SkillSpotter raises class-specific mAP from 12.40 to 21.82 and balanced accuracy to 60.40% on Ego-Exo4D by adding adaptive temporal suppression, gated pose fusion, and bidirectional cross-view attention to temporal action detectors.
Dual-stage reweighted moe for long-tailed egocentric mistake detection.arXiv preprint arXiv:2509.12990, 2025
2 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CV 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
UE-MCM fuses a small CLIP4CLIP branch for workflow inconsistency and a large Qwen3-VL branch for fine-grained action errors via a collaboration gate, trained with reweighted cross-entropy, AUC learning, and label-aware adjustment for long-tailed egocentric mistake detection.
citing papers explorer
-
SkillSpotter: Pose-Aware Multi-View Skilled Action Detection and Grading in Ego-Exo Videos
SkillSpotter raises class-specific mAP from 12.40 to 21.82 and balanced accuracy to 60.40% on Ego-Exo4D by adding adaptive temporal suppression, gated pose fusion, and bidirectional cross-view attention to temporal action detectors.
-
Understanding-Enhanced Model Collaboration for Long-Tailed Egocentric Mistake Detection
UE-MCM fuses a small CLIP4CLIP branch for workflow inconsistency and a large Qwen3-VL branch for fine-grained action errors via a collaboration gate, trained with reweighted cross-entropy, AUC learning, and label-aware adjustment for long-tailed egocentric mistake detection.