OpenVMR uses normalizing flow to detect out-of-distribution queries and performs moment retrieval only on in-distribution queries.
Computing Research Repository arXiv Preprint, arXiv:2001.09308 (2020)
3 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CV 3years
2026 3verdicts
UNVERDICTED 3representative citing papers
Models frames and words as cooperative game players to value uncertain vision-language correspondences for proposal-free moment localization, reporting superior results on Charades-STA and ActivityNet Caption.
MCMT improves weakly-supervised VMR by fusing multiple learnable Gaussian masks from proposals into a positive sample mask and using dual masked query reconstruction tasks for stability.
citing papers explorer
-
Not All Inputs Are Valid: Towards Open-Set Video Moment Retrieval Using Language
OpenVMR uses normalizing flow to detect out-of-distribution queries and performs moment retrieval only on in-distribution queries.
-
Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective
Models frames and words as cooperative game players to value uncertain vision-language correspondences for proposal-free moment localization, reporting superior results on Charades-STA and ActivityNet Caption.
-
Multi-proposal Collaboration and Multi-task Training for Weakly-supervised Video Moment Retrieval
MCMT improves weakly-supervised VMR by fusing multiple learnable Gaussian masks from proposals into a positive sample mask and using dual masked query reconstruction tasks for stability.