Learning to generalize without bias for open-vocabulary action recognition,

· 2025 · arXiv 2502.20158

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Gold Points Sniper: Self-guided Visual Reasoning in VLM for Fine-grained Action Understanding

cs.CV · 2026-06-21 · unverdicted · novelty 6.0

GPS framework adds self-guided reasoning modules to lightweight VLMs for fine-grained action understanding, claiming performance near GPT-4o with better factual accuracy on a custom CAP-based dataset.

TACO: Towards Task-Consistent Open-Vocabulary Adaptation in Video Recognition

cs.CV · 2026-06-24 · unverdicted · novelty 5.0

TACO proposes Relative Structure Distillation and a lightweight specialization projection to mitigate inconsistency between fine-tuning and evaluation objectives in open-vocabulary video recognition, claiming state-of-the-art results on cross-dataset and base-to-novel benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

Gold Points Sniper: Self-guided Visual Reasoning in VLM for Fine-grained Action Understanding cs.CV · 2026-06-21 · unverdicted · none · ref 8
GPS framework adds self-guided reasoning modules to lightweight VLMs for fine-grained action understanding, claiming performance near GPT-4o with better factual accuracy on a custom CAP-based dataset.
TACO: Towards Task-Consistent Open-Vocabulary Adaptation in Video Recognition cs.CV · 2026-06-24 · unverdicted · none · ref 49
TACO proposes Relative Structure Distillation and a lightweight specialization projection to mitigate inconsistency between fine-tuning and evaluation objectives in open-vocabulary video recognition, claiming state-of-the-art results on cross-dataset and base-to-novel benchmarks.

Learning to generalize without bias for open-vocabulary action recognition,

fields

years

verdicts

representative citing papers

citing papers explorer