pith. sign in

Egoschema: A diagnostic benchmark for very long-form video language understanding

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

dataset 1

citation-polarity summary

fields

cs.CV 5 cs.LG 1

years

2026 5 2023 1

roles

dataset 1

polarities

background 1

representative citing papers

Adaptive Greedy Frame Selection for Long Video Understanding

cs.CV · 2026-03-20 · unverdicted · novelty 6.0

A question-adaptive greedy frame selector combines SigLIP relevance and DINOv2 coverage under a submodular objective with a text classifier routing to preset trade-offs, yielding accuracy gains on MLVU especially at low frame budgets.

citing papers explorer

Showing 6 of 6 citing papers.