Egotempo: A benchmark for egocentric video question answering requiring temporal reasoning

URL https://arxiv · 2025 · arXiv 2503.13646

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

cs.CV · 2026-06-03 · unverdicted · novelty 7.0

VideoKR supplies 315K knowledge-intensive video reasoning examples and a dedicated benchmark, with experiments indicating post-training gains on reasoning tasks that require both video content and external knowledge.

EGOSTREAM: A Diagnostic Benchmark for Streaming Episodic Memory in Egocentric Vision

cs.CV · 2026-05-29 · unverdicted · novelty 7.0

Egostream introduces a diagnostic benchmark that expands 2,250 questions into 8,528 recall-conditioned evaluations to measure streaming episodic memory performance across detail, spatial, temporal, event, social, causal, and prospective dimensions in egocentric vision.

Flattery in Motion: Benchmarking and Analyzing Sycophancy in Video-LLMs

cs.CL · 2025-06-08 · unverdicted · novelty 7.0

VISE is the first benchmark for sycophancy in Video-LLMs, with two training-free mitigation strategies based on key-frame selection and internal representation steering.

citing papers explorer

Showing 3 of 3 citing papers.

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding cs.CV · 2026-06-03 · unverdicted · none · ref 10
VideoKR supplies 315K knowledge-intensive video reasoning examples and a dedicated benchmark, with experiments indicating post-training gains on reasoning tasks that require both video content and external knowledge.
EGOSTREAM: A Diagnostic Benchmark for Streaming Episodic Memory in Egocentric Vision cs.CV · 2026-05-29 · unverdicted · none · ref 34
Egostream introduces a diagnostic benchmark that expands 2,250 questions into 8,528 recall-conditioned evaluations to measure streaming episodic memory performance across detail, spatial, temporal, event, social, causal, and prospective dimensions in egocentric vision.
Flattery in Motion: Benchmarking and Analyzing Sycophancy in Video-LLMs cs.CL · 2025-06-08 · unverdicted · none · ref 31
VISE is the first benchmark for sycophancy in Video-LLMs, with two training-free mitigation strategies based on key-frame selection and internal representation steering.

Egotempo: A benchmark for egocentric video question answering requiring temporal reasoning

fields

years

verdicts

representative citing papers

citing papers explorer