pith. sign in

CyberV: Cybernetics for Test-time Scaling in Video Understanding.arXiv preprint arXiv:2506.07971, 2025

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

fields

cs.CV 4

years

2026 4

verdicts

UNVERDICTED 4

clear filters

representative citing papers

Towards One-to-Many Temporal Grounding

cs.CV · 2026-06-04 · unverdicted · novelty 7.0

Introduces OMTG benchmark with C-Acc and EtF1 metrics, a 56k dataset, and caption/temporal rewards, reaching 43.65% EtF1 SOTA on the new bench.

AVIS: Adaptive Test-Time Scaling for Vision-Language Models

cs.CV · 2026-06-10 · unverdicted · novelty 6.0

AVIS is an adaptive policy that jointly scales visual context via key-based token pruning and reasoning via difficulty-predicted self-consistency to improve the accuracy-compute curve on image and video tasks.

Watch, Remember, Reason: Human-View Video Understanding with MLLMs

cs.CV · 2026-06-05 · unverdicted · novelty 4.0

This is a survey that frames video MLLM research via a human-view formulation of perceptual representations, memory states, reasoning traces, and predictions, then reviews methods, datasets, benchmarks, and open problems.

citing papers explorer

Showing 4 of 4 citing papers after filters.