Timechat: A time-sensitive multimodal large lan- guage model for long video understanding

Shuhuai Ren, Linli Yao, Shicheng Li, Xu Sun, Lu Hou · 2024

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

DynaTok: Temporally Adaptive and Positional Bias-Aware Token Compression for Video-LLMs

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

DynaTok introduces temporally adaptive budget allocation with EMA memory and spatial selection with memory to compress video tokens, retaining over 95% accuracy at 90% reduction on VideoQA benchmarks.

MARS: Technical Report for the CASTLE Challenge at EgoVis 2026

cs.CV · 2026-05-18 · unverdicted · novelty 3.0

MARS converts long videos to captions and summaries, maintains modality-specific memories, and deploys an agent to select evidence or answer, placing second on the CASTLE Challenge leaderboard.

citing papers explorer

Showing 2 of 2 citing papers.

DynaTok: Temporally Adaptive and Positional Bias-Aware Token Compression for Video-LLMs cs.CV · 2026-05-19 · unverdicted · none · ref 17
DynaTok introduces temporally adaptive budget allocation with EMA memory and spatial selection with memory to compress video tokens, retaining over 95% accuracy at 90% reduction on VideoQA benchmarks.
MARS: Technical Report for the CASTLE Challenge at EgoVis 2026 cs.CV · 2026-05-18 · unverdicted · none · ref 10
MARS converts long videos to captions and summaries, maintains modality-specific memories, and deploys an agent to select evidence or answer, placing second on the CASTLE Challenge leaderboard.

Timechat: A time-sensitive multimodal large lan- guage model for long video understanding

fields

years

verdicts

representative citing papers

citing papers explorer