Timechat- online: 80% visual tokens are naturally redundant in streaming videos

Penglei Wang, Ziming Quan, Danyang Wu, Jin Xu · 2025 · arXiv 6027.375483

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Mosaic: Cross-Modal Clustering for Efficient Video Understanding

cs.PF · 2026-04-11 · unverdicted · novelty 7.0

Mosaic uses cross-modal clusters as the unit for KVCache organization in VLMs to achieve up to 1.38x speedup in streaming long-video understanding.

NexusAI: Enabling Design Space Exploration of Ideas through Cognitive Abstraction and Functional Decomposition

cs.HC · 2026-04-12 · unverdicted · novelty 5.0

NexusAI decomposes LLM inspirations into navigable functional fragments and abstractions to improve creative design space exploration, with a user study showing reduced cognitive overhead.

citing papers explorer

Showing 2 of 2 citing papers.

Mosaic: Cross-Modal Clustering for Efficient Video Understanding cs.PF · 2026-04-11 · unverdicted · none · ref 39
Mosaic uses cross-modal clusters as the unit for KVCache organization in VLMs to achieve up to 1.38x speedup in streaming long-video understanding.
NexusAI: Enabling Design Space Exploration of Ideas through Cognitive Abstraction and Functional Decomposition cs.HC · 2026-04-12 · unverdicted · none · ref 65
NexusAI decomposes LLM inspirations into navigable functional fragments and abstractions to improve creative design space exploration, with a user study showing reduced cognitive overhead.

Timechat- online: 80% visual tokens are naturally redundant in streaming videos

fields

years

verdicts

representative citing papers

citing papers explorer