Mosaic uses cross-modal clusters as the unit for KVCache organization in VLMs to achieve up to 1.38x speedup in streaming long-video understanding.
Timechat- online: 80% visual tokens are naturally redundant in streaming videos
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
NexusAI decomposes LLM inspirations into navigable functional fragments and abstractions to improve creative design space exploration, with a user study showing reduced cognitive overhead.
citing papers explorer
-
Mosaic: Cross-Modal Clustering for Efficient Video Understanding
Mosaic uses cross-modal clusters as the unit for KVCache organization in VLMs to achieve up to 1.38x speedup in streaming long-video understanding.
-
NexusAI: Enabling Design Space Exploration of Ideas through Cognitive Abstraction and Functional Decomposition
NexusAI decomposes LLM inspirations into navigable functional fragments and abstractions to improve creative design space exploration, with a user study showing reduced cognitive overhead.