arXiv preprint arXiv:2602.17196 (2026)

Wang, Y · 2026 · arXiv 2602.17196

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

TOPS: First-Principles Visual Token Pruning via Constructing Token Optimal Preservation Sets for Efficient MLLM Inference

cs.AI · 2026-06-25 · unverdicted · novelty 6.0

TOPS formulates visual token pruning as constructing Token Optimal Preservation Sets using three information-theoretic principles and demonstrates superior performance on MLLM benchmarks.

Spectral Evolution-Guided Token Pruning in Multimodal Large Language Models

cs.CV · 2026-06-23 · unverdicted · novelty 6.0

CLSE prunes tokens in MLLMs by quantifying cross-layer spectral redistribution in the frequency domain to preserve semantically active tokens and reduce compute.

Accelerating Multimodal Large Language Models with Prior-Corrected Token Reduction

cs.CV · 2026-06-23 · unverdicted · novelty 6.0

PriorTR estimates model-induced prior attention via a null token in one forward pass and contrasts it with task-conditioned attention to improve visual token pruning accuracy-efficiency trade-offs in MLLMs.

citing papers explorer

Showing 3 of 3 citing papers.

TOPS: First-Principles Visual Token Pruning via Constructing Token Optimal Preservation Sets for Efficient MLLM Inference cs.AI · 2026-06-25 · unverdicted · none · ref 57
TOPS formulates visual token pruning as constructing Token Optimal Preservation Sets using three information-theoretic principles and demonstrates superior performance on MLLM benchmarks.
Spectral Evolution-Guided Token Pruning in Multimodal Large Language Models cs.CV · 2026-06-23 · unverdicted · none · ref 65
CLSE prunes tokens in MLLMs by quantifying cross-layer spectral redistribution in the frequency domain to preserve semantically active tokens and reduce compute.
Accelerating Multimodal Large Language Models with Prior-Corrected Token Reduction cs.CV · 2026-06-23 · unverdicted · none · ref 65
PriorTR estimates model-induced prior attention via a null token in one forward pass and contrasts it with task-conditioned attention to improve visual token pruning accuracy-efficiency trade-offs in MLLMs.

arXiv preprint arXiv:2602.17196 (2026)

fields

years

verdicts

representative citing papers

citing papers explorer