InAAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25 - March 4, 2025, Philadelphia, PA, USA, pages 22128– 22136

Qianhao Yuan, Qingyu Zhang, Yanjiang Liu, Jiawei Chen, Yaojie Lu, Hongyu Lin, Jia Zheng, Xianpei Han, Le Sun · 2025 · arXiv 2504.00502

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Look Less, Reason More: Block-wise Attention Skipping for Efficient Multimodal LLMs

cs.CV · 2026-06-07 · unverdicted · novelty 6.0

V-Skip applies block-wise structured sparsity to skip saturated visual self-attention in deeper MLLM layers while retaining FFNs, using few-shot calibration for task-specific paths and achieving 94.16-100.31% performance retention.

From Inheritance to Saturation: Disentangling the Evolution of Visual Redundancy for Architecture-Aware MLLM Inference Acceleration

cs.CV · 2026-04-08 · unverdicted · novelty 6.0

HalfV disentangles MLLM visual redundancy into universal IVR and architecture-dependent SSR via a three-stage lifecycle, delivering 4.1x FLOPs speedup with 96.8% performance retention on Qwen25-VL.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Look Less, Reason More: Block-wise Attention Skipping for Efficient Multimodal LLMs cs.CV · 2026-06-07 · unverdicted · none · ref 9
V-Skip applies block-wise structured sparsity to skip saturated visual self-attention in deeper MLLM layers while retaining FFNs, using few-shot calibration for task-specific paths and achieving 94.16-100.31% performance retention.
From Inheritance to Saturation: Disentangling the Evolution of Visual Redundancy for Architecture-Aware MLLM Inference Acceleration cs.CV · 2026-04-08 · unverdicted · none · ref 6
HalfV disentangles MLLM visual redundancy into universal IVR and architecture-dependent SSR via a three-stage lifecycle, delivering 4.1x FLOPs speedup with 96.8% performance retention on Qwen25-VL.

InAAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25 - March 4, 2025, Philadelphia, PA, USA, pages 22128– 22136

fields

years

verdicts

representative citing papers

citing papers explorer