VLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected Training

Chen, Zhanpeng, Xu, Chengjin, Qi, Yiyan, Jiang, Xuhui, Guo, Jian · 2025 · DOI 10.18653/v1/2025.findings-emnlp.432

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

miniReranker: Efficient Multimodal Reranking through Visual Cache Reuse and Interaction Sparsity

cs.IR · 2026-06-09 · unverdicted · novelty 6.0

miniReranker reduces multimodal reranking runtime to under 1% of the dense baseline under high-reuse conditions while retaining over 96% of performance via vision-first prompting, early exit, sparse cross-segment attention, and embedder-guided token pruning.

citing papers explorer

Showing 1 of 1 citing paper.

miniReranker: Efficient Multimodal Reranking through Visual Cache Reuse and Interaction Sparsity cs.IR · 2026-06-09 · unverdicted · none · ref 24
miniReranker reduces multimodal reranking runtime to under 1% of the dense baseline under high-reuse conditions while retaining over 96% of performance via vision-first prompting, early exit, sparse cross-segment attention, and embedder-guided token pruning.

VLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected Training

fields

years

verdicts

representative citing papers

citing papers explorer