SAM-MI: A mask-injected framework for enhancing open-vocabulary semantic segmentation with SAM

Lin Chen, Yingjian Zhu, Qi Yang, Xin Niu, Kun Ding, Shiming Xiang · 2025 · arXiv 2511.20027

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

WikiSeeker: Rethinking the Role of Vision-Language Models in Knowledge-Based Visual Question Answering

cs.CV · 2026-04-07 · unverdicted · novelty 6.0

WikiSeeker boosts KB-VQA performance by using VLMs to rewrite image-informed queries for better retrieval and to decide when to route to external LLM or rely on internal VLM knowledge.

Towards Realistic Open-Vocabulary Remote Sensing Segmentation: Benchmark and Baseline

cs.CV · 2026-04-17

citing papers explorer

Showing 1 of 1 citing paper after filters.

WikiSeeker: Rethinking the Role of Vision-Language Models in Knowledge-Based Visual Question Answering cs.CV · 2026-04-07 · unverdicted · none · ref 8
WikiSeeker boosts KB-VQA performance by using VLMs to rewrite image-informed queries for better retrieval and to decide when to route to external LLM or rely on internal VLM knowledge.

SAM-MI: A mask-injected framework for enhancing open-vocabulary semantic segmentation with SAM

fields

years

verdicts

representative citing papers

citing papers explorer