arXiv preprint arXiv:2507.21649 (2025)

Gao, S · 2025 · arXiv 2507.21649

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Can Multimodal Large Language Models Truly Understand Small Objects?

cs.CV · 2026-04-24 · unverdicted · novelty 7.0

Current MLLMs show weak performance on small object understanding tasks, but fine-tuning with the new SOU-Train dataset measurably improves their capabilities.

citing papers explorer

Showing 1 of 1 citing paper.

Can Multimodal Large Language Models Truly Understand Small Objects? cs.CV · 2026-04-24 · unverdicted · none · ref 12
Current MLLMs show weak performance on small object understanding tasks, but fine-tuning with the new SOU-Train dataset measurably improves their capabilities.

arXiv preprint arXiv:2507.21649 (2025)

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer