pith. machine review for the scientific record. sign in

OCR-Reasoning benchmark: Unveiling the true capabilities of MLLMs in complex text-rich image reasoning

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

fields

cs.CV 2 cs.LG 1

years

2026 3

representative citing papers

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

cs.LG · 2026-04-27 · unverdicted · novelty 5.0 · 2 refs

Nemotron 3 Nano Omni is an efficient open multimodal model supporting audio, text, images, and video with reported accuracy gains and leading results on document understanding and long audio-video tasks.

citing papers explorer

Showing 3 of 3 citing papers.