pith. machine review for the scientific record. sign in

Sophiavl-r1: Reinforcing mllms reasoning with thinking reward

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

years

2026 4 2025 1

clear filters

representative citing papers

Video-R1: Reinforcing Video Reasoning in MLLMs

cs.CV · 2025-03-27 · conditional · novelty 7.0

Video-R1 uses temporal-aware RL and mixed datasets to boost video reasoning in MLLMs, with a 7B model reaching 37.1% on VSI-Bench and surpassing GPT-4o.

citing papers explorer

Showing 4 of 4 citing papers after filters.