pith. machine review for the scientific record. sign in

citation dossier

Sophiavl-r1: Reinforcing mllms reasoning with thinking reward

stub below hub threshold · 3 Pith inbound

Kaixuan Fan, Kaituo Feng, Haoming Lyu, Dongzhan Zhou, and Xiangyu Yue · 2025 · arXiv 2505.17018

3Pith papers citing it
3reference links
cs.CVtop field · 2 papers
UNVERDICTEDtop verdict bucket · 2 papers

This arXiv-backed work is queued for full Pith review when it crosses the high-inbound sweep. That review runs reader · skeptic · desk-editor · referee · rebuttal · circularity · lean confirmation · RS check · pith extraction.

read on arXiv PDF

why this work matters in Pith

Pith has found this work in 3 reviewed papers. Its strongest current cluster is cs.CV (2 papers). The largest review-status bucket among citing papers is UNVERDICTED (2 papers). For highly cited works, this page shows a dossier first and a bounded explorer second; it never tries to render every citing paper at once.

fields

cs.CV 2 cs.LG 1

years

2026 2 2025 1

representative citing papers

Video-R1: Reinforcing Video Reasoning in MLLMs

cs.CV · 2025-03-27 · conditional · novelty 7.0

Video-R1 uses temporal-aware RL and mixed datasets to boost video reasoning in MLLMs, with a 7B model reaching 37.1% on VSI-Bench and surpassing GPT-4o.

citing papers explorer

Showing 3 of 3 citing papers.