arXiv preprint arXiv:2505.19213 (2025)

Improving Medical Reasoning with Curriculum-Aware Reinforcement Learning , author= · 2025 · arXiv 2505.19213

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Token-Sparse Medical Multimodal Reasoning via Dual-Stream Reinforcement Learning

cs.CV · 2026-06-30 · unverdicted · novelty 6.0

ViToS uses dual-stream RL with cross-feedback optimization to prune medical image tokens to 77% length while reporting 108.27% and 104.16% relative performance on two 7B VLMs across seven benchmarks.

Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming

cs.CV · 2026-05-20 · unverdicted · novelty 6.0 · 2 refs

Introduces Zoom-then-Diagnose paradigm and uncertainty-aware reward in GRPO for confidence-aware ultrasound VQA, reporting 39.3% improvement in lesion localization across liver, breast, and thyroid datasets.

citing papers explorer

Showing 2 of 2 citing papers.

Token-Sparse Medical Multimodal Reasoning via Dual-Stream Reinforcement Learning cs.CV · 2026-06-30 · unverdicted · none · ref 45
ViToS uses dual-stream RL with cross-feedback optimization to prune medical image tokens to 77% length while reporting 108.27% and 104.16% relative performance on two 7B VLMs across seven benchmarks.
Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming cs.CV · 2026-05-20 · unverdicted · none · ref 17 · 2 links
Introduces Zoom-then-Diagnose paradigm and uncertainty-aware reward in GRPO for confidence-aware ultrasound VQA, reporting 39.3% improvement in lesion localization across liver, breast, and thyroid datasets.

arXiv preprint arXiv:2505.19213 (2025)

fields

years

verdicts

representative citing papers

citing papers explorer