0 technical report

Taohan Weng, Chi Zhang, Chaoran Yan, Siya Liu, Xiaoyang Liu, Yalun Wu, Boyang Wang, Boyan Wang, Jiren Ren, Kaiwen Yan, Jinze Yu, Kaibing Hu, Henan Liu, Haoyun Zheng, Zhenyu Liu, Duo Zhang, Xiaoqing Guo, Anjie Le, Hongcheng Guo · 2025 · arXiv 2509.25748

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

Towards Real-World Ultrasound Understanding: Large Vision-Language Models from Multi-Image Examinations with Long-Form Reports

cs.CV · 2026-07-02 · unverdicted · novelty 7.0

A large examination-level ultrasound dataset with long-form reports enables simple LVLM fine-tuning to outperform prior complex methods.

Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming

cs.CV · 2026-05-20 · unverdicted · novelty 6.0 · 2 refs

Introduces Zoom-then-Diagnose paradigm and uncertainty-aware reward in GRPO for confidence-aware ultrasound VQA, reporting 39.3% improvement in lesion localization across liver, breast, and thyroid datasets.

FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model

cs.CV · 2026-06-09 · conditional · novelty 5.0

FADA is a selectively distilled unified vision-language model for fetal ultrasound that performs interpretation, classification, detection, and segmentation in one pipeline, achieves strong metrics, and deploys offline on mobile devices.

citing papers explorer

Showing 3 of 3 citing papers.

Towards Real-World Ultrasound Understanding: Large Vision-Language Models from Multi-Image Examinations with Long-Form Reports cs.CV · 2026-07-02 · unverdicted · none · ref 20
A large examination-level ultrasound dataset with long-form reports enables simple LVLM fine-tuning to outperform prior complex methods.
Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming cs.CV · 2026-05-20 · unverdicted · none · ref 25 · 2 links
Introduces Zoom-then-Diagnose paradigm and uncertainty-aware reward in GRPO for confidence-aware ultrasound VQA, reporting 39.3% improvement in lesion localization across liver, breast, and thyroid datasets.
FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model cs.CV · 2026-06-09 · conditional · none · ref 11
FADA is a selectively distilled unified vision-language model for fetal ultrasound that performs interpretation, classification, detection, and segmentation in one pipeline, achieves strong metrics, and deploys offline on mobile devices.

0 technical report

fields

years

verdicts

representative citing papers

citing papers explorer