pith. machine review for the scientific record. sign in

citation dossier

Efficient multi-turn rl for gui agents via decoupled training and adaptive data curation

stub below hub threshold · 2 Pith inbound

Pengxiang Li, Zechen Hu, Zirui Shang, Jingrong Wu, Yang Liu, Hui Liu, Zhi Gao, Chenrui Shi, Bofei Zhang, Zihao Zhang, et al · 2025 · arXiv 2509.23866

2Pith papers citing it
3reference links
cs.AItop field · 2 papers
UNVERDICTEDtop verdict bucket · 2 papers

This arXiv-backed work is queued for full Pith review when it crosses the high-inbound sweep. That review runs reader · skeptic · desk-editor · referee · rebuttal · circularity · lean confirmation · RS check · pith extraction.

read on arXiv PDF

why this work matters in Pith

Pith has found this work in 2 reviewed papers. Its strongest current cluster is cs.AI (2 papers). The largest review-status bucket among citing papers is UNVERDICTED (2 papers). For highly cited works, this page shows a dossier first and a bounded explorer second; it never tries to render every citing paper at once.

fields

cs.AI 2

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

citing papers explorer

Showing 2 of 2 citing papers.