pith. sign in

Residual off-policy rl for finetuning behavior cloning policies

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

fields

cs.RO 4 cs.LG 2

years

2026 5 2025 1

roles

background 2

polarities

background 2

representative citing papers

$\pi^{*}_{0.6}$: a VLA That Learns From Experience

cs.LG · 2025-11-18 · unverdicted · novelty 6.0

RECAP enables a generalist VLA to self-improve via advantage-conditioned RL on mixed real-world data, more than doubling throughput and halving failure rates on hard manipulation tasks.

citing papers explorer

Showing 6 of 6 citing papers.