pith. machine review for the scientific record. sign in

Rlinf: Flexible and efficient large-scale reinforcement learning via macro-to-micro flow transformation.arXiv preprint arXiv:2509.15965, 2025a

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

years

2026 7

representative citing papers

Reinforcing VLAs in Task-Agnostic World Models

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

RAW-Dream lets VLAs learn new tasks in zero-shot imagination by using a world model pre-trained only on task-free behaviors and an unmodified VLM to supply rewards, with dual-noise verification to limit hallucinations.

citing papers explorer

Showing 7 of 7 citing papers.