Sime: En- hancing policy self-improvement with modal-level exploration

Yang Jin, Jun Lv, Wenye Yu, Hongjie Fang, Yong-Lu Li, Cewu Lu · 2025 · arXiv 2505.01396

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

WorldSample: Closed-loop Real-robot RL with World Modelling

cs.RO · 2026-07-02 · unverdicted · novelty 5.0

WorldSample generates synthetic transitions from a post-trained world model grounded in real rollouts and uses Policy-Paced Learning to improve RL policies, reporting 28% higher success rates and 59% fewer training steps on contact-rich robot tasks.

RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation

cs.RO · 2025-10-20 · unverdicted · novelty 5.0

RESample uses exploratory sampling guided by a lightweight Coverage Function to expand VLA training data coverage, yielding 12% performance gains on LIBERO and real-world tasks with 10-20% added samples.

citing papers explorer

Showing 2 of 2 citing papers.

WorldSample: Closed-loop Real-robot RL with World Modelling cs.RO · 2026-07-02 · unverdicted · none · ref 12
WorldSample generates synthetic transitions from a post-trained world model grounded in real rollouts and uses Policy-Paced Learning to improve RL policies, reporting 28% higher success rates and 59% fewer training steps on contact-rich robot tasks.
RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation cs.RO · 2025-10-20 · unverdicted · none · ref 26
RESample uses exploratory sampling guided by a lightweight Coverage Function to expand VLA training data coverage, yielding 12% performance gains on LIBERO and real-world tasks with 10-20% added samples.

Sime: En- hancing policy self-improvement with modal-level exploration

fields

years

verdicts

representative citing papers

citing papers explorer