arXiv preprint arXiv:2508.12252 , year=

Hu, K · 2025 · arXiv 2508.12252

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

One Demonstration Is Enough for Real-World Robotic Reinforcement Learning

cs.RO · 2026-07-02 · unverdicted · novelty 6.0

AutoSERL achieves strong performance on six real-world robot manipulation tasks using RL guided by a single demonstration via sliding-window intervention, safety recovery, and automatic termination.

FADA: Few-Shot Domain Adaptation via Dynamics Alignment for Humanoid Control

cs.RO · 2026-06-26 · unverdicted · novelty 6.0

FADA is a three-stage Planner-IDM method that achieves few-shot domain adaptation for humanoid control by distilling an oracle policy then finetuning only the IDM on short target-domain rollouts via supervised learning.

Dynamics Are Learned, Not Told: Semi-Supervised Discovery of Latent Dynamics Geometries For Zero-Shot Policy Adaptation

cs.RO · 2026-06-01 · unverdicted · novelty 6.0

Contrastive learning bounds the Lipschitz constant of a trajectory dynamics encoder to support outcome-centric zero-shot adaptation in MuJoCo robotics tasks under severe dynamics shifts.

citing papers explorer

Showing 3 of 3 citing papers.

One Demonstration Is Enough for Real-World Robotic Reinforcement Learning cs.RO · 2026-07-02 · unverdicted · none · ref 8
AutoSERL achieves strong performance on six real-world robot manipulation tasks using RL guided by a single demonstration via sliding-window intervention, safety recovery, and automatic termination.
FADA: Few-Shot Domain Adaptation via Dynamics Alignment for Humanoid Control cs.RO · 2026-06-26 · unverdicted · none · ref 42
FADA is a three-stage Planner-IDM method that achieves few-shot domain adaptation for humanoid control by distilling an oracle policy then finetuning only the IDM on short target-domain rollouts via supervised learning.
Dynamics Are Learned, Not Told: Semi-Supervised Discovery of Latent Dynamics Geometries For Zero-Shot Policy Adaptation cs.RO · 2026-06-01 · unverdicted · none · ref 6
Contrastive learning bounds the Lipschitz constant of a trajectory dynamics encoder to support outcome-centric zero-shot adaptation in MuJoCo robotics tasks under severe dynamics shifts.

arXiv preprint arXiv:2508.12252 , year=

fields

years

verdicts

representative citing papers

citing papers explorer