Laura Smith, Ilya Kostrikov, and Sergey Levine

URLhttps: //arxiv · arXiv 2110.05457

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

FADA: Few-Shot Domain Adaptation via Dynamics Alignment for Humanoid Control

cs.RO · 2026-06-26 · unverdicted · novelty 6.0

FADA is a three-stage Planner-IDM method that achieves few-shot domain adaptation for humanoid control by distilling an oracle policy then finetuning only the IDM on short target-domain rollouts via supervised learning.

Bridging the Gap: Enabling Soft Actor Critic for High Performance Legged Locomotion

cs.RO · 2026-05-24 · unverdicted · novelty 5.0

Targeted changes to policy initialization, critic targets, and return estimation let SAC match PPO performance across legged locomotion tasks in massively parallel simulation.

citing papers explorer

Showing 2 of 2 citing papers after filters.

FADA: Few-Shot Domain Adaptation via Dynamics Alignment for Humanoid Control cs.RO · 2026-06-26 · unverdicted · none · ref 7
FADA is a three-stage Planner-IDM method that achieves few-shot domain adaptation for humanoid control by distilling an oracle policy then finetuning only the IDM on short target-domain rollouts via supervised learning.
Bridging the Gap: Enabling Soft Actor Critic for High Performance Legged Locomotion cs.RO · 2026-05-24 · unverdicted · none · ref 12
Targeted changes to policy initialization, critic targets, and return estimation let SAC match PPO performance across legged locomotion tasks in massively parallel simulation.

Laura Smith, Ilya Kostrikov, and Sergey Levine

fields

years

verdicts

representative citing papers

citing papers explorer