pith. machine review for the scientific record. sign in

arxiv: 1810.05687 · v4 · submitted 2018-10-12 · 💻 cs.RO · cs.LG

Recognition: unknown

Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience

Authors on Pith no claims yet
classification 💻 cs.RO cs.LG
keywords realworlddistributionpolicysimulationablepoliciesrandomization
0
0 comments X
read the original abstract

We consider the problem of transferring policies to the real world by training on a distribution of simulated scenarios. Rather than manually tuning the randomization of simulations, we adapt the simulation parameter distribution using a few real world roll-outs interleaved with policy training. In doing so, we are able to change the distribution of simulations to improve the policy transfer by matching the policy behavior in simulation and the real world. We show that policies trained with our method are able to reliably transfer to different robots in two real world tasks: swing-peg-in-hole and opening a cabinet drawer. The video of our experiments can be found at https://sites.google.com/view/simopt

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Solving Rubik's Cube with a Robot Hand

    cs.LG 2019-10 accept novelty 7.0

    Reinforcement learning models trained only in simulation using automatic domain randomization solve Rubik's cube with a real robot hand.