GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning

Ankur Handa; Dieter Fox; Jacky Liang; Miles Macklin; Nuttapong Chentanez; Viktor Makoviychuk

arxiv: 1810.05762 · v2 · pith:CGT4MQZAnew · submitted 2018-10-12 · 💻 cs.RO

GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning

Jacky Liang , Viktor Makoviychuk , Ankur Handa , Nuttapong Chentanez , Miles Macklin , Dieter Fox This is my paper

classification 💻 cs.RO

keywords learningdeepdistributedsimulationtaskstraininggpu-acceleratedlocomotion

0 comments

read the original abstract

Most Deep Reinforcement Learning (Deep RL) algorithms require a prohibitively large number of training samples for learning complex tasks. Many recent works on speeding up Deep RL have focused on distributed training and simulation. While distributed training is often done on the GPU, simulation is not. In this work, we propose using GPU-accelerated RL simulations as an alternative to CPU ones. Using NVIDIA Flex, a GPU-based physics engine, we show promising speed-ups of learning various continuous-control, locomotion tasks. With one GPU and CPU core, we are able to train the Humanoid running task in less than 20 minutes, using 10-1000x fewer CPU cores than previous works. We also demonstrate the scalability of our simulator to multi-GPU settings to train more challenging locomotion tasks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

OrbiSim: World Models as Differentiable Physics Engines for Embodied Intelligence
cs.RO 2026-05 unverdicted novelty 5.0

OrbiSim builds a differentiable physics engine from world models to support gradient-based policy optimization and contact modeling in robotics.