Parkour in the wild: Learning a general and extensible agile locomotion policy using multi-expert distillation and rl fine-tuning

· 2025 · arXiv 2505.11164

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

citation-role summary

background 1

citation-polarity summary

unclear 1

representative citing papers

roto 2.0: The Robot Tactile Olympiad

cs.RO · 2026-05-20 · unverdicted · novelty 7.0

roto 2.0 provides a standardized benchmark for end-to-end blind tactile RL on 16-24 DOF robots, with open-sourced baselines achieving 13 Baoding ball rotations in 10 seconds.

Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

cs.RO · 2026-02-17 · unverdicted · novelty 6.0

A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.

Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning

cs.RO · 2025-11-06 · unverdicted · novelty 6.0

Isaac Lab is a unified GPU-native platform combining high-fidelity physics, photorealistic rendering, multi-frequency sensors, domain randomization, and learning pipelines for scalable multi-modal robot policy training.

SSR: Scaling Surefooted and Symmetric Humanoid Traversal to the Open World

cs.RO · 2026-05-29 · unverdicted · novelty 5.0

SSR is an end-to-end vision-based framework for humanoid traversal that learns imagined foothold guidance, equivariant latent-space symmetry augmentation, and terrain-specific multi-discriminator motion priors to enable safe locomotion on diverse real-world terrains.

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient

cs.RO · 2026-05-26 · unverdicted · novelty 4.0

SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.

Evaluation of an Actuated Spine in Agile Quadruped Locomotion

cs.RO · 2026-05-08 · unverdicted · novelty 4.0

Adding an actuated sagittal spine to a simulated quadruped increases agility and allows it to clear higher obstacles, steeper slopes, and tighter passages than the rigid-spine baseline.

Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input

cs.RO · 2026-04-21 · unverdicted · novelty 4.0

Sparsely gated MoE policies double the success rate of a real Unitree Go2 quadruped on large-obstacle parkour versus matched-active-parameter MLP baselines while cutting inference time compared with a scaled-up MLP.

citing papers explorer

Showing 7 of 7 citing papers.

roto 2.0: The Robot Tactile Olympiad cs.RO · 2026-05-20 · unverdicted · none · ref 1
roto 2.0 provides a standardized benchmark for end-to-end blind tactile RL on 16-24 DOF robots, with open-sourced baselines achieving 13 Baoding ball rotations in 10 seconds.
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching cs.RO · 2026-02-17 · unverdicted · none · ref 33
A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.
Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning cs.RO · 2025-11-06 · unverdicted · none · ref 86
Isaac Lab is a unified GPU-native platform combining high-fidelity physics, photorealistic rendering, multi-frequency sensors, domain randomization, and learning pipelines for scalable multi-modal robot policy training.
SSR: Scaling Surefooted and Symmetric Humanoid Traversal to the Open World cs.RO · 2026-05-29 · unverdicted · none · ref 26
SSR is an end-to-end vision-based framework for humanoid traversal that learns imagined foothold guidance, equivariant latent-space symmetry augmentation, and terrain-specific multi-discriminator motion priors to enable safe locomotion on diverse real-world terrains.
Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient cs.RO · 2026-05-26 · unverdicted · none · ref 12
SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.
Evaluation of an Actuated Spine in Agile Quadruped Locomotion cs.RO · 2026-05-08 · unverdicted · none · ref 11
Adding an actuated sagittal spine to a simulated quadruped increases agility and allows it to clear higher obstacles, steeper slopes, and tighter passages than the rigid-spine baseline.
Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input cs.RO · 2026-04-21 · unverdicted · none · ref 22
Sparsely gated MoE policies double the success rate of a real Unitree Go2 quadruped on large-obstacle parkour versus matched-active-parameter MLP baselines while cutting inference time compared with a scaled-up MLP.

Parkour in the wild: Learning a general and extensible agile locomotion policy using multi-expert distillation and rl fine-tuning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer