pith. sign in

Parkour in the wild: Learning a general and extensible agile locomotion policy using multi-expert distillation and rl fine-tuning

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.RO 7

years

2026 6 2025 1

verdicts

UNVERDICTED 7

roles

background 1

polarities

unclear 1

representative citing papers

roto 2.0: The Robot Tactile Olympiad

cs.RO · 2026-05-20 · unverdicted · novelty 7.0

roto 2.0 provides a standardized benchmark for end-to-end blind tactile RL on 16-24 DOF robots, with open-sourced baselines achieving 13 Baoding ball rotations in 10 seconds.

SSR: Scaling Surefooted and Symmetric Humanoid Traversal to the Open World

cs.RO · 2026-05-29 · unverdicted · novelty 5.0

SSR is an end-to-end vision-based framework for humanoid traversal that learns imagined foothold guidance, equivariant latent-space symmetry augmentation, and terrain-specific multi-discriminator motion priors to enable safe locomotion on diverse real-world terrains.

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient

cs.RO · 2026-05-26 · unverdicted · novelty 4.0

SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.

citing papers explorer

Showing 7 of 7 citing papers.

  • roto 2.0: The Robot Tactile Olympiad cs.RO · 2026-05-20 · unverdicted · none · ref 1

    roto 2.0 provides a standardized benchmark for end-to-end blind tactile RL on 16-24 DOF robots, with open-sourced baselines achieving 13 Baoding ball rotations in 10 seconds.

  • Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching cs.RO · 2026-02-17 · unverdicted · none · ref 33

    A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.

  • Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning cs.RO · 2025-11-06 · unverdicted · none · ref 86

    Isaac Lab is a unified GPU-native platform combining high-fidelity physics, photorealistic rendering, multi-frequency sensors, domain randomization, and learning pipelines for scalable multi-modal robot policy training.

  • SSR: Scaling Surefooted and Symmetric Humanoid Traversal to the Open World cs.RO · 2026-05-29 · unverdicted · none · ref 26

    SSR is an end-to-end vision-based framework for humanoid traversal that learns imagined foothold guidance, equivariant latent-space symmetry augmentation, and terrain-specific multi-discriminator motion priors to enable safe locomotion on diverse real-world terrains.

  • Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient cs.RO · 2026-05-26 · unverdicted · none · ref 12

    SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.

  • Evaluation of an Actuated Spine in Agile Quadruped Locomotion cs.RO · 2026-05-08 · unverdicted · none · ref 11

    Adding an actuated sagittal spine to a simulated quadruped increases agility and allows it to clear higher obstacles, steeper slopes, and tighter passages than the rigid-spine baseline.

  • Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input cs.RO · 2026-04-21 · unverdicted · none · ref 22

    Sparsely gated MoE policies double the success rate of a real Unitree Go2 quadruped on large-obstacle parkour versus matched-active-parameter MLP baselines while cutting inference time compared with a scaled-up MLP.