hub

Parkour in the wild: Learning a general and extensible agile locomotion policy using multi-expert distillation and rl fine-tuning

· 2025 · arXiv 2505.11164

14 Pith papers cite this work. Polarity classification is still indexing.

14 Pith papers citing it

read on arXiv browse 14 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1

citation-polarity summary

unclear 1

representative citing papers

HumanoidArena: Benchmarking Egocentric Hierarchical Whole-body Learning

cs.RO · 2026-06-16 · unverdicted · novelty 7.0

HumanoidArena is a new benchmark of 7 leg-critical HOI/HSI tasks that evaluates egocentric hierarchical whole-body policies in humanoids and finds performance is strongly conditioned on the low-level GMT used.

roto 2.0: The Robot Tactile Olympiad

cs.RO · 2026-05-20 · unverdicted · novelty 7.0

roto 2.0 provides a standardized benchmark for end-to-end blind tactile RL on 16-24 DOF robots, with open-sourced baselines achieving 13 Baoding ball rotations in 10 seconds.

Learning Locomotion on Discrete Terrain via Minimal Proximity Sensing

cs.RO · 2026-06-30 · unverdicted · novelty 6.0 · 2 refs

Foot-mounted proximity sensors provide pre-contact feedback that, when integrated into RL, improves quadruped traversal robustness on discrete terrain with reliable sim-to-real transfer.

StairMaster: Learning to Conquer Risky Hollow Stairs for Agile Quadrupedal Robots

cs.RO · 2026-06-24 · unverdicted · novelty 6.0

StairMaster trains an RL policy that lets a Unitree Go2 quadruped climb hollow stairs up to 55 degrees via zero-shot sim-to-real transfer using cross-attention, SRU memory, and active-perception rewards.

SWAP: Symmetric Equivariant World-Model for Agile Robot Parkour

cs.RO · 2026-06-18 · unverdicted · novelty 6.0

SWAP embeds symmetry equivariance into world models and policies, enabling a quadruped to leap 2.13m gaps and climb 1.63m platforms with robust generalization to mirrored and outdoor terrains.

TAGA: Terrain-aware Active Gaze Learning for Generalizable Agile Humanoid Locomotion

cs.RO · 2026-06-04 · unverdicted · novelty 6.0

TAGA learns terrain-aware active gaze behaviors for humanoid robots via RL alone, enabling generalizable locomotion with 1.2m real-world gap traversal.

Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

cs.RO · 2026-02-17 · unverdicted · novelty 6.0

A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.

Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning

cs.RO · 2025-11-06 · unverdicted · novelty 6.0

Isaac Lab is a unified GPU-native platform combining high-fidelity physics, photorealistic rendering, multi-frequency sensors, domain randomization, and learning pipelines for scalable multi-modal robot policy training.

LadderMan: Learning Humanoid Perceptive Ladder Climbing

cs.RO · 2026-06-04 · unverdicted · novelty 5.0

A hybrid motion-tracking and imitation-reinforcement pipeline produces a depth-based visuomotor policy that lets humanoids climb varied ladders zero-shot on hardware and perform teleoperated manipulation while climbing.

CoRe-MoE: Contrastive Reweighted Mixture of Experts for Multi-Terrain Humanoid Locomotion with Gait Adaptation

cs.RO · 2026-06-03 · unverdicted · novelty 5.0

CoRe-MoE uses a two-stage RL framework with contrastive reweighting in a Mixture-of-Experts architecture to enable gait transitions and multi-terrain adaptation for humanoid locomotion.

SSR: Scaling Surefooted and Symmetric Humanoid Traversal to the Open World

cs.RO · 2026-05-29 · unverdicted · novelty 5.0

SSR is an end-to-end vision-based framework for humanoid traversal that learns imagined foothold guidance, equivariant latent-space symmetry augmentation, and terrain-specific multi-discriminator motion priors to enable safe locomotion on diverse real-world terrains.

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient

cs.RO · 2026-05-26 · unverdicted · novelty 4.0

SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.

Evaluation of an Actuated Spine in Agile Quadruped Locomotion

cs.RO · 2026-05-08 · unverdicted · novelty 4.0

Adding an actuated sagittal spine to a simulated quadruped increases agility and allows it to clear higher obstacles, steeper slopes, and tighter passages than the rigid-spine baseline.

Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input

cs.RO · 2026-04-21 · unverdicted · novelty 4.0

Sparsely gated MoE policies double the success rate of a real Unitree Go2 quadruped on large-obstacle parkour versus matched-active-parameter MLP baselines while cutting inference time compared with a scaled-up MLP.

citing papers explorer

Showing 14 of 14 citing papers after filters.

HumanoidArena: Benchmarking Egocentric Hierarchical Whole-body Learning cs.RO · 2026-06-16 · unverdicted · none · ref 31
HumanoidArena is a new benchmark of 7 leg-critical HOI/HSI tasks that evaluates egocentric hierarchical whole-body policies in humanoids and finds performance is strongly conditioned on the low-level GMT used.
roto 2.0: The Robot Tactile Olympiad cs.RO · 2026-05-20 · unverdicted · none · ref 1
roto 2.0 provides a standardized benchmark for end-to-end blind tactile RL on 16-24 DOF robots, with open-sourced baselines achieving 13 Baoding ball rotations in 10 seconds.
Learning Locomotion on Discrete Terrain via Minimal Proximity Sensing cs.RO · 2026-06-30 · unverdicted · none · ref 10 · 2 links
Foot-mounted proximity sensors provide pre-contact feedback that, when integrated into RL, improves quadruped traversal robustness on discrete terrain with reliable sim-to-real transfer.
StairMaster: Learning to Conquer Risky Hollow Stairs for Agile Quadrupedal Robots cs.RO · 2026-06-24 · unverdicted · none · ref 10
StairMaster trains an RL policy that lets a Unitree Go2 quadruped climb hollow stairs up to 55 degrees via zero-shot sim-to-real transfer using cross-attention, SRU memory, and active-perception rewards.
SWAP: Symmetric Equivariant World-Model for Agile Robot Parkour cs.RO · 2026-06-18 · unverdicted · none · ref 10
SWAP embeds symmetry equivariance into world models and policies, enabling a quadruped to leap 2.13m gaps and climb 1.63m platforms with robust generalization to mirrored and outdoor terrains.
TAGA: Terrain-aware Active Gaze Learning for Generalizable Agile Humanoid Locomotion cs.RO · 2026-06-04 · unverdicted · none · ref 30
TAGA learns terrain-aware active gaze behaviors for humanoid robots via RL alone, enabling generalizable locomotion with 1.2m real-world gap traversal.
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching cs.RO · 2026-02-17 · unverdicted · none · ref 33
A modular system uses motion matching to compose long-horizon human skill chains, trains RL experts, and distills them into a depth-based policy that lets a Unitree G1 humanoid autonomously climb, vault, and roll over obstacles up to 1.25 m tall.
Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning cs.RO · 2025-11-06 · unverdicted · none · ref 86
Isaac Lab is a unified GPU-native platform combining high-fidelity physics, photorealistic rendering, multi-frequency sensors, domain randomization, and learning pipelines for scalable multi-modal robot policy training.
LadderMan: Learning Humanoid Perceptive Ladder Climbing cs.RO · 2026-06-04 · unverdicted · none · ref 39
A hybrid motion-tracking and imitation-reinforcement pipeline produces a depth-based visuomotor policy that lets humanoids climb varied ladders zero-shot on hardware and perform teleoperated manipulation while climbing.
CoRe-MoE: Contrastive Reweighted Mixture of Experts for Multi-Terrain Humanoid Locomotion with Gait Adaptation cs.RO · 2026-06-03 · unverdicted · none · ref 29
CoRe-MoE uses a two-stage RL framework with contrastive reweighting in a Mixture-of-Experts architecture to enable gait transitions and multi-terrain adaptation for humanoid locomotion.
SSR: Scaling Surefooted and Symmetric Humanoid Traversal to the Open World cs.RO · 2026-05-29 · unverdicted · none · ref 26
SSR is an end-to-end vision-based framework for humanoid traversal that learns imagined foothold guidance, equivariant latent-space symmetry augmentation, and terrain-specific multi-discriminator motion priors to enable safe locomotion on diverse real-world terrains.
Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient cs.RO · 2026-05-26 · unverdicted · none · ref 12
SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.
Evaluation of an Actuated Spine in Agile Quadruped Locomotion cs.RO · 2026-05-08 · unverdicted · none · ref 11
Adding an actuated sagittal spine to a simulated quadruped increases agility and allows it to clear higher obstacles, steeper slopes, and tighter passages than the rigid-spine baseline.
Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input cs.RO · 2026-04-21 · unverdicted · none · ref 22
Sparsely gated MoE policies double the success rate of a real Unitree Go2 quadruped on large-obstacle parkour versus matched-active-parameter MLP baselines while cutting inference time compared with a scaled-up MLP.

Parkour in the wild: Learning a general and extensible agile locomotion policy using multi-expert distillation and rl fine-tuning

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer