hub

Learning quadrupedal locomotion over challenging terrain

· 2020 · DOI 10.1126/scirobotics.abc5986

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

open at publisher browse 12 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

MPC-Injection: Biasing Off-Policy Locomotion RL Toward Controller-Induced Behavior Basins

cs.RO · 2026-06-24 · unverdicted · novelty 7.0

MPC-Injection biases off-policy RL locomotion policies toward controller-induced behavior basins by injecting MPC transitions into the replay buffer.

Instrumentation for Imitation Learning: Enhancing Training Datasets for Clothes Hanger Insertion

cs.RO · 2026-05-22 · unverdicted · novelty 6.0

Instrumented objects boost diffusion policy success in robotic hanger insertion by 14-25 percentage points over vision-only baselines, and augmenting datasets with instrumented expert rollouts lets a vision-only student match the instrumented expert.

ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders

cs.RO · 2026-05-19 · accept · novelty 6.0 · 2 refs

ARC-RL is a new suite of four MuJoCo continuous-control environments featuring game-inspired hexapod and quadruped morphologies, a single closed-form multi-component reward function, CPG demonstrators, and empirical comparisons of online and offline-to-online RL algorithms.

FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control

cs.LG · 2026-04-06 · unverdicted · novelty 6.0 · 2 refs

FlashSAC improves training speed and final performance of off-policy RL on high-dimensional robot tasks by reducing update frequency, increasing model scale, and bounding norms to limit critic error accumulation.

Iteratively Learning Muscle Memory for Legged Robots to Master Adaptive and High Precision Locomotion

cs.RO · 2025-07-18 · unverdicted · novelty 6.0

Integrates iterative learning control with a torque library to enable high-precision adaptive locomotion on bipedal and quadrupedal robots, reducing tracking errors by up to 85% and achieving over 30x faster control rates.

T-GMP: Terrain-conditioned Generative Motion Priors for Versatile and Natural Humanoid Locomotion

cs.RO · 2026-06-05 · unverdicted · novelty 5.0

T-GMP learns a terrain-conditioned latent motion manifold via CVAE from demonstrations and integrates it into an adversarial pipeline with a foothold penalty for versatile, natural humanoid locomotion.

Learning Locomotion on Complex Terrain for Quadrupedal Robots with Foot Position Maps and Stability Rewards

cs.RO · 2026-04-03 · unverdicted · novelty 5.0

Integrating foot position maps into heightmaps and adding a locomotion-stability reward in an attention-based RL framework improves quadrupedal success rates on both trained and out-of-domain complex terrains.

KYON: Semi-Modular Wheel-Legged Quadruped With Agile Bimanual Capability

cs.RO · 2026-06-29 · unverdicted · novelty 4.0

KYON is a semi-modular wheel-legged quadruped with reconfigurable lower legs, base-mounted actuators, and bimanual manipulation, using whole-body control plus RL policy for dynamic locomotion and tasks in unstructured environments.

Learning Perceptive Platform Adaptive Locomotion Controllers for Quadrupedal Robots

cs.RO · 2026-06-23 · unverdicted · novelty 4.0

Empirical comparison of blind, critic-perceptive, and fully perceptive variants of morphology-aware RL locomotion controllers shows critic-only perception improves robustness over blind baselines while remaining more stable under perception noise than full perception.

Long-Distance Real-World Navigation of the Legged-Wheeled Robot Go2-W Using Deep Reinforcement Learning

cs.RO · 2026-06-19 · unverdicted · novelty 4.0

A DRL locomotion controller extended from prior quadruped work enabled the Go2-W robot to complete 2.8 km of autonomous real-world navigation including mixed terrain and stairs.

Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient

cs.RO · 2026-05-26 · unverdicted · novelty 4.0

SDPG is a new on-policy visual RL algorithm that estimates gradients via stochastic perturbations of rollouts, achieving faster training and lower memory use than baselines on visual MuJoCo tasks while adding new robotics benchmarks and sim-to-real results.

Neuromorphic Reinforcement Learning for Quadruped Locomotion Control on Uneven Terrain

cs.NE · 2026-05-10

citing papers explorer

Showing 3 of 3 citing papers after filters.

FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control cs.LG · 2026-04-06 · unverdicted · none · ref 41 · 2 links
FlashSAC improves training speed and final performance of off-policy RL on high-dimensional robot tasks by reducing update frequency, increasing model scale, and bounding norms to limit critic error accumulation.
Learning Locomotion on Complex Terrain for Quadrupedal Robots with Foot Position Maps and Stability Rewards cs.RO · 2026-04-03 · unverdicted · none · ref 8
Integrating foot position maps into heightmaps and adding a locomotion-stability reward in an attention-based RL framework improves quadrupedal success rates on both trained and out-of-domain complex terrains.
Neuromorphic Reinforcement Learning for Quadruped Locomotion Control on Uneven Terrain cs.NE · 2026-05-10 · unreviewed · ref 24

Learning quadrupedal locomotion over challenging terrain

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer