Sampling Strategies for Robust Universal Quadrupedal Locomotion Policies

David Rytz; Ioannis Havoutis; Kim Tien Ly

arxiv: 2510.07094 · v2 · pith:5AWTGZZCnew · submitted 2025-10-08 · 💻 cs.RO

Sampling Strategies for Robust Universal Quadrupedal Locomotion Policies

David Rytz , Kim Tien Ly , Ioannis Havoutis This is my paper

classification 💻 cs.RO

keywords samplingjointrobuststrategiescomparedconfigurationsgainslocomotion

0 comments

read the original abstract

This work focuses on sampling strategies of configuration variations for generating robust universal locomotion policies for quadrupedal robots. We investigate the effects of sampling physical robot parameters and joint proportional-derivative gains to enable training a single reinforcement learning policy that generalizes to multiple parameter configurations. Three fundamental joint gain sampling strategies are compared: parameter sampling with (1) linear and polynomial function mappings of mass-to-gains, (2) performance-based adaptive filtering, and (3) uniform random sampling. We improve the robustness of the policy by biasing the configurations using nominal priors and reference models. All training was conducted using the RaiSim simulation environment, tested in simulation on a range of diverse quadrupeds, and zero-shot deployed onto hardware using the ANYmal quadruped robot. Compared to multiple baseline implementations, our results demonstrate the need for significant joint controller gains randomization for robust closing of the sim-to-real gap.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning Perceptive Platform Adaptive Locomotion Controllers for Quadrupedal Robots
cs.RO 2026-06 unverdicted novelty 4.0

Empirical comparison of blind, critic-perceptive, and fully perceptive variants of morphology-aware RL locomotion controllers shows critic-only perception improves robustness over blind baselines while remaining more ...