Trains embodiment-aware value functions on up to 50 robots and applies their gradients as differentiable surrogates to optimize held-out robot designs with over 1100 parameters.
Multi-embodiment locomotion at scale with extreme embodiment randomization
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.RO 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
A reinforcement learning method lets legged robots jointly learn information-seeking actions and predict joint-level and global embodiment parameters using a history-augmented URMA model in simulation.
citing papers explorer
-
Shape Your Body: Value Gradients for Multi-Embodiment Robot Design
Trains embodiment-aware value functions on up to 50 robots and applies their gradients as differentiable surrogates to optimize held-out robot designs with over 1100 parameters.
-
Active Embodiment Identification with Reinforcement Learning for Legged Robots
A reinforcement learning method lets legged robots jointly learn information-seeking actions and predict joint-level and global embodiment parameters using a history-augmented URMA model in simulation.