Learning humanoid locomotion with perceptive internal model

· 2024 · arXiv 2411.14386

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

read on arXiv browse 10 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SigLoMa: Learning Open-World Quadrupedal Loco-Manipulation from Ego-Centric Vision

cs.RO · 2026-05-05 · unverdicted · novelty 6.0

SigLoMa enables dynamic loco-manipulation on quadrupeds from ego-centric 5 Hz vision alone by using Sigma Points for scalable exteroception, an ego-centric Kalman Filter for high-rate state estimation, and an active sampling curriculum, matching expert human teleoperation performance.

HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model

cs.RO · 2026-02-12 · unverdicted · novelty 6.0

HAIC enables robust humanoid interactions with underactuated objects by predicting their dynamics from proprioceptive history and using a world model for adaptive control.

TeleGate: Whole-Body Humanoid Teleoperation via Gated Expert Selection with Motion Prior

cs.RO · 2026-02-10 · unverdicted · novelty 6.0

TeleGate achieves high-precision real-time whole-body teleoperation of humanoid robots by dynamically gating between expert policies and using a VAE motion prior to infer future intent from history, outperforming distillation baselines on dynamic motions with only 2.5 hours of mocap data.

DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion

cs.RO · 2025-05-24 · unverdicted · novelty 6.0

DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.

VAIC: Vision-Guided Humanoid Agile Object Interaction Control via Decoupled Commands

cs.RO · 2026-06-08 · unverdicted · novelty 5.0

VAIC distills a teacher policy into a vision-and-proprioception student policy using recurrent adaptation and decoupled commands, enabling diverse real-robot tasks like box carrying and skateboarding that outperform baselines.

LadderMan: Learning Humanoid Perceptive Ladder Climbing

cs.RO · 2026-06-04 · unverdicted · novelty 5.0

A hybrid motion-tracking and imitation-reinforcement pipeline produces a depth-based visuomotor policy that lets humanoids climb varied ladders zero-shot on hardware and perform teleoperated manipulation while climbing.

Now You See That: Learning End-to-End Humanoid Locomotion from Raw Pixels

cs.RO · 2026-02-06 · unverdicted · novelty 5.0

An end-to-end policy learns robust humanoid locomotion directly from noisy depth images via high-fidelity sensor simulation, vision-aware distillation from privileged maps, and terrain-specific multi-critic reward shaping.

Towards Adaptive Humanoid Control via Multi-Behavior Distillation and Reinforced Fine-Tuning

cs.RO · 2025-11-09 · unverdicted · novelty 5.0

A two-stage distillation plus reinforced fine-tuning approach produces a single humanoid locomotion controller that adapts across skills and irregular terrains.

One-shot Adaptation of Humanoid Whole-body Motion with Walking Priors

cs.RO · 2025-10-29 · unverdicted · novelty 5.0

A one-shot adaptation technique for humanoid whole-body motion that computes order-preserving optimal transport distances between walking and target sequences, interpolates geodesic intermediate poses, optimizes for collision-free retargeting, and adapts via reinforcement learning.

TACT-ful: Multi-Channel Terrain Affordance and Compliance Training for Payload-Robust Perceptive Humanoid Locomotion

cs.RO · 2026-06-06 · unverdicted · novelty 4.0

A multi-channel terrain affordance reward combined with lower-body compliance training via virtual wrenches enables end-to-end PPO-trained humanoid policies to walk at 1 m/s on 0.2 m risers with improved payload robustness.

citing papers explorer

Showing 10 of 10 citing papers after filters.

SigLoMa: Learning Open-World Quadrupedal Loco-Manipulation from Ego-Centric Vision cs.RO · 2026-05-05 · unverdicted · none · ref 28
SigLoMa enables dynamic loco-manipulation on quadrupeds from ego-centric 5 Hz vision alone by using Sigma Points for scalable exteroception, an ego-centric Kalman Filter for high-rate state estimation, and an active sampling curriculum, matching expert human teleoperation performance.
HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model cs.RO · 2026-02-12 · unverdicted · none · ref 39
HAIC enables robust humanoid interactions with underactuated objects by predicting their dynamics from proprioceptive history and using a world model for adaptive control.
TeleGate: Whole-Body Humanoid Teleoperation via Gated Expert Selection with Motion Prior cs.RO · 2026-02-10 · unverdicted · none · ref 35
TeleGate achieves high-precision real-time whole-body teleoperation of humanoid robots by dynamically gating between expert policies and using a VAE motion prior to infer future intent from history, outperforming distillation baselines on dynamic motions with only 2.5 hours of mocap data.
DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion cs.RO · 2025-05-24 · unverdicted · none · ref 28
DreamPolicy integrates an autoregressive diffusion world model with policy learning to produce a single scalable policy that generalizes to unseen composite terrains for humanoid locomotion.
VAIC: Vision-Guided Humanoid Agile Object Interaction Control via Decoupled Commands cs.RO · 2026-06-08 · unverdicted · none · ref 45
VAIC distills a teacher policy into a vision-and-proprioception student policy using recurrent adaptation and decoupled commands, enabling diverse real-robot tasks like box carrying and skateboarding that outperform baselines.
LadderMan: Learning Humanoid Perceptive Ladder Climbing cs.RO · 2026-06-04 · unverdicted · none · ref 21
A hybrid motion-tracking and imitation-reinforcement pipeline produces a depth-based visuomotor policy that lets humanoids climb varied ladders zero-shot on hardware and perform teleoperated manipulation while climbing.
Now You See That: Learning End-to-End Humanoid Locomotion from Raw Pixels cs.RO · 2026-02-06 · unverdicted · none · ref 26
An end-to-end policy learns robust humanoid locomotion directly from noisy depth images via high-fidelity sensor simulation, vision-aware distillation from privileged maps, and terrain-specific multi-critic reward shaping.
Towards Adaptive Humanoid Control via Multi-Behavior Distillation and Reinforced Fine-Tuning cs.RO · 2025-11-09 · unverdicted · none · ref 22
A two-stage distillation plus reinforced fine-tuning approach produces a single humanoid locomotion controller that adapts across skills and irregular terrains.
One-shot Adaptation of Humanoid Whole-body Motion with Walking Priors cs.RO · 2025-10-29 · unverdicted · none · ref 3
A one-shot adaptation technique for humanoid whole-body motion that computes order-preserving optimal transport distances between walking and target sequences, interpolates geodesic intermediate poses, optimizes for collision-free retargeting, and adapts via reinforcement learning.
TACT-ful: Multi-Channel Terrain Affordance and Compliance Training for Payload-Robust Perceptive Humanoid Locomotion cs.RO · 2026-06-06 · unverdicted · none · ref 15
A multi-channel terrain affordance reward combined with lower-body compliance training via virtual wrenches enables end-to-end PPO-trained humanoid policies to walk at 1 m/s on 0.2 m risers with improved payload robustness.

Learning humanoid locomotion with perceptive internal model

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer