hub Mixed citations

Mujoco: A physics en- gine for model-based control, in: 2012 IEEE/RSJ International Con- ference on Intelligent Robots and Systems, IEEE

Todorov, E · 2012 · arXiv 2012.638610

Mixed citation behavior. Most common role is background (67%).

72 Pith papers citing it

Background 67% of classified citations

read on arXiv browse 72 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 7 dataset 2

citation-polarity summary

background 6 use dataset 2 unclear 1

representative citing papers

Unleashing Infinite Motion: Scaling Expressive Quadrupedal Motion via Generative Video Priors

cs.RO · 2026-06-26 · conditional · novelty 7.0

Uni-Mo generates 7,488 language-annotated quadruped motions via LLM prompts and video diffusion, lifts them to 3D trajectories, and trains policies achieving 96.7% real-robot success on 392 sampled motions.

MPC-Injection: Biasing Off-Policy Locomotion RL Toward Controller-Induced Behavior Basins

cs.RO · 2026-06-24 · unverdicted · novelty 7.0

MPC-Injection biases off-policy RL locomotion policies toward controller-induced behavior basins by injecting MPC transitions into the replay buffer.

HARBOR: A Harness Framework for Agentic Robot Reinforcement Learning

cs.RO · 2026-06-07 · unverdicted · novelty 7.0

HARBOR is a new agentic harness framework that automates robot RL workflows end-to-end across 16 tasks in manipulation, locomotion, and dexterous control, matching or exceeding default configurations while enabling sim-to-real transfer.

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

cs.AI · 2026-05-19 · unverdicted · novelty 7.0

SceneCode compiles natural language prompts into executable code programs that generate editable, articulated indoor scenes for physics simulation.

Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

RAT reformulates regularized natural policy gradients as vanilla gradients with a transformed advantage, computed efficiently via randomized block Kaczmarz iterations on on-policy data.

CelloCut: Constructive Watertight Remeshing via Tetrahedral Cell Cuts

cs.GR · 2026-05-18 · unverdicted · novelty 7.0

CelloCut formulates watertight remeshing as binary labeling on a Delaunay tetrahedral partition solved by graph-cut minimization with one-sided constraints to guarantee volumetrically consistent solids.

EgoFun3D: Modeling Interactive Objects from Egocentric Videos using Function Templates

cs.CV · 2026-04-13 · unverdicted · novelty 7.0

EgoFun3D creates a new task, 271-video dataset, and pipeline using function templates to model interactive 3D objects from egocentric videos for simulation.

HeiSD: Hybrid Speculative Decoding for Embodied Vision-Language-Action Models with Kinematic Awareness

cs.RO · 2026-03-18 · unverdicted · novelty 7.0

HeiSD delivers up to 2.45x faster inference for embodied VLA models by hybridizing speculative decoding with kinematic boundary detection and error-mitigation tricks while preserving task success rates.

Continuum Robot Localization using Distributed Time-of-Flight Sensors

cs.RO · 2026-02-06 · conditional · novelty 7.0

Distributed low-resolution time-of-flight sensors along a 53 cm continuum robot, fused with a shape prior, achieve 2.5 cm position and 7.2 degree orientation localization error in simulation and real experiments across multiple environments.

GLUE: Coordinating Pre-Trained Generative Models for System-Level Design

cs.CE · 2025-12-22 · conditional · novelty 7.0

GLUE orchestrates frozen pre-trained generative models into a system-level design generator that enforces feasibility, performance, and diversity, with data-driven and data-free variants benchmarked on UAV design.

BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion

cs.RO · 2025-08-11 · conditional · novelty 7.0

BeyondMimic combines compact motion tracking with a unified guided latent diffusion model to master diverse agile behaviors from human demos and solve unseen downstream tasks via test-time classifier guidance.

LLMPhy: Parameter-Identifiable Physical Reasoning Combining Large Language Models and Physics Engines

cs.LG · 2024-11-12 · unverdicted · novelty 7.0

LLMPhy uses iterative LLM-generated programs executed in physics engines to solve continuous parameter estimation and discrete scene layout problems, outperforming prior black-box methods on three new zero-shot physical reasoning datasets.

Scalable Behavior Cloning with Open Data, Training, and Evaluation

cs.RO · 2026-06-25 · unverdicted · novelty 6.0

Releases the largest open teleoperation dataset for robot manipulation together with hardware, simulation, and training infrastructure to support scalable behavior cloning.

Hallucination in World Models is Predictable and Preventable

cs.LG · 2026-06-25 · unverdicted · novelty 6.0

Hallucination in world models is a data coverage issue predictable by three signals and preventable through targeted training sampling and online data collection.

OmniContact: Chaining Meta-Skills via Contact Flow for Generalizable Humanoid Loco-Manipulation

cs.RO · 2026-06-24 · unverdicted · novelty 6.0

OmniContact introduces contact flow as a shared representation of body trajectories and contact signals to learn and chain loco-manipulation meta-skills, reporting 98.7% success on box carrying and 76.5% on push-stack tasks.

AutoDex: An Automated Real-World System for Dexterous Grasping Data Collection

cs.RO · 2026-06-22 · accept · novelty 6.0

AutoDex automates the full perception-execution-labeling-reset loop for real-world dexterous grasping data collection, delivering 4.8x throughput over teleoperation and 76% success for retrieved grasps versus 34% from simulation-only data.

NASDAQ: Normalized Observation Space Dynamics-Augmented Q-Learning

cs.LG · 2026-06-19 · unverdicted · novelty 6.0

NASDAQ normalizes observations in an online RL setting so that dynamics prediction losses are balanced across dimensions, yielding competitive performance with lower wall-time than prior model-based and self-predictive methods.

Inductive Generalization for Robotic Manipulation

cs.RO · 2026-06-19 · unverdicted · novelty 6.0

The paper introduces an inductive generalization evaluation protocol for manipulation policies and shows that SOTA vision-language-action models fail on progressively harder task variants.

Do as I Do: Dexterous Manipulation Data from Everyday Human Videos

cs.RO · 2026-06-17 · unverdicted · novelty 6.0

DO AS I DO reconstructs and retargets hand-object interactions from in-the-wild monocular RGB videos to produce dexterous robot manipulation trajectories, outperforming prior methods on ground-truth and online video datasets.

AnnotateAnything: Automatic Annotation of 3D Assets for Robot Manipulation

cs.RO · 2026-06-16 · unverdicted · novelty 6.0

AnnotateAnything converts passive 3D assets into manipulation-ready assets by combining vision-language reasoning for semantics with parallel physics pipelines for executable action annotations such as grasps and articulations.

Shield-Loco: Shielding Locomotion Policies with Predictive Safety Filtering

cs.RO · 2026-06-05 · unverdicted · novelty 6.0

A post-hoc predictive safety filter adjusts RL policy contact locations for quadruped robots via sampling-based optimization on a full-physics model, reducing safety violations in cluttered environments with minimal performance deviation.

AEGIS: A Backup Reflex for Physical AI

cs.AI · 2026-06-04 · unverdicted · novelty 6.0

AEGIS uses activation probes for early-warning detection of high-risk steps in weak policies and selectively escalates to stronger policies, recovering 10.1% of lost trajectories on LIBERO-Spatial while activating the strong policy on only 38% of steps.

Shape Your Body: Value Gradients for Multi-Embodiment Robot Design

cs.RO · 2026-05-30 · unverdicted · novelty 6.0

Trains embodiment-aware value functions on up to 50 robots and applies their gradients as differentiable surrogates to optimize held-out robot designs with over 1100 parameters.

Curriculum reinforcement learning with measurable task representation learning

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

A VAE-based latent task representation enables automatic curriculum generation in CRL for non-Euclidean navigation tasks, outperforming interpolation and GAN-based methods in experiments.

citing papers explorer

Showing 22 of 72 citing papers.

Closed-Loop Sim-to-Real Reinforcement Learning for Deformable Microfiber Shape Control cs.RO · 2026-05-20 · unverdicted · none · ref 22
A closed-loop sim-to-real RL policy trained in a simplified frictionless simulator achieves sub-millimeter microfiber shape control on physical hardware via visual feedback without retraining.
SmoCap: Unified Scale-Pose Canonicalization with Proxy-Mapped Trust-Region QP cs.RO · 2026-05-20 · unverdicted · none · ref 25
SmoCap performs unified scale-pose canonicalization for motion capture by solving constrained trust-region QPs with analytical proxy-mapped Jacobians in a sparse control subspace.
Automatically Improving Simulation Physics for Articulated Objects cs.RO · 2026-05-18 · unverdicted · none · ref 14
A simulator-in-the-loop multi-modal method refines physical properties of incomplete 3D articulated objects to improve simulation stability and downstream robot policy performance.
Before the Body Moves: Learning Anticipatory Joint Intent for Language-Conditioned Humanoid Control cs.RO · 2026-05-14 · unverdicted · none · ref 3 · 2 links
DAJI is a hierarchical framework using distillation and autoregressive generation to learn future-aware joint intents for language-conditioned humanoid robot control.
Rethinking Priority Scheduling for Sequential Multi-Agent Decision Making in Stackelberg Games cs.MA · 2026-05-08 · unverdicted · none · ref 11
HPA dynamically selects agent decision orders in Stackelberg games to improve equilibria and performance in multi-agent MuJoCo control tasks.
Gated Memory Policy cs.RO · 2026-04-21 · unverdicted · none · ref 48
GMP selectively activates and represents memory via a gate and lightweight cross-attention, yielding 30.1% higher success on non-Markovian robotic tasks while staying competitive on Markovian ones.
ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation cs.RO · 2026-04-13 · unverdicted · none · ref 43
Compositional Simulation generates scalable real-world robot training data by combining classical simulation with neural simulation in a closed-loop real-sim-real augmentation pipeline.
From Fold to Function: Simulation-Driven Design of Origami Mechanisms cs.RO · 2025-11-13 · conditional · none · ref 36
A simulation framework using MuJoCo deformable bodies and CMA-ES optimization enables rapid design and experimental validation of origami mechanisms like an improved catapult.
MOBIUS: A Multi-Modal Bipedal Robot that can Walk, Crawl, Climb, and Roll cs.RO · 2025-11-03 · unverdicted · none · ref 42
MOBIUS is a multi-modal bipedal robot with hybrid reinforcement learning and force control plus an MIQCP planner that enables walking, crawling, climbing, and rolling on varied terrains.
Geometric Analysis of Neural Regression Collapse via Intrinsic Dimension cs.LG · 2025-10-01 · unverdicted · none · ref 22
Neural regression collapse occurs when last-layer feature intrinsic dimension falls below target intrinsic dimension, creating over-compressed and under-compressed regimes that govern generalization based on data quantity and noise.
Behavior Synthesis via Contact-Aware Fisher Information Maximization cs.RO · 2025-05-18 · unverdicted · none · ref 50
Derives a contact-aware Fisher information measure to synthesize robot behaviors that maximize information-rich contacts for efficient object parameter learning.
Gymnasium: A Standard Interface for Reinforcement Learning Environments cs.LG · 2024-07-24 · accept · none · ref 31
Gymnasium establishes a standardized API for RL environments to improve interoperability, reproducibility, and ease of development in reinforcement learning.
Latent Linear Quadratic Regulator for Robotic Control Tasks cs.RO · 2024-07-15 · unverdicted · none · ref 15
LaLQR learns a latent linear-quadratic representation of robotic systems by imitating MPC to enable efficient LQR control.
Human2Any: Human-to-Robot Transfer via Constraint-Aware Compositional Planning cs.RO · 2026-06-27 · unverdicted · none · ref 73
Human2Any transfers human video demonstrations to robots by representing tasks as object-object interactions and composing learned priors with robot-side planning.
A Scalable Embodied Intelligence Platform for Seamless Real-to-Sim-to-Real Transfer of Household Mobile Manipulation Tasks cs.RO · 2026-06-17 · unverdicted · none · ref 13
BestMan is a robotics platform with ASG for scene reconstruction, simulation-guided skill learning, and HUM middleware to enable seamless real-to-sim-to-real transfer in household mobile manipulation.
TACT-ful: Multi-Channel Terrain Affordance and Compliance Training for Payload-Robust Perceptive Humanoid Locomotion cs.RO · 2026-06-06 · unverdicted · none · ref 39
A multi-channel terrain affordance reward combined with lower-body compliance training via virtual wrenches enables end-to-end PPO-trained humanoid policies to walk at 1 m/s on 0.2 m risers with improved payload robustness.
Silent Failures in Physical AI: A Literature Review of Runtime Action Authorization for Autonomous Systems cs.RO · 2026-05-23 · unverdicted · none · ref 110
A literature review that defines silent physical-action failures in Physical AI and identifies the lack of complete runtime authorization boundaries across surveyed technical streams.
Distilling Game Code World Model Generation into Lightweight Large Language Models cs.AI · 2026-05-23 · unverdicted · none · ref 31
SFT followed by RLVR on Qwen2.5-3B-Instruct raises syntactic and execution correctness when generating Game Code World Models across 30 games.
Enhancing RL Generalizability in Robotics through SHAP Analysis of Algorithms and Hyperparameters cs.LG · 2026-05-04 · unverdicted · none · ref 24
A SHAP analysis framework is introduced to decompose configuration impacts on RL generalization and guide selection for improved performance in robotics.
The embodied brain: Bridging the brain, body, and behavior with neuromechanical digital twins q-bio.NC · 2026-01-12 · unverdicted · none · ref 10
Neuromechanical digital twins embed neural controllers in simulated bodies to infer unmeasurable biophysical variables, generate testable hypotheses via perturbations, and bridge neuroscience with robotics and machine learning.
ARROW: Augmented Replay for RObust World models cs.LG · 2026-03-12 · unreviewed · ref 29
Frictional Q-Learning cs.LG · 2025-09-24 · unreviewed · ref 25

Mujoco: A physics en- gine for model-based control, in: 2012 IEEE/RSJ International Con- ference on Intelligent Robots and Systems, IEEE

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer