hub Mixed citations

Mujoco: A physics en- gine for model-based control, in: 2012 IEEE/RSJ International Con- ference on Intelligent Robots and Systems, IEEE

Todorov, E · 2012 · arXiv 2012.638610

Mixed citation behavior. Most common role is background (67%).

45 Pith papers citing it

Background 67% of classified citations

read on arXiv browse 45 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 7 dataset 2

citation-polarity summary

background 6 use dataset 2 unclear 1

representative citing papers

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

cs.AI · 2026-05-19 · unverdicted · novelty 7.0

SceneCode compiles natural language prompts into executable code programs that generate editable, articulated indoor scenes for physics simulation.

Randomized Advantage Transformation (RAT): Computing Natural Policy Gradients via Direct Backpropagation

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

RAT reformulates regularized natural policy gradients as vanilla gradients with a transformed advantage, computed efficiently via randomized block Kaczmarz iterations on on-policy data.

CelloCut: Constructive Watertight Remeshing via Tetrahedral Cell Cuts

cs.GR · 2026-05-18 · unverdicted · novelty 7.0

CelloCut formulates watertight remeshing as binary labeling on a Delaunay tetrahedral partition solved by graph-cut minimization with one-sided constraints to guarantee volumetrically consistent solids.

EgoFun3D: Modeling Interactive Objects from Egocentric Videos using Function Templates

cs.CV · 2026-04-13 · unverdicted · novelty 7.0

EgoFun3D creates a new task, 271-video dataset, and pipeline using function templates to model interactive 3D objects from egocentric videos for simulation.

HeiSD: Hybrid Speculative Decoding for Embodied Vision-Language-Action Models with Kinematic Awareness

cs.RO · 2026-03-18 · unverdicted · novelty 7.0

HeiSD delivers up to 2.45x faster inference for embodied VLA models by hybridizing speculative decoding with kinematic boundary detection and error-mitigation tricks while preserving task success rates.

Continuum Robot Localization using Distributed Time-of-Flight Sensors

cs.RO · 2026-02-06 · conditional · novelty 7.0

Distributed low-resolution time-of-flight sensors along a 53 cm continuum robot, fused with a shape prior, achieve 2.5 cm position and 7.2 degree orientation localization error in simulation and real experiments across multiple environments.

GLUE: Coordinating Pre-Trained Generative Models for System-Level Design

cs.CE · 2025-12-22 · conditional · novelty 7.0

GLUE orchestrates frozen pre-trained generative models into a system-level design generator that enforces feasibility, performance, and diversity, with data-driven and data-free variants benchmarked on UAV design.

Frictional Q-Learning

cs.LG · 2025-09-24 · unverdicted · novelty 7.0

Frictional Q-Learning encodes supported actions as tangent directions on an action manifold using a contrastive variational autoencoder to reduce extrapolation errors in off-policy reinforcement learning.

BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion

cs.RO · 2025-08-11 · conditional · novelty 7.0

BeyondMimic combines compact motion tracking with a unified guided latent diffusion model to master diverse agile behaviors from human demos and solve unseen downstream tasks via test-time classifier guidance.

LLMPhy: Parameter-Identifiable Physical Reasoning Combining Large Language Models and Physics Engines

cs.LG · 2024-11-12 · unverdicted · novelty 7.0

LLMPhy uses iterative LLM-generated programs executed in physics engines to solve continuous parameter estimation and discrete scene layout problems, outperforming prior black-box methods on three new zero-shot physical reasoning datasets.

Curriculum reinforcement learning with measurable task representation learning

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

A VAE-based latent task representation enables automatic curriculum generation in CRL for non-Euclidean navigation tasks, outperforming interpolation and GAN-based methods in experiments.

ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders

cs.RO · 2026-05-19 · accept · novelty 6.0 · 2 refs

ARC-RL is a new suite of four MuJoCo continuous-control environments featuring game-inspired hexapod and quadruped morphologies, a single closed-form multi-component reward function, CPG demonstrators, and empirical comparisons of online and offline-to-online RL algorithms.

Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.

PhyMotion: Structured 3D Motion Reward for Physics-Grounded Human Video Generation

cs.CV · 2026-05-14 · conditional · novelty 6.0

PhyMotion scores generated human videos by grounding recovered 3D poses in a physics simulator across kinematic, contact, and dynamic axes, yielding stronger human correlation and larger RL post-training gains than prior 2D rewards.

R2R2: Robust Representation for Intensive Experience Reuse via Redundancy Reduction in Self-Predictive Learning

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

R2R2 introduces a non-centered regularization objective for SPL that addresses conflicts with spectral properties, leading to better performance on continuous control tasks at high UTD ratios.

Mirror, Mirror on the Wall: Can VLM Agents Tell Who They Are at All?

cs.AI · 2026-05-09 · unverdicted · novelty 6.0

Stronger VLM agents use mirror reflections for self-identification in controlled 3D tests, while weaker ones inspect but fail to extract or correctly attribute self-relevant information.

Lucid-XR: An Extended-Reality Data Engine for Robotic Manipulation

cs.RO · 2026-04-30 · unverdicted · novelty 6.0

Lucid-XR uses XR-headset physics simulation and physics-guided video generation to create synthetic data that trains robot policies transferring zero-shot to unseen real-world manipulation tasks.

VADF: Vision-Adaptive Diffusion Policy Framework for Efficient Robotic Manipulation

cs.RO · 2026-04-17 · unverdicted · novelty 6.0

VADF adds an Adaptive Loss Network for hard-negative training sampling and a Hierarchical Vision Task Segmenter for adaptive noise scheduling during inference to speed convergence and reduce timeouts in diffusion robotic policies.

Solving Physics Olympiad via Reinforcement Learning on Physics Simulators

cs.LG · 2026-04-13 · unverdicted · novelty 6.0

Physics simulators generate synthetic QA data for RL training that improves LLM performance on IPhO problems by 5-10 percentage points.

FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control

cs.LG · 2026-04-06 · unverdicted · novelty 6.0 · 2 refs

FlashSAC improves training speed and final performance of off-policy RL on high-dimensional robot tasks by reducing update frequency, increasing model scale, and bounding norms to limit critic error accumulation.

frax: Fast Robot Kinematics and Dynamics in JAX

cs.RO · 2026-04-05 · unverdicted · novelty 6.0

frax is a new open-source JAX library delivering low-microsecond CPU dynamics and over 100 million GPU evaluations per second for robot kinematics and dynamics with autodiff support.

HUSKY: Humanoid Skateboarding System via Physics-Aware Whole-Body Control

cs.RO · 2026-02-03 · conditional · novelty 6.0

HUSKY combines humanoid-skateboard dynamics modeling with adversarial motion priors and physics-guided lean-to-steer strategies to achieve real-world stable skateboarding on a humanoid robot.

Toward Reliable Sim-to-Real Predictability for MoE-based Robust Quadrupedal Locomotion

cs.RO · 2026-01-31 · unverdicted · novelty 6.0

MoE-based locomotion policy with RoboGauge metrics achieves reliable sim-to-real transfer, enabling robust quadrupedal walking on challenging unseen terrains up to 4 m/s.

Neural CDEs as Correctors for Learned Time Series Models

cs.LG · 2025-12-13 · unverdicted · novelty 6.0

Neural CDEs serve as correctors that reduce error accumulation in multi-step forecasts from learned time-series models across synthetic, physics, and real-world data.

citing papers explorer

Showing 1 of 1 citing paper after filters.

AWAC: Accelerating Online Reinforcement Learning with Offline Datasets cs.LG · 2020-06-16 · unverdicted · none · ref 53
AWAC combines offline data with online RL via advantage-weighted actor-critic updates to enable faster acquisition of robotic skills such as dexterous manipulation.

Mujoco: A physics en- gine for model-based control, in: 2012 IEEE/RSJ International Con- ference on Intelligent Robots and Systems, IEEE

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer