archive
Every paper Pith has read. Search by title, abstract, or pith.
1504 papers in cs.RO · page 1
-
Seamless blending cuts robot-hand takeover jitter by 99.8%
Hand-in-the-Loop: Improving Dexterous VLA via Seamless Interventional Correction
-
Single model tops VLM and world benchmarks while ranking near first on robot actions
Pelican-Unified 1.0: A Unified Embodied Intelligence Model for Understanding, Reasoning, Imagination and Action
-
VLMs fail to locate hidden functional objects from task instructions
SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization
-
Alignment lets robots predict tactile sensations from sight
Let Robots Feel Your Touch: Visuo-Tactile Cortical Alignment for Embodied Mirror Resonance
-
Unified GPU solver gives exact gradients for stiff heterogeneous soft bodies
DiffPhD: A Unified Differentiable Solver for Projective Heterogeneous Materials in Elastodynamics with Contact-Rich GPU-Acceleration
-
Language lets humanoids plan moves before the body starts
Before the Body Moves: Learning Anticipatory Joint Intent for Language-Conditioned Humanoid Control
-
Intermediate foot springs cut quadruped energy use by 17%
Energy-Efficient Quadruped Locomotion with Compliant Feet
-
Diffusion models uncover semantic attacks on vehicle maps
Systematic Discovery of Semantic Attacks in Online Map Construction through Conditional Diffusion
-
Distill refines robot task specs to match true user intent
Distill: Uncovering the True Intent behind Human-Robot Communication
-
Robots dodge obstacles with partial maps via local path fixes
Reactive Planning based Control for Mobile Robots in Obstacle-Cluttered Environments
-
Latent multi-agent rollouts boost closed-loop driving performance
MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving
-
GCS optimization matches nonlinear AV trajectories efficiently
Motion Planning for Autonomous Vehicles using Optimization over Graphs of Convex Sets
-
CVaR policies top safety verification rates in robot navigation
Safety-Constrained Reinforcement Learning with Post-Training Reachability Verification for Robot Navigation
-
Sparse relevance heads cut ViT 3D detection time by 3x
SToRe3D: Sparse Token Relevance in ViTs for Efficient Multi-View 3D Object Detection
-
Low-res egocentric images suffice for robot active perception
Behavior Cloning for Active Perception with Low-Resolution Egocentric Vision
-
Target distribution from demos lets robots explore while staying grounded
Ergodic Imitation for Adaptive Exploration around Demonstrations
-
Action history prior straightens robot flow paths
WarmPrior: Straightening Flow-Matching Policies with Temporal Priors
-
Guidance sets safe speed for loiter UAV corridor reinsertion
Loiter UAV Reinsertion Guidance for Fixed-wing UAV Corridors
-
One diffusion model generates LiDAR scans across eight domains
OmniLiDAR: A Unified Diffusion Framework for Multi-Domain 3D LiDAR Generation
-
Language models turn satellite images into UAV search priors
LMPath: Language-Mediated Priors and Path Generation for Aerial Exploration
-
Draft model cuts diffusion VLA latency to 19 ms
Realtime-VLA FLASH: Speculative Inference Framework for Diffusion-based VLAs
-
RoboEvolve pairs a vision-language planner with a video-generation simulator in a…
RoboEvolve: Co-Evolving Planner-Simulator for Robotic Manipulation with Limited Data
-
FrameSkip lifts VLA success from 66.5% to 76.2% with 20% frames
FrameSkip: Learning from Fewer but More Informative Frames in VLA Training
-
One demo lets robots build walls of any length
Manipulation Planning for Construction Activities with Repetitive Tasks
-
CARS attributes AV collisions to driver faults
Learning Responsibility-Attributed Adversarial Scenarios for Testing Autonomous Vehicles
-
TinySDP runs real-time SDP on microcontrollers for robot control
TinySDP: Real Time Semidefinite Optimization for Certifiable and Agile Edge Robotics
-
Monocular RGB builds accurate 3D scene graphs room by room
LEXI-SG: Monocular 3D Scene Graph Mapping with Room-Guided Feed-Forward Reconstruction
-
Sliding-mode law hits targets on exact schedule inside acceleration limits
Bounded-Input True Proportional Navigation for Impact-Time Control
-
Survey ties hardware and methods to dexterous hand progress
Towards Robotic Dexterous Hand Intelligence: A Survey
-
Distilled policy trains quadrupeds to traverse any tunnel
Robot Squid Game: Quadrupedal Locomotion for Traversing Narrow Tunnels
-
Causal joint modeling raises driving score to 87.53
Causality-Aware End-to-End Autonomous Driving via Ego-Centric Joint Scene Modeling
-
User spatial cues lift robot success to 81.2 percent
Guide, Think, Act: Interactive Embodied Reasoning in Vision-Language-Action Models
-
Rotational rings tune magnetic robot tip response in fixed fields
Design of Magnetic Continuum Robots with Tunable Force Response Using Rotational Ring Pairs
-
One vector of atom scores certifies every formula in a temporal fragment
Vision-Based Runtime Monitoring under Varying Specifications using Semantic Latent Representations
-
Reweighting robot actions by velocity boosts performance
AttenA+: Rectifying Action Inequality in Robotic Foundation Models
-
Open standards let one agent model run consistently in three simulators
Integration of an Agent Model into an Open Simulation Architecture for Scenario-Based Testing of Automated Vehicles
-
Retrained VMAF cuts error in teleoperation video ratings by 27 percent
Beyond VMAF: Towards Application-Specific Metrics for Teleoperation Video
-
Refinement cuts UAV localization error in cold starts and against faults
Uncertainty-Aware 3D Position Refinement for Multi-UAV Systems
-
Shared tokens unify bimanual robot control
CUBic: Coordinated Unified Bimanual Perception and Control Framework
-
Flow-adaptive MMD preserves ergodic coverage in dynamic domains
Asymptotically Optimal Ergodic Coverage on Generalized Motion Fields
-
Two demonstrations yield 90% success under OOD robot conditions
SID: Sliding into Distribution for Robust Few-Demonstration Manipulation
-
Rotational latents lift VLA success to 98% on LIBERO
RotVLA: Rotational Latent Action for Vision-Language-Action Model
-
Trajectory augmentation enables offline RL from limited suboptimal data
Trajectory-Level Data Augmentation for Offline Reinforcement Learning
-
BlockVLA adapts pretrained autoregressive vision-language-action models into discrete…
BlockVLA: Accelerating Autoregressive VLA via Block Diffusion Finetuning
-
This paper compares three robot interaction styles—passive (no help)
Exploring Human-Robot Collaboration: Analysis of Interaction Modalities in Challenging Tasks
-
Stereo vision and location priors boost real-world robot navigation
What Limits Vision-and-Language Navigation ?
-
Robot success in crowds rises 14% with human intention forecasts
HCSG: Human-Centric Semantic-Geometric Reasoning for Vision-Language Navigation
-
Decoupled planes enable linear scaling in large VLA RL
D-VLA: A High-Concurrency Distributed Asynchronous Reinforcement Learning Framework for Vision-Language-Action Models
-
D-VLA delivers linear speedup for trillion-param VLA models
D-VLA: A High-Concurrency Distributed Asynchronous Reinforcement Learning Framework for Vision-Language-Action Models
-
Galilean symmetry keeps INS consistent with unknown delays
Galilean State Estimation for Inertial Navigation Systems with Unknown Time Delay