Canonical reference

In: 2024 IEEE International Conference on Robotics and Automa- tion (ICRA)

Gu, Q · 2024 · arXiv 7147.2024

Canonical reference. 76% of citing Pith papers cite this work as background.

108 Pith papers citing it

Background 76% of classified citations

read on arXiv browse 108 citing papers

citation-role summary

background 17 baseline 2 extension 1 other 1

citation-polarity summary

background 16 baseline 2 unclear 2 extend 1

representative citing papers

FactoryNet: A Large-Scale Dataset toward Industrial Time-Series Foundation Models

cs.LG · 2026-05-09 · unverdicted · novelty 8.0 · 2 refs

FactoryNet is the first universal pretraining corpus for industrial time-series data with a shared S-E-F-C schema that supports cross-embodiment transfer and competitive anomaly detection.

Think While You Map: Asynchronous Vision-Language Agents for Incremental 3D Scene Graphs

cs.CV · 2026-06-30 · unverdicted · novelty 7.0 · 2 refs

An asynchronous architecture decouples incremental voxel-based mapping from VLM-based semantic enrichment to produce queryable open-vocabulary 3D scene graphs that match or exceed prior methods on segmentation and grounding benchmarks.

MultiUAV-Plat: An LLM-Oriented Platform, Benchmark and Framework for Multi-UAV Collaborative Task Planning

cs.AI · 2026-06-30 · unverdicted · novelty 7.0

MultiUAV-Plat supplies a new RESTful simulation platform and 1500-task benchmark where Agent4Drone reaches 57.9% task pass rate versus 30.6% for ReAct baseline across 75 multi-UAV missions.

Unleashing Infinite Motion: Scaling Expressive Quadrupedal Motion via Generative Video Priors

cs.RO · 2026-06-26 · conditional · novelty 7.0

Uni-Mo generates 7,488 language-annotated quadruped motions via LLM prompts and video diffusion, lifts them to 3D trajectories, and trains policies achieving 96.7% real-robot success on 392 sampled motions.

MPC-Injection: Biasing Off-Policy Locomotion RL Toward Controller-Induced Behavior Basins

cs.RO · 2026-06-24 · unverdicted · novelty 7.0

MPC-Injection biases off-policy RL locomotion policies toward controller-induced behavior basins by injecting MPC transitions into the replay buffer.

WireCraft: A Simulation Benchmark for Industrial DLO Manipulation

cs.RO · 2026-06-16 · unverdicted · novelty 7.0

WireCraft is a new configurable simulation benchmark for industrial DLO manipulation with three task families, dual physics models, and shared evaluation of RL, IL, and VLA policies showing high success under privileged state but bottlenecks for vision-based methods.

FARM: Find Anything using Relational Spatial Memory

cs.RO · 2026-06-13 · unverdicted · novelty 7.0

FARM creates an open-vocabulary relational spatial memory that improves object retrieval recall by 164-224% over prior methods on 44k language queries across 67 scenes while running at 5-10 Hz.

SemanticXR: Low Power and Real-time Queryable Semantic Mapping with an Object-Level Device-Cloud Architecture

cs.DC · 2026-06-11 · unverdicted · novelty 7.0

SemanticXR introduces the first device-cloud system for real-time open-vocabulary semantic mapping and querying that organizes work around semantically identifiable objects to meet XR power, bandwidth, and memory limits.

CHORUS: Decentralized Multi-Embodiment Collaboration with One VLA Policy

cs.RO · 2026-06-10 · unverdicted · novelty 7.0 · 4 refs

CHORUS adapts a single VLA backbone for decentralized control of diverse robot teams, achieving 64-point gains over from-scratch decentralized baselines and outperforming centralized methods in real-world tasks using only local observations.

TIDES: Time-Derivative Event Simulation via Deformable Reconstruction

cs.CV · 2026-06-01 · unverdicted · novelty 7.0

TIDES simulates realistic event camera streams in continuous time via dynamic Gaussian splatting with adaptive occlusion handling and sensor artifact modeling, claiming SOTA fidelity and better downstream transfer than prior methods.

AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust

cs.RO · 2026-05-23 · conditional · novelty 7.0

Reinforcement learning policies for quadrotor inversion transitions with bidirectional thrust outperform optimization baselines by 32% in position RMSE and 57% in settling time in simulation, with successful hardware validation.

RS2AD-LiDAR: End-to-End Autonomous Driving LiDAR Data Generation from Roadside Sensor Observations

cs.CV · 2026-05-22 · unverdicted · novelty 7.0

RS2AD-LiDAR reconstructs vehicle LiDAR data from roadside observations via coordinate transformation, virtual LiDAR modeling and resampling, claimed as the first such method, with experiments showing improved object detection when mixed with real data.

Constrained MPC-Based Motion Planning for Morphing Quadrotors in Ultra-Narrow Passages under Limited Perception

cs.RO · 2026-05-15 · conditional · novelty 7.0

A smooth exponential obstacle cost with reduction factor in nonlinear MPC allows morphing quadrotors to traverse narrow gaps under limited 2D LiDAR perception.

Galilean State Estimation for Inertial Navigation Systems with Unknown Time Delay

cs.RO · 2026-05-13 · unverdicted · novelty 7.0

A Galilean-equivariant filter jointly estimates INS navigation states and unknown GNSS time delays, preserving accuracy and consistency better than EKF in UAV flights and simulations with delays up to 500 ms.

Distance-Constrained Unlabeled Multi-Agent Pathfinding

cs.MA · 2026-05-12 · unverdicted · novelty 7.0

Distance-r Independent Unlabeled Multi-Agent Pathfinding is PSPACE-complete, with reduction-based and configuration-generator algorithms that solve instances with hundreds of agents.

Shields to Guarantee Probabilistic Safety in MDPs

cs.LO · 2026-05-11 · unverdicted · novelty 7.0

New framework for probabilistic safety shields in MDPs showing impossibility of strong classical guarantees and providing weaker but usable alternatives with offline and online constructions.

Beyond Self-Play and Scale: A Behavior Benchmark for Generalization in Autonomous Driving

cs.RO · 2026-05-11 · unverdicted · novelty 7.0 · 2 refs

BehaviorBench reveals that self-play RL policies for autonomous driving overfit to their training traffic agents and do not generalize to other behaviors, motivating a hybrid rule-based plus learned planner.

The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

cs.CL · 2026-05-08 · unverdicted · novelty 7.0

An AI-agent social platform generated mostly neutral content whose use in fine-tuning reduced model truthfulness comparably to human Reddit data, suggesting limited unique harm but flagging tail risks like secret leaks.

LLM-Foraging: Large Language Models for Decentralized Swarm Robot Foraging

cs.RO · 2026-05-02 · unverdicted · novelty 7.0

LLM-Foraging uses off-the-shelf LLMs for decentralized tactical decisions in CPFA-based swarm foraging, collecting more resources than GA-tuned baselines across 36 varied configurations while showing greater consistency.

ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue

cs.RO · 2026-05-02 · unverdicted · novelty 7.0

ESARBench is the first unified benchmark for MLLM-driven UAV agents that must explore, locate clues, and decide on victim positions in photorealistic simulated SAR environments.

Towards Multi-Object Nonprehensile Transportation via Shared Teleoperation: A Framework Based on Virtual Object Model Predictive Control

cs.RO · 2026-04-08 · unverdicted · novelty 7.0

The virtual object MPC framework enables stable shared teleoperation for transporting up to nine objects, cutting sliding distance by 72.45% and eliminating tip-overs compared to baseline.

Referring-Aware Visuomotor Policy Learning for Closed-Loop Manipulation

cs.RO · 2026-04-07 · unverdicted · novelty 7.0 · 2 refs

ReV is a referring-aware visuomotor policy using coupled diffusion heads for real-time trajectory replanning in robotic manipulation, trained solely via targeted perturbations to expert demonstrations and achieving higher success rates in simulated and real tasks.

A global dataset of continuous urban dashcam driving

cs.CV · 2026-04-01 · accept · novelty 7.0

CROWD is a new global dataset of 51,753 continuous urban dashcam segments spanning over 20,000 hours from 238 countries, with manual labels and automated object detections for routine driving analysis.

AID: Agent Intent from Diffusion for Multi-Agent Informative Path Planning

cs.RO · 2025-12-02 · conditional · novelty 7.0

AID trains diffusion policies via behavior cloning on existing MAIPP planners followed by RL fine-tuning to achieve faster execution and higher information gain in multi-agent coordination.

citing papers explorer

Showing 6 of 6 citing papers after filters.

Galilean State Estimation for Inertial Navigation Systems with Unknown Time Delay cs.RO · 2026-05-13 · unverdicted · none · ref 6
A Galilean-equivariant filter jointly estimates INS navigation states and unknown GNSS time delays, preserving accuracy and consistency better than EKF in UAV flights and simulations with delays up to 500 ms.
ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue cs.RO · 2026-05-02 · unverdicted · none · ref 72
ESARBench is the first unified benchmark for MLLM-driven UAV agents that must explore, locate clues, and decide on victim positions in photorealistic simulated SAR environments.
Towards Multi-Object Nonprehensile Transportation via Shared Teleoperation: A Framework Based on Virtual Object Model Predictive Control cs.RO · 2026-04-08 · unverdicted · none · ref 37
The virtual object MPC framework enables stable shared teleoperation for transporting up to nine objects, cutting sliding distance by 72.45% and eliminating tip-overs compared to baseline.
Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation cs.RO · 2026-04-09 · unverdicted · none · ref 3 · 2 links
Test-time steering of pre-trained whole-body policies via sample-based planning lets legged robots generalize dynamic loco-manipulation to varied heavy objects and tasks without additional training or tuning.
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective cs.RO · 2025-07-02 · unverdicted · none · ref 212
The survey frames VLA models as pipelines that generate progressively grounded action tokens and classifies those tokens into eight types to guide future development.
Trajectory Prediction for Autonomous Driving: Progress, Limitations, and Future Directions cs.RO · 2025-03-05 · unverdicted · none · ref 133
A survey of trajectory prediction techniques for autonomous vehicles that proposes a taxonomy, overviews the prediction pipeline, and highlights remaining research gaps.

In: 2024 IEEE International Conference on Robotics and Automa- tion (ICRA)

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer