Canonical reference

In: 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp

Romero-Sorozabal, P · 2024 · arXiv 8592.2024

Canonical reference. 73% of citing Pith papers cite this work as background.

55 Pith papers citing it

Background 73% of classified citations

read on arXiv browse 55 citing papers

citation-role summary

background 12 method 2 other 1

citation-polarity summary

background 11 use method 2 support 1 unclear 1

representative citing papers

GLENS: Global Search via Learning from Solver Iterates with Diffusion Models

cs.LG · 2026-05-29 · unverdicted · novelty 7.0

GLENS uses diffusion models on solver iterates to generate high-quality and diverse initial guesses for multimodal non-convex optimization, leading to faster solver convergence.

Hijacking Agent Memory: Stealthy Trojan Attacks Through Conversational Interaction

cs.CR · 2026-05-28 · unverdicted · novelty 7.0

MemPoison enables stealthy memory poisoning in LLM agents via dialogue by using semantic relational bridges, entity masquerading, and joint embedding optimization to bypass selective extraction and rewriting, achieving up to 0.95 attack success rate.

MDrive: Benchmarking Closed-Loop Cooperative Driving for End-to-End Multi-agent Systems

cs.RO · 2026-05-11 · unverdicted · novelty 7.0

MDrive benchmark shows multi-agent cooperative driving systems generally outperform single-agent ones in closed-loop settings but perception sharing does not always improve planning and negotiation can harm performance in complex traffic.

Beyond Self-Play and Scale: A Behavior Benchmark for Generalization in Autonomous Driving

cs.RO · 2026-05-11 · unverdicted · novelty 7.0

BehaviorBench reveals that self-play RL policies for autonomous driving overfit to their training traffic agents and do not generalize to other behaviors, motivating a hybrid rule-based plus learned planner.

Towards Multi-Object Nonprehensile Transportation via Shared Teleoperation: A Framework Based on Virtual Object Model Predictive Control

cs.RO · 2026-04-08 · unverdicted · novelty 7.0

The virtual object MPC framework enables stable shared teleoperation for transporting up to nine objects, cutting sliding distance by 72.45% and eliminating tip-overs compared to baseline.

Large Video Planner Enables Generalizable Robot Control

cs.RO · 2025-12-17 · conditional · novelty 7.0

A video foundation model trained on human demonstrations generates zero-shot plans that convert to executable robot actions on novel scenes and tasks.

X-Morph: Human Motion Priors for Scalable Robot Learning Across Morphologies

cs.RO · 2026-06-29 · unverdicted · novelty 6.0

X-Morph retargets human motions to kinematically plausible references for multiple legged morphologies, trains privileged RL trackers, and distills them into deployable policies that generalize and enable teleoperation and text-conditioned generation.

ReGuide: From Test-Time Guidance to Self-Improving Diffusion Policies

cs.LG · 2026-06-27 · unverdicted · novelty 6.0

ReGuide is a self-improving framework that uses phase-conditioned guidance to generate corrective rollouts and absorbs successful ones back into diffusion policy training, yielding 1.3-7.7x success gains on Robomimic tasks.

Flowing With Purpose: Latent Action Guided Flow Matching Policies For Robotic Manipulation

cs.RO · 2026-06-22 · unverdicted · novelty 6.0

LAFM adapts the source distribution in flow matching policies via a latent action model to better match fragmented robotic action spaces, claiming 23.4% higher real-world success and 10.4% on LIBERO-90 while beating larger pre-trained models.

HilDA: Hierarchical Distillation with Diffusion for Advancing Self-Supervised LiDAR Pre-training

cs.CV · 2026-06-18 · unverdicted · novelty 6.0

HilDA pre-trains LiDAR backbones via multi-layer and global distillation from vision models plus temporal occupancy diffusion, yielding SOTA results on detection, flow, and occupancy tasks.

See Selectively, Act Adaptively: Dual-Level Structural Decomposition for Bimanual Robot Manipulation

cs.RO · 2026-06-11 · unverdicted · novelty 6.0

A VLA policy using view-selective visual routing and interaction-aware action MoE improves average success by 27.7% in simulation and 43.3% in real-world bimanual tasks over monolithic baselines.

SpaceVLN: A Zero-Shot Vision-and-Language Navigation Agent with Online Spatial Cognitive Memory and Reasoning

cs.RO · 2026-06-08 · unverdicted · novelty 6.0

SpaceVLN proposes a stagewise closed-loop framework using Spatial Cognitive Memory and Spatial-CoT for zero-shot vision-and-language navigation and object-goal navigation, reporting SOTA results on R2R-CE, RxR-CE, GN-Bench, and HM3D-OVON plus real-robot tests.

RGB-S: Image-Aligned Tactile Saliency for Robust Dexterous Manipulation

cs.RO · 2026-06-07 · unverdicted · novelty 6.0

RGB-S projects tactile contacts onto images as force-modulated Gaussian saliency maps via kinematics and zero-initialized conditioning, raising real-world occluded dexterous manipulation success by 26.7 percentage points over implicit baselines.

Perceptive Behavior Foundation Model: Adapting Human Motion Priors to Robot-Centric Terrain

cs.RO · 2026-06-06 · unverdicted · novelty 6.0 · 2 refs

Perceptive BFM grounds human motion priors in robot terrain perception via terrain-conformal reference synthesis and teacher-student transfer from adapted to raw-reference tracking.

HORIZON: Recoverability-Governed Curriculum for Physical-Domain Scaling

cs.RO · 2026-06-03 · unverdicted · novelty 6.0

HORIZON is a recoverability-governed checkpointed frontier curriculum for on-policy physical-domain scaling on quadruped locomotion that identifies three regularities: uneven widening, non-monotonic composition, and the necessity of joint on-policy interaction.

AFUN: Towards an Affordance Foundation Model for Functionality Understanding

cs.RO · 2026-06-01 · unverdicted · novelty 6.0 · 2 refs

AFUN predicts task-conditional functional masks and 3D post-contact motion curves from RGB-D and language, trained via a standardized multi-source data pipeline, and reports large gains over baselines on segmentation, contact prediction, and motion tasks.

POIROT: Interrogating Agents for Failure Detection in Multi-Agent Systems

cs.AI · 2026-06-01 · unverdicted · novelty 6.0

POIROT protocol repurposes agents in LLM multi-agent systems as an internal diagnostic layer for failure detection, outperforming single-LLM evaluators with gains that increase with complexity, agent count, and fault types.

S2M-Trek: From Single to Multi-Sphere Transport via Per-Frame Deep Sets on a Wheel-Legged Robot

cs.RO · 2026-05-31 · unverdicted · novelty 6.0

Per-Frame Deep Sets enables scaling single-sphere to five-sphere transport on a quadruped by performing permutation-invariant pooling within each history frame, reaching 100% no-drop success in simulation where standard encoders plateau.

Scalable Multi-robot Motion Planning via Hierarchical Subproblem Expansion and Workspace Decomposition Refinement

cs.RO · 2026-05-19 · unverdicted · novelty 6.0

A hierarchical multi-robot motion planner that refines workspace decompositions to enable scalable coordination through discrete search over smaller decoupled subproblems.

Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems

cs.AI · 2026-05-15 · unverdicted · novelty 6.0

Combines LTL formal methods with LLMs for auditing, predictive monitoring, and runtime intervention on temporally extended behavioral constraints, outperforming LLM baselines and reducing violations.

Robust Instruction Compliance in Cooperative Multi-Agent Reinforcement Learning

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

MAVIC corrects Bellman backups at instruction boundaries by adjusting the incoming objective and restoring continuation value, enabling consistent estimation under stochastic instruction switching in cooperative MARL.

Control of Fully Actuated Aerial Vehicles: A Comparison of Model-based and Sensor-based Dynamic Inversion

cs.RO · 2026-05-12 · unverdicted · novelty 6.0

On a fully actuated hexarotor, sensor-based INDI outperforms model-based geometric NDI under mismatches, gusts, and sensor degradation with lower position errors, but NDI tracks attitude better at reduced control rates, providing the first experimental full-pose INDI validation with decoupled axes.

VISOR: A Vision-Language Model-based Test Oracle for Testing Robots

cs.SE · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

VISOR is a VLM-based automated test oracle that evaluates robot task correctness and quality from videos while reporting its own uncertainty, tested on GPT and Gemini across four tasks and over 1000 videos with Gemini showing higher recall and GPT higher precision but low uncertainty-correctness tie

High Precision Hydraulic Excavator Control for Heavy-Duty Grading

cs.RO · 2026-05-10 · unverdicted · novelty 6.0 · 2 refs

Autonomous excavator controller achieves 1.8 cm RMSE in heavy-duty grading across different hydraulic architectures, outperforming commercial solutions by a factor of 2.6 in precision while better utilizing machine pressure.

citing papers explorer

Showing 1 of 1 citing paper after filters.

DigiForest: Digital Analytics and Robotics for Sustainable Forestry cs.RO · 2026-04-16 · unverdicted · none · ref 7 · 2 links
DigiForest integrates heterogeneous autonomous robots for data collection, automated tree trait extraction, a decision support system for growth forecasting, and autonomous harvesters for selective logging, with real-world tests in European forests.

In: 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer