Canonical reference

Title resolution pending

· 2024 · arXiv 7147.2024

Canonical reference. 76% of citing Pith papers cite this work as background.

89 Pith papers citing it

Background 76% of classified citations

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 17 baseline 2 extension 1 other 1

citation-polarity summary

background 16 baseline 2 unclear 2 extend 1

representative citing papers

FactoryNet: A Large-Scale Dataset toward Industrial Time-Series Foundation Models

cs.LG · 2026-05-09 · unverdicted · novelty 8.0 · 2 refs

FactoryNet is the first universal pretraining corpus for industrial time-series data with a shared S-E-F-C schema that supports cross-embodiment transfer and competitive anomaly detection.

MPC-Injection: Biasing Off-Policy Locomotion RL Toward Controller-Induced Behavior Basins

cs.RO · 2026-06-24 · unverdicted · novelty 7.0

MPC-Injection biases off-policy RL locomotion policies toward controller-induced behavior basins by injecting MPC transitions into the replay buffer.

WireCraft: A Simulation Benchmark for Industrial DLO Manipulation

cs.RO · 2026-06-16 · unverdicted · novelty 7.0

WireCraft is a new configurable simulation benchmark for industrial DLO manipulation with three task families, dual physics models, and shared evaluation of RL, IL, and VLA policies showing high success under privileged state but bottlenecks for vision-based methods.

FARM: Find Anything using Relational Spatial Memory

cs.RO · 2026-06-13 · unverdicted · novelty 7.0

FARM creates an open-vocabulary relational spatial memory that improves object retrieval recall by 164-224% over prior methods on 44k language queries across 67 scenes while running at 5-10 Hz.

SemanticXR: Low Power and Real-time Queryable Semantic Mapping with an Object-Level Device-Cloud Architecture

cs.DC · 2026-06-11 · unverdicted · novelty 7.0

SemanticXR introduces the first device-cloud system for real-time open-vocabulary semantic mapping and querying that organizes work around semantically identifiable objects to meet XR power, bandwidth, and memory limits.

CHORUS: Decentralized Multi-Embodiment Collaboration with One VLA Policy

cs.RO · 2026-06-10 · unverdicted · novelty 7.0 · 4 refs

CHORUS adapts a single VLA backbone for decentralized control of diverse robot teams, achieving 64-point gains over from-scratch decentralized baselines and outperforming centralized methods in real-world tasks using only local observations.

TIDES: Time-Derivative Event Simulation via Deformable Reconstruction

cs.CV · 2026-06-01 · unverdicted · novelty 7.0

TIDES simulates realistic event camera streams in continuous time via dynamic Gaussian splatting with adaptive occlusion handling and sensor artifact modeling, claiming SOTA fidelity and better downstream transfer than prior methods.

RS2AD-LiDAR: End-to-End Autonomous Driving LiDAR Data Generation from Roadside Sensor Observations

cs.CV · 2026-05-22 · unverdicted · novelty 7.0

RS2AD-LiDAR reconstructs vehicle LiDAR data from roadside observations via coordinate transformation, virtual LiDAR modeling and resampling, claimed as the first such method, with experiments showing improved object detection when mixed with real data.

Constrained MPC-Based Motion Planning for Morphing Quadrotors in Ultra-Narrow Passages under Limited Perception

cs.RO · 2026-05-15 · conditional · novelty 7.0

A smooth exponential obstacle cost with reduction factor in nonlinear MPC allows morphing quadrotors to traverse narrow gaps under limited 2D LiDAR perception.

Galilean State Estimation for Inertial Navigation Systems with Unknown Time Delay

cs.RO · 2026-05-13 · unverdicted · novelty 7.0

A Galilean-equivariant filter jointly estimates INS navigation states and unknown GNSS time delays, preserving accuracy and consistency better than EKF in UAV flights and simulations with delays up to 500 ms.

Distance-Constrained Unlabeled Multi-Agent Pathfinding

cs.MA · 2026-05-12 · unverdicted · novelty 7.0

Distance-r Independent Unlabeled Multi-Agent Pathfinding is PSPACE-complete, with reduction-based and configuration-generator algorithms that solve instances with hundreds of agents.

Shields to Guarantee Probabilistic Safety in MDPs

cs.LO · 2026-05-11 · unverdicted · novelty 7.0

New framework for probabilistic safety shields in MDPs showing impossibility of strong classical guarantees and providing weaker but usable alternatives with offline and online constructions.

Beyond Self-Play and Scale: A Behavior Benchmark for Generalization in Autonomous Driving

cs.RO · 2026-05-11 · unverdicted · novelty 7.0 · 2 refs

BehaviorBench reveals that self-play RL policies for autonomous driving overfit to their training traffic agents and do not generalize to other behaviors, motivating a hybrid rule-based plus learned planner.

The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

cs.CL · 2026-05-08 · unverdicted · novelty 7.0

An AI-agent social platform generated mostly neutral content whose use in fine-tuning reduced model truthfulness comparably to human Reddit data, suggesting limited unique harm but flagging tail risks like secret leaks.

LLM-Foraging: Large Language Models for Decentralized Swarm Robot Foraging

cs.RO · 2026-05-02 · unverdicted · novelty 7.0

LLM-Foraging uses off-the-shelf LLMs for decentralized tactical decisions in CPFA-based swarm foraging, collecting more resources than GA-tuned baselines across 36 varied configurations while showing greater consistency.

ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue

cs.RO · 2026-05-02 · unverdicted · novelty 7.0

ESARBench is the first unified benchmark for MLLM-driven UAV agents that must explore, locate clues, and decide on victim positions in photorealistic simulated SAR environments.

Towards Multi-Object Nonprehensile Transportation via Shared Teleoperation: A Framework Based on Virtual Object Model Predictive Control

cs.RO · 2026-04-08 · unverdicted · novelty 7.0

The virtual object MPC framework enables stable shared teleoperation for transporting up to nine objects, cutting sliding distance by 72.45% and eliminating tip-overs compared to baseline.

Referring-Aware Visuomotor Policy Learning for Closed-Loop Manipulation

cs.RO · 2026-04-07 · unverdicted · novelty 7.0 · 2 refs

ReV is a referring-aware visuomotor policy using coupled diffusion heads for real-time trajectory replanning in robotic manipulation, trained solely via targeted perturbations to expert demonstrations and achieving higher success rates in simulated and real tasks.

A global dataset of continuous urban dashcam driving

cs.CV · 2026-04-01 · accept · novelty 7.0

CROWD is a new global dataset of 51,753 continuous urban dashcam segments spanning over 20,000 hours from 238 countries, with manual labels and automated object detections for routine driving analysis.

AID: Agent Intent from Diffusion for Multi-Agent Informative Path Planning

cs.RO · 2025-12-02 · conditional · novelty 7.0

AID trains diffusion policies via behavior cloning on existing MAIPP planners followed by RL fine-tuning to achieve faster execution and higher information gain in multi-agent coordination.

BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations

cs.CV · 2025-06-03 · unverdicted · novelty 7.0 · 3 refs

BEVCALIB performs LiDAR-camera calibration from raw data by fusing camera and LiDAR bird's-eye view features with a novel feature selector and reports state-of-the-art accuracy on KITTI and NuScenes.

Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs

cs.CV · 2025-05-27 · unverdicted · novelty 7.0

DORI benchmark shows top vision-language models reach only 54.2% accuracy on coarse orientation tasks and 33% on granular judgments, with sharp drops on reference-frame shifts and compound rotations.

Closed-Loop Vision-Language Planning for Multi-Agent Coordination

cs.AI · 2025-02-14 · unverdicted · novelty 7.0

COMPASS uses VLMs to generate and refine code-based strategies with structured communication, achieving 57% win rate on SMACv2 Protoss 5v5 versus 27% for QMIX.

Flowing With Purpose: Latent Action Guided Flow Matching Policies For Robotic Manipulation

cs.RO · 2026-06-22 · unverdicted · novelty 6.0

LAFM adapts the source distribution in flow matching policies via a latent action model to better match fragmented robotic action spaces, claiming 23.4% higher real-world success and 10.4% on LIBERO-90 while beating larger pre-trained models.

citing papers explorer

Showing 50 of 89 citing papers.

FactoryNet: A Large-Scale Dataset toward Industrial Time-Series Foundation Models cs.LG · 2026-05-09 · unverdicted · none · ref 19 · 2 links
FactoryNet is the first universal pretraining corpus for industrial time-series data with a shared S-E-F-C schema that supports cross-embodiment transfer and competitive anomaly detection.
MPC-Injection: Biasing Off-Policy Locomotion RL Toward Controller-Induced Behavior Basins cs.RO · 2026-06-24 · unverdicted · none · ref 55
MPC-Injection biases off-policy RL locomotion policies toward controller-induced behavior basins by injecting MPC transitions into the replay buffer.
WireCraft: A Simulation Benchmark for Industrial DLO Manipulation cs.RO · 2026-06-16 · unverdicted · none · ref 9
WireCraft is a new configurable simulation benchmark for industrial DLO manipulation with three task families, dual physics models, and shared evaluation of RL, IL, and VLA policies showing high success under privileged state but bottlenecks for vision-based methods.
FARM: Find Anything using Relational Spatial Memory cs.RO · 2026-06-13 · unverdicted · none · ref 53
FARM creates an open-vocabulary relational spatial memory that improves object retrieval recall by 164-224% over prior methods on 44k language queries across 67 scenes while running at 5-10 Hz.
SemanticXR: Low Power and Real-time Queryable Semantic Mapping with an Object-Level Device-Cloud Architecture cs.DC · 2026-06-11 · unverdicted · none · ref 53
SemanticXR introduces the first device-cloud system for real-time open-vocabulary semantic mapping and querying that organizes work around semantically identifiable objects to meet XR power, bandwidth, and memory limits.
CHORUS: Decentralized Multi-Embodiment Collaboration with One VLA Policy cs.RO · 2026-06-10 · unverdicted · none · ref 25 · 4 links
CHORUS adapts a single VLA backbone for decentralized control of diverse robot teams, achieving 64-point gains over from-scratch decentralized baselines and outperforming centralized methods in real-world tasks using only local observations.
TIDES: Time-Derivative Event Simulation via Deformable Reconstruction cs.CV · 2026-06-01 · unverdicted · none · ref 41
TIDES simulates realistic event camera streams in continuous time via dynamic Gaussian splatting with adaptive occlusion handling and sensor artifact modeling, claiming SOTA fidelity and better downstream transfer than prior methods.
RS2AD-LiDAR: End-to-End Autonomous Driving LiDAR Data Generation from Roadside Sensor Observations cs.CV · 2026-05-22 · unverdicted · none · ref 2
RS2AD-LiDAR reconstructs vehicle LiDAR data from roadside observations via coordinate transformation, virtual LiDAR modeling and resampling, claimed as the first such method, with experiments showing improved object detection when mixed with real data.
Constrained MPC-Based Motion Planning for Morphing Quadrotors in Ultra-Narrow Passages under Limited Perception cs.RO · 2026-05-15 · conditional · none · ref 13
A smooth exponential obstacle cost with reduction factor in nonlinear MPC allows morphing quadrotors to traverse narrow gaps under limited 2D LiDAR perception.
Galilean State Estimation for Inertial Navigation Systems with Unknown Time Delay cs.RO · 2026-05-13 · unverdicted · none · ref 6
A Galilean-equivariant filter jointly estimates INS navigation states and unknown GNSS time delays, preserving accuracy and consistency better than EKF in UAV flights and simulations with delays up to 500 ms.
Distance-Constrained Unlabeled Multi-Agent Pathfinding cs.MA · 2026-05-12 · unverdicted · none · ref 1
Distance-r Independent Unlabeled Multi-Agent Pathfinding is PSPACE-complete, with reduction-based and configuration-generator algorithms that solve instances with hundreds of agents.
Shields to Guarantee Probabilistic Safety in MDPs cs.LO · 2026-05-11 · unverdicted · none · ref 37
New framework for probabilistic safety shields in MDPs showing impossibility of strong classical guarantees and providing weaker but usable alternatives with offline and online constructions.
Beyond Self-Play and Scale: A Behavior Benchmark for Generalization in Autonomous Driving cs.RO · 2026-05-11 · unverdicted · none · ref 2 · 2 links
BehaviorBench reveals that self-play RL policies for autonomous driving overfit to their training traffic agents and do not generalize to other behaviors, motivating a hybrid rule-based plus learned planner.
The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment cs.CL · 2026-05-08 · unverdicted · none · ref 156
An AI-agent social platform generated mostly neutral content whose use in fine-tuning reduced model truthfulness comparably to human Reddit data, suggesting limited unique harm but flagging tail risks like secret leaks.
LLM-Foraging: Large Language Models for Decentralized Swarm Robot Foraging cs.RO · 2026-05-02 · unverdicted · none · ref 34
LLM-Foraging uses off-the-shelf LLMs for decentralized tactical decisions in CPFA-based swarm foraging, collecting more resources than GA-tuned baselines across 36 varied configurations while showing greater consistency.
ESARBench: A Benchmark for Agentic UAV Embodied Search and Rescue cs.RO · 2026-05-02 · unverdicted · none · ref 72
ESARBench is the first unified benchmark for MLLM-driven UAV agents that must explore, locate clues, and decide on victim positions in photorealistic simulated SAR environments.
Towards Multi-Object Nonprehensile Transportation via Shared Teleoperation: A Framework Based on Virtual Object Model Predictive Control cs.RO · 2026-04-08 · unverdicted · none · ref 37
The virtual object MPC framework enables stable shared teleoperation for transporting up to nine objects, cutting sliding distance by 72.45% and eliminating tip-overs compared to baseline.
Referring-Aware Visuomotor Policy Learning for Closed-Loop Manipulation cs.RO · 2026-04-07 · unverdicted · none · ref 25 · 2 links
ReV is a referring-aware visuomotor policy using coupled diffusion heads for real-time trajectory replanning in robotic manipulation, trained solely via targeted perturbations to expert demonstrations and achieving higher success rates in simulated and real tasks.
A global dataset of continuous urban dashcam driving cs.CV · 2026-04-01 · accept · none · ref 42
CROWD is a new global dataset of 51,753 continuous urban dashcam segments spanning over 20,000 hours from 238 countries, with manual labels and automated object detections for routine driving analysis.
AID: Agent Intent from Diffusion for Multi-Agent Informative Path Planning cs.RO · 2025-12-02 · conditional · none · ref 26
AID trains diffusion policies via behavior cloning on existing MAIPP planners followed by RL fine-tuning to achieve faster execution and higher information gain in multi-agent coordination.
BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations cs.CV · 2025-06-03 · unverdicted · none · ref 11 · 3 links
BEVCALIB performs LiDAR-camera calibration from raw data by fusing camera and LiDAR bird's-eye view features with a novel feature selector and reports state-of-the-art accuracy on KITTI and NuScenes.
Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs cs.CV · 2025-05-27 · unverdicted · none · ref 36
DORI benchmark shows top vision-language models reach only 54.2% accuracy on coarse orientation tasks and 33% on granular judgments, with sharp drops on reference-frame shifts and compound rotations.
Closed-Loop Vision-Language Planning for Multi-Agent Coordination cs.AI · 2025-02-14 · unverdicted · none · ref 3
COMPASS uses VLMs to generate and refine code-based strategies with structured communication, achieving 57% win rate on SMACv2 Protoss 5v5 versus 27% for QMIX.
Flowing With Purpose: Latent Action Guided Flow Matching Policies For Robotic Manipulation cs.RO · 2026-06-22 · unverdicted · none · ref 32
LAFM adapts the source distribution in flow matching policies via a latent action model to better match fragmented robotic action spaces, claiming 23.4% higher real-world success and 10.4% on LIBERO-90 while beating larger pre-trained models.
From Pixels to Concepts: Growing Rich 3D Semantic Scene Graph Forests utilizing Foundation Models cs.RO · 2026-06-22 · unverdicted · none · ref 9
Uses VLMs to detect instance concepts and LLMs to infer abstract relationships, assembling them into 3D scene graph forests that are evaluated on uHumans2 and ScanNet and tested in open-vocabulary retrieval on a Spot robot.
World Engine: Towards the Era of Post-Training for Autonomous Driving cs.RO · 2026-06-18 · unverdicted · none · ref 13
World Engine generates realistic safety-critical driving variations from logs for reinforcement post-training, reducing benchmark failures more than data scaling and showing collision reductions plus on-road gains in a production system.
SceneMiner: Identity-Preserving Multi-Task Fine-Tuning for Unified BEV Scene Mining cs.CV · 2026-06-09 · unverdicted · none · ref 7 · 2 links
SceneMiner shows that identity-preserving multi-task fine-tuning removes cross-task interference by zero-initializing new heads and freezing shared-stream parameters, enabling unified BEV scene mining with preserved original heads.
SpaceVLN: A Zero-Shot Vision-and-Language Navigation Agent with Online Spatial Cognitive Memory and Reasoning cs.RO · 2026-06-08 · unverdicted · none · ref 55
SpaceVLN proposes a stagewise closed-loop framework using Spatial Cognitive Memory and Spatial-CoT for zero-shot vision-and-language navigation and object-goal navigation, reporting SOTA results on R2R-CE, RxR-CE, GN-Bench, and HM3D-OVON plus real-robot tests.
Perceptive Behavior Foundation Model: Adapting Human Motion Priors to Robot-Centric Terrain cs.RO · 2026-06-06 · unverdicted · none · ref 22
Perceptive BFM grounds human motion priors in robot terrain perception via terrain-conformal reference synthesis and teacher-student transfer from adapted to raw-reference tracking.
Customer-Agent: Overcoming Context Limitations in Ultra-Long Shopping Trajectories via Tool-Augmented Agents and RLVR cs.CL · 2026-06-06 · unverdicted · none · ref 108
Introduces ShopTrajQA long-context benchmark and an RLVR-trained tool-augmented agent that bypasses LLM context limits by external file storage and code-based retrieval for shopping trajectories.
How to Steer Your Multi-Agent System: Human-LLM Collaborative Planning cs.MA · 2026-05-21 · unverdicted · none · ref 4
Formalizes design space for human-LLM collaborative planning along mode, scope, and level axes; evaluates AMBIPOM prototype via user study and benchmark revealing hybrid workflows and trade-offs.
Scalable Multi-robot Motion Planning via Hierarchical Subproblem Expansion and Workspace Decomposition Refinement cs.RO · 2026-05-19 · unverdicted · none · ref 9
A hierarchical multi-robot motion planner that refines workspace decompositions to enable scalable coordination through discrete search over smaller decoupled subproblems.
LACE: Latent Visual Representation for Cross-Embodiment Learning cs.RO · 2026-05-16 · unverdicted · none · ref 1
LACE aligns human-robot visual features via semantic distribution matching on corresponding body parts plus Gram loss, yielding 65% better zero-shot policy transfer than baseline DINO.
Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing cs.LG · 2026-05-15 · unverdicted · none · ref 43 · 4 links
Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.
VRA: Grounding Discrete-Time Joint Acceleration in Voltage-Constrained Actuation cs.RO · 2026-05-11 · unverdicted · none · ref 13 · 2 links
VRA grounds discrete-time joint acceleration commands in voltage-constrained actuator physics to eliminate unrealizable accelerations and reduce oscillations in electric motor systems.
VISOR: A Vision-Language Model-based Test Oracle for Testing Robots cs.SE · 2026-05-11 · unverdicted · none · ref 14 · 2 links
VISOR is a VLM-based automated test oracle that evaluates robot task correctness and quality from videos while reporting its own uncertainty, tested on GPT and Gemini across four tasks and over 1000 videos with Gemini showing higher recall and GPT higher precision but low uncertainty-correctness tie
MAG-VLAQ: Multi-modal Aerial-Ground Query Aggregation for Cross-View Place Recognition cs.CV · 2026-05-10 · unverdicted · none · ref 7
MAG-VLAQ fuses multi-modal ground and aerial data via ODE-conditioned vector-of-locally-aggregated-queries to nearly double recall@1 on aerial-ground place recognition benchmarks.
Ray-Aware Pointer Memory with Adaptive Updates for Streaming 3D Reconstruction cs.CV · 2026-05-07 · unverdicted · none · ref 19 · 2 links
The paper proposes ray-aware pointer memory with adaptive retain-or-replace updates to improve long-term stability and pose accuracy in streaming 3D reconstruction.
Navigating the Clutter: Waypoint-Based Bi-Level Planning for Multi-Robot Systems cs.RO · 2026-04-22 · unverdicted · none · ref 76
Waypoint-based bi-level planning with curriculum RLVR improves multi-robot task success rates in dense-obstacle benchmarks over motion-agnostic and VLA baselines.
SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models cs.AI · 2026-04-21 · unverdicted · none · ref 25
SafetyALFRED shows multimodal LLMs recognize kitchen hazards accurately in QA tests but achieve low success rates when required to mitigate those hazards through embodied planning.
WARPED: Wrist-Aligned Rendering for Robot Policy Learning from Egocentric Human Demonstrations cs.RO · 2026-04-12 · unverdicted · none · ref 5
WARPED synthesizes realistic wrist-view observations from monocular egocentric human videos via foundation models, hand-object tracking, retargeting, and Gaussian Splatting to train visuomotor policies that match teleoperation success rates on five tabletop tasks with 5-8x less collection effort.
A Coordinate-Invariant Local Representation of Motion and Force Trajectories for Identification and Generalization Across Coordinate Systems cs.RO · 2026-04-11 · unverdicted · none · ref 18
Introduces the Dual-Upper-Triangular Invariant Representation (DUTIR) as a coordinate-invariant local representation for motion and force trajectories with improved robustness to singularities and a supporting computational algorithm.
SynFlow: Scaling Up LiDAR Scene Flow Estimation with Synthetic Data cs.CV · 2026-04-10 · conditional · none · ref 36
SynFlow creates a 34-times larger synthetic LiDAR scene flow dataset that lets models trained only on simulation match or beat supervised real-data baselines on multiple benchmarks.
Model Space Reasoning as Search in Feedback Space for Planning Domain Generation cs.AI · 2026-04-09 · unverdicted · none · ref 16
An agentic LLM framework augmented with symbolic feedback and heuristic search over model space generates improved planning domains from natural language descriptions.
Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation cs.RO · 2026-04-09 · unverdicted · none · ref 3 · 2 links
Test-time steering of pre-trained whole-body policies via sample-based planning lets legged robots generalize dynamic loco-manipulation to varied heavy objects and tasks without additional training or tuning.
frax: Fast Robot Kinematics and Dynamics in JAX cs.RO · 2026-04-05 · unverdicted · none · ref 15
frax is a new open-source JAX library delivering low-microsecond CPU dynamics and over 100 million GPU evaluations per second for robot kinematics and dynamics with autodiff support.
SutureFormer: Learning Surgical Trajectories via Goal-conditioned Offline RL in Pixel Space cs.RO · 2026-03-19 · unverdicted · none · ref 23
SutureFormer models needle tip movement in video as sequential pixel-space actions via goal-conditioned offline RL with spline-based reward densification, cutting average displacement error by 58.6% on a new 1,158-trajectory kidney suturing dataset.
Acoustic Feedback for Closed-Loop Force Control in Robotic Grinding cs.RO · 2026-02-24 · unverdicted · none · ref 11
AFRG estimates grinding force from contact microphone audio for closed-loop robotic control, delivering 4-fold better consistency across disc conditions at roughly 200 times lower cost than force sensors.
Genie Sim 3.0 : A High-Fidelity Comprehensive Simulation Platform for Humanoid Robot cs.RO · 2026-01-05 · unverdicted · none · ref 8
Genie Sim 3.0 introduces an LLM-powered scene generator, the first LLM-based automated evaluation benchmark, and a large open synthetic dataset that demonstrates zero-shot sim-to-real transfer for robotic manipulation policies.
$\pi^{*}_{0.6}$: a VLA That Learns From Experience cs.LG · 2025-11-18 · unverdicted · none · ref 22 · 2 links
RECAP enables a generalist VLA to self-improve via advantage-conditioned RL on mixed real-world data, more than doubling throughput and halving failure rates on hard manipulation tasks.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer