ProbeAct: Probe-Guided Training-Free Failure Recovery in Vision-Language-Action Models
Pith reviewed 2026-06-27 16:31 UTC · model grok-4.3
The pith
A training-free intervention detects and corrects failures in language-guided robot policies at runtime.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
PROBEACT uses a multi-target hidden-state probe to predict 3D object positions, an object-agnostic kinematic state machine to detect failures, and a hierarchical Control Barrier Function filter to encode safe constraints, allowing recovery from failures in language-guided robot policies without modifying their weights.
What carries the argument
The lightweight multi-target hidden-state probe predicting 3D positions from intermediate features with Hungarian-matched tracking, paired with the kinematic state machine and barrier function filter for failure detection and action correction.
If this is right
- Language-guided robot policies can achieve higher success rates on manipulation tasks under perturbations.
- The method works on both base and fine-tuned policies as a universal add-on.
- Failure recovery relies only on internal model signals and kinematics, without external sensors.
- Actions are corrected minimally to preserve the original policy behavior.
Where Pith is reading between the lines
- This could allow deployment of such policies in dynamic environments where initial training data is limited.
- Similar probing techniques might apply to other embodied AI systems for runtime safety.
- Testing on additional benchmarks would clarify the generality of the improvement.
Load-bearing premise
The hidden-state probe accurately predicts 3D object positions from language-guided robot policy features to guide recovery.
What would settle it
A test showing that disabling the position predictions or using inaccurate probes eliminates the success rate improvement on the benchmark.
Figures
read the original abstract
Vision-Language-Action (VLA) models demonstrate strong perfor-1 mance on language-conditioned robotic manipulation within their training dis-2 tribution, yet their generalization capabilities remain fundamentally limited. They3 lack the robustness required to handle perturbations, frequently failing when con-4 fronted with lighting changes, altered camera viewpoints, or small initial-state5 variations. We propose PROBEACT, a training-free runtime intervention frame-6 work that detects and recovers from grasping and placement failures in pre-7 trained VLA policies without modifying their weights or requiring additional8 demonstrations. PROBEACT combines three components: (i) a lightweight multi-9 target hidden-state probe that predicts the 3D positions of task-relevant objects10 from intermediate VLA features, with Hungarian-matched identity tracking for11 multi-object scenes; (ii) an object-agnostic kinematic state machine that detects12 grasp, transport, and placement failures using only gripper-internal signals and13 end-effector kinematics; and (iii) a hierarchical Control Barrier Function (CBF)14 filter that encodes repeated-failure locations as soft safe-set constraints, mini-15 mally correcting VLA actions while preserving baseline behavior. As a plug-and-16 play, training-free intervention loop, PROBEACT is orthogonal to existing train-17 ing pipelines. Evaluated on the LIBERO-plus benchmark, our framework acts as18 a universal safety net, improving the success rate of the OpenVLA-OFT model19 from 69.6% to 74.1%, while demonstrating broad applicability to both base and20 fine-tuned VLA policies.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes PROBEACT, a training-free runtime intervention framework for Vision-Language-Action (VLA) models. It combines (i) a lightweight multi-target hidden-state probe that regresses 3D positions of task-relevant objects from intermediate VLA features (with Hungarian matching for identity tracking), (ii) an object-agnostic kinematic state machine that detects grasp/transport/placement failures from gripper and end-effector signals, and (iii) a hierarchical Control Barrier Function (CBF) filter that encodes repeated-failure locations as soft constraints to minimally correct actions. The central empirical claim is that this plug-and-play loop raises success rate on the LIBERO-plus benchmark from 69.6% to 74.1% for the OpenVLA-OFT policy while remaining orthogonal to training pipelines and applicable to both base and fine-tuned VLAs.
Significance. If the probe's localization accuracy and the overall pipeline's causal contribution are rigorously validated, the result would be significant: a general, training-free safety net that improves robustness to distribution shifts without additional data or weight updates. This is orthogonal to existing VLA fine-tuning literature and could be broadly useful for deployment.
major comments (2)
- [Abstract, §4] Abstract and §4 (Results): The 4.5 pp success-rate gain is presented as direct evidence that the probe-guided recovery works, yet no quantitative probe metrics are supplied (position RMSE, localization success rate, or ablation that removes the probe while keeping the state machine and CBF). Because the probe is the sole source of 3D object state for the downstream kinematic detector and CBF, the absence of these numbers leaves the central claim untestable.
- [§3.1] §3.1 (Probe architecture): The claim that intermediate VLA hidden states contain sufficient information for accurate 3D regression is load-bearing, but the manuscript provides neither the probe's training objective details nor any validation against ground-truth 3D trajectories on LIBERO-plus. Without these, it is impossible to determine whether the reported gain stems from the probe or from other components.
minor comments (2)
- [§3.2] Notation for the Hungarian matching step and the CBF safe-set definition should be introduced with explicit equations rather than prose descriptions only.
- [§4] The LIBERO-plus benchmark description is referenced but not summarized; a brief table of task categories and perturbation types would improve reproducibility.
Simulated Author's Rebuttal
We thank the referee for the constructive comments, which identify gaps in the empirical validation of the probe component. We address each point below and will incorporate the requested details and experiments into the revised manuscript.
read point-by-point responses
-
Referee: [Abstract, §4] Abstract and §4 (Results): The 4.5 pp success-rate gain is presented as direct evidence that the probe-guided recovery works, yet no quantitative probe metrics are supplied (position RMSE, localization success rate, or ablation that removes the probe while keeping the state machine and CBF). Because the probe is the sole source of 3D object state for the downstream kinematic detector and CBF, the absence of these numbers leaves the central claim untestable.
Authors: We agree that quantitative probe metrics and a targeted ablation are necessary to substantiate the probe's contribution. In the revision we will add position RMSE and localization success rate for the probe on LIBERO-plus, together with an ablation that retains the kinematic state machine and CBF while removing the probe, thereby isolating its effect on the reported success-rate improvement. revision: yes
-
Referee: [§3.1] §3.1 (Probe architecture): The claim that intermediate VLA hidden states contain sufficient information for accurate 3D regression is load-bearing, but the manuscript provides neither the probe's training objective details nor any validation against ground-truth 3D trajectories on LIBERO-plus. Without these, it is impossible to determine whether the reported gain stems from the probe or from other components.
Authors: We will expand §3.1 to specify the probe's training objective, including the regression loss and any auxiliary terms, and will report validation results that compare the probe's predicted 3D positions against ground-truth trajectories available in the LIBERO-plus benchmark. revision: yes
Circularity Check
No circularity; empirical framework evaluated on external benchmark
full rationale
The paper introduces PROBEACT as a training-free runtime intervention with three components (hidden-state probe, kinematic state machine, CBF filter) and reports success-rate gains on the LIBERO-plus benchmark (69.6% to 74.1%). No equations, derivations, or parameter-fitting steps are described that reduce a claimed prediction to its own inputs by construction. Performance numbers are presented as direct empirical measurements rather than outputs forced by self-definition or self-citation chains. The central claims rest on observable benchmark results, not on any load-bearing self-referential step.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
M. J. Kim, K. Pertsch, S. Karamcheti, T. Xiao, A. Balakrishna, S. Nair, R. Rafailov, E. Foster, G. Lam, P. Sanketi, et al. Openvla: An open-source vision-language-action model.arXiv preprint arXiv:2406.09246, 2024
Pith/arXiv arXiv 2024
-
[2]
M. J. Kim, C. Finn, and P. Liang. Fine-tuning vision-language-action models: Optimizing speed and success.arXiv preprint arXiv:2502.19645, 2025
Pith/arXiv arXiv 2025
-
[3]
K. Black, N. Brown, D. Driess, A. Esmail, M. Equi, C. Finn, N. Fusai, L. Groom, K. Hausman, B. Ichter, et al. pi0: A vision-language-action flow model for general robot control.arXiv preprint arXiv:2410.24164, 2024
Pith/arXiv arXiv 2024
-
[4]
P. Intelligence, K. Black, N. Brown, J. Darpinian, K. Dhabalia, D. Driess, A. Esmail, M. Equi, C. Finn, N. Fusai, et al. pi0.5: a vision-language-action model with open-world generalization. arXiv preprint arXiv:2504.16054, 2025
Pith/arXiv arXiv 2025
-
[5]
S. Fei, S. Wang, J. Shi, Z. Dai, J. Cai, P. Qian, L. Ji, X. He, S. Zhang, Z. Fei, et al. Libero-plus: In-depth robustness analysis of vision-language-action models.arXiv preprint arXiv:2510.13626, 2025
Pith/arXiv arXiv 2025
-
[6]
X. Zhou, Y . Xu, G. Tie, Y . Chen, G. Zhang, D. Chu, P. Zhou, and L. Sun. Libero-pro: To- wards robust and fair evaluation of vision-language-action models beyond memorization.arXiv preprint arXiv:2510.03827, 2025
Pith/arXiv arXiv 2025
-
[7]
D. Goel, Y . Wang, T. Wu, G. Qiao, P. Piliptchak, D. Held, and Z. Erickson. Geometric red- teaming for robotic manipulation. InConference on Robot Learning, pages 41–67. PMLR, 2025
2025
-
[8]
X. Zeng, X. Zhou, Y . Li, J. Shi, T. Li, L. Chen, L. Ren, and Y .-L. Li. Diagnose, correct, and learn from manipulation failures via visual symbols.arXiv preprint arXiv:2512.02787, 2025
arXiv 2025
-
[9]
J. Yang, Y . Chen, Y . Xu, P. Li, X. Wu, Z. Wen, B. Fang, T. Yu, Z. Zhang, Y . Li, et al. Uaor: Uncertainty-aware observation reinjection for vision-language-action models.arXiv preprint arXiv:2602.18020, 2026
Pith/arXiv arXiv 2026
-
[10]
Y . Fang, Y . Feng, D. Jing, J. Liu, Y . Yang, Z. Wei, D. Szafir, and M. Ding. When vision overrides language: Evaluating and mitigating counterfactual failures in vlas.arXiv preprint arXiv:2602.17659, 2026
arXiv 2026
- [11]
-
[12]
A. Shah, W. Chen, A. Godbole, F. Mora, S. A. Seshia, and S. Levine. Learning affordances at inference-time for vision-language-action models.arXiv preprint arXiv:2510.19752, 2025
arXiv 2025
-
[13]
Z. Lin, J. Duan, H. Fang, D. Fox, R. Krishna, C. Tan, and B. Wen. Failsafe: Reasoning and recovery from failures in vision-language-action models.arXiv preprint arXiv:2510.01642, 2025
arXiv 2025
-
[14]
C. Ma, G. Yang, K. Lu, S. Xu, B. Byrne, N. Trigoni, and A. Markham. Cyclevla: Proactive self-correcting vision-language-action models via subtask backtracking and minimum bayes risk decoding.arXiv preprint arXiv:2601.02295, 2026
arXiv 2026
-
[15]
H. W. Kuhn. The hungarian method for the assignment problem.Naval research logistics quarterly, 2(1-2):83–97, 1955
1955
-
[16]
A. D. Ames, S. Coogan, M. Egerstedt, G. Notomista, K. Sreenath, and P. Tabuada. Control barrier functions: Theory and applications. In2019 18th European control conference (ECC), pages 3420–3431. Ieee, 2019. 9
2019
-
[17]
Zitkovich, T
B. Zitkovich, T. Yu, S. Xu, P. Xu, T. Xiao, F. Xia, J. Wu, P. Wohlhart, S. Welker, A. Wahid, et al. Rt-2: Vision-language-action models transfer web knowledge to robotic control. In Conference on Robot Learning, pages 2165–2183. PMLR, 2023
2023
-
[18]
O. M. Team, D. Ghosh, H. Walke, K. Pertsch, K. Black, O. Mees, S. Dasari, J. Hejna, T. Kreiman, C. Xu, et al. Octo: An open-source generalist robot policy.arXiv preprint arXiv:2405.12213, 2024
Pith/arXiv arXiv 2024
-
[19]
G. Alain and Y . Bengio. Understanding intermediate layers using linear classifier probes.arXiv preprint arXiv:1610.01644, 2016
Pith/arXiv arXiv 2016
-
[20]
Belinkov
Y . Belinkov. Probing classifiers: Promises, shortcomings, and advances.Computational Lin- guistics, 48(1):207–219, 2022
2022
-
[21]
H. Lu, H. Li, P. S. Shahani, S. Herbers, and M. Scheutz. Probing a vision-language-action model for symbolic states and integration into a cognitive architecture. In2025 IEEE Interna- tional Conference on AI and Data Analytics (ICAD), pages 1–8. IEEE, 2025
2025
-
[22]
El Banani, A
M. El Banani, A. Raj, K.-K. Maninis, A. Kar, Y . Li, M. Rubinstein, D. Sun, L. Guibas, J. John- son, and V . Jampani. Probing the 3d awareness of visual foundation models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 21795–21806, 2024
2024
-
[23]
A. D. Ames, X. Xu, J. W. Grizzle, and P. Tabuada. Control barrier function based quadratic programs for safety critical systems.IEEE Transactions on Automatic Control, 62(8):3861– 3876, 2016
2016
-
[24]
Marley, R
M. Marley, R. Skjetne, E. Basso, and A. R. Teel. Maneuvering with safety guarantees using control barrier functions.IFAC-PapersOnLine, 54(16):370–377, 2021
2021
-
[25]
D. Kim, S. Yang, W. Zou, B. Shuai, D. Zhang, F. Zhang, C. Liu, and S. E. Li. Control safety function for explicit safety-critical control of autonomous vehicles. In2024 IEEE Intelligent Vehicles Symposium (IV), pages 843–850. IEEE, 2024
2024
-
[26]
X. Shen, E. L. Zhu, Y . R. St¨urz, and F. Borrelli. Collision avoidance in tightly-constrained en- vironments without coordination: A hierarchical control approach. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 2674–2680. IEEE, 2021
2021
-
[27]
T. Pan, A. M. Wells, R. Shome, and L. E. Kavraki. Failure is an option: Task and motion plan- ning with failing executions. In2022 International Conference on Robotics and Automation (ICRA), pages 1947–1953. IEEE, 2022
1947
-
[28]
Colledanchise and P
M. Colledanchise and P. ¨Ogren. How behavior trees modularize hybrid control systems and generalize sequential behavior compositions, the subsumption architecture, and decision trees. IEEE Transactions on robotics, 33(2):372–389, 2016
2016
-
[29]
Thananjeyan, A
B. Thananjeyan, A. Balakrishna, S. Nair, M. Luo, K. Srinivasan, M. Hwang, J. E. Gonzalez, J. Ibarz, C. Finn, and K. Goldberg. Recovery rl: Safe reinforcement learning with learned recovery zones.IEEE Robotics and Automation Letters, 6(3):4915–4922, 2021. 10
2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.