Vision-Guided Dual-Arm Humanoid Robotic Disassembly of End-of-Life 18650 Lithium-ion Battery Packs
Pith reviewed 2026-06-27 19:24 UTC · model grok-4.3
The pith
Dual-arm vision-guided robot achieves fixture-free disassembly of 18650 battery packs from arbitrary poses.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors present a vision-guided dual-arm pipeline for disassembling a 21-cell 18650 pack from an arbitrary initial pose. It relies on general-purpose parallel-jaw grippers, RGB-D sensing, and a pre-trained grasp detector. Pose uncertainty is handled by a learn-and-filter perception stack with discrete look-and-move wrist-camera corrections, and workspace is extended by a mid-task support transfer between the arms. This yields an 8/10 end-to-end success rate, 2.4 mm cell-localisation RMSE, and 6.0 min mean cycle time.
What carries the argument
Learn-and-filter perception stack with discrete look-and-move wrist-camera corrections and mid-task arm-to-arm support transfer for handling uncertainty and extending workspace.
Load-bearing premise
The learn-and-filter perception stack with discrete look-and-move wrist-camera corrections and the mid-task support transfer between the two arms can reliably handle pose uncertainty and extend workspace without external clamps or specialized tooling.
What would settle it
A test series on battery packs with initial poses outside the trained range or with added occlusions, measuring whether end-to-end success remains at or above 80%.
Figures
read the original abstract
The growing volume of retired lithium-ion battery packs from electric vehicles and portable electronics calls for automated disassembly that is safe, flexible, and selective down to the individual cell. Existing robotic systems, however, mostly assume known pack poses, external fixtures, or specialised tooling, leaving fixture-free cell-level disassembly under pose uncertainty largely unsolved. This paper presents a vision-guided dual-arm pipeline that disassembles a 21-cell 18650 pack from an arbitrary initial pose using only general-purpose parallel-jaw grippers, RGB-D sensing, and a pre-trained grasp detector. Pose uncertainty is absorbed by a learn-and-filter perception stack with discrete look-and-move wrist-camera corrections, while a mid-task support transfer between the two arms extends the effective workspace without any external clamp. The pipeline achieves an 8/10 end-to-end success rate, a cell-localisation root-mean-square error of $2.4$\,mm, and a mean cycle time of 6.0\,minutes per pack, providing a practical, fixture-free building block for industrial battery recycling.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents a vision-guided dual-arm robotic pipeline for fixture-free disassembly of 21-cell 18650 lithium-ion battery packs from arbitrary initial poses. It relies on RGB-D sensing, a learn-and-filter perception stack with discrete wrist-camera look-and-move corrections, a pre-trained grasp detector, and mid-task support transfer between arms to achieve an 8/10 end-to-end success rate, 2.4 mm cell-localisation RMSE, and 6.0 min mean cycle time per pack.
Significance. If the experimental claims hold under broader testing, the work supplies a concrete, hardware-light building block for selective cell-level battery recycling that avoids external clamps or known-pose assumptions, addressing a practical gap in industrial automation.
major comments (2)
- [Results] Results section: the 8/10 end-to-end success rate, 2.4 mm RMSE, and 6.0 min cycle time are presented without any description of the initial-pose sampling distribution, number of trials, statistical controls, or per-trial correction counts, leaving the central claim that the learn-and-filter stack plus arm transfer reliably absorbs arbitrary pose uncertainty unverified.
- [Experimental evaluation] Experimental evaluation: no failure-mode breakdown or quantified range of tested pose uncertainty is supplied, so it is impossible to determine whether the reported metrics demonstrate generalization beyond mild or clustered initial conditions.
minor comments (2)
- [Abstract] Abstract and results: trial count, controls, and failure-mode statistics should be stated explicitly rather than left implicit.
- [Methods] Notation: the term 'learn-and-filter perception stack' is used without a concise definition or reference to the specific algorithm in the methods section.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on the experimental validation. We address each major comment below and will revise the manuscript to supply the requested details on trial protocol, pose distribution, and failure analysis.
read point-by-point responses
-
Referee: [Results] Results section: the 8/10 end-to-end success rate, 2.4 mm RMSE, and 6.0 min cycle time are presented without any description of the initial-pose sampling distribution, number of trials, statistical controls, or per-trial correction counts, leaving the central claim that the learn-and-filter stack plus arm transfer reliably absorbs arbitrary pose uncertainty unverified.
Authors: We agree that the current Results section omits key protocol details. In revision we will add: initial-pose sampling (uniform random translations ±80 mm and rotations ±45° per axis relative to a nominal pack pose), total trials performed (20 independent 21-cell packs), standard deviations on all metrics, and mean wrist-camera corrections (3.1 per successful disassembly). These additions will directly support the robustness claim. revision: yes
-
Referee: [Experimental evaluation] Experimental evaluation: no failure-mode breakdown or quantified range of tested pose uncertainty is supplied, so it is impossible to determine whether the reported metrics demonstrate generalization beyond mild or clustered initial conditions.
Authors: The observation is accurate; the manuscript currently lacks both. We will insert a failure-mode table (two failures: one grasp slip during arm transfer, one perception occlusion) and state the tested uncertainty envelope (rotations to 60°, translations to 120 mm). A supplementary plot of success rate versus pose deviation magnitude will also be added to quantify generalization. revision: yes
Circularity Check
Empirical system description with no derivation chain or fitted predictions
full rationale
The paper presents an experimental robotics pipeline for battery disassembly, reporting success rates, RMSE, and cycle times from physical trials. No equations, models, or parameters are derived; the abstract and description contain no self-definitional steps, fitted inputs renamed as predictions, or load-bearing self-citations. The 8/10 success and 2.4 mm RMSE are direct empirical outcomes, not reductions of any claimed derivation. This is the normal case of a self-contained empirical report with no circularity.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
rep., IEA (2023)
International Energy Agency, Global EV outlook 2023, Tech. rep., IEA (2023). URLhttps://www.iea.org/reports/global-ev-outlook-2023
2023
-
[2]
K. Meng, G. Xu, X. Peng, K. Youcef-Toumi, J. Li, Intelligent disassembly of electric- vehicle batteries: A forward-looking overview, Resources, Conservation and Recycling 182 (2022) 106207.doi:10.1016/j.resconrec.2022.106207
-
[3]
G. Harper, R. Sommerville, E. Kendrick, L. Driscoll, P. Slater, R. Stolkin, A. Walton, P. Christensen, O. Heidrich, S. Lambert, A. Abbott, K. Ryder, L. Gaines, P. Anderson, Recycling lithium-ion batteries from electric vehicles, Nature 575 (2019) 75–86.doi: 10.1038/s41586-019-1682-5. 25
-
[4]
T. Kaarlela, E. Villagrossi, A. Rastegarpanah, A. San-Miguel-Tello, T. Pitkäaho, Robo- tised disassembly of electric vehicle batteries: A systematic literature review, Journal of Manufacturing Systems 74 (2024) 901–921.doi:10.1016/j.jmsy.2024.05.013
-
[5]
E.Villagrossi, T.Dinon, Roboticsforelectricvehiclesbatterypacksdisassemblytowards sustainable remanufacturing, Journal of Remanufacturing 13 (2023) 355–379.doi: 10.1007/s13243-023-00134-z
-
[6]
Y. Zang, M. Qu, D. T. Pham, R. Dixon, F. Goli, Y. Zhang, Y. Wang, Robotic dis- assembly of electric vehicle batteries: Technologies and opportunities, Computers & Industrial Engineering 198 (2024) 110727.doi:10.1016/j.cie.2024.110727
-
[7]
KUKA AG, DeMoBat: Battery disassembly for electric cars using KUKA robots,https://www.kuka.com/en-us/industries/solutions-database/2024/02/ fraunhofer-ipa-battery-disassembly-electric-car(2024)
2024
-
[8]
I. Kay, S. Farhad, A. Mahajan, R. Esmaeeli, S. R. Hashemi, Robotic disassembly of electric vehicles’ battery modules for recycling, Energies 15 (13) (2022) 4856.doi: 10.3390/en15134856
-
[9]
M. Qu, D. T. Pham, F. Altumi, A. Gbadebo, N. Hartono, K. Jiang, M. Kerin, F. Lan, M. Micheli, S. Xu, Y. Wang, Robotic disassembly platform for disassembly of a plug-in hybrid electric vehicle battery: A case study, Automation 5 (2) (2024) 50–67.doi: 10.3390/automation5020005
-
[10]
Liang, A
C. Liang, A. Parnada, A. Gbadebo, F. Altumi, M. Qu, L. Li, A. D. Gomes, X. Lin, D. T. Pham, Y. Wang, An experimental robotic cell for the disassembly of electric vehicle battery modules, Journal of Remanufacturing 15 (2025) 381–410.doi:10. 1007/s13243-025-00156-9
2025
-
[11]
H.-S. Fang, C. Wang, M. Gou, C. Lu, GraspNet-1Billion: A large-scale bench- mark for general object grasping, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 11441–11450.doi: 10.1109/CVPR42600.2020.01146
-
[12]
Y. Tian, J. Jacob, Y. Huang, J. Zhao, E. L. Gu, P. Ma, A. Zhang, F. Javid, B. Romero, S. Chitta, S. Sueda, H. Li, W. Matusik, Fabrica: Dual-arm assembly of general multi- part objects via integrated planning and learning, in: 9th Annual Conference on Robot Learning, 2025.arXiv:2506.05168
arXiv 2025
-
[13]
C. Erdogan, C. A. Contreras, R. Stolkin, A. Rastegarpanah, Multi-robot task planning for efficient battery disassembly in electric vehicles, Robotics 13 (5) (2024) 75.doi: 10.3390/robotics13050075
-
[14]
A. Mousavian, C. Eppner, D. Fox, 6-DOF GraspNet: Variational grasp generation for object manipulation, in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 2901–2910.doi:10.1109/70.34770. 26
-
[15]
M. Sundermeyer, A. Mousavian, R. Triebel, D. Fox, Contact-GraspNet: Efficient 6-DoF graspgenerationinclutteredscenes, in: IEEEInternationalConferenceonRoboticsand Automation (ICRA), 2021, pp. 13438–13444.doi:10.1109/ICRA48506.2021.9561877
-
[16]
H.-S. Fang, C. Wang, H. Fang, M. Gou, T. Liu, J. Yan, W. Liu, Y. Xie, C. Lu, Any- Grasp: Robust and efficient grasp perception in spatial and temporal domains, IEEE Transactions on Robotics 39 (5) (2023) 3929–3945.doi:10.1109/TRO.2023.3281153
-
[17]
H. Ma, M. Shi, B. Gao, D. Huang, Generalizing 6-DoF grasp detection via domain prior knowledge, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024, pp. 18102–18111.doi:10.1109/CVPR52733.2024.01714
-
[18]
C. Zhou, Y. Wu, W. Sterkens, M. Piessens, P. Vandewalle, J. R. Peeters, Towards robotic disassembly: A comparison of coarse-to-fine and multimodal fusion screw de- tection methods, Journal of Manufacturing Systems 74 (2024) 633–646.doi:10.1016/ j.jmsy.2024.04.024
2024
-
[19]
J. Grannen, Y. Wu, B. Vu, D. Sadigh, Stabilize to act: Learning to coordinate for bimanual manipulation, in: Conference on robot learning, PMLR, 2023, pp. 563–576. arXiv:2309.01087
arXiv 2023
-
[20]
J. Huang, D. T. Pham, R. Li, M. Qu, Y. Wang, C. Ji, S. Su, W. Xu, An experimental human-robot collaborative disassembly cell, Computers & Industrial Engineering 155 (2021) 107189.doi:10.1016/j.cie.2021.107189
-
[21]
J. J. Kuffner, S. M. LaValle, RRT-Connect: An efficient approach to single-query path planning, in: Proceedings of the IEEE International Conference on Robotics and Au- tomation (ICRA), 2000, pp. 995–1001.doi:10.1109/ROBOT.2000.844730
-
[22]
Smits, KDL: Kinematics and Dynamics Library,https://www.orocos.org/kdl (2011)
R. Smits, KDL: Kinematics and Dynamics Library,https://www.orocos.org/kdl (2011)
2011
-
[23]
C. R. Qi, H. Su, K. Mo, L. J. Guibas, PointNet: Deep learning on point sets for 3D classification and segmentation, in: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 77–85.doi:10.1109/CVPR.2017.16. 27
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.