An analysis of sensor selection for fruit picking with suction-based grippers
Pith reviewed 2026-05-08 02:31 UTC · model grok-4.3
The pith
Multimodal sensors allow over 90% accurate detection of apple pick success and failures in real orchards
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors show through orchard experiments that phase-dependent analysis of sensors in a suction gripper enables Random Forest and Multilayer Perceptron classifiers to detect successful picks and impending failures with over 90% accuracy, predicting events within 0.09 s of human ground truth.
What carries the argument
Phase-dependent multimodal sensor evaluation in a compliant suction gripper for pick state classification using pressure, force, and inertial data.
Load-bearing premise
The sensor informativeness and classifier accuracies will remain consistent under different orchard conditions, fruit sizes, stem strengths, and with gripper wear.
What would settle it
Collecting new data from a different orchard or with varied fruit and observing classifier accuracy significantly below 90% would falsify the reliability of the identified sensor sets.
Figures
read the original abstract
Robotic fruit harvesting often fails to reliably detect whether a fruit has been successfully picked, limiting efficiency and increasing crop damage. This problem is difficult due to compliant fruit and grippers, variable stem attachment, and occlusions in orchard environments. Prior work has explored vision-based perception and multi-sensor learning approaches for pick state estimation. However, minimal sensor sets and phase-dependent sensing strategies for accurate pick and slip detection remain largely unexplored. In this work, we design and evaluate a multimodal sensing suite integrated into a compliant suction-based apple gripper. Our approach is unique because it identifies which sensors are most informative at different phases of the pick, enabling predictive detection of failures before they occur. The contributions of this paper are a phase-dependent evaluation of multimodal sensors and the identification of minimal sensor sets for reliable pick state classification. Experiments in a real apple orchard show that Random Forest and Multilayer Perceptron classifiers detect successful picks and impending failures with over 90% accuracy, and Random Forest predicts pick/slip events within 0.09 s of human-annotated ground truth.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript designs and evaluates a multimodal sensing suite integrated into a compliant suction-based gripper for robotic apple picking. It performs a phase-dependent analysis to determine the most informative sensors during different stages of the pick operation, identifies minimal sensor subsets for reliable state classification, and reports orchard experiments in which Random Forest and Multilayer Perceptron classifiers achieve over 90% accuracy in detecting successful picks and impending failures, with the Random Forest also predicting pick/slip event times within 0.09 s of human-annotated ground truth.
Significance. If the performance claims hold under broader conditions, the work offers practical value for increasing the reliability of robotic fruit harvesting by enabling predictive detection of pick failures before they damage fruit or stems. The phase-dependent sensor ranking and minimal-set identification provide actionable guidance for hardware design in compliant manipulation. Real-world orchard data collection is a strength, as is the explicit focus on sensor selection rather than end-to-end vision-only approaches.
major comments (2)
- [Section 4 and Section 5] Section 4 (Experimental Setup) and Section 5 (Results): The central accuracy (>90%) and latency (0.09 s) figures are computed exclusively against human-annotated ground truth for pick success, slip onset, and impending failure. No inter-annotator agreement statistics, number of annotators, or objective corroboration (e.g., force-threshold slip detection or high-speed video verification) are reported. Because label noise or systematic bias would propagate directly into both the classifier metrics and the phase-dependent informativeness rankings, this omission is load-bearing for the primary claims.
- [Section 5] Section 5 (Results): The manuscript states that Random Forest and MLP classifiers were trained and evaluated but does not specify the cross-validation procedure, train/test split strategy, or whether temporal leakage was prevented when using time-series sensor data. Without these details it is impossible to assess whether the reported accuracies reflect genuine generalization or overfitting to orchard-specific conditions.
minor comments (2)
- [Abstract and Section 3] The abstract and Section 3 would benefit from a concise table listing the exact sensor models, sampling rates, and mounting locations on the gripper.
- [Figure captions] Figure captions for the phase-dependent sensor ranking plots should explicitly state the number of trials and fruit instances contributing to each phase.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed feedback, which highlights important aspects of ground truth reliability and experimental methodology in our work on multimodal sensor selection for suction-based apple picking. We address each major comment below and indicate the revisions planned for the next version of the manuscript.
read point-by-point responses
-
Referee: [Section 4 and Section 5] Section 4 (Experimental Setup) and Section 5 (Results): The central accuracy (>90%) and latency (0.09 s) figures are computed exclusively against human-annotated ground truth for pick success, slip onset, and impending failure. No inter-annotator agreement statistics, number of annotators, or objective corroboration (e.g., force-threshold slip detection or high-speed video verification) are reported. Because label noise or systematic bias would propagate directly into both the classifier metrics and the phase-dependent informativeness rankings, this omission is load-bearing for the primary claims.
Authors: We agree that the quality and validation of the ground-truth labels are central to the strength of our accuracy and latency claims. The annotations were performed by a single expert annotator using synchronized high-resolution video and sensor traces, with explicit labeling criteria for successful picks, slip onset, and impending failure. In the revised manuscript we will expand Section 4 to fully document the annotation protocol, the number of annotators (one), and the labeling guidelines. We will also add an explicit discussion of this choice as a limitation. Inter-annotator agreement statistics and independent objective corroboration (high-speed video or separate force-threshold detection) were not collected in the original study; we will therefore note these as limitations rather than provide new data. revision: partial
-
Referee: [Section 5] Section 5 (Results): The manuscript states that Random Forest and MLP classifiers were trained and evaluated but does not specify the cross-validation procedure, train/test split strategy, or whether temporal leakage was prevented when using time-series sensor data. Without these details it is impossible to assess whether the reported accuracies reflect genuine generalization or overfitting to orchard-specific conditions.
Authors: We appreciate the referee pointing out this lack of detail, which is necessary for assessing generalization. In the revised Section 5 we will specify that a 5-fold stratified cross-validation was used, with folds constructed so that entire pick sequences (and therefore all time steps from a given grasp) remained within a single fold. An 80/20 train/test split was applied at the sequence level, drawing the test set exclusively from unseen trees and orchard sessions to avoid temporal leakage and orchard-specific overfitting. These methodological choices will be stated explicitly together with the reported accuracy and latency figures. revision: yes
- Inter-annotator agreement statistics and objective corroboration of ground-truth labels (e.g., high-speed video verification or independent force-threshold slip detection), as these were not collected during the original orchard experiments.
Circularity Check
No circularity: empirical ML evaluation on independent orchard data
full rationale
The paper reports an experimental study that collects multimodal sensor data during real apple orchard picks, trains standard off-the-shelf classifiers (Random Forest, MLP) on those data, and evaluates detection accuracy and event timing against separate human annotations. No derivation chain, equations, fitted parameters presented as predictions, or self-citation load-bearing steps appear in the described contributions or abstract. Performance claims rest on held-out experimental measurements rather than any reduction to the inputs by construction, so the work is self-contained.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Training and test samples are independent and identically distributed.
Reference graph
Works this paper leans on
-
[1]
J. Davidson, S. Bhusal, C. Mo, M. Karkee, and Q. Zhang, “Robotic manipulation for specialty crop harvesting: A review of manipulator and end-effector technologies,”Global Journal of Agricultural and Allied Sciences, vol. 2, pp. 25–41, 09 2020
work page 2020
-
[2]
Design and control of an apple harvesting robot,
Z. De-An, L. Jidong, J. Wei, Z. Ying, and C. Yu, “Design and control of an apple harvesting robot,”Biosystems Engineering, vol. 110, no. 2, pp. 112–122, 2011
work page 2011
-
[3]
Exploiting plant dynamics in robotic fruit localization,
J. Senden, L. Janssen, R. van der Kruk, H. Bruyninckx, and R. van de Molengraft, “Exploiting plant dynamics in robotic fruit localization,” Computers and Electronics in Agriculture, vol. 196, p. 106860, 2022
work page 2022
-
[4]
Predicting fruit-pick success using a grasp classifier trained on a physical proxy,
A. Velasquez, N. Swenson, M. Cravetz, C. Grimm, and J. R. Davidson, “Predicting fruit-pick success using a grasp classifier trained on a physical proxy,” in2022 IEEE/RSJ Int’l Conf on Intelligent Robots and Systems (IROS), 2022, pp. 9225–9231
work page 2022
-
[5]
An autonomous strawberry-harvesting robot: Design, development, integration, and field evaluation,
Y . Xiong, Y . Ge, L. Grimstad, and P. J. From, “An autonomous strawberry-harvesting robot: Design, development, integration, and field evaluation,”Journal of Field Robotics, vol. 37, no. 2, pp. 202– 224, 2020
work page 2020
-
[6]
M. Sytsma, B. M. van Marrewijk, T. Tielen, A. Vroegop, and J. Ruizendaal, “Design and evaluation of a robotic prototype for gerbera harvesting, performing actions at never-seen locations,”Com- puters and Electronics in Agriculture, vol. 228, p. 109671, 2025
work page 2025
-
[7]
Investigating sensors and methods in grasp state classification in agricultural manipulation,
B. Walt, J. Westphal, and G. Krishnan, “Investigating sensors and methods in grasp state classification in agricultural manipulation,” Smart Agricultural Technology, vol. 12, p. 101533, 2025
work page 2025
-
[8]
Towards intelligent fruit picking with in-hand sensing,
L. M. Dischinger, M. Cravetz, J. Dawes, C. V otzke, C. VanAtter, M. L. Johnston, C. M. Grimm, and J. R. Davidson, “Towards intelligent fruit picking with in-hand sensing,” in2021 IEEE/RSJ Int’l Conf on Intelligent Robots and Systems (IROS), 2021, pp. 3285–3291
work page 2021
-
[9]
Compact robotic gripper with tandem actuation for selective fruit harvesting,
A. Velasquez, C. Grimm, and J. R. Davidson, “Compact robotic gripper with tandem actuation for selective fruit harvesting,” 2024
work page 2024
-
[10]
Multi-sensor vacuum gripper for closed-loop grasp in robotic apple harvesting,
J. Hemming, N. van Damme, M. Sytsma, R. van de Ven, G. Kootstra, and J. R. Davidson, “Multi-sensor vacuum gripper for closed-loop grasp in robotic apple harvesting,”IFAC-PapersOnLine, vol. 59, pp. 315–320, 2025, 8th IFAC Conf on Sensing, Control and Automation Technologies for Agriculture (AGRICONTROL 2025)
work page 2025
-
[11]
Vacuum suction end-effector development for robotic harvesters of fresh market apples,
W. Hua, W. Zhang, Z. Zhang, X. Liu, M. Huang, C. Igathinathane, S. V ougioukas, C. K. Saha, N. Mustafa, D. S. Salama, Y . Zhang, and M. Zhang, “Vacuum suction end-effector development for robotic harvesters of fresh market apples,”Biosystems Engineering, vol. 249, pp. 28–40, 2025
work page 2025
-
[12]
Elec- tronic sensing combined with machine learning models for predicting soil nutrient content,
S. Liu, X. Chen, X. Xia, Y . Jin, G. Wang, H. Jia, and D. Huang, “Elec- tronic sensing combined with machine learning models for predicting soil nutrient content,”Computers and Electronics in Agriculture, vol. 221, p. 108947, 2024
work page 2024
-
[13]
Cnn-mlp-based configurable robotic arm for smart agriculture,
M. Li, F. Wu, F. Wang, T. Zou, M. Li, and X. Xiao, “Cnn-mlp-based configurable robotic arm for smart agriculture,”Agriculture, vol. 14, no. 9, 2024
work page 2024
-
[14]
C. Zheng, A. Abd-Elrahman, and V . Whitaker, “Remote sensing and machine learning in crop phenotyping and management, with an emphasis on applications in strawberry farming,”Remote Sensing, vol. 13, no. 3, 2021
work page 2021
-
[15]
Applications of machine learning and deep learning in agriculture: A comprehensive review,
M. Waqas, A. Naseem, U. W. Humphries, P. T. Hlaing, P. Dechpichai, and A. Wangwongchai, “Applications of machine learning and deep learning in agriculture: A comprehensive review,”Green Technologies and Sustainability, vol. 3, no. 3, p. 100199, 2025
work page 2025
-
[16]
Detecting apple abscission during robotic harvesting with force and pressure sensing,
E. Krueger, O. Gehrke, M. Cravetz, M. Sytsma, J. Hemming, and J. R. Davidson, “Detecting apple abscission during robotic harvesting with force and pressure sensing,”IFAC-PapersOnLine, vol. 59, no. 23, pp. 419–424, 2025, 8th IFAC Conf on Sensing, Control and Automation Technologies for Agriculture AGRICONTROL 2025
work page 2025
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.