ReLeaf: Benchmarking Leaf Segmentation across Domains and Species
Pith reviewed 2026-05-07 17:58 UTC · model grok-4.3
The pith
A YOLO26 model trained on four leaf segmentation datasets reaches 83.9% mean mAP50-95 on their test sets but only 40.2% on a new 23-species benchmark, revealing substantial cross-domain generalization gaps.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A model trained on all four existing datasets achieves a mean mAP50-95 of 83.9% across their corresponding test sets and 40.2% on our new benchmark, demonstrating improved generalization and highlighting the need for diverse leaf-segmentation datasets in robust precision agriculture.
Load-bearing premise
The semi-automatic annotation process for the new 23-species dataset produces sufficiently accurate ground-truth masks, and the four selected public datasets adequately represent the range of species and imaging conditions encountered in real precision agriculture.
Figures
read the original abstract
Rising global food demand and growing climate pressure increase the need for sustainable, precise agricultural practices. Automated, individualized plant treatment relies on fine-grained visual analysis, yet leaf-level segmentation remains underexplored despite its value for assessing crop health, growth dynamics, yield potential and localized stress symptoms. Progress is limited by a lack of dedicated datasets, especially regarding species coverage, and by the absence of systematic evaluations of modern instance-segmentation architectures for this task. We address these gaps by surveying current data and identifying four suitable, publicly available leaf-segmentation datasets. Using them, we compare one-stage, two-stage and Transformer-based detectors and identify a YOLO26 model configuration to provide the best trade-off for real-world precision-agriculture tasks. Extensive cross-domain generalization experiments reveal substantial performance drops across plant species and recording setups, especially for models trained solely on laboratory data. To strengthen data availability, we introduce a new benchmark dataset with leaf-level masks for 23 plant species, created via semi-automatic annotation of selected CropAndWeed images. A model trained on all four existing datasets achieves a mean mAP50-95 of 83.9% across their corresponding test sets and 40.2% on our new benchmark, demonstrating improved generalization and highlighting the need for diverse leaf-segmentation datasets in robust precision agriculture.
Editorial analysis
A structured set of objections, weighed in public.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Leaf counting with deep convolutional and deconvolutional networks
Shubhra Aich and Ian Stavness. Leaf counting with deep convolutional and deconvolutional networks. InProceed- ings of the IEEE international conference on computer vision workshops, pages 2080–2089, 2017. 2
work page 2080
-
[2]
Jonathan Bell and HM Dee. Aberystwyth leaf evaluation dataset: a plant growth visible light image dataset of ara- bidopsis thaliana.Zenodo, 2016. 3, 4
work page 2016
-
[3]
Enrico Bellocchio, Gabriele Costante, Silvia Cascianelli, Mario Luca Fravolini, and Paolo Valigi. Combining domain adaptation and spatial consistency for unseen fruits counting: a quasi-unsupervised approach.IEEE Robotics and Automa- tion Letters, 5(2):1079–1086, 2020. 2
work page 2020
-
[4]
Yolact: Real-time instance segmentation
Daniel Bolya, Chong Zhou, Fanyi Xiao, and Yong Jae Lee. Yolact: Real-time instance segmentation. InProceedings of the IEEE/CVF international conference on computer vision, pages 9157–9166, 2019. 2
work page 2019
-
[5]
Ozgul Calicioglu, Alessandro Flammini, Stefania Bracco, Lorenzo Bell `u, and Ralph Sims. The future challenges of food and agriculture: An integrated analysis of trends and solutions.Sustainability, 11(1):222, 2019. 1
work page 2019
-
[6]
End-to- end object detection with transformers
Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. End-to- end object detection with transformers. InEuropean confer- ence on computer vision, pages 213–229. Springer, 2020. 2
work page 2020
-
[7]
Ekin Celikkan, Timo Kunzmann, Yertay Yeskaliyev, Sibylle Itzerott, Nadja Klein, and Martin Herold. Weedsgalore: A multispectral and multitemporal uav-based dataset for crop and weed segmentation in agricultural maize fields. InPro- ceedings of the Winter Conference on Applications of Com- puter Vision (WACV), 2025. 3
work page 2025
-
[8]
Computer vision annotation tool (cvat).https://github.com/opencv/cvat, 2026
CV AT.ai Corporation. Computer vision annotation tool (cvat).https://github.com/opencv/cvat, 2026. 4
work page 2026
-
[9]
Chad DeChant, Tyr Wiesner-Hanks, Siyuan Chen, Ethan L Stewart, Jason Yosinski, Michael A Gore, Rebecca J Nelson, and Hod Lipson. Automated identification of northern leaf blight-infected maize plants from field imagery using deep learning.Phytopathology, 107(11):1426–1432, 2017. 1
work page 2017
-
[10]
Imagenet: A large-scale hierarchical image database
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. In2009 IEEE conference on computer vision and pattern recognition, pages 248–255. IEEE, 2009. 2
work page 2009
-
[11]
An overview of global leaf area index (lai): Methods, products, validation, and applications
Hongliang Fang, Frederic Baret, Stephen Plummer, and Gabriela Schaepman-Strub. An overview of global leaf area index (lai): Methods, products, validation, and applications. Reviews of Geophysics, 57(3):739–799, 2019. 1
work page 2019
-
[12]
Mulham Fawakherji, Ali Youssef, Domenico Bloisi, Alberto Pretto, and Daniele Nardi. Crop and weeds classification for precision agriculture using context-independent pixel-wise segmentation. In2019 third IEEE international conference on robotic computing (IRC), pages 146–152. IEEE, 2019. 2
work page 2019
-
[13]
Learning to count leaves in rosette plants.XXX,
Mario Valerio Giuffrida, Massimo Minervini, and Sotirios A Tsaftaris. Learning to count leaves in rosette plants.XXX,
-
[14]
Petr Hurtik, V ojtech Molek, Jan Hula, Marek Vajgl, Pavel Vlasanek, and Tomas Nejezchleba. Poly-yolo: higher speed, more precise detection and instance segmentation for yolov3.Neural Computing and Applications, 34(10):8275– 8290, 2022. 2
work page 2022
-
[15]
Kenta Itakura and Fumiki Hosoi. Automatic leaf segmenta- tion for estimating leaf area and leaf inclination angle in 3d plant images.Sensors, 18(10):3576, 2018. 1
work page 2018
-
[16]
Ultralyt- ics yolo.https://github.com/ultralytics/ ultralytics, 2024
Glenn Jocher, Ayush Chaurasia, and Jing Qiu. Ultralyt- ics yolo.https://github.com/ultralytics/ ultralytics, 2024. Accessed: 2025-10-11. 2, 5, 1
work page 2024
-
[17]
Zohaib Khan, Yue Shen, and Hui Liu. Objectdetection in agriculture: A comprehensive review of methods, applica- tions, challenges, and future directions.Agriculture, 15(13): 1351, 2025. 3
work page 2025
-
[18]
Jana Kierdorf, Laura Verena Junker-Frohn, Mike De- laney, Mariele Donoso Olave, Andreas Burkart, Hannah Jaenicke, Onno Muller, Uwe Rascher, and Ribana Roscher. Growliflower: An image time-series dataset for growth anal- ysis of cauliflower.Journal of Field Robotics, 40(2):173– 192, 2023. 3, 4, 8, 2
work page 2023
-
[19]
Data augmentation for leaf segmentation and counting tasks in rosette plants
Dmitry Kuznichov, Alon Zvirin, Yaron Honen, and Ron Kimmel. Data augmentation for leaf segmentation and counting tasks in rosette plants. InCVPR Workshops, pages 0–0, 2019. 2
work page 2019
-
[20]
Sweet pepper pose detection and grasping for automated crop harvesting
Christopher Lehnert, Inkyu Sa, Christopher McCool, Ben Upcroft, and Tristan Perez. Sweet pepper pose detection and grasping for automated crop harvesting. In2016 IEEE ICRA, pages 2428–2434. IEEE, 2016. 1
work page 2016
-
[21]
Lian Lei, Qiliang Yang, Ling Yang, Tao Shen, Ruoxi Wang, and Chengbiao Fu. Deep learning implementation of image segmentation in agricultural applications: A comprehensive review.Artificial Intelligence Review, 57(6):149, 2024. 3
work page 2024
-
[22]
Rui Li, Shunyi Zheng, Ce Zhang, Chenxi Duan, Libo Wang, and Peter M Atkinson. Abcnet: Attentive bilateral con- textual network for efficient semantic segmentation of fine- resolution remotely sensed imagery.ISPRS journal of pho- togrammetry and remote sensing, 181:84–98, 2021. 2
work page 2021
-
[23]
Microsoft coco: Common objects in context
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Doll´ar, and C Lawrence Zitnick. Microsoft coco: Common objects in context. In European conference on computer vision, pages 740–755. Springer, 2014. 2, 5
work page 2014
-
[24]
Rt-detrv2: Improved base- line with bag-of-freebies for real-time detection transformer
Wenyu Lv, Yian Zhao, Qinyao Chang, Kui Huang, Guanzhong Wang, and Yi Liu. Rt-detrv2: Improved base- line with bag-of-freebies for real-time detection transformer. arXiv preprint arXiv:2407.17140, 2024. 1
-
[25]
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab, Timoth ´ee Darcet, Th ´eo Moutakanni, Huy V o, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, et al. Dinov2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193, 2023. 5
work page internal anchor Pith review arXiv 2023
-
[26]
End-to-end instance seg- mentation with recurrent attention
Mengye Ren and Richard S Zemel. End-to-end instance seg- mentation with recurrent attention. InCVPR, pages 6656– 6664, 2017. 2
work page 2017
-
[27]
com/en/products/pd/equipment/scanners/ fi-8950-production-scanner
Isaac Robinson, Peter Robicheaux, Matvei Popov, Deva Ramanan, and Neehar Peri. Rf-detr: neural architecture search for real-time detection transformers.arXiv preprint arXiv:2511.09554, 2025. 2, 5 9
-
[28]
Gianmarco Roggiolani, Matteo Sodano, Tiziano Guadagnino, Federico Magistri, Jens Behley, and Cyrill Stachniss. Hierarchical approach for joint semantic, plant instance, and leaf instance segmentation in the agricultural domain.arXiv preprint arXiv:2210.07879, 2022. 2
-
[29]
U-net: Convolutional networks for biomedical image segmentation
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical image segmentation. InMICCAI, pages 234–241. Springer, 2015. 2
work page 2015
-
[30]
Plant leaf disease detection and classification based on cnn with lvq algorithm
Melike Sardogan, Adem Tuncer, and Yunus Ozen. Plant leaf disease detection and classification based on cnn with lvq algorithm. In2018 UBMK, pages 382–385. IEEE, 2018. 1
work page 2018
-
[31]
Annotated image datasets of rosette plants
Hanno Scharr, Massimo Minervini, Andreas Fischbach, and Sotirios A Tsaftaris. Annotated image datasets of rosette plants. InEuropean conference on computer vision, pages 6–12. Suisse Z¨urich, 2014. 3, 4, 8, 2
work page 2014
-
[32]
Detection of plant leaf diseases using image segmentation and soft computing techniques
Vijai Singh and Ak K Misra. Detection of plant leaf diseases using image segmentation and soft computing techniques. Information Processing in Agriculture, 4(1):41–49, 2017. 1
work page 2017
-
[33]
The cropandweed dataset: A multi-modal learning approach for efficient crop and weed manipulation
Daniel Steininger, Andreas Trondl, Gerardus Croonen, Julia Simon, and Verena Widhalm. The cropandweed dataset: A multi-modal learning approach for efficient crop and weed manipulation. InProceedings of the IEEE/CVF Winter Con- ference on Applications of Computer Vision, pages 3729– 3738, 2023. 1, 2, 3, 4
work page 2023
-
[34]
Weed growth stage estimator using deep convolutional neural networks.Sensors, 18(5):1580,
Nima Teimouri, Mads Dyrmann, Per Rydahl Nielsen, Solvejg Kopp Mathiassen, Gayle J Somerville, and Ras- mus Nyholm Jørgensen. Weed growth stage estimator using deep convolutional neural networks.Sensors, 18(5):1580,
-
[35]
An easy-to-setup 3d phenotyping platform for komatsuna dataset
Hideaki Uchiyama, Shunsuke Sakurai, Masashi Mishima, Daisaku Arita, Takashi Okayasu, Atsushi Shimada, and Rin- ichiro Taniguchi. An easy-to-setup 3d phenotyping platform for komatsuna dataset. InProceedings of the IEEE inter- national conference on computer vision workshops, pages 2038–2045, 2017. 3, 4, 8, 2
work page 2038
-
[36]
Jehan-Antoine Vayssade, Gawain Jones, Christelle G ´ee, and Jean-No¨el Paoli. Pixelwise instance segmentation of leaves in dense foliage.Computers and Electronics in Agriculture, 195:106797, 2022. 3, 4
work page 2022
-
[37]
Mingjie Wang and Fuzhong Li. Real-time accurate apple detection based on improved yolov8n in complex natural en- vironments.Plants, 14(3):365, 2025. 2
work page 2025
-
[38]
Leaf segmentation using modified yolov8-seg models.Life, 14(6):780, 2024
Peng Wang, Hong Deng, Jiaxu Guo, Siqi Ji, Dan Meng, Jun Bao, and Peng Zuo. Leaf segmentation using modified yolov8-seg models.Life, 14(6):780, 2024. 2
work page 2024
-
[39]
Yunlong Wang and Zhiyong Zhang. Segment any leaf 3d: A zero-shot 3d leaf instance segmentation method based on multi-view images.Sensors, 25(2):526, 2025. 2
work page 2025
-
[40]
In-field phenotyping based on crop leaf and plant instance segmentation
Jan Weyler, Federico Magistri, Peter Seitz, Jens Behley, and Cyrill Stachniss. In-field phenotyping based on crop leaf and plant instance segmentation. InProceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 2725–2734, 2022. 2, 3
work page 2022
-
[41]
Jan Weyler, Federico Magistri, Elias Marks, Yue Linn Chong, Matteo Sodano, Gianmarco Roggiolani, Nived Che- brolu, Cyrill Stachniss, and Jens Behley. Phenobench: A large dataset and benchmarks for semantic image interpreta- tion in the agricultural domain.IEEE transactions on pattern analysis and machine intelligence, 46(12):9583–9594, 2024. 3, 4, 6, 8, 2
work page 2024
-
[42]
Yuxin Wu, Alexander Kirillov, Francisco Massa, Wan-Yen Lo, and Ross Girshick. Detectron2.https://github. com/facebookresearch/detectron2, 2019. 2, 5, 1
work page 2019
-
[43]
Detrs beat yolos on real-time object detection
Yian Zhao, Wenyu Lv, Shangliang Xu, Jinman Wei, Guanzhong Wang, Qingqing Dang, Yi Liu, and Jie Chen. Detrs beat yolos on real-time object detection. InProceed- ings of the IEEE/CVF conference on computer vision and pattern recognition, pages 16965–16974, 2024. 2
work page 2024
-
[44]
Rethinking semantic segmen- tation from a sequence-to-sequence perspective with trans- formers
Sixiao Zheng, Jiachen Lu, Hengshuang Zhao, Xiatian Zhu, Zekun Luo, Yabiao Wang, Yanwei Fu, Jianfeng Feng, Tao Xiang, Philip HS Torr, et al. Rethinking semantic segmen- tation from a sequence-to-sequence perspective with trans- formers. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6881–6890,
-
[45]
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, and Jifeng Dai. Deformable detr: Deformable trans- formers for end-to-end object detection.arXiv preprint arXiv:2010.04159, 2021. 1 10 ReLeaf: Benchmarking Leaf Segmentation across Domains and Species Supplementary Material This supplementary document complements the main paper with additional statis...
work page internal anchor Pith review arXiv 2010
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.