pith. sign in

arxiv: 2605.20458 · v1 · pith:ZGH4456Onew · submitted 2026-05-19 · 💻 cs.CV

ELEMENT: Multi-Modal Retinal Vessel Segmentation Based on a Coupled Region Growing and Machine Learning Approach

Pith reviewed 2026-05-21 06:46 UTC · model grok-4.3

classification 💻 cs.CV
keywords retinal vessel segmentationmulti-modal imagingmachine learningregion growingconnectivity featuresfundus imagesscanning laser ophthalmoscope
0
0 comments X

The pith

ELEMENT segments retinal vessels by coupling region growing with machine learning on grey-level and connectivity features.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents a framework called ELEMENT for automatic segmentation of blood vessels in retinal images captured through fundus photography, scanning laser ophthalmoscopy, and fluorescein angiography. It extracts pixel features that combine intensity values with information about vessel connectivity obtained via region growing, then feeds these into a machine learning classifier that propagates the connectivity evidence to neighboring pixels. This reduces labeling inconsistencies that plague purely intensity-based or deep-learning methods. When tested on the DRIVE, STARE, CHASE-DB, VAMPIRE FA, IOSTAR SLO and RC-SLO collections, the approach records accuracies from 97.4 percent to 98.35 percent and exceeds the performance of twenty-five of twenty-six prior algorithms, including six deep-learning models.

Core claim

The ELEMENT method achieves superior vessel segmentation accuracy across multiple ocular imaging modalities by extracting complementary grey-level and connectivity features and propagating the connectivity information seamlessly through pixels during the machine-learning classification stage.

What carries the argument

Coupled region-growing and machine-learning classification that extracts and propagates vessel-connectivity features alongside grey-level evidence.

If this is right

  • Faster and more consistent vessel maps become available for diagnosing diabetic retinopathy, glaucoma and macular degeneration.
  • The same feature set and propagation step works without retraining across fundus, SLO and FA modalities.
  • Reduced need for post-processing corrections compared with methods that produce fragmented vessel segments.
  • Potential to lower the time cost of manual or interactive segmentation in clinical workflows.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The explicit connectivity propagation step may let the method maintain accuracy when training data are scarce, unlike purely data-driven networks.
  • The framework could be inserted as a connectivity-regularization module inside existing deep networks to improve their boundary consistency.
  • Clinical deployment would benefit from measuring wall-clock segmentation time on standard hospital hardware rather than only reporting accuracy.

Load-bearing premise

The proposed features capture complementary evidence based on grey level and vessel connectivity properties, and this connectivity information is seamlessly propagated through the pixels at the classification phase to reduce inconsistencies.

What would settle it

Re-evaluate the method on a new set of images containing heavy noise or vessel-disrupting pathologies where the connectivity-propagation step produces more labeling errors than a pure intensity or deep-learning baseline.

Figures

Figures reproduced from arXiv: 2605.20458 by Aura Conci, Erick O. Rodrigues, Panos Liatsis.

Figure 7
Figure 7. Figure 7: SLO images, (a) and (d), from the IOSTAR dataset. (b) and (e) show the ground truth of (@) and (d), respectively, while (c) and (1) are the results of ELEMENT's vessel segmentation, respectively. of 45 degrees. The dataset contains a total of 24 images. The first image in the folder was used for training purposes, and segmentations were produced in the testing phase of the system for the remaining 23 image… view at source ↗
Figure 8
Figure 8. Figure 8: sults. RC-SLO images and respective ELEMENT segmentation re￾F. SLO Retinal Fundus (RC-SLO Dataset) Finally, we provide a further last experiment with another SLO dataset, i.e., RC-SLO, provided by [46]. This dataset contains SLO image patches such as the ones shown in [PITH_FULL_IMAGE:figures/full_fig_p010_8.png] view at source ↗
read the original abstract

Vascular structures in the retina contain important information for the detection and analysis of ocular diseases, including age-related macular degeneration, diabetic retinopathy and glaucoma. Commonly used modalities in diagnosis of these diseases are fundus photography, scanning laser ophthalmoscope (SLO) and fluorescein angiography (FA). Typically, retinal vessel segmentation is carried out either manually or interactively, which makes it time consuming and prone to human errors. In this research, we propose a new multi-modal framework for vessel segmentation called ELEMENT (vEsseL sEgmentation using Machine lEarning and coNnecTivity). This framework consists of feature extraction and pixel-based classification using region growing and machine learning. The proposed features capture complementary evidence based on grey level and vessel connectivity properties. The latter information is seamlessly propagated through the pixels at the classification phase. ELEMENT reduces inconsistencies and speeds up the segmentation throughput. We analyze and compare the performance of the proposed approach against state-of-the-art vessel segmentation algorithms in three major groups of experiments, for each of the ocular modalities. Our method produced higher overall performance, with an overall accuracy of 97.40%, compared to 25 of the 26 state-of-the-art approaches, including six works based on deep learning, evaluated on the widely known DRIVE fundus image dataset. In the case of the STARE, CHASE-DB, VAMPIRE FA, IOSTAR SLO and RC-SLO datasets, the proposed framework outperformed all of the state-of-the-art methods with accuracies of 98.27%, 97.78%, 98.34%, 98.04% and 98.35%, respectively.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript proposes ELEMENT, a multi-modal framework for retinal vessel segmentation that couples region growing with machine learning for pixel classification. Features are extracted from grey-level and vessel connectivity properties, with the latter propagated during classification to reduce inconsistencies. The work reports superior performance on six public datasets, claiming 97.40% accuracy on DRIVE (outperforming 25 of 26 SOTA methods including six deep-learning approaches) and accuracies of 98.27%, 97.78%, 98.34%, 98.04%, and 98.35% on STARE, CHASE-DB, VAMPIRE FA, IOSTAR SLO, and RC-SLO respectively.

Significance. If the reported accuracies are robustly validated, the hybrid non-deep-learning approach could be significant for retinal vessel segmentation in ocular disease analysis, offering a potentially more efficient and interpretable alternative to deep networks while leveraging multi-modal data and connectivity cues across fundus, SLO, and FA modalities.

major comments (3)
  1. [Methods] The methods description of the classification phase states that connectivity information 'is seamlessly propagated through the pixels' but provides no explicit mechanism, graph-based term, CRF integration, or pseudocode for this propagation; without an ablation isolating its effect, the performance gains cannot be confidently attributed to the claimed multi-modal complementarity rather than tuning.
  2. [Experiments] The experimental evaluation reports overall accuracies and outperformance claims but supplies no information on training-test splits, cross-validation strategy, hyper-parameter search, or statistical significance tests for the DRIVE results; this omission is load-bearing for the central claim of beating 25/26 prior methods including deep-learning baselines.
  3. [Results] Table or results section comparing to SOTA lacks a baseline ablation removing the connectivity feature or region-growing component, making it impossible to verify that the reported improvements stem from the proposed coupled approach rather than dataset-specific factors.
minor comments (2)
  1. [Methods] Clarify the exact machine-learning classifier used (e.g., random forest, SVM) and list all free hyperparameters with their search ranges.
  2. [Figures] Add zoomed insets or error maps in the qualitative figures to illustrate where inconsistencies are reduced by the connectivity propagation.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed review of our manuscript on the ELEMENT framework. We address each major comment point by point below, outlining how we will strengthen the paper through revisions where appropriate.

read point-by-point responses
  1. Referee: [Methods] The methods description of the classification phase states that connectivity information 'is seamlessly propagated through the pixels' but provides no explicit mechanism, graph-based term, CRF integration, or pseudocode for this propagation; without an ablation isolating its effect, the performance gains cannot be confidently attributed to the claimed multi-modal complementarity rather than tuning.

    Authors: We agree that the current description of connectivity propagation during classification is insufficiently detailed. In the revised manuscript, we will expand the methods section to provide an explicit description of the propagation mechanism, including how vessel connectivity properties are updated and passed to neighboring pixels. We will also include pseudocode for the classification phase and add an ablation experiment that removes the connectivity propagation to isolate its contribution to the reported performance. revision: yes

  2. Referee: [Experiments] The experimental evaluation reports overall accuracies and outperformance claims but supplies no information on training-test splits, cross-validation strategy, hyper-parameter search, or statistical significance tests for the DRIVE results; this omission is load-bearing for the central claim of beating 25/26 prior methods including deep-learning baselines.

    Authors: We acknowledge that these experimental details are critical for reproducibility and for substantiating the outperformance claims. In the revision, we will add a dedicated experimental setup subsection specifying the standard training-test splits for each dataset (e.g., the 20/20 split for DRIVE), any cross-validation strategy used, the hyper-parameter search procedure, and statistical significance tests (such as McNemar's test) comparing ELEMENT against the baselines on DRIVE. revision: yes

  3. Referee: [Results] Table or results section comparing to SOTA lacks a baseline ablation removing the connectivity feature or region-growing component, making it impossible to verify that the reported improvements stem from the proposed coupled approach rather than dataset-specific factors.

    Authors: We concur that ablation studies are needed to attribute gains to the coupled region-growing and machine-learning components. We will incorporate new ablation experiments in the results section, reporting performance when the connectivity feature is removed and when the region-growing component is disabled, across the evaluated datasets, to demonstrate the contribution of the proposed coupling. revision: yes

Circularity Check

0 steps flagged

No circularity in empirical segmentation pipeline

full rationale

The paper presents an empirical multi-modal vessel segmentation framework (ELEMENT) that extracts grey-level and connectivity features, applies region growing, and performs pixel classification via machine learning, with performance measured by accuracy on public benchmarks (DRIVE, STARE, CHASE-DB, etc.). No mathematical derivations, predictions, or first-principles results are claimed that reduce to fitted parameters or self-referential inputs by construction. The central performance claims rest on direct comparison to external state-of-the-art methods rather than any internal reduction or self-citation chain, rendering the work self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard image-processing assumptions plus fitted machine-learning parameters; no new physical entities are postulated.

free parameters (1)
  • machine-learning classifier hyperparameters
    Parameters of the pixel classifier are chosen or fitted on training data to achieve the reported accuracies.
axioms (1)
  • domain assumption Grey-level and vessel-connectivity features provide complementary information that improves pixel classification when propagated together.
    Invoked in the description of feature extraction and the classification phase.

pith-pipeline@v0.9.0 · 5836 in / 1374 out tokens · 30472 ms · 2026-05-21T06:46:47.449898+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

4 extracted references · 4 canonical work pages

  1. [1]

    Parallel multiscale feature extraction and region growing: Application in retinal blood vessel detection,

    L. Ortega-Arjona, “Parallel multiscale feature extraction and region growing: Application in retinal blood vessel detection,” IEEE Trans. Inf. Technol. Biomedicine, vol. 14, no. 2. pp. S00-506, Mar. 2010. C. Lorenz, C. Carlsen, T. M. Buzug, C. Fassnacht, and J. Weese, “A multi- scale line filter with automatic scale selection based on the hessian matrix f...

  2. [2]

    Theoretical foundations of anisotropic diffusion in image processing

    Weickert, “Theoretical foundations of anisotropic diffusion in image processing.” Theor. Found. Comput. Vision, Comput. Suppl.. vol. 11, pp. 221-236, 1996. E. O. Rodrigues, “Combining minkowski and cheyshev: New distance pro- posal and survey of distance metrics using k-nearest neighbours classificr.” Pattem Recognit. Lett...vol. 110, no. 15, pp. 66-71, 2...

  3. [3]

    Retinal vessel segmentation using the 2-d gabor wavelet and supervised classification

    1. V.B. Soares. ]. 1. G. Leandro, R. M. Cesar, H. F. Jelinek. and M. J. Cree, “Retinal vessel segmentation using the 2-d gabor wavelet and supervised classification.” IEEE Trans. Med. Imag., vol. 25, no. 9, pp. 1214-1222, Sep. 2006. D. Marin, A. Aquino, M. E. Gegundez-Arias, and J. M. Bravo, “A new supervised method for blood vessel segmentation in retina...

  4. [4]

    Locating blood vessels in retinal images by piccewise threshold probing of a matched filter response

    [Online]. Available: hitps://www.sciencedirect.com/sciencefarticle/ abs/pii/S 1361841519300982 [61] [62] [63] [67] [68] [69] [70] 7 m) 1) [74] [75] [76] mm [78] 791 A.D. Hoover, V. Kouznetsova, and M. Goldbaum, “Locating blood vessels in retinal images by piccewise threshold probing of a matched filter response.” IEEE Trans. Med. Imag., vol. 19, 0. 3, pp....