pith. machine review for the scientific record. sign in

arxiv: 2604.14263 · v1 · submitted 2026-04-15 · 🧬 q-bio.TO · cs.CV· cs.LG

Recognition: unknown

A deep learning framework for glomeruli segmentation with boundary attention

Behnaz Elhaminia, Catherine King, Dimitrios Chanouzas, Jiaqi Lv, Lorraine Harper, Owen Cain, Paul Moss, Shan E Ahmed Raza

Authors on Pith no claims yet

Pith reviewed 2026-05-10 12:17 UTC · model grok-4.3

classification 🧬 q-bio.TO cs.CVcs.LG
keywords glomeruli segmentationdeep learningboundary attentionU-Netkidney pathologyinstance segmentationattention decoder
0
0 comments X

The pith

U-Net with boundary attention decoder segments glomeruli more accurately than existing methods.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a deep learning approach for segmenting glomeruli in kidney tissue images by adding a specialised attention decoder to a U-Net model. This decoder focuses on boundaries to better distinguish between closely packed glomeruli, which traditional semantic segmentation often merges incorrectly. Accurate segmentation supports better diagnosis of kidney conditions. The model builds on pathology foundation models and reports improved performance over prior techniques in standard overlap metrics.

Core claim

The authors present a U-Net-based architecture that incorporates a specialised attention decoder to emphasise boundary separation, enabling more precise instance-level segmentation of glomeruli. Experimental results show that this approach achieves higher Dice scores and Intersection over Union values than state-of-the-art methods, demonstrating superior performance in delineating individual glomeruli even when they are adjacent.

What carries the argument

A specialised attention decoder added to the U-Net decoder path that highlights critical boundary regions to improve separation of adjacent glomeruli instances.

If this is right

  • The new model can delineate adjacent glomeruli more precisely in pathology images.
  • It achieves better results than current methods according to Dice and IoU metrics.
  • The approach leverages existing pathology foundation models for feature extraction.
  • Improved segmentation accuracy supports more reliable diagnostic applications in kidney disease.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar boundary attention techniques might apply to segmenting other clustered structures in medical imaging, such as cells or tumors.
  • Testing the model on diverse kidney tissue samples from different sources could reveal if the gains hold generally.
  • The framework might reduce the need for manual boundary corrections in clinical workflows.

Load-bearing premise

The specialised attention decoder must actually enhance boundary separation for touching glomeruli, and the performance improvements must stem from the model design rather than specific dataset characteristics or selective evaluation.

What would settle it

A comparison on a new, independent dataset containing many pairs of adjacent glomeruli where the proposed model shows no gain in separation accuracy over a standard U-Net without the attention decoder.

read the original abstract

Accurate detection and segmentation of glomeruli in kidney tissue are essential for diagnostic applications. Traditional deep learning methods primarily rely on semantic segmentation, which often fails to precisely delineate adjacent glomeruli. To address this challenge, we propose a novel glomerulus detection and segmentation model that emphasises boundary separation. Leveraging pathology foundation models, the proposed U-Net-based architecture incorporates a specialised attention decoder designed to highlight critical regions and improve instancelevel segmentation. Experimental evaluations demonstrate that our approach surpasses state-of-the-art methods in both Dice score and Intersection over Union, indicating superior performance in glomerular delineation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 1 minor

Summary. The paper proposes a U-Net-based model augmented with a specialized attention decoder that leverages pathology foundation models to emphasize boundary separation for improved instance segmentation of glomeruli in kidney tissue images. It claims this architecture yields superior Dice scores and Intersection over Union (IoU) metrics compared to state-of-the-art methods.

Significance. If the performance claims can be substantiated through proper validation, the work could advance instance segmentation in renal histopathology by addressing the common failure of semantic segmentation to separate adjacent glomeruli. The integration of foundation models for boundary attention represents a relevant direction, but the absence of supporting experimental details prevents assessment of whether the gains are attributable to the proposed components.

major comments (3)
  1. [Abstract] Abstract: The assertion that 'Experimental evaluations demonstrate that our approach surpasses state-of-the-art methods in both Dice score and Intersection over Union' is unsupported by any dataset description, baseline implementations, quantitative results, statistical tests, or ablation studies, so the central empirical claim cannot be evaluated.
  2. [Model Description] Model architecture description: No ablation is presented to isolate the contribution of the specialized attention decoder (e.g., by comparing the full model against a baseline U-Net without the boundary attention component), leaving open whether reported gains stem from the decoder or from other unstated factors such as training regime or data handling.
  3. [Experimental Evaluations] Experimental evaluations: The text provides no information on the dataset (size, source, train/test splits), how state-of-the-art baselines were re-implemented under identical conditions, or any error analysis/statistical significance testing on the metric differences, undermining the fairness of the superiority claim.
minor comments (1)
  1. [Abstract] Abstract: 'instancelevel' is missing a hyphen and should read 'instance-level'.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments on our manuscript. We acknowledge that several key experimental details, ablations, and supporting analyses were insufficiently documented, which limits the ability to fully evaluate the claims. We will revise the manuscript to address these points directly by adding the missing information, studies, and statistical validations.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The assertion that 'Experimental evaluations demonstrate that our approach surpasses state-of-the-art methods in both Dice score and Intersection over Union' is unsupported by any dataset description, baseline implementations, quantitative results, statistical tests, or ablation studies, so the central empirical claim cannot be evaluated.

    Authors: We agree that the abstract claim requires explicit support in the manuscript. In the revision, we will ensure the main text includes a clear dataset description, baseline re-implementations, quantitative results tables, statistical tests, and ablations. We will also update the abstract to reference these supporting elements where space permits, so the superiority claim can be properly evaluated. revision: yes

  2. Referee: [Model Description] Model architecture description: No ablation is presented to isolate the contribution of the specialized attention decoder (e.g., by comparing the full model against a baseline U-Net without the boundary attention component), leaving open whether reported gains stem from the decoder or from other unstated factors such as training regime or data handling.

    Authors: We concur that an ablation study is necessary to isolate the boundary attention decoder's contribution. The revised manuscript will include a new ablation experiment comparing the full proposed model against a standard U-Net baseline (without the specialized decoder) trained under identical conditions, with results reported on the same metrics to demonstrate the decoder's specific impact. revision: yes

  3. Referee: [Experimental Evaluations] Experimental evaluations: The text provides no information on the dataset (size, source, train/test splits), how state-of-the-art baselines were re-implemented under identical conditions, or any error analysis/statistical significance testing on the metric differences, undermining the fairness of the superiority claim.

    Authors: We apologize for these omissions in the experimental section. The revised version will add a comprehensive experimental setup subsection detailing the dataset (size, source, and train/test splits), the exact re-implementation protocol for all baselines under matched conditions, and full error analysis including statistical significance testing (e.g., paired t-tests or Wilcoxon tests) with p-values and confidence intervals on the Dice and IoU differences. revision: yes

Circularity Check

0 steps flagged

No circularity: purely empirical architecture proposal and benchmark comparison.

full rationale

The manuscript describes a U-Net variant augmented with a boundary-attention decoder that leverages pathology foundation models. Its central claim is an empirical performance improvement (higher Dice and IoU) over prior methods on glomeruli segmentation tasks. No equations, derivations, fitted parameters renamed as predictions, or self-referential definitions appear in the provided text. The result is therefore not reducible to its own inputs by construction; it rests on standard experimental reporting whose validity is independent of any circular step.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The abstract contains no explicit mathematical axioms, free parameters, or newly postulated entities. The work implicitly relies on standard deep-learning assumptions that attention mechanisms improve boundary delineation and that foundation-model features transfer to this task.

pith-pipeline@v0.9.0 · 5412 in / 1069 out tokens · 36226 ms · 2026-05-10T12:17:20.259360+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

26 extracted references · 1 canonical work pages · 1 internal anchor

  1. [1]

    A deep learning framework for glomeruli segmentation with boundary attention

    INTRODUCTION The glomerulus is a key structural and functional unit of the human kidney. Accurate evaluation of glomeruli in re- nal biopsy samples is crucial for diagnosing a wide range of kidney diseases. This assessment also informs treatment strategies and provides valuable prognostic insights. In recent years, deep learning approaches have been incre...

  2. [2]

    Datasets In this study, we used two datasets

    MA TERIALS AND METHODS 2.1. Datasets In this study, we used two datasets. (i) The public dataset was obtained from the Human BioMolecular Atlas Program (HuBMAP) [11] and comprises 20 PAS-stained kidney WSIs, including 11 fresh-frozen and 9 formalin-fixed paraffin- embedded (FFPE) samples. All images were scanned at a spatial resolution of 0.65µm/pixel. Gl...

  3. [3]

    The first model, TOM architecture [19], employs a U- Net–style design with a SeResNeXt101 encoder and an attention module

    EXPERIMENTAL RESULTS We evaluated our method in comparison with the top three models on the Kaggle HuBMAP challenge leaderboard [18]. The first model, TOM architecture [19], employs a U- Net–style design with a SeResNeXt101 encoder and an attention module. The second model, Gleb [18], uses an en- semble of four four-fold models, including three U-Net–styl...

  4. [4]

    This is likely due to the increased complexity of the REACTIV AS data closely adjacent glomeruli sharing bor- ders, making segmentation more difficult

    CONCLUSION The results in Table 1 and Table 2 show that the Dice and IoU decreased for all models when tested on the REACTI- V AS dataset. This is likely due to the increased complexity of the REACTIV AS data closely adjacent glomeruli sharing bor- ders, making segmentation more difficult. Despite these chal- lenges, the proposed model outperformed state-...

  5. [5]

    CK is funded by an MRC CRTF fellowship (MR/X006964/1) which supported the collection of the REACTIV AS dataset

    ACKNOWLEDGMENTS BE & SR report financial support from MRC (MR/X011585/1). CK is funded by an MRC CRTF fellowship (MR/X006964/1) which supported the collection of the REACTIV AS dataset. Recruitment of patients to the REACTIV AS dataset was partly supported by an Investigator-Initiated Program of Merck Sharp a Dohme Corp (MSD). JL is supported by the EPSRC UK

  6. [6]

    Computational segmentation and classi- fication of diabetic glomerulosclerosis,

    Ginley et al., “Computational segmentation and classi- fication of diabetic glomerulosclerosis,”Journal of the American Society of Nephrology, 2019

  7. [7]

    A deep learning-based approach for glomeruli instance segmentation from multistained re- nal biopsy pathologic images,

    Jiang et al., “A deep learning-based approach for glomeruli instance segmentation from multistained re- nal biopsy pathologic images,”The American Journal of Pathology, vol. 191, no. 8, pp. 1431–1441, 2021

  8. [8]

    Glomnet: a hover deep learning model for glomerulus instance segmentation,

    Noemie et al., “Glomnet: a hover deep learning model for glomerulus instance segmentation,” in2024 IEEE International Symposium on Biomedical Imaging (ISBI). IEEE, 2024, pp. 1–5

  9. [9]

    Glo-net: A dual task branch based neural network for multi-class glomeruli segmentation,

    Wang et al., “Glo-net: A dual task branch based neural network for multi-class glomeruli segmentation,”Com- puters in Biology and Medicine, vol. 186, pp. 109670, 2025

  10. [10]

    A hybrid cnn-transxnet approach for advanced glomerular segmentation in renal histology imaging,

    Liu, “A hybrid cnn-transxnet approach for advanced glomerular segmentation in renal histology imaging,” International Journal of Computational Intelligence Systems, vol. 17, no. 1, pp. 126, 2024

  11. [11]

    An image inpainting-based data augmenta- tion method for improved sclerosed glomerular iden- tification performance with the segmentation model efficientnetb3-unet,

    He et al., “An image inpainting-based data augmenta- tion method for improved sclerosed glomerular iden- tification performance with the segmentation model efficientnetb3-unet,”Scientific Reports, vol. 14, no. 1, pp. 1033, 2024

  12. [12]

    Ensembled segnext based glomeruli seg- mentation,

    Kumar et al., “Ensembled segnext based glomeruli seg- mentation,” inInternational Workshop on Medical Op- tical Imaging and Virtual Microscopy Image Analysis. Springer, 2024, pp. 202–209

  13. [13]

    Unsupervised stain augmentation en- hanced glomerular instance segmentation on pathology images,

    Yang et al., “Unsupervised stain augmentation en- hanced glomerular instance segmentation on pathology images,”International Journal of Computer Assisted Radiology and Surgery, pp. 225–236, 2025

  14. [14]

    Glomerulosclerosis identification in whole slide images using semantic segmentation,

    Bueno et al., “Glomerulosclerosis identification in whole slide images using semantic segmentation,”Com- puter methods and programs in biomedicine, vol. 184, pp. 105273, 2020

  15. [15]

    Region-based convolutional neural nets for localization of glomeruli in trichrome-stained whole kidney sections,

    Bukowy et al., “Region-based convolutional neural nets for localization of glomeruli in trichrome-stained whole kidney sections,”Journal of the American Society of Nephrology, vol. 29, no. 8, pp. 2081–2088, 2018

  16. [16]

    Hubmap - hacking the kidney,

    Addison et al., “Hubmap - hacking the kidney,” 2020, Kaggle competition, accessed: 2025-10-03

  17. [17]

    REACTIV AS — Subclinical cytomegalovirus reactivation in patients with newly diagnosed or relapsed ANCA-associated vasculitis and adverse clini- cal outcomes,

    Health Research Authority, “REACTIV AS — Subclinical cytomegalovirus reactivation in patients with newly diagnosed or relapsed ANCA-associated vasculitis and adverse clini- cal outcomes,”https://www.hra.nhs.uk/ planning-and-improving-research/ application-summaries/ research-summaries/reactivas/, 2021, ClinicalTrials.gov unique protocol ID NCT04916704

  18. [18]

    Leveraging pathology foundation models for panoptic segmentation of melanoma in h&e images,

    Lv et al., “Leveraging pathology foundation models for panoptic segmentation of melanoma in h&e images,” in Annual Conference on Medical Image Understanding and Analysis. Springer, 2025, pp. 58–72

  19. [19]

    Efficientnetv2: Smaller models and faster training,

    Tan et al., “Efficientnetv2: Smaller models and faster training,” inInternational conference on machine learn- ing. PMLR, 2021, pp. 10096–10106

  20. [20]

    I can find you! boundary-guided separated attention network for camouflaged object detection,

    Zhu et al., “I can find you! boundary-guided separated attention network for camouflaged object detection,” in Proceedings of the AAAI conference on artificial intelli- gence, 2022, vol. 36, pp. 3608–3616

  21. [21]

    Reverse attention for salient object detec- tion,

    Chen et al., “Reverse attention for salient object detec- tion,” inProceedings of the European conference on computer vision (ECCV), 2018, pp. 234–250

  22. [22]

    U-net: Convolutional networks for biomed- ical image segmentation,

    Olaf et al., “U-net: Convolutional networks for biomed- ical image segmentation,” Springer, 2015, pp. 234–241

  23. [23]

    Segmentation of human functional tissue units in support of a human reference atlas,

    Jain et al., “Segmentation of human functional tissue units in support of a human reference atlas,”Communi- cations Biology, vol. 6, no. 1, pp. 717, 2023

  24. [24]

    Kaggle hubmap repository,

    Tikutikutiku, “Kaggle hubmap repository,” https://github.com/tikutikutiku/ kaggle-hubmap, 2023, Accessed: 2025-10-24

  25. [25]

    Hubmap-3rd-place-solution,

    Shujun He, “Hubmap-3rd-place-solution,” https://github.com/Shujun-He/ Hubmap-3rd-place-solution/, 2025, Ac- cessed: 2025-10-24

  26. [26]

    Metrics reloaded: recommendations for image analysis validation,

    Lena et al., “Metrics reloaded: recommendations for image analysis validation,”Nature methods, vol. 21, no. 2, pp. 195–212, 2024