PGE-SAM: Prompt-Guided Feature Enhancement for Interactive Segmentation under Degradation
Pith reviewed 2026-06-30 06:40 UTC · model grok-4.3
The pith
PGE-SAM uses user prompts to spatially guide feature restoration and improve SAM segmentation on degraded images
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
PGE-SAM explicitly leverages user prompts and prior mask predictions to spatially guide the feature restoration process toward regions of interest through a Prompt Guidance Generator. To recover fine-grained details lost under degradation, it introduces Multi-Scale Features Interaction to incorporate low-level encoder features, along with a Foreground Reconstruction Loss that restricts feature-level supervision to the segmentation target. This produces state-of-the-art robustness on both medical and natural image domains across multiple degradation levels while maintaining generalization to clean images and adding less than one-fifth of the parameters of prior methods.
What carries the argument
The Prompt Guidance Generator, which uses user prompts and prior mask predictions to spatially direct feature restoration toward segmentation-relevant regions.
If this is right
- Achieves state-of-the-art robustness on medical and natural images at multiple degradation levels.
- Maintains original generalization performance on clean images.
- Adds less than one-fifth the parameters required by earlier restoration methods.
- Preserves SAM's iterative prompt-refinement loop in interactive use.
Where Pith is reading between the lines
- The same prompt-directed restoration pattern could be attached to other promptable vision models facing degraded inputs.
- Targeted guidance may let practitioners skip separate full-image restoration steps before running segmentation.
- Applying the approach to video or 3-D medical volumes would test whether temporal or volumetric consistency improves under analogous degradations.
Load-bearing premise
Spatially guiding feature restoration with prompts and prior masks will recover fine-grained details lost under degradation without introducing new artifacts or harming SAM's zero-shot behavior on clean images.
What would settle it
An experiment in which PGE-SAM produces lower mask accuracy or visible new artifacts on a set of degraded test images compared with the unmodified SAM would falsify the central claim.
Figures
read the original abstract
Segment Anything Model (SAM) has revolutionized promptable image segmentation with strong zero-shot generalization. However, its performance degrades substantially under real-world imaging artifacts such as noise, blur, and compression. Existing methods restore features globally without focusing on segmentation-relevant regions and neglect SAM's iterative refinement mechanism, leading to suboptimal performance in interactive settings. We propose Prompt-Guided Feature Enhancement SAM (PGE-SAM), a framework that explicitly leverages user prompts and prior mask predictions to spatially guide the feature restoration process toward regions of interest through a Prompt Guidance Generator. To recover fine-grained details lost under degradation, we introduce Multi-Scale Features Interaction to incorporate low-level encoder features, along with a Foreground Reconstruction Loss that restricts feature-level supervision to the segmentation target. Furthermore, we present DM-Seg, a benchmark for interactive segmentation on degraded medical images, spanning multiple imaging modalities with both general and modality-specific degradations at varying severity levels. Extensive experiments demonstrate that PGE-SAM achieves SOTA robustness on both medical and natural image domains across multiple degradation levels, while maintaining generalization to clean images and adding less than one-fifth of the parameters of prior methods.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes PGE-SAM to improve the Segment Anything Model (SAM) for interactive segmentation under real-world degradations (noise, blur, compression). It introduces a Prompt Guidance Generator that uses user prompts and prior masks to spatially direct feature restoration, Multi-Scale Features Interaction to incorporate low-level encoder features, and a Foreground Reconstruction Loss to focus supervision on the target region. A new benchmark DM-Seg for degraded medical images across modalities and severity levels is presented. The central claim is that PGE-SAM achieves SOTA robustness on both medical and natural image domains while preserving zero-shot generalization on clean images and adding less than one-fifth the parameters of prior methods.
Significance. If the performance claims hold with rigorous validation, the work would address a practical gap in deploying promptable segmentation models under realistic imaging conditions, particularly in medical domains. The DM-Seg benchmark could serve as a useful resource for future robustness studies. The parameter-efficiency aspect, if demonstrated, would be a notable strength relative to heavier restoration pipelines.
major comments (3)
- [Abstract] Abstract: The SOTA robustness claim on medical and natural domains across degradation levels is asserted via 'extensive experiments' on DM-Seg and other benchmarks, yet the manuscript text supplies no quantitative tables, baseline comparisons, error bars, ablation results, or statistical tests. This renders the central performance claim unverifiable and load-bearing for the contribution.
- [Abstract] Abstract (Prompt Guidance Generator and Foreground Reconstruction Loss): The assumption that spatially guiding feature restoration with prompts and prior masks recovers fine-grained details without introducing new artifacts or harming SAM's zero-shot behavior on clean images is stated but unsupported by any ablation, visualization of restored features, or quantitative check on artifact introduction. This is the weakest assumption underlying the robustness claim.
- [Abstract] Abstract (parameter count): The claim of adding less than one-fifth the parameters of prior methods is presented without any explicit parameter counts, comparison table, or breakdown of the added modules (Prompt Guidance Generator, Multi-Scale Features Interaction).
minor comments (2)
- [Abstract] The description of Multi-Scale Features Interaction lacks any diagram, equation, or pseudocode showing how low-level encoder features are fused with the prompt-guided restoration.
- [Abstract] DM-Seg is introduced as a new benchmark, but no details on its construction, number of images, degradation simulation procedure, or annotation protocol are supplied.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address each major comment below and will revise the manuscript to improve verifiability of the claims.
read point-by-point responses
-
Referee: [Abstract] Abstract: The SOTA robustness claim on medical and natural domains across degradation levels is asserted via 'extensive experiments' on DM-Seg and other benchmarks, yet the manuscript text supplies no quantitative tables, baseline comparisons, error bars, ablation results, or statistical tests. This renders the central performance claim unverifiable and load-bearing for the contribution.
Authors: We acknowledge that the abstract asserts SOTA performance without embedding supporting numbers or references to specific results. The full manuscript contains experimental sections with comparisons, but to directly address the concern, we will revise the abstract to include key quantitative metrics (e.g., mIoU improvements across degradation levels) and add a concise results summary table. We will also ensure error bars and statistical significance are reported in the main text if not already explicit. revision: yes
-
Referee: [Abstract] Abstract (Prompt Guidance Generator and Foreground Reconstruction Loss): The assumption that spatially guiding feature restoration with prompts and prior masks recovers fine-grained details without introducing new artifacts or harming SAM's zero-shot behavior on clean images is stated but unsupported by any ablation, visualization of restored features, or quantitative check on artifact introduction. This is the weakest assumption underlying the robustness claim.
Authors: We agree that the current manuscript provides insufficient direct evidence for this assumption. We will add dedicated ablations isolating the Prompt Guidance Generator and Foreground Reconstruction Loss, include feature visualizations before/after restoration, and report quantitative metrics on clean-image performance and potential artifact introduction (e.g., via perceptual metrics or failure case analysis) in the revised version. revision: yes
-
Referee: [Abstract] Abstract (parameter count): The claim of adding less than one-fifth the parameters of prior methods is presented without any explicit parameter counts, comparison table, or breakdown of the added modules (Prompt Guidance Generator, Multi-Scale Features Interaction).
Authors: We accept this point. The revised manuscript will include a dedicated parameter-count table with breakdowns for each added module, explicit comparisons to prior methods, and confirmation of the 'less than one-fifth' claim with precise numbers. revision: yes
Circularity Check
No circularity: additive modules and empirical claims with no self-referential reductions
full rationale
The provided abstract and description introduce PGE-SAM as a framework with new components (Prompt Guidance Generator, Multi-Scale Features Interaction, Foreground Reconstruction Loss) and a new benchmark DM-Seg. These are presented as additive architectural elements rather than derivations. No equations, parameter fits, predictions that reduce to inputs by construction, or load-bearing self-citations appear. SOTA claims are asserted via 'extensive experiments' on external benchmarks, which are independent of any internal derivation chain. This matches the default expectation of no significant circularity.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption SAM's iterative refinement mechanism can be preserved while adding external feature restoration modules
invented entities (3)
-
Prompt Guidance Generator
no independent evidence
-
Multi-Scale Features Interaction
no independent evidence
-
Foreground Reconstruction Loss
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Data in brief28, 104863 (2020) 8, 28, 31, 36
Al-Dhabyani, W., Gomaa, M., Khaled, H., Fahmy, A.: Dataset of breast ultrasound images. Data in brief28, 104863 (2020) 8, 28, 31, 36
2020
-
[2]
Microscopy Research and Technique 87(12), 3089–3106 (2024) 9
Al-Tahhan, F., Fares, M.: An accurate paradigm for denoising degraded ultrasound images based on artificial intelligence systems. Microscopy Research and Technique 87(12), 3089–3106 (2024) 9
2024
-
[3]
Baid, U., Ghodasara, S., Mohan, S., Bilello, M., Calabrese, E., Colak, E., Farahani, K., Kalpathy-Cramer, J., Kitamura, F.C., Pati, S., et al.: The rsna-asnr-miccai brats 2021 benchmark on brain tumor segmentation and radiogenomic classifica- tion. arXiv preprint arXiv:2107.02314 (2021) 8, 28, 31, 36
work page internal anchor Pith review Pith/arXiv arXiv 2021
-
[4]
Radiographics 24(6), 1679–1691 (2004) 35
Barrett, J.F., Keat, N.: Artifacts in ct: recognition and avoidance. Radiographics 24(6), 1679–1691 (2004) 35
2004
-
[5]
saliency maps from physicians
Bernal, J., Sánchez, F.J., Fernández-Esparrach, G., Gil, D., Rodríguez, C., Vilar- iño, F.: Wm-dova maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians. Computerized medical imaging and graphics 43, 99–111 (2015) 8, 28, 32, 36
2015
-
[6]
International journal of biomedical imaging2008(1), 184123 (2008) 35
Block, K.T., Uecker, M., Frahm, J.: Suppression of mri truncation artifacts using total variation constrained data extrapolation. International journal of biomedical imaging2008(1), 184123 (2008) 35
2008
-
[7]
Computers in biology and medicine109, 218–225 (2019) 8, 28, 32, 36
Buda, M., Saha, A., Mazurowski, M.A.: Association of genomic subtypes of lower- grade gliomas with shape features automatically extracted by a deep learning al- gorithm. Computers in biology and medicine109, 218–225 (2019) 8, 28, 32, 36
2019
-
[8]
Nature methods 16(12), 1247–1253 (2019) 8, 28, 31, 36
Caicedo, J.C., Goodman, A., Karhohs, K.W., Cimini, B.A., Ackerman, J., Haghighi, M., Heng, C., Becker, T., Doan, M., McQuin, C., et al.: Nucleus segmen- tation across imaging experiments: the 2018 data science bowl. Nature methods 16(12), 1247–1253 (2019) 8, 28, 31, 36
2018
-
[9]
In: Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part VII
Chen, L., Chu, X., Zhang, X., Sun, J.: Simple baselines for image restoration. In: Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part VII. pp. 17–33. Springer-Verlag, Berlin, Heidelberg (2022) 7, 13
2022
-
[10]
International Journal of Computer Vision131(8), 2198–2218 (2023) 20, 33, 38, 42
Chen, L., Fu, Y., Wei, K., Zheng, D., Heide, F.: Instance segmentation in the dark. International Journal of Computer Vision131(8), 2198–2218 (2023) 20, 33, 38, 42
2023
- [11]
-
[12]
In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Chen, W.T., Vong, Y.J., Kuo, S.Y., Ma, S., Wang, J.: Robustsam: Segment any- thing robustly on degraded images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 4081–4091 (June 2024) 2, 4, 8, 10, 17, 19, 23, 32, 41
2024
-
[13]
Cheng, J., Ye, J., Deng, Z., Chen, J., Li, T., Wang, H., Su, Y., Huang, Z., Chen, J., Sun, L.J.H., He, J., Zhang, S., Zhu, M., Qiao, Y.: Sam-med2d (2023) 4, 10, 22
2023
-
[14]
IEEE transactions on pattern analysis and machine intel- ligence37(3), 569–582 (2014) 9, 26
Cheng, M.M., Mitra, N.J., Huang, X., Torr, P.H., Hu, S.M.: Global contrast based salient region detection. IEEE transactions on pattern analysis and machine intel- ligence37(3), 569–582 (2014) 9, 26
2014
-
[15]
In: 2018 IEEE 15th international symposium on biomedical imaging (ISBI 2018)
Codella, N.C., Gutman, D., Celebi, M.E., Helba, B., Marchetti, M.A., Dusza, S.W., Kalloo, A., Liopyris, K., Mishra, N., Kittler, H., et al.: Skin lesion analysis toward melanoma detection: A challenge at the 2017 international symposium on biomed- ical imaging (isbi), hosted by the international skin imaging collaboration (isic). In: 2018 IEEE 15th intern...
2017
-
[16]
Lippincott Williams & Wilkins (1990) 35
Curry, T.S., Dowdey, J.E., Murry, R.C.: Christensen’s physics of diagnostic radi- ology. Lippincott Williams & Wilkins (1990) 35
1990
-
[17]
IEEE Access8, 15983–15999 (2020) 35
Duarte-Salazar, C.A., Castro-Ospina, A.E., Becerra, M.A., Delgado-Trejos, E.: Speckle noise reduction in ultrasound images for improving the metrological evalu- ation of biomedical applications: an overview. IEEE Access8, 15983–15999 (2020) 35
2020
-
[18]
Duraš, A., Wolf, B.J., Ilioudi, A., Palunko, I., De Schutter, B.: A dataset for de- tection and segmentation of underwater marine debris in shallow waters. Scientific Data11(1), 921 (aug 2024).https://doi.org/10.1038/s41597-024-03759-2, https://doi.org/10.1038/s41597-024-03759-221, 33, 39, 42
-
[19]
In: proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI 2024
Gao, Y., Xia, W., Hu, D., Wang, W., Gao, X.: DeSAM: Decoupled Segment Anything Model for Generalizable Medical Image Segmentation . In: proceedings of Medical Image Computing and Computer Assisted Intervention – MICCAI 2024. vol. LNCS 15012. Springer Nature Switzerland (October 2024) 4
2024
-
[20]
In: Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)
Guo, G., Guo, Y., Yu, X., Li, W., Wang, Y., Gao, S.: Segment any-quality images with generative latent space enhancement. In: Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR). pp. 2366–2376 (June 2025) 2, 4, 6, 8, 9, 10, 17, 19, 23, 26, 40
2025
-
[21]
In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2019) 9, 19, 26
Gupta, A., Dollar, P., Girshick, R.: Lvis: A dataset for large vocabulary instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (June 2019) 9, 19, 26
2019
-
[22]
In: Proceedings of the IEEE/CVF Winter Con- ference on Applications of Computer Vision (WACV)
Gupta, H., Kotlyar, O., Andreasson, H., Lilienthal, A.J.: Robust object detection in challenging weather conditions. In: Proceedings of the IEEE/CVF Winter Con- ference on Applications of Computer Vision (WACV). pp. 7523–7532 (January
-
[23]
Physics in Medicine & Biology65(23), 235039 (nov 2020) 3
Hao, S., Liu, J., Chen, Y., Liu, B., Wei, C., Zhu, J., Li, B.: A wavelet transform- based photon starvation artifacts suppression algorithm in ct imaging. Physics in Medicine & Biology65(23), 235039 (nov 2020) 3
2020
-
[24]
Proceedings of the International Conference on Learning Representations (2019) 1
Hendrycks,D.,Dietterich,T.:Benchmarkingneuralnetworkrobustnesstocommon corruptions and perturbations. Proceedings of the International Conference on Learning Representations (2019) 1
2019
-
[25]
Advancements and breakthroughs in ultrasound imaging1(8), 1–8 (2013) 35
Hiremath, P., Akkasaligar, P.T., Badiger, S., Gunarathne, G.: Speckle noise reduc- tion in medical ultrasound images. Advancements and breakthroughs in ultrasound imaging1(8), 1–8 (2013) 35
2013
-
[26]
In: ICLR (2025) 10
Hu, J., Jin, L., Yao, Z., Lu, Y.: Universal image restoration pre-training via degra- dation classification. In: ICLR (2025) 10
2025
- [27]
-
[28]
In: International con- ference on multimedia modeling
Jha, D., Smedsrud, P.H., Riegler, M.A., Halvorsen, P., De Lange, T., Johansen, D., Johansen, H.D.: Kvasir-seg: A segmented polyp dataset. In: International con- ference on multimedia modeling. pp. 451–462. Springer (2019) 8, 28, 32, 36
2019
-
[29]
In: Brain Tumor MRI Image Segmentation Using Deep Learning Techniques, pp
Jindal, S.K., Banerjee, S., Patra, R., Paul, A.: Deep learning-based brain malig- nant neoplasm classification using mri image segmentation assisted by bias field correction and histogram equalization. In: Brain Tumor MRI Image Segmentation Using Deep Learning Techniques, pp. 135–161. Elsevier (2022) 35
2022
-
[30]
Medical image analysis69, 101950 (2021) 8, 28, 31, 36 Appendix 51
Kavur, A.E., Gezer, N.S., Barış, M., Aslan, S., Conze, P.H., Groza, V., Pham, D.D., Chatterjee, S., Ernst, P., Özkan, S., et al.: Chaos challenge-combined (ct- mr) healthy abdominal organ segmentation. Medical image analysis69, 101950 (2021) 8, 28, 31, 36 Appendix 51
2021
-
[31]
URL https: //doi.org/10.3115/v1/d14-1086
Kazemzadeh, S., Ordonez, V., Matten, M., Berg, T.: ReferItGame: Referring to objects in photographs of natural scenes. In: Moschitti, A., Pang, B., Daelemans, W. (eds.) Proceedings of the 2014 Conference on Empirical Methods in Natu- ral Language Processing (EMNLP). pp. 787–798. Association for Computational Linguistics, Doha, Qatar (Oct 2014).https://doi...
-
[32]
In: NeurIPS (2023) 4
Ke, L., Ye, M., Danelljan, M., Liu, Y., Tai, Y.W., Tang, C.K., Yu, F.: Segment anything in high quality. In: NeurIPS (2023) 4
2023
-
[33]
In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)
Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A.C., Lo, W.Y., Dollar, P., Girshick, R.: Segment anything. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). pp. 4015–4026 (October 2023) 1, 3, 10, 17, 22
2023
-
[34]
In: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Lai, X., Tian, Z., Chen, Y., Li, Y., Yuan, Y., Liu, S., Jia, J.: LISA: Rea- soning Segmentation via Large Language Model . In: 2024 IEEE/CVF Con- ference on Computer Vision and Pattern Recognition (CVPR). pp. 9579–9589. IEEE Computer Society, Los Alamitos, CA, USA (Jun 2024).https://doi.org/ 10.1109/CVPR52733.2024.00915,https://doi.ieeecomputersociety.org...
-
[35]
NeurIPS (2025) 4
Lee, S., Gwon, Y., Hoyer, L., Kwak, S.: Gara-sam: Robustifying segment anything model with gated-rank adaptation. NeurIPS (2025) 4
2025
-
[36]
Medical Physics52(2), 899–912 (2025) 4
Li, Y., Jing, B., Li, Z., Wang, J., Zhang, Y.: Plug-and-play segment anything model improves nnunet performance. Medical Physics52(2), 899–912 (2025) 4
2025
-
[37]
In: Proceedings of the IEEE Winter Conference on Applications of Com- puter Vision
Liew, J.H., Cohen, S., Price, B., Mai, L., Feng, J.: Deep interactive thin object selection. In: Proceedings of the IEEE Winter Conference on Applications of Com- puter Vision. pp. 305–314 (2021) 9, 19, 26
2021
-
[38]
IEEE Transactions on Pattern Analysis and Machine Intelligence42(2), 318–327 (2020) 8
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence42(2), 318–327 (2020) 8
2020
-
[39]
In: Computer Vision– ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13
Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft coco: Common objects in context. In: Computer Vision– ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. pp. 740–755. Springer (2014) 9, 19, 26
2014
- [40]
-
[41]
In: International Conference on Medical Image Computing and Computer-Assisted Intervention (2025) 4
Liu, H., Gao, M., Luo, X., Wang, Z., Qin, G., Wu, J., Jin, Y.: Resurgsam2: Re- ferring segment anything in surgical video via credible long-term tracking. In: International Conference on Medical Image Computing and Computer-Assisted Intervention (2025) 4
2025
-
[42]
In: Proceedings of the AAAI Conference on Artificial Intelligence (2022) 1
Liu,W.,Ren,G.,Yu,R.,Guo,S.,Zhu,J.,Zhang,L.:Image-adaptiveyoloforobject detection in adverse weather conditions. In: Proceedings of the AAAI Conference on Artificial Intelligence (2022) 1
2022
-
[43]
Nature Communications15, 654 (2024) 4, 10, 22
Ma, J., He, Y., Li, F., Han, L., You, C., Wang, B.: Segment anything in medical images. Nature Communications15, 654 (2024) 4, 10, 22
2024
-
[44]
arXiv preprint arXiv:2504.03600 (2025) 4
Ma, J., Yang, Z., Kim, S., Chen, B., Baharoon, M., Fallahpour, A., Asakereh, R., Lyu, H., Wang, B.: Medsam2: Segment anything in 3d medical images and videos. arXiv preprint arXiv:2504.03600 (2025) 4
-
[45]
Medical Image Analysis82, 102616 (2022) 8, 28, 31, 36 52 T
Ma, J., Zhang, Y., Gu, S., An, X., Wang, Z., Ge, C., Wang, C., Zhang, F., Wang, Y., Xu, Y., et al.: Fast and low-gpu-memory abdomen ct organ segmentation: the flare challenge. Medical Image Analysis82, 102616 (2022) 8, 28, 31, 36 52 T. Nguyen et al
2022
-
[46]
Computers in biology and medicine135, 104623 (2021) 8, 28, 31, 36
Marzola, F., Van Alfen, N., Doorduin, J., Meiburger, K.M.: Deep learning segmen- tation of transverse musculoskeletal ultrasound images for neuromuscular disease assessment. Computers in biology and medicine135, 104623 (2021) 8, 28, 31, 36
2021
-
[47]
Montoya, A., Sterling, D., Hasnin, kaggle446, shirzad, Cukierski, W., yffud: Ul- trasound nerve segmentation.https://kaggle.com/competitions/ultrasound- nerve-segmentation(2016), kaggle 8, 28, 31, 36
2016
-
[48]
Radiological physics and technology6(1), 130–141 (2013) 9
Mori, I., Machida, Y., Osanai, M., Iinuma, K.: Photon starvation artifacts of x- ray ct: their true cause and a solution. Radiological physics and technology6(1), 130–141 (2013) 9
2013
-
[49]
Proceedings of the AAAI Conference on Artificial Intelligence40(10), 8107–8115 (Mar 2026).https://doi.org/10.1609/ aaai.v40i10.3775725
Moskalenko, A., Kuznetsov, D., Dudko, I., Iasakova, A., Boldyrev, N., Shepelev, D., Spiridonov, A., Kuznetsov, A., Shakhuro, V.: Breps: Bounding-box robustness evaluation of promptable segmentation. Proceedings of the AAAI Conference on Artificial Intelligence40(10), 8107–8115 (Mar 2026).https://doi.org/10.1609/ aaai.v40i10.3775725
2026
-
[50]
Mendeley Data1(2024) 28, 32, 36
Nam, D., Panina, A., Pak, A.: Lung cancer segmentation dataset with lung-rads class. Mendeley Data1(2024) 28, 32, 36
2024
-
[51]
Pattern Recognition173, 112868 (2026) 1
Nguyen, T.D., Le, D.T.: Mode: A model-agnostic framework for object detection under adverse weather conditions. Pattern Recognition173, 112868 (2026) 1
2026
-
[52]
Oktay, O., Schlemper, J., Folgoc, L.L., Lee, M., Heinrich, M., Misawa, K., Mori, K., McDonagh, S., Hammerla, N.Y., Kainz, B., Glocker, B., Rueckert, D.: Attention u-net: Learning where to look for the pancreas (2018),https://arxiv.org/abs/ 1804.039997, 13
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[53]
In: 10th International symposium on medical information processing and analysis
Pedraza, L., Vargas, C., Narváez, F., Durán, O., Muñoz, E., Romero, E.: An open access thyroid ultrasound image database. In: 10th International symposium on medical information processing and analysis. vol. 9287, pp. 188–193. SPIE (2015) 8, 28, 31, 36
2015
-
[54]
Qiao, Y., Zhang, C., Kang, T., Kim, D., Zhang, C., Hong, C.S.: Robustness of sam: Segment anything under corruptions and beyond (2023),https://arxiv. org/abs/2306.077132
-
[55]
Magnetic resonance in medicine38(3), 429–439 (1997) 35
Reeder,S.B.,Atalar,E.,BolsterJr,B.D.,McVeigh,E.R.:Quantificationandreduc- tion of ghosting artifacts in interleaved echo-planar imaging. Magnetic resonance in medicine38(3), 429–439 (1997) 35
1997
-
[56]
In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., Ommer, B.: High-resolution image synthesis with latent diffusion models. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 10684– 10695 (June 2022) 2, 4
2022
-
[57]
Turkish Journal of Electrical Engineering and Computer Sciences29(2), 598–615 (2021) 35
Saken, M., YAĞCI, M.B., YUMUŞAK, N.: Impact of image segmentation tech- niques on celiac disease classification usingscale invariant texture descriptors for standard flexible endoscopic systems. Turkish Journal of Electrical Engineering and Computer Sciences29(2), 598–615 (2021) 35
2021
-
[58]
arXiv preprint arXiv:2210.03416 (2022) 8, 28, 31, 36
Seibold, C., Reiß, S., Sarfraz, S., Fink, M.A., Mayer, V., Sellner, J., Kim, M.S., Maier-Hein, K.H., Kleesiek, J., Stiefelhagen, R.: Detailed annotations of chest x- rays via ct projection for report understanding. arXiv preprint arXiv:2210.03416 (2022) 8, 28, 31, 36
-
[59]
IEEE transactions on pattern analysis and machine intelligence38(4), 717– 729 (2015) 9, 19, 26
Shi, J., Yan, Q., Xu, L., Jia, J.: Hierarchical image saliency detection on extended cssd. IEEE transactions on pattern analysis and machine intelligence38(4), 717– 729 (2015) 9, 19, 26
2015
-
[60]
Expert Systems with Applications107, 15–31 (2018) 8, 28, 31, 36 Appendix 53
Silva, G., Oliveira, L., Pithon, M.: Automatic segmenting teeth in x-ray images: Trends, a novel data set, benchmarking and future perspectives. Expert Systems with Applications107, 15–31 (2018) 8, 28, 31, 36 Appendix 53
2018
-
[61]
In: Konukoglu, E., Menze, B., Venkataraman, A., Baumgartner, C., Dou, Q., Albarqouni, S
Singla, R., Ringstrom, C., Hu, R., Lessoway, V., Reid, J., Rohling, R., Nguan, C.: Speckle and shadows: Ultrasound-specific physics-based data augmentation for kid- ney segmentation. In: Konukoglu, E., Menze, B., Venkataraman, A., Baumgartner, C., Dou, Q., Albarqouni, S. (eds.) Proceedings of The 5th International Confer- ence on Medical Imaging with Deep...
2022
-
[62]
Sudre, C.H., Li, W., Vercauteren, T.K.M., Ourselin, S., Cardoso, M.J.: Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. Deep learning in medical image analysis and multimodal learning for clinical deci- sion support : Third International Workshop, DLMIA 2017, and 7th International Workshop, ML-CDS 2017, held i...
2017
-
[63]
In: Belgrave, D., Zhang, C., Lin, H., Pascanu, R., Koniusz, P., Ghassemi, M., Chen, N
Tang, H., Xie, C.W., Wang, H., Bao, X., Weng, T., Li, P., Zheng, Y., Wang, L.: Ufo: A unified approach to fine-grained visual perception via open-ended language inter- face. In: Belgrave, D., Zhang, C., Lin, H., Pascanu, R., Koniusz, P., Ghassemi, M., Chen, N. (eds.) Advances in Neural Information Processing Systems. vol. 38, pp. 83761–83791. Curran Assoc...
2025
-
[64]
RadioGraphics39(4), 1017– 1018 (2019) 35
Triche, B.L., Nelson Jr, J.T., McGill, N.S., Porter, K.K., Sanyal, R., Tessler, F.N., McConathy, J.E., Gauntt, D.M., Yester, M.V., Singh, S.P.: Recognizing and mini- mizing artifacts at ct, mri, us, and molecular imaging. RadioGraphics39(4), 1017– 1018 (2019) 35
2019
-
[65]
International journal of computer assisted radiology and surgery15(2), 183–192 (2020) 8, 28, 31, 36
Vitale, S., Orlando, J.I., Iarussi, E., Larrabide, I.: Improving realism in patient- specific abdominal ultrasound simulation using cyclegans. International journal of computer assisted radiology and surgery15(2), 183–192 (2020) 8, 28, 31, 36
2020
- [66]
-
[67]
European Conference on Computer Vision (ECCV) (2024) 10, 23
Wong, H.E., Rakic, M., Guttag, J., Dalca, A.V.: Scribbleprompt: Fast and flex- ible interactive segmentation for any biomedical image. European Conference on Computer Vision (ECCV) (2024) 10, 23
2024
-
[68]
Medical Image Analysis102, 103547 (2025) 4
Wu, J., Wang, Z., Hong, M., Ji, W., Fu, H., Xu, Y., Xu, M., Jin, Y.: Medical sam adapter: Adapting segment anything model for medical image segmentation. Medical Image Analysis102, 103547 (2025) 4
2025
-
[69]
In: European conference on computer vision (ECCV) (2022) 22
Wu, K., Zhang, J., Peng, H., Liu, M., Xiao, B., Fu, J., Yuan, L.: Tinyvit: Fast pretraining distillation for small vision transformers. In: European conference on computer vision (ECCV) (2022) 22
2022
-
[70]
Applied Sciences11(3) (2021).https://doi.org/10.3390/ app11031127,https://www.mdpi.com/2076-3417/11/3/112735
Yasutomi,S.,Arakaki,T., Matsuoka,R.,Sakai,A., Komatsu,R.,Shozu,K.,Dozen, A., Machino, H., Asada, K., Kaneko, S., Sekizawa, A., Hamamoto, R., Komatsu, M.: Shadow estimation for ultrasound images using auto-encoding structures and synthetic shadows. Applied Sciences11(3) (2021).https://doi.org/10.3390/ app11031127,https://www.mdpi.com/2076-3417/11/3/112735
2021
-
[71]
In: ProceedingsoftheIEEE/CVFConferenceonComputerVisionandPatternRecog- nition (CVPR)
Zamfir, E., Wu, Z., Mehta, N., Tan, Y., Paudel, D.P., Zhang, Y., Timofte, R.: Complexity experts are task-discriminative learners for any image restoration. In: ProceedingsoftheIEEE/CVFConferenceonComputerVisionandPatternRecog- nition (CVPR). pp. 12753–12763 (June 2025) 10
2025
-
[72]
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications
Zhang, C., Han, D., Qiao, Y., Kim, J.U., Bae, S.H., Lee, S., Hong, C.S.: Faster segment anything: Towards lightweight sam for mobile applications. arXiv preprint arXiv:2306.14289 (2023) 10, 22 54 T. Nguyen et al
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[73]
arXiv preprint arXiv:2207.06799 (2022) 8, 28, 31, 36
Zhao, Q., Lyu, S., Bai, W., Cai, L., Liu, B., Cheng, G., Wu, M., Sang, X., Yang, M., Chen, L.: Mmotu: A multi-modality ovarian tumor ultrasound im- age dataset for unsupervised cross-domain semantic segmentation. arXiv preprint arXiv:2207.06799 (2022) 8, 28, 31, 36
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.