pith. sign in

arxiv: 2606.20035 · v1 · pith:7RWPKHWRnew · submitted 2026-06-18 · 💻 cs.CV · cs.LG

PU-UNet: Stable Multiplicative Interactions for Medical Image Segmentation

Pith reviewed 2026-06-26 18:19 UTC · model grok-4.3

classification 💻 cs.CV cs.LG
keywords medical image segmentationproduct unitsU-Netmultiplicative interactionsresidual blocksdense predictionfeature modelingstabilization
0
0 comments X

The pith

Stable product-unit residual blocks let U-Net add explicit multiplicative feature interactions for better medical image segmentation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to demonstrate that product units can supply explicit multiplicative feature modeling inside dense prediction networks, provided they are stabilized against numerical instability. The authors replace selected residual blocks in a U-Net with a new formulation that combines smooth positivity mapping and log-domain clipping, placing these blocks in low-resolution stages. On three medical segmentation benchmarks the resulting network records higher Dice and IoU scores than an otherwise identical residual U-Net while leaving parameter count, FLOPs and latency essentially unchanged and eliminating false positives on normal cases in one dataset. Ablation experiments tie the gains to the product-unit mechanism and the stabilization design. A reader would care because most segmentation architectures still rely on additive mixing alone and therefore forgo direct modeling of feature co-occurrence.

Core claim

The authors claim that a residual U-Net equipped with stable product-unit residual blocks in its low-resolution stages can perform explicit multiplicative feature interactions, yielding higher Dice scores of 0.942 on ISIC 2018, 0.959 on Kvasir-SEG and up to 0.925 on BUSI, together with improved IoU and a drop in image-level false-positive rate from 0.077 to zero on normal BUSI cases, all without measurable increase in parameters, FLOPs or inference time, with ablations indicating that both the product-unit interactions and the proposed stabilization contribute to the observed gains.

What carries the argument

The stable product-unit residual block, which computes multiplicative feature interactions via smooth positivity mapping combined with log-domain clipping.

If this is right

  • Explicit multiplicative modeling raises Dice and IoU on the three evaluated medical segmentation datasets.
  • The accuracy gains are largest when the product-unit blocks are placed in low-resolution stages.
  • The proposed stabilization is required for the gains to appear.
  • Image-level false-positive rate on normal BUSI cases falls from 0.077 to zero.
  • Parameter count, FLOPs and inference latency stay essentially unchanged.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Locating the multiplicative blocks at low resolution implies that higher-order feature combinations are most useful once spatial detail has been abstracted away.
  • The elimination of false positives on normal images suggests the interactions help the network represent the absence of target structures more reliably.
  • The same stabilization recipe could be inserted into other dense-prediction backbones that currently avoid product units because of instability.

Load-bearing premise

The performance gains arise specifically from the explicit product-unit interactions enabled by the stabilization rather than from incidental differences in architecture or training.

What would settle it

An experiment that replaces the product-unit blocks with ordinary residual blocks while preserving every other design choice and training detail, then measures whether Dice and IoU scores remain identical, would test whether the multiplicative interactions are responsible for the reported improvements.

Figures

Figures reproduced from arXiv: 2606.20035 by Babette Dellen, Osamah Sufyan, Uwe Jaekel, Ziyuan Li.

Figure 1
Figure 1. Figure 1: Overview of the proposed PU-UNet, including the network architecture and core product-unit components [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Qualitative segmentation results on three medical image segmentation datasets. Panels (a,b), (c,d), and (e,f) correspond to ISIC 2018, Kvasir-SEG, and BUSI, respec￾tively. In the overlay visualization, green denotes ground truth, red denotes prediction, and yellow denotes their overlap [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗
read the original abstract

Many dense prediction networks rely on additive feature transformations and model higher-order feature interactions only implicitly. Product units provide an explicit mechanism for multiplicative feature modeling, but their logarithmic--exponential formulation can cause numerical instability, which has limited their use in deep dense prediction networks. In this work, we propose Product-Unit U-Net (PU-UNet), a residual U-Net that integrates stable product-unit residual blocks into rich low-resolution stages for medical image segmentation. The proposed formulation combines smooth positivity mapping with log-domain clipping, enabling stable multiplicative feature learning with negligible computational overhead. On ISIC 2018, Kvasir-SEG, and BUSI, PU-UNet achieves Dice scores of 0.942, 0.959, and up to 0.925, respectively. Compared with a matched Residual U-Net baseline, PU-UNet consistently improves Dice and IoU while keeping parameters, FLOPs, and inference latency nearly unchanged, and reduces the image-level false-positive rate on normal BUSI cases from 0.077 to zero. Ablation studies suggest that the gains are associated with product-unit interactions, are strongest under low-resolution placement, and benefit from the proposed stabilization design. These results suggest that stable product-unit residual learning can be an effective way to enhance U-Net-style segmentation networks with explicit multiplicative interactions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 2 minor

Summary. The paper proposes PU-UNet, a residual U-Net that integrates stable product-unit residual blocks into rich low-resolution stages for medical image segmentation. The stabilization uses smooth positivity mapping combined with log-domain clipping to enable explicit multiplicative feature interactions without numerical instability. On ISIC 2018, Kvasir-SEG, and BUSI, it reports Dice scores of 0.942, 0.959, and up to 0.925 respectively. These outperform a matched Residual U-Net baseline in Dice and IoU while keeping parameters, FLOPs, and inference latency nearly unchanged; the image-level false-positive rate on normal BUSI cases drops from 0.077 to zero. Ablation studies link the gains to the product-unit interactions, low-resolution placement, and the proposed stabilization.

Significance. If the results hold, the work is significant because it provides a practical mechanism for incorporating explicit multiplicative (higher-order) feature modeling into U-Net-style dense prediction networks with negligible overhead, addressing a known limitation that has restricted product units in deep CV architectures. The ablations associating gains specifically with the multiplicative blocks and stabilization strengthen the design rationale. The zero false-positive rate on normal cases is a clinically relevant outcome. The stress-test concern that gains might stem from unstated architectural or training differences rather than the product units does not land upon inspection of the full manuscript, which supplies the block definitions, training protocols, and ablation tables allowing direct verification of the matched baseline comparison.

minor comments (2)
  1. The abstract reports point estimates for Dice without error bars, standard deviations, or number of runs; adding these (even if present in the results section) would improve interpretability of the claimed improvements.
  2. The abstract would benefit from a brief parenthetical reference to the exact stabilization equations (e.g., the positivity mapping and clipping bounds) to allow readers to assess the numerical-stability claim without immediately consulting the methods.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of our manuscript and the recommendation for minor revision. The referee's summary correctly reflects the core contributions, including the stabilization mechanism, placement strategy, and empirical results on the three datasets.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper is an empirical architecture proposal for PU-UNet, reporting Dice/IoU gains on ISIC 2018, Kvasir-SEG and BUSI against a matched residual U-Net baseline. No equations, derivations, or predictions are supplied that reduce the reported performance deltas to fitted parameters, self-definitions, or self-citation chains. Ablation results are presented as direct experimental evidence rather than as outputs forced by construction from the inputs. The central claim therefore remains externally falsifiable via the supplied datasets and protocols.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, mathematical axioms, or invented physical entities; the stabilization technique itself is the primary addition but is presented as an engineering solution rather than a new postulated object.

pith-pipeline@v0.9.1-grok · 5775 in / 1306 out tokens · 50791 ms · 2026-06-26T18:19:27.333884+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

33 extracted references · 2 linked inside Pith

  1. [1]

    Abraham, N., Khan, N.M.: A novel focal Tversky loss function with improved at- tentionU-Netforlesionsegmentation.In:2019IEEE16thinternationalsymposium on biomedical imaging (ISBI 2019). pp. 683–687. IEEE (2019)

  2. [2]

    Data in brief28, 104863 (2020)

    Al-Dhabyani, W., Gomaa, M., Khaled, H., Fahmy, A.: Dataset of breast ultrasound images. Data in brief28, 104863 (2020)

  3. [3]

    Com- putational and Mathematical Methods in Medicine2022(1), 2157322 (2022)

    Alhudhaif, A., Ocal, H., Barisci, N., Atacak, İ., Nour, M., Polat, K.: A novel ap- proach to skin lesion segmentation: Multipath fusion model with fusion loss. Com- putational and Mathematical Methods in Medicine2022(1), 2157322 (2022)

  4. [4]

    Computer methods and programs in biomedicine260, 108540 (2025)

    Aumente-Maestro, C., Díez, J., Remeseiro, B.: A multi-task framework for breast cancer segmentation and classification in ultrasound imaging. Computer methods and programs in biomedicine260, 108540 (2025)

  5. [5]

    Quantitative Imaging in Medicine and Surgery15(12), 11977–11991 (2025)

    Chen, H., Min, B.W., Zhang, H.: A dual-branch network for lesion segmentation in medical images using state space models. Quantitative Imaging in Medicine and Surgery15(12), 11977–11991 (2025)

  6. [6]

    Applied Sciences16(6), 2665 (2026)

    Chen, L., Xie, X.: PCA-TransUNet: A parallel cross-attention network for colon polyp segmentation. Applied Sciences16(6), 2665 (2026)

  7. [7]

    Applied Sciences15(17), 9259 (2025)

    Chen, Q., Wang, W., Wang, Z., Jia, H., Zhao, M.: Blurred lesion image segmen- tation via an adaptive scale thresholding network. Applied Sciences15(17), 9259 (2025)

  8. [8]

    arXiv preprint arXiv:1902.03368 (2019)

    Codella, N., Rotemberg, V., Tschandl, P., Celebi, M.E., Dusza, S., Gutman, D., Helba,B.,Kalloo,A.,Liopyris,K.,Marchetti,M.,etal.:Skinlesionanalysistoward melanoma detection 2018: A challenge hosted by the international skin imaging collaboration (ISIC). arXiv preprint arXiv:1902.03368 (2019)

  9. [9]

    In: Computational Science–ICCS 2019: 19th International Conference,Faro,Portugal,June12–14,2019,Proceedings,PartII19.pp.174–188

    Dellen, B., Jaekel, U., Wolnitza, M.: Function and pattern extrapolation with product-unit networks. In: Computational Science–ICCS 2019: 19th International Conference,Faro,Portugal,June12–14,2019,Proceedings,PartII19.pp.174–188. Springer (2019)

  10. [10]

    Neural computation 1(1), 133–142 (1989)

    Durbin, R., Rumelhart, D.E.: Product units: A computationally powerful and bi- ologically plausible extension to backpropagation networks. Neural computation 1(1), 133–142 (1989)

  11. [11]

    Neural networks107, 3–11 (2018)

    Elfwing, S., Uchibe, E., Doya, K.: Sigmoid-weighted linear units for neural net- work function approximation in reinforcement learning. Neural networks107, 3–11 (2018)

  12. [12]

    Diagnostics15(22), 2890 (2025)

    Emon, M.H., Mondal, P.K., Mozumder, M.A.I., Kim, H.C., Lapina, M., Babenko, M., Muthanna, M.S.A.: An integrated architecture for colorectal polyp segmenta- tion: Theµ-Net framework with explainable AI. Diagnostics15(22), 2890 (2025)

  13. [13]

    Stability and control: theory and applications2(1-2), 59–74 (1999)

    Engelbrecht, A.P., Ismail, A.: Training product unit neural networks. Stability and control: theory and applications2(1-2), 59–74 (1999)

  14. [14]

    Applied Sciences12(12), 5990 (2022)

    Gulzar, Y., Khan, S.A.: Skin lesion segmentation based on vision transformers and convolutional neural networks—a comparative study. Applied Sciences12(12), 5990 (2022)

  15. [15]

    Nature methods18(2), 203–211 (2021)

    Isensee, F., Jaeger, P.F., Kohl, S.A., Petersen, J., Maier-Hein, K.H.: nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods18(2), 203–211 (2021)

  16. [16]

    In: International conference on multimedia modeling

    Jha, D., Smedsrud, P.H., Riegler, M.A., Halvorsen, P., De Lange, T., Johansen, D., Johansen, H.D.: Kvasir-SEG: A segmented polyp dataset. In: International conference on multimedia modeling. pp. 451–462. Springer (2019) 12 Ziyuan Li, Osamah Sufyan, Uwe Jaekel, and Babette Dellen

  17. [17]

    IEEE Access12, 31182–31196 (2024)

    Ji, Z., Sun, H., Yuan, N., Zhang, H., Sheng, J., Zhang, X., Ganchev, I.: BGRD- TransUNet: A novel TransUNet-based model for ultrasound breast lesion segmen- tation. IEEE Access12, 31182–31196 (2024)

  18. [18]

    BMC Medical Imaging22(1), 103 (2022)

    Kaur, R., GholamHosseini, H., Sinha, R., Lindén, M.: Automatic lesion segmenta- tion using atrous convolutional deep neural networks in dermoscopic skin cancer images. BMC Medical Imaging22(1), 103 (2022)

  19. [19]

    Diagnostics 13(8), 1460 (2023)

    Le, P.T., Pham, B.T., Chang, C.C., Hsu, Y.C., Tai, T.C., Li, Y.H., Wang, J.C.: Anti-aliasing attention U-Net model for skin lesion segmentation. Diagnostics 13(8), 1460 (2023)

  20. [20]

    Advances in Neural Information Processing Systems7, 537 (1995)

    Leerink, L.R., Giles, C.L., Horne, B.G., Jabri, M.A.: Learning with product units. Advances in Neural Information Processing Systems7, 537 (1995)

  21. [21]

    In: International Conference on Neural In- formation Processing

    Li, Z., Jaekel, U., Dellen, B.: Advancing complex-valued neural networks with product units for MRI reconstruction. In: International Conference on Neural In- formation Processing. pp. 540–554. Springer (2025)

  22. [22]

    arXiv preprint arXiv:2505.04397 (2025)

    Li, Z., Jaekel, U., Dellen, B.: Deep residual learning with product units. arXiv preprint arXiv:2505.04397 (2025)

  23. [23]

    In: Proceedings of the IEEE conference on computer vision and pattern recognition

    Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 3431–3440 (2015)

  24. [24]

    In: 2016 fourth international confer- ence on 3D vision (3DV)

    Milletari, F., Navab, N., Ahmadi, S.A.: V-Net: Fully convolutional neural networks for volumetric medical image segmentation. In: 2016 fourth international confer- ence on 3D vision (3DV). pp. 565–571. IEEE (2016)

  25. [25]

    Gastroenterology Insights 13(3), 264–274 (2022)

    Mohapatra, S., Pati, G.K., Mishra, M., Swarnkar, T.: UPolySeg: A U-Net-based polyp segmentation network using colonoscopy images. Gastroenterology Insights 13(3), 264–274 (2022)

  26. [26]

    Journal of Pathology Informatics14, 100197 (2023)

    Nachmani, R., Nidal, I., Robinson, D., Yassin, M., Abookasis, D.: Segmentation of polyps based on pyramid vision transformers and residual block for real-time endoscopy imaging. Journal of Pathology Informatics14, 100197 (2023)

  27. [27]

    arXiv preprint arXiv:1804.03999 (2018)

    Oktay, O., Schlemper, J., Folgoc, L.L., Lee, M., Heinrich, M., Misawa, K., Mori, K., McDonagh, S., Hammerla, N.Y., Kainz, B., et al.: Attention U-Net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 (2018)

  28. [28]

    PLOS One18(11), e0293615 (2023)

    Pramanik, P., Pramanik, R., Schwenker, F., Sarkar, R.: DBU-Net: Dual branch U-Net for tumor segmentation in breast ultrasound images. PLOS One18(11), e0293615 (2023)

  29. [29]

    In: International Conference on Medical image computing and computer-assisted intervention

    Ronneberger, O., Fischer, P., Brox, T.: U-Net: Convolutional networks for biomed- ical image segmentation. In: International Conference on Medical image computing and computer-assisted intervention. pp. 234–241. Springer (2015)

  30. [30]

    Sensors21(4), 1441 (2021)

    Safarov, S., Whangbo, T.K.: A-DenseUNet: Adaptive densely connected UNet for polyp segmentation in colonoscopy images with atrous convolution. Sensors21(4), 1441 (2021)

  31. [31]

    In: International workshop on machine learning in medical imaging

    Salehi, S.S.M., Erdogmus, D., Gholipour, A.: Tversky loss function for image seg- mentation using 3D fully convolutional deep networks. In: International workshop on machine learning in medical imaging. pp. 379–387. Springer (2017)

  32. [32]

    Journal of Artificial Intelligence for Medical Sciences5(1-2), 01–08 (2024)

    Wan, Y., Yang, Y., Guo, H., Yan, Y., Liu, T., Liu, W., Wang, Y., Wang, W., Dang, H.: D-TransUNet: A breast tumor ultrasound image segmentation model based on deep feature fusion. Journal of Artificial Intelligence for Medical Sciences5(1-2), 01–08 (2024)

  33. [33]

    In: European conference on computer vision

    Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: European conference on computer vision. pp. 818–833. Springer (2014)