hub Mixed citations

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, Philipp Fischer, Thomas Brox · 2015 · cs.CV · arXiv 1505.04597

Mixed citation behavior. Most common role is background (43%).

86 Pith papers citing it

Background 43% of classified citations

open full Pith review browse 86 citing papers arXiv PDF

abstract

There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks. Using the same network trained on transmitted light microscopy images (phase contrast and DIC) we won the ISBI cell tracking challenge 2015 in these categories by a large margin. Moreover, the network is fast. Segmentation of a 512x512 image takes less than a second on a recent GPU. The full implementation (based on Caffe) and the trained networks are available at http://lmb.informatik.uni-freiburg.de/people/ronneber/u-net .

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 11 method 6 baseline 3 dataset 1

citation-polarity summary

background 9 use method 6 baseline 3 unclear 2 use dataset 1

claims ledger

abstract There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segme

co-cited works

representative citing papers

LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR

cs.CV · 2026-05-11 · unverdicted · novelty 7.0

LatentHDR generates structurally consistent panoramic HDR images by producing one scene latent with a diffusion backbone then deterministically mapping it to multiple exposure latents via a lightweight conditional head.

EchoXFlow: A Beamspace Echocardiography Dataset for Cardiac Motion, Flow, and Function

cs.CV · 2026-05-06 · unverdicted · novelty 7.0

EchoXFlow is a new dataset of 37,125 beamspace echocardiography recordings with separable modalities, Doppler data, ECG, and clinical annotations that enables acquisition-aware learning not possible with standard scan-converted videos.

Generative diffusion models for spatiotemporal influenza forecasting

cs.LG · 2026-04-27 · unverdicted · novelty 7.0

Influpaint uses generative diffusion models on image-encoded influenza data to produce realistic and diverse epidemic trajectories that match leading ensemble methods in accuracy.

VitaminP: cross-modal learning enables whole-cell segmentation from routine histology

cs.CV · 2026-04-26 · unverdicted · novelty 7.0

VitaminP uses paired H&E-mIF data to train a model that transfers molecular boundary information, enabling accurate whole-cell segmentation directly from routine H&E histology across 34 cancer types.

Physics-informed, Generative Adversarial Design of Funicular Shells

cs.CE · 2026-04-17 · unverdicted · novelty 7.0

A modified DCGAN with an auxiliary discriminator using the membrane factor generates stable, previously unseen funicular shells optimized for pure compression in three dimensions.

Machine Learning Phase Field Reconstruction in a Bose-Einstein Condensate

cond-mat.quant-gas · 2026-04-10 · unverdicted · novelty 7.0

A U-Net-based ML pipeline reconstructs the complete phase field and quantized vortex charges in 2D Bose-Einstein condensates from density snapshots alone, using synthetic training data from projected Gross-Pitaevskii simulations.

Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings

q-bio.QM · 2026-04-09 · unverdicted · novelty 7.0

Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.

Diffusion Processes on Implicit Manifolds

cs.LG · 2026-04-08 · unverdicted · novelty 7.0 · 2 refs

Defines diffusion processes on implicit data manifolds via proximity-graph approximations to the infinitesimal generator and carré-du-champ operator, proves convergence in law to the continuous manifold process, and provides an Euler-Maruyama integrator validated on synthetic and MNIST manifolds.

Contour Refinement using Discrete Diffusion in Low Data Regime

cs.CV · 2026-02-05 · unverdicted · novelty 7.0

A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.

Radio-Interferometric Image Reconstruction with Denoising Diffusion Restoration Models

astro-ph.IM · 2026-01-22 · unverdicted · novelty 7.0

A diffusion model trained on real radio galaxy images reconstructs high-fidelity interferometric observations from VLA, EHT, and ALMA simulations and outperforms CLEAN on gridded visibilities.

SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis

cs.CV · 2025-12-17 · unverdicted · novelty 7.0

SemanticBridge provides a new 3D dataset for bridge component segmentation and quantifies sensor-induced domain gaps that drop model performance by up to 11.4% mIoU.

Visual Diffusion Models are Geometric Solvers

cs.CV · 2025-10-24 · unverdicted · novelty 7.0

Standard visual diffusion models operating in pixel space can approximate solutions to the inscribed square, Steiner tree, and simple polygon problems.

Deep Learning for CMB Foreground Removal and Beam Deconvolution: A U-Net GAN Approach

astro-ph.IM · 2025-08-29 · unverdicted · novelty 7.0

A U-Net GAN reconstructs CMB T and E maps from Planck-like simulations with foregrounds and systematics, achieving under 1% error outside the Galactic region and demonstrating first-time correction for non-circular beams and asymmetric scans.

SinkSAM-Net: Knowledge-Driven Self-Supervised Sinkhole Segmentation Using Topographic Priors and Segment Anything Model

cs.CV · 2024-10-02 · unverdicted · novelty 7.0

SinkSAM-Net uses topographic priors and SAM with coordinate-wise bounding box jittering to create pseudo-labels for iterative self-supervised training of an EfficientNetV2-UNet, reaching about 95% of fully supervised performance on sinkhole datasets.

Normalizing flows for all-orders QED corrections in lattice field theory

hep-lat · 2026-05-21 · unverdicted · novelty 6.0

Normalizing flows enable all-order QED corrections in lattice scalar QED in 2-4 dimensions with reduced variance and transferability from small to large lattices.

Learning to Think in Physics: Breaking Shortcut Learning in Scientific Diffusion via Representation Alignment

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

REPA-P aligns intermediate representations in diffusion models with physical states using first-principles PDE residuals to accelerate convergence and boost out-of-distribution robustness on PDE tasks.

SegRAG: Training-Free Retrieval-Augmented Semantic Segmentation

cs.CV · 2026-05-17 · unverdicted · novelty 6.0 · 2 refs

SegRAG is a training-free retrieval-augmented framework that extracts class-specific point prompts from a filtered DINOv3 feature bank to boost SAM3 semantic segmentation performance on standard and agricultural benchmarks.

A General B\'ezier Tree Encoding Counterfactual Framework for Retinal-Vessel-Mediated Disease Analysis

eess.IV · 2026-05-13 · unverdicted · novelty 6.0

BTECF encodes retinal vessels as Bézier trees to enable targeted, parameter-level counterfactual interventions on vessel geometry for causal analysis of vascular diseases.

EDGER: EDge-Guided with HEatmap Refinement for Generalizable Image Forgery Localization

cs.CV · 2026-05-12 · unverdicted · novelty 6.0

A dual-branch system using frequency edge cues and CLIP-based synthetic patch detection for accurate, resolution-independent image forgery localization.

Geometry-aware Prototype Learning for Cross-domain Few-shot Medical Image Segmentation

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

GeoProto enriches appearance prototypes with geometric offsets from an ordinal shape branch to improve cross-domain few-shot medical image segmentation.

Don't Fix the Basis -- Learn It: Spectral Representation with Adaptive Basis Learning for PDEs

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

ABLE learns a spatially adaptive Parseval frame from data via an ancillary density to replace fixed bases in spectral neural operators for PDEs.

Diffusion model for SU(N) gauge theories

hep-lat · 2026-05-07 · unverdicted · novelty 6.0

Implicit score matching trains diffusion models that successfully sample SU(3) Wilson gauge configurations on lattices, with a Hamiltonian-dynamics corrector needed for strong coupling.

Leveraging Image Generators to Address Training Data Scarcity: The Gen4Regen Dataset for Forest Regeneration Mapping

cs.CV · 2026-05-07 · conditional · novelty 6.0

Mixing real UAV imagery with 2101 AI-generated image-mask pairs improves semantic segmentation F1 scores for fine-grained forest species by over 15 percentage points overall and up to 30 points for rare classes.

A CNN--Transformer Denoiser for low-$S/N$ Galaxy Spectra: Stellar Population Recovery in Synthetic Tests

astro-ph.GA · 2026-05-06 · unverdicted · novelty 6.0

A hybrid CNN-Transformer denoiser trained on synthetic spectra substantially reduces noise and improves stellar population recovery for low-S/N galaxy observations in controlled tests.

citing papers explorer

Showing 36 of 86 citing papers.

Deep Learning for MRI Slice Interpolation: The Critical Role of Problem Formulation eess.IV · 2026-05-15 · unverdicted · none · ref 11 · internal anchor
Reformulating the input to adjacent slices for deep learning MRI interpolation yields 58% SSIM gains and 10.1% improvement over linear baseline, with problem formulation outweighing architecture choice.
Vision Transformer-Conditioned UNet for Domain-Adaptive Semantic Segmentation cs.CV · 2026-05-12 · unverdicted · none · ref 48 · internal anchor
ViTC-UNet adapts frozen ViT representations to biomedical semantic segmentation by conditioning a UNet via learnable tokens and two-way attention decoding.
Scalable Active Metamaterials for Shape-Morphing cs.CE · 2026-05-07 · unverdicted · none · ref 52 · internal anchor
A hierarchical SAM framework decouples macroscale mesh optimization from microscale inverse design to enable fast scalable creation of aperiodic shape-morphing metamaterials.
Full-chip CMP modelling based on Fully Convolutional Network leveraging White Light Interferometry cs.LG · 2026-05-06 · unverdicted · none · ref 8 · internal anchor
A fully convolutional network trained separately on WLI and AFM data predicts full-chip post-CMP nanotopography at nanometer accuracy.
Flow matching for Sentinel-2 super-resolution: implementation, application, and implications cs.CV · 2026-05-01 · unverdicted · none · ref 40 · internal anchor
Flow matching achieves single-step pixel accuracy and 20-step perceptual quality for Sentinel-2 super-resolution, outperforming diffusion and Real-ESRGAN while enabling large-scale 2.5 m land-cover products.
End-to-end Automated Deep Neural Network Optimization for PPG-based Blood Pressure Estimation on Wearables cs.LG · 2026-04-11 · unverdicted · none · ref 57 · internal anchor
An end-to-end hardware-aware optimization pipeline produces DNNs for PPG-based blood pressure estimation with up to 7.99% lower error and 83x fewer parameters that fit on ultra-low-power SoCs like GAP8.
RASALoRE: Region Aware Spatial Attention with Location-based Random Embeddings for Weakly Supervised Anomaly Detection in Brain MRI Scans cs.CV · 2025-10-09 · conditional · none · ref 30 · internal anchor
A novel weakly supervised anomaly detection method for brain MRI that uses discriminative dual prompt tuning for pseudo masks and region-aware spatial attention with location-based random embeddings to achieve SOTA results with under 8 million parameters on BraTS and MSD datasets.
Accuracy Improvement of Cell Image Segmentation Using Feedback Former cs.CV · 2024-08-23 · unverdicted · none · ref 24 · internal anchor
Feedback Former improves cell image segmentation accuracy by feeding detailed feature maps back from near the output to lower transformer layers, outperforming non-feedback baselines with lower computational cost on three datasets.
Online Inference and Detection of Curbs in Partially Occluded Scenes with Sparse LIDAR cs.RO · 2019-07-11 · unverdicted · none · ref 21 · internal anchor
Real-time deep network approach on 2D LIDAR bird's-eye views for detecting visible and occluded curbs with post-processing tracking.
The Ethical Dilemma when (not) Setting up Cost-based Decision Rules in Semantic Segmentation cs.CV · 2019-07-02 · unverdicted · none · ref 21 · internal anchor
Defining egoistic and altruistic cost functions for class confusions in semantic segmentation changes precision, recall, and segment-wise error rates relative to standard MAP decisions.
Physics-guided Convolutional Neural Network for Domain Growth Prediction in Systems with Conserved Kinetics cs.LG · 2026-06-09 · unverdicted · none · ref 38 · internal anchor
An attention-based physics-guided CNN surrogate is trained to predict long-time microstructural evolution under the Cahn-Hilliard equation for both critical and off-critical mixtures while preserving composition and matching Lifshitz-Slyozov domain growth.
World Action Models: The Next Frontier in Embodied AI cs.RO · 2026-05-12 · unverdicted · none · ref 34 · internal anchor
The paper introduces World Action Models as a new paradigm unifying predictive world modeling with action generation in embodied foundation models and provides a taxonomy of existing approaches.
Deep Learning-Based Segmentation of Peritoneal Cancer Index Regions from CT Imaging cs.CV · 2026-04-30 · unverdicted · none · ref 11 · internal anchor
nnU-Net segments rPCI regions on 62 CT scans with mean Dice 0.82, nearing inter-observer agreement of 0.88 and beating Swin UNETR at 0.76.
KAYRA: A Microservice Architecture for AI-Assisted Karyotyping with Cloud and On-Premise Deployment cs.LG · 2026-04-29 · unverdicted · none · ref 5 · internal anchor
KAYRA packages a cascade of EfficientNet-B5 + U-Net, Mask R-CNN, and ResNet-18 models into a microservice architecture that supports both cloud and on-premise deployment and reaches 98.91% segmentation accuracy in a pilot test on 459 chromosomes.
A Deep U-Net Framework for Flood Hazard Mapping Using Hydraulic Simulations of the Wupper Catchment cs.LG · 2026-04-22 · unverdicted · none · ref 21 · internal anchor
A U-Net surrogate model trained on hydraulic simulations predicts maximum water levels for flood hazard mapping in the Wupper catchment with results comparable to the original simulations.
A Wasserstein GAN-based climate scenario generator for risk management and insurance: the case of soil subsidence cs.LG · 2026-04-22 · unverdicted · none · ref 46 · internal anchor
A conditional Wasserstein GAN generates plausible future SWI drought trajectories for French insurance risk management under climate change.
Learning to count small and clustered objects with application to bacterial colonies cs.CV · 2026-04-21 · unverdicted · none · ref 71 · internal anchor
ACFamNet Pro reaches 9.64% mean normalized absolute error on bacterial colony images under 5-fold cross-validation, beating FamNet by 12.71%.
AI Approach for MRI-only Full-Spine Vertebral Segmentation and 3D Reconstruction in Paediatric Scoliosis cs.CV · 2026-04-20 · unverdicted · none · ref 27 · internal anchor
An AI pipeline using GAN-generated MRI-like images and U-Net segmentation produces automated 3D thoracolumbar spine reconstructions from MRI with 88% Dice score and reduces processing time from 1 hour to under 1 minute while preserving scoliosis deformity features.
DigiForest: Digital Analytics and Robotics for Sustainable Forestry cs.RO · 2026-04-16 · unverdicted · none · ref 44 · internal anchor
DigiForest integrates heterogeneous autonomous robots for data collection, automated tree trait extraction, a decision support system for growth forecasting, and autonomous harvesters for selective logging, with real-world tests in European forests.
AMO-ENE: Attention-based Multi-Omics Fusion Model for Outcome Prediction in Extra Nodal Extension and HPV-associated Oropharyngeal Cancer eess.IV · 2026-04-10 · unverdicted · none · ref 36 · internal anchor
An attention-based fusion model combining semi-supervised CT segmentation, radiomics, and clinical features predicts metastatic recurrence, overall survival, and disease-free survival in HPV+ oropharyngeal cancer with AUCs of 88.2%, 79.2%, and 78.1% on an internal cohort of 397 patients.
Uncertainty Estimation for Deep Reconstruction in Actuatic Disaster Scenarios with Autonomous Vehicles cs.RO · 2026-04-07 · unverdicted · none · ref 9 · internal anchor
Evidential Deep Learning outperforms other methods in accuracy, calibration, and speed for uncertainty-aware scalar field reconstruction in aquatic environments using autonomous vehicles.
SAGE-GAN: Towards Realistic and Robust Segmentation of Spatially Ordered Nanoparticles via Attention-Guided GANs cs.CV · 2026-04-04 · unverdicted · none · ref 18 · internal anchor
SAGE-GAN integrates a self-attention U-Net into a CycleGAN framework to generate realistic synthetic electron microscopy image-mask pairs that augment training data for nanoparticle segmentation without human labeling.
Optimizing Grasping in Legged Robots: A Deep Learning Approach to Loco-Manipulation cs.RO · 2025-08-24 · conditional · none · ref 11 · internal anchor
A U-Net-style CNN trained on synthetic multi-modal grasp data from the Genesis simulator enables a real quadruped robot to navigate to and precisely grasp objects in a loco-manipulation task.
Wildfire spread forecasting with Deep Learning cs.LG · 2025-05-23 · conditional · none · ref 48 · internal anchor
A deep learning framework forecasts final wildfire burned area extent from ignition-time data, with an ablation showing that a four-day pre- to five-day post-ignition temporal window improves F1 and IoU by nearly 5% over a single-day baseline on held-out Mediterranean test data.
Clinical utility of foundation models in musculoskeletal MRI for biomarker fidelity and predictive outcomes eess.IV · 2025-01-23 · unverdicted · none · ref 3 · internal anchor
Fine-tuned foundation models produce reliable MSK MRI biomarkers that support workload-reducing triage and calibrated 48-month prediction of knee replacement and incident OA.
Self-Adaptive 2D-3D Ensemble of Fully Convolutional Networks for Medical Image Segmentation eess.IV · 2019-07-26 · unverdicted · none · ref 3 · internal anchor
Self-adaptive 2D-3D FCN ensemble optimized by multiobjective evolution for prostate segmentation on PROMISE12 achieves top-10 ranking with smaller size than prior auto-designed models.
SAN: Scale-Aware Network for Semantic Segmentation of High-Resolution Aerial Images cs.CV · 2019-07-06 · unverdicted · none · ref 5 · internal anchor
SANet adds a re-sampling-based scale-aware module to semantic segmentation networks to better handle inconsistent object scales in aerial images.
Embedding Non-Distortive Cancelable Face Template Generation cs.CV · 2024-02-04 · unverdicted · none · ref 18 · internal anchor
Presents a non-distortive cancelable face template method via targeted image distortion that maintains identity signals for neural embedding models on MNIST and LFW data.
Blind Deblurring Using GANs eess.IV · 2019-07-27 · unverdicted · none · ref 13 · internal anchor
Modifications to GANs using non-local attention blocks, residual connections, combined losses, and edge feedback are proposed and tested for supervised blind image deblurring.
Machine Learning Techniques for Astrophysics and Cosmology: Lyman-$\alpha$ forest astro-ph.CO · 2026-05-21 · unverdicted · none · ref 183 · internal anchor
Review of machine learning applications for analyzing Lyman-alpha forest observations to probe cosmology, reionization, and dark matter.
Machine Learning as a Transformative Tool for (Exo-)Planetary Science astro-ph.EP · 2026-04-10 · unverdicted · none · ref 50 · internal anchor
The paper reviews ML applications for sequence modeling, pattern recognition, and generative Bayesian analysis to tackle heterogeneous data challenges in (exo)planetary science.
Enhanced Ionization Charge Identification in the Short-Baseline Neutrino Program Neutrino Detectors with Deep Neural Networks physics.ins-det · 2026-05-14 · unreviewed · ref 12 · internal anchor
REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations cs.CL · 2026-05-12 · unreviewed · ref 29 · internal anchor
StereoPolicy: Improving Robotic Manipulation Policies via Stereo Perception cs.RO · 2026-05-11 · unreviewed · ref 88 · internal anchor
TRAS: An Interactive Software for Tracing Tree Ring Cross Sections cs.CV · 2026-05-08 · unreviewed · ref 17 · internal anchor
A theory of learning data statistics in diffusion models, from easy to hard stat.ML · 2026-03-13 · unreviewed · ref 22 · internal anchor

U-Net: Convolutional Networks for Biomedical Image Segmentation

hub tools

citation-role summary

citation-polarity summary

claims ledger

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer