super hub Mixed citations

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, Philipp Fischer, Thomas Brox · 2015 · cs.CV · arXiv 1505.04597

Mixed citation behavior. Most common role is background (43%).

111 Pith papers citing it

Background 43% of classified citations

open full Pith review browse 111 citing papers more from Olaf Ronneberger arXiv PDF

abstract

There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks. Using the same network trained on transmitted light microscopy images (phase contrast and DIC) we won the ISBI cell tracking challenge 2015 in these categories by a large margin. Moreover, the network is fast. Segmentation of a 512x512 image takes less than a second on a recent GPU. The full implementation (based on Caffe) and the trained networks are available at http://lmb.informatik.uni-freiburg.de/people/ronneber/u-net .

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 11 method 6 baseline 3 dataset 1

citation-polarity summary

background 9 use method 6 baseline 3 unclear 2 use dataset 1

claims ledger

abstract There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segme

authors

Olaf Ronneberger Philipp Fischer Thomas Brox

co-cited works

representative citing papers

From Phase to Phenomenon: Self-Supervised Learning of Subsurface Scattering with Minimal Phase-shift Inputs

cs.CV · 2026-06-28 · unverdicted · novelty 7.0

A self-supervised method pretrains an encoder on eight PSP images per view to learn generalizable subsurface scattering representations that transfer to relighting and dense footprint reconstruction on unseen complex objects.

Sampling the Schwinger Model with Gauge-Equivariant Diffusion

hep-lat · 2026-06-25 · unverdicted · novelty 7.0

A gauge-equivariant diffusion model samples Schwinger model configurations, yielding unbiased observables matching MCMC and qualitatively less topological freezing than HMC.

Inverse Critical Experiment Design via Gradient Optimization and a Multigroup Attention-Based Neural Network Architecture

cs.LG · 2026-06-01 · unverdicted · novelty 7.0

A U-Net surrogate with multigroup attention pooling is trained on OpenMC sensitivity data and combined with gradient optimization to generate grid-based critical experiment geometries that achieve c_k values up to 0.97757 for HALEU fuel validation.

Field-level multi-tracers simulation-based inference of cosmological parameters from 3D maps

astro-ph.CO · 2026-05-25 · unverdicted · novelty 7.0

The work demonstrates that multi-tracer field-level SBI on galaxy and HI maps yields 2-7 times better constraints on Omega_m and sigma_8 than single-tracer or summary-statistic approaches, with 3D maps performing best.

LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR

cs.CV · 2026-05-11 · unverdicted · novelty 7.0

LatentHDR generates structurally consistent panoramic HDR images by producing one scene latent with a diffusion backbone then deterministically mapping it to multiple exposure latents via a lightweight conditional head.

EchoXFlow: A Beamspace Echocardiography Dataset for Cardiac Motion, Flow, and Function

cs.CV · 2026-05-06 · unverdicted · novelty 7.0

EchoXFlow is a new dataset of 37,125 beamspace echocardiography recordings with separable modalities, Doppler data, ECG, and clinical annotations that enables acquisition-aware learning not possible with standard scan-converted videos.

Generative diffusion models for spatiotemporal influenza forecasting

cs.LG · 2026-04-27 · unverdicted · novelty 7.0

Influpaint uses generative diffusion models on image-encoded influenza data to produce realistic and diverse epidemic trajectories that match leading ensemble methods in accuracy.

VitaminP: cross-modal learning enables whole-cell segmentation from routine histology

cs.CV · 2026-04-26 · unverdicted · novelty 7.0

VitaminP uses paired H&E-mIF data to train a model that transfers molecular boundary information, enabling accurate whole-cell segmentation directly from routine H&E histology across 34 cancer types.

Physics-informed, Generative Adversarial Design of Funicular Shells

cs.CE · 2026-04-17 · unverdicted · novelty 7.0

A modified DCGAN with an auxiliary discriminator using the membrane factor generates stable, previously unseen funicular shells optimized for pure compression in three dimensions.

Machine Learning Phase Field Reconstruction in a Bose-Einstein Condensate

cond-mat.quant-gas · 2026-04-10 · unverdicted · novelty 7.0

A U-Net-based ML pipeline reconstructs the complete phase field and quantized vortex charges in 2D Bose-Einstein condensates from density snapshots alone, using synthetic training data from projected Gross-Pitaevskii simulations.

Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings

q-bio.QM · 2026-04-09 · unverdicted · novelty 7.0

Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.

Diffusion Processes on Implicit Manifolds

cs.LG · 2026-04-08 · unverdicted · novelty 7.0 · 2 refs

Defines diffusion processes on implicit data manifolds via proximity-graph approximations to the infinitesimal generator and carré-du-champ operator, proves convergence in law to the continuous manifold process, and provides an Euler-Maruyama integrator validated on synthetic and MNIST manifolds.

Contour Refinement using Discrete Diffusion in Low Data Regime

cs.CV · 2026-02-05 · unverdicted · novelty 7.0

A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.

Radio-Interferometric Image Reconstruction with Denoising Diffusion Restoration Models

astro-ph.IM · 2026-01-22 · unverdicted · novelty 7.0

A diffusion model trained on real radio galaxy images reconstructs high-fidelity interferometric observations from VLA, EHT, and ALMA simulations and outperforms CLEAN on gridded visibilities.

SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis

cs.CV · 2025-12-17 · unverdicted · novelty 7.0

SemanticBridge provides a new 3D dataset for bridge component segmentation and quantifies sensor-induced domain gaps that drop model performance by up to 11.4% mIoU.

Visual Diffusion Models are Geometric Solvers

cs.CV · 2025-10-24 · unverdicted · novelty 7.0

Standard visual diffusion models operating in pixel space can approximate solutions to the inscribed square, Steiner tree, and simple polygon problems.

Deep Learning for CMB Foreground Removal and Beam Deconvolution: A U-Net GAN Approach

astro-ph.IM · 2025-08-29 · unverdicted · novelty 7.0

A U-Net GAN reconstructs CMB T and E maps from Planck-like simulations with foregrounds and systematics, achieving under 1% error outside the Galactic region and demonstrating first-time correction for non-circular beams and asymmetric scans.

SinkSAM-Net: Knowledge-Driven Self-Supervised Sinkhole Segmentation Using Topographic Priors and Segment Anything Model

cs.CV · 2024-10-02 · unverdicted · novelty 7.0

SinkSAM-Net uses topographic priors and SAM with coordinate-wise bounding box jittering to create pseudo-labels for iterative self-supervised training of an EfficientNetV2-UNet, reaching about 95% of fully supervised performance on sinkhole datasets.

Does Your ViT Still Need U-Net for Segmentation?

cs.CV · 2026-06-30 · unverdicted · novelty 6.0

EoSeg shows that modern ViT backbones support accurate medical image segmentation without U-Net-style decoders via multi-level query modeling and learnable block fusion, with strong results on seven benchmarks.

X-Mind: Efficient Visual Chain-of-Thought via Predictive World Model for End-to-End Driving

cs.CV · 2026-06-27 · unverdicted · novelty 6.0

X-Mind proposes an efficient internal visual chain-of-thought using compressed BEV sketches and recurrent block diffusion to embed predictive world models into end-to-end driving policies.

A Simulation Platform for Flapping-Wing Vehicles

cs.RO · 2026-06-01 · unverdicted · novelty 6.0

FWAV-Sim is a high-fidelity Unity simulation framework for flapping-wing vehicles that integrates blade-element aerodynamics with bluff-body drag, spatiotemporally correlated fractal turbulence, and realistic IMU/LiDAR/RGB sensor models to support autonomy development.

MoRE: A Mixture-of-Experts-Based Task-Adaptive End-to-End Network for Multimodal MRI Reconstruction

eess.IV · 2026-06-01 · unverdicted · novelty 6.0

MoRE integrates a sparsely activated MoE module with unsupervised routing into a variational network for stable multimodal MRI reconstruction on fastMRI brain and knee data at 8x undersampling.

Emergent Transfer of a Physics Foundation Model from Simulation to Laboratory Turbulence

physics.flu-dyn · 2026-05-31 · unverdicted · novelty 6.0

Finetuned physics foundation model generalizes zero-shot from few DNS runs to laboratory RTI data, matching experimental mixing growth rates and handling unseen stable stratification.

21cmEMUv3: a hybrid diffusion-LSTM emulator of 21cmFAST summary observables

astro-ph.CO · 2026-05-29 · unverdicted · novelty 6.0

21cmEMUv3 emulates the cylindrical 21cm power spectrum via score-based diffusion and six other 21cmFAST observables via LSTM networks at sub-percent accuracy, then uses the emulator to infer a lower limit on soft-band X-ray luminosity from HERA data.

citing papers explorer

Showing 11 of 11 citing papers after filters.

SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis cs.CV · 2025-12-17 · unverdicted · none · ref 69 · internal anchor
SemanticBridge provides a new 3D dataset for bridge component segmentation and quantifies sensor-induced domain gaps that drop model performance by up to 11.4% mIoU.
Visual Diffusion Models are Geometric Solvers cs.CV · 2025-10-24 · unverdicted · none · ref 38 · internal anchor
Standard visual diffusion models operating in pixel space can approximate solutions to the inscribed square, Steiner tree, and simple polygon problems.
Deep Learning for CMB Foreground Removal and Beam Deconvolution: A U-Net GAN Approach astro-ph.IM · 2025-08-29 · unverdicted · none · ref 30 · internal anchor
A U-Net GAN reconstructs CMB T and E maps from Planck-like simulations with foregrounds and systematics, achieving under 1% error outside the Galactic region and demonstrating first-time correction for non-circular beams and asymmetric scans.
Forecasting implied volatility surface with generative diffusion models q-fin.CP · 2025-11-10 · unverdicted · none · ref 14 · 2 links · internal anchor
A conditioned diffusion model with SNR-weighted arbitrage penalty generates one-day-ahead arbitrage-free implied volatility surfaces and outperforms baselines on market data.
Recovering Sub-threshold S-wave Arrivals in Deep Learning Phase Pickers via Shape-Aware Loss physics.geo-ph · 2025-11-10 · unverdicted · none · ref 14 · internal anchor
A shape-aware loss strategy recovers sub-threshold S-wave arrivals in deep learning seismic phase pickers by treating labels as coherent shapes, achieving a 64% increase in effective detections.
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions cs.LG · 2025-09-23 · unverdicted · none · ref 24 · internal anchor
DAWM introduces a modular diffusion world model with an inverse dynamics model to produce complete synthetic transitions that improve conservative offline RL algorithms like TD3BC and IQL on D4RL tasks.
Flow marching for a generative PDE foundation model cs.LG · 2025-09-23 · unverdicted · none · ref 53 · internal anchor
Flow Marching jointly samples noise and physical time to learn a velocity field for generative PDE modeling, paired with a latent autoencoder and efficient transformer for large-scale pretraining on 2.5M trajectories.
RASALoRE: Region Aware Spatial Attention with Location-based Random Embeddings for Weakly Supervised Anomaly Detection in Brain MRI Scans cs.CV · 2025-10-09 · conditional · none · ref 30 · internal anchor
A novel weakly supervised anomaly detection method for brain MRI that uses discriminative dual prompt tuning for pseudo masks and region-aware spatial attention with location-based random embeddings to achieve SOTA results with under 8 million parameters on BraTS and MSD datasets.
Optimizing Grasping in Legged Robots: A Deep Learning Approach to Loco-Manipulation cs.RO · 2025-08-24 · conditional · none · ref 11 · internal anchor
A U-Net-style CNN trained on synthetic multi-modal grasp data from the Genesis simulator enables a real quadruped robot to navigate to and precisely grasp objects in a loco-manipulation task.
Wildfire spread forecasting with Deep Learning cs.LG · 2025-05-23 · conditional · none · ref 48 · internal anchor
A deep learning framework forecasts final wildfire burned area extent from ignition-time data, with an ablation showing that a four-day pre- to five-day post-ignition temporal window improves F1 and IoU by nearly 5% over a single-day baseline on held-out Mediterranean test data.
Clinical utility of foundation models in musculoskeletal MRI for biomarker fidelity and predictive outcomes eess.IV · 2025-01-23 · unverdicted · none · ref 3 · internal anchor
Fine-tuned foundation models produce reliable MSK MRI biomarkers that support workload-reducing triage and calibrated 48-month prediction of knee replacement and incident OA.

U-Net: Convolutional Networks for Biomedical Image Segmentation

hub tools

citation-role summary

citation-polarity summary

claims ledger

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer