Mixed citations

Title resolution pending

Olaf Ronneberger, Philipp Fischer, Thomas Brox · 2015 · Lecture Notes in Computer Science · DOI 10.1007/978-3-319-24574-4_28

Mixed citation behavior. Most common role is background (50%).

47 Pith papers citing it

56.5k external citations · Crossref

Background 50% of classified citations

open at publisher browse 47 citing papers more from Olaf Ronneberger

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 3 method 2 baseline 1

citation-polarity summary

background 3 use method 2 baseline 1

authors

Olaf Ronneberger Philipp Fischer Thomas Brox

co-cited works

representative citing papers

Text Dictates, Music Decorates: Energy-based Attention for Editable Dance Motion Generation

cs.AI · 2026-06-22 · unverdicted · novelty 7.0

STREAM decouples text and music conditioning in a diffusion transformer via AdaLN for structure and BEAM for beats, plus new Motorica++ dataset and editability metrics, claiming SOTA music alignment with preserved semantics.

GPROF-IR: An Improved Single-Channel Infrared Precipitation Retrieval for Merged Satellite Precipitation Products

physics.ao-ph · 2026-05-08 · unverdicted · novelty 7.0

GPROF-IR is a CNN-based retrieval that uses temporal context in geostationary IR observations to produce precipitation estimates with lower error than prior IR methods and climatological consistency with PMW retrievals for integration into IMERG V08.

AttentionBender: Manipulating Cross-Attention in Video Diffusion Transformers as a Creative Probe

cs.MM · 2026-04-22 · unverdicted · novelty 7.0

AttentionBender applies 2D transforms to cross-attention maps in video diffusion transformers, producing distributed distortions and glitch aesthetics that reveal entangled attention mechanisms while serving as both an XAI probe and creative tool.

Differentiable Surrogate for Detector Simulation and Design with Diffusion Models

physics.ins-det · 2026-01-09 · unverdicted · novelty 7.0

A LoRA-adapted conditional diffusion surrogate for electromagnetic calorimeter showers matches key observables within 2% RMSE and reproduces directional trends in design-utility gradients.

OOD-SEG: Exploiting out-of-distribution detection techniques for learning image segmentation from sparse multi-class positive-only annotations

cs.CV · 2024-11-14 · unverdicted · novelty 7.0

OOD-SEG reframes multi-class segmentation from sparse positive-only annotations as pixel-wise positive-unlabelled learning solved by integrating out-of-distribution detection techniques, with a proposed cross-validation evaluation on surgical imaging datasets.

Cyclic 2.5D Perceptual Loss for Cross-Modal 3D Medical Image Synthesis: T1w MRI to Tau PET

eess.IV · 2024-06-18 · unverdicted · novelty 7.0

Proposes a cyclic 2.5D perceptual loss with manufacturer SUVR standardization for T1w MRI to tau PET synthesis, reporting improved regional agreement on ADNI and SCAN cohorts across U-Net, UNETR, SwinUNETR, CycleGAN, and Pix2Pix.

DeepMine-Mamba: Mitigating Information Dilution in Mamba-Based State Space Models for Document Image Binarization

cs.CV · 2026-06-07 · unverdicted · novelty 6.0

DeepMine-Mamba adds an Anti-Dilution Gate to Mamba-based models to counteract feature dilution in document binarization and reports competitive FM and Fps scores on DIBCO benchmarks under leave-one-year-out evaluation.

Direct High-Magnetic-Field Coupling to Stripe Order in a Cuprate Superconductor

cond-mat.str-el · 2026-06-05 · unverdicted · novelty 6.0

High magnetic fields directly enhance the amplitude and correlation length of stripe order in a cuprate superconductor far above the vortex melting transition, indicating a coupling mechanism independent of superconductivity suppression.

Multi-Task Crack Foundation Model for Engineering-Reliable Crack Representation and Topology Preservation in Civil Infrastructure

cs.CV · 2026-06-04 · unverdicted · novelty 6.0

CrackGeoFM is a multi-task framework that adapts a frozen visual foundation model with FCEM, CFAM, and SMTD modules for crack mask prediction, skeleton reconstruction, and uncertainty estimation, reporting SOTA results across 20 datasets including few-shot settings.

XTinyU-Net: Training-Free U-Net Scaling via Initialization-Time Sensitivity

eess.IV · 2026-05-10 · unverdicted · novelty 6.0 · 2 refs

A Jacobian sensitivity curve computed at initialization identifies the narrowest U-Net configuration that avoids performance collapse, matching nnU-Net accuracy with 400-1600x fewer parameters on six medical datasets.

Controlling Transient Amplification Improves Long-horizon Rollouts

cs.LG · 2026-05-09 · conditional · novelty 6.0 · 2 refs

Commutativity regularization mitigates transient error amplification in autoregressive neural simulators by penalizing non-normality and non-commutativity of Jacobians, yielding stable long-horizon rollouts.

Spectral Lens: Activation and Gradient Spectra as Diagnostics of LLM Optimization

stat.ML · 2026-05-07 · unverdicted · novelty 6.0

Spectral analysis of activations and gradients provides new diagnostics that link batch size to representation geometry, early covariance tails to token efficiency, and spectral shifts to learning dynamics in decoder-only LLMs, backed by a mechanistic model.

A Robust Foundation Model for Conservation Laws: Injecting Context into Flux Neural Operators via Recurrent Vision Transformers

cs.LG · 2026-05-06 · unverdicted · novelty 6.0

A recurrent Vision Transformer hypernetwork injects context into Flux Neural Operators to infer and solve unseen conservation laws while preserving robustness and long-time stability.

SIAM: Head and Brain MRI Segmentation from Few High-Quality Templates via Synthetic Training

cs.CV · 2026-05-04 · unverdicted · novelty 6.0

SIAM achieves state-of-the-art whole-head MRI segmentation of 16 structures including extra-cerebral tissues by training on synthetic data from just six manual templates, matching or exceeding prior methods on 301 scans across eight heterogeneous datasets.

Cross-Domain Transfer of Hyperspectral Foundation Models

cs.CV · 2026-04-29 · unverdicted · novelty 6.0

Cross-domain transfer of remote-sensing HSI foundation models improves proximal sensing semantic segmentation over in-domain training and narrows the gap to cross-modality methods on the HS3-Bench benchmark.

Semantic Segmentation for Histopathology using Learned Regularization based on Global Proportions

eess.IV · 2026-04-27 · unverdicted · novelty 6.0

VSLP infers dense segmentations from global label proportions via a pre-trained transformer for initial confidence maps followed by variational optimization using Wasserstein fidelity and a learned regularizer, outperforming prior weakly supervised methods on histopathology datasets.

ICPR 2026 Competition on Low-Resolution License Plate Recognition

cs.CV · 2026-04-24 · accept · novelty 6.0

The ICPR 2026 LRLPR competition on real low-quality license plate images drew 99 valid submissions, with the winning team reaching 82.13% recognition rate and four teams exceeding 80%.

Localized Tornado Outbreak at the Upstream of a Tropical Easterly Wave in Camarines Norte, Philippines (13 September 2025)

physics.ao-ph · 2026-04-22 · unverdicted · novelty 6.0

A tornado outbreak with simultaneous tornadic supercells occurred in the Philippines within an easterly severe weather regime, documented as the first known instance there.

FlowForge: A Staged Local Rollout Engine for Flow-Field Prediction

cs.LG · 2026-04-21 · unverdicted · novelty 6.0

FlowForge predicts flow fields via staged local updates with a shared lightweight predictor, matching or exceeding baselines in accuracy while improving robustness to noise and reducing latency.

PSIRNet: Deep Learning-based Free-breathing Rapid Acquisition Late Enhancement Imaging

eess.IV · 2026-04-09 · accept · novelty 6.0

PSIRNet produces diagnostic-quality free-breathing PSIR LGE cardiac MRI from a single interleaved IR/PD acquisition over two heartbeats using a physics-guided deep learning network trained on over 800,000 slices.

Component-Adaptive and Lesion-Level Supervision for Improved Small Structure Segmentation in Brain MRI

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

CATMIL augments nnU-Net with component-adaptive Tversky and MIL-based lesion supervision to raise Dice scores, small-lesion recall, and error control on the MSLesSeg dataset.

RABC-Net: Reliability-Aware Annotation-Free Skin Lesion Segmentation for Low-Resource Dermoscopy

cs.CV · 2026-04-07 · unverdicted · novelty 6.0

RABC-Net achieves 86.58% DICE and 79.47% JAC on skin lesion segmentation across ISIC-2017, ISIC-2018, and PH2 using only pseudo-labels and no manual masks for training or adaptation.

RSEdit: Text-Guided Image Editing for Remote Sensing

cs.CV · 2026-03-14 · unverdicted · novelty 6.0

RSEdit adapts off-the-shelf text-to-image models into a collection of editing systems that follow text instructions while keeping geospatial structure intact in remote sensing images.

Deeper detection limits in astronomical imaging using self-supervised spatiotemporal denoising

astro-ph.IM · 2026-02-19 · unverdicted · novelty 6.0

ASTERIS, a self-supervised spatiotemporal denoising algorithm, improves astronomical detection limits by 1 magnitude at 90% completeness while identifying three times more redshift >9 galaxy candidates in JWST images.

citing papers explorer

Showing 38 of 38 citing papers after filters.

Text Dictates, Music Decorates: Energy-based Attention for Editable Dance Motion Generation cs.AI · 2026-06-22 · unverdicted · none · ref 54
STREAM decouples text and music conditioning in a diffusion transformer via AdaLN for structure and BEAM for beats, plus new Motorica++ dataset and editability metrics, claiming SOTA music alignment with preserved semantics.
GPROF-IR: An Improved Single-Channel Infrared Precipitation Retrieval for Merged Satellite Precipitation Products physics.ao-ph · 2026-05-08 · unverdicted · none · ref 37
GPROF-IR is a CNN-based retrieval that uses temporal context in geostationary IR observations to produce precipitation estimates with lower error than prior IR methods and climatological consistency with PMW retrievals for integration into IMERG V08.
AttentionBender: Manipulating Cross-Attention in Video Diffusion Transformers as a Creative Probe cs.MM · 2026-04-22 · unverdicted · none · ref 62
AttentionBender applies 2D transforms to cross-attention maps in video diffusion transformers, producing distributed distortions and glitch aesthetics that reveal entangled attention mechanisms while serving as both an XAI probe and creative tool.
Differentiable Surrogate for Detector Simulation and Design with Diffusion Models physics.ins-det · 2026-01-09 · unverdicted · none · ref 39
A LoRA-adapted conditional diffusion surrogate for electromagnetic calorimeter showers matches key observables within 2% RMSE and reproduces directional trends in design-utility gradients.
OOD-SEG: Exploiting out-of-distribution detection techniques for learning image segmentation from sparse multi-class positive-only annotations cs.CV · 2024-11-14 · unverdicted · none · ref 48
OOD-SEG reframes multi-class segmentation from sparse positive-only annotations as pixel-wise positive-unlabelled learning solved by integrating out-of-distribution detection techniques, with a proposed cross-validation evaluation on surgical imaging datasets.
Cyclic 2.5D Perceptual Loss for Cross-Modal 3D Medical Image Synthesis: T1w MRI to Tau PET eess.IV · 2024-06-18 · unverdicted · none · ref 62
Proposes a cyclic 2.5D perceptual loss with manufacturer SUVR standardization for T1w MRI to tau PET synthesis, reporting improved regional agreement on ADNI and SCAN cohorts across U-Net, UNETR, SwinUNETR, CycleGAN, and Pix2Pix.
DeepMine-Mamba: Mitigating Information Dilution in Mamba-Based State Space Models for Document Image Binarization cs.CV · 2026-06-07 · unverdicted · none · ref 4
DeepMine-Mamba adds an Anti-Dilution Gate to Mamba-based models to counteract feature dilution in document binarization and reports competitive FM and Fps scores on DIBCO benchmarks under leave-one-year-out evaluation.
Direct High-Magnetic-Field Coupling to Stripe Order in a Cuprate Superconductor cond-mat.str-el · 2026-06-05 · unverdicted · none · ref 262
High magnetic fields directly enhance the amplitude and correlation length of stripe order in a cuprate superconductor far above the vortex melting transition, indicating a coupling mechanism independent of superconductivity suppression.
Multi-Task Crack Foundation Model for Engineering-Reliable Crack Representation and Topology Preservation in Civil Infrastructure cs.CV · 2026-06-04 · unverdicted · none · ref 28
CrackGeoFM is a multi-task framework that adapts a frozen visual foundation model with FCEM, CFAM, and SMTD modules for crack mask prediction, skeleton reconstruction, and uncertainty estimation, reporting SOTA results across 20 datasets including few-shot settings.
XTinyU-Net: Training-Free U-Net Scaling via Initialization-Time Sensitivity eess.IV · 2026-05-10 · unverdicted · none · ref 18 · 2 links
A Jacobian sensitivity curve computed at initialization identifies the narrowest U-Net configuration that avoids performance collapse, matching nnU-Net accuracy with 400-1600x fewer parameters on six medical datasets.
Spectral Lens: Activation and Gradient Spectra as Diagnostics of LLM Optimization stat.ML · 2026-05-07 · unverdicted · none · ref 55
Spectral analysis of activations and gradients provides new diagnostics that link batch size to representation geometry, early covariance tails to token efficiency, and spectral shifts to learning dynamics in decoder-only LLMs, backed by a mechanistic model.
A Robust Foundation Model for Conservation Laws: Injecting Context into Flux Neural Operators via Recurrent Vision Transformers cs.LG · 2026-05-06 · unverdicted · none · ref 20
A recurrent Vision Transformer hypernetwork injects context into Flux Neural Operators to infer and solve unseen conservation laws while preserving robustness and long-time stability.
SIAM: Head and Brain MRI Segmentation from Few High-Quality Templates via Synthetic Training cs.CV · 2026-05-04 · unverdicted · none · ref 65
SIAM achieves state-of-the-art whole-head MRI segmentation of 16 structures including extra-cerebral tissues by training on synthetic data from just six manual templates, matching or exceeding prior methods on 301 scans across eight heterogeneous datasets.
Cross-Domain Transfer of Hyperspectral Foundation Models cs.CV · 2026-04-29 · unverdicted · none · ref 16
Cross-domain transfer of remote-sensing HSI foundation models improves proximal sensing semantic segmentation over in-domain training and narrows the gap to cross-modality methods on the HS3-Bench benchmark.
Semantic Segmentation for Histopathology using Learned Regularization based on Global Proportions eess.IV · 2026-04-27 · unverdicted · none · ref 60
VSLP infers dense segmentations from global label proportions via a pre-trained transformer for initial confidence maps followed by variational optimization using Wasserstein fidelity and a learned regularizer, outperforming prior weakly supervised methods on histopathology datasets.
Localized Tornado Outbreak at the Upstream of a Tropical Easterly Wave in Camarines Norte, Philippines (13 September 2025) physics.ao-ph · 2026-04-22 · unverdicted · none · ref 91
A tornado outbreak with simultaneous tornadic supercells occurred in the Philippines within an easterly severe weather regime, documented as the first known instance there.
FlowForge: A Staged Local Rollout Engine for Flow-Field Prediction cs.LG · 2026-04-21 · unverdicted · none · ref 4
FlowForge predicts flow fields via staged local updates with a shared lightweight predictor, matching or exceeding baselines in accuracy while improving robustness to noise and reducing latency.
Component-Adaptive and Lesion-Level Supervision for Improved Small Structure Segmentation in Brain MRI cs.CV · 2026-04-09 · unverdicted · none · ref 2
CATMIL augments nnU-Net with component-adaptive Tversky and MIL-based lesion supervision to raise Dice scores, small-lesion recall, and error control on the MSLesSeg dataset.
RABC-Net: Reliability-Aware Annotation-Free Skin Lesion Segmentation for Low-Resource Dermoscopy cs.CV · 2026-04-07 · unverdicted · none · ref 20
RABC-Net achieves 86.58% DICE and 79.47% JAC on skin lesion segmentation across ISIC-2017, ISIC-2018, and PH2 using only pseudo-labels and no manual masks for training or adaptation.
RSEdit: Text-Guided Image Editing for Remote Sensing cs.CV · 2026-03-14 · unverdicted · none · ref 19
RSEdit adapts off-the-shelf text-to-image models into a collection of editing systems that follow text instructions while keeping geospatial structure intact in remote sensing images.
Deeper detection limits in astronomical imaging using self-supervised spatiotemporal denoising astro-ph.IM · 2026-02-19 · unverdicted · none · ref 99
ASTERIS, a self-supervised spatiotemporal denoising algorithm, improves astronomical detection limits by 1 magnitude at 90% completeness while identifying three times more redshift >9 galaxy candidates in JWST images.
Algebraic Language Models for Inverse Design of Metamaterials via Diffusion Transformers cs.CE · 2025-07-21 · unverdicted · none · ref 47
DiffuMeta uses diffusion transformers and algebraic language representations to generate diverse 3D shell metamaterials with targeted stress-strain responses under large deformations including buckling and contact.
MLFFM-SegDiff: A Multi-Level Feature Fusion Diffusion Model for Skin Lesion Segmentation eess.IV · 2026-06-25 · unverdicted · none · ref 4
MLFFM-SegDiff adds a multi-level feature fusion module and dual-path encoder to a diffusion U-Net, reporting improved Jaccard (0.8546) and Dice (0.9207) scores over baselines on three skin lesion datasets.
Test-Time Adaptation in Optical Coherence Tomography Using Trajectory-Aligned Time-Independent Flow cs.CV · 2026-06-17 · unverdicted · none · ref 21
Flow-matching TTA with histogram matching to synthetic reference trajectories and time-independent flow achieves SOTA segmentation of AMD biomarkers in OCT.
SegTME-UNI2: A Foundation Model-Based Framework for Generalisable Multiclass Cell Segmentation and LLM-Driven Tumour Microenvironment Characterisation in Histopathology cs.CV · 2026-06-16 · unverdicted · none · ref 22
SegTME-UNI2 pairs a UNI2-based dual-head segmentation model trained via progressive pseudo-labeling with an LLM to produce multiclass cell maps and narrative TME descriptions from H&E images.
Reliability of Probabilistic Emulation of Physical Systems cs.LG · 2026-06-11 · unverdicted · none · ref 18
CRPS-trained ensembles achieve better uncertainty reliability and speed than latent generative models for probabilistic emulation of 2D physical systems.
Physics-informed neural networks for quantitative assessment of cancellous bone microstructure from photoacoustic signals physics.med-ph · 2026-05-20 · unverdicted · none · ref 40
Biot-PINN embeds Biot poroelasticity into a neural network to decode photoacoustic signals for cancellous bone microstructure grading at 97% accuracy.
Observation-Guided Neural Surrogate Learning for Scientific Simulation Emulation: A Single-Gauge Flood-Inundation Proof of Concept physics.ao-ph · 2026-04-28 · unverdicted · none · ref 17
An EnsCGP coarse surrogate plus U-Net-ASPP corrector emulates LISFLOOD-FP flood depths on a 256x256 grid around one Chicago gauge, achieving R² ≈ 0.99 and MAE < 0.01 m on held-out events while matching the gauge depth at that single pixel.
Weighted Knowledge Distillation for Semi-Supervised Segmentation of Maxillary Sinus in Panoramic X-ray Images cs.CV · 2026-04-22 · unverdicted · none · ref 4
A semi-supervised framework using weighted knowledge distillation and SinusCycle-GAN refinement achieves 96.35% Dice score for maxillary sinus segmentation in panoramic X-rays from 2,511 patients.
Training-inference input alignment outweighs framework choice in longitudinal retinal image prediction cs.CV · 2026-04-18 · unverdicted · none · ref 20
Training-inference input alignment outweighs framework choice for longitudinal retinal image prediction, with deterministic regression matching complex models when acquisition variability dominates disease progression.
Physics Priors Offer Useful Accuracy-Carbon Trade-Offs in Spatio-Temporal Forecasting cs.LG · 2025-09-29 · unverdicted · none · ref 39
Stronger physics priors in neural networks for spatio-temporal shear flow forecasting yield substantially lower training carbon footprints than weak or no priors, though inference savings are less consistent.
Do We Really Need Diffusion? A Fast U-Net for Paired Medical Image Translation cs.CV · 2026-06-16 · unverdicted · none · ref 32
Lightweight U-Net outperforms DDPM on T2w-to-MRI-SFF translation (r=0.975 vs 0.962, MAE=0.014 vs 0.019) with 208x faster inference on 230k paired images from NAKO.
Patient-Level Diagnosis of Acute Myeloid Leukemia via Deep Learning Analysis of Bone Marrow Smear cs.CV · 2026-06-09 · unverdicted · none · ref 7
YOLO segmentation plus EfficientNet classification aggregates cell predictions to patient-level CBLC ratios, reporting weighted F1 scores of 0.87-0.91 on three external center cohorts from 89 patients.
$\mu$-FlowNet: A Deep Learning Approach for Mapping Flow Fields in Irregular Microchannels Using an Attention-based U-Net Encoder-Decoder Architecture cs.CE · 2026-04-19 · unverdicted · none · ref 39
μ-FlowNet applies an attention U-Net to map flow fields in irregular microchannels, reporting dice score 0.9317 and IoU 0.8731 on test data while outperforming standard U-Net and T-Net.
Few-Shot Left Atrial Wall Segmentation in 3D LGE MRI via Meta-Learning cs.CV · 2026-03-26 · unverdicted · none · ref 25 · 2 links
MAML with auxiliary cavity tasks and boundary loss improves 5-shot LA wall segmentation over standard fine-tuning (DSC 0.54 vs 0.48) and nears fully supervised performance at 20 shots.
Predicting parameters of a model cuprate superconductor using machine learning physics.comp-ph · 2025-12-03 · unverdicted · none · ref 34
An adapted U-Net model trained on mean-field phase diagrams accurately predicts Hamiltonian parameters for a cuprate superconductor when validated on Monte Carlo simulation data.
Respiratory Motion Correction in Abdominal MRI using a Densely Connected U-Net with GAN-guided Training eess.IV · 2019-06-24 · unverdicted · none · ref 15
Densely connected U-Net with GAN-guided training and perceptual loss corrects respiratory motion artifacts in abdominal MRI.
Topology-Driven Fusion of nnU-Net and MedNeXt for Accurate Brain Tumor Segmentation on Sub-Saharan Africa Dataset eess.IV · 2026-04-17 · unverdicted · none · ref 28
Pre-training nnU-Net and MedNeXt on BraTS 2025 data then fine-tuning on BraTS-Africa with added topology refinement yields NSD scores of 0.810, 0.829, and 0.895 for SNFH, NETC, and ET.

Title resolution pending

citation-role summary

citation-polarity summary

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer