super hub Mixed citations

U-Net: Convolutional Networks for Biomedical Image Segmentation

Olaf Ronneberger, Philipp Fischer, Thomas Brox · 2015 · cs.CV · arXiv 1505.04597

Mixed citation behavior. Most common role is background (43%).

118 Pith papers citing it

Background 43% of classified citations

open full Pith review browse 118 citing papers more from Olaf Ronneberger arXiv PDF

abstract

There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks. Using the same network trained on transmitted light microscopy images (phase contrast and DIC) we won the ISBI cell tracking challenge 2015 in these categories by a large margin. Moreover, the network is fast. Segmentation of a 512x512 image takes less than a second on a recent GPU. The full implementation (based on Caffe) and the trained networks are available at http://lmb.informatik.uni-freiburg.de/people/ronneber/u-net .

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 11 method 6 baseline 3 dataset 1

citation-polarity summary

background 9 use method 6 baseline 3 unclear 2 use dataset 1

claims ledger

abstract There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segme

authors

Olaf Ronneberger Philipp Fischer Thomas Brox

co-cited works

representative citing papers

From Phase to Phenomenon: Self-Supervised Learning of Subsurface Scattering with Minimal Phase-shift Inputs

cs.CV · 2026-06-28 · unverdicted · novelty 7.0

A self-supervised method pretrains an encoder on eight PSP images per view to learn generalizable subsurface scattering representations that transfer to relighting and dense footprint reconstruction on unseen complex objects.

Sampling the Schwinger Model with Gauge-Equivariant Diffusion

hep-lat · 2026-06-25 · unverdicted · novelty 7.0

A gauge-equivariant diffusion model samples Schwinger model configurations, yielding unbiased observables matching MCMC and qualitatively less topological freezing than HMC.

HAMNO: A Hierarchical Adaptive Multi-scale Neural Operator with Physics-Informed Learning for Dynamical Systems

cs.LG · 2026-06-10 · unverdicted · novelty 7.0

HAMNO introduces adaptive gating between local and global operators in a hierarchical setup, with PI-HAMNO adding PDE residual constraints, demonstrating better performance on Allen-Cahn, Cahn-Hilliard, and Swift-Hohenberg equations.

Inverse Critical Experiment Design via Gradient Optimization and a Multigroup Attention-Based Neural Network Architecture

cs.LG · 2026-06-01 · unverdicted · novelty 7.0

A U-Net surrogate with multigroup attention pooling is trained on OpenMC sensitivity data and combined with gradient optimization to generate grid-based critical experiment geometries that achieve c_k values up to 0.97757 for HALEU fuel validation.

Field-level multi-tracers simulation-based inference of cosmological parameters from 3D maps

astro-ph.CO · 2026-05-25 · unverdicted · novelty 7.0

The work demonstrates that multi-tracer field-level SBI on galaxy and HI maps yields 2-7 times better constraints on Omega_m and sigma_8 than single-tracer or summary-statistic approaches, with 3D maps performing best.

LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR

cs.CV · 2026-05-11 · unverdicted · novelty 7.0

LatentHDR generates structurally consistent panoramic HDR images by producing one scene latent with a diffusion backbone then deterministically mapping it to multiple exposure latents via a lightweight conditional head.

EchoXFlow: A Beamspace Echocardiography Dataset for Cardiac Motion, Flow, and Function

cs.CV · 2026-05-06 · unverdicted · novelty 7.0

EchoXFlow is a new dataset of 37,125 beamspace echocardiography recordings with separable modalities, Doppler data, ECG, and clinical annotations that enables acquisition-aware learning not possible with standard scan-converted videos.

Generative diffusion models for spatiotemporal influenza forecasting

cs.LG · 2026-04-27 · unverdicted · novelty 7.0

Influpaint uses generative diffusion models on image-encoded influenza data to produce realistic and diverse epidemic trajectories that match leading ensemble methods in accuracy.

VitaminP: cross-modal learning enables whole-cell segmentation from routine histology

cs.CV · 2026-04-26 · unverdicted · novelty 7.0

VitaminP uses paired H&E-mIF data to train a model that transfers molecular boundary information, enabling accurate whole-cell segmentation directly from routine H&E histology across 34 cancer types.

Physics-informed, Generative Adversarial Design of Funicular Shells

cs.CE · 2026-04-17 · unverdicted · novelty 7.0

A modified DCGAN with an auxiliary discriminator using the membrane factor generates stable, previously unseen funicular shells optimized for pure compression in three dimensions.

Machine Learning Phase Field Reconstruction in a Bose-Einstein Condensate

cond-mat.quant-gas · 2026-04-10 · unverdicted · novelty 7.0

A U-Net-based ML pipeline reconstructs the complete phase field and quantized vortex charges in 2D Bose-Einstein condensates from density snapshots alone, using synthetic training data from projected Gross-Pitaevskii simulations.

Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings

q-bio.QM · 2026-04-09 · unverdicted · novelty 7.0

Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.

Diffusion Processes on Implicit Manifolds

cs.LG · 2026-04-08 · unverdicted · novelty 7.0 · 2 refs

Defines diffusion processes on implicit data manifolds via proximity-graph approximations to the infinitesimal generator and carré-du-champ operator, proves convergence in law to the continuous manifold process, and provides an Euler-Maruyama integrator validated on synthetic and MNIST manifolds.

Contour Refinement using Discrete Diffusion in Low Data Regime

cs.CV · 2026-02-05 · unverdicted · novelty 7.0

A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.

Radio-Interferometric Image Reconstruction with Denoising Diffusion Restoration Models

astro-ph.IM · 2026-01-22 · unverdicted · novelty 7.0

A diffusion model trained on real radio galaxy images reconstructs high-fidelity interferometric observations from VLA, EHT, and ALMA simulations and outperforms CLEAN on gridded visibilities.

SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis

cs.CV · 2025-12-17 · unverdicted · novelty 7.0

SemanticBridge provides a new 3D dataset for bridge component segmentation and quantifies sensor-induced domain gaps that drop model performance by up to 11.4% mIoU.

Visual Diffusion Models are Geometric Solvers

cs.CV · 2025-10-24 · unverdicted · novelty 7.0

Standard visual diffusion models operating in pixel space can approximate solutions to the inscribed square, Steiner tree, and simple polygon problems.

Deep Learning for CMB Foreground Removal and Beam Deconvolution: A U-Net GAN Approach

astro-ph.IM · 2025-08-29 · unverdicted · novelty 7.0

A U-Net GAN reconstructs CMB T and E maps from Planck-like simulations with foregrounds and systematics, achieving under 1% error outside the Galactic region and demonstrating first-time correction for non-circular beams and asymmetric scans.

SinkSAM-Net: Knowledge-Driven Self-Supervised Sinkhole Segmentation Using Topographic Priors and Segment Anything Model

cs.CV · 2024-10-02 · unverdicted · novelty 7.0

SinkSAM-Net uses topographic priors and SAM with coordinate-wise bounding box jittering to create pseudo-labels for iterative self-supervised training of an EfficientNetV2-UNet, reaching about 95% of fully supervised performance on sinkhole datasets.

Does Your ViT Still Need U-Net for Segmentation?

cs.CV · 2026-06-30 · unverdicted · novelty 6.0

EoSeg shows that modern ViT backbones support accurate medical image segmentation without U-Net-style decoders via multi-level query modeling and learnable block fusion, with strong results on seven benchmarks.

X-Mind: Efficient Visual Chain-of-Thought via Predictive World Model for End-to-End Driving

cs.CV · 2026-06-27 · unverdicted · novelty 6.0

X-Mind proposes an efficient internal visual chain-of-thought using compressed BEV sketches and recurrent block diffusion to embed predictive world models into end-to-end driving policies.

Learning the Universe: Posterior Reliability of Neural Generative Models in High-Dimensional Field-Level Inference of Cosmic Initial Conditions

astro-ph.CO · 2026-06-08 · unverdicted · novelty 6.0

Generative models for cosmological field-level inference can reproduce posterior means and cross-correlations yet fail to capture correct uncertainty geometry when validated against HMC reference samples.

A Simulation Platform for Flapping-Wing Vehicles

cs.RO · 2026-06-01 · unverdicted · novelty 6.0

FWAV-Sim is a high-fidelity Unity simulation framework for flapping-wing vehicles that integrates blade-element aerodynamics with bluff-body drag, spatiotemporally correlated fractal turbulence, and realistic IMU/LiDAR/RGB sensor models to support autonomy development.

MoRE: A Mixture-of-Experts-Based Task-Adaptive End-to-End Network for Multimodal MRI Reconstruction

eess.IV · 2026-06-01 · unverdicted · novelty 6.0

MoRE integrates a sparsely activated MoE module with unsupervised routing into a variational network for stable multimodal MRI reconstruction on fastMRI brain and knee data at 8x undersampling.

citing papers explorer

Showing 45 of 45 citing papers after filters.

From Phase to Phenomenon: Self-Supervised Learning of Subsurface Scattering with Minimal Phase-shift Inputs cs.CV · 2026-06-28 · unverdicted · none · ref 39 · internal anchor
A self-supervised method pretrains an encoder on eight PSP images per view to learn generalizable subsurface scattering representations that transfer to relighting and dense footprint reconstruction on unseen complex objects.
LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR cs.CV · 2026-05-11 · unverdicted · none · ref 37 · internal anchor
LatentHDR generates structurally consistent panoramic HDR images by producing one scene latent with a diffusion backbone then deterministically mapping it to multiple exposure latents via a lightweight conditional head.
EchoXFlow: A Beamspace Echocardiography Dataset for Cardiac Motion, Flow, and Function cs.CV · 2026-05-06 · unverdicted · none · ref 38 · internal anchor
EchoXFlow is a new dataset of 37,125 beamspace echocardiography recordings with separable modalities, Doppler data, ECG, and clinical annotations that enables acquisition-aware learning not possible with standard scan-converted videos.
VitaminP: cross-modal learning enables whole-cell segmentation from routine histology cs.CV · 2026-04-26 · unverdicted · none · ref 46 · internal anchor
VitaminP uses paired H&E-mIF data to train a model that transfers molecular boundary information, enabling accurate whole-cell segmentation directly from routine H&E histology across 34 cancer types.
Contour Refinement using Discrete Diffusion in Low Data Regime cs.CV · 2026-02-05 · unverdicted · none · ref 36 · internal anchor
A CNN-based discrete diffusion method refines sparse contours from segmentation masks using simplified denoising steps and minimal post-processing, outperforming baselines on small medical and environmental datasets while running 3.5 times faster.
SemanticBridge - A Dataset for 3D Semantic Segmentation of Bridges and Domain Gap Analysis cs.CV · 2025-12-17 · unverdicted · none · ref 69 · internal anchor
SemanticBridge provides a new 3D dataset for bridge component segmentation and quantifies sensor-induced domain gaps that drop model performance by up to 11.4% mIoU.
Visual Diffusion Models are Geometric Solvers cs.CV · 2025-10-24 · unverdicted · none · ref 38 · internal anchor
Standard visual diffusion models operating in pixel space can approximate solutions to the inscribed square, Steiner tree, and simple polygon problems.
SinkSAM-Net: Knowledge-Driven Self-Supervised Sinkhole Segmentation Using Topographic Priors and Segment Anything Model cs.CV · 2024-10-02 · unverdicted · none · ref 18 · internal anchor
SinkSAM-Net uses topographic priors and SAM with coordinate-wise bounding box jittering to create pseudo-labels for iterative self-supervised training of an EfficientNetV2-UNet, reaching about 95% of fully supervised performance on sinkhole datasets.
Does Your ViT Still Need U-Net for Segmentation? cs.CV · 2026-06-30 · unverdicted · none · ref 33 · internal anchor
EoSeg shows that modern ViT backbones support accurate medical image segmentation without U-Net-style decoders via multi-level query modeling and learnable block fusion, with strong results on seven benchmarks.
X-Mind: Efficient Visual Chain-of-Thought via Predictive World Model for End-to-End Driving cs.CV · 2026-06-27 · unverdicted · none · ref 12 · internal anchor
X-Mind proposes an efficient internal visual chain-of-thought using compressed BEV sketches and recurrent block diffusion to embed predictive world models into end-to-end driving policies.
A Multimodal 3D Foundation Model for Light Sheet Fluorescence Microscopy Enables Few-Shot Segmentation, Classification, and Deblurring cs.CV · 2026-05-25 · unverdicted · none · ref 25 · internal anchor
A multimodal 3D foundation model pretrained on LSM volumes via masked reconstruction and image-text alignment enables improved few-shot segmentation, classification, and deblurring.
Plume Segmentation from MethaneSAT with Cross-Sensor Transfer Learning and Physics-Informed Postprocessing cs.CV · 2026-05-22 · unverdicted · none · ref 42 · internal anchor
Mask R-CNN with ResNet-50 pre-trained on MethaneAIR and fine-tuned on MethaneSAT, plus physics-informed postprocessing, yields instance-level precision 0.60/recall 0.98 at baseline, improving to 0.71/0.94 and 0.92/0.70 in two operational modes.
SegRAG: Training-Free Retrieval-Augmented Semantic Segmentation cs.CV · 2026-05-17 · unverdicted · none · ref 7 · 2 links · internal anchor
SegRAG is a training-free retrieval-augmented framework that extracts class-specific point prompts from a filtered DINOv3 feature bank to boost SAM3 semantic segmentation performance on standard and agricultural benchmarks.
EDGER: EDge-Guided with HEatmap Refinement for Generalizable Image Forgery Localization cs.CV · 2026-05-12 · unverdicted · none · ref 18 · internal anchor
A dual-branch system using frequency edge cues and CLIP-based synthetic patch detection for accurate, resolution-independent image forgery localization.
Geometry-aware Prototype Learning for Cross-domain Few-shot Medical Image Segmentation cs.CV · 2026-05-11 · unverdicted · none · ref 2 · internal anchor
GeoProto enriches appearance prototypes with geometric offsets from an ordinal shape branch to improve cross-domain few-shot medical image segmentation.
TRAS: An Interactive Software for Tracing Tree Ring Cross Sections cs.CV · 2026-05-08 · accept · none · ref 17 · 2 links · internal anchor
TRAS is an interactive GUI tool integrating CS-TRD, DeepCS-TRD, and INBD for automatic tree ring detection and measurement in cross-sectional images, evaluated on 18 Pinus taeda samples with 81% F-score and 20% manual effort reduction.
Leveraging Image Generators to Address Training Data Scarcity: The Gen4Regen Dataset for Forest Regeneration Mapping cs.CV · 2026-05-07 · conditional · none · ref 50 · internal anchor
Mixing real UAV imagery with 2101 AI-generated image-mask pairs improves semantic segmentation F1 scores for fine-grained forest species by over 15 percentage points overall and up to 30 points for rare classes.
Approaching human parity in the quality of automated organoid image segmentation cs.CV · 2026-05-04 · conditional · none · ref 20 · internal anchor
A composite SAM-based method segments organoid images with accuracy matching or approaching inter-observer variability among human annotators.
When Less Is More: Simplicity Beats Complexity for Physics-Constrained InSAR Phase Unwrapping cs.CV · 2026-04-28 · accept · none · ref 6 · internal anchor
A vanilla U-Net with 7.76M parameters achieves R²=0.834 and RMSE=1.01 cm on a global InSAR benchmark, beating larger attention models by 34% in R² and 51% in RMSE while running 2.5× faster.
From Boundaries to Semantics: Prompt-Guided Multi-Task Learning for Petrographic Thin-section Segmentation cs.CV · 2026-04-16 · unverdicted · none · ref 4 · internal anchor
Petro-SAM adapts SAM via a Merge Block for polarized views plus multi-scale fusion and color-entropy priors to jointly achieve grain-edge and lithology segmentation in petrographic images.
Self-supervised Pretraining of Cell Segmentation Models cs.CV · 2026-04-12 · unverdicted · none · ref 15 · internal anchor
DINOCell achieves a SEG score of 0.784 on LIVECell by self-supervised domain adaptation of DINOv2, improving 10.42% over SAM-based models and showing strong zero-shot transfer.
GIF: A Conditional Multimodal Generative Framework for IR Drop Imaging in Chip Layouts cs.CV · 2026-04-11 · unverdicted · none · ref 20 · internal anchor
GIF fuses geometrical image features and logical graph topology in a conditional diffusion model to generate high-quality IR drop images for chip layouts, outperforming prior ML methods on CircuitNet-N28 with SSIM 0.78, Pearson 0.95, PSNR 21.77, and NMAE 0.026.
ELT: Elastic Looped Transformers for Visual Generation cs.CV · 2026-04-10 · unverdicted · none · ref 59 · internal anchor
Elastic Looped Transformers share weights across recurrent blocks and apply intra-loop self-distillation to deliver 4x parameter reduction while matching competitive FID and FVD scores on ImageNet and UCF-101.
Label Dropout: Improved Deep Learning Echocardiography Segmentation Using Multiple Datasets With Domain Shift and Partial Labelling cs.CV · 2024-03-12 · unverdicted · none · ref 11 · internal anchor
Label dropout mitigates shortcut learning in multi-dataset partially labelled echocardiography segmentation, improving Dice scores by 62% and 25% on two cardiac structures.
SDXL-Lightning: Progressive Adversarial Diffusion Distillation cs.CV · 2024-02-21 · conditional · none · ref 52 · internal anchor
SDXL-Lightning uses progressive adversarial distillation to reach new state-of-the-art quality in one-step and few-step 1024px text-to-image generation from the SDXL base model.
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets cs.CV · 2023-11-25 · conditional · none · ref 73 · internal anchor
Stable Video Diffusion scales latent video diffusion models via text-to-image pretraining, video pretraining on curated data, and high-quality finetuning to produce competitive text-to-video and image-to-video results while enabling motion LoRA and multi-view 3D applications.
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis cs.CV · 2023-07-04 · conditional · none · ref 39 · internal anchor
SDXL improves upon prior Stable Diffusion versions through a larger UNet backbone, dual text encoders, novel conditioning, and a refinement model, producing higher-fidelity images competitive with black-box state-of-the-art generators.
Accurate Nuclear Segmentation with Center Vector Encoding cs.CV · 2019-07-09 · unverdicted · none · ref 15 · internal anchor
A bottom-up nuclear segmentation method using Center Vector Encoding outperforms prior state-of-the-art approaches.
CheXanatomy: Anatomy-Aware Vision-Language Modeling for Chest Radiographs cs.CV · 2026-06-07 · unverdicted · none · ref 19 · internal anchor
CheXanatomy trains VLMs to generate 2D anatomical masks via next-token prediction on synthetic CXRs from CT, matching U-Net performance with better domain-shift robustness and sample efficiency.
Coarse-to-Fine Domain Incremental Learning with Attentive Distillation for Mining Footprint Segmentation in Multispectral Imagery cs.CV · 2026-05-23 · unverdicted · none · ref 28 · internal anchor
MineC2FNet uses attentive distillation from coarse to fine domains in a teacher-student setup to boost mining footprint segmentation performance and releases a new expert-validated dataset of 219 precisely annotated images.
Systematic Evaluation of Vision Transformers for Automated Cervical Cancer Classification: Optimization, Statistical Validation, and Clinical Interpretability cs.CV · 2026-05-17 · unverdicted · none · ref 9 · internal anchor
Vision Transformer optimized on Herlev dataset reaches 94.9-95.2% accuracy in cervical cell classification with Grad-CAM attention aligning to nuclear and chromatin features.
Vision Transformer-Conditioned UNet for Domain-Adaptive Semantic Segmentation cs.CV · 2026-05-12 · unverdicted · none · ref 48 · internal anchor
ViTC-UNet adapts frozen ViT representations to biomedical semantic segmentation by conditioning a UNet via learnable tokens and two-way attention decoding.
Flow matching for Sentinel-2 super-resolution: implementation, application, and implications cs.CV · 2026-05-01 · unverdicted · none · ref 40 · internal anchor
Flow matching achieves single-step pixel accuracy and 20-step perceptual quality for Sentinel-2 super-resolution, outperforming diffusion and Real-ESRGAN while enabling large-scale 2.5 m land-cover products.
RASALoRE: Region Aware Spatial Attention with Location-based Random Embeddings for Weakly Supervised Anomaly Detection in Brain MRI Scans cs.CV · 2025-10-09 · conditional · none · ref 30 · internal anchor
A novel weakly supervised anomaly detection method for brain MRI that uses discriminative dual prompt tuning for pseudo masks and region-aware spatial attention with location-based random embeddings to achieve SOTA results with under 8 million parameters on BraTS and MSD datasets.
Accuracy Improvement of Cell Image Segmentation Using Feedback Former cs.CV · 2024-08-23 · unverdicted · none · ref 24 · internal anchor
Feedback Former improves cell image segmentation accuracy by feeding detailed feature maps back from near the output to lower transformer layers, outperforming non-feedback baselines with lower computational cost on three datasets.
The Ethical Dilemma when (not) Setting up Cost-based Decision Rules in Semantic Segmentation cs.CV · 2019-07-02 · unverdicted · none · ref 21 · internal anchor
Defining egoistic and altruistic cost functions for class confusions in semantic segmentation changes precision, recall, and segment-wise error rates relative to standard MAP decisions.
LETT-NeXt: A Lightweight RECIST-Guided Model for 3D CT Lesion Segmentation cs.CV · 2026-06-29 · unverdicted · none · ref 23 · internal anchor
LETT-NeXt uses RECIST line prompts in a cropped MedNeXt-v2 encoder-decoder to predict 3D lesion masks, reaching DSC 73.9 on hidden test data for a CVPR 2026 segmentation competition.
Efficient Transformer-Based Localized Patch Sampling for Choroid Plexus Segmentation in Multiple Sclerosis cs.CV · 2026-06-02 · unverdicted · none · ref 13 · internal anchor
SwinUNETR model with 32x32x32 patch sampling achieves DSC of 0.868 for LVCP segmentation in MS, outperforming UXNET with 99% lower computation.
Deep Learning-Based Segmentation of Peritoneal Cancer Index Regions from CT Imaging cs.CV · 2026-04-30 · unverdicted · none · ref 11 · internal anchor
nnU-Net segments rPCI regions on 62 CT scans with mean Dice 0.82, nearing inter-observer agreement of 0.88 and beating Swin UNETR at 0.76.
Learning to count small and clustered objects with application to bacterial colonies cs.CV · 2026-04-21 · unverdicted · none · ref 71 · internal anchor
ACFamNet Pro reaches 9.64% mean normalized absolute error on bacterial colony images under 5-fold cross-validation, beating FamNet by 12.71%.
AI Approach for MRI-only Full-Spine Vertebral Segmentation and 3D Reconstruction in Paediatric Scoliosis cs.CV · 2026-04-20 · unverdicted · none · ref 27 · internal anchor
An AI pipeline using GAN-generated MRI-like images and U-Net segmentation produces automated 3D thoracolumbar spine reconstructions from MRI with 88% Dice score and reduces processing time from 1 hour to under 1 minute while preserving scoliosis deformity features.
SAGE-GAN: Towards Realistic and Robust Segmentation of Spatially Ordered Nanoparticles via Attention-Guided GANs cs.CV · 2026-04-04 · unverdicted · none · ref 18 · internal anchor
SAGE-GAN integrates a self-attention U-Net into a CycleGAN framework to generate realistic synthetic electron microscopy image-mask pairs that augment training data for nanoparticle segmentation without human labeling.
SAN: Scale-Aware Network for Semantic Segmentation of High-Resolution Aerial Images cs.CV · 2019-07-06 · unverdicted · none · ref 5 · internal anchor
SANet adds a re-sampling-based scale-aware module to semantic segmentation networks to better handle inconsistent object scales in aerial images.
Non-frontal face recognition using GANs and memristor-based classifiers cs.CV · 2026-06-10 · unverdicted · none · ref 107 · internal anchor
A system pairing GAN pose frontalization with memristor neuromorphic classifiers reports up to 96% accuracy on non-frontal face datasets.
Embedding Non-Distortive Cancelable Face Template Generation cs.CV · 2024-02-04 · unverdicted · none · ref 18 · internal anchor
Presents a non-distortive cancelable face template method via targeted image distortion that maintains identity signals for neural embedding models on MNIST and LFW data.

U-Net: Convolutional Networks for Biomedical Image Segmentation

hub tools

citation-role summary

citation-polarity summary

claims ledger

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer