SheafStain: Sheaf-Theoretic Schr\"odinger Bridge for Spatially and Biologically Coherent Virtual Staining

Daeky Jeong; Eunjin Jang; Hongjun Yoon; Hwamin Lee; Hyeongyeol Lim; Won June Cho

arxiv: 2606.11846 · v1 · pith:GI5J2YOOnew · submitted 2026-06-10 · 💻 cs.CV

SheafStain: Sheaf-Theoretic Schr\"odinger Bridge for Spatially and Biologically Coherent Virtual Staining

Hyeongyeol Lim , Hongjun Yoon , Eunjin Jang , Daeky Jeong , Won June Cho , Hwamin Lee This is my paper

Pith reviewed 2026-06-27 10:29 UTC · model grok-4.3

classification 💻 cs.CV

keywords virtual stainingsheaf theorySchrödinger Bridgevision foundation modelswhole slide imagescontext contaminationpathology imaging

0 comments

The pith

Treating vision foundation model embeddings as sheaf sections inside a Schrödinger Bridge produces virtual stains that remain consistent when patches are joined into full slide images.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Virtual staining converts one tissue stain into another to speed up cancer biomarker tests, but processing gigapixel slides patch by patch creates visible seams and biological mismatches at boundaries. The paper argues that self-attention in pathology vision models produces inconsistent embeddings for the same physical location because global context varies, which it formalizes as a presheaf that violates the gluing axiom. SheafStain fixes this by recasting class and patch tokens as sheaf-like sections within a Schrödinger Bridge, where the class token holds biological identity and patch tokens build a spatial map. A backbone pretrained jointly on H&E and IHC supplies consistent cross-stain mappings so one feature space guides both input and output. Evaluation on stitched 1024x1024 outputs for HER2, ER, PR and Ki-67 shows reduced boundary artifacts compared with six prior methods.

Core claim

The central claim is that embeddings from pathology vision foundation models form a presheaf violating the gluing axiom due to context contamination from self-attention, and that integrating class and patch tokens as sheaf-like sections into a Schrödinger Bridge framework, using a co-pretrained H&E/IHC backbone, enforces spatial and biological consistency for virtual staining of whole slide images.

What carries the argument

Sheaf-like sections of class and patch tokens inside a Schrödinger Bridge, where class tokens anchor biological identity and patch tokens supply per-position spatial maps.

If this is right

Single pretrained feature space supervises both conditioning and stain alignment without degenerate cross-stain stalks.
Evaluation on stitched full-resolution outputs rather than isolated patches reveals the stitching artifact reduction.
Class tokens maintain biological consistency while patch tokens enforce spatial continuity across the slide.
Results hold across HER2, ER, PR and Ki-67 stains when translating at 256x256 resolution.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same sheaf-section treatment could be tested on other large-image tasks that currently suffer from context-dependent embeddings, such as semantic segmentation of aerial or medical volumes.
If the gluing enforcement works, it may reduce the need for post-processing seam removal steps in any patch-based inference pipeline.
Extending the co-pretraining idea to additional stain pairs might allow one backbone to support multiple virtual staining directions without retraining.
A direct test would be to measure whether the method preserves quantitative biomarker scores on the stitched outputs at the same level as on isolated patches.

Load-bearing premise

That the observed inconsistency across overlapping patches is caused by a sheaf gluing violation that the Schrödinger Bridge integration will enforce.

What would settle it

Quantitative boundary continuity scores and visual inspection on stitched 1024x1024 virtual stain outputs versus ground-truth IHC images for the four markers.

Figures

Figures reproduced from arXiv: 2606.11846 by Daeky Jeong, Eunjin Jang, Hongjun Yoon, Hwamin Lee, Hyeongyeol Lim, Won June Cho.

**Figure 1.** Figure 1: Failure cases of prior virtual staining. Independent patch translation induces (a) patchboundary discontinuities, (b) stain-tone drift, and (c) chromatin-distribution mismatch across reassembled regions, disrupting morphological cues used for biomarker scoring. of whole-slide images (WSIs) precludes entire image processing under GPU memory limits, forcing independent generation on small patches (e.g., 25… view at source ↗

**Figure 2.** Figure 2: SheafStain training pipeline. (a) Overview. Reference patch Pref and two adjacent targets Padj1 , Padj2 sharing a triple-overlap are sampled; VFM-derived M and cn condition three weightshared generator passes that yield Gref, Gadj1 , Gadj2 . (b) Spatial map. VFM tokens of overlapping patches are aggregated into M, a sheaf-consistent section over Pref. (c) Pixel sheaf and cocycle losses. Penalize disagreem… view at source ↗

**Figure 3.** Figure 3: SheafStain inference pipeline. The input H&E is covered by a stride-driven reference grid with overlapping adjacent patches (white guides), supplemented with additional reference patches drawn from the high- and low-energy extremes of the mid-band Fast Fourier Transform (FFT) energy map (lower-left). Each reference patch is independently translated by the generator qϕ under VFMderived conditioning (M, cn)… view at source ↗

**Figure 4.** Figure 4: Qualitative comparison on BCI and MIST HER2 datasets. On stitching, D-VST is the closest competitor, yet SheafStain still leads by 6–13% across blocks. DAB-r gaps narrow on BCI HER2 (0.0267 vs. 0.0258) but widen to 1.6–1.8× the runner-up on the multi-stain MIST block (HER2, ER, PR, Ki-67), indicating that VFM-derived spatial conditioning carries cross-stain biomarker signal that prior methods miss. Additio… view at source ↗

**Figure 5.** Figure 5: Spatial map conditioning extraction pipeline in Section 3.4. (a) A reference patch (256 × 256) is randomly sampled from a 1024 × 1024 image. (b) Surrounding overlapping patches (224 × 224) are extracted in 8 directions at stride 80: two each along the horizontal, vertical, and the two diagonal axes (lower-left to upper-right, upper-left to lower-right); horizontal and vertical directions receive lateral ji… view at source ↗

**Figure 6.** Figure 6: Effect of the additional reference patches on MIST HER2 validation (n = 1,000; stride 192; 256 × 256 reference patches). (a) Histogram of per-image ∆TS = TSw/o − TSw/; bars to the right of zero are images for which adding the patches reduced TS. (b) Mean per-image % change for five paired metrics, with the sign flipped on higher-is-better metrics so that a positive bar always denotes improvement. Significa… view at source ↗

**Figure 7.** Figure 7: Dataset-level metric comparison on MIST HER2 validation (n = 1,000, stride 192). Each panel uses an independent y-axis to make the with/without contrast visible at the metric’s natural scale. KID is reported as KID × 103 for legibility. Adding the patches lowers FID, KID, and TS, raises DAB-r, and leaves mIOD and FOD absolute errors unchanged. H Dataset Statistics [PITH_FULL_IMAGE:figures/full_fig_p018_7.png] view at source ↗

**Figure 8.** Figure 8: Per-position analysis of Prov-GigaPath overlap-token consistency on 4,873 BCI images. (a) Two adjacent patches Ui , Uj at stride s share an overlap region Uij ; tokens in the overlap match across patches by spatial position k. (b) Context contamination: a token t at the same physical position in Ui vs. Uj aggregates attention over different non-overlapping contexts (Ui\Uij for fi(t); Uj \Uij for fj (t)), y… view at source ↗

**Figure 9.** Figure 9: Empirical verification of inherent restriction map learning on MIST HER2 validation [PITH_FULL_IMAGE:figures/full_fig_p022_9.png] view at source ↗

**Figure 10.** Figure 10: Additional HER2 qualitative comparison on BCI at 1024 × 1024. 25 [PITH_FULL_IMAGE:figures/full_fig_p025_10.png] view at source ↗

**Figure 11.** Figure 11: Additional HER2 qualitative comparison on MIST at 1024 × 1024. 26 [PITH_FULL_IMAGE:figures/full_fig_p026_11.png] view at source ↗

**Figure 12.** Figure 12: ER qualitative comparison on MIST at 1024 × 1024. 27 [PITH_FULL_IMAGE:figures/full_fig_p027_12.png] view at source ↗

**Figure 13.** Figure 13: PR qualitative comparison on MIST at 1024 × 1024. 28 [PITH_FULL_IMAGE:figures/full_fig_p028_13.png] view at source ↗

**Figure 14.** Figure 14: Ki-67 qualitative comparison on MIST at 1024 × 1024. 29 [PITH_FULL_IMAGE:figures/full_fig_p029_14.png] view at source ↗

**Figure 15.** Figure 15: Latency–quality trade-off on MIST/HER2. Four quality metrics vs. per-image latency: (a) FID, (b) KID×103 , (c) DISTS, and (d) DAB-r. All panels are oriented so that “up is better”: y-axis is inverted for the lower-is-better metrics (a–c); panel (d) uses a normal y-axis for higher-is-better DAB-r. Methods on the dashed curve form the non-dominated Pareto frontier (no other method is simultaneously faster a… view at source ↗

**Figure 16.** Figure 16: ROC curves on real BCI test for HER2 Low/High classifiers, each trained on a different [PITH_FULL_IMAGE:figures/full_fig_p032_16.png] view at source ↗

read the original abstract

Current virtual staining approaches offer the potential for time- and cost-efficient biomarker quantification in cancer diagnostics and prognostics. However, patch-wise inference for gigapixel whole slide images (WSIs) fails to maintain spatial continuity, yielding artifacts that cause catastrophic mismatches with ground-truth images. Although pathology Vision Foundation Models (VFMs) offer rich representations, their self-attention causes varying global contexts to produce inconsistent embeddings for the same physical region. We formalize and validate this ``context contamination'' as a sheaf-theoretic problem where these embeddings form a presheaf that violates the gluing axiom. To address this, we propose SheafStain, a new approach that reinterprets VFM features as sheaf-like sections for spatially and biologically coherent virtual staining. Specifically, SheafStain integrates class and patch tokens into a Schr\"odinger Bridge framework as sheaf-like sections. While the class token anchors biological consistency, patch tokens form a per-position spatial map. A backbone co-pretrained on Hematoxylin \& Eosin (H\&E) and Immunohistochemistry (IHC) yields non-degenerate cross-stain stalks, so a single VFM feature space supervises both input conditioning and output stain alignment. Departing from prior work that evaluates on isolated $256 \times 256$ patches and either random-crops or resizes the $1024 \times 1024$ ground truth, we translate at $256 \times 256$ and evaluate on the stitched $1024 \times 1024$ outputs across HER2, ER, PR, and Ki-67. SheafStain demonstrates promising results against six prior methods while mitigating patch-boundary stitching artifacts. Code will soon be released.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

SheafStain frames VFM context contamination as a presheaf gluing failure and routes class/patch tokens through a Schrödinger Bridge with co-pretrained H&E/IHC stalks, but the abstract supplies zero numbers to show the fix works.

read the letter

The central move is to treat self-attention embeddings from pathology VFMs as sections of a presheaf that fails to glue across overlapping patches, then enforce consistency by embedding those tokens into a Schrödinger Bridge whose cross-stain stalks come from a backbone pretrained on paired H&E and IHC data. The evaluation shift—translate at 256×256 but score on stitched 1024×1024 outputs for HER2, ER, PR, and Ki-67—is also explicit and avoids the usual crop-or-resize shortcuts.

What stands out is the practical target: gigapixel WSIs where boundary artifacts break downstream biomarker quantification. The six-method comparison and the co-pretraining step for non-degenerate stalks are concrete engineering choices that could matter if they hold up.

The soft spot is the complete absence of any quantitative result in the abstract. No PSNR, SSIM, FID, pathologist concordance, or even a statement that the sheaf construction satisfies the gluing axiom on the data. Without those, it is impossible to separate the contribution of the sheaf language from the co-pretraining or the Bridge itself. The circularity risk the reader flagged is real on the current evidence: the stalks are outputs of the same pretraining that feeds the final model.

This paper is for groups already running virtual staining pipelines who hit stitching problems on whole slides. A reader who wants a new formal vocabulary for an old inconsistency will find the framing, but anyone needing reproducible numbers or an ablation on the sheaf component will have to wait for the full text and code.

I would send it to review. The problem is genuine and the proposed mechanism is at least coherent on its own terms; a referee can check whether the experiments actually demonstrate the claimed consistency gain.

Referee Report

3 major / 2 minor

Summary. The manuscript claims that self-attention in pathology Vision Foundation Models produces context-dependent embeddings that form a presheaf violating the gluing axiom (termed 'context contamination'), and proposes SheafStain to reinterpret class and patch tokens as sheaf-like sections inside a Schrödinger Bridge. A co-pretrained H&E/IHC backbone supplies non-degenerate cross-stain stalks so that a single feature space supervises both conditioning and output alignment. The method translates 256×256 patches and evaluates on stitched 1024×1024 images for HER2, ER, PR and Ki-67, asserting improved spatial/biological consistency and superior performance relative to six prior virtual-staining baselines while mitigating patch-boundary artifacts.

Significance. If the formalization and empirical claims hold, the work supplies a principled mechanism for enforcing consistency across overlapping patches in gigapixel virtual staining, a practical bottleneck in computational pathology. The explicit shift to stitched-image evaluation (rather than isolated-patch or resized-GT protocols) is a methodological advance that better matches clinical use. The co-pretraining strategy for cross-stain stalks and the planned code release are concrete strengths that would aid reproducibility if the central sheaf construction is shown to be non-circular.

major comments (3)

[Abstract] Abstract (paragraph on formalization of context contamination): the claim that VFM self-attention embeddings constitute a presheaf violating the gluing axiom is presented as the central motivation, yet no explicit restriction maps, overlap diagrams, or verification that the gluing condition fails for the same physical region under different global contexts are supplied; without this derivation the sheaf framing remains an interpretive overlay rather than an independent justification for the subsequent Schrödinger Bridge construction.
[Abstract] Abstract (evaluation protocol paragraph): the manuscript states that SheafStain 'demonstrates promising results against six prior methods' on stitched 1024×1024 outputs, but reports no quantitative metrics (PSNR, SSIM, FID, biomarker-specific concordance, or statistical tests), no ablation isolating the sheaf or co-pretraining components, and no verification that the cross-stain stalk parameters remain independent of the final staining result; these omissions make the empirical support for the central claim impossible to assess.
[Abstract] Abstract (Schrödinger Bridge integration paragraph): the class token is said to 'anchor biological consistency' and patch tokens to 'form a per-position spatial map,' yet the manuscript supplies no equation or section demonstrating that the resulting sections satisfy the sheaf gluing axiom on overlaps or that the Schrödinger Bridge transport enforces this property beyond the co-pretraining step; this is load-bearing for the claim that the approach restores spatial and biological coherence.

minor comments (2)

[Abstract] The statement 'Code will soon be released' should be replaced by an explicit repository URL or a clear statement that the code is already available, consistent with reproducibility standards.
[Abstract] The phrase 'non-degenerate cross-stain stalks' is used without a preceding definition or reference to how degeneracy is measured; a brief parenthetical or citation would clarify the term for readers outside sheaf theory.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the detailed and constructive report. The comments correctly identify areas where the abstract and manuscript require additional explicit constructions and empirical details to strengthen the sheaf-theoretic claims and evaluation. We address each major comment below and will incorporate the requested elements in the revised manuscript.

read point-by-point responses

Referee: [Abstract] Abstract (paragraph on formalization of context contamination): the claim that VFM self-attention embeddings constitute a presheaf violating the gluing axiom is presented as the central motivation, yet no explicit restriction maps, overlap diagrams, or verification that the gluing condition fails for the same physical region under different global contexts are supplied; without this derivation the sheaf framing remains an interpretive overlay rather than an independent justification for the subsequent Schrödinger Bridge construction.

Authors: We agree that the abstract omits explicit restriction maps, overlap diagrams, and direct verification of gluing failure. While Section 3 of the manuscript defines the presheaf structure on VFM embeddings, the presentation would benefit from concrete illustrations. In the revision we will add a dedicated figure and subsection that specifies the restriction maps for overlapping patches, provides an overlap diagram, and empirically verifies gluing violation by comparing embeddings of the same physical region under differing global contexts extracted from the co-pretrained backbone. This will make the sheaf framing a self-contained justification rather than an overlay. revision: yes
Referee: [Abstract] Abstract (evaluation protocol paragraph): the manuscript states that SheafStain 'demonstrates promising results against six prior methods' on stitched 1024×1024 outputs, but reports no quantitative metrics (PSNR, SSIM, FID, biomarker-specific concordance, or statistical tests), no ablation isolating the sheaf or co-pretraining components, and no verification that the cross-stain stalk parameters remain independent of the final staining result; these omissions make the empirical support for the central claim impossible to assess.

Authors: The abstract summarizes outcomes under length constraints. Section 5 of the manuscript describes the stitched 1024×1024 evaluation protocol and comparisons to six baselines, yet we acknowledge the absence of the requested quantitative metrics, ablations, and stalk-independence verification in the current version. In the revision we will expand the results section to report PSNR, SSIM, FID, biomarker-specific concordance with statistical tests, ablations that isolate the sheaf construction and co-pretraining, and an analysis confirming that cross-stain stalk parameters are independent of the generated staining output. revision: yes
Referee: [Abstract] Abstract (Schrödinger Bridge integration paragraph): the class token is said to 'anchor biological consistency' and patch tokens to 'form a per-position spatial map,' yet the manuscript supplies no equation or section demonstrating that the resulting sections satisfy the sheaf gluing axiom on overlaps or that the Schrödinger Bridge transport enforces this property beyond the co-pretraining step; this is load-bearing for the claim that the approach restores spatial and biological coherence.

Authors: We recognize that the current text does not supply an explicit equation linking the class/patch token construction to satisfaction of the gluing axiom under the Schrödinger Bridge. Section 4 describes the integration of tokens as sheaf-like sections, but the load-bearing demonstration is missing. In the revision we will insert a new equation together with a short derivation showing that the optimal transport map, when composed with the non-degenerate cross-stain stalks, produces sections that satisfy the gluing condition on overlaps; this will directly substantiate the claim that the framework restores spatial and biological coherence beyond the co-pretraining step alone. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's derivation begins with an empirical observation of context-dependent embeddings in VFMs, formalizes this as a presheaf violating the gluing axiom, and proposes integration of class/patch tokens into a Schrödinger Bridge using a co-pretrained H&E/IHC backbone. No equations, fitted parameters, or self-citations are exhibited that reduce the central claims (sheaf sections enforcing consistency, cross-stain stalks) back to the inputs by construction. The evaluation protocol on stitched 1024×1024 images is explicitly separated from prior patch-wise methods and serves as independent empirical support. The derivation remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 1 invented entities

Central claim rests on standard sheaf axioms plus the modeling decision to treat VFM tokens as sections; co-pretraining introduces fitted parameters whose independence is not demonstrated in the abstract.

free parameters (1)

cross-stain stalk parameters
Non-degenerate stalks produced by co-pretraining on H&E and IHC are described as enabling the framework but are not shown to be parameter-free.

axioms (1)

standard math Gluing axiom of sheaf theory
Invoked to diagnose that VFM embeddings violate consistent gluing across overlapping patches.

invented entities (1)

sheaf-like sections from class and patch tokens no independent evidence
purpose: To enforce spatial and biological coherence in the Schrödinger Bridge output
New interpretive layer placed on existing VFM tokens; no independent falsifiable evidence supplied in abstract.

pith-pipeline@v0.9.1-grok · 5873 in / 1382 out tokens · 34663 ms · 2026-06-27T10:29:13.332342+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

39 extracted references

[1]

S. Liu, C. Zhu, F. Xu, X. Jia, Z. Shi, and M. Jin. BCI: Breast cancer immunohistochemical image generation through pyramid pix2pix. InCVPR Workshops, pages 1815–1824, 2022. 9

2022
[2]

F. Li, Z. Hu, W. Chen, and A. Kak. Adaptive supervised PatchNCE loss for learning H&E-to-IHC stain translation with inconsistent groundtruth image pairs. InMICCAI, 2023

2023
[3]

Klöckner, J

P. Klöckner, J. Teixeira, D. Montezuma, J.S. Cardoso, H.M. Horlings, and S.P. Oliveira. GANs vs. Diffusion Models for virtual staining with the HER2match dataset. InDeep Generative Models, pages 120–130. Springer, 2025

2025
[4]

S. Yang, D. Wei, Y . Hu, Q. Peng, H. Liu, Y . Huang, X. Wu, Y . Zheng, and L. Wang. D-VST: Diffusion transformer for pathology-correct tone-controllable cross-dye virtual staining of whole slide images. In NeurIPS, 2025

2025
[5]

B. Kim, G. Kwon, K. Kim, and J.C. Ye. Unpaired image-to-image translation via neural Schrödinger bridge. InICLR, 2024

2024
[6]

F. Qiu, Y . Zhang, Z.-L. Huang, X. Zhu, and Z. Wang. PASB: Pathology-aware Schrödinger bridge for virtual immunohistochemical staining.Medical Image Analysis, 108:103869, 2026

2026
[7]

Saurav, T.L.H

J.R. Saurav, T.L.H. Pham, P. Mukherjee, P. Yi, B.A. Orr, and J.M. Luber. UNIStainNet: Foundation-model- guided virtual staining of H&E to IHC.arXiv preprint arXiv:2603.12716, 2026

arXiv 2026
[8]

Saleem, A

A.B. Saleem, A. Ahmed, A. Behera, H. Amin, I.Y . Liao, M. Khattab, J.W. Pan, and H. Makmur. HistDiT: A structure-aware latent conditional diffusion model for high-fidelity virtual staining in histopathology. In ICPR, 2026

2026
[9]

Madar and O

O. Madar and O. Fried. Tiled Diffusion.arXiv preprint arXiv:2412.15185, 2025

arXiv 2025
[10]

Isola, J.-Y

P. Isola, J.-Y . Zhu, T. Zhou, and A.A. Efros. Image-to-image translation with conditional adversarial networks. InCVPR, 2017

2017
[11]

J.-Y . Zhu, T. Park, P. Isola, and A.A. Efros. Unpaired image-to-image translation using cycle-consistent adversarial networks. InICCV, 2017

2017
[12]

F. Chen, R. Zhang, B. Zheng, Y . Sun, J. He, and W. Qin. Pathological semantics-preserving learning for H&E-to-IHC virtual staining. InMICCAI, pages 384–394. Springer, 2024

2024
[13]

Dubey, T

S. Dubey, T. Kataria, B. Knudsen, and S.Y . Elhabian. Structural cycle GAN for virtual immunohisto- chemistry staining of gland markers in the colon. InMICCAI Workshop on Machine Learning in Medical Imaging, pages 447–456. Springer, 2023

2023
[14]

S. Liu, K. Liu, S. Margolis, W. Wu, S.R. Knezevich, D.E. Elder, M.M. Eguchi, J.G. Elmore, and L. Shapiro. Generating seamless virtual immunohistochemical whole slide images with content and color consistency. arXiv preprint arXiv:2410.01072, 2024

arXiv 2024
[15]

T. Wang, M. Wang, Z. Wang, H. Wang, Q. Xu, F. Cong, and H. Xu. ODA-GAN: Orthogonal decoupling alignment GAN assisted by weakly-supervised learning for virtual immunohistochemistry staining. In CVPR, pages 25920–25929, 2025

2025
[16]

Kataria, B

T. Kataria, B. Knudsen, and S.Y . Elhabian. StainDiffuser: MultiTask dual diffusion model for virtual staining.arXiv preprint arXiv:2403.11340, 2025

arXiv 2025
[17]

H. Xu, N. Usuyama, J. Bagga,et al.A whole-slide foundation model for digital pathology from real-world data.Nature, 630(8015):181–188, 2024

2024
[18]

R.J. Chen, T. Ding, M.Y . Lu,et al.Towards a general-purpose foundation model for computational pathology.Nature Medicine, 30(3):850–862, 2024

2024
[19]

V orontsov, A

E. V orontsov, A. Bozkurt, A. Casson,et al.A foundation model for clinical-grade computational pathology and rare cancers detection.Nature Medicine, 30:2924–2935, 2024

2024
[20]

Bodnar, F

C. Bodnar, F. Di Giovanni, B. Chamberlain, P. Liò, and M. Bronstein. Neural sheaf diffusion: A topological perspective on heterophily and oversmoothing in GNNs. InNeurIPS, 2022

2022
[21]

I. Duta, G. Cassarà, F. Silvestri, and P. Liò. Sheaf hypergraph networks. InNeurIPS, 2023

2023
[22]

Di Nino, S

L. Di Nino, S. Barbarossa, and P. Di Lorenzo. Learning sheaf Laplacian optimizing restriction maps. In Asilomar Conference on Signals, Systems, and Computers, pages 59–63, 2024

2024
[23]

Ribeiro, A.L

A. Ribeiro, A.L. Tenorio, J. Belieni,et al.Cooperative sheaf neural networks. InICLR, 2026

2026
[24]

Bredon.Sheaf Theory

G.E. Bredon.Sheaf Theory. Springer, 2nd edition, 1997. 10

1997
[25]

Ayzenberg, T

A. Ayzenberg, T. Gebhart, G. Magai, and G. Solomadin. Sheaf theory: from deep geometry to deep learning.arXiv preprint arXiv:2502.15476, 2025

arXiv 2025
[26]

Schrödinger

E. Schrödinger. Sur la théorie relativiste de l’électron et l’interprétation de la mécanique quantique.Annales de l’institut Henri Poincaré, 2(4):269–310, 1932

1932
[27]

C. Léonard. A survey of the Schrödinger problem and some of its connections with optimal transport. arXiv preprint arXiv:1308.0215, 2013

arXiv 2013
[28]

Villani.Optimal Transport: Old and New

C. Villani.Optimal Transport: Old and New. Springer, 2009

2009
[29]

Chen, T.T

Y . Chen, T.T. Georgiou, and M. Pavon. Stochastic control liaisons: Richard Sinkhorn meets Gaspard Monge on a Schrödinger bridge.SIAM Review, 63(2):249–313, 2021

2021
[30]

Ruifrok and D.A

A.C. Ruifrok and D.A. Johnston. Quantification of histochemical staining by color deconvolution.Analyti- cal and Quantitative Cytology and Histology, 23(4):291–299, 2001

2001
[31]

Kingma and J

D.P. Kingma and J. Ba. Adam: A method for stochastic optimization. InICLR, 2015

2015
[32]

Heusel, H

M. Heusel, H. Ramsauer, T. Unterthiner, B. Nessler, and S. Hochreiter. GANs trained by a two time-scale update rule converge to a local Nash equilibrium. InNeurIPS, 2017

2017
[33]

Bi ´nkowski, D.J

M. Bi ´nkowski, D.J. Sutherland, M. Arbel, and A. Gretton. Demystifying MMD GANs. InICLR, 2018

2018
[34]

Zhang, P

R. Zhang, P. Isola, A.A. Efros, E. Shechtman, and O. Wang. The unreasonable effectiveness of deep features as a perceptual metric. InCVPR, 2018

2018
[35]

K. Ding, K. Ma, S. Wang, and E.P. Simoncelli. Image quality assessment: Unifying structure and texture similarity.IEEE TPAMI, 44(5):2567–2581, 2022

2022
[36]

Wang, A.C

Z. Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli. Image quality assessment: From error visibility to structural similarity.IEEE TIP, 13(4):600–612, 2004

2004
[37]

S. Modi, W. Jacot, T. Yamashita,et al.Trastuzumab deruxtecan in previously treated HER2-low advanced breast cancer.New England Journal of Medicine, 387(1):9–20, 2022

2022
[38]

Wolff, M.E.H

A.C. Wolff, M.E.H. Hammond, K.H. Allison,et al.Human epidermal growth factor receptor 2 testing in breast cancer: American Society of Clinical Oncology/College of American Pathologists clinical practice guideline focused update.Journal of Clinical Oncology, 36(20):2105–2122, 2018. 11 A Sheaf Theory Background This section expands upon the brief sheaf-theo...

2018
[39]

restriction to Ui

Gluing:If a family of local sections {si ∈ F(U i)} is pairwise compatible on overlaps, there exists a uniques∈ F(V)whose restriction to eachU i equalss i. Locality says a section is determined by its local restrictions; gluing says compatible local sections assemble into a unique section. The compatibility condition and the very notion of "restriction to ...

arXiv 2036

[1] [1]

S. Liu, C. Zhu, F. Xu, X. Jia, Z. Shi, and M. Jin. BCI: Breast cancer immunohistochemical image generation through pyramid pix2pix. InCVPR Workshops, pages 1815–1824, 2022. 9

2022

[2] [2]

F. Li, Z. Hu, W. Chen, and A. Kak. Adaptive supervised PatchNCE loss for learning H&E-to-IHC stain translation with inconsistent groundtruth image pairs. InMICCAI, 2023

2023

[3] [3]

Klöckner, J

P. Klöckner, J. Teixeira, D. Montezuma, J.S. Cardoso, H.M. Horlings, and S.P. Oliveira. GANs vs. Diffusion Models for virtual staining with the HER2match dataset. InDeep Generative Models, pages 120–130. Springer, 2025

2025

[4] [4]

S. Yang, D. Wei, Y . Hu, Q. Peng, H. Liu, Y . Huang, X. Wu, Y . Zheng, and L. Wang. D-VST: Diffusion transformer for pathology-correct tone-controllable cross-dye virtual staining of whole slide images. In NeurIPS, 2025

2025

[5] [5]

B. Kim, G. Kwon, K. Kim, and J.C. Ye. Unpaired image-to-image translation via neural Schrödinger bridge. InICLR, 2024

2024

[6] [6]

F. Qiu, Y . Zhang, Z.-L. Huang, X. Zhu, and Z. Wang. PASB: Pathology-aware Schrödinger bridge for virtual immunohistochemical staining.Medical Image Analysis, 108:103869, 2026

2026

[7] [7]

Saurav, T.L.H

J.R. Saurav, T.L.H. Pham, P. Mukherjee, P. Yi, B.A. Orr, and J.M. Luber. UNIStainNet: Foundation-model- guided virtual staining of H&E to IHC.arXiv preprint arXiv:2603.12716, 2026

arXiv 2026

[8] [8]

Saleem, A

A.B. Saleem, A. Ahmed, A. Behera, H. Amin, I.Y . Liao, M. Khattab, J.W. Pan, and H. Makmur. HistDiT: A structure-aware latent conditional diffusion model for high-fidelity virtual staining in histopathology. In ICPR, 2026

2026

[9] [9]

Madar and O

O. Madar and O. Fried. Tiled Diffusion.arXiv preprint arXiv:2412.15185, 2025

arXiv 2025

[10] [10]

Isola, J.-Y

P. Isola, J.-Y . Zhu, T. Zhou, and A.A. Efros. Image-to-image translation with conditional adversarial networks. InCVPR, 2017

2017

[11] [11]

J.-Y . Zhu, T. Park, P. Isola, and A.A. Efros. Unpaired image-to-image translation using cycle-consistent adversarial networks. InICCV, 2017

2017

[12] [12]

F. Chen, R. Zhang, B. Zheng, Y . Sun, J. He, and W. Qin. Pathological semantics-preserving learning for H&E-to-IHC virtual staining. InMICCAI, pages 384–394. Springer, 2024

2024

[13] [13]

Dubey, T

S. Dubey, T. Kataria, B. Knudsen, and S.Y . Elhabian. Structural cycle GAN for virtual immunohisto- chemistry staining of gland markers in the colon. InMICCAI Workshop on Machine Learning in Medical Imaging, pages 447–456. Springer, 2023

2023

[14] [14]

S. Liu, K. Liu, S. Margolis, W. Wu, S.R. Knezevich, D.E. Elder, M.M. Eguchi, J.G. Elmore, and L. Shapiro. Generating seamless virtual immunohistochemical whole slide images with content and color consistency. arXiv preprint arXiv:2410.01072, 2024

arXiv 2024

[15] [15]

T. Wang, M. Wang, Z. Wang, H. Wang, Q. Xu, F. Cong, and H. Xu. ODA-GAN: Orthogonal decoupling alignment GAN assisted by weakly-supervised learning for virtual immunohistochemistry staining. In CVPR, pages 25920–25929, 2025

2025

[16] [16]

Kataria, B

T. Kataria, B. Knudsen, and S.Y . Elhabian. StainDiffuser: MultiTask dual diffusion model for virtual staining.arXiv preprint arXiv:2403.11340, 2025

arXiv 2025

[17] [17]

H. Xu, N. Usuyama, J. Bagga,et al.A whole-slide foundation model for digital pathology from real-world data.Nature, 630(8015):181–188, 2024

2024

[18] [18]

R.J. Chen, T. Ding, M.Y . Lu,et al.Towards a general-purpose foundation model for computational pathology.Nature Medicine, 30(3):850–862, 2024

2024

[19] [19]

V orontsov, A

E. V orontsov, A. Bozkurt, A. Casson,et al.A foundation model for clinical-grade computational pathology and rare cancers detection.Nature Medicine, 30:2924–2935, 2024

2024

[20] [20]

Bodnar, F

C. Bodnar, F. Di Giovanni, B. Chamberlain, P. Liò, and M. Bronstein. Neural sheaf diffusion: A topological perspective on heterophily and oversmoothing in GNNs. InNeurIPS, 2022

2022

[21] [21]

I. Duta, G. Cassarà, F. Silvestri, and P. Liò. Sheaf hypergraph networks. InNeurIPS, 2023

2023

[22] [22]

Di Nino, S

L. Di Nino, S. Barbarossa, and P. Di Lorenzo. Learning sheaf Laplacian optimizing restriction maps. In Asilomar Conference on Signals, Systems, and Computers, pages 59–63, 2024

2024

[23] [23]

Ribeiro, A.L

A. Ribeiro, A.L. Tenorio, J. Belieni,et al.Cooperative sheaf neural networks. InICLR, 2026

2026

[24] [24]

Bredon.Sheaf Theory

G.E. Bredon.Sheaf Theory. Springer, 2nd edition, 1997. 10

1997

[25] [25]

Ayzenberg, T

A. Ayzenberg, T. Gebhart, G. Magai, and G. Solomadin. Sheaf theory: from deep geometry to deep learning.arXiv preprint arXiv:2502.15476, 2025

arXiv 2025

[26] [26]

Schrödinger

E. Schrödinger. Sur la théorie relativiste de l’électron et l’interprétation de la mécanique quantique.Annales de l’institut Henri Poincaré, 2(4):269–310, 1932

1932

[27] [27]

C. Léonard. A survey of the Schrödinger problem and some of its connections with optimal transport. arXiv preprint arXiv:1308.0215, 2013

arXiv 2013

[28] [28]

Villani.Optimal Transport: Old and New

C. Villani.Optimal Transport: Old and New. Springer, 2009

2009

[29] [29]

Chen, T.T

Y . Chen, T.T. Georgiou, and M. Pavon. Stochastic control liaisons: Richard Sinkhorn meets Gaspard Monge on a Schrödinger bridge.SIAM Review, 63(2):249–313, 2021

2021

[30] [30]

Ruifrok and D.A

A.C. Ruifrok and D.A. Johnston. Quantification of histochemical staining by color deconvolution.Analyti- cal and Quantitative Cytology and Histology, 23(4):291–299, 2001

2001

[31] [31]

Kingma and J

D.P. Kingma and J. Ba. Adam: A method for stochastic optimization. InICLR, 2015

2015

[32] [32]

Heusel, H

M. Heusel, H. Ramsauer, T. Unterthiner, B. Nessler, and S. Hochreiter. GANs trained by a two time-scale update rule converge to a local Nash equilibrium. InNeurIPS, 2017

2017

[33] [33]

Bi ´nkowski, D.J

M. Bi ´nkowski, D.J. Sutherland, M. Arbel, and A. Gretton. Demystifying MMD GANs. InICLR, 2018

2018

[34] [34]

Zhang, P

R. Zhang, P. Isola, A.A. Efros, E. Shechtman, and O. Wang. The unreasonable effectiveness of deep features as a perceptual metric. InCVPR, 2018

2018

[35] [35]

K. Ding, K. Ma, S. Wang, and E.P. Simoncelli. Image quality assessment: Unifying structure and texture similarity.IEEE TPAMI, 44(5):2567–2581, 2022

2022

[36] [36]

Wang, A.C

Z. Wang, A.C. Bovik, H.R. Sheikh, and E.P. Simoncelli. Image quality assessment: From error visibility to structural similarity.IEEE TIP, 13(4):600–612, 2004

2004

[37] [37]

S. Modi, W. Jacot, T. Yamashita,et al.Trastuzumab deruxtecan in previously treated HER2-low advanced breast cancer.New England Journal of Medicine, 387(1):9–20, 2022

2022

[38] [38]

Wolff, M.E.H

A.C. Wolff, M.E.H. Hammond, K.H. Allison,et al.Human epidermal growth factor receptor 2 testing in breast cancer: American Society of Clinical Oncology/College of American Pathologists clinical practice guideline focused update.Journal of Clinical Oncology, 36(20):2105–2122, 2018. 11 A Sheaf Theory Background This section expands upon the brief sheaf-theo...

2018

[39] [39]

restriction to Ui

Gluing:If a family of local sections {si ∈ F(U i)} is pairwise compatible on overlaps, there exists a uniques∈ F(V)whose restriction to eachU i equalss i. Locality says a section is determined by its local restrictions; gluing says compatible local sections assemble into a unique section. The compatibility condition and the very notion of "restriction to ...

arXiv 2036