pith. sign in

arxiv: 2606.06983 · v1 · pith:XLITWFAQnew · submitted 2026-06-05 · 📡 eess.IV · cs.AI· cs.CV

DaX: Learning General Pathology Representations Across Scales

Pith reviewed 2026-06-27 20:37 UTC · model grok-4.3

classification 📡 eess.IV cs.AIcs.CV
keywords computational pathologywhole-slide imagingself-supervised learningfoundation modelmulti-scale representationshistopathologybenchmarkDINO
0
0 comments X

The pith

DaX adapts DINOv3 self-supervised learning with continuous magnification training and cross-scale views to produce pathology representations that top a 161-task benchmark.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

DaX is a pathology vision foundation model that starts from natural-image DINOv3 weights and adds training designs to handle the wide range of magnifications, staining, and resolutions found in whole-slide histopathology images. The designs include continuous magnification training, cross-scale tissue views, orientation-agnostic augmentations, multi-input-size training, and Gram-anchored dense consistency, all intended to link fine cellular details with broader tissue structure while keeping token-level features stable as input scale changes. The authors introduce a reproducible benchmark of 161 tasks drawn from 44 public datasets across four clinical domains and evaluate every model under patient-level cross-validation. On this benchmark DaX records the highest average performance and strong per-task rankings, covering diagnostic, molecular, tissue-context, and prognostic endpoints.

Core claim

DaX achieves the highest mean performance across tasks and consistently strong task-level ranking scores on a WSI-level benchmark comprising 161 clinically meaningful tasks from 44 public datasets covering 28,182 patients and 34,394 slides, with gains spanning diagnostic pathology, biomarker and molecular profiling, tissue/specimen context, and risk, response, and prognosis.

What carries the argument

Continuous magnification training together with cross-scale tissue views and Gram-anchored dense consistency, which connects local cellular morphology with global tissue architecture while stabilizing dense token-level representations across input scales.

If this is right

  • DaX functions as a transferable visual encoder for computational pathology tasks that vary in magnification and acquisition conditions.
  • The patient-level cross-validation protocol yields more stable model rankings than random slide splits.
  • The benchmark supplies a standardized test suite covering diagnostic, biomarker, context, and prognostic categories for future pathology foundation models.
  • Representations remain robust to changes in staining, scanner type, slide preparation, and input resolution.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same scale-bridging augmentations could be tested on other self-supervised backbones to check whether the gains are architecture-specific.
  • Extending the benchmark to entirely new scanner vendors or staining protocols not present in the current 44 datasets would test whether the robustness claims hold outside the training distribution.
  • If the dense consistency term proves critical, it may offer a general way to regularize multi-resolution token models in other imaging domains that exhibit scale variation.

Load-bearing premise

The specific training designs successfully connect local cellular morphology with global tissue architecture while stabilizing dense token-level representations across input scales.

What would settle it

A controlled ablation on the same 161-task benchmark in which removing continuous magnification training or cross-scale views causes DaX to fall below the performance of unmodified DINOv3 would falsify the claim that these designs drive the observed gains.

read the original abstract

Computational pathology requires visual representations that transfer across diverse clinical endpoints and remain robust to variation in magnification, staining, scanner type, slide preparation, and input resolution. We present DaX, a pathology vision foundation model that adapts DINOv3-style self-supervised learning to whole-slide histopathology. DaX is initialized from natural-image DINOv3 weights and incorporates continuous magnification training, cross-scale tissue views, orientation-agnostic and acquisition-robust augmentation, multi-input-size training, and Gram-anchored dense consistency. These designs aim to connect local cellular morphology with global tissue architecture while stabilizing dense token-level representations across input scales. We further construct a WSI-level benchmark comprising 161 clinically meaningful tasks from 44 public datasets, covering 28,182 patients and 34,394 slides across four clinical domains and nine task categories. All models are evaluated under a fixed patient-level cross-validation protocol with fold-level statistical ranking, enabling reproducible comparisons that are less sensitive to split-dependent variation. Across this benchmark, DaX achieves the highest mean performance across tasks and consistently strong task-level ranking scores, with gains spanning diagnostic pathology, biomarker and molecular profiling, tissue/specimen context, and risk, response, and prognosis. These results support DaX as a transferable visual encoder for computational pathology and provide a standardized evaluation framework for future pathology foundation models. Project page: https://alibaba-damo-academy.github.io/DaX/benchboard/.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The manuscript presents DaX, a pathology vision foundation model initialized from natural-image DINOv3 weights and adapted via continuous magnification training, cross-scale tissue views, orientation-agnostic and acquisition-robust augmentation, multi-input-size training, and Gram-anchored dense consistency. It constructs a WSI-level benchmark of 161 clinically meaningful tasks drawn from 44 public datasets (28,182 patients, 34,394 slides) spanning four clinical domains and nine task categories, evaluates all models under a fixed patient-level cross-validation protocol with fold-level statistical ranking, and reports that DaX attains the highest mean performance across tasks together with consistently strong task-level rankings.

Significance. If the reported performance gains are shown to arise from the listed training designs rather than data leakage and if the numerical results, ablations, and task-selection details are supplied, the work would supply both a transferable visual encoder for computational pathology and a standardized, reproducible evaluation framework that reduces sensitivity to split variation.

major comments (2)
  1. [Abstract] Abstract: the central claim that DaX achieves the highest mean performance across the 161 tasks is stated without any numerical values, error bars, ablation results, or description of how the 161 tasks were chosen or balanced; this prevents verification of the magnitude or robustness of the reported gains.
  2. [Abstract] Abstract: no information is supplied on pretraining corpus size, sources, or patient-level exclusion criteria ensuring disjointness from the 44 evaluation datasets. Because public pathology datasets frequently overlap, any shared slides or patients would allow exploitation of dataset-specific artifacts, directly undermining attribution of the ranking gains to the proposed designs (continuous magnification training, cross-scale views, etc.).

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment point by point below, indicating revisions to the manuscript.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that DaX achieves the highest mean performance across the 161 tasks is stated without any numerical values, error bars, ablation results, or description of how the 161 tasks were chosen or balanced; this prevents verification of the magnitude or robustness of the reported gains.

    Authors: We agree that the abstract would benefit from additional quantitative context. In the revised manuscript we have added the mean performance value across the 161 tasks together with a brief statement on the fold-level statistical ranking. Full numerical results, error bars, ablation studies, and the task-selection and balancing procedure are already detailed in Sections 3 and 4; we have also inserted a cross-reference in the abstract directing readers to these sections and the supplementary benchmark table. revision: yes

  2. Referee: [Abstract] Abstract: no information is supplied on pretraining corpus size, sources, or patient-level exclusion criteria ensuring disjointness from the 44 evaluation datasets. Because public pathology datasets frequently overlap, any shared slides or patients would allow exploitation of dataset-specific artifacts, directly undermining attribution of the ranking gains to the proposed designs (continuous magnification training, cross-scale views, etc.).

    Authors: We recognize the validity of this concern. The model is initialized from DINOv3 weights pretrained solely on natural-image corpora (ImageNet and related datasets) that contain no pathology slides. For the subsequent pathology adaptation stage we have now added, in the revised Methods section, the exact size and sources of the pretraining corpus together with the explicit patient-level exclusion protocol that removes every patient appearing in the 44 evaluation datasets. This documentation ensures that performance differences can be attributed to the listed training designs rather than data leakage. revision: yes

Circularity Check

0 steps flagged

No circularity; performance claims rest on external benchmark evaluation

full rationale

The paper describes an adaptation of DINOv3 self-supervised learning with listed training designs (continuous magnification, cross-scale views, augmentations, multi-size training, Gram consistency) and reports results on a separately constructed benchmark of 161 tasks drawn from 44 public datasets under patient-level cross-validation. No equations, fitted parameters, or self-citations are presented that would make the reported rankings or mean performance reduce to the training designs by construction. The evaluation protocol is external to the pretraining objective, and the central claim is an empirical ranking rather than a derived identity. This is the normal non-circular case for an empirical foundation-model paper.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated. The model relies on standard self-supervised learning assumptions and the new benchmark construction, but these are not itemized.

pith-pipeline@v0.9.1-grok · 5800 in / 1214 out tokens · 22039 ms · 2026-06-27T20:37:09.326921+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

79 extracted references · 13 canonical work pages · 2 internal anchors

  1. [1]

    Digital pathology and artificial intelligence

    Muhammad Khalid Khan Niazi, Anil V Parwani, and Metin N Gurcan. Digital pathology and artificial intelligence. The lancet oncology , 20(5):e253–e261, 2019

  2. [2]

    Artificial intelligence for digital and computational pathology

    Andrew H Song, Guillaume Jaume, Drew FK Williamson, Ming Y Lu, Anurag Vaidya, Tiffany R Miller, and Faisal Mahmood. Artificial intelligence for digital and computational pathology. Nature Reviews Bioengineering, 1(12):930–949, 2023

  3. [3]

    A survey of pathology foundation model: Progress and future directions

    Conghao Xiong, Hao Chen, and Joseph JY Sung. A survey of pathology foundation model: Progress and future directions. arXiv preprint arXiv:2504.04045 , 2025

  4. [4]

    Towards a general-purpose foundation model for computational pathology

    Richard J Chen, Tong Ding, Ming Y Lu, Drew FK Williamson, Guillaume Jaume, Andrew H Song, Bowen Chen, Andrew Zhang, Daniel Shao, Muhammad Shaban, et al. Towards a general-purpose foundation model for computational pathology. Nature medicine, 30(3):850–862, 2024

  5. [5]

    A whole-slide foundation model for digital pathology from real-world data

    Hanwen Xu, Naoto Usuyama, Jaspreet Bagga, Sheng Zhang, Rajesh Rao, Tristan Naumann, Cliff Wong, Zelalem Gero, Javier González, Yu Gu, et al. A whole-slide foundation model for digital pathology from real-world data. Nature, 630(8015):181–188, 2024

  6. [6]

    A foundation model for clinical-grade computational pathology and rare cancers detection

    Eugene Vorontsov, Alican Bozkurt, Adam Casson, George Shaikovski, Michal Zelechowski, Kristen Sev- erson, Eric Zimmermann, James Hall, Neil Tenenholtz, Nicolo Fusi, et al. A foundation model for clinical-grade computational pathology and rare cancers detection. Nature medicine, 30(10):2924–2935, 2024

  7. [7]

    Phikon-v2, a large and public feature extractor for biomarker prediction

    Alexandre Filiot, Paul Jacob, Alice Mac Kain, and Charlie Saillard. Phikon-v2, a large and public feature extractor for biomarker prediction. arXiv preprint arXiv:2409.09173 , 2024

  8. [8]

    A visual–language foundation model for pathology image analysis using medical twitter

    Zhi Huang, Federico Bianchi, Mert Yuksekgonul, Thomas J Montine, and James Zou. A visual–language foundation model for pathology image analysis using medical twitter. Nature medicine, 29(9):2307–2316, 2023

  9. [9]

    A visual-language foundation model for computational pathology

    Ming Y Lu, Bowen Chen, Drew FK Williamson, Richard J Chen, Ivy Liang, Tong Ding, Guillaume Jaume, Igor Odintsov, Long Phi Le, Georg Gerber, et al. A visual-language foundation model for computational pathology. Nature medicine, 30(3):863–874, 2024

  10. [10]

    A vision–language foundation model for precision oncology

    Jinxi Xiang, Xiyue Wang, Xiaoming Zhang, Yinghua Xi, Feyisope Eweje, Yijiang Chen, Yuchen Li, Colin Bergstrom, Matthew Gopaulchan, Ted Kim, et al. A vision–language foundation model for precision oncology. Nature, 638(8051):769–778, 2025

  11. [11]

    A multimodal knowledge-enhanced whole-slide pathology founda- tion model

    Yingxue Xu, Yihui Wang, Fengtao Zhou, Jiabo Ma, Cheng Jin, Shu Yang, Jinbang Li, Zhengyu Zhang, Chenglong Zhao, Huajun Zhou, et al. A multimodal knowledge-enhanced whole-slide pathology founda- tion model. Nature Communications, 2025

  12. [12]

    A generalizable pathology foundation model using a unified knowledge distillation pretraining framework

    Jiabo Ma, Zhengrui Guo, Fengtao Zhou, Yihui Wang, Yingxue Xu, Jinbang Li, Fang Yan, Yu Cai, Zhengjie Zhu, Cheng Jin, et al. A generalizable pathology foundation model using a unified knowledge distillation pretraining framework. Nature Biomedical Engineering , 10(3):545–564, 2026

  13. [13]

    DINOv2: Learning Robust Visual Features without Supervision

    Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, et al. Dinov2: Learning robust visual features without supervision. arXiv preprint arXiv:2304.07193 , 2023

  14. [14]

    Mind the gap: Continuous magnification sampling for pathology foundation models

    Alexander Möllers, Julius Hense, Florian Schulz, Timo Milbich, Maximilian Alber, and Lukas Ruff. Mind the gap: Continuous magnification sampling for pathology foundation models. arXiv preprint arXiv:2601.02198, 2026

  15. [15]

    Benchmark- ing foundation models as feature extractors for weakly supervised computational pathology

    Peter Neidlinger, Omar SM El Nahhas, Hannah Sophie Muti, Tim Lenz, Michael Hoffmeister, Hermann Brenner, Marko van Treeck, Rupert Langer, Bastian Dislich, Hans Michael Behrens, et al. Benchmark- ing foundation models as feature extractors for weakly supervised computational pathology. Nature biomedical engineering, pages 1–11, 2025

  16. [16]

    Oriane Siméoni, Huy V Vo, Maximilian Seitzer, Federico Baldassarre, Maxime Oquab, Cijo Jose, Vasil Khalidov, Marc Szafraniec, Seungeun Yi, Michaël Ramamonjisoa, et al. Dinov3. arXiv preprint arXiv:2508.10104, 2025

  17. [17]

    The cancer genome atlas pan-cancer analysis project

    John N Weinstein, Eric A Collisson, Gordon B Mills, Kenna R Shaw, Brad A Ozenberger, Kyle Ellrott, Ilya Shmulevich, Chris Sander, and Joshua M Stuart. The cancer genome atlas pan-cancer analysis project. Nature genetics , 45(10):1113–1120, 2013

  18. [18]

    The genotype- tissue expression (gtex) pilot analysis: multitissue gene regulation in humans

    GTEx Consortium, Kristin G Ardlie, David S Deluca, Ayellet V Segrè, Timothy J Sullivan, Taylor R Young, Ellen T Gelfand, Casandra A Trowbridge, Julian B Maller, Taru Tukiainen, et al. The genotype- tissue expression (gtex) pilot analysis: multitissue gene regulation in humans. Science, 348(6235): 648–660, 2015

  19. [19]

    Histai: an open-source, large-scale whole slide image dataset for computational pathology

    Dmitry Nechaev, Alexey Pchelnikov, and Ekaterina Ivanova. Histai: an open-source, large-scale whole slide image dataset for computational pathology. arXiv preprint arXiv:2505.12120 , 2025

  20. [20]

    Towards robust foundation models for digital pathology

    Jonah Kömen, Edwin D de Jong, Julius Hense, Hannah Marienwald, Jonas Dippel, Philip Naumann, Eric Marcus, Lukas Ruff, Maximilian Alber, Jonas Teuwen, et al. Towards robust foundation models for digital pathology. arXiv preprint arXiv:2507.17845 , 2025

  21. [21]

    Towards large-scale training of pathology foundation models

    Nanne Aben, Edwin D de Jong, Ioannis Gatopoulos, Nicolas Känzig, Mikhail Karasikov, Axel Lagré, Roman Moser, Joost van Doorn, Fei Tang, et al. Towards large-scale training of pathology foundation models. arXiv preprint arXiv:2404.15217 , 2024

  22. [22]

    Rotation-agnostic image representation learning for digital pathology

    Saghir Alfasly, Abubakr Shafique, Peyman Nejat, Jibran Khan, Areej Alsaafin, Ghazal Alabtah, and Hamid R Tizhoosh. Rotation-agnostic image representation learning for digital pathology. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages 11683–11693, 2024

  23. [23]

    Attention-based deep multiple instance learning

    Maximilian Ilse, Jakub Tomczak, and Max Welling. Attention-based deep multiple instance learning. In International conference on machine learning , pages 2127–2136. PMLR, 2018

  24. [24]

    A clinical benchmark of public self-supervised pathology foundation models

    Gabriele Campanella, Shengjia Chen, Manbir Singh, Ruchika Verma, Silke Muehlstedt, Jennifer Zeng, Aryeh Stock, Matt Croken, Brandon Veremis, Abdulkadir Elmas, et al. A clinical benchmark of public self-supervised pathology foundation models. Nature Communications, 16(1):3640, 2025

  25. [25]

    Multi- omics in primary and metastatic breast tumors from the aurora us network finds microenvironment and epigenetic drivers of metastasis

    Susana Garcia-Recio, Toshinori Hinoue, Gregory L Wheeler, Benjamin J Kelly, Ana C Garrido-Castro, Tomas Pascual, Aguirre A De Cubas, Youli Xia, Brooke M Felsheim, Marni B McClure, et al. Multi- omics in primary and metastatic breast tumors from the aurora us network finds microenvironment and epigenetic drivers of metastasis. Nature cancer, 4(1):128–147, 2023

  26. [26]

    Multi-omic machine learning predictor of breast cancer therapy response

    Stephen-John Sammut, Mireia Crispin-Ortuzar, Suet-Feung Chin, Elena Provenzano, Helen A Bardwell, Wenxin Ma, Wei Cope, Ali Dariush, Sarah-Jane Dawson, Jean E Abraham, et al. Multi-omic machine learning predictor of breast cancer therapy response. Nature, 601(7894):623–629, 2022

  27. [27]

    Predicting axillary lymph node metastasis in early breast cancer using deep learning on primary tumor biopsy slides

    Feng Xu, Chuang Zhu, Wenqi Tang, Ying Wang, Yu Zhang, Jie Li, Hongchuan Jiang, Zhongyue Shi, Jun Liu, and Mulan Jin. Predicting axillary lymph node metastasis in early breast cancer using deep learning on primary tumor biopsy slides. Frontiers in oncology, 11:759007, 2021

  28. [28]

    Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer

    Kevin M Boehm, Emily A Aherne, Lora Ellenson, Ines Nikolovski, Mohammed Alghamdi, Ignacio Vázquez-García, Dmitriy Zamarin, Kara Long Roche, Ying Liu, Druv Patel, et al. Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer. Nature cancer, 3(6):723–733, 2022

  29. [29]

    Bracs: A dataset for breast carcinoma subtyping in h&e histology images

    Nadia Brancati, Anna Maria Anniciello, Pushpak Pati, Daniel Riccio, Giosuè Scognamiglio, Guillaume Jaume, Giuseppe De Pietro, Maurizio Di Bonito, Antonio Foncubierta, Gerardo Botti, et al. Bracs: A dataset for breast carcinoma subtyping in h&e histology images. Database, 2022:baac093, 2022

  30. [30]

    Matched mri, seg- mentations, and histopathologic images of brain metastases from primary lung cancer

    Saahil Chadha, Durga V Sritharan, Darin Dolezal, Sampada Chande, Thomas Hager, Khaled Bous- abarah, Mariam S Aboian, Veronica Chiang, MingDe Lin, Don X Nguyen, et al. Matched mri, seg- mentations, and histopathologic images of brain metastases from primary lung cancer. Scientific Data , 2026

  31. [31]

    Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer

    Babak Ehteshami Bejnordi, Mitko Veta, Paul Johannes van Diest, Bram Van Ginneken, Nico Karssemei- jer, Geert Litjens, Jeroen A WM Van Der Laak, CAMELYON16 consortium, Meyke Hermsen, Quirine F Manson, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. Jama, 318(22):2199–2210, 2017

  32. [32]

    From detection of individual metastases to classification of lymph node status at the patient level: the came- lyon17 challenge

    Peter Bandi, Oscar Geessink, Quirine Manson, Marcory Van Dijk, Maschenka Balkenhol, Meyke Hermsen, Babak Ehteshami Bejnordi, Byungjae Lee, Kyunghyun Paeng, Aoxiao Zhong, et al. From detection of individual metastases to classification of lymph node status at the patient level: the came- lyon17 challenge. IEEE transactions on medical imaging , 38(2):550–560, 2018

  33. [33]

    Combining histology, medical imaging and molecular data for medical prognosis and diagnosis (chimera), 2025

    Daan Schouten, Robert Spaans, Khrystyna Faryna, Nadieh Khalili, Geert Litjens, Bas Dille, Catherine Chia, Pierpaolo Vendittelli, and Tahlita Zuiverloon. Combining histology, medical imaging and molecular data for medical prognosis and diagnosis (chimera), 2025. MICCAI 2025 Challenge

  34. [34]

    Cancer moonshot biobank (cmb)

    Cancer Moonshot Biobank. Cancer moonshot biobank (cmb). https://www.cancerimagingarchive. net/research/cmb, 2024. [Data set]

  35. [35]

    Proteogenomic data and resources for pan-cancer analysis

    Yize Li, Yongchao Dou, Felipe Da Veiga Leprevost, Yifat Geffen, Anna P Calinawan, François Aguet, Yo Akiyama, Shankara Anand, Chet Birger, Song Cao, et al. Proteogenomic data and resources for pan-cancer analysis. Cancer cell , 41(8):1397–1406, 2023

  36. [36]

    Proteogenomic landscape of breast cancer tumorigenesis and targeted therapy

    Karsten Krug, Eric J Jaehnig, Shankha Satpathy, Lili Blumenberg, Alla Karpova, Meenakshi Anurag, George Miles, Philipp Mertins, Yifat Geffen, Lauren C Tang, et al. Proteogenomic landscape of breast cancer tumorigenesis and targeted therapy. Cell, 183(5):1436–1456, 2020

  37. [37]

    Integrated proteogenomic characterization of clear cell renal cell carcinoma

    David J Clark, Saravana M Dhanasekaran, Francesca Petralia, Jianbo Pan, Xiaoyu Song, Yingwei Hu, Felipe da Veiga Leprevost, Boris Reva, Tung-Shing M Lih, Hui-Yin Chang, et al. Integrated proteogenomic characterization of clear cell renal cell carcinoma. Cell, 179(4):964–983, 2019

  38. [38]

    Proteogenomic analysis of human colon cancer reveals new therapeutic opportunities

    Suhas Vasaikar, Chen Huang, Xiaojing Wang, Vladislav A Petyuk, Sara R Savage, Bo Wen, Yongchao Dou, Yun Zhang, Zhiao Shi, Osama A Arshad, et al. Proteogenomic analysis of human colon cancer reveals new therapeutic opportunities. Cell, 177(4):1035–1049, 2019

  39. [39]

    Proteogenomic and metabolomic characterization of human glioblastoma

    Liang-Bo Wang, Alla Karpova, Marina A Gritsenko, Jennifer E Kyle, Song Cao, Yize Li, Dmitry Rykunov, Antonio Colaprico, Joseph H Rothstein, Runyu Hong, et al. Proteogenomic and metabolomic characterization of human glioblastoma. Cancer cell , 39(4):509–528, 2021

  40. [40]

    Proteogenomic insights into the biology and treatment of hpv-negative head and neck squamous cell carcinoma

    Chen Huang, Lijun Chen, Sara R Savage, Rodrigo Vargas Eguez, Yongchao Dou, Yize Li, Felipe da Veiga Leprevost, Eric J Jaehnig, Jonathan T Lei, Bo Wen, et al. Proteogenomic insights into the biology and treatment of hpv-negative head and neck squamous cell carcinoma. Cancer cell , 39(3): 361–379, 2021

  41. [41]

    A proteogenomic portrait of lung squamous cell carcinoma

    Shankha Satpathy, Karsten Krug, Pierre M Jean Beltran, Sara R Savage, Francesca Petralia, Chandan Kumar-Sinha, Yongchao Dou, Boris Reva, M Harry Kane, Shayan C A vanessian, et al. A proteogenomic portrait of lung squamous cell carcinoma. Cell, 184(16):4348–4371, 2021

  42. [42]

    Proteogenomic characterization reveals therapeutic vulnerabilities in lung adenocarcinoma

    Michael A Gillette, Shankha Satpathy, Song Cao, Saravana M Dhanasekaran, Suhas V Vasaikar, Karsten Krug, Francesca Petralia, Yize Li, Wen-Wei Liang, Boris Reva, et al. Proteogenomic characterization reveals therapeutic vulnerabilities in lung adenocarcinoma. Cell, 182(1):200–225, 2020

  43. [43]

    Integrated proteomic and glycoproteomic character- ization of human high-grade serous ovarian carcinoma

    Yingwei Hu, Jianbo Pan, Punit Shah, Minghui Ao, Stefani N Thomas, Yang Liu, Lijun Chen, Michael Schnaubelt, David J Clark, Henry Rodriguez, et al. Integrated proteomic and glycoproteomic character- ization of human high-grade serous ovarian carcinoma. Cell reports , 33(3), 2020

  44. [44]

    Proteogenomic characterization of pancreatic ductal adenocarcinoma

    Liwei Cao, Chen Huang, Daniel Cui Zhou, Yingwei Hu, T Mamie Lih, Sara R Savage, Karsten Krug, David J Clark, Michael Schnaubelt, Lijun Chen, et al. Proteogenomic characterization of pancreatic ductal adenocarcinoma. Cell, 184(19):5031–5052, 2021

  45. [45]

    Proteogenomic characteri- zation of endometrial carcinoma

    Yongchao Dou, Emily A Kawaler, Daniel Cui Zhou, Marina A Gritsenko, Chen Huang, Lili Blumenberg, Alla Karpova, Vladislav A Petyuk, Sara R Savage, Shankha Satpathy, et al. Proteogenomic characteri- zation of endometrial carcinoma. Cell, 180(4):729–748, 2020

  46. [46]

    Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks

    Jason W Wei, Laura J Tafe, Yevgeniy A Linnik, Louis J Vaickus, Naofumi Tomita, and Saeed Hassanpour. Pathologist-level classification of histologic patterns on resected lung adenocarcinoma slides with deep neural networks. Scientific reports, 9(1):3358, 2019

  47. [47]

    Sebastian Fernandez-Pol, Yasodha Natkunam, Damir Vrabac, Rebecca Rojansky, Ranjana Advani, Pranav Rajpurkar, S, and Andrew Y. Ng. H&E and immunohistochemical stain images of 209 cases of diffuse large B-cell lymphoma linked with cytogenetic features and clinical outcomes, 2022. URL https://doi.org/10.7937/NVA3-N783. [Data set]

  48. [48]

    The digital brain tumour atlas, an open histopathology resource

    Thomas Roetzer-Pejrimovsky, Anna-Christina Moser, Baran Atli, Clemens Christian Vogel, Petra A Mercea, Romana Prihoda, Ellen Gelpi, Christine Haberler, Romana Höftberger, Johannes A Hainfellner, et al. The digital brain tumour atlas, an open histopathology resource. Scientific Data , 9(1):55, 2022

  49. [49]

    A multimodal dataset for precision oncology in head and neck cancer

    Marion Dörrich, Matthias Balk, Tatjana Heusinger, Sandra Beyer, Hamed Mirbagheri, David J Fischer, Hassan Kanso, Christian Matek, Arndt Hartmann, Heinrich Iro, et al. A multimodal dataset for precision oncology in head and neck cancer. Nature Communications, 16(1):7163, 2025

  50. [50]

    Deep learning trained on hematoxylin and eosin tumor region of interest predicts her2 status and trastuzumab treatment response in her2+ breast cancer

    Saman Farahmand, Aileen I Fernandez, Fahad Shabbir Ahmed, David L Rimm, Jeffrey H Chuang, Emily Reisenbichler, and Kourosh Zarringhalam. Deep learning trained on hematoxylin and eosin tumor region of interest predicts her2 status and trastuzumab treatment response in her2+ breast cancer. Modern Pathology, 35(1):44–51, 2022

  51. [51]

    Huncrc: annotated pathological slides to enhance deep learning applications in colorectal cancer screening

    Bálint Ármin Pataki, Alex Olar, Dezső Ribli, Adrián Pesti, Endre Kontsek, Benedek Gyöngyösi, Ágnes Bilecz, Tekla Kovács, Kristóf Attila Kovács, Zsófia Kramer, et al. Huncrc: annotated pathological slides to enhance deep learning applications in colorectal cancer screening. Scientific Data , 9(1):370, 2022

  52. [53]

    An interpretable machine learning system for colorectal cancer diagnosis from pathology slides

    Pedro C Neto, Diana Montezuma, Sara P Oliveira, Domingos Oliveira, João Fraga, Ana Monteiro, João Monteiro, Liliana Ribeiro, Sofia Gonçalves, Stefan Reinhard, et al. An interpretable machine learning system for colorectal cancer diagnosis from pathology slides. NPJ precision oncology , 8(1):56, 2024

  53. [54]

    Ipd-brain: An indian histopathology dataset for glioma subtype classification

    Ekansh Chauhan, Amit Sharma, Megha S Uppin, Manasa Kondamadugu, CV Jawahar, and PK Vinod. Ipd-brain: An indian histopathology dataset for glioma subtype classification. Scientific Data , 11(1): 1403, 2024

  54. [55]

    Deep learning artificial intelligence predicts homologous recombination de- ficiency and platinum response from histologic slides

    Erik N Bergstrom, Ammal Abbasi, Marcos Díaz-Gay, Loïck Galland, Sylvain Ladoire, Scott M Lippman, and Ludmil B Alexandrov. Deep learning artificial intelligence predicts homologous recombination de- ficiency and platinum response from histologic slides. Journal of Clinical Oncology , 42(30):3550–3560, 2024

  55. [56]

    Intratumoral resolution of driver gene mutation heterogeneity in renal cancer using deep learning

    Paul H Acosta, Vandana Panwar, Vipul Jarmale, Alana Christie, Jay Jasti, Vitaly Margulis, Dinesh Rakheja, John Cheville, Bradley C Leibovich, Alexander Parker, et al. Intratumoral resolution of driver gene mutation heterogeneity in renal cancer using deep learning. Cancer research, 82(15):2792–2806, 2022

  56. [57]

    Nascent prostate cancer heterogeneity drives evolution and resistance to intense hormonal therapy

    Scott Wilkinson, Huihui Ye, Fatima Karzai, Stephanie A Harmon, Nicholas T Terrigino, David J Van- derWeele, John R Bright, Rayann Atway, Shana Y Trostel, Nicole V Carrabba, et al. Nascent prostate cancer heterogeneity drives evolution and resistance to intense hormonal therapy. European urology, 80 (6):746–757, 2021

  57. [58]

    Reduced lung-cancer mortality with low-dose computed tomographic screening

    National Lung Screening Trial Research Team. Reduced lung-cancer mortality with low-dose computed tomographic screening. New England Journal of Medicine , 365(5):395–409, 2011

  58. [59]

    Histopathological whole slide image dataset for classification of treatment effectiveness to ovarian cancer

    Ching-Wei Wang, Cheng-Chang Chang, Muhammad Adil Khalil, Yi-Jia Lin, Yi-An Liou, Po-Chao Hsu, Yu-Ching Lee, Chih-Hung Wang, and Tai-Kuang Chao. Histopathological whole slide image dataset for classification of treatment effectiveness to ovarian cancer. Scientific Data , 9(1):25, 2022

  59. [60]

    Artificial intelligence for diagnosis and gleason grading of prostate cancer: the panda challenge

    Wouter Bulten, Kimmo Kartasalo, Po-Hsuan Cameron Chen, Peter Ström, Hans Pinckaers, Kunal Nagpal, Yuannan Cai, David F Steiner, Hester Van Boven, Robert Vink, et al. Artificial intelligence for diagnosis and gleason grading of prostate cancer: the panda challenge. Nature medicine, 28(1):154–163, 2022

  60. [61]

    Automatic cellularity assessment from post-treated breast surgical specimens

    Mohammad Peikari, Sherine Salama, Sharon Nofech-Mozes, and Anne L Martel. Automatic cellularity assessment from post-treated breast surgical specimens. Cytometry Part A , 91(11):1078–1087, 2017

  61. [62]

    Proteogenomic analysis of chemo-refractory high-grade serous ovarian cancer

    Shrabanti Chowdhury, Jacob J Kennedy, Richard G Ivey, Oscar D Murillo, Noshad Hosseini, Xiaoyu Song, Francesca Petralia, Anna Calinawan, Sara R Savage, Anna B Berry, et al. Proteogenomic analysis of chemo-refractory high-grade serous ovarian cancer. Cell, 186(16):3476–3498, 2023

  62. [63]

    Clinical-grade computational pathology using weakly supervised deep learning on whole slide images

    Gabriele Campanella, Matthew G Hanna, Luke Geneslaw, Allen Miraflor, Vitor Werneck Krauss Silva, Klaus J Busam, Edi Brogi, Victor E Reuter, David S Klimstra, and Thomas J Fuchs. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nature medicine, 25(8):1301–1309, 2019

  63. [64]

    Surgen: 1020 h&e-stained whole-slide images with survival and genetic markers

    Craig Myles, In Hwa Um, Craig Marshall, David Harris-Birtill, and David J Harrison. Surgen: 1020 h&e-stained whole-slide images with survival and genetic markers. GigaScience, 14:giaf086, 2025

  64. [65]

    Immune surveillance in clinical regression of preinvasive squamous cell lung cancer

    Adam Pennycuick, Vitor H Teixeira, Khalid AbdulJabbar, Shan E Ahmed Raza, Tom Lund, Ayse U Akarca, Rachel Rosenthal, Lukas Kalinke, Deepak P Chandrasekharan, Christodoulos P Pipinikas, et al. Immune surveillance in clinical regression of preinvasive squamous cell lung cancer. Cancer Discovery, 10(10):1489–1499, 2020

  65. [66]

    Compu- tational pathology applied to clinical colorectal cancer cohorts identifies immune and endothelial cell spatial patterns predictive of outcome

    Nicholas Trahearn, Chirine Sakr, Abhirup Banerjee, Seung Hyun Lee, Ann-Marie Baker, Hemant M Kocher, Valentina Angerilli, Federica Morano, Francesca Bergamo, Giulia Maddalena, et al. Compu- tational pathology applied to clinical colorectal cancer cohorts identifies immune and endothelial cell spatial patterns predictive of outcome. The Journal of Patholog...

  66. [67]

    VisioMel database: predicting cutaneous melanoma relapse, 2025

    Société Française de Pathologie. VisioMel database: predicting cutaneous melanoma relapse, 2025. URL https://doi.org/10.60597/wsg0-r316. [Data set]

  67. [68]

    H-optimus-1, 2025

    Bioptimus. H-optimus-1, 2025. URL https://huggingface.co/bioptimus/H-optimus-1

  68. [69]

    H-optimus-0, 2024

    Charlie Saillard, Rodolphe Jenatton, Felipe Llinares-López, Zelda Mariet, David Cahané, Eric Durand, and Jean-Philippe Vert. H-optimus-0, 2024. URL https://github.com/bioptimus/releases/tree/ main/models/h-optimus/v0

  69. [70]

    Virchow2: Scaling self- supervised mixed magnification models in pathology

    Eric Zimmermann, Eugene Vorontsov, Julian Viret, Adam Casson, Michal Zelechowski, George Shaikovski, Neil Tenenholtz, James Hall, David Klimstra, Razik Yousfi, et al. Virchow2: Scaling self- supervised mixed magnification models in pathology. arXiv preprint arXiv:2408.00738 , 2024

  70. [71]

    Hibou: A family of foundational vision transformers for pathology

    Dmitry Nechaev, Alexey Pchelnikov, and Ekaterina Ivanova. Hibou: A family of foundational vision transformers for pathology. arXiv preprint arXiv:2406.05074 , 2024

  71. [72]

    Exaone path 2.5: Pathology foundation model with multi-omics alignment

    Juseung Yun, Sunwoo Yu, Sumin Ha, Jonghyun Kim, Janghyeon Lee, Jongseong Jang, and Soony- oung Lee. Exaone path 2.5: Pathology foundation model with multi-omics alignment. arXiv preprint arXiv:2512.14019, 2025

  72. [73]

    Scaling self-supervised learning for histopathol- ogy with masked image modeling

    Alexandre Filiot, Ridouane Ghermi, Antoine Olivier, Paul Jacob, Lucas Fidon, Axel Camara, Alice Mac Kain, Charlie Saillard, and Jean-Baptiste Schiratti. Scaling self-supervised learning for histopathol- ogy with masked image modeling. MedRxiv, pages 2023–07, 2023

  73. [74]

    A pathology foundation model for cancer diagnosis and prognosis prediction

    Xiyue Wang, Junhan Zhao, Eliana Marostica, Wei Yuan, Jietian Jin, Jiayu Zhang, Ruijiang Li, Hongping Tang, Kanran Wang, Yu Li, et al. A pathology foundation model for cancer diagnosis and prognosis prediction. Nature, 634(8035):970–978, 2024

  74. [75]

    Benchmarking self- supervised learning on diverse pathology datasets

    Mingu Kang, Heon Song, Seonwook Park, Donggeun Yoo, and Sérgio Pereira. Benchmarking self- supervised learning on diverse pathology datasets. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages 3344–3354, 2023

  75. [76]

    A multimodal whole-slide foundation model for pathology

    Tong Ding, Sophia J Wagner, Andrew H Song, Richard J Chen, Ming Y Lu, Andrew Zhang, Anurag J Vaidya, Guillaume Jaume, Muhammad Shaban, Ahrong Kim, et al. A multimodal whole-slide foundation model for pathology. Nature medicine, pages 1–13, 2025

  76. [77]

    A multimodal biomedical foundation model trained from fifteen million image–text pairs

    Sheng Zhang, Yanbo Xu, Naoto Usuyama, Hanwen Xu, Jaspreet Bagga, Robert Tinn, Sam Preston, Rajesh Rao, Mu Wei, Naveen Valluri, et al. A multimodal biomedical foundation model trained from fifteen million image–text pairs. Nejm Ai , 2(1):AIoa2400640, 2025

  77. [78]

    A visual–omics foundation model to bridge histopathology with spatial transcriptomics

    Weiqing Chen, Pengzhi Zhang, Tu N Tran, Yiwei Xiao, Shengyu Li, Vrutant V Shah, Hao Cheng, Kristo- pher W Brannan, Keith Youker, Li Lai, et al. A visual–omics foundation model to bridge histopathology with spatial transcriptomics. Nature Methods, 22(7):1568–1582, 2025

  78. [79]

    Emerging properties in self-supervised vision transformers

    Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, and Armand Joulin. Emerging properties in self-supervised vision transformers. In Proceedings of the IEEE/CVF international conference on computer vision , pages 9650–9660, 2021

  79. [80]

    Deep residual learning for image recognition

    Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition , pages 770–778, 2016. Appendix Table 1. Two-level task taxonomy and complete task inventory of the DaX benchmark. A Hierarchical abbreviation map of the benchmark task taxono...