Universal and Transferable Attacks on Pathology Foundation Models

Aydogan Ozcan; Che-Yung Shen; Nir Pillar; Xilin Yang; Yuntian Wang

arxiv: 2510.16660 · v1 · pith:BG3N25MCnew · submitted 2025-10-18 · 💻 cs.CV · cs.LG· physics.med-ph

Universal and Transferable Attacks on Pathology Foundation Models

Yuntian Wang , Xilin Yang , Che-Yung Shen , Nir Pillar , Aydogan Ozcan This is my paper

classification 💻 cs.CV cs.LGphysics.med-ph

keywords foundationpathologymodelsutapperformanceacrossmodelvarious

0 comments

read the original abstract

We introduce Universal and Transferable Adversarial Perturbations (UTAP) for pathology foundation models that reveal critical vulnerabilities in their capabilities. Optimized using deep learning, UTAP comprises a fixed and weak noise pattern that, when added to a pathology image, systematically disrupts the feature representation capabilities of multiple pathology foundation models. Therefore, UTAP induces performance drops in downstream tasks that utilize foundation models, including misclassification across a wide range of unseen data distributions. In addition to compromising the model performance, we demonstrate two key features of UTAP: (1) universality: its perturbation can be applied across diverse field-of-views independent of the dataset that UTAP was developed on, and (2) transferability: its perturbation can successfully degrade the performance of various external, black-box pathology foundation models - never seen before. These two features indicate that UTAP is not a dedicated attack associated with a specific foundation model or image dataset, but rather constitutes a broad threat to various emerging pathology foundation models and their applications. We systematically evaluated UTAP across various state-of-the-art pathology foundation models on multiple datasets, causing a significant drop in their performance with visually imperceptible modifications to the input images using a fixed noise pattern. The development of these potent attacks establishes a critical, high-standard benchmark for model robustness evaluation, highlighting a need for advancing defense mechanisms and potentially providing the necessary assets for adversarial training to ensure the safe and reliable deployment of AI in pathology.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Scalable, Energy-Efficient Optical-Neural Architecture for Multiplexed Deepfake Video Detection
cs.CV 2026-05 conditional novelty 6.0

Hybrid optical-digital architecture multiplexes 15+ video streams for parallel deepfake detection, reporting 97.79% average accuracy on Celeb-DF with resilience to degradation and attacks.
Beyond the Failures: Rethinking Foundation Models in Pathology
cs.AI 2025-10 unverdicted novelty 2.0

Foundation models stumble in pathology due to conceptual mismatches with biological tissue, requiring explicitly designed models rather than adaptations of natural-image methods.