Explainability in Generative Medical Diffusion Models: A Faithfulness-Based Analysis on MRI Synthesis

Pallabi Saikia; Surjo Dey

arxiv: 2602.09781 · v2 · submitted 2026-02-10 · 💻 cs.LG · cs.AI

Explainability in Generative Medical Diffusion Models: A Faithfulness-Based Analysis on MRI Synthesis

Surjo Dey , Pallabi Saikia This is my paper

Pith reviewed 2026-05-16 02:35 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords explainabilitydiffusion modelsMRI synthesisfaithfulness analysisprototype networksgenerative AImedical imagingdenoising trajectory

0 comments

The pith

Diffusion models generating MRI scans can be explained by measuring how faithfully prototype networks match their denoising steps to training features, with Enhanced ProtoPNet scoring highest.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether prototype-based explainers can open up the black box of diffusion models that create synthetic MRI images by tracing their step-by-step denoising process back to recognizable training patterns. It applies three variants—ProtoPNet, Enhanced ProtoPNet, and ProtoPool—to see which one most accurately connects what the model produces to the features it learned from real data. The results single out Enhanced ProtoPNet as the most faithful link, with a measured score of 0.1534. This matters for healthcare because generative models are already producing realistic medical images, yet without clear reasoning they remain hard to trust for diagnosis or training. If the faithfulness scores hold, the approach gives a practical way to inspect and validate how each synthetic scan is built.

Core claim

By running the diffusion model’s denoising trajectory through prototype explainers and scoring how well the prototypes align with both generated and original training features, the study finds that Enhanced ProtoPNet produces the highest faithfulness value of 0.1534 and thereby supplies the clearest account of the image-formation process.

What carries the argument

A faithfulness metric that scores how closely prototype activations in PPNet, EPPNet, and ProtoPool match the intermediate states along the diffusion model’s denoising trajectory from noise to final MRI image.

If this is right

Enhanced ProtoPNet supplies the most reliable window into how diffusion models assemble MRI images from noise.
The same faithfulness framework can be applied to other generative diffusion tasks in medical imaging.
Higher-faithfulness explanations increase the chance that clinicians will accept synthetic images as trustworthy.
Prototype matching offers a concrete way to audit whether generated scans stay consistent with the training distribution.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The method could be adapted to flag when a diffusion model starts generating images that stray from real anatomical patterns.
Clinics might run these checks routinely before using synthetic data to augment scarce real MRI datasets.
Similar prototype tracing might reveal whether the model has learned shortcuts that ignore subtle pathology.

Load-bearing premise

The faithfulness score based on prototype matching actually tracks the diffusion model’s true internal steps rather than only surface similarities in the images.

What would settle it

An independent inspection of the diffusion model’s attention or gradient maps at multiple denoising timesteps that shows the model is using different features from the ones highlighted by the top-scoring EPPNet prototypes.

Figures

Figures reproduced from arXiv: 2602.09781 by Pallabi Saikia, Surjo Dey.

**Figure 2.** Figure 2: Comparison of real and synthetic breast MRI images generated with [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗

**Figure 3.** Figure 3: Visualization of the denoising trajectory across diffusion timesteps. [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗

**Figure 4.** Figure 4: Comparison of faithfulness scores for PPNet, EPPNet, and ProtoPool [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗

**Figure 5.** Figure 5: Distribution of Normalized Influence Scores (NIS) across generated [PITH_FULL_IMAGE:figures/full_fig_p012_5.png] view at source ↗

read the original abstract

This study investigates the explainability of generative diffusion models in the context of medical imaging, focusing on Magnetic resonance imaging (MRI) synthesis. Although diffusion models have shown strong performance in generating realistic medical images, their internal decision making process remains largely opaque. We present a faithfulness-based explainability framework that analyzes how prototype-based explainability methods like ProtoPNet (PPNet), Enhanced ProtoPNet (EPPNet), and ProtoPool can link the relationship between generated and training features. Our study focuses on understanding the reasoning behind image formation through denoising trajectory of diffusion model and subsequently prototype explainability with faithfulness analysis. Experimental analysis shows that EPPNet achieves the highest faithfulness (with score 0.1534), offering more reliable insights, and explainability into the generative process. The results highlight that diffusion models can be made more transparent and trustworthy through faithfulness-based explanations, contributing to safer and more interpretable applications of generative AI in healthcare.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper applies existing prototype explainability methods to diffusion models for MRI synthesis and reports EPPNet as most faithful at 0.1534, but the metric lacks checks tying it to the actual denoising steps.

read the letter

The key thing to know is that this paper extends prototype-based explainability methods to diffusion models for synthesizing MRI images and finds that EPPNet achieves the highest faithfulness score of 0.1534 among the compared approaches. It argues this makes the generative process more understandable and trustworthy for healthcare applications. On the positive side, the authors correctly identify that diffusion models, despite their success in generating realistic medical images, hide their decision-making during the denoising steps. They adapt ProtoPNet, its enhanced version, and ProtoPool to link generated features with those from the training set through a faithfulness lens. This domain shift to medical diffusion is a reasonable move, and focusing on the relationship between generated and training features via prototypes shows some thought about how to probe the model without needing full access to internals. The weaknesses are more substantial. The central result rests on a faithfulness metric defined by prototype matching, yet the paper does not include tests to confirm this matching tracks the actual diffusion trajectory, such as how noise is predicted at each timestep. An ablation that alters the prototypes without changing the diffusion dynamics or checks for ties to reconstruction accuracy per step would strengthen the case, but nothing like that appears. Additionally, basic experimental information is missing: no mention of the specific MRI dataset used, the number of samples, training details for the diffusion model, or how the faithfulness score is precisely calculated and whether the difference is statistically significant. This leaves the 0.1534 figure floating without context. Citations seem appropriate, covering the relevant prototype and diffusion literature without over-relying on self-cites. This paper would mainly interest researchers working on explainable AI for medical image generation. A reader already deep in prototype methods or diffusion for healthcare might pick up some practical comparison points, but it offers limited new insight for the broader field. I recommend against sending it for peer review in this state. The empirical foundation is too thin to justify referee time without major additions to the methods and validation.

Referee Report

2 major / 1 minor

Summary. The paper introduces a faithfulness-based explainability framework for generative diffusion models applied to MRI synthesis. It compares prototype-based methods (ProtoPNet, Enhanced ProtoPNet/EPPNet, and ProtoPool) by linking generated image features to training-set prototypes and reports that EPPNet attains the highest faithfulness score (0.1534), thereby providing the most reliable insights into the diffusion model's generative process.

Significance. If the faithfulness metric can be shown to track causal contributions within the denoising trajectory rather than static feature overlap, the approach could meaningfully increase trust in medical generative models; the current evidence, however, is insufficient to establish this link or quantify practical benefit.

major comments (2)

[Abstract] Abstract / Experimental Analysis: the headline claim that EPPNet achieves the highest faithfulness (0.1534) is presented without any description of dataset size, number of generated samples, statistical significance testing, or the precise formula used to compute faithfulness, rendering the central empirical result unevaluable.
[Methods] Methods (implied): no ablation is reported that perturbs the training prototypes while holding the diffusion denoising trajectory fixed, nor any correlation between prototype activation strength and per-timestep reconstruction error; without such evidence the ordering could be driven by surface-level similarity statistics independent of the diffusion dynamics.

minor comments (1)

[Abstract] The abstract refers to analysis of the 'denoising trajectory' but never specifies which diffusion timesteps t are examined or how prototype matching is aligned with noise-prediction steps.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. We address each major comment below and outline the revisions we will make to improve clarity and strengthen the empirical support.

read point-by-point responses

Referee: [Abstract] Abstract / Experimental Analysis: the headline claim that EPPNet achieves the highest faithfulness (0.1534) is presented without any description of dataset size, number of generated samples, statistical significance testing, or the precise formula used to compute faithfulness, rendering the central empirical result unevaluable.

Authors: We agree that the abstract is insufficiently self-contained on these points. The full manuscript describes the dataset, sample counts, the faithfulness metric (computed as the normalized overlap between prototype activations and the diffusion reconstruction objective), and reports statistical significance via paired tests. In the revised version we will expand the abstract to include concise statements of dataset size, number of generated samples, the exact faithfulness formula, and the significance testing procedure so that the headline result is immediately evaluable. revision: yes
Referee: [Methods] Methods (implied): no ablation is reported that perturbs the training prototypes while holding the diffusion denoising trajectory fixed, nor any correlation between prototype activation strength and per-timestep reconstruction error; without such evidence the ordering could be driven by surface-level similarity statistics independent of the diffusion dynamics.

Authors: This is a fair critique of the causal interpretation. The present experiments compare faithfulness scores on identical generated images but do not include the requested perturbation ablations or timestep-wise correlations. We will add these analyses to the revised manuscript: (1) an ablation that perturbs prototype weights while freezing the diffusion trajectory and measures impact on faithfulness, and (2) correlation analysis between activation strength and per-timestep reconstruction error. These additions should help demonstrate that the observed ordering is tied to the generative dynamics rather than static feature overlap. revision: yes

Circularity Check

0 steps flagged

No significant circularity in faithfulness analysis

full rationale

The paper's central claim rests on an empirical comparison of faithfulness scores (EPPNet at 0.1534) obtained by matching generated MRI features to training prototypes via an external metric. This metric is applied after generation and does not reduce by construction to the diffusion model's fitted parameters or denoising trajectory. No self-definitional equations, fitted inputs relabeled as predictions, or load-bearing self-citations appear in the abstract or described methods. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Based on abstract only; no explicit free parameters, axioms, or invented entities are described. The faithfulness score itself may implicitly depend on unstated choices in prototype selection or similarity metrics.

pith-pipeline@v0.9.0 · 5459 in / 1007 out tokens · 51001 ms · 2026-05-16T02:35:57.569914+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Faithfulness Score ... F(x) = 1/m Σ NIS_j(x)·corr(p_j, x) ... prototype activations with generative dynamics
IndisputableMonolith/Foundation/ArrowOfTime.lean forward_accumulates unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

denoising trajectory ... q(x_t|x_{t-1}) = N(√(1-β_t) x_{t-1}, β_t I) ... reverse process p_θ

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages · 1 internal anchor

[1]

Enhanced prototypical part network (eppnet) for explainable image classification via prototypes, 2024

Bhushan Atote and Victor Sanchez. Enhanced prototypical part network (eppnet) for explainable image classification via prototypes, 2024

work page 2024
[2]

How were you created? explaining syn- thetic face images generated by diffusion models

Bhushan Atote and Victor Sanchez. How were you created? explaining syn- thetic face images generated by diffusion models. InEuropean Conference on Computer Vision, pages 263–278. Springer, 2024

work page 2024
[3]

This looks like that: deep learning for interpretable image recognition.Advances in neural information processing systems, 32, 2019

Chaofan Chen, Oscar Li, Daniel Tao, Alina Barnett, Cynthia Rudin, and Jonathan K Su. This looks like that: deep learning for interpretable image recognition.Advances in neural information processing systems, 32, 2019

work page 2019
[4]

Conditional diffusion models for semantic 3d brain mri synthesis

Zolnamar Dorjsembe, Hsing-Kuo Pao, Sodtavilan Odonchimed, and Furen Xiao. Conditional diffusion models for semantic 3d brain mri synthesis. IEEE Journal of Biomedical and Health Informatics, 28(7):4084–4093, 2024

work page 2024
[5]

Gener- ative adversarial nets.Advances in neural information processing systems, 27, 2014

Ian J Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Gener- ative adversarial nets.Advances in neural information processing systems, 27, 2014

work page 2014
[6]

Gans trained by a two time-scale update rule converge to a local nash equilibrium.Advances in neural information processing systems, 30, 2017

Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. Gans trained by a two time-scale update rule converge to a local nash equilibrium.Advances in neural information processing systems, 30, 2017

work page 2017
[7]

Denoising diffusion probabilis- tic models.Advances in neural information processing systems, 33:6840– 6851, 2020

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilis- tic models.Advances in neural information processing systems, 33:6840– 6851, 2020

work page 2020
[8]

Classifier-free diffusion guidance, 2022

Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance, 2022

work page 2022
[9]

Fundamentals of digital image processing

Anil K Jain. Fundamentals of digital image processing. 1989

work page 1989
[10]

Cola-diff: Conditional latent diffusion model for multi-modal mri synthesis

Lan Jiang, Ye Mao, Xiangfeng Wang, Xi Chen, and Chao Li. Cola-diff: Conditional latent diffusion model for multi-modal mri synthesis. InInter- national Conference on Medical Image Computing and Computer-Assisted Intervention, pages 398–408. Springer, 2023

work page 2023
[11]

Mediffusion: Joint diffusion for self-explainable semi- supervised classification and medical image generation.arXiv preprint arXiv:2411.09434, 2024

Joanna Kaleta, Pawe l Skier´ s, Jan Dubi´ nski, Przemys law Korzeniowski, and Kamil Deja. Mediffusion: Joint diffusion for self-explainable semi- supervised classification and medical image generation.arXiv preprint arXiv:2411.09434, 2024

work page arXiv 2024
[12]

Diffusion mod- els in medical imaging: A comprehensive survey.Medical image analysis, 88:102846, 2023

Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Moein Heidari, Reza Azad, Mohsen Fayyaz, Ilker Hacihaliloglu, and Dorit Merhof. Diffusion mod- els in medical imaging: A comprehensive survey.Medical image analysis, 88:102846, 2023

work page 2023
[13]

Auto-Encoding Variational Bayes

Diederik P Kingma and Max Welling. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[14]

High-resolution image synthesis with latent diffusion models, 2022

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj¨ orn Ommer. High-resolution image synthesis with latent diffusion models, 2022

work page 2022
[15]

Interpretable im- age classification with differentiable prototypes assignment, 2022

Dawid Rymarczyk, Lukasz Struski, Micha l G´ orszczak, Koryna Lewandowska, Jacek Tabor, and Bartosz Zieli´ nski. Interpretable im- age classification with differentiable prototypes assignment, 2022

work page 2022
[16]

A machine learning approach to radiogenomics of breast cancer: a study of 922 subjects and 529 dce-mri features.British journal of cancer, 119(4):508–516, 2018

Ashirbani Saha, Michael R Harowicz, Lars J Grimm, Connie E Kim, Su- jata V Ghate, Ruth Walsh, and Maciej A Mazurowski. A machine learning approach to radiogenomics of breast cancer: a study of 922 subjects and 529 dce-mri features.British journal of cancer, 119(4):508–516, 2018

work page 2018
[17]

Vald-md: visual attribution via latent diffusion for medical diagnostics

Ammar A Siddiqui, Santosh Tirunagari, Tehseen Zia, and David Windridge. Vald-md: visual attribution via latent diffusion for medical diagnostics. arXiv preprint arXiv:2401.01414, 2024. 14 Surjo and Pallabi

work page arXiv 2024
[18]

A latent diffusion approach to visual attribution in medical imaging

Ammar Adeel Siddiqui, Santosh Tirunagari, Tehseen Zia, and David Win- dridge. A latent diffusion approach to visual attribution in medical imaging. Scientific Reports, 15(1):962, 2025

work page 2025
[19]

Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole

Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equations, 2021

work page 2021
[20]

Im- age quality assessment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004

Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. Im- age quality assessment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004

work page 2004
[21]

The unreasonable effectiveness of deep features as a perceptual met- ric

Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. The unreasonable effectiveness of deep features as a perceptual met- ric. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 586–595, 2018

work page 2018

[1] [1]

Enhanced prototypical part network (eppnet) for explainable image classification via prototypes, 2024

Bhushan Atote and Victor Sanchez. Enhanced prototypical part network (eppnet) for explainable image classification via prototypes, 2024

work page 2024

[2] [2]

How were you created? explaining syn- thetic face images generated by diffusion models

Bhushan Atote and Victor Sanchez. How were you created? explaining syn- thetic face images generated by diffusion models. InEuropean Conference on Computer Vision, pages 263–278. Springer, 2024

work page 2024

[3] [3]

This looks like that: deep learning for interpretable image recognition.Advances in neural information processing systems, 32, 2019

Chaofan Chen, Oscar Li, Daniel Tao, Alina Barnett, Cynthia Rudin, and Jonathan K Su. This looks like that: deep learning for interpretable image recognition.Advances in neural information processing systems, 32, 2019

work page 2019

[4] [4]

Conditional diffusion models for semantic 3d brain mri synthesis

Zolnamar Dorjsembe, Hsing-Kuo Pao, Sodtavilan Odonchimed, and Furen Xiao. Conditional diffusion models for semantic 3d brain mri synthesis. IEEE Journal of Biomedical and Health Informatics, 28(7):4084–4093, 2024

work page 2024

[5] [5]

Gener- ative adversarial nets.Advances in neural information processing systems, 27, 2014

Ian J Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Gener- ative adversarial nets.Advances in neural information processing systems, 27, 2014

work page 2014

[6] [6]

Gans trained by a two time-scale update rule converge to a local nash equilibrium.Advances in neural information processing systems, 30, 2017

Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. Gans trained by a two time-scale update rule converge to a local nash equilibrium.Advances in neural information processing systems, 30, 2017

work page 2017

[7] [7]

Denoising diffusion probabilis- tic models.Advances in neural information processing systems, 33:6840– 6851, 2020

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilis- tic models.Advances in neural information processing systems, 33:6840– 6851, 2020

work page 2020

[8] [8]

Classifier-free diffusion guidance, 2022

Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance, 2022

work page 2022

[9] [9]

Fundamentals of digital image processing

Anil K Jain. Fundamentals of digital image processing. 1989

work page 1989

[10] [10]

Cola-diff: Conditional latent diffusion model for multi-modal mri synthesis

Lan Jiang, Ye Mao, Xiangfeng Wang, Xi Chen, and Chao Li. Cola-diff: Conditional latent diffusion model for multi-modal mri synthesis. InInter- national Conference on Medical Image Computing and Computer-Assisted Intervention, pages 398–408. Springer, 2023

work page 2023

[11] [11]

Mediffusion: Joint diffusion for self-explainable semi- supervised classification and medical image generation.arXiv preprint arXiv:2411.09434, 2024

Joanna Kaleta, Pawe l Skier´ s, Jan Dubi´ nski, Przemys law Korzeniowski, and Kamil Deja. Mediffusion: Joint diffusion for self-explainable semi- supervised classification and medical image generation.arXiv preprint arXiv:2411.09434, 2024

work page arXiv 2024

[12] [12]

Diffusion mod- els in medical imaging: A comprehensive survey.Medical image analysis, 88:102846, 2023

Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Moein Heidari, Reza Azad, Mohsen Fayyaz, Ilker Hacihaliloglu, and Dorit Merhof. Diffusion mod- els in medical imaging: A comprehensive survey.Medical image analysis, 88:102846, 2023

work page 2023

[13] [13]

Auto-Encoding Variational Bayes

Diederik P Kingma and Max Welling. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[14] [14]

High-resolution image synthesis with latent diffusion models, 2022

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj¨ orn Ommer. High-resolution image synthesis with latent diffusion models, 2022

work page 2022

[15] [15]

Interpretable im- age classification with differentiable prototypes assignment, 2022

Dawid Rymarczyk, Lukasz Struski, Micha l G´ orszczak, Koryna Lewandowska, Jacek Tabor, and Bartosz Zieli´ nski. Interpretable im- age classification with differentiable prototypes assignment, 2022

work page 2022

[16] [16]

A machine learning approach to radiogenomics of breast cancer: a study of 922 subjects and 529 dce-mri features.British journal of cancer, 119(4):508–516, 2018

Ashirbani Saha, Michael R Harowicz, Lars J Grimm, Connie E Kim, Su- jata V Ghate, Ruth Walsh, and Maciej A Mazurowski. A machine learning approach to radiogenomics of breast cancer: a study of 922 subjects and 529 dce-mri features.British journal of cancer, 119(4):508–516, 2018

work page 2018

[17] [17]

Vald-md: visual attribution via latent diffusion for medical diagnostics

Ammar A Siddiqui, Santosh Tirunagari, Tehseen Zia, and David Windridge. Vald-md: visual attribution via latent diffusion for medical diagnostics. arXiv preprint arXiv:2401.01414, 2024. 14 Surjo and Pallabi

work page arXiv 2024

[18] [18]

A latent diffusion approach to visual attribution in medical imaging

Ammar Adeel Siddiqui, Santosh Tirunagari, Tehseen Zia, and David Win- dridge. A latent diffusion approach to visual attribution in medical imaging. Scientific Reports, 15(1):962, 2025

work page 2025

[19] [19]

Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole

Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equations, 2021

work page 2021

[20] [20]

Im- age quality assessment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004

Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. Im- age quality assessment: from error visibility to structural similarity.IEEE transactions on image processing, 13(4):600–612, 2004

work page 2004

[21] [21]

The unreasonable effectiveness of deep features as a perceptual met- ric

Richard Zhang, Phillip Isola, Alexei A Efros, Eli Shechtman, and Oliver Wang. The unreasonable effectiveness of deep features as a perceptual met- ric. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 586–595, 2018

work page 2018