Broken Memories: Detecting and Mitigating Memorization in Diffusion Models with Degraded Generations

Chen Chen; Feifei Li; Geng Hong; Min Yang; Mi Zhang; Xiaoyu You; Yuanmin Huang

arxiv: 2605.22050 · v4 · pith:7QV34EWNnew · submitted 2026-05-21 · 💻 cs.CV

Broken Memories: Detecting and Mitigating Memorization in Diffusion Models with Degraded Generations

Yuanmin Huang , Mi Zhang , Chen Chen , Feifei Li , Geng Hong , Xiaoyu You , Min Yang This is my paper

Pith reviewed 2026-06-30 17:33 UTC · model grok-4.3

classification 💻 cs.CV

keywords diffusion modelsmemorizationdetectionmitigationnumerical stabilitylatent normsimage generationprivacy

0 comments

The pith

Memorization in diffusion models produces numerical instability visible as broken artifacts in generated images.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that memorization in diffusion models creates internal numerical instability during the iterative generation process. This instability frequently appears as visually broken or degraded artifacts in the output. The authors define empirical stability regions using norms of latent updates at each step to distinguish stable from unstable behavior. They then build an on-the-fly detection and mitigation system that operates without modifying the prompt or guidance. Experiments on Stable Diffusion 1.4 show the method reaches AUC above 0.999 for detection and drives memorization rate to zero after mitigation while adding negligible time per image.

Core claim

The authors claim that memorization induces internal numerical instability in diffusion models, which manifests as visually broken artifacts. They introduce empirical stability regions based on latent update norms to characterize stable generation behavior. Using these regions, they develop a step-wise detection and adaptive mitigation framework that suppresses memorization on the fly, achieving an AUC greater than 0.999 for detection and a 0.0% memorization rate after mitigation on Stable Diffusion 1.4, with negligible overhead.

What carries the argument

Empirical stability regions based on latent update norms, which quantify the stability of the generation process at each diffusion step and enable distinction between memorized and non-memorized cases.

If this is right

Memorization can be detected with AUC exceeding 0.999 using the stability analysis.
Mitigation can reduce the memorization rate to 0% without altering prompts or guidance.
Semantic fidelity and image quality remain preserved during mitigation.
The method adds only about 0.01 seconds per image in overhead.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The stability-region approach might generalize to other iterative generative models beyond diffusion.
If the separation between stable and unstable regions holds across model scales, it could support prompt-independent auditing of training data leakage.
Combining this detection with existing membership inference methods could yield higher-precision privacy audits.

Load-bearing premise

Visually broken artifacts during generation are specifically caused by memorization rather than other sources of instability.

What would settle it

Observing broken artifacts in generations from clearly non-memorized prompts or stable clean outputs from known memorized prompts would falsify the claimed link between memorization and the observed instability.

Figures

Figures reproduced from arXiv: 2605.22050 by Chen Chen, Feifei Li, Geng Hong, Min Yang, Mi Zhang, Xiaoyu You, Yuanmin Huang.

**Figure 2.** Figure 2: Left: On-the-fly detection and mitigation progress. Each row visualizes predicted [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Comparison of mitigations on SD 1.4 [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: Qualitative results comparing the proposed approach with the baselines on SD 1.4. [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 6.** Figure 6: Comparison of latent update trajectories and gen [PITH_FULL_IMAGE:figures/full_fig_p009_6.png] view at source ↗

**Figure 7.** Figure 7: PNDM generation process on SD 1.4 using strong/mild/non- memorized prompts. [PITH_FULL_IMAGE:figures/full_fig_p014_7.png] view at source ↗

**Figure 8.** Figure 8: DDIM generation process on SD 1.4 using strong/mild/non- memorized prompts. [PITH_FULL_IMAGE:figures/full_fig_p014_8.png] view at source ↗

**Figure 9.** Figure 9: Similar stability regions by prompts from different [PITH_FULL_IMAGE:figures/full_fig_p015_9.png] view at source ↗

**Figure 10.** Figure 10: Similar stability regions by different numbers of [PITH_FULL_IMAGE:figures/full_fig_p016_10.png] view at source ↗

**Figure 11.** Figure 11: Detection AUC using different numbers of refer [PITH_FULL_IMAGE:figures/full_fig_p017_11.png] view at source ↗

**Figure 13.** Figure 13: Comparison of latent update trajectories and gen [PITH_FULL_IMAGE:figures/full_fig_p017_13.png] view at source ↗

**Figure 14.** Figure 14: Comparison with baselines on memorized prompts using finetuned SD 1.4 [PITH_FULL_IMAGE:figures/full_fig_p019_14.png] view at source ↗

read the original abstract

While diffusion models excel at generating high-quality images, their tendency to memorize training data poses significant privacy and copyright risks. In this work, we for the first time identify that memorization induces internal numerical instability, often manifesting as visually ``broken'' artifacts. Inspired by stability analysis in numerical methods, we introduce empirical stability regions based on latent update norms to quantitatively characterize stable behavior during generation. Leveraging this, we propose a principled, on-the-fly framework for step-wise detection and adaptive mitigation. Our approach suppresses memorization without altering prompts or guidance, thereby preserving semantic fidelity and image quality. Extensive experiments on Stable Diffusion 1.4 demonstrate that our method achieves an AUC $>0.999$ detection performance and a $0.0\%$ memorization rate after mitigation with negligible overhead ($\approx0.01$s per image).

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper links memorization to latent instability for on-the-fly detection and mitigation but does not establish that memorization is the actual cause.

read the letter

This paper claims to be the first to connect memorization in diffusion models to numerical instability shown as broken artifacts, and offers an on-the-fly detection and mitigation using latent norms. The reported performance is strong but the supporting evidence for the core claim looks thin.

The new part is the stability region idea based on update norms, which lets them detect and fix during sampling without side effects on the prompt. Experiments on SD 1.4 back up the AUC and zero memorization numbers, and the overhead is low. That part is useful for anyone who needs to reduce privacy risks in these models.

The soft spot is the causality. The method assumes the high norms come from memorization, but other factors in the diffusion process could produce similar patterns. The paper would be stronger with tests that vary memorization while holding other variables constant. Details on how they quantify memorization and define the regions are also light.

Readers interested in applied fixes for generative AI safety would get value here. It has a working approach and addresses a timely issue, so it deserves a serious referee to check the experiments and claims.

I would send this to peer review.

Referee Report

3 major / 2 minor

Summary. The paper claims that memorization in diffusion models induces internal numerical instability during the generation process, often visible as 'broken' artifacts. It introduces empirical stability regions derived from latent update norms to enable step-wise detection of memorization and adaptive mitigation, without modifying prompts or guidance scales. Experiments on Stable Diffusion 1.4 report AUC > 0.999 for detection and 0.0% memorization rate post-mitigation, with negligible computational overhead.

Significance. If the core causal link between memorization and the observed instability holds and generalizes, the work offers a practical on-the-fly detection and mitigation technique that preserves semantic fidelity, addressing privacy and copyright concerns in generative models. The approach is notable for operating during inference rather than requiring retraining or prompt engineering.

major comments (3)

[§3] §3 (Method): The central claim that memorization specifically induces the high latent update norms and broken artifacts lacks causal validation. No controlled interventions (e.g., targeted fine-tuning to induce memorization while holding other factors fixed, or ablation of memorization) are described to rule out confounders such as prompt complexity, guidance scale, or inherent diffusion stochasticity; the empirical stability regions could therefore act as a proxy rather than a memorization-specific detector.
[§4] §4 (Experiments): The reported 0.0% memorization rate after mitigation and AUC >0.999 require a precise definition of the memorization metric and how it is computed (e.g., exact threshold for considering an output memorized, use of membership inference or reconstruction attacks). Without this, or error bars across multiple runs/prompts, it is impossible to assess whether the mitigation truly suppresses memorization or merely alters generation dynamics.
[§4.3] §4.3 (Ablation or stability analysis): The separation into stable/unstable regions is presented as reliable across prompts and models, but no analysis shows robustness when varying generation hyperparameters or testing on non-memorized prompts that might naturally produce high-norm updates; this directly affects the load-bearing claim of reliable on-the-fly detection.

minor comments (2)

[Abstract / §3] The abstract and method sections use 'empirical stability regions' without an explicit equation or pseudocode for the norm threshold computation; adding this would improve reproducibility.
[Figures] Figure captions should explicitly state the number of samples, models, and prompts used to generate the reported AUC and memorization rates.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive comments and address each major point below. We outline planned revisions to strengthen the manuscript.

read point-by-point responses

Referee: [§3] §3 (Method): The central claim that memorization specifically induces the high latent update norms and broken artifacts lacks causal validation. No controlled interventions (e.g., targeted fine-tuning to induce memorization while holding other factors fixed, or ablation of memorization) are described to rule out confounders such as prompt complexity, guidance scale, or inherent diffusion stochasticity; the empirical stability regions could therefore act as a proxy rather than a memorization-specific detector.

Authors: Our work demonstrates a strong empirical correlation between memorization and elevated latent update norms across extensive experiments on Stable Diffusion 1.4. We agree that the manuscript would benefit from more cautious language and explicit discussion of potential confounders. We will revise §3 to frame the findings as an observed association rather than direct induction and add a limitations paragraph addressing alternative explanations such as prompt complexity. revision: partial
Referee: [§4] §4 (Experiments): The reported 0.0% memorization rate after mitigation and AUC >0.999 require a precise definition of the memorization metric and how it is computed (e.g., exact threshold for considering an output memorized, use of membership inference or reconstruction attacks). Without this, or error bars across multiple runs/prompts, it is impossible to assess whether the mitigation truly suppresses memorization or merely alters generation dynamics.

Authors: We will add a precise definition of the memorization metric in the revised §4, including the exact similarity threshold, the membership inference procedure used for evaluation, and error bars computed over multiple independent runs with different random seeds. revision: yes
Referee: [§4.3] §4.3 (Ablation or stability analysis): The separation into stable/unstable regions is presented as reliable across prompts and models, but no analysis shows robustness when varying generation hyperparameters or testing on non-memorized prompts that might naturally produce high-norm updates; this directly affects the load-bearing claim of reliable on-the-fly detection.

Authors: We will augment §4.3 with new ablation results that vary key hyperparameters (guidance scale, number of steps) and evaluate the stability regions on a held-out set of non-memorized prompts to verify that high-norm updates remain rare outside memorized cases. revision: yes

Circularity Check

0 steps flagged

No significant circularity in the derivation chain

full rationale

The paper introduces empirical stability regions based on latent update norms as a new characterization of generation behavior, then reports AUC >0.999 and 0.0% post-mitigation memorization as experimental outcomes. No provided text or equations show these regions or thresholds being fitted to the same data used for the performance claims, nor any self-definitional loops, fitted inputs renamed as predictions, load-bearing self-citations, or imported uniqueness theorems. The central claims rest on external experimental validation rather than reducing to the inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only; no equations or method details available to enumerate free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5683 in / 1068 out tokens · 31988 ms · 2026-06-30T17:33:46.265421+00:00 · methodology

Review history (3 revisions) →

discussion (0)

Reference graph

Works this paper leans on

36 extracted references · 19 canonical work pages · 9 internal anchors

[1]

Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Floren- cia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, et al. 2023. Gpt-4 technical report.arXiv preprint arXiv:2303.08774 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023
[3]

Tony Bonnaire, Raphaël Urfin, Giulio Biroli, and Marc Mézard. 2025. Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training. arXiv:2505.17638 [cs.LG] https://arxiv.org/abs/2505.17638

work page arXiv 2025
[4]

Jonathan Brokman, Itay Gershon, Omer Hofman, Guy Gilboa, and Roman Vain- shtein. 2025. Tracking memorization geometry throughout the diffusion model generative process. InNeurIPS 2025 Workshop on Symmetry and Geometry in Neural Representations

2025
[5]

J.C. Butcher. 1996. A history of Runge-Kutta methods.Applied Numerical Mathe- matics20, 3 (1996), 247–260. doi:10.1016/0168-9274(95)00108-5

work page doi:10.1016/0168-9274(95)00108-5 1996
[6]

Nicolas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramer, Borja Balle, Daphne Ippolito, and Eric Wallace. 2023. Extracting training data from diffusion models. In32nd USENIX security symposium (USENIX Security 23). 5253–5270

2023
[7]

Ruchika Chavhan, Ondrej Bohdal, Yongshuo Zong, Da Li, and Timothy Hospedales. 2024. Memorized Images in Diffusion Models share a Subspace that can be Located and Deleted. arXiv:2406.18566 [cs.CV] https://arxiv.org/abs/ 2406.18566 KDD 2026, August 9–13, 2026, Jeju Island, Republic of Korea. Yuanmin Huang, Mi Zhang, Chen Chen, Feifei Li, Geng Hong, Xiaoyu Y...

work page arXiv 2024
[8]

Chen Chen, Daochang Liu, Mubarak Shah, and Chang Xu. 2025. En- hancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffu- sion Models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8182–8191. https://openaccess.thecvf. com/content/CVPR2025/html/Chen_Enhancing_Privacy-Utility_Trade- offs_to_Mitigate_Memoriz...

2025
[9]

Chen Chen, Daochang Liu, Mubarak Shah, and Chang Xu. 2025. Explor- ing Local Memorization in Diffusion Models via Bright Ending Attention. arXiv:2410.21665 [cs.CV] https://arxiv.org/abs/2410.21665

work page arXiv 2025
[10]

Chen Chen, Daochang Liu, and Chang Xu. 2024. Towards memorization-free diffusion models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8425–8434

2024
[11]

Dale R Durran. 1991. The third-order Adams-Bashforth method: An attractive alternative to leapfrog time differencing.Monthly weather review119, 3 (1991), 702–720

1991
[12]

Zihan Guan, Mengxuan Hu, Sheng Li, and Anil Vullikanti. 2025. UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models. arXiv:2404.01101 [cs.CR] https://arxiv.org/abs/2404.01101

work page arXiv 2025
[13]

1993.Solving ordinary differential equations I: Nonstiff problems

Ernst Hairer, Gerhard Wanner, and Syvert P Nørsett. 1993.Solving ordinary differential equations I: Nonstiff problems. Springer

1993
[14]

Jack Hessel, Ari Holtzman, Maxwell Forbes, Ronan Le Bras, and Yejin Choi
[15]

CLIPScore: A Reference-free Evaluation Metric for Image Captioning

CLIPScore: A Reference-free Evaluation Metric for Image Captioning. arXiv:2104.08718 [cs.CV] https://arxiv.org/abs/2104.08718

work page internal anchor Pith review Pith/arXiv arXiv
[16]

Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. 2018. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. arXiv:1706.08500 [cs.LG] https://arxiv.org/abs/1706. 08500

work page internal anchor Pith review Pith/arXiv arXiv 2018
[17]

Jonathan Ho, Ajay Jain, and Pieter Abbeel. 2020. Denoising diffusion probabilistic models.Advances in neural information processing systems33 (2020), 6840–6851

2020
[18]

Jonathan Ho and Tim Salimans. 2022. Classifier-Free Diffusion Guidance. arXiv:2207.12598 [cs.LG] https://arxiv.org/abs/2207.12598

work page internal anchor Pith review Pith/arXiv arXiv 2022
[19]

Anubhav Jain, Yuya Kobayashi, Takashi Shibuya, Yuhta Takida, Nasir Memon, Julian Togelius, and Yuki Mitsufuji. 2025. Classifier-free guidance inside the attraction basin may cause memorization. InProceedings of the Computer Vision and Pattern Recognition Conference. 12871–12879

2025
[20]

Dongjae Jeon, Dueun Kim, and Albert No. 2025. Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes. Proceedings of the 42nd International Conference on Machine Learning(2025)

2025
[21]

Yue Jiang, Haokun Lin, Yang Bai, Bo Peng, Zhili Liu, Yueming Lyu, Yong Yang, Xing Zheng, and Jing Dong. 2025. Image-Level Memorization Detection Via Inversion-Based Inference Perturbation.The Thirteenth International Conference on Learning Representations(2025)

2025
[22]

Kingma and Max Welling

Diederik P. Kingma and Max Welling. 2019. An Introduction to Variational Autoencoders.Foundations and Trends®in Machine Learning12, 4 (2019), 307–392. doi:10.1561/2200000056

work page doi:10.1561/2200000056 2019
[23]

Luping Liu, Yi Ren, Zhijie Lin, and Zhou Zhao. 2022. Pseudo Numerical Methods for Diffusion Models on Manifolds. arXiv:2202.09778 [cs.CV] https://arxiv.org/ abs/2202.09778

work page arXiv 2022
[24]

Zihao Luo, Xilie Xu, Feng Liu, Yun Sing Koh, Di Wang, and Jingfeng Zhang. 2024. Privacy-Preserving Low-Rank Adaptation against Membership Inference Attacks for Latent Diffusion Models. arXiv:2402.11989 [cs.LG] https://arxiv.org/abs/2402. 11989

work page arXiv 2024
[25]

Ed Pizzi, Sreya Dutta Roy, Sugosh Nagavara Ravindra, Priya Goyal, and Matthijs Douze. 2022. A self-supervised descriptor for image copy detection. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 14532– 14542

2022
[26]

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. arXiv:2103.00020 [cs.CV] https://arxiv.org/ abs/2103.00020

work page internal anchor Pith review Pith/arXiv arXiv 2021
[27]

Jie Ren, Yaxin Li, Shenglai Zeng, Han Xu, Lingjuan Lyu, Yue Xing, and Jiliang Tang. 2024. Unveiling and mitigating memorization in text-to-image diffusion models through cross attention. InEuropean Conference on Computer Vision. Springer, 340–356

2024
[28]

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. 2022. High-Resolution Image Synthesis with Latent Diffusion Models. arXiv:2112.10752 [cs.CV] https://arxiv.org/abs/2112.10752

work page internal anchor Pith review Pith/arXiv arXiv 2022
[29]

Christoph Schuhmann, Romain Beaumont, Richard Vencu, Cade Gordon, Ross Wightman, Mehdi Cherti, Theo Coombes, Aarush Katta, Clayton Mullis, Mitchell Wortsman, et al. 2022. Laion-5b: An open large-scale dataset for training next generation image-text models.Advances in neural information processing systems 35 (2022), 25278–25294

2022
[30]

Christoph Schuhmann, Richard Vencu, Romain Beaumont, Robert Kaczmarczyk, Clayton Mullis, Aarush Katta, Theo Coombes, Jenia Jitsev, and Aran Komatsuzaki
[31]

Laion-400m: Open dataset of clip-filtered 400 million image-text pairs.arXiv preprint arXiv:2111.02114(2021)

work page internal anchor Pith review Pith/arXiv arXiv 2021
[32]

1985.Numerical solution of partial differential equations: finite difference methods

Gordon D Smith. 1985.Numerical solution of partial differential equations: finite difference methods. Oxford university press

1985
[33]

Gowthami Somepalli, Vasu Singla, Micah Goldblum, Jonas Geiping, and Tom Goldstein. 2023. Understanding and mitigating copying in diffusion models. Advances in Neural Information Processing Systems36 (2023), 47783–47803

2023
[34]

Jiaming Song, Chenlin Meng, and Stefano Ermon. 2022. Denoising Diffusion Implicit Models. arXiv:2010.02502 [cs.LG] https://arxiv.org/abs/2010.02502

work page internal anchor Pith review Pith/arXiv arXiv 2022
[35]

Yang Song, Jascha Sohl-Dickstein, Diederik P Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. 2020. Score-based generative modeling through stochastic differential equations.arXiv preprint arXiv:2011.13456(2020)

work page internal anchor Pith review Pith/arXiv arXiv 2020
[36]

Ryan Webster. 2023. A reproducible extraction of training images from diffusion models.arXiv preprint arXiv:2305.08694(2023)

work page arXiv 2023
[37]

transfer part

Yuxin Wen, Yuchen Liu, Chen Chen, and Lingjuan Lyu. 2024. Detecting, explain- ing, and mitigating memorization in diffusion models. InThe Twelfth International Conference on Learning Representations. A Detailed Proofs In this section, we provide detailed proofs for Theorem 1 and 3 presented in the main text. Theorem 1 (Normal Trajectories Stability).Let 𝛿...

work page arXiv 2024

[1] [1]

Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Floren- cia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, et al. 2023. Gpt-4 technical report.arXiv preprint arXiv:2303.08774 (2023)

work page internal anchor Pith review Pith/arXiv arXiv 2023

[2] [3]

Tony Bonnaire, Raphaël Urfin, Giulio Biroli, and Marc Mézard. 2025. Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training. arXiv:2505.17638 [cs.LG] https://arxiv.org/abs/2505.17638

work page arXiv 2025

[3] [4]

Jonathan Brokman, Itay Gershon, Omer Hofman, Guy Gilboa, and Roman Vain- shtein. 2025. Tracking memorization geometry throughout the diffusion model generative process. InNeurIPS 2025 Workshop on Symmetry and Geometry in Neural Representations

2025

[4] [5]

J.C. Butcher. 1996. A history of Runge-Kutta methods.Applied Numerical Mathe- matics20, 3 (1996), 247–260. doi:10.1016/0168-9274(95)00108-5

work page doi:10.1016/0168-9274(95)00108-5 1996

[5] [6]

Nicolas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramer, Borja Balle, Daphne Ippolito, and Eric Wallace. 2023. Extracting training data from diffusion models. In32nd USENIX security symposium (USENIX Security 23). 5253–5270

2023

[6] [7]

Ruchika Chavhan, Ondrej Bohdal, Yongshuo Zong, Da Li, and Timothy Hospedales. 2024. Memorized Images in Diffusion Models share a Subspace that can be Located and Deleted. arXiv:2406.18566 [cs.CV] https://arxiv.org/abs/ 2406.18566 KDD 2026, August 9–13, 2026, Jeju Island, Republic of Korea. Yuanmin Huang, Mi Zhang, Chen Chen, Feifei Li, Geng Hong, Xiaoyu Y...

work page arXiv 2024

[7] [8]

Chen Chen, Daochang Liu, Mubarak Shah, and Chang Xu. 2025. En- hancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffu- sion Models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8182–8191. https://openaccess.thecvf. com/content/CVPR2025/html/Chen_Enhancing_Privacy-Utility_Trade- offs_to_Mitigate_Memoriz...

2025

[8] [9]

Chen Chen, Daochang Liu, Mubarak Shah, and Chang Xu. 2025. Explor- ing Local Memorization in Diffusion Models via Bright Ending Attention. arXiv:2410.21665 [cs.CV] https://arxiv.org/abs/2410.21665

work page arXiv 2025

[9] [10]

Chen Chen, Daochang Liu, and Chang Xu. 2024. Towards memorization-free diffusion models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8425–8434

2024

[10] [11]

Dale R Durran. 1991. The third-order Adams-Bashforth method: An attractive alternative to leapfrog time differencing.Monthly weather review119, 3 (1991), 702–720

1991

[11] [12]

Zihan Guan, Mengxuan Hu, Sheng Li, and Anil Vullikanti. 2025. UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models. arXiv:2404.01101 [cs.CR] https://arxiv.org/abs/2404.01101

work page arXiv 2025

[12] [13]

1993.Solving ordinary differential equations I: Nonstiff problems

Ernst Hairer, Gerhard Wanner, and Syvert P Nørsett. 1993.Solving ordinary differential equations I: Nonstiff problems. Springer

1993

[13] [14]

Jack Hessel, Ari Holtzman, Maxwell Forbes, Ronan Le Bras, and Yejin Choi

[14] [15]

CLIPScore: A Reference-free Evaluation Metric for Image Captioning

CLIPScore: A Reference-free Evaluation Metric for Image Captioning. arXiv:2104.08718 [cs.CV] https://arxiv.org/abs/2104.08718

work page internal anchor Pith review Pith/arXiv arXiv

[15] [16]

Martin Heusel, Hubert Ramsauer, Thomas Unterthiner, Bernhard Nessler, and Sepp Hochreiter. 2018. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. arXiv:1706.08500 [cs.LG] https://arxiv.org/abs/1706. 08500

work page internal anchor Pith review Pith/arXiv arXiv 2018

[16] [17]

Jonathan Ho, Ajay Jain, and Pieter Abbeel. 2020. Denoising diffusion probabilistic models.Advances in neural information processing systems33 (2020), 6840–6851

2020

[17] [18]

Jonathan Ho and Tim Salimans. 2022. Classifier-Free Diffusion Guidance. arXiv:2207.12598 [cs.LG] https://arxiv.org/abs/2207.12598

work page internal anchor Pith review Pith/arXiv arXiv 2022

[18] [19]

Anubhav Jain, Yuya Kobayashi, Takashi Shibuya, Yuhta Takida, Nasir Memon, Julian Togelius, and Yuki Mitsufuji. 2025. Classifier-free guidance inside the attraction basin may cause memorization. InProceedings of the Computer Vision and Pattern Recognition Conference. 12871–12879

2025

[19] [20]

Dongjae Jeon, Dueun Kim, and Albert No. 2025. Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes. Proceedings of the 42nd International Conference on Machine Learning(2025)

2025

[20] [21]

Yue Jiang, Haokun Lin, Yang Bai, Bo Peng, Zhili Liu, Yueming Lyu, Yong Yang, Xing Zheng, and Jing Dong. 2025. Image-Level Memorization Detection Via Inversion-Based Inference Perturbation.The Thirteenth International Conference on Learning Representations(2025)

2025

[21] [22]

Kingma and Max Welling

Diederik P. Kingma and Max Welling. 2019. An Introduction to Variational Autoencoders.Foundations and Trends®in Machine Learning12, 4 (2019), 307–392. doi:10.1561/2200000056

work page doi:10.1561/2200000056 2019

[22] [23]

Luping Liu, Yi Ren, Zhijie Lin, and Zhou Zhao. 2022. Pseudo Numerical Methods for Diffusion Models on Manifolds. arXiv:2202.09778 [cs.CV] https://arxiv.org/ abs/2202.09778

work page arXiv 2022

[23] [24]

Zihao Luo, Xilie Xu, Feng Liu, Yun Sing Koh, Di Wang, and Jingfeng Zhang. 2024. Privacy-Preserving Low-Rank Adaptation against Membership Inference Attacks for Latent Diffusion Models. arXiv:2402.11989 [cs.LG] https://arxiv.org/abs/2402. 11989

work page arXiv 2024

[24] [25]

Ed Pizzi, Sreya Dutta Roy, Sugosh Nagavara Ravindra, Priya Goyal, and Matthijs Douze. 2022. A self-supervised descriptor for image copy detection. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 14532– 14542

2022

[25] [26]

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. arXiv:2103.00020 [cs.CV] https://arxiv.org/ abs/2103.00020

work page internal anchor Pith review Pith/arXiv arXiv 2021

[26] [27]

Jie Ren, Yaxin Li, Shenglai Zeng, Han Xu, Lingjuan Lyu, Yue Xing, and Jiliang Tang. 2024. Unveiling and mitigating memorization in text-to-image diffusion models through cross attention. InEuropean Conference on Computer Vision. Springer, 340–356

2024

[27] [28]

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. 2022. High-Resolution Image Synthesis with Latent Diffusion Models. arXiv:2112.10752 [cs.CV] https://arxiv.org/abs/2112.10752

work page internal anchor Pith review Pith/arXiv arXiv 2022

[28] [29]

Christoph Schuhmann, Romain Beaumont, Richard Vencu, Cade Gordon, Ross Wightman, Mehdi Cherti, Theo Coombes, Aarush Katta, Clayton Mullis, Mitchell Wortsman, et al. 2022. Laion-5b: An open large-scale dataset for training next generation image-text models.Advances in neural information processing systems 35 (2022), 25278–25294

2022

[29] [30]

Christoph Schuhmann, Richard Vencu, Romain Beaumont, Robert Kaczmarczyk, Clayton Mullis, Aarush Katta, Theo Coombes, Jenia Jitsev, and Aran Komatsuzaki

[30] [31]

Laion-400m: Open dataset of clip-filtered 400 million image-text pairs.arXiv preprint arXiv:2111.02114(2021)

work page internal anchor Pith review Pith/arXiv arXiv 2021

[31] [32]

1985.Numerical solution of partial differential equations: finite difference methods

Gordon D Smith. 1985.Numerical solution of partial differential equations: finite difference methods. Oxford university press

1985

[32] [33]

Gowthami Somepalli, Vasu Singla, Micah Goldblum, Jonas Geiping, and Tom Goldstein. 2023. Understanding and mitigating copying in diffusion models. Advances in Neural Information Processing Systems36 (2023), 47783–47803

2023

[33] [34]

Jiaming Song, Chenlin Meng, and Stefano Ermon. 2022. Denoising Diffusion Implicit Models. arXiv:2010.02502 [cs.LG] https://arxiv.org/abs/2010.02502

work page internal anchor Pith review Pith/arXiv arXiv 2022

[34] [35]

Yang Song, Jascha Sohl-Dickstein, Diederik P Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. 2020. Score-based generative modeling through stochastic differential equations.arXiv preprint arXiv:2011.13456(2020)

work page internal anchor Pith review Pith/arXiv arXiv 2020

[35] [36]

Ryan Webster. 2023. A reproducible extraction of training images from diffusion models.arXiv preprint arXiv:2305.08694(2023)

work page arXiv 2023

[36] [37]

transfer part

Yuxin Wen, Yuchen Liu, Chen Chen, and Lingjuan Lyu. 2024. Detecting, explain- ing, and mitigating memorization in diffusion models. InThe Twelfth International Conference on Learning Representations. A Detailed Proofs In this section, we provide detailed proofs for Theorem 1 and 3 presented in the main text. Theorem 1 (Normal Trajectories Stability).Let 𝛿...

work page arXiv 2024