Generalization of Diffusion Models Arises with a Balanced Representation Space

Lianghe Shi; Meng Wu; Molei Tao; Qing Qu; Xiang Li; Xiao Li; Zekai Zhang

arxiv: 2512.20963 · v3 · pith:4ZCAS523new · submitted 2025-12-24 · 💻 cs.LG · cs.CV

Generalization of Diffusion Models Arises with a Balanced Representation Space

Zekai Zhang , Xiao Li , Xiang Li , Lianghe Shi , Meng Wu , Molei Tao , Qing Qu This is my paper

classification 💻 cs.LG cs.CV

keywords modelsdiffusionrepresentationgeneralizationmemorizationrepresentationstrainingarises

0 comments

read the original abstract

Diffusion models excel at generating high-quality, diverse samples, yet they risk memorizing training data when overfit to the training objective. We analyze the distinctions between memorization and generalization in diffusion models through the lens of representation learning. By investigating a two-layer ReLU denoising autoencoder (DAE), we prove that (i) memorization corresponds to the model storing raw training samples in the learned weights for encoding and decoding, yielding localized spiky representations, whereas (ii) generalization arises when the model captures local data statistics, producing balanced representations. Furthermore, we validate these theoretical findings on real-world unconditional and text-to-image diffusion models, demonstrating that the same representation structures emerge in deep generative models with significant practical implications. Building on these insights, we propose a representation-based method for detecting memorization and a training-free editing technique that allows precise control via representation steering. Together, our results highlight that learning good representations is central to novel and meaningful generative modeling.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Proximal-Based Generative Modeling for Bayesian Inverse Problems
math.OC 2026-05 unverdicted novelty 7.0

PGM replaces the intractable likelihood score in diffusion models with a closed-form Moreau score computed via proximal operators, enabling non-asymptotic sampling for inverse problems trained only on prior data.
A Theoretical Analysis of Memory and Overfitting Phenomena in Stochastic Interpolation Models
cs.LG 2026-06 unverdicted novelty 5.0

In the oracle continuous-time setting, stochastic interpolation models recover training samples exactly, with deviations controlled by discretization and estimation errors, leading to theoretical definitions of overfi...
Robust and Generalizable Safety Steering for Text-to-Image Diffusion Transformers
cs.AI 2026-05 unverdicted novelty 4.0

SafeDIG applies position-aware sparse feature transfer via SAEs in DiT models to reduce unsafe generations in target risk domains on FLUX.1 Dev and SD 3.5 while keeping source safety and quality.
Principles and Practice of Deep Representation Learning: or a Mathematical Theory of Memory
cs.LG 2026-06 unverdicted novelty 3.0

The book presents principles from optimization and information theory to explain deep network architectures and enable new interpretable models.