Breaking the Curse of Dimensionality: Diffusion Models Efficiently Learn Low-Dimensional Distributions

Huijie Zhang; Peng Wang; Qing Qu; Siyi Chen; Yi Ma; Zekai Zhang

arxiv: 2409.02426 · v5 · pith:F4DKGRLNnew · submitted 2024-09-04 · 💻 cs.LG · cs.CV

Breaking the Curse of Dimensionality: Diffusion Models Efficiently Learn Low-Dimensional Distributions

Peng Wang , Huijie Zhang , Zekai Zhang , Siyi Chen , Yi Ma , Qing Qu This is my paper

classification 💻 cs.LG cs.CV

keywords datadiffusionimagemodelsdistributionslearnlow-dimensionalsubspace

0 comments

read the original abstract

Despite their empirical success across a wide range of generative tasks, the fundamental principles underlying the ability of diffusion models to learn data distributions are poorly understood. In this work, we develop a new mathematical framework that explains how diffusion models can effectively learn low-dimensional distributions from a finite number of training samples without suffering from the curse of dimensionality. Specifically, motivated by the intrinsic low-dimensional structure of image data, we theoretically analyze a setting in which the data distribution is modeled as a mixture of low-rank Gaussians. Under suitable network parameterization, we show that optimizing the training objective of diffusion models is equivalent to solving the canonical subspace clustering problem over the training samples, where each subspace basis corresponds to the low-rank covariance of a Gaussian component. This equivalence allows us to show that the sample complexity for learning the underlying distribution scales linearly with the intrinsic dimension of the data, rather than exponentially with the ambient dimension. Our theoretical findings are further supported by empirical evidence that demonstrates phase transition phenomena in generalization on both synthetic and real-world image datasets. Moreover, we establish a correspondence between the learned subspace bases and semantic attributes of image data, providing a principled foundation for controllable image generation.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Transformers Learn the Optimal DDPM Denoiser for Multi-Token GMMs
cs.LG 2026-04 unverdicted novelty 8.0

Transformers converge globally to the optimal DDPM denoiser for multi-token GMMs via self-attention mean denoising, with explicit token and iteration requirements.
Intrinsic Wasserstein Rates for Score-Based Generative Models on Smooth Manifolds
cs.LG 2026-05 unverdicted novelty 7.0

Score-based generative models attain intrinsic Wasserstein-1 sample rates of order n to the power of -(beta+1)/(d+2beta) on d-dimensional smooth manifolds with beta-Holder densities.
The Interplay of Data Structure and Imbalance in the Learning Dynamics of Diffusion Models
stat.ML 2026-05 unverdicted novelty 7.0

Higher-variance classes are learned first in diffusion models; strong class imbalance reverses the order and imposes distinct delayed learning times on minority classes.