Critical windows: non-asymptotic theory for feature emergence in diffusion models

Marvin Li; Sitan Chen

arxiv: 2403.01633 · v2 · pith:UEESKDS3new · submitted 2024-03-03 · 💻 cs.LG · cs.CV· stat.ML

Critical windows: non-asymptotic theory for feature emergence in diffusion models

Marvin Li , Sitan Chen This is my paper

classification 💻 cs.LG cs.CVstat.ML

keywords diffusionwindowsmodelsboundscriticalimageexperimentsfeatures

0 comments

read the original abstract

We develop theory to understand an intriguing property of diffusion models for image generation that we term critical windows. Empirically, it has been observed that there are narrow time intervals in sampling during which particular features of the final image emerge, e.g. the image class or background color (Ho et al., 2020b; Meng et al., 2022; Choi et al., 2022; Raya & Ambrogioni, 2023; Georgiev et al., 2023; Sclocchi et al., 2024; Biroli et al., 2024). While this is advantageous for interpretability as it implies one can localize properties of the generation to a small segment of the trajectory, it seems at odds with the continuous nature of the diffusion. We propose a formal framework for studying these windows and show that for data coming from a mixture of strongly log-concave densities, these windows can be provably bounded in terms of certain measures of inter- and intra-group separation. We also instantiate these bounds for concrete examples like well-conditioned Gaussian mixtures. Finally, we use our bounds to give a rigorous interpretation of diffusion models as hierarchical samplers that progressively "decide" output features over a discrete sequence of times. We validate our bounds with synthetic experiments. Additionally, preliminary experiments on Stable Diffusion suggest critical windows may serve as a useful tool for diagnosing fairness and privacy violations in real-world diffusion models.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Towards More General Control of Diffusion Models Using Jeffrey Guidance
cs.LG 2026-06 unverdicted novelty 7.0

Jeffrey guidance applies Jeffrey's rule of conditioning to diffusion models to target prescribed marginal distributions while preserving conditional structure, demonstrated via embedding matching and fairness enforcement.
Local Diffusion Models and Phases of Data Distributions
cs.LG 2025-08 unverdicted novelty 6.0

The paper introduces a phase framework for data distributions connected by local denoisers and demonstrates that reverse diffusion consists of trivial and data phases separated by a transition where local score functi...
Statistical Properties of Training & Generalization
stat.ML 2026-06 unverdicted novelty 2.0

Neural scaling laws in deep learning interact with physics constraints and inductive biases beyond classical statistics.
Statistical Properties of Training & Generalization
stat.ML 2026-06 unverdicted novelty 1.0

Review of neural scaling laws and their relation to constraints and inductive biases when applying machine learning to physics problems.