pith. machine review for the scientific record. sign in

arxiv: 2601.00090 · v2 · submitted 2025-12-31 · 💻 cs.CV · cs.LG

Recognition: unknown

It's Never Too Late: Noise Optimization for Collapse Recovery in Trained Diffusion Models

Anne Harrington , A. Sophia Koepke , Shyamgopal Karthik , Trevor Darrell , Alexei A. Efros

Authors on Pith no claims yet
classification 💻 cs.CV cs.LG
keywords noiseoptimizationcollapsedifferentdiversityfrequencymodemodel
0
0 comments X
read the original abstract

Contemporary text-to-image models exhibit a surprising degree of mode collapse, as can be seen when sampling several images given the same text prompt. Previous work has attempted to address this issue by steering the model using guidance mechanisms, or by generating a large pool of candidates and refining them. In this work, we take a different direction and aim for diversity in generations via noise optimization. Specifically, we show that a simple noise optimization objective can mitigate mode collapse while preserving the fidelity of the base model. We also analyze the frequency characteristics of the noise and show that alternative noise initializations with different frequency profiles can improve both optimization and search. Our experiments demonstrate that noise optimization yields superior results in terms of generation quality and diversity.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. STRIDE: Training-Free Diversity Guidance via PCA-Directed Feature Perturbation in Single-Step Diffusion Models

    cs.CV 2026-05 unverdicted novelty 7.0

    STRIDE boosts diversity in one-step diffusion models by injecting PCA-aligned pink noise into transformer features while preserving text alignment and quality.

  2. Couple to Control: Joint Initial Noise Design in Diffusion Models

    cs.LG 2026-05 unverdicted novelty 6.0

    Coupled initial noises in diffusion models, with designed dependence but unchanged marginal Gaussians, improve generated image diversity on Stable Diffusion variants while preserving quality and alignment.

  3. Diverse Sampling in Diffusion Models with Marginal Preserving Particle Guidance

    cs.LG 2026-05 unverdicted novelty 5.0

    EDDY adds diversity to diffusion-model samples by using kernel-based anti-symmetric pairwise drifts that preserve marginal distributions via Fokker-Planck symmetries, with practical approximations for expensive cases.