Momentum Guidance: Plug-and-Play Guidance for Flow Models

Baiyu Su; Chi Zhang; Jian Yu; Lizhang Chen; Qiang Liu; Runlong Liao

arxiv: 2602.20360 · v2 · pith:3Z4RIBAPnew · submitted 2026-02-23 · 💻 cs.LG · cs.CV

Momentum Guidance: Plug-and-Play Guidance for Flow Models

Runlong Liao , Jian Yu , Baiyu Su , Chi Zhang , Lizhang Chen , Qiang Liu This is my paper

classification 💻 cs.LG cs.CV

keywords guidanceacrossmodelsaverageflowflow-basedgenerationimproves

0 comments

read the original abstract

Flow-based generative methods offer a simple and effective framework for high-fidelity generation, yet pretrained flow models are rarely used in their vanilla conditional form: in image generation, samples without guidance often appear diffuse and lack fine-grained detail. Existing guidance techniques such as classifier-free guidance (CFG) improve fidelity but reduce sample diversity. We introduce Momentum Guidance (MG), a guidance method that improves sample quality by extrapolating the current velocity away from an exponential moving average of past velocities along the ODE trajectory, while preserving the standard one-evaluation-per-step cost. MG provides gains beyond CFG, improving the precision-recall Pareto frontier. Experiments demonstrate the effectiveness of MG across benchmarks. On ImageNet-256, MG improves FID by 36.54% without CFG and 25.42% with CFG on average across sampling settings, attaining an FID of 1.553 at 16 sampling steps. Evaluations on large flow-based models, including Stable Diffusion 3 and FLUX.1-dev, further confirm improvements across standard metrics.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Training-Free Looped Transformers
cs.LG 2026-05 unverdicted novelty 7.0

Training-free looped transformers retrofit recurrence to frozen models via damped ODE sub-steps on mid-stack blocks, yielding gains such as +2.64 pp on MMLU-Pro for Qwen3-4B.