Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing

Chi Xing; Gaojie Jin; Guangliang Cheng; Jianhong Wang; Lijun Zhang; Sihao Wu; Xiaonan Si; Xiaowei Huang

arxiv: 2502.07829 · v1 · pith:GCSVOLWEnew · submitted 2025-02-10 · 💻 cs.CV · cs.LG

Preference Alignment on Diffusion Model: A Comprehensive Survey for Image Generation and Editing

Sihao Wu , Xiaonan Si , Chi Xing , Jianhong Wang , Gaojie Jin , Guangliang Cheng , Lijun Zhang , Xiaowei Huang This is my paper

classification 💻 cs.CV cs.LG

keywords preferencealignmentdiffusioneditinggenerationimagemodelsaligning

0 comments

read the original abstract

The integration of preference alignment with diffusion models (DMs) has emerged as a transformative approach to enhance image generation and editing capabilities. Although integrating diffusion models with preference alignment strategies poses significant challenges for novices at this intersection, comprehensive and systematic reviews of this subject are still notably lacking. To bridge this gap, this paper extensively surveys preference alignment with diffusion models in image generation and editing. First, we systematically review cutting-edge optimization techniques such as reinforcement learning with human feedback (RLHF), direct preference optimization (DPO), and others, highlighting their pivotal role in aligning preferences with DMs. Then, we thoroughly explore the applications of aligning preferences with DMs in autonomous driving, medical imaging, robotics, and more. Finally, we comprehensively discuss the challenges of preference alignment with DMs. To our knowledge, this is the first survey centered on preference alignment with DMs, providing insights to drive future innovation in this dynamic area.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Follow-Your-Preference++: Rethinking Preference Alignment for Image Inpainting
cs.CV 2026-06 unverdicted novelty 3.0

Empirical study shows reward model ensembles mitigate biases like brightness and composition in preference data for image inpainting, yielding better performance than prior methods without architecture changes.
Reinforcement Learning for Scalable and Trustworthy Intelligent Systems
cs.LG 2026-05 unverdicted novelty 3.0

Reinforcement learning is advanced for communication-efficient federated optimization and for preference-aligned, contextually safe policies in large language models.