pith. sign in

Diffusion model alignment using direct preference optimization

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

baseline 1

citation-polarity summary

fields

cs.CV 6 cs.LG 1

years

2026 4 2025 3

roles

baseline 1

polarities

baseline 1

representative citing papers

DanceGRPO: Unleashing GRPO on Visual Generation

cs.CV · 2025-05-12 · unverdicted · novelty 6.0

DanceGRPO applies GRPO to visual generation tasks to achieve stable policy optimization across diffusion models, rectified flows, multiple tasks, and diverse reward models, outperforming prior RL methods.

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

cs.CV · 2025-03-10 · unverdicted · novelty 6.0

Seedream 2.0 is a native Chinese-English bilingual diffusion model that integrates a self-developed LLM text encoder, Glyph-Aligned ByT5, and Scaled ROPE to reach claimed state-of-the-art results in prompt following, aesthetics, text rendering, and human preference alignment via RLHF.

citing papers explorer

Showing 7 of 7 citing papers.