OeMDM unifies masked diffusion, autoregressive, and block diffusion models under various generation orders; LoMDM jointly optimizes ordering and diffusion backbone from scratch and outperforms prior discrete diffusion models on language benchmarks.
αmdlm(t) 1−α mdlm(t) logp θ,bd3lm(xb |z b t ,x <b) # (80) = BX b=1 Et∼Unif[0,1] Ezb t ∼qαmdlm(·|xb)
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Unifying Masked Diffusion Models with Various Generation Orders and Beyond
OeMDM unifies masked diffusion, autoregressive, and block diffusion models under various generation orders; LoMDM jointly optimizes ordering and diffusion backbone from scratch and outperforms prior discrete diffusion models on language benchmarks.