SpecMaskGIT: Masked generative modeling of audio spectro- grams for efficient audio synthesis and beyond.arXiv preprint arXiv:2406.17672

Marco Comunità, Zhi Zhong, Akira Takahashi, Shiqi Yang, Mengjie Zhao, Koichi Saito, Yukara Ikemiya, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji · 2024 · arXiv 2406.17672

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion

cs.LG · 2025-10-06 · unverdicted · novelty 7.0

Theoretical analysis reveals MaskGIT's implicit temperature sampling in masked diffusion; proposes equivalent moment sampler and efficiency techniques for adaptive unmasking with image and text experiments.

Fixed-Point Masked Generative Modeling

cs.LG · 2026-05-29 · unverdicted · novelty 6.0

FP-MGMs with consistency loss and three-state reuse (CoFRe) reduce parameters by up to 38.8% and improve low-budget perplexity and FID versus standard masked generative models on text and images.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Fixed-Point Masked Generative Modeling cs.LG · 2026-05-29 · unverdicted · none · ref 10
FP-MGMs with consistency loss and three-state reuse (CoFRe) reduce parameters by up to 38.8% and improve low-budget perplexity and FID versus standard masked generative models on text and images.

SpecMaskGIT: Masked generative modeling of audio spectro- grams for efficient audio synthesis and beyond.arXiv preprint arXiv:2406.17672

fields

years

verdicts

representative citing papers

citing papers explorer