pith. machine review for the scientific record. sign in

The unification of representation learning and generative modelling

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

eess.AS 1

years

2026 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

PoDAR: Power-Disentangled Audio Representation for Generative Modeling

eess.AS · 2026-05-11 · unverdicted · novelty 6.0

PoDAR disentangles audio signal power from semantic content in latents using power augmentation and consistency objectives, yielding 2x faster convergence and gains of 0.055 speaker similarity and 0.22 UTMOS when applied to Stable Audio VAE with F5-TTS.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • PoDAR: Power-Disentangled Audio Representation for Generative Modeling eess.AS · 2026-05-11 · unverdicted · none · ref 14

    PoDAR disentangles audio signal power from semantic content in latents using power augmentation and consistency objectives, yielding 2x faster convergence and gains of 0.055 speaker similarity and 0.22 UTMOS when applied to Stable Audio VAE with F5-TTS.