MACE-Dance generates music-driven dance videos via cascaded motion and appearance experts, claiming SOTA results on a new benchmark dataset.
Fabian Mentzer, David Minnen, Eirikur Agustsson, and Michael Tschannen
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2verdicts
UNVERDICTED 2representative citing papers
EchoTorrent combines multi-teacher distillation, adaptive CFG calibration, hybrid long-tail forcing, and VAE decoder refinement to enable few-pass autoregressive streaming video generation with improved temporal consistency and audio-lip sync.
citing papers explorer
-
MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
MACE-Dance generates music-driven dance videos via cascaded motion and appearance experts, claiming SOTA results on a new benchmark dataset.
-
EchoTorrent: Towards Swift, Sustained, and Streaming Multi-Modal Video Generation
EchoTorrent combines multi-teacher distillation, adaptive CFG calibration, hybrid long-tail forcing, and VAE decoder refinement to enable few-pass autoregressive streaming video generation with improved temporal consistency and audio-lip sync.