Emotional face-to-speech.arXiv preprint arXiv:2502.01046, 2025

Jiaxin Ye, Boyuan Cao, Hongming Shan · 2025 · arXiv 2502.01046

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Archon: A Unified Multimodal Model for Holistic Digital Human Generation

cs.CV · 2026-05-28 · unverdicted · novelty 5.0

Archon unifies seven modalities via modality-specific tokenizers and an autoregressive backbone pretrained on 72 tasks, plus a 4x-efficient video reparameterization and stepwise 'Thinking in Modality' procedure, and reports superior or comparable results on digital-human tasks.

citing papers explorer

Showing 1 of 1 citing paper.

Archon: A Unified Multimodal Model for Holistic Digital Human Generation cs.CV · 2026-05-28 · unverdicted · none · ref 59
Archon unifies seven modalities via modality-specific tokenizers and an autoregressive backbone pretrained on 72 tasks, plus a 4x-efficient video reparameterization and stepwise 'Thinking in Modality' procedure, and reports superior or comparable results on digital-human tasks.

Emotional face-to-speech.arXiv preprint arXiv:2502.01046, 2025

fields

years

verdicts

representative citing papers

citing papers explorer