PersonaGest uses a semantic-guided RVQ-VAE with a Semantic-Aware Motion Codebook and contrastive learning in stage one, followed by a Masked Generative Transformer and Style Residual Transformers in stage two, to achieve state-of-the-art co-speech gesture generation with semantic coherence and style
Mimicparts: Part- aware style injection for speech-driven 3d motion generation
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 2roles
background 2polarities
background 2representative citing papers
ViBES introduces a speech-language-behavior model using modality-specific transformer experts that jointly generates dialogue and 3D body actions, showing gains over separate co-speech and text-to-motion baselines on multi-turn metrics.
citing papers explorer
-
PersonaGest: Personalized Co-Speech Gesture Generation with Semantic-Guided Hierarchical Motion Representation
PersonaGest uses a semantic-guided RVQ-VAE with a Semantic-Aware Motion Codebook and contrastive learning in stage one, followed by a Masked Generative Transformer and Style Residual Transformers in stage two, to achieve state-of-the-art co-speech gesture generation with semantic coherence and style
-
ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
ViBES introduces a speech-language-behavior model using modality-specific transformer experts that jointly generates dialogue and 3D body actions, showing gains over separate co-speech and text-to-motion baselines on multi-turn metrics.