Audio2gestures: Generating diverse gestures from speech au- dio with conditional variational autoencoders

· 2021

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Gelina: Unified Speech and Gesture Synthesis via Interleaved Token Prediction

cs.SD · 2025-10-13 · unverdicted · novelty 6.0

A unified discrete autoregressive model for joint text-to-speech and co-speech gesture synthesis via interleaved token sequences and modality-specific decoders.

citing papers explorer

Showing 1 of 1 citing paper.

Gelina: Unified Speech and Gesture Synthesis via Interleaved Token Prediction cs.SD · 2025-10-13 · unverdicted · none · ref 41
A unified discrete autoregressive model for joint text-to-speech and co-speech gesture synthesis via interleaved token sequences and modality-specific decoders.

Audio2gestures: Generating diverse gestures from speech au- dio with conditional variational autoencoders

fields

years

verdicts

representative citing papers

citing papers explorer