Speaker-aware modeling of conversational timing using per-speaker deviation distributions, Markov turn-taking, and unified KDE gap modeling improves alignment with real Switchboard patterns over independence-based baselines.
From Independence to Interaction: Speaker-Aware Simulation of Multi-Speaker Conversational Timing
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
We present a speaker-aware approach for simulating multi-speaker conversations that captures temporal consistency and realistic turn-taking dynamics. Prior work typically models aggregate conversational statistics under an independence assumption across speakers and turns. In contrast, our method uses speaker-specific deviation distributions enforcing intra-speaker temporal consistency, while a Markov chain governs turn-taking and a fixed room impulse response preserves spatial realism. We also unify pauses and overlaps into a single gap distribution, modeled with kernel density estimation for smooth continuity. Evaluation on Switchboard using intrinsic metrics - global gap statistics, correlations between consecutive gaps, copula-based higher-order dependencies, turn-taking entropy, and gap survival functions - shows that speaker-aware simulation better aligns with real conversational patterns than the baseline method, capturing fine-grained temporal dependencies and realistic speaker alternation, while revealing open challenges in modeling long-range conversational structure.
fields
cs.SD 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
From Independence to Interaction: Speaker-Aware Simulation of Multi-Speaker Conversational Timing
Speaker-aware modeling of conversational timing using per-speaker deviation distributions, Markov turn-taking, and unified KDE gap modeling improves alignment with real Switchboard patterns over independence-based baselines.