From Independence to Interaction: Speaker-Aware Simulation of Multi-Speaker Conversational Timing

· 2025 · cs.SD · arXiv 2509.15808

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We present a speaker-aware approach for simulating multi-speaker conversations that captures temporal consistency and realistic turn-taking dynamics. Prior work typically models aggregate conversational statistics under an independence assumption across speakers and turns. In contrast, our method uses speaker-specific deviation distributions enforcing intra-speaker temporal consistency, while a Markov chain governs turn-taking and a fixed room impulse response preserves spatial realism. We also unify pauses and overlaps into a single gap distribution, modeled with kernel density estimation for smooth continuity. Evaluation on Switchboard using intrinsic metrics - global gap statistics, correlations between consecutive gaps, copula-based higher-order dependencies, turn-taking entropy, and gap survival functions - shows that speaker-aware simulation better aligns with real conversational patterns than the baseline method, capturing fine-grained temporal dependencies and realistic speaker alternation, while revealing open challenges in modeling long-range conversational structure.

representative citing papers

From Independence to Interaction: Speaker-Aware Simulation of Multi-Speaker Conversational Timing

cs.SD · 2025-09-19 · unverdicted · novelty 6.0

Speaker-aware modeling of conversational timing using per-speaker deviation distributions, Markov turn-taking, and unified KDE gap modeling improves alignment with real Switchboard patterns over independence-based baselines.

citing papers explorer

Showing 1 of 1 citing paper.

From Independence to Interaction: Speaker-Aware Simulation of Multi-Speaker Conversational Timing cs.SD · 2025-09-19 · unverdicted · none · ref 1 · internal anchor
Speaker-aware modeling of conversational timing using per-speaker deviation distributions, Markov turn-taking, and unified KDE gap modeling improves alignment with real Switchboard patterns over independence-based baselines.

From Independence to Interaction: Speaker-Aware Simulation of Multi-Speaker Conversational Timing

fields

years

verdicts

representative citing papers

citing papers explorer