Cstr vctk corpus: English multi-speaker corpus for cstr voice cloning toolkit

· 2019 · DOI 10.7488/ds/2645

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open at publisher browse 5 citing papers

representative citing papers

Ethical and Technical Limits of Deepfake Speech Datasets

cs.SD · 2026-06-09 · unverdicted · novelty 6.0

Audit of 39 deepfake speech datasets shows most lack demographic metadata making fairness checks infeasible and reveals substantial overlap in bona fide sources that undermines cross-dataset generalization claims.

Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models

cs.CR · 2026-05-18 · unverdicted · novelty 6.0

AIA generates universal interference audio infused with Acoustic Latent Semantics to bypass LALM safety alignment, achieving SOTA attack success rates on 10 models across five datasets.

Streaming T5-based Text-to-Speech Synthesis with Limited Lookahead

cs.SD · 2026-06-20 · unverdicted · novelty 5.0

S5-TTS introduces a streaming T5-TTS variant with lookahead-causal masking and interleaved multi-source distillation that achieves comparable quality to full-context models while cutting end-to-end latency.

Single frequency filtering based multi-speaker direction of arrival estimation from stereo recordings

eess.AS · 2026-06-15 · unverdicted · novelty 4.0

An SFF-based DoA estimator using PHAT-weighted GCC on envelopes performs comparably or better than GCC methods on real reverberant multi-speaker recordings.

Robust Soft-Constrained Spatially Selective Active Noise Control for Hearables Under Secondary Path Variations

eess.AS · 2026-05-17 · unverdicted · novelty 4.0

A robust soft-constrained optimization framework for spatially selective active noise control that minimizes average cost over a set of secondary path estimates from human measurements to reduce performance variation under mismatch.

citing papers explorer

Showing 5 of 5 citing papers after filters.

Ethical and Technical Limits of Deepfake Speech Datasets cs.SD · 2026-06-09 · unverdicted · none · ref 76
Audit of 39 deepfake speech datasets shows most lack demographic metadata making fairness checks infeasible and reveals substantial overlap in bona fide sources that undermines cross-dataset generalization claims.
Acoustic Interference: A New Paradigm Weaponizing Acoustic Latent Semantic for Universal Jailbreak against Large Audio Language Models cs.CR · 2026-05-18 · unverdicted · none · ref 50
AIA generates universal interference audio infused with Acoustic Latent Semantics to bypass LALM safety alignment, achieving SOTA attack success rates on 10 models across five datasets.
Streaming T5-based Text-to-Speech Synthesis with Limited Lookahead cs.SD · 2026-06-20 · unverdicted · none · ref 50
S5-TTS introduces a streaming T5-TTS variant with lookahead-causal masking and interleaved multi-source distillation that achieves comparable quality to full-context models while cutting end-to-end latency.
Single frequency filtering based multi-speaker direction of arrival estimation from stereo recordings eess.AS · 2026-06-15 · unverdicted · none · ref 27
An SFF-based DoA estimator using PHAT-weighted GCC on envelopes performs comparably or better than GCC methods on real reverberant multi-speaker recordings.
Robust Soft-Constrained Spatially Selective Active Noise Control for Hearables Under Secondary Path Variations eess.AS · 2026-05-17 · unverdicted · none · ref 26
A robust soft-constrained optimization framework for spatially selective active noise control that minimizes average cost over a set of secondary path estimates from human measurements to reduce performance variation under mismatch.

Cstr vctk corpus: English multi-speaker corpus for cstr voice cloning toolkit

fields

years

verdicts

representative citing papers

citing papers explorer