SRD provides a threshold-independent, representation-level privacy assessment for voice anonymization that reveals system weaknesses not detected by equal error rate evaluation.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
fields
eess.AS 3representative citing papers
Listeners detect automatic anonymization in pathological speech at 91-93% accuracy with a 30-point perceived quality drop, yet clinical severity ratings stay nearly unchanged for dysarthria, dysglossia, and dysphonia.
A self-supervised prosody encoder with speaker disentanglement strategies outperforms raw prosody and HuBERT baselines on pitch reconstruction and prosodic event detection while achieving strong speaker separation.
citing papers explorer
-
Evaluating voice anonymisation using similarity rank disclosure
SRD provides a threshold-independent, representation-level privacy assessment for voice anonymization that reveals system weaknesses not detected by equal error rate evaluation.
-
Privacy-preserving Prosody Representation Learning
A self-supervised prosody encoder with speaker disentanglement strategies outperforms raw prosody and HuBERT baselines on pitch reconstruction and prosodic event detection while achieving strong speaker separation.