Diffusion reconstruction creates hard samples for audio deepfake detection training, and when paired with feature aggregation and RACL, it reduces average EER versus baselines.
Semanticodec: An ultra low bitrate semantic audio codec for general sound
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Classical codecs prove more robust to noise than neural codecs, speech enhancement significantly helps noise-affected codecs, and listening effort plus ASR-based metrics add useful nuance beyond basic intelligibility scores.
citing papers explorer
-
Diffusion Reconstruction towards Generalizable Audio Deepfake Detection
Diffusion reconstruction creates hard samples for audio deepfake detection training, and when paired with feature aggregation and RACL, it reduces average EER versus baselines.
-
Assessing the Impact of Noise and Speech Enhancement on the Intelligibility of Speech Codecs
Classical codecs prove more robust to noise than neural codecs, speech enhancement significantly helps noise-affected codecs, and listening effort plus ASR-based metrics add useful nuance beyond basic intelligibility scores.