MixFake is a new benchmark for mixed-authenticity audio and a multi-stream prompt tuning method achieves 0.95% EER foreground and 7.72% absolute gain in complex background deepfake detection.
Detect All-Type Deepfake audio: Wavelet prompt tuning
2 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.SD 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
EnvTriCascade is a tri-stage cascaded framework using mix-consistency detection followed by dual SSL-based five-class classifiers with cross-branch attention and RawBoost augmentation, achieving 0.8266 Macro-F1 on the ESDD2 2026 challenge test set.
citing papers explorer
-
MixFake: Benchmarking and Enhancing Audio Deepfake Detection in Diverse Real-world Mixed Audio
MixFake is a new benchmark for mixed-authenticity audio and a multi-stream prompt tuning method achieves 0.95% EER foreground and 7.72% absolute gain in complex background deepfake detection.
-
EnvTriCascade: An Environment-Aware Tri-Stage Cascaded Framework for ESDD2 2026 Challenge
EnvTriCascade is a tri-stage cascaded framework using mix-consistency detection followed by dual SSL-based five-class classifiers with cross-branch attention and RawBoost augmentation, achieving 0.8266 Macro-F1 on the ESDD2 2026 challenge test set.