hub

Audio Deepfake detection: A survey

· 2023 · arXiv 2308.14970

17 Pith papers cite this work. Polarity classification is still indexing.

17 Pith papers citing it

read on arXiv browse 17 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

Probing-Guided Layer Selection from Self-Supervised Speech Models for Generalizable Audio Deepfake Detection

cs.SD · 2026-06-29 · unverdicted · novelty 7.0

Probing-guided selection of depth zones from frozen SSL speech models yields compact classifiers with 28% relative EER improvement on cross-domain deepfake detection tasks.

MixFake: Benchmarking and Enhancing Audio Deepfake Detection in Diverse Real-world Mixed Audio

cs.SD · 2026-05-22 · unverdicted · novelty 7.0

MixFake is a new benchmark for mixed-authenticity audio and a multi-stream prompt tuning method achieves 0.95% EER foreground and 7.72% absolute gain in complex background deepfake detection.

Toward Fine-Grained Speech Inpainting Forensics:A Dataset, Method, and Metric for Multi-Region Tampering Localization

cs.SD · 2026-05-04 · unverdicted · novelty 7.0

A new dataset, iterative coarse-to-fine localization framework, and segment-level IoU F1 metric tackle the open problem of detecting multiple unknown word-level inpainted regions in speech.

Indic-CodecFake meets SATYAM: Towards Detecting Neural Audio Codec Synthesized Speech Deepfakes in Indic Languages

eess.AS · 2026-04-21 · unverdicted · novelty 7.0

Introduces the Indic-CodecFake dataset for Indic codec deepfakes and SATYAM, a novel hyperbolic ALM that outperforms baselines through dual-stage semantic-prosodic fusion using Bhattacharya distance.

ArtifactNet: Detecting AI-Generated Music via Forensic Residual Physics

cs.SD · 2026-04-17 · unverdicted · novelty 7.0

ArtifactNet extracts codec residuals from spectrograms with a 4M-parameter network to detect AI music at F1=0.9829 and 1.49% FPR on unseen tracks from 22 generators, outperforming larger baselines.

Listening Deepfake Detection: A New Perspective Beyond Speaking-Centric Forgery Analysis

cs.CV · 2026-04-14 · conditional · novelty 7.0

Introduces the LDD task, ListenForge dataset built from five listening head generation methods, and MANet model that detects listening forgeries via motion inconsistencies guided by audio semantics.

DetectZoo: A Unified Toolkit for AI-Generated Content Detection Across Text, Audio, and Image Modalities

cs.MM · 2026-06-02 · unverdicted · novelty 6.0

DetectZoo is a unified toolkit providing reference implementations of 61 detectors, native loaders for 22 benchmark datasets, and a standardized evaluation pipeline for AI-generated content detection across text, audio, and image modalities.

Asymmetric Phase Coding Audio Watermarking

cs.CR · 2026-05-08 · unverdicted · novelty 6.0

APC embeds compact Ed25519 signatures into audio phase data with error correction to achieve 97.5-98.3% cryptographic verification under eight attack types at mean PESQ 3.02.

Phoneme-Level Deepfake Detection Across Emotional Conditions Using Self-Supervised Embeddings

cs.SD · 2026-05-04 · unverdicted · novelty 6.0

Phoneme-level analysis using self-supervised embeddings identifies higher divergence in complex vowels and fricatives for emotional voice conversion deepfakes, enabling more interpretable detection across emotions.

Split and Conquer Partial Deepfake Speech

cs.SD · 2026-04-03 · unverdicted · novelty 6.0

A two-stage boundary detection plus segment classification method with multi-length training achieves state-of-the-art results for detecting and localizing partial deepfakes on PartialSpoof and Half-Truth benchmarks.

Synthetic Trust Attacks: Modeling How Generative AI Manipulates Human Decisions in Social Engineering Fraud

cs.CR · 2026-04-02 · unverdicted · novelty 6.0

The paper proposes Synthetic Trust Attacks (STAs) as a formal threat model with an eight-stage attack chain (STAM) that shifts defense focus from detecting synthetic media to protecting human decision processes in social engineering.

AuthGlass: Benchmarking Voice Liveness Detection and Authentication on Smart Glasses via Comprehensive Acoustic Features

cs.HC · 2025-09-25 · unverdicted · novelty 6.0

The AuthGlass dataset and proposed multi-modal models achieve state-of-the-art results on voice liveness detection and user authentication for smart glasses.

A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook

cs.SD · 2026-05-18 · unverdicted · novelty 5.0

A survey of Large Audio Language Models that establishes a taxonomy of trustworthiness vulnerabilities and proposes a Defense-in-Depth roadmap for audio intelligence.

Gender Fairness in Audio Deepfake Detection: Performance and Disparity Analysis

cs.SD · 2026-03-09 · unverdicted · novelty 5.0

Fairness metrics uncover gender disparities in audio deepfake detection error distributions that standard Equal Error Rate metrics obscure.

Advancing Zero-Shot Open-Set Speech Deepfake Source Tracing

eess.AS · 2025-09-29 · unverdicted · novelty 5.0

A zero-shot open-set speech deepfake source tracing framework using adapted SSL-AASIST embeddings and AAM loss achieves EER of 16.43% in OOD trials with cosine scoring, outperforming few-shot alternatives.

Dual-Granularity Orthogonal Disentanglement for Generalizable Audio Deepfake Detection

cs.SD · 2026-06-15 · unverdicted · novelty 4.0

Dual-granularity orthogonal disentanglement framework achieves EERs of 1.35%, 7.88%, and 21.58% on ASVspoof 2019 LA, ASVspoof 2021 DF, and In-the-Wild datasets, outperforming gradient reversal by 2.60% on cross-dataset transfer.

Classical Machine Learning Baselines for Deepfake Audio Detection on the Fake-or-Real Dataset

eess.AS · 2026-04-15 · unverdicted · novelty 3.0

RBF SVM achieves ~93% accuracy and ~7% EER on deepfake audio detection using prosodic and spectral features from the FoR dataset at 44.1 kHz and 16 kHz sampling rates.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Asymmetric Phase Coding Audio Watermarking cs.CR · 2026-05-08 · unverdicted · none · ref 5
APC embeds compact Ed25519 signatures into audio phase data with error correction to achieve 97.5-98.3% cryptographic verification under eight attack types at mean PESQ 3.02.
Split and Conquer Partial Deepfake Speech cs.SD · 2026-04-03 · unverdicted · none · ref 4
A two-stage boundary detection plus segment classification method with multi-length training achieves state-of-the-art results for detecting and localizing partial deepfakes on PartialSpoof and Half-Truth benchmarks.
A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook cs.SD · 2026-05-18 · unverdicted · none · ref 38
A survey of Large Audio Language Models that establishes a taxonomy of trustworthiness vulnerabilities and proposes a Defense-in-Depth roadmap for audio intelligence.

Audio Deepfake detection: A survey

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer