Fooled Twice: People Cannot Detect Deepfakes but Think They Can

Kobis, N · 2021 · arXiv 2308.14970

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

representative citing papers

Toward Fine-Grained Speech Inpainting Forensics:A Dataset, Method, and Metric for Multi-Region Tampering Localization

cs.SD · 2026-05-04 · unverdicted · novelty 7.0

A new dataset, iterative coarse-to-fine localization framework, and segment-level IoU F1 metric tackle the open problem of detecting multiple unknown word-level inpainted regions in speech.

Indic-CodecFake meets SATYAM: Towards Detecting Neural Audio Codec Synthesized Speech Deepfakes in Indic Languages

eess.AS · 2026-04-21 · unverdicted · novelty 7.0

Introduces the Indic-CodecFake dataset for Indic codec deepfakes and SATYAM, a novel hyperbolic ALM that outperforms baselines through dual-stage semantic-prosodic fusion using Bhattacharya distance.

ArtifactNet: Detecting AI-Generated Music via Forensic Residual Physics

cs.SD · 2026-04-17 · unverdicted · novelty 7.0

ArtifactNet extracts codec residuals from spectrograms with a 4M-parameter network to detect AI music at F1=0.9829 and 1.49% FPR on unseen tracks from 22 generators, outperforming larger baselines.

Listening Deepfake Detection: A New Perspective Beyond Speaking-Centric Forgery Analysis

cs.CV · 2026-04-14 · conditional · novelty 7.0

Introduces the LDD task, ListenForge dataset built from five listening head generation methods, and MANet model that detects listening forgeries via motion inconsistencies guided by audio semantics.

Asymmetric Phase Coding Audio Watermarking

cs.CR · 2026-05-08 · unverdicted · novelty 6.0

APC embeds compact Ed25519 signatures into audio phase data with error correction to achieve 97.5-98.3% cryptographic verification under eight attack types at mean PESQ 3.02.

Phoneme-Level Deepfake Detection Across Emotional Conditions Using Self-Supervised Embeddings

cs.SD · 2026-05-04 · unverdicted · novelty 6.0

Phoneme-level analysis using self-supervised embeddings identifies higher divergence in complex vowels and fricatives for emotional voice conversion deepfakes, enabling more interpretable detection across emotions.

Split and Conquer Partial Deepfake Speech

cs.SD · 2026-04-03 · unverdicted · novelty 6.0

A two-stage boundary detection plus segment classification method with multi-length training achieves state-of-the-art results for detecting and localizing partial deepfakes on PartialSpoof and Half-Truth benchmarks.

Synthetic Trust Attacks: Modeling How Generative AI Manipulates Human Decisions in Social Engineering Fraud

cs.CR · 2026-04-02 · unverdicted · novelty 6.0

The paper proposes Synthetic Trust Attacks (STAs) as a formal threat model with an eight-stage attack chain (STAM) that shifts defense focus from detecting synthetic media to protecting human decision processes in social engineering.

Classical Machine Learning Baselines for Deepfake Audio Detection on the Fake-or-Real Dataset

eess.AS · 2026-04-15 · unverdicted · novelty 3.0

RBF SVM achieves ~93% accuracy and ~7% EER on deepfake audio detection using prosodic and spectral features from the FoR dataset at 44.1 kHz and 16 kHz sampling rates.

citing papers explorer

Showing 9 of 9 citing papers.

Toward Fine-Grained Speech Inpainting Forensics:A Dataset, Method, and Metric for Multi-Region Tampering Localization cs.SD · 2026-05-04 · unverdicted · none · ref 11
A new dataset, iterative coarse-to-fine localization framework, and segment-level IoU F1 metric tackle the open problem of detecting multiple unknown word-level inpainted regions in speech.
Indic-CodecFake meets SATYAM: Towards Detecting Neural Audio Codec Synthesized Speech Deepfakes in Indic Languages eess.AS · 2026-04-21 · unverdicted · none · ref 80
Introduces the Indic-CodecFake dataset for Indic codec deepfakes and SATYAM, a novel hyperbolic ALM that outperforms baselines through dual-stage semantic-prosodic fusion using Bhattacharya distance.
ArtifactNet: Detecting AI-Generated Music via Forensic Residual Physics cs.SD · 2026-04-17 · unverdicted · none · ref 11
ArtifactNet extracts codec residuals from spectrograms with a 4M-parameter network to detect AI music at F1=0.9829 and 1.49% FPR on unseen tracks from 22 generators, outperforming larger baselines.
Listening Deepfake Detection: A New Perspective Beyond Speaking-Centric Forgery Analysis cs.CV · 2026-04-14 · conditional · none · ref 46
Introduces the LDD task, ListenForge dataset built from five listening head generation methods, and MANet model that detects listening forgeries via motion inconsistencies guided by audio semantics.
Asymmetric Phase Coding Audio Watermarking cs.CR · 2026-05-08 · unverdicted · none · ref 5
APC embeds compact Ed25519 signatures into audio phase data with error correction to achieve 97.5-98.3% cryptographic verification under eight attack types at mean PESQ 3.02.
Phoneme-Level Deepfake Detection Across Emotional Conditions Using Self-Supervised Embeddings cs.SD · 2026-05-04 · unverdicted · none · ref 5
Phoneme-level analysis using self-supervised embeddings identifies higher divergence in complex vowels and fricatives for emotional voice conversion deepfakes, enabling more interpretable detection across emotions.
Split and Conquer Partial Deepfake Speech cs.SD · 2026-04-03 · unverdicted · none · ref 4
A two-stage boundary detection plus segment classification method with multi-length training achieves state-of-the-art results for detecting and localizing partial deepfakes on PartialSpoof and Half-Truth benchmarks.
Synthetic Trust Attacks: Modeling How Generative AI Manipulates Human Decisions in Social Engineering Fraud cs.CR · 2026-04-02 · unverdicted · none · ref 11
The paper proposes Synthetic Trust Attacks (STAs) as a formal threat model with an eight-stage attack chain (STAM) that shifts defense focus from detecting synthetic media to protecting human decision processes in social engineering.
Classical Machine Learning Baselines for Deepfake Audio Detection on the Fake-or-Real Dataset eess.AS · 2026-04-15 · unverdicted · none · ref 3
RBF SVM achieves ~93% accuracy and ~7% EER on deepfake audio detection using prosodic and spectral features from the FoR dataset at 44.1 kHz and 16 kHz sampling rates.

Fooled Twice: People Cannot Detect Deepfakes but Think They Can

fields

years

verdicts

representative citing papers

citing papers explorer