Deepfake video detection using convolutional vision transformer

· 2021 · arXiv 2102.11126

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

read on arXiv browse 7 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Listening Deepfake Detection: A New Perspective Beyond Speaking-Centric Forgery Analysis

cs.CV · 2026-04-14 · conditional · novelty 7.0

Introduces the LDD task, ListenForge dataset built from five listening head generation methods, and MANet model that detects listening forgeries via motion inconsistencies guided by audio semantics.

Architecture-Adaptive Uncertainty Fusion for Deepfake Detection

cs.CV · 2026-06-04 · unverdicted · novelty 6.0

COF fuses epistemic, aleatoric, calibration, conformal and distributional uncertainties via simplex optimization of Pearson correlation with errors, outperforming alternatives under distribution shift on CelebDF but collapsing with all methods on cross-dataset tests.

Enhancing Self-Supervised Talking Head Forgery Detection via a Training-Free Dual-System Framework

cs.CV · 2026-05-05 · unverdicted · novelty 6.0

A training-free dual-system framework refines anomaly score ordering on uncertain samples from self-supervised talking head forgery detectors to improve detection performance.

LAA-X: Unified Localized Artifact Attention for Quality-Agnostic and Generalizable Face Forgery Detection

cs.CV · 2026-04-05 · unverdicted · novelty 6.0

LAA-X uses multi-task learning with explicit localized artifact attention and blending synthesis to build a deepfake detector that generalizes to high-quality and unseen manipulations after training only on real and pseudo-fake samples.

PVLM: Parsing-Aware Vision Language Model with Dynamic Contrastive Learning for Zero-Shot Deepfake Attribution

cs.CV · 2025-04-19 · unverdicted · novelty 6.0

PVLM combines parsing-aware vision-language modeling with dynamic contrastive learning to enable fine-grained zero-shot attribution of deepfakes to unseen generators and outperforms prior methods on a new benchmark.

MFVLR: Multi-domain Fine-grained Vision-Language Reconstruction for Generalizable Diffusion Face Forgery Detection and Localization

cs.CV · 2026-05-11 · unverdicted · novelty 5.0

MFVLR uses multi-domain vision-language reconstruction with a fine-grained language transformer, multi-domain vision encoder, and vision injection module to achieve generalizable detection and localization of diffusion-synthesized face forgeries.

EMO-BOOST: Emotion-Augmented Audio-Visual Features for Improved Generalization in Deepfake Detection

cs.AI · 2026-05-19 · unverdicted · novelty 4.0

Emo-Boost augments low-level deepfake detectors with intra- and inter-modal emotion consistency checks to raise cross-manipulation generalization AUC by 2.1% on FakeAVCeleb.

citing papers explorer

Showing 2 of 2 citing papers after filters.

LAA-X: Unified Localized Artifact Attention for Quality-Agnostic and Generalizable Face Forgery Detection cs.CV · 2026-04-05 · unverdicted · none · ref 63
LAA-X uses multi-task learning with explicit localized artifact attention and blending synthesis to build a deepfake detector that generalizes to high-quality and unseen manipulations after training only on real and pseudo-fake samples.
MFVLR: Multi-domain Fine-grained Vision-Language Reconstruction for Generalizable Diffusion Face Forgery Detection and Localization cs.CV · 2026-05-11 · unverdicted · none · ref 22
MFVLR uses multi-domain vision-language reconstruction with a fine-grained language transformer, multi-domain vision encoder, and vision injection module to achieve generalizable detection and localization of diffusion-synthesized face forgeries.

Deepfake video detection using convolutional vision transformer

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer