DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models.CoRR abs/2506.03007

· 2025 · arXiv 2506.03007

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

SynCred-Bench: Benchmarking Synthetic Credibility in AI-Generated Visual Misinformation

cs.CV · 2026-06-02 · unverdicted · novelty 7.0

SynCred-Bench shows that 15 MLLMs reach only 10.5% TPR, open-source detectors under 5%, commercial APIs 57.6%, and humans 63% TPR at 5% FPR when identifying AI-generated images with synthetic credibility.

From Evidence to Verdict: An Agent-Based Forensic Framework for AI-Generated Image Detection

cs.CV · 2025-10-31 · conditional · novelty 6.0

A multi-agent forensic system integrates multiple evidence sources and debate to detect AI-generated images, reporting 97.05% accuracy on a 6,000-image benchmark while outperforming traditional classifiers.

citing papers explorer

Showing 2 of 2 citing papers after filters.

SynCred-Bench: Benchmarking Synthetic Credibility in AI-Generated Visual Misinformation cs.CV · 2026-06-02 · unverdicted · none · ref 16
SynCred-Bench shows that 15 MLLMs reach only 10.5% TPR, open-source detectors under 5%, commercial APIs 57.6%, and humans 63% TPR at 5% FPR when identifying AI-generated images with synthetic credibility.
From Evidence to Verdict: An Agent-Based Forensic Framework for AI-Generated Image Detection cs.CV · 2025-10-31 · conditional · none · ref 49
A multi-agent forensic system integrates multiple evidence sources and debate to detect AI-generated images, reporting 97.05% accuracy on a 6,000-image benchmark while outperforming traditional classifiers.

DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models.CoRR abs/2506.03007

fields

years

verdicts

representative citing papers

citing papers explorer