M4FC: a Multimodal, Multilingual, Multicultural, Multitask Real-World Fact-Checking Dataset

Iryna Gurevych; Jiahui Geng; Jonathan Tonglet

arxiv: 2510.23508 · v3 · pith:3YLKAXGHnew · submitted 2025-10-27 · 💻 cs.CL

M4FC: a Multimodal, Multilingual, Multicultural, Multitask Real-World Fact-Checking Dataset

Jiahui Geng , Jonathan Tonglet , Iryna Gurevych This is my paper

classification 💻 cs.CL

keywords datasetfact-checkingm4fcmultimodalpredictionreal-worldtasksavailable

0 comments

read the original abstract

Existing real-world datasets for multimodal fact-checking have multiple limitations: they contain few instances, cover on only one or two languages, focus only on one task, or rely on external news article sets for sourcing true claims. To address these shortcomings, we introduce M4FC, a new real-world dataset comprising 4,982 images paired with 6,980 claims. The images, verified by professional fact-checkers from 22 organizations, represent a diverse range of cultural and geographic contexts. Each claim is available in one or two out of ten languages. M4FC spans six multimodal fact-checking tasks: visual claim extraction, claimant intent prediction, fake image detection, image contextualization, location verification, and verdict prediction. We provide baseline results for all tasks and analyze how combining intermediate tasks affects verdict prediction performance. We make our dataset and code publicly available.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

VeriTaS: The First Dynamic Benchmark for Multimodal Automated Fact-Checking
cs.IR 2026-01 conditional novelty 8.0

VeriTaS is the first dynamic benchmark for multimodal automated fact-checking that updates quarterly with real-world claims and a standardized scoring scheme to resist data leakage.
ReMMD: Realistic Multilingual Multi-Image Agentic Verification for Multimodal Misinformation Detection
cs.AI 2026-06 unverdicted novelty 7.0

ReMMD presents ReMMDBench (500 samples, 2756 images, five languages, five-way veracity) and ReMMD-Agent, which achieves 41.80% accuracy and 39.12% macro-F1 on five-way classification with GPT-5.2 while cutting costs v...