The Automated Verification of Textual Claims (AVeriTeC) Shared Task

Andreas Vlachos; Arpit Mittal; Chenxi Whitehouse; Christos Christodoulopoulos; James Thorne; Michael Schlichtkrull; Mubashara Akhtar; Oana Cocarascu; Rami Aly; Yulong Chen

arxiv: 2410.23850 · v1 · pith:TC5QP4P5new · submitted 2024-10-31 · 💻 cs.CL

The Automated Verification of Textual Claims (AVeriTeC) Shared Task

Michael Schlichtkrull , Yulong Chen , Chenxi Whitehouse , Zhenyun Deng , Mubashara Akhtar , Rami Aly , Zhijiang Guo , Christos Christodoulopoulos

show 4 more authors

Oana Cocarascu Arpit Mittal James Thorne Andreas Vlachos

This is my paper

classification 💻 cs.CL

keywords sharedtaskaveritecclaimsevidenceautomatedscoresubmissions

0 comments

read the original abstract

The Automated Verification of Textual Claims (AVeriTeC) shared task asks participants to retrieve evidence and predict veracity for real-world claims checked by fact-checkers. Evidence can be found either via a search engine, or via a knowledge store provided by the organisers. Submissions are evaluated using AVeriTeC score, which considers a claim to be accurately verified if and only if both the verdict is correct and retrieved evidence is considered to meet a certain quality threshold. The shared task received 21 submissions, 18 of which surpassed our baseline. The winning team was TUDA_MAI with an AVeriTeC score of 63%. In this paper we describe the shared task, present the full results, and highlight key takeaways from the shared task.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

The Warrant Gap: Claim-Conditioned Re-scoring for Fact-Checking
cs.CL 2026-06 unverdicted novelty 6.0

Introduces claim-conditioned re-scoring (SIFT) and warranted supports proportion (WSP) metric, reporting accuracy recovery up to 27.6 points and WSP calibration at AUC 0.92 on FEVER, SciFact and other benchmarks.