pith. sign in

arxiv: 2410.23850 · v1 · pith:TC5QP4P5new · submitted 2024-10-31 · 💻 cs.CL

The Automated Verification of Textual Claims (AVeriTeC) Shared Task

classification 💻 cs.CL
keywords sharedtaskaveritecclaimsevidenceautomatedscoresubmissions
0
0 comments X
read the original abstract

The Automated Verification of Textual Claims (AVeriTeC) shared task asks participants to retrieve evidence and predict veracity for real-world claims checked by fact-checkers. Evidence can be found either via a search engine, or via a knowledge store provided by the organisers. Submissions are evaluated using AVeriTeC score, which considers a claim to be accurately verified if and only if both the verdict is correct and retrieved evidence is considered to meet a certain quality threshold. The shared task received 21 submissions, 18 of which surpassed our baseline. The winning team was TUDA_MAI with an AVeriTeC score of 63%. In this paper we describe the shared task, present the full results, and highlight key takeaways from the shared task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. The Warrant Gap: Claim-Conditioned Re-scoring for Fact-Checking

    cs.CL 2026-06 unverdicted novelty 6.0

    Introduces claim-conditioned re-scoring (SIFT) and warranted supports proportion (WSP) metric, reporting accuracy recovery up to 27.6 points and WSP calibration at AUC 0.92 on FEVER, SciFact and other benchmarks.