pith. sign in

Integrity report for WCXB: A Multi-Type Web Content Extraction Benchmark

A machine-verified record of the checks Pith has run against this paper: detector runs, findings, signed bundle events, and canonical identifiers.

arXiv:2605.21097 · pith:2026:JPRDV4QHUEAPVQKEJCZOBNESYG

0Critical
0Advisory
6Detectors run
2026-05-25Last checked

Paper page arXiv integrity.json bundle.json

Detector runs

claim_evidence completed v1.0.0 · findings 0 · 2026-05-25 06:03:43.026291+00:00
doi_title_agreement completed v1.0.0 · findings 0 · 2026-05-25 06:02:41.645340+00:00
doi_compliance completed v1.0.0 · findings 0 · 2026-05-25 05:23:30.348255+00:00
citation_quote_validity completed v0.1.0 · findings 0 · 2026-05-21 09:50:51.505741+00:00
cited_work_retraction completed v1.0.0 · findings 0 · 2026-05-21 07:52:13.769490+00:00
ai_meta_artifact skipped v1.0.0 · findings 0 · 2026-05-21 01:33:59.466463+00:00

Findings

No public integrity findings for this paper.

Signed record

The machine-readable record for this paper lives at /pith/JPRDV4QH/integrity.json. Pith Number bundles also include signed pith.integrity.v1 events where a Pith Number exists.