pith. sign in
Pith Number

pith:QVD4L32Z

pith:2026:QVD4L32Z2F5ALHWMTX4IZCJSKQ
not attested not anchored not stored refs pending

Internal Data Repetition Destroys Language Models

Bo He, David Donoho, Jessica Chudnovsky, Joshua Kazdan, Mehmet Donmez, Noam Levi, Rylan Schaeffer, Sanmi Koyejo, Yegor Denisov-Blanch

arxiv:2606.24998 v1 · 2026-06-23 · cs.LG · cs.AI

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{QVD4L32Z2F5ALHWMTX4IZCJSKQ}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.
Receipt and verification
First computed 2026-06-25T00:17:47.875367Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

8547c5ef59d17a059ecc9df88c8932541eb817378dfcf055f3946ab3bf6f7b87

Aliases

arxiv: 2606.24998 · arxiv_version: 2606.24998v1 · doi: 10.48550/arxiv.2606.24998 · pith_short_12: QVD4L32Z2F5A · pith_short_16: QVD4L32Z2F5ALHWM · pith_short_8: QVD4L32Z
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/QVD4L32Z2F5ALHWMTX4IZCJSKQ \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 8547c5ef59d17a059ecc9df88c8932541eb817378dfcf055f3946ab3bf6f7b87
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "01946e2fa0e7047c2ae67b60a0c81192c32dc99b4036e8586f173f4018aec15e",
    "cross_cats_sorted": [
      "cs.AI"
    ],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.LG",
    "submitted_at": "2026-06-23T16:02:40Z",
    "title_canon_sha256": "f315da6f562fb1d59436e1186c94297f650d9bd8b235c1536e56ee49241d251c"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2606.24998",
    "kind": "arxiv",
    "version": 1
  }
}