pith. sign in
Pith Number

pith:XHRQXUER

pith:2026:XHRQXUERWOBFBW4D6KTH2VQRN3
not attested not anchored not stored refs pending

EnterpriseRAG-Bench: A RAG Benchmark for Company Internal Knowledge

Chris Weaver, Joachim Rahmfeld, Mark H. Butler, Roshan Desai, Weijia Chen, Wenxi Huang, Yuhong Sun

A new benchmark supplies 500,000 synthetic enterprise documents and 500 questions to evaluate retrieval-augmented generation on company-internal knowledge.

arxiv:2605.05253 v2 · 2026-05-05 · cs.IR

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{XHRQXUERWOBFBW4D6KTH2VQRN3}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

We present EnterpriseRAG-Bench, a dataset consisting of approximately 500,000 documents spanning nine enterprise source types and 500 questions across ten categories that test distinct retrieval and reasoning capabilities.

C2weakest assumption

The synthetic corpus with cross-document coherence and added noise such as misfiled documents and conflicting information realistically reflects real company-internal knowledge.

C3one line summary

EnterpriseRAG-Bench supplies a synthetic corpus of 500,000 documents across Slack, Gmail, GitHub and other tools plus 500 questions that probe lookup, multi-document reasoning, conflict resolution and absence detection.

Receipt and verification
First computed 2026-05-21T01:04:26.888551Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

b9e30bd091b38250db83f2a67d56116eef37206988e7773fd50b96880ad73fa4

Aliases

arxiv: 2605.05253 · arxiv_version: 2605.05253v2 · doi: 10.48550/arxiv.2605.05253 · pith_short_12: XHRQXUERWOBF · pith_short_16: XHRQXUERWOBFBW4D · pith_short_8: XHRQXUER
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/XHRQXUERWOBFBW4D6KTH2VQRN3 \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: b9e30bd091b38250db83f2a67d56116eef37206988e7773fd50b96880ad73fa4
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "2667f75ac3234a8db463535b608b0cd23315241544b0840e31ab0d297bc30d4b",
    "cross_cats_sorted": [],
    "license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
    "primary_cat": "cs.IR",
    "submitted_at": "2026-05-05T20:23:38Z",
    "title_canon_sha256": "205f9f9c4b51ea56cf5d260f0cd75b11eaf5a8a0065546fbb84c8c93dd8044ff"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.05253",
    "kind": "arxiv",
    "version": 2
  }
}