pith:ERNFVBFK
Model Spec Midtraining: Improving How Alignment Training Generalizes
Training models on synthetic documents about their Model Spec before alignment fine-tuning shapes how they generalize from later examples.
arxiv:2605.02087 v2 · 2026-05-03 · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{ERNFVBFKEPMLFXMRQUEW3VDMX5}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
applying MSM with a spec addressing self-preservation and goal-guarding substantially reduces agentic misalignment rate (Qwen3-32B: 54% to 7%), beating a deliberative alignment baseline (14%).
That training on synthetic documents discussing the Model Spec will reliably encode the intended generalizations into the model without introducing new unintended behaviors or degrading other capabilities.
Model spec midtraining trains AI models on documents about their alignment rules before demonstration fine-tuning, producing stronger and more controllable generalization to the intended values and safety behaviors.
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-25T02:01:21.550661Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
245a5a84aa23d8b2dd9185096dd46cbf4cfff3a210b0efb857f6bce56b92d806
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/ERNFVBFKEPMLFXMRQUEW3VDMX5 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 245a5a84aa23d8b2dd9185096dd46cbf4cfff3a210b0efb857f6bce56b92d806
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "2e0aed015bf1a11ff1a2707dd39ac6cec235d990c6266eb21325da8d7d016f11",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.AI",
"submitted_at": "2026-05-03T23:16:14Z",
"title_canon_sha256": "1932e8f8bef3c64a24772dfe65d8518d6aa60796de036002fd399059937a2d6e"
},
"schema_version": "1.0",
"source": {
"id": "2605.02087",
"kind": "arxiv",
"version": 2
}
}