pith:KMSLJJOB
Flow-OPD: On-Policy Distillation for Flow Matching Models
Flow-OPD trains domain-specialized teachers with single-reward GRPO then distills them into one flow-matching student using on-policy sampling and dense supervision.
arxiv:2605.08063 v4 · 2026-05-08 · cs.CV · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{KMSLJJOBIGX44SXX67E3LDWGTF}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Built upon Stable Diffusion 3.5 Medium, Flow-OPD raises the GenEval score from 63 to 92 and the OCR accuracy from 59 to 94, yielding an overall improvement of roughly 10 points over vanilla GRPO, while preserving image fidelity and human-preference alignment and exhibiting an emergent 'teacher-surpassing' effect.
That single-reward GRPO fine-tuning lets each domain-specialized teacher reach its performance ceiling in isolation and that the subsequent three-step orchestration of on-policy sampling, task-routing labeling, and dense supervision can consolidate heterogeneous expertise into one student without reintroducing gradient interference or reward hacking.
Flow-OPD applies on-policy distillation to flow matching models via specialized teachers, cold-start initialization, and manifold anchor regularization, lifting GenEval from 63 to 92 and OCR from 59 to 94 on Stable Diffusion 3.5 Medium.
Formal links
Receipt and verification
| First computed | 2026-05-20T01:05:15.919408Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
5324b4a5c141afce4af7f7c9b58ec6994020700309b366174a913882661e3e33
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/KMSLJJOBIGX44SXX67E3LDWGTF \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 5324b4a5c141afce4af7f7c9b58ec6994020700309b366174a913882661e3e33
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "461631e79734da5e5af6eb58614015c6fe61285d00aa3e34415eae71ee82b715",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2026-05-08T17:50:15Z",
"title_canon_sha256": "17bb8772104084dc225d4714dee66f4d885f7966e4b1abdb0e02a5936f90f450"
},
"schema_version": "1.0",
"source": {
"id": "2605.08063",
"kind": "arxiv",
"version": 4
}
}