pith:Z43LXKWP
A Switching System Theory of Q-Learning with Linear Function Approximation
The mean dynamics of Q-learning with linear function approximation are exactly equivalent to a linear switched system whose stability determines convergence.
arxiv:2605.11021 v2 · 2026-05-10 · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{Z43LXKWPAMDLOHNY4PQIQY6JLV}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
We derive an exact linear switched model for the mean dynamics and relate convergence to stability of the corresponding switched system.
That the mean dynamics of Q-learning with linear function approximation admit an exact representation as a finite set of linear switching modes whose joint spectral radius governs convergence.
Q-learning with linear function approximation is recast as a switched linear system whose mean dynamics converge precisely when the joint spectral radius of the switching matrices is less than one.
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-20T01:05:16.522625Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
cf36bbaacf0306b71db8e3e08863c95d6d665df4fce51a86f60fe5cac9432b80
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/Z43LXKWPAMDLOHNY4PQIQY6JLV \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: cf36bbaacf0306b71db8e3e08863c95d6d665df4fce51a86f60fe5cac9432b80
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "50e911934cfb9d76dc0b2679b68867559464df43c129533a94dbdd78c07fee9c",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-10T16:21:31Z",
"title_canon_sha256": "1367a8f9dfe7deeb7b6cda9fc40c07ae986d30178b072d47d5f2fd1ecdc94c61"
},
"schema_version": "1.0",
"source": {
"id": "2605.11021",
"kind": "arxiv",
"version": 2
}
}