pith:FYN4RFFB
RT-H: Action Hierarchies Using Language
Predicting fine-grained language descriptions of motions first helps robot policies share structure across diverse tasks and accept language corrections.
arxiv:2403.01823 v2 · 2024-03-04 · cs.RO · cs.AI
Record completeness
Claims
Our method RT-H builds an action hierarchy using language motions: it first learns to predict language motions, and conditioned on this and the high-level task, it predicts actions, using visual context at all stages.
That fine-grained language motion phrases capture shared low-level structure across semantically diverse tasks sufficiently well that predicting them improves downstream action prediction and enables effective language-based correction.
RT-H learns robot policies by first predicting language motions as an intermediate representation and then mapping those plus the high-level task to actions, yielding more robust multi-task performance and the ability to learn from language interventions.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:14.778993Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
2e1bc894a1a9656a528666771ee6447eb469f837ee05bea1f488a7363f760f38
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/FYN4RFFBVFSWUUUGMZ3R5ZSEP2 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 2e1bc894a1a9656a528666771ee6447eb469f837ee05bea1f488a7363f760f38
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "92b344e6172175696b4c494f461c8d89070242449cfbc5c528c263a6933d5a7a",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.RO",
"submitted_at": "2024-03-04T08:16:11Z",
"title_canon_sha256": "6ce889699add44e9d8826eff1f6ba9e286a1dd915a3843ad177b00c773c46637"
},
"schema_version": "1.0",
"source": {
"id": "2403.01823",
"kind": "arxiv",
"version": 2
}
}