Dragan, Shankar Sastry, and Sanjit A

Sadigh, D · 2017 · DOI 10.15607/rss.2017.xiii.053

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Beyond Monotonic Progress: Retry-Supervised Value Learning for Robot Imitation

cs.RO · 2026-06-23 · unverdicted · novelty 6.0

ReTVL uses retry events as sparse supervision to train mistake-sensitive value functions that reweight demonstration chunks for improved behavior cloning on real-robot manipulation tasks.

Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations

cs.RO · 2026-05-21 · unverdicted · novelty 6.0

Robots detect underspecified reward features via demonstration variation and query targeted natural language explanations to improve reward recovery from imperfect demos.

citing papers explorer

Showing 2 of 2 citing papers.

Beyond Monotonic Progress: Retry-Supervised Value Learning for Robot Imitation cs.RO · 2026-06-23 · unverdicted · none · ref 39
ReTVL uses retry events as sparse supervision to train mistake-sensitive value functions that reweight demonstration chunks for improved behavior cloning on real-robot manipulation tasks.
Robots That Know What to Ask: Recovering Misaligned Rewards through Targeted Explanations cs.RO · 2026-05-21 · unverdicted · none · ref 40
Robots detect underspecified reward features via demonstration variation and query targeted natural language explanations to improve reward recovery from imperfect demos.

Dragan, Shankar Sastry, and Sanjit A

fields

years

verdicts

representative citing papers

citing papers explorer