MTA-RL predicts 3D driving affordances from multi-modal sensors with a transformer and uses them as the observation space for an RL policy, yielding better route completion and generalization than baselines in CARLA urban scenarios.
Exploring the limitations of behavior cloning for autonomous driving
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
citation-role summary
background 1
citation-polarity summary
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1roles
background 1polarities
background 1representative citing papers
citing papers explorer
-
MTA-RL: Robust Urban Driving via Multi-modal Transformer-based 3D Affordances and Reinforcement Learning
MTA-RL predicts 3D driving affordances from multi-modal sensors with a transformer and uses them as the observation space for an RL policy, yielding better route completion and generalization than baselines in CARLA urban scenarios.