EasyChauffeur: A Baseline Advancing Simplicity and Efficiency on Waymax

Jiang-Jiang Liu; Jingdong Wang; Lingyu Xiao; Wankou Yang; Xiaoqing Ye

arxiv: 2408.16375 · v1 · pith:Q6KUBZSQnew · submitted 2024-08-29 · 💻 cs.RO

EasyChauffeur: A Baseline Advancing Simplicity and Efficiency on Waymax

Lingyu Xiao , Jiang-Jiang Liu , Xiaoqing Ye , Wankou Yang , Jingdong Wang This is my paper

classification 💻 cs.RO

keywords efficiencydataevaluationeasychauffeurperformanceplannersrobustnessfindings

0 comments

read the original abstract

Recent advancements in deep-learning-based driving planners have primarily focused on elaborate network engineering, yielding limited improvements. This paper diverges from conventional approaches by exploring three fundamental yet underinvestigated aspects: training policy, data efficiency, and evaluation robustness. We introduce EasyChauffeur, a reproducible and effective planner for both imitation learning (IL) and reinforcement learning (RL) on Waymax, a GPU-accelerated simulator. Notably, our findings indicate that the incorporation of on-policy RL significantly boosts performance and data efficiency. To further enhance this efficiency, we propose SNE-Sampling, a novel method that selectively samples data from the encoder's latent space, substantially improving EasyChauffeur's performance with RL. Additionally, we identify a deficiency in current evaluation methods, which fail to accurately assess the robustness of different planners due to significant performance drops from minor changes in the ego vehicle's initial state. In response, we propose Ego-Shifting, a new evaluation setting for assessing planners' robustness. Our findings advocate for a shift from a primary focus on network architectures to adopting a holistic approach encompassing training strategies, data efficiency, and robust evaluation methods.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Long-term Traffic Simulation via Structured Autoregressive Modeling
cs.AI 2026-06 unverdicted novelty 6.0

RosettaSim adapts frozen LLMs via structured autoregressive modeling of scene topology and agent states to reach SOTA short- and long-term traffic simulation on WOSAC, paired with RTE evaluation that correlates better...
ConsistencyPlanner: Real-time Planning with Fast-Sampling Consistency Models
cs.RO 2026-06 unverdicted novelty 4.0

ConsistencyPlanner applies fast-sampling consistency models for efficient multimodal trajectory generation and attention-based heterogeneous feature fusion to achieve superior safety in Waymax driving simulations.