Falsification-driven reinforcement learning for maritime motion planning

Florian Finkeldei; Hanna Krasowski; Marlon M\"uller; Matthias Althoff; Murat Arcak

arxiv: 2510.06970 · v2 · pith:MHN2LJ3Mnew · submitted 2025-10-08 · 📡 eess.SY · cs.LG· cs.SY

Falsification-driven reinforcement learning for maritime motion planning

Marlon M\"uller , Florian Finkeldei , Hanna Krasowski , Murat Arcak , Matthias Althoff This is my paper

classification 📡 eess.SY cs.LGcs.SY

keywords maritimescenariostrainingagentsapproachcompliancefalsification-drivenlearning

0 comments

read the original abstract

Compliance with maritime traffic rules is essential for the safe operation of autonomous vessels, yet training reinforcement learning (RL) agents to adhere to them is challenging. The behavior of RL agents is shaped by the training scenarios they encounter, but creating scenarios that capture the complexity of maritime navigation is non-trivial, and real-world data alone is insufficient. To address this, we propose a falsification-driven RL approach that generates adversarial training scenarios in which the vessel under test violates maritime traffic rules, which are expressed as signal temporal logic specifications. Our experiments on open-sea navigation with two vessels demonstrate that the proposed approach provides more relevant training scenarios and achieves more consistent rule compliance.

This paper has not been read by Pith yet.

Falsification-driven reinforcement learning for maritime motion planning

discussion (0)