pith. sign in

arxiv: 2503.20654 · v4 · submitted 2025-03-26 · 💻 cs.CV · cs.AI

AccidentSim: Generating Vehicle Collision Videos with Physically Realistic Collision Trajectories from Real-World Accident Reports

Pith reviewed 2026-05-22 22:22 UTC · model grok-4.3

classification 💻 cs.CV cs.AI
keywords vehicle collision video generationphysical trajectory simulationaccident reportslanguage model fine-tuningNeRF renderingpost-collision dynamicsautonomous driving data
0
0 comments X

The pith

AccidentSim generates vehicle collision videos with physically realistic trajectories from real-world accident reports.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a framework to create rare vehicle accident videos needed for autonomous driving research. It pulls physical clues and context from existing accident reports to drive a physical simulator that produces accurate post-collision vehicle paths and assembles these into a dedicated trajectory dataset. The dataset fine-tunes a language model so it can predict consistent trajectories from new text descriptions of driving scenarios. These trajectories are then merged with high-quality scene backgrounds rendered by Neural Radiance Fields to produce complete videos. The resulting videos are shown to match both visual appearance and physical behavior of real collisions better than prior generation methods.

Core claim

AccidentSim extracts physical and contextual details from real-world accident reports, feeds them into a reliable physical simulator to replicate post-collision trajectories, assembles the results into a training dataset, fine-tunes a language model to predict trajectories from user prompts, and combines the trajectories with Neural Radiance Fields backgrounds to output collision videos that exhibit both visual and physical authenticity.

What carries the argument

The AccidentSim pipeline: physical simulator that turns report data into trajectory dataset, language-model fine-tuning for prompt-based prediction, and NeRF rendering to composite foreground vehicles with backgrounds.

If this is right

  • A new dataset of physically consistent collision trajectories becomes available from existing accident reports.
  • Language models can be prompted to generate realistic post-collision behavior across varied driving scenarios.
  • Generated videos supply training and testing material for autonomous driving systems on rare collision events.
  • The method separates trajectory physics from visual rendering, allowing independent improvement of each part.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same report-to-trajectory step could be applied to other accident types or vehicle classes if suitable reports exist.
  • Fine-tuned models might support interactive scenario editing inside driving simulators.
  • The approach could reduce reliance on expensive real-world crash data collection for edge-case testing.

Load-bearing premise

The physical clues and contextual information in real-world accident reports are sufficient for a reliable physical simulator to accurately replicate post-collision trajectories without needing additional real sensor data or validation measurements.

What would settle it

Direct comparison of the simulator-generated vehicle positions, velocities, and orientations over time against high-precision measurements from instrumented crash tests or detailed forensic reconstructions of the same reported accidents; large systematic deviations would falsify the replication claim.

Figures

Figures reproduced from arXiv: 2503.20654 by Longfei Han, Qiang Qu, Qian Zhang, Weidong Cai, Xiangwen Zhang, Xiaoming Chen.

Figure 1
Figure 1. Figure 1: AccidentSim generates vehicle collision videos from user-provided accident descriptions, producing physically realistic collision [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Architecture of AccidentSim: AccidentSim extracts physical cues, such as vehicle speeds and collision types, along with contex [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Videos generated by AccidentSim in comparison with baseline methods. For a fair comparison, all methods are given the same [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Vehicle collision scenarios across four typical road types. The results show that AccidentSim generates vehicle collision videos [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: The process of extracting physical clues and contextual information from accident reports. [PITH_FULL_IMAGE:figures/full_fig_p012_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Demonstration of physical simulations in CARLA for vehicle collisions. [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: The collision videos generated by traditional methods such as ChatSim [ [PITH_FULL_IMAGE:figures/full_fig_p013_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Prompt used for GPT-4o evaluation [PITH_FULL_IMAGE:figures/full_fig_p014_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Results of human evaluation. E.2. GPT-4o Evaluation The process of the GPT-4o evaluation, mentioned in Sec￾tion 4.3 of the paper, is detailed below. We followed Phys￾Gen3D [6] and employed GPT-4o as a multimodal evaluator to quantitatively assess the quality of the generated vehicle collision videos. This AI-based evaluation serves as an au￾tomated counterpart to human judgment, offering an addi￾tional lay… view at source ↗
read the original abstract

Collecting real-world vehicle accident videos for autonomous driving research is challenging due to their rarity and complexity. While existing driving video generation methods may produce visually realistic videos, they often fail to deliver physically realistic simulations because they lack the capability to generate accurate post-collision trajectories. In this paper, we introduce AccidentSim, a novel framework that generates physically realistic vehicle collision videos by extracting and utilizing the physical clues and contextual information available in real-world vehicle accident reports. Specifically, AccidentSim leverages a reliable physical simulator to replicate post-collision vehicle trajectories from the physical and contextual information in the accident reports and to build a vehicle collision trajectory dataset. This dataset is then used to fine-tune a language model, enabling it to respond to user prompts and predict physically consistent post-collision trajectories across various driving scenarios based on user descriptions. Finally, we employ Neural Radiance Fields (NeRF) to render high-quality backgrounds, merging them with the foreground vehicles that exhibit physically realistic trajectories to generate vehicle collision videos. Experimental results demonstrate that the videos produced by AccidentSim excel in both visual and physical authenticity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The paper introduces AccidentSim, a pipeline that extracts physical and contextual clues from real-world accident reports, feeds them into a physical simulator to generate post-collision trajectories and a corresponding dataset, fine-tunes a language model to predict trajectories from text prompts, and composites the resulting vehicle motions with NeRF-rendered backgrounds to produce collision videos. The central claim is that the resulting videos exhibit superior visual and physical authenticity compared to prior driving-video generation methods.

Significance. If the physical-authenticity claim is substantiated with quantitative validation, the work would address a clear gap in synthetic data generation for autonomous-driving research by supplying collision scenarios whose post-impact dynamics are derived from real reports rather than purely learned or heuristic motion models.

major comments (2)
  1. [Abstract / Experimental Results] Abstract and Experimental Results section: the assertion that 'experimental results demonstrate that the videos produced by AccidentSim excel in both visual and physical authenticity' is unsupported; the manuscript supplies no quantitative metrics (e.g., trajectory error, velocity RMSE, collision impulse consistency), no baseline comparisons, and no description of how physical realism was measured or validated against any ground-truth post-collision sensor data.
  2. [Methods] Methods / Pipeline description: the claim that trajectories generated by the external simulator from report-derived clues are 'physically realistic' is load-bearing for the entire contribution, yet the text provides no ground-truth post-collision measurements (final positions, velocities, or contact forces) against which simulator outputs could be compared; any systematic mismatch between the simulator's multi-body contact model and real deformation/friction therefore remains invisible to both the LM fine-tuning stage and the downstream evaluation.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive feedback on the need for stronger validation of physical realism. We address each major comment below and outline planned revisions.

read point-by-point responses
  1. Referee: [Abstract / Experimental Results] Abstract and Experimental Results section: the assertion that 'experimental results demonstrate that the videos produced by AccidentSim excel in both visual and physical authenticity' is unsupported; the manuscript supplies no quantitative metrics (e.g., trajectory error, velocity RMSE, collision impulse consistency), no baseline comparisons, and no description of how physical realism was measured or validated against any ground-truth post-collision sensor data.

    Authors: We agree the current manuscript lacks quantitative metrics, baseline comparisons, and explicit measurement protocols for physical realism; the Experimental Results section relies on qualitative video examples. We will revise the abstract to remove the unsupported claim and add a new subsection describing evaluation methodology, including trajectory consistency metrics against the simulator outputs and visual quality scores, plus comparisons to prior video generation methods. revision: yes

  2. Referee: [Methods] Methods / Pipeline description: the claim that trajectories generated by the external simulator from report-derived clues are 'physically realistic' is load-bearing for the entire contribution, yet the text provides no ground-truth post-collision measurements (final positions, velocities, or contact forces) against which simulator outputs could be compared; any systematic mismatch between the simulator's multi-body contact model and real deformation/friction therefore remains invisible to both the LM fine-tuning stage and the downstream evaluation.

    Authors: The simulator generates trajectories from parameters extracted from accident reports using established multi-body physics. We will revise the Methods section to explicitly note the absence of direct ground-truth post-collision measurements in the source reports and to discuss the simulator's modeling assumptions as a limitation. We will also add any feasible indirect checks, such as consistency with reported final vehicle positions where available in the reports. revision: partial

standing simulated objections not resolved
  • Absence of ground-truth post-collision sensor data (positions, velocities, forces) in real-world accident reports, which prevents direct quantitative validation of simulator outputs against reality.

Circularity Check

0 steps flagged

No circularity in derivation chain

full rationale

The paper describes an engineering pipeline that extracts clues from accident reports, feeds them to an external physical simulator to generate trajectories, fine-tunes a language model on the resulting dataset, and renders with NeRF. No equations, parameter-fitting steps, or self-referential derivations appear in the abstract or described structure. The physical-authenticity claim rests on the simulator's external validity rather than any internal reduction of outputs to inputs by construction. No self-citation load-bearing steps or ansatz smuggling are present.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The central claim rests on the unstated premise that accident reports contain enough quantitative physical information to drive an accurate simulator and that the resulting trajectories generalize when used to fine-tune the language model; no explicit free parameters, axioms, or invented entities are named in the abstract.

pith-pipeline@v0.9.0 · 5733 in / 1083 out tokens · 40026 ms · 2026-05-22T22:22:28.701696+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Learning physically grounded traffic accident reconstruction from public accident reports

    cs.LG 2026-04 unverdicted novelty 6.0

    A multimodal learning model with a new dataset of 6,217 cases reconstructs lane-consistent pre-impact motion and collision interactions from public accident reports, outperforming baselines in accuracy and consistency.

Reference graph

Works this paper leans on

46 extracted references · 46 canonical work pages · cited by 1 Pith paper · 3 internal anchors

  1. [1]

    Nmvccs (2005-2007) search

    National Highway Traffic Safety Administration. Nmvccs (2005-2007) search. 2016. Retrieved November 15, 2023 fromhttps : / / crashviewer . nhtsa . dot . gov / LegacyNMVCCS/Search. 1, 4

  2. [2]

    Cosmos World Foundation Model Platform for Physical AI

    Niket Agarwal, Arslan Ali, Maciej Bala, Yogesh Balaji, Erik Barker, Tiffany Cai, Prithvijit Chattopadhyay, Yongxin Chen, Yin Cui, Yifan Ding, et al. Cosmos world foun- dation model platform for physical ai.arXiv preprint arXiv:2501.03575, 2025. 1, 6

  3. [3]

    Uncertainty-based traffic accident anticipation with spatio-temporal relational learn- ing

    Wentao Bao, Qi Yu, and Yu Kong. Uncertainty-based traffic accident anticipation with spatio-temporal relational learn- ing. InACM Multimedia Conference, 2020. 3

  4. [4]

    Jimeng 3.0

    ByteDance. Jimeng 3.0. 2025.https://jimeng. jianying.com. 1, 6, 7

  5. [5]

    nuscenes: A multi- modal dataset for autonomous driving

    Holger Caesar, Varun Bankiti, Alex H Lang, Sourabh V ora, Venice Erin Liong, Qiang Xu, Anush Krishnan, Yu Pan, Gi- ancarlo Baldan, and Oscar Beijbom. nuscenes: A multi- modal dataset for autonomous driving. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11621–11631, 2020. 1

  6. [6]

    Physgen3d: Crafting a miniature interactive world from a single image

    Boyuan Chen, Hanxiao Jiang, Shaowei Liu, Saurabh Gupta, Yunzhu Li, Hao Zhao, and Shenlong Wang. Physgen3d: Crafting a miniature interactive world from a single image. InProceedings of the Computer Vision and Pattern Recogni- tion Conference, pages 6178–6189, 2025. 6, 7, 14

  7. [7]

    Carla scenario runner, 2019

    Scenario Runner Contributors. Carla scenario runner, 2019. 6, 7

  8. [8]

    The cityscapes dataset for semantic urban scene understanding

    Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, and Bernt Schiele. The cityscapes dataset for semantic urban scene understanding. InProceed- ings of the IEEE conference on computer vision and pattern recognition, pages 3213–3223, 2016. 1

  9. [9]

    Wenhao Ding, Minjun Xu, and Ding Zhao. Learning to collide: An adaptive safety-critical scenarios generating method.2020 IEEE/RSJ International Conference on Intel- ligent Robots and Systems (IROS), pages 2243–2250, 2020. 6, 7

  10. [10]

    CARLA: An open urban driving simulator

    Alexey Dosovitskiy, German Ros, Felipe Codevilla, Antonio Lopez, and Vladlen Koltun. CARLA: An open urban driving simulator. InProceedings of the 1st Annual Conference on Robot Learning, pages 1–16, 2017. 2, 11, 13

  11. [11]

    The Llama 3 Herd of Models

    Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Ab- hishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, et al. The llama 3 herd of models.arXiv preprint arXiv:2407.21783,

  12. [12]

    Scenic: a language for scenario specification and scene generation

    Daniel J Fremont, Tommaso Dreossi, Shromona Ghosh, Xi- angyu Yue, Alberto L Sangiovanni-Vincentelli, and Sanjit A Seshia. Scenic: a language for scenario specification and scene generation. InProceedings of the 40th ACM SIGPLAN conference on programming language design and implemen- tation, pages 63–78, 2019. 15

  13. [13]

    Magicdrive: Street view generation with diverse 3d geometry control.arXiv preprint arXiv:2310.02601, 2023

    Ruiyuan Gao, Kai Chen, Enze Xie, Lanqing Hong, Zhenguo Li, Dit-Yan Yeung, and Qiang Xu. Magicdrive: Street view generation with diverse 3d geometry control.arXiv preprint arXiv:2310.02601, 2023. 3

  14. [14]

    Vision meets robotics: The kitti dataset.The Inter- national Journal of Robotics Research, 32(11):1231–1237,

    Andreas Geiger, Philip Lenz, Christoph Stiller, and Raquel Urtasun. Vision meets robotics: The kitti dataset.The Inter- national Journal of Robotics Research, 32(11):1231–1237,

  15. [15]

    Sovar: Building generalizable sce- narios from accident reports for autonomous driving testing

    An Guo, Yuan Zhou, Haoxiang Tian, Chunrong Fang, Yun- jian Sun, Weisong Sun, Xinyu Gao, Anh Tuan Luu, Yang Liu, and Zhenyu Chen. Sovar: Building generalizable sce- narios from accident reports for autonomous driving testing. arXiv preprint arXiv:2409.08081, 2024. 5, 11, 12, 13

  16. [16]

    Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor

    Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, and Sergey Levine. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. InInterna- tional conference on machine learning, pages 1861–1870. PMLR, 2018. 15

  17. [17]

    Autovfx: Physically realistic video edit- ing from natural language instructions.arXiv preprint arXiv:2411.02394, 2024

    Hao-Yu Hsu, Zhi-Hao Lin, Albert Zhai, Hongchi Xia, and Shenlong Wang. Autovfx: Physically realistic video edit- ing from natural language instructions.arXiv preprint arXiv:2411.02394, 2024. 6, 7, 8

  18. [18]

    LoRA: Low-Rank Adaptation of Large Language Models

    Edward J Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen- Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen. Lora: Low-rank adaptation of large language models.arXiv preprint arXiv:2106.09685, 2021. 5

  19. [19]

    Vbench: Comprehensive bench- mark suite for video generative models

    Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, et al. Vbench: Comprehensive bench- mark suite for video generative models. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 21807–21818, 2024. 6, 7

  20. [20]

    Ac3r: auto- matically reconstructing car crashes from police reports

    Tri Huynh, Alessio Gambi, and Gordon Fraser. Ac3r: auto- matically reconstructing car crashes from police reports. In 2019 IEEE/ACM 41st International Conference on Software Engineering: Companion Proceedings (ICSE-Companion), pages 31–34. IEEE, 2019. 11, 12

  21. [21]

    Kling 2.0, 2025.https : / / kling

    Kuaishou. Kling 2.0, 2025.https : / / kling . kuaishou.com/en. 1, 6, 7

  22. [22]

    Drivingdif- fusion: Layout-guided multi-view driving scene video generation with latent diffusion model.arXiv preprint arXiv:2310.07771, 2023

    Xiaofan Li, Yifu Zhang, and Xiaoqing Ye. Drivingdif- fusion: Layout-guided multi-view driving scene video generation with latent diffusion model.arXiv preprint arXiv:2310.07771, 2023. 3

  23. [23]

    Curse of rarity for autonomous vehicles.nature communications, 15(1):4808, 2024

    Henry X Liu and Shuo Feng. Curse of rarity for autonomous vehicles.nature communications, 15(1):4808, 2024. 1

  24. [24]

    Neuroncap: Photorealistic closed- loop safety testing for autonomous driving

    William Ljungbergh, Adam Tonderski, Joakim Johnan- der, Holger Caesar, Kalle ˚Astr¨om, Michael Felsberg, and Christoffer Petersson. Neuroncap: Photorealistic closed- loop safety testing for autonomous driving. InEuropean Conference on Computer Vision, pages 161–177. Springer,

  25. [25]

    Roadrunner

    MathWorks. Roadrunner. 2020.https : / / www . mathworks.com/products/roadrunner.html. 5

  26. [26]

    OpenAI. Gpt-4o. 2024.https://openai.com/ index/hello-gpt-4o/. 6

  27. [27]

    Generating useful accident-prone driv- ing scenarios via a learned traffic prior

    Davis Rempe, Jonah Philion, Leonidas J Guibas, Sanja Fi- dler, and Or Litany. Generating useful accident-prone driv- ing scenarios via a learned traffic prior. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 17305–17315, 2022. 3 9

  28. [28]

    Runway. Gen-4. 2025.https://runwayml.com/ research/introducing-runway-gen-4. 1, 6, 7

  29. [29]

    Cadp: A novel dataset for cctv traffic camera based accident analysis

    Ankit Parag Shah, Jean-Bapstite Lamare, Tuan Nguyen-Anh, and Alexander Hauptmann. Cadp: A novel dataset for cctv traffic camera based accident analysis. In2018 15th IEEE In- ternational Conference on Advanced Video and Signal Based Surveillance (AVSS), pages 1–9, 2018. 3

  30. [30]

    Scalability in perception for autonomous driving: Waymo open dataset

    Pei Sun, Henrik Kretzschmar, Xerxes Dotiwalla, Aurelien Chouard, Vijaysai Patnaik, Paul Tsui, James Guo, Yin Zhou, Yuning Chai, Benjamin Caine, et al. Scalability in perception for autonomous driving: Waymo open dataset. InProceed- ings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2446–2454, 2020. 1, 6

  31. [31]

    Language conditioned traffic generation.arXiv preprint arXiv:2307.07947, 2023

    Shuhan Tan, Boris Ivanovic, Xinshuo Weng, Marco Pavone, and Philipp Kraehenbuehl. Language conditioned traffic generation.arXiv preprint arXiv:2307.07947, 2023. 3

  32. [32]

    Advsim: Generating safety-critical scenar- ios for self-driving vehicles.2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9904–9913, 2021

    Jingkang Wang, Ava Pun, James Tu, Sivabalan Mani- vasagam, Abbas Sadat, Sergio Casas, Mengye Ren, and Raquel Urtasun. Advsim: Generating safety-critical scenar- ios for self-driving vehicles.2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9904–9913, 2021. 6, 7

  33. [33]

    Driving into the future: Multiview visual forecasting and planning with world model for au- tonomous driving

    Yuqi Wang, Jiawei He, Lue Fan, Hongxin Li, Yuntao Chen, and Zhaoxiang Zhang. Driving into the future: Multiview visual forecasting and planning with world model for au- tonomous driving. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 14749–14759, 2024. 1, 3

  34. [34]

    Editable scene simulation for autonomous driving via collaborative llm-agents

    Yuxi Wei, Zi Wang, Yifan Lu, Chenxin Xu, Changxing Liu, Hao Zhao, Siheng Chen, and Yanfeng Wang. Editable scene simulation for autonomous driving via collaborative llm-agents. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15077– 15087, 2024. 5, 7, 13

  35. [35]

    Panacea: Panoramic and controllable video generation for autonomous driving

    Yuqing Wen, Yucheng Zhao, Yingfei Liu, Fan Jia, Yanhui Wang, Chong Luo, Chi Zhang, Tiancai Wang, Xiaoyan Sun, and Xiangyu Zhang. Panacea: Panoramic and controllable video generation for autonomous driving. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6902–6912, 2024. 1, 3

  36. [36]

    Holodrive: Holistic 2d-3d multi-modal street scene generation for au- tonomous driving.arXiv preprint arXiv:2412.01407, 2024

    Zehuan Wu, Jingcheng Ni, Xiaodong Wang, Yuxin Guo, Rui Chen, Lewei Lu, Jifeng Dai, and Yuwen Xiong. Holodrive: Holistic 2d-3d multi-modal street scene generation for au- tonomous driving.arXiv preprint arXiv:2412.01407, 2024. 1

  37. [37]

    S-nerf: Neural radiance fields for street views.arXiv preprint arXiv:2303.00749, 2023

    Ziyang Xie, Junge Zhang, Wenye Li, Feihu Zhang, and Li Zhang. S-nerf: Neural radiance fields for street views.arXiv preprint arXiv:2303.00749, 2023. 3

  38. [38]

    Safebench: A benchmarking platform for safety evaluation of autonomous vehicles.Advances in Neural Information Processing Systems, 35:25667–25682, 2022

    Chejian Xu, Wenhao Ding, Weijie Lyu, Zuxin Liu, Shuai Wang, Yihan He, Hanjiang Hu, Ding Zhao, and Bo Li. Safebench: A benchmarking platform for safety evaluation of autonomous vehicles.Advances in Neural Information Processing Systems, 35:25667–25682, 2022. 6, 15

  39. [39]

    Bdd100k: A diverse driving dataset for heterogeneous multitask learning

    Fisher Yu, Haofeng Chen, Xin Wang, Wenqi Xian, Yingying Chen, Fangchen Liu, Vashisht Madhavan, and Trevor Dar- rell. Bdd100k: A diverse driving dataset for heterogeneous multitask learning. InProceedings of the IEEE/CVF con- ference on computer vision and pattern recognition, pages 2636–2645, 2020. 1

  40. [40]

    Is- safe: Improving semantic segmentation in accidents by fus- ing event-based data

    Jiaming Zhang, Kailun Yang, and Rainer Stiefelhagen. Is- safe: Improving semantic segmentation in accidents by fus- ing event-based data. In2021 IEEE/RSJ International Con- ference on Intelligent Robots and Systems (IROS), pages 1132–1139, 2021. 3

  41. [41]

    Chatscene: Knowledge-enabled safety-critical scenario generation for autonomous vehicles

    Jiawei Zhang, Chejian Xu, and Bo Li. Chatscene: Knowledge-enabled safety-critical scenario generation for autonomous vehicles. InProceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 15459–15469, 2024. 6, 7, 15

  42. [42]

    Cat: Closed-loop adversarial training for safe end-to-end driving

    Linrui Zhang, Zhenghao Peng, Quanyi Li, and Bolei Zhou. Cat: Closed-loop adversarial training for safe end-to-end driving. InConference on Robot Learning, pages 2357–

  43. [43]

    On adversarial robustness of tra- jectory prediction for autonomous vehicles

    Qingzhao Zhang, Shengtuo Hu, Jiachen Sun, Qi Alfred Chen, and Z Morley Mao. On adversarial robustness of tra- jectory prediction for autonomous vehicles. InProceedings of the IEEE/CVF Conference on Computer Vision and Pat- tern Recognition, pages 15159–15168, 2022. 6, 7

  44. [44]

    Language-guided traffic simulation via scene-level diffusion

    Ziyuan Zhong, Davis Rempe, Yuxiao Chen, Boris Ivanovic, Yulong Cao, Danfei Xu, Marco Pavone, and Baishakhi Ray. Language-guided traffic simulation via scene-level diffusion. InConference on Robot Learning, pages 144–177. PMLR,

  45. [45]

    Collision model for vehicle motion prediction after light impacts.Vehicle System Dynamics, 46(S1):3–15, 2008

    Jing Zhou, Huei Peng, and Jianbo Lu. Collision model for vehicle motion prediction after light impacts.Vehicle System Dynamics, 46(S1):3–15, 2008. 8 AccidentSim: Generating Vehicle Collision Videos with Physically Realistic Collision Trajectories from Real-World Accident Reports 10 AccidentSim: Generating Vehicle Collision Videos with Physically Realistic...

  46. [46]

    Algorithm for Pre-Collision Trajectory Planning The proposed algorithm for pre-collision trajectory plan- ning, mentioned in Section 3.1 of the paper, is detailed in Algorithm 1

    Table of Contents •Pre-Collision Trajectory Planning •Information Extraction from Accident Reports •Physical Simulation of Accident Scenario •Additional Qualitative Analysis of Collision Dynamics •Human and GPT-4o Evaluation •Collision Reduction with AccidentSim A. Algorithm for Pre-Collision Trajectory Planning The proposed algorithm for pre-collision tr...