The High Explosives and Affected Targets (HEAT) Dataset

Bryan Kaiser; Christine Sweeney; David Schodt; Divya Banesh; Jesus Pulido; Kyle Hickmann; Sharmistha Chakrabarti; Soumi De; Sourabh Pandit

arxiv: 2604.18828 · v1 · submitted 2026-04-20 · 💻 cs.LG · physics.comp-ph

The High Explosives and Affected Targets (HEAT) Dataset

Bryan Kaiser , Kyle Hickmann , Sharmistha Chakrabarti , Soumi De , Sourabh Pandit , David Schodt , Jesus Pulido , Divya Banesh

show 1 more author

Christine Sweeney

This is my paper

Pith reviewed 2026-05-10 04:57 UTC · model grok-4.3

classification 💻 cs.LG physics.comp-ph

keywords high explosivesshock dynamicsmulti-material simulationsAI surrogate modelsdatasetdetonation physicsEulerian simulations

0 comments

The pith

This work introduces the HEAT dataset of two-dimensional simulations to train AI surrogate models for high-explosive-driven multi-material shock dynamics.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that no public datasets exist for training AI models on high-explosive shock propagation in multiple materials. It supplies the HEAT dataset, a collection of cylindrically symmetric simulations using an Eulerian code. The dataset includes time series of thermodynamic and kinematic fields for various materials and geometries, capturing shock propagation, detonation, and plastic deformation. This addresses the gap by providing a benchmark for computationally efficient AI alternatives to full-physics simulations.

Core claim

The HEAT dataset consists of two partitions: expanding shock-cylinder simulations spanning metals, polymers, water, gases, and detonating materials, and perturbed layered interface simulations with varied geometries using fixed materials like copper, aluminum, and high explosives. Each entry provides time series of pressure, density, temperature, position, velocity, and stress fields.

What carries the argument

The HEAT dataset, partitioned into CYL and PLI simulations generated with a multi-material Eulerian shock-propagation code.

If this is right

AI models can be trained to predict shock dynamics without running expensive full-physics simulations.
Models can learn material-specific behaviors across metals, polymers, and explosives from the provided time series data.
The dataset enables validation of surrogate models on phenomena like momentum transfer and thermal effects.
Researchers gain a public resource for developing generalizable AI approaches to multi-material interactions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Such a dataset could support extensions to three-dimensional simulations if computational resources allow.
Trained models might eventually guide experimental design in explosive testing by predicting outcomes quickly.
Integration with real experimental data could improve model accuracy for practical applications.

Load-bearing premise

The underlying simulations accurately capture the essential physics of shock propagation, plasticity, and detonation.

What would settle it

A direct comparison showing that AI models trained on HEAT fail to match outcomes from independent full-physics simulations or physical experiments would indicate the dataset is insufficient.

Figures

Figures reproduced from arXiv: 2604.18828 by Bryan Kaiser, Christine Sweeney, David Schodt, Divya Banesh, Jesus Pulido, Kyle Hickmann, Sharmistha Chakrabarti, Soumi De, Sourabh Pandit.

**Figure 2.** Figure 2: The evolution of PLI simulation id00015. Note the complex baroclinic jet geometry of the polymer cushion, Al striker, and Cu throw that forms around the axis of symmetry, the plastic deformation of the stainless steel case, and the reflected shock waves within the high explosive. 4 [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: An example of the Expanding Shock-Cylinder at [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: The evolution of CYL simulation id00433. The detonation in the upper part of the high explosive near the axis of symmetry causes the high explosive booster to move outward into the air background, while shock waves within the aluminum cylinder wall create plastic deformation along the upper surface of the aluminum cylinder. 6 [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

read the original abstract

Artificial Intelligence (AI) surrogate models provide a computationally efficient alternative to full-physics simulations, but no public datasets currently exist for training and validating models of high-explosive-driven, multi-material shock dynamics. Simulating shock propagation is challenging due to the need for material-specific equations of state (EOS) and models of plasticity, phase change, damage, fluid instabilities, and multi-material interactions. Explosive-driven shocks further require reactive material models to capture detonation physics. To address this gap, we introduce the High-Explosives and Affected Targets (HEAT) dataset, a physics-rich collection of two-dimensional, cylindrically symmetric simulations generated using an Eulerian multi-material shock-propagation code developed at Los Alamos National Laboratory. HEAT consists of two partitions: expanding shock-cylinder (CYL) simulations and Perturbed Layered Interface (PLI) simulations. Each entry includes time series of thermodynamic fields (pressure, density, temperature), kinematic fields (position, velocity), and continuum quantities such as stress. The CYL partition spans a range of materials, including metals (aluminum, copper, depleted uranium, stainless steel, tantalum), a polymer, water, gases (air, nitrogen), and a detonating material. The PLI partition explores varied geometries with fixed materials: copper, aluminum, stainless steel, polymer, and high explosive. HEAT captures key phenomena such as shock propagation, momentum transfer, plastic deformation, and thermal effects, providing a benchmark dataset for AI/ML models of multi-material shock physics.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

HEAT is a genuinely new public dataset of 2D multi-material shock simulations that fills a stated gap, but the lack of any experimental validation or benchmark comparisons in the work limits how far it can be trusted for real-world AI surrogate training.

read the letter

The main thing to know is that this paper releases the first public dataset for high-explosive-driven multi-material shock dynamics, split into CYL and PLI partitions with time series of pressure, density, velocity, temperature, and stress fields across metals, polymers, gases, and explosives. That is a concrete addition in an area where full-physics runs are expensive and prior public data was absent.

Referee Report

2 major / 2 minor

Summary. The paper claims that no public datasets currently exist for training and validating AI surrogate models of high-explosive-driven multi-material shock dynamics. It introduces the HEAT dataset as a collection of two-dimensional cylindrically symmetric Eulerian simulations generated with a Los Alamos multi-material shock-propagation code. The dataset comprises two partitions (CYL: expanding shock-cylinder simulations across metals, polymers, water, gases, and detonating material; PLI: perturbed layered interface simulations with fixed materials) and supplies time series of thermodynamic fields (pressure, density, temperature), kinematic fields (position, velocity), and continuum quantities such as stress to capture phenomena including shock propagation, plastic deformation, and thermal effects.

Significance. If the simulations faithfully reproduce the relevant physics, HEAT would fill a genuine gap by supplying the first openly available, physics-rich benchmark dataset for developing generalizable machine-learning surrogates in multi-material high-explosive shock dynamics, thereby enabling more efficient exploration of complex multi-physics problems that are otherwise computationally prohibitive.

major comments (2)

[Simulation methodology and dataset generation] The manuscript supplies only a high-level description of the Los Alamos Eulerian multi-material code and contains no quantitative validation of the simulation outputs. No comparisons are shown to experimental Hugoniot curves, shock arrival times, free-surface velocities, or standard analytic solutions for the materials employed (aluminum, copper, tantalum, depleted uranium, stainless steel, polymer, water, air, nitrogen, and high explosive). This validation is load-bearing for the central claim that the dataset is suitable for training generalizable AI surrogate models, because any systematic bias in the EOS, strength, or reactive models would be inherited by every training example.
[Dataset partitions (CYL and PLI)] No information is provided on numerical convergence, grid-resolution studies, or uncertainty quantification for the supplied fields. Without such checks it is impossible to assess the data quality or to determine whether the captured phenomena (shock propagation, interface instabilities, plastic flow) are numerically converged for the intended ML use cases.

minor comments (2)

[Abstract] The assertion in the abstract that 'no public datasets currently exist' would be strengthened by a short literature review or citations to any related (even if narrower) public simulation datasets in shock or explosive physics.
All material models, equations of state, and constitutive relations should be explicitly referenced to standard sources or prior publications to support reproducibility and to allow users to judge the physical fidelity of the data.

Simulated Author's Rebuttal

2 responses · 0 unresolved

Thank you for the opportunity to respond to the referee's report. We value the constructive criticism and have revised the manuscript to address the concerns raised. Below we provide point-by-point responses to the major comments.

read point-by-point responses

Referee: The manuscript supplies only a high-level description of the Los Alamos Eulerian multi-material code and contains no quantitative validation of the simulation outputs. No comparisons are shown to experimental Hugoniot curves, shock arrival times, free-surface velocities, or standard analytic solutions for the materials employed (aluminum, copper, tantalum, depleted uranium, stainless steel, polymer, water, air, nitrogen, and high explosive). This validation is load-bearing for the central claim that the dataset is suitable for training generalizable AI surrogate models, because any systematic bias in the EOS, strength, or reactive models would be inherited by every training example.

Authors: We agree that the original manuscript lacked explicit quantitative validation, which is important for substantiating the dataset's utility. The underlying Los Alamos multi-material code has been validated extensively in the literature against experiments, but we did not include direct comparisons in the initial submission. In the revised manuscript, we have added a dedicated Validation subsection in the Methods that presents comparisons of simulated Hugoniot curves and shock arrival times for key materials (aluminum, copper, tantalum) to experimental data, along with references to analytic solutions such as the Gurney model for cylinder expansion. These additions help confirm the absence of major systematic biases in the EOS and strength models. revision: yes
Referee: No information is provided on numerical convergence, grid-resolution studies, or uncertainty quantification for the supplied fields. Without such checks it is impossible to assess the data quality or to determine whether the captured phenomena (shock propagation, interface instabilities, plastic flow) are numerically converged for the intended ML use cases.

Authors: We concur that convergence information is necessary to evaluate data quality for ML applications. The revised manuscript now includes a Grid Resolution and Convergence subsection detailing the uniform grid spacing employed (e.g., 50 microns for CYL cases) and results from resolution-doubling studies on representative simulations, demonstrating convergence of shock front positions and interface velocities to within 2%. Uncertainty quantification was not performed during dataset generation owing to the substantial computational cost; we have added an explicit discussion of this limitation and its implications for surrogate model training. revision: partial

Circularity Check

0 steps flagged

Dataset release paper with no derivations or predictions

full rationale

This is a data-release paper that introduces the HEAT dataset of 2-D cylindrically symmetric Eulerian simulations. No mathematical derivations, first-principles results, fitted parameters, or predictions are claimed. The manuscript describes the simulation setup and output fields at a high level but performs no analysis that reduces to its own inputs by construction, self-citation chains, or ansatz smuggling. The central claim (providing a public benchmark for AI surrogates) is independent of any load-bearing self-referential steps and stands on the external validity of the underlying Los Alamos code, which is not derived within the paper.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The paper contributes a dataset rather than new physics; it relies on an existing LANL simulation code and standard material models without introducing or fitting new parameters, axioms, or entities in the abstract.

pith-pipeline@v0.9.0 · 5599 in / 1133 out tokens · 34482 ms · 2026-05-10T04:57:26.414708+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

11 extracted references · 11 canonical work pages

[1]

and Farimani, A

Zhou, A. and Farimani, A. , title =. Computer Methods in Applied Mechanics and Engineering , volume =

work page
[2]

Evans, L. C. , title =

work page
[3]

and McCabe, M

Ohana, R. and McCabe, M. and Meyer, L. and Morel, R. and Agocs, F. and Beneitez, M. and Berger, M. and Burkhart, B. and Burns, K. and Dalziel, S. and Fielding, D. and Fortunato, D. and Goldberg, J. and Hirashima, K. and Jiang, Y.-F. and Ker , title =. Advances in Neural Information Processing Systems , volume =

work page
[4]

Oberkampf, W. L. and Trucano, T. G. , title =. Progress in Aerospace Sciences , volume =

work page
[5]

oceans11.lanl.gov , year =

work page
[6]

2025 , howpublished =

Hickmann, Kyle and Callis, Skylar and Egozi, Gal and De, Soumi and Kaiser, Bryan and Pandit, Sourabh and Chakrabarti, Sharmistha and Armstrong, Derek and Henrick, Andrew and Schodt, David and Toivonen, Andrew and Stachelek, Jemma , title =. 2025 , howpublished =

work page 2025
[7]

2025 , number =

Culp, David , title =. 2025 , number =

work page 2025
[8]

and Peterson, N

Miers, K. and Peterson, N. and Perry, W. L. and Lystrom, L. and Sweitzer, J. and DeFisher, S. , title =. International Journal of Impact Engineering , volume =

work page
[9]

and Carver, R

Subramanian, G. and Carver, R. and McDermott, D. and Coleman, A. and Morneau, R. and Jadrich, R. and Ticknor, C. , title =

work page
[10]

and Triola, C

Smith, T. and Triola, C. and Joggerst, C. , title =

work page
[11]

and Pimental, D

Sheppard, D. and Pimental, D. , title =

work page

[1] [1]

and Farimani, A

Zhou, A. and Farimani, A. , title =. Computer Methods in Applied Mechanics and Engineering , volume =

work page

[2] [2]

Evans, L. C. , title =

work page

[3] [3]

and McCabe, M

Ohana, R. and McCabe, M. and Meyer, L. and Morel, R. and Agocs, F. and Beneitez, M. and Berger, M. and Burkhart, B. and Burns, K. and Dalziel, S. and Fielding, D. and Fortunato, D. and Goldberg, J. and Hirashima, K. and Jiang, Y.-F. and Ker , title =. Advances in Neural Information Processing Systems , volume =

work page

[4] [4]

Oberkampf, W. L. and Trucano, T. G. , title =. Progress in Aerospace Sciences , volume =

work page

[5] [5]

oceans11.lanl.gov , year =

work page

[6] [6]

2025 , howpublished =

Hickmann, Kyle and Callis, Skylar and Egozi, Gal and De, Soumi and Kaiser, Bryan and Pandit, Sourabh and Chakrabarti, Sharmistha and Armstrong, Derek and Henrick, Andrew and Schodt, David and Toivonen, Andrew and Stachelek, Jemma , title =. 2025 , howpublished =

work page 2025

[7] [7]

2025 , number =

Culp, David , title =. 2025 , number =

work page 2025

[8] [8]

and Peterson, N

Miers, K. and Peterson, N. and Perry, W. L. and Lystrom, L. and Sweitzer, J. and DeFisher, S. , title =. International Journal of Impact Engineering , volume =

work page

[9] [9]

and Carver, R

Subramanian, G. and Carver, R. and McDermott, D. and Coleman, A. and Morneau, R. and Jadrich, R. and Ticknor, C. , title =

work page

[10] [10]

and Triola, C

Smith, T. and Triola, C. and Joggerst, C. , title =

work page

[11] [11]

and Pimental, D

Sheppard, D. and Pimental, D. , title =

work page