Output Type Before Quality: A Standards-Derived XAI Admissibility Rubric for Autonomous-Driving Safety

Abhinaw Priyadershi; Jelena Frtunikj; Mandar Pitale; Maria Spence

arxiv: 2606.05461 · v1 · pith:5GW2PSQ7new · submitted 2026-06-03 · 💻 cs.AI

Output Type Before Quality: A Standards-Derived XAI Admissibility Rubric for Autonomous-Driving Safety

Abhinaw Priyadershi , Mandar Pitale , Jelena Frtunikj , Maria Spence This is my paper

Pith reviewed 2026-06-28 05:50 UTC · model grok-4.3

classification 💻 cs.AI

keywords XAIautonomous drivingsafety standardsassurance casescausal explanationsAMLASISO 26262evidence-type gap

0 comments

The pith

Causal XAI is required to meet evidence demands from autonomous-driving safety standards at three lifecycle stages.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper derives 19 evidentiary criteria from standards including AMLAS, ISO 26262, ISO 21448 and ISO/PAS 8800, then scores six classes of XAI methods against those criteria across seven lifecycle stages. It finds that only causal XAI produces the required directed cause-and-effect chains, quantified interventional effects and named root-cause variables for hazard identification, incident investigation and data management. At the other four stages, correlational or language-based methods meet the criteria. A single-VLA check on 1,996 real driving clips confirms that each method's observed output type matches the rubric prediction. The result implies that XAI selection for ADS assurance cases must follow stage-specific evidence needs rather than method popularity.

Core claim

From the cited standards the authors extract 19 testable evidentiary criteria that assurance cases must satisfy with directed cause-and-effect chains, quantified interventional effects and named root-cause variables. Structural scoring of XAI method classes shows causal XAI is required to close the evidence-type gap at hazard identification (+62 percent), incident investigation (+50 percent) and data management (+50 percent). The verdict set remains stable for thresholds from 0 percent to 50 percent and survives a worst-case single-cell flip down to 25 percent. At the remaining four stages correlational or language-based methods are comparable or sufficient. The rubric addresses structural a

What carries the argument

The 19-criterion admissibility rubric that scores XAI method classes by whether their output types can supply the directed cause-and-effect evidence required by the standards at each lifecycle stage.

If this is right

XAI method selection for ADS safety cases must be driven by the evidence demand of each lifecycle stage.
Causal XAI is the only class that satisfies the rubric at hazard identification, incident investigation and data management.
At the other four stages correlational or language-based methods remain admissible under the rubric.
Even an admissible method still requires separate validation that its concrete outputs are factually correct.
The stability of the verdict set down to a 25 percent threshold indicates the structural result is not fragile to small scoring changes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same standards-to-rubric approach could be reused for other regulated domains that require causal evidence in assurance cases.
Developers might design stage-aware XAI pipelines that switch method class according to the current lifecycle stage.
The open fidelity-validation challenge points to a need for new benchmarks that test whether a causal graph or trace names the actual root cause in driving data.

Load-bearing premise

The 19 criteria derived from the standards correctly capture the evidence types that XAI outputs must supply in autonomous-driving assurance cases.

What would settle it

Demonstration that a non-causal XAI method, such as SHAP, can be implemented to produce directed cause-and-effect chains that satisfy the hazard-identification criteria would falsify the structural scoring.

Figures

Figures reproduced from arXiv: 2606.05461 by Abhinaw Priyadershi, Jelena Frtunikj, Mandar Pitale, Maria Spence.

**Figure 1.** Figure 1: The evidence-type gap. A safety standard demands one evidence type; a popular XAI method produces another. Left: ISO/PAS 8800 Cl. 6.7.1 defines a directed cause-and-effect chain from triggering condition to hazardous behaviour that Cl. 13.3.1– 13.3.3 normatively binds to safety-analysis evidence. Right: SHAP returns a ranked feature-importance list; in a silent-failure example from Section 5, the actual ca… view at source ↗

**Figure 2.** Figure 2: SCM is the only column with zero structural failures across the [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: Three stages clear the 30% causal-necessity threshold; all survive a one-step cell flip. Best causal-method score (filled square) vs best non-causal (open circle) per stage, with the verdict annotation. Hazard ID survives any one-step flip, incident investigation survives any one-step flip, and data management has a 3-criterion margin (D1, D2, D3 each favour SCM). incident investigation ≻ data management b… view at source ↗

**Figure 4.** Figure 4: The largest empirical gap in the paper: marginal effect estimates [PITH_FULL_IMAGE:figures/full_fig_p010_4.png] view at source ↗

read the original abstract

Safety standards for ML-based autonomous driving specify the kind of evidence an assurance case must contain (directed cause-and-effect chains, quantified interventional effects, named root-cause variables), yet the XAI literature is organised by output type and technique family (saliency maps, feature attribution, counterfactuals, causal graphs, language traces). SHAP, the most-recommended ADS XAI method, returns a ranked feature list that no implementation effort can convert into a directed chain (Fig.1). We name this mismatch the evidence-type gap. From AMLAS, ISO 26262, ISO21448, ISO/PAS 8800 we derive 19 testable evidentiary criteria across 7 lifecycle stages with representative clause-cited derivations and score six XAI method classes structurally. Causal XAI emerges as structurally required to satisfy the derived criteria at three stages: hazard identification (+62% rubric gap), incident investigation (+50%), and data management (+50%); the verdict set is stable across thresholds T in (0%, 50%]$ and survives a worst-case single-cell flip down to T = 25%. At the remaining four stages, correlational or language-based methods are comparable or sufficient. The rubric identifies structural admissibility (necessary but not sufficient for compliance): an admissible method's specific output content may still be wrong, and validating that fidelity (the edges a fitted SCM produces, the cause a trace names) is the open assurance challenge. A single-VLA proof of concept on 1,996 real-world driving clips (79,840 rows, ten splits) is consistent with each method's observed output type matching its rubric prediction. XAI method selection for ADS safety assurance should be driven by lifecycle-stage evidence demand, not by method popularity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper turns safety standards into a 19-criterion rubric that flags causal XAI as required at three ADS lifecycle stages, but the criteria-to-standard mappings and method scorings need direct verification.

read the letter

The main takeaway is that the authors extract 19 evidentiary criteria from AMLAS, ISO 26262, ISO 21448 and ISO/PAS 8800, then score six XAI method classes on whether their output types can supply the required evidence at each of seven lifecycle stages. Causal XAI shows up as structurally necessary for hazard identification, incident investigation and data management, with reported gaps of 62% and 50%, while correlational or language methods suffice elsewhere. The verdict holds across a range of thresholds and a single-cell flip test.

What is new is the explicit focus on output type rather than technique family, plus the stage-by-stage application to assurance-case evidence demands. The paper does a solid job grounding the evaluation in external standards instead of XAI-internal benchmarks, and the 1,996-clip validation on real driving data at least confirms that observed outputs line up with the rubric predictions.

The soft spots sit in the two mapping steps the stress-test note flags. The clause-to-criterion derivations are cited but not shown in enough detail here to judge how much interpretation was involved, and the structural scoring of methods like SHAP treats output capabilities as fixed even after possible post-processing. If either mapping is off, the size of the gaps changes. The internal consistency check on the clips does not test whether the rubric itself matches what the standards actually require.

This is for safety engineers and assurance-case writers in autonomous driving who have to justify XAI tool choices to regulators. A reader working on compliance would get concrete criteria to apply. It deserves peer review because the core mismatch it identifies is real and the rubric gives a usable starting point, even if reviewers will press on the exact derivations and scoring rules.

Referee Report

1 major / 1 minor

Summary. The paper derives 19 evidentiary criteria from AMLAS, ISO 26262, ISO 21448 and ISO/PAS 8800 across seven lifecycle stages for autonomous-driving assurance cases. It structurally scores six XAI method classes (including SHAP, saliency, counterfactuals, causal graphs and language traces) on whether their output types can ever supply the required evidence (directed cause-and-effect chains, quantified interventional effects, named root-cause variables). Causal XAI is found structurally required at hazard identification (+62% gap), incident investigation (+50%) and data management (+50%), with verdicts stable for thresholds T in (0%,50%] and robust to single-cell flips down to T=25%. At the other four stages correlational or language methods suffice. A single-VLA proof-of-concept on 1,996 real-world clips (79,840 rows, ten splits) is reported as consistent with each method’s rubric-predicted output type. The rubric addresses structural admissibility only.

Significance. If the clause-cited derivations and structural scoring hold, the work supplies a standards-grounded, output-type-driven rubric for XAI selection in ADS safety cases rather than popularity-based choice. The explicit stability analysis across T thresholds and the 1,996-clip proof-of-concept (with ten splits) constitute reproducible empirical support for the internal consistency of the rubric. These elements strengthen the central claim that method selection must be driven by lifecycle-stage evidence demand.

major comments (1)

[Abstract (criteria derivation and validation paragraph); § on 19 evidentiary criteria and method scoring] The central claim that causal XAI is structurally required at the three named stages rests on the accuracy of the 19 criteria extracted from the cited standards and on the structural scoring of the six method classes. The single-VLA proof-of-concept validates only that observed output types match the paper’s own rubric predictions; it does not independently test whether the criteria correctly encode the evidence types demanded by the standards or whether the scoring accurately reflects what each method class can supply even after post-processing. This mapping is load-bearing for the reported gaps (+62%, +50%, +50%) and for the stability result.

minor comments (1)

[Abstract (Fig. 1 reference)] Figure 1 is referenced in the abstract but its caption and axis labels are not described in the provided text; ensure the figure explicitly shows why a ranked feature list cannot be converted into a directed chain.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive comment and for recognizing the significance of the standards-derived rubric and the stability analysis. We address the major comment point by point below.

read point-by-point responses

Referee: [Abstract (criteria derivation and validation paragraph); § on 19 evidentiary criteria and method scoring] The central claim that causal XAI is structurally required at the three named stages rests on the accuracy of the 19 criteria extracted from the cited standards and on the structural scoring of the six method classes. The single-VLA proof-of-concept validates only that observed output types match the paper’s own rubric predictions; it does not independently test whether the criteria correctly encode the evidence types demanded by the standards or whether the scoring accurately reflects what each method class can supply even after post-processing. This mapping is load-bearing for the reported gaps (+62%, +50%, +50%) and for the stability result.

Authors: We agree that the single-VLA proof-of-concept (1,996 clips, ten splits) only demonstrates internal consistency: that each method class produces output types matching the rubric predictions. It does not constitute an independent empirical test of whether the 19 criteria faithfully encode the evidentiary demands of the cited standards clauses, nor does it validate the structural scoring against post-processed implementations. The criteria are derived via direct clause extraction from AMLAS, ISO 26262, ISO 21448 and ISO/PAS 8800, with representative derivations supplied in the manuscript so that readers may assess the mapping themselves. The structural scoring evaluates output-type capability (e.g., whether a ranked feature list can ever supply a directed cause-and-effect chain) rather than post-processing potential. We will revise the abstract and the criteria-derivation section to state explicitly the limited scope of the empirical component and to flag that independent validation of the criteria-to-standard mapping against real assurance cases remains an open task. This makes the load-bearing assumptions transparent without altering the reported gaps or stability results. revision: partial

Circularity Check

0 steps flagged

No significant circularity; derivation from external standards is independent

full rationale

The paper derives its 19 evidentiary criteria from external standards (AMLAS, ISO 26262, ISO 21448, ISO/PAS 8800) with clause-cited derivations, then applies structural scoring to XAI method classes based on their inherent output types (e.g., SHAP cannot produce directed chains). This chain does not reduce to self-definition, fitted inputs renamed as predictions, or self-citation load-bearing. The single-VLA proof-of-concept tests consistency with the rubric but does not substitute for external validation of the rubric itself. No equations or steps exhibit the enumerated circular patterns; the central claims rest on mappings to independent standards rather than internal construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the accuracy of the standards-derived criteria and the classification of XAI method output types; no free parameters or new entities are introduced in the abstract.

axioms (1)

domain assumption The safety standards AMLAS, ISO 26262, ISO21448, ISO/PAS 8800 specify evidentiary requirements that can be translated into testable criteria for XAI output types.
The paper derives 19 criteria from these standards across 7 lifecycle stages.

pith-pipeline@v0.9.1-grok · 5869 in / 1322 out tokens · 39026 ms · 2026-06-28T05:50:33.068498+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

16 extracted references · 4 canonical work pages

[1]

arXiv preprint arXiv:2112.11561 (2024), revised 2024

Atakishiyev,S.,Salameh,M.,Yao,H.,Goebel,R.:Explainableartificialintelligence for autonomous driving: A comprehensive overview and field guide. arXiv preprint arXiv:2112.11561 (2024), revised 2024

work page arXiv 2024
[2]

Chickering, D.M.: Optimal structure identification with greedy search. J. Mach. Learn. Res.3, 507–554 (2002)

2002
[3]

arXiv preprint arXiv:2408.02379 (2024)

Fresz, B., Göbels, V.P., Omri, S., Brajovic, D., Aichele, A., Kutz, J., Neuhüttler, J., Huber, M.F.: The contribution of XAI for the safe development and certification of AI: An expert-based analysis. arXiv preprint arXiv:2408.02379 (2024)

work page arXiv 2024
[4]

Hawkins, R., Paterson, C., Picardi, C., Jia, Y., Calinescu, R., Habli, I.: Guidance on the assurance of machine learning in autonomous systems (AMLAS). Tech. rep., University of York (2021), arXiv:2102.01564

work page arXiv 2021
[5]

ISO: Road vehicles — functional safety (2018), iSO 26262:2018

2018
[6]

ISO: Road vehicles — safety of the intended functionality (2022), iSO 21448:2022

2022
[7]

ISO: Road vehicles — safety and artificial intelligence (2024), iSO/PAS 8800:2024

2024
[8]

arXiv preprint arXiv:2402.10086 (2024)

Kuznietsov, A., Gyevnar, B., Wang, C., Peters, S., Albrecht, S.V.: Explainable AI for safe and trustworthy autonomous driving: A systematic review. arXiv preprint arXiv:2402.10086 (2024)

work page arXiv 2024
[9]

In: Proc

Lundberg, S.M., Lee, S.I.: A unified approach to interpreting model predictions. In: Proc. NeurIPS (2017)

2017
[10]

https://huggingface.co/ datasets/nvidia/PhysicalAI-Autonomous-Vehicles (2025)

NVIDIA: PhysicalAI-Autonomous-Vehicles dataset. https://huggingface.co/ datasets/nvidia/PhysicalAI-Autonomous-Vehicles (2025)

2025
[11]

Pearl,J.:Causality:Models,Reasoning,andInference.CambridgeUniversityPress, 2nd edn. (2009)

2009
[12]

MIT Press (2017)

Peters, J., Janzing, D., Schölkopf, B.: Elements of Causal Inference. MIT Press (2017)

2017
[13]

In: Proc

Priyadershi, A., Frtunikj, J.: Lost in fog: Sensor perturbations expose reasoning fragility in driving VLAs. In: Proc. CVPR Workshops (SAIAD) (2026), to appear

2026
[14]

In: Proc

Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad- CAM: Visual explanations from deep networks via gradient-based localization. In: Proc. IEEE ICCV (2017)

2017
[15]

MIT Press, 2nd edn

Spirtes, P., Glymour, C., Scheines, R.: Causation, Prediction, and Search. MIT Press, 2nd edn. (2000)

2000
[16]

Harvard J

Wachter, S., Mittelstadt, B., Russell, C.: Counterfactual explanations without opening the black box. Harvard J. Law Technol.31(2) (2018)

2018

[1] [1]

arXiv preprint arXiv:2112.11561 (2024), revised 2024

Atakishiyev,S.,Salameh,M.,Yao,H.,Goebel,R.:Explainableartificialintelligence for autonomous driving: A comprehensive overview and field guide. arXiv preprint arXiv:2112.11561 (2024), revised 2024

work page arXiv 2024

[2] [2]

Chickering, D.M.: Optimal structure identification with greedy search. J. Mach. Learn. Res.3, 507–554 (2002)

2002

[3] [3]

arXiv preprint arXiv:2408.02379 (2024)

Fresz, B., Göbels, V.P., Omri, S., Brajovic, D., Aichele, A., Kutz, J., Neuhüttler, J., Huber, M.F.: The contribution of XAI for the safe development and certification of AI: An expert-based analysis. arXiv preprint arXiv:2408.02379 (2024)

work page arXiv 2024

[4] [4]

Hawkins, R., Paterson, C., Picardi, C., Jia, Y., Calinescu, R., Habli, I.: Guidance on the assurance of machine learning in autonomous systems (AMLAS). Tech. rep., University of York (2021), arXiv:2102.01564

work page arXiv 2021

[5] [5]

ISO: Road vehicles — functional safety (2018), iSO 26262:2018

2018

[6] [6]

ISO: Road vehicles — safety of the intended functionality (2022), iSO 21448:2022

2022

[7] [7]

ISO: Road vehicles — safety and artificial intelligence (2024), iSO/PAS 8800:2024

2024

[8] [8]

arXiv preprint arXiv:2402.10086 (2024)

Kuznietsov, A., Gyevnar, B., Wang, C., Peters, S., Albrecht, S.V.: Explainable AI for safe and trustworthy autonomous driving: A systematic review. arXiv preprint arXiv:2402.10086 (2024)

work page arXiv 2024

[9] [9]

In: Proc

Lundberg, S.M., Lee, S.I.: A unified approach to interpreting model predictions. In: Proc. NeurIPS (2017)

2017

[10] [10]

https://huggingface.co/ datasets/nvidia/PhysicalAI-Autonomous-Vehicles (2025)

NVIDIA: PhysicalAI-Autonomous-Vehicles dataset. https://huggingface.co/ datasets/nvidia/PhysicalAI-Autonomous-Vehicles (2025)

2025

[11] [11]

Pearl,J.:Causality:Models,Reasoning,andInference.CambridgeUniversityPress, 2nd edn. (2009)

2009

[12] [12]

MIT Press (2017)

Peters, J., Janzing, D., Schölkopf, B.: Elements of Causal Inference. MIT Press (2017)

2017

[13] [13]

In: Proc

Priyadershi, A., Frtunikj, J.: Lost in fog: Sensor perturbations expose reasoning fragility in driving VLAs. In: Proc. CVPR Workshops (SAIAD) (2026), to appear

2026

[14] [14]

In: Proc

Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad- CAM: Visual explanations from deep networks via gradient-based localization. In: Proc. IEEE ICCV (2017)

2017

[15] [15]

MIT Press, 2nd edn

Spirtes, P., Glymour, C., Scheines, R.: Causation, Prediction, and Search. MIT Press, 2nd edn. (2000)

2000

[16] [16]

Harvard J

Wachter, S., Mittelstadt, B., Russell, C.: Counterfactual explanations without opening the black box. Harvard J. Law Technol.31(2) (2018)

2018