Title resolution pending

Stop explaining black box machine learning models for high stakes decisions · 2019 · DOI 10.1038/s42256-019-0048-x

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

open at publisher browse 11 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Interpretable Machine Learning for Spatial Science: A Lie-Algebraic Kernel for Rotationally Anisotropic Gaussian Processes

stat.ML · 2026-05-11 · unverdicted · novelty 7.0

A Lie-algebraic kernel reparameterizes 3D rotationally anisotropic Gaussian processes with explicit principal length-scales and SO(3) orientations, matching full SPD flexibility but improving interpretability over axis-aligned ARD.

ProtoSSL: Interpretable Prototype Learning from Unlabeled Time-Series Data

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

ProtoSSL discovers generalizable prototypes from unlabeled time-series via self-supervision and assigns them to new tasks for interpretable predictions, outperforming supervised baselines in low-data regimes on ECG datasets.

Measuring Faithfulness in Chain-of-Thought Reasoning

cs.AI · 2023-07-17 · conditional · novelty 7.0

Chain-of-Thought reasoning in LLMs is often unfaithful, with models relying on it variably by task and less so as models scale larger.

Correcting Influence: Unboxing LLM Outputs with Orthogonal Latent Spaces

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

A latent mediation framework with sparse autoencoders enables non-additive token-level influence attribution in LLMs by learning orthogonal features and back-propagating attributions.

How Researchers Navigate Accountability, Transparency, and Trust When Using AI Tools in Early-Stage Research: A Think-Aloud Study

cs.CY · 2026-04-25 · unverdicted · novelty 6.0

A think-aloud study reveals that AI tools in early research misrepresent uncertainty, obscure provenance, and create fragile trust, leading researchers to develop compensatory strategies to preserve scholarly judgment.

Interpretable Quantile Regression by Optimal Decision Trees

cs.LG · 2026-04-22 · unverdicted · novelty 5.0

A novel algorithm learns sets of optimal quantile regression trees to predict full conditional distributions interpretably and efficiently.

From Large Language Model Predicates to Logic Tensor Networks: Neurosymbolic Offer Validation in Regulated Procurement

cs.AI · 2026-04-07 · unverdicted · novelty 5.0

A neurosymbolic pipeline extracts predicates from offer texts with an LLM and validates them via Logic Tensor Networks, delivering performance comparable to standard models plus built-in interpretability on a real corpus.

Robust and Explainable Divide-and-Conquer Learning for Intrusion Detection

cs.LG · 2026-05-03 · unverdicted · novelty 4.0

A divide-and-conquer method decomposes network intrusion detection into focused subtasks, allowing lightweight models to gain up to 43.3% higher local accuracy and 257x smaller size while improving robustness and explainability.

LLMs Should Not Yet Be Credited with Decision Explanation

cs.AI · 2026-05-01 · unverdicted · novelty 4.0

LLMs support decision prediction and rationale generation but lack evidence for genuine decision explanation, requiring stricter standards to avoid over-crediting.

Human Agency, Causality, and the Human Computer Interface in High-Stakes Artificial Intelligence

cs.HC · 2026-04-14 · unverdicted · novelty 4.0

The paper proposes a Causal-Agency Framework to restore human causal control at AI interfaces by combining causal models, uncertainty quantification, and human-centered evaluation.

Toward Aristotelian Medical Representations: Backpropagation-Free Layer-wise Analysis for Interpretable Generalized Metric Learning on MedMNIST

cs.CV · 2026-04-07 · unverdicted · novelty 4.0

A-ROM delivers competitive MedMNIST performance via pretrained ViT metric spaces, a concept dictionary, and kNN without backpropagation or fine-tuning, framed as interpretable few-shot learning under the Platonic Representation Hypothesis.

citing papers explorer

Showing 11 of 11 citing papers.

Interpretable Machine Learning for Spatial Science: A Lie-Algebraic Kernel for Rotationally Anisotropic Gaussian Processes stat.ML · 2026-05-11 · unverdicted · none · ref 19
A Lie-algebraic kernel reparameterizes 3D rotationally anisotropic Gaussian processes with explicit principal length-scales and SO(3) orientations, matching full SPD flexibility but improving interpretability over axis-aligned ARD.
ProtoSSL: Interpretable Prototype Learning from Unlabeled Time-Series Data cs.LG · 2026-05-07 · unverdicted · none · ref 6
ProtoSSL discovers generalizable prototypes from unlabeled time-series via self-supervision and assigns them to new tasks for interpretable predictions, outperforming supervised baselines in low-data regimes on ECG datasets.
Measuring Faithfulness in Chain-of-Thought Reasoning cs.AI · 2023-07-17 · conditional · none · ref 20
Chain-of-Thought reasoning in LLMs is often unfaithful, with models relying on it variably by task and less so as models scale larger.
Correcting Influence: Unboxing LLM Outputs with Orthogonal Latent Spaces cs.LG · 2026-05-12 · unverdicted · none · ref 183
A latent mediation framework with sparse autoencoders enables non-additive token-level influence attribution in LLMs by learning orthogonal features and back-propagating attributions.
How Researchers Navigate Accountability, Transparency, and Trust When Using AI Tools in Early-Stage Research: A Think-Aloud Study cs.CY · 2026-04-25 · unverdicted · none · ref 59
A think-aloud study reveals that AI tools in early research misrepresent uncertainty, obscure provenance, and create fragile trust, leading researchers to develop compensatory strategies to preserve scholarly judgment.
Interpretable Quantile Regression by Optimal Decision Trees cs.LG · 2026-04-22 · unverdicted · none · ref 14
A novel algorithm learns sets of optimal quantile regression trees to predict full conditional distributions interpretably and efficiently.
From Large Language Model Predicates to Logic Tensor Networks: Neurosymbolic Offer Validation in Regulated Procurement cs.AI · 2026-04-07 · unverdicted · none · ref 16
A neurosymbolic pipeline extracts predicates from offer texts with an LLM and validates them via Logic Tensor Networks, delivering performance comparable to standard models plus built-in interpretability on a real corpus.
Robust and Explainable Divide-and-Conquer Learning for Intrusion Detection cs.LG · 2026-05-03 · unverdicted · none · ref 33
A divide-and-conquer method decomposes network intrusion detection into focused subtasks, allowing lightweight models to gain up to 43.3% higher local accuracy and 257x smaller size while improving robustness and explainability.
LLMs Should Not Yet Be Credited with Decision Explanation cs.AI · 2026-05-01 · unverdicted · none · ref 33
LLMs support decision prediction and rationale generation but lack evidence for genuine decision explanation, requiring stricter standards to avoid over-crediting.
Human Agency, Causality, and the Human Computer Interface in High-Stakes Artificial Intelligence cs.HC · 2026-04-14 · unverdicted · none · ref 24
The paper proposes a Causal-Agency Framework to restore human causal control at AI interfaces by combining causal models, uncertainty quantification, and human-centered evaluation.
Toward Aristotelian Medical Representations: Backpropagation-Free Layer-wise Analysis for Interpretable Generalized Metric Learning on MedMNIST cs.CV · 2026-04-07 · unverdicted · none · ref 29
A-ROM delivers competitive MedMNIST performance via pretrained ViT metric spaces, a concept dictionary, and kNN without backpropagation or fine-tuning, framed as interpretable few-shot learning under the Platonic Representation Hypothesis.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer