Local Interpretable Model-agnostic Explanations of Bayesian Predictive Models via Kullback-Leibler Projections
read the original abstract
We introduce a method, KL-LIME, for explaining predictions of Bayesian predictive models by projecting the information in the predictive distribution locally to a simpler, interpretable explanation model. The proposed approach combines the recent Local Interpretable Model-agnostic Explanations (LIME) method with ideas from Bayesian projection predictive variable selection methods. The information theoretic basis helps in navigating the trade-off between explanation fidelity and complexity. We demonstrate the method in explaining MNIST digit classifications made by a Bayesian deep convolutional neural network.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
A Unified Framework for Uncertainty-Aware Explainable Artificial Intelligence: A Case Study in Power Quality Disturbance Classification
Formalizes explanation distributions from BNNs via push-forward measures and proposes UA-RAO operators to summarize them, with empirical gains in localization on a 15-class power quality disturbance task using deep ensembles.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.