arxiv: 1904.12991 · v2 · pith:RTRSXWW6new · submitted 2019-04-29 · 💻 cs.LG · cs.AI· stat.ML

"Why Should You Trust My Explanation?" Understanding Uncertainty in LIME Explanations

Yujia Zhang , Kuangyan Song , Yiming Sun , Sarah Tan , Madeleine Udell This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords uncertaintydatalimeexplanationsmodelsoutcomesreliabilitytrust

0 comments

read the original abstract

Methods for interpreting machine learning black-box models increase the outcomes' transparency and in turn generates insight into the reliability and fairness of the algorithms. However, the interpretations themselves could contain significant uncertainty that undermines the trust in the outcomes and raises concern about the model's reliability. Focusing on the method "Local Interpretable Model-agnostic Explanations" (LIME), we demonstrate the presence of two sources of uncertainty, namely the randomness in its sampling procedure and the variation of interpretation quality across different input data points. Such uncertainty is present even in models with high training and test accuracy. We apply LIME to synthetic data and two public data sets, text classification in 20 Newsgroup and recidivism risk-scoring in COMPAS, to support our argument.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Do Fair Models Reason Fairly? Counterfactual Explanation Consistency for Procedural Fairness in Credit Decisions
cs.LG 2026-05 unverdicted novelty 7.0

Outcome-fair credit models often exhibit hidden procedural bias through inconsistent reasoning across groups, which the CEC framework mitigates by enforcing consistent feature attributions via counterfactuals.
Fairness of Explanations in Artificial Intelligence (AI): A Unifying Framework, Axioms, and Future Direction toward Responsible AI
cs.AI 2026-05 unverdicted novelty 6.0

A conditional invariance framework defines explanation fairness as explanations being statistically independent of protected attributes given task-relevant features, unifying existing metrics and enabling procedural b...