pith. machine review for the scientific record. sign in

arxiv: 1807.08024 · v3 · submitted 2018-07-20 · 💻 cs.CV

Recognition: unknown

Explaining Image Classifiers by Counterfactual Generation

Authors on Pith no claims yet
classification 💻 cs.CV
keywords imageclassifierpartschangedecisionrelevantseenad-hoc
0
0 comments X
read the original abstract

When an image classifier makes a prediction, which parts of the image are relevant and why? We can rephrase this question to ask: which parts of the image, if they were not seen by the classifier, would most change its decision? Producing an answer requires marginalizing over images that could have been seen but weren't. We can sample plausible image in-fills by conditioning a generative model on the rest of the image. We then optimize to find the image regions that most change the classifier's decision after in-fill. Our approach contrasts with ad-hoc in-filling approaches, such as blurring or injecting noise, which generate inputs far from the data distribution, and ignore informative relationships between different parts of the image. Our method produces more compact and relevant saliency maps, with fewer artifacts compared to previous methods.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Explainable Human Activity Recognition: A Unified Review of Concepts and Mechanisms

    cs.LG 2026-04 unverdicted novelty 4.0

    The paper delivers a mechanism-centric taxonomy and unified perspective on explainable human activity recognition methods across sensing modalities.