Investigating Bias in Image Classification using Model Explanations

Lalana Kagal (1) ((1) Massachusetts Institute of Technology); Schrasing Tong (1)

arxiv: 2012.05463 · v1 · pith:3HLQBJRBnew · submitted 2020-12-10 · 💻 cs.CV · cs.LG

Investigating Bias in Image Classification using Model Explanations

Schrasing Tong (1) , Lalana Kagal (1) ((1) Massachusetts Institute of Technology) This is my paper

classification 💻 cs.CV cs.LG

keywords biasexplanationschangeclassificationdegreeimagemodeladditional

0 comments

read the original abstract

We evaluated whether model explanations could efficiently detect bias in image classification by highlighting discriminating features, thereby removing the reliance on sensitive attributes for fairness calculations. To this end, we formulated important characteristics for bias detection and observed how explanations change as the degree of bias in models change. The paper identifies strengths and best practices for detecting bias using explanations, as well as three main weaknesses: explanations poorly estimate the degree of bias, could potentially introduce additional bias into the analysis, and are sometimes inefficient in terms of human effort involved.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Privacy Beyond Pixels: Latent Anonymization for Privacy-Preserving Video Understanding
cs.CV 2025-11 conditional novelty 6.0

A plug-and-play Anonymizing Adapter Module removes private information from video latent features using self-supervised privacy objectives and consistency losses while retaining utility on action recognition, temporal...