Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting

· 2019 · cs.IR · arXiv 1901.09451

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

We present a large-scale study of gender bias in occupation classification, a task where the use of machine learning may lead to negative outcomes on peoples' lives. We analyze the potential allocation harms that can result from semantic representation bias. To do so, we study the impact on occupation classification of including explicit gender indicators---such as first names and pronouns---in different semantic representations of online biographies. Additionally, we quantify the bias that remains when these indicators are "scrubbed," and describe proxy behavior that occurs in the absence of explicit gender indicators. As we demonstrate, differences in true positive rates between genders are correlated with existing gender imbalances in occupations, which may compound these imbalances.

representative citing papers

Estimating Grammatical Gender Directions in Contextual Embeddings under Controlled and Natural Contexts

cs.CL · 2026-06-29 · unverdicted · novelty 6.0

A framework estimates grammatical gender directions in contextual embeddings via controlled and natural contexts, finding unweighted controlled contexts and centroid estimators yield the purest directions.

AgentFairBench: Do LLM Agents Discriminate When They Act?

cs.AI · 2026-06-15 · unverdicted · novelty 6.0

AgentFairBench is a multi-domain benchmark for demographic disparity in LLM agent actions, with a pilot showing no significant effect for Claude Haiku 4.5 after arity-matched noise correction.

citing papers explorer

Showing 1 of 1 citing paper after filters.

AgentFairBench: Do LLM Agents Discriminate When They Act? cs.AI · 2026-06-15 · unverdicted · none · ref 29 · internal anchor
AgentFairBench is a multi-domain benchmark for demographic disparity in LLM agent actions, with a pilot showing no significant effect for Claude Haiku 4.5 after arity-matched noise correction.

Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting

fields

years

verdicts

representative citing papers

citing papers explorer