AI generates covertly racist decisions about people based on their dialect.Nature, 633:147–154

Valentin Hofmann, Pratyusha Ria Kalluri, Dan Jurafsky, Sharese King · 2024 · DOI 10.1038/s41586-024-07856-5

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

A framework for analyzing concept representations in neural models

cs.CL · 2026-05-02 · unverdicted · novelty 7.0

A new framework shows concept subspaces are not unique, estimator choice affects containment and disentanglement, LEACE works well but generalizes poorly, and HuBERT encodes phone info as contained and disentangled from speaker info while speaker info resists compact containment.

What Do People Actually Want From AI? Mapping Preference Plurality

cs.CL · 2026-06-04 · unverdicted · novelty 6.0

Open-ended preference data reveals substantial plurality in what people want from AI and divergent interpretations of shared values such as truthfulness.

Do Language Models Pass the Bechdel Test? Auditing Gender Biases in LLM-Generated Screenplays

cs.HC · 2026-06-23 · unverdicted · novelty 4.0

Human-written screenplays pass the Bechdel test more often than those generated by GPT-5, Gemini 3 Pro, and Claude Sonnet 4.5, though network analyses show mixed bias patterns across all script types.

Reducing Political Manipulation with Consistency Training

cs.CL · 2026-05-21

citing papers explorer

Showing 4 of 4 citing papers.

A framework for analyzing concept representations in neural models cs.CL · 2026-05-02 · unverdicted · none · ref 217
A new framework shows concept subspaces are not unique, estimator choice affects containment and disentanglement, LEACE works well but generalizes poorly, and HuBERT encodes phone info as contained and disentangled from speaker info while speaker info resists compact containment.
What Do People Actually Want From AI? Mapping Preference Plurality cs.CL · 2026-06-04 · unverdicted · none · ref 32
Open-ended preference data reveals substantial plurality in what people want from AI and divergent interpretations of shared values such as truthfulness.
Do Language Models Pass the Bechdel Test? Auditing Gender Biases in LLM-Generated Screenplays cs.HC · 2026-06-23 · unverdicted · none · ref 21
Human-written screenplays pass the Bechdel test more often than those generated by GPT-5, Gemini 3 Pro, and Claude Sonnet 4.5, though network analyses show mixed bias patterns across all script types.
Reducing Political Manipulation with Consistency Training cs.CL · 2026-05-21 · unreviewed · ref 15

AI generates covertly racist decisions about people based on their dialect.Nature, 633:147–154

fields

years

verdicts

representative citing papers

citing papers explorer