Persistent anti-muslim bias in large language models

Abubakar Abid, Maheen Farooqi, James Zou · 2021 · arXiv 1702.346262

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 3

citation-polarity summary

background 2 support 1

representative citing papers

Towards Measuring the Representation of Subjective Global Opinions in Language Models

cs.CL · 2023-06-28 · conditional · novelty 7.0

LLMs default to responses more similar to opinions from the USA and some European and South American countries; prompting for a country shifts alignment but can introduce stereotypes, while translation does not reliably match language speakers.

Fairness vs Performance: Characterizing the Pareto Frontier of Algorithmic Decision Systems

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

The Pareto frontier of fair algorithmic decisions consists of deterministic group-specific threshold rules on predicted success probabilities, which can include upper bounds for some fairness metrics and holds independently of model training approach.

Do Language Models Pass the Bechdel Test? Auditing Gender Biases in LLM-Generated Screenplays

cs.HC · 2026-06-23 · unverdicted · novelty 4.0

Human-written screenplays pass the Bechdel test more often than those generated by GPT-5, Gemini 3 Pro, and Claude Sonnet 4.5, though network analyses show mixed bias patterns across all script types.

Designing for Collective Access: In Search of a Solution to Accessible Communication in a Mixed-Ability Non-Profit

cs.HC · 2026-05-11 · unverdicted · novelty 4.0

A six-month qualitative study of a mixed-ability nonprofit finds that conflicting access needs in communication act as a generative process revealing power structures and enabling accountability and repair rather than serving as technical problems to eliminate.

AI Safety Landscape for Large Language Models: Taxonomy, State-of-the-art, and Future Directions

cs.AI · 2024-08-23 · unverdicted · novelty 4.0

The paper introduces a taxonomy of AI safety for LLMs organized into Trustworthy AI, Responsible AI, and Safe AI perspectives, accompanied by a review of state-of-the-art methods, challenges, and future directions.

citing papers explorer

Showing 5 of 5 citing papers.

Towards Measuring the Representation of Subjective Global Opinions in Language Models cs.CL · 2023-06-28 · conditional · none · ref 1
LLMs default to responses more similar to opinions from the USA and some European and South American countries; prompting for a country shifts alignment but can introduce stereotypes, while translation does not reliably match language speakers.
Fairness vs Performance: Characterizing the Pareto Frontier of Algorithmic Decision Systems cs.LG · 2026-05-11 · unverdicted · none · ref 13
The Pareto frontier of fair algorithmic decisions consists of deterministic group-specific threshold rules on predicted success probabilities, which can include upper bounds for some fairness metrics and holds independently of model training approach.
Do Language Models Pass the Bechdel Test? Auditing Gender Biases in LLM-Generated Screenplays cs.HC · 2026-06-23 · unverdicted · none · ref 1
Human-written screenplays pass the Bechdel test more often than those generated by GPT-5, Gemini 3 Pro, and Claude Sonnet 4.5, though network analyses show mixed bias patterns across all script types.
Designing for Collective Access: In Search of a Solution to Accessible Communication in a Mixed-Ability Non-Profit cs.HC · 2026-05-11 · unverdicted · none · ref 37
A six-month qualitative study of a mixed-ability nonprofit finds that conflicting access needs in communication act as a generative process revealing power structures and enabling accountability and repair rather than serving as technical problems to eliminate.
AI Safety Landscape for Large Language Models: Taxonomy, State-of-the-art, and Future Directions cs.AI · 2024-08-23 · unverdicted · none · ref 4
The paper introduces a taxonomy of AI safety for LLMs organized into Trustworthy AI, Responsible AI, and Safe AI perspectives, accompanied by a review of state-of-the-art methods, challenges, and future directions.

Persistent anti-muslim bias in large language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer