Quantifying bias in automatic speech recognition

· 2021 · arXiv 2103.15122

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 2

citation-polarity summary

background 1 support 1

representative citing papers

Toward Fair Speech Technologies: A Comprehensive Survey of Bias and Fairness in Speech AI

eess.AS · 2026-05-02 · accept · novelty 7.0

The paper delivers a unified framework for fairness in speech technologies by formalizing seven definitions, organizing research into three paradigms, diagnosing pipeline-specific biases, and mapping mitigations to those sources.

"This Wasn't Made for Me": Recentering User Experience and Emotional Impact in the Evaluation of ASR Bias

cs.CL · 2026-04-22 · unverdicted · novelty 7.0

ASR bias causes users from underrepresented dialects to internalize failures as personal inadequacy and perform extensive emotional and linguistic labor, revealing harms missed by accuracy-only evaluations.

Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation

cs.CL · 2025-11-26 · conditional · novelty 7.0

ST models override masculine ILM biases with acoustic input, using first-person pronouns to link terms to the speaker and accessing gender cues across the full frequency spectrum rather than pitch alone.

Speak Your Mind: The Speech Continuation Task as a Probe of Voice-Based Model Bias

eess.AS · 2025-09-26 · unverdicted · novelty 7.0

The authors perform the first systematic bias evaluation in speech continuation tasks across three models, revealing gender interactions in text metrics and stronger reversion to modal phonation for female prompts.

Few-Shot Accent Synthesis for ASR with LLM-Guided Phoneme Editing

cs.SD · 2026-04-30 · unverdicted · novelty 5.0

Few-shot TTS adaptation combined with LLM-guided phoneme editing produces synthetic accented speech that improves ASR word error rates on real accented audio even in cross-speaker and ultra-low-data settings.

Demographic and Linguistic Bias Evaluation in Omnimodal Language Models

cs.CV · 2026-04-11 · unverdicted · novelty 5.0

Omnimodal models show reduced demographic bias in image and video tasks compared to substantial biases and lower performance in audio tasks.

citing papers explorer

Showing 6 of 6 citing papers.

Toward Fair Speech Technologies: A Comprehensive Survey of Bias and Fairness in Speech AI eess.AS · 2026-05-02 · accept · none · ref 116
The paper delivers a unified framework for fairness in speech technologies by formalizing seven definitions, organizing research into three paradigms, diagnosing pipeline-specific biases, and mapping mitigations to those sources.
"This Wasn't Made for Me": Recentering User Experience and Emotional Impact in the Evaluation of ASR Bias cs.CL · 2026-04-22 · unverdicted · none · ref 13
ASR bias causes users from underrepresented dialects to internalize failures as personal inadequacy and perform extensive emotional and linguistic labor, revealing harms missed by accuracy-only evaluations.
Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation cs.CL · 2025-11-26 · conditional · none · ref 14
ST models override masculine ILM biases with acoustic input, using first-person pronouns to link terms to the speaker and accessing gender cues across the full frequency spectrum rather than pitch alone.
Speak Your Mind: The Speech Continuation Task as a Probe of Voice-Based Model Bias eess.AS · 2025-09-26 · unverdicted · none · ref 14
The authors perform the first systematic bias evaluation in speech continuation tasks across three models, revealing gender interactions in text metrics and stronger reversion to modal phonation for female prompts.
Few-Shot Accent Synthesis for ASR with LLM-Guided Phoneme Editing cs.SD · 2026-04-30 · unverdicted · none · ref 10
Few-shot TTS adaptation combined with LLM-guided phoneme editing produces synthetic accented speech that improves ASR word error rates on real accented audio even in cross-speaker and ultra-low-data settings.
Demographic and Linguistic Bias Evaluation in Omnimodal Language Models cs.CV · 2026-04-11 · unverdicted · none · ref 9
Omnimodal models show reduced demographic bias in image and video tasks compared to substantial biases and lower performance in audio tasks.

Quantifying bias in automatic speech recognition

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer