AfriVox-v2: A Domain-Verticalized Benchmark for In-the-Wild African Speech Recognition

· 2026 · cs.CL · arXiv 2605.03590

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Recent large language models (LLMs) show strong speech recognition and translation capabilities for high-resource languages. However, African languages remain dramatically underrepresented in benchmarks, limiting their practical use in low-resource settings. While early benchmarks tested African languages and accents, they lacked exhaustive real-world noise and granular domain evaluations. We present AfriVox-v2, a comprehensive benchmark designed to test speech models under realistic African deployment conditions. AfriVox-v2 introduces "in the wild" unscripted audio for all supported languages. We also introduce strict domain verticalization, evaluating model accuracy across ten sectors including government, finance, health, and agriculture and conducting targeted tests on numbers and named entities. Finally, we benchmark a new generation of speech models, including Sahara-v2, Gemini 3 Flash, and the Omnilingual CTC models. Our results expose the true generalization gap of modern speech models in specialized, noisy African contexts and provide a reliable blueprint for developers building localized voice AI.

representative citing papers

AfriVox-v2: A Domain-Verticalized Benchmark for In-the-Wild African Speech Recognition

cs.CL · 2026-05-05 · unverdicted · novelty 6.0

AfriVox-v2 is a benchmark that evaluates modern speech models on in-the-wild African audio with domain-specific tests for sectors including government, finance, health, and agriculture.

citing papers explorer

Showing 1 of 1 citing paper.

AfriVox-v2: A Domain-Verticalized Benchmark for In-the-Wild African Speech Recognition cs.CL · 2026-05-05 · unverdicted · none · ref 2 · internal anchor
AfriVox-v2 is a benchmark that evaluates modern speech models on in-the-wild African audio with domain-specific tests for sectors including government, finance, health, and agriculture.

AfriVox-v2: A Domain-Verticalized Benchmark for In-the-Wild African Speech Recognition

fields

years

verdicts

representative citing papers

citing papers explorer