pith. sign in

RAS: a Reliability Oriented Metric for Automatic Speech Recognition

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

Automatic speech recognition systems often produce confident yet incorrect transcriptions under noisy or ambiguous conditions, which can be misleading for both users and downstream applications. Standard evaluation based on Word Error Rate focuses solely on accuracy and fails to capture transcription reliability. We introduce an abstention-aware transcription framework that enables ASR models to explicitly abstain from uncertain segments. To evaluate reliability under abstention, we propose RAS, a reliability-oriented metric that balances transcription informativeness and error aversion, with its trade-off parameter calibrated by human preference. We then train an abstention-aware ASR model through supervised bootstrapping followed by reinforcement learning. Our experiments demonstrate substantial improvements in transcription reliability while maintaining competitive accuracy.

fields

cs.SD 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

RAS: a Reliability Oriented Metric for Automatic Speech Recognition

cs.SD · 2026-04-27 · unverdicted · novelty 5.0 · 2 refs

RAS is a reliability-oriented metric for ASR that balances informativeness and error aversion via human-calibrated abstention, paired with a training method using supervised bootstrapping and reinforcement learning.

citing papers explorer

Showing 1 of 1 citing paper.

  • RAS: a Reliability Oriented Metric for Automatic Speech Recognition cs.SD · 2026-04-27 · unverdicted · none · ref 2 · 2 links · internal anchor

    RAS is a reliability-oriented metric for ASR that balances informativeness and error aversion via human-calibrated abstention, paired with a training method using supervised bootstrapping and reinforcement learning.