pith. sign in

arxiv: 2602.23171 · v2 · pith:QYUGG377new · submitted 2026-02-26 · 📡 eess.AS

Align-Consistency: Improving Non-autoregressive and Semi-supervised ASR with Consistency Regularization

classification 📡 eess.AS
keywords align-consistencymodelnon-araccuracyconsistencydecodingnon-autoregressiverefinement
0
0 comments X
read the original abstract

Consistency regularization (CR) improves the robustness and accuracy of Connectionist Temporal Classification (CTC) by ensuring predictions remain stable across input perturbations. In this work, we propose Align-Consistency, an extension of CR designed for Align-Refine -- a non-autoregressive (non-AR) model that performs iterative refinement of frame-level hypotheses. This method leverages the speed of parallel inference while significantly boosting recognition performance. The effectiveness of Align-Consistency is demonstrated in two settings. First, in the fully supervised setting, our results indicate that applying CR to both the base CTC model and the subsequent refinement steps is critical, and the accuracy improvements from non-AR decoding and CR are mutually additive. Second, for semi-supervised ASR, we employ fast non-AR decoding to generate online pseudo-labels on unlabeled data, which are used to further refine the supervised model and lead to substantial gains.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.