pith. sign in

arxiv: 2106.09236 · v1 · pith:6DHRS2EVnew · submitted 2021-06-17 · 💻 cs.SD · eess.AS

Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition

classification 💻 cs.SD eess.AS
keywords self-attentionconformerattentionmechanismprob-sparserecognitioninferencememory
0
0 comments X
read the original abstract

End-to-end models are favored in automatic speech recognition (ASR) because of their simplified system structure and superior performance. Among these models, Transformer and Conformer have achieved state-of-the-art recognition accuracy in which self-attention plays a vital role in capturing important global information. However, the time and memory complexity of self-attention increases squarely with the length of the sentence. In this paper, a prob-sparse self-attention mechanism is introduced into Conformer to sparse the computing process of self-attention in order to accelerate inference speed and reduce space consumption. Specifically, we adopt a Kullback-Leibler divergence based sparsity measurement for each query to decide whether we compute the attention function on this query. By using the prob-sparse attention mechanism, we achieve impressively 8% to 45% inference speed-up and 15% to 45% memory usage reduction of the self-attention module of Conformer Transducer while maintaining the same level of error rate.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Signed Dual Attention: Capturing Signed Dependencies in Time Series Forecasting

    cs.LG 2026-06 unverdicted novelty 3.0

    Signed Dual Attention is a parameter-free attention module that models signed dependencies in time series via dual message passing to achieve two-head expressiveness in one block.