Librispeech: An ASR corpus based on public domain audio books

· 2015

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Asymmetric Encoder-Decoder Based on Time-Frequency Correlation for Speech Separation

eess.AS · 2026-03-31 · unverdicted · novelty 6.0

SR-CorrNet introduces an asymmetric TF-domain architecture with separation-reconstruction strategy and correlation-to-filter estimation that yields consistent gains on WSJ0-Mix, WHAMR!, and LibriCSS under anechoic, noisy-reverberant, and real-recorded conditions.

Time vs. Layer: Locating Predictive Cues for Dysarthric Speech Descriptors in wav2vec 2.0

cs.SD · 2026-04-23 · unverdicted · novelty 5.0

Layer-wise aggregation from wav2vec 2.0 best predicts intelligibility in dysarthric speech, while time-wise aggregation is better for imprecise consonants, harsh voice, and monoloudness.

citing papers explorer

Showing 2 of 2 citing papers.

Asymmetric Encoder-Decoder Based on Time-Frequency Correlation for Speech Separation eess.AS · 2026-03-31 · unverdicted · none · ref 63
SR-CorrNet introduces an asymmetric TF-domain architecture with separation-reconstruction strategy and correlation-to-filter estimation that yields consistent gains on WSJ0-Mix, WHAMR!, and LibriCSS under anechoic, noisy-reverberant, and real-recorded conditions.
Time vs. Layer: Locating Predictive Cues for Dysarthric Speech Descriptors in wav2vec 2.0 cs.SD · 2026-04-23 · unverdicted · none · ref 24
Layer-wise aggregation from wav2vec 2.0 best predicts intelligibility in dysarthric speech, while time-wise aggregation is better for imprecise consonants, harsh voice, and monoloudness.

Librispeech: An ASR corpus based on public domain audio books

fields

years

verdicts

representative citing papers

citing papers explorer