WHAM!: Extending speech separation to noisy environments

Gordon Wichern, Joe Antognini, Michael Flynn, Licheng Richard Zhu, Emmett McQuinn, Dwight Crow · 2019 · DOI 10.21437/interspeech.2019-2821

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding

cs.SD · 2026-06-03 · unverdicted · novelty 6.0

CleanCodec reframes audio tokenization as a selective information bottleneck to encode only perceptually important features at 12.5 tokens per second, outperforming prior codecs in efficiency, speaker similarity, and intelligibility.

Asymmetric Encoder-Decoder Based on Time-Frequency Correlation for Speech Separation

eess.AS · 2026-03-31 · unverdicted · novelty 6.0

SR-CorrNet introduces an asymmetric TF-domain architecture with separation-reconstruction strategy and correlation-to-filter estimation that yields consistent gains on WSJ0-Mix, WHAMR!, and LibriCSS under anechoic, noisy-reverberant, and real-recorded conditions.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Asymmetric Encoder-Decoder Based on Time-Frequency Correlation for Speech Separation eess.AS · 2026-03-31 · unverdicted · none · ref 60
SR-CorrNet introduces an asymmetric TF-domain architecture with separation-reconstruction strategy and correlation-to-filter estimation that yields consistent gains on WSJ0-Mix, WHAMR!, and LibriCSS under anechoic, noisy-reverberant, and real-recorded conditions.

WHAM!: Extending speech separation to noisy environments

fields

years

verdicts

representative citing papers

citing papers explorer