The in- terspeech 2020 deep noise suppression challenge: Datasets, subjective testing framework, and challenge results.arXiv preprint arXiv:2005.13981

· 2020 · arXiv 2005.13981

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

SelectTSL: Prompt-Guided Selective Target Sound Localization in Complex Scenarios

cs.SD · 2026-07-02 · unverdicted · novelty 6.0

SelectTSL is an end-to-end model using a Prompt-Guided Selective Attention Module and IPD enhancer to localize only prompt-specified target sounds and estimate their count and direction in complex acoustic scenes.

Position-Aware Target Speaker Extraction for Long-Form Multi-Party Conversations: A Diarization-Free Framework for ASR

cs.SD · 2026-06-28 · unverdicted · novelty 6.0

PATSE is a DOA-guided target speaker extraction system that produces speaker-attributed streams for diarization-free ASR in multi-party conversations.

SenSE: Semantic-Aware High-Fidelity Universal Speech Enhancement

eess.AS · 2025-09-29 · unverdicted · novelty 6.0

SenSE adds language-model semantic guidance to flow-matching generative speech enhancement via a dual-path masked conditioning strategy and reports SOTA results on distorted speech.

Fast-ULCNet: A fast and ultra low complexity network for single-channel speech enhancement

eess.AS · 2026-01-21 · unverdicted · novelty 4.0

Fast-ULCNet matches original ULCNet speech enhancement quality while cutting model size by more than half and latency by 34% via FastGRNN replacement and a state-drift filter.

citing papers explorer

Showing 3 of 3 citing papers after filters.

SelectTSL: Prompt-Guided Selective Target Sound Localization in Complex Scenarios cs.SD · 2026-07-02 · unverdicted · none · ref 65
SelectTSL is an end-to-end model using a Prompt-Guided Selective Attention Module and IPD enhancer to localize only prompt-specified target sounds and estimate their count and direction in complex acoustic scenes.
Position-Aware Target Speaker Extraction for Long-Form Multi-Party Conversations: A Diarization-Free Framework for ASR cs.SD · 2026-06-28 · unverdicted · none · ref 38
PATSE is a DOA-guided target speaker extraction system that produces speaker-attributed streams for diarization-free ASR in multi-party conversations.
Fast-ULCNet: A fast and ultra low complexity network for single-channel speech enhancement eess.AS · 2026-01-21 · unverdicted · none · ref 15
Fast-ULCNet matches original ULCNet speech enhancement quality while cutting model size by more than half and latency by 34% via FastGRNN replacement and a state-drift filter.

The in- terspeech 2020 deep noise suppression challenge: Datasets, subjective testing framework, and challenge results.arXiv preprint arXiv:2005.13981

fields

years

verdicts

representative citing papers

citing papers explorer