Audio Set: An Ontology and Human-Labeled Dataset for Audio Events

Jort F · 2017

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Whisper-AuT: Domain-Adapted Audio Encoder for Efficient Audio-LLM Training

cs.SD · 2026-04-12 · unverdicted · novelty 4.0

Whisper-AuT is a domain-adapted audio encoder obtained by fine-tuning Whisper-large-v3 on mixed speech, environmental, and music data, yielding gains of +23% on ESC-50, +5% on GTZAN, and +0.7% on Speech Commands.

citing papers explorer

Showing 1 of 1 citing paper.

Whisper-AuT: Domain-Adapted Audio Encoder for Efficient Audio-LLM Training cs.SD · 2026-04-12 · unverdicted · none · ref 6
Whisper-AuT is a domain-adapted audio encoder obtained by fine-tuning Whisper-large-v3 on mixed speech, environmental, and music data, yielding gains of +23% on ESC-50, +5% on GTZAN, and +0.7% on Speech Commands.

Audio Set: An Ontology and Human-Labeled Dataset for Audio Events

fields

years

verdicts

representative citing papers

citing papers explorer