BEARD adapts Whisper encoder for ATC domain via BEST-RQ and distillation on 5000h unlabeled speech then 2h labeled fine-tuning, delivering 12% relative WER gain over fine-tuned baseline.
ATCO2 corpus: A large-scale dataset for research on au- tomatic speech recognition and natural language understand- ing of air traffic control communications
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
verdicts
UNVERDICTED 2representative citing papers
SCOPE achieves 91.05% open-set detection accuracy and corrects 96.63% of anomalous ATC readbacks via frozen LLM with plug-in classifier and in-context learning on semi-synthetic data.
citing papers explorer
-
BEST-RQ-Based Self-Supervised Learning for Whisper Domain Adaptation
BEARD adapts Whisper encoder for ATC domain via BEST-RQ and distillation on 5000h unlabeled speech then 2h labeled fine-tuning, delivering 12% relative WER gain over fine-tuned baseline.
-
SCOPE: A Lightweight-training LLM Framework for Air Traffic Control Readback Monitoring
SCOPE achieves 91.05% open-set detection accuracy and corrects 96.63% of anomalous ATC readbacks via frozen LLM with plug-in classifier and in-context learning on semi-synthetic data.