Federated Learning for Keyword Spotting

Alice Coucke; David Leroy; Joseph Dureau; Thibault Gisselbrecht; Thibaut Lavril

arxiv: 1810.05512 · v4 · pith:NODZZRLAnew · submitted 2018-10-09 · 📡 eess.AS · cs.CL· cs.LG· cs.SD· stat.ML

Federated Learning for Keyword Spotting

David Leroy , Alice Coucke , Thibaut Lavril , Thibault Gisselbrecht , Joseph Dureau This is my paper

classification 📡 eess.AS cs.CLcs.LGcs.SDstat.ML

keywords federatedaveraginglearningwakewordcommunicationdatasetadam

0 comments

read the original abstract

We propose a practical approach based on federated learning to solve out-of-domain issues with continuously running embedded speech-based models such as wake word detectors. We conduct an extensive empirical study of the federated averaging algorithm for the "Hey Snips" wake word based on a crowdsourced dataset that mimics a federation of wake word users. We empirically demonstrate that using an adaptive averaging strategy inspired from Adam in place of standard weighted model averaging highly reduces the number of communication rounds required to reach our target performance. The associated upstream communication costs per user are estimated at 8 MB, which is a reasonable in the context of smart home voice assistants. Additionally, the dataset used for these experiments is being open sourced with the aim of fostering further transparent research in the application of federated learning to speech data.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

What changes after deployment? A survey on On-device Learning in TinyML
cs.LG 2026-05 unverdicted novelty 6.0

A survey of on-device learning in TinyML organized by distribution change regimes, highlighting influences on applications, hardware, and solutions plus a gap between benchmarks and deployments.