Towards Personalized Federated Learning for Dysarthric Speech Recognition

· 2026 · cs.SD · arXiv 2606.13253

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Speech recognition is challenging for dysarthric speakers. While federated learning (FL)-based ASR can be an effective tool for protecting privacy, it suffers from heterogeneity issues caused by speaker variability. Forcing all speakers to share the same model components can be suboptimal under such heterogeneity, making personalization a promising direction; however, related research on dysarthric speech remains limited. To this end, this paper explores two aggregation strategies to achieve personalization, including the parameter-based averaging strategy and the embedding-based averaging strategy. Experiments on UASpeech and TORGO show that the proposed methods outperform the baseline regularized FedAvg by statistically significant WER reductions of up to 0.99% absolute (3.15% relative) on UASpeech and 0.56% absolute (4.73% relative) on TORGO, respectively.

representative citing papers

Towards Personalized Federated Learning for Dysarthric Speech Recognition

cs.SD · 2026-06-11 · unverdicted · novelty 5.0

Parameter-based and embedding-based averaging in personalized FL for dysarthric ASR yields up to 0.99% absolute WER reduction on UASpeech and 0.56% on TORGO versus regularized FedAvg.

citing papers explorer

Showing 1 of 1 citing paper.

Towards Personalized Federated Learning for Dysarthric Speech Recognition cs.SD · 2026-06-11 · unverdicted · none · ref 1 · internal anchor
Parameter-based and embedding-based averaging in personalized FL for dysarthric ASR yields up to 0.99% absolute WER reduction on UASpeech and 0.56% on TORGO versus regularized FedAvg.

Towards Personalized Federated Learning for Dysarthric Speech Recognition

fields

years

verdicts

representative citing papers

citing papers explorer