Activation Steering for Accent Adaptation in Large Audio Language Models

Eun-Jung Holden; Gongping Huang; Jinuo Sun; Qiuchi Hu; Sung Kyun Chung; Ting Dang; Yang Xiao

arxiv: 2603.05813 · v2 · pith:2YES2P6Fnew · submitted 2026-03-06 · 📡 eess.AS

Activation Steering for Accent Adaptation in Large Audio Language Models

Jinuo Sun , Yang Xiao , Sung Kyun Chung , Qiuchi Hu , Gongping Huang , Eun-Jung Holden , Ting Dang This is my paper

classification 📡 eess.AS

keywords accentactivationadaptationdirectionsencoderinformationlayer-wiselayers

0 comments

read the original abstract

Accent variability remains a major source of errors in automatic speech recognition, yet most adaptation methods rely on parameter fine-tuning without understanding where accent information is encoded. We treat accent variation as an interpretable subspace in hidden representations and investigate whether it can be identified and controlled directly in activation space. We extract layer-wise encoder activations and estimate mean-shift directions capturing accent-induced representation shifts. By injecting these directions into individual layers and measuring how they align accented and standard embeddings, we derive a layer-wise accent sensitivity profile, revealing that accent information concentrates in a narrow band of middle encoder layers. Leveraging this structure, we further introduce parameter-free accent steering that modifies representations during inference without updating model weights. Experiments across eight accents show consistent word error rate reductions.

This paper has not been read by Pith yet.

Activation Steering for Accent Adaptation in Large Audio Language Models

discussion (0)