pith. sign in

arxiv: 2603.05813 · v2 · pith:2YES2P6Fnew · submitted 2026-03-06 · 📡 eess.AS

Activation Steering for Accent Adaptation in Large Audio Language Models

classification 📡 eess.AS
keywords accentactivationadaptationdirectionsencoderinformationlayer-wiselayers
0
0 comments X
read the original abstract

Accent variability remains a major source of errors in automatic speech recognition, yet most adaptation methods rely on parameter fine-tuning without understanding where accent information is encoded. We treat accent variation as an interpretable subspace in hidden representations and investigate whether it can be identified and controlled directly in activation space. We extract layer-wise encoder activations and estimate mean-shift directions capturing accent-induced representation shifts. By injecting these directions into individual layers and measuring how they align accented and standard embeddings, we derive a layer-wise accent sensitivity profile, revealing that accent information concentrates in a narrow band of middle encoder layers. Leveraging this structure, we further introduce parameter-free accent steering that modifies representations during inference without updating model weights. Experiments across eight accents show consistent word error rate reductions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.