Genre Controlled Music Generation via Activation Steering
read the original abstract
Computational Music Generation is evolving towards non-conventional styles, demanding methods that enable precise and controllable blending of diverse music elements. In this work, we present a method for fine grained control using inference-time interventions on an autoregressive generative transformer, MusicGen. Through our approach, we achieve genre control by steering the residual stream using weights of a linear probe on it. By framing activation steering as a human-controllable interaction, our work highlights how interpretable model behaviors can empower in co-creative music generation.Audio samples demonstrating our method are available on our demo page.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
Closing the Loop: PID Feedback Control for Interpretable Activation Steering in Symbolic Music Generation
Introduces Dual Steering via Gram-Schmidt orthogonalization to reduce interference in multi-attribute activation steering for the Multitrack Music Transformer.
-
Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation
Activation steering with Gram-Schmidt orthogonalization enables disentangled, deterministic control of pitch and duration attributes in the Multitrack Music Transformer without retraining.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.