Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 90% Match Research Paper Music technologists,AI researchers,Composers,Audio engineers,Creative technologists 2 weeks ago

Steering Autoregressive Music Generation with Recursive Feature Machines

generative-ai › autoregressive
📄 Abstract

Abstract: Controllable music generation remains a significant challenge, with existing methods often requiring model retraining or introducing audible artifacts. We introduce MusicRFM, a framework that adapts Recursive Feature Machines (RFMs) to enable fine-grained, interpretable control over frozen, pre-trained music models by directly steering their internal activations. RFMs analyze a model's internal gradients to produce interpretable "concept directions", or specific axes in the activation space that correspond to musical attributes like notes or chords. We first train lightweight RFM probes to discover these directions within MusicGen's hidden states; then, during inference, we inject them back into the model to guide the generation process in real-time without per-step optimization. We present advanced mechanisms for this control, including dynamic, time-varying schedules and methods for the simultaneous enforcement of multiple musical properties. Our method successfully navigates the trade-off between control and generation quality: we can increase the accuracy of generating a target musical note from 0.23 to 0.82, while text prompt adherence remains within approximately 0.02 of the unsteered baseline, demonstrating effective control with minimal impact on prompt fidelity. We release code to encourage further exploration on RFMs in the music domain.
Authors (5)
Daniel Zhao
Daniel Beaglehole
Taylor Berg-Kirkpatrick
Julian McAuley
Zachary Novack
Submitted
October 21, 2025
arXiv Category
cs.LG
arXiv PDF

Key Contributions

Introduces MusicRFM, a framework that uses Recursive Feature Machines (RFMs) to steer frozen, pre-trained music generation models (like MusicGen) for controllable output. It enables fine-grained control over musical attributes by analyzing internal gradients to find 'concept directions' in the activation space, allowing real-time manipulation without retraining.

Business Value

Empowers musicians and content creators with powerful tools for generating customized music, accelerating creative workflows and enabling new forms of musical expression.