Differentiable Black-box and Gray-box Modeling of Nonlinear Audio Effects

Comunità M., Steinmetz C.J., Reiss J.D.

Abstract

Audio effects are extensively used at every stage of audio and music content creation. The majority of differentiable audio effects modeling approaches fall into the black-box or gray-box paradigms; and most models have been proposed and applied to nonlinear effects like guitar amplifiers, overdrive, distortion, fuzz and compressor. Although a plethora of architectures have been introduced for the task at hand there is still lack of understanding on the state of the art, since most publications experiment with one type of nonlinear audio effect and a very small number of devices. In this work we aim to shed light on the audio effects modeling landscape by comparing black-box and gray-box architectures on a large number of nonlinear audio effects, identifying the most suitable for a wide range of devices. In the process, we also: introduce time-varying gray-box models and propose models for compressor, distortion and fuzz, publish a large dataset for audio effects research - ToneTwist AFx - that is also the first open to community contributions, evaluate models on a variety of metrics and conduct extensive subjective evaluation. Code and supplementary material are also available.

Resources

Paper: https://arxiv.org/abs/2502.14405
Code: https://github.com/mcomunita/nablafx
Data: https://github.com/mcomunita/tonetwist-afx-dataset
Webpage: https://github.com/mcomunita/nnlinafx-supp-material

Citation

Comunità M., Steinmetz C.J., Reiss J.D. "Differentiable Black-box and Gray-box Modeling of Nonlinear Audio Effects" - arXiv preprint arXiv:2502.14405.