New neural lv2 plugin from Aida DSP based extensively on existing NeuralPi, reduced to the bone

madmaxwell · August 20, 2022, 2:01pm

Which preset / model are you using? What is the peak input level of the signal reaching the plugin? Thx

spunktsch · August 20, 2022, 2:32pm

I used the MoonCrunch. Levels seem fine.

jon · August 22, 2022, 6:34am

Hardly, but we never know. Does this happen with any other plugin or if you connect the ins to the outs do you have anything resembling this? Did you change something in the settings while sshing? Maybe the best would be to do a factory reset, although at this point in time it may be a bit hard

redcloud · August 22, 2022, 6:39am

I’ll pay attention if it will happen again and do my private investigations

keyth72 · September 1, 2022, 2:13pm

@madmaxwell Just got a chance to catch up on this thread, amazing work! And glad to see the community is getting excited about it. It’s unfortunate what’s happening to Mod-Devices but this gives me some hope that it can continue in some form. Planning on checking out your code and training dataset soon! I’d be curious to get a recording of the Quad Cortex training input, I’ve only heard bits and pieces but it seems to be some kind of generated frequency sweeps as opposed to actual guitar playing.

micahvdm · September 1, 2022, 5:24pm

Are you on the discord server for Mod? Theres a test wav file under the plugin category. It’s similar to what kemper and neural use

keyth72 · September 1, 2022, 7:04pm

@micahvdm Very cool, I am not, how does one get access to that?

rogeriocouto · September 1, 2022, 7:18pm

Discord invitation

keyth72 · September 1, 2022, 9:18pm

@rogeriocouto Thank you! I joined, and I think I see the file you’re referring to @micahvdm , it’s 1:14 long and plays some tones followed by guitar sounds?

micahvdm · September 1, 2022, 9:26pm

Yeah that’s the one. It’s a good starting point for capturing as each guitar with different hardware would ultimately affect the outcome of the capture, so using generated frequencies is a good way to overcome this. Does your neural net algorithm look at the reference vs the outcome and determine the clipping difference to get the profile?

keyth72 · September 2, 2022, 6:18pm

@micahvdm In a way, but there’s nothing in the code specific to finding clipping characteristics. It’s more generic than that, I think. It looks at the difference between the input and output signals and optimizes the parameters (weights and biases) of the network to behave like the amp or pedal. The training process starts out by trying random weights and biases (literally just numerical values that are either multiplied(weights), or added (biases)) and checks the output audio signal to see if it got closer to the target. If it did get closer, it will continue tweaking the numbers in that direction, if not, it tweaks the numbers in the opposite direction, and keeps going until it gets as close as it can. The technical term for this kind of optimization is “gradient descent”. The LSTM model which NeuralPi uses has a “memory”, which remembers the signal in the past to determine how it should behave in the future. Since the data is on the scale of 44.1k or 48k Hz this “memory” is happening on the millisecond scale. The exact same LSTM architecture can also be used to predict stock prices, weather, etc. The data you train it on determines what it predicts. The numbers in the network (if you open up a json file and look at it) are just matrices of decimal values. They are abstracted representations of different features of the device you are modelling.

I probably went into more detail than you wanted, I suppose the short answer to your question is “yes”.

rogeriocouto · September 2, 2022, 6:40pm

Hi @keyth72
Loved the last part of your answer.

As a total noob on this subject I wonder if you and the other guys here could clarify some references that I found over the internet.

We are talking here about AI, deep learning, machine learning and such.
What are the difference between profiling or modeling an amp or a pedal?
What it means “White Box” and “Black Box” and how these terms are related to machine learning?
If the answers for this are to complex, a reference to some introductory reading may help me a lot.
Thanks

keyth72 · September 2, 2022, 7:16pm

@rogeriocouto You are making my day! Sure thing, and anyone else feel free to correct me or add to my answers:

Profiling is a term that Kemper came up with for their process. I think it’s even trademarked when referencing guitar effects. I don’t think they use any kind of A.I. or machine learning. The best I can gather from random forums is that they have a handful of reference amp or distortion algorithms that are tweaked based on the information from their profiling process. They send a series of test tones through the device, but I believe that’s where the similarities between them and Quad Cortex ends. Modelling is just a generic term to say its a mathematical approximation of the real thing, there’s not a specific tech associated with the term in this case. But everything I create, as well as the core tech in the Quad Cortex is based on A.I.

Sidebar, there are abunch of terms for A.I. that aren’t super clear, Machine Learning, Neural Networks, and Deep Learning are all slightly different flavors of the umbrella of “Artificial Intelligence”.

White box is modelling out circuits and components, tubes, resistors, capacitors, etc. You know everything going on inside the box. It’s based on physics and the math used to represent the specific electronic components.

Neural nets are black box, where you know your inputs and what you want out, but don’t care how it gets there. The neural net abstracts info from the training data, and the math isn’t based on physics. Grey box is another term, where you mix the two. This would be if you use a neural net for just the tubes and non-linear components, and then use white box modelling for the capacitors/resistors and linear components.

Hope that helps!

I have a collection of research papers about audio and A.I. on github for anyone who really wants to go down the rabbit hole on this stuff: https://github.com/GuitarML/mldsp-papers

rogeriocouto · September 2, 2022, 8:04pm

Shure it does!
Thanks for the clarification.

Loads of things to read now

micahvdm · September 3, 2022, 5:45am

@keyth72 thanks for the detailed explanation! Love it. They way I’m doing it is more along the lines of sending specific frequencies(aka that wav file) through the device that you wanna profile(or model) and then matching the eq and storing the clipping characteristics. It seems to get pretty good results, but it’s not exactly where I want it yet.

Steve_Meiers · September 3, 2022, 7:16pm

… And maybe also the cab solo, for use with other amp sims? Although cab sims abound, it would make for complete packages.

madmaxwell · September 5, 2022, 3:37pm

Cabs in general are well handled by IR loaders. This is because usually we’re interested in the linear properties of the cab + mic system and the impulse response own them. Or, the impulse stores all the linear information about equalization that happens.

Let me explain a few things. This plugin uses deep learning to tell a very small network, that can run in real time, how to process audio so that it sounds as close as it gets to the real thing, the guitar amplifier or the distortion effect (stompbox). This is similar speaking in generic terms as what is doing Neural DSP on their own.

This is very different from profiling technique, we have fery few if not only a single example of an open source audio plugin who does profiling: https://github.com/olegkapitonov/Kapitonov-Plugins-Pack/blob/master/LV2/kpp_tubeamp/kpp_tubeamp.dsp. The key point here is that we have a generic model or algorithm that is representing the ideal guitar amplifier, then some signals are sent through the ideal device, to capture the “main” characteristics of it and apply back to the original model. Still, some fine tuning of the existing model needs to be done by ear to compare against original. In other ways, some skills are required to profile. In addition to that, since the fixed model is a guitar amp and not a distortion device, the algorithm is oversized to emulate just a fuzz, and may eventually fail in doing that. Also note that certain types of guitar amplifiers may be not perfectly represented by the structure defined in the plugin. For all these reasons, which are also reported in academic literature on the topic (as you can imagine, we don’t have massive publications on that), neural networks are the “next thing” for amp modeling, superior to anything that we have listened so far. When I model a guitar amplifier, I exactly now what is the resulting ESR that is the difference between the original and the one predicted by the network. This is oversimplified, since this stuff is PhD grade guys.

So what’s going on with my plugin? I really want to distribute it for Dwarf users, in particular giving current situation. But unfortunately cross compilation of my plugin with Dwarf current toolchain (gcc 7.5) is resulting in a plugin that produces a lot of noise. I don’t have a Dwarf, so @redcloud @spunktsch and @dreamer are helping me. This needs to be done by trial and error. On my Mod derivative project (Aida DSP OS) I use a yocto generated sdk based on Dunfell branch and gcc 9.3.0. I’ve uploaded a new demo here:

I’ve already moved away from what is public in terms of source code for training and other plugins. For example I have implemented a new loss function pre-emphasis filter that is using A-Weighting. By doing A/B comparisons I can say there is a huge improvement in the model sound. I need to implement a frequency dependant ESR measurement in order to quantify this “huge”. I’m pushing new plugin source code and models in the following days, and I would like some feedback.

@keyth72 (amazing to have you here sir)

redcloud · September 5, 2022, 3:47pm

Impressive!

dreamer · September 5, 2022, 3:49pm

I do not have a Dwarf either :#

Steve_Meiers · September 5, 2022, 4:37pm

Agreed, very impressive and the demo sounds very good!
I have a dwarf and would love to try this plugin out, but I’m afraid of bricking my dwarf if i do something wrong. Compiling, etc, sounds complicated and like it might also be complex to revert if i did botch things up.
I guess I need to start getting into the deep water of experimentation, especially if the parent company is gone now.
It would be nice if there were a way to just side load stuff, like on Android.