Best Audio to MIDI extractor, maybe AIAIAIAI?

What’s the currently best working audio to midi AI converter? I am trying that on a chiptune :sweat_smile: But melodyne fails…

This one looks fairly promising but I haven’t tried it. Uses the model from Spotify that is probably also running behind most “audio to midi” services, but it’s an open source and free vst plugin for all platforms.

oh thanks! I’ve just tried that, but it seems to detect one voice only, so you theoretically could convert a song to make it playable on a pc beep speaker thing :slight_smile:

I’m not sure that a good general-purpose polyphonic audio to MIDI converter even exists. It’s a hard problem. Other than Melodyne (and MIDI Guitar for guitar) I’m not even aware of a good special-purpose solution.

Edit: Have you tried the Samplab demo? I always kind of tune it out when I’m searching for this stuff due to my bias against companies that use “AI” as a selling point, but you know, I imagine current AI tech is a good fit for this task.

It transcribes as a single midi sequence but it does detect multiple voices and polyphony. I also gave it a quick test now and it did fairly well.

Well, had to manually edit a lot, that arp… :exploding_head:

I love this synth, so colorful yay

This so far seems to be the most accurate converter, single voice only:
https://midi.martinic.com/en/audio-to-midi

Tbh. I am kind of confused how the marketing claims about AI are vs. reality here… Still no algorithm seems to be able to extract polyphonic midi data accurately in 2025. Neither S1, Ableton, Cubase. And if the audio contains even a chiptune arp, all these tools will fail. Even the timing is wonky with all the tools.

Fender Studio Pro 8 / Presonus Studio 8 Pro now includes polyphonic midi note rendering… Will try that. They say it’s optimized for live instruments though, no chiptunz D:

EDIT: The AIAIAIAI in Fender Pro Studio Studio is not working nicely…. And can’t handle chiptunz!

Why use any of that?… Try picking it out with your own ear… That’s the best way… Back in the 70’s & 80’s that was the ONLY way & if you didn’t have an ‘ear’ to pick stuff up you were not invited to join the band…

Nowadays there’s so much FX & distortion that ‘fake out’ conversion scheme software, however using the ear one can bypass all that nonsense & get to the pitch & when you are done most times you will find it’s just a simple infantile lullaby or loop…

Years ago on KVR I would help out a beggar or two who pleaded to help with chords or melody of a tune… one guy wanted that tune ‘On Acid’ decompiled & was quite respectful so I even did a screencast of how to go about it going thru the entire tune & to prove my point that it’s just simple sounds with tons of FX added I used AXS tracker to do it with… I stopped doing this as several did not even bother to thank…

I suggest jamming to tunes to train the ear like oldskool, back in the day I met some musicians that were legendary picking out complex riffs after only hearing it once, Way better than me…

Problem nowadays is so many just buy FL or Live & think ‘I’m a musician now’ whilst back in the day just ‘buying a bass’ didn’t make you anything at all, took years of practice…

Did that in the past, but too lazy to fiddle exactly the fast arps in that original tune. Also a test for AI.

And again a proof that AI is dumb AF, can’t adapt to anything which wasn’t trained beforehand. IMO extracting notes from a chiptune arp is a very annoying, cumbersome work which should be done by the machines.

Presonus wrote that their extraction model is trained for real instruments like guitar… But isn’t an arp lead in a chiptune often very similar to a lead guitarist? Wow the level of inability of the AI is stunning.

OK, Well there’s always a way… For decades now there’s been all kinda players & soft that will slow down a tune whilst retaining pitch… Run thru a beatslicer which will do same thing with advantage of listening to individual slices until the mind goes numb… This must be a helluva ARP>>>

1 Like

Hehe, yes, it comes straight from hell:

It even has arp legato a bit later, nice. This is how far I came, I am very lazy these days (mix is an ugly failed Pro-C3/Pro-Q4 test):

I’ve rendered the single tracks of the chiptune with some UAE emu, and then used the above recommended AI thing. Still had to correct a lot. Original is way better. Well, it’s just some experiment.

1 Like

OK, I see… chiptune music, some of that can be hard to discern as the short note durations & the fast ‘fluttering’ which on the video most sounds like minor triads of the bass root using different inversions of the triad ‘moving about’ might be some different in the chorus…

Can’t play the ‘mini-player’ they don’t seem to work for me…

This reminds me of Klystrack which I was messing about with a year or so ago… Really like that chiptune tracker, I plan on making a jazz-rock chiptune in it before I die>>>

1 Like