Super vague AI question for MIR 

Super vague AI question:

I have a pile of wav files and I would like some AI to come in and do an unsupervised analysis of them where it comes up with some ways in which they're distinct from each other and give them a score of 1-5 on how close they are to each pole of whatever it decides makes them distinct. So it decides on some pronounced x-factor and then scores them accordingly.

My questions are:
1. What's the terminology to describe this process?

2. Is there code lurking around (preferably in python or SuperCollider) that can do the machine learning/analysis/(see question 1)?

3. Are the results of it likely to be something that humans can perceive? (Or is there a way to encourage that?)

Super vague AI question for MIR 

@celesteh not unsupervised but sounds like a 'K-nearest neighbour' could help with the classification?

Follow

Super vague AI question for MIR, not an answer 

@rra @celesteh en.wikipedia.org/wiki/Self-org is a nice technique to get a low dimensional map from a high dimensional set of points. The main problem is that different random seeds can give different maps, so the (e.g.) 2D coordinates aren't meaningful in themselves: however (this is the point of it) nearby 2D points should have similar high-dimensional characteristics.

I used it in one project:
1. calculate energy per octave of each WAV section (over 10 octaves or so)
2. make a 2D SOM out of the 10D data
3. analyzing a long WAV, make a Markov chain out of the 2D SOM coordinates ("nearest SOM node" is a discrete space)
4. run the Markov chain probabalistically to generate sound

Web 1 0 0

Super vague AI question for MIR, not an answer 

@mathr @rra

That is a cool project!

What I want to do is use the original audio files with the scores. So I've got a playback engine that picks a starting point and a target and interpolates between that over time. On the way, it plays audio files with scores near it's current state. This is a way to organise what files should be played when and impose musical values on a relatively random collection of sounds.

Sign in to participate in the conversation
post.lurk.org

Welcome to post.lurk.org, an instance for discussions around cultural freedom, experimental, new media art, net and computational culture, and things like that.