Using Computers for Sophisticated Music Analysis
Tom Avril writes "Need an accompaniment for your melody? Seeking a virtual dancer to try out your new choreography? Or perhaps you're making a new TV commercial, and you need a snippet of music that sounds something like Radiohead, but a bit more mellow. Increasingly, sophisticated software can help with these sorts of tasks. We got a look at the latest from the nascent field of Music Information Retrieval, after its conference in Philadelphia: 'A key part of the conference each year is the announcement of results from a sort of software shoot-out — a competition in which various universities pit their music-analysis algorithms against one another. Entrants from more than a dozen countries competed in 18 tasks, using their computers to "listen" to selections of music, then identify such things as the genre, mood, composer or title. The eventual goal: to help people search for music they might like by combing through millions of audio files in a database. ... In another task, the computer had to identify tunes that someone hummed. "The idea is, you go into the karaoke bar and start humming, and the computer retrieves your song," Downie said.'"
Artist: Britney Spears
Song: Hit Me Baby
Rating: Shit
Conclusion: Humans are weak and stupid
Action Plan: Terminate Britney Spears
You start humming and the RIAA deducts the money from your account for your reproduction.
Or perhaps you're making a new TV commercial, and you need a snippet of music that sounds something like Radiohead, but a bit more mellow.
MORE mellow than 'Fake Plastic Trees?'
This actually may work, especially if you are selling some sort of sleeping aid or anti-anxiety medication.
He who knows best knows how little he knows. - Thomas Jefferson
On the one hand, I'd love it if my home stereo could determine what song I was humming and start playing along.
On the other hand, my family would kill me.
How can I believe you when you tell me what I don't want to hear?
While this technology is very neat, programs which convert sound(wav/mp3) into Midi data have existed for many years. The programs featured in the competition are the next logical step. It's simply data-mining applied to music.
Music is math, but math is not necessarily music. Much of the computerized music based on mathematics alone sounds like atonal shit. Mathematical algorithms can be great for accompaniment but are not meant to replace a human composer.
The tune recognition task is easier than it sounds (ha). In fact it's enough to hum the *contour* of the music, i.e. whether it simply goes up or down, for a couple of bars, ignoring the rythm even.
This way of indexing and recognising music is called the Parson Code and is quite effective.
Self-regulating karaoke. If the computer can't tell what the hell you're singing it's probably best for you to stay off the stage.
-Peter
Or perhaps you're making a new TV commercial, and you need a snippet of music that sounds something like Radiohead, but a bit more mellow.
You don't need software algorithms for that, just go download a Coldplay album. Except maybe replace "mellow" with "soulless."
The problem with this technology, and what Pandora or LastFm applied, is that the programs tend to choose always the same kind of music, and it's boring.
When I listen to music, I like to have some variety, not always playing the same thing, again and again, in different forms.
I like listening to one genre, and then switch to another genre, and the programs are unable to provide that.
Last.fm may be good, but here's the Pandora summary for why it played a particular song (James Taylor's Handy Man):
the song features pop rock qualities, folk influences, a subtle use of vocal harmony, use of string ensemble, major key tonality, a vocal-centric aesthetic, a good dose of acoustic guitar pickin' (sic) a dynamic male vocalist, electric pianos, acoustic rhythm guitars, romantic lyrics...
While a sophisticated computer may be able to detect some of these characteristics, I stand by my comment.
The Music Genome Project is definitely tracking things that (at this point) take a human to notice. Features like "great trumpet solo" or "ambiguous lyrics" are quite a bit beyond the sorts of musical features being extracted by the tools described in the papers at the conference, based on the few I looked at.
Humans are fantastic musical processors. Computers not so much. Which is what makes the problem so fascinating, I think.
Computers are useless. They can only give you answers. -- Pablo Picasso