Napster, Audio Fingerprinting, and the Future of P2P
mjmalone writes "Napster founder Sean Fanning is poised for a comeback, seems the now 22 year old Fanning has developed technology which creates "audio fingerprinting" of individual tracks and compares them against fingerprints in his firm's database to determine legality. A fee may be set and collected on a copyrighted track by its rightful owner. Fanning is actively recruiting industry support as well as pushing the idea to p2p services such as kazaa and grokster. " This isn't exactly new technology, but it's still interesting to see what Fanning is up to these days besides movie cameos.
You have to give him credit. At least someone out there is actually trying to make p2p legit, and not just throwing their weight around *cough* RIAA *cough*
if(md5sum("myfile.mp3") == md5sum("Limp_Bizkit-Crap.mp3")
cout "PIRATE!";
Will it be able to tell the diff between...
:)
Backstreet boys, N'Sync and other boy bands?
Creed, Nickleback and other "rock bands"?
50-cent and DMX?
I wonder if record companies will accept mistakes when differentiating between these artists
--
"I'm not bright. Big words confuse me. But Wanda loves me and that should be enough for you." - Cosmo
I recall that in its dying days Napster was talking about adding this to appease the recording industry. The variation then was from a company called Relatable. Sounds like Shawn is stuck in a recursive loop.
Umm no.
This is not going to make P2P "legit".
This is going to further destroy legit and non infringing usage of P2P. Now, RIAA will still say "p2p has no purpose other than piracy ban it"! And if people start paying for music from these services, guess what LEGITIMATE users of p2p suffer.
Sean Fanning did not invent P2P. Before napster we used to have IRC/DCC bots etc. and web search pages. Sean Fanning made downloading mp3's easier for the masses because of his windows client that automagically shared files you had downloaded. He's great but he's no God.
I remember seeing a book once that helped you identify songs by whether the sequence of notes at the beginning of the piece went up, down or stayed the same pitch when compared to the previous note. It was about the size of a telephone directory.
A quick Google finds out that its called The Parson's Code, with a lot more information here.
Presumably the fingerprinting scheme works in a similar fashion (over a larger portion of the song, and probably over multiple fragments of the song as well).
Ian.
A physicist is an atom's way of thinking about atoms
Such people don't "change sides" or comit "treason". They don't have any morales at all and work basically for any bloke who has money in his pockets. And Fanning thinks that this bloke is the music industry. I wonder, however, if they'll take him. Elephants are said to have good memory and to be unforgiving.
And for this P2P thing: does anyone here really think at the music industry will just lean back and watch their profits flush aways through DSL customer lines ?
Owner of a Mensa membership card.
Or P2P networks could NOT verify "legality," NOT pay Fanning anything, remain distributed enough to avoid any serious legal problems, shift the responsibility (rightfully) onto users, and music will remain -- as it was and ever shall be -- completely free.
Avast, me hearty! Arrrr!
before the "audio fingerprint" changes? Say, speed it up by 5%, filter out some of the bass and drum, and profit.
Karma: Food Fight (Mostly affected by Date Plate).
What if you introduced a little bit of static or something into the MP3? Not enough to be annoying, or maybe even really perceptible, but just enough to throw the "fingerprinting" off. I wonder if the technology is good enough to detect that. Also, if you were to record a song from vinyl, clean it up, and post it online, it might be different from the "official" version of the file. Maybe the technology might be able to detect the general pattern of the song, rather than exact sounds, but if not, Fanning's technology might not work out.
I produce electronic music and write little games. Have a look.
So you have a way of authenticating that a song is legitimately bought? An audit trail for each track? Wonderful. It's not going to be taken apart and cracked within a week is it? No-one's going to take our model and release a free implementation with much wider popular appeal are they? Are you sure? Great! We'll buy your company and give you generous stock options then.
...
Please excuse me now, my pet unicorn needs feeding
"It's not your information. It's information about you" - John Ford, Vice President, Equifax
He has a good business plan: create a big problem, then solve it.
I use the MusicBrainz tagger sometimes, and it works by comparing the audio signature of a song to the one in its database. This seems like the same sort of idea, but MusicBrainz tags files completly wrong a good percentage of the time, even listing the wrong artist - title info as a 100% match. I think this kind of technology has a ways to go before it could hold up in court or whatever.
"Sic Semper Tyrannosaurus Rex."
the cat's already out of the bag. The real issue here is the existence of the middleman in the music industry. Prices of CDs are artificially inflated by the middleman (the music outfits behind the RIAA), because they control most of the musical output in this country. Consumers want this music, and some continue to purchase at these inflated prices. But when you can get the same music, albeit illegal, from an alternate provider (KaZaA et al.), why bother paying those prices at all?
The solution is to bring the price of music back down to a reasonable level. If consumers are able to more directly compensate the artist for their music, and they can do so at a more granular level (i.e purchase tracks, vs CDs), and the easy of use is comparable to the p2p networks, then I bet you'll see a rebound in purchases. Granted, not all the people who use p2p will buy legit copies -- but I bet you'll see a significant rebound.
This country is long overdue for some overhauls on copyright / fair use law. The RIAA likens consumers who use p2p as criminals, but the RIAA backers have already been convicted for price fixing and routinely screw the artists they purport to represent out of cash. Criminals calling their target market criminals? Even if they're right, it's a matter of the pot calling the kettle black.
The days where the music industry could rob consumers without consequence is coming to an end. Exactly how it turns out is anybody's guess, but consumers are on to the RIAA's schemes and have a found a way to get their music without their shenanigans. Expect to see year-over-year sales to continue to fall until some of these leviathans go belly-up, and artists gain more control over production and licensing -- the way it should be.
The concept of audio "fingerprinting" is an interesting one, but likely outside of Fanning (and his local folks) experience or abilities. Fingerprinting has to rely on one of two things. The first is the artificialities of files -- things like file length, name, checksums, etc etc. All of these are easily overcome, and likely not robust to differing compression/bit rates/etc. The second thing it could reply on is data content -- that is, things like how many beats per minute, the time/frequency pattern in segment(s) X, Y, etc etc. I'll call this analysis of content. Unfortunately, simple analysis of content and watermarking schemes are very easily detected and overcome (remember the Felten/audio protection challenge?). TRUE analysis of content (when certain instruments play, their timing, the singing, etc) is a very difficult signal processing problem that won't be overcome without serious mathematics. And as much as I like Fanning, I don't think he's got the juice for it. Just my $0.02
MusicBrainz already has a free music fingerprint program. It identified about 60-70% of my songs correctly. It also will rename your files and update the ID tags.
The 30-40% it did not find... I could easily find by doing some searching manually through the program.
It was a nice way to completely identify my mp3 collection. Yes, it's a legal collection, but I wanted an easy way to rename the files and id tags.
Anyhoo... the program is pretty buggy so save often. Help the cause.
Enjoy.
DavaK
What good is this going to do?
I thought the whole filesharing problem comes from people wanting to download music for free instead of paying for it. IMHO, the problem is not that there is trouble IDENTIFYING copyrighted songs, it's that it's hard to get people to PAY for them.
Imagine this -- you have a network that identifies what you try to upload, and if it determines that the file is copyrighted, it charges you a fee. What do you do? Well, what did millions of people do when Napster tried to limit what you could share? Simple: They go to other, freer networks and leave the more limited one in the dust, awaiting bankruptcy.
The technology of audio fingerprinting can be very cool when used in other ways... like, perhaps, a more forgiving method (compared to checksums) to make sure a song is the real deal. Or, like some company is doing, enabling cell phone callers to call in with a song playing in the background and have a database identify the song name for them.
Unfortunately, I fail to see how this technology will discourage people from sharing files. We already KNOW which songs are copyrighted. It's just a matter of convincing people to pay for them. To accomplish that, I believe there needs to either be some way to discourage users from not paying (such as the RIAA's legal actions) or, preferably, some incentives to make people WANT to buy legal music (supporting your artists, ease of use/ease of finding the right song, download speed, song quality, etc., but all at a reasonable price).
Crippling filesharing networks to prevent sharing is a stupid, ineffective solution. As history has shown, people will simply use other networks. Unless the RIAA can completely crack down on and close ALL the networks, nothing will change. If there is just ONE free network, it's the one people will migrate to and use.
Audio fingerprinting is not something like a hash function that leads to a deterministic identifier. It is more like a web search engine that finds the best fuzzy match.
If you use audio fingerprint scores in the aggregate, for example to see what's popular, it works. If you depend on any one audio fingerprint matchup being accurate, especially accurate enough to use for legal notices, it doesn't make sense.
Music is a semantic object. Saying whether two pieces of music are the same thing depends on stuff that even humans have a hard time figuring out, like how much originality there is in a tribute band's cover.
Sean's business model seems a tad flawed. His new software has already been written, and an SDK is freely available here. Source code for both the Linux and Windows clients (which includes the fingerprinting code) is a click away under their downloads section. Redhat and Debian packages are there too, as well as Ruby and Perl bindings.. so fire up apt-get and go to it!
Perhaps we can take the time to look at the root causes of the whole P2P/ music industry / RIAA debacle. We all know the context. But what are the hidden assumptions? Can we reanalyse these? And can we find a new model for buying and selling digital media that does not pit the greedy tycoons against the valiant hackers? I think so...
1. The first assumption is that consumption is completely elastic. In other words, people will pay whatever the goods they want cost. (Assumption of the media industry.)
2. The second is that value is constant. In other words, digital theft by a million people is equal to physical theft of a million CDs.(Assumption of the media industry, who come up with bizarre figures as to the "loss" sustained thanks to illegal file sharing.)
3. The third is that digital content has no value. In other words, digital theft is not theft because bits and bytes have no value. (Assumption of the file swappers.)
All these assumptions are wrong.
First, consumption is almost completely inelastic. People will spend every disposable penny they have. If goods are cheaper, they will buy more of them. Raising the price of goods simply decreases demand.
Secondly, value is not just constant, it is almost always inversely proportional to rarity. In other words, the more of an item is available, the less it is worth.
Thirdly, of course digital content has value: that people go to great lengths to aquire it demonstrates this. However, its value is subject to the law of rarity.
What does this all mean?
Firstly, whether or not people illegally share music (and the same applies to movies), the value of media is going down inexorably thanks to the huge volume produced. And I'm not speaking of the cost of manufacture, but the perceived value, the price people are willing to pay. Diamonds cost practically nothing to produce, their value comes from their rarity.
Secondly, an industry faced with this value equation has several options. They can try to restrict supply and eliminate competition, which is what the music industry has done for about 20 years since the CD eliminated the production bottleneck. In a competitive market they will lower their prices so that consumers stay loyal. We have also seen this. Finally, they can ignore reality and die.
Thirdly, one of the ironies about digital distribution is that it eliminates the rarity variable. This means that any object distributed digitally will inevitably tend towards zero. I can download music from the Net but I value my own (irreplacable) CD collection much more.
I believe that even the 'pay as you go' model is doomed to failure. The only sustainable model is one in which prices are set by the market and production by the producers.
So, what I propose (or rather, predict, for this is almost inevitable) is a media market that works as follows:
1. The producer of a work creates a specific number of instances of the work. This can be as large or small as they want, but they cannot change the quantity afterwards.
2. The instances are individually serialized so as to be traceable to their owner. They can be copied freely.
3. These instances are now auctioned and can be resold in an open market.
This scheme can be applied to music, writing, photographs, almost any digital creation. Imagine a famous writer produces a short story. They issue a series of 1000. Now, you can buy one of these copies. It will be, forever, an original that is certified and unlosable. The price is set by auction, and the rights to these copies can be traded in an open market. What's the cost of a 2003 Madonna? Around $1.20, these days. And a 1998 Leftfield? Up to $30, if you can get them. In fact, you have paid not for a real thing, but for a slice of rarity. Sound strange? What about shares and options...?
There is only one requirement for such a market, and that is the market place. All the rest follows from the natural laws of supply and demand.
Ceci n'est pas une signature
But audio fingering printing is very much a reality, and nobody uses a check sum.
There are many good papers on this.. I particularly like the one on "AudioDNA" visit your local google. You see with Audio Fingering Printing we are actually able to take a song that has been rerecorded onto an analog tape, slightly time stretched and still be able to tell that its the original song. It doesn't rely on bytes, but instead qualities of the audio signal.
There are many ways to do this, but one solid method is to analysis the audio signal for acoustic events that are resistant to change. Make a listing of these events and store there locations in time as a chain. Even if you only have a small segment of the chain you can search for it with techniques similar to the one's they use in biology (nobody looks for a complete DNA chain). Its a little difficult to explain without knowing something about signal processing so I suggest just searching the web. Here are a few good topics:
Music Information Retrival - (MIR)
Audio Finger Printing
Audio DNA
CUIDADO
ISMIR
MPEG-7
Oh and try not to insult all the people who research this stuff by claiming some goof at Napster invented it.
Rob
Sure- I'll volunteer all my files to be tested by some random company, and they can tell me whether I owe them money or broke the law in some way.
Just contact me here: Hao_Wu@not-likely-to-happen.com, care of GET FUCKED.
I suggest you read Slashdot
Nothing is worse than somebody who is too stupid to realize that their 15 minutes are up.
On Wall Street they say "buy low, sell high" On the pad we say, "buy high, sell high" Isn't that somehow better?
About these 'fingerprints' - are they SIMILAR for similar pieces of music? Or are they only useful for identifying the one piece of music that each fingerprint is for?
If the 'fingerprints' are similar enough, you could ALSO use the technique to search for songs that you may have never heard but match the general style of music that you like. Sounds like something independent musicians could really benefit from ("Hey, I'd never heard of THESE guys before, but their music is exactly the style that I like....")
And if this is NOT the case, is anyone working on a "music style" analysis of some sort that could be stored in a 'searchable' fashion? (i.e. take your favorite song, run it through an analysis program to get it's 'fingerprint', then feed that 'fingerprint' to a search engine to get a listing of similar songs...)
Hacker Public Radio is our Friend
As a class project, a friend and I built a music recognition database. You can read our paper.
The general approach is fairly straightforward. You extract a set of "features" (typically several Mel Frequency Cepstral Coefficients, or MFCCs) from each sample of the song, say 10ms. You then pick several (say, 16) arbitrary points and iteratively generate that many "average" feature vectors, along with their weights so that they all sum to a one vector. This data is turned into a Hidden Markov Model (HMM). To see what audio you have, you run it through each of the possible HMMs and see which produces the greatest likelihood.
This method is typically applied to speaker recognition, where a linear search through HMMs is reasonable. This obviously isn't the case when you know about hundreds of thousands of songs, so a large part of the challenge is narrowing the field of HMMs to check (which is one of the focuses in our paper). Relatable, who were working with Napster a long time ago, have clusters that can classify 1,000 songs per second; I'm pretty sure they use this technique.
This technique has several important features. First, it doesn't depend on any properties of files themselves. Checksums would be trivial to beat, looking at a file's length could be circumvented by inserting silence, etc. Since this creates an average of sample data, a song would need to be changed quite a bit to fail to match. (The system is robust to, for instance, changes in bitrate, slowing the music down, and rearranging bits of the song or putting it in reverse.) We didn't have enough "derivative" music to test how it handles sampled music vs. the original -- it depends how much is changed.
Finally, this sort of system is useful for much more than song identification. You can build a model for an artist or genre and determine how to classify the song. One of my focuses in the paper is unsupervised genre classification -- my tests indicated some fairly reasonable groupings. This technique could be used for music recommendation -- "You like Dropkick Murphys? Well, they sound like Flogging Molly, so you might want to check them out."
Ceci n'est pas une signature.
Dear Slashdot Family, Typically, I refrain from ever posting on Slashdot, despite my regular visits, mainly because you guys are usually talking over my head, but in the case of the RIAA/P2P fight, you're right up my alley. Very recently, my company released an ongoing series of video shorts on Kazaa/Altnet featuring footage of Hawaiian Tropic models with original music soundtracks. Although we just started, prospects look promising. Ergo, everybody involved in this process, so far, is pleased. We're even considering including independent recording artists music tracks, and offering them a royalty on sales. If 50 examples like this surfaced, say, in the next 60 days, wouldn't this be the key argument against the RIAA? i.e., content providers selling entertainment voluntarily using P2P? How could they continue to claim a position of "artists interest" if independents used P2P in a lucrative way? In my view, the only issue from the RIAA's perspective is - even if they don't say it upfront - they just don't want to lower their prices. They don't want to adjust to this intense force in the free market. Where would the software industry be if it spent this much time and money "fighting piracy"? Just some thoughts, SoSoHot.com
Father O'Malley,
Your music file "Angels We Have Heard On High.mp3" matches "Smack My Bitch Up" by The Prodigy in our audio fingerprint database. Our lawyers will be in contact with you about this infringement.
Sincerely,
Recording Industry Association of America