Slashdot Mirror


RIAA Tracking Songs by MD5 Hashes

aSiTiC writes "Apparently RIAA has obtained some technical experts in their prosecution of file swappers. Currently they are tracking traded mp3 files from the Napster network by matching MD5 hashes. This seems quite interesting but I was under the assumption that identical hashes could be created with identical rips and id3v2 tagging. Now may be the time to update your illegal mp3 file MD5 hash sums."

15 of 779 comments (clear)

  1. What if... by moehoward · · Score: 4, Interesting

    What if I own the CD but got files off the Internet because I was too lazy to rip them? Would I still be expecting to be sent to the prison camp?

    In other news, all songs produced by RIAA artists in the last 10 years all have the same MD5 hash anyway, because they're all the same.

    --
    "If you want to improve, be content to be thought foolish and stupid." - Epictetus
  2. What happen if by Anonymous Coward · · Score: 5, Interesting

    you just normalize or edit the begining or the end of the song? Does the MD5 Hashes still works?

  3. but will they target aol/tw? by Comsn · · Score: 4, Interesting
    The RIAA has said it expects to file at least several hundred lawsuits seeking financial damages as early as next month. U.S. copyright laws allow for damages of $750 to $150,000 for each song offered illegally on a person's computer, but the RIAA has said it would be open to settlement proposals from defendants.


    will they start sending subpeonas to aol/tw customers this time?
  4. hashes are kinda pointless by truffle · · Score: 4, Interesting


    It is generally believed amongst file traders that it is legal to download an mp3 for a song, when you own the CD. In other words, you don't need to rip and encode songs from your own CD. However, this may not be true (I am not a lawyer).

    The RIAA is using MD5 hashes as a basis for proof that the individual in question downloaded the files they are sharing, instead of ripping them from their own CD collection. This is supposed to show the individual is a willing participant in stealing and distributing music, instead of someone who is just sharing what they already own. But, see above.

    I think this is mostly just a FUD tactic. They can talk to the media about how their MD5 hashes prove so-and-so is a big mean pirate hacker. MD5 hash certainly sounds scary, especially when the technology is described by the media as a tool used by hackers.

    --

    ---
    I support spreading santorum
  5. Re:MD5 Hash by kzinti · · Score: 4, Interesting

    The only way for two files to have the same MD5 hash is for them to both be encoded with the same encoder, from the same WAV file, with the same bitrate and all advanced options, and to have exactly the same ID3 information, the same filesize, and to be identical to the last bit.

    If two people used the same ripping software set to all its default settings (as many unsophisticated users do), got a perfect rip off the CD, and relied on CDDB information for tagging the song, then it's possible that they got mp3s identical down to the last bit, and thus identical MD5 hashes. BUT to make this a plausible defense, you'd have to show that your rip was in fact perfect. In other words you'd have to be able to recreate the mp3 independently. If the old Napster mp3 had any ripping errors, then it would be hard to claim that the later rip just happened to have the same errors - assuming errors are essentially random.

  6. Where does it say MD5? by eddy · · Score: 5, Interesting

    Are we sure they're actually using MD5? The article doesn't even contain the string "md5" that I can see. It mentions hashes though, but there's something called Robust Hashing which can be used to identify, or at least, compare content in a "fuzzy" way.

    --
    Belief is the currency of delusion.
  7. How About An MP3 Outguess? by thecampbeln · · Score: 4, Interesting

    Lets see someone put together an app that flips bits here and there within MP3s to make each one it runs against unique enough to create a new MD5 hash!? (I would, but I can only program in a pseudo-language ;) It could even be as simple as adding in a trailing byte to all of your MP3s, though that could be easily filtered. Hell, if you can hide messages within compressed JPEGs without noticeably affecting their quality, why not do something similar to MP3s just to jack up this sort of tracking!?

    --
    "1984" was ment to be a warning, not a guidebook. You hear that Kim Jong-il!? BushCo?!
  8. Re:Now what? by utlemming · · Score: 5, Interesting

    No, we need to create a honeypot farm. You remember that article way back when on Slashdot? It described how to implenent a whole farm. Then we strictly prohibit scanning of the networks for MD5 checksums. Since RIAA is using bots, they won't read the warning and fire off the subeona. When you get a subeona, then you slam them with a computer crime lawsuit. See, you can still get rich from RIAA. But how do you get illegal MD5 check sums with out possesing the files? If you wanna screw with RIAA you have to be damned sure that you right.

    --
    The views expressed are mine own and do not express the views of my employer.
  9. Re:gee? by laird · · Score: 4, Interesting

    It's true that two different people could generate RIP's of the same track with the same MD5 hash, but the odds are low: they'd have to use exactly the same encoding settings, and enter exactly the same ID3 tags with exactly the same values. (Counterpoints: they could be default settings, and CDDB/Gracenote metadata, which would improve the odds a bit) And since we're talking about large music collections, the exact matching would have to have to happen across hundreds of tracks. And if the ID3 tags had notes like "ripped by so-and-so" that'd kinda blow the case. So while it's certainly true that MD5 hashes don't completely uniquely identify a particular RIP of a track, I think that when compared for large numbers of files, it'd be a pretty good indicator of file copying.

  10. What is illegal here? by emptybody · · Score: 4, Interesting

    If I use KaZaa to access indie artists who are
    sharing their songs - as is their right - AND I
    also rip my entire 1000+ CD/LP/8track collection
    to the same computer AND I intellegently store
    all the files in the same heirarchy.

    Have any laws been broken?

    KaZaa is configured to share everything in my
    heirarchy so that the indie songs can continue to
    be shared.

    Have any laws been broken?

    I go in for Jury Duty, meanwhile Another Kazaa
    user downloads the indie shared files.

    Have any laws been broken?

    Another Kazaa user downloads the rips from my
    personal collection because their 8track player
    is on the fritz.

    Have any laws been broken?

    Another Kazaa user downloads the rips from my
    personal collection because their LPs were
    destroyed in a flood.

    Have any laws been broken?

    Another Kazaa user downloads the rips from my
    collection because they want to see what the
    latest Madonna single sounds like before going
    out and buying the CD.

    Have any laws been broken?

    If any laws were broken here - who broke them?

    Just because I leave the front door open does not
    mean that anyone can enter and take what they
    want from my house. Same as my computer.
    The action of downloading is at question not
    making the article available.

    YMMV. Consult a lawyer.

    --
    comment directly in my journal
  11. Re:gee? by nearlygod · · Score: 5, Interesting

    About this interpretation of Fair Use: I agree that downloading mp3's of CDs that you have purchased should be fair use. I am in a similar situation. A couple of years ago I lost 90% of my CD collection in an apartment fire. I had about 20 of these CDs ripped at the time and since then, I have downloaded many of the others to replace what I had paid for. In some cases, I re-purchased the CD because I wanted to have an original for some of my favorite artists but I didn't mind the mp3 mastered replacements for many of the CDs. Would this fall under Fair Use? I would think that it does since the RIAA seems to think that we are only purchasing a license to listen to the music. However, if I had to present the original CDs to a judge to prove that I do/did own the physical CD, I would be SOL.

    --
    The Tools Of Ignorance wanna be a tool?
  12. Re:gee? by arth1 · · Score: 5, Interesting
    This wouldn't, though, be a defense for the central problem that she made all of these MP3s available for download by millions of anonymous strangers without the consent of the copyright holders.

    Unless she had an OC-48 or two going into her home, she didn't make the files available for download by *millions* of strangers. When the resource is limited, the magnitude of the crime is likewise limited. If you offer a stolen watch on the streets of New York, you can't be charged with trying to sell it to MILLIONS of people, cause there's only one watch. Likewise, in this case there's only enough bandwidth for a certain number of potential downloads, and speaking of millions here is plain misleading.
    If the people who downloaded files from her spread them further, that's THEIR crime and not hers, much as the guy who sold a stolen watch won't be found guilty for the watch buyer illegaly selling it to someone else.

    And in this case, it's even less severe, as it's not a theft, but a copyright violation.

    Regards,
    --
    *Art
  13. RIAA Taxes by brj · · Score: 5, Interesting

    Don't we already pay a small tax to the recording industry every time we buy blank audio CDs (but not data CDs)? I'd like to see some lawyer fight a case claiming that a P2P user has already paid the RIAA and is therefore exempt from their lawsuits when downloading the music and burning it to an audio CD. That would be an interesting lawsuit.

  14. Re:gee? by AJWM · · Score: 4, Interesting
    I have some doubts about the ripping process being as exact as you say.

    So did I, so I just ran the experiment:

    al% cdparanoia -d /dev/hdd 1 pf.wav
    cdparanoia III release 9.8 (March 23, 2001)
    (C) 2001 Monty <monty@xiph.org> and Xiphophorus

    Report bugs to paranoia@xiph.org
    http://www.xiph.org/paranoia/

    Ripping from sector 0 (track 1 [0:00.00])
    to sector 17511 (track 1 [3:53.36])

    outputting to pf.wav
    (--stuff omitted due to lameness filter--)

    al% cdparanoia -d /dev/hdd 1 pf2.wav
    cdparanoia III release 9.8 (March 23, 2001)
    (C) 2001 Monty <monty@xiph.org> and Xiphophorus

    Report bugs to paranoia@xiph.org
    http://www.xiph.org/paranoia/

    Ripping from sector 0 (track 1 [0:00.00])
    to sector 17511 (track 1 [3:53.36])

    outputting to pf2.wav

    (-- stuff omitted due to lameness filter--)

    al% md5sum pf*wav
    fd8ddaf41fd482a6aa1a492915a3e788 pf.wav
    fd8ddaf41fd482a6aa1a492915a3e788 pf2.wav
    al%
    Looks like under identical conditions (same drive) it'll rip consistently. Ripping off a different drive might give different results, that's more hassle than I want to try right now. If anyone wants to compare, the disc/track I ripped is Pink Floyd's Dark Side of the Moon, Capitol's catalog # CDP 7 46001 2, DIDX 226. (Different recordings will almost certainly give different results.)

    Oh, and to make RIAA happy:

    al% rm pf*wav
    al%
    ;-)
    --
    -- Alastair
  15. Re:gee? by Zigg · · Score: 5, Interesting

    Different drives, with the same disc, and identical software, certainly do give different results. Just tested. Identical versions of cdparanoia live on both systems.

    I also ran lame with default settings (makes a 128K CBR) on both WAVs and got different sums there as well.

    No tags involved.