Slashdot Mirror


Reputation System Fights P2P Junk

yeejiun writes "Many of the files that are shared on p2p networks tend to be junk. Organizations such as the RIAA and music labels regularly pollute these networks with nonsense files masquerading as real music/video files. These junk files make it difficult for users to find what they want on such p2p networks. Some researchers at Cornell University have developed a reputation system called Credence, that works on the Gnutella network, allowing users to tell the good files from the bad ones."

78 of 338 comments (clear)

  1. better answer by eight+and+a+quarter · · Score: 4, Insightful

    quit downloading crap off of kazaa/grokster/morpheous/etc. dont trust brittneyspearsporno.avi.mpeg.exe

    --
    lameness filter thwarted.
    1. Re:better answer by strider44 · · Score: 2, Insightful

      That doesn't stop people from generating a random file 700MB big and calling it Serenity.Leaked.avi

    2. Re:better answer by zaxios · · Score: 4, Funny

      brittneyspearsporno.avi.mpeg.exe
       
      Link please.

    3. Re:better answer by mibus · · Score: 2, Funny

      > > Try /dev/null

      > Wow! A lot of hip-hop is in there, along with the latest and newest pop-music from RIAA, too!

      No, that's /dev/urandom

  2. FP? - And that's why I use Bittorrent... by nonlnear · · Score: 2, Insightful

    Gotta love the torrents!

    --
    argumentum ad fallacium: Fallacy of defining a fallacy which allows one to dismiss the argument in question.
    1. Re:FP? - And that's why I use Bittorrent... by larry+bagina · · Score: 2, Informative
      mpeg/avi/wmv/mp3/ogg/etc are already compressed with an encoder specific for video/audio, so secondary compression from zip or rar isn't particularly helpful. However, zip and rar can password protect files, so if you want to see britney's 6-month pregnant sex video, the password id the 3rd word of the 2nd paragraph after you sign up for a "totally free" pr0n site.

      --
      Do you even lift?

      These aren't the 'roids you're looking for.

    2. Re:FP? - And that's why I use Bittorrent... by Anonymous Coward · · Score: 2, Informative

      A large amount of video releases posted to torrent sites are "scene" releases that come from usenet.

      These releases are typically rar-ed into multiple parts to allow for easy and reliable posting to usenet.

      People simply taking a scene release and uploading it to a torrent site is quite common, so these rar releases on places like The Pirate's Bay are nothing to worry about. It's usually a sign that it's a "good" release if you see many *.r0* or *rar files.

      Of course be on the lookout for *exes inside of compressed releases, but the presence of rars means nothing negative as far as a torrent being legit.

    3. Re:FP? - And that's why I use Bittorrent... by EvilMonkeySlayer · · Score: 2

      Sssh, you don't want them knowing about usenet do you?

    4. Re:FP? - And that's why I use Bittorrent... by MasterSLATE · · Score: 2, Informative

      Azureus has that functionality built in. There's a setting for prioritize first chunks (maybe its first/last, but memory says its first).

      --

      [sig]www.masterslate.org[/sig]
  3. I'm a little lost in this whole thing by ReformedExCon · · Score: 5, Funny

    I thought the primary purpose of P2P filesharing was to share legally swappable media files as well as other files like documents and useful freeware applications. Is there some nefarious entity flooding the P2P networks with garbage disguised as those files above? Why would you need to know the quality of the file's reputation?

    --
    Jesus saved me from my past. He can save you as well.
    1. Re:I'm a little lost in this whole thing by Chandon+Seldon · · Score: 2, Informative

      Ever heard of trojan horses? Spam zombies are worth good money.

      --
      -- The act of censorship is always worse than whatever is being censored. Always.
  4. this is stupid by Madd+Scientist · · Score: 3, Insightful

    if the RIAA is willing to create junk files, you really don't think they are going to create fake accounts to rate their junk files as "good"? ANY system you put in place that gathers "votes" from users can be manipulated.

    1. Re:this is stupid by Creepy+Crawler · · Score: 3, Insightful

      Look at kuro5hin's rating system. In a democratic system, participation is the key. AT that site, very few comments are rated upon. The few that are rated are the...

      1: Master parent in big thread.
      2: 1 and 2 child of master parent (in a big thread)
      3: Obscene trolls (...you donkey-raping shit eater...)

      Past that, not many care. Of course we have had a few mod-trolls who create a few accounts to run everything into hidden (similar to beiong -1'ed here), but are countered by the users actually participate for a short while.

      Since the voices on K5 get out, nobody cares about democracy. Just hop on over there if you dont believe me.

      --
    2. Re:this is stupid by TheophileEscargot · · Score: 2, Interesting

      This system specifically addresses kuro5hin's main problem.

      On K5, only a few people rate comments. Basically crapflooders cabals got together enough accounts to outweigh legitimate raters, giving them control of the rating system.

      With this system, the crapflooders would be able to rate each other up... but if you rate differently to them, your view would ignore or reverse their ratings.

      Wouldn't help anonymous users though.

  5. eDonkey by mnemonic_ · · Score: 5, Informative

    Doesn't the eDonkey2000 network already have a system like this? Users identify fakes and report them, then the phony file information propagates throughout the network and the fake file dies.

    1. Re:eDonkey by mnemonic_ · · Score: 4, Informative

      Ah, found it: donkey-fakes. eMule automatically downloads the fakes list upon startup, and prevents the files from spreading.

    2. Re:eDonkey by noidentity · · Score: 2, Interesting

      Doesn't the eDonkey2000 network already have a system like this? Users identify fakes and report them [...]

      So all the RIAA has to do is report all the real files as fakes? Well, along with the fakes, otherwise the real files would be marked as the fakes and the fakes as the real.

    3. Re:eDonkey by Jugalator · · Score: 2, Interesting

      Yes, it's not too uncommon. On BT, I just check how many seeders the file has. People don't want to keep and spend their bandwidth on seeding fakes.

      So far, I've never fallen to a faked torrent with lots of seeds either. Even if RIAA could in theory set up networks to seed fakes, they don't seem to be doing it.

      I think there'd be less confusion if the article title was "New Reputation System for Gnutella To Fight P2P Junk".

      --
      Beware: In C++, your friends can see your privates!
    4. Re:eDonkey by daikokatana · · Score: 5, Interesting
      Indeed - but there is a big problem with that system. eMule recognizes the file hashes and reports them as fakse, but it stops after that.

      For the past few weeks, I have been rewriting part of the eMule source to have the following changes:

      1. I offer a valid file with a valid hash (no fake) 2. People try to download the file from me and move up fast in my queue 3. Once they download a chunk from me, the data I send them is invalid (generated random) 4. Since this part is invalid, they need to redownload it 5. Since they move up faster in my queue than others, they redownload the part from me. 6. etcetera...

      To be honest - I want to sell this tactic, that's why I do it. And so far it works! I get loads and loads of requests and rerequests for files, so this is a perfect tactic to kill the download of valid files - reputation system or no reputation system.

      Remember, the file is valid, but they'll get it much much slower and spend x times the bandwidth to get it. I have unlimited bandwidth (up/down) so I always win in the end.

      If whatever organisation I sell it to employs this on a large scale, the network will be flooded.

      --
      http://jcsnippets.atspace.com/ - a collection of Java & C# snippets
    5. Re:eDonkey by __aaxwdb6741 · · Score: 2, Insightful

      This is easily counter-measured by "banning" sources which are unable to send me a valid chunk in X attempts.

      I think that there could be an easy counter-measure for every measure that the anti-pirate industry takes.

      Every thing they do is just an other inconvenience for the downloader, themselves, the legit downloaders, and Everybody Else (tm).

  6. Self-policing is needed by case_igl · · Score: 3, Interesting
    "...allowing users to tell the good files from the bad ones."

    Don't you mean the real illegal files from the fake illegal files? Seriously, it is no surprise to me why P2P has gotten a bad rap. Many of the users simply use P2P apps to commit piracy.

    Yes, there are legit uses as well. But honestly, if you are looking for free music from a band that has released it as such, you can usually find it. It's the copyrighted commercial music and video that have tons of fake files, porn movies, etc...Not Jim Blow Sings the Blues, Live from Natrona, PA!

    1. Re:Self-policing is needed by Penguin · · Score: 3, Informative

      Yeah, because 300 years certainly isn't enough for a word to be recognized...?

      From http://www.etymonline.com/index.php?term=pirate :

      "Meaning "one who takes another's work without permission" first recorded 1701"

      Come on, the term is older than RMS!

      --
      - Peter Brodersen; professional nerd
    2. Re:Self-policing is needed by EvanED · · Score: 4, Insightful

      But what the parent is saying (and which is a very legit argument if you ask me) is that if you're looking for a Debian repository, you're almost certainly not going to find a fake file!

      If you want to be sure, you can compare the file size to the official one. If it matches, you can be all but completely confidant that it's real.

      After all, there are probably far fewer people trying to flood P2P with bogus files just for the hell of it then there are trying to flood P2P with bogus files in an attempt to protect copyright.

  7. Bitzi all over again? by Motherfucking+Shit · · Score: 2, Insightful

    How is this any better than Bitzi and its Bitprints, which are already built into popular Gnutella servents like BearShare?

    "Our client provides a peer-based judgement that a given object will possess the properties with which it is labeled and enables users to evaluate search results for authenticity before downloading."

    Sounds exactly like Bitzi to me...

    "Many peer-to-peer reputation schemes have been proposed in academia. Credence is the first practical implementation of a peer-to-peer reputation scheme."

    I don't think so.

    --
    "BSD: Free as in speech. Linux: Free as in beer. Windows 10: Free as in herpes." --Man On Pink Corner in #52607549.
  8. Re:Torrents can be bogus too. by mnemonic_ · · Score: 2, Informative

    ...which only verifies file integrity. It doesn't check if the file is what its filename says it is. It only ensures correct data transfers, not correct data.

  9. Here's a simpler idea... by lightspawn · · Score: 4, Insightful

    If a file appears to by RIAA-affiliated music, treat it as a junk file.

    Why bother with music the artist doesn't want you to have? Just forget about it altogether and discover new music, even new types of music that you'd never realize existed, much less that you could enjoy.

  10. NO by zymano · · Score: 2, Informative

    OVERVIEW

    Credence is a robust and decentralized system for evaluating the reputation of files in a peer-to-peer filesharing system. Our goal is to enable peers to confidently gauge file authenticity, the degree to which a file's contents matches its advertised description.

    At the most basic level, Credence employs a simple, network-wide voting scheme where users can contribute positive and negative evaluations of files. On top of this, a client uses statistical tests to weight the importance of votes from their peers. And finally, Credence allows clients to extend the horizon of information by selectively sharing information with their peers.
    Authenticity and Pollution

    We define pollution broadly as any file with content that does not match its description. An authentic file, by contrast, has content that is accurately described by its metadata. We find in practice that pollution in current networks can be easily identified by users without any special knowledge or expertise. As pollution becomes more sophisticated, more advanced detection techniques will need to be developed to help users safely identify malicious content.
    Voting

    The Credence system relies on individual users as the first line of defense against pollution. After a user downloads and uses a file, she is given a chance to submit a single vote to the Credence system: a positive (thumbs-up) vote for authentic files, and a negative (thumbs-down) vote for a polluted file. Each vote is cryptographically signed and entered into the system.
    Vote Gathering

    Credence uses these votes collected in the network to determine the authenticity of content. Credence displays a rating for each file that appears in response to a user query.

    First, the client software executes a search for votes, and downloads a number of votes randomly selected from the network. These votes are then aggregated into a single estimate of the authenticity of the file in question.

    Each vote collected from the network is not used directly, however, since some peers in the network may accidentally vote incorrectly, or even lie intentionally about the file's authenticity. Therefore we assign to each peer a correlation coefficient, or weight, reflecting the historical usefulness of the peer's votes. In effect, this helps remove the incentive for an attacker to lie about the authenticity of files. A consistent liar is, after all, just as useful as an honest peer when it comes to distinguishing authentic files and pollutions. And an inconsistent voter will come to be be ignored by others in the network.
    Information Sharing and Transitive Correlation

    Peer-to-peer networks can grow quite large, and many clients might participate rarely, sharing and voting on only a few files. This means that alone, a client may have trouble quickly discovering peer correlations and other historical data. To alleviate this problem, Credence uses a technique called transitive correlation to quickly spread information among small groups of peers and help clients expand their horizon .

    In Credence, a client periodically requests historical data from selected peers in the network. This data contains information on how the peer voted in the past (cryptographically signed, as before), and information about how the peer is related to other peers in the network. The client can then validate this information for authenticity, then integrate it into its local databases. In this way, not only does the client take advantage of the work other peers do in evaluating files for authenticity, but also gains insight into the behavior of peers in the network. All this is done without need for user interaction, or any peer trust values, which can be difficult for a user to accurately determine.
    Changes to the LimeWire Client and Gnutella Network

    Credence is integrated into the LimeWire client, and works on top of the Gnutella network. The implementation is built entirely on top of existing primitives in the Gnutella protocol. It opens up no additional ports

  11. What outrage by dedazo · · Score: 2, Insightful
    Because we all know that P2P is only used to trade legal content. How dare those evil record companies "pollute" the system.

    Shocking.

    I don't know that their tactics are effective - after all, networks like eDonkey|eMule seem to be pretty good at self-policing. But it's amusing to see the undercurrent of outrage in these 'stories'.

    We all know damn well why the *AA folks do what they do.

    --
    Web2.0: I love when people Flickr my cuil and digg my boingboing until my google is reddit and I start to yahoo
  12. Good summary by kernel_dan · · Score: 3, Informative

    For those of you that can't be bothered to RTFA, this system takes a profile of how you vote on files and matches you with other people who voted similarly. Thus, the spammers would see different ratings than 'normal users.'

    --

    Illegal? Samir, This is America.
  13. You can already tell by ravenspear · · Score: 2, Interesting

    It is already very east to tell the junk files from the good ones. The junk ones will come from a very limited IP range. What usually happens is that the *AAs, and the companies they hire to pollute the networks will use the entire IP range they own to do that, but that usually still only amounts to a few class Bs. The good files on the other hand will come from all different class As.

  14. Its not all bad... by distantbody · · Score: 5, Funny

    The fact that I didnt get to play HL2 was compensated by the 2 hours of dwarf porn.

  15. rtfa, sucka. by knowles420 · · Score: 5, Informative

    7. Can a group of spammers game the Credence algorithm by voting thumbs-up for each others' spam ?

    No. The trustworthiness computation is designed to preclude such attacks.

    8. What happens when a large number of spammers vote each others' spam up ? Can they fool the reputation system ?

    No. Credence's reputation computation is similar to Google's PageRank, but is more general - every node computes a different rank based on its own votes. Reputation flows from a given good node along trust edges towards other nodes. Spammers can create tight cliques in which everyone votes on each others' spam, but the entire clique will be deemed untrustworthy. And if anyone in the spammer clique does a search, they will see each others' spam ranked high.

    or, just do whatever you want.
    --
    -knowles
    1. Re:rtfa, sucka. by PylonHead · · Score: 5, Informative

      No, the pot smoker is right. Your brain is too small to absorb their goodness.

      In their system there is no single "high reputation" metric. Everyone had a different reputation to each other. Three people, A, B and C. A may have a high reputation as far as B is concerned, but C thinks A has a low reputation.

      They do this by grouping people who vote the same way. So you trust the people that vote like you do.

      Assuming that you vote good files up and bad files down, you will be grouped with people who do the same. At some point, the spammers have to start voting differently than you do.. voting their spam up. This will distance them from your trust network, and cause you to value their opinion less.

      --
      # (/.);;
      - : float -> float -> float =
    2. Re:rtfa, sucka. by xquark · · Score: 3, Informative

      yes correct, and in-fact it can be taken one step further:

      assume the system is able to determine symmetric groups.
      that is groups that have totally (or near totally) different
      voting directions, an example would be the honest group and
      the spammers group.

      if say the spammers vote something up, instead of the honest
      group ignoring their rating, they can use the symmetric
      properties between their group and the spammer's group to
      re-enforce their vote (aka the credence) of the file in
      question - in this case rate it down even further.

      If the right restrictions were put in place such as the fact
      that the symmetric effect will only effect files that have a
      negative credence and not files that have a positive credence,
      then various forms of collusion can be over come.

      A lie can always be turned into a truth and a truth into a lie...

      Arash Partow
      __________________________________________________
      Be one who knows what they don't know,
      Instead of being one who knows not what they don't know,
      Thinking they know everything about all things.
      http://www.partow.net/

      --
      Arash Partow's Philosophy: Be a person who knows what they don't know, and not a person who doesn't know.
    3. Re:rtfa, sucka. by cahiha · · Score: 2, Interesting

      No, the pot smoker is right. Your brain is too small to absorb their goodness.

      The authors have not shown that their system is resistant to attacks. Maybe it seems plausible to them and to you that it is, but plausibility is not the same as actually demonstrating that property.

    4. Re:rtfa, sucka. by andersa · · Score: 2, Insightful

      You don't get it. You can't infiltrate the trust circle. Those inside would shove you out, once they find out you are no good, so your basic premise fails.

    5. Re:rtfa, sucka. by pv2b · · Score: 2, Interesting

      1000 accounts per hour for an hour isn't outside the realm of possibility at all.

      There are 3600 seconds in an hour. If you were to streamline the process of registering the accounts, so that the only human process were to decode the CAPTCHA image, I could definitely see myself performing one of these tests every 3.6 seconds, especially with some practice. (Depending on the difficulty of the test of course.)

      This is where you employ people at minimum wage, or even illegal immigrants below minimum wage. I'm not that in to the exact figures for the United States, but I'd guess minimum wage would be somewhere around $5/hour. This makes it pretty cheap to create 1000 accounts. :-)

      Now, going beyond say 1000 by a few orders of magnitude, the cost of passing CAPTCHA tests goes up the same way.

      This, of course, is ignoring advances in AI technology that are starting to be able to identify and pass CAPTCHAs. It doesn't matter if they suck. Even if you only get 1% accuracy on them (lowball estimate), that only slows down an automated account generation attack by a factor of 100, and with *cough* clandestine grid computing *cough*, this becomes a quite effective attack in todays world with zombies etc.

  16. Nice countermove... by barks · · Score: 3, Insightful

    I like this idea. Media hordes, read as RIAA and MPAA, will constantly try to find technical ways to put the P2P genie back in the bottle.

    For every Napster (Kazaa, etc.) they close, another will be spawned. For every fake or intrusive system they create to battle downloaders, another downloading method will be innovated. For every commercial they feature a celebrity crying copyright heresy, /. mobs will just mock them.

    It's no shattering concept there'll never be a checkmate for either side.

  17. Moderate Moderators by zymano · · Score: 2, Insightful

    Like what Slash does.

  18. Re:One problem with this Credence system: by Anonymous Coward · · Score: 5, Insightful

    I think the main insight and contribution of the system is that the reputation of a peer according to you is determined by whether he/she votes in a similar manner as you.

    So if the RIAA starts spamming Gnutella with lots of junk stuff, you will never vote in the same way as the RIAA dummy accounts, and you don't take their votes into account.

    In fact, it seems the system is even smarter than that - it can take votes from people that are strongly uncorrelated with you and use that as negative information. So anything these people vote as valid files, you can treat as garbage as their definition of good/bad files is completely opposite to yours. And assuming you trust your own judgement, that means those files must be bogus.

    Reminds me a lot of the google pagerank system, but with explicit learning/training instead of using back-links for determining correlation.

  19. Re:One problem with this Credence system: by patternjuggler · · Score: 2, Insightful

    Seems the trust system is prone to spamming itself. If the RIAA (or anyone for that matter) flood the system with bogus votes, then the "honest" votes will get ruled out.

    I haven't read the description closely, but it's hard to see why flooding the system would matter- it isn't majority rule, it's who do you trust and who do they trust. If the RIAA has ten million bogus users, I and a few hundred other people vote thumbs down on them but thumbs up for each other, then we have our little corner where a set of honest opinions exist (although it may take a while to initially connect to that group).

    The way you would have to spam the system would be to vote honestly for a time and then switch abruptly, but even then the damage would be quickly mitigated as your credibility disappears.

  20. You misunderstand by fbartho · · Score: 2, Interesting

    You misunderstand what you quoted... if they flood the system with votes, it matches them with the type of vote they make, when you use it you are matched with the type of vote you make... Thus, if you mod real files up then another user who mods real files up will trust your mods more than somebody else. If the evilpeople mod real files down and bad files up, then they will trust the mods of other evil people, but they won't trust your ratings, and you won't trust them.

    Thus if you wanted to have a really easy way to find a list of crap files, you just have to mod down every real file you have, and mod up every piece of crap you have, then do a search. Your results will be clustered by the trust that the file you are getting is "like yours" or in that case, a fake.

    --
    Gravity Sucks
  21. Re:Torrents can be bogus too. by nunchux · · Score: 3, Informative

    True... But a bogus torrent usually doesn't survive too long and certainly doesn't see too many seeders. If it's been up for a day or two you can be reasonably sure it's valid.

    Also, even the "pirate" torrent sites are centralized and often even have administrators, sometimes even comment boards. If a torrent is bogus, someone will take it down. (Not that I've been to those sites, of course...)

    Of course this could all be manipulated, but AFAIK it hasn't been yet by the powers-that-be... And I don't see why they'd bother, when a threatening letter is all it usually takes to take a torrent site down, and it would take considerably more effort than turning a bunch of scratchy mp3's loose on kazaa.

  22. Re:One problem with this Credence system: by Stauf · · Score: 2, Insightful
    I love shooting people down with their own links - from http://www.cs.cornell.edu/People/egs/credence/faq. html:

    7. Can a group of spammers game the Credence algorithm by voting thumbs-up for each others' spam ?

    No. The trustworthiness computation is designed to preclude such attacks.

    8. What happens when a large number of spammers vote each others' spam up ? Can they fool the reputation system ?

    No. Credence's reputation computation is similar to Google's PageRank, but is more general - every node computes a different rank based on its own votes. Reputation flows from a given good node along trust edges towards other nodes. Spammers can create tight cliques in which everyone votes on each others' spam, but the entire clique will be deemed untrustworthy. And if anyone in the spammer clique does a search, they will see each others' spam ranked high.


    So a 'good' rank is dependent on the whims of people who usually vote the same way that you do. So spammers will see high rated spam and non-spammer will see high rated non-spam. Simple.
  23. Re:This has to stop by jericho4.0 · · Score: 3, Insightful
    You're a troll, but your first paragraph accurately represents what many feel about this.

    The research and motivation for this is important. If peer to peer networks can be subverted, then they have lost their usefulness. IMO, the sharing of copyrighted data is unavoidable, and sacrificing the freedom of a protocol in an attempt to prevent it is shortsighted.

    It probably would have been better for Cornell if it had been left as a paper, rather than implementing it.

    --
    "A language that doesn't affect the way you think about programming, is not worth knowing" - Alan Perlis
  24. Re:Who cares by PhreakinPenguin · · Score: 2, Interesting

    I'm thinking this is a troll but what they hey, I'll bite. The problem with the P2P flooding is not that people don't know how to use it. I've used P2P for quite awhile now and would consider myself somewhat "skilled" at searching for things I need. The junk files are the exact same size as a normal file would be. A song for example, will show that it's a 192KB song, lasts 4:30, and is shared by 40 people. So you download it and start to listen. The first 30 seconds is perfect audio, then it's followed by 4 minutes of either dead silence or some sort of annoying tone. I can almost guarantee that there's more junk out there than real files. And as a side note, you may want to know what you're talking about before you start calling people dumb.

    --


    My sig of choice is Marlboro
  25. Taking advantage of the hoarder mentality by hellfire · · Score: 4, Interesting

    Many hardcore file shares and hosters, dare I say most that would call themselves hardcore, are not in it for getting free content on demand when they want it. They are into collecting absolutely anything and everything they can get their hands on. In some collections, people wouldn't possibly, in their lifetimes,be able to listen to all the music or watch all those movies. But just the thought of having it makes many hoarders happy. And it's not even necessarily reputation amongst others. It could be in many cases, but not always. They just have to have it.

    What's my point? Well, this is the greatest strength and weakness of peer to peer. Hoarders ensure a healthy flow of files, but they rarely actually check what they have. They don't check to see the software works, or if the music is a complete copy, or that the movie was cut down to a quarter of the original screen size.

    This is what companies take advantage of, both those who want to hurt swapping, and those who just want to seed files for the purpose of installing some evil spyware. It's nice to have a bunch of people trying to seed the masses but cmon the point of file sharing is to pool our independent resources. For someone who doesn't have all day to search for files and test quality and whatnot, it is sometimes less painful to just go buy the CD than it is to actually try to download it amongst the mess of files that are out there.

    --

    "All great wisdom is contained in .signature files"

    1. Re:Taking advantage of the hoarder mentality by cowscows · · Score: 2, Interesting

      And that's why there's such a great business opportunity for downloadable content. That's why Apple is selling so many songs with their music store.

      "Casual" downloading of shared content is hard, especially if you want decent quality. Convenience means a lot to people. There are lots of lazy individuals with plenty of money, and they'll happily give you a little money if you do some work for them. And figuring out a decent download counts as work.

      I don't mind the music and movie studios being against file sharing. I don't even mind them suing people who share files. I just think it's kind of silly for them to be as vocal about it without offering something to compete with it. I mean, these people run giant media conglomerates. You don't get into that position without at least a decent amount of business savvy. Why are they so unable to see a big potential source of business?

      But I agree with you with the hoarder mentality, and I find it sort of fascinating. A few years ago I had a roommate like that. He had every game imaginable, and no time to play them since he spent all his free time downloading more. He had every piece of graphics software I've ever heard of, and no time to really learn how to use them. He downloaded 320x240 copies of lots of crappy movies, and he watched some of them, but always secluded back in his room by himself, cause nobody else wants to watch sucky quality like that. While my other roommate and I would buy a few DVDs, and hang out in the living room with a couple other people and make it a decent social event. It was interesting to watch this kid go.

      --

      One time I threw a brick at a duck.

  26. Companies hurt by jamienk · · Score: 3, Interesting

    Many many companies (and individual artists) have faced SERIOUS economic damage by attempts to thrawt P2P from being absolutely ubiquitous and maximally effective. Estimates are in the BILLIONS of dollars (US only) of lost sales in broadband connections, blank media disks, large hard disk drives, software support, consulting fees, home audio/video equiptment, and the like. And Western countries are fast falling behind as the majority of educated citizens from developing nations take advantage of the black market for these goods and services while Western citizens are blocked in droves by propaganda, political corruption, inferior substitutes, and FUD from fully participating in the open exchange of science, the arts, poltical discorse, and culture in general.

    Credence will hopefully bring us a bit closer to reaching our current potential.

    1. Re:Companies hurt by Matey-O · · Score: 2, Insightful

      That's a lovely, scary, statement you've made there, and it's earned youa buncha karma...care to back up the claims with a citation or two?

      Billions? I highly doubt Billions.

      --
      "Draco dormiens nunquam titillandus."
    2. Re:Companies hurt by helgihg · · Score: 2, Interesting

      Yes, yes. Actually, the evidence behind sharing artwork simply does not stand with the case. It is ASSUMED that they're losing an X amount of money because of some Y factor, but no evidence has popped up yet to actually support these claims. I believe that it's a fundamental mistake to first of all assume that those who are downloading copyrighted material, are going to be less interested in buying a retail version. I think this whole thing is a misunderstanding. I think somebody who downloads Fight Club and loves it, is in fact MORE likely to buy the retail version than someone who never saw the movie to begin with (or experienced it as-good-as-it-gets in a moviehouse or something). I think the fundamental mistake here, is to assume certain behaviour upon dozens of millions of people, in a multi-billion dollar industry, and to me, that's not just scientifically shaky, it's also intellectually proposterous. People don't just work the way you (or the companies) assume they work. The companies are NOT losing money, quite on the contrary the industry is expanding faster than ever before, and absolutely nothing indicates that the free flow of information in general (regardless of copyright) has any consideriable negative impact on the interests of these companies, not to mention the good it actually does to the idea of a people, that have the opportunity to know what they're buying before they buy it. That's not a very high standard, to know what you're buying, and me being able to use my mother's car every once in a while does not mean that I don't have any reasons left to buy my own car. Quite on the contrary, if I'd *never* use my mother's car, I'd probably just be happy with bicycling. But I'm not. I like to drive, and I want my own car. I bought myself a Muse CD the other day after a buddy of mine showed me dozens of their songs, 100% illegally. This people-work-in-the-worst-possible-way theory just has to go. It just doesn't work like that. I'm betting on that not only the industry, but also the public, will have grown up from this scientifically shallow theory of a people that simply hate good products. It's just simply not the case.

  27. Even better answer by quadra23 · · Score: 2, Informative
    quit downloading crap off of kazaa/grokster/morpheous/etc

    Use a P2P program that actually includes some 'anti-junk" features. I typically use Shareaza (probably not the best, and I'm sure someone will state a better P2P but the points still remains, Shareaza does offer some features these clients do not -- including a rating/comment system that goes with the file whenever anyone finds a search result for it). Usually I know if the file is a fake before I download because I use some obvious signs:

    • How many sources have this file? (more can be just as suspicious as legitimate
    • Is the file size relatively the same to one fake file I already downloaded? (yes, sometimes they are just copies with different names)?
    • What kind of comments/ratings does the file have when I select it in the search list? (of course this could be a little flaky if the 'junk spreader' decided to positively review the file)

    I prefer the client program including these features, especially when it's available to connect to several networks at the same time. Nothing worse then getting a 100MB+ file and realizing you wasted the bandwidth for not, or the program you downloaded wasn't the same as the file name (more legit, but not what you were looking for).

    Do be careful because some files that are really a virus can be detected by AV as 'ok'. Thankfully I found the virus before it did much damage and by reading the Symantec AV report I was able to make sure I removed it completely. Just because one 'setup.exe' claims to be a setup program don't trust it unless you trust the name of the setup program -- "Program Setup Wizard" does not cut it!

    Since Shareaza also supports torrents I usually go through torrent sites and have rarely had any 'junk' files from the torrents. The more junk the RIAA (and other companies!) try to spread the better we get at ignoring and working around it!

    1. Re:Even better answer by eight+and+a+quarter · · Score: 2, Insightful

      This is what happens.
      Company XYZ comes up with P2P app.
      Distributed online for free.
      Users who claim to be legit flock on P2P site.
      **AA lawyers swarm.
      Site is closed down.
      People are sued.


      Real losers?
      Company XYZ.
      Users who are sued as examples.
      You just lost the newest P2P site.


      Wash. Rinse. Repeat.

      --
      lameness filter thwarted.
  28. Flaw in this approach by typical · · Score: 2, Interesting

    Actually, while I doubt the OP intended it, he has a good point.

    See, let's be honest about this. While there will *always* be jackasses out there who spam networks just because they can, and a few more people trying to shove spyware down people's throats, a pretty big chunk of the folks producing spam are those trying to prevent their copyrights (however overly-long-lived they may be) from being infringed upon.

    So, the point is, that it's a good bet that a sizeable chunk of the files being shared aren't exactly legal.

    Which means that you don't really want to make it especially obvious that you're sharing said file.

    What this system does is provides a cryptographic signature on a small, publically downloadable piece of data that establishes that you have downloaded and *consciously examined* the file.

    Frankly, this is pretty good evidence for someone trying to push an infringement lawsuit that you have infringed upon their copyright (yes, our work has MD5sum "foo" the same as the thing this guy is rating.

    That being said, I do think that a more sophisticated method to this approach will win.

    The largest problem on the Internet has always been rating and attributing data -- Google takes a pretty decent stab at some of the problem, and look how essential they've become. This just does a much better job.

    --
    Any program relying on (nontrivial) preemptive multithreading will be buggy.
  29. Renamers by DuranDuran · · Score: 2, Interesting

    Organizations such as the RIAA and music labels regularly pollute these networks with nonsense files masquerading as real music/video files. ...as do the "renamers". I wonder if anyone has studied why such people rename files in this way?

    --
    "You can justify anything by putting it in quotes, adding a famous name and making it a sig" - Albert Einstein
  30. rtfa, sucka.-Groupthink actually works? by Anonymous Coward · · Score: 2, Insightful

    So, when's Slashdot going to impliment this "golden" system?

    1. Re:rtfa, sucka.-Groupthink actually works? by LinuxHam · · Score: 2

      when's Slashdot going to impliment this "golden" system?

      Right after they implement a spell checker.

      --
      Intelligent Life on Earth
  31. Re:Problems by Beolach · · Score: 2, Informative
    Except the way it works is that the reputation of a file that you see is based not on the over-all votes of the total population (including spammers). The reputation of a file that you see is only based on the votes of other peers that you have a high correlation with, based on what files you rate as good and bad. So if you have rated 9 files, and I have rated those same 9 files in the same way you did, then Credence would trust my ratings for you.
    From the FAQ:
    3. How does Credence know who is trustworthy and who is a spammer?
    Initially, it doesn't. As you vote for files, it stores your votes and discovers the set of peers with whom your votes are correlated. It also communicates with peers to find out about other peers with whom they in turn are correlated. The outcome of this computation is a numerical value computed for each file appearing in query results that reflects the probability that the given file is trustworthy.

    If you vote thumbs-up for good files and thumbs-down for bad files, you will be grouped with the vast majority of people who also vote honestly. You will then compute a high trustworthiness metric for all files that this (potentially very large) group of users has ever voted on. If you vote inaccurately (i.e. you are a spammer), you will compute a low trustworthiness metric for other non-spam files, and honest users will compute a low trustworthiness coefficient for your opinion. It is thus in your best interest to vote honestly.
    ...
    6. I hate the music group X. Should I vote thumbs-down for their songs?

    No. See the question above - your votes should simply reflect whether the file's description is accurate and whether its contents are intact. Voting thumbs-down for a perfectly good file may cause your node to be lumped in with spammers and reduce the effectiveness of Credence for you (i.e. you will likely see more spam in your searches).
    --
    Join moola.com, play games to earn money.
  32. Re:Problems by Beolach · · Score: 2, Informative

    Whoops, posted too soon. The second potential problem you describe is more in line with how Credence is described to work, but I think it's unlikely to be a very big problem. Yes, the system will probably allow for "mistakes," but it will cull those mistakes out. So if the spammer rates most good files good and bad files bad, but rates their one spam file also good, then it is possible your client will report that spam file as having a high credibility. But, once you (or anyone else) download and find that it is not a good file, you will rate it bad, and as more people rate it bad, its credibility will go down. It's a case of diminishing returns for the spammer.

    --
    Join moola.com, play games to earn money.
  33. Re:Torrents can be bogus too. by frostw · · Score: 2, Informative

    Ummm, yes there is. For instance, VLC media player will play partly downloaded videos.

    --
    http://www.sydney-webcam.com
  34. Can this system work on ./ ? by fundflow · · Score: 3, Interesting

    This may automate the reviewing process

  35. Still an issue with "hit-and-run" by Kjella · · Score: 3, Interesting

    1. Mark a bunch of good files as good
    2. Mark your bogus file as good
    3. Spread your vote list on zombie network
    4. Your votes corrolate highly with "good files", and there's no counter-votes by others (yet)
    5. Trick lots of people to download it (the rating goes to shit eventually, but...)
    6. New bogus file. Goto 1.

    In addition, you have an issue with semi-good files. What if the encoding is flawed, should you mark it as bad or good? Either case can put you at odds with the general opinion.

    Third, you have an issue with files trolling for incorrect votes. Create a "non-obviously" bogus file, which some people will mark bad, others good. You'll create a lot of conflicting votes and "noise" in the system to make attacks like above possible.

    Kjella

    --
    Live today, because you never know what tomorrow brings
  36. Re:Torrents can be bogus too. by Irashtar · · Score: 2, Insightful

    True, but Torrents rely on the community, while with things like kaaza, many times what you want is hosted by one guy, and it's hard to kick fakes. In torrents, fakes die very quickly, thanks to the 'OMG fake' comments on the torrent sights.

  37. Huh by TCM · · Score: 5, Insightful

    Who actually searches for files in the P2P client? Normally you visit some site where the releaser himself posted a torrent or an ed2k link and you download that.

    I can't remember the last time I actually searched in eMule.

    --
    Of course it runs NetBSD. BTC: 1NT7QvbetmANwaMzhpVL6
  38. Re:This has to stop by cliffski · · Score: 3, Interesting

    couldnt agree more. P2P is superb stuff, and has all kinds of legit uses, but to pretend that its not 95% used to download copyrighted music and movies and thus save a few bucks is just denial.
    There are far too may slashdotters who reply to any article on copyright with "get with the system dude! copyright is over!" usually they seem to be 13 year old kids who dont understand what its like to have your income and career based on developing electronic products.
    Do people really think that Lord of the Rings deserved to sell just 1 copy, to the p2p hacker who ripped it?

    --
    DRM-free indie games for the PC and Mac: Positech Games
  39. Litigation index by xixax · · Score: 4, Interesting

    Can this also be used as a metric for the RIAA and MPAA to decide which people to take legal action against? Go for the most trusted, most highly rated individuals and take out the most influential (central? critical?) nodes. In the same way that cliques of poisoners would stand out.

    Xix.

    --
    "Everything is adjustable, provided you have the right tools"
  40. Re:This has to stop by utexaspunk · · Score: 2, Insightful

    Do people really think that Lord of the Rings deserved to sell just 1 copy, to the p2p hacker who ripped it?

    There was a time when home video didn't even exist, and yet movies still got made. After their initial run in theaters, movies would only be shown every now and then on tv late at night. They were basically filler programming for what would have otherwise been dead air. One might even argue that the average quality of new movies has been on a steady decline since then.

    Just because I can package something for individual sale and ask people to pay for it doesn't mean it's right, copyrighted or not. Especially when the producers more than recouped the cost of production long before going to video and that copyright is going to be extended ad infinitum.

  41. Why is that AC post modded "Troll"? by Travoltus · · Score: 4, Interesting

    I disagree that these scientists are breaking any *legitimate* law, but if you accept as a premise that they are, then they are in fact breaking the law using taxpayer dollars.

    Instead of modding that down it should be modded up so more people can discuss the ramifications.

    Do we allow taxpayer dollars to be spent on civil disobedience? On that issue, I am very unsure.

    --
    --- Grow a pair, liberals... stop letting the Republicans bully you!
  42. Setting a precadent by gallondr00nk · · Score: 2, Interesting

    Despite everyone's views on the use of p2p networks, isn't it a dangerous precadent to set to allow these companies to steamroller over *anyone* who dares share copyrighted material. Is living in a DRM world where consumer rights are constantly reevaluated as to give us the least amount of enjoyment and freedom from our purchases worthwhile? It doesn't matter *who* these p2p sharers are, isn't setting the precadent of removing consumer rights by DRM (to copy, rip, burn for backup etc.) far more demeening and indefencible?

  43. Re:This has to stop by wilsonao · · Score: 2, Insightful

    "Just because I can package something for individual sale and ask people to pay for it doesn't mean it's right, copyrighted or not." Hi, this is reality talking, we'd like to know where you could have possibly formed such a delusional idea that people don't deserve to sell the things they made. I guess I will just go over to McDonalds now and grab a big mac without paying. I don't think it's right to pay and I haven't eaten for about 2 hours, so why should I have to pay?

  44. This modding down has to stop by gorim · · Score: 2, Insightful

    Too bad he wrote as AC, but he has one of the most insightful points of the entire thread, an entire aspect that is overlooked.

    The basic premise of the slashdot story is how cool it is that researchers are defending the acts of people to trade in uncorrupted *illegal* file trading.

    After all, it seems the most if not all corrupted files are ones that, if they weren't corrupted, would have been illegal to trade anyway.

    I think the RIAA and MPAA are scum sucking pigs who need bacon carved off their arses and handed to them. Still, I also think their concern about massive illegal fileswapping is legitimate, even though the leaders of their respective industries are the ones truly responsible for raping their own artists...

  45. Another for the RIAA/MPAA Tool to use... by Kamiza+Ikioi · · Score: 3, Interesting

    The system seems like a tool to use against the RIAA/MPAA to block pollution efforts. However, then the other shoe drops, and the RIAA/MPAA has a tool to target the highest ranked nodes/cliches/people. No longer do they need to figure out how many files you have.

    They just have to find one file, extrapolate your rank to the average system rank, run a few numbers (and maybe a few inflated costs in there too), and bam... for sharing Happy Birthday To You.mp3, you get slapped with a $1 million infringement case because you happen to rank as a very high legitimate link.

    On the other hand, this might be benefitial to take the heat off of the majority of the file trading community that honestly is NOT costing them any money. They don't need to target the casual "weekend downloader", who's rank should be significantly lower (being a new node on the network) than some guy with 4 160GB HDD's of the latest releases to theater and DVD. Nobody feel sorry when these guys (or gals) get busted. When 14 year old choir girls get busted, there is PR hell to pay. This system allows them to do that.

    Didn't RTFA, but that's my first impression. A use to boost network quality, a use to increase (not decrease) the reach of the **AA's, and a use that may help both sides.

    "Every tool has at least 2 completely unassociated uses. A spoon can serve food to your mouth, or gouge the eyes out of your enemies." - Me

    --
    I8-D
  46. Re:Torrents can be bogus too. by badfish99 · · Score: 2, Insightful

    I haven't tried VLC, but mplayer will usually play partially-downloaded torrents if they are mpeg files. It just skips the bits that haven't been downloaded yet.
    It's less reliable with avi files: it doesn't seem to like it if the first part of the file is missing.

  47. Re:Torrents can be bogus too. by Spudds · · Score: 4, Insightful

    And I don't see why they'd bother, when a threatening letter is all it usually takes to take a torrent site down

    That's not really true. Depending on where the site is hosted, legal threats could be more humerous than scarry.

        Case in point.

          Btw, if you've got a few minutes to kill, you should really check out some of the emails to and responses from thepiratebay.com. They are hilarious!

  48. Downside? by Luveno · · Score: 2, Funny

    Will this make finding CCR on P2P harder?

  49. Evidence? by venomkid · · Score: 2, Interesting

    "As you can see, your honor, according to a ranking system on the pirate file-sharing network, the accused had a high rank for carrying real, pirated files."

    No, thank you.

    --
    vk.
  50. Re:exactly! by Secrity · · Score: 2, Insightful

    " if p2p files are legit, why do you need checksums.."

    To verify that the file is, in fact, legitimate. There are a number of unscrupulous folks out there that would just love to have even just a few people install their trojans. As Ronald Reagan said "Trust, but verify."

  51. Re:Torrents can be bogus too. by SpecBear · · Score: 3, Insightful
    It's happening. Contrary to what many on slashdot think, the *AA orgs employ some pretty clever people, and the pirate networks are far more vulnerable than is often assumed. Check out the comment threads on some new releases at one of the torrent sites, and you'll see that a number of spoofing tactics are in use:
    • Fake files. This is clearly a more primitive tactic and can be thwarted by clients that can be set to download the first parts of a file first.
    • Incomplete files. The seeder reports having the entire file, but will never deliver certain parts of it. Thus, downloaders get stalled at 98.5%. And it's amazing how long people will wait for that last bit.
    • Fake seeds. Haven't confirmed how this one works, but sometimes you'll see a torrent with an improbable number of seeders (e.g., 300 seeds and 100 leechers for a fairly new torrent). Lots of seeds attract more people.
    • Timing. For example, demand for a movie will rise in the days shortly before its release. If you get your fake tracker up and running during that critical time before there's a real pirate version out, then you'll attract downloaders and waste their time. And there's a snowball effect: when people go to download from BT, all of things being equal they usually go for the tracker that has the most people on it.
    Combine the tactics, and you've got a serious problem. Every user adds to the strength of the distribution network so tying up one client with a fake not only prevents that client from getting the material, it also keeps that client from helping others get it as well.

    If you're patient, persistent, and knowledgeable, you can avoid or minimize the impact of these spoofing tactics. But patient, persistent and knowledgeable don't really describe the average pirate (or just about anyone else, for that matter). The dedicated pirate simply won't be stopped, and the content producers know this.

    Like you, I once assumed that the various forms of moderation on the torrent sites would mitigate this. But the countermeasure are slow to work, as I've seen fake torrents stay up for weeks. It's easy to post multiple new fakes. And users are incredibly clueless. I have, on several occasions, seen comment threads where several people will post "This is a fake, don't bother," but the torrent will still have thousands of people downloading and the very next comment will be something like "I've been stuck at 99% for three days, will somebody fucking seed this!!" Remember, the goal isn't to elimiate the network. The goal is to make it so untrustworthy and unreliable that it's too much trouble for Joe User and he'll go to the theater instead.