Slashdot Mirror


Can Poisoning Peer to Peer Networks Work?

andrewchen writes "Can poisoning peer to peer networks really work? Business 2.0 picked up my research paper from Slashdot and wrote an article about it. In my paper, I argue that P2P networks may have an inherent "tipping point" that can be triggered without stopping 100% of the nodes on the network, using a model borrowed from biological systems. For those who think they have a technical solution to the problem, I outlined a few problems with the obvious solutions (moderation, etc.)."

147 of 391 comments (clear)

  1. The easiest solution to fix poisoning... by Blowit · · Score: 3, Insightful

    Have each user vote for each server they download from. If a specific server gives out bad files, the users would vote as a bad server. Then it would not be able to connect to the P2P network.

    This would be moderation however, it would be the smartest way as each user would have their word on who is allowed and not allowed on the network.

    --
    *Headline News* censorship shuts down the Internet! More at 6PM!
    1. Re:The easiest solution to fix poisoning... by Stonehand · · Score: 2, Insightful

      Does that require either centralization (which attracts lawyers and introduces a single point of failure) or trust (P2P propagation of votes, which might be spoofable by a small conspiracy)?

      --
      Only the dead have seen the end of war.
    2. Re:The easiest solution to fix poisoning... by jeremy+f · · Score: 3, Insightful

      Unfortunately, that would lead to bias from potential downloaders of music, as well as for manipulation of ratings by an individual or a group of individuals. Ultimately, this would only serve to flesh out targets by would-be P2P 'hunters', i.e. RIAA agents.

      If I see a list of servers, and a rating, I'm instinctively going to select one of the top rated servers. Most people's ratings of such servers would be a function of two distinct factors:

      - Does the server have what I'm looking for?
      - How quickly can I get this file from this server?

      If both factors are very favorable to me, I'm going give this server a good rating. If I can't connect, or the server doesn't have what I'm looking for, I'm going give the server a poor rating.

      If a server wants to become highly rated in this type of a system, the operators must provide

      - Lots of bandwidth
      - Lots of files

      Not many people can afford to do both. As a result, a 'cartel' of sorts would be formed, where the top few servers serve to a majority of the users, and the rest of the servers, of which there may be 20 times or more of, all serve to the minority.

      If the 'hunter' wants to kill this group, what does he do? He wouldn't want to poison each one systematically -- he'd want to go after the big targets that everyone feeds from. This rating system would only help him expedite this process.

    3. Re:The easiest solution to fix poisoning... by plover · · Score: 3, Insightful
      But if I were the RIAA, my legions of henchmen would be voting down the servers that supply "stolen" music, and voting up the servers that supply poison. And they would meta-mod down anyone who disagrees with their votes.

      So to be useful, votes would require authentication in order to avoid ballot box stuffing. But authentication goes hand in glove with identification, and that's something the users of the P2P networks seem to be trying to avoid.

      Bottom line: voting is subject to the same poisoning that the files are subject to. It adds a layer of complexity that simply delays poisoning, but probably not for long. Hell, with the inevitable bugs (that end up denying users unpoisoned files) and long-term ineffectiveness, voting would probably be smiled upon by the RIAA.

      --
      John
    4. Re:The easiest solution to fix poisoning... by Blowit · · Score: 2, Insightful

      However, if the voting is ONLY allowed after a download, then this poisoning can be significantly reduced...

      --
      *Headline News* censorship shuts down the Internet! More at 6PM!
    5. Re:The easiest solution to fix poisoning... by KelsoLundeen · · Score: 2

      If I were the goddamn RIAA or the MPAA (Jack "Maddog ... Grrrr!!" Valenti, I mean) I'd focus a little bit on image enhancement.

      If I were the RIAA, I'd tell my employees to stop acting like a bunch of two-bit hackers start giving the customers what they want.

      Really, this whole thing -- from poisoning P2P network to authorizing legal hacks on 14 year old uers -- is absurd.

      Hilary and Jack "Maddog ... Grrrr!!!" Valenti oughta take their fingers from the sockets and start talking with users and figuring out how they can get users what they want and the users can give the RIAA and MPAA what they want.

      It's a long process, but I'll tell you one thing: the more the RIAA and MPAA keep employing the shock-trooper tactics, the less goodwill and grace (if such goodwill and grace ever existed, but I think it did -- at least in part) they're gonna get from Joe and Joe-elle Consumer.

    6. Re:The easiest solution to fix poisoning... by perljon · · Score: 2, Interesting

      This would be changing constantly. First of all, joining a P2P is pretty easy assuming it is open to the public. And, as I am out searching for enimem (they throw out a lot of poison), I download a poision file, and now, I am a) blocked from the network or b) passing out poison myself.

      A blocking system can't work fast enough.

      --
      This isn't the sig you are looking for... Carry on...
    7. Re:The easiest solution to fix poisoning... by NineNine · · Score: 2

      Absolutely right. And I think most people do this (including myself). The trick is when downloading is to look for the versions that are shared by the largest group of people. And of course, after you download, delete the bogus ones ASAP.

    8. Re:The easiest solution to fix poisoning... by simm_s · · Score: 2

      Have each user vote for each server they download from. If a specific server gives out bad files, the users would vote as a bad server. Then it would not be able to connect to the P2P network.

      A voting system can be abused by creating a large group of malicious users giving each other positive feedback. Andrew already mentioned this on his webpage. Routing on a P2P network may not be direct, so you may not be able to give a site bad feedback anyhow.

    9. Re:The easiest solution to fix poisoning... by ImaLamer · · Score: 2

      But your solution is going to involve too much interaction and it's just moderation and letting kids control the network.

      What I want to have in the future of P2P is system level protocols which require no user interaction.

    10. Re:The easiest solution to fix poisoning... by plover · · Score: 2
      Not gonna happen...

      The RIAA and MPAA want money. Lots of money. The kind of money they're used to. The P2P sharers want music. Lots of music. For free, just like they're used to.

      Everybody keeps ranting "why don't they find a business model that works?" Here's your answer: There isn't one; there won't be one; there can never be one. First, it's an argument of corporations vs. the marketplace. Can you speak for every P2P user? Can anyone even claim to? Of course not, no one can. So it's already a one-sided discussion. The industries have no incentive to "talk" to the marketplace, since their only feedback comes in the form of "no revenue, no sales" in any case.

      Jack and Hillary aren't stupid -- they've already figured that much out, so I think they've come up with a simple plan. They've decided to squeeze every last nickel from every last legitimate consumer until the whole production system implodes from lack of revenue. Their business plan is to get to be so rich now that they won't care when it implodes.

      Under this plan, Jack and Hillary have no need to talk to anybody except to placate their respective industries. "Studios, crank out those movies. Recording companies, press those discs. We're taking good care of the whole Internet for you. We promise we'll have this piracy thing licked about the same time we reach $1,000,000,000 net worth (each.) So keep your stock prices up, please."

      --
      John
    11. Re:The easiest solution to fix poisoning... by kiatoa · · Score: 2, Funny
      Browse slashdot at -1 sometime. Valid points are moderated down as "trolls" or "flamebait" if the moderator disagrees with them. Goatse links are often moderated up once or twice if they are placed in an otherwise normal-looking comment.

      Blatently wrong posts often make it up to +5 informative, while a reply to it that is accurate will only get a +2 insightful.



      I've thought about this a little and was wondering What would happen if slashdot started selling higher rated posts. Say for $5.00 I could buy 20 posts. I would tend to use them more judiciously but would have my posts start out at +2. Just a thought.

      --
      90% of the wealth is in 2% of the pockets. Bummer to be in the majority.
    12. Re:The easiest solution to fix poisoning... by mfos.org · · Score: 2, Funny
      Unfortunatly I doubt there is one easy way of keeping P2P unpoisoned. It's one of those thorny issues that appear simple but really turn out to big bastards, like cryptography.

      I was reminded of one of the AI Koans

      One day a student came to Moon and said: "I understand how to make a better garbage collector. We must keep a reference count of the pointers to each cons."

      Moon patiently told the student the following story:
      "One day a student came to Moon and said: `I understand how to make a better garbage collector...

      [Ed. note: Pure reference-count garbage collectors have problems with circular structures that point to themselves.]
    13. Re:The easiest solution to fix poisoning... by Archfeld · · Score: 2

      why is this flamebait, He/she brings up valid points. I beleive the system is valid, but there ARE ISSUES with Karma and the moderation system, as evidenced by a thorough test on slashdot. Rather than scrap the system I'd like to see some constructive idea's on how to fine tune it...

      --
      errr....umm...*whooosh* *whoosh* Is this thing on ?
    14. Re:The easiest solution to fix poisoning... by Archfeld · · Score: 2

      they just need to start using the metamoderation data they collect and apply it to your moderation total. If your moderations get overturned you should suffer a karmic hit. Not a perfect system but a double check at least. It would add value to meta moderation.

      --
      errr....umm...*whooosh* *whoosh* Is this thing on ?
    15. Re:The easiest solution to fix poisoning... by Golias · · Score: 3, Insightful
      The whole moderation thing is pointless anyway, because poisoning will not work. From the article:
      Flooding a network with spoofed files would drive users to more reliable music sources -- like the labels' own online sites.

      This statement is obviously false. Nobody will move to the labels' own online sites, because the label sites don't provide what they are looking for: lots of music in vanilla MP3 files with no sharing restrictions, license headaches, or some kind of goofy-assed "copy once" encryption scheme.

      Users who become frustrated with crapflooders on their favorite P2P network will simply move on to whatever the next emerging P2P network is, and those who want use poison tactics will play a losing game of whack-a-mole indefinately.

      --

      Information wants to be anthropomorphized.

    16. Re:The easiest solution to fix poisoning... by budgenator · · Score: 2

      Cool, built in targeting information. RIAA would target the top three servers each day, just scan the quality ratings.

      --
      Apocalypse Cancelled, Sorry, No Ticket Refunds
    17. Re:The easiest solution to fix poisoning... by billstewart · · Score: 2

      Not much - it's the old "Tentacles Of Medusa" problem. Depending on how complex your rating system is, the Poisoners will probably have to do a little work to give their tentacles lots of really great karma by saying they've shared zillions of files with each other, or whatever else it takes to game the voting system besides voting early and often.

      --

      Bill Stewart
      New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  2. One big problem: Lazy users by Anonymous Coward · · Score: 3, Insightful

    Many users, when they download a "poisoned" file, get a little angry... and then they move on WITHOUT deleting the file! This leaves it in the system on yet another node and increases the chances that someone else will download it from them. If users take a little more responsibility for the network, these files wouldn't spread very well at all.

  3. it's already poisonned by users by curseur · · Score: 3, Insightful

    Because most users download files and never check them.
    Really annoying especially with large files you've downloaded at 1kbps

    1. Re:it's already poisonned by users by garcia · · Score: 2

      I could never understand the LONG lists of available files which are not usable.

      In addition, anyone using ATTBI should be forewarned that you should remove ANY and ALL movies from your shared folder on any P2P network. The MPAA is reporting violations to ATTBI's legal demands center and ATTBI *is* disabling users who violate rules.

      I suggest the removal of all shared movies if you are on ANY ISP, but especially large cable modem networks.

    2. Re:it's already poisonned by users by AndroidCat · · Score: 2
      I could never understand the LONG lists of available files which are not usable.

      You've never run a BBS right? :^) The number of junk files uploaded even when they didn't need it for download ratios was amazing. Or uploading renamed copies of software already uploaded (with fscking BBS ads inserted into the zip to make size checks impossible.)

      --
      One line blog. I hear that they're called Twitters now.
    3. Re:it's already poisonned by users by AndroidCat · · Score: 2
      What also sucks is that KaZaA doesn't seem to remove nodes from is "check list" of IPs unless it gets a definite port refused response. So when I reboot and get bombarded with port 1214 requests, and my firewall drops the packet completely, they keep trying for weeks.

      No biggie, I pop up a web server on the port. KaZaA is close enough to HTTP to confuse them into going away as well as logging all their user/download info. :^)

      --
      One line blog. I hear that they're called Twitters now.
    4. Re:it's already poisonned by users by digitalsushi · · Score: 2

      I'm not trying to be snide, but cannot you just have your firewall send a 'rejected' packet instead of having it do a 'drop'?

      --
      slashdot: where everyone yells sarcastic metaphors to themselves to understand the issue
    5. Re:it's already poisonned by users by fishbowl · · Score: 2

      "I could never understand the LONG lists of
      available files which are not usable. "

      Are you talking about the people who post their playlists on a website, which is what you find when searching for a song title, but has the files themselves elsewhere?

      --
      -fb Everything not expressly forbidden is now mandatory.
    6. Re:it's already poisonned by users by Arcaeris · · Score: 2, Interesting

      It's more ridiculous than you might think.

      Searching on Kazaa yesterday for LOTR - The Two Towers (yes, I know, I'm such a pirate), I found about 4 files of 800MB - 1 GB in size. They all said, "Incomplete" or "Does not Work" in the filename.

      It's not just that these files exist and don't work, but that people have them and just don't care. With HDs getting so large, who can blame them, either? Even a gig here and there isn't hurting most people.

      So, I decided to download one of them and see. The Kazaa description was "LOTR - The Two Towers." The filename was "Eight-Legged Freaks Part 1 of 2." The actual file, upon some A/V work on my part, turned out to be several hours of audio from a trailer for The Scorpion King.

      I mean, Jesus. Sorry if it's a little off-topic.

  4. Obvious technical solution take 2 by Kragg · · Score: 3, Interesting

    Although this idea [checksums] works for newsgroups and some other centralized services, it does not with P2P. Basically, it comes down to the fact that you must trust whomever is actually doing the checksumming, or else they can just lie and publish false checksums. In the case of P2P networks, the checksumming is done by the same person you want to figure out if you can trust! As far as I know, this is an unresolvable problem.


    So, um... how about this... If it's a standard file, such as, say, the deviance rip of neverwinter nights, or the new MPEG of Two Towers, then it should always have the same checksum.

    Somebody somewhere needs to maintain a website with these checksums on. Then there's no dependence on the person who you're pulling the file from.

    Obviously doesn't work for random porn videos (although it would for more popular ones... which might also tell you whether they're any good).

    And there's nothing illegal about it.

    Problems?

    --
    If you can't see this, click here to enable sigs.
    1. Re:Obvious technical solution take 2 by Yarn · · Score: 2

      Yes, but by the time you've downloaded it to check the checksum you've wasted n hours downloading trash.

      --
      -Yarn - Rio Karma: Excellent
    2. Re:Obvious technical solution take 2 by Dooferlad · · Score: 2, Informative

      eDonkey 2000 / http://www.sharereactor.com do this. The eDonkey network works by using links (as in clickable on web pages) that contain MD4 sums of the file + file size to let users know about files on the network. It does have some searching capabilities but they are limited. This is persumably fixed in the new Overnet project the guy is doing.

      The files are all downloaded in segments from multiple sources, and you sometimes get bad segments, but they are only a fraction of the total file size so you don't really care.

      You just plain can't poison eDonkey / Overnet - it won't work. It is also the only network that I would be tempted to use to distribute real content since it is guaranteed that the user will get what you want them to.

    3. Re:Obvious technical solution take 2 by dattaway · · Score: 2

      Solution (difficulty level: easy :)

      We can GPG sign each megabyte of the files to be downloaded. If the P2P clients downloading from the infected server raise enough red flags, the server can be voted off the island.

    4. Re:Obvious technical solution take 2 by moonbender · · Score: 2

      As mentioned above, sites like ShareReactor pose as a single point of failure. The RIAA could (arguably) close down SR, which would be a tremendous loss for the ED2k P2P community. Of course, other sites or other ways of checksum distribution would spring up to try and fill that void - while not unpoisonable, I'd agree that the eDonkey network is very well defended against it.

      --
      Switch back to Slashdot's D1 system.
    5. Re:Obvious technical solution take 2 by s20451 · · Score: 2

      If the RIAA release songs which are already in the public domain, but titled incorrectly (e.g., release a repeating loop of "Happy Birthday" with the title of "Coldplay--Yellow.mp3"), then they can add the tag line without fear of losing anything. I'm sure they have enough lawyers to argue that the tag line applies to the content of the mp3 and not its title.

      --
      Toronto-area transit rider? Rate your ride.
    6. Re:Obvious technical solution take 2 by Kragg · · Score: 2

      Checksum plus filesize is pretty damn good.
      I'm not saying this is perfect, but it would help.

      Incidentally, in answer to another point raised somewhere round here, it's true that the p2p system is the one providing you with the checksum, but there's still 2 Good Things.
      - After you download, you won't run it... you can do the checksum test yourself
      - If it was built into the p2p system then it would be indisputable... unless the server was lying, in which case you know not to trust that server.

      --
      If you can't see this, click here to enable sigs.
    7. Re:Obvious technical solution take 2 by Jedi+Alec · · Score: 2, Funny

      GPG signed by whom?

      Well, who do you think?

      Attention, the triple X movie you're about to download has not passed Microsoft Digital Signing and could endanger your personal stability...

      --

      People replying to my sig annoy me. That's why I change it all the time.
    8. Re:Obvious technical solution take 2 by Jeppe+Salvesen · · Score: 2

      Not at all. I am in fact considering coding something like that. I'm envisioning a separate p2p network where md5 checksums along with moderated content is kept as synchronized as possible. Users can submit new files/checksums, but those should be peer reviewed in some yet-undecided manner. It should be possible to blacklist md5s (VERY efficient in stopping virus propagation, bad mp3s etc).

      Then, the different clients can interface to the content p2p network, so that users that are considering downloading a file can have a better guess at the authenticity and quality of the file - given that they build in support for passing hashes along with the general search results.

      I would actually like to see a system where the content database is so well maintained that all systems can use it as a central QA tool, enhancing the file sharing experience.

      And folks - just because the technology can be abused, does not mean that it is inherently evil. I just would like to have the quality raised in some way.

      The downside is that it will probably become a way to censor information to some extent. We just need to minimize the risks, and maximize the benefits.

      --

      Stop the brainwash

  5. Checksumming can work by Pedrito · · Score: 3, Informative

    I disagree with your suggestion that checksums can't work. A way they could work is as follows.

    Create a website with logins for the users. Users of this web site can create lists of checksum for the files they create or have downloaded and verified as valid.

    Other users can check any given user's list, and perhaps even post comments about the user's list, a form of moderation, if you will.

    The validity of any single file on any random user's list would certainly be questionable, but some lists would become "trusted" by the community through trial and error. Others would be recognized as bogus and ignored.

    Just a thought. Give me more than a few minutes and I might be able to come up with a better one.

    1. Re:Checksumming can work by anonymous+loser · · Score: 2
      This is exactly what is addressed in the second part of his answer to this question in the FAQ:
      Another idea that is often proposed is moderation, specifically "webs of trust." That is, people keep lists of people they trust, and then they implicitly trust (often with diminishing degree) the people they trust, and so on. In the context of P2P, the each user would then receive a "trust rating," reflecting the number of people that trust them. However, this can also be defeated fairly easily, by creating groups of malicious users that trust each other - then, untrustworthy users may have high scores leading to problems in the future. This kind of fraud has happened on eBay, where people give themselves recommendations to mislead future partners.

    2. Re:Checksumming can work by Pedrito · · Score: 2

      This kind of fraud has happened on eBay, where people give themselves recommendations to mislead future partners

      Yeah, and it's run e-bay into the ground. Oh wait, no it hasn't.

    3. Re:Checksumming can work by Hast · · Score: 2

      See the previous comment about tree hashes. After you have downloaded 1k or such you'll compare to the source hash you have. If it doesn't match either the server is compromised or you have a tansmission error. You'd just try downloading it again, and if it seems like the server is feeding you bogus data then you just chose a new one.

      Using cryptographic hashes will make it impossible to generate a bogus datastream which generates the same hash. (CRC32 and such are not as suited.)

    4. Re:Checksumming can work by anonymous+loser · · Score: 2

      EBay has benefitted significantly from PayPal in this regard, where even if you get screwed (which seems to happen regularly) you can recover your investment. How are you going to recover lost bandwidth once you've already downloaded something?

    5. Re:Checksumming can work by Pedrito · · Score: 2

      I've purchased dozens of items from E-Bay over a period of years. I've never once been screwed, though I have spotted some obvious fraud in the past.

      As for recovering lost bandwidth, you can't, but you can use checksumming along with moderation to improve reliability.

  6. Always a way by Lumpy · · Score: 5, Insightful

    Most of us who have been on P2P looking for files have been used to the fact that a large number of users are misconfigured (their firewall blocks your incoming request but heppily tells you they have the file you want) or are trading crap quality files. At that point you resort to brue force and using a bot to just grab everything it can to a large holding drive... a 40gig ide is dirt cheap and can easily hold the results of running a bot searching for "radiohead mp3" and grabbing EVERYTHING it finds over the course of about 3 days. but then you have to manually go in and delete all the crud, cruft and garbage. It's still faster than the old days of IRC trading but the signal to noise ratio has always been really bad.

    Granted poisining it can start to drive away the gimmie-gimmie crowd or the newbies.. but the hardcore and old-timers will stay and simply find a way around it. Hell a group of about 100 of us now have our own private open nap network going and we have only high quality known good files. any clients connecting not sharing or sharing crap are instantly banned/blackballed... so we do the moderation thing.. with a side requirement that you must be asked to join and prove your worthyness to us. Maybe that will be the direction P2P will go... back to the roots of IRC where you had to prove your worthyness, ratios were encforced, and real people made decisions to keep out the troublemakers...(RIAA) granted you dont get 30 bajillion users that way, but then you dont have to spend a night and 10 gig trying to find that song or file you want.

    --
    Do not look at laser with remaining good eye.
    1. Re:Always a way by warpSpeed · · Score: 3, Insightful
      Hell a group of about 100 of us now have our own private open nap network going and we have only high quality known good files.

      You hit upon a good theme here. To counter act the problems, the signal to noise ratio, poisoning, etc, users will have to PUT MORE EFFORT into downloading warz, and MP3s. The P2P networks will thrive, but you will not have as much of the global swap fests, and free warz that you can get now. The most the people poisining the P2P world can hope for is to increase the level of effort required to use P2P effectivly. And along the way they will create some stonger social ties between the users. Ultimately they will end up strenthening the whole P2P movement...

    2. Re:Always a way by wa1rus · · Score: 2, Insightful

      Granted poisining it can start to drive away the gimmie-gimmie crowd

      To be fair though, that's pretty much the point, isn't it.

    3. Re:Always a way by frank_adrian314159 · · Score: 2
      Hell a group of about 100 of us now have our own private open nap network going and we have only high quality known good files.

      And that's what the *AA want. As long as the networks split and isolate, they can monitor them and pick them off as they become big enough. Also, since being a member of a closed "pirating ring" is as good as an admission of conspiracy, they can start to use RICO laws, too. Yummy...

      In reality, the only safety in P2P for illegal sharing was its ubiquity. Once that's gone, you become an easy target. It's a lot easier to control five people than a mob of thousands.

      --
      That is all.
    4. Re:Always a way by Lumpy · · Score: 2

      but you're missing the point... The group will never become bigger than a hundred or so.. we wont allow anymore users. BUT... most users are also members of other groups so the source material propagates... espically cince I can get on the opennap chat room while downloading that royalty free and open for all free music I download I can let others know I'm looking for butthole-surfers early years mp3's I already look for about 4-5 different things for my friends in networks I know they arent members of.

      it propagates... and as soon as someone makes a way for me to link my cient program with Network1 and networ2 and allow a friend to ask my client to look for and download XYZ elsewhere then things will start really hopping.

      the circles overlap... creating a GIGANTIC trading network that is as big as what existed... but not poisoned or crappy.

      Oh and we only trade what we create, or free artists... really we do.... really.... believe me... please....

      --
      Do not look at laser with remaining good eye.
  7. IP address block banning by oliverthered · · Score: 2

    Why not block all IP's in RIAA/MPAA IP ranges and any ranges that are putting crap onto the network.

    --
    thank God the internet isn't a human right.
    1. Re:IP address block banning by Ubi_NL · · Score: 3, Funny

      what if the they take a few AOL accounts to do the poisoning: mind you that these have flexible IP adresses. Therefore you have to block all of AOL, which is A-OK by the RIAA I suppose...

      Or you could not live in the US and have no problem

      --

      If an experiment works, something has gone wrong.
    2. Re:IP address block banning by psavo · · Score: 2

      Therefore you have to block all of AOL, which is A-OK by the RIAA I suppose...

      That would be nice to see, RIAA sat on by AOL.. cos ultimately that would be a breach of AOL's terms of usage.

      --
      fucktard is a tenderhearted description
  8. Some comments on the conclusions... by decarelbitter · · Score: 3, Insightful

    From the webpage:
    In particular, our analysis of the model leads to four potential strategies, which can be used in conjunction:
    1. Randomly selecting and litigating against users engaging in piracy
    This seems to be the option which involves the least technological action. However, randomly wouldn't work, if it were only because the P2P users don't all live in the same country, hence different laws apply. So some sort of not-so-random selection proces has to be implemented.

    2. Creating fake users that carry (incorrectly named or damaged files)
    Modern P2P programs support downloading files from multiple sources. If someone downloads such a fake file and discovers it, the file will almost always be deleted. So, these files will not propagate through the network, or at least not as fast and as much as the correct files. So a search where one file can be downloaded from many sources is in this case preferable before one with not many nodes serving the same file.

    3. Broadcasting fake queries in order to degrade network performance
    Now this is an interesting thing. The makers of the P2P programs who are being targeted by fake queries could ban such users, or could build in a feature where the user of a P2P program can ban a host his/herself, so that it will be excluded in further searches.

    4. Selectively targeting litigation against the small percentage of users that carry the majority of the files
    Some users carry gigs and gigs of files, but that doesn't mean they're very popular. If I setup a server where I host my 20CD collection of Mozart works I'll probably won't get as much traffic as when I publish the Billboard 100. It's not the quantity, but the content of the files served that counts. Search for Britney and you'll receive 1000's of hits. Search for Planisphere and a lot less results will show up.

    Nevertheless it's a good paper.

  9. GPG signatures and web of trust by FreeUser · · Score: 5, Insightful

    The answer is quite simple, and would be very difficult for the sabateurs to subvert.

    GPG signatures (which BTW include a checksum) of content, with said signatures refering to an online alias rather than a real person (thereby maintaining anonymouty).

    A web of trust is formed, in which HollywoodDude is known and trusted, and has signed RipperGod's key, who in turn has signed FairUsers key, and so forth.

    Provide a separate way of obtaining the keys (e.g. multiple independent websites, multiple independent keyservers, and so forth), and people can simply filter out anything submitted by untrusted users. If something submitted by someone outside of the trust ring, and someone who is trusted sees the item and determines that it is worthwhile/good/whatever and not a decoy, they could sign the item themselves.

    Gaining trust would of course take time, probably requiring many worthwile submissions, but that is true in real life anyway, so why should it be any different online.

    If someone violates their trusted status (or their private key is stolen, which BTW would be a violation of the law), others in the ring of trust could revoke their trusted access and blacklist their signature.

    It isn't as convinient as just being able to share something with little or no thought, but it is emminently doable, and there really is no straightforward way to undermine such an approach.

    --
    The Future of Human Evolution: Autonomy
    1. Re:GPG signatures and web of trust by Salamander · · Score: 2, Insightful
      It isn't as convinient as just being able to share something with little or no thought

      That's exactly what the paper's authors said, pointing out that the decrease in convenience is in itself a real danger, and they were right.

      --
      Slashdot - News for Herds. Stuff that Splatters.
    2. Re:GPG signatures and web of trust by ajs · · Score: 2

      You don't need to know who authored the file. My suggestion of long ago is this: maintain a service serperate from gnutella that rates content. Refer to that content by name, but include MD5 signatures. The first signature is for the first 1k, the second signature is for the first 10k and so on, through the logarithmic orders of magnitude, base 10. One final signature would be for the whole file.

      Now web sites can present reviews that tie into this new protocol with a URL (something like "gsig://sigs.mediahype.net/ab3827d9827eab39f2c-1") and that URL is then submitted to a signature-aware gnutella client which contacts the signature server, downloads the filename and signatures and then gets that file from Gnutella. The file download will be aborted if the signature fails to match at any of the signatures, or it will be aborted immediately if the file size is smaller or larger than the one in the signature server.

      Sure, you can still put out a 10-second clip with empty noise after, but the download will stop at that 10-second mark. What's more: a smart client can keep the section that DID match the signatures and look for an intact copy to CONTINUE from. Thus, truncated versions will now be ignored immediately.

      This introduces a centralized client-server model for trust purposes, but reviewers are not providing content, just reviewing it. The MPAA and RIAA could even put up servers that review valid promotional content, and warn users of copyright violations in other files! *This* is the way to solve everyone's problems at once (unless of course your problem happens to be a failing business model).

    3. Re:GPG signatures and web of trust by ImaLamer · · Score: 2

      Great.

      So when the MPAA downloads Star Wars Attack of The Clones they know that I'm the one who ripped it!

      I'm not going to put my GPG (PGP) signature on a document with plans to hijack planes either.

    4. Re:GPG signatures and web of trust by evilviper · · Score: 2

      No, I'm afraid that system would just not be able to take off. Far too much inital setup, and a lot of continual maintenance.

      What is needed is source obfustication. Instead of connecting directly to a node with the files, we lay the network out like a system of routers. Each node can only communicate with it's neighbor, which means you can only know the next hop, never the source. Without a doubt this exponentially decreases download speeds (each node downloads and uploads the file before it gets to you) but with swarming, and dynamic metrics for each node, it could work out. Of course caching would be the obvious next step.

      As for overloading the network with crap, go right ahead. I'm more than happy to waste RIAA/MPAA bandwidth time and time again. But that's just me.

      For a solution, let's be democratic about it. When we search for a file, we find that 10 people have it. If 8 have marked it as being a fake, I probably won't download it.

      So that takes care of everything except 3. Broadcasting fake queries in order to degrade network performance. The downside of p2p networks is that they have a lot of overhead. Privacy demands overhead. However, if one is sending excessive requests, they may be blocked temporarily. Of course the lower the TTL, the higher the tolerance for a large number of searches.

      Any questions?

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    5. Re:GPG signatures and web of trust by FreeUser · · Score: 2
      Great.

      So when the MPAA downloads Star Wars Attack of The Clones they know that I'm the one who ripped it!


      Go back and read my comment. The comment, not the title. To wit:


      GPG signatures (which BTW include a checksum) of content, with said signatures refering to an online alias rather than a real person (thereby maintaining anonymouty).


      There is absolutely nothing about GPG that requires the key to refer to an actual, human identity. If everyone knows that TrustedDude is a trustworthy person, that is sufficient. No one needs to know that TrustedDude is in fact a 15 year old kid in New Jersey who spends his free time violating copyright (or perhaps not, there are all kinds of legitimate uses for P2P networks, not least among them improved accessibility to popular legal content, like free software whose primary ftp servers are often overloaded).
      --
      The Future of Human Evolution: Autonomy
    6. Re:GPG signatures and web of trust by Saeger · · Score: 2
      You just better make sure that your pseudo private key is physically secure and securely passphrased when the MIBs show up to tie you to your evil nym. :)

      --

      --
      Power to the Peaceful
    7. Re:GPG signatures and web of trust by jbolden · · Score: 2

      Huh? Its not hard to figure out who heavy Kaaza users are. Most of them are either: minors or in college. Once you have the physical address getting the name the right person isn't hard; and then getting physical access to the computer will not be hard, "Hello, Mr Smith I'm Joe White from the RIAA it appears you son Michael has been involved in trafficking pirated music. We'd like to handle this without police involvement, do you mind come in clean the stolen files off your son's computer? Of course you can watch if you like while we do this..." While they are there they take the key...

      5000 visits at $200 visit = $1m not much money at all. And certainly enough to shatter any rings of trust.

    8. Re:GPG signatures and web of trust by ImaLamer · · Score: 2

      But having an online alias which stands in place of your true identity is a single point of failure.

      For me to earn 'mod points' I would need to keep the same nick/alias. If I do that I'm leaving a solid, airtight case for the Secret Service or the FBI.

      Basically putting my thumb print on the buddy glove. Sure, they find the glove - but it fits my hand, millions of times.

    9. Re:GPG signatures and web of trust by FreeUser · · Score: 2

      But having an online alias which stands in place of your true identity is a single point of failure.

      Only if you are the sole person in the ring of trust.

      The ring of trust need not include all the thousands of people who subscribe to the USENET list or P2P service, just a core of people who trust each other and the quality of their submissions. Others (the majority who just lurk and occasionally download stuff) could obtain the keyring from a third party (or better yet, several independent third parties.

      If your public key is compromised, one of the others can revoke it, and sign any good submissions from you themselves.

      For me to earn 'mod points' I would need to keep the same nick/alias. If I do that I'm leaving a solid, airtight case for the Secret Service or the FBI.

      Basically putting my thumb print on the buddy glove. Sure, they find the glove - but it fits my hand, millions of times.


      Use double blind remailer techniques (USENET) or FreeNet. See the cypherpunks information for details ... there are ways to maintain an online identity that is completely decoupled from your real self. It takes a little work, and requires you to be careful (e.g. don't go attaching your .signature file to an anonymous remailing), but it isn't rocket science.

      --
      The Future of Human Evolution: Autonomy
  10. faked hashes by vurtigo · · Score: 3, Interesting

    The problem faked hashes can be addressed using trees of checksums rather than just a simple checksum although a workable implementation would require embedding into the P2P protocol.

    The idea is you break the file up into smallish sized blocks (100k or so) and generate a hash for each one of these. For each 8 first level hashes, you feed them into a crypto hash function to generate a second level hash. For each 8 second level hashes... you generate a third level hash. This allows a continuous (per 100k blocks) proof that the content is valid... The size of the proof grows with the log of the content so it is not much of a problem.

    1. Re:faked hashes by bje2 · · Score: 2

      i think you're missing the point, becuase then you still need to have some sort of centralized listing of what the values/file sizes/etc, of all the files should be...that kind of ruins the point of the P2P network...

      --

      "Facts are meaningless. You could use facts to prove anything that's even remotely true." - Homer Simpson
    2. Re:faked hashes by vurtigo · · Score: 2, Insightful

      That doesn't address the problem. The problem is that in order to compute the hash function, you have to have already downloaded the entire file. What one needs is a mechanism that allows you to detect cheaters in the protocol nearly continuously.

      The appropriate reference to look for in the cryptographic literature is signing digital streams. For the compartively simple problem of P2P (where you don't have to worry about lossy channels), the tree based mechanism is pretty easy to implement.

  11. They Don't Need to Poison P2P by Anonymous Coward · · Score: 5, Insightful

    The RIAA/MPAA don't need to poison P2P networks. Nor do they need to use lawsuits and the threat of DMCA. The easiest, best way to stop illegal sharing of copyrighted materials is to provide a legal, reasonably priced electronic distribution alternative.

    Really. Most users, given the choice, will pick the "honest" legal way to get their music and videos. Will there still be pirates? Of course, but you can never stop them and, heck, you're not losing money on them anyway. They wouldn't spend the money on the music.

    Treat honest customers as honest, embrace new distribution methods. The problems go away. Think of the cost savings: they wouldn't have to buy any more senators.

    1. Re:They Don't Need to Poison P2P by mark-t · · Score: 4, Insightful
      Really. Most users, given the choice, will pick the "honest" legal way to get their music and videos. Will there still be pirates? Of course, but you can never stop them and, heck, you're not losing money on them anyway. They wouldn't spend the money on the music
      In fact, really... most users, given the choice will take the least expensive road available to them as long as their chances of being caught are minimal, and as long as it doesn't involve stealing anything tangible. If you think most people are decent, law abiding citizens, why not take a poll and see what percentage of drivers nowingly speed? The fact is that Piracy is perceived by many as a "victimless crime", so there's no justification for a law against it in most people's opinions. These people will continue to violate the law so long as they feel they can continue to get away with it.

      While lowering the price of the media would make *some* difference, it wouldn't make enough of a difference to be worthwhile.

    2. Re:They Don't Need to Poison P2P by PhilHibbs · · Score: 2
      Piracy is perceived by many as a "victimless crime", so there's no justification for a law against it in most people's opinions.
      And given that the US is supposed to be a democracy, doesn't that mean that it shouldn't be a crime?
    3. Re:They Don't Need to Poison P2P by mark-t · · Score: 2
      That doesn't matter.

      Remember, speeding on an open road where there are no other cars is a victimless crime too... but it's still illegal.

      Furthermore, piracy isn't victimless. If it were, changing the prices wouldn't make *ANY* difference, but it does. Think about it.

    4. Re:They Don't Need to Poison P2P by Reziac · · Score: 2

      I'd rephrase that to "most users will pick the *easiest* way (not necessarily the cheapest or most honest). But the principle is the same. Make it *easier* to find the desired MP3 from an RIAA server, make the downloads reliable-quality, dirt-cheap, and encumbrance-free, and no one will bother with the perils and pitfalls of P2P.

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    5. Re:They Don't Need to Poison P2P by captaineo · · Score: 2

      I totally agree with you... The problem is that the accessibility of the internet flies in the face of traditional licensing models for film and music. Record and film studios are accustomed to portioning out the rights to their products in nice tidy chunks - US broadcast rights for year X, European broadcast rights for year Y, VHS disribution rights for Africa, airline/radio play rights, sequel rights, rights to re-use the content in other productions, etc. (it used to be that natural borders kept these categories separate; now they have to use artifical borders like the DVD region coding system).

      But with internet distribution there is only one "right" to sell: once the content is on the net, anyone can get it, anywhere, anytime. While a tremendous boon for consumers, this completely destroys the old, picture-perfect system of nice little independent packages of "rights." And that is why traditional media companies are keeping their heads buried in the sand, horrified at the collapse of their nice neat rights packages, hoping that this whole internet distribution thing will finally blow over. They are praying for ubiquitous DRM systems to re-create all those nice little borders...

    6. Re:They Don't Need to Poison P2P by mark-t · · Score: 2

      Just because you don't see a reason for a law doesn't invalidate the law. The simple fact of the matter is that they own the media and are allowed, by virtue of free enterprise, to charge as much as they damn well please for it. You are free to voice your complaints about their pricing, but even if your complaints fall on deaf ears, that does not give you the license to copy their stuff outside the bounds of fair use (what constitutes fair use is explicitly outlined in that section of the copyright act).

    7. Re:They Don't Need to Poison P2P by Saeger · · Score: 2
      Just because you don't see a reason for a law doesn't invalidate the law.

      Sure it does; if enough people flaunt it.

      As John Perry Barlow said:

      "[IP] laws are so practically unenforceable and breaking them has become so socially acceptable that only a thin minority appears compelled...to obey them.... Whenever there is such profound divergence between the law and social practice, it is not society that adapts."

      --

      --
      Power to the Peaceful
    8. Re:They Don't Need to Poison P2P by mark-t · · Score: 2

      And I suppose if enough people went around killing other people too, they wouldn't be able to have laws against that either. If you don't draw the line at where the law is already laid, you can't stop it from being pushed back.

    9. Re:They Don't Need to Poison P2P by mark-t · · Score: 2

      I might also add that it is _because_ of people flaunting the law that stupid laws like the DMCA came into existence, vastly limiting the honest person's ability to copy materials in manners commensurate with fair use. Piracy is *NOT* victimless.

    10. Re:They Don't Need to Poison P2P by ivan256 · · Score: 2

      If you think most people are decent, law abiding citizens, why not take a poll and see what percentage of drivers nowingly speed?

      I'm going to turn your bad analogy around on you.

      That a large number of people knowingly speed is exactly why you are wrong. People don't want to wait, and they don't want to be hassled. If people could get music online quickly and easily for very cheap they would probably choose that over using P2P, because it would save time, and they'd get guaranteed quality. People don't want to wait a long time for a file that may or may not finish downloading and may or may not be what it says it is, but that's the only option available right now. The key is that the price has to be low enough that users won't give a second thought to paying it. If you want to hear that song, and it's $1.50, you might hesitate and not buy it. If it's only $0.20 then you might just buy it and not worry about the price. The upside to that is that it would bring the price of music back down to a reasonable level at the same time.

      If I was automatically billed $0.50 for use of the highway on the way home from work I would happily pay it if it meant that I could go 100mph without getting a ticket. (I don't want to slow down for tool booths though.)

    11. Re:They Don't Need to Poison P2P by Saeger · · Score: 3, Insightful
      Piracy is *NOT* victimless.

      "Copyright Infringement" is *NOT* piracy.

      Thanks.

      --

      --
      Power to the Peaceful
    12. Re:They Don't Need to Poison P2P by mark-t · · Score: 2
      You are right, people choose convenience over price almost every time. But given a convenient enough option that happens to cost less, I can pretty much guarantee that people will take it.

      And by the way... my wife told me off for that message when she saw it as well... apparently she wasn't too impressed with my opinions of humanity in general either. Oh well... I'll choose to blame it on a poor night's sleep and being woken up at 6:30 AM by a road construction crew. :)

  12. So if I try to download the latest.. by Anonymous Coward · · Score: 4, Funny

    tune, I may end up with somthing thats bland, repetitive and annoying.

    And, pray tell, how am I supposed to know the difference?

  13. Simple! by Eric_Cartman_South_P · · Score: 5, Funny
    Everyone posting a real song should name it beggining with, "RIAA sucks, fair use is good, and Disney love$ politicin$". They would never want to spread such text, so every song name beggining with the text simply MUST be real.

    1. Re:Simple! by decathexis · · Score: 4, Interesting
      A more 'toothful' modification of this idea would be to require all files to include some DMCA-protected text, like DeCSS.

      Or, maybe, a "licence":

      By making this File available on the Network, directly or through an Agent, the Distributor hereby gives up any and all Rights to its Content, as well as any other Works of Art matching this File in name.


      Having distributed content together with such licenses (or hired someone to do so), it might be a bit harder for the labels to defend copyright claims for individual songs.

  14. Trust webs by nuggz · · Score: 2

    I think webs of trust are a good idea.
    Poisoning such a web could prove difficult. I trust personal friends highly, the aren't a poisoning group.
    People I or they don't know well won't get a high trust rating, and would be suspected if they were poisoning the group.

    I think slashdot type moderation works well too, combined with a decent sized web of trust should be a pretty stable system

  15. From the article... by Fizzlewhiff · · Score: 2

    Flooding a network with spoofed files would drive users to more reliable music sources -- like the labels' own online sites.

    The problem is the labels don't have their own online sites. Sooner or later (its bound to happen) the labels are gonna hire some college grads who grew up on sharing and understand the problem. Maybe then a compromise will be reached.

    --

    'Same speed C but faster'
  16. Tree Hash EXchange (THEX) by Orasis · · Score: 2

    The crew at the Open Content Network have released a specification for serializing hash trees. The specification is called the Tree Hash EXchange (THEX) and is being implmented in both the Open Content Network and Gnutella. Furthermore, this specification is compatible with the TigerTree hashes used for Bitzi.

  17. I agree and always have, but.... by FallLine · · Score: 3, Insightful

    If this is what people are forced to do to achieve Napster-like results, then RIAA et. al have basically won all that they set out to achieve. By raising the bar high enough and by forcing higher transaction costs on the users, industry effectively shuts internet piracy out for 99.9% of the population. Of course people like me, that 1% or whatever it is, will always be able to circumvent whatever they throw in my path (presuming that I'm willing and wanting to do so of course). However, that number is so small that they really would not bother spending much effort to enforce from a simple cost / benefit point of view. Why spend millions in legal and related fees to track down a group of consumers that only account for half that amount? They won't bother, like they didn't really before Napster came along.

    In fact, I would further argue, against the conventional wisdom on slashdot, that RIAA has basically won the war against P2P and other forms of mass piracy. At least once they shut out networks such as Fasttrack, and let it be known that there will no financial return for those that fund the development of piracy networks. Certainly the average Schmoe can download that super popular song via GNUtella with some effort, but getting much more than that like, say, the entire album at decent quality from same artist, is like trying to extract blood from a rock. That is not to say that they will retire their guns, but rather that it will just be an on-going series of small battles, more like maintenance, to hammer down any network, system, or device that pops up and starts to hemmorage their intellectual property.

    1. Re:I agree and always have, but.... by swb · · Score: 2

      but getting much more than that like, say, the entire album at decent quality from same artist, is like trying to extract blood from a rock.

      I (sadly) only started using Napster about a year before it got shut down, but I never found it a particularly good source for downloading an entire album, especially one in the same bitrate and overall quality. I thought that was nearly impossible.

      I'd say overall that only about 75% of the stuff was worth keeping (eg, 128kbps+, no skips/cutoffs/distortion) and I searched for mostly mainstream stuff (rock n roll). I got a fair amount of cutoff tunes, tunes with skips in the middle or just bad overall audio quality.

      I'd agree thought that the RIAA has effectively killed off P2P, except for people that make a serious effort at maintaining their own networks or of putting real resources towards mining gnutella-type networks.

    2. Re:I agree and always have, but.... by FallLine · · Score: 2
      I (sadly) only started using Napster about a year before it got shut down, but I never found it a particularly good source for downloading an entire album, especially one in the same bitrate and overall quality. I thought that was nearly impossible.

      I'd say overall that only about 75% of the stuff was worth keeping (eg, 128kbps+, no skips/cutoffs/distortion) and I searched for mostly mainstream stuff (rock n roll). I got a fair amount of cutoff tunes, tunes with skips in the middle or just bad overall audio quality.
      While I agree that Napster was hardly ideal at this, it was VASTLY better than the current alternatives and it was actually quite workable if you knew how to take advantage of it. Namely, you find all the users that have a good organized collection of kinds of files that you're interested in on a decent network connection, add them to your hotlist, browse their lists directly, and download exclusively from them. I discovered these users, in the first place, by improving my search method by searching for directories (folders), rather than files, and by searching for higher bit rate mp3s (since high quality tends to imply a more caring user). When you sort by path and/or username it becomes quite evident when someone has a large collection of good music. Of course, this kind of technique was out of the technical reach of most of napster's users at the time...but it was effective. These same techniques are crippled on today's "P2P" networks because you have (in reality, not their claims) a much much smaller set of users to search from, horrible latency, and volatility of the network makes finding a user 5 minutes later, never mind a couple weeks later, quite unlikely....plus the bad searching and listing interfaces...ick.
  18. Doesn't Sharezilla do this too? by Ride-My-Rocket · · Score: 2

    I just started using it last week -- I think I remember something whereby each file has some type of key / checksum (I'm not too familiar with the nuances of encryption)........... but I could be wrong.

  19. Not really working... by Kjella · · Score: 3, Insightful

    Checksumming - no good. Any program could pretend to have the right checksum, but send false data. No point in figuring out *afterwards* the download is corrupt.

    Webs of trust - hardly. Imagine a network of antis giving eachother good reviews, they'd certainly be better off than someone without any reviews at all. It's very *unlikely* that the one you're P2P'ing with has a trust chain you accept.

    "Database" of who are good traders and not - Fake databases would screw that, you wouldn't know which ones to trust as you have no central server. The problem is that if there's to be any real P2P exchange happening, it's usually *strangers* meeting.

    My friends could do a web of trust or a database, but then we'd much more likely to setup some mutual leech ftp servers instead and skip the entire P2P-networks.

    Kjella

    --
    Live today, because you never know what tomorrow brings
    1. Re:Not really working... by slithytove · · Score: 2

      You seem to be suggesting the same thing as the author. The point you and he are missing is that, while check-sums and trust-chains can be poisoned, the scale on which it would have to happen is impractical. If a bunch of RIAA bots all trust eachother and I make the mistake of trusting them about a particular checksum, it will be the last time. That whole chain of trust will be suspect and that particular filename/checksum will be on my ignore list. All I need is to add a few regular file traders keys to my trusted list and I will be able to find what I want. Additionally as reliable chains grow in size, fake ones will also have to grow to be appealing to newbies.
      File trading was one of the first things I used the internet for, over a bbs shell using irc and ftp. It wasnt too hard for my 11yr old mind then and the choices have only increased and become easier since then. True, the RIAA, if it blew a huge wad hiring script-kiddies could make things a little more complicated for the first time ever, but I suspect that the progress in p2p would only start increasing faster since new features would be required.

    2. Re:Not really working... by leviramsey · · Score: 2

      That's the thing. IRC warezing won't die. FTP/HTTP warezing won't die. But there's no effective way to do a massive search of those media. In essence, the volume of trading will go down if KaZaa and friends are sufficiently poisoned. The RIAA will tolerate a small amount of piracy, especially by the technically literate, because it's too expensive to go after them. When the system becomes larger and more widespread, allowing anybody who can type and click a moust to get the files, the RIAA will swoop in.

  20. Re:Don't talk lame trash... by moonbender · · Score: 2
    You obviously aren't very experienced in the whole warez scene thing. Not that I'd blame you, but the original poster knows more about it than you do from reading PC Magazine.
    I say you're talking lame trash, unless you host it on YOUR site. YOU be the victim of **IA lawsuits. Unless you post a link to a site where you plan to host such a wonderfull page, shut the f**k up.
    There already are plenty of these sites, others mention concrete examples. So far, they have not yet had a problem with the RIAA, perhabs because what they do (provice checksums, not files) is not illegal, perhabs because the RIAA does not yet consider them relevant.
    However, in any case, it is way easier to spread checksums by various means - internet boards, email lists, usenet, IRC - than spreading the actual file. If the situation arises, and the P2P net is "poisoned" with invalid files (and invalid checksums) I'm sure it won't be hard to acquire the valid checksums and download the correct files. Of course, "poisoned" clients sending out fake files with wrong checksums will still be a problem.
    On a more technical issue, you you really think different rips of the same movie will have the same checksum? What if one rip is one second longer or shorter? Or the ripping prog compreses it in a slightlt different way? bang... different checksums.
    Why would there be different rips? Typically, each movie is released only once (by groups specialising in it), all other releases are "dupes" and are not do be distributed. The same is true for virtually any sub-category of the scene, such games ISO/RIP, utils and audio.
    --
    Switch back to Slashdot's D1 system.
  21. Use Limewire by asv108 · · Score: 4, Informative

    The latest versions of limewire use hashes from a specification called HUGE that probably defeat this type of posioning attack. You can check out a recent interview with limewire team here. Go here if you want to download the code or check out the dev docs(Which are pretty outdated).

  22. Cripes -- did anyone proof this paper? by KelsoLundeen · · Score: 3, Insightful

    What the second-to-last paragraph in the paper? There's a missing word. A pretty important word, too. (How can this paper be featured all over the map and have an error like this?)

    Anyway, is it:

    "Or perhaps the carrying capacity of a well-designed P2P network is huge, and *NO* amount of flooding can overwhelm the network."

    Or:

    "Or perhaps the carrying capacity of a well-designed P2P network is huge, and *ANY* amount of flooding can overwhelm the network."

    Which is it: "no" or "any?"

  23. Distributed trust and peer review by Jim+McCoy · · Score: 5, Insightful

    I love the smell of undergraduate sophistry in the morning...

    The author of this paper seems to suffer from the common practice of those in a hurry to finish their term papers that if they somehow ignore the elephant in the room that disproves their point they might end up getting partial credit for impressing people with how well they can tap dance around the elephant. In this case the well-established practice of using a secure hash function as a self-verifying mechanism to prevent DoS attacks that try to flood a network with garbage files is the elephant.

    In his FAQ regarding the paper, Mr. Chen correctly addresses the problem of a lack of centralized authority in using hash functions as distributed/P2P but apparently did not make more than a cursory examination of the subject or else he would have seen the various methods available for solving such a problem. I can only assume this is the case because reputation systems beyond simple moderation are not addressed and flow-constrained trust networks are never mentioned in this section.

    As someone who seeks to pass off a "bad" file (this report) as a "good" file, perhaps sooner rather than later Mr. Chen will learn how the distributed moderation and trust system known as peer reputation works. Surely I am not the only one who finds it more than a little ironic that a paper by an author who claims that distributed moderation doesn't work is being submitted to a peer-reviewed journal in an attempt by the author to bootstrap his own reputation?

    1. Re:Distributed trust and peer review by Salamander · · Score: 2

      It's also funny to see you present as a solved problem something that's actually a very active area of research and pretty much still in its infancy. If you ask ten people who've been working on peer reputation how it works, you'll probably get five saying "it doesn't...yet" and the other five giving you five (or more) different algorithms. You're probably correct that there's a solution in there somewhere, but please don't make people think all the interesting stuff in that area has already been done.

      In other words, watch out for that elephant. ;-)

      --
      Slashdot - News for Herds. Stuff that Splatters.
    2. Re:Distributed trust and peer review by Jim+McCoy · · Score: 2

      If I implied that the problem is "solved" then I was being a bit too enthusiastic about the directions that current research and practice in this area is evolving. There are many different approaches to this problem (distriburted trust management) and the theoretical groundwork is already in place for several "solutions" which deal with the specific applications being described here. The case of "poisoned files" is actually a much simpler subset of distributed trust management problem because it deals with a simple good/bad distinction and it is a case where it is relatively easy to use simple emergent effects like voting can provide effective protection from these sorts of simple attacks upon a p2p system. [Before you jump in with a "but the RIAA can cast millions of votes" please check out the flow-constrained networks Raph Levien is working on and the application of clustering mechanisms to simple reputation systems.]

      EBay has also provided a good source of research material and several papers in the past few years have provided in-depth analysis of the effectiveness of existing EBay mechanisms in dealing with semi-anonymous/pseudonymous peer-to-peer transactions where there is a significant incentive for dishonest behavior. In a situation like EBay where real money is changing hands the incentives for cheating is strong, yet simple reputation systems solve the largest class of such problems and simple refinements like cluster matching among those who cast votes. Raph Levien and others have also made significant strides in going beyond the theroetical and into the direct application of reputation and distrubted trust management to distributed peer groups (check out the www.advogato.org trust metric system and Raph's Ph.D. thesis for more info.) I am not denying that there is an elephant in this room as well, but it is a wounded elephant who is soon to be "pining for the fjords" if you catch my drift...

      The general case problem of distributed trust management is not solved, there are several tools available to coders that require minimal effort to implement but which can keep the p2p user a step or two ahead of the simple attacks described in the original paper.

    3. Re:Distributed trust and peer review by Jim+McCoy · · Score: 3, Interesting
      Mr. Chen correctly points out that an attacker can easily forge the hash values it reports to the network. self-verification won't happen until the user has downloaded a good portion (if not all) of the file. At that point the attack has already been successful.


      You can send out a bad copy once, but if well-known and trusted copies already exist on the network you are not going to be able to replace these with bad copies, the self-verification does not prevent the single-point attack you describe, it prevents the propogation of this attack throughout the network. If an attacker serves up bad files (ones that do not match the SHA1 hash advertised) then the downloader should treat the host as malfunctioning and query a more reliable source. The downloading agent does not need to unpack the file and see what is inside, it just checks the SHA1 hash and then can simple assume that there was a transmission error and try another source. Eventually the malicious node will be trimmed from everyone else's peer list and a new node identity will have to be generated and the game starts again.


      This single attack costs the attacker as much as it does the downloader (and you can bet the RIAA is paying more per MB of data sent than someone downloading the data via a DSL or cable modem line) and a few simple changes to the system like favoring trusted peers (ones who have not given you mismatched hash/payload data) as the first nodes to query and only moving down the local reputation food chain if you need to expand your query or search for alternate sources. Unless an attacker can pretend to be a vast majority of the nodes in the system it is not going to be able to make this attack scale-up in the manner you suggest.


      There is a difference between an attack that works on a single download and an attack that would be viable for a network-wide assault. The case you and Mr. Chen bring up here is clearly in the first category, an inconvenience for individual users but not something that will be a significant problem for the network as a whole.

      Moderation and peer reputation require some method of recording "ratings" of users on the network. Something not present in the current Gnutella network. But if implemented, it would have to be distributed as well. This means that there, at some point, must be a blind trust between clients to complete these "ratings". That blind trust will lead to poisioning of the ratings system and make it worthless.


      "Ring of trust" simply does not work in a distributed environment that is truly open to anyone. Closed distributed environments, or virtually closed environments within an open environment would be the only way. However new users would not be able to enter them and that is how Gnutella keeps itself alive.


      Which is why I think that things like Raph Levien's work in reputation systems (and actually coding up working examples of such a system, see refs below) are rather attractive because they solve this specific problem in a rather elegant fashion and make such simplistic attacks much more difficult and expensive to pull off. [Here's a quick hint: Have you ever noticed that most people seem to care about Roger Ebert's opinion rather than yours when it comes to what movies to go see? This is because distributed trust system can deal with voter flooding attacks by limiting how much influence comes from untrusted sources.]


      You seem to think, Mr. McCoy, that there are obvious solutions. Yet you really don't present any nor do you present any existing real-world examples.



      One of the problems I addressed in the original paper was the fact that it was poorly researched in certain aspects. It seems that everyone is too lazy to actually do any research these days, but since spending five minutes doing google searches on various terms related to reputation systems seems to be too much work for either you or Mr. Chen, here is a quick summary of a few minutes work (although I selected papers that I am familiar with after google returned a hit).


      1) For starters look at Google itself. Google is the single biggest distributed reputation system in the internet. That is what a pagerank is, the "repuation" of a particular link for a particular subject using link count as the voting mechanism. It can be attacked and subverted on a small scale as various Google-juicing experiments prove, but it is also very effective at filtering out these attacks (see some of the Scientology google-juicing wars to see how hard it is to really influence a massively distributed reputatioon system implemented my people who know how to pick the best ideas from current research and invent a few of their own.


      2) EBay seller rankings. These can also be attacked and tweaked, but even when money is involved (making the incentive for dishonest behavior very high, much more so than any p2p system will ever have to deal with) EBay manages to keep fraud to a manageable level and recent research into seller/buyer identity-blinding and reputation cluster filtering can make the seller ranking system even more attack-resistant.


      3) Amazon buyer ratings and recommendations. Yet another example of a real-world distributed trust management system.


      4) Advogato is a community forum site that implements some of Raph's Ph.D. work in reputaitons and distrubted trust management to create a flow-constrained reputation system that has some very good attack-resistance characteristics. Raph has been running Advogato using his distrubted trust metric for several years now.


      5) Pattie Maes' agents group at MIT, specifically the Yenta reputation clustering system but just about everything to come out of this group is a source of good ideas and practical research in this area.


      6) Check out some of the available research bibliographies (like this) and places like citeseer for other research in the subject.


      One thing you will notice about these real-world examples is that none of the systems tries to be "perfect", just good enough to get the job done.

  24. I disagree by FallLine · · Score: 2
    This seems to be the option which involves the least technological action. However, randomly wouldn't work, if it were only because the P2P users don't all live in the same country, hence different laws apply. So some sort of not-so-random selection proces has to be implemented.
    I disagree. I think this would be a highly effective means, should it become necessary. Once you eliminate US based servers you've already removed some 90% of the acceptible providers for US citizens. When you further remove those highly developed countries that have close ties with the United States, which are apt to go along with RIAA when force is brought to bear, then you will leave the remaining pool of servers to 1% or so of what it was. That 1% cannot sustain even 1% though, because the demand will be so high that it will effectively block all practical use. Now, mind you, this cooperation need not require super-active law enforcement or anything to that effect. In fact, I would argue that the the relatively simple complusion of prompt response from the servers' ISPs for suspension of service for, say, 90 days suspension of service would be more than enough to deter the file servers given that there is no benefit for being a file server and every reason not to be.

    Modern P2P programs support downloading files from multiple sources. If someone downloads such a fake file and discovers it, the file will almost always be deleted. So, these files will not propagate through the network, or at least not as fast and as much as the correct files. So a search where one file can be downloaded from many sources is in this case preferable before one with not many nodes serving the same file.
    Again, I disagree. It has been my experience than many users do not delete damaged files, they simply leave them. The so-called swarmed downloads only further expose the downloads to corruption since all it really takes is one corrupt segment to either cause the program to crash or at least play really unbearable sound (or whatever media). To further compound the problem, the industry could use their cash and their legitimacy to be the most available and desirable servers (so that your swarmed downloads are almost certain to select its servers).

    Now this is an interesting thing. The makers of the P2P programs who are being targeted by fake queries could ban such users, or could build in a feature where the user of a P2P program can ban a host his/herself, so that it will be excluded in further searche
    This is impossible in any current decentralized P2P scheme, don't you get it? How is any routing servent to know that the other servent it is connected to is not passing legitmate requests the hosts it is purporting to represent? It can't. It might attempt to throttle the traffic of any from any given node, but then that would necessarily mean throttling the ENTIRE network, which would be self-defeating.

    Some users carry gigs and gigs of files, but that doesn't mean they're very popular. If I setup a server where I host my 20CD collection of Mozart works I'll probably won't get as much traffic as when I publish the Billboard 100. It's not the quantity, but the content of the files served that counts. Search for Britney and you'll receive 1000's of hits. Search for Planisphere and a lot less results will show up.
    While it is almost certainly true that only 1% of the content accounts for 99% of the traffic, it is also true that only 10% of the hosts account for almost all of the servers. Of those 10%, roughly half of them, (those that HAVE the popular files, are SHARING, are on truly HIGH speed network, and are NOT FIREWALLED) account for the majority of it. If you take the biggest servers out first, you will have a big impact. What's more, once it becomes established that there are likely consequences for being an effective server of files, the industry need not literally attack every last one of them. They need only use fear to their advantage and allow the servers' own self-interest to take over.
  25. Re:You have to be kidding... by moonbender · · Score: 2
    Eh? I know that was supposed to be a joke, and it went right back at you for acting as if you had any clue about what you're talking about, when you, in fact, have not.
    Not one link, eh? Plenty?
    I didn't post any URLs, because as I said others already mentioned them in comments. But anyway, here you go.

    ShareReactor

    FileNexus

    Asia Movies

    Jigle

    Various sites specialised in files of certain languages (French, German), such as Spieleplanet

    etc etc etc - just search for eDonkey links.

    There are also IRC channels and uncounted web boards (similar to Asia Movies) dedicated to sharing ED checksums.

    Oh, and about the checksums... look at a popular song, and see how many variations exist in file size at the same quality. Are you saying different files with different sizes won't have the same checksum?
    No, I am saying, in fact, I said, that is completely irrelevant. We're not talking about sharing files as in Napster or Audiogalaxy (where you seem to draw your experience from). There's only ONE valid version of each single/album (single MP3s aren't usually spread), the first high-quality, complete release by a scene group. All later releases are dupes, and not distributed. You get the checksum to that release, and you're set.
    --
    Switch back to Slashdot's D1 system.
  26. Actually checksums should work. by jidar · · Score: 3, Interesting

    Taken from Andrew Chens responses to the solutions:

    Although this idea works for newsgroups and some other centralized services, it does not with P2P. Basically, it comes down to the fact that you must trust whomever is actually doing the checksumming, or else they can just lie and publish false checksums. In the case of P2P networks, the checksumming is done by the same person you want to figure out if you can trust! As far as I know, this is an unresolvable problem.

    Actually, the checksums should still work I believe, in much the same way that file sizes work now. Consider the reason the files that are being injected are set to the same size as the real file; the purpose is to mask these files to the naked eye. Checksums could be used for the same purpose.
    The reason for this is because as people find good files they will tend to keep them while deleting the bad files. Sure if we only get 1 result back then we don't know one way or other, but if we have 10 results back and 8 of the 10 of the same checksum, we can assume those 8 are the good files.
    Of course the problem with this is that a great many people don't bother to delete bad files after downloading, but should the poisoning become too much of a problem we can entice more people to clean up their shared files by way of the client interface.

    All in all, I think this would combat poisoning very well.

    --
    Sigs are awesome huh?
  27. Re:Shameless plug... by Erasmus+Darwin · · Score: 2
    "Hopefully this will make the network somewhat self-moderating since users sharing undesirable content will not rise in network status."

    This strategy fails to take into account the fact that an RIAA mole could easily share desireable content. For example, mp3.com has 7 free, legal tracks for download from Linkin Park (not my choice in music, but they are quite popular currently). There are quite a few other well-known bands with free tracks on there. Sharing all that content, which the various record labels have decided to share anyway, will only serve to get the sharing user voted up.

    Once the mole is voted up for carrying lots of valid files that people are interested in, the mole begins to distribute poison. Sure, this will cause the mole's ranking to fall somewhat, but damage will be done in the process. Furthermore, the legit files will continue to somewhat offset the attempts to vote the user down. Multiply this whole situation by a number of different automated users, and you've got an effective poisoning attack.

    In short: The mole has spread files that the RIAA already wants distributed (win for the RIAA, win for users), and the mole has spread poison for files that the RIAA doesn't want distributed (win for the RIAA, loss for users).

  28. What a bunch of hypocrits by Dionysus · · Score: 2

    I hope the same people who defends the right to distribute mp3 they don't own the copyright for, will be the same people who defends a person/company's right to violate the GPL.

    --
    Je ne parle pas francais.
  29. Sure it can if and only if you can buy online MP3s by MattRog · · Score: 2

    The RIAA and all the lawyers in the world will never be able to completely stop pirating. Look at how much money the feds throw at drugs and the number of addicts on the street. If enough people want something, they'll get it.

    I know one of my chief frustrations is to search for a song and either have it incomplete, or be of poor quality (e.g. pops or other defects) or to simply have it not be the same song that I downloaded. If I could search for a song, pay $SOME_SMALL_AMOUNT (e.g. $1US) for it and download a 'known perfect' copy at my choice of bitrates (e.g. 128, 160, etc.) then sure as heck I'd do it.

    Distributing these poisoned files would take an enormous amount of bandwidth, so they'd have to have some sort of agreement worked out with ISPs and a mass-content provider, say Akamai. Akamai has tens of thousands of servers located in hundreds (if not more) of ISPs throughout the nation. I think on peak usage they're pushing out 100 GB/sec. in the US (if not more). Simply say "Ok Akamai, can we buy 10GB on each of your servers and push all these MP3s out?". Then you write a gnutella client for each box which offers all the MP3s up for distribution.

    I can't remember how the gnutella protocol works but I think it broadcasts search requests to the nodes that store a cache of what they have and what their neighbors offer and then can pass the request off. Have your client log all the requests (so you can tell the record companies which songs were requested more) and of course offer up your files when requested. If you do this with 10,000 boxes full of identical content chances are you're going to drown out any signal out there.

    If you're really tricky, you can even have the client 'fake' files so you don't actually need to have the file on the box; you could send a pre-existing obfuscated file, or even dynamically build and stream the poisoned MP3.

    Of course, all of this is moot if you still don't have a very easy, cheap method of offering MP3s online for the mass public. You could pitch it like this "Yeah, so you won't make much money off of offering $SOME_SMALL_AMOUNT for each MP3. But you're a fool if you think simply shutting Morpheus off will result in even 10% of the Morpheus users buying the actual CD or using a painful, userUNfriendly pay-per-MP3 system. However, what if we have a method to net you 20 or 30% of users who wouldn't pay you anyway?" So the pitch would be "We can't get you all of them, but our method would give you more than you're getting now!". Frankly the people who post on SlashDot (from the very negative response to the Subscription model) are not a good cross-section of the vast majority of internet users out there :).

    So in your obfuscated file you have it play maybe 20 seconds of the file and then say "Sorry, this is a copywrited file. Pirating files costs artists money. If you want to buy this MP3 for $SOME_SMALL_AMOUNT, please visit http://www.somestore.com. 80% of $SOME_SMALL_AMOUNT earned will go directly to the artist."

    It gives them a reason to buy it - not only do you have SomeStore.com very easily accepting payment, but you ACTUALLY PAY THE ARTISTS A MAJORITY OF THE MONIES EARNED! So it can quell the naysayers who say "Well the artist wouldn't receive anything anyway!" (rant: but who are you hurting more, the billion dollar-industry or the Artist who NEEDS even the small cut they receive from each CD sold?).

    Some drawbacks could be of course that someone writes a 'detector' to find and ignore the invalid MP3s, or they block the IP addresses of the servers, etc. but that is easily fixed. Most non-power users (e.g. the great and huddled masses of the internet) don't want to update their Morpheus client every time a new version is released. Heck, even programs which offer hassle-free updating (e.g. antivirus, windowsupdate.com) very rarely are by the majority of internet users. Also, you'd work out the server IP settings with the ISP so that they would rotate to a random IP in their pool - since most of the servers are located in most ISPs you couldn't ban the single IP but perhaps a subnet. But since the IPs are in the ISP, you have now banned a large chunk of users. If they are in every ISP, you will have to ban every ISP (see the problem in banning IPs?).

    So, to boil it down to a sentence:
    Have very easy-to-use, hassle-free, cheap, reliable, etc. method for users to buy MP3s and they WILL

    --

    Thanks,
    --
    Matt
  30. this already exists by Trepidity · · Score: 2

    Bitzi stores information on files found on P2P networks, indexed by a TigerTree hash appended to a SHA1 hash. Support for it has been integrated into several Gnutella clients (ShareAza, Limewire, etc.), which have also come up with their own URL systems (gnutella:// and magnet:// are the two existing ones right now).

  31. Fake Checksums by nuggz · · Score: 3, Informative

    Here is a file
    Bobs_Song.mp3 5 M Hash -XXXXXXX
    You don't know that I gave you the wrong hash till you're done.
    It can only tell you that you have the wrong file, after you have it

  32. Poison chain by nuggz · · Score: 2

    If you find a poisoned file in a trusted chain, you can now discount that person, and that entire chain.
    Trust should work both ways.
    Several unrelated "I got a good file" ratings could give you a cloud of trust. I think it oculd work.

  33. Sharereactor and edonkey by dotslash · · Score: 3, Informative

    A P2P program call edonkey (don't laugh) has partially solved this problem.

    In order to dowload a file, you can use a URI such as (ed2k://|file|The_Adventrues_Of_Pluto_Nash(2002).C D1.FTF.eDKDistro.Sharereactor.bin|559778352|1b153e 31f5fdbe829488989d04dda2b1|/
    ). The URI contains the "local filename", size and SHA-1 hash. A companion web site acts as a directory of URI's for popular content. The content is screened by the folks running the site. It has now reached the point where the "pirate" teams have accounts and post SHA-1 encoded URIs before releasing the content into the wild. Most edonkey users don't use the embedded search and instead use directories such as sharereactor.

  34. Checksums and signatures work by mikec · · Score: 3, Informative
    Mr Chen apparently does not understand public key cryptography. Using a "web of trust" does in fact work.

    The author writes

    For the uninitiated, checksums work by examining a file and creating a string that "fingerprints" the data. It can be used in many situations, but the most common application is to verify that a file has been correctly transfered. The basic idea, in relation to P2P, is that every file on a user's computer is checksummed, and this checksum is then published to everyone else. Then, it may be possible to create a directory of "correct" checksums, to make sure you are actually downloading what you want. Although this idea works for newsgroups and some other centralized services, it does not with P2P. Basically, it comes down to the fact that you must trust whomever is actually doing the checksumming, or else they can just lie and publish false checksums. In the case of P2P networks, the checksumming is done by the same person you want to figure out if you can trust! As far as I know, this is an unresolvable problem.

    This is not an unresolvable problem at all; this is where web of trust comes in. The basic idea is for the publisher to sign the checksum using his or her private key. Others can then verify the signature using the publishers public key. This allows me to verify, using only a few bytes of information, that a publisher named SecretAgent did indeed publish a file. If I know that SecretAgent has previously published a lot of "good" files, then the file is probably good. If I don't have any experience with SecretAgent, but I do know that PrivateBenji is trustworthy, and PrivateBenji vouches for SecretAgent, then the file is probably good.

    The author fundamentally misunderstands webs of trust:

    Another idea that is often proposed is moderation, specifically "webs of trust." That is, people keep lists of people they trust, and then they implicitly trust (often with diminishing degree) the people they trust, and so on. In the context of P2P, the each user would then receive a "trust rating," reflecting the number of people that trust them. However, this can also be defeated fairly easily, by creating groups of malicious users that trust each other - then, untrustworthy users may have high scores leading to problems in the future. This kind of fraud has happened on eBay, where people give themselves recommendations to mislead future partners.

    A web of trust is not a "trust rating" ala eBay. A web of trust is a specific group of people who vouch for each other. Creating a malicious group of people who trust each other does not cause problems. (In fact, it can actually help.) If I trust A, based on experience, and if A trusts B, based on experience, then I can probably trust B. The fact that C, D, and E are malicious doesn't cause problems, because neither A nor B trusts them.

    1. Re:Checksums and signatures work by mikec · · Score: 2

      There is nothing to prevent the bad client to send a copy of somebody's elses signature of the file's checksum. Public Key Authentication is used to verify whether already received data is actually from so and so. It cannot be used to authenticate not yet sent data.

      I write a message that says, "File X, which has md5 sum Y, is good file." Notice that I didn't publish the content to begin with; I'm just vouching for the fact that it seems worthwhile. Then I sign that message with my private key and post to the net under the name FooBar. You see a message from FooBar, whom you trust because FooBar hasn't led you wrong in the past. You first check to make sure the message is really from FooBar by using FooBar's public key. (You kept a copy from previous messages.) If I'm not FooBar, the signature doesn't ckeck out, and you ignore the message---maybe 512 bytes wasted. If I am FooBar, it does check out. Then you look for a file named X with md5 sum Y. If you find it, you download it. If you find a server that tells you the sum is Y but then gives you something with a different sum, you have found a bogus server, and you don't use that server anymore.

      If bogus servers are a big problem, then you need to identify good servers, again using a web of trust. I.e., periodically send out messages that look like "The following servers seem to be bogus: X, Y, Z". (Bogus means that they are lying about the md5 sums of their content.) Again, you sign those messages so that people who trust you can believe the results.

      It is absolutely true that someone can trade good stuff for a year to become trusted, and then suddenly serve up junk. But of course, they can only do that once or twice. Then they aren't trusted anymore. And in the mean time, they have served up lots of good stuff.
  35. Overkill by Cryogenes · · Score: 3, Informative

    Distributed trust and peer review are fine and good but not even needed for the simple task at hand.

    Look at the warez scene to see how it goes. A handful of release groups whose names are known to everybody who is even vaguely interested is sufficient to ensure supply. If these groups are attacked by fake releases (rarely happens) they can use hash keys as you suggest (some already do).

    Websites like www.sharereactor.com also safeguard against fakes - another mechanism which is strong enough to defeat the entire problem by itself.

    What I am saying is that distributed moderating à la slashdot will not evolve. Instead, we will have a handful of "authorities" - Web sites or public keys - that everyone trusts.

    Note that authority - when not combined with power - is a Good Thing (TM).

  36. So, sharing is OK now right? by PotatoHead · · Score: 2

    If they get to poison the networks, then that means that they are using the networks --just as we are.

    I wonder what would happen if some ordinary user did the same things? Right or wrong?

    Dealing with the problem this way is far better than using the law because it is hard to define the law in a way that makes good sense for everyone long term particularly when we don't yet know how P2P could benefit us all.

    Besides, they can place any number of promotional information into their files just as easily as they can garbage and they should. Why not? They might even be able to write off more of the expense.

    What the media companies need is good marketing. They are the content source. (for now) All they need to do is add value in ways that leverage the network effect that P2P offers and they *will* make money.

    Anyway, the result of this is likely not all bad because file sharing will get somewhat marginalized, we all preview before we download large files and everyone is reasonably happy and free to use the net in creative ways.

  37. P2P sharing should leverage popularity by defile · · Score: 2

    Popular files are more likely to be valid. Poison is less likely to be popular. Poison sinks to obscurity.

  38. Easy to fix, really. by mmol_6453 · · Score: 2

    Public key encryption's been around for quite a while.

    Just give moderators private keys, and distribute the public keys. Bingo! Authenticated moderation...

    --
    What's this Submit thingy do?
    1. Re:Easy to fix, really. by adamshamblin · · Score: 2, Interesting

      This proposed solution, or most any solution based upon moderation, has a few serious flaws. First of all, the use of public key encryption would require some sort of central authority to both assign moderator status to select members of the P2P network, and to distribute the keys to the masses. In the case of the Gnutella network, this could be said to be both the antithesis of the network model, as well as being relatively impossible to enforce - with the disjointed nature of the Gnutella network, it is conceivable that segments of the network would not be visible to a logged in moderator. In fact, to insure moderator coverage, moderator status would have to be given to a statistically high number of individuals. Second, the creation of the central authority necessary to administer this proposed 'solution' would give organizations like the RIAA and MPAA easy and - from their point of view - logical individuals to target in their foolhardy quest for the Copyright Grail.

      Perhaps a means of voluntary moderation could be accommodated in the Gnutella protocol itself. 'Karma' could be built up on a network node based upon many criteria, which could include positive feedback from peers, etc. By writing moderation into the protocol itself, client developers could implement these features at their own discretion. The idea of moderation would then be put to the test of software natural selection.

      --
      http://iratepublik.com
  39. block checksum by bogado · · Score: 3, Interesting

    one could keep a trusted block signature for each file. Say you have signature file that has one MD5 for each x bytes of the file. This file and it's MD5 hash is the identity of the file. On would then choose to download this file before the file itself and then download the blocks of x bytes from the file in a rendomised order, and possibly from diferent nodes. I guess this would add some otherwise uneeded downloads, but would help to restart the stoped downloads and would detect poison nodes easily.

    To bad I am so late in posting this...

    --
    []'s Victor Bogado da Silva Lins

    ^[:wq

    1. Re:block checksum by bogado · · Score: 2
      No you first download a MD5 for the MD5 list (this wiil be the id of a file), then the MD5 list (this file could be potentialy big for CD images for instance), if this does not match the id blacklist the node. Now you will download a random block from the list, from one of the nodes that has the files with the same ID. If this block does not check with the MD5 black list the node. Try again from other node, and so on.


      A separed web of trust could apply to what IDs are from valid list of MD5 from a valid file and witch are not.

      --
      []'s Victor Bogado da Silva Lins

      ^[:wq

  40. What we need, to legitimize P2P by mmol_6453 · · Score: 2

    Is to create a network specifically dedicated to trading, say, opensource code, research papers, personal public diaries, and the like.

    (Bye bye, karma) I may sound like a troll, but at least I'm being honest.

    Peer-to-peer filesharing has a great deal of potential, but if its only popular use is piracy, well, we already get enough bad press, don't we? It'll only get worse.

    (Sorry about the soapbox I'm standing on...)

    --
    What's this Submit thingy do?
  41. Misapplications... by zunger · · Score: 2

    I wonder if the author has considered that the primary applications of this work are probably not in influencing file-sharing networks so much as in politics. The P2P network that first comes to mind is ordinary web access within China. This is a situation where the government has an active interest in preventing any politically sensitive information from being propagated within the country, and so the ideas of this paper are directly applicable.

    I'll leave the relevant ethical issues as a matter of discussion -- but I would suggest that this is a far more serious reason to be concerned about corporate research into network interruption.

  42. Re:Directories like Bitzi can stop fraudulent file by bje2 · · Score: 2

    since this website that collections strong file checksums, descriptions, etc, is now a centralized location (as opposed to P2P which isn't centralized), could the website fall under legal attack for aiding and abetting illegal activity of swapping copyrighted material? just curious...

    --

    "Facts are meaningless. You could use facts to prove anything that's even remotely true." - Homer Simpson
  43. Two Problems by Fascist+Christ · · Score: 2, Interesting

    I see two problems with this idea.

    1. Their problem They don't want to change. They don't want to give in to this non-physical technology. They don't understand it, so they condemn it. It's human nature. They aren't simply hard-headed.
      -or-
    2. Our problem They will sell it to us for $5 per 64-bit mp3 to make up for the "lost sales" on the "pirated" copies. 128-bit will cost you $10. They won't offer any higher quality because it would "take away from CD sales."
    --
    TodayTM BillyJoelTM GoogleTMd for StitchTMes due to WindowsTM while RollerbladeTMing with an AppleTM and a PopsicleTM
  44. Meet the demand, kill the network by NigelJohnstone · · Score: 2, Insightful

    Even simpler than all these attack strategies. Simply produce the produce the way customers want it.

    Enough people will defect to the faster, more direct, legitimate servers. Where they can get the whole album and a movie in 2 hours instead of 2 weeks. The price should be good enough to encourage this.

    The P2P networks relies on enough users mirroring enough copies of enough products. Reduce the user base and the number of nodes drops until it just doesn't work anymore.
    You can see this on the unpopular P2P networks now.

    So either you will end up with:

    1. a few users sharing lots of files (which can be picked off with civil copyright laws).

    2. a few users sharing few files (which means they can't find the files they want on the network, so are less likely to be running a P2P just to support other users, so the number of people spirals down).

    The one thing I don't think you will end up with is many people legitimately downloading and then sharing the files. Quite simply, you would eat up your bandwidth using P2P which you need to do the downloading.

    Another factor is the charging, many ISPs are moving to a download limit, e.g. TOnline is moving to 5GB limit per month, then pay 1.5 cents per MB.

    So a movie would cost $7 to download after you've used up the first 5GB. Or for that matter to upload to another user!
    So you could pull maybe 7 movies a month on the flat fee.
    A lot of users on P2P systems will disappear as this becomes the norm.

    So P2P is really just a temporary problem for copyright holders, just as long as they get their legitimate sales systems in place and don't go pissing off the consumers with DRM, funny licenses etc.

  45. trustable checksums by sacrilicious · · Score: 2
    From the article:
    In the case of P2P networks, the checksumming is done by the same person you want to figure out if you can trust! As far as I know, this is an unresolvable problem.
    The shortsightedness of the above is that it doesn't acknowledge that a checksum is merely a convenience, not the name of the game. If someone falsely publishes a checksum that a trusted checksum directory states is good, then it's simply a longer operation for a user to discover that the checksum is actually false... but this still leads to the blacklisting of the publisher, and not everyone needs to download the file in its entirety to discover this.

    .

    --
    - First they ignore you, then they laugh at you, then ???, then profit.
  46. Re:Pull The Story by gaudior · · Score: 2

    Mozart is not, but any random recording of a Mozart piece is.

  47. There is a business model that works by Archfeld · · Score: 3, Interesting

    look at DVD's...provide so much material that it is more work pirating than it is to buy. Why does a DVD cost the SAME as a CD ? Last time I checked a movie was SIGNIFICANTLY more expensive to produce than a ALBUM, and yet DVD's sell for the same or LESS, and quite often contain the BLOODY soundtrack as well. If a CD included multimedia stuff, editing room floor tracks, useless bio info and oodles of extra crap at a reasonable price it will be more trouble to rip it than it would be to buy it. When the RIAA wakes up and realizes that, maybe, just maybe things will turn around, otherwise, one way or another the industry is dead. The MPAA is actually beginning to come around, slowly and not without a FIGHT, but they are evolving. I don't hold out the same hope for the record industry.

    --
    errr....umm...*whooosh* *whoosh* Is this thing on ?
  48. MD5 Hash by Xannor · · Score: 2, Interesting

    After reading this and some of the comments from the old posting, I realised the MD5 hash is not a bad approach. When a client scans its HD it creates MD5 checksums of its files. when some one requets a file the checksum is sent with the reply. when the file is d/l'ed the checksum is checked. if the checksum fails the user is notified and they can either re-try the d/l or accept it. after they can test the file. if (with a valid checksum) the file is corrupt, the client can store the checksum and filter it from future requests, also they can be shared to prevent others from d/l'in as well. this system could still be temerarily defeted by having many versions of the same file, but again that could be tested as well (too many bad files flags a bad host, etc)

    --
    I sig therefore I am...
  49. Edonkey? Good lord... by GuyMannDude · · Score: 2

    A P2P program call edonkey (don't laugh) has partially solved this problem.

    I'd hate to see the kinds of porno AVIs that get traded on a P2P program named "edonkey"! (shudder). At least there isn't one called FistOfFiles.exe yet.

    GMD

  50. Re:This won't work. by ahfoo · · Score: 2

    That's the key point right there. The paper that the article was based on used the analogy of a pond being polluted. Well, there are good anaolgies and there are bad analogies and a fishing pond not a very good analogy here because a P2P network is much more like a swimming pool with not one or two, but millions of high powered filters. A standard filter and chlorine/ozone system on a swimming pool can remove enormous amounts of excrement. A pool with a million filters is going to require a hell of a waste stream to pollute for any length of time. Given that these filter systems are also the water inlets for the pool, the task of polluting the majority of the water for any length of time is problematic at best and unlikely to succeed.

  51. Re:Golf war? by Golias · · Score: 2
    Whyinthehell would he want to go to it? The only solution they came up with, for every problem they discussed, was "the US should give the Third World even more money."

    Anybody could have seen that coming. Were I president of the US, I would have skipped it as well.

    --

    Information wants to be anthropomorphized.

  52. You can do better than that :) by j3110 · · Score: 3, Insightful

    Do GPG signatures on blocks(about 50-100k) of files instead of entire files. When you have a contradiction of checksum's on blocks of files, alert that the user that someone is a liar. Take all the results of the search for that file, and all the gpg signatures and present the user with two options that are the sum of their trust levels. Most files can be previewed to check if it is bogus, and the user can blacklist anyone that even trusted that host, and their IP's as well. From then on, none of those IP's will be allowed to connect to this host. Eventually, they'll exhaust their IP supply before they end piracy.

    Obviously the user would get to select the appropriate action if one of the files are just better than the other with a rating mechanism as well :) (A per file rating instead of a per host rating)

    Other advantages to this method are:
    *Checksums can't be faked except in NP time. (use a random block size to thwart a super computer precalculating bad blocks that MD5 to the right hash... use multiple hashes)
    *Multiple host download is gauranteed to be the same file (even when being poisoned).
    *A computer need not have the entire file to share a block of the file, therefore files propogate the network in a more exponential manner. (host A gets block 1 from B. Host C gets block 2 from B, Host C and A trade blocks 1 and 2. Host D comes along and wants the same file, and can download from A and C instead of bogging down B. Works even better because all connections that I've seen are duplex even if they have a slower upstream. Conserve network bandwidth by refering downloaders to other people who have downloaded before... search for the GPG signature of the hosts on the network.)

    Overall, I see this kind of thing being implemented very soon because it's not that difficult, and it's pretty obvious. Maybe the next edition of Gnutella will support this.

    Of course there are loopholes where the RIAA/MPAA could buy half a million IP addresses or have a lot of computers on the network, but you don't have to have an unbreakable system, just a system that costs more to break than they think they will see in profits from breaking it.

    --
    Karma Clown
  53. Re:Poisoning is not possible by symbolic · · Score: 2


    This is faulty reasoning. Once 80% of the nodes are poisoned (and probably far fewer), it means that users looking for illegal mp3 files will stand only a 1 in 5 chance of getting something that isn't worthless. How many times do you think people are going to subject themselves to this before deciding that it's just to much trouble? It's a clever solution, because it's using the very trait that makes P2P so attractive (P2P caters to convenience, and by extension, laziness), to render it wholly ineffective for its intended purpose.

  54. Comparisons to the War on Drugs by bwt · · Score: 4, Interesting

    In particular, our analysis of the model leads to four potential strategies, which can be used in conjunction:

    1. Randomly selecting and litigating against users engaging in piracy
    2. Creating fake users that carry (incorrectly named or damaged files)
    3. Broadcasting fake queries in order to degrade network performance
    4. Selectively targeting litigation against the small percentage of users that carry the majority of the files


    This mostly summarizes the war on drugs and the government's strategy against alcohol prohibition in the 1920's. Neither worked and the countermeasures are simple and straight forward.

    A "directed" web of trust, objective quality measurement, and knowledge compartimentalization defeat the above strategy. The countermeasure of creating large numbers of mutally trusting attackers doesn't work when trust "flow" is taken into account. The keys to such a system are:
    1) trust is assymetric
    2) nodes define and change who they trust based on their own assessments
    3) Nodes protect their knowledge of the web of trust

    To see how this works, consider the cops and the drug dealers. The fact that the cops all trust each other does not result in the drug dealers trusting them. When a dealer is compromised, no matter how high up the chain it goes, trust shifts to rivals. Even when a kingpin falls, lines of trust will still exist that aren't compromised.

    Drug dealing is not as popular as file sharing, is substantially more damaging to peoples lives and society, and has motivated levels of funding that are not matchable by publicly traded firms (who must demonstrate at least mid-range ROI). Despite all of these advantages, the war on drugs has been a dismal failure. The bottom line is that the internet makes distribution of content a commidity, where it was formerly a task of enormous complexity and value add. Economics will determine the rest, unless the US adopts and maintains a totalitarian government.

  55. Re:Are they THAT blind? by symbolic · · Score: 2


    You make it sound like the RIAA/MPAA is conducting a criminal enterprise by negotiating contracts between artists and themselves, and then risking the capital required for the production, promotion, and distribution. Those THIEVES!

    The $15-$20/CD argument is a smokescreen. Not only do most consumers have the option to get CDs at fairly reduced prices (through a mail-order club), if they object to the price, they are free to keep their money, and let the RIAA/MPAA keep its property.

  56. Ya can't beat evolution by tealeaves844 · · Score: 2, Interesting

    Here's another way to look at the problem: the physics of evolution. If we can treat p2p as an ecosystem, we can apply the same types of energy balances. The paper isn't talking aobut extinction of p2p, it's talking about a change in the observable patterns it exhibits. Because stressing a network can't eliminate p2p, a new one will pop up in its place. If you treat user demand as "free energy" the most stable state of those users is in sharing. Fundamentally, when you stress an ecosytem, it can "fail" in that the species in it aren't the same, but new ones pop up. The dinosaurs went extinct, but here we are!

  57. "Authorities" are a distrib. trust management tool by Jim+McCoy · · Score: 2

    You are probably correct as far as how things will play out in the real world (fewer sources of authority, but well-known and trusted sources) simple because of how the background social networks that currently exist can be used as a bootstrapping mechanism by the trusted source solution. Part of my original point is that this solution, as long as multiple sources of authority are allowed to exist, is a part of the general distributed trust solution to the original problem. Distributed trust can be "client-server", "peer-to-peer" or some hybrid of the two.

    You only have to take a look around the real world to see that reputations are an efficient and attack resistant mechanism for allowing untrusted parties to exchange info/goods/services. Credit ratings, movie ratings, "best of" lists, gossip, etc. We are surrounded by and enmeshed within distributed trust and reputation systems so completely that most people do not even realize how many times a day they use such a system.

  58. Re:They need to lower PRICES! by mark-t · · Score: 2
    The problem is that a reduction in price does not result in a proportional gain in sales, just as jacking the price results in an exponential loss in sales. This has been repeatedly shown to be true in marketing of almost any product, and is especially true for products that are considered luxury items (and yes... owning tapes and CD's is considered a luxury). If they lowered the prices, the piracy would indeed slow down, but not nearly enough to justify it in any sort of business sense.


    The break-even point on this curve is actually not that far below the so-called outrageous amounts they are currently charging. I did a research paper on this topic last year for college, and had to admit that while CD's are in fact overpriced, they are nowhere even close to the amounts that you are claiming. Yes, they are far above their own costs for the media -- but they aren't that much above their costs if you factor in media piracy as lost revenue. Whether or not they ever would have seen the money is beside the point -- if you smuggle someone into a theatre to see a movie without them paying for a ticket, even if there *are* a lot of empty seats, you are still considered to be depriving the theatre of lost revenue as well.


    Anyways, you know the way the our market works, right? Everyone charges as much as they possibly can and still be able to convince some percentage of the people to buy the product. Every once in a while you find a philanthropic soul who will charge a modest amount above his own costs, but come on people! This is the real world... you can't seriously _expect_ a majority of people to run businesses like that. Heck, if most of them did, they probably wouldn't last more than a year!

  59. Please mod up parent - he's clueful by billstewart · · Score: 2
    Plover has the clue here - in an open system it's easy to create lots of identities that are just tentacles of yourself. So if you mainly count positive votes, the Poisoners can give their tentacles lots of positive votes. But if you count negative votes heavily, they can go slandering big sites, and unlike Slashdot Karma, it's hard to metamoderate, because the ratings are about private transactions. So it's pretty easy for the Poisoners to create a lot of good-looking sites with good reputations, and use them to spread poisoned files.

    What's worse, it's very difficult to identify bad files automatically, because different rips of the same original can have different checksums, so the poisoners can spread lots of versions with different checksums, so you can't tell whether two files claiming to be a 128kbps ogg of "Whoops I Cloned It Again" came from the same original, only that they're not the same, so you have to listen to the thing all the way through to be sure that it doesn't suddenly turn into an FBI/RIAA/KGB warning against copying music, or a commercial for the CD containing the FM version of the track, or that it doesn't have a lot of low-level static in it. (If I were an artist, I might be more annoyed about the latter.)

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  60. Cryptographic checksums fix that, sort of. by billstewart · · Score: 2

    CRCs aren't the only kind of checksum out there, though they're nice and fast. Cryptographic-quality checksums avoid the problems - if you change one bit of the input, they change about half the bits of the output, and it's nearly impossible to predict what the changes will be. MD5 was the most popular for a long time, though SHA1 has been replacing it for a variety of technical reasons. MD5 is 128 bits long, SHA1 is 160, so you don't need to worry about collisions unless you have more than 2**64 or 2**80 files.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  61. Why this fails for audio files by billstewart · · Score: 2

    Unlike Warez or some lossless compression systems, this doesn't work for audio or for video applications using lossy compression instead of distributing exact copies. The reason is that different compression runs don't need to have identical checksums, depending on your compression parameters, equipment, etc., so the Poisoners can go create lots of different files all claiming to be a rip of the real thing, and they can have multiple identities all claiming to have a version to share, so even if you burn one file and one identity, they can trivially create more. If they're clever, they can do this with very little extra work - each version has identical data except in the last block (448 bits for MD5, I forget how many for SHA1), which is juggled a bit. Since music files are large, this means they can do 99.99% of the work once and only have to repeat the last 0.01% multiple times. GPG signatures on the files don't help much either - they've provided a genuine signature saying that jack12345 and lars6789 both downloaded this file of "Whoops I Cloned It Again" and got checksum 12903849021834, but when you listen to it, it's just Poison singing "Happy Copyright Violation Lawsuit To You" with a burst of noise in the last few milliseconds.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  62. EBay's a different case by billstewart · · Score: 2
    Most E-Bay users are honest, and most of the dishonest ones are afraid of getting caught, and most of the dishonest ones who aren't afraid of getting caught are either too small-time to matter or too stupid to get away with it, and ripping people off takes a certain amount of Real Work and creates a certain level of traceability.


    This is different - there's no penalty other than your reputation, the Poisoners have a much stronger legal position than anybody who might complain (Hey - I tried to rip off their music and they gave me a Bad Copy!), identities can be created free by robots, reputations for the identities don't take too much work to forge, and there are lots of creative ways to cheat.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  63. Why public-key crypto doesn't solve the problem by billstewart · · Score: 2
    It's fundamentally a social problem, and public-key crypto webs of trust don't map very well to it. They work badly enough for the social problems they were *designed* to solve (e.g. Phil Zimmermann's anti-nuclear activists try to prevent forgery and eavesdropping by Feds infiltrators - it starts to get weak when one of your friends is gullible about signing keys for his new friends who are really Feds.) And maintaining really broad webs of trust is surprisingly difficult, except when there's a commercial enterprise to sustain it, i.e. either a record company or a lawsuit target.

    It's very easy to create a large number of identities in this system, each pretending to be a real person but really just Yet Another Tentacle of the Poisoners. They can all build up great reputations by signing each others's keys, and sending reports into the whoever-archives-reports-about-users system claiming to have done lots of downloads to each other, and they're all listed as having T3 or Ethernet connections so they're very attractive. And they can pump out a large number of files that they've signed, indicating correctly that the checksum on File#12345 is 290384098213 or whatever, for many different files with many different names, all of which are really Poison singing "Happy Copyright Violation Lawsuit To You!" with a different serial-number burst of noise at the end. They can distribute enough non-poisoned songs to create some good genuine reputations, use those to sign peoples' keys and get people to sign their keys, use these reputations to sign the keys of their other tentacles, and start distributing poisoned songs to people who trust them directly or indirectly, using their keys which have been outed as Poisoners to sign the keys of people who aren't tentacles. Even more fun, you can distribute lots of poisoned index data - some P2P systems are much easier to kill that way.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
    1. Re:Why public-key crypto doesn't solve the problem by mikec · · Score: 2

      Nah. Large numbers of malicious identities vouching for each other is actually good. As soon as you figure out that one of them is bogus, you can blacklist all of them.

    2. Re:Why public-key crypto doesn't solve the problem by billstewart · · Score: 2

      The problem is that they're also vouching for lots of non-malicious identities - you can set things up so it's pretty hard to spot the bad guys automatically without also getting good guys, and manually blacklisting a few thousand droids is too much work.

      --

      Bill Stewart
      New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  64. Doesn't work without trusted third parties by billstewart · · Score: 2
    If you don't have a trusted third party, you can't easily create a rule that voting is only allowed after downloads - either the downloader can be a Poisoner's Apprentice who's dishonestly claiming to have downloaded a bad file from you, trashing your karma, or else there's a mechanism for you to claim that he's lying, in which case any Poisoner who you download poisoned files from can use that mechanism to claim that you're lying if you complain.

    But if you *do* have Trusted Third Parties, Poisoners will either attack them technically, sue them, or pretend to be them, or all three. And Slashdot MetaModeration isn't directly applicable to this problem, because the disputed event is private, unlike Slashdot postings which third and fourth parties can look at and decide whether they're really Insightful or Trolls.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  65. This exists - Bitzi by ShaunC · · Score: 2
    Create a website with logins for the users. Users of this web site can create lists of checksum for the files they create or have downloaded and verified as valid.

    Other users can check any given user's list, and perhaps even post comments about the user's list, a form of moderation, if you will.
    Fortunately, someone's already done all the work. Such a system already exists; head on over to Bitzi and check it out.

    Bitzi is based on checksumming. After you download a file, you run it through the Bitcollider app to generate a unique checksum which is automatically uploaded to the Bitzi site. Meta-information like ID3 tags, etc. is also extracted from the file if present, and all of this data is combined to create what's known as a "Bitzi ticket." You can vote for the (in)validity of a particular file, and you can also leave comments about a particular file for other users. A ticket can be created for any file, not just MP3s; there are already lots of pornos with Bitzi tickets :)

    The eventual goal is that, before you take the time to download a file, you'll be able to look up its Bitzi ticket and determine whether or not it's what you're really looking for. If 10 people have already indicated that the file is bogus, corrupted, incomplete, etc. you'll be able to safely skip it without wasting time or bandwidth. In order for this to happen on a broad scale, Bitzi needs more users. It's totally a volunteer community effort; someone has to be the first person to run each file through the Bitcollider and generate the initial ticket. Please visit the Bitzi site, register (I can vouch for the fact that it's possible to register with an @example.com address and still access the site just fine), then run all your shared and/or downloaded files through Bitcollider. The more files that get into the Bitzi system, the better; this includes "bad" files, and in fact ticketing "bad" files is probably more useful than ticketing "good" files.

    Several popular P2P filesharing clients, including BearShare and eDonkey2K, already have built in support for Bitzi tickets. I hope others will follow suit.

    Shaun
    --
    Thanks to the War on Drugs, it's easier to buy meth than it is to buy cold medicine!
  66. Rating individual files instead of users fails by billstewart · · Score: 2

    I've separately posted a discussion about how it's easy to create large numbers of files with different checksums pretending to be different audio rips of the same tune. Not only does this flood the typical index system, but if the Poisoners can create lots of users, they can all rate the poisoned files as good, or rate non-poisoned files as bad, and they can probably give themselves great karma by first sending in lots of reports about having successfully shared lots of good files with each other.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  67. Doesn't help for Audio MP3/Ogg/etc by billstewart · · Score: 2
    Checksums can be useful for Warez distribution, where a given file has a unique checksum, but they're not useful for compressed audio files, where there isn't a 1-1 mapping between the original and the bitstream you're distributing. Your MP3s, Oggs, etc. depend on the version of compression program you're using, specific parameters you used to compressed with, phase of the moon, or whatever. So all that extra work you're doing with incremental checksums isn't very helpful, because there isn't One True Value for you to compare against, though it can make it easier to do blacklists for some of the more efficient poisoning techniques. The easy way to do poisoning is to take your standard copy of Poison singing "Happy Copyright Violation Lawsuit To You" followed by a small block of serial-number bits to make the checksums different for each copy - this lets you crunch the partial MD5 for the first 99.99% of the file once and only have to do extra work for the last 0.01% of each poison file you're creating, which lets you create ~10000 poison files with only twice as much work as creating one. If you go to the extra work of using your incremental hash techniques for whitelisting files, it doesn't gain you anything, though it will catch this type of blacklisted files after the first block. If you're doing a whitelist-based system, you actually have to have a human listen to the thing (or a robot that does music-to-text, but that isn't going to catch files with the original lyrics and lower resolution or different tunes.) And blacklisting users has other problems, if you're not extremely (and probably unscalably) careful about your web of trust, since Poisoners can do things like give each other good karma and distribute a few real files, then put out lots of Bad Karma blacklist reports about non-Poisoners.

    Do go read about BitTorrent, though - it does use a number of the ideas you've mentioned for efficient distrubtion.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
    1. Re:Doesn't help for Audio MP3/Ogg/etc by j3110 · · Score: 2

      The MP3/Ogg issue is exactly why you need 50-100K segments... You can play partial streams of either. If you don't like the quality, or it is the wrong file, you don't waste time trusting the original host to begin with. If some id10t downloads a bad file, I don't trust him to not have a few more either. My idea of a system would work because it uses the best of blacklist without going overboard, and still uses whitelist to notify the user that a file is more likely to be accurate (not really... just causes the poisoning party to obtain several independant hosts to lie as well, but the 100K block sample would clear up any network of poison very quick for any given host.) Each host makes it's own decision (actually the user) on who's the liar, but networks of bad files would be colapsed into one file each. The larger the network, the higher it floats. A network would require IP's of computers to vouch for the validity, so if you lie and you're caught, you need a new IP before I'll believe you ever again. Each user blacklists as he pleases, so it can't be tricked into blacklisting files on any other computer.

      The key to the whole system is the 50-100K blocks. It affords a lot of flexibility, the most powerful of which is a preview so you can kill off the idiots, leaches, and poisoners in mass. If they aren't a mass, then they'll probably rank below a good file :)

      They can only make you download another 50-100K if they lie about any given checksum.

      Basically: Anyone that mods up an idiot would be ignored as well as the idiot. If they don't mod up the idiot, he'll never be heard. If they do mod up the idiot, he and his friends are only heard once. The security is not scalable, because it isn't needed to be if you kill 100's of nodes at a time of liars. It's an N*M problem for them. They need N nodes to mod up 1 other node. They need M of those networks. N is the number of people that modded up the highest ranking good file. M is the number of times the user can be fooled.

      Ex) File A has 100 hosts online vouching for the good copy. The user will check three files before giving up. In order to poison the network, you need 101*3 hosts with unique IPs. Even if they had 303 IPs, in order to poison 10 files they would need 3030 IPs, because you will block each 303 IP's everytime you get a poisoned file.

      As for just making the first block work, that's why you use random block sizes. They can't make a random sized block of data conform to the hash on demand. If they could, then you might as well store passwords in clear-text. MD5 makes a 128 bit hash of data that is irreversable in less than 2^127 number of guesses on average. If it took one clock cycle to make a guess, and you had a 2Ghz machine, it would take ~3E21 years to come up with a fake hash on average.

      This could speed up P2P (as BitTorrent has shown), and it significantly increases the cost to poison the network.

      --
      Karma Clown
  68. How does the ad come across to the recipients? by Ride-My-Rocket · · Score: 2

    I have no desire to intentionally spam people, but if that ad isn't all in their faces, is it really that bad of a thing? And honestly, Shareaza works 5x as well for me as Gnucleus ever did. Plus, it looks / feels like a mature application....... not a big deal to some, but a trait that I definitely miss from the Napster days of yore.