Slashdot Mirror


Mathematical Analysis of Gnutella

jrp2 sent in a paper written by one of Napster's founding engineers. It is a mathematical evaluation of Gnutella discussing why the network won't be able to scale up to any reasonable size. I have been impressed with Gnutella in the past, and have wondered along these same lines in the past.

209 of 332 comments (clear)

  1. This is old news. by BitterOak · · Score: 3, Offtopic
    This has been discussed since shortly after Gnutella came out. It is essentially arguing that if each peer asks several other peers if a song is available, you get an exponential growth of search requests and the network clogs if you have too many users. This was basically the rationale for the Fasttrack type of systems, such as Morpheus, which most people use today.

    --
    If I can be modded down for being a troll, can I be modded up for being an orc, or a balrog?
    1. Re:This is old news. by RovingSlug · · Score: 4, Informative

      Speaking of the merits of Fasttrack, it also provides robust parallel download and auto-resume. Using Gnutella is painful by comparison.

    2. Re:This is old news. by mcspock · · Score: 2, Funny

      what would be funny is if an economist worked on the gnutella project, and they wrote up a paper on how the new napster business model would never succeed.

      or maybe only i would find that funny.

      --
      -- Patience is a virtue, but impatience is an art.
    3. Re:This is old news. by Bitsy+Boffin · · Score: 2, Insightful

      Mod that man up.

      I have never had much luck using Gnutella, the main problem seems to be the lack of parallel download, if you have 20 users all with the same file you want, it is dismally painful to have to pick one.

      Fasttrack on the other hand (Kazaa has a linux client that is IMHO better than the bloated windows offering) works very well in this regard. Choose a file and the client download it in parallel from as many clients as it can, makes for much quicker transfer.

      --
      NZ Electronics Enthusiasts: Check out my Trade Me Listings
    4. Re:This is old news. by Lappie · · Score: 1

      Very true and the link posted above can also be found in that older article under this link. The link looks different but the website it points to is an exact copy of the original one in the above older article.

  2. Hitchhiker's Guide, Part II. by Tackhead · · Score: 5, Funny
    Earth: Mostly harmless.

    Napster: Sucks ass.

    Gnutella: Doesn't scale.

    (Mod my ass as Flamebait for this, but didn't everyone know about Gnutella's scaling problems, and for-pay Napster sucking ass, based on Slashdot stories months and weeks before today?)

    1. Re:Hitchhiker's Guide, Part II. by Phexro · · Score: 4, Offtopic

      Earth: Mostly harmless.

      you must have the new edition.

    2. Re:Hitchhiker's Guide, Part II. by psych031337 · · Score: 2
      Earth: Mostly harmless.
      you must have the new edition.

      With the state of the world being as it is now, i'd say this is an outdated issue... Maybe "partially lethal" would be more appropriate.
      --
      +++ath0
    3. Re:Hitchhiker's Guide, Part II. by GiMP · · Score: 2

      mostly harmless, in relation to the universe.

      Although "mostly harmless, mostly suicidal" would fit the bill quite well.

    4. Re:Hitchhiker's Guide, Part II. by sharkey · · Score: 3, Offtopic

      Maybe "partially lethal" would be more appropriate.

      That's just the pretzels.

      --

      --
      "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
    5. Re:Hitchhiker's Guide, Part II. by istartedi · · Score: 2

      Yes, and in the latest version there is a lengthy explanation of why the previous guide was woefully underpowered, and how the next version is going to have a complete real-time model of the entire universe with a customized Total Perspective Vortex (TM) designed to match the buyer. They are just waiting for those dime-sized hard drives that hold 10 googolplex bytes. They are expected to hit the market Real Soon Now.

      However, we have to warn you that in testing there have been problems with the UPS (Universal Positioning System) which is required to make sure that the real-time universe model doesn't try to recursively model all the other real-time universe models in the universe. Testers have confirmed that when this happens, the electronics inside tend to blow up.

      --
      For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
    6. Re:Hitchhiker's Guide, Part II. by jo42 · · Score: 1

      > Earth: Mostly harmless.
      Infested by a highly destructive self-centered parasitic virus called 'humanity'.

  3. old news by Silver+A · · Score: 4, Informative
  4. mathamatical evaluation by Molina+the+Bofh · · Score: 2, Offtopic

    It is a mathamatical evaluation of Gnutella

    Someone has not passed his grammatical evaluations at school :)

    --

    -
    Roses are #FF0000, Violets are #0000FF, find / -name '*base*' |xargs chown -R us && mv zig greatjustice
    1. Re:mathamatical evaluation by Molina+the+Bofh · · Score: 2

      Duh, I know it's MATHEMATICAL, but when the story was posted, it read:

      jrp2 sent in a paper written by one of Napster's founding engineers. It is a mathamatical evaluation of Gnutella discussing why the network won't be able to scale up to any reasonable size. I have been impressed with Gnutella in the past, and have wondered along these same lines in the past.

      It was fixed shortly after my comment.

      --

      -
      Roses are #FF0000, Violets are #0000FF, find / -name '*base*' |xargs chown -R us && mv zig greatjustice
    2. Re:mathamatical evaluation by RatFink100 · · Score: 2

      mathamatical is a spelling error not a grammatical one.

  5. Ancient news. by RareHeintz · · Score: 5, Informative
    This story is over 11 months old!

    I mean, I know that none of us - including our fine moderators - are perfect, but are they at least paying attention?

    OK,
    - B

    1. Re:Ancient news. by roguerez · · Score: 2

      Worse, there is even an older article about this subject.

    2. Re:Ancient news. by kraada · · Score: 1

      yes, yes, we all know it's ancient . . .
      my only guess is maybe this is trying to get someone out there to take the initiative and write something that does scale?
      since it still works fine (as I download 2 episodes of TNG), I'm guessing nobody cares . . . but as the lack of scaling becomes an issue, i'm sure someone will get off their arse and write it . . .
      i can't wait to see it, either :)
      in the meantime . . . *shrug* it's better'n napster, so let's use what we've got :)

    3. Re:Ancient news. by SirSlud · · Score: 3, Funny

      Heaven forbid that the /. staff should pack those braincells with friends, family and life rather than the last 24 months of ~= 10 submissions a day! Man, I'm gunna start modding these 'yawn, been there done that' posts as redundant!

      --
      "Old man yells at systemd"
    4. Re:Ancient news. by bbh · · Score: 1

      Even worse, this guy is probably wondering why his server is feeling the Slashdot effect again 11 months after the first time. Watching the hits rise screaming NOT AGAIN!!!!

      So sad.

    5. Re:Ancient news. by amlutias · · Score: 1

      i dunno, maybe because it's their job? and there's a search engine, conveniently provided for all to use?

      back in the old school hobby days of /., nobody would really care, and it wouldn't happen because of the fewer submissions. but /. is now professional, discounting jon katz.

    6. Re:Ancient news. by Brendan+Byrd · · Score: 2

      Sorry, some of us don't remember stories from 11 months ago. They posted it again? Good. Let some of the other people a chance to read it.

  6. Re:old news by Fillup · · Score: 1

    whoa!! holy memory, batman!

    i haven't myself used any of this crap in a long long time. I am wondering when the CLUELESS entertainment industry will just realize that they should provide mp3, etc. I mean, these services/apps are neato, but they all suck! Filled with parasites and slow as hell -- dude, i don't care if you dial up or have a T1...i'd pay a buck or two per song easy.

    I swear, the music industry is a bunch of keystone kops.

    --
    "I think there is a world market for, maybe, five computers." __ IBM Chairman, 1943 __
  7. Re:GPL - Intellectual Theft? by rpack · · Score: 1

    I think that this could be a *little* off topic.

  8. Pay Napster beta testers allowed to speak by DrSkwid · · Score: 3, Redundant

    and on the same day someone from Napster says not Pay Gnutella won't scale

    .

    --
    There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    1. Re:Pay Napster beta testers allowed to speak by xercist · · Score: 1

      Same day? hardly. This is about a year and a half old. Not to mention slashdot already posted it a year ago.

      --

      --
      grep "xercist" /dev/random ...you'll find me in there someday
    2. Re:Pay Napster beta testers allowed to speak by DrSkwid · · Score: 1

      the editor said it in his editorial part.

      --
      There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
  9. What the hell? by avalys · · Score: 5, Funny

    Heh...somehow I read the title as "Mathematical Analysis of Guatemala", since this article has been posted before and Slashdot never posts anything twice.

    --
    This space intentionally left blank.
  10. The Logarithmic value of the messages exchanged ! by Khalid · · Score: 5, Interesting

    The problem is not that difficult, if you want Gnutella to scale, then you need to avoid the exponential explosition of the number of messages exchanged between the clients as their number grows. The only solution is to structure the network by using "super clients" or "servants" or "super nodes", call them what you want, the later is what KaZaa and Morphus have accomplished; this makes the number of messages exchanged grows in a logarithmic way (this is an outrageous simplification of course, but gives an idea). There are many such expriments with Gnutella two with those ideas, this is what BearShare is trying to do.

  11. Growth of network relates to negative attention by totallygeek · · Score: 3, Interesting
    Everyone knows that Napster was basically a glorified DCC engine rip-off from IRC days of file trading. It made IRC file sharing easy for the average computer user. With the death of Napster as everyone knew it, you still see #mp3 and #mp3tunes and the like on IRC trading files person-to-person like Metallica never existed. I think that when something explodes in popularity you get too many bad people joining in ruining things for the users that are not abusers. When so many people jump on a bandwagon, you get media attention for wrong-doing and that is where the death nail is driven.


    Look at ICQ. It was fairly decent as an instant messaging client until the numbers hit one million or so and then it needed to control everything under the sun and companies could spam through it. File sharing happens through it all the time too.


    I don't care if Gnutella cannot scale to the levels that Napster saw. Smaller is better in my opinion!

    1. Re:Growth of network relates to negative attention by Dixie_Flatline · · Score: 2

      So you'd be perfectly content to receive answers to your queries that originate from your own machine and download your own content back to yourself?

      Smaller is better, so just one user to search must be best of all! And the download rates are incredible!

    2. Re:Growth of network relates to negative attention by AtaruMoroboshi · · Score: 1



      I totally agree with this.

      I use a small peer to peer program that is for a specific subgenre of music, and it has a few thousand users at most. It runs off of donated linux server space and is already buckling under the weight of the number of folks using it, so i'm not going to name it by name, but it's perfect. The ratio of quality music to crap is amazingly high. I regularly download a gig of worthwhile stuff in a day.

      on napster, it was such a pain in the ass, wading through the garbage.

      smaller is frequently better. (like the new imac, heh)
      .

    3. Re:Growth of network relates to negative attention by zerocool^ · · Score: 2

      Smaller is better in my opinion!


      Unless you want obscure stuff. If i hear that XYZ indie punk band has a great album (Self, or The Proms, for example) I want to hear what they sound like before i buy it, because I don't want to order something from CDNnow or whatever and pay $20/cd and $7/shipping or whatever to get a crappy CD (the juliana theory - emotion is dead: thanks for nothin). But i do want to support indie music if it doesn't suck. So for me, it's morpheus, old-skool napster, gnutella, whatever, as long as it is big, i'll check it out.

      ~z

      --
      sig?
    4. Re:Growth of network relates to negative attention by vawlk · · Score: 2, Interesting

      Each request generates a GUID and clients check each request vs a recent history of GUIDs. It prevents any sort of looping.

      Having developed the first host caching application for gnutella, I can say that the author never fully understood how the network worked.

      His equations may be accurate based on how he thought requests and replied propogated through the network, but he assumed every request had a reply.

      It is true that the bandwidth overhead was large, but I rarely used more than 15KB/s during the times when there were 4000+ clients connected. He says that it might not be possible to reach all 4000 people, but in order for me to know how many users were out there, they all replied to my ping, thus searchable.

      Finally, the very nature of the network doesn't lend itself to protocol updates at all. The protocol was extreamly limited, but once it caught on, not much could be done about updating it short of starting an entirly new protocol. You couldn't just shut it down, and thats the major problem.

      Many proposals were written on how to implement a system without the gnutella limitations, and you are seeing them in many different implementations.

    5. Re:Growth of network relates to negative attention by totallygeek · · Score: 2
      Unless you want obscure stuff. If i hear that XYZ indie punk band has a great album...I want to hear what they sound like before i buy it...


      I listen to punk music and have always enjoyed the openness of the companies that sell the music for non-fans and fans alike to listen before buying. Most indie labels have inexpensive samplers or online mp3 download segments from artists. I listen to many obscure punk bands, and almost always there was a venue to hear them before buying. Toxic shock had the Shock Report with floppy 7" recording samplers. Notes in Thrasher Magazine was an excellent review resource. Flipside had samplers. Nowadays you have The Fat Club or Punk-O-Rama. Cheap CD offerings where you get about 10 to 15 different bands showcased. Enjoy!

    6. Re:Growth of network relates to negative attention by zerocool^ · · Score: 2

      Actually, i don't know if you're going to check this post or what, since its anonymous, but I do go to a lot of shows. In the past year, i've seen the dave brockie expierence twice, rawg once, less than jake/ teen idols/ anti-flag 3 times, new found glory 5 times, weezer/ get up kids once, the living end/ american hi-fi once, and i've been to and seen a dozen or so local shows in fredrock, va, and even played in a few. And tonite, i'm driving from blacksburg, va, 6 hours to see GWAR in norfolk/va beach, and then tomorrow i'm driving 2.5 to see GWAR in Winston Salem N.C.

      And you're right, there is absolutely no comparason between a CD and a show. If i can, i tend to buy CD's at shows - it's like LTJ says: "Well I really don't know if it matters at all so, but we try to keep our prices low for records and our shows"

      --
      sig?
    7. Re:Growth of network relates to negative attention by maxpublic · · Score: 1

      While you at it add a dictionary to your list, and ponder why spelling 'school' incorrectly as 'skool' doesn't make you cool.

      (Neither does wearing a baseball cap backwards or sporting one of those stupid little fratboy mini-beards, but we'll save that lecture for another day.)

      Max

      --
      My god carries a hammer. Your god died nailed to a tree. Any questions?
    8. Re:Growth of network relates to negative attention by zerocool^ · · Score: 2

      While you at it add a dictionary

      Did you mean: "While you're at it..."

      --
      sig?
  12. Well, two things. by Anonymous Coward · · Score: 1

    1) Stale, as many have already pointed out.

    2) Irrelevant. Who gives a shit if I can see 'everyone in the world', as long as I can see _enough_ of the world to get done what I want to get done? The way things currently stand, a big chunk of the Gnutella network is 'beyond the TTL horizon' for any given user -- does this actually impede anyone from getting the files they want? It doesn't me, that's for sure...

  13. 20/20 Hindsight by nadaou · · Score: 4, Insightful

    Yes, but..

    It's sort of like calculating the maximum hull speed for steam ships crossing the Atlantic Ocean and saying there is a theoretical maximum speed to intercontinental travel. Then someone comes along and invents airplanes.

    Gnutella will mutate and evolve, and will at somepoint in the future be replaced by something better when it starts to fall over.

    The demand for Ms. Spears and the Backstreet Boys is just too damn strong for things to stand still.

    I enjoyed that this post was next to the announcement that of the new-and-not-so-improved preview of Napster was out..

    --
    ~.~
    I'm a peripheral visionary.
    1. Re:20/20 Hindsight by skelley · · Score: 1

      Great analogy. *Exactly* the lines I was thinking along. No accounting for innovation.

      I was amazed (11 months ago like the rest of you), that a Napster founder would go to the trouble to write a somewhat rigorous mathematical paper slamming an up-and-coming competitor.

      It seems like petty and childish behavior, probably from the stress of desperation Napster was in at the time (and still is in).

      One last thing ....

      .... f##k! RIAA. Sorry. It just slipped out.

    2. Re:20/20 Hindsight by ralphbecket · · Score: 3, Insightful

      No, the complaint is not about the concept, but the gross lack of understanding exhibited in the design.

      There are well known workable epidemic algorithms suitable for P2P that have been around for a long time. They generally provide statistical guarantees of success in return for scalable use of bandwidth.

      Epidemic distributed systems should not be attempted by people who do not grok exponential growth. Planning for somebody wiser to innovate around your mess is not responsible.

    3. Re:20/20 Hindsight by f0rtytw0 · · Score: 1

      I do believe jordan had already left napster at this time. Got hired by some other company, can't remember the name now. Anyway as I remember he was happy to not be constantly working on their servers.

      --
      this is the most important sig ever! In your face 446154!
    4. Re:20/20 Hindsight by skelley · · Score: 1

      "Planning for somebody wiser to innovate around your mess is not responsible."

      You mean like most major software vendors ?

      Be real, product grow and mature over time (or die). Products need different models to be affordable and scalable over time.

      Early products are are designed to be functional first and scalable second. Do you not build that first release because it is not scalable to the Nth degree.

      If so - start lining up Windows(any version), Apache, Squid, various TCP/IP stacks, Linux, Oracle, etc., etc. in your sights.

      How come I'm not using *your* kick-ass P2P system right now ?

    5. Re:20/20 Hindsight by jallen02 · · Score: 1

      Thats the thing, you can call it slamming, but it isnt really. If it is a technically sound paper that proves why, and does so in a professional manner, its all good. I would like to think of that as "education".

      Jeremy

    6. Re:20/20 Hindsight by ralphbecket · · Score: 1

      If your design brief specifies scalability to internet sized communities then you'd better take that into account early on in the design. Like security, it isn't something that can be effectively retrofitted.

      The point I'm trying to make is that for a very large number of engineering problems there will be existing work done on the problem which should be examined before leaping in feet first. I thought this was one of the things CS degrees were supposed to teach.

      In this case it seems that due attention was paid neither to the design objectives or the literature on the subject.

    7. Re:20/20 Hindsight by CaseyB · · Score: 2
      It's sort of like calculating the maximum hull speed for steam ships crossing the Atlantic Ocean and saying there is a theoretical maximum speed to intercontinental travel. Then someone comes along and invents airplanes.

      More to the point, it's like doing that TODAY, when airplanes already exist. Nobody is currently advocating flat p2p systems like the old gnutella in favor of supernode systems like FastTrack or extended gnutella.

      Of course, this paper was written over a year ago, but it shouldn't be news to anyone now.

  14. Well that's that then. by QuantumG · · Score: 1

    Obviously all these mp3s I have on my harddrive and listen to every day must have got onto my harddrive using some other file sharing program. Maybe I actually purchased all this music but supressed the memories for fear that I was supporting the music industry. This could explain why I'm so broke! The fact that I type in the name of any song that happens to cross my mind during the long fits of programming (usually accompanied by everything from rap music to beethoven) and it inevitably gives me a list of results in the hundreds is proof enough that the network "scales". When I look at my network stats and see that the small number of files I am sharing (about 150) each have hits in the hundreds of thousands, even though I restart my server at least twice a week, shows that I'm definitely contributing to the network. Surely this article is just a case of sour grapes.

    --
    How we know is more important than what we know.
    1. Re:Well that's that then. by bonk · · Score: 1

      Do you not understand the meaning of the word 'scale' when applied to computers?

      Basically, it takes significantly more bandwith/cpu/whatever in a peer-to-peer network as the number of users go up. It is not as simple as "10 times as many users cause 10 times as much bandwith". They cause far more than that.
      Just because you are still able to pirate songs whenever you feel like it isn't proof that the system is scaling well. Surely your post is just a case of ignorance.

      --
      I hope to die peacefully in my sleep like grandpa, not screaming like his passengers.
  15. gnutella by flynt · · Score: 3, Interesting

    On the topic of this program, a more current story running on msnbc.com right now is telling how it is becoming a severe security risk for users of the program. Here is the article.

    1. Re:gnutella by pyramid+termite · · Score: 3, Insightful

      This isn't a case of hackers getting into people's systems, it's a case of people who don't understand their own computer's directory structures sharing a bunch of files they shouldn't, unless there's something I missed in this poorly done news story. The real security risk here is not Gnutella, it's ignorance. I know the manual for Win ** is very thin and sketchy, but directories are covered in it.

      It's depressing to think that a lot of people put their computers on a network without even understanding basic concepts like this. (It's even more depressing to call tech support at an ISP and realize you understand more about the problem then they do, but now I'm rambling.)

    2. Re:gnutella by ImaLamer · · Score: 2

      This is very interesting since I got into people's hotmail boxes because the cookies were on the network.

      It was quite simple. Search Gnutella for text files containing the @ sign.

      But one quick question: Would a Linux gnutella program let me share /etc ? I don't know, and I don't want to try it. I'm guessing as long as you are running the program as a regular user you may not be able to actually read the files.

      But still: search for the @! There are plenty of cookies on gnutella for download. The funny thing though is that most users seem to be on dial-up.

    3. Re:gnutella by damiam · · Score: 2
      I'm guessing as long as you are running the program as a regular user you may not be able to actually read the files.

      On my Debian box (and my RedHat partition) just about everything in /etc is world-readable except for /etc/shadow. Not that people are really gonna be interested in a copy of my /etc/init.d/apache file anyway... It's /home you really have to worry about.

      --
      It's hard to be religious when certain people are never incinerated by bolts of lightning.
  16. Gnutella's spawn by PureFiction · · Score: 5, Informative

    What I find most interesting are the kinds of projects that have sprung up in Gnutella's wake. Many of these started out as attempts to improve Gnutella, and have since moved on (the Gnutella Next Generation working group never really materialized into anything)

    We had napster and one extreme, gnutella at the other, and in the middle a re a number of partially centralized systems with super peers like Fast Track, such as:
    Open FT
    JXTA Search
    GNet
    NEShare

    and many others...

    Then there are the alternative projects that use an entirely different mechanism. For example, social discovery as implemented in:
    NeuroGrid
    ALPINE

    Or distributed keyword hash indexes like:
    Chord
    Circle
    GISP
    JXTA Distributed Indexing

    And many others as well.

    The coming year(s) will see a lot of maturity in these areas, and searching large peer networks will become ever more efficient over time. Gnutella showed us the possibilities of a fully decentralized model, and refinements of its underlying architecture can produce vastly better solutions.

    2002 will be an interesting year for peer networking applications...

  17. Re:old news by guttentag · · Score: 1, Redundant
    Well, it is in the it-warrants-mentioning department.

    Besides, it's only appropriate to follow up a "heralding the death of Napster" discussion with a "Napster founder predicts the death of Gnutella" discussion.

  18. Even better by sulli · · Score: 1

    this comment - it was ancient news then!

    --

    sulli
    RTFJ.
    1. Re:Even better by (outer-limits) · · Score: 1

      Maybe if the /. crew had time to write code to prevent dupes and improve quality and functionality, we would all be happier. As it is, most of the recent work appears to be getting wasted on dealing with a few idiot trolls. I for one would like to see 'ongoing' stories, that evolve over time, rather than new topics having to be created to deal with each new development.

      --

      Microsoft - Where would you like to go today, Maybe Jail?

  19. Slashdotted? by Eric+Seppanen · · Score: 2, Informative
    Are you feeling Slashdotted? Server got that not-so-fast feeling?

    Maybe you should be trying the Google cache!

    --
    314-15-9265
  20. OT: Quick! Earn that Karma! by Cheshire+Cat · · Score: 3, Offtopic

    Since numerous people above have pointed out this is a repeat, everyone should browse the older article and repost all the comments that were modded up to +5, and reap the benefits when that karma comes rollin' in! ;)

    --

    Last night I shot an elephant in my pajamas. How he got in my pajamas I'll never know.
    1. Re:OT: Quick! Earn that Karma! by mister+sticky · · Score: 1

      Done and done.
      _____________________________________________
      Re:I challenge you... (Score:5)
      by PureFiction (coderman@mindspring.com) on Wednesday February 14, @12:56PM (#431980)
      (User #10256 Info | http://cubicmetercrystal.com/)
      I am nearing completion of a network that satisfies a, b, c, e.

      I havent started on d and f, but they could be added.

      This project is called The ALPINE Network
      http://cubicmetercrystal.com/alpine/

      It scales linearly, and provides a query mechanism that rivals the performance of a centralized directory. (Although the bandwidth is more than a centralized query, but at least you have direct control over how much bandwidth you use and how).

      At any rate, I could use development assistance a great deal. Let me know if anyone is interested.

      Regards...

      99.44% "You are the product of a mutational union of ~640Mbytes of genetic information."

    2. Re:OT: Quick! Earn that Karma! by snarkh · · Score: 1


      Am I right to understand, that your comment had been posted before?

    3. Re:OT: Quick! Earn that Karma! by AiX2 · · Score: 1

      Since numerous people above have pointed out this is a repeat, everyone should browse the older article and repost all the comments that were modded up to +5, and reap the benefits when that karma comes rollin' in! ;)

      Nah, too much work. I'll just copy and paste from the current thread. Slashdot moderators won't know the difference.

  21. Well, duh by Anonymous Coward · · Score: 1, Interesting

    Anyone who understands how Gnutella works (unfortunately, too few people) knows that Gnutella is horribly broken, will never work, and is basically unfixable.

    The more relevent question is whether you can have a peer-to-peer network without central servers that *can* scale. And the answer is "no".

    However, the REAL question is whether you can have a peer-to-peer network with decentralized servers, i.e., with clients that automatically establish a heirarchy among all the clients, and certain clients become more "server like". They only way to make a Gnutella work is by making it heirarchical, but the heirarchy needs to be automatic for it have the same general "virtual network" aspect of Gnutella.

    Is it possible? I don't know. You would probably have to have automatic bandwidth measurements, depth probes, all kinds of things to make it work. I simply don't know if it would be possible to automate something like that.

    1. Re:Well, duh by /ASCII · · Score: 1

      Heh... This is a copy-and-paste from the previous article about Gnutella. Don't mod it up.

      --
      Try out fish, the friendly interactive shell.
  22. FUD FUD FUD Re:gnutella by Anonymous Coward · · Score: 1, Funny

    Wow, that's the biggest load of steaming bullshit I've come across in quite some time.
    Tonight on Dateline: Dangerous computer hacker terrorists steal your files by... um, you sharing them on the Gnutella network like a fucking moron...

  23. Re:moochers + crap == worthless by Jeff+Probst · · Score: 1, Insightful
    I was putting out all the CPU time and bandwidth

    Did you use Linux as the server? There are known limitations to Linux, the main ones being:

    1. it sucks
    2. it sucks
    3. it sucks
    Hope that this clears things up.
  24. Re:moochers + crap == worthless by Guppy06 · · Score: 1, Troll

    Maybe next time you should look for material that's a bit more popular among servers and draws a slightly more highbrow collector. In other words, lay off the barnyard sex movies. :)

  25. Re:The Logarithmic value of the messages exchanged by zerocool^ · · Score: 5, Informative


    The only solution is to structure the network by using "super clients" or "servants" or "super nodes", call them what you want, the later is what KaZaa and Morphus have accomplished...

    This is exactly the point. This is the only way to properly distribute querys, as anyone who has set up a multi-homed ISP knows. It works on the same principle as BGP routing, i.e. there are routers (super-nodes, or whatever) that have a specific number (an ASN - or in P2P, the supernode address) but there are thousands of computers (casual modem users - p2p) on the internet that these routers have information about. If BGP routing worked this way, nothing would go anywhere. However, by having several nodes giving out information on who has what and how to get it, while the majority of users just download and give out their own info, not pass along info of others, things work much smoother. And with a correct implementation, everyone could have a route to everyone's file list at a minimal bandwidth useage.

    --
    sig?
  26. Re:moochers + crap == worthless by moncyb · · Score: 2, Insightful

    I was putting out all the CPU time and bandwidth and getting nothing in return.

    This is one of the biggest problems with P2P file sharing programs. Nearly everyone wants great content for free, but very few are willing to give back and supply any of it.

  27. Re:Repeat ? by aka-ed · · Score: 1

    Yep it sure was, quite a while ago, and at the time it was first published and acknowledged by the Gnutella crowd, work began in earnest to resolve the issue.

    That work resulted in research like this, and to major changes in Gnutella implementations.

    --
    I survived the Dick Cheney Presidency 7 to 9 AM 7-21-07
  28. Choice by Wheaty18 · · Score: 2, Insightful

    There is a major flaw in all P2P software, and it has nothing to do with the coding. More people tend to want to take than recieve. I remember seeing a line graph on LimeWire's page (I think?) that showed a monthly progression of the number of people sharing files compared to the number of people downloading files. The 'downloaders' were outweighing the 'uploaders' by a HUGE ammount.

    If everyone was willing to share their files, then there would be no such problem with P2P programs.

    1. Re:Choice by Ando[evilmedic] · · Score: 1

      Sorry, but the downloaders download from who?

      Wouldn't the # download(er)s equal the # upload(er)s?

    2. Re:Choice by Gaijin42 · · Score: 2

      Uh. Your logic is whack. People are downloading more than they are uploading. If people offer more files, it will increase the number of downloads. (More people want b-spears-naked.jpg than HAVE b-spears-naked.jpg)

      If this were not true, essentially, people would be offering files that nobody wanted, and that would just be stupid.

    3. Re:Choice by AMuse · · Score: 2

      Uh. Your logic is whack. People are downloading more than they are uploading

      Logically, if one person is downloading then another person (peer to peer) is uploading.

      How do people download more than upload?

    4. Re:Choice by chicolindo · · Score: 1

      Correct me if I am wrong but...
      Isn't downloading equal to downloading?
      And uploading equal to uploading?
      You aren't uploading to a centralized network!?
      People are just scooping up files from your box?
      Right??!!

    5. Re:Choice by Peyna · · Score: 2
      I believe that the point is that there are few people who make their machines available for others to download from. The number of downloads taking place will be equal to the number of uploads taking place, obviously.

      If more and more people use the server only to receive files, and do not make files they receive available to others, then in the end, the people who were making their files available to others will no longer be able to, or they will have to severely limit the bandwidth going out to those who are taking the files.

      The only way to avoid this would be to have nodes that are there simply to retrieve as many good quality files as possible and offer them up for download. But then, it's not really P2P anymore, is it?

      --
      What?
    6. Re:Choice by mshiltonj · · Score: 2, Informative
      When searching for content on the gnutella network, a lot of the results come back showing the host as 192.168.* 10.0.*, which means they are behind a firewall or otherwise not directly routable. In such a case, the user may be unable to correct it, or unaware of the issue entirely

      I was like this for about a week before I realized why I wasn't getting any uploads. I had to open up port 6346 on my home network (linksys router). Also, Napshare lets me "force local ip" to my firewall/ external ip (assigned by RoadRunner). The linksys router does port forwarding on outside requests, so only one computer on my home network can share on that port.

      This thread reminded me that RoadRunner had expired my old ip address and assigned me another and I had forgotten to update my gnutella client to reflect to new ip. So for the past few weeks or so, I had been one of the "non-sharing" people by simple oversight.

      I doubt most limewire/bearshare users know any of this stuff. When running a gnutella client from work, people couldn't do this even if knew about it and wanted to.

      There is a major flaw in all P2P software, and it has nothing to do with the coding. More people tend to want to take than recieve. I remember seeing a line graph on LimeWire's page (I think?) that showed a monthly progression of the number of people sharing files compared to thenumber of people downloading files. The 'downloaders' were outweighing the 'uploaders' by a HUGE ammount.

      If everyone was willing to share their files, then there would be no such problem with P2P programs.
    7. Re:Choice by Robotech_Master · · Score: 2

      Don't look at me, I always make my coupla gigs of stuff available for upload on the services I use.

      Problem is, though, the cable company caps me to a 128 kilobit per sec upstream, so there's an imbalance there that I can't do anything about.

      But I do what I can!

      --
      Editor Emeritus and Senior Writer, TeleRead.org
    8. Re:Choice by Johnny00 · · Score: 2, Informative

      Thats exactly how eDonkey2000 works!

      While your downloading a file, it's immediately made available for upload from you. It uses resume download to download parts of the file you want from multiple sources, some of which don't have all of the file yet too.

      --
      I live life on the edge ... of my desk.
    9. Re:Choice by Jhan · · Score: 1

      So, keep track of bytes downloaded per byte uploaded over the last few weeks for every user. Make the 50 users with the highest ratio appear in every client, with a smack the leech button. Nodes then offer less bandwith to much-smacked peers. Some dispensation should be made for new users.

      This has all been discused in the recent article about punishment as guarantee for altruism

      --

      I choose to remain celibate, like my father and his father before him.

    10. Re:Choice by Gaijin42 · · Score: 1

      I meant that there is a one to many relationship between uploaders and downloaders. Of course the actual number of uploads will be equal to the downloads. I mean that a given file will be downloaded multiple times, by many different people, from one source.

  29. Re:I need to know... by kilgore_47 · · Score: 1

    Kind of funny that the napster guy predicted gnutellas death oh-so-long-ago, and now napster is the bad joke that wont die and gnutella is still going strong. Who's laughing now, Mr. Ritter?

    --
    ___
    The way to see by faith is to shut the eye of reason. --Ben Franklin
  30. gnutella doesn't scale??? by npietraniec · · Score: 1

    Duh. OpenFT does though. giFT needs help testing thier network.

  31. Correction, Taco by bjtuna · · Score: 2

    have been impressed with Gnutella in the past, and have wondered along these same lines in the past.

    I think we could add:

    "... but since I was too busy doodling and writing dirty, hackish perl when I was in school, I'm glad someone else did the actual math."

  32. Massive DDOS by ThomasXSteel · · Score: 1
    Perhaps the most disturbing thing I've come to realize after reading the article is that any yutz with a text editor, a compiler, and a knowledge of the gnutella protocol could soak the net with a massive DDOS attack by firing off large numbers of search queries with a high TTL.

    Am I missing something or is gnutella a big risk to internet stability/security?

    1. Re:Massive DDOS by vawlk · · Score: 1

      it already happened. One of he original gnutella clients had a bug that would take requests with a TTL of 1 and set it to 254.

      Other people hacked, but luckilly this was prevented by quick updates to the software. Clients soon had filters to kick out bad requests, illegal TTLs, spammed searches, etc...

  33. Why Napster isn't P2P. No, Really. by Omega · · Score: 4, Insightful
    This suggestions of this article are quite thought-provoking, but they also illustrate an interesting point: Napster really isn't P2P.

    In theory, a true Peer-to-Peer file transfer network would exist in a decentralized fashion where you would never have to query a central host for routing or file availability. Napster requires you to route through one of the Napster servers for information. Even introducing Napigator still doesn't alter the Napster model because all it does is allow you to route through a different central host. It seems that all Napster did was integrate a search engine and nameserving into one element (coming from only one provider).

    This isn't to knock the accomplishments of Napster, it was certainly an original idea to incorporate these areas and provide a GUI access client to boot. But it is apparent that Napster developers weren't all that revolutionary in their thinking either.

    The suggestion of true P2P is revolutionary, and the perfect implementation (should it ever arrive) will also be revolutionary. But the Napster model is no different than everyone providing their MP3 list to a website who maintains a list of links on where to download MP3s. Napster simply automated this process. Napster is no more P2P than any TCP/IP connection not operated through a proxy.

    Is http P2P? I'm talking directly to another system, and there is no moderator/mediator. Normally, I have to find out about that system from a 3rd party (e.g. a search engine) -- just like someone obtains a list of links from Napster.

    True, I'm being no better than the author of the original article; because I too am offering no solutions. I'm just holding out hope for true P2P in the future.

    1. Re:Why Napster isn't P2P. No, Really. by mozkill · · Score: 1

      dude. wake up. the author of this paper is the co-founder of Napster!

      --

      -- Betting on the survival of the media industry is a serious risk. I advise investing elsewhere.
    2. Re:Why Napster isn't P2P. No, Really. by rho · · Score: 2

      True Peer-to-Peer networking is two "golden shower" porn stars exchanging business cards.

      --
      Potato chips are a by-yourself food.
    3. Re:Why Napster isn't P2P. No, Really. by telbij · · Score: 2

      The suggestion of true P2P is revolutionary, and the perfect implementation (should it ever arrive) will also be revolutionary.

      Wait a minute, I thought the point of this article is that true P2P can never scale. More to the point though, are you claiming Gnutella isn't true P2P? I don't know enough networking theory to argue one way or the other on this though.

      True, I'm being no better than the author of the original article; because I too am offering no solutions. I'm just holding out hope for true P2P in the future.

      From my understanding of the article, the scalability problems are directly linked to a raw P2P scheme. You can't hope for a more 'pure' implementation of P2P to solve that problem. P2P has already fulfilled it's promise: a de-centralized file-sharing service that can't be easily shut down by any one party.

      Technologically P2P isn't all that impressive though, at least not from a theoretical standpoint. The challenge now is to maintain sufficient decentralization while pragmatically improving the technology. Limewire has shown some recent success in this arena despite having a long way to go still.

      Time to dub a new term such as Multi-Tiered P2P (MTP2P) with the theoretical goal of connecting the most computers using the least bandwidth. Certainly this would require a degree of centralization that would be more vulnerable than a fully equal P2P system, but if done carefully in an open protocol would still be immune from legal attack.

  34. Re:moochers + crap == worthless by aarondyck · · Score: 1

    And what would you suggest as an alternative? There's always a Microsoft product, if you don't mind rebooting every ten minutes and having to perform countless daily updates to prevent crackers from taking down your system...at least Linux is stable and relatively hack-proof.

  35. So, you got a better solution? by 2Bits · · Score: 1, Flamebait
    Everyone seems to jump on the gun to say this is old news, blah blah...

    If it's such an old news, how come no one on /. has come up with a solution to fix it? Or better yet, a better architecture and protocol?

    I didn't get a chance to read the whole paper, as it stalled half way thru (usual /. effect!). But it looks like a quite serious analysis paper to me. We should at least give him credit for that, there's not many OSS developers who write serious analysis papers, after all.

    1. Re:So, you got a better solution? by coltrane99 · · Score: 2, Funny

      I have one, but there isn't enough room in the margin here to write it down...

    2. Re:So, you got a better solution? by gorilla · · Score: 2
      If no-one has fixed a potential problem, that can only mean one of two possible causes:

      1: No-one with the suitable skills has looked at the problem, and done the work to reimplement it.

      2: No-one is experiencing the problem, so there is no pressure to fix it.

  36. broadcast on O(n) time and O(1) space by shakazulu · · Score: 1

    There is more than one way to do broadcasts.

    Certain forms of randomized routing offer a
    guarrantee of a broadcast to complete after
    O(n) time for n clients in O(1) time with high
    probability. And there are trade-offs in
    between.

    I suggest you all study algorithms for
    broadcast before assuming that the most
    bandwidth intensive algorithm will be the
    only one used.

  37. Re:old news by Tattva · · Score: 1
    Don't forget here and here.

    --
    personal attacks hurt, especially when deserved
  38. Is there a limit to the gnutella horizon? by eyefish · · Score: 3, Interesting

    I'm not very familiar with the deep technical details of Gnutella, but isn't there a limit on how far the "horizon" is (i.e.:how many users near by you can see)? If this is correct, all the mathematics here presented apply only in theory and not in practice, as what will happen is that (1) most queries will not be relayed past a "reasonable horizon", and (2) there exists a good (or high?) probability that as long as you're searching for "popular" files, that you will eventually find them.

    Because of this basic and simple observation, I do not foresee gnutella to die anytime soon because of scalability reasons alone (however copy-protection issues are another story).

    Again let me stress that my observation here is based on the strong assumption that the "search horizon" is "reasonable sized" so as not to have to search the whole gnutella network.

    1. Re:Is there a limit to the gnutella horizon? by dragons_flight · · Score: 2

      The limit is his 'T' factor in the paper. By default it's set to 4, but as he points out it would have to be set to around 7 if you were ever going to see the whole network and what everyone has shared if there were a million users (ala Napster). Even considering the T=4 setting the article's results still show that the original Gnutella set up is a horrible bandwidth hog.

    2. Re:Is there a limit to the gnutella horizon? by Error27 · · Score: 2
      The other poster nearly got it right. The horizon is a combination of T (the number of times a query is repeated before it dies) and N (the number of connections each computer has). The values for the first client were T=5 and N=4. I'm fairly sure that most clients must have raised the value for T.

      His chart called "reachable users" describes how the horizon grows as T or N change.

      I think now that there are normally over 1000 people in your horizon possibly up to 8000.

      The other thing about the article is that it was written before clients started caching replies and that changes your horizon around quite a bit.

      Quite frankly caching the replies probably helps but the Gnutella protocol is still awful.

      I'm more impressed with Morpheus as a decentralized file sharing network. There is an open source Morpheus client called "gift."

      The weird thing is that the only way to get documentation about how Morpheus works is to download the source tarball for gift and poke around in the READMEs. There is no other public documentation for it any where on the net.

      Basically it sets up tons of little mini servers that index songs for up to around 300 people. Clients have a list of these servers and query them to find files. If you want to a "horizon" of 6000 computers then you only have to make 20 or 30 queries. In Gnutella (without caching) the same horizon would be 6000 queries. No one really knows what it would be with caching and it changes depending on whether it's a popular query or not.

      Actually Gnutella in that case is much worse than just 6000 queries because many computers have no songs shared and are stilled searched where in Morpheus computers that don't share songs do not index. And another thing that makes Gnutella worse is that I think the replies are relayed multiple times instead of just once.

      I'm not a gift developer or user myself... But I would say it was a far better way to go than Gnutella.

  39. They've been saying this for two years now by LM741N · · Score: 1

    Gnutella has always been usable for me. Even after the Napster collapse. But its kind of like stamp collecting, you have to put some work into finding the item you want. There is no guarantee that the song you want will be there today, but it might be there tomorrow, or an hour from now- who knows?

  40. Re:and while we're on the topic of moderation by slugfro · · Score: 1

    Good point. This often happens. However the post which you reference as being"(Score 5: Informative)" did include a link to the old slashdot article. So it was more informative than the first post. But you should be happy to know that the first post has since been rapidly gaining points!

    --

    -- Find the Truth...
  41. Even worse! Incosistent math! by hodeleri · · Score: 5, Informative

    If you read through this research paper it'll start with N=4 and T=5. As you continue to read through the paper he quotes bandwidth figures from his table using various other N and T values.

    For example, in the very last table (Bandwidth rates for 10qps) he says the bandwidth generated will by 8GB/s, which align with N=8, T=7. Where you to use the N and T values from the beginning, this would be 2.4MB/s, which is off by 3143 and one third times.

    Going back to Joe User's Greatful Dead query, it only generates ~250KB, not 800MB.

    Remember, very very few people are going to modify their TTL or open connections. This ``white paper'' grossly misstates the amount of bandwidth Gnutella generates and seems to be an anti-Gnutalla paper designed to mislead rather than an honest and fair judgment

    1. Re:Even worse! Incosistent math! by pyite · · Score: 1

      :s/Greatful/Grateful

      --

      "Nature doesn't care how smart you are. You can still be wrong." - Richard Feynman

    2. Re:Even worse! Incosistent math! by Alfred · · Score: 2, Informative

      You have missed the point of the paper. The N=4 and T=5 values equate to a maximum reach of 484 clients for a query. To reach "napster" sizes of 1million you need T=7 and N=8.
      That explains the different numbers.

      However, I do agree that a couple numbers seem to be plucked from mid-air, but the argument and maths seems fine :)

    3. Re:Even worse! Incosistent math! by ChazeFroy · · Score: 2

      Additionally, either my eyes are deceiving me or the beer I am consuming is clouding my thought processes.

      Look at his first chart and the "N = 2" case. Why is this only being incremented by 2 each time (2, 4, 6, 8...)? Shouldn't it be multiplied by 2 each time (2, 4, 8, 16...) just as "N = 3" is multiplied by 3, "N = 4" is multiplied by 4, etc?

    4. Re:Even worse! Incosistent math! by AbsoluteRelativity · · Score: 2, Interesting

      Not only are they less likely to modify TTL, but the need to do so for popular content is not that necesary, the more popular the content the more people will download and share the download with others. Sort of a user based way of propogating media across the internet (while Freenet does it automaticly).

      --
      disclaimer : My views do not represent those of every one else in slashdot.
    5. Re:Even worse! Incosistent math! by ajknott · · Score: 2, Informative

      I thought this was an error originally as well but it is not. When you have 2 connetsions (N=2 globally) and a node sends a request to the first two nodes, each of those nodes has 2 connections, one to the original requesting node and one to a new node. Thus the second level nodes can only send the request to 1 new node each.

    6. Re:Even worse! Incosistent math! by demultiplexer · · Score: 1

      Indeed, the values for N and T are silently increased and the figures you get for the defaults are much more reasonable.

      Thanks to that same N and T, Gnutella has an inherent limited visibility of 484 nodes by default. Use the author's own queries/user figures, we find that any area of the network is hit by 0.3 queries/second, a long way short of the quoted figure; the traffic flowing through it is proportionally lower at 74KB/s. As a consequence there cannot be "hot spots" in the network, the traffic is smoothed out.

      The total calculated traffic of 2.4MB/s (N=4, T=5) is absolute peanuts when it is spread over the entire Internet. The fact that a Gnutella net is rarely balanced tends to decrease, not increase, the bandwidth consumed. So in real life you won't reach even that.

      What's more, with an increasing number of users, the amount of traffic generated increases linearly. This isn't bad scalability - that is about as good as it gets! Where Gnutella scales badly is when network visibility (N and T) increases. That has nothing to do with the number of users though.

      - Peter

  42. This is well and truely FUD. by BeBoxer · · Score: 5, Insightful

    As I pointed out last time this was posted, this article is basically 100% FUD. Yes, the amount of traffic goes up. And no, gnutella doesn't scale very well. But the author goes out of his way to make the problem look worse than it actually is. You see, the article only computes the total amount of traffic in the entire network. A number which is both huge and meaningless. You see, by this math, if I send a packet somewhere and it takes 10 hops, well, thats like sending 10 packets!

    At the end of the paper, the author coughs up the big scary number of 63GBps of traffic in the Gnutella network when the nodes each have 8 connections and are using a TTL of 8. Wow! That's a lot of traffic. That certainly isn't scaling! Well, what the author never points out is that, by his own math, the network has 7,686,400 users at this point! When we divide up the total traffic among all of those network links, we get a different view. If you do the math you discover that this is a whopping 72Kbps! Oh no! It's the end of the world! Well, no, it's not. True, it's more than a modem can handle. But it's well within the reach of most cable modem connections. Given that your computer is being expected to handle the search requests of over 7 million other people, it's not that much traffic.

    Don't get me wrong, I agree that Gnutella doesn't scale all that well. But this paper is just plain FUD. The only number that really matters to users is the total bandwidth load on their pipe. By carefully avoiding that number, which isn't very big and scary at all, the auther is clearly lying by ommision. Given all of the real problems networks like Gnutella encounter, it isn't interesting to read this sort of drivel. Why don't we drag out Mathmatical and model how much bandwidth Napster wastes by transmitting the names of all the files being shared even though most of them will never get searched for. Hmmm. lets assume 7,000,000 users. Let's assume that they each share 1000 files with an average filename length of 32 characters. Why, that's 224 Gigabytes of data, and we haven't even done any searches yet! Cleary, Napster doesn't scale. Ugh. This guy might know how to use Mathematica, but I still suspect he worked in the Marketing department. With the same guys who will tell you about their 200Mbps fast ethernet.

    1. Re:This is well and truely FUD. by swillden · · Score: 2
      Huh? That's easy. Square brackets are for function parameters (whether you're defining use using the function), curly brackets are for sets/lists (nest them to get matrices, tensors, etc.) and parentheses are for grouping in arethmetic expressions.

      There are no exceptions, no corner cases, nothing unusual. If you said he has a hard time figuring out when to use commas and when to use semi-colons, that I could relate to, or when to use '=' and ':=', or especially how to write pattern substitution rules that always do what you think they do, but the brackets are really, really straightforward.

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
  43. who is the author of this paper? by mozkill · · Score: 3, Interesting

    its important to know that the author of this paper is Jordan Ritter, who is the co-founder of Napster.

    --

    -- Betting on the survival of the media industry is a serious risk. I advise investing elsewhere.
    1. Re:who is the author of this paper? by Peyna · · Score: 1

      Maybe because it was noted by the poster of the article in the first place?

      --
      What?
    2. Re:who is the author of this paper? by mozkill · · Score: 1

      actually... right after i posted this message, the text of the article changed to include "written by one of Napster's founding engineers" . that was not there before! i think someone changed it.

      --

      -- Betting on the survival of the media industry is a serious risk. I advise investing elsewhere.
    3. Re:who is the author of this paper? by maxpublic · · Score: 1

      I read this as "John Ritter" and now I've got that damned song stuck in my head, the one that goes "Come and knock on my door...."

      Max

      --
      My god carries a hammer. Your god died nailed to a tree. Any questions?
    4. Re:who is the author of this paper? by mozkill · · Score: 1

      oh man! why'd you have to go and do that to me too!

      --

      -- Betting on the survival of the media industry is a serious risk. I advise investing elsewhere.
  44. Scaling is not the issue by Black+Art · · Score: 1, Offtopic

    Quality of files is the problem.

    There are a number of problems using Gnutella.
    Getting a complete and/or undamaged file is difficult. (Especially anything long.)

    Just because you find a file does not meanyou can get it. Huge numbers of the files "available" on Gnutella are either on non-routable addresses or on servers that refuse connection or timeout.

    Many of the files on Gnutella are misnamed or misattributed. Do a search on "Weird Al", for example. You will get all sorts of responses, few of them actually by Weird Al.

    It is useless for getting files that contain multiple parts, unless tared or the like. (For example, getting a complete album is next to impossible. The unreliability of the service ensures that.)

    Gnutella seems to have nothing to insure any sort of "quality of service" or file intergrity.

    Pretty much a waste of time.

    --
    "Trademarks are the heraldry of the new feudalism."
    1. Re:Scaling is not the issue by Strepsil · · Score: 1

      (For example, getting a complete album is next to impossible. The unreliability of the service ensures that.)

      Funny - just the other day I managed to collect a large chunk of the early Pink Floyd albums off Gnutella after I was hit by a wave of nostalgia and had nothing to play my old records on.

      It took me about 3 hours to locate and download the following albums:

      • Piper at the Gates of Dawn
      • A Saucerful of Secrets
      • More
      • Atom Heart Mother
      • Obscured by Clouds

      None of these could be said to be particularly current. The only track I had trouble finding was one of the ones on Obscured by Clouds. Trying my search again 10 minutes later got it for me.

      I've never had trouble at all getting complete albums using Gnutella.

      I agree with the misattribution problem though - but that's because a lot of people are morons. It's hardly the fault of the network. "Classic" Napster had exactly the same issues. It's hardly specific to Gnutella.

    2. Re:Scaling is not the issue by beanyk · · Score: 1

      Some kind of user moderation might be useful, then? Like "this source/individual only has crappy quality rips/doesn't attribute properly - look elsewhere for what you seek".

      Of course, then the user ratings would have to live somewhere central/trustworthy. Bah.

    3. Re:Scaling is not the issue by maxpublic · · Score: 1

      These objections apply to every p2p or pseudo-p2p service I've ever tried. It's a problem with how the tech is employed by clueless users, not the tech itself.

      Remember who we're catering to: people who're too stupid to get on IRC and grab their porn and music via DCC. Think about that for a moment.

      Max

      --
      My god carries a hammer. Your god died nailed to a tree. Any questions?
  45. Re:The Logarithmic value of the messages exchanged by Tattva · · Score: 1
    I agree that it is a possible solution, but it is not the only solution. Caching popular searches, having each node keep a cache of a random selection of adjacent nodes, and any number of other creative strategies to avoid the dumb graph crawling (granted, loop detection and timeouts do work) of gnutella may make it as useful as napster was in terms of breadth of material available.

    --
    personal attacks hurt, especially when deserved
  46. 20/20 Hindsight?! by mizhi · · Score: 2

    Sorry, but when Gnutella first came out, and I looked at the protocol, I thought to myself, "Gee, this is nice, but when that graph of connections starts getting highly connected and you have all those people spitting out queries and forwarding others there is going to be a humongous sucking sound as the bandwidth is taken." No, I didn't read a paper or do the math, but anyone with a basic grounding in graph theory and computer science would see the shortcomings immediately. Yeah, it will evolve and should since I like this kinda stuff... but it wasn't exactly rocket science. :-/

    --
    Humorless sig goes here.
  47. cryptic by Tattva · · Score: 1
    outdated

    cryptic

    --
    personal attacks hurt, especially when deserved
  48. I have problems with this 'analysis'... by rmckeethen · · Score: 3, Insightful
    I find it disturbing that the author neglects to mention some critical, and to me anyway, obvious points. Let's talk about just two, bandwidth usage and client optimizations.

    First, if I understand what he's driving at correctly, the bandwidth numbers he gives are for the Gnotella network as a whole, not for each and every client connected to it. This is equivelent to saying "average 'HTTP' usage generates n amount of bandwidth over the Internet", or "DNS traffic will consume x number of bytes on a given network". So what? Would anyone be really shocked if 7,000,000 web browsers generated HTTP and DNS traffic in the gigabyte range? Doesn't bother me. That might be an interesting number to your ISP but as a user of Gnotella I could care less about how much total bandwidth my query for 'The Grateful Dead' takes up. It sure sounds like alot of traffic, but it's distributed over the entire Gnotella network. As long as the traffic isn't high enough to overwhelm individual clients I don't see the problem. These numbers just don't seem to be that important, or am I missing something here?

    The other item the author fails to consider (and I'm going to guess that, as one of the engineers behind Napster, he probably knows better) are client-side optimizations like search caching and differentiation of the clients. The caching arguement goes like this:

    If client A sends out a query to client C looking for 'Grateful Dead' and client B sends out a very similar request to client C , say, 'The Grateful Dead', even basic caching would prevent client C from sending this request back out to the same hosts that responded to the first request made by client A. Again, am I missing something important here? I'm not sure that caching would reduce the traffic dramatically but I'd be willing to bet that it would improve matters significantly, especially for clients that remained 'up' for long periods of time (which is in itself another important factor that seems to be missing here). This just seems so obvious.

    There are bunches of optimizations like this that can be done with the Gnotella application to reduce the overall bandwidth. And this leads to the other half of my point, i.e. the author assumes that each and every client will be functionally the same. They aren't. The Gnotella FAQ tells you to reduce your N if your on a slow connection. This means that not all Gnotella clients are exactly the same now anyway; some have higher N's than others. The FastTrack guys (i.e. KaZaA, Morpheous, et. al.) have already shown that it makes sence from an efficency standpoint to have some clients do more then others via 'supernodes' and the like. This seems like a fairly obvious development on the client side and I can't for the life of me understand why this isn't addressed. I mean, really, isn't the 'client-client' vs. 'client-server' approach really the underlying assumption behind why Napster will scale and Gnotella won't?

    I hate to say it but it looks to me like the author is showing just a little bias here. Hey, I suppose that if I worked on a competing standard I'd trash-talk the competition too but I think his time would be better spent making the Napster approach work better. No matter how you slice it or dice it Napster is pretty much dead while the Gnotella network is still alive and kicking. Maybe it will never scale to 'billions and billions' of hosts but at least it's still around and going strong.

    1. Re:I have problems with this 'analysis'... by maxpublic · · Score: 1

      That's "billllyuns and billllyuns...."

      Max

      --
      My god carries a hammer. Your god died nailed to a tree. Any questions?
  49. Morpheus is better by xX_sticky_Xx · · Score: 1

    Not because of any difference in the clients (they're virtually identical and are both on the Fasttrack network) but because it doesn't contain the spyware that Kazaa does.

    --

    ---

    I didn't want to leave this space blank.
    1. Re:Morpheus is better by xX_sticky_Xx · · Score: 1

      I forget all of the spyware that it uses but I do know that it uses Cydoor for caching ads locally. Adaware will detect all of the spyware modules Kazaa contains.

      --

      ---

      I didn't want to leave this space blank.
  50. He is right by secondsun · · Score: 1

    Gnutella is a network HOG! Of course I was a idiot who decided to install LImeWire supernode on my PC and ran it for about 3 days before getting tired of it. The next week I am informed that my Internet is being discontinued until Feburary 4 (I live on campus). When I get around the BS from network services I find out that I was using 80% of my subnet and 10% of the outbound traffic from Gnutella. Of course, they see all the porn that is trafficed in Gnutella and assume it is a porn service. After explaining to them what Gnutella was they basically said well you weren't a porn server but still you are in trouble.

    SO I am at the LIbrary typing on /. while my PC in my room coolects dust.

    --
    There is nothing wrong with being gay. It's getting caught where the trouble lies.
    1. Re:He is right by secondsun · · Score: 1

      Well thanks to the fact that I am taking 22 hours and serveral online courses, I don't have time to do that. (And Yes I HAVE Explored the Madelbrot set, done mazez, played solitare, Written video games, and text adventures). Also, what I did with computers in the 1980's was play astro-grover on my C-64. Yes, I was 5 when the 80's ended.
      But your points were noted.

      --
      There is nothing wrong with being gay. It's getting caught where the trouble lies.
  51. A possible solution to the scaling problem by Bake · · Score: 2, Interesting

    How would a P2P with the scaling the likes of which IRC networks use?

    Since I believe IRC scales pretty good why not build the Gnutella network like that?

    1. Re:A possible solution to the scaling problem by lewp · · Score: 1

      Because IRC networks rely on known central servers, the same way Napster did. The IRC model is the exact thing these decentralized networks are trying to avoid.

      The RIAA, MPAA, etc. can shutdown 50 central servers the same way they can shutdown one. It may take a bit more effort, but that's what they pay their legal teams for.

      --
      Game... blouses.
  52. Clarification by Wheaty18 · · Score: 1

    It's quite simple. Someone might be offering a file, and 50 people are all downloading that file. If you download BearShare (a Gnutella client), it even has a little check box that says something like 'Share Files'. The fact is that more people are downloading than uploading.

  53. Not only is this old, it is outdated by codemachine · · Score: 3, Informative

    There were several responces to this article pointing out that the current Gnutella network is much more scalable than the one discussed in the article. Try looking here and here for articles discussing the changes since early 2000.

    Come on Slashdot, its 2002 not 2000. It looks pretty bad accepting this article right after the Napster one. Does Slashdot or VA own a stake in Napster or something?

  54. File Integrity and Multi-Source Downloads by Orasis · · Score: 1

    There is a new Gnutella standard extension called HUGE that will will fix a number of this file integrity and reliability problems. I think Bearshare is very close to releasing an implementation.

    There is also a sister specification to HUGE entitled the "Content-Addressable Web" which is for performing distributed downloads of content from normal web sites, and is thus not Gnutella specific. The CAW specification is available at http://onionnetworks.com/caw/

    --
    Justin Chapweske, Onion Networks
    http://onionnetworks.com/

  55. New algorithm needed at the connect phase? by _Knots · · Score: 2, Interesting

    Here's a really wacked-out thought I had that I've been working on.

    Gnutella clients can sometimes have more "potential" connections out to the network than MAX_CONNECT (because they open, say five, expecting two and get four). If so, why not do a traceroute to each of the hosts and crop out the one that is the most hops away? Iterate cropping until there are MAX_CONNECT active connections.

    This would tend to favor a network that closely reflected the underlying structure of the network - thus reducing any earth-shattering impact on the inet backbone?

    To further force a short-inet-distance perhaps clients should (optionally) not accept connections from far-flung hosts?

    Additionally, clients should count already-seen packets (which they are supposed to drop) against the goodness of a given link - thus reducing routing loops in the network and forcing it to flatten out instead of clump together.

    These might allow clients to have a higher TTL without increasing net net (har har) bandwidth - less duplicated, circularly-routed, lengthy-path, etc, data.

    I suspect (have not checked) that some clients do the latter (routing loop prevention), but I know of none doing the formers.

    I will get around to coding this soon, unless somebody can tell me it's a stupid idea (for a good reason).

    --Nathan

    --
    Anarchy$ dd if=/dev/random of=~/.signature bs=120 count=1
    1. Re:New algorithm needed at the connect phase? by gorilla · · Score: 2

      Traceroute is very expensive in terms of time, requiring a lot of packets to be sent out, and waiting on replies and on Unix based OS's, requires the program to have root privligages. It also doesn't really solve the problem. A server 7 hops from you can be on a congested link, or running on a really slow server, while a server 9 hops from you can be on a fast server with lots of bandwidth.

    2. Re:New algorithm needed at the connect phase? by _Knots · · Score: 1

      Yes, it's expensive to traceroute. But it's a one-time per-link cost. Seems reasonable to me, though maybe it isn't.

      I should clarify: traceroute hop count should not be the *only* statistic used when determining whether to make or drop a link. I think it should, however, be one. Include ping time for some sense of congestion on established links, include malformed packet count, include looped packet count, etc.

      Wouldn't a gnutella network that "flattened out" to mimic the underlying structure of the routers be better than one that insisted in creating links that spanned more routers? In the latter the odds of a given router handling lots of streams seems likely, whereas if we keep the connections netwise nearby, packets can potentially stay off the backbones for a hop or two and keep ISP's upstream traffic down (answer to all those "you do know your ISP can't handle all of its users at full bandwidth at once" issues).

      For instance: A and B are on the same trunk of a cable modem, C is on a different trunk handled by the same ISP, and D is on an entirely different network.

      In theory, nothing prevents gnutella from establishing any particular link setup. Ranging from "everybody connects to one host" (worst case here would be D) to A-B-C-D linkage. Wouldn't "nudging" the network to A-B-C-D instead of [ABC]-D be better? In the latter, every query from A,B,C,or D results in at least three (QUERY) packets that cross the cable-modem's upstream connection. (for instance, A. A->D, D->B and D->C).

      Now watch what happens if the network has flattened out to mimic the routers. A sends a query to B. That's on the same trunk! B forwards on to C. That doesn't hit any backbones - it's just ISP internal traffic. C forwards on to D. Ok, we hit the backbone *once*. Not three times.

      An answer from D will hit the backbone again. Answers from B or C will not. In the first "Everybody pile on D" senario, answers from B or C would have hit the backbone TWICE! (B->D, D->A)

      I remain convinced that tracerouting and "Flattening out" (it's my term, is there a better one) for router mimicing "Oh, a host with 250ms latency that's six fast routers away. I'll use it until I can get a closer one." is better than "OH, Look, there's a host I have no clue where it is but it's ping latency is 250ms. Let's go there." A ping latency of 250ms could be a modem right next to you (no backbone hitting, though slow) or a host across one of the big ponds and your ISP happens to have a favorable peering agreement with somebody who owns a cross-pond fiber line. Perhaps ping would urge the network to flatten out, but it does not seem to have a sufficient push if any.

      Perhaps the goodness function on a link should be something like:

      goodness() = pong_responses - ping latency - hops - malformed packets - duplicate packets. Perhaps with tuning coefficients on each term. More terms might be good, too. Drop links with the worst goodness if we have too many of them and/or drop links whose goodness value drops below a given threshold. (One may also wish to drop links based on other functions, like say just on malformed packets - any more than X or Y% of them and we drop the link).

      This should also work with the "ultrapeer" stuff that's crept into the gnutella protocol (on which I pass no judgement, not being familiar enough with it.)

      Maybe I'm barking up the wrong tree.
      --Nathan

      --
      Anarchy$ dd if=/dev/random of=~/.signature bs=120 count=1
    3. Re:New algorithm needed at the connect phase? by gorilla · · Score: 2
      Wouldn't a gnutella network that "flattened out" to mimic the underlying structure of the routers be better than one that insisted in creating links that spanned more routers?

      In theory, sorta yes. I say sorta, because the actual number of routers isn't important, it's more to do with the end to end reliability and bandwidth and transit time. It doesn't matter if there are 2 routers, or 28 routers, as long as all the other things are equal. However it's very complex to actually determine what are the 'best' peers to use. It certainly can't be done simply by hop counting. If I have A & B both 4 hops from a common point, and C 3 hops from that common point, then A to B is 8 hops, while A to C is 7 hops. So even though the link from the common point to C might be the most congested link, it's still the closest in terms of hops. It also can't be done by ping timing. Ping timing with normal (small) ping packets in a non-congested network is mainly affected by how fast the routers can pass on packets, which is largely independant of the speed of their links. In order to give a good indication of how fast the links will actually be, you have to simulate traffic with the same size packets, and transmitted at the same time intervals. In addition, the dynamics of the network mean that what was the right answer now, might not be the right answer in 10 minutes time or so.

      The upshot is, if I was to do this from scratch, I think the easiest & most effective way would be for the client to open N connections, and periodically check to see how 'good' those connections are. It would then drop the worse X connections, and re-establish them to different peers. After a period of time, you'll hopefully find that you've got N-X stable connections to the best possible peer, and X connections which are fluculating.

    4. Re:New algorithm needed at the connect phase? by _Knots · · Score: 1

      I think we're fighting different issues.

      You sound like you're fighting congestion. I'm worried about overloading the networks because we were not responsible with watching to whom we connected.

      There's nothing, I agree, a client can do if other clients are making a given link congested aside from not use that link.

      But I still think it's important that the network should not make unnecessary total traffic by selecting hosts without at least including as a minor factor their traceroute distance.

      This is, again, not an exclusive thing. Seems almost genetic to me: if the network's fine in terms of load then we needn't worry. But if the network is getting congested it makes more sense to hit fewer routers (lower connection count, pull the other end closer, find another route [not likely]). So if we have two links that are perfectly identical, shouldn't we drop the one that's further away? Your n-x idea, in essence.

      Maybe I'm still confused, but I think you misunderstand me. I don't want the network to exclusively do this - I just want there to be a minor "evolutionary nudge" towards a flattened out network.

      Hey, thanks for all the responses.

      --Nathan

      --
      Anarchy$ dd if=/dev/random of=~/.signature bs=120 count=1
    5. Re:New algorithm needed at the connect phase? by gorilla · · Score: 2
      You sound like you're fighting congestion. I'm worried about overloading the networks because we were not responsible with watching to whom we connected

      What is the difference in your eyes? To me, that's two different phrasings of the same thing. An overloaded network is a congested one.

    6. Re:New algorithm needed at the connect phase? by _Knots · · Score: 1

      True, an overloaded network is congested. There's nothing (or little) we can do about a congested network link that is carrying traffic besides ours. I'm not trying to necessarily reduce congestion on a given link - I'm just trying to cut down on total packets traversing the internet.

      Reading your responses I am becoming less sure that this is a worthy goal.

      --Knots

      --
      Anarchy$ dd if=/dev/random of=~/.signature bs=120 count=1
    7. Re:New algorithm needed at the connect phase? by gorilla · · Score: 2

      It's not a goal I'd be worried about. Given the choice of a link which is running at 40% capacity with 100,000 p/s and a link which is running at 99% capacity but 50,000 p/s, I'd rather use the first link. Of course the client can't tell this, but if using a UDP based protocol, it can monitor the number of packets dropped, and with either TCP or UDP, it can tell the transmission speed.

  56. P2P could be or not.. by f00zbll · · Score: 1
    Aside from all the math, there are other reasons why P2P programs will fail. If the financial analysts are right and unlimited bandwidth is coming, then P2P will happen. On the otherhand, there have been a lot of articles about obstacles preventing broadband.

    From a financial perspective, analysts believe unlimited bandwidth is coming because it typically costs the phone company about 40 bucks to service an account. For a typical low end cellular account ($20/month), the company looses money. The same rules apply to cable, dsl and normal phone line. If the progress of broadband comes to a halt and reverses, all of that math means absolutely nothing.

    Who care is Napster scales better when no one has broadband access. Really people, P2P technology is heavily dependent on broadband access to a large percentage of the population. If broadband gets priced above the average Joe's budget, do you think it will really matter? If only the top 5% of the population have broadband, the numbers cited in the article are meaningless.

    Time to look beyond the statistical FUD and look at the real issue at hand. The architecture of gnutella will only be a factor if unlimited bandwidth comes to the masses.

  57. Re:The Logarithmic value of the messages exchanged by Lumpy · · Score: 3, Interesting

    and I hate to say this, but take an idea from the windows networking world... each machine has an election to see who is going to be the master browser (based on average connected and up times.. the clients that are up and serving the longest and with the shortest down times historically) then we have the next few building the same master browser database but sitting dormant (just listening and cacheing) until the master browser disappears, then the next highest pipes up and says "ohhh lookie me!" thus keeping a master server up (and that master server could load balance with the sub servers by just sending a "busy use 127.0.0.2 or 127.0.0.3" back to the client.

    it could be fixed, and made powerful, self scaling.

    --
    Do not look at laser with remaining good eye.
  58. Re:The Logarithmic value of the messages exchanged by Adam+Fisk · · Score: 2, Informative

    LimeWire currently implements a variation of this -- what we call "UltraPeers." UltraPeers establish a significantly greater horizon on the network, and there are other distributed protocols that do this in other creative ways, such as Chord out of MIT, which can be found at: http://www.pdos.lcs.mit.edu/chord/ That aside, there is significant evidence to show that a distributed network can scale far better than any centralized network. Remember that Napster had serious scaling problems as well -- you could only see the files from the hosts on whichever server you happened to be logged in to. The only solution to that problem is the brain-dead purchase of a yet faster multi-million dollar server. I would not call that scaling. As everyone else has pointed out, this discussion began and ended in the Gnutella community about a year ago.

    --

    Adam Fisk

  59. Gnutella as a DDOS tool by zbuffered · · Score: 2

    If what he says is true, that you could generate 14 megs' worth of responses, what's to stop me from forging my IP address to be YOUR IP address, querying for the string mp3, and sitting back and watching the carnage? There would be almost no way to trace this, and it would certainly generate a significant amount of traffic, so what's to stop me? Maybe his statistics are a bit inaccurate, but all the same, you could cause a lot of data to be sent somewhere, while not causing yourself any significant lag at all.

    --
    Synergy is your friend
    1. Re:Gnutella as a DDOS tool by vawlk · · Score: 2, Insightful

      read the protocol spec, and you would understand why you can't do this. You don't reply directly to a request. You send it back through your connections and the clients you are connected to only accept replies with the correct information.

      If you had 8 connections and a request comes in from 1 of them, only that 1 connection would accept a reply with the request's guid. The IP information is taken directly from your connection.

  60. Re:GMy Program by Pinky · · Score: 1

    HEY! you forgot My program: Myster... :-)

    It would probably fall under the catagory of "social discovery" but I'm working on a proxying system that would allow many nodes to pool their resources under one node, effectively implementing a kind of super node.

  61. Ms. Spears and the development of Internet by porky_pig_jr · · Score: 1

    sometime in a future we'll all realize that Ms. Spears in fact was the major driving force behind the major Internet-related multimedia innovations. Including the recent IETF proposal on multimedia chat attachments. (not that I'm going to start listening to her music, of course)

    1. Re:Ms. Spears and the development of Internet by pacc · · Score: 1

      Not only music, but music videos and
      soon probably the virtual Ms. Spears.

      Well even science it seems: Britney Spears Guide to Semiconductor Physics

  62. And you can play DIABLO 1 without hacks :) by CrazyJim0 · · Score: 1

    I developed GNUTELLA in tandem. But I designed it mainly to have a theortical online RPG imagine "DIABLO 3" which wouldn't lag to hell like DIABLO 2(central server), but at the same time not incur the data hacking of DIABLO 1(client side).

    How's it work? Basically everyone stores OTHER people's characters on their computers. Its client side, but not on your side of the client :)
    If you log on, and everyone contests your new found power, then likely you didn't get it by honest means.

    There are conspiracy theory and stuff that would give this problems... But people marked as potential problems would be more closely monitored... And if you continued to be abusive, you get kicked out of the play group.

    I went on to note that if there was some way to get some IP seed with IP lists, then everyone could connect outside of a central server.

    When I heard of Napster, I automatically jumped and thought my idea could be adapted to file sharing, but lo and behold it was known for at least a year.

    Lots of main idea things come off in tadem.

    But there is definately a TON of GNUTELLA spin off uses... And most of them involve lowering the overhead to compete with corporate giants.

    You're using the client's power and bandwith to lessen the dependence on a central server's

  63. model, schmodel ... by porky_pig_jr · · Score: 1

    as the saying goes, 'All models are wrong. Some of them are useful.


    So assuming this model is useful, the question is: "useful for whom?"

  64. Re:The Logarithmic value of the messages exchanged by Patrick · · Score: 5, Informative
    The only solution is to structure the network by using "super clients" or "servants" or "super nodes", call them what you want, the later is what KaZaa and Morphus have accomplished; this makes the number of messages exchanged grows in a logarithmic way

    That's not logarithmic. If every client node connects to a "super node," and every other "super node," then what you have is a two-level tree. Growth at each level is O(sqrt{n}), not logarithmic.

    Chord, a p2p research project from MIT, is truly logarithmic. Go read their SIGCOMM'01 paper for an explanation of how their system works.

    --Patrick

  65. Oh come on! by Sanity · · Score: 3, Insightful

    If you are going to criticize a paper, do so on the basis of what they are claiming (there is no shortage of support for the claims he is making), not with conspiracy theories about the author's motivation.

    1. Re:Oh come on! by DrSkwid · · Score: 1

      you might notice no criticism in my contribution

      in fact I didn't claim anything or even make any suggestion of improprietry or otherwise

      I simply observed a phenomena

      --
      There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    2. Re:Oh come on! by Sanity · · Score: 2
      you might notice no criticism in my contribution
      in fact I didn't claim anything or even make any suggestion of improprietry or otherwise
      I simply observed a phenomena
      Well, I won't argue with you as to what your intention was, since only you know that, but most readers would reasonably conclude that you were implying some form of improprietry even if it was not explicitely stated.
    3. Re:Oh come on! by DrSkwid · · Score: 1

      In the absence of proof it's not for me to offer conjecture.

      Now Napster is to go pay it's in their interests for Guntella to be denigrated.

      I thought it was interesting, therefore, that someone chose for these stories to appear almost next to each other on the front page.

      tbh. it's probably just a bit of associative submission queue scanning.

      .

      --
      There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
  66. And there's room for improvement, no less... by DarkEdgeX · · Score: 5, Informative

    Plus the author ignores (mostly due to the fact that they didn't exist back when it was written, this IS an old article) the innovations made with Gnutella (and other, newer competing technologies). Specifically, there are now 'search proxies' that exist on Gnutella that cache and return common queries, thus not saturating the network with redundant queries. For a modem user, this makes the network usable if they limit their connections to proxy servers, because the number of searches hitting their client directly shrinks as common queries are sifted through.

    Not to mention there's still room for improvement to the protocol itself-- there's no reason a proxy couldn't cache a list of all files shared by a connected client, then answer queries directly, NEVER sending a query directly to a client. (Ultimately, as people run proxies like this more and more, you'd end up having proxies talking directly to eachother.) The ultimate Gnutella proxy would cache commonly requested files and make them available over a bigger pipe.

    No money in it, but for the Gnutella enthusiast, I could see them running this kind of thing from work off of a QA box, for example, or from their support desk at an ISP. =)

    --
    All I know about Bush is I had a good job when Clinton was president.
  67. Freenet has addressed this issue from day one by Sanity · · Score: 3, Interesting
    The scalability issues with Gnutella are clear to anyone who understands how it works. From day one, Freenet was designed with scalability as a core goal. In Freenet, the number of nodes involved, and the time required to retrieve a piece of information, scales logarithmically as the size of the network increases.

    A good analogy might be a detective trying to find a suspect for a crime. The Gnutella approach is akin to going on TV and asking everyone in the area to let you know if they know who did it. It may work once, but the more you do it, the less effective it is. Freenet works as detectives do normally, they gradually home in on their suspect by gathering information, and using that information to refine their search.

    Some say that Freenet only achieves this scalability because it doesn't do the type of "fuzzy" search Gnutella does. You need to know exactly what you are looking for in Freenet to find it. This isn't true, the Freenet searching algorithm can be generalised to allow fuzzy searching. While this has not yet been demonstrated in practice, it is definitely possible in theory.

    It always amazes me that people continue to lament flaws in many current P2P architectures when Freenet has incorporated solutions to those problems almost from its inception.

    Disclaimer: I am Freenet's architect and project coordinator, so you could be forgiven for thinking I am biased, but you are free to review our papers and research to decide for yourself.

    1. Re:Freenet has addressed this issue from day one by npietraniec · · Score: 3, Insightful

      So... How come after... 2? years Freenet hasn't become a standard or even a well known in the file-sharing world? I'm not trolling, I'm curious. Napster has come and gone, gnutella has come and gone, Now we have fasttrack... Meanwhile, the freenet site just chugs along...

    2. Re:Freenet has addressed this issue from day one by donglekey · · Score: 2

      Freenet is technically superior and very cool. The last time I tried it though it had a cgi web page based UI. That and nothing worked. It was a while ago, but I had and still have a lot of hope for freenet, but it just does not need to be that complicated. The idea of dedicating space that is separate from the actual files is a cool idea and opens a lot of doors but most people will just see it as wasted space. If I want to share 1000 oggs I am not going to want to dedicate a duplicate 5 gigs just to share them on Freenet, and that made me cringe because it is a weekness to getting content out there. I am off right now to check if changes have been made, like I said those were the problems I saw a long time ago.

    3. Re:Freenet has addressed this issue from day one by Sanity · · Score: 2
      Simple - because it isn't ready for public consumption yet. 2 years isn't long for a project like Freenet - look at how long it took Linux to reach wide acceptance, in many ways Freenet is a more complex project since, unlike Linux, it isn't just a reimplementation of code that is already out there, it is a completely new concept.

      Secondly, Freenet isn't really a file-sharing app, despite receiving much inaccurate publicity as "the next Napster". It isn't well adapted to sharing mp3s, nor should it be given its goals.

      We will be releasing 0.5 soon, it will be a huge improvement.

    4. Re:Freenet has addressed this issue from day one by Spy+Hunter · · Score: 2
      It always amazes me that people continue to lament flaws in many current P2P architectures when Freenet has incorporated solutions to those problems almost from its inception.

      There is one reason, and one reason only, why this occurs: There is no Freenetster. No P2P file-sharing app that allows you to easily search for and download music/movies/etc. As soon as there is one, Freenet will explode (assuming it really is as scaleable and such as it is made out to be). You want Freenet to be popular? There's only one thing you have to do...

      --
      main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
  68. Re:What uncertainty? by DarkEdgeX · · Score: 3
    It has historical value but it no longer has any value as a file sharing protocol.

    You're either an idiot, a karma whore, or both. Go ahead, download Bearshare (or any Gnutella based client) and look at the number of query responses you get to common queries-- oh look, most of them are RIGHT. Sure, you get some dead hits, but most of the time this is someone who's behind a firewall that doesn't have their client setup to work behind said firewall (or can't, because of a non-static IP address).

    Saying it's "historical" is just flamebait though, but I guess I answered the call like an idiot. As the original poster said, it's all FUD-mongering. As your post is.

    --
    All I know about Bush is I had a good job when Clinton was president.
  69. Totally NOT true!!! by Jagasian · · Score: 5, Informative

    Obviously you haven't used GNUtella for the past year. Xolox is a GNUtella client that allows for parallel downloading, resuming, and Xolox will even look for other sources of the file that you are currently downloading, if the current sources are too slow or down. Basically, with Xolox, you search for a file that you want, and you get results with numbers by them depicting how many sources have the file. That way you don't have to decide which source you want to download from. You decide which file you want to download... and Xolox figures out the rest.

    My average download speeds on Xolox are around 160Mbs. Of course, I am use the ever so crappy AT&T cable modem service... so other people on faster DSL lines will most likely experience faster downloads.

    Next thing you are going to tell me is that Windows is better than Linux because Linux doesn't have any good GUIs or desktop environments for it. Yeah, lets just ignore everything thats out there right now.

    Not only that, but Limewire also supports multisource, segmented, or swarmed downloading. Though Limewire has only recently gotten such functionality, while Xolox has had it for the past year.

    Oh, and GNUtella is free as in beer and as in speech.

    1. Re:Totally NOT true!!! by Afrosheen · · Score: 2

      I think most of us that are familiar with gnutella are using sLimeWire which sucks, or gnapster with the gnutella servers, which also sucks. I'm glad there's a good alternative.

    2. Re:Totally NOT true!!! by |<amikaze · · Score: 1

      160Mbs? That sounds pretty fast.. probably faster than your NIC can handle, unless you have one of those uber-31337 gigabit cable modems.

    3. Re:Totally NOT true!!! by evil_roy · · Score: 1

      gnutella has nothing to do with GNU.

    4. Re:Totally NOT true!!! by burts_here · · Score: 1

      but has anyone noticed the bandwidth hit you get from running Gnuttela clients, leave it running for a few hours and i find that it's tacking up 1.5k a sec, dosen't sound bad but my sister is usually running it at the same times thats another 1.5k, I'm only on a 65K isdn line, thats quite a lot of my bandwidth...

      --
      Burt "Out of my mind back in 5 minutes"
    5. Re:Totally NOT true!!! by Jagasian · · Score: 2

      Doink! Fingure-Freudian-slip there. I meant "kbs", not "Mbs". Sorry bout that. The intent should have been obvious though, to people like you anyway.

  70. Download the new LimeWire by EllisDees · · Score: 2

    Seriously. The latest version (2.1) seems to have solved quite a few of the problems outlines in the 'study'. Anyone who is doubting the scalability of the protocol should give it a try.

    --
    -- Give me ambiguity or give me something else!
    1. Re:Download the new LimeWire by Tazzy531 · · Score: 1

      What your missing is that it is not a problem of the client but of the technology behind it. As the paper discussed, there is a number of scalability problems inherent within gnutella. Unless these are resolved, any new client will inherate these flaws.
      It's the same thing as saying a person building the fastest boat but only using a paddle. No matter how you design the boat, the paddle is your limiting factor

      --


      _______________________________
      "I'm not Conceited...I'm just a realist..."
    2. Re:Download the new LimeWire by EllisDees · · Score: 2

      Limewire does address these problems. Instead of every single client forwarding every single request among every attached client, they have implemented what they call 'ultrapeers'. Each of these clients is capable of handling the searches for up to 75 non-ultrapeer clients, and each 'normal' client usually connects to 3 of the ultrapeers, while the ultrapeers try to maintain 6 or so connections to other ultrapeers and older clients. They are changing the way not the network works, not just building a faster client.

      --
      -- Give me ambiguity or give me something else!
  71. the big picture by trapper7 · · Score: 1

    Is it just me or is everyone missing the big picture here?

    When I do my search for whatever the heck it is I don't expect or want 10 million results. Searching every user is almost always pointless.

    If you actually step back and think about it a distrubuted network like this functions perfectly as a few thousand overlapping smaller networks of 2 or 3 thousand users. This way each person's own mini-network is centered around him.

    A excelent side effect of this is it makes the content self filtering as dud material won't propagate far. This is because the user will delete it once they realise its a dud, hence stopping it moving on into neighbouring mini-networks. With a big centralised network like napster all it takes is a few extreamly fast clients offering decoy crap to much everyone around.

    Purhaps if no result is found in the immeadiate mini-network then the query could be passed a bit further away, but you would never need to query every last person. Its highly unlikely you would be looking for something so rare that only one person in a million has it...

    -trapper-

  72. Heh by autopr0n · · Score: 2

    Well, I would find that funny. Of course, you don't really need to be an economist to know the new napster will fail, just like you don't need to be a computer scientist to know the Gnutella network was fucked (at least in it's original conception)

    --
    autopr0n is like, down and stuff.
  73. Re:moochers + crap == worthless by BrookHarty · · Score: 2

    No. You cant download when all your upstream is being used.

    If there was bandwidth capping as the default this would help. Also need to fix resume. Basically, put a decent client that has QOS built in by default, and can resume files from multiple sites. I never had a problem uploading, but when I want a file on a modem, and only 2 have the file I want, its mostly likely 1/2 way during the download, the user will log off. I have a directory of incompletes that never get resumed. Also, I have to connect to a large (again) LARGE amount of hosts to find the file I need. Its like finding a needle in the haystack. This is where a directory service like napster kicked ass. Finding the file.

    But then if you want britney spears mp3s you will find thousands of hits...

  74. Transparent Proxy by Uncle+Dazza · · Score: 2, Interesting

    Has anyone considered that a transparent proxy might be the solution, or at least a partial solution?

    The internet is more of a tree than a net, at least for the smaller ISP's. So a site can run a transparent proxy that aggregates all it's gnutella clients, and only maintain a few outbound connections for the entire site, as opposed to a few per client. In addition, incoming gnutella connections are intercepted and directed at the proxy (which is essentially another gnutella node).

    This allows ISP's to limit the number of gnutella connections to the rest of the world. In fact, it would be best for them to connect only to other ISP's using a proxy as well.

    This would tend to greatly improve query response time for nodes that are close by, but on the other hand would make it harder to create connections to remote nodes, because that control has been moved from the client to the proxy.

    But an office or an net cafe or school could run the proxy and have a single link between it and the ISP's proxy, instantly connecting the site with all the ISP's users and cutting bandwidth considerably.

    Proxy's can do other things to accelerate searches. If a request for "Grateful Dead" has been forwarded, then there is no need to forward the same query string in the immediate future (say 1 minute). And of course the is the option of caching the file transfers themselves...

  75. Supernoding's other advantage by billstewart · · Score: 3, Insightful
    In addition to reducing the growth from exponential to sqrt or logarithmic or whatever, the other big advantage of supernodes in a Gnutella-like network is that you can limit supernodes to systems with fast network connections, while regular peers can be on slower network connections, which is a serious bottleneck in a network that needs to send all queries to all peers to be successsful.

    Of course, building an indexing system that scales arbitrarily is difficult, and building an indexing system that recognizes local topologies is also critical. A typical problem universities had with Napster was that if N people at the school wanted a given tune, most of them would be likely to fetch it across the school's limited outside bandwidth instead of most people fetching it from other sites on the fast LAN after the first one or two had downloaded it across the limited part. Napster was able to reduce this problem, at least at some schools, because having a centralized indexing service means that they can enforce more locality by making it easiest for people to find nearby peers. A decentralized system *may* be able to accomplish this, but it's a lot harder.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  76. Finding obscure stuff by billstewart · · Score: 2
    If all you're looking for is popular material, it's easy to get by with queries going to small numbers of people. Surely somebody else at foo.edu or somebody else in New Zealand has that Metallica album - no sense in going far away for it. But if you want something that's not common, you'll have to look more places to find it; if you want something that *nobody* has, you'll have to look everywhere :-)

    Or if you're looking for something more complex, you'll get better results by checking more places. For instance, I once searched Napster for every recording of a given Irish folk song - the versions done by the Chieftans got lots of responses, but some of the other bands who'd recorded it only got one or two, and they were performed entirely differently. Or if you're looking for live Grateful Dead performances, used in the paper's example partly because sharing them is legal, you'll probably find most of the albums on one music-sharing net or another, and the few hundred or a thousand best (or best-taped) shows they did, but you may be looking for that random show you attended in 1971 to compare how they played Dark Star with how they played it a few years later and to see how much of your memories were affected by the mood you were in (ok, or the drugs you'd been taking :-)

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  77. No solution needed. by Roadmaster · · Score: 1

    two reasons:

    1- The paper was written by someone at Napster. That's like someone at Ford writing a paper on how Chevrolet passenger transports can't scale.

    2- The math has been reviewed by people (at least here) and found to be flawed.

    So, it's probably a nonexistent problem, and the fact that gnutella keeps working and the whole internet hasn't slowed to a crawl because of it, is proof that maybe there's no solution needed.

  78. What about Mac OS X by corporatemutantninja · · Score: 1

    Ok, I'm convinced that I should stop using the Gnutella network. But what should I use with OS X if it's not LimeWire?

    --
    Actually, I was trying to be Insightful, not Funny.
  79. Mods, pour some karma on that guy by Sloppy · · Score: 1

    That's a good idea. There would probably be legal problems for the ISPs in the contributory/vicarious aspects, but technically, that would be a good improvement.

    --
    As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.
  80. All fine and good but... by Jason_Knx · · Score: 1

    What actually will a distributed network do for business


    A business depends on relyable information and information sharing. If anyone can add info to the network to be searched through then it wont "fly" inmost businesses.


    Why, because any business that has any quality assurance, needs a relyable information source and needs to know that source and it has to be controled.


    I ran across a problem in this in trying to convince some clients to supply a company I worked for with CAD files instead of paper if they had them. I was told getting the files wasn't the problem but they were goverment agencies and making sure they were the current controled drawings was. They all told me if I could solve that then they would do it immediately as it would greatly reduce the load on hard copies they had to make.


    In the engineering world if you wanted a drawing of some item and could go into a Gnutella like client and do a search for it would have been great. It would save time in redrawing it yourself, which could be considerable hours depending on what it was. But the question is where you got the drawing and how can you be sure it was current and accurate. There's no quality checks. Even if you knew the source how did you know it was the most current. If all the engineers/drafters that were connected to the network had a client on there computers what if the drawing they had was out of date.


    Controlled networks with proper TTLs and maximum hops can work fine despite these numbers. But quality of info still needs to be determined and examined.


    In the end I kept coming up with a single or system of central repositories to check against but seemed to contradict the thinking of a distributed network with no central database to check against.

  81. RIAA by pclminion · · Score: 2
    From the article:

    From above, a whopping 1.2 gigabytes of aggregate data could potentially cross everyone's networks, just to relay an 18 byte search query. This is of course where Gnutella suffers greatly from being fully distributed.

    Actually, I think the RIAA suffers more, since there's no one to sue.

  82. Re:What uncertainty? by DarkEdgeX · · Score: 3

    Easy-- this is where proxies come in, and proxies that talk to other proxies across the world (ie: proxies that intentionally look for and connect to other proxies, essentially sharing information across a larger umbrella). I think you're referring to the issue of TTL in query packets, and a proxy can solve that problem as well (especially, as my other post in this same thread indicated, the proxy and protocol can be extended to immediatly poll clients upon connection for what files they're sharing (and thus, never pass queries on to clients directly)).

    Yes there are chances to optimize the protocol, but it's all fairly basic-- Kazaa's technology isn't that far removed from Gnutella. Supernodes (which is basically what I described above, a 'proxy/cache') are the next logical step to the Gnutella spec.

    --
    All I know about Bush is I had a good job when Clinton was president.
  83. Not just the moderators... by untulis · · Score: 1

    While we're at it, let's give anti-kudos to jrp2 who submitted this fine repeat. Where's the friend/foe button so I can ding him too?

  84. Re:old news by willfe · · Score: 1

    *sigh*. Guess we can't ever talk about something more than once around these parts.

    --
    Read my stuff.
  85. giFT enters network testing by Robotech_Master · · Score: 3, Informative

    It's worth noting that giFT/OpenFT just entered its first stage of network testing--and with that in mind, they need as many people as possible to download and run the client so they can test the network. Complete instructions for so doing are given on the website.

    --
    Editor Emeritus and Senior Writer, TeleRead.org
  86. Re:The Logarithmic value of the messages exchanged by dildofire · · Score: 1

    from a technical purely perspective, the supernodes idea makes a lot of sense. but wouldn't that just give the RIAA a spot to attack to shut down the network?

  87. His paper is flawed and misleading by kzadot · · Score: 1

    His article is similar to the following:
    Cars suck because they don't scale, if we build them to carry more than 1000 people, they get too big and heavy for the materials we build them with to hold together.
    If we drive them faster than 400km/h they are too dangerous for the roads, and use too much petrol.
    But, if you use a car normally, they are fine. Just as gnutella is fine if you keep hosts and hops to a reasonable level, not the silly settings the author of the article assumed.

    1. Re:His paper is flawed and misleading by PigleT · · Score: 2

      Which bit of "defaults in our gnutella client" did you not understand?

      If you're going to criticise the article, then you want to ask *where* this 8Gb/s is going to be - it's all very well summing up the total traffic that *might* be generated, but it's not as though I'm going to have to download all 8Gb down my 56K modem line in order to get every result matching "grateful dead live", is it? Not only am I only going to restrict myself to a couple of minutes' searching, but the traffic itself is distributed all over the 'Net so no one link has to experience the disruption.
      Without analysis of the size of the network as a denominator by which to divide the above, claiming "it won't scale" is utter tripe, and I'd expect better from any article claiming to be "mathematical". Duh.

      --
      ~Tim
      --
      .|` Clouds cross the black moonlight,
      Rushing on down to the circle of the turn
  88. Re:The Logarithmic value of the messages exchanged by Metrol · · Score: 3, Interesting

    Not if you couldn't predict which machines on the net would act as those supernodes. If, like another poster mentioned, machines that met a certain criteria (bandwidth, storage, time on line, whatever) simply won an election to act as a node, there's no single point to shut down. Shut down one supernode, the others are informed that a replacement is needed and another election is called for.

    Following an election, the supernodes update the clients as to the lookup machines. I suppose you could even have it where if all the supernodes were shut down that an entirely new election process takes place creating a new set of supernodes. Kind of like having a DNS server setup where any machine can act as one of the root servers based on a criteria based election by those machines doing a lookup.

    Way too much for my wee brain to work out all the details on. Sounds good in theory anyway :)

    --
    The line must be drawn here. This far. No further.
  89. Too late... by vrt3 · · Score: 3, Informative
    From http://www.xolox.nl:
    Dear XoloX-user, Taking into account the latest law suits against p2p clients based on Fasttrack-technology (such as Kazaa), we have decided to discontinue XoloX. As of the 1st of december, XoloX will be shut down and removed from distribution sites. We hope everybody has enjoyed XoloX as long as it has been around and we want to use this opportunity to thank everybody who made a contribution to its development. These last few days will give you some time to finish your downloads and we advise you not to start new transfers. Thanks again and goodbye! --Team XoloX--
    --
    This sig under construction. Please check back later.
    1. Re:Too late... by vrt3 · · Score: 1

      I was too fast... on Zeropaid's site you can still download a (patched) working version. That'll teach me.

      --
      This sig under construction. Please check back later.
    2. Re:Too late... by Jagasian · · Score: 2

      Yeah, thats why I posted the Zeropaid link. They have a cracked version that doesn't autoshutdown. Hopefully we start seeing Xolox-like functionality in more GPL GNUtella clients. The network is only great because it is open... and widely adopted... thats what gives it an edge over the other guys.

  90. why, why, why? by cascadingstylesheet · · Score: 1

    I must have my filter set too high.

    Why do I never see anyone say the obvious? Right or wrong, I feel less liable if I only dl, not ul. Yeah, I doubt anyone would go after me for sharing a few songs on gnutella. But I really doubt they'll go after me for grabbing a few songs from gnutella.

    I'd be glad to share all day long. Sorry, just can't take the risk. I don't want to have to explain to my wife that we lost our broadband, or got fined, or are getting sued or something because I wanted to play nice on gnutella.

  91. Will you pay attention at the SOURCE? by JCCyC · · Score: 2

    a paper written by one of Napster's founding engineers

    Just when they lauch their pay service. No, I assure you his/her analysis is totally and utterly impartial. Excuse me while I ask Bill Gates about the scalability of the Linux kernel.

    1. Re:Will you pay attention at the SOURCE? by jordan · · Score: 1

      I wrote the paper a year ago, moron.

      --jordan

    2. Re:Will you pay attention at the SOURCE? by JCCyC · · Score: 1

      Oops. The date wasn't totally explicit, and the math went way over my head. I'll apologize if you apologize for the "moron".

    3. Re:Will you pay attention at the SOURCE? by maxpublic · · Score: 1

      And in a year of explosive Gnutella-network growth you *still* think the system isn't capable of scaling? If so, there's definitely a moron in the conversation and it isn't the other guy.

      Max

      --
      My god carries a hammer. Your god died nailed to a tree. Any questions?
  92. Re:The Logarithmic value of the messages exchanged by chrohrs · · Score: 2, Informative
    LimeWire has attacked this problem by introducing "ultrapeers", which offload most of the bandwidth to a small subset of hosts. It works really well. Unlike FastTrack, this is an open-protocol with an open-source implementation available.

    The next step is to add more sophisticated routing protocols between ultrapeers. Many of the algorithms mentioned elsewhere in this post (Chord, CAN, etc.) are contenders for that, as is LimeWire's home-grown query-routing proposal.

    Christopher Rohrs
    LimeWire

  93. OT - Internet has limits? by jeff13 · · Score: 1

    If there is one thing this reminds me of, is that the limits of transmitting "bits" is a real thing. Why we are trying to broadcast bits and bytes is beyond me, I see no evidence of this ever becoming a high quality, practical medium. Isn't the Internet really a lot of hype? Other than transmitting basic text around it is not much good for anything heavier. IMHO.

    Don't start in with 3D online games. My point is, after my 10 years of TCP/IP, Web, Streaming, Gaming, etc. the Internet and Computers in general still look to me like the most clumsy, expensive, wasteful, way to transmit information ever created. And it all looks like shit still. Hey, Maybe you like watching movies on a 14" screen with jumpy pixilated images, I don't. The real tragedy here is that you spent far more money on it, even if you got it from some warez spot online.

    The world has been taken over by aliens who look just like little TVs with keyboards. Their mission, to waste time, present images so poorly eyesight worldwide will be strained, destroy as much information as possible and when that fails - change it, pollute by dumping masses of oil based plastic, bubble the economy to outrageous proportions - causing world monetary chaos, then control information by offering online shopping.

    Mission successful!

  94. Mojo Nation by Jonboy+X · · Score: 1

    A while back someone told me about Mojo Nation and it seemed pretty nifty. The idea is that users get "mojo" for contributing to the network as a whole, which they cam then "spend" on services that other users provide. It's trying to solve the problem of "freeloaders" on networks such as Gnutella who use lots of bandwidth with searches but basically bog the network down. Capitalism meets P2P. It's kinda-sorta decentralized in that files that are shared are broken down into little (redundant) bits that are stored for retrieval on lots of hosts. Again, neat idea, but the problem I had with it was that it cost mojo to share files! It costs mojo to do anything that uses bandwidth, basically, so users are implicitly discouraged from sharing. Oy vey.

    --

    "In a 32-bit world, you're a 2-bit user. You've got your own newsgroup, alt.total.loser." -Weird Al
  95. Just be glad by Mr.+Fred+Smoothie · · Score: 3, Funny

    Taco's not a patent examiner.

    --

  96. Mathematical Analysis of Guatemala by qwerpoiu · · Score: 1

    Population: 12,974,361 (July 2001 est.)
    Age structure: 0-14 years: 42.11% (male 2,789,189; female 2,674,747)
    15-64 years: 54.25% (male 3,518,209; female 3,519,851)
    65 years and over: 3.64% (male 220,640; female 251,725) (2001 est.)
    Population growth rate: 2.6% (2001 est.)
    Birth rate: 34.61 births/1,000 population (2001 est.)
    Death rate: 6.79 deaths/1,000 population (2001 est.)
    Net migration rate: -1.84 migrant(s)/1,000 population (2001 est.)
    Sex ratio: at birth: 1.05 male(s)/female
    under 15 years: 1.04 male(s)/female
    15-64 years: 1 male(s)/female
    65 years and over: 0.88 male(s)/female
    total population: 1.01 male(s)/female (2001 est.)
    Infant mortality rate: 45.79 deaths/1,000 live births (2001 est.)
    Life expectancy at birth: total population: 66.51 years
    male: 63.85 years
    female: 69.31 years (2001 est.)
    Total fertility rate: 4.58 children born/woman (2001 est.)
    HIV/AIDS - adult prevalence rate: 1.38% (1999 est.)
    HIV/AIDS - people living with HIV/AIDS: 73,000 (1999 est.)
    HIV/AIDS - deaths: 3,600 (1999 est.)
    Nationality: noun: Guatemalan(s)
    adjective: Guatemalan
    Ethnic groups: Mestizo (mixed Amerindian-Spanish or assimilated Amerindian - in local Spanish called Ladino), approximately 55%, Amerindian or predominantly Amerindian, approximately 43%, whites and others 2%
    Religions: Roman Catholic, Protestant, indigenous Mayan beliefs
    Languages: Spanish 60%, Amerindian languages 40% (more than 20 Amerindian languages, including Quiche, Cakchiquel, Kekchi, Mam, Garifuna, and Xinca)
    Literacy: definition: age 15 and over can read and write
    total population: 63.6%
    male: 68.7%
    female: 58.5% (2000 est.)

    Source: CIA

    1. Re:Mathematical Analysis of Guatemala by John+Sullivan · · Score: 1
      Languages: Spanish 60%, Amerindian languages 40% (more than 20 Amerindian languages, including Quiche, Cakchiquel, Kekchi, Mam, Garifuna, and Xinca)

      Real Men don't speak Quiche!

      --
      This is my World Wide Web of Whatever
  97. Re:The Logarithmic value of the messages exchanged by carm$y$ · · Score: 2, Insightful

    The only solution is to structure the network by using "super clients" or "servants" or "super nodes"[...]

    But won't this "super singularties" become, on the long run, bottlenecks themselves, prone to abuse, DoS etc., plus the logical target for the "other side" that wants this kind of p2p to be buried and forgoten?

    One of the strenghts of the p2p model is that is hard to pursue 1000's of (arguably) minor copyright infingements as opposed to charge one entity (Napster?) with all of them...

    --
    -- No sig today
  98. A better solution by TheSHAD0W · · Score: 2

    A solution has been proposed that is a hybrid of Napster and Gnutella; basically, it is a Gnutella-like network of volunteer-run Open-Nap-type servers. Most users would run as an end node, which would merely query, upload and download; a few, however, would run index servers, where all the searching would take place. End nodes would run as a client/server relationship to one (or more) of the indexing nodes, each of which would network with a few other indexing nodes. The result would be a file-sharing network nearly as efficient as Napster with the robustness of Gnutella.

  99. Point Missed by streetcat · · Score: 1

    It seems the author has missed the uniqueness of the gnutella protocol when comparing. The readings show nothing but the obvious, while the potential for gnutella as such is huge when these disadvantages are removed. It is as simple as this. A person can choose the city he wants to live in. Once chosen, he can visit the grocery store in his locality. e.g If he stays in Chicago, he doesn't need to go all the way to San Francisco to buy groceries. Similarly simple solution for gnutella: Organize the gnutella network.