Slashdot Mirror


You Can Navigate Between Any Two Websites In 19 Clicks Or Fewer

An anonymous reader writes "A study done by a Hungarian physicist found that of the billions of websites and over a trillion objects on the web, any given two are separated by no more than 19 clicks. 'Distributed across the entire web, though, are a minority of pages—search engines, indexes and aggregators—that are very highly connected and can be used to move from area of the web to another. These nodes serve as the "Kevin Bacons" of the web, allowing users to navigate from most areas to most others in less than 19 clicks. Barabási credits this "small world" of the web to human nature—the fact that we tend to group into communities, whether in real life or the virtual world. The pages of the web aren't linked randomly, he says: They're organized in an interconnected hierarchy of organizational themes, including region, country and subject area. Interestingly, this means that no matter how large the web grows, the same interconnectedness will rule.'"

28 of 185 comments (clear)

  1. Like most overgeneralizations... by mark-t · · Score: 3, Insightful

    ... this one is quite obviously false. And illustrates one of the dangers about assuming that extrapolation is equivalent to actual supporting data.

    I mean, there are objects behind paywalls that, all by themselves, can be more than 19 clicks away from a highly unrelated web page elsewhere online There are objects which are online that have no external links to them at all. And those are just the obvious ones.

    It's an interesting notion, but it's incorrect.

    1. Re:Like most overgeneralizations... by lxs · · Score: 5, Funny

      The guy is a physicist so it is only true for spherical websites in a vacuum.

    2. Re:Like most overgeneralizations... by Sique · · Score: 5, Insightful

      If your pages are not connected via links to any extern sites, then by definitionem, they are not part of the World Wide Web.

      --
      .sig: Sique *sigh*
    3. Re:Like most overgeneralizations... by Sique · · Score: 2

      What ever it is, it surely is not part of a world wide web. It's an island all of its own.

      --
      .sig: Sique *sigh*
    4. Re:Like most overgeneralizations... by Hentes · · Score: 3, Insightful

      Many times it's not the author being overenthusiastic but the media reports.

    5. Re:Like most overgeneralizations... by wonkey_monkey · · Score: 2

      And what if two sites are connected only to each other? Or fifty?

      --
      systemd is Roko's Basilisk.
    6. Re:Like most overgeneralizations... by Sique · · Score: 3, Interesting
      No, the World Wide Web was the vision which would emerge out of lots of HTTP-Servers serving pages of HTML with links to each other. Tim Berners-Lee explicely stated such:

      Making a web is as simple as writing a few SGML files which point to your existing data. Making it public involves running the FTP or HTTP daemon, and making at least one link into your web from another.

      So yes, to be part of the World Wide Web, your site has to have at least one link from another site -- otherwise it's not part of the public World Wide Web. It's the same with the Internet. Of course you can create another network using IPv4 or IPv6 to connect the nodes to each other, but as long as there is no external link into it, it's just an intranet and not part of the Internet.

      --
      .sig: Sique *sigh*
    7. Re:Like most overgeneralizations... by tverbeek · · Score: 2

      Incorrect. The original specifications of the World Wide Web did not require that every page be linked from others, only that they could be linked from others. That's why every Web browser has had a field for manual input of URLs. Directories like YAHOO and search engines like Altavista, which linked the Web more pervasively, were important to the development of the Web, but every Internet-accessible HTTP server was still part of the Web even without them.

      You're probably thinking of the definition of the Internet, which does require every network that is a part of it to be connected to all of the others in some way. If your network is linked to other networks, but none of those networks are connected to the whole of the Internet (e.g. within a totalitarian state that's been unplugged by a dictator), then you might be on an internet, but you aren't on the Internet.

      --
      http://alternatives.rzero.com/
    8. Re:Like most overgeneralizations... by im3w1l · · Score: 2

      The rules state nothing about *outbound* links.

  2. So Which Web Site is the Kevin Bacon Equivalent? by Biff+Stu · · Score: 4, Interesting

    It's got to be a reasonably good, well-liked site, but not a mega-site like Google or Facebook.

    How about Salon.com or theonion.com?

    I would say /., but by its nature, /. has too many connections to be used for a Kevin Bacon number equivalent. Conversely, The Onion probably doesn't link to enough stuff.

    I vote for Salon.com

  3. 19 clicks? No way...unless... by Anubis+IV · · Score: 5, Informative

    Not a chance, unless you're counting the number of clicks it takes to turn on the on-screen keyboard and type enough characters into Google search for a reliable suggestion to show up. Up until two years ago when I left academia I was with an Internet research lab at a major university, and I saw diagrams of some of the graphs collected by decently large web crawls of the time. None of them would have been clustered enough to allow jumping between two arbitrary sites in 19 clicks or less for three primary reasons:
    1) Most links are unidirectional, not bidirectional (e.g. you might link to a news story, but the news site is unlikely to link back to you). As a result, it's rather difficult to reach sites on the fringe of the graph, since many of them have few or no links pointing to them.

    2) Domains (as in domains like medicine, technology, and automobiles, not domain name like google.com) tend to be segregated from one another and oftentimes have long chains before they reach more clustered/common parts of the Internet (e.g. if you start at a particular site for a niche topic, there may be only one other site pointing to it, and then only one pointing to that one, and so on for quite awhile).

    3) Many sites don't have any links to other sites. It's not as uncommon as you might think, and they'd all count as a dead end, which would obviously end your traversal if you were starting from that site.

    When I used to see those graphs, most of them would exhibit chains that would dangle off of the main cluster and would stretch out for dozens or hundreds of sites in length, meaning that if you started from one of those sites in the middle, you'd have to go half that distance in either direction before you'd make it back to the main cluster. Even with as far as we've come in recent years, I seriously doubt that all of those chains have been eliminated.

  4. What a waste of time by dcollins117 · · Score: 5, Insightful

    If you read the actual article you'll find that his findings "involved a simulated model of the web that he created to better understand its structure. " So this article has nothing to do with the actual internet, but a simulation of it. It's not a noteworthy study, and I'm wondering why I wasted my time reading about it.

    1. Re:What a waste of time by SeaFox · · Score: 2

      It's not a noteworthy study, and I'm wondering why I wasted my time reading about it.

      Because you didn't do enough surfing to find the superior time-wasting content that was only a click (or 18!) away?

  5. I don't think so by twocows · · Score: 3, Funny



    HAH!

    1. Re:I don't think so by BeardedChimp · · Score: 4, Insightful
    2. Re:I don't think so by Rockoon · · Score: 3, Insightful

      The web is all about bad practices.

      --
      "His name was James Damore."
  6. Re:Assuming they're linked at all by Sique · · Score: 4, Insightful

    Then maybe your blog is not part of the World Wide Web, it's just based on the same technologies and can be reached via the same means.

    --
    .sig: Sique *sigh*
  7. I can improve on that. by Arancaytar · · Score: 2

    *click on URL bar*

    *enter URL*

    *hit enter*

    There, one click.

  8. Very old news by Lusitania+River · · Score: 2

    How is this news? The author has a book called Linked (published in 2002 and actually a very good book) that already mentioned, in chapter 3, that the degree of separation is 19 (18.59 to be exact). It's interesting that it has not changed in 11 years but it's certainly not news !!!!

  9. Re:Wikipedia game by beowulfcluster · · Score: 2
    The all knowing Wikipedia states:

    As of 6 February 2013, the highest finite Bacon number reported by the Oracle of Bacon is 9.

  10. nope. can't be done by __aaltlg1547 · · Score: 2

    Because: 1. new sites are created every day initially having no links to them. 2. websites can be and are created that have no external links in them.

  11. No, you cannot by allo · · Score: 2

    There are pages with no outgoing links. Before anyone yells "thats not part of the web", there are ingoing links, so its linked to the web.

  12. Re:So Which Web Site is the Kevin Bacon Equivalent by Impy+the+Impiuos+Imp · · Score: 3, Funny

    I don't think Slashdot is the great hub you think it is. Every time I click a link here to somewhere else, the page never loads and is clearly broken.

    --
    (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
  13. Because not a single site exists with no links by wisnoskij · · Score: 2

    Because no one has ever created a website with no external links.

    --
    Troll is not a replacement for I disagree.
  14. Re:Assuming they're linked at all by ranton · · Score: 2

    This thread is one gigantic breeding ground for the No True Scotsman fallacy.

    How is that? Sique was just responding that merely having an HTML page does not mean that the page is on the World Wide Web. To be on the web you need at least some level of interconnectivity with the rest of the web.

    There are plenty of other entities, such as corporate intranets, that use the same underlying technologies as the World Wide Web but would not be considered to be on the web. That is the reason we have different terms such as Internet, Intranet, World Wide Web, etc. They are not completely interchangeable.

    --
    -- All that is necessary for the triumph of evil is that good men do nothing. -- Edmund Burke
  15. Re:Who cares? by TheCarp · · Score: 2

    Sigh, no. Actually, this is a mis-statement of Kevin Bacon theory, first proposed when Stanley Milgrim devised an experiment where he wrote fan mail to Kevin Bacon and distributed it through a random process to several people throughout the country, asking them to attempt to deliver it by handing it to a personal friend and asking them to do the same. On average, Kevin recieved them all within 6 transfers.

    --
    "I opened my eyes, and everything went dark again"
  16. Re:You're wrong, in my opinion. by geminidomino · · Score: 2

    No, you're on the internet. Don't confuse that with the web, please. I know /. has gone downhill over the past decade or so, but it hasn't reached AOL-ness yet. Let's not help it along, shall we?

  17. Re:Assuming they're linked at all by K.+S.+Kyosuke · · Score: 2

    Sique was just responding that merely having an HTML page does not mean that the page is on the World Wide Web. To be on the web you need at least some level of interconnectivity with the rest of the web.

    Step 1. Build two web sites, neither of them linking to the outside world.

    Step 2. Have the web link to both of them (if they're notable, try mentioning them on Wikipedia, e.g.)

    Step 3. Starting on web site 1, try reaching web site 2.

    --
    Ezekiel 23:20