Slashdot Mirror


A Statistical Review of 1 Billion Web Pages

chrisd writes "As part of a recent examination of the most popular html authoring techniques, my colleague Ian Hickson parsed through a billion web pages from the Google repository to find out what are the most popular class names, elements, attributes, and related metadata. We decided that to publish this would be of significant utility to developers. It's also a fascinating look into how people create web pages. For instance one thing that surprised me was that the <title> is more popular than <br>. The graphs in the report require a browser with SVG and CSS support (like Firefox 1.5!). Enjoy!"

21 of 294 comments (clear)

  1. I clicked I'm Feeling Lucky on this article by dada21 · · Score: 1, Funny

    and all I got was Britney Spears.

    Sheesh.

  2. We've come a long way by suso · · Score: 3, Funny

    if the tag isn't on the top elements list.

  3. Blink by suso · · Score: 4, Funny

    the tag.

    1. Re:Blink by mysqlrocks · · Score: 3, Funny

      the <blink> tag.

      I must have blinked, I didn't see it the first time.

    2. Re:Blink by ReverendLoki · · Score: 4, Funny
      Still, the only good use I ever saw for that tag was the line:

      Schrodinger's cat is <blink>not</blink> dead.

      Every other usage just caused me to browse elsewhere.

      --
      09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
    3. Re:Blink by Repton · · Score: 2, Funny

      All you need to do is blink at the right frequency and you'll never see it at all!

      --
      Repton.
      They say that only an experienced wizard can do the tengu shuffle.
  4. is more popular than by InsideTheAsylum · · Score: 5, Funny

    well when people talk like this and dont bother using punctuation spacekeys or any of the skills that they have been taught in school its no wonder why webpages turn out like this not to mention those long runon sentences and also all that broken code that are the fist attempt at a webpage by a twelve year old kid who tried to steal someone elses layout and replaced the word with his own then you start to look at all of those dynamically generated webpages and the layouts and the style sheets and its no wonder why the good old br tag never get a work out.

    1. Re: is more popular than by Fr05t · · Score: 1, Funny

      "...out."

      Hooray! I've never been so happy to see a period!

    2. Re: is more popular than by aussie_a · · Score: 5, Funny

      Never been scared your girlfriend was pregnant? Oh wait, this is slashdot. Nevermind.

    3. Re: is more popular than by Anonymous Coward · · Score: 5, Funny

      Women and Compilers... miss a period and they go wild.

    4. Re: is more popular than by Anonymous Coward · · Score: 1, Funny

      Congratulations.
      Is it yours?

  5. Finally... by RandoX · · Score: 5, Funny

    An un-slashdottable server.

  6. Not complete by Anonymous Coward · · Score: 5, Funny

    It didn't have everything of course. Some elements were censored on behalf of the Chinese government.

  7. well this is new by Abstract_Me · · Score: 1, Funny

    we haven't slashdotted the google server... but it would appear that the firefox download site for extensions is.

  8. \. shows up in the Web Authoring Statistics by digitaldc · · Score: 4, Funny

    The 'br' element

    The br element is a simple one, yet used on so many pages that it is the 8th most-used element. It is used more than the p element.

    clear, style, class, soft, id, and \.


    Wow! I never knew you guys were that popular.

    --
    He who knows best knows how little he knows. - Thomas Jefferson
    1. Re:\. shows up in the Web Authoring Statistics by shrikel · · Score: 5, Funny
      You're confused. Backslashdot is across the street.

      (sheesh)

      --
      Any sufficiently simple magic can be passed off as mere advanced technology.
  9. Best bash I've seen in a long time: by Benanov · · Score: 4, Funny
    From TFA, the classes page:

    The rest of the top 20 classes are either presentational or otherwise meaningless (msonormal, for example, which is one of the classes that Microsoft Office uses in its "HTML" output).
  10. Re:No GOTOs? by the+computer+guy+nex · · Score: 4, Funny

    How about:

    IF(Post=Old_And_Tired) GOTO Mod_Down

  11. Re:BR tag? by crumley · · Score: 2, Funny
    But don't we all use br's when we quote other people on slashdot?
    No.
    --
    Preventive War is like committing suicide for fear of death. - Otto Von Bismarck
  12. Re:One thing that screws up web page studies by Anonymous Coward · · Score: 1, Funny
    aaaarrrrggghhhhh!

    I just changed the results... now he has to redo all those pretty colors...

  13. Re:Worst use of SVG ever by jamesots · · Score: 2, Funny

    Yeah, and what's the point of using HTML? They could have posted an image of the text to the same effect.

    --
    Ho hum for the life of a bear