A Statistical Review of 1 Billion Web Pages

← Back to Stories (view on slashdot.org)

A Statistical Review of 1 Billion Web Pages

Posted by ScuttleMonkey on Wednesday January 25, 2006 @08:41AM from the demanding-a-recount dept.

chrisd writes "As part of a recent examination of the most popular html authoring techniques, my colleague Ian Hickson parsed through a billion web pages from the Google repository to find out what are the most popular class names, elements, attributes, and related metadata. We decided that to publish this would be of significant utility to developers. It's also a fascinating look into how people create web pages. For instance one thing that surprised me was that the <title> is more popular than <br>. The graphs in the report require a browser with SVG and CSS support (like Firefox 1.5!). Enjoy!"

9 of 294 comments (clear)

is more popular than by InsideTheAsylum · 2006-01-25 08:46 · Score: 5, Funny

well when people talk like this and dont bother using punctuation spacekeys or any of the skills that they have been taught in school its no wonder why webpages turn out like this not to mention those long runon sentences and also all that broken code that are the fist attempt at a webpage by a twelve year old kid who tried to steal someone elses layout and replaced the word with his own then you start to look at all of those dynamically generated webpages and the layouts and the style sheets and its no wonder why the good old br tag never get a work out.
1. Re: is more popular than by aussie_a · 2006-01-25 08:54 · Score: 5, Funny
  
  Never been scared your girlfriend was pregnant? Oh wait, this is slashdot. Nevermind.
2. Re: is more popular than by Anonymous Coward · 2006-01-25 10:42 · Score: 5, Funny
  
  Women and Compilers... miss a period and they go wild.
Finally... by RandoX · 2006-01-25 08:46 · Score: 5, Funny

An un-slashdottable server.
BR tag? by p0 · 2006-01-25 08:46 · Score: 5, Insightful

With css power you really do not need to use br, maybe that is the reason for the small stats for the tag's use?

--
This is my sig. There are thousands more, but this one is mine.
Not complete by Anonymous Coward · 2006-01-25 08:47 · Score: 5, Funny

It didn't have everything of course. Some elements were censored on behalf of the Chinese government.
Re:what's the point of a 1 billion page sample? by Anonymous Coward · 2006-01-25 08:53 · Score: 5, Informative

You get a decrease of the variance of the mean.
Re:what's the point of a 1 billion page sample? by Durinthal · 2006-01-25 08:55 · Score: 5, Insightful

If you can have a larger sample, why not use it? It's more accurate that way.
Re:\. shows up in the Web Authoring Statistics by shrikel · 2006-01-25 08:59 · Score: 5, Funny

You're confused. Backslashdot is across the street.
(sheesh)

--
Any sufficiently simple magic can be passed off as mere advanced technology.