Alexa, Amazon's Most Flawed Idea
Rub3X writes "The Alexa ranking system is naturally flawed. The data should never be treated as accurate, as it's easily manipulated, and not supported for most browsers in the world. It's an estimate, and nothing more.
" I've been saying that forever, but unfortunately for me, since it's a number on a website that is considered "Real" to some, I'm supposed to take it seriously. I imagine this is a problem for many webmasters out there.
Services like Megaupload.com force Non-American/Non-european users to install Alexa toolbar to download the file.
That explains why Alexa has file-upload sites such as Megaupload,rapidshare in the top 10 sites of most countries...
Wincopy
According to the article:
:P
"Alexa has no support for FireFox, Opera or Safari at all. "
According to Alexa's Wiki:
"Users running any browser except Internet Explorer and Mozilla Firefox are not represented. Thus users of Opera, Safari, mobile phone (WAP) browsers are all ignored. Nevertheless, this is still the vast majority of the browser market."
So its half right
I've pointed this out before. There are weird statistical anomolies that should show that Alexa's webratings are not perfect. Take a look at this data for Slashdot and Digg. The traffic ratings both shoot up withing a s short amount of time. It just doesn't make much sense. http://www.alexa.com/data/details/traffic_details? &range=2y&size=medium&compare_sites=www.digg.com&y =r&url=www.slashdot.org#top
Ooo man the floppy drive is broken. No wait. The computer is just upside down.
I remember for a while LewRockwell.com, which promoted alexa for its readers, was top-500, beating out worldnetdaily.com and gamefaqs.com. Now, nothing against LewRockwell.com, and it is indeed surprisingly popular, but there's no way in hell it's a top 500 site.
Apology to Ubuntu forum.
Everyone who owns or develops web sites knows this. Anyone who hints in a forum the numbers may be accurate immediately gets slapped down. It's the non-technical advertisers who don't know this. And they're the only ones who care about this ranking in order to gauge how much to spend on purchasing web site advertising. Since almost no web sites publicly display traffic info advertisers find Alexa rankings very convenient and probably just don't understand why they'd be useless.
Until advertisers "get it" or a much more accurate public metric is made available, Alexa rankings will unfortunately matter to web sites that are supported by advertising.
Developers: We can use your help.
The problem is that statisically it's nice to say that 30% does not make a majority but Im sure that spreads changes from website to website. Imagine looking at the statistics for a Linux website. The majority there better not be IE.
Ooo man the floppy drive is broken. No wait. The computer is just upside down.
Now it's clear that the rankings from this system are heavily skewed and misses a substantial portion of the user base.
This suggests it is useless as a way to estimate how much to pay for advertising on a web site (though since this is usually per click/per display I don't see why ranking matters here). However, it doesn't show that this data can't be usefull for other things. For instance it could be quite usefull to know what other sites the users (or IE users) of a site visit.
In other words the data seems useless for any statistical analysis but it could be quite helpful to know what sorts of users visit a site. Sure slashdot's traffic might be underrepresented but I bet you the data still show that slashdot users are quite likely to go browse gadget purchase sites or programming related sites. If you want to know where to advertise your new fancy gadget or a fancy new programming enviornment that would be very usefull information even if it wouldn't support a rigorous statistical analysis.
If you liked this thought maybe you would find my blog nice too:
One fact TFA and the Slashdot title both got wrong, is Alexa wasn't Amazon's idea. Until Amazon bought it in 1999, Alexa was the commercial offshoot of archive.org for three years. Alexa is still what gives the Wayback Machine its web crawls.
Slashdot Burying Stories About Slashdot Media Owned
It doesn't matter, though, since the distribution of toolbars is not uniform across all Internet users. A good example is the website I work on. We know our traffic, yet Alexa under-reports us. We also know a local competitor's traffic -- both sets of numbers are generally public information that advertisers use. They have a nice site but get about 1/2 of our traffic, yet Alexa over-reports them over us by a factor of 3-4.
You can pull accurate statistics if and only if your data points are distributed correctly. Because Alexa has no way to randomly and accurately assign toolbars to users, their data is not reliable in any form.
A similar example is how political polls are taken. You can get accurate numbers with 1,000 adults if, and only if, those 1,000 are random throughout the entire population. You can skew the poll numbers by polling 1,000 Democrats or Republicans only instead of 1,000 random. Your results are only accurate to your surveyed population -- in Alexa's case, their numbers are only accurate so far as "Rank ### amongst Internet Explorer 6.0 users who speak a limited number of languages who have voluntarily installed our toolbar to submit their surfing habits to us for analysis and are subjected to trade secret methods of ranking".
The only way that you could pull accurate numbers would be through all ISPs selecting random data points to find what hostnames people were using. It would have to be filtered, though, to produce accurate numbers in terms of actual "website hits" instead of just "website requests". Keep-alive would further impede accurate results. As would proxies, DNS caches, and HOSTS files.
Wikipedia constantly uses Alexa to see if linking to a website or profileing a website is "notable". Despite outrage by the people who submitted the content, usually everything that gets nominated for deletion has some editor cite alexa as a reason to delete it.