Slashdot Mirror


Alexa, Amazon's Most Flawed Idea

Rub3X writes "The Alexa ranking system is naturally flawed. The data should never be treated as accurate, as it's easily manipulated, and not supported for most browsers in the world. It's an estimate, and nothing more. " I've been saying that forever, but unfortunately for me, since it's a number on a website that is considered "Real" to some, I'm supposed to take it seriously. I imagine this is a problem for many webmasters out there.

26 of 113 comments (clear)

  1. File Upload Sites & their ranking by in2mind · · Score: 4, Informative

    Services like Megaupload.com force Non-American/Non-european users to install Alexa toolbar to download the file.

    That explains why Alexa has file-upload sites such as Megaupload,rapidshare in the top 10 sites of most countries...

  2. Error in article by tont0r · · Score: 3, Insightful

    According to the article:
    "Alexa has no support for FireFox, Opera or Safari at all. "

    According to Alexa's Wiki:
    "Users running any browser except Internet Explorer and Mozilla Firefox are not represented. Thus users of Opera, Safari, mobile phone (WAP) browsers are all ignored. Nevertheless, this is still the vast majority of the browser market."

    So its half right :P

    1. Re:Error in article by hclyff · · Score: 3, Informative

      According to this, there is no FF version. There are third party plugins for monitoring only.

    2. Re:Error in article by tclark · · Score: 2, Interesting

      I don't see anywhere where you can actually download a toolbar for Firefox - not that I would if they had one.

      That's the other problem with Alexa. It doesn't include clueful users. Of course, they aren't statistically significant.

  3. The data shows there are problems by technoextreme · · Score: 4, Interesting

    I've pointed this out before. There are weird statistical anomolies that should show that Alexa's webratings are not perfect. Take a look at this data for Slashdot and Digg. The traffic ratings both shoot up withing a s short amount of time. It just doesn't make much sense. http://www.alexa.com/data/details/traffic_details? &range=2y&size=medium&compare_sites=www.digg.com&y =r&url=www.slashdot.org#top

    --
    Ooo man the floppy drive is broken. No wait. The computer is just upside down.
    1. Re:The data shows there are problems by Anonymous Coward · · Score: 2, Insightful

      I wonder would the obvious spike in users at digg & /. be due to the introuction of an alexa plugin for mozilla firefox at that time (May 2006)?

      www.stevecastle.org

      Just askin'...

    2. Re:The data shows there are problems by jamie · · Score: 5, Interesting

      If so, it kind of makes the case that Alexa data is less than useful.

      But that's not all that's going on. In Nov-Dec 2005 it shows Slashdot's traffic roughly tripling, then settling down to roughly double its previous level, in the space of about a month. I have our traffic logs from that time. They were basically flat. All of the variance was Alexa anomalies.

    3. Re:The data shows there are problems by stevesliva · · Score: 2, Insightful
      All of the variance was Alexa anomalies.
      In the past three years, the one Slashdot article with Alexa in the title was in December 2005. No doubt a few slashdotters took a quick look at the toolbar, and the quickly decided it was worthless.
      --
      Who do you get to be an expert to tell you something's not obvious? The least insightful person you can find? -J Roberts
    4. Re:The data shows there are problems by badasscat · · Score: 5, Insightful

      f so, it kind of makes the case that Alexa data is less than useful.

      It's not "less than useful".

      In fact, this is both a completely obvious and a completely stupid article submission. The "duh" tag is appropriate, both because none of the current ranking/statistics systems are accurate, and because despite that, they are still useful.

      When you're looking at numbers like total reach, or you're comparing one web site with another, nobody needs statistics that are 100% accurate. I don't need to know if CNN has 4 million unique visitors per day or 4,409,765 unique visitors per day. You're using these services to get a general idea. If I'm running a web site, for example, I know what my own stats are - I don't need Alexa to tell me. But I can still use Alexa to tell me the basic gist of a competitor, and if they're not as accurate as internal stats would be, what does that matter?

      Moreover, Alexa's stats are no more or less accurate (or easy to manipulate) than those of major organizations like Nielsen. The fact of the matter is any system that's not using actual server logs is going to have some inaccuracies (and if you think otherwise, then you've just bought into marketing spin). You live with it and accept it. The main difference is that Alexa is free, whereas other stat compilers charge thousands of dollars per year.

    5. Re:The data shows there are problems by dumbfounder · · Score: 2, Insightful

      I think they changed to a different statistical model at that point, there are a ton of sites that make a jump on that same date. It is a good thing that they continually refine what they have (because it is FAR FAR FAR FAR from perfect) but they should have a little asterisk there letting people know what happened.

    6. Re:The data shows there are problems by Anonymous Coward · · Score: 3, Interesting

      I remember about 5 years ago we were making a new website and it wasn't launched yet, but it was publicly visible. About 5 of us had the Alexa toolbar installed and we were spending a bit of time on the site checking it and making sure links worked etc. A couple of weeks in to it - before we had even launched the site we had an Alexa ranking of around 60,000.

      Honestly Alexa must know it's a big joke. It still annoying to see it given any relevance.

  4. Duh by UbuntuDupe · · Score: 3, Interesting

    I remember for a while LewRockwell.com, which promoted alexa for its readers, was top-500, beating out worldnetdaily.com and gamefaqs.com. Now, nothing against LewRockwell.com, and it is indeed surprisingly popular, but there's no way in hell it's a top 500 site.

  5. The people that matter by truthsearch · · Score: 4, Insightful

    Everyone who owns or develops web sites knows this. Anyone who hints in a forum the numbers may be accurate immediately gets slapped down. It's the non-technical advertisers who don't know this. And they're the only ones who care about this ranking in order to gauge how much to spend on purchasing web site advertising. Since almost no web sites publicly display traffic info advertisers find Alexa rankings very convenient and probably just don't understand why they'd be useless.

    Until advertisers "get it" or a much more accurate public metric is made available, Alexa rankings will unfortunately matter to web sites that are supported by advertising.

  6. Yeah and MOST for slashdot is not IE by technoextreme · · Score: 3, Insightful

    The problem is that statisically it's nice to say that 30% does not make a majority but Im sure that spreads changes from website to website. Imagine looking at the statistics for a Linux website. The majority there better not be IE.

    --
    Ooo man the floppy drive is broken. No wait. The computer is just upside down.
  7. Useless or Used Wrong? by logicnazi · · Score: 3, Insightful

    Now it's clear that the rankings from this system are heavily skewed and misses a substantial portion of the user base.

    This suggests it is useless as a way to estimate how much to pay for advertising on a web site (though since this is usually per click/per display I don't see why ranking matters here). However, it doesn't show that this data can't be usefull for other things. For instance it could be quite usefull to know what other sites the users (or IE users) of a site visit.

    In other words the data seems useless for any statistical analysis but it could be quite helpful to know what sorts of users visit a site. Sure slashdot's traffic might be underrepresented but I bet you the data still show that slashdot users are quite likely to go browse gadget purchase sites or programming related sites. If you want to know where to advertise your new fancy gadget or a fancy new programming enviornment that would be very usefull information even if it wouldn't support a rigorous statistical analysis.

    --

    If you liked this thought maybe you would find my blog nice too:

  8. BZZT. by Rob+T+Firefly · · Score: 5, Insightful

    One fact TFA and the Slashdot title both got wrong, is Alexa wasn't Amazon's idea. Until Amazon bought it in 1999, Alexa was the commercial offshoot of archive.org for three years. Alexa is still what gives the Wayback Machine its web crawls.

  9. Re:But is supported for the #1 browser by daeg · · Score: 4, Insightful

    It doesn't matter, though, since the distribution of toolbars is not uniform across all Internet users. A good example is the website I work on. We know our traffic, yet Alexa under-reports us. We also know a local competitor's traffic -- both sets of numbers are generally public information that advertisers use. They have a nice site but get about 1/2 of our traffic, yet Alexa over-reports them over us by a factor of 3-4.

    You can pull accurate statistics if and only if your data points are distributed correctly. Because Alexa has no way to randomly and accurately assign toolbars to users, their data is not reliable in any form.

    A similar example is how political polls are taken. You can get accurate numbers with 1,000 adults if, and only if, those 1,000 are random throughout the entire population. You can skew the poll numbers by polling 1,000 Democrats or Republicans only instead of 1,000 random. Your results are only accurate to your surveyed population -- in Alexa's case, their numbers are only accurate so far as "Rank ### amongst Internet Explorer 6.0 users who speak a limited number of languages who have voluntarily installed our toolbar to submit their surfing habits to us for analysis and are subjected to trade secret methods of ranking".

    The only way that you could pull accurate numbers would be through all ISPs selecting random data points to find what hostnames people were using. It would have to be filtered, though, to produce accurate numbers in terms of actual "website hits" instead of just "website requests". Keep-alive would further impede accurate results. As would proxies, DNS caches, and HOSTS files.

  10. Real? by Poromenos1 · · Score: 2, Funny

    it's a number on a website that is considered "Real" to some

    That's not real, that's int.

    --
    Send email from the afterlife! Write your e-will at Dead Man's Switch.
  11. don't see the point by jasen666 · · Score: 2, Insightful

    Alexa is flawed from the start.
    What impetus or benefit would a user have to install a toolbar that tracks them? Other than out of charity to help out this company? I don't get it. Nor do I particularly trust them. Just one more thing to help crash IE.

  12. WTF is Alexa? by Yvan256 · · Score: 2, Insightful

    Seriously, why bother writing two or three sentences anymore? Just put a single link on a single word, that's even less helpful and even less work for the editors.

    WTF IS ALEXA?

    Another case of "I don't want to waste 30 seconds to explain WTF the news is about, let 50K users waste a few minutes and slashdot a website trying to figure out what it is".

  13. Polling data in general by MrNougat · · Score: 2, Informative

    Whenever you conduct a poll, and that's what Alexa is doing, you are always excluding data from those who do not respond to polls (for whatever reason). It's an inherent flaw in polling.

    --
    Web 2.0 == Giant Blogspam Circle Jerk
  14. Wikipedia editors constantly need to be smacked by Anonymous Coward · · Score: 4, Interesting

    Wikipedia constantly uses Alexa to see if linking to a website or profileing a website is "notable". Despite outrage by the people who submitted the content, usually everything that gets nominated for deletion has some editor cite alexa as a reason to delete it.

  15. High end advertising by alexhmit01 · · Score: 2, Interesting

    One of the things that most Internet marketers miss (myself included at times) is that as you move up the food chain, there are more buyers. If I am pushing a brand-less product, I focus on my CPC rate, conversion rate, etc., and don't care where the ad runs, only the conversion rate there and what I am paying per click.

    However, the big boys (Ford Motor Company, Warner Bros. Television, etc.) focus on brand building and budgets. They don't ask if they are making money off the impressions, they have a quarterly budget, spend the budget, and ideally aim for the biggest bang-for-the-buck. However, they don't want their brand associated with marginal content, so they have to approve the site carrying their ad.

    Now, they pay dearly for the privilege, but for a content site, a big-name ad agency placing a big brand may be willing to pay substantially more on a CPM basis because they need to cut a deal, take up time, etc. If I am making $5 CPM, I'm not going to spend hours with their legal department, etc., to make $5.05 CPM. Now for $6 or $7 CPM, or even $10, I'll spend time handling them as a client, not running an ad network.

    The flip side to this, the ad agency doesn't want to spend hours time from highly paid professionals for a site with 1000-2000 visits/day. They need a bang for the buck, and if their time is going to add $5-$10 on a CPM basis for compliance, then the advertising becomes less viable.

    So, when you hit thresholds on major rankings, whether it be Alexa, Nielson, etc., then some of the ad players notice you. So the rankings matter, because maybe hitting top-2000 means you can sign on a few premier accounts. While the traffic jump from spot 2001 to 2000 may not be huge, you may see a HUGE revenue jump if you can sell the inventory.

    That's why the things matter, as you move up, companies that won't deal with smaller players because transaction costs are too high show up... More demand for your ad inventory, holding supply constant, means higher prices. Even as you move higher, the supply increases (which would force you to take lower prices to move it), more buyers come on the scene, which usually means a still higher CPM.

    The rules change as you move up the food chain, which is why the people that define the food chain matter.

    Alex

  16. The problem: Lack of trust by Keybounce · · Score: 2, Interesting

    When Alexa first came out, I was willing to use it. There were two features that it provided, and page ranking was actually the least important. Far more important to me was the goal of building an inverted index of the web -- tracking who linked to this site I was looking at, rather than seeing who this site links to.

    All that changed when Alexa was bought by Amazon. And then the truth came out -- all the information that I thought was private was in the database, and now owned by a commercial company, with no restriction on how they used that information. All the information about me that came to the right of the question mark was now in a commercial database, just as bad as AOL's release of search engine queries.

    That gave a 100% loss of trust for me. And not just me.

    People who know what's going on won't install Alexa because it's giving unrestricted access to personal information to a commercial company for their own profit. And, the "backwards index" -- which helps the internet navigation globally -- is no longer the focus of the product.

    So for most people, it has lost any purpose and functionality.

    This is why it is so fundamentally off on any numbers it generates. Heck, Neilson ratings have to be more accurate :-).

  17. Re:masked domains by Rob+T+Firefly · · Score: 2, Insightful
    Unless the toolbar goes off of what you type into the address bar rater than the urls that are actually loaded, would I would consider more likely. Do they say how they do it?
    I don't know, but off the top of my head I doubt page-ranking services would count other sites loaded in an IFrame. Otherwise I could create one of those useless domain-squatting pages that just exist to throw ads at people who click or type wrongly, load a bunch of actually useful, respected sites in IFrames, and use all that content to boost my Alexa rank/Google rank whatever else.
  18. Google or Yahoo could create a better web ranking by sien · · Score: 2, Interesting
    The real reason for TV stats is for advertisers and TV stations to work out how much they can sell advertising for.

    What is the reason for web stats? If you're paying per view or per click then the information is directly available.

    This leads to an interesting possibility. The ad providers could provide a ranking of sites based on the number of adds that they show there and the number of clicks that are created. This is, of course, open to manipulation via click fraud and other techniques but it would probably be more accurate than Alexa's rankings.

    Then, if you wanted to improve this even more you could combine this with the number of searches that go to a page. A large net firm that provided these services could do such a ranking. Google or Yahoo could do this. Perhaps they do, for their internal consumption.