Slashdot Mirror


Why AltaVista Lost Ground To Google Sooner Than Expected

techtsp writes: Marcia J. Bates, UCLA Professor Emerita of Information Studies recently explained why Google's birth led to the downfall of AltaVista. According to Bates, early search engines including AltaVista adapted the classical IR methods. At the other hand, Google founders started off with a completely different approach in mind. Google successfully recognized the potential of URLs, which could be added to the algorithms for the sake of information indexing altogether. Google's modern age techniques were a huge boost to those older techniques. Whatever other business and company management issues AltaVista faced, it was the last of the old style information retrieval engines.

21 of 172 comments (clear)

  1. Link to quora post by Anonymous Coward · · Score: 3, Informative
    1. Re:Link to quora post by unrtst · · Score: 4, Informative

      I got no such thing from the quora page (https://www.quora.com/Why-did-Altavista-search-engine-lose-ground-so-quickly-to-Google/answer/Marcia-J-Bates)
      It doesn't even have a second page!

      The link from TFS is to http://www.pc-tablet.co.in/, which is just as long as the original quora page, but it's reworded to the point of being useless.

      For example, where the bastardized pc-tablet page says, "Google successfully recognized the potential of URLs", the original didn't use the word URL in the entire article. What they said was, "What the Google founders recognized about search on the Web was that information about LINKS could be added to the algorithms." See, that makes much more sense, and it's actually what google did! Link to your page from other pages significantly increased your page rank, even more so if those other pages were on topic. The URL of those other pages didn't matter, and they weren't extracting any additional information from the URL.

  2. Wow, way to fuck that up by Jack9 · · Score: 4, Insightful

    The article says "URLs" when the Quora post, cited as the source, says LINKS. Also the article is basically devoid of any information, other than "Google did better because it used LINKS to help determine ranking." Thanks for the headline, with a summary, linking to an article that misquotes the linked source, that has a healine worth of information. No really, thanks.

    --

    Often wrong but never in doubt.
    I am Jack9.
    Everyone knows me.
    1. Re:Wow, way to fuck that up by fermion · · Score: 5, Informative

      Altavista used keywords and the assumption that websites would be honest, because, what motivation would they have to not be honest. There was no real monitization on the web, and websites with bad reputations, websites that included keywords that were bogus, would simply fall off the web due to free market forces. However, about a year after Altavista was founded, 2o7 among other c tracking cookies began to monetize visits to web pages. Altavista, though a huge innovation over Yahoo, was still a simplistic model that really had no method to counteract the market forces that made keyword inflation profitable. Also, Altavista had no real way make money. Google was a hybrid of Altavista and 2o7 and had several advantages. First, because it used links and not keywords, it could actually use free market forces to evaluate the quality of the page. The assumption was that if a page were linked by a lot of other sites, then the page was useful and it could be ranked based on content. The second was that unlike 2o7, google actually provided a service to end users, so end users were in effect compensated for allowing tracking cookies on their computer. I myself had my browser set to reject all tracking cookies except for Google as I needed those cookies for other services. Third, the Google algorithm was quite sophisticated, so could be tweaked as the pure link based ranking failed due to link farms and the like. Now, honestly, in many cases the search results returned by google are no better than the search results returned by altavista at the turn of the century. What saves google is that it has funds and motivation to improve the results as the SEO people attempt to manipulate the rankings. I think google is looking at the secondary and tertiary levels of the links to determine ranking, which is helping a lot. Ultimately there is going to have to be some serious math done and graph theory developed to get the ranking back to the quality that allowed Google to pummel everyone else.

      --
      "She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
    2. Re:Wow, way to fuck that up by Dynamoo · · Score: 4, Interesting
      I remember when AltaVista first came out.. it was a revelation. The result you wanted was normally on the first few pages. Don't laugh - that was a big frigging deal at the time. These days, if the result you want isn't number one then you assume something is borked.

      But it was quite easy to game the system. To begin with, if you wanted to be #1 for "SEX" you would just repeat the word "SEX" a lot of times. It was all done on in-page factors. Of course, AltaVista engineers eventually tried to counteract the spam (use a word too many times and it counts against you, for example), but the whole PageRank idea did lead to better results.

      I seem to remember that AltaVista was originally a project to show how powerful DEC's Alpha processors were. Instead, it opened up the idea that the whole web (or at least millions of pages) could be searchable on a full-text basis. That was pretty revolutionary at the time.

      --
      Never email donotemail@WeAreSpammers.com
  3. Grammar? by marovada · · Score: 4, Insightful

    I found the summary of this article very confusing. Phrases such as "At the other hand" and "indexing altogether"?? Oh, and call me ignorant for not understanding what "IR" means. Infrared? Then I read the article and found that the summary is just a badly strung together quotation of the text, including all of the grammatical errors. I'm still confused, but slightly less so.

  4. Infra-Red vs Links! by BenJeremy · · Score: 4, Funny

    Providing links to search results was obviously far more useful to web users than Infrared.

    Duh.

  5. Plagiarize much? by Doogie+Howser · · Score: 5, Informative

    First paragraph at Wikipedia: "AltaVista was an early web search engine founded in 1995. It was once one of the most popular search engines, but it lost ground to Google and was purchased by Yahoo! in 2003, which retained the brand but based all AltaVista searches on its own search engine. On July 8, 2013, the service was shut down by Yahoo! and since then, the domain redirects to Yahoo!'s own search site.[2]"

    Second and third lines of TFA: "Founded in 1995, AltaVista was a very popular Internet search engine website. Nevertheless, AltaVista lost ground to Google and was purchased by Yahoo! in 2003. Ten years later, Yahoo! officially shut down AltaVista in July 2013 and redirected the domain name to its own search engine website."

    Hmm...

  6. Paid placement by tomhath · · Score: 4, Informative

    I don't specifically recall using Alta Vista, but I do remember how terrible all of the search engines were before Google came along. They didn't return the most relevant results, they returned the web sites that paid them to be placed higher; Google was the first one to actually do what the user wanted from a search engine - return relevant results.

  7. The interface had something to do with it by EdwinV · · Score: 3, Insightful

    By the time Altavista got popular, the interface was a cluttered mess where you could hardly find the search line. Google came with an almost empty screen with a logo and a search line. You'd have switched just to save your eyes. More like the good old Webcrawler interface.

  8. What Do You Expect? by Frosty+Piss · · Score: 4, Informative

    The article says "URLs" when the Quora post, cited as the source, says LINKS. Also the article is basically devoid of any information, other than "Google did better because it used LINKS to help determine ranking." Thanks for the headline, with a summary, linking to an article that misquotes the linked source, that has a healine worth of information. No really, thanks.

    It's a paid-for "article" to a ad-infested link-farm.

    Here's a link to the ACTUAL story: https://www.quora.com/Why-did-...

    --
    If you want news from today, you have to come back tomorrow.
  9. The AltaVista Page Sucked by Sarusa · · Score: 3, Informative

    Altavista had better results than Google for years, especially because you could use all sorts of search modifiers that Google didn't support till later like -no_pages_with_this_word or +must +have +all +these and logical operators.

    But then as the leaders they got cocky and wanted to be a portal and filled up the page with so much crap and spam it hurt. Meanwhile Google's page was still just search box, go, I'm feeling lucky, and a few other tiny things.

    That's why I switched after Google got good enough that they were comparable, NOT better. It was just less annoying. That's why most of the people I knew back then switched.

    AltaVista realized too late what they'd done and tried to rebrand as 'Raging' with just a simple search page, but by then it was too late.

    I'm sure the Google approach is much more scaleable but the article seems terribly confused and like it's trying to make some bizarre sense out of a cultural artifact from a time they can't comprehend.

  10. Re:How the hell did this get on the front page? by Lumpy · · Score: 4, Insightful

    Editors? you must be new here.

    --
    Do not look at laser with remaining good eye.
  11. It was because of AstaLavista by future+assassin · · Score: 4, Funny

    not google that cause AltaVista to fail.

    --
    by TheSpoom (715771) Uncaring Linux user here. I have nothing to add to this but please continue. *munches popcorn*
  12. Re:Was Altavista... by towermac · · Score: 4, Insightful

    Yes, Altavista was better than Yahoo. I remember reading that Yahoo was a static directory, updated by humans; whereas AV had a newfangled web crawler. Anyone remember the term 'spider'? Altavista wasn't known all that well though, and it was part of my geek cred to show it to users. And usually, it found what they were looking for.

    Pretty much the moment Google came on the scene though, it was better than Altavista. AVs answer? Plaster the front page with ads and 'content'. Make it a 'portal' to the web.

    Heh. Wrong answer.

  13. Re:And today? by phantomfive · · Score: 4, Insightful

    Today search is ALMOST ENTIRELY SHIT. It is used because shit is king.

    If you think that, you don't remember Alta Vista, which had millions of links to "Page not Found" and in the search results had multiple listings to the same (often broken) page.

    --
    "First they came for the slanderers and i said nothing."
  14. Citation index by dbIII · · Score: 4, Informative

    Google did better because it used LINKS to help determine ranking

    There's a thing called the science citation index that sorts papers that are referenced more to a higher score than those that are not referenced much, and it's a good way to find those papers on a topic that others have found most useful.
    Google saw it worked and applied a similar method using links (as the above poster wrote). That method brought human judgment that had already been applied into the mix and enabled them to index far more rapidly than AltaVista with better results than AltaVista's simple keyword searches. It was more likely to lead people to a key site that many used instead of an abandoned fan site.
    That's the main difference.

  15. inertia? AltaVista was big before Google existed by raymorris · · Score: 4, Informative

    Inertia? AltaVista, Hotbot, and Excite had the inertia. They were the big players when a couple of college students thought up the idea that became Google. AltaVista and the other established players had the inertia.

    The established search engines also had algorithms based on word frequency in various parts of the page. I did search engine optimisation back then, so I studied it in detail. The simplified explanation is that searching for "Einstein" would return whichever page had the word Einstein repeated the most on the page. Minus points for repeating it "too many" times.

    Google had a revolutionary idea. If lots of good pages link to abouteinstein.com, It's probably a good page. That's Page Rank, and it worked quite well. That's the far and above the most important reason Google won - their ranking system was far superior because it was based on a different, better, idea.

    * You might wonder how Google knows which pages are "good", in order to calculate which pages are linked to by good pages, and are therefore also good. It's recursive across the whole internet. If lots of pages link to princeton.edu/physics/, and princeton.edu/physics/ links to lab.gov/particles/, then lab.gov/particles/ gains some "good" points. Specifically, it gains an equal share of the Princeton's pages rank value as all other links on Princeton's page. In other words, whatever value a page has, that value is divided equally among each page it links to. So a page "vouches" for each page it links to, but if it links to many pages, it can also pass a small amount of credibility to each.

  16. Re:inertia? AltaVista was big before Google existe by wisnoskij · · Score: 3, Interesting

    They had a complimentary idea, not a different idea. Page Rank ranks a page in general terms, but tells you nothing on if it has anything to do with Einstein (from what I understand). You still need some form of the old way of judging the Einsteininess of a page.

    --
    Troll is not a replacement for I disagree.
  17. Alta Vista was not supposed to make money by aberglas · · Score: 4, Interesting

    It was a technology demonstration of DEC's (remember Digital Equipment? If so you are old!) new Alpha chips and servers, so powerful that they could index the entire early 1990s web. A very minor side project.

    When Compaq bought DEC, they were surprised to find that they had also bought Alta Vista. Around then somebody tried to commercialize it and killed it in the process.

    https://en.wikipedia.org/wiki/...

  18. Re:And fast! by TheRaven64 · · Score: 3, Informative

    The 'fast' thing really can't be overstated. By the time Google launched, AltaVista's search page had become huge, to the extent that it took about 30 seconds to load on a 28.8Kb/s MODEM (the fastest that mine could connect at given the line quality, though on paper it could do 56Kb/s). Google took well under 5 seconds (not because Google devs were clever and actively aimed for this, quite the reverse: they didn't initially have anyone good at HTML/CSS stuff, so produced the simplest page that worked).

    I remember the search results on Google being worse than AltaVista, but getting them so much faster that I could start loading the first 3-4 before AltaVista showed me anything. Occasionally I'd go back to AltaVista if Google failed. A few years later, Google fucked up their UI enough to make me switch to DuckDuckGo.

    --
    I am TheRaven on Soylent News