Slashdot Mirror


AllTheWeb Claims Bigger Index Than Google

An anonymous readers writes: "Hoping to attract more mass appeal for an online search engine with a cult following, Norwegian search engine AlltheWeb on Monday declared that it indexes more Internet information than longtime pacesetter Google. Boston.com has the story." Of course, pages indexed is not the only measure of a search engine and probably isn't even the most important.

102 of 291 comments (clear)

  1. AllTheWeb by Bob+McCown · · Score: 2, Informative

    This is the same AllTheWeb that has been buying banner ads that launch their website into pop-ups again and again...no thanks....

  2. ALL YOUR WEB PAGES ARE BELONG TO US! by Neck_of_the_Woods · · Score: 4, Funny


    Great, you have a huge index. I know a haystack that has more than one needle, but the stack is about the size of Texas.

    --
    Neck_of_the_Woods
    #/usr/local/surf/glassy/overhead
    1. Re:ALL YOUR WEB PAGES ARE BELONG TO US! by Moonshadow · · Score: 3, Funny

      Once again proving, it's not the size of your index, it's how you use it!

      *ahem*

  3. Simplicity not quantity. by MarvinMouse · · Score: 5, Insightful

    Well, I think this might finally answer the question I have been wondering about my love of google for a long time.

    Do I love google because it's so simple and easy to use with very quick download times and simple graphic interfaces, and good search algorithms that more often then not give me the sites that I am looking for in one page.

    or Do I love google because it has a ton of useful sites logged in its database including all copies, half sites, under construction sites, etc.?

    I am willing to say that's it's likely the first one, and I think that it might be that for most other people.

    But either way, it'll be neat to see what AllTheWeb.com does well.

    --
    ~ kjrose
    1. Re:Simplicity not quantity. by SerpentMage · · Score: 3, Insightful

      Exactly. I did a search for "real estate Alsace" in the database and it found all of the ones that google did.

      The only problem was it found many real estate sites within the US, and other places. And these places have absolutely nothing to with the Alsace

      What I also love with google is that it will translate my english searches into the appropriate language search. So Real Estate becomes "immoblier", which brings up another ton of results.

      It is the little touches that keeps me coming back to google.

      --

      "You can't make a race horse of a pig"
      "No," said Samuel, "but you can make very fast pig"
  4. Hmm... by eaeolian · · Score: 2, Funny

    Ya think *somebody* might be compensating for something here?

    1. Re:Hmm... by pokeyburro · · Score: 2, Funny

      Sure, their frontpage looks great and all, but look at the location!

      --
      Lately democracy seems to be based on the skybox, the Happy Meal box, the X-box, and the idiot box.
  5. Less webpages by oever · · Score: 5, Funny

    The ultimate test: how many webpages about me:

    Google: 185
    AllTheWeb: 57

    I'll stick with google. It indexes more interesting stuff. ;^)

    --
    DNA is the ultimate spaghetti code.
    1. Re:Less webpages by tenman · · Score: 3, Funny

      I like the Image Search better than google's.

    2. Re:Less webpages by tenman · · Score: 2

      does google even have an mp3 search area? I looked up mp3.google.com, and found nothing.

    3. Re:Less webpages by Zeinfeld · · Score: 2
      I get an even bigger difference if I use my full name

      Google 7,760. Search took 0.39 seconds
      AllTheWeb 1,761

      However if I search on only my surname (which is very rare) I get

      Google about 11,000. All the Web 11,374

      This basically corresponds to cases where I am cited by first initial only.

      This indicates to me that Google has much more accurate discrimination functions on cross matches.

      --
      Looking for an Information Security student project suggestion?
      Try http://dotcrimeManifesto.com/
    4. Re:Less webpages by KjetilK · · Score: 2
      A search for my name: :-)
      • Google: 2,750
      • Alltheweb: 1,023.

      AOL. Actually, I've been using alltheweb from time to time. It has happened that it has been able to dig up some really obscure things that Google hasn't found, but generally, I find that if there are at least twenty pages likely to come up with the search words, Google does a whole lot better job ranking them. That really counts a lot.

      --
      Employee of Inrupt, Project Release Manager and Community Manager for Solid
    5. Re:Less webpages by Zeinfeld · · Score: 2
      Rare my rear end.

      Rare as in there are only five living people who use the name and of them I am the only one with Internet access.

      --
      Looking for an Information Security student project suggestion?
      Try http://dotcrimeManifesto.com/
  6. Obvious Ads--bleah by dalassa · · Score: 4, Informative

    Unfortunatly their ads are at the top of the page, followed by "top news" and then the links themselves.
    However the first two returns for Scientology are the Scientology homepage and Operation Clambake. I wonder how long it will be before AllTheWeb is threatened.

    Aside I'll need more proof that this thing is more accurate than google before I would consider switching.

    --
    Feminism is the radical notion that women are people.
    1. Re:Obvious Ads--bleah by Mwongozi · · Score: 3, Interesting

      Google is currently listing Operation Clambake first.

    2. Re:Obvious Ads--bleah by Mwongozi · · Score: 2

      Funny, it lists scientology.org first when I click on your link.

      It's changed since I wrote that comment. They must have seen it. I should have kept quiet. ;)

  7. yeah but.. by SeicheWarning · · Score: 4, Funny

    are they going to have any cutesy cartoons made out of their name for special occasions?

    1. Re:yeah but.. by laserjet · · Score: 2

      That exactly what I thought. I thought, "hmm.. these aren't really that funny... but maybe they are really going to change their logo."

      then came he end of the week and I was majorly disapointed.

      --
      Moon Macrosystems. Sun's biggest competitor.
    2. Re:yeah but.. by plover · · Score: 2
      Did anyone find the Dilbert series to be funny AT ALL?

      Nobody I know...

      Remember, the only reason Dilbert is funny is because we all get to see our pathetic, cubic lives reflected in his mirror. And when we see fun-house images of ourselves, we laugh. Scott Adams has a gift for bending the mirrors just the right way.

      But when he "tries" to be original, he kind of falls flat IMHO. Sure he sells books & stuff, but I think it's only on the Dilbert name.

      Of course, he brings money home by the truckload and I don't, so what the hell do I know?

      --
      John
  8. good not to have google monoculture by kisrael · · Score: 5, Insightful

    Well, I was pretty happy with the results of a search on my name...happier than with Google in that once case, though that's but a single tiny datapoint.

    In any case, it would be terrific to have a viable alternative to Google...despite Google's almost unnerving ability to do *so* many things Right, it is good to have somewhere to turn just in case something went wrong there. Not having a monoculture (which is what we're almost on the verge of with Google) is generally a good thing.

    --
    SO YOU'RE GOING TO DIE: The Comic for Dealing with Death
    1. Re:good not to have google monoculture by Psiren · · Score: 2

      Well, I was pretty happy with the results of a search on my name...happier than with Google in that once case, though that's but a single tiny datapoint.

      Funny, my name didn't even get a hit, whereas it's the first match on Google. But, like you say, it's a tiny datapoint. I'll stick with Google though.

  9. American Tech Companies by Anonymous Coward · · Score: 5, Funny

    The story says AlltheWeb.com is owned by a Norwegian company. Should people really support a socialist Scandinavian country? Any real American should only use capitalist homeland-based search engines, like Google or MSN Search. Like Bush says, "You're either with us or against us." Only a traitor would go against Bush's wisdom. What are you, AN AMERICAN TALIBAN?

    1. Re:American Tech Companies by DrVxD · · Score: 2, Interesting

      > What are you, AN AMERICAN TALIBAN?
      No. I'm English - and you're a colonial who has a drinking song for a national anthem.

      --
      Not everything that can be measured matters; Not everything that matters can be measured.
    2. Re:American Tech Companies by Anonymous Coward · · Score: 2, Insightful


      Well Last time I checked, the Scandinavian countries were market economies and parliamentary democracies..

      Just cuz our chicks are hot and our health care is free doesn't mean you have to get your panties in a bunch and start throw the socialist label around!

      3 cheers for scando chicks!

  10. Nothing to write home about... by peterdaly · · Score: 3, Insightful

    I just did some searches, and it appears to be ok for finding information. Whether it's logic is as good as google's is hard to tell. Little slower than google. It doesn't look to me like there is any reason to use it over google. How many sites worth visiting are not in google's index?

    This may be a case of a company picking a poor benchmark as their performance measurment. Google's draw is their great ranking logic, not index size.

    -Pete

  11. In other news ... by Christianfreak · · Score: 5, Interesting

    Windows declares itself better than linux,
    Gnome declares itself better than KDE
    Emacs declares itself better than VI
    PHP declares itself better than Perl ...

    Let the flames fly :)

  12. How does this work? by Sludge · · Score: 4, Funny

    I did some searches, and I ended up with different results than google. Perhaps of note, the results I got with alltheweb are from 1998, whereas google's are from this year.

    1. Re:How does this work? by Peyna · · Score: 2

      Yeah, I've noticed google will have a fairly recent version of a page available when searching, even pages that you wouldn't expect to be updated all that often. (you would expect news sites to be crawled at least daily, but not some off website in the middle of nowehere).

      One thing I will give google a hand for is there ability/choice to not have to have large graphical ads anywhere. AllTheWeb has a banner ad at the top of each page, and another one that appears beside your search results. It is mildly annoying, and if it proves better, could be tolerable. Google's slim environment is very appealing though.

      I'll have to try AllTheWeb for a little bit first and see if it suits my needs before I can give a better evaluation than this.

      --
      What?
    2. Re:How does this work? by Coolfish · · Score: 3, Informative

      i did an even simpler test. I searched for "monkeys" - google returned interesting results all from different websites in the first 10 results. AllTheWeb returned 10 pages, SIX of which were all on the same domain, shacknews.com, and all of them had NOTHING to do with monkeys!

      This search engine can't even find monkeys, then forget about it.

  13. They ignore Google's cache by Andy_R · · Score: 2

    Google counts as a single page both their cache and the site in it's current form, so the number of web pages you can get to from a google search is significantly higher than the number of pages they have actualy 'indexed'.

    This is far more important to me as a user than some extra pages that alltheweb may have (presumably because they ignored a few 'nobots' tags? that Google's crawlers respected?)

    --
    A pizza of radius z and thickness a has a volume of pi z z a
  14. Google advantages by Midnight+Thunder · · Score: 2
    I'll stick with Google for the following reasons:
    • Lighter interface
    • Adverts are less intrusive
    • Has a cache in case the site gets slashdotted or taken offline by the Scientoligists ;-)
    • I like the special occasion cartoons
    • It appears to be clued in to what people want
    • I am used to it and biased :-p
    --
    Jumpstart the tartan drive.
  15. Banner ads by cjpez · · Score: 2

    I'm too spoiled by Google, I think. I took one glance at the search results screen that had a few banner ads, and decided never to go there again. I understand they want to offset costs/make money off of the engine, but banner ads are ugly as sin. I'll stick with Google.

  16. Where's TEOMA? by Hheero · · Score: 2, Insightful

    I think i remember Teoma making the same claim, "we're better than Google.".
    They should be featured on one of those shows - Where Are They Now?

    1. Re:Where's TEOMA? by Jugalator · · Score: 3, Funny

      lol

      Slashdot.org not among 10 first matches when searching for "slashdot"...

      Needless to say, I never wasted more time by checking the next page...

      --
      Beware: In C++, your friends can see your privates!
    2. Re:Where's TEOMA? by danny · · Score: 2
      Slashdot comes up #1 when I search on "slashdot"... In fact it comes up #1, #3, and #5!

      Danny.

      --
      I have written over 900 book reviews
    3. Re:Where's TEOMA? by PurpleBob · · Score: 2

      Hey, at least it helpfully asks if you want to refine your search to "knitting books".

      --
      Win dain a lotica, en vai tu ri silota
  17. Enough with the Google worship, already. by HEbGb · · Score: 4, Interesting

    God forbid someone presents an objective comparison between Alltheweb and Google. Responses such as "Google is my God" and Timothy's little snip in the article do nothing for anyone really interested in using a useful search engine.

    I just used Alltheweb for some common searches I do, and you know what? It found a lot more useful hits than Google did. Yea, imagine that.

    But Alltheweb didn't seem to have a cache, which I thought was very useful in Google.

    So, come on, folks, give it a chance, and don't jump to conclusions without an objective analysis. The tendency to blindly worship things like google/linux/linus/transmeta is far too common on this site.

    1. Re:Enough with the Google worship, already. by jmu1 · · Score: 2
      The tendency to blindly worship things like google/linux/linus/transmeta is far too common on this site.

      Well, look at it this way... if you want to look at a fan site for something else, go there. Don't expect everyone to be completely objective. Fact of the matter is that most people who are subjective, are vocal about said topics. Folks that don't care, or don't want to inadvertently press anyone into anything generally don't peep.

    2. Re:Enough with the Google worship, already. by iomud · · Score: 2

      The tendency to blindly worship things like google/linux/linus/transmeta is far too common on this site.

      Yeah! *looks around* Fuck Transmeta!

    3. Re:Enough with the Google worship, already. by Gutboy_Barrelhouse · · Score: 4, Interesting
      Timothy's little snip offers at least as much as your post. He's just pointing out that pages indexed is (only) one measure of a search tool. That's a starting point - what are some others? Another poster mentioned up-to-dateness of the pages returned. That's good. Clean interface is another one. Features like a cache, which you mention, are another. Response time, configurability, documentation, what else?

      Other than that you just say that alltheweb gave more hits for you on a couple of searches. That's a pretty useless measure of search tool quality to anyone else.

      For my part I'm not sure what would make me switch from Google. I really value their interface, and none of the recent challengers are preferable to me in that regard. I switched TO Google because IMO it was a big leap in result quality over the earlier generation, but until we start using personalized intelligent agents, I'm hard pressed to imagine how a new engine could produce as big a leap. Basically, as search engine quality improves, for most people the sample of searches that they'd have to do in order for a comparison to produce clearly discriminable results also grows. Will very many Google users really sit down and do like 50 searches with both it and AllTheWeb, on the chance that ATW proves slightly better? I doubt it, and I'd hate to have to be in the planning room of a Google competitor, trying to think of the killer app that could get people to switch with just a few searches.

      Though I'd certainly switch if I could have the Librarian from Snow Crash.

  18. Nothing special this way comes by psililisp · · Score: 2, Informative

    when a search for

    "php regular expression" AND "tutorial"

    on AllTheWeb gives me 131 results, with more than half being a reference to a PHP website manual (and even a dislaimer footer because it had the words "PHP" and "and" in it ???). Moreover, it took my "and" literally as a search criteria, though my advance searching techniques could probably use a bit of help ;)

    In comparison, Google gives me 73 links (without omitted results showing) with many results displaying ALL my keywords in bold and not ONE of them using "and" as a keyword.

    Dunno, I'm probably a bit biased anyway since "Google" types out so much easier for me (repetition i guess) than "alltheweb".

    pblt....

  19. Higher for a reason... by coene · · Score: 2, Informative

    Because it indexes all of the domain names of the same site as different hosts.

    Google returns one accurate site for the company "DataHive", one domain name (not the proper one, but how would it know =)

    This site returns 3 different domains, and tries to present them as different pages, though they all have the same content.

    I can imagine its easy to claim more than google when you multiply the number of real hits.

    I must say though, the results I found were pretty good for a number of queries. Definetly a google competitor. It does not seem to find all of the newsgroup/mailing-list stuff that google returns, good or bad depending on what you are searching for.

    Its nice to have another competant option ;)

    1. Re:Higher for a reason... by Peyna · · Score: 2

      Just make sure to click the link in the upper right that says: "Offensive Content Reduction: On" before you want to search for some good porno. It does admit the filter probably is not the greatest, but it's nice to see it applied. Google offers a similar feature on their images search, but not for the content.

      I have a feeling this could be similar to the recent Hotmail changes, where a friend of mine with the last name Hancock was told he had to change his last name because it violated their rules. Same went for someone with the last name Hacker. When filtering like that you have to be very careful, because just because something is a naughty word in one context, doesn't mean it is in every context.

      --
      What?
    2. Re:Higher for a reason... by arkanes · · Score: 2

      It's not linked on the main page, but you can filter content from the advanced search page.

  20. Targetted searching is needed, never mind size by eXtro · · Score: 3, Insightful
    When I saw Gladiator I was wondering how much of the story was actually historically accurate. I don't own an encyclopedia so I did a bit of research on the web. I looked up some of the main historical characters, Emperor Commodus, Marcus Aurelius and others, using google, altavista and others. What I found was that the spike of interest in the movie had completely swamped out any historical results. I had to wade through pages and pages of hits before I finally started getting to information that was useful in my context.


    Google is my favourite search engine, even now, its ads are unobtrusive and don't pollute the search results. They've been good net citizens and they've done substantial research into how to better search. There results are typically the best as well.


    In this case their search results were very broken however, at least for the purposes of my search. What I'd like to see is google, or an engine as effective as google, add in the ability to constrain your search to subject areas. In this instance I'd constrain my search to historical sites and would have received mostly uncorrupted hits. This is different than a web directory. Web directories don't classify sites based on there quality. Google does in a round about fashion, it lists sites with more people linking to it higher than sites with less links.


    I'm not sure how the details of this would work, self-nomination would not necessarily work. Porn companies would gladly pollute the keywords on the off chance that somebody looking for history would buy a membership to their site. Letting individuals vote a site into or out of a keyword might work, though you'd be in danger of concerted efforts to say vote out anti-Scientologist information and vote in pro-Scientologist information when both actually could be under a religious keyword.


    Anyway, linking to more sites isn't necessarily helpful in my opinion. What I'd prefer is the ability to narrow the focus of my searches.

    1. Re:Targetted searching is needed, never mind size by arkanes · · Score: 2

      Shoulda added "and NOT movie" or other negative search terms.

  21. The true test by Mr_Silver · · Score: 5, Funny
    Search for "sex":

    Google: 63,500,000
    AllTheWeb: 25,435,205

    I think I'll stick with Google :o)

    --
    Avantslash - View Slashdot cleanly on your mobile phone.
    1. Re:The true test by pubjames · · Score: 2

      Well, I think there both rubbish.

      I searched for "Juliette Binoche tits" on the image search of both sites (with filtering turned off in both cases) and neither returned satisfactory results.

    2. Re:The true test by pubjames · · Score: 2

      there => they're

      Damn. Spotted it just after I'd clicked submit.

    3. Re:The true test by cybercuzco · · Score: 2

      So since google searches 2,073,418,204 webpagees, and you got 63,500,000, that means that 3.6% of the web is at least somewhat "sex" related, and this doesnt mean porn either, just instances of the word sex. So the internet is hardly the haven of pr0n fiends that it once was (slashdot notwithstanding)

      --

  22. More pages = more crap by cr@ckwhore · · Score: 2

    More pages means more crap and shitty search results. Similar to programming, more lines of code doesn't mean better.

    --
    Skiers and Riders -- http://www.snowjournal.com
  23. Cached here by GothChip · · Score: 5, Funny

    For those who are unable to reach AllTheWeb here is the homepage through the usual Google cache.

    1. Re:Cached here by jesser · · Score: 2

      And the forms seem to cut out incomplete tags even when posting in plain text, which is a bit crap.

      Slashdot's names for comment modes are confusing. "Plain Old Text" is really a "do what I mean" mode: line breaks become paragraphs, but HTML tags are interpreted as HTML tags. If you don't want any tags interpreted, you have to use "Extrans" mode. Kuro5hin's names for the modes make more sense.

      --
      The shareholder is always right.
  24. Major, Major Flaw by limekiller4 · · Score: 4, Interesting

    I just tried to pull up one of my own pages with this engine. Got:

    "Redirection limit for this URL exceeded. Unable to load the requested page."

    Which, as near as I can tell, is their way of throttling commercial hits. Wonderful. Moving the mouse over the link doesn't reveal the address in the bottom bar, either, so the only way I can think of to obtain the address of the item it matches is by right-clicking and selecting 'copy link address', opening a new window and pasting it it (and having a browser that is capable of doing this), then editing the URL so only the target link text remains.

    You can't even right-lick and open in a new window to do this. If you try, you get "about:blank" which, afaik, means they're using javascript.

    These people sure go through a lot pains to render a result and then not let you anywhere near it. Saying they're bigger than Google is a bit like someone bragging about how their PDP-11 is bigger than my Athlon. Cripes.

    --
    My .02,
    Limekiller
    1. Re:Major, Major Flaw by foo+fighter · · Score: 2

      I'm guessing you are using Mozilla.

      The link does the same thing to me when using 1.1alpha on Win2k SP2+ (My primary browser and platform).

      Internet Explorer 6 follows the link with no problems.

      So I'd say it's a browser issue and not a web page issue.

      --
      obviously no deficiencies vs. no obvious deficiencies
    2. Re:Major, Major Flaw by yomahz · · Score: 2

      I'm guessing you are using Mozilla.

      The link does the same thing to me when using 1.1alpha on Win2k SP2+ (My primary browser and platform).

      Internet Explorer 6 follows the link with no problems.

      So I'd say it's a browser issue and not a web page issue.


      I got the same error he got when using mozilla so I looked at the source and it's not javascript coming from that page. I followed the link and got the same error so I decided to telnet to port 80 and check out the source on the page and this is what I got:

      telnet www.alltheweb.com 80
      Trying 66.77.74.20...
      Connected to www.alltheweb.com.
      Escape character is '^]'.
      GET /go/1/H/web/http/www.kaosinc.com/jen.shtml HTTP/1.0

      HTTP/1.1 302 Found
      Date: Mon, 17 Jun 2002 16:27:31 GMT
      Server: Apache/1.3.24 (Unix) PHP/4.2.0-atw
      X-Powered-By: PHP/4.2.0-atw
      Location: http://www.kaosinc.com/jen.shtml
      Connection: close
      Content-Type: text/html

      Your browser does not support HTTP redirects.

      click here to reach http://www.kaosinc.com/jen.shtml.
      Connection closed by foreign host.

      Looks fine to me.... the 302 (redirect) should be working fine. I wanted to try with IE but my laptop seems to be having somenetwork problems at the moment.

      Very, very strange...

      --
      "A mind is a terrible thing to taste."
  25. AllTheWeb _has_ one advantage by rainer_d · · Score: 3, Informative
    Even if you totally discard it as a Web-Search-Engine, the FTP-Search is still one of the best.
    It was hidden as ftpsearch.lycos.com for some time, but now it seems to have come "home".
    BTW: the last time their OS was visible through the firewall, it was FreeBSD...

    Anyone remember archie ?

    --
    Windows 2000 - from the guys who brought us edlin
  26. still very important by shd99004 · · Score: 2

    True, indexed pages on the internet is not the single most important thing for a search engine. But it is definitely up there in the top. Personally I would preffer more indexed pages before most other things one can measure a search engine, simply because then I know there's a greater chance to find what I'm looking for, even if maybe it will be a little more difficult.

    --
    Will work for bandwidth
  27. RTFM by Hellkitten · · Score: 2, Informative

    There is actually a help link. "php regular expression" + "tutorial" would have given you what you wanted. If you want to compare two tools you should at least use enough time to see if you have to use the two differently, and then see what is best at getting the job done.

    --
    - We are the slashdot. Resistance is futile. Prepare to be moderated -
  28. How about the image search then? by Jugalator · · Score: 3, Funny

    I'll just say this:

    Google manage to get a graph of the slashdot effect among the first 20 hits, while AllTheWeb just manage to get Cliff showing a Think Unix book (in weirdo hawaiian clothes). :-)

    I don't know about you, but Google give me more relevant matches as usual. ;-)

    --
    Beware: In C++, your friends can see your privates!
  29. Old pages, not grouped by site - less useful by mttlg · · Score: 2
    My completely unscientific and extremely quick and biased test resulted in a failing grade for this search engine. A search for mttlg (English only) brought up lots of stuff, most of it correctly, but it completely missed mttlg.net and its previous incarnations (they aren't even indexed). It did turn up plenty of /. posts, some of which link to mttlg.net, as well as some pages from a free web hosting service that I used to use but can't do anything with anymore (the service is essentially dead, but you can't tell this until you log in and try to use it...).

    Worse than the outdated and useless search results is the way they are presented - there is no grouping by site to put similar pages under one entry. Of the 167 results, almost all of them are from two distinct sites, but you have to wade through all of them to find any different ones. With a more common search string, it will be almost impossible to find what you are looking for, and it is still difficult with a narrow focus search. Google ain't going down that easy...

  30. It's not worship when... by YaRness · · Score: 2

    ...the object of claimed affection really IS as good as everyone says.

    And, Google forbid, should google start to suck, or something else start to be better, then I think most of us would find another search engine to "worship", like I (and I assume many others) did when Yahoo went down the toilet.

    For me, the one mention of pop-ups and heavy graphic ads is more than enough to make it not worth my while to check out (and yes, I know, at home, I can filter out all the banner ad and pop-up garbage, but here at work I don't have the luxury of arbitrarily installing proxies and browsers to do that sort of thing. besides, web sites that use pop-ups piss me off).

  31. My index... by Kingpin · · Score: 2


    ..is bigger than your index.

    Computer scientists - pfft...

    --
    Unable to read configuration file '/bigassraid/htdig//conf/14229.conf'
    Geocrawler error message.
  32. News? by Publicus · · Score: 2

    Today the New York Times claimed that it had published "All the News That's Fit to Print."

    One question remains unanswered: Will they be able to do it again tomorrow?

    Note to moderators: This is sarcasm. It isn't off-topic. I'm implying that some marketing ploy by alltheweb.com isn't exactly newsworthy. Thank you.

    --

    My Karma was at 49, then they switched to words. All that work for nothing!

  33. Re:wasn't a snip by HEbGb · · Score: 2

    Your comment sounded to me like it was specifically designed to diminish the importance AllTheWeb's claim, in favor of Google. Perhaps it was unintentional, but I doubt it.

    I don't know if you're a Google-worshiper, but you certainly ran to its defense when faced with a strong claim from a competing search engine.

  34. Depends what you are searching for by nick255 · · Score: 2

    If you are looking for something really specific (eg. the DNS entry of your machine to see which webpages you look at publish log files), then alltheweb in my experience will find a number of pages which google misses.

    For general searching google still rocks.

  35. The old saying holds true: by nemesisj · · Score: 3, Funny



    It's not how big it is, it's how you use it.

    Google is still way more useful in my opinion.

  36. Real Life Example by hysterion · · Score: 2

    Alltheweb's claims are not unfounded, and I find it always worth checking when google fails.

    Here is one of several real life cases where it found software for me that google didn't.

    (It still does, and google still doesn't.)

  37. hmm passes the pr0n test by asv108 · · Score: 2, Redundant
    Pr0n search with alltheweb.com, 48,518 results found.

    Google Pr0n Search finds 46,200 results.

    Searching for pr0n via alltheweb.com leads to 2318 more potential pieces of pr0n to be seen.

  38. Vote Google by salmo · · Score: 5, Interesting

    The reason I'm for Google has little to do with technology. It has everything to do with advertisements and capitalism.

    I'd rather support a company that uses subtle advertisements like Google does than a company that uses in your face banner ads, etc. (Then again I'm posting on Slashdot!) Also I make a point to check out the ads evey now and then on Google and visit the company's site. I may be getting hosting from an advertiser on Google soon.

    If people who advertise on Google make more money than they do with banner ads, pop-ups, etc. then we'll see the idea spread. I don't like in-my-face ads, so I do what I can to tell companies that. It's called being a responsible consumer.

    Plus more valid hits come up when I search for myself on Google ;-).

  39. Hrm... by autopr0n · · Score: 2

    Comes up with more hits for my name then google.

    I can't belive how many people have my "Subtle mind control? why do all the HTML buttons say 'submit'" quote on their sites.

    --
    autopr0n is like, down and stuff.
    1. Re:Hrm... by jesser · · Score: 2

      I can't belive how many people have my "Subtle mind control? why do all the HTML buttons say 'submit'" quote on their sites.

      Have you seen this? Submit

      --
      The shareholder is always right.
  40. Not entirely fair by palmech13 · · Score: 2, Interesting
    Sure AllTheWeb may have more links, but when I did a search on my name, I found several hits on pages that haven't existed for many moons. Every search engine has this problem, but Google has long since flushed those pages. And in the time between the page vanishing and Google noticing, they've got it cached. :)

    Of course, as has been mentioned a few times above, competition is a Good Thing (TM).

    - Ardenstone

  41. Re:wasn't a snip by foobar104 · · Score: 2

    Your comment sounded to me like it was specifically designed to diminish the importance AllTheWeb's claim, in favor of Google.

    I think you're right, except for the "in favor of Google" part. Timothy said, "pages indexed is not the only measure of a search engine and probably isn't even the most important." AllTheWeb claims that their page index is big, and Timothy is reality-checking that claim.

    I think Slashdot editors get too snippy too often in their story posts. But this isn't one of those occasions.

  42. When you click? by autopr0n · · Score: 2

    I don't think there's anything wrong with opening a new window when you click an add, it's not the same thing as a popup, and most of the time it's the choice of the website admin, not the advertizer.

    Also, lots of people prefer opening new sites in new windows. Myself included.

    --
    autopr0n is like, down and stuff.
    1. Re:When you click? by SoupIsGoodFood_42 · · Score: 2
      Also, lots of people prefer opening new sites in new windows. Myself included.

      But alot of us don't. Which is why you should let the user choose. Or even better, do what Google does and give you an option to have the windows open in a new window (if you get sick of right clicking each link).

  43. THE by FFNieko · · Score: 2, Interesting

    Norwegian search engine AlltheWeb on Monday declared that it indexes more Internet information than longtime pacesetter Google.

    Then how come the word with the most search results (FYI: the) on Google, returns less results on alltheweb?

  44. Re:Kuro5hin is dying!!!! by Mr.Sharpy · · Score: 2

    wow, that's the first "'insert name' is dying!!!" post I have ever seen that is legitimate. interesting.

  45. Main Pages Found by highlander123 · · Score: 2, Interesting

    Google always seems to give me what I want, faster than anything else. Either this is because of it's search algorythms, or that it has only the indexes linked... example : I search for engsoc (looking for Canadian Univerisity Engineering Societies) and I find all the "main" entry pages with google, and I find a littering of "inside" pages with obscure titles with this new one. I'll stick with google-- and my chances of using the "i feel lucky" button are high, since the first or second link.

  46. slashvertisement? by bilbobuggins · · Score: 2
    'Clive, we need some way to get a lot of heavy computer users to our new spiffy search engine...'

    'I've got it Herbert! Let's make some inflammatory claim about Google that has nothing to do with the actual quality of either sites results and sit back and watch the hits roll in!'

    yeah, so it's an obvious troll, but i guarantee you it's true.

  47. bigger index doesn't always help by foobar104 · · Score: 2

    I searched both Google and AllTheWeb for the name of my company. (For privacy reasons, I'm not going to tell you the name.) We are a small company, and probably few pages on the web link to our site, but Google pulled up our home page as its first search result. AllTheWeb failed to list it in its first page of links.

    It's not hard to find our site, either. Our company's name is "foo bars"* and our URL is "foobars.com." Google nailed it, while AllTheWeb bombed.

    Doing a more complex search with lots of words from our home page did, finally, get AllTheWeb to cough up our site. So I know it's in there.

    So in my opinion it has little to do with how big their index is. It has to do with how good they are at finding what I'm looking for. For me, Google almost always finds what I'm looking for. I've even started using the "I Feel Lucky" button to skip the search results altogether and just take me straight to the first listed site.

    *Incidentally, I've always wanted to open a pub called the Foo Bar, but I don't think many people would get it.

  48. comments on AllTheWeb by OpenMind(tm) · · Score: 2, Interesting

    This did return more results for some search terms than google. Not many of the extras seemed all that useful, though. The signal to noise ratio seems a bit lower.

    The ordering of pages seems less helpful. In many cases, the page I'm looking for is farther down the page.

    The sponsored links and advertising are way more noticeable, and get in the way of the search results, although they're probably easy enough to ignore.

    Google seems to be better at rating by search term proximity, under the useful assumption that if the search terms occur close to each other, it is less likely to be a random hit. One irritation with AllTheWeb is that for many results, it doesn't show you the context of the search terms in the summary.

    Obviously AllTheWeb lacks the excellent USENET archive. The video and MP3 search festures might be pretty useful, I haven't had a chance to try them.

    I realize I'm coming across as entirely pro-Google, but these are the only observations I have right now. I'll give AllTheWeb a chance, and let internet darwinism settle the issue.

  49. alltheweb claims by BadTuna · · Score: 2, Interesting

    Do a search for slashdot GOOGLE = 2,250,000 AllTheWeb = 1,649,088 What's up with that ?

    --
    Your sig here!
  50. Re:I have seen the Fast technology. by moogla · · Score: 2

    What was their pitch?

    I'm interested.

    --
    Black holes are where the Matrix raised SIGFPE
  51. It's your webserver! by yomahz · · Score: 2
    It doesn't seem to be a problem with alltheweb.com, your webserver seems to be stuck in some sort of 302 loop. It keeps redirecting to itself.


    telnet www.kaosinc.com 80
    Trying 192.203.175.245...
    Connected to www.kaosinc.com.
    Escape character is '^]'.
    GET /index.shtml HTTP/1.0

    HTTP/1.1 302 Found
    Date: Mon, 17 Jun 2002 16:51:47 GMT
    Server: Apache/1.3.23 (Unix) Debian GNU/Linux PHP/4.1.2 ApacheJServ/1.1.2
    Location: http://www.kaosinc.com/index.shtml
    Connection: close
    Content-Type: text/html; charset=iso-8859-1

    302 Found

    Found
    The document has moved here.


    Connection closed by foreign host.

    --
    "A mind is a terrible thing to taste."
  52. Your web server is broken by jesser · · Score: 3, Informative

    Redirection limit for this URL exceeded. Unable to load the requested page.

    That is a Mozilla error message (source) and does not come from alltheweb. Your web server is broken. http://www.kaosinc.com/jen.shtml redirects to http://www.kaosinc.com/index.shtml, which then redirects to itself. This happens regardless of where I find the link to http://www.kaosinc.com/jen.shtml, or what browser I use to load it. IE appears to just sit there, Opera bounces between various stages of trying to connect, and Netscape 4 gives up after a few redirects and displays a raw 302-found page ("The document has moved _here_") without redirecting.

    Moving the mouse over the link doesn't reveal the address in the bottom bar, either, so the only way I can think of to obtain the address of the item it matches is by right-clicking and selecting 'copy link address', opening a new window and pasting it it (and having a browser that is capable of doing this), then editing the URL so only the target link text remains.

    An easier way to see the URL of the link is to hold the mouse down over the link, and then move off of the link before you lift the mouse button. But I still get the infinite-redirect error message if I type your URL directly.

    You can't even right-lick and open in a new window to do this. If you try, you get "about:blank" which, afaik, means they're using javascript.

    If I right-click on a link from the alltheweb search results and select "open link in new window", I see http://www.alltheweb.com/go/1/H/web/http/www.kaosi nc.com/jen.shtml in the location bar and get the same error message. What version of Mozilla are you using?

    --
    The shareholder is always right.
  53. Re:wasn't a snip by HEbGb · · Score: 2

    I don't agree; the article was clearly making a comparison between ATW and Google, to which Timothy responded by diminishing ATW's claim, clearly to the benefit of Google.

    If the article had been about Google indexing it's N-billionth page, do you think Timothy would have quipped something about it's irrelevance? I doubt it. More likely there'd be the usual drooling.

  54. Re:Something else to consider... by symbolic · · Score: 2


    Being a Norwegian company, would they be under the same mandate to hand over all 'suspect' search queries for abuse by the US's new CIAFBINSASSSASD (known in PRSpeak as the Information Awareness Office)?

  55. Our old friend ftpsearch.ntnu.no reincarnated by Perianwyr+Stormcrow · · Score: 2

    Years ago, I used to use ftpsearch to find warez left in public incoming dirs by warez couriers.

    Glad to see it's back, after a sojourn as a non working component of Lycos.

    --

    What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

  56. Catchy name by bperkins · · Score: 2

    The reason why AllTheWeb will surpass google is that it has a much catchier name.

    As a bonus, alltheweb (when properly separtaed with spaces) is proper English.

  57. Re:Something else to consider... by glitch! · · Score: 2

    Being a Norwegian company, would they be under the same mandate to hand over all 'suspect' search queries for abuse by the US's new CIAFBINSASSSASD (known in PRSpeak as the Information Awareness Office)?

    I would hope not, but perhaps there might be a profit angle involved.

    More seriously, do you have any knowlege that this "mandate" exists? Is it public law? Executive order? Secret executive order? A directive from "high levels"? Or is this more of an "intelligent concern" of yours? There's nothing wrong with that - everyone with half a brain should be concerned about these possibilities.

    --
    A dingo ate my sig...
  58. size matters? by kootch · · Score: 2

    come on, we all know that *cough* it isn't the size that matters, but how well you use it to um, achieve results.

    :)

  59. Re:wasn't a snip by foobar104 · · Score: 2

    Okay, I get it. You've got a bug up your ass. I'll stop trying to have an objective conversation with you now.

  60. A small test by Loki · · Score: 2, Informative

    Let's look at the numbers shall we?

    Fnord: Google: 104000 AllTheWeb: 46439
    Cheese: Google: 3690000 AllTheWeb: 7718252
    Linux: Google: 48000000 AllTheWeb: 26670311
    Windows: Google: 44600000 AllTheWeb: 66545303
    Extropian: Google: 4460 AllTheWeb: 3999
    Kumquat: Google: 32600 AllTheWeb: 42889
    Question Authority and the authorities will question you.: Google: 90 AllTheWe b: 74
    Hot man meat: Google: 229 AllTheWeb: 1661
    Hot pussy: Google: 104000 AllTheWeb: 770057

    "undefined reference to" error: Google: 31700 AllTheWeb: 8548
    "Antimatter-Catalyzed MicroFission / Fusion": Google: 6 AllTheWeb: 1

    Surprisingly alltheweb does return more hits in some areas, most notably for che ese, windows, and pr0n. With the cheese test, AllTheWeb helpfully cluttered my s creen with a banner for food products. Google, thankfully, is still bannerless, and returns more linux hits, fnords, and Voltaire quotes. Alltheweb also stalled several times and I had to resubmit a search. Conclusion: If you're a linux gee k or you want to know about fnords, futuristic philosophies, compilation errors, or advanced space propulsion concepts, google is better. If you're a horny wind ows user and want to find gay or straight pr0n, and if you for some reason like kumquats and want to learn more about cheese, use alltheweb.

    Seriously, I'll probably stick with google, better numbers or no. The only thing AllTheWeb has going for it is the ftp search. The original is owned by lycos no w and broken.

  61. The REAL Measure of a Search Engine... by joshjs · · Score: 2

    ...is how often they include cute, relevant, wacky different variations on their logo.

    Right?

  62. Google creating a new information economy? by rwa2 · · Score: 2

    Hey, check out one of the new Google beta programs, answers.google.com Even you can now earn fame and fortune, and yes, even internet cash be searching google's archives for answers to people's questions...

  63. Clambake by KjetilK · · Score: 2

    I wonder how long it will be before AllTheWeb is threatened.

    Could be interesting, since AllTheWeb is based in Norway, the same country where Operation Clambake is. They might say "DMCA, what?"

    But then, they might not, since the index itself is probably in the US, and besides, our Big Sister Sunde thinks DMCA is Norwegian law anyway, so she'll be banging on the doors once she gets $cientology on the phone.

    --
    Employee of Inrupt, Project Release Manager and Community Manager for Solid
  64. Washingon Post article by scubacuda · · Score: 2

    here.

  65. Simple Decision... by ffatTony · · Score: 2
    google.com = 10 character alltheweb.com = 13 characters

    I'll stick with google, those 3 extra characters are too much for me. Seriously though, alltheweb seems pretty good, I tried their mp3 search and it was ok, not really comparable to the myriad p2p clients, but still a nice feature.

  66. Re:Something else to consider... by symbolic · · Score: 2

    All I've read suggests that the IAO effort is geared toward building a massive data infrastructure that will allow fast access to all manner of information related to a specific target (which could be anyone that fits into a specific profile, for one reason or another). So, it's not a mandate per se, but the mere fact that our browsing habits, including search queries, could be part of it, is, and should be unsettling for every American citizen. The problem is that the government will have more and more access to information with less and less control or acountability.

  67. Total Documents? by lostchicken · · Score: 2

    AllTheWeb has summed all their formats to get 2.1 billion.

    If you add Google's 700mil USENET articles, 300mil images, etc, Google has >3,000,000,000 documents to search. That kills ATW.

    --
    -twb
  68. I dunno. by autopr0n · · Score: 2

    To me, Alltheweb really does have a superious sense of style, its more estheticly pleasing. And even with the banners it's still really fast

    I'm probably going to start using both regularly.

    --
    autopr0n is like, down and stuff.