Slashdot Mirror


WebCrawler Turns 10 Today

Brian Pinkerton writes "WebCrawler, one of the first search engines on the 'Net, turns 10 today. You can read a short history of WebCrawler. When I wrote WebCrawler, one could do a credible job of crawling, indexing, and searching the Web from a single desktop PC. Today, the reality is a little bit different."

29 of 136 comments (clear)

  1. Guess this celebration... by oberondarksoul · · Score: 5, Funny

    ...won't have an accompanying Google Doodle?

    --
    And tomorrow the stock exchange will be the human race
  2. e-mailing results by qewl · · Score: 4, Interesting

    Ah the nostalgia of receiving search results via e-mail :)

    --

    (\_/)
    (O.o) This is Bunny. (> <)
    1. Re:e-mailing results by Brianwa · · Score: 5, Informative

      You can be emailed results from Google as well.
      Simply email google@capeclear.com with the search terms in the subject line, you will soon recieve a response with the results. I think there is a limit to how many times a day you can use this, but I cannot find the link to the project webpage.

  3. Birthdays by 7Ghent · · Score: 4, Funny

    Happy birthday to Webcrawler AND Hitler! Hurray!

  4. They used to be my google.... by wo1verin3 · · Score: 5, Interesting

    I remember when webcrawler was the only search engine I touched...

    In 1996 it was nice and simple. Then as the time went on it got a bit too cluttered for my liking. Now looks like they're trying to googlize themselves with the current interface.

    1. Re:They used to be my google.... by Basehart · · Score: 4, Funny

      I really like their pantyhosecrawler companion site.

      Very cool research tool.

  5. Whoa! by outZider · · Score: 5, Interesting

    Holy crap!

    I remember WebCrawler, but lost touch with it in around 1996, when I started religiously using AltaVista. They sure have changed a bit. ... but do they have any relevance anymore? They're owned by InfoSpace. :P

    --
    - oZ
    // i am here.
  6. Wow by z0ink · · Score: 5, Interesting

    Does anybody else remember getting a WebCrawler promotional CD 10 years ago? I didn't even have a CD-ROM then!

    --
    Steal This Sig
  7. I remember using Webcrawler before google... by John+Seminal · · Score: 5, Insightful
    It was a good search engine. I dunno why I stopped using it, I think it was a bit on the slow side and Google had more pages.

    Heck, while reminiscing, I remember when excite was my start page, and when I used them for email. I remember they were the first "start" page to have groups. I stopped using them 4 years ago when their email stopped working.

    I guess if anything, we can learn the web is not going to be the same in 5 years as it is today. My question is, "is it better"? Personally, I think it was better back in the day. I would like to see a search engine that does not display any spam or sales or sex sites as hits. I now do most of my searches on google doing "search parameters site:edu".

    --

    Rosco: "If brains were gunpowder, Enos couldn't blow his nose."

    1. Re:I remember using Webcrawler before google... by Anonymous Coward · · Score: 5, Interesting

      I think I remember why I left WebCrawler.

      WebCrawler was simple and effective. But then AltaVista emerged. I started using AltaVista.digital.com, and from there WebCrawler went down hill - lots of advertising and junk that kind of made me hate it. What was once seemless and simple became noisy.

      I used AltaVista for a number of years, but once again advertising got the best of it. It turned super-sophisticated, with a lot of advertising fluff and "features". Altavista was becoming overly commercialized. They had a "simple" version that was better (I forget the name [begins with an "R"?]), but soon the result sets were scewed towards advertisers and abusers.

      In 2001, I made the switch to Google. It was everything that WebCrawler once was in terms of ease of use and quality of results. I've been more or less happy with Google ever since.

    2. Re:I remember using Webcrawler before google... by qodfathr · · Score: 5, Informative

      You are remembering raging.com, still up-and-running today.

      --
      Yes, it's true. This man has no dick.
  8. Then and now... by jdreed1024 · · Score: 5, Funny
    When I wrote WebCrawler, one could do a credible job of crawling, indexing, and searching the Web from a single desktop PC. Today, the reality is a little bit different.

    No kidding. Back then, one could serve a website from most any machine, and it would be there for all to see. Today only the largest websites can avoid a slashdotting with only 9 posts in the thread.

    --
    There is no sig, there is only Zuul.
    1. Re:Then and now... by mph · · Score: 5, Funny
      Today only the largest websites can avoid a slashdotting with only 9 posts in the thread.
      Imagine how bad it would be if everyone actually read the articles.
  9. Birthday party by jacobhoupt · · Score: 5, Funny

    I'll be hosting my tenth annual WebCrawler birthday party tonight in the back of my Yugo.

    Feel free to drop in, there should be plenty of seating available for those interested.

    --
    -- the only good thing the French ever did was two chicks at one time
  10. my new hero by theMerovingian · · Score: 4, Funny


    Some guys are too cool for their own good. Brian Pinkerton has the domain 'thinkpink.com', AND he wrote his own search engine.

    I bet he even has a 3-digit UID, a beowulf cluster of Xboxes running linux, and he sold all his stock options during the bubble. :)

    --
    "If you think you have things under control, you're not going fast enough." --Mario Andretti
  11. When did they give up.... by David+Hume · · Score: 5, Interesting

    ...on their own web search technology and become a metasearch engine? From the WebCrawler About Page:

    WebCrawler uses innovative metasearch technology to search the Internet's top search engines, including Google, Yahoo, Ask Jeeves, About, Teoma, FindWhat, LookSmart, and many more.

    With one single click, WebCrawler searches the best results from the combined pool of the world's leading search engines -- instead of results from only one single search engine.

    And WebCrawler makes it easy to refine your search so you can find the most meaningful results right away. No wonder it's a leader in the search industry.


    Was it 2001? The History states:

    2001 InfoSpace acquires WebCrawler. Excite, now Excite@Home, went belly up. In the bankruptcy, Infospace acquired WebCrawler. Today Infospace runs WebCrawler as a meta-search engine. And they've given Spidey a new name and turned him purple!


    Oh, and if it is not being otherwise used, has the code for the WebCrawler spider been open-sourced? :)

  12. Boy, does this take me back... by Faust7 · · Score: 4, Interesting

    ...to the days when the search engine market resembled the microcomputer market of the '80s. Several competitors, all with (roughly) the same market share, each with a certain number of hits that the others didn't have. I had to use at least a few of them to assure myself that I was getting something reasonably close to what the whole Web could offer on my search topic (even though no search engine comes close to penetrating all of the pages out there).

    If I was looking for something, I'd query Lycos, AltaVistas, Infoseek, Excite, Webcrawler, and Magellan. And, later on, Google. Vastly different results, site designs, site objectives. I won't say it was the most streamlined, elegant experience, but it was kind of fun.

  13. Wow. Just. Wow. by TWX · · Score: 5, Interesting

    I remember using Webcrawler back when I got my first 14.4 Slirp connection back in 1994. It was the only way to search!

    and then came the marvels of altavista.digital.com.

    I'm so glad that google came along...

    --
    Do not look into laser with remaining eye.
  14. Wow - the 1996 wayback WebCrawler page STILL WORKS by Anonymous Coward · · Score: 5, Interesting

    http://web.archive.org/web/19961023234707/http://w ww.webcrawler.com/

    Presumably connects to the current crawler which still accepts the old format :)

    --
    Callas

  15. First query? by Xzzy · · Score: 4, Funny

    So who remembers the first search query they typed into Webcrawler?

    I was just crawling out of the gopher world, a short period where I was getting turned on to the web but there was no way to find links, almost everything came through the university homepage or word of mouth. Then someone pointed me to webcrawler.

    What did I search for first? "fart jokes". No kidding.

    "boobs" was second.

  16. Well isn't that ironic by Zygote-IC- · · Score: 5, Funny

    So, to read a story celebrating an anniversary about a search engine, we have to go through the cache of another search engine?

    Go figure.

  17. WebCrawler on NeXTStep - before Open Source by ben_kelley · · Score: 5, Interesting

    It is scary to think that at one point I e-mailed the WebCrawler people to ask them how it worked. In response they sent me a copy of the source (Objective C for NeXT) so I could compile it up on my NeXT PC (I had a "black" NeXT - 68000 based) to index my intranet web server.

    I doubt that someone like Google would send you a copy of their source these days - even if you asked nicely.

    I could never get it to compile, and I deleted it long ago, but I kind of wish I had kept it now. An interesting piece of internet history.

    1. Re:WebCrawler on NeXTStep - before Open Source by sacremon · · Score: 5, Interesting

      I sometimes sit back and think about some of the various projects that first saw life on the NeXT platform:

      the first web server
      Webcrawler
      Doom and DoomII

      Pretty good for a machine that only sold ~70,000 units total, not including the versions of NEXTSTEP for ix86/SPARC/PA-RISC.

      I still have a Color NeXTStation stashed away in a closet. I was using it as a print server till about two years ago.

      --
      If you can't beat them, embrace and extend them.
  18. The WebCrawler Search Voyeur by Faust7 · · Score: 5, Funny

    Anyone remember the WebCrawler Search Voyeur?

    It was a little Java applet that sat on your screen and displayed the pseudo-real-time search queries of other people.

    When I was a computer lab monitor at my college, we used to note in the log book any particularly amusing queries that we'd seen.

    "hairy woman"... "squirrel torture"... "tom AND cruise AND foot AND odor"... "asian girl underage spanking"...

    1. Re:The WebCrawler Search Voyeur by intangible · · Score: 4, Funny

      I remember that it wouldn't show every search of course, but you could verify it was working by searching for the same phrase over and over again. About 10 seconds later, you could see your search phrase. You could actually use it to communicate with other people, albeit a little slow, but it was amusing. I would type in silly things just so others watching the voyeur would see them.
      I bet you guys recorded some of my stuff :P

  19. public search engine by jacquesm · · Score: 4, Interesting

    I'd happily contribute cash to a publicly funded and publicly run search engine.

    Anyone game ?

  20. Hardly one of the first by btempleton · · Score: 4, Informative

    Internet searching way predates 1994. Archie by Peter Deutsch (the one from Montreal, not the American one) was one of the most popular applications on the internet in the 80s. The http search engines like Webcrawler and Lycos came much, much later on internet time scales.

    --
    Has it been over a year since you last donated to the Electronic Frontier Foundation
  21. WebCrawler Sale Sensation by Anonymous Coward · · Score: 4, Interesting

    WebCraweler's Brian Pinkerton formerly worked at NeXT, and I remember being in the the NeXT kitchen when news arrived in 1995 of his sale of WebCrawler to AOL. The sale price was around $1 million, and everyone was absolutely awed that a software company could sell for so much. This marked the beginning for me of the dot-com era: Just a few month later, other companies started or run by ex-NeXTers sold for millions, then tens of millions, and at least one for hundreds of million. Soon after that, NeXT CEO Jobs took Pixar through an IPO, for a personal gain of about $1 billion!

  22. nope by millette · · Score: 4, Informative
    You just need 8 desktop machines and you can index a 10th of what google does. From a recent article:
    Gigablast runs on eight desktop machines, each with four 160-GB IDE hard drives, two gigs of RAM, and one 2.6-GHz Intel processor. It can hold up to 320 million Web pages (on 5 TB), handle about 40 queries per second and spider about eight million pages per day. Currently it serves half a million queries per day to various clients, including some meta search engines and some pay-per-click engines.
    I also read it was going to expand it's index this year, but I wasn't able to find where I read that.