How to Build a Search Engine
CowboyRobot writes "Three years ago, former Infoseek developer Matt Wells decided to go solo and build his own search engine, Gigablast.
In this article, Infoseek founder Steve Kirsch interviews his former employee about the process and challenges of creating a modern, scalable search engine. From the article: 'Search is a fiercely competitive arena, even though there are really only five Web search companies today: Google, Yahoo (Altavista/AlltheWeb/Inktomi), Looksmart (Wisenut), AskJeeves (Teoma), and Gigablast. It's a tight little community, and a lot of the people know and watch each other. Microsoft is also coming to the party, and everyone's a little bit nervous to see what it's bringing.'"
"even though there are really only five Web search companies today: Google, Yahoo (Altavista/AlltheWeb/Inktomi), Looksmart (Wisenut), AskJeeves (Teoma), and Gigablast " Gigawho? You silly goose.
"and everyone's a little bit nervous to see what it's bringing.'"
Money. Lots and lots of money.
Mod point free since 2001
Microsoft at the party would probably look something like this
"Pass the dip, guys!"
Whoa, hold on. Wrong site. Never mind.
"Have you ever thought about just turning off the TV, sitting down with your kids, and hitting them?"
What about BOOBLE.
I found over one million hit for XXX and not even one hit as far as I could tell to do with the fucking vin desal pice of shit movie.
Fifth, Profit.
I'm glad you told everyone he's a good guy, for a minute there I just assumed he was an evil, scheming villain.
...
Google: "Searching 4,285,199,774 web pages" That's quite a big difference.
At least this Gigablast name is closer to the truth. They are only exaggerating their page count by a factor of 3.7 : 1.
By my math, Google comes up short by 2.3x10^90 : 1.
Gigablast sucks : Proof - I entered my name and Gigablast says "no results". Did u mean "something thats not my name". No thanx I did not
Google : My site is the first !!!
And of course I refuse to believr that anyone in the world would be interested in anything but my home page.
I dunno. I better google it.
'In other news, Google announced the buy-out of Gigablast. The newly-formed company will be called Giggle.'
'He who has to break a thing to find out what it is, has left the path of wisdom.' -- Gandalf to Saruman
When an American writes "there are only five companies that..." he really means: "there are only five companies IN THE USA that...".
I liked this quote: "Now that the Internet is very large, it makes for some well-developed memory. I would suppose that the amount of information stored on the Internet is around the level of the adult human brain. Now we just need some higher-order functionality to really take advantage of it. At one point we may even discover the protocol used in the brain and extend it with an interface to an Internet search engine."
The protocol used in the brain? That can't be a good direction to go. I mean, if it's anything like my memory and honestly, the memory of most people I know, it's definitely going to be a step backwards. Human brains can hold a lot of information, but retreival is definitely not its specialty. I can see it now. Type in my search terms and the engine comes back with, "ummm, it's right on the tip of my tongue. Okay, I don't have a tongue, but I just about remember it. Give me just a minute to think about it. umm... umm... Nope, it's gone. Nevermind."
I'd fire you in an instant with sloppy code like that.
You forgot to add "Order by nipple_size desc".
1. Buy license for existing web search engine.
2. ???
3. Profit!
+1 Insightful, -1 Troll. What can I say, I'm an Insightful Troll.