Google's Technology Explored
RobotWisdom writes "Internetnews offers a moderately detailed peek at Google's technology. For example, they use stripped-down Red Hat on a massively redundant network, and they're starting to have success with automatic clustering of concepts, so that pages can match even if none of the words in your query actually appear on the page." Additional analysis on InformationWeek and C|Net. From the article: "As a search query comes into the system, it hits a Web server, then is split into chunks of service. One set of index servers contains the index; one set of machines contains one full index. To actually answer a query, Google has to use one complete set of servers. Since that set is replicated as a fail-safe, it also increases throughput, because if one set is busy, a new query can be routed to the next set, which drives down search time per box."
That's now how google does it! This is their REAL secret:
http://www.google.com/technology/pigeonrank.html
If we could /. google, that would impress me
and they're starting to have success with automatic clustering of concepts, so that pages can match even if none of the words in your query actually appear on the page
So that's why I can search on the result page for my orginally query and find nothing. And all this time I was blaming Internet Explorer!
Coder's Stone: The programming language quick ref for iPad
The technology that is truly asstounding, is Google's ability to cache itself. Yeah, think about THAT one for a while.
Theoretically, he said, if someone searches for "Bay Area cooking class," the system should know that "Berkeley courses: vegetarian cooking" is a good match even though it contains none of the query words.
One word: cooking.
I'm sure the principle is sound. I just think the example is a leetle bit flawed.
What I say does not represent the views of my employers, my friends, my cats, or myself.
http://www.google.com/jobs/lunar_job.html
a snippet:
Hivemind harvest in progress..
Oh come now! You can always do a site:slashdot.org and search Google. All the knowledge about ANYTHING is right there at your fingertips. Sometimes in duplicate and triplicate!
What more could you need?
I always thougth that they used NT + Access Database.
Google's redundancy theory works on a meta level, as well, according to Hoelzle. One literal meltdown -- a fire at a datacenter in an undisclosed location -- brought out six fire trucks but didn't crash the system.
"You don't have just one data center," he said, "you have multiples."
The real idea behind Google Maps is so that as the server catches fire it use it's last cycles to send an eMail to the nearest fire cheif and include a map. I think it would also throw in a GMail invite for incentive.
.\.\att Clare
oh yee of little faith
-Parent AC
One literal meltdown -- a fire at a datacenter in an undisclosed location -- brought out six fire trucks but didn't crash the system.
/.ing could do this.
Gee.. I wish our
It is my belief that data center fires are caused by slashdot every day!
I submitted this story last night, and it didn't get posted.
Substitute "pron" for "knowledge" and the statement still stands.
They will, once the patches are out of beta.
For more information, click here.