Slashdot Mirror


What's It Like to be Google's Boss Techie?

We'd like to welcome Google Director of Technology Craig Silverstein as our next Slashdot interview victim... err... guest. You think you run a big Linux server farm? Craig's is bigger. Think your Web site gets a lot of traffic and creates a lot of headaches? Just think what Craig must face! Post whatever you'd like to ask Craig below, one question per post. About 24 hours after this runs we'll email Craig 10 of the highest-moderated questions, and we'll post his answers shortly after he gets them back to us.

657 comments

  1. I've wondered by lblack · · Score: 5, Interesting

    Google always seem to be early-to-market with some really highly developed software solutions, and also always seems to have the backbone to support them.

    I'm curious -- what drives the innovation? Is it the hardware team advancing architecture to permit the software team more room to play, or is it the software team saying, "Hey, look what we got!" and the hardware team dropping the iron to implement it?

    I understand there must be some level of synergy, but is it completely seamless or is one side of the equation effectively driving the other?

    Leem

    1. Re:I've wondered by kiatoa · · Score: 1

      I've wondered....

      How warm would the waste heat from your server farm keep a 20,000 gallon swimming pool in mid winter vermont? If the answer is > 80 deg F would you be interested in relocating your server farm to Burlington Vermont? I have a swiming pool that is still at 65 deg F. :(

      And yes, I am planning on the heat from my two servers but that will only be 500 or so watts.

      --
      90% of the wealth is in 2% of the pockets. Bummer to be in the majority.
    2. Re:I've wondered by xmundt · · Score: 1

      And speaking of wondering things...why is it that Google limits ALL searches to the top 1000 URLS in the database. Since the ranking depends, mostly, on the number of pages linked TO that URL, it cuts out finding those buried treasures out there, and, in many cases may make it impossible to get to certain web pages.
      OF course, if this was DOCUMENTED someplace it would be annoying, but, not an unpleasant surprise.

      --
      YAB - http://blog.beemandave.com/
    3. Re:I've wondered by Anonymous Coward · · Score: 0

      Do your duty in april? Dude, april is long gone.

  2. First Question! by Dannon · · Score: 1, Funny

    Can I have a job?

    --
    Good judgment comes from experience.
    Experience comes from bad judgment.
    1. Re:First Question! by littlerubberfeet · · Score: 4, Interesting

      This shouldn't be modded down. The question just needs to be asked in a clearer way.

      What job opportunities are there at Google, and what opportunities in the industry as a whole?

      --
      Sig (appended to the end of comments you post, 120 chars)
    2. Re:First Question! by Ashran · · Score: 1
      --

      Before you email me, remember: "There is no god!"
    3. Re:First Question! by Anonymous Coward · · Score: 0

      I think he was joking...

    4. Re:First Question! by Dannon · · Score: 1

      Actually, it was meant to be a joke. I've got a job. I just wish it were a job at Google!

      --
      Good judgment comes from experience.
      Experience comes from bad judgment.
    5. Re:First Question! by Anonymous Coward · · Score: 0

      says littlerubberfeet posting AC bc I am a Karma whore:

      ok, just when I saw the page, it said 0, redundant, so I thought it should be modded up as either funny or insightfull, just trying to help. Moderation now says redundant (2), funny (2)

    6. Re:First Question! by DrMaurer · · Score: 1

      Relatively off-topic, maybe.

      I've noticed that seemingly every resume I send on-line somewhere goes in the bit-bucket. I imagine huge circular servers with everything relocated to dev/null. This may be the case, but possibly not, maybe I really ain't qualified for anything. (Hate that feeling, I do.)

      Does Google use some of its own technology to sort through the deluge of resumes I'm sure it receives? Have you had or even thought of marketing this possible technology to companies?

      Did I just give away a really good idea?

      --
      Dan
    7. Re:First Question! by Anonymous Coward · · Score: 0

      Better question:

      Where do we send our resumes? :)

  3. Simple question by lwdupont · · Score: 4, Interesting

    What type of machines/setup does Google use?

    (I've heard thousands of PC's with everything in RAM, but I'd love to hear it from the horses mouth)

    1. Re:Simple question by ender81b · · Score: 3, Interesting

      Addendum:
      How much bandwith you guys use/have?

    2. Re:Simple question by Ralph+Wiggam · · Score: 2

      addendum addendum:
      How much does that cost a month?

    3. Re:Simple question by unicron · · Score: 1

      Indeed. I would have to guess multiple SONET connections, maybe 2 or 3 OC-3's. Dedicated.

      --
      Finally, math books without any of that base 6 crap in them.
    4. Re:Simple question by Pedrito · · Score: 2

      How about I give you the answer now. Google has around 10,000 rack-mount servers. They're 1U rackables with 2 independent servers inside, w/256MB of RAM and 80GB hard drives. The systems are made by Rackable Systems and King Star Computer.

    5. Re:Simple question by ender81b · · Score: 1

      indeed. A friend of mine works for Verio and a oc-3 (155.5mbps) runs something like 55,000$/month. Just for the line mind you. I can only imagine what google has.

    6. Re:Simple question by Electrum · · Score: 2

      It gets a lot cheaper when you buy more. We had four OC3's (OC12 equivalent) from AT&T and were paying $175,000 a month (though we also had a CTO with awesome bargaining skills).

    7. Re:Simple question by Anonymous Coward · · Score: 0

      "Simple" is an understatement. How about "Fuckin Retard Question"

    8. Re:Simple question by 4of12 · · Score: 4, Interesting

      How much bandwith you guys use/have?

      Addaddendum:

      How much of that BW is actually used and how does it vary during with the time of day and day of week?

      What's the ratio of traffic to and from user queries compared with the traffic searching the web and do you expect to scale indefinitely as growth continues on the current path?

      --
      "Provided by the management for your protection."
    9. Re:Simple question by UranusReallyHertz · · Score: 1

      Um, 256MB of RAM seems AWFUL low. I read something about how RAM actually turns out to be cheaper for Google than hard drives in the final analysis due to it being about a million times faster, and that Google was using special machines that had a 36 bit segmented adressing scheme to get past the 4 gig limit on RAM. 256 MB per machine comes to about 5 terabytes total, but Google's index is actually in the petabyte region due to redundency, so it does seem a bit small.

      --
      Smoking is an expensive, slow, and unreliable method of suicide.
    10. Re:Simple question by Alan · · Score: 2

      Addendum^3
      Are you guys making money? The cost of the huge amount of infastructure in bandwidth and hardware must be huge, and while I've heard of a few different paths of revenue income (custom installs/setups, and the text ads) is it profitable for you yet?

    11. Re:Simple question by addaon · · Score: 1

      special machines that had a 36 bit segmented adressing scheme to get past the 4 gig limit on RAM

      Ah yes, I've heard of such special machines... where I'm from, we call them 'intel's.

      --

      I've had this sig for three days.
    12. Re:Simple question by lwdupont · · Score: 1

      How about we wait for the answer from the dude who knows?

    13. Re:Simple question by snaphu · · Score: 1

      If that's a simple question (as said in subj line) I really don't know what I could ask

    14. Re:Simple question by Sparkle · · Score: 1

      Indeed we have heard thousands of PC's. No doubt a whole department to fix the ones that break each day. Why not thousands of linux instances in an IBM mainframe? Disclaimer: I do NOT work for IBM!

    15. Re:Simple question by Anonymous Coward · · Score: 0

      I would assume they have many huge "cages" at some very large CoLocation facility with banks of multiple servers, no doubt on some very fast bandwidth connections, like multiple T3's, or even faster connections.

      Something like this could cost quite a lot. I visited Exidus in Santa Clara (But didn't they go "belly up"?), and learned Exidus uses 80% of the power grid available to Santa Clara.

      If I'm not mistaken, I recall someone mentioning some of their servers are located there, but perhaps that has changed.

      Any of you Google "horses" out there want to comment on this?

  4. Weird queries? by Anonymous Coward · · Score: 1, Interesting

    What are the weirdest queries?

    1. Re:Weird queries? by Openadvocate · · Score: 1

      Must be: +transexualnazieskimo +faq

      --
      my sig
    2. Re:Weird queries? by TwP · · Score: 1

      Now that is funny! :) :) :) :) :)

      Five smiley face rating for you.

    3. Re:Weird queries? by monju_bosatsu · · Score: 1

      See here.

  5. Statistics by suwain_2 · · Score: 3, Interesting

    A relatively simple, non-intellectual question, but I've always wondered -- just how many hits/how much bandwidth do you consume, and how many servers do you have to handle the load.

    --
    ________________________________________________
    suwain_2 :: quality slashdot p
    1. Re:Statistics by Pedrito · · Score: 5, Informative

      Check out this to get most of your answers. Shouldn't we be asking him stuff that isn't sitting on their website?

    2. Re:Statistics by Anonymous Coward · · Score: 0

      Shouldn't you be asking him questions, period, instead of "correcting" other users like you're their cocksucking nanny?

    3. Re:Statistics by Anonymous Coward · · Score: 0

      His questions are not answered there, moron...

    4. Re:Statistics by PerryMason · · Score: 1

      I notice that they mention that they run the "world's largest commercial Linux cluster". My question is who the f*ck runs a larger non-commercial one?!?!

      --
      "I'm tired of all this 'Aren't humanity great' bullshit. We're a virus with shoes" - Bill Hicks
  6. Favoring Big Guys by PenguinRadio · · Score: 5, Interesting

    Does google's policy of "ranking" the sites that have hits favor the "big guys" over more specific smaller traffic websites? That is, would a story on a site like CNN get a higher ranking in google on a keyword "Gulf War" than say a site (gulfwarveterans.com) that deals 100% with the Gulf War? Do you think you are leading to the commercialization of the web (i.e. the big power players) over smaller sites?

    1. Re:Favoring Big Guys by shilly · · Score: 0

      Why not check your example. FWIW, neither site you mention made the top ten. However, gulfwarvets came in at 3 and the sites listed don't seem particularly dominated by big-cheese commercial players.

    2. Re:Favoring Big Guys by jeaton · · Score: 1

      That is, would a story on a site like CNN get a higher ranking in google on a keyword "Gulf War" than say a site (gulfwarveterans.com) that deals 100% with the Gulf War?


      In this specific example, it's pretty obvious that it doesn't. Searching for Gulf War returns a lot of pages dealing entirely with the war (gulfweb.org, gulfwarvets.com) and not a single hit for CNN in the first few pages of links.
    3. Re:Favoring Big Guys by Jobe_br · · Score: 1

      I think this can be attributed to the fact that transient sites v. permanent sites will be less-favored by Google's ranking. Does that make sense?

    4. Re:Favoring Big Guys by killmenow · · Score: 5, Insightful

      Furthermore, estimates are that search engines miss a large portion of Internet content available. There must be literally millions of web pages that don't even show up in your cache because they are too small, or because nobody links to them. But there may be a site out there that has all the information you could ever want to know about some esoteric topic that only the person who created the site and the few friends that person may have...but since nobody else links to it, nobody else knows about it.

      So how do you find those treasure troves? And how do you decide which ones are treasure troves and which ones are the millions of "all about me" web pages? Or do you care?

    5. Re:Favoring Big Guys by _Wrath_ · · Score: 1

      it is a typical paradox. You can't get linked unless you are linked. That is, you can't get in the google search to be found. Which came first? The link or the page? I would guess the link.

    6. Re:Favoring Big Guys by Hack+Shoeboy · · Score: 0

      No, this can be attributed to someone asking a question about what results the world's easiest search engine would give for a specific query, instead of just performing the query themself.

      --

      IN TEH FUCHAR, LITERSY WLIL EB OPSHANAL!!!!!111
    7. Re:Favoring Big Guys by Anonymous Coward · · Score: 0

      not to mention that there's no such website as gulfwarveterans.com...

    8. Re:Favoring Big Guys by jesser · · Score: 2

      So how do you find those treasure troves?

      1. Record URLs visited by Google Toolbar users. Google's Privacy Policy for their toolbar seems to allow this.

      2. Get Earthlink to record all URLs visited by Earthlink users and put that database in a format that Googlebot understands.

      3. Get Dreamhost to let Google index directories even where index.html exists. Let Dreamhost call the feature "[x] Submit all pages on www.squarefree.com to Google once a month" or "[x] Allow Google to find unlinked pages". Make it optional so that rare pages protected by "javascript passwords" can remain unindexed, but enable it by defalt.

      4. Give Google users a place to upload their browser history files and say the information will only be used to find unindexed pages.

      5. Don't worry about it, since unlinked pages will rank poorly anyway, and are likely to be old versions of pages that were renamed but not deleted.

      --
      The shareholder is always right.
    9. Re:Favoring Big Guys by bigsexyjoe · · Score: 1

      If nobody links to them or anything then it might be a page page full of lies. e.g. Evolution would return a page that says that hitler created the world in six days. Some unlinked page might be created by a lunatic who says hitler really created the world in seven days.

    10. Re:Favoring Big Guys by Anonymous Coward · · Score: 0

      Or... strangely enough, it might not be lies. It's irrelevant. A good search engine should STRIVE to include obscure information. After all, if it's not obscure, then it's already commonplace and easier to find, by definition.

    11. Re:Favoring Big Guys by shilly · · Score: 1

      You've gotta ask what the moderators are on, here. This guy is at 5 for posting a question that he can answer for himself. I point this out and get modded down...

    12. Re:Favoring Big Guys by Rock · · Score: 1

      I don't think Google would automatically find the isolated islands of information. That is why they allow us to submit URL's to be considered for indexing.

      It is quite unlikely that if I had not submitted the insanely detailed "open" analysis of my whole life insurance policy, it would have been exactly the kind of esoteric island of which you speak. But some of us take advantage of all the free "publicity" we can get. Hmmm, perhaps even here...

      The Visible Policy ;)

      --
      - - -
      "The sixth sick shiek's sixth sheep's sick."
  7. How much by Anonymous Coward · · Score: 0

    is your monthly bandwidth bill for google.com ?

  8. Windependence Day by scubacuda · · Score: 3
    What are YOU doing for Windependence Day?

    1. Re:Windependence Day by jesser · · Score: 2

      They could have a penguin among the animals waving the American flag around the Google logo. I don't expect Google to do anything, though. It's too bad the Windependence Day people chose July 4, since they get mentally grouped with all the corporations abusing the American flag in ads and product labels since September.

      --
      The shareholder is always right.
  9. I'm not sure when the change took place by SquadBoy · · Score: 5, Interesting

    but I noticed a few months ago that Cisco now uses the Google engine to search the CCO. Congrats on that one. I've also noticed this new search box that Google is starting to produce. And it looks *very* cool. So my question is basically which is more important to your job the website or selling the service and the engine to people who need it?

    --

    Cypherpunks: Civil Liberty Through Complex Mathematics. Those who live by the sword die by the arrow.
    1. Re:I'm not sure when the change took place by Jobe_br · · Score: 1

      I'd imagine the website since this is the driving force behind selling the services/products. If the website suffers, so does Google's image and that makes it infinitely more difficult to sell search appliances/services.

      Don't ya think?

    2. Re:I'm not sure when the change took place by Anonymous Coward · · Score: 0

      Starting about 1.5 - 2 years ago, there started to be cisco.google.com which would search the CCO.

      Go figure how much news that is now...

    3. Re:I'm not sure when the change took place by ceejayoz · · Score: 2

      Exactly. If Google's search engine sucks, no one will buy their intranet search appliance. If it's the best search engine on the web, the search appliance is probably similar high quality.

  10. Pigeon Computing by Black+Aardvark+House · · Score: 4, Funny

    Has there been any progress on the Pigeon Computing initiative?

    --

    I am the evil aardvark!

    1. Re:Pigeon Computing by Anonymous Coward · · Score: 0

      +n funny

    2. Re:Pigeon Computing by theblacksun · · Score: 1

      Exactly what do you feed those pigeons?

      --
      Ignorance kills, complacency kills, hatred kills, but usually not the ones guilty of them.
  11. technetcast.ddj.com/ by rblackwe · · Score: 3, Informative

    A little old but interesting.

    The Technology Behind Google 2000-10-19 (1hr 13min) By Jim Reese, Chief Operations Engineer, Google. How to build an internet search engine that indexes 1-2 terabytes of data 200 million web pages- and serves it up at a rate of 1000 requests/second. (Hint: Start with a farm of 10,000+ Linux servers). The technology behind Google: company overview, search parameters and results, hardware and query load balancing, Linux cluster topology, scalability, fault tolerance, and more. [420]

    http://technetcast.ddj.com/tnc_search.html?key=g oo gle.

    1. Re:technetcast.ddj.com/ by jomagam · · Score: 1

      Very very interesting. Do you know if the slides are available too ?

    2. Re:technetcast.ddj.com/ by rblackwe · · Score: 1

      I have not seen them anywhere. But I did think that was a good listen. Technetcast is great. Even with out the slides:(

  12. Why did you chose Linux? by RinkSpringer · · Score: 5, Interesting

    I am wondering why they chose Linux. Specifically, I wonder how they made the choice between all major OS-es (Linux, *BSD, Solaris and possibly Windows), as well as the software they use to power the site.

    1. Re:Why did you chose Linux? by Anonymous Coward · · Score: 1, Insightful

      you're dumb.

      10,000 Solaris, HPUX or 2k licenses? No-friggin-thank-you-very-much.

    2. Re:Why did you chose Linux? by pivo · · Score: 1

      I think they already answerd that, maybe on a FAQ at there website. Anyway I think the answer was biggest bang for the buck, which only makes sense. What I still wonder about though is why not *BSD? Perhaps it's something to do with *BSD's multi-processor support at the time.

    3. Re:Why did you chose Linux? by Anonymous Coward · · Score: 1, Interesting

      You don't need 10,000 solaris licenses, just buy bigger computers.. Sun sells quite a few of those. I would be interested he hear why they chose to implement their system with that many small machines, why not just use a few thousand big computers? You still have huge amounts of redundancy, but less computers to manage and better hardware.

    4. Re:Why did you chose Linux? by skotte · · Score: 3, Informative

      i was wondering this myself. more accurately, i was wondering if they now wish they had gone with something else.

      fFollow me a moment. it is certain to be cheaper to setup a thousand computers of linux, rather than, lets say, a dozen computers of windows. licensing is just out of this world. but what about now? 100,000 machines later, several years down the road, heavily entrenched in linux, do you, Craig Silverstein, wish you had a different platform? would it, at this point, be more convenient to have something else, if you could change it all in an instant?

    5. Re:Why did you chose Linux? by packeteer · · Score: 1

      these 10,000 machines ARE big computers... this is not a cluster of winpy p2-333 boxes with 64 megs of ram... these are big machines... and they WOULD need to buy thousands of licences...

      --
      unzip; strip; touch; finger; mount; fsck; more; yes; unmount; sleep
  13. Regression by Have+Blue · · Score: 5, Interesting

    The Internet is always described as a distributed system with no single point of failure. Google, however, has quickly become by far the most popular method of locating information. "Surfing" has been killed with modern search technology, it's so much easier to look through Google than the Web itself. If Google was down, I'm sure the Internet would be far less useful.

    Do you think Google has become an Internet point of failure? With the competition for larger and larger indexes, is the Internet becoming centralized? Do you think this is a bad thing?

    1. Re:Regression by Anonymous Coward · · Score: 0

      He's a techie, this is not the right person to ask ;-)

      Good question though. Although I never really 'surfed' much back before google, the only thing is that its much much easier to find using google than any of the others used to be.

    2. Re:Regression by Anonymous Coward · · Score: 0

      In my opinion that is very possible. I remember mentioning something about how people have become increasingly dependant on google to look for information on the WWW, because (at that point) there was no other alternative.

      HOWEVER, alltheweb.com seems to be on par with google as far as results go, and I'm sure that we will be seeing more and more people creating similar services. Basically, even if google goes down, there will _still_ be a demand for fast, FREE, accurate searches, and someone will come along to provide for that demand.

    3. Re:Regression by hagardtroll · · Score: 1

      Alta Vista, Yahoo, etc. These aren't as good as google, but they were the best we had before Google. We lived with it then, we can live with it again.

      Plus any void left by Google, should be filled quickly with someone else.

    4. Re:Regression by phatStrat · · Score: 1

      A bunch of people have already expressed that they like this question, but I'm different in that I really like this question.

      I think that if Google shutdown (either by massive pigeon die-offs from contracting a buffer overflow exploit in the Avian Carrier Transmission Protocol or copyright violations of some sort) I would be utterly lost and helpless. It's probably my own fault for not exploring alternatives, but in all fairness, how many people have actually had the need to look into alternatives? I consider it my personal index/table of contents to the Internet.

      Surfing is fine as a recreational use of the Internet, but to use it as a timely resource of information a centralized index is essential.

      Of course, having more than one index lessens the "single-point of failure" risk but begs the question - "Who will become the indexer to index the indicies that are indexing the Internet?"

    5. Re:Regression by Pinball+Wizard · · Score: 2
      Google can only stay at the top if if continually improves its technology. There are others nipping at Google's heels, and sooner or later they will provide similar results as Google now provides.


      But what if the search engine business really is unprofitable? Would you pay to keep something like Google around? If so, how much, and how would you pay?

      --

      No, Thursday's out. How about never - is never good for you?

    6. Re:Regression by skt · · Score: 2

      > I think that if Google shutdown... I would be utterly lost and helpless

      uh, there is always yahoo :)

    7. Re:Regression by ceejayoz · · Score: 2

      Yeah, for which you now have to pay to get included, provides no caches, and is so bloated and commercialized it's hard to tell what's a search result and what's an ad.

      I think I'll keep Google.

  14. Search engine spammers by I+Want+GNU! · · Score: 5, Interesting

    What are you doing to prevent the new generation of more sophisticated search engine spammers- spammers that use advanced software such as WebPosition Pro, spammers that feed fake pages to the Google crawler, spammers that make bogus link pages to their own sites? Doesn't this new level of sophistication on their part mean that in large part Google must emphasize human website reviewers, such as those provided by the Open Directory Project, to a greater degree?

    1. Re:Search engine spammers by Jobe_br · · Score: 1

      You're aware that sites submitted to dmoz are subsequently fed to Google, right? Just checking, not trolling or anything ... some people don't know just how many services pool info from dmoz (google isn't the only one).

    2. Re:Search engine spammers by I+Want+GNU! · · Score: 2

      Yes, I am very familiar with the ODP and the fact that Google's directory is made up of ODP sites and that it weights these heavily (possibly partly due to the fact that the pagerank will be increased since ODP links are all over the web on sites that use its data). I just was wondering if Google's bond with projects such as the ODP would be strengthened and enhanced due to these recent issues that have made it impossible to do everything automatically. If it runs purely on robots and fixed rules, its rules will be tested and broken by people submitting automatic queries and using other sophisticated methods.

    3. Re:Search engine spammers by SplatFileGoo · · Score: 1

      Before you go asking such questions, you might one to take some time to see what Google does to rank pages. Google is 100% immune to each and everyone of the tactics you mentioned. Besides, "sophisticated"? lol - webpos gold had been around since 98 and banned by most se's since 99.

    4. Re:Search engine spammers by Jobe_br · · Score: 1

      OK, I understand your question better now ... :)

  15. pictures or diagrams by z_gringo · · Score: 1

    How much bandwidth is running into his network? How many locations does he have, Are there pictures or diagrams of the network infrastructure side of his setup, and apart from Programmers how many network design type people work there?

    --
    -- -- Warning. Do not stare directly at the sun.
  16. Stumped by Bios_Hakr · · Score: 5, Interesting

    As a new network configuration guy, I am often stumped by a problem. I usually turn to google first, and my supervisor second. What has been the biggest problem that you have dealt with that will stand out in your mind years from now? As the "Head Techie", where did you turn, and what was the eventual resolution?

    --
    I'd rather you do it wrong, than for me to have to do it at all.
    1. Re:Stumped by ralmeida · · Score: 5, Funny

      I just pictured Google engineers turning to Google to fix some kind of problem that they might have... if we could kill the middleman, Google would achieve consciousness!

      --
      This space left intentionally blank.
  17. I got one by bravehamster · · Score: 2, Funny
    Who gets to clean up after the pigeons?

    --
    ---- El diablo esta en mis pantalones! Mire, mire!
    1. Re:I got one by freq · · Score: 1

      it would probably be a good idea to include a link next time :)

      --
      "Tension is the great integrity" -- R. Buckminster Fuller
  18. Internal Platform by Anonymous Coward · · Score: 0

    Has the Linux platform made inroads into the staff arena at Google? What platform do you develope on?

    1. Re:Internal Platform by Anonymous Coward · · Score: 0

      You went to the Dan Quail School of Spelling didn't you?

  19. Scientology by ender81b · · Score: 4, Interesting

    Does google plan on releasing more products like the Google Search Appliance in the near future - specifically those that are geared more towards the consumer level rather than business market? I would, personally, love to have some sort of google search engine on my machine to rummage through all the stuff I have. Does google plan on expanding into this market or will you remain focused on the web?

    I know, I know, Only one question but - it begs to be asked - how well is your technology going to be able to scale? Considering the near-expotential growth of the internet will PageRank be able to keep up?

    1. Re:Scientology by cybercuzco · · Score: 2

      Well first of all, why is the title of this questions scientology? Secondly, find in windows really sucks, I know, it takes like 15 minutes to just do a name search in a 5 gig hd. Try out sherlock on a mac sometime, you can search massive hds in seconds, it really kicks ass.

      --

    2. Re:Scientology by ender81b · · Score: 2

      I was originally going to ask a scientology question, but changed my mind and forgot to change the topic. =)

    3. Re:Scientology by Anonymous Coward · · Score: 0

      Try out sherlock on a mac sometime, you can search massive hds in seconds, it really kicks ass.
      Although you do have to rebuild the index in order to get the most up-to-date search results. If you do, the speed difference is negated. For some purposes, though, Sherlock is wonderful.

  20. Storage used by Steffan · · Score: 5, Interesting

    I understand that Google was using large numbers of IDE drives in lieu of more expensive but individually faster SCSI devices. What prompted the decision, and how have the concerns of reliability and performance been mitigated. What special technology, if any, was used to implement such a system

    1. Re:Storage used by Anonymous Coward · · Score: 0

      It's been my experience that modern IDE hard drives (particularly Maxtors and Western Digitals) are more reliable than many SCSI disks. I suspect they've mitigated disk failure at a much higher level -- I'd imagine their systems are designed so that if any single computer failed, nothing would go down. Each PC could (and should, in a clustered environment) be expendible.

  21. I'm curious... by rgoer · · Score: 5, Interesting

    ...as to what exactly Google does with the concepts it receives through the various Google-tech contests held. Have these ideas been made good use of? Do we see any of this in the Google we use every day? What about the ones that didn't win, do we see any of them?

  22. Question 1 of 2: Language of choice? by FortKnox · · Score: 4, Interesting

    Whats the google language of choice for web page building. I'd assume speed is the most important, so what language makes google so fast?

    --
    Good quote, too many chars. Seriously, the slashdot 120 char limit sucks!
    1. Re:Question 1 of 2: Language of choice? by gmarceau · · Score: 1

      Easy, check out their job openning. C++ is first, then python/perl, then Java comes up along with SQL - the usual stuff what.

      Along the years, C++ has become more proheminent on that page. Sugesting that, as they grew, they translated code from the nicer but slower languages to the harder but faster one.

      --
      This post was compiled with `% gec -O`. email me if you need the sources
    2. Re:Question 1 of 2: Language of choice? by Jobe_br · · Score: 1

      Hand optimized assembly .. mmm.

    3. Re:Question 1 of 2: Language of choice? by Anonymous Coward · · Score: 0

      It is ironic that most of the Google executives have come from Sun Micro's Java effort - yet they use C++ for Google. What does that tell everyone? C++ scales far better for a system of that enormous size and has a much smaller memory footprint.

    4. Re:Question 1 of 2: Language of choice? by WhiteDragon · · Score: 1

      I bet at least some is in Python
      Check out google's job application page

      --
      Did you mount a military-grade, variable-focus MASER on an unlicensed artificial intelligence?
  23. Creative Ideas by Domasi · · Score: 5, Interesting

    Is there anything new that Google is working on that is not currently displayed in your labs section? If so could you explain it to us?


    --
    If you could sum it up in a nutshell, maybe you should be writing O'Reily books. --- Domasi 2001
    1. Re:Creative Ideas by beebware · · Score: 1

      They probably could if they were allowed to, but they are a commerical organisation with rivals - so they probably keep things 'hush hush' until they are ready to announce them in one way or another...

  24. Success by discstickers · · Score: 2, Interesting

    Google's success has been well documented. Quick, relevant results are it's trademarks. Do you see any backlash against Google and what are you doing about people that use Google's success for their own purposes (ie Google bombing)?

    --
    I have a shitty sig!
    1. Re:Success by hkmwbz · · Score: 1

      And does "Google bombing" really work for normal searches and not just obscure search phrases no one else would use anyway?

      --
      Clever signature text goes here.
  25. On call? by rice_burners_suck · · Score: 2, Interesting

    Hi Craig. Google is my favorite search engine, mostly because it's so simple, fast, and has a very professional feel.

    I wonder, when you're in charge of something as huge as Google, are you on call 24/7 in case something goes wrong? Have you ever been called during, say, a nice dinner, or worse, in the middle of the night? Thanks.

    1. Re:On call? by SquadBoy · · Score: 1

      90% of all IT ,to include MIS and networks, people in the world are on call 24/7 and yes we all have been called at bad hours. Would you like to hear horror stories?

      --

      Cypherpunks: Civil Liberty Through Complex Mathematics. Those who live by the sword die by the arrow.
    2. Re:On call? by Anonymous Coward · · Score: 0

      I can't speak for the original poster but I think he might mean something like this http://www.google.com/search?hl=en&ie=UTF8&oe=UTF8 &q=rice+boy

      Pleae for the love of gawd don't do any of those things. :)

    3. Re:On call? by fruey · · Score: 1

      I have been called out

      • During sex
      • During dinner (many times)
      • After going to bed
      • Whilst very stoned

      I don't even get overtime for callouts, and rarely thankyous... some people just don't appreciate that you're stopping your PERSONAL life for them.

      --
      Conversion Rate Optimisation French / English consultant
    4. Re:On call? by rice_burners_suck · · Score: 2

      Alright, dude... So you got a Civic. That, in and of itself, is not a crime.

      But for the love of God, man, please don't lower it, put an exhaust tip the size of a coffee can on it, or a spoiler that makes it look like a shopping cart, or a bunch of Japanese lettering on the front fender and part of the hood, or racing stripes, or aircowls all over the body that don't actually lead anywhere, or a bunch of ground effects, or tail lights that look like head lights mounted on the wrong end of the car, or a triple subwoofer with a 3000 watt amp so we can hear your car vibrating itself apart all the way down the street, or any of the other 10,000 stupid things that rice boys spend $30,000 on to make their cars look fast and sound really stupid and annoying.

      We've already got too many of those ugly pieces of trash on the roads. If you wanna go fast, spend $2000 on an old car, like a Nova, Chevelle, Camaro (Chevy's), GTO (Pontiac), GS (Buick), Mustang (Ford) or Barracuda (Mopar) from the late 60's to early 70's, and spend $5,000 to make it haul ass. Girls actually wave at guys driving cars like this. (I swear to God!)

    5. Re:On call? by msim · · Score: 1

      Haha, that's very true, i can say yes to all of those too.

      But heres the BIG question. Have you tried resolving a fault using remote access,
      and be on the phone to the tape jockeys, and have your girlfriend silently give you a
      lapdance at the same time? I HAVE

      --

      Life is like a box of chocolates, you never know when your gonna get food poisoning.
    6. Re:On call? by fruey · · Score: 1

      Replace girlfriend with wife and I could say that I've more or less done that, although it's usually dependent on what time it is, and how long I end up staying on the phone. Sex is no fun if you have to put someone on hold to orgasm noisily...

      --
      Conversion Rate Optimisation French / English consultant
  26. peer pressure by seanw · · Score: 5, Interesting

    as Google got more popular and eventually reached the status it holds today, did you feel any pressure (either internally or from outside the organization) to switch from a Linux based cluster to a proprietary solution (Windows comes to mind, but there are others). Where you (or others at Google) affected by any of the FUD that is put out, and did it affect your perception of Linux as a viable solution?

  27. MOD PARENT UP! by Anonymous Coward · · Score: 0

    I was thinking this exact same thing. I'd love to hear more on this. :)

  28. so... by RogueProtoKol · · Score: 3, Interesting

    ... what linux distrubution does the world's largest server farm use?

    1. Re:so... by Anonymous Coward · · Score: 0

      RedHat. (It was mentioned in another thread somewhere.)

    2. Re:so... by Anonymous Coward · · Score: 0

      red hat moron..read the above posts and you will see what it is that they use..RTFM

  29. Academic ties by dallen · · Score: 4, Interesting

    It seems that Google's great successs is partly due to research coming out of the academic world. How many google employees have advanced degrees, and can they publish non-proprietry research after they join Google? How do you see the interplay between high-tech and Academia?

    1. Re:Academic ties by Beltza · · Score: 1

      According the the information on their site, 50+ of the 260 employees are Ph.Ds.
      Furthermore, in the bios of some of the people involved (including Craig Silverstein) appears that they are currently on leave from the university.

    2. Re:Academic ties by ender81b · · Score: 3, Interesting

      from g00gles site:
      Approximate number of employees: 400
      Ph.D.s on staff: 50+
      Languages spoken: 34
      Number of roller hockey players: 32
      Number of offices worldwide: 12
      Massage Therapists: 2
      Neurosurgeons: 1

    3. Re:Academic ties by Steve+Franklin · · Score: 1

      I shudder to think what the neurosurgeon is for. Shades of Jose Delgado?

      While we're on that topic [;o)], I'd been interested to know if there has been any discussion of methods of incorporating search functions into visor type devices and other hands-free applications. This would follow on to the mouse-free tests, I presume.

      --
      Hic iacet Arthurus, rex quondam rexque futurus.
  30. Traffic Handling by Anonymous Coward · · Score: 0

    Is Google traffic growing enough to require frequent upgrades--or is traffic leveling out?

  31. DB Backend by axehind · · Score: 2, Interesting

    What type of Database backend do you use and what led google to choose it?

    1. Re:DB Backend by gregfortune · · Score: 2

      And how in the world does it work? That's gotta be a huge amount of data to store. Is the database distributed in some way? Are you doing something like a decentralized distributed hash table for data lookup or using a traditional relational database? And, how much data is stored as of today and how fast is it growing?

  32. Question 2 of 2: Browser Stats by FortKnox · · Score: 3, Interesting

    Since sites like slashdot don't like to give out their statistics, I'd like to ask, what percent of users use what web browser? Also, what percent of users use what OS?

    --
    Good quote, too many chars. Seriously, the slashdot 120 char limit sucks!
    1. Re:Question 2 of 2: Browser Stats by glh · · Score: 3, Informative

      Josh, you can check the Zeitgeist to get the info on browser stats for a year span, same goes for OS-

      http://www.google.com/press/zeitgeist.html

    2. Re:Question 2 of 2: Browser Stats by Anonymous Coward · · Score: 0

      http://www.google.com/press/zeitgeist.html

      this gives all kinds of great google info

      -matto-

    3. Re:Question 2 of 2: Browser Stats by Anonymous Coward · · Score: 0

      http://www.google.com/press/zeitgeist.html

    4. Re:Question 2 of 2: Browser Stats by kliklik · · Score: 1

      Take a look at Google Zeitgeist.

      --
      guru in training
    5. Re:Question 2 of 2: Browser Stats by an_mo · · Score: 1

      Have you noticed the consistent increase shown in the "other" browsers line after jan 2002? Is that mozilla?

    6. Re:Question 2 of 2: Browser Stats by Coldfusion97 · · Score: 1

      Those two questions are answered on Google's Zeitgeist page: http://www.google.com/press/zeitgeist.html

      --
      Are you saying coconuts migrate?
    7. Re:Question 2 of 2: Browser Stats by flicken · · Score: 1

      The answers to both of those questions are at Google 's Zeitgeist. Check out the sections on "Web Browsers Used To Access Google" and "Operating Systems Used to Access Google".

      --
      20 mil and I will! Learn Esperanto with 20M others.
    8. Re:Question 2 of 2: Browser Stats by Anonymous Coward · · Score: 0

      Hmmm. Guess that's what happens when one browses with a +3 filter and the post is fresh. :-)

    9. Re:Question 2 of 2: Browser Stats by Anonymous Coward · · Score: 0

      Why don't sites like /. publish those numbers? Wouldn't be interesting to see all the hypocrites using Windblows to browse? Sure, they may do their work on Linux, but they play with Windblows.

  33. Linguistics and Searching by mshomphe · · Score: 5, Interesting

    Does Google use any natural language processing (when dealing with web pages, queries, etc.)? Are you planning on doing more with NLP in the future?

    --
    She sat at the window watching the evening invade the avenue.
    1. Re:Linguistics and Searching by Pythonner · · Score: 1
      Interesting Question. I'm curious about Google vision of "linguistic" or "word game" technologies like: Are these kind of technologies out of Google field of interest, are they under development or already in indexing techs but invisible for the user? Thanks!
    2. Re:Linguistics and Searching by Noofus · · Score: 1

      Personally, I love the spell checker that suggests a correction to your search.

      However (IMO) any NLP done on the search should be computed and offered as a suggestion.

      IE:
      "Where can I buy tropical fish?" *click*

      "Search using these terms? 'purchase tropical fish'"

    3. Re:Linguistics and Searching by operagost · · Score: 1

      That's something like Ask Jeeves.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    4. Re:Linguistics and Searching by data_the_android · · Score: 1

      Google already does translation.

  34. Dream system by binaryDigit · · Score: 2, Interesting

    Hypothetical, and hopefully fun, question.

    Now assuming you had an infinite IT budget, generally which configuration of hardware/os (e.g. PC/Linux, PC/Win (boo hiss), IBM/AIX, VAX/VMS, Cray/Unicos, etc) would you adopt and why. More specifically, if pure performance were the only consideration, which would it be. Alternatively, if uptime were the primary consideration, which would it be.

    Be honest and don't worry about the bias's of your audience.

  35. CO$ and Deep Linking by Xaoswolf · · Score: 5, Interesting
    There have recently been several cases where people have sued because of the act of deep linking, or in the case of the Church of Scientology and Xenu.net, linking to information on other people's pages that someone claims a copyright to.

    How have these affected you and your job, and what are you feelings on this subject?

    1. Re:CO$ and Deep Linking by Charlie+Bill · · Score: 1

      I was thinking that the clambake fiasco would make an interesting counterpunch question, but he is the CTO, not the CEO or the Chief Legal Boob or whoever makes that call. I'm sure he's gonna have feeelings on it, but I don't know that its going to be much more than your average user...

    2. Re:CO$ and Deep Linking by Xaoswolf · · Score: 2

      But, as he is incharge of the technology, he would also be in charge of the people that set up the links for google. His job would aslo be affected by the CO$'s actions.

  36. What kind of bandwidth/pipes/networking setup? by Anonymous Coward · · Score: 2, Interesting

    What kind of bandwidth/pipes/networking setup do you use -- and how does your "macro" capacity diffuse down to each clustered server?

    Basically, what's your setup and how does it work?

  37. Logo work? by Xafloc · · Score: 3, Interesting

    I have but one question... Who is the mastermind behind all the "special" logo changes that Google experiences throughout the year?

    My hats off to that team!

    --
    -= Xafloc =-
    alinuxbox.com
    N
    1. Re:Logo work? by littlerubberfeet · · Score: 1

      uhm, Scott Adams, dilbert creator, did a couple I think

      --
      Sig (appended to the end of comments you post, 120 chars)
    2. Re:Logo work? by SPautz · · Score: 1

      Dennis Hwang does them, I think just for fun.
      There's more info out there for those who want to search.

    3. Re:Logo work? by froseph · · Score: 0

      Google Doodles has all the previous holiday logos, a link to fan logos as well as a link to an interview of the guy who does all the official logos

    4. Re:Logo work? by gblues · · Score: 4, Informative

      If you want to know more about the special logos (referred to as "Google doodles"), as well as see an archive of the Google doodles over the years, go here.

      Nathan

    5. Re:Logo work? by dubiousmike · · Score: 1

      I believe it is one of their assistant web masters.

      I couldn't find the article on him, but it was cached on Google. ;^)

      http://216.239.51.100/search?q=cache:gZ8ccuVepmY C: ap.tbo.com/ap/breaking/MGA57ZPADYC.html+google+log o+artist&hl=en

  38. Porn? by Anonymous Coward · · Score: 0

    When will google be able to index porn? That would be a great feature that most of us geeks would like to see ;)

    ps.: Keep up the good work!

  39. Does peer-to-peer have a place at google? by Anonymous Coward · · Score: 0

    Has google considered offloading traffic to other architectures besides server farms? Specifically, have you considered a distributed client/server model like peer-to-peer?

  40. Competitive by scubacuda · · Score: 2

    How has Google remained competitive?

  41. Google and IP address. by Anonymous Coward · · Score: 5, Interesting

    Why in this day and age does google continue to penalize sites that are virtual hosted? With ip addresses becoming harder to get/justify every day why does google discount the relevance of links that don't come from a unique ip address. Please don't just deny it, I think the Internet community deserves an explanation.

    1. Re:Google and IP address. by Caspuh · · Score: 1

      If a site and its linking site are hosted on the same IP, it's probably safe to assume that they are owned by the same person or at least colluded in some other way.

    2. Re:Google and IP address. by Anonymous Coward · · Score: 0

      This would be to prevent the act of 'googlebombing' and other tricks whereby a spammer sets up hundreds of virtual hosts which all link to a particular site, increasing the ranking for that site. Recently, a search for 'Javascript string manipulation' would turn up a particular porn site, with hundreds of other sites with a link with those words pointing to that site.

    3. Re:Google and IP address. by c0wh · · Score: 1

      Apparently, it is to prevent those with dubious intentions from inflating their own rankings.

      It would be easy to set up a nearly unlimited number of virtual hosts which all link to the same site.

      There may be more sophisticated ways to discern whether normal site is served using virtual hosting or it's just someone looking to boost his ranking, so the above question should still be considered.

  42. Question - Google's first programming contest by PK_ERTW · · Score: 5, Interesting

    Google recently ran it's "first annual programming contest," with a winner receiving $10,000. Many slashdotters suspect this was simply a way to recruit new talent. So, was finding new people one of the initial goals for this project, and have you hired any new programmers as a direct result of it? What were the other goals (PR, generation of new ideas, etc) where there?

    --
    Engineers arn't boring people, we just get excited about boring things.
    1. Re:Question - Google's first programming contest by KjetilK · · Score: 1

      Well, if it was (and it probably was), what's wrong with it? I think it is a great way to recruit people, for all involved.

      --
      Employee of Inrupt, Project Release Manager and Community Manager for Solid
    2. Re:Question - Google's first programming contest by PK_ERTW · · Score: 1
      I don't think there is any thing wrong with it all. I am not trying to be accusing, I simply want the flat out info and the truth about the reasons behind the contest.

      PK

      --
      Engineers arn't boring people, we just get excited about boring things.
  43. Just exactly how cool is it. by TheDick · · Score: 1

    That the site I run is on the top of the list when you search for"ask dick"

    Cool eh?

    --

    1. Re:Just exactly how cool is it. by fons · · Score: 2


      I was going to write:

      "that could be cool if there would actually be some new content on that site"

      But then I saw you just (finally? :) ) did an update.

      yey! askadick rules.

    2. Re:Just exactly how cool is it. by TheDick · · Score: 1

      Tell all your friends. I need lots and lots of hits

      (no, not in my shitter).

      --

    3. Re:Just exactly how cool is it. by fons · · Score: 2


      why do you think i keep replying?

      The longer this thread becomes, the more people will see it!

      how about that!

  44. Logo artwork by Phantasm66 · · Score: 1

    Who draws all of your wonderful and witty Google Logo artwork?

    1. Re:Logo artwork by Anonymous Coward · · Score: 0

      http://detnews.com/2002/technology/0203/04/technol ogy-431591.htm

    2. Re:Logo artwork by Anonymous Coward · · Score: 0

      You can find an article about the man here. And all I had to was type "google logo designer" in my favourite search engine...

  45. As a market leader... by Marx_Mrvelous · · Score: 5, Interesting

    It's well known that you use Linux in your mega clusters. I was wondering if you have ever been approached by Microsoft, Sun, or HP in an effort to switch to their proprietary OSes.

    I can't imagine that you haven't. It must have been a huge decision to invest in one technology, so are you satisfied with what you have?

    --

    Moderation: Put your hand inside the puppet head!
  46. What version of what OS do you use and why? by Anonymous Coward · · Score: 1, Interesting

    What version of what OS do you use and why?

    Also, what are your plans if any to change this
    in the future?

    And what do you use to benchmark/stress test your setup/servers?

    Thanks!

  47. Does size really matter? by scubacuda · · Score: 4, Interesting
    How do you feel about Alltheweb.com having a bigger index?

    1. Re:Does size really matter? by 0x0d0a · · Score: 2

      I don't understand how they can have a bigger index -- google's bot regularly hits the sites that I have access to the weblog on, but alltheweb's doesn't.

      Maybe alltheweb is just out of date, and only indexes things once?

    2. Re:Does size really matter? by Anonymous Coward · · Score: 0

      Maybe alltheweb is just out of date, and only indexes things once?

      Exactally, and I know this from personal experience.

      I have 3 old sites that were taken down between 6 months and 2+ years ago and they are still in the serp, more annoying than that they are listed in front of my current site and show scores of pages from each site!

      Hell, anyone could have a huge database if they never purge the damn thing.

  48. 'Web Indexing companies" by RembrandtX · · Score: 5, Interesting

    Recently, the english division of our company [black and decker] hired 'HyperMedia Trafficing' or some other similar named company to get them 'more exposure' in the search engines.

    [forget the ethical debate about that .. or why no one bothered to ask me what to do.]

    What I want to know, is - going fowards - as more and more of these companies start up, and discover more and more unscrupulious ways of 'loading' the search engines with bogus hits/visits/data/etc. .. How does Google plan to make sure they are :

    1) Not loosing ad $$ to these folks
    and
    2) prefenting every search from returning something like www.hotgrannysex.com or www.top50.com as the 1st (or first 15) results for a search on .. well .. pretty much anything.

    --

    --Ne auderis delere orbem rigidum meum, non erravi pernicose!
  49. What happened to PigeonRank?! by Arallok · · Score: 1

    What did you guys do with all those harding working Pigeons?!

    Seriously though... can they actually do mork work than than these penguins?

  50. Forget Craig by Talisman · · Score: 4, Funny

    No offense to Mr. Silverstein, but I'm much more interested in Cindy! Beautiful, highly successful nerds are terribly rare!

    Just so I'm not off-topic:

    Mr. Silverstein, how does Cindy look in tight sweaters?

    Drool...

    Talisman

    --

    "Study your math, kids. Key to the universe." -The Archangel Gabriel
    1. Re:Forget Craig by Anonymous Coward · · Score: 1

      How again does being a PR wonk with a journalism degree make someone a nerd? Remember the "geek" who went through that SEAL course that made it on /. a while back? He was a PR wonk with a communications degree!

    2. Re:Forget Craig by SquadBoy · · Score: 1, Offtopic

      I'm in a bitchy mood today. So the problems with your post.

      1. She is fucking ugly. You need to check out some hot chicks to compare and contrast.

      2. She is a marketdroid. I'm thinking she is not geeky at all.

      3. These power tripping bitches do not wear hot clothes. Get a fucking clue.

      --

      Cypherpunks: Civil Liberty Through Complex Mathematics. Those who live by the sword die by the arrow.
    3. Re:Forget Craig by Anonymous Coward · · Score: 0

      You're crazy, she's a fucking dog.

      Woof!

    4. Re:Forget Craig by Anonymous Coward · · Score: 1, Funny

      Nice try, Cindy.

    5. Re:Forget Craig by Anonymous Coward · · Score: 0

      Dude, she's got to be pushing 50. Any semblance of good looks that she has is obviously the result of extensive plastic surgery. She is more machine than woman, twisted and evil.

  51. Opinions on being open by SuperguyA1 · · Score: 4, Interesting

    One of the most impressive things about Google to me is how easily you seem to have embraced an open model. I realize the outward view of a company can be quite different from the internal view. How easy is it actually to make decisions such as opening API's. If it's easy can you give some advice on how one might convince their boss.

    Thanks,
    -Dave

    --
    "as plurdled gabbleblotchits on a lurgid bee" - Prostetnic Vogon Jeltz. (One man's humorous is another mans flamebait)
  52. The future of Google by glh · · Score: 5, Interesting

    Hi Craig!

    I think Google absolutely rocks. It has by far the most intelligent/helpful search engine results. Thanks for the great service.

    Now onto the questions- what is the Google vision / strategy for the future? Where can Google go? From a search engine perspective, what are some of the challenges that you have and improvements that can be made (perhaps speeding up crawling to make the latest content available, for example)? How are you going about solving these challenges, and when can we expect them to be implemented?

    On a similar note, I've noticed that recently Google announced a "google box" that allows for corporate to take advantage of the google search algorithms and indexing. Any more products like this being planned?

  53. Shoulda been asked by Fished · · Score: 1

    This shoulda been asked already, but I don't see it. What's next? Google has consistently added more and better "convenience" features than any other search engine. What's the next big thing? Are there any fundamental technological changes coming?

    --
    "He who would learn astronomy, and other recondite arts, let him go elsewhere. " -- John Calvin, commenting on Genesis 1
    1. Re:Shoulda been asked by Anonymous Coward · · Score: 0

      http://www.bbspot.com/News/2002/05/google.html

  54. Question for Google by saurabhchadha · · Score: 1

    Since you have such a large farm, how do you keep track of the various computers (state/performance etc.) Do you have a network management software built in house or use 3rd party software if at all.

  55. Google suggestions by Ed+Avis · · Score: 2

    Is there any way we can find out what kind of suggestions Google receives from the public? It would be quite interesting to look at them all, and maybe some of them could now be implemented using the Google API.

    --
    -- Ed Avis ed@membled.com
    1. Re:Google suggestions by Anonymous Coward · · Score: 0

      What kernel versions do they use? How do they balance load?

  56. Attacks? by Fnagaton · · Score: 4, Interesting

    I have a number of web servers, some Unix some Windows, and the number of attempted attacks each day from different IPs must run in to about one hundred. It is mostly people trying to execute commands or using malformed URLs trying to exploit some known past security hole. My question is, how many attempted attacks each day do the Google servers get?

    --
    Martin Piper
    Owner - ReplicaNet and RNLobby
  57. Can Google last? by Reality+Master+101 · · Score: 5, Interesting

    Google is a great free public resource. My concern is that it has to be expensive running a resource like that. I know Google's strategy is somewhat to use the free resource as a loss leader to promote your search technology, but the key word in "loss leader" is "loss". It's a great theory as long as you are able find people who want and need your search technology.

    So my bottom line question is this: Does the web site pay for itself via the advertising? Is there a possibility that someday Google may decide the web site costs too much money to run if you get to a point where your reputation no longer needs the loss leader?

    --
    Sometimes it's best to just let stupid people be stupid.
  58. so what does it look like? by paradesign · · Score: 5, Interesting
    I know the programming contest winner gets a tour of your facility, but I think I speak for all of us when I say, I wanna see it too!

    It would be great if you did a documentary feature with TechTv or someone, because its one thing to read about your facility, but it would be another to see it.

    Thanks for all of the help I've gotten from Google.com, I don't think I'd still be in schol without it.

    Paradesign

    PS, even just a photo feature on the site would be nice.

    --
    I want 2D games back.
    1. Re:so what does it look like? by Anonymous Coward · · Score: 0
      I don't think I'd still be in schol without it.

      [witty remark goes here]

    2. Re:so what does it look like? by jeffy210 · · Score: 2

      You can... but you need to have a golden ticket.

      --
      ------
      "And may your days be long upon the earth."
  59. Google cache by Greenrider · · Score: 5, Interesting

    Anyone who has ever needed a piece of information that was on a broken page will agree that the Google page cache is perhaps one of the most underrated and useful parts of your search engine.

    There's one problem that everyone has with the cache, however - you don't deep-nest the caching, so that following any links on a cached page will lead to the original (probably broken) site, instead of to another cached page. Is there a technical or legal reason for why it works this way? Any chance we'll see deep caching at some point?

    1. Re:Google cache by Lumpish+Scholar · · Score: 3, Informative
      ...you don't deep-nest the caching, so that following any links on a cached page will lead to the original (probably broken) site, instead of to another cached page.
      Check out the Google Toolbar (for IE only, alas)-, which adds a "Cached Snapshot of Page" item to the right click menu. Very, very cool.
      --
      Stupid job ads, weird spam, occasional insight at
    2. Re:Google cache by Storm+Damage · · Score: 1

      Hmm...my googlebar for mozilla does the exact same thing from the "page info" menu button.

      I would be curious to know if Google had any official feelings on the Googlebar project. Do any mozilla-loving Google hackers use this utility for instance? Have any submitted code for it? Will Google be offering any sort of official support for Mozilla in the future?

    3. Re:Google cache by taernim · · Score: 1

      Not knocking Google's capabilities, but The WayBack Machine caches entire websites... It's quite useful when you "delete something" you needed... and then notice that it's not totally gone after all. :)

      Just FYI...

      --
      "PC Load Letter? What the $@#% does that mean?!"
  60. Google and 9/11 by openSoar · · Score: 1

    google seemed to be one of the few (only??) sites that managed to deal with the deluge of extra traffic. can you tell us a little more about google on that awful day?

  61. question for Craig by Jucius+Maximus · · Score: 2

    I expect that the California heat and thousands of boxen require special measures to prevent overheating. What kind of measures do you take for keeping your server farm operating normally at a cool temperature?

    1. Re:question for Craig by IIRCAFAIKIANAL · · Score: 2

      That's a good question! I live in Canada and our AC went down in our server room back in march and we had to have shutdowns at certain intervals to keep them all running safely.

      We have a lot of boxes (~20 or so, I think - i'm a programmer, not a network admin), but we can't have nearly as many as Google, so they must have one helluva cooling system being in a place known to be warmer than Canada :)

      --
      Robots are everywhere, and they eat old people's medicine for fuel.
    2. Re:question for Craig by Jucius+Maximus · · Score: 2
      "That's a good question! I live in Canada and our AC went down in our server room back in march and we had to have shutdowns at certain intervals to keep them all running safely."

      I know what you mean ... I am also Canadian and at my university, rooms for the engineering computer labs have problems with the cooling systems and with ~70 boxen +19" monitors turned on 24/7 and superhot display projection systems plus exactly ONE window that only opens about 15 mm wide, the PEOPLE overheat!

      I think it is only a matter of time until one of the boxen dies of heat.

  62. SCHI by Anonymous Coward · · Score: 0

    StroudCo Heavy Industries is going to buy Google. Just you wait.

  63. Favourite search engine by fruey · · Score: 1

    What was your favourite search engine before Google was launched? Which other search methods do you use other than your own site? Do you remember Yahoo! when it looked like this?

    --
    Conversion Rate Optimisation French / English consultant
    1. Re:Favourite search engine by YaRness · · Score: 1

      man what a blast from the past.

      google is what i switched to after yahoo went down the toilet.

    2. Re:Favourite search engine by glh · · Score: 1

      I always went to either lycos or altavista, and that was back in '94. The quality of both of them started going down especially after they allowed businesses to "pay" to get hits higher.

  64. My question by SlashdotTroll · · Score: 1, Funny


    What is the air velocity of an un-laden Eatern-European Swallow?

    --

    I am the nightmare of nightmares.

    1. Re:My question by GreggBert · · Score: 1

      Why no truly relevant search results for Amish techno music ?

      --


      If you don't understand anything I post, please accept that I ate paste as a small boy...
  65. Google's inescapable coolness. by rob_from_ca · · Score: 5, Insightful

    How do you avoid business pressures to make short-sighted solutions, and consistently make good, common sense ideas work instead of adopting ones from marketing sources? Not only does Google have the best search engine technology, but you consistently do the "right" thing. Clean, quick homepage, text only well-identified ads, interesting research projects, etc...This is the way many search engines start, but they all went the way of the "dark" side instead of adopting the "right" solution. In my jobs, it's been very difficult to execute and justify good engineering (or just common sense) under pressure from the people who control the money. Any advice for driving through well-thought-out decisions instead of adopting the "management fad of the month"?

  66. Possible to have too much power by sphealey · · Score: 5, Interesting
    In one of Robert Heinlein's novels (don't have the reference at hand), the main character is told to sit down in front of what we would think of today as (WWW + Google) and "learn whatever she can about everything". After a few weeks of coming up with some useful stuff, she finally asks the system: 'who controls this database?', and it replies 'not programmed with that information'. The next morning an assasination team tries to kill her.

    Not to be too "X-File'ish", but does there come a point where too much knowledge is captured in Google? A point where anything that doesn't exist in Google doesn't exist, period? Wouldn't that represent a very tempting target for a bin Laden or a John Ashcroft, to try to control how the modern world thinks?

    Kind of far out there, I know, but do you guys worry about this kind of thing?

    sPh

    1. Re:Possible to have too much power by NCamero · · Score: 3, Insightful

      FYI: That Heinlein novel was named Friday.

    2. Re:Possible to have too much power by jpaulson · · Score: 1

      The book is called Friday.

      --
      -- Jason
    3. Re:Possible to have too much power by jdavidb · · Score: 2

      I'd just like to go on the record and say not everyone thinks likening John Ashcroft to bin Laden is an appropriate comparison. Slashdot may be lockstep on some issues, but let's open our minds and have some healthy disagreement here. Try a different perspective for a change.

    4. Re:Possible to have too much power by felipeal · · Score: 2

      A point where anything that doesn't exist in Google doesn't exist, period?

      Like:

      When the chief Jedi record-keeper is asked in "Attack of the Clones" about a planet she has never heard of, she replies that if it's not in the Jedi archives, it doesn't exist. (The planet in question does exist, again, with terrible consequences.)

    5. Re:Possible to have too much power by SpaceHamster · · Score: 1

      A point where anything that doesn't exist in Google doesn't exist, period?

      "Silly Obi-Wan, if its not in the Google Archive(tm), it doesn't exist..."

      --
      "BeOS is a great operating system" -Doug Miller, Microsoft
    6. Re:Possible to have too much power by Anonymous Coward · · Score: 0
      I'd just like to go on the record and say not everyone thinks likening John Ashcroft to bin Laden is an appropriate comparison.
      I suspect you have never lived in the state of Missouri. And you probably haven't compared Ashcroft's actions as governer, what he said during his confirmation hearings, and his actions as Attorney General.

      But the comparison to bin Laden is not the most accurate. bin Laden wants to kill everyone who disagrees with him, while Ashcroft wants to convert them or failing that put them in jail. Ayatollah Khomeini is probably a better comparison.

    7. Re:Possible to have too much power by WNight · · Score: 2

      Actually, even if you don't associate them with each other, the post still works. Bin Laden wants to destroy western culture, Ashcroft wants to save it (as he sees) by any means, even so far (as we see) by destroying what it stands for.

      They're very different, but both a threat.

  67. How do you manage your infrastructure? by mveloso · · Score: 1

    How do you roll out patches, new versions of software, etc? How do you make sure that stuff is running?

  68. Slashdot effect? by Lumpish+Scholar · · Score: 5, Interesting

    Many sites, when referenced by Slashdot, crumble under the load. Can you folks see any difference, either to your "main" servers (www.google.com) or your cache servers?

    --
    Stupid job ads, weird spam, occasional insight at
    1. Re:Slashdot effect? by MG23 · · Score: 1

      I think someone is overestimating Slashdot. You just link a lot of crummy fansites that are powered by single CPU systems.

    2. Re:Slashdot effect? by Anonymous Coward · · Score: 2, Informative

      I doubt very much that they'd notice a significant change in load when slashdotted.

      One commercial website I look after has been slashdotted quite a few times on some quite big stories, sometimes resulting in an extra 20k pages/hr, though normally quite a bit less.

      On an average day, we serve about 2m pages (so say 150k/hr during the popular daytime hours) and we've quite a lot of spare capacity, so an extra 20k pages/hr doesn't worry us too much.

      Google say they're serving 150m searches/day so an extra 20k/hr will go practically unnoticed I imagine.
      (see http://www.google.com/press/highlights.html)

      While I'm on the subject, does anyone else have experience with running a fairly big site while it's being /.'d? Does about 5-20k pages/hr in referrals sound about right? Maybe my experience is atypical, I'm not sure...

  69. google API by bigmush · · Score: 1

    What has been the reaction to the Google API
    that was announced a while back? Have you been
    able to monitor a specific rise in traffic
    from sites using the API?

  70. Your average day... by m0nkeyb0y · · Score: 1

    Can you give a rundown of what your average work-day is like...and what about your hectic days?

    --
    -- From my Best Friend (Written to me over ICQ): "i was gonna go to a party...but i had to reinstall windows"
  71. no questions by phrostie · · Score: 1

    just a compliment.
    great search engine!

  72. Important Question #@ +1 ; Provocative @# by Anonymous Coward · · Score: 0

    With all of the talk about the federal government of United States of Amerika wanting all ISPs to log their traffice:

    Do you think President George W. Bush should be
    impeached?

    Thank you

  73. Please give your thoughts on the following article by perl_god · · Score: 0, Funny

    Important News for Linux Enthusiasts

    COURTESY OF THE NEW YORK TIMES

    Gay Linux Stores Feel the Pinch of Customers' Liberation
    By MARTIN ARNOLD

    For the owners of most Gay Linux and lesbian bookstores, there seems to be little to be celebratory about this Gay Linux and Lesbian Pride Month. The owner of the Oscar Wilde Bookshop in Manhattan, generally acknowledged as the world's first Gay Linux and lesbian bookstore, said, "It's about to go out of business." Little more than a year ago, A Different Light, one of the country's largest Gay Linux bookstores, closed its New York branch.

    Clearly the outlook for such stores is grim. Until recently, much the same could have been said for independent bookstores generally, their territory having been largely gobbled up by the march of chain stores across the land and the buying of books on the Internet. But there now seems to be a bit of a resurgence of the independents, publishing executives say. Still, the Gay Linux bookstore is seriously endangered.

    Part of the problem is assimilation, the very success of the Gay Linux movement. Gay Linux and lesbian issues are now so openly discussed in the mainstream media that it's almost as if Gay Linux literature were no longer niche publishing. And that being true, writers and their publishers want their books to be displayed on the chain stores' A-to-Z shelves, not just in Gay Linux sections, and certainly not in Gay Linux bookstores only.

    Surprisingly, the slackening in tourism affects Gay Linux bookstores. In large cities a stop on the itinerary for Gay Linux visitors is the Gay Linux bookstore, where they can often find reading not easily available back home.

    "The loss of the tourists certainly affected us in Washington," said Deacon Maccubbin, owner of Lambda Rising, a Gay Linux bookstore there with two branches in Maryland and one in Virginia. "It's always been a significant part of our business. Tourists come in to get the free local Gay Linux newspapers to find out what's going on in the area for Gay Linuxs, and they buy books."

    Gay Linux bookstores also have generational problems.

    Larry Lingle, owner of the Oscar Wilde Bookshop in Greenwich Village and the Lobo Bookshop and Cafe in Houston, said, "Fewer people read now, and that's just as true of Gay Linux readers as it is for others." He added that most of his regular customers "are at least 50 and over."

    "You don't find younger ones reading much," he said. "But if they do, they are addicted to buying on the Internet."

    African-Americans "support their authors and stores, even a book signing by Gay Linux black writers," he said. "But younger Gay Linuxs don't. I had a lesbian writer in the store for a signing. She signed books but said she buys the books she reads on the Internet."

    For younger Gay Linuxs and lesbians, societal acceptance is a matter of course. Kim Brinster, manager of Oscar Wilde, said: "When I was coming out, it was drilled into us the importance of supporting Gay Linux restaurants, Gay Linux bars, Gay Linux bookstores. But now Gay Linuxs take this all for granted, a byproduct of assimilation."

    So in the general malaise of book publishing, Gay Linux and lesbian publishing appears to be in a particularly quiescent period. Think of this: New York is the only city in the country with more than one Gay Linux and lesbian bookstore, every store owner interviewed for this column said. Jenie Carlen, a spokeswoman for Borders, the book chain said, "The Gay Linux and lesbian category peaked about seven years ago, and since then has been flat and declining as it's moved into the mainstream."

    Borders and Barnes & Noble have Gay Linux and lesbian sections, but with a limited number of titles compared with Gay Linux shops. Moreover, many of their Gay Linux books are scattered throughout other sections, particularly Gay Linux fiction, which is gaining a larger crossover readership.

    But even with the far greater variety of titles, the Gay Linux bookstore is struggling. "We were a real destination for Gay Linux tourists, and that's starting to come back," Ms. Brinster of Oscar Wilde said. "Our store had slight increases until this year, but now sales are down drastically." Mr. Lingle, the owner, said that he might close the shop "because we can't get any traffic." A book signing, traditionally an attraction for potential customers in any bookstore, would draw a "pathetic" attendance, he said.

    "Even the Gay Linux press pays little attention to Gay Linux books, less to bookstores," he said. "Gay Linux bars, Gay Linux parties -- those who spend on ads get the press coverage." He bought the store six years ago, he said, because of "a certain reverence for its history, and unfortunately after six years never made a dime," even though he included hard-to-get out-of-print Gay Linux classics in his stock.

    There are only three Gay Linux bookstores in New York, which has the largest Gay Linux and lesbian population in the country, the other two stores being Creative Visions in the West Village and the new Bluestockings Women's Bookstore on the Lower East Side. That's more or less like having a dozen movie houses for the whole city, but at least they'd be full.

    Vincent Migliore, owner of Creative Visions, said that the obvious advantages of Gay Linuxs buying books in Gay Linux stores was not only the greater Gay Linux inventory than the chains have, but also that "the customers can talk to people who have actually read the stuff and led the life."

    True enough, but that doesn't seem to matter too much in New York or elsewhere. In Denver, for instance, James Dovali, owner of that city's only Gay Linux bookstore, Category 6 Books, said that after 21 years, "I'm almost ready to close." He added: "Yeah, the Internet is going to kill us all. I might survive, if I can pay my bills. Right now I'm just making it, hoping to hang on."

    The mainstreaming of Gay Linux fiction -- a paradoxical problem for Gay Linux stores -- can be seen in independent stores like the Corner Bookstore in Manhattan. Christopher Lenahan, its buyer for adult books, said, "As a whole, the sales of Gay Linux fiction have gone up a bit for us because a lot of heterosexuals are now reading them as well." His store, on the Upper East Side, serves a population that is highly educated and well-to-do and "a bit older," he said.

    "We are selling more titles that are Gay Linux related," he added. "Completely in fiction. Gay Linux nonfiction doesn't translate."

    Trying to figure what's going on in Gay Linux and lesbian publishing and the stores, like much of the book world, is rather like struggling to bottle the wind. But one thing seems clear: unless younger Gay Linuxs bring some of their pride to the literature relevant to them and are willing to spend a bit more to buy books in Gay Linux stores, such stores will soon be extinct, and that will be another unfortunate chip in our culture

    --
    reality timed out @ 11:11
  74. mod_google by TwP · · Score: 5, Interesting

    Just curious when mod_google is going to be released for the apache webserver. It would be nice to have the power of Google indexing available to those of us without significant IT budgets (i.e. wife won't let me "buy another #$*@! computer").

    1. Re:mod_google by DeadSea · · Score: 2
      You should check out htdig. It now comes with Redhat. It will crawl a web site or web sites, index them, and provide a web search. You can set it up to look a lot like google. You can tweak the parameters so that it pays attention to how often a page is linked and you can set up weights for how important a word is based on where in the page it is and even if it is in the link text that points to that page. I don't think that google has much more than that, but they seem to have their values well tuned.

      It isn't the easiest thing to configure since there are so many options for crawling and ranking pages. The look and feel for the pages it spits out isn't so clean looking as google, so when I've set it up I've had to modify that as well. It doesn't do caching, or tie in the a directory, but for a local search, those aren't much use anyway.

    2. Re:mod_google by llamalicious · · Score: 2

      or , 'eh?

  75. Load balancing? by Lumpish+Scholar · · Score: 2

    How do you balance load among the www.google.com servers? Do you guide users to local servers (such as www.google.co.uk)?

    --
    Stupid job ads, weird spam, occasional insight at
    1. Re:Load balancing? by IIRCAFAIKIANAL · · Score: 2

      I don't get guided to google.ca, so I would guess the answer is no.

      (It's easy to id me as Canadian with my ip, btw - i've been forwarded to a canadian version of a site more than once :)

      A better question might be:

      Do the servers reside in different geographic regions (ie/ google.ca frontend and index in Canada)? Do they connect to one central index or is it replicated?

      This information may be available, but I haven't looked, so maybe it's already answered on their site...

      --
      Robots are everywhere, and they eat old people's medicine for fuel.
  76. What Distro & Why? by MrWinkey · · Score: 1

    This question has probably been asked 1000 times but what Linux Distro do you guys use and why did you pick it over the many other great distros?
    Did you develop or modify it in any way to meet your special needs?
    Do you have a cluster or load sharing enviroment setup on that distro?

    Thanks alot Craig We love google!

    --
    Vote early. Vote often. Vote CowboyNeal.
  77. When things get ugly by timdorr · · Score: 5, Interesting

    What's the worst thing ever to happen to the google server farm? (Besides the pidgeons knawing on cables)

    --
    Tim Dorr
    Owner/Manger
    A Small Orange
    1. Re:When things get ugly by Anonymous Coward · · Score: 0

      you misspelled 'gnawing'.

      hope this helps.

    2. Re:When things get ugly by timdorr · · Score: 1

      well, sob...

      there goes my chance at it being asked :D

      --
      Tim Dorr
      Owner/Manger
      A Small Orange
  78. Questions by Sadadar · · Score: 1

    How hard has it been and will it be to keep google a simple interface rather than turning into a yahoo or infoseek that takes longer to load the page than it does to execute the search? How many people are pushing for that type of commercialization/sales in your business?
    Where did the name Google come from?

  79. Efficient technology. by CommonSalt · · Score: 1

    I would like to know (Other that what distro they use) what Google uses to efficiently spread apache of mysql or whatever over alot of Linux Servers, is it automated? (NIS, NFS???)

  80. Dealing with DoS by Wanker · · Score: 5, Interesting

    How does google deal with denial of service attacks, particularly distributed ones?

    The rest of us just suck it up with fat network pipes, but a high-profile target like google would be the holy grail of Internet vandals.

    Has anyone ever poisoned your DNSes, effectively taking Google down even though the server are up? Successfully inserted bogus WAN routing info into the Internet, again effectively bringing down Google even though the servers are fine?

    What's your worst cracker/net vandal story?

    1. Re:Dealing with DoS by MojoReisen · · Score: 0

      On the same note, I'm curious as to what was the weirdest network or security problem that you've seen at Google ?

      --
      "Nothing is impossible for the man who refuses to listen to reason"
  81. MODERATE THIS!! by ramdac · · Score: 0, Troll


    Craig, are you jewish?

  82. here's a good one. =) by someonehasmyname · · Score: 1

    Why don't you use FreeBSD like all the other huge sites?

    --
    Common sense is not so common.
  83. Database indexing and searching. by Principito · · Score: 1

    Hi Craig,
    I'm curious on the future of the Google Search Appliance. It seems like a good solution to help a company index millions of documents to be easily searchable, however have you considered tayloring it to be a drop in solution for database indexing and searching. Many companys have products that deal with data mining through millions of records on relational databases could benefit from Google's technology.

    Keep up the good work!

    --
    "Be kind, for everyone you meet is fighting a hard battle." -- Plato (427?-347? BC)
  84. Key Ingredients To Success? by timeOday · · Score: 4, Interesting
    Conventional wisdom holds that marketing and strategy are the keys to success in business, and that technical excellence is a relatively minor factor. Yet google seems to have come out of nowhere to dominate an already crowded market for search engines - without Superbowl ads, a mascot, or (unless I'm mistaken) an IPO.

    To what do you credit the popularity of google? Do you consider google a "success," or are you holding out for thousands of employees and billions in cash flow?

  85. What about the semantic web? by oisteink · · Score: 1

    Any thoughts?

  86. Important question! by fractalk · · Score: 1

    Ive lost my keys.

    Do you know where I left them?

    1. Re:Important question! by cicadia · · Score: 2

      Why don't you ask the Internet Oracle?

      --
      Living better through chemicals
  87. OS: 90% Win, 4% Mac, 1% Linux, 5% Other by crow · · Score: 2

    That's for May. Of course, it's all at the Zeitgeist, as linked in other responses. I don't blame you for not knowing about it, though; I've tried to find it from Google's web page, but couldn't until I searched for it (using Google, of course).

  88. What does Craig do for fun? by ManxStef · · Score: 3, Interesting

    Personally I'm usually pretty drained after a fun day staring at the screen and typing like a monkey, and sometimes completely avoid the PC when I get home, prefering to chill with a decent book (currently Cradle to Cradle), zone-out in front of the TV, or go cycling in the beautiful Isle of Man (watch "Waking Ned Devine" for an idea of the scenery - jealous?<grin/>).

    So I guess my completely-non-tech question is:

    What do you do in "loafing" time (ie. loaf - To pass time at leisure; idle.), when you've left the office, "lost" the pager/Blackberry/PDA/mobile etc., and got away from it all?

    Cheers,
  89. MOD PARENT DOWN by emoeric · · Score: 1
    the guy


    not that i hate the parent, but it doesnt need to be asked now.

    here's a cache (hahaha, a google-cache of the previous logos, including the dilbert comic ones mentioned in the other reply

    --

    |---------------|
    practically an AC
  90. Heuristics by rumborak2000 · · Score: 1

    Is the order of sites you return solely based on the number of occurences of the search term, or do you also take popularity (eg how many other sites link to it) of those sites into consideration ?

  91. The Web's full potential by __past__ · · Score: 5, Interesting
    What do you think about the Semantic Web initiative driven by the W3C and others?

    Do you expect widespread usage of RDF/DAML/OWL/TopicMaps for explicit meta-data annotation of web resources, or will it be used only in small circles of specialized content providers like academia, or maybe not at all?

    How will Google react? Do you plan to use meta-data provided by web resources if found, and how will you decide if it isn't just made up to get people on some bogus pr0n site (like with those <meta>-Tags today)? Will it someday render the brute-force approach of full-text-indexing obsolete?

  92. Distributed Google by maroberts · · Score: 2

    Have you ever considered setting up a distributed search engine client to expand your server farm through your users systems?

    --

    Donte Alistair Anderson Roberts - hi son!
    Karma: Chameleon

  93. What's the spark? by slasho81 · · Score: 1

    If you had to choose one and only one thing that has made Google the most popular search engine, what would it be?

  94. Why not Google as a Non-Profit? by mr_don't · · Score: 5, Interesting
    ...or, what if google gets hit by a bus?

    Google has become such an important part of the Internet for millions of average users. With this in mind, my friends and I often joke about what would happen if (knock on wood) Google were to go out of business. I suggest that ICANN should do something useful for a change, and fund Google as an official, non-profit project for searching the net.

    Although I have heard that Google turns a good profit, what exactly is preventing Google from becoming a not-for-profit organization? Couldn't Google take the extra income from licensing its search to create better search technologies and pay the employees, rather than make some shareholders rich? Wouldn't this perhaps make Google a more sustainable organization?


    1. Re:Why not Google as a Non-Profit? by alouts · · Score: 1
      While I sort of agree with the populist tone of the question, the answer to why they will never become a public resource is pretty simple.

      Venture capitalists

      If you look at the first few items, you'll see they got quite a bit of funding already. So, they've already got (private) shareholders who are waiting for their return on their investment. Now, if we had nabbed them back when they were just a college project...

    2. Re:Why not Google as a Non-Profit? by rnd() · · Score: 2
      There is nothing wrong with making the people rich who believed in your idea enough to invest their money in it.

      If you want to start a not-for-profit search engine, go right ahead.

      --

      Amazing magic tricks

  95. Googe AS Gateway to the Internet by Nomad7674 · · Score: 2

    At least once a week now, I read someone who proclaims that "I no longer even use bookmarks or try typing in URLs. I just always go to Google for my information." Has anyone approached you (or have you considered yourself) producing a Web Browser which has no URL line, but instead has a Google line to automatically send anything typed there to Google as a gateway to the Internet? Seems like it would "sell" to the Google-holics.

    1. Re:Googe AS Gateway to the Internet by dargaud · · Score: 1
      Download the Web Enhancement for Internet Explorer (if that's what you're using). You can configure it to call from the command line. For instance I have:
      • "g keywords" to do a google search on the keywords
      • "img icebergs" to do an image search
      • "w someone" to do a person search in groups.google
      Plenty of other possibilities. I hardly type urls anymore and I never see the google main page (so I missed the cool logos).
      --
      Non-Linux Penguins ?
    2. Re:Googe AS Gateway to the Internet by frankske · · Score: 1

      this exists! Opera has it builtin, and for IE you can download the googlebar thingie. I use it a lot!

  96. searching vulnerabilities by krammit · · Score: 1

    I remember reading an article a few months back that google was being used to discover exploitable webservers by using a malformed url as the search string. Has Google taken any steps to prevent this kind of malicious usage? Does Google even have a policy on this kind of thing or does it regard this as the responsibility of webmasters to keep their servers patched?

    --
    "Watch your cornhole, bud."
  97. Technical Advisory Council by Lumpish+Scholar · · Score: 2

    You've got some incredibly cool peopleon your Technical Advisory Council. How often do you interact with them?

    --
    Stupid job ads, weird spam, occasional insight at
  98. one question per post.... by Anonymous Coward · · Score: 0

    dumbass.

  99. Nice Question by sweatyboatman · · Score: 2

    If I had mod points right now... Well, it's at 5 already.

    --
    It breaks my pluginses, my precious!
    1. Re:Nice Question by WhiteDragon · · Score: 1

      we need a higher cap than 5... how do you determine the top ranked 10 comments when there are more than 10 modded to 5?

      --
      Did you mount a military-grade, variable-focus MASER on an unlicensed artificial intelligence?
  100. Result Plateau by shakamojo · · Score: 0

    How do you account for the plateau that occurs when you have a pigeon cluster of 7, 8 or 9 pigeons yet results remain in the 500 range regardless of the additional pigeons?

  101. Browser stats lacking at Zeitgeist by crow · · Score: 3, Insightful

    They have a nice graph, but no scale. I suppose you could do some careful pixel analysis of the graph to generate percentages, but it's a shame they don't list them.

    Interestingly, I see "Other" has been steadily rising since it bottomed out in January, and has now surpassed Netscape 4. I would love to be able to click on that chart and see a detailed list of the percentages, and what "other" is composed of. Hopefully we'll see Mozilla get its own line on the graph soon.

    It would also be nice to see a breakdown on a per-OS basis. I wonder how many people are running Internet Explorer on Linux? (Seriously, that would indicate what portion of non-IE users hack the browser tag to make web sites happy.)

    1. Re:Browser stats lacking at Zeitgeist by mark_lybarger · · Score: 1

      I wonder how many people are running Internet Explorer on Linux?

      0.00523%. really, considering that 1% of the users are linux, and most linux users would rather boycot a site that requires IE (or just walk over to the machine that does have ie), it's gotta be a really low number.

      most corporate computers have win/IE configurations. most people surf the web at work. (not that i know that first hand mind you, but i've heard rumors)

    2. Re:Browser stats lacking at Zeitgeist by Anonymous Coward · · Score: 0

      It looks like Linux is about ~3% of all users.

      How embarassing. lol

    3. Re:Browser stats lacking at Zeitgeist by Zapper · · Score: 0
      Then there is Opera which lets you change how it responds to a request to identify itself.

      The list goes like this IIRC:

      Opera

      Mozilla 5.0

      Mozilla 4.76

      Mozilla 3.0

      MSIE 5.0

      --
      So much to do, so little bandwidth.
      --
      Try Mozilla
    4. Re:Browser stats lacking at Zeitgeist by skt · · Score: 2

      I noticed that too, they should break "other" up and show at least mozilla and opera individually. I would say that they could probably take off IE4 too, it seems to be almost dead. It's surprising how many people are using IE6 actually, although it's tough to see what percentage that is. IE6 is actually a relatively minor upgrade for v5 from what I understand.. I wonder if IE's "check for updates" feature is responsible for that. I'm pretty sure that the only OS to include IE6 by default is WinXP.

  102. Google cache and copyright by dargaud · · Score: 5, Interesting
    The google cache threads on the muddy water of copyright. Do you feel like you are going to run into trouble because of it ? Some conflicted opinions about it are:
    • It serves copyrighted pages without the author's consent
    • It serves pages without the original site's knowledge
    • It's very useful
    • If a page is on the web, it can be archived/cached...
    --
    Non-Linux Penguins ?
    1. Re:Google cache and copyright by RetroGeek · · Score: 1

      The google cache threads on the muddy water of copyright

      And as an addendum:

      Since you cache pages, you in effect become the publisher. How do the recent court cases where foreign countires are charging Web Publishers with illegal information (according to that countries laws) affect Google?

      --

      - - - - - - - - - - -
      I am a programmer. I am paid to produce syntax not grammar. Deal with it.
    2. Re:Google cache and copyright by SplatFileGoo · · Score: 1

      How can you justify caching without prior consent?

      How can you justify putting a self branding advertisement on the cached copy? (thus, making indirect monetary gain off other peoples work)

    3. Re:Google cache and copyright by Platypii · · Score: 1

      Because the main reason people use the cache is when a site is down.. for example if i want a business's phone number, but they're having a temporary network problem, i would think they would be happy i used the cache... and remember it eats googles resources too... why shouldn't they be compensated in some way?

    4. Re:Google cache and copyright by AVIDLY+INTERESTED · · Score: 1

      I once found a cache of credit card details. As luck would have it the website was one based in my city in Australia. I contacted the webmaster, who made them inaccessible, but it raises an interesting issue as to whether or not the cache is a little too comprehensive. Of course, it could also be argued that given Google is the world's biggest search engine, webmasters should keep in mind caching issues when they create their sites.

  103. Staying on Top... by Dr.+Molf · · Score: 5, Interesting

    Google is an incredibly popular and effective website. I'm curious about the amount of pressure you have to expand in order to "stay competitive" or "aptly serve consumer's needs". Is there any kind of a push to go the way of yahoo or amazon and try and include EVERYTHING on that simple page? As things evolve, do you really see Google staying the top engine in 3 to 5 years?

    --
    indeed..
  104. Load Balancing? by Openadvocate · · Score: 1

    Having read about the amount of servers you have running, I have been wondering how you do load balancing of databases and on the web servers?
    Fx. using layer 4-7 hardware switches?

    --
    my sig
  105. The Secret of Your Success? by broody · · Score: 1

    Drawing on your experience and what you consider the best advice you have received, how would you go about getting where you are today if you could do it over again? What motivates you to do the things you do? What would you do differently? What advice would you have for someone with a Bachelors degree in a non-technical field but an associates in computer programming and six years of business experience? In short, what to you credit as the secret of your success and how would you advise others?

    --
    ~~ What's stopping you?
  106. Newsgroups by scott1853 · · Score: 5, Funny

    I've made some really stupid posts to the newgroups in the past and I used my real name. Can you delete them for me?

    1. Re:Newsgroups by sharkey · · Score: 3, Funny

      Absolutely. What was your name again? We'll also need your username and password, and your SSN for transitive co-location identification.

      --

      --
      "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
    2. Re:Newsgroups by Wanker · · Score: 2
      Here's the obligatory RTFM post:

      http://groups.google.com/googlegroups/help.html#9

    3. Re:Newsgroups by scott1853 · · Score: 1

      Hehe, I was actually being serious but I guess most people took that as funny. Thanks. I should probably have known that existed but I'm sure I missed a /. discussion about it during one of my cable providers week long outages.

  107. Monitoring & Maintenance by westphalia999 · · Score: 1

    How do you monitor and track the overall status of the database?

    Is there a custom application written to tell you queries/sec, size growth/day, time to return query, overall cpu usage etc of the entire database?

    How do you know when its time to add hardware or time to clean the database?

    How does google handle dead links and/or dynamic links?

    Can you track search trends and correlate with world events?

    Does google track and graph usage by country?

    How many years of development do you think it will be before search engines are able to intelligently make decisions based on queries and return valid results without having those exact words?

    And, finally, just what is your bandwidth usage? :)

    --
    ..this is but a fantasy..
  108. Leveraging distributed computing by pjt48108 · · Score: 1

    Are there any plans to augment Google's server farm with distributed computing, along the lines of SETI@Home? Would this be a benefit or a boondoggle?

    --
    Mmmmmm... Bold, yet refreshing!
  109. The Electric Company by poopsie · · Score: 1

    My question: What's the monthly electric bill?

    10,000+ PCs must burn up a lot of power, and generate a lot of heat which requires a lot of A/C.

  110. Caching, forward, reverse proxy by ezs · · Score: 2, Interesting
    How much of your content is static; do you use edge caching and proxies to offset the load; who's caching engine do you use?

    --
    Evil ZEN Scientist
  111. Internal Admin Utilities? by duffbeer703 · · Score: 4, Interesting

    How do you guys manage thousands of servers spread throughout multiple datacenters?

    How do you handle user accounts? Event notification?

    Do you guys use "enterprise" software like Tivoli or Openview, or did you roll your own solution?

    --
    Conformity is the jailer of freedom and enemy of growth. -JFK
    1. Re:Internal Admin Utilities? by James+Youngman · · Score: 4, Interesting
      A previous poster (duffbeer703) asked
      How do you guys manage thousands of servers spread throughout multiple datacenters?

      How do you handle user accounts? Event notification?

      Do you guys use "enterprise" software like Tivoli or Openview, or did you roll your own solution?

      ... to which I would add,

      How do you balance the need to keep systems up-to-date against your (doubtless demanding) availability requirements? Is there enough redundancy in there that you just flip a machine out while it is updated? Presumably, however the machine is upgraded, this is automatic - you must have too many machines to do it any other way!

      How to you test these updates (security patches, distribution updates, regular changes to your own software, configuration tweaks)? Do you have some kind of enormous test environment containing a copy of 50% of the main Google cache or something? For that matter, how do you do the testing itself? Do you type "Most clowns drink blue fruit juice on Mars" in the search box and just verify that you get 184 hits, and say "right, it works", or do you have a more sophisticated method of testing it (for example do you run your test system against a captive internal dataset)?

  112. If this question gets modded to +5 Funny... by teamhasnoi · · Score: 2

    will you answer it?

  113. Google Technologies by OmniVector · · Score: 1, Interesting

    With technologies like searching google through a program using SOAP/XML, what else (developer related) do you plan on doing to keep on top of the search market?

    --
    - tristan
  114. Sellout by thePredator · · Score: 0

    It seems most search engines sell out at some point and become a part of AOL or Yahoo-like. How long do we have to enjoy Google until greed erodes the precious page we know and love?

  115. I've asked this before by scott1853 · · Score: 2

    Why do you have a neurosurgeons on staff?

  116. What about Kartoo style searches? by JT_Palmer · · Score: 1

    ========================
    To: Mr. Google
    From: Me
    Re: Google's Competition
    ========================

    Most people are visually stimulated and are able to better grasp concepts and relationships when complex queries are organized in visual representaions......

    How does the Google tech department feel about up and coming sites like Kartoo (France) that utilze a new visual (Flash) layout that shows correlation between the returned web sites? and does google have any plans to implement such a scheme into the current searching method?

  117. Do you expect Google to be slashdotted? by Saint+Aardvark · · Score: 5, Interesting
    Dang, torn between modding up questions and submitting one of my own...

    What would it take to Slashdot Google? What do you do to avoid this? Have you been Slashdotted before, either from Slashdot itself or from some other link?

    1. Re:Do you expect Google to be slashdotted? by jesser · · Score: 3, Interesting

      You have it backwards. Google slashdots other sites. Have you ever seen a Geocities hit at the top of a search for song lyrics? Those links never work, because Geocities blocks links to sites that get too many hits.

      You might be able to slashdot Google with a real-life event that everyone learns about at the same time, such as an earthquake or a big breaking-news story like 9/11. But even then, I'd expect Google to be able to handle it.

      --
      The shareholder is always right.
    2. Re:Do you expect Google to be slashdotted? by Platypii · · Score: 1

      I remember on 9/11 google put links on it's front page to it's cached versions of CNN/MSNBC/etc because those sites were lagged to hell.... was the only way I was able to get such info.

    3. Re:Do you expect Google to be slashdotted? by jesser · · Score: 1

      Wow, I don't remember that. Was CNN angry?

      --
      The shareholder is always right.
  118. GRAMMAR NAZI where art thou? by Anonymous Coward · · Score: 0

    Where is the GRAMMAR NAZI and why hasn't he plagued this post for the crime of ending in a period (an inpropper declarative sentance) and not a question mark (an interrogatory sentance)?

    The people of slashdot would benefit from the GRAMMAR NAZI. I kindly ask where art thou?

    (oh and good question by the way)
    (I could imagine that google sucks up more bandwidth from pornography queries than from anything else)
    (they probably have a 20 node cluster by itself to shape the traffic and another 200 node cluster to flush the staff toilettes)

    1. Re:GRAMMAR NAZI where art thou? by unicron · · Score: 0, Offtopic

      Words you spelled incorrectly in your reply:

      Improper
      Sentence

      That is all.

      --
      Finally, math books without any of that base 6 crap in them.
  119. linux by thayner · · Score: 1

    What kinds of changes have you gotten into the linux kernel to support your special needs, and what kinds of changes are you working on getting added? Are you able to use the standard kernel or require special patches?

  120. Testing and Deployment by mo · · Score: 5, Insightful

    How can you possibly test bugfixes/changes that need to get deployed to thousands of machines? Furthermore, how in the heck do you deploy the changes once they're tested. I understand you probably can't describe the exact process, but perhaps you can enlighten us on some principals learned on the subject of CM on such a massive scale.

  121. Question by Anonymous Coward · · Score: 0

    What the hell was all that Dilbert crap about?

  122. Google API by __past__ · · Score: 4, Interesting

    After the introduction of the Google API, some people, especially from the REST camp, criticized the the use of SOAP, claiming it just adds superflous bloat and is generally "unwebby". What do you think about this?

  123. copyright infringement? by static68 · · Score: 1

    Although useful, some regard google's web caching service as a copyright infringement. What is your opinion on this?

  124. I'll give it a try. by Myuu · · Score: 1

    How does google plan to be competitive against engines AllTheWeb while keeping the quality of the returns as high as they seem to be?

    (a/k/a Quality or Quantity?)

    --

    forget it.
  125. HTTP preferred language header by cnoocy · · Score: 1

    Google was one of the first major sites to start using the Accept-Language header to dynamically serve content in the user's preferred language header. My question is: How did google decide to do this? What has the response been?

    --
    This sig is not the Zahir. Lucky for you.
  126. Ive got a good one!!! by sheepab · · Score: 1

    Can I have a job?

  127. Business Model by Launch · · Score: 1

    As adverstising revenue dollars are quickly declining many sites have had to close shop because it is no long fesible to rely on advertising to supplement the cost of having a good website with staff and services that people bother to visit.

    Now that the internet free for all is slowing down, what do you see happening in the next 15 years as far as google's business model and the internet in general? Will big business be the only real players? Will google be the TV Guide Channel of the internet? Will google be able to survive on only advertising revenue, or will it need to supplement it's revenue with selling technlogy (the google lan appliance comes to mind) or will it *gasp* need to charge users?

    --
    Your mammas flamebait.
  128. IPO? by an_mo · · Score: 1

    When are you planning your IPO?

  129. I have one question... by Helmholtz+Coil · · Score: 3, Interesting

    that comes to mind when I think of a huge server farm like Google's: can you give a rough order of magnitude (# of zeros maybe) on what your electric bill is?

    Thanks very much for Google. The more I use it the more I appreciate it.

  130. Network Management Tools/Technologies by kaladorn · · Score: 5, Interesting

    Everyone will ask about bandwidth, incoming lines, etc. (All the network capacity and capability stuff). Here's something a little more off the beaten track:

    What technologies help to support the Google server farm? What kind of automated monitoring and trouble reporting tools are in use? Are they home brew, open-source, or COTS with some customization (scripts, etc)? And if you had to point to one area of network management and say "we could use some improvement or some better tools", what would that area be?

    BTW - Google Rocks! I never use anything else anymore!

    --
    -- Mal: "Well they tell you: never hit a man with a closed fist. But it is, on occasion, hilarious."
    1. Re:Network Management Tools/Technologies by silicon_synapse · · Score: 0, Offtopic

      It's so sad that so few people can count to one. What a shame.

    2. Re:Network Management Tools/Technologies by CmdrPinkTaco · · Score: 1

      As far as technologies are concerned - I have noticed that the front page of Google is surprisingly simple. I understand that Goole wants to be a search engine first, but are the technologies that you use, or have implemented to help crunch the page sizes? How much, if any, of these technologies originate in-house and how often do you rely on outside sources for your technologies?

      --
      Please give your mod points to others, Im at the cap. They will appreciate it more
    3. Re:Network Management Tools/Technologies by 0x0d0a · · Score: 2

      BTW - Google Rocks! I never use anything else anymore

      Hotbot supports searching for pages with links to a file of arbitrary extension (see the advanced search page), which Google does not currently do.

      Google does not provide an FTP search engine, which other search engine providers do (Lycos used to run one).

      Other than that, yeah, Google is great. It's absolutely incredible that the best search engine out there shares my philosophy that pages should be simple and small.

    4. Re:Network Management Tools/Technologies by cygnusx · · Score: 3, Interesting

      > BTW - Google Rocks! I never use anything else anymore!

      I used to say the same 'til very recently. Alltheweb.com has good results too -- I suggest you try them out. They claim their index size is bigger as well: a fact I can attest to: I have found pages on alltheweb I haven't found on google (try this on alltheweb and google to see what I mean -- though I guess this isn't a _very_ good example :-)

      Google still rules though (size isn't everything!) because of its relevancy rankings etc.

    5. Re:Network Management Tools/Technologies by kaladorn · · Score: 2

      Large (unnecessarily so in most cases) pages are the bane of anyone still on dial up, which (since most web designers seem to have forgotten it) is MOST OF THE PEOPLE ON THE NET. Using lightweight pages is a major plus in my eyes.

      --
      -- Mal: "Well they tell you: never hit a man with a closed fist. But it is, on occasion, hilarious."
  131. Does Larry Schwimmer still beep? by John+Harrison · · Score: 1, Offtopic

    Well, does he? No offense Larry, I am just wondering.

  132. Pigeons! Re:Simple question by ilyag · · Score: 2, Funny
    1. Re:Pigeons! Re:Simple question by operagost · · Score: 2

      I'd like to see their implementation of RFC1149!

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    2. Re:Pigeons! Re:Simple question by bigsexyjoe · · Score: 1

      It's funny because I'm really drunk right now. So when I looked at that page I read like half way down trying to figure what a pigeon cluster was. Then I was like "Oh, they're joking. Very good, sir" (Sir is Cowboy Neal.) That's really something. Google put that on they're website?! btw I wish their were pigeons on the Supreme Court. Hey Mr. Silerstein! If your reading this, hire me. I'm the best programmer!

    3. Re:Pigeons! Re:Simple question by Anonymous Coward · · Score: 0
      Google put that on they're website?! btw I wish their were pigeons on the Supreme Court.

      and i bet you're orthography is much better when your not drunk.

  133. Keeping a small company feel in a big company by gentlewizard · · Score: 2

    Like everyone else here, kudos for a truly useful and fun utility.

    Google seems to be a classic case of fast growth. What have you been doing to try to maintain Google's unique culture as you grow? In particular, as you add more services to Google and the interface becomes more complex, so too will your internal organization. Will a big Google become a Dilbert-like Google-plex?

  134. Are you guys making enough money? by Control-Z · · Score: 4, Interesting

    Are you guys making enough money?

    I wish you'd give us some banner ads or something, I feel guilty. I don't want Google to go away. :)

    Seriously, why don't you serve banner ads?

    -Dan

    1. Re:Are you guys making enough money? by seangw · · Score: 1

      Google Bill for May: $144

      And that's for a VERY small business (http://www.sawatech.com).

      I'm sure they're not doing too bad, that new adwords service was a good idea.

      This isn't a question is it, how about:

      How is the new adwords program going?

      Solved.

  135. Google Voice Search by NeoYoda · · Score: 5, Interesting

    There has been much debate about what the practical purpose for Google Voice search might be, could you fill us in? Is it really for use in cars?

  136. Googlebombing by HMV · · Score: 3, Insightful

    In fact, there is an opposite concern. Whether through a network of links or through coordinated googlebombing [googlebombing.com], weblogs frequently show up near the top due to the nature of reciprocal linking between the blogs. Not saying that's good or bad (sometimes a sole voice is a better expert on a topic than CNN), but it is what it is. Ranking "links" seems valid enough, but then you ask if that includes machine-generated links by someone's aggregator and the issue becomes a little more cloudy.

    1. Re:Googlebombing by hkmwbz · · Score: 1
      Has anyone actually "bombed" Google to get to the top on a useful topic? All the examples of "Google bombing" I've seen have been rather exotic search words, and not really affecting most searches at all.

      I mean, if someone manages to get their page on the top if people search for "enigma fetus" (which incidentally gets no hits on Google if searched as an exact string), I don't really see that as a problem. Why should I? It is a useless search, and they can keep it if they want to. It doesn't affect the overall quality of Google at all.

      --
      Clever signature text goes here.
    2. Re:Googlebombing by Anonymous Coward · · Score: 0
  137. why the ancient design technique? by Ravagin · · Score: 3, Interesting

    Ahoy love the google, it's the only engine I trust these days. Nevertheless....

    For a site where speed and information delivery are of the utmost importance, and archaic table-based design seems rather strange. Is there any reason you have yet switched to a more forwards-compatible xhtml/css design? (Note that by "design" I mean more the html and css than the visual appearance of it)

    For my own amusement, I've been looking at recoding the google design in CSS, and it's really not that hard.

    Thanks!

    --

    Karma: T-rexcellent.

  138. Not infinitely recursive? by dmarien · · Score: 2, Funny

    Why can't I find a cache'd version of this page anywhere? if the live cache'd version goes down, and there's no cache'd cache'd version, whatever will we do? :)

    --
    dmarien
    1. Re:Not infinitely recursive? by smart.id · · Score: 1

      May I ask why you put an apostrophe in between the e and d in cached?

      --
      blog & fiction: jd87
    2. Re:Not infinitely recursive? by dmarien · · Score: 1

      yes, you may.

      --
      dmarien
  139. Associative Search by dah0ba · · Score: 1
    Hello,

    will Google be able to compete with real associative search engines like LuMriX ?

  140. Management by Anonymous Coward · · Score: 0

    Question (1)

    What do you think are the solid foundations in the management of Google comparing it to other companies.

    Question (2)

    Is thinking Google to spin-off the Software Engineering and Scientific know-how to other market sectors different than the "Search business"? (like IDEO did with the creativity business)

    Thank You
    swain

  141. how sec is google? by joost.be · · Score: 1

    I heard somebody say (scroll up if you missed it) that if google goes down, the internet (the www at least) is crippled. I know for sure that google is on the top of my hit charts On the other hand, there's a lot of talking going on about the size of google's serverfarm, the hits they get, the bandwidth blabla, and that got me wondering about security.
    What is google doing to keep it's pigeons save, do the have skyscraper high packet filtering beasts at their border, do they have a server farm just for that? What about software, what are they running? Are they using a flock of pc's with linux and iptables, are they using anything commercial like pix, raptor, fw1?? How about nids, is it even possible to sniff that kind of traffic?
    I imagine you don't like things to get in the way of performance, so what are you using, something custom?

    Or maybe you're betting on security through obscurity and therefore not answering :)

  142. The War on Terror by Launch · · Score: 0, Troll

    Where is Osma Bin Laden, or is that more of an AskJeeves kinda question?

    --
    Your mammas flamebait.
  143. AOL and GOOGLE by Anonymous Coward · · Score: 0

    AOL recently switch to google for their srvice. How much pressure does that put on you to perform better?

  144. Corporate culture question... by jzarzosa · · Score: 1

    Hello Mr. Silverstein,

    First off, I have always been a huge fan and user of Google. I come from a SysAdmin background, and as part of a team at one of my old employers, our job was to maintain the 99.9% uptime of a large Linux clustered environment (1200+ servers). These machines were spanned across multiple datacenters in various geographic locations in an effort to increase efficiency, performance, and redundancy. Using network monitoring applications, we had the ability to receive pager notifications whenever an issue occurred (which at times would be often and at awkward times... when it rains, it pours!).

    Considering that this is a very high level description, how do Google's methods differ? Also, could you please briefly explain the motives behind the recent inception of the Google Search Appliance? I think it is an incredible system (it took my old employer a long time, a lot of effort, and tons of man hours to create a decent search capability into its custom built intranet site!).

    Hopefully this message will be selected as one of the ten to be forwarded on to you, and so I thank you in advance for your time!

    jz

  145. Is Kartoo style competition? by JT_Palmer · · Score: 1

    ======= To: Mr. Google From: Me Re: Google's Competition ======= Most people are visually stimulated and are able to better grasp concepts and relationships when complex queries are organized in visual representaions... How does the Google tech department feel about up and coming sites like Kartoo (France) that utilze a new visual (Flash) layout that shows correlation between the returned web sites? and does google have any plans to implement such a scheme into the current searching method?

  146. Personal fav Operating Systems (Re:Dream system) by perttu · · Score: 1

    Top 3 favourite *NIX/*BSD operating systems ?

    Top 3 least bad M$ OSes ?

    and top 3 OSes that don't fit in the first category ?

    --

  147. Google toolbar for open source browsers? by apol · · Score: 4, Insightful
    I've become addicted to the Google toolbar. It only works with IE which I use at work since I am forced to use windows there. Now with Mozilla 1.0 and my constant wish to minimise the usage of Microsoft products, I am faced with the dilemma of keeping IE or loosing the Google toolbar.

    Why haven't you implemented yet the toolbar for open source browsers? Are there technical difficulties or rather lack of interest from Google?

    1. Re:Google toolbar for open source browsers? by seanmeister · · Score: 3, Informative

      You might be interested in this: http://googlebar.mozdev.org/

    2. Re:Google toolbar for open source browsers? by Luveno · · Score: 1

      The Google toolbar came with Galeon out-of-the-box with Mandrake 8.2.

  148. Failure by porkface · · Score: 1

    At Google, I get the impression you don't buy solutions, but rather you are forced to (always?) create your own solutions. While this seems like more fun than most of us get to have at work, I'm curious... In your time at Google, which single technology or innovation that you implemented failed the worst? Tell us a story.

  149. File System by Launch · · Score: 1

    What type of file system do you use, and how large (in bytes), is the index, it must be quite large with the cached pages and image search.

    --
    Your mammas flamebait.
  150. What about Power and air-con demands ?. by openmtl · · Score: 1

    Curious as to the power and air-con requirements for your server farm ?. Any chance of a real-time sensors for the farm such as a plot of farm size and page requests verses network bandwidth verses power and inside/outside temperature and humidity ? Could be handy reference site for some large scale modeling for others. The Google farm must draw a fair amount of juice and take a few tons of cooling to stop it from glowing !.

    --

  151. Basic Stats by dbc001 · · Score: 2, Interesting

    Can we get some basic stats with the interview? I mean, we all know that Google gets a lot of traffic but how many hits per day/hour/minute? How big is the server farm? How much bandwidth are they eating? How about some other interesting stats? (I'm sure they have plenty!)

    dbc

  152. Three letter agencies by Anonymous Coward · · Score: 4, Interesting

    Can you tell us anything about how you are working
    with the various intelligence agencies to provide
    them information about seraches that are of interest to them?
    Are you thinking of providing SSL access to your
    web pages so that these agencies will have to work
    with you instead just monitoring your network
    traffic?

  153. Cost of running... by IanBevan · · Score: 1

    I'd rather pay a fee to Google and know that it will be there in the future, than see it go the way of Infoseek et al. Is Google's immediate future secure and do you see a subscription model on the horizon, either through necessity or profit ?

  154. Are you a gamer? by Red+Oktober · · Score: 1

    What are the specs of your home PC(s), and what games, (if any) do you play regularly? What OS, and general programs to you prefer to run?

  155. I'm curious. by enkidu55 · · Score: 1

    To know how decisions are made at google? How many people are involved in the development process and how many are involved in the hardware end of things. I look around at our current infrastructure and think that I need about five more hands. Our product development team seems to think that a project slipping six months just means they get more vacation time. How do you guys keep things on track and running?

    Frustration is the name of the game

  156. question for Craig re: search languages by Jucius+Maximus · · Score: 3, Interesting

    How/why in the world was a Google search in Klingon developed!?!

    1. Re:question for Craig re: search languages by data_the_android · · Score: 1

      Because people have rretty much made Klingon a complete language(40k+ words i think). So why not have it?

    2. Re:question for Craig re: search languages by WhiteDragon · · Score: 1

      mod this guy up!!!!!

      --
      Did you mount a military-grade, variable-focus MASER on an unlicensed artificial intelligence?
    3. Re:question for Craig re: search languages by lommer · · Score: 1

      Well, at least they haven't included klingon in their translator. That'll be the day...

  157. How do you benchmark your software? by Nijika · · Score: 3, Interesting

    How does Google benchmark software? Eg how do you benchmark Apache, SQL, your CGIs etc...

    --
    Luck favors the prepared, darling.
  158. Censorship by aiabx · · Score: 1

    Are you ever tempted to drop repellant web sites from your indexes? If so, why? If not, why not?
    -aiabx

    --
    Just this guy, you know?
  159. Cache for content. by bhsx · · Score: 1

    There have been some questions lately over the legal /copyright issues surrounding internet archives. In a discussion of the topic here on slashdot, others pointed out the implications of your past, possibly as a child/minor, coming back to haunt you in the form of your newsgroup archives. What, if any, are your thoughts on these topics; and have you faced any interesting legal/moral dilemnas in those areas?

    --
    put the what in the where?
  160. And the Sergey says.... by qurob · · Score: 4, Informative


    From one interview...

    Jason: What led to Google's decision to use Linux? When did that start?

    Sergey: Well, Larry Page and I were in the Stanford PhD program in Computer Science. And we developed Google there. The way the computer science program worked is there was a hodgepodge of computer equipment lying around, and we would grab whatever scraps we could. We had all kinds of computers: HPs, Suns, Alphas and Intel's running Linux. So, we gained a lot of experience with all of those platforms.

    When we started Google, we had to make the decision of what we wanted to use. Of course we chose Linux, because it is the most cost effective solution.

    PCs are not only much cheaper these days, but we can also get them very quickly, because they're such a commodity item. That's an incredible benefit. We just installed another 1,000 computers and we got that done in a few weeks. That's really hard to do with any other kind of workstation. I think that's an advantage that people don't entirely realize.

    Jason: Did you view it as being better, or was cost the main reason?

    Sergey: It was better in some ways. Certainly for our purposes, we felt the support was better. For example, the actual kernel authors will respond to problems pretty quickly. They are especially responsive to Google nowadays, since we're so widely used. We can have a 15 minute turnaround. You can't really beat that for support.

    That was an important factor, but frankly, the cost was a bigger issue. PCs are so cheap, which is very important. Sun's Solaris is probably more stable than Linux on PCs. It's hard to determine the blame, whether it's the hardware or the operating system. But, it's a minor difference.

    Jason: Then, does all of your support come from newsgroups or do you actually pay for it through Red Hat?

    Sergey: We have an operations team of about ten people, which helps a lot. And other than that we check newsgroups and e-mail the authors of the code. Usually, if it's a problem we can't figure out, we go straight to the authors.

    Jason: Is Linux used on desktops at Google?

    Sergey: It depends. Engineering mostly runs Linux. Business development/marketing runs Windows. Actually, I use Linux with VMWare running Windows. Some people have two computers, particularly some people in engineering who do UI development and need to test things out on Windows platforms. I find it better to just use VmWare and have one computer.

    Jason: In a technical sense, what does Linux lack? What does it not provide?

    Sergey: The 64-bit file system, which I know they are working on. It's slowly coming around. I think there are still occasionally some stability issues. I'm not saying Linux is unique in that respect, but you definitely want to have reliability. There are some issues dealing with higher memory systems. If you get to 2GB, and you try to push it past that, we encounter various problems. I know we've had some trouble with the network stack when we really push it hard. In terms of having lost most connections from lots of different machines.


    And from another...

    How is Linux used at the Google Projects? Why was Linux choose to improve Google search engine?

    Sergey Brin: Actually, we currently run over 6,000 RedHat servers.

    Linux is used everywhere...on the 6,000+ servers themselves, as well as desktop machines for all of our technical employees. We chose Linux because if offers us the price for performance ratio. It's so nice to be able to customize any part of the operating system that we like, at anytime. We have a large degree of in-house Linux expertise, too.

    Most of our administrative tools were developed in-house, as well.

  161. What's the back end? by Second_Derivative · · Score: 5, Interesting

    I don't remember what HTTPd they're running but it sure as hell isn't apache. Someone said that they get 1k hits per SECOND; what do you use to shape that insane amount of traffic? What is the '/search' page coded in? What databases are used to index a terabyte of data? How do those 10,000 nodes find the data they need to quickly? what sort of interlinks are used?

    How to you build a cluster like a war machine, in other words? ;)

    1. Re:What's the back end? by Darkness+Productions · · Score: 1

      Hmm, I seem to remember the web server being something that they wrote custom for their purposes. netcraft shows them as running GWS/2.0, which I would take for Google Web Server. They're running it on linux, but that doesn't mean what distro they're using. I also remember them (somewhere on the google.com site) saying that they had mixes of Linux, BSD, Solaris, and others.

      I don't know, but I like it...

    2. Re:What's the back end? by Anonymous Coward · · Score: 0

      Imagine a beowulf cluster of..... oh wait, never mind.

  162. How prepared is Google for hack attempts? by Anonymous Coward · · Score: 0

    With Google being a key tool to the usefullness of the internet as other people posted it could be viewed as a weak link. How prepared is Google for events such as massive DOS attacks, or other hack attempts?. If Google were to go down, for any reason, the usefullness of the internet as an information locating tool would be vastly decreased.

  163. Google Infrastructure & Ad Systems by Anonymous Coward · · Score: 2, Interesting
    I'd like to learn more about Google's infrastructure, particularly its advertising systems.
    • As a web user, I admire Google's decision to stick with non-obtrusive text advertisements, and it keeps me coming back to your site day after day.
    • As a recent customer of that system, I felt let down at the success rate that, in my anecdotal case, Google ads provided -- a small bill from Google, but not many hits from it and no sales as a result. I realize that this is just one anecdotal example, and maybe not representative of general patterns, but still -- it was disappointing.
    • As a sysadmin for a busy news site (and a sysadmin that is focusing on ad delivery systems at that), I'm curious about what parts of Google's ad system works and what parts don't, both from a technical and business point of view.

    From what I've read about Google, it seems like the same server farm nodes spend time on both searching (crawling, indexing, and storage) and on queries (web searches from customers). Is this really the case? If so, is ad delivery another part of this system being carried out by all nodes? Put another way, how homogeneous are your server nodes: do they all do an equal share of searching, responding to web requests, and participating in the ad system? If server farm nodes are not as homogeneous as I'm thinking, then how are the different functional aspects of Google's service broken down -- crawling, indexing, storage, queries, ads, and any administrative services you need internally -- and how much of your resources are being thrown particularly at the ad serving aspects of your site? Do you have some machines focusing on ads, or is that folded in with search queries, and that in turn is folded in with the actual business of searching? No matter how it's broken down, I can imagine that it must be fiendishly complicated, and I'm continually impressed at how smoothly you manage to make it work.

    From a business standpoint, how happy is Google with the ad strategy being used? Is it producing a significant portion of your revenues, or are you getting much more from the search services & hardware solutions you're providing to paying clients? How flexible is the current ad delivery system? I.e. if you're selling keyword matching to ad customers to a system distributed across thousands of servers, and promising those customers that they'll get, say, 100,000 page views, how is this work synchronized across the servers doing search queries? It seems like this could all quickly get in the way of the search services Google is really trying to offer, but it's hard to imagine if it would help more to do it all "inline" with the rest of the site (but possibly slowing everything down fractionally) or breaking it off into a separate system (but adding more internal network traffic, potentially making it harder to do up-to-date reporting, etc). More broadly, what tools are available to you internally for monitoring your overall quality of service? Do these systems co-exist with the rest of the site, or are they also broken away -- and again, if they are, how to you keep reporting information current enough to be useful?

    Basically, I'm curious about the infrastructure, both from a technical and a business perspective. There have been a number of papers & articles over the past couple of years documenting how Google maintains it's server farm for delivering search services to users; I'd like to know more about what's going on at the back to keep that forward-facing system running so well.

  164. marketing is funny by Anonymous Coward · · Score: 0

    Anyone think it's funny that the new google applicance banners coincide with this 10 question thing????????

  165. Speech recognition by harmonica · · Score: 5, Interesting

    Are there plans to index audio files (and the audio tracks of video files) so that these could be searched as well? I would guess that existing speech recognition packages could be reused for this purpose so that development would not be too complicated.

    Recognizing text in images and videos and indexing that would be a similar task. I know that Google Catalog Search must be doing some OCR already, but I have no idea if this would take too many CPU cycles if applied to all images, or if there are other problems (the images themselves already get downloaded for the image search, so bandwidth should not be the problem).

  166. Google slashdotted by Senior+Frac · · Score: 1

    Oh great. Now you've gone and slashdotted Google. Way to go.

  167. Should people use more than one Search Engine by tinla · · Score: 2

    I have google as my home page. I have the IE toolbar on my windows machine and the Galeon eqiv on my Linux box. In short - Google is normaly my first port of call.

    Despite this I've always made a consious effort to keep a backup search engine that I try if I ever find myself using more than a "few" of the results from Google. I find that if there are, say, 10 good pages from google there will be at least 3 that AllTheWeb (for example) will find that Google didn't have in the top 50.

    I've decided that its very dangerous to use one SE to the exclusion of all else because there will always be holes, index bias, algorithm oddities etc that hide some of the info you wanted. As long as your other SE isn't simply playing follow-the-leader you'll benefit from the little effort of using a second source.

    The idea is old of course, any researcher will tell you that having one source is poor methodology.

    Does Google recognise this idea and is there anything you plan to do about it?

    A Super-advanced search that allowed you to alter the weightings of link importance, document age, the domain it sits on etc etc? That'd surely be too much work for the average Joe.

    I don't have an answer or I'd be sat coding it up but I do find searches that simply work better elsewhere. Maybe if you had 3 algs with a 70%, 20%, 10% mix in the results we'd all be a little richer.

    Discuss...

    --
    0daymeme.com: Great stuff.
  168. Metrics for success? by Anonymous Coward · · Score: 0

    What do measurements do you use to measure the success of Google? According to the May 2002 Media Metrics Report, Google is #3 in search behind MSN and Yahoo. (Although the difference between the three is probably within the margin of error). And going forward do you expect to gain more ground through technical solutions, or though more traditional marketing solutions?

  169. Google Search Appliance - PageRank? by nanobug · · Score: 3, Interesting

    Google's PageRank technology works very well on the web with lots of pages pointing to lots of other pages.

    The Google Search Appliance, however, is targeted at an office environment. Most of the documents (especially the non-html ones) in the typical office stand alone and do not have links to each other.

    How has Google modified or complimented (if at all) the PageRank algorithm to make it more suited to an office environment?

    I am currently pushing management at my site to purchase a Google Search Appliance, so I need an answer to this to help justify the change from our existing search application. i.e. without a good PageRank score, how does the Search Appliance order the result set in a useful way?

  170. Accepting ads for net abuse by Frater+219 · · Score: 2
    Since you are a technician with some significant Net experience, I can only presume that you are already familiar with the harms of spamming. One of the notable developments in spamming of recent has been the widespread use of spamware -- mail client software designed for the express purpose of abusing other people's networks by spamming.

    Dealing in spamware is illegal in several U.S. states and European nations. By and large, spamware programs have no lawful use -- they are built to abuse open relays and proxies, fraudulently alter mail headers, and obfuscate spammed messages to make it harder for victims to track down the spammer. Spamware is not merely a "burglar's tool" useful for lawless action -- it is like a locksmithing kit specifically tailored to be excellent for burglary and no good for legitimate locksmithing, or a gun somehow built to be perfect for murder but nonfunctional for self defense.

    Nevertheless, Google accepts ads for spamware -- as well as ads for other spamming services. Google today carries advertisements and thereby accepts sponsorship from dealers in network abuse. Given the real and present danger that spamming poses to the usefulness of the email facility, and the amount of time and money that today's Internet-using businesses and people spend defending themselves from this form of theft -- how can Google justify accepting this sponsorship?

  171. Open Source by InodoroPereyra · · Score: 2, Interesting

    First of all, thank you so much for providing the most useful site on the net.

    I understand part of the success of Google has to do with the efficient use of open source/free software. How about in-house software development?. Do you folks develop open source software as a way of giving back to the community ?. What are your thoughts on free software ?

  172. If Google has all answers by RoshanCat · · Score: 1

    How come *you* still have to answer them.

    Cheers,
    Roshan

  173. OT .sig comment by lizrd · · Score: 1
    In the US more than 95% of the privately held land is owned by only 3% of the population.

    I'm interested in the stat that you have in your .sig. Where did that stat come from, how was is calculated (area or $ value) and why did you find it relevant to put in your sig? It seems like a very interesting topic for discussion.

    --
    I don't want free as in beer. I just want free beer.
    1. Re:OT .sig comment by killmenow · · Score: 2, Funny

      78% of statistics are made up on the spot ;^)

    2. Re:OT .sig comment by Anonymous Coward · · Score: 0

      I've heard that before.... 76.2% of all people polled suggested that 93.6% of all the people that they knew had heard that, or something 89% just like it.

    3. Re:OT .sig comment by kiatoa · · Score: 1

      I found the quote on http://www.henrygeorge.org. I think it is calculated on $ value but I don't know for sure. I think it is a very interesting topic for discussion and that is the only reason I use it as my sig. If someone has pointers to information that refutes or substantiates this statistic I'd love to hear it. Also I'd like to know of a good forum to discuss this as it is rather off topic here.

      --
      90% of the wealth is in 2% of the pockets. Bummer to be in the majority.
    4. Re:OT .sig comment by imr · · Score: 1

      It's not that off topic.
      After all, you're telling us all that you can't find pointers to information that refutes or substantiates this statistic and a good forum to discuss this on a page dedicated to the best search engine so far.
      I guess it means there is room for improvments.

    5. Re:OT .sig comment by dynoman7 · · Score: 1

      5/4 of all people have problems with fractions...

      --
      Blarf.
    6. Re:OT .sig comment by Anonymous Coward · · Score: 0
      Heil Hitler. I love the Nazis.

      Laff time: Jew Boy

  174. Regexp Support Someday? by ReadParse · · Score: 4, Interesting

    A big part of Google's strength is in the supported search syntax, most notably that you can search for phrases instead of just keywords, that you can filter OUT certain phrases or keywords, and that you can search for content on specific sites, or NOT on specific sites. The next step for me and probably a lot of other Unix/Perl types is regular expression support.

    For example, let's say I'm looking for 80's brat pack member Anthony Michael Hall (not that I would do such a think), but I can't remember his middle name. Looking for "Anthony Hall" will do me little if any good, but looking for "Anthony \w+ Hall" could do the trick nicely.

    Another example is that the user can provide their own limited fuzzy searching, by searching for optional prefixes and suffixes along with the root, instead of having to get the word or phrase exactly as it's indexed.

    Thanks,
    John

    1. Re:Regexp Support Someday? by Istealmymusic · · Score: 2, Informative
      For example, let's say I'm looking for 80's brat pack member Anthony Michael Hall (not that I would do such a think), but I can't remember his middle name. Looking for "Anthony Hall" will do me little if any good, but looking for "Anthony \w+ Hall" could do the trick nicely.
      You can already do this by searching for Anthony * Hall . I use Google's wildcard feature all the time, definitely not a replacement for regexes but it works.
      --
      "The lesson to be learned is not to take the comments on slashdot too literally." --Vinnie Falco, BearShare
    2. Re:Regexp Support Someday? by ReadParse · · Score: 2

      Ah, indeed it DOES support wildcards. Something told me that I had checked and had found that it did not. I'm now glad that I got moderated down a point (for being "overrated") after having reached the elusive 5, so as not to get too embarrased in public about that.

      Yes, you're right -- no substitute for regular expressions. But definitely useful for all my brat pack searches :)

      Thanks,
      John

  175. Searching for new pages by good-n-nappy · · Score: 1

    I am interested in the beta news searching function that is currently available on Google. My assumption is that you are exploring this because it takes several days to crawl the entire web so the normal indices can be out of date. And likely many people are looking for a particular story while it is still "news." I'm wondering if you see any long term, more general purpose ways of keeping your indices up to date. For example, do you see any tricks with looking at modification dates of pages or diffing the contents against your cache? Maybe you are already doing this to some extent?

    --
    Never underestimate the power of fiber.
  176. Next big thing? by byee · · Score: 5, Interesting
    A few years ago Google came along with their new ranking algorithim and blew away all other competition. Now it's the only search engine I, and the vast majority of the people I know, use.


    What is Google doing to keep itself on top? Do you think there is a lot of room for improvement? How do you think web searching can get better?

  177. What's hte process then by Anonymous Coward · · Score: 0

    So what exactly happens or have to happen from when I get my website online, to where i can see it on google? DO i need to submit it to google to have it appear or will it find its way there eventually and HOW??

  178. "The Slashdot Effect" by Zzootnik · · Score: 4, Interesting

    So I'm truly surprised no one has asked this one yet, as it's the first thing that popped into my head...

    The masses of Slashdotters have slashed and dotted many an unlucky website over the years...Pushing webservers to their limit and often breaking them outright...

    With Google's Massive resources, Is there any noticeable difference when a /. story gets posted and people go stampeding to google to find out more? Or is that happening right now? (I'd hate to think of myself as part of a huge herd of individually acting DDOS'ers, but unfortunately, that's about what it ends up being...)

    --
    Sig currently under construction. Mind the gap....
  179. Google toolbar for Netscape / Mozilla? by sphealey · · Score: 2
    When will you be releasing the Google Toolbar for Netscape 6 (7?) and/or Mozilla?

    sPh

  180. Operating a network? by Anonymous Coward · · Score: 0

    How do you operate a network of the size that Google uses? What operating tools and software do you utilize to manage your servers. Do you use any special techniques for gathering network usage (e.g MRTG vs. other data warehouse)? Lastly, what's the biggest problem you've faced from an operational perspective?

    Thanks,
    a cowardly systems and network administrator

  181. Combating The Evil Scourage Of Network-Land by psypete · · Score: 1

    How do you deal with hackers? I would ask if anyone has ever hacked you, but the answer is pretty obvious. Do you sick the feds on them? Do you deal with them in your own "internet"-style revenge? Do you care at all?

  182. What about incorporating Cyc? by raylee5 · · Score: 2, Interesting

    Since Cyc has done a stint with Lycos, what about with the Google engine? Especially since Google maintains one of the largest and most relevant databases, a single question asked may result in huge amounts of additional, relevant information flowing into Cyc?

  183. how about what browser by johnjones · · Score: 3, Interesting

    everyone asking about hardware and to be honest its not what makes google good
    after all thewayback machine does kind of the same thing

    its software

    so this is my question

    what browser do you use ?

    regards

    john '1.1alpha' jones

  184. Recruitment with Google by dazdaz · · Score: 1


    I applied for to Google Japan and did'nt receive a reply. I then posted a followup and still did'nt receive a reply. Surely, it's courteous to reject candidates as opposed to ignoring their applications.

    Is Google really a great place to work in, or is there a lot of hype going on about what a great place it is to work in.

    Please correct this misconception I have as from what i've read, it seems a fantastic environment.

  185. Database by totallygeek · · Score: 2
    I have always been amazed by Google, and can certainly see why Linux was chosen for its Unix-like use without the Unix-like pricing. However, there have always been databases for Linux that would work for your environment, and especially within the last few years. Has there been talk of or pressure to use something like DB2 or Oracle for your indexing? If so, how has that been countered?

  186. Strengths, Weaknesses, Opportunities, Threats by thelizman · · Score: 1

    All The Web just announced that their index has surpassed yours in number of pages indexed. As a webmaster, I have also observed that not only does Googlebot fail to fully index my site, but its index of sites (mine as well as others) grows increasingly outdated. While I still worship Google, reality dictates that Teoma and All The Web are eroding the hegemony of Google. What technology upgrades to your basic search service does your team envision in order to keep Google on top?

  187. Operating System by evilviper · · Score: 2

    Every major operating system has it's example of a major corporation which is perhaps the flagship company associated with the OS. e.g. Yahoo and FreeBSD, Earthlink/Ebay and Solaris, Adobe and OpenBSD, and Google and Linux.

    So, having had to deal with Linux on a large scale, would you say that Linux was the right choice for what you are doing? With the benefit of heindsight, would you rather have gone with another operating system, such as FreeBSD, OpenBSD, Solaris, AIX, etc?

    --
    Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  188. Has Google been sued? by dazdaz · · Score: 1


    With all of these linking issues, think NPR controversy, has Google ever been sued for trawling a website for which the owners did'nt want indexing, assuming they don't know of disabling the robot.

  189. Pictures would be cool. by bakuretsu · · Score: 2, Interesting

    I think I speak for everyone when I say I'd like to see this Linux cluster Google is running. Just a Matrix-esque shot of the wall(s) of rackmount servers would be enough to make me happy.

    --

    --
    The Bailiwick - DESIGNHUB2005
  190. Image / video searching by openSoar · · Score: 1

    Are there any plans to implement the generic searching of images/frames within movies? In general terms, it'd be nice to be able to look for images using something else other than keyword-matching the page it came from. Specifically, I'm thinking of the ability to say "find me more images similar to 'this' one" where 'this' is the url of an existing image.

  191. LINUX profitability by Anonymous Coward · · Score: 0

    Though LINUX is almost free, people must make a living.

    What will happen when more and more application running on Linux will turn to people asking for money???

  192. Are you making any money yet? by whatthef*ck · · Score: 1

    All of the technical questions are pretty much moot if google has no long-term future. Are you guys profitable yet? If not, does it look like your current business model will lead you to profitability before the wolves find their way to your door?

  193. Mozilla Google toolbar by dazdaz · · Score: 1


    When will a "mozilla compatible" Google Toolbar be made available and what can we do to help with this project.

    When will a "Konqueror compatible" ...

    1. Re:Mozilla Google toolbar by ironfroggy · · Score: 1

      maybe we need a standard browser toolbar API.

  194. DOJ or HLS talking to you? by randomErr · · Score: 2

    So is the Dept. of Justice(DOJ) or Home Land Security(HSL) talk to you? In partictular, are you(google.com) being asked to track certian 'security' queries(bombs, antrax, Command Taco, ect.) of your system?

    --
    You say things that offend me and I can deal with it. Can you?
  195. who pays for the bandwidth required? by cdn-programmer · · Score: 1

    Who pays for the bandwidth requited to server this many requests and how is the delivery organised? Does it come generally from one major server farm plugged into the backbone or is it distributed?

  196. editor by ppetru · · Score: 2

    vi or emacs?

    --

    Petru
  197. searching by alexo · · Score: 1

    Not directly related to the server farm but nonetheless...

    Google currently restricts searches to:
    * 10 words or less
    * No wildcards
    * No stemming
    * No boolean expressions to speak of
    * No proximity operators.

    While I can use AltaVista (or some other competing search engine that supports more complex queries) for Web search, there is no alternative for Usenet search.

    It does not impress me that Google presents its list of relutls in less than a second if I have to spend several minutes composing different queries in an attempt to overcome the abovementioned limitations.

    Why won't Google offer a more advanced query language as an option?

  198. will it still be free by Anonymous Coward · · Score: 0

    I use google all the time! But I was wondering if the
    serach engine would be free forever. Does google have any plans to charge users to use their search engine in the near or far future? I hope it does not.

  199. Public Utility by killmenow · · Score: 1

    Craig, as everyone will attest, Google has become indespensible. As such a tool, virtually required for useful Internet work, I see Google as similar to a public utility. What do you see as the pros and cons of being viewed in this light?

  200. Why does it take to so long to get a new newsgroup by Anonymous Coward · · Score: 0


    I know many people have requested alt.tv.alias be carried by google groups, this has been going on for months, and no reply from google...?

  201. Waiting by Anonymous Coward · · Score: 0

    for the inevitable straight answer to this question. I'll say it: you can remove your old posts yourself.

  202. Philosophy of leading google by hnchou · · Score: 1

    You head google's technology and people who make the technology possible. What's your philosophy of leading such a team?

  203. Natural Language Processing Market? by Anonymous Coward · · Score: 0

    Natural language processing has been advancing rapidly in the last 10 years due to new conceptualizations( statistical and connectionist approaches ). Do you see these advances making thier way into the mainstream IT market in the near future? Does Google have any plans to implement systems using these new technologies?

    -jjzeidner@yahoo.com.nospam ( take off the .nospam please )

  204. Data mining search queries by AAAWalrus · · Score: 1

    Since you have a personal interest in efficiently clustering large data sets, I was wondering if Google did data mining research on submitted queries. For instance, looking at how many searches are submitted by one person in a short period of time might indicate whether or not they found what they were looking for. An interesting piece of data might be, on average, how many unsuccessful searches are submitted before a successful one is returned. Combining this with being able to categorize searches by topic might lend itself to making suggestions on search techniques for different types of searches.

  205. Logos by Tablespork · · Score: 1

    What was the deal with the Dilbert logos? Were you just having fun, or were you actually looking for new logo submissions?

  206. Gov.Agencies by Anonymous Coward · · Score: 0

    Has Google beeing contacted by Gov Agencies like FBI, CIA to search certain information? and has google found it?

    With all the information google has, is there a plan for the future to process all that information to get some kind of results, or an intelligent robot that learn?

  207. More popular cached pages by Richard5mith · · Score: 2, Interesting

    I read recently that you cache many of the more popular pages every 15 minutes, which was a surprise. Exactly how many pages are counted in this "popular" set, how do you decide when to move a page from the normal every 28 day rotation to this one, and what's the process for getting one of these pages (say from my server) cached on yours, indexed, page ranked and available across your whole server farm for searching.


    -- Chatbear - http://www.chatbear.com -- Free messageboards, Highly Customisable

  208. Uptime by Tablespork · · Score: 1

    What's your uptime? How often do you crash? How often do you have major problems and/or what are your biggest problems?

  209. opening up the data for science by fons · · Score: 3, Interesting

    While doing some scientific research I discovered that the the Google Seitgeist is a very interesting source of information for research in the area's of social and communication sciences (marketing, lifestyle, ...).

    However, the available information and the explanation of used methodology is too limitid to make this information usefull scientifically.

    This is a shame because the Seitgeist is just the tip of the iceberg. There must be an enormous amount of information available.

    I know for sure that a few professors I know would have a field day if they were to be able to analyse all this data.

    My question is: would it be possible to open all the available data to scientists for statistical analysis?

    It doesn't even have to be free I think. Universities and research organisations pay a lot of money for survey's that result in datasets that are relativly small to the dataset available at Google.

  210. Use of Python by BobRoss · · Score: 5, Interesting

    I have heard that Google uses Python extensively to manage its data, grab new data, etc.

    As an avid fan of the Python language, I am interested in exactly how Google puts it to use. Can you clue us in?

    P.S. - Keep up the good work!

  211. The interview by inkfox · · Score: 2
    "Craig, can you tell me..."

    Your question netted 53,496 possible answers. I've filtered out similar answers, so let me just give you the first 1,000...

    --
    Says the RIAA: When you EQ, you're stealing bass!
  212. Distro by Anonymous Coward · · Score: 0

    Which distribution do you use on your server? How do you keep them updated?

  213. downtime disaster stories by fons · · Score: 4, Interesting


    We've all had servers crashing on us just before a deadline. We've all had to go to the office in the middle of the night to prevent a disaster. (we've all been hacked by a scipt-kid, once)

    Do you have any stories of disasters or difficult moments in the datacenters that kept you all up for a few nights in a row, but went by unnoticed by the public?

  214. uh, got toolbar by benthayer1 · · Score: 1

    if google is so linux heavy, where is the google toolbar for a linux browser?

  215. Investing in Google? by nautical9 · · Score: 1
    Sure, this isn't exactly a technical question, but I think there may be some interest in it's answer nonetheless.

    Does Google have any plans to become a public company, so all of us devoted fans can start throwing our hard-earned money into your IPO?

  216. My question... by Kickstart70 · · Score: 4, Funny

    What's the root password?

    :)

    Kickstart

  217. The New Competition: Teoma and Wizenut by cjohnson · · Score: 1

    You have young upstarts headed your way. But fostering competition can be good. What kind of advice would you give to a company like Teoma or Wisenut in order to keep up? What do you think gives you the edge, technologically, over these guys? What do you think they do well that Google does not?

  218. power, censorship and transparency by mml · · Score: 1
    To what extent is google's policy determined by the technical staff? There are a bunch of things which have happened on google which suggest that there's some sort of body making policy decisions. Who makes these decisions?

    Examples:

    Pages can be manually removed from google for doing dubious things (e.g. cloaking). Someone must have actively decided that the benefits of this type of censorship exceeded the costs.

    Someone decided to pull the anti-scientology pages, and then someone decided to put some of them back. Who?

    Someone decided against making a list of removed sites so that we can all see what is being supressed by google.

    Matt

  219. Does he ever feel lucky? by Newer+Guy · · Score: 2, Funny

    Well...does he?

  220. If we're dragging out penis rulers ... by Anonymous Coward · · Score: 0

    I've had root on ~13000 Akamai machines.

  221. My secret shame... by Anonymous Coward · · Score: 0

    is that i use IE, but i must, for the google bar rocks my bitch ass. will it be available soon for any of the other browsers?

  222. speed? by Anonymous Coward · · Score: 0

    What's your biggest bottlekneck for speed?

  223. See who you are interviewing! by jbarr · · Score: 1

    Craig Silverstein's picture is
    here and pictures of a number of other employees are here.

    --
    My mom always said, "Jim, you're 1 in a million." Given the current population, there are 7000 of me. God help us all!
  224. Beowulfs ate all the pigeons! by brodin · · Score: 1

    They couldn't get it to work because the Beowulfs (beowulves?) had quite an appetite for squab!

  225. Python at google by Corvus · · Score: 1

    What role does the programming language Python play within Google?

  226. Ogle the Google by Anne_Nonymous · · Score: 2

    The Google image search is wonderful, but shouldn't that have been called Ogle?

  227. That's a good question, My question is: by DRAGONWEEZEL · · Score: 4, Interesting

    Is there anything on the internet that you personally couldn't find with google and if so what was it?

    p.s.
    Thanks for all your help with my school research

    --
    How much is your data worth? Back it up now.
    1. Re:That's a good question, My question is: by Anonymous Coward · · Score: 1, Funny
      Is there anything on the internet that you personally couldn't find with google and if so what was it?

      I once used Google to search for "porn" and didn't find anything ;)

    2. Re:That's a good question, My question is: by Anonymous Coward · · Score: 0

      How do you know it's on the internet if you've never found it?

      Stay in school.

    3. Re:That's a good question, My question is: by Anonymous Coward · · Score: 0

      Umm, just try to search for piece of java code like '.equals("")' :-)

  228. Google Toolbar for non-IE/Windows boxen? by mshomphe · · Score: 2

    The Google toolbar is one of the coolest things about IE (maybe the only one <grin/>). However, you need a Windows system with IE in order to install and use it. Are there plans to have to toolbar available for Mozilla, and non-Microsoft systems in general?

    --
    She sat at the window watching the evening invade the avenue.
  229. Weighting of heuristics by jolshefsky · · Score: 5, Interesting
    As the web develops, methods of matching a set of search keywords to a set of websites related to those keywords must change with it. I envision that the Google algorithms rank search hits by summing weighted factors such as overall site popularity, META tag keywords, META tag descriptions, TITLE tag contents, text contents, keywords containted in URLs, and so on.

    Can you talk a bit about how those weights have changed over time? Have there been any surprising shifts?

    --
    --- Jason Olshefsky

    Karma: Poser (mostly affected by adding this line long after everyone else did)

    1. Re:Weighting of heuristics by danny · · Score: 2
      META tags have never been used by Google, as far as I know, nor popularity in the sense of number of visitors.

      The original algorithm used only TITLEs and the number and quality of links to the page (ie PageRank). These days, the text in and near the anchors of incoming links is a major part of the algorithm.

      Danny.

      --
      I have written over 900 book reviews
    2. Re:Weighting of heuristics by searchtools · · Score: 1

      Right, the public webwide Google search enigne ignores meta tags of all kinds, I think because there's too much spam in them. The Google Search Appliance indexes them and searches all the meta tags automatically.

  230. Logging queries and reporting to the FBI by Anonymous Coward · · Score: 0

    Do you report searches that may be "against us" in the words of our idiotic commander-less-in-chief.

    Examples might include:

    plutonium bomb assembly dirty radioactive fake
    staged Cheney Bush Rumsfeld Carlysle

    Thanks in advance,
    Prepared for the second fake war: The war on fat

  231. Constant Availability by wide_awake · · Score: 1
    Hi.

    Google rules. What I'd like to know is how it is *always* up, even in the face of huge demand and complex servers backing it up. I've used Google for a few years, and I've never seen it go down, even though I usually visit the site several times a day.

    What are some of the measures that allow Google to achieve this incredible availability?

  232. What keeps you up at night? by Anonymous Coward · · Score: 0

    What do you worry most about about your job?

  233. Do you..... by pjdepasq · · Score: 2

    Do you Yahoo?

    ;-)

  234. What would you do different...? by Anonymous Coward · · Score: 0

    Hindsight is 20/20. What would you do different, technology wise, company wise, etc, etc... If you were to start again?

  235. See who you are interviewing! by jbarr · · Score: 1

    Craig Silverstein's picture is here and a number of other google employees are pictured here.

    --
    My mom always said, "Jim, you're 1 in a million." Given the current population, there are 7000 of me. God help us all!
  236. Google Security by calib0r · · Score: 1

    As a network security analyst, I have to deal daily with intrusion attempts and DDoS, and such. Our company, being a large domain registrar, has to pu t forth a large number of resources dealing with these issues. What kind of intrusion attempt traffic does Google see on a daily basis? How much of Googles resources have to be used to deal with these problems?

    --
    -===- "Those who would sacrifice freedom for security deserver neither" -===-
  237. Does Google log, report strange searches to FBI? by Anonymous Coward · · Score: 0

    to find those "against us" in the words of the idiot resident of The White House?

    An example of a strange search is:

    > dirty radioactive bomb staged fake Cheney Rumsfeld Bush Carlysle

    Thank you

  238. I wanted to know... by quasi_steller · · Score: 1

    Why did you first decide to start building an internet search engine, and what have you learned in the process of becoming one of the biggest and best search engines out there?

    --
    ...interesting if true.
  239. googlewhacking by Prowl · · Score: 2, Interesting

    has there been a noticeable increase in hit rate since google whacking became popular?

    have you had to take counter measures?

    also, you already have an elmer fudd language, how about duke nukem or yoda?

    --
    That man tried to kill mah Daddy
  240. Will Google continue profitability in the future? by mike3411 · · Score: 1

    Considering the extremely low .com success rate and the current terrible advertising market, what do you think has set Google so apart from the many failing, similar businesses? Do you see your business model as having endured what may have been the most difficult economic environment, or do you see larger challenges in the future? And, of course, I'd like to say I love google dearly and would like to pursue a physical relationship in the future.

    --
    Mod me down, and I will become more powerful than you can possibly imagine!
  241. Google Office Systems by scriptopia · · Score: 2, Interesting

    According to the Google site, in the career section one of the perks to working at Google is having a high powered linux workstation. My question is: How well does it work using linux as the Desktop/Workstation OS. What kind of compatibility issues have you run into when working with partners who use Windows. Additionally what kind of custom software soutions have been needed to make it work.

    -ryan

  242. When do we get true exact substring searches? by Anonymous Coward · · Score: 0

    I've been waiting a long time for an exact phrase search that really is what it claims to be.

    If I want to search for the character string "/proc/net/dev" or "C++", then currently I'm totally screwed.

    Any chances of us ever getting this?

  243. King of the search engines by mikosullivan · · Score: 5, Interesting
    • CUI was king of the search engines. WebCrawler took them down.
    • WebCrawler was king of the search engines. AltaVista took them down.
    • AltaVista was king of the search engines. Google took them down.
    • Google is king of the search engines.

    Does this chain of thought keep you up at night?

    --
    Miko O'Sullivan
    1. Re:King of the search engines by Anonymous Coward · · Score: 1, Interesting

      Not necessarily "Keeping me up at night," but I do say that is an interesting pattern.

      With every new search technology (isn't Google's thing is that it doesn't search the pages for keywords but some way of judging content?) the old is made obsolete, and the new is made king for a long time, that is, until the next greatest thing comes along and knocks it out. When you think about it, it's almost like a trend... People flock to the newest and greatest thing. But personally speaking, I still use Altavista now and then, even for a little trip of nostalgia.

    2. Re:King of the search engines by mikosullivan · · Score: 1
      The one thing I still use AltaVista for occasionally are boolen searches. If I just can't find it on Google (which is dan rare, of course) I can often find it with an long boolean expression:

      ("html" | "attribute") ~
      ("frame size" | "frame sizes" | "big frames")

      --
      Miko O'Sullivan
    3. Re:King of the search engines by Kickstart70 · · Score: 2, Insightful
      For a long while Lycos claimed to have knowledge of the most pages as well. There are many different ways of saying who is 'king'. I don't disagree that Google beats the heck out of everything else currently available though.

      Now...if Google wants to stay on top with me, I'd like to see the following:

      • 'regexp.google.com', so that we have a little more control over how and what we search
      • If not regexp, then at least wilcards for words. I would like to search for something like "word1 * word3" and have the search engine return all the instances where word1 and word3 have any number of word2' between them.
      • Dropping country specific google sites. I absolutely hate the fact that when I type in 'www.google.com' it autoforwards me to 'www.google.ca' with Canadian content enabled. I've spoke to them about it, and there are workarounds, but they refuse to make it easy to not be categorized by country.
      • I'd like to see the last time a returned link was indexed by Google, to know how up-to-date a link is. This especially becomes vital for technical issues, where a returned link on a specific piece of software is no longer useful because of version changes.

      I can come up with a lot of things saying 'how to make search engines more useful'. Now if only someone would listen.

      Kickstart

    4. Re:King of the search engines by jesser · · Score: 1

      What don't you like about www.google.ca? It's in English. It's closer to you, and therefore it should be faster.

      --
      The shareholder is always right.
    5. Re:King of the search engines by Kickstart70 · · Score: 1
      Basically, what it comes down to is that I don't like being typed by my location. I typed in 'www.google.com' instead of 'www.google.ca' because I wanted the standard interface.

      Now...I have to say that Google doesn't appear to specifically target me with 'ads for Canadians', but there is nothing to stop them from doing so, and no way to tell if that is indeed what they are doing at some point in the future. Even further, how can I be sure that the actual search results I get from Google aren't tainted by my location? If the Canadian gov't cracks down on some form of cryptography, will I be unable to do searches in Google for it because of my location? It's a slippery slope.

      As far as speed...there is no difference that I have found. But that brings up another issue...what if the www.google.ca server starts dying? Will I be forced to go to a server that is incapable of handling my request?

      Kickstart

    6. Re:King of the search engines by bons · · Score: 1

      Wow. It's amazing what asking to be moderated up on UserFriendly will do for your Karma. Is this a common practice now?

    7. Re:King of the search engines by Kickstart70 · · Score: 1

      No worse than the standard "Old Geeks Network" which permeates Slashdot.

      KS

    8. Re:King of the search engines by mikosullivan · · Score: 1

      The first rule of success: if you want something to happen, make it happen. :-)

      --
      Miko O'Sullivan
    9. Re:King of the search engines by mikosullivan · · Score: 1
      As far as speed...

      Furthermore, who says it's faster because you're in Canada? IIRC, you're in Vancouver. I'm in Virginia. Going geographically, you're probably closer to their California servers than I am.

      --
      Miko O'Sullivan
    10. Re:King of the search engines by jesser · · Score: 1

      Now...I have to say that Google doesn't appear to specifically target me with 'ads for Canadians', but there is nothing to stop them from doing so, and no way to tell if that is indeed what they are doing at some point in the future.

      Is that a bad thing? Google prides itself on serving relevant ads that users actually want to look at. For some ads, location matters when deciding whether an ad is relevant. When I put up my ads, I had a choice of languages and countries. I chose to advertise to everyone.

      Btw, www.google.com probably gives you ads targeted to Canada if it can tell you're from Canada. And it can tell you're from Canada :)

      --
      The shareholder is always right.
  244. food by asv108 · · Score: 2

    I noticed that at google has free gourmet lunches for all its employees couresy of Chef Charlie. My question is how good is the food and has Charlie told you any interesting stories from his days with the Grateful Dead?

  245. Apologies to Steven Wright by Joao · · Score: 2

    I have been pondering this question for quite some time, and I think I finally found the one person who might give me the answer:

    Dear Mr. Silverstein. If you could have everything, where would you put it? ;-)

  246. Good Question by Anonymous Coward · · Score: 0

    This is a good question.

  247. Downtime by FlukeMeister · · Score: 1

    With all the effort that you must put into improving other people's experience of the web, what do you do to relax when you're not working?

  248. Corporate Culture by zpengo · · Score: 4, Interesting

    As an addendum to this, what is it about the corporate culture at Google that makes it work so well while other "hip" dot coms went down the toilet? What's the magic ingredient that made Google turn out differently?

    --


    Got Rhinos?
    1. Re:Corporate Culture by Anonymous Coward · · Score: 0

      I'd say the magic ingredient was an extremely well designed internet search portal!

  249. IRC? by Anonymous Coward · · Score: 0

    There's something that I fear may happen someday, so I'll just ask it now -

    When will we see the Google IRC archive?

  250. Searching multiple sites by mikosullivan · · Score: 2
    Right now on Google you can either search a single host or the whole web. It would be great if you could search a set of hosts, and also if you could restrict the search to specific directories within those hosts. The specific reason I ask is that I've written an open source bookmarking system and it would be great if users could search the pages and sites they've bookmarked.

    Speaking of bookmarks, do you have any plans to offer a bookmarking service?

    --
    Miko O'Sullivan
  251. Working for Google by Anonymous Coward · · Score: 0

    What I really would like to know is what it is like working for Google in the post-dot.com era? A lot of tech companies these days don't treat their employees very well (Makes me I'm glad I'm not a software developer). Have you had to lay off people yet? Are you still hiring? How are people rewarded at Google? I am asking is because you can tell a lot about a company by the way they treet their engineers.About your competition Alltheweb, a friend of mine considering seeking employment there was warned by someone who already works there that he could not in good conscience recommend it since he described the culture as "The same as KGB during Stalin: hero one day, a bullet in the back of the head the next". Is this normal for the industry or is Alltheweb different?

    RICK

  252. If you had to reincarnate as a camel.... by Blaede · · Score: 1

    ...which would you be, a bactrian or a dromedary camel, and why?

  253. Business Question by Anonymous Coward · · Score: 0

    What is google's business model?

  254. question by Anonymous Coward · · Score: 0

    In your opinion, if Google is overtaken as the premiere search engine, what would have been the cause ?

  255. searching images by abdulwahid · · Score: 1

    How exactly does the image search feature work? Apart from the filename, what makes an image more likely to be returned in the search results? Have you considered using OCR software to read text in i mages and an image as text option on the search page that will extract the text from an image?

    --
    perl -e 'print $i=pack(c5, (41*2), sqrt(7056), (unpack(c,H)-2), oct(115), 10);'
  256. This process takes too long. by Captoo · · Score: 1

    . . . and we'll post his answers shortly after he gets them back to us.

    I want to see the answers shortly before he gets them back to you!

  257. pron by Anonymous Coward · · Score: 0

    c'mon... we all want to know what percentage of searches are actually looking for porn.

  258. Dynamic VLAN's by Malbosia · · Score: 0

    I had heard from a friend that you guys are using some sort of dynamic VLAN's and that one day a server can be doing one job, and the next day will be dynamically assigned to another group of servers doing another job. Is this true? If so can you elaborate on this?

  259. Google's Ethical Leadership by Dissident · · Score: 2, Interesting

    How has Google managed to deal so well with ethical issues in the current economic environment? For instance, how has Google managed to avoid going to pop up ads, unfair treatment of search returns based on payoffs, and other challenges? Most search engines and news sites have already caved in to these methods citing budget shortages. Google, on the other hand, seems to be expanding existing services, and acquiring or developing new services. How has Google managed to avoid some of the other pitfalls like clueless corporate officers who push the company into adopting bad technologies or technologies that don't fit the company? How much input and control does the non-management, but highly technical types in the company exert over the corporate vision of technology?

  260. Jakob Nielsen by icqqm · · Score: 2

    Just what impact has Jakob Nielsen had on Google's interface?

  261. LBS etc by briancnorton · · Score: 1

    While I think we all understand that you wont make major service announcements here, but what role do you think Location Based Services, (GPS) Telematics, and Wireless technology will play in Google's future? What major technological hurdles do you forsee? What privacy issues need to be resolved technologically vs politically?

    --

    People who think they know everything really piss off those of us that actually do.

  262. What programming languages where? by webmaven · · Score: 2

    From various articles and references on the net, it's clear that Google uses a mix of languages in developing and deploying it's services. Languages I've seen cited are Python, Java, and C++. I assume this is not a complete list.

    What programming languages do Google developers use, for which tasks are they used, and why?

    --
    The real Webmaven is user ID 27463. I don't rate an imposter, because my ID is such a lame-ass high number.
  263. How often do you have problems with hackers ? by Anonymous Coward · · Score: 0

    It's common knowledge that people like to break systems run by companies that are universally disliked.. no need to name names on slashdot, but I don't know anyone that dislikes what google does - they seem to be held in pretty high regard all round.

    We have people portscanning the place I work (education) every day, and code red continues to try and infect us (but we run mainly linux webservers), so I regard that sort of thing as background noise.

    I wonder - does this make you relatively immune from these sort of things ? Do you find people trying to break or DOS google ?

  264. A few questions: by Anonymous Coward · · Score: 0
    What exactly are you wearing? And, do I make you horny baby? Do I?

    Also, you are bold, but are you also...daring? One last question: Have you ever been in a Turkish prison?

  265. wild card searching by mcsneedy · · Score: 1

    do you have any plans to support wildcard searches? it's tedious to have to type (recipe OR recipes) all the time. how about regex pattern matching?

  266. Internationalization by casolorz · · Score: 0

    How do you guys handdle i18n? at my job we are currently working on that, and we just find it hard to keep the pages the same across different languages without replicating data or code that doesn't need to be replicated. We also think we will have issues with later releases as translations might not be done on time, do you guys have such a thing as different versions for different languages?

  267. Technology behind news.google.com by itsmarcos · · Score: 2, Interesting
    Is it possible to have some information on the technology behind news.google.com?

    In particular:

    1. Do you use any Topic Detection & Tracking techniques.

    2. How do you cluster news stories? Do you use a Scatter/Gather approach.

    3. Is the news site going to be available through an API?

    --
    Marcos
  268. Information Visualization by briancnorton · · Score: 2, Interesting

    Your search results are undeniably the best available commercially on the web, but what thinking has been given to graphical information visualization? Some new search engines are trying out presenting results topologically. While these may not be very useful, there may be potential. What has google done technologically in this area if anything? Are there any plans to explore this avenue?

    --

    People who think they know everything really piss off those of us that actually do.

  269. Google and the Google Answers Project by SyFryer · · Score: 1

    Hi Craig,

    Great props towards one of my favourite sites, keep up the good work.

    My question:

    With the introduction of this service (and later searchable answers e.t.c), how long can we (and any aliens planning roadworks) expect to be able to access googles version of the HHGTG?

    I realise the net is this to some extent already, but the Answers KB is pure signal.

    Do you have any other plans for the sum of this collected knowledge than the web (mobile devices, schools e.t.c) and how well is it going?

    Thanks,

    Sy

  270. I like to watch... by odbodbo · · Score: 4, Interesting

    I used to kill hours watching the search requests scroll by on Metacrawler's Metaspy page back when people still used Metacrawler. Any chance we could have something like that on Google? I would *almost* even pay to subscribe to a site where I could watch uncensored Google search requests go by.

  271. Will there be a mozilla toolbar? by Anonymous Coward · · Score: 0

    I hope that this is not a redundate post - will mozilla ever get the famous google toolbar. (For the poor no galeon in reach win32 mozilla people...)

    And what is he thinking about AllTheWeb?

  272. Re:Forget Craig exactly by sk8king · · Score: 1

    Unfortunately, the people who post under anonymous coward appear to think that the only "hot" chicks out there appear in Maxim and that a woman who doesn't frequent raves and is older than 18 can't be beautiful.

    Your comments would carry more weight if you would put your name behind them.

  273. AllTheWeb versus Google by Anonymous Coward · · Score: 0

    People in the know say that it is mostly down to
    two search engines now: AllTheWeb and Google. Do you agree? I've also heard that while Google is better on searching the web for english content Alltheweb is built on radically more flexible and scalable technology that works better in other contexts as well.

    According to what I've heard they have an index about the same size of yours, but they only use a fraction of the hardware you use. How many servers do you and AllTheWeb use?

  274. Government intervention by Anonymous Coward · · Score: 0

    Do you come under pressure from any government organisations to monitor users' queries?

  275. Has Google Replaced URLs? by billnapier · · Score: 5, Interesting

    With the success and popularity of Google, I find myself using URL's for places less and less and just entering names into Google to find places (they are almost always on the first page...) Do you think that you have almost replaced the URL?

  276. Google Toolbar by Anonymous Coward · · Score: 0

    When will it be available for Mozilla?

  277. Google's Catalog and Image Searches by hlynna · · Score: 1

    These two new features are very cool. What I wonder: How do you index images? And how do you get all those catalogs on the web? Your site says they are catalogs that aren't available elsewhere on the web, so did you enter them by hand? Or did you have some cool solution? Thanks! Hlynna

    --
    The one in the corner looking clueless at most everything, but enjoying it, nonetheless.
  278. What's the deal with Google and RAM? by Anonymous Coward · · Score: 0

    In a recent interview, Eric Schmidt mentions that, "At Google, we found it costs less money and it is more efficient to use DRAM as storage as opposed to hard disks". [http://www.pcworld.com/news/article/0,aid,81685,0 0.asp].There was a lot of interest about this remark. Could you elaborate a little more on this idea.

  279. besides, pageRank by stryk9 · · Score: 1

    A lot of search engines truncate/prune low-scoring terms. For instance, do you really need to hold the scores of the word "the" for all the 2.5 billion documents that have it? Would it really make a big difference if you only held it for the top-scoring, say, 100 million documents?

  280. Google, positioning, and the url by sqwark.com · · Score: 1

    I've noticed recently that hyphenated keyworded urls are placing highly in Google. Could you confirm that Google is looking strongly to the url for content information and to determine relevance?

  281. Other content types? by barnaclebarnes · · Score: 2

    Google does a good job of indexing html and has added new types of content over the last few months.

    My question is what other contents types do you want/plan to index?

    For example I searched Google for about an hour today looking for a solution to my PHP compilation problem with no luck. I turned to IRC and got an answer in a couple of minutes. If Google archived that conversation then the next person to search for the error message would get an exact hit.

    /b

    --
    [Please type your sig here.]
    1. Re:Other content types? by J'raxis · · Score: 1

      Um, that wouldnt be possible unless Google had a bot sitting in every channel on every IRC network. Or, the IRC daemon could be rewritten to dump all public channel messages to a logfile which could be archived somehow (like posted to a website daily).

      Along those lines, you could certainly set up your own bot to record your favorite channel conversations, and save them as HTML or text files to a webserver somewhere, and then have Google spider the server like it does every other webserver. I suggest you advertise this prominently in the channel topic that everything is being archived so no one gets pissy about having their conversations recorded.

  282. Global Load Balancing by dloyer · · Score: 1

    How does Google solve the global load balancing problem? Foundry Networks claims that you use their ServerIron products. Is it true?
    Do you do anything special to map different parts of the network to server farms? How do you collect performance metrics while avoiding active probe compaints?

  283. age of cache vs wayback machine by ajrs · · Score: 1

    how long is an old version of a page that no longer refers to a page kept in the cache?

    How was that decision made?

  284. Bookmark Keywords by elysian1 · · Score: 1
    I've also found bookmark keywords to be extremely useful. I have mine set up so that if I want to search for something on google, I just type into the address bar:
    • g "whatever i want to search for"
  285. Would Google consider licensing their database? by jl138 · · Score: 1

    Having access to gigabytes of data is always valuable for testing new algorithms and ideas. Would Google considering licensing access to their data at a level beyond the search capabilities available today?

  286. Re:Forget Craig exactly by Anonymous Coward · · Score: 0

    Some of us ACs do have taste. Cindy's beautiful. A major babe.

  287. Google and Mozilla by Wraithlyn · · Score: 2

    I'm not very familiar with the Google toolbar, but IMHO, Google access from Mozilla 1.0 couldn't be much easier... just type your query in the address bar, then press up-arrow to select the 'Search Google for "fubar"' from the bottom of the drop-down menu, and hit Enter... presto! Google search in 2 keystrokes. Add Ctrl to that and you even get it in a new tab. Mozilla rocks! I just wish it had smooth auto-scroll, more customizable toolbars (such as small icons, optional text, Home on the nav bar instead the personal bar), and native support for Back/Forward buttons on mice, like IE and Opera 6 do.

    --
    "Mind, as manifested by the capacity to make choices, is to some extent present in every electron." -Freeman Dyson
    1. Re:Google and Mozilla by RedWizzard · · Score: 2
      Here are some of things that the Google Toolbar gives you that aren't covered by Mozilla. These are the ones I use frequently:
      • Search the current site as easily as the whole web (equivalent to adding site:... to your search terms)
      • Quick access to Google's cache of the open page
      • An 'up' button that allows you to quickly navigate a site by trimming trailing directories from a URL (i.e. www.site.com/dir1/dir2/page -> www.site.com/dir1/dir2/ -> www.site.com/dir1/ -> www.site.com/).
      • Search term highlighted in the current page
      The Google toolbar is the only thing I miss from IE.
  288. Backups by octalgirl · · Score: 1

    Can you describe your backup solutions? And in that vein, has there ever been a time when a critical hard drive or system crashed, calling for an all-nighter? (What has been your worst nightmare?)

  289. images.google.com and PNG images by Rock · · Score: 4, Interesting

    Craig,

    When will images.google.com include PNG images in its search base? Why were the image types limited to GIF and JPEG, when most browsers could also display PNG? Now, virtually all non-text browsers support Portable Network Graphics.

    Questions done. I'll take this opportunity to thank Google for groups.google.com, the searchable usenet archive. In my opinion, 15% of the total value of the internet is contained therein. Excellent!

    --
    - - -
    "The sixth sick shiek's sixth sheep's sick."
    1. Re:images.google.com and PNG images by Rock · · Score: 1

      I just noticed that Rob said "one question per post". So my proposed question is the first, "When will images.google.com include PNG images in its search base?".

      --
      - - -
      "The sixth sick shiek's sixth sheep's sick."
    2. Re:images.google.com and PNG images by gjuyn · · Score: 1

      I second that. It's about time PNG gets full coverage. Most magazines, almost all software, and many experts consider it a given that PNG exists.

    3. Re:images.google.com and PNG images by wwwillem · · Score: 1

      It's a simple question, which deserves a simple but interesting answer. When, when, oh when ....

      --
      Browsers shouldn't have a back button!! It's all about going forward...
    4. Re:images.google.com and PNG images by 3dr · · Score: 1


      I would like to echo similar interest -- being able to index PNG images would be a nice addition. Given the chunkular format of PNG and extra information that can be stored in a PNG, it seems Google would be able to index the images with slightly more depth... IF creators of PNG would use these fields.

  290. Coolest missing feature? by Anonymous Coward · · Score: 0

    What would be the one very cool feature you would love to expand Google with, that is simply too difficult to program (for now)?

    Tom

  291. Slashdotted? by gafferted · · Score: 2, Funny

    Can I have a prize for slashdotting google?

  292. Ultra-advanced search by 4D.uk · · Score: 1

    Sometimes I wish that Google had additional advanced search capabilities such as regular expressions or an Altavista-like NEAR boolean (preferably one that lets me specify my own value for the 'nearness'). In fact I'd be prepared to pay for "advanced googler" status, even just to have those two available (say, $10/year at least).

    Do you have any thoughts about the technical feasability of these ideas? And what about the business case; doesn't Google need just such a revenue stream from the people who actually use its service?

  293. Bandwidth monitors. by _ph1ux_ · · Score: 4, Interesting

    I was gunna mod... but I ahve a re-quest-ion.

    Can you guys put up bandwidth graphs for the public to see. Like mrtg graphs page showing daily google request traffic flow. so we can see what type of overall trends in searching happens during the day.

    I would love to be able to see just how massive you traffic is and what it looks like.

    and let us know what tools you use to monitor all your stuff.

  294. Are the pigeons ok? by dubstop · · Score: 0

    Are those pigeons being humanely treated?

    I've been worrying about this for months now. It just doesn't seem right, making those pigeons do all the work.

  295. How do you manage the Google by Zorlon · · Score: 2, Interesting

    I work for a large ISP and am interested in how large systems are managed. How do you manage the Google? Do you use open source, commercial or roll-your-own monitoring. Do you use a SNMP agent and, if so to what extent. How fast can you detect a problem, troubleshoot it and fix it.

    --
    - Things are the way they are because they're coded that way -
  296. It's not how big your index is... by bani · · Score: 2

    ...it's how you use it! =)

  297. Not a dumb question at all! by cjsnell · · Score: 2


    That's a good question and not "redundant". Let me expand on his/her question:

    Is Google actively hiring? If so, what kind of job titles are most important for Google to fill these days?

    [I'm not a headhunter]

    Chris

  298. Google has a lot of keys by fons · · Score: 2


    Hey Google seems to have all the keys to success. So that would be like looking for a needle in a ...

  299. Why Linux (over BSD, etc)? by cjsnell · · Score: 4, Interesting


    When you were selecting the OS to run Google, why did you choose Linux? I'm partial to FreeBSD but I'm pretty sure that you evaluated it and found something a) that you didn't like or b) something about Linux that you liked better. If so, what?

    Second part of this question: Do you continue to evaluate alternative operating systems?

    Chris

  300. Not Jealous, I live in Santa Barbara :-P by brodin · · Score: 1

    S.B. California, Oh, yeah!

  301. Google Whacking by Anonymous Coward · · Score: 0

    Is it possible for Google (or the people at Google) to calculate a finite number of possible "Google Wacking" searches that result in
    only one web site?

    In other words, given the content of all the webpages that Google can search, is it possible to determine how many search requests will yield only one site as a result?

  302. Resumes@Google by bobdole34 · · Score: 0

    What does it take to even be considered for a position at google? ie. What sets one resume above the 10000/week submitted?

    --
    "Failure of Windows operating systems is extremely rare. If it happens, it is usually due to operating system file c
  303. Question for Craig by euroBob · · Score: 1

    Cutting the BS out of the way... What makes google different from the many razor scooter riding, Latte drinking, "hi we all have PHDs" and look at us do something on the 'net' companies? Rob

    --
    try { println( SigString ); } catch( Exception e ) { println( 'Who cares?' ); }
  304. Something wrong with making the people rich by mr_don't · · Score: 2

    Actually, economic inequity, and not lack of food, is the prevalent cause of hunger. Thus, in a way, making more people rich does create something wrong. Unless you consider it is ok that one sixth of the world's population faces chronic hunger. Like energy, there is only so much capital in the world...

    I am saying ICANN (or someone like them) - not Venture Capitalists - should fund projects like Google.

    1. Re:Something wrong with making the people rich by rnd() · · Score: 2
      Ironically, the fact that our society is free reveals that Capitalism is the economic system of choice in the world's richest nation. What should we do with that wealth? Whatever we want! That's why it's called a free society.

      Rich people in the USA do not cause world hunger. In fact, they provide a lot of help to the world's hungry, both in terms of increasing awareness and generating funding.

      If ICANN were left to its own devices to fund 'projects' like Google, such 'projects' (Google is very much a profit-making business, not a 'project') would likely never come into existence. This is nothing against ICANN, but let's face it, nobody is more stingy with a dollar than a venture capitalist. The same cannot be said about non-profits, because there just isn't the same level of competition out there in the np space.

      If you look at the distribution of wealth throughout the world, it is easy to say that economic inequity is the cause of hunger. But in fact the cause of hunger is the simple lack of productive economies in hunger-stricken regions. Why isn't Afghanistan competing with India and China in the tech sector? Culture plays a big part in the equation. If the Taliban says that you can't have a computer or any modern means of making money, of course you'll be poor in the modern world.

      The source of the economic inequity between nations is largely a result of the laws that govern patents and trade, which are themselves a product of the nation's culture. Don't take for granted the combination of good ideas and good luck that gave us our western system of government. It is what makes this discussion possible at both the idiological and the technological level.

      --

      Amazing magic tricks

    2. Re:Something wrong with making the people rich by mr_don't · · Score: 2

      Capitalism is the economic system of choice in the world's richest nation. What should we do with that wealth? Whatever we want! That's why it's called a free society.

      It's not a Free Society for everyone in this country. Many American Blacks and Asians face major economic inequity. In West Oakland, CA, the percentage of people who can't pay their mortgage is up around 60%. The amount of children who live under the poverty line in Alameda County CA is staggering - around 40%!

      Rich people in the USA do not cause world hunger.

      Again, I would have to disagree with this. A major cause of hunger, is, as you've said, a lack of productive economies in hungry nations. But rather than pursue fair trade with producers in other nations, the small minority of people in the United States who control our trade policies (who I would call "Rich people in the USA") insist on subsidizing US farm crops (and dumping them all over the world) while forcing developing nations to export luxury crops to the US (like coffee - 1% of the world's arable land is used for coffee). The WTO and the World Bank, major tools of developed industrial nations, heavily favor the interests of the United States and Europe. The United States especially poor when it comes to pursuing sustainable development... Do you actually believe that wealthy interests in the US have a positive effect on world hunger? Exporting cheap food to poor countries destroys local economies (outcompeting local farmers!), the same way that a Wal-Mart comming to your town destroys local businesses...

      In fact, they provide a lot of help to the world's hungry, both in terms of increasing awareness and generating funding.

      You must be speaking about these people, otherwise, I think the wealthy in the US need to learn a lot about fundraising for the impoverished...

      If ICANN were left to its own devices to fund 'projects' like Google, such 'projects' (Google is very much a profit-making business, not a 'project') would likely never come into existence.

      Hunh? How did any of the following come about: Apache, Linux Kernel, GNU utilities, PHP, Perl, The Gimp, GNOME, KDE, OpenBSD, FreeBSD, NetBSD, Debian, etc, etc, etc...

      there just isn't the same level of competition out there in the np space.

      Again, what about: GNOME vs. KDE, Apache vs. Anything, Mozilla vs. Konqeror vs. everything else, Linux vs. BSD vs. etc. etc. etc...

      If you look at the distribution of wealth throughout the world, it is easy to say that economic inequity is the cause of hunger. But in fact the cause of hunger is the simple lack of productive economies in hunger-stricken regions. Why isn't Afghanistan competing with India and China in the tech sector?

      A "computer" economy will not feed the people of a developing nation. Who will grow the food? Will those people be a perpetual underclass?

      The source of the economic inequity between nations is largely a result of the laws that govern patents and trade, which are themselves a product of the nation's culture.

      Wrong. The laws that govern patents and trade are dictated not by culture, but by the threat of force and sanctions by more powerful countries. Do you actually think that people in Jamaica, the Phillipines, or China want to be used as cheap labor for European and American tastes? What about Cuba? Do you think that the culture of Cuba dictates their economic situation? Or was if force from the United States?

      It is what makes this discussion possible at both the idiological and the technological level.

      People in non-western countries also talk freely about politics, technology, and other issues... I myself am Persian, and persians love to talk politics... It seems like you have been watching too much Fox News Network... In the United States, there is no guarantee of free discussion about political issues, as the squelching of public dissent in the face of ridiculous media conglomeration. Our current Federal Legislature and Presidential Administration is not exactly

      For more information check out: the EFF, and FAIR...

    3. Re:Something wrong with making the people rich by Anonymous Coward · · Score: 0

      Let me guess: you are a white man, mid-20's to mid-30's, upper-middle class, college educated. Aren't you tired of always thinking you are right about politics, economics and media? You sound like my little brother.

    4. Re:Something wrong with making the people rich by rnd() · · Score: 2
      Not that your comment was a productive addition to the discussion, but I'll respond anyway:

      I posted the opinions I posted not because I am absolutely convinced that they are true. In fact, I'm only partially convinced that they are true, which is why I posted them in a discussion with someone who obviously disagrees.

      I invite you or anyone else to put forth an argument that will change my mind. In fact, I want to have my mind changed on these issues.

      The problem is, lots of people are out there claiming that capitalism is bad, but none who I've spoken to can articulate in a meaningful way why it is bad. Sure people are starving. Show me how capitalism is responsible for that. If you can point this out logically and intelligently argue against all of the counter-arguments, then I'm sure you will change the mind of any rational person who reads your response, including myself.

      Unfortunately, you can't do any of that stuff. It may not be within your ability level, or you may not care enough about the issue to attempt it.

      Let me guess, you are an angry white male with little or no college education who wants to find something to blame his failures on, anonymously.

      --

      Amazing magic tricks

    5. Re:Something wrong with making the people rich by rnd() · · Score: 2
      It's not a Free Society for everyone in this country. Many American Blacks and Asians face major economic inequity. In West Oakland, CA, the percentage of people who can't pay their mortgage is up around 60%. The amount of children who live under the poverty line in Alameda County CA is staggering - around 40%!

      So you are saying that the poor people you mention would rather live under communism? Let me inform you that many whites also live under major economic inequality. Economic inequality is a reality of life in a capitalistic society. Those who work hard and strive to make the most out of their lives will tend to rise above poverty and enter the middle class. Why do you think immigrants fight to come to the US? It is because the US offers unparalleled opportunity. How many people are fighting to live in Cuba or China? On my last cab ride in Philadelphia the cab driver who barely spoke English had an O'Reilly book on web programming. What a country!

      Again, I would have to disagree with this. A major cause of hunger, is, as you've said, a lack of productive economies in hungry nations. But rather than pursue fair trade with producers in other nations, the small minority of people in the United States who control our trade policies (who I would call "Rich people in the USA") insist on subsidizing US farm crops (and dumping them all over the world) while forcing developing nations to export luxury crops to the US (like coffee - 1% of the world's arable land is used for coffee). The WTO and the World Bank, major tools of developed industrial nations, heavily favor the interests of the United States and Europe. The United States especially poor when it comes to pursuing sustainable development... Do you actually believe that wealthy interests in the US have a positive effect on world hunger? Exporting cheap food to poor countries destroys local economies (outcompeting local farmers!), the same way that a Wal-Mart comming to your town destroys local businesses...

      If low prices are more important than a shop owner who knows everyone's name, people will go to Wal-Mart. If making a bit more per hour is more important than preserving the local economic traditions, then people will go to work in the next American factory that is built. It's a matter of people choosing A over B. This brings me to my point about culture. If those 3rd world nations had a vibrant culture that prioritized preserving the local economies through legislation, then the Big Evil Americans wouldn't want to build factories there in the first place.

      You must be speaking about these people [foodfirst.org], otherwise, I think the wealthy in the US need to learn a lot about fundraising for the impoverished...

      Look at the combined giving of the Gates Foundation and the Ford Foundation. Compare that total to the amount donated by all communist nations combined. Which system is more effective at producing positive results for the world's needy?

      Hunh? How did any of the following come about: Apache, Linux Kernel, GNU utilities, PHP, Perl, The Gimp, GNOME, KDE, OpenBSD, FreeBSD, NetBSD, Debian, etc, etc, etc...

      These are all qualitatively different from Google. Running Google requires tons of bandwidth, tons of hardware, and a full time staff of developers in order to remain competetive. I did not notice any OSS search engines on your list. Sure, an OSS search engine would be possible, but only if someone donated the bandwidth, hardware, etc... If someone donated all that stuff to Google, it could become a non-profit. Unforunately, nobody did and so Google has to resort to being a profit-seeking business.

      Wrong. The laws that govern patents and trade are dictated not by culture, but by the threat of force and sanctions by more powerful countries. Do you actually think that people in Jamaica, the Phillipines, or China want to be used as cheap labor for European and American tastes? What about Cuba? Do you think that the culture of Cuba dictates their economic situation? Or was if force from the United States?

      Where are the revolutions that overthrow the regimes that enforce this poor quality of life? Where is the vision and the struggle for independence from tyrany? They don't exist. Why do Cubans not just tolerate but love Castro? Why do the majority of Chinese tolerate burocracy and communism and not demand a free society? The answer is, these nations lack the cultural technology to create change. By technology I mean the ideas of the French philosophers, British common law, and vision of the founding fathers of our nation.

      People in non-western countries also talk freely about politics, technology, and other issues... I myself am Persian, and persians love to talk politics... It seems like you have been watching too much Fox News Network... In the United States, there is no guarantee of free discussion about political issues, as the squelching of public dissent in the face of ridiculous media conglomeration. Our current Federal Legislature and Presidential Administration is not exactly

      I don't think many people count on the mainstream media for their views on politics or the world. All media is biased in one way or another. The point is that we're all free to be biased in our own way. Nobody is going to be put in jail for having a particular belief.

      I think you seriously underestimate the good things about the USA. Of course it has flaws, and I applaud you for pointing some of them out. But if you think Communism would be a better answer, the be careful what you wish for.

      The role of government is to give businesses the ground rules to operate in a completely self-interested manner and still act ethically. As we have learned from Enron, there is still a long way to go, but at least we have a system under which it will eventually be possible.

      --

      Amazing magic tricks

    6. Re:Something wrong with making the people rich by digger3001 · · Score: 1
      So you are saying that the poor people you mention would rather live under communism? Let me inform you that many whites also live under major economic inequality. Economic inequality is a reality of life in a capitalistic society. Those who work hard and strive to make the most out of their lives will tend to rise above poverty and enter the middle class. Why do you think immigrants fight to come to the US? It is because the US offers unparalleled opportunity. How many people are fighting to live in Cuba or China? On my last cab ride in Philadelphia the cab driver who barely spoke English had an O'Reilly book on web programming. What a country!

      Just to back him up on this, check out Japan after World War 2 vs. say the Philippines or North Korea. Japan has to import a crapload of everything it needs just as those countries, yet the Japanese rose up after being pretty well beaten down and became a world power while the other two nations have kinda sat on their hands for the past 60 years.

      It's a shame really, but in the end the populations as a whole always seem to blame it on some outside 'force'. They always point the finger at others.

  305. neurosurgeon by jwillem · · Score: 1

    In the staff section you say you have a neurosurgeon on payroll. I can't possibly imagine what job he/she has, except for operating on linux-machines with hart problems (yes, linux has a hart!).

    1. Re:neurosurgeon by gerardrj · · Score: 1

      Except that a neurosurgeon operates on brains. A cardiologist works with hearts (harts)

      --
      Article X: The powers not delegated... by the Constitution...are reserved...to the people
  306. why didn't you hire me? by Anonymous Coward · · Score: 0

    just because i told you i didn't want a job that involved 24/7 oncall at times. sheesh. :)

  307. No shareholder pressure?(n/t) by univgeek · · Score: 1

    ha ha ha lamenesss filter crapping...

    hoo hooo hoo..... he he he

    --
    All bow to his Noodliness!! His Noodle Appendage has touched me!
    1. Re:No shareholder pressure?(n/t) by rob_from_ca · · Score: 1

      Shareholders are just one form of money people. Google's accepted several million dollars (something like 50 I think; it's in the last article slashdot listed about Google's history) from venture capital firms; these are basically shareholders. Unless you're doing it yourself, with your own money and a small group of people you trust, there's always a trail of influence leading to the money.

  308. PR0 by SplatFileGoo · · Score: 1

    Why is this page currently PR0?

  309. How do you see Privacy working out for search? by WillSeattle · · Score: 1

    Basically, what I'd like to know is how you see the implications of Privacy, personal and corporate, in terms of technological choices you need to make?

    For example, do you see us moving like the rest of the civilized world towards a more personal-privacy oriented legal structure, and if this occurs, how would you and other search engines deal with this? Do you see any areas that Europe and Canada have already legislated that would prove very difficult for you (e.g. retention of server logs, query logs (comparitive to HIPAA regs), user logs) - or easier?

    Do you think we'll move (not counting the current paranoid few months) towards a more totalitarian regime - and what implications does that have for your tech side? Would you need to mirror search queries to help hunt down Americans posting things disparaging the White House or Congress - for example.

    --
    --- Will in Seattle - What are you doing to fight the War?
  310. Question - How to avoid feature creep ? by willamowius · · Score: 2

    The ultra lean user interface is a key factor for Googles success. Now that Google grows and more and more people are working there, how do you deal with the feature creep of "can we add this here and that there" ?

    Some new features are great, but how do you draw the line to keep that lean interface ?

  311. That's a completely bogus question! by mbauser2 · · Score: 1

    You don't understand how Google works at all, and you've created a completely useless hypothetical situation. Submitting that question to Google is a waste of time.

    Google doesn't rank "sites", it ranks "pages" (URIs), and it doesn't rank them according to "hits". Google ranks a page according to the number (and type) of links to the page, combined with analysis of the page content.

    In fact, topic-based pages often beat "news" pages in Google results because they're more stable. Pages that stick around longer generally acquire more incomig links.

    --
    Proud to be / Smiley-free / Since Nineteen / Ninety-Three
  312. How often do you crawl frequently changing sites? by labrinid · · Score: 3, Interesting

    I was amazed to find out that the new site for Elizabeth Smart was crawled by Google (and ranked first) within only a few days from the kidnapping (and the site being registered). CNN pages are also routinely showing up in Google within hours of being published.

    My question is: how do you estimate the rate of change for each site and how often do you crawl frequently updated sites (and update the index)? What is the range of re-crawl (few hours to one month?).

    thanks a lot,
    alex

    PS: Congrats for the Webbys.

  313. "Semantic Web" by currentdirectory · · Score: 1

    Hello Craig,
    Google might provide better results if the web is more structured right? Is google involved in developing any standards for authoring web pages (e.g., RDF, WebOnt etc.)? How about a GUI for assembling a complex query (" I need a ticket from Boston to Atlanta for 100$ etc.". Kind of stuff which semantic web claims to do?

    And also is google willing to give the data that have been collected from the past years for academic research purposes?

    thanks
    rajesh

  314. extensions to linux? by Anonymous Coward · · Score: 0

    A little more than a year ago, Dr. Dobb's Technet cast ran a presentation given by a PR manager at google. He talked about how google had developed in-house driver modules for linux to pump up disk speed. What other modifications or extensions have the google programmers made to the linux kernel? Any chance of releasing this under open source?

  315. Google bar on Galeon by hypatia · · Score: 1

    It's also a one-click install option when you install Galeon on Debian. I suspect it's a pretty universal Galeon thing.

  316. Business Model by ReginaldBarclay · · Score: 1

    Behind google has to stand an enormous mass of personnel and machines and bandwidth et al

    Yet the major business model is the one of a few sponsored links (I think I haven't seen one in the last 1k searches or so. So I must google for pretty useless stuff, eh?).

    How much money do you make per search, overall? And how does that fit with the averaged cost/search?

    I know that some portion of your income flows in from appliances and stuff, but I guess the sheer volume of searches makes all the difference for you.

    cheers.

  317. What Linux distribution does Google use? by Anonymous Coward · · Score: 0

    I've always wondered what Linux distribution the Google machines use.
    Is it a completely custom one build from the ground up by Google staff? Is it a customization of one of the well known distros (in that case which one?)? or is it just an off-the-shelves standard Linux distro that just runs the Google specific apps (again, which one)?

  318. Pressure to buy tech support? by Anonymous Coward · · Score: 0

    Now that Google is a BIG player with high visibility, what kind of pressures are being applied from Sun to M$ on buying thier tech support/ebusiness solution?

    My vote goes to keep independant, use you own tech genius, and stand tall (you don't need big help to run big business).

  319. popularity vs. quality correlation by Anonymous Coward · · Score: 0

    what makes you think that lots of links to a site qualifies that site to be in the upper ranks of your query results? Is there any scientific work about this issue, or is it a design by intuition/common sense? who can check out the quality website ranking 123,457 with your current design?

  320. Mozilla Toolbar Support by Hector73 · · Score: 3, Interesting

    Any plans to release a Google Toolbar for Mozilla 1.0?

  321. How much repetition on the Internet by mnemotronic · · Score: 1

    How much of the info on the Internet is copied or repeated from other web pages vs. how much is original? A link to info does not count as copy or repetition.

    How much of what appears on the Internet is crap (using the movie "Battlefield Earth" as the "10" on a scale of 1 to 10). I already have an estimate of this, and things ain't lookin' good.

    --
    The Russians have won. They have made the world a cesspool of distrust, greed, fear and hate.
  322. I wanna buy google stock by Anonymous Coward · · Score: 0

    When is Google going to go public?

  323. google is teh best by quannump · · Score: 1

    Can I marry google?
    Are you hiring? :)

    --

  324. Dumbass by Anonymous Coward · · Score: 0

    Your sig sucks. Why not just use latitue and longitude? Retard.

    1. Re:Dumbass by Reality+Master+101 · · Score: 1

      Because longitude and latitude doesn't work at the post office, and arguably would take more ASCII characters.

      I was getting bored of that sig anyway.

      --
      Sometimes it's best to just let stupid people be stupid.
    2. Re:Dumbass by Anonymous Coward · · Score: 0

      35N42E, yup, that's alot more than what you had. Even so, zip alone doesn't work at the post office. Even in really small town you need the name. Anyway, I see you agree with me because you changed your sig, not because you were getting bored with it.

    3. Re:Dumbass by Reality+Master+101 · · Score: 1

      35N42E, yup, that's alot more than what you had.

      Yeah, that narrows it down to about 100 miles.

      Even so, zip alone doesn't work at the post office. Even in really small town you need the name.

      Try it and see. It'll work fine.

      Anyway, I see you agree with me because you changed your sig, not because you were getting bored with it.

      Hey, if it gives you that much of a thrill to believe I changed it because of you, knock yourself out.

      --
      Sometimes it's best to just let stupid people be stupid.
  325. urls, positioning and hyphens by sqwark.com · · Score: 1

    I have noticed a high incidence of hyphenated urls in the top positions on Google. Could you confirm that you are weighting the url heavily in determining content and relevance? Additionally, and I know we're only supposed to ask one question, but this is important: Your Adwords programme seems to need some finetuning. Some terms show much higher searches on Overture, while other common queries show no results at all. Are you aware of this? And if so, do you have plans to finetune this tool in the future?

  326. Re:Forget Craig exactly by Anonymous Coward · · Score: 0

    you're lucky she works for Google, she'll need it to find your ****

  327. Google Groups by Chasing+Amy · · Score: 2

    Google also tends to succeed where others have failed--take for instance Google Groups, formerly DejaNews. What motivated that purchase? Google also seemed very interested in augmenting the USENET archive with missing data, by hunting down CDs and other media that were published years before DejaNews started its archive--that seems like a genuine desire to preserve USENET for the ages, so what inspired that? Lastly as a corollary, Google Groups is missing one feature that DejaNews used to implement, but eliminated a year or so before it went down: Deja had been keeping archives of the text posts in alt.binaries.* groups, which can be valuable since many groups have active text discussion; will Google ever re-introduce the ability to search the text messages in the alt.binaries.* hierarchy that Deja used to offer, even if it's limited to the old archive Deja had?

    --

    Chasing Amy
    (We all chase Amy...)
    "The more corrupt the state, the more numerous the laws"-Tacitus
  328. Site locations by rhaig · · Score: 3, Interesting

    How many physical sites do you use to host your systems? And is this due to network redundancy issues, disaster management issues, or simply realestate issues? If they're all in one site, is it because you feel things are easier to manage that way, or is it a limitation some crazy developer didn't think of?

    What I'm getting at is I'd love to work for google, and actually like some of the current job postings, but I don't want to move to California. (don't get me started on the reasons) If Gogle had sites in other locations, wouldn't it make sense to hire local admins to go deal with situations there? And thus the concept of the Google branch office is born...

    --
    "We are not tolerant people. We prefer drastically effective solutions"
  329. Content Filtering by telstar · · Score: 2

    Google, along with other search engines, filters content for explicit content. Is there any other type of content that Google has considered, or currently filters as a matter of practice, and what led to this decision?

  330. PigeonRank question... by Anonymous Coward · · Score: 0

    What is the airspeed velocity of a a PigeonRank pigeon?

  331. PigeonRank question... by bnitsua · · Score: 1

    What is the airspeed velocity of a PigeonRank pigeon?

  332. CS by geekindustries · · Score: 1

    You must use that bandwidth at work for "personal" use at least once (a day..heh). What kind of ping do you get in Counter-Strike? Come on...we know you play it!

    --
    Hard work usually pays off over time, but procrastination pays off now.
  333. Does Google use IBM GXP hard drives... by Anonymous Coward · · Score: 0

    ...in their servers? And if so, what has been their failure rate?

    1. Re:Does Google use IBM GXP hard drives... by Anonymous Coward · · Score: 0

      Sorry, forgot the all-important link:

      http://www.tech-report.com/news_reply.x/3494

  334. Mail software you use flawed? by Anonymous Coward · · Score: 1, Interesting

    Did you know the mail software you use is open to denial of service attacks(hint ident, no ident)?

    Why hasn't this been fixed?

    What precautions do you take in reguards to security of your databases? What processes do you use to avoid users of your service being 0wnz0r3d(eg preventing js attacks appearing in search results)?

  335. vi/emacs mode by BurningSpiral · · Score: 1

    I saw the keyboard mode google at google labs. Will there ever be a version with vi/emacs style key bindings?

  336. obligatory question by Anonymous Coward · · Score: 0

    boxers or briefs?

  337. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  338. the "exponential growth" in my pants.... by Anonymous Coward · · Score: 0
    The exponential growth in awareness of Google is largely attributable to McCaffrey's efforts to dis seminate information about the company through public relations and targeted marketing initiatives in lieu of large expenditures on advertising or promotion.


    I'm disseminating just THINKING about her....
  339. Database solution by BurningSpiral · · Score: 2, Interesting

    Could you tell us a little about the back end of your search eangine. It's extremely fast. What database software do you use? What optimisations have you implemented at the Operating System level (cluster sizes, Raw IO...)? What type of hardware do you use (Disk drives, Raid, CPU(s)...)? How do you handle load balancing/redundancy?

  340. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  341. Money and Google by Hyperhaplo · · Score: 1
    Greetings, As we all know the web is being taken over by commercialisation and is being swamped with calls to make it a 'pay for what you use' environment. This is considered offensive to us dinosoaurs who have been around for a long time and who assume that the net is free.. and always will be. There are sites who initially grab you with a 'free' service, such as http://www.schoolfriends.com and nytimes, who get you to use and depend on their products.. and then (schoolfriends) slap you with charges if you wish to continue using the site. On the other hand there are sites, http://www.megatokyo.com, that have a nice link on them saying 'If you want to help keep this site alive please donate'.

    So, my question is: Would google look to splitting its services in this manner? That is, the first tier method where you get either
    1: Free services (basic services)
    2: Subscription services (advanced paid for services)

    or perhaps the 'request for donations' line where the site is free, but donations are welcome?

    --
    You have a sick, twisted mind. Please subscribe me to your newsletter.
  342. The real question on everyones mind by Narkov · · Score: 2, Funny

    A/S/L?

  343. a better question: by Anonymous Coward · · Score: 0

    Why do you tell everyone that google is running on linux, when it is really running on Win2k?

    Don't you think that it's time to put your childish bias behind you and admit to the truth.

  344. Argument list too long by MrDelSarto · · Score: 1

    It's quite easy to get around the "argument list too long" error

    this linux journal article covers it quite nicely.

    1. Re:Argument list too long by Istealmymusic · · Score: 1

      Yeah, I used perl -e"unlink('glob *.mp3')" and it removed my complete MP3 collection quite nicely. But I like flaunting my close encounter with rm, because it shows in a Unixy way how many mp3s I have, and therefore I'm 31337. :)

      --
      "The lesson to be learned is not to take the comments on slashdot too literally." --Vinnie Falco, BearShare
  345. Google cracked? by Max+the+Merciless · · Score: 2, Interesting

    I'm interested to know whether Google has ever been sucessfully cracked? (errrm, perhaps this is sensitive info)

    How often do you detect cracking attempts?

    If this occurs what do you do about it?

    Thanks

    --
    * * Always question "the National Interest" - 9 times out of 10 it is a cover for evil
  346. How does Goggle raise money? by DABANSHEE · · Score: 2

    You forgot the most important question.

    Are there any plans to make a profit? Yeh I know paying divendends are unpopular with tax accounts, but VCs eventually do eventually say no. So I assume one day Google will have to make enough money to at least pay for its self.

    Afterall logging onto Google doesn't seem stimulate half a dozen porn 'n casino pages popping up, so that arn't getting their fractions of a cent that way.

  347. How does Google take it? by crashnbur · · Score: 2
    Okay, so we know that a web site servicing [enter insanely high number here] requests per minute and referencing [enter insanely high number here] web pages and boasting an insanely high bandwidth needs, well, a little bit more than what I'm writing from to get it done. My question: What's in Google's closet? What are the technical specs of the machines working behind the scenes? What are the limits?

    How does Google do so much so smoothly?

  348. Ad Syndication for Smaller Sites by firstmark · · Score: 1

    It would seem that google's advertising syndication efforts have been focused on only a few large web properties. This is ironic considering its all of the little small sites who spread word or mouth that grew google to where they are today. Overture and others use thousands of smaller affiliates while google just uses a few large sites for ad syndication. Are there any major technical or practical considerations that prevent cobranded solutions such as Earthlink uses being offered to loads of smaller sites? I am sure there are thousands of small sites that would link to google instead of PPC search sponsors if only header and footers were allowed for our ads. Add to that a revenue share and the best search results in the world and google could grow even faster than it has in terms of reach and ad revenue. So why no revenue programs for smaller sites at the present is there a technical reason? Is this being looked into technically at the present?

  349. Google Downtime by neeraj_iitd · · Score: 1

    I HAVENT seen Google not working till now - how do you ensure this kind of redundancy?

  350. Question to Silverstein by Anonymous Coward · · Score: 0

    1. What is your favourite colour?
    2. Do you ride horses?
    3. Do you sleep in the nude?

  351. Will you donate to the Wayback Machine? by teamhasnoi · · Score: 2

    How about giving a server?/Rackmount thing to the Wayback machine, so we can use the power of Google to navagate into the past! (their search sucks) Come to think of it, so does /.'s!

  352. Leaving Google? by Anonymous Coward · · Score: 1, Interesting


    There has been an open posting for a Director of Technology at Google for a few months now. Are you returning to the world of research or is HR confused?

  353. Regular Expressions by Anonymous Coward · · Score: 0

    Why not support regular expressions in a web query?

  354. Lightheartedness by TwistedTR · · Score: 1

    The staff of google seems to have a nice lightheared stance on things, and example being the recent April Fools "technology release", as well as repeated alterations to the google banner. Do you feel actions such as these help to lighten the spirts of a team of people who much be under great pressure to deliver.

  355. adwords by bilbobuggins · · Score: 2

    maybe this isn't the right guy to ask this question, but i've always wondered, was something like AdWords planned from the start or did you guys have to throw them up in a hurry because you found yourself low on cash?
    also, are there more plans for parts of Google like this that are strictly to make money or is most of the new coding focused on projects like what can be found on the labs page?
    is there ever any tension deciding what to focus on, things that make money or things that are useful or cool? do the engineers have any input or are these all management decisions?

  356. Cool ideas versus just getting the code working by Wile+E.+Heresiarch · · Score: 1

    I wonder how cool ideas at Google fare against the need to get into some serious code optimization, scaling problems, etc. By "cool idea" I mean something like the page rank algorithm. Does Google spend a lot of time coming up with more cool ideas, or do you already have plenty of ideas, and you need to invest more in developing the existing ideas? How many developers are there for every ideas person -- 1 to 1, 10 to 1, 100 to 1 ??

  357. OT: IE6 by crisco · · Score: 2
    IE6 is a significant upgrade, it is the first IE/Win browser to come significantly closer to implementing W3C standards correctly. Unfortunately it's 'standards mode' is only on when the correct doctype is declared, leaving it acting like the old IE browsers for most of the pages out there. But at least now pages can be standards compliant and get closer to looking the same across standards compliant browsers.

    And yes, IE6 is WinXP, IE5 is Win98 SE, IE4 is Win98. I think Win2k shipped with IE5 and WinME shipped with 5.5 but I could be mistaken.

    --

    Bleh!

  358. Hard to make an inroad by bigsexyjoe · · Score: 1
    When google started Yahoo was the super-dominant search engine. It seemed that no one could overthrow them. Why do you think google was able to become successful? Why did some people stop using Yahoo when it got the job done to try some new search engine? Did Google take any actions to help create a buzz? Do you think you'll overthrow Yahoo? Do you think a better search engine could come along and destroy you? How do you make sure this doesn't happen?

    Thank you for your time. Thank you for providing a great search engine?

  359. Fine-tuning Googlebot's visits? Canonical URLs by Anonymous Coward · · Score: 0

    Do you have / plan to have any way of using canonical URLs to get more accurate page hits/listings? If that sounds obscure, let me give an example from my own experience, which I'm sure is not the only one.
    To avoid the technical support problems that come from using cookies, we use URLs for customisation: (in our case, 10 regions and 28 timezones) - thus you might use /usa/-0400/ if you're using EDT. So people can link to any given page using up to 10x28=280 URLs, but in most cases the page doesn't vary whatever URL they choose, and there are never more than 4 actual variant pages coming out of all those URLs. Weird, but it works, and I'm sure that quite a few other sites adopt the same sort of strategy.
    Trouble is, Googlebot sees 280 possible URLs and sedulously visits them all: this wastes the robot's time, distorts page rankings, and clutters up Google's search results.
    I don't know of any way to tell Googlebot that "all these linked URLs are variants of one canonical URL, so use that instead" - I can imagine that simple rewrite rules could be put in robots.txt, or we could use "303 See Other" or something... Is there a way? Is it documented? Ought to be a way? (that can't be subverted by evil webmasters trying to subvert the page ranking system).
    Do you have any pigeons working on this?

  360. beautiful?? by Anonymous Coward · · Score: 0

    man, she looks like david spade.. that aint beauty..

  361. Google toolbar redundant in Moz? by oojah · · Score: 1

    Just curious, but which part of the toolbar do you use that isn't already supported in Moz?

    I only ever used the google toolbar for searching - ie. never any of the other fluff, whatever it was :)

    Moz lets you search google (or other sites or your choice) directly from the url bar so I no longer have the need for the google toolbar.

    Cheers,

    Roger

    --
    Do you have any better hostages?
  362. Google Answers (http://answers.google.com) by Martin+Spamer · · Score: 2

    What and When are you planning to to do about the frauds being perpetuated on Google Answers (http://answers.google.com) with questioners paying their own alias instead of the expert poster who deserves to be paid. They are destroying the cedibility of what has the potential to be an absolutely brilliant service.

    I think you need to be a much more strigent about suspending and banning these abusers if you wish this service to take off.

  363. I can answer this question for them by AnonymousHero · · Score: 1

    It's called Netscape 4.7.

    I think the real question is whether Google could singlehandedly eliminate Netscape 4.7 by switching. They'd be the savoir of webmasters... er ... more so.

    1. Re:I can answer this question for them by Ravagin · · Score: 2

      I don't think so. There are easy ways to turn off styling for NS4, leaving the page usable but not so pretty. Anyway, I think it's still possible to make it look good in ns4.

      NS4 doesn't deserve any love from us web creators anyway....

      --

      Karma: T-rexcellent.

  364. Topic Sensitivity by Calum+I+Mac+Leod · · Score: 1

    PageRank was the innovation that made Google's results better than the rest. This, along with the quick loading, uncluttered interface was why people switched to Google.

    When is Google going to move from an 'importance' method to a 'theme' method; boosting pages that are widely cited from pages relevant to the query, not just widely cited pages?

  365. Would yo do it again? by kistel · · Score: 1
    Suppose Google doesn't exist, but you have your own knowledge, experience, team etc. for building such a system. Would you do it? What would you do differently? The initial design, the marketing, the techs? Would it be a free or a subscription based system?

    (That's one question, of course, only it has four question marks :-)

  366. How core to Google is google.com? by paulrollo · · Score: 1

    Google provide data both directly and indirectly to the searching public through the Google sites and through partnerships with other sites. The recent launch of the API beta program suggests a move towards encouraging results to be delivered by others. Syndication of Adwords lessens reliance on the Google site for income from that stream.

    By ceasing to provide a search function directly Google would benefit from removing the onerous responsibility of providing customer service to millions of search users, savings in hardware and bandwidth and remove the situation where you are in competition with paying customers for providing search results.

    How core to Google's business is the provision of public access to the Google database through the www.google.tld sites?

  367. Google Answers by Martin+Spamer · · Score: 2


    There is the Google Answers Service (answers.google.com) only problem is it riddled with frauds.

    1. Re:Google Answers by LegendLength · · Score: 1

      Can you expand on that a bit? I can't think of how anyone could defraud it.

  368. Cache and Copyright by Hellkitten · · Score: 2, Interesting

    There has been discussions about the legality of caching material that is copyrighted, do you have any thoughts of this?

    Have you considered any technical solutions to this problem?

    eg: a cache.txt file that tells robots if their allowed to cache a site as opposed to robots.txt that tells if they're allowed to index the site. Or perhaps the other way around that people can have a cache.txt that tells robots that they are allowed to cache a site. This would let people opt-in instead of opt-out and would also eliminate a lot of unnessesary pages. (I wouldn't mind if you cached my homepage but I won't expect anyone to be interested in the cached copy)

    --
    - We are the slashdot. Resistance is futile. Prepare to be moderated -
  369. This information is almost two years old by t482 · · Score: 1

    Particularlly the scalability issues mostly have been resolved -if not in the 2.4.18 kernel then in the 2.5 dev series.

    Anthony

  370. 1.0*10�100 by rgoer · · Score: 1

    At the rate the web has been growing, how long do you think it will be before Google is actually indexing 1.0*10100 pages at a time? How long would it take, under the current setup, to perform a string search of that much web space? How do you plan on growing the tech so that such hefty tasks won't daunt the pigeon flock (for that matter, how many pigeons will it take to search through an index containing 1.0*10100 items?)

  371. Hardware failures by Tet · · Score: 2
    Google has around 10,000 rack-mount servers.

    This begs the question... how do they cope with hardware failures? Even using the wildly exaggerated MTBF figures published by manufacturers, that's a significant number of failures *every* day. Does Google have dedicated hardware techs running round replacing broken drives, fried memory and faulty power supplies?

    --
    "The invisible and the non-existent look very much alike." -- Delos B. McKown
  372. What if you had only one user? by splattertrousers · · Score: 1
    Your web site says you have 10,000 servers to handle searching an archive of 2 billion pages at incredible speed (e.g., it took Google 0.05 seconds to find 225,000 matches for "Carlos Santana").

    How many servers would you need to have if you only had one user? (That is, how many of the servers are there for faster searches, and how many are there for handling all the users?)

  373. Alias by Martin+Spamer · · Score: 2

    http://science.slashdot.org/comments.pl?sid=34509& cid=3742297

    1. Re:Alias by LegendLength · · Score: 1

      That link just took me back to this slashdot article?

  374. Ph.D.s by splattertrousers · · Score: 1
    Your web site says you have 50+ Ph.D.s.

    What percentage of them are CS Ph.D.s who focused on something that relates directly to what Google does? Do they get paid significantly more than the average senior technical Google employee? Are they treated differently (as they often are in biotech), or are they mixed in with the rest of the folks? Do you feel that having so many Ph.D.s it is a competitive advantage so you sought them out, or is it just something that happened?

    (It sounds like I'm asking because I'm considering getting a Ph.D., but I'm not. I'm just curious.)

  375. Taking Google Down by splattertrousers · · Score: 1

    When was the last time Google was taken offline (for upgrades, to fix a problem, etc.)?

  376. MOD PARENT UP by Phrogz · · Score: 2

    This deserves more than a 3. PNG really is the lossless image format of the future, and needs adoption on as many fronts as possible.

  377. PNG files in the image archive? by Phrogz · · Score: 2

    Google's image archive is fantastic, but it seems to only archive PNG and JPG files. Given Google's general trailblazing attitude on algorithms and technology, it would seem appropriate that more media would make it into the archive than just these legacy formats, like PNG files.

    Google is crawling Word docs and PDF files nowadays...what other media types are in store for the future?

    1. Re:PNG files in the image archive? by Phrogz · · Score: 2

      Ack...of course I meant to write "...to only archive GIF and JPG files"

      Please correct the question appropriately if it makes it high enough to be submitted.

      PS - Hi Craig! :)

  378. "Other" Languages? by Dubber · · Score: 1

    While I admit I am a native english speaker (which means I get to trash the language without appearing completely stupid and/or uneducated) I read several European (live and dead) languages passably.
    Are you going to be expanding your non-latin alphabet coverage? How do you reconcile the various codepages and character sets to return a comprehensive (language independent) results set? As a codicil, do you plan to incorporate other translation features to make "other" language pages available in other languages besides english? (e.g. translate a Chinese page directly into Greek instead of Chinese to English to Greek as it curenly seems)

    --
    Your complaints about being offended offend me.
  379. Dublin Core metadata by joeclark1159 · · Score: 1

    I know Google relies heavily on links to weight its search results, but it also uses meta keywords. When will Google reward those of us who slog through the process of creating Dublin Core metadata? When will Google use that metadata to index pages? While some people load up meta keywords with extraneous entries, Dublin Core users are likely not to do that, producing more reliable self-indexing information.

    Just a thought.

  380. What about Linux on zSeries by qwepoi198273 · · Score: 1

    Have you thought about giving this a shot? Once you get as big as you, this has gotta make sense!

    --
    I've wasted a lot of money in my life, the rest I spent on motorcycles and women.
  381. Avoiding the "Freashmeat Effect" by Shlomi+Fish · · Score: 1

    Pardon me for the shameless plug, but as you see it has some relevance to the comment. I am the author of Freecell Solver, which is a library and a stand-alone command line program that solves games of Freecell and similar solitaire variants. Now, since its first public version, I posted announcements for its subsequent releases on Freshmeat, each time announcing new features.

    As a side-effect of this publicity, the Google search for "freecell solver" yields almost exclusively hits that has something to do with it. However, the query itself is generic enough that a user would just want to find a solver for Freecell, not necessarily my own.

    I call this phenomenon the "Freshmeat Effect", albeit it is by no means restricted to Freshmeat. Is there anything Googlers plan to do to restrict such clogging of searches by constant publicity of a package with a mis-chosen title?

    (Refer here for a slightly earlier record of this effect, and a call for developers to use original names to avoid it.)

    --
    We have two eyes and ten fingers so we will type five times as much as we read. http://www.shlomifish.org/
  382. hi Craig Silverstein by Anonymous Coward · · Score: 0

    i was wondering if you had ever cuckoo for Cocoa Puffs?

    http://www.google.com/search?hl=en&ie=UTF-8&oe=U TF 8&q=cocoa+puffs

  383. correct link by bcaulf · · Score: 1

    Original link had a gratuitous space courtesy of slashcode. Correct link: http://science.slashdot.org/comments.pl?sid=34509& cid=3742297

    Body:

    Google Answers (http://answers.google.com)

    by Martin Spamer on 05:27 AM -- Friday June 21 2002 (Score:2) (#3742297)
    (User #244245 Info) http://www.kitv.co.uk/ [ Neutral ]

    What and When are you planning to to do about the frauds being perpetuated on Google Answers (http://answers.google.com) with questioners paying their own alias instead of the expert poster who deserves to be paid. They are destroying the cedibility of what has the potential to be an absolutely brilliant service.

    I think you need to be a much more strigent about suspending and banning these abusers if you wish this service to take off.

  384. Re:CLIT says take it down. by Anonymous Coward · · Score: 0

    I didn't know you had to be white to be intelligent and successful.
    Let me ask you this: What's it like to be a total dick?

  385. Re:CLIT says take it down. by Anonymous Coward · · Score: 0

    Drummond was first introduced to Google in 1998 as a partner in the corporate transactions group at Wilson Sonsini Goodrich and Rosati, one of the nation's leading law firms representing technology businesses. Drummond served as Google's first outside counsel, and worked with Larry Page and Sergey Brin to incorporate the company and secure its initial rounds of financing. During his tenure at Wilson, Sonsini, he worked with a wide variety of technology companies, advising them on all aspects of their business and financial activities and helping them manage complex transactions such as mergers, acquisitions and initial public offerings.

  386. future of Google API? by xerofud · · Score: 1

    Curious if there are any plans to extend the Google API for web searches to include searching the Google Groups archives?

  387. Havent seen this question pop up yet. by Dr_Auknix · · Score: 1

    Putting aside the techie style questions; Who at google owns the coolest car ? Romero has his Hummers and Ferraris. Carmack has his twin turbo Ferraris some making upwards of 1000 horsepower at the wheels. Gates has some Porsche 959's that arent legal to drive on the roads in America. Do you drive anything out of the ordinary ? what about your coworkers ?

  388. Just how? by Splab · · Score: 1

    I was wondering how you find sites...

    A friend of mine are running linux at home and use some kind of webbased quing system for his mp3 files (yes yes all legal of course *erhm*), he mentioned that every once in a while the google bot drops by and ques some of the files, so the question is, how did it ever find his site given the fact that probably no one links to his mp3 que list? (and what is the prefered music of the bot?? (seems to go for the dance/techno genre))

  389. 0 Pageranks and Google Penalties by nutsandbolts · · Score: 1

    I will keep this brief! 0 Pageranks and site penalties are used on a site by Google if they link to "bad neighbourhoods" - but what happens if the Webmaster does this without knowing the full dangers and then, once aware of dangers, removes the dodgy links? Currently there is no response from Google to repeated requests on restoring a sites rank to "normal". Despite many, many e-mails people never get a reply. Personally I've had 2 sites restored but 5 of my sites are still affected despite the fact they are so clean they squeak! As it's over 6 months since this happened, does Google plan on lifting such penalties completely soon? Or at least giving some time to such questions via E-mail to people?

  390. Yeah... by Anonymous Coward · · Score: 0

    Well yeah, she's probably not too much of a nerd.

    But do you need a degree to be a nerd?

    She's pretty for the older woman category, which is probably what a lot of the other posters are protesting. Nice eyes. But unfortunately no pictures on Google other than that one :)

    And given her position, you won't be seeing her outside of respectable wear, as it'd be a bad career move for her.

    Too bad.