Slashdot Mirror


How Many Google Machines, Really?

BoneThugND writes "I found this article on TNL.NET. It takes information from the S-1 Filing to reverse engineer how many machines Google has (hint: a lot more than 10,000). 'According to calculations by the IEE, in a paper about the Google cluster, a rack with 88 dual-CPU machines used to cost about $278,000. If you divide the $250 million figure from the S-1 filing by $278,000, you end up with a bit over 899 racks. Assuming that each rack holds 88 machines, you end up with 79,000 machines.'" An anonymous source claims over 100,000.

476 comments

  1. Nice Rack! by turnstyle · · Score: 4, Funny

    No wonder I'm'a Googlin'

    --
    Here's what I do: Bitty Browser & Andromeda
    1. Re:Nice Rack! by p00p+at+instable.net · · Score: 0

      Gotta love Booble! Err, Google

    2. Re:Nice Rack! by djsmiley · · Score: 1

      Anyone relsied yet the STUPID mistake in that report...

      80,000 something cpus.....

      doesn't mean 80,000 something computers.

      I talked to the guy that helped write one of the first nodel programs for it, he reckoned they they had under 1k of machines.

      V.V.V.clever programing (think mega gridcomputing)

      --
      - http://www.milkme.co.uk
    3. Re:Nice Rack! by Igmuth · · Score: 1
      From the summary:
      88 dual-CPU machines

      Your point?
    4. Re:Nice Rack! by jo42 · · Score: 1

      > Nice Rack!

      Me thinks you want Booble...

  2. What is that as a percentage ... by Alain+Williams · · Score: 3, Interesting

    * of servers in the world
    * of servers in the USA
    * of servers running Linux

    1. Re:What is that as a percentage ... by Anonymous Coward · · Score: 5, Funny

      Well, let's count. I have two servers at home and 8 at work. They all run linux. Now, if everyone else in the world joins this thread, we can find out.

    2. Re:What is that as a percentage ... by Phurd+Phlegm · · Score: 3, Funny
      "* of servers in the world"

      0.0001%

      * of servers in the USA

      0.00000045%

      So there are 222 times as many servers in the United States as in the entire world, and there are 1.96e13 servers in the USA. Too bad there isn't a "-1 bad arithmetic" moderation....

    3. Re:What is that as a percentage ... by Anonymous Coward · · Score: 0

      Even if everyone in the world replied, if they post as you did it wouldn't work out right at all. If more than one person works where you work you'll end up with the same machines counted multiple times.

    4. Re:What is that as a percentage ... by JWSmythe · · Score: 1

      Score us for 200 servers, 180 running Linux, all in the US. Oh, and just two admins running the show. :)

      --
      Serious? Seriousness is well above my pay grade.
    5. Re:What is that as a percentage ... by Lispy · · Score: 4, Funny

      Nono, only those you have root access to. Post your pass as proof.

    6. Re:What is that as a percentage ... by jrockway · · Score: 1

      One linux server, three linux worstations at home :) So far 100% of computers are Linux!!

      --
      My other car is first.
    7. Re:What is that as a percentage ... by identity0 · · Score: 2, Funny

      Hello, my name is Thabo Mugabe and I am a subsistance farmer in Africa. I have 5 head of cattle and a herd of sheep, of which none run Linux. Thank you for asking. :)

    8. Re:What is that as a percentage ... by Anonymous Coward · · Score: 0

      Too bad there isn't a "-1 dickhead" morelike.

    9. Re:What is that as a percentage ... by theCoder · · Score: 4, Funny

      OK, it's 1-2-3-4-5 -- the same as my luggage!

      --
      "Save the whales, feed the hungry, free the mallocs" -- author unknown
    10. Re:What is that as a percentage ... by LinuxLuver · · Score: 1

      I'm in New Zealand. I have 5 Linux systems running at the mo' (2 servers / 3 desktops) and one lonely WinME desktop system. In other words.....over 80% Linux here at my house.

      --
      Only boring people are ever bored.
    11. Re:What is that as a percentage ... by thetaikung · · Score: 1

      haha I was SO expecting you to ask to help you access a deceased relative's root password and store his 25.5 Million (25,500,000) MB of porn. All you need me to do is post root and 30% of the porn is mine!

      --
      P226 .40cal
  3. $278k ?? by r_cerq · · Score: 5, Insightful

    That's $3159 per machine, and those are today's prices... They weren't so low a couple of years ago...

    1. Re:$278k ?? by Anonymous Coward · · Score: 1, Interesting

      You might be able to get machines slightly cheaper than retail if you, say, buy 79,000 of them.

    2. Re:$278k ?? by toddler99 · · Score: 4, Informative

      google doesn't buy pre-built machines they have been building costum machines from the very beginning. Although, with fab'n their own memory, i'm sure today they do a lot more. Google runs the cheapest most unreliable hardware you can find. It's in the software that they make up for the unreliable hardware. Though unreliable hardware is ok so long as you have staff to get the broken systems out and replaced with a new unreliable cheap ass system. When google started they used lego's to hold their costum built servers together

    3. Re:$278k ?? by Dave419 · · Score: 0

      I think this article should read Google should have 79,000 machines based on what they spent. In another /. posting about sparc, google was saying they were never going to buy high end servers again because they are alternatives where you balance quanity with quality and get the same reliability and performance for less money. I'm paraphrasing a bit here, but whoever inferred that tidbit of information made a bad assumption when they wrote that article, why wold google lie about how many servers they had or how much it cost?

      --
      ~ there are 10 types of people in this world, those that can read binary and those that can't
    4. Re:$278k ?? by hjf · · Score: 3, Flamebait

      so it means if you are smart enough, you don't need to have a $1,500,000 Sun server or that kind of shit. leave that for big corporations with lame-ass programmers.
      imagine what google could do with that kind of shit

    5. Re:$278k ?? by Gilk180 · · Score: 5, Interesting

      I really doubt they are spending anywhere near this for the machines themselves. A former student a google employee made one of those recruiting/marketing visits to my university last semester. I got to speek to him at length about Google's operation. According to him (and he had pictures to back this up). All of their boxen are a motherboard, an ide drive and a processor sitting on a shelf in the rack. No cases, no fans, no cd, etc. Plus they buy in bulk and get good prices.

    6. Re:$278k ?? by Starborn · · Score: 1

      Google just use a different approach to the problem - intead of buying 'better' hardware, they buy more cheap hardware.

    7. Re:$278k ?? by jarich · · Score: 2, Insightful
      Agreed. If you are able to code in your fault tolerance, it's a heck of a lot cheaper than buying it.

      What's cheaper... buying a round robin DNS router (hardware) or coding your client to try the next web server in it's list (software). Now, multiply that savings for every customer you sell to.

      The problem is finding someone who knows how to do that robustly and reliably. Most places have troubling finding developers whose programs don't crash every 15 minutes. This sort of thing is a little more advanced.

    8. Re:$278k ?? by sql*kitten · · Score: 5, Insightful

      so it means if you are smart enough, you don't need to have a $1,500,000 Sun server or that kind of shit. leave that for big corporations with lame-ass programmers. imagine what google could do with that kind of shit

      The difference is that if Google loses track of a few pages due to node failure it's no big deal because a) they don't guarantee to index every page on the web anyway and b) the chances are that page will be spidered again in the near future - and it may not even still exist anyway.

      Your bank, on the other hand, can't just "lose" a few transactions here and there. FedEx can't just lose a few packages there and there. Sure they occasionally physically lose one, but they never lose the information that at one point, they did have it. Your phone company can't just lose a few calls you made and not bill you for them. Your hospital can't just lose a few CAT scans and think oh well, he'll be in for another scan eventually.

      Now, I'm not saying that Google's technique isn't clever - I'm saying that it can't really be generalized to other applications. And that's why very smart people - and big corporations can afford to hire very smart people - keep on buying Sun and IBM kit by the boatload.

    9. Re:$278k ?? by Robotech_Master · · Score: 1

      They may well have spent a lot more on their machines back then.

      The key word here is, say it with me, de-pre-ci-a-tion--the accounting process by which an asset's value declines over time. The machines may have cost more when they were bought, but they're not worth that much now.

      --
      Editor Emeritus and Senior Writer, TeleRead.org
    10. Re:$278k ?? by It'sYerMam · · Score: 1
      I'm sure they could find something more durable and less expensive than lego to hold their machines together.
      Mind, if they just replaced their existing kit with the alien lego, they'd get the advantages of alien technology for about 50 per go.

      Incidentally, you have one lego brick, two lego bricks, one piece of lego, two pieces of lego and some lego. Lego does not refer to the individual pieces. (You can't correctly say "a lego")

      --
      im in ur .sig, writin ur memes.
    11. Re:$278k ?? by Anonymous Coward · · Score: 2, Informative

      Google Filesystem replicates same data on three nodes (by default, can be configured to more), so the probability of data loss is rather small. Source here.

    12. Re:$278k ?? by geniusj · · Score: 5, Insightful

      I can confirm this as well.. I have seen their racks in Equinix in Ashburn, VA. I pass by their cages every time I go to my cage there. I believe I also saw them in Exodus in Santa Clara a couple of years ago. They are 1U half depth and do indeed lack a case. There are definitely thousands of their servers in Ashburn, VA, and they are very space efficient (as they would need to be).

    13. Re:$278k ?? by Anonymous Coward · · Score: 5, Informative

      The high-end Sun machines are designed for high availability. Not only will a CPU failure not crash the machine, the CPUs are hot swappable so you can replace a failed CPU without so much as a reboot.

    14. Re:$278k ?? by sql*kitten · · Score: 5, Informative

      Any of those 64 CPUs fails, and your system will crash.

      Doesn't work like that, kid. A CPU on a high-end Sun fails, and the system will keep on running. You can swap the CPU out and replace it with a new one, the system will simply pick it up, assign threads to it, and keep on running. Had a couple of CPUs fail a little while ago... the first we users noticed of it was that the application slowed down slightly. Sysadmin just said yeah, I know, I'll replace 'em when the parts arrive this afternoon. Cool, we said. No data lost, no need to shut down or even restart our app. 'Course you gotta architect your app to deal with that - like don't have just one thread that does a crucial task, 'cos there's a chance that might be on the CPU that fails. But still, it's no big deal.

    15. Re:$278k ?? by Anonymous Coward · · Score: 5, Informative

      Dude, big iron is not comparable in the slightest to that dinky little dual PPro Linux 'server' you keep in your closet. A CPU can fail, on a live running system, and the machine and Solaris or AIX won't even hiccup. Your application will notice, because suddenly a couple of its threads will quit, but that's ok, software like Oracle already knows how to deal with failed transations. And if you can schedule a CPU board removal/swap, then there won't be ANY problems at all, as the OS will migrate threads to other CPUs and allow the removal or hardware.

      And hey, if you want to mix and match CPU types (uSparc 2 and 3, etc), speeds, etc, no problem either. So if you wanna upgrade your server's CPUs, there will be zero downtime, you just do it a board at a time (board = 2 or 4 CPUs).

    16. Re:$278k ?? by Anonymous Coward · · Score: 0

      hotmail used to be the same way, dunno how they are now though.

    17. Re:$278k ?? by Anonymous Coward · · Score: 0

      go away troll

    18. Re:$278k ?? by mikis · · Score: 2, Informative

      google doesn't buy pre-built machines

      Yes, they do.

    19. Re:$278k ?? by drinkypoo · · Score: 1

      Your bank can employ redundancy. Just commit every transaction as many times as the available capacity of the system will allow. During peak periods the system will be theoretically less reliable but you will be able to use the odd lost transaction to pinpoint areas in which your system is lacking, and fix them. Never commit anything via less than three paths, that should be sufficient.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    20. Re:$278k ?? by jburroug · · Score: 4, Interesting

      Your hospital can't just lose a few CAT scans and think oh well, he'll be in for another scan eventually.

      You've never worked in a medical field have you? You'd think that that would be a big deal and in theory data integrity is a very high priority but in reality...

      I used to work as the IT Manager for a diagnostic imaging and cancer treatment center (and still do contract work with them because my replacement is kind of a noob) While loosing studies isn't exactly a "no big deal" situation it's still far more common than patients will ever realize. The server that stores and processes all of the digital images from the scanning equipment is a single CPU home rolled P4 using some shitty onboard IDE raid controller (doesn't even do RAID5!) running Windows 2K. The most money I could get for setting up a backup solution was the $200 an external firewire drive cost. Somehow we never managed to loose a study once it reached my network in the 9 months I worked there but I know three or four were deleted from the cameras themselves before being sent properly so whoops it's gone, gotta reschedule (and bill their insurance or Medicare again!) Two weeks ago one of the drives in that 0+1 array failed and despite my pleadings they still haven't ordered a replacement yet...

      Now it's tempting to think that this place is just a special case of cheapness and sloppiness but from talking to the diagnostic techs (the people that operate the cameras) that's not so. That clinic is a little worse than average in terms of loosing patient information but by no means the worst some of them at seen/heard of/worked at in their careers. It's worse in general at small facilities but even large hospitals often suffer from the same unprofessionalism.

      Your bank and the phone company keep much better track of your calls or your ATM transactions than most hospitals do with your CT or MRI scans...

      --
      "Listen: We are here on Earth to fart around. Don't let anybody tell you any different!" - Kurt Vonnegut
    21. Re:$278k ?? by Anonymous Coward · · Score: 0, Informative

      You are dreaming. FWIW, I am responsible for 5 15k's and lots of 6800's. Sun expects that a 15k domain with 60 CPU's and 480 DIMMs will experience 7 - 8 hard failures per year.

      When a Sun server loses a CPU or DIMM it will crash. The benefit is that it will generally come back up with those parts blacklisted so you won't see another crash from the same parts. When the parts are replaced, you can bring the new ones back without taking an outage. If you happen to see that you are getting soft errors (e.g. persistent ECC corrections), you can take it offline preemptively without causing an outage.

    22. Re:$278k ?? by Anonymous Coward · · Score: 0

      Don't have a thread perform a critical task? Where, then, are the critical tasks going to be performed? On a ribbon?

    23. Re:$278k ?? by Anonymous Coward · · Score: 2, Informative
      DIMMs is one thing, CPU's are another.
      If its a soft error - "cosmic ray" or what ever then it will log it and keep going. For CPU's if you haven't affinitied any processes/threads to it you should be able to do :
      psradm -v -f cpuid
      To take the processor offline yourself, obviously failing is not exactly the same but I thought I have had some fail without crashing. - Unless, of course you are trolling and if so, you got me.
    24. Re:$278k ?? by Anonymous Coward · · Score: 0

      He was saying have more than one thread that can perform [any given] critical task because a thread/CPU can be killed.

      I guess you could have a PHB thread that makes sure there are surviving thread for each task type and if not it would hire new low-cost LWT's that did that task or outsource those critical tasks to another process, preferably a process on a low-overhead Indian CPU...

    25. Re:$278k ?? by omeomi · · Score: 1

      Your phone company can't just lose a few calls you made and not bill you for them.

      I lived in an apartment for almost a year where we didn't get a single long-distance bill (and, yes, we made plenty of long-distance calls). They eventually started billing us for long distance, but they never did mention the old bills that we never received or paid...

    26. Re:$278k ?? by Anonymous Coward · · Score: 0

      Now, I'm not saying that Google's technique isn't clever - I'm saying that it can't really be generalized to other applications. And that's why very smart people - and big corporations can afford to hire very smart people - keep on buying Sun and IBM kit by the boatload.


      This can be generalized a lot more than you imply, including to bank software. Just because you have a machine fail doesn't mean you lose data. I'd agree that big companies can afford to hire very expensive people and keep wasting money on Sun and IBM by the boatload. They certainly don't need to though.

    27. Re:$278k ?? by Hans+Lehmann · · Score: 2, Funny
      When google started they used lego's to hold their costum built servers together

      They clearly had to put an end to that practice once they reached their first 10,000 machines or so. Have you looked at the price of Legos lately?

      --
      09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
    28. Re:$278k ?? by devilspgd · · Score: 1

      Don't have ***ONE*** thread perform a critical task. Or at least, don't do it without something redundant checking the data.

      --
      Give a man a fish, he'll eat for a day, but teach a man to phish...
    29. Re:$278k ?? by Anonymous Coward · · Score: 0

      I can understand why your no longer employed with that company and question your abilities as a consultant considering the company is unwilling to allocate the required resources to do the job properly. It is part and parcel your job to insure the customer gets what they need even when the customers understanding is lacking. Sometimes you have to force the issue. As long as your credible they should comply. If they don't find you credible, then your out of a job. Again. Credibility comes in part by ones ability to communicate and contextual errors in the utilization of simple words and phrases does cause credibility to suffer.

      Ok. Once more with feeling.

      Lose ... "you will lose the job"
      Loose ... "the screw was loose"
      Loss ... "your loss is another mans gain"
      Lost ... "you lost the job"
      Loser ... "which makes you a loser"

      Now I'm no grammer nanny but these are simple words and simple concepts. Know them. They are your friends.

    30. Re:$278k ?? by Anonymous Coward · · Score: 0

      OK, so the "loose" vs "lose" thing bugs the hell out of me too... but maybe you should check your usage of "your" vs "you're" before you start throwing stones. Glass houses and all, y'know?

      "your" is the possessive form of "you"
      "you're" is a contraction of "you are"

    31. Re:$278k ?? by tekunokurato · · Score: 2, Informative

      Funny--I've spent LOADS of time on the consumer side of Brigham & Women's/Dana Farber in Boston getting various operations, chemotherapies, catscans, etc, and seeing many different doctors and nurses (don't worry I'm fine now ^_^). I'm consistently impressed with both the ubiquity and the reliability of their information systems. They're extremely universal and always seem to display quite simply exactly what Medical Care Personnel X needs to access. Perhaps a model to suggest to your clinic.

    32. Re:$278k ?? by nelsonal · · Score: 1

      They spent the vast majority of that cash over 2003 and 2004. Look in their cash flow statement under capital expenditures. Figure most of the 50 million building was over the recent years, but it was somehting like 80 million in 2002, 200+ in 2003 and 60-80 so far in 2004. I am going by memory (read the thing on Thurs).

      --
      Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
    33. Re:$278k ?? by Dephex+Twin · · Score: 1

      Don't forget that he spelled "grammar" wrong.

      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
    34. Re:$278k ?? by destiny_uk · · Score: 1

      I suppose the thing about CAT scans is you do one, look at it straight away, and if you see a massive tumour, you know there's something wrong. If it gets lost at some time in the future, well, it wasn't current data anyway, they'd probably be much better taking another one to see how you are progressing...?

    35. Re:$278k ?? by Craigy · · Score: 2, Informative

      As it happens on high end (regatta-class) pSeries Kit from IBM, you don't need to worry about how your app works either as the firmware reassigns your (single, very important) thread to a still-working CPU. Craigy

    36. Re:$278k ?? by Anonymous Coward · · Score: 0

      Everyone is missing the point here: The parent is correct.

      Depending on the nature of the failure a domain will either recover from the error and keep running or panic, reboot and blacklist the component. I have personally witnessed both types of behavior.

      Sounds like we have a lot of book knowledge running around here, but then again that's typical of Slashdot.

    37. Re:$278k ?? by malfunct · · Score: 1

      Its crazier than that, Google runs under the premise that switching out a machine is a large cost so they let the machine die and they may not get to it for months. The idea is to make thier system so overengineered that they don't spend very much maintaining it. As far as I can tell it works well for them.

      --

      "You can now flame me, I am full of love,"

    38. Re:$278k ?? by TNLNYC · · Score: 1

      Cool... Real info... I've learned from another thread that they seem to be using machines from rackable. Based on the info on the rackable site, it looks like they are using C2000 series. If you get a chance, could you check out if that assumption is correct (disclaimer, I'm the author of the estimates listed at the top of the post :) )

      --
      Check out http://www.tnl.net/blog
    39. Re:$278k ?? by treat · · Score: 1
      The high-end Sun machines are designed for high availability. Not only will a CPU failure not crash the machine, the CPUs are hot swappable so you can replace a failed CPU without so much as a reboot.

      This is a LIE. In a domainable system, a failed CPU will cause only that domain to crash. But that is it. In a domainable system, a separate domain might as well be a separate system. A failed CPU will CRASH THE KERNEL THAT KNOWS ABOUT THAT CPU!

      How did a lie like this get modded to +5 when my truthful statements got modded to -1. Is this based on Sun marketing lies?

    40. Re:$278k ?? by treat · · Score: 1
      Doesn't work like that, kid. A CPU on a high-end Sun fails, and the system will keep on running. You can swap the CPU out and replace it with a new one, the system will simply pick it up, assign threads to it, and keep on running.

      This is a LIE. The BEST you will get is that it detects what failed and after the PANIC AND REBOOT you will have that board or CPU disabled.

      Cool, we said. No data lost, no need to shut down or even restart our app. 'Course you gotta architect your app to deal with that - like don't have just one thread that does a crucial task, 'cos there's a chance that might be on the CPU that fails. But still, it's no big deal.

      Even if what you were saying is true, which it is not, what is the likelyhood that a multi *threaded* application could handle the death of an individual thread at a random instant? Probably about 0% of multithreaded applications can handle that. If they're using fork() they will probably be OK. But this is irrelevant because when a CPU fails your ENTIRE SYSTEM PANICS AND REBOOTS no matter what kind of fancy Sun hardware you are using.

      WHY ARE YOU LYING?

    41. Re:$278k ?? by treat · · Score: 1
      To take the processor offline yourself, obviously failing is not exactly the same but I thought I have had some fail without crashing. - Unless, of course you are trolling and if so, you got me.

      If you were FORCED to do this because the CPU was bad, it is already TOO LATE. Perhaps you got a correctable error, e.g. on that CPU's cache and you disabled it as a precaution. But the only time I've used psradm to disable a CPU was after a PANIC AND REBOOT.

    42. Re:$278k ?? by treat · · Score: 1
      Dude, big iron is not comparable in the slightest to that dinky little dual PPro Linux 'server' you keep in your closet. A CPU can fail, on a live running system, and the machine and Solaris or AIX won't even hiccup.

      This is NOT TRUE. Solaris will CRASH.

      Your application will notice, because suddenly a couple of its threads will quit, but that's ok, software like Oracle already knows how to deal with failed transations.

      If Oracle or Sybase or any other application I've EVER SEEN has one of its thread crash the WHOLE THING IS FUCKED. What if that thread held a lock? What you say is hopelessly stupid.

      And if you can schedule a CPU board removal/swap, then there won't be ANY problems at all, as the OS will migrate threads to other CPUs and allow the removal or hardware.

      There's a big difference between disabling the hardware and then removing it, and having the hardware fail. Once it has failed there has ALREADY BEEN MEMORY CORRUPTION and the system *MUST* PANIC AND REBOOT or it only risks propogating the corruption.

      And hey, if you want to mix and match CPU types (uSparc 2 and 3, etc), speeds, etc, no problem either. So if you wanna upgrade your server's CPUs, there will be zero downtime, you just do it a board at a time (board = 2 or 4 CPUs).

      Funny, I was just in a sales meeting with Sun in which they promised the feature of mixing CPU speeds *IN THE FUTURE*. Today you can ABSOLUTELY NOT MIX CPU TYPES and mixing clock speeds is a MARKETING PROMISE FOR THE FUTURE.

      Most of your statements are lies, the remainder are simply wrong.

    43. Re:$278k ?? by sql*kitten · · Score: 1

      This is a LIE. The BEST you will get is that it detects what failed and after the PANIC AND REBOOT you will have that board or CPU disabled.

      The feature is called "CPU Offlining" - go look it up. The CPU will be flagged as bad at the next reboot it's true, but you don't have to reboot.

    44. Re:$278k ?? by treat · · Score: 1
      The feature is called "CPU Offlining" - go look it up. The CPU will be flagged as bad at the next reboot it's true, but you don't have to reboot.

      CPU Offlining is only available on the 3800 and up. CPU offlining only offlines a CPU after multiple corrected ECC errors in the ecache. It will not prevent your system from panicing after, for example, an uncorrectable ecache error.

      There's a big difference between disabling a suspected faulty component before it has failed and continuing after a component failure has caused memory corruption. It may be a nice feature, but the fact is that statements that Sun systems do not crash if a CPU or memory FAIL are simply WRONG. A single bit correctable error is not a hard failure.

  4. Can you imagine by Sadiq · · Score: 4, Funny

    Can you imagine a beowul.... oh.. wait..

    --
    SysWear - Geek T-shirts (UK/Europe)
    1. Re:Can you imagine by mkavanagh · · Score: 5, Funny

      can you imagine a beowulf cluster of karma in soviet russia whoring YOU, you insensitive cliched clod?

    2. Re:Can you imagine by Anonymous Coward · · Score: 1, Informative
      Three things:

      First, funny moderations don't affect karma so making a joke like that isn't karma-whoring as the user is not gaining karma.

      Second, you can turn off the meta-moderation messages by simply deselecting the check box that says "willing to moderate" in your user preferences - which is what I am assuming you want based on your somewhat cryptic sentence in all caps.

      Third, decaf is your friend.

    3. Re:Can you imagine by Isaac-Lew · · Score: 1

      AFAIK, Google uses their own clustering software (possibly derived from an existing project, but I wouldn't be able to confirm or deny).

    4. Re:Can you imagine by Anonymous Coward · · Score: 1, Funny

      ...

      profit!

    5. Re:Can you imagine by MoFoQ · · Score: 1

      why not mod him up just to tick him off....hehehe

      the only thing in that post that offended me was the "goat f*cker" part...the rest; I hear worse in the john.

      The classification of "goat f*cker" should be reserved for the Taliban, Al Qaeda, and other f*cks who have small penises (george carlin/freudian), beat their wives, and can only do it with a goat as they are being hunted down. Poop in, poop out; if you try to f*ck someone, you'll likely get f*cked back.....o wait...could that be? YES...it's KARMA!

      Salute to our fighting men and women; yes, women!

  5. Google, will you marry me? by Anonymous Coward · · Score: 5, Funny

    1) google is so pretty and smart
    2) google is worth so much money
    3) google has a huge rack!!

    1. Re:Google, will you marry me? by Anonymous Coward · · Score: 5, Funny

      whoops, forgot to sign the letter!

      Love,
      Yahoo.

    2. Re:Google, will you marry me? by slickwillie · · Score: 3, Funny

      Of course you can't marry Google, but you might have a chance with Sergey.

    3. Re:Google, will you marry me? by Bobdoer · · Score: 2, Funny

      There's only one problem: she's always being used by other men.

    4. Re:Google, will you marry me? by Hogwash+McFly · · Score: 1

      Are you feeling lucky, Mr Proposer?

      --
      Mother, do you think they'll like this sig?
    5. Re:Google, will you marry me? by madsh · · Score: 2, Funny

      Don't worry...

      just send it through your gmail...

      and you will also get a few adds about rings and flowers to go with that...

      Mads

    6. Re:Google, will you marry me? by Anonymous Coward · · Score: 0

      3) google has a huge rack!!

      Dear God! It's got over 900 of them...

  6. IPO changes things by Have+Blue · · Score: 5, Interesting

    There was an article recently about how Google constantly understates various statistics about itself to mislead potential competitors. This article also said that the SEC would not allow them to do this once they became a publically traded company.

    1. Re:IPO changes things by JPriest · · Score: 0, Insightful

      I am sort of suprised at the 250 million IPO though. Obviousy the IPO is not the value of the entire company, but the little company I work for that I bet almost nobody here has heard of, is actually worth nearly 80 times Google's IPO. Amazing how such a small company can have such a large impact.

      --
      Saying Java is nice because it works on all OS's is like saying that anal sex is nice because it works on all genders.
    2. Re:IPO changes things by danila · · Score: 1

      250 millions is only the cost of their servers (if you can't RTFA, read the blurb at least), the IPO is $e*10^9, or more than 10 times bigger.

      --
      Future Wiki -- If you don't think about the future, you cannot have one.
    3. Re:IPO changes things by Anonymous Coward · · Score: 0

      Why the hell is this comment moderated as troll???

    4. Re:IPO changes things by slim-t · · Score: 1
      Why the hell is this comment moderated as troll???

      Maybe it's a troll.

      I personally want to hear more about the poster's $40 billion small company that no one hear has heard of.

    5. Re:IPO changes things by fullofangst · · Score: 1

      Hi, nice post.

      Unfortunately the article said, "How many google machines, really?"

      Perhaps try reading next time rather than seeing the word "GOOGLE" and instantly hitting POST REPLYLYLYYYY "!!!1111

    6. Re:IPO changes things by nelsonal · · Score: 1

      Cargill might be that big (I'd guess more like $20 billion, outside of farmers no one really hears about them because they dont really care to have you know about them much less how big they are. Some of the bigger law firms and other partnerships in the country might be that big. Think along the lines of specialized business services, since to be that big you have to be national, and we would know about them if the sold goods to average folk.
      Usually private comapnies are undervalued compared to public companies so it is worth going public (as your company will be much more valuable (at the cost of quite a bit of information disclosure). Go read a 10-K and proxy for a company some time, they usually tell you quite a bit about their operations, income, assets, strategy, and other things. As a result companies that remain private, they do so to retain a signficant amount of secracy regarding their operations and other activities.

      --
      Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
    7. Re:IPO changes things by slim-t · · Score: 1
      Cargill might be that big (I'd guess more like $20 billion, outside of farmers no one really hears about them because they dont really care to have you know about them much less how big they are. Some of the bigger law firms and other partnerships in the country might be that big. Think along the lines of specialized business services, since to be that big you have to be national, and we would know about them if the sold goods to average folk.

      I'm sure there are plenty of huge companies that most people have never heard of. To get that big, you've obviously got some customers, so some people with have to have heard about you.

      Cargill was the wrong example to give to me. I had a couple employees from Cargill using my pulp lab (at a paper mill) a couple weeks ago. They left a mess. Also, I think a girl I went to school with (elementary-high school) is married to a Cargill.

  7. Pretty Broad by haydenth · · Score: 1

    From the Article: "Google is managing between 45,000 and 80,000 servers"

    Last time I checked, there was a pretty big difference between 45,000 and 80,000 servers. I mean, 80,000 is almost twice 45,000.

    Cheers,

    --
    tom

    --
    - tom -
    1. Re:Pretty Broad by avalys · · Score: 4, Insightful

      Yes, but aside from dealing with hardware failures and other physical / logistical problems, there really isn't much of a difference between managing 45,000 computers and managing 80,000. They're both Really Big Numbers, and I'm sure whatever software they're using is scaleable enough to smoothly handle many more machines than that.

      --
      This space intentionally left blank.
    2. Re:Pretty Broad by Anonymous Coward · · Score: 0

      any broad with a rack like that is going to be a pretty broad!

    3. Re:Pretty Broad by Anonymous Coward · · Score: 0

      She's also going to be pretty broad.

    4. Re:Pretty Broad by victor_the_cleaner · · Score: 4, Funny

      Yeah it's kind of like:

      Your wife has slept with 80 other men, or was it 200?

      Either way, it's not good for you.

    5. Re:Pretty Broad by Anonymous Coward · · Score: 0

      Depends on how much she learned while she was in bed with those other men, doesn't it?

    6. Re:Pretty Broad by Ayaress · · Score: 0, Offtopic

      Or what they gave her. And I'm not talking about flowers.

    7. Re:Pretty Broad by Anonymous Coward · · Score: 0

      That would count as a hardware failure. The OP already admitted that they are an exception.

    8. Re:Pretty Broad by p0rnking · · Score: 1

      If you RTFA you'd see why there's a difference between 45,000 and 80,000.
      But for the lazy, here's the part where the author explains himself ...

      "If you divide the $250 million figure from the S-1 filing by $278,000, you end up with a bit over 899 racks. Assuming that each rack holds 88 machines, you end up with 79,000 machines. However, one must recognize that equipment is not all CPUs. As a result, you must discount the figure of $250 million to account for routers, firewalls, machines for employees, etc... So let's assume for a minute that only about $200 million is going to the CPUs. That still leaves us with 719 racks or a bit over 63,000 machines. Even if we discount other equipment to be costing $100 million, we end up with a bit over 47,000 machines on 539 racks."

    9. Re:Pretty Broad by marauder404 · · Score: 1

      The point of the estimate is not to deduce exactly how many servers that Google has, because that figure surely changes from week to week. It's to show that the previous figures of 10k-15k servers is totally off base and that they've long exceeded the previously-thought figures. It's to show that it's simply a really big number.

  8. Why do we care? by the_raptor · · Score: 4, Funny

    Seriously? What is the point of this article? What next? Linus found to prefer blue ink, over black ink?

    --

    ========
    CINC, 4th Penguin Legion
    1. Re:Why do we care? by saden1 · · Score: 1

      Seems like quite a bit of guestmation was done and the over all hardware cost analysis was shady to say the least. I'd like to announce that the Milky Way galaxy has billions of stars.

      --

      -----
      One is born into aristocracy, but mediocrity can only be achieved through hard work.
    2. Re:Why do we care? by 0xC0FFEE · · Score: 5, Funny

      Well, everybody knows that black ink is best for the job and that Linus prefers it.

    3. Re:Why do we care? by dsheeks · · Score: 1

      I agree and wouldn't mod your post as funny. This falls more in line with how many hairs do you have on your head. Unless that number is getting uncomfortably low, you don't really care as long as it does the job of keeping the sun off your dome.

      Does the number of machines google uses enable the engine to produce good results? THAT is a question of more importance and the answer seems to be more "questionable" all the time. I hope that trend reverses and we get reliable search results. Who cares if it takes 10,000 machines or 100,000 as long as Google works and can stay in business.

    4. Re:Why do we care? by hobbsbutcher · · Score: 1

      Blue ink is better for signatures so that you can tell it from the b&w printout.

      --
      Jonathan B.
  9. Which brings up an interesting question... by phrogeeb · · Score: 1

    For all the econ majors out there. What exactly does Google have to tell us now that it's an IPO?

    Ok, its not an interesting question, Im just hoping for first post.

    --

    ------

    "Will the highways on the Internet become more few?" --George W. Bush, in Jan. 2000

    1. Re:Which brings up an interesting question... by gregwbrooks · · Score: 3, Interesting
      Not a thing, in terms of the number of their servers, or internal data such as line-item hardware purchases.

      This is how it should be, since knowing the size of Google's hardware capacity is a very, very strategic bit of information, and the kind of thing that would allow Yahoo/MSN/whoever to get a feel for how much capital would be necessary to duplicate or improve upon it.

      --


      "It was a summer's tale: Just a boy, his Linux, and a head full of dreams..."
    2. Re:Which brings up an interesting question... by treat · · Score: 1
      This is how it should be, since knowing the size of Google's hardware capacity is a very, very strategic bit of information, and the kind of thing that would allow Yahoo/MSN/whoever to get a feel for how much capital would be necessary to duplicate or improve upon it.

      What you say would make sense, except that there must be thousands of people who know the size of Google's hardware capacity, or who have access to the information. You can't convince me that someone with the resources of Microsoft could not easily determine how many machines Google has.

    3. Re:Which brings up an interesting question... by fermion · · Score: 1
      The hardware cost is not the issue. If it were MSN would be the leader already. MS, the company that will likely lose a couple billion on XBox by the second aniversary of the console, would not likely consider even a quarter of a billion dollars, or even a half, a signifcant barrier to entry.

      The issue, as in XBox, is not being the first or even second product to market, and in the long run, stagnating products. Xbox is a well designed machine, but Sony has the games. Windows and the Office suites are very competative, but the later is stagnating and the development of the former has devolved into a series of minimially neccesary me-too features. The significant evolution of the OS, something that other companies started on in the mid-90's and have available now, might be available and stable in two or three years.

      --
      "She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
    4. Re:Which brings up an interesting question... by nelsonal · · Score: 1

      It varies from company to company, but most tech companies are pretty open about things like revenue sources (by industry or product or whatever). As an example, most insutry folk guessed that google had revenue of about $1 billion, but now everyone knows that they have about 600m in self advertising, $200 million in network advertising (those google style text ads on other pages), and $50 million in revenue from those corporate network searching Google boxes. They also know that margins have been declining over the past few years, but that revenue is growing at about a 100% annual clip. Once they are public they will have to do some selling of their performance to owners and analysts, which are generally availible on a public company's web page. In those they are usually questioned about successes and failures over the past three months, and even the most secretive company has to answer a few, or everyone figures they are making results up and the stock tanks.

      --
      Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
  10. Not unexpected... by avalys · · Score: 4, Insightful

    I don't think this is that strange: after all, that 10,000 machines figure is several years old. It's only logical that Google has expanded their facilities since then.

    --
    This space intentionally left blank.
  11. At $699 per CPU by earthforce_1 · · Score: 5, Funny

    SCO now knows how big an invoice to send Google! :-D

    --
    My rights don't need management.
    1. Re:At $699 per CPU by dicepackage · · Score: 1

      If you charge by the CPU then it is $ 1398 US dollars per computer.

    2. Re:At $699 per CPU by kunudo · · Score: 1, Insightful

      Doesn't google run it's own proprietary OS? Thought I read that somewhere...

    3. Re:At $699 per CPU by simcop2387 · · Score: 0

      and at 88 per rack, and 899 racks

      1398*88*899=$110,598,576

      i think sco might give them a discount of 600,000 and just ask for the $110,000,000 that they feel they deserve because they do own linux :)

    4. Re:At $699 per CPU by irokitt · · Score: 1
      --
      If my answers frighten you, stop asking scary questions.
    5. Re:At $699 per CPU by RavidgeMole · · Score: 1

      I know that they created their own custom clustering system, was this what you were thinking of?

      --
      "It is better to keep your mouth closed and have people think you a fool than to open it and prove them right." M. Twain
    6. Re:At $699 per CPU by ComaVN · · Score: 4, Funny

      Yeah, but they stole it from SCO.

      --
      Be wary of any facts that confirm your opinion.
    7. Re:At $699 per CPU by Xshare · · Score: 1

      Lindon, Utah-based SCO claims that parts of Linux were directly copied from Unix, which SCO owns. Huh?

    8. Re:At $699 per CPU by no+longer+myself · · Score: 1
      I was just wondering... What was Google's response to SCO's approach? I'd like to think is was FOAD.

      Other than that, I find the whole Google speculation game to be a bit curious. It almost reminds me of my elementary school days on the playground when we had contests to see who could top the last guy's claim while still maintaining some credibility.

      In the coming weeks, I wouldn't be surprised to see reports that Google has more servers than all other servers combined, and that their bandwidth is twice that of the entire federal government.

    9. Re:At $699 per CPU by damiam · · Score: 2, Informative

      No. They run Linux, with their own proprietary software over it.

      --
      It's hard to be religious when certain people are never incinerated by bolts of lightning.
  12. What a waste by Anonymous Coward · · Score: 3, Funny
    I'm sure a single IBM mainframe could do the same amount of work in half the amount of time and cost a fraction of what that Linux cluster cost.

    I hang around too many old-timer mainframe geeks. MVS forever!!! and such.

    1. Re:What a waste by phoxix · · Score: 5, Interesting

      If you've ever read a white paper of Google's, you'd realize that they even tell people why they deal with massive clusters over mainframes: lower latency.

      Sunny Dubey

    2. Re:What a waste by Anonymous Coward · · Score: 0

      You wouldn't happen to be Denis Nicole by any chance? ;)

      http://www.ecs.soton.ac.uk/~dan/

    3. Re:What a waste by Waffle+Iron · · Score: 5, Informative
      I'm sure a single IBM mainframe could do the same amount of work in half the amount of time and cost a fraction of what that Linux cluster cost.

      Mainframes are optimized for batch processing. Interactive queries do not take full advantage of their vaunted I/O capacity.

      Moreover, while a mainframe may be a good way to host a single copy of a database that must remain internally consistent, that's not the problem Google is solving. It's trivial for them to run their search service off of thousands of replicated copies of the Internet index. Even the largest mainframe's storage I/O would be orders of magnitude smaller than the massively parallel I/O operations done by these thousands of PCs. Google has no reason to funnel all of the independent search queries into a single machine, so they shouln't buy a system architecture designed to do that.

    4. Re:What a waste by Anonymous Coward · · Score: 0

      " I'm sure a single IBM mainframe could do the same amount of work"

      why would you even think that? it's just so wrong

    5. Re:What a waste by Anonymous Coward · · Score: 0

      YHBT YHL HAND

    6. Re:What a waste by bwy · · Score: 1

      Mainframes are optimized for batch processing. Interactive queries do not take full advantage of their vaunted I/O capacity.

      Historically true and IMHO probably still true today, but I do have customers running our Java based server product on the host under Z/OS and the administrators would be sure to argue with us.

      One admin was walking me through a config where he could have all these various instances of Red Hat running on their host. It would be interesting to see some real life case studies on a setup like this.

    7. Re:What a waste by RajivSLK · · Score: 1

      I'm sure a single IBM mainframe could do the same amount of work in half the amount of time and cost a fraction of what that Linux cluster cost.

      Yes, I'm sure your right and Google is wrong. You are so much smarter than everyone.

    8. Re:What a waste by RageEX · · Score: 1

      Interesting post. I'm surprised they maintain multiple copies of their database, it's gotta be huge. Someone should sell them a large SAN/NAS machine and a copy of CXFS ;)

  13. More interested... by WiKKeSH · · Score: 1

    I'd be more interested in the specs of those machines rather than the number of machines. Wonder if that will ever be revealed. :)

    1. Re:More interested... by WiKKeSH · · Score: 1, Insightful

      That's all speculation.

    2. Re:More interested... by Solosoft · · Score: 1

      I remember watching some video of the google guy discussing how google works. The Machines are middle end PIII's. (I believe) but I remember hearing him keep mentioning that as the way to go.

    3. Re:More interested... by mikis · · Score: 1

      How 'bout RTFM?

      "So how much processing power is that? Well, once again, the Google cluster document provides some interesting tidbits. Per the document, the racks that were used were

      88 dual-CPU 2Ghz Intel Xeon servers with 2 Gbytes of RAM and an 80-Gbyte hard disk."

    4. Re:More interested... by Arctic+Fox · · Score: 1

      Dude.
      There's no such thing as "middle end". It's either in the middle, or it's at the end!
      Other than that, you're right. Given enough of them, it doesn't matter how good they are.

  14. Obligatory comment by SILIZIUMM · · Score: 0, Offtopic

    Of course, someone *will* say the obligatory "imagine a beowulf..." comment in this thread.

  15. And this isn't even all... by scifience · · Score: 1

    Think of how many racks filled with nothing but data storage they must have. This only counts the actual dual-processor machines, so there is bound to be tons of computer stuff that isn't even counted.

    1. Re:And this isn't even all... by DetrimentalFiend · · Score: 1

      I believe that google spreads the data and the processing across the entire cluster. So each machine is storing a little, processing some searches, and possibly even crawling.

  16. Assumptions? by waytoomuchcoffee · · Score: 4, Interesting

    According to calculations by the IEE, in a paper about the Google cluster, a rack with 88 dual-CPU machines used to cost about $278,000

    Um, don't you think if you were buying 899 racks you might actually, you know, negotiate for a better price?

    This isn't the only assumption in your analysis, and the problems with them will be compounded. What's the point of this, really?

    1. Re:Assumptions? by digitac · · Score: 5, Funny

      That's right, they probably got in on the "Buy 899 Get 1 Free" Sale. So in reality they have a nice even 900 racks. Makes much more sense that way.

    2. Re:Assumptions? by 2MuchC0ffeeMan · · Score: 4, Insightful

      i thought of this too, but then i thought that they probably bought them 5/10/20 at a time as they grew.

      --
      Runnin' On Empty .... I'm Still Alive
    3. Re:Assumptions? by heliocentric · · Score: 1

      But I see no information about how quickly all of their machines can calculate an infinite loop. Perhaps we can work backwards then with this information (when known) to dervie the number of systems.

      --
      Wheeeee
    4. Re:Assumptions? by Loosewire · · Score: 1

      still if you guarantee to buy 5/10/20 systems per month youll still get a funkingly big discount

      --
      Slashdot - The one stop shop for procrastination
    5. Re:Assumptions? by bagsc · · Score: 1

      The combination of depreciation, dropping prices, and financing costs probably means they install new systems when they need it. No one in their right mind buys 9000 computers if they only need 4500.

      --
      http://www.accountkiller.com/removal-requested
    6. Re:Assumptions? by bkocik · · Score: 1
      Assumptions? (Score:5, Interesting) by waytoomuchcoffee (263275)

      Re:Assumptions? (Score:5, Insightful) by 2MuchC0ffeeMan (201987)

      Are you guys brothers?

    7. Re:Assumptions? by tekunokurato · · Score: 1

      Interesting. Too much coffee man is a comic, though, while the other one could just be anything.

    8. Re:Assumptions? by ari_j · · Score: 1

      ...and then you thought that they probably have to replace more or less complete machines frequently enough to put at least a few hundred together at a time. ...and then you thought that they probably have long-term purchase agreements or contracts, so they don't have to buy all X thousand machines at a time but still get the "buy X thousand, get Y gree" deal.

    9. Re:Assumptions? by TNLNYC · · Score: 1

      It's true and that's why I went with the low-medium-high approach. Even if they get a discount, you can still go with the high estimate and assume that other costs (desktops, routers, firewalls, etc...) is being discounted. The point of the analysis was to get a new watermark for where google stands machine-wise. I think the general agreement is more than 10k. The question is how much more. Care to offer a better approach at guessing this info?

      Disclaimer: I'm the author of the original analysis

      --
      Check out http://www.tnl.net/blog
  17. Cheap hardware by Anonymous Coward · · Score: 1, Interesting

    I was always under the impression that Google used a lot of "cheap" hardware. Meaning, they only used IDE and non-rackmount machines.

    So, they probably don't used "racks" but if they were, that means they could only get about 12-15 desktop machines (single proc) per rack. That's a whole lot less than 42 - 1U rackmounts to fill the rack.

    1. Re:Cheap hardware by Anonymous Coward · · Score: 0

      You are right, actually. Google is nothing more than 20-30 really old desktop computers running Redhat 5.2. And the Google impementation is just Perl (Perl 4 no less), Apache and Mysql.

    2. Re:Cheap hardware by flxkid · · Score: 2, Funny

      Right, 12-15 per rack...they're smart enough to develop an amazing search engine, but not to understand proccessing power density issues...

      --
      Better VDF than VD...check it out: Data Access
    3. Re:Cheap hardware by Ancient+Devices+King · · Score: 1

      "Cheap" in this kind of situation usually means non mainframe. In any case, rack machines don't cost that much more than a regular desktop PC. A 1U rack case+PSU costs about $200-300. That's only about 3x more than a regular mid range server tower (and about the same as a monster double wide server case), and that's what I would pay for one -- volume discounts are going to cut that down. Now consider how much it's going to cost them to have enough space to have desktops instead of racks. The "cheap" solution is going to be the rack one, not the desktop one. It's the same reason everybody uses racks for servers.

      As for disk, who cares? Google caches everything in ram anyway.

      --
      -"It seems like you're trying to exploit a security hole. Would you like help?"
    4. Re:Cheap hardware by Derkec · · Score: 1

      If I understand things properly, this is somewhat close to what google started like. Except that they thought that cases for their handful of desktop machines were too expensive. They just took out the motherboard and such. Without the cases, you can fit a lot into some tight space.

  18. Maybe just me... by hot_Karls_bad_cavern · · Score: 4, Insightful

    Might just be me, but damn, don't you think this has raised the interested of our three letter entities? i mean, damn that is just some serious computing and indexing power on cheap, "disposable" hardware...with a filesystem that can keep track of that many machines? If i headed one of such entities, i'd sure want to know more about it!

    1. Re:Maybe just me... by Anonymous Coward · · Score: 0

      Filesystem?

      Are you kidding?

      It's called a "database".

      Sheesh. I think the 3 letter agencies you mention have all heard of a "database" before.

    2. Re:Maybe just me... by hot_Karls_bad_cavern · · Score: 1

      Database?

      Are you kidding?

      The new WinFS is a database there, sparky. So yes, "filesystem" is what i'll call this implementation by google...get over yourself.

    3. Re:Maybe just me... by Anonymous Coward · · Score: 0

      What Col. Sanders from KFC want to know about Google?

  19. Come on! Does it really matter? by diegomontoya · · Score: 2, Insightful

    My guess is just as your guess which would be:

    your guess + 1 = my guess.

    We already know they have enough servers to saturate a T1000 line so might as well stop here and talk about something more constructive.

    1. Re:Come on! Does it really matter? by Anonymous Coward · · Score: 0

      Don't you mean:

      my guess += your guess

    2. Re:Come on! Does it really matter? by DA-MAN · · Score: 1

      We already know they have enough servers to saturate a T1000 line so might as well stop here and talk about something more constructive.

      How in the hell does a present day search engine saturate a fictional liquid metal robot from the future???

      --
      Can I get an eye poke?
      Dog House Forum
    3. Re:Come on! Does it really matter? by MrHanky · · Score: 4, Funny
      How in the hell does a present day search engine saturate a fictional liquid metal robot from the future???

      Well, that depends on what sort of time portal they use. Now, a T1000 would probably be saturated by a time portal following the Terminator rules: one way only. But Google seems to favour Back to the Future rules, as shown by number of hits:

      13,500,000 for back to the future

      3,460,000 for terminator

      This would make saturating a T1000 a lot easier, since you could saturate it while travelling back in time yourself, or maybe even while standing still in time. This would make Google's bandwidth infinite, as a measly T1000 would stand still. Unless it was using its own time portal to travel back in time to destroy Google, but that would create a paradox, since, as we all know, Google will become Skynet, which will create the T1000 in the first place.

      What I'm trying to say is: I don't know, but I'm sure Google could do it.

  20. wait by Docrates · · Score: 4, Insightful

    Remember there's a little thing called "volume discount"...

    It's gotta be more than that.

    --

    There are two kinds of people in the world: Those with good memory.
  21. Did you factor in screws and cable? by Anonymous Coward · · Score: 0

    That can add 40% to the cost of a cluster.

    1. Re: Did you factor in screws and cable? by Patrik_AKA_RedX · · Score: 1

      How do you mean scre

    2. Re: Did you factor in screws and cable? by Darthmalt · · Score: 2, Funny

      NO they just use legos duh. Though I personally prfer duct tape

  22. All that power by Chucklz · · Score: 5, Funny

    With all those TFlops, no wonder Google converts units so quickly.

  23. "my god .... by Neuropol · · Score: 1

    ... it's full of stars"

    1. Re:"my god .... by Anonymous Coward · · Score: 0

      Wow.

      That's the gayest thing I've read on Slashdot all day.

      Congratulations.

  24. Really? by irikar · · Score: 5, Funny

    You mean the PigeonRank(tm) technology is a hoax?

    1. Re:Really? by Reteo+Varala · · Score: 1

      Either that, or the PETA found out, and decided to pay the company a midnight visit. I guess they didn't want the pigeons to develop beak tunnel syndrome.

    2. Re:Really? by GregChant · · Score: 1

      Well, according to google, they treat their pigeons fairly well:

      Isn't it cruel to keep pigeons penned up in tiny data coops?

      Google exceeds all international standards for the ethical treatment of its pigeon personnel. Not only are they given free range of the coop and its window ledges, special break rooms have been set up for their convenience. These rooms are stocked with an assortment of delectable seeds and grains and feature the finest in European statuary for roosting.

  25. How detailed is the SEC filing? by jgaynor · · Score: 1

    Does the SEC filing mention whether theses are datacenter boxes or employee workstations? Beyond that, do they take into account storage array costs, telco equipment, etc? Not that I don't love speculating about the awe that is the google datacenter, but an SEC filing seems like a fairly inaccurate, roundabout way to make an assumption about cluster nodes . . .

  26. The things you could do with that... by drsmack1 · · Score: 0

    With that much computer power at their disposal they could do some cool things - maybe some sort of distributive computing thingie or big database of some kind. Of course they would have to network all those systems togther - that could be expensive. Imagine how much a 80,000 seat license for lantastic would cost!

    1. Re:The things you could do with that... by polyp2000 · · Score: 2, Funny

      With that much computer power at their disposal they could do some cool things - maybe some sort of distributive computing thingie or big database of some kind.

      What about building a really big search engine?
      To build a really big search engine, your going to need some serious distributed computing, and a big database! Hey wait a minute thats what they are doing!

      --
      Electronic Music Made Using Linux http://soundcloud.com/polyp
  27. This is actually useful by 2MuchC0ffeeMan · · Score: 2, Interesting

    because with ~80,000 machines, they can easily put a few hard drives in each, and give everyone 1gb of gmail space... I didn't think it was possible.

    where do you go to buy 80,000 hard drives?

    --
    Runnin' On Empty .... I'm Still Alive
    1. Re:This is actually useful by Anonymous Coward · · Score: 0

      The internet. Obviously. Check what Froogle says.

    2. Re:This is actually useful by Simon+(S2) · · Score: 1

      where do you go to buy 80,000 hard drives?

      at walmart.

      --
      I just don't trust anything that bleeds for five days and doesn't die.
    3. Re:This is actually useful by Patrik_AKA_RedX · · Score: 1

      Nah! Ebay! That nigerian guy, the son of some general, sells those very cheaply. All he asks you is to pay upfront for shipping.

    4. Re:This is actually useful by Anonymous+Brave+Guy · · Score: 4, Insightful
      where do you go to buy 80,000 hard drives?

      You don't; their Sales Director comes to you...

      --
      If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
    5. Re:This is actually useful by drinkypoo · · Score: 4, Funny

      You almost have it right... first, a couple of hookers come to you. Then, a few hours later, they are followed by the sales director.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    6. Re:This is actually useful by merlin_jim · · Score: 1

      You almost have it right... first, a couple of hookers come to you. Then, a few hours later, they are followed by the sales director.

      I don't know who you buy your drives from, but as I haven't been getting the BJ + on-site-service pitch, we may have just found the first argument why NOT to buy drives from IBM Enterprise...

      --
      I am disrespectful to dirt! Can you see that I am serious?!
  28. 88 machines per rack? hardly. by cyclop5 · · Score: 3, Interesting

    In your standard 42U cabinet, you're talking a half-U per server. Umm.. not happening. Let's just say I happen to know they use 2U servers, for a total of 21 per cabinet. Custom jobs - just the "floor pan" (i.e. no sides, or top for the case), system board, power supply, and I think a single (or possibly dual) hard drive (I didn't want to be too nosy staring into someone else's colo space). Oh, and network. And rumor has it, they're putting in close to 200 cabinets in just this location alone.

    1. Re:88 machines per rack? hardly. by Anonymous Coward · · Score: 0

      I think they're using computers where the length of them allow them to put one in the front of the rack, and another in the back. (hence getting 2u of space from a 1u vertical area)

    2. Re:88 machines per rack? hardly. by natet · · Score: 1

      I was just about to say something similar. It is possible to purchase racks that are 44U, so I was thinking it was far more likely that the number quoted was 88 CPU's, not 88 dual CPU boxes. In addition, You would have to fit some network equipment in that figure somewhere. I wonder if this guy knows what he is talking about...

      --
      IANAL... But I play one on /.
    3. Re:88 machines per rack? hardly. by Anonymous Coward · · Score: 2, Informative

      In your standard 42U cabinet, you're talking a half-U per server. Umm.. not happening.

      IBM has a blade center that can hold 84 2-way blades in a 42U cabinet.

    4. Re:88 machines per rack? hardly. by Anonymous Coward · · Score: 0

      there using custom chassis. theres nothing standard about what google does with there colo setups at all

    5. Re:88 machines per rack? hardly. by PenguinOpus · · Score: 4, Informative

      Racksaver was selling dual-machine 1U racks for several years and I owned a few of them. Think deep, not tall. Racksaver seems to have renamed itself Verari and only has dual-Opteron in a 1U now. Most dense configs seem to be blade-based these days. Verari advertises 132 processors in a single rack, but I suspect they are not king in this area.

      If Google is innovating in this area, it could either be on price or in density.

    6. Re:88 machines per rack? hardly. by Jeff+DeMaagd · · Score: 1

      I think someone confused CPU for computer.

      I also thought Google just used white-box type computers.

    7. Re:88 machines per rack? hardly. by njcoder · · Score: 1

      Have a look at www.rackabelsystems.com. They use a lot of these. They can fit two servers in a 1u space, back to back.

    8. Re:88 machines per rack? hardly. by cyclop5 · · Score: 3, Interesting

      From the cabinets I saw, it was definitely 2U vertical space. It was one of those things that surprised me a little - I would have assumed they'd use blade servers, or at least 1U boxes just to get the rack density. So when I had the opportunity to "sneak a peek", I tried to notice as much as I could, without poking and prodding. Unfortunately, there wasn't much to notice, other than what I mentioned previously. That, and they were all pre-installed in the cabinets before shipping out to the colo. (There were 30 or 40 cabinets in the shipping/receiving area of the colo).

    9. Re:88 machines per rack? hardly. by Random+Web+Developer · · Score: 1

      Also have a look at this Apparently google is using 1/2 (depth) unit's

      --
      Artists against online scams http://www.aa419.org/
    10. Re:88 machines per rack? hardly. by sforman · · Score: 1

      rackable sell 1U high, half depth servers so 88 in a rack is easily do-able.

    11. Re:88 machines per rack? hardly. by Anonymous Coward · · Score: 0

      There are actually 80 servers per rack.

      The racks are 23" racks (NOT 19" racks) which is what most people miss.

      For each 2U there are two motherboards side-by-side - the motherboard/CPU/etc take up about 1U and there's 1U of space above (for air-flow - there are no fans).

      So on one side of the rack are 40 systems - 44U - 4U (switches) = 40U / 2U per shelf x 2 CPUs per shelf.

      The racks are double-sided, so duplicate that number for the back and you have 80 systems in a rack.

      As for 100,000 servers... only if they're in a single facility. I've been in a datacenter with over 900 rack which make some 72,000 servers. I think 100,000 is a conservative estimate.

    12. Re:88 machines per rack? hardly. by geniusj · · Score: 1

      Google does not use 2U servers (at least for most of their servers.) They use 1U half depth servers. They are nothing more than a disk and a power supply sitting on top of a motherboard with no case. I have seen them many times.

      I'm sure they do have 2U servers in some places for things like databases perhaps? However, their standard deployment is the 1U half depth .. box.. if you can even call it that :)

      Regards,
      -JD-

    13. Re:88 machines per rack? hardly. by TNLNYC · · Score: 1

      Looks like Rackable is selling a system that does 88 machines per rack: the C1000 series. They list Google as a customer so I'm assuming that that may be it...

      Disclaimer: I'm the author of the original analysis.

      --
      Check out http://www.tnl.net/blog
  29. googling files.... by Anonymous Coward · · Score: 0

    How abt a file search on google? Does it will bring dooms day for software makers?

  30. Power by ManFromAnotherPlace · · Score: 3, Funny

    This many computers must use quite a bit of power and they probbably also need some serious airconditioning. I sure wouldn't want to receive their electricity bill by mistake. :)

    1. Re:Power by RogerWilco · · Score: 1

      Let's assume 200 Watts per machine for power and cooling, that makes 16 MW, at a price of 40/MW gets you a bill of around 6 million a year.
      compared to the prices of 80.000 machines it's peanuts.
      The electricity bill of something like a big aluminium smelter would be realy nasty, they might need several 100 MW

      --
      RogerWilco the Adventurous Janitor
  31. Google hosting by titaniam · · Score: 4, Interesting

    I wonder if google will start up a web-hosting business? I bet you can't beat their uptime guarantees. They could provide sql, cgi, etc, and build in multi-machine redundancy for your data just like they do for theirs. It'll be the google server platform, just one more step to replacing Microsoft as the evil monopoly.

    1. Re:Google hosting by Rui+Lopes · · Score: 1

      just one more step to replacing Microsoft as the evil monopoly.

      And then /.'ers will start saying "Google, that evil monopoly" and will hate it.

      --
      var sig = function() { sig(); }
    2. Re:Google hosting by cyberformer · · Score: 4, Insightful

      If they did, there's a real chance that there could be no more Internet for a lot of applications: people would just upload their Web pages to Google, users would log on to Google to search, and most email will go through Gmail.

      This is a good thing for Google, but not for the world as a whole.

    3. Re:Google hosting by Angostura · · Score: 4, Interesting

      Actually, I would be more worried if I was Akamai. If Google went after the corporate market and offered some kind of grid-esque caching-and-execution environment, that would be something to look at. However it would need some rather nifty scheduling an admin tools, and would add a lot complexity, so I don't think that's too likely.

    4. Re:Google hosting by wfberg · · Score: 1

      Google itself uses akamai for DNS.

      --
      SCO employee? Check out the bounty
    5. Re:Google hosting by Anonymous Coward · · Score: 0

      And then /.'ers will start saying "Google, that evil monopoly" and will hate it.

      with 60% of them still using it as their search engine..

    6. Re:Google hosting by himalayantraveller · · Score: 1

      Replace Microsoft with a new monopoly? Which gets evil over time?

    7. Re:Google hosting by green+pizza · · Score: 1

      If they did, there's a real chance that there could be no more Internet for a lot of applications: people would just upload their Web pages to Google, users would log on to Google to search, and most email will go through Gmail.

      If I invest in Google, I sure hope they will consider web hosting, especially "professional" hosting, not just yahoo-style personal pages. What's good for Google is good for Google's investors.

    8. Re:Google hosting by Ian+Bicking · · Score: 3, Interesting

      There's an interesting article comparing Google and Akamai which talks about that as well, since they have technical similarities, but are strategically very different -- Akamai does massive web hosting, while Google does massive web applications.

    9. Re:Google hosting by darkmeridian · · Score: 1

      Why would this be bad? It would raise the bar for it's competitors. Yes, Google would have a huge lead, but it's not like MS where it could actively leverage that lead to exclude others. Another guy could come along a create a better Google and then people will flock to him for searching and Google for the other stuff.

      --
      A NYC lawyer blogs. http://www.chuangblog.com/
    10. Re:Google hosting by Basje · · Score: 1

      Exept for the fact that akamai is geographically distributed, and google is not.

      Google has a strong brand, but would require massive investments to pull this off. That does not guarantee that they won't, but if akamai pays attention they should be able to maintain their advantage at this point.

      --
      the pun is mightier than the sword
    11. Re:Google hosting by k3y · · Score: 1

      In Soviet Russia the Internet googles YOU!

  32. Google employee? by Anonymous Coward · · Score: 0

    Surely there must be some Google tech out there reading this.. Just post AC and tell us.

  33. Someone call the FBI by Durandal64 · · Score: 4, Funny

    The number of machines Google uses is considered a trade secret. By attempting to determine how many machines they have, you're in violation of the DMCA. I'm calling the FBI.

    1. Re:Someone call the FBI by whovian · · Score: 0

      Trade secret, yes, but in violation of DMCA?! Copyright != trade secret. Besides, at that rate you might as well make accusations against investment houses at large -- they do their own calculations and speculations for their purposes. Why is it wrong to discuss it in public using public records?

      --
      To-do List: Receive telemarketing call during a tornado warning. Check.
    2. Re:Someone call the FBI by Durandal64 · · Score: 1

      Oh Christ, it was a joke, man.

  34. Hmmm....Google by Anonymous Coward · · Score: 0

    ows SCO over $90million USD assuming that all of those machines run Linux, which they probably do.

  35. uh.... by TheCabal · · Score: 1

    100% of the money isn't devoted towards servers and rackspace. There's operational costs, network infrastructure, bandwidth, software costs, maintenance contracts, spare parts... it ain't all just hardware.

  36. I have seen the light by Anonymous Coward · · Score: 3, Informative

    working at abovenet google has pulled there machines in and out of our data centers many a times. its incredible the way they have there shit is setup.

    they fit about 100 or so 1u's on each side of the rack, there double sided cabinets that look like refrigerators. there seperated in the center by noname brand switches and they have castor wheels on the bottoms of them. google can at the drop of a dime roll there machines out of a datacenter onto there 16 wheeler, move, unload and plug into a new data center in less than a days time.

    1. Re:I have seen the light by Myrrh · · Score: 1

      Plus, for the TRULY anal English majors in the audience, starting a sentence with "Working at abovenet" is a dangling participle. Just who are you talking about, exactly? Do YOU work at Abovenet? Do the Google employees work at Abovenet? Do the Google machines work at Abovenet?

    2. Re:I have seen the light by Vrallis · · Score: 1

      I'll leave the grammar/spelling issue alone.

      100U each side of the rack? These would be racks over FOURTEEN FEET tall. 1U = 1.75"

      I've never heard of any racks over 50-55U tall.

  37. Makes Perfect Since by peterdaly · · Score: 2, Insightful

    Since the 10k server number was first floated, I believe google has added quite a few, meaning 6 to 10 whole new datacenters around the world.

    It would only make sense that the server count would now be in the ballpart of what is mentioned here.

    Google hasn't been standing still, and I've heard the "Google has 10k servers" for 1-2 years now.

    -Pete

  38. Auxiliary costs? by zCyl · · Score: 1

    Does it include additional costs like maintenance costs, administrator salary, electricity, etc.

    At an estimated 300W per machine and 7 cents a KWH, that's $14M just for the electricity to run those machines, not including air conditioning for that many machines. So is the figure quoted JUST for the computing hardware?

    1. Re:Auxiliary costs? by Anonymous Coward · · Score: 0

      In the SEC filing, the $250M is the cost of their hardware. Also, they host all of their servers in data centers operated by a third party.

    2. Re:Auxiliary costs? by DrMaurer · · Score: 1

      I can tell you that they don't pay 7 cents a KWH. Their price is probably about 3 or 4 cents, considering the amount of electricity they buy.

      Not that that's a figure to sneeze at, by your calculations that's still $7M, but companies don't pay residential rates.

      Of course, maybe that's the price for companies out wherever their based. Here, Rockford, IL., or thereabouts, it's 3 or 4 cents per KWH for a substantial user of power.

      --
      Dan
    3. Re:Auxiliary costs? by Garak · · Score: 1

      300W is way high. If they all had 3 HD's, CDROM, video card and fully loaded down then it may be close to 300W. But these machines are just a mb and a hd. I would guess around 100W a machine at peak.

      --
      God, root, what is the difference?
  39. I know, I know! by Anonymous Coward · · Score: 1, Funny

    >where do you go to buy 80,000 hard drives?

    You call IBM.

    I heard they have a large amount of 75GXP drives ready to rock'n'roll[0]!

    (fineprint)[0] Raid-1 capable interface not included.

  40. you're forgetting one thing. by cnb · · Score: 1

    a cluster costs more then just machines

  41. Ask by RoadkillBunny · · Score: 1

    Couldn't we just ask google?

    --
    Cheers,
    RoadkillBunny
    1. Re:Ask by wed128 · · Score: 1

      man...if i had a system with that much power under the hood, i'd call it from the rooftops...it's simply amazing, and i wonder why it's such a big secret that we have to speculate...

    2. Re:Ask by FuzzyBad-Mofo · · Score: 2, Funny

      Or they could ask Jeeves:

      Say, Jeeves old boy: how many servers does Google have?

      Jeeves: Piss off!

  42. 15 Megawatts by SuperBanana · · Score: 4, Interesting

    ...assuming 200W per server, which is probably low, but probably compensates for 79,000 being most likely an overestimate. However, that doesn't even begin to account for the energy used to keep the stuff cool.

    Anyone know how many trees per second that would be? Conversion to clubbed-baby-seals-per-sec optional.

    1. Re:15 Megawatts by gspr · · Score: 5, Funny

      According to Google herself dried wood contains 15.5 MJ of energy per kg. It seems that Google consumes about 1 kg of wood per second (if they've found a way to utilize 100% of the energy, which they of course have - they're Google, after all), and that the pigeons are just there to use their wings to dry the wood!
      We're on to you, Google!

    2. Re:15 Megawatts by Anonymous Coward · · Score: 0

      why would they use trees?

      that seems kind of a stupid method to get power.

    3. Re:15 Megawatts by Sponge+Bath · · Score: 3, Funny

      The servers could be powered by 15 Megahamsters on treadmills (@ 1 watt/hamster). But that would require sufficient management to motivate the hamsters with the threat of off-shoring their jobs.

    4. Re:15 Megawatts by glenstar · · Score: 4, Funny
      According to Google herself...

      Hm... Google seems decidedly male to me.

      1) Answers questions rapidly without offering any description of how the answer was derived? Check.

      2) Works in short, fast bursts of energy and then tells you proudly it only took them .009 seconds? Check

      3) Has an inability to accessorize his appearance? Check.

      4) Returns 82,200,000 results when asked about porn? Check and match!

  43. Heat by gspr · · Score: 5, Informative

    A Pentium 4 dissipates around 85 W of heat. I don't know what the Xeon does, but let's be kind and say 50 W (wild guess). Using the article's "low end" estimate, that brings us to 4.7 MW!
    I hope they have good ventilation...

    1. Re:Heat by Neil+Blender · · Score: 5, Funny

      Google engineer reading your post: OH SHIT!

      <sound of door slamming>

      <sound of car engine starting>

      <sound of tires squeeling away>

    2. Re:Heat by gammelby · · Score: 5, Informative
      I once attended a talk by google fellow Urs Hölzle on the google architecture, and he mentioned how they handle the cooling issue: They do not depend on each individual unit to be cooled separately - instead they have an enormous flow of air between the racks (sitting back to back), generated by some large fan in the roof.

      Ulrik

    3. Re:Heat by bob_jordan · · Score: 4, Funny

      More like a google engineer taking wet squelching footsteps to the door crying out,

      "I'm meeeelllllttttiinnnnnggggg"

      Bob.

    4. Re:Heat by Dawn+Keyhotie · · Score: 2, Funny
      Well, by my calculations, taking into account the effects of Einstein's theory of general relativity, once the data center is accelerated to 88mph, it will consume exactly 1.21GW of electricity!

      Now, where's my Delorean?

      Cheers!

      --
      "The only good windmill is a tilted windmill."
    5. Re:Heat by Anonymous Coward · · Score: 0

      Well..
      John Carmack has a 1000HP Ferrary, thats about 0.77MW

  44. SCO by WindBourne · · Score: 2, Insightful

    Since it is known that Google has the largest installed base of Linux and now they are about to go IPO in the billions, I wonder why SCO has not gone after them? Apparently, it is not use of Linux that makes SCO persue a company.

    The interesting thing is, that if SCO really has MS backing and MS is pulling strings, then I would think that MS would want SCO to persue google to tie them up for awhile.

    --
    I prefer the "u" in honour as it seems to be missing these days.
    1. Re:SCO by Rick+Richardson · · Score: 1

      SCO will wait until as close as possible to Google's IPO date to file the suit.

    2. Re:SCO by WindBourne · · Score: 1

      I am not so sure of that. I suspect that SCO knows that they have no real case against Linux/GPL. So far, everybody that they have filed suits against, or have sent letters to, are only companies that they have delt with in the past.

      In addition, I do not think that MS and Sun are really pulling strings, just funding a loose gun. Sometimes that is enough. This way both keep their fingers mostly clean. Even now, Sun is doing all that they can to disavow SCO while at the same time, funding them.

      --
      I prefer the "u" in honour as it seems to be missing these days.
  45. hardcore by mooosenix · · Score: 5, Funny
    After many scientific and time consuming experiments, we have found the number of servers to be.........

    42.

    1. Re:hardcore by Seek_1 · · Score: 1

      Someone mod this up, I just spit coffee all over my keyboard because of the parent. :)

    2. Re:hardcore by njcoder · · Score: 2, Interesting
      "42"

      Actually, that's pretty close to the number of copies of Red Hat Google actually paid for in 200.

      The price was right; Google doesn't pay any significant amount of money to Red Hat. Google downloads the software for free and gets support in-house and from the Linux community. Google actually paid for only about 50 copies of Red Hat, and those purchases were more of a goodwill gesture. "I feel like I should be nice, so when I go to Fry's I pick up a copy," Brin said.
      From here
    3. Re:hardcore by Ether3k · · Score: 1

      Wow, the Yeare of our Lorde, 200 AD, huh?

      I didn't know Google was that old!

      --
      END
  46. What? by Jozer99 · · Score: 1

    How are they fitting 88 dual cpu computers in a rack? Last I checked, racks were between 42U and 46U. That is 42-46 1U computers. I have seen 1/2 U computers, but never dual cpu 1/2U computers. I can see 88 processors per rack, but not 172 processors per rack. Plus, when you have such a high density of computers, you need lots of power and cooling. Most racks have UPSes and cooling fans that take up rack space.

    1. Re:What? by lcracker · · Score: 1

      Ever seen blades?

    2. Re:What? by Anonymous Coward · · Score: 0

      Use both sides of the rack.

    3. Re:What? by mikis · · Score: 1

      Their servers are 1U but half depth, so they put them from the both sides of the rack. See here

    4. Re:What? by Jozer99 · · Score: 1

      Yes, but since the article says that this spec is several years old, I assumed that was before blades were available/practical.

  47. Hm? Did we forget everything we've been told? by MrPerfekt · · Score: 1

    In every other Google article I've read, they say that they use old slow PCs in mini to mid tower cases. You can fit about... 12-16 machines in a rack if they're not rackmount cases. Not 88.

    In any case, all this calculation is a bit over speculatory and pointless.

    --
    I just wasted your mod points! HA!
  48. Too late for that! by Anonymous Coward · · Score: 0

    I was going to start my own Google clone, and after I figured out how to crawl the web, index it, and cache it, and how to do the keyword-based advertising, and after I wrote the programs for it, which were actually pretty standard things out of the academic literature, the only thing left I needed to know was the exact number of machines they have. Now I have a much smaller search-space for that number. Bwa-ha-ha!

  49. In other words by InfiniteWisdom · · Score: 1

    If the statistics are lumpy when they reach the SEC, they will be lumpy when they reach you.

  50. Have you seen the dictionary? by Anonymous Coward · · Score: 0

    n/t

  51. Why reverse engineer... by SporkLand · · Score: 5, Informative

    When you can just open "Computer Architecture: A Quantitavie Approach, 3rd Edition" by Hennessy and Patterson to page 855 and find out that in summary:
    Google has 3 sites (two west coast, one east)
    Each site connected with 1 OC48
    Each OC48 hooks up to 2 Foundry BigIron 8000 ...
    80 Pc's per rack * 40 racks(at an example site)
    = 3200 PC's.
    A google site is not a homogenous set of PC's instead there are different types of PC's that are being upgraded on different cycles based on the price/performance ratio.

    If you want more info get the patterson hennessy book that I mentioned. Not the other version they sell. This one rocks way harder. You get to learn fun things like Tomosulo's algorithm.

    If I am violating any copy rights feel free to remove this post.

    1. Re:Why reverse engineer... by Anonymous Coward · · Score: 2, Insightful

      That book was written about 3 years ago.

    2. Re:Why reverse engineer... by Anonymous Coward · · Score: 0

      I was just about to ask about the bandwidth. Is there a site out there that lists that kind of thing for large installations? I'm just wondering what kind of connection places like MIT and M$, etc. have.

    3. Re:Why reverse engineer... by blanktek · · Score: 1

      Also, check out a lengthy sample chapter.

    4. Re:Why reverse engineer... by fullofangst · · Score: 1

      Don't worry dude, slashdot don't remove posts.

      They just moderate them to -100 and no-one can ever see them (except the admins, who jack off nightly after reading where to bittorrent Triumph of the Geeks)

    5. Re:Why reverse engineer... by jabbo · · Score: 1

      All of the above may have been correct at one point in time, but the specifics have all changed.

      For starters, Foundry switches are trash ;-)

      --
      Remember that what's inside of you doesn't matter because nobody can see it.
  52. Hmm by CrystalFalcon · · Score: 1

    The reason this is considered funny (perhaps not by moderators, but in general) is why the Swedes took to the streets for the abolishment of IP harassment yesterday.

    What are you doing?

  53. inside information by sir_cello · · Score: 4, Interesting

    Interesting People 2004/05:
    I know for a FACT they passed 100,000 last November. One thing the Louis calculation may have missed is Google's obsession with low cost. For example read the company's technical white paper on the Google file system. It was designed so that Google could purchase the cheapest disks possible, expecting them to have a high failure rate. What happens when you factor cost obsession into his equation?

    1. Re:inside information by Anonymous Coward · · Score: 0

      Its hard to tell exactly from the filing and I'm not an accountant (though sometimes I feel like one) but usually when you buy a piece of equipment it has a certain life, say 4 years, and then you spread the cost of the aquisition over the 4 years from an accounting point of view -- this is called amortization.

      That is, if I spend $1000 on a computer that I expect to last me 4 years, then despite the fact that I spent the whole $1000 this year I only account for $250 of it as a cost each year (the remainder gets tied up in an asset account to keep everything balanced).

      Someone with a better knowledge of accounting correct me if I'm wrong, but this could significantly magnify the purchasing power of that $250M number.

    2. Re:inside information by gammelby · · Score: 4, Informative
      In the talk mentioned in a previous posting, mr. Hölzle also talked about disk failures: They have so many disks (obviously of low quality, according to you) and read so much data, that they cannot rely on standard CRC-32 checks. They use their own checksumming in a higher layer to circumvent the fact that CRC-32 gives false positive results in one out of some-large-number.

      Ulrik

    3. Re:inside information by duffbeer703 · · Score: 1

      Depreciation isn't related to what you can or cannot buy. That's what cash flow is for.

      For tax purposes, they have to depreciate the costs of their equipment across its useful life.

      If you purchase a computer for $1,000 and depreciate it over four years, you can "write off" $250 per year.

      --
      Conformity is the jailer of freedom and enemy of growth. -JFK
  54. 10,000 or 80,000? by Anonymous Coward · · Score: 0

    first off, don't they list the 10,000 figure as number of machines per data center? and i know they have at least 5 or 6 data centers around the country. so as far as i'm concerned, these numbers still add up...

  55. I'm more interested.. by diegomontoya · · Score: 5, Funny

    in how they recycle their gigantic heat output...perhaps move data center to the windy city, open up a homeless shelter next door, and put the hot air to good use for once. They might even get a tax break on this.

    Better yet, open up a nursery (plant type) next door , build a green house, and piple 25% of the heat to it. Have you guys see the price of trees lately? Google could make a killing with the "recycling" plant.

    1. Re:I'm more interested.. by Anonymous Coward · · Score: 0

      They give it to SCO in exchange for a server license :)

      Don't bother modding me up im AC today.

    2. Re:I'm more interested.. by Anonymous Coward · · Score: 0
      • etter yet, open up a nursery (plant type) next door , build a green house

      Dood. You can do a lot better than that. Just go for weed! You bet weed is better priced than trees, man. And the fibres from the remaining stalks are almost as strong as any tree-fibres, so their still be usefull remians.

      You could probably supply a statesworth of stoners with a rig like this :) And this is not tax deduction, this is pure profit!

      Oh, that's right. There's a prohobition.... Which haven't been lifted yet. D'Oh.

      /covardly posting as A.C., after some nasty recent karma burning :)

    3. Re:I'm more interested.. by bgarcia · · Score: 3, Funny
      I'm more interested in how they recycle their gigantic heat output...perhaps ... open up a homeless shelter next door, and put the hot air to good use for once.
      By cooking the homeless people?
      I guess that's one way to solve the homeless problem.
      --
      I'm a leaf on the wind. Watch how I soar.
    4. Re:I'm more interested.. by JDWTopGuy · · Score: 2, Funny

      Hey, with a little hot sauce... after all, beggars can't be choosers, but they make great tacos!

      --
      Ron Paul 2012
    5. Re:I'm more interested.. by identity0 · · Score: 2, Funny

      Even better - open up a greenhouse next door, and buy some grow lamps. I'm sure geeks will pay a lot of cash for "Google Doobies" :)

      Google could corner the geek pothad market with their revolutionary "plant ranking" engine, and the power consumption from the grow lamps can be easily hidden in Google's normal power bill, too.

    6. Re:I'm more interested.. by void* · · Score: 1

      Google Green Is PEOPLE!

      --


      Code or be coded.
    7. Re:I'm more interested.. by Anonymous Coward · · Score: 0

      That is right... I will market it as soyent green.

    8. Re:I'm more interested.. by phaze3000 · · Score: 1
      Now there's an idea - Google-brand skunk.

      Have they got a data centre in Amsterdam?

      --
      Blaming GW Bush for the Iraq war is like blaming Ronald McDonald for the poor quality of food.
  56. Absolutely Beautiful by Anonymous Coward · · Score: 5, Insightful

    All those machine, all that complexity and activity, all boiled down to one little box under a Google logo. The most useful input box on the internet.

    Thanks Google!

  57. I'd hate to see their power bill! by penginkun · · Score: 1

    Can we blame them for the rolling blackouts? ;)

  58. Nobody has 88 systems in a rack by Glasswire · · Score: 0

    ...due to simple arithmetic: if a std 42U rack was full of 1U, dual proc systems (42) that's 84, not 88 cpus AND CERTAINLY not 88 DPs (196 cpus).

    Now normally you don't fill all 42U with systems either -for thermal and other logistic reasons.

    Probably, your 899 racks (assuming you're right about THAT) would really hold about 35 systems so you're at about 31465 systems (62930 cpus).

    1. Re:Nobody has 88 systems in a rack by ColdZero · · Score: 0

      Blades maybe?

    2. Re:Nobody has 88 systems in a rack by Grimster · · Score: 4, Interesting

      I was in Exodus - Toyama facility in Sunnyvale, CA back in 2001 and was talking to some of the data center techs, they were bitching because Google DOES stack 44 -half depth- servers in a rack, on EACH SIDE (aka 88 servers per rack indeed) and how the heat that produces is absolutely fucking insane and how he can't believe they don't meltdown. He was comlaining how frugal google was not giving the systems more room to breath.

      --
      --- www.f-theocean.com
    3. Re:Nobody has 88 systems in a rack by Anonymous Coward · · Score: 0

      Somehow, I'm reminded of battery hens.

      Boycott Google! They're cruel to computers! Slashdotters demand free-range servers! Give them room to breathe!

    4. Re:Nobody has 88 systems in a rack by fullofangst · · Score: 1

      Don't believe you, sorry.

    5. Re:Nobody has 88 systems in a rack by Anonymous Coward · · Score: 0

      *hint* ...

      custom racks

    6. Re:Nobody has 88 systems in a rack by tigga · · Score: 1
      ...due to simple arithmetic: if a std 42U rack was full of 1U, dual proc systems (42) that's 84, not 88 cpus AND CERTAINLY not 88 DPs (196 cpus).

      There is NO std 42U rack. 42U is a least common denominator. In AboveNet facility in San Jose my company had 22 2U servers per rack + space for Portmaster and power management boxes. Exodus and Qwest have I think smaller racks but still you may put 42 1U servers plus switches there. Those racks have some spare space to allow cable management etc. No wonder Google could stack 44 boxes. By the way you are forgetting they have non-standard boxes - they might be thinner.

    7. Re:Nobody has 88 systems in a rack by Grimster · · Score: 1

      Ok, don't. Not like I give 2 shits who or what you believe.

      --
      --- www.f-theocean.com
  59. You're not factoring in Google's culture by gregwbrooks · · Score: 3, Interesting
    Google is all about two things from an operational standpoint:

    • Keep costs down; and
    • What happens inside the company, stays inside the company.
    Figuring out the number of servers they have is why we're noodling over the second point, but the first point is what probably as us all thrown off. Someone in a position to know said recently that he could state as a an absolute fact they have more than 100,000 servers -- and added that merely mentioning it probably violated multiple NDAs he had.
    --


    "It was a summer's tale: Just a boy, his Linux, and a head full of dreams..."
    1. Re:You're not factoring in Google's culture by edgedmurasame · · Score: 1

      It's no surprise that #2 is there, but the wording gives them away. It should be no surprise, since they come from Stanford. Just need someone to Fastlink them. Takers?

      --
      "Forget the engineers." -Carly Fiorina, briber of MIT Technology Review.
  60. When the CIO was at SVLUG by nbahi15 · · Score: 2, Informative

    The CIO and Head Brainsurgeon (he really is a medical doctor) was at SVLUG last year he said there were about 11500 Linux boxes at Google.

  61. Reverse engineered? by bartwol · · Score: 1
    "It takes information from the S-1 Filing to reverse engineer how many machines Google has..."

    I knew a woman who "reverse engineered" the lines on the palm of my hand and determined that I had 10 inches of manhood.

    I think she went to the same school as Slashdot's engineer.

    <bart

  62. The formula works! by Tablizer · · Score: 1

    1. Implement a Beowulf Cluster
    2. Index Soviet Russia before they index you
    3. Index images of panties and Natalie Portman
    4. Profit!

  63. Choke point? by PurifyYourMind · · Score: 1

    This is more of a technical question than a business one, but what exactly is the "choke point" in the server side of a Google search? Sure, those 10,000+ machines have no problem obliterating even complex searches. But surely there must be "gateway" type machines which route the searches. How does Google prevent these from being overloaded? (Or is that a trade secret?)

  64. False advertising. by duckpoopy · · Score: 5, Funny

    They better have at least 10^100 machines, or they will be getting a call from my lawyers.

    --
    word.
    1. Re:False advertising. by cayfer · · Score: 1

      10**100 is a "googol", nor "google"

    2. Re:False advertising. by Anonymous Coward · · Score: 0

      "Not" is spelled "not", not "nor", you pedantic fuckwit.

  65. Environmental impact: power to 68,000 homes by XavierItzmann · · Score: 3, Funny

    Did anyone think of the electricity needed to power and cool 50,000 servers

    The 1,100 Apple cluster at Virginia tech uses 3 megawatts, sufficient to power 1,500 Virgina homes
    http://www.research.vt.edu/resmag/2004resmag/HowX. html

    Yes, it is true: every time you hit Google, you are polluting the Earth.

    --
    The next pasture is always greener
    1. Re:Environmental impact: power to 68,000 homes by A.T.+Hun · · Score: 3, Interesting

      Yes, it is true: every time you hit Google, you are polluting the Earth.

      Whereas Slashdot uses nothing but solar power.

    2. Re:Environmental impact: power to 68,000 homes by Anonymous Coward · · Score: 0

      I am sure their machines are a lot more efficient than yours. You waste more running a search on Google than they do preforming it.

      Yes, you made the sky a little darker by making this post.

    3. Re:Environmental impact: power to 68,000 homes by Anonymous Coward · · Score: 1, Insightful

      Each apple requires more power than an entire house? well i guess that makes sense considering that only 1 out of 10 Virginia homes have power.

    4. Re:Environmental impact: power to 68,000 homes by XavierItzmann · · Score: 1

      He he.

      I live surrounded by woosies who are into the whole oganic/save the planet/Kyoto fashions.

      It is great to pull their strings by pointing out that they are great Earth killers, merely by doing things like taking a shower or using Google.

      --
      The next pasture is always greener
    5. Re:Environmental impact: power to 68,000 homes by XavierItzmann · · Score: 1

      You make a very good point: even with A/C, it is unreasonable that each G5 uses so much power.

      Looking again at the article (VT Research Mag): http://tinyurl.com/27pzy

      I am thinking that they probably meant a net increase of just 1.5MW for the 1,100 machines; i.e., the equivalent of 750 homes.

      Even this figure seems too large, as each Apple would consume 70% of a home's power. Maybe they installed overcapacity in order to acommodate further expansion?

      Oh well, point remains: each click on Google and each posting on /. results in consumption of oil, coal, or nuclear power.

      --
      The next pasture is always greener
    6. Re:Environmental impact: power to 68,000 homes by lawpoop · · Score: 4, Insightful

      Yes, it is true. We can't exist without polluting. However, I'm willing to bet, without doing the calaulations, that the pollution you personally generate by querying google is much less than what you generate browsing slashdot on your home computer.

      --
      Computers are useless. They can only give you answers.
      -- Pablo Picasso
    7. Re:Environmental impact: power to 68,000 homes by Anonymous Coward · · Score: 0

      Every time you shit, you pollute the earth too

    8. Re:Environmental impact: power to 68,000 homes by drinkypoo · · Score: 1

      It's also a lot more efficient than driving around the USA looking shit up in libraries. Afer all, interlibrary loans, when they are even available, often do not operate outside of a given county let alone any greater geographical boundaries. The exception to this is of course major Universities... In any case it's better than some of the alternatives.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    9. Re:Environmental impact: power to 68,000 homes by fermion · · Score: 1
      Well we obviously need to find those 1500 homes and burn them down. We can't have then using the power needed for the Apple Cluster.

      No, really. I understand and what you are saying, I just think it is silly to compare one use of power to another, as if one were a good use of power and one were a bad. No matter what, creation of power increases the entropy of the universe and leads us closer to the big heat death (or cold death, or perhaps nothing, depending on your image of the shape of the universe). Why is powering a home better than a data server. Is someone cooking meth more valuable than discovering a new state of matter?

      OTOH, comparing equal things might make sense. For example, that asault vehicle might use 5 times as many resources as the mid sived car. Or the Intel computer might use twice as much power at the PowerPC. Or the 15,000 housing running the air conditioning 24 hours a day might use as much power all the poject housing in the entire state.

      --
      "She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
    10. Re:Environmental impact: power to 68,000 homes by Anonymous Coward · · Score: 0

      You think you're so powerful you can kill the earth? That is ignorant and arrogant.

      Mother nature bats last. We could possibly wipe out the human race, but Earth? HA!

      You keep pulling people's strings... I'm sure it makes you feel very productive. Or what? What's the big thrill?

      Of course, if you really believe this, show us you care and stop taking showers (assuming you are showering or bathing), so we all know to avoid you IRL too.

    11. Re:Environmental impact: power to 68,000 homes by RTMFD · · Score: 1

      Yes, it is true: every time you hit Google, you are polluting the Earth.

      ...and it is true that every time you fart, you are polluting the Earth.

      To me, it seems like Greenies will find any way they can to "rain on the parade of progress." Go back to living in your urban ivory-tower and leave the rest of us schlubs alone to pursue the fruits of technology nag-free.

    12. Re:Environmental impact: power to 68,000 homes by eclectro · · Score: 1

      Whereas Slashdot uses nothing but solar power

      Slashdot runs on hot air.

      Except for the mods - they use crack.

      --
      Take the cheese to sickbay, the doctor should see it as soon as possible - B'Elanna Torres, "Learning Curve"
  66. Not all bought at once by hey · · Score: 1

    A major thing wrong with those price calculations is that fact that Google has probably neen buying a few (hundred) machines a month for the last few years. Also old machines (and some new ones) die and are retired. So the cost is probably very messy. Presumably when they back a bunch they go for the best price/performance ratio at the time. As the years have gone you get more cyles per buck.

  67. Returns 404..Google cache by Pranjal · · Score: 1


    Here's the Google Cache of that page. The link returns a 404 for some reason.

    1. Re:Returns 404..Google cache by Anonymous Coward · · Score: 0

      No it doesn't.

  68. Amazing by Anonymous Coward · · Score: 0

    Any formula to calculate how many google stocks the slashdot owners bought ?

  69. Some Actual Numbers by William+Marcy+Tweed · · Score: 1
    Well, if you want to use some real numbers for the back of the envelope calculations then you should at least use google's own numbers. From their own site at http://www.google.com/appliance/hardware.html take a look at the GB-8008. Eight 1U systems that can index 15M documents. 15M / 8 = 1.875M. This is spread out over five collections of 3M each. With google claiming 4,285,199,774 pages (docs) indexed divided by the above figure will get you 2,286 machines.

    Now this figure seems pretty low to me. Even if you double it, considering one cluster on each coast, you're still not near what others have speculated.

  70. What make you think it's all spent on *machines*? by Moderation+abuser · · Score: 1

    Ah, how much do you think a machine room costs?
    How much for UPS provision?
    How much for cooling?
    How much for LAN/switching?

    While individual machine costs may well be low these days, the infrastructure to support them is *not* cheap.

    I reckon the estimates of the numbers (40k, 79k etc) of machines based on that 250 million dollar figure *seriously* overestimates the numbers of actual computers.

    --
    Government of the people, by corporate executives, for corporate profits.
  71. Sentience by Anonymous Coward · · Score: 0

    A good gamma radiation blast could turn those 100,000 machines against us! The once great internet oracle could turn and wipe out humanity.

    "GOOGLe" I see the 3 6's.

  72. Scary... DDOS? by moosesocks · · Score: 2, Interesting

    Isn't it scary that according to these figures, Google's datacenter should theoretically be able to DDOS the entire Internet?

    Someone mentioned that they have enough bandwidth/processing power to saturate a T1000 line. Scary...

    --
    -- If you try to fail and succeed, which have you done? - Uli's moose
    1. Re:Scary... DDOS? by Anonymous Coward · · Score: 0

      Isn't it scary that according to these figures, Google's datacenter should theoretically be able to DDOS the entire Internet?

      but they're not running Windoze, so no need to worry.

    2. Re:Scary... DDOS? by FunkSoulBrother · · Score: 1

      wouldn't the entire attack be coming from the same (few) places though? Couldn't you just drop packets from specified IP ranges?

  73. Re:Mirror , just in case by kinzillah · · Score: 1, Funny

    So tell me, do you have a script that automatically makes useless mirrors? How come we never see mirrors of things that actually get brought down?

    --
    Douglas P. Price
  74. surely Big Iron would have been cheaper? by christophersaul · · Score: 1

    Wouldn't it have been easier just to have put up a couple of racks running traditional Unix Big Iron, instead of wasting all that money on lots of crappy little Intel boxes?

    1. Re:surely Big Iron would have been cheaper? by name773 · · Score: 1

      resource allocation becomes more flexible as you get more computers. scaling is also easier, say you want only a bit more power to do an extra page, you're not going to buy a multi processor ultra-sparc when a pentium II could do the job.

    2. Re:surely Big Iron would have been cheaper? by christophersaul · · Score: 1

      I know, I was only joking really. Any time anyone mentions Google on Slashdot, it's usually making the case that since Google run their business on thousands of broken 386s running Linux, everyone else should be able to as well.

    3. Re:surely Big Iron would have been cheaper? by name773 · · Score: 1

      man am i dumb...
      but hey, if anyone does decide to switch, i run a "big iron" recycling center

  75. i didnt believe it ... by Anonymous Coward · · Score: 0

    i must have gotten dumber over the years, but i
    honestly think google used to do a much better job
    maybe two years or more ago ...

    it's not how many computers you have but how fast
    your backbone connection is, how "good" your
    database is and how efficient your search/index
    algorithem is ...
    anyways methinks this "how-many-computer" thingy
    is just to boost the initial IPO offering price
    since just going public for some MELLIONS because
    of some data on HDD that might just be obsolete in
    a few weeks is ..errr... another bubble?

    IF google goes public for MELLIONS you can count
    me in on the search wars, since the amount
    everyone is ready to pay for a google stock is
    mighty redicilous!!!

    maybe googles has "revers-kick-my-butt" problems
    since they're putting their own ads on some pages?
    dunno ...

  76. Acquisition by MrChuck · · Score: 4, Insightful
    recall that important mantra:
    The cost of acquiring the machine is a fraction of the cost of owning it.

    And lets not forget the overhead of 2 networks per machine and all the patch panels, wiring, switches. Toss in console management (which may not be on all machines at all time), monitoring and management of said machines. Oh, and one really tired guy running around.

    Disks are going to fail at a rate of several hundred or thousand PER DAY, just statistically. (along with power supplies etc)

    Toss in that in three years, ALL of those machines are obsolete.
    That's huge.

    I've got ~300 racks in a half full data center upstairs from me. All network cables run to a room below it to patch panels. Around 50% the size of the DC is cable management. Next to that is a room FILLED with chest high batteries - these are used during outages until the generators need to be kicked on. And a NOC takes up about 1/5th the space of the DC (monitoring systems worldwide, but it's got seating for maybe 40 people - tight and usually filled with 10 folks, but in a crunch we live up there).

    So that $3159 is only a bit of it. And in 3 years, all those machines will likely be replaced for whatever $3k buys then. That's about to be a 2 CPU Athlon64 box. If Sun can pull a rabbit out of its ass, we'll have 8 and 16CPU Athlon64 boxes. At least with that, some of the CPUs can talk to each other really really really fast.

    1. Re:Acquisition by Anonymous Coward · · Score: 5, Informative

      >>Disks are going to fail at a rate of several hundred or thousand PER DAY

      that's a little over the top big guy. i've worked at a 10,000 node corp doing desktop support. We lost ONE disk perhaps a week....if that much. We often went several weeks with no disks lost.

      even if you factor in multiple drives per server, say TWO (because they are servers not desktops)

      Interpolate for 100,000, that's a max of 20 disks per week...on the high end.

    2. Re:Acquisition by igotmybfg · · Score: 1

      disks don't fail THAT fast/much, also management needs are way less than you envision - they don't need to f**k with each box individually, it can be done remotely

    3. Re:Acquisition by onepoint · · Score: 2, Informative

      Well it really depends on what your willing to spend for a drive and the quality. I will agree, 10000 drives should give you about 2 to 6 failures per week. But I have seen that sometimes in a web server situation 10000 drives have a failure rate of about 15 per week. In one case ( very very bad case ) we had a batch of bad drives come in, the first 70 had a complete failure within 4 weeks then the rest of the order failed within 6 months.... we nevered ordered that model of drive again.

      Now we have had some great luck also, where we found a brand that almost never failed for 12 to 18 months at a time, so we set up a specific policy that we used those drives as back-up redundancy drives for every main drive ( about 2500 drives ), to this day I have yet to see more than 1 failure per every 3 weeks with those drives.

      Now I have a pc at home that has been abused daily and have never had drive failure, it's been turned on every day since 1999, so it cycles completely from hot/cold and sleep/aware. maybe I'm lucky but I've abused that drive consistantly ( and back up weekly ) so maybe I'm due.

      Drive spin has become a huge factor in relation to drive failure in a web server farm. You want the fastest spin rate and at the same time you need the fast read times, but the faster spin rates give you higher failures, so you really have to learn to blend cache's, hardware and software and the dreaded mix drive raid.

      best of luck to all

      Onepoint

      --
      if you see me, smile and say hello.
    4. Re:Acquisition by ron_ivi · · Score: 1
      Parent wrote: "The cost of acquiring the machine is a fraction of the cost of owning it. ... Oh, and one really tired guy running around. "

      Only in a really poorly managed network. Failover is a wonderful thing.

      Cringley analyzed this before:

      Now here is the part that sticks in my mind: the fault tolerant nature of the cluster is such that if a machine fails, the other machines simply take over its functions. As a result, <b>whenever a server fails at Google, THEY DO NOTHING. They don't replace the broken machine. They don't remove the broken machine. They don't even turn it off.</b> In an army of drones, it isn't worth the cost of labor to locate and replace the bad machines. Hundreds, maybe thousands of machines lie dead, uncounted among the 10,000 plus.

      We have reached the point where we are totally dependent on computers, yet the marginal cost of a computer&#151;at least for Google&#151;is nothing. This may be an historical first.
    5. Re:Acquisition by ostiguy · · Score: 1

      I have a tough time believing this - when you are paying for hosting, paying for dead boxen will eventually add up. I tend to think, that if google is as smart as we think they are, they have a rack monitoring system that could establish at what point they should have a tech visit the hosting facility - i.e, 3 boxes dead costs them X a month, etc.

      ostiguy

    6. Re:Acquisition by ron_ivi · · Score: 1
      I suspect it's a combination of both approaches. No need for immediate action; and you just let them sit and wait until you're running low on rackspace.

      Then when you need to expand, "ping" them all (no need for overly complex monitoring SW in this case) and reclaim them in groups of hundreds. If you do this once/quarter or so it's not too much of an administrative burden.

    7. Re:Acquisition by Anonymous Coward · · Score: 0

      Right, Chingy is wrong.

      I read a reliable report that the Google server room monkeys wheel around carts of disks, etc and replace as necessary during their at least daily rounds of the server racks.
      The white "box" days are gone and the various naked and non-naked 1U machines are in.

    8. Re:Acquisition by afidel · · Score: 1

      My guess it that they actually NEVER replace individual machines, they put new racks together as capacity demands and when a new rack is put together it is either assigned to an underserved DC or is sent out to replace the oldest/most degraded rack. This way they simply wheel in the new rack, hook it up, and wheel out the old rack, no downtime, no wasted personell time.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    9. Re:Acquisition by duffbeer703 · · Score: 1

      If you see that many failures, you definately have some heat or quality control issues.

      The organization that I work with has around 30,000 workstations and 4,000 servers. Last year we had 76 disk failures.

      --
      Conformity is the jailer of freedom and enemy of growth. -JFK
    10. Re:Acquisition by crucini · · Score: 1

      As I understand, there is no console management. And broken machines are never fixed or replaced - the system re-replicates the data that was (redundantly) stored on that node.

      Of course I could be wrong - I'm basing this on the few glimpses Google has allowed of their operation in the past.

      I don't think these PC's cost $3159. More like $500. They are just cheap PC's with two disks, either in a 1RU case or naked on a plate - Google has used both packaging techniques.

    11. Re:Acquisition by RabidMonkey · · Score: 1

      sounds like a fun place to work .. you hiring? :)

      --
      We emerge from our mother's womb an unformatted diskette; our culture formats us. - Douglas Coupland
    12. Re:Acquisition by Hoser+McMoose · · Score: 1

      Just to throw some numbers at that hard drive stat of yours...

      Mean Time Between Failures means the number of drive-hours between a hard drive failure within the warranty, ie it's exactly what we need to solve this problem. Low-end IDE drives have an MTBF of somewhere around 500,000 hours.

      So, if we figure that Google has 78,000 servers, each with a single IDE drive (apparently that is how they have them setup), they should have 1 drive failure every 6.4 hours, or about 3-4 drives a day. Definitely a non-trivial amount, and obviously some days will be a lot worse than others, but certainly not hundreds per day, let alone thousands. Even if the MTBF is somewhat overstated and is only 300,000 hours you're still looking at single-digit drive failures per day on average.

    13. Re:Acquisition by Anonymous Coward · · Score: 0
      Sure. First we'll have a couple meetings to decide who should be on the committee to ponder the formation of a plan. Then we'll need to write up plans on the several ways to actually implement the plan to hire you but don't worry about that. It'll be next year before we get to that.

      and your username will look like a generated password

    14. Re:Acquisition by vuo · · Score: 0

      >Interpolate for 100,000,

      Extrapolate, not interpolate. We'd need a comment from someone who manages >100000 machines to "interpolate". Interpolation means you evaluate between two or more known data points. Here you're using just one point, and that's always extrapolation.

    15. Re:Acquisition by jabbo · · Score: 1

      So, were your disks running continuously, 24 hours a day, 365 days a year? And did some of your desktops have more than two (say, 4, 6, or even 8) disks piled into them? Your calculations are on the right track, but your assumptions are flawed.

      10,000 nodes is no big deal unless they're all continuously active. Multiply that several fold, into multiple data centers, and you start to see statistically valid patterns emerge. One of those patterns is that drive manufacturers are rather optimistic about their MTBF. Another is that even ECC RAM has more flaws than you would expect (!).

      Google Has Over 10,000 Computers... how many more, no one who has signed an NDA is permitted to say.

      But I'll tell you one thing, Google has never paid $3000 for an ordinary production server and never will... that would be apostasy.

      --
      Remember that what's inside of you doesn't matter because nobody can see it.
  77. They use low power Pentium IIIs by Anonymous Coward · · Score: 0

    I read an article on The Register saying they laughed at Intel when they wanted them to use the 200W Itanium chips. Power/heat ratio and low cost are the single biggest factors in Google's hardware decision.

  78. They don't need 40K machines! by Horia · · Score: 1

    Do the math. Gigablast.com said they use 8 machines (each P4 2.4, 2Gb ram, lots of hdd) and they handle an index of 250Mil pages.

    So, of the search engine part of google has like 4.2 billion pages, that would scale up to 135 machines!

    So with 200 machines they could do the job. OK, that is not completely correct because we didn't account for the difference in traffic.

    Gigablast says it handles 50searches/sec on 8 machines - 6.25 per machine. Google has 200million searches/day that is approx 2400/sec so that would scale to 384 boxes. Still way lower that 45,000.

    So I really don't get why they need so much power.

    1. Re:They don't need 40K machines! by DiscoOnTheSide · · Score: 2, Insightful

      Google also indexes images, newsgroups, has things like froogle, as well as the upcomming gmail. Not to mention all the research and other things they have going, on top of redundancy...

      --
      Viva La Revolucion! Buy a Mac!
    2. Re:They don't need 40K machines! by mikis · · Score: 1

      Yes, but the point is that they cache entire web in RAM, and that is hard to do with only 200 machines.

  79. But his low end number are Wrong... by quasi-normal · · Score: 3, Interesting

    He displayed a little numerical dyslexia... it's 359 racks, not 539 for $100 Mil. which makes the stats a little different: 31592 machines 63184 CPU's 63184 GB RAM 2527.36 TB of Disk space and I'm not sure what his logic is behind the Teraflops calculations... looks like he's taking 1Ghz==1TFlop which would give about 126.4 TFlops. Aside from that error, the figures sound pretty realistic to me. But I wanna know how much bandwidth they use.

    1. Re:But his low end number are Wrong... by TNLNYC · · Score: 1

      Thanks for the catch, I corrected the figures :)

      --
      Check out http://www.tnl.net/blog
  80. lego? by sfraggle · · Score: 2, Insightful

    Sounds like a pretty stupid idea to me. Lego is expensive stuff.

    --
    were you expecting to see a sig here? perhaps you'd rather see the inside of an ambulance!
    1. Re:lego? by james+b · · Score: 2, Interesting

      I think the parent is probably referring to some of the pictures on google's early hardware photos page, courtesy of the wayback machine. If so, the lego never necessarily went into `production', it was just when they were messing around.

    2. Re:lego? by Anonymous Coward · · Score: 0

      Those are Duplo, not LEGO.

    3. Re:lego? by Anonymous Coward · · Score: 0

      And Duplo blocks aren't the same brand or something?

    4. Re:lego? by michrech · · Score: 1

      I think the parent is probably referring to some of the pictures on google's early hardware photos page, courtesy of the wayback machine. If so, the lego never necessarily went into `production', it was just when they were messing around.

      Plus, it says (and shows) very clearly that it was only a cover for a stack of HDD's they had. It didn't say it was holding servers together as the origional poster said.

      --
      bork bork bork!
  81. Re:88 machines per rack? Blades, dude. by AJWM · · Score: 1

    In your standard 42U cabinet, you're talking a half-U per server. Umm.. not happening.

    One word: blades.

    I'm not saying that's what Google uses (I have no idea), but that is how to cram servers in at less than 1U per server (eg 8 or more server blades in 4U of rack). There are also some 1U boxes that cram 2 servers in side by side.

    --
    -- Alastair
  82. Google's Trusted Computing GRID by mytho · · Score: 1

    Google is building their trusted computing grid, where the brute power of cheap hosts are used in a global trusted matrix. It's there software that is amazing and builds this huge cluster around the world. There's no point in guessing how many machines they have now, the number rises every second. But sooner or later whether you want it or not, we will all become a part of one fancy trusted grid where we (hopefully) all share resources. If it will be Google's grid or my grid? Who know... just my 2 cents.

  83. Google Jokes by Anonymous Coward · · Score: 1, Funny

    What did one Google machine say to another Google machine?

    - Don't worry, I've got your backup.

    Why did the Goggle machine cross the road?

    - To get to the other cluster.

  84. Corrected version - Re:I have seen the light by imroy · · Score: 2, Funny

    Geez dude, go back to school and learn how to punctuate properly and the proper use of there/they're/their. I'm not a grammer/spelling nazi. Even though mistakes annoy the shit out of me, I usually let it pass. I know I make the occasional mistake myself. But your post was just too much.

    I don't know why I'm doing this, but here's a corrected version of your post:

    Working at AboveNet, Google has pulled their machines in and out of our data centers many a time. It's incredible the way they have their shit set up.

    They fit about 100 or so 1U machines on each side of the rack. They're double sided cabinets that look like refrigerators, separated in the center by noname brand switches and they have castor wheels on the bottom. Google can at the drop of a dime roll their machines out of a data centre onto their 16 wheeler then move, unload and plug into a new data centre in less than a days time.

    1. Re:Corrected version - Re:I have seen the light by NoData · · Score: 4, Funny

      I'm not a grammer/spelling nazi.

      Obviously.

    2. Re:Corrected version - Re:I have seen the light by Danny+Rathjens · · Score: 0, Offtopic
      Why is it that people whining about spelling inevitably spell words incorrectly? For a detailed correction and a mod of 3, you also missed quite a bit.

      • grammer -> grammar
      • noname -> no-name
      • castor wheels -> casters (the castor mispelling is just about common enough to be proper(although m-w.com doesn't say so, and 'wheels' is redundant in either case)
      • 16 wheeler -> 18 wheeler
      • plug into -> plug in to
      • centre -> center (I'd let that one by if you had changed both instances, but using both spellings doesn't make sense.)
      • a days time -> a day's time
    3. Re:Corrected version - Re:I have seen the light by kylegordon · · Score: 1

      You missed a comma before and after 'at the drop of a dime', since it's not a required piece of information to make an understandable sentence.
      :-p

    4. Re:Corrected version - Re:I have seen the light by ponds · · Score: 0, Offtopic

      Correcting your post:

      For some reason you bolded random words, they shouldn't be bolded. Please go back to school and learn acceptable use of bold.

    5. Re:Corrected version - Re:I have seen the light by imroy · · Score: 1

      /me pulls out his dictionary...

      • grammer -> grammar
        D'oh! I was afraid of missing something. I was being really careful too.
      • noname -> no-name
        That's debatable. Either way, you're making a compound word. Down here in .AU, "noname" actually is a brand name. Although they certainly don't make network switches.
      • castor wheels -> casters
        Nup, "castor" is in my Oxford dictionary. It specifically mentions wheels on furniture. Perhaps "caster" is an Americanism.
      • 16 wheeler -> 18 wheeler
        Do I look like a trucker? Is this TruckDot? Anyway, this doesn't have anything to do with spelling or grammar.
      • plug into -> plug in to
        Alright. Note to self: "into" is quite different to "in to".
      • centre -> center
        Once again, an Americanism. The correct Commonwealth spelling is centre. (BTW, it's annoying that the American spelling is in the HTML standard)
      • a days time -> a day's time
        I always thought this was another Americanism but it seems it isn't. I'm trying really hard to remember what I learnt in school. Hmmm...

      Slowly becomming a grammar and spelling nazi...

    6. Re:Corrected version - Re:I have seen the light by u38cg · · Score: 1
      Yo, if you gonna go all pedantic man, at least do it right:

      Working at AboveNet, Google have pulled their machines in and out of our data centers many times. It's incredible the way they have their equipment set up.

      They fit about 100 or so 1U machines on each side of the rack. They are double sided cabinets that look like refrigerators, separated in the center by no-name brand switches and they have caster wheels on the bottom. At the drop of a dime, Google can roll their machines out of a data centre onto their 16 wheeler then move, unload and plug into a new data centre in less than a day's time.

      --
      [FUCK BETA]
    7. Re:Corrected version - Re:I have seen the light by Caedar · · Score: 1

      So this is what Slashdot has come to? +5 Funny for pointing out spelling mistakes? (Yes, I know. Mod me down for offtopic, but I'm pointing out something here!)

    8. Re:Corrected version - Re:I have seen the light by imroy · · Score: 1

      I know, it's weird. I thought I'd be modded down "offtopic" or "flamebait" in the blink of an eye. Slashdot moderation is just plain weird nowadays.

    9. Re:Corrected version - Re:I have seen the light by darkmeridian · · Score: 3, Funny

      I'm not a grammer/spelling nazi.

      Obviously.

      I'm not a grammer/spelling Nazi.

      Obviously.

      --
      A NYC lawyer blogs. http://www.chuangblog.com/
    10. Re:Corrected version - Re:I have seen the light by DrEasy · · Score: 1

      "To a million eyeballs all spelling mistakes are shallow..."

      --
      "In our tactical decisions, we are operating contrary to our strategic interest."
    11. Re:Corrected version - Re:I have seen the light by Tet · · Score: 1
      The correct Commonwealth spelling is centre. (BTW, it's annoying that the American spelling is in the HTML standard)

      True, although it's deprecated. It is, however, still part of the CSS standard. There was some discussion about allowing "centre" as a synonym, but it didn't make it into the final standard IIRC.

      --
      "The invisible and the non-existent look very much alike." -- Delos B. McKown
    12. Re:Corrected version - Re:I have seen the light by Anonymous Coward · · Score: 0

      ``It's'' is incorrect. That implies ownership. The correct version is ``Its''.

    13. Re:Corrected version - Re:I have seen the light by AVryhof · · Score: 3, Funny

      According to Google...

      Did you mean: grammar

    14. Re:Corrected version - Re:I have seen the light by wheany · · Score: 1
    15. Re:Corrected version - Re:I have seen the light by Anonymous Coward · · Score: 0

      It's grammar not grammer.

    16. Re:Corrected version - Re:I have seen the light by dougkramer · · Score: 1

      It's grammar Nazi, not grammer Nazi, Mr. Spelling-Bee. - Kelsey Grammer

  85. That's Incredible! by moltar77 · · Score: 1

    Over 100,000 servers? Amazing... I wonder then how many pigeons they must have, and where do they get them all?

  86. Web in memory by Sajma · · Score: 1, Interesting

    Given how fast Google is, we expect that they keep all the text of all the web pages that they index in memory. If we estimate 100K machines and 4,285,199,774 web pages, that's 42,852 web pages per machine. Let's guess 1 GB RAM per machine, then that's an allocation of about 25 KB per page (quite a bit larger than the average page size, I suspect). Of course, they've probably replicated the web a few times; let's guess 3 times, so that's about 8 KB per page -- still room to spare, and it's possible that the average memory per machine is greater than 1 GB. Plus, they could compress less popular pages -- the delay of decompression in memory is probably small.

    Of course, once you consider that they keep thumbnails of al the images they index, things get tight very quickly. Plus, we can't forget the actual INDEX from words to documents -- that's in memory, too. And Orkut (which is probably pretty small, come to think of it).

    GMail is another story altogether. 1 GB per user for 100K users would saturate their cluster. Plus indexes for searching mail. It seems unlikely that we'll have all-memory mail accounts anytime soon.

    1. Re:Web in memory by Alain+Williams · · Score: 1

      They need to keep the indexes in memory for fast searching, they don't need to keep the cache in RAM.
      Similarily: for images they only need keep the keywords, the imgage itself can live on disk -- it isn't going to be needed very often.

      Their mail system - why keep mail in RAM ? Only one person is going to be interested in reading it, normal disk is plenty fast enough for that - OK, they might temporarily cache subject lines/... once someone loggs in, but I can't see the need to keep the entire message in RAM.

    2. Re:Web in memory by Sajma · · Score: 1

      Ah, but notice that Google returns snippets that contain the search terms for each page in the search result. If the page contents are not in RAM, then creating these snippets would need to read the page from disk. Google's performance suggests that this is unlikely. (I doubt caching just snippets is worthwhile relative to caching the whole page) You're probably right about the images (keywords only) and mail (subjects only), though.

  87. one of these days by rnd() · · Score: 1

    One of these days (in the next 10-15 years) my PDA will have as much computing power and storage capacity as Google does today.

    --

    Amazing magic tricks

    1. Re:one of these days by Anonymous Coward · · Score: 0

      ...but you'll use it all to view porn

  88. Even E-Mail by 3D+Lover · · Score: 1

    Yea, That'd be a good idea, web hosting. Oh, and they could offer e-mail too. Wait..... I think they are working on that.

  89. Hey by daishin · · Score: 2, Funny

    Google executives sir,mam,person, do you mind if you could lend me a few boxes?

    --
    (\_/)
    (O.o) This is Bunny. Add Bunny to your signature
    (> <) to help him achieve world domination.
  90. Fire sale by kansei · · Score: 1

    I work at a major investment bank and I've had a chance to see how procurement on a large scale works. I've seen them get servers / disk / etc at 50% to 65% discount in volume. It seems that when you buy a 1000s of anything directly from the manufacturer, they will bend into pretzel shapes to sell it to you at a huge discount.

  91. google is starting to resemble... by Anonymous Coward · · Score: 2, Funny

    google is starting to resemble the wonka factory from willy wonka and the chocolate factory.

  92. Google doesn't own ANY machines! by DrDebug · · Score: 1

    Here's what actually is happening...

    Google is running on YOUR MACHINE, and everyone elses machine that is connected to the internet. Through very sophisticated software and anti-detection techniques, Google uses your 'spare' CPU cycles to search, index, and respond to queries.

    Has anyone actually ever SEEN any of Google's machines?

    Scary, huh? :-)

  93. Interesting list... by glpierce · · Score: 4, Funny

    "Your phone company can't just lose a few calls you made and not bill you for them."

    Wait, what's wrong with that one?

    --
    G
    1. Re:Interesting list... by Anonymous Coward · · Score: 2, Insightful

      Working for a phone company, I can say that "we can" and "we do" :-)

  94. Sunday, Sunday, Sunday! /. vs. Google! by NarrMaster · · Score: 0

    Let's /. Google everyone!
    Click Me!

    --
    That's right. All your base.
  95. Why does anybody care? by AhBeeDoi · · Score: 1

    If you're not a competitor, why does anybody care how many servers Google uses? Is there a light bulb joke in here that I'm missing?

  96. Been there, done that... by pyrros · · Score: 1
  97. Not to sound like your Mom [or Señor Ashcroft by mosel-saar-ruwer · · Score: 1


    ...but if I were Joe al Q'aeda, with maybe a Bachelor's or a Master's in EE, and me and my friends had taken our annual stipend from the Saudi embassy to build ourselves a nice little EMP weapon and to purchase a van to house it, well, I think you've just told us where we'll be heading...

  98. Server pricing by JWSmythe · · Score: 5, Informative

    His pricing in the summary may be a bit off.

    Every article I've read about Google's servers says they use "commodity" parts, which means they buy pretty much the same stuff we buy. They also indicate that they use as much memory as possible, and don't use hard drives, or use the drives as little as possible. From my interview with Google, they asked quite a few questions about RAID0, RAID1 (and combinations of those), I'd believe they stick in two drives to ensure data doesn't get lost due to power outages.

    We get good name brand parts wholesale, which I'd expect is what they do too. So, assuming 1u Asus, Tyan, or SuperMicro machines stuffed full of memory, with hard drives big enough to hold the OS plus an image of whatever they store in memory (ramdrives?), they'd require at most 3Gb (OS) + 4Gb (ramdrive backup). I don't recall seeing dual CPU's, but we'll go with that assumption.

    The nice base machine we had settled on for quite a while was the Asus 1400r, which consisted of dual 1.4Ghz PIII's, 2Gb RAM, and 20Gb and 200Gb hard drives. Our cost was roughly $1500. They'd lower the drive cost, but incrase the memory cost, so they'd probably cost about $1700, but I'm sure Google got better pricing, buying the quantity they were getting.

    The count of 88 machines per rack is a bit high. You get 80u's per standard rack, but you can't stuff it full of machines, unless you get very creative. I'd suspect they have 2 switches, and a few power management units per rack. The APC's we use take 8 machines per unit, and are 1u tall. There are other power management units, that don't take up rack space, which they may be using, but only the folks at Google really know.

    Assuming the maximum density, and equipment that was available as "commodity" equipment at the time, they'd have 2 Cisco 2948's and 78 servers per rack.

    $1700 * 78 (servers)
    +
    $3000 * 2 (switches)
    +
    $1000 (power management)
    --------
    $139,600 per rack (78 servers)

    Lets not forget core networking equipment. That's worth a few bucks. :)

    Each set of 39 servers would probably be connected to their routers via GigE fiber (I couldn't imageine them using 100baseT for this) Right now we're guestimating 1700 racks. They have locations in 3 cities, so we'll assume they have at least 9 routers. They'd probably use Cisco 12000's, or something along that line. Checking eBay, you can get a nice Cisco 12008 for just $27,000, but that's the smaller one. I've toured a few places who had them, and pointed at them citing them to be just over $1,000,000.

    So....

    $250,000,000 (ttl expenses)
    - $ 9,000,000 (routers)
    ------
    $241,000,000
    / $ 139,600
    ------
    1726 racks
    * 78 (machines per rack)
    ------
    134,682 machines

    Google has a couple thousand employees, but we've found that our servers make *VERY* nice workstations too. :) Well, not the Asus 1400r, those are built into a 1u case, but other machines we've built for servers are very easy to build into midtowers instead. Those machines don't get gobs of memory, but do get extras like nice sound cards and CD/DVD players. The price would be the same, as they'd probably still be attaching them to the same networking equipment. 132,000 servers, and 2,682 workstations and dev machines is probably fairly close to what they have.

    I believe this to be a more fair estimate, than the story gave. They're quoting pricing for a nice fast *CURRENT* machine, but Google has said before that they buy commodity machines. They do like we do. We buy cheap (relatively) and lots of them, just like Google does. We didn't pattern ourselves after Google, we made this decision long before Google even existed.

    When *WE* decided to go this router, we looked at many options. The "provider" we had, before we went on our own, leasing space and bandwidth directly from Tier 1 providers, opted for the monolythic sy

    --
    Serious? Seriousness is well above my pay grade.
    1. Re:Server pricing by Andy_R · · Score: 3, Insightful

      "hard drives ... they'd require at most 3Gb (OS) + 4Gb (ramdrive backup)"

      Which is why they have no problems finding space for GMail - you can't buy full size drives as small as 7Gb anymore, so they already have countless Tbs of unused drive space in their racks.

      --
      A pizza of radius z and thickness a has a volume of pi z z a
    2. Re:Server pricing by Anonymous Coward · · Score: 0

      Why the hell would they need a ramdrive backup? Didn't you read their SOSP 2003 paper? GFS makes it possible to build a filesystem that NEVER is written to disk, EVER.

    3. Re:Server pricing by JWSmythe · · Score: 1

      I'd have to assume that the smallest drives they have in operation are 20Gb, but I'd suspect they probably be smarter not to use extra storage across all those servers, they'd probably go with some large arrays.

      We can buy this for about

      Promise VTrak 15100 $4075
      15 WD 250Gb SATA $230/ea $3450
      ---------------
      3.5Tb (single RAID5) $7525

      Just looking at raw space, you could put 14 of those on a SCSI chain, and multiple cards in one machine (say 4 to keep things simple). For just $422,000, you could have 196Tb of space on a single machine. I'd strongly suspect they'd distribute it over a few more servers than just one. :)

      They're very smart offering the 1Gb storage. It'll get users in, who don't like the limits. I've offered my users "unlimited" storage for years. They were on a 45Gb drive for a long time, but are now on a 330Gb array. With 246 users, they're only using 7Gb, and honestly, most of that space is my usage. I archive several busy mailing lists, for my own usage, and plenty of accounts that exist, "just in case" we need a message from it from years ago. :)

      Google will see a very small number of it's users getting anywhere close to 1Gb. I'm sure there'll be plenty of /. readers hitting them in the first months, trying to fill their own boxes, but outside of that, it will only be very occasionally that.

      One account we have, that simply collects spam sent to other accounts, took 7 months to collect 1.1Gb of mail. Hmmm, maybe I should purge that one. Nah, we still have 300+Gb free. :)

      --
      Serious? Seriousness is well above my pay grade.
    4. Re:Server pricing by Anonymous Coward · · Score: 0

      Wow, I had no idea. You guys run a great site. I'm a proud subscriber of RC.

      Glad to see you're definitely not some two-bit chop shop! Keep kicking ass and taking pictures!

    5. Re:Server pricing by JWSmythe · · Score: 1

      You'd be really surprised to see what we have. Ya, it's far less than a single leased machine in someone elses room. I've been tempted to put pictures online of the colo's and our equipment, but every time I have a decent set of photos together, we've already made a change at one of the colo's, and alas, no camera.

      I haven't been back to New York, since another admin was up replacing machines, which is sad, it probably looks much prettier there now, than when I first set it up. :) I found a way to fit 12 4u machines in a cabinet, and still get the doors to shut. :)

      --
      Serious? Seriousness is well above my pay grade.
    6. Re:Server pricing by JWSmythe · · Score: 1


      I'm surprised no one corrected me on this. I was just writing something else and realized you can't fit 80 servers per rack. :)

      1u = 1.75"

      A standard rack is 70" tall.

      You can fit 40 machines per rack. Knock 1u off for the switch, so 39 per rack, not 78. The price for servers and routers comes out the same, just twice as much floor space, which to us non-multi-billion-dollar companies costs money. :)

      I'm fairly sure Google outright owns their facilities, so they don't have to worry about that square footage. That's just a little less space for the foosball room.

      --
      Serious? Seriousness is well above my pay grade.
  99. Yes, but.... by Anonymous Coward · · Score: 0

    > After many scientific and time consuming
    > experiments, we have found the number of servers
    > to be.........
    > 42.

    Yes, but each server is the size of a planet.

  100. Re:Not to sound like your Mom [or Señor Ashcr by Anonymous Coward · · Score: 4, Funny

    You might also be interested to know that there are a lot of government buildings in Washington DC.

  101. Re:Not to sound like your Mom [or Señor Ashcr by geniusj · · Score: 1

    Ha! It is quite a neat place. One of the nicest datacenters I've been in. They do try to take some precautions I guess. For example, you won't find the addresses to any of their datacenters on their website and the buildings are completely unmarked. They're usually not in business parks or anywhere you'd really expect them to be either. However, I'm sure that information could be dug up rather easily :)..

    Now would you have to get into the datacenter first? Or are the concrete walls a non-issue?

    -JD-

  102. I think they include infrastructure & air cool by melted · · Score: 2, Interesting

    I think they include infrastructure and air cooling into their $250M figure. I these things can actually cost MORE than the racks themselves, especially if these racks consist of commodity hardware, and considering the size of their data center.

  103. You'll go blind. by Anonymous Coward · · Score: 0

    You just trolled yourself.

  104. google uses solid state hd's by nester · · Score: 1

    wasn't there recenly an article here on /. about how, due to the unreliability of mechanical/magnetic drives, it was cheaper (and faster, of course) for them to use solid state drives? i know i read that about some company. i couldn't find the article, though.

    1. Re:google uses solid state hd's by dumbfounder · · Score: 1

      please tell me you found the article, I have been looking for a decently priced solid state disk for years. What is so hard about having a battery backed up memory board you can put 16gb or more of cheap RAM on? Seems almost trivial. If I were google, I would buy a company just to make me hardware like that. But I am dumbfind, and thus it still doesn't make financial sense.

  105. ya, but... by Teh_monkeyCode · · Score: 1

    does it run Linux?

    --
    -------
    Chunky Bacon
  106. Largest supercomputer? by artlu · · Score: 1

    Based on that assumption, if they do have 100k servers and if those servers were each 1ghz wouldnt they be bigger than any supercomputer in terms of pure performance?
    100k Ghz vs. VA Tech's 4400 Ghz.
    Or am I completely wrong?

    --
    -------
    artlu.net
  107. "Google" is singular not plural, get it right by Myrrh · · Score: 1

    Google "have?"

    What is with this movement to consider corporations as plural rather than singular?

    A corporation, i.e., Google, is SINGULAR. As in, "Google HAS pulled its machines out of a datacenter..."

    If you said something like, "Google employees HAVE pulled their machines out of a datacenter..." then that would be correct.

    If you're going to be pedantic, so am I.

    1. Re:"Google" is singular not plural, get it right by swillden · · Score: 1

      What is with this movement to consider corporations as plural rather than singular?

      It's not a "movement", it's a cultural difference. Americans use the singular, Britons and Australians use the plural.

      I'm American but I actually think that their way is better because it emphasizes the fact that organizations are comprised of people. Many fallacious arguments are built on the notion that an organization has "an" opinion, intention or goal, when the reality is that the public actions and decisions of a company, government, etc. are just the external effects of many separate -- and often poorly coordinated -- internal actions and decisions.

      Of course, I've seen the same fallacious arguments from Brits and Aussies, so maybe it doesn't really help...

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    2. Re:"Google" is singular not plural, get it right by swillden · · Score: 1

      Oh, forgot to mention: I'm not sure which way is correct in Canadian English. I've seen Canadians do it both ways though, in my experience, more of them follow the American style.

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    3. Re:"Google" is singular not plural, get it right by Myrrh · · Score: 1

      Hmm. I hadn't considered that. I figured it was just poor grammar (which is, sadly, characteristic of computer programmers and scientists of my generation).

      Legally, however -- at least in the United States -- when a company incorporates, the corporation itself becomes a person. The corporation can be sued, but the individual boardmembers and employees cannot. These people are protected from both creditors and customers by way of incorporation.

      That is why I think it's most correct to refer to incorporated companies as singular. It takes a bit more work to say something like "the employees of X corporation" or "the board members of X corporation," but that indicates without ambiguity that you are talking about the people in the corporation and not the corporation itself.

    4. Re:"Google" is singular not plural, get it right by Anonymous Coward · · Score: 0
      More information:

      collective noun
      A collective noun, or group noun, is a noun which designates a group of people or animals. Crew, committee, gang, government, audience, family, and herd are all collective nouns.

      When such a group is considered as a single unit, its collective noun is used with a singular verb and singular pronouns: The committee has reached its decision. But when the focus is on the individual members of the group, British English tends to use a plural verb and plural pronouns with its collective nouns: The committee have been arguing all morning over what they should do. American English usually uses a singular verb and pronouns in these circumstances.

      A determiner in front of a singular collective noun is always singular: this committee, never these committee (but of course when the collective noun is pluralized, it takes a plural determiner: these committees).

  108. 15000 as of 2002 by yagu · · Score: 1

    If you read (actually, watch) this article, you'll find google was using 15,000 linux servers as of the year 2002....

    "how google does it"

  109. I know how much power google uses. by DarkHelmet · · Score: 1
    1.21 gigawatts.

    They use plutonium to power their servers.

    They also have lightning rods all around their server farms to harness bolts of lightning if and when they do strike.

    --
    /^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.[A-Z]{2,4}$/i
  110. No by metalhed77 · · Score: 4, Informative

    It would not be a very distributed DDOS and that would stop any attack quite quickly. Quite simply google's bandwidth providers (or the providers above them) would just unplug them. They may be global, but they probably have less than 40 datacenters. It would not be distributed enough to sufficiently attack. If you could take over the same number of machines with the same amount of bandwidth, but distributed globally on various subnets (say a massive virus), *then* you'd have a DDOS machine. As is, google's DDOS would be shut down quite quickly.

    --
    Photos.
    1. Re:No by rayvd · · Score: 3, Insightful

      The best way for Google to accomplish a DDOS if they _really_ wanted to would be to make every search result point to the target website. :)

      Now that would be impressive...

    2. Re:No by caluml · · Score: 1
      The best way for Google to accomplish a DDOS if they _really_ wanted to would be to make every search result point to the target website. :)

      Yeah - imagine www.google.com pointed to any IP address. Slashdot. Microsoft. Yahoo. Anything. It would just disappear from the internet. As would their upstream ISP, I would imagine. Now - how can I get that A record changed? :)

      calum@gk calum $ nsupdate
      > server 216.239.36.10
      > update add www.google.com 86400 A 66.35.250.150
      >
      > calum@gk calum $

      Slashdot still here? Nope, didn't work.

  111. Google a toxic waste hazard ? by scattol · · Score: 1

    If the numbers floated here of about 100,000 unreliable PCs, doesn't this make Google a serious industrial toxic waste hazard? There's a lot of lead and other exotic elements in all of these machines put together. Especially if they die quickly.

    I am half kidding here but it still makes for a large amount of hardware to dispose at one point.

  112. Re:Not to sound like your Mom [or Señor Ashcr by Anonymous Coward · · Score: 1, Insightful

    Getting into datacenters is trivial: Set up a shell company, and call the sales team and insist on inspecting their colo before comitting to anything. You'll get a guided tour of any hosting facility you want. Security at most hosting facilities is a joke - the CTO at a company I worked for used to spend quite a lot of time tracking down reliable hosting companies to use for our systems, and more than one place maintenance doors had been left unlooked etc.

  113. Who benefits most from disinformation? by Anonymous Coward · · Score: 0

    Is there any correlation between the amount of FUD, disinformation and general rumer mongering that has recently exploded around Google and the fact that Microsoft is now trying to play their game?

    Are we watching an attempt to unlevel the playing field?

    Surely all this can't be due to Google's recent move to go public.

  114. Re:Obligatory jackass by Anonymous Coward · · Score: 0

    I would also like to take this opportunity to call SILIZIUMM a jackass.

  115. Board Members/Employees can't be sued? by Firethorn · · Score: 1

    When a company incorporates, the ownly people 'protected' from suit are the stockholders. Their liability is limited to their investment. CEO's, board members, and employees can most certainly be sued if they're involved. This is why companies have so many policies and rules, it's so that if anything happens they can say 'it wasn't our fault, our employee violated orders'.

    Most lawsuits focus on the company, however, because they're considered responsible for the employee, and the fact that the corporation has loads of money where the employee/s in question don't.

    --
    I don't read AC A human right
  116. Re:Not to sound like your Mom [or Señor Ashcr by geniusj · · Score: 2, Informative

    I'm not saying that it's impossible. I'm sure any dedicated individual could do it. However, tours in datacenters are typically guided (especially at equinix). As far as getting in via unlocked doors, I'd say definitely would not happen here. You have to go through about 4 doors and 4 hand scanners to get in. There are no other entrances.

    Of course, most of it is more for show than practicality. I mean, they have hand scanners on every single cage. Definitely a little bit excessive :). However, I'm sure it impresses many decision makers.

    -JD-

  117. Re:Not to sound like your Mom [or Señor Ashcr by afidel · · Score: 1

    Who cares, MAE East and MAE West are MUCH more critical to the functioning of the internet and their position has been known for years. MAE East is on the other side of a wall from a parking deck, if you wanted to do a decent job of crippling the internet a truck bomb on the other side of that wall would do a pretty good job. Of course it would have been better before private peering facilities decentralized a lot of cross domain traffic but it would still cause enough traffic to be screwed up that it would have a very serious impact.

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  118. Drive failures by Anonymous Coward · · Score: 0

    Depends on if you're using IBM drives or not.

    Our IBM SAN has a dead drive more often than not. We've gone from using RAID5 in each chassis, to RAID5 + 2 hot spares because we've had multiple failures (took out the entire chassis and made some customers very angry)

  119. Gb vs GB by green+pizza · · Score: 1

    They're very smart offering the 1Gb storage.

    I'm not sure if there's an offical standard, but generally a lowercase "b" is shorthand for "bit", while an uppercase "B" stands for "byte". The difference is almost an order of magnitude.

    Google is offering 1 GB of storage, that is, 1 GigaByte, or roughly 1024 MB.

    Many networks now use 1 Gb Ethernet, that is, 1 GigaBit per second, or roughtly 128 MB per second.

    Then there's the whole MiB vs MB and GiB vs GB issue... (because 1000 != 1024) ((2^10 = 1024, 10^3 = 1000))

    1. Re:Gb vs GB by JWSmythe · · Score: 1

      So let me see if I got this right. You bothered to take the time to write your message, for one character in the wrong case? You probably read through the whole thread looking for someone to make that mistake too, didn't you?

      gigabyte. happy?

      "They are very smart offering the one gigabyte storage."

      So, what's it feel like, being so right? Been looking for it for a while, haven't you? Feel good?

      --
      Serious? Seriousness is well above my pay grade.
  120. Its obvious really by Hecatonchires · · Score: 1

    Google found the full text of the REAL ownership papers withe their caching robot, and SCO know. After all, it's Googles job to find obscure information.

    --

    Yay me!

  121. More info: The Secret Source of Google's Power by Phaith · · Score: 1
  122. Re:You fucking faggot by fullofangst · · Score: 0, Flamebait

    shut up, you fucking retard.

  123. Why use PCs? by RageEX · · Score: 1

    Google is dropping some serious $$$ on large system(s) here. Why did they choose to use clusters of PCs? It's got to be a nightmare to manage. Why not go for something from a company like SGI or IBM where you can get very large NUMA systems?

    1. Re:Why use PCs? by tigga · · Score: 1
      Why not go for something from a company like SGI or IBM where you can get very large NUMA systems?

      Let's try some arithmetics here: 1000 boxes with 2GB memory each cost about 2 million dollars approximately. How much would IBM or SGI system with 2000 GB cost?

      Well, I have no idea but have suspition it could cost ten times more.

    2. Re:Why use PCs? by jabbo · · Score: 1

      Why pay that sort of overhead? It's not like they can't just lure the appropriate engineers away from IBM or SGI. Hell, Google recently moved into SGI's old headquarters. You tell me who has more financial sense... here's a hint, it's not SGI.

      If deploying vast numbers of cheap redundant computers was core to your business model, would YOU outsource it? Not if you're smart, you wouldn't. Not if you had smart engineers (in the true sense of the word -- people who build stuff).

      Do some patent searches and you may eventually find out the real answer.

      --
      Remember that what's inside of you doesn't matter because nobody can see it.
  124. fuck racist losers with nothing better to do. by Anonymous Coward · · Score: 0

    see title.

  125. Re:Not to sound like your Mom [or Señor Ashcr by Anonymous Coward · · Score: 0
    Now would you have to get into the datacenter first? Or are the concrete walls a non-issue?

    Do they have concrete walls protecting the cabling from sabotage as it leaves the building? What about 50 ft or 100 ft or 500 ft away? I'm sure a backhoe can be rented cheaply enough...

  126. Redundancy by crucini · · Score: 3, Interesting
    The google file system is redundant. Loss of one node does not lose data.

    Some of the reasons these techniques aren't used in enterprise computing:
    1. They're hard, and business programmers are not that bright. And nobody has encapsulated these technologies in an IT product.
    2. The system can only respond quickly to a finite set of transactions that was known at design time. It lacks the flexibility of a standard file system or relational database.
    3. By the time a business has a lot of data, it usually has enough money to store the data conventionally. Search engines are a bit different.

    Since I've seen it up close a few times, I can say that the standard "enterprise way" (Oracle/Sun/EMC) delivers very poor bang for the buck. If Google wanted to, they could deliver a modified GFS with any desired level of reliability by increasing the redundancy. And even after that bloating, it would still deliver greater bang for the buck than the conventional solutions.
    1. Re:Redundancy by sql*kitten · · Score: 2, Interesting

      They're hard, and business programmers are not that bright. And nobody has encapsulated these technologies in an IT product.

      Hmm, yes. The really bright programmers are living in their parents' basement and working for IBM for free. The dumb ones are getting paid a pile of money to code up forms and reports in fancy code-generation tools, then clocking off at 5 and enjoying themselves.

      The system can only respond quickly to a finite set of transactions that was known at design time.

      Those dumb business programmers left that paradigm behind in the 80s. The tech to do it (the relational database) was developed in the 70s.

      Since I've seen it up close a few times, I can say that the standard "enterprise way" (Oracle/Sun/EMC) delivers very poor bang for the buck.

      You've "seen it up", I've "set it up", kid. Once you've been around the block a few times, you'll drop your tech-snobbery and just choose the right tool for the job.

    2. Re:Redundancy by crucini · · Score: 1
      Hmm, yes. The really bright programmers are living in their parents'basement and working for IBM for free. The dumb ones are getting paid a pile of money to code up forms and reports in fancy code-generation tools, then clocking off at 5 and enjoying themselves.

      There are some bright programmers working for free. But I'd say that bright programmers are generally working on device drivers, embedded systems, game engines, file systems, search engines, compilers, and similar things. Stuff that pushes the envelope.

      The comment about "a pile of money" is sad. People who think they're making a pile of money generally aren't. At any rate, the ability to live well, manage one's time and enjoy oneself is not positively correlated with intelligence. In fact, it may be negatively correlated. In other words, knowing how to get off work at 5 may not help you to write a fast, scalable, fault-resilient distributed datastore.
      Once you've been around the block a few times, you'll drop your tech-snobbery and just choose the right tool for the job.

      This isn't snobbery; quite the reverse. I'm quite happy with Oracle/Sun/EMC if someone else is paying. I spend a lot of time working with Oracle.

      But would you dig into your savings and buy that stuff if you were starting an internet company? If you want to choose the right tool for the job, you should start by knowing the tradeoffs of the various tools available.
  127. Sure, 10 years ago... by B4RSK · · Score: 2, Informative

    The high-end Sun machines are designed for high availability. Not only will a CPU failure not crash the machine, the CPUs are hot swappable so you can replace a failed CPU without so much as a reboot.

    Yes, 10 years ago this was a important thing to have... As were many other "big iron" features. And it still sounds very cool in a geeky kinda way.

    But with redundant relatively cheap clusters available, these types of things aren't worth the $$$ they used to be.

    Except at the extreme high end of the computing world hardware is steadily progressing to commodity level.

    --
    Some people are like slinkies--basically useless but they bring a smile to your face when pushed down the stairs.
    1. Re:Sure, 10 years ago... by dubious9 · · Score: 1

      But with redundant relatively cheap clusters available, these types of things aren't worth the $$$ they used to be.

      This is a myth is see on slashdot alot. Yes, it is partly true, and there are also many cases where it just makes sense to have the big iron. For one thing when you are doing distributed stuff on clusters there's a whole lot more network stuff going on. And sun had it right, the network is the computer. If you can't talk to anybody else, you can't get work done. Meaning if your network is fubared, it doesn't matter how many boxes you have.

      So if you have dozens more switches and routers to maintain, not to mention more boxen, you may have to require a dedicated network engineer. And if you have to hire an extra person, then the big iron alternative gets real cheap real quick. And all that cicso stuff isn't that simple, that's why CCNE's are so popular to get these days.

      This is not to mention that for some tasks, such as some database setups, are better at big systems, like Sun's with their high throughput, than a bunch of networked intels.

      There is cost/benefit to every choice in IT, and the "throw more boxen at it" philosophy can get you in trouble real quick if that's all you ever consider.

      --
      Why, o why must the sky fall when I've learned to fly?
    2. Re:Sure, 10 years ago... by treat · · Score: 1
      This is a myth is see on slashdot alot.

      How can you talk about a myth you see on slashdot a lot without mentioning the mistaken impression that Sun servers have enough redunancy to handle a CPU failure without crashing?

  128. Re:Not to sound like your Mom [or Señor Ashcr by Anonymous Coward · · Score: 0

    Yes, but you'd be surprised how many mid-level datacenters rely on security through obscurity. I know a semi-major datacenter in downtown Unnamed where you can walk into a large local landmark, go up in an elevator, and be one locked door from millions of dollars of equipment. I also know other datacenters in the same unnamed city with armed guards, biometric handscanners, and ID checks.. choose your datacenter wisely!!

  129. obligatory "Clerks" reference by teridon · · Score: 1
    Dante: 37! My girlfriend sucked 37 dicks!

    Randall: In a row?

    --
    I hold it, that a little rebellion, now and then, is a good thing. -- Thomas Jefferson
  130. Wanna bet? by zeitgeist_chaser · · Score: 1

    Have a look at Rackable. Notice that Google is one of their customers. They offer up to 88 half-depth 1U servers in a rack and can even do DC power to cut down on heat generation from such a dense configuration. I'm sure Google is taking advantage of their custom configuration services, too.

    --
    While thinking philosophically, we see problems in places where there are none. -Wittgenstein
  131. Well, then by glpierce · · Score: 1

    Thank you.

    --
    G
  132. IP addresses? by Anonymous Coward · · Score: 0

    Do these machines all have real IP addresses, or do all but the front-end webservers have non-routable addresses?

    If nothing else, keeping track of 100K addresses is pretty amazing.

  133. Google search appliance? by pineapples10 · · Score: 1

    Is this one of their machines? Google Search Appliance I wonder if all of them have that fancy yellow paint job?

  134. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  135. That is assuming Google is using all 899 racks... by byrnereese · · Score: 1

    If I were Google, I would probably buy out space that I could move into later. But that is me.

    --

    ^byrne :/

  136. Duh! by Cinematique · · Score: 1

    The exact number of servers that Google uses is trivially easy...

    2,718,281,828

    ; )

  137. Dude by Anonymous Coward · · Score: 0

    You are SO full of horseshit. I checked out your little site and all I see is thinly veiled bigotry. The biggest problem is that a lot of dinosaurs here in the United States are trying to push things back to the 50s in terms of "values" and the economy. Hence the operations in Iraq instead of Afghanistan where they should have been. Face it. The United States had it's time in the sun during the 50s and now it's OVER. It's time for us to accept our place and rather being citizens of the United States, we must become citizens of Earth. I'll bet your grandfather knew more about being a citizen of Earth than you do. Fucking get over it it.

  138. A Mnemonic Device by justkarl · · Score: 1

    Remember: Language and grammar; or, Kelsey Grammer.

  139. Repeat... by vuo · · Score: 0

    "I pass by their cages every time I go to my cage there."

    Does anyone else find this funny?

  140. $3000 for a server?!?!?!?! by dumbfounder · · Score: 1

    is you crazy? that is a lot for a cheapo commodity server. I am starting up a search engine and I spend an average of about $300 for a server. Bang for the buck is what it's all about. Ok, even with the dual action going on, I can put together a 1u rack system with dual 2ghz xeons, 2gb RAM, and 2 160gb drives for under $1100. And I bet google can get stuff way cheaper than I can. fyi, that is using an Intel motherboard, samsung memory, and maxtor drives, so it isn't even the worst stuff you can buy.

  141. Shit - off topic rant by bezuwork's+friend · · Score: 1
    This caught my eye.

    FedEx can't just lose a few packages there and there.

    Yes they can. They did for me, anyway. When I went through their claim procedure it was denied. It took a lawsuit, a couple of court visits, and a year later to get my money.

    Apparently, they claim it was a communication problem - apparently there are 4 Fed Ex companies, and somehow, my complaint got to the wrong one. I don't know how that was my fault (as they insinuated), though, as they have only 1 claim reporting number.