Slashdot Mirror


Cooling Challenges an Issue In Rackspace Outage

miller60 writes "If your data center's cooling system fails, how long do you have before your servers overheat? The shrinking window for recovery from a grid power outage appears to have been an issue in Monday night's downtime for some customers of Rackspace, which has historically been among the most reliable hosting providers. The company's Dallas data center lost power when a traffic accident damaged a nearby power transformer. There were difficulties getting the chillers fully back online (it's not clear if this was equipment issues or subsequent power bumps) and temperatures rose in the data center, forcing Rackspace to take customer servers offline to protect the equipment. A recent study found that a data center running at 5 kilowatts per server cabinet may experience a thermal shutdown in as little as three minutes during a power outage. The short recovery window from cooling outages has been a hot topic in discussions of data center energy efficiency. One strategy being actively debated is raising the temperature set point in the data center, which trims power bills but may create a less forgiving environment in a cooling outage."

55 of 294 comments (clear)

  1. This is number 3 by DuctTape · · Score: 5, Informative
    This is actually Rackspace's number 3 outage in the past couple days. My company was only (!) affected by outages 1 and 2. My boss would have had a fit if number 3 would have taken us down for the third time.

    Other publications have noted it was number 3, too.

    DT

    --
    Is this thing on? Hello?
  2. Which only shows by CaptainPatent · · Score: 2, Informative

    If you want 100% uptime, it's important to have back up power for the cooling as well as the server systems themselves.
     
    Is this really news?

    --
    Well, back to rejecting software patent applications.
    1. Re:Which only shows by jandrese · · Score: 4, Informative

      If you want 100% uptime (which is impossible, but you can put enough 9s in your reliability to be close enough), you need to have your data distributed across multiple data centers, geographically separate, and over provisioned enough that the loss of one data center won't cause the others to be overloaded. It's important to keep your geographical separation large because you never know when the entire eastern (or western) seaboard will experience complete power failure or when a major backhaul router will go down/have a line cut. Preferably each data center should get power from multiple sources if they can, and multiple POPs on the internet from each center is almost mandatory.

      --

      I read the internet for the articles.
    2. Re:Which only shows by lb746 · · Score: 3, Interesting

      I actually use a vent duct to suck in cold air from outside during the winter to help cool a server in my house. Originally I was more concerned with random object/bugs/leaves so I made it a closed system(like water cooling) to help protect the actual system. It works nicely, but only for about 1/3 or less of the year when the temperature is cold enough to make a difference. I've always wondered about a larger scale of something like this such as how the parent suggested servers in a colder/arctic region.

    3. Re:Which only shows by Ironsides · · Score: 2, Interesting

      Yes, actually. This was looked into by multiple companies during the late 90's. I'm not sure if any were ever built. I think one of the considerations as a byproduct was the savings of not having to run chillers with the cost of getting fibre and power laid to the facility.

      --
      Fly me to the moon Let me sing among those stars Let me see what spring is like On jupiter and mars
    4. Re:Which only shows by Azarael · · Score: 2, Interesting

      Some data centers also have multiple incoming power lines (which hopefully don't have a single transformer bottle-neck). Anyway, I know for sure that at least one data center in Toronto had 100% uptime during the big August 2004 Blackout, so it is possible to prevent these problems.

    5. Re:Which only shows by blhack · · Score: 3, Interesting

      I think the problem is availability of power. When you are talking about facilities that consume so much power that, when built, their proximity to a power station is taken into account, you can't just slap one down at the poles and call it good. I would imagine that lack of bandwidth is a MAJOR issue as well..... ...one field where I think storing servers at the poles would be amazing is super computing. Supercomputers don't require the massive ammounts of bandwidth that webservers etc do. You send a cluster a chunk of data for processing, it processes it, and it gets sent back. For really REALLY large datasets (government stuff)...just fill a jet with hard-disks and have it to the server center in a few hours.

      --
      NewslilySocial News. No lolcats allowed.
    6. Re:Which only shows by afidel · · Score: 2, Interesting

      It sounds like they DID have backup power for the cooling but that they switch over to backup power caused some problems. This isn't really all that unusual because cooling is basically never on UPS power so the transition to backup power may not go completely smoothly unless everything is setup correctly, tested, and there are no or little unusual circumstances during the switchover. I've seen even well designed systems have problems in the real world. One time we lost one leg of a triphase power system so the automatic transfer switch failed to flip over and startup the generator. The UPS realized it wasn't getting good power so it flipped over to battery power. Luckily the UPS sent out its notification and we were able to manually switch over to generator and get the cooling online, but there is almost no chance of it working in we only had 3 minutes to get things corrected.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    7. Re:Which only shows by NickCatal · · Score: 3, Interesting
      I can't stress this enough. When I talk to people about hosting and they rely on 100% availability they NEED to go with geographically diverse locations. Even if it is a single backup somewhere you have to have something.

      For example, Chicago's primary datacenter facility is in 350 E. Cermak (right next to McCormick Place) and the primary interconnect facility in that building is Equinix (which has the 5th and now 6th floors.) A year or so ago there was a major outage there (that mucked up a good amount of the internet in the midwest) when a power substation caught on fire and the Chicago Fire Department had to shut off power to the entire neighborhood. So the backup system started like it should, with the huge battery rooms powering everything (including the chillers) for a bit while the engineers started up the generators. Only thing is, the circuitry that controls the generators shorted out, so while the generators themselves were working, the UPS was working, the chillers were working, this one circuit board blew at the WRONG moment. And this isn't the only time this circuit has been used, they test the generators every few weeks.

      Long story short, once the UPSes started running out of power the chillers started going, lights flickered, and for a VERY SHORT period of time the chillers went out before all of the servers did. Within a minute or two it got well over 100 degrees in that datacenter. Thank god the power cut out as quick as it did.

      So yes, Equinix in that case did everything by the book. They had everything setup as you would set it up. It was no big deal. But something went wrong at the worst time for it to go wrong and all hell broke loose.

      It could be worse, your datacenter could be hit by a tornado

      --
      -nick
    8. Re:Which only shows by autocracy · · Score: 2, Informative

      Reading the article, they WERE on backup power. Emergency crews shut down the backup power to the chillers temporarily while they were working in the area, so the chillers had to start again. Cycling these big machines isn't instant.

      --
      SIG: HUP
    9. Re:Which only shows by spun · · Score: 2, Interesting

      Hmph. We have backup power for the cooling in our server room, but we had to deal with a fun little incident two weeks ago. Trane sent out a new HVAC monkey a month ago for routine maintenance. I was the one who let this doofus in, and let me tell you, he was a slack-jawed mouth-breathing yokel of tender years. He took one look at our equipment and said, I quote, "I ain't never seen nutin' like this'un before, hee-yuck!" I was a bit taken aback, but he seemed to go through all the proper motions.

      Fast forward to three weeks ago. The temp is fine, but the humidity keeps going down. I tell management, but this is a state agency and everything around here takes three times as long as it should. For a state agency, that's outstanding, by the way. Anyway, noting gets done. Then we find out WHY the humidity is going down: seems the HVAC monkey didn't screw in the water bottle all the way and the entire 5 ton fills up with water, until it shorts out at 4 pm on a Friday afternoon and dumps water everywhere.

      Well, we got our four emergency portable coolers in with little tubes leading out into the hall, the fans on, and the doors open right quick, but the temp still shot up to over 100 in under ten minutes. Well, I told hem something was up, and anyway, I'm on the VMware/BladeCenter server consolidation team, and this is just more of an argument to fund us better. But I guess the moral of the story is, don't let slack-jawed mouth-breathing yokels fix your mission critical systems.

      --
      - None can love freedom heartily, but good men; the rest love not freedom, but license. -- John Milton
    10. Re:Which only shows by afidel · · Score: 2, Interesting

      Ok, I specifically said UPS power, as in it takes time to spinup the generators and switching from one source to the other does not always go perfectly in the real world. One factor is minimum cycle time on the compressors. The 3 minute time frame was from TFA which says that at a density of 5KVA per cabinet thermal shutdown can happen in 3 minutes due to thermal load.

      Oh and as far as the one leg collapsing thing, yes we were VERY pissed at everyone involved in that little problem, it turns out it was a design flaw in the transfer switch. Because it happened during the day we ended up taking more of an outage for replacement of the switch then we did from the incident but it just proves that even a well designed system can have problems. That datacenter was small enough to only have single source power, my current datacenter has dual feed including dual generator and fully redundant cooling so a single transfer switch malfunction wouldn't take it down but you have to work within the parameters set by budget and need.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    11. Re:Which only shows by Critical+Facilities · · Score: 2, Interesting

      Although our servers are on uninterrupted power (same as the Air Con)

      I guarantee your HVAC systems are NOT on UPS power. If by some massive failure during construction and commissioning they were and it was missed, I'd recommend firing your entire engineering department and any development contractors involved with building and maintaining your facility. There is no reason to put HVAC systems (chillers, pumps, air handlers, CRACs) on UPS as they can all manage just fine with losing their power and restarting once power is restored (either from utility or generator). To subject your UPS system(s) to the massive inrush current that would occur when various HVAC component loads are thrust on it would be....well, stupid at best.

      Your power systems sound pretty consistent with what is in most Data Centers (the "Essential Power" is often referred to as Emergency Power in Data Center environments). 30 seconds is a pretty good turnaround time for generators to start up, although 15 seconds is better (and very attainable).

      So to answer your question, no, Data Centers do not have a "slacker" design than hospitals. They are actually quite similar in their requirements in terms of HVAC and of course power.
    12. Re:Which only shows by shoemakc · · Score: 2, Informative

      Well, it's clear by that statement that you have no idea of the infrastructure of a Data Center.


      Believe it or not, I've designed both, and while I certainly don't claim to be an expert on all the IT equipment, I've got a pretty good idea of the electrical systems that go into them.

      My description of the emergency branches was intentionally vague because their full definitions comprise some dozens of pages in NFPA 99. I assumed most people wouldn't care about that level of detail :-)

      Anyway, my point was that while a typical data center has 3 types of power available (Normal, Emergency and UPS), a typical hospital usually has at least 5:

      Normal
      Emergency (Life Safety)
      Emergency (Critical)
      Emergency (Equipment)
      Emergency (UPS)

      These generally include separate panels, feeders, automatic transfer switches...etc, so I still stand by my claim that hospitals have the more complex electrical system. Also consider that hospitals now contain increasingly critical data center facilities. Of course I will concede that the UPS topology of a large data center is generally far more complex then a hospitals....but again, that's just one part of the puzzle.

      "Starting" and closing to the Buss are 2 very different things. If you believe that large generators are starting and closing to the buss at full voltage and balanced frequencies in 3 seconds, I have a bridge that you may be interested in purchasing. To give you some perspective, our 2 generators for our Data Center (2 Megawatts each) start and close to the buss (and are assume the building load) in 15 seconds. We, of course, circulate the heated jacket water to keep the oil, cylinders, etc warm and ready as you described.


      I'll take that bridge. The reason your generators take 15 seconds to start is that they comprise a Level 2 system (as defined in NFPA 110), and not the Level 1 system that hospitals require. Level 1 includes a whole bunch of additional requirements (ie...expense) that are simply not required where the outage will not potentially risk human life, ie, datacenters. Now i'm not sure about all the modifications that manufacturers must make to their gen sets to meet these requirements, but I can assure you that that 10 second start (which includes startup, sync and bus connection) is required by code. Also, I've been there at the monthly test that hospitals are required to perform and yep...they really are that quick.

      Now again...it's not that your generators are bad...it's just that theirs no reason for a company to spend the extra cash on that sort of system when a longer startup time will do; typically the UPS is sized for 15 minutes of runtime and the HVAC equipment can go down for a few minutes without the room overheating.

      There are many components that you're probably unaware of and layers of redundancy that are invisible to those who do not work in the "back of house" Critical Environments. To reiterate, I'm not saying that hospitals aren't complex nor am I saying that they do not have Critical Environments within them. I'm simply saying that you may have a perception of what a Data Center is that is not necessarily consistent with what is actually the case.


      Similarly, I'm not claiming that hospitals are more complex overall systems....just that their electrical distribution systems typically are.

      -Chris
      --
      --an unbreakable toy is useful for breaking other toys--
  3. How to estimate the cooling needs? by Dynedain · · Score: 2, Interesting

    Actually this brings up an interesting point of discussion for me at least. Our office is doing a remodel and I'm specifying a small server room (finally!) and the contractors are asking what AC unit(s) we need. Is there a general rule for figuring out how many BTUs of cooling you need for a given wattage of power supplies?

    --
    I'm out of my mind right now, but feel free to leave a message.....
    1. Re:How to estimate the cooling needs? by CaptainPatent · · Score: 4, Informative

      Is there a general rule for figuring out how many BTUs of cooling you need for a given wattage of power supplies? I actually found a good article about this earlier on and it helped me purchase a window unit for a closet turned server-room. Hope that helps out a bit.
      --
      Well, back to rejecting software patent applications.
    2. Re:How to estimate the cooling needs? by Critical+Facilities · · Score: 2, Informative
    3. Re:How to estimate the cooling needs? by trolltalk.com · · Score: 4, Interesting

      Believe it or not, but in one of those "life coincidences", pi is a safe approximation. Take the number of watts your equipment, lighting, etc., use, multiply by pi, and that's the # of btus of cooling. Don't forget to include 100 watts per person for body heat.

      It'll be 90F degrees outside, and you'll be a cool 66F.

    4. Re:How to estimate the cooling needs? by Bandman · · Score: 2, Insightful

      I hate to pick, but I think you'll find rotund people actually have a lower surface to mass ratio than thin people.

    5. Re:How to estimate the cooling needs? by JUSTONEMORELATTE · · Score: 2, Funny

      Believe it or not, but in one of those "life coincidences", pi is a safe approximation. Take the number of watts your equipment, lighting, etc., use, multiply by pi, and that's the # of btus of cooling. Don't forget to include 100 watts per person for body heat.

      It'll be 90F degrees outside, and you'll be a cool 66F.
      And if that doesn't work, you can always tell your VP that you were taking your numbers from some guy named TrollTalk on ./
      I'm sure he'll understand.
    6. Re:How to estimate the cooling needs? by Anonymous Coward · · Score: 2, Funny

      > Don't forget that nerds and geeks often have much more surface area than
      > the standard professor, leading to higher heat loss into the cooler server room.

      Not to even think about the actual situation during power outtakes:

      When power is lost, all the geeks are going to be really excited and sweating (and heating) very much more than normally!

    7. Re:How to estimate the cooling needs? by Nf1nk · · Score: 2, Interesting

      Personal energy output is a function of a number of variables, but the most important, are the ambient temperature and the movement of air through the room. The 100 watts per person is a conservative estimate based on a roughly 75 F room.

      The Prof in a box experiment has a large issue that contributes to error. He is breathing with a tube, the heat exchange in your lungs is a convection exchange and has too large a magnitude to ignore. If you have doubts about how much heat flows out through breathing next time you are cold in bed pull the covers up over your head and breath under the covers. You will find that the bed gets nice and warm in a very short time.

      --
      I used to have a cool sig, back when I cared
    8. Re:How to estimate the cooling needs? by Anonymous Coward · · Score: 2, Informative

      Well, specifically, a ton, in HVAC terms, is the amount of energy one ton of ice can absorb. i.e. when refrigeration replaced ice, if you had one ton of ice delivered per day, a one ton cooler would fit the bill.

    9. Re:How to estimate the cooling needs? by R2.0 · · Score: 2, Informative

      The general rule I always follow is HIRE AN ENGINEER! Ferchrissake, there are people who do these calculations for a living, and have insurance in case the screw up. You want to trust your data center to advice from slashdot and a back-of-the-envelope calculation?

      Sheesh - what's the name of your company so I can sell short?

      --
      "As God is my witness, I thought turkeys could fly." A. Carlson
    10. Re:How to estimate the cooling needs? by zippthorne · · Score: 2

      Recommended diet is typically between 1500 kcal and 2500 kcal depending on body type and other factors. It is, I believe, a measure of how much energy you would be able to extract from the food, rather than what you'd be able to obtain by burning it directly.

      Anyway, if you use an average of 2000 kcal, whether that goes into heating or moving around, a control volume around yourself will experience the same thing: 2000 kcal of waste heat generated over the course of a day. Everything turns into waste heat, eventually. The chemical energy you are unable to extract from the food is irrelevant: it will be passed and remains available for something to extract in your refuse.

      Further assuming that your usage of energy is mostly constant (i.e. that your physical exertions are << the things you keep doing to stay alive (generate heat, pump blood, etc)) you would assume that a person would, on average, be a 100 W heat source.

      So, what does this mean for the professor's experiment? Well, obviously practical experience trumps theory, so it has to be reconciled with the 2000 kcal estimate on diet. Is the professor eating enough? He looks healthy. Perhaps he's not moving enough. Is the air-displaced relevant? I'd think correcting for it (they cleverly used a capacitance method) would reduce he estimates of the professor's output. Perhaps estimates of energy conversion are incorrect and need to be corrected.

      The only glaring flaw I can see is that the breathing tube does not appear to have any temperature probes or flow meter. This is important because, due to the huge area of the lungs, every breath will reach equilibrium temperature inside the professor, so there is the question of how much is lost in that manner.

      I've always heard ~50W, as well. Though the food-calc suggests otherwise. 50-100W isn't really a bad estimate though, since you'd only be using it to make order of magnitude calculations, anyway.

      --
      Can you be Even More Awesome?!
    11. Re:How to estimate the cooling needs? by trolltalk.com · · Score: 2, Interesting

      Think for 2 secs ... each kw of electricity eventually gets converted to heat. Resistive heating generates ~ 3,400 btus per kilowatt, so multiplying electrical consumption by pi gives you a decent cooling capacity. Add an extra 10% and you're good to go (you *DO* remember to add in a fudge factor of between 10 and 20% for "future expansion", right?)

  4. And the answer is: Liquid Nitrogen by Bombula · · Score: 2, Informative

    Liquid nitrogen is the cooling answer, for sure. Then you're not dependent upon power of any kind at all. The nitrogen dissipates as it warms, just like how a pool stays cool on a hot day by 'sweating' through evaportation, and you just top up the tanks when you run low. It's cheap and it's simple. That's why critical cold storage applications like those in the biomedical industry don't use 'chillers' or refrigerators or anything like that. If you really want to put something on ice and keep it cold, you use liquid nitrogen.

    --
    A-Bomb
    1. Re:And the answer is: Liquid Nitrogen by DerekLyons · · Score: 2, Informative

      Liquid nitrogen is the cooling answer, for sure. Then you're not dependent upon power of any kind at all.

      Except of course the power needed to create the LN2.
       
       

      That's why critical cold storage applications like those in the biomedical industry don't use 'chillers' or refrigerators or anything like that. If you really want to put something on ice and keep it cold, you use liquid nitrogen.

      As above - how do you think they prevent the LN2 from evaporating? The LN2 is a buffer against loss of power, but typically they have a pretty serious cryocooler to keep the LN2 there when they do have power.
    2. Re:And the answer is: Liquid Nitrogen by MenTaLguY · · Score: 2, Informative

      Even oxygen levels elevated to as little as 23% oxygen can lead to a violent increase in the flammability of materials like cloth and hair. Controlling gas concentrations so they remain at safe levels can be very tricky.

      Setting aside evaporation, be careful not to get it on anything. LOX can easily saturate anything remotely porus and oxidisable, effectively turning it into an unstable explosive until the LOX evaporates... at LOX or LN temperatures, that can even become an issue with oxygen condensing from the air onto your equipment/insulation. Forget just avoiding the creation of sparks -- better be sure that the safety measures have been successful in eliminating all LOX-incompatible materials and be careful not to bump anything too hard!

      And of course, even a tiny fire or explosion can easily lead to a rapid boiloff. Sudden boiloff can be an issue simply because of drastically increased pressure and still-cold temperature. Liquified gasses like LOX, NOX, etc. expand a LOT when they boil (about an 600-800x increase in volume, simply transitioning from a liquid to a gas), even while remaining dangerously cold. Imagine being in a closed room with a punctured dewar. Assuming you've escaped being hit by the dewar which has gone flying like a deflating balloon with reinforced-concrete-shattering force, you've potentially got ruptured eardrums and possibly internal injuries due to the abrupt pressure change which has also jammed the door. You fall to the floor from the pain of burns on your lower body from the ultracold gas which has quickly filled the lower part of the room -- which then starts to burn your face and lungs out too as you start breathing it.

      Hopefully the facility you're in has proper emergency ventilation measures, adequate room size, properly constructed doors, and protective equipment to avoid this scenario, but you still don't want to be in the room if it happens if you can help it... Cryogenic gasses are seriously dangerous. Don't underestimate them or treat them lightly.

      --

      DNA just wants to be free...
  5. New cooling strategy needed? by MROD · · Score: 5, Interesting

    I've never understood why data centre designers haven't used a different cooling strategy to re-circulated cooled air. After all, for much of the temperate latitudes for much of the year the external ambient temperature is at or below that needed for the data centre so why not use conditioned external air to cool the equipment and then exhaust it (possibly with a heat exchanger to recover the heat for other uses such as geothermal storage and use in winter)? (Oh, and have the air-flow fans on the UPS.)

    The advantage of this is that even in the worst case scenario where the chillers fail totally during mid-summer there is no run-away, closed loop, self re-enforcing heat cycle, the data centre temperature will rise but it would do so more slowly and the maximum equilibrium temperature will be far lower (and dependant upon the external ambient temperature).

    In fact, as part of the design for the cluster room in our new building I've specified such a system, though due to the maximum size of the ducting space available we can only use this for half the heat load.

    --

    Agrajag: "Oh no, not again!"
    1. Re:New cooling strategy needed? by afidel · · Score: 3, Informative

      The problem is humidity, a big part of what an AC system does is maintain humidity in an acceptable range. If you were going to try to do once through with outside air you'd spend MORE power during a significant percent of the year in most climates trying to either humidify or dehumidify the incoming air.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    2. Re:New cooling strategy needed? by cjanota · · Score: 2, Insightful

      Where do you think current AC units dump all the heat that they extract? What the GP is suggesting just cuts out the middle man (AC). The AC units produce quite a bit if heat themselves.

      --
      You can fix anything with duct tape and sticks.
    3. Re:New cooling strategy needed? by R2.0 · · Score: 2, Insightful

      Part of the problem is that it is a lot easier to move heat via liquid than air. the conventional design uses chillers mounted outside the space to cool a liquid medium/refrigerant, which is then pumped very efficiently to cooling coils in the space (modify for DX coils). the air inside the condityioned space makes a very short trip through the servers, across the room, over the coil, and back out again.

      Under your scenario, the AIR is the working medium - it is cooled on the outside, and then moved inside via relatively inefficient fans. And it is a SHITLOAD of air - that means either high volumes (huge ductwork) or high velocity (how do you like working in a wind tunnel?).

      Fans on UPS? Are you kidding? How big do you want your UPS to be? Fans suck a LOT of power, especially when you have them doing what you propose.

      "as part of the design for the cluster room in our new building I've specified such a system"

      You've specified? From your post, it's obvious you aren't an HVAC engineer, so what are your qualifications? Did you do an analysis to see what the real ROI is? Or is it just so obvious to you why years of HVAC design are totally wrong?

      --
      "As God is my witness, I thought turkeys could fly." A. Carlson
    4. Re:New cooling strategy needed? by R2.0 · · Score: 2, Informative

      What you and the OP are describing is called "free cooling", a long established principle in HVAC design. It is used in commercial and industrial buildings all the time. The reason that it is not used in residential all that much is that

      1) until relatively recently, houses "breathed" quite well on their own due to loose construction. With tightening energy codes and the use of Tyvek and better windows, houses don't have a lot of air exchange through the boundaries, and problems ensue - "stuffiness", moisture, mold, "sick building". Residential construction hasn't thought this through yet - there are some builders who now refuse to use Tyvek due to ventilation (and liability) issues.

      2) Controls become an order of magnitude more complicated. Most residential systems are "bang bang" systems - it's on or off based on 1 criteria. To introduce free cooling, you need outside air sensors, dampers, actuators, and a controller a lot more complex than a home t-stat. For most ersidential builders, that's a couple thousand in extra costs that can't be recouped in sale price - most owners just don't care, and when you are building 5000 of the same unit, "most owners" rule.

      As for dehumidification, you have it backwards - dehumidification typically required a COLDER coil than necessary for cooling alone, and then you reheat the air. It is horribly inefficient, but sometimes necessary - with a "tight" building, you have to get the moisture out somehow, and supercooling the air inside just isn't a good idea (other than making for lots of erect nipples, that is)

      Finally, what makes sense for one situation may not for another - a data center uses orders of magnitude more cooling than a house or common office building. Moving the amount of air necessary to provide that cooling gets really hard - the amount of energy a fan requires increase with the CUBE of the flow required. So to get twice the airflow you use 8x the power. It's the same with pumps, but because the heat capacity of water or glycol is so much greater than that of air, the effects are minimized.

      --
      "As God is my witness, I thought turkeys could fly." A. Carlson
    5. Re:New cooling strategy needed? by R2.0 · · Score: 2, Informative

      Are you stupid? A heat exchanger (btw, not "very common" at all in residential) is the OPPOSITE of free cooling! In free cooling, COLD outside air is brought directly into the space, bypassing the cooling coils. Why? Because the air is cool already.

      With a heat exchanger, you are bringing cool air in, and then HEATING IT UP with the waste heat from the exhaust air. Great for saving energy in a residence, when one wants to stay toasty warm - not so great in a data center or office building when there is still a cooling load in winter. So absolutely nothing you said has anything to do with free cooling. Heat exchangers are great for what they do, but free cooling isn't it.

      "That's also untrue. The direct, free-flowing heat exchange between hot and cold coils allows dehumidifiers to be much more energy efficient, using typically around 1/3rd as much power for the same volume of air."

      True - IF you are using a heat exchanger. But if one is not - lets say, in an office building on a cool spring morning - then you have a problem. You bring in nice 65F air, at 65-70% RH - it's wet. You don't heat it up through a HX, because you need the 65F air to maintain temp setpoint. But now you are dumping a lot of water into the space, and it doesn't *feel* cool. So, you run your cooling coil at, say 50F discharge temp. That is below dewpoint, and it pulls moisture out of the air. But now you are dumping 50F air into the space, so the space temp gets driven down, and you get the nipple effect. So what do you do? REHEAT the air to 65F. Which, BTW, is exactly what home humidifiers do - the discharge air is reheated to a temp greater than the intake air, reflecting the energy added by the electricity. TANSTAAFL.

      You can throw a HX in that equation, but it certainly isn't a dumb device - the control logic needs to know when to open the air dampers and close them, so as not to interfere with free cooling.

      "As to coil temperature, obviously any temperature will work, to varying degrees of effectiveness. You'll need to provide some numbers to back up your claim. General-purpose dehumidifiers are usually just slightly modified AC units"

      Bullshit. The coil temperature MUST be less than the dew point of the air, by the very definition of "dew point". Practically, it needs to be substantially less for the dehumidification to really work. Often, that temp is less than desired for discharge air temp. See above example.

      Call me when you've bought a psychrometric chart and a ductulator. There are plenty of design decisions to be made when designing an HVAC system, unfortunately including appeasing owners who think they are design geniuses.

      --
      "As God is my witness, I thought turkeys could fly." A. Carlson
  6. Damn dihydrogen monoxide by wsanders · · Score: 2, Funny

    They should ban that stuff. (dhmo.org)

    --
    Give a man a fish and you have fed him for today. Teach a man to fish, and he'll say "WHERE'S MY FISH, YOU IDIOT?"
  7. Ironic advertisement by davidwr · · Score: 2, Funny

    Ah, the dangers of context-sensitive advertising.

    Ad on the main page when this article was at the top of the list.

    Does "50% off setup" mean you'll only be set up halfway before they run out of A/C?

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  8. Funny you mention this by Leebert · · Score: 4, Interesting

    A few weeks ago the A/C dropped out in one of our computer rooms. I like the resulting graph: http://leebert.org/tmp/SCADA_S100_10-3-07.JPG

  9. Physics by DFDumont · · Score: 3, Informative

    For those of you who either didn't take Physics, or slept through it, Watts and BTU's/hr are both measurements of POWER. Add up all the (input) wattages, and use something like http://www.onlineconversion.com/power.htm/ to convert. This site also has a conversion to 'tons of refrigeration' on that same page.
    Also note - Don't EVER user the rated wattage of a power supply because that's what it SUPPLIES, not uses. Instead use the current draw multiplied by the voltage (US - 110 for single phase, 208 for dual phase in must commercial blgs, 220 only in homes or where you know thats the case). This is the 'VA' [Volt-Amps] unit. Use this number for 'watts' in the conversion to refrigeration needs.
    Just FYI - a watt is defined as 'the power developed in a circuit by a current of one ampere flowing through a potential difference of one volt." see http://www.siliconvalleypower.com/info/?doc=glossary/, i.e. 1W = 1VA. The dirty little secret about power calculations is that there is another factor thrown in, typically about 0.65, called the 'power factor' that UPS and power supply manufacturers use to lower the overall wattage. That's why you always use VA (rather than the reported wattage) because in a pinch you can always measure both voltage and amperage(under load).
    Basically do this - take all the amperage draws for all the devices in your rack/room/data center, multiply them by the applied voltage for that device (110 or 208) and add all the products together. Then convert that number to tons of refrigeration. This is your minimum required cooling for a lights out room. If you have people in the room, count 1100 BTU's/hr for each person and add that to the requirements (after conversion to whatever unit you're working with). Some HVAC contractors want specifications in BTU's/hr and other want it in tons. Don't forget lighting either if its not a 'lights out' operation. A 40W florescent bulb means its going to dissipate 40W (as in heat). You can use these numbers directly as they are a measure of the actual heat thrown, not of the power used to light the bulb.
    Make sense?

    Dennis Dumont

    1. Re:Physics by timster · · Score: 2, Informative

      The dirty little secret about power calculations is that there is another factor thrown in, typically about 0.65, called the 'power factor' that UPS and power supply manufacturers use to lower the overall wattage.

      It's not "thrown in" by the manufacturers. The dirty little secret is simply that you are talking about AC circuits. 1W = 1VA in AC circuits only if the volts and the amps are in phase -- which they aren't.

      Take a sine wave -- in AC, that's what your voltage looks like, always changing. If you're powering something purely resistive like an incandescent bulb, your amps follow the same sine wave and 1W=1VA. But inductive loads like power supplies introduce a lag in the current, so that the amps aren't in phase with the volts. As a result, you cannot naively multiply the RMS volts by the RMS amps to get the average wattage -- you have to take the integral of volts times amps through the curve. And for part of that curve, the voltage and the current flow in different directions, which represents negative power (that is, the inductive circuitry is pushing current back across the wire). As a result of this the overall power will always be less than the volt-amps.

      --
      I have seen the future, and it is inconvenient.
    2. Re:Physics by EmagGeek · · Score: 2, Insightful

      First, 1 Watt is the movement of energy at the rate of 1 Joule per second, and need not be electrically related at all. A watt is energy per unit time.

      Second, power factor is irrelevant to cooling calculations because reactive power does not generate heat, even though it does generate imaginary current in the generating device. This is why power companies bill industrial power based on VAH and not on KWH.

      Generators are rated for the magnitude of their output current, not just the real component of it. This is also why power companies try their best to load all three phases equally - because in that case the net instantaneous current out of the generator is zero and the physical forces on the windings and stators is constant and uniform.

      Also, most server-class power supplies have power factor correction which adjusts the power factor to 1 by adding shunt capacitance to the input of the supply.

      The major point that most people seem to be missing in this dialogue is that a 500W PC power supply does not draw 500W from the wall by simply being plugged in. A PC power supply will deliver only what power is needed by the devices connected to it. For example, my server at home is an X2-4800 with 8 hard disks in it, 4 cooling fans, and a 600W power supply. The total power draw on the server box, two UPS units, the 24 port ethernet switch, the router, the cable modem, and the overhead light, is 262W at CPU idle. Just because it has redundant 500W supplies doesn't mean it's going to draw 1000W just sitting there. I have not measured the power with both CPUs at 100%.

    3. Re:Physics by hobbesmaster · · Score: 2

      Just FYI - a watt is defined as 'the power developed in a circuit by a current of one ampere flowing through a potential difference of one volt." see http://www.siliconvalleypower.com/info/?doc=glossary/, i.e. 1W = 1VA. The dirty little secret about power calculations is that there is another factor thrown in, typically about 0.65, called the 'power factor' that UPS and power supply manufacturers use to lower the overall wattage. That's why you always use VA (rather than the reported wattage) because in a pinch you can always measure both voltage and amperage(under load).


      If you don't want to learn anything about AC power, please don't call it a "dirty little secret". Wikipedia for example would be a good place to start.

      I'll summarize for you though:
      Real Power (P) - has units of W. This is the amount of power dissipated into the "real" (resistive) part of a complex load.
      Reactive Power (Q) - has units of VAR (volt-amp reactive). This is the amount of consumed by the reactive (capacitive/inductive) part of a complex load.
      Apparent power (|S|, S = P+jQ) - has units of VA (volt-amp). If you plot P and Q on the complex axis, you this is what completes the triangle.
      Power factor - ratio between real and apparent power. (P/|S|) For purely resistive loads, this is 1. For purely reactive loads, this is 0.

      A UPS is basically just a battery with a bit of supporting circuitry. The battery can only deliver a certain amount of energy per time (some type of power). If this energy is all going to a resistive load, then yes, it would be what you think it is, Watts. This is also the apparent power the UPS can deliver in VA (P=|S|, PF=1). If the load is purely reactive, the apparent power will remain the same in VA, however the real power will be 0. For this reason, UPSs are rated in VA. They can make no guarantee about the amount of real power (W) they can deliver, as it will depend on the load.

      Hopefully that makes sense and I haven't confused things further. (power EEs may have something to add/correct, I'm more of an electronics one going off memory)
  10. Short-cycling protection by Animats · · Score: 5, Interesting

    Most large refrigeration compressors have "short-cycling protection". The compressor motor is overloaded during startup, and needs time to cool. So there's a timer that limits the time between two compressor starts. 4 minutes is a typical delay for a large unit. If you don't have this delay, compressor motors burn out.

    Some fancy short-cycling protection timers have backup power, so the the "start to start" time is measured even through power failures. But that's rare. Here's a typical short-cycling timer. For the ones that don't, like that one, a power failure restarts the timer, so you have to wait out the timer after a power glitch.

    The timers with backup power, or even the old style ones with a motor and cam-operated switch, allow a quick restart after a power failure if the compressor was already running. Once. If there's a second power failure, the compressor has to wait out the time delay.

    So it's important to ensure that a data center's chillers have time delay units that measure true start-to-start time, or you take a cooling outage of several minutes on any short power drop. And, after a power failure and transfer to emergency generators, don't go back to commercial power until enough time has elapsed for the short-cycling protection timers to time out. This last appears to be where Rackspace failed.

    Dealing with sequential power failures is tough. That's what took down that big data center in SF a few months ago.

  11. Highlights Serious Flaw - Neglecting Outside by Ron+Bennett · · Score: 3, Insightful

    While many here are discussing UPSes, chillers, set-points, etc the most serious flaw is being glossed over ... the lack of redundency outside the data center, such as multiple, diverse power lines coming in...

    From the articles, it appears that Rackspace datacenter doesn't have multiple power lines coming in and/or many come in via one feed point.

    How else is it that a car crash quite some distance from the datacenter can cause such disruption. Does anyone even plan for such events - I get the feeling most planners don't, since I've seen first-hand many power failures occur in places where one would expect more redundency from dumb things like a vehicle hitting a utility pole, etc.

    Ron

    1. Re:Highlights Serious Flaw - Neglecting Outside by PPH · · Score: 2, Informative
      You have to pay for redundant feeds from the local utility company. And they aren't cheap. If you don't select a location on the boundary of two independent distribution circuits, the two feeds are worthless.

      I live near a hospital which is located on the boundary between two distribution circuits, each fed from a different substation. That redundancy cost the hospital tens or hundreds of thousands of dollars. But the two substations are fed from the same transmission loop, which runs through the woods (lots of trees and on inaccessible rights-of-ways), so the most probable fault will take both stations, circuits, and sources to the hospital off line.

      The moral of the story: Don't depend on an outside organization (the local utility) for service when its your neck on the line and not theirs.

      --
      Have gnu, will travel.
  12. Maxwells data center by techpawn · · Score: 2, Funny

    We've summoned a small demon to let in cool air particles and shunt out hot ones. Sure the weekly sacrifice gets to be a pain after a while, but there's always a pool of willing interns right?

    --
    Ask not what you can do for your country. Ask what your country did to you
  13. computers convert 100% electricity to heat by mwilliamson · · Score: 2, Insightful

    Every single watt consumed by a computer is turned into heat, and generally released out the back of the case. Computers behave the same as the coil of nichrome wire as is used in a laundromat clothes dryer. (I guess a few milliwatts gets out of your cold room via ethernet cables and photons on fiber)

  14. Re:Why run data centres in hot states? by arth1 · · Score: 4, Interesting

    (Disregarding your blatant karma whoring by replying to the top post while changing the subject)

    There's several good reasons why the servers are located where they are, and not, say, in Alaska.
    The main one is light speed through fiber, and a cable from Houston to Fairbanks would induce a best case of around 28 ms latency, each way. Multiply by several billion packets.

    This is why hosting near the customer is considered a Good Thing, and why companies like Akamai have made it their business of transparently re-routing clients to the closest server.

    Back to cooling. A few years ago, I worked for a telephone company, and the local data centre there had a 15 degree C ambient baseline temperature. We had to wear sweaters if working for any length of time in the server hall, but had a secure normal temperature room outside the server hall, with console switches and a couple of ttys for configuration.
    The main reason why the temperature was kept so low was to be on the safe side -- even if a fan should burn out in one of the cabinets, opening the cabinet doors would provide adequate (albeit not good) cooling until it could be repaired, without (and this is the important part) taking anything down.
    A secondary reason was that the backup power generators were, for security reasons, inside the server hall themselves, and during a power outage these would add substantial heat to the equation.

  15. 5kw? ow. by MattW · · Score: 2, Insightful

    5 kilowatts is a heck of a lot to have on a single rack - assuming you're actually utilizing that. I recently interviewed a half dozen data centers to plan a 20-odd server deployment, and we ended up using 2 cabinets in order to ensure our heat dissipation was sufficient. Since data centers are usually supplying 20 amp, 110 or 120v power, you get 2200-2400 watts available per drop; although it's considered a bad idea to draw more than 15 amps per circuit. We have redundant power supplies in everything, so we keep ourselves at 37.5% of capacity on the drops, and each device is fed from a 20amp drop coming from a distinct data center pdu. That way even if one if the data center pdus implodes, we're still up and at 75%- capacity.

    Almost no data center we spoke to would commit to cooling more than 4800 watts of power at an absolute maximum per rack, and those were facilities with hot/cool row setups to maximize airflow. But that meant they didn't want to drop more than 2x20amp power drops, plus 2x20 for backup, if you agreed to maintain 50% utilization across all 4 drops. But since you'd really want to maintain 75%- even in the case of failure, you'd only be using 3600watts. (In the facility we ended up in, we have a total of 6 20 amp drops, and we only actually utilize ~4700 watts.

    Ultimately, though, the important thing is that cooling systems should be on generator/battery backup power. Otherwise, as this notes, your battery backup won't be useful.

  16. Re:Why run data centres in hot states? by arth1 · · Score: 2, Informative

    While thinking outside the box is all well and fine, it's even better when combined with Common Knowledge. Like knowing that caves and mines (a) tend to be rather warm when deep enough, and (b) have a fixed amount of air.

    As for the power efficiency of pumping air from several hundred meters away compared to pumping it through the grille of an AC unit, well, there's a reason why skyscrapers these days have multiple central air facilities instead of just one: Economics.

    I'd like to see you pump air for any long distance with your exercise bike :-)

  17. Re:Datacenter cooling should be on generator/ups by Skapare · · Score: 2, Insightful

    A large data center should not have one big massive UPS anyway. It should all be divided out into various load sections, each with its own UPS+battery system. Once you do that, then you can have cooling on its own UPS without any risk of the cooling system impacting the UPS feeding the computers ... if you really want cooling on UPS (it can be done, but generally is not the best way). Surely you would have the cooling on it's own three phase circuits.

    Perhaps a better approach is a smart cooling system that rotates the starting of compressors on various units so you always have some number of units running and some number not running, at the ratio needed for the current thermal demands. Then where there is an outage that has to go to generators, only a limit number of units will have been recently started just before the outage and need to be thermally protected. The controller skips those and starts the idle units (unless you are already maxxed out in which case you'd have no idle units). But you will need to have the cooling on the generators.

    If you are going to have a backup distribution circuit from the utility, it should be physically separate from the primary circuit so that it is not necessary to shut down both to deal with things like a traffic accident.

    --
    now we need to go OSS in diesel cars
  18. Power + Heat + Data Centers: a tough problem by markjl · · Score: 2, Interesting

    Disclaimer: I work with SGI, so I can shed some light on their customer's perspective (NASA, gov't, research labs, etc.) and solution to this problem.

    The increasing density of servers is exacerbating the problem of power and cooling in every data center. This week is the SuperComputing trade show where the the new top 500 supercomputers edition was released with "Big Turnover Among the Top 10 Systems," where you can see the first examples to address these issues.

    SGI's new ICE blade system was launched a few months ago, it was designed to address the power consumption, real estate density, and cooling issues everyone will probably experience on their next server cycle. ICE has shipped and one installation is now #3 on the Top 500. It's a welcome sign that SGI is back from bankruptcy. I'm sorry if this seems like an advert, so I'm not going to link to SGI -- you can go find out more easily if you want.

    --
    My opinions are my own, but you may share them!
  19. Re:Datacenter cooling should be on generator/ups by Critical+Facilities · · Score: 2, Informative

    I agree with almost all of your post with the only exception being the cooling systems on UPS. There is absolutely no reason to put cooling systems on UPS power. Large, inductive loads are a UPS's enemy. A big inrush current of a chiller starting up would beat the crap out of your battery string(s).

    Having said that, you are exactly right on having both your UPS system(s) and your cooling system(s) diversified. I tend to get into this argument with people regarding what constitutes a "data center" and one of the most significant parts of determining what actually constitutes a "data center" is redundancy. This means not just redundant utility power feeds, but redundant UPS systems/modules, redundant generators, redundant chillers/CRACs, redundant PDU's, etc etc.

    For our cooling systems, we have 4 Chillers (we only need 2) and 20 CRACs (we only need 10. Any problems with any system can be mitigated by rolling to the redundant system.

  20. Re:Why run data centres in hot states? by Spazmania · · Score: 2, Informative

    the local data centre there had a 15 degree C ambient baseline

    Well that's just incompetent. For one thing, commercial electronics experience increased failure as you move away from an ambient 70 degrees F regardless of which direction you move. Running them at 59 degrees F (15 C) is just as likely to induce intermittent failures as running it at 80 degrees F.

    For another, you're supposed to design your cooling system to accommodate all of the planned heat load in the environment. If your generators will be adding heat then the A/C needs to have sufficient capacity to take that heat back out.

    And anyway, your generators shouldn't be adding heat. They should be walled off from the data center with exterior air exchange. Otherwise an error in the exhaust ducting risks killing your operators with CO poisoning.

    --
    Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
  21. Re:Why run data centres in hot states? by RockDoctor · · Score: 2, Interesting

    the local data centre there had a 15 degree C ambient baseline
    Well that's just incompetent. For one thing, commercial electronics experience increased failure as you move away from an ambient 70 degrees F regardless of which direction you move. Running them at 59 degrees F (15 C) is just as likely to induce intermittent failures as running it at 80 degrees F.
    I was considering asking why the GP poster was bothering with a sweater when working (as opposed to sleeping) in his server room at 15centigrade, but decided that he must just be one of those people who can't stand normal temperatures. But electrical engineers know that a lot of their equipment is going to be used in "ambient" conditions which are not the "ambient" of their climate-controlled office. In my work, for example, 20C would be an abnormally hot temperature for our sensor equipment ; -20C would be by no means unknown; -50C quite credible. On the other hand, some of our analytic equipment has to run for months between service visits at +50C in 90%+ condensing humidity and with forced ventilation carrying salt dust and oil spray. You design your equipment for the conditions that it's going to face, not the conditions in your office today.
    Additionally, you appear to be conflating the air temperature in the data centre (15C) with the temperature of the components. Since having a heat flux requires having a thermal gradient, then the components will be warmer than your heat sink.
    In this town, we can tell the nationality of the boss of any office instantly on walking in - European bosses keep the HVAC (heating ventilation air-conditioning, or climate control) set to about 20C ; American bosses have it re-set to 25C (until over-ruled for wasting money).

    For another, you're supposed to design your cooling system to accommodate all of the planned heat load in the environment. If your generators will be adding heat then the A/C needs to have sufficient capacity to take that heat back out.
    There's an Indian HVAC company (in Abu Dhabi), and a instrumentation engineer (last heard of in Houston, America) who need to be taught this lesson. Again. If you meet them, please apply the clue-bat before agreeing to take the equipment they design out to the Empty Quarter to rig it up.

    [your generators] should be walled off from the data center with exterior air exchange. Otherwise an error in the exhaust ducting risks killing your operators with CO poisoning.
    Your carbon dioxide flood for fire suppression would be as effectively lethal. Operators would need to be kept out of the controlled zone while enclosed generators are running; the fire suppression system should be overridden while operators are in the controlled zone, or you need to be rigged up with cascade air supplies and work-pack SCBA while working in the control zone. This isn't rocket science - there are plenty of corpses that point the way to proper management of work in potentially lethal atmospheres. (Of course, there are plenty of work places that like to cut corners and put their workers at risk. Don't work there and do report them to the relevant authorities.)
    --
    Birds are not dinosaur descendants;birds are dinosaurs, for all useful meanings of "birds", "are" and "dinosaurs"