When Does Powering Down Servers Make Sense?
snydeq writes "Powering down servers to conserve energy is a controversial practice that, if undertaken wisely, could greatly benefit IT in its quest to rein in energy costs in the datacenter. Though power cycling's long-term effects on server hardware may be mythical, its effects on IT and business operations are certainly real and often detrimental. Yet, development, staging, batch processing, failover — several server environments seem like prime candidates for routine power cycling to reduce datacenter energy consumption. Under what conditions and in what environments does powering down servers seem to make the most economic and operational sense, and what tips do folks have to offer to those considering making use of the practice?"
If you virtualized your servers, you could create a managed power-down/power-up scenario. In the morning, your servers would turn on, your virtualized instances would move around (so they have more power for the day's activities), and then at night they'd retreat to a smaller group of servers. The unused servers could shut down for the night. You could even rotate which servers stay on overnight keeping the virtual servers running to spread the wear around if there is some.
Seriously, this analogy makes sense. When a car is cruising on the high way it's able to maintain speed using 4/8 cylinders. Servers could be cycled in a similar fashion. Do you really need 20 web servers running at 3 am on a Sunday?
Why have 16 terminal servers (sorry, couldn't think of anything else) running when no more than 10-20 users are on it after working hours? Then in the morning, power them back on again using WakeOnLan.
And that backup server with a whole lot of disks? Why not only have it running during the night when stuff is being backed up?
Powering down your servers tends to introduce response issues. :-)
Some servers, like the HP ProLiant line, has power management features. Try experimenting with features like these first.
Uh I am the administrator of a server that archives all email for our company. We no longer use this solution for our email archiving, but according to federal regulations this email needs to be accessible for at least another 26 months. The only people who use the server anymore are the various alphabet soups of regulators who came in twice a year, maybe I'm the exception but not the rule but I can't see a reason to keep the server on...
Full disclosure, I work for Citrix. Check out XenServer, which can remotely provision server workloads to virtual and bare metal machines - based on load, you can remotely power up resources as needed. I have seen the future, and it is awesome. And green.
Atari, System V, C64, Amiga, College Unix (?, I wasn't like root or nothin'), Mac, WinNinetySighs, NT, Novell, Win2K, So
Your correct.
A lot of places I know shut completely down at night but leave the servers up and running. Often it is so they can run end of night jobs or just so they can get up and running quickly in the morning.
A lot of it is just waste and a lot of it is just habit.
Now for people that run 24/7? That is totaly up to you.
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
The real problem with powering down servers is that you won't know there's a problem until you power them up again. The result is that the problem always occur when you need the servers (otherwise you wouldn't be turning them on). This instead of the problem mostly occurring when the servers are not in use or at least not all servers at the same time.
If you power up 1000 servers in approximately 15 minutes (once per day) and 10 don't power up, then you have 10 problems to solve asap. If you don't power up 1000 servers but they also fail approximately 10 per day then you don't have 10 non-working servers at the same time, but randomly distributed over the day. Meaning the problems don't queue up.
Example Setup The organization I work for has a well known usage patterns that we use to make decisions like this. 95% or more of our traffic occurs during business hours which we define as 7:00 AM - 7:00 PM. During business hours we have dedicated servers for various functions. We have a cluster of servers running virtual server instances that duplicate the dedicated servers. During off hours the dedicated servers are powered down and the virtual server instances take over. It works for us and we have seen a significant decrease in power usage with no impact on our users.
Analogy (this time, light bulbs).
- Light bulbs fail just as you turn them on. Or off.
- They hardly ever fail whilst switched on.
I think servers are the same. You're in trouble if a server you've had switched on for two years and forgotten about loses power and doesn't come back up. If it'd been switched off every weekend it would have failed earlier -- but probably at a more convenient time.
Light bulbs fail when they're turned on because there's a warmup spike as the electricity flows through the wire. This causes the tungsten to heat up more than it will when the circuit is complete, and break the circuit.
If your lightbulb fails it's because the filament has worn down - it's just that you usually find out about it when you turn the light bulb on or off.
Also, your analogy means nothing. Servers are nothing like light bulbs.