Power Problems Force Seattle To Throttle City Data Center For Days

Hey, so let's post it to Slashdot! by Anonymous Coward · 2012-08-24 12:22 · Score: 4, Funny

That should help the situation.

Re:Hey, so let's post it to Slashdot! by Artaxs · 2012-08-25 03:42 · Score: 1

Article doesn't mention the power draw from PAX and Guild Wars 2 launching on the same weekend?

--
Militant Agnostic: "I don't know, and damn it, neither do you!"

Looking to move to Seattle... by Dripdry · 2012-08-24 12:27 · Score: 1

While looking at the prospect of moving to Seattle, I've read repeatedly read that the city is in political gridlock and seems totally unable to get any meaningful long term infrastructure additions put in place despite wide support for them. It seems to me this is the case in many cities out there, but can anyone say what it's like in their city?
I suspect that the first ones to finally get something useful done (rather than just repaving a few highways) will be the ones to reap a lot of growth when this recession finally begins to fade away.

--
-

Re:Looking to move to Seattle... by 93+Escort+Wagon · 2012-08-24 13:56 · Score: 3, Informative

I live south of Seattle, and work in the city.
Any political gridlock is largely because current Mayor McGinn is a joke. Seattle is a fairly liberal city, but McGinn was largely seen as too extremely left-wing to be electable even there; so he remade himself into a pragmatist - a change that lasted until he was sworn in. McGinn made specific promises pre-election that he wouldn't let his personal ideology affect policies where the citizenry clearly differed from him... then he turned around and spent most of his time fighting ideological political battles, ignoring real problems while devoting 100% of his time tilting at his personal windmills.

--
#DeleteChrome
Re:Looking to move to Seattle... by Anonymous Coward · 2012-08-24 14:10 · Score: 1

That's probably the only significant thing that's wrong with the city. There's tons of activists and there's a lot of good folks that believe they're entitled to get their way every time. But, in all honesty, it tends to get sorted out and it really doesn't have as much of a negative impact as you might think. For the most part there isn't a whole lot that really needs fixing that isn't already addressed.
Re:Looking to move to Seattle... by Anonymous Coward · 2012-08-24 14:29 · Score: 1

It's definitely gotten worse under the current mayor, but we've had gridlock before. We're just now breaking ground on the viaduct replacement over 20 years and one earthquake after we learned that the engineering design was flawed. But the current mayor is going to be a one term mayor, he's been so bad that I actually deeply regret not having voted for Nickels in the primary that year. I remember hearing the results the next day and having an "oh shit" moment when I realized that Nickels wasn't even an option as he placed 3rd.
Re:Looking to move to Seattle... by Anonymous Coward · 2012-08-24 15:41 · Score: 1

We should be so lucky. Political gridlock would be wonderful. However, that is not the case. We have very active political drivers in this city.
I used to only understand such issues theoretically, but I've seen their application in plain use all around me in this city. One of my roommates is an ex-architect who is getting deep in the politics of the council's plans for the capital hill area. It is a constant fight between city planners selling off blocks of space to the highest briber(high rise developers who will opt for buildings that fit into higher tax code levels as an additional perk for our bureaucrats) and the people that actually live in the area. No thought is given to the supposed property owner at all(who effectively only have a user license for what is theirs), unless they have some political pull. We are effectively barred from competition in the realm of urban development, while a special few are actively selected and supported.
An area that is more in my field of expertise(and hurts a lot more too) is telecommunication utility control. I live in one of the denser areas in the heart of the city, just outside of the true downtown area. The particular spot I'm at has Qwest/Century Link as the sole provider(Broadstripe supposedly has some permission to work here, but its effectively barred due to the limited access). This means near the heart of the damn city, I'm stuck on a shit DSL connection. This is all because the city bestows franchise utility monopoly privileges to particular providers in different areas of the city. There is some overlap, like mandating that whoever is given control over some area must let some number of other ISPs use their cable for some fee. It is a token gesture however, because unless the rest of society is permitted total access to serve customers as they are willing, it doesn't matter since the entrenched and protected provider can stay just ahead of the extra cost any competitor has to deal with to shut out competition. In my particular case, I suspect(but I admit I have no proof) that the privilege to Century Link is related to the nearby Seattle University which probably has some deal made between them, Century Link and the city. In any case, it is a simple fact that better service is lacking in one of the denser areas of the city for no justifiable reason. It isn't even so upsetting that we have terrible service provision so much as it is the fact that the reason our providers suck is so unnecessary.
So, I'll stop there since I'm sure you get my point. I'll end with a caveat that most mature cities suffer from these types of problems, so I cannot say if it is any worse than where you are coming from.
Re:Looking to move to Seattle... by 93+Escort+Wagon · 2012-08-24 15:50 · Score: 1

Yeah, actually you are right that gridlock is not really new up here. McGinn is just such a breathtakingly bad mayor though.
I find it a bit funny, because in Seattle it's not like there's any real right versus left division - all the crap that goes on is about individual agendas. Plus people want all this stuff and want new laws to save puppies and orphans and transgendered left-handed bicyclists, but don't really want to pay for any of it - it surprises me how anti-tax Seattle can be for such an ostensibly liberal place.

--
#DeleteChrome
Re:Looking to move to Seattle... by symbolset · 2012-08-24 15:56 · Score: 4, Funny

Seattle has great parking. You can park your car on I5 for several hours each day without concern that traffic might move forward while you're shopping.

--
Help stamp out iliturcy.
Re:Looking to move to Seattle... by drainbramage · 2012-08-24 16:27 · Score: 1, Funny

For those of you not from seattle:
He is complaining that he voted for the democrat instead of the other democrat.

--
No brain, no pain.
Re:Looking to move to Seattle... by SuricouRaven · 2012-08-24 18:26 · Score: 1

"Plus people want all this stuff and want new laws to save puppies and orphans and transgendered left-handed bicyclists, but don't really want to pay for any of it

I think you just described every country ever. I'm sure if you went through the cruniform tablets dug up from Babylon you'd eventually find a letter complaining that there aren't enough guards on the street and the taxes are too high.
Re:Looking to move to Seattle... by drainbramage · 2012-08-25 02:35 · Score: 1

Please give him some slack, he graduated from a Seattle school.

--
No brain, no pain.
Re:Looking to move to Seattle... by cthulhu11 · 2012-08-26 18:10 · Score: 1

McGinn made specific promises pre-election that he wouldn't let his personal ideology affect policies where the citizenry clearly differed from him... then he turned around and spent most of his time fighting ideological political battles, ignoring real problems while devoting 100% of his time tilting at his personal windmills.
Sounds kinda like Schwartz and Sun.

Seattle Times, Where Are You? by Frosty+Piss · 2012-08-24 12:30 · Score: 2

Interesting that this is not on the front page of the Seattle Times. In fact, I can't find it at Washington's biggest paper at all.

--
If you want news from today, you have to come back tomorrow.

Re:Seattle Times, Where Are You? by cthulhu11 · 2012-08-26 18:12 · Score: 1

The "Seattle Times" really needs to come clean and just rename itself the "Microsoft Times". This story didn't concern Microcultists, so it doesn't rate their attention.

Re:Okay, A Point Here by Anonymous Coward · 2012-08-24 12:33 · Score: 1

Problem is that it contained atleast bill paying and other personal information, and the government has very strict laws about how stuff like that is to be stored and processed. Even if they can get over the red tape just for using a private vendor, there are laws saying that they have to use vendors that meet certain special interest criteria, and then automatically pick the lowest price because of budget laws. In the end they get a data center that is down on weekends, holidays, and all other cruft for a freaking fortune. I remember when one of my relatives told me about a similar tale; they were forced by all sorts of stupid laws to buy office chairs (padded, but still pretty much junk) for a government office for hundreds or thousands of dollars a piece, chairs that would cost at very most $100 at the local officemax.

Re:Okay, A Point Here by Anonymous Coward · 2012-08-24 12:33 · Score: 1

Feel free to bid out the project, and then see if it's worthwhile. Check out the costs of something being under another entity control.

In this case, they COULD pay the extra to have this fixed while running, but for them, I'm guessing the temporary shutdown over a 3-day weekend is the more cost-effective option. With two days on either side, it's hardly a gross inconvenience. The city's key operations will go on.

Or just assume that contracting out is the better way, and sprinkle on a little magic fairy dust from the Cloud.

5 days no government, is that so bad? by DevotedSkeptic · 2012-08-24 12:35 · Score: 2

If you lived in podunk nowhere then no probably not, if emergency services continue to operate it wouldn't be a big issue. But for such a large municipality to go dark for 5 days...would definitely be impactful locally and possibly regionally/nationally to a smaller degree. Emergency services are very important but the business of government (no matter how i feel about it from time to time) needs to continue and serve it's people...I am sure (at least i hope) that they looked into portable power generation, but it seems that this is a poor solution. just my 2 pennies.

--
Chief Thinker www.devotedskeptic.com

Re:5 days no government, is that so bad? by jrmcferren · 2012-08-24 12:41 · Score: 3, Insightful

They have the power, they just can't get it where they need it without equipment overheating. Since it is a busbar overheating you can't just switch over to emergency power to fix it, you have to route power around the issue which is not economically feasible in this case except for the emergency services systems which can use their redundant power supplies.

--
sudo mod me up
Re:5 days no government, is that so bad? by DevotedSkeptic · 2012-08-24 12:47 · Score: 2

Well being able to keep up emergency services is definitely most important, i don't think we are getting the whole story since either something was added to create extra electrical draw or something is failing. I wonder if that is the 2.1 million spoken of to add capacity...or repair.

--
Chief Thinker www.devotedskeptic.com
Re:5 days no government, is that so bad? by dnay · 2012-08-24 12:48 · Score: 2

They have the power, they just can't get it where they need it without equipment overheating. Since it is a busbar overheating you can't just switch over to emergency power to fix it, you have to route power around the issue which is not economically feasible in this case except for the emergency services systems which can use their redundant power supplies.
Run down to Autozone and grab a couple dozen jumper cables.

--
Since I gave up hope, I feel much better.
Re:5 days no government, is that so bad? by Isaac-1 · 2012-08-24 12:54 · Score: 2

Am I the only one to think, how many modern servers does the city of Seattle really need? Google says the population is only 608,000 in 2010
Re:5 days no government, is that so bad? by TubeSteak · 2012-08-24 14:47 · Score: 1

Am I the only one to think, how many modern servers does the city of Seattle really need?

By my calculations, the city of Seattle needs exactly two electrical buses worth of modern servers.

--
[Fuck Beta]
o0t!
Re:5 days no government, is that so bad? by Anonymous Coward · 2012-08-24 15:05 · Score: 3, Funny

The city of Seattle, or any modern city, needs exactly three modern servers to provide their public services. And two of them are to provide redundancy for the one that does the actual work. Internally they may need more servers for VDI or some such, or need to physically isolate one service from another. But one modern server is adequate to provide all of the public services Seattle provides, and two more provide geographic redundancy through their fiber network, which could be upgraded to 100 Gig for a reasonable cost because they own the fiber and the endpoints. The devil is in the I/O, and SSD takes care of that.
But I can't tell them that. I sell their multitudinous departments a lot of servers.

The dog ate my data center by fustakrakich · 2012-08-24 12:42 · Score: 1

If bills don't get paid, there better not be any late fees imposed. The banks could make millions on this.

--
“He’s not deformed, he’s just drunk!”

Oh crap.... by Anonymous Coward · 2012-08-24 12:55 · Score: 1

iCarly will be pissed.

Wow by koan · 2012-08-24 13:13 · Score: 1

Sounds like Seattle's 911 system is quite fragile.

--
"If any question why we died, Tell them because our fathers lied."

Forgetting something? by LostCluster2.0 · 2012-08-24 13:19 · Score: 2

If power problems are downing the city's datacenter for a holiday weekend, couldn't they just rent a few $100/mo servers and run the city apps on them for the downtime and make the problems transparent to the end user? No one-place site is ever safe for important apps, we call that a Single Point of Failure around here.

--
I'm LostCluster but I lost my password to that user. Hey Slashdot, how about helping me get it back!

Re:Forgetting something? by Skapare · 2012-08-24 17:39 · Score: 1

No. They should get the money to put in 2:3 redundancy separate data centers, away from downtown, in widely separated locations.

--
now we need to go OSS in diesel cars

the cloud by Lord+Ender · 2012-08-24 13:22 · Score: 2

Seattle? The home of Amazon? Why on earth don't they just move their datacenter to Amazon Web Services? They could probably do it for less than the $2.1 million they're spending on this single part!

--
A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.

Re:the cloud by 93+Escort+Wagon · 2012-08-24 13:59 · Score: 2

It's also the home of Microsoft; and Google is also strongly represented. You can't afford to piss off any of these guys...

--
#DeleteChrome
Re:the cloud by musicalmicah · 2012-08-26 10:40 · Score: 1

Seattle? The home of Amazon? Why on earth don't they just move their datacenter to Amazon Web Services? They could probably do it for less than the $2.1 million they're spending on this single part!
Migrating huge amounts of data and services is very expensive, and especially difficult to do in years when the tax revenue is down. Government is also typically more conservative with new technologies and processes than the private sector., and apprehensive about outsourcing when proper stewardship of citizens' data is their #1 priority.

911 and emergency services by girlintraining · 2012-08-24 13:58 · Score: 4, Insightful

What I'm trying to figure out is why 911 and emergency services didn't have a separate offsite backup. I mean, how much more mission critical can you get than that? Everytime I see one of these articles I think to myself: Why are they mentioning this if there wasn't some risk of failure? And the answer is... because quite obviously, there was some risk.

I don't want my cause of death to be "Your call could not be completed as dialed. Please check the number and try your call again later..."

--
#fuckbeta #iamslashdot #dicemustdie

Re:911 and emergency services by adolf · 2012-08-24 15:31 · Score: 2

In my experience with 911 and emergency communications (none of which is anywhere near the scale of what Seattle must have), they have power redundancy (consisting of one or more UPS and one or more standby generator), connectivity redundancy (multiple telephone/data circuits going to different places), and physical redundancy.
So if one 911 PSAP goes completely offline for any reason, there is one or more geographically independent backup PSAPs which can take over in quid-pro-quo fashion.
Do things get a little bit harrier when this happens? Absolutely: You've got folks who, no matter how good they are at doing their usual job, are now doing a somewhat different and more complex job. Efficiency goes to shit, but more hands are easily called in/moved around to help with that in short order.
So. The 911 phone will still be answered, and your ambulance/fire brigade/armed posse is still within easy grasp.

--
Kid-proof tablet..
Re:911 and emergency services by gsogeek · 2012-08-26 06:29 · Score: 1
Efficiency goes to shit, but more hands are easily called in/moved around to help with that in short order.
So. The 911 phone will still be answered, and your ambulance/fire brigade/armed posse is still within easy grasp.
While the redundancy is built into the system to allow the call to go somewhere, it may not be a place that can handle the call the best. We try to build the systems to account for these failovers, but even then, calltakers/dispatchers will start working on a sort of "muscle memory" when things get bad and busy. Just because the 911 phone is answered, doesn't mean that the ambulance/fire brigade/armed posse is within easy grasp. It's not as simple as adding capacity if the extra capacity is just as unaware of the layout of the area they are serving, and you can only drill on these situations so much.
For example: Assume CityA and CityB. CityA is a rather large city with 250k people. CityB is a fair-sized city with 100k people. PSAP for CityA goes dark and fails over. A call is receieved for 203 Main St. in CityA for a 911 hangup/Check Welfare. Since the system has failed over, This call is now being handled by the PSAP for CityB. CityB also has a 203 Main St. The calltaker sees on their display that there's a call for 203 Main St and puts that address in the CAD. CAD finds 203 Main St and the dispatcher send the police to do a welfare check at that address. CityB Police get there, nothing to be found, and the call is closed out as unfounded. After a few minutes, another call comes in for CityA's 203 Main St. (Remember during this time that CityB has been processing their own call volume as well, which can be quite substantial, as CityB's PSAP was not structured to handle the call volume for CityA as well.) At this point, dispatch has their first "red flag" moment and realizes that they need to mentally switch over and send CityA Police to check the area. Luckily, this case would be a kid playing with the phones, but it could turn out very different if this were for say a heart attack, or some active crime in progress.
Oh how I wish the redundancy planning for these particular systems were so well thought out, and some are (One agency I know of has a full hot-standby PSAP built that can be staffed and processing in about 10 minutes, they actually test this on a regular basis), but a lot of other, usually smaller sites or very large sites, simply flip the switch and shunt the calls to neighboring PSAPs to handle the load, with the results of the example above, until they can finally get their disaster plan options up and running.
Mostly, this is a matter of cost and, as has been mentioned before, everybody wants 911 answered on the first ring, but nobody wants to pay the taxes needed to make that happen, which usually only comes up when you get that nice "All circuits are busy" message, or the dreaded busy signal on when you dial 911 (Both of which can and do happen in some areas).

Definitions:
- PSAP - Public Safety Answering Point (the place the phone rings)
- calltaker - the person that talks to the public that is calling in
- dispatcher - the person that tells the cops where to go (I love saying that)
- CAD - Computer Aided Dispatch (This has very little to do with drawing things beyond basic GIS, even though spamers think it does)
--
All systems working, customers satisfied, and staff eagerly enthusiastic. All pigs fed and ready for flight.
Re:911 and emergency services by adolf · 2012-08-26 20:26 · Score: 1

No system is without faults.
But in this context (the Seattle non-fiasco), it doesn't seem to be a big deal. Things were/are fine.
That said: Everyone wants and expects their 911 services to be absolutely bullet-proof, but nobody paying for it gives a fuck about the funding for that. 911 (in these parts) is funded in ways that are more straight-forward than gasoline taxes, and relatively easily understood. And these taxes are currently on the chopping block, for the benefit of no-one and the detriment of all.
But that's a different issue than 911 being generally available in the face of catastrophic failure, which it is. The issues you list are those of training deficiency, and the training is lax perhaps only because the regular system(s) are so reliable that nobody bothers to consider the concept that failure is a very realistic option.

--
Kid-proof tablet..

Good plan by Guerilla+Antix · 2012-08-24 14:37 · Score: 1

Nice, so they're running their mission critical operations on reserve systems. Hope nothing too important happens while they're getting bombed by a /. post.

Re:Okay, A Point Here by AK+Marc · 2012-08-24 14:50 · Score: 2

I've had a similar issue with a private data center. There wasn't a UPD bypass switch because the UPS had an internal bypass switch (installed with the datacenter years before. But the UPS was old, and a new UPS was cheaper than replacing all the batteries (and more powerful with better features). So my coworker planned out the switch, 2 days outage over a weekend. Of course, since I took most of the classes to be an EE, I re-drew the plans and got the project done with half the labor time and two 30-second outages (well, both were about a second, but longer than the time a server could live without power, so it was safer to turn everything off as if it were a longer outage). The problem was caused by a stupid "cost saving" choice on installation.

Sounds like something similar here, where there's an issue with part of the redundancy, but it's not actually capable of running fully redundantly. Otherwise, cut everything over, then fix it. Or just turn it off and fix it (and the power will flow). I've seen it more than once in corporate world, so it's not an example of governmental oops, just IT oops.

--
Learn to love Alaska

Re:It has to be said... by AK+Marc · 2012-08-24 14:55 · Score: 2

The cloud doesn't need power?

--
Learn to love Alaska

Use the remote site by hawguy · 2012-08-24 15:04 · Score: 2

Why don't they just fail over the critical life and fire safety systems to the backup datacenter, and keep normal services up at the primary datacenter while they do the work? They do have a second site, right? Surely no one would host a system deemed "critical" and "life safety" at a single site?

Re:Use the remote site by Glendale2x · 2012-08-24 15:29 · Score: 1

Because while things may have been well designed originally or planned including all the fancy redundancy, after years of no major issues it becomes a target of its own success: cutbacks and people saying "see, we never needed it, and look at how much money we can save". Such is the way of things.
If you personally are worried about 911 services being out then go write down the various 7 (or 10-digit if your exchange requires it) numbers for your local emergency services. 911 is not an exclusive to reach them, just the easiest. Whatever happened to the days of the list of those various numbers on the fridge? I'm not even that old and I remember my parents having the list posted just in case.

--
this is my sig
Re:Use the remote site by hawguy · 2012-08-24 15:52 · Score: 2

Because while things may have been well designed originally or planned including all the fancy redundancy, after years of no major issues it becomes a target of its own success: cutbacks and people saying "see, we never needed it, and look at how much money we can save". Such is the way of things.
If you personally are worried about 911 services being out then go write down the various 7 (or 10-digit if your exchange requires it) numbers for your local emergency services. 911 is not an exclusive to reach them, just the easiest. Whatever happened to the days of the list of those various numbers on the fridge? I'm not even that old and I remember my parents having the list posted just in case.
I thought I was already paying for a reliable E-911 service through the 911 service fees we've all been paying on our phone bills for years.
So what you're saying is that even though we've been paying for 911 for years, we've been paying for cheap, non-redundant service, and it we expect the type of multi-site redundancy that's normally reserved for moderately successful websites, then we need to pay even more? What value are we getting from the hundreds of millions of dollars already collected?
I've called 911 a handful of times, but never from my own house so I'm not sure how that list of phone numbers taped to the fridge is supposed to help me. There used to be a time when you could count on finding a phone book under the phone in your friend's house with the local emergency numbers inside the front cover, but I haven't seen a phone book at a friend's house in years.
Re:Use the remote site by Glendale2x · 2012-08-24 15:57 · Score: 1

Next you're going to tell me that the USF fees are always used precisely for what they say they're for.

--
this is my sig
Re:Use the remote site by johnnick · 2012-08-24 16:02 · Score: 1

>Because while things may have been well designed originally or planned including all the fancy redundancy, after years of no major
>issues it becomes a target of its own success: cutbacks and people saying "see, we never needed it, and look at how much >money we can save". Such is the way of things.
Part of this is also people who are bad at math. I once had a major disagreement with a business guy trying to explain that there was a significant difference between a server that had been 100% available for a given time period and one that was _architected_ to be 100% available. He couldn't understand that the former scenario involves getting lucky, while the latter is the result of (more expensive) design.

--
"The plural of anecdote is not data."
Re:Use the remote site by CAIMLAS · 2012-08-24 16:25 · Score: 1

You just don't fail over to an off-site facility when you're short staffed and haven't thoroughly tested your off-site. Very few locations can effectively fail to an off-site location gracefully for one reason or another.

--
~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
Re:Use the remote site by forkazoo · 2012-08-24 18:20 · Score: 1

Presumably because then, there won't be a backup available for the critical systems. There probably is some extensive backup infrastructure available, but you never activate it unless you genuinely *absolutely have to.* If something bad happens to the active systems while you have voluntarily taken down half your 911 infrastructure, "we didn't want to take down any convenience systems," really won't cut it as an excuse. Besides, the presumed backup probably isn't seamless, doesn't work quite as well, etc. You almost never have 100% capacity in your DR secondaries. They are usually just to tide you over in an emergency, and maintain some functionality.
Re:Use the remote site by Glendale2x · 2012-08-24 19:29 · Score: 1

With 911 services the other is consolidation. In my area there used to be multiple centers with dispatchers for various agencies in diverse locations. By nature it was redundant. If one had problems the others could assist. Now there's just a single unified emergency communications center for everyone. Even if there is a backup site or plan all of the dispatchers are on duty are only at the unified center and it takes time to shift things around.

--
this is my sig

overheating power buses / wires are a fire risk by Joe_Dragon · 2012-08-24 15:40 · Score: 2

overheating power buses / wires are a fire risk and that comes from them being under sized for the load.

See the towering inferno to see where that can get you.

Re:overheating power buses / wires are a fire risk by thegarbz · 2012-08-24 22:48 · Score: 1

and that comes from them being under sized for the load.
While you're technically right they may not be undersized for the load. Very few substations suddenly become so warm because they are sized incorrectly for the the loads they are driving under normal conditions.
The vast majority of overheating in switchgear comes form either malfunctioning equipment (overdrawing current without tripping out), maintenance problems (dust and such, though more of a problem in remote RMUs), or of faulty installations or faults developing over time causing poor connections.
Last time we had an overheating event at work it was due to the switchgear not being racked in correctly. Everything worked but we found the problem with a thermo scan that looked like we had a heater in the very middle of our switchgear. Unfortunately we could not rack it out as it was jammed and the answer was taking an outage of the substation. Fortunately we had redundancy.

Re:Okay, A Point Here by CAIMLAS · 2012-08-24 16:23 · Score: 2

I had an almost identical situation happen to me this past spring, too. I was the sysadmin at one of the facilities. It happened right after I gave my two weeks, and damn was I busy. :P I ended up having to take all my UPSes off the mains and run them over some two phase at one point to get additional power onto a secondary genset, because the amp load simply was too high (oops, poor planning - someone forgot to figure high load overhead amperage requirements).

Unlike this situation, my situation only had a single power run due to the topographical location of where we were: on top of a hill/small mountain, on the edge of a park. There were 5 fairly sizeable facilities on the hill, some of which have some fairly significant power requirements due to the type of work they perform (lots of sciencey stuff).

Fortunately, all of the buildings had (100 KW+) gensets. Unfortunately, only one of the 5 was NG, and the others were diesel. This gets really costly, really quickly, since it's California, diesel's at something like $4.50/gallon, and the things will burn through a full 500 gallon tank in a day at around 60% utility. So we're talking ~$10k a day just to keep these things fueled (including an extra pulled up due to additional crunch demand).

Plant faculty - probably a good 30-60 people in all - were in the conduit going up the hill for a day trying to figure out where the fault was, and then another three days getting new cable run and relay substation. (God, I hate how slow many union workers work.) Turns out the relay fused up pretty solidly, welding itself nicely into the culvert.

I seem to recall talk back and forth that the total damage was going to be over $500,000, so it really doesn't surprise me that a large city's power infrastructure would cost a multiple of that. If cities are like some of the hospitals I've seen, they've got lecherous IT sales people at their door on an almost-daily basis. They also buy a lot of the crap the sales people are peddling, many of which seem to (still) require being run on their own propriety platform and/or a dedicated piece of hardware. And then, the old systems don't really go away until they die, and there's a cost incurred to recover the lost data - because they're non-profit, they don't really seem to understand cost of maintenance, depreciation, or anything like that. So, I can certainly see the power requirements for some poorly designed cluster for public facing things, a handful or three of interface systems to tie in with the governmenty systems, and so on.

In my mind, it makes sense that they just shut those services down temporarily. "Forced vacation use" for city workers, maybe? They'll save a lot more than 2.5 million that way, if they can do it, I'm sure (funny how government is able to cut costs when there's no alternative :P). I imagine it's too much of a cost and/or risk to try to move essential services (fire/PD/911) to the hot site, and really no reason to do so, especially when they've not yet tested their DR plan.

--
~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers

Re:It has to be said... by Skapare · 2012-08-24 17:37 · Score: 1

The cloud is somewhere else with their own power problems.

--
now we need to go OSS in diesel cars

Re:2.1 million? What? by Xero · 2012-08-24 17:57 · Score: 4, Informative

The datacenter is on the 26th floor of the municipal tower and the overheating bus runs up to that floor. The power company in question is municipally owned, either way it would be the city's problem.

Story doesn't make sense by Xero · 2012-08-24 18:18 · Score: 2

McGinn had quite a few facts wrong in the press conference. The equipment is working fine now and the overheating only caused a minor amount of downtime. The major issue though was the backup generator never kicked in because as it turns out, the electric starter for the diesel generator is connected to the same bus. Labor Day weekend was then choosen to fix this majorly obvious design deficiency.

Re:Okay, A Point Here by PPH · 2012-08-25 04:45 · Score: 1

when was the last time a well-run private datacenter was offline for five days barring a natural disaster?

Boeing's datacenter in Bellevue Washington (East of Seattle about 10 miles). About a decade ago, they had to shut down their entire operation because a purpose-designed and built data center had power problems and didn't have the system redundancy they thought it did. In this case, the problem had to do with the use of incorrectly specified parts in some panelboards (main lug bolts). When a few were discovered to be overheating due to loose connections, the extent of the problem was revealed.

That datacenter was supposed to have been designed with fully redundant systems, including two utility sources. But that turned out not to have been the case and the only solution was to shut down everything over a long weekend and replace the suspect parts.

Having been in the commercial power biz in addition to my time at Boeing, I have become aware of a number of inadequate data center power systems installed in the past few decades. Without pointing any fingers, it seems the Seattle area has suffered from a few engineering firms and contractors that rode to dot com boom, swept in installing crap and moved on. The City of Seattle's IT infrastructure could be suffering from an additional problem in that they consolidated their city operations into a building that they picked up cheap. And this building was designed prior to the past few decades growth in IT systems. So its data center power systems were a retrofit to that building with all the shortcomings that this entails. Not enough room to install redundant power risers, switchgear and other assorted equipment in an existing structure.

--
Have gnu, will travel.

Re:Okay, A Point Here by evilviper · 2012-08-25 06:10 · Score: 1

Diesel here in CA is under $4, usually a few cents less than unleaded, and in any case, generators can use the road-tax-free supply (ala. Home heating oil) which drops the price significantly, still.

--
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant

It's all about how you look at costs by Vrtigo1 · 2012-08-27 13:18 · Score: 1

If you've got enough compute resources to have your own data center, it's probably cheaper for you to have your own data center instead of paying someone else to do it for you. So then, you build your own data center, and you decide on compromising on certain things like power bus redundancy, because in any given data center environment there are a million things that can fail, but you have to prioritize the systems you make redundant by looking at their failure probability and expected failure impact. You can't make everything redundant. That would be foolish because you'd spend so much money on redundancy that you'd have no money left for functionality. You probably want redundancy for routers, switches, servers, storage, etc because that's all stuff that's likely to fail, but I'd bet most single tenant datacenters probably don't have power grid redundancy because that's really expensive, and not as likely to fail. You would probably be better served by staying on a single power grid and putting the money to bring another power connection in on a generator instead.

The point I'm trying to make is that there is a level at which you have to say something is "redundant enough". I think that call that the point of diminishing returns. I would say an overheating power bus is probably an acceptable failure because I would've considered that as something that has a pretty low failure risk, so I wouldn't have spent the money to have two of them.

Remember I'm talking about a single tenant, privately owned datacenter for a small entity here. I.E. a municipality that probably has somewhere between 100-500 servers. Naturally if you're a company that is in the business of doing business online, or a huge company, then this isn't the right path for you. At the end of the day, a municipality offers online services as a convenience for its residents. When the DC blows up, you can still write a check and drop it in the mailbox to pay your water bill.

When you figure out your critical services, you separate them and define another SLA which applies only to them. And from the article, it sounds like that is exactly what they did - they kept the critical life safety systems running and took down the convenience systems for an acceptable period of time. So what's all the fuss about?

Slashdot Mirror

Power Problems Force Seattle To Throttle City Data Center For Days

56 of 85 comments (clear)