Lightning Strike KOs Amazon, Microsoft EuroClouds

So Cloud v Cloud.... by artor3 · 2011-08-07 15:04 · Score: 5, Funny

...nature wins?

Re:So Cloud v Cloud.... by Torinir · 2011-08-07 15:07 · Score: 3, Funny

Flawless Victory!
Re:So Cloud v Cloud.... by Oxford_Comma_Lover · 2011-08-07 15:27 · Score: 1

Sephiroth is rumored to have been stuck in the middle.

--
-- IANAL, this isn't legal advice, and definitely isn't legal advice for you. Also, Squee!
Re:So Cloud v Cloud.... by Trilkin · 2011-08-07 15:30 · Score: 3, Funny

There is bad fanfiction depicting this event. Lots of it.

--
Nobody cares what the CAPTCHA for your post was.
Re:So Cloud v Cloud.... by Anonymous Coward · 2011-08-07 15:59 · Score: 1

On sale in the dark corners of comics shops all across Japan for 400 yen a copy.
Re:So Cloud v Cloud.... by Anonymous Coward · 2011-08-07 16:01 · Score: 1

...nature wins?
Sadly, not so true.
EC2 is set up so that you can move your instance quite simply to another availability zone such as America.
Leveraging the internet and routing around damage is one of the great strengths of "the cloud". If you had a server traditionally hosted in that datacenter you'd be quite screwed right now.
I dislike "the cloud" term as much as the next /.er but right now its working as intended.
Re:So Cloud v Cloud.... by Anonymous Coward · 2011-08-07 16:06 · Score: 0

Or Cloud v McCloud, depending on if there was a Quickening going down.
Re:So Cloud v Cloud.... by CrankyFool · 2011-08-07 16:17 · Score: 2

US is not another availability zone -- it's a different region. There are multiple AZs per region and -- if Amazon is doing their job -- a lightning strike should not take down more than one AZ in a region.
Re:So Cloud v Cloud.... by Z00L00K · 2011-08-07 16:44 · Score: 2

And with all eggs in one basket you can be sure to crack them all in one punch.

--
If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
Re:So Cloud v Cloud.... by Anonymous Coward · 2011-08-07 16:48 · Score: 0

As Nelson would say, HAH-HAH!

As yo mama might say: NIGGERS!
Re:So Cloud v Cloud.... by rtfa-troll · 2011-08-07 18:39 · Score: 2

I dislike "the cloud" term as much as the next /.er but right now its working as intended.
That seems to be true for Amazon; the outage is exactly what they documented in the case of the loss of a data centre. I'll give them (tentative) points for a job acceptably done*. You can't, however, say the same for Microsoft. They had a user visible application level outage for the loss of a single location. That's a screw up and clearly shows why you shouldn't trust anything "business critical" to just one cloud.
* we still don't have clarity about the physical separation between their generator and phase synchronisation system. I don't know if they could have saved themselves with a proper physical layout of their separated power supplies. Also we have no idea what caused the transformer to be struck instead of just some lightning protector.

--
=~ s,(.*),<sarcasm>$1</sarcasm>,g if any_point_you_wish();
Re:So Cloud v Cloud.... by erroneus · 2011-08-07 19:12 · Score: 0

At least spell it right... "MacLleod" not McCloud.
Re:So Cloud v Cloud.... by ozmanjusri · 2011-08-07 19:25 · Score: 1

Actually, it's more like an earth-shattering BOOM.

--
"I've got more toys than Teruhisa Kitahara."
Re:So Cloud v Cloud.... by ozmanjusri · 2011-08-07 19:28 · Score: 1

You can't, however, say the same for Microsoft. They had a user visible application level outage for the loss of a single location. That's a screw up and clearly shows why you shouldn't trust anything "business critical" to just one cloud.
Don't you mean 'you shouldn't trust anything "business critical" to Microsoft'?

--
"I've got more toys than Teruhisa Kitahara."
Re:So Cloud v Cloud.... by jimicus · 2011-08-07 19:39 · Score: 1

That would be an Illudium Q-36 explosive space modulator, not a lightning strike.
Re:So Cloud v Cloud.... by Guspaz · 2011-08-07 20:18 · Score: 3, Informative

That might be true if Amazon didn't have multiple AZs in single datacenters.
The fault isn't necessarily Amazon's for stuff like this. The whole point of cloud infrastructure is that you use many cheaper instances to scale load and provide high availability caused by the failure of any one (or group of) node. Take Netflix, for example. While they do have their share of outages, they were completely unaffected by Amazon's big EC2 failure a few months ago, despite the fact that a significant portion of Netflix' infrastructure was hosted out of the affected region. Why? Because they built failure into their system, to the extent where they have a process that goes around killing random instances to keep them on their toes. They've planned for and built their system around the possibility that large chunks of the system might just up and vanish without warning.
If you're building a large-scale cloud system, *geographic* diversity should obviously be a part of any high availability plan. I'd also say that having provider diversity isn't a bad idea, but it seems like a lot of big cloud customers just stick with one provider.
Re:So Cloud v Cloud.... by SMoynihan · 2011-08-07 20:33 · Score: 4, Informative

I live in Dublin, and that was some seriously targeted lightning. No sign of storms here, that I saw...
Re:So Cloud v Cloud.... by outsider007 · 2011-08-07 21:37 · Score: 1

Hmm, 20% of the eggs in each basket I'd say.

--
If you mod me down the terrorists will have won
Re:So Cloud v Cloud.... by c0mpliant · 2011-08-07 21:43 · Score: 2

I was thinking the EXACT same thing... I know it was raining pretty hard on Saturday, but I didn't see any lightning or hear any thunder

--
There is no -1 disagree
Re:So Cloud v Cloud.... by Anonymous Coward · 2011-08-07 21:48 · Score: 0

Nature always wins in the long run.
Re:So Cloud v Cloud.... by mabhatter654 · 2011-08-07 22:50 · Score: 1

"Epic Zapp Battles of History"
Re:So Cloud v Cloud.... by SMoynihan · 2011-08-07 23:01 · Score: 1

Yep, and from what I am reading this happened on Sunday (amazon.co.uk's accounts section was down then too, I guess it was related). We had surprisingly good weather yesterday - considering the forecast. No giant electrical discharges from the sky that I saw!
Re:So Cloud v Cloud.... by Blobule · 2011-08-07 23:07 · Score: 1

I don't usually correct another's spelling, but since you incorrectly spelled MacLeod while pointing out someone else's spelling mistake, I thought it prudent :)
Re:So Cloud v Cloud.... by chill · 2011-08-07 23:12 · Score: 1

The biting humor is you also spelled it wrong. Considering it was a reference to The Highlander, the correct spelling would be with only one "L", not two. MacLeod.

--
Learning HOW to think is more important than learning WHAT to think.
Re:So Cloud v Cloud.... by 2phar · 2011-08-07 23:24 · Score: 2

The lights were flickering here in South Dublin last night around 10pm and there had been an outage of some sort 3-4 hours earlier. I was wondering what was going on, because brownouts/blackouts are extremely rare here. The only real memories I have of power outages in Dublin were rolling blackouts due to industrial action back in the 1970s.
Re:So Cloud v Cloud.... by Muad'Dave · 2011-08-07 23:56 · Score: 1

"History shows again and again how nature points out the folly of men....GODZILLA!" - Blue Oyster Cult

--
Tiller's Rule: Never use a word in written form that you've only heard and never read. You will end up looking foolish.
Re:So Cloud v Cloud.... by TheRaven64 · 2011-08-08 00:06 · Score: 1

There was thunder and lightning over Swansea on Sunday night. From the direction and the speed the wind was blowing, I wouldn't be surprised if it had been in Ireland a little earlier.

--
I am TheRaven on Soylent News
Re:So Cloud v Cloud.... by c0mpliant · 2011-08-08 00:13 · Score: 2

Yeah it was daecent weather, too bad I still recovering from Saturday night to be able to enjoy it!
Yeah I just asked around the office here and no one heard any either. I'm beginning to wonder if someone was just playing with a set of balloons and had an unfortunate accident involving static...

--
There is no -1 disagree
Re:So Cloud v Cloud.... by Anonymous Coward · 2011-08-08 00:40 · Score: 0

I thought the idea behind the Internet was a distributed computing so that if any one point went down you would still be able to get where you were going. So wouldn't one also build the same kind of redundancy into a centralized cloud service? It must be that a massive centralized cloud service isn't cost effective unless you only have a single point of failure.
Re:So Cloud v Cloud.... by Anonymous Coward · 2011-08-08 02:51 · Score: 0

Curse you. *shakes fist*
Not how I wanted to start my morning...
Re:So Cloud v Cloud.... by erroneus · 2011-08-08 03:38 · Score: 1

yeah... obvious typo... it was too early to be typing accurately... ... but I should have known all along because...
"There can be only one!!!... L in MacLeod"
Re:So Cloud v Cloud.... by Chris+Mattern · 2011-08-08 06:04 · Score: 1

FATALITY.
Re:So Cloud v Cloud.... by slick7 · 2011-08-08 10:13 · Score: 1

Actually, it's more like an earth-shattering BOOM.
More like, SNAP, CRACKLE, POP. So much for surge protection and isolation transformers.

--
The mind conceives, the body achieves, the spirit manifests.
Re:So Cloud v Cloud.... by Anonymous Coward · 2011-08-08 15:51 · Score: 0

I agree. You need to have a backup plan.
http://networkingexchangeblog.att.com/technology/outages-happen-disaster-recovery-the-cloud-and-a-lesson-from-cycling/
Re:So Cloud v Cloud.... by SMoynihan · 2011-08-11 03:40 · Score: 1

I've just been over at the Met Eireann website, and stumbled across this interesting service recording lightning strikes each day. If I am reading it right, it seems there about 11 near Dublin over that day. Indeed, quite targeted to hit Amazon! Someone has been irritating Greek deities...
http://www.met.ie/climate/lightning.asp?ReportDate=06/08/2011
Re:So Cloud v Cloud.... by ksandom · 2011-08-12 09:38 · Score: 1

US is not another availability zone -- it's a different region. There are multiple AZs per region and -- if Amazon is doing their job -- a lightning strike should not take down more than one AZ in a region.
Correct, except that it was only one availability zone that went down. That AZ maps to one of 3 different labels for different customers. For us it was eu-west-1a, for one of our customers, it was eu-west-1b. The AZs did their job exactly as they were supposed to.

--
Funnyhacks - Wierd, unusual, and fun hacks

All I can say is...... by Anonymous Coward · 2011-08-07 15:07 · Score: 0

Shocking

BPOS? by Anonymous Coward · 2011-08-07 15:07 · Score: 0

Did anyone vet that abbreviation before they launched it?

Just imitiating Verizon. by DWMorse · 2011-08-07 15:08 · Score: 5, Funny

I see how it is. Verizon workers go on strike, MSFT and Amazon gotta call in for something strike-related that's bigger and flashier. Show-offs.

--
There's a spot in User Info for World of Warcraft account names? Really?

My Sympathies by smpoole7 · 2011-08-07 15:08 · Score: 3, Insightful

Considering that my radio stations have been getting hammered for weeks now by this horrible weather in the Southern United States, my sympathies are with them.

I don't care how much protection you put on your system (and when you have giant lightning rods that are hundreds of feet tall, like we do, you DO try to protect things), an occasional strike is going to slip through. When it does it can get ... messy. :)

--
Cogito, igitur comedam pizza.

Re:My Sympathies by Hartree · 2011-08-07 15:57 · Score: 2

"When it does it can get ... messy. :)"
I'm often amazed at all the weird failures even when lightning doesn't hit directly and just induces currents.
Used to see long RS232 runs that woudn't fail instantly, but would act flakey soon after a near strike, and then take a day or two to fail completely.
Re:My Sympathies by kent_eh · 2011-08-07 16:07 · Score: 5, Interesting

I've been in the same spot. (10KW, 3 tower array). It's amazing how far the parts of a capacitor on a P&M panel can spread when propelled by a lightning strike.
Even with ball gaps, chokes, and all the other effort, ultimately the transmitter has to be connected to the tower. 50 ohms is not that much different than "the shortest path to ground" when you put a few thousand KV against it.

It took several years after my career change to enjoy the spectacle of a lightning storm

--

---
"I can't complain, but sometimes still do..." Joe Walsh
Re:My Sympathies by Z00L00K · 2011-08-07 16:47 · Score: 1

And those 50 ohms aren't resistive ohms either, so the lightning just ignores them.
A lightning can be very powerful - a 2" steel tube can get disintegrated completely by it.

--
If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
Re:My Sympathies by Agripa · 2011-08-08 02:29 · Score: 1

The component failure mode was "disappearance".
Re:My Sympathies by adolf · 2011-08-08 07:22 · Score: 2

Used to see long RS232 runs that woudn't fail instantly, but would act flakey soon after a near strike, and then take a day or two to fail completely.
And that, my friend, is why the good lord gave us RS-422 to use for long runs instead RS-232.
Balanced, differential signalling and sensible grounding FTW.

--
Kid-proof tablet..
Re:My Sympathies by cusco · 2011-08-08 07:53 · Score: 1

We get PTZ cameras where the actuators fail just because lighting strikes 50 meters higher up the antenna that they're mounted on. We don't have any problem with the RS-485 serial connections though, even though some of the runs are >300 meters.

--
"Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin
Re:My Sympathies by Anonymous Coward · 2011-08-08 10:01 · Score: 0

Well, I don't think anyone faults Amazon for having the single site failure. The big fail is that the supposed failover mechanisms in these "cloud" systems DON'T WORK. 60% up after over 2 hours? That is a fail.
Re:My Sympathies by Hartree · 2011-08-08 11:11 · Score: 1

In most cases, the tech is called in to clean up the smoldering remains of what was put there before he was in the scene.
This had been put into a small machine shop to link the PCs in the office with the CNC machines to transfer files (late 1980s/early 90s).
An employee of the place had hooked it up, and with at most a couple hundred feet, they normally got away with it. Even in an electrically noisy environment like a machine shop.
The owner was of the mindset "Just fix it. It works for us."

Ouch... by __Paul__ · 2011-08-07 15:11 · Score: 1

I don't know a lot about EC2, but I believe they lose all data when they're powered down, unless special provisions are taken, don't they?

This could result in another series of fuckups like this, where a bitcoin exchange lost its wallet.dat due to a misconfigured EC2 instance.

--
worldmobilenet.com -- World Prepaid Wireless Internet plans

Re:Ouch... by __Paul__ · 2011-08-07 15:13 · Score: 1

(Clarification, by "they", I mean "EC2 instances")

--
worldmobilenet.com -- World Prepaid Wireless Internet plans
Re:Ouch... by lgarner · 2011-08-07 16:56 · Score: 2

Yes. When an EC2 instance is "turned off" it's destroyed, along with any data- unless you're using an EBS volume (the special provision) which is persistent, or S3. Not to say that there couldn't be issues with the data on those either, in the face of an extremely sudden, unexpected shutdown. Shutting off an EC2 instance is equivalent to deleting a VMware VM. You then have to start a new one from a template (AMI).
Re:Ouch... by Firehed · 2011-08-07 20:15 · Score: 1

Is it somehow Amazon's fault that the developer didn't RTFM and understand EC2 is effectively the same as remoting into a system booted off a LiveCD with no hard drive attached? Great for computing rainbow tables or, indeed, farming BitCoins; rather poor choice for running a database.

--
How are sites slashdotted when nobody reads TFAs?
Re:Ouch... by JamesP · 2011-08-08 01:00 · Score: 1

This could result in another series of fuckups like this, where a bitcoin exchange lost its wallet.dat due to a misconfigured EC2 instance.
GEEZ I didn't know that.
I admit this is not very clear from EC2 users (that is unless you read the docs), but Amazon should have a provision where if it shuts down because of them, then no data is lost.
And btw use S3 and s3cmd, very good and very easy (even with encryption)

--
how long until /. fixes commenting on Chrome?
Re:Ouch... by stoanhart · 2011-08-08 14:16 · Score: 1

Only if you're idiotic enough not to save your important data to persistent storage. Did you know unsaved Word documents are lost when the power goes off?
Re:Ouch... by __Paul__ · 2011-08-09 00:39 · Score: 1

So, where was the line where I was blaming Amazon?

--
worldmobilenet.com -- World Prepaid Wireless Internet plans

The Cloud by keithpreston · 2011-08-07 15:12 · Score: 3, Funny

Sounds about like
http://xkcd.com/908/

Re:The Cloud by Anonymous Coward · 2011-08-07 16:04 · Score: 1

i dont get it, not really like that at all
Re:The Cloud by Anonymous Coward · 2011-08-07 20:23 · Score: 0

Two large clouds based in the same centre.

Don't say I didn't warn you! by fuzzyfuzzyfungus · 2011-08-07 15:13 · Score: 1

In my capacity as a Certified Solution Architect(tm), I often warned that The Cloud was suitable only for dynamic workloads. But did you listen? Oh, no, you just went and let your static workloads build up in the Cloud, increasing TCO and, now, bringing down Disaster on your heads!

Re:Don't say I didn't warn you! by afidel · 2011-08-07 15:46 · Score: 1

Dude, if you follow proper procedures then your EC2 instance is mirrored in another availability zone and is using multizone S3 replication. It's not cheap but it's available. If you were really a solutions architect you'd know this =)

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Re:Don't say I didn't warn you! by fuzzyfuzzyfungus · 2011-08-07 16:10 · Score: 2

True enough. I was just trying to eke out a static + clouds = lighting joke.
Re:Don't say I didn't warn you! by plover · 2011-08-07 16:14 · Score: 1

In my capacity as a Certified Solution Architect(tm), I often warned that The Cloud was suitable only for dynamic workloads. But did you listen? Oh, no, you just went and let your static workloads build up in the Cloud, increasing TCO and, now, bringing down Disaster on your heads!
Let me see if I understand you. Static buildup in the clouds caused a sudden discharge redistributing the static to a different cloud with its own static buildup, which then failed to discharge its duties after being charged by the other cloud's discharge.

--
John
Re:Don't say I didn't warn you! by dwreid · 2011-08-07 16:50 · Score: 1

Of course the reason for using cloud based platforms is that they are cheap. Now you tell me that if I want them to be reliable they're not cheap. Soooo.... why use the cloud again? Oh that's right... it's the new buzz word.
Re:Don't say I didn't warn you! by afidel · 2011-08-07 17:10 · Score: 2

Depends on your needs, if you need capacity only occasionally or have a workload where the peak is an order of magnitude or more from the base then it can make perfect sense to use a cloud provider, it's not like multisite replication and large amounts of bandwidth are cheap when you do them yourself.

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Re:Don't say I didn't warn you! by hedwards · 2011-08-07 19:11 · Score: 1

Weren't those the same zones that turned out to be hosted in the same facility? Or am I misremembering that.
Re:Don't say I didn't warn you! by jimicus · 2011-08-07 19:44 · Score: 1

Looks like the old "Good, fast, cheap: pick two" adage might need a little rewording. How about "Fast, reliable, cheap: pick two"?
Re:Don't say I didn't warn you! by Anonymous Coward · 2011-08-07 20:35 · Score: 0

failed to discharge
Butt-head: Huh huh huh huh. He said, "discharge."
Beavis: Oh, yeah.
Re:Don't say I didn't warn you! by dkf · 2011-08-07 20:43 · Score: 2

Looks like the old "Good, fast, cheap: pick two" adage might need a little rewording. How about "Fast, reliable, cheap: pick two"?
Since when was "reliable" anything other than one of the classic metrics for "good"? The old adage needs no changes at all.

--
"Little does he know, but there is no 'I' in 'Idiot'!"
Re:Don't say I didn't warn you! by F.Ultra · 2011-08-07 20:53 · Score: 1

It didn't "turn out", the zones are described that way in their document as being in the same facility. However you can select other geographic zones such as US, EU and Asia. They are most certainly not in the same facility :-)
Re:Don't say I didn't warn you! by aug24 · 2011-08-07 22:44 · Score: 1

Rubbish. We stripe across two EC2 zones. We would always want at least two servers anyway in case of any server failure. We still had 100% uptime over last night because failover worked correctly.
We are now seeing servers come back up gracefully and so far have not had to take any remedial action - a watching brief.
And it's cheap.

--
You're only jealous cos the little penguins are talking to me.
Re:Don't say I didn't warn you! by sgt+scrub · 2011-08-08 03:35 · Score: 1

It just isn't the same without a blond in there somewhere.

--
Having to work for a living is the root of all evil.
Re:Don't say I didn't warn you! by sgt+scrub · 2011-08-08 03:38 · Score: 1

I don't think so. I had a girlfriend that was cheep and reliable. Unfortunately, she was anything but good.

--
Having to work for a living is the root of all evil.
Re:Don't say I didn't warn you! by afidel · 2011-08-08 05:24 · Score: 1

I don't think so. I had a girlfriend that was cheep and reliable. fortunately, she was anything but good

FTFY =)

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.

Bpos huh ? by Anonymous Coward · 2011-08-07 15:19 · Score: 0

for Bpos users - the change is probably not noticeable anyway.

YOU !! by Anonymous Coward · 2011-08-07 15:19 · Score: 0

Am I first ??

Cloud fail by Baloroth · 2011-08-07 15:20 · Score: 5, Insightful

My understanding of the point of cloud computing was that it would be distributed. I.e. the failure of any one data or computing center would mean the data was still available. Hence, the term "cloud": nebulous, non-localized. Apparently, someone forgot to tell Microsoft and Amazon what the buzzwords they were using actually mean. I more or less expected that of M$, but the fact that Amazon failed too, well, thats pretty a little surprising. I guess it's kinda the norm for all large corporations.

Glancing at the article, it looks like this outage effected only a certain area, but still, cloud should mean other data centers would take over. I particularly love the quote "Dublin has become a key cloud computing gateway." If one city serves as a "gateway", its not a cloud system. I understand using it as one data center, but others should take over automatically for that area in case of a failure. If you don't have a failover system, you don't have a real cloud computing platform. You have a wannabe cloud computing platform. Or maybe they are just taking a buzzword and redefining it to suit their purposes. That's... exactly what we should expect, I suppose.

Or am I completely misunderstanding the meaning of this latest buzzword? It's quite possible, I never quite got down what "Web 2.0" was supposed to mean either. Beyond lots and lots of Flash.

--
"None can love freedom heartily, but good men; the rest love not freedom, but license." --John Milton

Re:Cloud fail by HTMLSpinnr · 2011-08-07 15:26 · Score: 4, Informative

For EC2, it's only distributed if you pay to have your "service" running in more than one availability zone.

--
$ man woman *
-bash: /usr/bin/man: Argument list too long
Re:Cloud fail by Concerned+Onlooker · 2011-08-07 15:31 · Score: 1

As near as I could tell, Web 2.0 boiled down to one thing: The HTTP request object in javascript.
Where I work, people who are normally major control freaks are seemingly eager to let their processing and data storage go to The Cloud. It will be interesting to see how long The Cloud is down from this lightning strike. The control freaks don't like it when a company web app isn't accessible for even a few minutes, let alone an hour.

--
http://www.rootstrikers.org/
Re:Cloud fail by Anonymous Coward · 2011-08-07 15:37 · Score: 0

Distributed locations are only useful when one makes use of them. I assure you that Amazon has more than one datacenter in Dublin, and others in Europe. But if you're an EC2 client, you need to be willing to pay to make use of them.
Re:Cloud fail by Anonymous Coward · 2011-08-07 15:40 · Score: 1

No, what you're completely misunderstanding is the event. Yes, Amazon EU-WEST-1 went down. Whether people's applications were affected depends on whether people followed those very same cloud architecture principles you mentioned. If they did, the EU-WEST-1 failure shouldn't affect them. Amazon has dozens of availability zones all over the world that AWS users can deploy their code to.
The failover mechanisms you talk about are for AWS' users to follow, not Amazon itself.
Re:Cloud fail by TubeSteak · 2011-08-07 16:00 · Score: 2

Or am I completely misunderstanding the meaning of this latest buzzword?
The main feature of cloud computing is the ability to almost instantly increase your capabilites and capacities.
At its most basic level, you're talking about a pool of computing resources that can be doled out without regards to the underlying hardware.
There's no promise that this cloud (the aforementioned pool of hardware) is geographically distributed.
Like any other hosting, you have to pay extra if you want your data replicated at another hosting center.

--
[Fuck Beta]
o0t!
Re:Cloud fail by Anonymous Coward · 2011-08-07 16:02 · Score: 0

My understanding of the point of cloud computing was that it would be distributed. I.e. the failure of any one data or computing center would mean the data was still available. Hence, the term "cloud": nebulous, non-localized. .
Sorry - you must be thinking of our n+1 release of cloud computing. You are still on n right?
Re:Cloud fail by Anonymous Coward · 2011-08-07 16:08 · Score: 0

EC2 lets you lease servers by the hour. Your server has to live somewhere. If that somewhere loses electricity, your server is gone. There's no way around that.
Re:Cloud fail by Anonymous Coward · 2011-08-07 16:21 · Score: 0

Next you're going to be saying that 4G implies some sort of minimum data speed and set of features as defined by some industry organization.
You're never going to get very far in marketing, you know.
Re:Cloud fail by Wolfling1 · 2011-08-07 16:26 · Score: 4, Insightful

Ah, yes. There is that.

At the moment, my company is aggressively encouraging our customers to avoid the Cloud at all costs. Let me explain why.

Whilst the technology exists for the cloud to deliver fault tolerant distributed storage, when you choose to put data in the cloud, you are choosing to relinquish control of the data. You are placing it in the hands of someone else. Quite probably an organisation that you do not know intimately. Quite probably an organisation that is based in a different legislative region - probably another country.

You have little to no capacity to audit their system. You have little to no capacity to test their fault tolerance. And here's the sucker punch - you have little to no legal comeback for the consequences if something bad happens.

If your data contains any personal information about another person, you are placing the privacy of that person in the hands of an organisation you do not control, and upon whom you cannot enforce any legislative restrictions.

So, unless you are seriously geared up to investigate and audit your prospective cloud provider - and they are willing for you to do so, the only data you can safely put in the cloud is data that would be basically irrelevant to your core business anyway. Until the fundamental issues of privacy, security and accountability are resolved - or dramatically improved - placing core business data in the Cloud is a massive corporate risk.

They should not have called it the 'Cloud'. They should have called it the 'Arse' - because if your management are planning to stick their heads in one, they may as well stick their heads up the other. I don't imagine that 'Arse Computing' would be as popular though.
Re:Cloud fail by Anonymous Coward · 2011-08-07 16:26 · Score: 1

Compared to under "the control" of a lot of very confused systems people, it's actually far superior to local administration. I've seen far too many confused systems people who've never actually worked with virtualization or providing systems monitoring who insist on inventing their own environments from scratch, and cause far too much chaos doing so.
In these cases, the ill trained local administrator who's busy clearing away the viruses on the boss's son's computer, or flushing the porn caches that people have been storing on the in-house storage because "disk is cheap", can work on the smaller local tasks and point the finger at someone outside the company for insisting on security policy, proper backup, or actually setting a realistic Service Level Agreeement.
Re:Cloud fail by Anonymous Coward · 2011-08-07 16:56 · Score: 1

Using EC2 or Azure there is nothing stopping you from having redundancy across multiple data centers in totally different countries if required. If people don't bother, their bad.
Re:Cloud fail by mcrbids · 2011-08-07 17:46 · Score: 2

Perhaps it would be a good idea to start by defining what exactly "cloud computing" means?
Because looking at the Wikipedia article I see only a brief mention of reliability through redundancy: Reliability is improved if multiple redundant sites are used, which makes well-designed cloud computing suitable for business continuity and disaster recovery.
As CTO of a data hosting "cloud services" provider myself, I'm proud of our track record for reliability and redundancy. All our systems are backed up offsite to multiple locations every 24 hours. At all times we provide off-site hosting facilities at about 50% of the capacity of our primary production cluster for disaster recovery scenarios. All our systems are redundant on site; the loss of any single system can be healed within an hour or so. Our system uptime averages approaches 4 nines over years in an industry where 2 nines is considered exceptional.
Having systems replicate in near-real-time to multiple locations and autonomically route around large network outages on extremely short notice is an extremely tough and expensive thing to do. It sounds simple enough, but the devil is in the details. The number of things that can go wrong is simply staggering, and trying to account for every possible thing that can go wrong is simply infeasible to proactively account for.
What do you do when the problem is due to a router outage offsite? What if 70% of routes work, and 30% don't due to a BGP burp? What if latency is 50ms? 100ms? 200ms? At what point is it "down" ? What if network conditions are excellent but there's a problem with DNS? What if a power surge generated by your UPS (yes, it happens!) takes out 20% of your production cluster? Do you heal or switch to hot backup? What if it takes out 30% of your production cluster? 40% 60%?
People think of "online" is a boolean yes/no question, but it's not. We calculate our uptimes based on expected end-user experience, but rarely is our production cluster actually working at 100%. Nearly always, some system or other is in need of attention and it doesn't constitute an emergency because there's still additional redundancy in the system.

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.
Re:Cloud fail by Anonymous Coward · 2011-08-07 17:57 · Score: 0

But even that failed not long ago didn't it?
Re:Cloud fail by Sarten-X · 2011-08-07 18:00 · Score: 2

I quit paying attention to your explanation/rant after one particular choice of words:

avoid the Cloud at all costs.
I immediately envision a scenario where the cost of setting up management, infrastructure, and equipment is significantly larger than the cost of losing a portion (or perhaps even all) of a company's data or processing capacity. Rejecting cloud services as a viable option regardless of the actual cost is just as asinine as rejecting the option for turning on hot water in the bathroom sink, because it just might be too warm for somebody.
When I risk my sanity long enough to pick out a few more words, I find you dismissing the cloud as being only suitable for "irrelevant" data. Apparently, all data is either "core business data" or "irrelevant", and there's no such thing as "nice to have around", "those old backups", or simply "not worth handling on our own". Of course, the existence of special-purpose clouds is ignored, along with private and internal clouds.
As for auditing, uptime, and legal consequences, you've apparently never dealt with a service contract. If the contract mandates five nines of uptime, and includes a clause making them liable for all damages and loss, that's a pretty hefty legal comeback.
I do sincerely hope I'm never a customer of your company.

--
You do not have a moral or legal right to do absolutely anything you want.
Re:Cloud fail by Sarten-X · 2011-08-07 18:03 · Score: 1

Gravity-powered fluidics?

--
You do not have a moral or legal right to do absolutely anything you want.
Re:Cloud fail by Anonymous Coward · 2011-08-07 19:26 · Score: 0

VMs are being bogus-ly advertised by companies as Cloud. If you dig deep down, its just a hosted VM labelled as Cloud.
Re:Cloud fail by Anonymous Coward · 2011-08-07 19:51 · Score: 0

For EC2, it's only distributed if you pay to have your "service" running in more than one availability zone.
yep
Re:Cloud fail by jimicus · 2011-08-07 20:02 · Score: 3, Informative

Cloud computing is a buzzword meaning "don't run your own hardware, run your business on someone elses". Which might mean anything from a virtual server that you manage at one end of the sophistication scale to a SaaS product at the other.
All sorts of aspects of this are optional. Including:
1. Whether or not you manage the underlying operating system - including things like security patches and hardening. You can choose a cloud computing provider that has sysadmins deal with that for you and just run the application yourself; they are a LOT more expensive than Amazon.
2. How much effort your provider puts into making their systems geographically redundant. Few will talk openly about this; I'm prepared to bet hard cash this is because the vast majority that offer you a virtualised server are just using a web interface to expose a fairly vanilla Xen-with-a-SAN infrastructure to the world with everything sat in one place. Providers that will run the OS for you and can honestly say their infrastructure accounts for complete data centre loss are like hen's teeth.
3. If you've gone for a SaaS provider - how much effort their developers went to to ensure their application can stand up to everything up to and including total loss of a data centre. And whether or not they test for such an occurrence.
Re:Cloud fail by sjames · 2011-08-07 20:10 · Score: 1

Well, according to the trade press, "the cloud" is both a floor wax and a dessert topping (plus a bag of chips).
In reality, it's fine grained virtual server rental with an API. Nothing more and nothing less.
While that is quite useful if you need it, it doesn't magically fix any problems other than highly variable server loads. As we see here, it isn't a magic wand for uptime/availability.
Re:Cloud fail by dkf · 2011-08-07 20:26 · Score: 1

In reality, it's fine grained virtual server rental with an API. Nothing more and nothing less.
That's exactly what IaaS is. Higher levels (notably SaaS) are a bit more; in those cases you're buying the service — wherever that's running $mdash; and not the server. You get less control but have to do less of the legwork yourself. It's the classic value-add model, and it makes a bunch of sense for many people. (Not everyone, but that's OK. Nobody can be all things to all people.)

--
"Little does he know, but there is no 'I' in 'Idiot'!"
Re:Cloud fail by jimicus · 2011-08-07 20:31 · Score: 3, Interesting

As for auditing, uptime, and legal consequences, you've apparently never dealt with a service contract. If the contract mandates five nines of uptime, and includes a clause making them liable for all damages and loss, that's a pretty hefty legal comeback.
I agree with you entirely, it's an absolutely beautiful piece of legal comeback. But every service contract I've ever seen is so full of ifs, buts and other assorted get-outs that it's very rare to actually be able to hold someone to it.
The one time I have seen an SLA that was actually quite good, the company in question didn't refuse to honour it. Oh no. They went one better - they hadn't even told their staff that it existed, so if you asked about it you'd get a response along the lines of "What's an SLA, then?" The only way you'll get an SLA honoured in those circumstances is to take your provider to court, and you can bet that if you do they'll drop you like a hot potato. So you probably wouldn't bother in any but the most egregious of circumstances.
Re:Cloud fail by Anonymous Coward · 2011-08-07 20:40 · Score: 0

I have to take some issue with that definition. We already had instantly available resource tailoring before the cloud, through virtualization.
Granted, redundancy is not a core attribute of the cloud, it's a feature. The core attribute is CDN (content distribution network) capability. But I think you can see how people would assume that a sane, robust CDN would have some redundancy built in. Frankly, I don't understand how they can offer a CDN without redundancy and geographically distributed caching. If the whole thing still depends on a single datacenter than it shouldn't be called a cloud, I don't see the difference between a normal datacenter and a "cloudy" one.
I call marketing as the main culprit here. When you advertise your service as being "in the cloud", people assume both CDN and redundancy. They should have made it clear that the low-end service is NOT in the cloud.
Re:Cloud fail by DrXym · 2011-08-07 20:52 · Score: 1

As near as I could tell, Web 2.0 boiled down to one thing: The HTTP request object in javascript.
Well that was the main thing but greater maturity of DOM, HTML and CSS played their part. Even before XMLHttpRequest turned up it was still possible to do something analogous by doing a form submitting within an invisible iframe and parsing the result. XMLHttpRequest made things a lot easier although same origin policy was a huge pain. Most modern browsers allow cross origin requests these days but it's still a pain to set up.
Re:Cloud fail by Trogre · 2011-08-07 22:48 · Score: 1

You were lied to.

--
"Nine times out of ten, starting a fire is not the best way to solve the problem." - my wife
Re:Cloud fail by aug24 · 2011-08-07 22:48 · Score: 1

Oh silly underinformed person. There is a datacentre in Dublin. It is one of six Amazon datacentres. The others were unaffected, as were (our) public facing services, because only some (of our) servers are placed in Dublin.
It looks like a cloud from the outside. Those of on the inside know where the servers are because we want to choose where we place them for latency / redundancy reasons.

--
You're only jealous cos the little penguins are talking to me.
Re:Cloud fail by Overzeetop · 2011-08-08 00:12 · Score: 2

(1) How would you not be backing up to a local resource?
(2) Why would you not be using encryption at your user site before sending data to the remote server?
If you're small, choose a small service like SpiderOak.
If you're large, build a custom front end.
I don't have particularly sensitive data, so I don't encrypt at my end, though in theory I could (but it would be a pain) and SO had issues with their distributed synchronization back then (note: if I were big enough, I could have hired an IT person to manage the my servers and set op a client side sync that worked). I have no less than 4 copies of my data at two separate local locations, at least one of which is always off-line, in addition to the remote service.
Distrust is a good thing, but it can, imho, be managed.

--
Is it just my observation, or are there way too many stupid people in the world?
Re:Cloud fail by AtomicJake · 2011-08-08 01:22 · Score: 1

Whilst the technology exists for the cloud to deliver fault tolerant distributed storage, when you choose to put data in the cloud, you are choosing to relinquish control of the data. You are placing it in the hands of someone else. Quite probably an organisation that you do not know intimately. Quite probably an organisation that is based in a different legislative region - probably another country.

Which is the real issue: No way for a European company to use a US cloud provider - Amazon, Azure, Google. The Patriot Act is prohibitive here.
Re:Cloud fail by codepunk · 2011-08-08 02:30 · Score: 1

"At the moment, my company is aggressively encouraging our customers to avoid the Cloud at all costs. Let me explain why."
Because it is eating your profits.
The same lightning strike we are talking about could have just as easily taken out a in-house data center. I would also argue with that being the case the situation would have likely even been worse for the customer.

--

Got Code?
Re:Cloud fail by sjames · 2011-08-08 08:59 · Score: 1

Of course, SaaS is much older than any use of the term "Cloud Computing". Depending on where you want to draw a line, virtual web and mail hosting from the '90s could be considered SaaS. If that's too old-school, then Salesforce from right around 2000 would certainly qualify.
Re:Cloud fail by Rich0 · 2011-08-09 00:56 · Score: 1

If your data contains any personal information about another person, you are placing the privacy of that person in the hands of an organisation you do not control, and upon whom you cannot enforce any legislative restrictions.
Yes, but it does give you somebody to point your finger at when there is a data loss. That is almost always enough for a manager to get to keep his job (assuming he wasn't already promoted for all the cost-savings years ago). Often it is enough to keep a company out of legal trouble (but this is AMAZON - what person wouldn't assume they know what they're doing with personal data?).
Unless a company is privately owned there is rarely enough oversight to keep it from betting the farm on the latest get-rich-quick scheme. Just look at Wall Street a few years ago.
Re:Cloud fail by Anonymous Coward · 2011-08-09 03:30 · Score: 0

My experience in the corporate world tells me that once a hot new buzzword comes out, it's applied to EVERYTHING: Because corporate executives are always either complete idiots, in which case they're not interested in something unless it's accompanied by the new buzzword, which they actually have no comprehension of; or are utter and complete psychopathic con artists, in which case words contain no meaning except for how they can manipulate people. In neither case does anyone give the slightest crap whether something has any basis in reality. Which is why you DON'T want corporations running the world, it presages nothing but an endless spiral into oblivion.

Customers should be looking for alternatives by Anonymous Coward · 2011-08-07 15:23 · Score: 0

Such as Virtustream

clouds and lightning go together by karuna · 2011-08-07 15:24 · Score: 1

Well, when there are clouds it often rains and occasionally they produce thunder and lightning. I guess Amazon though that they can have clouds without worrying about their byproducts.

More 9's falling from the sky? by kolbe · 2011-08-07 15:28 · Score: 1

We have all read about the difficulty and expense of providing reliability in the cloud (the so-called five nines) as well as the fact that as more popular web services rely on cloud platforms, the more people rely on those services. As such, I cannot help but wonder what kind of fallout will happen after this latest event, but I do get the feeling that this "Lightening strike" may erode the vCloud marketing of 5x9's uptime just a wee bit more.

Re:More 9's falling from the sky? by Anonymous Coward · 2011-08-07 15:40 · Score: 0

I do get the feeling that this "Lightening strike" may erode the vCloud marketing of 5x9's uptime just a wee bit more.
If that's the problem, then I think they just need to get a "lightening breaker"
Re:More 9's falling from the sky? by bluemonq · 2011-08-07 15:49 · Score: 1

It's 5x9 IF you pay for more than one instance. You DID pay for instances in multiple geographic locations, right?
Re:More 9's falling from the sky? by Roachie · 2011-08-07 16:58 · Score: 1

Oh, you wanted the remote control that that? Well, that's another $45.99. Yea, you DIDNT say you wanted a remote control before, yea, glad to get that out of the way...
Can I tell you about our maintenance program?

--
This sig is not paradoxical or ironic.
Re:More 9's falling from the sky? by bluemonq · 2011-08-08 12:58 · Score: 1

The point of having a cloud backup is that it's far removed from whatever local setup you have. If you have a cloud setup then it should be relatively simple to have geographically distributed backups/VMs/instances. Some PHB misunderstood that as cloud computing as being inherently more reliable and secure. Having a single instance in the cloud as your sole asset doesn't get you redundancy. You're an idiot if you believed otherwise.

at that level the safety's tipped foreing a manual by Joe_Dragon · 2011-08-07 15:28 · Score: 1

at that level the safety's tipped foreing a manual cut over to backup. Also Surge Protectors can't really take a direct lighting strike.

But any ways for the cloud to work you will need data centers all over the place for good lag and for back up. Now Dublin is good for not needing lot's of cooling but still you don't want to put all the severs in one area so other stuff like over seas data line cuts can't take down systems far away. If on side severs at least you can get some work done with no or a slow back up link to the out side.

The selling point... by Genda · 2011-08-07 15:33 · Score: 1

... And if we aren't 1000%, absolutely, positively reliable may God Strike Us... BLAM!!!!!!

Re:The selling point... by Oxford_Comma_Lover · 2011-08-07 15:41 · Score: 1

... And if we aren't 1000%, absolutely, positively reliable may God Strike Us... BLAM!!!!!!
Sorry sir. The thousands place is the sign bit in our percent calculations.

--
-- IANAL, this isn't legal advice, and definitely isn't legal advice for you. Also, Squee!
Re:The selling point... by Roachie · 2011-08-07 17:03 · Score: 1

"I swear to God baby, I aint cheated on you. If I cheated may God strike this data center dead...
Whoooooop.
"Uhhhha... funny that...."

--
This sig is not paradoxical or ironic.
Re:The selling point... by pinkushun · 2011-08-07 19:38 · Score: 1

(posting to undo wrongful mod, a bug in the system? dropdown mod combo box, use arrows to scroll, press escape to cancel, it then applies the last highlighted mod option :/)
was meant to mod funny btw :)
Re:The selling point... by Pieroxy · 2011-08-07 21:09 · Score: 1

(posting to undo wrongful mod, a bug in the system? dropdown mod combo box, use arrows to scroll, press escape to cancel, it then applies the last highlighted mod option :/)
It's not a bug, it's a feature. If you want to cancel your mod, you have to unplug the RJ45 at the back of your computer first. RTFM for God's sake !!!

--
Write boring code, not shiny code!

Re:two words : Surge Protector by sjames · 2011-08-07 15:34 · Score: 2

Have you ever seen a surge protector after a direct strike? The MOVs don't help much once they vaporize.

A surge protector is mostly useful against the more common near misses.

Microsoft renamed its product by countertrolling · 2011-08-07 15:36 · Score: 5, Funny

Office 364

--
For justice, we must go to Don Corleone

Re:Microsoft renamed its product by Anonymous Coward · 2011-08-07 16:55 · Score: 0

No, the code name inside Microsoft for it was Office 180.
Re:Microsoft renamed its product by SynthaxError · 2011-08-07 19:54 · Score: 1

Hey, I do think that "Microsoft's BPOS" sounds good.
Even if it may be redundant since you can't expect something else as a POS SW from Microsoft :-)

--
"There is no dark side of the moon really. Matter of fact it's all dark."
Re:Microsoft renamed its product by hedleyroos · 2011-08-07 20:06 · Score: 1

I thought Bloated Piece Of Sh*t was a good acronym :)

Sunny by garyoa1 · 2011-08-07 15:45 · Score: 1

So you're saying there wasn't a cloud in the sky, huh?

--
Wuddooeyeno? IITYWYBMAD? Like nuts? eclecticallyincorrect.com

Re:two words : Surge Protector by tpstigers · 2011-08-07 15:45 · Score: 1

Read the article. Please?

Serves them right by RobinEggs · 2011-08-07 15:59 · Score: 2, Informative

Those massive data centers only existed because Microsoft and Amazon channeled profits through Irish subsidiaries to avoid US taxes. They serve some legitimate functions for customers in the UK as a matter of convenience (why build two data centers?), but they're primarily money laundering centers.

I'd call a few lightning strikes the least of the punishments those data centers - and the entire infrastructures to which they're attached - really deserve.

Re:Serves them right by XaXXon · 2011-08-07 16:18 · Score: 2

The AWS services out of dublin aren't through an Irish subsidiary. It's just regular AWS.
If you know differently, please document it.
Re:Serves them right by Anonymous Coward · 2011-08-07 17:02 · Score: 0

You're an idiot. Why should money both made and kept out of the US be subject to US taxes? This is why they're called "multinational corporations" - they don't exist just in one place.
Re:Serves them right by Anonymous Coward · 2011-08-07 17:31 · Score: 0

They serve some legitimate functions for customers in the UK as a matter of convenience (why build two data centers?), but they're primarily money laundering centers.
FTA:
"Dublin has become a key cloud computing gateway to Europe and beyond for U.S. companies due to several factors, including the city’s location, connectivity, climate and ready supply of IT workers. Dublin’s temperature is ideal for data center cooling, allowing companies to use fresh air to cool servers instead of using huge, power-hungry chillers to refrigerate cooling water."
Also, please not, that if the UK had more a progressive, enterprise friendly tax system...... oh, sorry you're a MONARCHY
Re:Serves them right by Billly+Gates · 2011-08-07 18:02 · Score: 1

Ironically, if you or I bought a PO Box to avoid state taxes we would be thrown in jail. Why do businesses not worry about this?

--
http://saveie6.com/
Re:Serves them right by Anonymous Coward · 2011-08-07 19:08 · Score: 0

That's not entirely true - there are also laws that require certain data to be kept inside the EU. Without an EU data center, companies could not use the cloud for this.
Re:Serves them right by TubeSteak · 2011-08-07 19:28 · Score: 1

The AWS services out of dublin aren't through an Irish subsidiary. It's just regular AWS.
I don't think that's what the GP was getting at.
Due to their complex tax [strike]avoidance[/strike] management plans, American based multi-nationals end up with billions of dollars overseas.
They can't bring those billions back to the USA without being taxed, which would completely negate the point of booking them offshore to begin with.
The end result is that they invest it overseas instead of domestically, because it looks better for their bottom line.
Just because AWS LLC isn't an Irish subsidiary doesn't mean that the money used to create their infrastructure didn't come from that route.

--
[Fuck Beta]
o0t!
Re:Serves them right by romiz · 2011-08-07 19:48 · Score: 1

Also, please not, that if the UK had more a progressive, enterprise friendly tax system......

A comparatively small country will always be able to set lower taxes for enterprises than their larger neighbors. This is because it only needs to attract a small percentage of the larger countries taxable income to offset the loss due to the lower tax rate, whereas the big country only lowers its income when it does this. Fiscal competition in an open market is a fraud against the people of larger countries, preventing them to set the tax rates of the companies that do business there.
Re:Serves them right by Anonymous Coward · 2011-08-07 20:00 · Score: 0

"Also, please not(sic), that if the UK had more a progressive, enterprise friendly tax system...... oh, sorry you're a MONARCHY"
Please note, that if you had >1 brain cell you would know that the mostly ceremonial duties our monarch performs as part of a constitutional monarchy/parliamentary democracy does not include the setting of any taxation system or policies.
The challenge to the powers wielded by a UK monarch can be traced back to the year 1215 in the document Magna Carta and more so in the Bill of Rights passed by parliament in 1689.
Re:Serves them right by Anonymous Coward · 2011-08-07 22:50 · Score: 0

Oh sorry, you're a RETARD.
Re:Serves them right by nedlohs · 2011-08-08 00:28 · Score: 1

Because it is for US citizens/residents, and corporations are people right?

Comment removed by account_deleted · 2011-08-07 16:08 · Score: 1

Comment removed based on user account deletion

Power Co-Generation by anubi · 2011-08-07 16:12 · Score: 4, Interesting

While working at Chevron Oil Pascagoula Mississippi refinery, I noted Chevron had the same problem. Loss of electrical power to the refinery would be catastrophic. No one wants to be around tons of petrochemical products undergoing serious chemical reactions when one loses control.

To mitigate this threat, Chevron worked with Mississippi Power to operate a power generation facility at the refinery.

I would think that anywhere there is a substantial "data processing farm" with critical power requirements, business arrangements should be made with the power generation utilities to run a natgas power plant in the immediate area.

The utilities often run these plants as "topping" plants, as they needed anyway to even out short-time load variances on the line.

But, in the event of a serious loss of grid power, it can be awful handy to have a few megawatts of power coming from down the street.

--
"Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]

Re:Power Co-Generation by antifoidulus · 2011-08-07 17:00 · Score: 2

RTFA! They do have generators! The lightning strike was apparently so powerful that is affected the backup generators synchronization equipment.

"Normally, upon dropping the utility power provided by the transformer, electrical load would be seamlessly picked up by backup generators,â Amazon said in an update on its status dashboard. âoeThe transient electric deviation caused by the explosion was large enough that it propagated to a portion of the phase control system that synchronizes the backup generator plant, disabling some of them.â

--
Monstar L
Re:Power Co-Generation by Anonymous Coward · 2011-08-07 17:46 · Score: 0

RTFA! They do have generators! The lightning strike was apparently so powerful that is affected the backup generators synchronization equipment. .....which means there wasn't anyone there to manually start them up?
Re:Power Co-Generation by anubi · 2011-08-07 17:47 · Score: 1

IIRC, the generators at Chevron were not "backup" generators, per se. They were fullbore online generators, run by the utility.

These were not backup generators. Even though that was their alternate function. These were online and ran 24/7.

The problem with backup generators is they sit idle too much and you are likely not to discover a problem until you fire them up.

The continuously running generators, operated and maintained by the utility, had a very low probability of failure, and were sized to easily accommodate the refinery load, so that if a divorce from the grid was inevitable, we could continue.

There are economies of scale from running a continuous generation facility locally that justify the costs of more robust equipment than can be justified by something that sits idly by waiting until you need it.

Proper generator operation is verified 24 hours a day by the circumstance of it being online. I would consider the possibility of a rogue lightning bolt doing it in, along with the whole grid, but having the skill and resources of the utility to bring it back up would help a lot.

Never underestimate the skills of a determined utility crew.

--
"Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]
Re:Power Co-Generation by Billly+Gates · 2011-08-07 17:58 · Score: 2

When I was an undergrad I worked a crappy job at a Florida amusement park. Lightning capital of the world is in Pasco County Florida which was about 30 minutes away.
They generated their own power and the power lines and even the roller coasters were designed to be struck by lightning. Let me tell you they were struck every 3 or 4 days during the raining season in summer from the monsoons from the Carribean and Gulf. During a bad storm the power lines and rides could be struck 2 to 3 times each in a 30 minute period. They always kept running as soon as the storm would pass.
If a shitty amusement park can handle it I would think a datacenter would have much more expensive and critical components. Disney World in Orlando even generates its own power and powers part of Orlando.

--
http://saveie6.com/
Re:Power Co-Generation by Monoman · 2011-08-07 22:00 · Score: 1

Yeah but how many times did the amusement park get taken offline before they came up with a solution. The old saying "it is not a matter of if, it is a matter of when" often applies. Unless compelled by some law, upper management often listens to the bean counters until something goes wrong ... then they start asking questions and opening the checkbook.

--
Keep the Classic Slashdot.
Re:Power Co-Generation by director_mr · 2011-08-07 23:30 · Score: 1

Sure you could make a datacenter handle a lightning storm like that easily. Simply take it off line every time you see a lightning storm. When the storm has passed, you bring it back online again. Just like they do with the amusement park.

Some people may have problems with that kind of availability and prefer the risk of the datacenter going down 1 day over a period of years instead of going down every lightning storm. But they are just being overly picky I am sure.
Re:Power Co-Generation by Billly+Gates · 2011-08-08 00:54 · Score: 1

Well in my case the bean counters looked at maintance costs and rides being down for the whole park because they did get struck by lightning in a continual basis.
If the bean counters stress no redundancy for the customers who demand 99.99 uptime in their contracts then the are retarded and not doing thier job. I do agree I hate PHBs too but a proper bean counter will do the analysis and look into all angles. We simply hate them because they never want to pay us more than the minimium.

--
http://saveie6.com/

Re:two words : Surge Protector by kent_eh · 2011-08-07 16:16 · Score: 1

Have you ever seen a surge protector after a direct strike? The MOVs don't help much once they vaporize.

It's amazing how many people who call themselves engineers don't get that surge suppressors are non-resettable.

"what do you mean the power supplies got destroyed. We've had surge suppressors in there for years..."

--

---
"I can't complain, but sometimes still do..." Joe Walsh

Re:two words : Surge Protector by DigiShaman · 2011-08-07 16:27 · Score: 1

When you buy a surge protector, what you're really buying is that little insurance policy that you're supposed to fill out. That's it.

--
Life is not for the lazy.

what a senseless act by DSS11Q13 · 2011-08-07 16:28 · Score: 1

Cloud on cloud violence like this cannot be tolerated!

Lesson by Anonymous Coward · 2011-08-07 16:28 · Score: 0

I just love how Mother Nature gives Mankind a spanking now and again, just to remind us who is boss.

Come to the cloud... by Jawnn · 2011-08-07 16:33 · Score: 1

...where life is grand and all your apps are immune to...BzzzzzZZZZZT!

BPOS by Anonymous Coward · 2011-08-07 16:40 · Score: 0

Blue Pane of Sin?

Re:BPOS by maxwell+demon · 2011-08-07 18:59 · Score: 1

Ballmer's POS?

--
The Tao of math: The numbers you can count are not the real numbers.

How shocking! by NicknamesAreStupid · 2011-08-07 16:41 · Score: 4, Informative

This is not the first time -- http://www.datacenterknowledge.com/archives/2009/06/11/lightning-strike-triggers-amazon-ec2-outage/ .

Re:How shocking! by Anonymous Coward · 2011-08-07 17:50 · Score: 0

Here's the only Irish media coverage of it I've found so far.
http://searchtopics.independent.ie/article/0g8m5GIcvz8Lr?q=Dublin%2C+Ireland

distributed != replicated by dutchwhizzman · 2011-08-07 17:05 · Score: 1

Distribution means your virtual machine can be on a number of machines inside a cloud. There's nothing in the definition of a cloud that says it has to be in different locations, or running mirrored copies of your instance. Sure, it's possible, just like it's possible with single machines. When will people stop assuming that "cloud" means "indestructible"? This is exactly what happened before with EC2 and lots were hurt then by the same assumption.

--
I was promised a flying car. Where is my flying car?

Re:at that level the safety's tipped foreing a man by Animats · 2011-08-07 17:26 · Score: 4, Informative

Also Surge Protectors can't really take a direct lighting strike.

But lightning arrestors can. A serious lightning arrestor is a spark gap (sometimes open air, sometimes in an inert gas) to ground, with a very heavy cable or busbar to multiple ground rods, and no sharp turns in the path to ground. This is followed up by an inductor which is a few turns of busbar. This gear is usually placed where power lines or antenna feeds enter a building. MOV-type protection is further downstream.

Antenna towers are struck by lightning frequently, and the associated radio gear routinely continues to operate. This isn't rocket science. It's big hunks of copper.

The Hartford Steam Boiler Inspection and Insurance Company, in their publication "The Locomotive" (they've been at this since 1867) has a good article on lightning protection. Hartford Steam Boiler insures not only against boiler explosions, but things like downtime due to lightning strikes. But only after their inspectors (they have 1200) have visited the plant and are satisfied with the equipment.

A question to ask your "cloud" provider - who handles your business interruption insurance, and do they inspect your faclities?

2 things tto remember by rossdee · 2011-08-07 17:43 · Score: 1

Murphy was an Irishman

Every Silver Cloud has a leather lining.

Lightning in the Cloud....go figure by Anonymous Coward · 2011-08-07 17:52 · Score: 0

I can see where that possibility went over their heads.

THAT's NOT a CLOUD by Crypto+Gnome · 2011-08-07 17:55 · Score: 2

Despite ALl the market-hype and brew-haha going on, the simple fact remains:

If ALL your computing power is in ONE SINGLE DATACENTRE then what you have is a DAMP SPOT not a CLOUD.

--
Visit CryptoGnome in his home.

Good! Now maybe people will know! by erroneus · 2011-08-07 19:23 · Score: 1

All of the cloud computing hype has business everywhere, once again, buying what they don't understand just as they did during the dotCOM bubble. That particular bandwagon caused all sorts of damage to the industry including a flood of people unsuited to the line of work and suppression of wages that don't seem to have ever returned. Now business continues to crave cheap, Walmart-ized IT services and are seeking to get it any way they can; outsourcing to 3rd world nations and most recently entrusting cloud computing where they put not just all of their eggs, but the eggs of hundreds if not thousands of other businesses in one basket waiting for moments like this.

And it's not like these lessons were never made available in smaller doses. There has been more than one Blackberry service outage to highlight the fact that all Blackberry traffic passes through Blackberry servers and will halt when Blackberry fails. How much worse when you entrust whole parts of critical business functionality to "the cloud"?

13 - 10 = 2 by Anonymous Coward · 2011-08-07 19:40 · Score: 0

Am I the only one that feels TFA isn't correct? From TFA:

Amazon said the power outage began at 10:41 a.m. Pacific time, with instances beginning to recover about two hours later at 1:47 p.m.

To me, that seems like 3 hours, not 2...

Sounds strange by drolli · 2011-08-07 19:59 · Score: 1

(see below) Why does synchronization have to be manual? Or is this a safety feature that the automatic synchronization decides to go offline if things get weird?

From the Article:

âoeNormally, upon dropping the utility power provided by the transformer, electrical load would be seamlessly picked up by backup generators,â Amazon said in an update on its status dashboard. âoeThe transient electric deviation caused by the explosion was large enough that it propagated to a portion of the phase control system that synchronizes the backup generator plant, disabling some of them.â

âoePower sources must be phase-synchronized before they can be brought online to load. Bringing these generators online required manual synchronization. Weâ(TM)ve now restored power to the Availability Zone and are bringing EC2 instances up.â

marketing bullshit by Tom · 2011-08-07 20:20 · Score: 4, Insightful

And there is the marketing bullshit revealed. All the promises of the cloud - down by one lightning strike.

Because, let's face it, the whole "cloud" thing as they sell it is just advanced virtual hosting with a different name. The only real cloud capabilities are those the big companies build for themselves, and they did things like that 10 years ago already, when nobody had ever heard the term "cloud" used in computing contexts.

In the end, it's about selling something to people who already have the older version and convincing them to buy the new one. So you give it a different name because a "new" product sells easier than the upgraded version of an "old" product.

Anyone remember when "Web 2.0" was all the hype? It really wasn't a 2.0 as we all know. There was nothing new in it, all components had been around for a long time. It was a conceptual bundle, but not a new version like the name suggested.But "we're doing more Javascript" now doesn't sell nearly as good as "we're moving to Web 2.0 now".

--
Assorted stuff I do sometimes: Lemuria.org

Re:marketing bullshit by aug24 · 2011-08-07 22:52 · Score: 2

But they weren't down. I have servers in Dublin. I also have striped redundancy across other EC2 datacentres. We had 100% uptime last night and I'm now watching the Dublin based machines recover gracefully.
Yes, I agree it's "advanced virtual hosting with a different name". But it didn't break its promises.

--
You're only jealous cos the little penguins are talking to me.
Re:marketing bullshit by Tom · 2011-08-08 01:14 · Score: 1

Yes, I agree it's "advanced virtual hosting with a different name". But it didn't break its promises.

That depends on what promises you mean.
"The Cloud" has been hyped as this mystical thing that means you just move your servers "into the cloud" (whatever that means) and mystically physical location ceases to matter.
And that promise was broken. You need to worry about things like redundancy, multiple data centers, etc. etc. - just like you did before. I've designed HA systems myself many, many years ago when "cloud" was something in the sky. We set up our own redundant upstreams and taught our routers BGP. Outsourcing that doesn't magically turn it into a mystic entity.
It's the same stuff under a different name, that's all. That doesn't mean it doesn't work - far from it. But you still have to worry about the same things. Because the real world doesn't change just because marketing has come up with a new campaign. Some of these days I swear that everyone in marketing has gone to lala land and they actually believe that by changing names they can change the world. Confusing the menu with the meal, the old problem.

--
Assorted stuff I do sometimes: Lemuria.org
Re:marketing bullshit by Junta · 2011-08-08 02:44 · Score: 1

But it didn't break its promises.
I can read countless stories of people who use things like EC2 and express disdain toward users afflicted by downtime because they didn't understand how to work it. When all is said and done, the users using EC2 'right' seem to end up having to worry all the frustrating details of HA they worried about before, 'Cloud' doesn't change that. The 'promise' is that you naively slap your workload into a 'cloud' provider and you don't have to think so hard about all of that. Even if Amazon doesn't explicitly make that promise, they certainly benefit from the general buzz along those lines and don't do a whole lot to quell that impression with non-technical decision managers.
Now you may say people who naively do that are overvaluing the 'cloud', and while that may be true, one has to seriously wonder what the value of 'cloud' is. I would say a large population of companies 'doing it right' end up spending the money to 'rent' computing power and then end up having to keep on their IT staff anyway and end up saving nothing or spending more.
*Particularly* initiatives like the US government 'moving to the cloud' needs to keep in mind they can't just ditch all those IT experts, and with that in mind, do you still achieve meaningful savings?

--
XML is like violence. If it doesn't solve the problem, use more.
Re:marketing bullshit by Anonymous Coward · 2011-08-08 03:59 · Score: 0

^ We also have instances in Dublin, 100% uptime, and noticeably faster since the 7th. Not sure how that can be related, but it's definitely apparent.
Re:marketing bullshit by aug24 · 2011-08-08 20:27 · Score: 1

I see your point. It's not my impression that moving things 'into the cloud' will magically make any complexity go away. But it does make that complexity easier and easier to manage.
We can stripe servers around the world at service setup time with one command.
Load balancers now intelligently remove defunct machines and increase capacity (from other zones) to compensate, provided you have defined how you want them to act (and no system will ever define that for you for a known cost).
It's not a panacea for all IT ills, but it is strong medicine.

--
You're only jealous cos the little penguins are talking to me.
Re:marketing bullshit by aug24 · 2011-08-08 20:29 · Score: 1

We estimate savings of ~30% of server cost, none on IT staff. If you do it right. If you do it wrong, costs can rise.

--
You're only jealous cos the little penguins are talking to me.
Re:marketing bullshit by Tom · 2011-08-08 23:56 · Score: 1

It certainly is. I do think it's a good evolutionary step. I just don't think it is a revolution. All the stuff (load balancers, redundancy over multiple locations, moving VMs around, etc.) has been around for years before "cloud" was an IT term.

--
Assorted stuff I do sometimes: Lemuria.org

Microsoft failed over to Amsterdam by Anonymous Coward · 2011-08-07 22:51 · Score: 0

BPOS in Europe runs from Dublin as primary, Amsterdam as secondary.

Microsoft's BPOS by markbark · 2011-08-07 23:11 · Score: 3, Funny

Business Productivity Online Suite?
I always thought it stood for "Big Piece of Sh..... never mind

Obviously by arcite · 2011-08-07 23:16 · Score: 1

They should have had three generators!

FF7/Star Fox crossover by tepples · 2011-08-08 00:08 · Score: 1

McCloud? What does the Star Fox team have to do with any of this?

Zeus Anonymous by Crock23A · 2011-08-08 00:44 · Score: 1

So... Zeus accomplished what Anonymous could not.

Or M$ could port it to Xbox by tepples · 2011-08-08 00:44 · Score: 1

If Microsoft really wanted to cut costs, it would allow for up to five or six days a year of downtime and port its client software to a thin client comparable to a game console. Then it could be called Office 360.

Nature always "wins" by drinkypoo · 2011-08-08 00:51 · Score: 1

Nature bats last.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"

The people fail to understamd by Anonymous Coward · 2011-08-08 04:13 · Score: 0

Regarding Amazon, there was no cloud fail. If you have your instance in more than one availability zone (usually means datacentre) then you wouldn't have an outage. A cloud never implied redundancy like this, its an optitional feature.

Re:The people fail to understamd by Anonymous Coward · 2011-08-08 16:04 · Score: 0

Silly, that's spelled optician, not optition.
- Grammer Notsi

A broken design by Skapare · 2011-08-08 04:34 · Score: 1

You don't need to synchronize the phases if the system is designed right. Keep each generator separate (do not even try to parallel them). Have every machine covered by UPS/battery so they ride through the switching. In high redundancy cases, use an extra transfer switch for a pair of generators for each section. Only do open-transition power transfers (the lights blink out briefly during the change but the UPS/battery system keeps things running). For machines with multiple power input, split them across multiple UPSes for extra redundancy.

--
now we need to go OSS in diesel cars

Cloud or Not you still need to be prepared by Anonymous Coward · 2011-08-08 15:42 · Score: 0

It is important to have a backup plan no matter where you run your apps. Even owned and operated datacenters fail from time to time. http://networkingexchangeblog.att.com/technology/outages-happen-disaster-recovery-the-cloud-and-a-lesson-from-cycling/trackback/

Anonymous by Anonymous Coward · 2011-08-09 18:21 · Score: 0

Is this how Anonymous is going to attack Facebook?

Slashdot Mirror

Lightning Strike KOs Amazon, Microsoft EuroClouds

189 comments