Major Outage At the Amazon Web Services
ralphart writes "The Northern Virginia datacenter for Amazon Web Services appears to be having a major outage that affects EC2 services. The Amazon Forums are full of reports of problems. Latest update from the status page: 2:49 AM PDT We are continuing to see connectivity errors impacting EC2 instances, increased latencies impacting EBS volumes in multiple availability zones in the US-EAST-1 region, and increased error rates affecting EBS CreateVolume API calls. We are also experiencing delayed launches for EBS backed EC2 instances in affected availability zones in the US-EAST-1 region. We continue to work towards resolution."
How am I supposed to be able to not do work?
But how can this be possible? It's The Cloud . This sort of this simply doesn't happen.
If you want news from today, you have to come back tomorrow.
Severe weather hit the area. They shutdown Surry Power Station in Surry County, Virginia after a tornado took the power out that powers the power station.
http://www.stopacop.so -- You have rights. How about standing up for them before they go away?
I'm glad everyone's moving to the cloud for reliability and scalability purposes!
Bummer.
Hebrews 11:8
Jeremiah 33:3
My instance is on us-east-1d which is still up.
The DDoS didn't work so they tried something else.
I know I'm a coward member.
Well... I am sure the additional server load from curious slashdotters like myself can only be helping.
Wait. Stop scrolling for a sec. O.K. Thanks. - P
Slashdot and Digg have one day traffic surges because Reddit is down. I'm getting way too much done today not being distracted by the GoneWild girls. This productivity must cease at once!
Does go to show what can happen when your business depends on an outsource provider. Everyone has to depend on service providers to some extent, but sometimes it's a good exercise to see how many of your company eggs are in one basket. Redundancy is expensive, but so is losing business. Even Google has had Gmail interruptions, lost some customer data and experienced slow downs.
That's our life, the big wheel of shit. - The Fat Man, Blue Tango Salvage
so this is why tested.com is down...
Else I don't know what to do? I almost went to Digg! so please amazon guys, work on your stuff!
"Science will win because it works." - Stephen Hawking
I didn't even realize that one of our partners was using Amazon EWS until suddenly they were down all day. Amazon is really stable historically, but it's frustrating when you're out of business and all you can do is wait and see if Amazon will fix it soon.
In the "old school" thinking, smart companies have a redundant data center somewhere, humming along and waiting to be switched on if the main data center ever goes down. "The cloud" was supposed to solve that - massive redundancy within Amazon's services were supposed to protect you from outages. Not the case, apparently, since it looks like Amazon is going to fall below their promised 99.95% uptime (4.38 hours per year downtime).
I think the answer is to have redundant cloud services online, so you could switch from Amazon to Google or DevGrid if you had issues. The problem is, there's nothing quite like Amazon right now, it's not easy to switch from Amazon to some random service. This might be the biggest argument against virtual services - lack of standardization makes it hard to move from one to another, and hard to set up backup services in case of emergency.
This is why, regardless of whether you're in the cloud or not, you need to have the ability to fail over to multiple datacenters in different geographical locations. Availability Zones are good but don't cut it. Unfortunately, Amazon doesn't make transferring backups between regions easy or cheap.
I was wondering why it took longer to start up my hadoop cluster this morning on EC2, but it still beats the living hell out of buying and configuring large numbers of machines for short term testing.
so we wait for the fog to lift...
We're now approaching our final destination, a datacenter of the future where nothing can possi-blye go wrong. Er, possi_bly_ go wrong. Heh, that's the first thing that's ever gone wrong.
Hmmmm... today *is* Judgement Day... perhaps Skynet's first target is AWS's East-Coast data center. Coincidence? I think not.
Gotta wonder what kind of flack Amazon is going to take for this one. I've had a couple clients looking into cloud services including moving to AWS and have already had one of them call me and cancel a meeting about it. While I understand stuff happens, the entire sales pitch for AWS was redundancy and build as you grow. Redundancy has obviously not worked in this case, while I usually support cloud services, this is definitely going to be a hard example to counter when trying to sell it to potential customers.
Is anybody else suffering from Reddit withdrawal?
~Syberz
It means inclement weather; it rains; it pours; it delays air traffic; it's gloomy. You can look up at it and see whatever you can imagine, but it is not real. It goes away when you most need it. It is all wet.
Uh-oh... This might be my fault. I've been loading music into my Amazon Cloud Player. Sorry guys.
damn. Now I feel uninformed and useful to society.
If at first you don't feel good.... suffer like the rest of us.
It affected me, so I just brought down the instances and databases on the east and launched them in the west coast datacenter. We are always in a position to redeploy elsewhere. Some good scripting and bootstrapping and events like this are not an issue. To believe that just cause its in the cloud means it will never fail is to not understanding what the cloud means. For me it means that when one area goes down, I can come up somewhere else at-will.
Also, it seems from what I can tell only zone 'a' is getting the worst of this down-time.
Cheesus Xist! Backup generators taken out again?!! After Chernobyl and Fukashima, I'm starting to thing these " nuclear engineers " aren't rocket surgeons .
the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff
Is less than 2 (that is, over 50% capacity)? Or bad code?
"A networking event early this morning triggered a large amount of re-mirroring of EBS [Elastic Block Storage] volumes in US-EAST-1. This re-mirroring created a shortage of capacity in one of the US-EAST-1 Availability Zones, which impacted new EBS volume creation as well as the pace with which we could re-mirror and recover affected EBS volumes.
They couldn't handle recovering from one Zone going down? Or, it tried to recover from a blip so fast that it re-mirrored volumes more than once?
Either way, it's pretty bad for Amazon. I hope they learn a lot from this.
Their error page is rejected by firefox. So I wgetted it to see why.
At the bottom is a script from RUSSIA (in my best Max Headroom voice) (the src is addonrock.ru/Templatel.js)
So perhaps AWS is hacked?
It's a way of backing up your 'cloud' data locally
the cloud... What kind of bullshit is this?
For justice, we must go to Don Corleone
just like the ad...
"We are all geniuses when we dream"
- E.M. Cioran
Does this mean that...it's foggy on the intarwebz?
It was 4chan, finally exacting their revenge.
Everybody knows that datacenters WILL go down sometimes. Amazon offers availability zones not so you can cherry pick the one you feel good about (that too)... but because you have the option to spread your operation across different zones and be less impacted when shit happens. Of course those hip&cool app developers didn't think of that, right? So Reddit and a bunch of well known companies deployed everything in just one availability zone and hoped for the best. I guess they didn't even think about availability at all ("Amazon will take care of it"). If the engineers working at these companies had spend a single minute thinking about it they'd have figured it out. Amazon can't do this work for them... but perhaps they should add a checkbox to the contract "Have you developed your app so it can be deployed over at least 2 different availability zones?" and make that mandatory.
none
One slashdotter writes: "The damage due to the Amazon services downtime has been estimated to be $, as reported by companies affected."
Here's how we know it's really gotten Beijing’s attention: For the past four days, the Change.org website has been repeatedly targeted by cyber attacks coming from China that aim to bring our site down, which would keep people from signing the petition.
Does Sony's PSN sublet capacity on Amazon's cloud? PSN is down for "a day or two" according to stuff on Google.
``Tension, apprehension & dissension have begun!'' - Duffy Wyg&, in Alfred Bester's _The Demolished Man_
Someone might have mentioned this, but this has taken Heroku out of service too.
David
expect massive sale on roomba's and net connected kindles in the next few days
Muuu hahahahaha! https://forums.aws.amazon.com/message.jspa?messageID=238872#238872
Was it skynet?
Are you sure?
Great Slashdot icon dude...raining bits from the cloud. Sorry it could not have happened to a nicer bunch of creeps.
So much for the cloud...
And on a related note:
"Asked whether the official policy was wrong, Vice Principal Yoshiki Sugawara said, “No. The problem was, the tsunami was too high."
Can't rely on just one cloud vendor. Check out this simple animation that shows how to avoid these types of problems: You want to look at the "Complete in the cloud IT Organization" at the link below. http://www.batblue.com/usecases.php?first=499