Amazon EBS Failure Brings Down Reddit, Imgur, Others
Several readers have sent word of a significant Amazon EBS outage. Quoting:
"Amazon Web Services has confirmed that its Elastic Block Storage (EBS) service is experiencing degraded service, leading sites across the Internet to experience downtime, including Reddit, Imgur and many others. AWS confirmed on its status page at 2:11 p.m. ET that it is experiencing 'degraded performance for a small number of EBS volumes.' It says the issue is restricted to a single Availability Zone within the US-East-1 Region, which is in Northern Virginia. AWS later reported that its Relational Database Service (Amazon RDS) and its Elastic Beanstalk application plaform also experienced failures on Monday afternoon."
Productivity reached a record high this afternoon.
After 3 days without programming, life becomes meaningless
- The Tao of Programming
It's the cloud! It's like never like down, and webscale!
Since no one can go on reddit, they will come back to /. only to find out why reddit is down!
Coursera is also down as a result.
/. is working just fine.
Are those karma points in the mail?
It's as if millions of geek voices cried out in terror & were suddenly silenced.
We are seeing EBS problems across multiple AZs with our services, as are many others. Amazon is downplaying the issue.
See HN for ongoing discussion as well: http://news.ycombinator.com/
Bad luck if you're hosted in the US-East-1 Region, I guess.
Heh, I should really start advertising the LVS clusters I tend to as 'private clouds with better uptime than Amazon'.
My God, it's Full of Source!
OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
I have to admit, due to this outage I just logged in to Slashdot for the first time in a year. We're experiencing our own outages at work, unrelated to AWS, but I'd hate to be an AWS admin during one of these major outages. This makes me wonder why Reddit, Imgur, etc., don't have presences in multiple availability zones to prevent this kind of outage.
Do you still think that putting your digital life in the "cloud", without any ability to fall back on a physical hard drive or device, is a good idea?
[End Of Line]
An honest question, why don't these large, big-name sites utilize the Multi Availability Zone failover that Amazon offers? It seems these AWS outages make for good headlines, but shouldn't any large site be co-located in multiple physical locations to ensure uptime? If they WERE using Multi AZ, or there is some other technical reason why it wouldn't help, I'm really curious to know why...
There's an oblig xkcd: http://xkcd.com/908/ Guess someone tripped over the wire.
s/[stupid comments]/[intelligent discourse]/gi
I'm just glad I moved my hosting away from AWS. It seems they've had a few problems lately in their datacentres. Local Aussie hosting seems to have better bandwidth anyway.
If only there were some lessons learned over decades and decades of mainframe use that that could be applied to the cloud.
Well... as it's currently referred to, the "cloud" is a singular entity. So, as long as there's one single server running as part of that infrastructure, you could weasel your way around any downtime and reassure the ignorant masses that "the cloud" is is still up, even if the only remaining piece is a Raspberry Pi running over a cable modem in some guy's basement.
Hey, look everybody, the cloud is still up! You can't do near as much as you usually can, but it's up! 100% uptime! Woo!