Amazon EC2 Crash Caused Data Loss
Relayman writes "Henry Blodget is reporting that the recent EC2 crash caused permanent data loss. Apparently, the backups that were being made were not sufficient to recover the lost data. Although a small percentage of the total data was lost, any data loss can be bad to a Website operator."
srsly, as in your own
... the confusion of ideas that would lead someone to treat their live web server as their primary/master data repository.
I guess I'm still stuck in Commodore 64 World, or something..
Was the lost data... all the stuff the PSN network lost? I think I see a connection!
There's a spot in User Info for World of Warcraft account names? Really?
Who knew?
-- Braden's law of data: All data spends some of its lifetime in an excel spreadsheet.
OMFG Run for the hills! Social Medial Data was lost. How Will I know in 30 years what you said about bobby!?!??!
EC2 is not meant to be used for data storage, that is what S3 is designed for. You store data and backups on S3, and use EC2 to serve high bandwidth websites to the masses.
^^^
If you have no need for this snapshot, please delete it to avoid incurring storage charges.
This is completely unacceptable.
As a former sysadmin though, I think one way of keeping database backups less then 12 hours old is by writing compressed binary logs to a tape streamer. If the tape is double sided, it can continuously keep recording binary logs. But for the amount of data Amazon handles I think this can grow expensive, tapes are slow media so even a rack full of them might not cut it. Still, there has to be a way to cache volatile data to non-volatile storage. But then, even Google can loose a file.
I like to send syslog to a remote site, so in case of an outage the investigation can begin immediately. Because even if there are several redundant servers, if they all have the same flaw they're all going down the same way.
If Amazon can fuck things up as badly as this, then surely so can Google. Which means the cloud is dead.
Back to mainframes, folks. It really works. It's really secure. You have full control. And not really more expensive.
I'm not even joking.
Guess Wikileaks feels good about not being hosted there anymore.... their critical information could have been "lost" as well....
What is more scaring for me, is that Amazon tell you that they have multiple availavility zones on each zone, and recomends you to distribute replicated servers, on each of this zones, for example I have a project with the master database in one zone, and the replica on the other zone. Why both zones fail?? Are not isolated/independent? Amazon charges you for data transfer between zones. As other says fails the servers, anyone must had backups on other place (S3, or Amazon external).
Damia
Amazon never pretended that data on EBS volume is completely safe. If you want that store them on S3.
Amazon always advoctaed to back EBS volumes. Actually i'm surprised that only this little data is lost.
What's a girlfriend?
Post morten Amazon explanation:
http://aws.amazon.com/message/65648/
Damia
When is Linux and cloud computing going to be ready for the desktop? Any day now?
http://www.stopacop.so -- You have rights. How about standing up for them before they go away?
any data loss can be bad to a Website operator.
any data loss is catastrophic, if it's your data. They claim "a small percentage" of data was lost... 1% is a small percentage... 10% is also small percentage, but it's a huge amount of data.
Fortunately where I live and work there isn't really sufficient and reliable connectivity to "the cloud" to make it a worthwhile endeavor, so hopefully all the mistakes are learnt from before I have to worry about it.
Unless you pay extra, they say you can expect to lose data stored in S3 on a regular basis. There's nothing wrong with that per se, but it's something you need to plan for.
S3:
http://aws.amazon.com/s3/
EBS:
http://aws.amazon.com/ebs/
This is not the first time I've heard about a big hosting centre losing data even though it never happens, and they are keeping backups, etc.
It if it's at all manageable, keep one copy safe at your own place in addition to the replication at the hosting centre. You can set up a cheap box at the office with a couple of terabytes disk space and suck down the data periodically with something like rsync and rdiff-backup. It's not a whole lot of work and can make the difference between having a big problem and total disaster.
It would help if hosting centres actually told you how exactly they store and backup your data and what they do in case of emergency instead of throwing meaningless phrases like "99.999% uptime!" and "fully redundant storage backbone!" at you. Fully redundant storage backbone is nothing if it means it's built with some big arse proprietary SAN stuff where the whole array goes down if the main controller goes down. Which it of course does because it's a flaky embedded thing with 2k memory that has to be programmed in assembler and C with dangling memory pointers all over the place.
As I understand the article, an inferno-like situation destroyed servers at AWS. That can happen. But why should a customer ever get notice of such a situation? Apart from public notice about a desaster somewhere within a company?
Several years back I learned about techniques, where a second live system would jump in immediately without the customer even noticing the switch to another physical machine. And I am not an expert in high-availability...
The second thing which makes me wonder is the notice of snapshots. Why do AWS talk of data likely not useful and charge their customers at the same time for their storage?
cb
The durability you quote for S3 (99.99%) is for the reduced redundancy option. The standard storage lists 99.999999999% durability.
You can check out the full disclosure of the event details here: http://aws.amazon.com/message/65648/