Amazon EC2 Crash Caused Data Loss
Relayman writes "Henry Blodget is reporting that the recent EC2 crash caused permanent data loss. Apparently, the backups that were being made were not sufficient to recover the lost data. Although a small percentage of the total data was lost, any data loss can be bad to a Website operator."
srsly, as in your own
... the confusion of ideas that would lead someone to treat their live web server as their primary/master data repository.
I guess I'm still stuck in Commodore 64 World, or something..
Was the lost data... all the stuff the PSN network lost? I think I see a connection!
There's a spot in User Info for World of Warcraft account names? Really?
Who knew?
-- Braden's law of data: All data spends some of its lifetime in an excel spreadsheet.
EC2 is not meant to be used for data storage, that is what S3 is designed for. You store data and backups on S3, and use EC2 to serve high bandwidth websites to the masses.
Guess Wikileaks feels good about not being hosted there anymore.... their critical information could have been "lost" as well....
What is more scaring for me, is that Amazon tell you that they have multiple availavility zones on each zone, and recomends you to distribute replicated servers, on each of this zones, for example I have a project with the master database in one zone, and the replica on the other zone. Why both zones fail?? Are not isolated/independent? Amazon charges you for data transfer between zones. As other says fails the servers, anyone must had backups on other place (S3, or Amazon external).
Damia
I think people miss the point of the cloud - saying the cloud is worthless because it "brings people that would otherwise have nothing against you trying to take down your server" is like saying that the internet is worthless because it opens up security risks.
I for one am glad to be connected, and obviously so are many others. Don't use services that aren't good for you - there are some cloud based services that are great, and some that aren't. It's pretty clear that in the future, things will be more connected, not less - adapt and take advantage of the good parts, the rest will fade anyway.
There's something simplistically technocratic about assuming that what is now is better than what has been.
Buy X! It's newer, thus better, than Y!
Because the economy's like a religion and set up so people lose their jobs and their homes if you don't needlessly produce and consume nothing of value.
What's a girlfriend?
Post morten Amazon explanation:
http://aws.amazon.com/message/65648/
Damia
any data loss can be bad to a Website operator.
any data loss is catastrophic, if it's your data. They claim "a small percentage" of data was lost... 1% is a small percentage... 10% is also small percentage, but it's a huge amount of data.
Fortunately where I live and work there isn't really sufficient and reliable connectivity to "the cloud" to make it a worthwhile endeavor, so hopefully all the mistakes are learnt from before I have to worry about it.
Unless you pay extra, they say you can expect to lose data stored in S3 on a regular basis. There's nothing wrong with that per se, but it's something you need to plan for.
S3:
http://aws.amazon.com/s3/
EBS:
http://aws.amazon.com/ebs/
This is not the first time I've heard about a big hosting centre losing data even though it never happens, and they are keeping backups, etc.
It if it's at all manageable, keep one copy safe at your own place in addition to the replication at the hosting centre. You can set up a cheap box at the office with a couple of terabytes disk space and suck down the data periodically with something like rsync and rdiff-backup. It's not a whole lot of work and can make the difference between having a big problem and total disaster.
It would help if hosting centres actually told you how exactly they store and backup your data and what they do in case of emergency instead of throwing meaningless phrases like "99.999% uptime!" and "fully redundant storage backbone!" at you. Fully redundant storage backbone is nothing if it means it's built with some big arse proprietary SAN stuff where the whole array goes down if the main controller goes down. Which it of course does because it's a flaky embedded thing with 2k memory that has to be programmed in assembler and C with dangling memory pointers all over the place.
The durability you quote for S3 (99.99%) is for the reduced redundancy option. The standard storage lists 99.999999999% durability.
Yes, because building your own datacentre, or paying hosting fees to a five-nines-plus facility, costs nothing. Air conditioning, batteries, generators, fire supression, multiple, redundant network connectivity: that stuff''s all free. A mainframe solves it all!!
Look, a quality DC costs millions to build or tens of thousands to rent space in. Servers and mainframes cost money to manage, support and spare out. If you're starved for capital, why wouldn't you use EC2+EBS+S3 for a few bucks a month, rather than tie up dollars that could be spent on developers, marketing or suchlike in hardware and facilities that you're not really benefitting from. To build something like EC2 and the like is seriously expensive. Can your average startup with a server or two claim five nines? Really?
All these people who chant "Don't use the cloud, there could be an outage/breach!!" are just one screw-up away from the same, and it's often pure luck that they haven't been whacked yet.
--srj/mmv