Slashdot Mirror


If Your Cloud Vendor Goes Out of Business, Are You Ready?

storagedude writes: With Amazon Web Services losing an estimated $2 billion a year, it's not inconceivable that the cloud industry could go the way of storage service providers (remember them?). So any plan for cloud services must include a way to retrieve your data quickly in case your cloud service provider goes belly up without much notice (think Nirvanix). In an article at Enterprise Storage Forum, Henry Newman notes that recovering your data from the cloud quickly is a lot harder than you might think. Even if you have a dedicated OC-192 channel, it would take 11 days to move a petabyte of data – and that's with no contention or other latency. One possible solution: a failover agreement with a second cloud provider – and make sure it's legally binding.

26 of 150 comments (clear)

  1. incremental backups by gbjbaanb · · Score: 3, Interesting

    This is the same problem we've always had, whether its someone's website on a shared host or a colo server. You need to back it all up and doing a naive dump of the entire thing will take too long and cost too much in bandwidth, so you take a dump of the entire thing once (preferably when you have the thing you're deploying locally) and then take incremental backups from there.

    The big question is what's the best backup tools to do this, and do they work on cloud systems that don't look like real servers? eg. I recall rsoft that did very good incrementals based on disk blocks changing so the backups were also continuous. Not sure if that'd fly on AWS.

    1. Re:incremental backups by jbolden · · Score: 3, Interesting

      You can do the same things just not on the file levels. Most of your clouds are using a SAN for databases. SAN's do backups like you say. You create matching LUNs on both devices and incrementally track changes to blocks between the SANs. You can just ask your cloud provider what SAN they are using and have them configure this sort of setup for you to another provider.

      In the case of AWS they provide backup servers directly connected to their production servers so you just backup to AWS. And if you want to move the backup from there to somewhere else it is trivial since it is already organized for backup.

    2. Re:incremental backups by pjt33 · · Score: 2

      Depends on what the problem with the colo server is. It's not entirely unknown for police to seize an entire rack of servers from a colo. (E.g. 1, 2, and I half-remember incidents in other countries too).

    3. Re:incremental backups by Ol+Olsoc · · Score: 5, Insightful

      Part of the issue with this is that people are hosting their entire servers on the cloud, not just a website

      Because it is safe, secure, always up, and the way of the future. A company can lay off half or more of it's IT staff going to this wonderful cloud, and no more worries about backing up files, because the cloud saves money, is safe, secure, always up, and the way of the future.

      Except when it isn't.

      --
      The shepherds did so well protecting the flock that the sheep no longer believed that wolves existed.
    4. Re:incremental backups by m2pc · · Score: 2

      This is the same problem we've always had, whether its someone's website on a shared host or a colo server. You need to back it all up and doing a naive dump of the entire thing will take too long and cost too much in bandwidth, so you take a dump of the entire thing once (preferably when you have the thing you're deploying locally) and then take incremental backups from there.

      I agree with this approach. If you can get an initial full backup and then use something like RSync with a cron job to handle the incrementals, that would be ideal.
      Some info regarding RSync with EC2 is here: http://stackoverflow.com/quest...

      One of the worst offenders when it comes to data exports has to be salesforce.com. If you delete a single custom object they charge you upwards of $10K USD to recover your data: https://help.salesforce.com/HT...
      Even worse, you get it back in CSV format. My former employer decided to go with them for their entire operation (sales, marketing, and production/warehouse). I left around the time they started implementation, and it was a complete nightmare!

  2. Re:Not for Federal Customers by Anonymous Coward · · Score: 2, Funny

    i suspect they'd get *everybody's* data, not just theirs.

  3. Legally binding? by louic · · Score: 4, Insightful

    Legally binding?
    I thought the goal was to get your data back*, not to start a lawsuit.

    * ...which you should have backed up somewhere obviously, not only on a single cloud storage location

    1. Re:Legally binding? by aaarrrgggh · · Score: 2

      Moreover, if a big cloud vendor quickly closes shop, interdependencies and network effects are likely to have an impact on your contingent vendor.

      We have hosted email, and will likely move to Amazon Glacier for DR backups; we have local snapshot backups that give all the information locally that would go to Glacier; it is just the earthquake/sprinkler/sabotage scenario that offsite would protect us against, and Glacier is starting to get competitive for our needs.

      Like everything, it is a scenario you need a plan for. Depending on the impact, the plan needs to be developed, tested, and re-validated as appropriate.

  4. Re:I hate hardware by Nyder · · Score: 4, Insightful

    I hate hardware and for all intents and purposes it can go shove itself up its own ass. As a result I very much love the cloud, no matter how much of a buzzword it is. Let someone else worry about the tedious busywork it is to get one piece of hardware to talk to another. Oh what's that? A disk died? I don't give a damn because I don't have to drive 30 minutes each direction just to change it. Ha!

    You missed the point of the article then.

    What if your cloud service provider goes down? How you going to get all your data if you get only 1 day, or a week notice? How about if you get no notice, the shit just stops working? The company goes poof! So does all your data.

    And you think that won't happen? Please, it is going to happen, sooner or later. Why do you think it's named The Cloud? Because clouds evaporate and disappear.

    --
    Be seeing you...
  5. Re:My thoughts by jellomizer · · Score: 4, Interesting

    Cloud offers a lot of real advantages over doing it all yourself.
    1. Having resources when you need it. Lets say you have some seasonal tasks that needs extra horse power (Quarterly Reports, Christmas Rush, Back To School, Exam Time...) so you can request short term extra power to keep up. Without having all this extra equipment running nearly idle for 3/4 of the year.

    2. Hardware upgrade, maintenance, backups... Keeping your hardware up to date is expensive, if you are keeping your old servers you will try to keep them for 6-8 years normally as to get the most out of them. However that means your servers are getting behind the times and are at risk of breaking. As well having to pay for redundant backup servers. Managing good data backup and storage policies. While might be easy for a couple of servers for larger demand it could become a huge waste of your time. The cloud company should be doing this maintenance. They get the advantage of dealing with a larger set.

    But I don't like cloud for the sake of cloud. Adobe Cloud, Office 365 and most of your software that you really just want on your PC and not pay a monthly fee for. Sure cloud backup such as Google Drive but your device has adequate power to run these apps without a huge server farm in the background.
     

    --
    If something is so important that you feel the need to post it on the internet... It probably isn't that important.
  6. Local Backups by Jason+Levine · · Score: 4, Interesting

    I find that local backups are better than cloud backups. I have a 1TB external hard drive that's nearly filled up. This drive cost me around $100 a few years ago. To get 1TB of backup from Google, for example, I would need to pay $9.99 a month. So I can either pay $120 yearly for 1TB of storage space or I can buy a new hard drive every year with increasing disk space. (Currently, $120 will get me a 3TB external hard drive.) With two of the drives, I can have one located somewhere "off-site" in case something happens to the location of my primary hard drive (fire, theft, etc).

    Don't get me wrong, cloud backups can be useful. I can have my phone auto-backup photos and videos to the cloud which is helpful in case something happens to my phone. It also means I don't need to worry about backing up my phone as often. Still, for the most part, I've found local backups to be easier to manage and less expensive than cloud backups.

    --
    My sci-fi novel, Ghost Thief, is now available from Amazon.com.
    1. Re:Local Backups by Naatach · · Score: 2

      Local backups are well and good until a fire or some other disaster takes out the data and the backup device. While it's easy to plan to take a backup off-site, it's another matter to get someone to remember to do it unfailingly every day. Finally, lugging external drives around subjects them to physical damage. A cloud provider solves this dilemma.

      --
      There may be no "I" in team, but there's also no "F" in way.
    2. Re:Local Backups by Voyager529 · · Score: 2

      Both systems have their advantages and drawbacks:

      Local backups, advantages:
      1.) Lower cost/GB.
      2.) Control over data.
      3.) Backups done on demand.
      4.) Multiple users/devices can be backed up on the same drive.

      Local backups, disadvantages:
      1.) Backup of mobile devices gets interesting.
      2.) Backup schedule needs to be adhered to; most people forget until the day after they need it.
      3.) Cost/GB narrows if more than one external disk is purchased to protect against disk failure.
      4.) Opportunity cost - performing backups take time and some level of technical expertise.

      Cloud backups, advantages:
      1.) Streamlined and convenient, so they're generally actually performed.
      2.) Realtime backup, usually including versioning.
      3.) Simple to do on all form factors.
      4.) Additional benefits - sharing, access from other devices, etc.

      Cloud backups, disadvantages:
      1.) No recourse if data is lost or "shared" on the provider's end.
      2.) Necessity of trust that the cloud provider will honor their Super Pinky Promise to not sell your data to the highest bidder.
      3.) No guarantee that the cost won't double tomorrow.
      4.) Backups make messes of data transfer quotas.
      5.) Initial transfer / complete data recovery can take a VERY long time.

      Now, the first three Cloud disadvantages can be somewhat mitigated by a decent SLA, but consumer grade cloud services aren't going to have a decent one, and a company who signs an agreement to be sued into oblivion for technical mishaps will be prohibitively expensive for end users - and even then, it boils down to enforcement. Voyager529 Cloud Services, Inc. goes into contract with Jason Levine for cloud backups. 99.999% uptime, no data gets out, and $25/month for 1TB for life, no matter what. Next month, my colo goes belly up overnight, and the servers with your data on it get sold at auction for $50. I file bankruptcy. Go ahead and sue me - you probably won't see your money, and you definitely won't see your data.

      It's for these reasons (and others) that I agree with you and do all my own personal backups on my own hard drives, and give an undesirable hand gesture to The Cloud (tm). For the majority of people though, the Saturday spent shopping at Microcenter for parts, 2-3 hours unboxing and assembling their own FreeNAS, installing the software, configuring the storage array, and port forwarding in their router like I did, just isn't worth the hassle. As much as I'd rather people do that (or even get a WD MyCloud), I'm begrudgingly happy that Dropbox and Google Drive and iCloud are all getting better traction because, even though Google/Amazon/Microsoft end up with plenty of that data, for most people, they actually have backups.

    3. Re:Local Backups by Slashdot+Parent · · Score: 2

      You might want to check out cloud backup services again. I think you could get by with more like $50 or $60/year these days. I'd include links, but I don't have any specific provider to endorse.

      --
      They don't grade fathers, but if your daughter's a stripper, you fucked up. --Chris Rock
  7. Failover Agreement? by wisnoskij · · Score: 2

    Do those actually accomplish anything when the entity who signed teh agreement just when bankrupt and disappeared? I cannot imagine there is anyway to enforce any agreement on an entity who is legally dead and without any remaining assets of any kind.

    --
    Troll is not a replacement for I disagree.
  8. You don't need the bandwidth by jbolden · · Score: 4, Insightful

    Even if a large cloud provider were to get out of the business they are going to handle things in a responsible way and move their datacenter, hardware and data to someone else. And that's almost certainly true for the smaller players as well. That hardware and data is worth money even if not as much as it cost to buy. The bondholders are going to want $.60 on the dollar rather than $.00 on dollar if they can. But even if we assume that weren't true there are still options. Many of the colo companies which remember sell 30% of their space to the telcos are already using their cross connects for cloud-to-cloud moves the same way they do now for carrier-to-carrier. So for example from Equinix you can go between AWS, Azure and Verizon (Tarramark).

    Almost all the small cloud players are renting space and will move data to physical drive or DAS or SAN. If they are growing broke just find out where they host, buy their hardware storage and keep it in the same colo your data is at now as a colo deal.

    This is the sort of thing your cloud agent can handle for your company for free. I get that Joe-IT manager isn't plugged in enough to the industry to know whose hosting what where and what interconnects they have but that doesn't mean the data isn't readily available. This article is mainly just ignorant of how the industry works.

  9. Only one way to deal with this problem by VAXcat · · Score: 2

    Make sure you keep your resume up to date and on local storage. When your company's cloud vendor fails...print your resume and use it to get a job somewhere else.

    --
    There is no God, and Dirac is his prophet.
  10. Re:I hate hardware by Amouth · · Score: 2

    This should be part of any companies DR strategy. If they don't have the provider going down, the same way they assume a DC goes down, then they aren't doing their job.

    We use DropBox, and we keep a local mirror sync'ed within our own servers. We use the service for the efficiency, but if they went belly up, we already have all our own data and would just be looking for a way to fill the efficiency gap.

    --
    '...if only "Jumping to a Conclusion" was an event in the Olympics.'
  11. AWS losing $2 billion a year? by ehynes · · Score: 2

    Citation please.

    1. Re:AWS losing $2 billion a year? by Enry · · Score: 3, Informative

      Yeah this. I can't find a source for this claim. According to Wall Street Journal, AWS' revenue is only a $1.2billion per quarter. It would have to be losing at least $500mil/quarter to make a $2 billion/yr loss. In other words, for every dollar you spend on AWS, they're really losing $.50 or so.

    2. Re:AWS losing $2 billion a year? by Anonymous Coward · · Score: 4, Funny
  12. Dumbing down your organisation by biodata · · Score: 2

    Mostly clouds are more expensive than doing it yourself, unless you fire your sysadmin(s) as part of the deal.

    --
    Korma: Good
  13. Re:My thoughts by jbolden · · Score: 4, Insightful

    Why would you want it to die? What was the upside of company's having to constantly worry about hardware budgeting when they wanted to do software projects?

  14. Re:I hate hardware by Amouth · · Score: 2

    while not a complete data center, we do have a few racks.

    The mentality you have to have when dealing with the "cloud" is how can i utilize it to make things better. Not "lets just shove it all up there" just like having that one box in the back-office that is the only one that can do something, you have to treat the cloud as a point of failure. When you design IT systems, you to the best of your budget try to mitigate single points of failure. Your budget is what limits the bounds in which you can do that, and the impact to the business should justify the budget.

    For dropbox what we did was setup a synology san unit with with Cloud Sync attached to a single admin account. We then implemented a business process for creation of dropbox folders for use with clients (we are a consulting firm). When a folder needs to be created, it is created on that account, and then shared out. This ensures that it gets a local copy of all dropbox folders used for clients. We then back the synology unit to our normal in-house back.

    We didn't go to dropbox because we wanted to replace our san or our local file storage abilities (honestly having everyone sync over the net connection vs local lan would be horrid). But rather wanted the convenience of their service, and the ease of use for the end users. But having that be the only centralized place anything existed would cause a single point of failure and should not be acceptable. It's the difference between leveraging something and relying on something.

    We also use AWS for outward facing web services, But we have our own IP Block with addresses reserved for fallback, anything we push to out is first pushed to a local production system for final QA then to AWS production. If AWS would disappear, we would just update DNS settings and be back on-line within a MAX of 24 hours. Now our local pipe and production system couldn't handle the same load that AWS would in the same manner but would be functional (stress test shows page loads going from ~.1 to 4-5 sec), but that is ok as the risk is low and the impact is only moderate so ultimately it's a risk we are willing to accept.

    Too many places now days don't do proper risk analysis and mitigation, and very few companies have real DR strategies. DR doesn't have to be expensive or complex, but it must be planned for and maintained, else you will be bitten when the disaster happens.

    --
    '...if only "Jumping to a Conclusion" was an event in the Olympics.'
  15. I'll worry about this ... by davide+marney · · Score: 2

    when Netflix stops using AWS. And Expedia. And NASA. And the CIA, fer cryin' out loud.

    Sheesh.

    --
    "We receive as friendly that which agrees with, we resist with dislike that which opposes us" - Faraday
  16. change paradigm to *onsite* backups by ron_ivi · · Score: 2
    Remember how "off-site backups" were important back before the cloud?

    I think the best of both worlds is to have the live system in the cloud, but have on site backups of all those systems.

    That way if/when the cloud dies, you can still have access to all your data.