Slashdot Mirror


Mega-Uploads: The Cloud's Unspoken Hurdle

First time accepted submitter n7ytd writes "The Register has a piece today about overcoming one of the biggest challenges to migrating to cloud-based storage: how to get all that data onto the service provider's disks. With all of the enterprisey interweb solutions available, the oldest answer is still the right one: ship them your disks. Remember: 'Never underestimate the bandwidth of a station wagon full of tapes hurtling down the highway.'"

20 of 134 comments (clear)

  1. Pro photography is a huge problem by Anonymous Coward · · Score: 5, Interesting

    Returning from a site with a tethered computer full of 80 MP 16-bit raw files from a day's worth of shooting would break most bandwidth bills if you tried uploading all these images.

    1. Re:Pro photography is a huge problem by HapSlappy_2222 · · Score: 4, Interesting

      Just because you take pro pictures at 80MP doesn't mean your business has an extra $10,000 per year laying around for a business grade gigabit pipe; as the sole employee of my company, that'd mean I'm paying myself $20,000 a year instead of $30,000 (TBH, I'm lucky to be in the black, period, with only two years in). I store my images at my studio, back them up daily to a removable disk, and bring in a 3rd removable to copy them over once per week. All told? $500 for the 2 drives and a striped array on my studio PC. The self-storage backup technique works well for me.

      Realistically, though, if I want to I can just upload them all to home or a cloud storage in batches overnight, the same way I download 10 gigabyte files at home. It's just plain easier to cart em around, though.

    2. Re:Pro photography is a huge problem by rthille · · Score: 4, Informative

      The tiny town of Sebastopol CA, population ~7800 has gigabit fiber to the (some) doorstep for $69/month.

      --
      Awesome furniture, accessories and cabinetry in Santa Rosa, CA: http://humanity-home.com/
  2. Station Wagon Full of Tapes by viking099 · · Score: 5, Funny

    Remember: 'Never underestimate the bandwidth of a station wagon full of tapes hurtling down the highway.'"

    Yeah, the bandwidth is great, but the latency SUCKS.

  3. Second biggest challenge by WOOFYGOOFY · · Score: 4, Insightful

    Second biggest challenge: trusting any of these places have the motivation to keep your data more secure than credit card companies do.

    1. Re:Second biggest challenge by gestalt_n_pepper · · Score: 5, Insightful

      No, that's third. The second biggest challenge is believing that those fine hosting companies with servers hosted in lower Slobbovia won't have a few entrepreneurial employees who will *actively* be searching your data for all that is monetizable.

      --
      Please do not read this sig. Thank you.
  4. Re:The real hurdle by icebike · · Score: 4, Insightful

    Getting around all the buzzwords

    Well that's one hurdle.
    The next is RECOVERY when ICE or FBI or some other 3letter agency walks in an takes your data because one tiny customer use the service for some allegedly nefarious purpose.

    The key here is to use a service so big that even god himself would not dare take it down, although the Ayatollah might try. Small cloud services, even if multi-homed are a risky proposition. Even if you do manage to get all your data into them, they are not large enough to push back against any subpoena or search warrant that any misguided judge in some backwater jurisdiction may issue.

    --
    Sig Battery depleted. Reverting to safe mode.
  5. Comment removed by account_deleted · · Score: 3, Funny

    Comment removed based on user account deletion

  6. Backups by SJHillman · · Score: 5, Informative

    My last employer offered offsite backups to clients. For the initial seed, we always tried to get them to put it on an external HDD and ship it to us (or at least DVDs). The only major exceptions were clients that were also on FiOS - that was the only case where over-the-net transfer was faster than the backup-and-ship-it method for the initial seed.

  7. Bandwidth of a Station Wagon by Surazal · · Score: 5, Funny

    Yes, never underestimate the bandwidth of a station wagon full of disks hurtling down the highway. The latency, on the other hand, leaves much to be desired, and I've heard the packet loss can be downright fatal.

    --
    --- Journals are boring; Go to my web page instead
    1. Re:Bandwidth of a station wagon by BradleyUffner · · Score: 3, Informative

      I have never liked the station wagon analogy, because it misunderstands the thing we are trying to measure. In the example, we measure the bandwidth of the station wagon. But that's like measuring the bandwidth of a packet -- a nonsense concept. We measure the bandwidth of the channel, not the chunks of data which fly through it. To really get the right analogy, we should talk about the bandwidth of a freeway, not the station wagon which drives upon the freeway.

      Bandwidth in the colloquial sense means "the amount of data which passes a given point, per second." So, imagine that you can load 25 TB in the form of tapes into a station wagon. For safety, these station wagons must drive a distance of 75 meters apart and a speed of 100 kilometers per hour. That means that one station wagon passes a given point every 2.7 seconds. That's 9.2 TB per second. Adding a second lane to the highway would double the bandwidth.

      The stupid calculation which is often performed, on the other hand goes like this. You have 25 TB in the wagon, and you drive it to a location 10 hours away... Already you've gone off the tracks, because you are mentioning the TIME it takes to get to the destination, i.e. the LATENCY. And as anybody knows, the latency (or equivalently the distance between the points) has NOTHING to do with bandwidth.

      How can you say Time has nothing to do with bandwidth when, in your own example, you measured it in TB per SECOND?

      Following your example again of 9.2TB/sec, that can be changed to 9.2TB * 60 /min, or 9.2TB * 60 * 60 /hour, or 9.2TB * 60 * 60 * 10 / 10 hours, which is the exact measurement that you seem to have a problem with earlier in your post (data in a 10 hour period).

    2. Re:Bandwidth of a station wagon by Firethorn · · Score: 4, Interesting

      Indeed. He also ignored the core reason for having said bandwidth - you have X amount of data to move in Y time (at under Z cost); what's the best way to do so?

      As such, a 'packet' on the freeway system is rather expensive, so you don't want to be putting multiple station wagons on the system if you don't have to. Figure the driver costs $20/hour, the vehicle itself $.50/mile(gas, maintenance, insurance, tolls, etc...), and you're looking at 300 miles in 10 hours. For a single packet you're looking at $350 for that single 'packet'. If a single station wagon doesn't do it, perhaps a cargo van would, which doubles the capacity of the packet while only raising the cost $50, to $400. Still not good enough? Upgrade to a 'package van' like UPS/Fedex trucks. Next step would be a Semi.

      In any case, I'd say that you could fit 25TB into a motorcycle today - 3 TB drives are fairly common now, and I can fit 10 into my saddlebags easily. Heck, I can get 1.5TB native tapes, about the same size as a HD. Padding it's dimensions up, it's 11 x 11 x 3 cm = 363 cm^3, or 2,755 per cubic meter.

      A 2008-11 Dodge Grand Caravan Cargo van - 143.8 cubic feet = 4.07 cubic meters, giving me room for 11k 1.5TB tapes. 16.5k TB, in 10 hours, if I have a single cargo van. Ouch. Disregarding media cost, that's ~$400.

      Do this daily, we're looking at 1.5 terrabits per second. Don't know of any connections that fast.
      Monthly, we're down to ~50 gigabit (rounding down). I can guarantee that a 50 gigabit connection will cost more than $400.
      Annually, it's 'only' 4 gigabit, and I pay more than $100/month for my megabit class connection, which ISN'T utilized 100%, unlike my calc.

      You don't normally need to figure out the bandwidth of the freeway because:
      1. Generally 1 vehicle 'packet' is sufficient, and due to the high marginal cost per said vehicle, you normally only want to send one.
      2. The roads are used for more than data shipment, which would be like trying to figure out how much bandwidth you have available for VOIP by looking at total circuit bandwidth.

      Don't need to ship that much? You should be able to ship about 30 of them for $60, second day air. That's 45TB, or about 140 Mbit of 100% saturated traffic for a month. BTW, during my calcs for paying fedex to ship them, I think that weight might actually be enough of an issue to increase gasoline consumption - but I think I've established that even $800 would be cheap if you need to ship that ridiculous of an amount of data.

      --
      I don't read AC A human right
  8. Exit strategy by scsirob · · Score: 3, Insightful

    No, the second biggest challenge is to come up with a viable exit strategy. Once you have several TB at this service provider, how will you move it out of there when the next provider has a better deal? That was one of the major big points for having a cloud in the first place, to have the freedom to move your compute requirements to a better, cheaper, faster (pick two) provider.

    Even if you moved it in with a station wagon full of tapes or disks and your provider let you import it, I'm sure your provider will not be so helpful when you need to move it back out.

    Blatant plug: Perhaps Actifio (www.actifio.com) can fix this for you, by replicating your data in, and also back out of production systems in deduped and compressed format.

    --
    To Terminate, or not to Terminate, that's the question - SCSIROB
  9. A problem bigger than getting your data on ... by petes_PoV · · Score: 4, Insightful

    ... is getting it all back OFF again when you want to switch service providers.

    The one thing you want never to happen is that you get locked in to a single cloud service. They might go bust, they might become uncompetitive. They may become politically "unfriendly" or tainted with customers you have no desire to be associated with - or any of a number of other reasons to say "adios".

    Just like with disaster planning, all the processes and procedures, agreements and SLAs are worthless until you've actually PERFORMED the operation and done so without a major service interruption. How many cloud users have gone that far - and how many are locked in but don't know it?

    --
    politicians are like babies' nappies: they should both be changed regularly and for the same reasons
  10. Re:why dodge this question? by Bogtha · · Score: 3, Insightful

    You don't know the exact dialogue between the journalist and the rep. I've been quoted in print in similarly stupid ways when what I said made absolute sense in context to what was asked. "Pressed if disks are accepted" could have been something like the rep telling them about a new CSV import tool they had built, the journalist saying "So if I mailed you a 5TB database on a disk, could you import that?", and the rep replying "Sure, but you'd need to export the data first...".

    --
    Bogtha Bogtha Bogtha
  11. Aspera and Friends by PhillC · · Score: 3, Interesting

    You you always use a UDP solution such as Aspera. Fast transfer speeds, bandwidth management and they have a specific AWS implimentation.

    Other options to look at include Smartjog, whose new Bolt product looks quite interesting, Riverbed's Steelhead product, Filecatalyst and Signiant.

    There are many solutions around now to deal with large file transfers for both small and large business. Most of them use UDP instead of TCP/IP, with Checksums to ensure all data is reliable delivered. Even with just 1Mbps upload speeds, something like one of the above named products will be advantageous. I've worked in the media industry for a number of years, and this type of thing is being used in Film and Television all the time. Of course, there are still tapes being shipped around, but in emerging markets, such as Russia for instance, the file transfer really beats a tape being stuck in customs for weeks or months.

    --
    Brought to you by the author of such childrens' classics as "Some Kittens can Fly!" and "All Dogs go to Hell."
  12. But then... by TemperedAlchemist · · Score: 4, Insightful

    How did you manage to fix armed FBI storming your servers located in another country problem?

  13. Re:The real hurdle by CAIMLAS · · Score: 5, Informative

    That is just one of many of the hurdles.

    Really, these problems are problems because most 'cloud' shit is done wrong.

    It's a bit of a worn out record here on Slashdot, but anyone or any company which is fully dependent upon The Cloud for business continuity is a fool.

    * First off, there is no such thing as 'utility computing', and probably never will be due to the volatile nature of storage and its ongoing cost of maintenance.
    * Second, if you do not maintain primary physical control of something, to the best of your ability, you do not control it.
    * For primary IT infrastructure, it will cost more to do "Cloud" than local. If you can afford 2-3 servers a year, but not much more, and a nominal IT operations budget, chances are you should have an in-house "cloud" with off-site replication.
    * Bandwidth costs both ways will kill you, as will latency in many cases, will kill Cloud functionality.

    At this point, I still strongly recommend against public Clouding your systems unless they are:

    a) (very!) low volume with use-based billing. This only makes sense for a low-volume public-facing site where you don't already have IT infrastructure (on a cost basis)
    b) off-site 'hot' replication. You've got your inside 'private Cloud' which replicates to off-site systems. (Cloud is basically just colocated virtualization, after all.)
    c) Other geographic/distribution requirements (eg. multisite organization with none serving as a good central hub). In this case, colocation of your own equipment makes more sense in many regards.

    --
    ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
  14. Canada by DarwinSurvivor · · Score: 3, Interesting

    In canada, unless you need low latency, the internet is about the most expensive method you could possibly use to transfer data. source

  15. Re:The bandwidth of a fully laden alimentary canal by FunkSoulBrother · · Score: 3, Funny

    African or European?