Slashdot Mirror


BitTorrent For Enterprise File Distribution?

HotTuna writes "I'm responsible for a closed, private network of retail stores connected to our corporate office (and to each other) with IPsec over DSL, and no access to the public internet. We have about 4GB of disaster recovery files that need to be replicated at each site, and updated monthly. The challenge is that all the enterprise file replication tools out there seem to be client/server and not peer-to-peer. This crushes our bandwidth at the corporate office and leaves hundreds of 7Mb DSL connections (at the stores) virtually idle. I am dreaming of a tool which can 'seed' different parts of a file to different peers, and then have those peers exchange those parts, rapidly replicating the file across the entire network. Sounds like BitTorrent you say? Sure, except I would need to 'push' the files out, and not rely on users to click a torrent file at each site. I could imagine a homebrew tracker, with uTorrent and an RSS feed at each site, but that sounds a little too patchwork to fly by the CIO. What do you think? Is BitTorrent an appropriate protocol for file distribution in the business sector? If not, why not? If so, how would you implement it?"

291 comments

  1. Sneakernet by 91degrees · · Score: 5, Insightful

    The bandwidth of a DVD in the postal service isn't great but it's reasonable and quite cost effective.

    1. Re:Sneakernet by tepples · · Score: 4, Insightful

      The bandwidth of a DVD in the postal service isn't great but it's reasonable and quite cost effective.

      From the summary: "I would need to 'push' the files out, and not rely on users to click a torrent file at each site." I imagine that the following is also true: "I would need to 'push' the files out, and not rely on users to insert a disc and run setup.exe at each site."

    2. Re:Sneakernet by fxkr · · Score: 1

      Problem is, both latency and packet loss are quite high...

    3. Re:Sneakernet by 91degrees · · Score: 1

      Since these are disaster recovery files, I'm assuming they only need to be used in case of a disaster.

      If so, the data is just as good on a DVD as on a hard disk.

      If not my idea's rubbish but suggesting it was inexpensive.

    4. Re:Sneakernet by Tokerat · · Score: 1

      Well it is also not described what TYPE of disaster recovery it might be - some of that info might be "What to do if there is a fire on store property.pdf" but some of it might also be "CustomerTransactionDataRecovery.exe"

      If that's the case, you would need a store to run setup.exe every time...

      --
      CAn'T CompreHend SARcaSm?
    5. Re:Sneakernet by Anonymous Coward · · Score: 0

      First: You mean latency, not bandwidth.

      Second: Things change so you can't just repeat this line year after year, you have to revisit the question regularly. What does the math look like in 2008? What would be an estimate of disc cost + mailing cost + recurring employee time cost vs file transfer cost?

    6. Re:Sneakernet by the_B0fh · · Score: 1

      What has that got to do with anything? If that's the case, the files that are currently pushed out to DR still has to be executed manually or automagically.

      The OP is not asking for that - the OP wants the files to be transferred automagically. A DVD works perfectly fine, just has high latency.

    7. Re:Sneakernet by neomunk · · Score: 1

      I don't think that putting a DVD into a (hopefully) physically secured computer is as automagical as doing absolutely nothing on the client end while a script/daemon takes care of all the work.

    8. Re:Sneakernet by 91degrees · · Score: 1

      First: You mean latency, not bandwidth.

      No. I mean bandwidth. Latency is meaningless here because you're just sending a disc out. The data transfer rate is 4GB over a couple of days which is somewhere in the order of 10-100Kbits/s.

    9. Re:Sneakernet by maxume · · Score: 4, Insightful

      Also, burning (and packaging and mailing...) a bunch of DVDs isn't necessarily cheap/quick/easy, so it breaks down pretty quickly as the number of stores increases.

      --
      Nerd rage is the funniest rage.
    10. Re:Sneakernet by Anonymous Coward · · Score: 0

      No, bandwidth is meaningless because it can scale trivially (just send more DVDs).

      Oh and I guess you did the math and it didn't work out in your favor. I thought so too.

    11. Re:Sneakernet by gbjbaanb · · Score: 2, Insightful

      surely "push the files" to a remote site is the same as "posting the files" via a different transport mechanism. When people say that they need to remotely push the files, its not that the users can't/won't be able to handle them if they're not there already setup, its because they'll forget or just be too lazy to click the button to retrieve them. A DVD in the post is difficult to miss.

      However, a DVD in the post may not arrive or may be corrupt.

    12. Re:Sneakernet by 91degrees · · Score: 1

      No, bandwidth is meaningless because it can scale trivially (just send more DVDs).

      So bandwidth isn't an issue either. That's good.

      Oh and I guess you did the math and it didn't work out in your favor. I thought so too.

      You guessed wrong.

    13. Re:Sneakernet by Anonymous Coward · · Score: 2, Funny

      "What to do if there is a fire on store property.pdf"

      Step one: turn off the computer.

      Oh crap! what's step two?

    14. Re:Sneakernet by palegray.net · · Score: 1

      Actually, the bandwidth is great... it's the latency that poses a problem.

    15. Re:Sneakernet by derfy · · Score: 2, Insightful

      I dunno, but step three is profit.

    16. Re:Sneakernet by Sentry21 · · Score: 1

      The bandwidth can be great, it's the latency that kills you.

    17. Re:Sneakernet by 91degrees · · Score: 1

      Other people have said as well. Still not convinced. But ultimately it's total transfer times that we're concerned about so whichever it is... It's just a question of how much of a problem a couple of days (plus potential lost DVDs) will cause.

    18. Re:Sneakernet by louiswins · · Score: 1

      Then step two must be "Call insurance agency"?

    19. Re:Sneakernet by Anonymous Coward · · Score: 0

      Actually, the latency could be of benefit, the postal system acts as off-site storage.

    20. Re:Sneakernet by Anonymous Coward · · Score: 0

      bandwidth is actually great (4.7GB per day, if you assume it can be delivered overnight)... but latency (the time it takes for the first bit of data to arrive) is not... word :)

    21. Re:Sneakernet by Dan541 · · Score: 1

      Or "Buy insurance"

      --
      An SQL query goes to a bar, walks up to a table and asks, "Mind if I join you?"
    22. Re:Sneakernet by Zygamorph · · Score: 1

      Actually the bandwidth of a DVD is excellent, its the latency that is a problem. Picture a postal truck full of DVDs, lots of bits but the round trip time is bit high.

  2. Different torrent client ? by drsmithy · · Score: 5, Informative

    No need to get fancy with an "RSS feed". rTorrent, at least, can be configured to monitor a directory for .torrent files and automatically start downloading when one appears. You could set this up, then simply push out your .torrent file to each site with something like scp or rsync.

    1. Re:Different torrent client ? by Anonymous Coward · · Score: 5, Interesting

      rtorrent watching a directory for .torrent would be the way to go. And then use unison to keep the .torrent directory in-sync.

    2. Re:Different torrent client ? by Anonymous Coward · · Score: 0

      I think even uTorrent can monitor a directory =]

    3. Re:Different torrent client ? by Anonymous Coward · · Score: 0

      uTorrent and Azureus both do this.

    4. Re:Different torrent client ? by rusl · · Score: 1

      ya, that's a pretty standard BT client feature.

      --
      Stupidity is its own reward.
    5. Re:Different torrent client ? by msobkow · · Score: 1

      Azureus automatically pulls it's updates as well, so maybe there's a way to get Azureus to auto-subscribe to files, too.

      --
      I do not fail; I succeed at finding out what does not work.
    6. Re:Different torrent client ? by antikaos · · Score: 1

      I'd guess that that's via an RSS stream

      --
      I don't believe you, I'm here for a seat on the secret spaceship.
    7. Re:Different torrent client ? by smARMie · · Score: 1

      I use this setup to backup up 18 sites on two backup servers:
      t0 script generates backup file and .torrent in every location, then uploads the .torrent to the tracker, then starts seeding
      t1 script pushes .torrent from tracker to backup servers
      -> launchmanycurses (in screen) does it's job

      --
      Beware of programmers who carry screwdrivers!
    8. Re:Different torrent client ? by Neoncow · · Score: 1

      I love unison, but for simplicity rsync would probably do fine for a push environment.

    9. Re:Different torrent client ? by Anonymous Coward · · Score: 0

      WINE 1.0 is out. Only GNU HURD, a Mac Pro Mini, and Duke Nukem Forever to go !

      Not to mention Chinese Democracy. If Duke turns up it'll be a cold winter in hell.

    10. Re:Different torrent client ? by marnaq · · Score: 1
      No need to poll the watch directory. Rtorrent has an XMLRPC interface.

      xmlrpc rtorrent-client-1:port load_start "http://some-server/latest-stuff.torrent"

  3. Storm or some other botnet by Anonymous Coward · · Score: 1, Funny

    Ask a warez site.

    1. Re:Storm or some other botnet by vilgefortz · · Score: 3, Funny

      That's what he just did.

  4. Dedicated Server by Szentigrade · · Score: 0, Flamebait

    Wouldn't a dedicated server provide what you need? Upload your recovery files once and than have the server transfer them to each client at high speed. Simple and cost effective.

    --
    When I read about the evils of drinking, I gave up... reading.-Henny Youngman
    1. Re:Dedicated Server by Anonymous Coward · · Score: 0

      I don't know why parent is considered flamebait. Although the summary had already said that they would prefer something that would cost less, and be easier for low bandwidth.

      But flamebait? Off-topic would be more appropriate.

    2. Re:Dedicated Server by nog_lorp · · Score: 1

      Looks more like -1 Troll to me.

  5. technologies working together isn't patchwork by Yonkeltron · · Score: 1

    these are technologies that have been proven effective when working together by people everywhere. if you put it together, test it and build a system for fail-safes etc., you should be fine!

    --
    Keep the faith, share the code
  6. ask us by TheSHAD0W · · Score: 4, Informative

    Next time you should ask at the official BitTorrent IRC channel.

    The Python BitTorrent client, which runs on Unix, has a version called "launchmany" which is easily controlled via script. It should fit your needs very nicely.

  7. Works great by Anonymous Coward · · Score: 5, Insightful

    BitTorrent is an excellent intranet content-distribution tool; we used it for years to push software and content releases to 600+ Solaris servers inside Microsoft (WebTV).

    -j

  8. Sure, why not? by sexybomber · · Score: 5, Insightful

    Is BitTorrent an appropriate protocol for file distribution in the business sector?

    Sure! BitTorrent, remember, is only a protocol, it's just become demonized due to the types of files being shared using it. But if you're sharing perfectly legitimate data, then what's wrong with using a protocol that's already been extensively tested and developed?

    Just because it's been used to pirate everything under the sun doesn't make it inappropriate in other arenas.

    1. Re:Sure, why not? by tylerni7 · · Score: 0

      I don't know what the poster meant exactly when he said appropriate, but I figured it was something like "is it inappropriate to use the client's bandwidth to push our software?"
      If that wasn't what he meant, well, maybe it should be inappropriate. There is certainly nothing wrong with bittorrent, but I'm not sure how the clients would react if they knew that they had to use their bandwidth to push your software.

      They might be fine with it, especially if it's a closed network, and that bandwidth can't be put to any good use. However, unless it was already written in a contract somewhere that they agree to let you use their connection for anything, I don't think it would be appropriate to silently push bittorrent clients on their machines that they don't know about.

    2. Re:Sure, why not? by Bert64 · · Score: 2, Insightful

      Pirates still prefer FTP, it seems all of the big warez groups are still pushing files around using FTP...

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
    3. Re:Sure, why not? by hedwards · · Score: 2, Interesting

      The main problem is that it introduces an extra vulnerability. With it the capability of very efficiently spreading malware and viruses around. Depending upon how locked down things are, it might not be a problem, but still it's definitely something to worry about.

      And yes, I am assuming that somebody's going to get their machine infected or that somebody's going to break into the VPN traffic. Not necessarily likely, but still has to be considered.

    4. Re:Sure, why not? by Xugumad · · Score: 4, Informative

      One of the things that always amused was when people claimed Bram Cohen was "selling out" by working with the movie/music industry. BitTorrent was never intended for piracy use, it's merely it's most common use.

      It's very regularly used for Linux distros, game patches (World of Warcraft!), etc.

    5. Re:Sure, why not? by Jbcarpen · · Score: 2, Informative

      Um, he's pushing files to retail stores that are part of a chain. He's doing this from the corporate office. I don't think the retail stores get much say in the matter, since they aren't customers, they're subsidiaries.

      --
      GENERATION 667: The first time you see this, copy it into your sig on any forum and add 1 to the generation
    6. Re:Sure, why not? by tylerni7 · · Score: 1

      Ahh I need to work on my reading comprehension I guess, thanks for clearing that up...

    7. Re:Sure, why not? by Fumus · · Score: 3, Insightful

      Don't forget about USENET. It's way more convenient than waiting days because the seeders to leechers ratio is 1:30.

    8. Re:Sure, why not? by nog_lorp · · Score: 2, Informative

      You're talking about the difference between the provider pirates and the end-user pirates. SCENE people hate p2p. Average Joe-wants-stuff-for-free doesn't know what the "scene" is, and uses p2p (always wondering why torrents say RELOADED or RAZOR1911).

    9. Re:Sure, why not? by Anonymous Coward · · Score: 0

      And yes, I am assuming that somebody's going to get their machine infected

      Thinking about USB sticks with targeted malware? Clients should be kept updated, only necessary rights granted (storage usage, user rights, ..) and standard compliance requirements followed (IDS, firewalls, AV).
      There really should be an OS project for implementing regulatory compliant corporate networks with different open source components and free operating systems. Of course, the project would step over a few toes of consultants.

    10. Re:Sure, why not? by atraintocry · · Score: 1

      Still, the context of "appropriate" was most likely the pool of data transfer methods, not social issues.

    11. Re:Sure, why not? by adolf · · Score: 1

      With BitTorrent's hash checking, clients are unable to poison the torrent.

      Therefore, in order for widespread infection to happen, it'd have to occur at the source. This is just about exactly as likely to happen with BitTorrent as it is with any other distribution method (from rsync, to postal service, to armed couriers).

    12. Re:Sure, why not? by Anonymous Coward · · Score: 0

      Idiot, learn how the technology works before opening your mouth. Seeders and the seed to leech ratio have nothing to do with how fast you get the content, as you also download from leechers. The only time there would be a problem is when the original seeder has'nt pushed out a full copy of the file yet.

    13. Re:Sure, why not? by Daengbo · · Score: 1

      Didn't he used to be on Slashdot all the time when he was developing the protocol? I seem to remember that he was promoting it in his sig. Anyway, supporting your point, when he was proposing the protocol, it was specifically for taking loads off of FTP servers during the first few days after a large release without having to use mirrors. His primary example was a Linux distro release.

      I ask if you remember because it's obvious you would have been around at that time and my memory isn't that great.

    14. Re:Sure, why not? by heson · · Score: 1

      The SCENE is a MMO, the strictly hierarchial structure is needed for the game to work.

    15. Re:Sure, why not? by Cato · · Score: 1

      Whatever software you put on a server to handle replication could have vulnerabilities. I don't see why BitTorrent peers should be more vulnerable than anything else, and it's possible they are sufficiently well tested out on the Internet that they are more secure than a tool used only within the enterprise.

    16. Re:Sure, why not? by Anonymous Coward · · Score: 1, Interesting

      It's not the pirates that prefer FTP, it's the Scene.

      They're not the same. If those very FTP sites were the only places you could get pirated material online neither Hollywood nor the music industry would care about online piracy. Common people wouldn't know about the sites, just like normal users that copied game floppies from their friends in the 90s didn't know what a BBS was.

      The "pirates" today are the "common people" (at least for my own generation), ie p2p users. The Sceners hates it when their releases "leak" to torrent-sites just as much as the original authors of the pirated content does.

    17. Re:Sure, why not? by StikyPad · · Score: 1

      USENET must be extremely efficient indeed, considering that you've managed to forget both the First Rule of USENET and the Second Rule of USENET at the same time!

    18. Re:Sure, why not? by StikyPad · · Score: 1

      Ah, provider pirates... So detached from reality that they attempt to forbid redistribution of "their" releases via torrent. For even more irony, check the user agreement of any private tracker, which typically prohibits redistribution of "their" torrents to any other tracker.

      I guess the ultimate irony would be your friend refusing to give you a copy of the title he just downloaded from TPB because the README says he'll get banned for P2P.

    19. Re:Sure, why not? by Fumus · · Score: 1

      It's not a problem any more since nowadays almost no ISP provides free USENET access.

    20. Re:Sure, why not? by Bert64 · · Score: 1

      Yes, and i can't stand the "scene"...

      The level of arrogance, deleting a perfectly good rip of something because it was done by some random joe and not a "scene release", even if the scene version is noticeably worse in quality or such...

      This ridiculous business of chopping every file up into thousands of small rar files, and often compressing a file multiple times (hint: compressing a file more than once does not save you more space, and video/audio files are typically already compressed)... Leaving users with the hassle of having to decompress it after downloading.

      An obsession with FTP, despite the fact that there are newer better designed and more secure protocols out there these days... There really is no good reason to use FTP anymore.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
  9. rsync by timeOday · · Score: 5, Informative

    How much do these disaster recovery files change every month? If they stay mostly the same, using rsync (or some other binary-diff capable tool) may let you keep your simple client/server model while bringing bandwidth under control.

    1. Re:rsync by Anonymous Coward · · Score: 5, Informative

      Yes, and there are ways you can use rsync from well-planned scripts that are very powerful beyond just file transfer.

      1. The basic case of "transfer or update existing files at destination to match source." It always takes advantage of existing destination data to reduce network transfers.

      2. The creation of a new destination tree that efficiently reuses existing destination data in another tree without modifying the old tree. See --copy-dest option.

      3. In addition to the previous, don't even create local disk traffic of copying existing files from the old tree to new, but just hard link them. This is useful for things like incremental backup snapshots. See --link-dest option.

      It may not be as sexy as p2p protocols, but you can implement your own "broadcast" network via a scattered set of rsync jobs that incrementally push their data between hops in your network. And a final rsync with the master as the source can guarantee that all data matches source checksums while having pre-fetched most of the bulk data from other locations.

      I've been enjoying various rsync applications such as the following (to give you an idea of its power): Obtain any old or partial mirror of a Fedora repository and update it from an appropriate rsync-enabled mirror site, to fill in any missing packages. This is a file tree of packages and other metadata. Concatenate all of the tree's files into one large file. Then use rsync to "update" this file to match a correponding DVD re-spin image on a distro website. Rsync will figure out when most of those file extents cooked into the ISO image are already in the destination file, and just go about repositioning them and filling in the ISO filesystem's metadata. An incredibly small amount of traffic is spent performing this amazing feat.

    2. Re:rsync by Kozz · · Score: 1

      You know, I'd had a need for an rsync-like tool for Windows (specifically between Windows Server 2003 machines). I found a Windows-based rsync implementation (whose name I can't recall), but the tool was clunky and unreliable. I saw someone suggest Unison, but do you have any other suggestions specifically for Windows?

      --
      I only post comments when someone on the internet is wrong.
    3. Re:rsync by kinema · · Score: 1

      Even better, rsync against a local binary diff that was distributed via BitTorrent.

    4. Re:rsync by timeOday · · Score: 2, Informative

      I swear by cygwin, which lets you use rsync plus hundreds of other POSIX/Linux applications. These are the real McCoy, not half-baked re-implementations.

    5. Re:rsync by cronin1024 · · Score: 1

      This is a little crazy, but you might be able to get away with modifying the .torrent file, then forcing a recheck of the existing files. If the files are mostly the same (in the same position, i.e. no sliding data) this will work just fine.

    6. Re:rsync by Anonymous Coward · · Score: 0

      mod parent up

  10. In a word, Yes by cullenfluffyjennings · · Score: 4, Informative

    I've seen bittorrent used for several business critical functions. One example is world of warcraft distributing updates using it.

    1. Re:In a word, Yes by hedwards · · Score: 1

      I suspect that the GP is either referring to professional farmers or to Blizzard's own staff.

    2. Re:In a word, Yes by nabsltd · · Score: 5, Insightful

      For Blizzard, updates to World of Warcraft are very much a "business critical function".

    3. Re:In a word, Yes by Thundersnatch · · Score: 1

      I've never heard of any other organizational use besides Linux distribution and Blizzard's "critical business function". What are your other examples?

      It would seem to me that IP multicast tools and efficient client-server tools like rsync are far more useful in an organization for content distribution than BitTorrent for "critical" processess. You get better traffic locality, deterministic behavior, logging, and a lot of other necessities that you can't get with BitTorrent clients.

  11. World of Warcraft uses it. by gblackwo · · Score: 0

    Must be good enough for the rest of us.

  12. Cisco already makes a product to do this - WAAS by colinmcnamara · · Score: 5, Informative

    It is like Rsync on steroids. Cisco's Wan optimization and Application Acceleration product allows you to "seed" your remote locations with files. It also utilizes some advanced technology called Dynamic Redundancy Elimination that replaces large data segments that would be sent over your WAN with small signatures.

    What this means in a functional sense is that you would push that 4 Gig file over the WAN one time. Any subsequent pushes you would only sync the bit level changes. Effectively transferring only the 10 megabytes that actually changed.

    While it is nice to get the propeller spinning, there is no sense reinventing the wheel.

    Cisco WAAS - http://www.cisco.com/en/US/products/ps5680/Products_Sub_Category_Home.html

    --
    Colin McNamara - CCIE #18233 "The difficult we do immediately, the impossible just takes a little longer"
    1. Re:Cisco already makes a product to do this - WAAS by CaymanIslandCarpedie · · Score: 2

      I'm a huge fan of WAN accelorators (though I prefer the products from Riverbed), but not sure of the fit here (and is certainly isn't anything like what the OP is asking about). First, these devices aren't cheap especially if you need to communicate between tons of locations as seems to be the case here as each location will require a unit. Even the lower-end product in the category will easily run 10k. Second we don't know how much the files being moved once a month are similar. If not a majority identical, this product wouldn't really provide any benefit. If the file is basically identical then you can do the same thing with rsynch or similar for free. Even if they are identical, the functionality you are talking about is based on local caching. Since the files being moved are only moved once a month (depending on other data moving between sites and size of disk cache the unit has) that cached could likely have been cleared by the next months data is moved. Again rendering this expensive solution useless.
        use
      Again, I love WAN acceloration and if properly used and understood in the right situations, it is some of the most useful and worthwhile pieces of kit in any datacenter. However, for the use mentioned. Doesn't seem the right fit to me.

      --
      "reality has a well-known liberal bias" - Steven Colbert
    2. Re:Cisco already makes a product to do this - WAAS by Bert64 · · Score: 4, Interesting

      Bittorrent will transfer the differences too, if you make a new file overwrite an old one, it will replace any chunks which are different.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
    3. Re:Cisco already makes a product to do this - WAAS by Anpheus · · Score: 3, Informative

      BitTorrent is not very flexible in this regard and so if you have bits -added- to the middle, then everything after the first added bit will need to be updated.

      The worse case is of course, if you have new material at the beginning and everything is shifted. BitTorrent is not designed for that.

    4. Re:Cisco already makes a product to do this - WAAS by jamiebecker · · Score: 3, Insightful

      presumable "on steroids" means "with a fancy GUI".

      rsync does this too. rsync can push or pull.

      besides, there are plenty of rsync gui's, too.

      however, bittorrent is almost certainly the best solution for this purpose -- the real question is coherency. You always know that eventually you'll have a complete and perfect copy at each location -- but how do you know WHEN that copy is complete so you can work on it? if this is strictly a backup system, then it's not needed, but it's probably not a good thing to be using files as they're being written:

      some scripting -- rsync or BTdownload -- would fix this. copy the files to a working location when the update is complete, and then work from there while updates are restarted on the temp dir.

      --
      https://jamiesonbecker.com
    5. Re:Cisco already makes a product to do this - WAAS by Anonymous Coward · · Score: 1, Informative

      >BitTorrent is not very flexible in this regard and so if you have bits -added- to the middle, then everything after the first added bit will need to be updated.

      I disagree. Your point holds true if and only if we are talking about a single large file (e.g. a dvd image). As the question pertains to replicating fileS, bittorrent does appear to have the neccessary flexibility. Adding a new file or modifying a file in the middle of a torrent does not force the redownloading of the entire torrent or even any files that appear after the addition - it simply requires downloading the file that has been changed/added.

    6. Re:Cisco already makes a product to do this - WAAS by Anonymous Coward · · Score: 0

      "on steroids" is the secret sauce which costs $3000 per client!

    7. Re:Cisco already makes a product to do this - WAAS by psycho12345 · · Score: 1

      some scripting -- rsync or BTdownload -- would fix this. copy the files to a working location when the update is complete, and then work from there while updates are restarted on the temp dir.

      Scripting isn't even necessary, most torrent clients have a feature to move a completed torrent to another directory when completed.

    8. Re:Cisco already makes a product to do this - WAAS by he-sk · · Score: 3, Informative

      Even with a large file only the differences can be retransmitted with bittorrent, provided that the overall filesize doesn't change. At startup, bittorrent will verify the local data and then discard and redownload the chunks that don't match the checksum in the torrent file.

      But rsync would be a better solution in this scenario as it was explicitly designed for such a use and will handle changes to the file much better.

      --
      Free Manning, jail Obama.
    9. Re:Cisco already makes a product to do this - WAAS by jamiebecker · · Score: 1

      Right, several things to consider:
      1) Is it necessary to continuously seed
      2) Ongoing sync -- i.e., are these files changed on the fly

      --
      https://jamiesonbecker.com
    10. Re:Cisco already makes a product to do this - WAAS by LordMyren · · Score: 3, Interesting

      Use bittorrent to distribute git blobs. They are immutable & append only; perfect for something like bittorrent. All you'd really need is a good means of syndication via Atom, & end users capable of understanding SCM.

    11. Re:Cisco already makes a product to do this - WAAS by jelle · · Score: 1

      I wouldn't be surprised if it uses xdelta:

      http://xdelta.org/

      --
      --- Hindsight is 20/20, but walking backwards is not the answer.
    12. Re:Cisco already makes a product to do this - WAAS by complete+loony · · Score: 1

      I wouldn't say that.

      If you use bittorrent on a directory of files, and the files are not modified, but a new file is inserted in the middle. At worst a piece of the start and end of a couple files will be redownloaded (as the piece will overlap 2 files). The rest of the existing files should have the same contents even though the piece boundaries have moved. And even though the hash values would changed, the client should be able to confirm their contents.

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    13. Re:Cisco already makes a product to do this - WAAS by TheThiefMaster · · Score: 1

      Only if it all stays aligned. If you add a byte to the start, it thinks the entire file has changed.

    14. Re:Cisco already makes a product to do this - WAAS by Anonymous Coward · · Score: 0

      Bittorrent will transfer the differences too, if you make a new file overwrite an old one, it will replace any chunks which are different.

      And distribute destruction of your previous backup across all the machines.

    15. Re:Cisco already makes a product to do this - WAAS by adolf · · Score: 1

      Except, of course, for the fact that rsync isn't P2P. The questioner would end up flooding a central server (an issue which was noted in the question).

    16. Re:Cisco already makes a product to do this - WAAS by he-sk · · Score: 1

      That depends on the size of the changes. If only a few MB of the 4GB change then there wouldn't be much flooding.

      --
      Free Manning, jail Obama.
    17. Re:Cisco already makes a product to do this - WAAS by Anonymous Coward · · Score: 0

      IF it's actually just one big .ISO, then jigdo might do the trick. It's designed to reassemble CD images from old images and updated files. It would still be possible to retransmit the individual files for an update via bittorrent, and run jigdo to reassemble the image.

    18. Re:Cisco already makes a product to do this - WAAS by Bert64 · · Score: 1

      Only if the file changes in size, but there are way round that like using many small files, or a fixed size disk image... Really depends on the nature of your backup data.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
  13. Azureus by Anonymous Coward · · Score: 0

    Azureus, for instance, will happily check a directory regularly for torrents and just start downloading those. It should be trivial to apply some sort of external mechanism to PUTting such torrents in place on needed computers.

  14. Bittorrent is not secure by Hal_Porter · · Score: 1

    DHT or the like might seed your files outside the company. Ok, I'm too lazy to work out if that really is a threat, but I'm not sure that bitorrent is appropriate for data that you don't want to end up in the public domain.

    You could probably rig up a system where scripts check secure FTP servers for updates, and download them. Cascade the SFTP servers so that each one feeds out to two more, geographically close ones and you'll be ok. If possible only download diffs, not the whole thing. And find an SFTP client which will pull several files at a time since that gives better throughput on high latency connections which are window size limited.

    --
    echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    1. Re:Bittorrent is not secure by jd142 · · Score: 5, Informative

      While security is always something to be considered, this from the question:

      "private network of retail stores connected to our corporate office (and to each other) with IPsec over DSL, and no access to the public internet"

      Private network? Check.
      No access to public internet? Check.

      So pretty much no way for the files to be seeded outside the company.

      And even if there were a way to seed on the internet when they don't have access to it, password protect the file so only a client with the password can download it. That's not unbreakable, but if a competitor wanted the information there are easier ways to get it.

    2. Re:Bittorrent is not secure by Hal_Porter · · Score: 1, Offtopic

      I've worked at places that use IPSec or VPN. A common problem is that the server is loaded down so the secure connection is rather slow. So people will use VPN to pick up email/access the intranet when they need to and plain IP for internet access. Or people will bring laptops home and use plain IP on their unsecured home wireless network. Problem with this scheme is that if one of the machines with the files on is abused like this you could potentially have a problem. And if the files end up on pirate bay or you'll probably get fired no matter how encrypted they are.

      Now in an ideal world everyone would understand things enough to not connect a secure machine to the public internet and also that it doesn't matter if files leak if they are sufficiently encrypted. Or that as you put it "if a competitor wanted the information there are easier ways to get it".

      But we don't live in that world.

      Mind you I'd define sufficiently encrypted as something much more secure than a passworded ZIP file.

      --
      echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    3. Re:Bittorrent is not secure by rusl · · Score: 1

      Well, you could easily turn off DHT function (not all clients have that ability anyway). Then you could maybe have an IP whitelist similar to the functionality of the azureus plugin that blacklists certain RIAA/MPAA peers. And then you could even define certain ports only - those ports could be encrypted tunnels or something? I wouldn't know how to do all that stuff myself (well, turning off DHT is simple) but it seems that all these kinds of features could be tweaked if someone really was serious about it.

      --
      Stupidity is its own reward.
    4. Re:Bittorrent is not secure by nabsltd · · Score: 1

      DHT or the like might seed your files outside the company. Ok, I'm too lazy to work out if that really is a threat, but I'm not sure that bitorrent is appropriate for data that you don't want to end up in the public domain.

      Every BitTorrent client that supports DHT also has the ability to disable it.

      In addition, since this is a VPN network, the client IP addresses are likely to be non-routable, so even if you did leak the torrent through DHT, it's pretty unlikely that anyone outside the company would be able to connect to a client running at 192.168.1.1.

    5. Re:Bittorrent is not secure by SanityInAnarchy · · Score: 1

      password protect the file so only a client with the password can download it.

      I don't know of a good way to do that with BitTorrent. Simpler to just encrypt the whole file, so anyone who downloads it is just helping seed, and can't read the file.

      That's not unbreakable

      With a large enough key, and properly applied crypto, it can be unbreakable until quantum computers become feasible.

      As for DHT, I don't see where that's a problem -- trivial to simply disable it, or use a client which doesn't support it.

      --
      Don't thank God, thank a doctor!
    6. Re:Bittorrent is not secure by mlts · · Score: 1

      I would go as far as to recommend encrypting the files before putting them on a seed node. The simplest would be to use an archiving program that offers AES encryption (7Zip, WinZip, WinRAR, StuffIt) and give all branch sites the password.

      You can also use TrueCrypt volumes with a keyfile sent via E-mail and encrypted with the site admin's PGP or S/MIME key for better security.

    7. Re:Bittorrent is not secure by hezekiah957 · · Score: 1

      ...which totally misses the point of not needing the retail stores have to do anything...

    8. Re:Bittorrent is not secure by supernova_hq · · Score: 1

      Assuming (hoping) that these are *nix machines, and he has SSH access into them, the retail outlet shouldn't have to do anything. Just write a script that knows the location of the encryption key and have it decrypt and store the files. rTorrent has all the available functionality to run scripts after completing a torrent even after moving said completed torrent to another location (separate hard drive, etc).

    9. Re:Bittorrent is not secure by turbidostato · · Score: 1

      "So pretty much no way for the files to be seeded outside the company."

      So what? Since so many studies point out that your major security concern is *within* your company, and talking about hundreds of satellites, faulty seeding is not out of concern.

      Fortunately you can build up a P2P solution thant will check a centrally served hash signature previous to "bless" any downloaded stream.

    10. Re:Bittorrent is not secure by magamiako1 · · Score: 1

      You are vastly overcomplicating the situation, the OP said "There is no outside access to the internet through this connection".

      And how, exactly, is it going to go "outside the company" if the torrent is entirely internal anyway? You could have outside access and it's not going to start sending your data to the internet if your tracker and peers consist of entirely internal hosts.

  15. If the CIO expects "official" support... by aktzin · · Score: 5, Informative

    Personally I like the portable media shipment suggestions. But if your CIO/company requires enterprise software from a large vendor with good support, have a look at IBM's Tivoli Provisioning Manager for Software:

    http://www-01.ibm.com/software/tivoli/products/prov-mgrproductline/

    Besides the usual software distribution, this package has a peer-to-peer function. It also senses bandwidth. If there's other traffic it slows down temporarily so it won't saturate the link. Once the other traffic is done (like during your off-hours or maintenance windows) it'll go as fast as it can to finish distributing files.

    --
    Quantum mechanics: the dreams that stuff is made of.
    1. Re:If the CIO expects "official" support... by scottme · · Score: 3, Insightful

      Actually, there is a Tivoli product that does more or less exactly what the OP asks for: IBM Tivoli Provisioning Manager for Dynamic Content Delivery

  16. Re:Snail-mail USB sticks by SirLurksAlot · · Score: 5, Insightful

    Why would they want to pay for those USB sticks (and any shipping fees that might be involved) when they have a perfectly good network already in place to send the data in a secure manner? There are too many variables involved in using USB sticks as a means of transferring back-up data. Sticks could get damaged, lost, stolen, etc, not to mention that the server at each store would need to allow USB access which could potentially open them up to other security risks. Just imagine if someone at a store decided to plug in their own USB stick and swipe a few files. Nice idea, but there are too many risks involved with a physical transfer of data.

    --
    God, schmod. I want my monkey man!
  17. CIO's want pre-built software by obstalesgone · · Score: 1

    Get it pre-built and externally supported. It'll be a lot easier to fly by your CIO.

    The solution you suggested makes sense.

    1. RSA keys are shared across the network.

    2. A new file becomes available on your "central" server and is placed into a directory automatically shared by a bt client on the central server.

    3. A simple script on the central server checks a list of servers it needs to update, and tells each of them to initiate a transfer using the bittorrent protocol.

    4. ???

    5. Profit.

    1. Re:CIO's want pre-built software by obstalesgone · · Score: 2, Insightful

      Better yet, tack on:

      6. Give the script that handles this a name, build deployment tools, and release them under GPL.

  18. No, you fool! by bistromath007 · · Score: 5, Funny

    Haven't you been reading the warnings around here about how bad it is for the Internet? If big business starts using BT we'll microwave the baby!

    1. Re:No, you fool! by Mad-Bassist · · Score: 2, Interesting

      Oooooh... I can see the whole issue of throttling suddenly becoming very amusing as the corporate behemoths start slugging it out.

      --
      "The only legitimate use of a computer is to play games." - Eugene Jarvis
    2. Re:No, you fool! by Anonymous Coward · · Score: 0
  19. WAFS from GlobalScape by Anonymous Coward · · Score: 1, Informative

    We do something similiar using WAFS by GlobalScape (Previously Availl).

    http://www.globalscape.com/wafs/

    It provides bit-level updates to data either on a schedule or continuously, and can keep a specified file version archive too. The continuous update to HQ should keep DSL utilisation low.

  20. Chained client/server by Manfre · · Score: 4, Insightful

    Have you thought about building up a distribution tree for your sites?

    Group all of your stores based upon geographic location. State, region, country, etc. Pick one or two stores in each group and they are the only ones that interact with the parent group.

    E.g. Corporate will distribute the files to two locations in each country. Then two stores from each region will see that the country store has the files and download them. Repeat down the chain until all stores have the files.

    1. Re:Chained client/server by Anonymous Coward · · Score: 1, Informative

      Bittorrent will do this for you.
      Especially with Super-seeding/Initial seeding.

  21. load balancing? by abigsmurf · · Score: 1
    Bittorrent is incredibly wasteful for the initial seeding and is pretty intense on network equiptment. You have to be careful configuring all of the network settings, last thing you want is all of the stores either crashing their routers or maxing out the connections.

    why not spread out the backups? Limit the bandwidth of the backups to allow enough regular traffic and have different stores send their backups on different days

    1. Re:load balancing? by rusl · · Score: 1

      It's not rocket science to configure the bandwidth limits in just about any BT client. If I can seed things fast enough on my tiny little home ADSL connection with only 30kb up (and not overload things because I set the bandwidth caps below my maximum) I don't see why a much faster fancier network would fail.

      --
      Stupidity is its own reward.
    2. Re:load balancing? by abigsmurf · · Score: 4, Interesting

      It's not usually the bandwidth that kills networks for BT (although it does if you're not careful), It's the hundreds of temporary connections and half open connections it does. Lots of routers weren't designed for this and give up the ghost. You can configure them to lessen this (port forwarding, limiting it in the client) but some routers just can take it. There are also lots of routers which degrade over time with heavy BT usage and need occasional reboots.

    3. Re:load balancing? by Zironic · · Score: 1

      It doesn't appear like it's meant to get pushed to terribly many computers so a really low connection cap like 20 should probably be enough and I have a hard time seeing how that would trash any router.

    4. Re:load balancing? by Barny · · Score: 1

      Considering its all going to be running over an IPSEC tunnel it could cause problems unless his off-site routers are well built (dlink 4tl).

      A basic box running monowall is pretty good at this kind of routing requirements, mainly because all the IPSEC requires is more memory and processing power, which can be upgraded then :)

      --
      ...
      /me sighs
    5. Re:load balancing? by Anonymous Coward · · Score: 0

      have you been taking something?
      just use enterprise class hardware for enterprise class networks..

    6. Re:load balancing? by Zironic · · Score: 1

      Apparently D-Link Routers can die if you just look at them wrong, I don't think they're a good baseline for what can cause problems with routers :P

  22. BITSAdmin by Anonymous Coward · · Score: 0

    If you're using Windows XP or above, take a look at the built in tool "BitsAdmin."

  23. Captain disillusion by jonaskoelker · · Score: 4, Informative

    with IPsec over DSL, and no access to the public internet.

    Unless you have very long wires, some box is going to route them. Are those your own?

    Otherwise, your ISP's router, diligent in separating traffic though it may be, can get hacked.

    Why am I saying this? Not to make you don your tinfoil hat, certainly, but just to point out that if the scenario is as I describe, you're not 100% GUARANTEED to be invulnerable. Maybe a few tinfoil strips in your hair would look nice... ;)

    About the actual question: bit torrent would probably be fine, but if most of the data is unchanged between updates, you may want to compute the diff and then BT-share that. How do you store the data? If it's just a big tar(.gz|.bz2) archive, bsdiff might be your friend.

    If you push from a single seeder to many clients, maybe multicast would be a good solution. But that's in the early design phase I think, which is not what you need :)

    Best of luck!

    1. Re:Captain disillusion by Anonymous Coward · · Score: 0

      Otherwise, your ISP's router, diligent in separating traffic though it may be, can get hacked.

      It is clear you missed the bit about IPSec... or perhaps don't know what it does. The whole idea of VPNs, or encryption in general, is that they assume every point between you and your destination is Carol and they protect your data anyway. So yeah, you're right, it's not 100% guarenteed to be invulnerable, but to break it you'd have to guess an unreasonably hard to guess number. More like 99.9999% guarenteed (assuming you do your initial key distribution properly and don't let people do dumb things on the machines on each end of the VPN tunnerl)

    2. Re:Captain disillusion by Anonymous Coward · · Score: 0

      Yes the ISP's routers could get compromised, but that's the whole point of IPsec - so that someone in the middle can't read your traffic.

    3. Re:Captain disillusion by Anonymous Coward · · Score: 0

      Unless you have very long wires, some box is going to route them. Are those your own?

      Otherwise, your ISP's router, diligent in separating traffic though it may be, can get hacked.

      Unless the traffic is encrypted with IPsec before it gets to the ISP's router.

      Many large ISPs offer a hosted VPN service, where the ISP manages everything for you. From the article summary it isn't clear if they use a hosted VPN service or they just purchase regular DSL and have a VPN router that is configured to null-route non-VPN traffic.

      And it doesn't matter who owns the router, but who pwns it :)

    4. Re:Captain disillusion by Seth+Kriticos · · Score: 1

      As far as I know there is only one thing in life that is somewhat 100% GUARANTEED, and that is, that it ends. Everything else is just a question on probability.

      To go back to the topic: If you don't trust your ISP (legitimate thing) then you should encrypt the data before sending.

      If you use a sophisticated encryption algorithm (like AES or serpent) and then send it out, then the listeners will have some problems reading the data. You probably even could use public BT in such a case with rsync'ed .torrent files.

      To go back to my original topic, the smartest thing to do is to lower the probability of data infiltration. So use a IPSec'ed VPN to send out encrypted form of the archives via rtorrent.

      I know, eats resources, but it would be fun, no?

    5. Re:Captain disillusion by TheRealMindChild · · Score: 1

      Seriously... if you do your encryption right, it doesn't matter who is in-between. Have the initial transfer between both of them be a public end of a PGP key. There is no man-in-the-middle attack for that.

      --

      "When life gives you lemons, don't make lemonade. Make life take the lemons back!" -- Cave Johnson
    6. Re:Captain disillusion by Anonymous Coward · · Score: 0

      You don't even have to calculate the difference between updates to push just the difference - bittorrent automatically splits the files up into chunks which it calculates the md5 to. If you set it up right bittorrent will calculate the md5s of all the chunks of the previous update and update only the chunks with different md5s.

    7. Re:Captain disillusion by jozmala · · Score: 1

      assuming you don't let people do dumb things on the machines on each end of the VPN tunnerl

      Thats quite an assumption. Perhaps the one that fails when scaled to tens or hundreds of shopkeepers.

      --
      ©God :Copyright is exclusive right for creator to determine the use of his creation.
    8. Re:Captain disillusion by jonaskoelker · · Score: 1

      You don't even have to calculate the difference between updates to push just the difference

      I think it'd help

      bittorrent automatically splits the files up into chunks which it calculates the md5 to.

      It uses sha1, not md5: http://en.wikipedia.org/wiki/BitTorrent_(protocol)

      If you set it up right bittorrent will calculate the [hash value]s of all the chunks of the previous update and update only the chunks with different [hash value]s

      So if you push_front a singe 0, every chunk will change (they all have the same size, which you can configure), so you have to download everything again.

      If you do compression with an infinite window, it's likely that a single small change in the source will change a lot in the compression.

      Computing the diff seems like it most reliably will limit your bandwidth usage.

    9. Re:Captain disillusion by jonaskoelker · · Score: 1

      Yes the ISP's routers could get compromised, but that's the whole point of IPsec - so that someone in the middle can't read your traffic.

      They can do traffic analysis to see which protocol(s) you're using. They can launch attacks against your boxes. They can start messing with BGP.

      If someone wants to be evil towards you, having them break into your ISP routers is bad. Without encryption it's worse, but with encryption it's still bad.

      What I found is that a machine that costs 10 megadollars can break 1024bit RSA in a year; I think it was key recovery, not just message recovery. Depending on who you think your adversary is, this might make you uneasy.

    10. Re:Captain disillusion by Anonymous Coward · · Score: 0

      with IPsec over DSL, and no access to the public internet.

      Unless you have very long wires, some box is going to route them. Are those your own?

      Otherwise, your ISP's router, diligent in separating traffic though it may be, can get hacked.

      If the ISP for this business provides a private "cloud" such as is often the case with old Frame Relay technology, and the ISP provides the head end IPSec targets (to which corporate also connects), then this would not hammer the corporate connection save for one full seeding of the files.

      OTOH, if the OP has the VPN head end at his corporate office, and each field site connects to it (via IPSec) then the corporate head end is routing for ALL the sites ... thus crushing the corporate connection.

      One recommendation of course is to limit the seeder's bandwidth to keep the corporate connection from being hosed during distribution.

    11. Re:Captain disillusion by UncleTogie · · Score: 1

      What I found is that a machine that costs 10 megadollars can break 1024bit RSA in a year; I think it was key recovery, not just message recovery. Depending on who you think your adversary is, this might make you uneasy.

      So we should be worried about someone breaking a key that's changed at least twice in the one-year interim?

      Call me crazy for asking, but why?

      --
      Don't tell me to get a life. I'm a gamer; I have LOTS of lives!
  24. Re:SurgePlus Offsite File Synchronization. by Blowit · · Score: 1

    Here are the direct links for the product:
    http://www.netwinsite.com/surgemail/index.htm

    http://www.netwinsite.com/surgeplus/index.htm

    --
    *Headline News* censorship shuts down the Internet! More at 6PM!
  25. How I would do it... by LuckyStarr · · Score: 5, Interesting

    ...is quite straight forward in fact.

    1. Create a "Master" GnuPG/PGP Key for yourself. This key is used to sign all your data as well as your RSS feed (see below).
    2. Set up an RSS feed to announce your new files. Sign every entry in it using your "Master-Key".
      • All the stores check the validity of your RSS feed via your public key.
      • All the stores have one (or the same) GnuPG/PGP key to decrypt your files. The beauty of GnuPG/PGP is that given many destinations you can encrypt your data so that every recipient (each with their own key) can decrypt them. Nice, eh?
    3. Set up a standard BitTorrent server to distribute your files.
    4. Announce all your new files via your RSS feed.

    This has many advantages:

    The beauty of this system is that it relies heavily on existing technology (BitTorrent, RSS, GnuPG, etc), so you can just throw together a bunch of libraries in your favourite programming language (I would use Python for myself), and you are done. Saves you time, money and a lot of work!

    Furthermore you do not need to have a VPN set up to every destination as your files are already encrypted and properly signed.

    Another advantage is: As this is a custom-built system for your use-case it should be easy to integrate it into your already existing one.

    --
    Meme of the day: I browse "Disable Sigs: Checked". So should you.
    1. Re:How I would do it... by w1d3 · · Score: 0, Redundant

      Using BT is pointless in the scenario you suggested. Every client will be getting a different file (encrypted by his public key) so use of BT will not save any bandwidth.

    2. Re:How I would do it... by Anonymous Coward · · Score: 0

      Um, no. Just no. For fuck's sake, try actually using PGP/GnuPG and understanding its capabilities.

    3. Re:How I would do it... by msoftsucks · · Score: 4, Informative

      Not necessarily true. PGP allows you to sign with multiple keys. Each site would have their own key that they would use to decrypt the file. One file, multiple keys, multiple users. Simple.

      --
      Quit playing Monopoly with Bill.
      Linux - of the people, by the people, and for the people.
    4. Re:How I would do it... by itzdandy · · Score: 1

      why would they need to use gnuPG? The submitter did say that this was on a private network over ipsec links with no access to the internet.

    5. Re:How I would do it... by LuckyStarr · · Score: 1

      Because all his shops must then be connected to every other shop. This are many ipsec links to maintain. It's easier to just use the internet and encrypt your data.

      --
      Meme of the day: I browse "Disable Sigs: Checked". So should you.
    6. Re:How I would do it... by itzdandy · · Score: 1

      because his shops are not on the internet. The router is on the internet but the shop behind the router is VPN tunneled to the main shop.

    7. Re:How I would do it... by LuckyStarr · · Score: 1

      You said it. He would gain nothing from using BitTorrent over - say - a HTTP download if he has to route the traffic of all(!) his shops through his main shop.

      --
      Meme of the day: I browse "Disable Sigs: Checked". So should you.
    8. Re:How I would do it... by itzdandy · · Score: 1

      except that his routers are on the internet. Because of that, he may be able to setup ipsec tunnels between remote branches allowing bittorrent to be usable.

      His clients would all still be locked away inside his private network, but that network would have a mesh layout where each site can see some or all of the other sites.

    9. Re:How I would do it... by LuckyStarr · · Score: 1

      Yes he may. Though if he uses of-the-shelf VPN routers the number of ipsec tunnels may be limited (via license or something else) and if he has only 10 shops he would already need 24 tunnels, which have to be configured at either end. So that makes 48 configuration entries. A lot of work and a lot to maintain.

      --
      Meme of the day: I browse "Disable Sigs: Checked". So should you.
  26. How is the VPN setup by eagle486 · · Score: 5, Informative
    If the VPN is setup in a standard hub and spoke configuration then bittorrent would not help since all traffic between sites has to go via the central site.

    Your best bet is multicast, there are programs for software distribution that use multicast.

    1. Re:How is the VPN setup by Anonymous Coward · · Score: 0

      If the VPN is setup in a standard hub and spoke configuration then bittorrent would not help since all traffic between sites has to go via the central site.

      Did you read the article summary? (emphasis added)

      "private network of retail stores connected to our corporate office (and to each other) with IPsec over DSL"

    2. Re:How is the VPN setup by turbobug · · Score: 1

      Multicast is indeed the best solution unless the network is fully or highly meshed. Ideally, rsync like behavior over multicast (either bandwidth controlled or in the network quiet times) would be my choice.

    3. Re:How is the VPN setup by Thundersnatch · · Score: 1

      To get rsync-like behavior using a multicast file transfer tool, one could use something like xdelta3 to distribute difference files repeatedly. And the file format is actually an IETF draft standard (which cannot be said of rsync or rdiff-backup).

  27. it's called dsync by slashdotmsiriv · · Score: 5, Interesting

    and you can find documentation for it here:
    http://www.cs.cmu.edu/~dga/papers/dsync-usenix2008-abstract.html

    It is rsync on steroids that uses a BitTorrent-like P2P protocol that is even more efficient because it exploits file similarity.

    You may have to contact the author of the paper to get the latest version of dsync, but I am sure they would be more than happy to help you with that.

    1. Re:it's called dsync by slashdotmsiriv · · Score: 4, Informative

      I hate to reply to my posts, but this link has an even shorter description of the tool:

      conferences.sigcomm.org/sigcomm/2008/papers/p505-puchaA.pdf

    2. Re:it's called dsync by kava_kicks · · Score: 1

      I read the abstract and it sounds very interesting. Any idea where to get the source code from???

    3. Re:it's called dsync by chill · · Score: 1

      Unfortunately there is a program out there called dsync, which isn't what you're referencing. This sounds very interesting. Any idea where to see sample source code?

      --
      Learning HOW to think is more important than learning WHAT to think.
    4. Re:it's called dsync by chill · · Score: 1

      Better to reply to you than to my own "where's the source?" post.

      Check this out: http://www.cs.cmu.edu/~dot-project/

      --
      Learning HOW to think is more important than learning WHAT to think.
    5. Re:it's called dsync by kava_kicks · · Score: 1

      The program is actually called dsynch ... they just didn't pick an original name. See this link for DOT source and article: http://www.cs.cmu.edu/~dot-project/ http://www.cs.cmu.edu/~dga/papers/dsync-usenix2008-abstract.html

    6. Re:it's called dsync by kava_kicks · · Score: 1

      Close ... DOT is the underlying transfer protocol they are using, but not the whole application that makes up dsync. From what I can tell, dsync takes DOT and adds some throttling algorithms etc. Anyway, DOT looks pretty damn good too.

  28. Call me old fashioned by Hognoxious · · Score: 1

    I'd get a station wagon and fill it with tapes. Go on, mod me "-1 old fashioned"

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  29. Cleversafe? by Anonymous Coward · · Score: 1, Informative

    You should take a look at cleversafe.org - it's an opensource 'dispersed storage' infrastructure which allows you to slice up files and distribute them across a network of storage servers. Not sure if this would get you what you want, but it's worth looking into.

  30. Foldershare? by MunkieLife · · Score: 1, Informative

    I like the bittorrent idea more... but if you're looking for something simple and free - Foldershare. Not sure if this works for you, but I use Foldershare to sync files between several of my offices. It is peer to peer, with a central server to initiate the connection. If you have a 4GB file, perhaps you could rar it into smaller pieces, then this could work for you. If you don't have an internet connection though, this totally won't work for you. Heh.

    1. Re:Foldershare? by HotTuna · · Score: 1

      Thought about Foldershare, but it needs to talk to MS servers, and that's not going to happen... Also, not sure if there's a limit to the number of peers...

  31. Rsync or DFS sound like good choices by Anonymous Coward · · Score: 0

    You don't say if the files are changed at the remote sites, or just at head office.

    Rsync is an option - have 10 remote sites replicate from the master, then have other stores replicate from the submasters.

    You don't say if you're running windows, but the distributed file system works pretty well. Supports remote differential compression.

  32. uhhhhh multicast? by branto · · Score: 0

    sounds like a problem that multicast-based file transfer is designed to solve. http://www.tcnj.edu/~bush/uftp.html You said IPSec VPNs, but is it just ipsec, or is it gre inside ipsec? If there's no GRE, then forget what I said.

  33. Use existing technology by Mostly+a+lurker · · Score: 5, Funny

    CIOs are notoriously conservative. Any solution you suggest that involves building a solution from scratch will scare them. The solution is to use existing proven technology. In the MS Windows world, at least, root kits have been distributing updates successfully for years. You should be looking at simply modifying an existing root kit to your requirements.

    1. Re:Use existing technology by Anonymous Coward · · Score: 0

      CIOs are notoriously conservative

      ...unless you buy them dinner.

    2. Re:Use existing technology by jozmala · · Score: 1

      You have to be outside of company for that to work.

      --
      ©God :Copyright is exclusive right for creator to determine the use of his creation.
  34. IPSec over DSL by kabloom · · Score: 1

    Are you using IPSec in Tunnel mode or Transport mode? If you're using it in tunnel mode, then you're not going to fix your bandwidth problem, because all data has to go through corporate HQ anyway because that's where the tunnels end.

  35. Downtime? by Anonymous Coward · · Score: 0

    Um, ok the data goes down and you have it everywhere but the main site that needs to download it again at a capped rate. How do you get it back to the hosting data site rapidly enough to be useful?

    A encrypted usb memory key and a stamp go a long way.

    Seriously it's a good idea and nice in practice but have you ever tried sitting there on your hands while a boss with a whip watches you download the company files at 150k/s. If this is to be able to backup your branch office sites and restore remotely that is fine, I just wouldn't want a 3hr downtime to show up on my record while you retransmit data.
    I actually have backups go to other sites across the nation now because of hurricanes damage or if the world ends and the future civilization's life hangs in the balance of our spreadsheet data.

    It is more my last line of defense.

  36. Re:Yes yes yes!!! by kurt555gs · · Score: 0, Offtopic

    To bad I don't have any mod points, this one (above troll) is better than the usual.

    --
    * Carthago Delenda Est *
  37. Re:Yes yes yes!!! by Anonymous Coward · · Score: 2, Funny

    Slashdot: news for nerds, stiffs that matters.

  38. Re:WTF? by the_B0fh · · Score: 1

    Please, think of the PFYs. His DR fileset is only 4Gigs. My pr0n is bigger than that. ASCII/text pr0n!

    Others have already given him the best solution for his case - DVDs. Overnight them, and he is done. Latency may be a bit much, but not that much more than doing it over DSL or dialup.

    Now, lets go back to discussing OT stuff.

  39. Depending on VPN topology by Razron · · Score: 1

    Most VPN setups like this are hub and spoke with the central office being the spoke. So connections that go from one remote sit to another still have to go through the central office. So you still have a bandwidth problem at the central office. If you have your VPN setup as a mesh so it has connections to multiple sites you might be able to get this to work. The problem you run into then is most inexpensive VPN solutions will only be able to handle so many different VPN tunnels before they run out of CPU. Not know what you used to setup your remote offices as a VPN concentrator this may not be a problem.

  40. Hadoop DFS sounds more appropriate by kevinodotnet · · Score: 1

    Why not use the Hadoop distributed file system? It offers automatic replication and you can treat each "store" as a "rack" to guarantee multiple remote backups.

    You also get the immediate advantage of having a single file namespace and instant streaming access to all of the files from any single location.

    The only advantage to Bittorrent that I can see is faster recovery time since a single store can source the backup from from N other stores (instead of 2, or whatever number of replications you have decided on).

  41. Re:Yes yes yes!!! by Anonymous Coward · · Score: 0

    No way, that's kiddie pr0n! And with a (probably) illegal alien at that!

    For shame.

  42. Re:Snail-mail USB sticks by hedwards · · Score: 2, Insightful

    Because depending upon the actual files that might be overkill. For recovery files there's probably a lot of similar or same files in each batch. Something like Jigdo, rsync or distributing diffs might be a lot more efficient.

    With those the main concern is having an appropriate client to automatically handle the updating on that end.

    Most of those options would also be capable of checking the integrity of previous updates and could be run more frequently just to verify that the data is uncorrupted. I think that bittorrent has similar capabilities.

  43. Sub.TV already does this by Anonymous Coward · · Score: 0

    Sub.tv use bittorrent to distribute large video files to plasma screens in student unions - they auto-download - IIRC, it's an older Azureus client, presumably written with a plug-in, that ran on an always-on windows box.
    It seems an entirely appropriate mechanism for it, and they're already doing what you seem to want!

  44. Re:Yes yes yes!!! by Anonymous Coward · · Score: 0

    A 17 y/o having sex with a 15 y/o is legal almost everywhere, except some US states.

  45. rsync by ratsbane · · Score: 1

    I've set up something similar to this. You almost certainly don't need to transfer ALL of the 4gb every month - you just need to update a copy in the corporate office with all of the changes from the locations. Rsync is the answer. It figures out what's changed and only transfers the changed stuff, which is typically a trivial amount. Rsync is a brilliant piece of work. it's made for exactly the sort of thing you're trying to do. It will work so well you'll think there's some kind of quantum voodoo going on. Also, check out rdiff-backup. There's a version for windows and you can rsync easily between windows and *nix. If security is an issue (and it sounds as if it isn't) you can rsync over ssh, too.

  46. Use Existing Tape Backup Software Features by Anonymous Coward · · Score: 0

    I have used commercial packages like the Enterprise Backup Solution we already use to backup data to tape to mirror files. Even across a SLOW AS CHRISTMAS T1 connection it works VERY well to only copy the files that change on a daily basis. So, unless you are modifying GIGS of data at-a-time, keep it simple.

  47. Lotus Domino! by Anonymous Coward · · Score: 0

    Lotus Domino! It's replication keeps databases/websites and documents/files contained within them in sync across multiple servers. You can specify how the data is distributed across the network with connection documents.

  48. Integrate a datacenter server by Korkman · · Score: 1

    Setup a cheap file server in a datacenter, hook it up into your VPN network and store all backups there. Use rsync - very fast, uses SSH nowadays for auth and encryption. Encrypt the whole backup partition (dmcrypt, truecrypt, etc.) and keep the key private. Manual mount and key entry after rebooting. That way datacenter operators can't (easily) gain access to the files. Or transfer already encrypted files, which will destroy rsync performance, though.

    Set SSH and all the other services to listen on the VPN IP only, making the machine invisible to the common internet.

    Not as fancy as Peer-To-Peer distribution, but very reliable and fast. Also you get less administrative headaches, I think.

  49. Windows DFS -- Dont use FRS by anexkahn · · Score: 5, Informative

    In windows 2003 R2/Windows Server 2008 they really improved DFS. It lets you set up throttling in 15 minute increments, and with Full Mesh replication, it decentralizes your replication..kind of like bit torrent. However, you have to make sure you don't accidentally use FRS, because it sucks. Where I work we have 5 branches that pull data from our data center. I have DFS replication setup so I can have all our software distribution at the local site. I need to keep the install points at all the sites the same, so I use DFS to replicate all the data, then to get to it I type \\mydomain.com\DFSSharename Active Directory determines what site I am in, then points me to the local share. If the local share is not available, it points me to the remote share, or to a secondary share in the same site...so it gives you failover for your file servers. If you don't have any windows boxes, this wont work, and this really locks you into Microsoft, but it won't cost you anything more than what you have already paid. Below is a link to Microsoft's page with more information, including how to set it up: http://www.microsoft.com/windowsserver2003/technologies/storage/dfs/default.mspx

    --
    Curious about Storage and Virtualization? Check out
    1. Re:Windows DFS -- Dont use FRS by Anonymous Coward · · Score: 0

      mod parent up, this is how to do it. As much as I hate windows, I hate to say that R2 replication works and it works well.

    2. Re:Windows DFS -- Dont use FRS by Anonymous Coward · · Score: 0

      Yup - I second DFS. We currently use it from one central location to push Ghost Images that are customized for each computer type to over 80 locations. Some of these locations are on a 768/128 DSL pipe. Drop a 4GB file on and in 24 hours, it's at every one of our locations, even if we drop it first thing in the morning - doesn't mess with our network. I would recomend that your primary location have a decent amount of upstream bandwidth though.

    3. Re:Windows DFS -- Dont use FRS by Olematon · · Score: 1

      DFS is the easiest answer, IMO. Also, Samba 3 supports DFS out-the-box (well at least version 3.2, which we are using).

  50. Rely on someone else's bandwidth by Anonymous Coward · · Score: 0

    Use an existing service to provide it: http://bitsrepublic.com/

  51. NFS with DFS by flyingfsck · · Score: 1

    You could set up a NFS distributed file system. That may be more amenable to your boss and will have other advantages too.

    --
    Excuse me, but please get off my Pennisetum Clandestinum, eh!
  52. Try looking into CleverSafe by Anonymous Coward · · Score: 0

    www.cleversafe.com

  53. Do not use BitTorrent by Anonymous Coward · · Score: 0

    Take a look at your company's network topology. If it is a typical branch setup, like "hub and spoke" where your branches are all remotely connected through the central head office, then BitTorrent will waste bandwidth. Why have a peer to peer application like Bittorrent routing traffic from a branch, up to the head office, and back down to another branch? You do not want to impact other applications running across the WAN on a remote branch.

    Unless your WAN topology is fully meshed, peer-to-peer apps are probably not so efficient. It's better to use a direct-push strategy. Take a look at Microsoft DFS (distributed file system) - you can control replication links and times, or use a protocol like FTP and put QoS network restrictions on it. Schedule pushes for off-peak hours, where possible. Stagger updates to each branch, if necessary. My company uses an IBM product called Tivoli to push updates to branches, because it has bandwidth control capabilities. There are other apps like this out there (probably cheaper as well).

    BitTorrent is better suited to Internet downloads, and because bandwidth is controlled autonomously in each client, what's to prevent client's in different sites from hogging all the bandwidth in any given branch?

  54. More commercial solutions... by Strawberry · · Score: 1

    Both Kontiki and Ignite sell enterprise-type (supported, maintained etc.) P2P systems that can be deployed internally if you need something off-the-shelf.

  55. Push? by unixan · · Score: 1

    I would need to 'push' the files out, and not rely on users to click a torrent file at each site.

    Every heard of remote login, especially ssh <host> <command>?

    Sure, go ahead, mod me -1 Obvious.

    --
    This signature intentionally left unblank.
  56. Just don't call it BitTorrent by Anonymous Coward · · Score: 2, Interesting

    ...or P2P when you first mention it to the CIO.

    I would venture most CIOs' exposure to such things has been limited to what the popular media is pushing: BitTorrent == PIRACY.

    I'd recommend sticking to vague terms like "Distributed file transfer".

    1. Re:Just don't call it BitTorrent by 1u3hr · · Score: 1
      I would venture most CIOs' exposure to such things has been limited to what the popular media is pushing: BitTorrent == PIRACY.

      It's CIO, not CEO. If a CIO is so clueless, don't waste time on this, concentrate on getting your resume in order.

  57. More questions.. by Junta · · Score: 1

    What platform is used?
    Is it scriptable readily?
    How scheduled are the updates?
    How similar is the data day to day?

    Things come to mind as a tradtionally Unix admin:
    -cron job to download the file using screen and btdownloadcurses
    -ssh login to each site and do the same (if need to push at arbitrary times)
    -rsync (if the day-to-day diff is small, might as well do this)

    Analogous procedures can probably be down for whatever platform you choose. Learning how to generically apply this strategy in the platform of choice is vital for any administrator of a distributed system.

    --
    XML is like violence. If it doesn't solve the problem, use more.
  58. Re:better approach by Anonymous Coward · · Score: 1, Insightful

    Too late, bub. There is a better one above.

  59. The question remains.. by Junta · · Score: 2, Insightful

    How are they connected to each other? If the same bottleneck router is used to reach each other, then it is a mott point. People often forget about the underlying network workings and abstract away that important detail. They can reach each others IPs, but that is not to say all traffic goes through the same weak link in the chain regardless.

    --
    XML is like violence. If it doesn't solve the problem, use more.
    1. Re:The question remains.. by FishWithAHammer · · Score: 1

      I would think that "and to each other" would mean connections to other stores as opposed to a central router, but the summary does kind of suck.

      --
      "You can either have software quality or you can have pointer arithmetic, but you cannot have both at the same time."
    2. Re:The question remains.. by Junta · · Score: 1

      If using IPSec to maintain a private network over an untrusted provider, I find it hard to believe that they actually have full mesh configured. It's possible, but unlikely...

      --
      XML is like violence. If it doesn't solve the problem, use more.
    3. Re:The question remains.. by itzdandy · · Score: 2, Informative

      I also assumed that this was hub and spoke and that the "to each other" statement was just routing. Depending on the number of remote sites, and that he did not mention a specific hardware supplier, I would assume that a meshed ipsec VPN setup would be a task to maintain as it would likely be all manual.

      I am all for open source systems but find that Cisco 8xx series routers are well priced(under $500) and easily managed for easy mesh vpn setups for up to 20 links. I run this setup with a ASA5510 at the center and each site connected to the ASA and 4 other sites for remote administration office and any other connections are just routed. Basically a hybrid hub&spoke + appropriate meshing.

    4. Re:The question remains.. by Curmudgeonlyoldbloke · · Score: 1

      The way that he phrased the summary suggested not to me.

      It read like he's got a central VPN server at the "corporate office" with the shops connecting to that. I would guess that shops can route to each other, but it's not going to help corporate office bandwidth if shop A can only get to shop B via the centre.

    5. Re:The question remains.. by FishWithAHammer · · Score: 1

      Well, yeah, I just assumed that nobody would be that dumb, I think. I mean, it isn't even a QUESTION if it's all being routed over a central router.

      --
      "You can either have software quality or you can have pointer arithmetic, but you cannot have both at the same time."
    6. Re:The question remains.. by mevets · · Score: 1

      | Well, yeah, I just assumed that nobody would be that dumb

      you must be new here....

  60. I must say.. by Junta · · Score: 1

    I;m guilty of abstracting away that detail in contemplating his article.

    If it proves his network architecture has the same bottleneck either way, all the more reason he needs to take a hard look at is data and how amenable it is to rsync.

    --
    XML is like violence. If it doesn't solve the problem, use more.
  61. Kontiki by MikeD83 · · Score: 2, Informative

    I work for a large company (>50,000 employees). IT recently rolled out a new "video delivery service." The system delivers videos to everyone's desktop. The system is designed by Kontiki. It's basically an enterprise BitTorrent tool which Kontiki prefers to call, "peer-assisted."

    1. Re:Kontiki by thintz · · Score: 2, Informative

      I too work for a large company that rolled out Kontiki. Like the previous poster mentioned Kontiki is a commercial, enterprise class BitTorrent like tool. We also use it to deliver video to the desktop. I havn't worked directly with those guys for years but believe you could use it for most any type of content. I believe they can handle your security needs as well as dynamically adjust how much bandwidth they are using based on a number of different criteria. I'd give them a call to at least inquire further. http://www.kontiki.com/

  62. Why do you have 20th century bandwidth? by Anonymous Coward · · Score: 0

    A company well-funded enough to have "C-level" execs, shouldn't have ghetto bandwidth.

  63. Re:Snail-mail USB sticks by darkpixel2k · · Score: 1

    4GB of files once per month, why bother using the network?

    No one ever seems to answer the question. The dude has his reasons.

    I find myself in a similar situation. 7 offices connected via Comcast cable. Every single office has a local backup to a USB-attached external hard drive. But they also want off-site backups in case of fire or flood. Making a rount-trip between the 7 offices takes half a day. None of the staff at the offices are technically competent. They used to do tape-backups at each office, but people would forget, tapes would go back, staff didn't know how to check/verify backups, etc... They want an automated system that doesn't require their staff to do anything. Take the human failure component out.

    It's easy enough to script a local backup using a .vbs file and ntbackup, but it's difficult to replicate all those remote offices back to the main office. It overloads the connection.

    I've considered and played copying the ntbackup file off-site every night, but the bks files are anywhere from 2 GB to 50 GB. I've tried BackupPC and a few other apps that run well on Linux, but they don't run so well when accessing Windows boxes.

    It would be great to be able to script a copy of the data to a backup directory, create a .torrent file, and then drop the .torrent file into a directory on the servers that needed to download and store the torrent.

    --
    There's no place like ::1 (I've completed my transition to IPv6)
  64. use udpcast by Anonymous Coward · · Score: 0
  65. This is a solved problem by Antique+Geekmeister · · Score: 3, Interesting

    It's spelled 'NNTP'. Look at how Usenet newsgroups, especially for binaries, have worked for decades for a robust distribution model. The commands to assemble the messages can be scripted as well.

    Similarly, the bottorrent files you describe can also be pushed or pulled from a centralized target list and activated via SSH as needed.

    1. Re:This is a solved problem by Anonymous Coward · · Score: 0

      I'm glad that someone else thought of NNTP. If you set up the peering properly (fairly easy to do) it would be trivial to have each site feed the files to two (or more) sites that would in turn feed it to two (or more) other sites, so on and so forth. So you end up feeding the original files a small number of times with each site then sharing the load to feed other sites.

      Further if you feed a close and a far peer, they can both start feeding towards each other and have a flood rather than a gradual wave across the network.

      As a bonus, you can feed files from any where in the network and the distribution system will work the same.

  66. Re:Yes yes yes!!! by Anonymous Coward · · Score: 0

    but its illegal to write stories about it...

  67. We are... by Anonymous Coward · · Score: 0

    We have been working on our own proprietary protocol that resembles BitTorrent but offers a bunch of features BitTorrent doesn't.

    It's not BitTorrent, just like it. We need it for transferring up to 50 GB+ of data around the world every week.

    I agree that for your purposes, a simpler solution is probably in order though. RSync can be very powerful with a scripting layer on top of it. Others have also mentioned iTorrent which is an option.

  68. linux and bittorrent, and some light scripting. by itzdandy · · Score: 1

    consider scripting the process of creating a torrent file of the data that needs replication. At each remote site, run some linux or bsd system and setup ssh keys so the central server can run a script on each remote machine.

    setup a local bittorrent tracker.

    On the main server, script building the torrent file and run an upload script against a list of remote sites that would download the torrent file via scp and run it until it has seeded out a given amount OR has run for x days.

    The only issue here that I see is that you said that you are using ipsec over DSL which implies that all of your bandwidth goes through the central site anyway. You would need to build ipsec tunnels between sites and make sure that you have routes in place to use the secondary tunnels for appropriate IP addresses.

  69. Multicast? S3? by _LORAX_ · · Score: 1

    Why not send it simultaneously to all locations using multicast?

    What about uploading an encrypted version to S3 which can then be downloaded via torrent or the S3 API?

    1. Re:Multicast? S3? by doctorcisco · · Score: 1

      Why not use multicast? Because the actual traffic from the hub site is IPSec packets sent individually to each remote site. Putting multicast packets inside each of those IPSec packets doesn't change the fact that a copy needs to cross that wire for each remote site. It won't help his bandwidth issue unless the spoke routers are meshed and doing multicast routing, neither of which is terribly likely.

      doc

  70. How I did it by Anonymous Coward · · Score: 0

    We needed a better solution for pushing out server images to all of out data centers automatically. These images were 5 to 50 gb in size. We have 80gb pipes connecting data centers but we needed to get the images to all imaging servers as fast as possible. We setup a central server that acted as our index of torrents as well as our tracker. The daemons on the remote servers were configured to monitor the torrent index and when a new one was added it downloaded it. Once the server downloads the file it seeded indefinitely so when a new imaging server came online it would seed for it. We are a .net shop so we used mono torrent for both the tracker and integrated it into our imaging daemon. The index was just a web server directory with directory browsing turned on. Works like a charm.
     

  71. Dropbox by Brainimus · · Score: 1

    What about something such as dropbox... http://www.getdropbox.com/

  72. Re:Why do you have 20th century bandwidth? by JSBiff · · Score: 1

    Well, why provision the data center with more expensive bandwidth, if a p2p solution can solve the problem without spending much/any extra money? Don't ever buy more of a resource until you are efficiently using the resource. Only if you are using it efficiently (or at least, as efficiently as you really can), and it's *still* not enough, should you actually buy more.

    Businesses are pretty adamanant about expense justification (and they should be). You have to justify any expenses, and even when they are justified, if the company doesn't have the money, they won't spend it (usually).

  73. bittorrent URLs need to work with browsers ... by Lazy+Jones · · Score: 1

    Browsers should support bittorrent-URLs right out of the box, there's really no excuse for not doing this. It would make hosting (large-ish) static content so much easier.

    --
    "I love my job, but I hate talking to people like you" (Freddie Mercury)
    1. Re:bittorrent URLs need to work with browsers ... by louiswins · · Score: 1

      You mean like Opera already does?

    2. Re:bittorrent URLs need to work with browsers ... by Lazy+Jones · · Score: 1

      You mean like Opera already does?

      No, it should work like any other kind of URLs i.e. <img src=http://blah.foo/image.jpg.torrent> should work and embed the image directly. AFAIK Opera just has a built-in downloader but no seamless support like this (correct me if I'm wrong).

      --
      "I love my job, but I hate talking to people like you" (Freddie Mercury)
  74. We do a simple P2P at work... by Pe_Ell · · Score: 1

    I've thought about working with either RSS or pushing .torrent files and then having torrent daemon's synce files for me. But so far that's been overly complicated as I have ~1500 production web servers to keep up to date on different code releases depending on what cluster they're in.

    Currently we have a database of our production equipment and we keep track of what cluster/role a server is in. So the boxes all run a cron every minute that checks to see if a new "version" of a release is available (any code change whether a new code release or just an update increments version for that cluster).

    Now we're use rsync for our transfer. So we keep our /etc/rsync.conf up to date with the potential subnets that we have internally so random outside machines on the WAN can't rsync to our boxes. But we also keep a "lock" table in our database along with the state of each server.

    So we have one super seed if you will which keeps a copy of code for all clusters. When you update a release (kept elsewhere) and then "push" the change you made gets copied to the super seed machine. Once that is done the version gets incremented and then the crons start to look for the new code.

    Basic idea to locking we we limit the number of peers a box is allowed to have. So in order to not impact production traffic it's set to 3 currently (plenty fast). So if no "peers" (any machine in the cluster that is not the seed) has the code it will be used first and the boxes that fired off the cron last will sit in a queue (waiting to get a lock on anything with code that hasn't reached it's limit of locks). So when it first starts out the first three boxes will fail over to the super seed. Then the peers will start to get it from those boxes once they are updated.

    Doing a code push to our larger cluster used to take 45 minutes to an hour (was done in sequence and not in parallel) but now takes about 5 minutes.

    My next goals are to distribute the super seeds and potentially use RPM distribution since I'm working on making the code release only restart the fewest service necessary in order to pick up the changes. And with RPM I can have those commands in the install portion. :)

    --
    Midget Tosser
    1. Re:We do a simple P2P at work... by Anonymous Coward · · Score: 0

      You might want to check out zsync, since it uses precomuted checksums it would reduce load on your seed machines and speed up transfers since the machines download the checksums all at once, not the rsync - "computes some checksums - check with the server to compare if the same, etc" loop

  75. Re:SurgePlus Offsite File Synchronization. by pipatron · · Score: 1

    So where is the '-1, Spam' mod?

    --
    c++; /* this makes c bigger but returns the old value */
  76. Re:Why do you have 20th century bandwidth? by itzdandy · · Score: 1

    agreed. Have you been watching the economy? well-funded shouldnt imply retarded.

    If you maintain a culture of appropriate thriftiness at every level of your organizations, you will likely never get to the point of having 1 executive riding a private jet that can move 20 people for a dinner meeting.

    That being said, bandwidth can be pretty cheap and at most places around the country you can get 20Mb of fiber for $500-$700/m.

    Remeber the key word in the phrase, "appropriate" thriftiness.

  77. Re:Yes yes yes!!! by Anonymous Coward · · Score: 0

    This being Slashdot, I was half-expecting Veronica to have a cock 'n' balls, or for the protagonist to end up eating her feces, or something like that. Instead, I was pleasantly surprised and aroused by the finale. Bravo!

  78. Re:better approach by Anonymous Coward · · Score: 0

    I never understood this crap. Is it supposed to drive people away? is it supposed to push slashdot into moderating? Is it supposed to prevent people from reading at -1?

  79. DistriBrute (aka BitRain) & DMVPN by kava_kicks · · Score: 1

    A group in the Netherlands has already commercialised BitTorrent to manage enterprise patch deployment. The product used to be called BitRain but was renamed to DistriBrute. You can talk directly to one of the developers Leo Blom: lblom AT iteleo.nl He was really helpful last time I talked to him about it. Also, as one of the other posts pointed out, if you are going to do this within your VPN cloud, you need ot make sure that the VPN tunnels are multi-point (each site can talk directly to the other) or you will not solve your problem (cause all traffic will go via the main hub). Please MOD this up as I am pretty sure this is exactly what he is after. http://www.4m88.nl/ Leo Blom

    1. Re:DistriBrute (aka BitRain) & DMVPN by HotTuna · · Score: 1

      Beautiful! Thank you...

  80. Re:WTF? by forkazoo · · Score: 2, Insightful

    Others have already given him the best solution for his case - DVDs. Overnight them, and he is done. Latency may be a bit much, but not that much more than doing it over DSL or dialup.

    I don't like the DVD option. If it was a matter of sending out to "the other site," that'd be one thing. But, if you need to burn hundreds of DVD's for all the locations it suddenly becomes practically a full time job that could be replaced with a shell script and the WAN. I mean, 300 stores, assuming 15 minutes per DVD (including everything -- verify the data, put it in the envelope, print the envelope label, take it to the mail room, etc.) makes for almost 80 hours (about two work weeks!) of work. If your data needs grow to where you need two DVD's, or you add more remote locations, then it literally becomes a matter of a full month of work to get each month's backups out.

    My inclination would be to not bother with RSS, and just sftp the torrent to each remote location as a push. But, that's a minor matter of which technology you happen to be more familiar with. (If he can implement the RSS plan faster than it takes him to look up sftp command line switches, then more power to him -- I'm certainly the other way around.) But, somebody posted some information about dsync which seems even better than that - bit torrent style peer sharing, and rsync style efficient replication. All as one tool. Minimizes the needed upload from the central site from (4 GB * number of stores) every month to just (1*changed data). I truly can't imagine DVD's being better.

  81. Miro by Anonymous Coward · · Score: 0

    Miro (formerly known as Democracy player) is multi platform, has a rss feed reader and a bittorrent client built in.
    It's multiplatform and being open source I bet it can be run as a daemon.

  82. Already included! by Hurricane78 · · Score: 1

    Doesn't a BitTorrent folder already allow adding additional stuff later?

    I would recommend making a small modification of an existing open source torrent client:
    Let the download never stop. Make it look for now parts, updates to downloaded parts (via sha1), and new files in the directory structure of the torrent until the end of time.

    That way you have an instant error-resistant peer-to-peer backup and replication service that is as easy to use, as copying (or linking) the files into the right folder.

    --
    Any sufficiently advanced intelligence is indistinguishable from stupidity.
    1. Re:Already included! by Archon-X · · Score: 1

      ..and a fantastic way to seed all sorts of nasties into an originally clean download.

  83. Re:Snail-mail USB sticks by JamesTRexx · · Score: 2, Informative

    If it's between Windows servers, you can try DFS (although I haven't seen it really do one-way replication) or just use robocopy.
    We use both to replicate data between windows servers internally and on external sites.

    --
    home
  84. Re:WTF? by the_B0fh · · Score: 1

    Did you actually read what the OP wrote? IPsec to home office. NO PUBLIC INTERNET. NOTHING IN THE BITTORRENT SPEC WILL HELP because all the bittorrent traffic *still has to come home* to go back out.

    The easiest way is just to script a push out to the individual stores.

    Explain to me how bit torrent is going to help his home office wan traffic congestion?!

  85. Perl! by Anonymous Coward · · Score: 0

    Just write up a couple perl scripts. one to send data, and one to sit on the client machines constantly monitoring a port. fast and easy.

  86. Potential solution by jimmypw · · Score: 1

    After you have set up the infrastructure as in rules and a torrent server what you could do is set up rtorrent at each site to watch a directory for torrents then simply scp the latest torrent to all sites. Rtorrent will grab this and start downloading it. This leaves the issue though of potentially purging old files but thats for another topic.

    Sorry if somones already posted this solution i dont have time to read all of the replys.

  87. Re:WTF? by darkpixel2k · · Score: 1

    Please, think of the PFYs. His DR fileset is only 4Gigs. My pr0n is bigger than that. ASCII/text pr0n!

    Others have already given him the best solution for his case - DVDs. Overnight them, and he is done. Latency may be a bit much, but not that much more than doing it over DSL or dialup.

    Now, lets go back to discussing OT stuff.

    I don't know about you--but I don't trust my DVDs in the hands of UPS, FedEx, or the USPS. Especially if they have customer data or credit cards. Yeah, I know--encrypt. And if there's a problem decrypting the data in the DVD? Ship another DVD? The latency is outrageous. It could take weeks to get a successful backup, encryption, shipment, decryption, and verification. The network is faster.

    --
    There's no place like ::1 (I've completed my transition to IPv6)
  88. Re:better approach by spazdor · · Score: 1
    --
    DRM: Terminator crops for your mind!
  89. But I thought the Enterprise used by unassimilatible · · Score: 3, Funny

    subspace for its communication needs.

    I'm confused.

    --
    Slashdot "libertarians": Small government for me, big government for those I disagree with. -1, I disagree with you
  90. Re:Snail-mail USB sticks by supernova_hq · · Score: 1

    Overlooking the fact that mailing potentially secret confidential files with the postal service is a bad idea, why use USB sticks?

    If you were to use the postal service (which I highly discourage), why not use DVD's. They are over 4GB, WAY cheaper and you don't have to worry about them getting erased.

    Also, you could set up the server to have a "Read-Only" DVD drive, so no-one could use the new hardware to DOWNLOAD files OFF the server.

  91. Comment removed by account_deleted · · Score: 3, Interesting

    Comment removed based on user account deletion

  92. Multicast file carousel by Dr+J.+keeps+the+nerd · · Score: 1

    If you own your network enough, you could consider multicasting. udpcast is one tool for doing this, or you could look into implementing a file carousel. If you don't have the network support for multicast, then this won't be very helpful.

  93. Re:Snail-mail USB sticks by darkpixel2k · · Score: 1

    If it's between Windows servers, you can try DFS (although I haven't seen it really do one-way replication) or just use robocopy. We use both to replicate data between windows servers internally and on external sites.

    I use it to replicate some data between sites, and a few one-way copies for backups, but it's a horrid system. No easy way to see what's going on or control the progress. I can tweak rsync into next tuesday and figure out what's going on like nothing else, but windows DFS and DFSR (or whatever they call the new stuff in R2) is buggy and difficult to troubleshoot or fix.

    --
    There's no place like ::1 (I've completed my transition to IPv6)
  94. several possiblities by ico2 · · Score: 1

    Well, there are a few torrent clients that can be run from the command line, just install one on each machine with a shell script/batch file to do the work.

    You could also consider writing something simple using split and wget or whatever.

  95. Re:Yes yes yes!!! by Anonymous Coward · · Score: 0

    You guys kept reading after the first 5 words?

  96. No by Weezul · · Score: 1

    the .torrent file includes the hash for each part of the file, so your client won't complete the download if the hashs don't match.

    --
    The Christian religion has been and still is the principal enemy of moral progress in the world. -- Bertrand Russell
  97. ruby torrent phptorrent ssh tunnel yeah! by Anonymous Coward · · Score: 0

    Our company has a constantly changing source. Sometimes files are just moved about a little.

    I started looking into bittorrent for keeping our vendor in sync but the IT guys ended up home brewing it for a variety of reasons.

    Our main office has a slow internet connection, and we were driving a hard disk up to our datacenter for it's high speed internet, but we needed some files uploaded as soon as possible, and we didn't want to duplicate transfers.

    So The idea was to have multiple seeds for our vendor, and then use the seeds for our off-site backup.

    When the file system changing every few minutes starting and stopping doesn't work too well..

    We would have had to hack it a ton, and we didn't need all of the features, (we wanted all the features, but basic needs came first)

  98. APT by Anonymous Coward · · Score: 0

    Debian's "apt" package managing utility has some new torrent support, as well it's long established version, caching, etc. capabilities. You could possibly (depending on the format of your data) distribute it in .dpkg packages via apt.

  99. Re:WTF? by b0bby · · Score: 1

    That was my thought too - if all the stores are simply VPN'd to the HQ, then through the HQ is the only route so you're not going to gain anything. You'd have to have store to store VPNs running as well to get any benefit; at that point you might be just as well off doing the distribution tree someone suggested above.

  100. Congressnet by Ostracus · · Score: 4, Funny

    Hey I resent that. None of my DVDs have ever taken a bribe.

    --
    Shai Schticks:"You don't make peace with friends, you make peace with enemies"
    1. Re:Congressnet by StikyPad · · Score: 1

      Yeah, none...that you know of.

      <shifty grin>

  101. Re:WTF? by the_B0fh · · Score: 1

    Psst - I dunno if you know, but people can sniff your network traffic! Really, fer real! Mum's the word, OK!

    This is DR, folks, it ain't that hard.

  102. Re:WTF? by the_B0fh · · Score: 1

    That is to say - the encrypted media can be verified prior to shipping, perhaps even with the aid of a script.

  103. Re:BitTorrent not efficient for this scenario by iammani · · Score: 4, Informative

    uTorrent supports this. It is called Initial Seeding. And it does exactly what your script intended.

  104. Re:WTF? by nick0909 · · Score: 1

    I thought about doing this same thing about a year ago. After actually thinking about it I realized that same thing; with the hub-and-spoke network model of all branches connected back to one HQ you are going to make things worse using BT and having every client trying to receive data from every other client. For my company, we just moved to an MPLS mesh network, so it might be time to revisit the scenario.

  105. Old School Solution by swabeui · · Score: 1

    *drum roll* .... FTP! With a bit of scripting and command line Winzip you can transfer everything automatically. Get a $5 a month unlimited bandwidth hosting account and have your clients check for updates every day. The new stuff is downloaded automatically and you can have the script even email you when the transfer is complete (with details of the CRC and file size).

  106. snakebite by jjbarrows · · Score: 1

    snakebite from actlab.tv should be a good solution - if it works, i haven't got it to work yet. it watches a folder for normal files, creates .torrent files and lists them on a tracker or html page ... not sure about automated downloading of them, i was going to send out an RSS feed to the clients (each only needed a subset of the total file pool anyway)

  107. Re:WTF? by darkpixel2k · · Score: 1

    Psst - I dunno if you know, but people can sniff your network traffic! Really, fer real! Mum's the word, OK!

    This is DR, folks, it ain't that hard.

    Yeah, that's why you encrypt your traffic.

    And before you say 'encrypt the DVD', if there's an error, a scratch, or whatever, the DVD is somewhat worthless. A network, however has the ability to recognize the error and retransmit... It's a lot faster than your general package handlers...

    Not to mention, I'd hate to be in the situation of explaining to my boss that the file he needs restored immediately will be here in about 5 days--after all, we shipped UPS ground.

    --
    There's no place like ::1 (I've completed my transition to IPv6)
  108. Re:WTF? by darkpixel2k · · Score: 1

    That is to say - the encrypted media can be verified prior to shipping, perhaps even with the aid of a script.

    Perhaps--but backups via the network can be entirely automated.

    Via a package delivery service, someone has to verify the media, address and pack, arrange for a pickup, hand-off the media, do something with the tracking number (maybe email it to me), I have to do something with the tracking number, someone has to receive the package and sign for it, someone has to verify the tracking number to make sure shipments of customer data haven't been lost, unpack the media, examine it for defects, insert it into the drive and make sure it came out on the other end in the same condition as it went in, and then toss it in the archive pile, and then finally after some amount of time--it must be destroyed.

    The network is easier.

    --
    There's no place like ::1 (I've completed my transition to IPv6)
  109. Re:SurgePlus Offsite File Synchronization. by Blowit · · Score: 1

    How could this be even considered Offtopic let alone spam if I am currently using such a product and does EXACTLY what this guy wants it to do?

    Guess if you have not tried it = SPAM.

    Either way, his loss for not trying it.

    --
    *Headline News* censorship shuts down the Internet! More at 6PM!
  110. Re:SurgePlus Offsite File Synchronization. by Blowit · · Score: 1

    OHH NOW I GET IT...
    Cause I said it was a MAIL Server...
    Well yes it is but it has a FILE STORE section that meets the needs of the original writer of this article.

    --
    *Headline News* censorship shuts down the Internet! More at 6PM!
  111. bitsadmin by professorfalcon · · Score: 1

    bitsadmin, to create jobs for the Background Intelligent Transfer Service (BITS) in Windows.

  112. No Unix admins here? (wasRe:Different torrent cli) by sowth · · Score: 1

    Maybe I haven't done enough work in the "enterprise", but wouldn't a script in a cron job be more appropriate here? Program it to check for a new .torrent file every day (or an appropriate frequency), and when new, start bittorent.

    Off the top of my lame head, I can think of at least two easy ways to check for new torrents. The easiest would be to just download the previous torrent file, and cmp it with the old one.

    The second would be to write a python script which keeps track of the etag or modification time of your .torrent update. It is easy, just read Chapter 11 of Dive into Python. Section 11.6 appears to have what you want.

  113. Re:WTF? by the_B0fh · · Score: 1

    Gee, you hire idiot savants or what? Much of the things you bitch about in the other post about manually tracking tracking numbers, etc, can all be automated. About the only thing that needs to happen is to slap it into an envelope and send it off. And if your company is cheap enough to use ground for shipping, then you certainly can't afford to upgrade the network so that you can send DR files over it.

    In case you forgot, the OP's issue *is* the network, and he wants some magic pixie dust to get more bandwidth. Ain't gonna happen. He can either:

    1) upgrade network
    2) redesign network
    3) suck it up and keep moaning and bitching about network
    4) use something else, such as a DVD.

    Your "hey, lets keep using the network" is keeping him at option 3, and, seriously, I don't want to hear him moaning and bitching.

  114. Re:BitTorrent not efficient for this scenario by Anonymous Coward · · Score: 0

    At first glance this would be easily implemented via BITS and local squid proxy server.

  115. netapp by Shotgun · · Score: 2, Insightful

    What you really want is a solution like Netapp file servers. It will distribute the files at the file system block level, updating only the blocks that have changed. You install a filer at your central office, then have multiple mirrors of that one at the various field offices. All the PCs get their boot image off the network file server (the local one). With one update, you can upgrade every PC in the entire company.

    --
    Aah, change is good. -- Rafiki
    Yeah, but it ain't easy. -- Simba
  116. Re:Snail-mail USB sticks by guruevi · · Score: 1

    I don't know why anybody hasn't brought up rdiff-backup yet. It works great, the initial backup takes a bit but it's plenty fast, it only transfers the 'changed' bits and leaves you with a working mirror everywhere and an incremental backup on each destination. It uses the rsync library and I have been using it to back up ~8 TB (usually around 500 GB of change) on a weekly basis.

    --
    Custom electronics and digital signage for your business: www.evcircuits.com
  117. DFS by Anonymous Coward · · Score: 0

    DFS is your friend.

  118. Source of Replication by Anonymous Coward · · Score: 0

    This really depends on:
    * If the sites all use the same 4G DR file
    * How many sites you have

    For instance, if you had 10 sites that all used the same file you would seed from the source site and other sites would join the p2p download as they join the share. Let's say that you base it on time zones. And you have 2 sites start the download at time A, another 3 join at time B, and the final 4 (one is "hosting" the seeded file) join at time C. A will download directly from your hosting server, B will be able to download from the hosting server and the A servers, C can join the fun and download from any of the sites.

    From a bandwith perspective, the hosting site will have the entire contents downloaded from it (uplink), the A sites will most likely get the next largest traffic, followed by the B sites and other C sites. (I realize that you cannot guarentee how much is downloaded from where). So, you have to be mindfull of link speeds, upload caps / limits, etc per site.

    If you do not need the same 4G file at each site; it may just be easier to use a DVD and ship it off site.

    Just my 2 cents.

  119. uTorrent remote interface by Anonymous Coward · · Score: 0

    uTorrent has a network interface built-in that I use to add torrents to my dad's machine from my machine, the file downloads on his server and is then available on our home network. If you have uTorrent at all these disparate sites their interface can be made available to you where ever you are, the interface can also be locked down to only be available to a specific ip address, etc.

  120. Re:Yes yes yes!!! by Anonymous Coward · · Score: 0

    You missed a perfect opportunity to say "news for nads" there.

  121. Get Drop Box by planckscale · · Score: 1

    I've found that GetDropBox.com is a great tool for replicating files across machines. It has a 2gb "Free" version but I'm wondering if there is a paid service for more space? Only those files that are changed are replicated, very bandwidth friendly, etc.

    --
    Namaste
  122. Re:Yes yes yes!!! by po134 · · Score: 1

    Stories ain't illegal as far as I can recall, it is only if you take pictures/video ;). They have to have the age of the sexual maturity, in canada it is now 16 so yes this would be illegal to talk about it, although the act itself is not illegal as both party have less than 3 years of age difference.

    anyway this is irrelevant :)

  123. Re:Yes yes yes!!! by Bastard+of+Subhumani · · Score: 1

    this would be illegal to talk about it, although the act itself is not illegal

    That would make it rather difficult to bring a prosecution. Or are public prosecutors/district attorneys chosen on their miming ability where you come from?

    Hmm. Not a nice image. Forget I said that.

    --
    Only three things are certain; death, taxes, and apocryphal quotations - Ben Franklin.
  124. 2 prjcts no one has mentioned yet-konspire2b/zsync by Anonymous Coward · · Score: 0

    konspire2b - http://konspire.sourceforge.net/
    _claimed_ to be faster for distribution than BT at the time. Maybe that is not true anymore, but I always thought konspire2b looked interesting for intranet stuff.

    zsync - http://zsync.moria.org.uk/ zsync precomputes the checksums for rsync so it moves the checksum load to the client side. It also claims to have a way to deal with compressed files efficiently. Again, looks interesting for intranet distribution.

    Now if we could combine some of the features of konspire2b, zsync and dsyncwe'd have the ultimate file distribution system.

  125. Lots of fallacious assumptions being made by Anonymous Coward · · Score: 0

    First understand the problem.

    Does the "main" site have 4G of data that needs to be protected for recovery in the case of a disaster? If so, just about every post before this one will produce a solution that will not work. For DR, you should be designing a system that recovers fast. Unless you have an incredible pipe, there is no way you are going to pull 4G of data from your remote sites in a business-acceptable amount of time. Chances are you will probably go with a tape system or similar architecture.

    If you're just distributing 4G of data from a "master" to many other locations, the rsync or BT ideas will work, but don't mistake this as being a DR solution.

    I would be money that your CIO had some kid relative who is just entering the IT world make the suggestion. No-one who does any real DR planning would seriously consider what most of this post & thread are about.

    -a *real* professional DR planner

  126. Groove! by Anonymous Coward · · Score: 0

    Groove (http://groove.net) does exactly what you want. It's already in use for file distribution in the private and public sectors; coast guards, hurricane warning systems, as well as many large-scale businesses use it successfully.

  127. Solaris at Microsoft by fm6 · · Score: 1

    I got curious about this post, which has already attracted some attention in the blogosphere. ("Solaris at Microsoft?") Some googling reveals that WebTV was originally developed on BSDi, then moved to Solaris. At that point (1997) they were acquired by Microsoft.

    I seem to recall read that many of Microsoft's Unix-based acquisitions have had trouble moving to NT, despite the obvious pressure to do so. So there were probably Solaris servers at Microsoft's Mountain View campus (where WebTV is located) for some years. But it's been 11 years, and I'm sure those Solaris servers are long gone. You'll notice that J refers to them in the past tense.

  128. File distribution management by brrgo · · Score: 1

    I wrote an application that I implemented on 20,000+ machines that replicated files over 256kb WAN lines using UDP, because UDP is connectionless is at a lower priority than TCP.
    It sent 1 kB/sec for a total of about 8GB a day. Of course it was no easy task (2yrs development) to create a protocol that can detect and recover missing parts of a file then reconstruct the file when it is complete. As well as replicating to machines that were off when they were turned back on. It used MS SQL server to maintain the file lists, sends and WAN connections. It ran for a couple of years.
    My work canned the whole thing in favor of Microsoft SMS, because it has web reports I guess. Not many company have as many branches as we do (>1,000)

  129. Not too difficult./ by spxZA · · Score: 1

    I am implementing something similar using trackerless torrents, DHT and LSD from libtorrent. All you need to do is distribute the torrent hashes to the clients, and libtorrent will take care of the rest.