Slashdot Mirror


Easy, Reliable Distributed Storage and Backup?

RichiH writes "Most of you are the free IT staff of friends and family, just as I am. One of my largest headaches is backing up their data. What I am looking for allows for off-site storage on multiple server machines running Linux, has Linux & Windows clients that Just Work and require zero everyday effort (although a large-ish effort to set them up is just fine), allows for granular access control, is versioned and will, ideally, allow me to grab data automagically (think photo pool for your family where your mother, sister, etc., share each other's photos). This is something I've been trying to find for years, but I've never seen anything even closely resembling what I want. With the Wall Street Journal handing out its Technology Innovation Award to Cleversafe recently, I was once again reminded of this particular itch which needs scratching. Before I deploy it, I want to ask the Slashdot community for its opinion on that piece of software, and on potential alternatives. How do you solve this problem?"

222 comments

  1. Git... by paul.schulz · · Score: 1

    Git and the git-web web based tool are very useful for maintaining a tree of archived data, and browsing it.

    1. Re:Git... by quinks · · Score: 3, Informative

      Yes, because storing thousands of jpg images and other binary data is exactly what git was intended for. Get people to store their data on Samba fileservers. Set up home directories in their name as well as shared directories accessible by everybody or Samba groups. Use ACL if you need to. To backup, use rsync and OpenSSH, write a few batch scripts and hey - presto! Instant solution that'll even work with cheapo webhosts and your home linux box as backup servers. Versioning can be done for any amount of time by using rsync's backup feature, and you can allow people to browse old versions within Windows Explorer connected to a Samba share in that way.

    2. Re:Git... by tsa · · Score: 1

      Yep. I have something similar at home too. Works well.

      --

      -- Cheers!

    3. Re:Git... by RichiH · · Score: 3, Insightful

      Being easy to use (as in, not more than 3-5 mouse clicks, total) is one of my main concerns. Git definitely fails in this regard.

    4. Re:Git... by marcello_dl · · Score: 2, Interesting

      actually, for my own digital assets repo - see signature - i see two features of git which might be handy, atomicity of commits and hashes which avoid storing duplicates. git has "plumbing" commands which might help. Still haven't explored it.

      BTW if you have enough band you could do away with a doxroom instance on a host, don't forget to backup files and db and remember it's alpha quality.

      --
      ---- MISSING MISCELLANEOUS DATA SEGMENT --- [sigdash] trolololol
    5. Re:Git... by Anonymous Coward · · Score: 0

      Oh well, shouldn't be to difficult to place the git commands into a file and run that with a single click.

    6. Re:Git... by Anonymous Coward · · Score: 0

      I've been considering doing this for a while but I can't find a cheap webhost that offers plenty of storage (say 500GB).

      All the hosts I've looked at offer cheap slow servers with little storage, or expensive fast servers with lots of storage. A cheap single processor server with limited network bandwidth and one big disk would be more than adequate, but I can't find one.

      Does anyone have any recommendations?

    7. Re:Git... by debatem1 · · Score: 2, Interesting

      I'd be happy to write a script that will handle that concern, but somebody else would have to do the UI unless you want it looking like it escaped from Windows 95.

    8. Re:Git... by SanityInAnarchy · · Score: 1

      As I understand it, in normal use, Git never forgets a commit. Is that really what you want for photos?

      The other poster isn't entirely accurate -- I suspect Git does just fine with binary data. It's just that said binary data will stay in the tree forever.

      --
      Don't thank God, thank a doctor!
    9. Re:Git... by RichiH · · Score: 2, Interesting

      If you are being serious..:

      Afaik, Git supports Meta/recursive repos where I have one master repo with many subrepos. Thus, it would be best to have a master repo that contains all other repos. That will make replication easier.

      The only other requirements would be that it adds all files in a given directory to repo foo and pulls repos bar, baz, quux. Preferably, it would happen automagically & regularly with a throttled connection. Requiring them to click a button in a butt-ugly app is fine, as well.
      If Windows had cronjobs or I knew VB, I would do it myself, but..

      _If_ you decide to do something like this, I will definitely give it a try. And it will finally give me a reason to poke Git :)

    10. Re:Git... by pebcak · · Score: 1

      I agree. The only thing I would consider changing is to use rdiff-backup instead of rsync in order to make tracking, and restoring revisions more straightforward. Both utilities would do the job though.

    11. Re:Git... by debatem1 · · Score: 2, Interesting

      I'm perfectly serious. It's a useful app and a pretty easy problem. If you'll email me at CTO@Openmigration.net and let me know more about your specific requirements (number of remote hosts, total archive size, etc) I can start figuring out what the best way to do this is. Also, I'll need to know all of the platforms you're running on (will you need support on cell phones? Xbox?), the level of redundancy you're comfortable with, will you need a web interface, etc.

    12. Re:Git... by mcpkaaos · · Score: 1

      If Windows had cronjobs...

      Windows Scheduled Tasks might do the trick. If not, you can always install Cygwin, which includes cron as a Windows service.

      ...or I knew VB

      Forget VB. Python is your friend.

      --
      It goes from God, to Jerry, to me.
    13. Re:Git... by RichiH · · Score: 1

      Cygwin can do cronjobs? Neat, I will remember that if I ever use Windows, again.

      I would code it in Perl, but I do _not_ want to expose my mom to a CLI interface.

    14. Re:Git... by RichiH · · Score: 1

      Will do, thanks!

      FYI, the navigation buttons on the left of hand side of openmigration.net are black in Konqueror 4.1.2. Works fine with Iceweasel 3.0.1, though.

    15. Re:Git... by pthisis · · Score: 1

      python + wxpython for the GUI (with wxglade as the gui-builder). If you follow the wxwindows standards the UI will have native look and feel on Windows, GNOME, and Mac machines (including more major differences like the menubar being in the window on Windows/GNOME at at the top of the screen on Macs).

      --
      rage, rage against the dying of the light
    16. Re:Git... by ryszard99 · · Score: 2, Insightful

      I would code it in Perl, but I do _not_ want to expose my mom to a CLI interface.

      why not make a CGI and use the web as a GUI?! :-)

      Even if your parents are non techy, they've probably had some experience browsing the web, so a web GUI shouldnt be too foreign to them.

      couple that with something like CGI::Application and you've got a great framework in which to build and expand what you/your parents want/need.

      --
      -- $_='ab-bc ratvarre';tr"'a-z'"'n-za-m'";print
    17. Re:Git... by debatem1 · · Score: 1

      Thanks for the info, I'll have somebody fix that right away.

    18. Re:Git... by marcello_dl · · Score: 1

      the demo of doxroom is on bluehost which offers unlimited space, shell and so on. vps are more flexible though

      --
      ---- MISSING MISCELLANEOUS DATA SEGMENT --- [sigdash] trolololol
    19. Re:Git... by darkpixel2k · · Score: 1

      To backup, use rsync and OpenSSH, write a few batch scripts and hey - presto! Instant solution

      Instant? Meh. apt-get install backuppc.

      Most home firewalls support either the true DDNS or that dynDNS crap. Give 'em a hostname or get a static IP for the people you need backed up. Install backuppc on one of your machines and remotely connect in and back them up.

      I have 10 machines that get backed up nightly across my comcast connection. (That's part of the reason I'm so near that damn 250 MB cap).

      I'm sure if my friends and family cared, I could enable remote access through my firewall so they could restore when needed--but usually when it gets to that point, they call me anyways.

      If a box dies, I just drag it over to my place, hook it up to my network, replaced the dead drive, and restore it over my LAN.

      --
      There's no place like ::1 (I've completed my transition to IPv6)
  2. Mozy? Duplicity? by Anonymous Coward · · Score: 0

    How about Mozy? I really like Duplicity, but it's probably not for these users if they're asking you for help.

  3. The Realms of Science Fiction by warewolfsmith · · Score: 0, Flamebait

    What you are seeking is still in the realms of science fiction, and would probably cost a bomb as well. Good luck with your search, please let us all know if you find this digital nirvana.

    1. Re:The Realms of Science Fiction by Anonymous Coward · · Score: 0

      An amazon S3 backup tool could work for him. He could even roll his own with rsync

  4. Use the pr0n method! by dogganos · · Score: 4, Funny

    Rename your data to 'Barely legal college girls having first time sex - XXX Vol1/256.r001' and use p2p to spread them all over the world!

    1. Re:Use the pr0n method! by estarriol · · Score: 0, Offtopic

      Mod parent up! Clever + funny = win.

    2. Re:Use the pr0n method! by Anonymous Coward · · Score: 0

      www.mozy.com

    3. Re:Use the pr0n method! by electrictroy · · Score: 1

      What an excellent way to backup my photo collection! I'll get to work on it right away.

      --
      The government is not your daddy. Its purpose is not to raid middle-class neighbors' wallets and give it to you.
    4. Re:Use the pr0n method! by kdemetter · · Score: 3, Funny

      It's funny , but it might be practical in way.

      It's possible to put data in images , so why not in a video.

      You just take something highly demanded ( could be porn , could be a movie ) , and punt your data in it , well encrypted and without anyone knowing it.

      The file gets shared because of the content people want to see , and if you ever lose your data you just lookup the file via P2P , and you have it back.

    5. Re:Use the pr0n method! by madnis · · Score: 3, Funny

      What if all your data is just porn anyway?

    6. Re:Use the pr0n method! by debatem1 · · Score: 1

      Not a bad idea, actually, using the p2p grids for backup. Wonder if that's possible.

    7. Re:Use the pr0n method! by linear+a · · Score: 1

      Ugh. Too recursive.

    8. Re:Use the pr0n method! by Anonymous Coward · · Score: 0

      I download it, get pissed that it is not porn, upload real porn with the same name and file size. Best of luck.

  5. I can tell you how I solve it in a business by jimicus · · Score: 5, Informative

    I can tell you how I solve it in a business context, but whether or not it could be scaled down to personal I'm not sure.

    The problem: 2 sites each with 70-100GB of data needs offsite backup with similar criteria to your own. Bandwidth available to these sites is 2-4Mbps. The only OS involved is Linux, though I'm sure Windows could be shoehorned in somehow. A third site which has a tape streamer and someone to take tapes offsite is available. Data protection legislation means that storing it with a hosted service is illegal unless I encrypt it myself before sending it offsite - I'm only aware of one tool which claims to be able to do this and still send data as a binary delta (it uses the rsync library) and that tool is still not particularly common in Linux distributions and not very widely used. I'm nervous of trusting my backups to a tool that isn't on heavy use, particularly if strong encryption is being employed.

    The Solution: A server in the third site and some judicious scripting with rsync allows it to mirror the data in the other two sites. The first sync is fairly painful, of course, but provided you don't have too much data regularly changing subsequent syncs aren't too bad. The server is backed up to tape which provides versioning capability so if someone only realises that they lost a file a week after the fact it can still be restored,

    Initial effort to set up was pretty great but now it's done it JFW and requires no brain power whatsoever to run on a daily basis. I can make the data available over the VPN (of course the access speed will be dog slow) more-or-less immediately and I can make it available at LAN speed by copying it to a hard disk and courier it to the remote office in under 48 hours. A full restore of 100GB across a 2Mbps connection will take at least 4-5 days.

    1. Re:I can tell you how I solve it in a business by maino82 · · Score: 1

      You don't necessarily need to make that first backup painful. Rsync while you've got both servers in the same room over a LAN, and from then on you just have to deal with the delta and don't need to worry so much about bandwidth.

    2. Re:I can tell you how I solve it in a business by PC_Freak · · Score: 3, Interesting

      Data protection legislation means that storing it with a hosted service is illegal unless I encrypt it myself before sending it offsite - I'm only aware of one tool which claims to be able to do this and still send data as a binary delta (it uses the rsync library) and that tool is still not particularly common in Linux distributions and not very widely used. Based on my limited understanding of crypto, when you encrypt data it should turn into pseudo-random noise, so if *any* bits change the whole thing changes (unless you're doing a block-cypher, but if it's chained-block then every portion *after* that will also change). So for large files, this seems like the delta would end up being practically the entire file, wouldn't it?

    3. Re:I can tell you how I solve it in a business by PC_Freak · · Score: 1

      (Re-posting with formatting that'll survive the SAVE process...)

      Data protection legislation means that storing it with a hosted service is illegal unless I encrypt it myself before sending it offsite - I'm only aware of one tool which claims to be able to do this and still send data as a binary delta (it uses the rsync library) and that tool is still not particularly common in Linux distributions and not very widely used.

      Based on my limited understanding of crypto, when you encrypt data it should turn into pseudo-random noise, so if *any* bits change the whole thing changes (unless you're doing a block-cypher, but if it's chained-block then every portion *after* that will also change). So for large files, this seems like the delta would end up being practically the entire file, wouldn't it?

    4. Re:I can tell you how I solve it in a business by jimicus · · Score: 1

      You don't necessarily need to make that first backup painful. Rsync while you've got both servers in the same room over a LAN, and from then on you just have to deal with the delta and don't need to worry so much about bandwidth.

      The source server was in another country and in daily use, the destination server was bolted into a cabinet, weighed about 40kg and also in daily use.

      (Nice idea though).

    5. Re:I can tell you how I solve it in a business by jimicus · · Score: 1

      Based on my limited understanding of crypto, when you encrypt data it should turn into pseudo-random noise, so if *any* bits change the whole thing changes (unless you're doing a block-cypher, but if it's chained-block then every portion *after* that will also change). So for large files, this seems like the delta would end up being practically the entire file, wouldn't it?

      I'm not sure how it works, but I can think of a few ways you could work around that in theory.

      The most obvious is to encrypt every file individually and then ship a tar of the whole lot up. Though for best results, you'd need to download each file, decrypt it, perform a binary delta against the source file, encrypt the delta and ship that up.

      End of the day though, it sounded rather too complicated for my liking. I get the benefit of offsite backups stored with someone like Amazon but I'm using a tool which may or may not work, isn't in particularly common use (so hasn't had the exposure to many eyes that rsync has) and goes to great effort to ensure that the data is very hard to decrypt - one cockup and the person who's going to be locked out of the data is me.

    6. Re:I can tell you how I solve it in a business by Anonymous Coward · · Score: 0

      You can also sneaker-net a temporary disk between sites. I've used my laptop via jumbo-jet as a carrier-pigeon to seed my own trans-pacific backups. Two LAN transfers with a 24 hour transit delay between them. Then re-run rsync over Internet to fix up the small stuff that gets out of sync.

      Rsync is my favorite sledge-hammer. Also easier than jigdo at creating Fedora ISOs... just cat all the RPMs from an unpacked repo tree into one big file, and rsync from a mirror site to convert the file into a DVD ISO image! It's amazing how well it sorts out and reuses the contents in the file.

    7. Re:I can tell you how I solve it in a business by mysidia · · Score: 1

      What tool are you aware of?

      You don't have to backup as one file. You can use multiple independently encrypted segments.

      I.E. Divide every megabyte of plaintext into a chunk, encrypt the chunk independently.

      If nothing in that 'chunk' changes, then you avoid re-sending that chunk.

      If part of that chunk does change, then you apply the rsync algorithm normally.

      Or perhaps you devise an algorithm to provide the information compute the early change and automatically derive the changes to the rest of the blocks that occur on the remote side.

    8. Re:I can tell you how I solve it in a business by Anonymous Coward · · Score: 0

      DFS

      Wins.

      We have about 30 office. All backing up user and share folders to central data center.

      Central data center has tape library.

      Encrypts data, puts on tape. Done.

      No muss. No fuss.

      First DFS sync is painful. After that, it's bit-level changes only.

    9. Re:I can tell you how I solve it in a business by exekewtable · · Score: 1

      You would be better off switching to rdiff-backup after the initial sync. Then you get versions, a web interface for restores, a nagios plugin to check its working for you (instead of reading logs!). Fully managed versioned backups. Works on pretty much all platforms too. http://www.rdiff-backup.org

  6. Two questions by Geoffreyerffoeg · · Score: 3, Informative

    You're asking two questions. The first is that you want backup, so that all their data just gets thrown somewhere and they lose the last few days' work their hard drive dies. You don't even necessarily want this on the network; just back up to a DVD-R every so often, and take every month's DVD-R offsite (a friend's house, a bank's vault, whatever). There's lots of backup software for this. Most can do fancy stuff like incremental backups. You can probably find something opensource you can host for your friends and family on a decently-available server.

    The second question is networked file storage, where you don't care about automatically archiving files, but you do want frequent access and a good UI. For this I recommend something like Dropbox, which has good support for OS integration and a web interface.

    1. Re:Two questions by RichiH · · Score: 3, Informative

      Actually, I was asking one question. I need both rolled into one. And requiring me to be on site is not feasible. Ideally, they don't even notice that Backups are being made.

    2. Re:Two questions by Geoffreyerffoeg · · Score: 5, Insightful

      If you try to roll backup and distributed file-storage into the same application, you're not going to get anything useful. Aunt Sally is going to want every single file including her OS and her tax returns backed up, in case her hard drive dies, but only wants the photos -- and only some of the photos, actually -- to be visible to Grandma Suzie. If Suzie can see every file on Sally's computer, and the entire history of each file, she's not going to be able to browse the photos in a way that's at all intuitive.

      And worse yet, if Sally wants to send out links to her photos to fifteen of her friends by e-mail, she needs some sort of interface to mark parts of her backup as world-readable but the rest (like her passwords and e-mail) not. If the network backup program even lets you do this, it won't give Sally a UI that she'll be able to figure out.

      You can certainly get network backup services: Mozy was mentioned in an earlier comment.

      If you rethink your requirements in terms of your goals, you'll probably find that both rolled into one isn't what you want, and not just because a product doesn't exist at the moment that does that — a product that does that can't possibly have a good UI. If they shouldn't notice or care about how backups are being made, how are they going to figure out how to share photos with each other?

    3. Re:Two questions by rnswebx · · Score: 1

      I wish I had mod points. Well said.

    4. Re:Two questions by dbIII · · Score: 1
      DVDs are not paticularly reliable. I've seen dozens that can't be read. Tape is pretty expensive but I've seen quite a few tapes from the 1980s that can be read (if somewhat expensively).

      Forget about portable hard drives for anything other than temporary storage. Enough dust will kill them off if nothing else.

    5. Re:Two questions by RichiH · · Score: 1

      Well, this ideal app would treat the directory with private stuff differently than the the one containing photos.

      Also, all the end users see is a local directory where they can put stuff into. Nothing more, nothing less. All complexity would be hidden and only visible to me.

    6. Re:Two questions by Angostura · · Score: 1

      You need Mozy for the backup. You need Dropbox for the sharing. Unless you simply want to alias all of their files and folders into the Dropbox,

    7. Re:Two questions by Znork · · Score: 1

      Not sure if it suits your situation, but you could take a look at drbd. Current stable versions only support 2 node mirroring, but future versions are planned with further nodes.

      Personally I've used it for shared-device semantics for backing storage on Xen VM's (and prefer it over my previous iSCSI-in-VM config). It is also, however, eminently suitable for remote-site mirroring of block devices. It isn't too difficult to build a stack with backing devices remote-mirrored over drbd, shared out over iscsi in the local location and layered with whatever you want on top of that. Essentially the equivalent of what some storage vendors do with remote-mirrored SAN stuff.

      And in my experience this far (after about a year of running it) drbd is stable regardless of what I throw at it.

    8. Re:Two questions by ishobo · · Score: 1

      Tape is not that expensive. A single LTO4 drive costs ~$3500. The cassette is $0.10 per GB. The big money will be if you invest in a library.

      --
      Slashdot - The great and glorious cluster fuck of Internet wisdom.
    9. Re:Two questions by Anonymous Coward · · Score: 0

      I do this with two scripts, GnuPG, and dar. The only intervention needed is pressing a button to burn the monthly CD-Rs with PAR2 volumes and driving one copy to a bank vault, sending one to family, and putting another in a CD binder. Local, near field, and remote backups, all for the cost of three CD-Rs.

    10. Re:Two questions by Team503 · · Score: 1

      $3500 for home backup? Who are you, Jobs?

    11. Re:Two questions by ishobo · · Score: 1

      Some people spend $500+ for a video card, or two, for a home system. How valuable is your data? Archives are very important to me. The most expensive part will be the tape unit whether it is DAT, LTO, AIT, or VXA. My home server is using a DDS3, which cost me $600 in 2004. The DDS3 replaced a DDS1 I had been using since 1994. You could go with DAT72 ($600) or DAT160 ($900). Far cheaper for the DAT drive than LTO4, but you will spend more per GB on each cassette: ~$0.50 for DAT72 or DAT160 vs ~$0.10 for LTO4.

      Do not forget the fire safe if you have no off-site storage.

      --
      Slashdot - The great and glorious cluster fuck of Internet wisdom.
  7. Allmydata tahoe by Anonymous Coward · · Score: 1, Interesting

    Have a look at http://allmydata.org/trac/tahoe which might provide what you're looking while being way simpler to setup than Cleversafe.

  8. Dropbox by operator_error · · Score: 5, Informative

    Ars technica did a nice review of Dropbox, titled, "How Dropbox ended my search for seamless sync on Linux" (but it works on OSX 7 Windows too) http://arstechnica.com/news.ars/post/20080914-how-dropbox-ended-my-search-for-seamless-sync-on-linux.html

    1. Re:Dropbox by Firehed · · Score: 4, Insightful

      Dropbox is absolutely fantastic as a sync tool (and also has some degree of versioning), but there's no practical way as of yet to make it into a full-system backup. When 'watch folders' show up, it'll get a lot closer, but like any web-based system, it becomes impractically slow for anyone dealing with lot of data. Even digital snapshots add up quickly with the resolution of the point-and-shoot cameras, never mind if there's an actual photographer shooting RAW.

      --
      How are sites slashdotted when nobody reads TFAs?
    2. Re:Dropbox by Foodie · · Score: 0

      Sounds like a combination of HP Mediasmart Server + Dropbox is a good solution. Use the MSS/WHS to perform daily backups of the clients automatically, provide user accounts and storage space, has web share and DLNA capabilities. Then add Dropbox to backup that MSS to another remote box. This way, you get daily backups, and can do full system restore, while you can also get daily or weekly backups of the MSS into a central location.

    3. Re:Dropbox by Anonymous Coward · · Score: 0

      How much does it cost? The one thing dropbox's pricing page doesn't mention is the price.

    4. Re:Dropbox by Klaus_1250 · · Score: 1

      Seems very interesting, but this disturbs me:

      Dropbox cooperates with government and law enforcement officials and private parties to enforce and comply with the law. We will disclose any information about you to government or law enforcement officials or private parties as we, in our sole discretion, believe necessary or appropriate to respond to claims and legal process (including but not limited to subpoenas), to protect the property and rights of Dropbox or a third party, to protect the safety of the public or any person, or to prevent or stop any activity we may consider to be, or to pose a risk of being, illegal, unethical, inappropriate or legally actionable.

      If I read this correctly, your data is anything but secure or private, as Dropbox can use any arbitrary reason to give your data to any party.

      --
      It only takes one man to change the Wisdom of the Crowd to Tyranny of the Masses.
    5. Re:Dropbox by operator_error · · Score: 1

      from memory (mine) the 1st 2 GBs are free. So far that's all that's available because the firm is still in 'beta'. My understanding is once they come out of beta, they'll offer larger packages for a fee. I'll venture a guess versioning will arrive at some point too.

    6. Re:Dropbox by Fweeky · · Score: 1

      I don't think I'd want any of my backups on a service that clearly has access to my data. All such a service should be able to see is utterly opaque encrypted binary blobs they don't have the key for. Dropbox clearly think that's too hard, and prefer to err on the side of making their implementation easier.

    7. Re:Dropbox by pla · · Score: 1

      Ars technica did a nice review of Dropbox

      I think Dropbox has the right idea, with one glaring flaw - You need to trust your data to a third party (Amazon S3), and that third party needs to continue to exist (and offer the service) for Dropbox to keep working.

      For most purposes, I wouldn't consider that a major problem - I doubt Amazon really cares about the contents of my personal collection of apps to which I'd like to have access anywhere I go, or my family photos, or the contents of my to-do list; and I highly doubt they'll go under any time soon. But as the only option, I'd call that a showstopper.

      If I could run a Dropbox server (by which I mean the coordinating server, not the client-side daemon) on a box at home, I'd use it in a heartbeat. Even if I never actually used it, just having the option makes a world of difference between "Dependent on Amazon" and "viable backup/synchronization solution".

    8. Re:Dropbox by Blakey+Rat · · Score: 1

      If you want backup, use Mozy.com. It's already been around and perfected for many years, it integrates into "Shadow Copy" on Windows and "Time Machine" on OS X, it's cheap and effective. My media PC is backed-up to Mozy, over 250 GB with no problems.

  9. online backup by derekcohen · · Score: 2, Interesting

    what's wrong with getting an account with Connected/Iron Mountain - easy to use intelligent online storage that doesn't cost a lot - saved my bacon many a time

  10. JungleDisk with Amazon S3 Storage by kefa · · Score: 3, Informative

    Have you considered the JungleDisk client that works with the Amazon S3 storage cloud? This has backup clients for Windows, Linux, and Mac and with suitable configuration of 'buckets' would allow you to do most of what you are trying to achieve. Okay so it's a pay-for service (albeit cheap) but it does provide the all important off-siting, strong security/encryption and unlimited capacity.

    1. Re:JungleDisk with Amazon S3 Storage by RichiH · · Score: 1

      I have. But I don't want to put the data in anyone else's hands.

    2. Re:JungleDisk with Amazon S3 Storage by kefa · · Score: 1

      I understand your sentiment but at least with JungleDisk the data is encrypted by the client on *your* machine so Amazon don't have access to any of your data. True, they can still lose it for you, but it really does take the effort/pain out of off-siting. I guess I'm just worried about losing 10+ years (and 400GB+) of music and photos when my house is burgled/flooded/raised to the ground.

    3. Re:JungleDisk with Amazon S3 Storage by crt · · Score: 2, Informative
      From a privacy perspective, Jungle Disk encrypts your data with a key you control prior to upload - no one else can read it. From a security perspective, you can read their Security Whitepaper here, but suffice it to say they take security really seriously.

      As far as redundancy goes, your data gets stored in multiple Amazon datacenters around the country, which provides redundancy and high availability. At the end of the day, it's a far superior solution to anything you can cook up at home.

      Of course there is a small cost involved, but at $0.15/GB it's quite inexpensive for what you are getting.

    4. Re:JungleDisk with Amazon S3 Storage by giostickninja · · Score: 4, Funny

      raised to the ground.

      Wow. How far below ground is your house?

    5. Re:JungleDisk with Amazon S3 Storage by Anonymous Coward · · Score: 0

      Great solution. They pay for the S3 service and JungleDisk application and you set the application up for them.

    6. Re:JungleDisk with Amazon S3 Storage by Anonymous Coward · · Score: 0

      I use jungledisk on multiple systems running linux not only for backup but also to keep my svn versioned data synced on all systems. If I loose my laptop, I don't have to worry of data loss. The best part is that it is very affordable and hence I don't have to beg USC's IT dept every year to allocate more storage space. I pay about 70 cents a month to AWS :-).

  11. wimps by G3ckoG33k · · Score: 4, Funny

    "Only wimps use backup. Real men just upload their important stuff on ftp, and let the rest of the world mirror it."

    God

    1. Re:wimps by pjameson · · Score: 1

      Are we calling Linus God now?

    2. Re:wimps by RoceKiller · · Score: 2, Informative

      I think this is the quote you where looking for:

      "Backups are for wimps. Real men upload their data to an FTP site and have everyone else mirror it." -Linus Torvalds

    3. Re:wimps by Anonymous Coward · · Score: 0

      "Only wimps use backup. Real men just upload their important stuff on ftp, and let the rest of the world mirror it."

      God

      Actually, it was Linus who said that.

    4. Re:wimps by Eil · · Score: 1

      Er, Linus Torvalds is many things, but God is certainly not one of them.

    5. Re:wimps by G3ckoG33k · · Score: 1

      "My name is Linus Torvalds and I am your god."

              * Jokingly introducing himself, at the 1998 Linux Expo in Durham, North Carolina

      http://en.wikiquote.org/wiki/Linus_Torvalds

  12. Update from OP by RichiH · · Score: 2

    I looked at Cleversafe, trying to get through the PR bubblespeak. It seems they are emulating disks, not offering integrated _backup_. As saving from my mom's SD card to a distributed online disk via a DSL line is not feasible, I will most likely need to scratch that idea.

  13. you haven't thought this through by speedtux · · Score: 4, Informative

    Backup isn't the same as sharing. And do you want actual replication or merely fault tolerance to node failure? Actual n-fold replication means you're going to pay n times the amount of money for storage. And why do you insist on one application to do everything?

    My suggestion: set up automatic backups to one of the many backup services on the net. They worry about how to replicate your data, you don't have to. For the same service to support both backup and sharing is hard and it's probably a bad idea. It's much easier if you know that the backup service simply cannot access the contents of any of your files.

    For sharing, use services designed for that: Flickr Pro, Picasa, Google Docs, whatever. They are designed for sharing, they know about users and permissions, and they can only publish what you actually upload to them.

    As for Cleversafe, the idea is as old as forward error correction, but the economics and management never seem to quite work out. And basically, you're getting the same functionality from hosted storage: Amazon, Google, Box.NET, etc. are already figuring out how to keep your data available and secure, and are probably doing a better job than you could do with a homebrew system.

    1. Re:you haven't thought this through by RichiH · · Score: 1

      I did think this through. I want distributed backups with several, for lack of a better word, working copies checked out on different machines.

      I don't need to worry about encrypted backups or anything as all machines involved will be under my control. If any machine is compromised, I lose confidentially, anyway.

      That a homebrew solution would suck is a given. This is why I asked /. for readymade solutions.

    2. Re:you haven't thought this through by Geoffreyerffoeg · · Score: 4, Insightful

      I want distributed backups with several, for lack of a better word, working copies checked out on different machines.

      Aha, now I figured out why we're all misunderstanding you. Those aren't backups. "Backups" to my ears means that you copy the entire contents of your disk or your Documents folder nightly onto tape or some other archival medium, so that in case of hardware failure you have something to restore from. Potentially you also keep prior versions around. The tapes are stored in a corner somewhere because they're never actually accessed except in an emergency, and they're destroyed after a few months.

      What you want isn't backups, since it doesn't make sense for different people to share backups any more than it makes sense for different people to share a single networked hard disk or networked home directory. You just want a distributed file storage system, with automatic syncing / commits.

    3. Re:you haven't thought this through by RichiH · · Score: 1

      Indeed. But as it should support versioning etc, we are back at it being a backup. Not in the sense of 'grab tape, stow it in bank', though.

    4. Re:you haven't thought this through by Anonymous Coward · · Score: 0

      seeing your uid, i can tell you're not a reel geek. real geek cred doesn't come from purchasing a service online, it comes from learning how to set up the same service for yourself, and without having to pay someone else a monthly fee.

      whomever relies on you as a computer guy, i can tell you don't enjoy working with computers and finding new ways to get more functionality.

      yeah, we all only have so much time in the day, and how we like to spend it differs, but TFA was asking about which software might be useful, as long as the only hard part was the initial configuration. something with a steep learning curve that 'just plain works' is okay, in other words he wasn't looking for shortcuts.

      many other posters suggested that while he'd like an all in one package, that really two pieces of software fit the bill better than one, because a nice simple UI that can be taught in seconds for sharing photos etc, isn't synonymous with regular remote backups of data.

    5. Re:you haven't thought this through by RiotingPacifist · · Score: 1

      wait so what you want is concurrent versioning to be dealt with by some sort of system?

      if you don't want user to have to learn about the subversion controls (because it can be a real GIT to use sometimes), then many programs implement similar functionality using plugins.
      e.g
      Web2.0 OpenOffice.org collaboration & document management extension
      OOoSVN

      I would setup an SVN/cvs/git for everything then find extension tools to deal with each use case as the need for transparent svn access becomes apparent

      --
      IranAir Flight 655 never forget!
    6. Re:you haven't thought this through by nikin · · Score: 1

      Something like CrashPlan (http://www.crashplan.com) may be just what you're looking for. There are Home and Business systems, Win, Lin, Mac and Solaris options. The home option allows for backup online to their service or, through a buddy system (read other family members in your case) you can backup to multiple offsite locations (family computers) for free. Basic client is like $60 and that's it if you do the BYOB method. It's fast, it's fully encrypted prior to leaving your computer, it's smart (no duplicates, accessible anywhere, tons of options: scheduling, realtime backup, network and cpu throttling, great compression, storage quotas etc) and they are readying a new rev that will speed up transfers around 400%--according to a recent blog post (http://crashplan.blogspot.com). It's a nobrainer to setup. I've got a small business setup with them for about 30 systems and the server side is just as easy to setup. Their support is free, fast, efficient, effective, and curteous. They don't support bare-metal restore and are clear about that but I don't think you are looking for bm. I have restored several times with great success. Backup sucks for reasons we are all all-too aware of. Obviously, each situation requires a unique solution, which explains the plethora of vendors and methods out there. It's important that each constituent understands the full capability of whatever system you end up implementing, and the limits therein. Good luck.

    7. Re:you haven't thought this through by k8to · · Score: 1

      You want the mooon.

      You want a system that stores data enough times to be usefully deal with hardware failure, and with enough versioning to allow people to usefully recover from errors. So your storage requirements are an order of magnitude larger (at least) than your live data.

      Okay, now you have an order of magnitude more live disks on your network. Now you need a network filesystem that intelligently deals with faults in all cases. Also, I hope you don't have mobile users who leave the network, or temporary network outage situations. In such cases the nodes will either be entirely non-operational or need to maintain private data production that gets synched up later. Resynching involves implementing a local cache concept via a conceptually different datastore technique than the network system, and it involves the network filesystem dealing with split-brain, which is going to be enormously difficult.

      And that's just getting started. You also want performance if you're going to use it for everything, and you also seem to want access control. If you need the access control to be trustworthy when some nodes are under control of users (I assure you, that some of your nodes are), then you need encryption implemented at the store and retrieve point, killing your performance. Or you need to start having different classes of nodes with different trust metrics, where some data can only live on trusted servers, and some data (readable by others) can live anywhere.

      The system, as described, has way way way too much complexity to be feasably implemented in any reasonable amount of time. Keep on dreaming, I'd like it too.

      --
      -josh
    8. Re:you haven't thought this through by Colin+Smith · · Score: 1

      So... Basically... You want a distributed version control system...

      google for "subversion", or "git", or "distributed version control system".

      hth

       

      --
      Deleted
    9. Re:you haven't thought this through by pla · · Score: 1

      What you want isn't backups, since it doesn't make sense for different people to share backups any more than it makes sense for different people to share a single networked hard disk or networked home directory

      Although I agree that the GP has something other than just "backups" in mind, I would still consider the result quite a decent form of backup (moreso even than the notorious "works perfectly until you need to recover" tape archive) - If you have copies available locally, at a remote repository, and at several additional nonlocal users' machines, as well as versioning to allow recovery of previous incarnations of the files, you have an impressively resilient means of recovery in the event of disaster/accidents/hardware failure.

    10. Re:you haven't thought this through by RichiH · · Score: 1

      Not quite, but I am pondering Git.. It's the one SCM which can do distributed storage right.

    11. Re:you haven't thought this through by Anonymous Coward · · Score: 0

      seeing your uid, i can tell you're not a reel geek. real geek cred doesn't come from purchasing a service online, it comes from learning how to set up the same service for yourself

      Distributed storage was fun to play with in the 1990's. These days, it's something for script kiddies.

      From your response, I can tell you're a moron.

  14. Re:Mozy? Duplicity? by Firehed · · Score: 4, Insightful

    No Linux client, AFAIK (though I do run it on my MBP). It's become rather impractical for me as a photographer though, as sometimes I'll shoot enough photos that my internet connection would be completely maxed out for days on end trying to sync up the new data - and I have a decent-for-cable 1Mbps upload rate.

    rsync to Amazon S3 might be an option, if only for cross-platform capabilities. No versioning though, but outside of Apple's Time Machine (obviously useless for Windows and Linux), you're not going to get that without some major headache. Any remote system is going to be horribly slow for the first sync with any typical internet connection, and quite possibly problematically slow for photographers, media horaders, and in general people with big hard drives.

    --
    How are sites slashdotted when nobody reads TFAs?
  15. rdiff-backup and chironfs by Delgul · · Score: 3, Informative

    The subject says it all:

    - rdiff-backup to backup your data one backup server.
    - chironfs to clone the file system to another remote server.

    rdiff-backup runs on *nix and windows (with the help of Cygwin).

    Once set up, rdiff-backup needs virtually no maintenance. If needed, setup Nagios to warn you if things run afoul.

    Used this for years, never disappointed me so far!

    1. Re:rdiff-backup and chironfs by happyslayer · · Score: 1

      haven't used chironfs (I'll check it out), but rdiff-backup has been working at a medical clinic for off-site backup for 10 months without a glitch.

      Basic data from my backup:

      • 2.5 GB of original data (including the application)
      • mysql-autobackup script to dump daily, weekly, monthly copies of all relevant databases
      • Average daily diff is about 25-35 MB
      • Off-site server
      • cron job to run a custom script

      The results are that it's been running without a hitch for 10 months, and I've used the backed-up data to create a mirror of the medical clinic site in less than 3 hours.

      For me, that's all I need: The ability to take my backups and go from clean machine to fully functioning system in less than a day.

      It's been working so well that I've been quietly offering the service to some of my other IT customers who need to get up and running quickly.

      Legal and confidentiality issues are the biggest obstacle. The medical office data is covered because of previous arrangements and agreements. For other customers with these issues, you can set up a similar layout that works within their requirements.

      --
      Never confuse movement with action. --Hemingway
    2. Re:rdiff-backup and chironfs by Albanach · · Score: 1

      Where you have sensitive data you can look at duplicity - it uses gpg to create encrypted diffs.

      With Amazon S3 integration there's even a cheap distributed storage host available.

    3. Re:rdiff-backup and chironfs by Anonymous Coward · · Score: 0

      I completely agree about rdiff-backup. It's simple, fast, reliable and it makes restoring completely trivial (no need to wonder if your tape really has the file that you just lost). Plus there's a native Windows port now.

      http://wiki.rdiff-backup.org/wiki/index.php/Installations#Using_native_win32

  16. If you had Windows & Mac - Mozy by tkrotchko · · Score: 1

    If you had only Windows and Mac, I'd opt for Mozy (http://www.mozy.com) which is owned by EMC. It's $50/year for unlimited storage and their agent is unobtrusive and backs up even open files.

    The downside is that it limits upstream bandwidth to 1Mb/s, so your initial backup might take a week. But after that, it takes 3 minutes a night and it does it without prompting. I've strong-armed my immediate family into using it because it also allows me to monitor remotely the status of all backups.

    It's seriously good stuff.

    --
    You were mistaken. Which is odd, since memory shouldn't be a problem for you
    1. Re:If you had Windows & Mac - Mozy by penguin_man101 · · Score: 1

      Another vote for Mozy. If you want to try it out, up to 2GB is free.

    2. Re:If you had Windows & Mac - Mozy by Anonymous Coward · · Score: 0

      Mozy is a great product. It uses data de-duplication which reduces the delta set on each backup and creates the equivalent of a daily full backup.

      Restoration is easy and on a granular level. I have asked recently about a Linux version and have been told that there is one in the works.

      It's easy to use (even for Aunt Sally) and definitely worth looking into.

  17. Tape? Its old, but it is still useful by mlts · · Score: 1

    As an alternative, use tape. It may not have the shine of offsite backups, but if you need data backed up reliably, one easy option might be a DLT or another recent capacity tape drive. Combine a backup program that does encryption (bru, amanda, zmanda) and then set up a contract with Iron Mountain.

    Then, if you do a basic tape rotation schedule, periodically running recent tapes offsite, you should be protected against known disasters. And, because the tapes are encrypted with a high quality and long passphrase (this is assuming), if the tapes get lost or stolen, they won't do an attacker any good.

    On the low end, if tape is too expensive, there is purchasing external mini hard disks that only require power via the USB ports, combining those with TrueCrypt or another sturdy encryption program and using those instead of tapes.

  18. Use rsnapshot by Anonymous Coward · · Score: 2, Interesting

    Get 4 x 1TB disk and minimum RAID 6. Install Linux. Install rsnapshot, which offers:

    * Filesystem snapshot - for local or remote systems.

    * Database backup - MySQL backup

    * Secure - Traffic between remote backup server is always encrypted using openssh

    * Full backup - plus incrementals

    * Easy to restore - Files can restored by the users who own them, without the root user getting involved.

    * Automated backup - Runs in background via cron.

    * Bandwidth friendly - rsync used to save bandwidth

    You may also find CentOS or Debian tutorial useful.

    Good luck!

    1. Re:Use rsnapshot by RT+Alec · · Score: 1

      4 disks and RAID 6? That make little sense. If you have 4 drives and are willing to give up 50% to redundancy (which is not out of the question), RAID 10 (pair of striped drives + pair of striped drives, mirrored) is much less complex.

      Now a ZFS RAIDz/2 with 8 drives...

    2. Re:Use rsnapshot by Anonymous Coward · · Score: 0

      A two disk failure on RAID 10 has a 50% chance of catastrophic data loss. Not so with 6.

  19. But that's plain replication by Anonymous Coward · · Score: 0

    You're merely proposing data replication again.

    TFA mentioned Cleversafe, and specifically wanted feedback on that "dispersal" approach as it seems better than simple replication.

    If I understood TFA correctly, he proposes to develop (although he wrote "deploy") something similar, hence wants feedback on the idea first.

  20. Bacula? by up4fun · · Score: 5, Informative

    http://www.bacula.org/

    Runs pretty tight (low bandwidth), supports channel encryption and datastore encryption, can even create Bare Metal Recovery disks. I have a server room with LTO3 tape drives that I use to backup my clients' incremental data changes nightly, including Linux, Mac and Windows clients and servers. I have VPN's out to each client, so don't use the built-in channel encryption, but I maintain a keypair for each client.

    Backup only, but I /could/ present a maintained volume as a share over the VPN. Bacula supports disk and tape volumes as backup stores. I've personally had no need to do that to date.

    We're not talking terabytes here - my ISP would pwn me if that was going on, but I do circa 20G of data changes every night from clients. Some of them are laptops that are not always on or connected. Most are friends and family PC's, so it backs up when it can. I have to do almost no maintenance apart from changing a tape occasionally. The backup client is tiny and unobtrusive, even when running. On Windows it uses VSS, so it is reliable.

    I have had a number of panic phone calls (esp from my kids at Uni) who have lost a thesis or the like and are utterly amazed when, after a few clicks over the phone they look at their webmail and yesterday's version is in their inbox. That's what it's all about! I am the god of lost data! Which, of course, works for me.

    1. Re:Bacula? by Anonymous Coward · · Score: 4, Funny

      - plural of baculum.

      Do you know what a baculum is? It is the penis bone found in most male mammals with the exception of humans.

      Great product naming!

    2. Re:Bacula? by Anonymous Coward · · Score: 0

      this is why open source software often gets renamed when it gets popular.

      everyone has to be clever about a name that hasn't been taken...

    3. Re:Bacula? by Doctor+O · · Score: 1

      The best thing is that despite being posted by an AC it's not only funny but actually true.

      I'm going to introduce Bacula at work immediately. That will be the first IDIFTL meeting in company history.

      --
      Who is General Failure and why is he reading my hard disk?
    4. Re:Bacula? by JoCat · · Score: 1

      That puts a new spin on their motto:

      "It comes by night and sucks the vital essence from your computers." (From the site)

    5. Re:Bacula? by Anonymous Coward · · Score: 0

      So the guy who stars in Star Trek: Enterprise is Scott Multiple-penis-bones?

    6. Re:Bacula? by ZerdZerd · · Score: 1

      If you don't use tapes, BackupPC is a good alternative. I like it because it pulls the files to the backup-server and it compresses the backups very well (never stores anything twice).

      --
      I'm not insane! My mother had me tested.
  21. Amazon S3 by MosesJones · · Score: 1

    There are a bunch of people offering this sort of service (or build your own) on Amazon's S3. It has the advantage of being accessible to everyone, has the security built in and you only have to worry about the data not server availability.

    Backup not on the cloud just doesn't make much sense to me these days.

    --
    An Eye for an Eye will make the whole world blind - Gandhi
  22. Why not keep it simple? by rnswebx · · Score: 1

    I think this may have been said before, but what's wrong with setting up a basic samba server on one of your machines, and then simply using cron (Mac/Linux) or scheduled task(Windows) to dump the backups across the WAN via rsync/scp? (Depending on how important managing multiple versions of the same file is, perhaps using cron on the server-side do some SVN magic would make sense.)

    You'd be able to allow multiple users access to other folders with simple Samba ACLs, and it'd all be right on their desktop with an interface they're used to. As far as maintaining the backend, simple rsyncs between your linux servers keep everything up to date in the case of node failure.

    The solution also allows for easy drop-in replacement in terms of switching target backup/sharing servers. All you would need to do is email your new rsync script and tell LuserX to put it in XYZ directory. There'd be no walking them through configuring a new backup application they've probably never seen before.

    Clean and simple, but certainly not feature-rich.

  23. clouds everywhere! by demmer · · Score: 0

    RichiH obviously is more a Stallman guy asking for a diy and possibly opensource solution, that kept all his family data on privately owned systems...

    so why all the jungle/dropbox/flickr answers?
    has slashdot sunken that deep?

    what i do:
    - a fileserver with raid5 at my place and one at my parents'
    - nightly rsync replication of their data to my server and my data to theirs over ssh

    - ... so we always have two copies of the data and local redundancy

    - allows fast access to all data even huge amounts in the hundreds of gigs dimension and also if internet is somehow slow or down
    - this of course is no backup in the classic meaning! however rsync does not delete data on the replication site if you don't tell it too, so you kind of have protection against mistakenly deleted data too (unlike with only raid)

  24. Online backup / file sharing by dongola7 · · Score: 1

    For backups, I'd recommend one of the many online services. I used to do backups using a custom shell script, but you really can't beat the online services in terms of ease-of-use. Personally, I use Mozy. It's $55 a year for unlimited storage, but they offer 2GB free (and for a lot of folks, that's really all they need). If you have a lot of computers, you can set up a single account to manage all of them.

    As for file sharing, if all you're doing is sharing photos, I'd recommend a site like flickr. For other stuff, dropbox seems to work well.

  25. Silver Bullet for file ownership/ACLs? by Gazzonyx · · Score: 2, Insightful

    For storing permissions and the such, are you using a .tar container? My biggest stumbling block with my backup scheme is storing ACLs and permissions.

    I've got a few ideas about doing it, but they're all kludgy or force me to walk away from my rsync scripts which are really fairly mature at this point. Furthermore, I need to get deltas downstream and packing everything in to one file pretty much defeats that purpose at the several gig level unless I'm running an rsync server to calculate the diffs. These kinds of things become problematic due to the infrastructure I'm working with.

    I'm really starting to lean towards running everything over iSCSI, but then I've got to get the VPN thing going which could require some re-subnetting at either end of the tunnel. Needless to say, I'd prefer to avoid that or any other solution that requires messing with stuff that Works Right Now.

    Have you dealt with these issues at all, or at least know what won't work? I'd appreciate any insights before I use a brute force method.

    --

    If I mod you up, it doesn't necessarily mean I agree with what you've said, sorry.

    1. Re:Silver Bullet for file ownership/ACLs? by jimicus · · Score: 2, Informative

      Recent versions of rsync fully support POSIX ACLs (including, if asked, setting up ACLs on the receiving end that don't make any sense because they refer to uids that don't exist - though you could work around that one with a common authentication mechanism such as LDAP) - I've not tried to get Windows working so I'm not sure how well that would work.

      Be warned that full POSIX ACL support hasn't made it into every Linux distribution yet - IIRC Debian Etch's rsync doesn't, for instance. If you're paranoid, you could add a line to call getfacl before you call rsync in your existing script script - shouldn't require more than one line.

    2. Re:Silver Bullet for file ownership/ACLs? by Anonymous Coward · · Score: 0

      Best rsync backup tool is snapback2 (see cpan)

      However, for general backup see brackup. This thing backs up encryped or plain to targets including ftp, filesystem, ssh and S3. Files are sliced and diced into chunks and MD5 sums ensure that the data is only stored once. The file tree is stored separately so that you have a versioned filelist separate to the data.

    3. Re:Silver Bullet for file ownership/ACLs? by jhealy1024 · · Score: 1

      Even better, recent versions of rsync allow you to shoehorn all metadata into xattrs on files, so you can (for example) store Mac OS X metadata and ACLs on a linux box with no special file system setup. You can even store the files as an unprived user and have the real perms stored in xattrs as well.

    4. Re:Silver Bullet for file ownership/ACLs? by jimicus · · Score: 1

      Be warned that with Mac OS X metadata, that gets stored under the same filename as the original with ",_" prefixed - I'm not sure what happens if a file with that name already exists.

      Also, if you want to make full use of rsync options, you need the same version on both ends of the tunnel.

      (That being said, props to Mr. Tridgell, rsync is an absolutely awesome tool which has saved me I-don't-know-how-much in terms of time and effort. I really must make a donation to the project at some point),

    5. Re:Silver Bullet for file ownership/ACLs? by Em+Adespoton · · Score: 1

      To add to the other post: BackupPC plus some scripting of the latest rsync. BackupPC lets you pull all the data from the various sources into one pool, and tracks your ACL for you. Rsync then lets you mirror your BackupPC pool to multiple sites. Data is restored/retrieved via web interface.

      Not quite a transparent SAN, but I think it meets all the listed requirements.

  26. If I might plug a favorite project! by Gazzonyx · · Score: 2, Informative

    Check out S3Backer. It lets you mount an Amazon S3 bucket to your Linux/Mac/BSD/*NIX box. GPL F/OSS as icing on the cake.

    --

    If I mod you up, it doesn't necessarily mean I agree with what you've said, sorry.

    1. Re:If I might plug a favorite project! by Anonymous Coward · · Score: 0

      and s3fs http://code.google.com/p/s3fs Gazzonyx: we gotta stop running into each other like this!

    2. Re:If I might plug a favorite project! by Gazzonyx · · Score: 1

      Randy!
      It's a small internet, no? :)

      --

      If I mod you up, it doesn't necessarily mean I agree with what you've said, sorry.

  27. AFS? by TheRaven64 · · Score: 4, Interesting
    AFS is only about 20 years old, and supported on Windows, Mac, and most flavours of *NIX, so it might not be sufficiently mature for your needs, however it does provide the following capabilities:
    • Remote storage with local caching.
    • Snapshots, allowing coarse-grained versioning.
    • Replication on the server.

    As well as all of the standard things you'd expect from a networked filesystem (ACLs, authentication, and so on).

    If you set up an AFS cell with your volumes replicated across a few remote servers and get your clients to connect to this cell then it should be fine. Set a cron job to take regular snapshots, and dump them to some offline medium periodically.

    --
    I am TheRaven on Soylent News
    1. Re:AFS? by Eil · · Score: 1

      AFS is only about 20 years old, and supported on Windows, Mac, and most flavours of *NIX, so it might not be sufficiently mature for your needs,

      Sheesh, condescend much?

    2. Re:AFS? by emj · · Score: 1

      AFS is not "mature", maybe it works flawlessly on Solaris but the Linux and Mac clients are unstable and prone to crashes.

    3. Re:AFS? by Doug+Neal · · Score: 1

      AFS is only about 20 years old, and supported on Windows, Mac, and most flavours of *NIX,

      AFS as a standard may be 20 years old, but are there any actively-maintained, feature-complete, open source, and cross platform implementations around? I looked into using AFS about four years ago and decided that the answer to that question was "no".

      OK, maybe it's time to give it another chance...

      The Wikipedia entry for AFS says:

      There are three major implementations, Transarc (IBM), OpenAFS and Arla, although the Transarc software is losing support and is deprecated. AFS (version two) is also the predecessor of the Coda file system.

      Transarc is off the list straight away, then. On to OpenAFS, the open-source fork of Transarc. Seems to be actively maintained, the last development release being within the last few weeks... but:

      1.5.53 is also the most recent in the series of releases intended to provide new experimental features including the Demand Attach File Service and Disconnected AFS

      Disconnected operation is still "experimental". Not encouraging.

      On to Arla. Seems to be a client-side only implementation and hasn't reached a 1.0 release yet. Next!

      So, AFS 2 became Coda? The Wikipedia entry for Coda states that Coda has been superceded by InterMezzo, another dead project that was dropped from the Linux 2.6 kernel and has itself been superceded by Lustre... which actually looks damn impressive, but is designed for Linux HA clusters, and doesn't have the Windows and Mac support that AFS does.

      So yeah, still can't find an actively-maintained, feature-complete, open source, and cross platform distributed filesystem. Would love to be proven wrong though; any takers?

  28. Linux-based NAS with built-in applications by golodh · · Score: 1
    It sounds as if the author of the opening post is looking for a Network-Attached Storage device that will function as a server, is based on Linux, and comes with pre-loaded applications.

    I found and tested the predecessor of the following device (which I can recommend on basis of a year-long test of a sample with N=1): Bubba (see http://excito.com/bubba/about-bubba.html ). A Swedish NAS device. I have to note that it's certainly not "distributed" in the sense that it's easy to mirror data across multiple devices (I didn't try and wouldn't know an easy way of doing that). It's basically a server, so you'd still need to take care of backups yourself.

    It's a metal box the size of a lunch-box, contains a HDD, a PowerPC processor, two ethernet interfaces, and comes pre-loaded with Linux 2.6 (Debian Etch), and has a web-based control interface for adding users (see http://excito.com/bubba/about-bubba.html ). It can act as a server (Samba), torrent and email downloader, and router (if you want). It's got decent tech support through this forum (see http://forum.excito.net/). You can buy the box with or without HDD.

    Nevermind the website (they brought in a consultant who made something I really dislike), the box and its applications are solid. Have a look and see if it's what you need.

  29. Wuala by Anonymous Coward · · Score: 0

    Not sure if I understood your request correctly but check out Wuala. Great for storing and sharing information in a secure manner over the internet.

    1. Re:Wuala by Juser · · Score: 1

      Why would you allow someone to store their data on your machine? On top of that its encrypted? A) You have no idea what they are storing in your house. B) You have no idea where your data is being stored and if it got accessed.

  30. rsync and dyndns by MMC+Monster · · Score: 1

    First set each computer up with a dyndns account so that remote administration is easy.

    Then set up folders in each computer for each member of the family. For each family member's main computer, make symbolic links to other family members picture folder, etc.

    Set up a schedule to use rsync to copy the contents of the folders on a daily basis.

    While you are at it, I suggest adding one more computer to the mix that will copy the home folders for all family members and keep them in a svn folder so they can call you to undelete files.

    --
    Help! I'm a slashdot refugee.
  31. Try JungeDisk by pmeinl · · Score: 1

    Try JungleDisk http://www.jungledisk.com/ It uses Amanzon S3 Storage.

  32. Mozy by Anonymous Coward · · Score: 0

    What about Mozy? http://mozy.com/ 2GB for free, or $5 a month for unlimited storage. Does versioning and is really easy to use.

  33. JungleDisk by Anonymous Coward · · Score: 0

    I use JungleDisk to backup everything to Amazon S3.

  34. BackupPC by squidinkcalligraphy · · Score: 1

    BackupPC might do what you're after. From the blurb:

    high-performance, enterprise-grade system for backing up PCs
    BackupPC is disk based and not tape based. This particularity allows
    features not found in any other backup solution:
      * Clever pooling scheme minimizes disk storage and disk I/O.
          Identical files across multiple backups of the same or different PC are
          stored only once (using hard links), resulting in substantial savings
          in disk storage and disk writes.
      * Optional compression provides additional reductions in storage.
          CPU impact of compression is low since only new files (those not already
          in the pool) need to be compressed.
      * A powerful http/cgi user interface allows administrators to view log files,
          configuration, current status and allows users to initiate and cancel
          backups and browse and restore files from backups very quickly.
      * No client-side software is needed. On WinXX the smb protocol is used.
          On linux or unix clients, rsync or tar (over ssh/rsh/nfs) can be used
      * Flexible restore options. Single files can be downloaded from any backup
          directly from the CGI interface. Zip or Tar archives for selected files
          or directories can also be downloaded from the CGI interface.
      * BackupPC supports mobile environments where laptops are only intermittently
          connected to the network and have dynamic IP addresses (DHCP).
      * Flexible configuration parameters allow multiple backups to be performed
          in parallel.
      * and more to discover in the manual...

    --
    "I think it would be a good idea" Gandhi, on Western Civilisation
    1. Re:BackupPC by gardyloo · · Score: 1

      BackupPC is nice. Its pooling strategy is very good, it works brilliantly and painlessly when backup up linux -> linux (though I have to re-try it Windows -> linux), and their UI is what a lot of the other solutions need for people to browse/restore their own data using a web browser.
          Its devs are responsive, too!

    2. Re:BackupPC by Anonymous Coward · · Score: 0

      Thats what I use. I have a Linksys NSLU2 + ext. HDD running Debian and backuppc. It makes backups of 2 servers. Making backups is not super fast and concurrent backups are just too slow. But basically I set it up a few years ago and it is making backups ever since without any maintenance.

      Oh, I choose an NSLU2 with USB HD so I can build it into a small water-/fireproof box and keep it in my cellar or in the shed. It should be safe there in case of a disaster..

    3. Re:BackupPC by Polimath · · Score: 1

      I use BackupPC at home and at work. It backups up linux clients easily. Setting up Windows XP pro as a client is actually pretty easy, too, using a very small version of rsyncd that is available at the BackupPC site. I have not had luck setting up XP Home as a client using rsyncd, I suspect because of the way it handles permissions. If an XP Home client is on an internal network, you could share the folder you want backed up I don't do that because the XP Home machine I want to back up is a laptop -- shares bad on laptop.

  35. SyncToy by paylett · · Score: 1
    For the slightly more mundane task of just backing up from one PC to another, I've started using Microsoft's free SyncToy tool on family and friend's networks.

    It's simple. It works.

    --

    Believing something doesn't make it true. Not believing something doesn't make it false.

  36. A Free Open Source Backup Tool... by onitzuka · · Score: 1
    ... might be what you need.

    I wrote the open source backup tool "Gazoo!" to perform fast reliable backups using rsync. Think of it as a command-line TimeMachine. ;-)

    Give it a whirl! Never loose a file again! :-D

  37. www.jungledisk.com by Phurge · · Score: 1

    www.jungledisk.com - does all you want, its cheap and its hosted on Amazon's multiple servers.

    --
    I'll see your hokum and raise you a boondoggle.
  38. Re:Mozy? Duplicity? by shadow349 · · Score: 2, Informative

    rsync to Amazon S3 might be an option, if only for cross-platform capabilities. No versioning though

    cough ... JungleDisk ... cough

  39. Our Stuff... a bit of a rambling post by ka9dgx · · Score: 3, Insightful

    I think that the issue is faced by far more people than is readily apparent... it's the need for a VERY easy to use tool to share Our Stuff with Our Family. If my Mom and sisters were able to share all their photos with each other by carrying a USB drive around when they see each other... the most important thing they have on their computers would be backed up... the need for social file sharing is huge... we just don't have the tools to do it well yet. Something that does auto-discovery of stuff, remembers previous decisions, and just goes to work making copies in the right directions is what we need.

  40. Online service by Anonymous Coward · · Score: 0

    I use Spideroak ( https://spideroak.com/ ). They do backup and sharing like you need.

  41. CloudBackup is what I use by scottm52 · · Score: 2, Informative

    I just didn't want to deal with it. I use cloudbackup.openrsm.com and have them buy an account. It can do a whole network of Linux, MAC, and Windows machines with one account, or just a laptop. The client software is free and does network drive of the backup space too. I figure easy and my friends paying for it works. It's saved my butt a couple times too.

    1. Re:CloudBackup is what I use by homeofficeblogger · · Score: 1

      I use this for my work network. Three PC's and a MacBook AIR.. It works great. Especially with the network drive. Makes it easy to grab files from the home office if I'm out and about and forget to copy something over. It's pretty cheap for what you get too.

  42. AhSayOBS by kyouteki · · Score: 1

    AhSay's free version of their Offsite Backup Server (http://www.ahsay.com/en/freeedition/ahsay_free_edition_index.html) does versioning and, well, everything you're really asking for. I use this at work with about 20 clients, and it's rock solid.

    --
    A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.
  43. Re:Mozy? Duplicity? by TuaAmin13 · · Score: 1

    JungleDisk looks good.

    I have no idea what type of cost that's going to incur after a while though. It's probably going to be like an additional cell phone bill per month, which I don't know is feasible.

    Another option I kind of thought of--can you buy a server from a datacenter at a one time fee, put in as much storage as you want, and then just get charged for bandwidth? The up front costs of that would be fairly high, but it seems like it would be the cheapest long run cost, IF you could find some place willing to do it.

  44. It's doable.... by cptdondo · · Score: 1

    I do this commercially. I ship a small embedded box with custom firmware that works as a samba client and runs a VPN back to my server.

    Then I run rsnapshot to rsync the remote.

    That way the clients don't have to do any installation at all, I can admin my box remotely without any local representation, and it will work with any system as long as it supports samba.

    The only setup required on the local site is a userid for the backup client.

    The devil, of course, is in the details.

    1. Re:It's doable.... by happyslayer · · Score: 1

      What equipment do you use for the embedded box? I'd love to do the same for my customers.

      --
      Never confuse movement with action. --Hemingway
    2. Re:It's doable.... by cptdondo · · Score: 1

      Typically off-the-shelf wifi access points. Currently the Asus 500G Premium running OpenWrt.

      I think my samba build is part of the openwrt distribution (they were using 2.x and I built 3.0.xx for it.)

      Disable the wifi, take the antenna off, and you have a really capable embedded linux computer.

    3. Re:It's doable.... by happyslayer · · Score: 1

      Nice...never would have thought of that. I've been meaning to play with OpenWrt. Sounds like I should get off my butt and do it.

      --
      Never confuse movement with action. --Hemingway
  45. sshfs by cenc · · Score: 0

    Google it for more info, it will do everything you are asking for in a secure manner.

  46. Have you looked at BackupPC by Xylaan · · Score: 1

    It supports rsync, ssh, tar, and SMB. Performs pooling which reduces the number of stored files. Only issue is it uses the local account password file, so you'd have to set up an account for each user you wanted to give direct access too. http://backuppc.sourceforge.net/

  47. SVN by dheltzel · · Score: 1

    I use SVN to backup my sister's important stuff to my home server. It was easy to teach them to commit changes and add new files to be versioned because I installed Tortoise SVN on their Windows computers. It has full versioning and can use an encrypted link if that's important.

    Everything else just seemed like too much work to implement.

  48. Evidently... by yttrstein · · Score: 1

    You've not purchased a Mac in the last five years.

  49. Our Solution - by jchawk · · Score: 1

    I run a company called Real Pro Data Solutions, LLC.

    You can check us out at - http://www.realprodata.com/

    We run on open standards and can provide assistance with setup. We have solutions that will work on all of your platforms, Windows, Linux, Mac...

    We're a small business so you can always work with the same people who helped build the company!

    And you can actually call and talk to us! :-)

  50. Datumguard will backup windows and linux servers by datumsoft · · Score: 1

    My company also struggled with an easy to use and robust offsite backup solution that backed up Windows servers with exchange along with our Linux and Unix servers running MySQL and other services. Backups were always a large headache for us and our customers. About two years ago we started testing multiple backup systems and running large scale recovery scenarios for our company and our customers. We needed it to be dead simple to use for both customers and administrators, offer revision control, encrypt the data, compress the data, and be able to store the data for up to 7 years or longer. In our quest to find and implement the ultimate solution we started our own company offering offsite backup, and we had over two years of testing and breaking many other solutions. Now we are the company our customers look towards to ease the stress of their backups both large and small. http://datumguard.com/

  51. Re:Mozy? Duplicity? by isorox · · Score: 1

    rsync to Amazon S3 might be an option, if only for cross-platform capabilities. No versioning though, but outside of Apple's Time Machine (obviously useless for Windows and Linux), you're not going to get that without some major headache.

    Server running opensolaris/*bsd with ZFS, rsync to that, create a snapshot every day.

  52. Sounds familiar by ThreeGigs · · Score: 1

    Don't most businesses already do this? On laptops, I used roaming profiles, and synched My Docs with the user's home directory on the server. All additional backups, versioning, etc. were handled on, and by the server.

    Downside is it's not a complete solution, as any data stored in Program Files or Common Files dirs wasn't mirrored.
    Upside is that it's simple network management, and even lets you use login scripts.

    I don't think you're ever going to find a 'simple' (as in 3 clicks) solution usable by non-techies with versioning. Backup, yes. Sharing, versioning....not so much. It looks like you're simply going to have to be the server admin, and let the server deal with the versioning, multiple sites, and sharing.

  53. Mozy for backup by isoga · · Score: 1
    Hi

    I've been using Mozy.com for back ups. The client is totally unnoticeable after it's installed. It just runs some time each day when you are not using your PC. Handles large files like .pst nicely as well. They give you 2GB free or unlimited for 5USD/month and bonus space if you refer people :)

    https://mozy.com/?code=WAQ9DM/ and scroll down to 2GB free offer.

  54. What i do/have tried. by pjr.cc · · Score: 1

    What I personally use is all linux so it may not be much use to you however, i've gone thru several iterations of a simple script for my machines.

    Originally it was just a usb connected drive at home and work that got rsynced to (it would look for an lvm volume with specific name and if it existed, kick it off). The first iteration of the script basically just rsynced the data across and once a week did a dump. The second iteration took lvm snapshots on top of that (with some minor automated management) and added rsync -d to the mix (deletes files on the target that dont exist on the source).

    The next iteration was a much bigger change (and probably the smarter one) so basically it would rsync and snapshot files that didnt come from (or had changed from) packages that existed on the machine. i.e. it would go thru all the files on my harddrive, if it was from a package it would leave it alone (getting the OS back is simple then just overlay the backup stuff on top). a little while ago i switch to zfs for the external drive (my only real regret is zfs will probably never be a part of the linux kernel) and thats been pretty good cause zfs is a brilliant filesystem.

    In the OSS space i've played with things like afs, coda, drbd and things that wrap around svn and cvs (how I wish lvm had replication in built).

    But, having worked in the big-boy space for a long time i've seen alot of commercially available implementations most of which are available cross platform. Some are based on backup solutions (netbackup, backupexec, backbone, etc) and some work at the storage level.

    One implementation I was mostly impressed with though was using falconstor. Its basically a block-level replication software that connects to iscsi volumes and is really quite impressive in the way it manages backing up data. The company itself had mostly rover types who were in the office maybe once a week, the rest of the time their volumes would sync across links of varying speed (even over vpn) and was able to be configured not to chew up the entire link space. It all spoke back to an iscsi storage device (equalogic or emc with its iscsi head, i cant remember) and was also snapshotted occasionally. The best part about it was they never really had to deal with it, it just seemed to work 99.99% of the time.

    So far i've found the OSS side a little friendlier to those who know how to use them (mostly because they're just so much easier to modify), while the commercial side do everything you expect them to with varying degrees of success without being flexible enough.

    Of course, the one definitive thing i've found with people is that you'll always find someone who'll end up saving data in odd locations then get cranky when you cant restore it because you just weren't backing up c:\windows\temp ;).
    The whole "you can please some of the people some of the time" holds true 99% of the time.

  55. Datto Z series by Anonymous Coward · · Score: 1, Interesting

    Datto's new line uses ZFS with snapshots. If they're willing to spend a couple hundred bucks, it's a really easy (and foolproof) solution.

  56. The answer you don't want by slashdot_commentator · · Score: 1

    The current state of open source backup technology is abysmal. Currently, I'd say reliable would by rsyncing to a large, removeable hard drive, and then couriering it to a remote location or "secure" physical storage service.

    For "long term" backup, get a DLT tape drive, and selectively backup to tape. The tape, if properly stored, will be more likely to recover data than a hard drive. Also note, this is a few hundred dollar investment, with large capacity DLT tapes going for a hundred a pop as well.

    =====

    As anyone who's experienced this can attest, DVD-R/+R media really sucks for long term data storage. (We're talking 5+ year ranges.) But the latest error correction technologies (parchive2) has got me rethinking the problem.

    parchive2, for those not familiar with it, uses reed-solomon ECC algorithm to produce checksum files against a datafile (or set of datafiles). Its conceptually similar to raid5 data storage. So even with an inevitable failure of a sector of DVD-dye, you can still recover the data intact after running a reconstruction. A beautiful demonstration of a datafile's recoverability, with even 15% loss of data, would be to download a DVD (or even bluray disk) off USENET NEWS, delete a few data chunks, and then run par2 to rebuild the lost chunks. (And unrar to restore the original DVD disk.)

    The general idea is that you process the directory you want backed up using the RAR program (which "archives" it into a dump file, and then chops up the dump file into even pieces), create enough par2 data to allow a recovery with 20% data loss, and then split the data files and the par2 ECC files over 6+ DVD disks. (That would allow a recovery, even if you lost a disc, and lose a few sectors on another.)

    One "trivial" problem would be to write up a convenient utility to automate this whole process. The difficult problem, as I see it, is to increase data survivability when the DVD header block is lost, and the whole DVD disk becomes unreadable. I suspect there are alternate forms of DVD data encoding which allows one to either retrieve an alternate header block, or do sector recovery of disc data.

    A payoff here would be awesome; one wouldn't need to split a backup over many DVD disks, and still ensure data recovery, say, 10 years from now. So I will have to rip into hardware DVD ISOs to get an answer, but if someone has a better esoteric knowledge of DVD technology, any help would be much appreciated.

    --
    There is no America. There is no democracy. There is only IBM and AT&T and DuPont, Dow, General Electric, and Exxon
    1. Re:The answer you don't want by cdrguru · · Score: 1

      While it is true that if the initial border zone on a DVD becomes unreadable the disc cannot be accessed normally, there are recovery techniques that will allow the data to be read from the disc. No, there isn't any way to "demand" access to the data - you have to go through the drive's normal protocol. But you can play some tricks.

      No, there aren't any "alternate forms" of DVD encoding. DVD discs are rather complicated and the drive and chipset are very, very much required as an intermediary. And the drive only works one way. Completely different than a hard drive.

      And finally, an "ISO image file" contains only the most basic information from DVDs. There is lots and lots of other stuff on the disc that isn't represented there at all. The idea that you can represent a CD or DVD completely as an ISO image file is utterly false. Sure, you can get the basics, but that is all.

    2. Re:The answer you don't want by slashdot_commentator · · Score: 1

      No, there aren't any "alternate forms" of DVD encoding.

      So UDF encoding doesn't write backups to its "bad sector" table, or alternate DVD block headers in case the master one gets blown away?
      With the current, standard ISO9660 spec, is there a way to retrieve raw data on each sector on a disc? One then could still arrange the data format within each written sector. Then if the master block on the DVD disc gets blown, one could do raw reads of each sector on the disk, patch it together as one dump file, and then select certain written sectors as checksum data, create a checksum volume file out of it, and then rebuild the dump file, which would result in the recovered data.
      Are there reference spec details that limit the ability to do that?

      And finally, an "ISO image file" contains only the most basic information from DVDs. There is lots and lots of other stuff on the disc that isn't represented there at all. The idea that you can represent a CD or DVD completely as an ISO image file is utterly false. Sure, you can get the basics, but that is all.

      Apologies, I misspoke. I meant that I would have to rip into the hardware reference specs for DVD writing. They're referred to as RFC's on the Computer Science end. I'm not sure how those ISO committee reference documents are referred to (ISO docs?)

      --
      There is no America. There is no democracy. There is only IBM and AT&T and DuPont, Dow, General Electric, and Exxon
  57. Wuala by numLocked · · Score: 2, Interesting

    I'm surprised no one has mentioned Wuala - www.wua.la - which is a distributed online storage system. You agree to store (encrypted) bits of others' files in exchange for the ability to do so on others' machines across the wuala network. It's free and pretty damn cool. They can explain it better than I can: http://wua.la/en/learn/why

  58. wuala anyone?? by Guille+le+Bon · · Score: 1

    wuala works great for me. It's free, distributed, encrypted, and you can have as much space as you want as long as you share a corresponding portion of your hardrive (you get 20gigs online if you share 20 gigs of your hdd * the time percentage your machine is online). It has clients for Windows and Linux, even Mac. You can keep all your files private or decide you want to share a few folders either with friends, or everybody http://wua.la/

  59. rysnc.net will work for you by Anonymous Coward · · Score: 0

    Hmmm. One great solution: http://www.rsync.net/
    Their blurb:
    "Business continuity and disaster recovery built on open standards and common sense.
    - Simple backup/restore for Windows/Mac/Unix
    - Encrypted backups, snapshots, IPV6, sshFS
    - rsync, ftp/sftp/scp, rdiff-backup, WebDAV...
    - Map as a drive letter or mount in the Finder
    - Subversion/MySQL/Postgres/Exchange/SQL
    Choose the only provider with geographic redundancy around the world.
    Standard Offsite Filesystem: Full featured account, data stored in one backup location. $1.60GB/mo
    Geo-Redundant Filesystem: Data is automatically replicated to a second, redundant location. $2.80GB/mo "

    1. Re:rysnc.net will work for you by Kz · · Score: 1

      I've used them for small jobs. with rsync it's great for backups; but the WebDAV access isn't as good.

      i think it's somewhat cheaper than Amazon's S3, but don't know how well compares to other services.

      The best part is it's terms of service. it's short, and each paragraph is followed by a translation from legalese to english. they really inspire trust.

      --
      -Kz-
  60. advertising spam? by Uzik2 · · Score: 1

    This is just advertising spam taken to the next level.
    If you was bright you'd notice you don't control distributed backups.
    Not having control of your own data is double plus ungood.
    I.E. This is just someone shilling for a company.

    --
    -- Programming with boost is like building a house with lego. It's a cool but I wouldn't want to live in it
  61. Re:Mozy? Duplicity? by Firehed · · Score: 1

    Either you have approximately three libraries of congress worth of data, or a very cheap cell phone bill. S3 storage is pretty cheap considering the redundancy and offsite and all that good stuff - 15c/GB-month, and 10c/GB for transfer in. So up to about 30GB of so worth of stored data, it's cheaper than Mozy ($5/mo), but I'd need to be storing over 400GB-month of data plus a good chunk of rsync transfer bandwidth before it would cost as much as my cell line.

    And given the cell reception I get in S. Bumfuckville, NH, it'd probably be a much better investment.

    Most servers are going to have some sort of monthly charge associated with them. A co-lo will charge you out the wazoo for this kind of thing - remember, rack space is expensive. Cheap hosts that offer plenty of oversold storage (they tend to be more concerned about CPU cycles though, so using it as a remote drive probably wouldn't be an issue) don't typically give you SSH access, and anything beyond a cheap host and you might as well just go for S3/jungledisk unless you have a tremendous shit-ton of data in which case just buy a couple of Drobos and befriend a UPS guy and a relative in a different state. With 1.5TB drives hitting the market, you can pack about 4TB of drive-failure-safe storage in each unit. But honestly if you have those kinds of redundant+offsite storage needs, you're probably not asking Slashdot how to achieve it.

    --
    How are sites slashdotted when nobody reads TFAs?
  62. Something to keep an eye on by ElecCham · · Score: 2, Interesting

    ObStdDisc: I work for the company I mention here... but suffice it to say that I left a very stable job to do so - so's to indicate that I do actually believe in the excellence of the product.

    Keep an eye on Rebit. It doesn't do what you're asking about as of this moment... but (without treading into realms of "I'm not allowed to talk about that") I can safely say that the future holds some interesting things along this sort of direction.

    --
    Sig broken, watch for .finger
  63. restore-backup.com by Anonymous Coward · · Score: 0

    restore-backup.com

  64. Re:Mozy? Duplicity? by Fweeky · · Score: 1

    rsync to Amazon S3 might be an option, if only for cross-platform capabilities. No versioning though, but outside of Apple's Time Machine (obviously useless for Windows and Linux), you're not going to get that without some major headache

    Um, there are plenty of incremental backup tools dotted about, just upload the dumps?

    Alternatively tarsnap is currently in beta testing, uses Amazon S3, and the client is written by the top FreeBSD security bod, with the client coming as source (though the service isn't free).

  65. wuala by Profane+MuthaFucka · · Score: 1

    I'm testing it out right now for porn backup. So far, so good.

    --
    Fascism trolls keeping me up every night. When I starts a preachin', he HITS ME WITH HIS REICH!
  66. Allmydata Tahoe by Anonymous Coward · · Score: 0

    I'm not sure if it's precisely what you were looking for, but Allmydata Tahoe looks like an interesting possibility. It sounds like it meets most of your criteria, except versioning, but I think it exposes a FUSE interface, so you could probably just run a versioning filesystem like Git atop it.

  67. anyone mention Unison yet? by AgileGuru · · Score: 1

    unison is like rsync, but _mirrors_ changes on both sides, so you could create a star-type file distribution network based on a single backup location. So, anytime people drop a picture into their 'public pictures' folder, it will eventually get synced to everyone's 'public pictures' folders. uses rsync under the covers so is efficient. Good documentation; didn't take too long for me to get it set up right on a mixed windows/linux/mac environment. http://www.cis.upenn.edu/~bcpierce/unison/

  68. Clarification from OP by RichiH · · Score: 1

    First of all, thanks for all the feedback. I appreciate it. :)

    It seems that I did not state my requirement well enough and there's some confusion about them. So, here goes again:

    * The clients are distributed all over Germany, sometimes all over the world. DSL or better is available at all sites.
    * Data synchronization must be asynchronous.
    * Clients must have local shares, slices, working copies, whatever. They must work offline, as well.
    * Data must be partitionable. While my sister and mother share all their pics, others do not.
    * Automagic sync as soon as network connectivity is available is a plus. Requiring them to click a button is fine, as well.
    * Almost all clients will be Windows.
    * Total amount of storage used is a few dozen GiB. Using a few times that amount on the servers is fine.
    * It's OK if brains need to be poured over this baby during initial setup.
    * It's _not_ OK if tech knowledge is required during day-to-day operations.
    * I will not be on site very often, depending on how far away they live.
    * It's OK if they are not able to restore anything by themselves. I will just grad whatever and send it per eMail.
    * I don't like S3 or other cloud implementation. That data is mine/theirs and not storing it elsewhere is better than encryption. Hard drives are too large, anyway. Storage space is not a concern, reproductions of the complete data make it fault tolerant.

    Once again, thanks for all feedback :)

    1. Re:Clarification from OP by Anonymous Coward · · Score: 0

      Funny, it's like you want DropBox, but without the cloud. How about using DropBox with truecrypt containers?

  69. Tahoe by ajb44 · · Score: 1

    Sounds like tahoe could be what you want. However, it's pretty young, so I wouldn't necessarily rely on it yet.

  70. SpiderOak is my solution by Anonymous Coward · · Score: 0

    SpiderOak https://spideroak.com/engineering_matters is a fault tolerant, fully encrypted remote backup, and supported Linux since day 1. It does block level dedupe, and preserves all historical versions, deleted items, etc. I use it to combine my backup of multiple machines into one archive. Only real drawback is it uses more CPU while archiving items than other archiving systems.

    1. Re:SpiderOak is my solution by grikdog · · Score: 1

      Thanks for the tip. Does complete backup versioning force you to continually add on more storage? Are users allowed to delete unwanted items from the SpiderOak servers? 100 Gb at $10 a month/$120 per year adds up in a hurry for some of us.

      --
      ``Tension, apprehension & dissension have begun!'' - Duffy Wyg&, in Alfred Bester's _The Demolished Man_
  71. Just me? by Hognoxious · · Score: 1

    When I saw the words "easy", "reliable" and "distributed", I was expecting the punchline to be "choose any two".

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  72. carbonite by keraneuology · · Score: 2, Informative

    I use carbonite. Small app, I can have multiple machines within the same account, unlimited data for something like $49/year. I got it for a work machine - and it has already been used to retrieve deleted files (very painless process), liked it so much that I got it for a couple of the family machines that I support. I set it up for them and the only instructions they have to remember is "don't save tax returns under c:\windows\system32, save them under My Documents".

    --
    If the g'vt kept the data on you that google does you'd better believe you'd be calling it "doing evil"
  73. Great technology, not so great product by apankrat · · Score: 3, Insightful

    I watched their CTO's Google Talks presentation and it was really interesting. I got all excited, joined their beta only to realize that they - IMO - misused the technology they had and designed a rather mediocre product. Wuala wants to be a backup tool, a sharing tool, a social networking medium as well as few other things. In other words it lacks focus and wants to do everything - an approach that rarely works.

    --
    3.243F6A8885A308D313
  74. for the lazy by Anonymous Coward · · Score: 0
  75. crashplan.net by osssmkatz · · Score: 1

    http://www.crashplan.net/ has done exactly what you describe. everyone in your 'backup network' backs up to each other, and for free. They make money from selling their own offsite backup. --Sam

  76. NovaBackup 10 for a windows solution. by Anonymous Coward · · Score: 0

    Personally I use NovaBackup 10 from http://www.novastor.com/. I just the DR image capability in it to go to my 'backup server' (NAS), as I found it more reliable than ghost or true image. Though I did test the Amazon S3 and FTP parts of it along with burning to dvds, BD backups are still too expensive for my taste. Though it does do tape and other stuff, but isn't tape dead? :)

  77. CrashPlan by milesw · · Score: 1

    CrashPlan is an excellent option: supports Windows, OS X, and Linux, backups to other computers you own or trust (or even to CrashPlan servers, if you prefer), everything is encrypted, responsive dev team, etc, etc. Check out the video.

  78. CrashPlan by Anonymous Coward · · Score: 0

    I've been using CrashPlan on my OS X machines for quite awhile, and it's been working nicely. It's not free (for the person that wants to do a backup), but they do have Windows, Mac and Linux clients. And it's completely free to run on machines that will only run as backup destinations.

    It can't do your photopoll stuff, as far as I know, but the rest seems fine.

  79. Try a 21st century solution by Anonymous Coward · · Score: 0

    Make them all use Macs and use TimeMachine. flawless.

  80. Re:Mozy? Duplicity? by nachoboy · · Score: 1

    +1 vote for JungleDisk. I use it on my Windows and *nix machines and couldn't be happier. I really like the idea of paying for the software once (use on as many machines as you like, with free upgrades forever) and paying Amazon for storage "at cost." So many other internet services rely on oversubscribing limited resources, with heavy users eventually getting ejected in favor of more profitable clientele. With Amazon, I know I'm getting exactly what I pay for and they're not going to disappear with my data anytime soon.

  81. iSCSI by Zenzilla · · Score: 1

    Run an iSCSI target for each person you want to store info for. They can connect with their windowz and to them it acts like a slow hd. If you need security add vpn.

  82. Crashplan for backups and recovery by Darkk · · Score: 1

    I have heard about this site and it looks darn easy to use:

    http://www.crashplan.com/

    They support Win, MacOS and Linux

    If you have a "buddy" to store your backups then it'll be free otherwise to store stuff on their servers they charge a fee for it.

    Darkk

    1. Re:Crashplan for backups and recovery by Anomalyst · · Score: 1

      Crappy flash based home page, if they make such moronic decisions in marketing, I shudder to think how they butcher their development technical decisions.

      --
      There is no right to feel safe thru security vaudeville at the expense of everyone's freedom, privacy and tax money.
  83. Re:Mozy? Duplicity? by mydn · · Score: 1

    S. Bumfuckville, NH? That must be near Jaffrey.

  84. Consider Manent by gsasha · · Score: 1

    http://trac.manent-backup.com/ Easy: yes, after a first setup. Reliable - yes. Versioned: you bet! Actually, every backup you do is accessible as a different version, with a very little overhead.

  85. Funnily enough, I've been thinking "family server" by Colin+Smith · · Score: 1

    A low power "set top" linux box; 1 lan, 1 wifi, 4 USB, no internal storage; plug in external usb drives. Tv out, usb audio & video in.

    Serving; Email, web, files, printers, p2p, music & videos, video calling.
    easy VPN between trusted boxes
    easy sharing files (rsync over vpn)
    easy sharing calendar & addressbook(with outlook, thunderbird integration).

    The key is an easy and secure way to set up trusted vpns between multiple set top home servers to form friend & family networks. Perhaps email an URL "invite".

    --
    Deleted
  86. Re:Easier than ever. by Anonymous Coward · · Score: 0

    STFU, Twitter.

  87. Box Backup by Anonymous Coward · · Score: 0

    www.boxbackup.org

    Encrypted online backup to a server you control.

  88. CrashPlan. by Jay+L · · Score: 1

    (Repurposing a post I made to the VMWare Fusion forum...)

    I've found CrashPlan ($25/seat) to do a pretty good job of cross-platform, Time-Machine-like, peer-to-peer backup between Mac, Windows, and Linux servers - with the added advantage of off-site backups (for a fee, from them, or for free, from your own machines and your friends - who don't have to buy CrashPlan, either).

    On the Mac, like Time Machine, it appears to use the FSEvents system to back up only the changed files. (On Linux, it uses inotify, but that has some bugs; on Windows, I think it may use Shadow Volume Copy or something like that.) It stores only the portions of the files that have changed, in an xdelta-like format, so it's highly compressed and deduplicated (kinda like git). I sprang for the Pro version, which at $60 can keep any number of previous versions for any number of days, so you've really got point-in-time restore as far back as you want it. Best: It regularly checks the integrity of the backups. Anyone who's tried to do tape backups has discovered the joy of a corrupted backup file.

    Downsides: It's a CPU hog, even on Apple's 64-bit Java 6 VM. You can set it to limit its own CPU when you're at the keyboard, but obviously, that slows down your backups, and it doesn't seem entirely accurate; on an 8-core Mac Pro, I've seen it use up 100% of a core even when it was theoretically limited to less than that. There's an upgrade coming in the next few weeks that's supposed to offer 400% faster backups with 30% CPU, so that may get better.

    Also, the UI for restoring is a bit clunky, and forces you to go date-first, rather than tree-first; if you know you need an older version of a file, but don't know what the last "known good" version was, you're in for a lot of mousing. It has had a number of bugs (fewer lately) that cause it to lose track of which files have actually changed. This doesn't cause any problems, since your backup peer will store only the changed bytes (=0 bytes), but it does make the CPU problems worse, and waste a lot of disk and network bandwidth.

    There are free automatic updates every few months, but they're forced and unannounced, which gives me the willies a bit. (I don't know if the enterprise version has more control over that. BTW, the enterprise version is named "Pro Server", not to be confused with the home "Pro" version I bought, which of course has a server component as well..) Also, although your backups are encrypted, the logs (which are apparently either sent to, or retrievable by, their support team) have your filename and pathnames in them, which is a pretty big privacy leak that I've alerted them to.

    That said, having once done a complete tour of EVERY Windows backup solution, from free to $10K, and finding them all pathetically lacking and buggy, and nonetheless having bought my own DDS-4 drive and, later, VXA-2 10-tape carousel, and nonetheless still having had to send drives off to OnTrack three or four times... CrashPlan is the best damn backup I've seen, and the only one that's been hands-free enough to use and rely on, and for under $100 it's crazy.

  89. BackupPC by jimbo-nally · · Score: 1

    Even though I've never used it, I can say that from years of reading the K12LTSP listserve, BackupPC has always been mentioned as a good way to backup multiple machines. It supports Windows and Linux and you can browse the archives via a web browser.

  90. Sounds like Dropbox to me. by fremean · · Score: 1

    http://www.getdropbox.com/ - Store, Sync and share your files - versioning is standard. Piss easy to set up on Windows, Linux and Mac.

    Not much more to say :)

  91. audiojunky by audiojunky · · Score: 1

    have a look at www.datacastlecorp.com as their backup service works well for friends and family.

  92. SuperFlexible and Amazon S3 by Anonymous Coward · · Score: 0

    (Assuming windows clients)

    I use "Super Flexible file Synchronizer" (www.suplerflexible.com) and schedule it for nightly uploads to my S3 account. It runs as a service and has more options then you can shack a stick at, it really is super flexible.

    I run it on my server to back up about 600mb of databases and websites every night. I keep two weeks of copies on S3 and my monthly bill from Amazon is about $1.25. That is not a typo, it's a little over a dollar a month for no fuss backups.

  93. Box.Net by Soham · · Score: 1

    Box.Net (they have 2 million users) has everything that you listed above, except desktop clients, which they claim are under development.

  94. Re:online bacon backup by Anonymous Coward · · Score: 0

    what's wrong with getting an account with Connected/Iron Mountain - easy to use intelligent online storage that doesn't cost a lot - saved my bacon many a time

    You've got online bacon storage? Wow! The wonders of the internet never cease.

  95. is there an open source project that does this... by RandySC · · Score: 1

    http://sonicwall.com/us/products/2057.html

    CDP detects new or changed files, even when files are open. When this information is found, CDP immediately and automatically replicates it to dedicated hardware locally. Unlike most traditional backup products, no user intervention or additional software or hardware is required with CDP.

    It also keeps track of 15 versions of a file, so you can restore any or all of them.

    --
    Organization: alphabetical, sometimes numerical or messy
  96. Covered Extensively on Previous Thread by CranberryKing · · Score: 1

    Right here. Good stuff.

  97. JungleDIsk + Amazon S3 by forq · · Score: 1

    I use JungleDisk and the Amazon S3 service. This solution does everything you're asking for. It doesn't cost much, and you take on the S3 costs.

  98. Rsync by ResidentSourcerer · · Score: 1

    Rsync can be versioned just fine. You tell it to rename the old version of foo.jpg to foo.jpg.back. After the backup runs, another script runs on the server that pulls the mtime of foo.jpg.back and uses that to rename it to foo.timestamp.jpg.

    I implemented this in my last job where home directories were served by samba. The clients were winsooze. I had a large list of directories from the users' profile directory that were not backed up, because they changed too often, and few users would be aware of their existence. (browser caches, doc temp files...)

    Anyway the backup folder for a user would typically be 2-3 times the size of their primary directory over the course of a year.

    This size was maintained by another script that pruned the backups. The weekly prune removed all but the last version of the previous week. The monthly prune removed all but the last version of the previous month. The quarterly prune removed all but the last version of the previous quarter.
    So at any given time, you usually had dailies for this week and last week, weeklies for the current and previous month. Monthlies for the current and previous quarter.

    Implementation:
    Assuming you have shell access on a unix server somewhere that you have space, then you need to establish a communication channel between the client and the server.

    You need to put ssh and rsync on the client, and either run rsyncd as a service on the client, and pull from the server, or some form of cron on the client and push from the client.

    If you have samba running on the server, you have a browse mechanism for users to restore files. Store the server's location as one of the user's "Network Places"

    Of course all of this is run either through some form of VPN or each channel runs through ssh.

    Caveats: As with many backup systems, this doesn't handle changes in the file system in a fully robust manner. There will be errors if the user moves or renames a file between the script starting and the file being backed up. Since this system keeps multiple versions, and since backups are handled at the file level, the occasional single bad file or missed directory is not the end of the world -- it just means that a less recent version is used.

    It also doesn't handle huge files such as large MS Access file or Photoshop files with complete grace due to their size. However someone who uses these programs really needs an in house backup system.

    In use the most common request from my users was for a file from 2-3 days ago. Usually what had happened was that they logged out without saving, and large files would not be saved completely. Most of the time this happened with powerpoint files.

    --
    Third Career: Tree Farmer Second Career: Computer Geek First Career: Teacher, Outdoor Instructor, Photographer.
  99. How about a P2P, mirrored, encrypted, virtual disk by rajid · · Score: 1

    Take a P2P system which splits up files and distributes the contents across the net already. Put an encryption front-end onto it so that any files written into it are encrypted. Then put a front-end onto all of that which allows you to "mount" the whole thing as a virtual drive. Anyone who wants to have "X"Gb of data storage on the system needs only buy an "X"Gb drive and make it available to the system. In exchange for making "X" Gb available, you would have your files written (in encrypted form) to "the net" and automatically spread out across the Internet. If you want those files mirrored, then you can make "2X"Gb available in exchange for "1X"Gb of space, mirrored. Just doesn't seem as though it would be that hard to do given all of the P2P software out there which already does a lot of this. I just, unfortunately, do not have the time to do it. Sigh.

  100. Tahoe by chamalulu · · Score: 1

    Saw three suggestions about Tahoe so far, low scores though. So i thaught I'd just add my own low score suggestion about Tahoe (http://www.allmydata.com/).

  101. Re:Mozy? Duplicity? by Anonymous Coward · · Score: 0

    CrashPlan -
    It's entirely cross platform - Windows, Mac, Linux and Solaris.

    It uses far less bandwidth than rsync, doesn't slow your systems down, backs up differentially in real time, and lets you go off-site to multiple destinations.

    It has faster backup & restore than Mozy or Carbonite.

    If you use their online service - you can put as many computer as you want under a single service for $0.10/GB/month which is cheaper than S3.

    Oh, and it has unlimited versioning and it's 64 bit clean meaning no filesize limits.

  102. Spider Oak by WhiteDragon · · Score: 1

    I've heard good things about Spider Oak (first 2GB free) so you might check them out. (disclosure, some of the people who work there are friends of mine)

    --
    Did you mount a military-grade, variable-focus MASER on an unlicensed artificial intelligence?