Slashdot Mirror


What Software Do You Use for Unix Backups?

jregel asks: "Linus has stated that dump should not be considered a reliable backup program, and both tar and cpio have their limitations. So what are Slashdot readers doing for backing up Linux servers and workstations? (you do backup, right?)" Given this bit of news, have you used anything other than the standard Unix staple to back up your Linux boxes? If you were forced off of tar, cpio and dump, what would you use as a replacement?

17 of 212 comments (clear)

  1. Easy. by torpor · · Score: 4, Funny

    If you were forced off of tar, cpio and dump, what would you use as a replacement?

    I'd use dd of course...

    --
    ; -- the corruption of government starts with its secrets. a truly free people keep no secrets. --
  2. dump on solaris... by Polo · · Score: 4, Informative

    You know, I was thinking about the same thing since I had problems with a recent restore from a compressed dump archive. I was missing some files probably because I ran the dump from an active file system.

    I found out that solaris has a very interesting command: fssnap

    It creates a read-only snapshot of your filesystem intended for backup operations.

    You create a snapshot, dump the snapshot, then delete the snapshot and the dump is consistent.

    I wonder if there's something like this for linux...

    1. Re:dump on solaris... by AlexA · · Score: 3, Informative

      Yes there is. It's called LVM. I've used its snapshot capabilities before on my Linux server, it's very nice.

  3. Roll Your Own by JimR · · Score: 3, Interesting

    I wrote my own (Perl) script, that copies all my "important" files (basically stuff in my home directory that can't be reconstructed by other means and all the system config files) to a new directory tree (using cpio) it then burns the copied tree to CD-RW and verifies the CD against the copied tree.

    I operate a 4 disc system, so I always have the last four backups on CD and I keep the copied trees around (uncompressed) for as long as I have disk space. So far I've not needed the CDs (I store 2 of them offsite in case of disaster) but the copied filesystem trees have come in useful a couple of times.

    The only drawback of this is it's not appropriate for backing up huge quantites of data (like lots of audio or video files) as the CD media is quite limited in size - but when rewritable holographic storage comes along I'll be able to just change my function that decides which files are "important".

    --
    #exclude <ms/windows.h>
  4. BackupEDGE vs. Taper by mindslip · · Score: 3, Informative

    I think the 2 above are both excellent, Taper for the less demanding environment, BUpEdge for a system with multiple drives.

    I'm actually doing a 100gb backup as we speak... so good timing on the Ask Slashdot.

    My only beef with Taper (and I'd use it otherwise, on my home system) is that when you do an "e"xclude or "i"nclude of a directory, it scans the entire subtree, which can take *forever*, (like when excluding /var/squid) instead of just simply skipping that directory.

    mindslip

  5. Why not fix dump and/or Linux? by Dahan · · Score: 3, Insightful
    Dump has been the standard Unix backup program for decades... I don't use Linux, but if I did, I'd consider it a bug that dump didn't work properly.

    Seems to me that Linus (or another kernel hacker) should fix the ext2 race condition reported in that thread, rather than blithely dismiss the problem with, "dump was a stupid program in the first place."

  6. Re:rsync by Colitis · · Score: 3, Informative

    I use rsync over ssh too; I back it up to a machine at work (which I can reach from home). It basically does my whole home directory except for a few excludes for stuff that's a bit sensitive (ssh keys, keychain, ICQ history) which I manually backup to CD now and then. The machine at work is then backed up with TSM.

    The rsync over ssh style of backup is so easy it's addictive!

  7. Amanda! by nathanh · · Score: 5, Informative

    I have been extremely happy with Amanda. Single centralised backup server running amanda-server. Multiple workstations running the amanda-client. Amanda automagically schedules backups based on sensible heuristics. I just tell Amanda how many tapes I have, how many workstations I have, and Amanda does all the hard work of working out how much tape capacity is required and how often it should schedule incrementals/fulls.

    The server/client protocol has been designed to avoid reliance on dangerous security holes like rsh. The server sends the client a "send me your dump" message. The client then connects back to the server and delivers it the output from dump or tar. You can configure exclusion lists on the client if you're worried about sending certain files or filesystems. You can also encrypt the data stream and/or use Kerberos for authentication.

    If I forget to load a blank tape then Amanda plays it safe. It doesn't overwrite last night's backup: instead it stores incrementals into the "holding disk". Amanda will then flush the held backups to the next blank tape.

    Amanda emails me reports after every backup with a neat summary of what went right/wrong. It also gives you several hours advance warning if you forget to load a blank tape or if any of the workstations are offline.

    The only downside of Amanda is that it is fiddly to setup. The documentation is poor and the configuration files are cryptic. But if you're willing to invest some time and effort then you can't do much better (for free) than Amanda.

  8. afbackup by Vairon · · Score: 4, Informative
    Website URL: http://sourceforge.net/projects/afbackup/
    Features:
    • Server & Client programs
    • Supports multiple clients streaming backups at the same time
    • Webmin module for easy configuration
    • Support for many tape drives and autoloaders
    • SSL and DES encryption support
    • Remote or local start of backups
    • Compatible with most *NIX systems (personally used it with Linux, Solaris & FreeBSD)
    • Non-root users can restore their own files
    • Unlike AMANDA:afbackup can actually append to tapes

    For those who don't know: AMANDA cannot append to tapes.
    Every time you backup with AMANDA it must start from the beginning of the tape.
    So, if you want backups every day, you must have a tape for every day.
    (http://amanda.sourceforge.net/fom-serve/cache/29. html
    1. Re:afbackup by martin · · Score: 4, Informative

      amanda doesn't append to tapes so there is not possibility of blowing away that tape. This is a problem I've experienced with other commercial software that appends to a tape each run - tape write error and it marks the entire tape bad. which means you have to scrap the entire entire tape and start again.

      Also tisk of appending is loss of tape or drive due to environmental factors - fire/flood (plane being driven into data centre).

  9. I cannot be forced off tar by mnmn · · Score: 4, Interesting


    They say tar has its limitations. I really dont understand.

    Ive worked with different unixen and Linux distros, so I just dont want to be dependant on something that isnt installed by default everywhere. tar already has a VERY well known format and execution parameters.

    Ive lost my fair share of data to buggy harddrives and dumb mistakes like pulling off the ide cable while the system is running. So cron does daily backups using tar cfj using a file that has a list of other files to be backed up. This way I dont have to backup the whole partition. To restore a certain file, just tar xvfj backup2.tar.bz2 /pathtofile --root=/

    The cron setup renames backup.bz2 to backup2.bz2 and removes backup2.bz2 so I have the data for the past two days. Beside incremental backup which I dont need due to this setup, what else could I need? And by the way the backup.bz2 is copied off onto an NFS share elsewhere incase my whole RAID setup crashes, or the XFS filesystem bombs out. This setup can be replicated onto FreeBSD Solaris and many others.

    --
    "Give orange me give eat orange me eat orange give me eat orange give me you." -Nim Chimpsky
    1. Re:I cannot be forced off tar by mcelrath · · Score: 3, Insightful
      The horrible problem with linux right now though is that because the memory management is so braindead, that backup will swap out everything in memory in favor of caching your multi-gigabyte backup file. Thus your method brings the machine to a standstill while the backup is occuring (which can take hours to days depending on the size of your filesystem).

      Not a criticism of your method (in fact, I use this), just a rant that the Linux MM system NEEDS TO BE FIXED. I'm sick of watching as some trivial process that will only read or write once gets the whole filesystem cached for it while programs I'm using interactively get swapped to disk. Video recording and playing programs (mplayer, ogle) have the same problem.

      Let's hope 2.6 is better than 2.4. Can any kernel hackers comment on this? In 2.5 will tar cvjf /home /mnt/backup/home.tar.bz2 bring my system to its knees?

      -- Bob

      --
      1^2=1; (-1)^2=1; 1^2=(-1)^2; 1=-1; 1=0.
  10. Try star by J�rg Schilling by Corporate+Gadfly · · Score: 4, Informative

    Some people have already mentioned Amanda.

    In addition to amanda, I have good luck with star coded by Jörg Schilling. star is very feature-rich, fast, standards compliant and has been around since 1985. Give it a try!

    The star-users mailing list is here . You can also look at the man page and finally download it

    --
    Corporate Gadfly
    Jonathan Archer: the most beaten up Enterprise captain in Star Trek history
  11. BackupPC by dissy · · Score: 3, Informative

    http://backuppc.sourceforge.net/

    Automated backups to an online disk server, open source, and a really nice web interface as well as command line interface.

    It uses samba and ssh to backup and restore to windows and unix machines.
    You can have it restore any files/folders in a backup you select, using the same methods (samba or ssh) as well as it can send the restore files to your browser in a tar or zip file.

    I recently replaced a machine using amanda and a DLT drive with a fileserver using a raid 5 array and backuppc. Best switch ever.

  12. TSM by duffbeer703 · · Score: 3, Interesting

    Tivoli Storage Manager is the only "backup solution" that I have ever seen that truly works well without alot of tweaking and twiddling.

    I've worked at places using Legato and Amanda, where restoring from backup was an unreliable and error-prone process more likely to be a waste of time than anything else.

    TSM is not cheap, but is worth every penny. We have one full time and one part time employee handle the backup/restore jobs for about 2000 servers. Try that with Legato or Amanda.

    --
    Conformity is the jailer of freedom and enemy of growth. -JFK
  13. Re:tar does not do incremental backups by dissy · · Score: 3, Informative

    > The problem is tar always archives the entire space which makes it difficult to
    > backup, say gigabytes of data, daily.
    >
    > A decent backup tool (as opposed to an archival tool) must absolutely have
    > incremental backup support.

    Er?

    tar --help
    [snip]
    Operation modifiers:
    -G, --incremental handle old GNU-format incremental backup
    -g, --listed-incremental handle new GNU-format incremental backup
    [snip]
    Local file selection:
    -N, --newer=DATE only store files newer than DATE
    --newer-mtime compare date and time when data changed only
    [snip]

    This is in tar (GNU tar) 1.12
    (Which is really really old actually.. slackware 3.2 dist)

    There are also tons of options to exclude directorys and files, to force it to span disks, and pretty much match in any way you need.
    I've been making incremental backups (and even restored a few) for awhile now.

  14. Re:Use dump and lose data by coyote-san · · Score: 3, Informative

    Have you even read Linus's comments?

    Dump works by reading the raw data partition. That works great with an unmounted partition, or if you have a very limited OS that does not perform any caching.

    But Linux is different - it's now using the cached pages as the primary content, usually flushing them to disk only as the pages are dropped. This is the approach used by most mature OSes, but Linux doesn't yet have an interface for "dump" programs to query the OS for updated but unwritten sectors.

    So dump is the worst of all possible things now. Not only will you get incomplete live files, you can get incomplete files even if the users have all terminated but the pages haven't been flushed to disk yet. That's non-deterministic, and there's simply no way for you to perform reliable dumps.

    On the practical side, dump is specific to the filesystem. When everyone ran ext2, that wasn't a problem. But now people may have a mixture of ext2, ext3, reiserfs, xfs, jfs, and probably even other formats. Each requires their own dump and restore, and that requires a lot more effort.

    --
    For every complex problem there is an answer that is clear, simple, and wrong. -- H L Mencken