What Software Do You Use for Unix Backups?
jregel asks: "Linus has stated that
dump should not be considered a reliable backup program, and both tar and cpio have their limitations. So what are Slashdot readers doing for backing up Linux servers and workstations? (you do backup, right?)" Given this bit of news, have you used anything other than the standard Unix staple to back up your Linux boxes? If you were forced off of tar, cpio and dump, what would you use as a replacement?
If you were forced off of tar, cpio and dump, what would you use as a replacement?
I'd use dd of course...
; -- the corruption of government starts with its secrets. a truly free people keep no secrets. --
9 out of 10 network admins smack their tape monkeys when they forget about modprobe zftape after reboots.
Do none of the decent filesystems have a snapshotting facility? That's the best solution to coherent backups...
You know, I was thinking about the same thing since I had problems with a recent restore from a compressed dump archive. I was missing some files probably because I ran the dump from an active file system.
I found out that solaris has a very interesting command: fssnap
It creates a read-only snapshot of your filesystem intended for backup operations.
You create a snapshot, dump the snapshot, then delete the snapshot and the dump is consistent.
I wonder if there's something like this for linux...
I wrote my own (Perl) script, that copies all my "important" files (basically stuff in my home directory that can't be reconstructed by other means and all the system config files) to a new directory tree (using cpio) it then burns the copied tree to CD-RW and verifies the CD against the copied tree.
I operate a 4 disc system, so I always have the last four backups on CD and I keep the copied trees around (uncompressed) for as long as I have disk space. So far I've not needed the CDs (I store 2 of them offsite in case of disaster) but the copied filesystem trees have come in useful a couple of times.
The only drawback of this is it's not appropriate for backing up huge quantites of data (like lots of audio or video files) as the CD media is quite limited in size - but when rewritable holographic storage comes along I'll be able to just change my function that decides which files are "important".
#exclude <ms/windows.h>
Amanda comes up a lot. They can't span tapes.
Veritas also comes up a lot. Aside from cost, did you know Veritas can't back up single files larger than 2GB in size on Linux clients?
On paper, BRU looks pretty darned good. I haven't yet put that theory into practice.
I think the 2 above are both excellent, Taper for the less demanding environment, BUpEdge for a system with multiple drives.
/var/squid) instead of just simply skipping that directory.
I'm actually doing a 100gb backup as we speak... so good timing on the Ask Slashdot.
My only beef with Taper (and I'd use it otherwise, on my home system) is that when you do an "e"xclude or "i"nclude of a directory, it scans the entire subtree, which can take *forever*, (like when excluding
mindslip
This Slashdot user('s IT) is using HP Omniback and was happy to do it more than once.
I plan to use Linux' Software RAID to mirror all my data to one or two harddisk in an exchangeable frame. A RAID restore took only two thirds of the time for a file-by-file copy on my old machine. It is already using two disks in a Software RAID mirror. So the next step is to threw in another disk, and make linux mirror the data to the third drive. Should run nicely in background.
Sure, this is a home solution, not a professional solution, but it should be simpler than fiddling with tapes that don't have enough capacity for all my data.
Denken hilft.
we use rsync for backup over ssh onto another machine at a remote location. This works really well, especially if you do a cp with hardlinks each night. rsync will download just the changes since yesterday, and files that are the same just end up being hard links to the same data. This also makes restores reasonably trivial. (just ssh onto the backup machine, cd into the directory for the date you want, cd into the directory and grab the file -- done.)
Seems to me that Linus (or another kernel hacker) should fix the ext2 race condition reported in that thread, rather than blithely dismiss the problem with, "dump was a stupid program in the first place."
DAR's what I use:
only 25 blank CD-R's to backup my home dir
*cough*
( yeah, OK, so I should stop trying different distros and settle on one, or two, .. alright, three, then, and I'd be able to wipe all that stuff I keep for installing-from-disk, but .. Oh, Later. )
Messages to/for me ( in me journal )
This backup machine keeps seven generations of daily backups on one disk (cp -al, so no duplicating of static data), and a few weekly ones on the other disk. Every night it rsyncs things off-site (to my home). That rsync has turned out to be unreliable (probably my adsl), so I have a script that does it in small bits and pieces. Takes a few hours in the early morning.
In Murphy We Turst
I use "mkisofs /etc /root /home -R -T -o backup.iso && cdrecord dev=0,0,0 speed=4 blank=fast -data backup.iso" to create an ISO image, which will be burned to the CDRW disk. That's all I need to backup my workstation. And restoring the data doesn't require any special tools.
www.amanda.org
nice - can use tar or dump as the back end system. Works on *nix/ MaxOSX/windows via samba or cygwin.
cdbkup is a little more sophisticated - multiple levels, multiple disks.
"CDBKUP is a professional-grade open-source package for backing up filesystems onto CD-Rs or CD-RWs."
Note to ACs: I won't mod you up, even if you are being funny or insightful. So take a chance! It's not real life!
Just use your favorite volume managment system to create a snapshot of whatever volume you would like to backup and use dump, tar or whatever to write that snapshot to tape. This way you'll have a consistent backup of the filesystem.
I'm using LVM as a wrapper around Amanda to create and remove snapshots of every filesystem I'd like to backup.
I have been extremely happy with Amanda. Single centralised backup server running amanda-server. Multiple workstations running the amanda-client. Amanda automagically schedules backups based on sensible heuristics. I just tell Amanda how many tapes I have, how many workstations I have, and Amanda does all the hard work of working out how much tape capacity is required and how often it should schedule incrementals/fulls.
The server/client protocol has been designed to avoid reliance on dangerous security holes like rsh. The server sends the client a "send me your dump" message. The client then connects back to the server and delivers it the output from dump or tar. You can configure exclusion lists on the client if you're worried about sending certain files or filesystems. You can also encrypt the data stream and/or use Kerberos for authentication.
If I forget to load a blank tape then Amanda plays it safe. It doesn't overwrite last night's backup: instead it stores incrementals into the "holding disk". Amanda will then flush the held backups to the next blank tape.
Amanda emails me reports after every backup with a neat summary of what went right/wrong. It also gives you several hours advance warning if you forget to load a blank tape or if any of the workstations are offline.
The only downside of Amanda is that it is fiddly to setup. The documentation is poor and the configuration files are cryptic. But if you're willing to invest some time and effort then you can't do much better (for free) than Amanda.
To dump dump, or not to dump dump. That is the question.
(Spudley Strikes Again!)
If you were forced off of tar, cpio and dump, what would you use as a replacement?
I'd cvs of course.
From time to time i burn CD's using follow-symbolic-links from a dir with symbolic links to all important files and directories.
The rest is the default distro installation anyways.
Features:
For those who don't know: AMANDA cannot append to tapes.
Every time you backup with AMANDA it must start from the beginning of the tape.
So, if you want backups every day, you must have a tape for every day.
(http://amanda.sourceforge.net/fom-serve/cache/29
Arkeia is a powerful one, but not free software. there are two versions, a free one for small offices and a more powerful costly one. ...quick browse of the site does not reveal the free version, i don't think it exists anymore for 5.x (maybe i am not recalling correctly).
anyway, arkeia can back up windows, linux, unix, and mac osx.
Use my userscript to add story images to Slashdot. There's no going back.
...and your RAID backup seems kind of comical.
.sig that's floating around...
Kinda like that "Offsite backup? Yeah, it's in the other tower!"
It's not free, but that's probably why it's only been mentioned once so far.
BRU
I don't use tapes because I hate them. Granted in order not to use them, you need lots of spare hard drives, which I do.
First, I use mysqlhotcopy to get all sql data. Then, my backup server uses samba to tar and gzip up all data from various servers, Windows and Linux, into one place. Then it uses scp to send it all across the WAN to another backup server which keeps a business week rotation, and one month rotation. The other site does the same, and so far no problems at all.
This way you don't have to *hope* that your tapes aren't corrupt and jump through ugly hoops to retrieve your data. I would *not* recommend bzip, however; too damn slow and processor intensive, even with a dedicated backup server. Simply not scalable.
It's all going according to
Veritas Bare Metal Restore 4.5
Works for windows and Unix(AIX, Solaris, HP-Ux) but I don't see support for Linux but I would guess that if you can get it to work on the above there is a tweak to get it working.
Just anouther option, I know it is not going to be the flavor of the month because it is not free or OSS.
Enjoy,
Neck_of_the_Woods
#/usr/local/surf/glassy/overhead
Jason
"FORMAT C:" - Kills bugs dead!
On topic, but adrift at sea a bit:
what tools will restore a backup done with Windows 2000/XP under Linux?
Under win95/win98, you can smbtar the entire remote drive into a compressed tarball. To restore, fdisk a new drive, format it, and tar -xjpf tarball.tar.bz2, and possibly sys C: it once it's back in the windows machine. Windows takes care of anything else that needs to be done.
Under Win2000/XP, obviously this won't work, so you need to use Windows's backup or other tools. But if you want to restore a file that a user deleted, you need to find a windows machine(difficult if you're not onsite), and restore the file using windows backup.
Under the Win95/98 method, just untar the backup, and copy it back to the client's pc while they continue to work on other things.
They say tar has its limitations. I really dont understand.
Ive worked with different unixen and Linux distros, so I just dont want to be dependant on something that isnt installed by default everywhere. tar already has a VERY well known format and execution parameters.
Ive lost my fair share of data to buggy harddrives and dumb mistakes like pulling off the ide cable while the system is running. So cron does daily backups using tar cfj using a file that has a list of other files to be backed up. This way I dont have to backup the whole partition. To restore a certain file, just tar xvfj backup2.tar.bz2
The cron setup renames backup.bz2 to backup2.bz2 and removes backup2.bz2 so I have the data for the past two days. Beside incremental backup which I dont need due to this setup, what else could I need? And by the way the backup.bz2 is copied off onto an NFS share elsewhere incase my whole RAID setup crashes, or the XFS filesystem bombs out. This setup can be replicated onto FreeBSD Solaris and many others.
"Give orange me give eat orange me eat orange give me eat orange give me you." -Nim Chimpsky
I use mkisofs and cdrecord to copy all my data to a CD-RW. I have two of those; the oldest data gets overwritten. At work I use cp to copy all data to an extra harddisk that is removed after the copy is made. And I keep my backup for home at work and vice-versa. Very important in case your building burns down.
-- Cheers!
Just because it's broken on Linux doesn't mean that:
* it's not better on other platforms
* the other tools aren't worse
Elizabeth Zwicky's classic Torture-testing Backup and Archive Programs will give a whole list of reasons why you should be suspicious of tar or cpio.
Heck, the FreeBSD Handbook answers the question "Which backup program is best?" by saying "dump(8). Period."
"The purpose of argument is to change the nature of truth." -- Bene Gesserit Precept
Small, important or irretrievable stuff gets mkisofs [even -J!] to CD-R.
I use Lonetar locally to tape and rsync/ssh over DSL to my house, works great, I have not had to go to a tape for months to recover a file, just ssh into my home server and copy to the office network. I have saved many an ass this way. But I leave the tape running just to have a couple layers of bckup. We are also adding a new office in another city, I will rsync/ssh to that location too.
Some people have already mentioned Amanda.
In addition to amanda, I have good luck with star coded by Jörg Schilling. star is very feature-rich, fast, standards compliant and has been around since 1985. Give it a try!
The star-users mailing list is here . You can also look at the man page and finally download it
Corporate Gadfly
Jonathan Archer: the most beaten up Enterprise captain in Star Trek history
For backing up my FreeBSD home server I use a second (identical) HDD in a swappable IDE bracket on a standard plain ole onboard IDE controller (the 2nd channel to be precise). Though hotswapping isn't really supported on these controllers, it does seem to work :)
/home, just everything) to the hotswap HDD, using the "ssync" tool /var/log/backup
Making a backup is easy. I just plug in the bracket and start a homebrew script which:
- enables and inits the hotswap IDE channel
- mounts the partitions on the hotswap HDD
- removes system immutable flags on files on the hotswap HDD (so that they can be overwritten)
- copies over all new files (/sbin,
- resets the system immutable flags to their original state
- umounts the partitions
- disables the IDE channel
- logs the above to
- mails the log to me
A whole backup takes about 25 minutes on an almost full 30GB disk in a P200 (it only copies the new/changed files), and compared to tape it's very cheap. If the master drive fails, I swap HDDs and the whole server works again, without any configuration whatsoever.
dump isn't broken; Linux is broken.
The question of how to perform backups on linux remains perfectly valid, of course, but recognizing where the problem lies makes other solutions more obvious.
http://backuppc.sourceforge.net/
Automated backups to an online disk server, open source, and a really nice web interface as well as command line interface.
It uses samba and ssh to backup and restore to windows and unix machines.
You can have it restore any files/folders in a backup you select, using the same methods (samba or ssh) as well as it can send the restore files to your browser in a tar or zip file.
I recently replaced a machine using amanda and a DLT drive with a fileserver using a raid 5 array and backuppc. Best switch ever.
MakeItSo has a very small footprint (daemon is about 1.4 meg) and it spikes load about .5 while running - very tollerable (some of the commercial packages were spiking load over 4.0!! we use SUN servers). The nice thing is that because we wrote it ourselves (all C) we can tweak it and do just about anything we wish to improve it's performance.
We use LoneTar at a couple of different clients. Not much to dislike except slow file restore seeks on tape but apparently this has been fixed within the last year.
Tivoli Storage Manager is the only "backup solution" that I have ever seen that truly works well without alot of tweaking and twiddling.
I've worked at places using Legato and Amanda, where restoring from backup was an unreliable and error-prone process more likely to be a waste of time than anything else.
TSM is not cheap, but is worth every penny. We have one full time and one part time employee handle the backup/restore jobs for about 2000 servers. Try that with Legato or Amanda.
Conformity is the jailer of freedom and enemy of growth. -JFK
We use tapeware here, simply because it's what came with the tape drive and it supports network backups from windows machines to the backup server running on linux. It has a curses, X11 and windows interfaces that are all quite similar (X11 and win32 interfaces being identical). I was wondering if anyone else uses this software and what your comments are on it.
Companies with money can get a netapp box for critical data. Here you can absolutly use dump, tar or cpio. They create a "snapshot" of a file before backing it up.
Unfortunately we are talking a minimum of $40k for this type of solution.
If the snapshot comcept could be written into ext4, then dumps would be great.
I always put a caveat into my backup policies to cover this issue.
Anybody out there know if Bru does any better?
Veritas have a work arund to this?
Amanda is just a wrapper around dump.
Although S tar is a nice utility, as mentioned by previous posters. You can get pax directly from the OpenBSD people. Debian also packages pax, if you run Linux.
assert(expired(knowledge));
At our dotCom company, we bought EMC boxes and I was REALLY excited about the TimeFinder concept. But then I found out that it doesn't really find time, it just makes backups.
:)
I had thought we had found the answer to getting a six-month project done in 3 months - use "TimeFinder" by EMC.
-Peter
For linux we create a SMB share with samba that the backup server has access to. All files are either tar-gzip'd or just copied over to the directory. Everything in the directory is backed-up.
1;
A decent backup tool (as opposed to an archival tool) must absolutely have incremental backup support.
...major OS that doesn't have some kind of filesystem-snapshot support.
And this, more than any marketspeak CTO buzzwords, is what prevents Linux's entry into really mainstream server usage. Every sysadmin senior enough to make the decision of "do we use Linux for " knows that backing up a live filesystem is simply stupid. Right now -- and for the foreseeable future, since nobody seems to be working on it -- you have to unmount a filesystem and then use a program that speaks raw devices. And there went your 24/7 customer uptime.
(The alternative solution is usually something like "run a mirrored RAID, then take one of the mirrors offline, back it up, and bring it back online." That only works when the RAID implementation isn't constantly being rewritten between every kernel release.)
You cannot apply a technological solution to a sociological problem. (Edwards' Law)
I don't really have all that much on my home computer that I couldn't afford to lose. I use IMAP for email, so that's stored in two places already. I use CVS for software I right, so that's backed up too. The very few commercial programs I've bought I have CDs for. Same thing with the mp3s I have (plus I do back those up to a separate hard drive every once in a while using handy dandy cp -R). Any passwords or receipts I've saved would be lost, but I could get the passwords back by filling out lost password requests, and hopefully the receipts wouldn't be necessary. I see no reason to back up my entire drive, because 99% of it is already backed up in one way or another.
subject says it all :-( ARCserve sucks!
Yes, Arkeia is commercial, but sometimes you get what you pay for. Arkeia has a decent UI (but you can do everything from the command prompt, too), and it can backup just about any flavor of UNIX (including OS X) and Windows. It works with any SCSI tape device, including tape libraries. It embraces the standard full/incremental backup paradigm, so its pretty straight-forward to migrate to Arkeia from dump, etc. It has features that expedite disaster recovery (full-box rebuild), and can do on-line backups of MySQL, Oracle, PostgreSQL, DB2, etc.
Note that Arkeia offers a free Linux version if you only need to backup the local machine and a couple of clients--perfect for the SOHO user.
And, no, I don't work for them--I just used the software for a couple of years when I was responsible for a tiny server farm. Arkeia always worked, unlike some of the tempermental WinNT boxes I had to support...
The problem with most suggestions here is that it seems the average slashdot reader is a linux hobbyist or works as the IT manager for a small office that happens to run linux. What happens when you need to backup 6TB/night and don't want to pay someone to sit around swapping tapes all night. Sometimes it just isn't practical to purchase another SAN solution to facilitate an rsync. Or what if you have a collection of high capacity LTO tape drives at your disposal, but don't have the budget for something larger and automated, or smaller with an autoloader. I think automation and efficiency is almost as important as reliability and cost. Not everyone can afford a Storagetek Powderhorn Silo, or needs the versatility of expensive products such as Veritas Netbackup. Then again, sometimes tar or rsync just don't cut it in an enterprise environment where data is mission critical.
This simply generates a file list that's piped to afio. I generate many small afio files, one per home folder for example. Then another Perl script takes those files and figures out how to distribute them so that the CDs are as full as possible (currently takes 2).
Finally, a bash script that runs from cron calls xmessage to make windows pop up requesting me to insert CD-RWs. It marks CDs, so that if I insert the wrong CD-RW I get a warning. Once this is set, it all happens automatically, I only need to insert the CD when it asks for it.
It doesn't do incremental backups, but I don't need them for now. If anybody is interested in this stuff I could try to make a few changes to make it more customizable and upload it somewhere.
For daily work, I use CVS. This is nice because I can access it at home or work or laptop, and backup is not needed if you don't care about revision history.
rsync over ssh
afio using bzip2 with 100k blocks. With tar -cz, if you have a single bad bit, the whole archive is bad. You're screwed. With the afio approach, you lose the file where the corruption occurred, and if you're lucky, you only lose the 100k block where the corruption occurred.
Rsync can also be used to make some very nice incremental "snapshot" backups.
I do rsync to a dedicated backup server at a different location. As a fallback, I backup portions regularly to CD-ROM.
I use rsync. my script then for offline backup I use BRU (*not* BRU pro but regular BRU). This backs the data up to my DAT.
That isn't reliable. ISO9660 directories can only have ~1024 entries, any more are dropped on the floor. There are also limitations on the length of a filename with Rock Ridge extensions, possibly 32 characters.
Then there's the other things that don't translate well. Do you deference symbolic links? What about fifos and special devices?
If you want to be safe, you need to either check the directory tree first or put everything into a container without these restrictions. I've been developing some tools for this, but keep flipping back and forth between compressed tar files and zipfiles. The former can be read with standard tools, but requires an explicit index (for performance when seeking single files) and doesn't scale to multiple discs. The latter has an index and supports multiple discs, but isn't widely used in the Unix world.
For every complex problem there is an answer that is clear, simple, and wrong. -- H L Mencken
Taper has a limitation in that the archive can not be greater than 4GB. If larger, it will appear to write OK, but it'll segfault when you go to try and read the archive.
Give me my freedom, and I'll take care of my own security, thank you.
I believe you are wrong. EVMS (which was built by IBM) and is distributed under the GPL license for free, provides software raid (0,1,5), filesystem snapshots, has both GUI and CLI tools for linux.
It's a simple patch you can add to any 2.4 kernel.
A lot of the problems backing up live systems are because of poor coding practices. (The other problem is people attempting to back up things that shouldn't be backed up at the filesystem level. A classic example of this is relational databases - they should usually be dumped and restored with their own tools.)
Specifically, how many programmers routinely get advisory write locks on files they plan to update? How many home-brewed or ad-hoc backup solutions bother to get advisory read locks?
I've written some backup utilities that do the open()/flock()/mmap() dance, and while it's mildly annoying to get a couple error messages per run, I greatly prefer having no file to having a corrupted one in the archive.
For every complex problem there is an answer that is clear, simple, and wrong. -- H L Mencken
Well, at work I use Arcserve to backup files from some Red Hat servers via the Unix agent. While I'm not jumping up and down and waving my hands, this solution works, it's easy to setup and it's not that expensive, actually.
Another note. Arcserve used to crash my NetWare servers time and again, but so far the Unix agent has been running without problems on Red Hat 7.3 - at least for me. :-)
zWhat would an EWOULDBLOCK block, if an EWOULDBLOCK could block would? -- me
Have you even read Linus's comments?
Dump works by reading the raw data partition. That works great with an unmounted partition, or if you have a very limited OS that does not perform any caching.
But Linux is different - it's now using the cached pages as the primary content, usually flushing them to disk only as the pages are dropped. This is the approach used by most mature OSes, but Linux doesn't yet have an interface for "dump" programs to query the OS for updated but unwritten sectors.
So dump is the worst of all possible things now. Not only will you get incomplete live files, you can get incomplete files even if the users have all terminated but the pages haven't been flushed to disk yet. That's non-deterministic, and there's simply no way for you to perform reliable dumps.
On the practical side, dump is specific to the filesystem. When everyone ran ext2, that wasn't a problem. But now people may have a mixture of ext2, ext3, reiserfs, xfs, jfs, and probably even other formats. Each requires their own dump and restore, and that requires a lot more effort.
For every complex problem there is an answer that is clear, simple, and wrong. -- H L Mencken
I have two identically sized drives (80GB). Every weekend, I take down the machine for about an hour and Ghost disk 1 to disk 2 (60GB used takes 50 minutes). Then I hide disk 2 in the BIOS (disable the controller). An EXACT backup of the OS. People ask why I don't just do RAID - well, I trust the hardware not to fail more than myself - if I f* it up, with RAID, yippee I have to copies of crap.
Another option I did in the past is to have a very basic host OS and a virtual machine with all the important stuff. You take down the virtual machine, copy the file that is its "hard drive" to somewhere else, and boot it back up. Now you can split that file across CD-R, move to another PC, etc. This solution is also very nice if you have catastrophic failure, you can recover much faster with new hardware and not worry about the right drivers, etc, because your important system always has the same peripherals - the VM ones. Kernel just boots right up.
- RR
I should put something clever here. Maybe someday.
Prayerware(TM)2.0 http://www.prayerware.com/
ADSM Rocks.
God, I love this backup system. I've used it in three different places for various cross-platform backups.
Novell, NT/2k, Unices, etc etc. It just works. Of course, to truly use it right, you need a huge tape library. Huge = 5000 tapes, at one bank I was at. We had one in Manhattan and one in Jersey and a 100mbps link between. We'd backup to Jersey at night for instant disaster recovery preparation, and copy back to NYC during the day. We were even setting up a third automated tape library in Brooklyn for added redundancy.
ADSM client configs were a pain until you understood the software. It ISN'T a backup program like you are thinking. It's really a snapshot storage system, with configurable numbers of file copies kept in buffer, we had it set to 5 unique copies of each file. You can restore snapshots based on file dates, which is a nice feature, though you'd need to extend it beyond 5 copies to really gain the benefit.
Loved the stuff. Wish I could get it back - using crappy Veritas here.
Strongly recommended, the parent poster hit it on the head.
HBI's Law: Frequency of calling others Nazis is directly correlated with the likelihood of the accuser being Communist.
Yes, rsync is the dog's.
On the Mac, I use RsyncX, which knows about resource forks, even when transferring them to systems which don't have them.
And on Windows, I use rsync again.
I've tried every damn sync program for the Mac. I've tried tar and dump on UNIX. I've tried fancy network backup tools. I've not found anything that compares with rsync.
I hate the complexity of the command-line syntax, but it has the required functionality:
1. Automatically incremental.
2. Works locally from disk to disk or across a network.
3. Works via secure VPN or SSH.
4. Works between any two platforms I happen to be using, so I can back up to wherever the spare disk space is.
5. Easily scriptable, easily scheduled.
6. More efficient at using network bandwidth than any other protocol I've found.
7. Doesn't fail over on systems with incorrect system clocks or bad timestamps.
8. Data ends up in original native format, not some format that needs a special program to read.
9. Partial restore is trivial.
10. Works great with large capacity but slow-to-write backup media like DVD-RAM.
11. It has never damaged my data.
The only downside I've noticed is that the rsync ports to Windows tend to be comparatively CPU-intensive for some reason. Turning off compression helps.
I also use rsync for maintaining my web site, sharing my iCal calendar, syncing my browser bookmarks...
GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
Standard tar does *not* support incremental backups. Look at page 127 of O'Reilly's _Unix_Backup_&_Recovery_. Are you claiming that Curtis Preston doesn't know what he's talking about? I guess O'Reilly's technical editors don't either. Also, the Posix standard says they don't. I guess you're also going to claim they're all wrong.
Just because you have some tar that someone jammed a cheap hack into, doesn't mean that tar supports incremental backup. Hey, the world doesn't revolve around you kid. Get over it.
Hey moderators, thanks yet again for making slashdot a little less useful to clueful people.
Before I start looking for new backup tools, I would look for the one responsible for removing my tools in the first place.
A possible(read: theoretical) form of backup would be to use the various online search engines as distributed backup mediums. Ie, convert your data into various web pages which are encoded. Since webcrawlers will crawl a site and attempt to store/cache te data(google, the wayback machine, etc), your data is, in theory, cached on those crawler databases.
The only problem with this idea is the fact that you can't really determine the time and place of the backup. Only hope that they cache it in a reasonable way. ;)
The corollary of that would be to encode it as DNS with LONG LONG expire times so that people can serve your domain and by extension your data.
Seriously though, the utilities mentioned in the original post represents functions which, after a fashion and some long hours tinkering, could be simulated so that you can perform your backups in a half-way reasonable way. (Read: custom code your own backup tool)
There is, of course, the good old tried and true backup a hard drive with another hard drive... but that get's a wee bit expensive and hard when you start talking terabytes of data...
Winged Power Photography
Why use taper or tar or whatever? Ninnle, as usual, comes with an excellent backup package: Ninnle Backup, of course!
I use Backup Express made by SyncSort. I've been pretty happy with it overall. It's designed with very large libraries in mind though, which can be semi-annoying if you don't have a very large library, like me. You have to define media before the backup begins, i.e. if you put in blank tape, the software won't automatically use the tape if the software needs a blank tape. You have to define each tape to be in a certain media pool. If you have a very large library, you'd define everything once and let it go at it. It's not a major inconvenience, but it is my one complaint.
It has most of the features of more expensive packages for quite a bit less money. They have good, responsive tech support, with technicians you can actually understand, unlike some other companies, *cough* CA. They also offer a demo period, a month or more, when you have an unrestricted copy of the software and full access to their technical support staff, so you can get a good idea if it will work for you.
For our environment, 15 Windows clients, 5 Solaris clients, 3 Linux clients, a 2 drive 14 slot library, and a 1 drive 10 slot library, including a year of support and upgrades for about $18,000.
This document describes a method for generating automatic rotating "snapshot"-style backups on a Unix-based system, with specific examples drawn from the author's GNU/Linux experience. Snapshot backups are a feature of some high-end industrial file servers; they create the illusion of multiple full (up to ownership/permission) backups per day without the space or processing overhead. All of the snapshots are read-only, and are accessible directly by users as special system directories. It is often possible to store several hours, days, and even weeks' worth of snapshots with slightly more than 2x storage. This method, while not as space-efficient as some of the proprietary technologies (which, using special copy-on-write filesystems, can operate on slightly more than 1x storage), makes use of only standard file utilities and the common rsync program, which is installed by default on most Linux distributions. Properly configured, the method can also protect against hard disk failure, root compromises, or even back up a network of heterogeneous desktops automatically.
I've always been taught that if you want to have a reliable backup of your OS do it in single user mode (if possible, although normally not really practicle). For real world backup one nice way I found of doing a *full* backup is the following, Some might say it is risky but it does works
:)
Have a backend storage system which has redundancy. Say two NetApps that replicate each other. When you want to do you backup of data take one of the machines out of the cluster so that there is no I/O. Back it up using your favourite software. (Amanda, Veritas, dd, cpio,tar...). Put it back in the cluster and let it sync up and there you go.
Now I would like a beowolf of those
Rus
Cheap UK and US VPS
What I did, since Amanda doesn't have very good robotic library support, was to set up our NT server with an old version of Backup Exec (it was available used, from another company that upgraded, and it was cheap). Then I added a surplus Exabyte 8mm library (an EXB210, if I recall correctly) that I got for about $50 at the local used computer place.
Next, I installed Samba on our NetBSD boxes, set up the shares and permissions, and viola! Centralized backup with minimal hassle and cost.
Granted, this was done with older software and surplus components. While the exact hardware, and specific backup software I used may not be suitable for "enterprise" environments, the same principle applies. Samba is a lot more useful than I think some folks give it credit for. Thank you, Andrew Tridgell!
Bruce Lane, KC7GR,
Blue Feather Technologies
We're currently using the free Arkeia version as we have all stuff mirrored on one machine (rsync).
I recently looked at Bacula (www.bacula.org), it looks promising I think, I'm curious why nobody mentioned it sofar.
...run RedHat and integrate into our enterprise backup solution with the Linux client agent for ARCServe.
Unfortunately it's not free, but it didn't change our corporate licence by hardly anything.
LOAD "SIG",8,1
LOADING...
READY.
RUN
I primarily use FTP (enter favorite ftp software here) to backup data to another system. Though I have not broken down and automated it through scripting/cron. If I need a permanent storage type of backup, then off to CD the items go.
Regards,
Ryan Pritchard
Fun Extends All Basic Life Expectancies
At my (large) company we are Veritas Netbackup which works well.
At home until recently I had been using one of those rsync/hard link backup systems with good results (links to that quoted here elsewhere).
I'd been looking for a somewhat simple solution that I could run on a low end linux box at my kids school to backup 1 linux box and 1 NT server. After a bunch searching I finally settled on flexbackup because it is fairly simple, and can use tar in incremental mode, emulating dump's levels. Since I'm just using tar and bzip2, restores can be done just on a Windows box. I have it backup the NT box by using the smbfs to mount it, and then have the backups stored compressed on another harddisk. After the backups are complete, the system uses (http://www.gnupg.org) to encrypt a copy of the files, and puts them in a "pickup" directory. After that, the system sends a signal to a couple of home boxes via http/syslog, upon which the home boxes use rsync to copy those files down over peoples cable modems.
While this solution is obviously only useful for small amounts of data (the downloading to home part), it does allow for secure offsite backups, and even the home backup machines cannot decrypt the data because they don't have the required private key.
The home backup box doesn't have any access to the school server other than the ability to do rsync's, as I'm using a ssh/rsync "forced-command" setup, so even if the home boxes are rooted they cannot get back into the school.
I've started using flexbackup on my home network as well, and it works great, although I wish it had the ability to push the tar files across a SSH connection (it CAN run dump/etc over SSH, but I just want tar backups of files dumped over the ssh connection).
I hadn't see backuppc yet though, and it looks pretty good, and looks like it could easily work in a small replicated environment.
-- I speak only for myself.
Ah Linux, the home of the extended 2 filesystem. The filesystem which still insists that you should fsck it every few reboots or days, whichever comes first. That inspires confidence!
But really, Write Caching is evil, and that's been known for some time. Why is it still an issue for Linux? How can you expect a backup to contain data that the drive doesn't even contain yet? Seems unreasonable to me.
(BTW, IBM appears to have discontinued their Linux LVM project - the Sistina Linux LVM is the best way to go. This Linux LVM is quite similar in commands/functionality as the HP-UX LVM. In fact, full-featured LVMs for Linux have been around longer than similar capabilities on Solaris ...)
Bu has served us well over the last several years. It is a
filesystem to filesystem backup tool that we normally use to
backup to an NFS server over the network, and it can back up
to multi-volume CDRW's. Although, the CDRW dump feature is
not finished for Linux yet, the rest of the features work on
both Linux and FreeBSD. It is designed for immediate
backups of individual files or directories throughout the
work day as well as full or incremental backups via cron.
It is very configurable, light weight, has include and
exclude lists, nice log files, good documentation, and is
free under the GNU public license.
I'm not claiming that dump provides the snapshot. The sequence goes like this:
Of course you don't dump a live filesystem. It's just stupid.
Moderators, this person was not informative, they were simply missing the point.
You cannot apply a technological solution to a sociological problem. (Edwards' Law)
"Backups are for wimps. Real men upload their data to an FTP site and have everyone else mirror it."
...
I'll consider this Linus' guarantee to me that ftp(1) will always work under Linux
I vote for ftp over dump, in this case.
If you want to do multi-volume check out Bacula (Google for it).
Tapeware- for Linux, anyway. Dunno about other Unices.
"Hello, World", 17 errors, 31 warnings
Of course, you don't have to put ISOs on CDs:
/tmp/backup.tar /whatever && cdrecord /tmp/backup.tar
/dev/scd0 or whatever to restore, which under UNIX shouldn't count as special tools.
tar cf
Then you need to tar x
> Ah Linux, the home of the extended 2 filesystem. The filesystem which still insists that you should fsck it every few reboots or days, whichever comes first. That inspires confidence!
Google is your friend: "linux journalling filesystem".
Before I write my own and reinvent the wheel, does anyone know of any scripts which will create ISO image files of an MP3 collection?
Here's what I want to do. On my home LAN (as opposed to work, where we've already got a good backup strategy), I've got a large (20Gb) MP3 collection. I back up everything else with a little shell script, and that's good enough.
But the MP3 collection is hard to back up, and I've only ever done it twice because of the work involved. The technique thus far has been to simply dump 'em in sequential blocks by filename until I cannot add any more files before I exceed the CD's capacity. This is inefficient, of course, because if the next file is 5 megabytes and I've only got 4 megabytes left, then I go on to the next CD-R and preserve the order (so that I can still keep track of what files I've put in).
What I'd like to do is have a script which automatically sorts them so that I have efficient use of the CDs and tars each file individually to preserve the long filenames. (Why not tar all the files in the CD image? Damage to the CD might make that whole disc unreadable.) For the same reason, I'm not interested in disc spanning.
Of course, such a script would be useful anywhere that a large quantity of relatively big files have to be backed up to CD-R... or, even larger files have to be backed up to DVD.
Anyone know of anything like this, or am I rolling up my sleeves and kludging something together?
Fire and Meat. Yummy.
If you use LVM (Logical Volume Manger) for your partitions you can use its built-in snapshot feature.
Anyone use Arcserve(it)? I use it to backup multiple machines to a tape library off a server. Kind of a poor man's veritas from CA. It supports a lot of OSs through propietary agents or through network shares. It has nice restores, keeps a DB of backup information that you can query. The only problems I have with it are the cost is a ripoff, CA is a horrible company to deal with, and the software crashes and forgets device configurations regularly without reason.
Anyone else out there use it?