Unix Backup And Recovery
Cover Image
The Scenario You're a system administrator suddenly tasked with handling the backup of all your employer's mission-critical data. Or maybe you've been handed a tape of questionable origin with the instructions "I need all the files off of this." Perhaps you're working on your company's disaster-recovery plan and are looking for advice about how to restore all the computers to operation in the event of catastrophe. Unix Backup & Recovery is a comprehensive volume designed to help with all of these tasks and many others. What's Bad While the organization of topics is clear, the sheer scope of the book prevents easy digestion of the material by the casual reader. Those expecting to read a chapter or two at random may find some of the concepts hard to follow unless they first read the full 65 pages of introductory material. Also, I would have liked to see a clearer discussion of the differences in procedure and general philosophy between a typical small shop (where tapes are organized based on the day the backup was made) and the kind of unique-volume labeling that tends to accompany larger systems or commercial backup products. Since a lot of Unix systems are being managed lately by people whose background is in smaller systems, making this kind of transition is a very important topic.
One part of the book's design may be good or bad depending on how you intend to read it. Areas deemed especially time-sensitive, like what features are included with which commercial backup system, are not addressed in the book. Instead, readers are referred to the author's backupcentral.com site for the latest information. While assuming that any Unix administrator has Internet access is probably not unreasonable, I found myself reading a lot of this book during spare moments while waiting for routine chores to complete. It was not helpful that I needed to access the Web site in order to follow the chapter I was reading while I waited for my car's oil to be changed.
What's Good With many years' worth of practical experience, several specialist contributors, and dozens of technical reviewers, this book leaves few stones unturned. No matter how experienced you are at managing backups, you could probably learn at least a few tricks from Curtis Preston and his crew. Normally discussions about backups are relegated to, at best, a single chapter in a Unix administration book. Unix Backup & Recovery is the first title I've ever seen that covers this territory in full detail. In fact, even if you aren't specifically a Unix administrator, the discussion of topics like the most common causes of system failure and how to pitch a more reliable backup scheme to management are very cross-platform. They're worth reading no matter what type of computer system you rely upon. So What's In It For Me? The first two chapters of the book provide a real-world approach to backups that include often-unaddressed topics like the availability of the backup hardware in the future, dealing with off-site storage, and exactly how high the cost of poor backups can be. With that basis, the native Unix utilities (dump, cpio, and tar) are evaluated. One particularly good part of that coverage is a discussion of tape portability, and notes on how the GNU versions of those utilities stack up in that and other contexts. Even Unix administrators who aren't involved with backups regularly might find this chapter interesting, as the information about how to read an unfamiliar tape you've been given is alone is worth the price of the book if you're ever stuck in that situation.For those looking to back up systems without much of a budget, a discussion of free backup tools ranges from writing scripts to automate the built-in Unix tools to coverage of the popular AMANDA backup system. The third section covers what to look for in a commercial backup product. This is light on specific recommendations, instead trying to educate the reader well enough to perform his or her own product selection. A somewhat related chapter covers the main ideas behind High Availability, which is obviously too big of a topic to cover fully in a 15-page section.
The next few chapters cover bare-metal backup and recovery, where the goal is to make a backup of the system capable of being used to create a new system in the event of a total failure. Many traditional solutions to this problem involve first re-installing the operating system, then restoring the backup. The author maintains this is a bad approach, and instead focuses on constructing a small bootable system (i.e. a Linux rescue floppy) capable of partitioning the drive and restoring the backup without laying down the OS first. SunOS/Solaris, Linux, Compaq True-64 Unix, HP-UX, IRIX and AIX are all covered.
Four chapters on database backup and recovery suggest how to integrate your backup solution with the database vendor's tools. Along with a general discussion aimed at bringing non-database administrators up to speed on DB lingo, separate chapters cover Informix, Oracle and Sybase. Finally, the three closing chapters to the book include miscellaneous information like backing up Rational's ClearCase product and selecting backup hardware, as well as some notes on upcoming trends.
Competent system administrators, either through forward thinking or past battle scars, develop a level of paranoia about their computers and how strongly their data should be protected that people outside the field find it hard to fathom. If you'd like to hone your own sense that everyone is out to get you, and know how to stop them, Unix Backup & Recovery is as good of an introduction to that topic as you'll find anywhere.
Buy this from ThinkGeek.
Table of Contents- Preparing for the Worst
- Backing It All Up
- Native Backup & Recovery Utilities
- Free Backup Utilities
- Commercial Backup Utilities
- High Availability
- Bare-Metal Backup & Recovery Methods: SunOS/Solaris
- Bare-Metal: Linux
- Bare-Metal: Compaq True-64 Unix
- Bare-Metal: HP-UX
- Bare-Metal: IRIX
- Bare-Metal: AIX
- Backing Up Databases
- Informix Backup & Recovery
- Oracle Backup & Recovery
- Sybase Backup & Recovery
- ClearCase Backup & Recovery
- Backup Hardware
- Miscellanea
Yeah, but you can't practice LAW unless yer a LAWYER, right? Any book on data destruction would always end with: "Laws vary. Always check with your legal representation before destroying any data."
Seriously, the last shop I adminned for was under a court order to NEVER overwrite our monthly backups. Our off-site storage costs were INSANE.
Hey, first off, I'm not bashing anything. This isn't a troll, isn't a flame, nothing. I reall y like linux, but I don't like the hidden expenses. Read on:
.tar.gz file and 30 min to get it unpacked. tell 'em they can win $100. then tell them to read the man pages. don't worry about losing your money.)
This book, several hundred pages talking about backup and recovery, supplements the O'Reilly book line for Linux. If you put all of their books about Linux and administration (I'm not talking about Perl or Apache or SQL books mind you) you'd have a several thousand page volume.
This is outrageous. Hardcore linux proponents (almost as annoying as the hardcore mac users) LOVE to talk about cost of ownership. ("M$ is SO awful bilking people") Well, when you have to buy $120 dollars in documentation just to run the damn thing, that $100 copy of Win2k Professional doesn't look so bad after all.
Yeah, yeah, you can find it all free on the net. But what good does that do you when you can't get your PPP configured just to connect to the internet? What are you supposed to do in that case? Print them out? Might as well buy the book considering the time and expense in printing them out.
It's too bad Linux books (with very few exceptions) suck some hardcore cock. You can't just buy one or two linux books, you have to buy "Running Linux" then some Linux security book, then some samba book, then some apache book, and now some backup book. Of course, you don't have to buy these books, but you stand to fuck shit up if you don't. The documentation on the internet is pretty crappy, man pages are totally WORTHLESS (give a new user a
It's not just a little under $.03, it's pretty true.
email me to bitch:
supershroom@rocketmail.com
As you said, see www.colltech.com for more information.
I've been using Amanda for backing up my systems for about 2 years. To me it looks a lot like Solstice/Legato Backup, which is used at the univ/work.
There's one difference: Solstice/Legato has a GUI (which I never really used BTW). I think it would be nice if someone wrote a GTK GUI for Amanda.
It searches sector-by-sector to find the locations of partitions. You can usually feed the output back into sfdisk to recreate them.
It works good, saved my butt :-)
I have not read the book, but I agree with tha idea that bare-metal backup are the way to go.
I work as a tech monkey for a school board and I deal almost exclusively with M$ crap. Win95 and friend keep of lot of state data that change constantely, so the only way to have a restorable backup is to take a snapshot of the disk. The file in your backup is'nt randomely accessible, so this solution can't be used for archival. I personnaly use Norton Ghost 5 (does good work).
The other great thing about this approach is that I can now afford to be lazy on problem-solving. Computer X had been doing a lot of BSOD lately ? Pop in that network-enabled boot floppy and dump the last-known-good image to disk from the server.
The only drawback I am aware of is that you can dump these image only on identical hardware. A NIC in a different slot is enough to fsck-up your restore. As I said, Windoze and friend keep a lot of state data (including hardware-related thing such as IRQ, IO, etc.).
:wq
Try this. I originally heard of it off the ReiserFS FAQ page... never used it, but it's worth a shot.
Try the links here. Look at the ASDM mini-how-to and the backup with MSDOS Mini-HOWTO.
Try to hack my 31337 firewall!
Well, you can make a full-blown GUI app that does backup and restore, but there's one little thing: you probably need to be able to run the restore while a tiny root is mounted and no /usr. That means no X.
Of course, that's no reason that backups and "casual" restores (i.e. not restoring the whole system) shouldn't be GUI. But they damn well better have an X-less alternative. I wonder if that's one of the reasons that tar is still around.
---
As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.
Nice try at the usual cynicism around here, but:
It almost makes me wonder if someone genuinely thought this book or topic is interesting. Maybe Hemos recently had a "disk accident" and the subject is on his mind. :-)
---
As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.
I am an ignorant crawling worm, and so lazy and slothful that typing a a company's URL into a web browser and reading a list of what they sell, is too much work for me. It wouldn't be so bad if I weren't so foolish, opinionated, misinformed, and philosophically bankrupt. But I am. Oh great all-knowning Oracle, I ask -- whoops, wait, this isn't an oracle grovel. What I meant to say, is that I was pretty wrong about ThinkGeek selling books. They do sell books.
But not O'Reilly books!
---
As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.
I have to setup a backup system for a bunch of Sun machines mostly running Oracle for an Oracle software shop.
We have decided on Veritas NetBackup.
My *painful* experience with CAI ArcServe IT/Open told me to stay far away and then farther from that beast. If you decide to try it out, think of proprietary tape formats and Raima DBMS (at least that's on NT). Also consider the insane number of patches and the fact that it's prone to crash boxes of Intel variety, especially the Netware ones.
You come to store, you see lots of UNIX books. My favorite local shop (Borders of Palo Alto) has literally 7 shelves full of Linux books, and maybe 30 on other flavors. Ironic, isn't it?
Well, there were none that were dedicated to backups and were current. I'll pick this book up and see if it sheds some light so I can sleep at night.
--
Leonid S. Knyshov
Network Administrator
Leonid S. Knyshov
Find me on Quora
It's called a l33t skr1p7 k1dd33. They like to break into networks, or, sometimes, destroy the network they're on. Their favorite tool is rm -rf /*
Wouldn't it be nice to know when you get one of those jerkoffs on that you have a tape full of all the important data?
If you think you know what the hell is going on you're probably full of shit. -- Robert Anton Wilson
If you think you know what the hell is going on you're probably full of shit. -- Robert Anton Wilson
jdube is who
Could it be because this book is sold by ThinkGeek, a company owned by Andover.net ?
True, a novice sysadmin (or one interested in expanding his/her knowledge) might need to purchase books for easy, reliable reference. This is a one time purchase unlike the expensive licensing model of Windows or NT.
If a tool doesn't exist, and you are technically inclined enough to REALLY need it, you should be technically inclined enough to write your own. I challenge you to write the same kind of utility for NT, with a non-open OS and a non-open filesystem.
In summary, the benefits of Linux cover far more than initial cost, they stem from an operating system, file system, windowing environment and utilities that are all open source. If you can't understand that this kind of environment allows one to create one's own tool and modifications to allow the system to do WHATEVER ones heart desires, then you better fdisk, format and install NT because the open source movement has just passed you by.
It's been a while since I used it, but I seem to recall Legato Networker having a quite useable GUI. Sure it wasn't stunning but it worked and made simple backup/recovery jobs easy to do by just pointing and clicking.
Of course the price for Networker is rather high. Esp. if you use a tape jukebox or want to back up a number of client machines in a multi OS network.
lone-tar from cactus software...
Their SCO Unix product was a dream (Creates "Airbag" boot/root disks w/ a fully automated (auto partition/mkfs/etc...) restore function. Like 80.00. Trial D/L available @ web site. Linux product looking pretty good, too. (Only have it installed on a test machine right now...)
Worth every penny (IMNSHO)
Technology -- No Place For Wimps! Grateful Dead and Jerry Garcia Chatroom -- http://www.wemissjerry.org
In general, anything which really needs to be kept should be printed out and archived in duplicate (this also has the advantage of settling once and for all what time a document was created, unlike electronic formats),
This is dang expensive, bulky, and could put your company out of business. You need a lot of space, and every 20-50 years you would need to copy the documents (acid based paper).
Caterpillar has chosen to electronically archive almost everything, to save money and time in printing repair manuals.
You can read other stories about electronic document management at the Document Management Alliance homepage.
Disclaimer, I work for a Fortune 50 company that specializes in Document management, so I do have a vested interest in this.
George
And I got a frickin' link wrong.
Document Management Alliance.
George
True, its increasingly meningless to use backups for disk crash recovery. Backups should be used to
recover individual files or directories which were deleted or corrupted. To be sensible, you need mirroring or other RAID methods to protect against disk failure these days.
Thats why ufsdump etc becomes increasingly meningless by the second. A GPL'd application like IBM's TSM/ADSM built upon a GPL DBMS would kick ass!
Linus Thorvalds
Yup, the linux dump/restore programs used to be unmaintained. :\
But now, there is a new maintainer and a sourceforge site.
No, *nix never crashes :) But the ideea of backups is good when:
- lots of users play with rm, wildcards and their "important" data
- you have a shitload of disks and the mtbf/nr_of_disks ratio is very low.
- you||your bo$$ are paranoid
- you have VERY important data which must survive an axe-trough-cpu error.
ps: mtbf==mean time between failures
1% APY, No fees, Online Bank https://captl1.co/2uIErYq Don't let your $$$ sit in a no-interest acct.
I have ADSM running on a linux server here. Been happy with it so far (its backing up to a AIX server that another dept. manages)
What other backup software for Linux do you have experience with (both good & bad)?
> tar works fine except it duplicates hardlinks
Do you mean that it adds the file all over again to the tar, or that it records the link to the tar? (Wasted space vs. recording it at all.)
The tar distributed with Solaris 7 (no handy earlier versions) behaves in the second manner, though it'll give errors if you don't include the file being hardlinked to in the tarball.
(I'm not posting this to say "You should use Solaris instead of [FLAVOR OF OS].", but rather to determine if your particular problem with tar is an across the board behavior. I don't want anyone reading the discussion to think that no tar handles hardlinks well.)
Or perhaps I'm misinterperting your use of hardlinks. I'm thinking "symbolic vs. hard".
-- Chapman's Observation #1: Nothing is ever simple
Ahh yes... Networker one of my favorites... I loved this product at my previous job, ease of use, add-on options, allow the secretary to get her own files (without hardly any training: point click, what time period, where do you want it...).
We are currently slamed with EMC's edm sofware... ugly, but it is kinda nice to be able to take a couple of terabytes directly from a box and throw it onto tape without using any host or network resources. At least it's not Alexandria...
ps. The price of Networker is only ~1k which for some shops seem pretty high but it is so damn worth it, ease of installation, GUI for everyone, wonderful multiplatform support: Solaris, Irix, HP, Linux, Windows..., encryption, archiving...
Of course your results may vary, I've had awful experiences with many other products (which shall remain nameless) that others have had great luck with.
As a newbie to linux, this sort of information is important to me, and it's good to see that a topic like this gets the attention it deserves, rather than a chapter in a book about linux in general.
BTW, I liked the review, but I thought it was sort of simplistic.
More race stuff in one place,
than any one place on the net.
beep....beep....beep....
Ouch! Stop hitting me.
tcd004
LostBrain
I am a newbie starting out with Linux. Is there any resource which gives a small how-to for backups. A complete book looks intimidating at first sight.
**Life is too short to be serious**
Just to be sure about the actual day it was counted (sic). U speak with the pschological mindset of a lawyer in saying that data should be destroyed just because one company indulged in illegal activities and got caught at it. Also the reason RDBMSs sell is that the actually help to reduce data redundancy without losing any data. The preservation of all this data can lead to unexpected benefits(aka data mining) which one can not be aware of at the moment of making daily backups and to throw it away is wasteful. Also I suppose the community always considers data as valuable as unlike a lawyer they dont automatically think everything to be incriminating
**Life is too short to be serious**
What the reviewer did not mention is that all the contributors and reviewers came from a consulting company in Austin, TX called Collective Technologies. My brother was one of the reviewers :-) You can visit their site at www.colltech.com.
Orpheus2000 - Hell and Back, Again!For a start, any file with the extension ".ppt" should have a sunset period of no more than two weeks, unless there are powerful reasons not to destroy.
People often say that you shouldn't throw the baby out with the bathwater, but you have to admit that most sysadmins take the attitude that you shouldn't throw out the bathwater either. Which is why so many corporate archiving systems stink of piss.
-- the most controversial site on the Web
Humberto Molena Rodriguez, Press Officer
Humberto Molena Rodriguez, Press Officer
Overdrive Trucking & Hauling Corporation
Interesting comment. Stupid but interesting. But then it's from a coward isn't it.
Damn you beat me to the punch. :-)
Back in the early '90s I made a pair of programs which did exactly this. (I was way into virus research back then and made a little bit of a living in my early teens doing just this) All it did was scan the drive from sector 1 looking for a partition signature, analysed it and jumped to where it indicated a boot sector was and if it existed, wrote it in to the MBR. Made a partition resizer too. Also played a lot with DOS MCBs and managed ot make a permanent LoadHigh program by altering the last MCB in the chain.
I never thought much of these programs until years later when Partition Magic came out and I realized I'd yet again screwed myself out of a cool idea. Other ideas? Hooking a modem up to the original NES (I think I still have drawings for this), 2-way paging, etc.
sigh.
>The other problem is an incredible black hole of documentation. I've gone through everything at freshmeat, and none of them met my
;-) This book is valuable because it discusses strategies like that, as well as covers issues like hot & cold backups of databases, & related technologies like HSM & High Availability -- as well as storage technology.
>criterion of being able to do multi-level backups and could span volumes of variable size. These two criterion aren't exactly difficult to
>satisfy in the Windows world or even commercial UNIXes, but for linux OSS projects, it was nearly impossible to find it.
This ``incredible black hole" extends way beyond just how to use the program. Several months ago, my boss asked me to look into backup strategies, one of which is called ``Towers of Hanoi". Needless to say, nothing on Deja.com led me to understand just WTF this was. (Although I wasted several hours on reading about the math problem of the same name.
My criticism of the book (which I have open at my elbow) is that Preston should have discussed commercial backup utilities in more detail -- even though he states a persuasive reason for this decision. (``Products change constantly. It would be impossible to keep this book up to date with the 50 different backup products that are available for Unix.") I still feel that providing an intelligent criticism of one or two products -- their strnegths, their weaknesses, how they work -- would help the newbie sysadmin, who seems to be the one usually delegated with this important, but unsexy task.
Geoff
I think I see a trend here. Maybe for them it really would be easier to muzzle the entire internet than to produce p
From www.ora.com: Slashdot.org book reviewer Greg Smith awards Unix Backup & Recovery a rare 10/10 rating and says it "is the first title I've ever seen that covers this territory in full detail." Read the entire review and discover why this book is essential reading for the well-prepared sysadmin.
(emphasis added)
People wonder why I complain about the lack of real journalism on Slashdot. They also wonder why I complain about the consistently (and usually undeservedly) high ratings ALL book reviews get (I've never seen anything below a 6).
Well, folks, here's the reason: Because whether Slashdot is real journalism or not, people will treat it that way. Like it or not, what Slashdot says is the perceived reality. Let's make sure perceived reality and actual reality are at least on speaking terms, shall we?
--
Linux MAPI Server!
http://www.openone.com/software/MailOne/
(Exchange Migration HOWTO coming soon)
Any suggestions as to how to rebuild a blown partition table when you dont have the original info?
I've got a system with a zorched out MBR (Dont Ask) on a 30GB EIDE drive, I bought a second identical drive on which I mirrored the original drive. So I can experiment without risking the original data. I've found the first partition, start and end , and then found the start of the second, but then the tool I'm using cant get beyond 8GB.. And then I'm stuck as to how do I reconstruct the Partition Table given raw sector numbers?
Any suggestions as to what Linux tools whould be useful for raw sector read/write and translation for Cyl/HD/Sect to LBA, then to Partition Format?
Closed source Tools are OK, but Iv'e tried Norton, and NT's DiskProbe, they dont work. Most tools assume you have a MBR backup (Dont Ask.. again...)
(OK... I tried replacing a MBR from 1 disk to another to fix a problem with LILO, forgot about the Partition Table, and didnt have a rescue disk... So now I get to learn all about disk partitioning, the hard way)
Starman97@Gmail.com (bring it on spammers)
I think this type collabration is the spirit of the Open Source movement, the spirit of cooperation towards a common goal. After all, the tag line of Collective Technologies is 'The power of many minds.'
I guess I should also point out that our company is the exclusive onsite support for Redhat as well, so we have way too many Linux experts for our own good. `8r)
--
Gonzo Granzeau
Gonzo Granzeau
"Nothing the god of biomechanics wouldn't let you into heaven for.." -Roy Batty
Well, I can address your issues WRT my own backup setup, which is Amanda w/ Dump.
Amanda has the pitfall you mentionned of only wanting to write to tapes. It uses lots of tape IOCTL's. But the next version is going to have a TAPER API, that will allow it to write to anything from a serial port to a RAID-0 array raw device.
Also, as to "getting down and dirty with the filesystem", this is exactly what dump does. Pretty much every unix has some type of dump/restore program, which reads the raw device of the disk. Linux is no exception. Dump has higher performance than tar, and backs up the same things the kernel sees - sparse files, hard links, the whole bit.
Thanks for filling me in on what MTBF means. I mean, I'm a WINDOWS person. There's NO WAY I could know what that means. That's what I meant about Physical deteriation of disk drives. ps: I realize the needs for backups. I was just being a smart-ass!
---
DO NOT DISTURB THE SE
Why would you need a backup? I thought *nix's never crashed.... Oh, well, I guess the physical deteriation of disk drives :)
---
DO NOT DISTURB THE SE
I have to do this sort of thing for a living, so it's nice to have a report on this book, but I'm wondering why more tech book reviews don't make it to the main slashdot page. Any special reason this did?
I'll start with Legato Networker. I have kind of a love-hate relationship with with Legato. The product has many strengths to recommend it, but also many significant weaknesses. It has a good graphical interface on both unix and nt. The nt interface is a lot better for configuration, while the unix version is better for operations and monitoring. Both GUI's connect via the network to the backup server and are installed with the agent on all client machines. It also has a well-rounded set of cli tools that again are network-based and installed on all clients. In general, everything in the gui can be done from the command line, but some of it is rather painful. Still, if you are planning to support the product 24x7, you better learn the command line for those nights when the VPN server craps out and you have to dial-in by modem instead of using DSL. The overall architecture is well thought out and works pretty good for the most part. I can sustain 50MB/s on a Sun E450 writing to 10 DLT7000 tape drives in a single robotic library, and have seen the peak go over 80MB/s. The biggest weakness of the current version is the index structure. This is the system that stores which files were backed up from which client, when, and to what tape. Legato uses a hacked-up b-tree structure stored in compressed binary files. The lookups are pretty fast, but it can choke if you are backing up many streams of small files simultaneously because it can't write as fast. The real problem, though, is that the indexes get corrupt too easily. The result is a lot of time spent cross-checking and recompressing the file indexes. The media index is worse because it can't be repaired. If an error is found, you have to restore from an earlier version (the media index is written to tape several times a day). This doesn't happen very often, but it shouldn't happen at all. Acknowledging the problem, Legato will be replacing their index system in the next version. Another annoyance is the lack of a decent global management utility. I have many E450 backup servers, each ignorant of the others, and each has to be configured seperately. The final major drawback is its use of a proprietary tape format. You can only read the data with Legato. Still, I throw several terabytes at the system each day, and it gets the job done, for the most part.
Veritas NetBackup is the newest of these three products, but has come on strong in the large datacenter segment these products play in. It supports several advanced features like dynamic robotic tape library sharing, which is very usefull in a Fibre-Channel Storage Area Network. The index structure is flat-file based, so it doesn't get corrupt and is human readable, but takes up more hard drive space. That last part is a non-trivial point. My oldest Legato server has accrued over 120GB of index information. If this were flat files, I would need to buy a lot more disk than I currently have. Another positive is the data is written in tar format, whether to tape, optical, or filesystem. NetBackup supports more client OS's than Legato, including support for Linux, but not BSD. Legato has unsupported clients for Linux, NetBSD, and BSDi. The major drawback is administration. NetBackup is more complicated to configure correctly, particularly in a large environment. It is also harder to maintain as the environment expands.
IBM's Tivoli Storage Manager is the only package that can backup the entire enterprise, from the Mac desktop in PR to the OS/390 in the datacenter. TSM supports just about any client platform you can think of that's still in use, except (curiously) Linux or BSD. For the index structure, TSM doesn't mess around: it comes with a custom version of DB2 specifically hardened for use with TSM. Because it uses a DBMS, TSM has by far the best reporting abilities of the three. You can buy a package of reports from IBM, or roll your own using standard SQL. Another major advantage is the backups are 'incremental always,' to use the IBM marketese. The first time a client does a backup, it is a full. From then on, only changed files are sent to the server. While the other packages support this, rolling through all the incrementals in the case of a full restore is painfully slow and requires a lot of tape mounts. TSM can do this because of the DB2 index system and very advanced media management inherited from the mainframe world. Like NetBackup, TSM writes all data in tar format. All this power comes at a price, unfortunately. TSM is extremely complicated to configure across a large enterprise and appallingly expensive.
On a final note, a word of caution: backup administration is the most thankless job in all of IT. No one notices the 99+% of backups that run successfully every day, but one failure on a business-critical system and you get crucified. Also, be prepared for your damned pager to go off at the most unfortunate times, day and night. To anyone considering a job as a backup admin, Just Say No. Trust me.
- Isn't this really one of the skills a sysadmin should know already? Why is this book necessary? If you don't know this, you shouldn't be an admin! It's that simple.
Albert Einstein had one of the most comprehensive libraries on mathematics, physics, logic, and philosophy in the world.When asked why he had so many books, he said: "Why waste all my precious time learning all these facts, when all I really need to know is where to find them in which book?"
Most rational adults will agree that Dr. Einstein pretty much knew what he was talking about, and he got that way by using his time efficiently at every opportunity. That's how he changed the understanding of the known universe in his lifetime.
The best System Administrators have the best libraries, or the best reference resources available at all times.
The --worst-- admins only rely on their own memory.
This book is an essential piece of a comprehensive reference library.
That's "why".
AC:
y .html
While your point about the "hidden cost" of linux books is open to debate, it's moot here since this book's only Linux-focused chapter (bare metal recovery) has been made available for gratis:
http://www.backupcentral.com/bare-metal-recover
One of the problems with standard UNIX backup utilities is the fact that they don't have a friendly interface. This isn't a problem for the sysadmin, if he needed a GUI I wouldn't hire him! But what about the secretary or other low-level employee that changes the tapes? Someone has to do that, and in many cases, such as smaller companies, the sysadmin isn't in there full time. A easy to use interface saying exactly what needs to be done is a must. Of course you just need a little prompt saying "Remove current tape" "Inset tape 1" but that can mean the difference between a backup system that gets used and one that is ignored.
"In general, anything which really needs to be kept should be printed out and archived in duplicate..."
No offense, but this is a bad idea.
I have worked in the biotech industry for a number of years. Now for starters, any data supporting a publication, invention, or patent, has to be kept for seven years. (In Canada--I think it's the same in the US) The difference between seven years and 'permanent' isn't much when the average lifetime of achival media is less than that. In other words, if you're looking for a way of storing data for more than three or four years, your looking for essentially 'permanent' archives.
Secondly, the 'printing out in duplicate' idea implies that all data worth archiving is textual or visual. In one lab, we generated four-dimensional data sets, and did data interpretation on processed slices of extracted cubes. There's no WAY we could print out the data set, and even if we could, it would only be the processed data, using somewhat subjective processing parameters. The original data would be lost.
You do make a good point, though, that much of user 'data' is utter junk. Thing is, if you told people that it would be destroyed at the end of the month, we'd decimate an entire rainforest, printing out 'mouse balls.' One of the nice things about archiving computer data is that it's (relatively) cheap, resource-friendly, and easy. Makes it tempting to archive stuff that you never cared about keeping before.
"People who do stupid things with hazardous materials often die." -- Jim Davidson on alt.folklore.urban
In short, for a backup scheme to be effective, it needs to get down and dirty with the filesystem - abstraction layers invariably lose performance (which in this case is defined by backup speed and how much tape is required).
The other problem is an incredible black hole of documentation. I've gone through everything at freshmeat, and none of them met my criterion of being able to do multi-level backups and could span volumes of variable size. These two criterion aren't exactly difficult to satisfy in the Windows world or even commercial UNIXes, but for linux OSS projects, it was nearly impossible to find it.
The list goes on. I hope this book can provide step-by-step documentation for setting up atleast ONE backup program. AMANDA I hear is nice, but when I downloaded the distribution.. I couldn't make heads or tails of it. This is coming from a guy who wasn't phased when setting up procmail recipes and getting Sendmail working in, uhh, unusual configurations.
That's my $0.02. In short, linux offerings are limited. People focus on the more glamorous things like kernel development or creating a GUI.. but I could really go for the basics - like an easy to use CLI-based backup program that has a decent feature set.
If I could, I'd moderate StreetLawyer up in the hopes of starting a discussion (alas moderators don't hit the book reviews often)
While it may smack of a dark corporate culture of ingrained cover-ups to us geek-types, the fact is that excesive records can be a genuine danger even to those of us who feel we have nothing to hide.
IANAL, but I was recently the plaintiff in a civil suit, and I was surprised by the dismay of my attorney at the voluminous records I had kept, documenting every meeting with the defendant for the past two years (all cc'd to the defendant within days of the meeting, with requests for comments). I thought I was being diligent and even praiseworthy.
Not so. It turns out that my words, even if cc'd for comment, can almost always be used against me, but are apparently rather weak support for my version of events.
Fortunately, (much to my lawyer's surprise) we found nothing it those memos to injure my case, but they also were of no help when the defendant (more accurately, the defendant's employees - the defnadant was a large organization) simply pled "I don't know", "I don't remember the letter", and "I skim those things and throw them away. I don't have any of them in my files"
[It was infuriating, somehow we thing Big Outfits file everything -- and they probably do, but how are you going to prove it? This outfit had defended against such suits in the past, and had learned its lession well)
Most of us could stand to improve the organization in our lives, and are bitten by "I wish I had that file" more than "wish I didn't", but "too much data" is potentially harmful. As cases involving e-mail and USENET have shown, casual, ill-framed, or out-of-context remarks can be damning.
The cleaner your backups, the fewer irrelevant details (especially details even *you* didn't know) to mess you up. If you get sued for copying code, you don't want the plaintiff to be able to find a backup showing a bootleg copy of his program on your network -- even if it was just something your summer intern installed to help him/her understand your product.
If you had something in your house that was useless and potentially toxic, I hope you'd get rid of it (even if it is related to you by marriage
__________
If you can go to bed, knowing you did a valuable thing today, you're very lucky. If you can't... it's not bedtime
Speaking as a lawyer, my view of the IT profession in general is that they are, for some weird psychological reason, obsessed with preserving all sorts of data of any sort forever, without any thought to whether keeping it is useful, or even downright destructive. In general, anything which really needs to be kept should be printed out and archived in duplicate (this also has the advantage of settling once and for all what time a document was created, unlike electronic formats), while "backups" of most user information should be more focused on deleting the useless and incriminating crap which most users clog up their hard drives with. Although I suppose that nobody's going to get rich selling new database programs or "enterprise level" record management systems that way.
just my opinion
John Saul Montoya
-- the most controversial site on the Web