Best Home Backup Strategy Now?
jollyreaper writes "Technology moves quickly and what was conventional wisdom last year can be folly this year. But the one thing that's remained constant is hard drives are far too large to backup via conventional means. Tape is expensive and can be unreliable, though it certainly has its proponents. DVDs are just too small. There are prosumer devices like the Drobo, but it's still just a giant box of hard drives, basically RAID. And as we've all had drilled into our heads, 'RAID is not backup.' When last this topic came up on Slashdot, the consensus was that hard drives were the best way to backup hard drives. Backup your internal HDD to an external one, and if your data is really important, have two externals and swap one off-site once a week. Is there any better advice these days?"
Back up to a honking flash drive?
rsync.
Switching off-site backups every week is an unnecessary hassle. Back up to an external hard drive and an online backup service. Anything more than that is overkill unless you have really important data.
A Magic the Gathering Article and Forum Aggregator
Maybe it's unconventional to use, I dunno, another hard drive?
Confucius say, "Find worm in apple - bad. Find half a worm - worse."
Not the same as external backup, but it provides redundancy against a single drive failure and provides history. Otherwise, run backup overnight every now and then to an external drive and store it away.
Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.
You should try Mozy - free for 2GB, $5/month for unlimited. I've been using them for a year and its seamless.
https://mozy.com/?code=WAQ9DM
... Amazon S3 mounted on FUSE. USD $0.17 per gigabyte to store offsite, and with FUSE I can browse it like it's local. I am very happy, and saving money too.
Thank for reading to the sig. You may stop reading now. It is safe. There is no more content. Why are you still reading?
What do you mean Take a Backup?
My Hard disk is nowhere near full.
Hmmmm, I must download some more Music.
Ok, Only joking.
Seriously, how many Windows users know that lots of important stuff is in hidden direcetories that just copying your files is not going to work?
Almost 50GB per disc and brand name blanks aren't too expensive if you know where to look. (Hey Newegg: surely y'all could save us some nuisance if you'd import a shipping container or two of blanks direct from Japan...) Nero Linux supports Blu-ray drives. RAID1 for primary storage with BD-R DL backup, with the backups ideally stored securely off-site should be sufficiently paranoid for most home users though Blu-ray is too new to have real-world long-term integrity statistics.
Remote backup to a rented dedicated server is also a possibility though not terribly practical in America due to certain monopoly carriers (<cough>AT&T</cough>) being too cheap to build FTTH, at least until they run out of duct tape and bailing wire to keep their WWII-era copper plant patched together, and even then.
All of the online backup strategies are a joke. Due to bandwidth restrictions, it would take years just to make a backup of a typical user's hard drive, and they don't offer enough space (seriously). The cheapest form of medium currently is hard drives, so my current backup system is to have 2 equally sized 500 GB drives and I use Acronis on a schedule to do a differential backup of one drive to the other once a week during early morning hours. If the differentials start to get too large, I'll do a new full backup and start doing differentials from there again. I haven't found any backup solution that is "totally" automatic in this regard, but since it only requires manual intervention once every several months it's not a huge deal.
Aside from the choice of backup medium (home users back up to flash if they have less than 8GB of data and to hard disks if they need more space), the big reason why people don't backup is that doing it right is ridiculously complicated, due to lack of software support. There's basically no way to produce a reliable full backup from within a running system. If people are supposed to make backups that they can rely on, this must change.
I decided that I have three main "categories of data":
- easily replaceable: This is stuff that is fairly easy to replace.. for instance I have ripped a huge portion of my DVD collection (for my own use). If I lost this data, it would not be a tragedy .. just a pain in the ass.
- hard to replace: This is stuff that does exist "out there".. but would not be easy to replace. This includes old TV shows that you can't buy or if you can are very hard to find.
- irreplaceable: Self explanatory.. this is my documents, code, photos, etc that could not be replaced if lost
I keep everything besides OS files on a file server. Raid 6 (two parity stripes).. this is the first layer..
to me this is adequate to protect "easily replaceable" stuff (which in my case constitutes a huge chunk of file space).
I backup everything in the "hard to replace" and "irreplaceable" categories to a seperate (removable but stays in the system) hard disk (so far 1TB has been enough to hold all this data). I make a
secondary backup to a second removable drive and store this "off site". This secondary backup does not get updated very often.. which is the trade off I guess... but it provides a "last hope" if something
crazy ever happened.. like my house burning down.
Oh.. and backups are encrypted!
rsync.
That's the protocol. Now what media do you recommend? Another hard drive?
Thank for reading to the sig. You may stop reading now. It is safe. There is no more content. Why are you still reading?
OpenSolaris and 8 drive RAIDZ-2. PHYSICALLY disconnect that fileserver (and turn it off) and sync up to it once a month.
Use GlusterFS or RSync to sync that up to your main computer. If you can figure it out, make incrimental backups to DVD once a week (or day, if it's that important). Take those DVDs off-site into a vacuum sealed (not expensive, you can make one that uses a hand pump and a box). If everything goes to hell, restoring from DVDs takes forever but you have that option, and that's what's important.
Windows Home Server actually has very good backup options. a)It allows for folder duplication on shared folders, protecting your shared files against a single hard drive failure. b)It allows you to add a hard drive as a backup drive, basically to dump all the shared folders, which can then be taken offsite. c)Jungle Disk has a WHS plugin, and there's an alternate Jungle Disk plugin which is allegedly better on whsplus.com, which provides your online protection. Automated daily backups mated with Volume Shadowing means that not only is your data safe, but previous versions are available too.
I bought a pair of Infrant ReadyNas NV+ systems a couple of years ago; I kept one for myself, and gave one to my parents.
My computers back up to my nas box, my parent's computer backs up to their nas box.
I keep a ssh tunnel open between both of our networks, and each nas box uses rsync to back up to the other one.
The only problem I've run into so far is Comcast's 250 gig cap; but so far I've been edging in slightly under the limit.
I've re-purposed a computer as a backup server, which lives at my parents house. It runs Ubuntu, with ZFS running over FUSE. Each night, a scripted CRON event will run zpool scrub on my storage pool, and if there is a problem, it will send me a text.
My MacBook Pro will use Time Machine over NFS over SSH to make the actual backups from my dorm/wherever I happen to be.
Commence CDDL/GPL/BSD Flamewar.
-jX
Don't you just love politics? It's like a comedy of errors.
I recently started with an off-site rsync copy (daily) to a remote linux box. The linux box has an encrypted partition for added security.
Advantages: automatic, offsite, encrypted (rsync over ssh, over an Openvpn tunnel), rsync algorithm (only copy differences, etc..)... An extra step in the future will be adding BackupPC to the remote linux box.
Why all the hassle? I need to safe-guard approx. 25 GB (mostly pictures) and online backup services. If they're free, no thanks.
I store all of my porn videos and ripped music in the Limewire cloud, and let other people back it up for me. Works great, and I often realize I have backed up songs that I don't even remember ripping!
Not really, keep doing it like that. for how to do that read this: http://jwz.livejournal.com/801607.html
I'm kinda a 'option 1' guy, but stuff that's really important, I just burn on to DVD every so often.
The other option, now that most folk now have halfdecent connections is to set up an rsync to a buddies machine, (and reciprocate) , using encryption, you now have an automatic off site back up.
HDD are fast, cheap, and sort of reliable. Keep two separate (physically different locations) double sets of them and its relatively safe.... and probably still the best way to do it for large data collections.
If the data only has small changes to it day by day and you have fast upload access consider adding over the internet off site back up (two locations).
Anyone know the long term stability of SSD drives?
Mozy is good - it's offsite backup with nice shell integration. Sadly it's Windows only though :(
-- Mike
I have an external harddrive attached to my Macbook and Time machine takes care of the rest. And my important document and photos I upload to my dropbox That way I have a local backup of my entire harddrive in case something happens to my Macbook and one stored on the "cloud" that I can reach if my house burns down.
I fought the corporate America, and the corporate America bought the law.
It depends on how many important files you have. If you have just a few documents, you can still burn them to CD periodically, or use an online for-pay backup service such as Carbonite or Rsync.Net. The reason to use HDDs is because you have lots of data, or your computer data, including OS installs is very important to you, and you need a way to recover rapidly. (E.g. you _really_ can't wait, and it's worth the cost of external HDDs and accessorie to avoid waiting)
If money is no object, ioSafe makes some fireproof, waterproof, shock-proof drive enclosures, which could help against disaster situations. The alternative is indeed use of an offsite location. You need a lock box or safe regardless of method, to help protect against human risks to your drives. Or utilize encryption to help prevent data from fallign into the wrong hands.
Otherwise, if you use HDs for backup, consider a hard drive docking station. Like one of these or a voyager Q (who makes a model supporting Firewire800 also); docking stations are more convenient to buying a bunch of external HDs. Eventually, when you upgrade your hard drive, use the old one to store important files.
If you have a stack of old hard drives, you can actually use them also. So a dock, and some plastic cases to put your internal HDs in could be favorable to buying a bunch of external HDs. (There are companies that specialize in selling rugged anti-static plastic cases for HDs, but I just pile them in a box, and use the original anti-static bags that came with new HDs)
If you are using old HDDs for archival purposes, make sure to spin them up every few motnhs, or you suffer bit rot, and the mechanical components of the drive may fail.
Or get one dock + multiple cheap HDDs for important documents.
And possibly one large HD for a full system backup. Apple users are blessed with Time machine. Linux users can dd or rsync their files, and even have a script do it nightly (so long as you have multiple HDs, and cycle them after backups).
Windows users have got to use third-party software or do some scripting.
But the one thing that's remained constant is hard drives are far too large to backup via conventional means.
Maybe it's unconventional to use, I dunno, another hard drive?
From said fucking summary:
"When last this topic came up on Slashdot, the consensus was that hard drives were the best way to backup hard drives. Backup your internal HDD to an external one, and if your data is really important, have two externals and swap one off-site once a week. Is there any better advice these days?"
http://backuppc.sourceforge.net/
Get an old P3 for free somewhere and load this up on it with a big disk or two for storage, put it on your network, and run it. That's what I do and it works like a charm. I went through all the options over the years, tape, DVDs, manual copying to a server.
Backuppc backs up all my windows and linux PCs. It backs up only what I tell it to, and it does both full and incremental. Sort of a pain in the ass to set up (I use cygwin rsyncd on the windows boxes, and regular rsyncd on the linux boxes), and it works well.
Only drawback is it is still on site.
For some people their livelihood depends on the safety of their information. For most of us though, it's really little more than attachment. If you've gotten to the point where you need to backup to tape "just in case", perhaps your problem isn't so much the danger of data loss, but you fear of data loss.
AccountKiller
Just because the backup solution _uses_ RAID doesn't mean the old adage applies to it. As long as you are using it as external backups all is well.
What that phrase IS telling you to do however is not use RAID on the machine you want to back up and expect it to do what you want.
My UID is prime... is yours?
i use rsync + samba on a linux box over the network with 2TB drives in a mirror (encrypted, mirrored with debian) for primary backup and have a LG Blue Ray burner for secondary backup. I get 50GB DL blu rays for $10 each from ebay (shipped from japan in 10 packs) mostly using verbatims. I burn two blu rays at a time with copies of the sam data on both and store them separately. $20 for 50 gig is not a bad bargain.
Once upon a time, the computer you wanted always cost (at least) $5,000.
This trend ended in the late 80's. All of a sudden, package system prices started trending seriously downwards, because due to Moore's law, computer speed started outrunning almost everything you'd want to run on it. Not true for certain specific apps, including graphics and games, but for office use it was perfectly fine.
I remember buying a 200 MB hard drive for $500 and thinking about what a great price it was.
Up until recently hard drives were one of the more expensive components left in a computer package. Now? Most are under $100. That's lower than tape backups used to be at their lowest prices. It's true, right now the best way to back up your hard drive is a second hard drive.
IMO the big question now is where that second hard drive will be. You can stick it in your computer and mirror your main drive in real time easily enough, but that means a virus or software issue will ruin both drives simultaneously. Better to sync them once a week? Perhaps.
Of course, this won't help you if there's a house fire. The fireproof hard drives are still darned expensive. Internet-based remote backup is great, if your broadband can handle it.
Anything else is *so* last century.
Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
For home backups, blue ray will do a fine job.
Cuneiform tablets work well for me. Don't store them in a flood zone, though.
I have so many drives from so many years of external data that I do 2 Time Machine backups to 2 separate drives, a 3d SuperDuper backup about every 3 months to a drive stored in a safe deposit box and I avail myself of MobileMe sync services. Like I said, I use SuperDuper for the backups to the externally stored drive. That way, if a Time Machine backup is corrupted or wonky, the pooch is not you know what. On an ad hoc basis I backup my user folder using SuperDuper to a portable firewire bus-powered drive from OWC... so, I really have 4 backups. Between MobileMe, my smtp email, my use of gmail and the panoply of manually run backups, I feel pretty safe. Why feel safe? Bc I have done backups to a single drive before and ruined the backup and the main drive simultaneously. It was my fault, but I learned to never depend on a single backup again. Yes, it takes time, but the 3 months I needed to sift through 5 years of data (I used Data Rescue II from ProSoft successfully) was a lot worse. The other 2 Macs in the house are also backed up to 3 backup drives each using the same approach.
I use a pretty wide set of media for backups for my closet of hacked together servers; anywhere from SATA hard drives that I just hot-add and copy to, to still using CDs and DVD media for a tangible backup. But in the end, I really love tape. I've got a Dell Powervault 100T 20/40GB DDS-4 external SCSI tape drive ($40 eBay tops) and with a few open source tape backup tooks from freshmeat.net and a handful of tapes, I feel really confident in my backups. To me, tapes have ALWAYS been a reliable tangible fashion to back super important data or server/OS content and get it back, reliably. I've been doing this really light setup for almost 3 years now and it's never failed me and it's really not that expensive at all if you stay on the back half of tape technology. Upfront cost wouldn't be any more than $100 if you did some serious online shopping for the tape drive, 5-10 tapes and a SCSI LV card.
Tape isn't cheap but it is cost-effective. I love, absolutely adore, the HP 1/8 LTO-4 autoloader that I use at my office, and I'm thinking about getitng one for personal/home use. Nothing else gives the kind of throughput we get from that thing (both reading and writing) and as for reliability, I would like to hear from anyone who has ever had an LTO-4 tape fail in service.
-fb Everything not expressly forbidden is now mandatory.
Since when is tape unreliable? My DLT has a MTBF of 250,000 hours. I've used DLT, DDS, and Travan for years and I've seen far more HDD failures. I've seen plenty of tape drives fail, but not the tapes themselves. I trust my tapes far more than any spinning platter. Come to think of it, I trust my tapes more than any other backup I use (Optical disc, HDD, and Cloud). Once my station wagon full of tapes caught fire on the highway, but I blame that cheap-ass roach clip.
That's still roughly 10 bucks a disk for 500GB. You can buy 2x the HD storage for less than that price.
While optical media has its advantages, the convenience of an automated backup solution to an HD or multiple HDs means it's more likely to happen, thus is more useful. I do incremental backups to an external HD on an hourly basis. How do I do that on BD without it becoming very quickly A) expensive and B) damn inconvenient?
Let's face it, at the cost of HD storage, there's really no better general case solution.
I'm curious if this is breaking some kind of record for repeated questions. I mean I know it's a common thing but come tf on.
One problem I've run into using hardware raid is that if there is a problem not with the disks, but with the hardware controller, you can be locked in to buying another of the device that screwed you.
However, with RAID 1 or software raid, you can easily just put the software on another machine and be up and running. For this kind of thing where performance isn't the biggest factor, that's what I go with.
If you look up the reviewes on the drobo, everyone seems to love it until it fails. Then you seem to be completely SOL.
For really valuable files (the ones I won't ever be able to replace if I lose them: my own documents, my photos), I burn a monthly DVD and drop it alternatively at my parents' and brother's.
For the rest of the junk (media files: music, videos, books...)that are very large but not that important (or easily replaceable), I have a large external HD to which I clone my main HD once a week. I then keep the Backup HD off-line until the next time.
The Cloud - because you don't care if your apps and data are up in the air.
So what if you have to use a bunch of them? You can buy a giant stack of them for dirt cheap. That porn collection will impress your friends more when it takes up physical space.
I'm thinking of backing up to another hard drive and I'm torn between an external USB drive or another SATA disk. Considering I would probably (if we're being honest) leave the USB disk connected at all times (for a daily backup) with full knowledge that in the case of a fire it'd be toast along with the machine... is there any incentive to use USB over SATA? It's pricier and slower, right?
As a Slashdot discussion grows longer, the probability of an analogy involving cars approaches one.
it depends mostly on how much data you have. I have a couple of terabytes. I also have a daughter living 5 miles away so I backup at my house to external drives and swap those backups with another set kept at her house. If you have 50megs of data (many people have very small requirements) then an online backup strategy might be very handy. You can even get 32gig and larger(?) USB flash drives that are more than adequate for most people who just want to backup their email and pictures/data. Tape drives are fine for geeks but access is slow and rebuilding a drive becomes more of a chore. Definitely not for mom and dad if they aren't geeks. External drives give total bootability (or the potential for it) and for most people are the easiest way to do a complete mirror of your HD. For data, most people can fit all their data (if they can even find it) on a USB Thumb Drive.
rsync.
Ok, that covers the <5% of users who can set up and maintain a backup systems based on rsync. What about the other 95%?
As an interface to set up a backup system for a moderately adept geek with sufficient focus to set up and maintain a recurring rsync backup, an above average grasp of the layout of their filesystem, and the presence of mind to alter their rsync script as their computer changes over time, rsync is extremely powerful. For everyone else, it's next to useless.
hardware RAID 5 for minimising impact of disk failures. And one external hard drive backup, kept in offsite location. Also, sync the most important data with your laptop, if you have one, so you have kind of two external backup media.
Backups are:
- off line (viruses, power surge, sabotage...)
- off site (fires, theft...)
- tested (i've got horrors stories of people that THOUGHT they had backups...)
- multiple (... and of backups that turn bad at the worst possible moment)
So backing up data is a hassle, and can be expensive depending on what you need: a DVD, a BD, an HD... But pretty much the only foolproof solution anyway is to burn your data onto a media you then send away to your parents' or other trusted 3rd party. Once a month is the very minimum.
If you're using HDs, you may want to re-use them after a while, but don't forget to keep some very old ones, for when you realize ages after the facts that one of your files got corrupted.
The Cloud - because you don't care if your apps and data are up in the air.
There are three kinds of data:
1. If you lose this data you will go to jail.
2. If you lose this data, your business will be impacted.
3. If you lose this data, you will have less options for entertainment.
#1 tends to be a megabyte or less.
#2 tends to be a few hundred megabytes of documents.
#3 tends to be terabytes.
My company has a PDF of every document that we've touched in the past decade (federal law requires this retention), and our entire business continuity backup fits easily on four LTO-4 tapes, plus a very less-than-full tape that we rotate for offsite storage weekly. We've explored every backup system out there and this is by far the most cost-effective for us.
I don't understand why the OP claims "tape is unreliable", as I have not heard of a single instance of in-service failure of an LTO-4. As for it being expensive, it is, but before we went to tape we were using Firewire800 external drives, much more expensive than tape cartridges, and not as reliable as some people have been led to believe.
USB and FW external drives almost never fail as long as they are powered on. They fail in storage, which seemed pretty weird to me, since they should be able to sit on a warehouse shelf indefinitely. My low-sample unscientific data from experience says otherwise.
Since everybody is going from LTO-2/3 to LTO-4, you should be able to get LTO-3 transports pretty cheaply.
But my first advice is to identify the data in categories #1 and #2, where you might realize that it's a good practice in any case, to store the important stuff with its own priority. This is the hard part. Identifying what's actually important. If you don't do this, no matter what backup system you end up using, you're going to be burying the important stuff in the noise, introducing risk.
The OP also mentioned Drobo. I have a Drobo and I love it, but I must warn you that it's pretty slow, even with really fast drives. Don't expect to be able to copy a terabyte to it in less than 40 or 50 hours, even with firewire 800. This is the problem that drove us to tape, which is much faster than any filesystem we can feed it from.
-fb Everything not expressly forbidden is now mandatory.
I just can't be bothered with slashdot any more. It's full of dummies with mod points. How do I get off the Internet?
Do I need a megabyte of backup capacity for every megabyte of storage? No, I decide what's important and how long it's important for.
Deleted
Well, I don't think there's an ideal solution, but I can tell you what I do.
For software, I find Bacula to be a very effective solution. It's open source, cross platform, and very flexible. Bacula was designed with tape in mind, so it takes a bit of wrangling to make it work well on hard drives - but once you get it set up properly, it works quite well with disk.
Now, I back everything "critical" up using bacula onto my ZFS array of cheap drives on my Solaris box. I just let ZFS itself do the compression (I didn't benchmark this, Bacula's compression may be more effective) and retain the backups on disk for about 2 months. I do nightly incremental backups and monthly full backups, but Bacula gives you lots of different options in this respect.
I then take the Bacula backups and rsync them over to external media weekly. I also take and keep 4 zfs snapshots of these backups on my external media, so that I can go back 4 weeks prior if I need to. I also rsync over to a separate smaller external drive "every now and then." I keep that other drive in the opposite side of the house (better would be to leave it at a friend's house, but I'm lazy - I just have to hope that one end of the house survives a fire / theft unscathed).
I have another class of data - data that I deem important, but also capable of being re-acquired at minimal expense. This data gets no incremental backups, and is only rsync'd around. It gets put on the larger external hard drive, but not on the smaller secondary drive. Beyond that, I have a third class of data, which I deem completely expendable. This is mostly normal recordings from my MythTV machine, which I consider an acceptable loss, and these aren't backed up at all.
At the end of the day, there really is no magic bullet. I really like disks + bacula, but what works best for you will depend on what you're trying to back up (and how much value you place on making sure that this happens properly).
The backup strategy is the same as it's been for the past 40 years or more: keep copies of your data offsite, where you can get at them in a reasonable timeframe should anything happen to your local storage. The technology used to implement that strategy changes with time, though. Personally, I made the decision many years ago that removable media was a waste of time. The only sensible option is to have storage powered up and accessible at all times. So I rsync to a hosted server in a remote datacentre (the initial sync took forever over ADSL, but the nightly diffs are pretty quick)
"The invisible and the non-existent look very much alike." -- Delos B. McKown
Here's a silly idea: buy a hard drive and a low-power always-on computer and negotiate with a friend to do the same. Backup over internets to each other's new computers. Then you can get away with just one backup device that also happens to be offsite.
Ghost Virtual Machine gives 15gigs of Amazon.com data storage and right now if you use the promotion code of "launch" you get 10Gigs more as a bonus for 25Gigs. If you want to give me a referral my id is orion_blastar there, and each person you referred grants you 5Gigs more in a bonus.
Google Docs also has document storage but does not give as much as G.ho.st does. The Ghost Virtual Machine can access your Google Docs drive as well.
Here is a review of the top 5 online cloud storage sites so you can take your pick.
MyBloop offers unlimited free storage, but I am not 100% sure of that or their privacy policy.
Lifehacker talks about using your Yahoo Mail account for unlimited storage and also that Google's GMail almost offers the same service as well.
Remember, Slashdot does not have a -1 disagree moderation, and no, troll, flamebait, and overrated are not substitutes.
Do nothing(TM) is the cheapest solution to the backup problem. By doing nothing you ensure that no amount of time will be wasted looking fr the backups if you lose your primary storage !
As an interface to set up a backup system for a moderately adept geek with sufficient focus to set up and maintain a recurring rsync backup, an above average grasp of the layout of their filesystem, and the presence of mind to alter their rsync script as their computer changes over time, rsync is extremely powerful. For everyone else, it's next to useless.
On the other hand, if you thought you could ask on /. you probably match this description...
Like most people, I have a small amount of truly irreplaceable content (documents, pictures) and a whole bunch of "it'd be annoying if I lost that" content (music, movies). One of the really convenient things about this split: the truly irreplaceable stuff is not very large. My docs and pictures occupy about 15 GB, and most of that is pictures.
I have an external hard drive where I back up everything at least nightly. This protects me from accidental deletions and a failed hard drive. It doesn't protect against fire or theft, though.
Services like Mozy and Carbonite offer off-site backup for about $5/month (there are many others -- these are the two best known, I think). I could string together something with a spare drive and a friend, but frankly, it would take a year or two before that approach matched the cost of Mozy et al., and frankly, I just don't WANT to worry about this crap. I'll pay the $60/year to make it someone else's problem.
One interesting option: Crash Plan at http://www4.crashplan.com/consumer/index.html . They offer free backups to friends' machines, and paid backups to their own fileservers. Sounds like the best of both worlds, but I haven't gotten around to trying it yet.
--
Now first of all, I think it is perfectly fine to use RAID as a backup as long as it is *not the primary storage* of the machine you are backing up.
I have been considering Drobo as a backup device but the high price keeps putting me off. However, unlike standard RAID implementations, Drobo's is unique where it is expandable. Just pop out the smallest disk and pop in a larger one. I think for people as myself that have growing libraries and growing home directories, maintainence of a backup device that easy should be appealing, since it gives us more time to think about how to migrate data off primary storage to a larger disk, or how to add an extra disk to the LVM, etc.
Aren't external usb/firewire/esata drives cheaper than $200/500 gig? I think they are around $100, so you get $10 per 50 gig. It's secondary backup while they aren't permanent you can rotate through them and reuse them.
Tahoe ( http://allmydata.org/~warner/pycon-tahoe.html ) uses crypto to allow friends to share their storage space without exposing what is being stored.
So, you could get a bunch of friends to agree to get 1TB drives and set up Tahoe so that you could all push backups to these drives for backup purposes. However, I'm not sure how you would push the backups to the Tahoe file-system. Since the company behind Tahoe (All My Data) uses it to provide backup services, there must be some thought that has gone into this, but I don't know if the open-source side of it has any tools for it. I've heard rumors that something like Duplicity will support Tahoe as a backend, but I haven't looked into it.
But, this would take care of the off-site secure replication component.
As long as you don't lose your crypto key to the data when your house burns down, of course...
Sean
We must lay out the kinds of failures and goals of a backup to determine how best to back up.
1. We would like to protect against mechanical drive failure. This can be done with a RAID.
1.5. We may also want to protect against the failure of other components of the computer. I recently had a computer die because its motherboard died, and it took about two weeks to get a new computer, and the new computer was a significant upgrade so it had SATA instead of IDE. In the mean time, I needed my data on other systems, and when the new computer came, I needed to borrow a USB-IDE bridge to recover some stuff that I wasn't backing up.
2. We would like to protect against accidental deletion of files, file corruption, or edits to a file that we have now reconsidered. This can be done with snapshotting. In source code, to reconsider and edit to a file is fairly common, and is the reason why most programming projects use revision control systems. Other options like nilfs or ZFS snapshots can also fill this goal. This goal is accomplished more easily if the backups area automatic and the backup device is live on the system.
Depending on your needs, this goal may be counterbalanced by a need to not retain the history of files for legal or other reasons, and this should inform your choice of backup strategy.
3. We would like to protect against filesystem corruption, whether by an OS bug, or by accidentally doing cat /dev/random > /dev/hda. This can be done by having an extra drive of some sort that isn't normally hooked up to the computer. Tape drives, CDs, and DVDs have traditionally fulfilled this purpose, and this is where the use of additional hard drives is being suggested. Remote backups, via rsync can also accomplish this. For this I use git.
4. We would like to protect against natural disasters. For someone living in New Orleans, it would be nice to have a backup somewhere outside the path of Hurricane Katrina. Remote backups may be pretty much the only way to accomplish this, unless you're a frequent traveler and can hand-deliver backup media to remote locations.
5. In addition to any of the above, the code you use create said backup may be buggy, or may become buggy or misconfigured over time. Checking the integrity and restorability of your backups after creating them, and keeping several (independent) previous versions of a backup may help here.
You may not be concerned with the various modes of failure described here occuring simultaneously. For example, it may be unlikely that you need to deal with file system corruption at the same time that you regret one of the edits you made on your file. In that case, your offline backup device doesn't need to hold all of your snapshots.
Just encrypt and twitter your backups. As long as you use decent encryption, should be fine for a decade or 2.
I just rdiff-backup to an external drive connected by USB every night. Just put the command in cron.daily and it will run at about 3:00am. If you have to take action yourself to get a backup, you aren't doing it right. You need to have a process that runs for you so you can forget about it. Rdiff-backup provides a capability like Apple's time machine so you can get snapshots of data from past days. It is available for linux, mac, maybe windoes. The external drive is in case some tragic event happens to the computer and fries the components, the external drive may still survive.
When the backup got too big for my old external drive, i just swapped it for a bigger one. The old one then became an "archival backup" that I can save for long term storage.
Can you find some thing that can retain data for, say, 40+ years and be still able to restore them?
Everything comes to a cost. If the tapes are too expensive, then just buy the same storage you are using now and make just a second copy. But that would not be a backup. Just a second copy as unreliable as the original.
Maybe Computers will never be as intelligent as Humans.
For sure they won't ever become so stupid. [VR-1988]
My files are rsync'ed every night to an off-site other machine. And the other machine's data are rsync'ed to mine. After the rsync, the remote machine creates an incremental tarball of all files that have changed since the last tarball. Occasionally, two consecutive tar increments are merged into one to save space.
I hate to be that bitter old pessimist, but this has been debated to death and back here on Slashdot many times over. I swear, it should be in the FAQ by now.
All of the times this question has come up (feels like at least once a month), there have been many very good suggestions. Why should we rehash them for the nth time?
Blu-ray will become cheaper as soon as the new Blu-ray competition gets started.
Bender: "Hey, what's this? Hermes' dreadlocks, and his arm? Leela, I'm shocked! Food goes in the disposal, hair and flesh go in the trash."
I really think a home file server (NAS) is the way to go now. Yes, still hard drives. If you really want to keep your data safe you should plan on purchasing duplicate drives each time you shop. 1 will remain in your home NAS. The NAS could be as simple as an old 800 mhz computer that you throw into a case with 10 internal bays. Personally I use an open source solution called SyncBack. It's extremely basic but it allows me to backup any drive connected to my home network to my NAS. Similarly, there are a lot of suggestions for Acronis. I like my SyncBack just fine.
Back it up to a server on the moon
But the one thing that's remained constant is hard drives are far too large to backup via conventional means.
Since drives are expanding faster than my critical data, all my computers and external drives have the same directory unison synced to a master copy on my mail fileserver.
Machine specific files live in a subdirectory of the shared backup directory. For example ~/backup/server or ~/backup/mythtv
Task specific files live in a subdirectory of the shared backup directory. For example ~/backup/genealogy
Sensitive files are mcrypt'd to a .nc file. For example, financial spreadsheets, scans of important documents.
Portable devices generally spend most of their time far away from the computers or are clipped to my keychain.
I also occasionally manually run a handy backup script on my computers to copy /etc/network/interfaces to ~/backup/$machine-name/etc/network/interfaces which is later synced. I backup about 10 config files on all my computers, and more on special purpose machines.
I have no idea if unison and mcrypt are available on non-linux machines... I don't use windows or mac to do anything important, only to play games.
So, pretty much every mass storage device I own has a complete and full backup of all my important, partially encrypted, data. As of this morning, all my important stuff is about six hundred megs.
I do pretty much the same thing with bulk media such as music collection, etc.
People whom are digital packrats might have trouble with this lifestyle.
"Science flies us to the moon. Religion flies us into buildings." - Victor Stenger
The first pre-requisite is that it's automated.
Whether that's through cron or Windows Service or whatever, it needs to happen transparency at a regular interval, because otherwise people will forget about it.
And setting up a calendar remind doesn't count, as it will be simply delayed/dismissed. At most you may have a reminder to swap offsite disks (put the current one in you bag/briefcase NOW to take away).
"Another hard drive?"
Yes. Tape has not kept up with hard drives, in fact they're now more per gigabyte than hard drives, very different from 90s prices. Even just the tapes themselves are more expensive than hard drives per gigabyte, and that doesn't include the thousands it can cost for a multiple terabyte autoloader tape drive.
There is really nothing else as cheap as hard drives per gigabyte. I use a external USB2 SATA dock and swap a few SATA drives. And honestly I'm not all that worried about backing up with modern operating systems. We've come a long away from Windows 95, where I was restoring from my Eagle Travan-3 1.6/3.2gb once a month.
my karma will be here long after I'm gone
Jungledisk is a nice front-end to Amazon S3 for which I pay around $1 a month. The transfer fees for S3 are irrelevant over time, and I end up paying around $15 a month to back up my entire personal dataset from 3 different machines. I can get to any of my stuff from anywhere if needed, and the incremental backup system keeps things current with no overt action from me. I include my personal mp3 collection (big), digital photos, and everything else I care about. I spend a lot more than $15 a month on soda, so the $15 seems like a bargain to me.
There may be cheaper solutions out there, but why bother? This one is plenty cheap enough, and works great for me.
Online backup solutions are only as safe as the administrators make it, and they have to rely on a trusted method. What method do they use, and why is it any better than one we use ourselves (i.e. a redundant backup in a different location using said method)? What's more, online backup solutions could put our data at risk in other ways.
Twinstiq, game news
That's the protocol. Now what media do you recommend? Another hard drive?
Personally I use a dedicated backup server which has 2 x 1TB discs in a RAID1 configuration. The cron-scheduled backup uses Dirvish to create multiple historic backups, and the backup server is in a different building from the machines which it is backing up.
Not quite as good as multiple historic tapes, but have you seen the price of 1 TB tapes these days?
Accidental file deletion Check
Disc failure Check
File system corruption Check
Fire damage Check
Obviously there are other risks which it doesn't cover, but you sometimes have to ask yourself just how important is your data? I'm not running a business these days.
Rsync -av --delete my-hard-drive offsite-hard-drive
Simple.
Effective.
*GASP!*
It's N-sync!
... Amazon S3 mounted on FUSE. USD $0.17 per gigabyte to store offsite, and with FUSE I can browse it like it's local. I am very happy, and saving money too.
What happens if some malware comes along and does an 'rm -rf [...]' at the location of your FUSE mount point?
I think one pre-requisite for a proper backup is that it's offline (or somehow detached from the live system). If it's online then the 'backup data' can be taken out just like your 'regular' data in some cases.
Of course it's up to you to determine how much of a risk this is, but for me swapping external hard drives once a week between home and work is quicker and easier.
That's exactly what happened to us... the hit took out 2 Macs and the power bricks/adapters for nearly everything else electronic in the house. And it was a strike across the street that travelled thru the dsl line, not the well-protected outlets. I always have at least one backup NOT connected and stored off-site since then. The other awful thing that convinced me to use 3 external drives was backing up to a single drive and having a bad thing happen to both the main drive in my PowerBook and the backup at the same time. The screwup was a funky restore from backup (I'll never use Intego Personal Backup again). Yes, the stupid things happen and you'd better be ready...
Yeah, "another hard drive" is a good system. My policy is to never rebuild a flaky hard drive. If Windows goes wonky, which seems to happen every so often (with four XP machines at home, I seem to need to do some form of system repair every year at least), I just buy a new hard drive and either restore to it or start from scratch.
Then, the older hard drive gets shelved. That way, I have at least a snapshot in time, even if it's a few years old, and even if Windows won't boot I do have the critical files accessible. It's easy to access them by sticking the drive in a USB external shell.
But my DAILY backup routine is Norton Ghost doing weekly baselines and nightly incrementals, to external USB drives. This saved me many hours of rebuilding my wife's machine this week, when XP encountered the dreaded MUP.SYS freeze on bootup. I simply bought a new (larger) hard drive, shelved the dead one, and restored from Norton Ghost's backup. Aside from a few hours of MUP.SYS troubleshooting (to determine it WAS a dead drive and not some wonky peripheral), total restore time to a "just like she left it" system was a few hours, mostly waiting for the 100+ Gb to transfer from the USB external.
Finally, every so often I dupe my backup externals to a second external. These "safety" backups get stored in a second building on my property, so I have fire insurance.
--Brandon / Split Infinity Music
Rule #1 - It's not backup until its out of the building (3.5 kilometres away for insurance purposes) Rule #2 - Do 2 backups with different brands of software ( system & retail ) Rule #3 - Do a restore frequently to check the backup (don't trust the logs)
I just got myself a QNAP NAS (http://www.qnap.com/) on which all my systems backup. That NAS gets replicated to a USB disk from time to time, and I keep that disk offsite. My plan is to get rid of the USB disk and buy a second QNAP NAS to put at the remote site, and let them replicate over the internet. Reason: I can schedule that so it is not forgotten, and no more hassle of moving disks offsite. :)
Reason I went with QNAP: runs linux and is hackable out of the box
I back up all my hard drives with external hard drives. I have two internal RAID 0 and I back up each of them with a single encrypted hard drive (example two 640's backed up to 1.5T, two 500's to a 1TB). I then stick them in a copper mesh lined, fire and water proof safe.
I'm pondering building an ejection seat/escape pod to activate whenever the safe is in danger. But then I'd have a giant hole in my roof that I'd have to explain to my landlord.
I've run with two hard drives of equal capacity for around 8 years. For a while these were stored on a linux box. Every once in a while I'd simply run a cp with update to basically move only changed/new files. This saved me many times. These days I have 1 internal 1TB HD and 1 external 1TB HD in my Windows desktop. I'm eventually going to purchase a third external 1TB HD and swap it out in my safe deposit box at my bank. Luckily, my bank is across the street, so swapping that will be no problem and take very little time. I don't plan to do this once a week...maybe once every 4-5 weeks. The odds of a hard drive crash are probably greater than my apartment burning down anyway.
Can you find some thing that can retain data for, say, 40+ years and be still able to restore them?
The only reason those tapes failed is that some idiot decided to erase them.
Anybody want my mod points?
The only true redundant solution is DFS and DFS Replication with the addition of Shadowcopy. If anyone knows of a linux option that rivals these please speak up, I've been looking for one for years. RAID is just a really large single point of failure.
DFSR gives you the option to replicate live data and it works REALLY well. Plus you can add extra replicas on a replication lag. You could even have an extra replica on a replication lag across a VPN link to someone else's house. There are other off-site options like Mozy which work well.
Of course, this is all pretty complex. When it's working, though, your data is pretty safe.
Don't tell us you're using LVM for critical data such as backups.
LVM does not implement file system barriers.
Betcha didn't know that, did you? Scary, ain't it?
Heck, might as well google for why barriers aren't enabled by default in Linux....
Safest home backup solution
Use standard commodity parts for easy replacement/reconstruction.
ie. Consumer grade writable disc's and external hard drives.
Backup Home data to DVD/BR/CD regularly (Monthly)
Backup Bulk data with 3 Hard disks!!!!
1) Use one Live HD
2) Use one to backup Live
3) Use ANOTHER to backup Live again
The Reason?
If your backup is large, You will probably need to erase the target drive.
At this point you are VERY vulnerable.
With your only backup erased,
You had better hope your live drive doesn't fail while backing up again.
Or you may accidentally delete something from the live drive WHILE backing up.
With a 3rd hard drive, At least you can recover from your last backup if your current backup somehow nukes itself.
Dangers....
1) Don't have a live drive and think your safe with another live drive synced, You really want to keep hard drive backups IN
STORAGE. If it's live it can and probably will die. It's just a matter of time or circumstances. This goes for raid too.
2) This is home storage, So 6/12 month hard disk backups should root out any bit rot or hard disk seizures.
"What about the other 95%?" Over the years I became an old and bitter sysadm... you know what ? They just need to do what the 5% did: Put their asses in a chair and Read The Fucking Manual... and read again, and again until they understand the subject.
No, I'm not user friendly, I do not need to be... people are asking me for help anyway.
For example you deleted a file last week that you need now. A supplier has sent this month's data with last month's filenames and overwritten last month's data. Your database has 'acquired' corruptions and now you need to go back to find a working or clean version. A duplication strategy just means you have two copies of the bad stuff!
Here is what I do
The history chain has extending time gaps between copies eg 0.9,1,3,7,15,30,90 days. So for a daily backup of a file that changes every day the two most recent copies are always shifted down the chain and every 3 days the second is bumped down to 3rd position which in turn might bump 3 to 4 if 3 is more than a week newer than 4. This is ever so easy to program - I even did it in a DOS batch file.
Let's review what happens if the computer goes bang! - Reload from USB hard drive and flash. Alternatively if data gets corrupted - Trawl through the history on the HD.
This is the only story where being modded "Redundant" is a laudable goal.
Personally, In-home backup to mirrored Iomega NAS, and offsite to a USB enclosure which goes away weekly.
This is an ugly but workable script I wrote for linux/mac to make multiple snapshot backups where the data set is rather smaller than the target media. That's not the case with me, or probably most of slashdot, but it works for Moms and laptops.
I like music
On the other hand, if you thought you could ask on /. you probably match this description...
If you had to ask on /., you already don't match the description.
Of course, this won't help you if there's a house fire. The fireproof hard drives are still darned expensive.
Data backed up to a set of USB hard drives in rotation, with no more than one in the same building at the same time, should survive a burning building as long as it's not part of a fire of 1871 scale.
"What about the other 95%?" Over the years I became an old and bitter sysadm... you know what ? They just need to do what the 5% did: Put their asses in a chair and Read The Fucking Manual... and read again, and again until they understand the subject.
That's not what they did.
First, they were born/nurtured in such a way to have above average technical aptitude.
Second, they were interested enough in how computers work to tinker and learn and gain a broad base of knowledge about their computer and OS.
Only then did they "put their asses in a chair and Read The Fucking Manual... and read again, and again until they understand the subject."
If you expect the 95% who did not go through the first two parts to skip right over into the third part, you're in dire need of taking your ass out of the chair and meeting some Real Fucking People.
No, I'm not user friendly, I do not need to be... people are asking me for help anyway.
Do what I do. Tell them, "yes, there's a way but it's rather complex. Do you want me to explain it?" The answer is almost always "no". Because they really don't want you to explain it, they want you to do it. If they say yes, you'll probably be asked to stop in less than 60 seconds.
1. For my media I do two 1 TB external HDDs with a robocopy run every 48 hours (to avoid the "oops, I deleted something I shouldn't have" RAID 1 problem)
2. For my cannot-lose-it small documents (KeePassX file, financial spreadsheets, etc.) I put them in an encrypted archive format and upload periodically to Gmail, to work as an offsite backup.
It'd be sweet if I could simply back up everything offsite, but with only 100 KB/s of upload bandwidth to work with on my DSL connection, that's just not feasible.
Ubuntu users are blessed with Back in time. All it is is a gui interface to rsync, but the gui is quite nice. Since the rsync keeps the same directory structure, you can always check your backups by simply viewing the files on the backup drive.
"Be grateful for what you have. You may never know when you may lose it."
Put simply: No, there isn't.
Backup your internal HDD to an external one, and if your data is really important, have two externals and swap one off-site once a week. Is there any better advice these days?"
There isn't any better advice, but you ALWAYS have "really important" data.
The most likely things you are going to want to back up are documents and spreadsheets, pictures, videos and of course code. If you can't afford more than one external drive, or even don't want to spend anything at all, the big G (yeah, I'm a fanboy, but there are probably equivalent options) provides help. Google Docs for the first two: search for "google docs synch" and the first option is freeware (not that I've used it - nowadays I just use Docs itself). Picasa allows you to keep pictures unpublished, not so sure about youtube etc. And code you can mail to yourself, (g)zipped, although it might be on your home test/dev machine and also on your commercial web server.
I store and backup all my "couldn't live without" files in a particular place, and back it up regularly to a
I agree that HDD is the best solution. But as for the software, dirvish has our preference: it does nice incremental rsync backups over ssh, using hardlinks for files that are identical. It also warns you when a backup fails.
They should use Time Machine.
Where are my mod points when I need them...+65535 insightful
Yes, "RAID is not backup", in that you shouldn't simply RAID your primary drive and consider the backup problem solved, but backing up to a RAID array can be advantageous -- you do disk-to-disk backup (via any of a variety of methods), and monitor the health of the RAID array closely -- if any disk in the array goes south, replace it promptly and your backup stays consistent. And, if you keep a spare drive or two around, you can swap a drive out occasionally to take off-site (and let the array rebuild onto one of your spares).
Personally, I like the ReadyNAS Duo a lot more than the Drobo (hard to explain, I just trust their tech better, and the ReadyNAS is natively networked, rather than needing an afterthought add-on). Last I checked, Amazon will sell you an empty ReadyNAS Duo and a couple WD Green 1TB drives for ballpark $500. That said I haven't got a ReadyNAS yet (because money has so many uses these days); I'm using my second most favorite backup setup, a 500GB laptop drive in an external bus-powered FireWire enclosure. I'm using a MacAlly PHR-S250CC enclosure (which I'm very happy with), using a drive I already had, but for a complete setup, I'd probably go with one of Other World Computing's packages for about $150. This loses RAID (which I ultimately want very much to have, for reliability), and isn't networked (which would be good for backing up multiple machines, and ease of use), but the bus-powered drive is so damned easy to use that I actually do it every day (set the drive next to my laptop and plug one cable between them, Time Machine notices the drive and starts a backup, 5-10 minutes later it's done, and I unmount the drive, unplug the cable, and put it back on the shelf).
My primary machine is a Mac; I use Time Machine for daily backups, and use SuperDuper to clone my MBP's drive onto the same backup disk every few weeks (minus a number of large directories that I know Time Machine is getting anyway); this gives me a backup drive I can boot from (via SuperDuper), and a lot of incremental history stored in a very usable manner (via Time Machine). And a backup system that I actually use because it's painless.
Add a ReadyNAS, and I could have my laptop automatically backing (hourly) up any time it's on the home network.
As far as on-line backup goes, I haven't been convinced yet. It eats a lot of bandwidth, and it means that someone else (that I don't know personally) has a copy of all my data, with only their promise of encryption keeping them honest. Sure, there isn't much there for anyone else to get worked up about (a variety of legally purchased music and software, a bunch of old email and vacation photos), but if it's not out of my hands, then that's one less thing I have to worry about. I do love DropBox for moving non-confidential files around, but I wouldn't use it for backup.
I use Time Machine + 2 external drives for my laptop. One drive is in my lab at school, the other drive is at home. It gets synced to both regularly and I have an offsite solution in case one burns down or gets stolen. If you have a desktop you could always swap external drives once a week and keep one at the office or other remote place. If you were more paranoid you could leave one in a safe deposit box as well -- a 1TB external drive is all of $100 now.
Like others have said, there are usually a few different types of data you have.
I categorize mine as follows
1) Entertainment. Music, Movies etc
2) Important stuff. Photos, cv, code etc
For me, there is no need to back up 1. If I lose it, I can buy it again or do without. 2 must be backed up. Tar and compressed by date onto my one and only local HD. Each time I tar, I copy changes to a 16GB usb key. It's cheap and simple
Here's a partial list of my backup directory
gav@r2d2:/data/sda4/backups$ ls -l
total 2257188
-rw-r--r-- 1 gav gav 99853231 2009-02-08 11:38 2008.02.08.home.tar.gz
-rw-r--r-- 1 gav gav 235315200 2008-12-06 15:52 2008.12.06.docs.tar
-rw-r--r-- 1 gav gav 126540165 2009-01-04 10:55 2009.01.04.home.tar.gz
-rw-r--r-- 1 gav gav 328816640 2009-01-12 23:09 2009.01.12.home.tar
-rw-r--r-- 1 gav gav 98706437 2009-04-05 18:07 2009.04.05.devkitPro.zip
-rw-r--r-- 1 gav gav 6322458 2009-04-05 17:45 2009.04.05.documents.zip
-rw-r--r-- 1 gav gav 91485066 2009-04-05 18:06 2009.04.05.workspace.zip
-rw-r--r-- 1 gav gav 5239082 2009-04-22 22:12 2009.04.22.documents.tar.bz2
-rw-r--r-- 1 gav gav 249277582 2009-04-22 22:16 2009.04.22.home.tar.gz
-rw-r--r-- 1 gav gav 230007959 2009-05-17 10:43 2009.05.17.home.tar.gz
-rw-r--r-- 1 gav gav 574637109 2009-04-05 18:54 2009.05.17.photos.zip
mozy == No Linux support
until that happends, its trash for most of us
Higuita
Here's what I do.
Nightly (my data do not change that frequently) I backup important directories to another drive (used to be internal but is now an external 1TB disk). I have set rdiff-backup to keep 30 days worth of backwards diffs. I weekly mirror the latest copy to an off-site disk which happens to be connected to my workstation at work as they don't mind. I do not mirror the backwards diffs that rdiff-backup use to get old revisions of files.
This gives me screwup protection up to 30 days and theft/fire protection of the current data. The disk at work is encrypted. I thought of encrypting the external 1TB disk at home too but the disk of not being able to restore the primary backup outweighs the protection from datatheft if the disk should be stolen in a break-in.
No matter what backup software or script you use to backup your files see to it that you can go back at least a couple of revisions to be able to recover from the most common source of data corruption... you.
Disks are pretty cheap these days. Some might argue that the space needed for backups are too much, as much as the data you are backing up. Yes it often does but it's worth it. Once you have lost a great amount of files in a screwup or disk failure you gladly pay the price for an extra disk.
Now if I only could find a very user-friendly backup software for Windows that gives rdiff-backup functionality and is also wallet-friendly I could convince my father to start backing up his data.
What the phrase means is that simply installing a RAID on a given system is not a backup for that system. It is still more than acceptable to use *another* system, with a RAID or not, as the backup for the original system. For years, we've been running one large systems with tens of disks in a large RAID array as our backup system, for the other systems. The point of backups is to have the data in more than one place, regardless of where that place is. DVD, tape, an external disk, etc.
Drobo, or something similar, would be a perfect device for a home backup server.
I have data on my hard drive that is probably illegal in certain jurisdictions. Now that governments are prosecuting people for merely possessing information, like demolition training materials or photographs of one's own children playing in the bathtub, regardless of what jurisdiction you live in, there is no way I'm going to let that out of my physical control. If I encrypt it before uploading it, they'll just discover that it's encrypted and use the rubber hose attack until I give them the key.
I keep an external hard drive that's five times as big as my normal one, and do a daily backup keeping up to five previous copies. Of course most of them don't ever change, like executables, so the drive doesn't really need to be that big.
I piss off bigots.
Only wimps use tape backup: _real_ men just upload their important stuff on ftp, and let the rest of the world mirror it ;)
- Linus Torvalds
http://en.wikiquote.org/wiki/Linus_Torvalds
This is a pretty cool sumary of why and how to backup data. It deserves a +5 interesting.
WHS provides automatic backups, multiple images. Use an offiste storage solution such as Mozy for Uber important files.
I too use backuppc to a external HD to backup my main computer and my family systems
it do data deduplication (file level, not block level) and compression... its easy to use and works perfect.
i swap HDs every week
now backuppc needs to add a option for block level deduplication (usually is a little more efficient in size for many files) and, more important, upgrade its rsync protocol (the latest versions of rsync improved alot the protocol, faster and smarter)
Higuita
I have about 60GB of *stuff*.. the usual digital excrement, with about 6GB of absolutely irreplaceable stuff, so I rsync the whole 60GB of stuff to a USB 500GB drive, and to my home server nightly, and then once a week do a sync of the 6GB of REALLY important stuff to Amazon S3 via Jungledisk.. Two fantastic products for backing up stuff AND getting it offsite in case of disaster.. I used to use Mozy, but they don't support Linux and even if I was still on Windows, their backup manager is pretty braindead.. The cost for Amazon S3 is dirt-cheap.. My initial upload of the 6GB of stuff and 3 weeks of incrementals after made my first Amazon S3 bill a whopping $3.42.. Now with just incrementals, it averages around $1/mo... Don't have any financial interest in either of them, just like them...
aws.amazon.com
www.jungledisk.com
THANK YOU, Edward Snowden!! Americans owe you a debt of gratitude (whether they know it or not..)
I use CrashPlan. I set it up for my family and you can then backup to their PCs and they can backup to you for off-site backup (via Invites), likewise, you can still backup to an external drive internally. I've been using this method the past year and its been fabulous.
It depends on what you want to backup. I, for example, like to keep an image of my primary boot drive available for each of my systems just in case I do something to destroy my OS. For this, I have a 3.6TB RAID5 managed by Slackware 12.2 on a headless box. The RAID5 ensures that my backups won't be lost to a hard drive crash (the system has 5 1TB drives), and the amount of space allows me to have multiple backups of my primary systems (each is partitioned to have a 60GB partition for the OS/major programs and the rest of the space is available for games/other fun things). Fortunately, the 3.6TB is overkill, so I also use it as a media server for my house. By the way, I built that system for $700.
-Rigor Mortis
Not popular on Slashdot, but I'm very happy with Windows Home Server. We've got a bunch of Windows machines around, and it images everything automatically. I do manual backups of important stuff (e.g., vacation pictures, mail archives) on DVD and thumb drives, and do a total clone of the WHS backup directory every two months (it's less than a terabyte, and fits comfortably on a cheap HD).
The Linux boxes just have source code, so a git push to a Windows box suffices there.
Backups are stored both off-site and in a local fire safe.
Any sufficiently advanced technology is insufficiently documented.
But RAID is a perfectly acceptable backup when it is a backup - as in a second system.
The first data protection should be to run ZFS with redundancy - that is already much safer than "RAID" (for reasons easily discoverable by brief web reading).
This can be mirrored to a second system.
you had me at #!
Like the major banks, my HDD is too big to fail :)
Personally, I have a two-fold system:
My real important data, textfiles, spreadsheets, business-related data, etc. (100MB at the moment) lies encrypted (via encfs) on my main machine, my laptop and another off-site-machine. The encrypted folders are synced via Dropbox.
My not-quite-so-important-data like, uhm, self-made and legal rips of my bought dvds and cds reside on external harddrives. Metadata of all of this (folder- and filenames, avi-metadata, varying checksums) is backed up with the real important data[tm](c) and I rely on the power of the internets to being able to resurrect my data in case of a crash.
This system clearly is not suited for an amateur video cutter, but in my and similar cases, where there's a few magnitudes of difference between the volume of irreplaceable and internet-enabled-reconstructable data, this method should work pretty good.
FYI, even if you use your own key, Mozy only encrypts the contents, not the filenames. That's rather insufficient. A court could subpoena them for a list of your files, establish that particular files exists, and require you to produce them. See http://michaelshadle.com/2007/05/07/mozy-the-backup-client-damn-close-but-still-no-cigar/
Plug: In 2006 I founded https://spideroak.com/ specifically to provide a zero-knowledge approach to online backups. We don't know anything about your data, including your file and foldernames. On the servers we just see sequentially numbered data blocks. It's written in Python and C and we've always supported Linux and OS X (and Windows if that's what you're into.) SpiderOak keeps historical versions of your files and deleted files forever (or until you decide to remove them) and will sync folders for you across several computers. Some reviews are http://www.linuxplanet.com/linuxplanet/reviews/6644/1/ and http://www.maclife.com/article/reviews/online_storage_battle_which_cloud_backup_service_reigns_supreme
... of backing up stuff by e-mailing it to a forwarding loop using a bunch of free e-mail accounts/aliases.
"I love my job, but I hate talking to people like you" (Freddie Mercury)
We use rsync between the various machines in our home office. Every machine runs Linux except my daughter's, which runs XP (her Care Bears game doesn't like WINE).
I have a laptop machine, a desktop machine, and a headless fileserver for my personal use, and my wife has a laptop. That is 4 machines that are used to back up her work directory (she works from home in recruitment for a franchise company) as well as photos. I use them to back up code, emails, and local copies of websites. Everything is scheduled via rsync and cron to run in the wee hours.
I keep my ebooks collection on all 3 of my machines, I guess that qualifies as the medium category.
Unimportant stuff like music and video are kept strictly on the fileserver.
Periodically I zip up the work-related crap and upload it to a cheap VPS that also serves as a secondary DNS server. The VPS is less than $8 per month and comes with 2.5 gigs of storage and 100 gigs of transfer.
Multiple redundant nightly backups using existing hardware, with offsite backup for $8 per month. Can't beat it for SOHO.
I'm not a crazy hard drive filler like a lot of people but here's my attempt to make my life simple. Try not to fill your personal hard-drive with media you didn't create yourself: all sorts of movies, books, and mp3's. If you truly must have this stuff in archives, then put it on another hard-drive not mixed up with your personal stuff.
Your personal stuff:
-----------------------
1)do keep your emails in a dedicated directory named with the date of creation i.e. from20July2009email
2)do keep your personal pictures in another directory...ditto
3)do keep your personal videos in another directory
4)do keep your work related projects in another directory
There now we've got it simplified down to 4 main directories to backup.
I've successfully backup to external hard drive by simple directory copy because I don't have that many directories to deal with.
It takes about 20 minutes to backup while attending and observing it closely.
I've tried "pybackpack" recently which is in the ubuntu repositories. It's a gui for selecting the directories and keeping them in a backup project file. It's worth a look.
Now with regards to backup of your computer's customized "just for you" settings, well from what I've seen for Linux recently is partimage
http://en.wikipedia.org/wiki/Partimage
They've even got an article recently published a couple of weeks ago in the Chinese weekly computer newspaper called "Dian Nao Bao" which has dedicated half-page to Ubuntu every week. I can't understand a word of it except they do publish the english buzzwords in it like 3g, android, partimage, wardriving, and mplayer.
prefixsuffix is cool. It's a little tool that changes groups of file names quickly. It might be useful for your backups.
Ok, that covers the <5% of users who can set up and maintain a backup systems based on rsync. What about the other 95%?
As an interface to set up a backup system for a moderately adept geek with sufficient focus to set up and maintain a recurring rsync backup, an above average grasp of the layout of their filesystem, and the presence of mind to alter their rsync script as their computer changes over time, rsync is extremely powerful. For everyone else, it's next to useless.
So can anybody say something to rsync-like features in other backup software? Surely there should be loads of commercial ones that do this differential copying thing?
Yahoo News : " On one end of the power-brick-shaped device are two USB connections, and on the other end sits a single 10/100 Ethernet port. Attach that side of the device to your network and you can hotswap any number of USB-based storage devices (be they hard drives or flash-based) to the two connectors on the other side." Apparently the product has an unfortunate conflict with an existing web hosting service. Product page here.
So can anybody say something to rsync-like features in other backup software? Surely there should be loads of commercial ones that do this differential copying thing?
Just googled for it and at least found DeltaCopy, a Windows wrapper around rsync incorporating a scheduler. If that's done properly, it should be alright for the average Windows user.
And honestly I'm not all that worried about backing up with modern operating systems.
Modern operating systems don't protect you from:
Best thing at the moment for home backup is to mount an encrypted external hard drive and copy to it, then take it off-site. If you think that sounds over the top, then I predict one day you'll be sitting at your terminal saying "aw, shit".
Advice: on VPS providers
*LOTS* of floppies. You might be surprised at how cheap they are now, too. :D
Cheers,
Dave
They that can give up essential liberty to obtain a little temporary safety deserve neither safety nor liberty.
Ben
It has to be automated or it won't work. If your backup scheme requires physical intervention to function normally, like swapping drives, it is destined to be a failure. It also has to provide live, directly addressable snapshots. Archival formats just don't work very well these days because it takes way, way too long to get to the data you might need to get to. Sure, a prudent company will always have some sort of archival or off-line storage for extreme emergencies, but the primary means of backup and restore today is live access. Only a really stupid person uses tape in the first two layers of backup.
One has to consider what one can do with the ridiculous amount of storage available on today's drives. Just a few weeks ago I slapped together a 10-Terrabyte 'test' filesystem with 2TB drives and a port multiplier enclosure. One E-SATA cable, 10TB. Its ridiculous how cheap this stuff has gotten over the last few years. But in terms of using all that storage, one of the goals of HAMMER (and to some extent ZFS) is to integrate live-snapshots into the filesystem itself. In the case of HAMMER's transactional storage (on DragonFlyBSD), you get fine-grained history, trivialized snapshot management, and bandwidth-controlled streaming mirroring for backup.
So the setup I have now is each box running HAMMER (on DragonFlyBSD) stores about 60-days worth of daily snapshots and one day's worth of fine-grained history on the production filesystem itself. Some of my boxes don't run HAMMER yet, and those I just cpdup/rdist over to the LAN backup box. Then there is a daily backup from the boxes on the LAN to a HAMMER-based LAN backup machine, and a weekly off-site backup from the LAN backup box to the off-site box. As I convert more of the machines over to HAMMER I can use the far more efficient streaming mirroring feature for HAMMER-HAMMER backups, and even make the daily backup a near real-time stream (and then cut to a daily snapshot each day), but particularly between the dedicated HAMMER filesystem on the LAN backup box and on the off-site box.
The big advantage of having the ability to maintain hundreds of snapshots is of course that you do not need to use the horrible cruft-multiplying 'hardlink' trick to maintain each snapshot. My LAN backup box can maintain about 120 daily snapshots of all my machines and my off-site box can store over a year's worth of weekly snapshots. My base backup set is well over 200GB now. Each day adds a few gigs, so a single terrabyte drive can hold a considerable number of snapshots.
And in terms of taking drives off-line for storing on a shelf, which is a reasonable thing to do as long as NOT doing so doesn't interfere with normal backup operations... it is far easier to do that with fewer larger drives then to do that with drive arrays. If you are taking physical possession you really only want to have to deal with one drive. In the case of HAMMER I just have a second mirroring stream going to a dedicate easily-pullable drive (via an E-Sata hot swap enclosure) which does not interfere with the nominal automated chain backups.
ZFS's raid-z is quite nice, but not really suitable for a backup system unless your backup needs exceeds a few terrabytes. It's a waste of power and money to construct redundancy for a single filesystem when 2TB drives can be bought and one can construct redundancy by creating multiple, independant backups. Ultimately it does get large enough to warrant at least hard-mirroring so things continue to click along if a drive failure occurs, but for most people its just wasteful to do that. If one already has a chain of backups (and thus redundancy in the form of completely independant copies), the occasional failure of a portion of the chain isn't that big a deal.
Where ZFS really works well is on the production system in a non-clustered environment. HAMMER doesn't really do in-filesystem redundancy (i.e. we'd have to depend on a hardware or software RAID layer), but HAMMER's goal is ultimately (as in the ultimate goal of the project) to integrate into a cluster of independantly-run copies of the same filesystem and in that sort of topology one does not actually want each individual copy to itself be redundant... it would just be wasted storage.
-Matt
rdiff-backup to an external server. Retain as many incremental versions of the data as needed. I keep 90 days for my HOME and 30 days for every server (currently 6 running in VMs)
Use a ZFS-based NAS or server storage in a RAIDz config and take snapshots constantly. Use `zfs send` to remotely mirror your data to an offsite server - Amazon supports OpenSolaris servers and ZFS storage CHEAP.
Oh, if you're on windows, there is an rdiff-backup version, but it's only good for data. Use Clonezilla to backup your apps and OS.
I run backuppc on a free Pentium III running linux. It backs up itself (tar), 2 window machines (SMB), a macbook (ssh and tar), and my website (ssh and tar). It is fully automatic, and does full, differential, and incremental backups. My wife can browse backups and restore individual files herself. If a file appears in more than one backup, even if from different machines, a single copy is stored. I now have 20 full backups, and 93 incrementals using just 36 GB. For our document folders, we do incrementals twice a day.
I make an occasional offsite backup by imaging the disk to another one (craigslist!) giving it to a family member for storage.
CONS: doesn't know about Mac resource forks, windows multiple data streams, ntfs permissions, ...
Configuration required condsiderable tinkering. A full restore would require reinstalling the operating system, then putting user files in their proper places.
Don't mess with The Phone Company. Piss them off and you'll be using two tin cans and a piece of string.
I am always annoyed by the repetitive alert beeps coming from the hard drive when I back it up.
The best home backup strategy is "now, securely, and with understanding". Get 99% of the benefit by doing it now.
The best way to improve "now" is "secure". Secure your backup by tying it down, and taking a copy to a remote location.
If you do it "now" you will have a chance of recovery when disaster strikes. If you do it "securely" then you will have a chance of recovery when a bigger disaster strikes. If you understand how the backup process works and what you are backing up, you have a good chance of being able to do the recovery yourself.
If you want to protect against a house fire or theft 3 drives is the minimum.
1st drive is your main drive, which you want to back up
2nd drive is your primary on-site backup. Preferably only switched on for backups, which are performed regularly (at least weekly)
3rd drive is your offsite backup. It's there in case your first 2 drives are corrupted, stolen or lost. It is your backup of last resort and lives at a trusted relative's house that lives more than 10 minutes away but no more than about an hour. Backup to this drive every month or two.
Remember also that your exposure is limited to however often you back up your drives.
Some more tips, if your data is really important and you have the money to do it:
NEVER overwrite existing data on a backup drive if you can help it. Incremental backups are best. Once you fill a backup drive, stow it and get a new one. Only do incremental backups to the drive. Anything you overwrite may be overwritten with something that has become corrupted.
If you want to get more secure, rotate your offsite backup drive. You'll end up with multiple offsite archive drives.
Once a drive is more than 5 years old, copy it to a new drive. Do not dispose of or overwrite the old one. Hold onto it until it dies.
These posts express my own personal views, not those of my employer
Best thing at the moment for home backup is to mount an encrypted external hard drive and copy to it, then take it off-site. If you think that sounds over the top, then I predict one day you'll be sitting at your terminal saying "aw, shit".
Replace that external hard drive with a service like Amazon S3 and a piece of software like Time Vault and you have a real contender for a viable Home Backup strategy (albeit one for geeks). Expecting people to perform off-site tape backups from home is a bit much.
As someone with a fair amount of experience with backups in corporate environments, I realized years ago that achieving the same thing at home with tapes was simply not practical. I'll assume you're using something like BSD or Linux like I am, so I'll ignore commercial backup software, but even then there's plenty left to be confused about.
/root, /tmp and /var) on a 4-disk RAID-5 array. This system performs very nicely, but the only thing I'm perhaps not 100% comfortable with is the fact that the backup partition is also located on the array.That may seem safe enough, but now I feel that maybe I should have used an extra disk, or even a pair of mirrored disks, for the backups instead.
First of all: hardware. Years ago, I gave tapes a try, but the only affordable drives I could find, from Onstream, kept breaking down on me and the tapes were not too cheap. I would have preferred DLT, but that technology was prohibitively expensive for home use. So, that left disk-based backups. The good thing is that disks are fast and the space is cheap these days.
Software. Mainly I use a backup package called faubackup, which I think is simple and easy to use. It's very economical with disk space because subsequent backups consist largely of hard links to files that were backed up previously and have have not been changed since. Data from remote machines is first copied to the backup server with rsync, after which that data is backed up depending on rsync's success. I use my own scripts to coordinate faubackup and rsync.
Redundancy. I'm luckily in that I also maintain similar servers for some friends in other parts of the country and have managed to convince them of the advantages of backing up our data to each other's servers. Rsync is really good for doing this, but obviously I leave out the directories with the bulky media files when copying data across the Internet (we all have ADSL with only 1 Mb of upload bandwidth).
Disks. Until recently I've always configured servers with two disks: one for the system and data and the other for backups only. It's a cheap solution that I've always been very pleased with. Since each of the two disks has hundreds of gigabytes of space available, I can afford to maintain many more backups than with a typical 21-tape strategy. The only problem that I've encountered so far is that moving the backups to a new disk can be very time-consuming: as opposed to copying only it's contents, you're forced to copy the entire partition over because of all the hard links it contains.
Alternative disk configuration. The last server I installed uses an SSD for it's boot and root file systems (using the 'noatime' partition attribute) and houses all of its data (along with
CDs have pressed pits, in CD-Rs the information is written into an organic dye layer bye the laser -- no pits, no pressing.
better backup solution? It is simple :) :)
slice backup files to 750MB, name it as XXX.SuperPorno.avi, and share it into any P2P network
... pay the electric bill for your server backup project?
I find that FreeNAS and a pair of mirrored raid1 drives is suitable.
Storing data on a single RAID array, on your daily use machine is not a backup.
Storing your data on a RAID array, then copying that data onto another RAID array, on another machine, *is* a backup.
(of course, it's not a spectacular, super awesome backup that will protect your data from flood, fire, and Jerry Bruckhiemer, but it is a backup, and it's a lot more than most people and many businesses ever think to do.)
Cheap, Easy, Good. Pick two. A network aware 1TB SATA NAS should set you back less than $400 shipped.
There are some people that if they don't know, you can't tell 'em.
Someone was just telling me yesterday about the virtues of rsync.net, which is around 65 cents/gb under their best discount plan. I felt that was reasonable for backing up important documents or even my source code repository (a few gb) but certainly not for bulk backups of multi terabytes.
> until they run out of duct tape and bailing wire to keep their WWII-era copper plant patched together, and even then.
Yeah, um, that's FiOS.
I'm completely convinced that the reason Verizon started deploying FiOS -- particularly in Florida -- is because they've been practicing Cut-to-clear for 30 years, and their outside plant is, essentially, unrecoverable; it was cheaper to spend $10B doing FTTH than to try to regroom it.
I've never found a Verizon outside-plant type for whom that did *not* ring true. And, as a datapoint, I have 45 T-1s in 2 buildings; I lose approximately one a month because someone from Verizon stole (usually only) one pair to give to another customer.
I report it, they fix it, and presumably, some other customer then loses one pair.
Most backup solutions are silly. The common user doesn't need a massive RAID array nor would they even understand how to recover if they ever experienced a disaster. Online backup solutions are *ok* but it is more likely that your hard drive dies than your house burns down. Now this isn't the perfect solution, but it is easy and it works. I highly recommend the ShadowProtect image based backup software. Matter of fact I would recommend any type of image based backup software. Being able to restore your entire computer with all your applications and data is really freakin' nice. More importantly, if your motherboard dies, you can do a hardware independent restore with your image so that you can restore your image to a totally different machine with totally different hardware and Windows won't bluescreen all to hell. All thanks to hardware abstraction layer. Yes we all wish we had off-site fireproof backups, but I trust my data with myself and only myself. So a couple external HD's with a backup image I think would be plenty sufficient for most people "in-home". And who said tapes were unreliable? Tapes are the longest lasting form of media ever in existence. hard drives die, and ink-based CDs and DVDs have a 5 year life expectancy.
*plays the Apogee theme song music*
I have a 2TB Hammer Storage array, configured as RAID-1 at home to backup my laptops. Do I need to backup daily? Nope. I tend to copy on occasion to the backup array. If one of the drives goes, the other is still readable. Seems to work fine for me. :)
Whoa cowboy.
Sure, "RAID is not backup" in the sense that your primary storage, if RAIDed, is only protected from disk failure, not file deletion or filesystem blowout.
RAID very mych is a backup if you've set up a second seperate box (maybe on a seperate power grid, better yet in another building, or perhaps on two sets of removable hard-drives that get swapped between your home and an offsite loaction (your drawer in the office? your mum's place?) routinely.
If you're willing to go the extra mile, here's my take on it:
Failure modes:
[a] Single Hard-drive failure
[b] I deleted a file accidentally, or a file got corrupted by an improper shutdown, or the filesystem got hosed.
[c] Two Hard-drive failure
[d] Machine blowout, or Major Filesystem blowout (>20% (or whatever % you put aside) of the blocks on the volume got changed),
Solutions:
---------------
Failure Mode [a] -> RAID 5
Cost: Pay for 1 extra hard-drive whose volume you can't actively use (If you wanted 4TB, you gotta buy 5 x 1TB harddrives or a blackbox that incorporates these)
Failure Mode [b] -> Run linux on your fileserver, with LVM or LVM2, allocating 10%-20% (my figures) of your volume to snapshots (depending on how much change you expect) and having an automated scheduled job perform a snapshot weekly, mount it so it can be browsed anytime, and blow out redundant historic ones (while keeping, say, one from the beginning of every month for yay many months).
Cost:
* Have to run a linux box rather than off-the-shelf storage blackbox
* Some linux work required to set up and work out what to do in a real-life failure mode.
* Some linux scripting knowhow required to automate properly
* 10-20% of your overall post-RAID volume allocated to snapshots (YMMV).
=== a bit about snapshots === /movie.iso, the entire block (4MB would be a good ballpark for an LVM block) that this naughty bit lived in will be replicated - the old (pre-corruption Jan01 copy) 4MB block will be copied onto the snapshot space (that 20% we set aside earlier), and the new (corrupted) one will be copied onto the live filesystem.
To the uninitiated, a snapshot only "saves" the old versions of blocks that got changed when they actually got changed. It doesn't stress the machine for an hour duplicating your gazillion-terabytes at any stage, nor does it care about files or filesystems as it sits below the filesystem.
Hence, If you took a snapshot on Jan 1st, nothing would physically get copied anywhere on Jan 1st.
If, consequently, on January 4th, a single bit got changed due to corruption following an unclean shutdown in a 9GB DVD iso called
We now have the same thing we would have had without the snapshot (a corrupted file on the primary volume), except, the snapshot retains an uncorrupted version of a 9GB file, which is costing us just 4MB to retain. And even protects us from filesystem corruption.
In real-wold terms, since our Jan-1 snapshot has been read-only live-mounted since Jan01, say, under /snapshots/0101/movie.iso, we can go back to our historic snapshot once we discover the file is b0rked and copy the pre-corrupted one out of there.
=== end bit about snapshots ===
Failure Mode [c] -> RAID-6
Cost:
* have to run a linux box rather than off-the-shelf storage blackbox
* Some linux work required to set up and work out what to do in a real-life failure mode.
* Buy 2 extra hard-drives whose volume you can't actively use (If you wanted 4TB, you gotta buy 6 x 1TB harddrives)
Failure Modes [a,c,d] Full copy on secondary storage array
Cost:
* Any Secondary storage.
* Extreme inefficiency. If you're running a 4TB (post-RAID/LVM) array, you will invest in 4TB of disk and be able to retain one historic image, or pay for 8TB to retain two, etc. You can go fancy and do snapshots here too tho.
Failure Modes [a,b,c,d]
Cost:
* Linux with RAID + LVM on your primary storage. This gives me snapshots
-
I'm using a windows home server (HP but Asus started making them too). Basically it's a little box that's barely taller than the 4 hard drives it can hold. It's quiet and power efficient. It can centralize and store media files making them easy to share throughout the house but it also backs up the other computers in the house -- in fact it will even wake up a sleeping or hibernating laptop in the middle of the night to do the backup and then put it back to sleep when its done (something that just blew my mind when I first realized what was happening).
WHS doesn't have a concept of "raid" but you can flag folders to be duplicated and WHS will ensure that if it's possible they will be duplicated across hard drives. Adding hard drives is fairly simple. When you add a new one you can chose to add it to the storage pool or to use it to back up the WHS itself. The storage pool is pretty cool, there really aren't letter drives, WHS just maintains a giant pool of storage made up from all the connected hard drives so if you're running low on server space just add another hard drive.
Generally I'm happy letting the WHS backup my laptop and game machine. If anything happens to them the WHS can rebuild them. If anything happens to a drive on the WHS the folder redundancy will keep things healthy. I'd basically have to have a simultaneous, catastrophic loss of two or more hard drives on the WHS to start losing data and the odds on that are pretty long. One of these days I might build a small raid to backup all the media on the WHS but that's low priority right now.
For businesses Windows server 2008 has much the same capacity although it supposedly does image based backups (ie norton ghost, but WHS will do ghost backups as well with service pack 3). And of course apple's solution is time machine.
I heard this on a podcast somewhere. I don't remember which one....
The 3-2-1 rule.
3 copies of your data
on 2 different types of media
and 1 copy offsite.
Personally I use Macs, so my strategy involves Time Machine and an external HD AND a copy of Mozy for online/offsite backup.
On the LInux side you could use an external drive and either rsync, or any number of Time Machine clones, and for your offsite backups, you could use Jungle Disk to do online backups to Amazon S3.
"Replace that external hard drive with a service like Amazon S3"
I don't know about you, but I'd be a old man before my internet connection would upload a terabyte worth of data.
If it's only a few gigabyte and you might get away with uploading to Mozy. Anything larger than that and you're back to hard drives.
my karma will be here long after I'm gone
I'm using a backup scenario which uses swapping out mirrors (RAID 1) in combination with Windows Volume Shadow Copy services. Twice per day my system (Server 2003 or Server 2008) takes a Volume Shadow snapshot (VSS?), which is how you get that "oops" protection - its similar to having an always available, instant restore tape library built right into the filesystem (BTW, Novell had this feature 10 or more years before Microsoft, but I digress). You just need to make sure you have plenty of free space on the drive to accommodate the snapshots, but the algorithm is very efficient since it only grows when files have changed.
Then, once per week, for the off-site disaster protection, I swap the external eSATA software mirror drive out, remove the broken mirror under disk manager, import the foreign disk from last week and recreate the mirror. Bingo - just a single drive to keep up with. I have a hot on-site mirror and an offsite mirror no more than a week old. Its quick and convenient and performance is excellent (and cost is minimal since the features are built-in to Windows)
I'm not sure if there's an equivalent to Volume Shadow Copies under Linux, but the software mirroring is there and works quite well.
There's no place like 127.0.0.1
"my current backup system is to have 2 equally sized 500 GB drives and I use Acronis on a schedule" - by junglebeast (1497399) on Sunday July 19, @03:44PM (#28749481)
See subject-line, because I agree with, & use, your mixture & solution here - HDD's are "the way", & don't co$t too much (a 40gb unit WD JB series 7,200 rpm unit of EIDE PATA variety - these are CHEAP to find nowadays, in used parts/mom & pop shops for computer parts)... I use FULL BACKUPS though, only, & I use "maximum compression", & it fits a 150gb image down under 30mb in size iirc...
HDD's (slightly older & smaller ones) are very fast compared to other forms of media also (for faster restores & backups)...
APK
P.S.=> I backup once a month only, after MS patch Tuesday, & until then, I use a RAID - 1 mirror (not depending on it here for backup, just uptime moreso (but, some folks consider that a form of backup (I do, & I don't - mixed bag on THAT account))... apk
"I want to make a ham sandwich. Conventionally these contain bread and ham, but I'm an idiot so I want to make it from dog hair and epoxy resin".
Leela: And that sandwich you're eating is made of old discarded sandwiches. Nothing just gets thrown away.
Fry: The future is disgusting!
A thumb drive is not a reliable backup, but it's a crucial backup, if all your other plans keep your data at home.
If you've got some crucial data which you absolutely cannot lose, or which you'd need in a hurry if a tragedy happened, you should have that up to date on your keyring. (Encryption is your friend -- use it.)
Why? Because when your house burns down, you lose all your backups except those you've move off site.
So yes, back up to another HD. Back up to disk. Mail those disk somewhere. etc. But also make a copy of the truly crucial stuff you need right away, and key in on your hip when you leave the house.
Once a month I set my computer up in a circle of lit candles, spray a bit of booze around, and behead a live chicken.
Ok, that covers the <5% of users who can set up and maintain a backup systems based on rsync. What about the other 95%?
They should use Time Machine.
Which requires OS X Leopard.
So you're saying that all of those users who don't own a Mac with the latest version of OS X should either replace their non-Mac PCs with new Macs (Mac mini starts at $600) or upgrade to OS X Leopard ($130)?
Insightful indeed.
I currently have two raptor drives with a mirrored ZFS setup that I use only for backups. I take snapshots regularly so that if a file gets deleted or overwritten I can just roll it back. Again, this is just for backups. I work on my desktop PC and copy documents, etc. over to my server once a week or so.
It does the job and the redundancy is comforting.
Homonyms are fun!
You're driving your car, but they're riding their bikes there.
A hybrid approach:
An external HD in whatever format you choose (USB drive, NAS device, etc.) is an excellent start since it's fast and local. However, it doesn't handle the "Holy fucking shit!" scenario (e.g. flood, theft, earthquake). In that case, you should use an online backup service like Mozy and let them handle the "Holy fucking shit!" scenario since it's their job.
"Modern operating systems don't protect you from: Oops. Didn't mean to delete that."
Honestly, I've never accidentally deleted something that couldn't be recovered.
"Oops, my wife/kids didn't mean to delete that."
Best solution for that is a second PC. I bought a used 2.4ghz IBM for $65 and a slightly damaged 19" LCD for $50 on Craigslist. That's probably about the same price as the external 1 terabyte drive and far more convenient than taking it off-site, and you could just backup files across the network to that PC.
"A bug in the new release of Gnomovision ate my existing Gnomovision files."
Virtual PC
"Break-ins, electronic or otherwise. Your hard drive eats itself. Fire, flood, etc."
All very rare, but there's always that chance...
Honestly my dream backup solution has always been a low-power PC on a UPS inside a fire-proof cabinet in my attic. Unfortunately I didn't think 802.11g was up to the task with only 54mbps, but now that wireless-N's 300mbps is practically here I think this might be the year.
my karma will be here long after I'm gone
Disk is cheap nowadays.
Use RAID and snapshotting (previous versions, shadow copies, whatever your OS of choice calls it) to take care of the "oh my disk broke and I need to get this TPS report done" factor (staying up and working).
Add an external disk for offsite backup ability.
For corporate use, there are other ways and means, but for home use that would be my method of choice at the moment.
I run: Windows, OS X, Linux, FreeBSD. Just because you have a hammer, doesn't mean everything is a nail.
Translation: I can't keep my tracker ratio if I waste my parent's upstream backing up their financial records and my baby pictures to Mozy. I downloaded all my movies and have no physical pressed DVD as backup.
Yea, when diffs are large it totally makes sense to just do the whole thing instead. \o/
It's pretty cool if you ask me. I just invoke the script, and it automatically rsyncs a few directories with a local mirror. rsync is very nice because it only copies the changes from the folders that it's syncing. It's very efficient. while the local mirror is being updated with all of the latest changes, the script uses ssh to create a newly timestamped copy of the last mirror I uploaded to my server. The script then compares the local mirror and the remote mirror, and only sends the changes. This way I can back up all of my important files locally and remotely while using as little bandwidth and local processing power as possible.
Well its been almost a month. Technology changes very rapidly, and JollyRoger wants to know "Is there any better advice these days?" Since, you know, last month. Lets see there was Gbridge, PathSync, DirSync Pro, Jake, SyncToy v2.0, Unison File Synchronizer, cron run rsync's, and JBOD in a Storage Tower from Addonics.
Since you know technology changes so rapidly, you'd definitely want to trust your backups to something bleeding new. And "these days" are so much better than "those days".
I tackled this problem with an encrypted (dm-crypt) RAID-1 server equipped with easy-removal SATA cages.
I have a four-drive RAID-1 configuration (that is, every bit gets written to four different drives).
Every two weeks or so, I run a script that dismounts the file system briefly (to render the on-disk state consistent), and pop out two of the RAID drives, leaving two working.
I then mail the two removed drives to my two friends who hold on to my off-site backups (priority mail flat rate box $4.95) and they mail back the drives I sent them previously (I include prepaid labels).
In the meantime, while I'm waiting for my friends to mail drives back, I take the two drives I'd previously gotten back in the mail and put them back in the server. The server recognizes them as out of date and starts rebuilding, and a day or so later, I'm back to having four identical drives.
Lather, rinse, repeat.
There are usually four drives in the server (the two permanent ones, plus the two that will make the next backup), and four drives "out" (two being sent to the undisclosed locations, and two being sent back). So in the event of subtle corruption, I have two complete redundant backup sets: one two weeks old, and one four weeks old.
This works for me because my storage needs are modest enough that a single 1-TB drive holds them comfortably. Next server I build like this will be 2-TB, and that will hold me even longer. Astonishingly, drive capacity is growing faster than my ability to consume it (I used to have multiple boxes of external drives).
It's a lot of drives: eight drives doing the work of one. But drives are cheap: the current sweet spot seems to be $130 for 1.5TB, or about $1000 for the disks and another $500 or so for the rest of the server (those SATA cages are surprisingly expensive).
You could use fewer drives: maybe only single drives as backups, rather than pairs in different locations, and maybe just wait for drives to return from the field, rather than having a shadow set always there and spinning. But the extras are pretty cheap as cheap insurance goes. And, of course, you don't have to mail them (and actually I hand-deliver one of each pair, because one friend is local, but it's comforting to know that my other shadow is 2500 miles away).
If I wanted, I could just pop in an extra drive (or dissimilar pair) for a day and create archival snapshots to store permanently off-site but I haven't really had that need.
I haven't lost a drive in the mail yet, and the rebuild process is a pretty good proof that the drives are working.
I made a previous version of this idea with external USB disks, but SATA drives are MUCH more convenient (and cheaper to mail). All those USB enclosures and their inefficient little power supplies and twisty cables made for a very clumsy (and toasty) configuration. Now, I have just a single full-tower chassis and the cooling is efficient and built in.
This solution is entirely based on commodity parts. Anything that breaks, I can replace with something new that's as good or better. I deliberately made my RAID pairs from dissimilar drives (Seagate and WD), to stave off concurrent failure modes, and even sourced my Seagates from two vendors so I got two made in Thailand and two in China (I think). I used to do tape backup, and man, was that a pain: drives go obsolete, formats disappear, media always gets more expensive, and it's slow and linear. Of course, if I had 100 GB of enterprise data to protect, I might be more interested in a tape-based solution--but it would be a lot more expensive. Disk is clearly where the cost-effectiveness is being pushed most heavily.
Encryption is essential. It would be absurd to be mailing around cleartext drives. But a strong password means that the whole-disk encryption is effective against any plausible attack. I keep the password well-protected and never change it, although it would be easy enough to do that with dm-crypt.
I w
An external HDD that also archives individual files. It is very cool.
As shentino (perhaps unconsciously) points out, the proper verb form is "back up", not "backup". The word "backup" is a noun, not a verb. (This is true of most compound words ending with "up" or "down", such as "makeup", "shutdown", "letdown", "screwup", etc.) So, in the summary, the word "backup" should be "back up" in the following phrases: "too large to backup via conventional means", "the best way to backup hard drives", and "Backup your internal HDD". The word "backup" was correctly used (as a noun) in the following phrases: "Best Home Backup Strategy Now?" and "RAID is not backup".
Backup your internal HDD to an external one, and if your data is really important, have two externals and swap one off-site once a week.
For the vast majority of home users this is still the best advice.
In Soviet America the banks rob you!
When people say that raid is not a backup, they mean that simply putting raid on your existing storage will not give you backup, not that something using raid cannot be a backup. Anything which results in two or ideally more independent copies of something is a perfectly valid backup mechanism, regardless of whether the two copies are both on raid devices or neither is. The biggest question with any backup mechanism is what disasters will wipe out your backup and your original, and whether you can live with them. A copy of a file on the same physical medium will protect against accidental deletion and small scale disk corruption, but it won't protect against a failure of the medium. A copy on an external drive constantly connected to the PC will protect you against drive failure, but may or may not protect you against a power surge(depending on what causes the surge and a certain element of luck). A copy on a NAS device in the same building will protect you against the destruction of your computer, and some surges, but may or may not protect you against damage from a lightening strike, fire or other natural disaster. A copy in another building offers additional levels of protection.
Of course as you move out along that scale, cost increases sometimes dramatically. An external copy is more expensive than a NAS device is substantially more expensive than an external hard drive, offsite backup is generally substantially more expensive than a NAS device, etc. It all really depends on how important the data you're trying to back up is to you, and how difficult/expensive it would be to replace. For regular home users, most stuff isn't really all that important and any archived copy will probably do. There are some exceptions of course, family photos, wedding videos, a lot of the things which actually don't get seriously considered in backup routines, but most stuff really just isn't all that important. Even for those things though, it's probably enough to keep multiple dvd copies with a few stored at another location, and reasonably frequent new copies to prevent bitrot.
Because hard drive technology has outpaced tapes etc, I'm afraid they seem to be the only solution as far as I can tell. And if your backup requirements exceed the size of a single drive, yes I think Drobo is the way to go.
Sure I wish there was a tape solution that could do the job, but unless you have major corporate sized budget, I can't find a product that can do it.
So go DROBO as far as I can see. On the other hand, if you've got that much data you probably are already on Drobo, and now need a 2nd one to back it up.
Drobo is good.
Paradigm: 1. backup daily as disks do fail. 2. store offsite in case you have a burglary or fire
If you have a broadband and a flatrate: /home/myaccount/myrsa.key" /myvolume myaccount@myfriend.dydndsn.org:/myvolume"
Mirror your Linux box with the one of a trusted friend using rsync - and he mirrors to your box.
"rsync -avz --delete -e "ssh -i
If you have no broadband or flatrate - use 2 USB disks (Truecrypted) - one of them stored at friends or at work.
Do daily backup using rsync (linux) or robocopy (windows).
Swap the 2 disks in regular intervals - say weekly or monthly.
use www.mozy.com to backup your "crown jewels" that change daily - 5GB are free there and its all encrypted.
But, not Apple's.
If I have a disk crash I use my time machine to go back in time, buy a backup disk and back up the one that's going to crash.
And as a bonus, sometimes doing this alters the future enough so my original disk runs just fine without crashing!
Except one guy who had it right. Buy a fucking docking station, buy 2 fucking 1-terabyte drives (times however many fucking terabytes of fucking porn you own), back up the fucking shit on each fucking drive, take one off fucking site.
That's fucking IT! There IS NO better fucking solution.
Now if you only have 100 fucking MB of fucking email, you can find a better fucking solution than fucking 1TB drives, I'm sure. But there IS NO one fucking solution that fits every fucking situation.
Now - WHAT THE FUCK DON'T YOU UNDERSTAND?
Sorry, been listening to the Christian Bale tape again...
Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!
Sun is putting SSDs on their storage devices.
Those are enterprise quality devices, not pr0n backup servers ....
IANAL but write like a drunk one.
That is the real problem that few people want to address: that we don't need to backup everything that we have.
IANAL but write like a drunk one.
To drive home the inadequacy of online backups in the *cloud" (Web 2.0 music here please).
IANAL but write like a drunk one.
In which planet do you live???
IANAL but write like a drunk one.
Use a caddy/hdd - keep it secure and safe from fire.
Test your backups
This is a home computer isn't it?
What is so vital that you need to do organized backups?
Movies? How many movies do you have there? Will you ever watch them all?
Ditto for music, holiday photographs and videos (be honest, most of them are eminently forgettable).
Important stuff is on paper or off site in a safe place (contracts of any kind, etc).
Are you a developer? Your code is in your company's servers. OSS developer? Get an account in Sourceforge.
If you really must (most likely you don't) just buy another hard disk, make an exact copy of what is in your computer, and forget about it for 6 months.
I personally think that most people need only the above strategy. If you as an individual *think* you need to backup 2TB of data reliably I will say you have far too much crap in your hard drive. I humbly suggest that you get a life.
IANAL but write like a drunk one.
If you spend some time analyzing your data, you'll find out that not all data is equal. Loosely your data is split among:
1. actual work (spreadsheets, documents, etc.)
2. media
3. programs
4. OS There are different attributes to those different types. Your work files are most likely the most important, they are also the least in size, and the hardest to re-create. Media does not change over time. New is added, but whatever is there, never changes. Programs are easily re-installed, same as OS.
What I'm saying is you need different backup strategies. For me, work files are backed up online via a service. Music media is backed up on DVDs, incrementally, so when new is added, only the new is backed up (and for me media is music only - for movies I keep the original DVD media, or if downloaded, I don't bother. They are re-downloadable. Same for programs - either I have the original, of if downloaded, I create a DVD media for it.
For my pictures collection I use a different scheme. I use online redundant synchronized copies. I have 3 copies. If one disk dies, after I fix the computer, I have 2 more copies to re-sync from.
I think that like any big problem - cutting into smaller ones gives simpler if different solutions to the parts of the problem.
5a. You might have been foolish enough to install a videogame on your PC, and the copy-protection routines might have sabotaged your backups.
Great summary, with one quibble:
If you actually have a remote site, UPS/Fedex will carry the tapes for you. You don't have to send your own human along. I imagine that with most home users the issue is that they don't have a reliable remote site to send tapes to, not that they can't get the tapes there.
The expression "RAID is not back up" is meant to illustrate that if you're only copy of something (ergo you're working-copy) is on RAID, that is not enough. Significant hardware failure, file corruption through application or OS, or file deletion by accident or a virus - any of these will destroy your local data even if it is RAID'd.
If your working copy is on computer A, however, and you're backing up to a RAID on computer B (or NAS B, whatever), that is back up. It is not fantastic back up (offsite, multiple layers of redundancy, encrypted, yada yada yada), but it is back up.
In short, "RAID is not back up" should be rephrased as "RAID alone is not back up."
"The crows seemed to be calling his name, thought Caw."
Best Mac backup solution ever = external HDD + SuperDuper
> First, they were born/nurtured in such a way to have above average technical aptitude.
Yeah, but that "nurturing" was when I had no choice but to RTFM and figure it out on my own... You do have to be interested enough to sit through explanations you don't understand and break them down piece by piece until you finally understand them, though. And implicit in that, you have to be able to remember things you don't understand at all accurately for long enough to digest them.
Just pop in another HD and use a script like http://storebackup.org/ to do the incremential backups. I do not zip my files, so the restore is the most importand part. What I use it for is if I did something stupid, more then being afraid of hardware failure.
The main things I backup are settings and some files.
Things I do not backup: CDs and DVDs I ripped. Those are on another HD that is read-only. Website stuff, as that is already backed up with my hoster.
Scripts and such I have are with my webhoster as well.
I often see that people do a complete overkill with backups. I understand that if you are a geek, it is fun to play with it. If you are a standard user, first lok at how you want the restore to be happening.
So what I do is have the backup HD mounted as read-only and read-write by root when doing the actual backup. Yes, I have lost data, but as it was for my home, who cares if I lost 3 years from a mailing list? In all that time I have never really missed it. I now even do a yearly cleanup of all my mails. I just delete them, as I never look at them anyway.
Just because I can do backups does not mean I must.
Don't fight for your country, if your country does not fight for you.
1.5 can be done with SAN storage and/or virtualization in a cluster. With the SAN solution, you just point the LUN's at another similar box and start it up.
2,3,4,5 can be done with IBM's TSM
Install a SFF system (Mac Mini, Shuttle, whatever) at a neighbours' house (across the street, preferably), but within reach of your wlan. Using your preferred method (cronned rsync, time machine, whatever) automate the creation of daily, hourly, whatever backups. Offer free internet access, PC maintenance, whatever in return. Make sure it's internet accessible for those trips.
Done: Free, fast, regular offsite backups.
Alternatively, use very long CAT cable.
Looks good, will you support OpenSolaris??
Whatever strategy you use, verify your backups! there is nothing worse than having backups that don't work or which are stuck in some proprietary format you can no longer access for some reason.
It last came up about a month ago. Even singularitards don't think things change that quick.
Confucius say, "Find worm in apple - bad. Find half a worm - worse."
My setup:
1 FreeNAS which can hold 500GB of data, running 24/7: cost about $100
1 Western Digital External HDD which can hold 650GB of data, some of it important: cost about $60
2 Windows XP client PC's with some important data
My software:
DSynchronise (http://dimio.altervista.org/eng/) runs on each client as a real time service copying all stuff from selected directories to the NAS, keeping 2 changes: cost $0
Unstoppable Copier (http://www.roadkil.net/program.php?ProgramID=29) manually runs on one client once a month to copy everything from the NAS to the Western Digital External HDD: cost $0
Benefits:
Data on client goes directly to NAS
Real time backup with 2 changes directly available on NAS via Samba
Very improbable that one client and NAS and month old backup on WD get trashed
Pretty cheap 2 way backup
Bad:
WD External HDD should not be in the same house as the NAS in case of fire.
Don't forget to do WD HDD process otherwise failover situation gets older and older each month.
Tip: Give family pictures and movies to as much people possible. In case of manual retrieval you have lots of sources.
Pfff, a full backup of my fotolibrary (17Gig) to my rackserver takes about 2-3 Days,
incrementals are done in minutes, or 2 Hours max (daily increments)
Hardly time to get to old, but good for me, for not getting to much gray hair.
DSL with good upload rates are not really expensive here in germany.
we need an "-1 Plain wrong" moderation option!
you can just plug in an external HD and click the time machine button and forget about it.
For things important enough to be offsite you can just save them on your MobileMe drive and let apple mirror it for you.
Irony here is I am one but do neither. I use rsync all over the place here and don't presently have an offsite.
I just got done downloading and building Rsync 3.06 and need to figure out how to build it for a different architecture, as I only have dev tools on my intel and I need to get rsync updated on the ppc's. (I know only just enough to be dangerous with fink/dev tools/etc and follow instructions for downloading and compiling builds)
I work for the Department of Redundancy Department.
"The safety of a offsite facility is inversely proportional to the height of the builds around it." - Population density tends to correlate to the height of building making these location statistically prone to issues. .""The safety of offsite storage increase with distance from the source of the data"." - Environmental catastrophy, the further the distance from home the more likely to avoid the same catastrophy. .""The cost of offsite storage increases with distance from the source of the data"."- .""The convienence of offsite storage increases with distance from the source of the data"." - .""Never use offsite storage facilities near Denver Co. or damn near anywhere in the Dakotas."." - Primary nuclear targets. Simply not an option for risk. .""Never store data in a city with a population greater then four times the median city size for your state nor in the state's capital city"." - While mirroring the first rule, the basis for this specific rule is in the event of calamity traffic in\out, power, riots, etc. All those population driven risks come into play. The larger the city the more risk involved. Now with terrorists factoring into risk analysis the larger cities are bigger targets.
Usually this is the result of shipping costs or fuel to get the backup from home to storage.
The closer the storage, the easier to drop tapes\disks off and to pick them up.
and the last golden rule I offer: .""Just because you backup online doesn't absolve you from due dilligence on the physical location where that online backup ends up."." - Online backups aren't magical. 'Where does the data end up and how "safe" is that location?' are questions that still need to be asked.
-=[ Who Is John Galt? ]=-
I've been using OPENRSM CloudBackup for my systems. It works well (even on my crappy DSL connection).
Their software client works on Win MAC and Linux so I even run it on my VPS and Dedicated Servers besides the home systems. And they let me do all those systems (6 and growing) with my one account.
I'm sure there's others. But this one works for me.
You folks are wasting time backing up data. It's a well known fact that hard drive do NOT fail. Data showing otherwise has just been fabricated by manufacturers of data backup products and services.
I have set my technology challenged mom's system up on a 500 GB USB external drive and have NTI Shadow running in the background. Shadow copies whatever file you are working on to the external drive in realtime. If you accidentally delete a file, it keeps the last copy you have for a restore left on the disk. For mission critical files (family business) I copy a set to my laptop 1/week.
I really should ghost her machine if I need to rebuild after a crash.
People pay you to help them. To make their computers work. And you want them to do what you've done, read all about computers and fix it themselves....and then why would they keep your mean angry little self around?
Which NOT easy method do you recommend?
There is also the option to install push-agents on each windows client, for daily backup.
The client-backup repository is based on Windows Server 2003 principles. It creates an initial image of the host, and then does sector-based differential backup. This greatly reduces the time needed to backup individual clients.
The backups can be used to restore complete images of the host harddrive (in case of complete failure) or individual files. Individual files are accessed through the Windows Home Server Console where the user can simply open the backup for any day (where a backup was performed) and browse to the desired file(s) using a normal Explorer window.
I have been using WHS for quite some time now and I am extremely satisfied with its features and performance. To my knowledge there are no push-agents for non-windows operating systems so I am scripting the backups of my Mac and openSUSE mahines. And obviously there is nothing "open source" involved. But the server runs a ton things for me that no NAS device could ever do. In addition to backup and media streaming my WHS also serves as a mailserver, enhanced local DHCP, network booting, centralized antivirus control, media library for ripped DVDs (with plugins for various players and a ton of metadata for the ripped movies) a bunch of homebre web applications (i use ASPX but it is no problem to install a mySQL and Apache webserver if that is your flavor) and remote access to my files from anywhere on the internet.
I am not trying to turn this into a FOSS-vs-Microsoft flamewar. I am simply saying that WHS is actually a pretty good product regardless of your personal feelings about Microsoft.
- Jesper
My security clearance is so high I have to kill myself if I remember I have it...
When I recently rebuilt my main home machine, I bought three brand new 750GB drives.
Two are in the machine at any given time, as a RAID1 mirror. The third drive is kept offsite. Once a week, I pull a drive from the mirror and swap it out with the drive carried back home from my offsite location, rebuild the mirror, and then take the freshly pulled drive back to my offsite storage location.
If something bad happens to my home or the primary RAID1 mirror, then I've still got an offsite copy of all my stuff no older than a week old. That's good enough for me.
Disclaimer: I am in no way associated with the following companies and/or any of its representativs or employees. I am merely an very very satisfied customer.
If you don't want to build your own system you could consider buying a prebuilt system. It saves you a ton of trouble, lowers your electrical bill, reduces your carbon footprint, and has cool features that are hard to get right in a homebuilt system (such as small size, passive cooling, hot-swap bays or wall-mount kits).
I researched the market for prebuilt WHS servers and ended up with a Tranquil SQA-5H (see http://tranquilpc.co.uk/ for the full product range) and it is probably the best piece of computer hardware I have ever purchased. Room for 5 harddrives, easy to work with, and the machine itself uses only 29 W.
Similar products are available from HP (check out http://www.hp.com/united-states/campaigns/mediasmart-server/) and a lot of other vendors (like Acer, Velocity Micro, Niveus). They are all available under the "Buy" tab of the Windows Home Server website: http://www.microsoft.com/windows/products/winfamily/windowshomeserver/default.mspx
- Jesper
My security clearance is so high I have to kill myself if I remember I have it...
I mean, google has all the answers.
http://www.ss64.com/bash/rsync.html
Get one of those free, secure, online email accounts with unlimited storage somehwere like hotmail, gmail, yahoo, etc. Then just zip your hard drive contents and email it to yourself.
Ta-da.
If you do what you always did, you get what you always got.
Okay, well, as a data protection engineer at work and a geek at home, here's my strategy (I do BOTH)
1) Near line/onsite backups: I do a multiple-times daily sync to an external USB drive for critical data. In this case, this includes things like my MP3 library, video garbage that I've collected. Basically anything that I can't replace _really_ easily.
2) Offline/Offsite backups: I use Mozy (as some other users have mentioned). $5 a month, large storage capacity. It's an always incremental backup, automatically runs when the computer is idle. It takes quite a while for the initial backup, but who cares. I don't personally allow my MP3s or some of my video junk to backup to Mozy, but the stuff I seriously care about is there.
For the stuff I'm really paranoid about, I try to make sure I've got copies dumped to my iPhone, which is always in my pocket, so it's not going to be at my home if the place burns down.
I value my photos. I keep them on RAID1 with an auto-sync to external drive. I then run CrashPlan to backup to 2 remote datacenters where I have servers co-located. I also run an occasional CrashPlan local backup to external drive which I take to the bank and place in my safe deposit box.
1 colocation box is at work, $0. 1 colocation box is in DC ($45/mo). Safe deposit box is included with my bank account ($0). CrashPlan cost me $60 or so for the software to get the advanced features.
I used to run JungleDisk but figured running my own machines would be safe enough since I have two of them at separate sites as well as my other levels of redundancy. SpiderOak looks interesting but it's not the fastest thing out there... Anyhoo, those are the 3 that I narrowed down to, you may have different tastes.
HDD are getting fat & fatter but I can't believe people need to backup 1000 MB of personal data.
Put the important stuff (= what you created yourself, not downloaded/copied/...) somewhere and back that up using some old time solution, and some new-fangled "online" backup with revision control.
For the other 999 MB (of porn) do nothing, losing that is a godsend (= terabytes of *new* porn woohooo !!!!) .....
Personally I think it's stupid to backup/burn/keep anything that's easily found on the web (like ant kind of media), by the time you'll need it again:
- you wont be able to find your hardcopy
- the particular version you backed up will be obsolete
- it'll never happened
Whenever I build a machine, I put in a second internal hard drive to which I make a periodic image. Personally, I use Acronis, but I am sure there are a ton of decent applications out there that can do it. You could even do a bootable flash drive with Linux that just runs a dd to copy the drive over.
Hard drives are cheap.
I use crashplan - it's free as in beer... cross platform and does "to disk" backups - either to a local disk or a disk on a remote system. It does data de-duplication, compression, encryption and only backs up thing that have changed.
www.crashplan.com
BlackNova Traders
What exactly is the title referring to? It means simply this: if you put a flash drive on a shelf, and NOTHING breaks EVER, you will still lose your data within 10 years. This is because the data on flash is less "permanent" than a hard disk drive - it is an electric charge trapped between insulating gates. Unfortunately, the insulation is imperfect, and the charge slowly leaks away. This means that every drive in existence will lose it's data, unless that data is periodically refreshed (i.e. manually rewrite all the data).
I would say that hard drives have a longer shelf-life in terms of the lifetime of magnetic moment, but at the same time the mechanical portions have poor shelf life. Assuming nothing else breaks, if you leave a hard drive on a shelf for years without powering it up, the mechanical parts will seize (corrosion, drying of lubricants, etc).
Your best best is to backup early, and often, and KNOW the weaknesses of your media. Basically, if you want to use flash, periodically clean the drive and re-write all data. If you want to use hard disks, power them up regularly.
Man is the animal that laughs.
And occasionally whores for Karma.
I have a Time Capsule and is not too impressed by time machine.
After months the backup got corrupt, and I find out that all of the
backup is stored in one single object, a sparse bundle. So Time
Machine has one gaint single point of failure.
Disk Utility failed to repair it so I bought DiskWarrior that all are raving
about, didn't even work repairing the sparsebundle itself on a remote
disk, and then of course can't repair it. With lots of trouble copied the
sparsebundle to a USB connected disk, to find that because DiskWarrior
stores all information in RAM and a backup disk the way Apple has done
the backup storage contains tons of files (inside the sparsebundle I
guess), DiskWorrior has to begin swap, and after 24 hours it had not
showed any progress at all. Guess it would require a week or more.....
(Alsoft support thinks it is no need to inform customers about the RAM
requirements with many files, even if swapping it *will* eventually do
its job weeks later so not a bug, we customers should after all
understand the internals of DiskWarrior).
Time Machine to my Time Capsule is also slow like hell and loads
my machine more than I expected.
Backing up to hard drives is good. Backing up to removable harddrives is better (the combination removable tray and hard drive is called "media"). But backing up to RAIDED media is the best! presently, the only company I know of which makes RAIDED media is http://www.high-rely.com. They offer a 3 drive RAID 5 media and a two drive RAID 1 (mirror) media as well as RAID 0 media. They also offer "Softwareless backup". Pretty cool stuff. Bit pricey.
Actually, because of Falcon4's comments, I am seriously considering a WHS system. Specifically, I'm looking at the HP LX195 because of the low power use. What helped push me over the line of reticence was his mention that you could run remote backups through Hamachi. This makes setting up a system of automated backups on our home's 5-7 systems so simple, I can't just leave it to chance anymore.
If this is as simple as you all are saying, all I'll have to do is make sure to do the additional backup to an external so that I can move it offsite every month.
By my opinion, one of the best solution is to set up low power consuming server such as Bubba2, at offsite location (at your parent's / friend's) house. First (full) backup will take forever, but the incremental ones will be ok. There is hard disk at the other side (if it is in RAID1, even better). When using such a server you don't mind if it's turned on all the time, and you can also set it up to serve some other servers (for personal use, as it has not enough power for anything more).
http://www.excito.com/bubba/products/overview.html
If you are using Linux at the machine, that has to be backup-ed, you can use sshfs to mount the remote filesystem and rsync to sync it. Versioning can be done using hard-links. This is simple, easy to understand solution, but that's just one of the many posibilites.
If anyone know for any similar low-power linux server, please share the info.
P.S.: If you have too much time, you can create a low-power linux server using LinuxStamp and external USB hard drives (but now I am drifting away from original topic).
http://www.opencircuits.com/Linuxstamp
I painted my data onto the walls of caves... 30,000 years from now, when the rest of you are crying in your beer over your lost data, the precious pictures of my hunting trip will still be retrievable!
99% of people using 1 TB of data for personal storage, are already "Backing up" their movies and music.
So what you are really asking is "what's the best way to backup my backup"?
Once we have that figured out, you are going to want to know how to backup your backup of your backup.
BTW, I use Mozy, and love it. As long as your backup isn't huge, or you are willing to wait a long time for your backup to be complete, and you have a good internet connection, you should be set. I started my backup as a small set of folders that I considered most important, and then expanded the folders as they completed. Took many nights, but I was sleeping, so I don't care.
I submit that "a cigarette in my mouth" is sufficient proof of concept.
Normally I ascribe all life to intelligent design, but in your case I'll make an exception.
I had a problem with Mozy as well. It turned out my osx-partition had too little space, and some big files couldn't be processed anymore. No mention of this in the status window or anything. It even reported that the backup succeeded. Just because I noticed that the backup had been uploaded too quick to be true, I checked the logs, and found an error message. Mozy support still doesn't understand that this is a bug.
So it may be that your problem was reported in the error logs. But I guess you don't trust it anymore.
I have about 100GB to upload as well, which succeeded in the end. I am still thinking about changing to S3 though.
6. Someone may break into your home/place of business and steal anything that looks remotely "technical." This requires some sort of remote backups.
This has been my system for years and it works. HD's are so cheap these days I just buy two at a time.
http://texturedstatic.blogspot.com/2009/02/one-approach-to-backup-your-computer.html
A flash drive is probably the most stable technology.
I disagree. They wear down gradually, you know?
I had MP3 player once, 1GB. I made fairly regular backups on it. After 3 months or so it started to hang Vista when I put it in. It went to the drawer for next 2 years as broken.
Just recently I looked through my stuff and found it. I booted up my Linux system and made a raw copy. It stopped on 204th megabyte with permanent read error.
Reliable, don't you think?
Take my advice - never, ever do backups on your USB or store your only copy of data on it.
-1 insightful?
We would like to protect against accidental deletion of files, file corruption, or edits to a file that we have now reconsidered. This can be done with snapshotting. In source code, to reconsider and edit to a file is fairly common, and is the reason why most programming projects use revision control systems. Other options like nilfs or ZFS snapshots can also fill this goal. This goal is accomplished more easily if the backups area automatic and the backup device is live on the system.
/tmp, /home, media folders, or other things that you really don't want versioned), every time you make a change to configuration files, you simply check in the changes:
/path/to/filename
/etc files. Or for figuring out what changed that might have broken something. Plus I can (with the proper permissions on the SVN repository) open up the repository and browse the history from another machine. Such as looking at changes from my laptop running TortoiseSVN and using the visual diff tool of TSVN.
The solution for this on a Linux box is FSVS paired with a Subversion backend repository.
Once you have the machine configured (ignoring directories like
# fsvs ci -m "blah blah blah"
You could also script that to run at say 4am every morning to check-in things that you forgot to check-in.
It's a wonderful tool for tracking changes to
Wolde you bothe eate your cake, and have your cake?
On the Windows side, try Second Copy 7.
On the Linux side, rdiff-backup has a very simple syntax.
Wolde you bothe eate your cake, and have your cake?
Your irreplaceable pool is larger than mine, my really critical stuff fits on a pair of DL DVDs, so only about 14GB (I use "dvdisaster" software ECC in the images). However, I did spring for a Blu-Ray burner, which hasn't gone production yet but promises 20GB+ECC now, and possibly double that when/if 50GB media is available and affordable.
I have one more layer you don't, I inherited a large gun safe, and I have DVD backups in a small fireproof security box in that. That gives me hours of fire protection, and the data is in the format of encrypted ext2 filesystems, copies of loop mounted backup images.
The nice thing about DVDs is that they can be mailed to a secure location without needing a fast network connection.