Best Online Remote Backup Service w/Linux Client?
technocraft asks: "I've been searching for an online service to backup data from my Linux file server and have come up with nothing. For many users, Carbonite looks to be a great solution: Affordable, with 'unlimited' capacity. Unfortunately for me, you can only backup from Windows XP and explicitly NOT from external drives or network mapped drives (like my file server)."
Is anyone aware of an online backup services without these restrictions?
They use rsync over ssh:
http://www.rsync.net/
Base rate: $1.80/gb/month
Volume discounts:
25-49GB - 10% Off
50-99GB - 20% Off
100-199GB - 30% Off
200-399GB - 40% Off
400-999GB - 50% Off
1TB+ - 60% Off
You get supposedly unlimited storage, and pay for only what you use.
I haven't actually tried them though.
Simply the best. Windows and linux clients. One computer on your network runs it and can backup all the network shares you want it to. It's even HIPAA compliant.
VaultLogix is a little expensive compared to other services though. But there are reasons for that. I use it on a couple servers.
Get a cheap hosted site from some place like 1and1.com, and just use it as a fileserver. 100GB + MySQL + PHP is only $10 a month, and there's a cheaper plan available w/o MySQL.
I want to delete my account but Slashdot doesn't allow it.
I use Blacksun which doesn't have the "unlimited" storage you had before, but they are very affordable and offer rsync, ssh, sftp, and the regular linux services as well as the typical dragndrop interface clients. Very nice, and their tech support is helpful and quick to respond!
Did you ever notice that *nix doesn't even cover Linux?
Do you have any more information regarding your situation? For example, why does it have to be online storage? Is there a reason tape/optical/hard drive doesn't work for you? Also, how much data are we talking about? If it is hundreds of GB, then online may not be the best solution; the time to restore your data may be prohibitive. In line with that same question, what kind of bandwidth are we talking about? DSL is great for a couple of GB, but you need a lot more than that if you have a lot of data. Finally, how critical is this data? Is it a personal file server, or is it for a business? How long before downtime becomes an issue?
If time is critical, you might want to use tape or a spare hard drive and store it in a safe rated for digital media. If you need to restore, it can be a matter of minutes instead of hours or days. If you do daily backups of the entire server, you could just get a new machine, load the data, (or plug in the hard drive), and be back up in minutes. For absolutely critical files (financials, etc.) you could use online backup, and have all the non-critical backed-up locally.
So, if you can provide more information about your situation, the advice might be better.
Reading code is like reading the dictionary - you have to read half of it before you can go back and understand it.
Presumption: After a crash or loss, you may not have access to any of your own encryption keys.
Query: How can you possibly trust any third party not to take liberties with personal or business information entrusted to their care?
I really think you're better off taking care of backups in-house, along with of course keeping some of those same backups off-site in a secure manner.
--
I don't want to rule the world... I just want to be in charge of mayonnaise.
Don't forget to include the costs your ISP will charge you for the bandwidth usage... or if you need to restore 10GB over a T1, I hope you didn't need that data that quickly :)
It's not the online backups that sound like your concern but "outsourcing backups", otherwise you'd backup to a RAIDed pair of disks or backup to tape, both which will be cheaper than $onlineserver + $bandwidth costs. If the tech (could be you) loses the tape, breaks the tape, corrupts the filesystem that is the target for the backups, you can't sue them. But if a company that you are paying to keep your data, loses it, you can sue them for $lost per minute.
If you are going to outsource to a backup company, make sure they a) will be around for a while b) have been around for a while c) have references from people more complex than you.
Some people say backups aren't exciting, try losing a backup tape sometime...
-b.
http://www.dataprotection.com/
I work for these guys, vaults in 3 redundant datacenters, 10 years doing remote backup for almost EVERY operating system out there and, most importantly, a thiry day free (as in beer) trial just to check out what we can do. Not to mention we're one of, if not the biggest, player in the continental US when it comes to remote data storage.
-b.
I current have about 500GB on the server. That will likely double over the next year.
Query: How can you possibly trust any third party not to take liberties with personal or business information entrusted to their care? Answer: I wouldn't trust with my real email address, much less my data... I would encrypt the data before backing it up.
My reaction to rsync.net was similar. I'd really love to try them out, but I can't come up with any really good ways to justify that kind of expense.
... I bet you could get the whole thing down to a few watts that way and under $200.)
If you have multiple locations available for your use -- and if you're a home user, who doesn't? (parents, friends, etc.) -- it's a lot cheaper and you can get a lot more flexibility if you take an old PC, put a bunch of drives in it, and set it up somewhere. Then just have your systems replicate to it at night. (Yes, it's not bidirectional if you just use vanilla rsync, so if you have a lot of file churn you'll need to script something to keep the backup from bloating.)
I guess the 'setup cost' is higher than a managed service like rsync.net, but a minimalist system doesn't even take that long. Install Debian, install rsyncd if it's not already there, and open a port in the firewall for it. (Actually you don't really even need to run it as a daemon, now that I think about it.) WebDAV could take longer, but you'd have to really value your time highly to pay rsync.net's prices in exchange for an afternoon setting up Apache and a couple of cron jobs.
Personally I just have an old 600MHz Celeron machine that I set up with rsync and ssh/sftp with dyndns, and then traded a friend for a similar system that he had set up. I keep his box in a closet, plugged into my router, and he does the reverse. We both get off-site backups, and the only real cost of ownership is the electricity. (And if I was doing it today, you could get one of those routers that can run Linux from CompactFlash and can mount an external HD via USB 2.0
I am normally very skeptical of the DIY route -- it's tough to compete with mass-production in many instances. But I think that this is one situation where even a low-grade geek can toss themselves together something in a day that will be nearly the equal of rsync.net's service, for workstation-backup use. Hopefully the rsync.net guys will adjust their pricing accordingly and make a liar out of me soon, though.
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
Regarding your question on Carbonite: "Carbonite does not currently back up files that are larger than 2GB in size, removable hard drives (e.g., USB drives) or mapped network drives." "Carbonite will back up all the supported files on your hard drive whether you have 1GB, 10GB or more. As a practical matter, the speed of today's DSL and cable Internet services will make it very slow to back up more than, say, 40-50GB of data. Our Terms of Use will allow us to deny service to abusers." Given those terms, it's going to be tough to get past 100GB or so. I will have to consider getting a dedicated server at a hosting facility or do some kind of peer-sharing with friends and/or fellow photographers.
I realize this wouldn't be suitable for large companies but for my home network, I just use a regular webhosting provider for system backups. I use one-and-one which is $4.95/mo for 50gb of space, duplicity on the Linux side which is able to gpg encrypt the volumes and incrementally backup the systems, transfering the volumes over FTP to the webhost. The backup directory is configured so as not to be publicly accessible but it's gpg encrypted in any case.
I'm not a big fan of rsync backups. In the case of an accidental deletion or worse, gradual FS corruption, the corrupt data gets transfered over in the nightly rsync and your backups are useless. Same if a cracker deletes content, those deletes hapilly spread to the rsync mirrors if not caught before the scheduled run.
Jungle Disk is a cross-platform front-end for Amazon S3 that supports Windows, Mac, and Linux. You only pay the Amazon fees ($0.15/gig/month). On Linux you can mount it directly using DavFS then backup using any software you want (rsync, etc). It supports encryption and caching as well.
You might want to check out S3/Jungledisk. S3 by amazon is relatively cheap (15 cents/GB + transfer) and Jungledisk acts as WebDAV middleware. The middleware is still rather basic, but it works for backups, and passes the "are they going to be around in a couple of years" test. All data is encrypted by the Jungledisk middleware.
Moderation is for Monks!
This is quite true, and every current implementation of rsync that I've run across (not that I'm exactly a scholar on the topic or anything, but I've used it on Mac OS X and a bunch of Linux distros) uses SSH as the default shell when the address is specified with a single colon ('rsync -avz ~/Documents jdoe@foobar.com:/var/backup/'), although I suppose its probably best practice to specify the shell explicitly in a script.
So just to restate what I should have said: the only thing you'd need to have running on your remote server is sshd; if you don't care about WebDAV access you could lock down everything else. That would give you scp and sftp as well which seems like it ought to be good enough for most people.
I wouldn't recommend logging in as root when doing the backups though (since it has to have an un-passworded private key on the client machine to login as part of a cron job).
Anyway, given that you can use most features of the rsync daemon via a shell (including named modules), there doesn't seem to be any good reason to run the daemon on a machine that's exposed to the public net. In retrospect, I have no idea why I mentioned it.
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
http://press.xtvworld.com/article13092.html I am sure the storage landscape will change even further over the next few years so keep your hopes up for something better.
I provide consulting services to Divinsa. They use BackupPC to provide online backups for unix machines such as Linux, Mac, and FreeBSD. Dual datacenters with the data replicated between them. Check them out at http://www.divinsa.com./
Check it out... NetMass data centers And yes, the company does Linux. Bare metal restores and all that jazz
http://www.zmanda.com/quick-backup-setup.html
Amanda: Open Source Backup Software
Try Sago Networks. http://www.sagonet.com/ Data centers in both Atlanta & Tampa FL.
Generally I'm pretty fond of a roll-your-own multi-site online-over-ssh backup solution.
But if you weren't going to do that, Amazon offers:
15c / GB/month + 20c / GB xfer
Some middlewares already exist, like Jungledisk.
Looking for freelance Actionscript (Flash/Flex) or ColdFusion work and/or freelance developers. Email me, put Slashdot
www.dreamhost.com Disk: 20GB/month, increases by 800MB/month; Bandwidth: 1TB/month, increases by 32GB/month. Price: $7.95/month. The longer you're a customer, the more space and bandwidth you get. Includes shell accounts, etc. On top of that, ssh/sftp/rsync bandwidth doesn't even count against your account usage. I use Duplicity (rdiff-backup+gzip+gnupg) to backup several gigs of data on a regular basis, and it's great.
"Those who consume the bulk of goods are those who make them. We must never forget this secret of our prosperity."