Why Mirroring Is Not a Backup Solution

← Back to Stories (view on slashdot.org)

Why Mirroring Is Not a Backup Solution

Posted by kdawson on Friday January 2, 2009 @05:25AM from the pointed-lesson dept.

Craig writes "Journalspace.com has fallen and can't get up. The post on their site describes how their entire database was overwritten through either some inconceivable OS or application bug, or more likely a malicious act. Regardless of how the data was lost, their undoing appears to have been that they treated drive mirroring as a backup and have now paid the ultimate price for not having point-in-time backups of the data that was their business." The site had been in business since 2002 and had an Alexa page rank of 106,881. Quantcast said they had 14,000 monthly visitors recently. No word on how many thousands of bloggers' entire output has evaporated.

51 of 711 comments (clear)

DUH! by Anonymous Coward · 2009-01-02 05:27 · Score: 5, Insightful

DUH!
1. Re:DUH! by djupedal · 2009-01-02 06:07 · Score: 5, Funny
  
  As if millions of voices suddenly cried out in terror, and were suddenly silenced.
2. Re:DUH! by larry+bagina · 2009-01-02 06:43 · Score: 5, Funny
  
  I pity the fool who hahas?
  
  --
  Do you even lift?
  These aren't the 'roids you're looking for.
3. Re:DUH! by severoon · 2009-01-02 07:58 · Score: 4, Funny
  
  Journalspace CTO: We don't need an expensive off-site backup solution b/c we mirror all of our data real-time. It's genius!
  -entire database gets overwritten-
  Journalspace CTO: Ohhhhhh...now I get it.
  
  --
  but have you considered the following argument: shut up.
4. Re:DUH! by NickFitz · 2009-01-02 08:51 · Score: 5, Funny
  
  What about archive.org?
  Ah, apparently not... :-D
  
  --
  Using HTML in email is like putting sound effects on your phone calls. Just say <strong>no</strong>.
5. Re:DUH! by jcuervo · 2009-01-02 09:18 · Score: 4, Funny
  
  What we've got here is failure to administrate.
  
  --
  Assume I was drunk when I posted this.
Again a frost post to a red story by Anonymous Coward · 2009-01-02 05:27 · Score: 5, Funny

While this mirrors previous comments, it's not really a backup solution.
When is backing up *not* an option? by wandazulu · 2009-01-02 05:29 · Score: 5, Interesting

Mirroring, RAID, grid, whatever. At some point, you want your data safe and secure on something not physically attached to any power source.
1. Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 05:37 · Score: 5, Insightful
  
  Incremental backups to tape every night, full backup at the weekend. Tapes must be stored off-site at a proper storage location. Got lots of data and a small backup window? Get a faster tape drive and a tape robot. It costs money, but you data costs more.
  
  This is at a minimum people. Come on!
2. Re:When is backing up *not* an option? by Wdomburg · 2009-01-02 06:52 · Score: 4, Insightful
  
  Even accepting your price that's a cost of about 12.7 cents per gigabyte and you can get 800GB native LTO-4 tapes for about $50, which comes out to about 6.3 cents per gigabyte.
  But quoting costs for desktop grade SATA drives severely understates the true cost. For any non-trivial site installation you're talking near-line rated drives, drive caddies, storage shelves and additional SAN fabric. Then price out the additional power, cooling and rack space. Then price offsite shipping and storage for the bulkier, heavier and more delicate disk option.
  Mirroring has its place. Snapshotting has its place. And backups to stable media still has its place too.
3. Re:When is backing up *not* an option? by Trixter · 2009-01-02 07:53 · Score: 4, Insightful
  
  That's not my company's policy, that's *my* policy. I can take a 3-month hit to my personal data. AND YET MY LAX PERSONAL POLICY WOULD HAVE SAVED JOURNALSPACE.
  My *company's* policy is daily offsiting. Expensive, but very many of our locations could become a smoking hole in the ground and we'd still be able to restore and operate.
4. Re:When is backing up *not* an option? by Wdomburg · 2009-01-02 08:15 · Score: 4, Insightful
  
  Fine. Get the cartridges, but what about the capital cost minus depreciation of the drive? What about random access?
  Random access is why snapshots also have their place. :) Archival backups and nearline backups solve different sets of problems.
  Now weigh those against an inexpensive jbod frame with a 2gb FC backplane.
  What kind of capacity are we talking. For a small site you can pick up a little 2U unit that'll store 6.4TB uncompressed for under $5k. Or if you're running a larger site you can snag a 4U unit with two drives for about $15k that'll handle 30.4TB with optional expansion to 60.8TB native.
  What's the write speed of LT vs a tasty little GB SAS drive?
  120MB/sec per drive without compression. And now that you've talking about SAS drives your per TB cost is hopelessly optimistic. Even OEM packaged terabyte SAS drives are going to run you about a quarter a gigabyte, which is now four times the media cost of an LTO-4 solution.
  Rackspace? You can put a dozen into about 4U.
  So about 12TB in 4U compared to the 30TB unit I mention above.
  Cooling? Although I'll grant you green cost, the random accessibility out-classes the seek time and tape insertion by a human cost dramatically.
  Have you never heard of a tape library?
  Stable media? Tape? Sometimes.
  Properly handled tape is incredibly stable.
  Shelf space?
  If you're doing off-site storage, that's going to be an issue regardless of what media you're using. And as I pointed out, tape is far more compact and far lighter than disks.
  No need to use tape anymore. Get out of the reality distortion field, but do the right thing by testing what you have and doing drills to ensure that whatever you have, works and is a procedure understood by all.
  I'm not the one dismissing an entire class of technology while demonstrating ignorance of its costs and benefits.
Dear Every Corporate Tool in the Universe: by yttrstein · 2009-01-02 05:29 · Score: 5, Insightful

And that's why your IT department actually needs funding. Sleep tight.
1. Re:Dear Every Corporate Tool in the Universe: by TubeSteak · 2009-01-02 05:43 · Score: 5, Insightful
  
  And that's why your IT department actually needs funding. Sleep tight.
  They've had the site live for 6 years.
  This wasn't a lack of funding, it was just sheer stupidity.
  6 years and nobody ever thought it'd be a good idea to back everything up to dvd or an external hard drive. HTML compresses really well in case they didn't know.
  
  --
  [Fuck Beta]
  o0t!
2. Re:Dear Every Corporate Tool in the Universe: by yttrstein · 2009-01-02 05:45 · Score: 4, Insightful
  
  Stupidity IS a lack of funding. Pay the salary of someone smart enough to handle your data correctly if you have no interest in becoming smart yourself. Simple.
3. Re:Dear Every Corporate Tool in the Universe: by mrchaotica · 2009-01-02 05:56 · Score: 4, Insightful
  
  Hell, they could have spent $50 on a USB hard drive (i.e., half-assed it) and been better off!
  
  --
  "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
4. Re:Dear Every Corporate Tool in the Universe: by slugtastic · 2009-01-02 05:57 · Score: 5, Funny
  
  I'm more surprised that the site lived for 6 years without back-up. That's pretty hardcore.
5. Re:Dear Every Corporate Tool in the Universe: by modmans2ndcoming · 2009-01-02 06:01 · Score: 5, Funny
  
  Screw that!! IT Departments are cost centers and have absolutely no benefit to the bottom line of a company... none at all... nope.
6. Re:Dear Every Corporate Tool in the Universe: by Kjella · 2009-01-02 06:09 · Score: 5, Insightful
  
  Being too stupid to recognize your own shortcomings is also a form of stupidity. Or hubris, whichever is more appropriate.
  
  --
  Live today, because you never know what tomorrow brings
7. Re:Dear Every Corporate Tool in the Universe: by bb5ch39t · 2009-01-02 06:24 · Score: 4, Funny
  
  Absolutely correct! Why management here is drooling over how much they money could save if they just didn't need the damn IT department. And all those damn desktop computers! Why, life was much better back in the days of paper ledgers and pencils! Sigh - if only we could have the perfect company. One which only has high level managers and none of the "riff raff" that infects them. Oh! Wait! That's Congress. And they have a monopoly.
8. Re:Dear Every Corporate Tool in the Universe: by slushdork · 2009-01-02 07:05 · Score: 5, Funny
  
  Maybe they should have used this backup strategy, although this one looks more like this...
rm -rf / by corsec67 · 2009-01-02 05:29 · Score: 5, Informative

rm -rf /

That is one reason why mirroring isn't a backup, and why backups should ideally be off-line.

--
If I have nothing to hide, don't search me
1. Re:rm -rf / by Piranhaa · 2009-01-02 06:06 · Score: 5, Funny
  
  C:\>rm -rf /
  'rm' is not recognized as an internal or external command,
  operable program or batch file.
  Everything's still running here...
Excellent! by GravityStar · 2009-01-02 05:30 · Score: 5, Funny

Excellent! We can use their demise as yet another cautionary tale.
1. Re:Excellent! by El+Torico · 2009-01-02 05:48 · Score: 5, Funny
  
  Excellent! We can use their demise as yet another cautionary tale.
  Ironically, it's more useful than the entire collection of blogs that they stored.
  
  --
  In the land of the blind, the one-eyed man is usually crucified.
That's what backups are for by MBCook · 2009-01-02 05:31 · Score: 5, Interesting

It's really unfortunate that this happened. If they had simply had a backup snapshot of the DB they could have restored it. RAID only saves you from disk failures. It doesn't work on OS/user failures.
Unfortunately this is the kind of thing you tend to learn from experience (either yours or someone else). It's very easy to think "RAID 1 = disks are safe".
Just like a database cluster wouldn't have saved them. A clustering database can save you from load, or you can swap servers if a disk goes bad. But when someone issues "DELETE * FROM..." the other cluster nodes start to happily run the same thing and now you have 2 (or 3 or 10 or...) empty database boxes.
I hope those bloggers had a backup of some sort of their own.

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.
1. Re:That's what backups are for by MBCook · 2009-01-02 05:56 · Score: 5, Insightful
  
  My guess (and this is a guess, I'd never heard of the site before yesterday) is that this is some guy who started his own little site and it got bigger and bigger. Basically he never designed the backup, the system was just slowly pieced bigger and bigger until it got to it's current state.
  The comments in the messages from the site's operator about the cost of the drive recover and thinking both drives just died at once indicate to me that this site was basically a hobby for him and he isn't experienced as an admin.
  
  --
  Comment forecast: Bits of genius surrounded by a sea of mediocrity.
How hard is it to remember: by computersareevil · 2009-01-02 05:39 · Score: 4, Insightful

Mirroring: High availability
Backups: High reliability
The rules of backups by Anonymous Coward · 2009-01-02 05:40 · Score: 5, Informative

The rules of backups:
1. Backup all your data
2. Backup frequently
3. Take some backups off-site
4. Keep some old backups
5. Test your backups
6. Secure your backups
7. Perform integrity checking
To many shops think HA==DR by uncledrax · 2009-01-02 05:41 · Score: 4, Informative

It's more an issue that some people think that HA == DR.. which obviously this story reminds us that it is not the same thing.
Mirroring / RAID == HA.. if one of your HDDs let the smoke out, you still don't incur downtime. If you have a hot-spare, you're even better.. all it does it let you have alittle time to correct the
issue (ie: "It can wait until morning").
Also, one other very important thing.. mirroring doesn't prevent/restore data corruption. If you're mirroring your rm -rf (as pointed out by Corsec67 below), your RAID will happy do what it does.. and span your command to all your disks.... Congrats, you just successfully gave yourself HA to your disk erasing! :]
Backups are DR.. If your RAID croaks.. your SOL if you don't off-machine backups. If you accidently nuke your disks with an rm or something, you can still go back and restore data.. sure you'll likely loose -some- data, but -some- is better then all in this case.

--
----- The internet has given everyone the ability to have their voice heard equally as loud.. even if they shouldn't be
1. Re:To many shops think HA==DR by xyphor · 2009-01-02 06:14 · Score: 5, Informative
  
  DR is Disaster Recovery
  HA is High Availability
2. Re:To many shops think HA==DR by cbiltcliffe · 2009-01-02 06:54 · Score: 5, Funny
  
  I tried Googling, but the only results I got were a medical office in Chinatown.....
  
  --
  "City hall" in German is "Rathaus" Kinda explains a few things......
Re:stunned silence by conureman · 2009-01-02 05:42 · Score: 5, Funny

I am experiencing a strange phenomenon. The jaw-drop reflex has been popping my mouth open for several minutes and won't stop. If I focus I can close it, but then it pops open again. wow.

--
The cost of that cleanup, of course, will be borne by taxpayers, not industry.
Only 2 drives? by lalena · 2009-01-02 05:42 · Score: 4, Insightful

Maybe I could understand that there might be issues with backing up live databases, and they didn't want to deal with it. Still not an excuse.
BUT, according to the site "the server which held the journalspace data had two large drives in a RAID configuration". Only TWO drives.
All they had to do was pull one of the drives, replace it, and lock up the original off site. In a couple of hours the drives would have been mirrored again.
To the HR department by squeegee_boy · 2009-01-02 05:42 · Score: 5, Funny

Important note: don't hire the IT dude with Journalspace.com on his resume.
Re:El Oh El by kurtmckee · 2009-01-02 05:45 · Score: 5, Insightful

I'm really surprised that with all the users they had, they are so quick to say "everything is gone and we're giving up"
Considering how complete and unrecoverable the loss is, they have no idea who their users are. The accounts would have to be recreated from scratch, but who would try? Their users have no reason to ever trust them again. Journalspace would have a difficult time wooing back their original users, and no new user would seriously consider using them.
Bowing out is the only recourse, but I'm glad they're considering releasing their source code.
Re:Ouch by conureman · 2009-01-02 05:45 · Score: 4, Insightful

Or even one, stale, backup.

--
The cost of that cleanup, of course, will be borne by taxpayers, not industry.
A lesson for admins, and users too by gzipped_tar · 2009-01-02 05:46 · Score: 4, Insightful

No doubt this incident is the result of the admin's fault. He's been confusing mirroring and backup and carried on the mistake until it's too late, as pointed out in other comments.
Now what about a user's angle? The morale is you can never think your data is safer when it's "in the cloud". If you value your blog and your readers, you *should* save a copy of your work as well as the readers' info, *locally*, somewhere you have control over.
There's no place like $HOME.

--
Colorless green Cthulhu waits dreaming furiously.
1. Re:A lesson for admins, and users too by djmurdoch · 2009-01-02 05:57 · Score: 4, Insightful
  
  And a corollary to the parent's good advice: if you can't easily get a complete copy of your work, find another host. Manual one-by-one downloads don't cut it.
Re:Noobs. No, really. by emag · 2009-01-02 05:51 · Score: 4, Informative

Even the greenest IT employee knows that mirroring is to protect against hard drive failure and not software corruption.
I only wish that were true. I've given up arguing with friends about this, who insist that their mirrors are good enough backups. I just stare at colleagues who think such, especially those who SHOULD know better. And I *know* coworkers are doing this @ work, too, and I'm just waiting for about 50TB of data to suddenly go missing...

--
"The urge to save humanity is almost always a false front for the urge to rule." --H.L. Mencken
No Archive.org either by computersareevil · 2009-01-02 05:52 · Score: 5, Informative

They also purposely blocked archive.org via a robots.txt exclusion, so the bloggers can't use that to try and recover some of their blogs.
There is a denial going on by hwyhobo · 2009-01-02 05:52 · Score: 5, Insightful

In today's world where primary storage and protection storage are well-defined, and where entire industry grew around it (examples: NetApp, Data Domain), one is hard-pressed to understand the reason for such a debacle. The reading of the note referred to in the article leads me to believe, unfortunately, that Journalspace's IT department did not understand the difference.
It is sometimes considered a bad form to say something bad about fellow techies. We prefer to look for 'outside' causes. Still, to learn and avoid the same problems in the future, one has to admit his mistakes first. This paragraph from the Journalspace's page:
The value of such a setup is that if one drive fails, the server keeps running, using the remaining drive. Since the remaining drive has a copy of the data on the other drive, the data is intact. The administrator simply replaces the drive that's gone bad, and the server is back to operating with two redundant drives.
makes me believe there is a denial going on.

--
End anonymous moderation and posting on /.
Someone needs to be FIRED by spitek · 2009-01-02 05:53 · Score: 4, Funny

You pay your infrastructure people to maintain business, continuity I mean the tittle of this post made me go, "Really, no shit" That's like systems admin 101! If the admin was aware then the manager that didn't listen needs to be fired. If the manager listened and they are just run by retards then they got what they deserve. You'd think 17,000 visitors a month would be worth enough to do it right, in add revenue alone. The cost of a consumer machine running linux with a few TB's of SATA space - $1200 How much the company paid to have a system's admin play video games all day - $50,000 The cost of a 17,000 vistor a month site going down because they had no data base backups - Priceless.
Mirroring by jav1231 · 2009-01-02 06:04 · Score: 5, Insightful

See mirroring is like...well a mirror. If you stand before one and stick a fork in your eye your mirror-image does the same. In real time. Analogies are there for a reason.
1. Re:Mirroring by gEvil+(beta) · 2009-01-02 08:30 · Score: 4, Funny
  
  See mirroring is like...well a mirror. If you stand before one and stick a fork in your eye your mirror-image does the same. In real time. Analogies are there for a reason.
  
  There's a major flaw in your analogy. See, if I stick a fork in my right eye, the mirror image will stick a fork in his left eye. Between the two of us, however, we still have one good left AND right eye. So ipso fatso, I have a complete backup.
  
  --
  This guy's the limit!
Google cache diving by Chris+Pimlott · 2009-01-02 06:10 · Score: 5, Informative

Looks like at least some content is still in Google's cache, those looking to salvage their journals should act quickly.
You can limit google's search results to a particular site by using the "site:domainname.com" search term (example) and then click the "Cached" links of each result to see Google's copy.
There's also a Greasemonkey script for Firefox that can automatically add Google Cache links next to page links, so you can navigate from one cached page to another easier.
You need more than backups ... by blowdart · 2009-01-02 06:11 · Score: 5, Insightful

You don't just need backups. You need to TEST them. Having a backup run every night is nice and all; but if the tapes are unreadable and no error was reported, or if you're doing it wrong and the backup is corrupted and you only find out when you come to restore ....
Double Duh! by Roger+W+Moore · 2009-01-02 06:22 · Score: 4, Interesting

Since they apparently used OSX Server this is particularly bad. All they needed was a large enough USB attached disk and then to turn on Time Machine. Might not be the best solution for their needs but it is hard to imagine one which requires less effort.
1. Re:Double Duh! by MarkRose · 2009-01-02 07:22 · Score: 4, Informative
  
  Not quite. Backing up a live database can be a bit tricky. By the time you finish copying part of the database, the first bit can change again. So you have to create a snapshot of some kind. And that has to be supported in the database setup (at the application or server level) in order for the backup to be in a consistent state. And you don't want your backup process to degrade site performance, either. So a simple file copy is totally inadequate.
  A common solution is replication. Backup is then performed by creating a replication point on the slave database machine then taking a snapshot and copying that while while master database machine continues serving at full speed. Replication can then catch up when the backup is complete. Another advantage to having replication is duplication on the machine level -- if the master fails, go live to the slave with minimal to no downtime. Set both machines up in a master-master configuration and you can swap back and forth as needed, allowing live maintenance and backup with no performance degredation.
  
  --
  Be relentless!
2. Re:Double Duh! by MBCook · 2009-01-02 08:09 · Score: 4, Informative
  
  *BZZZZT*
  The GP was 100% correct. If you had kept reading, you'd see that the suggestion was to use replication so you can lock the DB into a consistent state while backing up. When the backup is done, the box starts replicating again. If you didn't have the backup box, you'd have to lock the production database while your backup was going on.
  He was suggesting replication purely as a way to avoid having to pause the application during backup, not as the backup it's self.
  
  --
  Comment forecast: Bits of genius surrounded by a sea of mediocrity.
Personal backups of online data by RevWaldo · 2009-01-02 06:29 · Score: 4, Insightful

This is why users should be able to easily back up their own data for any online service. If a service entrusted with your data provides no straightforward way to drop a copy of it onto your own hard drive, don't trust it. I'd go as far to say that any service that doesn't strongly recommend you keep your own backups shouldn't be trusted.

Do the big kahunas of the "Web 2.0" world give users that option? Gmail, Myspace, Facebook, Twitter etcetera ad nauseam?

--
Prisencolinensinainciusol. Ol Rait!