Why Mirroring Is Not a Backup Solution

DUH! by Anonymous Coward · 2009-01-02 05:27 · Score: 5, Insightful

DUH!

Re:DUH! by frovingslosh · 2009-01-02 06:01 · Score: 1

Let us consider the advice of the great philosophers Mr. T and Nelson on this subject.

--
I'm an American. I love this country and the freedoms that we used to have.
Re:DUH! by djupedal · 2009-01-02 06:07 · Score: 5, Funny

As if millions of voices suddenly cried out in terror, and were suddenly silenced.
Re:DUH! by larry+bagina · 2009-01-02 06:43 · Score: 5, Funny

I pity the fool who hahas?

--
Do you even lift?
These aren't the 'roids you're looking for.
Re:DUH! by hesiod · 2009-01-02 07:44 · Score: 2, Insightful

We can only hope they remain silent.
Re:DUH! by severoon · 2009-01-02 07:58 · Score: 4, Funny

Journalspace CTO: We don't need an expensive off-site backup solution b/c we mirror all of our data real-time. It's genius!
-entire database gets overwritten-
Journalspace CTO: Ohhhhhh...now I get it.

--
but have you considered the following argument: shut up.
Re:DUH! by Anonymous Coward · 2009-01-02 08:44 · Score: 0

Journalspace CTO: We don't need an expensive off-site backup solution b/c we mirror all of our data real-time. It's genius!
-entire database gets overwritten-
Journalspace CTO: Ohhhhhh...now I get it.
If only they were using a real ACID database, with real logging and rollback, such as oracle or MS sql server. Properly configured, they are very handy.
But still no comparison to tape backup.
Re:DUH! by NickFitz · 2009-01-02 08:51 · Score: 5, Funny

What about archive.org?
Ah, apparently not... :-D

--
Using HTML in email is like putting sound effects on your phone calls. Just say <strong>no</strong>.
Re:DUH! by vrmlguy · 2009-01-02 09:12 · Score: 1

Sort of reminds me of this.
You know, I've thought once or twice about building something that reads robots.txt files and archives everything that's denied to archive.org. Until today, I couldn't think of a good business case.

--
Nothing for 6-digit uids?
Re:DUH! by jcuervo · 2009-01-02 09:18 · Score: 4, Funny

What we've got here is failure to administrate.

--
Assume I was drunk when I posted this.
Re:DUH! by socsoc · 2009-01-02 10:38 · Score: 1

Except they didn't block archive.org, they just didn't specifically grant them an exclusion to their default block all in the robots.txt.
Re:DUH! by jez9999 · 2009-01-02 13:10 · Score: 1

If only they were using a real ACID database, with real logging and rollback, such as oracle or MS sql server. Properly configured, they are very handy.
But still no comparison to tape backup.
Yeah.
A database might actually stand a chance of being readable in case of an emergency.

--
== Jez ==
Do you miss Firefox? Try Pale Moon.
Re:DUH! by darthflo · 2009-01-02 13:46 · Score: 3, Insightful

As if millions of voices constantly cried out in angst, and were finally silenced.
Fixed that for you. ;)
Re:DUH! by lgw · 2009-01-02 14:06 · Score: 2, Insightful

If your tape is readable the day you make it, it will be readable for many years. Tape (especially half-inch tape) doesn't really go bad over time - 20 year shelf life is common.
The reason you hear the "oh noooes, my tapes aren't readable" horror stories is because people are too lazy to verify the tapes at the time of creation, and the tape drives go silently bad over the years. Schedule a verify after every backup (and you don't need to verify everyhting you wrote, just a sample) and you won't have a problem with "bad tapes". Well, except maybe with QIC tape; what garbage.

--
Socialism: a lie told by totalitarians and believed by fools.
Re:DUH! by Kyril · 2009-01-02 16:07 · Score: 2, Insightful

Amen to this -- at Fermilab we had a setup with lots of 8mm tapes, and we thought the MTBF was awfully high (several were failing each week), much more than the 30,000 hour MTBF specified...until we realized it was 30,000 hours with a 5% duty cycle, or 600 hours of use. 600 hours divided by even a dozen tapes is 50 hours, about 6 8-hour days and these were in use up to 24x6... The system also let us empirically confirm the single-bit error rate of DRAM, something on the order of 1 in 10^13 bits at the time.
Hot spare, hot swap, hot plug...that's how you gotta do it when you have so much hardware on hand that failures need to be planned for rather than prevented.
Re:DUH! by Kazoo+the+Clown · 2009-01-02 20:55 · Score: 1

I suppose it's possible someone has *already* been doing that. Isn't robots.txt a voluntary scheme?
Re:DUH! by Anonymous Coward · 2009-01-02 22:23 · Score: 0

Journalspace CTO: We don't need an expensive off-site backup solution b/c we mirror all of our data real-time. It's genius!
-entire database gets overwritten-
Journalspace CTO: How did that happen? It is YOUR fault!
There, fixed that for you.
Re:DUH! by darkmeridian · 2009-01-03 00:50 · Score: 1

It's not clear that he's learned his lesson. From the skeleton JournalSpace page, "Here is what happened: the server which held the journalspace data had two large drives in a RAID configuration. As data is written (such as saving an item to the database), it's automatically copied to both drives, as a backup mechanism."
It's not backup! RAID provides reliability through redundancy. An on-site disaster, power outage, or software problem will cause you to lose your data.

--
A NYC lawyer blogs. http://www.chuangblog.com/
Re:DUH! by Anonymous Coward · 2009-01-03 01:41 · Score: 0

I would be devastated if this happened to c64web.com
run from a 1982 commodore 64
Re:DUH! by Anonymous Coward · 2009-01-03 22:43 · Score: 0

âoeOnly wimps use tape backup: _real_ men just upload their important stuff on ftp, and let the rest of the world mirror itâ -Linus Torvalds
Now since they did not let the world mirror their stuff due to robots.txt, ... (post your reply and let the story continue)
Re:DUH! by rholland356 · 2009-01-05 04:32 · Score: 1

RAID 0 on a two-disk system isn't much of a mirror. More like an image reflected from the wrapper on a stick of gum. No mention of keeping a mirror on another system.
They suspect a former employee, but how is it possible that a former employee gained access to their system? An utter lack of security, most likely.
The real kicker is, of course, that the fee from DriveSavers would have been as much as an entire year of revenue for Journalspace.
Faulty tech? Yeah. Faulty business model? Definitely! At some point you have to take the system out of the basement and put it in a storefront.
And to those who would blog--get to know your host.
Re:DUH! by severoon · 2009-01-05 07:32 · Score: 1

RAID 0 on a two-disk system isn't much of a mirror.
You don't know how right you are—RAID-0 isn't a mirror at all. RAID-1 is a mirror, RAID-0 is a stripe set.

More like an image reflected from the wrapper on a stick of gum. No mention of keeping a mirror on another system.
What does this mean? Isn't a "mirror on another system" a backup? (Unless you mean real-time, like a replication service type thingy...)

They suspect a former employee, but how is it possible that a former employee gained access to their system? An utter lack of security, most likely.
Yea, I love that explanation for their problem: It's not that our backup was insufficient...it's that several components of our operation were teetering on the edge all at the same time...security, backup...it was all a complete mess! Nice damage control.

At some point you have to take the system out of the basement and put it in a storefront.
What does this mean? Why would you put your IT systems in a storefront?

And to those who would blog--get to know your host.
Huh?

--
but have you considered the following argument: shut up.
Re:DUH! by budmang · 2009-01-05 09:14 · Score: 1

While doing backups - good backups - seems like an obvious necessity, all too many people and companies put it off, hoping it won't happen to them. This was another critical reminder.
Gleb
--
Backblaze - backup, before you wish you had.

Again a frost post to a red story by Anonymous Coward · 2009-01-02 05:27 · Score: 5, Funny

While this mirrors previous comments, it's not really a backup solution.

Re:Again a frost post to a red story by moranar · 2009-01-02 06:04 · Score: 1

Now this is what I call First Post. Well done!

--
"I think it would be a good idea!"
Gandhi, about Internet Security
Re:Again a frost post to a red story by nocomment · 2009-01-02 06:41 · Score: 1

So the pipes are clean then?
http://ars.userfriendly.org/cartoons/?id=20070101
btw can't they 'rollback' the changes to the DB?

--
/* oops I accidentally made a comment, sorry */
/* http://allyourbasearebelongto.us */
Re:Again a frost post to a red story by Anonymous Coward · 2009-01-02 08:06 · Score: 0

Real cloggers probably got as annoyed by that cartoon as hackers do by journalists that use hacker interchangeably with cracker. Of course, cracker has an alternative meaning as well.
Re:Again a frost post to a red story by Anonymous Coward · 2009-01-02 08:33 · Score: 0

btw can't they 'rollback' the changes to the DB?
How do you think a rollback works? By magic fairies? It's all gone.

Oh Come on! by Anonymous Coward · 2009-01-02 05:28 · Score: 0

Who was the IT person there? It's been obvious for years and years that mirroring is a crap solution for backup! I.D.I.O.T.S!

When is backing up *not* an option? by wandazulu · 2009-01-02 05:29 · Score: 5, Interesting

Mirroring, RAID, grid, whatever. At some point, you want your data safe and secure on something not physically attached to any power source.

Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 05:37 · Score: 5, Insightful

Incremental backups to tape every night, full backup at the weekend. Tapes must be stored off-site at a proper storage location. Got lots of data and a small backup window? Get a faster tape drive and a tape robot. It costs money, but you data costs more.

This is at a minimum people. Come on!
Re:When is backing up *not* an option? by z_gringo · 2009-01-02 05:55 · Score: 2, Informative

nightly dumps of the database and rsync of the data directories to servers in different locations should be adequate. If you have lots of data, I don't see how tapes are really going to do the daily backup jobs.

backing up nightly to a large mirrored NAS and a periodic copy to a removable device seems like a good way to go these days. I haven't used tapes for years.

--
-- -- Warning. Do not stare directly at the sun.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 05:59 · Score: 0

Not attached to *any power source*? Surely you must be joking, Mr. Wandazulu!
Re:When is backing up *not* an option? by Nutria · 2009-01-02 06:04 · Score: 0, Troll

If you have lots of data, I don't see how tapes are really going to do the daily backup jobs.
More tape drives, idiot, and parallel backups. Mainframes and OpenVMS have been doing it that way for decades.

--
"I don't know, therefore Aliens" Wafflebox1
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 06:12 · Score: 0

Incremental backups to tape every night, full backup at the weekend.

Daily incrementals and weekly fulls? How about using backup technology invented after the dark ages?
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 06:12 · Score: 3, Interesting

Nope.
Mirrors are fine, just snapshot them and store them offsite regularly. Do delta backups as needed but close-in for fast restoration.
There is no rational justification for tape anymore, what with the cost per TB stored on hard disks now under $130, total $$. Random accessibility unless you're stalling a subpoena, is just mandatory on backup media.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by ba_hiker · 2009-01-02 06:15 · Score: 2, Informative

how 'bout this though.. hot-swap mirrored drives. pull 1/2 of the mirror at any time to make a backup. replace the pulled drives with blanks. keep a short stream of backup drives, say 8 or 9. drives are cheap. store in well padded metal boxes, offsite.
Re:When is backing up *not* an option? by z_gringo · 2009-01-02 06:20 · Score: 3, Insightful

NAS devices are cheaper and faster now. Lower end removable drives are not much more expensive than tapes, and they are a lot faster and easier to manage.

--
-- -- Warning. Do not stare directly at the sun.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 06:21 · Score: 0

My database backups take 8MB of space after a simple tar+bz2. I just put them on all of my desktops and laptops. Safety in numbers, especially when it's not secure data.
Re:When is backing up *not* an option? by Anpheus · 2009-01-02 06:33 · Score: 1

There's a USB 2.0 hard drive dock on NewEgg that will accept any 2.5" or 3.5" SATA hard disk, top-loading, no mounting bracket, no screws. Every day, press a button to release the current drive and then put in a new one.
Re:When is backing up *not* an option? by Nutria · 2009-01-02 06:34 · Score: 1, Insightful

NAS devices are cheaper and faster now. Lower end removable drives are not much more expensive than tapes, and they are a lot faster and easier to manage.
Having 21 days of off-site backups stored on NAS is kinda difficult.

--
"I don't know, therefore Aliens" Wafflebox1
Re:When is backing up *not* an option? by Wdomburg · 2009-01-02 06:52 · Score: 4, Insightful

Even accepting your price that's a cost of about 12.7 cents per gigabyte and you can get 800GB native LTO-4 tapes for about $50, which comes out to about 6.3 cents per gigabyte.
But quoting costs for desktop grade SATA drives severely understates the true cost. For any non-trivial site installation you're talking near-line rated drives, drive caddies, storage shelves and additional SAN fabric. Then price out the additional power, cooling and rack space. Then price offsite shipping and storage for the bulkier, heavier and more delicate disk option.
Mirroring has its place. Snapshotting has its place. And backups to stable media still has its place too.
Re:When is backing up *not* an option? by Trixter · 2009-01-02 07:01 · Score: 2, Informative

Amen, and why I just love ZFS (or any filesystem that supports instant snapshots). I use mirroring to cover drive failures, and I use weekly snapshots as backups. Once every three months, I offline to external disk. Minimum cost, more than reasonable protection.
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 07:02 · Score: 1, Insightful

Fine. Get the cartridges, but what about the capital cost minus depreciation of the drive? What about random access?
Now weigh those against an inexpensive jbod frame with a 2gb FC backplane. What's the write speed of LT vs a tasty little GB SAS drive? Rackspace? You can put a dozen into about 4U. Cooling? Although I'll grant you green cost, the random accessibility out-classes the seek time and tape insertion by a human cost dramatically. Stable media? Tape? Sometimes. Shelf space?
SAN fabric is dirt these days. You can get a nice Silkworm and a cheap-but-reliable SAS backplane for dirt as well. Perhaps a couple of GBICs.... or some handy-dandy fiber cables (also dirt these days) and you're in business. Or, put up a 10dot network off your public-face grid, and just use iSCSI. No need to use tape anymore. Get out of the reality distortion field, but do the right thing by testing what you have and doing drills to ensure that whatever you have, works and is a procedure understood by all.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by falcon5768 · 2009-01-02 07:02 · Score: 1

not any more difficult than tapes. Both are bulky but depending on how much data you have tapes can be much more so.

--
"Slashdot, where telling the truth is overrated but lying is insightful."
Re:When is backing up *not* an option? by SaDan · 2009-01-02 07:02 · Score: 1

Not if the NAS is off-site from your main data storage.
I'm a huge fan of tapes, but I've done some backup implementations that were better suited for a couple of hard drives in an off-site rotation.
Lots of ways to skin this cat, and apparently all were too difficult for the admin in question in the article.
Re:When is backing up *not* an option? by Omega996 · 2009-01-02 07:05 · Score: 1

well, incremental daily backups for sure, but not to tape. I think archiving weekly to a tape is fine, but daily? Too slow for writes and recovery, and too expensive.
Use a disk-based intermediate storage for your incrementals, and archive weekly to LTO.
Re:When is backing up *not* an option? by sirsnork · 2009-01-02 07:08 · Score: 1

Uhh, so what happens if there is a fire/earthquake/hurricane or any other disaster that wipes out your site? You've got data thats 3 months old? While thats better than nothing I'd be surprised if your company survived that.

--

Normal people worry me!
Re:When is backing up *not* an option? by FalcDot · 2009-01-02 07:10 · Score: 1

Not feasible since the array would need to be rebuilt every time you swap a drive.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 07:12 · Score: 0

It's always nice to have something truly offline to protect against virus or malicious users - a tape in a box has a physical layer of security associated with it that NAS does not.
For the small cost of a tape rotation service you can ensure against more then accidental data loss.
Re:When is backing up *not* an option? by juuri · 2009-01-02 07:16 · Score: 0

If you are are still doing backups to tape, well, old habits die hard.
Have a solid data validation strategy, test method and process. Migrate any data that must be kept for reasons, be they contractual, governmental, what have you, on the current bulk online storage medium. Currently that is simply hard drives. It's cheaper than tapes. The cost to host the data stays relatively stable, unlike with tapes as your sizes are increasing (more tapes vs larger capacity drives).
Some may say fall back on the old ways of tape as more "standard" or "stable" or "guaranteed" to work. Firstly BS, tape is never a sure thing and if you don't trust the drives to hold the data while not online, why are you trusting them to hold the data while online?
Tapes have got to go. Now.

--
--- I do not moderate.
Re:When is backing up *not* an option? by apache6131 · 2009-01-02 07:18 · Score: 1

That cost per TB of disk is based on consumer grade drives, enterprise drives cost quite a bit more. I think anywhere from 8 to 25k per TB depending on the tier of storage array.
Re:When is backing up *not* an option? by AvitarX · 2009-01-02 07:20 · Score: 1

Do you really want to spend so much time in Single drive fails the system mode though?

I like the idea, but it would be much smarter to do it with something with something like the Linux RAID 10, and 3 lives drive with 3 copies of data.

This will speed you up, especially during the frequent recover modes. Always protect you from at least one drive failure (often times 2). You will only need one extra drive (12.5% of disk cost, if 8-9 isn't too much, 9 or 10 certainly isn't).

What I do at work (without too much storage by todays standards) is have 2 500 GiB drives mirrored using a RAID 10 "far" copy (in tests it wrote barely slower (probably because writing can allow for near perfect caching), but read nearly as fast as a stripe).

I do nightly snapshots to a 1 TiB external drive. I have these backups shared read only, so people can browse by date and do their own data recovery.

With our work flow, 1 TiB is more than enough to save snapshots for a month. This lets me switch drives on a monthly basis. I also never delete the first of the month out of the snapshots, though I doubt it will ever be worth the effort to search for an old revision of a file.

FWIW, I use pdumpfs for the snapshots, it's not the best, but damn is it easy. Other options wither wanted a GUI (flyback, TimeVault), wanted too much setup in cron (glasstree, Dirvish), or didn't allow for a browsable result (rdiff-backup).

YMMV

--
Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 07:28 · Score: 0

You're welcome to pay for some EMC's salesperson's golf trip if you want. Enterprise class equipment certainly costs a bit more, but you're often paying for luxury where utility will do nicely.
Seagate bulk SAS drives 1tb barracudas are about $220 in oem bulk.... less if you want ata/300s. Compressed snapshots are fine in my experience, and better still, just in a nice, easy to slab hypervisor mountable.... or any other format that fits the bill. Delta snapshots are fine, too, depending on the transactional nature of the data. If it's transactional, then there are lots of great techniques to have multiple concurrent data instances that are easily coalesced when needed. The trouble: remembering to do fire drills.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by Nutria · 2009-01-02 07:28 · Score: 1

not any more difficult than tapes.
But disks are much more fragile. A dropped "box" (that's how Iron Mountain keeps track of them) of tapes is much less likely to result in damage than dropping that same box of disks. Same with throwing the box into the back of a truck and hauling them back and forth to the off-site warehouse.
Both are bulky
IDE drives are ~30% larger (by volume) than DLT & LTO tapes, and that doesn't include the padding needed to protect hard drives.

--
"I don't know, therefore Aliens" Wafflebox1
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 07:29 · Score: 1, Insightful

"There is no rational justification for tape anymore, what with the cost per TB stored on hard disks..."
Pardon? Once you buy the tape library (which admittedly can be pricey), the cost per TB on tape is a hell of a lot less with tape. Check out LTO4 for an example of a high capacity, high performance tape system. (Also, keep in mind that your high-end disk systems also drag along a big chunk of change in infrastructure before you can plug in disks, so the initial cost of a tape library is not such a straight win for disk systems either.)
Also, the monthly cost of spinning disk is also considerable in terms of power, real estate and cooling. And with the costs of disk x 2 for HA, you can get better value for your money with other stuff, *depending* on your requirements.
And before someone else brings it up, data deduplication can work just as well for tape as it does for disk. IBM's Tivoli Storage Manager (TSM) next version will have a data de-dup'ing built in, and it's due out in Q1 2009. (Let the "brand X de-duping is better than brand Y de-duping" wars begin!)
The moral of the story about disk vs tape vs software is that one solution does NOT fit all situations. Simple to implement doesn't mean cost-effective or even rational. Unfortunately, a good DR plan still requires people to think about disaster scenarios end-to-end and be focused on the business requirements rather than on one narrow definition of "good."
"There is no silver bullet."
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 07:30 · Score: 0

If you have lots of data, I don't see how tapes are really going to do the daily backup jobs. You've been looking at the wrong tape drives. How does 1TB per cartridge and 120MB/sec transfer sound? How about 30 years archival life and 100,000 load/unload cycles? You don't get that with a disk drive! Your NAS is likely to have these on the server side, either in a tape library for primary storage or backing up the disk farm. Don't bet your business on non-removable media!
Re:When is backing up *not* an option? by jstockdale · 2009-01-02 07:34 · Score: 1

No offense, but if you're in disaster recovery mode, I don't think you're going to care about drive depreciation or random access. You don't pull your tapes in from off-site because someone deleted the e-card their mom sent them for christmas -- you pull your tapes when a meteor hits your DC and you don't have any other option.

--
**AA: a bunch of mindless jerks who'll be the first against the wall when the revolution comes
Re:When is backing up *not* an option? by CdBee · 2009-01-02 07:35 · Score: 1

Pricewise, the reduced-hassle option of Amazon S3 at 15c/Gigabyte/month is nearly competitive with that

--
I have been a user for about 10 years. This ends Feb 2014. The site's been ruined. I'm off. Dice, FU
Re:When is backing up *not* an option? by Jon_E · 2009-01-02 07:37 · Score: 2, Interesting

that's where trixter needs to zfs send/recv the snapshots to an offsite location (and probably roll snapshots more frequently)
Re:When is backing up *not* an option? by femtoguy · 2009-01-02 07:39 · Score: 1

The biggest problem is that backup has two different purposes, and people are often confused about these.
1. Disaster recovery: You need backup for you system that covers you for failure of a disk. For this RAID 1 is adequate, though higher levels of protection would be nicer. In our data system, we do a complete live mirror of the file system, with double parity disks.
2. Archiving: This is to cover the loss of data not due to disk failure. The problem described in the article is this kind of failure, and is not covered by the disaster recovery setup. These problems also include people deleting files, people altering files, and wanting original files, curruption of files, whatever problems alter the filesystem. These are not fixed with disaster recovery because they alter the filesystem properly, and permanently.
The archiving problem can be done with snapshotting, which does provide coverage, and combined with RAID can provide both kinds of backup, but still leaves one weakness. If the entire system is always on line, the failure of the controller can take out multiple disks and disk systems, so something should really be kept off-line. For our system we have 5 sets of disks in ESATA holders that are swapped out in groups for off-line storage.
Finally these don't cover physical problems, such as fires. For this we store one of the sets of disks off-site for certainty.
If data is lost during the day, the snapshotting allows recovery of the data. If it happens during a week, either the snapshot or one of the sets of off-line disks provide backup. The off-site backup has the longest time horizon, but provides the longest time horizon of safety.
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 07:40 · Score: 1

Yes, the silver bullet is to drive it right through the heart of your expensive heaping honking tape library. Maybe garlic will help, too.
You can data dupe (2x cost). But you can also just say no, and start the procedure that gets you alternate storage, randomly accessible, easily tested, highly portable, and without proprietary media and infrastructure for about the same price as drive+tape+wiretime+hotelcost+human_intervention costs.
This is neither immoral or silver bullet-mentality. This is real and now. The sooner we shake tape, the better off we are. I've spent toooooooooo many hours watching jukeboxes shuffle stuff around to find out a vendor screwed up archiving, blew their compression algorithm, had a crappy library transport, or whatever-- all proprietary. Days of my life have been spent watching these things; they should be recycled at the earliest opportunity for the madness that they are.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 07:47 · Score: 1, Insightful

So let's get this straight: rather than use "old fashioned" tape to produce offsite backups, you use...consumer grade hard drives to produce an onsite backup?

Do you and the admin at Journalspace.com share tips, by any chance?
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 07:49 · Score: 0

Oh come on, you can't seriously list all those added costs for SATA without listing the high cost of the tape drive for the tapes and the robot. Those are two *major* expenses.
Re:When is backing up *not* an option? by WuphonsReach · 2009-01-02 07:51 · Score: 1

Even accepting your price that's a cost of about 12.7 cents per gigabyte and you can get 800GB native LTO-4 tapes for about $50, which comes out to about 6.3 cents per gigabyte.

Except that:

- You have to spend $2500-$3500 on the LTO-4 tape drive.

- Oh, better make it two. Just in case the first one breaks. Or make sure that you know two other people with LTO-4 capable tape drives. (On the upside, LTO is pretty stable, with multiple vendors, so getting a replacement drive is probably not hard.)

Until you've bought and used 50 tapes, those individual tapes are costing you $100 each due to the cost of the drive.

2.5" 500GB SATA drives that can be hot-plugged are now about $110. A SATA to USB style miniature case is $15. So for $125 times 3, a small business can get started with an off-site backup for $375. Or go with 1TB 3.5" drives for $100, plus about $30 for a caddy. And they can simply grow the solution by buying additional drives/caddies/enclosures. And until they get to the point where they are juggling a few dozen of these external drives, it's still less expensive then the tape solution.

And restoration is a whole lot easier with hard drives. *Every* computer out there can (assuming the proper software) read from hard drives without the need for expensive tape drives. That ubiquity is worth an awful lot when it comes to disaster recovery.

Yeah, you can spin all sorts of numbers about how tape saves you more money over the long term. But in small / medium business - the ability to avoid a large cash outlay for an expensive tape drive generally wins.

(My preferred backup solution for small/medium companies is to 1. Have a in-house target server where backups get written to. 2. Shove daily updates of key information offsite over the wire every night. 3. Get a hard disk offsite weekly. All of which is generally a lot more then most companies manage to do. You can even script in Linux, with /devfs, that backup scripts will fire when good backup target drives are hot-plugged into the system.)

--
Wolde you bothe eate your cake, and have your cake?
Re:When is backing up *not* an option? by deviator · 2009-01-02 07:52 · Score: 1

hear hear - there's a reason tape is not dead yet in the enterprise.
Re:When is backing up *not* an option? by Trixter · 2009-01-02 07:53 · Score: 4, Insightful

That's not my company's policy, that's *my* policy. I can take a 3-month hit to my personal data. AND YET MY LAX PERSONAL POLICY WOULD HAVE SAVED JOURNALSPACE.
My *company's* policy is daily offsiting. Expensive, but very many of our locations could become a smoking hole in the ground and we'd still be able to restore and operate.
Re:When is backing up *not* an option? by FatdogHaiku · 2009-01-02 08:01 · Score: 1

AND not at the same location. I just pulled files off of a small business file server (RAID 0 with an automatic spare HD) that was sooty and water stained... where were the backups, on top of the server, of course. Big help when the building burned down.

--
You have the right to remain sentient. If you give up the right to remain sentient, you will be elected to public office
Re:When is backing up *not* an option? by RichardJenkins · 2009-01-02 08:02 · Score: 1

Not for us, we rent a server from Rackspace, dump all of our interesting data onto it overnight (overwriting any existing data), and use their 'managed backup' to actually keep point in time backups of the mirror.
We're going to looking to close the windows from one day to a few minutes for files, and real time backup from the databases using transaction logs.
What could possibly go wrong? Well, getting complacent could. A while ago we realised that backups ad stopped working, and no one had noticed. Sure am glad I learnt that lesson the EASY way. Still, I think we've built a pretty robust, straightforward, fast restoring process using internet based backups.
Re:When is backing up *not* an option? by BoberFett · 2009-01-02 08:12 · Score: 1

Nothing says you can't make a backup of a snapshot. What exactly are proposing? Replication? Replication isn't a backup either. Failover yes. Backup no.
Re:When is backing up *not* an option? by BoberFett · 2009-01-02 08:14 · Score: 1

How the hell does a SAN work as off site storage?
Re:When is backing up *not* an option? by Wdomburg · 2009-01-02 08:15 · Score: 4, Insightful

Fine. Get the cartridges, but what about the capital cost minus depreciation of the drive? What about random access?
Random access is why snapshots also have their place. :) Archival backups and nearline backups solve different sets of problems.
Now weigh those against an inexpensive jbod frame with a 2gb FC backplane.
What kind of capacity are we talking. For a small site you can pick up a little 2U unit that'll store 6.4TB uncompressed for under $5k. Or if you're running a larger site you can snag a 4U unit with two drives for about $15k that'll handle 30.4TB with optional expansion to 60.8TB native.
What's the write speed of LT vs a tasty little GB SAS drive?
120MB/sec per drive without compression. And now that you've talking about SAS drives your per TB cost is hopelessly optimistic. Even OEM packaged terabyte SAS drives are going to run you about a quarter a gigabyte, which is now four times the media cost of an LTO-4 solution.
Rackspace? You can put a dozen into about 4U.
So about 12TB in 4U compared to the 30TB unit I mention above.
Cooling? Although I'll grant you green cost, the random accessibility out-classes the seek time and tape insertion by a human cost dramatically.
Have you never heard of a tape library?
Stable media? Tape? Sometimes.
Properly handled tape is incredibly stable.
Shelf space?
If you're doing off-site storage, that's going to be an issue regardless of what media you're using. And as I pointed out, tape is far more compact and far lighter than disks.
No need to use tape anymore. Get out of the reality distortion field, but do the right thing by testing what you have and doing drills to ensure that whatever you have, works and is a procedure understood by all.
I'm not the one dismissing an entire class of technology while demonstrating ignorance of its costs and benefits.
Re:When is backing up *not* an option? by SmurfButcher+Bob · 2009-01-02 08:23 · Score: 2, Insightful

I'm not sure what planet you're on, but I wish the rest of us were there with you.
Backup media should be and must be transported offsite every freakin day. You'd do that with a hard disk? Or more correctly, you'd do that with a STACK of hard disks? Or is your building fire, flood (including broken sprinkler pipes), gas leak, and drunken-truck-driver proof.

--

help me i've cloned myself and can't remember which one I am
Re:When is backing up *not* an option? by mabhatter654 · 2009-01-02 08:25 · Score: 1

you refer to the rack that can hold SAS diks. You can load them up with 3.5" 10k SCSI 150GB for processing or with 7200 1TB disks for backup in RAID 6 so you can lose two and be OK. When the 1TB disks cost what a backup tape does, it's a no brainer.
The only problem is that you don't have a real off-site backup... so you still gotta put that data to tape at some point and sent it where it some place safe like a bank safety deposit box. So when the server rack falls over and squashes YOU taking the equipment with it, somebody can get that tape to rebuild.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 08:26 · Score: 0

Looks like LTO drives run in the $3K-$4K range, which means that that first cartridge is going to cost you an extra $3.75-$5.00 per GB. As long as you plan on using a couple dozen tapes, you'll be ahead.
I think I'll stick to SATA drives for my (trivial) collection...
Re:When is backing up *not* an option? by synnthetic · 2009-01-02 08:41 · Score: 0

Here's my setup...
Backup Server, Old Compaq DL380, 4 x 500GB SATA drives ($350)
SATA 4 port card ($20)
LTO-3 Drive ($1500)
LT0-3 Tapes 30 x $30 = ($900)
SCSI 160 card ($25)
I've got Disk2Disk2Tape, 1.3TB online, 400gb x 30 on tapes.. nightly rsync from 4 different servers, weekly full backup to tape. Complete DR data backup (and windows system state bak's).. under $3k.
And if you set the blocksize to 256k, and use STAR, it'll fill a 400gb tape in 2 hours. That's on dual pentium 3's.
Re:When is backing up *not* an option? by gd2shoe · 2009-01-02 08:43 · Score: 1

For your consideration, a mirror does not need to be 2 drives. It could be 3 or more. In your hypothetical situation, you could be swapping out a third drive. It would allow for the extra performance and reliability of a mirror. Also, do it at night or another low usage time when the drives are otherwise idle. (FYI, I'm not convinced I like this idea, I'm just willing to add to it.)

--
I won't join Slashcott. OTOH, If Beta goes live, I just won't be back until it's fixed. Sorry Dice.
Re:When is backing up *not* an option? by mabhatter654 · 2009-01-02 08:48 · Score: 2, Insightful

can you restore a RAID with different hardware? With LTO3 tape I have several drive choices.. notably I can by a NEW drive and know the tape will work even 3-4 years out. What happens when the maker of your RAID solution moves on and wants to send you next year's model? Will the encryption and striping still line up on different hardware made by a different company?
Re:When is backing up *not* an option? by mabhatter654 · 2009-01-02 08:54 · Score: 2

can a stored disk drive sit on a shelf for a year or two of being tussled about and still work? Tapes have nearly all the moving parts external and replaceable in the drive, not the media. A cardboard box of backup tapes will survive storage pretty well.. a box of disk drives not so much.
Re:When is backing up *not* an option? by Wdomburg · 2009-01-02 09:05 · Score: 1

My intent wasn't to provide a thorough cost breakdown of the technology. That's impossible to do without taking individual site requirements and existing infrastructure and capacity into account.
My point was that quoting a wholesale drive cost was far from an accurate represent and that the cost of tape is often highly overestimated.
Is it right for every client? Of course not. The buy-in cost for the drive is far too high if a client has a relatively small data set. It can also be a poor fit for clients who consider nearline access of the data a priority, though in those cases a hybrid approach including snapshots is often the right answer.
On the other hand, the larger the data set the more attractive the lower media cost and higher density becomes. Likewise customers who have a need for deep backup cycles or true archiving (say for auditing puprosed).
In other words - don't trust people who try to sell you on the One True Way.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 09:07 · Score: 0

a tape might be useful in the event of a sun flare or atomic weapons going off and EMPing all of your hard drives...tape would be nice in that scenario.
Re:When is backing up *not* an option? by moggie_xev · 2009-01-02 09:15 · Score: 1

If you have lots of data, I don't see how tapes are really going to do the daily backup jobs.
More tape drives, idiot, and parallel backups. Mainframes and OpenVMS have been doing it that way for decades.
If you have lots of data you need to do the maths about how much it costs to back the data up. And how much it costs to reconstruct the data again and decide whether it is worth backing the data up or not.

You also need to consider how long it will take you to do the restore from the backup. If you have 4 petabytes for example backing up a sizable proportion of that space takes a lot of time and energy.

However I would be surprised if this was the case here and not having a database backup is incompetent unless the sysadmin has writtern proof of the rejected po's.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 09:21 · Score: 1, Insightful

You'll even be able to archive that LTO3 tape off-site for years and know that if you ever need to, you'll be able to read it in your new LTO5 drive.

With a SCSI or SAS hard drive you'll be lucky to even have the correct adaptors to be able to plug the sodding thing into your controller and power it up after two or three years...
Re:When is backing up *not* an option? by vrmlguy · 2009-01-02 09:23 · Score: 0

You store the disk drive in an off-site server, and keep it powered up all the time, slowly re-reading all the data so you can spt drive failures in a hurry. The big boys do off-site replication for disaster recovery, and take snapshots of that for off-site backups. The media never has to leave the DR site, yet it's safe from almost anything that would take out the primary data center.
(OK, I'll concede that an eruption of the Yellowstone supervolcano could take out data centers thousands of miles apart, but if that happens you've got other things to worry about.)

--
Nothing for 6-digit uids?
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 09:36 · Score: 0

LTO4=1600GB Compressed. 50 dollars a tape. Yea, tapes are no longer useful. What a load of $hit.
I run IBM Powwer6 P595's, IBM SAN behind SVC with pprc metro mirrors in dual data centers with HACMP/XD up the wazoo, we are talking hardware in the 10's of millions of dollars.
Guess what? We also put everything on tape. Yes, terabytes, lots of them, on tape, every night.
Re:When is backing up *not* an option? by rrohbeck · 2009-01-02 09:43 · Score: 1

How the hell does a SAN work as off site storage?
If the fiber is long enough or you use a fast WAN.
I've been involved in a project where a customer rented space on the other side of a river and a fiber, then ran full speed FC to a tape library across the river. Clearly good enough to protect against fire and tornadoes, this customer's main concerns. Hurricanes or earthquakes, probably not so much.

--
thegodmovie.com - watch it
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 09:51 · Score: 0

And have you noted the difference between "consumer grade" hard drives and others? Do you realize the madness in MTBF statistics used by the various hard drive vendors? Do you track the failure modes between OEM and "consumer grade" drives? Do your research and put your money where you believe. I do.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 09:53 · Score: 1

Truly, you've been listening to too many sales guys.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by rrohbeck · 2009-01-02 09:55 · Score: 2, Insightful

Anybody who uses disk based backup for a while finds out that it needs to be augmented by tape sooner or later. A disk based subsystem gets full pretty soon once you get used to the convenience. If you continue to buy disk drives it gets really expensive so people find that the best of both worlds is D2D2T. This reduces the size and speed of the tape subsystem you need, but doesn't make it obsolete. You need offsite storage anyway.

--
thegodmovie.com - watch it
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 09:56 · Score: 0

Oh-- do we ever agree about that. But who says you have the data in just one building? My snapshots and mirrors are 13mi from where I sit. The backups for those are nine states away. Another mirror of those is in Canada. Cost? Not that much, but my payloads aren't the size of other organizations.
And listen: there's not one freaking tape. Not one. And it's randomly accessible fully, and the MTTR for primary systems is under eight minutes from click to verified. You, too, can do it.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 09:59 · Score: 1

WIth no drives to restore onto, whatcha gonna do?
What about persistence, bleed through, good old oxidation, and leaving them in the back seat of a taxi?

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by Fulcrum+of+Evil · 2009-01-02 10:02 · Score: 1

I have somewhat lower requirements, so my options are a) tarballs copied somewhere on another server and b) a SATA DDS3/4/72 drive and some $8 DDS tables. One tape is quite sufficient to my needs, and backup tapes are a lot sturdier than hard disks.

--
"We returned the General to El Salvador, or maybe Guatemala, it's difficult to tell from 10,000 feet"
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 10:06 · Score: 0

We disagree.
If you design a sane availability system, you can obviate tape and all the crap behind it. It's all about availability, audit, and a systematic approach to provisioning, archiving, and lifecycle.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by Matt+Perry · 2009-01-02 10:25 · Score: 0

Even accepting your price that's a cost of about 12.7 cents per gigabyte and you can get 800GB native LTO-4 tapes for about $50, which comes out to about 6.3 cents per gigabyte.
You're conveniently ignoring the cost of the tape drive. The cheapest LTO-4 drive on Newegg is $2230. That makes a tape backup solution much more expensive per TB than hard drives, at least until you have a *lot* of tapes.

--
Slashdot: Failed Car Analogies. Amateur Lawyering. Anecdote Battles.
Re:When is backing up *not* an option? by speedingant · 2009-01-02 10:50 · Score: 1

Tapes are old news if you're looking at a shit load of data. I recycle JBOD arrays, and archive monthly to tape. Tapes are slow.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 10:57 · Score: 1, Insightful

If you design a sane availability system, you can obviate tape and all the crap behind it. It's all about availability, audit, and a systematic approach to provisioning, archiving, and lifecycle.
And itsy-bitsy data sets apparently. But in a world where one's data is measured in tens of terabytes rather than hundreds of gigabytes, tape is still king.
Re:When is backing up *not* an option? by X0563511 · 2009-01-02 11:11 · Score: 1

SMART doesn't work over USB.
You really should be checking smartcrl -a before touching a drive for backup...

--
For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
Re:When is backing up *not* an option? by X0563511 · 2009-01-02 11:12 · Score: 1

It's called risk management. You can't have it all.

--
For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 11:15 · Score: 0

Do you track the failure modes between OEM and "consumer grade" drives?
Do you realize that this sentence doesn't even make sense?
Do you also realize that the components of enterprise drives have to meet stricter tolerances than the components of consumer equipment? That it's not just a matter of reading some MTBF statistic?
Re:When is backing up *not* an option? by mabhatter654 · 2009-01-02 11:24 · Score: 2, Insightful

again with the power requirements! To keep even month and year would require a massive amount of extra hardware and power, not to mention people to tend it. I'd agree it's super good at recovery but how much more value are you getting versus tapes in a safety deposit box?
Re:When is backing up *not* an option? by michaewlewis · 2009-01-02 11:28 · Score: 0

I hope you're joking about the raid 0 with hotspare.... What good is a raid 0 with a hotspare going to do? If you lose one drive, the array is dead. No hot spare is going to revive it.
Re:When is backing up *not* an option? by Nutria · 2009-01-02 11:50 · Score: 1

dump all of our interesting data onto it overnight
How much? We back up 10+ TB every night, across a range of mid- and mainframe systems.
We're going to looking to ... real time backup from the databases using transaction logs.
We've been doing this for decades. (Since at least 1984 on OpenVMS, and most probably earlier. It's the Durability in ACID.)

--
"I don't know, therefore Aliens" Wafflebox1
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 11:52 · Score: 0

One of my former employers was going to great pains to do off-site backups to a whole ton of different places. We had rented servers all over the world that existed only to be the target of automated daily backups, along with backing up to tape and taking that away to store.
This was in an organisation with only five employees that grazed bankruptcy every few months. One of my co-workers one day made the very good point that if there was such a catastrophic event that it simultaneously took out both our datacenter, our office and the facility where the tapes were stored (which, I seem to remember, was the CEO's house) the data we had stored probably wouldn't really matter that much because the company -- and probably much of the country -- would be finished anyway.
Re:When is backing up *not* an option? by Nutria · 2009-01-02 12:01 · Score: 1

If you have lots of data you need to do the maths about how much it costs to back the data up. And how much it costs to reconstruct the data again and decide whether it is worth backing the data up or not.
Of course. We develop/run apps & databases for state and multi-state gov't agencies, which those agencies derive much income from. They'd be highly pissed we couldn't restore the system to the minute of the crash.
Which we've had to do a couple of times after "unfortunate incidents".
You also need to consider how long it will take you to do the restore from the backup.
Again, of course. PCI compliance auditors make us, quarterly, restore a full database to a test rig, and yearly do a complete DR test at our Sungard cold site.
Stove-piped systems have the benefit that data can accumulate on one machine while we restore the "next phase" system. Only the customer-facing system needs to be clustered/hot-failover, etc.

--
"I don't know, therefore Aliens" Wafflebox1
Re:When is backing up *not* an option? by Nutria · 2009-01-02 12:05 · Score: 1

I'm a huge fan of tapes, but I've done some backup implementations that were better suited for a couple of hard drives in an off-site rotation.
Must have been small systems.

--
"I don't know, therefore Aliens" Wafflebox1
Re:When is backing up *not* an option? by dbIII · 2009-01-02 12:09 · Score: 1

There is no rational justification for tape anymore
How do I read that disk from 1982? How can somebody read my current IDE drive in 2029 after the capacitors have leaked or corrosion has occured or one of the many points of possible failure in comparison to very simple tapes? Also disk transport is still tricky in comparison to tape, and to ensure it works people do things like ship entire systems around for a mere 57GB of data instead of putting it on a tape (or USB drive).
I'm not just making this up, I do read in tapes from 1982 becuase some data is really expensive to collect again. Rocks don't move about much in twenty years so the data is still valid and a lot more can be obtained from it with the computing power we have now.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 12:09 · Score: 0

And have you noted the difference between "consumer grade" hard drives and others?
Speed, density (LOWER density is better in enterprise systems), quality of the components (platers, spindles), testing, environmental ratings, MTBF, warranties.

Anything else?

Do you track the failure modes between OEM and "consumer grade" drives?
Huh? What does that even mean?

Do your research and put your money where you believe. I do.
Er, right. Well I believe that using hard drives, and particularly consumer grade drives, as your primary backup solution is an utterly terrible idea and that you or I would be laughed out of any competent IT department for even suggesting it.

You seem to have a blinkered idea of what "backups" actually mean. Throughout this thread you have made no provision for large datasets (you can't offsite TBs of data over a WAN), "snapshot" or audit backups (I.e. monthlies), archival backups or basic DR provisions. You have a wonderful online "backup" solution that works for you, for now, and you're happy with it: but I wonder just how much it cost you to setup and maintain, how that would have compared to an "old fashioned" D2D2T setup, how reliable it will prove when you need it and just how well it will scale in the future.

Of course you, like so many others before you, think you know better than the combined wisdom of the decades of knowledge of previous generations of admins. So why worry?
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 12:16 · Score: 1

In 1982, your tape format was proprietary. Maybe, if you're lucky, the tape hasn't oxidized. Maybe, if you're lucky, you can still read the ink on the label.
Or, perhaps you put the archive into a new SAN cache when you copied over the data set and bindery.
There might be media in Wang format, or maybe Lanier, or a hard-sectored 8". 3740 format tapes can still be read, unbelievably.
But you're citing 1982, and now that the tape is 26yrs old, was found in some forsaken warehouse, and now how are you going to get at the data? Now that there's an Internet, and online storage is not only feasible but desirable, rethinking all of this takes a measure of disbelief coupled to courage. That's how SANs are working, virtualized infrastructure, and highly audit-compliant availability systems are constructed. Newspapers are also dying; can't wait for Liquid Paper to become the analog, just like I waited for disks to drop in price to make a practical alternative to the hideous nature of tape archiving.
We don't CLOAD anymore. We don't use punch cards. We got rid of punch-tape. Now let's get rid of tapes.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by Darkk · 2009-01-02 12:17 · Score: 1

Just hope you're also dumping backed up data from the NAS to some kind of a removable media, i.e. tapes so it is physically separated from the servers into a vault of some kind.
Lightning can strike and wipe out your servers. Long as your data are kept elsewhere you can easily rebuild your systems from backup.
Re:When is backing up *not* an option? by OzRoy · 2009-01-02 12:17 · Score: 1

He sounds like someone who has actually done his research.
Every technology has it's place and tape is still importatant.
Re:When is backing up *not* an option? by Vanders · 2009-01-02 12:18 · Score: 1

Tapes are slow.
Who told you that?

HP StorageWorks LTO-4 Ultrium 1840 Tape Drive represents HP's fourth-generation of LTO tape drive technology positioned as HP's first tape drive capable of storing up to 1.6 TB per cartridge... Capable of data transfer rates up to 240 MB/sec
If God forbid your old JBOD lets out the smoke the day before your next tape archive is due, or something else happens and you lose the site, you've just lost a months worth of data. What's a months worth of data worth to your company?

--
Syllable : It's an Operating System
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 12:20 · Score: 0, Troll

You mistakenly believe that consumer grade drives have different MTBFs than OEM or 'enterprise' drives. What you're paying for is a vacation cruise for your sales rep.
I'm fully aware of differing needs for differing applications. I firmly believe all can be done with hard drives and even (in the future) SSD drives.
The availability systems I've engineered were through through, with audit and compliance and low MTTR in mine. It scales like crazy, and even more cost-effectively every month that hard drive costs go down-- and they do, every month since about 1988 in terms of cost per GB stored through a five year life cycle.
My decades of experience are unable to be exposed here. My years of battling the farce of overpriced, underperforming availability systems is the crux of my posts.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by dbIII · 2009-01-02 12:21 · Score: 1

Nice put down - invincible ignorance FTW. Meanwhile the rest of us will listen to a lot of people, consider a lot of options and do the best for the circumstances we can budget for. You might consider looking up SDLT and LTO - hell even an IBM 3480 from decades ago blows your objections above out of the water. There are other things - the sort of real time data streaming that you could do to two tape drives at the same time for twenty years is only recently reliable now to disks for example. The tape drives have also got a lot cheaper along with pretty well all other electronics.
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 12:22 · Score: 1

The place for tape technology is in a suitable recycling facility where it belongs. Go ahead: ask the hard questions to your vendors about what drives they use, the origin of them, and how they're different. Go ahead. Then do the checking on your own by pulling out some spares and finding out from Seagate, IBM, Hitachi, etc more info about those drives. Then find out how much the label cost, and how much that means to you.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by Darkk · 2009-01-02 12:29 · Score: 2, Insightful

Not too bad of an idea, least there is duplicate data at a different location which is better than what these guys are proposing.
Ideal thing is to have the duplicated data be MILES from the main DC site but it's not always practical when you have large volume of data to replicate and backup. Yes I know all about rysnc and smart data replication but still nothing like good old fashioned complete full backups at a expense of time. That always seems to work well for most people.
Re:When is backing up *not* an option? by speedingant · 2009-01-02 12:39 · Score: 1

Well it really all depends on your budget. I was basing that on our backup solution, so didn't delve into details. Not everyone can afford expensive tape arrays, and we have a single HP drive, which is slow.
We have roaming homes, which are all stored locally on each machine as well as the server. We have those backing up to a RAID 5 array, which is backed up nightly to a JBOD array. Each user machine is also backed up incrementally every 5 hours to another RAID 5 array. We have two JBOD arrays which get recycled. The archives are stored in a firesafe, both offsite and onsite.
The most we would lose is a days worth of data if anything silly happened, or if there was a fire in the server room. If a major fire broke out and we lost absolutely everything, a months worth of data lost is the least of anyones worries ;)
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-02 13:24 · Score: 0

It also helps to actually be able to restore from the backup once it happens. Nothing is more pathetic than a bunch of "IT Experts" standing around with not a single clue of how to restore the data from backups. Please people, practice before the fire!
Re:When is backing up *not* an option? by MightyYar · 2009-01-02 13:24 · Score: 1

With services like Mozy, daily offsiting is $5/month for home use. And you can use your own encryption key, so security isn't really an issue even for us paranoid geeky types.

--
W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
Re:When is backing up *not* an option? by dbIII · 2009-01-02 13:35 · Score: 2, Interesting

I'm not talking in the abstract here - the tapes was on nine inch reels with the data in SEGD format and now the data is now on the system. Nice set of assumptions you have there, but I was using a real example.
Disks are still crap for archival storage and for most of us they are still crap for daily offsite backups. It would be nice to have a second site and a really fast, cheap way to get terabytes of data to it - but for me the expense of that is a lot more than taking a box of tapes to a storage shed. Eventually those daily tapes become archival tapes when projects finish. To cap it all off, most of the incoming data where I work comes in on tape. DVDs are utter crap when you want to be sure a lot of data can actually be read at the endpoint and only hold 4.5GB anyway. Portable USB drives are replacing tapes for transport but they are just as slow to read and fragile to transport, and even then you get the time consuming crap of spending twenty minutes going around the building to find someone with the same sort of USB drive because it came without cables.
The final point is I really do not want to have to maintain the systems to have a couple of hundred terabytes of storage when the working set is well under 5 terabytes and there would be a lot of repition in that few hundred terabytes of recovered tapes. It then also raises security problems to ensure client data is not easily accessable by other clients. Disks are not the best for archival storage. One of the reasons I was using tapes from 1982 is that nobody was interested in the area the data came from between that data and this year. Disk space is still too expensive to keep two and a bit copies of something big for more than twenty years :)
Tapes are often annoying and I sopmetimes wish we could be rid of them but there are many circumstances where they are useful - paticularly archives, transport and backups. While I've been happily using rsync to take daily snapshots of important stuff for years and distributed it to machines in different buildings it still cannot always replace tape. I've had too many dead hard drives or unreadable DVDs (plus they are too small) to seriously consider them as archival storage. As for transport - why are USB hard drives so horribly slow? The ease of random access is irrelevant when you want a lot of files from the disk so LTO and SDLT still win there.
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 13:53 · Score: 1

USB write times are horrible. SSDs are getting faster, but there's a need to RAW that makes them slow, and the method that's used to write isn't like rotational media.
There are reasons to consider things like virtualization, FastScale, and other methods to cut down on the datasets. Then you need to have what's really needed in terms of audit, legal, and compliance needs. Then you factor reliability and availability.
The old maxim of online, near-line, and offline storage doesn't hold water these days. Random access and lack of human intervention and media management costs just push the fulcrum the other way now.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by lgw · 2009-01-02 14:12 · Score: 1

Of course, if his company can't survive the physical office being destroyed by the disaster in the first place, then his backup plan is perfectly sound.

--
Socialism: a lie told by totalitarians and believed by fools.
Re:When is backing up *not* an option? by timeOday · 2009-01-02 14:18 · Score: 1

Amen, and why I just love ZFS (or any filesystem that supports instant snapshots).

This sounds good. Does performance degrade once you make a few snapshots? I've noticed with VMWare, snapshots REALLY kill performance, presumably since the current state of each file is strung out across a number of files in different places. (Then again, SSD Drives might be fixing that for us).
Re:When is backing up *not* an option? by lgw · 2009-01-02 14:20 · Score: 2, Insightful

Your setup sounds *completely* vulnerable to a single malicious employee (with the right passwords). Typical engineer: protect against multiple failure modes, and disregard malice.
You don't have a backup until you seperate the data from the ability to destroy that data. Of course, there is WORM disk storage that meets that need, but that's far more expensive than tape (though handy for meeting auditing requirements).

--
Socialism: a lie told by totalitarians and believed by fools.
Re:When is backing up *not* an option? by dbIII · 2009-01-02 14:20 · Score: 1

Then you need to have what's really needed in terms of audit, legal, and compliance needs
Forget that stuff for a moment - sometimes you want to keep a lot of data beacuse it will actually be useful later on. That can vastly exceed the amount required for audit, legal, and compliance. In my case it is geophysical data which used to be very expensive to process the way we can now. Conversely the data would be very expensive and time consuming to recollect, and if you keep the tapes you don't need to since the recording resolution is still about the same. It's not much more than 1000 of those old reels and you have a terabyte (God knows how many are in all those boxes but it's more than that - plus they are actually all copies and the clients have the originals), then you have progressivley more modern tapes that hold up to that much each. A single simple marine seismic survey is going to give you around 5TB of data, and you never know how much of it is useful and each filter applied to the entirity of that data is going to give you another 5TB. It all adds up to a lot of file servers hopefully located at two sites if you want to keep that all on disk. I'm in Australia so the hosting costs for a lot of data make it out of the question and the bandwidth charges to keep things in sync far more so (no option of leasing lines or whatever you do in the USA - all traffic is metered and has to be paid for due to a telecommunications monopoly). We are talking about the difference between a multimillion dollar expense or hiring a storage shed. Since the data comes in in tape I'd still need the drives even if I had the mirrored sites.
Re:When is backing up *not* an option? by lgw · 2009-01-02 14:23 · Score: 1

And if Amazon loses or trashes your data they .. apologize? Refund that month's fee? What's in the SLA - does Amazon make any promise at all to actually protect your data?
Of course, you might just decide that it's unlikely that Amazon and your data center will lose the same data on the same week, and maybe that's OK. Of course, a malicious employee could still trash both copies, so I'd still want offline, offsite backup if only for that reason.

--
Socialism: a lie told by totalitarians and believed by fools.
Re:When is backing up *not* an option? by FatdogHaiku · 2009-01-02 14:31 · Score: 1

Sorry, it was RAID 1. The spare dropped in when something happened to one of the other drives, worked nicely and the array is what saved the files... steam and soot is not a good component coating.

--
You have the right to remain sentient. If you give up the right to remain sentient, you will be elected to public office
Re:When is backing up *not* an option? by saynt · 2009-01-02 14:43 · Score: 1

I think that you aren't really considering the power of scale in this issue. As of last Friday, I had nearly 6000 LTO-3 tapes in rotation. Granted, about 2500 of those are long term archives, and only get pulled in to do a restore, or when we re-tension or migrate media (2yrs. & 6yrs), but this kind of scale makes the cost of drives (14), and even tape robots level out greatly. My concerns are media stability, transportability, and tracking. Snapshots to drive surely have a place, and help us keep our service level for quick restores high, but for any large (or medium, like us) storage installation, tape is the only real option.
Re:When is backing up *not* an option? by postbigbang · 2009-01-02 15:06 · Score: 1

Seismic data is a bit different than transactional business data and I'm guessing it's subject to non-lossy codecs and other warehousing techniques that are just plain different.
I'll bet there's a way to compress that data and squish it into something that's more manageable.... especially if it's of a low sample rate. But I really don't know.....
My rationale: while no one's been looking, drive prices have (sorry to use this word) crashed. But drive reliability is getting really good. I don't see having to have two drives, one an enterprise 'branded' drive, another an 'OEM' drive, and still a third, a 'consumer' drive. You might pay for overnight service, but the quality differences aren't very much. A high speed SCSI or SAS or FC-modded drive is still damn fast in any one of the aforementioned categories and getting cheaper by the week. Tape drives are proprietary, slow, not randomly accessible, and require a lot of human intervention.
Throwing 5TB into a server mosh pit and crunching it until you get some meaning is a huge dataset, and unusual. Yet arrays can do this easily, and as they're arrays, often quickly. I'm guessing that a non-tape solution is viable, but I'm not experienced in such datasets and how they're processed.

--
---- Teach Peace. It's Cheaper Than War.
Re:When is backing up *not* an option? by CAIMLAS · 2009-01-02 16:14 · Score: 1

I know a guy who's a bit of a technophile, and did a similar thing. Local cable company was laying fiber optics in his locality a while back. Small town, few reasonable concerns of disaster other than maybe flooding by work, or rare case a tornado. He lived about a quarter mile from work, up a large hill.
At about the same time as the cable ocmpany was laying fiber he started to replace a lot of the old copper with fiber for the longer switch runs.
This locality was/is very cold, and for whatever reason the cable company dug a lot of their ditches in length (ie multiple blocks at a time) and left them late fall to finish the job in the spring after the thaw (yes, this is in the sticks). The cable was in the ground and partially covered, just not 'finished'.
Long story short, he bought extra fiber as well as a long spool of a hard plastic conduit of sorts - it was enough to reach his place. His boss got approval from the city to use the ditch the cable company used (IIRC) and rented a bobcat for a weekend (mainly to clear the snow). It was like what you'd use for a fish tank, but bigger. He used the conduit as a sheath for the fiber, and just laid it on top of the cable company's bundle conduit - from his office to his house. Talk about job security...

--
~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
Re:When is backing up *not* an option? by WuphonsReach · 2009-01-02 18:23 · Score: 2, Insightful

Agreed. Tape makes a lot of sense for high-volume applications.

It's just being crowded out of the low-end market by ever larger and ever cheaper hard drive sizes. Tape costs would have to drop by about a factor of 4 (or more) to compete in the lower end of the market where 100 tapes is a lot.

(If I could backup 800GB for $10, that would be much more of a no-brainer decision. The cost-advantage would be high enough to pay for the expensive tape drive. And $50 LTO-4 tapes are a lot better then back when a lot of large-capacity tapes cost $100 each.)

--
Wolde you bothe eate your cake, and have your cake?
Re:When is backing up *not* an option? by JoelKatz · 2009-01-02 19:05 · Score: 1

When the SAN supports snapshotting and remote synchronization. However often you want, you snapshot the SAN and synch the image to a remote SAN over your WAN.
Re:When is backing up *not* an option? by saynt · 2009-01-02 19:42 · Score: 1

Good points, all. I use a 500GB drive at home to do back up. I have an DLT8000, but it's just not worth the trouble to dig it out and get it hooked up anymore. It doesn't really protect me against physical hardship, since the backup drive is kept in the same house as the primary, but it's a nice insurance policy against drive failure and human error. For a small web company that I'm working with, we're using Amazon S3 and JungleDisk for their Windows based app server. I'm not sold on it yet, but it's another option and seems to be a reasonable way to make sure critical data is off site.
The move to commodity pricing on tapes has been a nice development for us, I remember paying upwards of $200 for DTF2 tapes...
Re:When is backing up *not* an option? by dbIII · 2009-01-02 20:15 · Score: 1

I'll bet there's a way to compress that data and squish it into something that's more manageable
There is but since there is no established standard it can only really be used for transport and only then if you include the compression and decompression software. Think of it as several hundred tracks of audio data - zip etc just doesn't work and usually produces bigger files. Stuff like segyzip which does mp3 like compression is produced by single individuals and closed source so cannot be relied upon for archival storage (in fact I get a 404 not found for it's web page).
Also while hard drive price have crashed so have tape drive prices. Even an LTO2 for under $2000 can record faster than a mid range server with RAID5 can feed it, and an LTO4 is a lot faster than that, and there are other options (some tape drives can sustain 350MB/s and hold a TB per tape)
Anyway, the point I have been gently trying to get across is that tape still has a place because there are things you have not considered. Random access is not a big deal for the primary uses of tape anyway - recovering the entire contents of dead storage areas or transporting a whole lot of files. They are a pain for getting that deleted email on a paticular users account from last week but that does not render them useless in other situations.
The last point in the earlier post about 5TB arrays is not what I'm referring to really. You have that data on disk but when the project is finished it all gets backed up onto a couple of sets of tapes and you use the disks for a different project. You actually need a bit more disk than that, typically the project grows to a large size and then is trimmed down at the end of the project when the dead ends are deleted. You end up with three datasets while the project is running - the original data (usually multiple copies at multiple sites), a list of the transforms applied to the data (that is where disk snapshots are good and it's only in the gigabyte range) and the working datasets which can be terabytes and can be recreated given the other two bits of information. In the end you free up a lot of storage by shifting the finished data to tape where you can forget about it until something happens like the client merges with another company five years later and the new management want the data examined again. Having a couple of fileservers running for five years just to store that data is a huge amount more expensive than just putting it all on tape and storing it in a couple of places.
Re:When is backing up *not* an option? by nacturation · 2009-01-02 22:00 · Score: 1

His boss got approval from the city to use the ditch the cable company used (IIRC) and rented a bobcat for a weekend (mainly to clear the snow). It was like what you'd use for a fish tank, but bigger.
A bobcat for a fish tank? Isn't that a bit overkill?

--
Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
Re:When is backing up *not* an option? by OzRoy · 2009-01-02 22:53 · Score: 1

You are dangerous. You are the sort of IT person who has had their own limited set of experiences and believes that everything they know is a suitable solution for every situation.
You have read the reasons for the continued existence of tape. You can continue to believe what you want to believe. I hope for your sake you never end up in a situation like these Journalspace guys where you end up with a major problem because you thought you knew better, and you ignored other people's experience.
Re:When is backing up *not* an option? by CAIMLAS · 2009-01-03 06:30 · Score: 1

Hah! Yes, it is. I was drunk/tired and not thinking linearly.
The hose he used to protect the fiber was like the tubing for a fishtank, but bigger. The bobcat was for the ditch. :P

--
~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-03 12:38 · Score: 0

Amen, and why I just love ZFS (or any filesystem that supports instant snapshots).
This sounds good. Does performance degrade once you make a few snapshots?
ZFS snapshots cost nothing other than disk space for the bits that have changed. Performance is not degraded; if anything, it takes marginally less effort to have snapshots than to not have them, because ZFS uses copy-on-write and so snapshots relieve some of the effort of freeing blocks.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-03 15:29 · Score: 0

You missed a step - make sure you can read the tapes back and the data is actually there!
I';ve seen backup tapes that could not be read at all, and tapes that could be read but - oops - the data had not actually been saved, just some headers.
Re:When is backing up *not* an option? by Anonymous Coward · 2009-01-04 10:04 · Score: 1, Insightful

SAN fabric is dirt these days. You can get a nice Silkworm and a cheap-but-reliable SAS backplane for dirt as well. Perhaps a couple of GBICs.... or some handy-dandy fiber cables (also dirt these days) and you're in business.
Out of curiosity do you have any idea what you're talking about? Fiber or GBICs? You're gonna need both...
Also SAN/LTO are hardly mutually exclusive. Plenty of tape libraries are san attached.
Re:When is backing up *not* an option? by tehcyder · 2009-01-05 03:15 · Score: 1

I thought we all had our data on the Cloud now?

--
To have a right to do a thing is not at all the same as to be right in doing it
Re:When is backing up *not* an option? by EvilBudMan · 2009-01-05 07:20 · Score: 1

Full always at night if time is not an issue. Off site one a week. If you have a fire take out your business you have to decide how much time you can loose. I have experienced this with few backups. We rebuilt everything from scratch and got rid of a lot of crap at the same time. We decided a week max. with only 30 employees.
If you gotta be on all the time (1000 employees) though then fiber to another building is the way to go.
Tape is still the way to go though and will probably be so till I'm dead. It just will not die.
Re:When is backing up *not* an option? by EvilBudMan · 2009-01-05 07:24 · Score: 1

They are probably not any more expensive but tapes have advantages of their own. You wouldn't think sequential access would be an advantage but it is when storing the data.
Re:When is backing up *not* an option? by vrmlguy · 2009-01-07 03:19 · Score: 1

Most data is static. Once you receive an email or IM, it never changes, while blogs mostly just append new posts and comments. As a result, pointer-based snapshots only need to track data that has changed since the snapshot was taken. If you have one TB of new/updated data per year, then 12 monthly snapshots will need 6.5 TB, 12 weekly ones will need 1.5 TB, and 12 daily 0.214 TB, totalling 8.214 TB. That may seem like a lot, but if you're backing up 20 TB then it's 42% overhead. And since the snapshots will probably be kept on 7+1 RAID, it could be less than 24% of your raw storage.

--
Nothing for 6-digit uids?

stunned silence by Anonymous Coward · 2009-01-02 05:29 · Score: 0

Slashdot is stunned into silence.

Re:stunned silence by conureman · 2009-01-02 05:42 · Score: 5, Funny

I am experiencing a strange phenomenon. The jaw-drop reflex has been popping my mouth open for several minutes and won't stop. If I focus I can close it, but then it pops open again. wow.

--
The cost of that cleanup, of course, will be borne by taxpayers, not industry.

Dear Every Corporate Tool in the Universe: by yttrstein · 2009-01-02 05:29 · Score: 5, Insightful

And that's why your IT department actually needs funding. Sleep tight.

Re:Dear Every Corporate Tool in the Universe: by TubeSteak · 2009-01-02 05:43 · Score: 5, Insightful

And that's why your IT department actually needs funding. Sleep tight.
They've had the site live for 6 years.
This wasn't a lack of funding, it was just sheer stupidity.
6 years and nobody ever thought it'd be a good idea to back everything up to dvd or an external hard drive. HTML compresses really well in case they didn't know.

--
[Fuck Beta]
o0t!
Re:Dear Every Corporate Tool in the Universe: by yttrstein · 2009-01-02 05:45 · Score: 4, Insightful

Stupidity IS a lack of funding. Pay the salary of someone smart enough to handle your data correctly if you have no interest in becoming smart yourself. Simple.
Re:Dear Every Corporate Tool in the Universe: by Vanders · 2009-01-02 05:48 · Score: 2, Insightful

Apparently they only had the two drives: one mirroring the other. They had one single drives worth of data. Just think how little data that really is, for a moment. They could have bought a second hand tape drive from eBay and a box of new tapes for a couple of hundred dollars and had a complete backup solution going in one afternoon.

For want of a nail, the shoe was lost.
For want of a shoe, the horse was lost.
For want of a horse,the rider was lost.
For want of a rider, the battle was lost.
For want of a battle, the kingdom was lost,
And all for the want of a horseshoe nail.

--
Syllable : It's an Operating System
Re:Dear Every Corporate Tool in the Universe: by Volante3192 · 2009-01-02 05:48 · Score: 3, Insightful

Never underestimate the beancounter's desire to save every cent possible. If your site's working perfectly fine, well, what's the point of having backups? Seriously, I see this happen all the time with small businesses. "Oh, it's never failed before, why do we need backups?" Then the server implodes.
Course, they then get pissed at us for not preventing it, but what do they expect us to do, shell out for a tape drive with our own cash? I think not.
Re:Dear Every Corporate Tool in the Universe: by IMarvinTPA · 2009-01-02 05:56 · Score: 1

Do the backups, then charge them back-costs on the service they didn't think they needed, but now do. If not, let them play with the disk recovery companies, etc.
IMarv

--
Trusting software vendors is no smarter than trus
Re:Dear Every Corporate Tool in the Universe: by mrchaotica · 2009-01-02 05:56 · Score: 4, Insightful

Hell, they could have spent $50 on a USB hard drive (i.e., half-assed it) and been better off!

--
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Re:Dear Every Corporate Tool in the Universe: by slugtastic · 2009-01-02 05:57 · Score: 5, Funny

I'm more surprised that the site lived for 6 years without back-up. That's pretty hardcore.
Re:Dear Every Corporate Tool in the Universe: by modmans2ndcoming · 2009-01-02 06:01 · Score: 5, Funny

Screw that!! IT Departments are cost centers and have absolutely no benefit to the bottom line of a company... none at all... nope.
Re:Dear Every Corporate Tool in the Universe: by Kjella · 2009-01-02 06:09 · Score: 5, Insightful

Being too stupid to recognize your own shortcomings is also a form of stupidity. Or hubris, whichever is more appropriate.

--
Live today, because you never know what tomorrow brings
Re:Dear Every Corporate Tool in the Universe: by JoJo's883 · 2009-01-02 06:19 · Score: 1

That is true until a company gets sued for a couple million $$ by a customer for losing all of the customers mission critical data. The payout will indeed have an impact on the bottom, middle, and top lines...
Re:Dear Every Corporate Tool in the Universe: by zippthorne · 2009-01-02 06:22 · Score: 1

"HTML compresses really well in case they didn't know"
That's a good point. Why aren't websites/browsers taking the openoffice approach and sending some kind of zhtml files. /. discussions could really benefit, and sites could package up most of the common elements into a single cache-populating download.

--
Can you be Even More Awesome?!
Re:Dear Every Corporate Tool in the Universe: by bb5ch39t · 2009-01-02 06:24 · Score: 4, Funny

Absolutely correct! Why management here is drooling over how much they money could save if they just didn't need the damn IT department. And all those damn desktop computers! Why, life was much better back in the days of paper ledgers and pencils! Sigh - if only we could have the perfect company. One which only has high level managers and none of the "riff raff" that infects them. Oh! Wait! That's Congress. And they have a monopoly.
Re:Dear Every Corporate Tool in the Universe: by Matheus · 2009-01-02 06:36 · Score: 1

Speaking of which "two large drives mirrored"
6 years of journalspace if all that was being stored is text data probably still fit on a couple hundred gigs. Have they seriously never upgraded this machine in the past 6 years? Is this the first time anything has gone wrong? No one *ever* did a backup? There's probably a DVD sitting around on someone's desk that has their entire business on it..
Re:Dear Every Corporate Tool in the Universe: by Anonymous Coward · 2009-01-02 06:38 · Score: 0

slugtastic says: I'm more surprised that you lived for 6 years without back-up.
journalspace.com says: I told you I was hardcore.
Re:Dear Every Corporate Tool in the Universe: by Matheus · 2009-01-02 06:38 · Score: 1

..and BTW I'm not saying that a "couple hundred gigs" would fit on a DVD.. their first 3 years worth of text data probably did tho.
Re:Dear Every Corporate Tool in the Universe: by digitalgiblet · 2009-01-02 06:42 · Score: 2, Funny

Pay the salary of someone smart enough to handle your data correctly if you have no interest in becoming smart yourself.
The first step is admitting you are stupid. That is hard for most people. Of course today they are having NO trouble making that cognitive leap...
Re:Dear Every Corporate Tool in the Universe: by larry+bagina · 2009-01-02 06:50 · Score: 1

slashdot doesn't even need to store comments. They could just randomly generate some "frosty piss!!", some "brown rope", some "kdawson is a retard", and some "dupe!" posts.

--
Do you even lift?
These aren't the 'roids you're looking for.
Re:Dear Every Corporate Tool in the Universe: by Anonymous Coward · 2009-01-02 07:04 · Score: 0

Salary does not = quality nor smarts.....
Re:Dear Every Corporate Tool in the Universe: by Trixter · 2009-01-02 07:04 · Score: 1

A lot people like to brag that their servers have three years or more of uptime. That's not hardcore, that's a server that hasn't been patched for three years!
Truly hardcore servers are those that can be patched weekly with little or no service downtime.
Re:Dear Every Corporate Tool in the Universe: by slushdork · 2009-01-02 07:05 · Score: 5, Funny

Maybe they should have used this backup strategy, although this one looks more like this...
Re:Dear Every Corporate Tool in the Universe: by Troy · 2009-01-02 07:11 · Score: 1

And here I thought "Extreme Hosting" was just a slogan.
Re:Dear Every Corporate Tool in the Universe: by Sorthum · 2009-01-02 07:12 · Score: 1

They do. :-)
Take a look at a packet capture for www.google.com sometime...
Re:Dear Every Corporate Tool in the Universe: by berend+botje · 2009-01-02 07:14 · Score: 1

Yeah, they could have, they should have, but the relevant part is: they didn't.

Firing IT and general management seems nowhere near sufficient in this case.
Re:Dear Every Corporate Tool in the Universe: by sjames · 2009-01-02 07:15 · Score: 3, Insightful

A USB drive is an excellent non-archival backup. Two or more in rotation is even better. That plus a decent RAID for the primary storage will cover most data losses. Even better if the drive goes home with the admin at night.
Re:Dear Every Corporate Tool in the Universe: by mrchaotica · 2009-01-02 07:19 · Score: 1

It depends on the amount of data. IIRC, above a few TB tapes become cheaper.

--
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Re:Dear Every Corporate Tool in the Universe: by Anonymous Coward · 2009-01-02 07:22 · Score: 0

i'd be surprised if it wasn't raised as an issue and torpedoed by finance coupled with legal stopping IT rolling their own for business security issues.
Re:Dear Every Corporate Tool in the Universe: by Anonymous Coward · 2009-01-02 07:29 · Score: 0

no kidding, we do have a half-assed backup system, but it's quite useless and we have to do it manually, naturally we want to make one, but the people in charge dont see a need, no matter how many doomsday scenarios we paint. We do have user data replicated over various sites in the southern california area though, which would help in the event of a catastrophic failure at our corporate branch.
but there's other data there too..
Re:Dear Every Corporate Tool in the Universe: by Meumeu · 2009-01-02 07:41 · Score: 1

slashdot doesn't even need to store comments. They could just randomly generate some "frosty piss!!", some "brown rope", some "kdawson is a retard", and some "dupe!" posts.
dupe!
Re:Dear Every Corporate Tool in the Universe: by ahodgson · 2009-01-02 07:56 · Score: 1

Most high volume web sites dynamically compress HTML and other text as they serve it; browsers send their compression capabilities in the request and automatically handle it.
Re:Dear Every Corporate Tool in the Universe: by Anonymous Coward · 2009-01-02 08:03 · Score: 0

A proper BOFH would.
Re:Dear Every Corporate Tool in the Universe: by 4D6963 · 2009-01-02 08:09 · Score: 1

What's really stupid is using "stupid" in the place of "not competent" (and "smart" in the place of "competent"). No one has to be stupid. You just can't be competent with anything.

--
You just got troll'd!
Re:Dear Every Corporate Tool in the Universe: by Anonymous Coward · 2009-01-02 08:13 · Score: 0

Being too stupid to recognize your own shortcomings is also a form of stupidity. Or hubris, whichever is more appropriate.
Why are you dragging George W. Bush into a technical discussion?
Re:Dear Every Corporate Tool in the Universe: by troll8901 · 2009-01-02 08:14 · Score: 2, Interesting

Even better if the drive goes home with the admin at night.
Would the admin be tempted to look at other people's data?
Re:Dear Every Corporate Tool in the Universe: by teg · 2009-01-02 08:17 · Score: 2, Interesting

Never underestimate the beancounter's desire to save every cent possible.
That's contrary to my experience. Other expenses have been skimped on occasionally, but just mention the word "backup" and the funding was there.
Re:Dear Every Corporate Tool in the Universe: by Anonymous Coward · 2009-01-02 08:45 · Score: 0

It was the database, not the html templates.
Re:Dear Every Corporate Tool in the Universe: by sjames · 2009-01-02 08:49 · Score: 2, Insightful

If so, he can do so anyway.
Re:Dear Every Corporate Tool in the Universe: by mabhatter654 · 2009-01-02 09:03 · Score: 2, Insightful

Don't send tapes home with Admins... send them to the bank to be put into a safety deposit box with the days checks if you have to. Admins don't want tapes in their home, it's a corporate security risk and the admin WILL forget to bring some back because one or two is no big deal... until they're not at your company anymore. I know I wouldn't do that because I wouldn't want to be the guy who's laptop bag gets ripped off with customer data on tapes inside it. It's just bad mojo waiting to happen.
Treat data media just like the companies cash money.
Re:Dear Every Corporate Tool in the Universe: by LWATCDR · 2009-01-02 09:10 · Score: 1

There where blogs so they where mostly text. Text is really small. 16GB flash drivers are getting cheap now. Put two flash drives on the server and do a nightly on one and weekly on the other.
I think the problem here was that the computer was probably at a COLO facility. No easy way to get to the server everyday for a back up. They might have had to do backups across the internet using a Cable Modem or DSL.
Just not a great way to work.

--
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
Re:Dear Every Corporate Tool in the Universe: by Karellen · 2009-01-02 09:12 · Score: 1

I prefer Linus's

--
Why doesn't the gene pool have a life guard?
Re:Dear Every Corporate Tool in the Universe: by Anonymous Coward · 2009-01-02 09:35 · Score: 0

Geoff? Is that you? ...sorry, I thought you were my boss.
Re:Dear Every Corporate Tool in the Universe: by rrohbeck · 2009-01-02 10:14 · Score: 1

Especially since this was a blogging site... the data stream must have been astounding, probably on the order of megabytes per day!!!
They could have backed up incrementals on floppies.

--
thegodmovie.com - watch it
Re:Dear Every Corporate Tool in the Universe: by rrohbeck · 2009-01-02 10:18 · Score: 1

Just point out to them some realistic incidence numbers for the typical disasters in your area, together with the stats that 90%+ (can't remember the exact figure) of companies that lose their data go under. Plus the few-percent AFR of disk drives and maybe one for servers of course.

--
thegodmovie.com - watch it
Re:Dear Every Corporate Tool in the Universe: by X0563511 · 2009-01-02 11:18 · Score: 1

HTML compresses really well in case they didn't know.
The problem is, that they wouldn't be backing HTML up. They would be backing up a database.

--
For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
Re:Dear Every Corporate Tool in the Universe: by sjames · 2009-01-02 11:18 · Score: 1

Where feasible, yes. For a small site, the admin takes it home solution is valid. If the data is at all sensitive, encrypt the backup (a good idea anyway for data going anywhere offsite).
Re:Dear Every Corporate Tool in the Universe: by mabhatter654 · 2009-01-02 11:27 · Score: 1

if they're big enough to pay my salary, they're big enough to have a small safety deposit box at the bank they deposit customer checks in. Unless this is a VERY small non-profit it's highly unprofessional.
This is in the same vein as guys that think their nephew can set up their network, or their website.
Re:Dear Every Corporate Tool in the Universe: by Pyrion · 2009-01-02 13:25 · Score: 1

If there's enough accountability within a company, then yeah, your beancounter doesn't want to be "the guy" responsible for turning down a backup that could've saved the company $millions.

--
"There is much pleasure to be gained from useless knowledge." - Bertrand Russell.
Re:Dear Every Corporate Tool in the Universe: by LackThereof · 2009-01-02 14:10 · Score: 1

From the blog linked in TFA (which is very concise, seriously, you can read it in under a minute, fucking do it):

It was the guy handling the IT (and, yes, the same guy who I caught stealing from the company, and who did a slash-and-burn on some servers on his way out) who made the choice to rely on RAID as the only backup mechanism for the SQL server. He had set up automated backups for the HTTP server which contains the PHP code, but, inscrutibly, had no backup system in place for the SQL data. The ironic thing here is that one of his hobbies was telling everybody how smart he was.
This doesn't excuse what happened, though: I should have taken a better look at what he'd left behind, and fixed all of the things that needed fixing.

--
Legalize recreational marijuana. Seriously.
Re:Dear Every Corporate Tool in the Universe: by beaviz · 2009-01-03 07:40 · Score: 1

Other expenses have been skimped on occasionally, but just mention the word "backup" and the funding was there.
That is so true! Backup is the system administrators get out of jail card.
PHB: "I need my personal laptop fixed NOW."
The techies started to look worried until some brave guy said something like: "Uhm.. Sorry.. We are investigating a possible problem with the nightly backup."
PHB: "Oh my God! I'll be on my way then, please carry on - and let me know if you need ANYTHING."
Good times back in the day :)
Re:Dear Every Corporate Tool in the Universe: by bruceslog · 2009-01-06 05:18 · Score: 1

Maybe they were counting on the Wayback Machine, or Google archives, to restore their lost blogs.

--
If it has tires or tits, it will give you problems.

rm -rf / by corsec67 · 2009-01-02 05:29 · Score: 5, Informative

rm -rf /

That is one reason why mirroring isn't a backup, and why backups should ideally be off-line.

--
If I have nothing to hide, don't search me

Re:rm -rf / by Piranhaa · 2009-01-02 06:06 · Score: 5, Funny

C:\>rm -rf /
'rm' is not recognized as an internal or external command,
operable program or batch file.
Everything's still running here...
Re:rm -rf / by morgan_greywolf · 2009-01-02 06:13 · Score: 1

You're looking for 'del C:\*.* /s /q /y'

--
My blog
Re:rm -rf / by KDR_11k · 2009-01-02 06:14 · Score: 1

Use
deltree /y C:\*.*

--
Justice is the sheep getting arrested while an impartial judge declares the vote void.
Re:rm -rf / by loonycyborg · 2009-01-02 06:21 · Score: 1

Most likely in this case that was dd if=/dev/zero of=/dev/hda
Re:rm -rf / by Anonymous Coward · 2009-01-02 06:22 · Score: 3, Funny

Ha! I store all my data in directory names!
Re:rm -rf / by Anonymous Coward · 2009-01-02 06:34 · Score: 0

dd if=/dev/urandom of=/dev/hda is a lot more fun. Especially if you then do a cat /dev/hda | strings... my god, my hard drive is full of llamas! er wrong command.
Re:rm -rf / by nicolas.kassis · 2009-01-02 06:42 · Score: 1

osx not linux /dev/hda won't work
Re:rm -rf / by Matheus · 2009-01-02 06:44 · Score: 1

Nah.. the data recovery co would have been able to get through those zeros with one hand tied behind their back.
try this:
while( true ) {
dd if=/dev/random of=/dev/hda
}
after even one iteration it gets hard..
after two expensive..
after three 'what are you kidding me'..
after four, five, six.. we're just having fun now :)
Re:rm -rf / by The+MAZZTer · 2009-01-02 06:45 · Score: 1

Install cygwin and try again.
Re:rm -rf / by Anonymous Coward · 2009-01-02 06:47 · Score: 0

Surely you mean sudo dd if=/dev/zero of=/dev/hda and then type in your password.
There are reasons OSes have permissions, after all, and this is one of them.
Re:rm -rf / by thousandinone · 2009-01-02 06:53 · Score: 1

Are there any production servers remaining anywhere that still recognize the deltree command?

If so, someone has failed big time... Need to support Legacy Applications maybe? If so, your company is one of those standing in the way of breaking out of MSFT for good...
Re:rm -rf / by BagOCrap · 2009-01-02 06:54 · Score: 1

rm -rf /
That is one reason why mirroring isn't a backup, and why backups should ideally be off-line.
That's basically how I found out my mirroring was working on a small Linux server (more for hobby)...
# cd /etc
# rm -rf *
D'oh!
What I meant to type was "rm -rf *~", to remove Vim backups of bunch of files.

--
-- Chaos, panic, pandemonium... My job here is done!
Re:rm -rf / by Anonymous Coward · 2009-01-02 07:01 · Score: 0

C:\>del C:\*.* /s /q /y
Invalid parameter.
C:\>
Re:rm -rf / by Seth+Kriticos · 2009-01-02 07:01 · Score: 1

tool@corporate: rm -rf /
rm: deleting '/': Operation not permitted

On the other hand, if you can not trust your operator(s), you are most probably owned - no matter what system you are on.

Now get of my lawn!
Re:rm -rf / by Poltras · 2009-01-02 07:03 · Score: 2, Funny

$ del C:\*.* /s /q /y
-bash: del: command not found
Guys...

--
Of Code And Men
Re:rm -rf / by jonbryce · 2009-01-02 07:05 · Score: 1

or dd if=/dev/zero of=/dev/disk0
saying as it is OSX, not Linux.
Re:rm -rf / by dfdashh · 2009-01-02 07:08 · Score: 2, Funny

Judging by your OS, not for long

--
df -h /my/head
Re:rm -rf / by Havok3114 · 2009-01-02 07:16 · Score: 0

Deltree hasn't been used in some time. Try: RD C:\ /S /Q
Re:rm -rf / by Anonymous Coward · 2009-01-02 07:30 · Score: 0

C:\>rm -rf /
'rm' is not recognized as an internal or external command,
operable program or batch file.
Everything's still running here...
Really? I'll try that command from Cygwin. Everything still seems to be running fi
Re:rm -rf / by mksql · 2009-01-02 08:01 · Score: 1

IIRC, the equivalent is called "Windows Update"...

--
I should have been a Geologist.
Re:rm -rf / by NIMHrat · 2009-01-02 08:15 · Score: 1

C:\> del /q /s . Not anymore...
Re:rm -rf / by TheRaven64 · 2009-01-02 08:37 · Score: 2, Informative

What happened to deltree?

--
I am TheRaven on Soylent News
Re:rm -rf / by TheRaven64 · 2009-01-02 08:38 · Score: 1

And this is why rm should always be aliased to rm -i for the root user.

--
I am TheRaven on Soylent News
Re:rm -rf / by Anonymous Coward · 2009-01-02 09:00 · Score: 0

It's been gone since XP at least. The supposedly equivalent del /S functionality just plain sucks. Thank FSM for Cygwin.
Re:rm -rf / by mabhatter654 · 2009-01-02 09:09 · Score: 2, Funny

operators make mistakes... the higher up they are paid the more likely they are to do it. Take the manager covering for somebody on vacation... they jump in and mistype something... boom! data gone. Happens to the best of us... the really good ones have ways in place to make sure mistakes have a way of being undone.
how many AS400 operators typed PWRDWNSYS *IMMED and got a surprise!
Re:rm -rf / by mcrbids · 2009-01-02 09:21 · Score: 1

Preacher, meet choir.
Choir, meet preacher. (Queue preaching)

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.
Re:rm -rf / by Anonymous Coward · 2009-01-02 10:15 · Score: 0

With the advent of user-friendly desktop distros, every idiot talks like a Linux expert. You do realize that you can log into a system as root, right? There are sysadmins (myself included) who do not see the point in logging in as an unprivileged user for day-to-day admin tasks. But then again, we're not all retarded.
Re:rm -rf / by Anonymous Coward · 2009-01-02 10:28 · Score: 0

In this case you've got bigger problems than data backup.
Re:rm -rf / by rrohbeck · 2009-01-02 10:31 · Score: 1

rm -rf /
That is one reason why mirroring isn't a backup, and why backups should ideally be off-line.
Even better, something like

dd if=/dev/zero of=/dev/hda
or similar. This wouldn't be the first time a RAID or disk controller, let alone a buggy file system or disk driver, did something like this. All it takes is a flaky DIMM contact. I've seen exactly this happen (except that it wasn't all zeroes but random garbage.) memtest86+ found a memory problem, and after removing and re-inserting a DIMM the system ran fine. Nothing was found wrong with the slot or DIMM contacts.

--
thegodmovie.com - watch it
Re:rm -rf / by rrohbeck · 2009-01-02 10:35 · Score: 1

What happened to deltree?
Replaced with RMDIR /S. Better yet, use /S /Q:

C:\>rmdir /?
Removes (deletes) a directory.
RMDIR [/S] [/Q] [drive:]path
RD [/S] [/Q] [drive:]path /S Removes all directories and files in the specified directory in addition to the directory itself. Used to remove a directory
tree. /Q Quiet mode, do not ask if ok to remove a directory tree with /S

--
thegodmovie.com - watch it
Re:rm -rf / by Anonymous Coward · 2009-01-02 11:39 · Score: 0

Just the wrong time to discover that you did actually install cygwin...
Re:rm -rf / by darkonc · 2009-01-02 18:10 · Score: 1

You're looking for 'del C:\*.* /s /q /y'
Ah, nothing quite like nice, easy, human readable, DOS commands to show why Windows is superior to Un*x.

--
Sometimes boldness is in fashion. Sometimes only the brave will be bold.
Re:rm -rf / by Anonymous Coward · 2009-01-03 01:32 · Score: 0

technically with a log based FS you would be safe from rm... but then:
- there is always dd
- a backup oughta be kept away from the original, not next to it.
Re:rm -rf / by cheekyboy · 2009-01-03 01:33 · Score: 1

its not hard to install unix tools in windows, if any linux person doesnt do that to his windows box hes forced to use, then PUH....
Im supprised the bash author or rm author doesnt check for that specific command and tell the user, "Stimpy you idiot!!!!!!!"

--
Liberty freedom are no1, not dicks in suits.
Re:rm -rf / by Anonymous Coward · 2009-01-04 01:33 · Score: 0

Just install cygwin, then it should work..

Ouch by scubamage · 2009-01-02 05:30 · Score: 3, Informative

We do data hosting, and I can't imagine how catastrophic that would be. Jebus. Let this be an ultimate example of why numerous backups are needed. Always. Without question.

Re:Ouch by conureman · 2009-01-02 05:45 · Score: 4, Insightful

Or even one, stale, backup.

--
The cost of that cleanup, of course, will be borne by taxpayers, not industry.
Re:Ouch by jabithew · 2009-01-02 06:02 · Score: 3, Insightful

This story put the fear of god into me. The first thing I did since reading it is to back up the website I admin (for my dad) locally. I'd always assumed our host would have good backup, but that seems naÃve now.

--
All intents and purposes. Not intensive purposes.
Re:Ouch by slugstone · 2009-01-02 06:32 · Score: 3, Interesting

Working at several hosting places I would say,you are correct. Never trust a hosting service backup. I always told our customers to never trust our backup. Sometimes backups just never happened. They are not high on the list of things to keep working.
Re:Ouch by berend+botje · 2009-01-02 07:21 · Score: 1

Your host might have a very good backup solution. They might even monitor it. And, perhaps, they are capable of restoring your backup within a reasonable timeframe.

But have you tried their response time and accuracy? I'm guessing no.

Your data, you do the backup. It isn't rocket surgery.
Re:Ouch by MyHair · 2009-01-02 13:06 · Score: 1

Definitely. Never trust someone else with your data. I lost a VPS whose host was presumably RAID'ed and backed up. Never got my VPS or data back from them, but I only lost a few hours of data since I backed it up myself daily. After 5 days of dumb excuses I opened an account with another provider, reconfigured my name servers and got everything going again.
I didn't have to wait 5 days; it's just that my sites weren't revenue generating.
Re:Ouch by noidentity · 2009-01-02 15:00 · Score: 1

Nevermind the host's data being corrupt, how about them being shut down for an extended period/permanently? Always assume volatility to anything you store outside your machine.

Excellent! by GravityStar · 2009-01-02 05:30 · Score: 5, Funny

Excellent! We can use their demise as yet another cautionary tale.

Re:Excellent! by El+Torico · 2009-01-02 05:48 · Score: 5, Funny

Excellent! We can use their demise as yet another cautionary tale.
Ironically, it's more useful than the entire collection of blogs that they stored.

--
In the land of the blind, the one-eyed man is usually crucified.
Re:Excellent! by Volante3192 · 2009-01-02 05:54 · Score: 1

With all these cautionary tales, you'd think someone out there would catch on...
Re:Excellent! by Rearden82 · 2009-01-02 06:00 · Score: 1

Exactly! These kinds of examples are ideal for demonstrating to PHB types that data loss isn't just some kind of theoretical once-in-a-billion-years kind of thing, and that backups actually serve a purpose.

I also find it amusing that they could apparently afford to blow money on nice shiny Mac servers, yet a tape drive (hell, even a $99 external hard drive or two from Best Buy) was out of the question.
Re:Excellent! by PriceIke · 2009-01-02 06:18 · Score: 0, Offtopic

This is not a troll, it's actually quite funny. As long as you're not one of the people who lost their blog on Journalspace. Mods, please correct, kthx.

--
It's not a lie. It's the truth with lossy compression.
Re:Excellent! by TwilightXaos · 2009-01-02 06:38 · Score: 2, Insightful

Yes.
But you wouldn't think everyone would catch on...
Re:Excellent! by dzelenka · 2009-01-02 07:09 · Score: 1

Excellent! We can use their demise as yet another cautionary tale.
"many thousands of bloggers' entire output has evaporated"
I'm still trying to figure out what of value was lost, and if a backup was ever actually needed.

--
Bah!
Re:Excellent! by SmurfButcher+Bob · 2009-01-02 08:38 · Score: 1

Someone should start a blog about failures like that!
Oh....

--

help me i've cloned myself and can't remember which one I am
Re:Excellent! by Pfhorrest · 2009-01-02 13:07 · Score: 1

Excellent! We can use their demise as yet another cautionary tale.
Ironically, it's more useful than the entire collection of blogs that they stored.
It could be that the purpose of your life is only to serve as a warning to others.

--
-Forrest Cameranesi, Geek of all Trades
"I am Sam. Sam I am. I do not like trolls, flames, or spam."

Mirroring is not intended as a data backup by zaibazu · 2009-01-02 05:31 · Score: 3, Informative

It is an inexpensive protection against a total harddisc failure, but effective at this part. A software going rogue or a user deleting the wrong files can't be helped by it.

Re:Mirroring is not intended as a data backup by Anpheus · 2009-01-02 06:45 · Score: 1

Partition the mirror into two halves, and have your OS on the first partition do a backup to the second once every 24 hours. Or do three partitions, and do the same.
You can make mirroring a haphazard disaster recovery solution, it just requires contributing sizable portions of your hard disk space to it.
Of course, the above won't help you if a meteorite takes out your server and the surrounding building.
Re:Mirroring is not intended as a data backup by Anonymous Coward · 2009-01-02 07:25 · Score: 0

>It is an inexpensive protection against a total harddisc failure, but effective at this part. A software going rogue or a user deleting the wrong files can't be helped by it.
I know you didn't mean it like that, but just in case someone reads that wrong: it's NOT a protection against a total harddisc failure in the usual sense. The total harddisc failure will happen.
What it does is: if you are lucky, only one disc fails at first and you can go get another one (and maybe, if you are really lucky, even plug it in time and have the rebuild finish in time) before the other(s) fail, which they will.
That is, it buys you time.
Re:Mirroring is not intended as a data backup by TheRaven64 · 2009-01-02 08:41 · Score: 1

Or if a power fluctuation fries both drives, or if an error in the OS corrupts the drives at the block layer, or if someone steals the server...

--
I am TheRaven on Soylent News
Re:Mirroring is not intended as a data backup by Hyppy · 2009-01-02 09:08 · Score: 1

It also won't help much if a controller failure mangles all the disks writes.
Re:Mirroring is not intended as a data backup by Anpheus · 2009-01-02 10:23 · Score: 1

For 24 hours (or however long since the last backup) without anyone noticing? That would be pretty impressive if the RAID controller was reading and writing bad data for 24 hours without the OS or any of your applications noticing.
Re:Mirroring is not intended as a data backup by Anpheus · 2009-01-02 10:24 · Score: 1

What do you mean "block layer."
Partition 1 start Partition 2 start.
See how they don't overlap? OS messes up partition 1, all the blocks, all the sectors are good.
Re:Mirroring is not intended as a data backup by TheRaven64 · 2009-01-02 11:23 · Score: 1

That's not really how it works. The partition table is an artificial construct that the OS knows about but the hardware doesn't. A bug in the block layer or the driver causing it to write the wrong value to the DMA instruction telling the hard drive where to write the block. It might be in the partition or in any other part of the disk. Now, ideally, the bottom level of the block layer will do some checking to ensure that requests are in the right partition, but a bug in the device driver or in the bottom of the block layer (or a failure of the validation there to spot the error) can write blocks to a random location.

--
I am TheRaven on Soylent News

That's what backups are for by MBCook · 2009-01-02 05:31 · Score: 5, Interesting

It's really unfortunate that this happened. If they had simply had a backup snapshot of the DB they could have restored it. RAID only saves you from disk failures. It doesn't work on OS/user failures.

Unfortunately this is the kind of thing you tend to learn from experience (either yours or someone else). It's very easy to think "RAID 1 = disks are safe".

Just like a database cluster wouldn't have saved them. A clustering database can save you from load, or you can swap servers if a disk goes bad. But when someone issues "DELETE * FROM..." the other cluster nodes start to happily run the same thing and now you have 2 (or 3 or 10 or...) empty database boxes.

I hope those bloggers had a backup of some sort of their own.

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.

Re:That's what backups are for by LWATCDR · 2009-01-02 05:46 · Score: 1

No it really isn't unfortunate this happened. It was going to happen sooner or later. The bad thing is that it didn't happen when there was less data to loose.
What where they thinking?

--
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
Re:That's what backups are for by Boron55 · 2009-01-02 05:50 · Score: 1

The right query would be "DELETE FROM..." (no star)

You just failed the database module exam. ;)

Seriously, I do this mistake too, from time to time.
Re:That's what backups are for by ergo98 · 2009-01-02 05:51 · Score: 2, Interesting

Unfortunately this is the kind of thing you tend to learn from experience
Even a moment of thought would have made it abundantly clear that this was not a backup situation, and it certainly should not require loss to pound it into someone's head.
They were clearly way over their heads if they thought this protected them from anything other than a single drive failure. More likely they were entirely aware of the risk, but decided to wing it anyways.
Re:That's what backups are for by MBCook · 2009-01-02 05:56 · Score: 5, Insightful

My guess (and this is a guess, I'd never heard of the site before yesterday) is that this is some guy who started his own little site and it got bigger and bigger. Basically he never designed the backup, the system was just slowly pieced bigger and bigger until it got to it's current state.
The comments in the messages from the site's operator about the cost of the drive recover and thinking both drives just died at once indicate to me that this site was basically a hobby for him and he isn't experienced as an admin.

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.
Re:That's what backups are for by SoloHan · 2009-01-02 05:59 · Score: 1

Argh, I just logged in to say the right one was DELETE FROM.... But as long as I am already logged on, I have to say that I once saw an SQL-like DBMS accepting the star.
Re:That's what backups are for by sdpuppy · 2009-01-02 06:00 · Score: 1

The bad thing is that it didn't happen when there was less data to loose.
Absolutely - they really needed to tighten up their data much earlier!
But grammatical pet peeves aside, you're absolutely right:

It was going to happen sooner or later.
I mean really - even if you don't know any better and you just spend a minute thinking about it, it so obvious that you can run into a situation like this with mirroring - even with just the thought: "what is an important file is deleted? " - never mind the whole shebang.
Oops - reminds me - better back up my har@^#&* .
Re:That's what backups are for by sdpuppy · 2009-01-02 06:06 · Score: 1

some guy who started his own little site and it got bigger and bigger. Basically he never designed the backup, the system was just slowly pieced bigger and bigger until it got to it's current state
Would someone with mod points give this fellow a +5 ? I would say that your post describes is a very likely scenario.
Re:That's what backups are for by sakshale · 2009-01-02 06:40 · Score: 1

My sister is a heavy user of Journal Space and based on reading between the lines from my conversations with her, I suspect you are 100% accurate. There will be a lot of people in that community who will be emotionally hurt by this as they had built a tight community of friends.

--
For every problem there is a solution that is simple, obvious and wrong.
Re:That's what backups are for by mzito · 2009-01-02 06:41 · Score: 3, Interesting

Ah, it totally depends on the type of database cluster. For example, with Oracle, if you're using Oracle DataGuard, even in synchronous replication mode you can define an "apply delay" - basically, "Don't acknowledge this commit until it is written locally, and copied and acknowledged on the remote side, but don't actually apply the transaction for two hours"
That way, if someone does a delete * from blogs;, it will be reflected immediately on the production, but you've got a nice window to sort it out.
Plus, if you've got database flashback turned on, you can simply say, "Flash my database back to what it looked like before someone was an idiot", and all your data comes back.
These features are expensive in Oracle, but they can be very useful when you actually need them.

--
me@mzi.to
Re:That's what backups are for by larry+bagina · 2009-01-02 06:57 · Score: 1

DELETE ... has to file a journal record for every row deleted (assuming you're using a DBMS with journalling). TRUNCATE generally doesn't do journaling and is faster.

--
Do you even lift?
These aren't the 'roids you're looking for.
Re:That's what backups are for by Jeremi · 2009-01-02 07:04 · Score: 1

RAID only saves you from disk failures. It doesn't work on OS/user failures.
Hell, it doesn't even save you from disk failures, if the disk failure is one of the side effects of the computer room burning down...

--

I don't care if it's 90,000 hectares. That lake was not my doing.
Re:That's what backups are for by Aristos+Mazer · 2009-01-02 10:37 · Score: 1

Yes, it would. A tight community of friends that no longer has any contact info for each other would certainly be disrupted. Just because the people do not know anything other than screen handles doesn't mean they haven't been talking for years, sharing stories, helping each other with problems, etc.
Re:That's what backups are for by fnj · 2009-01-02 18:57 · Score: 1

It's very easy to think "RAID 1 = disks are safe".
They are safer. Just not safe enough. And the data is not comprehensively safe.
Re:That's what backups are for by too2late · 2009-01-02 23:00 · Score: 1

when someone issues "DELETE * FROM..."
They will get a syntax error :-P

--
My rights don't end where your feelings begin.
Re:That's what backups are for by Johnno74 · 2009-01-03 16:26 · Score: 1

In SQL server if you are using full recovery model then you can recover up to the second before the bad command was run.
Backup the transaction log (which will include all transactions beween the prior log backup, and now - including the transaction you need to "undo") then restore your last full backup, then restore the transaction logs up until the one you did AFTER the bad transaction, then restore that last one specifying a stop time of just before the bad transaction was committed.
Point-in-time recovery is why you use full recovery model and not simple. Sadly I've worked on plenty of "mission critical" sql dbs that don't use full recovery. When this is brought to their attention they typically say "we don't need point in time recovery, we use RAID and do nightly backups".
Thats all fine, as long as you are prepared to lose everything between backups.
This can be done in every version of SQL server, even the free version. I'd be stunned if you couldn't do the same thing in Oracle.

Filed away under "Shoulda known better". by Anonymous Coward · 2009-01-02 05:32 · Score: 0

Sad, very sad.

Noobs. No, really. by urbanriot · 2009-01-02 05:32 · Score: 1

Even the greenest IT employee knows that mirroring is to protect against hard drive failure and not software corruption. Obviously someone felt they knew better than people that actually know better, or someone didn't consult the right people. This is the end result. Tape, USB keys, disc backup... there's so many debatable methods of backing up that there's no excuse for this one.

Re:Noobs. No, really. by emag · 2009-01-02 05:51 · Score: 4, Informative

Even the greenest IT employee knows that mirroring is to protect against hard drive failure and not software corruption.
I only wish that were true. I've given up arguing with friends about this, who insist that their mirrors are good enough backups. I just stare at colleagues who think such, especially those who SHOULD know better. And I *know* coworkers are doing this @ work, too, and I'm just waiting for about 50TB of data to suddenly go missing...

--
"The urge to save humanity is almost always a false front for the urge to rule." --H.L. Mencken
Re:Noobs. No, really. by etnoy · 2009-01-02 06:06 · Score: 1

"There are two types of people in the world. Those who take backups, and those who have never had a disk crash/fs corruption/IDE driver bug/whatever"

--
Quantum hacker.
Re:Noobs. No, really. by Anonymous Coward · 2009-01-02 08:22 · Score: 0

I think it's impossible to use computers without having first-hand experience in accidentally overwriting important files. Making it up to any sort of "IT employee" without that experience is mind-boggling.
This also leads to the reason for having at least two backups: because you then can format the wrong drive when trying your first restore form backup.
Re:Noobs. No, really. by Anonymous Coward · 2009-01-02 11:02 · Score: 0

i've lost count of the number of times someone excitedly sets up a box with mirroring, runs out of space, and breaks the mirror, realizing it's irrelevant to the operation of the system, once they need the drive space.
Re:Noobs. No, really. by WarlockD · 2009-01-02 12:42 · Score: 1

I hear you. I can't tell you how many sites I have been at where the customer is made at us because they lost two drives in a RAID5. I had one site that lost THREE (One hot-spare), but it clearly showed drives 1-2 died close to a year ago....

While I will blame user's I personaly hate those crappy motherboard raid's. There is no way for you to know when a drive goes bad unless you reboot or you install their utiltiy that may or may not notify the user.

El Oh El by greymond · 2009-01-02 05:33 · Score: 3, Insightful

That's all I can say at this. I'm really surprised that with all the users they had, they are so quick to say "everything is gone and we're giving up" instead of just starting over and maybe implementing protocol that would make sure this doesn't happen again.

--
Ave Molech Setting

Re:El Oh El by kurtmckee · 2009-01-02 05:45 · Score: 5, Insightful

I'm really surprised that with all the users they had, they are so quick to say "everything is gone and we're giving up"
Considering how complete and unrecoverable the loss is, they have no idea who their users are. The accounts would have to be recreated from scratch, but who would try? Their users have no reason to ever trust them again. Journalspace would have a difficult time wooing back their original users, and no new user would seriously consider using them.
Bowing out is the only recourse, but I'm glad they're considering releasing their source code.
Re:El Oh El by Kjella · 2009-01-02 05:48 · Score: 1

That's all I can say at this. I'm really surprised that with all the users they had, they are so quick to say "everything is gone and we're giving up" instead of just starting over and maybe implementing protocol that would make sure this doesn't happen again.
With such a backup policy, is it certain they even have the system? Clearly they don't have a development server with a recent database copy. Sounds to me like they're totally screwed and by the time they got something new together all their users would be gone and all of them would loudly warn anyone else from coming back. I think it shows a good degree of self-insight to simply say tha this screwup is beyond fixing. Even reforming under a new name would be better than the stigma of this disaster.

--
Live today, because you never know what tomorrow brings
Re:El Oh El by spuke4000 · 2009-01-02 06:06 · Score: 3, Funny

Indeed. Everyone knows that when you drive your company into the ground through incompetence you don't give up! You go to Washington to get your bail out. That's the American way.

--
This post cannot be rebroadcast without the express written constent of Major League Baseball.
Re:El Oh El by Frosty+Piss · 2009-01-02 06:44 · Score: 1

I'm really surprised that with all the users they had, they are so quick to say "everything is gone and we're giving up"

According to the message at the web site, they have sent the drives to some recovery firm to work on...

--
If you want news from today, you have to come back tomorrow.
Re:El Oh El by Chris+Pimlott · 2009-01-02 06:57 · Score: 2, Interesting

Their post said that only the task-specific server for data was hosed. If Journalspace offered paid services, then their billing system should still have all their customer's details.
Re:El Oh El by ralphdaugherty · 2009-01-02 11:34 · Score: 1

Google probably has all their data. Maybe they can diversify into "here's what we have, don't let it happen again" recovery services.
rd
Re:El Oh El by Anonymous Coward · 2009-01-02 21:54 · Score: 0

but I'm glad they're considering releasing their source code.
"Oh, it was on the DB... but if you can recover it, you can have it."
Re:El Oh El by Anonymous Coward · 2009-01-03 07:09 · Score: 0

Maybe they wanted this to happen so they could turn to new things?

Bizarre by Courageous · 2009-01-02 05:33 · Score: 1

It's bizarre that anyone would ever, under any circumstances, consider a "mirror" to be a backup. Mirrors automatically replicate errors, including the human variety.

Point in time snapshots might be a sort of lazy man's backup, but even then, consider the possibility of fire or disaster, and not having some sort of second location is just plain foolhardy.

C//

Mirroring is *NEVER* a "backup" by sisukapalli1 · 2009-01-02 05:33 · Score: 1

Mirroring is more of a safeguard for hardware failure. It does not replace backup. Both serve entirely different purposes.

If they had at least some online rsync'ed backups from at least a day/week/month/etc., that would be a backup.

The site *didn't* have any backups, and they pay the price.

S

Re:Mirroring is *NEVER* a "backup" by Anonymous Coward · 2009-01-02 09:07 · Score: 0

Well, if you do your mirroring with a version control system (eg, Git or Mercurial), it serves both purposes quite nicely. Of course, this is an inefficient solution if you have big binary blob databases.

Thank you by ari_j · 2009-01-02 05:36 · Score: 2, Funny

This is fascinating and altogether newsworthy. I had never before thought of this. I am very pleased, indeed, that kdawson engaged his most finely-honed editorial faculties to post this article to the front page, as it is not only stunning and fascinating in substance but also rather eloquently written.

It's so easy even a caveman can do it. by GPLDAN · 2009-01-02 05:37 · Score: 1

With the proliferation of snapshot technology (for instance Parallels on the Mac contains the ability to snapshot the VM and restore any snapshot) plus the ability to back the VM up to a USB2 disk that you can just plugin and then pull and put in a safe place - having point in time backups of servers has become so trivial and easy that it is unbelieveable that anybody would run a buisiness without it.

A leading SAN/NAS vendor (no commercials here) has a solution for SMB that includes a disk shelf and controller integrated together with 4TB of storage for about $14k. Maybe closer to $20k if you license the Linux and Windows backup agents to the SAN. Really - ANYBODY can do this.

If you're really THAT hard up for operating cash, get a 2U server with an integrated tape drive at the very very very least.

Re:It's so easy even a caveman can do it. by Anonymous Coward · 2009-01-02 07:26 · Score: 0

Really - ANYBODY can do this.
Especially since, in real life, backups cost an order of magnitude less than the figures you provided.

Inconceivable? by LSD-OBS · 2009-01-02 05:38 · Score: 3, Funny

I do not think it means what you think it means.

--
Today's weirdness is tomorrow's reason why. -- Hunter S. Thompson

Re:Inconceivable? by Farmer+Tim · 2009-01-02 06:27 · Score: 1

Indeed, this would appear to be one huge conceive-up.

--
Blank until /. makes another boneheaded UI decision.
Re:Inconceivable? by Anonymous Coward · 2009-01-02 14:29 · Score: 0

Werd, nice usage! :) LOL

How hard is it to remember: by computersareevil · 2009-01-02 05:39 · Score: 4, Insightful

Mirroring: High availability
Backups: High reliability

The rules of backups by Anonymous Coward · 2009-01-02 05:40 · Score: 5, Informative

The rules of backups:

1. Backup all your data
2. Backup frequently
3. Take some backups off-site
4. Keep some old backups
5. Test your backups
6. Secure your backups
7. Perform integrity checking

Re:The rules of backups by Anonymous Coward · 2009-01-02 05:47 · Score: 0

8. ???
9. Profit !
Re:The rules of backups by Anonymous Coward · 2009-01-02 05:48 · Score: 2, Funny

1. Do not talk about backups
2. DO NOT TALK ABOUT BACKUPS
Re:The rules of backups by Anonymous Coward · 2009-01-02 06:20 · Score: 0

1. Mirrors are not backups
2. No admin is to mistreat the database in any way... if there's anybody watching
3. Mirrors are not backups
4. I don't want to catch anyone not drinking at the console after lights out
5. Mirrors are not backups
6. Mirrors are NOT... part of rule 6
7. Mirrors are not backups
Re:The rules of backups by thousandinone · 2009-01-02 07:03 · Score: 1

Hey! You backed up Rule 1 three times! Your post is secure!
Re:The rules of backups by Anonymous Coward · 2009-01-02 07:16 · Score: 3, Insightful

1. Backup all your data
2. Test your backups
3. Backup frequently
4. Test your backups
5. Take some backups off-site
6. Test your backups
7. Keep some old backups
8. Test your backups
9. Secure your backups
10. Test your backups
11. Perform integrity checking
10. Test your backups
Every company I've worked at has had a backup plan. Exactly zero have had a recovery plan.
Re:The rules of backups by Thundersnatch · 2009-01-02 10:07 · Score: 1

You forgot:
8. Securely destroy old backups when they are not needed.
9. Run your backup and retention policies past your lawyers to be sure you're not saving too much or too little data.
Re:The rules of backups by MyHair · 2009-01-02 13:14 · Score: 1

I actually worked for a company that did have a recovery plan and tested it several times a year. I was impressed. They maintained multiple DR sites and would send a small team or sometimes an entire department to work a real shift at the DR site. The backups were well organized, had daily offsite transport and were tested. Awesome. Then they laid a bunch of us off. Oh well.

To many shops think HA==DR by uncledrax · 2009-01-02 05:41 · Score: 4, Informative

It's more an issue that some people think that HA == DR.. which obviously this story reminds us that it is not the same thing.

Mirroring / RAID == HA.. if one of your HDDs let the smoke out, you still don't incur downtime. If you have a hot-spare, you're even better.. all it does it let you have alittle time to correct the
issue (ie: "It can wait until morning").

Also, one other very important thing.. mirroring doesn't prevent/restore data corruption. If you're mirroring your rm -rf (as pointed out by Corsec67 below), your RAID will happy do what it does.. and span your command to all your disks.... Congrats, you just successfully gave yourself HA to your disk erasing! :]

Backups are DR.. If your RAID croaks.. your SOL if you don't off-machine backups. If you accidently nuke your disks with an rm or something, you can still go back and restore data.. sure you'll likely loose -some- data, but -some- is better then all in this case.

--
----- The internet has given everyone the ability to have their voice heard equally as loud.. even if they shouldn't be

Re:To many shops think HA==DR by Anonymous Coward · 2009-01-02 06:09 · Score: 1

Not to be a pain, but I can figure out that DR is 'Data Recovery'. Is HA 'Hardware Availability'? It makes sense that way, but it took me a moment to figure it out (not easy to google those two things, no?)
Re:To many shops think HA==DR by xyphor · 2009-01-02 06:14 · Score: 5, Informative

DR is Disaster Recovery
HA is High Availability
Re:To many shops think HA==DR by Anonymous Coward · 2009-01-02 06:24 · Score: 0

That would be ... HalfAssed == DeadReckoning?????
Re:To many shops think HA==DR by Anonymous Coward · 2009-01-02 06:25 · Score: 0

No-one knows what HA or DR means.
Re:To many shops think HA==DR by Anonymous Coward · 2009-01-02 06:26 · Score: 0

+1 Informative. Thanks. Was wondering that myself.
Re:To many shops think HA==DR by cbiltcliffe · 2009-01-02 06:54 · Score: 5, Funny

I tried Googling, but the only results I got were a medical office in Chinatown.....

--
"City hall" in German is "Rathaus" Kinda explains a few things......
Re:To many shops think HA==DR by Anonymous Coward · 2009-01-02 07:15 · Score: 0

HA or DR... you are SOL if you look like SA.
Re:To many shops think HA==DR by klasikahl · 2009-01-02 07:16 · Score: 0, Troll

I hope you don't work anywhere near a datacenter.
Re:To many shops think HA==DR by The+Blue+Meanie · 2009-01-02 07:22 · Score: 2, Informative

People who care about their data and their business know what they mean.
Although, at my particular shop, we use the term "BC" instead of "HA".
BC = Business Continuance (HA = High Availability)
DR = Disaster Recovery
BC = "Looks like we just lost a drive in the array. Better replace that right away." or "Oops, broke one of the multiple fibers to the SAN. Where's the spare again?"
BC also applies to our load-balanced clusters of web servers and application servers that allow for the offlining or loss of entire machines without losing functionality. You need more than your data existing on media to Continue Business - you and your customers need to be able to GET to it somehow.
DR = Your building just burned to the ground, taking every single piece of furniture, equipment, paper, and magnetic media inside along with it. Now what?
Please note that the coolest, slickest, snapshotted NAS with terabytes and terabytes of awesome cheap SATA storage in it is worth exactly JACK in this scenario if it's in the same building as the source material. Offsite backups are not optional, and offsite storage of hard drives isn't exactly the easiest thing to do.

--
"I feel that if a person can't communicate, the very least he can do is to shut up." -- Tom Lehrer
Re:To many shops think HA==DR by dfghjk · 2009-01-02 07:41 · Score: 1

Backups don't have to be off-machine. They just have to be asynchronous. Off-machine provides additional protections.
Re:To many shops think HA==DR by GravityStar · 2009-01-02 08:31 · Score: 1

Remember, one of the RAIDed disks could fail while rebuilding the RAID array. A failure in one of the disks could set of a domino chain of events that will cause you to replace the entire array over the course of several hours.
Re:To many shops think HA==DR by Anonymous Coward · 2009-01-02 10:07 · Score: 0

> I tried Googling, but the only results I got were a medical office in Chinatown.....
Hey! A name like Dr. Ha is no laughing matter...
Re:To many shops think HA==DR by klasikahl · 2009-01-02 10:24 · Score: 1

Okay, so maybe that was a bit troll-ish. But, if you work on production systems and you /don't/ know what HA or DR are, you should probably go find a new job and spare your bosses the inevitable losses that you'll cause. I would suggest front-line phone support.
Re:To many shops think HA==DR by michaewlewis · 2009-01-02 11:22 · Score: 0

those are the only results you got? I got around 67,000,000 results........
Re:To many shops think HA==DR by Anonymous Coward · 2009-01-02 12:02 · Score: 0

HA can be DR, provided it is done with both goals in mind. For web apps, HA and DR can usually be accomplished fairly easily.
OTOH, any shop that hasn't figured out backups, mirroring and load balancing isn't likely to get DR too.

Only 2 drives? by lalena · 2009-01-02 05:42 · Score: 4, Insightful

Maybe I could understand that there might be issues with backing up live databases, and they didn't want to deal with it. Still not an excuse.
BUT, according to the site "the server which held the journalspace data had two large drives in a RAID configuration". Only TWO drives.
All they had to do was pull one of the drives, replace it, and lock up the original off site. In a couple of hours the drives would have been mirrored again.

Re:Only 2 drives? by Anonymous Coward · 2009-01-02 06:48 · Score: 0

Unless you shut down the db, then the filesystem and data on the drive you pull is not consistent. Also, during the rebuild time, you're running the risk of losing your live drive with no backup (higher than normal risk, since it's going to be under increased load in order to remirror.
Pulling a drive from a live mirror is not a proper backup strategy any more than pulling the power cable is a proper shutdown (they both leave your disk in the same state).
Re:Only 2 drives? by Kineticabstract · 2009-01-02 06:55 · Score: 1

That's not really a backup strategy, unless you have many, many hard drives. It's not just a matter of backing up today's data, but also of being able to access yesterday's data if something got accidentally deleted today. You also need to be able to access last week's data, in case something got trashed the day before yesterday and no one noticed until today. Swapping in new hard drives every day just isn't going to cut it.
Tape drives are cheap - tape cassettes are cheaper yet. Software to automate your backup scheme is a little more expensive, but it's a one-time expense that you can use for a very long time. Over time, a tape backup system is a lot cheaper than swapping in new drives every so often, and as a bonus, it's also more reliable.
Re:Only 2 drives? by lalena · 2009-01-02 07:11 · Score: 1

Wasn't saying it was a good idea.
But if they were avoiding doing backups because the only thing on the drive was one big SQL DB, and they didn't have a way of backing it up without taking it offline, then swapping a drive is better than nothing.

I haven't done anything with incremental backups for a drive with a single SQL database, so I have no idea how that works.
But let's say they don't care about incremental backups or old backups. They only need to spend about $5K for a handful of matching 15K RPM SAS drives.

Every day or so, swap out one of the drives and replace it with the oldest backup drive. You then have 5-10 days of full daily backups.

Again, I'm not saying it is a good idea, but someone with zero IT knowledge could do both backup and recovery tasks with no training and no investigation.
No need to figure out how to backup a live Database without locking the whole thing up for hours, which is probably why they didn't do tape backups in the first place.
Re:Only 2 drives? by WuphonsReach · 2009-01-02 07:58 · Score: 1

All they had to do was pull one of the drives, replace it, and lock up the original off site. In a couple of hours the drives would have been mirrored again.

I can't recommend that method unless you're running triple-active RAID1 (which is simply mirroring across 3 disks instead of just 2 disks). With a triple-mirror, when you pull that 3rd drive to take it offsite, you're at least still running on a 2-drive mirror.

And it's still best if you shut down everything (especially the databases) before you pull that drive and pop in a new one.

Alternately, temporarily hook up a 3rd drive (USB/Firewire), extend the raid array to it, then disconnect it before trimming the array back again.

--
Wolde you bothe eate your cake, and have your cake?
Re:Only 2 drives? by MoogMan · 2009-01-02 08:37 · Score: 1

maybe, however rebuilding a raid live takes a performance hit on disk I/O, so this may not be desirable.
Re:Only 2 drives? by Anonymous Coward · 2009-01-02 09:33 · Score: 0

You are nearly as bad as the Journalspace admin.
The point of RAID is to improve AVAILABILITY. If you've pulled out one of those disks, even for a few hours, then you've destroyed the purpose of RAID. If the first disk dies when it's syncing to the second, you're screwed.
You might claim that it's unlikely that the disk will fail at the same time as you're doing a "backup", but if we could rely on such things we'd not backup at all.
Also, hotswap SCSI connectors are not designed to be pulled out on a regular basis, so doing so is just asking for trouble.
Re:Only 2 drives? by Fulcrum+of+Evil · 2009-01-02 10:07 · Score: 1

I'd like to point out that using RAID rebuild as a backup strategy is a massive abuse of the system and will likely be far less reliable than something more aligned with your use case. Hell, even tarballs will beat this.

--
"We returned the General to El Salvador, or maybe Guatemala, it's difficult to tell from 10,000 feet"
Re:Only 2 drives? by Darkk · 2009-01-02 13:05 · Score: 1

I wouldn't advise anybody messing around with the active RAID drives...PERIOD.
A simple database dump to an external hard drive for off-site backup would be far more ideal and cheap solution to do.
My first choice for any business always been tapes but sometimes they can't spend the money on it so I usually tell them then we need multiple portable external hard drives for off-site until the company can afford a tape backup solution.
Re:Only 2 drives? by kasperd · 2009-01-03 00:15 · Score: 1

Unless you shut down the db, then the filesystem and data on the drive you pull is not consistent.
If the system is working correctly the drive you pull out will be consistent. If you lose power at any point in time your database is supposed to guarantee that the data on disk will be in a consistent state once the system is back. That may involve some kind of roll back or a journal replay. But in the end you will have a consistent data base.

As others have already pointed out, if you do this, you have to have three mirrors for safety. And the additional load to resync when you put in another drive may be a problem. But if a single disk can handle normal workloads while the second disk is being replicated to the third disk, you might have a chance of doing that trick without hurting performance too much. Of course writes to the database have to be done to all three disks during the rebuild, so the normal workload will slow down the rebuild significantly. If you are using really large disks, don't be surprised if it takes a day to finish the rebuild.

Someone said that hot swap connectors are not designed for swapping on a daily basis. That might be true. But then you can do the trick logically rather than physically. Run RAID-1 across three disks. Tell the RAID to pull one disk out. On the same machine you can then access that single disk to make a backup to an off-site location. Once you are done with your backup put the disk back in the RAID. Of course before starting the procedure you have to verify that the RAID is actually running on all three disks. If the RAID is already short, that has to be resolved before starting the backup.

I'm not saying this is a good way to backup a database. But it certainly is doable without introducing unacceptable risks in the process. But you really have to know what you are doing.

--

Do you care about the security of your wireless mouse?

To the HR department by squeegee_boy · 2009-01-02 05:42 · Score: 5, Funny

Important note: don't hire the IT dude with Journalspace.com on his resume.

Re:To the HR department by DaveV1.0 · 2009-01-02 05:45 · Score: 3, Insightful

The only problem with that idea is that it may not have been the IT guy's decision to save money by not having a true backup system. I have seen companies skimp on backup systems because they thought their RAID system was enough.

--
There is no "-1 offended" or "-1 you don't agree with me" mod options for a reason.
Re:To the HR department by ergo98 · 2009-01-02 05:54 · Score: 2, Insightful

A better backup solution needn't cost much, or even anything. Simply FTPing to your own home machine on occasion would have been a millionfold improvement (given the popularity metrics, I don't think this was like a staffed operation or anything. Just a guy or two)
Re:To the HR department by DaveV1.0 · 2009-01-02 06:13 · Score: 1

Not necessarily. Remember the over head of a datadump or the like. I know what you are saying, though.

--
There is no "-1 offended" or "-1 you don't agree with me" mod options for a reason.
Re:To the HR department by Bender0x7D1 · 2009-01-02 06:26 · Score: 1

Wrong. Wrong. Wrong.
This huge mistake is some of the best, (and most expensive), training a person can get. Do you think they will EVER get caught without proper backups/recovery plans in place? I doubt it.

--
Reading code is like reading the dictionary - you have to read half of it before you can go back and understand it.
Re:To the HR department by Translation+Error · 2009-01-02 06:46 · Score: 1

Not a problem. His resume was lost with the rest of the data.

--
When someone says, "Any fool can see ..." they're usually exactly right.
Re:To the HR department by squeegee_boy · 2009-01-02 06:50 · Score: 2, Funny

OK, you hire him first. I think he's available.
Re:To the HR department by Anonymous Coward · 2009-01-02 06:52 · Score: 0

Journalspace.com 2003-2008 Systems Admin
* Mid sized Blog site
* Real world experience with Data loss Scenarios
Re:To the HR department by squeegee_boy · 2009-01-02 07:05 · Score: 0, Redundant

I lol'd :)
Re:To the HR department by Anonymous Coward · 2009-01-02 07:24 · Score: 0

Sorry, not an excuse. I was once at a company that relied on mirrors for source code control backup. I used CD-Rs for periodic backups. Relying on the "no budget" excuse is just incompetence.
Re:To the HR department by Anonymous Coward · 2009-01-02 08:43 · Score: 0

If this is the case, anyone competent enough to get another job should do so quickly, because you know who everyone is going to point fingers at when the server fails. It's almost as stupid to stay at such a place as it is to make the decision that mirroring is good enough.
Re:To the HR department by Xugumad · 2009-01-02 09:29 · Score: 1

Copying work data onto a personal machine is all sorts of dubious. I certainly would not admit to the fact that I regularly take an encrypted dump of our database and SVN repository off-site, for example.
I mean, if I did such a thing. Which clearly I don't.
Re:To the HR department by Darkk · 2009-01-02 13:07 · Score: 1

A good IT guy convinces the bosses that the company NEEDS a good backup solution, not just on RAID.
Re:To the HR department by kill+-9+$$ · 2009-01-02 16:44 · Score: 1

Something tells me that the folks involved aren't going to forget this lesson very soon. You'd need to verify that there is some working gray matter upstairs during the interview process, but so long as they know its an f'up, and why it was one, I can sorta justify they won't make that same mistake down the road on my dime.

--

-- A computer without COBOL and Fortran is like a piece of chocolate cake without ketchup and mustard
Re:To the HR department by DaveV1.0 · 2009-01-03 01:52 · Score: 1
Yeah, you keep thinking that. Meanwhile, I will keep believing my experiences in working world.
Experiences like
- being told to overwrite good current data with old data, even though I have told him we have been having troubles with the tape drive, which he won't pay to replace.
- being told "we can't afford a $500 tape unit for the server" by a boss who is ordering $5000 worth of new office furniture.
- being told "It will be OK. We have that RAID thing." * sound familiar
- being told "It won't matter if we lose the data. We will just shutdown, change the name of the company, and start again" * This is not a joke. This actually happened. Fortunately, it was for a contract consulting gig.
- being told "I don't want backups in case we get sued."
- being told "We can add the backup system later." then later "Why didn't you get it with a backup system?"
- My personal favorite "This is what my boss has approved. He won't spend the money on a backup system."
You may think it is your job to convince the boss, but often the boss either doesn't care, doesn't have a clue, or can't do anything about it.
--
There is no "-1 offended" or "-1 you don't agree with me" mod options for a reason.
Re:To the HR department by JoelKatz · 2009-01-03 03:33 · Score: 1

So what do you do? You have two choices:
1) Quit.
2) Go down with the ship.
If you pick 2, you're just as responsible.
"You may think it is your job to convince the boss, but often the boss either doesn't care, doesn't have a clue, or can't do anything about it."
Yes, it is your job. And if you can't do your job (whether it's your fault or not) you're in the wrong job.

A Great Disturbance ... by Anonymous Coward · 2009-01-02 05:43 · Score: 0

This morning I felt a great disturbance in the Blogosphere, as if tens of voices suddenly cried out in terror.

Virtual Machines for Backup happiness by MosesJones · 2009-01-02 05:46 · Score: 1

One thing that I've been switching to recently has been backing up not just the disk or data but creating a full virtual machine backup of the server. Space wise this can be a big hit so incremental data backups are done daily, with a full VM hit once a month alongside the full data dump. Now I'm shifting to doing a daily VM in addition which gives me the last close of play.

The reason for this is restore time, if it takes a few days to restore then its a right pain (or for some companies fatal) but a VM restore I can fire up on temporary kit in a matter of an hour or less and give a downgraded service while we patch up the full servers.

--
An Eye for an Eye will make the whole world blind - Gandhi

Re:Virtual Machines for Backup happiness by Darkk · 2009-01-02 13:13 · Score: 1

What tools do you use to create a VM image of an active server? I currently use the VMWare Server 2.0 running in Linux that hosts other Linux and Windows Servers. Creating a new VM image is easy BUT not sure if I can actually create a snapshot of a real physical server that can be hosted under VMWare.
Appreciate any insights you have.

A lesson for admins, and users too by gzipped_tar · 2009-01-02 05:46 · Score: 4, Insightful

No doubt this incident is the result of the admin's fault. He's been confusing mirroring and backup and carried on the mistake until it's too late, as pointed out in other comments.

Now what about a user's angle? The morale is you can never think your data is safer when it's "in the cloud". If you value your blog and your readers, you *should* save a copy of your work as well as the readers' info, *locally*, somewhere you have control over.

There's no place like $HOME.

--
Colorless green Cthulhu waits dreaming furiously.

Re:A lesson for admins, and users too by djmurdoch · 2009-01-02 05:57 · Score: 4, Insightful

And a corollary to the parent's good advice: if you can't easily get a complete copy of your work, find another host. Manual one-by-one downloads don't cut it.
Re: A lesson for admins, and users too by lalena · 2009-01-02 06:01 · Score: 1

If I had mod points, I would mod this up.
No one thinks about the user's responsibility to backup their data on the web.

I assume that almost no one backs up their blog entries, but how many people would lose all of their friend's email addresses if they lost their email account. How about their family pictures hosted online.

Even if someone did have a popular blog and backed up all of their data, they have still lost any fans, links, page rank... that they had accumulated.
Re:A lesson for admins, and users too by gzipped_tar · 2009-01-02 06:22 · Score: 1

Hmm, indeed. However, this kind of service usually comes at a cost.
For most bloggers, it's the content of their posts that matters. They don't need byte-to-byte backup. They don't need to preserve the data structure. Usually a text editor/word processor plus storage space for photos, videos, etc. are enough to defeat such a disastrous crash. As long as the content you created is still readable, it's OK.
This is apparently not the most efficient way to save the readers' comments, but anyway, it's the community's existence, not what they said, that matters more. If all you have is just a plain list of your frequent readers' emails, you can still keep in touch with them and hopefully rebuild the community.
Sometimes small measures can do very much. It's just that we are not *doing* them.

--
Colorless green Cthulhu waits dreaming furiously.
Re:A lesson for admins, and users too by djmurdoch · 2009-01-02 12:24 · Score: 1

However, this kind of service usually comes at a cost.
That's not always true, but when it is: don't put anything valuable there. If you have something valuable that you want to put online, and can't find a free host that lets you keep backups, then spend some money.

For most bloggers, it's the content of their posts that matters. They don't need byte-to-byte backup.

By valuable, I mean something that you'd like to come back to and re-use sometime in the future. Throwaways like /. posts don't need backups. Things you spend time on do.
Re:A lesson for admins, and users too by Darkk · 2009-01-02 13:17 · Score: 1

I don't think "confusing" with mirroring is his fault. I think it's pure laziness and the chief didn't bother to check on his own guy's work to make sure it's properly configured and running.
So in a nutshell laziness have no place in IT world given the fact there are plenty of both free and commercial tools that are easy to use.

Really!? by k3v1n · 2009-01-02 05:51 · Score: 1

Really hard to imagine something like this, especially when database backups are SO EASY. mysqldump, gzip, scp to a different machine or upload to S3. gzip'ed database dumps take up so little space!

Re:Really!? by daveime · 2009-01-02 06:01 · Score: 1

Unless you fancy killing your production server for 45 minutes while he dumps out a 1.5GB table with numerous indexes (thus locking the table for any writes), I'd suggest this policy.
Replicate the database onto a second server, and on that server, do all of the above. Believe me, it saves a lot of pain and disk grinding on your production server.
Re:Really!? by Anonymous Coward · 2009-01-02 06:28 · Score: 0

If the database doesn't have hot backup capabilities, then maybe it's the wrong database to use in a situation where you can't afford the downtime to back up.
Re:Really!? by daveime · 2009-01-02 06:47 · Score: 1

The thing with hot backups, is that for one or two tables, it's great ... try 200 or 300, with all tables locked while the raw database files are copied, and we are talking longer than 2 or 3 minutes. And believe me, you MUST stop all writes to ANY table during the hotcopy process, otherwise all you'll end up with is a bunch of tables with screwed up referential integrity.
Also, as the GP stated, gzipped raw SQL dumps compress very nicely. Try compressing a mysql MYI (index file), and you'll save nothing, as it's already in compressed b-tree format.
Doing a true dump from a replicated copy means zero downtime, as you can simply disable the replication slave, do the dump, bring replication back up and let it catch up on the statements still in the log. And in case your server really is fried, you can even load the SQL dumps back into a different (more modern) version of the database system on a newer server.
Try using hotcopied files, and you may run into version incompatibility problems ...
I'll stick with my method anyway.
Re:Really!? by TheSunborn · 2009-01-02 07:00 · Score: 1

Why should the tables be locked while doing a backup?
I mean any database that support transactions(That is anything, other then mysql using MyISAM) should be able to
just run the backup in it's own transaction. The entire point of a transaction, is that changes that start after the transaction start, are not visible to queries running within the transaction. Perfect for backup).
Re:Really!? by daveime · 2009-01-02 08:45 · Score: 1

Correct me if I'm wrong, but transactions are usually implemented using a transaction log as an intermediate queue ... i.e. a transaction that does an UPDATE will end up queued on the transaction log, because another transaction is already performing a "SELECT * FROM", and as it encompasses thw whole dataset of the table, it cannot be changed by another transaction until that transaction has completed.
My original point is that backups are always bound by disk I/O, and while hotcopies can speed this up by simply doing a cp *.MYD, *.MYI etc, rather than converting the files back into raw SQL that essentially reloads the table from scratch, I still feel that using replication allows a zero downtime solution, and allows much better control of the exact snapshot point of the backup itself.
Sorry if my knowledge it limited predominently to MyISAM tables, this has been my domain for a lot of years, and somehow I've never quite trusted InnoDB tables due to the reported time needed to do things like adding columns etc (it's bad enough on MyISAM).
Re:Really!? by Fulcrum+of+Evil · 2009-01-02 10:13 · Score: 1

I mean any database that support transactions(That is anything, other then mysql using MyISAM) should be able to just run the backup in it's own transaction.

Try it some time - see what that does to the Redo logs.

--
"We returned the General to El Salvador, or maybe Guatemala, it's difficult to tell from 10,000 feet"
Re:Really!? by Anonymous Coward · 2009-01-02 10:34 · Score: 0

I'll stick to my method.
BACKUP DATABASE AdventureWorks
TO DISK = 'Z:\SQLServerBackups\AdvWorksData.bak'
(Zero downtime, no transactions impacted, virtually all server operations remain available while the backup is in progress (ALL non-DDL operations are) and, yes, the backup IS consistent. All transactions will either be committed or not.)
Sheesh, Linux dweebs. This is the sort of thing that SQL Server users have had for the past decade - they take it for *GRANTED*. MySQL users, meanwhile, are slowly getting their head around 'transactions'. Sigh.
(And before you mumble 'redo log blowout' - nope. It actually works the other way - the BAK file is a raw dump of the data and a segement of the transaction log to unwind any transactions that got half-committed into the backup while it was being taken.)
Try it sometime.
Re:Really!? by TheSunborn · 2009-01-02 11:50 · Score: 1

Well, I don't hope that taking a backup writes to the database, so the redo log should not cause problems should it?
Or do the redo log grow because the long running backup transaction will cause all other transactions to write more to the redo log(Or rather, not be able to clear it)?
The largest database I manage, is still so small that I can do a backup within 20 seconds, so I newer bothered to do any benchmark to see how the backup effect the performance.
Re:Really!? by Fulcrum+of+Evil · 2009-01-02 12:18 · Score: 1

The second one; if your DB is big and busy, then a huge ass transaction to dump the whole thing is just not happening. I like the idea of hooking up a replicated slave - you can tell it to stop processing logs, back it up, and then tell it to play catchup without impacting the online DB.

--
"We returned the General to El Salvador, or maybe Guatemala, it's difficult to tell from 10,000 feet"
Re:Really!? by yalap · 2009-01-02 12:59 · Score: 1

I think you're correct. Modified pages are written to the t-log and the database file is read-only. If a page read is requested the t-log file is checked for a copy otherwise it is read from the database file. When a checkpoint occurs all processing halts and the t-log pages are written to the database file. The checkpoint occurs either at specified times or when a threshold of transactions is reached. So the backup can be a sequential read of the database file plus a read of the t-log and checkpoints are prevented (or run as part of the backup job). The restore process will handle applying the t-log to the restored database. HTH
Re:Really!? by daveime · 2009-01-03 01:53 · Score: 1

"unwind any transactions that got half-committed"
WTF ? A transaction is either committed or not comitted ... how can a transaction be "half-comitted", while other users are trying to access the same data supposedly under the impression that they are seeing the state of the database either before or after (but NOT in the middle) of any other transaction ?
Why does this scare me ? Maybe I'm just a Linux Dweeb who doesn't like terminology like half-committed.
Re:Really!? by JoelKatz · 2009-01-03 03:38 · Score: 1

"WTF ? A transaction is either committed or not comitted ... how can a transaction be "half-comitted", while other users are trying to access the same data supposedly under the impression that they are seeing the state of the database either before or after (but NOT in the middle) of any other transaction ?"
By "committed", he means written to the main database, rather than just log or journal files. Since it will take more than one write to the main database to complete a typical transaction, there can always be transactions that are "half-committed" in this sense.
What makes transactional databases reliable is that they use multi-phase commits. And they can overlap the phases of multiple commits.
It sounds counter-intuitive, but because the commit is done in multiple phases, the database can fully recover from a crash that occurs in, at, or between any phase or phases.

Hoax by jamesl · 2009-01-02 05:51 · Score: 0, Troll

This is so ignorant that it must be a hoax.

Google Cache to the rescue? by Anonymous Coward · 2009-01-02 05:51 · Score: 0

Perhaps the people at Google can help out?

Re:Google Cache to the rescue? by LordSnooty · 2009-01-02 06:34 · Score: 2, Interesting

I hope affected users are looking into this, I just did a search of a random JS blog and 2,000 entries were returned, all cached it would seem. So many people might be able to recover their work in a very painstaking manner.
Re:Google Cache to the rescue? by vrmlguy · 2009-01-02 09:37 · Score: 1

I thought that I'd do a not-so-random search of site:journalspace.com plus various common words, like "web". From looking at the results, I've got to go with the story's tag, "andnothingofvaluewaslost".

--
Nothing for 6-digit uids?

Professionalism by theskipper · 2009-01-02 05:51 · Score: 1, Informative

From TFL:

The data server had only one purpose: maintaining the journalspace database. There were no other web sites or processes running on the server, and it would be impossible for a software bug in journalspace to overwrite the drives, sector by sector.

The list of potential causes for this disaster is a short one. It includes a catastrophic failure by the operating system (OS X Server, in case you're interested), or a deliberate effort. A disgruntled member of the Lagomorphics team sabotaged some key servers several months ago after he was caught stealing from the company; as awful as the thought is, we can't rule out the possibility of additional sabotage.

First, it's somewhat lame/unprofessional to list "sabotage" as a possibility. Even if it's the strongest possibility. Adding the OSX comment and that a bug in their code is impossible is even lamer.

More importantly, if the key servers were sabotaged months ago, the first thing that I'd want to do is make a full image stored in multiple offsite locations. Ignorance of the RAID/backup issue is one thing, but knowing that the sabateur could have sprinkled the db with crap is even scarier.

Smells like there's more to the story than this. Or not.

Re:Professionalism by jav1231 · 2009-01-02 06:09 · Score: 1

Exactly. "Technical incompetence or ingnorance about security. We're not sure which." Okay so which makes you look better?
Re:Professionalism by moderatorrater · 2009-01-02 06:21 · Score: 2, Informative

Adding the OSX comment and that a bug in their code is impossible is even lamer.
The drives were overwritten sector by sector on a machine that didn't have any of their code running on it. Their application couldn't have done it because it couldn't execute arbitrary code on that server. The "impossible" comment makes sense to me.

As for it being lame/unprofessional to name the possibilities, I disagree. He states the OS it was running on and said that it was either an OS problem or sabotage. There might be a few possibilities, but that about sums them up right there. He was being thorough and open; what's the problem with that?
Re:Professionalism by Anonymous Coward · 2009-01-02 07:31 · Score: 0

The problem is that the OP is from a closet Apple fanboi.
Re:Professionalism by Anonymous Coward · 2009-01-02 08:34 · Score: 0

From TFL:
There were no other web sites or processes running on the server, and it would be impossible for a software bug in journalspace to overwrite the drives, sector by sector.
Yes, we all know that bugs that make bad things happen are impossible. Hint: if you know what it was going to do, it wouldn't be a bug.

No Archive.org either by computersareevil · 2009-01-02 05:52 · Score: 5, Informative

They also purposely blocked archive.org via a robots.txt exclusion, so the bloggers can't use that to try and recover some of their blogs.

Re:No Archive.org either by ds_job · 2009-01-02 06:02 · Score: 1

I was going to post that they had enough nous to get a robots.txt but not to actually maintain their system. I wouldn't want that company on my CV even if I was the janitor. This is going to be a millstone for anyone who was involved in this going back from day 2 (I can just about forgive them for being stupid on day 1)
Re:No Archive.org either by 42forty-two42 · 2009-01-02 06:03 · Score: 1

They blocked every spider except for a short list of whitelisted ones. There are still some random pages in google's cache if you search site:journalspace.com
Re:No Archive.org either by Civil_Disobedient · 2009-01-02 06:47 · Score: 1

I thought Coral Cache might work, but the only thing cached right now is the "out of business" message. If anyone has any actual links to content they could try Coral Cache themselves.
I'm a bit dumbfounded at the completeness of the stupidity. Let's see...
1. Use RAID mirroring as a backup solution? Check!
2. Don't make a copy of your data files anywhere else, ever? Check!
3. Actively prohibit public, free site mirrors from getting to your sacred content? Check!
4. Practice naked fire juggling in front of server farm? Wouldn't be surprised.
Re:No Archive.org either by Anonymous Coward · 2009-01-02 07:26 · Score: 1, Insightful

So let me get this straight... Journalspace.com was smart enough to have someone there setup a robots.txt file, but nobody there asked if anything was being backed up to a tape/external drive/DVD/CD/Floppy Disk/Cocktail Napkin?
I'm just glad I've never used Journalspace.
Re:No Archive.org either by troll8901 · 2009-01-02 08:56 · Score: 1

5. Shout at server disks, causing vibration? Check!

There is a denial going on by hwyhobo · 2009-01-02 05:52 · Score: 5, Insightful

In today's world where primary storage and protection storage are well-defined, and where entire industry grew around it (examples: NetApp, Data Domain), one is hard-pressed to understand the reason for such a debacle. The reading of the note referred to in the article leads me to believe, unfortunately, that Journalspace's IT department did not understand the difference.

It is sometimes considered a bad form to say something bad about fellow techies. We prefer to look for 'outside' causes. Still, to learn and avoid the same problems in the future, one has to admit his mistakes first. This paragraph from the Journalspace's page:

The value of such a setup is that if one drive fails, the server keeps running, using the remaining drive. Since the remaining drive has a copy of the data on the other drive, the data is intact. The administrator simply replaces the drive that's gone bad, and the server is back to operating with two redundant drives.

makes me believe there is a denial going on.

--
End anonymous moderation and posting on /.

Re:There is a denial going on by jbezorg · 2009-01-02 07:37 · Score: 2, Informative

The temptation for "But Macs can't fail!" bashing was strong, but I resisted. It did lead me to a question though. That is: Had they been mislead by the Mac culture? Could there been something in Apples ads or documentation that would lead them to this mistake?
The answer? No. At least not from Apple.
From page 32 of TFM: http://images.apple.com/server/macosx/docs/Server_Administration_v10.5_2nd_Ed.pdf

Defining Backup and Restore Policies
All storage systems will fail eventually. Either through equipment wear and tear, accident, or disaster, your data and configuration settings are vulnerable to loss. You should have a plan in place to prevent or minimize your data loss.

--
I've lost all my marbles except one & It's fun to test angular & centripetal acceleration in my skull
Re:There is a denial going on by Lost+Race · 2009-01-02 09:46 · Score: 2, Insightful

It is sometimes considered a bad form to say something bad about fellow techies.
Yeah, right. If there's anything professionals love to do, it's talk trash about their peers. What's the first thing a computer guy says when you bring him in to fix a broken system? "My god, what idiot spec'd/built/installed/configured this piece of garbage? It's a miracle it ever worked at all!" Ditto every other kind of professional, from plumber to surgeon to architect to accountant.
(As such a professional, I often discover that the idiot I'm complaining about was me.)
Re:There is a denial going on by hwyhobo · 2009-01-02 10:31 · Score: 2, Insightful

Ditto every other kind of professional, from plumber to surgeon to architect to accountant.
My experience is the exact opposite, particularly when comes to a medical profession. It's like mafia, and no one dares speak ill of another 'made man', at least not on the record.
I worked for over a decade as an independent networking consultant, and some of the most daring statements I heard people make when criticizing someone else's design were of the kind "perhaps it is not the most ideal for your environment. Needs change quickly, and not everything can be foreseen". Being a loudmouth rarely buys you a lot of business. Even your clients don't want to see that. It's not in good taste, and then, there is always a possibility somewhere in their minds that if you speak that way of others, you may speak that way of them.

--
End anonymous moderation and posting on /.
Re:There is a denial going on by Pfhor · 2009-01-02 11:54 · Score: 1

Actually there is a culture of "I am on a mac, why do I need backups" that exists because things are so easy to setup, and faily so infrequently.
Not on the server level, but on the desktop / user experience level. And if they are buying a machine that apple is selling as "easy to use as their desktop machines" with "no IT required," then yes, Apple is mis representing their product.
Of course, Apple can't sell enterprise level equipment worth shit, and knows it. 90% of their sales staff (in enterprise) would rather sell a product and never have to maintain contact with that customer until they need to buy something more again. Anything requiring more work or knowledge than what they had anticipating and deployed in house will rarely get done, unless you are buying MILLIONS of dollars of equipment.
As I've said up thread, the chances that they have an IT person at all, as in someone whose job it is to maintain the functioning and support of their IT infrastructure and services, and to secure them, is probably ZERO. They are web coders and dot commers, who just had an idea, bought an xserve, and threw their code up on it and had it run, and then collected ad revenue.
Re:There is a denial going on by jbezorg · 2009-01-02 13:30 · Score: 1

Statements like this "90% of their sales staff (in enterprise) would rather sell a product and never have to maintain contact with that customer until they need to buy something more again" make me think of XKCD.
I just Googled "Setting up X server" clicked on the first link ( http://xray0.princeton.edu/~phil/Facility/osx-mskcc.html ) who blatantly say they took most of it from this site ( http://sage.ucsc.edu/xtal/ ) and there, first item in the third column, Backing up and Cloning Disks.
I will agree on your assessment of the probability of them having an IT person. This seems like a problem any user who didn't RTFM and didn't have IT experience would make on any OS. Had these guys ran Linux, I think their actions, the outcome and apparent denial would have been the same.
Note, the last Apple I had the pleasure to use is a IIe and I'm not feeling the urge to rush out and get one.

--
I've lost all my marbles except one & It's fun to test angular & centripetal acceleration in my skull

Someone needs to be FIRED by spitek · 2009-01-02 05:53 · Score: 4, Funny

You pay your infrastructure people to maintain business, continuity I mean the tittle of this post made me go, "Really, no shit" That's like systems admin 101! If the admin was aware then the manager that didn't listen needs to be fired. If the manager listened and they are just run by retards then they got what they deserve. You'd think 17,000 visitors a month would be worth enough to do it right, in add revenue alone. The cost of a consumer machine running linux with a few TB's of SATA space - $1200 How much the company paid to have a system's admin play video games all day - $50,000 The cost of a 17,000 vistor a month site going down because they had no data base backups - Priceless.

Re:Someone needs to be FIRED by macbuzz01 · 2009-01-02 06:02 · Score: 2, Funny

I think they are all fired now.
Re:Someone needs to be FIRED by Anonymous Coward · 2009-01-02 06:20 · Score: 0

Um... Everyone is being fired. They're going out of business.
Re:Someone needs to be FIRED by Anonymous Coward · 2009-01-02 07:11 · Score: 0

More like Windows Users for Dummies 101
Re:Someone needs to be FIRED by 91degrees · 2009-01-02 11:10 · Score: 1

Some guy's going to have a bit of a problem getting a job.

"So, you were the IT guy for a company that collapsed because of an IT failure. Excellent! We hire the best here. You'll join the financial wizards from Enron and the Legal minds from SCO."
Re:Someone needs to be FIRED by jadavis · 2009-01-02 11:23 · Score: 1

You'd think 17,000 visitors a month would be worth enough to do it right, in add revenue alone.
If it's one dollar per visitor per month, that's only $17000/mo in total revenue. Hardly enough to do anything right, let alone everything.

--
Social scientists are inspired by theories; scientists are humbled by facts.
Re:Someone needs to be FIRED by spitek · 2009-01-02 11:37 · Score: 1

Granted you have a point that it is not a extremely large amount but that's still way more than needed to take database backups. That's what the $1200 computer I mentioned was for, really you could have done it with half that
Then an hour of a sys admins time to set a cron job to do a database dump then scp or rsync it somewhere. I have customers with 1 /100th the traffic yet they still have database backups. They even grab them manually down to their home machine every couple weeks via a web based open source tool
Money has no place in defending this even. It's human ignorance or laziness , period
Re:Someone needs to be FIRED by jadavis · 2009-01-02 13:30 · Score: 1

Money has no place in defending this even.
It takes money to make people care. Granted, this is an extreme case of apathy, but I assume that is at least partially explained by low revenue.

--
Social scientists are inspired by theories; scientists are humbled by facts.

Wow. Like seeing an amazing car wreak on video. by 1_brown_mouse · 2009-01-02 05:58 · Score: 1

I don't know how you can run a business without protecting your bread and butter.

You have insurance on your car. Hopefully some health insurance.

How about some Database Insurance? Hell, $150 for a simple 1TB USB HD?

Can you be that incompetent? Why yes, I guess you can be.

Just give up? by girlintraining · 2009-01-02 06:02 · Score: 1, Troll

Slashdot must be populated mostly by engineers and programmers that work on the software side of things, because nobody's considered mentioning a data recovery service, instead giving the "You're doooooooomed! You should have paid your IT people more" line. How very kind everyone here is of other people's technical mistakes. Because none of us have ever seen a bunch of dot files in a directory and typed "rm -rf .*" and then cried after or screwed up some production server with a "minor change"...

--
#fuckbeta #iamslashdot #dicemustdie

Re:Just give up? by IMarvinTPA · 2009-01-02 06:10 · Score: 2, Informative

The article says the data recovery company has found the drives wiped. There is no recoverable data.
It seems like the actual site failure was on the 23rd or so.
IMarv
PS, the internet archive was blocked by their robots, so there isn't even that to look at. http://web.archive.org/web/*/http://www.journalspace.com

--
Trusting software vendors is no smarter than trus
Re:Just give up? by DaveV1.0 · 2009-01-02 06:11 · Score: 1

I can see why you are "in training". DriveSavers is a data recovery service. Next time, RTFA.

--
There is no "-1 offended" or "-1 you don't agree with me" mod options for a reason.
Re:Just give up? by Anonymous Coward · 2009-01-02 06:11 · Score: 0

Slashdot must be populated mostly by engineers and programmers that work on the software side of things, because nobody's considered mentioning a data recovery service
No, Slashdot is populated mostly by people who don't bother to read TFA...
From TFA...

DriveSavers called today to inform me that the data was unrecoverable.
Re:Just give up? by Rakshasa+Taisab · 2009-01-02 06:11 · Score: 1

There were no other web sites or processes running on the server, and it would be impossible for a software bug in journalspace to overwrite the drives, sector by sector.
Doesn't sound like an 'rm -rf .'...

--
- These characters were randomly selected.
Re:Just give up? by Anonymous Coward · 2009-01-02 06:17 · Score: 0

The drives are still good were overwritten. Blanked out, wiped, nulled, zapped, zeroed. No data recovery service is going to get that data back.
That's why no one has mentioned it ya dork.
Re:Just give up? by rodeored · 2009-01-02 06:23 · Score: 1

Definitely management material
Re:Just give up? by moose_hp · 2009-01-02 06:25 · Score: 1

TFA mentions that the disks were erased sector by sector, not just a rm -rf / or a DELETE FROM users CASCADE; or something to that effect.

I remember a slashdot story about a guy sending hard disks to data recovery services, which were pretty much dealt with a dd if=/dev/zero of/dev/hda, and nobody claimed the "king of recovery" title yet.

--
DON'T PANIC.
Re:Just give up? by Anonymous Coward · 2009-01-02 06:31 · Score: 0

The "data recovery service" was mentioned right in the journalspace 'we died' message.
Also a Data Recover Service can't be expected to recover data if the drive is sufficiently overwritten.
They really are dooooooooomed.
And this mistake is among the most basic. Much like assuming you don't need to change the oil in a pickup truck with two gas tanks.
Re:Just give up? by girlintraining · 2009-01-02 06:35 · Score: 2, Informative

Since you're the only poster to reply without yelling "idiot" (thanks, btw) -- Zeroing the drive makes software recovery impossible. It doesn't make data recovery impossible. There are ways to read the offset data, though this is getting harder as magnetic densities increase every year. Ontrack data recovery specializes in that kind of thing. I've seen them do it. Granted, it's not a 100% thing -- you don't get back something that even resembles a filesystem. At least a third of it is uselessly garbled binary.

--
#fuckbeta #iamslashdot #dicemustdie
Re:Just give up? by Anonymous Coward · 2009-01-02 06:37 · Score: 0

...
backing up data correctly: 99% reability
recovering lost data: you're kidding, right?
the reason people here do not suggest recovery tools is not to spread the "backup? who needs it? we can always recover somehow" way of thinking...
Re:Just give up? by Darth_brooks · 2009-01-02 06:44 · Score: 1

No, apparently slashdot is populated by readers and mods who don't RTFA:
"Tuesday:
Journalspace is no more.
DriveSavers called today to inform me that the data was unrecoverable."

--
There are some people that if they don't know, you can't tell 'em.
Re:Just give up? by jeffmeden · 2009-01-02 06:51 · Score: 1

Maybe this is a case where Slashdotters Read TFA, whose third line is:

DriveSavers called today to inform me that the data was unrecoverable.
Cue the "doooooooomed!" music.
Re:Just give up? by nrozema · 2009-01-02 06:52 · Score: 1

Because none of us have ever seen a bunch of dot files in a directory and typed "rm -rf .*"
I'm quite sure most of us have done something stupid like that. That's what backups are for - to prevent unavoidable human error from becoming a catastrophic failure.
Re:Just give up? by Sockatume · 2009-01-02 07:01 · Score: 1

Indeed, even a hard-core multiple-overwrite wiping is at least partially recoverable, as the heads do not align perfectly and fail to cover the whole region of the old bit. The trick there would be distinguishing the real data from the intermediate overwrites. It's also possible to distinguish, say, a one overwriting a zero from a one overwriting a one. However that's very far into the "hypothetical and expensive" range, and will be of little consolation to Journalspace. If the drives were given a truly random, multiple overwrite (and not, say, zero zero zero...), I dare say that a recovery would be a couple of PhD theses worth of effort.

--
No kidding!!! What do you say at this point?
Re:Just give up? by Kjella · 2009-01-02 07:30 · Score: 1

You might have a point, if the data was accidentally overwritten once with something. Given the circumstances, it might just as easily have been wiped. Either way that would cost absurdly lot, take lots of time and as you say, probably leave a mess that you could try piecing together to a working database but would probably fail.

--
Live today, because you never know what tomorrow brings
Re:Just give up? by Anonymous Coward · 2009-01-02 07:40 · Score: 0

Sigh. There's actually an open offer of prize money for someone that formats over a disk and then is able to recover the data. For a number of reasons, no-one has been able to claim the prize. (Too lazy to find a link)
A data recovery service is not a license to be stupid. A data recovery service can help you, for example, if your disk array gets submerged in sewage, but will not be able to help you if you use disk scrubbing hardware.
It may be theoretically possible to recover the data using devices capable of detecting extremely small charges on a platter, but the time and money required in order to use this approach would be practically prohibitive.
Re:Just give up? by imsabbel · 2009-01-02 08:10 · Score: 2, Interesting

Thats bullshit, and has been for decades.
Its a myth. Just learn about it. Even if we use our newest AFM, or XMCD microscopy, you wont see an overwritten byte in any drive of the last 5 years. And even the last decade is very doubtful (basically, since GMR drives are around).
There IS NO SPACE between tracks anymore. Bits are right next to each other. If you overwrite, nothing above the superparamagnetic limit is left.
Not even the NSA could get anything useful out of a single overwrite with zeros (well, except relocation sectors and other specialities that might compromise security, but doesnt help with a backup)

--
HI O WISE PRINCE. WHT TOOK U SO DAM LONG?
Re:Just give up? by Sorthum · 2009-01-02 11:06 · Score: 1

Your sig is hilariously appropriate for this article...
Re:Just give up? by Sockatume · 2009-01-02 12:09 · Score: 1

Well, I can only apologise for assuming that review papers written on the topic since 2000 were up-to-date.

--
No kidding!!! What do you say at this point?
Re:Just give up? by Sockatume · 2009-01-02 12:12 · Score: 1

(I'd like to emphasise, again, that it's well into the range of the hypothetical, and thus very expensive, and would probably be novel enough to get a couple of grad students to their theses.)

--
No kidding!!! What do you say at this point?
Re:Just give up? by Darkk · 2009-01-02 13:31 · Score: 1

Infected with some kind of a trojan?
Re:Just give up? by Anonymous Coward · 2009-01-03 09:24 · Score: 0

They are very smart. It's just like it never happened. There was no site.

In other news... by talon77 · 2009-01-02 06:03 · Score: 1

The sky is blue, the earth is round, and U2 sucks.

Thanks.

Re:In other news... by Anonymous Coward · 2009-01-02 09:05 · Score: 0

No, the sky is black at night, the earth is flat, and U2 has multiple platinums.

Source code may be released! by nih · 2009-01-02 06:04 · Score: 0

We're considering releasing the journalspace source code to the open source community

I can't wai

--
I'm a rabbit startled by the headlights of life :(

Mirroring by jav1231 · 2009-01-02 06:04 · Score: 5, Insightful

See mirroring is like...well a mirror. If you stand before one and stick a fork in your eye your mirror-image does the same. In real time. Analogies are there for a reason.

Re:Mirroring by Anonymous Coward · 2009-01-02 06:45 · Score: 0

Can you repost your analogy with something more common, like a car?

Actually, your mirror image don't fork it's eye on _realtime_, even light have a speed, you know.
Re:Mirroring by jav1231 · 2009-01-02 07:31 · Score: 1

Actually, your mirror image don't fork it's eye on _realtime_, even light have a speed, you know.
Well played...well played! :p
Re:Mirroring by gEvil+(beta) · 2009-01-02 08:30 · Score: 4, Funny

See mirroring is like...well a mirror. If you stand before one and stick a fork in your eye your mirror-image does the same. In real time. Analogies are there for a reason.

There's a major flaw in your analogy. See, if I stick a fork in my right eye, the mirror image will stick a fork in his left eye. Between the two of us, however, we still have one good left AND right eye. So ipso fatso, I have a complete backup.

--
This guy's the limit!
Re:Mirroring by Aristos+Mazer · 2009-01-02 10:45 · Score: 1

Not that you'd see this analogy -- you'd have a fork in your eye.
Re:Mirroring by rrohbeck · 2009-01-02 10:49 · Score: 1

Ooh. Delay line backup. Now if we used something slower than light speed, say, a delay line?

--
thegodmovie.com - watch it

Even rsync mirroring would be better by Anonymous Coward · 2009-01-02 06:05 · Score: 0

Blimey. When I saw "mirroring" in the title, I thought it was going to talk about rsync. But raid mirroring being your entire backup strategy? That's special.

Look at the bright side by Anonymous Coward · 2009-01-02 06:05 · Score: 0

The first real backup they do is going to run really, really fast.

Running Mac OS server, seriously?! by Anonymous Coward · 2009-01-02 06:06 · Score: 0

Is this a joke? Did I read that correctly? The server was running on Mac OS X server? No wonder they are dead now! Only a Macretard could make such decisions.

Data recovery services? by Anonymous Coward · 2009-01-02 06:06 · Score: 0

They should still be able to send the drives to a data recovery service and recover a lot, if not all, of the data that was overwritten, right?

Re:Data recovery services? by Jerry+Smith · 2009-01-02 06:56 · Score: 1

They should still be able to send the drives to a data recovery service and recover a lot, if not all, of the data that was overwritten, right?
AFAIK DriveSavers gave up on their data, and they have quite a reputation on data-recovery. To me it looks like the data (i.e. both drives) was overwritten.

--
All those moments will be lost in time, like tears in rain. Time to die.

Liberal Bloggers by Nom+du+Keyboard · 2009-01-02 06:07 · Score: 0, Troll

If it's only liberal bloggers content...

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

If you can't be a good example... by Anonymous Coward · 2009-01-02 06:07 · Score: 1, Funny

"If you can't be a good example, you'll just have to be a horrible warning."

Catherine Aird
Quoted in the Book Practical UNIX & Internet Security

Re:If you can't be a good example... by ColdWetDog · 2009-01-02 16:00 · Score: 1

Maybe we should get a Paypal account together and send them a copy of this.

My two cents: ..

--
Faster! Faster! Faster would be better!

RAID has never been a backup solution by David_Hart · 2009-01-02 06:08 · Score: 1

RAID is used for three purposes, to speed up access to data, to create a volume larger than any one disk, and to mitigate against disk failure. It has never been, nor ever will be a backup solution on its own. A backup solution involves making a copy of the data to independent storage, be it tape, disk, etc. Ideally a copy will also be sent offsite in case of fire, etc. In addition, the restore process needs to be tested on a seperate restoration box to make sure that the backup process and the media are working correctly.

Personally, I am not sure why this is a story on Slashdot as everyone here should have at least this basic understanding of how to protect their data.

David

I Can't belive it.... by s0litaire · 2009-01-02 06:08 · Score: 1

all these posts and not ONE OS X / Mac-Fanboy joke!! not even a "They did have a backup solution: they prayed to their God Jobs every night to protect the data" Or "Apple servers are to cool to corrupt!" Come on guys! i know it's Friday night but you can't still be hungover from New Years... :D:D

--
Laters Sol "Have you found the secrets of the universe? Asked Zebade "I'm sure I left them here somewhere"

Re:I Can't belive it.... by Cormophyte · 2009-01-02 10:20 · Score: 1

It's not that we missed the opportunity to spread our love for all that is Mac, we haven't said anything because we know that as we speak Steve is on his way with The Disk and then you'll all see the true power of Jobs.

(Written from a Mac Pro)

The simplest things by kylegordon · 2009-01-02 06:08 · Score: 1

Many years ago, I had a minor disk failure. However, it resulted in the loss of the root directory inode. Everything within the directory (I think it was mounted as /var/) vanished... and mirroring would never have saved it.

sjmurdoch very kindly wrote python and debugfs magic that recovered about 95% of the structure and files, but it was a lesson learned against using mirroring as a form of backups...

Re:The simplest things by Darkk · 2009-01-02 13:34 · Score: 1

Yep, mirroring even mirrors disasters and screw ups as some people found out.
I think when people hear "RAID" it's protected. What they don't realize RAID is for HARDWARE protection not software. DUH!!!
Ah well, it's a shame a company is now out of business as they can't recover ANY of the lost data.

So, what was it worth in dollar terms? by jcr · 2009-01-02 06:09 · Score: 1

Anyone have an estimate of how much equity in this business just vanished?

-jcr

--
The only title of honor that a tyrant can grant is "Enemy of the State."

Sabotage by phorm · 2009-01-02 06:09 · Score: 1

A disgruntled member of the Lagomorphics team sabotaged some key servers several months ago after he was caught stealing from the company; as awful as the thought is, we can't rule out the possibility of additional sabotage.

Seems to be quite possible that a previous admin did this. In this case, the only real backup would be something disconnected (and tested). Risk factors are otherwise still high:

a) RAID: Data overwrite, controller failure, PC failure,multiple disk failure, data corruption/RAM/drive issues, deliberate erasure, out-of-space related errors

b) RAID+backup disk: Controller failure, major hardware malfunction (power surge, explosion).

c) On-site sync: Site-wide catastrophe (explosion, flood,electrical surge, etc), network issues (usually temporary)

d) Off-site sync: Deliberate sabotage (privilaged user), which could be somewhat offset using a pull-based backup rather than a push (backup server having very limited access and none from the server being backed up). Can also be redundant servers in different locations

e) Permanent storage media: Even this could go bad if the tapes/disks/etc get damaged or demagnetized etc. Tape backups can also be slow'ish (incremental-differential backups a bit faster to do)

There is no silver bullet, but relying on just RAID ignores a huge number of potential failures, ESPECIALLY after existing issues with attempted sabotage. No as it was the database that was nuked and not necessarily the webserver/etc, I'd be interested in knowing what logs were in place and if there traces of sabotage. A login at an odd time, but a weird account, or by a script set to execute on say "01-01-2009 00:00:01" would be a good start.

Wonder if a data-recovery firm might be able to get something bad on a DB if it's just been erased. Might be easier than if it were actually overwritten with bad data...

I can only say Gentoo-wiki.com by Anonymous Coward · 2009-01-02 06:10 · Score: 0

Gentoo-wiki.com is a awesome site. However, the guy who runs it either has no backup plan or really a lot of bad luck. I think it is at least the third time the main site says "Gentoo-Wiki recently had it's database lost; this is the rewrite of the site.".

Backup components by starfishsystems · 2009-01-02 06:10 · Score: 1

Disk mirroring is a useful component of a total disk backup solution. It's not itself a total solution, as it only protects against a subset of possible data loss scenarios.

Okay, that wasn't too hard to understand. Now let's move on.

--
Parity: What to do when the weekend comes.

Re:Backup components by Anonymous Coward · 2009-01-02 13:20 · Score: 0

Disk mirroring is a useful component of a total disk backup solution.
No! (swats parent with rolled-up newspaper) No!
Mirroring is to keep the server and applications running in the event of a drive failure, not to backup data.
Okay, for home use, whatever. But for any remotely serious serving mirroring and RAID has nothing to do with preserving data.

Google cache diving by Chris+Pimlott · 2009-01-02 06:10 · Score: 5, Informative

Looks like at least some content is still in Google's cache, those looking to salvage their journals should act quickly.

You can limit google's search results to a particular site by using the "site:domainname.com" search term (example) and then click the "Cached" links of each result to see Google's copy.

There's also a Greasemonkey script for Firefox that can automatically add Google Cache links next to page links, so you can navigate from one cached page to another easier.

Re:Google cache diving by Darkk · 2009-01-02 13:21 · Score: 1

Very nice catch but doesn't really help to bring the site backup (pardon the pun).
At least the bloggers can recover much as they can before it disappear forever in digital heaven.

Re:No Archive.org either-Compound Foolishness by Nom+du+Keyboard · 2009-01-02 06:11 · Score: 2, Interesting

They also purposely blocked archive.org via a robots.txt exclusion, so the bloggers can't use that to try and recover some of their blogs.

This is just compound foolishness. I gather they did it in an attempt to control bandwidth costs since it's hard to imagine any other reason.

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

You need more than backups ... by blowdart · 2009-01-02 06:11 · Score: 5, Insightful

You don't just need backups. You need to TEST them. Having a backup run every night is nice and all; but if the tapes are unreadable and no error was reported, or if you're doing it wrong and the backup is corrupted and you only find out when you come to restore ....

Re:You need more than backups ... by __aasqbs9791 · 2009-01-02 06:37 · Score: 1

Yep, there was just a story on one of the sites I visit (it was a week or so ago and I can't recall which one at the moment) where that is exactly what happened. I think it was the backup for the database, IIRC, and when they needed it for recovery, they found out it was corrupt.

So does anyone have suggestions for the best automated way to backup and to test those backups?
Re:You need more than backups ... by nolife · 2009-01-02 07:03 · Score: 1

Verify.
Well not completely. That will tell you what was sent is what is on the backup media, what was sent may not have been correct though and might not actually be usable for a backup. The media could become corrupt after that as well.
If anything, at least using some type of verify option should put you one more step away from being the "problem" if a restore goes bad.
Actually walking through the whole backup/restore process (at least restore to a different location) is the only way.

--
Bad boys rape our young girls but Violet gives willingly.
Re:You need more than backups ... by nabsltd · 2009-01-02 07:14 · Score: 2, Interesting
The best way I have found to test the backup is to nuke the data and restore.
Seriously, if you know what files store the data (and that you are backing up), just stop services and rename a directory or two so the data is "gone". Then, restore from backup, start the service, and see how things look. Another good way is to restore the data to a VM that runs the same software as the production server. You can sandbox a simulation of the entire Internet inside a few VMs if you want, and test what happens.
I just did something similar when I upgraded the OS on a VM that runs a MySQL server:
1. Create and configure new VM
2. Stop services on old VM
3. Run backup on old VM
4. Stop old VM
5. Reconfigure new VM with correct IP, etc., and restart
6. Restore data to new VM from backup
7. Test
Basically, if things had gone poorly, I could just stop the new VM and revert back to the old one.
Re:You need more than backups ... by SaDan · 2009-01-02 07:14 · Score: 1

Automation of the entire process only goes so far. You have to manually verify each step of the backup process anyways, so just dedicate a bit of your time each month (or quarter) to verify backups are in fact working.
Fun stuff, but worth the time and effort if disaster ever strikes.
Re:You need more than backups ... by mortonda · 2009-01-02 07:17 · Score: 3, Informative

Backups must be:
1) Automated - if you need human intervention, it will fail
2) Point-in-time - the system must be able to provide restores for a set of times, as fitting for the turn around on your data. A good default is: daily backups for a week, weekly for a month, and monthly for a year
3) TESTED: You must fully test the restoration process (if this can be automated, even better). Backups that you can't restore from a bare machine are worthless.
For better disaster recovery, backups should be:
4) offsite - if a fire or tornado hits, is the backup somewhere else?
5) easily accessible - how long will it take to get the restore going?
Re:You need more than backups ... by scubamage · 2009-01-02 07:47 · Score: 1

This is hugely important, especially with CDR's. It hit me recently for personal archives. I found an 8 year old burnt CD and was extremely excited. Until I realized that it was no longer readable without constant CRC errors. This was a cd that was put in a case immediately after burning and had never once received a scratch. The ink just broke down over time (as its known to do). The data could most likely have been saved if I'd checked it 3-4 years ago. My fault.
Re:You need more than backups ... by __aasqbs9791 · 2009-01-02 08:22 · Score: 1

But what about when you have several TB of already compressed data (group 4 TIFFs) that changes frequently? I had this situation at a place I used to work.
Re:You need more than backups ... by __aasqbs9791 · 2009-01-02 08:24 · Score: 1

But how do you verify several TB of already compressed data (group 4 TIFFs)? I'm asking because I had this exact situation at a place I used to work. We had no budget set aside of IT, and they never let me spend the time to actual develop something, and I don't work there anymore, but I'm still curious as I have friends still there.
Re:You need more than backups ... by spandex_panda · 2009-01-02 08:53 · Score: 1

You don't just need backups. You need to TEST them. Having a backup run every night is nice and all; but if the tapes are unreadable and no error was reported, or if you're doing it wrong and the backup is corrupted and you only find out when you come to restore ....
This happened to my dad, he has a small business and it has many many entries in a patient database. The staff religiously placed the tapes into a drive for backup and stored them in a fireproof safe. One day the server died... everything was lost. Backups weren't working properly and around 3 months of data was lost. (luckily only 3 months) It should have been one day lost but perhaps only the monthly backups were actually working! Bugger!

--
like phosphorescent desert buttons singing one familiar song
Re:You need more than backups ... by rsax · 2009-01-02 09:11 · Score: 1

But how do you verify several TB of already compressed data (group 4 TIFFs)?
Create checksums of each file before backup and then verify the restored files using the previous checksums?
Re:You need more than backups ... by SaDan · 2009-01-02 09:20 · Score: 1

Bingo.
It's IT. It's not supposed to be easy. ;-)
Re:You need more than backups ... by __aasqbs9791 · 2009-01-02 09:47 · Score: 1

And if you can create and verify 50 checksums per second it would take over 38 hours to check one project that we had (and not even the largest we had at the time at 7 million images). Now maybe 50 per second isn't reasonable (I actually think that's probably generous unfortunately) but I think you now see my problem. With large datasets backups are even more important (since they represent so many man hours) but they are so time and resource intensive they are not practical. Someone must have dealt with this problem and solved it by now.
Re:You need more than backups ... by Anonymous Coward · 2009-01-02 10:42 · Score: 0

I work for a company (NetApp) as support, and believe me, this happens way too often. The customer has YEARS of daily and weekly backups, and NONE are any good. Seems to happen more for Exchange Admins ;).
If that happens, once whatever caused the data to get corrupted is fixed, you get to see how good you are at rebuilding data (raid label edits, rebuilding the raid array by hand in software, going back to previous raid stripes that were written then discarded, etc.)
Re:You need more than backups ... by Anonymous Coward · 2009-01-03 09:51 · Score: 0

I had a Colorado tape backup on my home computer in 1994. I had half a dozen tapes and made regular backups. After a few years I suddenly got 'tape corrupt' (or similar) error and that was that. The tapes were useless since the index was on the first tape. Now I burn an image once a year to an external drive and copy files to cd or dvd periodically. Still, I haven't restored an image to a new drive to make sure it is really good. It's the only way to be sure. I also don't trust compression or encryption or password protection of backups. Just another excuse for something to go wrong. How about those thumb drives? My son works in IT and has gone through about one a year. They just quit working.
Re:You need more than backups ... by nabsltd · 2009-01-03 18:37 · Score: 1

Do the same thing...nuke and restore.
You don't have to nuke everything, just enough to know that your backup/restore solution is functional. It's completely impossible to test every backup, and with terabytes of data, you might not even want to do a single complete restore test, but you can do enough to know that you can put back files to the state they were.
Re:You need more than backups ... by __aasqbs9791 · 2009-01-03 19:02 · Score: 1

That's what I was afraid of. That's basically a "it can't be done" answer (no offense meant). If you can't test a significant percentage then there's little to no point in testing at all as you can't have faith in the backup working if you need it, and if you don't have faith in it, then you are doing "backup theater" rather than a real backup.

Yes, it is a backup solution by PearsSoap · 2009-01-02 06:13 · Score: 2, Interesting

Since when is Slashdot in the habit of disagreeing with Linus Torvalds?

Only wimps use tape backup: _real_ men just upload their important stuff on ftp, and let the rest of the world mirror it ;)

journalspace source (run your server as root) by marmoute · 2009-01-02 06:16 · Score: 1

#!/bin/sh rm -rf /*

google cache by lukas.mach · 2009-01-02 06:17 · Score: 1

Can't they recover most of the posts by some resonably complex Google Cache data mining? Shouldn't be that hard.

Re:google cache by clawoo · 2009-01-02 06:26 · Score: 1

Well actually it is not as easy as it seems, because each page had a design of its own. I mean, writing a crawler to go through Google's results list (or any other service that indexes pages) is quite a simple task. But the problem here is that apparently members could customize their journalspaces which means that unless the HTML output was standardized, it would be hell to try to get some sense out of each page.

--
This is not your signature.
Re:google cache by jafiwam · 2009-01-02 06:40 · Score: 1

Quick, someone go post this idea as a new revenue stream to Google "muse" (a few articles down).
An 800 number and Google takes your credit card to send you a copy of the Google spider results (what would be in Google Cache) in a nice zip file.
Re:Google cache by Anonymous Coward · 2009-01-02 08:30 · Score: 0

Unfortunately, all they had in the robots.txt was a "piss off" message to all the free web archivers. Genius.
Re:Google cache by Dogtanian · 2009-01-02 09:49 · Score: 1

They could recover much of the data from the Google cache.
56,200 results, the first few at least seem to be intact and useful.

I'd grab them while I could if I was Journalspace.

--
"Slashdot - News and Chat Sites Deviant". (Click "homepage" link above for details).
Re:Google cache by paimin · 2009-01-02 11:49 · Score: 1

Well, here's a business plan:
1. Download all the data previously hosted by journalspace.com from the Google cache.
2. Wait for the Google cache to expire.
3. ???
4. Profit!
The question being, does #3 begin with grabbing a % of journalspace.com's previous users, or selling the data back to journalspace.com? There's also the little problem of getting their user account data back. Hmm...the idiocy of this situation boggles the mind.

--
Facebook is the new AOL
Re:Google cache by Matheus · 2009-01-02 18:42 · Score: 1

or ARCHIVE.ORG!
Their entire business may be sitting on archive's servers..

Uggg by kenp2002 · 2009-01-02 06:18 · Score: 1

Jebus Punting Mary Jane Rice Almighty!!

What fucking good is disk mirroring, raid 5, 6, or raid 5 gazillion when the fucking building burns down?! Hey the raid control failed and wrote Calvin and Hobbes quotes all across the array! Once again the apple of common sense has fell so far from the tree that we need to explain this kind of stuff. We are in a shit load of trouble.

"The apple of common sense has fallen so far from the tree that we now have to, so to speak, explain what an apple is..."

--
-=[ Who Is John Galt? ]=-

Amateur level by gweihir · 2009-01-02 06:18 · Score: 1

Sorry to say this, but it is. Having competent engineers (and thus expensive engineers) is a requirement for professional IT operation. Cheap, "just working", operation comes, among others, with this type of risk.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.

Job advertisement by Bromskloss · 2009-01-02 06:21 · Score: 1

Motivated by recent events at journalspace.com, we seek you who up until now have had computer related duties at the blog host, particularly with a responsibility for backups, to let you know that we are not hiring you.

--
Swedish plasma phys. PhD student; MSc EE; knows maths, programming, electronics; finance interest; seeks opportunities

Re:Job advertisement by troll8901 · 2009-01-02 09:13 · Score: 1

... to let you know that we ARE hiring you to handle almost 100TB's worth of data.
(I kid, I kid!)

Double Duh! by Roger+W+Moore · 2009-01-02 06:22 · Score: 4, Interesting

Since they apparently used OSX Server this is particularly bad. All they needed was a large enough USB attached disk and then to turn on Time Machine. Might not be the best solution for their needs but it is hard to imagine one which requires less effort.

Re:Double Duh! by azav · 2009-01-02 06:38 · Score: 3, Insightful

Or attach a 4 TB Drobo to it and then use Time Machine.
Then make a backup and test the restore.
Their admin is criminally incompetent.

--
- Zav - Imagine a Beowulf cluster of insensitive clods...
Re:Double Duh! by SaDan · 2009-01-02 07:00 · Score: 1

Absolutely correct on the incompetent part. Wow, that's just ignorant.
Re:Double Duh! by CarpetShark · 2009-01-02 07:11 · Score: 2, Informative

All they needed was a large enough USB attached disk
Correction: all they needed was a large enough, functional, external disk.
Finding functional external drive products isn't so easy, I've discovered.
Re:Double Duh! by Anonymous Coward · 2009-01-02 07:13 · Score: 2, Informative

Not saying OS X is not pretty good on the desktop/laptop, just that for your servers you should use Linux or possibly Solaris or BSD, but not OS X or Windows.
So, you say they should use BSD (among other options) yet say they shouldn't use BSD or windows?
I do agree with your windows point, but as to your confusing BSD comment, i'd have to say you were right the first time and wrong the second.
And since i know this joke will go right over your head, heres a tip:
OS X is BSD
Re:Double Duh! by Poltras · 2009-01-02 07:16 · Score: 1

Actually, OSX is UNIX (BSD) under the hood. There are different tools (launchd, domain tools, net tools), but many well-known are there (samba, cups, PAM, ldap, etc).

People are afraid of it as a server option because it has a pretty interface, but if you ssh on one for more than 5 minutes you'd see it's pretty much unix-solid for server solutions. And if you like pretty interfaces, OSX Server is basically the same OSX with GUI tools added that calls/config the cli tools behind. It's the same tools, easier to manage/maintain.

Personally, between a Linux and an OSX server to admin I just flip a coin if there is no price difference.

--
Of Code And Men
Re:Double Duh! by gd2shoe · 2009-01-02 07:21 · Score: 2, Insightful

OS X is not BSD in the same way that Ubuntu is not Debian (and to a much greater extent, might I add). Mull over that one for a while.

--
I won't join Slashcott. OTOH, If Beta goes live, I just won't be back until it's fixed. Sorry Dice.
Re:Double Duh! by MarkRose · 2009-01-02 07:22 · Score: 4, Informative

Not quite. Backing up a live database can be a bit tricky. By the time you finish copying part of the database, the first bit can change again. So you have to create a snapshot of some kind. And that has to be supported in the database setup (at the application or server level) in order for the backup to be in a consistent state. And you don't want your backup process to degrade site performance, either. So a simple file copy is totally inadequate.
A common solution is replication. Backup is then performed by creating a replication point on the slave database machine then taking a snapshot and copying that while while master database machine continues serving at full speed. Replication can then catch up when the backup is complete. Another advantage to having replication is duplication on the machine level -- if the master fails, go live to the slave with minimal to no downtime. Set both machines up in a master-master configuration and you can swap back and forth as needed, allowing live maintenance and backup with no performance degredation.

--
Be relentless!
Re:Double Duh! by bev_tech_rob · 2009-01-02 07:29 · Score: 1

If nothing else (assuming you don't have USB fobs, burnable DVDs, and external drives), at least copy the DB to another computer on the network as a backup that can be retrieved quickly. Only problem with that setup is if the building burns down or a disgruntled employee knows where the backup is kept and nukes both (as looks like this case may have been, possibly)....

--
You're messin' with my Zen Thing, man.....
Re:Double Duh! by CrazedWalrus · 2009-01-02 07:34 · Score: 0

A common solution is replication.
Which misses the point of this article: Mirroring is not a backup solution. Replication is essentially mirroring, but via the database instead of on the disk/controller level. If someone issues a "delete from important_table" on your database, it'll be replicated down to the slaves. Replication solves the problem of availability -- not the problem of data backups.
Databases are all about consistency, and your concerns about snapshots are unwarranted if your app is correctly using transactions. The backup process will not see partial transactions -- only complete ones. The in-flight changes will be picked up in the next snapshot. Every database worth its salt has a way of dumping internally-consistent (committed) data to a file for later restoration.
Re:Double Duh! by Anonymous Coward · 2009-01-02 07:38 · Score: 0

Since they apparently used OSX Server this is particularly bad. All they needed was a large enough USB attached disk and then to turn on Time Machine.
Unless the loss was the result of a deliberate attack -- if the attacker had been as competent as the journalspace folks were incompetent, the attacker would've cleaned out any Time Machine drives, too.
Re:Double Duh! by Anonymous Coward · 2009-01-02 07:57 · Score: 0

Your suggestion of them using Time Machine is of course only relevant if they were using Leopard, which there is no indication they were.
Re:Double Duh! by mksql · 2009-01-02 08:08 · Score: 1

Exactly how does the choice of OS make a difference in this case?

--
I should have been a Geologist.
Re:Double Duh! by MBCook · 2009-01-02 08:09 · Score: 4, Informative

*BZZZZT*
The GP was 100% correct. If you had kept reading, you'd see that the suggestion was to use replication so you can lock the DB into a consistent state while backing up. When the backup is done, the box starts replicating again. If you didn't have the backup box, you'd have to lock the production database while your backup was going on.
He was suggesting replication purely as a way to avoid having to pause the application during backup, not as the backup it's self.

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.
Re:Double Duh! by Slashdot+Parent · 2009-01-02 08:09 · Score: 1

Personally, between a Linux and an OSX server to admin I just flip a coin if there is no price difference.
I dunno. A buddy of mine had 3 kernel panics this year under OSX, and I have had zero under Linux. I'm convinced.

--
They don't grade fathers, but if your daughter's a stripper, you fucked up. --Chris Rock
Re:Double Duh! by mksql · 2009-01-02 08:12 · Score: 2, Informative

> Backing up a live database can be a bit tricky.
Seriously? If your database of choice is a chore to backup while live, you need to rethink your choice.
Full or incremental backups should be a trivial operation, with support for intra-backup change capture only a little more effort (log shipping, replication, etc.)
Of all the reasons to lose data, "Backups are hard!" should not be in the list.

--
I should have been a Geologist.
Re:Double Duh! by mabhatter654 · 2009-01-02 08:16 · Score: 2, Informative

Except if somebody issued a drop table... then the repliants get dropped to... nice faithful mirroring!!! Been there, done that.
A good solution is to use mirroring like this, and then take the replicant offline to do real, full backups without taking down the production box. Then you have a live copy if drives or processors go bad to bring up immediately, and a backup tape to cover boo boos like this one. I believe that's what parent is getting at.
Re:Double Duh! by Anonymous Coward · 2009-01-02 08:24 · Score: 0

Losing 1 hour of database changes compared to losing the whole thing? I'll take losing 1 hour of changes that happened when the backup was running.
Re:Double Duh! by gEvil+(beta) · 2009-01-02 08:26 · Score: 1

Try reading a little bit further before responding, Sparky. The GP didn't miss the point of the article. YOU missed the point of what he was saying, despite it all being written in plain English.

--
This guy's the limit!
Re:Double Duh! by jadavis · 2009-01-02 08:28 · Score: 1

Replication is essentially mirroring, but via the database instead of on the disk/controller level.
"Replication" simply means that you keep two copies of the same data somewhere, somehow. There are many different ways to do this, and there are many reasons you might want to do this.
That's why replication is not a "checkbox" kind of feature. It's a set of many features, and to choose the right combination (that is, to make the right trade-offs), you need to look at the system as a whole.
These people were doing one kind of replication -- mirroring at the physical level. That did not guard them against overwrites at all (whether malicious, programmer error, or administrator error).

--
Social scientists are inspired by theories; scientists are humbled by facts.
Re:Double Duh! by jadavis · 2009-01-02 08:35 · Score: 1

Just the mere fact that you need to be aware that you're backing up a specific RDBMS makes it more complicated than backing up your photo collection.
It's not tricky as some problems, but if you don't read the manual, you're probably not going to get it right. And most people don't read the manual, so I tell them it's tricky so that they are aware that they might not be safe. If they care, then they usually would rather pay a high hourly rate than read the manual.

--
Social scientists are inspired by theories; scientists are humbled by facts.
Re:Double Duh! by AmberBlackCat · 2009-01-02 08:37 · Score: 1

They could have just shut the site down long enough to do a backup every now and then. Or they could have done the USB thing, or offsite backups, or whatever else. The point is, any solution any of us thinks of is better than what they did. It's like we're playing a game of finding better solutions and Journalspace did the Konami code for us.
Re:Double Duh! by mortonda · 2009-01-02 08:54 · Score: 1

A common solution is replication.
Which misses the point of this article: Mirroring is not a backup solution.
No, you missed the point - replication is a solution to give you a point where you can make a snapshot, which can then be backed up safely to get a valid point in time backup. It's part of the overall chain.
Re:Double Duh! by sinclair44 · 2009-01-02 09:05 · Score: 1

A friend of mine summed this up pretty well when he said, "OS X was BSD once in the same way that Orcs were Elves once." Debian and Ubuntu are very very similar, where OS X is nowhere close to what you would probably think of as BSD.

--
Omnes stulti sunt.
Re:Double Duh! by moggie_xev · 2009-01-02 09:06 · Score: 1

Three kernel panics in 2 days is pretty bad...
I have seen more linux kernel panics than OSX ones however we have more of those than the macs.
Re:Double Duh! by MarkRose · 2009-01-02 09:18 · Score: 1

Mirroring IS a backup solution, but not a complete one. The whole point to having the replication is to have a live backup, should a machine failure occur, as well as making backups painless and easy to do -- meaning a backup won't seriously degrade performance (copying large amounts of data will kill database I/O performance) because the backup is happening on the slave machine. The backups made on the second machine can be stored on it, offsite, or on any appropriate media.
Snapshots with a replication point set are important. Why? For synchronizing between servers. In the case something catastrophic happened, such as a "delete * from important_table", a restore from backup will be necessary. Unless a snapshot is taken with a replication point, the individual servers have no way of knowing their restored copies are synchronized. Without the replication point saved, a new one will be set and an entire database copy from master to slave may have to happen before the slave begins committing any updates. The reason snapshots should be used is because the journal containing the snapshot point could easily be played through and over-written by the time all the data has been copied to backup.

--
Be relentless!
Re:Double Duh! by MarkRose · 2009-01-02 09:28 · Score: 1

Why take the replicant offline? Just create a snapshot of the disk and copy the files from that. The replicant will continue to replicate all the updates as fast as it can while you copy off the snapshot. If something should happen to the primary box while you're making the backup, you'll be better off.

--
Be relentless!
Re:Double Duh! by Fulcrum+of+Evil · 2009-01-02 09:28 · Score: 1

Backing up a live database can be a bit tricky
That's why most RDBMSes have some level of support for generating live snapshots and keeping them stable for a while. Get a DB backup at midnight and replicate rewrite logs somewhere else and you'll lose very little data when someone thermites your server.

--
"We returned the General to El Salvador, or maybe Guatemala, it's difficult to tell from 10,000 feet"
Re:Double Duh! by Poltras · 2009-01-02 09:48 · Score: 1

I dunno. A buddy of mine had 3 kernel panics this year under OSX, and I have had zero under Linux. I'm convinced.
Oh I've seen my share of kernel panics in both OSes. And I've seen my shares of 0 panics in both too. You just have to be conscious about what you put in kernel space... the only time I had panics in either OSes is because I had installed sh!t or weird drivers from bizarre vendors. The kind of driver you shouldn't let eat after midnight and shouldn't get wet either.

Anecdotical facts are no proof of anything.

--
Of Code And Men
Re:Double Duh! by penguinstorm · 2009-01-02 10:02 · Score: 2, Insightful

BSD is no longer BSD either. You need to pick your flavour, whichever one suits your poison.

--
Skot Nelson music is my saviour / i was maimed by rock and roll
Re:Double Duh! by Feyr · 2009-01-02 10:05 · Score: 2, Interesting

did they ever fix their multithreading issues, where the performance went to shit as soon as you got more than 4 threads going at once?
THAT is what puts me off os x server, not teh pretties
Re:Double Duh! by Murpster · 2009-01-02 10:13 · Score: 2, Funny

Their admin is criminally incompetent.
Isn't that the same as saying "they had a Mac admin"?
Re:Double Duh! by Anonymous Coward · 2009-01-02 10:18 · Score: 0

Perhaps on a Macintosh, but it's a piece of cake on Windows. All you do is create a VSS (Volume Shadow Service) snapshot of the volume and then make the backup from that. (Unfortunately, VSS shapshot creation is command line utility only but any backup tool calls the VSS API to do it.) VSS snapshots are copy-on-write managed by the OS (Software VSS provider), or are provided by the underlying SAN (who provide the 'hardware VSS provider' so Windows knows how to make a snapshot.
Programs, such as SQL Server, register "VSS writers" which are notified when a VSS snapshot is being taken. These writers then make sure the on-disk structures are consistent before the snapshot is taken (everything is flushed to disk). The snapshot is then taken (which takes milliseconds - they're normally copy-on-write or something) and then operations continue.
Of course, you can always do it the easy way with SQL Server - "BACKUP DATABASE northwind TO DISK = 'd:\backups\northwind\nwind.bak'". Tada! Fully consistent backup taken online with zero downtime. Piece of cake. (You even have SQL Server maintience plans that keep the previous month's backups in a directory - automatically deleting old ones - that you can then copy to tape/DVD/whatever. Heck, we write transaction logs to another server every two hours and put them off to tape nightly.)
Re:Double Duh! by Pfhor · 2009-01-02 10:26 · Score: 1

Actually, since they were using an OS X Server, this is more telling of their skills than just how pathetic their backup plan was. I admin OS X boxes, but the last thing I would imagine using them for would be a HA data store for a website. The builds of php and mysql are not kept up to date, and really the only cases I have seen an OS X box deployed for webhosting was by web coders with no experience running an actual server. They liked they could get the box up and running with multiple IP addresses quickly, and would then just go about their work in apache. Not thinking about backup, not thinking about disaster recovery.
And while the hardware may look tasty and easy to buy (hey, one click ordering at store.apple.com, unlimited user licenses!!!), you can get a lot more bang for your buck with a dell or hp linux box.
Someone who had the ability to setup a Linux box might have had a few more grains of intellect to think to atleast run rsync with cron to backup the data store offsite, if even just once a month.
I mean, how did these guys plan to do a staged update, incase an apple software update broke their apache install? Break the raid, install on drive 1, and if it works, mirror over drive 2, therefore giving them no snapshot in time to roll back to?
Re:Double Duh! by Khyber · 2009-01-02 10:28 · Score: 1

Most OSX Kernel panics, just like Linux and now Vista, are usually caused by faulty RAM.
Default procedure for any Kernel panic at an Apple Repair Depot is to pull all system memory and replace with known good - 95% of the time this is the issue.
At least, that's how it was when I worked Flex several years ago.

--
Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
Re:Double Duh! by speedingant · 2009-01-02 10:45 · Score: 1

Might not be best solution at all for a database. Depending on the size of it, it would be quite a pain for time machine backing up 50+Gbs of a "modified file" every hour.
Re:Double Duh! by Anonymous Coward · 2009-01-02 10:54 · Score: 0

the suggestion was to use replication so you can lock the DB into a consistent state while backing up. When the backup is done, the box starts replicating again. If you didn't have the backup box, you'd have to lock the production database while your backup was going on.
What database is this? I haven't seen a database that needs to be locked for backup in years.
Any real database solved the backup problem a long time ago.
Backing up oracle or MS sql server is quick, easy and does not impede normal operations. You get a solid, ACID-compliant backup. You can do complete backups, incremental backups, differential backups, transaction log backups, etc. The database remains online and working during the backup.
I don't know how backups work with MySql.
The article doesn't say what database they were using, but I'm guessing if they were too cheap to buy backup media, they probably used a free database.
Re:Double Duh! by Jim4Prez · 2009-01-02 11:31 · Score: 1

Seriously. Good point. This was a small site with very few journalspace posters. Heck, you could probably backup and compress the entire DB to fit on a typical thumb drive.

Funny how they tried to shoot a point about the OS and said it could have been a "catastrophic failure". I didn't see mention of the DB system. Does anyone know what DB server was used? I still don't think that would be a point of failure because Oracle, SQL Server, MySQL and PostgreSQL all have VERY simple and reliable ways to backup.

This was simple a situation of journalspace hiring a "dba" that knew crap about being a real DBA. They probably paid some college kid $35K - $40K (with only MS Access experience) to be their "dba". Well, guess what you get!

I am a software dev and if I mentioned just using mirrored drives as our _only_ database backups, I would probably be tarred-and-feathered by our DBA's and for good reason (not that I would do such a stupid thing).
Re:Double Duh! by Darkk · 2009-01-02 12:05 · Score: 1

I find this totally dumb on their thinking mirroring would save their butts. Problem it's database corruption and they didn't catch it in time so an hourly replication would have compound the problem even further.
I always trusted daily and weekly backups on both tapes and disks. This way after the backup is done it's a "snapshot" or "frozen in time" of the data which protects it from viruses or other disasters.
So whenever you see a site that is down for scheduled maintenance it means they are doing it the RIGHT WAY as opposed to other solutions.
Of course, if the site been down for days then something really fubered on their end.
Re:Double Duh! by Anonymous Coward · 2009-01-02 12:07 · Score: 0

Filesystem Snapshots are better (Depending on how many data you write while backing up, but I think that ZFS has removed this limit also.).
No need for any special DB (Or any system, BTW) setup.
Re:Double Duh! by datapharmer · 2009-01-02 13:00 · Score: 1

Um... wtf do you mean by that? If you buy a USB 2.0 drive from best buy, then yes it might fail, but you can buy a drive enclosure with any interface you'd like and put any drive you want in it - even SCSI. Or do what others have said and get another inexpensive computer with raid and do a backup, or dump a tar ball with cron to a desktop or NAS every night. Heck even making a tar of the database to the SAME drive once a week would have made this problem recoverable. This isn't rocket science.

--
Get a web developer
Re:Double Duh! by timeOday · 2009-01-02 13:19 · Score: 1

Yeah, I remember there was somewhat of a flap when it was realized that OSX was a pretty slow server OS. I wonder if this has been remedied?
Re:Double Duh! by Anonymous Coward · 2009-01-02 13:31 · Score: 0

the database will change after the snapshot is created too. the point of a back up is to not lose everything, if you copy a live database and the first part changes before the copying is done, oh well at least you have a back up that would have prevented journalspace's current predicament.
he was talking about using time machine so for this discussion doing things perfect and proper was already out the window.
unless it would cause a crash it's still better than what journal space had happen, heck even if it did cause a crash it would probably have been better.
Re:Double Duh! by tomknight · 2009-01-02 13:39 · Score: 2, Interesting

Here's what I've found being said on the topic:
"I think I can reveal this much: an EX IT guy didn't do something he was supposed to have done. This wasn't discovered until AFTER the disks crashed. So, there were probably other reasons for this guy being an EX, too. Anyway, the crash happened, the mistake was discovered but now too late to fix."
From: http://dorrie.de/F1/viewtopic.php?t=194&start=375&sid=a65b1d9b0fbcc3c3e8df874fc167d495

--
Oh arse
Re:Double Duh! by kimvette · 2009-01-02 13:59 · Score: 1

Mirroring IS a backup solution, but not a complete one.
No, it's not. It's a fault-tolerance solution so you can achieve no downtime arising from disk failures.
I'd go as far as to suggest that an always-attached external drive is not a true backup solution either, because if that box gets rooted and cracker does "rm -rf /" then the game is over; not only is the live environment and data wiped, but the backup is as well.

--
The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
Re:Double Duh! by adolf · 2009-01-02 14:17 · Score: 1

Just about any problem is easy, given sufficient funding.
Our own backup solution at work (we don't have much data, and daily is plenty frequent enough) uses 2.5" Seagate IDE drives in (I kid you not) $6.00 Rosewill USB 2.0 external enclosures. Why? Well, it turns out that it works. I tested it, and tested it, and tested it some more.
Not a problem yet. Daily incrementals take a few minutes; weekly bare-metal backups take a few minutes more. At least one complete and recent backup set is always off site, and at least two are on site. Worst case should be that no more than 24 hours of data is lost, unless a backup drive fails too, in which case we're back 48 hours...which is good enough for what we're doing. At least, restores only take a few minutes to complete, and should work on any hardware which is reasonably similar to the ML330 server we're using (which we also keep a spare of).
It wasn't exactly cheap, since I got paid hourly to test, retest, fuck with it, devise problems, try to break it, so on, so forth, but it works brilliantly.
But as far as the boss is concerned, it WAS easy -- all he had to do was approve the plan and write the checks.

--
Kid-proof tablet..
Re:Double Duh! by adolf · 2009-01-02 14:22 · Score: 1

Can't these problems generally solved with snapshots (in FreeBSD and Linux LVM lingo) or shadow copies (in Windows parlance)?

--
Kid-proof tablet..
Re:Double Duh! by Architect_sasyr · 2009-01-02 15:00 · Score: 1

Mmmm no I believe the term I tend to use for the Mac admins I know who refuse to think outside the box is "Windows Admin". The rest of us are just fine with how we do things thanks.

--
Me failed English...
FreeBSD over Linux. If my comments seem odd, this may explain...
Re:Double Duh! by GigaplexNZ · 2009-01-02 15:48 · Score: 1

Creating a snapshot of the disk doesn't really work for two main reasons. One is that it takes so damn long that something is almost guaranteed to go wrong if it is still running. Also, a live database typically runs in memory with changes not yet synchronised to disk.
Re:Double Duh! by CAIMLAS · 2009-01-02 15:57 · Score: 1

Either I misread the GP or you did:
A common solution is replication. Backup is then performed by creating a replication point on the slave database machine then taking a snapshot and copying that while while master database machine continues serving at full speed. Replication can then catch up when the backup is complete.
IE, set a break point, back up everything prior to that point to another medium, presumably a tape or some other long-term chronologically organized medium), and then resume replication. Replication, in this case, provides a decent means through which a database (as opposed to files) can be reliably backed up, as simply "copying the database files" won't cut it.

--
~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
Re:Double Duh! by Kyril · 2009-01-02 16:21 · Score: 1

Backing up a live database is tricky for you. It is easy as pie for a vaguely adequate database. It will dump the dataspace then the transaction logs, and on reload it will replay the transaction logs up to the last committed checkpoint.
Someone else pointed out that replicas don't protect you from malicious alterations of the master (and someone who can crack the master can crack the replicas too) but I like hot replication for failover purposes (and spreading high-CPU/low-cacheability OLTP loads) and actual external dumps that spool off to tapes that go offsite for protection against data corruption.
Re:Double Duh! by _Hellfire_ · 2009-01-02 17:57 · Score: 1

Backing up a live database can be a bit tricky.
Hmmm. "pg_dump -D" usually does the trick for me.
Or am I missing something?

--
"And then I visited Wikipedia ...and the next 8 hours are a blur..."
Re:Double Duh! by darkonc · 2009-01-02 18:13 · Score: 2, Insightful

"everybody knows that MACs don't need admins!"
<sigh...>

--
Sometimes boldness is in fashion. Sometimes only the brave will be bold.
Re:Double Duh! by darkonc · 2009-01-02 18:25 · Score: 1

an EX IT guy didn't do something he was supposed to have done. This wasn't discovered until AFTER the disks crashed. So, there were probably other reasons for this guy being an EX, too. Anyway, the crash happened, the mistake was discovered but now too late to fix."
... and just how long ago was it that this ex IT guy didn't do what he was supposed to have done? If it was more than 4 weeks (and I'm guessing that it was far more than 4 weeks), then there was someone else asleep at the switch.

--
Sometimes boldness is in fashion. Sometimes only the brave will be bold.
Re:Double Duh! by JoelKatz · 2009-01-02 18:59 · Score: 1

If mirroring is a backup solution, which copy is the backup? Dual-active is not backup, it's fault tolerance. But thanks for playing.
Re:Double Duh! by Macka · 2009-01-02 23:15 · Score: 1

Personally, between a Linux and an OSX server to admin I just flip a coin if there is no price difference
I'm pretty disappointed with OS X under the hood. Apple don't seem to pay as much attention to it as they do the Aqua front end. How for instance are you supposed to be able to tell when you have an I/O disk bottleneck problem on OSX? Neither iostat or vm_stat report %iowait for the CPU, and the pittance of information you get out of iostat on OSX is a joke.
Linux on the other hand shows a very clear picture of what's going on. If there's a problem then iostat -mx 5 will show high cpu %iowait values, and those disks with high % utilization, queue service times, etc.
The ability to easily and quickly do this kind of trouble shooting is bread and butter for daily admin tasks, and OSX comes up well short. None of the DTrace tools provided help here either.
And don't get me started on the state of the man pages. They aren't accurate or up to date. Where for example is the pstat command? It must be part of the BSD tool set, because it's mentioned in some man pages, but doesn't exist on OSX.
OSX is a great desktop OS and I use it a lot. But I would never choose it over Linux for a server.
Re:Double Duh! by MarkRose · 2009-01-03 03:16 · Score: 1

What? Snapshots take at most a second. They're done on the filesystem layer, and updates are written to different blocks. Once you've finished copying the data off the snapshot, delete the snapshot and the old blocks are freed for use again. Databases often have the handy write pattern where most updates are done to the same or contiguous areas, so you usually have many hours/days before the filesystem is forced to overwrite the blocks in the snapshot if you don't delete it in time.
Any database worth its salt will keep everything synchronised to disk at all times. A query won't return unless the changes are written. The only issue can be if the OS or disk itself has buffered the writes, but transactional databases ensure consistency with an update log on disk (updates are first written to the log and another process commits those updates from the log to the database files).
And yes, in most situations a properly tuned and configured database will return most reads from memory.

--
Be relentless!
Re:Double Duh! by MarkRose · 2009-01-03 03:19 · Score: 1

Semantics. It's a backup on the hardware level, but not the software level. It's like having two pumps at a well. Either can do the job so one is a backup, but someone can still poison the water.

--
Be relentless!
Re:Double Duh! by MarkRose · 2009-01-03 04:14 · Score: 1

That's exactly what I'm talking about (though I didn't know the Windows parlance). The reasons for adding the slave machine are two: first, to have a hot spare should the first one go down, which is handy for doing software updates and other maintenance; and two, to avoid degraded performance during the backup. The second isn't an issue on a lightly loaded server, but on a busy box, it's a choice between keeping the box quick or having a quick backup.

--
Be relentless!
Re:Double Duh! by JoelKatz · 2009-01-03 04:20 · Score: 2, Insightful

"Either can do the job so one is a backup..."
Which one is the backup?
The whole point of a backup is that it is *stable*. Neither copy is stable, so there is no "backup on the hardware level". There are two active systems.
If you cannot restore an accidentally-deleted file from it, it's not a backup.
It is a serious mistake to use the term "backup" in relation to a RAID 0 array. There is only one correct way you can do that, "either disk can serve as a backup for the other, should its media fail".
Either disk can serve as a backup for the other *drive*. However, there is no backup copy of the data. It is *not* a backup solution. There is no backup.
There is, however, fault-tolerance. A media fault can be tolerated. But if the active copy of the data is corrupted, there is no backup.
Re:Double Duh! by MarkRose · 2009-01-03 04:26 · Score: 1

And watch all your busy tables write-block and your site go unresponsive while pg_dump does its thing?
And wait for hours and hours while pg_restore executes millions of insert queries and rebuilds indices?
Sure, pg_dump (or whatever another database might use) works for tiny databases where a few seconds of latency is tolerable once a day. It's not tolerable when you're down for hours.
Keeping the database responsive when backing up a live database is what makes it tricky.

--
Be relentless!
Re:Double Duh! by MarkRose · 2009-01-03 04:44 · Score: 1

And as long as the database can do that in a reasonable amount of time without affecting the performance of on-going database activity, that's great. But setting up a system capable of that is more complex than doing a straight file-system copy, hence "a bit tricky" in comparison, though not necessarily difficult.

--
Be relentless!
Re:Double Duh! by IdleTime · 2009-01-03 05:10 · Score: 1

No, it is not.
Replication is used to move data in a simple way between two running databases that needs access to SOME of the data.
Replication is extremely bad for backups, mostly because it is a very slow process and not meant to be used to duplicate the whole database. In such cases, you use a cluster system with multiple data points.
I work with one of the large database vendors and have for many years. Backup is not an issue, as we, at least, deliver some very good tools so you can manage backups and never miss more than a second or so of transactions, if any at all.

But when that is said, I have worked with so many companies where the DBA is running their system, resignation filled out and signed, the only thing missing, is the date as they are bound to fail and lose their data because of incompetence. No matter how many and how good the tools are, some DBA's are set on screwing up everything for some reason.

--
If you mod me down, I *will* introduce you to my sister!
Re:Double Duh! by MarkRose · 2009-01-03 05:13 · Score: 1

I think we're saying the same thing, just in different words. There are two things to backup: physical and temporal. First is the ability to keep everything working. That requires backup hardware and a working copy of the data. Second is the ability to access the data as it existed at a previous point in time. That requires a stable, detached copy of the data, usually stored on separate media for space and reliablity concerns.
I agree that not having a detached, stable copy of the *data* is not having a backup of the *data*. So having a second machine is definitely not a backup of the *data*. A second machine is a backup of *hardware* only. But having a backup of just the *data* is also not a *complete* backup. We've all heard stories of old media that can't be accessed because the hardware doesn't work or can't be found. A *data* backup is useless if you can't access it.
So a backup of hardware or a backup of data is indeed a backup, by definition. But neither is a *complete* backup solution, which includes both physical and temporal backups. Journalspace.com had some of the former and none of the latter.

--
Be relentless!
Re:Double Duh! by turbidostato · 2009-01-03 06:55 · Score: 1

"Can't these problems generally solved with snapshots (in FreeBSD and Linux LVM lingo) or shadow copies (in Windows parlance)?"
Short answer: no.
Long answer: filesystem level "things" don't know about database "things" so there may be "things" at the database level that lead to a non-functional database when coped with at the filesystem level.
This assumption is valid not only when talking about filesystems and databases but on every two unrelated items. Learn this and you probably know half you need to be a proper IT professional.
Re:Double Duh! by _Hellfire_ · 2009-01-03 10:41 · Score: 1

Ah! Thanks for the clarification.

--
"And then I visited Wikipedia ...and the next 8 hours are a blur..."
Re:Double Duh! by adolf · 2009-01-03 11:57 · Score: 1

In that case, please allow me to submit that the database is broken.
Snapshots are a frozen moment in time. Failing, under any circumstances, to be able to recover a working database from a snapshot, is the same as failing to recover a working database after hardware or power failure or an obscure kernel bug crashes the system.
Things should not be so fragile.

--
Kid-proof tablet..
Re:Double Duh! by MarkRose · 2009-01-03 12:09 · Score: 1

Yeah, I'm speaking from experience with MySQL's version of the same thing. It took almost two hours for the mysqldump to complete :/

--
Be relentless!
Re:Double Duh! by _Hellfire_ · 2009-01-03 12:22 · Score: 1

So would the solution then be to have a database replicate live to another machine, then take your daily dumps off the replicated machine, so that the forward facing database is unaffected?
Where I used to work we had a replicated database on a different box for most of the live systems for reporting. The reporting would kick off at a regular time and grind the replicated machine almost to a halt, but the live system would experience no slowdown.

--
"And then I visited Wikipedia ...and the next 8 hours are a blur..."
Re:Double Duh! by MarkRose · 2009-01-03 13:16 · Score: 1

That's exactly the solution.
And as an added bonus, if you configure the two machines to replicate both ways, you can switch between which one is the primary at any time. This allows you to take either machine offline to upgrade software or hardware or move the machine, etc. Some highend database software may have this functionality already. And there's the MySQL Multi-Master Replication Manager at http://code.google.com/p/mysql-master-master/ for MySQL. I'm not personally aware of anything for Postgres, but it may exist (and I imagine MMM could be easily re-written for it).

--
Be relentless!
Re:Double Duh! by Harik · 2009-01-03 13:25 · Score: 1

Short answer: no.
Long answer: filesystem level "things" don't know about database "things" so there may be "things" at the database level that lead to a non-functional database when coped with at the filesystem level.
actually yes, you can do just that - tell the database that the filesystem level 'thing' needs to get a coherent snapshot of the database level 'thing'
http://en.wikipedia.org/wiki/Quiesce
Two very common ways to backup huge datasets:
1) Cluster mode, pull one server out of the cluster, quiesce it, and back it up (tenerally with DB aware tools). Once the backup is complete, have it reconnect to the cluster and it will resynchronize.
2) Mirrored NAS. Quiesce the database, break the mirror, resume database on one half of the NAS, backup the disk image on the other half. When done, resynchronize NAS for the next day.

This assumption is valid not only when talking about filesystems and databases but on every two unrelated items. Learn this and you probably know half you need to be a proper IT professional.
And the other half is knowing when you CAN do things like this, and how.
Re:Double Duh! by Harik · 2009-01-03 13:27 · Score: 1

It will "probably" work. Which means no, it will NOT work when your database has shit itself. Murphy's law dictates that you do not want to rely on the database unclean shutdown recovery as part of your backup solution.
Quiesce, snapshot, resume does work however.
Re:Double Duh! by Directrix1 · 2009-01-03 13:33 · Score: 1

I thought most modern RDBMSes version all data internally so when you run a backup it just sets a note about the current version and knows not to vacuum up old versions until after the backup operation is done. This allows it to operate completely normally with regards to any transactions which occur during this time.

--
Occam's razor is the blind faith in the natural selection of least resistance and in universal oversimplification. -- EF
Re:Double Duh! by turbidostato · 2009-01-04 04:32 · Score: 1

"actually yes, you can do just that - tell the database that the filesystem level 'thing' needs to get a coherent snapshot of the database level 'thing'"
Thank you for actually reinforce my point: in order to do "a database thing" (a backup) you "tell the database..." (your own words).
Re:Double Duh! by turbidostato · 2009-01-04 04:46 · Score: 1

"In that case, please allow me to submit that the database is broken."
No, it isn't.
"Snapshots are a frozen moment in time."
Sorry but no, they aren't. Snapshots are *filesystems* frozen in a moment of time. Nothing more, nothing less. They explicitly are not RAM frozen in time and they are explicitly not done at a database-safe moment of time.
"Failing, under any circumstances, to be able to recover a working database from a snapshot, is the same as failing to recover a working database after hardware or power failure or an obscure kernel bug crashes the system."
That's right, so what? Nobody is forcing you to have database snapshots if not needed and there's no way that your database backlog wil be sincronized to your snapshot policies and procedures unless a sysadmin has worked for them to be that way.
"Things should not be so fragile."
They are, and you better work under reality checks, not based on what they "should" be.
Of course, you can have whole *systems* to be such and such reliable but in order to achieve that you need to "tell" each subsystem to do their own homework: the database-thing to take care of their own "things" (i.e.: transaction-based checkpoints); the RAM-thing to take care of their own "things" (like ECC parity); the persistance-thing to take care of their own "things" (like LVM, RAID, system-level backups, etc.)...
Once you learn which "things" are part of what "things", you can design and deploy a reliable system; failing that you will have people actually thinking that RAID is a form of backup; that filesystem-level copies can allow you to forget about how to properly secure your database, etc.
Re:Double Duh! by adolf · 2009-01-04 07:29 · Score: 1

It is broken.
Things really should not be so fragile.
Reliable systems are built to withstand problems. A database which crashes hard because of a kernel bug or a hardware failure, or (gasp!) an inopportune snapshot is a broken database.
That I accept that most or all databases are broken in this fashion, and that there are good and simple methods to work around this problem (for the purpose of backing them up) does not somehow make them less broken.

--
Kid-proof tablet..
Re:Double Duh! by CarpetShark · 2009-01-05 20:24 · Score: 1

Personally I use faubackup+ssh (with keys) for both network and local backups, and it works great. I'm not advocating the lack of backups based on the problem of finding a decent external drive; just saying that finding a decent external drive isn't that easy, in itself.
The Western Digital MyBook I bought packed up within three days. After buying a replacement PSU, it worked for another two days, then gave up the ghost completely. I hear many USB enclosures are similar. The drive itself works fine, and is now inside my desktop, but the enclosures seem to be pretty poor, generally speaking. I do know of one Mac guy who loves his wifi-enabled NAS/drive/thing.
Re:Double Duh! by turbidostato · 2009-01-06 06:38 · Score: 1

"It is broken."
It is not. You can repeat it one thousand times and it won't become true. Some "things" depending on other "things" means not "broken by design", it is called "coperativeness".
"Reliable systems are built to withstand problems."
Yes. But a database "system" is not a "system" (the system is the computer and network hardware, the operative system, the user space tools, the database manager, the data *and* the software accessing such data). Unless, of course, you want to consider the database a "system" by itself in which case, it should be built to withstand *database* problems, not any other kind of problems. Both the whole system and the database system can be built to withstand their scope-related problems so your point is... you don't have a point.
"A database which crashes hard because of a kernel bug or a hardware failure, or (gasp!) an inopportune snapshot is a broken database."
It is a broken database no more than a computer is broken if it stops working whenever electricity stops passing through: electric-dependability is an out-of-scope problem regarding stand-alone hardware systems; that's why computers do work *provided* electricity is there (note you still can build *whole* systems that can withstand increasingly severe restrictions regarding electric disposibility); operative-system stability is an out-of-scope problem regarding *any* user space application, not only RDBMs: if the underlying OS fails you can't and shouldn't expect *any* user-space executable to be reliable; you still can build *whole* systems under given severity constraints and -as I already said, one of you most basic considerations will be knowing what you can and should expect from any given subsystem so you don't ask "A" to accomplish tasks under "B's" responsibilities.
"I accept that most or all databases are broken in this fashion"
Taking into account the ammount of talent and money devoted to the database management bussiness, I'll ask you to consider the slim chance that it might be you, not them, the one in the wrong position.
Re:Double Duh! by garaged · 2009-01-08 07:22 · Score: 1

The perfect example of "similarity" would be kernel interchangeability, I don't think is totally transparent between debian-ubuntu, but I would surely guess is mostly impossible with freebsd-macosx

--
I'm positive, don't belive me look at my karma
Re:Double Duh! by garaged · 2009-01-08 07:36 · Score: 1

have you make the same kind of tasks ??
I haven't seen a BSOD on XP for a while, but can surely see one if I do similar tasks as in linux (testing LAMP, firewall, mail, snort, VPN, etc).

--
I'm positive, don't belive me look at my karma
Re:Double Duh! by adolf · 2009-01-09 06:39 · Score: 1

Interesting points.
Your argument is successful in that I'm actually beginning to believe you.
Thank you for your continued and instructive rebuttals.
That said, I welcome the day when database systems are more resilient to failure. Perhaps the rise of SSDs, with their increasingly fast access times, will allow future database systems to eliminate some of the performance considerations which are currently in place to deal with the slowness of disks, and enable transaction-level safety from sudden and unforeseen crashes.
At least, I can hope.

--
Kid-proof tablet..
Re:Double Duh! by rundgren · 2009-01-12 23:52 · Score: 1

That would have been a good point, except that you're wrong: For the purpose of a server OS (and probably most other purposes) OS X is so far removed from it's BSD ancestry that your point is not valid at all.

Google cache by crow · 2009-01-02 06:23 · Score: 1

They could recover much of the data from the Google cache. I could even see Google providing a recovery tool for use in situations like this, possibly charging some fee for it. There also ought to be something that they could put in the robots.txt file to tell robots to use the previous scan instead of what's there now until they recover it.

Re:The rules of backups (FAIL) by Anonymous Coward · 2009-01-02 06:23 · Score: 0

1. Do not talk about restores.
2. DO NOT TALK ABOUT RESTORES.

Good grief. by zorkmid · 2009-01-02 06:25 · Score: 1

To think I was actually so paranoid that I missed our Datasafe offsite pickup thursday that I drove in, swapped the tapes out and took them home.

Earthquakes, building fires, meteor strikes, Super Volcano blowing. If you don't have offsite backup and redundant offsite servers Murphy's and Finagle's laws are going to spank you. Hard.

A lesson for geeks and suits alike by E-Lad · 2009-01-02 06:29 · Score: 1

It's tough to not by cynical after reading about something like this happening to an established company. To many of us, this is being careless about data in the 1st degree.

But as one other poster pointed out, let this be a cautionary tale - but not only to those who fund and lead IT departments (aka, the "Suits") but for system administrators as well.

I don't claim at all to have any deep knowledge about this particular Journalspace setup, but I can get a decent idea just by gleaning tidbits from TFA and elsewhere. In the end, I have to conclude that Journalspace is/was a company that had a great idea or product, the product being an application, but the people whose full-time job was to maintain the app had no idea how to design the underlying infrastructure to properly run it.

This is a situation I find more and more these days since the advent of Web 2.0. The scenario goes something like this: A company is formed around an application, an assemblage of Java/PHP/Ruby/whatever code that runs a neat service or online tool. The company brings in people who know the idea/code/language well and improve it. *Running* the application, however, on the systems infrastructure, is not their strong point. They're coders. They know Java classes and whatnot inside and out, but there's a reason why these people are full-time app coders and not systems/storage administrators.

One of two scenarios then happen. Sometimes both occur. First, one of the app coder employees who knows just enough about running a OS or designing a systems infrastructure is co-opted by the group to do this, to run their app. This person can get a lot of things right, but the devil is in the details and misconceptions or misunderstandings about certain things lead to stuff like RAID mirroring being considered as a backup mechanism or choosing to run your company on OSX Server because its GUI is familiar or whatever the reason may be.

The other scenario is that the group of app coders try to hire in people with the right kinds of experience to setup a infrastructure, but because they're interviewing people for a job they don't know how to properly quantify, they end up hiring under-experienced admins who are good at feeding them BS.

In the end, you get a turd of a infrastructure that works most of the time, but always has at least one hidden domino threatening to topple. When that domino eventually does, Journalspace happens. Single database servers? No real backups? That's a some basic stuff right there. It makes me wonder about more nuanced things that help keep a infrastructure straight such as security policies (network, host, physical), change control, funding, and so on.

Personal backups of online data by RevWaldo · 2009-01-02 06:29 · Score: 4, Insightful

This is why users should be able to easily back up their own data for any online service. If a service entrusted with your data provides no straightforward way to drop a copy of it onto your own hard drive, don't trust it. I'd go as far to say that any service that doesn't strongly recommend you keep your own backups shouldn't be trusted.

Do the big kahunas of the "Web 2.0" world give users that option? Gmail, Myspace, Facebook, Twitter etcetera ad nauseam?

--
Prisencolinensinainciusol. Ol Rait!

Re:Personal backups of online data by Anonymous Coward · 2009-01-02 07:04 · Score: 0

Web 2.0: for amateurs, by amateurs
Re:Personal backups of online data by AndrewRUK · 2009-01-02 07:20 · Score: 1

In the case of Gmail, yes. Simply go in to your account settings and ensure that IMAP is enabled (or POP, if you prefer it) and then point your favourite mail client at imap.googlemail.com.
Re:Personal backups of online data by RealTime · 2009-01-02 07:40 · Score: 1

Google, at least, supports user data portability, and not just with. social networking.

GMail lets you import and export your contacts in a variety of formats and access your email (for back-up or whatever) via IMAP and POP.

Picasa syncs the web albums to your local machine (and runs under Linux, thanks to Google's open source contributions to Wine).

Blogger lets you push blog content elsewhere.

There are probably more (like stuff you can access via GData APIs), but these are just the ones I could think of off the top of my head.

--
Yesterday it worked; today it is not working; Windows is like that...
Re:Personal backups of online data by dcollins · 2009-01-02 08:37 · Score: 1

Don't think MySpace has a built-in feature, but there's a 3rd-party web app for it here:
http://makedatamakesense.com/myspace/export/

--
We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
Re:Personal backups of online data by Anonymous Coward · 2009-01-02 09:18 · Score: 0

Gmail certainly does. You can download your entire mail collection via POP3 easily.
Re:Personal backups of online data by Anonymous Coward · 2009-01-02 09:53 · Score: 0

Facebook doesn't even store your chat sessions. And they have so many third party apps, they couldn't possibly offer to backup anything but your mail, pokes, etc. Easier for me to just PDF a mail thread or Wall I want to keep.
With Gmail you can mirror all messages via a POP download, without disrupting the data online. But that shouldn't preserve your keyword labels, which is why I enjoy Gmail so much.
Hmm, time I go do that.
Re:Personal backups of online data by Anonymous Coward · 2009-01-02 10:01 · Score: 0

Dunno... i do know tho that duplicity+gmail makes for nice online backup space ;-)
Re:Personal backups of online data by Ironica · 2009-01-02 10:18 · Score: 1

I'd go as far to say that any service that doesn't strongly recommend you keep your own backups shouldn't be trusted.
From the geek perspective, you're right of course... but Joe the Enduser won't usually see it that way. "Well, if they don't think they can hang onto my data, I should probably put it elsewhere!"
Probably the best idea for a site that lives and dies by users' data is to (1) have a great backup policy/procedure in place; (2) make that policy public to the users; (3) seriously consider user feedback on that policy that may point out limitations; and (4) make it easy for users to back up their own data to flat files. An optional (5) would be to write a tool that lets those files be uploaded easily to restore the data after a loss.
It's a good policy even if you never have a disaster to recover from; after all, what if someone's little brother (wow, stream of consciousness time; I JUST NOW got the title of Cory Doctrow's book) gets access to their blog account and (after reading all the private ones) deletes every post? Now it's easy for users to re-upload their own data without even having to contact anyone. (This, in fact, could be the basis for recommending self-backup to end-users; heck, even state that there's a fee for the site to restore data lost due to compromised accounts, and you're golden.)
There are, of course, situations where user backups can't be trusted by the server, such as MMOGs... but for most sites, it's an excellent idea.

--
Don't you wish your girlfriend was a geek like me?
Re:Personal backups of online data by Anonymous Coward · 2009-01-02 10:35 · Score: 0

I highly agree.
Dreamhost.com is a "by the seat of their pants" type of web host sometimes, but they do offer a tool to export a backup once every 30 days.
Re:Personal backups of online data by Intrinsic · 2009-01-02 11:33 · Score: 1

Yep. I run my sites: blogging, image gallery, and forums on my own virtual host and back up the data bases once a month. I dont trust other services to maintain my data, I think too many people dont understand that the backups are not being created for them...
Re:Personal backups of online data by Anonymous Coward · 2009-01-02 13:19 · Score: 0

Livejournal does, and I back up on a monthly basis. Gmail does in that you can store your mail via POP. Twitter I don't know, Myspace I refuse to use, and I don't believe Facebook does.
Re:Personal backups of online data by j_sp_r · 2009-01-02 22:33 · Score: 1

Better tell people to enable POP3, as some clients only download the headers of IMAP messages (and remove them if they are removed from the server)
Re:Personal backups of online data by Anonymous Coward · 2009-01-03 11:33 · Score: 0

Gmail does for my contact list, at least.

Damn, they we just about to change their logo by ds_job · 2009-01-02 06:30 · Score: 1

New Logo

Re:So, what was it worth in dollar terms? by symbolset · 2009-01-02 06:31 · Score: 1

I have no idea about the dollars, but the traffic (and presumably advertising revenues) is about 1/40th of a slashdot. The demise of linux.com at 1/5th of a slashdot is a much bigger deal.

--
Help stamp out iliturcy.

FOSS backup software by Anonymous Coward · 2009-01-02 06:31 · Score: 0

I haven't had to do backups in a long time (it is someone else's job now). I was wondering is Amanda still the best thing you can get for nothing these days?

They misunderstood Linus by Jusii · 2009-01-02 06:33 · Score: 1

(Only wimps use tape backup: _real_ men just upload their important stuff on ftp, and let the rest of the world mirror it ;)"

Oblig. Despir poster by Anonymous Coward · 2009-01-02 06:35 · Score: 0

"It could be that the purpose of your life is only to serve as a warning to others." http://despair.com/mis24x30prin.html

Re:So, what was it worth in dollar terms? by Heddahenrik · 2009-01-02 06:36 · Score: 1

My site Elftown seems to be 66% bigger if you compare http://www.quantcast.com/journalspace.com and http://www.quantcast.com/elftown.com

I don't make much money at all and can't live on it, but I guess they made more money by having really crappy content so that people are more likely to click the ads.

Is there any better ways to make money from a free website except by forcing people to click on links? It is great advertising to be present on my site, but my visitors will simply not click on ads and advertisers don't pay for branding.

This was dumb... (archive.org) by Anonymous Coward · 2009-01-02 06:39 · Score: 0

We're sorry, access to http://journalspace.com has been blocked by the site owner via robots.txt.

You may want to:

* Read more about robots.txt
* See the site's robots.txt file.
* Try the page on the live web: http://journalspace.com
* Search for all pages on the site journalspace.com/
* Try a different page address, at top

See the FAQs for more info and help, or contact us.

The master said... by Anonymous Coward · 2009-01-02 06:40 · Score: 0

http://www.taobackup.com/

Personal Backups are a Must, if Possible by huh69 · 2009-01-02 06:45 · Score: 1

I use VPS servers and run my own site using wordpress for blogging, as well as some other software, and I encountered a situation similar to this with VPSLand (which, by the way, is the worst VPS hosting service I have ever used). I have since changed providers, obviously.

After having already been through several periods of unexplained downtime, and an instance where the VPS was unexpectedly rolled back due to some unexplained problems (you can see where this is going), my VPS seemed to be permanently down, and customer service would not, or could not, assist me.

I had been using their backup service, but I could not restore the VPS to a running state with the tools I was provided. However, I figured out that I could put the VPS in maintenance mode and I could atleast SSH to it, so I used WinSCP across SSH to download my files and databases.

Bottom line. While I didn't lose anything, I learned the hard way (which was a failure on my part since I am a systems administrator) that you can't become complacent if you care about your data when you don't control the environment. I trusted VPSLand's administrators to safeguard my data and they failed. If I had not discovered the maintenance mode option, I would have lost everything. The only way I can ensure my data is safe is to back it up myself. I eventually wrote a VBscript that connects and downloads all of the web and MySQL directories to my machine at home on a regular basis.

I know all situations are different, but if you can backup your data yourself in any way... even if it's just additional copies on some form of removable media, it's best to do it. Do not trust that whatever service you are paying for will ensure that it's never lost. The company might have a great looking website and have well written policies, but if you don't know what's going on behind the scenes, you don't really know anything about the company hosting your data. Go the extra mile and safeguard your data yourself.

They do have a back up from 2007. by Bill,+Shooter+of+Bul · 2009-01-02 06:45 · Score: 1

From their twiiter feed (http://twitter.com/jsupgrades)

new server is powered up. We are copying the database. 9:23 PM Oct 7th, 2007 from web

So Unless they nuked the old drives from the old server they should have a back up of oct 7 2007. That's better than a sharp poke in the eye. Sort of.

--
Well.. maybe. Or Maybe not. But Definitely not sort of.

Re:They do have a back up from 2007. by Ironica · 2009-01-02 10:53 · Score: 1

If they kept the backup after the new server was up, that is.

--
Don't you wish your girlfriend was a geek like me?
Re:They do have a back up from 2007. by paimin · 2009-01-02 11:52 · Score: 1

Something tells me they nuked the drives. Just a hunch.

--
Facebook is the new AOL

Stupid should hurt by YourMomLikesIt · 2009-01-02 06:53 · Score: 0

Every day I see these self proclaimed IT professionals that shouldn't be working do stupid things and make ignorant decisions. This is one of them. It's amazing that someone would actually think that mirroring is an actual solution for backup. Rule one, you can never have enough backups. Rule two, stupid should hurt. This has gotta hurt! Mirroring and RAID fail, plain and simple. You should always have backups, plural! Anyone who hires staff be on the lookout for anyone from this company, unless you need someone to work the grill!

There is no excuse for this! by jvschwarz · 2009-01-02 06:53 · Score: 1

This is just poor IT admins, or maybe none at all.

And to those of you who want to blame the bean counters, there are cheap ways of making backups, even if it means manually doing a sqldump to some other server, disk or even a PC with a big hard drive laying around.

There is absolutely no excuse for any server admin not to have tested backups.

Since I'm currently unemployed, if any of you admins need help with setting up adequate backup for your servers, please feel free to hire me!

--
... if that's your best, your best won't do... - Twisted Sister

Blogs destroyed??? by Anonymous Coward · 2009-01-02 06:57 · Score: 0

Online journals DESTROYED?!??!?

Oh happy day...

Wow, Just wow... by skyride · 2009-01-02 06:57 · Score: 1

So, An established company managed to not make any backups of their data and thought it was safe because it was on a RAID 1? (im assuming RAID 1 since there was only 2 drives). Offsite backup is designed to protect against 2 things, Hardware failure and accidental deletion (like this scenario). RAID 1 only greatly reduces the likeliness of the former. Heck, Im 15 and I know that! I take regular backups of the MySQL database on my server. Please explain to me how and establish company is incapable of that..

History repeats itself by Anonymous Coward · 2009-01-02 07:01 · Score: 0

It's happened before, http://www.laobserved.com/archive/2006/05/journalspace_we_shall_ret.php

Evolution at work by gooneybird · 2009-01-02 07:03 · Score: 1

Evolution even applies to the 'Net:

The stupid die...

How many times does the story get repeated? The DATA is what's most important, not the hardware... They valued HA over DR. The fact that they never made a backup in 7 years (since 2002) is unbelievable.

I wonder if they considered asking all of their customers to reseed their site with their entries? After all, the bloggers kept a local copy right??? RIGHT???? I know I do... Unreal.

Thats like saying... by C_Kode · 2009-01-02 07:04 · Score: 1

Why Mirroring Is Not a Backup Solution. ...is like saying: Why a two door coup isn't a long haul transport vehicle.

While the two door coup will do the job in some respects, you're not going to keep you job using it. Just like using RAID as a backup won't allow you to keep your job.

Never needed a restore? by Thyamine · 2009-01-02 07:04 · Score: 1

I just don't understand how after 6 years no one ever needed something restored. Really? I have plenty of power users that occasionally delete something. Mistakes happen. Restore from tape, everything is back to normal. I would love to know what they thought they were telling people/users: 'Oh, restore your file/data, no that's impossible. No one can do that after you delete it.'

I mean, I'm freaked out that my church (where I'm the network admin) doesn't have a proper backup solution yet (cost being the issue; any suggestions welcome). Let alone a site with thousands of users a month.

--
I will shred my adversaries. Pull their eyes out just enough to turn them towards their mewing, mutilated faces. Illyria

Re:Never needed a restore? by Clover_Kicker · 2009-01-02 07:43 · Score: 1

I mean, I'm freaked out that my church (where I'm the network admin) doesn't have a proper backup solution yet (cost being the issue; any suggestions welcome).

Rsync snapshots to a USB drive. Maybe a couple of USB drives, one that you take home every week while the other stays plugged in, swap every Sunday after the service.
If your server is *nix I like rsnapshot, a wrapper script for hourly/daily/weekly/monthly snapshots - dead simple and painless.
If your server is Win32 I'm told rsync works on cygwin, never tried it myself.
Re:Never needed a restore? by Darkk · 2009-01-02 15:37 · Score: 1

Rsync snapshots to a USB drive. Maybe a couple of USB drives, one that you take home every week while the other stays plugged in, swap every Sunday after the service.
If your server is *nix I like rsnapshot, a wrapper script for hourly/daily/weekly/monthly snapshots - dead simple and painless.
If your server is Win32 I'm told rsync works on cygwin, never tried it myself.
It does and I have used it on a couple of occasions. Although I prefer using rsync between Linux boxes anyway. That is just me.

I hope the mainstream tech press picks this up by Clover_Kicker · 2009-01-02 07:04 · Score: 1

I'd like to see a nice post-mortem in a mainstream IT magazine once the company actually goes broke.

A clipping from Infoworld or CIOwhatever magazine might allow even beancounters to understand the risks of Scrooging on backup and DR infrastructure.

One RAID + backup solution by SlashDev · 2009-01-02 07:06 · Score: 1

is NetApp snapshots. In a shell, it mirrors AND backups

--

TOP DSLR Cameras Reviews of the top DSLRs

Re:One RAID + backup solution by OrangeTide · 2009-01-02 07:17 · Score: 1

Any NAS worth its salt will support fast/instant snapshot creation.
It also doesn't hurt to have replication, so if your building burns down or the server room is hit by lightning you still have your data.
But it's hard to argue with the reliability of writing backup tapes and then having a data archiving company send a van out to pick up your tapes and keep them locked up in a safe place. Tapes aren't very good when you want to go back to a previous date (that's why we have snapshots). But tapes are really good if you want a uncomplicated process (although time consuming) to reliably archive your data. (and I say this coming from three storage companies that wanted to just sell people a box full of harddrives to replace tape backup)

--
“Common sense is not so common.” — Voltaire
Re:One RAID + backup solution by Darkk · 2009-01-02 15:35 · Score: 1

Tapes aren't very good when you want to go back to a previous date (that's why we have snapshots). But tapes are really good if you want a uncomplicated process (although time consuming) to reliably archive your data. (and I say this coming from three storage companies that wanted to just sell people a box full of harddrives to replace tape backup)
Let me guess... folks from Dell?
Re:One RAID + backup solution by OrangeTide · 2009-01-03 12:31 · Score: 1

no. There are several companies all pushing the same thing. It's hard to sell a tape solution to small customers that have lots of data, because the cost is prohibitive.

--
“Common sense is not so common.” — Voltaire

Only 14k users monthly by grimJester · 2009-01-02 07:06 · Score: 1

Something like blogger.com has 23 million. 14k a month is really not a large user base. This is something one single guy does as a hobby, not a large organization of volunteers. Starting over would probably make the user base stabilize at less than half the previous number, even that only with months of hard work.

Stupidity rewarded, conclusions deserved by kaizendojo · 2009-01-02 07:08 · Score: 1

Not only were they stupid enough to never even make a single copy of their site, they blocked www.archive.org from their robots.txt file so you can't even recover anything from that avenue.

Some of the pages seem to still be in Google's cache, so if you had a journal there and you were too stupid to back it up as well, there might be some hope for your content.

Thanks, journalspace.com for providing me with the best sales tool ever for convincing my web clients it's worth the extra few bucks to do a maintenance contract including off site backups....

Cache? by Seth+Kriticos · 2009-01-02 07:13 · Score: 1

Can't they just ask Google - for instance - to give the cache records of the site? I mean, we all know that what is released on the web is mirrored over and over again? No?

Re:Cache? by Skapare · 2009-01-02 08:11 · Score: 1

Did Google cache the user database and passwords?

--
now we need to go OSS in diesel cars
Re:Cache? by troll8901 · 2009-01-02 09:35 · Score: 1

Should Google even be able to see the passwords?
Re:Cache? by Shadyman · 2009-01-02 15:35 · Score: 1

Nope, as AcquaCow pointed out, they had web spiders blocked.

Death by robots.txt?
Re:Cache? by tru24rm · 2009-01-02 16:55 · Score: 1

Or, perhaps REAL spiders were the culprit? Someone may have forgotten to blow the dust, cobwebs, critters & other random debris from the machine, causing that yummy smell of burning dust we all love. ****I just LOVE the smell of roasted arachnids in the morning!**** The best part of wakin' up is arachnids in your comp! (This comment was invented by Shampoo & Company, LLC. All Rights Reserved.)

When even I am doing the near-offline backup... by sam0737 · 2009-01-02 07:18 · Score: 1

...through wake-on-LAN + rsync + shutdown to the box next to the room. I can't imagine something with that famous aren't taking at least the same measure to prevent such thing from happening.

I had seen and experienced too many accidents, which taught me one thing - if you are tight on budget, make a offline backup solution, or at least point-it-time, incremental, non-realtime first, only after that, go for RAID/Mirror, but not in the opposite way.

Most of the "Ouch" moment are due to user faults, mirroring doesn't protect anything silly user action.

Even if you can't invest time to figure out the FS snapshot, flushing database before starting backup, rotating whatever database update log, still much better than mirroring.

BTW, for mirroring, two hard drive of the same lot has a much higher rate to die at or around the same time.

Last but not least, buy a new harddrive every year or two, so your harddrive is always new and healthy. Bundle the old one to the backup array in JBOD so it's once again big enough to backup the new disk.

Hot backup vs rebuilding RAID array under load? by sublimino · 2009-01-02 07:29 · Score: 1

During those couple of hours the site is running from a degraded RAID array and the only backup is on the way to a remote storage area. Table locking may be an issue with very large data sets, but there are still solutions: using a redundant server with block level replication (DRBD) the replication can be paused and a backup taken while there is no disk or database I/O. A "re-sync" of even very large disks is much faster to restore than a RAID rebuild due to block level meta data stored on the disks by DRBD.

double-click by Anonymous Coward · 2009-01-02 07:29 · Score: 0

C:\>rm -rf /
'rm' is not recognized as an internal or external command,
operable program or batch file.

Everything's still running here...

Double-click on the attachment I just e-mailed you. That should fix the problem.

rdiff-backup by cca93014 · 2009-01-02 07:41 · Score: 1

http://www.gnu.org/savannah-checkouts/non-gnu/rdiff-backup/

Great solution for off-site incremental backups...

--

Invoicing, Time Tracking, Reporting

Re:rdiff-backup by jimmyharris · 2009-01-02 12:09 · Score: 1

Or even better, Safekeep which uses rdiff-backup but adds easy per-server configurations without writing scripts, LVM snapshots, database dumps and SSH key configuration.

How to backup, the Slashdot way by Anonymous Coward · 2009-01-02 07:44 · Score: 1, Funny

1. The file system must be ZFS. If your FS' acronym is more than 4 characters long, your data is fucked from the beginning.

2. Backup often: if it's not done every five minutes, your data is stale and totally not fresh.

3. Off-site backups: you need to Fedex your shit to five different corners of the world at a minimum. If a fire breaks out, you can call your offices in Uganda for the restore disks/tapes.

4. Secure your backups: keep guard dogs, armed mercenaries and Jason Statham nearby. In the likely event that your hard drives get jacked, you can always rely on Statham to kick down a door or two to save the day.

5. ???

6. Profit.

Re:How to backup, the Slashdot way by Anonymous Coward · 2009-01-02 08:21 · Score: 0

The file system must be ZFS. If your FS' acronym is more than 4 characters long, your data is fucked from the beginning.
So NTFS is in then, thats good ;) well kind of...
ext2 and ext3 are still fine, infact most file systems fit into the less than or equal to 4 characters...

Anthony Fremont, Journalspace.com CEO/CTO, Age 6 by RevWaldo · 2009-01-02 07:45 · Score: 1

(Anthony is at his PC playing Call of Duty, the sound of gunfire filling the office. The VP of Sales walks in.)

Vice President of Sales (nervously): Morning Anthony! It's a real good day we're havin' here in Peaksville! A real good day!

Sys Admin (rushing in): Good Lord! Both of the RAID drives are blank! The entire site is gone!

VP: Anthony, do you know anything about this?

Anthony: A message kept popping up on my screen saying the drives were nearing capacity. I don't like it when popups come up on my screen! So I deleted all the files. (waves his hands like a magician) Now all our drives are at 100% capacity!

SA: Why you little..

VP: Why it's real good that you done that Anthony! Real good! I hate when popups come up on my screen too, don't you Bill?

SA: Yes, I really hate that.

VP: Bill, why don't you go and restore the files from the backups?

SA (barely containing his anger): I would but when I went to the vault for the tapes it was filled with Pokemon cards.

VP: Anthony, do you know where the tapes are?

Anthony: I wished them into the cornfield. Silly stupid tapes! I needed the vault for my cards! What if there was a fire? Or a dragon attacked?

SA: Well, we can still restore from our offsite storage..

Anthony: Oh I shut that down, when did I do that? My Birthday! I needed money for my party. The accountants said there wasn't any and I got real sore! So I shut down nonessential services to fund my party. We had a petting zoo and everything!

SA (losing it): Nonessential! Do you realize what you've done!? The site's dead! Nearly six years of data, thousands of users gone! This means disgrace and bankruptcy!

(Anthony pauses the game and points his fingers at the SA, turning him into a jack-in-a-box. The VP stifles a screen.)

Anthony (unpausing the game): I didn't like that dumb old site anyway. We'll make a site so that people can play Call of Duty online for free!

VP: But Anthony we don't have the rights to..(Anthony shoots her a mean look) Why that's a fine idea, Anthony, real fine! We'll get started on it right away! Right away...

--
Prisencolinensinainciusol. Ol Rait!

Don't you love the suits? by Anonymous Coward · 2009-01-02 07:53 · Score: 0

First, they blame Mac OS X. Then they blame someone who sabotaged them months ago. At least they didn't blame DriveSavers.

At no time do they ever accept blame as a company. All they had to do was make one fricking full backup per week. That was too much for them. They decided to do something fun instead of mind the tape drive for a little while. Then something bad happened, and now the data, and the company, are toast.

One fricking offline backup per week. That would have saved their company.

And you know who said "no" to spending money on tapes or DVDs. It was some suit.

The geeks need to become the leaders of this society, or we are in big trouble. And no, using a Blackberry, iPhone, or a Mac or an does not qualify you as a geek.

OS X Server by DTemp · 2009-01-02 07:56 · Score: 2, Interesting

The site was run on OS X Server... I think this may be indicative of the level of IT effort with the company. Look, *I* run an OS X Server... but *I* am a Biology major that knows approximately dick about the UNIX command line, and use it to run a server that I probably wouldn't be able to run any other way. I also have it backup nightly to a cheap NAS, archiving old backups, and I've tested a restore to make sure it works.

This is probably just a couple guys who ran a website in their spare time... not a huge IT effort that failed.

Comment removed by account_deleted · 2009-01-02 07:58 · Score: 1

Comment removed based on user account deletion

Licensing for IT Pros? by deviator · 2009-01-02 08:00 · Score: 1

I've been in IT for a long time - and my company manages a whole lot of backup systems. Our customers usually have mirroring/RAID, and volume shadow copy, and disk backups, and tape backups, and...

As networks grow in complexity I am starting to think it would make sense to require IT pros to get licenses. Even though it would cost me a ton of time & money to license my staff, I think it would prevent idiotic stuff like this from happening--or at least make the licensing board directly responsible for a screwup like this.

Kinda like an electrician, but more like the BAR than anything else - screwing something up big-time (through ignorance or incompetence) could get an IT Pro "disbarred."

There's *no* good excuse for this, period. If your database can't be snapshotted so you can get a clean backup of it--use a different database technology which can.

Re:Licensing for IT Pros? by troll8901 · 2009-01-02 09:33 · Score: 1
As networks grow in complexity I am starting to think it would make sense to require IT pros to get licenses.
When you raise the entry bar ...
- How would fresh grads get their experience?
- Who would companies hire for desktop and helpdesk support?
- How would no-certs-but-very-experienced people find jobs?
- How would hirers know whether to hire a person with a cert, or a person with experience?

Damn that robots.txt... by AcquaCow · 2009-01-02 08:01 · Score: 1

Not even the Wayback Machine can save them...

"We're sorry, access to http://www.journalspace.com/ has been blocked by the site owner via robots.txt."

-- Dave

--

up 12 days, 22:30, 2 users, load averages: 993.20, 994.21, 994.56
*makes note to limit user processes...

Re:Anthony Fremont, Journalspace.com CEO/CTO, Age by mksql · 2009-01-02 08:03 · Score: 1

So true, many IT shops are located in or near the Twilight Zone.

--
I should have been a Geologist.

google cache ? by Anonymous Coward · 2009-01-02 08:07 · Score: 0

you're telling me google cache is not a valid backup mechanism??

Re:google cache ? by Anonymous Coward · 2009-01-02 08:21 · Score: 0

http://74.125.45.132/search?q=cache:7NRRj4HOxSwJ:damestjernelys.journalspace.com/+site:journalspace.com&hl=en&ct=clnk&cd=1&gl=us
With crap like that, I say good riddance.

File -> Export for the web by Onymous+Coward · 2009-01-02 08:09 · Score: 1

Over-aggressive attempt at vendor lock-in. Many content hosting businesses are perfectly content (hm) to let the hurdle of scraping your data out be barrier enough.

I don't believe many customers think about the importance of being able to File -> Export.

I hope eventually consumers come around to understanding, and that this feature becomes a primary criterion in selecting services.

Of course a mirror is a backup! by Slashdot+Parent · 2009-01-02 08:15 · Score: 1

Break the mirror, pull out one drive, relocate it to a secure location, replace the drive with another, and rebuild the array.

There. A mirror is a backup. QED.

--
They don't grade fathers, but if your daughter's a stripper, you fucked up. --Chris Rock

Re:Of course a mirror is a backup! by hesiod · 2009-01-02 08:29 · Score: 1

Meanwhile, your server's file access slows considerably, and during each HD swap you are running a live server with one hard drive. If that drive dies, then your site goes down. With "real" backup methods you can make your backups and survive that disk failure without the site going down.
Still, if you are just using that as proof that it COULD be used for backup... well, yeah, a bad backup method is still backup.

I remember Karl Rove had a JournalSpace.... by bodland · 2009-01-02 08:24 · Score: 2, Interesting

I'm just sayin'

Ship packets of data, not storage media by Anonymous Coward · 2009-01-02 08:41 · Score: 0

While I don't doubt that somebody here is trucking around racks of disks as a data transport, I think the majority of those advocating disk backup assume you can make your daily data transports via Internet or other WAN links, e.g. running rsync or another incremental transfer protocol between two disk servers. By having disk arrays at both sites, the incremental algorithms can effectively scan a local and remote snapshot and send only the deltas.

Many businesses do not have that much new data per day that they could not transmit the deltas over their existing WAN links. For these businesses, it is nothing but habit that would demand sending tapes out versus sending packets. For my own small software consulting business, I even did this via a GPRS radio link for over a year... piggybacking on the same GPRS link (think SLOW modem) that was also carrying my job-related inbound and outbound email.

haha by Tatsh · 2009-01-02 09:06 · Score: 1

hahahahahaha

where have i heard this story before... by Anonymous Coward · 2009-01-02 09:18 · Score: 0

"server crashed sometime between 18:59:40 EST (GMT -5:00) and 19:00:00 EST (GMT -5:00) on Dec 31, 2008 which remarkably corresponds to within at most 20 seconds of the New Year in GMT. I have been running this same hardware non-stop for more than six years..."

are you sure this isn't the same people?

Re:where have i heard this story before... by JoelKatz · 2009-01-02 19:09 · Score: 1

If this is them, it could definitely be an LS60 bug. GMT time went from 23:59:59 to 23:59:60, which can cause some software to think there are -1 seconds left in the year.

Just use Google Cache to recover your posts by Anonymous Coward · 2009-01-02 09:19 · Score: 0

You can use Google Cache to restore most, if not all of your posts --

http://digg.com/software/Recover_most_all_of_your_JournalSpace_posts_images_w_Google

google cache? by Anonymous Coward · 2009-01-02 09:31 · Score: 0

Maybe google can help people recover some things

why I wish there was a slashdot age limit by SuperBanana · 2009-01-02 09:31 · Score: 1

There is no rational justification for tape anymore, what with the cost per TB stored on hard disks now under $130, total $$.

Tape media is cheaper and has a higher density than any hard drives on the market. Only laptop drives are starting to come close in density, but they're several times more expensive per GB once you go beyond needing to store more than a couple of TB of data.

Also, tapes are designed to be reused thousands of times. SATA connectors are only spec'd for something like 500 connect/disconnect cycles at most.

--
Please help metamoderate.

Re:why I wish there was a slashdot age limit by postbigbang · 2009-01-02 10:03 · Score: 1

The procedure isn't to connect and disconnect SAS or SATA cables, rather to pipe the data across network wires, in-band or out-of-band. Having a shadow network to perform backups, provision machines, re-slab servers and so on (especially VMs!) is the goal.
Tape media is fraught with difficulties, tho not as much as when we dealt with DC600s and 3740's. The cute little cartridges today are nice, and wonderfully fast, and still lack random accessibility, require physical manipulation (and are more subject to loss, theft, and environmental disasters), are proprietary to specific vendor solutions, and while more compatible with various OS blends than ever before, are still a hassle worth retiring, IMHO.

--
---- Teach Peace. It's Cheaper Than War.

Drive mirroring is for fail-over redundancy.... by ducomputergeek · 2009-01-02 09:35 · Score: 1

I've always treated drive mirroring as fail-over redundancy. If the primary drive fails, use the mirrored copy until the drive can be replaced. There have been times that we've used the mirror drive as a back up if the primary drive failed. But every night, the entire site database and file structure is backed up on-site via tape and a FTP copy of critical data (database dump) is sent to a dedicated server at another hosting company in another country as a second copy of this data is backed up to a server in the office. And that server gets backed up to tape weekly.

Does this all cost money? Yes. Has it completely saved our ass before. The worst we've had is a disc failure that took the database server down for 30 minutes until I could reboot it and use the mirrored drive. Drive was swapped out later that day, took the site off line for an hour that night (when it doesn't get a lot of traffic anyway), copied the HDD, and rebooted. That was 98 days ago. *knocking on hard brownish furniture*

I load random copies of the back ups on a local development server twice a week, just to make sure that nothing weird is going on. Some of the other people in the department think their boss (me) is crazy about this stuff, but they're programmers not systems people.

Granted we run e-commerce for others. Every minute it is down is costing people money. Now we run only about 600 transactions a day at this point, but that averages to $15k a day.

--
"The problem with socialism is eventually you run out of other people's money" - Thatcher.

Re:Drive mirroring is for fail-over redundancy.... by Darkk · 2009-01-02 15:18 · Score: 1

Very sound system you have in place. Keep using it.
They will thank you for it and keep your job.
Re:Drive mirroring is for fail-over redundancy.... by JoelKatz · 2009-01-02 19:11 · Score: 1

"I've always treated drive mirroring as fail-over redundancy. If the primary drive fails, use the mirrored copy..."
No, you've got it wrong. There is no "primary drive". RAID 0 treats both drives precisely the same. Reads are distributed evenly over both both drives and all writes go to both drives.
Because there is no primary, it follows that there is no backup. A backup would be the one that you weren't using or changing constantly.
RAID 0 does not create a primary and backup. It creates a dual-active system. It provides fault tolerance (if one drive fails, data can still be read from and written to the other), but it doesn't provide any backup at all.
Re:Drive mirroring is for fail-over redundancy.... by Guido+von+Guido · 2009-01-03 06:49 · Score: 1

RAID 0 does not create a primary and backup.
You can say that again.
(Looking at the timestamp, I can understand the mistake....)
Re:Drive mirroring is for fail-over redundancy.... by JoelKatz · 2009-01-03 06:55 · Score: 1

Hehe, yeah. I mean RAID 1. But then, the same is true for RAID 5, RAID 6, RAID 10, and on down the line. In fact, I can put it this simply -- RAID is not backup. It's fault-tolerance.

Darwin awards by ZiggyM · 2009-01-02 09:45 · Score: 3, Funny

Are there Darwin awards for websites?

business fatigue by epine · 2009-01-02 10:00 · Score: 1

I haven't seen a comment yet about small business fatigue.

Was the disgruntled employee a founding member? Was he a stake-holder on any other level? Had all back salaries been paid to cover any ... uh ... dry spells in the startup plan? Were they confident they would be cash flow positive entering a difficult business year? Do they really not have any back media stashed somewhere? Maybe they just looked at the recovery cost from their dated (and possibly tampered) spares, the cost to their business credibility, and decided the prudent business decision was to close the doors and move on.

Maybe the disgruntled party was throw out the door but the parties responsible for creating the dysfunctional environment hung around. They usually do. Does the closure unlock any business assets that one or more of the existing principals can roll forward into another opportunity? Is the "failed drive" story just a lot more sanitary for public consumption than the sordid story about disgruntlement and personality conflict?

There's a troll out there who is suggesting maybe the same thing about Madoff's too easy confession.

http://www.globalresearch.ca/index.php?context=va&aid=11488

Not very good and puts far too much faith in "board level oversight" and never once mentions the Enron factor: even at the top (maybe especially at the top) people refuse to question black boxes if the profit stream appears reliable. How did Enron get away with not supplying detailed balance sheet? The usual refuge of "trade secret": if we tell you how we do it, the chicken recipe will cross the road.

Isn't that the bonus of being at the top? Everyone lets you into their secret tree forts? There are a lot of empty suits out there stalking the putting greens who aren't much motivated to puncture the veil of a secret handshake. They have other agendas.

http://www.edge.org/q2009/q09_10.html#taleb

The problem with the Madoff analysis is that it presumes his operation was legitimate for some (long) period of time, then he's wiped out by the big melt-down LTCM style, after which he concocts this bogus pyramid excuse. How then did he really achieve these implausibly consistent results over that long period of time? Does he really have a system that works as portrayed (until it blows up) LTCM style? Is there a secret he's still trying to keep?

I'd put my own bet on the square that the fund return levels were massaged since way back, and that the empty suit oversight boards were as gullible as you'd have to imagine, despite the glass and granite whitewash of financial controls and oversight.

As far as Journalspace is concerned, if it turns out that this "backup oversight" was the only bad seed at the core of this apple, it would be a case of truth stranger than fiction.

Hard Drive Backup - its how you use them by Hairy1 · 2009-01-02 10:01 · Score: 1

Hard Drives are great backup devices. Unlike tapes they don't stretch, and can be easily verified. Picking out specific files is far easier than with tape.

However, Mirroring is NOT a backup. In the commercial backup system I wrote I was responsible for a postgres database. Every night a cron job would do a local SQL dump to a file. This file would then be zipped and archived to several other computers, some of which were offsite.

We also had many large files, which were not stored in the database. For these files we used rsync configured in such a way that files could never be deleted. Generally files didn't change once generated. The database zip file was named based on the date, and a complete copy kept for each day.

It was a very brute force way of doing the backup, but had massive redundancy. Several computers at various locations would need to be comprimised or destroyed to destroy the data. The scripts and permissions were such that an attacker could not get universal access from one point.

I do worry about systems like GMail - and how they store and back up your data. I have invented various decentralized backup systems, so I assume that larger organisations use a similar approach - that is redundant data on many hard drives.

Archive/redo logs too by DamnStupidElf · 2009-01-02 10:07 · Score: 3, Informative

ACID compliant databases use a log, much like a filesystem journal, that contains all the changes made to the database before those changes were actually written out to the main database storage. When you back up the raw database, you back up all the logs since at least the time you started backup up the raw files until the time the backup was finished, and when you need to restore the database you put the raw data back and then let the database replay the logs.

Re:Archive/redo logs too by MarkRose · 2009-01-03 04:03 · Score: 1

Yep. The only trick is making sure your backup process completes before the log has over-written itself at the point when the backup started, and that the log is copied last. If you were backing up say 100 GB at 50 MB/s, it's going to take 34 minutes -- meaning the log better hold at least 34 minutes of updates, and many times that for a comfortable margin.
It's much simpler to create a snapshot and copy from that.

--
Be relentless!
Re:Archive/redo logs too by DamnStupidElf · 2009-01-04 11:03 · Score: 1

It's much simpler to create a snapshot and copy from that.
I'm not sure exactly what configuration you're talking about, but if the database server load is so high that you need to backup from a snapshot, wouldn't replaying the logs to keep the replicate up to date cause a huge load on the server hosting the replicate, too? Or are you talking about taking a snapshot of the raw storage using a feature of the SAN/OS and then applying enough archive logs to that to make it a consistent database, and then backing that up?
There are only about a hundred different ways to do proper database backups...
Re:Archive/redo logs too by MarkRose · 2009-01-04 14:13 · Score: 1

Yes, it would cause a high load on the replicant server, too. But it's okay if the replicant falls behind a little bit. Once the heavy demands of the backup are complete, the replicant can catch back up. Obviously, if both machines are loaded to near capacity, this won't work -- but by then you'd already be experiencing intermittant slowdowns (increased latency) at busy times of day.
The main delay is caused by disk bandwidth regardless of the backup method. It simply bogs things down when you read gigs of data off the disk.

--
Be relentless!

Re:No Archive.org either-Compound Foolishness by Anonymous Coward · 2009-01-02 10:08 · Score: 0

This is just compound foolishness. I gather they did it in an attempt to control bandwidth costs since it's hard to imagine any other reason.

Or, they did it to avoid database load. Mirroring an entire website with wget can generate a lot of database queries.

They get what they deserve by Anonymous Coward · 2009-01-02 10:19 · Score: 0

I'm sorry they lost their data, but what kind of idiotic plan is that? As for backing up the database, just quiesce the freakin thing at 4AM.. split the mirrors and start the database up again. That whole part should take 15mins top? Then backup the mirror disk and when done reattach the mirror. Jeezz..... What a half-ass setup!!!!

NOTE to SELF:
If I ever see a resume with journalspace as an employer... never hire them.

And that's why you should always... by Namlak · 2009-01-02 10:38 · Score: 1

...leave your cork on your fork.

Re:No Archive.org either-Compound Foolishness by psydeshow · 2009-01-02 10:49 · Score: 1

They also purposely blocked archive.org via a robots.txt exclusion, so the bloggers can't use that to try and recover some of their blogs.

This is just compound foolishness. I gather they did it in an attempt to control bandwidth costs since it's hard to imagine any other reason.

Umm, utter incompetence?

The same admin cocky enough to declare that their data is safe because it's stored in a RAID-1 is likely cocky enough to believe that archive.org and other scrapers are "stealing" the content of his website.

Google by Anonymous Coward · 2009-01-02 10:55 · Score: 0

I thought Google backed up the Internet

clever in 1964, but today we have ZFS. by toby · 2009-01-02 11:07 · Score: 1

= Unlimited, cheap snapshots.

Want to know more?

--
you had me at #!

Re:clever in 1964, but today we have ZFS. by Anpheus · 2009-01-03 12:35 · Score: 1

Oh, ok, let me know when they decide to loosen up the licensing on that driver since the code is wholly owned by Sun and they can choose to do that if they actually wanted to increase adoption and allow Linux (or Windows!) to be distributed with compiled in drivers.

Correct by toby · 2009-01-02 11:11 · Score: 1

mysqldump --single-transaction

--
you had me at #!

Mirroring isn't backup by kilodelta · 2009-01-02 11:12 · Score: 1

I know the difference. A mirror is simply to protect against hardware failure in most cases.
Higher RAID levels offer faster read/write throughput while encapsulating some form of error correction to contend with hardware failures.
But you must, must, must do regular backups of data. At my last job we were so paranoid we replicated databases to other servers and did daily backup dumps of all databases to tape.
There were some databases that go backed up every 15 minutes, those tended to be more transaction based databases. We used an open source product called Rsnapshot to do the backups.

blogs eh? by Anonymous Coward · 2009-01-02 11:22 · Score: 0

And nothing of value was lost......

Defense In Depth by maz2331 · 2009-01-02 11:31 · Score: 1

My approach to a business-critical system is to employ a strategy of defense in depth.

First level: RAID-10 array in NFS/DB boxes.

Second level: DRBD said NFS/DB boxes such that one server failure results in seconds of downtime max.

Third level: Automated dump of DB and make a tarball of website content and code.

Fourth level: RSYNC the tarballs offsite.

Fifth level: Occasionally backup the offsite backups to some form of removable storage.

It's concievable that this strategy could still have a failure whereby the data was all lost, but such disaster would need to be on the scale of a global thermonuclear war, asteroid impact, etc. In such an eventuality, nobody is going to care at all about the backed-up data anyway.

Get blog data back by oneofthose · 2009-01-02 11:45 · Score: 1

To get their blog entries back people might be able to use Google Reader. If journalspace provided RSS feeds and if the entire blog posts were part of the RSS feed and if somebody subscribed to the RSS feed in Google Reader one should be able to obtain all blog posts from when the first user followed the blog with Google Reader. Here are some details: http://www.niallkennedy.com/blog/2005/12/google-reader-api.html

tape backs up perfectly.... by SethJohnson · 2009-01-02 11:49 · Score: 2, Funny

Back in the nineties, a friend of mine was backing his mac system up weekly with a tape drive. The thing is, he was using the same tape to repeatedly back up onto. One day he calls to tell me he needed some help recovering files on his hard drive after a crash. I asked, "What about the tape backups?" He said, "That thing backs up perfectly. The problem is, it doesn't restore at all."

Seth

--
$5 / month hosted VPS on linux = awesome!

my sympathy by SethJohnson · 2009-01-02 11:57 · Score: 1

My condolences go out to the Journalspace.com operator. It was probably a huge part of that person's life and now it's all gone because of a mistake. I hope it doesn't leave him disillusioned about embarking on other ambitious efforts.

Seth

--
$5 / month hosted VPS on linux = awesome!

Human mistakes? Seen it before by yalap · 2009-01-02 12:53 · Score: 1

Maybe they had the same problem we had with our former Tampa/Atlanta based hosting company. When a drive in the RAID 1 failed they jumped in to replace the drive. But they replaced the good drive, not the bad drive. (And don't ask about their SAN where we had a backup and it went offline every few days. Excuse-of-the-minute included they had to replaced all the hard disks, or it cost $200,000 so it must be working, or they "replaced all the wires" because they might have cracked when they moved it from Tampa to Atlanta, even though it was still in Atlanta. Then it wasn't their problem any more because they couldn't do anything about it (except keep billing us $100 per month). Absolutely unbelievable what excuses they could pull out of the air. Thanks idrive.com for clean, reliable off-site backups. ) There is some computer usability study out there where they instructed sys admins in RAID 5 then put them in front of a test server and created various fault scenarios. All the data loss was caused by people pulling out the wrong drive or doing the wrong thing.

Hanlon's Razor by Anonymous Coward · 2009-01-02 13:23 · Score: 1, Informative

I'm the guy you reach, assuming you have a valid support contract, in situations like this. In real life, I work as a backline support engineer for midrange disk arrays, which range in price from 70K bux to over 500K bux. I've also taken operating system, network and security cases in the past with a previous employers. Our team takes these Severity-Cluster F*ck-data loss cases daily. We can salvage the data, sometimes. Frequently, there is nothing that can be done. After that, I write the Root Cause Analysis document and assist with the presentation to the customer's management.

I can understand where this guy is coming from. This is a potential career limiting move, and no one ever admits they screwed up. When you're truly scared, you loose the capacity for rational analysis, and grasp at straws. Unfortunately, this is when you are most likely to make mistakes.

A Microsoft filesystem support engineer once gave a really good analogy. Assume the big expensive disk array is a brand new Ferrari sports car. Just because it's really expensive with all the bells and whistles doesn't make it immune to flat tires or rear end collisions, does it?

Malicious actions *are not* the most likely cause of data loss or corruption. All of the specific situation below never occurred in the same data loss event, but I've personally seen each.

1) The array firmware is over two years out of date, because uptime was so important that maintenance was never scheduled. Same for the host OS and HBA drivers.
2) Failure to heed a published service alert requiring an upgrade or workaround.
3) Failure to save the current array configuration information.
4) The site does not have tape backup, instead the data is remote replicated to a similar array at the DR site.
4a) Someone convinced management that a disaster recovery site situated below mean sea level in New Orleans is a good idea. Oh, the date is early Sepember 2005, a week after Katrina hit.
4b) Alternatively, the DR site is in Florida, just after another hurricane. Someone forgot to buy a diesel fuel contract to top off the emergency generators every three days. After a week, the site goes dark.
4c) The telco routed data center primary and alternate fibre lines through the same physical conduit under the street. The utility crew with a ditch witch severs both. The ditch witch *always* wins! Anyways, your DR site is now out of the picture.
5) If tape backups are available, the cassettes are stored on top of a cabinet marked "Danger High Voltage".

6) A minor failure triggers the chain of events. For example, a drive fails and reconstruction to a hotspare drive begins. This is ignored.
7) The array's "call home for help" feature was never configured.
8) The array's data scrubbing feature was not active.

The array continues to operate in a degraded state, but since data availability is maintained, no one notices.
9) A reconstruction read failure on another spindle (second fault) in the Raid 5 volume occurs, taking the entire volume group offline. This read failure also kills off all other hotspares. This situation would have been prevented with data scrubbing.
Up to here, support can almost always recover the existing data on the array drives without too much work or restoring from backup.

10) Someone runs to the array, hears the array alarming and sees the flashing lights. End users are complaining. Management wants something done "right now". The replacement drives are a few hours away. It's time to make a command decision- call for help, sit tight and wait for the cavalry or go into kamikaze sysadmin mode and save the day?
11) The sysadmin recalls there is another identical array, which is running less important applications. He decides to yank parts from this "installed spare".
12) The stolen drives have not fixed the problem. A replacement controller or two from the other array didn't either. End users report additional problems with previously unaffected hosts.
13) Someone finally decided to call support, b

Everybody just calm down by MyHair · 2009-01-02 13:24 · Score: 1

What JournalSpace didn't say is that their mirrored drives are actually 30GB Zunes. Just let them run down overnight, charge them back up and everything will be fine tomorrow.

Make that 8 inch reel - morningitus by dbIII · 2009-01-02 13:52 · Score: 1

It's early Saturday morning and I grew up with metric.

Anyway, that should be enough to answer the "There is no rational justification for tape anymore" comment especially since tapes have also been improving over the years.

thousands of bloggers' output has evaporated by John+Hasler · 2009-01-02 14:51 · Score: 1

At least something good came of it.

--
Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.

Mirror or corrupt database? by noidentity · 2009-01-02 14:56 · Score: 1

On the bright side, if the site gets clogged up with blogs again, they have a nice mirror of it being free of them, for quick restoration.

But wait... They're hosting experts! by bigtallmofo · 2009-01-02 15:36 · Score: 2, Informative

The company that runs Journalspace (or used to, anyway) is Lagomorphics. They will host your site for you...

http://www.lagomorphics.com/hosting/

At Lagomorphics, we're OS X hosting experts. We've been using the Mac mini and Xserve platforms for years, and we're proud to offer you the opportunity to use our colocation facility. Just send us your Mac mini, or let us provide the hardware.

--
I'm a big tall mofo.

Something is fishy here..... by FlyingGuy · 2009-01-02 16:07 · Score: 2, Insightful

For everything to be just gone and I mean LONG gone, then something besides a truncation or un-linking of the file had to occur.

Now I don't know all that much about the apple file system, but I would imagine it is like most file systems in that it links clusters and sectors of data together using some sort of allocation table, hash, b-tree or something.

Now unless they had file scrubbing turned on and the OS purposefully went out and overwrote every segment of the file with 01010101 and 10101010 then the vast majority of the data should still be there, at least I would think it would be. I mean even the nastiest revenge oriented guy, would have to be able to invoke some kind of program to do that.

I am assuming that it was an SQL database of some flavor. I don't know much about MySQL internals but I am pretty sure a

delete from table

simply goes through the index and marks pages deleted and does not physically go out and scrub ever page that has data on it. I know that is how Oracle works.

So this leaves me wondering about the data recovery house.... I they were doing a sector by sector read on the entire drive ( either of them ) they should till see all sorts of data on the disk. Now I don't know if the database compresses data on the fly ( some do, some don't) and I don't know if drive compression is an option on OS-X. If so, I can see where they would see just mostly larges amounts of compressed data ( making things VERY difficult if not impossible to recover, but baring that, most OS's have the hooks built in do simply do a sector by sector read of the storage device and although your binary data ( images and the like ) might be unrecoverable, you could probably get most if not all of the text.

Just a thought, but hey I might be crazy, it is just the hacker in me that brings these things to mind...

--
Hey KID! Yeah you, get the fuck off my lawn!

Compression. by bored · 2009-01-02 16:13 · Score: 1

I work for a Disk->Disk->Tape vendor. There is definitely space for tape, for the following reasons.

Compression: Tape drives do in-band compression. This means that the LTO4 cart at 2:1 data can read/write data at over 300MB/sec. It also means that your getting 1.6T on that $50 cart. The lowest average compression ratio I've seen at a customers site is 1.5:1.
Encryption: The most recent versions of LTO/T10k/359x all support in-band encryption at full data rates. This means that when you backup 5T a night and send it offsite and it falls off the truck you don't have to worry about your data.
WORM: You can buy WORM media, which allows you to conform to various government regulations for track ability and accountability if you happen to be a health institution, financial institution etc.
Robust long term storage: Tape is rated and guaranteed for significantly longer than any disk subsystem on the market
Offsite bandwidth: Tape is suited to shipping, and its far easier/cheaper to send 10T of data on a half dozen tapes via fedex every night than trickle it over a high bandwidth high cost interdata center connection.
Power costs: With tape you can have exabytes of storage (near)online, in a library for just pennies a TB a year in power/cooling costs. MAID type disks help to fix this for disk, but the power costs are still significantly higher due to periodic spin-ups, keeping the electronics online, etc.
Raw bandwidth scalability/backup window: When configured properly, i've seen tape backup systems getting 30GBytes/sec+ the bottleneck is almost always how fast the source database disk, or whatever can feed the data at. The flip side is the price of the disk necessary to feed at these data rates. At those data rates the disks cost 1000x as much as the tape subsystem.

Serves em Right by Anonymous Coward · 2009-01-02 16:22 · Score: 0

Most brilliant IT folks by any other definition don't get that the DBMS vendors including Oracle, Sybase, Ingres (CA of old), Informix, etal. use every trick in the book as artifices to get decent performance. Hey, I use the commercial vendors in major applications, so I'm not being a bigot in any sense. I'm just bemoaning the idea that DBMS's tend to be massively overrated as secure reliable technological mechanisms for storing data. Yeah they can be any one of a variety of "ilities" (you know reliability, availability, etc.) but at a typically huge cost, and not always one which has any bearing on rational $ per increment of functionality.

Most specifically at issue here (in a design which appears to have attempted to avoid the vendor proprietary solutions) seems to be the fact that much of the performance for transaction based database applications depends upon huge (multi gig) physical memory caches of which normally would (in a ISAM or other file based sense) be written to the disk regularly, and in the best cases asynchronously. Sure the file approach has huge downsides, but my comment is strictly about the technology solution which has evolved in the market, and is being bought by notionally smart folks. In this case the IT pilot fish seems to have ignored that the DBMS caches would have a tight affinity in terms of node, and cpu (in a multinode or multi-processing environment), and that the large caches would have to be regularly flushed (also costly in terms of performance) to avoid potential corruption so...

Despite depending upon great SATA and/or RAID or even fiber channel disks to write everything twice, the timing of any failure would make it nearly absolute that even a well timed flush of a large memory (dbms) cache and the subsequent write to disk command of a large multigigabyte cache would not be complete (== corrupt transaction state, corrupt logs, with no time to rollback) on both disks to meet the demands of a simultaneous kernel panic or hardware exception. No disk is that fast at this point, even SSD.

I guess I would chalk this one up to naivete or ignorance on the part of the application designers or architects. You get what ya pay for. Kind of hard to have sympathy. I cleaned up a couple of messes like this when I worked for DEC.

Re:Serves em Right by FlyingGuy · 2009-01-02 17:46 · Score: 2, Insightful

I see your point, but something about this does not pass the smell test.
To have nothing on the HD(s) then someone had to very very carefully wipe the entire disk by overwriting every block and sector that the data occupied, and that would have made whatever DB system shit its pants as it started seeing data disappear so it would have been really obvious, really fast that something was amiss and as you relate would have more then likely caused a kernel panic and or at least a core dump of the DB system.

--
Hey KID! Yeah you, get the fuck off my lawn!
Re:Serves em Right by Anonymous Coward · 2009-01-02 19:57 · Score: 0

And where would this core dump get written?
Re:Serves em Right by FlyingGuy · 2009-01-02 20:32 · Score: 1

Huh?? I don;t understand your question in the current context. Now we are missing some information, like were both the HD's completely wiped? I don't think you can completely wipe a running systems HD without the OS complaining very loudly.
Hence my my notion that the data file might have been simply un-linked or truncated, but for the DBMS to have overwritten every record, every index, in every table seems a bit on the fishy side.
See my previous post on the subject.

--
Hey KID! Yeah you, get the fuck off my lawn!

cron plus rsync equals survival by gregraven · 2009-01-02 18:02 · Score: 1

An account with rsync.net and a couple of cron jobs would have saved their butts.

--
Greg Raven
As long as there's any left, I'll take mine first.

Common confusion: Sectacular vs likely by darkonc · 2009-01-02 18:30 · Score: 1

In my experience, the most common use of a backup is to recover from "I deleted file 'X', but it was a mistake". Drive disasters, however are more spectacular, so they're what your average PHB is going to hear about. Thus it is that an exasperated IT guy might get told by his PHB: We don't need backups. That's why we mirror!

It's just like the way that -- despite the fact that you're far more likely to be killed by a drunk driver than a terrorist, (this may even hold true for members of the US military), it's the terrorist deaths that make the news....

Think about it for a minute -- If, at the top of every hour, CNN spent 30 seconds on each DWI associated death during the last day,

they wouldn't have much time for other news, and
people would stop watching

This wouldn't be that much of a problem, however, if it wasn't for the fact that many members of the public -- and especially decision-makers tend to choose their response to 'reality' based on what's in the news... It's that kind of confusion between what's in the news and what's commonly occurring that often results in tragically incorrect focuses for la rge chunks of society.

--
Sometimes boldness is in fashion. Sometimes only the brave will be bold.

It never goes away by Raynor · 2009-01-02 18:52 · Score: 1

Nothing is ever deleted from the internet... that's more true than false.

Between Google Cache, Archive.org and local temp files, I would expect a gross majority of the information is recoverable if the people know what they are doing.

I had a professor who had accidentally corrupted both his website and the backups. It took all of five minutes to recover good copies.

--
"Dictator Flakes. They WILL be delicious."

Real men don't backup by Anonymous Coward · 2009-01-02 19:23 · Score: 0

They just upload their shit to an FTP site and have others back it up for them.

Wow by Anonymous Coward · 2009-01-02 19:55 · Score: 0

Running Windows without Cygwin is even more hardcore than running an internet company without backups.

Mirroring can be a solution by CormacJ · 2009-01-02 21:59 · Score: 1

Assuming that you have spare drives, you can use mirroring as a backup solution.

I had a huge database that I was was responsible for and we'd lock the database and split the mirror, take the drive offsite.

If the system died, we had a spare drive available for immediate recovery.

It's all in how you do it.

the timing does suggest a "time bomb" by Anonymous Coward · 2009-01-02 22:25 · Score: 0

seriously, I can see this disgruntled admin thinking, "let me set the whole thing to disappear on Jan 1, which is far enough away from my departure so no one will directly connect this to me"

Two comments: by RichiH · 2009-01-02 23:49 · Score: 1

1) This: http://journalspace.com/this_is_the_way_the_world_ends/not_with_a_bang_but_a_whimper.html is amusing. "The guy who I fired for stealing and who told people how smart he was did not have backups. After he left, I should have checked on stuff." -- This is wrong. If you don't _know_ proper off-site backups exist at any time, you are making a huge mistake. Every single day. Your responsibility does not start when you fire some guy. And in a shop small enough that one guy can handle all IT, the boss of a blogging _website_ _must_ know that.

Not much he can do now and no use crying over spilt milk. But to imply that his (shared) responsibility is less than 100% is a joke.

2) To save affected bloggers the trouble of posting: http://i154.photobucket.com/albums/s257/MyDoom111/btarded/outrage4yvdj3bf67oq.jpg ;)

rm -i doesn't save you... by Sits · 2009-01-03 00:05 · Score: 1

...when you add -f on the command line. There's a warning about why trying to make commands safe automatically is not a good idea in the Unix Haters Handbook (Changing rm's Behavior Is Not an Option section). Better to have an entirely different command written by yourself that won't vary it's behaviour platform (and at worse will be missing)...

My own cautionary tale (unrelated to the GP) is don't delete directories you think are completely empty by using rm -rf. Use rmdir - it's safer.

Oh and don't use -r if you aren't actually deleting directories. And watch out for GNUisms on GNU userlands - that * might mean more than you think it does on Linux...

Which is why you need an offsite data backup! by Gravesie · 2009-01-03 01:02 · Score: 1

I've been looking for more stories like this - i.e. people who failed to back up and their businesses closed. We know there are lots more out there but this is a real coup. Another guy had to re-do three months worth of accounts, but that doesn't even register on the scale of this one. Just goes to show yet again that offsite data backup is the only solution for anyone planning to stay in business.

--
Peter Graves Channel Computing

If the drive really is zeroed by Sits · 2009-01-03 01:05 · Score: 1

I'm not sure what you would get back (aside from remapped sectors). Years ago it was definitely possible to get significant data back from zeroed (as in dd if=/dev/zero rather than a "format") drives. However in recent years there is apparently so little space and densities are so high very few bits can be recovered. The reasoning for this is that there have been cases where drives have been blanked and vast sums of money have been at stake but the data was not recovered (old Slashdot thread about data recovery questioning if it's possible after one pass).

If you have a reliable link saying that this is to the contrary I would really like to see it though. Solid state disks are a whole 'nother kettle of fish though.

Mandatory Hui Neng quote by xactuary · 2009-01-03 01:44 · Score: 0

Bodhi is no tree, nor is the mind a standing mirror bright. Since all is originally empty, where does the dust alight?

--
Say hello to my little sig.

14,000 uniques a month? by level4 · 2009-01-03 02:56 · Score: 1

That's less than 500 a day! Christ, my personal blog gets more than that. Double that, in fact. And it's not like I put any effort into it whatsoever. It's roughly half talking about bands I like and half ranting about Ruby. No professional ambitions or concessions whatsoever. Not even updated regularly.

So isn't that pathetically low by any modern standard? Who could possibly make any money at all on 14k uniques a month?

I was under the impression that numbers like this were really low and basically meant nothing. "Enough traffic to call yourself popular" starts at about a quarter million uniques a month in my mind, and that would be the absolute minimum.

Don't mean to add insult to injury here, but if you've been soldiering on for more than 6 years and have less traffic than some random guy's zero-effort personal blog, then maybe you should just give up.

--
Let my new 7-digit UID be a lesson to all - write down your passwords.

wake up!! by Anonymous Coward · 2009-01-03 03:46 · Score: 0

coward.. u can easily show and explain the difference + in a mail to the boss if needed.. It *is* an unthankable job in most cases, but it *is* part of your earning a salary.

there are already comments here, and this story itself should provide enough material even for the most staunch believers of mirroring..

Now... ask yoourself how *valuable* are those 50TBs?

thousands of bloggers content gone?! by tfiedler · 2009-01-03 06:16 · Score: 1

Oh wait, who cares but the other bloggers that think the world revolves around them? No one.

--
Democrats and Republicans are like AIDS and Cancer, I want neither!

Bare SATA drive caddies by Kadin2048 · 2009-01-03 18:50 · Score: 1

There are actually several products like that now.

One is this one, which has USB and eSATA -- it's probably available on other places as well, but who doesn't like ThinkGeek? They don't say the manufacturer, but if I'm reading the photo correctly I think it's "NewWave". It goes for $40. In the ad copy there's a mention of a 1TB limit but I don't know if they really mean that or not.

Another, nicer option (IMO), is this one from NewerTech; they call it the "Voyager Q" if that link dies. It speaks USB, eSATA, and both 400 and 800 Mb/s FireWire. MSRP is $100. It specifically mentions that it's compatible with 2TB and larger drives.

I've seen several other models floating around from various manufacturers (some I think are just re-brandings of the same product ThinkGeek is selling) that are all substantially similar.

I'd be a bit concerned about putting a drive through too many insert/remove cycles -- the internal SATA connector isn't really made for repeated connection and disconnection -- but for backup purposes it's a pretty darn slick idea. I'd been thinking for a while about getting my SCSI tape drive set up and working again, but I think I'm just going to do 2.5" disks instead. Yes, the tapes admittedly last longer, but this assumes you can get the equipment to read them (and DAT was always a bit finicky in this regard). Plus, there's no comparing the sheer volume of media produced; it's a lot easier to take care of a small stack of hard drives (for under $100 you can get an air/water-tight Pelican case and a small media-rated fire safe and keep a few drives secure against just about anything except getting nuked) than it is to try and protect a big stack of tapes.

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."

Three words. by Anonymous Coward · 2009-01-03 20:19 · Score: 0

Multiple. Offsite. Backup.

Re:IT people by conureman · 2009-01-04 05:12 · Score: 1

I question anyone who claims to be "Professional".

--
The cost of that cleanup, of course, will be borne by taxpayers, not industry.

Anonymous Coward by Anonymous Coward · 2009-01-04 14:37 · Score: 0

The JournalSpace blog

http://journalspace.com/blog/

What were they thinking? by Juandachu · 2009-01-04 22:41 · Score: 1

Here in my company, which is in the earth moving business, we have daily backups to tape drives and weekly offsite storage. This is a company who invests very little in IT (our department is just me and one more person for some 2500 total employees) and even we can get that right...

BOFHalicious by nweyer · 2009-01-05 05:01 · Score: 1

LUSER: "hello, support? our db server disk is full" ... BOFH: "clickity click click" .... "nope - looks empty to me"

Slashdot Mirror

Why Mirroring Is Not a Backup Solution

711 comments