Tech Magazine Loses June Issue, No Backup
Gareth writes "Business 2.0, a magazine published by Time, has been warning their readers against the hazards of not taking backups of computer files. So much so that in an article published by them in 2003, they 'likened backups to flossing — everyone knows it's important, but few devote enough thought or energy to it.' Last week, Business 2.0 got caught forgetting to floss as the magazine's editorial system crashed, wiping out all the work that had been done for its June issue. The backup server failed to back up."
Then the swearing started again.
The simple truth is that interstellar distances will not fit into the human imagination
- Douglas Adams
who needs a magazine?
Ha Ha!
Maybe not so bad as losing your entire monthly product, per se... but it does happen. I'll bet their accounting, HR, and other back office systems are probably fine. This stuff is always ugliest at the department server level in smaller operations. I'll bet they get some good Mea Culpa 2.0 editorials out of it, though.
Don't disappoint your bird dog. Go to the range.
Some stories should just come with Nelson Muntz sound files embedded.
Ha-ha!
erroneous: look me up in a dictionary
*coughs
TFA: Business 2.0 never had to rely on their backup software until that day, which is why they probably did not realize that it was either obsolete or dysfunctional.
sorry, their MAIN problem is not in any way a dysfunctional backup system. ever heard of verifying backuped data?
I imagine that they still can resemble a lot of it from other files - they should still have all the layout pieces for one, and all the authors ought to have at least rough drafts of their stories on their personal computers. The deadline's screwed, but they can probably get it out a few weeks late (or in July, depending on how often they normally publish).
Why isn't it a default for an OS to ask where the backups should go when it is installed? Backups are like any other kind of security ... necessary. You shouldn't have to hunt for the instructions on how to back-up, they should be in your face.
Oh God! I'm sure we're missing soooo much!
HAHAHA
No June issue?
That's OK, nobody reads Business 2.0 anyway.
It seems unlikely to have crashed in such a way that a data recovery specialist would be unableto get most of the data back.
But whatever the case - there is a useful lesson here. Make sure your backups are backing something up.
Hold true, once again.
sometimes, nothing.
This reminds me of the recent uproar over a car crash involving the New Jersey governor. He was critically injured because he wasn't wearing his seatbelt, and people freaked, asking what sort of role model he could possibly be. I argued that he was an awesome role model, because sometimes people need to see a mistake end badly for someone else before they'll do what's necessary to protect themselves from making the same mistake. Seeing a high-profile magazine get hit like this can do the same for backup slackers the world over.
I don't know about you people, but after reading this (and giving it the "haha" tag) I'm going home and catching up on a couple of backups I've been slacking off on for a while.
Slashdot Burying Stories About Slashdot Media Owned
As long as those magazines that come in the smarmy black plastic covers still arrive I can't complain.
-m
Anyone know what "editorial software system" (CMS) they use?
There aren't a lot of ways for a machine to "crash" that loses all its data. Even a lightning-fried hard drive can have its platters removed by a data recovery lab and many files can be pulled off. A mechanical failure doesn't grind the platters into sand. As a network server it really should have a RAID too. So how exactly can "the server crash" so spectacularly that the RAID, backups, and widely available data recovery services all fail? Did the building blow up?
Tens of IT managers are getting Hundreds of IT minions to check Thousands of backup tapes and befor a senior manager walks in.
In the not too distant future, next Sunday A.D.
They should have just installed p2p software and had backup with multiple off-site copies. (Okay, maybe some copyright and security issues, but one fix at a time.)
the problem was, as always, not the backup. I've rarely seen problems resulting from the backup process. The troublesome process is the restore. Or as a friend put it once:
Nobody wants backups, what everybody wants is a restore.
In my twenty years of IT i've seen several companies making backups like a well oiled machine. The backup process was well documented and everyone was trained to a degree, they could do it with their eyes closed. But everything fell apart in the critical moment, because all they had planned was making the backup. Nobody ever imagined or tried a restore on the grand scale. So they ended up with a big stack of tapes with unuseable data.
Backup is the mean, not the goal.
Regards, Martin
And the server wasn't setup with a RAID1 at least for the partitions that hold critical data?
Backup alone is not enough, in some cases there needs to be multiple levels of protection.
Mistakes
"It could be that the purpose of your life is only to serve as a warning to others."
The International Herald Tribune http://www.iht.com/articles/2007/05/01/news/magazi ne.phparticle about it makes a good comment.
Business 2.0 has drawn attention for skewering firms, including, occasionally, other magazines, in its annual list of the "101 Dumbest Moments in Business."
I wonder if will include itself in the upcoming one?
They should also have remembered the old saying "There are two types of computer users, those who have lost data and those who are about to..."
The first issue of Business 3.0.
http://www.nytimes.com/2007/05/01/business/media/0 1mag.html?_r=1&oref=slogin
How about doing a restore practice run whilst at it? Just kidding. Sort of. ;)
ISO certified == THX certified
backup of that 2003 article?
I'm just providing them with free off-site backup, that's all.
Maybe some White House officials had secondary email accounts on the system.
Since the parent company, Time named "you" the person of the year they were simply following "you" and not doing regular backups
End of line
The problem comes with the need of a restore.
There are invariably 2 possible situations that can happen when a restore is in order:
Either the backups simply don't work 'cause they've been using the same media for 5+ years and nobody ever bothered checking the error messages (or because there were no errors produced in the first place 'cause the backup was done without a verify).
Or the last person who knew how to do a restore with those tapes was fired a year ago and the others just followed the backup procedure which has been documented down to the most trivial steps, but there isn't a single piece of information to be found how a restore would work. The reason for this is simply that, since that techie has to get a vacation sooner or later, too, even the secretary has to be able to do a backup (thus the documentation), but the techie alone is good enough to know the restore procedure.
This again is due to:
a) Him being too lazy to write down the procedure (hell, without his boss requiring him to write down the backup specs he wouldn't have done that either) and
b) His boss having not the foggiest idea that restores could be a tad bit more complicated than "doing a backup in reverse".
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
Huh, I guess I wasn't paying attention to when Slashdot turned into Digg, even though I read both. Here's a link to the original article, rather than what might be a splog. Especially since the article text was copied verbatim.
recovery systems and methods.
Having a backup you can'r recover from is useless.
The Kruger Dunning explains most post on
OMG, this happened to me too! Instead of Business 2.0, they switched me to PC Magazine, one of the biggest and most annoying MS shills. Not only had I lost a cherished subscription, but it had been replaced with the very antithesis of what I had subscribed to. And!, I had a three year paid subscription.... I tried unsuccessfully to get a refund, and eventually just resorted to tossing the new PC Magazine issue when it arrived. What a ripoff!
If they deleted the files in error, then the RAID would faithfully mirror that deletion across all physical disks...
I know that where I work we run a Disaster Recovery (DR) exercise every time we do either a hardware or software upgrade to prove that everything still works. If nothing has changed one is done every second year. We actually pull our off-site production tapes and restore to a new machine that is not in the same city where the current production machine resides. It may be over-kill for them, but, a test of that for them (or any other business) would be a fruitful exercise in that they prove that the backups are good and they can restore from a given point and carry on with a minimal loss of work.
For one of our server apps we actually have two laptops configured with all of the required software and we do restore production data from backups on a regular basis as we use that for our system testing on projects. This happens several times a year so we know that the backup and restore procedures truly work. It is also very cool walking in to the client site, plug in the laptop and show them that in an emergency they have a working machine very quickly. Not as fast as a server, but, it gets them a working machine until the replacement server arrives.
Panic now, beat the rush!
101 Dumbest Moments in Business I think they might want to revise their list. I'm sure I would like to
At home I back things up to my Buffalo LinkStation fileserver, but I also make secondary backups to other boxes and to the same box on which I work (different disk).
At work I tend to back the UNIX stuff I'm working on to my PC, my mainframe files to my PC, and my important PC stuff to either the UNIX box or to CD-R. The UNIX stuff is also in CVS.
I've invested a year or more in some of this software, and I don't want to lose it even if the entire data center fails.
Mainframe/UNIX Bit Twiddler and long time Windows/Linux Hobbyist.
The Theorem Theorem: If If, Then Then.
SSIA
I can't believe I was the first to post that.
Felt a great disturbance in the Server, as if millions of files cried out in terror and were suddenly silenced...
I hadn't had my morning coffee yet...
You know it....
Their next article on backups and recovery will be damned good...
And their next review of the backup software they use won't be written by the vendor's PR flack...
deleting the extra space after periods so i can stay relevant, yeah.
This is my story and I bullshit you not! I work for a manufacturing company, the second largest in its field in the world. Great. However the boss really does not like spending money. We eventually got a backup system using offsite backups (with a special client) and it seems to work ok. However, when it got to 100 GBs I was told to start pruning stuff. So I did. Long and short of it, even with the most important files backed up, we still have most things not backed up. Basically I have almost half a TB of data that I am not allowed to back up because its expensive. I can only backup 5 days worth of data as they are unwilling to pay anymore money for it. The fun will come when someone wants a restore from last year. This people, is the reality sometimes. Me, well, I really dont care anymore. Im sick of having servers, important, mission critical machines sitting on single IDE disks. We sell online, great, problem is our firewall is non redundant single IDE disk. If it goes (like it has in the past) we were down for days, loosing emails, web traffic, web orders, remote ordering systems, EDI data, remote sessions, ftp, everything. DR? the solution proposed by upper management is, oh we will buy some dells and restore. Yeah thats a good idea. After waiting a week for them to arrive, what exactly are you going to restore ? This is more typical than you think, unfortunatly. Im just the guy that has to make do with what i can. No doubt when it fucks up I will be blamed.
http://www.writeitfor.us - Writing IT for the IT generation.
Mirror your servers, not just your data.
One crashes, you reassign an IP and everything cruises along as before.
I've had to do this TWICE in the past year. It's a real timesaver.
Jerry: I don't understand. Do you have my data?
IT: We have your backup, we just can't restore it.
Jerry: But the backup keeps the data here, that's why you have the backup!
IT: I think I know why we have backups.
Jerry: I don't think you do. You see, you know how to MAKE the backup, you just don't know how to RESTORE the backup. And that's really the most important part of the backup: the restoring. Anybody can just make them.
All about backups & Verification of Server Archives...
From first hand experience.
"I think I speak for most of us when I say .... (Score:-1, Redundant)"
Ha ha!
"I like to lick butts!" by MobileTatsu-NJG (#32700246) (Score:5, Informative)
Time to fire the infrastructure team :\
My Motto for Backups is: "I have never had a backup that failed to loose information." or "All my backups have failed to restore in the critical moment."
After a few backup/disaster/restore cycles, I learned my lesson. A backup is only good if you check that you can actually restore it, and are able to restore it easily with freshly purchased computer equipment. Always assume that when the thieves/flood/disaster comes, you are left without any equipment whatsoever.
When disaster occurs, you need a credit card with a big limit, and the ability to restore onto the hardware that you just purchased that morning from the local Computer Store/Staples/Best Buy. Those expensive tape backup drives are difficult to replace quickly. If you use one, you need an additional tape drive off-site that works. Further, make sure you have the disaster recovery software and operating system software with you off-site.
A few hard drive images stored on portable USB drives can be a really handy component of the emergency backup strategy.
Saying the "editorial system crashed" is the modern day equivalent of "my dog ate my homework". To all the PHBs out there, when you read a phrase like that I want you to replace it with "I am too incompetent to do my job!". I'm tired of hearing people talk about computer systems as if they are some wild, uncontrollable magic when anyone who have worked in I.T for long enough knows that exactly the opposite is true -- the only difference is knowledge of how the system works.
No trees were harmed in the posting of this message. However, a great number of electrons were terribly inconvenienced.
smarmy: adjective ( smarmier , smarmiest ) informal - ingratiating and wheedling in a way that is perceived as insincere or excessive : a smarmy, unctuous reply.
// This is not a sig.
What I want to know is how they got that far with SOX auditing? We have to verify EVERYTHING for them.
I work for an IT department that had a serious crash that resulted in lost data.
Contracts, reference materials, source code, passwords, you name it, some of it was lost.
For us it happened because a tech without a lot of experience set up a new server many years ago. As years passed, he gained competence by leaps and bounds and the server he set up also gained a role as being relied on much more than it was originally expected to.
One day a drive crashed (hard) and it was then we discovered that it was RAID 0. Yes, the wails of anguish were painful to listen to. The tech was sorrowful but somewhat supported by the fact that management knew they were sending a rookie when he set it up the first time and also by the fact that nobody ever checked to see if the rookie had done the job right. Of course, he would never make that mistake now, but we all make noob mistakes when we're noobs.
So we sent the raid set out and they recovered the data, reconstructing piece by piece but many files were damaged beyond repair, some probably forever lost and the recovery process took a month when our department had come to rely on it for daily interaction.
Backup time right? Nope, it turns out that the backup software had been so overburdened that the department which managed it had started making judgement calls about what needed to be purged and what needed to run. Everything critical was on tape it was reasoned... but recall this server was not originally intended to be a critical server, and thus had no tape backup. The disk based backups had long since been purged and this server was (still considered non-critical by the people managing the backups because they'd never been told differently) without any backup, save those made before its role changed, which for all purposes were useless.
That is how it happens.
I see a lot of comments saying that you must test your backups, you must make sure your backups are successful but I don't see a lot of comments relaying this first rule of backups: MAKE SURE YOU ARE BACKING UP WHAT IS ACTUALLY NEEDED.
Nothing else. Backup has to be tested regularly. And it costs money. These people deserve exactly what they got.
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Too bad they don't have a rating for this other than funny. I've seen this mindset in too many companies regarding disaster recovery. Unfortunately, the IT guy ends up holding the bag when things go south.
I suspect that, in the not too distant future, we'll be seeing instances where backup media gets destroyed by those battery free emergency flashlights that use simple shaking and a capacitor as a means of power. They usually have a warning saying not to have the things within a couple of feet of magnetic media.
If you DO have one of those flashlights in your desk or vehicle, check the instructions on them. You might be slowly killing any backup mag media you have around it.
Lots of people do backups because they are relatively easy. But doing a restore, especially with expensive hardware, can be a major pain because you may not have a test machine to restore it on.
Then you need to add in the time factor of the restore. I remember one case where the backup and restore worked perfectly, but the amount of time spent restoring things was eating into the time available for running payroll. We had to do a partial restore of essential payroll files, and let the rest hang, in order to get payroll out on a timely basis.
Oddly enough, the machine was going to be upgraded to RAID-5 a couple of months after the crash. There had been a little complaining about the costs of the upgrade before the crash. That complaining disappeared about half way through the restore process.
If management is going to brag about cost savings, make sure that you get documentation on their comments and your warnings. That way, if/when things turn to slime, you can put it in your resume that you tried to warn them.
This may be needed for your personal recovery plan. It may also be needed if lawyers get involved and you end up facing charges.
That's the usual fly in the ointment with backups. Backups are difficult and expensive. Management always thinks they can make them easy and cheap by buying some automated solution. But the main cost of the backup is regularly testing the backups... which can only be done properly by doing a full restore... which requires available disk space equal to the size of the backup, and hours of operator time.
Story #1. Fortune 500 company. Lost some source. Big brouhaha. Edict went out: all files are to be backed up to diskettes and the diskettes sent to offsite storage which the management had contracted for with an outside firm. It took a lot of extra time, but people did it. After about two years, an important server with source code for a major product crashed. Developers tried to get the source back from offsite storage. It turns out that nobody at any point had taken any responsibility for cataloging, identifying, or indexing the diskettes. The diskettes might as well have not been labelled: the developers couldn't identify what diskettes were needed, and the offsite storage firm couldn't have retrieved them if they had.
Story #2. Medium-size scientific research organization with a Digital 11/70 running RSTS. Enlightened manager pays operator overtime pay to stay late three nights a week and do backups. Backups are performed with the "verify" option enabled. Tapes are placed in a fire-resistant tape vault every night. But no actual restores are performed. Database (Oracle, in the days when Oracle Corporation's name was still Relational Systems, Inc). is corrupted. A restore is attempted. It transpires that this version of Oracle uses the maximum record length for its files, which happens to be 65,536 bytes, and the Digital-supplied backup-restore utility... you guessed it... has a bug with records of that length. Yep. Writes 0 bytes, verifies 0 bytes.
Story #3. I worked at a place that recommended that individual developers perform individual backups using a cartridge tape system and some standard PC software. I set it up. There were two "verify" options. One used the cartridge system's read-after-write feature to read every block as it was written. The second performed the entire backup, then verified the entire backup in a second pass. Took twice as long, of course. I opted for the second method. The problem was: more than half the time, the verify would report one or two errors. And for some reason, probably efficiency of use of the tape, it didn't write file by file, it munged them into blocks. And it didn't even report the names of the files affected. Just "2 errors were encountered" or something like that. So, when that happened, I didn't see that a rational person had any alterative except to perform the whole backup again. And more than half the time, it would report a couple of errors the second time, and...
When I asked colleagues about this, it turned out that I was the only one ever to have picked the second verify option. Everyone else had picked the read-after-write-verify option, "because it was faster."
And told me not to fuss because "if it was only a couple of errors, the chances they were on files you needed to recover was too small to worry about."
"How to Do Nothing," kids activities, back in print!
used ext3cow versioning file system?
Excuse me, but please get off my Pennisetum Clandestinum, eh!
In the bad old days, one had to periodically backup, init the drive, and restore it, on a periodic basis for defraging the disk. This was before there were programs to do this automatically, and online.
This also had the effect of testing the full backup procedures. Not to mention spending a weekend at work.
Instead of Business 2.0 the June issue will carry the new name of the magazine:
Backups 101
It is a little bit better, but it still doesn't say much about the types of systems involved, both at the primary level and the backup/restore level.
If you are running relatively low cost stand-alone servers, then practice restores can be relatively easy with test machines.
If you are running something that is closer to a mainframe, practice restores can be hard because you may not have a test machine.
Although most of the comments have been regarding corporate back ups of huge amounts of data. Backing up data is just as important for the Home EU. I can't count how many times I have had customers bring in their units with fried hard drives that had bunches of irreplaceable pictures, and documents of all sorts they had downloaded, or created but had failed to burn to discs, or put on flash drives. Sometimes I have been able to get the drive working long enough to copy said files, but in most cases not. It is a message I preach and preach and preach, is a .25 cent disc, and 30 min. of your time to much to pay for these memories????
IF you can't be famous be infamous. But for GODS sake be something
Nothing to see here. Move along.
Have gnu, will travel.
Mirrored disks are great protection against hardware failure of the disk only. If the server crashed due to a software bug or human error it would not be hard to imagine both copies of the data being damaged. As an earlier poster pointed out - the data is critical but only for a short period of time and then becomes out-of-date. A disk to 2nd server or disk to tape backup is necessary is this type of environment, it is similar to a large development project. Constantly changing data needs to backed up. Anything that is backed up needs to be validated by testing the restore and recovery process. A good trick is to backup the data and restore it to a different partition / drive / LUN with a different name and see if the application recoginizes the data.
The shielding can reduce the coupling of radio waves, electromagnetic fields and electrostatic fields, though not static or low-frequency magnetic fields. (A conductive enclosure used to block electrostatic fields is also known as a Faraday cage.) The amount of reduction depends very much upon the material used, its thickness, and the frequency of the fields of interest.
Unless the safe was made of mu metal, I suspect the kettle could have demagnetized the tape if it was producing a low-frequency or static magnetic field.
The data recovery services I've worked with , Ontrack etc, can respond same day. If you can wait a few days, days not weeks, you will probably only get nicked ~$2,000. Unless the drive is physically damaged the entire process would be trivial for a company of any means.
I'm sure your right about the deadlines being a huge problem if you miss them, but unless its just literally about to print in most situations a recovery service should be able to help you out.
If you wanna get rich, you know that payback is a bitch
Setting up a regular, daily or weekly backup while not necessarily a super simple task, is pretty goddamn trivial. Esp. for a large business like Time.
Non impediti ratione cogitationus.
Soooo...What kind of software was running on the server machine(s) that failed... Linux, OS X, BSD, Windows? Also, what software gave empty or unusable backups without error or warning messages?
???? Please clarify.
"I don't know, therefore Aliens" Wafflebox1
it's called backup software, not restore software!
so it does backups. that's all.
You would be better off to buy a restore program. That usually would need to have a backup part build in to actually ba able to do the restore.
Atari rules... ermm... ruled.
--
.nosig