Data Recovery from ReiserFS RAID Array?
Ruatha asks: "We've recently had a problem with a ReiserFS RAID-5 array - two of the disks failed and, of course, some of the people using the array didn't have backups of their data...Ontrack have returned the disks because they can do nothing with them due to the FS we used on the array. Does anyone know of a company that can deal with data recovery from a ReiserFS RAID-5 array?"
Can the users who were not responsible enough to back up their data work some overtime and recover it the hard way? It doesn't seem to make much sense to go to all that trouble and expense to have an outside source recover data on the disks, when the end users showed lack of responsibility in the first place. Was there any sort of policy like this present at your company?
I should have picked out the nickname Demosthenes!Tecumseh.
Well, i guess you are screwed.
If i am right, in most raid configurations, you can resurect the data if one disk is missing, but two and go fetch the tapes.
The problem is that tape backup is too expensive, as it has been discused recently here.
and i guess it did...
What we see depends on mainly what we look for. -- John Lubbock Now search for that bug slave!
Could they have recovered from ext3?
Dave
I write a blog now, you should be afraid.
from google --> datarec
Am I the only one wondering how 2 disks failed at the same time? What happened? Power surge and no power strip or ups? I don't really want to point fingers, but the odds of two disks failing at the same time seems really, REALLY, freaking slim. Like if this happens to you, go out and buy a lotto ticket.
Could they have recovered from ext3?
Very likely, yes. Ext2 recovery techniqes are well known, documented, and tools exist (if rudimentary) for recovering files from it. I believe that this translates well to ext3.
Also, if you want someone to recover the data and you're willing to drop some money on it...I strongly suspect that there are people on the reiserfs team that would take on a recovery job quite happily. No one knows reiser better!
That being said, there is currently no good, easy-to-use, powerful recovery tool for Linux filesystems, rather depressingly. Now, you *could* argue that this is because the filesystems are so great, but even so...
May we never see th
First of all, since _two_ disks got screwed at the same time, you've lost the "normal" chance of getting the data back. RAID-5 ensures that if ONE disk fails, you can get the data back due to the parity-stuff - but not if two disks fail. You probably know this.
;)
So, what needs to be done, is to get one of the disks back online again. That *should* be possible, and if nothing really really bad happened, should, i think, in theory, be as easy as getting a lab to pull the disk-platters out and put a new motor / new electronics on them. I'm not sure about this though
Preferrably it could be done by the disk-manufacturer.
You could also check out an excellent company in Norway called IBAS. Check out http://www.ibas.com/america/index.html for their american office. They are really excellent at data reconstruction.
"Rune Kristian Viken" - http://www.nwo.no - arca
This may or may not work, however, I've successfully recovered data about 10 or 12 times using this method.
Find a working drive of the same model, take the electronics board off of it and swap it onto the bad drive. Typically when I have a drive fail, it's the electronics, not the mechanical portion of it. So far, this has worked every time for me, one was a Quantum Fireball, and the rest were all Seagate SCSI drives (some FCAL and some ultrawide).
If you had two disks fail at the same time, chances are it's the electronics. Once you recover the data, I would take a serious look at your RAID controller and possibly replace it. I had a bad RAID controller that kept frying drives, and once I replaced it I didn't have anymore problems.
Need Free Juniper/NetScreen Support? JuniperForum
I just heard a tip from a hardware guy. He says you can "freeze" a drive in a freezer to make it work for that one last spin. You'll have to be fast though and copy the stuff quickly somewhere else. After it warms up again its gone for good.
He told me this since his laptop hard drive went bad a few days ago and he had succesfully used the trick on it, saving all the data one would have thought lost.
According to him this should work on both drives with broken electronics or broken mechanics. He said he has many theories why it works, but has proven none. YMMV.
Assuming you can find one of them, use google for gods sake. Get them to read the drive linearly. At this point, they should be able to put each drive onto new drives for you. Build a raidtab that matches the new drives they give you, do raidstart /dev/mdXX, do a fsck.reiserfs on it. Mount the thing. Poof like magic you have data again.
I've heard of several people recovering Ph.d and masters disertations this way. It's why the DoD has such strict standards about destruction and disposal of a hard drive. It's very difficult to delete data from a drive so it can't be recovered. It's can just get out of control expensive to recover the data.
If it's worth $5-10K, and you can deal without having it for several days, this might actually work... If the company bitches, just tell that it's about what a good tape setup would have cost if they had bought one.
The two guys who said try stripping the electronics are pretty brave SOB's, but that might be worth trying if you can't spring for some experts to deal with that for you.
Kirby
For me the guys are www.vogon-data-recovery.com
There are others, but I always seem to see these guys at the forefront...
just my 2 pence..
It is very obvious, but the same guys that develop ResierFS offer support for this kind of problems.
Go to Namesys.com, and purchase support there. Maybe they can help you.
I work on the design of hardware RAID controllers and the original post is missing a key piece of information -- are the two drives really bad, or were they just marked as bad by the RAID driver? I ask because often (particularly with parallel SCSI drives) you have a bad connection or a bad drive which causes one drive to hang, which makes the RAID set go degraded, but since the bus is still corrupted the next drive it tries to talk to will also appear be bad so the array is marked offline. This happens at least ten times as often as genuine double drive failures. How the RAID software reacts to double drive failures depends upon the author. You should have some kind of log or console printout -- the timestamp of the errors is the tell-tale clue, if the failures happened within a minute or so then you can be pretty sure that only the first failure is real.
First thing's first -- put in a set of scratch drives and see if the bus and HBA is working ok. Test it thoroughly! Then, using a read-only tool like IOMeter, check each of your original data drives individually to see if it can read reliably across the platters (If using NT for this test, do NOT let DiskAdmin write signatures on the drives!!). Hopefully one of the drives will be bad -- if so, set it aside. Perhaps you are certain you know which was the first drive to fail -- remove that one if you know it. Reboot, and see if the array metadata is recognized. If not, concentrate your efforts on fixing the second drive to fail. If there is a gap in time between the first and second drive failures then the data on the first drive to fail is no longer of any use to you as it is out of sync with the other drives.
If you have more details please post them here and I will try to give you more detailed advice.
One other soapbox comment -- people who sell RAID technology should always provide some kind of metadata debugger because, as they say, sh!t happens.
I lost 40gigabytes (of mostly irreplaceable!) data on a 40gb UFS filesystem.
:(
It seems I am SOL.
I have searched and searched and SEARCHED high and low on google and other search engines. There appears to be no UFS undelete tool. extfs undelete tools exist, but nothing for UFS.
It was actually a FreeBSD 4.5 FFS filesystem to be exact, using soft updates. I unmounted the filesystem within seconds of realising it was gone. (and no, i didn't do rm -rf, it was a bad mistake of find and --delete)
So... does anybody know, or do i just give up?
D.
You can tell how powerful someone is by the magnitude of the crime they can commit and be able to get away with.
You've got an array with multiple users so, I assume that this is a server and not a workstation. Or possibly it is a SAN with multiple servers(users)? What I don't understand is how the users are responsible for the missing backups.
In the case of a server, it is obvious that a server level backup should be backing up the data of all users. If you are running a SAN and you are referring to certain servers not backing up their particular partitions, it is still inexplicable because you should be doing SAN level backups in this case.
So, while it is easy and common to blame the users for not doing backups, this case sounds more like a management problem on the network side. If it's in the datacenter it should be backed up. Period! If it's not in the datacenter, well, we all take our chances don't we. If you run without backups it is completely unreasonable to expect any form of safety from total data loss.
Of course, the normal precautions (encryptions, performing sensitive tasks on a machine not on the network, et al) still apply.
One CPU cycle wasted on digital restrictions management is ONE TOO MANY.