Can a Regular Person Repair a Damaged Hard Drive?
MrSeb writes "There's a lot of FUD when it comes to self-repairing a broken hard drive. Does sticking it in the freezer help? The oven? Hitting it with a hammer? Does replacing the PCB actually work? Can you take the platters out and put them in another drive? And failing all that, if you have to send the dead drive off to a professional data recovery company, how much does it cost — and what's their chance of success, anyway? They're notoriously bad at obfuscating their prices, until you contact them directly. This article tries to answer these questions and strip away the FUD."
What has been your experience with trying to fix broken drives?
No.
Sometimes just the controller portion fails. If you remove it and replace it with a working one from a identical drive you're back in business. Only tool needed is a torx driver I believe.
Don't ask me how, but I had a failing drive that couldn't even manage to be on for 30 seconds before being unreadable. Since I was curious, as a control, I first let the drive sit at room temperature for an hour. Afterwards, again, only 30 seconds of read time. I then put it in the freezer for an hour, and was able to read for 10 minutes, just enough time for the data I needed. I have no idea what actually happened, and am still skeptical to attribute the success to the freezer, but I did get what I wanted.
In the very limited (3) cases that I've had to try and revive a client's dead desktop drive, replacing the PCB board from an identical model - usually purchased cheaply, used or new, online - has always worked.
The other advantage of this approach is that if the first drive becomes revivable, even a time, you now have a second same-capacity drive to transfer the data to (using intermediate storage media if in fact it was the PCB that was the problem and you can only get one drive working at a time).
If it doesn't work, you're no worse off and still have a replacement drive to load data from your (hopefully recent) backups.
I broke an external USB hard disk once (it tipped over while running). It cost me AUD $2600 to get it repaired. They got most of the data off; some was corrupted but fortunately nothing important. I take more regular backups now!
I like my coffee the way I like my women - roasted and ground up into little tiny pieces.
Had a disk at work with our sourcesafe database on break. Due to responsibility falling between chairs, there was no backup at all. Sent it to one rescue firm, came back without successful restore, sent it to another one, got more than 99% back, lost nothing important, cost somewhere in the low 4 figures.
With private disks where data rescue is out of the question, I've had good experiences with freezing and in other cases replacing the circuit board. If doing it yourself, always mount RO and have somewhere with enough with enough space to make first a "cp" of selected really important stuff, a recursive "cp" of everything, and last a "dd" or "rescue_dd" of the whole disk. I've had better luck copying files from within a read-only mounted filesystem at first, you are fighting the clock after all.
I've tried the freezer trick to help what sounded like an ailing bearing , but with limited success. No amount of freezing seemed to help. To make things worse, when I took the drive out of the freezer, moisture started condensing immediately on the cold PCB. I tried to place it on a sponge to help sop up the water, but I can't imagine this helped the drive at all.
I have some friends that swear by this, but I am extremely doubtful especially because of the condensation problem. I feel like this is an a apocryphal bit of "knowledge" that has been passed down from a time when drivers were larger, slower and had less precise bearings. I can imagine that on a big old drive freezing the drive *may* have helped. But then again, perhaps it's something like throwing a pinch of spilled salt over your shoulder or touching wood--something your grandma told you to do, but doesn't actually do anything.
This one's tricky. You have to use imaginary numbers, like eleventeen... --Hobbes
Just 3D print whatever new part you need. A new read/write head? Just pop some plastic in the 3D printer and print one out. Then head over to the clean room and the tool box and jigs and use your dexterity and skill to change the head. Bad IC somewhere? 3D print out a new chip. Yes, 3D printing is the future!!
I took my hard drive to the Geek Squad and they wanted $500 to send my hard drive away to get the data.
I yelled at them and I told them that was robbery. Asked for the manager. But, when I was leaving one of the Geeks told me a secret.
He said just go home and drill a hole in the hard drive and then set it on top of your new hard drive with the hole facing down. All the data will just pour out to your new drive.
It didn't work for me, but maybe I didn't do it right?
Some HD problems (stuck platters so it doesn't spin up) are user-fixable. Most are not. There is a syndrome called "sticktion" where the read/write heads settle on the platters when shut off (most modern drives will elevate the heads when shut off, but some, including many older drives, do not). Because the platters and heads are so flat, they mechanically weld themselves together over time. To fix this (a technique I have used often in the past), you need to remove the drive, and then snap rotate it on the plane of the platters, so that the momentum of the platters trying to counter rotate against the impetus of the rotational momentum you are applying to the drive will break the "weld" loose. If you then quickly re-install the drive and turn it on, it will most likely spin up and continue to operate without problems. Other failure modes include head "crashes", spindle bearing failures, drive motor failures, controller circuitry problems (bad electronic components), and mechanical breakage of connectors, solder joints, etc. These typically are not user repairable.
Sometimes, real fast is almost as good as real-time.
Maybe the PCB swap used to work, it almost certainly won't work anymore. When a HDD powers on, it needs to load some parameters for the servo system (i.e. positioning the arm) and other tuned parameters for the controller to read back off the disk. These parameters are probably stored in flash memory on the PCB and the parameters will vary from disk to disk. So, parameters for drive A will not work to spin up drive B because of small variances in their manufacturing even if they're made on the same day in the same plant on the same line by the same underpaid employee
You can't swap disks because even if you get a tiny fingerprint on the disk, it's the size of Mt. Everest compared to the distance between the read head and the media. You'll be putting your own home-grown media defects all over it. Forget about getting your files back.
Aside from common firmware related problems (search for "reparing 7200.11" in google for an example), you're not going to have much luck.
The only other thing I've seen work: a guy took his neighbors HDD (which was not responding in Windows) and had to use an oscilloscope to realize the read waveform from the read head was a low amplitude. He built a small in-line amplifier which brought the amplitude back up to spec so the data could be read off. I was impressed.
Source: I have work experience on manufacturing processes for HDDs.
My first job in "the industry" was in a PC repair shop in 1991. Back in those days, we had a huge crop of bad Seagate 40MB (yes, that's "mega" children) hard drives. The usual problem was that the spindle had frozen up, and if we took the circuit board off and gently tapped the spindle, you could often (about 75% of the time) get the drive to start spinning again long enough to get your data off.
Hard drives have gotten a lot more reliable and a lot smaller since then. I don't know whether this would be a wise thing to do with a modern hard drive.
"He who would learn astronomy, and other recondite arts, let him go elsewhere. " -- John Calvin, commenting on Genesis 1
Backup
You will have great difficulty taking the platters out. The read heads have to be removed without physically coming into contact with the platters. You'll need specialized fixtures and tooling to even begin. If the data is that important then send it to a professional.
Only the State obtains its revenue by coercion. - Murray Rothbard
We fixed a drive by trading the pcb with another *IDENTICAL* drive (same rev of board etc)..
The funny part was that when we went to recover the files they desperately needed back from that drive, all we found were shortcuts to a network drive, where the files had been safe and sound the entire time.. The user just had no idea that they hadn't lost their files..
Sometimes it's possible to ressurect a dead drive by swapping the controller board with another from the same model (or a very close model from the same range). Unfortunately with modern drives there is often information stored on the controller board which is needed for the drive to start. This information seems to be stored on a serial memory chip (usually an 8-pin device in a SOIC or similar package) on the controller board.
What i've found you can do is remove the serial memory chip from the dead controller board and solder it to the donor controller board. Provided you have a hot air rework station it's pretty easy to remove and re-fit the serial memory chips. So-far i've tried this twice and it's worked both times, YMMV of course.
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
If you do that you want to put a dessicant in the bag with the drive. Otherwise you are just sealing the humid air in.
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
In preparation for Y2K, we had to turn off our text archive server (at a newspaper) for the first time in, literally, years. The machine itself has been in production for six years, the last two or so of which without a reboot.
It was an IBM AIX machine with an array of 4.5GB SCSI drives. After sitting with its power off for a couple hours, we turned it back on and Nothing Happened. No drives were spinning. Crap.
Called IBM tech support. Got the run-around. Finally got to a guy who said something along the lines of "you're going to think this is crazy but do what I say in this order" followed by...
* turn machine off
* remove drives
* turn the machine on
* bang the drives on their edge a few times on the floor - don't go crazy but harder than you think is a good idea
* spin the drives flat on the ground as though they were tops
* immediately, put the drives in the enclosure
* reboot the machine but do not power it off
Damn if the guy wasn't right.
His guess was that the drives had been powered for eight or so years and the lubricant had either broken down or the heads were simply stuck to the platters. The thumping dislodged the heads and the spin gave the grease a fighting chance. {shrug}
In any case, we dared not turn it off for another year and a half until at such time it was replaced. We thought about buying replacement drives but IBM wanted something along the lines of $600 for a 4.5GB drive. Even on eBay, they were three times what we felt was reasonable.
Cheers,
Matt
At least do it the right way.
1. Get ahold of long cables that can reach *outside* the freezer to your machine.
2. Wrap the drive *before* you put it in the freezer. Heat a towel in the oven to make sure it's dry, then wrap the drive in the towel. Now stick it in a plastic baggie, along with some silica gel packs to suck up more moisture. Try to close the mouth of the baggie around the cables as much as possible. Use duct tape if necessary.
3. Put it in the freezer, route the cables through the door seal, and make damn sure the door is shut tight as possible. Seal it with more duct tape if you have to. Let it sit in the freezer for at least 6 hours to get really cold.
4. Make all your preparations before plugging in the drive. Situate your primary machine right next to the freezer, make sure you're ready to go. If you can somehow manage it, and you know what you're doing, boot into an old copy of DOS, or a command-line interface of your preferred *nix distro. Don't waste time loading Windows if you can help it.
5. Turn off your machine, plug the drive in, then reboot.
6. Move *fast*. Start copying the drive contents over to the backup drive as fast as you can. If you can do it via a script or batch file, then even better. Speed is of the essence. In fact, if you know the locations of the files you need, as well as their general file names, then creating a batch file BEFORE starting would be your best option. Just tell it to copy everything in C:\MyLifesWork\coldfusion*.*
7. MOST IMPORTANT STEP!!! If this does not work, and you can't pull anything off the drive, then don't panic just yet. Turn off your machine, unplug the drive, then unplug the freezer.
Do NOT open the freezer until it has reached ambient temperature, which will take at least 24 hours or more.
This will prevent the drive from getting roached from the condensation, and make it more feasible for a drive recovery company to save your data.
[End Of Line]
Few years back a 20GB drive I salvaged from an old dead Thinkpad stopped working. No whirring sound, nothing...the green light on the USB enclosure stayed on.
There was no important data, but I thought "this is the chance to learn how to salvage a hard drive".
I did the freezer option. I had already used the freezer to kill ants in sugar and bugs in rice. Froze the drive overnight, took it out and immediately connected and waited for whirring sound. No sound. The drive is dead.
Gave the drive couple of almighty whacks. Still no sound. No life at all.
I threw it in the dust bin.
The next day I tried to connect a camera. The SD card on the camera failed to be identified on Windoze and Linux.
I tried another USB cable. And the camera connected fine.
It took me a few seconds to remember the old hard drive. Took it out of the trash, wiped it clean and connected.
The drive works perfectly fine even today. But it still got the smell of decomposed tea leaves.
Tat Tvam Asi
It really depends on what has failed and how. I've repaired a number of drives at work well enough to get data off with just basic software tools, like using Knoppix to force mount problematic partitions and so on. The drive may be failing but not completely so a software-only solution can do the trick.
Also I've had Spinrite work. It has about a 40% success rate but on drives that nothing else could read, I've had it make them readable again. In one case I ran Spinrite (it takes many hours, put a fan on the disk), copied the data to a new disk with Ghost, did a chkdsk, did a repair install of Windows and the system functioned flawlessly, no data or app loss. Of course the other 60% of the time it destroys the disk beyond any repair so it is a "Use only as a last resort and only if the data isn't important enough to pay for professional recovery," tool.
Replacing controllers can work if the controller is what has failed. Needs to be the precise controller so one from a like disk but different size won't work and occasionally even the firmware version can matter.
However if the problem is with the heads themselves or the platters then no, you can't do shit. You need a clean room to open the drive up without destroying it, and then of course you need something to put the platter in to for reading them.
So you can try to self repair a drive. As I said using recovery software (Knoppix with force mount is a great thing to try first) is a good first step, so long as the BIOS can see the drive. May be that you can just copy the data and call it good. However there are also plenty of situations where you can't repair it so don't count on it working. If the data is really important, send it to a pro.
WAY BACK WHEN I took the cover off a 40MB RLL disk, pushed on the spindle by hand (it had so much stiction it could not be repaired by any other means) to free it, put the cover back on, and it worked. Note, 40 megabytes. I didn't even try to improvise a clean room. My cleaning procedure was to blow on the top platter gently before I closed the drive. The drive spun up and I was able to recover 100% of the data from it, and it was nearly full. Before this happened this disk actually burned a power-carrying trace off the board and I replaced it with a wire jumper. Then later that wire got so hot that the solder melted and it fell off, and I put it back on and used it some more. You guessed it, Seizegate.
MANY TIMES I have got a non-spinning drive spinning again by whacking one corner (from the side of the drive) with a screwdriver. The last one I did this with was 80GB or so, but there's no reason why this technique should not be valid today. Connect to power, give it a sharp rap in the appropriate direction, listen to it spin. I started doing this with ST-225s which needed it very often, but I've applied it to many different disks successfully over the years.
My experiences aside, many people have put disks in the freezer or even the oven (not hard to stay below reflow temperature) and got them to free up. If it's a stiction problem it's all about thermal expansion and contraction. If the drive spins but does not work, if you're very lucky you might have a PCB problem, and if you can find a disk of the same model and version then the PCB from the other disk might work on your disk.
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
You need to hang around with people who eat more insoluble fibre.
Regular person? This is slashdot, there are no "regular persons" here.
Semantics is the gravity of abstraction
What had happened was that he had some minor NTFS corruption problem, so he went to our IT dept. Some IT monkey removed the laptop drive and tried to hook it up to a SATA - IDE converter. However he managed to wire up the power backwards. That fried the drive, but actually all it really did was burn/short the power polarity protection diode.
So with magnifying glass and soldering iron, I simply removed the shorted diode, and voila (not wahlah or viola), the drive was working again. I was then able to easily clean up the NTFS problem. Boy was he happy to get all his stuff back.
You need to hang around with people who eat more insoluble fibre.
They typically don't have to worry about being backed-up
Please put dates on your war-story recovery experiences. If you don't remember, give the drive's size.
Please don't be pointlessly coy about naming the recovery vendor who could or couldn't recover your data, the price they quoted, the price you finally paid them. Don't say nothing. Don't say "a certain major vendor, har har." Just tell us who they are. You don't have long before these sketchy tradies see the article and flood the comments with self-serving FUD.
Please state what was wrong with the drive before you tried to recover it, ex:
* didn't spin
* didn't identify
* latent sector errors (ex. "OS keept crashing")
None of the upvoted comments have these details which is making the article useless for me.
For my own experience,
(a) I had full, easy success swapping the controller of an IBM 9GB SCSI drive, but the old controller had been visibly damaged. SMD gull-wing (?) pins were smashed. This is a rare failure case for me.
(b) I'm often able to recover data from the third category of drive with 'dd if=/dev/olddrive of=/dev/newdrive bs=512 conv=noerror,sync'. (there is also dd_rescue, but I don't use it). This works on Linux or Mac, but on Solaris you must use GNU dd for it to work---the included one is just broken. Obviously you need to boot off a drive other than the one that's failing. The drive must be unmounted when you use dd. It takes several days to read a failing drive this way, and about a quarter-day to copy a fully-working drive this way, so the excessively slow/dumb retry cycles in the firmware and storage stack mean your drive is spinning long enough to get worse, if it's decaying (something to consider for self-service vs. pro recovery).
The issue is also what the humidity was in the room in which the head-disk module was assembled/sealed.
Except that hard drives are generally not sealed. They have a filtered breathing hole to exchange air with the outside. Otherwise, the casing would balloon when you took your computer on an airplane or when the drive is shipped via air.
Enigma
I've been a repair tech for the last 10 yrs. (and I don't mean I'm a "I built my own PC, I'm a computer god! I fix my friend/family''s computers" I actually know what I'm doing and have electrical engineering experience) I'd estimate I've seen around a thousand bad hard drives in that time. Of those, I'd say 65% would tap repeatedly, 25% had some io errors but were still working, 8% would sound normal but would never post on the bus, and the other 2% were the other weird issues like chirping or no power at all.
The tappers were very rarely recoverable by me. Every now and then I'd see one that if you powered it up dozens of times, you might get lucky and it would post properly and you could get data from it. None of the other common methods were helpful.
Over 90% of the drives with io errors and slow blocks could be recovered from. Most of those simply required a file level copy from bad drive to good. Most would have a handful of unrecoverable files. Depending on what was lost, an OS reinstall was sometimes required on the new drive, but not usually. A small percentage of them would have a large number of errors and require days to recover, or would fail completely during the recovery. A few of them would look promising but then quickly becomes apparent that almost nothing will be recoverable.
Sometimes a drive would stop responding during recovery and require a break. Trips to the freezer helped on about 30% of the drives. Some drives required numerous trips to the freezer, using rsync to resume copying where it left off last time, a process which could take days but could result in a complete recovery. I pondered ways to cool a drive during the recovery such as using a peltier, but never got anything implemented. I also use ddrescue and another custom script I wrote that works in a similar way, doing block-level recovery while splitting problem areas for smaller recovery chunks. That's useful for windows or other foreign OS where you can't do a file copy. (mac shop here)
I've never dried "drop therapy" or "impact maintenance". I'm sure it could help under specific circumstances like a stuck spindle or loose connection but I've never witness it.
I've done a little bit of onboard controller card ("OBCC") swaps for identical drives where the bad one wouldn't power on at all. About 25% success there. For that reason I tend to keep old tapping drives because their cards can work in dead drives. I assume the tapping drives have head failures, which isn't related to the OBCC. I've talked with multiple data recovery places about this process, and to my surprise every single one of them has told me "that won't work". They usually explain the remaps are stored on the OBCC, which makes sense, but isn't a good excuse not to try when the remaps probably don't account for more than one in a hundred thousand blocks. I think they just want me to send the drive to them.
The sled you place the drive into makes a HUGE difference in recovery. Avoid usb. I don't care if you insist on windows, install a firewire card. Almost all USB bridge chips handle misbehaving drives very badly. Only use one of those little external adapters with the build-on 2ft usb cord on it as an absolute last resort. OWC's "mercury elite aluminum" series are the best (reasonably priced) recovery sled I have found, and I have tried many. USB (39MB/sec, not 36, 26, 16, 12, etc), FW400, FW800, AND esata interface. In the past I used a Granite Digital "fireview", those absolutely rocked for drive recovery (LCD panel with diag menu....) but they stopped making them and they were IDE only. Someone needs to make a modern sled like that for sata please.
As for paid recovery, results seem random. Techs tend to have a recovery place they swear BY, and others they swear AT. But my observation is simply that methods vary and different places handle different problems with varying success. I think many techs' impressions are based on their first few experiences - if good they like, if bad they don'
I work for the Department of Redundancy Department.
Pure snake-oil today. It used to have some merit in the MFM and RLL days, but these are long over. The only thing SpinRite can do today is to cause more damage to the drive if it has mechanical problems. If the drive is mechanically fine, repeated read accessed do exactly the same as SpinRite does, because it does not have any other possibility on modern drives.
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.