IBM DeskStar 75GXP Hard Drive Failures?

← Back to Stories (view on slashdot.org)

IBM DeskStar 75GXP Hard Drive Failures?

Posted by Cliff on Thursday October 4, 2001 @07:07AM from the you-might-want-to-skip-this-product dept.

Sean Kelly asks: "Like a lot of other people, I went out and bought myself a nice 60GB IBM DeskStar 75GXP (ATA100, 7200rpm) hard drive to put in my sparkling new computer. Boy was that a mistake! A few months after I got the drive, it failed with horrific grinding and clicking noises, plus random data loss. So I RMA'd the first one and got a 'SERVICEABLE USED PART' replacement from IBM, which died of the same death after another few months. Not getting the hint, I RMA'd that one. Last week, I got the refab. drive back from IBM and it has already died, in less than a week! This time I did some site searching and found many people are having problems with this drive. Sites such as The Inquirer, Hexus, Tech Report, Hardware One, Sysopt, and even this PCWorld have dedicated articles, forums and user reviews to these failing and defective drives. From what I can understand, IBM is not publicly acknowledging that they screwed up here. How many other people out there have had their 75GXP (or 60GXP) drives fail? What size were they? What part number? What did IBM do about it? It is my opinion that IBM should do something about this, since I've seen an unnaturally high number of complaints about this drive now that I started looking for customer feedback. Also, here is a letter I sent to IBM explaining my frustration with them. It has more information in it."

22 of 695 comments (clear)

Min score:

Reason:

Sort:

No problems here by Lethal_Geek · 2001-10-04 07:09 · Score: 2, Interesting

Mine's been working fine for a full year now. I've never heard a peep about them failing more than they should or anything...*shrug*
1. Re:No problems here by jonfromspace · 2001-10-04 07:50 · Score: 3, Interesting
  
  We have 5 of the 75GB drives here at my office... so-far, 3 have died loud and terrible deaths. Not a great average... However, when they are working (knock on wood) they are sweet.
  
  Oddly enough, we keep getting the same 80GB Maxtor drive as a temp replacement from our dealer, and last time it came here it was full of Porn, DiVX, and MP3's.
  
  --
  I am become Troll, destroyer of threads
Re:Mine are pretty good by Anonymous Coward · 2001-10-04 07:13 · Score: 1, Interesting

The problem is quite simple: IBM sells Western Digital drives re-labeled. I think that spells it out, don't you?
Heat by maiden_taiwan · 2001-10-04 07:19 · Score: 4, Interesting

How's the ventilation in your computer case? Is it possible your drives are overheating?
Certain lots apparently very bad by hpa · 2001-10-04 07:20 · Score: 3, Interesting

The rumour going around the people who work on the disk subsystem in Linux is that certain lots/fabrication plats have lots of problems, and others are A-OK.

I recently got to experience the latter, when I got a machine with six of these disks as a RAID. To date, FIVE of the disks have had to be replaced, thanks God that did not include the system disk...
Drives too fast for the electronics? by jandrese · 2001-10-04 07:21 · Score: 4, Interesting

There was some rumbling on Storage Review that these drives may just be too fast for their electronics, and once you start filling up the outer sectors on the disk you will start getting errors. My friend has a pair of the 45GB 75XPs, and at least one of them has "issues". Every so often (now that the drive is full) the kernel will spit out:
ad4s1g: hard error reading fsbn 76293856 of 26874736-26874751 (ad4s1 bn 76293856; cn 8073 tn 63 sn 37)
followed by:
ad4: DMA problem fallback to PIO mode

So far the 60GXPs that I use have had no problems (knock on wood). I've seen at least once source that suggests that the 45GB versions of this drive are the most suseptable to having this problem. I suspect there was some poor quality control on these drives and some very marginal hardware was released onto the world (bad IBM, bad!), but that's more of a feeling since I don't have much evidence to support the claim.

--

I read the internet for the articles.
1. Re:Drives too fast for the electronics? by DarkHelmet433 · 2001-10-04 08:38 · Score: 2, Interesting
  
  This is consistent with what we see at work. 95% of the problems happen on the inner cylinders. This also would explain why windows users dont see it so much, since they would have to fill the disk 95% full before they went near the troublesome area. I would expect that windows users would tend to suffer from the reported mechanical failures more than the data corruption problems. On unix systems, the data corruption is an absolute curse.
  
  At work, this is with a sample size of several thousand disks, The DTLA's are a nightmare - the entire DTLA product range has inner-cylinder corruption problems. Scrubbing would "fix" them (for a while). We didn't see much of "disk died" type failures on the DTLA 30G drives, but saw many more "dead drive" failures with the larger 75G drives in addition to the corruption problems.
  
  Also, in one test, adding some electrical noise to the mains power (plug an electric drill into the power strip, it made a mess of it) caused the power supply rail voltages to drop a small amount and caused instant errors. The DTLA was the only drive in the machine that this problem. The older DPTA and other brand drives that were also under test didn't care at all.
  
  We have been considering changing our disk partioning layout to trim a few GB from the end of the disk and never use it. We expect that will "avoid" the corruption problems.
Threats of Class Action Lawsuits by bstrahm · 2001-10-04 07:23 · Score: 2, Interesting

Why file a class action lawsuit where your lawyer will get rich and you will get $1.24 for your efforts...

You are much better off sueing either in Small Claims court,where the limits tend to be around 1-1.5K dollars, don't require a lawyer on your part, and tend to be settled pretty quickly.

Threats of filling a class action lawsuit are a waste of time, you are much better off going to your county courthouse, filling the paper work, doing a quick web search on where to send the papers, and hire a courier to deliver them to IBMs local legal representative. When the day is done they will pay you your 1500 because it is much easier/cheaper to do that then to send two lawyers at $200/hr to your location to fight it (and still loose quite a bit of money)
9/18/36GB 10Krpm drive recall by sparcv9 · 2001-10-04 07:27 · Score: 4, Interesting

At the place I work, we had a few 18GB IBM drives fail on us. They were standard inclusions with the Sun gear we use here (Sun ships a lot of machines with IBM and Seagate drives.) We found out from IBM that there was a recall on 9, 18 and 36 GB, 10,000 RPM drives manufactured between certain dates. These drives are pretty much guaranteed to fail, period. One of our other departments had over 90% of their suspect drives fail already. Our Sun reps came out to count how many we had, so that they could replace them. My department has well over 100 of the recalled drives. Fortunately, most of them are in gear that hasn't been put into production yet.

--

This is not a Fugazi .sig
Yes, they've been causing me grief by jestapher · 2001-10-04 07:42 · Score: 3, Interesting

We got two IBM 75GXPs almost a year ago and they've consistantly given us trouble. At least once a week now we get something like:

hdg: dma_intr: status=0x51 { DriveReady SeekComplete Error } hdg: dma_intr: error=0x40 { UncorrectableError }, LBAsect=38535423, sector=38535360 end_request: I/O error, dev 22:01 (hdg), sector 38535360

Over and over again.

It used to be more frequent and would cause the system to completely die after a while, requiring some console-based fscking. But many months back we changed some kernel option -- forgive me as I can't recall which, though I think it was the "burst bit" or something related -- and it hasn't been completely crashing, though we still get those errors about once a week.

I've heard stories of drive completely dying, but thankfully -- knock on wood -- that hasn't happened here. Here's what our setup looks like (from dmesg):

Uniform Multi-Platform E-IDE driver Revision: 6.31 ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx VP_IDE: IDE controller on PCI bus 00 dev 21 VP_IDE: chipset revision 16 VP_IDE: not 100% native mode: will probe irqs later ide0: BM-DMA at 0xd800-0xd807, BIOS settings: hda:pio, hdb:pio ide1: BM-DMA at 0xd808-0xd80f, BIOS settings: hdc:pio, hdd:pio PDC20265: IDE controller on PCI bus 00 dev 88 PCI: Found IRQ 10 for device 00:11.0 PDC20265: chipset revision 2 PDC20265: not 100% native mode: will probe irqs later PDC20265: (U)DMA Burst Bit ENABLED Primary PCI Mode Secondary PCI Mode. ide2: BM-DMA at 0x8400-0x8407, BIOS settings: hde:pio, hdf:pio ide3: BM-DMA at 0x8408-0x840f, BIOS settings: hdg:pio, hdh:pio hde: IBM-DTLA-307030, ATA DISK drive hdg: IBM-DTLA-307030, ATA DISK drive ide2 at 0x9800-0x9807,0x9402 on irq 10 ide3 at 0x9000-0x9007,0x8802 on irq 10 hde: 60036480 sectors (30739 MB) w/1916KiB Cache, CHS=59560/16/63, UDMA(100) hdg: 60036480 sectors (30739 MB) w/1916KiB Cache, CHS=59560/16/63, UDMA(100)
1. Re:Yes, they've been causing me grief by Snowfox · 2001-10-04 08:15 · Score: 3, Interesting
  
  We got two IBM 75GXPs almost a year ago and they've consistantly given us trouble. At least once a week now we get something like:
  hdg: dma_intr: status=0x51 { DriveReady SeekComplete Error }
  hdg: dma_intr: error=0x40 { UncorrectableError }, LBAsect=38535423, sector=38535360
  end_request: I/O error, dev 22:01 (hdg), sector 38535360
  Over and over again.
  
  The IBM 7200RPM drives take longer to wake up after being spun down. I seem to recall seeing similar problems under FreeBSD that turned out to be read/write operations timing out.
  Is it possible that your BIOS or drive parms were set to power down the drive after some amount of inactivity? Anyone know more?
Take them to court if they're being uncooperative by psychonaut · 2001-10-04 08:17 · Score: 2, Interesting

I have undergone a similar ordeal, only with KDS monitors. The original monitor I purchased failed within a few months, and each of the three replacement monitors I was provided with died within a day of receipt. After repeated verbal requests for a refund followed by several demand letters, I have now initiated a lawsuit against KDS and its business associates for failure to provide warranty service.

I am doing this through my jurisdiction's small claims court; the procedure, while lengthy, is not so complicated that I require a lawyer. If you decide to opt for the same route, I would be pleased to provide you with a copy of my claim. You might want to examine it and adapt it for your own purposes and jurisdictional requirements.
Don't just ask who has had problems by tmark · 2001-10-04 08:38 · Score: 4, Interesting

We also need to ask how many people have had *good* experiences. Now, it is clear from reading some of these posts that many users have had good experiences with them. But posting this sort of question here, where a large selection bias probably exists and where people who have had good luck are much *less* likely to post their success stories, is going to result in a very skewed picture of IBM - or other - problems. So even if a dozen Slashdot users wrote in with their own horror stories, I wouldn't know what to make of the results, especially since horror stories of other manufacturers are not solicited at the same time. No basis is therefore provided here for us to evaluate any results.

Don't ask a question if the results you get back won't mean much !
Use Seagate by Simba · 2001-10-04 08:39 · Score: 2, Interesting

I've used pretty much every form of hard drive out there, and in my experience the Fujitsu (whom makes IBM's drives for them) and Western Digital drives are the worst to be had.

For IDE applications, I recommend the Seagate 7200rpm Baracuda line.

For SCSI, the Cheetah line. I have one of the age-old original 4.5GB Cheetahs. I've low level formatted it a few dozen times, high level formatted it a few hundred times, run a dozen operating systems on it, dropped it numerous times, whacked it, smacked it, and just generally abused the hell out of it in my torture (Q&A) server for many years. Once, when all the little brushless fans in my drive coolers decided to die within a hour of each other, it got so hot I couldn't even touch it for a few hours.

And.. it's still working. :) I've never, ever had a bad experience with Seagate, and will recommend them to anyone who listens. The few extra bucks you'll pay over a Maxtor or similar mass market drive are worth it a hundred times over when you've got critical data floating around in a non-raid situation. That's the situation for most college kids whom are cash-starved, yet need to have their thesis survive to the end of the term. :)

--
Hippies smell.
Funny Goofs by harlows_monkeys · 2001-10-04 09:03 · Score: 2, Interesting

This topic reminded me of an earlier IBM drive that had a problem. Way back in the 80's, when 80 meg was a good sized drive, IBM had a new SCSI drive with the unheard of capacity of 1 gig. I worked for a company that developed firmware for caching disk controllers, and we had one of the earliest of those drives for testing.

When I hooked it up and powered up the system, the drive fried. We got a replacement, and that worked for a while, but when it got moved to another system, it got fried.

We eventually figured out that what was happening was that in the systems these drives were going in, you could not easily see the power connector when the drive was in its bay. That's not normally a problem, because power connectors are keyed. However, the plastic on these connectors was not very rigid. If you tried to put a power connector in wrong, the plastic would simply bend and allow the connection. There was no noticable difference in the force required to put the connector in right and that required to put it in wrong. So, anyone using the "the right way is the one that actually goes in" method of hooking up power had a 50/50 chance of getting it backwards.

Now here's the funny part. We found out from a contact in IBM that IBM was having something like a 20% failure rate on these drives during testing at their plant, because of their own QA people plugging the power in backwards!
I've only seen one IBM drive fail by MrResistor · 2001-10-04 09:12 · Score: 2, Interesting

...and it was a travelstar that somebody dropped on concrete.
I have 2 75GXPs in my home box (20G and 30G) and they have both been great. I've also installed another 5 or 6 75GXPs in other peoples computers and they've been totally reliable. I've also had a few older IBM drives that have been passed on to other family members. I've had such great experiences with IBM drives that I haven't bought any other brand (for myself or anyone else) in over a year.
Kind of a glowing endorsement, I know, but we tend to find what we're looking for. If you're looking for problems with a particular manufacturer you will most likely find what seems to be an inordinate number of them. How do the complaints you're finding for IBM stack up in comparison to other manufacturers?
Also, could it be a heat problem? I always make sure that any hard drive I install has at least one adjacent empty bay for air circulation, preferably one on each side.
With that many drives in a row going bad, I would suggest that the drives aren't the problem. I had a friend who rebuilt/upgraded his computer several times last year because he was having weird stability problems. He went through 3 mobos with 2 Athlons, 2 mobo's with 2 P3s, 2 video cards, 2 sound cards, 3 hard drives, 3 IDE controller cards, and God only knows how many cables/connectors before he finally figured out it was the power supply that was the problem.
Of course, at every step he would sell the "bad" hardware to somebody at about 1/3 what he paid for it and they would have no problems with it (which is how I got my Athlon, mobo, the 20G 75GXP, and a RAIDed Promise Ultra66 without being murdered by my wife ;)

--
Under capitalism man exploits man. Under communism it's the other way around.
Re:"the problem isn't the drives, but bad handling by Anonymous Coward · 2001-10-04 10:29 · Score: 1, Interesting

When you think about it, it makes sense. The drive is traveling at perhaps 1 foot per second, and then it comes to a complete stop in less than a thousandth of an inch.
Acceleration is dv/dt, which is the change in velocity over time. So what you're really concerned about is how quickly the drive goes from 1 meter/second to its next velocity. What you don't want to see in a velocity vs. time graph are spikes, where, for instance, a drive goes from 1 m/s to -.5 m/s (rebound, ouch) in 1/1000 of a second. That'd be, uh, roughly 150G's peak acceleration.
But you're conceptually correct: generally, if something stops quickly, it does so in a short distance.
OK, enough nitpicking. :)
--
chahast at pangaea FOO dhs FOO org
s/foo/dot
My Deskstar and what i found out afterward by Anonymous Coward · 2001-10-04 10:43 · Score: 1, Interesting

I've got a 40GB Deskstar 60GXP. Here's what I've dug up on these uberdrives.

The platters in the 60GXPs are 20GBs each. Apparently, this is incredibly dense for consumer-grade hard drives (the 75's were 15GB each). The problem is this: heat. Everyone knows what happens if you stick a magnet on a grittle and let it cook for a bit. If you don't, here's the skinny: there are two ways to quickly and permanently demagnetize something. One is extreme shock. The other is heat. Harddrives heat up. If this heat is not dissipated, the platters heat up. When a platter is too hot, and the read/write head attempts to magnetize/demagnetize a sector, the magnetism may occur on the adjacent track.

My tip: A HDD cooler, even a passive one (giant heatsink) is better than nothing. I have one, and it's loud and obnoxious. But my HDD is fast, and that was my point in buying the Deskstar.

Hope this helps.
holy failures, batman! by soellman · 2001-10-04 11:08 · Score: 2, Interesting

Yea I have 5 of them and an extra in a 3ware raid and already I've had 5 failures. I'm getting to the point where I just do a low level format and put it back in the raid (5) because I can't get the RMAs fast enough! I'm just going to have to take it down after a backup and send them all back. grr..
and I find it hilarious that IBM says on their product page that cnet praises their new 75gig drive, and then when you go to cnet, 50% of the readers have given it a thumbs down! nice work IBM..
first IBM hd trouble ever - switched to Seagate by unger · 2001-10-04 16:15 · Score: 2, Interesting

been consulting for 16 years. standardized on IBM hard drives long ago. never had one fail.

purchased two 40GB DTLA-305040 drives in feb 2001.

one failed within 1 month.

the other developed bad spots within 1 month.

moved my clients to Seagate until IBM proves it can act responsibly.

ok, i fibbed a bit. i once replaced an old IBM laptop hard drive. that was the only dead IBM drive i'd ever seen, until this year.
my 75gxp 45gb by flerchin · 2001-10-04 16:33 · Score: 3, Interesting

i ordered my drive from some retailer on pricewatch. After about 3 months or so it started making noises that i can only liken to a clutch slipping out on a manual transmission car. Concurrent with these noises would be long seek times, even though i had disabled powersaving spin-down. I think the individual platters spin down or something. Anyways, i d/l'd the ibm diagnostics, it said i had a bad drive. So i called ibm tech support, they said too fucking bad, bec apparently my vendor had bought the drive from dell, thus making it 2nd hand and without warranty. I, being the trusting fool that i am, had thrown out all documentation, and felt screwed. Then i put the drive back in and decided to keep using it until it completely died. Only, and here's the catch, i didn't screw it back in, just left it on the slides in my case. The plm was solved! I was overjoyed and figured i could screw it in, since it would be there for a while. Immed after putting the screws in, the plm is back. I've left the screws out and haven't had a plm since. Yeah!!!

--
--why?
Mine is dead by opkool · 2001-10-04 17:14 · Score: 2, Interesting

Oh boy!

Mine was working great... but it died. It is on a home computer. Linux only. Good ventilation, usual stuff. No silly overcloking nor anything wierd. Plain desktop system. I even have a fan blowing air under the HDD!

After about 6 months of home use, I had trouble with my e-mail (using Kmail). Configuration got screwed up? I checked configuration, created new user, copyed config files in new user... and it didn't work.

Bug in old KDE 2.0? I tried upgrading to KDE 2.2.1. No success. I tried uninstalling and installing back the whole KDE 2.2.1 thing. No success. Still problems. Bad ones.

Software problem? The software was OK. I tried with other users. So next I checked the partition.

Oh Boy! CLinks! Clanks! It was an ReiserFS (I like to play safe). And some nasty errors showed up on the mail files.

I managed to save most of my email, copying the files to another partition. The /home partition was all clingy. Some mail files went through lotsa clings and clangs. Unable to be recovered whatsoever.

So I got myself an old drive and, while trying to install a linux on that 2nd drive, inorder to burn a CD from a reliable system, I made a mistake. With all the panic and pressure, I obliterated my partition table on the IBM 60 GB Desktar IDE-100 HDD instead of backing up the partition table.

OH boy!

It has been more than a month. I am still trying to recover that partition table. parted / gparted are no good: they have issues with ReiserFS and extended aprtitions (my data is on a ReiserFS extended partition).

I am very sad. I actualy bought that disk because it was an expensive IBM disk, because I wanted my data safe.

Of course the place where I bought it have ofered to replace it. But recovering the data costs many $$$ that I do not have, specially after getting layed off by a big blue company.

Oh boy! I'm pissed!