Seagate Firmware Update Bricks 500GB Barracudas
Voidsinger writes "The latest firmware updates to correct Seagate woes have created a new debacle. It seems from Seagate forums that there has yet to be a successful update of the 3500320AS models from SD15 to the new SD1A firmware. Add to that the updater updates the firmware of all drives of the same type at once, and you get a meltdown of RAID arrays, and people's backups if they were on the same type of drive. Drives are still flashable though, and Seagate has pulled the update for validation. While it would have been nice of them to validate the firmware beforehand, there is still a little hope that not everyone will lose all of their data."
I'm glad to see them trying though. It's nice of a company to realize they made a mistake, and work to fix it.
It's all fun and games till someone divides by 0. Then it's hilarious.
clay tablets.
Ay Caramba already.
They'll be no different from other HDD manufacturers. I recently got a Seagate external because the price and 5-year warranty were a great combo. I hear they are going to lower the warranty period and now these problems; makes me wonder where I will be able to buy reliable drives in the future.
Arguably, when version "latest and greatest" -1 has a cool bug that causes it to permanently and (without hardware intervention) irrecoverably brick itself for no obvious reason, applying version "latest and greatest", at the manufacturer's recommendation, is a fairly reasonable thing to do.
Anybody who thinks that RAID=backup is going to learn an exciting lesson; but I don't think we can, in fairness, blame people for applying the update.
what is happening with seagate? did they downsize their qa staff or something?
Normally, they wouldn't, but these drives already had issues. Seagate recommended updating the firmware (with their 'handy' windows only updater). Unfortunately, that made the problem worse.
I would like to know where the hell the firmware update IS? I have opened a ticket with Seagate for each drive. Followed the directions (which were linked to here last week) in detail, and I have heard back NOTHING.
Not even an acknowledgment that they have looked at my tickets. I got a "your ticket was created" email, and that is it.
Seagate is getting very close to losing a lot of customers.
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order- Ed Howdershelt Via Tass
Barracuda Flounders
There are no karma whores, only moderation johns
Once upon a great while back, Seagate was one of the première names in hard disk technology. These days, the only press I'm seeing them get is bad firmware, questionable reliability, etc. They've been around longer than Microsoft, they really have no excuse at this point for not even testing their bugfixes on their own hardware. It's not like they even have to test third-party stuff.
What leads to this sort of decline?
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Over 48 hours now .... Starting to think I better not hold my breath
I agree, but good job for holding your breath for so long :P
The firmware updater uses FreeDOS from a CD image (ISO). Users had to burn it to a CD and boot from it. Here's an example when I tried it (first release that crashed while upgrading -- did not brick for people and me) under VMware to see if my CD booted: http://img403.imageshack.us/img403/7128/screenshotsa7.gif from Sunday night. I didn't bother to try the second one because that one totally bricked 500 GB HDDs which I have!
Ant(Dude) @ Quality Foraged Links (AQFL.net) & The Ant Farm (antfarm.ma.cx / antfarm.home.dhs.org).
I work for a web hosting company and we get these drives by the case. I couldn't guess how many are deployed throughout the datacenter but on some of our backup servers alone I've calculated that I have almost 100 drives that need the firmware update. Thankfully none of the disks on the systems that I admin have shown problems yet, but we try to run a quality operation and that includes preventive maintenance wherever possible.
I was all set to update the firmware on these when one of our guys found that the update rendered unusable 8 of the 8 drives he upgraded the day before Seagate pulled the update. We currently have some massive amount of Western Digital 500GB and 750GB disks on rush order as a result of this debacle. It wouldn't surprise me if management tells us to swap the Seagate disks for the WDs and decides to just sell the whole lot of Seagate disks off in bulk as defective. It would be cheaper than paying people to update each one by hand.
Before this, Seagate used to mean "quality" in my opinion as their failure rate seemed to be lower than the competition and their 5-year warranty was unmatched. For the average home user, this situation is a headache. For people running datacenters filled with these disks, it's an outright fiasco.
Go here http://forums.seagate.com/stx/board?board.id=ata_drives to see the angry users and posts in Seagate's official forum. Most of us are pretty angry and upset. Definitely read this super long thread: http://forums.seagate.com/stx/board/message?board.id=ata_drives&message.id=6272 (42 pages).
I find it ironic that our HDDs are about to be bricked EITHER way (on its own) or with the pulled firmware updater (released twice already too; first one crashed with memory dumps and stuff for everyone; second one bricks 500 GB models).
FYI, http://support.seagate.com/firmware/MooseDT-32MB-SD1A.ISO was the ISO file that was released (404 error now due to brickings) according to my download history. Seagate needs to get the next one right!
Ant(Dude) @ Quality Foraged Links (AQFL.net) & The Ant Farm (antfarm.ma.cx / antfarm.home.dhs.org).
It's not bricked if you can fix it without modifying the hardware. It's a nice term -- stop destroying it.
is this seagategate?
Media that can be recorded and distributed can be recorded and distributed.
-kfg
I talked with A/S 10 minutes ago
After talking with Seagate A/S a few days ago and told I needed to update my firmware and sent an email on how to update, no fireware was downloadable from the links in the email provided.
Annoyed I talked to Seagate A/S again today, it seems I do not need a firmware upgrade anymore, and only some of the hard drives made in Taiwan between some date seem to be defective and updating firmware in non-defective drives seems to be causing problems. Hence they removed all links to firmware. Since they are not 100% sure of what I mentioned above yet, they told me they are going to update their site and call me back when things get finalized next week.
Can we, for God's sake, just permanently ban the use of the word "brick" or "bricked" in the summaries. I have yet to see it used correctly.
Brett
I work for Seagate. I was there when the fit hit the shan, and I saw everything going in internally, as well as externally.
I really love my job, so please excuse the sock-puppet nature that creating a brand new account and claiming to be an authority on the subject I must seem to be. But I am a geek, and I really think you all need to know the true story behind the scenes.
This whole thing started with the 1.5 Terabyte drives. It had a stuttering issue, which at first we all thought was a simple bad implementation of SATA on common chipsets. Seagate engineers promptly jumped in and worked to try to duplicate the issue and prove where the problem was. This wasn't a massive rush as 1.5tb drives are what? 5% of the drives on the market. When it became obvious that the issue was more widespread, they buckled down and put out a couple of firmware revisions to fix it.
Now, in the 1.5tb drives, there are 2 main revisions. the the product line that gets the CC* firmware, and the line that gets the SD* firmware. They came out with firmware CC1H and SD1A to fix these issues and started issuing them.
But, seagate has always been restrictive of handing out their firmware, so such updates required calling in with your serial so that the people who had access to hand out the firmware could check a) model, b) part number, and c) current firmware just to make absolutely sure that they were giving the right firmware out. This has been a procedre that has worked for YEARS up until now.
Then the bricking issue came to their attention. It took so long because it's an issue that's hard to track down - pretty much the journal or log space in the firmware is written to if certain events occur. IF the drive is powered down when there are 320 entries in this journal or log, then when it is powered back up, the drive errors out on init and won't boot properly - to the point that it won't even report it's information to the BIOS.
This is a rare, but still obviously bad issue. Up until now, we all figured it was just some standard type of failure, as it was such a rare event, so we'd RMA the drives.
So, for whatever reason, mid management started freaking out (as it could be a liability for seagate, I suspect - ontop of the already potentially liable issue of the stuttering problem causing drives to fail in RAIDs). So, they pushed the release of the SD1A firmware to the general public. They took a few days to 'test', though it was mostly just including some code in the batch file that kicks off the firmware updater, to check that it is a BRINKS drive, and the proper model number. Then it was kicked out to the public.
Please understand, this firmware had to go through five different checks to make sure it applies to the specific conditions to qualify sending to a customer, before now. 5 chances for us to go your drive needs the other (or none) firmware update. Suddenly, it's down to ONE check, and even that was more designed for a contingency just incase the wrong firmware was sent out.
Of course, it starts bricking drives.
Right now, the engineers are crapping themselves, the firmware's been pulled, the support agents are told to say "The firmware will be released soon" and no real procedure to fix this issue is in place. Our phones are flooded so bad that it locks the system up when there are too many calls in queue, and emails are coming in at hundreds an hour.
We simply cannot keep up.
The good news is, the chance of your drive simply not spinning up one day is very low. And for those of you who flashed the wrong firmware - be patient. It's not bricked, just unable to write data to the platters properly. When they have a *GOOD* firmware out, a new flash should un-brick the drives. If not, flashing it back to SD15 should make it work again.
Seagate really pushes the idea of being open and honest as much as we can without being sued to hell. They let agents make choices and use their skills instead of scripting us to death. They worked hard to bring their support back t
If the problem was the uncertainty of your data's integrity due to a firmware bug, then your problem is solved.
Now you are sure.
Which means they've proved its never too late to have a happy childhood.
Blank until
This whole debacle is *exactly* the reason I prefer software RAID to hardware RAID. I deliberately make my RAID arrays with disks manufactured at different times and by different brands, and when possible on different controllers as well -- having a totally homogenous RAID array has always struck me as dangerous.
This problem isn't anything to do with the drives being SATA versus anything else, and the FC lobby shouldn't get too smug. Some (with hindsight, at least) bad engineering decisions got taken in a complex product, and the result was that the product got into trouble. All disks are a mixtures of electronics, mechanicals and firmware, and although this happened today on a SATA drive it could happen equally well tomorrow on an FC drive. The answer to your question is ``anyone who wants to be power, space and money efficient''. There are products now shipping in volume --- Pillar, Sun's Fishworks boxes spring to mind --- where the performance of SATA is brought up to FC standards for many workloads (your mileage may vary, objects may be closer than they appear, etc) by the application of appropriate filesystem structures, battery-backed RAM, flash, SSD, etc, etc. There are, before anyone jumps in, workloads where nothing this side of a gazillion independent spindles of 15000rpm FC is going to work. But conversely, there are other workloads where performance isn't as much of an issue as space and power density (backup, for example) or where capacity causes the business far more issues than performance. I've got a Pillar stuffed full of SATA. There's FC available for people who need the performance bump, but I don't: my application workload saturates on other factors long before it maxes out the NAS server, and even if that were not the case, the value to my business of small deltas of performance (and the difference between FC and SATA is a lot smaller than people make out) is less than the massive difference in price. In general terms, SATA today is where FC was five years ago, and even if you end up short stroking it it's _still_ cheaper than FC. My Pillar allows me to effectively short-stroke SATA for performance and use the residue for non-critical data, which is nice, of course. Performance isn't everything, as otherwise we'd all be going to the supermarket in Formula 1 cars. There are other criteria, and SATA may be appropriate for your business, depending on what your business is. And slightly more controversially, I'm suspicious of admins who claim their application needs the latest bleeding edge of a component --- disks --- which is on a slow development curve for performance. The speed of disks scales, loosely, at the square root of the capacity times any increase in rotational speed, but seek times have only improved by a factor of four over the twenty years I've been running fileservers for. If you're seek bound, you've got deeper problems that disk technology won't always help you with. ian
A brick's value is the cost of creating a brick to replace it.
So if it is less expensive to throw something out and buy a new one than it is to repair it, it's bricked.
paintball
Maxtorman, I'd mod you up if I had the points. Your comments are the first ones to alleviate a very significant knot that formed in my stomach after reading this. I'm still a little concerned though, and have some questions at the bottom I hope you could answer.
I'm a little late to the party because I only use these only for non-critical stuff like home office and family PC's, but the prospect of having all my drives inevitably die really scares me. I've bought 18 drives (ST31000340AS and ST3500320AS all w/ FW SD15) in the last half of 2008 that sound like they match those reported to fail on the forum.
Funny enough I was complaining to my vendor about 4 drives that had to be replaced because they died within a month of use. Thought it was a bad batch they were pawning-off on customers, but I still trusted the Seagate brand.
So, my questions:
1) Is this definitely fixable in firmware? Should I be buying new drives right now?
2) What are the honest chances of a drive dying before a working firmware patch? My critical stuff is in RAID5 so I can always rebuild, but the gaming rig is RAID0, and off-site stuff like mom's media PC is only a single drive.
I appreciate your comments. Good to know there's a guru in the Slashdot community :)
-Matt
--- Need web hosting?
It's not "moving the head to prevent wear". It's SMART data gathering. smartctl will soon sort you out. However, I would personally not recommend it.
smartctl --smart=on --offlineauto=off
Raid has never been a backup. A backup is something stored outside of the running set. That way you can restore the data if your running system would, you know, break down.
c++;
I absolutely agree with you.
If we had been allowed proper development and proving time, this may not have been an issue at all.
But the moment Seagate even admitted there was an firmware issue with the 1.5Tb drives, lawyers began recruiting for class action suits.
Disgusting ambulance chasing fecal sniffers.
Since I had not heard of massive numbers of Seagate drives failing I already suspected that this is a rare occasion in which the drives would not spin up. I was wondering why Seagate announced this bug berfore they have a fix ready. Looks like they announced at very early. Maybe they also should have put more emphasis on the fact that it is a very rare bug.
It was announced. And people were freaking out about a bug from Seagate without a fix ready. What happens when customers freak? Right: Tons of pressure on getting a fix ASAP. I thought the chance of a bad firmware would be much higher with that much pressure and upgrading the drives would pose a more severe risk than just doing nothing. So I think I will keep doing that for another couple days at least.
The sad part about that story is that companies will be more reluctant to be open about bugs in the future.
Ok, maybe it's just me, but who the hell updates drive firmware anyway? Just because I'm a techie, doesn't mean I am suddenly willing to do more work than other customers.
Do you think a single consumer out there goes through the trouble of updating their drive firmware? (unless there's an automatic procedure in place, like probably mac and some windows manufacturers have)
To me, any drive which requires an firmware update to function (not just perform better) after purchase, is a failed product and I would surely hesitate to buy another ever again.
I used to buy Seagate drives in pretty large numbers for some of my datacenter activities and every time a drive locked up for some reason, I insisted on a new drive through EMA. Had Seagate refused, they would have taken away a large chunk of their added value, to me. I would probably never buy another drive from them again.
It is the old High Availability versus Disaster Recovery question. Two completely different things aimed at two completely different problems.
The first is to make sure that your system remains available as long as possible even if some of your hardware goes belly-up. The latter is for when your DATA goes belly-up.
Seagate never played Whack-a-mole growing up.
The game you played with a server full of seagate drives "growing up" is that if it was off long enough to cool down it was a virtual certainty that at least one of those drives wouldn't spin up. The odds of another disk developing stiction while you were taking the first one out of the case and whacking it with a screwdriver approached infinity as the number of disks became large enough to make the machine or enclosure heavy. Is that enough like whack-a-mole for you?
On the plus side, most of those drives had really huge filters in them. I had a 40MB RLL disk that I opened, de-stuck, and closed with nary a data error. I just did it in my bedroom (huh huh) with no dust or static control in the environment whatsoever. That drive eventually burned a trace right off of its control board, and then burned off the jumper wire I replaced it with, but that was over a year later and I'm quite sure one had nothing to do with the other.
I don't know how Seagate actually got its good name. I've used a lot of their disks, since way back when, and I just don't see it. I used to like Maxtor in spite of the noise, but WD has pretty much been my best friend all along. Today I will hardly buy anything else.
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
it starts to randomly move the head to prevent wear
I have to ask what wear you think is being prevented by INDUCING activity. If the head arm DIDN'T move, that would be *preventing* wear compared to moving it. But "reducing" wear?
Well unfortunately you will get the occasional clunker no matter who you go with. Any product cranked out in those kinds of numbers is bound to have at least small bad batches no matter how good the QA. Now as for ExcelStor, when I said small I was talking 80-160Gb range. I have built plenty of office machines with ExcelStor drives in that range and they are quite popular. They are VERY quiet, which if you are building a machine which is going to be on the desktop is a plus.
With WD I've always had those fail really quick(and thus be under warranty) or not at all. I've got a drawer filled with 40Gb WD IDE drives from upgrades that I'll need to figure out what to do with. Samsung I've had good luck with on very large capacity, as well as Hitachi. But if you are wanting a 1.5TB you are pretty much stuck in Seagate land-they are pretty much the only game in town. Might I suggest either the Samsung I linked to earlier or perhaps one of these, two in a RAID 0 if you really must get above the 1TB range? Because until Seagate gets their collective shit together I would be afraid if picking up one of their drives ATM. My WDs may only be 500GB each(and that strikes me as funny as hell that I can say "only" when my first HDD was 2GB) but they are VERY quiet and give me more space than I will ever need.
ACs don't waste your time replying, your posts are never seen by me.
I am one of the victims and your report confirmed all the problems which I expected to occur inside your company. I previously worked with an electronic giant and the problems are just too similar.
The catastrophic problems which Seagate is facing now could have been prevented - if there would have been one single person in customer service who would have cared and pushed the issue, which was known for months, up to the right people. A little googling some months ago would have proven that this issue is far bigger than a "one time" incident.
After all it doesn't happen every day that Data Recovery companies announce with joy that they are able to handle widespread 7200.11 firmware problems. Or that the two major companies which provide recovery solutions race for being the first to have a two click solution for this cash cow.
Data recovery companies were flooded with drives. They figured out an easy way to fix the firmware and kept it secret. They made a great profit, charging prices as if it was a hardware failure.
Seagate Datarecovery did the same by quoting up to 1800 USD for a 10 minute fix. Although I am sure that they were the only ones not aware of the easy fix.
The problem with the undetectable bios drives really isn't new. Your customer service knew it for a long time, but they are paid so little and probably have such strict procedures that they don't care about Seagates customers and no one dared to report the drive failures as a major incident. Everyone shut up about it and the people which are responsible and do care only learned about it months later when (or shortly before) it got out to the press.
Seagate had months of time to fix it. Two months ago when my drive broke, there was already plenty of information about the problem on the net. The only one who would deny any problem was Seagate.
I warned your board moderator of the disaster which will strike Seagate months ago. I tried to show him that these were not normal failure rates but the poorly paid guy didn't care.
The email support who takes two weeks to respond, and the phone and live support were just as ignorant.
There were people reporting how 4 out of 6 drives broke within weeks, and Seagate would only respond that such failure rates are normal.
People on the Seagate boards were constantly reporting the problem, but your board moderator shut them up. Threads where getting deleted and locked, including a big thread where the community was working on a fix. The reason, according to Seagate, was that it added nothing to the community.
The board moderator would consistently tell everyone that there is no known problem with the drive - the same message as your customer service.
It went as far as blocking links in private messages to a posting on another board which could help the victims. So how could Seagate expect from those people now to actually believe that the company cares?
The posting on the new board had within a short time 10.000 views. That's when things started to get out of hand for Seagate.
People were pissed off for months about Seagate. Everyone knew that the firmware was broken, but the company denied any problems. We knew that it is not that difficult to recover the data if you have the tools and knowhow, but the company wouldn't give any assistance. Many would have accepted the fate if the drive would truly be broken. But not if it is inaccessible because of a firmware bug which makes every single drive a -clicking- time bomb.
People everywhere were calling Seagate harddrives junk drives which are so unreliable that they will never buy them again.
So I, as many others, went on to warn every single person we knew about the problem with Seagate drives. The hilarious/sad thing is that before, I would recommend Seagate to everyone I knew. If someone would ask me which drive to buy I would reply with no doubt: Seagate.
This could have been prevented if Seagate would have acknowledged the problem much earlier. I wasted day after day,
I never understood why equipment capable of being flash-updated by users does not include the 1.0 drivers as a ROM onboard the device. This way if you completely and utterly bork the flashing, you can reset a jumper, press a recessed button with a paperclip, so SOMETHING that will cause the EPROM to be reflashed from the known good ROM. "Hey, here's baseline firmware again, people. Let's try this again."
The only possible explanation I can think of for not doing this is that the known-good ROM would add another half-cent to the manufacturing process and we know how manufacturers watch their pennies.
Kwisatz Haderach
Sell the spice to CHOAM
This Mahdi took Shaddam's Throne
I do a lot of video and audio work- so with about 350 gb worth of data that is in jeapordy, I find myself doing multiple backups.
I work in computer repair, and you would be surprised how many people lose all their data, despite having a backup. No single backup is a good idea. Multiple backups are ALWAYS a good idea. So here's what I do on a small budget with a ton of data:
NOTE: Backups need to be painless. If they're not, I'll never do them. Be honest with yourself. Making a note to sit down and burn DVDs every sunday will NEVER happen. It might happen a few times, but not *just* before you need it. Plan around convenience, for your data's sake.
1. Backup number 1 is basic. I have another desktop on my network, with a raid mirror and a network share. I run a backup regularly. Incremental backups to seperate files works nicely to conserve space. CHECK YOUR BACKUPS REGULARLY. Nothing's worse than realizing your backup has been bunk for 3 weeks, and then have a system go down.
2. My second backup is in case of major disasters that burn down my house or something similar. I use MOZY to do online backups. Now, with 350gb, it's not a very fast thing. But because it's incremental, eventually you get past the hump, and it only has bits and peices to do to keep up. A major point for me on online backups was security. My life is in this data, including SSNs, Pictures, Embarrassingly bad poetry, and so on, so I was a bit nervous about starting an online backup.
Mozy offers local encryption with a private key BEFORE uploading, although it's not neccessary. Mozy states in their terms, if you decide to go with the private key, and you lose that key- they can't help you. I like the sound of that.
If you don't use your own private encryption, they say they don't look at your data, but you know how companies fold to government wishes. I'm not hiding anything illegal, but I do want my privacy.
Mozy is $5/month. With my connection (768kbps up) it's taking months to finish the first upload. But after that's done, I'll have a bit more comfort, and for $5, it's not worth not doing. They allow you to throttle the bandwidth during different times of the day, so I have it upload during the night and while I'm at work, then have it stop while I'm home, so I have the internet connection all to myself!
*Note: I do not work for, nor am I affiliated with mozy.
Belief? Hope? Preference?The Existential Vortex