Backblaze Releases Billion-Hour Hard Drive Reliability Report (extremetech.com)

So... by Anonymous Coward · 2016-05-17 22:07 · Score: 0

...can these statistics be used to find parcel trowing delivery drivers???

Japanese? Not anymore. by johnsmithperson123 · 2016-05-17 22:09 · Score: 4, Informative

HGST is owned by WD now if I recall, so it's not Japanese anymore. (Sorry if somebody already mentioned this.)

Re: Japanese? Not anymore. by Anonymous Coward · 2016-05-17 22:18 · Score: 1

so if Ford bought Honda would it make it Japanese I think so?
Re:Japanese? Not anymore. by Anonymous Coward · 2016-05-17 23:26 · Score: 0

They own the company, but all the techonology and build is done by the Japanese, hence the superior quality.
Re:Japanese? Not anymore. by AmiMoJo · 2016-05-17 23:28 · Score: 1

It's still run as a separate company though, with engineers and design in Japan and manufacturing in Japan and I think Malasia.
HGST is owned by WD, but they are not the same drives or engineering.

--
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
Re: Japanese? Not anymore. by Anonymous Coward · 2016-05-18 00:15 · Score: 1

I really think so.
Re:Japanese? Not anymore. by Anonymous Coward · 2016-05-18 00:28 · Score: 0

Maybe they practice the Soviet method of corporate takeover? Kill the leadership and send the old workers to a prison camp, where they have to train their indian counterparts for menial pay.
Re:Japanese? Not anymore. by Anonymous Coward · 2016-05-18 00:49 · Score: 0

That has changed over the last couple months. They are integrating into a single company now. A condition of the sale was WD has to sell some of it's production to Toshiba (I think it was the desktop drives) so new WD drives are made by what was HGST. Overall what is being kept is HGST engineering and WD management/marketing,
-- someone who works in the drive industry.
Re:Japanese? Not anymore. by Solandri · 2016-05-18 01:18 · Score: 4, Informative

IBM sold their storage division to Hitachi, who renamed it HGST. So it was never Japanese to begin with.

Several countries objected to the HGST and WD merger since it would leave only two manufacturers of 3.5" HDDs (WD and Seagate). So to push the merger through, HGST agreed to sell its 3.5" assets to Toshiba (which until then only made 2.5" HDDs) so we would have three manufacturers of 3.5" HDDs
Re: Japanese? Not anymore. by houstonbofh · 2016-05-18 01:30 · Score: 1

No, if Ford bought Mazda...
Re:Japanese? Not anymore. by fnj · 2016-05-18 02:21 · Score: 2

[HGST is] still run as a separate company though, with engineers and design in Japan and manufacturing in Japan and I think Malasia.
I've got some bad news for you. MOFCOM has approved a full integration of the two companies, so (1) the HGST brand will disappear, and (2) all that HGST tradition of reliability is headed sraight down the crapper.
MOFCOM in 2012 at the time of the merger approval "restricted the two companies to a 'hold separate' ..., which prevented the companies from combining products and workforce."
For, actually, three years the two companies were forced to act as separate entities. But since late last year, WD has been allowed to integrate "substantial portions" of its business with HGST. R&D and manufacturing will become fully integrated within another one to two years. Separate product lines will no longer be mandated after two years.
It's going to be a repeat of what happened when Seagate swallowed Samsung. All the goodness of the Samsung products evaporated. The failed morass of Seagate's lackadaisical incompetence annihilated it.
Re:Japanese? Not anymore. by Anonymous Coward · 2016-05-18 02:28 · Score: 1

Still somewhat wrong.
Toshiba didn't make 3.5" consumer drives before. They already had the MGxx enterprise nearline series which are descendants of a Fujitsu design.
They now also have a consumer series based on that line.
DT01ABA/ACA = e300/p300 = Toshitachi. DT01ACA100 / DT01ABA100 started out literally identical to Deskstar 7K1000.D / 5K1000.B until they changed the manufacturer/model string - yet to this day they still haven't bothered to change even the FW rev#.
MD04ACA = x300 = Toshijitsu. MD04ACA500 is mechanically identical to MG04ACA500e but crippled in firmware.
Re: Japanese? Not anymore. by Grishnakh · 2016-05-18 05:17 · Score: 1

There's a difference between buying a stake in a company and buying out the whole company. Ford bought a stake in Mazda a while back, and co-developed some vehicles and platforms with them; they no longer have any ownership now however and the two are completely separate. This is rather different from one company completely buying out another company and merging them together (or just killing off the acquisition).
Re:Japanese? Not anymore. by mattventura · 2016-05-18 11:47 · Score: 1

That's bad news, I'm hoping by the time they go downhill, we'll actually have a production-quality ZFS implementation for Linux so you can use whatever crap drives you can find.
Re:Japanese? Not anymore. by Swave+An+deBwoner · 2016-05-18 13:23 · Score: 1

Looking at an HGST HUS724040ALE640 with manufacturing date Aug-2015, it says "MADE IN THAILAND".
Re:Japanese? Not anymore. by Anonymous Coward · 2016-05-18 19:48 · Score: 0

Fortunately, WD sold a sizeable chunk of HGST to Toshiba in order to meet the requirements of the anti-cartel authorities. Toshiba still makes good and reliable 3,5 inch HDDs and will continue to do so.

Re:Why does this matter? by Anonymous Coward · 2016-05-17 22:10 · Score: 5, Informative

It will affect you, if you ignore the results and choose to buy a Seagate drive. Trust me, I've been there...

Re: Why does this matter? by Anonymous Coward · 2016-05-17 22:24 · Score: 0

Is this a troll?

The point isn't that any drive lasts that long. It's an appeal to the statistical significance of the result.

Re:Why does this matter? by Anonymous Coward · 2016-05-17 22:26 · Score: 5, Informative

Can anyone tell me how this affects anyone? A billion hours is a ridiculous amount of time that makes this irrelevant to any reasonable person. No one cares if a hard drive lasts a billion hours.

I suggest you look at the definition of the word "cumulatively".
Here is a hint: divide 1,000,000,000 by the 60,000 HDD of the report, this makes 16,667 hours which is approximately 2 years.

they only run wd reds (non pro) by Anonymous Coward · 2016-05-17 22:44 · Score: 0

it puzzles me why they aren't running red pros, which have almost twice the warranty, the vanilla reds do suck in terms of reliability

Re:they only run wd reds (non pro) by Anonymous Coward · 2016-05-17 23:30 · Score: 1

Probably because, as explained in the report, they buy their disks in batches of 5000 to 10000 at at time.
I suppse WD can deliver reds in those quantities, but not red pros?
And price may come into the question too. At those quantities, it may be cheaper to replace 100 more drives than to buy longer lasting models.
Re:they only run wd reds (non pro) by fuzzyfuzzyfungus · 2016-05-17 23:51 · Score: 2

Probably price: Backblaze's thing is using some sort of software abstraction and redundancy layer to get away with providing storage on the cheapest drives that they can get their hands on.

Makes them a pretty good value among providers of offsite backup/cold-ish storage; but they have a very limited interest in paying for more reliability at the hardware level, since that would fairly quickly push them into the domain of traditional storage vendors who use more expensive hardware to provide fault tolerance for software that isn't designed to handle that itself.

They obviously have an interest in getting the best value for money, hence the gathering reliability data, and they'd presumably be willing to pay a nonzero premium if the reliability difference were large enough; but their whole approach is a 'paper over lousy hardware in software' strategy. It makes their storage designs a poor drop-in replacement for many applications(even if you are using a fairly clever filesystem like ZFS that has good tolerance for some drives dying, the sight of SATA port multipliers hanging off the cheapest HBAs they can find might make you a bit nervous); but it's pretty difficult to buy a storage system where a lower percentage of the total cost is non-disk hardware.

That webpage by Anonymous Coward · 2016-05-17 23:06 · Score: 2, Informative

Good god! opening that webpage is like walking trough treacle. I had to turn on Ghostery - 25 trackers!!

Re:That webpage by Anonymous Coward · 2016-05-18 00:13 · Score: 0

displays NOTHING for me.. content on page relies upon scripting. noscript = no content = no visit
i saw this article first on ars anyway. here's the actual source at backblaze that tfs should have linked to instead of that polluted extremetech page.
Re:That webpage by Anonymous Coward · 2016-05-18 00:34 · Score: 1

"The company that owns Ghostery, Ghostery, Inc. (previously Evidon), plays a dual role in the online advertising industry. Ghostery blocks marketing companies from gathering website user information, but it makes money from selling page visit, blocking and advertising statistics to corporations globally, including corporations that are actively engaged in collecting user information to target ads and other marketing messages to consumers."
Fighting evil with evil, nice.

Re:Why does this matter? by RogueyWon · 2016-05-17 23:08 · Score: 3, Interesting

Yeah, I've been there. There's nothing quite like the sinking feeling you get when you first hear the "bird chirp" sound which is the first sign of impending catastrophic failure.

I had 3 of those drives fail in a 6 month period, all of them relatively new and only subjected to consumer-level usage. It got to the point where I was getting agitated every time there was birdsong outside my window. Seagate drives don't get anywhere near my home PC since then.

Mixed up results by Anonymous Coward · 2016-05-17 23:09 · Score: 0

I think the author of the summary is mixing up the results for Seagate and WD.

From the summary:

Seagate comes out the worst, though it is suspected that much of that rating was warped by the company's crash-happy 3 TB drive (ST3000DM001).

That drive isn't even in the table in the report. It's the 4TB drives that pull Seagate's rating down.

Also, looking at the graph by manufacturer and year, it looks like it is Seagate that improved much the last year, not WD. And it's Seagate that comes out third for 2016 in that graph, while WD is last, not the other way around.

Math is hard! by Anonymous Coward · 2016-05-17 23:11 · Score: 0

From TFS:

Backblaze also notes that the 8.63% failure rate on the Toshiba 3TB is misleadingly high — the company has just 45 of those drives, and one of them happened to fail.

Wut?

Re:Math is hard! by Anonymous Coward · 2016-05-17 23:49 · Score: 0

Annual failure rate. If you have one drive and it fails after 3 months, it has an annual failure rate of 400%, because at that rate you'd need four drives to have one drive available for one year. Yes, math is hard. Go look at some cat pictures instead.
Re:Math is hard! by Anonymous Coward · 2016-05-17 23:58 · Score: 0

It said 47 in the report, not 45.
1 out of 47 in 1 quarter extrapolates to 8.51% per year.
The first quarter of 2016 counted 91 days, the entire year is 366 days.
1 out of 47 in 91 out of 366 days gives 8.55%
Damn, I expected to get 8.63 from at least one of those computations.
But math isn't hard, math is fun :)

Re:ST3000DM001? In a DATA CENTER? by Anonymous Coward · 2016-05-17 23:17 · Score: 1

That's what they call 'disk nearline storage'. MAID arrays.
I also remember that RAID originally stood for "Redundant Array of Inexpensive Disks", not for "top-of-the-line SCSI moneyburners" :)

Re: Why does this matter? by Anonymous Coward · 2016-05-17 23:18 · Score: 0

Why does the summary compare annual failure rates if the measure that matters is bytes*MTBF/dollar ?

Not very useful by TheRaven64 · 2016-05-17 23:18 · Score: 2

As they note, one of their drives has an 8% annual failure rate because they have 45 and one happened to fail this quarter. A lot of the others are similar numbers, with the difference between 0 and 1 failures being 4-8%. The only ones where they have enough data to be useful are HGS, one WD, and two Seagate models. One Seagate is a lot less reliable than most HGST drives (and less reliable than the worst HGST model), the other is the most reliable disk in the set. The WD drive is the least reliable.

--
I am TheRaven on Soylent News

Re:Not very useful by thegarbz · 2016-05-18 02:17 · Score: 2

You say not very useful and then point out that they provide all the stats needed to get all required information about the usefulness, and then also go on to say that there are several models with some really large data sets.
This is the very opposite of "not very useful".
Re:Not very useful by TheRaven64 · 2016-05-19 23:00 · Score: 1

No, I said it's not useful and then pointed out that they only have enough data to tell you anything reliable about one company and have data about two other drives from other companies that tell you very little about whether it's representative.

--
I am TheRaven on Soylent News
Re:Not very useful by thegarbz · 2016-05-20 04:00 · Score: 1

Hardly. Look at their published datasets. They have more than enough data to paint a picture about a company on the whole. Now as for specific models you are right, there are a few drives where they have too few models to draw an accurate picture.

Re:ST3000DM001? In a DATA CENTER? by Anonymous Coward · 2016-05-17 23:19 · Score: 5, Insightful

Ever price out "enterprise" drives?

When you're buying 10 drives, you pay the premium because man-hours to deal with failures are expensive. When you're buying 10,000? Not so much because failures are built into the design at that scale.

At the scale Backblaze operates, it's cheaper to build redundant systems that can handle consumer drive failures and just buy twice as many drives.

Hitachi made a mistake by Anonymous Coward · 2016-05-17 23:21 · Score: 0

when they sold out to Western Digital. This is the kind of thing they could capitalize on, to have the absolutely highest quality storage drives available. Anyway, what's clear from these charts is that the American products are of the lowest quality, and if quality is what you want, in general you buy Japanese or German.

Re:Hitachi made a mistake by Anonymous Coward · 2016-05-18 00:04 · Score: 0

Like with everything.

Re:Why does this matter? by GrumpySteen · 2016-05-17 23:25 · Score: 5, Informative

It's how statistics work.

There are over 7 billion people on the planet divided among 100 or so ethnicities and about 200 countries. If you're trying to determine the demographics of the world, checking only 10 random people will not give you any meaningful data. Checking a million random people, on the other hand, will give you a fairly good idea of the demographics of the world.

Same with hard drives. Statistics on 5 hard drives won't tell you anything about the likelihood of a 6th drive failing. Statistics on 100,000 drives will.

Re: Why does this matter? by Anonymous Coward · 2016-05-17 23:26 · Score: 0

Yes, it is a troll. "Why does this matter?"

Amazing how bad Seagate management is by jez9999 · 2016-05-17 23:31 · Score: 1, Troll

Not only have Seagate chained themselves to the declining HDD market, apparently happy with their inevitable fate of oblivion, but they can't even get that right! Their HDDs are totally crap as well.

--
== Jez ==
Do you miss Firefox? Try Pale Moon.

Re:Amazing how bad Seagate management is by Anonymous Coward · 2016-05-18 00:57 · Score: 0

Not a bad plan really, if you're top management.
1. Cut costs and make crappy HDDs; keeps the books in the black.
2. PROFIT!
3. People finally start to notice, and the company goes under.
4. Float away on golden parachute.
Re:Amazing how bad Seagate management is by houstonbofh · 2016-05-18 01:44 · Score: 1

Not only have Seagate chained themselves to the declining HDD market, apparently happy with their inevitable fate of oblivion, but they can't even get that right! Their HDDs are totally crap as well.
And they continue to sell because they are "good enough" and cheap enough, and because they are easy to buy in quantity. This is why BackBlaze does not buy as many Toshiba or WD drives... Getting them in quantity is difficult.
Re:Amazing how bad Seagate management is by Anonymous Coward · 2016-05-18 04:25 · Score: 0

Not only have Seagate chained themselves to the declining HDD market, apparently happy with their inevitable fate of oblivion, but they can't even get that right! Their HDDs are totally crap as well.
"Declining HDD market"? I see no such thing.
We have multiple sixty-HDD-drive file server chassis, we some SSDs for read/write caching (and booting). There's a lot of cold data out there, and people aren't putting it on premium (per GB) SSDs.

Re: Why does this matter? by Traxton · 2016-05-17 23:34 · Score: 2

My 3x Seagate NAS 4TB drives sitting in my file server have power-on hours of about 19000 without any sign of problems. I will replace them at the 3 year running mark anyway, but mostly because I am running out of space. If you wanna talk about really horrible drives, look at first generation WD Green. I sold mine after 2 months because of extremely lacking performance and worrying mechanical noises. During the 20 years I have been using hard drives I have come to the conclusion that one should never buy the largest drives and not the first generation of new technology drives. Choose the safe middle-ground.

Re:Why does this matter? by fuzzyfuzzyfungus · 2016-05-17 23:34 · Score: 1

You misinterpreted: this is a billion drive-hours worth of data, not a drive operating for a billion hours(given that that's a bit over 110,000 years, we don't really have that sort of reliability data, even if anyone cared).

And, when it comes to reliability analysis, that 'ridiculous amount of time' is enormously helpful. How else are you going to draw statistically significant conclusions about something with such an element of chance?

Re: Why does this matter? by Anonymous Coward · 2016-05-17 23:38 · Score: 0

You're right, I misunderstood. Reliability does matter. I don't understand why people think Seagate has awful reliability, though. The real issue isn't which manufacturers make good drives, but which models are reliable.

Re:ST3000DM001? In a DATA CENTER? by Anonymous Coward · 2016-05-17 23:41 · Score: 0

Someone hasn't been paying attention. They consumer drives actually have performed better over time than enterprise grade drives. You should read their previous reports before typing on the internet and showing everyone else you don't know what you're talking about.

What happened to laptop hard drives? by Anonymous Coward · 2016-05-17 23:42 · Score: 0

Four years ago, when I bought my old laptop, there were so many with 1 tb hard drives, although a bit expensive. That one died on me (not the hard drive, but the hinge snapped) and so I went to look for s new laptop. Turns out we've regressed, and now laptop hard drives are only a fraction of what they used to be at double the price, all for a tiny bit of speed and thinness. Fucking bullshit, I say.

Re: Why does this matter? by fuzzyfuzzyfungus · 2016-05-17 23:43 · Score: 4, Insightful

Depends on your use case: the Backblaze people are operating a system specifically designed for cheapo drives that are expected to have a fairly high chance of falling over and dying(pragmatically speaking, that's part of why they are so nice and friendly about drive reliability data and sharing the designs for their 'pods': their real asset as a company is the software sauce that allows them to offer cheap, reliable, storage through software-level redundancy on top of a pile of low-end drives packed tight and connected with really cheap HBAs and SATA port multipliers: no fancy hardware RAID, no redundant-controller SAS, etc.)

If you are buying drives to use as the boot volume for computers that only get a single HDD, or even systems with small RAID arrays, you are going to be seriously inconvenienced by drive models that drop dead atypically fast, even if you save a few bucks upfront. Re-imaging a replacement drive or swapping out a failed RAID disk and rebuilding the volume take time and trouble.

If your purposes are very similar to theirs, then your sensitivity to failure is lower and getting a slightly better deal per GB might start to make sense; but you have to be pretty failure insensitive(or the price of reliability really steep) to be in the same boat.

Re: Why does this matter? by dwywit · 2016-05-17 23:47 · Score: 2

IIRC, the WD green drives had firmware issues - they were "green" because the FW would power them down prematurely in an effort to save energy, only to have them powered up again because the OS requested a read/write. Too many off/on cycles = premature failure.

Also, I must be lucky - I've had one Seagate failure in 12 years, and it was replaced under warranty. Small sample, admittedly - somewhere between 100 and 150 in domestic use.

--
They sentenced me to twenty years of boredom

Re:ST3000DM001? In a DATA CENTER? by fuzzyfuzzyfungus · 2016-05-17 23:54 · Score: 1

Their whole thing is a software-level redundancy arrangement designed to provide adequate reliability through redundancy on top of utter shit hardware. That's the company's niche. It does mean that they massacre drives like crazy; but their cost/GB is pretty impressive, so long as you are doing fairly cold storage, not something IOPS intensive.

Re:WARNING: Parent link redirects to goatse by Anonymous Coward · 2016-05-18 00:05 · Score: 0

Actually, look at the URL your browser shows before modding anything down.

It's a plain simple google search link that will prove that what he said is true, the top (and probably third) comment in this thread is a troll.
And a dumb one at that, who starts all of his comments with the exact same words.

Re:ST3000DM001? In a DATA CENTER? by Anonymous Coward · 2016-05-18 00:08 · Score: 0

ST3000DM001 is just a particularly problematic drive. Backblaze has been using consumer drives for years, with great success. At their scale, It's much more cost-effective to keep replacing cheaper, maybe less reliable drives. Enterprise drives may make sense for some architectures, but certainly not for them.

Re:Why does this matter? by Anonymous Coward · 2016-05-18 00:21 · Score: 0

Don't say "X is bad" or "Y is bad" without adding "today", and keep an eye on how things evolve.

5 years ago WD had a piss poor reputation here at work, while Seagate counted as decent.
I think a certain shipment of WD drives (don't remember the exact type) didn't contain a single one that has lasted a full year before giving up.

Similarly, at a certain point in time, I think about 15 years ago, I wouldn't have recommended Hitachi to anyone, and Samsung was even worse.

Seagate loser as usual by Anonymous Coward · 2016-05-18 00:30 · Score: 0

Funny how the industry is still making excuses for Seagate.
Worst crap in the industry for over 20 years.

Seagate Sucks? by Anonymous Coward · 2016-05-18 00:53 · Score: 0

I have an SGI Octane @ work with 15K Cheetahs that hasn't been shut off since 2003.

Re:Seagate Sucks? by Anonymous Coward · 2016-05-18 01:04 · Score: 1

Oh yeah, guess wut?
They think Nefertit's tomb is hidden behind a wall in King Tut's burial chamber.
You know what they will find, bro?
They're gonna find a fucking NETWARE 3.12 server. That's what. ANd it's gonna be running on a bank of BAGHDAD BATTERIES. And it's gonna have full-height 5.25 SEAGATE CHEETAS. And they're gonna be NOISY. And ALL this will MAKE servers GREAT AGAIN. And the PEOPLE will REJOICE and MICROSOFT will DIE.
Re:Seagate Sucks? by citylivin · 2016-05-18 04:35 · Score: 1

hasn't been shut off since 2003.
Drives usually fail on power cycle so i'm not surprised.
I dare you to reboot it and see if any of the drives come back up.
Still thats an impressive uptime. So impressive that its not likely true. Power failures happen once every 10 years at the least in most places. UPS batteries die every 3 years. Generators fail in weird ways (i had one start delivering only 80v consistently as the voltage regulator was going out).

--
As a potential lottery winner, I totally support tax cuts for the wealthy

Re:Why does this matter? by marcansoft · 2016-05-18 01:00 · Score: 3, Insightful

No, it will affect you if you choose to ignore the results and buy a *3TB* Seagate drive.

When will people stop picking stupid manufacturer sides when it comes to drive reliability? It has nothing to do with manufacturers and everything to do with models. *Every* drive maker has put out shitty models that fail in dumb ways, from HGST (ex-IBM)'s DeathStars to Samsung's firmware fail (I still own a bunch of HD204UIs with an unfixed firmware bug that eats data if you dare use SMART self-tests) to Seagate's 3TB failures. Picking manufacturer sides just means you'll get hit whenever they make the next broken drive.

If you actually look at their per-drive stats, you'll see that Seagate's 4TB drive is, so far, *more* reliable than WD's current drives. I have a bunch of those and they're mostly running fine - though I had one drop off the controller last weekend (came back after reboot), first failure in years, I need to look into that. We'll see. Right now, 4TB Seagates seem to be the best bang per buck with decent reliability. Next year it might be another brand/drive.

Re:ST3000DM001? In a DATA CENTER? by silas_moeckel · 2016-05-18 01:03 · Score: 1

They are bad drives for sure, but they are not exactly using 90's raid tech either. It's distributed mirrors the OS just sees a JBOD and higher levels deal with making copies etc. Looking at their hardware spec they are not realy worried about performance with a lot of sata port multipliers. But their industry is write it twice and probably never access it again outside of bitrot detection and correcting for failures.

--
No sir I dont like it.

Actual link to report by Solandri · 2016-05-18 01:06 · Score: 4, Informative

And not some news website which doesn't even have the courtesy to provide a link to the actual source report.

https://www.backblaze.com/blog/hard-drive-reliability-stats-q1-2016/

It includes historical models as well as statistical confidence intervals - very useful for determining which model drive is more reliable. I know everyone wants to use an easy rule like "Seagate bad" when buying, but it's not that simple. Each new model of drive includes new design changes to try to increase capacity, improve speed and reliability, and/or reduce cost. Sometimes these design changes work, sometimes they don't and the model is less reliable (e.g. Samsung 840 EVO). The statistics have the greatest orthogonality when broken down by model, not by manufacturer.

Re:Actual link to report by PhrostyMcByte · 2016-05-18 02:44 · Score: 2

Slashdot should not link to sites that hide the actual article. So annoying.
Re:Actual link to report by Anonymous Coward · 2016-05-25 04:43 · Score: 0

It are de way of they're kind. They're kind.

Terrible Data Table by iCEBaLM · 2016-05-18 01:07 · Score: 1

So they label the data table as being for the first quarter 2016, but then for some inexplicable reason they change the failure rate to be annual? Are they using historical or projected data? Why skew the failure rate?

And then the bar graph - failure rates by manufacturer. How are they getting this data? For example, 2016 for HGST they list a failure rate of 1.03%, but that isn't borne out in the table data. The table data suggests only a 0.2% failure rate (44 failures / 22731 drives).

Re:Terrible Data Table by Solandri · 2016-05-18 01:32 · Score: 2

The article linked in the summary is a (bad) tech website's take on the actual report. Look above, I've provided a link to the actual Backblaze report.

The different drives have been in operation for different lengths of time. So they have to normalize the failure rate to an annual number in order to compare. e.g. If a drive model has been in use for 3 years, you just give the number of failures in the last year. If a drive model has been in use for 3 months, you multiply its failure rate by 4 to get its projected annual failure rate. It's not perfect, but at least this way you're comparing based on the same number of operating hours.

The bar graph (or anything comparing based on manufacturer) is pretty useless. I'd suggest just ignoring it. You want to concentrate on looking at the different model drives. The bar graph is made by lumping the statistics of each manufacturer's drives they used for the year. So it's a non-normalized amalgam of (1) a different mix of drive models every year, (2) in different quantities. Those two variables pretty much wipe out any statistical meaning, which you can get directly from the other charts they provide anyway. It's something that would be useful to Backblaze internally (see how well they're doing at filtering out unreliable drive models year over year), but useless for anyone else.

HGST best, Seagate worst - again! by blind+biker · 2016-05-18 01:10 · Score: 1

This is NOT the first report in which HGST hard drives resulted to be the most reliable, and very much not the first report where Seagate came dead last in reliability. In fact Seagate's unreliability is becoming legendary.

--
"The agriculture ministry is not in charge of Gundam" - Japanese ministry official.

Re:HGST best, Seagate worst - again! by U2xhc2hkb3QgU3Vja3M · 2016-05-18 01:50 · Score: 1

Seagate HHDs unreliability is so legendary, in fact, that they should partner up with Old Spice.
Re:HGST best, Seagate worst - again! by Anonymous Coward · 2016-05-18 02:53 · Score: 0

They should present data by model rather than manufacturer.
Re:HGST best, Seagate worst - again! by Anonymous Coward · 2016-05-18 06:56 · Score: 0

Already done... if you look at original report
Yawn

Re:Why does this matter? by Bongo · 2016-05-18 01:16 · Score: 1

Indeed. Of the (only) 100 desktops I deal with, exactly one had a 3TB Seagate, and it failed.

Re:Why does this matter? by houstonbofh · 2016-05-18 01:27 · Score: 1

When will people stop picking stupid manufacturer sides when it comes to drive reliability?

When people stop continuing to get tons of anecdotal evidence. I was refurbishing some old storage boxes, and testing all the drives. Found about 10 bad out of 50. All the bad ones were Seagate. (Farious sizes and models.) None of the WD were bad. This reinforces my belief that Seagate is likely to be crap. It works the same way for others.

Re:ST3000DM001? In a DATA CENTER? by houstonbofh · 2016-05-18 01:35 · Score: 1

What the fuck is this company doing using consumer hard drives in a goddamn data center?

Making lots money while still offering services cheaper then anyone else. Their B2 storage is even cheaper then Amazon S3. Yet, the service overall is very reliable, and the customers seem very happy.

Re:Why does this matter? by Anonymous Coward · 2016-05-18 01:39 · Score: 0

Yes! And similarly, seeing the sun rise in the east every day for the last few thousand years of human recorded history tells us nothing about what'll happen tomorrow!

Stop being an ass. Obviously there are assumptions that go into predictions about the future, cheif of which is that observations of the past are representative of general behaviour. But then dismissing that as utterly invalid is to dismiss the predictive power of the entirety of science.

never had a hard drive fail on me by Anonymous Coward · 2016-05-18 01:44 · Score: 1

not even once, and i run seagates everywhere and even an old maxtor 80gig drive on a 2002 desktop i use for torrenting, then again all have a fan for cooling and none use those fancy alternate power states in which the computer isnt on, but isnt off either, im not a faggot here, i like women with big titties and binary power states

but if you are running a seagate 3tb drive thats a monstrosity with 4 plates and 5 heads or whatever, or a model thats been on the market for 2 months and you love to put it to sleep and wake it up again until the firmware craps on you, you are dumb, and probably a faggot too

Less than useful by Lawrence_Bird · 2016-05-18 01:44 · Score: 1

This data is only for a 91 day period. To actually be useful, data should be presented for a rolling 6M, 1Y, 2Y and 3Y periods of time, or at least for however long they keep drives in service. They should also include mean and median age of the group of drives. Perhaps Backblaze has that info elsewhere but nothing like it in the article.

Re:Less than useful by Anonymous Coward · 2016-05-18 02:42 · Score: 0

Maybe look at the original report instead of a tech "journalist" butcher... "summarizing" it?
Hell, they even provide the source data so you can do your own statistical analysis.
Oh wait, that'd require more effort than whining in a /. comment.

Re:Why does this matter? by gumbi+west · 2016-05-18 01:57 · Score: 2

If you randomly select drives from a population than it absolutely does tell you something about the unsampled units. Obviously they don't run their drives for one hour an then retire them.

Re:Why does this matter? by Anonymous Coward · 2016-05-18 01:57 · Score: 0

Stop being a Grade-A cunt, you know damn well what he meant.

These drive statistics aren't used to give a 100% insight into how the next batch WILL behave, they're used to ESTIMATE what COULD happen to the next batch of drives, and sample sizes DO affect the results.
Go back and finish high school.

Re:Why does this matter? by fnj · 2016-05-18 02:03 · Score: 1

No, it will affect you if you choose to ignore the results and buy a *3TB* Seagate drive.

I will resist the impulse to shout "hey, stupid". If you even bothered to glance at the small table in the report, you would see that no Seagate 3 TB at all were covered. But the ST4000DX000 4 TB (5 failures, 9.63% failure rate) and ST4000DM000 4 TB (198 failures, 2.54% failure rate) were.

Re:Why does this matter? by Anonymous Coward · 2016-05-18 02:03 · Score: 1

This is the most wrong reply I've seen on Slash this week, kudos!

Re:Why does this matter? by thegarbz · 2016-05-18 02:14 · Score: 2

You must be one of those the chance of winning the lottery is 50:50, you either win or you don't people.

A long analysis of statistics of 100000 drives most definitely gives you information about the 100001th drive when it's in a population group compared to another population group.

Build a drive that self destructs after 2 hours, run a billion of them for 1 hour ... billion hours of time with no failures!!!!!!

Your absurd abuse of statistics would give very valuable insight into the assembly process and QA process of a manufacturer. This would produce very valuable information despite your attempt to show it's worthless, especially since infant mortality is a thing.

Re:Why does this matter? by Bengie · 2016-05-18 02:14 · Score: 1

YOU don't understand how statistics ACTUALLY work.

You don't seem to understand. The whole point of statistics is to give predictive power. If you can't predict what the 100,001st drive will probably do, then you're not using statistics.

Re: Why does this matter? by Traxton · 2016-05-18 02:15 · Score: 1

Correct. I disabled the power down timeout. It stopped the on/off cycles, but didn't improve performance. I ended up buying regular 7200 rpm Seagate disks. Sold them after 4 years and replaced with my current Seagate NAS disks. They run 5 degrees cooler than my old disks with no noticable performance decrease. Highly worth the small price difference.

Interesting trends by thegarbz · 2016-05-18 02:19 · Score: 2

Aside from comments on specific models and specific manufacturers, has anyone else noticed the downward trend?

I wonder if this is due to more careful selection or (except in the case of Seagate which is quite obvious) the manufacturers are actually getting better, or age related issues in the way the stats are reported.

Re: Why does this matter? by Gr8Apes · 2016-05-18 02:23 · Score: 2

I had two of the terrible 1.5TB Seagates fail early. Didn't even do a warranty exchange on them, wasn't worth having to do another one 3-6 months down the road, and then another, and another and... So I bought WD, Toshiba, HSGT, pretty much anything but Seagate. I still won't buy Seagate. Trust once lost is hard to earn back. Their drives just haven't been better than the brands I do trust, so no reason to go back to them.

--
The cesspool just got a check and balance.

Anyone remember Maxtor? by scorp1us · 2016-05-18 02:30 · Score: 1

You were lucky to get a year out of one of their drives.

--
Slashdot's rate-of-post filter: Preventing you from posting too many great ideas at once.

Re:ST3000DM001? In a DATA CENTER? by Bengie · 2016-05-18 02:32 · Score: 1

They're using all of the drives the same way and one brand fails 3x+ more than others. It doesn't matter if they're using the harddrives "wrong".

Re:Why does this matter? by Anonymous Coward · 2016-05-18 02:35 · Score: 0

God I hope you're not in a position where anybody might mistake you for someone who knows what they're talking about.

In case ExtremeTech is listening by justthinkit · 2016-05-18 02:40 · Score: 1

In case ExtremeTech is listening, I added them to my hosts file (several months back) and now never go there any more. Used to be worth a periodic visit...

--
I come here for the love

Billion hour test by rossdee · 2016-05-18 02:42 · Score: 1

A billion hours is 114 thousand years
civilization didnt exist that long ago
at least on this planet

even if they tested a thousand drives for 114 years that would still be amazing. what sort of hard drives were around in 1902 ?

What about drive warranties? by h4ck7h3p14n37 · 2016-05-18 03:14 · Score: 1

I know it's a reliability report, but shouldn't drive warranties be considered?

If a drive is still under warranty, do I really care if it fails at time X versus 2*X? Rather than choosing a drive based on overall reliability, shouldn't I make the decision based on reliability after the warranty period has elapsed?

Better Source by Krazy+Kanuck · 2016-05-18 03:29 · Score: 2

I realize advertising is king here, but a link to the original and far more detailed report would have been nice. https://www.backblaze.com/blog...

Re: Why does this matter? by omnichad · 2016-05-18 03:31 · Score: 1

They also intentionally broke RAID functionality to force you to buy their more expensive drives. I used one or two in a RAID anyway after correcting all the firmware settings, but it still caused problems.

Re:Why does this matter? by omnichad · 2016-05-18 03:34 · Score: 2

They were still using 3TB Seagates in their last report (Q4 2015). They discontinued all use of them as a result of their findings.

Is cheaper really better? by h4ck7h3p14n37 · 2016-05-18 03:36 · Score: 1

Does it really pay off in the long-run to buy lower quality drives?

For example, a 5400 RPM 4 TB WD Blue (desktop) drive is $130 with a 2-year warranty. The 4 TB WD Gold (datacenter) is $264 with a 5-year warranty, but spins faster and has twice as much cache. The more expensive drive is slightly cheaper per warranty-year and provides more IOPS, but does draw almost 4 Watts more power when active.

Without knowing how long the drives last beyond their warranty period, which the Backblaze report doesn't mention, isn't it less risky to buy the more expensive drive?

Re:Is cheaper really better? by Grishnakh · 2016-05-18 04:51 · Score: 1

I haven't bothered to look into Backblaze and exactly how they work, but if your business is all about providing the cheapest mass storage possible by using cheap drives in big RAID arrays, most likely the double-priced drive is a worse deal because it costs twice as much, and the extra performance isn't that important to you. In a massive array, it's going to get replaced when it dies, so you're not worried about risk because redundancy handles that. Plus the extra power consumption is going to cost you. So even if it only lasts exactly half as long, the cheaper drive is still a better deal.
Re:Is cheaper really better? by brianwski · 2016-05-18 06:09 · Score: 5, Informative

> Does it really pay off in the long-run to buy lower quality drives?

Disclaimer: Brian from Backblaze here. We use a fairly small, simple spreadsheet to answer that exact question. If Drive A is the same size as Drive B but fails 1% more often, then we might choose the drive that fails at a higher rate if is 2% cheaper, and if it is 10% cheaper it is a slam dunk. Make sense?

You ask about warranty. We enter the warranty information into the simple spreadsheet. If a warranty is 5 years long, then replacement drives are free during that time. If the failure rate is 1% per year, then that warranty is worth exactly 5% to us. If a drive with no warranty at all is 10% cheaper, then it is cheaper. If the drive with no warranty is 2% cheaper then we purchase the drive with the warranty.

In reality, the simple spreadsheet has a few more categories. For example, an 8 TByte Hard Drive takes half the datacenter space rental as two 4 TByte drives and the 8 TByte drive takes about half the electricity of the two 4 TByte drives. So if they were the same price we would obviously choose the 8 TByte drive. But they aren't the same price, so the additional cost of the 8 TByte drive has to be recovered over three years of reduced cabinet space rental costs and reduced electricity costs. We purchase drives once per month, so we get 20 bids from our cheapest suppliers, and right now SOME months Backblaze ends up purchasing the 8 TByte drives because they will pay for themselves within 3 years, and some months we go back to the 4 TByte drives because they are so ridiculously cheap it would take 7 years for the 8 TByte drives to pay for themselves.
Re:Is cheaper really better? by brianwski · 2016-05-18 06:13 · Score: 5, Informative

Brian from Backblaze here. This is exactly correct. We have redundancy across multiple computers in multiple locations in our datacenter, so losing one drive is usually a calm, non critical event that we take up to 24 hours to replace at our leisure during business hours.

If you are interested in details of our redundancy, here is a blog post about our "Vaults": https://www.backblaze.com/blog...

Summary of article: Backblaze uses Reed-Solomon coding across 20 computers in 20 locations in our datacenter. It is a 17 data drive plus 3 parity configuration, so we can lose any 3 entire pods in 3 separate racks in our datacenter and the data is still completely intact and available.
Re:Is cheaper really better? by Mr.CRC · 2016-05-18 14:59 · Score: 2

I'd rather have two of the $130 drives in RAID1, vs. the $264 drive. Then buy a cup of coffee.
I just experienced the first drive failure ever, since getting my first 20MB Winchester hard drive in the 80s. Fortunately it happened while testing before using as temp. storage to allow repartitioning another drive. Granted, my drives have only ever seen "desktop" workloads.
Re:Is cheaper really better? by Mr.CRC · 2016-05-18 15:02 · Score: 1

Fucking capitalists kick ass!
Re:Is cheaper really better? by Coren22 · 2016-05-19 07:46 · Score: 1

By chance do you guys sell the hardware for the storage boxes? I would love to have one of these in my house.

--
APK likes to ask for responses to the same things over and over. Maybe he just likes the responses?
Re:Is cheaper really better? by brianwski · 2016-05-27 10:01 · Score: 1

> By chance do you guys sell the hardware for the storage boxes?

You are in luck! Backblaze does NOT sell the hardware, but we give the design away entirely for free (and others sell it unassembled or assembled for a tiny markup). You can review the latest design here including downloading schematics and specs and parts lists to assemble your own: https://www.backblaze.com/blog...

It sounds like you only want one, and you may not want to worry about assembling it yourself, so you should definitely check out: http://www.45drives.com/ who will sell you a completely assembled storage pod without drives, or may still even sell you a "kit" of the parts that you have to build yourself to save some money.

Backblaze doesn't get anything at all from this, so you might ask why it is all this way. Two things: first of all, we aren't in the business of making and selling hardware, we sell raw storage as a service (our B2 product line) and also we sell online backup. It doesn't HURT Backblaze to release the designs and we get a little free press and good will about it and people hear our name and might want to purchase the OTHER products we actually charge money for. Also, the very nice people at "45 drives" helped us when we were starting out by prototyping our sheet metal and helping with industry cad drawings and such (we were mostly software people, don't know much about manufacturing) so we simply want good things for them. Finally, Backblaze benefits by a larger ecosystem of people using this design. Some of the past improvements have been contributions from OTHER companies and people improving our original design and giving back the improvements.
Re:Is cheaper really better? by Coren22 · 2016-05-31 02:47 · Score: 1

Thank you, that is exactly what I was looking for.

--
APK likes to ask for responses to the same things over and over. Maybe he just likes the responses?

Re:Why does this matter? by funwithBSD · 2016-05-18 04:11 · Score: 1

If I have 3 women have a baby, it will get done in 3 months.

MS Project told me so.

--
Never answer an anonymous letter. - Yogi Berra

Re: Why does this matter? by MachineShedFred · 2016-05-18 04:15 · Score: 3, Informative

They buy Seagate because Seagate will allow them to do volume purchases.

It's a bit easier to go to your local Best Buy and get one or two drives of whatever manufacturer you want then to buy 10,000 drives in a single order. The article specifically says that WD and Toshiba haven't been able to get that done, where Hitachi and Seagate have.

--
Slashdot still doesnâ(TM)t support Unicode after it was added to the HTML standard in 1997.

Re: Why does this matter? by Anonymous Coward · 2016-05-18 04:51 · Score: 0

The sun could explode right now. Any number of reasons could cause such a catastrophic failure.
We might get ripped out of Orbit by a small black whole, or our rotation nullified.
The UNIVERSE could just blip out of existence, rip, reverse time or worse.
Past events can only guess at the future. We only have one universe and one star as reference.

But just like weather prediction, a million little discrete weather events can bubble up and make it snow in the middle of a heat wave. That was a nice summer.
No matter how many drives you have, even if a million lasted 10 years, the next could easily fail to turn on.

It is like arguing 777777777 isn't as random as 564568962, they are both random, 2 different forms of random numbers.
Statistics != evidence. It's an estimated guess, a risk assessment, but not evidence.
Unless we have 100% reliable prediction, in which case the future is deterministic and boring!

Re:Why does this matter? by marcansoft · 2016-05-18 04:59 · Score: 1

Yup, they don't have any Seagate 3TB drives this time around... because they were so bad they ditched them all late last year. Meanwhile, as you mention, the ST4000DM000 (at 2.54% failure, sample size 34k drives) is doing better than the WD drives. The ST4000DX000 stat is not statistically significant, as they don't have many of those drives.

Re: Why does this matter? by Anonymous Coward · 2016-05-18 05:08 · Score: 0

You do know that large part of the information necessary to know if a drive is good or not, is if it's reliable, right?

And how does one measure reliability? By operating a shitload of them for a reasonable period of time, and look for failure trends. Hey, wait...

Yes, dealing with replacement costs more than buy by raymorris · 2016-05-18 05:12 · Score: 1

> If a drive is still under warranty, do I really care if it fails at time X versus 2*X?

I certainly do. Buying the drive costs maybe $130. Compare the cost to handle a failure:
Having a tech pull the pod, hook it up to the pod tester to find the bad drive, run that drive through the test sequence to prove (to the manufacturer) that it really is bad, fill out the RMA request, box it up and ship it, put a replacement drive in the pod, reinstall and activate the pod, handle receipt of new drive later.

Handling costs of a failure (under warranty or not) is probably $200. That's more significant than the purchase cost of a drive that's out of warranty.

The above description is for Backblaze. In my case, the procedure for a failed disk starts with "drive over to the datacenter". It ends with "hope that the firmware on the replacement drive doesn't have any glitches in my environment". I don't want to drive over there and deal with it. I'm more interested in drives that don't fail than drives that will be replaced under warranty.

On a local PC, the process probably begins with "hope that the backups worked correctly last night, and that nothing goes wrong with the restore".

Re:Why does this matter? by nmb3000 · 2016-05-18 05:20 · Score: 1

When will people stop picking stupid manufacturer sides when it comes to drive reliability? It has nothing to do with manufacturers and everything to do with models.

Completely disagree. Of course there are variations between models made by the same company, but it's the company's Big Bosses that decide on the margins and upper limits on tolerances and failures. If HGST is content with a 20% profit margin but Seagate expect 30% and both companies sell in the same segment, that extra 10% has to come from somewhere. Perhaps by using cheaper electrical components on the control board or more lax quality controls -- more likely some combination. That's the difference.

--
"What do you despise? By this are you truly known." --Princess Irulan, Manual of Muad'Dib
/)

Re:Why does this matter? by nmb3000 · 2016-05-18 05:23 · Score: 1

If there was ever a website that needed a "-1, Objectively Wrong" moderation option, it's Slashdot.

--
"What do you despise? By this are you truly known." --Princess Irulan, Manual of Muad'Dib
/)

Re:Why does this matter? by Bengie · 2016-05-18 05:25 · Score: 1

My cousin worked in a multi-petabyte datacenter and told me Seagate was crazy and Hitachi was the best.

Re: Why does this matter? by Bengie · 2016-05-18 05:27 · Score: 1

Statistics can give you 99.9999%+ predictability of a group.

Re:ST3000DM001? In a DATA CENTER? by brianwski · 2016-05-18 06:40 · Score: 5, Informative

> What ... is this company doing using consumer hard drives in a ... data center? .... they will fall out of an array every time there's a URE

Brian from Backblaze here. You assume we use RAID (inside of one computer), which is incorrect. We wrote our own layer where any one piece of data is Reed Solomon encoding across 20 different computers in 20 different locations in our datacenter (which is using some of the excellent ideas from RAID and ditching some of the parts that don't work well in our particular application). Our encoding happens to be 17 data drives plus 3 parity. We can make our own decisions about what to do with timeouts. When doing reads, we ask all 20 computers for their piece, and THE FIRST 17 THAT RETURN are used to calculate the answer. Now if one of the computers does not respond at all we send a data center tech to replace it. But if it was just momentarily slow a few times a day we let it be (we don't eject it from the Reed Solomon Group).

> These drives are only meant to be powered on a few hours a day and consumer workload duty cycles

I think a really interesting study would be to power a few thousand drives up once per day for an hour and shut them down. Compare it to a control group of the same drives left on so their temperature did not fluctuate. See which ones last longer without failure. I honestly don't have the answer. (Really, I don't.) What I do know is that Backblaze has left 61,590 hard drives continuously spinning, most of these are often labeled as "consumer drives", and that the vast majority of drives last so long that we copy the data off onto massively more dense drives (like copying all the data off a 1 TByte drive into an 8 TByte drive) not because the 1 TByte fails, but because it ECONOMICALLY MAKES SENSE. An 8 TByte drive takes less electricity per TByte, takes 1/8th the rack space rental, etc. So Backblaze honestly wouldn't care if the "Enterprise Drives" lasted 10x as long in our environment-> we would STILL replace them at the same moment.

Re:Why does this matter? by thegarbz · 2016-05-18 08:34 · Score: 1

Another person who doesn't understand data.

If 3 women have a baby and all end up the same you known nothing. If two of them miscarry in those three months, you learn a hell of a lot without ever getting a baby. You don't need to run every life to failure to learn something from statistics.

Well you do if you can't understand statistics, ... I recommend you do your master's thesis on mayflys, otherwise you'll never get done.

Re:Why does this matter? by GrumpySteen · 2016-05-18 12:53 · Score: 1

seeing the sun rise in the east every day for the last few thousand years of human recorded history tells us nothing about what'll happen tomorrow!

When you look a little closer and have enough data points, you'll find that the sun doesn't rise in the same exact place every day. The position varies over a cycle of about 365.25 days. You can indeed see that pattern with hundreds of thousands of data points. You cannot see it with 10.

Thanks for proving my point, even if you were being an ass in the process.

Re:Why does this matter? by funwithBSD · 2016-05-18 17:25 · Score: 1

"Another person who does not understand sarcasm."

Fixed that for you.

--
Never answer an anonymous letter. - Yogi Berra

Re:Why does this matter? by Anonymous Coward · 2016-05-19 02:10 · Score: 0

Oh look! It's the autism-hating Slashdot troll again!

Re:Why does this matter? by thegarbz · 2016-05-19 02:21 · Score: 1

Sarcasm is easily understood when the context or the content alludes to it.

You're communicating using a form where 90% of the context is absent. I can't hear the tone of your voice, I can't see your facial expressions, and I just finished dealing with someone who had absolutely no idea. Hence your reply looked like just another person who had no clue.

In order to communicate sarcasm you need to actually communicate it. Someone failing to understand it is a symptom of a filed communication.

But no one on Slashdot would EVER do that .... /sarcasm.

Re:Why does this matter? by funwithBSD · 2016-05-20 04:18 · Score: 1

Well, the phrase "MS project told me so" was sort a huge fucking clue mate. Unless you are insulting my intelligence by thinking that was a serious comment.

So was the fact that "2 women can't make a baby in half the time" is a well known axiom about the futility of trying to shorten the timeline of a single threaded task.

--
Never answer an anonymous letter. - Yogi Berra

Re:Why does this matter? by thegarbz · 2016-05-20 23:26 · Score: 1

Unless you are insulting my intelligence by thinking that was a serious comment

I was just talking to someone who had no fucking clue at all. So... yes I actually took a good chunk of your comment seriously.

But again. Communication is a two way street and maybe I'm just a complete and utter idiot who takes everyone's word as literal. You would be wise to remember someone could always confuse what you're saying especially in an impersonal context free communication medium.

is a well known axiom

And to extend on this I've never heard this before. Not all slashdotters are programmers. ... well not yet ... according to the government we'll all be programmers soon :-)

Slashdot Mirror

Backblaze Releases Billion-Hour Hard Drive Reliability Report (extremetech.com)

130 comments