8TB Drives Are Highly Reliable, Says Backblaze (yahoo.com)

← Back to Stories (view on slashdot.org)

8TB Drives Are Highly Reliable, Says Backblaze (yahoo.com)

Posted by BeauHD on Tuesday August 2, 2016 @10:10AM from the data-loss dept.

An anonymous reader writes from a report via Yahoo News: Cloud backup and storage provider Backblaze has published its hard drive stats for Q2 2016. Yahoo News reports: "The report is based on data drives, not boot drives, that are deployed across the company's data centers in quantities of 45 or more. According to the report, the company saw an annualized failure rate of 19.81 percent with the Seagate ST4000DX000 4TB drive in a quantity of 197 units working 18,428 days. The next in line was the WD WD40EFRX 4TB drive in a quantity of 46 units working 4,186 days. This model had an annualized failure rate of 8.72 percent for that quarter. The company's report also notes that it finally introduced 8TB hard drives into its fold: first with a mere 45 8TB HGST units and then over 2,700 units from Seagate crammed into the company's Blackblaze Vaults, which include 20 Storage Pods containing 45 drives each. The company moved to 8TB drives to optimize storage density. According to a chart provided in the report, the 8TB drives are highly reliable. The HGST HDS5C8080ALE600 worked for 22,858 days and only saw two failures, generating an annualized failure rate of 3.20 percent. The Seagate ST8000DM002 worked for 44,000 days and only saw four failures, generating an annual failure rate of 3.30 percent." For comparison, Backblaze's reliability report for Q1 2016 can be found here.

UPDATE 8/2/16: Corrected Seagate Model "DT8000DM002" to "ST8000DM002."

209 comments

Min score:

Reason:

Sort:

Yeah, but... by by+(1706743) · 2016-08-02 10:13 · Score: 5, Funny

...they use helium in the drives, so all your music sounds like Alvin and the Chipmunks.
1. Re: Yeah, but... by Luthair · 2016-08-02 10:17 · Score: 1
  
  Jokes aside not all the 8tb drives are helium based iirc.
2. Re:Yeah, but... by Anonymous Coward · 2016-08-02 10:28 · Score: 0
  
  ...they use helium in the drives, so all your music sounds like Alvin and the Chipmunks.
  Tell me about it. All of my pr0n money-scenes sound like a pair of budgies fighting.
3. Re: Yeah, but... by by+(1706743) · 2016-08-02 10:32 · Score: 1
  
  Ah, you're right.
4. Re: Yeah, but... by Jeremiah+Cornelius · 2016-08-02 12:05 · Score: 1
  
  This much capacity is overkill for music storage, and only useful in the homes of pr0n enthusiasts. SATAn storage.
  
  --
  "Flyin' in just a sweet place,
  Never been known to fail..."
5. Re:Yeah, but... by TheGratefulNet · 2016-08-02 14:27 · Score: 1
  
  to fix that, just run your 45's at 33.
  (GOML)
  
  --
  
  --
  "It is now safe to switch off your computer."
6. Re:Yeah, but... by TheGratefulNet · 2016-08-02 14:35 · Score: 1
  
  speaking of music, I have a hard drive based music player in my car and its been in the car since about 2003. it had whatever was the best IDE (not sata, too early for sata) notebook drive of its time. I put as much mp3 fileage on that as would fit.
  in all these years of hot and cold (bay area, but still we get some hot days when the car is left out in the sun) and vibration from daily driving, I have yet to replace that drive! it could have been a samsung or ibm or maybe hitachi. funny enough, none of those really make drives anymore, do they?
  I keep expecting this old ide notebook drive to fail but it keeps on running. in the trunk of my car right over a wheel well storage area, so it gets more than its share of rough road bumps.
  
  --
  
  --
  "It is now safe to switch off your computer."
7. Re:Yeah, but... by Hardness · 2016-08-02 14:46 · Score: 1
  
  Damn, that's some serious Monster Cable logic right there...!
8. Re:Yeah, but... by jedidiah · 2016-08-02 16:53 · Score: 1
  
  I have a 1.5TB Seagate drive that is still spinning.
  You can manage to get lucky even with the most notorious hardware.
  
  --
  A Pirate and a Puritan look the same on a balance sheet.
9. Re: Yeah, but... by Anonymous Coward · 2016-08-02 20:28 · Score: 0
  
  what about automatic backups?
10. Re:Yeah, but... by NormalVisual · 2016-08-02 20:30 · Score: 1
  
  Agreed. In the last 20 years, I've had only two drives fail out of more than a dozen - the original 30GB (was either Quantum or Maxtor) in my G4 Power Mac back in 2008, and a 2TB Seagate in a home-built machine in 2012.
  
  --
  Please stand clear of the doors, por favor mantenganse alejado de las puertas
11. Re:Yeah, but... by Anonymous Coward · 2016-08-03 01:45 · Score: 0
  
  And it's good for over an hundred years!
  
  The Seagate ST8000DM002 worked for 44,000 days ...
  I wonder who used 8TB drives in 1916?
12. Re:Yeah, but... by beastofburdon · 2016-08-03 05:17 · Score: 1
  
  All of them still make hard drives.
13. Re:Yeah, but... by RockDoctor · 2016-08-03 07:07 · Score: 1
  
  My original 60MB hard drive was still working when the burglars took it (and the computer it was in) 12 years after I got it. (Actually it was 40MB when I got it, but you know that trick, I'm sure.)
  
  --
  Birds are not dinosaur descendants;birds are dinosaurs, for all useful meanings of "birds", "are" and "dinosaurs"
14. Re: Yeah, but... by Anonymous Coward · 2016-08-03 15:22 · Score: 0
  
  For the youngsters: run your 7200s at 5400.
More that HGST are reliable by Anonymous Coward · 2016-08-02 10:16 · Score: 0

I think it's more about HGST drives being highly reliable
1. Re:More that HGST are reliable by Anonymous Coward · 2016-08-02 10:22 · Score: 0
  
  Maybe you should finish reading the summary before thinking then.
2. Re:More that HGST are reliable by Anonymous Coward · 2016-08-02 10:27 · Score: 0
  
  I think it's more about HGST drives being highly reliable
  Ok, then explain the 8TB Seagate drives with a 3.3% annualized failure rate ...
3. Re: More that HGST are reliable by Anonymous Coward · 2016-08-02 10:42 · Score: 0
  
  By comparing it to the HGST with 3.2 percent?
  I dunno about you, but 3.3 vs 3.2 isn't blowing anyone's mind. Not even back blaze, and they make their money by crunching the numbers.
4. Re:More that HGST are reliable by gweihir · 2016-08-02 10:57 · Score: 2
  
  More like Seagate 8TB not being utter trash (like so many other Seagate drives).
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
5. Re: More that HGST are reliable by sexconker · 2016-08-02 11:43 · Score: 1
  
  HGST n = 45
  Seagate n = 2700
  I'd have much more confidence in the Seagate sample's rating.
6. Re: More that HGST are reliable by dgatwood · 2016-08-02 13:33 · Score: 2
  
  I dunno about you, but 3.3 vs 3.2 isn't blowing anyone's mind. Not even back blaze, and they make their money by crunching the numbers.
  They're both terrible numbers, though perhaps not terrible by Seagate standards. The best of the HGST 4 TB drives had an annualized failure rate of only 0.4%. If these numbers are correct, then these drives are about an order of magnitude less reliable than previous generations of hardware....
  Of course, the confidence intervals on these numbers are huge. On the low end, the HGST 8 TB drive could be approximately as reliable as the 4 TB HGST drives (.4%). On the high end, it could be as bad as a 12% annualized failure rate, which would put it into the "complete junk" category. In other words, 45 drives just aren't enough data points to be much more reliable than the anecdotal evidence from folks posting on Slashdot.
  
  --
  Check out my sci-fi/humor trilogy at PatriotsBooks.
7. Re:More that HGST are reliable by NormalVisual · 2016-08-02 20:42 · Score: 1
  
  I don't know that I'd call them "utter trash" - every drive manufacturer has had problems over the years. The pair of Seagates in my server had more than 60,000 hours each on them and still had completely clean SMART reports when I finally pulled the machine, and of all the dozen or so other Seagates I've had in the machines at home (a 2TB Black), I've only ever had one fail. In fairness though, the server lived in a data center where the environment was tightly controlled and was turned on/off only twice during that period, and all of the other machines have always been on a UPS.
  
  I know that's not enough data points to be anything more than anecdotal, but they've worked pretty well for me. I even had an IBM Deathstar for a few years and never had problems with it, so maybe I'm a disk whisperer or something. :-D
  
  --
  Please stand clear of the doors, por favor mantenganse alejado de las puertas
8. Re:More that HGST are reliable by AmiMoJo · 2016-08-02 21:06 · Score: 1
  
  It's simply down to where they position themselves in the market. HGST cost a little more but you get better testing and reliability. Seagate are cheaper but more hit-and-miss. If your product is at the low end of the market, a basic DVR sold in supermarkets at the lowest possible price, you fit a Seagate drive and don't worry too much about failures after the 1 year warranty period. If it's a quality product that sells for a little more you fit a Hitachi drive and your reputation for relaibility increases.
  This happens with almost all components. Cheap electronics have cheap parts, caps that leak after a few years, power supplies that fail when the cheap transistors die.
  If you can stand some failures and upgrade your drives regularly anyway, Seagate might be cheaper for you. If you prefer reliability and long term data retention, pay a little more for HGST.
  
  --
  const int one = 65536; (Silvermoon, Texture.cs)
  SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
9. Re:More that HGST are reliable by vikingpower · 2016-08-02 21:30 · Score: 1
  
  If you can stand some failures and upgrade your drives regularly anyway, Seagate might be cheaper for you. If you prefer reliability and long term data retention, pay a little more for HGST.
  Makes sense. At the start-up where I'm currently being chief engineer, we're foreseeing a storage problem in a couple of months, and writing down specs and requirements for a long-term storage solution. I'm leaning towards the HGST solution, for reasons of long term data retention: it will definitely be a use case that a customer asks for 5- or 10-year old data, and we don't want to be able to fulfill that request by being penny wise pound foolish on Seagate. YMMV.
  
  --
  Religous speak to God. Insane are spoken to by God. When all shut up, one can finally hear Shostakovich in peace
10. Re: More that HGST are reliable by Bengie · 2016-08-03 05:32 · Score: 1
  
  From a 2007 report from Google, the 1 year annualized failure rate for their HDs, not broken up by brand or model, was about 1.5%. Between RMA rates from NewEgg and reported failure rates, 4% seems to be the norm. 0.4% sounds pretty good.
11. Re: More that HGST are reliable by rahvin112 · 2016-08-03 06:06 · Score: 1
  
  Small sample size on the HGST. Pay attention to the number of drives or it's not a reliable statistic. The long term 2013 - 2016 Seagate number is 3%, this is really low.
12. Re:More that HGST are reliable by gweihir · 2016-08-03 22:35 · Score: 1
  
  Probably best give the storage job to Backblaze if possible. They really know what they are doing. For such requirements, redundancy is key.
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Reliability by Hylandr · 2016-08-02 10:19 · Score: 1

Reliability is not so great an issue with raid systems being what they are today. What the bean counters fail to consider is the cost in man power required to replace seagate drives on a constant basis. Not just in the racks but process RMA's or the proper destruction and disposal of drives which may contain sensitive data.
I wonder how those numbers would look if other vendors were offered an equal analysis period. I know WD was mentioned but it didn't appear they got equal share.
Also: First. :)

--
~ People that think they are better than anyone else for any reason are the cause of all the strife in the world.
1. Re:Reliability by Anonymous Coward · 2016-08-02 10:26 · Score: 0
  
  Nope, look at comment timestamps, you were like 5th.
  You have to show all comments (i.e. hidden) below your threshold score
2. Re:Reliability by rthille · 2016-08-02 10:28 · Score: 2
  
  OTOH, given SSDs and the inability to guarantee the erasure of all data on the drive, unencrypted data should never hit the drives at all, and the key should of course also never be stored on the same media (unencrypted).
  That said, only my newer systems use encrypted volumes. My old drives I take apart and shatter/melt the platters.
  
  --
  Awesome furniture, accessories and cabinetry in Santa Rosa, CA: http://humanity-home.com/
3. Re:Reliability by Hylandr · 2016-08-02 10:31 · Score: 1
  
  the inability to guarantee the erasure of all data on the drive, unencrypted data should never hit the drives at all
  Wow, that's not something I had considered. Thanks for that bit of info!
  
  --
  ~ People that think they are better than anyone else for any reason are the cause of all the strife in the world.
4. Re:Reliability by Jahoda · 2016-08-02 10:36 · Score: 1
  
  Clearly, something is up with the Seagate DX series, but they have thousands of the DM models with a 2.66% failure rate. That's pretty remarkable. I've personally been very pleased with this series from seagate.
5. Re:Reliability by Anonymous Coward · 2016-08-02 10:44 · Score: 0
  
  I hope you don't mean the ST3000DM001 (3TB), which has a failure rate well above 30% in 2yr, I have *personally* junked 5 out of 7 of these crap Seagate drives, they start losing data (aka weak sectors) *or* suffer massive media damage (goes from a few hundred bad sectors to thousands then b0rks in the span of a few hours).
  I should open one of these to check what happened (head crack? media failure? contamination?)... maybe I will.
6. Re:Reliability by JustAnotherOldGuy · 2016-08-02 10:48 · Score: 1
  
  My old drives I take apart and shatter/melt the platters.
  I use a drill press to bore 8 or 10 holes straight through my old drives, then for fun I hit 'em with a hammer a few times while whispering my ex-wife's name. If the CIA/NSA/FBI wants my data bad enough to recover it after that, they're welcome to it.
  
  --
  Just cruising through this digital world at 33 1/3 rpm...
7. Re:Reliability by Archangel+Michael · 2016-08-02 10:51 · Score: 1
  
  If you are looking at the Economics of this from a Labor / Parts issue, you're already behind the curve.
  The real value is the data on the drives, and how much you'll miss that data if and when it goes away. The problem is, nobody values their data, until it is gone. And then it is too late.
  
  --
  Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
8. Re:Reliability by Jahoda · 2016-08-02 10:53 · Score: 2
  
  You know what? You're absolutely right and I do stand corrected - I recall this about the 3 TB - probably from Backblaze's data - and I want to say that I think they were first hitting the market after the Thailand disaster? It seems like the 4 TB models are pretty resilient. Anecdotally, I have 8 handling my home library and backups, and have had no failures since I started buying them in March 2013..
9. Re:Reliability by chuckugly · 2016-08-02 11:02 · Score: 1
  
  RaidZ2
10. Re:Reliability by PCM2 · 2016-08-02 11:22 · Score: 2
  
  Reliability is not so great an issue with raid systems being what they are today.
  
  At the scale Backblaze is talking about, I would say it's an issue. Somebody has to keep all those drives in stock and walk back to a cage to replace them. It's not data loss we're worried about here, it's costs.
  
  --
  Breakfast served all day!
11. Re:Reliability by aliquis · 2016-08-02 12:14 · Score: 1
  
  Only Seagate and HGST (was willing to?) provided the volumes they needed / wanted (seem weird) I think was the claim the last time this was posted.
12. Re:Reliability by Hylandr · 2016-08-02 13:10 · Score: 1
  
  And that's exactly what I stated in my post too.
  
  --
  ~ People that think they are better than anyone else for any reason are the cause of all the strife in the world.
13. Re:Reliability by Wrath0fb0b · 2016-08-02 13:55 · Score: 1
  
  OTOH, given the inability to guarantee the erasure of all data on any drive, unencrypted data should never hit the drives at all, and the key should of course also never be stored on the same media (unencrypted).
  FTFY.
  You are absolutely correct though -- you should never rely on making data inaccessible via erasure instead of via encryption.
  Incidentally, the ST8000DM002s that we are talking about here support for OPAL which makes it trivial to "throw away the key" by sending the drive a reset-DEK command.
14. Re:Reliability by AK+Marc · 2016-08-02 13:57 · Score: 4, Insightful
  
  OTOH, given SSDs and the inability to guarantee the erasure of all data on the drive,
  Wow, SSD even survives incinerators? Where I used to work, the policy for drives was to open them up and strip them for their magnets, then have magnet fun. The platters made good frisbees, but the problem is that they go through car windows, and the dents in cars are deep, so frisbee with care.
  
  --
  Learn to love Alaska
15. Re:Reliability by adolf · 2016-08-02 14:10 · Score: 1
  
  At the cost of these drives, isn't it just cheaper to rack the arrays on wheels, and shove them out the back door into the recycling trailer?
  
  --
  Kid-proof tablet..
16. Re:Reliability by Anonymous Coward · 2016-08-02 14:16 · Score: 0
  
  If you are looking at the Economics of this from a Labor / Parts issue, you're already behind the curve.
  The real value is the data on the drives, and how much you'll miss that data if and when it goes away. The problem is, nobody values their data, until it is gone. And then it is too late.
  If you could be sure you could get a very small failure rate, then all is well, and you may be able, to some extent, to have faith in a RAID array. Sadly, if a drive fails, there is a fair chance a second failure will occur during rebuild due to similar failure profiles. If your on RAID6 you might still be okay, unless of course a third drive fails. For home use, I have been using Software 6 with drives from 2 different manufacturers (western digital/seagate) to try to stagger the failure rates. I've already replaced 2 of the 5 drives. Smartctl reports the following ages after doing a short test. 3.1yr, 3.6yr, 1.4yr, 3.8yr, 3.8yr. This machine is idle 99% of the time, and it still chews through them fairly often.
  I'm guessing the first failed early on. The most recent one replaced was a western digital caviar green. Sadly the last one just failed a short test. It is the same brand. I reran the test, and the lba advanced by 1 for the first error.
  Time to order yet another drive it seems...
17. Re:Reliability by dbIII · 2016-08-02 14:34 · Score: 1
  
  Backblaze has redundant servers. It's the only way their "jam drives in where they will fit" pod designs make sense. At max load most of those drives will still be idle.
  IMHO raidz2 is a better idea but it's not what they are doing.
18. Re:Reliability by jedidiah · 2016-08-02 16:55 · Score: 1
  
  Even for a small home array, it's terribly annoying when all of your drives die young and all at the same time (Seagate).
  
  --
  A Pirate and a Puritan look the same on a balance sheet.
19. Re: Reliability by Anonymous Coward · 2016-08-02 19:45 · Score: 0
  
  WD greens aren't appropriate for RAID arrays.
20. Re:Reliability by NormalVisual · 2016-08-02 20:44 · Score: 1
  
  Belt sanders are fun too!
  
  --
  Please stand clear of the doors, por favor mantenganse alejado de las puertas
21. Re:Reliability by NormalVisual · 2016-08-02 20:45 · Score: 2
  
  The platters made good frisbees, but the problem is that they go through car windows, and the dents in cars are deep, so frisbee with care.
  
  And they can hurt too. Not that I'd have any personal experience with that....
  
  --
  Please stand clear of the doors, por favor mantenganse alejado de las puertas
22. Re:Reliability by AmiMoJo · 2016-08-02 20:54 · Score: 1
  
  HDDs are even worse. They silently remap blocks all the time, with no way to erase the data off the old partially-failed ones. At least most SSDs use encryption, and doing an ATA secure erase command will wipe and regenerate the key.
  Also, SSDs are much easier to physically destroy in a shredder or with a blowtorch. Much less metal and armour on them.
  
  --
  const int one = 65536; (Silvermoon, Texture.cs)
  SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
23. Re:Reliability by Anonymous Coward · 2016-08-02 20:57 · Score: 0
  
  You should also consider that this is the result of wear leveling.
  Spinning platter drives might also use wear leveling so if you need to dispose of drives with private information on them you should use physical destruction methods, regardless of type.
24. Re:Reliability by tigersha · 2016-08-02 20:59 · Score: 1
  
  Jesus H. Jeremiah Christ, I just installed one of those in a ProxMox virtualisation machine on the weekend. Will have to pull it tonight, thanks for the heads-up.
  
  --
  The dangers of excessive individualism are nothing compared to the oppressiveness of excessive collectivism
25. Re:Reliability by Anonymous Coward · 2016-08-02 21:04 · Score: 0
  
  RAID protects uptime, not data. Protection against data loss is done with backups.
  Sure, RAID gives you some extra margin before your data is gone, but that is when the data loss is due to a manufacturing quality issue.
  If the drive dies because of external factors like a power supply failure then there is a high probability that both drives will fail.
  No raid in the world will protect you against user errors or a malicious hacker that encrypts or removes your data.
  RAID is for the few cases where you need to be able to replace a drive without shutting down the computer.
  If you just want the extra performance you use striping.
26. Re:Reliability by AK+Marc · 2016-08-02 21:55 · Score: 1
  
  Figured anything that can go through a car window (by breaking it) would likely hurt.
  
  --
  Learn to love Alaska
27. Re:Reliability by i.r.id10t · 2016-08-03 01:01 · Score: 1
  
  Had to destroy a drive that had a lot of student data on it.
  Used a FN-FAL and 147 grain FMJ bullets at about 2700 feet per second.
  
  --
  Don't blame me, I voted for Kodos
28. Re:Reliability by chuckugly · 2016-08-03 02:11 · Score: 1
  
  I was too brief. I was trying to say that anyone who thinks the "real value is the data on the drive" as the GP did and is still going to put that data on a single drive without some redundancy or at least backup has already lost the plot no matter how reliable they believe their single drive solution to be. The folks at BB have the right idea IMO.
29. Re:Reliability by JustAnotherOldGuy · 2016-08-03 03:41 · Score: 1
  
  Had to destroy a drive that had a lot of student data on it.
  Used a FN-FAL and 147 grain FMJ bullets at about 2700 feet per second.
  Yeah, I used to take them out and shoot them (it's fun) but I got tired of picking up all the little bits and pieces of the hard drive. I don't want to litter my shooting areas with fragments of stuff like that. But it is fun to shoot a hard drive and watch it turn to metallic confetti. :)
  
  --
  Just cruising through this digital world at 33 1/3 rpm...
30. Re:Reliability by Anonymous Coward · 2016-08-03 03:58 · Score: 0
  
  What you have to do with the consumer SSDs is to leave them at the window frame unpowered for a year. Thank JEDEC, this will ensure that every bit in the drive is irrecoverably lost!! With enterprise models this can be done in a month!
31. Re:Reliability by brianwski · 2016-08-03 05:04 · Score: 1
  
  > Protection against data loss is done with backups, not RAID.
  
  RAID helps against data loss for some causes of data loss (like hard drives going bad).
  
  However, RAID doesn't protect against human error or software bugs - if you tell a RAID system to delete a file it is deleted - RAID does not mean you can roll back time. If you have a "backup" from a few days ago, if you realize you just destroyed some data with user error, you can use the backup to recover most of the data you just lost.
  
  Just to be absolutely clear - Backblaze does not use RAID inside each pod anymore, we use our own Reed-Solomon encoding across 20 drives in 20 different pods in 20 separate locations inside the datacenter. We open sourced the Reed-Solmon we use here: https://www.backblaze.com/blog... and you can read about how we organize the 20 different pods into a "Backblaze Vault" here: https://www.backblaze.com/blog...
32. Re:Reliability by Anonymous Coward · 2016-08-03 05:11 · Score: 0
  
  Tiny bits of ceramic smash car windows easily, but don't really hurt. :)
33. Re:Reliability by Bengie · 2016-08-03 05:42 · Score: 1
  
  Been building and repairing computers for 25+ years and have worked in IT for quite a few. I have never seen a harddrive die from power issues. I have seen burnt motherboards, and melted traces where the power comes in, but never had an HD die from a surge or lightning strike. Pretty much only unexpected shutdowns in need of a scandisk. I have seen drives die for a myriad of other reasons.
  
  How common are surge/lightning/PSU-blow-up HD deaths? My limited experience is "not often" since I've never seen one.
34. Re:Reliability by Anonymous Coward · 2016-08-03 08:31 · Score: 0
  
  A degausser would remove all the data off of a failed drive.
35. Re:Reliability by Anonymous Coward · 2016-08-03 16:48 · Score: 0
  
  Just to be absolutely clear - Backblaze does not use RAID inside each pod anymore, we use our own Reed-Solomon encoding across 20 drives in 20 different pods in 20 separate locations inside the datacenter. We open sourced the Reed-Solmon we use here: https://www.backblaze.com/blog... and you can read about how we organize the 20 different pods into a "Backblaze Vault" here: https://www.backblaze.com/blog...
  
  From my brief skim, the information coding is basically the same technology as Raid6, save you have 3 parity drives instead of 2, which likely ups the complexity a bit.
  If you are still reading this, I'm curious, what is the reliability of 17 data drives and 3 parity drives? I suspect I'm missing something. I'm fairly sure the coding for the 17+3 solution could be put back into Linux's software raid without a huge amount of trouble. The storage pods I glance at look like 3 sets of 20 drives. At a guess, you could probably use linux to assemble that same set into 3 sets of 20 drive software raid 6 arrays, and again, could probably do the 17+3 solution with work. With a fast intel cpu (AES-NI), you could probably offer encrypted storage as well, though again, there are details involved in mounting it and such automatically.
  I'm not doubting that the Java solution works, nor that it is fast and reliable. It just seems odd not to just modify the Linux Software Raid solution directly.
  Either way, much thanks for the hard drive failure data...
36. Re:Reliability by drsmithy · 2016-08-03 20:18 · Score: 1
  
  You can't (easily and reliably) stretch a RAID across hosts.
  Backblaze take a chunk of data and break it up into 20 smaller chunks (17 data + 3 parity) and then spread those 20 chunks across 20 different physical servers. You can't do that with RAID.
  It would also reduce the overall load during disk rebuilds as well.
  https://www.google.com.au/sear...
DO SPIES EXIST THOUGH? WHICH COMPANIES? by Anonymous Coward · 2016-08-02 10:25 · Score: 0

If your 8 TB hard drive is highly reliable, can you rely on your network connection and operating system to spy on everything you do?
Which city is Ed Snowden in presently?
Correct Seagate 8 TB Model by Jahoda · 2016-08-02 10:29 · Score: 1

Totally not trying to be pedantic, but the Seagate model they reference should actually be the "ST8000DM002"
1. Re:Correct Seagate 8 TB Model by BeauHD · 2016-08-02 10:36 · Score: 1
  
  Ah, good catch! It has been corrected.
2. Re:Correct Seagate 8 TB Model by Anonymous Coward · 2016-08-02 10:54 · Score: 5, Funny
  
  I've found one! The mythical Slashdot editor who edits. All hail the editing editor!
3. Re:Correct Seagate 8 TB Model by TheDarkMaster · 2016-08-02 23:29 · Score: 1
  
  Quick ! Preserve him in carbonite before he gets away!
  
  --
  Religion: The greatest weapon of mass destruction of all time
Not SSD Drives by Dracos · 2016-08-02 10:33 · Score: 1

These are all platter drives, but you can only discover that in the comments at TFA.
There are so few 8GB HGST drives, and they're so new, that the current data about them is statistically insignificant/unreliable, as is any model with less than 500 units and 200k drive days.
1. Re: Not SSD Drives by Anonymous Coward · 2016-08-02 10:45 · Score: 0
  
  Read up on back blaze.
  They only report numbers that they determine are significant. They likely have more 8tb drives, just not in big enough numbers.
  There is a chance that they even report error values in their report.
  Seriously, if you are interested in failure rates of drives. Backblaze are the only public dataset that exist for that information.
2. Re:Not SSD Drives by barc0001 · 2016-08-02 10:58 · Score: 1
  
  Why is heaven's name would anyone think that a cheap cloud backup company would be installing 8 TB SSDs in their massive storage arrays? Those things are thousands of dollars per drive!
3. Re:Not SSD Drives by msauve · 2016-08-02 12:30 · Score: 3, Interesting
  
  "There are so few 8GB HGST drives, and they're so new, that the current data about them is statistically insignificant/unreliable"
  
  The numbers in the summary come from different places, because the first chart in the linked article, for the April-June quarter says:
  
  Seagate 8TB, 2720 drives, 35840 drive days, 3 failures (13 days average per drive, 3% annual failure rate)
  HGST 8TM, 45 drives, 3825 drive days, 0 failures (85 days average per drive, 0% annual failure rate)
  
  The second chart, from April 2013 through the end of June, doesn't show drive numbers, just days, failures, and rates. The numbers in the summary seem to be pulled from both.
  
  Assuming that the 8TB drives stay in use until they die, here's where the stats seem to come from (drive days/# of drives). Drive days pulled from the "all time" chart, # of drives from the latest quarter chart):
  
  22858/45= 507 days average use HGST HUH728080ALE600
  44000/2700= 16 days average use Seagate ST8000DM002
  
  Now, anyone experienced with Seagate wouldn't expect the 3.3% annualized failure rate to be that low in another year and a half. The HGST rate _is_ after almost a year and a half.
  
  --
  "National Security is the chief cause of national insecurity." - Celine's First Law
4. Re: Not SSD Drives by brianwski · 2016-08-03 05:10 · Score: 1
  
  Brian from Backblaze here.
  
  > back blaze ... only reports numbers that they determine are significant.
  
  That's not true. We provide a COMPLETE dump of the raw data for anybody who wants to download it. Here is a link for the lazy: https://www.backblaze.com/b2/h...
Drive Qual yourself then? by Anonymous Coward · 2016-08-02 10:43 · Score: 0

Just buy a price point and run in your production gear? Never take a peak at the firmware, or drive performance over time? Errors handled correctly? Wow dude. That takes guts.
I guess if you're cloud, you don't give a crap about corruption, but if you're looking at over 5% AFR's, you might want to take a closer look. You've probably got a firmware problem, and they probably have a fix for it. It may be something specific to your work load. I'm sure you buy enough drives from them so you can at least get your distributor to set something up. Seagate and HGST have some really good engineers that would probably want to know about the fallout before a year has passed. That RMA thing can get swallowed in the numbers.
High failure rate by JustAnotherOldGuy · 2016-08-02 10:44 · Score: 5, Insightful

"...the company saw an annualized failure rate of 19.81 percent with the Seagate ST4000DX000 4TB drive"
A failure rate of almost 20% in a data center? Geez, that's pathetic.
A temperature-controlled environment, clean power, low shock and vibration, and 1 out of 5 still fails? Remind me never to buy Seagate. Oh, wait, I already vowed never to buy another Seagate- about 10 years ago after experiencing their unequaled propensity to die fast and hard.
Maybe other people have had better luck with Seagate than I have, but for me they've always been disappointing.

--
Just cruising through this digital world at 33 1/3 rpm...
1. Re: High failure rate by Anonymous Coward · 2016-08-02 10:50 · Score: 1
  
  Old quote from the fortune file...
  Friends don't let friends buy Seagate.
2. Re:High failure rate by fustakrakich · 2016-08-02 10:53 · Score: 1
  
  A failure rate of almost 20% in a data center? Geez, that's pathetic.
  Horribly so... And I don't believe that 3% is any good either. Move that decimal point one more to the left, then I might be impressed.
  
  --
  “He’s not deformed, he’s just drunk!”
3. Re:High failure rate by PRMan · 2016-08-02 11:09 · Score: 1
  
  I had one of those. 1 year warranty. And it died in 14 months.
  
  --
  Peter predicted that you would "deliberately forget" creation 2000 years ago...
4. Re:High failure rate by CanadianMacFan · 2016-08-02 12:05 · Score: 1
  
  Depends on how old the drives are. When I was working in a data centre I was having a lot of hard drive failures but they were laptop SCSI drives in blades that had been running continuously for over three years so it was expected for them to be hitting their end of life. (It was around 2007 so that's why the those were the drives.)
  So if the 4TB drives are a few years old getting a lot of use then I can see why they would be failing at a high rate. If they are newer then I would be worried. I'm more concerned that brand new 8TB drives are failing at an annualized rate of just over 3%. Maybe there's something about their data centre that's causing the failure rates? Perhaps they don't keep the temperature as cool as they should in order to save a few bucks?
5. Re:High failure rate by 0100010001010011 · 2016-08-02 12:06 · Score: 1
  
  Having owned a lot of the Seagate 2-3TB drives 20% is way too low from my experience. I think I have 2-3 still running out of a batch of 8. Including RMAs.
6. Re:High failure rate by JustAnotherOldGuy · 2016-08-02 12:29 · Score: 1
  
  Perhaps they don't keep the temperature as cool as they should in order to save a few bucks?
  That could be, but the other brands were failing at a much lower rate so it does make you wonder about the overall longevity of Seagate drives.
  
  --
  Just cruising through this digital world at 33 1/3 rpm...
7. Re:High failure rate by Pharmboy · 2016-08-02 12:50 · Score: 1
  
  They are using consumer drives for data center needs, this is the big reason their failure rate is relatively high. Still, with the redundancy, it is cheaper to run this way. Rumor is that Google ran that way with off the shelf computers. Use dirt cheap commodity products that are good quality, have exceptional redundancy, throw them away as they implode.
  
  --
  Tequila: It's not just for breakfast anymore!
8. Re:High failure rate by radarskiy · 2016-08-02 13:01 · Score: 2
  
  If you wrote off every manufacturer that hit a 20% annualized failure rate you would now be unable to buy any drives.
9. Re:High failure rate by Anonymous Coward · 2016-08-02 13:05 · Score: 0
  
  A temperature-controlled environment, clean power, low shock and vibration, and 1 out of 5 still fails?
  Moisture fluctuations were mentioned in a slashdot submission some time ago as a significant factor in disk drive destruction. Adaptive temperature control should be able deal with the relative moisture issues, however.
10. Re:High failure rate by waveclaw · 2016-08-02 13:44 · Score: 4, Informative
  
  If only that blackbaze pods were even remotely like other datacenter equipment. As far as vibration is concerned they are still pretty much a torture test for anything with a spinning motor. Minimal vibration protection while being mechanically coupled to a weak foundation while crammed in as tightly as geometry allows.
  
  A temperature-controlled environment, clean power, low shock and vibration, and 1 out of 5 still fails
  The density and structure of a pod is only temperature-controlled in that it is going to get hot, quickly.
  
  Remind me never to buy Seagate.
  The numbers from Backblaze you'll actually see that you shouldn't buy one particular desktop model of hard drive for your "datacenter." Numbers like Backblaze releases are quite fascinating in that you can analyze them. You can find which models at any vendor to prefer or avoid.
  
  Oh, wait, I already vowed never to buy another Seagate- about 10 years ago after experiencing their unequaled propensity to die fast and hard.
  Sorry to hear about your loss. I hope you kept backup copies. If not, I hope it taught you that if you don't have a copy then you don't have a backup.
  It is certainly reasonable to avoid a vendor when a lot of their products from many lines have defects at a given time. Seagate's desktop line certainly took a hit from the initial Backblaze numbers. The DM1000's huge failure rate is almost as legendary as the IBM Death Star line or Maxtor click-of-death. But stuff from before or after a given run may have better or worse quality. And of course even manufactures can get batches of bad parts. (Hidden variables like that are one of the reasons why the singular of data isn't anecdote.)
  I also wonder if we'll ever get numbers from Backblaze on things like the actual temperature, decibels and power these drives lived through. More than just avoiding a particular model. It would be nice to know how hot, loud and nasty you can get before your commodity-class storage starts pooping out.
  
  --
  
  "You cannot have a General Will unless you have shared experiences. You cannot be fair to people you don't know."
11. Re:High failure rate by adolf · 2016-08-02 13:58 · Score: 1
  
  Is there a real mechanical difference between "consumer" and "enterprise" drives, these days, at the bleeding edge of the storage-per-unit curve?
  Mostly I see differences in firmware, which (IMHO) ought to be end-user selectable anyway.
  (Before anyone replies, I chose those words carefully to avoid outliers like Raptor little-drive-in-a-big-heatsink configurations, or any other stuff that puts any metric other than capacity-per-dollar as a primary criteria.)
  
  --
  Kid-proof tablet..
12. Re:High failure rate by Zaelath · 2016-08-02 14:02 · Score: 1
  
  They seemed fine to me until they bought Maxtor in 2006; then you never knew what you were going to get, a Maxtor w/ a Seagate badge or a HDD that might have less than a 20% annual failure rate, in the first year.
  I'd guess since then; they closed all the Seagate factories and run exclusively from the cheaper Maxtor facilities. (all of that is a guess, but MBAs always think reducing cost > * so probably in the ballpark).
13. Re:High failure rate by dbIII · 2016-08-02 14:04 · Score: 1
  
  I also wonder if we'll ever get numbers from Backblaze on things like the actual temperature
  We got some numbers earlier from another story but they were entirely useless since they were average temperatures on machines that are idle most of the time. Maximums could tell us something useful. I won't bother linking to the earlier story because it was like a high school project. If it wasn't for their niche use of distributed archiving where their machines are unlikely to get very hot (but possibly individual drives would get very hot) I'd call their server design a high school project as well, but it actually makes sense for them. If a disk dies they can just shut down the server it is on, drag it out of a rack and dig the drive out of the middle of the "pod" out from under all the other drives without impeding their operations. They can afford a lot of server downtime just to replace single disks because others server have copies.
14. Re:High failure rate by vovin · 2016-08-02 14:11 · Score: 1
  
  Yes. There are specific mechanical differences in build quality around stability and vibration dampening between enterprise and consumer level drives. It's more than just flashing some different firmware (but that may be a part of what differentiates drives).
  The best indicators are length of warranty and specification of purpose, in my experience.
15. Re:High failure rate by dbIII · 2016-08-02 14:18 · Score: 1
  
  Perhaps they don't keep the temperature as cool as they should in order to save a few bucks?
  They have a few things about their "pod" design on their website and if you look at it you will see that you are correct. It looks like an utterly insane design until you consider that the things are mostly idle, so typically don't generate a lot of heat, and that they have distributed servers with distributed workloads so they can afford to lose one entirely for a while.
16. Re:High failure rate by dbIII · 2016-08-02 14:21 · Score: 1
  
  Yes but if the drives are failing due to heat it's not really useful information for someone else with file servers with very good airflow.
  Their results should be taken as an strong indication and not 100% reliable unless you have a situation very similar to Backblaze.
17. Re:High failure rate by dbIII · 2016-08-02 14:28 · Score: 1
  
  A temperature-controlled environment
  Not exactly. Take a look at their web page and their "pod" design. They have jammed in drives where they will fit but they have very different loads to a normal data center so can get away with it most of the time. Unlike a normal data center they will never be running all the drives in a "pod" flat out so something that would be a smoking mess elsewhere merely has drives in the middle that cannot shed heat properly.
  That's not to excuse Seagate, it's just to point out why failure rates are higher than what the rest of us experience and probably with a different mode of failure as well.
18. Re:High failure rate by Anonymous Coward · 2016-08-02 15:31 · Score: 1
  
  Wow.
  So you pull one statistic out of a long article, and reach a premature conclusion about an entire company.
  If you actually LOOK at the failure rate they've seen, Segate is around average. The failures have far more to do with model than they do with manufacturer. This voodoo of "driver maker X is worse than drive maker Y because I had bad luck once with drive maker X" is nonsense. Every drive maker has bad models that sometimes have high failure rates.
  Also, if you bothered to pay attention, you'll notice that they actually have a high percentage of Seagate drives.
19. Re:High failure rate by adolf · 2016-08-02 16:13 · Score: 1
  
  Don't you mean "damping"?
  And isn't the spindle motor still affixed firmly to the chassis, which is affixed to the enclosure?
  
  --
  Kid-proof tablet..
20. Re:High failure rate by jedidiah · 2016-08-02 17:00 · Score: 1
  
  ...so then you must obviously be able to point to some recent notorious models like the 1.5TB and the 3TB Barracudas. You should be able to do this off the top of your head. HELL, you should have included an example in your post.
  Although I suspect that you're just talking BS.
  I think my first batch of WD replacements are coming up on the age at which my last batch of Seagates started to become bothersome.
  
  --
  A Pirate and a Puritan look the same on a balance sheet.
21. Re:High failure rate by tlhIngan · 2016-08-02 18:01 · Score: 1
  
  Not exactly. Take a look at their web page and their "pod" design. They have jammed in drives where they will fit but they have very different loads to a normal data center so can get away with it most of the time. Unlike a normal data center they will never be running all the drives in a "pod" flat out so something that would be a smoking mess elsewhere merely has drives in the middle that cannot shed heat properly.
  That's not to excuse Seagate, it's just to point out why failure rates are higher than what the rest of us experience and probably with a different mode of failure as well.
  Backblaze is a cloud backup provider. They sell you online storage to backup your data, and I think they maybe one of those where you pay a fixed price for unlimited storage.
  The storage pods they use are designed for one thing - maximizing the bytes per cube unit of volume. So they're going to pack as many drives as possible. It's why they use SATA multiplexers - each multiplexer is limited to the upstream speed, so if you have a 3Gbps SATA uplink, you're limited to 3Gbps total speed even though you're driving 4+ drives. So its clear from this layout that throughput is not the key factor. Given I think their pods only have GigE interfaces, that too means it's not for general data center high-speed storage.
  But given their customers are limited in upload and download speeds, it's probably more than sufficient.
22. Re:High failure rate by dbIII · 2016-08-02 20:36 · Score: 1
  
  Indeed, as I could have told you or you could have learned from their website.
  I suggest you take a look at their description of how they pack their disks in and you will understand the heat issue I mentioned above.
23. Re:High failure rate by AmiMoJo · 2016-08-02 20:48 · Score: 1
  
  True, but Seagate are particularly bad as they have a history of releasing unreliable drives on a regular basis and then just endlessly swapping them for more unreliable drives until the warranty expires.
  For individuals (not datacentres) HGST is the best bet. As well as being generally very reliable they do proper testing and fix their problems. They might cost a bit more, but what's a few quid here and there to avoid all that hassle?
  
  --
  const int one = 65536; (Silvermoon, Texture.cs)
  SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
24. Re:High failure rate by Anonymous Coward · 2016-08-02 22:19 · Score: 0
  
  "A temperature-controlled environment, clean power, low shock and vibration"
  Why do you think they're low vibration? These are installed in custom designed pods that place large numbers of consumer drives very close together. This can lead to problems with heat (data room is cool, but there is localised hot spots), vibration, harmonics, resonance, and noise.
  Normally Backblaze doesn't see much difference between enterprise and consumer drivers, but in this case it looks like they ran out of luck and had a harmonics issue with this particular drive.
  You can't compare the failure rate in this use case with the failure rate for the intended desktop premium application.
25. Re:High failure rate by jabuzz · 2016-08-02 23:32 · Score: 1
  
  If you are doing a file server, then SATA multiplexers are more than adequate especially for what Backblaze are doing. Let put it this way 45 drives is 9 SATA multiplexers which at 3Gbps SATA is a total of 27Gbps throughput, more than enough to saturate two 10Gbps Ethernet links.
  However you can get 6Gbps SATA multiplexers these days, and Backblazes latest pods have 60 drives, so that is 72Gbps, which is nearly enough to saturate a couple of 40Gbps Ethernet links.
  People always and I mean *ALWAYS* overestimate what is required for throughput to file servers.
26. Re:High failure rate by Anonymous Coward · 2016-08-03 00:57 · Score: 0
  
  I guess you're never going to be impressed. The aggregate failure rate per year is 2%.
  Hard drives fail. That's why we have RAID, and backup data.
27. Re:High failure rate by drinkypoo · 2016-08-03 01:40 · Score: 1
  
  Don't you mean "damping"?
  That's one of my pet peeves too, but we've lost that war and now we'll never know for sure if people mean getting things soggy or cancelling oscillations.
  
  --
  "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
28. Re:High failure rate by Anonymous Coward · 2016-08-03 02:36 · Score: 0
  
  I also wonder if we'll ever get numbers from Backblaze on things like the actual temperature, decibels and power these drives lived through.
  Since the drives are installed into the same configuration these details are irrelevant. If the Seagates lived through higher temps, for example, it would only mean that the Seagates run hotter, not that they were put into a special situation.
29. Re:High failure rate by sglewis100 · 2016-08-03 03:38 · Score: 1
  
  I had one of those. 1 year warranty. And it died in 14 months.
  Thanks for this. I was just about to read the article, till I saw your comment that you had 1 and it failed in 14 months. Saved me all the trouble.
30. Re:High failure rate by brianwski · 2016-08-03 05:27 · Score: 1
  
  Brian from Backblaze here.
  
  > Perhaps they don't keep the temperature as cool as they should in order to save a few bucks?
  
  The colocation datacenter is SunGard in Rancho Cordova California and there are other tenants. I assume the temperature of the datacenter is industry standard? But even better, in the raw data dump it includes all the temperatures of all the hard drives, so you (or anybody) could check the correlation. We looked into it in 2014 and didn't find much correlation between temperature and hard drive failure as long as we kept the temperature of any one hard drive well below a tipping point (which we do). Here is the blog article and stats behind our analysis: https://www.backblaze.com/blog...
31. Re:High failure rate by macs4all · 2016-08-03 05:44 · Score: 1
  
  They are using consumer drives for data center needs, this is the big reason their failure rate is relatively high. Still, with the redundancy, it is cheaper to run this way. Rumor is that Google ran that way with off the shelf computers. Use dirt cheap commodity products that are good quality, have exceptional redundancy, throw them away as they implode.
  Yes, but it wasn't a rumor. They actually published a detailed analysis of their drive reliability a couple of years ago. And yes, they specifically said that they purchase commodity drives and simply replace as needed.
  
  FYI, they said there wasn't any specific brand they could point a finger at, good or bad, which I found utterly amazing.
  
  The Report also said there was little correlation between how hard the drives were worked, or even the temperature they ran at, and the individual drive failure rates.
32. Re:High failure rate by macs4all · 2016-08-03 05:47 · Score: 1
  
  If you wrote off every manufacturer that hit a 20% annualized failure rate you would now be unable to buy any drives.
  Except for HGST.
  
  So far, it looks like even WD has not been able to ruin their stellar Reliability.
  
  Yet...
33. Re:High failure rate by macs4all · 2016-08-03 05:50 · Score: 1
  
  For individuals (not datacentres) HGST is the best bet.
  After reading the reports of several Datacenters that have published same, and recommending HGST drives for anyone who asks, I am FIRMLY in the HGST camp!!!
34. Re:High failure rate by brianwski · 2016-08-03 05:59 · Score: 2
  
  Brian from Backblaze here.
  
  > I also wonder if we'll ever get numbers from Backblaze on things like the actual temperature ... power these drives lived through.
  
  The raw data dump includes drive temperatures as reported by "smartctl". You can find a dump here: https://www.backblaze.com/b2/h...
  
  We analyzed the failures correlated with temperature in this blog post in 2014: https://www.backblaze.com/blog...
  
  In a conversation with some of the Facebook Open Storage people, they said hard drives have increased failure rates at extremely high temperatures but our drives never get anywhere NEAR the temperatures required to cause failures. We monitor every drive for temperature, taking readings once every 2 minutes, and we have had situations where the drive temperatures caused our internal warning alerts to go off (well below those catastrophic levels Facebook saw failures at). When we go to investigate, the most common cause of rising pod drive temperature is that some of our fans in that pod have died. We used to have 6 gigantic fans to keep it cool, but we reduced it to 3 with no increase in drive temperature. If one of the fans dies it doesn't get warm enough to set off any alerts, but if 2 out of 3 fans die it can't move enough air to keep the pod within reasonable operating temperatures. We don't monitor the fans directly, but drive temperature has been such a good proxy for it we don't feel any pressing need to figure out how to monitor the fans.
35. Re:High failure rate by brianwski · 2016-08-03 06:19 · Score: 2
  
  Brian from Backblaze here.
  
  > I think their pods only have GigE interfaces
  
  Originally (up until 3 years ago) that was true, but all new pods have 10 GbE interfaces, and 100% of the pods in our "Backblaze 20 pod Vaults" have 10 GbE interfaces. And there are some really strange (and wonderful) performance twists on using 20 pods to store each file: when you fetch a 1 MByte file from a vault, we need 17 pods to respond each supplying only 60k bytes to reassemble the complete file from the Reed Solomon. So the actual bandwidth when fetching just one medium size file can reach more like 170 Gbit/sec theoretical bandwidth. However, if you tried to fetch ALL the files from a pod all at once, the raw 7200 RPM drive performance is our current limiting factor.
  
  Here is a link to a blog post on the 20 pod Backblaze Vault architecture: https://www.backblaze.com/blog...
  
  Here is a link to the Reed Solomon encoding we open sourced that we use on the 20 pod Vaults: https://www.backblaze.com/blog...
36. Re:High failure rate by stoatwblr · 2016-08-03 09:54 · Score: 1
  
  "Use dirt cheap commodity products that are good quality, have exceptional redundancy, throw them away as they implode."
  Quite simply, there are two philosophies to consider:
  One is to try and build everything as robust as possible and make it as reliable as possible at component/system level. (expensive hardware)
  The other is simply to say "Everything fails sooner or later. Design accordingly" (complex software)
  The best cost/benefit approach lies somewhere within the 2 extremes.
37. Re:High failure rate by stoatwblr · 2016-08-03 10:06 · Score: 1
  
  "Is there a real mechanical difference between "consumer" and "enterprise" drives, these days, at the bleeding edge of the storage-per-unit curve?"
  Mechanically, no. But enterprise drives frequently have better ECC/processors and are programmed to be allowed to draw more power, which translates to more force being able to be used in the arm and better tracking (in many cases the CPU and driver chips are the the same, it's just the power constraints which are changed in firmware, and they might have larger onboard caps.)
  That said, Seagate enterprise drives are just as trashy as their consumer cousins and have equally high failure rates - We had a 350% failure rate on our fleet of Seagate Constellations in raid arrays over their warranty period as one example.
  5400 vs 7200rpm only makes any real difference for sequential reads/writes. As soon as you start seeking there's virtually nothing to choose between them in terms of IOPs (it's usually something like 115 vs 125 IOPs at best)
  This must be getting close to the Last Hurrah for spinning drives in the enterprise, considering that read-optimised enterprise 4TB SSDs are now around 10x the price of 4TB consumer drives and still falling fast, with performance that blows spinning drives away no matter how you measure it.
38. Re:High failure rate by stoatwblr · 2016-08-03 10:15 · Score: 1
  
  You need to temper the HGST fanboiism with the knowledge that the chinese competition authority have finally allowed WD to merge HGST into the mothership.
  Whilst that means some HGST tech will show up in WD equipment, history shows that it is far more likely to mean that WD kit will start showing up with a HGST sticker on it. I regarded the $250k of HGST 4TB drives I purchased late last year as the last chance we had to obtain unsullied units.
  Seagate was allowed to merge in the Toshiba unit they've been forced to operate as a separate unit too, with the same caveats being applied to any preference for Toshiba based on past reliability.
  My personal opinion is that the chinese held off allowing those mergers (creating a world HDD duopoly in the process) until they decided SSDs are finally cheap enough to challenge them.
39. Re:High failure rate by stoatwblr · 2016-08-03 10:17 · Score: 1
  
  "Sorry to hear about your loss. I hope you kept backup copies"
  I haven't had a drive failure result in data loss in more than a decade. The backup tapes only tend to be pulled because someone deleted something critical and then went "oops"
  It's still annoying and time consuming to replace the things. I have better things to do and so do my staff.
40. Re:High failure rate by macs4all · 2016-08-03 12:04 · Score: 1
  
  Whilst that means some HGST tech will show up in WD equipment, history shows that it is far more likely to mean that WD kit will start showing up with a HGST sticker on it. I regarded the $250k of HGST 4TB drives I purchased late last year as the last chance we had to obtain unsullied units.
  I have been worried about exactly that thing with HGST. We shall see if their stellar reliability starts taking a dive, which would be bad (or WD reliability starts going up, which would be good).
  
  And as far as "fanboiism", I would say that someone who spent $250k on a bunch of rust on a stick, just to try and "beat the downslide" has some serious fanboi tendencies himself... ;-)
41. Re:High failure rate by ebvwfbw · 2016-08-03 13:57 · Score: 1
  
  Before you buy any new hard drive, check to see if the drive you're buying was made in a new facility. I used to sell drives by the pallet. Didn't matter who, Seagate, Maxtor, WD, etc, they would move their plant to some other country. For the next year usually they produced crap. Then they were reliable again.
  Some of my best drives are seagate.
42. Re:High failure rate by radarskiy · 2016-08-03 18:11 · Score: 1
  
  HGST come from the merger of Hitachi and IBM storage divisions. The HGST Deskstar comes from the IBM side and the Deskstar 75GXP model was notorious as the "Deathstar" for failures.
43. Re:High failure rate by drsmithy · 2016-08-03 20:30 · Score: 1
  
  Hi, Brian,
  What kind of sustained network Tx/Rx do you see to your nodes. Or - if you keep track of it - what sort of numbers do you see for disk utilisation (in terms of performance) ?
  I'm curious how hard those drives are working, in the context of the temperatures you see.
44. Re:High failure rate by adolf · 2016-08-03 21:44 · Score: 1
  
  So, to synopsize: Yep, things are the same -- mechanically. Therefore, they'll probably wear out at the same rate, whether "Enterprise" or "Consumer."
  I'll accept that enterprise drives are faster. I'll also accept that they're more demanding on their requisite (internal and external) power supplies, to keep the heat actuator moving as fast as is possible.
  And I'd like to suggest that the main difference there is indeed firmware: Seagate, at one point (around a decade ago, it seems), stopped allowing end-users to modify the acoustics of their drives. This used to be a common thing amongst the tweaker crowd: The silent PC folks would dial down the acceleration of the head stack, and the performance folks would go for full-loud.
  These used to be user-adjustable parameters inside Seagate hard drives.
  Obviously, with feedback, the more-silent settings draw less power and offer less performance. And the louder settings used more power, and offered greater performance. (Negative feedback is obviously at play here on all sides, as it is in any assembly involving an amplifier, a voice coil actuator, and a head which must be quickly and precisely positioned.)
  I also remember when there was a time when that "Enterprise" drives and "Consumer" drives were manufactured very differently. I think I still have some 4.5GB and 9GB IBM 9ES 7200RPM SCSI drives in a drawer, which I once paid dearly for, and toward which I'd bet a vacation on them still working fine.
  They're almost 20 years old, but things aren't expensive like those were at the time and perhaps it shows.
  I'm lead to wonder, then: Was the time that Seagate got rid of (or started ignoring) the acoustic adjustment parameters coincident with the same time when their drives became mechanically-identical?
  And therefore, can this not all be extrapolated to mean that there was a time when high-dollar spinning rust was actually generally more reliable than lower-dollar spinning rust? And can we assume that this time -- if it existed -- is past?
  
  --
  Kid-proof tablet..
45. Re:High failure rate by macs4all · 2016-08-03 21:45 · Score: 1
  
  HGST come from the merger of Hitachi and IBM storage divisions. The HGST Deskstar comes from the IBM side and the Deskstar 75GXP model was notorious as the "Deathstar" for failures.
  And then they obviously got their shit together...
46. Re:High failure rate by ilsaloving · 2016-08-04 02:20 · Score: 1
  
  Ditto here. I have had a number of seagate drives over the years. Way back when, Seagate was *it*. You simply didn't bother buying anything else. I don't know what happened, but their quality has gone through the floor so hard that it made a crater.
  All the seagate drives I've bought in the last decade are all long dead. External, internal, 3.5", 2.5", doesn't matter. They all died.
  The final straw was when I bought their hybrid drive. Died after 6 months. Had it RMA'ed. The new one died a month later. RMA'ed. The new one had bad sectors right out of the box. RMA'ed one more time. It died after maybe 6 weeks or so. I gave up at that point.
  Now I only buy Western Digital, and so far I've yet to have a failure among the dozen or so drives I've purchased.
47. Re:High failure rate by radarskiy · 2016-08-04 08:51 · Score: 1
  
  That being my entire point. You can't write off an entire storage division forever since they've all had problems at one time or another.
48. Re:High failure rate by toddestan · 2016-08-05 11:01 · Score: 1
  
  I have to echo that. I've found hard drives the past few years to be incredibly reliable compared to what they used to be. Even more so than SSD's, which is pretty remarkable as SSD's, in theory, should be very reliable. I've even given up on running RAID on my home systems as all the problems I experienced were due to the crappy built-in motherboard RAID solutions and never the drives themselves (yes, I could buy a proper RAID solution but in the end decided I didn't need it). I still back up to external drives as well as have my computers automatically back up to each other over the network, which I figure is good enough should I have a drive fail.
  Of course, I could be lucky. I did buy four of the infamous Seagate Barracuda 1.5TB drives from several years back. While I don't use them for anything important anymore (they are seven years old), I still have all of them, they all work, and have never given me any trouble.
49. Re:High failure rate by stoatwblr · 2016-08-08 05:58 · Score: 1
  
  "I've found hard drives the past few years to be incredibly reliable compared to what they used to be"
  Right up to the 2011 Thai floods. At that point reliabiilty plummeted and prices are still slightly higher than before that event.
  My point was actually that adequate RAID and backup policies mean losing a drive is no big deal.
  I would never _ever_ trust any device enough to keep the only copy of any data on it.
50. Re:High failure rate by stoatwblr · 2016-08-08 06:04 · Score: 1
  
  "Can this not all be extrapolated to mean that there was a time when high-dollar spinning rust was actually generally more reliable than lower-dollar spinning rust? "
  In brief: No.
  Enterprise drives used to be higher priced because they were individually tested, etc and had longer/better warranties with faster RMAs. In practice they failed more quickly than desktop drives, but that doesn't take into account any differences in loading/headseeking over that period.
Desktop drives by Doub · 2016-08-02 10:59 · Score: 1

It seems the ST8000DM002 is a desktop drive. I've had 3 of the Archive variants (ST8000AS002), and they all failed within a week of use. The third time I got a refund.
1. Re:Desktop drives by BenJeremy · 2016-08-02 11:26 · Score: 1
  
  Yeah, it will be a LONG time before I trust Seagate anything, after burning through every Seagate drive I've bought in the past 6 years. Every. Single. One. Not even heavy duty usage, some were just archival drives.
  Utter trash. Ever since they bought Maxtor... they took a terrible turn for the worse.
  I can't help but wonder if they saw an order for Backblaze... and said... gee guys, make sure QA sends only the most reliable bins to those guys.
2. Re:Desktop drives by Anonymous Coward · 2016-08-02 13:20 · Score: 0
  
  WTH!!, they bought Maxtor?!, THAT EXPLAINS EVERYTHING!!. People, don't ever touch a Seagate hdd ever again, NEVER, not even with a stick. Using a Maxtor drive was like playing russian roulette with a fully loaded semi-automatic gun in burst mode. Maxtor had 90% failure rate at best.
  This is their only legitimate use: https://www.youtube.com/watch?v=g3plw-oye90
Re:comment by cheater512 · 2016-08-02 11:13 · Score: 4, Insightful

If you've got 3,000 drives at home to come up with directly home applicable numbers, then please share them.
This is mostly useful to compare models vs models as the environment is kept the same.
It's completely legitimate to say model X is more reliable than model Y, it's not valid to say model X has a Z% failure rate in a home environment however.
22,858 and 44,000 days?!? by Linsaran · 2016-08-02 11:32 · Score: 4, Funny

I presume there's some detail I'm missing here since we did not have 8 TB hard drives 120 years ago.

--
In a bit of shameless internet panhandling, I accept Litecoin Donations at Lbd2oH9QsthD1GfuUXPyka12YxvWJYnBVf
1. Re:22,858 and 44,000 days?!? by sexconker · 2016-08-02 11:47 · Score: 2
  
  Drive days.
  1000 drives for 44 days each would get you 44000 drive days.
2. Re: 22,858 and 44,000 days?!? by Anonymous Coward · 2016-08-02 11:48 · Score: 0
  
  That figure is drives * operating time.
3. Re:22,858 and 44,000 days?!? by Anonymous Coward · 2016-08-02 11:51 · Score: 0
  
  10 drives at 20 days each = 200 operating days for that drive model. Yeah, it seems kind of fishy, but if you want actual operating hour analysis before you buy things, you'll never buy another drive with a warranty period remaining.
Re:comment by amicusNYCL · 2016-08-02 12:00 · Score: 1

I've got a cool story, bro. We put a NAS device online in our new data center with 4 Seagate Archive 8000 drives in it, and 2 of them died within 24 hours, trashing the RAID array. Thankfully, since it was a new NAS, it wasn't a big deal.

--
"Our two-party system is like a bowl of shit looking at itself in a mirror." - Lewis Black
"Drive days"?? by Anonymous Coward · 2016-08-02 12:04 · Score: 0

Who measures uptime in drive days? That is like saying man hours or multiplying uptime by the number of disks in an array. Where does it end?
Maybe I'm old but damnit I want min / max / avg - in DAYS, per drive. Can we please stop changing units on everything?
Unless we're talking years of uptime, these numbers don't come close to my desktop hard drives let alone something MADE FOR A DATA CENTER.
He He -- wait till the Helium leaks out by Anonymous Coward · 2016-08-02 12:06 · Score: 0

Anyone who uses a Helium filled Hard drive won't want to be reminded what happens when the Helium leaks out ...
Re:comment by Anonymous Coward · 2016-08-02 12:17 · Score: 0

If you've got 3,000 drives at home to come up with directly home applicable numbers, then please share them.
This is mostly useful to compare models vs models as the environment is kept the same.
It's completely legitimate to say model X is more reliable than model Y, it's not valid to say model X has a Z% failure rate in a home environment however.
I would agree with you. The environment is the same for all devices. The sample sizes vary, but I think the useful numbers come from the very large sample sizes.
It's interesting to note that Backblaze tracks failures across a specific model, but they do not provide numbers on failures within a specific model. For example, did a large number of devices fail and used the same firmware? or fell within the same manufacturing date range? or came from the same country or specific serial number range? Another way of saying that is: Can Backblaze do a better job of isolating their failures down to a more specifc identifier??? Some of the needed info is actually printed on the drive labels and not accessible via SMART, like manufacturing date & country of origin.
I noticed that Backblaze uses some WDC RED drives. I cringed when I read that. I believe that WDC designs RED drives from 1 to 4TB for systems with 5 drives or less. I am guessing here on that design limit. It must be "common sense" to expect failures from those drives when used in high density platforms like the pods at Backblaze. I mean using a device in an environment for which it was not designed by the manufacturer is like expecting a Ferrari 308 to be a great "off road" race car for the Baja 1000!!!
As an owner of 30+ RED drives in that size range, and all installed in homemade NAS systems of 5 drives or less, (knocks on wood...many times) my own experience has been extremely positive: no failures across 10,000 hours per drive. I have at least 300,000 hours of drive "uptime" without a failure for WDC RED 3 & 4 TB drives, but I do have spares stored..."just in case".
Here's what I find is important for getting the longest life out of a hard drive: (1) extremely secure and stable hard drive mountings; (2) all of my systems are on "prosumer" or "business" branded UPS systems (Eaton, in my opinion, is one of the best UPS out there); (3) OS--NAS integration (Linux is a great OS) such that NAS goes into automatic "safe" shutdown when the UPS runs on batteries for more than 5 minutes; (4) use drives designed for NAS or RAID usage (likely to have appropriate TLER tweaks in firmware); (5) check all drive firmware and burn-in all drives before placing them into "service" (avoids many premature failures and problems due to shipping).
I wonder if some of Backblaze's issues with WDC 4 TB and below drives was a lack of firmware upgrades. I distinctly remember there being a firmware upgrade that was not needed by my 4TB REDs but a number of my older 3TB REDs needed that upgrade. I forgot what WDC changed in that firmware upgrade, but I remember thinking it was necessary to do the upgrade work based on reading the "sparse" release notes from WDC that I could find.
Riiiiiight. by TheRealQuestor · 2016-08-02 12:36 · Score: 1

Come back in 3 or 5 years and tell me out of all the 8TB sold in 2016/2017 just how many are still functional and THEN what the failure rate is/was.

My "prediction" is it will most likely be that there is an 70% failure rate with Seagate being the top offender.
1. Re:Riiiiiight. by Anonymous Coward · 2016-08-02 13:23 · Score: 3, Insightful
  
  Come back in 3 or 5 years and tell me out of all the 8TB sold in 2016/2017 just how many are still functional and THEN what the failure rate is/was.
  My "prediction" is it will most likely be that there is an 70% failure rate with Seagate being the top offender.
  By then the data is worthless to anybody except the manufacturer. We necessarily have to accept a deficit of statistical quality to make forward predictions that are actually worth something, like knowing if I'm building a SAN, what drives I should buy.
  In 5 years, I'm not going to be buying 8TB drives, so knowing what the failure rate for some 8TB drive was is inconsequential. Either HDDs continue to improve and I buy 32TB or larger HDDs, or they don't, and I'll be filling my SAN with 8TB or larger SSDs, Xpoint memory, memristor, who knows.
  I'm looking at this data and it's informing me that I ought to be buying HGST drives, and that I made a mistake installing 3TB Seagate drives (though the drives tested are not the capacity or exact model ones I have), and that as they begin to fail, I would be better to replace them with 4TB HGST.
  I don't really care what happens to my drives 5 years out, they'll probably be replaced with higher capacity stock whether they start to fail or not. If my capacity needs to grow, I can buy new JBOD cards, a bigger mainboard to accomodate the extra channels, more JBOD trays, more racks, upgrade the AC, and pay a higher power bill -or- I add an extra slice to a mirror with a 2x density drive, resilver, replace one of the old drives with a 2x density drive, resilver, and continue until all the drives in the mirror have been 2x'd, rinse and repeat gradually throughout the array until sufficient capacity is reached. The cost works out lower. Sure the bigger drives are more expensive than the cheap drives and I only get the incremental value, but the balance of costs is such, that it's still cheaper than endlessly growing JBODs.
  The cost is lower still, considering that the drives are bought according to schedule, and when failures occur, the replacements are the larger capacity according to our schedule, and drives removed from mirrors due to capacity upgrades are put in the hotspare pool, ready to repair older, unupgraded mirrors.
  All 2TB drives are outgoing at the end of this month, 6TB drives are incoming for replacements and capacity growth.
2. Re:Riiiiiight. by dbIII · 2016-08-02 13:51 · Score: 1
  
  I've got a large cardboard box full of dead Hitachi drives for some reason, along with a smaller number of dead Seagate drives. I've stopped buying both. Maybe I just have less dead Seagates due to being turned off that brand earlier.
Re:comment by __aaclcg7560 · 2016-08-02 12:57 · Score: 1

I believe that WDC designs RED drives from 1 to 4TB for systems with 5 drives or less.
WD RED drives are available in 6TB and 8TB. Regular drives are 5400RPMs with 64MB cache. Pro drives are 7200RPMS with 128GB cache.
Re:comment by Anonymous Coward · 2016-08-02 13:24 · Score: 1

The 8TB Archive drives with shingled recording are not suitable for RAID arrays.
If it's working for them by dbIII · 2016-08-02 13:47 · Score: 4, Insightful

If it's working for them in their packed in boxes with crap airflow and really poor heat transfer then it will work even better in conventional file servers with hot swap drives at the front and a heap of airflow.

Take it with a grain of salt when Backblaze say a drive is crap since it may only be crap in their very hostile environment, but if they didn't break it then it's very likely to work well anywhere.
1. Re:If it's working for them by umafuckit · 2016-08-02 15:21 · Score: 1
  
  Take it with a grain of salt when Backblaze say a drive is crap since it may only be crap in their very hostile environment, but if they didn't break it then it's very likely to work well anywhere.
  What's the typical drive temperature in Backblaze's cases in their environment?
  
  --
  soylentnews.org
2. Re:If it's working for them by dbIII · 2016-08-02 16:06 · Score: 1
  
  What's the typical drive temperature in Backblaze's cases in their environment?
  They are not saying apart from an entirely useless average for machines that are idle a lot of the time with drives spun down. I'm not entirely sure they know or care what their maximums are and how long drives are hot for.
  I suggest taking a look at their web pages that describe their pod designs to get a better idea of the situation instead of just taking my word that they shove drives in wherever they will fit without taking heat into account. Given their workflow that's not as stupid as it sounds but there are going to be some times when some drives get very hot due to a lack of airflow.
3. Re:If it's working for them by bobloblaw54 · 2016-08-02 17:47 · Score: 1
  
  They implement a new storage pod and it is filled with user data for a week until it reaches capacity then the disks sit idle until a user needs a backup. So basically the hard drives environment my be "hostile" for a week but then they basically sit until they die. They do not quantify what is considered a failure if its a read write error or bad sectors or a smart failure its a different story each way.
4. Re:If it's working for them by Anonymous Coward · 2016-08-02 20:01 · Score: 0
  
  Actually, having the disks at the back is more reliable. As the temperature doesn't really impact the life time too much. It's humidity that kills drives especially good.
  http://www.datacenterdynamics....
5. Re:If it's working for them by dbIII · 2016-08-02 20:40 · Score: 1
  
  So basically the hard drives environment my be "hostile" for a week
  
  It's those temperature excursions beyond the design limits that matter and not if the average is 22C instead of 20C over months. So being hostile for a week is somewhat obviously (a lot more than IMHO) likely to be why they have extremely high failure rates across the full range of brands and models (with some much higher than others but all higher than others seem to experience).
6. Re:If it's working for them by dbIII · 2016-08-02 20:45 · Score: 1
  
  They have the disks at the back, the front, the top, the bottom, the middle. The ones right in the middle of the mass would get quickly cooked under the sort of loads seen elsewhere but they distribute the writes over several servers at once.
  
  the temperature doesn't really impact the life time too much
  Temperatures beyond design limits do. I've seen it several times especially back when WD drives ran very hot and some people had inadequate server cases and/or no alarms when fans died.
7. Re:If it's working for them by macs4all · 2016-08-03 05:59 · Score: 1
  
  So basically the hard drives environment my be "hostile" for a week
  It's those temperature excursions beyond the design limits that matter and not if the average is 22C instead of 20C over months. So being hostile for a week is somewhat obviously (a lot more than IMHO) likely to be why they have extremely high failure rates across the full range of brands and models (with some much higher than others but all higher than others seem to experience).
  Within reason, most reliability studies on electronics overall curiously do not equate temperature with average failure rates. It's kind of one of those "Of course higher temps make things fail faster" common-sense things that eventually just became "accepted as true". But statistically, that doesn't really happen.
  
  Now, having said that, spinning-rust HDDs are electro-MECHANICAL devices; so perhaps temperature has more of an effect of reliability than with a purely electronic devices...
8. Re:If it's working for them by brianwski · 2016-08-03 08:12 · Score: 1
  
  Brian from Backblaze here.
  
  > What's the typical drive temperature in Backblaze's cases in their environment?
  
  Short answer: the coolest drives are 21.92 Celcius and the hottest drive was 30.54 degrees.
  
  I wrote this up above in response to a temperature question, copy and pasted here. The raw data dump from Backblaze includes drive temperatures as reported by "smartctl". You can find a complete set of historical data of all drive temperatures in the Backblaze datacenter here: https://www.backblaze.com/b2/h...
  
  We analyzed the failures correlated with temperature in this blog post in 2014: https://www.backblaze.com/blog...
  
  In a conversation with some of the Facebook Open Storage people, they said hard drives have increased failure rates at extremely high temperatures (somewhere up near 40 degrees Celcius) but our drives never get anywhere NEAR the temperatures required to correlate with failures. We monitor every drive for temperature, taking readings once every 2 minutes, and we have had situations where the drive temperatures caused our internal warning alerts to go off (well below those catastrophic levels Facebook saw failures at). When we go to investigate, the most common cause of rising pod drive temperature is that some of our fans in that pod have died. We used to have 6 gigantic fans to keep it cool, but we reduced it to 3 with no increase in drive temperature. If one of the fans dies it doesn't get warm enough to set off any alerts, but if 2 out of 3 fans die it can't move enough air to keep the pod within reasonable operating temperatures. We don't monitor the fans directly, but drive temperature has been such a good proxy for it we don't feel any pressing need to figure out how to monitor the fans.
9. Re:If it's working for them by brianwski · 2016-08-03 08:17 · Score: 1
  
  Brian from Backblaze here.
  
  > most reliability studies on electronics overall curiously do not equate temperature with average failure rates.
  
  Backblaze looked into it in 2014 and we found no correlation: https://www.backblaze.com/blog...
  
  In a conversation with some of the Facebook Open Storage people, they had seen increased failure rates at extremely high temperatures (somewhere up near 40 degrees Celsius) but our drives never get anywhere NEAR the temperatures required to correlate with failures. We monitor every drive for temperature, taking readings once every 2 minutes, and in all but a few unusual conditions (such as some fans have failed) most drives are really running cool at around 25 degrees Celsius.
10. Re:If it's working for them by stoatwblr · 2016-08-03 10:28 · Score: 2
  
  "Short answer: the coolest drives are 21.92 Celcius and the hottest drive was 30.54 degrees."
  Based on the Google stats from a few years ago it was pretty clear that drive temperature was only a problem above 55C
  I target 45C as allowable maximum and 35C as normal with no apparent increase in mortality over colder temperatures, but that saves a lot in terms of running the cooling plant. The batches of Seagates Constellations we had with stupidly high failure rates ran well under 30C
  For home use my fileserver's drives have peaked out at 50C in hot weather (AC is rare in a UK house) and those drives currently have 42-48,000 hours on them (5-6 years) so it doesn't seem to have affected their reliability. OTOH Seagate ST2000DM drives in the same fileserver lasted less than 9 months (The DL series tended to run for 3 years before failing)
  Based on years of experience the best advice for RAID work I can offer is "Don't use a raid composed of the same make/model drives and if you must do that, then FFS try to ensure the drives come from different batches. Otherwise one drive failure is the only warning you have of impending array doom" (RAIDZ3 is good, but multiple drive failures within the same batch is still a high risk thing)
11. Re:If it's working for them by dbIII · 2016-08-03 13:36 · Score: 1
  
  Within reason, most reliability studies on electronics overall curiously do not equate temperature with average failure rates
  Seriously? Where did you hear that or did you make it up?
  
  "Of course higher temps make things fail faster" common-sense things
  It comes from temperature versus resistance charts, thermal expansion, lubricant breakdown at temperature (a very bit problem resulting in seized disks in the 1990s) and statistical analysis and not "common sense".
12. Re:If it's working for them by dbIII · 2016-08-03 13:51 · Score: 1
  
  With respect Brian, I've read that some time ago (and again to make sure) and the temperatures quoted are averages for disks that have a lot of idle time so are somewhat useless information when failure is going to be due to temperature excursions. Maximums would tell us something but are not in that blog.
  I've seen a lot of failures in the past in poorly designed cases with drives running hot back when the WD drives were especially hot. It's not so common now but apart from the lubricants used being better able to handle heat there are still plenty of things that go wrong when a drive is operated beyond it's design maximum. With different materials expanding at different rates it should be no shock that something with very tight tolerances fails at high temperatures.
  
  There was a lot of work done on hard disk failures due to heat in the 1990s published in some tribology journals but I'm not sure what is available online, I've been out of materials science for more than a decade and do stuff with computers now.
13. Re:If it's working for them by macs4all · 2016-08-03 16:48 · Score: 1
  
  Within reason, most reliability studies on electronics overall curiously do not equate temperature with average failure rates
  Seriously? Where did you hear that or did you make it up?
  
  "Of course higher temps make things fail faster" common-sense things
  It comes from temperature versus resistance charts, thermal expansion, lubricant breakdown at temperature (a very bit problem resulting in seized disks in the 1990s) and statistical analysis and not "common sense".
  My boss at the time, the Director of Engineering at the place where I was employed as an embedded designer/developer, is who told me about some stuff he had read that dispelled the common-sense "Logic" equating elevated temperature (but still within specs) with accelerated failure in electronics. Notice I said "electronics", not "mechanicals". So, before HDDs all got equipped with essentially frictionless and wear-free fluid bearings, heat probably was an issue with lubricant migration. But now, not so much.
14. Re:If it's working for them by dbIII · 2016-08-03 18:16 · Score: 1
  
  Take a look at charts for semiconductor resistance versus temperature from a first year engineering materials science text to get a greater insight than the one you described above.
  
  but still within specs
  I'm pretty sure the drives in the middle of the pods are going to go well above design temperatures at times.
  Also your mechanical utopia has with hard disks has not arrived, far from it. Different rates of expansion with different materials are one way to ruin your day when drives get hot.
15. Re:If it's working for them by macs4all · 2016-08-03 21:52 · Score: 1
  
  Take a look at charts for semiconductor resistance versus temperature from a first year engineering materials science text to get a greater insight than the one you described above.
  What? Are you concerned with Thermal Runaway? In digital circuitry, or even analog circuitry employing silicon semiconductors with temperature-compensating biasing circuitry?
  
  Well, alrighty then!
Re: comment by Stewie241 · 2016-08-02 15:03 · Score: 1

Is that a typo or is there really 2000 times the cache?
Re:comment by lgw · 2016-08-02 15:29 · Score: 1

You wot mate?
Seagate Archive drives are designed for cold storage, as they say 6 times on their web page for the drive. If you don't know what "cold storage" means, it means "not RAID".
So, you build a RAID array out of drives designed for "not RAID", and they started failing on you. And this is somehow Seagate's fault? The mind boggles.

--
Socialism: a lie told by totalitarians and believed by fools.
Re: comment by __aaclcg7560 · 2016-08-02 15:34 · Score: 1

Yes, a typo. But those pro drives are really expensive.
Re:comment by jedidiah · 2016-08-02 16:51 · Score: 1

Drives of that size are no longer limited to gimped "archival" roles.
On the one hand, a drive is probably likely to be more reliable when you pamper it and don't really do much with it. On the other hand, I've had plenty of Seagates fail in just that kind of use case.
Gimped archive disks? Who cares if they are reliable or not?

--
A Pirate and a Puritan look the same on a balance sheet.
Did they really have 8TB drives 62 years ago? by Anonymous Coward · 2016-08-02 17:25 · Score: 0

> The HGST HDS5C8080ALE600 worked for 22,858 days and only saw two failures
The Maths on this summary are worded really strangely. I am guessing they mean N drives for X days = 22,858 days, but it sounds like their 8TB drives were powered on 62 years ago and ran great except for two that died from the Germans bombing their data center in WW II. Oh wait, that was 72 years ago, so it must have been the 4 TB drives that died. My mistake. Really though, 22,858 days needs to be phrased like "man-hours", "man-years", etc. How about 22,858 unit-days?
1. Re:Did they really have 8TB drives 62 years ago? by ihtoit · 2016-08-02 18:02 · Score: 2
  
  yes it would be unit-days, as in nX=22858 so each drive in the array (n) had an uptime of X=22858/n. We know what n is. It's 45. Therefore, X=22858/45=~508 days. The stated MTBF of the HGST Enterprise-class drives is 2.5 million hours. That would put the expected array failure rate at 2,314 days (2.5mill. divided by array size).
  So don't be impressed, this is actually a failure report.
  
  --
  Political debates have me rolling my eyes so much I think I got optical whiplash. I should sue. - Foamy The Squirrel
80GB drives also reliable by ihtoit · 2016-08-02 17:53 · Score: 1

I had a 1st Gen Seagate 80GB SATA fail last month after 11 years and change, of 24/7 daily operation and very few power-off cycles.

--
Political debates have me rolling my eyes so much I think I got optical whiplash. I should sue. - Foamy The Squirrel
1. Re:80GB drives also reliable by macs4all · 2016-08-03 06:02 · Score: 1
  
  I had a 1st Gen Seagate 80GB SATA fail last month after 11 years and change, of 24/7 daily operation and very few power-off cycles.
  Yeah, power cycling seems to be the worst killer of HDDs, followed by high humidity.
2. Re:80GB drives also reliable by ilsaloving · 2016-08-04 02:21 · Score: 1
  
  Ah yes, the good old days when Seagate actually put effort into quality. You'll *never* see that kind of lifetime on their current drives.
Seagate failures... by Anonymous Coward · 2016-08-02 18:31 · Score: 0

are directly attributable to the controller boards. I have a 3TB from one of their USB drives that failed on me (It would just randomly start writing garbage to the drive, low level format would fix it, but reading back after would return garbage.) Another is a 500GB still in service that will start throwing read errors when the temp sensor reads around 45-50C (common during the summer here if the cooling is impeded the drive will heat up and begin malfunctioning, requiring both a cool off period and a full power-cycle to come back up properly. Oftentimes it results in data corruption across much of the directory tree resulting in most of the file system ending up in the lost+found folder, and a reinstall being needed to fix the system.
That particular system has a mix of third party drives in it, none of which see failures except for the seagates. (A diff seagate drive with ext2 filesystems does not see similiar corruption, although it is also accessed less frequently.)
Contrasting anecdote by billcopc · 2016-08-02 18:57 · Score: 4, Interesting

I'm an independent white-box NAS guy, and with the exception of the truly awful 1.5TB Seagate drives from 2008-2009 or so, I have not had any significant problems with them. I've got a few thousand 3 to 8 TB drives deployed with my clients, most of them cheap consumer drives (not even the "NAS" editions), and the annual failure rate is roughly 2% across all brands. This has been consistent for many years and I factor these stats into my costs and warranty projections. I have
The thing that bothers me about Backblaze, and the reason why I have a very hard time taking their results seriously, is the way they design their pods. They take a custom fabbed chassis, then fill it with the most ghetto components known to man: SATA port multipliers, ultra-low-end HBAs, dual "gamer" power supplies, very substandard cooling, and until recently they used super sketchy desktop boards. It's only last year that they finally changed the board for a Supermicro, primarily to get 10GbE very cheaply. For that same money, you can buy a ready-made 60-bay Supermicro chassis with redundant power and SAS - and a warranty. Hell, I bet SM would deliver directly to Backblaze's doorstep *and* give them a friendly discount.
Anyway... epic digression aside, when people ask me which brand is better, I tell them to buy whichever has the best warranty. A hard drive *will* die, the question is when, so the only logical course of action is to plan around its inevitable demise by keeping backups and redundancies, and learning the ins and outs of the RMA process.

--
-Billco, Fnarg.com
1. Re:Contrasting anecdote by Anonymous Coward · 2016-08-02 21:43 · Score: 0
  
  backblaze also RMAs the drives, even desktop drives come with warranty. They just RMA a lot of them. Still cheaper than a server drive.
  Also their workload is much different from a normal server farm, most of their drives are idle (spinned down) most of the time, which is actually more like a desktop workload. Since only one drive in a pod is active at once, it can use the metal of the other drives as cooling.
2. Re:Contrasting anecdote by TheDarkMaster · 2016-08-02 23:49 · Score: 1
  
  Well, think of it this way ... If the hard drive can survive in the environment provided by Backblaze, then they will certainly do better in a home computer properly built and will last longer than the hard drives that fail prematurely in the Backblaze environment. There is nothing better to test if a hardware is weak than to put it in a hostile environment and see what happens
  
  --
  Religion: The greatest weapon of mass destruction of all time
3. Re:Contrasting anecdote by goarilla · 2016-08-03 00:59 · Score: 1
  
  The thing that bothers me about Backblaze, and the reason why I have a very hard time taking their results seriously, is the way they design their pods. They take a custom fabbed chassis, then fill it with the most ghetto components known to man: SATA port multipliers, ultra-low-end HBAs, dual "gamer" power supplies, very substandard cooling, and until recently they used super sketchy desktop boards. It's only last year that they finally changed the board for a Supermicro, primarily to get 10GbE very cheaply. For that same money, you can buy a ready-made 60-bay Supermicro chassis with redundant power and SAS - and a warranty. Hell, I bet SM would deliver directly to Backblaze's doorstep *and* give them a friendly discount.
  
  I agree completely, especially about all the marvell crap, but what's so wrong about a gamer PSU.
  What besides form factor makes a zippy PSU that much better ?
4. Re:Contrasting anecdote by brianwski · 2016-08-03 08:30 · Score: 1
  
  Brian from Backblaze here.
  
  > If the hard drive can survive in the environment provided by Backblaze, then they will certainly do better in a home computer properly built
  
  I suppose it matters. The pods are in a professional datacenter with air filters and sticky paper that we step on before entering the clean datacenter. When we open pods in the datacenter they are NEVER filled with dust bunnies. But when I open up my "properly built" gaming computer at home there are ALWAYS dust bunnies, air in homes with pets and carpets is simply going to have some dust.
  
  Backblaze also monitors everything and fixes every problem, a home computer usually monitors nothing. Recently I was editing video on my laptop and it just shut off (I lost 10 minutes of edits). Turns out it was overheating due to a bad fan, but nothing WARNED me about this so I was subjecting all the components in the laptop to dangerously high levels of heat before the CPU shut down to protect itself. That won't happen in the Backblaze datacenter where we monitor everything, including the temperature of every last one of the 68,813 drives and go fix it when they deviate from normal for any reason.
  
  One of the main things Backblaze does which may or may not occur in a home office is that we do leave the drives powered up 24 hours a day, 7 days a week. If you leave your computer shut down half the time, there may be situations where that extends it's life in your home. For example, if the drives bearings inherently are built for 2 years of continuous run time and you only have the computer turned on 1/2 the time then your drive will last 4 years at home and only 2 years in the Backblaze datacenter.
5. Re:Contrasting anecdote by stoatwblr · 2016-08-03 10:49 · Score: 1
  
  " Turns out it was overheating due to a bad fan, but nothing WARNED me about this"
  I have specific daemons on my linux laptop to warn of overheating CPUs. Temperature monitoring in Windows is available but virtually noone makes use of it.
6. Re:Contrasting anecdote by heson · 2016-08-06 20:08 · Score: 1
  
  Warranty is only used as an indicator on how the producer values their product. We have stopped doing RMA on failed disks, as the refurbished ones we get back are crap and usually fails within a year (often within a week), with the work that causes and considering the cost of labor, it is cheaper to buy new drives, and scrap the broken ones.
Every time they post new stats by Anonymous Coward · 2016-08-02 19:12 · Score: 0

I end up reading the EXACT same comments and arguments.
What I've learned... by adolf · 2016-08-02 20:01 · Score: 3, Insightful

What I've learned from reading the comments here is that people are just as clueless when it comes to storage reliability as they ever were, and are just as capable of throwing the baby out with the bathwater as at any other time.
Dear Slashdot: Never change.

--
Kid-proof tablet..
Exhibit A: OP by Anonymous Coward · 2016-08-02 22:12 · Score: 0

If ever you thought that nerds were more scientifically minded than others, just ask them about their hard-drive preferences and watch them wax anecdotal.
By far the most used drive in the above report is the 4GB Seagate ST4000DM000: ~34,000 drives with a 2.7% failure rate. Two Toshiba's and one WDC show failure rates of nearly 9%. HGST is the only manufacturer with consistently sub-5% failure rates.
Re:comment by goarilla · 2016-08-02 23:11 · Score: 1

So, you build a RAID array out of drives designed for "not RAID", and they started failing on you. And this is somehow Seagate's fault? The mind boggles.

Aren't we all building raid arrays with non-raid disks for SMB and home use nowadays anyway ?
If you're being pedantic and take RAID to mean "Redundant Array of Inexpensive Disks" then the Seagate Archive is in fact the most ideal candidate.
I wouldn't though but that doens't mean he shouldn't.
Re: comment by Chas · 2016-08-02 23:21 · Score: 1

No. The archive disks aren't designed for a long duty cycle. They are meant to have data dumped onto them and then work as a read only disk.
The constant usage of a RAID array will cause drive failure via thermally induced URE in short order

--

Chas - The one, the only.
THANK GOD!!!
consumer vs enterprise by Anonymous Coward · 2016-08-02 23:45 · Score: 0

The enterprise drives always seem heavier to me.
1. Re:consumer vs enterprise by Anonymous Coward · 2016-08-03 02:19 · Score: 0
  
  It's the photon torpedoes - and on the later models, the damned family quarters.
Re:comment by Richard_at_work · 2016-08-02 23:45 · Score: 1

Drives designed for RAID use typically have different firmware which react differently to issues - RAID friendly drives react quicker to failures, meaning they are less likely to fail the RAID over correctable errors. Put a drive not intended for RAID use in an array and you will see more failures over drive level correctable errors.
Archival disks are one of those drives you will see this issue with.
The biggest drives always are by bravecanadian · 2016-08-03 01:40 · Score: 2

the most unreliable.
That is why you buy in the sweet spot for best value and let someone else prove new technologies and HD densities for you..
1. Re:The biggest drives always are by thegarbz · 2016-08-03 02:57 · Score: 1
  
  Yeah I always shop for best value. So I now have 8TB drives in my system.
  Oh what you didn't realise that 8TB SMR drives were the cheapest per megabyte before posting?
2. Re:The biggest drives always are by sims+2 · 2016-08-03 07:08 · Score: 1
  
  Got any in a NVR? I've been switching out 6TB WD Purple drives with 8TB HGST enterprise drives.
  I've heard that SMR drives don't handle 24/7 constant writes well.
  
  --
  Minimum threshold fixed. Thanks!
3. Re: The biggest drives always are by thegarbz · 2016-08-04 03:25 · Score: 1
  
  You heard right. They act quite a bit like SSDs and have periods where write speed stops of dramatically. Great storage pretty buck for a backup or to keep your data on, just don't use it for your OS paging or any other heavy workload.
4. Re: The biggest drives always are by thegarbz · 2016-08-04 03:27 · Score: 1
  
  Dick me autocorrect fail. Again with proofreading:
  You heard right. They act quite a bit like SSDs and have periods where write speed drops off dramatically. Great storage per buck for a backup or to keep your data on, just don't use it for your OS, paging, or any other heavy workload.
Recover This! by Anonymous Coward · 2016-08-03 01:44 · Score: 0

My local recycler has two machines that shred drives like a paper shredder. It doesn't matter what type of drive. You pour drives in the top and flakes of metal fall out the bottom within 3 seconds. Not even SSD chips survive these machines. Encryption, overwriting, drilling, are all pointless wastes of time. Shred em and forget about it.
When I perform a new system roll out and old systems are being recycled, I pop out the drives and the recycler shreds them en mass right in front of me. 50 drives takes ~5 minutes.
The machines are ridiculously simple so when the recycler told me that they cost $15,000 to $20,000 each I didn't believe him. But I later verified it to be true. Even the cheap little portable "wood splitter" type device that presses and splits two or three drives at a time is over $4,000
Re:comment by lgw · 2016-08-03 02:15 · Score: 1

SMR drives are different - the S is for Shingled. It's an oddball recording technology that requires an entire track be written to change any block. They're really the worst choice for random write patterns.

--
Socialism: a lie told by totalitarians and believed by fools.
The American brands are lowest quality by Anonymous Coward · 2016-08-03 02:26 · Score: 0

and the Japanese brands are highest quality, it's just like their previous reports. HGST, while being owned by an American company, is still doing all their research and development in Japan, by Japanese engineers, and this is likely the key difference that makes the HGST drives come out so far above the American drives.
Re:comment by Anonymous Coward · 2016-08-03 02:27 · Score: 0

Uhm... No. SMR drives only have very huge sector size, as far as the filesystem is concerned. If anything, the performance would be incredibly bad, but no reason to fail. Not within 24 hrs.
Seagate haven't bought one in 20 years by mandy2tom · 2016-08-03 04:28 · Score: 1

Why would anybody buy a spinning hard drive today so slow and solid-state drives are so cheap . I have a terabyte SSD drive in the new MacBook has read and write speeds close to a gigabyte a sec I have an eight-year-old MacBook, that before I swapped the hard drives for ssd had a 22 MB a second read write speed. Samsung has dominated the market micron might have a chance but anybody making spinning hard drive is wasting time.it's time the world implements a lifecycle tax. Meaning if the product last hundred years there's no tax if it lasts one day its thousand percent we don't need anymore happy meal toys
nope. by Anonymous Coward · 2016-08-03 05:13 · Score: 0

These are the same guys that claim HGST hard drives are reliable. Yeah, no thanks!
Re:comment by Bengie · 2016-08-03 05:17 · Score: 1

OpenZFS has been working to become aware of shingled storage. The CoW nature of ZFS already plays well with shingled recording, but it will become much better once the FS is aware of the layouts. In theory it's not much work, in practice, it's a lot of refactoring.
Re:comment by Bengie · 2016-08-03 05:22 · Score: 1

To change any earlier block. Changing earlier data requires later data to be re-written because the write head is wider than the read head. As long as you append data, you're fine. There in lies the rub. How do you know if you're near the front or back of a shingled region? If it's always per track, then that information is available. Even then, most/all file systems don't care. OpenZFS will care in the future. CoW nature plays well with being able to almost always append to these regions, reducing the amount of re-writing.
Re:comment by amicusNYCL · 2016-08-03 05:42 · Score: 1

If you don't know what "cold storage" means, it means "not RAID".
Well, it would be fantastic if they would mention that in one place or another. Instead I get lines like this:

Enjoy peace of mind with a drive engineered for 24×7 workloads of 180TB per year
Store your data faster with a SATA 6Gb/s interface that optimizes burst performance
Have confidence with a drive that provides reliable, low-power data retrieval based on Shingled Magnetic Recording (SMR) technology
Yes, this sounds like exactly what I'm looking for when I store my backups. I don't need to write to them often, really only once a day. Reliable retrieval sure would be nice though.

learn how these affordable, high density drives can meet your needs for long-term, cold storage that's quickly and readily available online.
That's great, I would like my long-term backups quickly and readily available online when needed.

Seagate Archive HDD has won the "Product Award of 2015" in the 3.5" segment by Kakaku.com
Ah, sounds like they thought it was the best 3.5" drive.

...one of the best all around hard drives on the market.
That's good, I'd like one of the best all around hard drives on the market.

Seagate Archive HDD 8TB: A lot of TBs for a relatively small investment.
Yes, I need a lot of TBs.

The Seagate Archive HDD 8TB is a high capacity, energy efficient, and lower cost hard drive for active archive purposes.
Active archive sounds like backup storage. This must be for me.

The drives are intended for use in large-scale data centers where density, power consumption, data integrity and data retrieval are paramount.
That's good, because I'm going to put these in a large-scale data center where data integrity and data retrieval are paramount.

Best fit applications:
Online archiving
Large data object storage
Big data cold storage
Cloud active archive
Web-scale archiving
All of those buzzwords sure sound similar to "where you put your backups".

Delivering absolute lowest cost/TB along with the performance and reliability required for massive scale applications, the new 8TB HDD is ideal for meeting the needs of our enterprise and service provider customers who demand optimized hardware and the cost structure needed for massive scale out.
Yes, massive scale, like you would find in a redundant array of disks.
But, if I pull up the data sheet, then it includes this footnote which is missing from the same section on the web page:

Archive HDDs are not intended for surveillance or NAS applications, and you may experience lower performance in these environments.

By "may experience lower performance", I'm guessing they mean that if I put these in a RAID array and point my servers there as a backup location, then I can expect a 50% failure rate in 24 hours.

--
"Our two-party system is like a bowl of shit looking at itself in a mirror." - Lewis Black
Re:comment by stoatwblr · 2016-08-03 09:44 · Score: 1

"RAID friendly drives react quicker to failures, meaning they are less likely to fail the RAID over correctable error"
Your're referring to TLER - which used to be a tunable value until Seagate/WD started using it to differentiate enterprise/domestic drives (it dictates how hard a drive will try to recover sector errors before marking them bad and moving on)
On the other hand to your example, if you put a RAID-friendly drive in standalone use and there's a sector issue you're far more likely to lose data.
It would be interesting to know if the TLER is tuneable on these drives (it isn't on lower capacity STx000DM-x drives), but given a 200%+ failure rate in the warranty period on Seagates's DM001 drives (2 and 3TB) I would still be very wary.
Re: comment by Anonymous Coward · 2016-08-03 15:16 · Score: 0

24 hours? You're having a laugh.
Re: comment by Chas · 2016-08-03 16:45 · Score: 1

No. No I'm not. Those drives simply don't have the features to survive in an array environment.
So, like an ordinary desktop drive (which is also missing those features), they'll eventually desync and fall out of the array.
If they tried to put the drives under load (like migrating the contents of one NAS to another), it's ENTIRELY possible that the drives died due to thermal excess (which is what happens when you run them for long periods of time).
And if they're packed in a small NAS box (think Synology DS1515, Drobo, etc), all up tight to one another? They'll cook themselves in short order.
Again, SMR Archive drives ARE NOT meant to be run in RAID/NAS environments! PERIOD! Talk to the manufacturers. They'll tell you the same thing.

--

Chas - The one, the only.
THANK GOD!!!
Those drives are not SSDs by dbIII · 2016-08-04 14:42 · Score: 1

Read what I have written it is very simple. Semiconductors behave differently when hot and that sometimes leads to failure. There is a lot of heat input from the mechanical side of the drives. If it can't be transferred away you get hot electronics no matter what you do on the electronic side.
Does that make sense yet?
1. Re:Those drives are not SSDs by macs4all · 2016-08-04 15:23 · Score: 1
  
  Read what I have written it is very simple. Semiconductors behave differently when hot and that sometimes leads to failure. There is a lot of heat input from the mechanical side of the drives. If it can't be transferred away you get hot electronics no matter what you do on the electronic side. Does that make sense yet?
  I designed DC motor drives for a living (including some battery drives); so I grok thermal transfer, ok? But I also still stand by what my boss told me. Not only was he a good Electrical Engineer, but he was a Physicist, too; which, IMHO, gave him a deeper insight into a LOT of things than most engineering PHBs.
2. Re:Those drives are not SSDs by dbIII · 2016-08-04 17:16 · Score: 1
  
  Maybe you misunderstood what you were told or are taking it out of context. I suggest reading a first year engineering text on materials science among one of the many online so you can get it from an independent source instead of your weird knee-jerk reaction to me.
  You've got no idea how amusing it is for me to hear a non-engineer say to an engineer than you will not believe the engineer because another one told you something that doesn't quite fit the situation. Think for yourself FFS! Stop throwing that appeal to authority around when there are so many on this site that have the same amount of authority! If your boss graduated in the late 1990s he may have been one of my students.
  Put that understanding of heat transfer to use. It's a very trivial situation in this case and nothing like the sort of stuff I used to model.
Re:comment by toddestan · 2016-08-05 10:26 · Score: 1

Even if the drive was not intended to be used for RAID, or designed to spend most of its time sitting on a shelf, I would still expect the drive to last more than 24 hours. Heck, even writing 8TB of data to the drive once then reading it back would take a good fraction of that 24 hour "lifetime". Now, if the drive died after, say, 2 months then maybe your comment would apply.