New Seagate Shingled Hard Drive Teardown
New submitter Peter Desnoyers writes: Shingled Magnetic Recording (SMR) drives are starting to hit the market, promising larger drives without heroic (and expensive) measures such as helium fill, but at a cost — data can no longer be over-written in place, requiring SSD-like algorithms to handle random writes.
At the USENIX File and Storage Technologies conference in February, researchers from Northeastern University (disclaimer — I'm one of them) dissected shingled drive performance both figuratively and literally, using both micro-benchmarks and a window cut in the drive to uncover the secrets of Seagate's first line of publicly-available SMR drives.
TL;DR: It's a pretty good desktop drive — with write cache enabled (the default for non-server setups) and an intermittent workload it performs quite well, handling bursts of random writes (up to a few tens of GB total) far faster than a conventional drive — but only if it has long powered-on idle periods for garbage collection. Reads and large writes run at about the same speed as on a conventional drive, and at $280 it costs less than a pair of decent 4TB drives. For heavily-loaded server applications, though, you might want to wait for the next generation. Here are a couple videos (in 16x slow motion) showing the drive in action — sequential read after deliberately fragmenting the drive, and a few thousand random writes.
At the USENIX File and Storage Technologies conference in February, researchers from Northeastern University (disclaimer — I'm one of them) dissected shingled drive performance both figuratively and literally, using both micro-benchmarks and a window cut in the drive to uncover the secrets of Seagate's first line of publicly-available SMR drives.
TL;DR: It's a pretty good desktop drive — with write cache enabled (the default for non-server setups) and an intermittent workload it performs quite well, handling bursts of random writes (up to a few tens of GB total) far faster than a conventional drive — but only if it has long powered-on idle periods for garbage collection. Reads and large writes run at about the same speed as on a conventional drive, and at $280 it costs less than a pair of decent 4TB drives. For heavily-loaded server applications, though, you might want to wait for the next generation. Here are a couple videos (in 16x slow motion) showing the drive in action — sequential read after deliberately fragmenting the drive, and a few thousand random writes.
Sounds like an interesting idea to increase capacity, but the downsides are huge. This really needs filesystem level optimisations to get any performance out of it. For rarely modified bulk storage, this should be fine.
09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
This seems that it would be beautiful for a backup server that backs up every few weeks.
One thing I've tended to wonder, why have a single read-write needle on conventional drives (especially in multi-platter situations). Why not have two needles, one on either side so they can't touch.
Alternately, why not a "track" that runs across the drive with shuttles on either side to perform the reads/writes. You could have two perpendicular tracks to increase performance
This reminds me of the original 'Log Filesystem' research in the 80s, back when drive geometry was known to the OS, and the OS could take steps to optimize for it.
The Log Filesystem concept was to write all data sequentially to disk, and update metadata during idle times.
The basic research influenced a number of filesystems, such as NetApp's WAFL, Sun's ZFS, Linux's JFFS2, etc.
Interesting to see the concept implemented 'in hardware'.
The problem with those helium filled hard drives is they float away as soon as I take them out of the package.
Why is the camera a potato?
The summary says "Reads and large writes run at about the same speed as on a conventional drive, and at $280 it costs less than a pair of decent 4TB drives", one of the 7 links in the summary mentions a 5TB model. 5TB for the price of two 4TB drives doesn't sound that great.
The real question is whether or not Seagate can maintain similar full drive performance compared to a non-SMR drive.
No.The real question is longevity. Per backblaze and my own anecdotal experience, Seagate drives already have a higher failure rate. Looking at this, any firmware bug or flaw could result in massive data loss of an entire 'band' if written incorrectly.
I understand that in any environment backups are crucial, but I live in the real world. A world where small and medium size business (for good or ill) neglect IT until it bites them. At least with regular drives recovery is often possible with block for block copies, and baring that a clean room has a good chance of recovering crucial data.
If a user has a performance need, I can suggest an SSD or SSD+HDD config with appropriate redundancy and backups. For pure space, large HDDs in an appropriate RAID or ZFS work fine. Per TFS, this is not ready for heavily drive loaded server configs yet, and i do not see a need in residential, or small biz or workstation use where other solutions are far better. To me this is currently a product looking for a solution, and one that is risky to data to boot.
Silence is a state of mime.
Those two videos have less pep than an empty soda can.
Find me an 8TB SSD that is even within spitting disttance (hell, within ICBM distance) of $300 and you win the prize, otherwise the suggestion is sless. Hint: Not today, not next year. Possibly this decade . The cost has come down a ton, but it was absolutely astronomical before.
The systems I'm buying now will be obsolete by the time SSD can even think about touching hard drives in terms of capacity per $. Typically, the ONLY reason to go full SSD now for large storage capacities is because you absolutely need the performance and are willing to pay essentially "whatever it costs" (at least 8x+ the price) because it's that important to get the IOPS. Maybe by the end of next year we'll get it down to "only" 4x the price (not counting that though because price per GB for large capacity hard drives still continutes to fall, balancing out a part of the cost reduction in SSDs).
1 minute and 46 seconds of my life back. Please?
"A 'person' is smart. 'People' are dumb, panicky animals and you know that."
Does it come with the latest NSA firmware release?
What if you never delete anything, then there's no garbage.
Lost me at "Seagate" - I just don't care.e
They've screwed their reputation as far as I'm concerned - like in 2007. Tried them again in 2014. Screwed again.
If they last until 2024 ... I _might_ try seagate again. Maybe.
SSD for boot/OS/swap, and slow spinner for data gives 99% of the performance for 99% of people.
That would be great except 99% of people don't want more than one disk.
Hell, *I* don't want more than one disk, and I can ably manage them. But there's no way I can afford the SSD it would require to store everything I have (never mind the backups).
"There is more worth loving than we have strength to love." - Brian Jay Stanley
imagine, something with actual moving parts for data storage in the early-21st century.
9% of people don't care. I set up my mother's computer with the OS on C and everything else
She sure will care if there's every a problem (and there will be a problem) with one of those drives.
As a rule of thumb it's way better not to double someones possible failure rates if they don't know themselves how to recover from it...
I've spent my life helping people get set up technically so they never need to talk to me again - at least not about their systems. It creates a lot less work for yourself, unscheduled works that generally comes at very bad times.
"There is more worth loving than we have strength to love." - Brian Jay Stanley
How impatient must one be to tear down a Seagate hard drive before it breaks down?
data can no longer be over-written in place, requiring SSD-like algorithms to handle random writes.
Good, now when my clients get hit by ransomware there is still hope that the "over-written" file can be recovered.
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.