Slashdot Mirror


Seagate Firmware Performance Differences

Derkjan de Haan writes "The Seagate 7200.10 disk was the first generally available desktop drive featuring perpendicular recording for increased data density. This made higher-capacity disks with excellent performance cheaper to produce. Their sequential throughput actually exceeded that of the performance king — the Western Digital Raptor, which runs at 10,000 RPM vs. the more common 7,200 RPM. But reports began to surface on the Net claiming that some 7200.10 disks had much lower performance than other, seemingly identical disks. Attention soon focused on the firmware, designated AAK, in the lower-performing disks. Units with other firmware, AAE or AAC, performed as expected. Careful benchmarks showed very mixed results. The claims found on the Net, however, have been confirmed: the AAK disk does have a much lower throughput rate than the AAE disk. While firmware can tune various aspects of performance it is highly unusual for it to affect sequential throughput. This number is pretty much a 'fact' of the disk, and should not be affected by different firmware."

31 of 177 comments (clear)

  1. bug by Anonymous Coward · · Score: 5, Insightful

    When the performance of a lower-end drive is better than that of a higher-end (or, god forbid, a SCSI drive!) this is a serious bug that of course needs to be fixed in the firmware update.

  2. Reliability by PlusFiveInsightful · · Score: 5, Insightful

    I'll take reliability over performance of a hard drive any day. Nothing sucks more than swapping out drives.

    1. Re:Reliability by RingDev · · Score: 2, Insightful

      Nothing sucks more than swapping out drives. Spoken like a man who's never been kicked in the nuts...

      I'd rather hot swap a failed raid drive than bring down a server to increase memory or redesign a solution from scratch in order to achieve the same performance gains. Heck, for the cost of having a coder just look at the I/O intensive code I could have bought another hard drive.

      -Rick
      --
      "Most people in the U.S. wouldn't know they live in a tyrannical state if it walked up and grabbed their junk." - MyFirs
    2. Re:Reliability by tepples · · Score: 2, Insightful

      Heck, for the cost of having a coder just look at the I/O intensive code I could have bought another hard drive. In which country? In some countries, high import duties and a weak local currency mean that the price of a hard drive is worth a lot more hours of labor than it would be in, for example, the United States or the United Kingdom. And across how many machines does your app run?
    3. Re:Reliability by rcw-work · · Score: 4, Insightful

      Compared to just replacing the hard drive for $150. Hardware is cheap. Labor is not.

      Your example makes sense, but what if you've already done that? Say your app is SQL-based and does some queries that are O(n^2) complex. You've already spent $20k on a bad-ass server with RAID10, a bunch of spindles, separate transaction log drives, and as much RAM as can fit. Now, a year later, there's more records in the system and performance sucks again. Where do you go from there? These disks don't go to 11. If you want to double the performance of that $20k box, you're likely going to spend not $40k but $200k.

      Once you outgrow commodity parts, if you want a 2x speedup, you'll usually have to pay 10x for it. Or wait three years. The price/performance curve is deceptively shallow towards the bottom end.

  3. Re:Linux check by Nimey · · Score: 2, Informative

    Sigh, never mind. Ubuntu's been updated since I put this computer together, so now hdparm /can/ talk to a SATA drive.

    Wouldn't you know that I've got an AAK disk.

    --
    Hail Eris, full of mischief...

    E pluribus sanguinem
  4. AAE vs AAK: It's a tie by Bellum+Aeternus · · Score: 3, Insightful
    So the whole article comes down to the fact the new Seagates are really good if you use them for what they're designed for, but are not as good at what they're not designed for. Surprise...

    Looks like Seagate designed the new drives for servers (probably file servers) because they're really good a moving large chunks of data around, doing large reads, and large write, but not so good a a ton of little reads and writes. So, don't buy them for your desktop/workstation.

    --
    - I voted for Nintendo and against Bush
  5. Well, at least you know... by jd · · Score: 2, Funny

    ...why they named it AAK!

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  6. RAID1 by Anonymous Coward · · Score: 4, Interesting

    Disks are cheap. I *always* run a RAID1 mirrored pair in my PCs, as pretty much all mobos these days have RAID1 capability built into the chipset's SATA controller anyway.

    On my main machine at home, I always buy my disks in groups of three drives whenever I upgrade. Two drives stay in the machine as the mirrored pair, and once a month I pull one out and stash it in a safety deposit box at my bank, and put the third drive into the machine and re-sync the mirror. That way if my house burns down / tornado smashes it or whatever bad thing that might happen, I've got a drive with my machine's image on it, no older than one month, stashed away offsite in a secure place so I can recover most all my stuff to a new machine.

    1. Re:RAID1 by Gandalf_the_Beardy · · Score: 3, Insightful

      It works for me - we have at least a thousand disks in our datacentre in raid5 arrays with 10+ disks per array - all the same make, model and build date and haven't yet had any fail so close that we couldn't leisurely swap the duff one out and rebuild onto the replacement. Quite why people suddenly think that drives are going to fail catastrophically at the same time like this is beyond me when the real world experience says it just isn't so.

    2. Re:RAID1 by GooberToo · · Score: 4, Insightful

      as pretty much all mobos these days have RAID1 capability built into the chipset's SATA controller anyway.

      And many of those are actually slower than a pure, software-only, RAID solution. Sometimes the "hardware RAID" does nothing but offload checksum calculations or other bits onto slower hardware resulting it in being a major performance hinderence rather than a performance boost. Worse yet, if your controller card dies, ALL of your data is now inaccessible. Worse yet again, there is not guarantee future hardware releases, even by the same manufacturer, will be compatible. Heck some of the really low end hardware solutions don't even provide mirrored reads, which should provide a 2x read-only performance boost.

      Not all RAID is created equal. And for many, software RAID, especially for Linux users, provides a solution faster than many RAID hardware solutions, is future proof, and only costs a couple of precent in additional CPU load. Best of all, it's free and works well with LVM. In a day and age where multiple cores are common and few actually use more than one, this option doesn't have much of a downside until you're willing to look at *REAL* RAID hardware.

    3. Re:RAID1 by Cef · · Score: 4, Interesting

      I've had disks fail almost all at the same time before.

      It's really annoying when the following happens:

      - Disk 1 dies in a RAID5 set
      - Hot spare (Disk 4) comes online and starts rebuilding
      - Disk 2 dies during the rebuild thrashing
      - Rebuild never completes
      - Put in 2 new disks
      - Restore a backup
      - Disk 3 fails during restoration, pulling in the hot swap (one of the new disks)
      - A year later, the original hot spare (Disk 4) fails, leading to another rebuild

      From my own experiences, the main culprit in these sorts of cases tend to be the bearings. Why they have a tendency to go at the same time, I have no idea. Haven't had it happen lately, but I know I'd rather avoid the problem.

      Usually though, it's not the make/model/build date that is the issue, but the batch number (especially for the parts rather than the drive). Parts tend to get allocated in batches, so if you get a batch of say.... bearings, that aren't up to snuff, that batch of drives will probably fail earlier, while others (even ones manufactured on the same date) will be fine.

    4. Re:RAID1 by evilbessie · · Score: 2, Insightful

      Use RAID 6, you can lose any 2 disks and still have all the data, means that data is secure whilst the array is rebuilding from a single failed drive. Alright you could lose 3 disks at once but that is much less likely than losing one or two, especially if the failure/rebuild occurs quickly.

  7. Not really same drives by zdzichu · · Score: 5, Insightful

    From TFA page 6:

    A sad detail is that updating an AAK disk to other firmware is impossible, due to physical differences of the two disks.
    (emph. mine)
    Different disks have different performance. News at 11.

    --
    :wq
    1. Re:Not really same drives by DAldredge · · Score: 4, Insightful

      Two drives sold under identical make and model identifiers should not be that different.

    2. Re:Not really same drives by jhesse · · Score: 4, Informative

      Tell that to D-Link.

      They were selling a USB 802.11G dongle (Model DWG-122, IIRC), one model number, *THREE* different chipsets (each requiring different drivers, only one of which had drivers for other than Windows)

      Nothing on the box other than a "A" "B" or "C" in tiny print in a corner.

      --

      --
      "I have also mastered pomposity, even if I do say so myself." -Kryten
  8. iis kdawson's spell-check firmware broken? by garcia · · Score: 2, Funny

    This number iis pretty much a 'fact' of the disk, and should not be affected by different firmware.

    Poor spell checking is pretty much a 'fact' of the browser you use when you submit articles to Slashdot, and should be affected by different editors.

    Perhaps kdawson's firmware is broken? :)

    1. Re:iis kdawson's spell-check firmware broken? by ruiner13 · · Score: 2, Funny

      Perhaps kdawson loves windows and IIS so much that he added it to his dictionary? ;)

      --

      today is spelling optional day.

  9. The Day the Earth Stood Still by Anonymous Coward · · Score: 5, Funny

    Whatever you do, don't stream audio from one of the -K drives across Vista!

  10. Why the mixed results? by Froggie · · Score: 2, Informative

    It's interesting to note that the general purpose benchmarks come out with AAK in the lead while the others, all very much sequential read focussed, don't. So the question is, what exactly are the operations that the AAK is doing faster in the mixed benchmarks? Seeking? Or maybe it's a bus bandwidth limit at the hard drive end?

    Sadly, we can't tell, because the author has focussed on the sensationalism of poor performance rather than asking these questions. Seems to need a few more experiments setting up, or alternatively an answer from the horse's mouth.

    Some candidate theories:
    - microcontroller software bug (unlikely)
    - hardware cost-down such as a slower, cheaper microcontroller or less RAM on the drive (quite likely)
    - rebalancing the performance optimisation, changing the cacheing or readahead algorithms to suit typical loads (possible, but it seems odd that this would limit linear read performance)

    1. Re:Why the mixed results? by Devistater · · Score: 2, Interesting

      Its not less RAM, all the 7200.10 perp drives are 16 meg cache, at least all the ones above 300 gigs are. And looks like some of the 250gig as well
      http://www.seagate.com/docs/pdf/datasheet/disc/ds_ barracuda_7200_10.pdf

      Its only when you get down to the 80 and 120 gig sizes that the cache is reduced. And thats to save money on the production costs since the drive itself sells for less. If people want a cheaper, smaller capacity drive, they aren't likely to be willing to pay more for the 16 meg cache.

      So "less RAM" can pretty much be eliminated. Your other theories could still be correct though. I personally would lean towards a bug, one that passed the Q&A because it didn't affect all performance characteristics of the drive.

  11. It's true by fifirebel · · Score: 5, Informative

    I have been setting up a couple of 8-drive RAID-5 arrays with these drives for some customers, and I also found out that 3.AAE drives performed much better that 3.AAK. No idea why. Seagate was unresponsive to queries about flashing the firmware and I had to replace all the 3.AAK drives by 3.AAEs.

    The manufacturing country had nothing to do with it. I had some chinese 3.AAE and 3.AAK as well as taiwanese (or was that thai?) 3.AAE and 3.AAK. 3.AAE would always perform better.

    The kind of testing I performed was:

    • hdparm -t /dev/sdN (AAK: 50 MB/s vs AAE: 72 MB/s)
    • time dd if=/dev/sdN of=/dev/null bs=1M (AAK was 10-15% slower)
    • iozone over ext3 showed slighly worse results with AAK than with AAE, but it was probably within the sampling/error margin (< 5%).

    Also, if you buy a retail kit (which I found cheaper than OEM at Fry's), there is no way to find out the firmware level on the box. You had to open the retail boxes to check the firmware revision on the drive itself.

    One theory I have is that these drives can supposedly be configured for server or workstation workloads. It could be that AAK drives are configured for server workloads by default (unless overridden) while the AAE are configured for workstation workloads by default. I have no idea how to toggle this under Linux.

  12. drive failure by leuk_he · · Score: 3, Interesting

    Quite why people suddenly think that drives are going to fail catastrophically at the same time like this is be

    An experienced administrator would know there is one item in the data center everything is relying on no-one could ever think of it failing, and it will fail at the most catastrophic time you think of. It won't be all fo those 1000'thns drives failing at the same time because some plane mistook your server lights for the landing runway, It will be some cheap sprinkler, the security lock of the door, Or some manager that decides to shutdown a machine to protect it from a Denial of service attack.

    If there is no such item a good BOFH will create such red button.

    1. Re:drive failure by Gandalf_the_Beardy · · Score: 3, Funny

      That's why we have a hot standby datacentre with real time replication to it. Shame that one of our contractors reversed over the gas main and we evacuated leaving all the access cards to the hot standby in the evacuated building.....

  13. Re:why do girls always abandon me? by Lars+T. · · Score: 3, Funny

    Because you have a harddrive but no firmware.

    --

    Lars T.

    To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

  14. three out of five ain't bad by peterxyz · · Score: 2, Interesting

    yup, about a decade ago I worked somewhere where this was an issue - they had a RAID configuration of somekind (I'm a nerd, but not a hardware one) and they had bearing failures in sufficiently close succession that the third failure occurred before all of the swapping from the second failure hadn't been completed.

    supposedly it was traced to a common fault in the bearings

  15. Seagates, and Linux HD optimization by straponego · · Score: 3, Insightful
    First, a comment on the Seagate 750G drives: If you run these, and you want to keep them running, make sure you have clean power. I've seen several of them die, usually after a power outage. Never seen one on a UPS die.

    Also, if you're concerned about Linux block device performance, look at the various kernel tunables. On a single drive, such as those Seagates, I can get extra ~10MB/s. On RAIDs and LVM volumes, the differences can be much higher-- more than twice as fast, in some cases. There are a few parameters that make a difference, and many values you might want to try for each. I have a script iterate through the various permutations, running IOZone on each, so I can see what does best for read vs. write and large vs. small file performance. But I can't release it just yet (employer makes 100% of income from Open Source; employer hates Open Source). Anyway, somebody out there can do better than I, I'm sure :)

    This discusses the tunables you'd want to check: http://www.3ware.com/KB/article.aspx?id=11050

    Note that these do NOT apply only to 3Ware controllers. And the differences in performance can be massive.

  16. Big Red Button by Jeff+Carr · · Score: 5, Funny

    If there is no such item a good BOFH will create such red button. One of the data centers I worked at had just such a red button. It was designed to immediately kill all power to the room. Behind a plastic case, clearly marked "Emergency Shutoff".

    The security for the door was malfunctioning earlier this summer, and the alarm was going off. The security guard thought the button was a shutoff switch for the security system... Luckily we had redundant servers at another location... Of course half of those didn't work...

    Luckily also, this was the smaller data center at that site, so it only housed a few hundred servers... including the servers that ran many of our ATMs, and our server inventory and trouble tracking software... which didn't fail over to their backups... of course.

    In addition, we had no idea where the server housing our server inventory information was... It turns out it was housed on a server called Skywalker... which we couldn't find... It turned out to be a cluster of Anakin and Amidala...

    Fracking geeks.
    --
    The television will not be revolutionized.
  17. Perpendicular Recording Overview Video by Jaxoreth · · Score: 4, Funny

    For those who are unclear on what perpendicular recording is, Hitachi made a video explaining how it works. It's a bit dry and technical, but I figure the Slashdot crowd is savvy enough to grok it.

    --
    In general, it is safe and legal to kill your children. -- POSIX Programmer's Guide
  18. Observation from the "inside" by Distan · · Score: 5, Informative

    I am an insider in the drive industry, so while I need to be vague on some things, I can add clarification on others.

    A hard drive is a very complex subsystem inside your computer, more complex than many people realize. A hard drive contains one or more CPUs, memory, firmware, and dedicated hardware devoted to the functions of storing and retrieving data.

    There is no single "right" way to draw the line between what is firmware and what is hardware in a hard drive. Algorithms could be coded in VHDL or Verilog and synthesized into the silicon, or they could be compiled in C (or hand coded in assembly) and be embedded in firmware. Each drive company has their own philosophy for where to draw the line.

    Some drive companies choose to implement only fundamental functions in silicon, and implement everything else in firmware. For these companies, comparing their firmware to the BIOS in a PC is a poor analogy. A better analogy would be to compare the firmware to the operating system.

    In a system with "lite" firmware, the firmware typically would be responsible for configuring a few control registers and buffers, and then the hardware would take over. But for a system with "heavy" firmware, the firmware behaves much more like a kernel. Data is not going to be moved in or out of buffers, or be sent to and from platters, without the active involvement of the firmware scheduling and ordering that activity.

    The author of the OP wrote "it is highly unusual for (firmware) to affect sequential throughput". The author is wrong. In a system with "heavy" firmware, all performance is highly dependent on the firmware. It can easily make the same difference in performance as you would see running Windows 95 v. Windows XP v. Windows Vista v. RH 7.2 v. RHEL 3.0 on the same PC hardware.

    I do not know if the Seagate drive in question is a "heavy" or "lite" firmware drive, but I do know that the assumption that firmware takes a minor role in hard drive performance is mistaken.

  19. Not a bug, it is an OEM feature by Klaus_1250 · · Score: 2, Informative

    The difference between these drives is not only the firmware, the hardware is also different. If you look a bottom of the drives, you can see the board has a completely different layout and presumably (the pictures I've seen were too low quality and the memory was not on the visible side on the AAK-drives) different chips. According to Seagate, the AAK drives were for an OEM-customer (unfortunately, they didn't mention which one). But how or why those drives made it to retail-channels (Seagate and the OEM-customer knowing the drives had a different performance profile)?

    --
    It only takes one man to change the Wisdom of the Crowd to Tyranny of the Masses.