Slashdot Mirror


Intel Confirms Data Corruption Bug, Halts New SSDs

CWmike writes "Intel has confirmed that its new consumer-class X25-M and X18-M solid state-disk drives (SSDs) suffer from data corruption issues and said it has pulled back shipments to resellers. The X25-M (2.5-inch) and X18-M (1.8-inch) SSDs are based on a joint venture with Micron and used that company's 34-nanometer lithography technology. That process allows for a denser, higher capacity product that brings with it a lower price tag than Intel's previous offerings, which were based on 50-nanometer lithography technology. Intel says the data corruption problem occurs only if a user sets up a BIOS password on the 34-nanometer SSD, then disables or changes the password and reboots the computer. When that happens, the SSD becomes inoperable and the data on it is irretrievable. This is not the first time Intel's X25-M and X18-M SSDs have suffered from firmware bugs. The company's first generation of drives suffered from fragmentation issues resulting in performance degradation over time. Intel issued a firmware upgrade as a fix."

12 of 137 comments (clear)

  1. Test before you ship by alain94040 · · Score: 4, Interesting

    Maybe they should have used HW/SW co-verification (like Seagate in that study - an example of how a storage company tests their firmware).

    For you software developers out there who enjoy free debuggers, you should know that we, hardware designers, also have our own debuggers. Except they are a little bit more expensive (think $500,000+) and can be quite bulky. But they are the only way to really test firmware before taping-out a chip.

    1. Re:Test before you ship by Anonymous Coward · · Score: 5, Informative

      As a professional FW tester, I can say 1) firmware can be tested easier than the hardware verification the parent is talking about, and 2) Parent is confusing HW verification with firmware verification. Don't confuse HW verification with Firmware, and don't confuse Software testing with hardware verification. They are vastly different than each other, and have their own set of tools and methods (try sitting through a STAR East or STAR West seminar as a FW tester - it is a total waste of time).

      I can (and do) test firmware on buggy hardware all day long - its not an issue.

  2. Re:I find this disturbing by jtownatpunk.net · · Score: 5, Insightful

    Future? You must be new to computers. I updated the firmware in my very first 80's printer to give it more features. Had to pop out the old chips and put in the new ones. I upgraded the firmware in modems from several different manufacturers (some more than once) to add features and fix bugs. I've updated the firmware (BIOS) on most of my motherboards. I've updated the firmware on optical drives. I've updated the firmware on a scanner. I've updated the firmware on SCSI controllers. I've updated the firmware on hard drives. I've updated the firmware on switches and routers. Hell, I've updated the firmware on keyboards.

    This is hardly a new phenomenon.

  3. Re:Ugh... summary.... by ShadowRangerRIT · · Score: 4, Informative

    The X25-M's initial firmware was unusually bad; the degradation was more rapid and more severe than necessary. Thus, they issued a firmware update. The results were quite impressive. It not only reduced the perf degradation, but it seems to have made writes faster across the board.

    --
    $_ = "wftedskaebjgdpjgidbsmnjgcdwatb"; tr/a-z/oh, turtleneck Phrase Jar!/; print
  4. Re:Ugh... summary.... by Krizdo4 · · Score: 4, Informative

    The performance degradation in the Intel X-25 is not because of a "firmware bug".

    Bugs can cause slowdowns, too

    Though it's highly regarded, Intel's X25-M SSD had a firmware bug that adjusted the priorities of random and sequential writes, leading to a major fragmentation problem that dropped throughput dramatically. The issue was originally uncovered by PC Perspective after two months of testing. Those tests showed that write speeds dropped from 80MB/sec. to 30MB/sec. over time, and read speeds dropped from 250MB/sec. to 60MB/sec. for some large block writes.

    https://www.techworld.com.au/article/302571/ssd_performance_--_slowdown_inevitable?pp=3

    Before firmware update

    the result suggested a write speed of 30 MB/sec.

    http://pcper.com/article.php?aid=691&type=expert&pid=3

    After firmware update

    After composing myself, I did the same file copy I had tried earlier. 76 MB/sec.

    http://pcper.com/article.php?aid=691&type=expert&pid=4

    Not a firmware bug?

  5. Feature Not A Bug by mrbene · · Score: 5, Insightful

    Seriously, I'd say this is in the By Design bucket. For the security conscious - set a BIOS password. If the (feds/aliens/wife/others) remove the password, all access to the data is gone.

    Brilliant! Secure!

    Mind you, not being able to change my password once every other day might hinder my current security model.

  6. Re:Well.. by rickb928 · · Score: 4, Interesting

    Is this a cost issue, or a thoroughness issue?

    No, we dont catch every possible scenerio here, either, but we do try very, very hard. Knowing one of the coders in Intel's RAID drivers groups, he goes crazy with stuff. And he just writes Linux drivers. I do not envy him - this past year, every bug he's had to fix has been caused by someone else's code. Someone not writing Intel drivers. And he gets slammed every time for bad testing, as if he can test all the rest of the kernel team's stiff, NTM every fly-by-night Chinese hardware outfit. They're killing him.

    I can't even say 'ext4', he just goes insane. Though he chuckles when I whisper 'ReiserFS', and opens another beer.

    I'm glad I'm not in that line of work.

    --
    deleting the extra space after periods so i can stay relevant, yeah.
  7. Re:I've seen this before by Grishnakh · · Score: 4, Insightful

    Why bother though? If someone breaks in, you'll have to fix or replace your front door, even though the motion-detecting laser robots zapped him. If you just leave your front door unlocked instead, intruders can just walk in, and the laser-wielding robots can zap him, and then automatically dispose of the body for you too. This way, the intruder won't cause any damage.

  8. Re:Typical redditor by Movi · · Score: 4, Insightful

    Because suddenly your code becomes time-based, eg it matters WHEN x=0 becomes x=1, and what's in between.

    Believe me, this kicks you in the balls really hard. I still remember the frustration on my Altera course, where in simulation everything worked fine, but once flashed onto a FPGA everything went to shit.

  9. Re:I've seen this before by NotQuiteReal · · Score: 4, Funny

    To keep out the innocent neighbor kids or the maid who comes on the wrong day. You only want to dispose of bodies that deserve it.

    You'll sleep better that way.

    --
    This issue is a bit more complicated than you think.
  10. What took them so long to report this? by AllynM · · Score: 4, Informative

    Welcome to 2 weeks ago:

    http://www.pcper.com/comments.php?nid=7544

    Allyn Malventano
    Storage Editor, PC Perspective

    --
    this sig was brought to you by the letter /.
  11. Re:Typical redditor by NP-Incomplete · · Score: 4, Informative
    On a chip, adding 2^256-1 and 1 may not equal 2^256 when:
    1. Your destination register is 256 bits.
    2. Your destination register is in a different clock domain.
    3. Your timing constraints are wrong.
    4. Your power grid cannot support switching 256 registers.

    Functional simulations will only catch #1.