Slashdot Mirror


Data-Corrupting ext3 Bug In Linux 2.4.20

linuxjack55 writes "Kerneltrap is reporting a data-corrupting bug in the ext3 code of kernel 2.4.20. The scope of the problem (and workarounds) are described in the article, which also includes a link to an interesting interview with kernel hacker Andrew Morton. In it, he states that the '2.4.x core has only stabilized very recently' and the 2.4.x kernel is 'even now...in a late beta state.' He was also asked when the 2.4 kernel could be considered stable. His reply: 'Six months, perhaps?' If that prediction is accurate, 2.6.x could arrive before a 'stable' version of 2.4.x does." (The interview with Morton is from last February -- how stable you consider 2.4 right now is up to you.)

27 comments

  1. Duplicate. by Naikrovek · · Score: 3, Insightful

    they get paid to *not* do this. Why do they not read their own site?

    Mod me down if you must, but I have a good point here.

    http://developers.slashdot.org/article.pl?sid=02 /1 2/02/0128206&mode=nested&tid=106

    1. Re:Duplicate. by Naikrovek · · Score: 4, Insightful

      To be fair, I suppose these guys see *thousands* of submissions per week and easily forget what's been posted and what hasn't.

      But writing a simple filter to check past stories for the same hyperlinks shoudln't be too hard, i wouldn't think.

      Ah well.

    2. Re:Duplicate. by PinkX · · Score: 1

      Like I said on a previous post on another dupe story, perhaps a mechanism which filters on the links attached to a story would surely eliminate a big deal of dupes...

    3. Re:Duplicate. by rlowe69 · · Score: 2

      To be fair, I suppose these guys see *thousands* of submissions per week and easily forget what's been posted and what hasn't.

      There are thousands of submissions, but much much much fewer make it as stories. Surely people that get paid to do this can keep track of just the stories.

      --
      ----- rL
  2. Dupe by FattMattP · · Score: 2, Redundant

    It's a dupe. Please move along.

    --
    Prevent email address forgery. Publish SPF records for y
    1. Re:Dupe by FattMattP · · Score: 2

      Redundant? Give me a fucking break. This entire story is redundant.

      --
      Prevent email address forgery. Publish SPF records for y
  3. first post haiku! by Anonymous Coward · · Score: 4, Funny

    this is a first post
    the editor's are on crack again
    let's beat the dead horse

  4. Corruption includes duplication of data blocks by rakarnik · · Score: 0, Redundant

    An example of such duplication can be seen at this leading news site.

  5. Does it fix buggy code? by heldlikesound · · Score: 1

    If my code is buggy, will the data corrupting bug fix it? Kind of one of those reverse deals, like "I don't not hate you!"

    Well, it would be nice....

    --


    Cloud City Digital: DVD Production at its cheapest/finest
  6. Does /. run 2.4.20 by mhesseltine · · Score: 4, Funny

    Because, the corruption of the databases would explain the duplicate story postings. Come on guys, ride this one for all it's worth.

    --
    Overrated / Underrated : Moderation :: Anonymous Coward : Posting
  7. FULL Interview by MattCohn.com · · Score: 3, Funny

    Andrew Morton then continued... 'One popular company to be affected by the bug was the OSDN. Their technology news site Slashdot.org updated their database server with the new 2.4.20 kernal.'

    It was a disaster. 'The problem was with syncing during unmount. When the people at Slashdot upgraded, their databases became corrupted. Suddenly, duplicate stories began to appear!'

    Added Andrew, 'Wow, duplicate stories. There's a shocker'

    The full interview will be availible soon on The Onion

  8. Journaling by MrWa · · Score: 1
    Wasn't the big benefit that came with ext3 the journaling capacity? Is there some type of journaling that the normal, lay user, isn't aware of?

    How can you verify that this option is not enabled? What, if it is enabled, can be done about it now - can you change the filesystem type (e.g. revert to ext2) or is all hope lost?

    1. Re:Journaling by GreenHell · · Score: 1

      If ext3 is selected as the filesystem type, then journaling is enabled automatically. You can however boot an ext3 filesystem as ext2 since it is backwards compatible, so no, all hope is not lost.

      --
      "I won't mod you down - I feel the need to call you a twit explicitly, rather than by implication."
    2. Re:Journaling by Anonymous Coward · · Score: 0

      The default mode of ext3 journalling is "ordered", and the bug doesn't affect this mode. SO, users of stock ext3 filesystems won't be affected, only people who changed the default journalling mode.

    3. Re:Journaling by 0x0d0a · · Score: 4, Informative

      The default mode is ordered. Basically, this journals only metadata, preventing your filesystem from becoming corrupted. This is the big worry for most people -- losing everything on your partition because of a power loss at a bad time. This may sound not so great, but it's what most other journalling filesystems do --only worry about metadata.

      Journalled mode journals everything, including file data and metadata. This is the uber-reliable (well, when it doesn't have corruption-causing bugs) mode that most filesystems don't bother with because of the speed hit.

      How can you verify that this option is not enabled

      You can look for options in /etc/fstab...it's ordered by default, but if there's an option data=journal, then it's journalled.

      If you're using 2.4.20 right now, I think I'd reboot into your older kernel right now. :-)

    4. Re:Journaling by crimsun · · Score: 2

      As others have stated, the default is data=ordered. If you haven't explicitly specified data=journal in /etc/fstab or via mount, then you're not affected (i.e. safe). /sbin/mount will spit out the parameters you used.

      This bug is in 2.4.20-pre5+, and Stephen and Andrew have both proposed workarounds. Ultimately, it might just be "more" "worth it" to switch to data=ordered if you've been using data=journal after syncing, dropping to single user, and remounting your ext3 partitions as data=ordered.

      Or you could just back out the offending diff.

    5. Re:Journaling by crimsun · · Score: 2

      Err, /bin/mount, rather. Stupid fingers. ;-)

  9. So... by J'raxis · · Score: 1
    The symptoms are that any file data which was written within the thirty seconds prior to the unmount may not make it to disk. A workaround is to run `sync' before unmounting
    So I guess you posted this story twice just in case the last one didn't get synced to disk properly, eh? *ducks*
    1. Re:So... by josepha48 · · Score: 2
      Amazing thing is that my netbsd box sync's disks as part of the shutdown script.

      Interestingly enough so does shutdown on Linux. So what's the problem?

      --

      Only 'flamers' flame!

    2. Re:So... by J'raxis · · Score: 1

      The `sync` program simply executes the fsync system call, which is what does the actual work (tells the kernel to flush its filesystem buffers to disk). The thing is, `umount` is also supposed to call fsync but apparently in this EXT3 configuration it isn't calling it properly.

      Running `sync` before a `umount` should be redundant -- if `umount` is working correctly.

  10. Still? by tpv · · Score: 1
    Come on, everyone has known about this bug for more than 24 hours.

    You think they'd have fixed it by now.

    --
    Read more of this story at Slashdot.Read more of this story at Slashdot.Read more of this story at Slashdot.
  11. Re:Hold up a minute! by Anonymous Coward · · Score: 0

    HaHaHa!

    And this is the only post that gets modded down to -1.
    Mods, your bias is showing...

  12. Wait! Wait! by Anonymous Coward · · Score: 0

    Where's the 750-plus comments to the effect of "Oh, we've fixed that already - and n times faster than (disdain)Microsoft(/disdain) did.

    Oh, and where's the obligatory "But we're so much better than *they* are" comments?

    Since all of you are such m4d l337 hax0rs, where's the karmawhored link to a detailed fix or workaround?

    Where's the +5 "Linux is so much better" comment now? How long did this take to be found? Was there not enough caffeine or time off from college classes to locate and squash it before I could fire off this choice little missive?

    I noticed there are 20 comments to the 1 thousand plus associated with any potential opportunity to bash non-Open Source/Linux products or ideas... I'm not even talking Microsoft itself here. Run out of nasty, antisocial things to say or is it just your favorite pet project has a flaw and you just can't bear to face facts?

    Just because you're still living with your parents doesn't mean there are hundreds of thousands of people trying to make a living, not a hobby, out of their favorite work.

  13. Re:Wait! Wait! by Anonymous Coward · · Score: 0

    There's none of those as this (and the previous one story on this) never got posted to the front page.

  14. Mod down by Anonymous Coward · · Score: 0

    FattMattP is such a fucking dumbass

    1. Re:Mod down by FattMattP · · Score: 2

      Strong words from a coward.

      --
      Prevent email address forgery. Publish SPF records for y