Slashdot Mirror


Serious Bug In 2.4.15/2.5.0

John Ineson writes: "There is a bug in the latest kernel releases, that causes fs corruption on umount. A lot of people have already been hit by this, so for now I suggest you hold fire on booting those new kernels. More dead-duck than greased-turkey. Two possible fixes are being discussed on linux-kernel." Colin Bayer adds links to a story at the Register and Al Viro's fix. Update: 11/25 00:39 GMT by T : Tarkie writes "Linux 2.4.16-pre1 is out, as detailed at NewsForge. If you've been having the filesystem corruptions, might be worth a try so that 2.4.16 can be out ASAP!"

275 of 498 comments (clear)

  1. Filesystems by fishebulb · · Score: 2

    From the looks of the post this bug occurs regardless of filesystem. Is that accurate? or would certain fs's be unaffected, im guessing that it doesnt matter, anyone care to clarify that

    1. Re:Filesystems by MShook · · Score: 5, Informative

      You're correct, it is regardless of filesystem. If you happen to be running 2.4.15 or 2.5.0, just remember to force a fsck for the next reboot (shutdown -F) that's the only way to clear the fs because it will be marked clean even if it's not). Right now, the developpers don't know how reseirfs would deal with this bug...

    2. Re:Filesystems by Colin+Bayer · · Score: 5, Interesting

      It afflicts every filesystem. However, rebooting with the file /forcefsck extant forces it to run an fsck (and fix the corruption) on boot.

      Also of help might be the Alt+SysRq keys; if you sync the drives and unmount them in single user mode before reboot, you should reduce or eliminate the corruption.

      --
      Want Linux games? HERE.
    3. Re:Filesystems by Free+Bird · · Score: 1

      Great, I never compile my kernels with SysRq (except on non-production machines)...

    4. Re:Filesystems by Lumpy · · Score: 2

      is it what was causing every one of my disks to report that there was media change every 3 seconds?

      I hate it when these things happen, espically when I'm fighting with ieee1394 bugs and patches..

      I though I screwed it up so re-image my dev box with a new RH7.2

      Ack, tis the price for living on the bleeding edge.

      --
      Do not look at laser with remaining good eye.
  2. Alan Cox by linux_warp · · Score: 1, Funny

    Good thing we have alan cox who tries to keep his tree somewhat more stable. Anyone know if his kernels were affected?

    1. Re:Alan Cox by linux_warp · · Score: 4, Interesting

      Also, straight out of alans diary:

      September 29th - Much kernel patching going on. The -ac kernel tree seems to be turning into the stable tree as Linus merges odder, weirder and more alarming things. I just hope he knows what he is doing.
      ---

      Sounds like confidence to me :)

    2. Re:Alan Cox by Rohan427 · · Score: 1

      Note: This is not supposed to be a stable release.

      - Rohan

  3. Stick with 2.4.15-pre8 by ShawnX · · Score: 2, Informative

    No problems with this kernel pre release :)

    --
    Everyone wants a Tux in their life.
    1. Re:Stick with 2.4.15-pre8 by fok · · Score: 1

      Ive just booted 2.5.0 on 2 boxes, and 2.4.15pre8 on the production server... hehehe

      I use ext3 or reiserfs and no problems so far...

      --
      \m/
    2. Re:Stick with 2.4.15-pre8 by X-Dopple · · Score: 1, Offtopic

      # make menuconfig

      <snip compiling stuff>

      Cannot find ncurses, please install it on your system (paraphrased)

      #rpm -q ncurses
      ncurses-5.2-16mdk

      This is on a Mandrake 8.1 system. What is wrong with the kernel?

    3. Re:Stick with 2.4.15-pre8 by Carlos+Laviola · · Score: 2

      You need ncurses development libraries as well. I'm a Debian user, so Debian's package is named 'libncurses5-dev'. They might be called 'ncurses-dev' on Mandrake.

    4. Re:Stick with 2.4.15-pre8 by redhog · · Score: 2

      Nope, he needs libncurses5 and libncurses5-devel in addition to ncurses...

      --
      --The knowledge that you are an idiot, is what distinguishes you from one.
  4. quality assurance by xah · · Score: 1, Insightful

    When are we going to start giving kernels to a QA team before releasing them?

    --
    I am not a lawyer. Do not take my words as legal advice. If you need legal advice, consult an attorney.
    1. Re:quality assurance by Sj0 · · Score: 1

      As soon as you get a professional group of developers who are compensated for their work. You can't rely on the garbage thats being churned out be the long-haired communist hippies that are in charge now.

      Let he who has never had windows corrupt a part of a FAT partition bitch about how it's because they don't get paid. The rest of us will deal with it as a part of trying new things (innovating, if you will) is making mistakes and fixing them, and anyone who runs out and installs the latest patch the second it's posted is asking for trouble. If it ain't broke, don't fix it.

      --
      It's been a long time.
    2. Re:quality assurance by gweihir · · Score: 1

      Release early and often is not the problem here. The problem is that things that should have been done in development kernels (i.e. 2.5.x) where done in the stable series. _Not_ a good idea!

      Maybe some people just failed to reconfigure their mindset on stability for 2.4.x. This would also explain the high number of unusable 2.4.x kernels. IMO 2.5.x should have been branched off after 2.4.7.

      Apart from that, nothing wrong with the current developments.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted and ignored otherwise.
    3. Re:quality assurance by Anonymous Coward · · Score: 2, Insightful
      This is xah, posting anonymously to protect what little karma I have left.

      I have a response to those who have said that the open source QA process is to release early and let early adopters suffer the consequences. Are you sure? Are you saying this is good example of open source development? Are you saying this is the exemplar of the open source development process? This is a data loss bug.

      In the open source development process, it's not a problem if the new release of Mozilla has a small problem with frames in XHTML, or if the new Linux kernel breaks support for USB joysticks. These are problems that can be fixed.

      Some bugs are so serious, however, that they deserve extra attention. These are the "showstoppers." In every kernel release, Linux says something about not finding any showstoppers. That is, there are no data loss bugs or other serious bugs that he knows of. He wouldn't release it if he thought it had such a serious problem.

      All I am saying is have a process that can perform rudimentary checks on the kernel to pick up any showstoppers. This process would take a few hours or at most a day. It would prevent situations like this, where the Linux community opens itself up for attack by all the brainwashed Microsoft zealots. Is this really flamebait?

    4. Re:quality assurance by KagatoLNX · · Score: 2, Interesting

      Finally some good points. This probably shouldn't be a reply to this
      message but it's too late now.

      I would point out that this bug does not turn up readily.

      This bug allows a system to boot up normally, run fine, and then when you
      reboot (and only when you reboot) some files are missing if (and only
      if) their buffers were dirty when you rebooted.

      This is NOT easy to catch. The average Linux system has upwards of 50,000
      files. A few disappearing is not easy to notice. In addition, buffers
      tend to get flushed pretty well during the shutdown process, so it
      wouldn't show up too often either (I avoided on accident it due to a
      peculiar RAID shutdown script I have that sync's and sleep's for a bit).

      For the M$ zealots out there be careful to practice what you preach. One
      of the core arguements of the Slaves of the Empire is that the Linux
      zealots bash M$ but can't take criticism themselves. If you'll check your
      precious windowsupdate.microsoft.com on a fresh Win98 install, you'll find
      the IDE Hard Drive Cache Update. For the uninitiated, this patch fixes a
      problem where Windows doesn't write all of the data to disk on
      shutdown. Ironically, this tended to completely hose Win98 systems
      beyond fixing by Scandisk (usually registry damage).

      So, Win98 and Linux have similar problems. In a week, the Linux bug will
      be history, but the M$ one is still being minted on CD and requires an
      Internet download (because it's a "minor problem", the fix is to "wait
      before shut down so the data is written"). I don't remember too much
      babbling on Slashdot about that bug and it's been there for YEARS.

      Gosh, I guess I should write this off as being dribble by "Linux
      Bashers".

      Oh, and to completely trash my karma, I've had disk corruption in a stable
      FreeBSD due to a bug in FreeBSD code, so don't get too high on your own
      superiority yet. You've got older code--sometimes it's a strength,
      sometimes it's a weakness. Like the FreeBSD development process isn't
      ever rocky.

      --
      I think Mauve has the most RAM. --PHB (Dilbert Comic)
    5. Re:quality assurance by TheLink · · Score: 2

      Face it, Linus is screwing up big time with 2.4. Sure he's human, but he's got to do something to catch all these stupid bugs.

      2.4 is supposed to be a STABLE release. However so far it has not been anything like that so far - VM issues, symlink probs, etc etc etc.

      Doesn't look like there's any QA worth talking about. No regression testing.

      Heck even the oft-flamed Redhat is doing a better job releasing stable kernels.

      Yeah you've had disk corruption in FreeBSD, but overall it looks like Linux 2.4 standards are rather poor.

      Why use Windows 98 for comparison? Windows is the far low end of the scale. Linux would be crap if it is just slightly better than Windows. So you got to aim a LOT higher than that. Looking at Linus' 2.4 STABLE, you'd be able to call Windows 95 stable too.

      Linux has great potential - heck the Linux on a mainframe thing is great. But someone should come up with some decent regression tests so that we don't get STUPID problems like this for STABLE releases. I don't care if stupid stuff happens in dev releases.

      Very disappointing.

      --
    6. Re:quality assurance by KagatoLNX · · Score: 1

      Admittedly, he is kinda screwing the pooch. But it's not like he's breaking new ground.

      Certainly, a lot of really nasty bugs showed up after release (around 2.4.5). I'd have to say that his solution, while bumpy, will probably give us the best fix.

      I mean, did you actually look at some of the VM code? After a while it became quite clear that there were just layers of hacks being put on to fix it. It *NEEDED* Andrea's rewrite.

      That being said, while there were about two months of real ugliness, it seems that the worst is behind us.

      However, I do concede that there might need to be a bit more testing, but I really think that we're doing marvelously.

      A very good assumption, however, is that when Linus "releases a new kernel", it is not like a "new release of RedHat". He's always pretty much maintained that QA is up to distros. He's not releasing each one in the context that tomorrow the whole world will upgrade. Truthfully, they seem to be more a release to the developer community. Sometimes there are rough edges, but, in general, the people don't get hurt (I don't care what OS you run, if your data is important--you DO NOT upgrade on a whim).

      --
      I think Mauve has the most RAM. --PHB (Dilbert Comic)
  5. Does anyone know... by Griim · · Score: 4, Insightful

    ...how something like this could have creeped in, and be missed? Was it a last-minute change that just didn't have time for testing, or was it (bad)luck-of-the-draw that no one noticed it?

    1. Re:Does anyone know... by Colin+Bayer · · Score: 5, Informative

      This bug was introduced when the kernel coders were trying to fix a bug that existed earlier (but, AFAIK, didn't cause fs corruption). It was introduced in pre9, but the final kernel was released within a few hours, so I guess nobody caught it in time.

      --
      Want Linux games? HERE.
    2. Re:Does anyone know... by fishebulb · · Score: 1

      no, thats wrong. 2.2 is meant for mission critical. 2.4 will work, but it doesnt have the time tested "garuntee." Anyone running 2.4 as mission critical system had better have one hell of a good backup system in place. Not to say 2.4 is completely unstable and such, but for mission critical applications, you dont want to take those chances.

    3. Re:Does anyone know... by Dionysus · · Score: 1, Flamebait

      I can certainly agree with that if 2.4.x just came out (like in the 2.4.0-2.4.5 range). The thing is, y.[even].x was supposed to be the stable branch, not the experimental branch.

      Meaning you don't introduce stuff unless is it really tested. Something like the USB stuff before it was backported to 2.2 series.

      If 2.4 doesn't have the timetested guarantee, why should anyone believe 2.2 has it? Why not go with 2.0? Oh, wait, neither 2.0 nor 2.2 has some of the features that was introduced just for the enterprise...

      Stop making excuses for the 2.4 series. It's been really shoddy. Doesn't reflect too well...

      --
      Je ne parle pas francais.
    4. Re:Does anyone know... by stefanlasiewski · · Score: 2, Interesting

      Did the marketing people take over the Kernel release process?

      Why the rush between the pre9 and final versions? Why the lack of QA? Are the kernel developers rushing to meet a deadline or something?

      Alot of people complain that Open Source projects develop too slowly, and cite the slow pace of Mozilla and Gnome as an example.

      Pro-OSS folks say "That's a BENEFIT to the OSS model, we don't rush things through the door before they are ready, therefore there are less bugs in our released products.

      But here we are, with a product that was rush, and that was released with a serious bug.

      --
      "Can of worms? The can is open... the worms are everywhere."
    5. Re:Does anyone know... by Telek · · Score: 2

      obviously it was gross incompetance on the programmers part.

      Or at least that's what most people here say when much less serious problems happen in Microsoft software.

      But since it's Linux, it must just be bad luck dude!

      --

      If God gave us curiosity
    6. Re:Does anyone know... by dinivin · · Score: 2, Insightful

      Pro-OSS folks say "That's a BENEFIT to the OSS model, we don't rush things through the door before they are ready, therefore there are less bugs in our released products.

      On the contrary... Pro-OSS folks who know what they're talking about will say that one of the benefits of the OSS model is "release early, release often". They'd also point out that while really show stopping bugs will make their way in to a stable release (of whatever project we're talking about), just as they'll make their way into the stable releases of closed-source projects, with OSS software you're not forced to wait till some company finally decides to admit there's a problem and release a patch.

      Anyone who says that all releases of OSS software are inherently more stable and secure than closed source software is a moron. And anyone who says that all releases of closed source software are inherently more stable and secure than OSS software is also a moron.

      Dinivin

    7. Re:Does anyone know... by iabervon · · Score: 2

      That explains why pre9 was released with the bug, but not why the final version was released with it. There really needed to be a "release candidate" notice on things that could become final versions and then a QA pass, before something gets blessed as a "final" version.

      The advantage with OSS is that you get frequent releases, which enables you to keep up with development and test the upcoming version to see if it works for you. But that doesn't help much when the upcoming version gets changed and then released without testing.

      Of course, there is the other advantage: that the person responsible for a bad bug in a final version will actually spend the day after thanksgiving fixing it, so that this sort of accident gets fixed in a day or two.

  6. If you are already running it... by krogoth · · Score: 3, Funny

    I recomment turning your computer off with the power switch or by unplugging it, after you've made sure you can boot an older kernel. Since umounting is done when you shut down cleanly, you don't want to do that.

    --

    They that quote Benjamin Franklin on liberty and safety deserve neither.
    1. Re:If you are already running it... by xah · · Score: 1

      But first make sure you can boot into a different kernel next time, either from hard drive or from removable media such as floppy.

      --
      I am not a lawyer. Do not take my words as legal advice. If you need legal advice, consult an attorney.
    2. Re:If you are already running it... by PeterM+from+Berkeley · · Score: 4, Informative

      I wouldn't do what this guy says.
      You're pretty much guaranteed to corrupt your
      filesystem this way. Probably nothing fsck
      couldn't fix, but still.

      Other posters have suggested that you use
      "shutdown -F" after running "sync",
      and rebooting into a NON-2.4.15 kernel.

      "sync" will write all the unsaved data to
      the disk, and "shutdown -F" will cause
      an fsck to start after rebooting.

      PM

    3. Re:If you are already running it... by MAXOMENOS · · Score: 2

      What moron rated the parent "Informative?" If anything this should be rated "+1 Funny" or "-1 Troll." Aren't Slashdot readers supposed to be Linux-clueful?

    4. Re:If you are already running it... by afrojer · · Score: 1

      This worked just fine for me! I re-compiled 2.4.14 and used a "sync && shutdown -rF now" command. Back up and running just perfectly!

      That's the last time I try out a cutting edge kernel on a semi-production box :-)

      --
      -- Jeremy C. Andrus http://www.jeremya.com/
  7. Bad start by C0vardeAn0nim0 · · Score: 1

    for the brasilian guy, hum ?

    Well, this is not the first, and probably it won't be the last too, dangerous boog in the 2.4 series. IIRC (too lazy today to check) 2.4.11 is marked as "do not use" in the kernel mirrors.

    --
    What ? Me, worry ?
    1. Re:Bad start by alhague · · Score: 2, Informative

      > for the brasilian guy, hum ?

      Nope. 2.4.15 was released by Linus ...

      al

    2. Re:Bad start by Dr.Dubious+DDQ · · Score: 2
      > for the brasilian guy, hum ?

      Nope. 2.4.15 was released by Linus ...

      True, but it's still a "bad start", in the sense of "unpleasant", for the "brasilian guy", because his very first task involves the urgent need for a quick release of a bug-fixed 2.4.16...Imagine getting hired as a sysadmin and the very first morning you walk into the office 100 or so computers belonging to senior management all start propagating MS VB viruses amongst themselves and the rest of the system, crashing machines, emailing sensitive data to random people outside of the company, slowing network traffic to a crawl, etc. etc.....

      Talk about "trial by fire"....

  8. "QA" by Anonymous Coward · · Score: 2, Insightful

    The users are the QA (why do you think Linus moved to 2.4 so early? To get more testers). If you don't like being a guinea pig, then wait about a week before moving to the newest kernel. Seriously, 7 days isn't that long, and all show-stoppers will have shown up long before then.

    1. Re:"QA" by Anonymous Coward · · Score: 1, Informative

      You cannot compare Linux and FreeBSD that way. FreeBSD is a complete OS, not just the kernel.

      I've NEVER seen filesystem corruption caused by my distribution (Red Hat) kernel.

      Compare FreeBSD with the whole Red Hat Linux (probably the same for Debian, don't know about the others), and you'll see neither have this sort of problems.

    2. Re:"QA" by theLime · · Score: 1

      Yes and no....

      2.4.11 had BIG bad problems, but there was no "2.4.11 fixed"

      They just marked the file something like "2.4.11-DO-NOT-USE" and put out 2.4.12

  9. FS corruption? by be-fan · · Score: 2, Interesting

    Dude. I hate to say this, but Windows 2000, while it may crash more, doesn't hose you're filesystem nearly as often as Linux seems to these days. At what point do we get to start making the LinSux jokes?

    PS> Don't flame me please. I just wiped Win2K off my harddrive this morning. Luckily, I downloaded the 2.4.15 tree but have been too lazy to compile it yet.

    --
    A deep unwavering belief is a sure sign you're missing something...
    1. Re:FS corruption? by fishebulb · · Score: 1

      not a flame, but ive had more corruption on win2k. our server at work had a corrupt file that hosed backups. MS's response to getting rid of this file, format the drive. that may not count as corrupt filesystem, but it was quite a serious problem.

      Personally ive never had a corrupt filesystem (that wasnt fsck fixable) in linux using ext2, reiser, or xfs.

    2. Re:FS corruption? by be-fan · · Score: 2

      Neither have I (and I track the -pre kernels), I was just being facetious. It gives MS more ammo anyway, though.

      --
      A deep unwavering belief is a sure sign you're missing something...
    3. Re:FS corruption? by A_Non_Moose · · Score: 2, Insightful

      Not going to flame you, just trying to amuse.

      I thought the reason for installing *nix's was so you'd never have to shut down? Therefore this should not be a problem.

      Now does this occur during *any* unmounting operation? Manually vs Shutdown?

      Oh, and be-fan, don't install XP and use Ext3 (hey, that rhymes) because if XP uses your Ext3 as swap space and 2.4.15 corrupts itself...woah, double whammy.

      Hey, any chance of getting iTunes 2.0 on Linux and Windows? Wanna play Russian Roulette...with an Uzi?

      Whip me, beat me, make me write bad checks (or install windows...same same)

      --
      Have you read the moderator guidelines? Well, have you, PUNK? (and I want a Karma: Gnarly option)
    4. Re:FS corruption? by Jagasian · · Score: 5, Insightful

      Thats funny. I have been running Debian (stable) for a long time now, and I haven't had any filesystem corruption. In fact, I haven't had the OS crash either.

      Its better to compare Windows 2000 to another complete operating system, NOT a bleeding edge kernel. Compare Windows 2000 to Debian (stable), and Windows 2000 will look like a house of cards.

    5. Re:FS corruption? by vrmlknight · · Score: 1

      this release isnt exactly something i'd want to put on a production server im sure some of the alpha and/or internal releases of win2k or nt had problems too

      --
      This must be Thursday, I never could get the hang of Thursdays.
    6. Re:FS corruption? by SecretAsianMan · · Score: 1

      compare Windows 2000 to another complete operating system, NOT a bleeding edge kernel

      I think, though, that the stable Linux kernel should not be considered bleeding edge. It should be considered conservative and trustworthy. There are (should be?) unstable tracks/trees in which the developers can toy with bleeding-edge stuff, and those tracks/trees should be the only place where such experimentation is done. If the bleeding edge is not separated from the trustworthy, then we have no right to bitch when the industry shuns Linux-based solutions.

      I, for one, would love to see Linux and other free Unices offer real competiton against the status quo, but it won't happen until more intelligent software engineering is in place.

      --

      Washington, DC: It's like Hollywood for ugly people.

    7. Re:FS corruption? by Score+Whore · · Score: 1

      2.4 ain't alpha or internal.

    8. Re:FS corruption? by Lennie · · Score: 1

      Yes it is, don't use a kernel that's out before it's atleast a month old and a new stable not until after 6 months or so on a _missioncritical_ system.

      If you want a good system, good with Debian and there standard kernel, it works.

      --
      New things are always on the horizon
    9. Re:FS corruption? by Tassach · · Score: 2

      Someone mod parent up. Any experienced (and cautious) sysadmin knows that you don't put ANY new release or patch on a production system until it's been out in the wild for at least a few weeks. This applies for both Unix and Windows -- anyone remember when NT Service Pack 6 first came out? Even critical security patches need to be applied with caution.

      --
      Why is it that the proponents of "one nation under God" are so eager to get rid of "liberty and justice for all"?
    10. Re:FS corruption? by Score+Whore · · Score: 1

      Err, I wasn't talking about what a prudent sysadmin would do. I'm talking about the engineering processes of the linux kernel. Even releases are meant to be stable, feature complete, etc. No point in modding the parent up, that'd just be dorky. So there.

    11. Re:FS corruption? by Dahan · · Score: 2
      And how the hell do you know M$ doesn't fuck up your disk? Because their own tools tell you?

      No, because I access my files, and they aren't corrupted. I'd notice if one of my source code files turned into random binary garbage. Or if one of the DLLs on my system wouldn't load because it no longer had a valid PECOFF header. Or if my MP3s no longer played properly. Et cetera...

    12. Re:FS corruption? by SmittyTheBold · · Score: 1

      If Win2K looks like a house of cards, then Debian (stable) looks like...ah hell, something that alludes to ANCIENT TECH. Win2K has USB support. Has that been backported to (stable)? The best comparison to Windows 2000 (and now XP) is probably Debian (unstable). It is resonable stable, (much more so than (testing)) but has *some* current tech...

      --
      ± 29 dB
    13. Re:FS corruption? by psyfybre · · Score: 1

      You may complain all you like. the single benefit of this is that an application could well have been released from microsoft with a bug like this and not been discovered until it is on the shop floor. (the GPF that everyone gets with winword 97 perhaps!)

      ahhh winge all you like, at least this way a handful of people get a serious problem and 18 hours later there is a patch. and all the willing brain boxes get together and fix the problem instead of paying someone who doesnt want to code anymore and is only working to pay his bills.

      psy.

    14. Re:FS corruption? by CH-BuG · · Score: 1

      It reminds me of a nice virus that worked by scrambling your files one by one, but who also intercepted the read/write operations so that everything looked "fine" from the user point of view. Then one day, the virus decided to kill itself, and let the user with a bunch of scambled files (which were even carefully backuped long enough to let you with no working copy)
      :-}

    15. Re:FS corruption? by Tassach · · Score: 1

      My point, which you apparantly missed, is that even a so-called "stable" software release shouldn't be used on a mission-critical system until it's been subjected to a good bit of real-world testing. Bleeding edge software, be it free or propriatary, needs to prove itself before it can be trusted.

      --
      Why is it that the proponents of "one nation under God" are so eager to get rid of "liberty and justice for all"?
  10. Really... by J.C.B. · · Score: 2, Insightful

    Isn't the 2.4 branch supposed to be stable? You know, the one that doesn't eat your disk. I think that this kernel should have gotten a little more testing for bugs of the catastrophic nature before it was deemed fit for general consumption.

    1. Re:Really... by Colin+Bayer · · Score: 2, Interesting

      Just like so many things with Linux, you'd think that this would be true, but it isn't. ;) There are two trees that are in development at all times:

      The "stable" tree (which has an even minor version number), and the "development" tree (which has an odd minor).

      When kernel 2.2n.0 (n being a non-negative integer, in this case, 2) is released, development stops on 2.2n-1.x, and all newly-submitted-and-approved-by-Linus patches are applied to the 2.2n.x tree (because 2.2n-1.x is out of date). While 2.2n.x is still called the stable tree, it becomes the development tree (because some of the newly-patched stuff is untested, and there's no "development" tree to put it in). The "stable" role falls back to 2.2(n-1).0, in this case, the 2.2.x tree.

      As far as this goes, it was a stroke of bad luck and hurried timing that this bug wasn't ironed out in 2.4.15-pre9 before it went final (and somewhat of a stroke of stupidity on the parts of the early adopters, myself included).

      When 2.2n+1.0 is released, 2.2n continues development, making it the stable tree. Any fixes to bugs found in the 2.2n+1.x tree are back-merged to the current stable tree so that end-users can enjoy a stable, debugged kernel without riding the bleeding edge.

      The problem with the Linux kernel numbering system is that the "stable" tree is only stable when there's a "development" tree to test the uncharted waters for it... if there isn't one, it's best to stay back a few revisions unless you like running fsck. ;)

      --
      Want Linux games? HERE.
    2. Re:Really... by Anonymous Coward · · Score: 5, Insightful

      (Inven: r-r, * to see, ESC) Wear/Wield which item? r
      You are wielding a Rant Stick (1d2) (+0,+0) (*slay* kernel developer)(a).

      It's not so much that it wasn't stable enough when it was released, but rather that they keep messing with 2.4 instead of making a 2.5. I think maybe Linus had this idea (at the end of 2.3) that the developers could focus on fixing bugs and make 2.4 really great. Unfortunately, they're volunteer developers, so they're working on things that excite them, which means insane stuff like VM rewrites and other "hey, let's try this" changes.

      This is why I still use 2.2 and will until there has been a 2.5 for a while (so the developers have a place to try their unstable new ideas) and 2.4 has gone into "bug-fix" mode (like 2.2 is now). It's really annoying, because I want some of the new features of 2.4 (the ones introduced back in 2.3), but can't afford to have the thing crashing on me, and don't want to spend a long time looking for a stable 2.4.X.

      Maybe next time, Linus won't wait so long to introduce a development version, or will at least refuse anything but bugfixes in so-called "stable" branches. Still, despite my complaining, I am happy that people have gone through all the trouble to write the Linux kernel, and will try to remember that. :)

    3. Re:Really... by caerwyn · · Score: 1

      I've been using 2.4.x for some time now, without any issues. The trick? Give the new kernel a few days before using it. Not only will it give people a chance to find these errors, but you'll avoid trashing the kernel mirror sites, too.

      FWIW, 2.4.14 still seems to be running quite fine for me.. :)

      --
      The ringing of the division bell has begun... -PF
    4. Re:Really... by HorsePunchKid · · Score: 1

      It should really be a Rant Stick (1d2) (+2 Insightful,-1 Troll) of *Slay* Kernel Developer... No match for my Quarterstaff of Burning (1d9) (+1 Flamebait) (Blessed), though ;).

      --
      Steven N. Severinghaus
    5. Re:Really... by Miniluv · · Score: 1

      Be careful discussing the VM rewrite as experimental because its a somewhat more complex issue.
      The VM issue had been nagging since 2.4.x opened last January, and finally Arcangeli wrote something useable that works better than Van Riels design. There was much patching, tweaking and benchmarking of the two from 2.4.9 onward, until around 2.4.14 or .15 at which point Alan and Linus and the bulk of the mailing list agreed that AA's VM was better than Van Riel's.
      The 2.4.15 mistake was a mistake, and had mostly to do with Linus trying very hard to get 2.4.15 out ASAP in order to get 2.5.0 really underway.
      He was pretty blatant in telling people that there were some issues being worked out in the 2.4.x tree after the VM issue reared its head.
      It all boils down to not trusting the vendor, in this case Linus and co., implicitly, but instead doing your own quality assurance and sanity checking before deploying a new kernel anywhere important.

  11. Don't throw stones in Glass Houses by Anonymous Coward · · Score: 1, Insightful
    Hmm..wow this is a serious bug alright.

    Yet there's no snide commentary from the editors whenever something like this happens with Microsoft (M$ to all the haters) software.

    Maybe you zealots will realize that nobody is perfect, and open-source is not necessarily better than closed-source.

    This also makes a case for not announcing new kernels not slashdot (aka not freshmeat). Most people here are linux newbie wannabees so they're not the most qualified people to be running the latest and greatest kernels.

    1. Re:Don't throw stones in Glass Houses by fishebulb · · Score: 3, Interesting

      yes this is quite a serious bug, but 2 things set this apart from MS. It will be fixed within 24-48 hours. The frequency of these bugs are a bit smaller than MS's bug of the day (which very often are large holes).

    2. Re:Don't throw stones in Glass Houses by toast0 · · Score: 2

      how hard is it really to compile a kernel?

      download the source, read teh kernel-howto, go through the menu (or x config), make bzImage, etc

      then repeat as necessary to get it to boot properly (ide root drive, load ide driver as module is always a good combo :)

    3. Re:Don't throw stones in Glass Houses by Vincepb · · Score: 1

      Debian goes through more quality assurance then Mandrake, RedHat and SuSE combined.
      You do realise that Debian stable has been tried and tested over the course of *years*?

      I highly recommend you go and read up on Debian's release schedual and release criteria before you condemn it as an 'amateur distro'.

      http://www.debian.org

      Vince.

      --

      I need a sig.
    4. Re:Don't throw stones in Glass Houses by Telek · · Score: 2

      Please, I'm obviously not as smart as you are, so can you please give me a list of the "large" holes of Windows that happen on a daily basis? My memory is obviously failing me already as I don't remember very many at all. Certainly not more than 400 "large holes" since W2K was released.

      And odder still I do remember that every time that I have heard of a "major" flaw it was fixed very quickly, and then took a few days to go under the standard regression tests on all platforms and machines before it was publically released. If you were affected by one of these problems, you could get the "unsupported" patch as soon as it was developed, but before they could complete testing.

      You can't do complete testing of a patch in 24-48 hours and release it as public with support.

      Also, when a "serious" problem does come out, the relevant MS developers are told to work 18 hours a day 7 days a week until it's solved.

      It's one thing to say "hey, it looks like here's the problem, here we just corrected it and compiled it, that should do", and another completely to have performed all of the tests required to make sure that one small "fix" didn't corrupt something on some obscure hardware configuration that other major clients are using.

      You're all so quick to cut down Microsoft and defend Linux when worse problems happen. You'll also have to explain to me how this is not completely hypocritical, because the logic on that one eludes me as well.

      --

      If God gave us curiosity
    5. Re:Don't throw stones in Glass Houses by sg_oneill · · Score: 2

      Actually dude. That's really unfair. Slashdot for all it's zealotry keeps it's bias on it's sleave. You *know* Slashdot has it's bias, that's why your here right?

      Seriously, the professional astroturfers on this site love to whinge about how the slashdot sysops are anti-ms, but hey; they admit it don't they?

      Compare that to MS owned news and the fact it NEVER critisizes windows, but pretends to be unbiased (One would even believe it if one didn't know better)

      At the end of the day , it reminds me of a comment by Aust media theorist John Hartley that "Propaganda is more honest than news, because at least it admits it's bias".

      Think about it.

      --
      Excuse the Unicode crap in my posts. That's an apostrophe, and slashdot is busted.
  12. A Workaround by kanelephant · · Score: 4, Informative
    Al Viro gave this comment and workaround on lkml.
    Breakage happens when you umount filesystem (_any_ local filesystem, be it ext2, reiserfs, whatever) that still has dirty inodes.

    As a workaround - sync before umount (and don't boot unpatched 2.4.15/2.4.15-pre9 again, obviously).

    IOW, if you are running 2.4.15 - build a patched kernel, install it and do the following:
    * switch to single-user
    * sync
    * umount everything non-busy
    * remount the rest read-only
    * turn the thing off
    * boot with patched kernel or with anything before 2.4.15-pre9

    The filesystem corruption can be fixed by a forced fsck. (The fsck must be forced since the filesystem is marked clean.)
    1. Re:A Workaround by Peyna · · Score: 1

      fsck on a 30 gig drive for each unmount isn't my idea of fun.

      --
      What?
    2. Re:A Workaround by kanelephant · · Score: 3, Informative

      sorry I didnt make that clear. If you follow the above advice you should not get any filesystem corruption. The last line is what to do if you have already got a corrupt filesystem!

    3. Re:A Workaround by Anonymous Coward · · Score: 3, Interesting

      The strange thing is, out of habit (years ago, you always had to remember to sync on Unix, and due to a bug, you always had to sync more than once), I always sync, sync, sync, umount...

    4. Re:A Workaround by toast0 · · Score: 1

      yes, but if you compile a patched kernel, you'll only have to do it once.

      when was the last time you fscked your drive? maybe its prime time for you to do so (all sorts of things can cause fs corruption without marking the drive unclean... happens more often with poor/overstressed components, but it can happen anyhow)

    5. Re:A Workaround by Anonymous Coward · · Score: 1, Funny

      Yeah, I always get wierd looks from people, since (and this is deeply stupid), I've been known to type sync; sync; sync; umount /mnt/cdrom

      - which is completely stupid and pointless...

    6. Re:A Workaround by Jetson · · Score: 1
      Breakage happens when you umount filesystem (_any_ local filesystem, be it ext2, reiserfs, whatever) that still has dirty inodes.

      I didn't think ReiserFS had any inodes to get dirty. Aren't inodes a feature of ext2/ext3?

    7. Re:A Workaround by sconeu · · Score: 2

      Ah! An old-timer! Me too... I remember doing this on SysIII type machines!

      --
      General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.
    8. Re:A Workaround by Score+Whore · · Score: 1
      (all sorts of things can cause fs corruption without marking the drive unclean... happens more often with poor/overstressed components, but it can happen anyhow)


      Yer kiddin' right? A fs should never be marked clean if there's any possibility that it's fuxored. It sounds like you're of the opinion that when an operation occurs on an fs that might make it "unclean" it is marked as such. However that's just plain wrong. The purpose of marking an fs clean is so that the system can look at it and see if whatever caused the last shutdown was something that would sanely put things away. You mark an fs clean when you are unmounting it and you know that you've flushed all the data, metadata and whatnot. Next time you mount it RW, you unmark it clean.
    9. Re:A Workaround by RFC959 · · Score: 1

      Hmm...I always thought that it wasn't due to a bug (after all, the same bug in every version of sync on every version of Unix?), but rather to the fact that writes might be going on at the same time as the sync. If you do 'time sync; time sync' on a busy system, you'll notice that the first one often takes a lot longer than the second. If you only type sync once, you run the risk of having things that /still/ haven't been written to disk because they were going on at the time you typed sync.

    10. Re:A Workaround by prog-guru · · Score: 1

      Anybody else thinking of adding this to their init.d scripts? On by Debian system, in /etc/init.d/umountfs, you can add "sync" at line 18, before it unmounts.

      Of course we should all build with the inode.c fix, but for the future, would syncing before umounting at every shutdown really hurt?

      --

      chris@xanadu:~$ whatis /.
      /.: nothing appropriate.

    11. Re:A Workaround by ViXX0r · · Score: 1

      No, he's right. If your drive is old/poor quality, and has bad sectors, data may get written perfectly from the OS's viewpoint, but if that data lands on a bad sector, it's going to be damaged and the partition SHOULD be marked unclean so that an fsck can try to sort things out next time.

      This isn't an oversight on the part of the kernel or it's developers, but a fault of hardware not doing it's job. This is what the author of that comment was referring to.

      --
      University - a box of academia nuts.
    12. Re:A Workaround by yugami · · Score: 1

      well, since i switched to reiser sometimes i just power down anyways ;)

      have fun w/ your fsck

    13. Re:A Workaround by aulendil · · Score: 1

      Umm, no...

      ReiserFS do also have inodes (as do most filesystems, I suppose). The difference is really that ext2/ext3 has a fixed number of inodes (initialized by mkfs) whereas ReiserFS has dynamicaly created inodes. IE created when there is need. Therefore a ReiserFS could _theoreticly_ hold an infinite number of files onm contrast to ext2 where the number of possible files is fixed by mkfs.

    14. Re:A Workaround by greenrd · · Score: 1
      How is typing sync twice any better? Couldn't there be unsaved data after the second sync has run?

      Surely the proper solution would have been to kill all possible processes before synching.

    15. Re:A Workaround by jamie · · Score: 1
      "How is typing sync twice any better?"

      According to The Unix-Haters Handbook, the original reason was that some older versions of "sync" required the operator to wait several seconds before the filesystem realized, yes, it really did have to flush those sectors to disk. Somehow typing "sync" three times became a magical incantation, with most admins unaware that the second two "syncs" only served as a pause, to slow down the typist so the first one had time to take effect.

      Superstition has preserved the incantation although all modern implementations of "sync" flush the data immediately.

      Kind of like how a lot of unix admins who got started "back in the day" will still stick a -print in their list of find arguments, even though most modern finds assume it by default. Habit...

    16. Re:A Workaround by Score+Whore · · Score: 1
      ...but if that data lands on a bad sector, it's going to be damaged and the partition SHOULD be marked unclean so that an fsck can try to sort things out next time.


      Err. Your comment there isn't too coherent. How exactly will a FS be marked "unclean" due to data landing on a bad sector that the OS can't detect? It's not correct behavior for any OS to ignore failed writes to media and hope that file system repair utilities will correct the problem later.
    17. Re:A Workaround by Dahan · · Score: 1
      Kind of like how a lot of unix admins who got started "back in the day" will still stick a -print in their list of find arguments, even though most modern finds assume it by default.

      Well, the implied -print isn't exactly the same as just sticking a -print on at the end... at least with BSD's find (I dunno about GNU),
      find . -some -list -of -things
      is equivalent to
      find . (-some -list -of -things) -print
      not
      find . -some -list -of -things -print

      So depending on what you want, the -print might be important. Real Life example:
      find . -name compile -prune -o -type f -print
      leave off the -print, and a directory named "compile" will show up in the output, when the intention is to only show files.

    18. Re:A Workaround by toast0 · · Score: 2

      His comment isn't very coherent, but there is still the possibility of data landing on a bad sector and the OS not detecting it. (One of my system has marginal ram, and this occasionally causes fs corruption, usually the os detects it, and has a cow (as well it should), but I'd imagine, on occasion it causes damage without the os detecting it, and an occasional fsck would catch that)

  13. Re:I know I'm going to get modded down by tollieman · · Score: 1

    Well a least it is more secure...

  14. the patch from the kernel list by MentlFlos · · Score: 4, Informative

    I hope /. dosent mangle this up too bad, but if it does:
    http://marc.theaimsgroup.com/?l=linux-kernel&m=100 658174003122&w=2

    List: linux-kernel
    Subject: Re: 2.4.15-pre9 breakage (inode.c)
    From: Linus Torvalds
    Date: 2001-11-24 5:55:42
    [Download message RAW]

    On Sat, 24 Nov 2001, Andrea Arcangeli wrote:
    >
    > --- 2.4.15pre9aa1/fs/inode.c.~1~ Thu Nov 22 20:48:23 2001
    > +++ 2.4.15pre9aa1/fs/inode.c Sat Nov 24 06:30:20 2001
    > @@ -1071,7 +1071,7 @@
    > if (inode->i_state != I_CLEAR)
    > BUG();
    > } else {
    > - if (!list_empty(&inode->i_hash) && sb && sb->s_root) {
    > + if (!list_empty(&inode->i_hash)) {
    > if (!(inode->i_state & (I_DIRTY|I_LOCK))) {
    > list_del(&inode->i_list);
    > list_add(&inode->i_list, &inode_unused);

    I have to say that I like this patch better myself - the added tests are
    not sensible, and just removing them seems to be the right thing.

    Linus

    1. Re:the patch from the kernel list by MentlFlos · · Score: 2
      of course something would go wrong.... that link is whacked out.

      I'm not sure where its going (vs where I was).

      I'm about to try and apply that patch and see what happens.

    2. Re:the patch from the kernel list by aussersterne · · Score: 2

      An alternate fix from Al Viro is here.

      I'm using it now.

      --
      STOP . AMERICA . NOW
  15. NO! by Anonymous Coward · · Score: 2, Informative

    This is a common misconception! 2.4 is *not* "stable"! It is "testing"! Well, now that it's split in two I suppose it can officially be called "stable" (what a bad start!), but I don't consider it stable (though I'm just a lowly AC). AFAIC, 2.2 = "stable" and 2.4 = "testing". In a month or so, things we'll change and we'll have 2.4 = "stable" and 2.5 = "experimental". Until 2.5 turns into 2.6/3.0, at which point it will be "testing", and the cycle continues :)

  16. Strange by imrdkl · · Score: 5, Insightful

    that a successful reboot of the system running the kernel is not in the regression suite. Does this error occur on every architecture?

    1. Re:Strange by Colin+Bayer · · Score: 2, Informative

      Does this error occur on every architecture?

      Yep... since the affected files are in fs/, not arch/*, it's an architecture-independent problem. Good thing I have the Magic SysRq enabled. ;)

      --
      Want Linux games? HERE.
    2. Re:Strange by Sleuth · · Score: 2, Funny

      Regression suite? What's that? Don't you have to pay for software to get one of them?

      But seriously, how much of a regression can be run if pre9 and release are only split by a few hours?

    3. Re:Strange by imrdkl · · Score: 1
      Linux is not an OS, Redhat and SUSE are

      Boy, was I confused. :{)

      Seriously, with two AC's on this thread, this seems a touchy question. It wasn't meant to be. I think most folks on this thread appreciate the fact that Linus took the Thanksgiving holiday to make a release.

      Then again, if Linu had a real boss, he would have told that employee to go home and eat turkey instead of releasing one. heh.

    4. Re:Strange by Erasmus+Darwin · · Score: 2
      "a successful reboot of the system running the kernel is not in the regression suite."

      Well, considering that a successful compile of all the modules isn't in the regression suite either, it's not exactly surprising.

      2.4.14 required a patch to get the loop-back device to compile. One of the 2.4.x releases a few before that had a problem with one of the sound modules (IIRC).

  17. this post is not a troll by xah · · Score: 1

    The author has very valid advice. Yet, some moderator marked this post a "troll." That's like saying Click and Clack on NPR are opponents of the automobile industry.

    --
    I am not a lawyer. Do not take my words as legal advice. If you need legal advice, consult an attorney.
  18. Re:to clarify by krogoth · · Score: 2

    You would still have to be careful until then - people who regularly mount and unmout read/write might want to be careful. I wonder if mounting read-only would help, or if the bug is below that level (from the discussion, it doesn't sound like it)?

    --

    They that quote Benjamin Franklin on liberty and safety deserve neither.
  19. This is why I use FreeBSD by cperciva · · Score: 4, Insightful

    Come on guys, nobody is going to take linux seriously as long as problems like this -- or the VM saga -- keep popping up in supposedly stable kernels. FreeBSD has no trouble keeping separate -CURRENT and -STABLE trees; why can't linux do the same?

    1. Re:This is why I use FreeBSD by dmelomed · · Score: 1

      Yes, Linux developement needs better QA akin to BSDs.

    2. Re:This is why I use FreeBSD by Colin+Bayer · · Score: 2

      Come on guys, nobody is going to take linux seriously as long as problems like this -- or the VM saga -- keep popping up in supposedly stable kernels.

      Read my earlier post on the subject. This is not a stable kernel, as there is no development tree to iron out all the bugs. If you ask me, anyone who upgraded to something as bleeding-edge and untested as this (myself included) deserves to get burned a little bit as a warning that you don't really need the newest kernel. ;)

      --
      Want Linux games? HERE.
    3. Re:This is why I use FreeBSD by Colin+Bayer · · Score: 1

      Eep... didn't mean to give myself +1. D'oh!

      --
      Want Linux games? HERE.
    4. Re:This is why I use FreeBSD by drsoran · · Score: 1

      Because this was a merge between a development and a stable tree. For some reason someone had the bright idea to make them equivalent. 2.4.14 however has been fine for me. 2.4.15 should have just been renamed 2.5.0 and left it at that. Then if it caused corruption, so be it.. it is development.

    5. Re:This is why I use FreeBSD by FattMattP · · Score: 3, Informative
      This is not a stable kernel, as there is no development tree to iron out all the bugs.
      Well, I disagree with you there. The way things have always been done, and the way we tell people that they are done is that x.<even#>.x is a stable kernel and x.<odd#>.x is a development kernel. Once you make that second number even, then it's interpreted by the whole community as stable, whether there's a development kernel or not, because that's what we've been taught and that's the way Linus has always done it. Continuing to put new features into the 2.4 tree rather than opening up 2.5 has led us to this unfortunate position. Hopefilly, in the future, the development tree will open as soon as the next major stable release is made and we can avoid things like this.
      --
      Prevent email address forgery. Publish SPF records for y
    6. Re:This is why I use FreeBSD by zmooc · · Score: 1

      This was not a merge; it was a regular update to the "stable" kernel which was then chosen as the base for the new development-branch. I sure hope they start 2.7 as soon as 2.6 comes out.

      --
      0x or or snor perron?!
    7. Re:This is why I use FreeBSD by mrjohnson · · Score: 1

      > You missed the point. Colin Baker was saying that while this is supposedly a stable series 2.4.x, it's been anything but stable.

      eh? It's been plenty stable for me. I've taken to accepting Red Hat's installation of 2.4.9 on the servers. It works fine for me... And the uptime keeps counting. :-)

      > 1. Kernel 2.4.0 was released much too early.

      Heh. It might have been. I remember several bugs with the early ones. But, as Linus said, it doesn't help much to have the same people testing the same kernels over and over again. Things were fixed quickly after the release to the public.

      > 2. There aren't enough eyeballs testing kernels, as there supposedly should be with OSS.

      Hm... This kernel was released yesterday. Today we have two (?) fixes for it, and articles on the subject from both /. and LT.

      Seems as if lots of eyeballs spotted this one.

    8. Re:This is why I use FreeBSD by chrysalis · · Score: 2

      Don't blame developpers. They are doing their best. But human people can't always be 100% right, and bug-free software doesn't exist. Sometimes you are pretty sure that your code is bug-free. 100 people have read it and found it ok. But just after releasing a new official version, a very vicious bug that nobody saw before is found.
      So what?
      Bugs aren't that bad. Found (and immediately fixed) bugs mean two things :

      - The project is active. No new bug means no new code.
      - The project is getting better.

      Usually, software with no known bug is dead software. Every piece of software has bugs. So if no bug is reported, it means that nobody uses the software, or that developpers don't care.
      Actually, I trust projects that have bugs, but whoose bugs are immediately fixed. I don't trust projects with bugs, that are waiting 6 months to release a new version that fixes 5000 bugs at once.

      You are saying that FreeBSD provides "real" bug-free releases. That's false.
      For instance, all kernels And when it comes to user tools, for instance, KDE doesn't compile from the port tree on FreeBSD 4.4-release.
      And when it comes to FS reliability : I have a FreeBSD 4.3-release box that crashed at the first run (the X server crashed), I had to reboot it by pressing the 'reset' button. It created disk errors that fsck was never able to fix. Doing 'ls' in a directory causes an immediate reboot. I tried every possible fsck option, fsck itself went boo-boo and it wasn't able to fix anything, and the directory can't even be deleted. I have to format the disk and reinstall everything.
      Every operating system, every software has bugs. The quality isn't relative to the number of bugs (it's almost a fixed percentage of the project's size) . It's relative to how fast they are fixed.


      --
      {{.sig}}
    9. Re:This is why I use FreeBSD by Shane · · Score: 2

      Are you honestly sitting here and telling us that this has never occured with FreeBSD?

      Comparing a FreeBSD-STABLE release to a Linux kernel release is silly, compare FreeBSD-STABLE to Debian-stable and then we have something to compare. That being the case, I would dare say that if you were to take all of the "releases" of Linux kernels and compare them to all of the releases of Freebsd-stable kernels you would find two things: a) That there are 10x more Linux kernel releases then there are FreeBSD ones in a given time period. b) That the percentage of serious bugs would be about the same. Which is saying something in Linux's favor seeing at how much more development is going on in that Camp (i.e. code being written).

      FreeBSD is a great OS, but it is nothing special or uniqe even in the OSS world no matter what its loyal users claim.

      --
      -- You can be a geeklord too :)
    10. Re:This is why I use FreeBSD by Arjuna+Theban · · Score: 1


      Agreed. Just last night I put 2.4.15 on a friend's computer to fix some stuff up for him. After a couple reboots (sync'ing before every reboot is a good thing) I noticed I needed the slack install cd which I didn't have with me, reverted to 2.2.19 and told him I'd fix it up later.

      Now what tf was I supposed to tell him if 2.4.15 corrupted his HD in the process? "Yea we say Linux is rock solid but the stable tree might fuck your partitions every once in a while"?

      I mean, come on, this is the tool for M$ FUD.

      ---

    11. Re:This is why I use FreeBSD by El_Nofx · · Score: 1

      Ohh, noone is going to take it seriously? THis is an EXPERIMENTAL kernal. It has not been thoughly tested and has bugs. That is why it has been released to be tested. Microsoft software has worse bugs and it has been beta tested and released. If you look at kernal 2.2x it is rock solid. You should be flamed for making such an idiotic comment.

      --
      It's not the OS it's the user that sucks. If it's user friendly, you get stupider people. - clinko
    12. Re:This is why I use FreeBSD by cperciva · · Score: 2

      eh? It's been plenty stable for me. I've taken to accepting Red Hat's installation of 2.4.9 on the servers. It works fine for me... And the uptime keeps counting. :-)

      Well, I guess 2.4.9 is fine, as long as you don't care about local root holes.

    13. Re:This is why I use FreeBSD by efgbr · · Score: 1

      Because FreeBSD is an operating system distribution and Linux is only the kernel of an operating system.

    14. Re:This is why I use FreeBSD by efgbr · · Score: 1

      I apologize for a second reply to the same post.

      Anyway, you should be comparing FreeBSD to a GNU/Linux distribution, not to the kernel. For example, Debian has stable, testing and unstable trees. Just like FreeBSD.

    15. Re:This is why I use FreeBSD by Just+Some+Guy · · Score: 2
      That there are 10x more Linux kernel releases then there are FreeBSD ones in a given time period.


      FreeBSD is developed and updated via CVS. You can have a new kernel version (whatever that means) every morning if you want it. Yes, both -STABLE and -CURRENT are tracked like this.
      --
      Dewey, what part of this looks like authorities should be involved?
    16. Re:This is why I use FreeBSD by Cro+Magnon · · Score: 1

      I belive RH patched their 2.4.9 kernel so it doesn't have that hole.

      --
      Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.
    17. Re:This is why I use FreeBSD by Dahan · · Score: 2
      Come on guys, nobody is going to take linux seriously as long as problems like this -- or the VM saga -- keep popping up in supposedly stable kernels. FreeBSD has no trouble keeping separate -CURRENT and -STABLE trees; why can't linux do the same?

      'cuz Linus hates CVS, even though it'd work just fine? (Sure, it's not perfect, but it's much better than the current situation of not using any revision control).

    18. Re:This is why I use FreeBSD by Baki · · Score: 2

      This strange even-uneven scheme means that for a while (since 2.4.0 arrived) no development kernel exists. The unavoidable result is that development goes into the "stable" kernel.

      Also, when stable and development exist, all fixes from development to stable have to be made by hand, without the aid of a decent version control system.

      All this because Linus thinks that CVS (which serves huge open source projects quite well) somehow doesn't fit him, and he has been waiting for years now for the ultimate version-control tool to arrive (bitkeeper). In the meantime it is apparently better to use nothing. I have my doubts.

      This difficult management of the two branches is also evident from the huge time it took between 1.2 and 2.0, 2.0 and 2.2 and now between 2.2 and 2.4 (every time Linus promised that the next time between two stable releases should not take so long, but every time it failed). This creates so much and long during stagnation of "stable" that adding new functionality to stable becomes almost unavoidable (making stable quite unstable at times).

      Most projects based on CVS create new stable versions (aided by CVS) much faster, leaving less discrepance between development and stable which makes catching up much easier and safer.

    19. Re:This is why I use FreeBSD by nick-less · · Score: 1


      This is not a stable kernel, as there is no development tree to iron out all the bugs.


      Thats the problem, after 2.4.0 was out, development should have been stopped and 2.5.0 should have been forked off for new features. If done like this, we would have 2.4.x with for example the old vm but vastly stable and a 2.5.x with a serios filesystem bug, but that would'nt be a problem since 2.5.x is a development three by number not by declaration.

      I know it's hard to deny the guys shouting "Just these 20 more lines, and it will also walk the dog...", but maybe it'll be better for 2.6.0 ;-)

  20. Ok so Apple isn't the only one to screw up by geek · · Score: 1, Insightful

    In your face! I sat here and read all the flames to apple about the iTunes screw up, and here we are with one just as big and glaring from the kernel developers themselves.

    Hypocrites!!!!!!!

    1. Re:Ok so Apple isn't the only one to screw up by Colin+Bayer · · Score: 1

      The iTunes bug was a broken shell script; this is a bug in some obscure, not-very-often-modified C code in the core of the Linux kernel... there's a big difference.

      --
      Want Linux games? HERE.
    2. Re:Ok so Apple isn't the only one to screw up by amccall · · Score: 3

      Maybe, just maybe, that's because the iTunes player was an end-user product, and the kernel source is intended for adventerous users, developers, and distributions. If the default RedHat kernel of a stable RedHat release had a FileSystem corruption error, that would be something to write home about - this isn't.

      --
      ------ 24.5% slashdot pure
    3. Re:Ok so Apple isn't the only one to screw up by KagatoLNX · · Score: 1

      Is it slightly more permissible for an OS upgrade to trash a filesystem
      than, say, a piece of multimedia software, no?

      I mean, this problem just requires a check of a the disk (something I
      believe Win9x does at the drop of a hat). iTunes nuked files and it
      didn't even have any business with the file access beyond reading them.

      In conclusion, all generalizations are bad.

      --
      I think Mauve has the most RAM. --PHB (Dilbert Comic)
    4. Re:Ok so Apple isn't the only one to screw up by Lurker · · Score: 1
      The iTunes bug was a broken shell script; this is a bug in some obscure, not-very-often-modified C code in the core of the Linux kernel... there's a big difference.

      You bet there was a difference...the Apple bug was only triggered under very specific circumstances (involving not following explicit directions to delete the old version first) while the kernel bug would have affected every single user who installed the kernel and did a shutdown/reboot. Which sounds more severe to you? Not that either of them are OK. Both groups acted quickly to fix the bugs, which is good.

  21. actually no there isn't by geek · · Score: 1

    FS corruption is FS corruption, you can't justify it at all, especially since this is supposed to be a stable kernel. It can't be all that obscure if it's changed enough since the last revision to do damage now could it?

    1. Re:actually no there isn't by Colin+Bayer · · Score: 1

      FS corruption is FS corruption, you can't justify it at all, especially since this is supposed to be a stable kernel.

      You shouldn't expect stability from the latest version of *anything* if it hasn't been out a day yet.

      There is a difference, too; the iTunes bug resulted from bad shell script written for the installer on an in-house program written for their own flagship OS by a large, trusted company. This is a bug in a part of the kernel maintained by a small group of people working without pay for the project.

      It can't be all that obscure if it's changed enough since the last revision to do damage now could it?

      Uh... the changes were to fix a long-standing filesystem bug (albeit, one that didn't cause FS corruption). Granted, Linus should have tested the kernel first, but he was in a hurry to get 2.4.15/2.5.0 out.

      --
      Want Linux games? HERE.
  22. Yeah by Evro · · Score: 1

    Welcome to "why not to grab every new piece of software as soon as it's released". Other examples include Apple's iTunes 2 and MS Windows 98 (first edition). Also works for hardware, and heck, even cars (first year of a new model is usually riddled with problems).

    --
    rooooar
  23. How can this be avoided in the future? by imrdkl · · Score: 3, Insightful

    Can someone give a joe-user guide to helping test new kernels?

    1. Re:How can this be avoided in the future? by Colin+Bayer · · Score: 1

      Can someone give a joe-user guide to helping test new kernels?

      /me stretches, getting ready for some extended typing...

      "Don't."

      You shouldn't really be testing new kernels unless you know what you're doing and you eat kernel compiles for breakfast (they go great with strawberries and low-fat milk). If you've mastered ksymoops, gdb (and its brother, kdb), and recovering a borken kernel, then have at it... just don't come complaining to me when your kernel panics. :)

      --
      Want Linux games? HERE.
    2. Re:How can this be avoided in the future? by GigsVT · · Score: 2

      http://linuxquality.sunsite.dk/articles/testsuites /

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
  24. The discussion isn't over by Carnage4Life · · Score: 4, Informative

    The last post in that thread is this one by Andrea Arcangeli sometime this morning and from the looks of things (if you read the entire thread) there is conflict between Alexander Viro and Andrea on which is the better solution.

    Linus saying he prefers a patch on an initial viewing isn't the end of the situation for now. I'd suggesting waiting a week and revisiting the thread to find out what the final word was.

    1. Re:The discussion isn't over by Lord+Omlette · · Score: 1

      No room for credit in your sig?

      --
      [o]_O
    2. Re:The discussion isn't over by krogoth · · Score: 2

      The bug was originally caused by a change to the filesystem code, and the AA patch undoes this change, but there is also a discussion on whether the change should be made.

      --

      They that quote Benjamin Franklin on liberty and safety deserve neither.
    3. Re:The discussion isn't over by |<amikaze · · Score: 1

      Strangely, I just downloaded the 2.4.15 kernel from ftp://ftp.ca.kernel.org, and it already appears to have the Linus-blessed patch applied to it... Very very strange

  25. Sad by Platinum+Dragon · · Score: 1, Troll

    I have to wonder, with all the bizarre bugs that have been creeping into "stable" kernels... are they even being tested before release, or is Linus just slapping on some patches and putting out a new kernel as 2.4. instead of 2.4.-prewhatever?

    The latter would seem to indicate frustration and burnout on his part.

    --

    Someday, you're going to die. Get over it.
  26. What I don't get ... by pauljlucas · · Score: 3, Insightful

    ... is why there seems to exist this rampant tendency among Linux-folk to upgrade one's kernel constantly. Unless a new kernel solves a problem you have, there is no reason to upgrade.

    --
    If you reply, do so only to what I explicitly wrote. If I didn't write it, don't assume or infer it.
    1. Re:What I don't get ... by rseuhs · · Score: 1

      There are so many prejudices about "linux-folk" it's no funny anymore. One day "Linux-folk" are guys who keep all their computers running without reboot for years just for the heck of it, the next day "Linux-folk" are updating the kernel dayly.

    2. Re:What I don't get ... by pauljlucas · · Score: 1
      There are so many prejudices about "linux-folk" it's no funny anymore.
      I think "stereotypes" was the word you were looking for.
      --
      If you reply, do so only to what I explicitly wrote. If I didn't write it, don't assume or infer it.
    3. Re:What I don't get ... by CentrX · · Score: 1

      No they don't. If there's a security fix, it would say "blah blah security fix blah blah, not being specific here because of DMCA".

      --

      "The price of freedom is eternal vigilance." - Thomas Jefferson
    4. Re:What I don't get ... by krorvik · · Score: 1

      Apart from the fact that it's *fun*, you mean?

      Seriously, the more people running the latest kernel, the better - how long would it have taken for this bug to be found if there weren't thousands of people testing it as soon as 2.4.15 was released?

      As for production use, most people are smart enough to not run the most recent kernel releases on critical systems.

  27. Make sure you have removed ext3 option, too by willamowius · · Score: 2, Informative

    For those who have tried ext3 in 2.4.15:
    Make sure you have reset the journaling flag on your filesystems, because your older kernel will not mount an unclean ext3 volume.

    Do a "tune2fs -O ^has_journal /dev/whatever".

  28. How Extensive Is This? by grahamkg · · Score: 2, Interesting

    I had an fs corruption with RH 7.2, using the kernel that came with the distro. It trashed the geometry of an entire drive. I was using a combo of ext2 and ext3 on the drive. I didn't lose anything, as I backup my system regularly.

    I've since migrated to Mandrake 8.1, which is much more solid than RH 7.2. Yet, it too runs a 2.4 kernel variant. This distro on one boot failed to recognize the ext3 partitions. I migrated all of the ext3 partitions back to ext2.

    I'd be very interested in learning if this is a problem that extends far back into the kernel tree.

    --
    Graham
    Linux - Fast Pane Relief
    1. Re:How Extensive Is This? by jjshoe · · Score: 1

      more then likely the stock kernel with mandrake didnt have ext3 support built in, run mandrake and rebuild a kernel with ext3 support and im certain you wont have any problems

      --
      -- botsex is {grep;touch;strip;unzip;head;mount} /dev/girl -t {wet;fsck;fsck;yes;yes;yes;umount} {/de
    2. Re:How Extensive Is This? by ViXX0r · · Score: 1

      Someone please correct me if I'm wrong, but how can an OS trash the geometry of a drive? The geometry of a drive is something fixed in hardware - a drive has 8 heads, 1024 cylinders, etc... software can't change that.

      Of course there are ways of interpreting the geometry differently (BIOS HD autodetects often report different geometry than what's on the drive case) but again, this should be out of the OS's hands.

      --
      University - a box of academia nuts.
  29. What do I do now? by bluelarva · · Score: 1

    I compiled and ran 2.4.15 for few hours and now I'm back to 2.4.14. As for me it appears my file system is intact. At least I don't think I did. How do I know for sure my files won't disappear on me? What kind of error messages would I see if my file system is corrupt? How do I correct it?

  30. Quality testing by Cyclone66 · · Score: 1

    Ok, no if this is still in "testing" forgive me.

    Should this darn thing be tested before it's released as "final"! I mean just a few weeks ago you guys were bashing Apple for their iTunes install that wrecked the hard drive, and now you're just coming up with solutions. How bout complaints? How about "This code should never have been released with such a serious bug". Again if this is test code then fine, it comes with the territory. Even if it's "Implied" test code, that's not good enough.

    1. Re:Quality testing by rtaylor · · Score: 2

      There is a big difference between this and iTunes. iTunes affected ONLY those with spaces in their Disk Labels. This affects everyone on linux with moderate disk writes (probably won't damage an idle computer).

      Similar response times. I'd classify this issue way worse ESPECIALLY since it should have gone through standard testing.

      --
      Rod Taylor
  31. Patch download here by DeeKayWon · · Score: 4, Informative

    The mailing list converted tabs into spaces, causing patch to choke. Get the patch here.

    1. Re:Patch download here by tijnbraun · · Score: 1

      Has anyone tries this patch already... would like to know.
      I was just 1 step away from "make install"...

    2. Re:Patch download here by buserror · · Score: 1

      You can also read the patch, it's a oneliner after all.
      $ vi fs/inode.c
      :1047
      delete the " && sb && sb->s_root" on that line
      :wq!
      $ make bzImage

    3. Re:Patch download here by Skuto · · Score: 2

      Or just run patch with the -l option

      patch -p1 -l virofix

      --
      GCP

  32. Things are working right not wrong: by amccall · · Score: 5, Insightful
    I've already seen 2 posts refering to "QA" and keeping the kernel stable, etc... If you are going to try the latest version of each package that comes out, you are going to get burned.

    This is one reason why distributions are so important. They do the QA, they make sure packages are stable, they apply the patches. If you want to download and run the latest edition of every package out, including the kernel, then you should expect some bumps in the road, because you are beta testing - even on a "stable" kernel series. Remember: release early, release often. You will have to do the QA, you will have to apply the patches, you will be burned. Some people like doing this to stay on the bleeding edge, others are a bit more cautious.

    If you want stable, solid kernels, that are heavily QA'd wait for packages to come out. Otherwise, post a bug report, and quit whining.

    --
    ------ 24.5% slashdot pure
    1. Re:Things are working right not wrong: by amccall · · Score: 4, Insightful
      I say linux is an OS NOT a distro and the OS had a bloody problem with something that was declared stable. Waiting for distros is not an option for people who role their own because of whatever special requirements they need. Wow you run debian good for you, not everyone has that luxery.


      Actually, I do "roll-my-own" and maintain a Linux distribution.I was not burned by this, because like other people "rolling their own/maintaining a distro" I do keep track of LKM posts.


      Anyone else doing this type of work, will hopefully learn from this - and NOT install the latest kernel the day after it's out. This type of thing has happened in EVERY series of stable kernels I can remember. And it will happen again.

      --
      ------ 24.5% slashdot pure
    2. Re:Things are working right not wrong: by Codifex+Maximus · · Score: 2

      I think you both have a point.

      Point #1: We have had a "bloody problem" with something that was declared stable. If it's declared stable then it should not have a bunch of new bells and whistles put in it to fail. New bells and whistles should be put in the next point release.

      Point #2: If you rush out and get the "latest thing" and "roll-your-own" then you need to expect running into instability even in software declared stable.

      I understand that Linus is used to working on development kernels. As such, he is used to working on stuff that isn't necessarily stable yet. Linus has been updating the "stable" tree up to and including 2.4.15 - I consider such kernels to be post 2.3.x kernels - with a foot still in the development tree. After all, he (Linus) hadn't yet turned the tree over to official maintainers so some addition tweaking should've been expected. Maybe the 2.4.x kernel went a little too far into the stable kernel revision with major development still going on... should he have waited in releasing 2.4 and kept it as 2.3? It's open to debate. All I know is that I'm using 2.4.14 and I'm damn happy with it.

      I'll also put in my vote on the debate of should a stable kernel get new major changes in systems. My vote is NO - it should only get tested bug-fixes for existing functionality - no matter who does the patching and maintaining. It only good engineering practice... isn't that what we're all about?

      --
      Codifex Maximus ~ In search of... a shorter sig.
    3. Re:Things are working right not wrong: by warpeightbot · · Score: 2
      Actually, I do "roll-my-own" and maintain a Linux distribution.I was not burned by this, because like other people "rolling their own/maintaining a distro" I do keep track of LKM posts.

      Anyone else doing this type of work, will hopefully learn from this - and NOT install the latest kernel the day after it's out.

      Actually, I simply keep an eye on Slashdot; if there's anything seriously wrong with a given rev, it will show up here... Funny thing, I was tinkering with 2.4.13 last night, and considered snagging .15 this morning, and I wake up to this.... heh.

      But you're right, it's a Good Idea to stay about a week or two behind the Leading Bleeding Edge... and that's true for ANYTHING, Microsoft, Linux Kernel, Cisco, or even cars.... remember how the first few years of the Taurus were real lemons? <shrug> Every once in a while this happens. It doesn't bother me. Bugs happen. They also get fixed. A lot faster than You-Know-Who...

      --
      Be sure you're right. Then go ahead.
      -- Davy Crockett

    4. Re:Things are working right not wrong: by iabervon · · Score: 2

      You should risk getting burned on the "pre" versions. That's why they're "pre" versions. There's no reason to rush out a "final" version, especially after even a small change. The reason the "pre" versions were there was so that people could do QA on them. The right move would have been to fork 2.5.0 off of pre9, and just leave pre9 around for a bit. This bug would have turned up in a few days, gotten fixed, and then 2.5.0 would get patched, pre10 would come out, that would sit around a bit, and, if the patch is good, it would become final.

      Nothing should change between the last release candidate and the final version, except the version number, and the last release candidate should stay a release candidate until the QA has been done. That's why there are final versions.

  33. See? by SuiteSisterMary · · Score: 4, Insightful

    If only this was Open Source Software, the source code could have been examined by thousands of highly motivated and intelligent hackers, who would have noticed the problem immediately. Wait....

    --
    Vintage computer games and RPG books available. Email me if you're interested.
    1. Re:See? by Vardamir · · Score: 1

      Funny thing, I compiled the kernel (with the preempt ion patch) and realized I had only selected reiserfs as a module so I could not mount my root file system, so I rebooted and started recompiling and went to read slashdot and saw this ....

      Personally, I'm thankful the bug was caught right away! One and a half days isn't bad. Although, I agree with some others that perhaps a constant development tree and stable tree may be a good thing - but who am I to say?

    2. Re:See? by SuiteSisterMary · · Score: 2

      No, actually, people lost data. Oh, and my post wasn't a troll, it was a shout of 'the emperor has no clothes!' which will, of course, lead to me getting lynched by the imperial court.

      --
      Vintage computer games and RPG books available. Email me if you're interested.
    3. Re:See? by Telek · · Score: 1

      I love how someone moderates you as "Troll".

      How was that trolling?

      Oh I see, it wasn't anti-ms and pro-linux so thus it's trolling.

      Too bad they're all hypocritical. IF this was the other way around, everyone here would be yelling up and down how it was obviously due to gross incometance in Microsoft that caused these problems to go unchecked (when the MS problems are never of this severity), but since it's linux it is obviously just bad luck, and just about nobody here goes to say any disperaging remarks about the linux programmers.

      And this, IMHO, is one of the major reasons that linux won't be a major desktop OS anytime soon. The people who use it have turned it into a religon. It's all about how MS is the big bad wolf and by using Linux somehow they'll have less to repent for when they die. It does Linux no good to have their supporters running around immaturely saying how "MS SUX!" all over the place, to the rest of the world they're just hurting Linux's image. This isn't all users, of course, but enough that it gets noticed.

      --

      If God gave us curiosity
    4. Re: See? by SuiteSisterMary · · Score: 2
      They didn't loss data. All data will be recovered after a manual fsck.
      Bloody lucky, too, but still completely unacceptable in ANYTHING that wants to be ANYWHERE NEAR a server.
      --
      Vintage computer games and RPG books available. Email me if you're interested.
    5. Re: See? by Cro+Magnon · · Score: 1

      Anybody running a server shouldn't be using yesterday's kernel. On my desktop at home I use the latest and greatest just for the heck of it. I was just lucky that I didn't get bitten by this bug, but if I had, I knew my backups were current. OTOH, if I were using Linux on the job, I'd be using 2.2.20 unless I NEEDED 2.4. I'd never use yesterday's kernel on anything but my personal box!

      --
      Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.
    6. Re:See? by AbsoluteRelativity · · Score: 1

      You should read the posts point count types, not the *last scored type*. As of right now she has only 1 troll, 1 insightful, and 5 funny, and 1 overrated, for a total of 8. So why waste time focusing on the 1 troll? I mean do you expect someone to get a perfect score or something? And 1 troll by far does not represent "everyone here". So just sit back, relax, and simmer down to a slow boil, before you explode.

      Also I dont see the 'religious' few Microsoft users preventing Microsoft from being "a major desktop OS". And Linux is not about how bad Microsoft is, its more about a business model that allows freedom. You want to really know what makes Microsoft bad, its not linux, its Code Red, Nimda, etc, and these are not necesarily microsoft incompetence, more so then probability. The probability that the more simple minded users you attract the less respect they have for the under lying technology. IMO most of the people who waste their time hating microsoft, are microsoft users, not linux users.

      --
      disclaimer : My views do not represent those of every one else in slashdot.
  34. 2.4.15-greased-turkey-donteat by bi11 · · Score: 2, Funny

    It's rotted.

  35. No problems for me by Peale · · Score: 1

    Nope, no sir. I'm sticking with 2.4.11.

  36. Re:The discussion isn't over [OT] by Carnage4Life · · Score: 1

    No room for credit in your sig?

    Actually, no there wasn't. The 120 char limit on sigs is a pain. I tried shortening the quote a little bit all I ended up being able to do was get as far as "Lord Oml" before running out of room. If it bothers you that much I can change my sig.

    PS:Moderators don't bother modding this thread down as offtopic, that's what the [OT] in the subject is for. Instead go find something insightful or informative to mod up instead.

  37. irony by FlyingDragon · · Score: 4, Funny
    From yesterday's discussion:

    > So who else is downloading 2.5 (Score:5, Funny)
    > by Chuck Chunder on Friday November 23, @02:23AM
    >
    > so they can be cool and trendy and be on the development tree while it's still stable?
    >
    > The Great Chunder Page - Alcohol Induced Fun!

    If you didn't think it was funny before, admit it -- it's pretty damn funny now.

    1. Re:irony by KjetilK · · Score: 1

      Hey! I spent a lot of time compiling that kernel! You don't know how hard it is being a newbie getting Linux for first time, trying to live up to the social pressure! ;-)

      --
      Employee of Inrupt, Project Release Manager and Community Manager for Solid
    2. Re:irony by Skuggan · · Score: 1

      >> So who else is downloading 2.5 (Score:5, Funny)

      >If you didn't think it was funny before, admit it -- it's pretty damn funny now.

      What about the moderator that already did know about this bug...

      --
      http://www.millnet.se/ GO/U d- s+:+ a C++ UL++++ P- L+++ E W+++ N+ w++ M-- PE+ t+ X++
    3. Re:irony by ddent · · Score: 1

      even funnier?

      Someone said they wouldn't because they would just have to downgrade again for a fixed bug. hmm... :)

    4. Re:irony by Cro+Magnon · · Score: 1

      Yeah, I thought 2.5 would be as stable as 2.4.15. Instead 2.4.15 as as flaky as a 2.5 version!

      --
      Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.
  38. grre by Anonymous Coward · · Score: 1, Interesting

    Grrr... Definitely time for a three-pronged development. STABLE, TESTING, DEVEL, trees, please!

    And would it kill them to use a versioning system ??? - CVS is great, but there's other options, too...

    This sort of thing is NOT good, in a supposedly "stable" tree, and gives MS, BSD, and proprietary Unix folk lots of Ammo against Linux.

    To be fair, people adminning production boxes should be using the kernels from their Distro's site, since, really, it's the distro that is the OS, and RH, SuSe, and Mandrake all heavily regression-test and fine tune their kernel, it's not necessary to be on the bleeding-edge.

  39. Re:to clarify by macinslak · · Score: 1

    To avoid corruption on unmount, simply type

    # sync

    before you unmount the filesystem, as the bug only affects dirty inodes(so mounting ro would fix it). Also, if you can't or don't want to physically pull the plug, you could always do a

    # telinit S
    #
    # umount
    # sync
    # reboot

    There was a mail that read something like this on LKML, but I'm having trouble finding it. You might want to find it if you're thinking about shutting down that way, just in case I forgot something horribly stupid.

  40. Re:to clarify by macinslak · · Score: 2, Informative

    It seems the second set of commands got mangled, sorry:

    telinit S
    kill everything but your shell
    sync
    unmount everything but root
    sync
    reboot

  41. Erroneous comment - not a long standing bug by Rares+Marian · · Score: 1

    Bugs happen and now you're criticizing people who go it fixed in a very short time. How about being fair and telling it to Microsoft when they fuck up and don't fix for weeks?

    --
    The message on the other side of this sig is false.
  42. hooray for 56k by RestiffBard · · Score: 2, Troll

    for once I'm glad i have 56k and decided against downloading the new kernel just yet. for all those bitching cause their system got hosed. well what did you expect? thats why you wait for the next post on slashdot saying somethings wrong with the new kernel. besides what about 2.4.15 was so necessary that you had to have the latest incremental kernel? I'm rather happy with 2.4.8. unless you're a developer/bug-tester/bleed-freak what reason do you have to upgrade to the very latest kernel?

    --
    - /* dead coders leave no comments */
    1. Re:hooray for 56k by noperoblimo · · Score: 1

      bzip2'd patches are small enough for a 56k modem, why would you want to download the whole tarball?

      Slightly more on topic, I'm still sticking with 2.2.20.

    2. Re:hooray for 56k by fredlwm · · Score: 1

      Yes, right, but I download both. The patch to tmpfs and the tarball to /home/ftp/pub/src/Linux/kernel/...

      --
      How to contact me - http://www.pervalidus.net/contact.html
  43. The question begs to be asked... by imrdkl · · Score: 1
    Is there a correlation between the "shaking out" of the commercial Linux companies, and the quality of the kernel releases?

    Weren't there more testers to catch this sort of thing before?

  44. ReiserFS; kernel testing by j3110 · · Score: 1

    I think that ReiserFS might be able to cope with it. The transaction log is not stored in/as inodes, therefore the transaction log replay should fix the B*Tree of data if it's broken. I'm not sure of other journalling filesystems... I'm glad I use raw partitions for my MySQL/InnoDB databases :) I could be wrong about the ReiserFS thing, so if anyone has any info to the contrary, it would be much appreciated. It only seems logical that FS's that were written for the possibility that Inodes wouldn't be flushed would be just fine.

    BTW... has anyone ever seen NTFS corrupt. I've seen it happen to many before. It's not pretty... you can't recover a single file. Still its better than FAT (with all the corruption there, I wonder why it hasn't formed it's own lifeform from the randomness) Still... this is no excuse for not testing software/major changes in a stable tree.

    Those who say that no one should have downloaded the new kernel just aren't thinking. We all should for this particular reason. We need to find the bugs fast and keep our beloved kernel developers on their toes :) I'ld rather know now than when I actually need to run the kernel.

    --
    Karma Clown
  45. Fun with Version Numbers by hearingaid · · Score: 2

    So, will we start seeing -post releases?

    Heh. I can see it now. 2.4.15-post1 :)

    --

    my old sig used to be funny, but then slashcode ate it and now it's not funny anymore

    1. Re:Fun with Version Numbers by serial+frame · · Score: 1

      But you can't have a 2.4.15-post without a 2.4.15-firstpost!

      --

      -
      And the Angel said unto me, "These are the cries of the carrots! The cries of the carrots!"
  46. NT ok, Win2k fixes NTFS errors pretty well. by Otis_INF · · Score: 2, Informative

    As an owner of a lovely IBM 75GXP hdd, I can say Win2k fixes corrupted files on NTFS pretty well. NT4 is perhaps a different ballgame, there you have the chance to indeed get stuck with files which are not recoverable at all.

    --
    Never underestimate the relief of true separation of Religion and State.
    1. Re:NT ok, Win2k fixes NTFS errors pretty well. by j3110 · · Score: 1

      Yes, Win2k does very well at fixing problems. So does fsck, but that's a whole other issue. NTFS on Win2k is mighty good, but it's hosed my system twice. I thought it was my fault until I had a couple of friends go through the FS loss.. Not just files, the whole system. Win2k is excellent at fixing errors on the system if you can make it that far. Ext2 hasn't ever corrupted to the point that I couldn't fix it with fsck to get my system back up. I've had to reinstall windows a few times now, but I have to use it to make money. I've found a way to keep things pretty safe with windows. I have a linux system with Samba and Reiser where I keep my really important mission critical data. I keep a copy (for performance) on a FAT32 drive of mine (so I can access in windows and linux... I've heard that there is a windows device driver for Ext2 but I can't find it, only an explorer app.)

      Anyhow, saying windows can fix errors on the system is only comparing ScanDisk(TM) with fsck. I was merely pointing out that linux filesystems don't really loose the whole fs like Win2k has done to me and a few friends. As I understand it, Win2K can still corrupt the MFT, but even this kernel bug doesn't hose the filesystem.

      (I run linux on the same computer, and I've never had a problem even with 2.4.15 (running it for a few weeks now) Guess I should patch to be on the safe side :) )

      My view point is still the same on this kernel though... I've considered anything since the new VM developmental. I still love the kernel hackers even if they've been drinking Decaf. for the past few months :) I'm not trying to justify buggy kernels in a stable tree, but this whole issue is blown out of proportion. Most people are using ReiserFS, EXT3, or XFS. I don't think this bug effects them. I can find no information nor cases where the FS has been corrupted on a journalled filesystem. I'm not a subscriber to the LKML and no one has posted any info about that except of uncertainty. I've done quite a bit of research on ReiserFS (http://www.namesys.com for more info on ReiserFS) and we all know about Ext2(have no link). I can see easily how Ext2 would be effected if Inodes where not written back from cache, but I'm not so certain about ReiserFS. If the Inodes aren't written back, the data should still be valid, just outdated since "metadata" is re-written instead of over-written.

      Still would like some more info!! :)

      --
      Karma Clown
    2. Re:NT ok, Win2k fixes NTFS errors pretty well. by DarkWarriorSS · · Score: 1

      Well... I'm not sure what happened, but I made sure that I made a backup of my system. And I'm glad I did too. I'm running ReiserFS, wouldn't run anything else, and I still got a currupt filesystem. Infact, I'm recompile QT and KDE right now, after the install of Slack just finished. So yes, from my experience, this bug does affect ReiserFS users. Altho I'm not sure about JFS, XFS, ext3, and so on.

  47. And What? by J.C.B. · · Score: 1
    Am I supposed to say, "It's okay only a few people's disks got eaten?" This is/was a very serious sort of bug, it should have never made its way into the branch that is generally considered to be the stable non-development one. Someone needs to be rethinking their development/testing methods. Any kernel that hasn't been tested sufficiently should be clearly marked as such, and extra-special care should be taken when messing with filesystem/disk I/O code.

    And what the fuck does Microsoft have to do with any of this?

    1. Re:And What? by Rares+Marian · · Score: 1

      Shit happens and gets fixed here.
      Go complain about people who DON'T DO THEIR JOB.

      1. This was a very serious bug
      Perhaps, but it's not the evil kind of bug like Windows locking up after several hours of non use.
      This bug is not going to start the LinSux jokes.
      This bug is the result of inevitable mistakes not carelessness.

      2. It should have never made its way into the stable branch.
      It didn't make its way. It had no history. One day no bug, next day bug, next day no bug.

      3.Any kernel that hasn't been tested...
      Ok I'll give you that. Kernels should have a standard subsystem test suite applied. Hmm...
      I think I'll go outline one. Glad I thought of the idea. Maybe I could even make a buck off this...

      4. Special care with messing w/ io code.
      If bug in subsystem A needs changes in subsystem B check subsystem A,B, and if subsystem C uses A or B check C and if E,F,and G use C,A, or B check A,B,C,D,E,F,G. Check D just so we don't start wondering which have and haven't been checked.

      Hmm... wait a minute why am I givin any of this away...

      --
      The message on the other side of this sig is false.
    2. Re:And What? by khyron4eva · · Score: 1

      You said:

      "3.Any kernel that hasn't been tested...
      Ok I'll give you that. Kernels should have a standard subsystem test suite applied. Hmm...
      I think I'll go outline one. Glad I thought of the idea. Maybe I could even make a buck off this..."

      Better minds have long beat you to this one...

      http://ltp.sourceforge.net/

      Unless you can do better than SGI, IBM, Bull and the OSDL.

  48. regression tests? by treat · · Score: 4, Redundant

    Is there any project to create a set of regression tests for the Linux kernel? This is not the first serious bug that would have been found with even the most basic set of regression tests.

    1. Re:regression tests? by ansible · · Score: 2

      I agree that regression tests are not a substitute for other testing and thinking. However, if such-and-such patch does break test #4751, then that's an indication that the patch and maybe the test need to be investigated further.

      I've loosely followed the progress of 2.4.x. Because of NFS problems until recently, I wouldn't have dreamed of puttint it into production. Every time it's gotten close to stable (by my standards), something else has cropped up (like the VM change).

      I'm gonna wait until we go at least 2 months without a major bugfix before I implement 2.4.x. I want to see a ChangeLog that mostly consists of "added new driver for NNN".

    2. Re:regression tests? by GigsVT · · Score: 2

      http://linuxquality.sunsite.dk/articles/testsuites /

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
  49. Hate it say it but... by Caligari · · Score: 1

    Linux makes a very nice workstation OS, but things like this that cause me to recommend FreeBSD for the server...

    I've never heard of something like this happening in FreeBSD-STABLE.

    While Linux may have more cutting edge features and perhaps has some speed advantages (this is debatable), the FreeBSD (and OpenBSD too!) coders are more conservative about what gets integrated into something which is supposedly stable.
    BSD gives you more peace of mind IMHO.

    --
    The moving cursor writes, and having written, blinks on.
  50. Big deal. by Shane · · Score: 5, Insightful

    It amazes me how big of a deal people make these types of issues out to be. I have heard of high standards but SH*T!. The more I read slashdot the more I realize that very few posters here actully work with much commerical grade software. These type of issues occure freqently with every software vendor I deal with professionally: Cisco, Microsoft, IBM, RedHat, Checkpoint ect.. ect.. The difference is when Cisco releases a new IOS image (which they do about twice as freqently as Linus does) They will quitely mark saym a 1/4th of them DF which stands for _DEFFERED_ i.e. SERIOUS BUG DON'T USE once it is discovered.

    This is why production implentations of software go through testing before deployment when at all possible. If you are running Cisco IOS that is say less then a month old you are taking a risk that there will be a serious bug that will hurt you. The same holds true for Linux kernels or any other peice of software. The more complicated the software the harder it is to keep serious bugs from slipping through the cracks, It is _AMAZING_ that Linux has a few major issues as it does.

    Here is an exercise for you all: Go to www.microsoft.com go to their support section and read through all of the changelogs (they are hard to find) for all of the hot fixes, service packs and general software updates and you will see what I mean (And yes you will find file system corruption there too).

    --
    -- You can be a geeklord too :)
    1. Re:Big deal. by Sentry21 · · Score: 2

      It's odd, you know. I'm used to software being released AFTER it's been tested, not before. In fact, in my experience, that's been the case.

      I don't think it's 'high standards' to expect your filesystem to stay intact after unmounting it. I don't care how new the kernel is, it's just the sort of thing people expect.

      Admittedly, important servers shouldn't be upgraded to the latest new kernel, but we should have clearly defined branches of 'stable' and 'testing'. 2.2.x is monstrously old, but still being updated, supposedly. So is 2.4.x, and now we hav 2.5.x to worry about too? Which is stable? Which is unstable?

      Debian can divide thousands of packages into these categories, why can't the kernel developers divide their kernel into them and make it obvious? I used to trust 2..x releases, because I was told they weren't devel. I didn't know that !devel didn't mean !going to corrupt your drive.

      --Dan

    2. Re:Big deal. by Shane · · Score: 3, Insightful

      The software _was_ released after it was tested. It was tested, a problem was found.. a patch was provided.. the patch was tested.. it was included.. kernel got released.. problem was discovered a patch was created and its about to be released.. thats how software works. You don't catch all of the issues..

      Now you can sit there and say "If Linus would of waited _blank_ period of time someone would of caught the problem before the release and this wouldn't of happend. You could also says that if Linus would just release -pre kernels and only release -stable kernels once a year we would have a REALLY stable kernel... the problem is thats not how the release early/release often model of development works. If you want that model use Microsoft we all know how stable their software is.

      If you want serious QA use redhat.. they do serious QA.. If you are running 0day software you get burned.. wether its the latest linux kernel, the latest microsoft service pack or the latest Cisco IOS.

      Question: what is your example of software that is released "AFTER it's been tested". I can't wait to go read through the change logs and find some bugs that should of been caught by this software superior QA.

      --
      -- You can be a geeklord too :)
    3. Re:Big deal. by Shane · · Score: 2

      Up to a point I will agree with you. Some key conflicts I have with your statements is that a) Microsoft has these issues all the time, they are documented in their changelogs. b) Microsoft isn't evil, they are greedy and controlling and have too much power given their past actions. In my opinion this is why so many people dislike microsoft and not Linus, Alan ect..

      --
      -- You can be a geeklord too :)
    4. Re:Big deal. by Codifex+Maximus · · Score: 2

      Sentry21 wrote:
      > It's odd, you know. I'm used to software being
      > released AFTER it's been tested, not before. In
      > fact, in my experience, that's been the case.

      Sentry21... you ARE the tester. This is Open-Software remember? The users report back to the maintainer their problems - those problems then receive attention in the form of patches or new versions etc... etc... etc...

      If you don't want to be part of the testing process then hang back a few kernel revisions and choose one that didn't have an major issues. Such is what the Distribution maintainers do.

      --
      Codifex Maximus ~ In search of... a shorter sig.
    5. Re:Big deal. by Telek · · Score: 2

      There wasn't any bugs so serious in a public release that when you turned off your computer it corrupted your hard drive on just about any computer. There were some, yes, where with an obscure set of hardware the drivers would fuck something up, and BTW it's *driver* problems in the majority of cases.

      It's not "_AMAZING_" that Linux has as few bugs as it does, when you consider that is one of the major good points about OSS. Add to that fact that many many users are able to actually look at the source code when they do find a bug and fix it themselves, or at least point out where they think the problem is, and it's not all that crazy to believe that there aren't very many bugs. It's still a badge, yes, absolutely, but I don't think it's "_AMAZING_" that is the case.

      What amazes me is how quickly everyone and their brother post disperaging remarks here on /. whenever there's a MS problem, but as soon as there is a major linux one, it's just bad luck dude.... If that isn't hypocritical I don't know what is.

      --

      If God gave us curiosity
  51. QA/Release cycles/et al by Tom+Rini · · Score: 2, Insightful

    I've seen lots of posts about 'We need to QA this!'
    and 'Are there any projects to try and QA the kernel releases?' Both of these miss the point. While we do need more people running the tests which do exist on the -pre releases, it comes down to Linus having an itchy trigger finger, so to speak. 2.4.15 in it's final form did exist for a little while, but it wasn't long enough for anyone to go and give it a good test. There's often been requests for Linus to wait a few days from the last -pre to -final so other arches and sync up (2.4.15 only compiles on x86/sparc64/arm and alpha). If this was released on monday, none of this would happen.

  52. Testing / QA by vlad_petric · · Score: 1

    Just wanted to remind you that testing doesn't guarantee lack of errors, it's only the other way around (lack of testing guarantees errors). Even with tons of testing, showstoppers still make it to public releases.

    The only problem is that a lot people download the latest kernel imediately after its release and put it on a production machine. My message to them (to you): "You're nothing but insane!". IMHO this should be a very valuable lesson to you ...

    The Raven

    --

    The Raven

  53. That is a cop-out by Sanity · · Score: 5, Insightful
    Transferring the responsibility to distribution maintainers is a cop-out.

    The real problem is that new functionality is being added to the stable branch.

    The solution to this type of problem is simple, when a stable kernel is released, an unstable branch should be created immedately. New functionality was being added to the 2.4 branch by developers simply because there is nowhere else to put it.

    New functionality should never be added to a stable branch in a piece of software as mission-critical as a kernel, that is what the unstable/development branch is for.

    If the kernel maintainers want to accelorate the pace at which new functionality gets into a stable branch then they should increase the frequency with which development branches become stable.

    1. Re:That is a cop-out by ViXX0r · · Score: 2, Informative

      > The real problem is that new functionality is being added to the stable branch

      In this case, the real problem was that a bugfix (which is supposed to occur in stable kernels) was faulty and caused another bug.

      --
      University - a box of academia nuts.
  54. Re:Those bastards! by reddeno · · Score: 1

    You're _SO_ funny! Hahahaha!

  55. Compare like with like... please... by smcv · · Score: 1

    ::sigh:: If you want a complete in-house-tested Linux OS, and you don't want to be a beta tester, get something like Debian stable.
    Yes, it's stale and out of date when compared with other distributions (e.g. Debian unstable), and yes, the default kernel is 2.2.19 (and when the current testing distribution becomes Debian 3.0, they're probably going to include a 2.2 kernel with that too). This is the price you pay for stability.

    Here's the difference in philosophy:

    MS: Only use the next version of Windows in-house until they're confident that it's nearly ready, then release it to beta testers, then fix bugs they report, then release when they're confident that it's ready.

    Linux kernel team: Release N.N.NNpreN versions until it's nearly ready, then release a "final" version; let distributions and "advanced users" (i.e. those who compile their own kernels) decide where they want to be on the spectrum going from cutting-edge features to known reliability.

    When do you think the Windows kernel developers last made major (i.e. non-bugfix) changes to, say, the released WinXP kernel, anyway? Probably quite a while back... and yet nobody complains about that, precisely because we never see the latest version until it's considered stable and used in Windows ZQ or something in a couple of years.

    1. Re:Compare like with like... please... by Vincepb · · Score: 1

      2.2.20 is stable. Considered stable, proven stable.
      It also supports USB.

      Vince.

      --

      I need a sig.
  56. Re:It's called Win9x by seann · · Score: 1

    hey, if 2.4.15 causes you trouble, then don't use it.
    You don't have to be on the bleeding edge. (I chose 2.4.15-pre6 because it fixed a DRM issue that affects 4.1.0), I was happy with 2.4.6.

    --
    I'm a big retard who forgot to log out of Slashdot on Mike's computer! LOOK AT ME.
  57. Yikes! by rsimmons · · Score: 1

    Between this and the security hole that surfaced recently, the kernel has had some really scary problems. I'm wondering if this latest problem could have been avoided?

  58. Please spare us by VividU · · Score: 1

    Expecting a OS to not destroy files is not holding a high standard. Just the opposite.

    1. Re:Please spare us by Shane · · Score: 2

      Oh it isn't? Name one that has not had this same problem in the last year? Just one.

      --
      -- You can be a geeklord too :)
    2. Re:Please spare us by VividU · · Score: 2, Interesting

      "Name one that has not had this same problem in the last year? Just one"

      Windows 2000

      Take this post as a challenge. Reply with a link that shows that there is/was a bug in Windows 2000 that caused the loss of a ENTIRE FILE SYSTEM ala Linux or Apples iTunes.

    3. Re:Please spare us by Shane · · Score: 4, Informative

      First: This linux bug does not the loss of the ENTIRE FILE SYSTEM. It leaves .lock files with invalid INODES which can be repaired by manully running fsck. As to you're challenge, these are just a few corruption problems with windows 2000 that I found doing a simple search on www.microsoft.com.

      http://support.microsoft.com/support/kb/articles /Q 268/8/97.ASP

      http://support.microsoft.com/support/kb/articles /Q 258/0/75.ASP

      http://support.microsoft.com/support/kb/articles /Q 273/2/45.ASP

      http://support.microsoft.com/support/kb/articles /Q 298/9/36.ASP?LN=EN-US&SD=gn&FR=0&qry=file%20system %20corruption&rnk=16&src=DHCS_MSPSS_gn_SRCH&SPR=WI N2000

      http://support.microsoft.com/support/kb/articles /Q 261/1/22.ASP?LN=EN-US&SD=gn&FR=0&qry=file%20system %20corruption&rnk=19&src=DHCS_MSPSS_gn_SRCH&SPR=WI N2000

      http://support.microsoft.com/support/kb/articles /Q 255/5/69.ASP?LN=EN-US&SD=gn&FR=0&qry=file%20system %20corruption&rnk=23&src=DHCS_MSPSS_gn_SRCH&SPR=WI N2000

      --
      -- You can be a geeklord too :)
  59. ReiserFS by Artana+Niveus+Corvum · · Score: 1

    I have been testing ever since I first heard whispers of this bug this morning and both of my test machines (one VIA KT266A/AthlonXP, one 440BX/dual pentium-II) which use ReiserFS seem completely unaffected... More after continued testing.

    --
    -----------------------------------------
    Remove the Greed which plagues mankind.
  60. Mind if I puke here... by linuxjack55 · · Score: 1

    Windows is the place where Bill Gates displays all of the ideas he's stolen, and all of the companies he's crushed. It's a museum dedicated to the multifarious ways in which greed and power corrupt. I wouldn't use his OS even if it was worth a shit. As for being "a paradign of total customer satisfaction", I'm surprised most Windows users can get any work done at all. I mean holding a mouse with your right hand and your nose with the other doesn't leave a hand free to do anything else...

    --
    The trouble with practical jokes is that very often they get elected. -- Will Rogers
  61. Need Linux regression testing! by Jeppe+Salvesen · · Score: 2

    When the so-called stable kernel can be released with such a huge bug, how can we tell the managers that Linux is stable and hassle-free?

    Really - we need to make scripts that test right about every critical aspect of a kernel. That would be file systems, VM, IPC, SMP, hardware drivers, SCSI, IDE, ethernet, token ring and more.

    Has anybody made such scripts? One thing is a broken, obscure driver, another thing is bugs that break everybody - like VM and now unmount.

    --

    Stop the brainwash

    1. Re:Need Linux regression testing! by GigsVT · · Score: 2

      http://linuxquality.sunsite.dk/articles/testsuites /

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
  62. GAH!!!! by strredwolf · · Score: 2

    Installed 2.4.15 the day this post came out. GAH! Now trying to deinstall the bird and go back to 2.4.14, and no matter what I do it says it's the greased turkey.

    Back to 2.2.19 now to recompile 2.4.14...

    --

    --
    # Canmephians for a better Linux Kernel
    $Stalag99{"URL"}="http://stalag99.net";
  63. Stable (?) kernels by Frank+T.+Lofaro+Jr. · · Score: 1

    These sorts of things should not creep into the STABLE kernel series.

    --
    Just because it CAN be done, doesn't mean it should!
  64. Lucky for me... by slamb · · Score: 1
    ...that there's another bug. It didn't link. Otherwise, I'd be running it now and merrily corrupting my beautiful filesystem.

    fs/fs.o: In function `dput':
    fs/fs.o(.text+0x118a8): undefined reference to `atomic_dec_and_lock'
    make: *** [vmlinux] Error 1

  65. Give it up. by J.C.B. · · Score: 1
    Linus fucked up. A bug was introduced that caused data loss, and it was in a non-prerelease kernel. It was fixed, but that doesn't make up for the fact that such a serious bug should have been caught and fixed before the release. You're just going to have to admit this to yourself, the Linux kernel will justly receive criticism whenever a major bug is found in a released version, especially one that could corrupt data.

    Well, at least you aren't expecting me to bitch about microsoft, in this totally unrelated story, anymore.

    1. Re:Give it up. by Rares+Marian · · Score: 1

      I got a little tired of the chicken little mob whining. I honestly believe it's a waste to criticize over a bug that gets fixed fast regardless what is.

      I'm more concerned about the code that rots and gets stale because no one wants to do their chores.

      --
      The message on the other side of this sig is false.
  66. It's like deja vu all over again! by 42forty-two42 · · Score: 1

    This is sorta like the glitch in 2.4.11, only worse...

  67. This bug was already mentioned on Slashdot... by exceed · · Score: 1

    This bug had already been mentioned on a post on the original article that informed us of 2.5's release.

    http://slashdot.org/comments.pl?sid=24074&cid=2605 154
    What happened? Didn't anyone take this seriously?

    --

    void women (int money, time_t time);
  68. ridiculous comparison by David+Jao · · Score: 2
    Redhat Linux 7

    Take this post as a challenge. Reply with a link that shows that there is/was a bug in Redhat Linux 7 that caused the loss of an ENTIRE FILE SYSTEM.

    The point (which I'm sure you'll miss, but anyway) is that linux-2.4.15.tar.gz is not an operating system. Anyone with the knowhow to download, compile, and install 2.4.15 from source had better be able to run fsck when something like this happens.

    Furthermore you way overstate the case when you assert this causes lost file systems. The vast majority of 2.4.15 corruption cases can be repaired with a fsck.

    Personally, I consider the code red II worm to be a far greater threat to my data than linux-2.4.15.tar.gz.

  69. Great start by Skuggan · · Score: 1

    Now it can only get better :-)

    --
    http://www.millnet.se/ GO/U d- s+:+ a C++ UL++++ P- L+++ E W+++ N+ w++ M-- PE+ t+ X++
  70. Phew... by Sir+Joltalot · · Score: 1

    Whoa.. I was running it there for a few minutes until I saw this.. then I compiled 2.4.14 right away and rebooted, there don't seem to be any problems yet.

    This all makes me wonder, though, about 2.4. I mean it seems like there have been relatively many "big" problems with 2.4 throughout its life. 2.4.10 had the SMP thing, 2.4.15 has this, and there have been other problems big enough to pretty much make a version unusable.

    I don't seem to remember as much of this happening with 2.2, I mean obviously every major new version of the kernel will be rough around the edges for the first few releases, but 2.2 seemed not to have as many of these big problems. So, what's going on, are kernels getting worse or am I just smocking crack?

    --
    "Caffeine is not an option. Caffeine is a way of life."
  71. Version 2.3? by Decimal · · Score: 1

    I'm a complete Linux Newbie, so please don't mod me down for just asking. Was there ever a kernal 2.3? Or will we be seeing it when 2.2 reaches a certain development point?

    --

    Remember "Bring 'em on"? *sigh
    1. Re:Version 2.3? by phaze3000 · · Score: 2
      Yes there was a 2.3, in fact there were over 90 2.3 releases IIRC..

      Odd minor version numbers are unstable (so 2.1, 2.3 and 2.5 are all unstable kernel branches).

      --
      Blaming GW Bush for the Iraq war is like blaming Ronald McDonald for the poor quality of food.
    2. Re:Version 2.3? by -brazil- · · Score: 1

      I've always heard the third version number being called the "minor" one. If the second is the minor, what's the third? The minuscule version number?

      --

      The illegal we do immediately. The unconstitutional takes a little longer.
      --Henry Kissinger

    3. Re:Version 2.3? by randomgeek · · Score: 1

      it goes major.minor.revision/Build. I've seen the third referred to as both revision and build.

  72. What kernels are stable? by Leimy · · Score: 1

    Since I first started using linux there have been two development trees.. 2.even.x [stable] and 2.odd.x [unstable].

    What makes one more stable than the other?
    There must not be too much testing of these so called bug fixes going on. I mean a simple reboot seems to be all that was needed to bring this bug to light.

    I would really like to know that when a "stable" kernel comes out that it has at least been tested somewhat before release. As the kernel grows larger I sense that these problems will increase in frequency.

  73. I found a REAL good patch by Anonymous Coward · · Score: 1, Informative

    I found a patch that solves the problem TOTALLY! The URL is www.freebsd.org

    Linux Sucks by the way

  74. 2.4 seems to have had some serious problems by prismatic · · Score: 2, Insightful
    between the VM issues in 2.4.[5-9] (iirc), and the big media fuss over the VM change in the .9->.10 transistion, then the FS corruption bug in .15. well, i'm glad i went from 2.4.5 to 2.4.12 and am still at it.

    <pseudo-rant>
    maybe there's a good side to your ISP going out of business and qwest dsl fscking you over changing your isp, making it harder to update your kernel 8)
    </pseudo-rant>

    but ultimately, i can't see its all that big of a deal. all you have to do is take a couple of weeks to get to the newest kernel. wait till its been out a fortnight, and you're golden

    --
    Brian Voils
    "A university is what a college becomes when the faculty loses interest in students."
  75. But what about unforseen complications? by marick · · Score: 1

    Look, writing software is difficult, and anything as complicated and flexible as the Linux kernel is a management headache, to say the least.

    It's fine and dandy to say "don't add new features in the stable series", but then what about important features like USB? When the 2.2 kernel series went stable, USB was a pipe-dream, so at around 2.2.15 (IIRC) USB was back-ported into the 2.2 kernel. This was generally considered a GOOD THING, not too risky.

    Even when not adding new features, sometimes just fixing bugs can cause new bugs to appear. A fact of software development is that sometimes things like this happen. Unforseen necessity can destabilize stable software. Try as we might, following proper coding practices and the like, writing software has continued to be hard simply by virtue of its complexity. Writing software is somewhat of a craft, not a pure engineering, and sometimes things don't go as planned (think leaky pottery).

    Ultimately, though, the goal is to improve the process as much as possible, keep/sell the "good" products (such as kernel 2.4.14) and throw away the ones that developed leaks (such as 2.4.15).

  76. Crack Team by redcliffe · · Score: 1

    What Linus needs to find is a crack team of kernel breakers. People with varieties of hardware. They need to build an automated testing tool that would automatically put the release kernel through thousands of iterations of tests, to ensure that nothing is wrong. What they need is a GNU/Autotest for testing kernels in a peer-to-peer SETI@home style, on non essential machines.

    David

  77. why can't MS buy some quality?? by Anonymous Coward · · Score: 1, Insightful

    MS has 36 billion bucks. Linux is a volunteer effort.

    Why is is that they are even in competition, again? Why is it MS can't buy some fucking good PROGRAMMING???

  78. 2.4.16-pre1 is out by the+COW+OF+DOOM+(tm) · · Score: 1

    ..including the patch for this bug. get it here

  79. wait.. by RyuuzakiTetsuya · · Score: 1

    Don't you have to reboot to make the patch work? Does'nt that require an unmount?

    --
    Non impediti ratione cogitationus.
  80. Got Slack? by L.+J.+Beauregard · · Score: 1

    The 2.4.15 bug didn't bite me. Of course, Slackware's shutdown script runs sync automatically.

    --
    Ooh, moderator points! Five more idjits go to Minus One Hell!
    Delendae sunt RIAA, MPAA et Windoze
  81. I think you missed his point a bit. by Chuck+Chunder · · Score: 4, Insightful

    People downloading kernels from kernel.org, particularly in the first few days of a release, are part of the QA process, not the ultimate beneficiaries of one.

    The Open Source (or more correctly, bazaar or distributed) development model also distributes responsibility. If the possibility of losing your data is something you can't afford then you simply shouldn't be sitting on the cutting edge of kernel development.

    --
    Boffoonery - downloadable Comedy Benefit for Bletchley Park
    1. Re:I think you missed his point a bit. by SuiteSisterMary · · Score: 2
      People downloading kernels from kernel.org, particularly in the first few days of a release, are part of the QA process, not the ultimate beneficiaries of one.
      The funny thing is that if you substitute 'buying Windows' for 'downloading kernels' then you're saying the same thing about microsoft, only instead of being earnestly serious, you're being snide and condescending.
      --
      Vintage computer games and RPG books available. Email me if you're interested.
    2. Re:I think you missed his point a bit. by barneyfoo · · Score: 1

      Ahhh, if only microsoft let the general public have access to weekly kernel builds and the sourcecode to come along with it.. You're living a fantasy if you think the two examples are in any way equal. What would be equal is the MS bug level, and say the Redhat 7.2 kernel bug level. You might be on to something there. Otherwise you're deluding yourself and (hopefully not) bringing others down with you into the mire.

    3. Re:I think you missed his point a bit. by SuiteSisterMary · · Score: 2
      Ahhh, if only microsoft let the general public have access to weekly kernel builds and the sourcecode to come along with it..
      Bullshit. With OSS, it's "Release early, release often." With Microsoft, it's "They obviously didn't even bother to test this. Typical Microsoft slipshod work. Haven't they even heard of QA?
      --
      Vintage computer games and RPG books available. Email me if you're interested.
    4. Re:I think you missed his point a bit. by barneyfoo · · Score: 1

      I'll state it again. Compare microsoft to redhat release kernels (2.4.9-2, 2.4.7-9) custom built and tested under the most strenuous conditions.. You haven't answered this point. Maybe you are too blindly pro microsoft to realize that one man (linus) releasing a kernel does not equal that of a corperation. You own the GPL'd kernel as much as linus does (he doesn't require assignment of copyright to him for submitted code and patches).

      So I state again. Compare the redhat release kernel to microsoft. Maybe you dont want to. Maybe it will throw your whole world view into the shitter.

    5. Re:I think you missed his point a bit. by SuiteSisterMary · · Score: 2

      FYI, I stopped using Linux in a production environment when Mandrake 7 and 8, and RedHat 7.2, when told to run a cron job at 4 AM every morning, using Vixie crond, would run it at perfectly random times, day or night, and multiple times, day or night, on several different machines. Linux desperately wants to be in the same world as Windows NT and Solaris, but people like you then complain long and loud when one tries to apply the same rules.

      --
      Vintage computer games and RPG books available. Email me if you're interested.
    6. Re:I think you missed his point a bit. by barneyfoo · · Score: 1

      Hrm. sounds like you're pretty incompotent to me.

      Slashdot is your only forum for bringing this issue up? I'm sure you're such a lame admin that you'd explain your problem to me, in detail, on slashdot, before you would to the appropriate redhat mailing list or developer. Shame, really.

    7. Re:I think you missed his point a bit. by SuiteSisterMary · · Score: 2

      This was a year back, and I went through the appropriate support forums, such as they were. It wasn't worth my time to deal with, though.

      --
      Vintage computer games and RPG books available. Email me if you're interested.
    8. Re:I think you missed his point a bit. by GigsVT · · Score: 1

      Not to jump into the fray, but coming from someone who has been looking for a job via their sig on /. for at least 2 months, it sort of loses some effect. :)

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    9. Re:I think you missed his point a bit. by SuiteSisterMary · · Score: 1
      Not to jump into the fray, but coming from someone who has been looking for a job via their sig on /. for at least 2 months, it sort of loses some effect. :)
      Two years ago, I'd have agreed. In this economy, no. :-)
      --
      Vintage computer games and RPG books available. Email me if you're interested.
  82. Re:Wait a second... by Telek · · Score: 2

    I think I'm just anti-anti-microsoft, that's all. I think they get beat up far too often for things that aren't always their fault.

    --

    If God gave us curiosity
  83. Typical by The+Bungi · · Score: 1

    There's the OS for you, typical piece of untested fucked up crap.

    Oh, wait. This is Linux we're talking about. Hmmm. OK, I know:

    Go Linus! Go Alan! Go kernel Gods! We're sure this is a very minor thing that will be fixed in no time! It doesn't matter!! We can take anything!! We love you!!!

    Micro$soft sucks! Linux roolz!!!1!!!

    OK, now moderate up, si vouz plez.

  84. Linux: Amateur hour by kivi · · Score: 1

    This is just the the thing "my mother/reast of the world except the converted" has against Linux.. Non "fully" tested code coming out a "stable" release...

    Marcelo your first priority should be to come up with a plausible patch plan.. ie:

    <UL>
    <LI>There should be more than a few days between patches, preferably months.</LI>
    <LI>There should be a REAL test task force, not
    only rely on "nerds".</LI>
    </UL>

    Without STABILITY Linux will NEVER succeed...

    And for Linus on the 2.5 side: Come up with a plan where the kernel is EASY to update.. We "nerds" know how to do this.. But the "TOTAL WORLD DOMINATION" is about the non "nerd/technical" people...

    --
    -- Kalle Kiviaho - kivi@picox.se -- Another day, another coma..
    1. Re:Linux: Amateur hour by PigleT · · Score: 2

      "Without STABILITY Linux will NEVER succeed... "

      ITYM `without stability we will be unable to continue using it for production purposes'. HTH. HAND.

      "But the "TOTAL WORLD DOMINATION" is about the non "nerd/technical" people..."
      Precisely why I don't give a fig about total world domination.

      Go read the Advocacy-HOWTO.

      --
      ~Tim
      --
      .|` Clouds cross the black moonlight,
      Rushing on down to the circle of the turn
  85. Re:You are right. by SuiteSisterMary · · Score: 2
    A fix was released in merely hours. I don't see Microsoft doing that.
    Well, I do seem to recall /. yelling at microsoft for Code Red, which went wild WEEKS after the Microsoft patch which fixed it was out, and MONTHS after the IIS HowTo which told you how to avoid the whole mess in the first place was out.
    --
    Vintage computer games and RPG books available. Email me if you're interested.
  86. 2.5, how appropriate. by supabeast! · · Score: 2

    Well, at least 2.5 was fucked up! Now nobody can really say that 2.5 was ever stable!

  87. It's things like this... by Teferi · · Score: 2

    ...that make me glad I switched to FreeBSD a while ago.
    Linux does have a lot of things I miss - DRI/DRM still isn't working right, X and GTK in particular seem a bit slower - but it's absolutely rock solid. I've only managed to crash it once, and that was my own fault - loaded a KLD from 4.4-RELEASE into a 4.4-STABLE kernel. Nice panic there.
    The ports system also is really nice; it could do dependencies a bit better, but it's generally fairly smart about it. And having the entire system source on dosk and available is nice.
    I like how the system is in CVS - bugs get patched and fixes checked in fast, and all one has to do is a 'make update' in /usr/src and do some rebuilding to update. No waiting for a release.

    --
    -- Veni, vidi, dormivi
  88. Re:The discussion isn't over [OT] by Lord+Omlette · · Score: 1

    It doesn't, I was just curious. Stupid 120 char limit.

    --
    [o]_O
  89. 2.4.15 = 2.5.0 = dontuse by SurfsUp · · Score: 2

    If you've been having the filesystem corruptions,

    *Everybody* will get corruption using 2.4.15/2.5.0, just don't use it.

    --
    Life's a bitch but somebody's gotta do it.
  90. There is hope by burden123 · · Score: 1

    > pre6: fix Intel 8xx agptlb flush Oooh, thats my board! This gives me hope that i can run linux again on my P4 1.4Ghz longer then 15 minutes without a hardware lockup.

  91. Re:Stop making excuses. by NDPTAL85 · · Score: 1

    2.0.x is still being maintained as well. So whats that make it, the extra-stable branch? :P

    --
    Mac OS X and Windows XP working side by side to fight back the night.
  92. Worried about new kernels by SnapperHead · · Score: 1

    I am not sure whats going on these days, but things like this are poping up way too much in the stable kernels over the past 2 months. The 2.2 series kernel had issues, but not this offtean. Plus, memory management wasn't as serious. (as far as the eariley 2.4 issues)

    Linus said when 2.3 was started that 2.4 would be a smaller quicker release style. I am wondering if that was a bad idea. Too much changed in 2.4, I think it really affected its development. I would rather see 2.5 development stopped ASAP to fix 99% of the 2.4 issues before going on. I could careless if 2.5 took another 2 years before it was started, as long as 2.4 was stable.

    Becuase of all these recent events, I have decied to wait at LEAST 2 weeks before upgrading kernels. Durring 2.2 (at eailry 2.4) I would upgrade within a few hours of release. (By the time I found out :)

    _if_ BSD would provide the same things that Linux can, (for example, a full /proc, which I find VERY usefull), I would think about switching, or dual booting. Most likley, moving my server over. BSD is quite stable compaired to Linux over the recent months. Its just not as easy to use.

    Anyway, I really hope things start improving.

    --
    until (succeed) try { again(); }
  93. This is why I use Commodore Basic V2.0 by PbHead · · Score: 1
    Oh Ya Baby. The good old commodore C=64!

    Nice and stable. Forget the Idea that it's rarely updated, and when it is you don't see anything but the finished product. I saw the C=128 is up to Commodore Basic V7, but thats too risky for me. I prefer to stay with the good ole rock solid stuff.

    Well sombody had to say it.

    --
    Opinions Expressed by Me should be Forced on Others - PbHead
  94. Yes we do have high standards. by TheLink · · Score: 2

    Yeah we have high standards. If not we'd write the stuff ourselves right? ;)

    Seriously tho, what do you want, a Linux that's slightly less unstable than Windows. Or a Linux that's actually stable.

    I don't understand why so many people here are using Microsoft to show why Linux isn't that bad.

    This is STABILITY we are talking about. If you have to resort to mentioning Microsoft then Linux has become rather bad hasn't it?

    If you are talking Joe Public acceptability then yeah mention Microsoft.

    Maybe Linux should go towards the FreeBSD style of releasing - STABLE, CURRENT, DEVELOPMENT.

    More regression tests before an actual release would help too.

    Cheerio,
    Link.

    --
  95. Re:Wait a second... by barneyfoo · · Score: 1

    Microsoft is a business. They pretend they have feelings to affect people like you. How can anyone feel sorry for a company that has $35 billion dollars in the bank. (that's cash folks, not equity or stock. Cold hard green american dollars). Even if linux destroys microsoft's business (which I'm sure you're inherently worried about.. gasp!), Microsoft could buy up 100 other businesses, write down the inital capital loss and start making bucket loads again.

    So please, the sky is _not_ falling, microsoft defenders. Stop pretending like everyone is out to get you. If you use windows, linux users dont hate you. If you like Microsoft Word, linux users aren't going beat down your door. If you feel left out of the linux party, you can join whenever you want, just remember to leave your whining, snivelling, righteous attitudes at the door, because we're all sick of it.

  96. Re:You are right. by barneyfoo · · Score: 1

    Awwwwww.. Poor microsoft. I'm crying. I really am. They are so deserving of our sympathy and fair treatment. I mean, look at how well they've treated us all in the past 10 years. Dont be fooled folks. Microsoft is doing VERY well without our brain-washed sympathy. Lets not feed into the marketing hype from redmond that says microsoft is embattled and they need our support cause they are down-and-out and they need to be cuddled. Give me a fucking break.

  97. Re:Wait a second... by Telek · · Score: 2

    man that really made me laugh out loud.

    Set your comment level to 0 and read any stories on /. that deal with Microsoft or Linux, and then say all of that again. The majority of users here do nothing but slam microsoft at every opportunity that they get, and yes, insult microsoft users. Sometimes even browsing at a level of 2 isn't enough to keep them out.

    Wow what a hoot.

    --

    If God gave us curiosity
  98. Debian Stable isn't running 2.4.x by Bob_Robertson · · Score: 1
    Since this bug is stated to be only in the 2.4.x series, the 2.2.19(i think) current Debian stable kernel is not effected.


    However, I'm running 2.4.12 in Debian Testing, and have not yet seen this problem.


    I will, however, be forcing fsck on every boot as soon as I can find out how to set that option.


    Bob-

    --
    The Ludwig von Mises Institute. The reasoning individuals economics
    1. Re:Debian Stable isn't running 2.4.x by Cro+Magnon · · Score: 1

      2.4.12 should be safe (as safe as any 2.4 version). This bug is just in 2.4.15-pre9 & 2.4.15-greased-turkey/dead-duck

      --
      Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.
  99. damn skippy! by mosch · · Score: 2

    enterprise software, by Linux.

  100. Re:Spelling lesson time by AveryT · · Score: 1

    I could careless "careless" is a word but you should have used "care less"

    Ummm... if you want to be picky about grammar, it's "I couldn't care less".

    "I could care less" is a bastardization which means the exact opposite of what people who say it think it does.

  101. Re:Wait a second... by barneyfoo · · Score: 1

    The point is to NOT read slashdot at level 0. If you come on slashdot to respond to level 0's, then I'll just roll my eyes at your hypocrisy.

    Please, enhance the culture, dont get stuck in the mud and bring it down. And dont be a paraoind MS defender. 1) They dont need your help. and 2) it makes you look like a tool[1].

    [1] proper use of tool. Most people use tool to refer to a lamer or some such person. The actual proper use is more literal. A "tool" is a mechanical device used by humans. A "tool" in the l33t sense is a person who acts as a mechanical device for a corperation or overlord, without thinking. For example, if I go into McDonalds and ask for a tall cup of water and the Minimum wage earning dork says "sorry I cant give you water in a large cup. you can have a kiddie cup of water, or buy $1 Evian water", then that person is a tool.

  102. Re: your sig by shibboleth · · Score: 1

    Obviously, I have answered the q. But like everyone else posting to /., i'm interested in others' opinion as well.

    --
    "Be thankful you are not my student. You would not get a high grade for such a design :-)" - Minix pro
  103. It is the best practice you may have by frog51 · · Score: 2

    For absolutely needing to upgrade your enterprise-wide linux base in a hurry. Which could happen.

    I completely stuffed my first 2 or 3 kernel patches/upgrades/compiles etc, but after a couple of dozen it becomes second nature, and in a stressful (read-Manager/client on your back wanting it done yesterday!!) situation that's what you need.

    Plus, it is kind of fun and interesting.

  104. Re:No, I don't. by SuiteSisterMary · · Score: 2

    Unless it was Apple. Then it still would destroy hard drives. :-)

    --
    Vintage computer games and RPG books available. Email me if you're interested.
  105. Re:Wait a second... by Cro+Magnon · · Score: 1

    Set your comment level to 0 or less and read any stories on /. on ANY subject and you'll get a bunch of bovine fertelizer. Sometimes even browsing at a level of 2 isn't enough to keep it out.

    --
    Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.
  106. MacDOOM? by stonewolf · · Score: 2

    This so called history doesn't seem to have noticed that there is/was a version of DOOM for the Apple Macinstosh.

    It was a great game. My only exposure to the Mac
    was through that game.

    Stonewolf