Slashdot Mirror


Linux 4.0 Has a File-System Corruption Problem, RAID Users Warned

An anonymous reader writes: For the past few days kernel developers and Linux users have been investigating an EXT4 file-system corruption issue affecting the latest stable kernel series (Linux 4.0) and the current development code (Linux 4.1). It turns out that Linux users running the EXT4 file-system on a RAID0 configuration can easily destroy their file-system with this newest "stable" kernel. The cause and fix have materialized but it hasn't yet worked its way out into the mainline kernel, thus users should be warned before quickly upgrading to the new kernel on systems with EXT4 and RAID0.

226 comments

  1. Linux is clearly unstable! by Anonymous Coward · · Score: 5, Funny

    I'll stick with Windows Vista, thanks.

    1. Re:Linux is clearly unstable! by Anonymous Coward · · Score: 0

      Deflector shields up!

    2. Re:Linux is clearly unstable! by Anonymous Coward · · Score: 0

      I prefer Windows ME. Vista is too buggy.

    3. Re:Linux is clearly unstable! by lsatenstein · · Score: 1

      I'll stick with Windows Vista, thanks.

      I have the 1.44meg and the large 5½inch meg soft floppies, and my msdos 3.1 system just flies. Everything is written in 8086 assembly. I have a small btree application. If it is not broken, why change it. I have accumulated a lifetime supply of spares.

      --
      Leslie Satenstein Montreal Quebec Canada
  2. It's RAID 0 by Anonymous Coward · · Score: 1, Insightful

    Losing data goes with the territory if you're going to use RAID 0.

    1. Re:It's RAID 0 by Anonymous Coward · · Score: 0

      Not necessarily. It could be 0+1 or similar, which gives some of both advantages and would still be affected by this.

    2. Re:It's RAID 0 by Enry · · Score: 2

      I have 4 drives in a RAID 10, so two RAID 1 arrays of two drives each combined together in a RAID 0. I did it mostly because I can add new drive at any time and just chain them onto the RAID 0.

    3. Re:It's RAID 0 by Qzukk · · Score: 1

      Linux's MD raid10 isn't the same as RAID 1+0, so I'm not entirely sure it would be affected by this.

      --
      If I have been able to see further than others, it is because I bought a pair of binoculars.
    4. Re: It's RAID 0 by AvitarX · · Score: 1

      Linux raid 10 does not allow adding the drives. That is the advantage to the GPs setup.

      I always use the raid 10 driver, for the purpose of 3 disk arrays, and the performancw benefit of the far layout, but not being able to grow them is annoying.

      --
      Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
    5. Re:It's RAID 0 by kthreadd · · Score: 2, Insightful

      Or it could work just fine. RAID 0 is not dangerous, you may just as well loose your data even if you only use a single drive. Hard drives and SSDs don't go bad that often that it's a problem.

    6. Re:It's RAID 0 by tehlinux · · Score: 1

      The 0 in RAID0 stands for how much data you are going to get back if you have to recover.

      --
      Most linux users don't know this, but the man pages were named after Chuck Norris. Chuck Norris fsck'ing hates noobs!
    7. Re:It's RAID 0 by Anonymous Coward · · Score: 0

      Except statistically adding a second drive adds a second point of failure. Like buying two lottery tickets: double your chances! 2 x diddly-squat is still diddly-squat!

    8. Re:It's RAID 0 by Ksevio · · Score: 1

      You should switch to a RAID5! Then you'd get a little extra capacity while still being protected against 1 drive failing.

    9. Re:It's RAID 0 by Anonymous Coward · · Score: 0

      RAID doesn't protect against data loss. It only protects against disk failures. A faulty motherboard, PSU or waterpipe in the floor above the server room could kill the entire disk array. A filesystem bug or accidental deletion will still cause data loss.
      Backups on a separate system protects against data loss. RAID is for protecting your uptime during disk failures except for RAID0 that only is for performance.
      In my opinion an unhealthy amount of effort is spent on using RAID to protect data when what most people need is better backup systems. (Or in some cases, start taking backups.)

    10. Re:It's RAID 0 by Swave+An+deBwoner · · Score: 1

      The standard mantra you are chanting is correct. But given the human propensity for failing to do something that they planned to do (regular fine-grained backups), a redundant array (pretty much anything except RAID 0) can mean the difference between losing some valuable data or development work and not losing it if a disk fails.

    11. Re:It's RAID 0 by KingMotley · · Score: 1

      Raid 10 can survive SOME 2-drive failures (in a 4-drive raid 10), and has significantly faster write speeds than Raid 5.

      Personally, I use a combination of RAID-0 and RAID-6 (not the same array), because Raid-5 for large arrays is almost useless. I've seen too many raid-5's die when the bad drive is replaced and the added stress of the rebuild then kills a second drive. Ouch.

    12. Re:It's RAID 0 by sound+vision · · Score: 1

      It's not quite "as well", with RAID 0 if you lose either of two drives, the data is gone. That effectively doubles the chance of failure.
      If you only have the budget to play with 2 drives, you should be using one drive normally and the second drive as an external backup. Not RAID 1, that leaves you open to software/file system errors and the like. Having the backup as a separate drive that's not plugged in except when running backups negates a lot of those failure modes.

    13. Re:It's RAID 0 by Forever+Wondering · · Score: 4, Informative

      Based on the commit fixes, it's in a function called raid0_make_request, which is only used in raid0.c
      raid 10 is in raid10.c, so it doesn't use this function.

      The bug is based on the fact that a macro "sector_div" modifies it's first argument [and returns the remainder]. I've removed the obligatory backslashes for clarity:

      # define sector_div(n, b)(
      {
              int _res;
              _res = (n) % (b);
              (n) /= (b);
              _res;
      }
      )

      This is used in some fifty files. Some just want the remainder [and they don't want the first arg changed so they do]:

      sector_t tmp = sector;
      rem = sector_div(tmp,blah);

      This is effectively what the code wanted, but the actual fix was to do a restore afterwards:

      sector_t sector = myptr->sector;

      ...
      rem = sector_div(sector,blah);

      ...
      sector = myptr->sector;

      ... // use sector [original value only please ;-)]

      The last line to restore sector with the original value was the fix.

      They should do a full code audit as their may be other places that could be a problem. I've reviewed half the files that use this macro and while they're not broken, some of the uses are fragile. I paraphrase: "sector_div considered harmful"

      What they really need are a few more variants which are pure functions that could be implemented as inlines:
      rem = sector_rem_pure(s,n)
      s2 = sector_div_pure(s1,n)

      Or, a cleaner sector_div macro:
      sector_div_both(s,n,sector_return,rem_return)

      --
      Like a good neighbor, fsck is there ...
    14. Re: It's RAID 0 by cthulhu11 · · Score: 1

      Beat me to it. RAID5 is almost always the wrong answer.

    15. Re:It's RAID 0 by Anonymous Coward · · Score: 0

      RAID 0 has no redundancy so calling it RAID is not only incorrect, it is stupid.

  3. Which RAID are they referring to? by fang0654 · · Score: 1

    Article isn't very clear - are they referring to softraid, fakeraid, and/or hardware raid?

    1. Re:Which RAID are they referring to? by Anonymous Coward · · Score: 0

      Softraid, obviously.

    2. Re:Which RAID are they referring to? by bakaorg · · Score: 5, Informative

      md raid. The bug was in commit md/raid0: fix bug with chunksize not a power of 2 causing, you guessed it, a bug with a chunksize not a power of two. I guess "fix" was a bit diversionary.

      The real problem was a macro modifying arguments that were later expected to be the unmodified version.

    3. Re:Which RAID are they referring to? by Anonymous Coward · · Score: 1, Informative

      That patch also says that the bug exists since Linux 3.14. So it is NOT 4.0, but every kernel since 3.14 that is affected here.

    4. Re:Which RAID are they referring to? by msauve · · Score: 4, Informative

      No. There was a minor bug introduced at 3.14. The patch to fix that, completely different issue, went into 4.0 and caused this corruption issue.

      --
      "National Security is the chief cause of national insecurity." - Celine's First Law
    5. Re:Which RAID are they referring to? by MSG · · Score: 3, Informative

      That fix is actually in the wrong place. The fix for that is tracked in kernel.org's bugzilla # 98501. I'm not linking directly as linking to bugzilla tends to place too high a load on those systems. It's impolite.

      Neil Brown said that he'd push the fix to Linus "shortly" at 2015-05-20 23:06:58 UTC. I still don't see the fix in Linus' tree.

      Watch for a fix titled "md/raid0: fix restore to sector variable in raid0_make_request"

    6. Re:Which RAID are they referring to? by nightsky30 · · Score: 1

      Hmmmm, let's call it RAID PI

  4. The disks usually don't ... by CaptainDork · · Score: 1, Offtopic

    ... need to be debugged, so using Raid® is probably the cause of this.

    --
    It little behooves the best of us to comment on the rest of us.
    1. Re:The disks usually don't ... by Culture20 · · Score: 1

      You laugh, but the first computer bug was a moth.
      https://www.youtube.com/watch?...

    2. Re:The disks usually don't ... by Anonymous Coward · · Score: 0

      It's a myth that the moth was the "first computer bug". The commonly cited log entry is pretty obviously a joke based on the fact that "bug" was already in use at the time.

    3. Re:The disks usually don't ... by Anonymous Coward · · Score: 1

      No, the cited log entry shows that the word "bug" was in use, but that they weren't commonly "found". That's because at the time "bug" meant some kind of mysterious issue with electronics, aka gremlin. The joke was that a bug had been *found*, at last. Computer bugs are often found, implying that this was the first application of the word "bug" to computer issues (with actual solutions) i.e. the first "computer bug". It's also where "debugging" was coined.

      Fairly well explained here.

  5. stable by rossdee · · Score: 4, Funny

    this is obviously some strange usage of the word "stable" that I wasn't previously aware of.

    1. Re:stable by Anonymous Coward · · Score: 5, Funny

      If you ever owned horses, you would understand what "stable" means in this context

    2. Re:stable by Anonymous Coward · · Score: 0

      I don't see any discrepancy. At least in the Wikipedia article the horses look a bit quirky.

    3. Re:stable by Anonymous Coward · · Score: 0

      this is obviously some strange usage of the word "stable" that I wasn't previously aware of.

      In this context, "stable" is short for "linux stable" which has an entirely different definition.

    4. Re:stable by Deep+Esophagus · · Score: 2

      This. My first thought upon reading TFS was, how did this ever pass peer review and testing to get into the "stable" kernel? They do still perform peer review and unit testing, don't they?

    5. Re:stable by Trevelyan · · Score: 5, Informative

      It's stable as in terms of features and changes. i.e. No longer under development and will only receive fixes.

      However! Kernels from kernel.org are not for end users, if someone is using these kernels directly then they do so at their own risk.
      They are intended for integrators (distributions), whose integration will include their own patches/changes, testing, QA and end user support

      There is a reason that RHEL 7 is running Kernel 3.10 and Debian 8 is running 3.16. Those are the 'stable' kernels you were expecting.

      When kernel development moved from 2.5 to 2.6 (that later became 3.0), they stopped their odd/even number development/stable-release cycle. Now there is only development, and the integrators are expected to take the output of that to create stable-releases.

    6. Re:stable by dave420 · · Score: 3, Insightful

      I understand if you are emotionally attached to Linux to the point where someone accidentally criticising it makes you feel uncomfortable, but you really should be able to figure out that "but... but... they're worse!" is no rational response :)

    7. Re: stable by Anonymous Coward · · Score: 0

      Most Linux users don't have that kernel so the world should keep spinning for us. Meanwhile, my Win keeps BSOD. Wish I could just change kernel to fix, that would be easy.

    8. Re:stable by Anonymous Coward · · Score: 1

      Microsoft has made bigger blunders, and then refused to fix them.

      So what? Saying that does not lessen the severity of this Linux bug.

    9. Re:stable by Anonymous Coward · · Score: 0

      It's really not that severe, that is cutting edge. People who use cutting edge get cut. Linux users who need their RAID do not have that Kernel.

    10. Re:stable by Anonymous Coward · · Score: 0

      [Citation Needed]

    11. Re:stable by Anonymous Coward · · Score: 0

      So, roughly like "eBay mint", then? (I've finally accepted that on eBay, "mint" actually refers to the flavor, not the condition.)

    12. Re: stable by andydouble07 · · Score: 1, Insightful

      Meanwhile, my Win keeps BSOD.

      Really? Sounds like you're screwing something up pretty bad, haven't seen one of those in about 6 or 7 years.

    13. Re: stable by Anonymous Coward · · Score: 0

      Anyone who consistently pushes their machines limits- and runs windows- will find the BSOD. If you do NOTHING, you will likely be ok until the registry corrupts. ENJOY.

    14. Re:stable by Anonymous Coward · · Score: 0

      No matter how bad things are, it's still far more stable than Typo3 CMS or any of its "stable" extensions. So, from that perspective this is a win!

    15. Re: stable by jbengt · · Score: 2

      I can routinely cause a BSOD (about 1/3 of the time) on my HP laptop running Windows 7 Pro if I use the touchpad at the log-in screen on start-up. It's apparently a known bug in the touchpad driver that will not get fixed.

    16. Re:stable by duke_cheetah2003 · · Score: 1

      This. My first thought upon reading TFS was, how did this ever pass peer review and testing to get into the "stable" kernel? They do still perform peer review and unit testing, don't they?

      Testing? Who does that anymore? That is the user's job.

      MMO's and Microsoft have made it so.

    17. Re: stable by Anonymous Coward · · Score: 0

      So it's a bug in third party software that is crashing the os. Why is it MS's responsibility to fix it?

    18. Re: stable by oobayly · · Score: 3, Insightful

      It's not. However it isn't beyond a reasonable expectation that a dodgy touchpad driver shouldn't be able to kill an OS.

    19. Re:stable by Anonymous Coward · · Score: 0

      Sometime people are become irrational when someone criticize themes favorite computingsystem or other loved by them stuff.

    20. Re: stable by Anonymous Coward · · Score: 0

      I can write you a driver that when you use the touchpad while linux is starting up it crashes it too. And it'll be linux's fault.

    21. Re: stable by Anonymous Coward · · Score: 0

      Then you should start using a microkenel based OS. On any monolithic OS a faulty driver can bring down the whole system.

    22. Re: stable by jeremyp · · Score: 1

      I can write you a touchpad driver that will crash Linux even if you don't use the touchpad on start up. That Linus Torvalds really doesn't know what he's doing.

      --
      All I want is a secure system where it's easy to do anything I want. Is that too much to ask ~~ Randall Munroe
    23. Re: stable by jeremyp · · Score: 1

      On every popular PC operating system in use today (and Linux), the kernel and the drivers share the same address space. It is thus beyond reasonable expectation for the OS to be able to protect itself from faulty drivers. Although Linux was designed this way from the start, OS X and Windows both started as microkernels (where the drivers have their own address space) but were "downgraded" to monolithic kernels because of performance concerns. Switching between address spaces has an enormous cost.

      --
      All I want is a secure system where it's easy to do anything I want. Is that too much to ask ~~ Randall Munroe
    24. Re: stable by drinkypoo · · Score: 1

      On every popular PC operating system in use today (and Linux), the kernel and the drivers share the same address space.

      There are linux drivers which run in userland.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    25. Re:stable by Anonymous Coward · · Score: 0

      While that is bad, it is not in any way related to the issue at hand,

    26. Re: stable by Anonymous Coward · · Score: 0

      You are incredibly lucky. BSOD's still affect the majority of people, although admittedly many of those are caused by hardware failure. Linux still tends to fare better when faced with hardware failure though; you may get strange behaviour depending on the component, but it won't instantly flog your system as Windows does.

  6. Warning: RAID 0 by Culture20 · · Score: 2, Interesting

    RAID 0 is unstable to begin with. Medium case scenario here (for legitimate use) is some data gets corrupted on a compute node. Run the program on two nodes; if you get the same result on both, that result is probably fine. If you're running RAID0 on any filesystem that isn't temporary or at least easily replaceable, you're doing it wrong.

    1. Re:Warning: RAID 0 by Enry · · Score: 2, Insightful

      RAID 0 is only as unstable as its least stable component. In this case it's most likely a drive failure, and most drives are fairly long MTBFs. The chances of a disk failure increase as a function of time and number of drives deployed. A two-drive RAID 0 will be more stable than a five-drive RAID 0 which will be more stable than a 10 drive RAID 0 that's three years old. In the case of higher RAID levels, you can remove a single (or multiple) drive failure as the point of failure. In this case, the point of failure is the kernel, so it's perfectly legitimate to consider this a really bad problem. Would you say the same thing if the bug affected RAID 1 or RAID 5?

    2. Re:Warning: RAID 0 by houstonbofh · · Score: 0

      RAID0 is a component of RAID10 and RAID0+1, which are considered both secure and stable. (Except on Linux 4.0)

    3. Re:Warning: RAID 0 by danbob999 · · Score: 1

      There is no valid reason for corruption to occurs on RAID0 anymore than on any other setup. The problem of RAID0 is data loss (drive failure).

    4. Re:Warning: RAID 0 by nine-times · · Score: 4, Insightful

      Would you say the same thing if the bug affected RAID 1 or RAID 5?

      I suspect not, since his point seemed to be that you shouldn't be using RAID 0 for data that you care about anyway.

      It doesn't really make it ok for a bug to exist that destroys RAID 0 volumes, but it does mitigate the seriousness of the damage caused. And it's true: Don't use RAID 0 to store data that you care about. I don't care if the MTBF is long, because I'm not worried about the mean time, but the shortest possible time between failures. If we take 1,000,000 drives and the average failure rate is 1% for the first year, it's that that comforting to the 1% of people whose drives fail in that first year.

    5. Re:Warning: RAID 0 by TheCarp · · Score: 1

      I have been running a 4 disk RAID 5 array for a few years now at home, and did a replacement upgrade a couple of years back.

      Overall I find in a 4 disk scenario I lose just a bit less than one disk per year. Maybe one disk every year and a half.

      So when you say RAID 0 that is 3 years old, that sounds about right. I would call such an array in serious danger of loss.

      --
      "I opened my eyes, and everything went dark again"
    6. Re:Warning: RAID 0 by AthanasiusKircher · · Score: 1

      Would you say the same thing if the bug affected RAID 1 or RAID 5?

      I suspect not, since his point seemed to be that you shouldn't be using RAID 0 for data that you care about anyway.

      Exactly. About the only reason I would ever use RAID 0 is for some sort of temp data drive where for some reason I wanted to string multiple drives together. You've basically taken a bunch of drives that each would be vulnerable without redundancy and have produced one big drive that will fail whenever any component does, thereby greatly increasing failure rate over individual drive failure rate. There are only a limited set of use cases where this is a helpful thing, and basically all of them are situations where you can assume that 100% data loss won't hurt you.

      t doesn't really make it ok for a bug to exist that destroys RAID 0 volumes, but it does mitigate the seriousness of the damage caused.

      Well, it mitigates the seriousness of the damage a bug should cause, assuming that people use RAID reasonably. But that's obviously a poor assumption, since many people seem to love playing Russian roulette with essential data.

      I was lucky enough to have a significant (but not catastrophic) data loss hit me when I was young and didn't have a lot of essential data to lose. It taught me the importance of redundancy and backups. Most people who haven't experienced such things are more cavalier with data -- and a RAID 0 bug could be catastrophic for them.

    7. Re:Warning: RAID 0 by Anonymous Coward · · Score: 0

      I use md RAID-0 with 2 SSDs and ext4 on top. It's really fast: write 1GB/s, read 6GB/s linear. Perfect for development and heavy git use with large repos.

      But of course I also bit the dust when trying 4.0 once... I lost almost all git repos (no work was lost though), kernel trees, portage trees, some binaries too. It was quickly fixed though. The git repos were quickly reobtained from the server, kernels checked out again, portage resynced and the corrupt binaries replaced by recompiling them with emerge (Gentoo). No actual work was lost, just one morning worth of time.

    8. Re:Warning: RAID 0 by iONiUM · · Score: 1

      For the record, I have a 6 year old machine running Windows 7 with a RAID-0 setup (asus p5k-e motherboard, WD 250gb drives), and it has never had an issue. It it typically on 24/7, but it has gone through many power outages where the UPS ran out of battery and it hard-reset.

      I do, of course, keep all data on a separate regular drive, along with an external back-up of that. So if the RAID-0 did die, it wouldn't be a big deal (and I could finally move to SSD!).

      Anyways, the point I am trying to make is that RAID-0 is not as "crazy unreliable" as some people would have you believe.

    9. Re:Warning: RAID 0 by nine-times · · Score: 2

      Well, it mitigates the seriousness of the damage a bug should cause, assuming that people use RAID reasonably.

      I'm going to go ahead and say that it mitigates the serious of the damage caused in actuality since most IT people entrusted with serious and important data aren't going to be that stupid. I mean, yes, I've seen some pretty stupid things, and I've seen professional IT techs set up production servers with RAID 0, but it's a bit of a rarity. There could still be some serious damage, but much less than if it were a bug affecting RAID 5 volumes.

    10. Re:Warning: RAID 0 by Anonymous Coward · · Score: 0

      Speak for yourself. My home server consists of two 4TB disks in a RAID-0 using md. I have nightly rsync backups going to USB for my most important 1 TB of data. I have been using this setup since 4TB became available and its been rock solid for me. I guess this means that I'm not taking this server to V4 kernel since I can't see spending the money to buy extra drives (for other RAID modes) or a tower capable of running extra drives

    11. Re:Warning: RAID 0 by Anonymous Coward · · Score: 0

      That's odd, my 6 disk RaidZ2 array is going with no failures in 2 years - when should I expect to start replacing drives?

    12. Re:Warning: RAID 0 by Anonymous Coward · · Score: 0

      If you buy a USB cable (and use it appropriately) it will gracefully shut down instead of faceplanting.

    13. Re:Warning: RAID 0 by Anonymous Coward · · Score: 0

      Hey, Y'all, watch this!

      FTFY. You've been lucky. Make sure you rotate those rsyncs on the USB, making hard links for duplicates where appropriate. It is a 3-4TB USB disk, right? And make some full backups elsewhere. An introduced error or two could ruin your important 1TB of data for a long stretch of backups. A RAID-1 setup would give you more assurance of continuous uptime, but would do nothing for your backups. I guess that depends on how important the extra 4TB of free space is worth.

    14. Re:Warning: RAID 0 by AthanasiusKircher · · Score: 1

      I'm going to go ahead and say that it mitigates the serious of the damage caused in actuality since most IT people entrusted with serious and important data aren't going to be that stupid.

      And that's where your assumptions are different from mine. I was discussing people who are probably NOT "entrusted with serious and important data," but nevertheless have their own personal data (which they think is at least somewhat valuable) and choose to run a RAID 0 setup because of some stupid reason, like it makes their system run a bit faster.

      (Well, that's not a completely stupid reason, but it is a reason to have a good backup strategy for essential files and to segregate your data so only the minimum files are at risk on RAID 0. Many people don't worry about these things until it's too late.)

      If you doubt such people exist, do an internet search or read some gamer forums. And given such people are more likely to be running a bleeding-edge new version of software than a IT pro with a server who does thorough stability testing before upgrades, I'd imagine that a bug like this will disproportionately affect those hobbyists.

      I'm not talking about IT pros here. I'm talking about random idiots who run RAID without thinking of the consequences. For them, this bug could be really serious.

    15. Re:Warning: RAID 0 by nine-times · · Score: 1

      If you doubt such people exist, do an internet search or read some gamer forums.

      I think you missed my point. I don't doubt such people exist. I doubt such people are generally safeguarding information that I think is important.

    16. Re:Warning: RAID 0 by TheCarp · · Score: 1

      Hard to say but it will happen eventually. I have seen it go a few years, then lose 2 within a few months. Always make sure monitoring works and will alert you if its degraded. You can run degraded mode for a long time without monitoring.....till the next one fails.

      They are mechanical, so manufacturing quality and environment will factor in. My drives likely see a lot of shake and heat being on the third floor of a 100 year old house, between the wind, the washing machine and seasonal heat.... its no data center in here.

      --
      "I opened my eyes, and everything went dark again"
    17. Re:Warning: RAID 0 by Enry · · Score: 1

      I suspect not, since his point seemed to be that you shouldn't be using RAID 0 for data that you care about anyway.

      I meant, what if there was a bug in the RAID 5 code that caused similar corruption? This is equivalent (almost) to blaming the victim. Yes, you did risky behavior, but the problem wasn't caused because of the risky behavior.

    18. Re:Warning: RAID 0 by Enry · · Score: 1

      I was really just throwing out drives and times. I had name-brand systems that were in a RAID 0 to consolidate two drives (the drive contents were expendable since this was just scratch space) and they ran for many years with few failures.

    19. Re:Warning: RAID 0 by nine-times · · Score: 1

      I meant, what if there was a bug in the RAID 5 code that caused similar corruption?

      Yes, I understood. And I way saying, yes, it seems clear that we would all care more if it were a problem with RAID 5.

      I understand that you think "we would respond differently if this were RAID 5" is a sign of hypocrisy or something. But it's not really that.

      It's a little like saying, "There was a design flaw in trash cans that cause items stored in the trash can to be damaged." And people respond by saying, "Yeah, well... that's not great, but it could be worse. Things stored in trash cans are usually things nobody cares about anyway."

      And then you say, "Would you respond differently if this trash can problem were discovered in long-term storage bins?"

      And so the response is, "Yes, we would care more about that. Of course we would all care more about that. Because people probably care about things in long-term storage bins, and usually put trash in trash cans. I understand that someone somewhere may be storing their valuable family heirlooms in trash cans, but they probably shouldn't be doing that."

    20. Re:Warning: RAID 0 by Enry · · Score: 1

      I understand that you think "we would respond differently if this were RAID 5" is a sign of hypocrisy or something. But it's not really that.

      Yes it is, and that's a very short sighted approach. I hope you're not a developer.

    21. Re:Warning: RAID 0 by nine-times · · Score: 1
      As I said:

      It doesn't really make it ok for a bug to exist that destroys RAID 0 volumes, but it does mitigate the seriousness of the damage caused.

    22. Re:Warning: RAID 0 by styrotech · · Score: 1

      Linux md RAID10 is a 'non standard' single level layout that does not have a RAID0 component/layer.

      There are 3 layouts available for it, one of which can mimic the underlying block layout of the 'standard' layered/nested RAID10.

    23. Re: Warning: RAID 0 by AvitarX · · Score: 1

      This is an important concept. Corruption of temporary / high performance scratch data can have terrible consequences, when the loss may not be that big of a deal.

      I'm shocked at how people have trouble grasping that.

      --
      Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
    24. Re:Warning: RAID 0 by strikethree · · Score: 1

      Erm, uh... what about the situation where you are running 4 drives in RAID 0 but the pairs are RAID1?

      I keep forgetting if that is called RAID10 or RAID0+1. Meh. Regardless, RAID0 is not just used for unimportant data.

      --
      "Someone needs to talk to the tree of liberty about its ghoulish drinking problem." by ohnocitizen
    25. Re:Warning: RAID 0 by nine-times · · Score: 1

      Yeah, that's RAID 10. If this bug affects RAID 10, then it's a bigger deal than if it only affects RAID 0.

    26. Re:Warning: RAID 0 by Anonymous Coward · · Score: 0

      If you doubt such people exist, do an internet search or read some gamer forums. And given such people are more likely to be running a bleeding-edge new version of software than a IT pro with a server who does thorough stability testing before upgrades, I'd imagine that a bug like this will disproportionately affect those hobbyists.

      gamers won't be hit by this defect anyway, they don't run linux

    27. Re:Warning: RAID 0 by Anonymous Coward · · Score: 0

      What if you dad pulled out earlier?

  7. Why ext4 by silas_moeckel · · Score: 2

    If your running a brand spanky new kernel, with data you do not care about why an old FS. Plenty of newer better FS's to choose from.

    --
    No sir I dont like it.
    1. Re:Why ext4 by Anonymous Coward · · Score: 0

      Looks like ext4 was introduced in 2008. What are some prominent FSs that are newer than that?

    2. Re:Why ext4 by Anonymous Coward · · Score: 0

      If your running a brand spanky new kernel, with data you do not care about why an old FS. Plenty of newer better FS's to choose from.

      Ext4 should in theory be the best choice. It's widely used and has a large enterprise support. Lots of business people get angry if it does not work properly.

    3. Re:Why ext4 by pla · · Score: 1

      Ext4 should in theory be the best choice. It's widely used and has a large enterprise support. Lots of business people get angry if it does not work properly.

      On a modern system with multiple disks you want to configure as some variety of soft-RAID, ZFS hands-down counts as the clear best choice (short of going for a "cluster" FS). It allows an arbitrary number of extra parity drives (think "RAID 8"), as well as arbitrarily many hot spares; it quickly and easily recovers from having someone pull out all your drives, shuffle them around, and put them back in (for a more real-world version of that - Ever updated your BIOS only to find all your drives remapped?); it detects and (usually) corrects corrupted files; it supports online snapshotting and snapshot exporting to another; it uses dynamically sized storage pools rather than fixed "partitions", and can even grow the underlying total available space.

    4. Re:Why ext4 by houstonbofh · · Score: 1

      Name one that actually boots the Linux kernel, and doesn't just run in user space. (Yes, I am a fan of ZFS, but not the Linux implementation.)

    5. Re:Why ext4 by silas_moeckel · · Score: 1

      XFS for starters it's the default nowadays on rhel/centos.

      --
      No sir I dont like it.
    6. Re:Why ext4 by Anonymous Coward · · Score: 0

      why an old FS

      Because they're more stable.

      Oh, wait ...

    7. Re:Why ext4 by Enry · · Score: 1

      You can't remove drives from a ZFS pool - once they're in (even if you have free space on other drives), the number of drives can't go down. Which really bothers me. With LVM you can evacuate data off of drives and shrink the pv. LVM in itself isn't a filesystem, but if you think of a pool as an LVM volume the functionality is somewhat similar.

    8. Re:Why ext4 by Anonymous Coward · · Score: 0

      XFS isn't newer than ext4.

    9. Re:Why ext4 by Rich0 · · Score: 2

      The problem is that the feature-list for ZFS is very enterprise-oriented.

      Why would you want to add just one drive to a server with 5x 6-drive RAID6 arrays? Just add another 6 drives at a time.

      On the other hand, if you have a PC with 3 drives in RAID5, you could easily want to turn that into a 4-drive RAID5 or a 5-drive RAID6 in-place.

      Btrfs has a lot of features that are useful for smaller deployments, like being able to modify the equivalent of a vdev in-place. ZFS on the other hand has a lot of features like ZIL that are very useful for larger deployments.

    10. Re:Why ext4 by Pope+Hagbard · · Score: 1

      It's newer than ext2, which ext4 is based on.

    11. Re:Why ext4 by Anonymous Coward · · Score: 0

      docker containers

    12. Re: Why ext4 by Anonymous Coward · · Score: 0

      I liked ReiserFS, but then he decided to kill his wife, which I understand, what I don't understand is why he stopped developing his FS which, IMO, was great.

    13. Re:Why ext4 by Anonymous Coward · · Score: 0

      ... and can't ever shrink the underlying total available space.

      FTFY

    14. Re:Why ext4 by goarilla · · Score: 1

      Why would you want to add just one drive to a server with 5x 6-drive RAID6 arrays? Just add another 6 drives at a time.

      ZFS isn't ideal for growing like that since it doesn't do rebalancing. Your younger raid arrays will always have more data on them.
      Also zfs destroy is very expensive.

    15. Re:Why ext4 by goarilla · · Score: 1

      *Your older raid arrays*

    16. Re:Why ext4 by pla · · Score: 1

      I will readily admit that as a "shortcoming" of ZFS, but honestly, I don't quite see any obvious use cases for it. On the short term (months), I've only ever needed to *add* storage, never remove it.

      On the longer term (years), I have found that I go back and forth on how many drives I need, but when I do eventually upgrade my home NAS to bigger and better hardware, I don't even try to salvage old drives 1/10th the size of modern ones - I bring up the new system, with however many brand new drives I consider appropriate, and clone the old one to the new one.

    17. Re:Why ext4 by fnj · · Score: 2, Informative

      Name one that actually boots the Linux kernel, and doesn't just run in user space. (Yes, I am a fan of ZFS, but not the Linux implementation.)

      You really should get out more. ZFS on Linux is not to be confused with the ZFS Fuse project. You can boot from a ZoL filesystem. In general ZoL is about as stable, complete, and reliable as any ZFS.

    18. Re: Why ext4 by Anonymous Coward · · Score: 0

      https://en.wikipedia.org/wiki/ReiserFS#Criticism
      He stopped because prison. Everyone else stopped because it wasn't that great.

    19. Re: Why ext4 by wed128 · · Score: 2

      ReiserFS predates ext4, and it's hard to be an active software developer in prison.

    20. Re: Why ext4 by kthreadd · · Score: 1

      If the file system actually was great I'm sure someone else would pick it up, but I don't know if it was that great.

    21. Re:Why ext4 by kthreadd · · Score: 1

      XFS isn't newer than ext4.

      Depends on how you look at. XFS is continuously developed and improved, they just don't stick a version number after it like the ext developers.

    22. Re:Why ext4 by Rich0 · · Score: 1

      Why would you want to add just one drive to a server with 5x 6-drive RAID6 arrays? Just add another 6 drives at a time.

      ZFS isn't ideal for growing like that since it doesn't do rebalancing. Your younger raid arrays will always have more data on them.
      Also zfs destroy is very expensive.

      Perhaps, but my point was more that if you want to grow ZFS this is the ONLY way to actually do it, as far as I'm aware. You can't add individual drives to individual "vdevs."

    23. Re: Why ext4 by Anonymous Coward · · Score: 0

      You owe me one new keyboard.

    24. Re:Why ext4 by Anonymous Coward · · Score: 0

      NetApp must be thrilled, because it seems the ZFS people have re-invented RAID-DP.

    25. Re:Why ext4 by goarilla · · Score: 1

      Well you can grow the underlying component devices of the vdev. But yes even the "perfect" ZFS has its weaknesses.

    26. Re:Why ext4 by pla · · Score: 1

      Perhaps, but my point was more that if you want to grow ZFS this is the ONLY way to actually do it, as far as I'm aware. You can't add individual drives to individual "vdevs."

      You can replace all the drives in the array with bigger ones, resilvering after each replacement, and when you get to the last one, poof, you magically have a bigger pool. I certainly won't claim that as terribly efficient, though. :)

      It has its shortcomings, no doubt. But compared to old-school RAID or even LVM, it takes a huge step forward.

    27. Re: Why ext4 by Anonymous Coward · · Score: 0

      It was great... until the other filesystems caught up while it was not under development.

    28. Re:Why ext4 by Rich0 · · Score: 1

      Sure, but with btrfs you can just add one drive and sometimes get its entire capacity added to your array - it works fine with mixed-size disks.

      Of course, it might just decide not to boot the next day, and that is the downside to btrfs. It does tend to be a bit more friendly in scenarios where you have a small number of disks, though, which was my main point.

    29. Re:Why ext4 by Rich0 · · Score: 1

      Agree, as the other reply pointed out as well. And you can do the same with mdadm raid too (though obviously with none of the benefits btrfs/zfs bring for data integrity like checksumming and copy-on-write). Mdadm will also let you reshape an array in place (that is change raid levels or number of disks), though with mdadm that will often result in messing up your stripe alignment and of course it is more likely to eat your data if something goes wrong since if it finds a parity mismatch it has no way to know which copy is bad.

      I was just commenting that btrfs tends to have a lot of features that appeal to small system users that you'll actually find missing on zfs, even if it is far less mature overall, and lacking in many enterprise-scale features. It just reflects the emphasis of the developers behind it.

      I really can't complain about zfs - it is a great filesystem. However, things like not being able to reshape an array or mix disk sizes in an array are some of the things that hold me back from adopting it. Heck, btrfs will let you switch from raid1 to raid5 without touching any of the data already written - newly-allocated chunks will use raid5 and existing chunks will continue to use raid1 - it doesn't manage arrays at the whole-device level. In practice though you're likely to tell it to rebalance your data of course.

    30. Re:Why ext4 by sjames · · Score: 1

      You're thinking of the ZFS that goes through FUSE. There is also ZFS on Linux that runs as kernel modules like any other fs.

      There's also btrfs.

      Of course, neither of those needs the md driver at all, they have their own raid like systems.

    31. Re:Why ext4 by Anonymous Coward · · Score: 0

      Isn't it dangerous to use ZFS without ECC memory? I had wanted to upgrade my backup drives to a filesystem that automatically verified the files, but if part of your memory goes bad the filesystems gladly destroys all your data for you. Did I understand that correctly?

    32. Re:Why ext4 by sjames · · Score: 1

      I'm using ZFS in production for now but I'm actively testing btrfs for that reason among others.

    33. Re:Why ext4 by houstonbofh · · Score: 2

      If you trace it back, all of that fear originates on one post from the freenas forums. A post from one of the key developers says that you should use ecc for any server with critical data, but zfs is neither more or less sensitive to it.

    34. Re:Why ext4 by sjames · · Score: 1

      In a production server, I can see value in stepwise evacuating old drives and then swapping them for new larger drives only once the data is stable on the filesystem. Done right you could pull it off with zero downtime without opening a window where a single failure brings you down.

    35. Re: Why ext4 by sjames · · Score: 1

      It had some REALLY ugly failure modes.

    36. Re:Why ext4 by jabuzz · · Score: 1

      If you really care about the checksumming then switch to SAS/FC and use the Data Integrity Field. ZFS fixes something that has already been fixed years before it even came into existence. It offers better protection from silent corruption than ZFS does as well, because with ZFS you have no guarantee that what you send to the disk is what actually gets written, DIF mitigates against that.

    37. Re:Why ext4 by Rich0 · · Score: 1

      Somehow I doubt that is going to work on my $70 Seagate 7200RPM hard drive plugged into my $50 motherboard or $20 SATA controller card. However, by all means let me know if it will.

      Plus, COW filesystems offer a lot more than just data checksumming.

    38. Re:Why ext4 by goarilla · · Score: 1

      So DIF is both an on disk checksum and an inflight error check like UDMA CRC ?

    39. Re:Why ext4 by goarilla · · Score: 1

      Also the wikipedia article https://en.wikipedia.org/wiki/... makes it look like it's a failed/superseded technology. Is it still being used ?

    40. Re: Why ext4 by drinkypoo · · Score: 1

      It was great... until the other filesystems caught up while it was not under development.

      It was great... until it went on a rampage and murdered your data. I kid, but I'm also serious. When it was in current development, no other fs was as efficient with small files, and there's a lot of those on the average Unix system so that's of great interest. But it also was the least reliable filesystem in common use. So it was really never worth using.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    41. Re: Why ext4 by vilanye · · Score: 1

      That should be fixed.

      Sadly, there are a lot of skilled and educated people in prison just rotting away. They could still be very useful and productive.

    42. Re:Why ext4 by Anonymous Coward · · Score: 0

      You really really really should not be.
      ZFS design is f**** up, and it's hidden by marketing bs.
      UNLESS of course, there is only ONE and SINGLE I/O happening at any given time, and that single IO is always sequential. Then it's amazing.

      Technical reason:
      ZFS is designed to activate all drives every time for every single IO, and utilize every single drive for even the smallest IO. End result is IOPS is pathetic even by the 90's standards.

      Further, it has no fault tolerance where it matters the most: Drives fail, and they tend to fail on bunches, if you have say 32 drive array with parity 3 and you have 4 drives fail at the sametime (not inconceivable, and can even happen on smaller), it will happily continue on writing on the remaining disks corrupting your filesystem.
      Never mind, due to the aforementioned design failure it would be stupid to make 32 drive array since you get 1 drive IOPS out of it on large requests, and on small requests you are lucky to peak at 6 drive IOPS level - and at the end of they day, performance is dictated by IOPS.

    43. Re: Why ext4 by wed128 · · Score: 1

      Part of what makes prison a punishment is that you're disallowed to be productive...

    44. Re: Why ext4 by vilanye · · Score: 1

      That is stupid and wasteful.

      Prison is supposed to be rehabilitation, treating it as punishment is counter-productive for both society and the inmate.

  8. Two issues in play? by jones_supa · · Score: 1

    There seems to be a fix in RAID code and a fix in Ext4 code.

    The latter was incorporated in Linux 4.0.3 (changelog), and according to the Phoronix article the RAID bug is still unfixed.

  9. New version ... by JasterBobaMereel · · Score: 5, Insightful

    This is the new 4.0 kernel, A Major version update , less than a month old, that most Linux systems will not have yet ...and the issue has already been patched

    Bleeding edge builds get what they expect, stable builds don't even notice

    --
    Puteulanus fenestra mortis
    1. Re:New version ... by Anonymous Coward · · Score: 0

      and the issue has already been patched

      Proof?

    2. Re:New version ... by Anonymous Coward · · Score: 1

      Since this was an undiscovered until now, you also have to consider how far back this bug goes. Apparently, it appeared in 3.14-rc1. For some people, that's significant.

    3. Re:New version ... by Anonymous Coward · · Score: 2, Insightful

      The last major Linux version update that actually meant something was 1->2. The "major version" bumps in the kernel are now basically just Linus arbitrarily renumbering a release. The workflow no longer has a notion of the next major version.

    4. Re:New version ... by Anonymous Coward · · Score: 0

      Proof?

      It's underlined for you in the fucking summary, loser. "The cause and fix have materialized" if you click that, it takes you to an article that discusses the problem and links to the fix.

    5. Re:New version ... by houstonbofh · · Score: 1

      The down side is that since no one runs business critical loads on new stuff, business critical tools do not get tested as well as simple stuff.

    6. Re:New version ... by Anonymous Coward · · Score: 0

      So it wasn't a stable release?

    7. Re:New version ... by torqer · · Score: 1

      Agreed... Tovalds having a google+ poll for changing major version numbers is as arbitrary as not having windows 9.

      https://plus.google.com/+Linus...

    8. Re:New version ... by Anonymous Coward · · Score: 0

      Except those distros that decided to backport that patch.

    9. Re:New version ... by jedidiah · · Score: 1

      No. They just don't run PRODUCTION on the bleeding edge code. That doesn't mean that this stuff isn't being tested with non-trivial use cases. Any reputable IT shop is going to be putting version n+1 through it's paces before it does anything important because everyone wants to keep their jobs.

      The last time I used RAID0 for anything it was a high volume R&D project. The OS vendor probably got a couple of good bug fixes out of us.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    10. Re:New version ... by Anonymous Coward · · Score: 1

      Skipping Windows 9 is not arbitrary. They do that because some way old java stuff out there would think it's Windows 95 or 98 because it does some shitty version string parsing.

    11. Re:New version ... by Anonymous Coward · · Score: 0

      "Java engineers are weenies"

    12. Re:New version ... by Yunzil · · Score: 1

      Uh, 4.0 is a stable build, chief.

    13. Re:New version ... by Anonymous Coward · · Score: 0

      No it isn.t, the last stable/unstable kernel releases was 2.6/2.5 after that the vanilla kernel is only a "development kernel". The kernel supplied by your distribution is the stable one.

    14. Re:New version ... by F.Ultra · · Score: 1

      No it wasn't. The patch that caused this problem was a fix to another problem that where introduced in 3.14-rc1.

    15. Re:New version ... by Yunzil · · Score: 1

      kernel.org:

      stable: 4.0.4 2015-05-17

    16. Re:New version ... by KingMotley · · Score: 1

      I'll wait for 4.1, and then I'll wait for 4.1.2 just to be safe.

    17. Re:New version ... by sound+vision · · Score: 1

      And which stable distros are using 4.0? My Debian stable box that gets updated at least weekly is on 3.2.

    18. Re:New version ... by houstonbofh · · Score: 1

      Most places I know do not have identical hardware for testing. They have retired production hardware for testing, so it is older stuff, with older drivers.

    19. Re:New version ... by vilanye · · Score: 1

      "Stable" distros like Debian and redhat patch in a lot of bug fixes and other things into their kernel, so there is no real sense of where they are feature and bugfix-wise.

      3.2 on Debian is not the same kernel as a distro that ran 3.2 when 3.2 was the current kernel.

      Those "stable" distros crack me up, the kernel and almost all of its packages are significantly behind and they are no more stable than my opensuse 13.2 servers that have been upgraded in place since 11.3, but this is a rant for another topic.

    20. Re:New version ... by petermgreen · · Score: 1

      My Debian stable box that gets updated at least weekly is on 3.2.

      Your box is now a Debian oldstable box.

      Debian stable (jessie) has 3.16, debian testing (stretch) also has 3.16 right now though i'd expect it to get 4.0 in the not too distant future. Debian unstable has 4.0.

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
  10. Avoid even-number OS major releases by Anonymous Coward · · Score: 1

    That's a good rule of thumb for Windows and Linux. Not sure about Apple :)

    1. Re:Avoid even-number OS major releases by Anonymous Coward · · Score: 0

      But no Windows 9...

    2. Re:Avoid even-number OS major releases by Anonymous Coward · · Score: 0

      Yeah, Linux 2.x was no good.

    3. Re:Avoid even-number OS major releases by houstonbofh · · Score: 1

      And 2.6 was the worst since both numbers were even!

    4. Re:Avoid even-number OS major releases by Anonymous Coward · · Score: 0

      Wrong. 4.0 was an arbitrary release, because Linus thought the last number was getting to high. Linux hasn't obeyed the major number = major change since 2.6.

    5. Re:Avoid even-number OS major releases by Penguinisto · · Score: 1

      That's a good rule of thumb for Windows and Linux. Not sure about Apple :)

      On the Apple side, the rule seems to be an iOS-only thing (and even then only recently... thanks iOS 8!)

      On the OSX side? 10.0 sucked pretty hard, and (IIRC) 10.2 had some problems, but it's been rather rock-stable since then (at least from my POV - I've used OSX from the ill-fated 10.0 all the way up to Yosemite, but YMMV).

      --
      Quo usque tandem abutere, Nimbus, patientia nostra?
    6. Re:Avoid even-number OS major releases by Anonymous Coward · · Score: 0

      dude, it's a joke

  11. WOOT! Communist Linsux is indeed worthless by Anonymous Coward · · Score: 0

    Several people have been point out to the fucktarded shitdot sheeple that communist open-sores, especially communist linsux, are more worthless than shit sundae with toe-fucking-jam, Richard "RMS Titanic" Stallman's favorite dessert. Now we can say why communist open-sores sucks so fucking much. Microsoft doesn't have any of these flaws in any version of Windows that has been released to the public. If a file system becomes corrupt in Windows it is the fucking hardware to begin with but yet you communist loving, fudge packing, twinkie sucking fucktarded shitdot sheeple always claim it is "Teh M$" like your butt buddy Twitter. The best solution to this is to ban communist open-sores, especially since the next "bug" will lead to mass identity theft. Perhaps that's the only way the fucktarded shitdot sheeple will ever get any fucking money. LMFAO!!!

    GO AHEAD FUCKING FLAME AWAY
    OR WASTE YOUR GODDAMNED
    MOD POINTS FUCKTARDED SHITDOT SHEEPLE OR BETTER
    YET GO SLIT YOUR FUCKING WRISTS
    FUCKTARDED SHITDOT SHEEPLE

    1. Re:WOOT! Communist Linsux is indeed worthless by Anonymous Coward · · Score: 0, Flamebait

      You sure about that?

      I remember NTFS failing a lot when it was released... people were having to defrag daily or the system would just not run...

      Losing data, performance,

      And subject to the most trivial of viruses - so bad that you had to have third party programs to suck up more performance just to keep running at a crippled rate.

    2. Re:WOOT! Communist Linsux is indeed worthless by Anonymous Coward · · Score: 0

      I remember NTFS failing a lot when it was released...

      Uhm, that was in 1993.

    3. Re:WOOT! Communist Linsux is indeed worthless by Anonymous Coward · · Score: 0

      Actually, communism and socialism have been:

      1) promoted by Wall St. (and other international bankers) since their inception
      2) is seen as a "good business move"
      3) eliminates competition
      4) guarantees a government-enforced monopoly (stability, reliability, efficiency, predictability)

      Communism and socialism are a way to manage people. They are Big Business, not geeks at an orange terminal
      at 3 AM "subverting" the system.

      Communism IS the worldwide banking, currency, educational, "business" and cultural system.

      Linux is communism misses the point. MS promotes communism all the time. They all do. Wall St. does. Everyone who is anybody does.

      International elites LOVE communism. It means no nationality, no independence, no sovereignty. It means the elites can rule from above and apportion out resources, always out of the reach and untouchable to everyone else.

      It means anytime bankers start betting their non-existent capital (leveraged 1:10 or worse) the taxpayer will bail them out, because "stable economy" is written into law not as:

      1) jobs are available
      2) the people have jobs
      3) the people have savings
      4) the people have food
      5) the people have houses

      But merely means "the banks are still in business and operating" and that's it.

      The whole communist and collectivist system is worthless, from currency to education to planned economies (on regional levels, from neighborhood to city to state to country to continent and beyond).

      Linux is but a small part of that. Microsoft has been funding and in favor of communism from the start. It means more profits.

      Communism is effectively a way of getting a taxpayer-supported monopoly written into law. Big business and international corporations in particular, LOVE communism.

      Big business will never ban communism. It is more profitable for them, they will not give up their government-enforced monopolies and planned economies. They worked hard over the course of a century and more to get everyone on board.

      Linux is but a small piece of communism, and pretty much irrelevant. The currency itself nowadays has no value.

      Whatever your opinion on copyright and whether it is moral to restrict copying of digital bits, communism as far as code goes is such a tiny fraction of the communist agenda it is not relevant.

      Linux can be seen as communism being used to fight against communism, which would mean everyone must use the same standards, by law. Linux allows choice.

      While the methods Linux uses are communist in nature, it is inaccurate and wholly misses the big picture to ascribe communism as originating from, or even that being the ultimate goal of the Linux community.

      If you want to fight communism, start with Wall St., start with Ford Motor, General Electric, IBM, Google, Apple (tech cartel), tax-exempt foundations, organizations revolving around Rockefellers, JP Morgans, Chases, Rhodes, Fabian socialists, Rothschilds, Carnegies, Harrimans, the Bush family, Goldman Sachs, the Federal Reserve (and equivalent wherever you live) -- the list goes on and on.

      I should add, "capitalism" was defined by Karl Marx. Communism is the tool of the international financial elite for power and control. It is no "enemy" of capitalism, it is what every "capitalist" prefers and strives for: a government-enforced monopoly with taxpayer money.

      Anyone who is an actual capitalist and an actual communist knows these things. It is only useful idiots and ill-educated folks who spout such rubbish.

      Communism is a method of making rich capitalists richer and ensuring noone can ever touch them. By giving everyone substinence wages and income tax and thousands of incentives/tax breaks for "desired behavior" the population is conditioned into submission and compliance with communist objectives.

      "Capitalists" and the rich use communism to gain control of populations. It is funded, fomented, and kept in place by the international financial elite, not "Linux users"

      There

    4. Re:WOOT! Communist Linsux is indeed worthless by Anonymous Coward · · Score: 0

      Communism is Rhodesians setting up Round Tables (CFR in the U.S., "Royal" institute of foreign affairs the U.S. populace would not accept)

      Communism is JP Morgan Chase traders setting up "the cartel" (check your local newspaper TODAY you moron)

      Communism is central banks around the world and fiat currency and the IMF and countless others

      Communism is education departments around the world promoting a "global economy"

      Communism is "tech cartels" with Intel, Apple, Microsoft, the list goes on and on (see pando.com)

      Communism is the NSA (and other agencies around the world) spying on everyone and buiding dossiers and profiles so they may be psychologically manipulated towards communist (mostly economic) agendas. Economic espionage is communism.

      Communism is the CIA running around the world overthrowing democratically-elected leaders to put in people of any political persuasion (communist, anti-communist, whatever sells) to keep global corporations happy.

      Linux may or may not have communist tendencies, but is mostly just a bystander. Destroying Linux...would not change communist takeover of the world one bloody red bit.

      If anything, destroying Linux would be a false sense of security, could be seen as a revolutionary move and brainwashing to condition people into not doing anything about the very real and very active and very well-funded communism that surrounds them the world over.

      If you are not a communist yourself, then you are a dupe or ignorant. Try educating yourself?

      I've given you enough leads to follow.

      Bill Gates loves communism, the Gates Foundation only gives grants to pre-approved groups (no individuals) -- look for yourself.

      All the big name foundations that promote communism around the world, are funded by so-called "capitalists" and "business people."

      They got theirs a century ago (or more), and want to ensure noone else ever does or ever touches their money.

      Linux is a tiny little blip on a giant red communist radar. So what? Destroying Linux wouldn't change a thing.

      Even if you think it is 100% communist, it is such a small blip not to matter.

      Big business LOVES communism. International corporations LOVES communism. Billionaires all LOVE communism.

      It means "noone can touch me" to them, and they can sit on top of everyone and demand "full employment" and that everyone serve their interests instead of their own.

      Sorry anonymous coward, your communists are in another castle.

      Totally missed the point
      And you're to blame
      You give anonymous cowards a bad name (bad name)

    5. Re:WOOT! Communist Linsux is indeed worthless by Anonymous Coward · · Score: 0

      No wonder the communists have won...you think "Linux" is communism.

      Are you the only opposition? We are all doomed to godless communism and totalitarianism I guess.

      You are the best the opposition has got? We are all fucked, forever and forever.

      Communism is mastery learning and Pavlovian conditioning (even Pavlov was against it being used on human beings for political purposes)

      Start with all the educational institutions around the WORLD that use communist methods. Start with Khan Academy and "flipped classroom"

      That is your communism right there. Courtesy of Microsoft, Apple, Google, the U.S. Dept. of Edu, the finance industry, etc.

      Big Business LOVES communism. International "capitalists" LOVE communism.

      Free trade "libertarians" LOVE communism and fund it every chance they get.

      An undefined "Linux" stereotype that you can't even name members of., or where their funding comes from, or how they are brainwashing people...is just not on the "communist" radar.

      IF anything, if there is any truth to your assertion that "Linux is communism"...then the reality is Linux is very late on the scene, so we have some possibilities:

      1) Linux wants capitalism but must succumb to communist currency/education/everything else, just the same as any other movement, because that is what the wealthy folks demand and have pushed onto and enslaved the whole world with

      2) Linux is apolitical, not concerned with such things either way; there may be sects and subgroups with strong opinions, but there is no requirement or enforced political viewpoint

      3) Linux was created and funded by communists, and is just one small blip on the communist plan, which was already achieved decades ago, long before Linux ever appeared...funded by "capitalists" all the way, from the very start

      4) you are a complete brainwashed fucking moron and/or just ignorant

      Even if you have supporting evidence for 3) and maintain that, Linux is just one of 10 million heads of the communist hydra.

      You can cut off the supposed Linux head of the communist party, so what? Wouldn't change a thing.

      Linux is collectivism at best. Has no communist culture, is independent of communist "currency", does not mandate communist brainwashing.

      There may or may not be overlap, so what?

      You are pissing in the communist red hurricane. You already lost. Long before Linux showed up on the scene.

      Fighting a straw man. Whoop-de-doo. Hooray, you want to destroy what is, at best, although you have offered no proof, a shadow, a puppet!

      "stop hitting yourself" as they say.

      If "Linux is communism" then the communists have won, the brainwashing is 100% complete. Noone will ever know the truth, because your disinformation will cancel it out.

    6. Re:WOOT! Communist Linsux is indeed worthless by Anonymous Coward · · Score: 0

      Saying "Linux is communis"m is like whining that there is one Cheerio left in your diarrhea-filled breakfast bowl.

      Of course Linux has no chance, it will succumb to communism like everyone else. The whole system the world over is communist diarrhea-filled.

      The "Linux community" has no chance to NOT be communist, even if it wanted to. They are stuck in the same communist ocean as everyone else.

    7. Re:WOOT! Communist Linsux is indeed worthless by Anonymous Coward · · Score: 0

      Communism is every operation mockingbird and mighty wurlitzer CIA media operation going back decades. Nevermind what the KGB or others did/still do.

      There are your communists right there, brainwashing the population of the world into compliance with globalist communist economic objectives.

      Here in the U.S., godless communism is the "Office of Faith Based and Community Initiatives" (not to mention blasphemy) -- the merger of "God" and "State", the "State" must "provide" for everyone like "God" formerly used to

      "Linux" is "communist" ? Not even close.

      MickeySoft has been communist from their inception, and they LOVE communism the same as every other large corporation in pretty much every country around the world.

    8. Re:WOOT! Communist Linsux is indeed worthless by Anonymous Coward · · Score: 0

      Bill Gates loves communism, the Gates Foundation only gives grants to pre-approved groups (no individuals) -- look for yourself.

      Now when Wild Ol' Billy says he is an excellent poker player, you know what he means.

      What do you think "revolution" and fundamentally changing the shape of "education" meant?

      Communism is just more profitable. Corporate takeover of the public schools and taxpayer money, directing it towards "workforce training" is just more profitable.

      Microsoft LOVES communism, always has. It is just more profitable that way.

      Fell for Gates' bluff
      And you're to blame
      You give anonymous cowards a bad name (bad name)

  12. It all makes sense now by Anonymous Coward · · Score: 0

    Now I understand why this version of Linux got a major new version number: this issue is a major new "feature."

    (Posting as AC as a defense against humor-impaired moderators. :-)

  13. From the Article... by Rashkae · · Score: 1

    It also looks like if dropping the discard mount option you will also avoid being hit by this serious issue.

    There's very little good reason to use 'discard' on Linux, and many reasons not to. (This isn't the first data corruption problem, and there are several performance issues as well.) Fstrim in a con job is the way to go.

    1. Re:From the Article... by Rashkae · · Score: 1

      Having said that, considering the nature of this bug, I wouldn't be surprised is using fstrim would also trigger this particular bug.

    2. Re:From the Article... by unrtst · · Score: 1

      Good thing all those 'eyes on the source code' caught this little nasty before it went out the door.

      Name one distro where this is out the door.
      For example, the latest ubuntu, released just a month ago, is using 3.19.

    3. Re:From the Article... by FredK · · Score: 1

      Using 4.0 on Gentoo for about a week now.

    4. Re:From the Article... by marsu_k · · Score: 1

      There's very little good reason to use 'discard' on Linux

      Care to elaborate on that? My bible says that discard is the first choice, fstrim when that isn't applicable for whatever reason. Bear in mind that I use Linux mostly as a desktop OS, so whatever caveats there may be in server use do not affect me.

    5. Re:From the Article... by marsu_k · · Score: 1

      4.0.2-1-ARCH #1 SMP PREEMPT Thu May 7 06:47:54 CEST 2015 x86_64 GNU/Linux

      Arch, at least.

    6. Re:From the Article... by present_arms · · Score: 1

      uname -r 4.0.4-pclos1

      --
      http://chimpbox.us
    7. Re:From the Article... by Anonymous Coward · · Score: 0

      Name one distro where this is out the door.

      Debian.

    8. Re:From the Article... by Rashkae · · Score: 1

      This is the first time I've found someone suggesting discard as the first choice over fstrim. The reasons to use fstrim is stated right in that article. Performance bottlenecks when there are file delete opperations. (And no real benefit to trimming on the fly vs trimming in a batch process.) However, while I usually have nothing against debaing my betters and making a spectacular fool of myself, I'm not going to go out of my way to contractict the Arch Linux documentation.

    9. Re:From the Article... by marsu_k · · Score: 1

      Oh, I'm not saying Arch Wiki is infallible (although, it is correct pretty much the whole time). I was just looking for rationalization to discard or not to discard. As a personal anecdote, this Zenbook has been running discard since day 1 (24GB SSD and 500GB HDD, discard on the first drive only of course) - the OS partition (the 24GB drive, ext4) is still spanking fast. Although, it has never been close to running out of space (/var is on the HDD).

    10. Re:From the Article... by Anonymous Coward · · Score: 0

      Debian stable is using 3.16. What are you talking about?

    11. Re:From the Article... by unrtst · · Score: 1

      Ok, I stand corrected.
      Still, not many (I don't count gentoo, as that's just whatever you compile; and unstable (ex. debian unstable) shouldn't count either).
      Running down the list in distrowatch:
      mint: 3.13
      ubuntu: 3.19.0
      debian (stable/testing): 3.16.7
      mageia: 3.19.8
      fedora: 3.17.4
      opensuse: 3.16.6
      arch: 4.0.4
      centos: 3.10
      pclinuxos 2014.12: 3.18.1
      slackware: 3.18.11
      freebsd: ... it's freebsd, not linux ...

      So, out of the top 11 (I don't know why freebsd is even on there), arch is the only one whose current release is on 4.x.

    12. Re:From the Article... by marsu_k · · Score: 1

      If that is your criteria, then perhaps Arch shouldn't count either - sure, it is not a source-based distro (the only package I compile frequently is Firefox, and that is due to it being the KDE-friendly fork, kudos to OpenSUSE for that), but still very much bleeding edge. Remarkably stable at that, but comparable to to Debian Unstable.

    13. Re:From the Article... by stooo · · Score: 1

      >> I hope there are lawsuits from people who lose their valuable data from this.
      ????? is there a law against bugs in the US ?
      can you accuse source code ?

      --
      aaaaaaa
    14. Re:From the Article... by ZeroEpoch · · Score: 0

      Any distro using kernel 3.19.7+ is affected. This includes the second most popular distro Fedora as well. That's how I originally discovered the bug and tested the fix. I only used 4.0.3 to first confirm the bug was still there before starting to bisect the commits.

  14. Bob Wilson knows they're real! by Thud457 · · Score: 1

    Way to go Barney Buzzkill!
    Next you're gonna try to tell us that gremlins aren't real.

    --

    the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff

  15. Re:Happy Corruption Day from The Golden Girls! by Anonymous Coward · · Score: 0

    "Confidante". They clearly say "confidante" not "cosmonaut".

  16. Re:Happy Corruption Day from The Golden Girls! by Anonymous Coward · · Score: 0

    The intro has been corrupted

  17. Re:Happy Corruption Day from The Golden Girls! by Anonymous Coward · · Score: 1

    Thank you for feeding the troll
    Took the bait and back again
    Your brain is weak, you're a doof and a space cadet.

    And if they threw a inquest
    Invited everyone you blew
    You would see the biggest sore would be from me
    And the verdict would say, thank you for feeding the troll.

  18. Ahh there it is by drinkypoo · · Score: 2

    Tunneled down into the articles, http://git.neil.brown.name/?p=... has the patch. I'm building a system with 4.0.4 right now so this was material to me

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    1. Re:Ahh there it is by TeknoHog · · Score: 1

      I'm building a system with 4.0.4 right now

      Where's that? For some reason I can't find that version anywhere.

      --
      Escher was the first MC and Giger invented the HR department.
    2. Re:Ahh there it is by Anonymous Coward · · Score: 0
    3. Re:Ahh there it is by TeknoHog · · Score: 1

      I love replies like this, especially the whooshing sound they make as they go by.

      --
      Escher was the first MC and Giger invented the HR department.
  19. Or just use a power of 2 chunk size? by tlambert · · Score: 3, Insightful

    Or just use a power of 2 chunk size?

    What idiot configuration did someone have to have to trigger this bug?

    1. Re:Or just use a power of 2 chunk size? by samwichse · · Score: 1

      But I always use 315 for my chunk size because that's my grandma's birthday!

  20. Unit Testing by Anonymous Coward · · Score: 0

    When did they ever?

    There is some degree of testing that happens, but it's mostly just the peer review. I'm not sure it's really possible to unit test the kernel, especially when you start talking about interacting with hardware. Also, the kernel is 17+ million lines of code. How many lines of unit tests do you think you would need for adequate code coverage? How many thousands of man-years do you think that it would take to accomplish this? What would happen to kernel development if you tried? For that matter, how long is that test going to take to run?

    1. Re:Unit Testing by Anonymous Coward · · Score: 0

      Also, the kernel is 17+ million lines of code. How many lines of unit tests do you think you would need for adequate code coverage? How many thousands of man-years do you think that it would take to accomplish this? What would happen to kernel development if you tried? For that matter, how long is that test going to take to run?

      Sounds as though it is time to rewrite Linux as a micro-kernel and avoid these issues by isolating file system code to be external to the kernel.

    2. Re:Unit Testing by Anonymous Coward · · Score: 0

      You get right on that. I'm sure that will solve more problems than it causes, and probably it will be accepted upon first submission. See you in a few thousand man-years.

  21. Re:Woot! I pissed off the fucktarded shitdot sheep by Anonymous Coward · · Score: 0

    And got a Troll mod! WOOT! This story and latest rage-modding proves the fucktarded shitdot sheple can't handle the truth that Communist Opensores sucks and caters to the fudgepacking, twinkie sucking fucktarded faggots (all members of shitdot) who should go slit their fucking wrists. I can't wait for you fucktarded shitdot sheeple to rage mod this comment as well so I can continue to laugh at your fucking stupidity. LMFAO!!!

    GO AHEAD FUCKING FLAME AWAY OR WASTE YOUR GODDAMNED MOD POINTS FUCKTARDED SHITDOT SHEEPLE OR BETTER YET GO SLIT YOUR FUCKING WRISTS FUCKTARDED SHITDOT SHEEPLE

    Sorry everyone, my fault. I forgot to force feed Grandpa his medication today.

  22. Re:Happy Corruption Day from The Golden Girls! by lister+king+of+smeg · · Score: 1

    best response to troll feeding ever

    --
    ---Saying gnome 3 is better than windows 8 not so much a compliment as it is damning with light praise.
  23. Re:Woot! I pissed off the fucktarded shitdot sheep by tshawkins · · Score: 1

    You need to feed him MOAR BRAINZZ

  24. Linux has become like Windows by Anonymous Coward · · Score: 0

    Linux has become so bloated, it is really disgusting. The desktop is crashing every couple of days. The file system is not stable any more. And it has become so easy to kill the entire system by launching too many processes.

    All the mess has started with these Intel APCI patches. Since then, Linux has become a major mess.

    Linus, can't you do anything about it?

    1. Re:Linux has become like Windows by Anonymous Coward · · Score: 0

      I do not think anyone can say with a straight face anymore that Linux is a robust operating system, or a good choice from a stability or performance standpoint. The reason why Linux remains so popular is that you can always change components and work around problems. It's a lot of work and frustration, but users will do anything to not have to pay for a proper operating system.

    2. Re:Linux has become like Windows by Anonymous Coward · · Score: 0

      The desktop is crashing every couple of days. The file system is not stable any more.

      The desktop? The file system?

    3. Re:Linux has become like Windows by vilanye · · Score: 1

      Not even a decent troll.

      Running opensuse 13.2 on desktop and server, no crashes and with KDE with all the bells and whistles it still comes in under 400 MB of RAM when the boot process is completed.

      Sure the kernel sources are getting large, but it supports a ton of hardware on several processor architectures.

      Talking about "the file system" and "the desktop" not only shows you are in fact a troll, but displays your total ignorance of Linux.

  25. Raid kills bugs dead! by TeknoHog · · Score: 5, Funny

    Well, there goes that slogan.

    --
    Escher was the first MC and Giger invented the HR department.
    1. Re:Raid kills bugs dead! by toddestan · · Score: 1

      In Soviet Russia, bug kills raid!

  26. In particular, NO redundancy. Reliability drops. by Ungrounded+Lightning · · Score: 5, Informative

    Losing data goes with the territory if you're going to use RAID 0.

    In particular, RAID 0 combines disks with no redundancy. It's JUST about capacity and speed, striping the data across several drives on several controllers, so it comes at you faster when you read it and gets shoved out faster when you write it. RAID 0 doesn't even have a parity disk to allow you to recover from failure of one drive or loss of one sector.

    That means the failure rate is WORSE than that of an individual disk. If any of the combined disks fails, the total array fails.

    (Of course it's still worse if a software bug injects additional failures. B-b But don't assume, because "there's a RAID 0 corruption bug", that there is ANY problem with the similarly-named, but utterly distinct, higher-level RAID configurations which are directed toward reliability, rather than ONLY raw speed and capacity.)

    --
    Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
  27. Systemd by Anonymous Coward · · Score: 0

    everything switches to it... "file corruption"... NSA Dr. Evil pinky finger.

  28. If it only affects RAID0, SAY SO, DAMN IT by Anonymous Coward · · Score: 0

    I was worried that my RAID1 and RAID6 arrays might be affected, and it took a lot of reading to figure out the truth.

    It is too much to add one more character to the headline to save a lot of people who care about data integrity panic?

  29. Raid0 and ext4 is a bad combination by Anonymous Coward · · Score: 0

    If you want high throughput, use XFS, it can get near the speed of the "metal". ext4 will start hitting the limit at 60 MB/s.

    1. Re:Raid0 and ext4 is a bad combination by twistedcubic · · Score: 1

      This is not true.

  30. Re:Happy Corruption Day from The Golden Girls! by Darinbob · · Score: 1

    I thought it was "commandante".

  31. Someone let Hans Reiser out of jail? by Anonymous Coward · · Score: 0

    And now he's doing to ext4 what he did to ReiserFS, and his ex wife?

  32. Bleeding edgy by ShreddingSplinters · · Score: 1

    Since when did bug reports become viral news?