Linux 4.0 Has a File-System Corruption Problem, RAID Users Warned
An anonymous reader writes: For the past few days kernel developers and Linux users have been investigating an EXT4 file-system corruption issue affecting the latest stable kernel series (Linux 4.0) and the current development code (Linux 4.1). It turns out that Linux users running the EXT4 file-system on a RAID0 configuration can easily destroy their file-system with this newest "stable" kernel. The cause and fix have materialized but it hasn't yet worked its way out into the mainline kernel, thus users should be warned before quickly upgrading to the new kernel on systems with EXT4 and RAID0.
I'll stick with Windows Vista, thanks.
Losing data goes with the territory if you're going to use RAID 0.
Article isn't very clear - are they referring to softraid, fakeraid, and/or hardware raid?
... need to be debugged, so using Raid® is probably the cause of this.
It little behooves the best of us to comment on the rest of us.
this is obviously some strange usage of the word "stable" that I wasn't previously aware of.
RAID 0 is unstable to begin with. Medium case scenario here (for legitimate use) is some data gets corrupted on a compute node. Run the program on two nodes; if you get the same result on both, that result is probably fine. If you're running RAID0 on any filesystem that isn't temporary or at least easily replaceable, you're doing it wrong.
If your running a brand spanky new kernel, with data you do not care about why an old FS. Plenty of newer better FS's to choose from.
No sir I dont like it.
There seems to be a fix in RAID code and a fix in Ext4 code.
The latter was incorporated in Linux 4.0.3 (changelog), and according to the Phoronix article the RAID bug is still unfixed.
This is the new 4.0 kernel, A Major version update , less than a month old, that most Linux systems will not have yet ...and the issue has already been patched
Bleeding edge builds get what they expect, stable builds don't even notice
Puteulanus fenestra mortis
That's a good rule of thumb for Windows and Linux. Not sure about Apple :)
It also looks like if dropping the discard mount option you will also avoid being hit by this serious issue.
There's very little good reason to use 'discard' on Linux, and many reasons not to. (This isn't the first data corruption problem, and there are several performance issues as well.) Fstrim in a con job is the way to go.
Way to go Barney Buzzkill!
Next you're gonna try to tell us that gremlins aren't real.
the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff
Thank you for feeding the troll
Took the bait and back again
Your brain is weak, you're a doof and a space cadet.
And if they threw a inquest
Invited everyone you blew
You would see the biggest sore would be from me
And the verdict would say, thank you for feeding the troll.
Tunneled down into the articles, http://git.neil.brown.name/?p=... has the patch. I'm building a system with 4.0.4 right now so this was material to me
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Or just use a power of 2 chunk size?
What idiot configuration did someone have to have to trigger this bug?
best response to troll feeding ever
---Saying gnome 3 is better than windows 8 not so much a compliment as it is damning with light praise.
You need to feed him MOAR BRAINZZ
Well, there goes that slogan.
Escher was the first MC and Giger invented the HR department.
Losing data goes with the territory if you're going to use RAID 0.
In particular, RAID 0 combines disks with no redundancy. It's JUST about capacity and speed, striping the data across several drives on several controllers, so it comes at you faster when you read it and gets shoved out faster when you write it. RAID 0 doesn't even have a parity disk to allow you to recover from failure of one drive or loss of one sector.
That means the failure rate is WORSE than that of an individual disk. If any of the combined disks fails, the total array fails.
(Of course it's still worse if a software bug injects additional failures. B-b But don't assume, because "there's a RAID 0 corruption bug", that there is ANY problem with the similarly-named, but utterly distinct, higher-level RAID configurations which are directed toward reliability, rather than ONLY raw speed and capacity.)
Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
I thought it was "commandante".
This is not true.
Since when did bug reports become viral news?
Not even a decent troll.
Running opensuse 13.2 on desktop and server, no crashes and with KDE with all the bells and whistles it still comes in under 400 MB of RAM when the boot process is completed.
Sure the kernel sources are getting large, but it supports a ton of hardware on several processor architectures.
Talking about "the file system" and "the desktop" not only shows you are in fact a troll, but displays your total ignorance of Linux.