EXT4 Data Corruption Bug Hits Linux Kernel

← Back to Stories (view on slashdot.org)

EXT4 Data Corruption Bug Hits Linux Kernel

Posted by Soulskill on Wednesday October 24, 2012 @07:22AM from the plenty-of-time-to-fix dept.

An anonymous reader writes "An EXT4 file-system data corruption issue has reached the stable Linux kernel. The latest Linux 3.4, 3.5, 3.6 stable kernels have an EXT4 file-system bug described as an apparent serious progressive ext4 data corruption bug. Kernel developers have found and bisected the kernel issue but are still working on a proper fix for the stable Linux kernel. The EXT4 file-system can experience data loss if the file-system is remounted (or the system rebooted) too often."

31 of 249 comments (clear)

Min score:

Reason:

Sort:

Re:Bisected? by Slayne · 2012-10-24 07:30 · Score: 5, Informative

Nope - bisection is a common technique for tracking down the cause of a bug by doing a binary search through the code history.
https://en.wikipedia.org/wiki/Code_Bisection
Re:Bisected? by Gothmolly · 2012-10-24 07:31 · Score: 4, Funny

No this means the kernel has bug-like tendencies from time to time, but is not exclusively buggy. For instance when it's in college, or if its at a bar, and has had a few drinks, well then it might be buggy, but normally at work and at home and to all its friends it acts stable.

--
I want to delete my account but Slashdot doesn't allow it.
This is why I stick to Reiser by Anonymous Coward · 2012-10-24 07:33 · Score: 5, Funny

I know he'd never do anything to harm me or my data.
I don't see the problem then... by Zapotek · 2012-10-24 07:34 · Score: 5, Funny

The EXT4 file-system can experience data loss if the file-system is remounted (or the system rebooted) too often.
We're talking about Linux users here...move along.
Really clever... by K.+S.+Kyosuke · 2012-10-24 07:36 · Score: 5, Funny

The EXT4 file-system can experience data loss if the file-system is remounted (or the system rebooted) too often."
They're trying to boost the average uptime of all installations by making people keep their machines turned on. It's just a continuation of the uptime war waged with the BSD folks!

--
Ezekiel 23:20
Interesting bug, but don't get excited. by dacut · 2012-10-24 07:38 · Score: 5, Informative

From Ted Ts'o's commentary, it's an optimization ("jbd2: don't write superblock when if its empty") gone awry:

The reason why the problem happens rarely is that the effect of the buggy commit is that if the journal's starting block is zero, we fail to truncate the journal when we unmount the file system. This can happen if we mount and then unmount the file system fairly quickly, before the log has a chance to wrap.
Basically, this optimization has the side effect of not updating the transaction log in this rare case. You can end up replaying old transactions after new ones, which will scramble metadata blocks. Given the rather unique conditions needed to hit this one, I'm not going to lose any sleep over any servers running without Ted's fix (though I'll certainly apply it once RedHat releases the patch).
1. Re:Interesting bug, but don't get excited. by Tough+Love · 2012-10-24 07:58 · Score: 4, Informative
  
  It means you could get an incorrect replay after a crash and end up needing to do a fsck. Good thing Ext2/3/4 fsck is awesome. Of course, having no replay bug will be much better. Note: the bug was introduced this October 8th. You are not running this kernel on your server or workstation unless you are a dev, it hasn't filtered through to distros yet.
  
  --
  When all you have is a hammer, every problem starts to look like a thumb.
2. Re:Interesting bug, but don't get excited. by Shimbo · 2012-10-24 08:47 · Score: 3, Insightful
  
  There are certainly distributions out there using 3.4 and 3.5 kernels.
  Yes, but not many of them will push kernel updates all the way through to end users in a couple of weeks.
3. Re:Interesting bug, but don't get excited. by Anonymous Coward · 2012-10-24 09:34 · Score: 5, Informative
  
  Ubuntu users are at risk.
  http://www.ubuntuupdates.org/package/core/quantal/main/proposed/linux-image-3.5.0-18-generic
  Look for " jbd2: don't write superblock when if its empty
  - LP: #1066176"
  If any Ubuntu users have proposed repo enabled and they've updated to 3.5.0-18, they're vulnerable.
4. Re:Interesting bug, but don't get excited. by fatphil · 2012-10-24 10:29 · Score: 4, Informative
  
  $ git show eeecef0af5e
  commit eeecef0af5ea4efd763c9554cf2bd80fc4a0efd3
  Author: Eric Sandeen <sandeen@redhat.com>
  Date: Sat Aug 18 22:29:40 2012 -0400
  
  jbd2: don't write superblock when if its empty
  
  --
  Also FatPhil on SoylentNews, id 863
The file system dug too greedily... by Bovius · 2012-10-24 07:43 · Score: 3, Funny

...and too deep. It awoke a being of segfaults and kernel panics.
Re:Bisected? by petermgreen · 2012-10-24 07:51 · Score: 4, Informative

What they actually split in half is a sequence of changesets (also known as commits).
The idea is you have a seqence of changesets that take you from the last known good revision to the first known bad revision. By splitting that sequence in half and determining if the revsion in the middle is good or bad you can in principle halve the number of revisions between last known good and first known bad until you find the revision that introduced the bug. Reality is messier because of nonlinear history, because some revisions may be "broken" such that it is not possible to determine if they are "good" or "bad" and because some bugs may be difficult to test for but still bisection is a useful tool for finding problem revisions among a long history relatively easill.

--
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
Re:Reiserfs became 'murderfs'... by Anonymous Coward · 2012-10-24 07:59 · Score: 5, Funny

So clearly the answer is General Tso's FS. Delicious, but you'll lose your data an hour later.
Your Papers Please by Anonymous Coward · 2012-10-24 08:05 · Score: 5, Funny

grammar nazi's
grammar Nazis
Summary is wrong by DrJimbo · 2012-10-24 08:05 · Score: 5, Informative

The EXT4 file-system can experience data loss if the file-system is remounted (or the system rebooted) too often.
This is wrong. The problem occurs when the fs is unmounted too *soon*. Twice in a row. The bug only appears if the journal buffer does not wrap. You only get catastrophic results if this happens twice in a row.

--
We don't see the world as it is, we see it as we are.
-- Anais Nin
1. Re:Summary is wrong by Anonymous Coward · 2012-10-24 08:27 · Score: 5, Interesting
  
  This appears to be untrue. My latest tests suggest that it happens if a single unclean umount happens while the fs is mounted in 3.6.3. (At least, I saw corruption in /var after a single boot, followed by a rescue boot into 3.6.1 and fsck: every filesystem that had journal replay invoked also had corruption.)
  -- N., original reporter, not much enjoying his fifteen minutes of fame since it comes with happy fun filesystem corruption attached: captcha is 'contrite', how appropriate
Re:Reinventing the wheel by UnknownSoldier · 2012-10-24 08:09 · Score: 4, Interesting

I have to agree with you. This is one of the best demos of ZFS around :)
http://www.youtube.com/watch?v=QGIwg6ye1gE
ZFS solves 3 problems by taking a wholistic approach:
* Volume Management
* File System
* Data Integrity
Instead of fragmenting the problem into 3 layers which only have limited access and knowledge by using a unified layer you have more meta-information available to make smarter decisions.
Some interesting essays:
https://blogs.oracle.com/bonwick/entry/raid_z
https://blogs.oracle.com/bonwick/en_US/entry/rampant_layering_violation
Re:Low impact by jedidiah · 2012-10-24 08:14 · Score: 5, Insightful

> Windows has never had anything as serious as a file system corruption bug.
That you know of...
Since the Windows development process isn't open, there's no way for you to tell. You don't get to see Microsoft's development versions and you don't get to see Microsoft's bug database.

--
A Pirate and a Puritan look the same on a balance sheet.
Re:Low impact by h4rr4r · 2012-10-24 08:16 · Score: 4, Informative

http://answers.microsoft.com/en-us/windows/forum/windows_cp-files/bug-report-serious-filesystem-corruption-and-data/17f69e19-92ca-4e1e-b9d5-f78f1ac4e963
Bugs happen. The difference here is that Linux development is done in the open so people find out about them.
Re:Reinventing the wheel by UnknownSoldier · 2012-10-24 08:49 · Score: 4, Interesting

> Blame SUN, they choose a license for ZFS to ensure it never had proper in kernel linux support.
That's a myth / blatant lie.
Fork Yeah! The Rise and Development of illumos
http://www.youtube.com/watch?feature=player_detailpage&v=-zRN7XLCRhc#t=1460s
Why You Need ZFS
http://www.youtube.com/watch?v=6F9bscdqRpo
@5:40 I just want to clarify you comment "It would be illegal to ship"
@5:45 I think there is a perception issue that we need to tackle.
@5:55 One point that I would like to make because I think said earlier that I think we have much more in common then that separates us.
@5:58 One of the most important things we all have in common is we are all open source systems.
@6:02 And we need to end this self inflicted madness of open source licensing compatibility.
@6:12 I think that it is a boogey man and we letting it us hold us back.
@6:19 You say it would be illegal to ship. I say no one has standing
@6:24 The GPL was never ever designed to counter-act other open source licenses.
@6:33 That is a complete rewrite of history to believe the GPL was designed to be at war with BSD or with Cuddle.
@6:39 The GPL was at war with properiety softwware. And thank the GPL and Stallman open source won.
@6:45 That is the whole point. Open source won.
@6:49 We are pissing on our own victory parade by not allowing these technologies to flow between systems.
Re:Bisected? by EMR · 2012-10-24 09:06 · Score: 3, Funny

If God forks the Universe every time you roll a die, he'd better have a damned good memory.
Nah, He only needs the latest SHA1 for each roll outcome commit as that'll point up the GIT tree :-D
Re:Low impact by the_other_chewey · 2012-10-24 09:35 · Score: 4, Insightful

That isn't a file system bug, that is progress. Would you consider it a bug if a Linux system from 1998 caused corruption on an ext4 volume?
Hell yeah.

If it'd tell me it doesn't know the file system and has no idea what do do with it,
that would be perfectly fine.

But corrupting a file system just because it is unknown to/unsupported by the
system trying to read it would be a huge bug.
Re:Low impact by sk999 · 2012-10-24 10:17 · Score: 4, Informative

Still, for all of the shit that Linux users talk about Windows, WINDOWS has NEVER had anything as serious as a FILE system CORRUPTION bug.
Finally, someone talking sense ... oh wait.
http://www.computerworld.com/s/article/9054178/Microsoft_s_Windows_Home_Server_corrupts_files
"Microsoft's Windows Home Server CORRUPTS FILES"
"'Don't edit' list includes photos, as well as Quicken and QuickBooks files, warns Microsoft; no word on patch"
Never mind ...
Re:Bisected? by Nivag064 · 2012-10-24 10:31 · Score: 3, Funny

Nah!
Your'e wrong!!
The 0's go to the top of the page, and the 1's to the bottom!!!
(As the 0's have air bubbles that make them float...)
[An irrelevant irrelevancy?]
Re:Part of the game by fatphil · 2012-10-24 10:33 · Score: 3, Informative

It is *not* 10 days old.

linux-stable$ git show 14b4ed22a6
commit 14b4ed22a6b5fc1549504336131be4f5f6ba1bf4
Author: Eric Sandeen <sandeen@redhat.com>
Date: Sat Aug 18 22:29:40 2012 -0400

jbd2: don't write superblock when if its empty

commit eeecef0af5ea4efd763c9554cf2bd80fc4a0efd3 upstream.

This sequence:

# truncate --size=1g fsfile
# mkfs.ext4 -F fsfile
# mount -o loop,ro fsfile /mnt
# umount /mnt
# dmesg | tail

results in an IO error when unmounting the RO filesystem:

[ 318.020828] Buffer I/O error on device loop1, logical block 196608
[ 318.027024] lost page write due to I/O error on loop1
[ 318.032088] JBD2: Error -5 detected when updating journal superblock for loop1-8.

This was a regression introduced by commit 24bcc89c7e7c: "jbd2: split
updating of journal superblock and marking journal empty".

Signed-off-by: Eric Sandeen <sandeen@redhat.com>
Signed-off-by: "Theodore Ts'o" <tytso@mit.edu>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

diff --git a/fs/jbd2/journal.c b/fs/jbd2/journal.c
index e149b99..484b8d1 100644
--- a/fs/jbd2/journal.c
+++ b/fs/jbd2/journal.c
@@ -1354,6 +1354,11 @@ static void jbd2_mark_journal_empty(journal_t *journal)

BUG_ON(!mutex_is_locked(&journal->j_checkpoint_mutex));
read_lock(&journal->j_state_lock);
+ /* Is it already empty? */
+ if (sb->s_start == 0) {
+ read_unlock(&journal->j_state_lock);
+ return;
+ }
jbd_debug(1, "JBD2: Marking journal as empty (seq %d)\n",
journal->j_tail_sequence);

--
Also FatPhil on SoylentNews, id 863
Re:Bisected? by Just+Some+Guy · 2012-10-24 10:56 · Score: 4, Informative

The summary should say "bisected and found" not "found and bisected". Bisecting is a way of finding bugs.
No. They found the bug, then bisected the commits between "last known working" and HEAD to discover what patch caused it.

--
Dewey, what part of this looks like authorities should be involved?
Re:Low impact by sk999 · 2012-10-24 11:36 · Score: 4, Informative

Nice try, but fail. That wasn't a bug in Windows, it was a bug in applications.
Really? Not according to Microsoft.
http://support.microsoft.com/kb/946676
"A BUG has been discovered in the way that the initial release of Windows Home SERVER manages FILE transfer and balancing across multiple hard drives. In certain cases, depending on application use patterns, timing, and the workload that is placed on the Windows Home Server-based computer, certain FILES could become CORRUPTED."
"... For distributing data across the different hard drives that are MANAGED by WINDOWS Home Server, the WINDOWS Home Server mini-filter driver REDIRECTS I/O ... A BUG has been discovered in the REDIRECTION mechanism which, in certain cases, depending on application use patterns, timing, and workload, may cause interactions between NTFS, the Memory Manager, and the Cache Manager to get out of sync. This causes CORRUPTED data to be written to FILES."
Re:Bisected? by Tough+Love · 2012-10-24 13:08 · Score: 3, Informative

Ah I see, we have ambiguity about what "find a bug" means. From the user's perspective, "finding a bug" means producing the buggy behavior. But from the developer's perspective, "finding a bug" means finding the erroneous code. And we are talking about developers here. From my perspective, until the bug was "found" by bisecting it was only "known to exist", not found. See?
By the way, I've actually bisected bugs, have you? No? OK.

--
When all you have is a hammer, every problem starts to look like a thumb.
Most of the early stories on the web are wrong.... by tytso · 2012-10-24 13:42 · Score: 5, Informative

I have a Google+ post where I've posted my latest updates to this still-developing story:
https://plus.google.com/117091380454742934025/posts/Wcc5tMiCgq7
Also, I will note that before I send any pull request to Linus, I have run a very extensive set of file system regression tests, using the standard xfstests suite of tests (originally developed by SGI to test xfs, and now used by all of the major file system authors). So for example, my development laptop, which I am currently using to post this note, is currently running v3.6.3 with the ext4 patches which I have pushed to Linus for the 3.7 kernel. Why am I willing to do this? Specifically because I've run a very large set of automated regression tests on a very regular basis, and certainly before pushing the latest set of patches to Linus. So while it is no guarantee of 100% perfection, I and many other kernel developers *are* willing to eat our own dogfood.
Re:Low impact by tirnacopu · 2012-10-24 23:00 · Score: 3, Interesting

I got bit by this one: http://support.microsoft.com/kb/925308 on volumes with hundreds of thousands of small files. All who had a size multiple of 4kb were corrupted.
Re:Well of course! by isorox · 2012-10-25 03:26 · Score: 3, Funny

Lastly, my geek friends, mounting too often can cause burning friction which can destroy data and cause irritation and discomfort.
I never had a problem with frequent mounting, however I have now found a side effect from a mount I performed last year. A child-process was forked into existence shortly after the mount, and now we find we're continuously receiving interrupts from the process, which has affected pretty much every aspect of system administration.
I find that performing the mount is occasionally possible, but having to umount to give resources to deal with the child process (which often core dumps, and needs a lot of user interaction), before ejecting can lead to frustration and cold showers.
Most of the time my team is simply trying to run sleep whenever we can.