A Short History of Btrfs

← Back to Stories (view on slashdot.org)

Posted by Soulskill on Friday July 31, 2009 @09:13PM from the new-and-shiny dept.

diegocgteleline.es writes "Valerie Aurora, a Linux file system developer and ex-ZFS designer, has posted an article with great insight on how Btrfs, the file system that will replace Ext4, was created and how it works. Quoting: 'When it comes to file systems, it's hard to tell truth from rumor from vile slander: the code is so complex, the personalities are so exaggerated, and the users are so angry when they lose their data. You can't even settle things with a battle of the benchmarks: file system workloads vary so wildly that you can make a plausible argument for why any benchmark is either totally irrelevant or crucially important. ... we'll take a behind-the-scenes look at the design and development of Btrfs on many levels — technical, political, personal — and trace it from its origins at a workshop to its current position as Linus's root file system.'"

15 of 241 comments (clear)

Min score:

Reason:

Sort:

Looks promising by PhunkySchtuff · 2009-07-31 21:30 · Score: 5, Informative

This looks like a promising filesystem - as ZFS on linux is, at present, doomed to die an ugly death, btrfs looks to address a lot of the shortcomings of other filesystems and bring a clean, modern fs to linux. It goes beyond ZFS in some areas too, such as being able to efficiently shrink a filesystem, and keeps a lot of the cool things that ZFS made popular, such as Copy-On-Write.
It looks like Btrfs also addresses some decisions that were made with the direction that ZFS would be going in, or how it would handle certain problems that now with hindsight behind the developers, they possibly would have done things differently.
Apple are really struggling with ZFS, with it being announced as a feature in early betas of both Leopard (10.5) and Snow Leopard (10.6), as well as being there in a very limited form in Tiger (10.4) - maybe development on Btrfs will leapfrog ZFS for consumer-grade hardware and Apple can finally look at deprecating HFS.

--
Specialist Mac support for creative pros, Melbourne
1. Re:Looks promising by PhunkySchtuff · 2009-07-31 23:31 · Score: 5, Informative
  
  Apple has, and does, use GPL'd code and complies with the terms of the license.
  Take, for example, WebKit, which is a fork of KHTML. It's now released as LGPL:
  http://webkit.org/coding/lgpl-license.html
  This code powers the browser that Apple ship with Mac OS X, Safari - which is arguably one of the most important pieces of code in the whole OS.
  As a result of it's quality, speed and standards adherence, it's now used by companies like Nokia and Adobe...
  
  --
  Specialist Mac support for creative pros, Melbourne
Re:So, by PhunkySchtuff · 2009-07-31 21:54 · Score: 5, Interesting

Aside from Copy on Write, one other feature that this filesystem has that I would consider essential in a modern filesystem is full checksumming. As drives get larger and larger, the chance of a random undetected error on write increases and having full checksums on every block of data that gets written to the drive means that when something is written, I know it's written. It also means that when I read something back from the disk, I know that it was the data that was put there and didn't get silently corrupted by the [sata controller | dodgy cable | cosmic rays] on the way to the disk and back.

--
Specialist Mac support for creative pros, Melbourne
Re:So, by borizz · 2009-07-31 22:00 · Score: 4, Insightful

Snapshots are nice too. Makes stuff like Time Machine and derivatives much more elegant. ZFS has built in RAID support (which, I assume, works on the block level, instead of on the disk level), maybe Btrfs will get this too.
So, what is the status of btrfs? by MMC+Monster · 2009-07-31 22:29 · Score: 4, Interesting

Is it Beta? The fact that Linus runs it as his root fs doesn't tell me much. Now, if you told me that's what he uses for ~/, I would be more impressed.
The important question to me is, how long 'til it gets in the major distributions?

--
Help! I'm a slashdot refugee.
1. Re:So, what is the status of btrfs? by joib · 2009-07-31 22:39 · Score: 4, Informative
  
  The important question to me is, how long 'til it gets in the major distributions?
  The article predicts a couple of years until it's safe enough as default in new distros.
2. Re:So, what is the status of btrfs? by TheRaven64 · 2009-07-31 23:51 · Score: 5, Interesting
  
  Meanwhile, FreeBSD and OpenSolaris are shipping with a version of ZFS that is usable now...
  
  --
  I am TheRaven on Soylent News
3. Re:So, what is the status of btrfs? by joib · 2009-08-01 00:40 · Score: 5, Informative
  
  Just because a replied to your snarky message with another equally snarky one, doesn't mean I'm not able to put it into words. For instance, a few reasons why I prefer Linux over *BSD or Solaris:
  - better package management
  - better hw support
  - better ISV support
  - the uncertain future of Solaris (after all, Sun got bought because they were bleeding red ink left and right, will the Solaris devs escape the inevitable layoffs and Oracle continue pumping money into Solaris development just to try to keep up with Linux?)
  - Lack of tier-1 commercial support for *BSD.
  - Much larger community
  - Better availability of qualified Linux sysadmins
4. Re:So, what is the status of btrfs? by asaul · 2009-08-01 01:17 · Score: 4, Informative
  
  For hardware support it really depends what segment of the market you are arguing about. If you are talking white box, low end mostly self supported stuff then no doubt, Linux wins hands down. But as a sysadmin I find Linux to be the of the most painful platform to work on compared to Solaris or AIX - predominantly because of the lack of standardised, stable and properly supported management interfaces.
  Fibre channel support is a joke. Sure, for the most part you can dynamically bring stuff in and out, and udev goes a short way to bringing some consistancy. The problem is when something goes wrong you are left with pretty much just rebooting - messages tell you nothing - is the device there or not? Usable details are buried away in /proc and /sys and typically are only useful for developers. Solaris and AIX had cfgadm/cfgmgr and lsdev and friends to tell you what state things are in or what has happened. There are useful and informative error messages (typically). So far on RHEL 3/4/5 all I ever see is odd octal dumps from drivers when errors occur, and wierd hangs and IO errors when devices get broken. It gets worse as you change fibre drivers and versions. Options which exist in one disappear in others. Vendor drivers add customisations which cause other issues.
  The lack of stablity in terms of being able to do things between versions gets me as well. On AIX/Solaris you write a script for Solaris 8, and it just works going forwards to other versions. Solaris 10 changes things a bit, but for the most part you can still poke around the same places or the same way to get info back. In short they tend not to break things that work.
  Linux goes the other way - a change is made, and thats that, it seems to be up to you to either track or figure it out. You find yourself having to customise things for many many variations of platform - not just major versions, but minor versions as well. Changes to config file locations, the ways those files are defined etc.
  Don't get me wrong, I got into UNIX on Linux and I wont dispute its strength in drivers or community, but that community is not "Enterprise" focused. Its why I use it for my PVR and not my file server. The rapid changes in Linux are why the DVB-T cards I got became supported so quickly after the hardware changed. I get the differences, but its not one size fits all.
  
  --
  "If everybody is thinking alike, somebody isn't thinking" - Gen. George S. Patton
Re:So, by joib · 2009-07-31 22:41 · Score: 4, Informative

ZFS has built in RAID support (which, I assume, works on the block level, instead of on the disk level), maybe Btrfs will get this too.
Yes, btrfs currently has built-in support for raid 0/1/10, 5 and 6 are under development.
Oh great by teslatug · 2009-07-31 23:50 · Score: 5, Funny

As if fsck wasn't bad enough to use in business talks, now I have to get prepared for btrfsck
1. Re:Oh great by toby · 2009-08-01 08:35 · Score: 4, Interesting
  
  I'd rephrase that. It eliminates the common cases where you'd need fsck on a conventional filesystem.
  ZFS' design makes consistency failure extremely unlikely. I understand why they claim it doesn't need fsck ("always consistent on disk"). There is controversy over whether there should be a scavenging tool. Some people want one for peace of mind.
  But again, most cases of ZFS pool loss where some believe a scavenger may have saved them, may actually have been solved by more aggressive rollback (I believe work is being done on this).
  Anyone interested in this issue should follow the ZFS mailing list.
  
  --
  you had me at #!
Re:So, by PhunkySchtuff · 2009-07-31 23:57 · Score: 4, Informative

What you do know is that when you read a block of data back from the disk, that block is what was supposed to be written to the disk.
If a file that is never read is corrupted somehow, then you will only discover that corruption when you read the file.
Having checksums is very good if you have a RAID-1 mirror. With full block checksums, you can read each half of the mirror and if there is an error, you know which one is correct, and which one isn't. At present, if a RAID-1 mirror has a soft error like this, due to corruption, you don't know which half of the mirror is actually correct.
With ZFS, for instance, you can create a 2-disk RAID-1 mirror and then use dd to write zeroes to one half of the mirror, at the raw device level (ie, bypassing the filesystem layer) and when you go to read that data back from the mirror, ZFS knows that it's invalid and instead uses the other side of the mirror. It then has an option to resilver the mirror and write the valid data back to the broken half, if you so want.

--
Specialist Mac support for creative pros, Melbourne
Meh by Dachannien · 2009-08-01 02:00 · Score: 5, Funny

Who cares? In a few years' time, this will be obsoleted by its successor, icantbelieveitsnotbtrfs.
Re:Duh... by caseih · 2009-08-01 05:09 · Score: 5, Informative

Wow. FUD flies fast and hard on slashdot. Zealots? Are you serious? Rather than mod your post as +1 Funny, I think I'll blow some karma and respond, just to set the record straight.
Laying aside misconceptions about the GPL, the main reason BtrFS is GPL is because it's part of the Linux kernel which is also GPL! How hard is it to grasp that? If Apple or anyone else wants to license Oracle's BtrFS code, they are welcome to negotiate and get the code under a different license than the GPL. It's that simple. BtrFS is an implementation of an idea, a specification. If Apple wants to write their own BtrFS driver, they are welcome to do that. Or Microsoft.
Why are developers who don't want their code to be ripped off (used without payment in a closed product) by companies and incorporated into a product are labeled zealots? How is this different than software companies requiring code to be licensed by third parties? So a company who creates some really cool technology that they license for a fee to others for use in products zealots? There really is no difference.
While I haven't written any software of note, I also use the GPLv2 (evaluating v3) since I want my software to be able to be freely used by those that want to use it, but if my code is that valuable to a company, I want to get paid for my trouble. If no one is willing to pay me, then that's fine. They are welcome to use my software without restriction, but if they redistribute it, to do so under the terms of the GPL. Guess that makes me a zealot.