Slashdot Mirror


Ext4 Advances As Interim Step To Btrfs

Heise.de's Kernel Log has a look at the ext4 filesystem as Linus Torvalds has integrated a large collection of patches for it into the kernel main branch. "This signals that with the next kernel version 2.6.28, the successor to ext3 will finally leave behind its 'hot' development phase." The article notes that ext4 developer Theodore Ts'o (tytso) is in favor of ultimately moving Linux to a modern, "next-generation" file system. His preferred choice is btrfs, and Heise notes an email Ts'o sent to the Linux Kernel Mailing List a week back positioning ext4 as a bridge to btrfs.

510 comments

  1. BTRFS? REALLY? by erroneus · · Score: 4, Interesting

    Couldn't they come up with a better name than "BuTteR FaSe?" I know I can't be the only one who read it like that. Call it anything but that.

  2. BTRFS? by Anonymous Coward · · Score: 5, Funny

    So it incorporates compression by vowel ommission? Brllnt!

    1. Re:BTRFS? by mqduck · · Score: 1

      They also omitted the double T. So you should have said "brlnt!"

      --
      Property is theft.
    2. Re:BTRFS? by Anonymous Coward · · Score: 0

      Not sure about you, but a lossy compression ain't what I'd want for my filesystem, BTR or not.

    3. Re:BTRFS? by Bromskloss · · Score: 1

      So it incorporates compression by vowel ommission?

      What about extra consonants? For error correction, perhaps?

      --
      Swedish plasma phys. PhD student; MSc EE; knows maths, programming, electronics; finance interest; seeks opportunities
    4. Re:BTRFS? by timelorde · · Score: 1

      Double-T is the name of a diner, hon.

      I think y'all meant double L...

    5. Re:BTRFS? by Tevo-D · · Score: 1

      They also omitted the double T. So you should have said "brlnt!"

      Um... wasn't that a double L in there? Or did they change the alphabet when I wasn't looking?

    6. Re:BTRFS? by koxkoxkox · · Score: 1

      The double T of butter, not of brilliant

    7. Re:BTRFS? by Reverend528 · · Score: 1

      You misspelled "Brillant".

    8. Re:BTRFS? by Anonymous Coward · · Score: 0

      Better not try to patent that. Hebrew's got thousands of years of prior art.

  3. Why not ZFS? by mlts · · Score: 5, Interesting

    Unless ZFS has patent issues, why not just work on having ZFS as Linux's standard FS, after ext3?

    ZFS offers a lot of capabilities, from no need to worry about a LVM layer, to snapshotting, to excellent error detection, even encryption and compression hooks.

    1. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Unless i'm mistaken, ZFS is patented by its creators over at Sun

    2. Re:Why not ZFS? by PhrostyMcByte · · Score: 5, Informative
      I am not aware of the differences, but from Theodore Ts'o:

      people who really like reiser4 might want to take a look at btrfs; it has a number of the same design ideas that reiser3/4 had --- except (a) the filesystem format has support for some advanced features that are designed to leapfrog ZFS, (b) the maintainer is not a crazy man and works well with other LKML developers (free hint: if your code needs to be reviewed to get in, and reviewers are scarce; don't insult and abuse the volunteer reviewers as Hans did --- Not a good plan!).

    3. Re:Why not ZFS? by Anonymous Coward · · Score: 5, Informative

      The ZFS developers specifically wanted the open sourced code to be under a GPL incompatible license, hence it has been released under CDDL (there was a interview with the Sun open source rep, can someone provide info/links about this). So ZFS cannot be part of the kernel, but there is a FUSE port of ZFS and according to http://en.wikipedia.org/wiki/ZFS#Linux Sun is investigating a Linux port, so there may be something good coming

    4. Re:Why not ZFS? by calidoscope · · Score: 1

      IIRC, it is that ZFS was released with a license that is not compatible with GPL v2 and thus cannot be part of the Linux Kernel. Will be interesting to see how btfs stacks up with ZFS.

      --
      A Shadeless room is a brighter room.
    5. Re:Why not ZFS? by Mad+Merlin · · Score: 4, Insightful

      ZFS offers a lot of capabilities, from no need to worry about a LVM layer, to snapshotting, to excellent error detection, even encryption and compression hooks.

      ...and that's it's biggest problem. ZFS duplicates a lot of functionality that belongs outside of a filesystem. All of the above can already be done using any Linux filesystem, so why keep around a second copy of all that code that implements those features for just a single filesystem?

      ReiserFS was (is) in a similar situation, where it also duplicates a lot of functionality that doesn't belong in the filesystem. Not only does this make it harder to maintain, but it makes a lot of features filesystem specific that shouldn't be.

    6. Re:Why not ZFS? by volsung · · Score: 4, Informative

      I don't know about the patents, but the current major obstacle is the license. ZFS, as part of the OpenSolaris kernel, is available under the CDDL. The CDDL is incompatible with the GPL, ruling out ZFS inclusion directly in the Linux kernel. Sun has hinted that they could dual license the Solaris kernel under CDDL and GPL, but that hasn't happened yet. Small parts of the ZFS filesystem code have been GPLed so they could be added to grub to support booting ZFS root filesystems.

      There is a userspace port of the ZFS code and utilities which avoids the license problem by using FUSE to separate the filesystem code into a separate process: ZFS-FUSE.

      If Sun were to ever dual-license ZFS, the ZFS-FUSE codebase would be a good place to start for porting the code to direct kernel inclusion. (Note: Sun, via their subsidiary, Cluster File Systems, now employes the author of ZFS-FUSE to use his port as an optional backend for the Lustre file system.)

    7. Re:Why not ZFS? by Wonko · · Score: 5, Informative

      ZFS duplicates a lot of functionality that belongs outside of a filesystem. All of the above can already be done using any Linux filesystem, so why keep around a second copy of all that code that implements those features for just a single filesystem?

      It wouldn't be possible to duplicate RAID-Z with LVM. Other features of ZFS are very handy, but RAID-Z is by far my favorite. Same storage density as RAID 5 but without the horrible write performance. RAID-Z uses copy-on-write to avoid RAID 5's required read for every non-cached write.

      Being able to create filesystems just as easily as creating directories is quite handy as well, though. IIRC, the filesystem sizes in ZFS are controlled by a quota style system. That is much simpler than shrinking an LV (if your filesystem supports shrinking), then adding a new LV, and then creating a filesystem. I don't know about you, but I am always a bit nervous when I have to resize an LV.

    8. Re:Why not ZFS? by 42forty-two42 · · Score: 3, Informative

      Sun has some patents on ZFS; the CDDL grants a license to these patents if you're deriving from the original ZFS source, but then you can't link it to linux.

      FWIW, I doubt ZFS-FUSE would be a good place to start - FUSE is totally different from Linux's actual vfs layer, after all.

    9. Re:Why not ZFS? by mritunjai · · Score: 4, Informative

      The ZFS developers specifically wanted the open sourced code to be under a GPL incompatible license, hence it has been released under CDDL (there was a interview with the Sun open source rep, can someone provide info/links about this). So ZFS cannot be part of the kernel, but there is a FUSE port of ZFS and according to http://en.wikipedia.org/wiki/ZFS#Linux Sun is investigating a Linux port, so there may be something good coming

      Rather, GPL is incompatible with anything else that can't be re-licensed as GPL, and that includes GPL v2 and v3, which can't even be mixed among themselves. May first we clear that mess, right ?

      ZFS is present in both Mac OSX and FreeBSD, thank you! They have no license issues whatsoever.

      --
      - mritunjai
    10. Re:Why not ZFS? by setagllib · · Score: 2, Insightful

      A FUSE ZFS guarantees it will never be the "default" filesystem anyway. BTRFS has a good shot at being your / in a couple of years.

      --
      Sam ty sig.
    11. Re:Why not ZFS? by clarkkent09 · · Score: 5, Insightful

      (b) the maintainer is not a crazy man and works well with other LKML developers

      Also important, he might be more focused due to not being in prison for first degree murder

      --
      Negative moral value of force outweighs the positive value of good intentions.
    12. Re:Why not ZFS? by k33l0r · · Score: 1

      ZFS has licensing issues. Sun has licensed it under the CDDL, which is incompatible with the GPL. If Sun doesn't change the license, our best hope will be to get ZFS running on FUSE, like NTFS-3G.

    13. Re:Why not ZFS? by Michael+Hunt · · Score: 1

      The ZFS developers specifically wanted the open sourced code to be under a GPL incompatible license, hence it has been released under CDDL

      No. The ZFS developers' management wanted the code to be under a license which served their purposes. The fact that CDDL is GPL-incompatible is, largely, GPL's fault. Don't blame the Sun guys for releasing free code simply because everything must be GPL in your world.

      Hypocrite.

    14. Re:Why not ZFS? by Anonymous Coward · · Score: 1, Informative

      that includes GPL v2 and v3, which can't even be mixed among themselves.

      [citation needed] My memory of the GPL compatibility matrix doesn't support that, but it may be true for GPLv2-only rather than the ordinary GPLv2-or-any-later-version (which does cover the Linux kernel).

      That said, I thought Sun was trying to GPLv3 ZFS? It's certainly a lovely filesystem, though I guess they're fighting with NetApp over patents on it.

    15. Re:Why not ZFS? by GrievousMistake · · Score: 5, Interesting

      Huh. One of the interesting things things about Reiser4 from an end-user perspective was Hans Reisers plans for file metadata. From what I can find about btrfs, it currently doesn't even support normal extended attributes. There was also talk about making it easy for developers to extend the filesystem with plugins that could add e.g. compression schemes.
      I can't really recognize anything from Hans Reiser's ramblings in the btrfs documentation that isn't standard file system improvements already seen in e.g. ZFS. does anyone have any specific examples of the ZFS-leapfrogging features referred to?

      --
      In a fair world, refrigerators would make electricity.
    16. Re:Why not ZFS? by Ivlis · · Score: 3, Informative

      Parts of ZFS are patented, but the license allows running it in userspace using FUSE.

      I'm confused: if we ask people why not run ZFS using FUSE, they reply because it's slow (I'm assuming it's possible to load ZFS at boot time using an initrd). And if we ask people which is better monolithic or microkernel, they reply microkernel. But ZFS using FUSE would be like a microkernel driver, so which is it?

    17. Re:Why not ZFS? by scott_karana · · Score: 1

      Linux is GPL, and ZFS is CDDL. As such, you can't merge ZFS into the Linux kernel. Sun is apparently considering dual-licensing ZFS, but until that happens, we'll have to use half-assed solutions like running ZFS in userspace.

    18. Re:Why not ZFS? by Anonymous Coward · · Score: 5, Funny

      Huh. One of the interesting things things about Reiser4 from an end-user perspective was Hans Reisers plans for file metadata.

      No, the most interesting feature of ReiserFS is this one (look to the far right).

      --
      ReiserFS: It puts the "stab" in "/etc/fstab".

    19. Re:Why not ZFS? by erikdalen · · Score: 1

      For starters most microkernels can boot from any filesystem even if the driver is in userspace.

      --
      Erik Dalén
    20. Re:Why not ZFS? by Xaria · · Score: 3, Informative

      No, it wouldn't. A microkernel loads modules into the kernel space. You're talking about running in user space. So when an application makes a system call, the kernel has to translate it to the FUSE layer into user space. So there's an extra layer consuming time. On top of that, kernel space isn't generally swapped out, but user space can be. Obviously it should never happen, but wouldn't it suck if your disk driver was swapped out?

      See the diagram at the bottom of this page: http://fuse.sourceforge.net/

      Also, ZFS (like ReiserFS) handles its metadata differently from ext3, so you have to translate the differences between the virtual file system and ZFS. This is why writes are significantly slower. Reads are not so bad. The NFS penalty would be huge. See http://www.linux.com/feature/138452

    21. Re:Why not ZFS? by deniable · · Score: 5, Funny

      Yep, BeaTeR FS is a kinder, gentler alternative to Reiser FS.

    22. Re:Why not ZFS? by mml · · Score: 5, Informative

      > Rather, GPL is incompatible with anything else that can't be re-licensed as GPL, and
      > that includes GPL v2 and v3, which can't even be mixed among themselves.

      Saying that GPLv2 and GPLv3 "can't even be mixed among themselves" is wrong and
      misleading.

      Section 14 of GPLv2 specifically deals with the problem of later versions of the
      licence and sets out the options. A copyright holder can choose to allow work to be used
      with later versions, such as GPLv3, or can choose not to. There are also more
      complex options. The licence itself doesn't force the choice one way or the other.

      Matt

    23. Re:Why not ZFS? by beav007 · · Score: 0

      does anyone have any specific examples of the ZFS-leapfrogging features referred to?

      Yes - (1) It has longer legs.

    24. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Having the filesystem require FUSE on Linux is the kiss of death.

      If ZFS is implemented for Linux, it needs to be on the VFS level, and usable by /, /boot, /usr, and other critical partitions. FUSE is great for implementing stuff like EncFS or filesystems that use Gmail for storage, but for a root filesystem, no way.

      IMHO, filesystems is where Linux is starting to lag compared to other operating systems. Windows, AIX, Solaris, and soon MacOS when Snow Leopard gets released will have snapshotting out of the box. Linux distribution makers need to get with the times and look for a replacement for ext3 which has 21'st century features. This is not to belittle ext3 and ext2 for that matter, but their time is beginning to pass, and something new needs to replace it.

    25. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Yes, it is "the GPL's fault" that it's so restrictive, but Sun have opened plenty of other code under that license without a murmur so their choice of an incompatible license does make one wonder why.

      The 'purpose being served' by the choice of CDDL was specifically to exclude inclusion in the Linux kernel, and I don't think that's being overly paranoid.

      Not that I'm blaming them. In fact I think it's great for Linux, as it's the a massive factor in spurring on development of competing technologies, like BTRFS and - more excitingly, I think - Tux3, and perhaps even HammerFS, which looks like it'll get a useful Linux port at some point.

      All of those have the potential to eat Sun's lunch in the storage space and if they do I'll laugh heartily, because their protectionist choice of license may ultimately cause ZFS to fall behind.

      rob

    26. Re:Why not ZFS? by BrainInAJar · · Score: 0, Flamebait

      Bullshit, the developers ( Sun ) didn't choose the CDDL to be GPL incompatible, they chose the CDDL so that they could 1) be compatible with a bunch of other stuff ( there's more than just linux out there, you know ) and 2) so that vendors that don't want to open-source their drivers could still provide Solaris drivers.

      GPL is almost as bad as proprietary ... you can't link to it if you decide you want to give your users more freedom than the GPL.

    27. Re:Why not ZFS? by BlackSabbath · · Score: 1

      > ...duplicates a lot of functionality that belongs outside of a filesystem...

      Belongs outside the FS? According to whom? There are certainly good historical reasons for why we have the block-driver/volume-manager/file-system split we currently do. However, great innovations sometimes require that convention be damned. Are the benefits worth the different architecture?

      Having said that I'm currently using ZFS on Mac OS X to host my VMware images. I can clone Windows guests as easy as pie without taking another 8Gb hit per VM. It's still not perfect on the Mac but it is pretty cool. We are also using ZFS at work (on Solaris SPARC) with mixed results. High-volume batch oriented (or low-connection/large-trx OLTP) databases run well. Databases with hundreds of connections making thousands of very short trx per sec. - not so well. We're still playing with it. Great for dev/test environments though. You can clone as many copies of some big DB as you like - instantaneous and space-saving - then destroy as you like.

    28. Re:Why not ZFS? by BrainInAJar · · Score: 1

      You're talking out of your ass.

      There's a very good reason why they can GPL Java and not their OS, and that has to do with the fact that you, as a commercial vendor, won't be linking in to the core JavaVM ( as that would probably break the java cert. compliance tests ).
      Meanwhile, nVidia, Veritas, etc. still want to write non-GPL drivers and have them link to the core Solaris operating system without worrying about legalities. (I'd love to see some minor kernel dev sue nV for GPL violation and have that little loophole closed )

    29. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Are you saying that SUN did not know what it did? The GPL is out there, and have been out there for ages. It and it's effects are well known, and have been so for a long time. Yet you pretend that the blame for ZFS being GPL-incompatible lies with the GPL? If SUN had wanted to make it open source in anything but name, what would have stopped them from using for instance the BSD license or dual licensing it?

      Laying the blame at the feet of GPL reeks of hypocrisy.

    30. Re:Why not ZFS? by QuoteMstr · · Score: 2, Informative

      You can clone as many copies of some big DB as you like - instantaneous and space-saving - then destroy as you like.

      I've been able to do this forever with LVM snapshots under Linux.

    31. Re:Why not ZFS? by mvdwege · · Score: 3, Interesting

      Come back when ZFS has decent filesystem maintenance tools.

      And don't give me that 'ZFS doesn't need a fsck' crap. SGI tried to pull that with XFS, and it didn't work. Filesystem (at least metadata) corruption will happen, and once it does, ZFS doesn't have the tools to fix it.

      Mart

      --
      "I know I will be modded down for this": where's the option '-1, Asking for it'?
    32. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Oh wow, how naive are you?
      If Sun was interested in wide adoption, they would dual license the code. They do not, because they want to weaken Linux. OSX and FreeBSD are not competitors and probably never will be, that's why it makes sense to let those support ZFS and take user share away from Linux.

    33. Re:Why not ZFS? by adrianwn · · Score: 5, Interesting

      A microkernel loads modules into the kernel space.

      No, that's the opposite of a microkernel. A microkernel loads its modules (then often called "servers") into user space. If the kernel and its drivers etc. run in the same address space (as is the case with, e.g., Linux), then we're talking about a monolithic kernel, even if it can dynamically load modules.

    34. Re:Why not ZFS? by Sentry21 · · Score: 1

      Because the GPL is incompatible with the CDDL, and so ZFS (without a complete from-scratch reimplementation) cannot be included into the Linux kernel codebase.

      It should be possible for someone to write a patch to the kernel which could be shipped separately under the CDDL and which end-users could apply to their kernels, but then you start getting into pointless grey areas.

      For my money, Solaris, FreeBSD, and Leopard are better bets - ZFS and dtrace, and Solaris/FreeBSD have containers/jails as well.

    35. Re:Why not ZFS? by Daniel+Phillips · · Score: 1

      Sun is perfectly able to GPL whatever code of theirs they wish, including ZFS.

      Personally, it does not bother me a bit that Sun has not put ZFS under GPL, that is Sun's loss not ours. ZFS is hardly the last word in filesystem design.

      --
      Have you got your LWN subscription yet?
    36. Re:Why not ZFS? by chromatic · · Score: 1

      Food for thought: would a BSD-style license have accomplished the same goals?

    37. Re:Why not ZFS? by Daniel+Phillips · · Score: 1

      It wouldn't be possible to duplicate RAID-Z with LVM.

      I often hear that claim but never see any support of that claim.

      --
      Have you got your LWN subscription yet?
    38. Re:Why not ZFS? by BrokenHalo · · Score: 4, Interesting

      not to belittle ext3 and ext2 for that matter, but their time is beginning to pass, and something new needs to replace it.

      I'm not sure that I see why, unless you're simply bored with the older filesystems. Something as critical as this should not be driven by what is trendy at any given moment. If one has no need for particular advanced bells or whistles, there is no need to use them.

      For instance, since for historical and security reasons I keep /boot on its own separate partition which is mounted readonly, it makes sense here to not have anything trying to write to a journal, so ext2 is still a very good choice here. As the partition is tiny (only 20MB) it takes a fraction of a second to run e2fsck over it when or as required, so there is nothing to be gained by journalling it anyway.

      I still use ReiserFS3 on most of my other partitions, since I don't have any intention of changing the filesystem until I change the drives. ReiserFS is still a good choice for my purposes anyway.

    39. Re:Why not ZFS? by BrentH · · Score: 1

      But it certainly is the best word in filesystem design, and isn't about to be bested by anything yet. Also, ZFS is right here right now, and btrfs is a long ways off.

    40. Re:Why not ZFS? by BrentH · · Score: 4, Insightful

      The things you think belong outside of a filesystem only 'nelong' there because that's what years of narrowminded developing have tought you. Look at it this way: /everything/ related to filestorage is managed by ZFS. What could be more convenient than that? Because of this, ZFS can do things much faster and much more reliable than any combo of LVM with a filesystem. Why chain together tools yourself, and manually think about things you really shouldn't be thinking about, when you can have a good filesystem take care of it for you.

      ZFS is easier to maintain, from a users perpective (and that's the job of development, to make usage easier, not ever the other way round).

    41. Re:Why not ZFS? by Kjella · · Score: 4, Informative

      Rather, GPL is incompatible with anything else that can't be re-licensed as GPL, and that includes GPL v2 and v3, which can't even be mixed among themselves. May first we clear that mess, right?

      With a copyleft license, you intend to secure certain rights to the end user to the work as a whole. It is at the very essence of what the GPL tries to do compared to non-copyleft open source licenses or the LGPL that only covers the parts consisting of LGPL code, not any sort of "flaw" or "mess". Licenses work so that you must simultaniously fulfill all of them, so the GPL denies using GPL code with code that denies end users the four freedoms the FSF profess. That is the intention by design, but then there is some collateral damage as well-intended licenses are rendered GPL-incompatible due to details since the GPL (or any copyleft license) couldn't allow open-ended arbitrary restrictions without losing all meaning. The GPLv2 was particularly flawed in this area since it was made fairly long ago with this not much in mind, and in the GPLv3 they did a lot of work to improve compatibility leading to section 7 that among other things say:

      Notwithstanding any other provision of this License, for material you add to a covered work, you may (if authorized by the copyright holders of that material) supplement the terms of this License with terms:
      a) Disclaiming warranty or limiting liability differently from the terms of sections 15 and 16 of this License; or
      b) Requiring preservation of specified reasonable legal notices or author attributions in that material or in the Appropriate Legal Notices displayed by works containing it; or
      c) Prohibiting misrepresentation of the origin of that material, or requiring that modified versions of such material be marked in reasonable ways as different from the original version; or
      d) Limiting the use for publicity purposes of names of licensors or authors of the material; or
      e) Declining to grant rights under trademark law for use of some trade names, trademarks, or service marks; or
      f) Requiring indemnification of licensors and authors of that material by anyone who conveys the material (or modified versions of it) with contractual assumptions of liability to the recipient, for any liability that these contractual assumptions directly impose on those licensors and authors.

      That vastly improves compatibility with the licenses the GPL wants to be compatible with so collateral damage is reduced to a minimum. It's still very easy to write a license, even a free software license, that isn't GPL compatible though. If you look at the reason the CDDL and GPL are incompatible it's that the CDDLs copyleft conditions and the GPLs copyleft conditions clash because they both try to do the same thing. It's almost impossible to write two copyleft licenses where one (or both) doesn't see the other as adding "additional restrictions" on the end user. Even the GPL can't escape that as it tries to improve the GPL unless you have the "and later" clause. Then again, there's no reason such a license should have to be revised often - it took 16 years before releasing version three and it'll probably be longer until next time it's needed.

      --
      Live today, because you never know what tomorrow brings
    42. Re:Why not ZFS? by tkinnun0 · · Score: 3, Insightful

      It's not wrong or misleading. If you have GPLv3 and GPLv2 code, you can mix them if the GPLv2 code's copyright holder gives you the permission. Likewise, if you have BSD and GPLv2 code and wish to retain the BSD licence. The mechanisms may be different but the end result is the same: GPLv2 in itself doesn't give you the permission, you need permission from the copyright holder.

    43. Re:Why not ZFS? by mike_sucks · · Score: 2, Insightful

      The GPL is restrictive as it is /because/ it ensures freedom for users. It is the /developers/ that the GPL bugs.

      Bring on the GPL, I say! Boo to Sun for being anti-users.

      /Mike

      --
      -- "So, what's the deal with Auntie Gerschwitz et all?"
    44. Re:Why not ZFS? by Daniel+Phillips · · Score: 2, Informative

      I don't know about that either. There are consistent reports that ZFS is slower than Ext3 on many common workloads. Also reports of instability.

      While I do respect some of the engineering achievements in ZFS, I do not consider it to be the last word in filesystem design, or even the best filesystem for many applications. I also have doubts about the wisdom of some of the design decisions, such as inhaling the LVM into the filesystem, using 128 byte block pointers, and making a distinction between filesystem snapshots and clones.

      --
      Have you got your LWN subscription yet?
    45. Re:Why not ZFS? by gbjbaanb · · Score: 3, Insightful

      Why chain together tools yourself, and manually think about things you really shouldn't be thinking about, when you can have a good filesystem take care of it for you.

      Because that's the Unix way - build small components (applications) and chain them together to create something out of the parts. I mean, why have ls and grep when you can have lsgrepsortfind? Really, the point is to have small, easily maintained apps that do 1 thing well than 1 app that does everything possibly well, but more usually poorly as its difficult to maintain and ensure it works properly. Not to mention the bloat when it replicates functionality already provided.

      This may not be the best model for a critical component like a filesystem, but on the other hand, reliability of a filesystem is paramount, so keeping it as small as possible is probably a good idea.

    46. Re:Why not ZFS? by Anonymous Coward · · Score: 1, Insightful

      Look at it this way: /everything/ related to filestorage is managed by ZFS. What could be more convenient than that?

      Having it in a layer where it would work with every file system, without needing to reinvent everything again.

    47. Re:Why not ZFS? by Daniel+Phillips · · Score: 1

      By the way, take a look at Hammer if you want to see something closer to the truth than ZFS. It has a cluster replication model too, considerably in advance of ZFS replication.

      --
      Have you got your LWN subscription yet?
    48. Re:Why not ZFS? by lucas+teh+geek · · Score: 1

      it's interesting that people will go to great lengths commit copyright infringement on music/movies/tv/videogames, but there doesnt appear to be people willing to step up and ignore sun's licencing and port it anyway. sure; it could never be incorporated into the official linux kernel releases, but I'm suprised that it doesnt exist as a patch available through "other channels".

      --
      TIAEAE!
    49. Re:Why not ZFS? by ObsessiveMathsFreak · · Score: 0, Redundant

      Rather, GPL is incompatible with anything else that can't be re-licensed as GPL,...

      Your statement is a tautology.

      --
      May the Maths Be with you!
    50. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      No, because BSD license doesn't grant you rights on patents?

    51. Re:Why not ZFS? by ciderVisor · · Score: 1

      Fantastic ! I LOL'd.

      --
      Squirrel!
    52. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      The ZFS developers specifically wanted the open sourced code to be under a GPL incompatible license

      when did microsoft release FAT under a GPL compatible licence? um, never. But the kernel devs made a GPL implementation.

    53. Re:Why not ZFS? by Wonko · · Score: 3, Interesting

      I often hear that claim but never see any support of that claim.

      The closest thing to RAID-Z in the Linux kernel is the RAID 5 DM. If you want to write a 4k block to some random location that isn't currently fully cached the DM has to read 1 stripe from each disk in the array, make the 4k change, recompute the checksums, and then flush that stripe back to each disk. The default stripe size is 64k. That means if you have 4 drives you would be performing a 256k read and a 256k write just to change a single 4k block. Of course, that is worst case. Best case is you have to overwrite the entire stripe with a fresh 256k block of data.

      ZFS and RAID-Z get around that problem by just writing the changed blocks to an unused part of the disk. Once the write is complete it just moves the pointer to the new block location. This is copy-on-write, and this is where the performance boost comes in over RAID 5. With RAID-Z you should never be required to read the whole stripe to do a write.

      RAID-Z also allows for dynamic stripe sizing. That helps get more optimal efficiency on small files and large files.

      The dynamic stripes aren't terribly important, but if you could figure out a way to do the copy-on-write without the filesystem have very fine grained control and knowledge of the underlying array we would all love to hear about it :).

    54. Re:Why not ZFS? by Znork · · Score: 2, Insightful

      RAID-Z uses copy-on-write to avoid RAID 5's required read for every non-cached write.

      Of course, the very same copy-on-write will also result in massive file fragmentation, carefully smearing your dbf files over the entire platters, making your SAN caches useless. Over time resulting in horrible read performance.

      ZFS is certainly a huge improvement for anyone used to ufs and disksuite, but I have to say that using it in the real world it's not all it's cracked up to be. A more layered approach would have made it easier to switch in and out features that turned out to be misfeatures in certain situations.

      Mixing together the features of various layers is, imo, no matter how tempting, simply the wrong approach. Proceed further along that road and you get to record based filesystems or even more special-purpose variants. I mean, there are even more optimizations that you can do if you know the _contents_ of the files. But once you go down that road complexity will grow with every possible different situation you need to handle and you end up either with something far too complex or something unsuitable for many cases. Better then to do the best you can without extra knowledge, code special layers for special features, assume nothing, and let the possibly competent admin add appropriate layers for appropriate data.

    55. Re:Why not ZFS? by Jah-Wren+Ryel · · Score: 1

      The GPL is restrictive as it is /because/ it ensures freedom for users. It is the /developers/ that the GPL bugs.

      +1 truth

      --
      When information is power, privacy is freedom.
    56. Re:Why not ZFS? by BrentH · · Score: 1

      Just because historically it has been the Unix way, doesn't mean we can't do it right from now on. I know what you're mean, 1 program, 1 task, is stabler in the end, but ZFS actually is a very reliable piece of software. It works, it's stable (actually, with its automatic integrity checking, it safer than anything I know) and its extremely usable. Again, just because it replicates functionality doesn't mean it can be (much) better. Linux itself replicated functionality in the beginning, but I'm damn glad that didn't stop Linus or GNU.

    57. Re:Why not ZFS? by BrentH · · Score: 1

      It's true that ZFS is in flux, and it's cutting edge functionality isn't always stable, but if you use a subset of it's funtions (which already make life easier than any LVM/ext combo) it's an inprovement. Speed is very much a function of the functions you use, but with disks these day's I gladly sacrifice 10% for the increased ease of use and data integrity. And it's design decisions are what I consider it's strongpoints, once you see it's practical simplicity, you'll love it.

    58. Re:Why not ZFS? by Anonymous Coward · · Score: 1, Informative

      And that is exactly what ZFS achieves:

      http://blogs.sun.com/eschrock/entry/ufs_svm_vs_zfs_code

      UFS+SVM = 320k lines of code
      ZFS = 70k lines of code

      And ZFS offers massively more functionality, flexibility and performance that UFS+SVM.

      Reversing your analogy - sure you can do anything you want with grep, sed, awk and cut. But sometimes you just make your day easier with a 3 line perl script. It doesnt mean the earlier tools are redundant, it just means you have a better way of doing things.

    59. Re:Why not ZFS? by EyelessFade · · Score: 1

      It has license problems. ZFS sun's CDDL is not compatibly with GPL. http://en.wikipedia.org/wiki/ZFS#Linux

    60. Re:Why not ZFS? by richlv · · Score: 1

      ext4 has a shot at 2 years for root :)
      i don't see btrfs becoming widespread default fs sooner than some 5 years.
      i'd be glad if proven wrong, though ;)

      --
      Rich
    61. Re:Why not ZFS? by richlv · · Score: 1

      it could have been dual-licensed. and "GPL is almost as bad as proprietary" bit was unnecessary trolling.

      --
      Rich
    62. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      It is because LVM is a generic block device with no idea of the data content - when the filesystem writes to it LVM has to rewrite the entire stripe i.e read the full stripe, modify data, calculate parity and write. Because LVM cannot assume to know what is contained within the RAID stripe, it must do full read-modify-write cycles and rewrite the entire stripe, even if you are writing to a previously unused area or just modifying one bit in an entire stripe.

      RAID-Z works because ZFS knows exactly what is written where and uses variable stripe widths to avoid the read-modify cycle. It calculates the full write, finds somewhere that fits it, and writes the full stripe - no read/modify cycle. Because of the COW operation the old block gets cleaned up later (or not if you are using a snapshot).

    63. Re:Why not ZFS? by richlv · · Score: 1

      to be fair, reading about brtfs, the design also mentions things like snapshots, mirroring, included raid stuff.
      http://btrfs.wiki.kernel.org/index.php/Main_Page
      i guess, some middle ground will be reached with brtfs for code reuse and functionality.

      --
      Rich
    64. Re:Why not ZFS? by Wonko · · Score: 3, Interesting

      Of course, the very same copy-on-write will also result in massive file fragmentation, carefully smearing your dbf files over the entire platters, making your SAN caches useless. Over time resulting in horrible read performance.

      If you want good database performance you probably want as little file system overhead as possible between your database and the disk. I wouldn't have expected ZFS to be the most efficient place to store a database.

      I would have to imagine your SAN is just doing uninformed readaheads. That would be a very good way to fill up a cache with useless data if you are reading from a fragmented file system. :)

      This issue with copy-on-write will be entirely sidestepped in a few years by flash storage's lightning fast seek times and smarter caching. IIRC, isn't the reason that zfs-fuse uses so damn much ram because ZFS has its own caching logic built in? If the file system knows where all the blocks in a file are it can do readaheads on its own.

      ZFS is certainly a huge improvement for anyone used to ufs and disksuite, but I have to say that using it in the real world it's not all it's cracked up to be.

      I don't have enough of my own real world experience with ZFS to argue with your experience. In fact, what I know of how ZFS works makes me believe that it can cause exactly the problems of which you speak.

      However, I don't think that means that there aren't a ton of workloads that wouldn't be impacted by these problems. I also believe that a large percentage of those workloads could benefit greatly from some of the features ZFS brings to the table.

      RAID-Z is nice when you need write performance but can't afford the drives for RAID 10. I can think of plenty of times when it would have been nice to have a writable snapshot to chroot into.

      Hell, I would even love to have ZFS on my laptop for snapshotting and cloning. It also seems like ZFS send/recv would make for much more efficient backups of my laptop than rsync buys me.

      Mixing together the features of various layers is, imo, no matter how tempting, simply the wrong approach. Proceed further along that road and you get to record based filesystems or even more special-purpose variants. I mean, there are even more optimizations that you can do if you know the _contents_ of the files.

      I think we are getting some pretty neat new features out of our file systems by blurring the lines between the layers. I wouldn't be surprised if we stumble upon a few more neat ideas before we're through.

      There is still quite a bit of improvement to make even before we have to make the file system aware of what is inside our files. :)

    65. Re:Why not ZFS? by richlv · · Score: 1

      i guess zfs way does not allow reuse of most of the functionality by other filesystems. thus while being very efficient in the short run, it could hinder long term development of new filesystems (not only as replacements for it but also as specific niche products)

      --
      Rich
    66. Re:Why not ZFS? by Berzelius · · Score: 1

      Sun could have tripple-licensed the ZFS code under both CDDL and GPL v2 and v3.

      They didn't do that. So obviously Sun choose to make ZFS incompatible with Linux.

    67. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      What is wrong with "zfs scrub"? Any corruption it detects it will fix, provided you have a redundant copy. If it finds corruption it will try to fix it on the fly anyway, or mark it to be repaired (normally removing the corrupted file).

      The design of the filesystem itself means it will detect far more corruption than any other filesystem can (to a fault, bad non-ECC memory on my box caused some corruption which ZFS detected but UFS would have paniced on).

    68. Re:Why not ZFS? by uglyduckling · · Score: 1

      Maybe so, but the vast majority of GPLv2 software uses the standard preamble which states "This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version."

    69. Re:Why not ZFS? by QuoteMstr · · Score: 1

      You do realize that some cruel person is going to code up "lsgrepsortfind" in Perl and inflict it on the world, right? :-)

    70. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Because it *does* have patent issues, in fact it has 56 at least.

      http://h3g3m0n.wordpress.com/2007/06/09/zfs-on-linux-freedom-can-be-so-restrictive/

    71. Re:Why not ZFS? by jcookeman · · Score: 1

      ZFS is not only outstanding, administration is a breeze. Why would we not want it on Linux? The simple answer: CDDL. Sadly. Competition is good. And, it's good to have btrfs. But, we have btrfs not for this reason. We have btrfs because of GPL versus everyone else wars. Sometimes it's an absolute shame that the wheel must be reinvented -- and in many cases -- badly. I'm still waiting on the list of features from btrfs that will "leapfrog" ZFS.

    72. Re:Why not ZFS? by asc99c · · Score: 1

      I don't see ZFS as un-UNIXy though. It still does 'one thing', just at a higher level - file storage - as pointed out by the GP.

      It looks to me that they've figured out the stuff that people wanted in numerous lower layers, and built a component that does one bigger thing. But it's still a nice contiguous block of functionality.

    73. Re:Why not ZFS? by QuoteMstr · · Score: 3, Informative

      I'm definitely in the layered-design-is-good, ZFS-is-an-abomination camp. But I do have to point out that mlockall would keep a userspace filesystem server from being swapped out, and with realtime priority, the process could even have some guaranteed CPU time. Userspace isn't that bad.

    74. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Didnt mean to start a license war here. You are right about the GPL being the reason ZFS cannot be included and not CDDL, however just stating that GPL is incompatible with anything else that can't be re-licensed as GPL does not explain the reasons nor give GPL the credit it deserves.

      The GPL incompatibilities come from GPL's primary goal of ensuring the end-user's freedom to the code and their freedom to modify it to their needs. This also means for the freedom to be effective, the user should be able to obtain all the dependencies, be it libraries or even the build system, without the potential risk of being denied unrestricted access to them. While a license not as restrictive as GPL may remain freely available forever, there is still the chance that someone or some organization may add some crucial change and introduce restrictions on the availability of the modifications. This will indirectly restrict the freedom of the end-user even if the original license didn't appear to restrict them. A piece of code which can no longer be freely used/modified by the user due to license (or even patent restrictions) no longer give freedom to the user, even if the code is available.

      This is were GPL ends up being incompatible as it does not make a compromise with its goal. It has always been consistent with its goal, even if it may alienate many developers/distributors who would prefer a developer/distributor friendly license over a end-user friendly one, even if the approaches to achieve it get changed over time to tackle any loopholes at the cost of the newer versions being incompatible with older version.
      The price of freedom is eternal vigilance - Thomas Jefferson.

    75. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      The citations that i could not find before: http://kerneltrap.org/node/8066 , also the video of debconf 2006 is online, you could watch it (from the 25 min mark).

      It is clear on what Sun's intent was, and if they really wanted to be compatible with all other OSS license, they could have just multiple licensed it.

      You may not like the GPL or even plain misunderstood it, but given the comparison to proprietary licenses, general tone and all the misleading/false statements looks like you are a troll

    76. Re:Why not ZFS? by segedunum · · Score: 1

      And if we ask people which is better monolithic or microkernel, they reply microkernel.

      Do they?

      But ZFS using FUSE would be like a microkernel driver, so which is it?

      Running a next generation Linux filesystem purely from userspace is simply not an option. The obvious performance issues kill it for one thing, not to mention integration with VFS, RAID and LVM.

    77. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Why chain together tools yourself, and manually think about things you really shouldn't be thinking about, when...

      What you're saying is, "Why have a Unix mindset, when you could pull a monolithic Microsoft?" Bravo, Sir. Bravo. I look forward to your patent on Wheel Mk 2.

      ZFS comprises lots of good ideas, none original, all brought together wrongly. It's "good enough", which is what you'd expect from Sun, but it's the kind of thing open source should be looking at and improving on, not porting over and playing catch-up with.

    78. Re:Why not ZFS? by diegocgteleline.es · · Score: 4, Informative

      One of the differences I can find between btrfs and ZFS is that ZFS explicitely avoided a fsck utility, and btrfs is explicitely designed with features designed to make fsck even more powerful than it's on usual filesystems like ext3. In btrfs, data structures have "back references", and the fsck can be used while the filesystem is mounted.

      IMO, this is a a btrfs advantage. ZFS has checksums and will find errors, but only will be able to self-heal the errors in a redundant configuration. On a single disk, ZFS will find the error thanks to checksums but will not be able to recover your data. Since ZFS was mainly designed for systems that will use redundant configurations, it may have sense there, but desktops are not never going to do such things. IMO the ZFS people were a bit elitist here - "let's going to build a filesystem so good that we won't need a fsck". But in the real world you _are_ going to need a fsck util. Only in excepcional and very rare cases, but you're going to need it.

      Of course that doesn't makes ZFS a bad filesystem, but it's an advantage for btrfs and linux.

    79. Re:Why not ZFS? by segedunum · · Score: 0

      Funny, but not the sort of thing that should be on a Wikipedia page describing filesystems.

    80. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      ZFS while great has a few limitations; you need at least a 64bit processor with 2Gb RAM minimum for it to run smoothly. Zpools are inflexible that you can only add to the storage pool and not remove and raidz are fixed to the number of disks you specify.

      It's an enterprise filesystem that needs planning to get right and doesn't really offer that much advantage to average joe linux user with one or two drives. Furthermore, the GPL is incompatible with the CDDL which ZFS is licensed under.

    81. Re:Why not ZFS? by segedunum · · Score: 1
      Hmmmmmm. I'm just wondering why Sun didn't manage to do the dual licensing thing for ZFS and Solaris that they did for Java ;-). That would really have meant no license issues whatsoever.

      ZFS is present in both Mac OSX and FreeBSD, thank you! They have no license issues whatsoever.

      Good for them (if indeed Apple manages to actually use ZFS). Unfortunately, they get far fewer contributions of source code, and there'll be even less, if any, flowing into OpenSolaris ;-).

    82. Re:Why not ZFS? by makomk · · Score: 3, Informative

      Yeah, and if you get any sort of metadata corruption, you're apparently fscked. See, for example, this thread in alt.sysadmin.recovery. Several of the posters say they basically had to manually fix the filesystem after it got screwed up - how very 1970s.

    83. Re:Why not ZFS? by segedunum · · Score: 2, Interesting

      ZFS has checksums and will find errors, but only will be able to self-heal the errors in a redundant configuration. On a single disk, ZFS will find the error thanks to checksums but will not be able to recover your data. Since ZFS was mainly designed for systems that will use redundant configurations, it may have sense there, but desktops are not never going to do such things.

      I find this checksumming and self-healing interesting, but the real question is what do you actually do to really solve it? With ZFS, an awful lot of people over at OpenSolaris get excited about detecting 'bit rot', but answers are a bit thin on the ground when you ask what can be done about it or what some of the errors actually mean. Yer, you're a bit less likely to get data loss, but you can only really avoid that if you have redundancy. Also, most of the problems ZFS has detected that I have seen have, at a best guess, probably been caused by a Solaris device driver doing something no one had known about. The filesystem can't help you there, no matter how advanced it is.

      The problem is our current storage technology, and more needs to be done where the problems occur - within disk drives themselves. I'm hoping SSDs will end up giving us a better fundamental starting point when it comes to storage.

    84. Re:Why not ZFS? by setagllib · · Score: 1

      That's certainly true for stable distributions, but seeing how quickly distributions like Ubuntu jump on new features, it's possible that btrfs will be an option for / within 2-3 years. Certainly not default, of course.

      Well, reiserfs shot to default status for some distributions almost immediately after its release, and long before it was stable.

      --
      Sam ty sig.
    85. Re:Why not ZFS? by DarkOx · · Score: 1

      Actually, You can mix gpl2 and gpl3 code period. Linus specifically struck the "or any later version" clause out of the copy of the gpl2 he used to license Linux. You could argue that Linux is not truly licensed under the gpl2 at all because he changed the license.

      --
      Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
    86. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      No more than any other filesystem. Granted at the moment ZFS is a little panicy when it does hit a bad pool, but I have personally had mirrored pool corruption caused by bad non-ECC memory (wrote bad checksums or blocks, on read back they were marked as bad. The fix was to rm the affected files and recover).

      But how is that different to any other corrupted filesystem? In UFS I have had situations where fsck simply removes every inode in the filesystem because of some recursive error. At some point you will hit a situation where you need to throw it out and recover data. The difference is ZFS will generally tell you when that is and normally let you know when something is corrupting your data.

    87. Re:Why not ZFS? by not+already+in+use · · Score: 1

      Unless ZFS has patent issues, why not just work on having ZFS as Linux's standard FS, after ext3?

      It's ironic, isn't it? The Linux community's strength is supposed to lie in the fact that they can share (and easily port) other people's work. Instead, in true Linux (and OSS) fashion, they create yet another implementation of essentially the same thing. After almost 10 years of following and using Linux, it's become apparent to me that it's totally an ego thing, too.

      --
      Similes are like metaphors
    88. Re:Why not ZFS? by jhol13 · · Score: 1

      How does those "back references" recover your data in case of a corrupted sector? Honest question, I do not know brfs.

      AFAIK ZFS has no fsck because there is no failure case where it would really help.

    89. Re:Why not ZFS? by jhol13 · · Score: 3, Insightful

      If a filesystem detects errors it is helping me (at least) there. No matter what creates them.

      I do not think SSDs will solve storage problems: there will be flaky adapters and other IF chips/firmware, etc.

    90. Re:Why not ZFS? by BrainInAJar · · Score: 0, Flamebait

      it doesn't make a lick of difference to the users if the code is GPL, CDDL, BSD, or for that matter even closed-source, they just want something that works.

      License debates are debates precisely among developers.

      and fuck you "Sun are anti-user"... Sun is the largest contributor to open-source in the world, even leaving out Java and OpenSolaris. Plus, no other company has had the balls to open-source their crown jewels. Look at happy open-source friendly IBM... Lotus: closed. AIX: closed. Websphere: closed... and the list goes on.

    91. Re:Why not ZFS? by MikeBabcock · · Score: 1

      The standard text of the GPLv2 DOES in fact give you that permission. "Version 2 or later." Therefore, I can take a GPL'd project that was released before GPLv3 and so long as the author did not change that text to "GPL version 2 ONLY" (like the Linux kernel), I can then use it as though it were GPL version 3 without talking to anyone.

      --
      - Michael T. Babcock (Yes, I blog)
    92. Re:Why not ZFS? by MikeBabcock · · Score: 1

      Given the option, I'd use a self-healing non-journaled filesystem for my /boot and root partitions (which does not include /var or /home), and journalled faster file systems for /var and /home.

      As it stands, I use ext2 for /boot, ext3 for almost everything else with different mount options.

      --
      - Michael T. Babcock (Yes, I blog)
    93. Re:Why not ZFS? by MikeBabcock · · Score: 1

      Why chain together tools yourself, and manually think about things you really shouldn't be thinking about, when you can have a good filesystem take care of it for you.

      Beats me, real sysadmins write scripts that do it for them. We have shell scripts for a reason.

      --
      - Michael T. Babcock (Yes, I blog)
    94. Re:Why not ZFS? by Waffle+Iron · · Score: 1

      On top of that, kernel space isn't generally swapped out, but user space can be. Obviously it should never happen, but wouldn't it suck if your disk driver was swapped out?

      It's silly to worry about hypothetical problems like that. They're just not an issue in the real world.

      In fact, I configured my system to use a swap file stored in a FUSE filesystem, and I've been running lots of apps just fine with no probl

    95. Re:Why not ZFS? by GleeBot · · Score: 2, Informative

      How does those "back references" recover your data in case of a corrupted sector? Honest question, I do not know brfs.

      AFAIK ZFS has no fsck because there is no failure case where it would really help.

      Back references could help you reconstruct the file system tree during fsck, but if random data is getting corrupted, you're not going to get it back without redundancy (or forward error correction, I suppose, but that amounts to the same thing).

      I can't think of many scenarios where the only kind of data corruption I'm worried about is corruption to file system metadata (which is incidentally all journaling is supposed to protect you from), but who knows.

    96. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      IMO, this is a a btrfs advantage. ZFS has checksums and will find errors, but only will be able to self-heal the errors in a redundant configuration. On a single disk, ZFS will find the error thanks to checksums but will not be able to recover your data.

      Not entirely true.

      By default this is the case, but you can tell ZFS to make multiple copies of each data block on the same disk:

      # zfs set copies=2 homedirsFS
      http://blogs.sun.com/relling/entry/zfs_copies_and_data_protection

      By default a lot of the metadata is stored in multiple locations (even in single-disk configurations).

    97. Re:Why not ZFS? by Goaway · · Score: 1

      Jokes are only allowed in explicitly marked humour zones.

    98. Re:Why not ZFS? by samkass · · Score: 1

      Since ZFS was mainly designed for systems that will use redundant configurations, it may have sense there, but desktops are not never going to do such things.

      From what I understand, Macs are the most likely systems to get ZFS as a standard mainstream desktop file system. And MacOS these days highly encourages all users to have an external or network-accessible backup drive where deltas get backed up once an hour or so and are extremely easy to restore from. If ZFS found a problem, recovering the bad data from online backup shouldn't be rocket science.

      --
      E pluribus unum
    99. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      so why would you want to run fsck on read-only partitions, again?
      if that partition appears unclean, aren't you already doomed and no fsck is going to help you?

    100. Re:Why not ZFS? by ckaminski · · Score: 1

      Umm.... Linux with LVM has had snapshotting for over five years, at least as long as Windows (Windows 2003).

    101. Re:Why not ZFS? by BobNET · · Score: 2, Informative

      Linus specifically struck the "or any later version" clause out of the copy of the gpl2 he used to license Linux.

      That piece of text isn't part of the license itself, it's part of a separate standard notice that states that the software is copyrighted and gives permission to redistribute or modify it under the terms of the GPL. It could just as easily have said "either version 2 of the License, or (at your option) any license you want in exchange for buying Linus a beer" and still be under version 2 of the GPL.

    102. Re:Why not ZFS? by compro01 · · Score: 1

      In fact, I configured my system to use a swap file stored in a FUSE filesystem, and I've been running lots of apps just fine with no probl

      I'd take that as more of a sign that the swap is semi-never used, rather than a sign that FUSE is quick.

      Unless I'm missing the joke with the end of the sentence cut off.

      --
      upon the advice of my lawyer, i have no sig at this time
    103. Re:Why not ZFS? by Just+Some+Guy · · Score: 1

      What, it tickles your hard drive to death?

      --
      Dewey, what part of this looks like authorities should be involved?
    104. Re:Why not ZFS? by compro01 · · Score: 2, Informative

      Which is why it got edited out. Note the "oldid" bit in the URL.

      --
      upon the advice of my lawyer, i have no sig at this time
    105. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      # cd /usr/src/linux
      # wc -l fs/ext3/*.c fs/jbd/*.c drivers/md/*.c

          70004 total

      And that with comments included.

      If you start bloated, everything seems better.

    106. Re:Why not ZFS? by Just+Some+Guy · · Score: 1, Insightful

      In btrfs, data structures have "back references", and the fsck can be used while the filesystem is mounted.

      Welcome to the party!

      Sincerely, FreeBSD 5.0 - 2001

      --
      Dewey, what part of this looks like authorities should be involved?
    107. Re:Why not ZFS? by Just+Some+Guy · · Score: 1

      You could argue that Linux is not truly licensed under the gpl2 at all because he changed the license.

      You can't, as Linus did not distribute a modified copy of the GPL. Instead, he used his own wording instead of the FSF's suggested suggested phrasing to state that he was releasing Linux under the GPLv2.

      --
      Dewey, what part of this looks like authorities should be involved?
    108. Re:Why not ZFS? by compro01 · · Score: 1

      The problem is that ZFS DOES have patent problems, 50 or so of them IIRC, in addition to the fact that Sun's CDDL license is GPL-incompatible and Sun apparently doesn't feel like dual/multi-licencing it.

      Bar Sun changing the licensing, I do not see ZFS hitting it big on Linux anytime soon.

      --
      upon the advice of my lawyer, i have no sig at this time
    109. Re:Why not ZFS? by feld · · Score: 1

      LVM snapshots are neutered and almost a complete waste of time. Can you restore an LVM back to the state at the time of the snapshot? Not yet.... Instead you're forced to do cp and dd operations. Inexcusable.

    110. Re:Why not ZFS? by MSG · · Score: 1

      Rather, GPL is incompatible with anything else that can't be re-licensed as GPL... May first we clear that mess, right ?

      Please, let's.

      First and foremost: only the original author or copyright holder may re-license any copyrighted work. Period. Including software under licenses like BSD in GPL copyrighted works does not re-license the BSD code. The BSD licensed code remains under the BSD license; it continues to allow re-use under the BSD terms.

      Because confusion on this issue has been fairly common, the Software Freedom Law Center published a paper on the subject after the Linux ath5k debacle.

    111. Re:Why not ZFS? by diegocgteleline.es · · Score: 1

      When I said "back references", I meant that some data structures have pointers back to their "parent" data structure, I wasn't talking about softupdates.

      AFAIK, softupdates in practice never were better than journaling, and since these days filesystems like ZFS and btrfs avoid the problems that softupdates and journaling tries to solve by maintaining the on-disk data always in a consistent state, softupdates is just a useless source of complexity and overhead.

    112. Re:Why not ZFS? by kwalker · · Score: 0

      While I don't have much to add, I have noticed at least two comments from you in this article stating that LVM snapshots are read-only. This is incorrect.

      While LVM+ext[3|4] does have shortcomings, namely replication and versioning, along with a few minor things like tail-packing, LVM2 (Which debuted in Linux 2.4 or 2.6 I forget) does have writable snapshots. I have used them many times for backups as well as cloning VMs.

      LVM2 is capable of a lot.

      --
      ... And so it comes to this.
    113. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      With ZFS, an awful lot of people over at OpenSolaris get excited about detecting 'bit rot', but answers are a bit thin on the ground when you ask what can be done about it or what some of the errors actually mean. Yer, you're a bit less likely to get data loss, but you can only really avoid that if you have redundancy.

      How thin is bit-thin?

    114. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      My memory of the GPL compatibility matrix doesn't support that, but it may be true for GPLv2-only rather than the ordinary GPLv2-or-any-later-version (which does cover the Linux kernel).

      The program may specify that future versions of the GPL apply to the program, but this is not part of the stock GPL itself. Large projects suffer greatly from being able to adapt to new licenses if they take patches from developers without requiring that the copyright be assigned to the owner of the project. This is why the Linux kernel is effectively stuck at GPLv2 forever.

      That said, I thought Sun was trying to GPLv3 ZFS? It's certainly a lovely filesystem, though I guess they're fighting with NetApp over patents on it.

      I've heard lots of rumors regarding this, but nothing substantial. It certainly fits in with the rumors that they want to keep it out of the Linux kernel, however IIRC, OpenSolaris is CDDL, and it would make sense to keep their licenses consistent.

    115. Re:Why not ZFS? by Fri13 · · Score: 1

      GPL is restrictive as it is _because_ it sensures freedom for users and technology. It is the _marketing people_ that the GPL bugs.

      (And money hungry people, who cant control the world.)

      I think the developers are happy if they dont need to reinvent all technologies... but can use and help whole world to develop faster when information is free...

    116. Re:Why not ZFS? by Ayanami+Rei · · Score: 1

      Running your filesystem in userspace could be quite satisfactory provided the service was well written (for example, NTFS-3g).
      The problem that you couldn't code around with a performance impact was the inability to use mmap to write files. Reading was okay, but writing, no. It would be necessary to fall back to using read/write/seek system calls and more copies of buffers in and out of kernel memory. Recently patches have been added that permit a FUSE user-space process to implement writable-mmaps and the associated performance enhancements, but few FUSE filesystems have been modified to take advantage of this.

      --
      THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
    117. Re:Why not ZFS? by BrentH · · Score: 1

      It by the way does work with every filesystem, but what wrong with reinventing the wheel? I automobiles we have done it multiple times, from wooden wheels with spokes to metal ones of of s single sheet of steel. Improvement is possible, really ;)

    118. Re:Why not ZFS? by BrentH · · Score: 1

      Yeah, and real men write their own OS... Why write scripts when a computer can do the same automagically for you?

    119. Re:Why not ZFS? by chromatic · · Score: 1

      Fair point. The Apache 2 license might have worked then.

    120. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      I've been able to do this forever with LVM snapshots under Linux.

      Where forever is defined as "since 1998".

    121. Re:Why not ZFS? by Anonymous Coward · · Score: 1, Insightful

      ZFS was designed without fsck because fsck doesn't recover filesystem data just metadata. ZFS metadata is written using "ditto" blocks. This means its written in two different places on the disk. When scrub is called and zfs finds metadata problems they are corrected using this redundant data. Thus fsck is useless.

      What you're talking about is if you're actual data is damaged (ie the checksum doesn't match) then zfs can only correct this if its applied some sort of raid to the disk. As far as I'm aware fsck utilities can only correct metadata issues with the filesystems. Actual data corruption issues are beyond the scope of the standard fsck utility.

    122. Re:Why not ZFS? by Wonko · · Score: 1

      While I don't have much to add, I have noticed at least two comments from you in this article stating that LVM snapshots are read-only. This is incorrect.

      I apologize for my outdated knowledge. I very much remember them being read only in the LVM1 days. I imagine I have never made an attempt to mount a snapshot read/write since.

      LVM2 is capable of a lot.

      I would agree that it certainly does a lot, and I would be very upset if someone tried to take it away from me. It is just that we are to a point now where we've squeezed most of the possible functionality out of DM+LVM that we can.

    123. Re:Why not ZFS? by Courageous · · Score: 1

      Well, yes, however to be clear, it's not the CDDL that prohibits this, but rather the GPL.

      C//

    124. Re:Why not ZFS? by Haeleth · · Score: 1

      I'm confused: if we ask people why not run ZFS using FUSE, they reply because it's slow. And if we ask people which is better monolithic or microkernel, they reply microkernel. But ZFS using FUSE would be like a microkernel driver, so which is it?

      Please provide your citations proving that there is a significant overlap between the "they"s who make these two claims, and your benchmarks proving that FUSE is not slower than a real microkernel driver. If you don't have those two things, then your confusion is ill-founded.

    125. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      OSX and FreeBSD are not competitors and probably never will be, that's why it makes sense to let those support ZFS and take user share away from Linux.

      And I'm sure both FreeBSD users and all four people using OS X on servers were delighted by this decision.

    126. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Because that's the Unix way - build small components (applications) and chain them together to create something out of the parts. I mean, why have ls and grep when you can have lsgrepsortfind?

      Bad analogy. Explain to me why I need ext2/ext3/ext4/btrfs/reiserfs/xfs/jfs/etc, fsck, a script to automatically run fsck, fdisk, fstab, mount, AND /dev/sda0+ (in addition to /dev/sda)? In the zfs world, all you have is zfs and zpool.

      Let's be honest here. You're not going to be changing the fstab on a day to day basis and you're not going to be changing your file system or partitions just because you felt like it. But I'll tell you, the day you do want to do that, you're going to be pissing in your pants as to why it is so ridiculously stupid to change. Get some hardware and throw freebsd 7 on it and try out ZFS before making bad analogies. ZFS does take more responsibility but also simplifies unnecessary complexities.

    127. Re:Why not ZFS? by Haeleth · · Score: 1

      Linux users by and large respect other people's copyrights.* After all, the GPL only works because of copyright -- without it, there's no way we could stop companies taking GPL code and using it without giving anything back in return.

      Most of the pirates who suck up bandwidth downloading illegal copies of music, movies, software, etc. are Windows users. (It's easy to tell where software is concerned: just look at what platform it runs on.)

      Don't let the fact that both kinds of people post on Slashdot confuse you. There's lots of different overlapping communities here.

      * Patents are a different matter -- plenty of Linux users have no problem openly violating patents on things like MP3 and MPEG4. But that's completely beside the point.

    128. Re:Why not ZFS? by ArsonSmith · · Score: 1

      How about:

      Sun could have made ZFS compatible with Linux.

      They didn't do that. So obviously Sun choose to make ZFS incompatible with Linux.

      --
      Paying taxes to buy civilization is like paying a hooker to buy love.
    129. Re:Why not ZFS? by kwalker · · Score: 1

      I apologize for my outdated knowledge. I very much remember them being read only in the LVM1 days. I imagine I have never made an attempt to mount a snapshot read/write since.

      They very much were read-only in LVM1. It was one of the design goals for LVM2 to make snapshots writable.

      I would agree that it certainly does a lot, and I would be very upset if someone tried to take it away from me. It is just that we are to a point now where we've squeezed most of the possible functionality out of DM+LVM that we can.

      This is where we differ. I don't think DM is at its end. Did you know LVM2 can do its own mirroring (i.e. RAID-1)? You probably do know about the DM-crypt, which makes whole filesystem encryption almost trivial.

      I can imagine a system (Maybe LVM3) that allows the filesystem sitting on top of it to request changes to the underlying LV, maybe allocating additional PEs for snapshots or space for additional redundancy for specific objects like RAID-Z, or automatic object replication between PVs over iSCSI.

      --
      ... And so it comes to this.
    130. Re:Why not ZFS? by Ant+P. · · Score: 1

      And don't give me that 'ZFS doesn't need a fsck' crap.

      Hmm... so that must mean you do give a fsck?

    131. Re:Why not ZFS? by tony.damato · · Score: 1

      If one looks at that thread, it digresses into the extreme, talking about one's flatmate and discussions on restaurants...

      Besides, it was posted back in May, and about ZFS on FreeBSD... the version on Solaris and OpenSolaris has progressed massively since then.

      For information from "the horse's mouth", so to speak, here is some interesting reading:

      http://www.solarisinternals.com/wiki/index.php/ZFS_Best_Practices_Guide

    132. Re:Why not ZFS? by Luminary+Crush · · Score: 1

      ZFS duplicates a lot of functionality that belongs outside of a filesystem.

      That may be your opinion. The most advanced storage technology companies tend to disagree with you.

      Some of the features, such as the pointer-based manipulation of the filesystem for snapshot purposes, are not possible to be performed outside the filesystem. Let's consider a snapshot. First (side note), a beef I have: Sun calls ZFS snapshots "Copy on Write" - which never made sense to me; there's no copy on write happening during a ZFS snapshot, the new data is written to a new location (not copied) and a new pointer laid down for it; the old data sits in it's initial location with the 'old' pointer. What is traditionally called a "copy on write" snapshot is how most of the world does snapshots: 1) read old data block, 2) copy to 'snapshot' area, 3) write new data block. This is three disk I/Os for each block written while a snapshot is active on the system (add another write copy if you have a second snapshot active, and keep going as you add snapshots until the system crawls to a stop). "Traditional" filesystems can't move their pointers around to manipulate themselves; that's what makes ZFS snapshots so powerful (and also why NetApp is suing them - because this type of filesystem was patented in NetApp's WAFL many years ago). Maybe they chose to co-opt the C.O.W. name to somehow try to dodge a NetApp lawsuit (oops). Anyway.

      With a pointer-based snapshot your I/O looks like this: 1) write new data block (along with pointer) to active filesystem. Done. One I/O. The old pointer is captured in the 'snapshot' filesystem and you don't need to do anything to it to manipulate it. So, almost no performance penalty to carry a live snapshot (or two, or three, etc).

      That's why it's routine for people running NetApp filers to be live with multiple, or even tens of snapshots 'online' for instant backup recovery... and why competing storage vendors like EMC rarely run with a snapshot for more time than it takes to make a backup or replicate that point-in-time data. It costs too much in I/O overhead. ZFS has a similar scheme (hence the lawsuit) and provides similar advantages (NetApp WAFL is also a quota filesystem, which simplifies volume sizing/management very nicely; ZFS, from what I've used of it, seems quite similar). This makes life in the storage management world better. A lot better.

      Historical vendor FUD about 'fragmentation' and 'Read I/O' has been directed at NetApp/WAFL for years; yet they continue to be a leader in benchmark performance. Maybe if you own the filesystem (WAFL) you have a much better idea about where to look for the next blocks, rather than using complex and arbitrary SAN read-ahead cache schemes as implemented by other vendors.

      So, I'd say that there are some compelling reasons to have this intelligence in the filesystem and not at some arbitrary level below it.

    133. Re:Why not ZFS? by Rutulian · · Score: 1

      No, you are misunderstanding what copy-on-write means. See here for a good explanation. In the filesystem world, copy-on-write means you have multiple pointers (for multiple snapshots) pointing to the same block until the block changes. Then the changes are written to a new block and the pointers are updated ("past time" snapshots continue to point to the old block while the "current time" snapshot is changed to point to the new block)...almost exactly what you explained for ZFS.

      I haven't been following the details on the NetApp case but, as far as I can tell, it's not about copy-on-write per se, but about their specific implementation of it which preserves filesystem consistency ("the tree-of-blocks" design). There have been several implementations of copy-on-write in filesystems, so I don't imagine the concept by itself is patentable.

    134. Re:Why not ZFS? by BrokenHalo · · Score: 1

      so why would you want to run fsck on read-only partitions, again? if that partition appears unclean, aren't you already doomed and no fsck is going to help you?

      I didn't say my /boot partitions had ever sprung an error. They do, of course, get remounted read-write for kernel upgrades and so forth, and it's not a bad policy to fsck it occasionally.

      And no, I'm far from doomed if it does spring an error. I've got plenty of other ways of booting the system.

    135. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      Just because Sun hasn't sued the BSD community or Apple doesn't mean they never will. At least the Linux community has learned from history (SCO, AT&T).

    136. Re:Why not ZFS? by evilviper · · Score: 1

      AFAIK, softupdates in practice never were better than journaling,

      Several benchmarks directly disagree. No I don't have links handy, it's been years. No doubt you can find them as easily as I.

      and since these days filesystems like ZFS and btrfs avoid the problems that softupdates and journaling tries to solve by maintaining the on-disk data always in a consistent state, softupdates is just a useless source of complexity and overhead.

      What? Maybe you know what you're talking about, but it sure doesn't sound like it, here.

      What does "maintaining the on-disk data always in a consistent state" even mean? And how is it any difference than the age old practice of synchronous (or atomic) writes? How is reordering metadata, so that partial or incomplete writes can be rolled back, any different? What's more, how is it "better" in any way?

      The overhead of softupdates is really very minimal, hence being even faster than journaling (which really does have some notable overhead). No doubt there's always some room for improved performance, but very little where softupdates are concerned. Complexity is also really quite minimal, and hardly notable next to the functionality any modern filesystem has (ie. reordering metadata is nothing compared to reordering bulk data to minimize seeks).

      ZFS' copy-on-write allows it to avoid FSCK, but it has some fairly nasty trade-offs, and there's plenty of examples where an FSCK tool was needed. I know next to nothing about BTRFS, so I admit there may be something here I'm missing, but ZFS certainly isn't a good example of not needing FSCK. Never mind that UFS2 with softupdates doesn't specifically NEED an FSCK after a power failure, allowing immediate read/write use of a filesystem, and only using background FSCK to really only to reclaim free blocks still marked as allocated. ie. very minor housekeeping that can really be put off indefinitely.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    137. Re:Why not ZFS? by iowannaski · · Score: 1

      Hardly. Replace "GPL" with "BSD" and the statement is false.

      --
      i forget
    138. Re:Why not ZFS? by volsung · · Score: 1

      True, but the ZFS-FUSE author also had to port over some Solaris-specific utility code that ZFS depended on. From what I remember following the ZFS-FUSE development initially, that was almost as time consuming as writing the FUSE wrapper.

    139. Re:Why not ZFS? by drinkypoo · · Score: 1

      I'm confused: if we ask people why not run ZFS using FUSE, they reply because it's slow (I'm assuming it's possible to load ZFS at boot time using an initrd). And if we ask people which is better monolithic or microkernel, they reply microkernel. But ZFS using FUSE would be like a microkernel driver, so which is it?

      You ARE confused, but you're confused about your basic premise. ZFS using FUSE would be NOTHING like using a microkernel driver. Here's why: In a Microkernel system, everything is a user-space process which communicates with the kernel (and/or some other process) via a message passing interface. This can well be based on shared memory but there is still some kind of overhead, which is why we generally don't use them; we want to be able not only to wring every bit of performance out of the machine, but also to decide precisely when an event will occur.

      Now, ZFS using FUSE is kind of like using a microkernel driver, because it would run in user space. But it would be unlike it as well, because Linux is not a microkernel.

      AFAIK you can load FUSE from an initrd, with whatever you would need to load ZFS. But FUSE is dog slow. It is simply not a solution for a root filesystem. It's also bloaty for that purpose, although that is less of an issue than the speed.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    140. Re:Why not ZFS? by drinkypoo · · Score: 1

      The standard text of the GPLv2 DOES in fact give you that permission.

      You mean this:

      9. The Free Software Foundation may publish revised and/or new versions of the General Public License from time to time. Such new versions will be similar in spirit to the present version, but may differ in detail to address new problems or concerns. Each version is given a distinguishing version number. If the Program specifies a version number of this License which applies to it and "any later version", you have the option of following the terms and conditions either of that version or of any later version published by the Free Software Foundation. If the Program does not specify a version number of this License, you may choose any version ever published by the Free Software Foundation.

      In other words, if you just say "I licensed under the GPL" then you can use any version. If you specifically say "I licensed under the GPL version 2" then as per section 9 you licensed only under the GPL version 2, as you stated. Only in the case in which the author specifically states that the software is licensed under Version 2 or any later version is this the case; in fact, a user who simply states that the code is released "under the gnu public license" has just cross-licensed their code under ALL versions of the GPL that have EVER been released, at least as per GPL version 2!

      See also:

      How to Apply These Terms to Your New Programs

      (Et cetera)

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    141. Re:Why not ZFS? by drinkypoo · · Score: 1

      it doesn't make a lick of difference to the users if the code is GPL, CDDL, BSD, or for that matter even closed-source, they just want something that works.

      The users want something that they won't be locked up in. The GPL provides them that. They don't know they want it, and those who want to prevent them from getting it have spent a lot of time and money convincing them that they can't have it.

      License debates are debates precisely among developers.

      The GPL, however, was written to protect the user, by protecting the software. As a side effect, it protects the interests of some developers... who are also users.

      and fuck you "Sun are anti-user"... Sun is the largest contributor to open-source in the world, even leaving out Java and OpenSolaris.

      If only we could.

      Plus, no other company has had the balls to open-source their crown jewels.

      I guess SGI doesn't count? (Let's face it, there was nothing left in IRIX to care about besides XFS.) Anyway, OpenSolaris isn't all of Solaris, AIX is hardly the crown jewels any more (IBM sells more Linux than AIX... Linux is IBM's crown jewels now) and Open Sourcing Java is a move of desperation, not a warm embrace. Sun kept Java closed as long as possible, until it was clear that they were going to be marginalized if they didn't offer some more compelling reason to use Java.

      P.S. I hope and pray that IBM never opens Lotus Notes or Websphere. Those projects might do as much damage to Open Source as, say, OpenGroupware.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    142. Re:Why not ZFS? by mvdwege · · Score: 1

      Erm, no? The original poster that started that thread on Usenet was specifically referring to ZFS on Solaris. A second poster piped up that he had similar problems on FreeBSD. And your posted link gives no information on how to solve the problem posted. It does give a lot of 'ZFS is great! Rah! Rah! Rah!' twaddle. Obviously written by a PR flack, not a real sysadmin. I like Sun kit, and they do produce good software, but their attitude to reported problems could be a lot better.

      And thread drift? That's normal in asr.

      Mart

      --
      "I know I will be modded down for this": where's the option '-1, Asking for it'?
    143. Re:Why not ZFS? by fatphil · · Score: 1

      Care to cite the exact paragraph of the GPLv2 which says "version 2 or later"?

      --
      Also FatPhil on SoylentNews, id 863
    144. Re:Why not ZFS? by Anonymous Coward · · Score: 0

      First it r(e)ises, then it gets a beating... oh, right, pr0n is what drives tech anyway.

    145. Re:Why not ZFS? by harry666t · · Score: 2, Interesting

      What about kernels written in type-safe languages? (Singularity, all the Java OSs)

      In these systems, ALL the programs are run in one address space. Does it make the whole OS (not just the kernel) monolithic or what?...

    146. Re:Why not ZFS? by Znork · · Score: 1

      I would have to imagine your SAN is just doing uninformed readaheads.

      Yah, if you want to hear me swear about SAN technology drop by one of those discussions some time. I'm not blaming ZFS for that, it's more an example where ZFS's best intentions and assumptions don't quite match reality, and the lack of a multilayer/changeable feature set gets a seriously adverse impact.

      This issue with copy-on-write will be entirely sidestepped in a few years by flash storage's

      Perhaps. But then again, perhaps not. I was personally stunned by the fact that SAN vendors did what they did with the caches (and let's just say, it's not just readaheads they do. And the cache segments are not small separate blocks, either you have the whole huge stripe or you don't). I'm sure it made sense in the days you were caching tape reels for mainframes, because believe me, that's where the tech is coming from.

      So any file and block layers that aspire to be, in any way, generic, simply cannot make assumptions that the underlying layers aren't punchcards. Or people hacking away at stone tablets. Sometimes that's almost exactly what the admins will be stuck with, and playing politics in large organizations to be allowed to buy a thumper or something close to what was intended to be underlying isn't always feasible.

      could benefit greatly from some of the features ZFS brings to the table.

      Without a doubt, and for Solaris it's a huge step forward. I'm just saying it's not as clear-cut as the marketing :).

      Hell, I would even love to have ZFS on my laptop for snapshotting and cloning.

      Mmm, if you're looking for something that might be possible to use for that and you're on Linux, take a look at drbd. It's a device-replication layer (implemented as a device-mapper layer :). I'm personally using it for a shared-device semantics layer between machines below virtual machines (I could just go with straight iSCSI, but drbd gives me some advantages in prettiness and controllability), but I've been thinking about using it along the lines of laptop synchronization and/or remote replication. Having run it for about a year I'm impressed and surprised by its stability and capability to deal with me doing various nasties to it without negatively affecting data integrity.

      I wouldn't be surprised if we stumble upon a few more neat ideas before we're through.

      I'm sure we will, and as long as one is careful not to close doors on the way it's a good idea to transfer such hints and information between layers, but coming to depend on the assumptions is the risk that needs to be avoided.

    147. Re:Why not ZFS? by Znork · · Score: 1

      namely replication

      Not certain it does exactly what you want, but take a look at DRBD. Device replication implemented as a device-mapper layer. I've been using it for about a year as underlying replicated shared storage for xen virtual machines, and found it very useful.

    148. Re:Why not ZFS? by kwalker · · Score: 1

      DRDB is nice for a two-system active/passive failover solution, but does nothing for larger needs or active/active situations. For that (From everything I've read) you need a SAN and GFS2, which is all fine and good except we'd also like to be able to replicate between sites for a site failover solution.

      --
      ... And so it comes to this.
    149. Re:Why not ZFS? by Znork · · Score: 1

      Actually, drbd does active/active these days (since 8.0), so you can do without the SAN.

      (I actually moved over from xen vm's on iSCSI SAN LUN's to drbd as it made some things easier and nicer (having lv's for each device, rather than separate iscsi LUN's, etc) Not quite the same performance, but in the same range at least, and for my purposes in this case the tradeoff was worth it).

      You still need GFS2 or some other cluster-aware file system on top if you want the whole shared-file system semantics with all it entails, but it's certainly doable.

    150. Re:Why not ZFS? by badkarmadayaccount · · Score: 1

      Keyword : 'yet'

      --
      I know tobacco is bad for you, so I smoke weed with crack.
    151. Re:Why not ZFS? by adrianwn · · Score: 2, Interesting

      Obviously the common definition of "microkernel" does not apply to SAS (Single Address Space) systems. The difference between Singularity and Linux is that in Linux all the modules logically belong to the kernel, while they are logically separated in Singularity: in Linux all data structures can potentially be accessed by every module; this is not the case in Singularity. Hence you can call Singularity a microkernel system, even though everything runs in the same address space.

    152. Re:Why not ZFS? by Wonko · · Score: 1

      How thin is bit-thin?

      Depends if he's laying down or if he's perpendicular.

    153. Re:Why not ZFS? by doom · · Score: 1

      ZFS is present in both Mac OSX and FreeBSD, thank you! They have no license issues whatsoever.

      Except for that tendency toward proprietary forks... like the way OSX forked off from FreeBSD>

    154. Re:Why not ZFS? by doom · · Score: 1

      ...and that's it's biggest problem. ZFS duplicates a lot of functionality that belongs outside of a filesystem. All of the above can already be done using any Linux filesystem, so why keep around a second copy of all that code that implements those features for just a single filesystem?

      ReiserFS was (is) in a similar situation, where it also duplicates a lot of functionality that doesn't belong in the filesystem. Not only does this make it harder to maintain, but it makes a lot of features filesystem specific that shouldn't be.

      This is a bit of a peeve of mine: the basic unix file-systems have been broken for so long, and people have gotten so used to dancing around their limitations, that you get people like this who can't imagine why you would want to actually fix the problems with them.

      E.g. Who cares about being able to handle lots of small files? Everyone knows that you should very carefully pack all that data into large files that won't choke the glorious file system.

    155. Re:Why not ZFS? by asaul · · Score: 1

      My personal example - I have my kids photos stored on a ZFS mirror pool on a cheap little Athlon PC. Worked fine, until one day I stopped being able to read some files - some checking and I found ZFS had picked up checksum errors on them on *both* sides of the mirror. The strange part was the more I scrubbed, the more errors I found. I noted that new files were more prone to it, and not all of them either.

      After some consideration I realised the only way both mirrors could be corrupted was if the data was written that way. If it was disk or controller it would be random, this was perfectly consistant on both sides. For that to happen and ZFS to detect it the checksum or data would have to have changed in memory after ZFS calculated the checksum and then be written to disk that way.

      Long story short - I had non-ECC memory and the motherboard had blown a capacitor. It seems that occasionally a flakey area of memory would corrupt some writes and later on ZFS would pick it up.

      With any other filesystem I would never have known.

      --
      "If everybody is thinking alike, somebody isn't thinking" - Gen. George S. Patton
  4. What I'd like by grasshoppa · · Score: 4, Interesting

    I would like transparent, administrator controlled, versioning. Modified a word document and saved it in place? root can go back and get the old version ( and, alternatively, the user can. root could disable this functionality ).

    The pieces are in place, it's doable, just someone needs to program it.

    --
    Mod me down with all of your hatred and your journey towards the dark side will be complete!
    1. Re:What I'd like by corsec67 · · Score: 4, Interesting

      So, you want a Versioning file system? Just make sure you never let that run on /var.

      OSS is like capitalism: If you see a need, then make it and distribute it.

      --
      If I have nothing to hide, don't search me
    2. Re:What I'd like by Anonymous Coward · · Score: 0

      You can do your own if you need it, by either using a version control system for your files, or doing snapshot every X h/m/s.

    3. Re:What I'd like by XMode · · Score: 1

      Oh god yes.. +1 for this.

      User: Oh, I've just accidentally overwritten that hyper important document we had laying around in some random directory.
      Me: Dont worry, its on backup tape. I'll get the old copy for you.
      *20 mins of finding the right tape and restoring it*
      User: That's not the one. We made a whole bunch of changes to it this morning and they aren't there anymore.
      Me: *hair turns gray and falls out*

    4. Re:What I'd like by bendodge · · Score: 4, Interesting

      That leads to space-bloat.

      What I'd like are files with expiration dates. When I make up some twiddly chart or download some funny video, I keep it because I'll probably want it tomorrow or next week, but then I tend to forget to delete it later. It would be really cool if creating a user data file prompted you with a simple dialog specifying how long you want it. Common options like 1 Week, 1 Month, 6 Months, 2 Years, Forever would do most of the time, and an option to choose a custom date would cover the rest. When a file expired, it would be placed in some kind of psudo-Trash Bin that could be reviewed and emptied when you want more space.

      I'd also love something tag-based instead of hierarchy-based. For example, I store photos by Year > Month > Event, but sometimes I want to make another category for photos of a specific person. This means I either make duplicates or have to dig around to find things. If I could tag them with dates (that should actually be auto-generated from the EXIF), event, place, and people I could then just browse for files with a particular tag.

      Come to think of it, these ideas are both somewhat akin to how a human brain stores stuff.

      --
      The government can't save you.
    5. Re:What I'd like by Anonymous Coward · · Score: 2, Informative

      wayback, copyfs, and ext3cow are all fairly stable versioning filesystems for linux. I'm not sure if they let you stop non-root users from getting old versions, but I don't see why you'd want people to have to ask an admin to get old versions of their files?

    6. Re:What I'd like by fuzzyfuzzyfungus · · Score: 4, Funny

      Wouldn't the world be so, so, so much nicer if users understood that the actual importance of a document is reflected in how carefully they stored it, not how angry they get when you can't get it back?

    7. Re:What I'd like by Tubal-Cain · · Score: 3, Insightful

      It sounds useful, but I think it would turn out to be about as annoying as UAC. Better to keep your files organized and prune occasionally.

    8. Re:What I'd like by Fweeky · · Score: 1

      So, something like HAMMER, then?

      A HAMMER filesystem can be mounted with an as-of date to access a snapshot of the system. Snapshots do not have to be explicitly taken but are instead based on the retention policy you specify for any given HAMMER filesystem. It is also possible to access individual files or directories (and their contents) using an as-of extension on the file name.

      Released and stable in DragonFlyBSD 2.0, and obviously BSD licensed.

    9. Re:What I'd like by spazdor · · Score: 1

      Did you just undo your own modding?

      --
      DRM: Terminator crops for your mind!
    10. Re:What I'd like by darkjedi521 · · Score: 1

      If you want a versioning file system, go use VMS for a while. They've been doing it since at least the early '90s. Not sure if ADVFS has that feature or not, never got a chance to use Tru64.

    11. Re:What I'd like by Anonymous Coward · · Score: 0

      3 words:

      overly

      zealous

      admins :P

    12. Re:What I'd like by Anpheus · · Score: 1

      You're contorting what you want your FS to be with what you want your user interface to be.

      Most of what you ask for could be done automatically with fast indexing search and automatically created snapshots/versioning that reclaims space as needed.

    13. Re:What I'd like by eric2hill · · Score: 1

      Tagging and time are two *very* underused metaphors in today's computer programs. Ever tried to tag an email with both a project and subproject? It doesn't happen. Gmail sort of allows this, but won't show you all the tags for a specific conversation in the main display, just that it's tagged with the most recent tag.

      I'd love to have an application (or OS) that brought in and stored data (email, documents, whatever) in a better system than just a flat tree.

      --
      LOAD "SIG",8,1
      LOADING...
      READY.
      RUN
    14. Re:What I'd like by mlts · · Score: 1

      This exists in both NTFS (Windows Server 2003 and 2008 have the Previous Versions... feature, but it needs to be explicitly enabled), and on NetApp servers. The NetApp server snapshots are lifesavers. Users can just pull out an old version of a file out of the .snapshot directory (well, .snapshot/hourly.0 or what time they want) and not have to ask IT for a restore.

      I'd like to see Linux have this, as well as MacOS (before they move to ZFS) without requiring the use of Time Machine, but it does take up a good amount of space to store the snapshots.

    15. Re:What I'd like by Anonymous Coward · · Score: 0

      yeah aside from all the cash that capitalism usally brings ofc.

    16. Re:What I'd like by mlts · · Score: 1

      That is an interesting feature, and useful especially for tagging log files which have to expire for policy reasons. However, there is so much potential for abuse here that it may not be worth having.

      A disgruntled employee can tag all files on a Web server to expire at the end of the month, then have a manual script they run weekly to retag them so they exist until the end of the next month. Then, if they get fired, the files get purged, and the admin can blame the filesystem for trashing everything while snickering.

      Another issue could be the clock. A hacker could just skew the clock forward on a box a year, forcing destruction of all the objects tagged with an expiration date.

      If it has to exist, it should just be a piece of metadata as opposed to an active destruction object.

    17. Re:What I'd like by SanityInAnarchy · · Score: 1

      This is very, very tricky to get right.

      Consider that POSIX provides no transactional guarantee other than sync and fsync.

      How does the filesystem know when to create a new version? Should every byte ever written to the file be construed as a new version? If so, how does the admin figure out which precise version, out of the literally billions that would be created, is the right one?

      And how do you reasonably prune that wasted space?

      No, what you really want is version control software. Users too. At the very least, it ensures that each commit was deliberate, and represents a valid state.

      --
      Don't thank God, thank a doctor!
    18. Re:What I'd like by grasshoppa · · Score: 1

      Good point about /var. But the important thing is, user data is recoverable if necessary.

      And I would write it, except I suck at coding. :(

      --
      Mod me down with all of your hatred and your journey towards the dark side will be complete!
    19. Re:What I'd like by SanityInAnarchy · · Score: 1

      That seems overly difficult to maintain.

      Just treat the hard drive as a gigantic cache. When you run out of space, automatically delete files which haven't been so much as read in six months.

      To keep it sane, keep an explicit break between what you're using as cache, and what is to be kept forever.

      I do agree on tag-based, though I'd take it a step further and do search-based. Come to think of it, Spotlight (among others) already does this.

      --
      Don't thank God, thank a doctor!
    20. Re:What I'd like by steveha · · Score: 1

      I want it too. When I used to run Windows 98 with Norton Utilities, I had this ("Norton Protected Recycle Bin") and it was slick. See my 6-year-old comment:

      http://linux.slashdot.org/comments.pl?sid=39457&cid=4213113

      To do this properly, it should be in the file system. (The undelete tool and the expire-old-data tool can and should be in user space, but the data protection features should be in the file system.)

      steveha

      --
      lf(1): it's like ls(1) but sorts filenames by extension, tersely
    21. Re:What I'd like by BenBoy · · Score: 1

      Yeah, we could call it, I dunno, "time machine" or something.

    22. Re:What I'd like by EvanED · · Score: 2, Interesting

      How does the filesystem know when to create a new version? Should every byte ever written to the file be construed as a new version? If so, how does the admin figure out which precise version, out of the literally billions that would be created, is the right one?

      True, you may not be able to get it perfect, but you can get way more useful than nothing.

      For instance, many programs that work on small files (the kind you'd most want to version) don't keep the file open, and instead open the file, write to it, and close it each time you save. (Some will move the file to a backup name (eg file~), create a new file, and write to that. This is in part so you at least have the previous version in the ~ file, and in part to compensate for non-ACID file systems because there will always be at least one copy of either the old or new data at any given time.) So creating a version when a process calls fclose() is a reasonable thing to do.

      Sure, it won't work for programs that keep the file open and update it by seeking around and writing, but it will work for the vast, vast majority of the cases that at least I personally would want.

      And how do you reasonably prune that wasted space?

      What you see as wasted space I see as space going to a pretty darn good use.

      As for pruning, you'd have to be fairly clever. But you could create policies that specify how long to keep old versions, how many versions to keep in a certain time period, etc. You could also pay attention to how often a file is opened, how often old versions of files are opened, etc. There's a paper on a file system called Elephant written for FreeBSD where they discuss some ideas on how to do this.

      There's also a hypothesis that at least I would agree with that things recently saved are much more likely to be useful. If you remember the "last lecture" guy Randy Pausch, he did another talk about time management in which I think he told a story about an experiment they did where the goal was to clean up the lab. People were too hesitant to throw things out because "I might need it later," so they set up a rotation of the trash bins. Things you throw out would stick around for a week, which meant that you could still safely retrieve it. But if you didn't need it within a week, you almost certainly wouldn't need it, so it was still basically safe to do. It it helped a lot with cleanliness since people actually threw things away. (He said the biggest problems were when the janitors emptied trash bins at the wrong time.)

      Finally, you could restrict the versioning by the file size, so for instance it would only store past versions for files under a certain size, etc. If you set it to 200K or something that would cover almost all of the files that I would really like versioning on, and yet keep the extra space relatively low.

      No, what you really want is version control software.

      That may be what you want, but it's not what I want.

      At the very least, it ensures that each commit was deliberate, and represents a valid state.

      This is also a downside: it means you can't see anything but valid states.

      Personally, I would like it if things like text editors and word processors saved the entire edit history of documents, persistently. You could use a scroll bar to go through the history, saves would be marked with small tick marks, and deliberate commits would be marked with larger tick marks.

    23. Re:What I'd like by Anonymous Coward · · Score: 0

      The idea here is that we don't want or won't remember to do either of those.

    24. Re:What I'd like by phyreskull · · Score: 1
      I'm currently working on a tag-based file system (but it's not stable at the moment) and there are others that have been written, although I don't have any handy links. Part of the problem is deciding which features should be part of the core file system, and which can be done externally.

      For example, it would certainly be possible to tag files for automatic deletion with my current implementation - then you could just run a daily/hourly/weekly/whatever cron script that deletes all files with an "expires" tag greater than or equal to the current date/time.

    25. Re:What I'd like by EvanED · · Score: 1

      Windows and OS X both have this now sort of, but it's not quite "right" (at least for what I want): both Windows's Previous Versions and OS X's Time Machine do time-based snapshots, as opposed to triggering the backup from file system events (e.g. an fclose()). This means that changes made within the backup span (1 hr for Time Machine) won't be recorded.

      (Time Machine also has a couple other deficiencies compared to what I want. The main one is that you have to use an external drive for the backup, as opposed to being able to use the same partition as your main data store. This means that you have to manage free space on two pools instead of one.)

      Finally, Linux has some projects that do some of this stuff, both snapshot-style and true versioning file systems, but none has the clear support that Previous Versions and Time Machine have, and I doubt any have anywhere near the GUI integration of especially Time Machine.

    26. Re:What I'd like by jesterzog · · Score: 1

      What I'd like are files with expiration dates.

      Would you want it in the file system, though? I'd have thought it'd make more sense to have the meta data stored somewhere else and just have something to go and purge things every so often, or otherwise move them out of the way if you're the kind of person who doesn't like purging. We do this kind of thing on a much larger scale with our document management system at my work, although it's a well documented process where the Document Management people will go through and delete certain information every so often depending on where it's been filed and what the legal/policy requirements are for certain kinds of info to be kept around.

      It would be kind of cool to be able to sort by some kind of expiration date, or automatically hide stuff that's passed such a date (but be able to get it back if it's really needed).

    27. Re:What I'd like by rohan972 · · Score: 1

      ~/.gnome2/nautilus-scripts/auto-delete
      #!/bin/bash
      echo rm -f $1 | at now + \
      `zenity --title="Auto delete file" \
      --text="Number of weeks to keep file" \
      --entry` weeks

    28. Re:What I'd like by Kjella · · Score: 1

      If you don't remember to delete it, is there any reason to believe you'd remember what expirery you set on it? I imagine pretty fast you'll be looking for some file you can't find but you can't remember exactly where you put it, only to eventually discover it's been silently moved to the pseudotrash as you need it a little longer than you thought. Personally I would instead recommend a utility to show which directories take up space, I used a nice one some time ago but don't remember the name - realized I had some 50GB down a path I didn't realize. Also do you want speed or cleanliness? For the former, you want to find some big files quickly and delete those. For the latter, well I never got around to that. What's sorted is sorted, the junk is not and if I need a bigger landfill the easiest is getting another disk. With TB disks, unless it's obviously useless just make a new archive directory from time to time and move it there so you only have the new junk...

      --
      Live today, because you never know what tomorrow brings
    29. Re:What I'd like by Seth+Kriticos · · Score: 1

      Well, if you are good at something else, than make money with it and pay a coder to do it. It's called cooperation. You know, these guys also like to eat once in a while.

    30. Re:What I'd like by Anonymous Coward · · Score: 0

      OS X "Time Machine" is functionally the same as rsync, but it has a GUI.

    31. Re:What I'd like by grotgrot · · Score: 1

      What I'd like are files with expiration dates.

      I'd also like to see the opposite. Even ZFS administration is ridiculously complicated if you want to add randomly sized multiple drives over time. I'd like to be able to tag directories and files and say that they must be able to survive the failure of at least N drives. Under the hood I want my drives to always be full with the filesystem making as many copies as possible of my data. When I plug in a new drive it should automatically and almost instantly fill it. Then the filesystem implementation becomes a question of what to free next to accommodate new data.

    32. Re:What I'd like by deniable · · Score: 1

      Oh hell yes. I learned Fortran on VMS and learned to love the file versioning. If there was one thing MS should have stolen from VMS for Windows NT, it's file versioning. You could take crap like VSS and ignore it. Office apps wouldn't need to do stupid tricks inside the files. It would make my life simpler.

    33. Re:What I'd like by Macka · · Score: 1

      No, AdvFS doesn't have that. No Unix OS uses a versioning FS as far as I know.

      I used to use VMS about 18 years ago before switching to Unix, and I don't miss versioning. IMO it was a bit of a pain in the ass anyway. You just run out of space faster and then have to "purge" your old versions to claw space back, which runs the risk of deleting file versions that someone else might want to keep around. In other words, it's messy.

    34. Re:What I'd like by Ornedan · · Score: 1

      Personally I would instead recommend a utility to show which directories take up space, I used a nice one some time ago but don't remember the name - realized I had some 50GB down a path I didn't realize.

      Konqueror, one of the standard directory views does that.

      Or du.

    35. Re:What I'd like by grasshoppa · · Score: 1

      I would definitely contribute to any effort out there that I felt would bring this to linux.

      --
      Mod me down with all of your hatred and your journey towards the dark side will be complete!
    36. Re:What I'd like by grasshoppa · · Score: 1

      That leads to space-bloat.

      I have the space with which to bloat, on both my immediate storage system as well as my backups.

      Besides, it wouldn't need to store much, possibly just a diff and a timestamp.

      --
      Mod me down with all of your hatred and your journey towards the dark side will be complete!
    37. Re:What I'd like by ultranova · · Score: 1

      Most of what you ask for could be done automatically with fast indexing search

      There is no such thing. Not only is the damn thing always out of date in regards to the not-so-current files, but it also requires occasionally reading through the entire file list of the disk, the IO load of which is annoying on desktop.

      Of course a simple solution is to have a filesystem/userspace data relay which lets an userspace daemon to listen to filesystem events (file/directory created/deleted/modified), and update the database as they arrive.

      --

      Forget magic. Any technology distinguishable from divine power is insufficiently advanced.

    38. Re:What I'd like by richlv · · Score: 1

      but there is no need to have something like that in _filesystem_.
      this could be implemented with existing userspace software - browser could ask this question, then it would write this data to some control file that is checked by a cron job periodically. additionally, file manager could be used to manipulate these proprties.
      i don't think i'd use something like that, though. the thought of some files being automatically deleted... i'm sure some video file i have downloaded would get deleted a day before i'd get to watching it (because i would have not thought about having no time to see it during the set period ;) ).
      as an even easier method you could try out setting up directories like download/{day,week,month,year}, downloading things in those and just run a cronjob daily or so and delete files older than {period} in each directory... then you could see how useful such a feture is to you.

      --
      Rich
    39. Re:What I'd like by Isao · · Score: 1

      I would like transparent, administrator controlled, versioning...

      I suggest looking at how VMS did it. The 2.6 kernel already incorporates VMSs distributed lock manager (woot!)

    40. Re:What I'd like by andy.ruddock · · Score: 1

      I used a nice one some time ago but don't remember the name

      Filelight?

      --
      God: An invisible friend for grown-ups.
    41. Re:What I'd like by jez9999 · · Score: 1

      Pthhhlbbt.

      I have a mere 80GB hard drive and haven't filled it with all the crap I download yet. With 500GB drives coming out, pruning will soon be unnecessary (at least, owing to lack of space).

    42. Re:What I'd like by the+99th+penguin · · Score: 1

      Windows:
      WinDirStat - http://windirstat.info/

      Mac OS X:
      Disk Inventory X - http://www.derlien.com/

      Both are released under the GPL.

    43. Re:What I'd like by foniksonik · · Score: 1

      Lots of people write simple bash scripts to do what you describe. The use find with some options to get all files modified past a week or month or year and move them.... obviously they only use it in a User folder (don't want to mess with apps or system).

      Maybe try that before you start hacking the filesystem itself?

      --
      A fool throws a stone into a well and a thousand sages can not remove it.
    44. Re:What I'd like by the_olo · · Score: 1

      Most userspace apps, like OpenOffice, orderly open a file, write to it, then close it. close() is a potentially perfect place for the required hooks that solve the problem.

      BTW, you mentioned fsync and sync. It seems that you fail to understand the problem. It's quite irrelevant when the data hits persistent storage, we're not discussing recovery strategies in case of hardware of system malfunction. We're not discussing concurrent writes, atomic transaction and all this mess.

      We're talking about retrieving older versions of data which has been destroyed by perfectly valid, yet unintended/unwelcome user space operations, performed by a single user to a non-shared file (well, not shared in the sense it's not continuosly accessed by multiple parties - it can of course lie in a shared directory where it's written to by multiple people occasionally, but the chance of them writing at the same moment is minimal).

      Concerning amount of space used for this feature - I think it should be configurable (e.g. maximum and minimum number of historical versions both per single file and globally per filesystem; maxmimum and minimum amount of used space - likewise).

      Ten years ago I were using files shared from a Novell NetWare file server and the Windows explorer-integrated context menu had the tools to restore (salvage) their older versions, and the history it provided was clearly automatically purged, but it provided reasonably many backdated versions. There was also a command to manually purge older versions in that context menu.

      Strange that nobody here refers to that Novell solution, it's been around for quite a long time now.

    45. Re:What I'd like by bendodge · · Score: 1

      My idea is that the file in pseudotrash will still be there in a dir listing (hence why it's 'pseudo'trash). That way you can still find it, but it's easy to delete when you want to clean.

      --
      The government can't save you.
    46. Re:What I'd like by Medievalist · · Score: 1

      So, you want a Versioning file system? Just make sure you never let that run on /var.

      Why not? I admin'd and coded on VMS systems with native versioning file systems for decades, and I don't see any problem with running /var on a versioning fs. It's not like versioning is some out-of-control monster that will eat your head if you aren't armed with a bullwhip and a chair... it will do what you tell it to do, appending to files does not have to create a new version, and version management is trivial compared to the amount of file management and backup make-work it obviates.

      People who haven't used a strong versioning system have no idea how wonderful it is (although I expect we'll see some folks telling me different any minute now). Most *nix filesytems are really very primitive; 30 year old paradigms like ugo-rwxrwxrwx and nesting ACLs.

    47. Re:What I'd like by bendodge · · Score: 1

      They wouldn't be automatically deleted, they'd just be put into a bin where you could delete them when you wanted space. As I mentioned above, I imagined the file still appearing in both the original dir and the pseudotrash. That way you could find it or purge it.

      I'd want a lot of things to be deleted, not just downloads. For instance, I'll often install some widget or other that falls into disuse. I eventually uninstall it, but it almost always leaves a few settings or cache files around.

      --
      The government can't save you.
    48. Re:What I'd like by Shadowruni · · Score: 1

      Oh well someone has to say it. Windows has this already and so does MOSS...

      (Ow my karma....)

      --
      "Chinese Amazons, power armor, laser swords.... things just meant to be." - Shampoo, A Very Scary Bet
    49. Re:What I'd like by Medievalist · · Score: 1

      That leads to space-bloat.

      Not necessarily. With a really good versioning paradigm, quite the opposite, in practice.

      In a solid versioning system you always know what you can delete. In a non-versioning system everybody has to assign a non-conflicting name or a separate storage space for prior versions of files, and humans don't use consistent naming, so one guy calls his backup files file.BAK and another calls 'em file.SAV1 with increasing numbers, etc.

      In a VMS versioning system, for example, programmers code all day, then go home and the nightly backups run, and come in next morning and their login scripts execute purge/keep=2 on their code folders, and presto they have the two latest versions live with all preceding versions available from backup.

      There are many other advantages, but the biggest one is preventing the duplicate file clutter that humans generate on non-versioning systems.

    50. Re:What I'd like by Medievalist · · Score: 1

      No, AdvFS doesn't have that. No Unix OS uses a versioning FS as far as I know.

      I used to use VMS about 18 years ago before switching to Unix, and I don't miss versioning. IMO it was a bit of a pain in the ass anyway. You just run out of space faster and then have to "purge" your old versions to claw space back, which runs the risk of deleting file versions that someone else might want to keep around. In other words, it's messy.

      Why didn't you just implement a daily purge in your LOGIN.COM file? It'd only be three of four lines of DCL tops (just to make sure it only happened on the first login of the day).

      If purging causes you to delete file versions someone else wanted to keep, you're doing it wrong.

      Ghods, I miss versioning... it saved so much time and typing when writing large complex systems. I watch *nix programmers fiddling with CVS, SVN, etc. and I think how much easier OS-based versioning was... whatever floats your boat, though, I guess.

    51. Re:What I'd like by Anonymous Coward · · Score: 0

      About the hierarchy... In most modern file systems you can create links or symbolic links and have things appear in a number of "branches" of the directory structure. I could see: /pictures/date/2008-09-05/*
      linked to: /pictures/events/Bob_21st_birthday/*
      and some of the images linked to: /pictures/people/Bob

      Given a decent script or program, I could see this being a simple interface that reads the time date information on the image, show the picture and asks for the event and a list of people or tags. I don't see tagging as a function of the FileSystem, but of the software (like most music players "tag" with artist, album, composer, etc.).

    52. Re:What I'd like by Tubal-Cain · · Score: 1

      Sure, you won't run out of space. But actually finding a useful file is easier

    53. Re:What I'd like by Anpheus · · Score: 1

      I was going to reply with the obvious solution that the filesystem should make its journal of recent changes available to applications, but... You did that for me.

      Your solution is, as I alluded to earlier, filesystem independent. It's a feature that you can tack on any filesystem easily if you chose to.

    54. Re:What I'd like by EvanED · · Score: 1

      For Linux, KDirStat: http://kdirstat.sourceforge.net/

      Basically the same as WinDirStat, except actually perhaps older and not quite as pretty. I am a tremendous fan of both programs.

    55. Re:What I'd like by SanityInAnarchy · · Score: 1

      Most userspace apps, like OpenOffice, orderly open a file, write to it, then close it. close() is a potentially perfect place for the required hooks that solve the problem.

      That's a big assumption, which will fall apart horribly if the user happens to be using something like Amarok on SQLite, or whatever the hell Firefox does for its own storage.

      It's quite irrelevant when the data hits persistent storage,

      It is, however, relevant to when the filesystem decides to make a snapshot.

      Concerning amount of space used for this feature - I think it should be configurable (e.g. maximum and minimum number of historical versions both per single file and globally per filesystem; maxmimum and minimum amount of used space - likewise).

      Which means we can now consider how much reliability we then lose in consideration of space.

      The reason I mention fsync, and transactions, and the whole concurrent write mess, is that solving this problem solves the other. If we could identify with certainty where a transaction is, that would be an ideal place to snapshot.

      --
      Don't thank God, thank a doctor!
    56. Re:What I'd like by michaelmuffin · · Score: 1
      venti might fit your needs. it preserves permissions too, so any user can grab files from backup, not just root.

      http://en.wikipedia.org/wiki/Venti

    57. Re:What I'd like by drinkypoo · · Score: 1

      Of course a simple solution is to have a filesystem/userspace data relay which lets an userspace daemon to listen to filesystem events (file/directory created/deleted/modified), and update the database as they arrive.

      The other part to this solution is to have an intelligent search system which, when you perform a search that might match documents in the cache to-be-indexed, will make some guesses about which uncached documents are most likely to contain your search terms (including if the old version of the document has them) and then reindexes them as part of your search.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    58. Re:What I'd like by drinkypoo · · Score: 1

      Here's a really stupid idea: why not just allocate free blocks with a LRU scheme? If you don't have a versioning filesystem, this would at least permit the undeletion of the most recently removed files. You could use a much more complicated system including scoring up files in /tmp and the like, of course. But why does no operating system in existence seem to do this? (And where's my versioning filesystem? Is it in my flying car?)

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    59. Re:What I'd like by drinkypoo · · Score: 1

      Personally I would instead recommend a utility to show which directories take up space, I used a nice one some time ago but don't remember the name

      du -sk * | sort -rn | more

      and get off my lawn.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    60. Re:What I'd like by grasshoppa · · Score: 1

      No, I don't want version control software. I use version control software for my source code, it serves an entirely different purpose than what I have in mind.

      My users, and users in general, don't know how to use version control software. Training would be a pain when their tech level means they forget how to login, randomly.

      A versioning file system is exactly the right solution for this environment. Further, we already have the concepts and solutions well developed individually. We'd only need to pull them together in to a file system.

      Create a file. Modify and save the file. On save, the file system does a diff, stores it with a time stamp.

      Of course this type of file system isn't exactly the most useful when dealing with large files, or /var, or in a variety of other circumstances, but it's dead useful when considering normal user data.

      --
      Mod me down with all of your hatred and your journey towards the dark side will be complete!
    61. Re:What I'd like by badkarmadayaccount · · Score: 1

      Me too. BTW, why does this have to be purely in the filesys? Forget the hierarchical structure, just make a monster fast flat fs, and implement dirs, versioning, snapshots, multiple views, etc. on a small db on top. That way, you could even make a high level raid (think about a fancy '/' unionmount), or other cool shit. Crypto support goes under the fs, that way you won't even see inodes that you shouldn't see, so you get stego. I still haven't figured out where compresion goes.

      --
      I know tobacco is bad for you, so I smoke weed with crack.
    62. Re:What I'd like by jonadab · · Score: 1

      > I would like transparent, administrator controlled, versioning.

      Amen.

      There are only two filesystem features I care about that FAT32 doesn't have.

      One is symbolic links, which ext2 has had for, approximately, the entire history of the filesystem. (I wouldn't be surprised if the original extfs had it, maybe even minixfs too, but I wasn't using Linux yet back then.)

      The other is automatic versioning. I've seen a rudimentary form of this in action on VMS, and even with the limited features of that implementation it's cooler than liquid helium, no fooling. If it were actually done *right*, to where the system administrator could do things like set the number of versions to keep for entire subdirectory trees (e.g., don't keep multiple versions in /var/log, but keep up to six versions in /etc, keep up to 50 versions in /home, and so on),

      I suppose journaling is also worthwhile, but it's nowhere near as cool as automatic versioning.

      As far as all that stuff about optimally storing small files, in an era when the price difference between a 100GB drive and a 300GB drive is less than the cost of shipping, does anyone actually care, anyone *other* than the guys writing filesystem code? I mean, it sure doesn't *hurt* anything if small files are stored efficiently, but that's not going to be the factor that makes up my mind as to which filesystem I want to use, I can tell you that for free.

      Versioning, on the other hand... I would give up any number of other features to gain versioning. I would *certainly* be willing to give up journaling to gain versioning (so, if there were a non-journaling filesystem that did automatic versioning, yeah, I'd use it). I'd give up hard links in a heartbeat. I *might* give up symbolic links for versioning, though that's a tough call, because symlinks are also incredibly useful.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    63. Re:What I'd like by jonadab · · Score: 1

      I wouldn't mind being *able* to set an expiration date on files, and then being *able* to run a find command later and get a list of all the expired ones.

      But it would have to be optional, and it would be nowhere near as useful as versioning.

      Seriously, you cannot understand how amazingly excellent automatic versioning is until you have used a system that has it. It is at least five times as cool as having an OC3 to your bedroom.

      Here's an example for programmers. You know how your code is cluttered with stuff you've commented out because you weren't certain you wouldn't need it back, and then you never bothered to delete it once you had the revised version working? You know how periodically you go through and find all the old commented-out bits and evaluate whether you still want them or should delete them now? With automatic versioning, you don't have to comment most of that stuff out in the first place. You can just hit save and then delete it.

      Here's one for system administrators. You know how, when you do a dist-upgrade, apt prompts you for every configuration file in /etc that you've ever edited, and asks you whether you want to keep your edited version or the new version, and you don't remember when or why you edited it, and looking at the difference is meaningless because any real changes get lost in a sea of slightly-changed comments, and you have to choose? With automatic filesystem versioning, you can just tell it to go ahead and install the new file, and your edited version will STILL BE THERE to refer back to.

      Versioned filesystems rock. If one ever makes it into the mainstream Linux kernel, I will be jumping up and down for a week.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    64. Re:What I'd like by jonadab · · Score: 1

      > I don't see why you'd want people to have to ask an admin to get old versions of their files?

      I can easily imagine wanting the users to have to ask an admin to get the settings changed for how many versions are kept, but that's not exactly the same thing.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    65. Re:What I'd like by jonadab · · Score: 1

      I've used VMS, and the versioning is really nice. I can think of any number of ways it could be *better*, but compared to not *having* versioning, the VMS implementation is Really Nice To Have.

      There are some other really cool things about VMS, too. I think anyone who's doing serious work in operating systems is really missing something if they haven't looked seriously at VMS. It's got a lot in it that's worth knowing about. Moreso, I would say, than BeOS.

      Not that I'm saying I'd want a point-for-point clone of VMS, or anything. In addition to some cool things, it's also got some really ugly things, which should preferably *not* be copied into other systems.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    66. Re:What I'd like by jonadab · · Score: 1

      > How does the filesystem know when to create a new version?

      This isn't even hard. When the file is opened for writing, that starts a new version.

      --
      Cut that out, or I will ship you to Norilsk in a box.
    67. Re:What I'd like by jonadab · · Score: 1

      > What you see as wasted space I see as space going to a pretty darn good use.

      Indeed. With the price difference between a 100GB drive and a 200GB drive being less than the cost of shipping, I think I can afford to keep a few copies of anything so important that I edit it regularly.

      > As for pruning, you'd have to be fairly clever. But you could create
      > policies that specify how long to keep old versions

      Not how long, but how many. You want a pruning rule that leaves you with geometrically increasing ages on the versions going backward. That is, the most recent versions are adjacent, but the further back you go the further apart the retained versions get.

      --
      Cut that out, or I will ship you to Norilsk in a box.
  5. Re:BTRFS? REALLY? by pizzach · · Score: 1

    No I did not. Thanks to you, now whenever I am eating or drinking at the computer, everything will taste like land-o-lakes butter faces now. Bleh!

    --
    Once you start despising the jerks, you become one.
  6. Obligatory by Anonymous Coward · · Score: 1, Funny

    Nice, but does it murder your wife?

    1. Re:Obligatory by PunkOfLinux · · Score: 1

      No, that's one of the listed features, actually!

              * Extent based file storage (2^64 max file size)
              * Space efficient packing of small files
              * Space efficient indexed directories
              * Dynamic inode allocation
              * Won't kill your family
              * Writable snapshots
              * Subvolumes (separate internal filesystem roots)
              * Object level mirroring and striping
              * Checksums on data and metadata (multiple algorithms available)
              * Integrated multiple device support, with several raid algorithms
              * Online filesystem check
              * Very fast offline filesystem check
              * Efficient incremental backup and FS mirroring
              * Online filesystem defragmentation

  7. WHATS WRONG WITH RIESERFS? by Anonymous Coward · · Score: 0, Funny

    This is what it would be like, if the majority of people were athiests.
    ATHIEST KID: Mom, I'm going to go fuck a hooker.
    ATHIEST MOM: Okay, son.
    ATHIEST KID: Afterwards, I'm going to go smoke pot with my friends, since it's "not addictive."
    ATHIEST MOM: Okay, come home soon!

    The athiest kid leaves the room. The father comes home from work several minutes later.

    ATHIEST DAD: Hey!
    ATHIEST MOM: Hi, honey! I'm pregnant again. I guess I'll just get another abortion, since "fetuses don't count as human life."
    ATHIEST DAD: Okay, get as many abortions as you want!
    ATHIEST MOM: Oh, and don't go in the bedroom.
    ATHIEST DAD: Why not?
    ATHIEST MOM: There are two gay men fucking eachother in there.
    ATHIEST DAD: Why are they here?
    ATHIEST MOM: I wanted to watch them do it for awhile. They just aren't finished yet.
    ATHIEST DAD: Okay, that's fine with me!

    Suddenly, their neighbor runs into the house.

    ATHIEST NEIGHBOR: Come quick, there's a Christian outside!
    ATHIEST MOM: We'll be right there!

    The athiest couple quickly put on a pair of black robes and hoods. They then exit the house, and run into the street, where a Christian is nailed to a large, wooden X. He is being burned alive. A crowd of athiests stand around him, all wearing black robes and hoods.

    RANDOM ATHIEST: Damn you, Christian! We hate you! We claim to be tolerant of all religions. But we really hate your's! That's because we athiests are hypocritical like that! Die, Christian!

    THE END

    Scary, isn't it?

    1. Re:WHATS WRONG WITH RIESERFS? by buswolley · · Score: 0, Redundant

      Interestingly, I've never heard of an Athiest.

      --

      A Good Troll is better than a Bad Human.

    2. Re:WHATS WRONG WITH RIESERFS? by daVinci1980 · · Score: 0, Offtopic

      You were modded offtopic and I will be too, but I feel I have to respond.

      So if atheists ran things we'd legalize legitimate behaviors between consenting adults and would stop pretending that religious morality has a place in government?

      Except for the disgusting lynching at the end, I fail to see what's wrong with the rest of your post.

      --
      I currently have no clever signature witicism to add here.
    3. Re:WHATS WRONG WITH RIESERFS? by Jesus_666 · · Score: 0, Offtopic

      It's someone who is athier than everyone else within the reference frame. Given that most people aren't very athy at all it's not really difficult to be the athiest for moderately sized reference frames.

      --
      USE HOT GRITS WITH STATUE OF NATALIE PORTMAN (NAKED AND PETRIFIED)
    4. Re:WHATS WRONG WITH RIESERFS? by Idiomatick · · Score: 0, Offtopic

      I know this is just flame bait but incase anyone reads it. Hilter was a christian but he didn't attend church world conquering is busy work i guess. Stalin Was raised to be a priest, hated became athiest, then later became a born again devoted catholic. Obama is obviously a christian even if his parents don't practice.

    5. Re:WHATS WRONG WITH RIESERFS? by SanityInAnarchy · · Score: 0, Offtopic

      Well, it was another copy/paste troll. And I found quite a lot more wrong with it than the lynching -- which was downright comical. Black robes and hoods?!

      Way up at the beginning, "ATHEIST KID" is probably meant to be a teenager. I can see where some people might have a problem with that -- especially a parent. If nothing else, there's STDs, and there's the working conditions of the hooker.

      Marijuana is almost certainly not as bad as it's made out to be, and I would say the impact of outlawing it is worse than the drug itself. But from what my parents tell me of their own experience (so to speak), it absolutely has a negative effect on memory. You're going to tell me that atheists, who don't believe (or don't know) that they're going anywhere after they die, are going to risk the health of their one and only brain for a high?

      Some would, I'm sure. Some Christians would (and do) as well. But I doubt there's a correlation, except in extreme cases -- like Rastafarians, who consider it a religious experience.

      And, well, abortions are expensive, potentially dangerous, and time consuming. Even if you know nothing about atheists -- and hey, there's no reason an atheist can't be pro-life -- why would anyone choose a $500 abortion over a $1 condom?

      Even if you assume atheists are inherently amoral (NOT true), why would you also assume they're complete morons?

      --
      Don't thank God, thank a doctor!
    6. Re:WHATS WRONG WITH RIESERFS? by xstonedogx · · Score: 0, Offtopic

      If there were an atheist authority, I would totally report you to it.

    7. Re:WHATS WRONG WITH RIESERFS? by Anonymous Coward · · Score: 0

      W T F ?

      I'm not even sure which level of reverse trolling we're at here or wtf you're trying to do, but that whole post is just stupid.

    8. Re:WHATS WRONG WITH RIESERFS? by sumdumass · · Score: 0, Offtopic

      As for the abortion issue, that is actually the problem I have with abortions. There are literally dozens of effective forms of birth control that can and do stop the majority of unwanted pregnancies. You have condoms, spermicidal foams, the pill, implants, the ping pong ball that a women put between her knees and doesn't let it drop to the floor (abstinence), and many others including snipping the tubes or nuts. All of which are cheaper then an abortion and doesn't require the destruction of life.

      Of course it take some personal responsibility to use the method but I sort of think that if you can't be responsible in that tinny way, you probably shouldn't be having sex. Now I do understand that sometimes birth control doesn't work. I'm not sure I can support an abortion in this manor because there is always adoption and people will often cover all the expenses of giving birth but it isn't my decision to make. I just really have a problem with Abortion as a form of birth control.

      As for assuming Atheist as complete morons, I wouldn't make that a blanket statement but I do know a few that would likely fit that classification. And the worse part about it is that they actually think they are somehow elite in the process. It doesn't take long to look around and see them. But then again, I wouldn't limit the amount of morons to just Atheist either.

    9. Re:WHATS WRONG WITH RIESERFS? by Fred_A · · Score: 0, Offtopic

      It's someone who practices Atheism

      One doesn't "practice" atheism, you silly troll.

      --

      May contain traces of nut.
      Made from the freshest electrons.
    10. Re:WHATS WRONG WITH RIESERFS? by Fished · · Score: 1

      Disclaimer... I am a Christian. So, go ahead and take me with a grain of salt if you need to. Just for the record, I neither approve nor agree with the OP's characterization of atheists. Sadly, religious belief (or the lack thereof) seems to have little impact on a persons morals, at least in this day and age.

      It seems to me that this sort of discussion of abortion evades the issue. Either a fetus is a human life, entitled to rights (human rights even) or it's not. It's not a question of whether abortion is a method of birth control, it's not a question of "privacy rights", it's not even a question of "rape and incest." It's a question of: "when *exactly* does a fertilized egg become a human being, entitled to full protection of the law?" Think about it for a second: would you (or anyone) seriously complain that infanticide was a sloppy and expensive method of birth control, were it legal? This isn't a hypothetical example: infanticide, especially of girl children, has been widely practiced in human history and is not uncommon in some parts of the world even today (e.g. China.)

      For a long time, I was pretty wishy-washy on the subject of abortion, and played around with everything from viability on down to implantation as the point at which a fetus becomes human. Then my wife had identical twins. Even when they were in the womb, my daughters (who have virtually identical genetic codes) showed different personalities--one would react to Jazz and Rock (by kicking), the other would react most strongly to Classical. Now they're 7, and they're near-opposites in everything but appearance. To me, the simplest explanation for this is that these 2 infants had a soul and this "soul" was present from the first moment it could possibly be detected.

      So, I'm categorically opposed to legal abortion, just as I would be categorically opposed to legal infanticide. This isn't because I'm squicked by the sexual aspects of abortion--I'm not opposed, for example, to the widespread availability of birth control--but because I am honestly convinced by reason and observation (not religious dogma) that every abortion kills a baby. And I truly wish that those who call themselves "pro-choice" would give a lot more sober consideration to exactly what the choice they advocate entails.

      --
      "He who would learn astronomy, and other recondite arts, let him go elsewhere. " -- John Calvin, commenting on Genesis 1
    11. Re:WHATS WRONG WITH RIESERFS? by McKing · · Score: 1

      I'm glad that you came up with that rational and reasonable argument for why you are opposed to abortion.

      Disclaimer: I am a Christian as well and I do not believe in abortion. I am also an American soldier who has served my country and actually fought to uphold the rights and freedoms that a lot of Americans take for granted.

      The pro-choice camp advocates *choice*. That's it. Most don't and never will advocate abortion. "Pro-choice" != "pro-abortion". Most pro-choice groups advocate choices that range through all of the options available pre-conception: abstinence, contraceptives, etc..., and the post-conception options of adoption or abortion.

      The issue is that people who are pro-choice would like those out there to make up their own minds about the issue, to take the personal responsibility that such a choice entails, and not to be limited by moralistic government control. I find it anathema to the (historically) conservative Republican tenets of small government controls and personal responsibility that one of the primary issues that the "single-issue" voter uses to determine their vote is the issue of abortion. I also find it funny that Sarah Palin actually used the word "choice" when she announced that she had "chosen" to keep her Down's syndrome baby, yet she is against Roe v. Wade....

      I don't and never will suggest abortion to anyone, and I will counsel them against it, but ultimately, it is the woman's (and her partner's) personal decision and always should be.

      --
      If only "common" sense was actually that common...
    12. Re:WHATS WRONG WITH RIESERFS? by Fished · · Score: 1

      I really think that this is a pretty cowardly dodging of the issue. Imagine for a moment someone saying: "I don't and never would suggest exposing of an infant to any parent, and I would counsel them against it, but ultimately, it is the woman's (and her partner's) personal decision and always should be."

      This is exactly what I'm railing against here--you are dodging the key issue, and in fact are begging the question! If a fetus is an infant, then it is absurd to claim that ending it's life is just a "choice". If a fetus is an infant, abortion is no choice, it's murder. And we should no more leave that up to the individual parent than we would allow individual parents to neglect, abuse, or murder any other infant. By leaving this decision up to the individual mother (and, for the record, Roe v. Wade gives the father no voice at all, which is also wrong) you are in effect saying that there is some unexplained and inexplicable difference between a fetus and a baby, because you are treating the two cases differently. And this is absurd under any moral philosophy worthy of the name--that is, any moral philosophy capable of giving actual moral guidance. (N.B. I do not consider the currently popular utilitarian ethics to be a worthy moral philosophy. I can get along with deontological ethics of any sort, but consequentialism gives me gas.)

      The bottom line is this: until someone can offer me some reasoned argument for calling the same 5 lbs. of protoplasm "infant" 5 minutes after a delivery (or, more chillingly, a failed abortion) and "fetus" 5 minutes before, here I stand. God help me, I can do no other.

      Incidentally, you seem to assume I'm a Republican. Actually, I'm holding my nose and voting for Obama. But what should have been an easy decision has been made hard by the failure of either party to be consistently "pro-Life" (i.e. anti-abortion, anti-capital punishment, and anti-military-adventurism.)

      --
      "He who would learn astronomy, and other recondite arts, let him go elsewhere. " -- John Calvin, commenting on Genesis 1
    13. Re:WHATS WRONG WITH RIESERFS? by SanityInAnarchy · · Score: 1

      Even when they were in the womb, my daughters (who have virtually identical genetic codes) showed different personalities--one would react to Jazz and Rock (by kicking), the other would react most strongly to Classical.

      Firstly: You're mistaking a reaction to a personality. All you understand is that you did something (played rock, or played classical), and your infants reacted (kicked). Everything after that is an assumption, unless your daughters can actually tell you that they remember hearing one or the other.

      To put another way: When I play Half-Life, the Marines react one way when I shoot them (by shooting back), and scientists react another way (by screaming and running away). Similarly, they react differently when I walk up and press the "use" key -- the Marine kicks at me (because I'm close enough that he can't shoot me), and the scientist starts following me -- or stops, if he was.

      Yet these are machines. Less than that, they are programs. Their whole existence can be distilled down to a few lines of code.

      What I am trying to say is: We don't know. No one can say for sure even whether adult human beings have a soul, much less a fetus. We cannot tell whether

      What we do know is that aborting the pregnancy will save the mother a great deal of pain, and sometimes, a great deal of danger. (Keep in mind: McCain is unwilling to make an exception for the life and health of the mother.)

      We kill for far less than that. We kill cows, chickens, turkeys, fish, and other things for no reason other than that we like the way we taste. (After all, we can survive just as well on a vegetarian diet.) Arguably, most of the things we kill for food have more personality than a first-trimester fetus.

      Now they're 7, and they're near-opposites in everything but appearance.

      Which again proves nothing -- this is actually quite common among twins.

      every abortion kills a baby.

      Go find out what a fetus looks like at various stages of development. I'll bet your daughters were at least in the second trimester when you felt them kick.

      To say that every abortion kills a baby is a bit like being outraged that every period kills an egg.

      --
      Don't thank God, thank a doctor!
    14. Re:WHATS WRONG WITH RIESERFS? by daVinci1980 · · Score: 1

      To say that every abortion kills a baby is a bit like being outraged that every period kills an egg.

      Greatest line ever.

      --
      I currently have no clever signature witicism to add here.
    15. Re:WHATS WRONG WITH RIESERFS? by daVinci1980 · · Score: 1

      First off, I disagree with your characterization of the GP as a coward. He's hit the nail right on the head: this isn't an issue of pro-abortion, it's pro-choice. As in, "I don't know the circumstances of your situation, so I shouldn't be sticking my nose into your business to make a decision for you. You should have a choice." You can call it cowardice, but then we're just talking past each other. Maybe we are anyways.

      I'm surprised you don't agree that viability is a reasonable standard: if the fetus cannot survive on its own, that's not a baby. It's a parasite. Does your tapeworm have a right to live because it's inside and a part of you? They're both carbon-based lifeforms and have an equal chance of surviving if removed from your body before they're ready.

      Moreover, there's a difference between "counseling against" and "making illegal." In general, I don't think most people who have abortions do so as a form of birth control. Some of my very close friends are ob/gyns, and the number one cause they wind up performing any form of abortion is because the baby isn't viable (there are literally hundreds of reasons this can happen) and the mother makes the decision to not deliver a full-term, stillborn child. Make no mistake: these fetuses would've been dead long before they became a babies.

      Let me ask you this: if we made abortion illegal, and a woman went and had an illegal abortion, do you believe she should be punished? Should we throw her in jail?

      Without a punishment, the criminalizing of abortions would just serve to make you feel better and wouldn't actually stop someone who was really determined to have an abortion.

      --
      I currently have no clever signature witicism to add here.
    16. Re:WHATS WRONG WITH RIESERFS? by SanityInAnarchy · · Score: 1

      Since no one else seems to have noticed... Wow, I must've been really tired when I posted this. (I'd say drunk, but that's impossible...)

      No one can say for sure even whether adult human beings have a soul, much less a fetus. We cannot tell whether

      I can't tell whether I intended to finish that sentence or not. Whoops.

      We kill cows, chickens, turkeys, fish, and other things for no reason other than that we like the way we taste.

      The way they taste!

      --
      Don't thank God, thank a doctor!
    17. Re:WHATS WRONG WITH RIESERFS? by Fished · · Score: 1

      Abortion as a means of birth control is not done by "stock" OB-Gyn's. It's done in abortion clinics. And, for what it's worth, the statistics I've been able to find place abortions due to "fetal health" as little as 1% of abortions, while abortions due to "personal choice" (i.e. abortion as a means of birth control) weigh in at between 78-98%.

      Simply put, as usual, the "pro-choice" crowd wants to make policy on the basis the marginal cases. You want to focus on fetal health, when all statistics show that to represent on only a vanishingly small fraction of the abortions done. Others want to focus on rape & incest, which represent and even smaller fraction. The fact is, no matter where you look, over 90% of abortions are done as a form of birth control.

      As for the fetus being a parasite... if you truly believe that, I can only say that that is sick and disgusting. I would also observe that people could (and have!) say much the same thing about a girl infant, for whom the father would be obligated to come up with a dowry. I propose a simple principle: parents have a choice whether or not conceive a child. If they choose to conceive a child, either by intent or by negligence, then they bear responsibility for that child from the moment of their choice until that child reaches it's majority.

      Another reason the "fetus as parasite" notion is philosophically absurd... Let's suppose a man impregnates a woman. As the law currently stands, he has absolutely no rights or responsibility with regard to said fetus until it is born. Then, suddenly, he is held legally responsible for 50% of that childs support for the next 18 years. How can a man be responsible for his sperm, then NOT be responsible for the fetus, then be responsible for the infant that results? This is simply absurd, because it breaks continuity.

      I remember some years ago seeing a made-for-tv movie about the original host of "Romper Room", who had a Thalidomide Baby and a pre-Roe v. Wade abortion. The abortion apparently became something of a cause celebre, and a doctor on the ethics board o the hospital where the abortion was to be performed made a comment (in the movie) that I will always remember: "What kind of moral cause is abortion!?"

      I ask you this same question... what kind of moral cause is a "choice" that ends a human life? In a world where desperate people are chasing all over the world to find babies to adopt, what sort of moral crusade is it to ensure the right to kill America's so that a careless woman will be saved from the fairly minor consequences of her own negligence? Are 3-7 months of pregnancy for the mother really worth less than a human life?

      Man, I still can't get over that parasite line. Dude, you disgust me.

      --
      "He who would learn astronomy, and other recondite arts, let him go elsewhere. " -- John Calvin, commenting on Genesis 1
  8. I can't believe... by arrenlex · · Score: 5, Funny

    Butter FS? Are you kidding me?

    Here is your first official list of jokes. Please contribute.

    1. You're still running ext4? I can't believe it's not ButterFS!
    2. But will it run on toast?
    3. Will fsck be renamed to butterknife?
    4. If your system overheats will your filesystem melt?
    5. If you use ButterFS too much, will it turn into FAT?
    6. If you leave ButterFS on your volume too long, will your hard drive start to reek?
    7. Will the next version of ButterFS be called GoatButterFS, just like the next version of Leopard is Snow Leopard?
    8. "Tough" notebooks will never have their hard drives formatted with ButterFS, because if you dropped them, they would always land hard drive down.
    9. When you submit your dead ButterFS hard drive to a data recovery centre, will they have an intern lick it to get the data off instead of putting it under a read head?

    These are getting kind of desperate -- your turn now.

    Honestly, what is it with FOSS and crappy names? (looking at you, gimp)

    1. Re:I can't believe... by Anonymous Coward · · Score: 2, Funny

      Honestly, what is it with FOSS and crappy names? (looking at you, gimp)

      All the good ones are trademarked. And it's The Gimp, to you, mister!

    2. Re:I can't believe... by penguinchris · · Score: 3, Funny

      When your hard drive fails and you hear those awful noises, you can say it's churning butter.

    3. Re:I can't believe... by Anonymous Coward · · Score: 0

      I think it started with the recursive acronym GNU. That set the stage for the nerdy animal acronym based naming schemes and their variations that are used throughout FOSS.

    4. Re:I can't believe... by Anonymous Coward · · Score: 0

      I really don't see why they chose "Butter FS", when it could just have easily been "Better FS". A bit snoody, but also a whole lot less ridiculous.

    5. Re:I can't believe... by Ivlis · · Score: 1

      Butters, what the hell are you doing?

    6. Re:I can't believe... by Anonymous Coward · · Score: 5, Funny

      These are getting kind of desperate -- your turn now.

      Yeah, you're spreading yourself a bit thin.

      • I hear some of the features in btrfs have been refined from ext3cow.
      • I touch'd a file on a btrfs disk, and now it's sticky!
      • I hear the standard block size of btrfs is 8 oz.
      • How can I make a business case for btrfs? I'm all for introducing new tech, but my boss only cares about how it will affect our margarins.
      • Will btrfs keep my servers from grinding? I'm a bit worried that if they churn too much, my files will separate!
      • And most importantly, In an emergency, can I use btrfs for a smoother fsck?
    7. Re:I can't believe... by RuBLed · · Score: 2, Funny

      ButterCupFS - Just when you thought it had everything built up, it will then turn you down and mess things around.

      you said for yourself that this was getting desperate

    8. Re:I can't believe... by GFree678 · · Score: 1

      Honestly, what is it with FOSS and crappy names? (looking at you, gimp)

      Programmers don't know/give a damn about marketing. Remember, they generally create something for themselves, so the name really doesn't matter to them. For better or worse.

    9. Re:I can't believe... by garphik · · Score: 1

      Somehow I don't know why (maybe because of the current nomenclature trend), but this doesn't seem funny (or a joke) to me. Those things actually make sense, goodness ...

    10. Re:I can't believe... by Anonymous Coward · · Score: 0

      Time to wake up the gimp...

    11. Re:I can't believe... by Anonymous Coward · · Score: 0

      If you drop a hard drive formatted with btrfs, does it always land pcb-side down?

    12. Re:I can't believe... by Tribbin · · Score: 1
      --
      If you mod this up, your slashdot background will turn into a beautiful sunset!
    13. Re:I can't believe... by foniksonik · · Score: 1

      Maybe they should set up an X-Prize - $10 to anyone who can come up with a better name.... of course companies with money actually pay hundreds of thousands to marketing firms to do the same thing. I know, I work at one.

      --
      A fool throws a stone into a well and a thousand sages can not remove it.
    14. Re:I can't believe... by foniksonik · · Score: 1

      I could only think of a quote:

      "It's good cholesterol, but it spreads like bad cholesterol."

      -Dr. Zoidberg

      --
      A fool throws a stone into a well and a thousand sages can not remove it.
    15. Re:I can't believe... by mortonda · · Score: 1

      you butter stop cooking up any more puns, or your audience might slip away....

    16. Re:I can't believe... by Hillgiant · · Score: 1

      2. But will it run on toast?

      I'm pretty sure there is a netBSD port out there somewhere for toast....

      --
      -
    17. Re:I can't believe... by Anonymous Coward · · Score: 0

      I understand a file system is the bread and butter of a computer, but if I print into a file on ButterFS, will it jam too ?

  9. Butters' FS! by russlar · · Score: 3, Funny

    Great for playing "Hello Kitty! Adventures"

    --
    Anybody want my mod points?
    1. Re:Butters' FS! by renegadesx · · Score: 1

      I hear the project has got a new lead developer.. Professor Chaos.

      --
      Make SELinux enforcing again!
  10. Re:BTRFS? REALLY? by Anonymous Coward · · Score: 4, Funny

    Butter Fase probably intended as Butter Face.

    Sounds like "But Her Face" as in: She has a great body, but her face...

  11. Whoa! by aevans · · Score: 5, Funny

    A Linux article on Slashdot!?

    1. Re:Whoa! by Anonymous Coward · · Score: 1, Funny

      You're right.. the terrorists must have won ;)

    2. Re:Whoa! by icydog · · Score: 3, Funny

      You must be... old here.

  12. Re:BTRFS? REALLY? by initialE · · Score: 5, Insightful

    Why not? It's a good analogy for FOSS after all. Great software, robust and all, but her face...

    --
    Starbucks, Harbuckle of Breath.
  13. what's a "next generation" file system? by seanadams.com · · Score: 2, Interesting

    Something like ZFS immediately comes to mind... but is there some generally accepted definition of what makes a file system "next generation"? TFA doesn't say, and I hate to diminish anyone's efforts here, but the new features in ext4 (according to wikipedia) aren't much to write home about: higher precision time stamps, larger volumes, larger directories, faster fscking. These may be worthy accomplishments but they are incremental improvements, not anything new. Or did I miss something?

    1. Re:what's a "next generation" file system? by jonasj · · Score: 1

      Or did I miss something?

      Yes, it seems that you did actually miss one thing, namely that it wasn't ext4 that was referred to as a next-generation filesystem, but rather btrfs. So while ext4 might only have incremental improvements, noone claimed anything else. btrfs is what they claimed was next-gen, with ext4 being, as the title says, an interim step.

      As to a definition of what makes a filesystem "next-generation", I cannot say.

      --
      You know, Microsoft's street address also says a lot about their mentality.
  14. Re:BTRFS? REALLY? by hampton · · Score: 5, Funny

    You're right. BTRFS is really silly. I recommend that the shortened form be ButtFS.

  15. Re:BTRFS? REALLY? by Dwedit · · Score: 1, Redundant

    I think it reads more like "Bit Rot" filesystem, perfect for 20 year old EPROM chips.

  16. B-tree based Filesystem by Zombie+Ryushu · · Score: 1

    I saw that and couldn't help but think, are they trying to make a filesystem based on the B-tree concept?

    1. Re:B-tree based Filesystem by AmberBlackCat · · Score: 3, Funny

      That's exactly what they're doing. The plan is to limit every directory to exactly two files or subdirectories that will be kept in alphabetical order. That way, you can find any file on your drive in log(n) time. Future updates are planned for people who have more than two songs by the same artist.

    2. Re:B-tree based Filesystem by hitchhacker · · Score: 2, Interesting
      B-Tree:

      Not to be confused with binary tree.

      -metric

    3. Re:B-tree based Filesystem by glwtta · · Score: 1

      B-trees are not the same as binary trees.

      --
      sic transit gloria mundi
    4. Re:B-tree based Filesystem by Anonymous Coward · · Score: 2, Informative

      A B-Tree can have N children per node, where N is determined by the number of child links you can fit in one block. You are thinking of a binary-tree.

    5. Re:B-tree based Filesystem by mike_sucks · · Score: 1

      For want of mod points: LOL

      --
      -- "So, what's the deal with Auntie Gerschwitz et all?"
    6. Re:B-tree based Filesystem by AmberBlackCat · · Score: 1

      I personally think my comments, about a system that would hold two items and hold more than two in the future, fit a b-tree well. But I really wasn't expecting to have to legitimize comments that were meant to be a joke in the first place.

    7. Re:B-tree based Filesystem by MikeBabcock · · Score: 1

      But it really is funny seeing people not get that.

      I still liked Reiser's B* trees that get slowly reordered and sorted in passes.

      --
      - Michael T. Babcock (Yes, I blog)
  17. Re:BTRFS? REALLY? by Anonymous Coward · · Score: 0

    Personally, I like "BuTt RiFfS" better.

  18. No by RuBLed · · Score: 1

    buttered only

  19. Re:BTRFS? REALLY? by blahplusplus · · Score: 5, Insightful

    "Couldn't they come up with a better name than "BuTteR FaSe?" I know I can't be the only one who read it like that. Call it anything but that."

    I read it as:

    BeTteR FileSystem

    I guess we'll have to part was :P

  20. Re:BTRFS? REALLY? by spazdor · · Score: 5, Funny

    Good, strong file-bearing hips!

    --
    DRM: Terminator crops for your mind!
  21. Btrfs = Bit torrent file system? by phreakv6 · · Score: 1

    that outta make a good marketing strategy

    --
    fifteen jugglers, five believers
  22. Re:BTRFS? REALLY? by coaxial · · Score: 1

    Yeah. You are.

  23. You're both right. by SanityInAnarchy · · Score: 5, Interesting

    ZFS duplicates a lot of functionality that belongs outside of a filesystem.

    Very true.

    It wouldn't be possible to duplicate RAID-Z with LVM.

    Also true.

    And the features which could be duplicated, couldn't be done nearly as well without a little more knowledge of the filesystem.

    The real problem here is that we're finding out that generic block devices aren't enough to do everything we want to do outside the filesystem itself. Or, if they are, it's incredibly clumsy. Trivial example: If I want a copy-on-write snapshot, I have to set aside (ahead of time) some fixed amount of space that it can expand into. If I guess high, I waste space. If I guess low, I have to either expand it (somehow, if that's even possible) or lose my snapshot.

    A filesystem which natively implemented COW could also trivially implement snapshots which take up exactly as much space as there are differences between the increments. But because of the way the Linux VFS is structured, this kind of functionality would have to be in a single filesystem, and would be duplicated across all filesystems. Best case, it'd be like ext3's JBD, as a kind of shared library.

    A humble proposal: We need another layer, between the block layer and the filesystem layer -- call it an extent layer -- which is simply concerned with allocating some amount of space, and (perhaps) assigning it a unique ID. Filesystems could sit above this layer and implement whatever crazy optimizations or semantics they want -- linear vs btree vs whatever for directories, POSIX vs SQL, whatever.

    The extent layer itself would only be concerned with allocating extents of some requested size, and actually storing the data. But this would be enough information to effectively handle mirroring, striping, snapshotting, copy-on-write, etc.

    It wouldn't be universal -- I've said nothing about the on-disk format, and, indeed, some filesystems exist on Linux solely for that purpose -- vfat, ntfs, udf, etc. Those filesystems could be done pretty much exactly the way they're done now. After all, the existence of a block layer in no way implies that every filesystem must be tied to a block device (see proc, sys, fuse, etc.)

    But I think it would work very well for filesystems which did choose to implement it. I think it would provide the best of ZFS and LVM.

    I haven't actually been seriously following filesystem development for years, so maybe this is already done. Or maybe it's a bad idea. If not, hopefully some kernel developers are reading this.

    --
    Don't thank God, thank a doctor!
    1. Re:You're both right. by Daniel+Phillips · · Score: 1

      A humble proposal: We need another layer, between the block layer and the filesystem layer -- call it an extent layer -- which is simply concerned with allocating some amount of space, and (perhaps) assigning it a unique ID. Filesystems could sit above this layer and implement whatever crazy optimizations or semantics they want -- linear vs btree vs whatever for directories, POSIX vs SQL, whatever.

      Woohoo, you've invented an LVM.

      --
      Have you got your LWN subscription yet?
    2. Re:You're both right. by jdogalt · · Score: 1

      "
      A humble proposal: We need another layer, between the block layer and the filesystem layer -- call it an extent layer
      "

      Hmm... I guess it's below the block layer, but... devicemapper/lvm?

      "
      But this would be enough information to effectively handle mirroring, striping, snapshotting, copy-on-write, etc.
      "

      Yeah, dm does all that... (but I'm guessing not quite as much as you were looking for)

    3. Re:You're both right. by Anonymous Coward · · Score: 1, Interesting

      The catch is that coming up with such a layer is kind of tricky. That's probably why Sun didn't bother - they intended to only ever implement one filesystem using it, and had an interest in not making those extra features available in the other supported filesystems.

      Another intermediate layer that sits between the block device and the filesystem, along with some kind of support from the VFS for the new features, would probably be enough to allow implementing a ZFS-like filesystem in Linux. The big ones are that it has to be able to interact better with LVM, and has to be able to handle COW semantics (that'd be a fantastic feature to expose to userspace). With that as a base, most of the features are either already there (in LVM, or Linux software RAID) but need a bit of improvement (like the page cache), or can be done in the filesystem itself (checksumming).

    4. Re:You're both right. by Wonko · · Score: 2, Interesting

      Trivial example: If I want a copy-on-write snapshot, I have to set aside (ahead of time) some fixed amount of space that it can expand into. If I guess high, I waste space. If I guess low, I have to either expand it (somehow, if that's even possible) or lose my snapshot.

      That still only covers one deficiency of LVM snapshots. LVM snapshots are read only and intended to be temporary. I'm also pretty sure you can't snapshot a snapshot, which wouldn't be at all helpful with a read only snapshot anyway.

      A humble proposal: We need another layer, between the block layer and the filesystem layer -- call it an extent layer -- which is simply concerned with allocating some amount of space, and (perhaps) assigning it a unique ID. Filesystems could sit above this layer and implement whatever crazy optimizations or semantics they want -- linear vs btree vs whatever for directories, POSIX vs SQL, whatever.

      We'd never be able to get it right and it would probably be more likely to get in the way. We seem to be learning that we can do much niftier things by tightly coupling what used to be very separate layers.

      I haven't actually been seriously following filesystem development for years, so maybe this is already done. Or maybe it's a bad idea. If not, hopefully some kernel developers are reading this.

      I don't really believe it is a bad idea. I do think it would have to be too heavy of a layer, though. It would have to track which file systems own each extent, and if you want to come close to matching RAID-Z you are going to need to be able to return very small extents (LVM defaults to 4MB, IIRC). If a file system is going to be requesting 4k extents you're going to have a lot of overhead in storing the extent ownership and size information. You're also going to have a lot of overhead in checking who owns each extent on any given read or write. I can think of ways to optimize that a bit, but I imagine it'll still have a significant space+performance impact.

    5. Re:You're both right. by SanityInAnarchy · · Score: 1

      Well, actually, no. Not unless you're suggesting that LVM is appropriate at the file level -- as in, creating a new device per-file.

      --
      Don't thank God, thank a doctor!
    6. Re:You're both right. by MikeBabcock · · Score: 1

      You beat me to it. A lot of people aren't familiar with how powerful LVM is on Linux though.

      --
      - Michael T. Babcock (Yes, I blog)
    7. Re:You're both right. by Anonymous Coward · · Score: 0

      Check out 'fossil' and 'venti', plan9's block storage server and filesystem respectively. It's a similar idea.

    8. Re:You're both right. by Anonymous Coward · · Score: 0

      Tux3 will have versioning and extent handling, so what you are asking is in development.

    9. Re:You're both right. by ArsonSmith · · Score: 1

      It's been over a year ago now, but I've done COW snapshots of Xen volumes useing LVM. basicly created a Xen volume and installed an OS on it and shut it down. Then I would take a COW snapshot of it and boot it for any further VMs that I needed of that OS. It worked great and saved a lot of space on my constrained little test environment.

      --
      Paying taxes to buy civilization is like paying a hooker to buy love.
    10. Re:You're both right. by ArsonSmith · · Score: 1

      "The catch is that coming up with such a layer is kind of tricky. That's probably why Sun didn't bother - they intended to only ever implement one filesystem using it, and had an interest in not making those extra features available in the other supported filesystems."

      'round here we call that "Vendor Lock In"

      --
      Paying taxes to buy civilization is like paying a hooker to buy love.
    11. Re:You're both right. by Daniel+Phillips · · Score: 1

      Well, actually, no. Not unless you're suggesting that LVM is appropriate at the file level -- as in, creating a new device per-file.

      I do not see what in your description above would require a device per file.

      --
      Have you got your LWN subscription yet?
    12. Re:You're both right. by DamnStupidElf · · Score: 1

      The extent layer itself would only be concerned with allocating extents of some requested size, and actually storing the data. But this would be enough information to effectively handle mirroring, striping, snapshotting, copy-on-write, etc.

      I think the biggest problem is that the filesystem would lose the ability to make intelligent decisions about where to allocate extents from and how to index them, without the scheme devolving into a "allocate one extent the size of the disk and manage it locally" being the best performer. For instance, how should a file system agnostic extent management system allocate and index 100 extents? 1,000,000,000,000 extents? What would the inter-block slack policy be? Intra-block slack? Most of the hard parts of reliably packing extents into a block device are done different ways by different file systems for different reasons. Abstracting that out of the file system layer would probably not help performance or reliability.

      ext3 is probably already close enough to what you describe; it has a way of allocating extents and naming them (files) and using them for whatever you want. You could implement RAID on top of independent ext3 file systems pretty easily, but it would be inefficient. In a traditional RAID, block access is a simple calculation based on the stripe size, block size, and number of drives. In a RAID on top of extents, it would be necessary to store a list of extents that each stripe uses in order to locate a block within that stripe.

    13. Re:You're both right. by Electrum · · Score: 1

      We need another layer, between the block layer and the filesystem layer [...] which is simply concerned with allocating some amount of space [...] Filesystems could sit above this layer

      This is exactly how ZFS is designed:

      • SPA - Storage Pool Allocator: allocates blocks from all the devices in a storage pool
      • DMU - Data Management Unit: consumes blocks from the SPA and exports objects (flat files)
      • ZPL - ZFS POSIX Layer: makes DMU objects look like a POSIX file system

      See page 7 of this ZFS presentation for a comparison with the traditional block device model.

    14. Re:You're both right. by Electrum · · Score: 1

      The catch is that coming up with such a layer is kind of tricky. That's probably why Sun didn't bother

      Actually, they did. ZFS has the SPA and DMU layers that sit above the ZPL.

    15. Re:You're both right. by SanityInAnarchy · · Score: 1

      My description was of an extent layer -- that is, when a filesystem needs an extent, it asks this layer. The idea is that, if a filesystem were to use this for individual files, it would allow the per-file striping/mirroring of ZFS, but would work with any filesystem implementing this layer.

      I doubt either LVM or device-mapper would work for the number of extents I'm talking about.

      --
      Don't thank God, thank a doctor!
    16. Re:You're both right. by SanityInAnarchy · · Score: 1

      We'd never be able to get it right

      All the more reason to make it pluggable. But hey, at least we could try.

      It would have to track which file systems own each extent

      I don't really see why. All it needs to keep track of is a unique id for the extent. Garbage collection could be left up to the filesystems -- or to another layer, depending on what people end up building.

      All it would have to keep track of is at least one superblock. That would be enough for another system to bootstrap, and handle "ownership" of the rest -- but I still don't think it needs to be tracked beyond, say, a superblock per filesystem.

      Another analogy: This is like Ring 0. Yes, any filesystem could accidentally trample all over another filesystem's extents, just as any kernel driver can trample over another kernel driver's memory. If you really need the protections of userspace, you build something on top of it.

      That's not entirely a new idea -- LLVM handles the low-level details of linking, native machine code, etc, but does absolutely nothing for garbage collection. Elsewhere, some languages use garbage collection libraries, and some use simple reference counting -- but LLVM is designed to be able to host them all.

      if you want to come close to matching RAID-Z you are going to need to be able to return very small extents (LVM defaults to 4MB, IIRC).

      Which is why LVM is not appropriate for this. Nor is device-mapper.

      I can think of ways to optimize that a bit, but I imagine it'll still have a significant space+performance impact.

      I don't think it would have a significant space impact -- especially as it would utterly trivialize the process of adding transparent compression to any filesystem.

      I don't know enough to really say whether it would have a significant performance impact. I don't think it would.

      I also think that it would be worth a performance impact. I can sacrifice a few cycles if it means a better structured, more reliable system.

      Thanks for your response -- it looks like most people thought I was just asking for LVM. I'm glad at least one person gets it.

      --
      Don't thank God, thank a doctor!
    17. Re:You're both right. by SanityInAnarchy · · Score: 1

      I think the biggest problem is that the filesystem would lose the ability to make intelligent decisions about where to allocate extents from and how to index them

      And the extent layer would pick up that ability.

      What additional information does the filesystem have, that it would make this choice with?

      without the scheme devolving into a "allocate one extent the size of the disk and manage it locally" being the best performer.

      Certainly, most database engines have devolved into "allocate one file the size of the data and manage it locally". I would guess that's mostly because existing filesystems don't provide a better option.

      Most of the hard parts of reliably packing extents into a block device are done different ways by different file systems for different reasons.

      Most are also done making large assumptions about the underlying media -- mostly that it is a single magnetic disk.

      ext3 is probably already close enough to what you describe; it has a way of allocating extents and naming them (files) and using them for whatever you want.

      I never said they would be named.

      And the names introduces a whole new world of filesystem-specific performance implications (Trivial example: dir_index or not?) and actual limitations (UNIX file permissions? POSIX ACLs? Extended attributes? Separate streams?) that belong more to a filesystem (or database) than to a simple storage manager.

      Rather, you would allocate an extent, and it is automatically assigned a number, which you memorize (somehow). So it's much closer to an inode -- but still not the same, as an inode stores metadata about the file besides its physical location on disk.

      In a RAID on top of extents

      No, the point is that the extent layer itself would be where you implement RAID. Not above it (because then it's filesystem-specific, and you've got another ZFS), and not below it (because then you lose information).

      Trivial example: RAID-Z, from ZFS, will only bother reconstructing space which is actually allocated, somehow. Traditional block-level RAID can only reconstruct the entire disk, including unallocated (and thus worthless) space -- because with no knowledge of the filesystem, it can't know what's allocated or not.

      --
      Don't thank God, thank a doctor!
    18. Re:You're both right. by SanityInAnarchy · · Score: 1

      Cool!

      I guess I can't claim originality, but at least that's some reassurance that I know what I'm talking about.

      --
      Don't thank God, thank a doctor!
    19. Re:You're both right. by badkarmadayaccount · · Score: 1

      I think GEOM is better, personally.

      --
      I know tobacco is bad for you, so I smoke weed with crack.
  24. ZFS's "LVM" is not like Linux's LVM. by jafo · · Score: 1

    The volume and snapshotting functionality in ZFS is quite different from that in the Linux kernel.

    When you have the LVM and RAID inside the file-system, you can easily do things like do a RAID rebuild of *ONLY* the data the file-system is using. So if you have a 5TB file-system with 100MB of space in use, you can do a RAID rebuild in a few seconds instead of several hours.

    Ditto for the snapshots. Linux's LVM implements snapshots, but you have to allocate storage to the snapshots specifically, where in ZFS you can use free space inside the file-system volume for snapshotting.

    For example, one ZFS system I have currently has 229 snapshots on it, and I don't have to worry about any of them running out of space unless the whole file-system runs out.

    So, saying that these are features that should be implemented outside of the file-system is easy to say, but loses a lot of the functionality you gain if they are all closely tied together.

    Sean

    1. Re:ZFS's "LVM" is not like Linux's LVM. by Daniel+Phillips · · Score: 1

      When you have the LVM and RAID inside the file-system, you can easily do things like do a RAID rebuild of *ONLY* the data the file-system is using.

      Surely you can imagine a relatively trivial mechanism whereby LVM raid could be instructed to resync only data which is currently allocated by the filesystem, a "you can treat these ranges as empty until written" advisory.

      --
      Have you got your LWN subscription yet?
  25. tytso by Anonymous Coward · · Score: 0

    rgtfo..

  26. Re:BTRFS? REALLY? by AmberBlackCat · · Score: 1

    It could also be read as BeTteR File System.

  27. Re:BTRFS? REALLY? by msormune · · Score: 1

    I read it as "BitterFS" :)

  28. Re:BTRFS? REALLY? by Anonymous Coward · · Score: 0

    So, cue the jokes.
    How is this filesystem? Like Btr.

    "ButterFS"--they're afraid to claim it is "BetterFS"?

    That would mean we'd still have to wait for bstFS.

    I guess if it's going to stay "butterFS", there ought to a special loopback driver so that you can put the butter on top of something else: brdFS.

  29. If you want a blazingly fast file system.... by FlyingGuy · · Score: 2, Informative

    Then look no farther then NSS ( Novell Storage Services ).

    It is Open Source, you get the full source if you download SLES.

    It has more of the desired features then anything else on the block right now.

    This should be the default file system for Linux. It has years of very heavy duty R&D behind it, it is pretty much completely de-bugged and ready to rock.

    --
    Hey KID! Yeah you, get the fuck off my lawn!
    1. Re:If you want a blazingly fast file system.... by moosesocks · · Score: 4, Interesting

      Max Volume Size: 8 TiB.

      That's not enough. Given that 1TB storage devices are on the market now, that could become outdated quite quickly. You'd be foolish to adopt that sort of filesystem, unless you were absolutely positive that you'd never upgrade (unlikely).

      Honestly, ZFS seems like it's the holy grail of filesystems. There are a few small issues that might need to be worked out, though it seems as close to "ideal" as you'd ever be able to get.

      --
      -- If you try to fail and succeed, which have you done? - Uli's moose
    2. Re:If you want a blazingly fast file system.... by QuoteMstr · · Score: 1

      Various limits can be increased easily be increasing the sizes of the relevant fields. The underlying code and design doesn't need to change much at all.

    3. Re:If you want a blazingly fast file system.... by mrjb · · Score: 1

      Honestly, ZFS seems like it's the holy grail of filesystems. There are a few small issues that might need to be worked out, though it seems as close to "ideal" as you'd ever be able to get.

      Depends on the features you need. I for one can see huge benefits in companies using a distributed filesystem across their computers. ZFS doesn't do that, so there's still room for improvement. Oh, and at some point 16 EiB won't be enough anymore.

      --
      Visit http://ringbreak.dnd.utwente.nl/~mrjb/growingbettersoftware to download your free copy of the book
    4. Re:If you want a blazingly fast file system.... by Kent+Recal · · Score: 3, Interesting

      Well, it looks interesting feature-wise but they seem to be explicitly targeting SuSE - which is a no-go for most people.
      From a glance at the docs (hey, at least they have docs, that's a plus) it also seems like it's tied to specific versions of EVMS and other parts of the kernel, thus if you don't run a "blessed, certified" SuSE kernel with all the nasty patches then you're on your own.

      Just google for "debian|gentoo|redhat|... novell nss filesystem". Apparently nobody even tried to run NSS on another distro, or at least didn't write about it.

      I, for one, would only touch this on a blackbox, vendor-supported appliance but never consider it for a production server of my own (none of which run SuSE).
      If they worked towards integrating it into the mainline kernel, now that would be nice.

    5. Re:If you want a blazingly fast file system.... by chrysalis · · Score: 1

      This is why HAMMER is the holy grail of filesystems.

      Oh, and it's BSD-licensed.

      --
      {{.sig}}
    6. Re:If you want a blazingly fast file system.... by hab136 · · Score: 1

      This is why HAMMER is the holy grail of filesystems.

      Well, just like the holy grail, it certainly was a pain to find: http://www.dragonflybsd.org/hammer/. Hint to developers: don't reuse common words as your project name, makes it difficult to Google.

    7. Re:If you want a blazingly fast file system.... by Shados · · Score: 1

      Hey, at least they didn't pick a 3 letter word, with punctuation, that is used in all of the URLs of a major top level domain, and is also not searchable in most forum softwares like others

    8. Re:If you want a blazingly fast file system.... by FlyingGuy · · Score: 1

      Yes it is only ported to SUSE at this point, but lets face it that is Novell's focus. It is however FOOS and that is the point. Where are you going to get a file system, that has that much R&D and debugging behind it that is at that level and completeness and correctness.

      One of the most critical and outstanding features of this file system is it's ability to add storage space On The Fly, no re-booting, no taking the server down. Shove another drive into a spare hot-swap cage slot, partition, mount and add to any volume and no one knows the event occurred.

      --
      Hey KID! Yeah you, get the fuck off my lawn!
    9. Re:If you want a blazingly fast file system.... by moosesocks · · Score: 1

      Oh, and at some point 16 EiB won't be enough anymore

      I wouldn't be too sure of that.

      From Wikipedia:

      Although we'd all like Moore's Law to continue forever, quantum mechanics imposes some fundamental limits on the computation rate and information capacity of any physical device. In particular, it has been shown that 1 kilogram of matter confined to 1 litre of space can perform at most 1051 operations per second on at most 1031 bits of information.[10] A fully populated 128-bit storage pool would contain 2128 blocks = 2137 bytes = 2140 bits; therefore the minimum mass required to hold the bits would be (2140 bits) / (1031 bits/kg) = 136 billion kg. To operate at the 1031 bits/kg limit, however, the entire mass of the computer must be in the form of pure energy. By E=mcÂ, the rest energy of 136 billion kg is 1.2x1028 J. The mass of the oceans is about 1.4x1021 kg. It takes about 4,000 J to raise the temperature of 1 kg of water by 1 degree Celsius, and thus about 400,000 J to heat 1 kg of water from freezing to boiling. The latent heat of vaporization adds another 2 million J/kg. Thus the energy required to boil the oceans is about 2.4x106 J/kg * 1.4x1021 kg = 3.4x1027 J. Thus, fully populating a 128-bit storage pool would, literally, require more energy than boiling the oceans.

      --
      -- If you try to fail and succeed, which have you done? - Uli's moose
    10. Re:If you want a blazingly fast file system.... by joib · · Score: 1

      Uh, you can do that today with LVM and ext3.

    11. Re:If you want a blazingly fast file system.... by FlyingGuy · · Score: 1

      LVM/ELVM each have their own problems that are well documented. NSS has all of that built in , debugged and is proven correct!.

      --
      Hey KID! Yeah you, get the fuck off my lawn!
    12. Re:If you want a blazingly fast file system.... by Kent+Recal · · Score: 1

      Well, NSS may be the best filesystem on the planet (many think the same about ZFS) but its close ties to SuSE linux with all the proprietary long-tail is still a showstopper for many if not *most* potential users. Most sysadmins I know (and I know a few) wouldn't touch SuSE with a 10ft pole and not remotely consider it for a server.

      Furthermore the features you named are not earth-shaking. Feature-wise it seems to rival ZFS but with a much smaller user-base and much less momentum. It's honestly the first time I hear at all about NSS and I'm sure I'm not alone...

      Anyways, I still stand to my point. It'd be nice if novell opened up and made NSS available on all linux distros. New filesystem options on linux are always welcome (reiser being dead, xfs being flakey and ext2/3 showing their age), so even if Novell charged a *reasonable* fee for commercial use I'd think it could gain some ground - if it really proves as solid in production as you describe it.

      In reality, though, they'll probably stick to their proprietary route and NSS will stay in its niche until it's eventually obsoleted by a new mainsteam linux filesystem like hammer, butter, ext5, zfs-for-linux or so....

    13. Re:If you want a blazingly fast file system.... by EvilRyry · · Score: 1

      With a max volume size of 8TB, you'll never get to grow it too much. 8TB might sound really wicked for your desktop, but its small game for even small servers these days.

    14. Re:If you want a blazingly fast file system.... by FlyingGuy · · Score: 1

      That size limit is very easily correctable.

      --
      Hey KID! Yeah you, get the fuck off my lawn!
    15. Re:If you want a blazingly fast file system.... by FlyingGuy · · Score: 1

      NSS has no proprietary tail. It is now FOOS! Novell Open Sourced it like 3 years ago. The full and complete source is on the SLES DVD. It lived on forge.novell.com for a while as well.

      --
      Hey KID! Yeah you, get the fuck off my lawn!
    16. Re:If you want a blazingly fast file system.... by Kent+Recal · · Score: 1

      NSS does have a proprietary tail: It only runs on SuSE linux with the custom SuSE kernel, as I explained in my previous post.
      Ofcourse you can correct me if I'm wrong, just post a link to an installation tutorial for any other distro (Debian, Gentoo, Redhat etc.).

    17. Re:If you want a blazingly fast file system.... by FlyingGuy · · Score: 1

      Well gosh golly! What a surprise! Novell invested their time and money into porting NSS to an OS they are supporting, shocking I tell, simply shocking!

      I do believe that SUSE uses the same kernel that all other *nix distro's use, correct me if I am wrong, but anything they change in the kernel gets pushed back out to the world right to opensuse,org

      Does it need some dev work, yes I bet it does.

      The point is, that is is FOOS, ZFS is NOT, it definitely has proprietary code, that SUN has no control over in it as has been discussed on /. many times.

      NSS; however, does not since it was developed BY Novell FOR Novell to be the files system for NetWare. Novell open sourced the complete NSS system, all of it, every bit of it. The source is included with a SELS server, a bright DEV could port it over as a native FS for *nix and the *nix community would have a KILLER file System.

      Since SLES is FOOS software then everything Novell does gets sent back into the community.

      the "community" needs to get a clue, NSS is better then anything out there as of this writing. And it has one other thing that NONE of the others do and that is the ability to undelete ANY file in ANY directory, as long as the file and or directory has not been flagged as "purge immediate"

      --
      Hey KID! Yeah you, get the fuck off my lawn!
    18. Re:If you want a blazingly fast file system.... by Kent+Recal · · Score: 1

      Dude, you're making a fool of yourself. If NSS really is this "killer" filesystem (wasn't that term reserved for reiser?) that is better than anything else out there... then why did no "bright dev" ever bother to port it all over the place? Why did nobody even bother to at least make it available on other linux distros?

      Why do all these bright hackers rather work on porting ZFS or developing new filesystems (hammer, butter, ext4)?

      Maybe because here in the real world, NSS has a reputation for being slow and buggy?

    19. Re:If you want a blazingly fast file system.... by FlyingGuy · · Score: 1

      I don't know what world you are living in, but I have been working with NSS since its inception, and it is neither slow or buggy.

      You want to know the real problem, well here it is... *nix heads, thats the problem. Instead using a tried and true solution that could be ported to generic *nix very quickly, they choose to re-invent the wheel and why? because its a religion to them. Novell broke a rule, and did what they had to do to cover their and their customers asses, namely the agreement with Microsoft. m well lets see what Novell has done to protect *nix shall we?

      • They dumped HUGE amounts of cash countering the law suite from SCO to protect Linux, but hey FUCK THEM right?
      • They brought Linux from obscurity and have turned out a distro that is clean, runs fast, is secure, has made its way into more corporations and paved the way for Linux acceptance in the the Fortune 500 then Red Hat ever even thought about, but hey FUCK THEM right?
      • Ohh yeah, they basically have funded the entire MONO project, but hey FUCK THEM right?
      • Their distro of OOo kicks ass and they shove everything they do right back to OOo dot org, which is the same as saying, they pumped as much if not more then Sun ever did into OOo, but hey FUCK THEM right?

      I will put up an NDS powered NetWare server running F&P services against ANY Linux distro and it will blow its doors off. Hell I will put NetWare 3.2 ( still the record holder for F&P services ) against ANY OTHER OS out there and it will just leave it in the dust, and all of that speed comes down to the File System. before NSS it was the NetWare native file System. Slow to mount to be sure, but once mounted NOTHING could touch it in terms of speed an security. Then came NSS.. An 8TB volumes mounts in about 10 seconds and uses a MOST 4 megs of RAM and STILL blows the doors off of anything out there.

      You want to do the Linux community a favor? Tell them to drop the "its a religion" bullshit and get them to use what has been made FREELY available to them.

      --
      Hey KID! Yeah you, get the fuck off my lawn!
    20. Re:If you want a blazingly fast file system.... by Kent+Recal · · Score: 1

      Calm down mate. I'm a bit baffled at how hysteric you get over this. We're talking about filesystems, right?

      It's difficult to find a point in your 3rd grade polemics, so I'll just state the obvious:
      There is no such thing as "the *nix heads" who have decided in a central board meeting to "FUCK NOVELL" and collectively boycot NSS.

      The simple fact that NSS has been around for a while but hasn't gained any significant popularity is a strong indicator that it is apparently not as perfect as you paint it here.

  30. Re:BTRFS? REALLY? by deniable · · Score: 4, Funny

    I read it as BeaterFS and wondered if it was too soon for ReiserFS jokes.

  31. Missed one. by xstonedogx · · Score: 1

    ParkayFS?

    Butter.

  32. Why all the fragmentation? by 0x000000 · · Score: 1

    This is something that has bothered me about open source in general for a while now, why is that there is so much fragmentation? So many wheels that are being re-implemented for the sake of being re-implemented? I agree that a new file system that supports all the new features of btrFS and ZFS are required, at the same time I don't understand all of this duplication. ZFS has some features that btrFS does not have, and vice-versa, why not spend the time developing a hybrid of the two, thereby massively increasing the usability and stability of both products, or rather, of just one product since the time and effort would only be but into the hybrid.

    If it is possible for Nvidia to use binary blobs for their graphics cards, it should be possible to use CDDL code with a compatibility shim in the Linux kernel. All this duplicated effort could instead be focused on one project, thereby having an all around better file system. btrFS has just recently started coming into fruition, would that time not be better spent improving ZFS?

    It seems that license issues are the only thing that is causing all of this trouble in the first place. As a user of a system I don't want to spend valuable time testing all the various file systems, I also don't want to have to support all of the different file systems that are available. With a project as large as Linux, and the amount of file systems that are available, how can it be guaranteed that the file system I ultimately go with has been properly bug tested, has had the proper code review done, and is not going to be shoved aside for the next new shiny file system that is introduced? As an end-user (and I hereby don't mean the home user group) I want stability. FreeBSD gives me UFS2, I know I can depend on it, I know it will still exist tomorrow, and I know that it is still being looked at for performance improvements and improvements in general, ZFS has recently been imported and is will exist for a long time. OpenSolaris gives me UFS, and ZFS, I know it is going to be around, I know it is going to be improved. Linux gives me XFS, JFS, ext2, ext3, ext4, ReiserFS, Reiser4, and now btrFS. Depending on my workload and who I ask I get told to use different file systems on Linux. Individually testing each and everyone would be time consuming and error prone, instead of all of these different file systems make one unified file system.

    For that reason, and that reason alone I use FreeBSD 7.0 and Solaris 10 on my servers. Stability is a good thing, I need some way to relay to my clients that there is a reasonable time schedule for new releases, that what they are storing their data on right now is going to be around tomorrow, and that it is stable, that it has been time proven and tested. Linux can not provide that at the moment.

    --
    cat /dev/null > .signature
    1. Re:Why all the fragmentation? by atraintocry · · Score: 2, Funny

      So you're saying someone should run a defrag on these filesystem projects?

    2. Re:Why all the fragmentation? by Martin+Soto · · Score: 1

      This is something that has bothered me about open source in general for a while now, why is that there is so much fragmentation? So many wheels that are being re-implemented for the sake of being re-implemented?

      People who make arguments of this type seem to think that there is some sort of Linux Corporation doing all the work. If this were the case, it would make no sense to have the large number of competing projects we actually see in the Open Source community. The Corporation's upper management would decide which projects continue and which get killed, and that would be the end of the story.

      In practice however, the Open Source community is a very heterogenous bunch of people, with no central control or source of founding, no common motivation, and no unified objectives. Some people contribute for altruistic reasons, some because they want interesting technical challenges, and some just get paid for doing it. Each participating company has its own reasons to be there, and companies doing Open Source software are often competing with each other.

      Rather than thinking of Open Source as a single company working for a single objective, you should compare the Open Source community with the open market. There are often companies offering similar products and competing for the clients, and people usually see this as possitive, not negative. Indeed, the actual surprise is that there is so much collaboration around Open Source.

    3. Re:Why all the fragmentation? by vadim_t · · Score: 1

      Why? Many reasons.

      Filesystems are sexy. Developing a successful one will make you famous, being a contributor not nearly so much.

      Filesystems are specific pieces of software that are carefully optimized and have to be designed with tradeoffs in mind. You can't just go and add bits of ZFS into say, ReiserFS. I'm sure it would have been a far from trivial undertaking, Hans would have disagreed a lot back when he was free, it probably wouldn't be doable elegantly, perhaps performance wouldn't be good, the kernel developers wouldn't accept the resulting mess, etc, etc.

      Then not nearly everybody agrees on what is the best filesystem. I for one don't like the approach of sticking everything into ZFS and would prefer to keep my RAID and LVM separate, thank you very much. On a longer term I'd like to see things like Z-RAID integrated into the Linux RAID, and be available to all filesystems.

      If it is possible for Nvidia to use binary blobs for their graphics cards, it should be possible to use CDDL code with a compatibility shim in the Linux kernel. All this duplicated effort could instead be focused on one project, thereby having an all around better file system. btrFS has just recently started coming into fruition, would that time not be better spent improving ZFS?

      Eww. That sort of thing needs to die, badly. Do you really want a kernel that the kernel developers will refuse to troubleshoot because your filesystem is always loaded, thus always tainted? Are you going to run your servers on that?

      There's no rush. I will wait for a GPLd, properly integrated into the kernel, supported by the developers version. ZFS may be cool but cooler things will come after it.

  33. when ext4 is feature complete it will be the #3 fs by ZeekWatson · · Score: 4, Interesting

    I'd like to know why Ted Tso and others are working on ext4? Even when ext4 is feature complete it will be the #3 filesystem in linux in terms of features and scalability behind xfs and jfs. I'd like to know what Ted Tso and others grudge against xfs and jfs is because they basically wont even acknowledge those filesystems.

    btrfs does have some nice looking features, its basically a gpl rewrite of zfs.

    The weakness with linux is in the LVM or EVMS layer. They both suck in that they are not enterprise ready (ie multi TB filesystems, 100+ MB/s sustained read/write) in that they cause unexplained IO hicups, lockups and kernel panics. LVM/EVMS certainly work fine for Joe Blow's HTPC, or a paltry 100GB database but they fall down when under serious load.

    This is the problem with open source. Certain areas, like filesystem development attract all the developers, and other areas like LVM/EVMS are seen as busting rocks and nobody wants to work on them. The results is we get a plethora of second rate filesystems (ie ext4) and a buggy LVM/EVMS layer that nobody wants to work on.

  34. Re:BTRFS? REALLY? by Anonymous Coward · · Score: 0

    I think of it more like BART Filesystem

  35. Re:BTRFS? REALLY? by msormune · · Score: 1

    Yeah maybe, but I was thinking about a dual meaning: "Bitter" could also be thought as "someone who sets bits". Or maybe it means the devs like their beer bitter. Or maybe BTRFS could be translated as "BetterFS"... BTR-60 and BTR-80 are also Russian armoured personal carriers so maybe this means BTRFS is solid enough to be used in heavy Russian military equipment...

  36. Ring 1 and 2? by krischik · · Score: 1

    Have you considered that modern CPU have more then 2 privilege levels?

    1. Re:Ring 1 and 2? by Anonymous Coward · · Score: 3, Interesting

      yes, IIRC Windows NT uses rings 0 and 4. However, the problem would not be made better by having more rings, the performance cost is the transition between rings, nothing special about the rings themselves. eg progressing from ring 10 to ring 9 is as expensive as going from ring 0 to 1, or from ring 0 to ring 100.

    2. Re:Ring 1 and 2? by Anonymous Coward · · Score: 0

      The cost of transitions between processor privilege levels ('rings' in your Intel-based language) is entirely architecture-dependent. It can be as cheap as changing a value in a single register.

    3. Re:Ring 1 and 2? by Eli+Gottlieb · · Score: 1

      Well sure, but no operating system actually uses them.

    4. Re:Ring 1 and 2? by DamnStupidElf · · Score: 3, Interesting

      Not exactly. To effectively change the actual permissions that the permissions rings allow, stacks, segment registers, i/o permission bitmaps, and page tables (among other things) have to be changed. Generally this means reading values from memory into caches, which is slow. Probably the slowest of them all is the page cache. Invalidating the entire page cache is godawful slow, and is necessary if each separate user-space has a truly private address space and not simply a chunk out of the entire virtual address space. Even for operating systems that partition the virtual address space into regions for each user process, the local descriptor (or equivalent) table for segment access needs to be reloaded. This has to happen for every cross-privilege-level call. It is *much* faster to simply call another kernel mode function (push some stuff on the stack, change the instruction register, and you're done) without messing with caches.

      In fact, it would be even faster to not separate the kernel and user space processes at all, and instead use formal verification or a virtual machine (which really just means a smaller instruction set that's easier to verify) to prove that no user process could ever mess with the kernel or other processes. Virtual machines for languages are essentially at this stage today; they implement what would constitute a kernel as the run-time level portions of the virtual machine, running the virtualized software in the same address space. There have been some attacks based on virtual machine weaknesses or memory corruption that break the protection model by changing data structures so that they violate the security model. This can happen in OS's that use hardware protection as well, there are just fewer places in memory that random changes can cause problems (just the page tables and other security paraphernalia), making it less likely.

    5. Re:Ring 1 and 2? by Ant+P. · · Score: 1

      x86-64 doesn't.

    6. Re:Ring 1 and 2? by Anonymous Coward · · Score: 0

      Your comment, like the great-grandparent comment, makes broad general statements while actually being x86 specific. For example:

      Probably the slowest of them all is the page cache. Invalidating the entire page cache is godawful slow, and is necessary if each separate user-space has a truly private address space and not simply a chunk out of the entire virtual address space. Even for operating systems that partition the virtual address space into regions for each user process, the local descriptor (or equivalent) table for segment access needs to be reloaded.

      Flushing an entire cache only to reload it with the same contents moments later is not only slow but it's moronic. However Intel doesn't give you much choice. Architectures implementing caches which tag entries with address space identifiers can avoid global invalidation and allow entries from any combination of processes to age out of cache normally.

      To effectively change the actual permissions that the permissions rings allow, stacks, segment registers, i/o permission bitmaps, and page tables (among other things) have to be changed.

      No. Think of user and group IDs in Unix. A setUID program can jump between levels of privilege quickly because this only involves changing an integer or two. This wouldn't be the case if, for instance, privileged access to files were instead implemented by changing permissions on all the files themselves. It's easier to change your credentials than to change everything else.

    7. Re:Ring 1 and 2? by DamnStupidElf · · Score: 1

      Flushing an entire cache only to reload it with the same contents moments later is not only slow but it's moronic. However Intel doesn't give you much choice. Architectures implementing caches which tag entries with address space identifiers can avoid global invalidation and allow entries from any combination of processes to age out of cache normally.

      That would definitely be useful, but it seems like a much larger cache would be required if there were a different address space for each process and hundreds of processes per system. Basically each page table entry in the cache would need entries for each address space that could access it, or it would need to store an owner id that could be matched against a process id or something similar. Another possibility is a capability based approach where each cache entry has an identifier and each process has a list of capability identifiers that allow it to access cache entries with matching identifiers. I haven't seen any architectures that actually implement a smart cache like you've mentioned, so I'd be interested in knowing which ones you're referring to.

      No. Think of user and group IDs in Unix. A setUID program can jump between levels of privilege quickly because this only involves changing an integer or two. This wouldn't be the case if, for instance, privileged access to files were instead implemented by changing permissions on all the files themselves. It's easier to change your credentials than to change everything else.

      I think the problem is that it's hard to implement permissions with only a uid and gid. There needs to be an arbitrary number of identifiers (capabilities) that a process can hold, or each page needs an arbitrary number of ACLs for user and group identifiers. I suppose the CPU could support loading a fixed set of identifiers or capabilities for a process at any given time, and the process could securely swap them out as necessary, but this could have a performance impact if a large number of different identities needed to be used. Perhaps I'm missing something about the security model you're talking about, though.

  37. Re:BTRFS? REALLY? by Hucko · · Score: 1

    Bloke, it is Canadian...

    --
    Semi-automatic amateur armchair Australian philosopher; conjecture ready at any moment...
  38. Re:when ext4 is feature complete it will be the #3 by Dionysus · · Score: 2, Insightful

    I'd like to know why Ted Tso and others are working on ext4? Even when ext4 is feature complete it will be the #3 filesystem in linux in terms of features and scalability behind xfs and jfs. I'd like to know what Ted Tso and others grudge against xfs and jfs is because they basically wont even acknowledge those filesystems.

    NIH

    --
    Je ne parle pas francais.
  39. In the Kernel? by DrSkwid · · Score: 1

    How primitive. Those Linux guys are savages!

    --
    There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
  40. Re:BTRFS? REALLY? by Hordeking · · Score: 0

    Nice legs, shame about the boat race.

    --
    Disclaimer: The opinions and actions of the US Gov't are in no way representative of those held by this author or its ci
  41. Re:BTRFS? REALLY? by Ragzouken · · Score: 5, Funny

    This is the internet, it's never too soon.

  42. Back when there was only fat16, ntfs, ext2 used by Anonymous Coward · · Score: 1, Insightful

    ext2 beat them all, hands down. Then for GNU Linux came cfs, hpfs, jfs, ext3, xiafs, reiserfs, onto ext4fs, and you know what was still easier to tune and outperformed all others in data retention and speed? You guessed right: ext2. Everything else is slowly becoming convalescent by needing complex database code that only makes stability no better when not a NAS mirror node, and worse is software writers depending on file system constructs through libraries outside ANSI C or worse by abstracting standard IO with said database. Doesn't anyone remember what simple functionality made Linux great, and now the same take it into a dark age of complexity to market their *ise Service route?

    1. Re:Back when there was only fat16, ntfs, ext2 used by vadim_t · · Score: 5, Informative

      I hope you're joking.

      ext2 is nice and simple, but it's neither fast not reliable. It uses a linear search to find directory entries, which means it's very slow on large directories, like Maildir mailboxes. It doesn't do tail packing which means it wastes space and is slower with small files. It's not reliable because without a journal it needs a fsck after a bad shutdown which takes ages on a modern disk, and recovers it worse than a journal would.

      Just search for benchmarks, something like reiserfs beats ext2 by huge margins when it comes to important workloads such as a mail server.

      There are very good reasons why distributions generally go with ext3, or one of the other filesystems. I haven't seen ext2 as the default option for the root FS in a very long time.

    2. Re:Back when there was only fat16, ntfs, ext2 used by jez9999 · · Score: 4, Funny

      Just search for benchmarks, something like reiserfs beats ext2 by huge margins when it comes to important workloads such as a mail server.

      Hell, it probably beats it to death.

    3. Re:Back when there was only fat16, ntfs, ext2 used by IceCreamGuy · · Score: 4, Insightful

      Yeah, I remember they used to talk about this in the Gentoo handbook; use ext2 for /boot, but ext3 for everything that you actually care about.

    4. Re:Back when there was only fat16, ntfs, ext2 used by branchingfactor · · Score: 1

      I'll back this up. I benchmarked reiserfs, ext2, ext3, xfs, and jfs for a real-world disk-intensive application and reiserfs stomped all over the other filesystems. ext3 was second but xfs and jfs were so far behind I would never use them.

    5. Re:Back when there was only fat16, ntfs, ext2 used by Chemisor · · Score: 4, Interesting

      > Just search for benchmarks, something like reiserfs beats ext2 by huge margins

      You mean like these ones where ext2 beats reiserfs in most cases and is at least as fast in the others?

      > I hope you're joking. ext2 is nice and simple, but it's neither fast not reliable.
      > It uses a linear search to find directory entries, which means it's very slow on
      > large directories, like Maildir mailboxes.

      Believe it or not, the world does not revolve around huge mail servers. Some of us actually run Linux on a desktop, and so don't really care about how well an fs handles a million maildir mailboxes. Latency is the most important criteria, and reiserfs is just too complicated to deliver it, as well as being a largely fringe fs. Especially now with Hans gone, it would become even more fringe.

      > It doesn't do tail packing which means it wastes space and is slower with small files.

      Yup, I'd like to have efficient small file handling. But really, it is better to avoid having many small files in the first place. Use compressed archives to store such things; it's quite a bit more efficient, and does not require exotic file systems which most normal people (i.e. your customers) will not use.

      > It's not reliable because without a journal it needs a fsck after a bad shutdown

      I used to do that, and then I got a UPS instead and switched back to pure ext2. The performance hit from journalling is simply too high to tolerate. A decent UPS (pretty much anything made by APC) will prevent the crashes in the first place, solving the problem completely and without any unnecessary overhead. With UPS prices being as low as they are, there is no excuse for not having one, so I think that journalling will become obsolete in some near future.

    6. Re:Back when there was only fat16, ntfs, ext2 used by Anonymous Coward · · Score: 0

      You mean, like a dead horse?

    7. Re:Back when there was only fat16, ntfs, ext2 used by MBGMorden · · Score: 4, Insightful

      so I think that journalling will become obsolete in some near future.

      I bet in 1992 you were still thinking color TV's wouldn't last either . . .

      Look, a UPS is a great thing. I run one myself. Heck with more and more people switching to laptops a lot of people are running a "UPS" without even realizing it. The simple fact though is that modern processors and disks are so fast that the minimal speed impact of journaling is barely noticeable. It's certainly not worth giving up over some marginal speed gains.

      I mean we're talking about a world where people will give up tons of speed in their computer just to make the WINDOWS WOBBLE when you move them, or to make teddy bears wave at them from the system tray. Do you honestly believe that they're going to risk having their files corrupt on an unexpected power outage for a fraction of a percent increase in meaningful speed?

      --
      "People who think they know everything are very annoying to those of us who do."-Mark Twain
    8. Re:Back when there was only fat16, ntfs, ext2 used by vadim_t · · Score: 4, Insightful

      You mean like these ones where ext2 beats reiserfs in most cases and is at least as fast in the others?

      Look at the bottom of the page. That's from 2003. Of kernel 2.6.0. A lot of code changed since then.

      Believe it or not, the world does not revolve around huge mail servers. Some of us actually run Linux on a desktop, and so don't really care about how well an fs handles a million maildir mailboxes. Latency is the most important criteria, and reiserfs is just too complicated to deliver it, as well as being a largely fringe fs. Especially now with Hans gone, it would become even more fringe.

      I'm not sure what exactly you mean by this. Latency is mostly influenced by the hard disk. And on a desktop the disk shouldn't be a bottleneck anyway.

      Yup, I'd like to have efficient small file handling. But really, it is better to avoid having many small files in the first place. Use compressed archives to store such things; it's quite a bit more efficient, and does not require exotic file systems which most normal people (i.e. your customers) will not use.

      Except there's lots and lots of those files in a modern Linux system. Config files, icon files, and small libraries for instance. Additionally many files are searched in different paths, making a fast directory search important.

      I used to do that, and then I got a UPS instead and switched back to pure ext2. The performance hit from journalling is simply too high to tolerate. A decent UPS (pretty much anything made by APC) will prevent the crashes in the first place, solving the problem completely and without any unnecessary overhead. With UPS prices being as low as they are, there is no excuse for not having one, so I think that journalling will become obsolete in some near future.

      Just as a RAID is not a backup, an UPS isn't a disk journal. One of those days you'll get a long outage, or the power cable will turn out to fit badly into the power supply, have a kernel panic, the UPS won't switch to battery fast enough, etc. And then after several minutes of fsck something important might end up broken.

      If the journal causes you a noticeable slowdown you probably aren't a typical user. In typical usage the disk should be mostly idle after boot.

      I don't see a point in going forward insanely fast without brakes. I'll take the safety. I have an UPS on every computer, and still have a journalled FS, because there were times when the UPS was of no help. Like yesterday, when I upgraded my laptop's RAM, booted it, and found that with more than 2GB RAM, the BIOS maps the video RAM above 4GB. The video card showed its displeasure with that state of affairs by corrupting the display and locking up. Had no choice but to powercycle the box.

    9. Re:Back when there was only fat16, ntfs, ext2 used by illumin8 · · Score: 5, Insightful

      I used to do that, and then I got a UPS instead and switched back to pure ext2. The performance hit from journalling is simply too high to tolerate. A decent UPS (pretty much anything made by APC) will prevent the crashes in the first place, solving the problem completely and without any unnecessary overhead. With UPS prices being as low as they are, there is no excuse for not having one, so I think that journalling will become obsolete in some near future.

      Yeah, because systems never kernel panic, or crash for any other reason than power outages... Wake me up after you've been waiting for fsck to finish on your 1TB drive and it's been running for the last 72 hours.

      Whether or not you've had a system shutdown uncleanly in the past, you certainly will at some time in the future, so why not just use ext3 and save yourself the headache of a 3 day long fsck?

      It's also painfully obvious that you've never worked as a sysadmin before. You try explaining to your manager that the reason why your company's server will take 3 days to come back online is that you wanted to save a few microseconds of latency when users were accessing files...

      --
      "When the president does it, that means it's not illegal." - Richard M. Nixon
    10. Re:Back when there was only fat16, ntfs, ext2 used by Anonymous Coward · · Score: 0

      Too soon?

    11. Re:Back when there was only fat16, ntfs, ext2 used by Anonymous Coward · · Score: 1, Insightful

      UPS is nice and so on, but how do you prevent crashes introduced by producers like ATI and Nvidia?

    12. Re:Back when there was only fat16, ntfs, ext2 used by Anonymous Coward · · Score: 0

      Someone needs introduction to murphy. New ups, first power failure killed the ups and shutdown the computer. Power supply in system dies, whoops.

    13. Re:Back when there was only fat16, ntfs, ext2 used by Anonymous Coward · · Score: 1, Insightful

      Interesting benchmark from 2003... Reiserfs moves some of the work to your CPU to increase IO performance. Let's see those same benchmarks on a modern system. Unless you're running Linux on a PIII-500, you're better off with reiserfs or XFS.

    14. Re:Back when there was only fat16, ntfs, ext2 used by mortonda · · Score: 2, Insightful

      A decent UPS (pretty much anything made by APC) will prevent the crashes in the first place, solving the problem completely and without any unnecessary overhead. With UPS prices being as low as they are, there is no excuse for not having one, so I think that journalling will become obsolete in some near future.

      While a UPS is certainly a must, it does not protect you from hardware faults completely. Ever have a cap burn out on your motherboard, or lightning strike through your network?

      Or the most irritating one of all, get a static shock through the keyboard that resets the system?

    15. Re:Back when there was only fat16, ntfs, ext2 used by RAMMS+EIN · · Score: 2, Interesting

      ``Believe it or not, the world does not revolve around huge mail servers. Some of us actually run Linux on a desktop, and so don't really care about how well an fs handles a million maildir mailboxes.''

      What if I have large Maildir mailboxes on my desktop system? Or anything else that puts many files in a single directory? Just because _you_ don't need that case to be fast doesn't mean it isn't a good idea to have it be fast, anyway.

      ``Latency is the most important criteria, and reiserfs is just too complicated to deliver it''

      Excuse me? Do you have any numbers to back up that claim? Because I'm having a hard time taking it on face value.

      ``as well as being a largely fringe fs''

      A filesystem that has been included in the mainline Linux kernel for several years, is offered as a prominent choice during installation of various distros, used to be the default fs on some distros, and is widely used by people who make conscious and informed choices about which filesystem to use. But yes, if you want to call it a "fringe fs", go right ahead.

      ``Especially now with Hans gone, it would become even more fringe.''

      This, unfortunately, is all too true. ReiserFS still is a great filesystem in terms of reliability and performance, from tiny files to huge ones, under a wide range of scenarios. Reiser4 was going to be even better: faster and more flexible and extensible, with fast arbitrary attributes and a lot of other goodness. But it never made it into the mainline kernel, and, with Hans Reiser in jail, the future doesn't seem bright for Reiser4. On the other hand, there are various new contenders: ZFS, btrfs, and ext4, just to name a few. None of them seem to be quite there yet, but hey, neither was Reiser4.

      ``Yup, I'd like to have efficient small file handling. But really, it is better to avoid having many small files in the first place. Use compressed archives to store such things; it's quite a bit more efficient''

      Kindly point me at this compressed archive format that lets me fetch files (small and large) by name and other attributes more efficiently than Reiser4 or even ReiserFS. Then please point out how I can use this as I would a filesystem: so that the good old Unix software can access the files. And remember: I need random access to the file contents, and I need to be able to add, remove, write, etc. files. And if any operation is interrupted suddenly and unexpectedly, the integrity of my tree needs to be preserved. Bonus points for full data integrity preservation.

      ``The performance hit from journalling is simply too high to tolerate.''

      Performance hit from journalling? And you're using ext2 to avoid it? Your usage patterns must be very different from mine. True, ext2 running in async mode (i.e. no consistency guarantee at all) is slower than ext3 with journalling which guarantees consistency. On the other hand, with ReiserFS, I can have journalling, guaranteed consistency of at least the filesystem structure, and better performance. Plus, for some strange reason, ext3 seems to lose a lot of files on my systems (although they can be recovered by running fsck) during normal operation. Among the 3, ReiserFS is the clear winner for me. I am not disputing that you may be seeing other data, but let's at least conclude that ext2 is _not_ faster than all journalled filesystems for everyone, and that the performance hit of journalling, if any, is not "too high to tolerate" for everyone.

      ``With UPS prices being as low as they are, there is no excuse for not having one, so I think that journalling will become obsolete in some near future.''

      I think smart people realize that having a UPS is no guarantee that your system will never fail in the middle of a write. So a method to bring the system back to a consistent state is needed in any case. Let's also realize that journalling isn't only for recovery. It is one way to implement transactions, and transactions are useful for more than recovery alone; for example, they can be used to ensure consistency of da

      --
      Please correct me if I got my facts wrong.
    16. Re:Back when there was only fat16, ntfs, ext2 used by Frank+T.+Lofaro+Jr. · · Score: 1

      Why use ext2 for boot?

      ext3 works fine, you can even set data=journal if you want (as I do), by passing the in the rootflags.

      --
      Just because it CAN be done, doesn't mean it should!
    17. Re:Back when there was only fat16, ntfs, ext2 used by Frank+T.+Lofaro+Jr. · · Score: 1

      Great, a file system called "btrfs" supports "tail packing".

      Let's not go there.

      --
      Just because it CAN be done, doesn't mean it should!
    18. Re:Back when there was only fat16, ntfs, ext2 used by Frank+T.+Lofaro+Jr. · · Score: 1

      I have a UPS, and it doesn't help when the kernel crashes.

      For that, you still need ext3.

      With ReiserFS, your system could end up dead.

      --
      Just because it CAN be done, doesn't mean it should!
    19. Re:Back when there was only fat16, ntfs, ext2 used by Frank+T.+Lofaro+Jr. · · Score: 1

      Plus, for some strange reason, ext3 seems to lose a lot of files on my systems (although they can be recovered by running fsck) during normal operation.

      With ReiserFS, Hans himself will tell you where missing items have gone.

      --
      Just because it CAN be done, doesn't mean it should!
    20. Re:Back when there was only fat16, ntfs, ext2 used by IceCreamGuy · · Score: 1

      I think (at least for whatever kernel they were using in '03, when I first tried Gentoo) that ext2 was significantly faster than ext3. They were saying that you could cut some decent boot time by going with ext2, and because of the small size, fsck time was trivial. Even if you ended up with a corrupted /boot because you didn't have a journal, it's really the only partition you can remake easily from scratch.

    21. Re:Back when there was only fat16, ntfs, ext2 used by Anonymous Coward · · Score: 0

      [..blahblahblah..] Gentoo [..blahblahblah..]

      [Insert generic I'm-oh-so-clever snide remark about compile times here]

    22. Re:Back when there was only fat16, ntfs, ext2 used by Chemisor · · Score: 1

      > It's also painfully obvious that you've never worked as a sysadmin before.

      Of course not, and that's my point: the world does not revolve around servers. Some of us use Linux on the desktop, and if the kernel developers don't get their head out of their asses and realize that, the year of "Linux on the desktop" will never come.

      > Wake me up after you've been waiting for fsck to finish on your 1TB
      > drive and it's been running for the last 72 hours.

      If you are a sysadmin and it takes you 72 hours to fsck a 1TB drive, you should be fired.

      > Yeah, because systems never kernel panic, or crash for any other reason than power outages.

      You must be running Windows. Serves you right.
      If you were running Linux, you wouldn't be having these problems. I haven't seen a kernel panic since 1996, and you wouldn't either if you stopped running the daily build from Linus's git tree on crappy hardware from the thrift shop.

    23. Re:Back when there was only fat16, ntfs, ext2 used by lewiscr · · Score: 1

      Do you honestly believe that they're going to risk having their files corrupt on an unexpected power outage for a fraction of a percent increase in meaningful speed?

      Does the file system wave at them from the system tray? Maybe with a cool woobly effect?

      No? Then nobody cares about the FS.

      --
      Still running FVWM. Get your fancy Desktop Environment off my lawn.

    24. Re:Back when there was only fat16, ntfs, ext2 used by Chemisor · · Score: 1

      > The simple fact though is that modern processors and disks are so fast
      > that the minimal speed impact of journaling is barely noticeable. It's
      > certainly not worth giving up over some marginal speed gains.

      If you look at the benchmarks I linked to, you'd see that ext2 is twice as fast as a journalled ext3 for sequential access. I wouldn't call that "barely noticeable".

    25. Re:Back when there was only fat16, ntfs, ext2 used by maestroX · · Score: 1

      I haven't seen ext2 as the default option for the root FS in a very long time.

      USB sticks, or other flash with wear-levelling controller.

    26. Re:Back when there was only fat16, ntfs, ext2 used by compro01 · · Score: 1

      Benchmarks from 5 years ago, when ext3 had only been around (in the kernel) for barely 2 years, compared to ext2, which had been around for over 10. A LOT has been changed in those 5 years. Do you happen to have anything using a recent kernel?

      --
      upon the advice of my lawyer, i have no sig at this time
    27. Re:Back when there was only fat16, ntfs, ext2 used by Chemisor · · Score: 1

      >> Latency is the most important criteria, and reiserfs is just too complicated to deliver it''
      > Excuse me? Do you have any numbers to back up that claim?
      > Because I'm having a hard time taking it on face value.

      You are the one trying to get me to switch file systems, so you are the one who has to provide the numbers. I look at those benchmarks, and ext2 looks like the winner. If you have others, I'll be happy to take a look at them.

      > A filesystem that has been included in the mainline Linux kernel for several years,
      > is offered as a prominent choice during installation of various distros, used to be
      > the default fs on some distros, and is widely used by people who make conscious and
      > informed choices about which filesystem to use.

      Most people just take the default filesystem for their distro, and I don't know of any that have reiserfs as the default. That would meet my definition of "fringe". From this perspective, it would be the height of foolishness for me to write any code depending on tail packing for performance. Speaking as a user, I would probably just choose applications that don't create lots of files. The filesystem is not a replacement for compound files, having been designed for a different purpose.

      > Kindly point me at this compressed archive format that lets me fetch files
      > (small and large) by name and other attributes more efficiently than Reiser4
      > or even ReiserFS.

      I don't know of any. Care to write one? 'Cause I'd sure like to have it. I think you can do this with Kioslaves, but I'm no expert there. My point is that it should exist, because it is the right solution. When you have many small files, that's just bad design. They are hard to keep track of, and there is no reason for them to be individually user-visible in the first place.

      > Then please point out how I can use this as I would a filesystem:
      > so that the good old Unix software can access the files.

      If you want that, you can use FUSE on a loop, but then, of course, Linux would make you mount it first. I was talking about doing it inside the application, transparently to the user.

      > ext2 is _not_ faster than all journalled filesystems for everyone, and that
      > the performance hit of journalling, if any, is not "too high to tolerate" for everyone.

      According to the benchmarks I linked to, ext2 is twice as fast as journalled ext3 for sequential access. I would call that "too high to tolerate". From personal experience, my system have felt noticeably more responsive since I had switched to ext2, so I'm inclined to keep it.

      > I think smart people realize that having a UPS is no guarantee that your system will
      > never fail in the middle of a write.

      Neither is a journal. But consider that the penalty is just an fsck, which on my disks takes about five minutes. I haven't had a power failure crash since I got the UPS a few years ago and I haven't seen a kernel panic since 1996. All in all, I think that performance gains outweigh the negligible "risk" of an fsck. And even if there is data loss, it would likely be very minor, and I keep backups anyhow.

      Now, don't get me wrong, I would probably have been more concerned with fs reliability if I were running a data center. But on my own machine the chances of a disaster are so slight and the potential data loss is so small, that reliability in the face of a system crash is simply not an issue. As I keep saying, I'm not running a server for a Fortune 500 company; it's a whole different world out here, with an entirely different set of concerns, and it would be nice if kernel developers took notice of us once in a while.

      > ReiserFS still is a great filesystem in terms of reliability and performance, from
      > tiny files to huge ones, under a wide range of scenarios. Reiser4 was going to be
      > even better: faster and more flexible and extensible, with fast arbitrary attributes
      > and a lot of other goodness.

      Oh, sure, I w

    28. Re:Back when there was only fat16, ntfs, ext2 used by FunkyRider · · Score: 0

      You must be talking from your ass. Just because you don't understand it and can't use it doesn't mean no one does. A UPS to solve file system corruption problem instead of Journaling or other FS level technique? Close your eyes and tell me this world doesn't exist.

      --
      just wonder why there are so many anonymous cowards in this world....
    29. Re:Back when there was only fat16, ntfs, ext2 used by MentlFlos · · Score: 1

      You mean, like a dead horse?

      Only if you are into bestiality

    30. Re:Back when there was only fat16, ntfs, ext2 used by Anonymous Coward · · Score: 0

      Hell, it probably beats it to death.

      Tasteless.

    31. Re:Back when there was only fat16, ntfs, ext2 used by batkiwi · · Score: 1

      > If you are a sysadmin and it takes you 72 hours to fsck a 1TB drive, you should be fired.

      Tell us how you magically speed up fsck? Or do you mean that you just skip it an manually set the disk as clean, for which you should be fired...

    32. Re:Back when there was only fat16, ntfs, ext2 used by illumin8 · · Score: 1

      Of course not, and that's my point: the world does not revolve around servers. Some of us use Linux on the desktop, and if the kernel developers don't get their head out of their asses and realize that, the year of "Linux on the desktop" will never come.

      Well, regardless of whether the world revolves around servers or not, if the kernel developers can make Linux stable and fast on large servers with 64GB-512GB of memory and 16-32 cores, chances are it'll run pretty well on your desktop as well.

      If you are a sysadmin and it takes you 72 hours to fsck a 1TB drive, you should be fired.

      Try doing an fsck on a large ext2 filesystem with lots of small files and directories. It could take between hours and days, depending on the number of inodes. I've seen some seriously wtf moments when people realized how long it would take their mailserver to come back online, and management was not pleased that they chose to use a non-journaling filesystem. Mind you, this is before ext3 was default and journaling filesystems on Linux were pretty experimental. Of course, at the time, a better choice for mailserver would have been a Solaris box running UFS with journaling, but they thought the poor little Dell running Red Hat with ext2 would be fine.

      You must be running Windows. Serves you right. If you were running Linux, you wouldn't be having these problems. I haven't seen a kernel panic since 1996, and you wouldn't either if you stopped running the daily build from Linus's git tree on crappy hardware from the thrift shop.

      Bullshit. I run Linux on the best HP hardware money can buy, brand new Proliants, and you still get a kernal oops/panic if you have a bad memory DIMM or a failed CPU or something. No operating system is immune to hardware failures.

      --
      "When the president does it, that means it's not illegal." - Richard M. Nixon
    33. Re:Back when there was only fat16, ntfs, ext2 used by drinkypoo · · Score: 1

      It's not reliable because without a journal it needs a fsck after a bad shutdown

      I used to do that, and then I got a UPS instead and switched back to pure ext2.

      Personally, I went with XFS, but I guess it takes all kinds.

      With UPS prices being as low as they are, there is no excuse for not having one, so I think that journalling will become obsolete in some near future.

      Those who forget the lessons of history are condemned to repeat them. Which is to say, people have said ridiculous shit like that before. Computers have gotten more and more complicated all along, "wasting" more and more power... And doing more and more behind the scenes.

      I have a laptop, but I know that my built-in UPS will not protect me from data loss, because Linux is not immune to software problems. My machine will panic every now and then... Running the LTS Ubuntu's release-version kernel, and the release-version nVidia driver (or the beta one.) A UPS does not provide any type of protection in a situation like this one.

      Incidentally, my APC UPS recently died. I have already replaced the battery, but this time the UPS is dead. Let me back up a little, too:

      A decent UPS (pretty much anything made by APC) will prevent the crashes in the first place, solving the problem completely and without any unnecessary overhead. With UPS prices being as low as they are, there is no excuse for not having one

      So uh, my "decent" UPS died the death of a thousand dogs amen, does not prevent software crashes anyway, and right now I can't justify spending the money to replace it. I have more money than a whole lot of people, especially on slashdot. I don't know if you've noticed, but the American economy is seriously fucked. People are skipping their chemo treatments because they can't afford the gas to get the hospital, burning their heirloom furniture to keep their kids warm, shit like that. But even people who can keep the power on and thus use their computers are trying to save their money, and furthermore, might like it if an operating system crash didn't mangle their data.

      You are ignorant, or a troll, or an ignorant troll.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    34. Re:Back when there was only fat16, ntfs, ext2 used by Anonymous Coward · · Score: 0

      If you are a sysadmin and it takes you 72 hours to fsck a 1TB drive, you should be fired.

      Which is why you should journal. Otherwise, the amount of time it takes isn't up to you.

    35. Re:Back when there was only fat16, ntfs, ext2 used by RAMMS+EIN · · Score: 1

      > >> Latency is the most important criteria, and reiserfs is just too complicated to deliver it''
      > > Excuse me? Do you have any numbers to back up that claim?
      > > Because I'm having a hard time taking it on face value.

      > You are the one trying to get me to switch file systems, so you are the one who has to provide the numbers. I look at
      > those benchmarks, and ext2 looks like the winner. If you have others, I'll be happy to take a look at them.

      You misunderstand. I am not trying to get you to switch filesystems, I am asking you to back up your claims. In this
      case, the claim that "reiserfs is just too complicated to deliver [low latency]".

      > > A filesystem that has been included in the mainline Linux kernel for several years,
      > > is offered as a prominent choice during installation of various distros, used to be
      > > the default fs on some distros, and is widely used by people who make conscious and
      > > informed choices about which filesystem to use.
      >
      > Most people just take the default filesystem for their distro, and I don't know of any that have reiserfs as the default.

      According to Wikipedia:

      ``ReiserFS is the default file system on the Elive, Xandros, Linspire, GoboLinux, Kurumin Linux[citation needed], and Yoper Linux distributions. ReiserFS was the default file system in Novell's SUSE Linux Enterprise until Novell decided to move to ext3 on October 12, 2006 for future releases.''

      > That would meet my definition of "fringe".

      I won't argue against that.

      > > Kindly point me at this compressed archive format that lets me fetch files
      > > (small and large) by name and other attributes more efficiently than Reiser4
      > > or even ReiserFS.
      >
      > I don't know of any.

      And yet, you said:

      ``Yup, I'd like to have efficient small file handling. But really, it is better to avoid having many small files in the
      first place. Use compressed archives to store such things; it's quite a bit more efficient''

      If you claim that compressed archives are more efficient for storing many small files, you had better actually know
      at least one compressed archive format that actually is more efficient. Otherwise, you would just be making an
      unfounded claim. Again, I ask you to back up your claims.

      > Care to write one? 'Cause I'd sure like to have it. I think you can do this with Kioslaves, but I'm
      > no expert there. My point is that it should exist, because it is the right solution.

      I disagree. There is no reason that having many small files should be as slow as some file systems make it. Finding an
      alternative way to store the data may solve the performance problem, but it increases complexity and runs the risk of
      making standard utilities useless for your chosen storage format. Given that it _is_ possible to store and retrieve the
      small pieces of data efficiently, why don't we just make the filesystem do it?

      > When you have many small files,
      > that's just bad design. They are hard to keep track of, and there is no reason for them to be individually user-visible in
      > the first place.

      So you say. I think that claim may be true in some scenarios, but certainly not in all scenarios. I _like_ the units of
      data that I think about to coincide with files; this allows me to use tools that manipulate files to manipulate a lot
      of things on my computer: documents, programs, configuration files, messages, etc. I don't like the idea of having to
      use specialized tools for each of these. If anything, I would try to _extend_ the number of things I can apply my
      standard tools to, rather than shrink it by hiding many objects in a single unit.

      > > Then please point out how I can use this as I would a filesystem:
      > > so that the good old Unix software can access the files.
      >
      > If you want that, you can use FUSE on a loop, but then, of course, Linux would make you mount i

      --
      Please correct me if I got my facts wrong.
    36. Re:Back when there was only fat16, ntfs, ext2 used by MBGMorden · · Score: 1

      No? Then nobody cares about the FS.

      Specifically? No. But they care whether or not their computer "works". As such they care about the filesystem, albeit indirectly. Between "computer works, but slightly slower" and "computer is slightly faster but might not boot back up if it crashes at the wrong time", users will prefer the first. It might not be a conscious decision, but they will still flock to the platform that has a reputation for not breaking down on a crash, and that platform will be one that has a journaling FS. They're not exactly a fad here. Every serious OS in the last 10 years has been shipping with a journaling FS. There's a reason that's so.

      --
      "People who think they know everything are very annoying to those of us who do."-Mark Twain
    37. Re:Back when there was only fat16, ntfs, ext2 used by lewiscr · · Score: 1

      Between "computer works, but slightly slower" and "computer is slightly faster but might not boot back up if it crashes at the wrong time", users will prefer the first.

      I think we're talking about different users here. Most of the users I'm thinking of would pick the slightly faster machine. That's something they can see and feel immediately. How many people re-install windows after it gets so hosed it won't boot anymore? That's the kind of user I'm thinking of.

      If everybody made informed rational decisions, they would pick good filesystems, mortgages and presidential candidates. Until this shining utopia arrives, I'm glad we protect the apathetic from themselves by default. I'm wholly in support of Journaled FS, and wouldn't run non-journaled on any rw mount point (well, maybe a USB drive). There are some good for-pay fully journalled Filesystems that are faster than ext2. I've used Veritas FS, and really wish I could use it for / and /boot too. From what I understand, JFS and VxFS have a lot in common.

    38. Re:Back when there was only fat16, ntfs, ext2 used by Chemisor · · Score: 1

      > Just because journalled ext3 is slower than unjournalled ext2 doesn't mean that
      > journalling in general imposes an intolerable performance hit.

      Journalling, by its very nature, implies turning every write into two writes. Furthermore, you have to add various hacks to avoid problems from out-of-order writes, to determine when the cache actually gets to the physical disk, and I am sure many other little things. I am not an fs designer, having learned everything I know about journaling from wikipedia, so I am not going to speculate whether there are ways to implement these things efficiently. The only thing I am certain of is that an unjournaled filesystem will always be simpler and faster than a journaled one, if only by a little bit.

      > But one advantage of having a journal is that you can maintain consistency
      > even if you suffer a failure in the middle of a write.

      This is relevant and interesting if you are running a bank. It is completely uninteresting if you spend your day writing text files and compiling them, like I do. The most I can possibly lose in a crash is five minutes of work, and the likelihood of the crash is so low, as to be beyond consideration. Since the magnitude of the disaster journaling averts is miniscule in my case, performance considerations far outweigh it, and I would always try to turn journaling off and trade it for performance and free memory that would save me time in the edit-compile-test cycle. That is my whole justification, and I see it as a damn good one.

      > If you claim that compressed archives are more efficient for storing many small files, you had
      > better actually know at least one compressed archive format that actually is more efficient.
      > Otherwise, you would just be making an unfounded claim. Again, I ask you to back up your claims.

      What sort of foundation would you like to hear? After all, if a particular compressed archive API does not exist, I can not give you benchmarks for it, can I? Nevertheless, I would consider it obvious for the simple reason that compressing a hundred files into an archive will always take less space than a hundred uncompressed files, no matter how efficiently they may be aggregated. While you can compress the entire filesystem, that is usually not a good idea, because there is no generic way to determine which files should be compressed and which ones should not. The only way I can think of doing it is to collect access patterns on each file and compress the ones that are read more often than written. Unfortunately, the access log has to be stored somewhere, and will naturally wipe out any savings you may get on your 4 byte files :)

      > I _like_ the units of data that I think about to coincide with files; this allows me to use tools
      > that manipulate files to manipulate a lot of things on my computer: documents, programs, configuration
      > files, messages, etc. I don't like the idea of having to use specialized tools for each of these. If
      > anything, I would try to _extend_ the number of things I can apply my standard tools to, rather than
      > shrink it by hiding many objects in a single unit.

      > Yes, exactly. You are proposing storing things in such a way that only that one application can access them. I don't
      > like that idea at all, for the reasons I have pointed out above.

      First of all, I do not think that allowing the user to muck with application internal files with "standard tools" is a good idea. Just as I encapsulate private implementation of a class, making it inaccessible to the code that uses it, likewise I would seek to encapsulate the private details of an application and "hide" them from the user. That does not mean that I would go out of my way to prevent the user from manipulating it, only to hide unnecessary details and clutter from his filesystem. Packaging all the application's files into a suitable collection of archives would be very appropriate. Not only is this neat and space-efficient on every fi

    39. Re:Back when there was only fat16, ntfs, ext2 used by RAMMS+EIN · · Score: 1

      > Journalling, by its very nature, implies turning every write into two writes.

      First of all, that isn't necessarily true; I think Log-structured Filesystem) is one example of a filesystem that uses a journal, yet does not write everything twice. It also depends on how much consistency you want to maintain: many journalled filesystems have the option to journal structure, but not data. Secondly, extra writes don't necessarily cause a performance hit. It all depends on the usage patterns. If the writes can be performed when the disk would otherwise be idle, no performance is lost.

      > > But one advantage of having a journal is that you can maintain consistency
      > > even if you suffer a failure in the middle of a write.
      > This is relevant and interesting if you are running a bank. It is completely uninteresting if you spend your day > writing text files and compiling them, like I do.

      Right. Different jobs, different tools.

      > > If you claim that compressed archives are more efficient for storing many small files, you had
      > > better actually know at least one compressed archive format that actually is more efficient.
      > > Otherwise, you would just be making an unfounded claim. Again, I ask you to back up your claims.
      > What sort of foundation would you like to hear?

      Anything that justifies your claim of compressed archives storing many small files more efficiently than any filesystem.

      > I would consider it obvious for the simple reason that compressing a hundred files into an archive
      > will always take less space than a hundred uncompressed files, no matter how efficiently they may
      > be aggregated.

      So you meant "space-efficient". I thought we were talking about time-efficiency. Ok, fair enough. But what if the filesystem compressed the files, as well?

      > While you can compress the entire filesystem, that is usually not a good idea, because there is no
      > generic way to determine which files should be compressed and which ones should not.

      True, but the same goes for archive files, of course. Any metric that the archiver uses to determine which files should be compressed, the filesystem can use, too.

      I think I see what you're getting at, though. The filesystem exposes a certain programming interface. Traditionally, this interface does not give applications a lot of control. For example, the application might know which files should be compressed, and which ones shouldn't, but the VFS does not typically allow this information to be communicated to the filesystem. A purpose-built storage system specific to the particular application could, of course, incorporate anything and everything that might be useful.

      > First of all, I do not think that allowing the user to muck with application internal files with
      > "standard tools" is a good idea.

      Well, see, generalizations. In some cases, there is no significant gain to being able to use standard tools to manipulate some data. In that case, feel free to lump all that data together in a single file that only your application knows how to deal with. But that wasn't the case I was talking about. I was talking about the case where you have pieces of data that are meaningful to the user, like individual emails. I thought that was what you were talking about, as well. I'm certainly not proposing every byte of the email should be in its own file. But I can see where having a one-to-one correspondence between emails and files, and perhaps even email parts to files, or even email headers to files, would be useful. You seem to be arguing I shouldn't want to do this, because filesystems are bad at dealing with such small parts - I say, filesystems can be made to be good at this, let's do it! And _then_ we can decide if having a file per email header is a good idea or not.

      > you aren't planning to tell all your users that they must use ReiserFS to use your ap

      --
      Please correct me if I got my facts wrong.
    40. Re:Back when there was only fat16, ntfs, ext2 used by Chemisor · · Score: 1

      > Secondly, extra writes don't necessarily cause a performance hit. It all depends
      > on the usage patterns. If the writes can be performed when the disk would
      > otherwise be idle, no performance is lost.

      That's what the disk cache already does for you. You are still making two writes instead of one, so no matter how long you decide to postpone it, it will catch up with you eventually. Not only that, but you are increasing the interval between the time you finish writing and the time when the data gets to the disk by a factor of two, thus increasing your chances of catching a hardware failure in it. I bet it also makes making meaningful benchmarks incredibly hard. Oh well. I'll just drop this subject now due to lack of information.

      > So you meant "space-efficient". I thought we were talking about time-efficiency.

      On the disk, space efficiency is time efficiency. The bottleneck of the io system is on the physical disk surface, and the less of it you use, the faster you can get the data to you. Any form of compression should give significant benefits, especially with today's fast CPUs.

      > True, but the same goes for archive files, of course. Any metric that the archiver
      > uses to determine which files should be compressed, the filesystem can use, too.

      I was not talking about the archiver; I was talking about you. The difference is that with an archive, the programmer or the user decide which files go together, which files should be compressed, and which files are not very important. This is not a decision that can be made automatically because it depends on the contents of the files and the meaning of that contents to whoever needs it. I don't see the computer being able to make this decision in the general case, barring significant advances in AI.

      > For example, the application might know which files should be compressed, and which
      > ones shouldn't, but the VFS does not typically allow this information to be communicated
      > to the filesystem. A purpose-built storage system specific to the particular application
      > could, of course, incorporate anything and everything that might be useful.

      Well, yes, you could go this far, and build a specialized backend for every application. Heck, that's what everyone has been doing for decades, with proprietary file formats. What I'm suggesting is a standardized general compound file API, much like the OLE compound documents on Windows, only not quite as ugly and bloated. Applications already write their own data formats, and that will continue, but we are missing an easy way to package multiple related files together.

      > You are saying that adding at least one additional API and updating all the
      > standard utilities is not a significant complexity increase. I beg to differ.
      > This would be changing the foundations on which Unix is built.

      "The foundations of Unix" are not sacred. They are just ossified into the OS. That does not mean they are perfect, only that we are used to them. A VFS extension, IMO, is not all that hard in principle, and it will not in any way change your "foundations". All it will do is permit you treat compound files like directories. The basic capabilities otherwise stay the same.

      > But that wasn't the case I was talking about. I was talking about the case where
      > you have pieces of data that are meaningful to the user, like individual emails.
      > I thought that was what you were talking about, as well.

      No, what I had in mind specifically was the application internal data files. Look in the various subdirectories under /usr/share and you'll find plenty of tiny little files that ought to be aggregated. Another example is config files. I've seen proposals to create a registry-like storage where each value is a file! Sure, throw it in and let the filesystem figure it out. Right...

      Saving a document into a multiple files is another bad idea. Database packages, like, say ingres, do that. A documen

  43. Re:BTRFS? REALLY? by tomatensaft · · Score: 1

    Couldn't they come up with a better name than "BuTteR FaSe?" I know I can't be the only one who read it like that. Call it anything but that.

    Missed the point! It's BTR FS, the bullet-proof filesystem for Linux!

  44. Reiser has time and no need to work by r00t · · Score: 3, Funny

    They feed him. They put a roof over his head.
    They even bathe him.

    He might as well devote himself to filesystems.

    1. Re:Reiser has time and no need to work by deniable · · Score: 1

      A new variant of the BSD advertising clause:

      This product includes software developed by California Prison Inmates.

    2. Re:Reiser has time and no need to work by standbypowerguy · · Score: 2, Interesting

      Jail is supposed to be punitive & reflective, not fun or interesting. There are plenty of worthwhile jobs in prison... laundry, cook, librarian, janitor, license plate stamper, etc.

      --
      This isn't the sig you're looking for... Move along.
    3. Re:Reiser has time and no need to work by r00t · · Score: 1

      At a typical jail in the USA, inmates spend their
      days playing video games and lifting weights.
      This is so they can be anti-social and strong.

      (actually true)

    4. Re:Reiser has time and no need to work by CarpetShark · · Score: 1

      Agreed on reflective, and not fun. But punitive and uninteresting? I'd say not. To me, those ideas seem to come out of faults in our justice system -- the corruption of the system to satisfy people who want revenge, rather than to ensure a peaceful, healthy, harmonious society.

      It seems to me that, if someone can be a useful, contributing member of society by working on code, then prison should encourage that, not discourage it. The only caveat I'd add is that coding is a corner case, given its relatively unsociable workflow. But that's more of a general problem for the IT industry, than a problem for the justice system.

  45. What about Tux3 by obi · · Score: 2, Interesting

    While btrfs looks quite cool, I'm even more interested to see whether http://tux3.org/ will go anywhere. Let's hope both will materialise and mature soon.

    1. Re:What about Tux3 by kwalker · · Score: 1

      I checked that out last night, and I'm sort of confused about it. Apparently Tux2 was scuttled by patents, and maybe that's got the information I need, but when I tried to see what features Tux3 will have, I wasn't able to find anything that made much sense.

      Apparently the main benefit of tux3 is versioning. Great, wonderful. That'll work well on the big company MSA but what about anything else?

      --
      ... And so it comes to this.
    2. Re:What about Tux3 by obi · · Score: 1

      Maybe this article will shed some light: http://kerneltrap.org/Linux/Comparing_HAMMER_And_Tux3

  46. layering hurts non-ZFS by r00t · · Score: 0, Flamebait

    It hurts performance. You're doing RAID/LVM stuff
    on disk blocks that could be empty. Wasting effort
    on unused bits is stupid.

    It endangers data. RAID/LVM stuff isn't normally
    transactional. (there is no journal or log)
    You can handle booting with a dead disk, and you
    might be able to handle a disk dying at some
    arbitrary point in time, but you can't handle
    disks that mangle your data in interesting ways.

  47. Re:when ext4 is feature complete it will be the #3 by waba.be · · Score: 1

    Do you have any reference regarding the LVM/EVMS issues? I'd be interested to find out more about that.

  48. Files-11 on RSX by AliasMarlowe · · Score: 1

    If you want a versioning file system, go use VMS for a while. They've been doing it since at least the early '90s. Not sure if ADVFS has that feature or not, never got a chance to use Tru64.

    Actually, the versioning file system FILES-11 originated in 1980 or perhaps earlier. At least, I recall that it was standard on RSX systems in 1980, and supported file versioning (albeit with wierd "directory" organization: UIDs like [310,54] were used instead of directories).

    File names could have a semicolon and version number attached, or if no version was specified then the highest version was implied. This led to the need to clean up old versions regularly, with a command such as:
    PIP LB2:[310,54]*.TSK/PU:3
    which would purge all but the three highest versions of each task file in [310,54] on the library 2 device. More explicitly, single file versions could be deleted with a command such as:
    PIP LB2:[310,54]F11ACP.TSK;4033/DE
    which would delete only version 4033 of the named file.

    The PIP commands acquired saner aliases in VMS, and the organization of the file system was somewhat improved. But the versioning was there already in 1980 in FILES-11 under RSX-11.

    --
    Those who can make you believe absurdities can make you commit atrocities. - Voltaire
    1. Re:Files-11 on RSX by jonadab · · Score: 1

      > The PIP commands acquired saner aliases in VMS, and the organization of the file system was somewhat improved.

      And the pathname syntax became even more complicated.

      --
      Cut that out, or I will ship you to Norilsk in a box.
  49. Re:when ext4 is feature complete it will be the #3 by Jah-Wren+Ryel · · Score: 5, Interesting

    The weakness with linux is in the LVM or EVMS layer. They both suck in that they are not enterprise ready (ie multi TB filesystems, 100+ MB/s sustained read/write) in that they cause unexplained IO hicups, lockups and kernel panics. LVM/EVMS certainly work fine for Joe Blow's HTPC, or a paltry 100GB database but they fall down when under serious load.

    LVM has been rock-solid for me with a ~7TB and 2 2TB ext3 filesystems (24 500GB disks) over the course of a year and a half. No problems migrating extents all over the place when I needed to swap disks in and out. Almost identical to HPUX in functionality, but without the sizing constraints.

    But, when I tried xfs for kicks I found out that a 7TB filesystem means you need 7GB of RAM to fsck it - impossible on a 32-bit system, I also had a week where I it all went in the shitter because I ran free-space to zero and started getting OS panics and data corruption.

    I'm definitely considering jfs for the next generation, my main complaint with ext3 has been ridiculously slow deletes and fsck's. Problems I have read don't exist with jfs.

    --
    When information is power, privacy is freedom.
  50. My filesystem wish... by shic · · Score: 1

    While I do see the point in file-system research and advancement, to me the usefulness of a 'better' hierarchical file system is limited by the traditional API used to access the data stored thereon.

    I'd like to efficiently store data direct to a block device in a different structure. I want to have support for (multi-user) transactions and to replace the concept of a 'file' with a mapping from one (arbitrarily long) sequence of (arbitrary) bytes to another... and to be able to read back pairs of byte-sequences by specifying domain ranges.

    From a low-level implementation perspective, I can't imagine that this is wildly different to a hierarchical file system with efficient directories (relaxed to allow the zero byte as part of a filename) but I can't find any work that has been done on such a facility.

    If such an alternative "filesystem" were to be implemented and have a reliability on a par with existing linux filesystems... it could revolutionise data processing.

    If anyone is aware of such a project, I'd be very interested.

    1. Re:My filesystem wish... by MikeBabcock · · Score: 1

      Look up the data storage format used on the old Newton OS by Apple. It was a stream object-oriented data structure, quite neat personally.

      --
      - Michael T. Babcock (Yes, I blog)
    2. Re:My filesystem wish... by shic · · Score: 1

      While I've never used the Newton "Soup" data storage, that's certainly a similar sort of thing, from a conceptual perspective - at least. The closest thing I found to it was with the Palm Pilot 'database' format.

      The distinction with what interests me is that I want this sort of a facility on server-class hardware, backed by terabytes of disk - with multiple multi-core processors accessing it (for both read and write) in real-time. I don't want to use a DBMS; I don't want to use an ISAM layer on top of my filesystem... I want to have low-level support for persistent associative arrays from binary strings to binary strings... with minimal restrictions.

      It seems an obvious facility to want... but (and I've tried) it is harder to implement than it, at first, sounds. A lot of the challenges are identical to filesystems, however.

  51. Re:gnaa by ciderVisor · · Score: 0

    Maya Rudolph uses Linux.

    --
    Squirrel!
  52. Re:BTRFS? REALLY? by Curtman · · Score: 1

    It's a good analogy for FOSS after all. Great software, robust and all, but her face...

    Was a good analogy... Maybe.

    Compiz shot us way out front in the looks department.

  53. Re:when ext4 is feature complete it will be the #3 by QuoteMstr · · Score: 1

    xfs is useful, but xfs should be used with a UPS. Sure, the filesystem metadata is safe on power failure, but files with outstanding uncommitted blocks at the time may be zeroed out when the system comes back up.

    The exact failure mode involves a file's metadata being updated before its data blocks are committed. You hit this problem frequently when doing the write-new-file-then-atomic-rename dance if you don't fsync the new file before the rename; most applications neglect that technically-required fsync because things just work on non-xfs filesystems.

    In other respects, xfs is excellent. I just wish it were more robust against dataloss.

  54. Re:GNAA! by Anonymous Coward · · Score: 0

    there needs to be a "-1: Fail" moderation

  55. Re:when ext4 is feature complete it will be the #3 by discord5 · · Score: 1

    7TB filesystem means you need 7GB of RAM to fsck it

    7GB of swap will do, it doesn't need to be RAM. And yes, 32-bit is a no go.

  56. Re:when ext4 is feature complete it will be the #3 by YoungHack · · Score: 1

    I've switched to jfs on pretty much all of my systems and I've had very little reason to complain. I had been using xfs, but saw fs corruption in several instances.

  57. buttfsck!! by Zaiff+Urgulbunger · · Score: 5, Funny

    You think that's bad? The file system check command is buttfsck!

  58. Something Linux has to leave behind... by argent · · Score: 1

    One of the things that we do in order to strongly discourage out of tree drivers/filesystems is that we constantly make changes to the API, without regard to making life easy to the out-of-tree kernel module. Most of the time the changes are justified --- although sometimes people have suspected that some changes made had benefits that were so marginal that it seemed that the main justification was to screw over externally maintained drivers/filesystems. -- Theodore Tso mit.edu>

    This kind of thing has been a huge obstacle to Linux on the desktop. The lack of a stable kernel API tells hardware manufacturers to stick with Windows. Even the past few years, with the introduction of Vista, has been less disruptive than every year is on Linux.

    1. Re:Something Linux has to leave behind... by perbu · · Score: 1
      Wrong.

      If you want stability stick to a Linux distribution - Ubuntu LTS and Red Hat Enterprise Linux both have stable APIs with a 5 and a 7 year lifespan. Desktop users have no business compiling their own kernels anyway.

      The lack of a stable API allows Linux to move forward unhindered by old cruft.

    2. Re:Something Linux has to leave behind... by argent · · Score: 1

      Hi there, Mr "Desktop users have no business compiling their own kernels anyway.", meet Mr "I just want to support Linux with my Pornmaster Digital Dildo". To do this, Mr D needs to be able to provide a driver for it that he can ship on the support CD in the Pornmaster single unit package, and the Pornmaster Bedtop Deluxe. And Pornmaster customers expect their teledildonics investment to work on their old hardware. So what does he build he drivers for, Mr K? Debian Stable Cobweb Edition, or Ubuntu Shiny Ferret, or the new Horny Hobbit distro? It's got to work on all of them.

      The problem is precisely BECAUSE desktop users have no business compiling their kernel that the kernel APIs have to be stable, so that users can expect to buy a new teledildonics device and install it in their 5 year old Lindows box and just have it work, just like they can do with their Windows XP laptop that their nephew bought them for their birthday in 2002.

      The lack of a stable API allows Linux to move forward unhindered by market share.

    3. Re:Something Linux has to leave behind... by Anonymous Coward · · Score: 0

      The lack of a stable kernel API tells hardware manufacturers to stick with Windows.

      The only reason any hardware manufacturer would care is if they intend to keep the source closed and be the only maintainers of the driver. Otherwise, they can make the driver, release the source with some specs, and if they don't want to deal with following the API changes, others will take care of it.

      Personally, I vote for having as many obstacles to closed source bullshit in linux as possible.

      And before you reply that people like me are "a huge obstacle to Linux on the desktop" I'll let you know that there are other operating systems that work great on the desktop. I have a mac, and even windows is fine since XP. The only reason to want a linux OS desktop over the other OSes is for the openness, and giving that up in order to make it more popular completely defeats the point.

    4. Re:Something Linux has to leave behind... by argent · · Score: 1

      The only reason any hardware manufacturer would care is if they intend to keep the source closed and be the only maintainers of the driver.

      That's just the tip of the iceberg.

      See, most hardware manufacturers don't REALLY care about the source to their drivers... but they DO need to be able to ship updated drivers to patch firmware problems, and ship drivers in the box with a new product without telling potential users they need to update to an experimental kernel to use it. So whether they ship source or not they need stable APIs.

      The only reason to want a linux OS desktop over the other OSes is for the openness,

      You're selling Linux short. I'm not even a big fan of Linux... I'm one of the guys who was already working on the other free UNIX before there was such a thing as Linux out there... and even I think it's got a lot of potential that people making a political statement about *one* of the kinds of openness are unfairly limiting.

  59. Re:when ext4 is feature complete it will be the #3 by jez9999 · · Score: 1

    a 7TB filesystem means you need 7GB of RAM to fsck it

    That's what SHE said!

  60. Re:BTRFS? REALLY? by Quarters · · Score: 1

    How about something really catchy and descriptive like, "Ogg VobisFS"

  61. 1982 called and wants it's product name back by SpinyNorman · · Score: 1

    Well, almost.

    Anyone else remember Novell Btrieve (B-tree based database library)?

    The scary thing about Btrieve was that each month the rate of growth of the known/fixed bug list seemed to be increasing, although the product itself seemed pretty stable in our use at least.

    1. Re:1982 called and wants it's product name back by etwills · · Score: 1

      [Novell Btrieve (B-tree based database library)]: The scary thing about Btrieve was that each month the rate of growth of the known/fixed bug list seemed to be increasing, although the product itself seemed pretty stable in our use at least.

      But presumably you could search the list *really* quickly!

    2. Re:1982 called and wants it's product name back by Wowlapalooza · · Score: 1

      No Child Left Behind called and wants its punctuation lessons back...

      (Sorry, pet peeve. Grrr).

  62. Re:BTRFS? REALLY? by h4rm0ny · · Score: 1, Insightful


    More tarty make-up, than actual looks, imo. Looks is a clean intuitive interface the responds quickly. Wobbly window frames and cubic desktops are just lipstick and eyeshadow. i.e. attention-seeking flash. Of course that's what some go for and it's good to be able to apply it if you want it. But I would liken it to real looks.

    --

    Aide-toi, le Ciel t'aidera - Jeanne D'Arc.
  63. Re:BTRFS? REALLY? by V!NCENT · · Score: 1

    BeTterFS? Better filesystem? I don't know how much you've been drinking but... ;)

    --
    Here be signatures
  64. ButterFS by Keeper+Of+Keys · · Score: 2, Funny

    I can't believe it's not better.

  65. Re:BTRFS? REALLY? by RavenChild · · Score: 0

    To fix problems in the filesystem, would you have to run ButtFSck?

  66. fs-driver by FunkyELF · · Score: 1

    anyone happen to know if the driver from fs-driver.org will work with ext4? It works with 2 and 3

    1. Re:fs-driver by EvilRyry · · Score: 1

      Nope. The ext3 driver cannot mount ext4 volumes after extents have been enabled.

  67. Re:BTRFS? REALLY? by not+already+in+use · · Score: 2, Funny

    Exactly. I couldn't even imagine where Linux would be right now if it weren't driven by a bunch of egotistical nerds clamoring for their own implementation of something rather than incorporating someone else' extremely capable and far more mature existing implementation.

    --
    Similes are like metaphors
  68. Re:BTRFS? REALLY? by David+Gerard · · Score: 1

    But crikey, goes all night without stopping. None of that Windows "I'm so sorry, this has never happened to me before, honest."

    --
    http://rocknerd.co.uk
  69. MOD PARENT UP by David+Gerard · · Score: 1

    I just choked.

    --
    http://rocknerd.co.uk
  70. Re:BTRFS? REALLY? by staeiou · · Score: 1

    All I want to know is if I can fsck this BtrFS.

  71. Re:BTRFS? REALLY? by aevans · · Score: 1

    For me, it's "do I have to fsck her all the time?"

  72. All hardware can fail, including UPSes. by Medievalist · · Score: 5, Insightful

    I used to do that, and then I got a UPS instead and switched back to pure ext2. The performance hit from journalling is simply too high to tolerate. A decent UPS (pretty much anything made by APC) will prevent the crashes in the first place, solving the problem completely and without any unnecessary overhead. With UPS prices being as low as they are, there is no excuse for not having one, so I think that journalling will become obsolete in some near future.

    Our industrial UPS (which is orders of magnitude more reliable than any APC product ever made) recently exploded, burnt, and shorted out the entire building's power. It spiked thousands of volts through the protected equipment and destroyed a half-dozen servers. The fire was fierce enough to cause our fm200 system (halon equivalent) to dump, which put out the fire before the main battery bank was breached.

    This was the first time I've ever seen an UPS bigger than a Chrysler fail, but I've seen dozens of failures from those crappy little APC units. At one time I had a stack of burnt-out ones in my basement (I used to salvage the batteries for cash).

    If your disaster survivability plan depends on any single piece of hardware never failing, it's no good. Offsite backup is your friend.

    1. Re:All hardware can fail, including UPSes. by Frank+T.+Lofaro+Jr. · · Score: 1

      That's scary. If that happened in a home, it is very likely the structure would catch fire.

      I wouldn't want to buy from that brand.

      Stuff breaks, but it should fail safe.

      I had a ViewSonic monitor die.

      No flames.
      No smoke.
      No sparks.

      Just wouldn't turn on (power light would only go to yellow, like there was no signal).

      --
      Just because it CAN be done, doesn't mean it should!
    2. Re:All hardware can fail, including UPSes. by Medievalist · · Score: 1

      Monitors can die pretty spectacularly too, but usually don't. Nowadays they even have overfrequency protection (so you can't make them catch on fire with a bad XF86Config file) and the flybacks are jacketed in hard plastic instead of rubber, which cuts way down on the fire and electric shock hazard.

    3. Re:All hardware can fail, including UPSes. by Chemisor · · Score: 1

      > Our industrial UPS (which is orders of magnitude more reliable than any APC
      > product ever made) recently exploded, burnt, and shorted out

      Well, I admit, that if my building burns down, I just might have to suffer through the inconvenience of running fsck on my filesystems for ten minutes or so, while you with your journalled fs will be able to restart in less than a minute. Call me a fool, but I think I just might be willing live with such a horrible risk.

    4. Re:All hardware can fail, including UPSes. by Cramer · · Score: 1

      I saw an Exide Powerware (100kva?) system do that. One of it's many batteries exploded. *boom* exploded. Datacenter went dark. It was the only time I've ever seen a populated datacenter silenced. (I've seen plenty of *empty* silent datacenters.) Exide was there within the hour to fix it. (i.e. inspect and replace every battery.) I've seen many smaller (rack mount) UPSen melt/boil their batteries. I've seen a transfer switch convert itself into a pool of metal and chared plastic -- that was a Good Day(tm). (and many more...)

    5. Re:All hardware can fail, including UPSes. by Cramer · · Score: 1

      Indeed it would. It's not so much the fault of the UPS as the battery itself. A lot of people are more interested in cheap than good.

    6. Re:All hardware can fail, including UPSes. by Medievalist · · Score: 1

      We lost hard drives and power supplies in multiple servers. I've seen the same thing happen with an APC branded small UPS in Baton Rouge, except that there it fried the PC motherboard too.

      All UPSes wear out. A significant fraction spike the output side when they fail.

      But of course, your data might not be important, in which case you're making a good tradeoff to get more performance. I don't even have UPSes on my family's desktop machines at home, because it's not worth it - we back up everything important to the server. I do have a 30A UPS on the server, though.

  73. Re:when ext4 is feature complete it will be the #3 by jabuzz · · Score: 1

    Er, that limitation of XFS has long been fixed. Now 4GB of RAM is good for something like a 100TB file system. If you have a larger file system then it would not be unreasonable to have more RAM.

    The only issue with XFS is the inability to shrink it at the moment.

  74. Re:when ext4 is feature complete it will be the #3 by drfireman · · Score: 1

    You hit this problem frequently when doing the write-new-file-then-atomic-rename dance if you don't fsync the new file before the rename; most applications neglect that technically-required fsync because things just work on non-xfs filesystems.

    I don't know much about filesystem design, but isn't this just a bug in the journalling? Does it apply equally to all of the reasonably recent versions of xfs?

    As an application developer, I'm certainly sympathetic to anyone who avoids fsync in certain situations -- depending on the details, it can certainly make things more sluggish. How much responsiveness to trade for how much security is a pretty individual thing.

  75. Butter FS by Amazing+Quantum+Man · · Score: 1

    I understand that Bennigans' will be changing all their servers over to Butter FS.

    --
    Fascism starts when the efficiency of the government becomes more important than the rights of the people.
  76. How is JFS these days anyway? by Anonymous Coward · · Score: 0

    It seems everyone has different experiences with different filesystems. My experience is:

    - ext2/ext3 : Utterly failed to work on a 1 TB filesystem I was setting up. Would not even format. Doesn't matter anyway because the thought of running fschk on a volume that size is not appealing. On the "normal" sized setups I have had numerous filesystems totally obliterated due to corruption (and fschk often just makes the problem worse). Nowadays I only use it for boot partitions which are basically read-only.

    - ReiserFS3 : File corruption like mad. System freeze/crash, accidentally hitting reset switch, etc. anything like that caused many files to get corrupted. Sometimes just rebooting a machine would cause stuff to get corrupted. This was well after (years) it had been fixed up and was "stable". I gave ReiserFS a long tryout for several years but it really did suck.

    - JFS : Unstable (kernel panics, etc). I have tried it a number of times over the years and it never seemed to get better. It has been probably 3 or 4 years since I last tried it though. Seemed very unstable every time.

    - XFS : In the old days it used to use a lot of CPU/RAM but nowadays this is my primary FS. Fast, handles large filesystems no problem, no file corruption at all even in the worst system failures I have seen (including hardware failure). Lots of features and tools (but not perfect). I do have some gripes... Although it's better than ext3 in every way, it's still a bit slow when deleting extremely large files or large directories of files. Also, when the filesystem is close to being full the performance degrades rapidly. And finally, it's annoying that boot utilities can't work off an XFS partition (this one is stupid in my mind, a single tiny tweak to the spec would fix this).

  77. Re:when ext4 is feature complete it will be the #3 by MrNemesis · · Score: 1

    We have an ~8TB JFS array used for shared storage of databases on our linux boxes, as well as for another smaller arrays used for VM's and general file serving duties. Bitchin' fast even with small files, no extra latency incurred by using ACL's as you get with ext3, very fast to format and fsck (ever tried this on a 2TB ext3 array? Painful!), lowest CPU utilisation of any filesystem we tested, frags up alot less than ext (yay for extent-based filesystems), ridiculously quick deletes, equal to XFS in speed at dealign with large files with less overhead at the same time. Only real disadvantage is that it doesn't support shrinking, but that wasn't considered a big downside for us.

    Never tried LVM on anything that big though, ours is just stacked hardware RAID10 stripes across a starburst of cheap SATA drives. Use it on a 3TB LVM at home though, not had any problems with it - but that's comparatively zero load.

    It wasn't popular initially since it wasn't supported by Redhat, but we have less issues with the JFS partitions than we've ever had with the ext3 ones (yeah yeah, famous last words ;)) - besides, we use CentOS boxes for these anyway (yeah yeah, more famous last words...).

    Most underrated filesystem available for Linux IMHO, and those of you who've used it on AIX will already know its awesomeness and how comparatively gimped the linux version really is :) Bottom line: we use JFS on any disc-based IO that's performance critical, and ext3 everywhere else in order to attempt compliance with corporate support ;)

    --
    Moderation Total: -1 Troll, +3 Goat
  78. Re:when ext4 is feature complete it will be the #3 by jabuzz · · Score: 1

    I want to know who in their right mind would even trust ext4 the brand new kid on the block over say JFS/XFS that have been round the block many times.

    ext4 is frankly an utter waste of time. The whole ext2/3/... series is well past it sell by date and going for another upgrade is a pointless waste of effort, that would be better spent fixing any issues with JFS/XFS (the primary one being the inability to shrink them).

  79. Re:BTRFS? REALLY? by Hal_Porter · · Score: 1

    Bitter yeah. But it never killed anyone.

    --
    echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
  80. It's all just for fun, you know. by Medievalist · · Score: 1

    I'd like to know why Ted Tso and others are working on ext4?

    Because he wants to. Some people play softball, some people write fs code. It's because they derive some satisfaction or enjoyment from it.

    Those of us who run Ted's code (remember, he's the linux capabilities guy) benefit greatly from his activities, because he's smart and capable. But really, he's doing it for himself; he's a self-actualized human being.

  81. Re:when ext4 is feature complete it will be the #3 by kwalker · · Score: 1

    Not to persuade you away from JFS (To each their own), but ext4 fixes the slow deletes (Extents free up a lot of metadata over ext3's method, especially for large files) and the long fsck's (fsck only checks the parts of the FS that are in use and can be run while the fs is online), from what I've read.

    --
    ... And so it comes to this.
  82. Re:BTRFS? REALLY? by Adm.Wiggin · · Score: 1

    (speaking of poor names...)

    But she's such a GIMP!

  83. Re:when ext4 is feature complete it will be the #3 by Orgasmatron · · Score: 1

    I've been running a multi-TB fileserver for a few years now. It is ext3 on LVM on RAID5 (MD).

    My only complaint the whole time has been the block size in ext3. If ext3 had 1MB blocks, I would be VERY happy.

    --
    See that "Preview" button?
  84. Re:BTRFS? REALLY? by Frank+T.+Lofaro+Jr. · · Score: 1

    You call Windows "extremely capable"??

    --
    Just because it CAN be done, doesn't mean it should!
  85. OS/2 and VMS do by krischik · · Score: 1

    Actually OS/2 used ring 2 for privileged DLLs and VMS used 4 rings for all sorts of stuff - ever since the VAX that is because even the VAX had 4 rings. But then VMS is 10 years younger then UNIX and offers a lot of stuff UNIX still hasn't got.

    So there are operating systems which uses more then 2 rings.

    Martin

    1. Re:OS/2 and VMS do by Eli+Gottlieb · · Score: 1

      OK, no modern operating systems for modern processors use more than 2 rings. When was the last time OS/2 or VMS made a new release?

      Though I'll give you that VMS has a lot of stuff that Unix simply can't ever have because Unix was designed off an entirely different basis.

  86. Re:BTRFS? REALLY? by Haeleth · · Score: 1

    Wobbly window frames? Cubic desktops? I use Compiz and I have neither of those things. Just because a desktop can be tweaked to appeal to people with no taste doesn't mean that it's inherently attention-seeking. You might as well judge cars based on what happens to them in "Pimp My Ride".

  87. Re:when ext4 is feature complete it will be the #3 by EvilRyry · · Score: 1

    I've fscked a 12TB (mostly full) XFS volume with 2GB of RAM in ~10 minutes with the newer versions of xfsprogs. If its going that slow and taking that long, you're well past due for an update. They've made some really amazing improvements.

  88. Re:BTRFS? REALLY? by not+already+in+use · · Score: 1

    Oh jeez, I know this is just asking to be modded down, but I really don't care. Yes, Windows is the most capable desktop operating system. Prove me otherwise.

    --
    Similes are like metaphors
  89. Re:That's right, muhboy! No NinaFS for you. by Anonymous Coward · · Score: 0
    Perhaps we could rename it YAFS; Yet Another File System, or if that's taken, SAFS; Still Another File System

    Personally, I'd rather move to IBM's JFS, but if you gotta chase the next "dream FS"...

  90. Re:BTRFS? REALLY? by Curtman · · Score: 1

    More tarty make-up, than actual looks, imo.

    Sour grapes IMO.

  91. Re:BTRFS? REALLY? by Wdomburg · · Score: 1

    Well it does support tail packing. Draw your own conclusions.

  92. e'who gives up essential speed for a little safety by Anonymous Coward · · Score: 1, Interesting

    deserved to fsck.btrfs /

    Maybe the GPP intended a read-only mount, as a diversified file system is the intent of stability and performance.
    Notice how ext2 is fastest, but none rememered to mount their / in "ro" (read-only) and mount their /usr to a peculiarly advantaged filesystem. Tell me, who has the base system directories constantly in read-write mode, as though they just can't decide what software they want on their computer? Some people have already decided and installed their user applications and libraries; we aren't shuffling everything around like mad XP, OSX, and muVista. Tell me who is faster in read-only? Now consider why one needs journal outside their /home (better to symlink that to the actual, /usr/local/home or /usr/home)? What keeps people from remounting their system root fs to read-write but only to move system binaries and libraries (maintenance), and then just remount read-only when done?

  93. Re:e'who gives up essential speed for a little saf by IceCreamGuy · · Score: 1

    Now consider why one needs journal outside their /home

    Yeah, good point, I mean who the fsck would ever want to write to somethings like /var/logs?

  94. If you actualy use a GNU/Linux, by Anonymous Coward · · Score: 0

    not a fancy automated shell to abstract the essential liberty you inherently have available, then subdivide the / filesystem and keep journal file systems only on data that needs it.

    / ext2 ro /var jfs rw /tmp jfs rw /home reiserfs rw /usr ext2 ro /usr/share reiserfs rw /usr/local ext2 ro

    Well, of'course /tmp should be a ramdisk and maybe /var as well, but when do you need somthing outside /home to be read-write except when installing software? /home and various /usr/ and /usr/local/ mountpoints are all that need the necessary journal and mirroring because that's where the certain data is being held. Holding the entire system on a / filesystem in read-write is asking for a roothack to knock you out of the water after the blackhole sun sucks out all the crew and passengers.
     

  95. Re:when ext4 is feature complete it will be the #3 by evilviper · · Score: 1

    xfs is useful, but xfs should be used with a UPS.

    IMHO, if you need to depend on a UPS, that doesn't bode well for the filesystem. How is it any better than an ancient filesystem like UFS/Ext2 (mounted async), if it doesn't do a good job with integrity? What's "excellent" about XFS?

    A UPS can't prevent a system crash either, so you need a decent FS.

    --
    Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  96. Re:BTRFS? REALLY? by Anonymous Coward · · Score: 0

    Let me know when a tiling window manager has compiz support. Then we'll talk. Until then, XMonad wins for usability.

  97. Re:BTRFS? REALLY? by Anonymous Coward · · Score: 0

    OS X is at least as capable, seeing as how it can run Windows as an application. Linux can also, though Windows can run Linux as an application now. Neither Windows nor Linux can run OS X as an application.

  98. I still don't understand: Why not ZFS? by Anonymous Coward · · Score: 0

    There is no problem when linking CDDL code with code under a different license, so the changes that need to done to the kernel could be done under gplv2 with ZFS as a module. That shouldn't be a problem since they do the same with nvidia already. Wouldn't it be great if the user had access to a compatible filesystem?

    Talking bout compatible Filesystems:
    ZFS is supported under MacOS, FreeBSD and Solaris. NetBSD and DragonflyBSD support are planned iirc. Btrfs and ext4 are supported under Linux only. Finally there is a filesystem which is adopted by a wide range of operating systems, it's open source, it's advanced, powerful and very easy to manage, supports transparent compression and encryption(coming soon), integrates nfs and cifs management and so much more..... and the only major operating system that DO NOT implement it are windows and linux?

    I've used linux for the last 12 years, but this license foo that is going on right now, which is keeping very useful open source software (zfs and dtrace) away from me, really soured my linux experience. I can live with a few less drivers for hardware i will never ever use, but i won't renounce features that save me hours and hours of work just because some guys don't like another open source license or chooses to spread FUD over those features (Zemlin.. I'm looking you're way)

    Regards,
    Bjoern

  99. high integrety world by krischik · · Score: 1

    OK, no modern operating systems for modern processors use more than 2 rings.

    So you consider a 39 year old system more modern then a 28 year old system? Ok, Unix had more face lifts done so if you look at the GUI of a current Unix it look more fresh.

    When was the last time OS/2 or VMS made a new release?

    VMS - "Latest stable release OpenVMS 8.3-1H1/ October 25, 2007". It is still used for high integrity systems. There are quite a few tools in the high integrity world which the low integrity world considers long dead. And you know what: If the low integrity world would sometimes have a look at those tools they might learn how to create less bug ridden systems.

    Though I'll give you that VMS has a lot of stuff that Unix simply can't ever have because Unix was designed off an entirely different basis.

    More importantly: If UNIX do get some of those hight integrity features bolted on (ACL, extended attributes, POSIX capabilities spring to mind) they are generally not used - usually because of backward comparability concerns.

    I remembers Brams dismissive reply when the communities asked for ACL support in Vim. If the standard editor (most system don't have a dedicated vi any more) won't support ACS - who does - who will.

    This is why I like my Mac - at least Apple does use all those nice new extras.

    Martin

  100. Ring -1 .. 3 by krischik · · Score: 1

    Not as far as I know and actually with the so called "hypervisor mode" we now have 5 rings of operation in x86 architecture:

    http://en.wikipedia.org/wiki/Ring_(computer_security)

    1. Re:Ring -1 .. 3 by Ant+P. · · Score: 1

      Guess I'm wrong then.

      I could have sworn I saw something ages ago saying they've given up on rings for the NX-bit stuff.

  101. Re:BTRFS? REALLY? by h4rm0ny · · Score: 1


    Well each their own. I did say that if it's what you like then fine. But my opinion is that most of the compiz stuff is more about attention seeking and showing off than it is about real substance. Hence my likening it to make-up and bling, rather than natural beauty which comes before both. You can turn it on or off, so it's good that it exists. And it does look cool. But I've got it turned off on mine. I pay more mind to what's underneath. ;)

    Regards,
    H.

    --

    Aide-toi, le Ciel t'aidera - Jeanne D'Arc.
  102. Re:BTRFS? REALLY? by not+already+in+use · · Score: 1

    OS X is at least as capable, seeing as how it can run Windows as an application.

    You mean it can run Windows in a VM with limited functionality, specifically in the graphics department. OS X is slow and bloated, but thanks to common perceptions most people believe this not to be the case. I for one, had a MacBook Pro that I dual booted Windows on, and I can testify to this fact 100%.

    --
    Similes are like metaphors
  103. Re:BTRFS? REALLY? by Yfrwlf · · Score: 1

    Which is perfectly OK as long as all Linux users still have the choice as to which software they choose. Making that choice an easy one for users who don't know how to compile stuff would also be a very nice Linux advancement, and even that can deal with fragmentation, as long as the fragmentation rests on top of some of the most common standards.

    --
    Promote true freedom - support standards and interoperability.
  104. Re:BTRFS? REALLY? by DaVince21 · · Score: 1

    Right, OpenOffice.org started looking so much more better thanks to Compiz.

    --
    I am not devoid of humor.
  105. Re:BTRFS? REALLY? by Curtman · · Score: 1

    No, OOo looked good before compiz.