Slashdot Mirror


Reiser4 Benchmarks

unmadindu writes "Hans Reiser has benchmarked Reiser4 against ext3 and Reiserfs 3. Reiser4 turns out to be way faster than V3, and for ext3, why don't you check out the results yourself ? Hans Reiser states, "these benchmarks mean to me that our performance is now good enough to ship V4 to users", and he will be probably sending in a patch within the next couple of weeks to be included in the 2.6/2.5 kernel."

108 of 414 comments (clear)

  1. Reliability by prestwich · · Score: 5, Interesting

    My one concern is reliability and recovery from failure; I've had a few cases where my belief in ReiserFS has been questioned; however I can't get Ext3 to build on larger than 500GB arrays.

    At this point I'd happily choose based on reliability/recoverability/stability not raw speed.

    1. Re:Reliability by globalar · · Score: 5, Insightful

      Exactly. RAM, CPU, and storage space are ever increasing. Now we need better ways to organize data, access it, protect it, and back it up.

      The fact of the matter is, it is easier to make a fast system than a stable, reliable one.

    2. Re:Reliability by SaDan · · Score: 5, Informative

      ReiserFS has worked pretty good on 1.2TB RAID-5 array I helped build. We're running RedHat 7.2 on a box with a Promise SX6000 RAID controller.

      The drivers are crap, and the box dies about every week or so. Haven't lost a single file yet, and we're at 91% filesystem useage (millions of files).

      The / filesystem is ext3. It's about 20gigs, and has had to have files restored several times.

      I have a lot of confidence in ReiserFS, after seeing the incredible amount of abuse on this one particular machine. I have run ReiserFS for quite a while now (ever since it was part of the kernel) for all of my home systems, and have never had a single issue with those filesystems.

      Looking forward to what ReiserFS4 will bring.

    3. Re:Reliability by cvd6262 · · Score: 5, Interesting

      We had a massive failure of our primary database server while I was out of the country. (Trust me, nothing puts a damper on your day more than having one of your techs call you at midnight from 7,000 miles away.) I blame Reiser. Not because it caused the outage (it was hardware), but because it was so good, it made us a bit lax.

      We're just a small grant lab at a university, so it's not like this was a corporate system or anything, and there had been hardware problems before. Given that most of the people are not techies, they did not know how to ssh in and shutdown -r now, so they would just hit the reset button whenever they thought something was wrong and I wasn't around.

      Anyway, because of Reiser's journalling, the system would come right back up after a forced reboot. I think that the guys in the lab cut the power a couple of times to many and the hard drive just gave out.

      By the way, I just had a tech install a new drive, and Debian base with ssh. I knew the password he would use for root, and I was able to rebuild the entire system and restored 250,000 records in half a day.... From North Africa.

      Try that with a non-*nix.

      --

      I'd rather have someone respond than be modded up.

    4. Re:Reliability by FyRE666 · · Score: 5, Funny

      Given that most of the people are not techies, they did not know how to ssh in and shutdown -r now, so they would just hit the reset button whenever they thought something was wrong and I wasn't around.

      I've found users doing that to my servers before now. I find that hitting them on the nose with a rolled up newspaper and shouting "No! Bad monkey!" in a stern voice tends to stop this behaviour...

    5. Re:Reliability by Zerbey · · Score: 3, Funny

      I find putting my servers in a locked room where the users cannot get at them even more effective. Just an observation :-)

    6. Re:Reliability by 91degrees · · Score: 3, Funny

      Most admins I know would rip off all the user's limbs, execute their family, and say a firm "No". Your way is a little more user friendly.

    7. Re:Reliability by Anonymous Coward · · Score: 3, Informative

      Troll.

      First I don't get what you mean by "huge amount of stuff in the partition"... In no place does he actually say that _anything_ is being stored on the root partition (we can assume it's an OS only partition).

      It's probably a server with an internal 20GB drive connected to the 1TB raid. If you're suggesting the 20GB is a partition off the raid, well then you have no clue how these things work.

    8. Re:Reliability by Billings · · Score: 5, Interesting

      Yeah, you won't lose files, but you'll lose data. It's been noted elsewhere in this article's comments in more technical jargon, but it is a known flaw in ReiserFS that blocks of data can be written to flat out wrong areas. As an example, I had an outage while I was working with my config files and running an apt-get update;apt-get dist-upgrade. Reiser then managed to write the middle of a debian package file to whatever config file I was working with.

      Had me confused to hell until I saw a newsgroup discussion that mentioned the exact problem I was having. Does Hans Reiser know about this problem? Oh, yeah. He does. Is he concerned about it? No, he's not. In his own words he's not. And ReiserFS fails silently; you'll never know until you find it.

      When I setup ReiserFS on my machine, I was aware of similar complaints, but I dismissed them as fear of trying something unproven. And I was happy with ReiserFS for quite awhile, because I never saw anything wrong (unlike ext2/3). But I really can't support a FS that has these kinds of data integrity issues if the team has that kind of attitude towards them.

    9. Re:Reliability by hankaholic · · Score: 4, Informative

      You obviously know nothing about ReiserFS.

      ReiserFS does have speed as a goal; however, with ReiserFS 4, all filesystem operations are now atomic, which is functionally equivalent to having full data (not just metadata) journalling.

      In addition, having the fastest CPU in the world won't make ext[23] better at things for which ReiserFS is fast.

      CPU speeds are increasing. Storage space is increasing. RAM is cheap.

      However, none of that equates to "disks are fast". Having a fast CPU with a slow filesystem is like having a gigabit LAN connected to the Internet via dialup. Sure, internally you're quite good, but throughput will still suck eggs.

      The fact of the matter is, it is easier to have no clue what you're talking about than to read a little bit before posting.

      --
      Somebody get that guy an ambulance!
    10. Re:Reliability by jd · · Score: 3, Funny

      Given that techs are supposed to... well, use high-tech, I sugest using a high-tech version of the newspaper. Now, what high-tech device can you read the news on? Hmmmm. How about this 21" monitor?

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    11. Re:Reliability by haoto · · Score: 2, Funny

      Hey man, you're talking just like my boss!

    12. Re:Reliability by SaDan · · Score: 2, Interesting

      The root partition is on a 20gig drive that is not part of the RAID array. It ONLY has the operating system stored on it.

      I'll say it another way... We don't boot from the array, and we don't store any operating system information on the array.

      Guess I should have mentioned that earlier, although I didn't consider that my decisions on how to lay out the filesystems would come under fire here.

    13. Re:Reliability by RoLi · · Score: 2, Funny
      From North Africa.

      Try that with a non-*nix.

      Aw, that's a non-issue. MCSEs are so cheap that it's no problem to always keep at least one MCSE within a 200 meter radius (300 meters if the MCSE is not overweight and under 30) from the system for 99.9% uptime.

    14. Re:Reliability by shokk · · Score: 2, Interesting

      When you're talking about that much data, backups become something you're only going to pull out for catastrophic recovery. Something like snapshots become more and more important. Are any of the upcoming Linux filesystems going to address this? I imagine that something like that would begin to take customers away from EMC and Network Appliance, and would put high reliability in the hands of smaller organizations.

      --
      "Beware of he who would deny you access to information, for in his heart, he dreams himself your master."
    15. Re:Reliability by GigsVT · · Score: 2, Interesting

      I'm using ext3 on a 1.2 TB array, a 2.0 TB array, a 1.9TB array, a 550GB array, another 2.0 TB array, and a 1.0 TB array (If I remembered them all right). They are all ATA or SATA arrays, some straight 3ware, some 3ware with software RAID on top to bind multiple cards together, and some are AC&C ATA-SCSI boxes.

      Anyway, haven't had any trouble, which is more than I can say for when I last tried Reiser. It couldn't handle files larger than 2GB about a year ago, that's since been fixed. It also caused some strange stability problems.

      The 1.9 TB and the 1.0TB array had XFS on it for a while, I'm not sure if I ever got around to changing the 1.9 to ext3, but XFS (linux) has also performed flawlessly. Of course our IRIX boxes also run XFS flawlessly on some smaller legacy SCSI-only arrays.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    16. Re:Reliability by nathanh · · Score: 2, Funny
      It's probably a server with an internal 20GB drive connected to the 1TB raid. If you're suggesting the 20GB is a partition off the raid, well then you have no clue how these things work.

      Wow, that's a neat technique. He said nothing even remotely like your first sentence, but you claimed that's what he said so you could easily "destroy" his argument. Let me try:

      If you're saying that Queen Elizabeth and her band of merry men is responsible for the block size on his RAID array then you have no clue how these things work.

      How's that? Please teach me, O Master.

    17. Re:Reliability by JLester · · Score: 3, Informative

      His point is that if you have a 2TB partition made of 8 250GB drives, you have no data protection at all. That's known as striping. If one drive fails, you lose the entire array with no way to rebuild it without a professional data recovery service (and even that might be iffy with Reiser).

      RAID5 would give you a 1.75TB partition with the equivalent space of one drive reserved for fault tolerance. My preference would be RAID5+Spare with that much space which would give you a 1.5TB partition with 250GB reserved for fault tolerance and one hot spare drive in case one of the others fails. With that setup, you're okay until you lose three drives.

      Jason

      --
      "FORMAT C:" - Kills bugs dead!
    18. Re:Reliability by globalar · · Score: 2, Insightful

      "You obviously know nothing about ReiserFS."

      I didn't say one thing about ReiserFS. Not being an expert (and this being /.), I purposely avoided making any judgements.

      On the record, the only opinion in my entire post was that it is easier to make something fast than it is to make something stable.

      Perhaps I was too plain. This was not a judgement of ReiserFS. It was an opinion formulated by observation.

    19. Re:Reliability by Anonymous Coward · · Score: 2, Informative

      RAID5 stores N drives' worth of data on N + 1 drives, for any N >= 2. For each range of sectors, one of the N + 1 drives is used for parity of the other drives' data (this is equivalent to reserving one drive for parity, but produces less contention).

      Your spare machine is more resistant to drive failure (you don't lose data unless corresponding drives in each machine fail) but you're paying for N * 2 drives (like RAID1) plus another computer and enough bandwidth to keep them in sync.

      Broken controller? Viruses? If your computers are broken, nothing can save you, certainly not buying more of them.

    20. Re:Reliability by jdew · · Score: 2, Interesting

      ick, 3ware... gotta hate raid cards with no onboard cache, and do retarded tricks with the drives cache for performance :(

    21. Re:Reliability by hansreiser · · Score: 4, Informative

      If you are using metadata journaling, then a file that you are in the middle of writing to when it crashes can have garbage added to it. Note that Unix filesystems have had this feature since the days of FFS and UFS. Use data-journaling if you find that unacceptable. ReiserFS V3 supports both data-journaling and meta-data journaling now.

      Be warned though, that all fixed location journals double the transfer time cost of performing writes because the data must first be written to the journal, and then written somewhere else. This is why we don't make data journaling the default in v3. Trust me, full data journaling would have been far easier to code first than meta data journaling, but it isn't in the interest of the 'average' user.

      Now V4 is an atomic filesystem, which is much better than data journaling, because it means that all filesystem operations are performed fully atomically. Your write syscall either fully happens or it does not. Applications can have multiple filesystem operations performed atomically. We do this without writing the data twice through use of a technique called wandering logs, which I describe in a posting below (and on our website).

  2. It's really good that filesystems are maturing too by 192939495969798999 · · Score: 2, Insightful

    With all of the focus on the latest hardware and graphics, it's good to know that improvements are being made across the board like this. It gives me faith that Moore's law is a long way from failing!

    --
    stuff |
  3. Conversion? by avalys · · Score: 4, Interesting

    Does anyone know if there will be a conversion utility available - i.e, to convert ReiserFS v3 partitions to v4?

    --
    This space intentionally left blank.
    1. Re:Conversion? by mlg9000 · · Score: 3, Informative

      ReiserFS v4 is backward compatible with v3. So if there isn't a conversion tool yet (don't know) you'll still be able use your v3 filesystems along side any new v4 filesystems you might add.

    2. Re:Conversion? by Anonymous Coward · · Score: 2, Informative

      No, Reiser4 (not reiserfs4) is not backwards compatible with ReiserFS 3. The filesystem structure has been built from ground up, and one of the reasons why I partitioned my new hard drive to only half its full capacity, for the cp -aR command.

    3. Re:Conversion? by hansreiser · · Score: 5, Informative

      No, V4 is not backward compatible with V3. V3 and V4 are kept as separate codebases so that the new V4 features don't destabilize V3. We are very serious about avoiding adding new features to V3, so that it can become a zero defect product.

      However, there is a tool called convertfs (as well as tar) which can convert V3 to V4. It can also convert ext2 to V3 or V4 or V3 or V4 to ext2. It is pretty clever (and written by someone outside our team), in that it creates a loop back mounted target filesystem inside a file inside the source filesystem, copies everything from the source to the target, and then reshuffles the blocks of the file so that they are at the offsets on the device that they were at within the file.

    4. Re:Conversion? by Anonymous Coward · · Score: 5, Funny

      C'mon folks, can't we find someone more authoritative than this guy. I mean, what would someone whose nick is "hansreiser" know about this stuff.

    5. Re:Conversion? by Jah-Wren+Ryel · · Score: 3, Interesting

      Hans - How is scalability? SGI seems to think that ReiserFS doesn't scale at all with multiple CPUs, unlike ext2. At least according to the paper they presented at the Ottawa linux symposium last month:

      (see page 9 of this PDF for the graph)

      The implication is a lack of fine-grain locking. Does this new all-atomic, all-the-time implementation automagically bring better locking too?

      --
      When information is power, privacy is freedom.
    6. Re:Conversion? by Jah-Wren+Ryel · · Score: 2, Informative

      I didn't "fail" - I chose not to. It isn't a competition with ext*, it is a question about implementation.

      --
      When information is power, privacy is freedom.
  4. wait! by BigBadDude · · Score: 5, Insightful


    hey, I can live with an unstable gnome or Kicq, but a beta filesystem?? no thanks dude!

    1. Re:wait! by justsomebody · · Score: 2

      you could if you have spare machine:)

      As it goes for me I can do that with no pain. Seeing what r4 provides migrating my things to r4 would be good. But first place for beta fs is spare machine that is just used to crash test this filesystem.

      --
      Signature Pro version 1.13.2-3 release 83.5 beta3try7 after-breakfast edition
    2. Re:wait! by quasi_steller · · Score: 2, Informative

      Yep. Testing a beta filesystem is something you do on a spare machine that you are not using at the moment. It would be really cool to help test ReiserFS out, but I only have one machine and can't risk losing all of my work!

      --
      ...interesting if true.
  5. Reiser4? Competition? by GreyWolf3000 · · Score: 4, Insightful

    After reiser4, what filesystems are actually decent competition for it? It'd be nice for OSS to claim not only the best web server (apache), best kernel, and best filesystem.

    --
    Slashdot: Where people pretend to be twice as smart as they really are by behaving like children.
    1. Re:Reiser4? Competition? by 1s44c · · Score: 5, Funny

      After reiser4, what filesystems are actually decent competition for it? It'd be nice for OSS to claim not only the best web server (apache), best kernel, and best filesystem.
      --

      Sick of gentoo zealots throwing plugs in completely unrelated topics? Me too!


      Not to mention the best software installation system ( portage ).

    2. Re:Reiser4? Competition? by Doktor+Memory · · Score: 2, Informative

      Reiser's competition is, in order: SGI XFS and IBM JFS. Oh, and I guess VxFS.

      --

      News for Nerds. Stuff that Matters? Like hell.

    3. Re:Reiser4? Competition? by Znork · · Score: 2, Insightful

      Most of them?

      Dont get me wrong, reiserfs has impressive performance, but the recurring problems with bugs causing crashes when using reiserfs makes me extremely reluctant to use it on any production system.

      A filesystem can have incredible performance, great features and impressive data consistency, but if it crashes the machine it's unusable.

      Currently I dont see the developers of reiserfs paying nearly enough attention to stability, nor do they seem particularly interested in the subject, preferring features and performance over stability.

      Laudable goals, but goals that puts the safe use of reiserfs into a small niche of high-performance failure tolerant systems.

    4. Re:Reiser4? Competition? by jd · · Score: 5, Insightful
      Actually, OSS claims several of the best filing systems! :)


      XFS is probably one of the fastest journalling filesystems out there, all-round, and probably offers the best competition to Reiser4. I'm actually surprised not to see some benchmarks against it, as XFS has gathered quite a following in places.


      The port of the Plan9 filing system is said to be one of the fastest filing systems out there - enough so that it's a part of a Government research program called "Pink", run by some mad scientists at Los Alamos. Yes, that Los Alamos. Again, this would be an excellent FS to have some benchmarks against.


      Last, but not least, Reiser4 didn't do spectacularly well against Ext3 in the benchmarks. I saw plenty of results both ways. Reading vs. Deleting, for example, shows a definite penalty whichever FS you choose, depending on the operations you're performing.


      In the end, if you truly want the fastest system, you should format partitions according to the type of workload they'll be doing. You want fast deletes on a /tmp partition, for example, but you will likely care much more about reading times on your application binaries, and modification times on your data files.


      (Unless you're using the suspend patch a lot, you probably won't want journalling on the /tmp partition, either.)


      A truly optimized system, therefore, isn't about picking your "one true love" of the filesystems. It's about deciding what criteria apply, and then looking to see what filesystem best meets that criteria.


      A mixed-fs machine should be capable of out-performing ANY homogenous-fs machine, no matter what fs the homogenous-fs machine has picked, because a homogenous system will always be a compromise. A mixed-fs system need compromise nothing. (Other than your sanity. Which, being a geek, is just a hinderence anyway.)

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    5. Re:Reiser4? Competition? by Wesley+Felter · · Score: 2, Informative

      The port of the Plan9 filing system is said to be one of the fastest filing systems out there - enough so that it's a part of a Government research program called "Pink", run by some mad scientists at Los Alamos.

      I think they are using the Plan 9 network filesystem protocol (9P2000), not the Plan 9 local filesystem (which isn't anything special AFAIK).

    6. Re:Reiser4? Competition? by lewp · · Score: 2

      I disagree.

      --
      Game... blouses.
  6. Honest Portability Question by jstockdale · · Score: 5, Interesting

    I am curious as to whether there are any projects to port Reiser4 to *BSD, particularly FreeBSD 5.x. Does anyone have any thoughts on how difficult a port might be? Can somone more versed in filesystems on *nix enlighten me as to the implimentation differences?

    --
    **AA: a bunch of mindless jerks who'll be the first against the wall when the revolution comes
  7. The Paul Reiser filesystem... by Anonymous Coward · · Score: 2, Funny

    ...is mostly funny, but is too much of a company filesystem to trust completely.

  8. Computer's names translation by vadim_t · · Score: 5, Informative

    In case anybody cares, "strelka" means "arrow", and "belca" means "squirrel"

    Wonder what naming system they're using. I use names from Alice in Wonderland.

    1. Re:Computer's names translation by kliklik · · Score: 5, Informative

      Strelka and Belka are the names of the dogs that were sent into space.

      Quote from first google search result: "Belka("Squirrel")and Strelka("Little Arrow") were launched into space on board Sputnik 5 on August 19, 1960. They were accompanied on their historic flight by 40 mice, 2 rats and a number of plants. Belka and Strelka were safely recovered after spending a day in orbit. Strelka eventually gave birth to a litter of 6 healthy puppies, one of which was given to President Kennedy as a gift."

      --
      guru in training
    2. Re:Computer's names translation by hansreiser · · Score: 4, Interesting

      These are the names of the two dogs that were sent into outerspace by Russia.

    3. Re:Computer's names translation by cygnus · · Score: 2, Funny
      Strelka eventually gave birth to a litter of 6 healthy puppies, one of which was given to President Kennedy as a gift.
      are you serious?? those dogs had sex in space!??

      ;)

      --
      Just raise the taxes on crack.
    4. Re:Computer's names translation by waferbuster · · Score: 2, Funny

      That makes the mile high club seem downright inadequate.

      --
      I'm an individual! Just like everyone else!
  9. So how much better is this really? by Man+In+Black · · Score: 2, Insightful

    All these numbers are nice, but I can't make heads or tails of them. Since these are times, I assume lower numbers are better? If so, they why are they usually in red in the report?

    Would this increase the speed of a normal system by any noticable amount? Is it worth my time to switch over my EXT3 filesystem to reiser4? Are there programs that can perform the filesystem change for me?

    --
    -"One machine can do the work of fifty ordinary men. No machine can do the work of one extraordinary man." -EH
    1. Re:So how much better is this really? by gumpish · · Score: 3, Informative
      Since these are times, I assume lower numbers are better? If so, they why are they usually in red in the report?

      The columns marked B/A and C/A are the performance ratios of ext3 data journaling and ext3 to reiserV4 respectively. A ratio greater than 1 means more time was needed for the operation than reiserV4 while a ratio less than 1 means faster performance.
  10. I don't understand the statistics by 0x0d0a · · Score: 4, Insightful

    The statistics on that page are measured in seconds, no? So larger numbers are worse.

    The comparisons are done with [foreign filesystem] divided by reiser4.

    One would think that numbers greater than one, where the foreign filesystem has a long running time and reiser4 a short one, would be the ones that benefit reiser4.

    Yet the numbers *less* than one are green, where Hans says reiser4 is considered better.

    What's going on?

    (Incidently, after having a friend lose a filesystem to buggy reiser code, I'm a bit inclined to wait until people have *seriously* hammered on this).

    1. Re:I don't understand the statistics by silvaran · · Score: 3, Informative

      A. reiser4
      B. ext3 data journalling
      C. ext3


      That's how the filesystems are lettered. In the column headers, you see this:

      A B/A C/A

      So it looks like the first column is the actual time for reiser4 (A), the second column is the ratio between ext3-j (B) and reiser4 (A) which is B divided by A, and the third is ext3 divided by reiser4. So if the number is > 1 (red), it means reiser4 took less time and might be "better". If the number is 0 (green) it means reiser took MORE time.

      I think.

    2. Re:I don't understand the statistics by hansreiser · · Score: 5, Informative

      The script that creates the comparison tables divides the other filesystems by the base filesystem. The problem is that Reiser4 was used as the base filesystem in one of the benchmarks, but not the other. So in one benchmark, green is good, and the other benchmark, red is good.

      I would have fixed this before posting to lkml, but I had to catch a plane, sorry about that.

  11. Bugs by Kaladis+Nefarian · · Score: 4, Insightful

    While great, this announcement/benchmark/statement does not mean that ReiserFS V4 is ready for production use, just that it is fast. It needs a lot more bug testing before then, so don't rush out and mass-convert to V4 just yet! See here for the full thread, rather than just the first post...

    --
    * Several monkeys are here, playing banjos and wearing small hats.
  12. Re:ok by ElGuapoGolf · · Score: 3, Interesting

    You'll probably have to compile it in yourself for now.

    RH probably will include it in the future, but probably won't give you the option to install on it without jumping thru major hoops.

    RH seems to suffer from a big case of "not-invented-here-itis", and RH users sometimes suffer for it. Not having ReiserFS is one way in which they do.

  13. Which to choose for DBs? by Openadvocate · · Score: 4, Interesting

    I realise that it is a bit early to adopt V4, but stable issues aside, which filesystem would YOU choose to for database volumes for fx. Oracle or MySQL?

    --
    my sig
    1. Re:Which to choose for DBs? by Just+Some+Guy · · Score: 4, Informative

      The database is already massively journaled. There's little advantage in passing every byte through two seperate journal, and plenty of disadvantages (speed, resources, etc.).

      --
      Dewey, what part of this looks like authorities should be involved?
    2. Re:Which to choose for DBs? by DarkVein · · Score: 2, Interesting

      I'd go with XFS. I've never heard of corruption from regular use, while I've heard far too many horror stories about ReiserFS. Aside from the ineptitude at reliability, Reiser's one of the most unpleasant characters I've ever run into in Free/Open software. Stallman at least has reason for what he does. Reiser sends off flames to dev mailing lists (sometimes legally) threatening people for removing the kernel message and command line commercials from his code. Usually, he fires off fiction about "the next version of GPL" that someone from the FSF invariable shoots down. Bleh.

      If you ever get a soap commercial in the output of dmesg, you'll know who to blame.

      --

      I'm as mimsy as the next borogove but your mome raths are completely outgrabe.

    3. Re:Which to choose for DBs? by julesh · · Score: 2, Informative

      There's little advantage in passing every byte through two seperate journal

      1. If you're using a filesystem for databases, then the chances are you're keeping other stuff besides the database on them. Otherwise, you'd use a raw partition.

      2. I believe that by default data changes to files aren't journalled in either of the systems that were being compared; only filesystem metadata changes. This doesn't cost much in terms of resources, particularly if the metadata rarely changes (as I suspect it doesn't on oracle data files, which I assume are fixed size allocations that are opened when the server is started and never closed...).

  14. Re:Comparison by Anonymous Coward · · Score: 3, Funny

    Like "Three Dogs Playing Poker" to the Mona Lisa.

  15. XFS? by leoboiko · · Score: 4, Interesting

    How does it compare against everyone's favorite, XFS?

    --
    Prescriptive grammar:linguistics :: alchemy:chemistry. Stop being a nazi and learn some science.
    1. Re:XFS? by lvdrproject · · Score: 2, Insightful
      Yeah, really. Currently i'm a big XFS fan, but i've heard a lot of great things about Reiser. That said, i'm not going to switch over to Reiser until i see some good information that suggests that it's worth it. What the advantages are, what the disadvantages are, what happens if you do this or that, numbers about speed or whatever, et cetera, et cetera. I've seen a few good comparisons, but they're either all numbers, or all conjecture. :/

      I'd also like to see it compared to JFS and maybe ext2 (if not just for reference).

  16. but will it make it by DemiKnute · · Score: 4, Interesting

    So he's submitting it to 2.6, but what are the chances it'll get submitted? Isn't this what caused all of Reiser's bitching a couple of years ago? He waited to long to get RFS into the kernel and ran into the feature freeze, and then pitched a hissy fit.

    --
    .
    1. Re:but will it make it by Nothinman · · Score: 3, Interesting
      not many people are going to use 2.6.0, and I'm sure V4 will make it into 2.6.1. The wait will be what, like a month?

      The point is that unless reiser4 is 100% self contained and stable it shouldn't make it into 2.6 at all now because there's been a feature freeze and a new filesystem and anything else it adds to the kernel are features. Hans waited too long, again, and technically reiser4 shouldn't be included in the standard/Linus kernel until 2.7 now.

      XFS wasn't allowed to be included in 2.4 officially because of those reasons but the SGI developers didn't cry about it, they just kept their patches up to date and waited for 2.5 to start so they could get included.

      If the filesystem is actually stable and has real benefits over ext3, XFS, JFS, etc it'll probably be patched into distros like RedHat anyway so it's not a huge deal.

  17. Re:ok by silvaran · · Score: 4, Informative

    I'm not sure about the exact reasons why they don't support various other filesystems. The default bootup sequence of a RH system uses an initial ramdisk, and actually scans each partition available to find out where they should be mounted (they created nash, NAno SHell, which is just simple support for shell commands as well as fs label scanning). That's why you see the LABEL=/ in your /etc/fstab on a RH system. ResiserFS didn't support filesystem labels until 3.6, so using this setup could mess things up (with 3.5 or older), and justifies your point about having to "jump through hoops" to get reiserfs working. The simplest way I found to move to reiserfs was to change all the LABEL=??? specifications to actual device files, boot from a recovery disk, move everything around while reformatting the partitions as another filesystem, then finally rebooting.

  18. Filesystems for the laptop user? by niko9 · · Score: 4, Interesting

    Anybody know what, if any, features are being added for the laptop user? Last time a checked, journaled filesystems, like ext3, were generally a no-no if you wanted you battery to last.

    Maybe a filesystem just for laptop/tablet pc users?

    1. Re:Filesystems for the laptop user? by hankaholic · · Score: 3, Insightful

      Operations are journalled in such a way that instead of updating the directory entries (or the file allocation table, or whatever applies to that specific FS) and then writing the new file contents, ReiserFS writes the new contents, and _then_ updates the directory information.

      From the impression I have after having read the V4 design docs, in most cases it's a matter of just writing the new data without touching the old, then updating the bookkeeping entries and marking the old information as unneeded. In theory, this method gives journalling and crash safety "for free".

      In practice, there's the matter of increased code complexity, which means that you'll take a hit on CPU usage, at least until the developers can stop their mad rush to get V4 into Linux 2.6 and work on code-level optimization (as opposed to optimizing the algorithms), but with Reiser's tree algorithms you're probably spinning the disk less while waiting for the filesystem code to find the data you need.

      At any rate, full data journalling comes without the burden of "log what I'm going to do, complete with data, then perform the operations, then mark the pending operation as completed in the journal." Basically you get around having to write twice in nearly all cases.

      (If ReiserFS decides that it's a really really good idea to leave the data where it's at, it will write new contents to free space on the disk, update the tree to reflect the changes, write the new contents to the original file's location on the disk, and again update the tree. However, this only occurs in rare circumstances, and is the exception rather than the norm.)

      --
      Somebody get that guy an ambulance!
  19. unfair - but true for me at least in the past by fiddlesticks · · Score: 3, Funny

    DELETE (time in secs to perform action)

    40.13(R4) 0.797 (ext3 journal) 0.837 (ext3)

    woo-hoo! now it corrupts my data even quicker :(

  20. Which is best? by Dodge+This · · Score: 3, Interesting
    OK so there seems to have been a lot of Reiser flaming going on here, so what would people recommend? Taking into account Speed, Reliability and Compatability.

    I know a lot of people will pull their hair out when they hear this, but: Speed is my primary concern. On long compiles of new programs or kernels for example the speed difference on a good FS can be important. I'm not saying that I'm willing to have a FS that corrupts every last file and directory, only that given two FSs which both have seemingly similar stability I would prefer the speed boost.

    I have tried one or two of the FSs but I haven't used them for any length of time to be able to compare one against another.

    1. Re:Which is best? by FlyingDragon · · Score: 2, Informative
      If speed is king, consider a tmpfs filesystem periodically rsync'd back to a drive.

      If that's a little extreme, XFS is probably the way to go. Ext3 is good for /, since you still have an ext2 way out, if you need it. ReiserFS works well with small files -- it makes a great proxy cache. XFS is good all around.

  21. "but you won't need to fsck" by Anonymous Coward · · Score: 2, Informative

    In this note, Hans Reiser clearly states his position: you trade off filesystem metadata performance against data integrity.

    Once you realize that this maintains your filesystem structure but turns your files into Swiss cheese (this happened to me several times) you'll switch to ext3 or XFS.

    Since reiserfs continuously rebalances the block tree, files that are only open for reading can be trashed by having a block exchanged with the last file written before a crash.

    Hans is recklessly pursuing optimization of one aspect of filesystem performance. I honestly believe DARPA should reconsider funding his current track of work.

    1. Re:"but you won't need to fsck" by hansreiser · · Score: 5, Informative

      That was V3. V4 is an atomic filesystem, which means that every filesystem operation is performed as a fully atomic transaction. This is more secure than the guarantees of data journaling, as data journaling doesn't necessarily guarantee that the write will complete.



      The reason we are able to do this in V4 but not in V3 is that V4 uses what I call wandering logs. With wandering logs, instead of copying data first to the journal, and then after commit copying it from the journal to the rest of the filesystem, (thereby writing the data twice), we just change our definition of where the journal is. I don't think that data journaling is worth going half-speed for most users. With V4, we not only don't go half-speed, we go faster than V3 ever went.



      For more details, please take a look at http:www.namesys.com/v4/v4.html

  22. About reiser4 by Fefe · · Score: 5, Informative

    I attended Hans' presentation at Linuxtag.

    Basically, reiser4 is optimized for the case where you unpack a large tarball, say the Linux kernel, and have enough memory to hold it all in cache, which is true for most of us these days. reiser4 will then choose the optimal disk layout for these files and flush them to disk.

    Hans also has aspects of a log structured file system in reiser4, which means you don't write to the file, you write to a log file which basically encompasses the whole disk. The up side is that you mostly write linearly, the down side is that the files get badly fragmented if they are updated at all. Most files are not updated, just written once at installation of the package. The files that are updated frequently tend to be source code from CVS, which are small enough to fit in memory completely and have reiser4 choose an optimal disk layout again.

    The case where this model completely sucks is the case where you update many portions of a large file. For example, running an SQL database with files on a reiser4 file system as backend, or maybe a DNS server with DDNS, or a berkeley db backend for Postfix or qmail to keep the SMTP AUTH users or something. Also, log files will probably be badly fragmented.

    Hans proposes to have something like a transparent defragmenter running in the background, which he calls "repacker". This would run in the kernel space, as part of the file system, and defragment badly fragmented files that are accessed frequently. This would solve most of the down sides of his approach, but this repacker is not finished yet.

    My personal view of reiser4 is: it looks like it is optimized to perform well in benchmarks. It tries to be fastest for updating databases, but buys the performance by being slower when reading the data afterwards. The critical question is whether the repacker can alleviate these concerns, and as long as it is not finished, reiser4 is basically out of the question except for a little testing here and there. I reckon reiser4 would be a great filesystem for keeping your mozilla and gcc CVS checkout handy. But until the repacker is done, I will not even use it for testing, because the repacker really is the crucial component that makes or breaks this.

    By the way: my previous experiences with reiserfs were less than stellar. Some people call it shredderfs instead. The main complaint with reiserfs is and always was that the fsck is not nearly as trustworthy or stable as the one from ext2/ext3. So even if I use reiserfs at all, it's only for data I can afford to lose completely, like my CVS checkouts or the squid cache directories or something like that.

    The benchmarks do look good though, and I am glad that at least someone is still trying major innovations in this area. Since most Unix vendors or divisions are no longer profit centers, file system innovations have largely stalled or moved to specialized companies who regard them as proprietary (Veritas) instead of releasing them as free software like IBM and SGI did.

    1. Re:About reiser4 by hansreiser · · Score: 5, Informative


      The difference between us and an LSF is that we perform well BEFORE you run the repacker, and we merely perform even better after you run it. LSF's required that you run the repacker to get good read performance, we don't. V4 kicks V3's butt without the repacker by a lot (due to dancing trees, allocate on flush, extents, and ending the use of BLOBS, among other things). With the repacker, it will just kick it harder.




      Our approach synthesizes a lot of approaches, rather than considering one technique to be the answer to everything. This makes our performance more robust, as the different approaches each cover over each other's lackings. There are some situations in which using a repacker is higher performance than making lots of little changes while constantly maintaining optimal allocation of files.



      The repacker will be ready in a few weeks.

  23. ext2? by Coneasfast · · Score: 3, Interesting

    I know ext2 isn't a journal fs, but it would still be interesting to see a direct comparison again reiser4.

    --
    Marge, get me your address book, 4 beers, and my conversation hat.
  24. If I'm reading these right.... by Anonymous Coward · · Score: 4, Insightful

    It's still not time to swap it for ext3 for general use.

    The first table with the mixed file sizes is the most compelling. The fact that reiser4's Create and Copy times are less than a third of the ext3s in real_time is impressive.

    But the fact that the CPU consumption on Read is double that for R4 as it is for ext3 is a serious problem. On a 1.3 Ghz machine saturating a generic UDMA 100 60G bus on RH 9.0 it's about 10% of the CPU, so the home user might not care. For a system capable of delivering serious data (like a 4 drive, 15k rpm SCSI RAID array @~3 times the read throughput) going from 30% CPU to 60% CPU usage is a definite problem. Even with a 2.6 Ghz cpu it would still move from chewing up 15% to 30%. I know these numbers don't scale exactly, but they could in fact scale ugly depending on how much CPU is dedicated to communicating with the hardware and how much is in fiddling with the filesystem. My production boxes spend > 80% of their disk activity reading, so I'm not yet inspired to go out and spend the time running benchmarks on highperf. systems just yet.

    Nevertheless, I always admire it when a new version of software comes out and it's noticeably faster than the old

  25. Real comparison? by Quixote · · Score: 3, Insightful
    I'd like to see someone (i.e., not me) do a comparison of ReiserV4 with other heavy hitters like XFS and JFS

    That would be interesting.

  26. Re:ok by Anonymous Coward · · Score: 3, Informative

    that may be true for existing systems, but in the case of running an install from cd, instead of taking the default, add either "reiserfs" or "reiser" (don't remember which - think the "fs" one) to the boot options at the prompt when booting from the cd. it then adds reiserfs to the list of filesysems you can choose to format new filesystems with in the partitioning tool. pre-existing reiser filesystems will be handled correctly if you don't format them.

    rh supports reiser just fine - they simply hide the option, probably because they feel ext3 is a safer choice.

  27. Re:It may be interesting to know... by botzi · · Score: 2, Informative

    ..but however remember that both have different strong points:

    XFS is especially good in dealing with large(and when I say large I mean *LARGE*) files, and IIRC, the all idea behind the Reiser filesystem is to deal with small data objects.

    Anyway, it'd be interesting to see such a benchmark as XFS has the reputation of one of the best(the best???) fs to be used with high hard-disk I/O-s and even higher system loads.....

    --
    1. No sig. 2. ???? 3. Profit!!!
  28. In other news... by wirelessbuzzers · · Score: 4, Interesting

    Apple benchmarked their new G6 processor against the latest 10 GHz Pentium V. They say that despite its lower clock speed, it runs their suite of PhotoShop 8 filters almost four time faster than the Pentium.

    Seriously, Hans Reiser is benchmarking his own file system, and he's using benchmarks that make his system look good. Like the SpriteLFS, his filesystem has a log structure for sequential writing, which makes it look really good in tests like he performed where you write the files once.

    Compare a database load, where you write small chunks of big files all the time. Without the repacker (like the cleaner in LFS), the disk becomes horribly fragmented. With the repacker, you have to include the slowdown of this background process defragging your hard disk. Ick.

    I'll trust his benchmarks when he presents a final, stable release, with the repacker on, and tests it under workloads such as would be encountered on a server. I might use it on my homebox even if it sucks on a server, but it would be nice to know that he considers his structure's impact on other workloads.

    --
    I hereby place the above post in the public domain.
  29. linus by austad · · Score: 2, Interesting

    Didn't linus say no more new features at this point? Didn't Reiser try this same damn thing last time, fighting with the kernel people to get his stuff in after the feature freeze?

    --
    Need Free Juniper/NetScreen Support? JuniperForum
    1. Re:linus by Anonymous Coward · · Score: 2, Informative

      No. Linus stated very clearly that new drivers or filesystems or whatnot weren't affected by the feature freeze. He just didnt want any code going in that touched large portions of the core kernel code.

  30. Should I bother? by anthony_dipierro · · Score: 2, Interesting

    I'm not putting thousands of files in a single directory. I'm not using tons of small files, and my hard drives are more than big enough to hold my data. Is there any reason to use reiserfs instead of ext3?

    1. Re:Should I bother? by Anonymous Coward · · Score: 2, Informative
      If you look at the website, then you will see that their goal is to redefine the role of a filesystem. Some of the new features in version 4:
      • plugin-based architecture
      • new semantics that allow files to also be directories
      When was the last time you remember hearing about new features in a filesystem? Journaling was the last one I can remember and that's a few years old now.
  31. use Pavlovian Conditioning by jonadab · · Score: 3, Insightful

    > > they would just hit the reset button
    >
    > I've found users doing that to my servers before now. I find
    > that hitting them on the nose with a rolled up newspaper and
    > shouting "No! Bad monkey!" in a stern voice tends to stop this
    > behaviour...

    Even better: configure the services that the users use so that
    they don't start up at system start. Write a short script that
    starts them all, and whenever you restart the server ssh in and
    run it. That way, if users cold-reset the server, nothing will
    work until you fix it anyway, so you might be able to break them
    of that habbit then. (Otherwise, they'll just do it behind your
    back and not tell you; this way, they HAVE to come to you.) The
    only internet service that needs to start at system start in such
    a setup is sshd, and with that you can start up everything else
    from anywhere easily.

    If you have any coworkers (or underlings) clueful enough to handle
    a shell prompt, you can train them to ssh in and do a _proper_
    restart, and tell them how to start up all the services by running
    your start-services script.

    One more option: you can disconnect the reset switch. However,
    that won't stop them from just unplugging it, which I've found
    myself doing to Win9x systems when the reset switch does nothing,
    or on some Linux systems when shutdown -h doesn't turn off the
    power when it's finished, if there's no real power switch other
    than the on-only kind a lot of newer cases sport.

    So, just fix it so that doing Bad Things (like power-cycling)
    doesn't achieve perceived positive results.

    --
    Cut that out, or I will ship you to Norilsk in a box.
    1. Re:use Pavlovian Conditioning by dtfinch · · Score: 2, Insightful

      That's a good way to increase job security too, if they don't suspect anything.

      As for the "on only" power switches, most of the computers have bios settings for that, and if not, holding them down for about 4 seconds usually works.

    2. Re:use Pavlovian Conditioning by mystran · · Score: 2, Insightful
      Actually, at couldn't you just disconnect the reset switch and hook the power button such that it runs proper shutdown..

      Windows 2000 and XP do this. No reason why Linux couldn't.

      --
      Software should be free as in speech, but if we also get some free beer, all the better.
    3. Re:use Pavlovian Conditioning by jonadab · · Score: 2, Insightful

      > Actually, at couldn't you just disconnect the reset switch
      > and hook the power button such that it runs proper shutdown..

      You could, but that doesn't stop users from unplugging the AC power
      cord and plugging it back in, which is what they'll do if the reset
      switch doesn't work. You want things set up so that if they do that
      once, they have no motivation to do it again next time (because it
      didn't accomplish what they want, just like you said it wouldn't).

      The problem is, common but unreliable operating systems have people
      conditioned to solve problems by power cycling, because that's often
      the only way and even more often will work. If you want them to not
      do it, it has to not work.

      Another way to accomplish this is to set things up so that fsck will
      require significant (and, to a user, scary) user interaction if the
      filesystem was not cleanly unmounted. That used to be the default,
      but these days the default is probably journaling and no fsck at
      all, or at least for fsck to run with no user interaction. For a
      system end users don't have physical access to, that's good, but if
      end users are present, it tends to give them the wrong idea (namely,
      that unclean shutdowns are no problem).

      This is an axiom every IT person should learn well: what's good for
      a system end users touch directly and what's good for a system whose
      users are powerusers or admins or developers are two different
      things. A server can be in the latter category, but only if end
      users can't walk up to it and use it directly. Lock it in a server
      room and change the lock so that only the IT staff can get in (no
      janitors, no non-IT managers, nobody but IT staff), or colocate it
      in a datacenter, and you can treat it as an admin-type system.
      Set it on a desk in the open office area where Joe Secretary can
      touch it, and you have to treat it differently.

      Though, I have a cgi server in an open office area... but I've
      evaluated the risk. The power switches and stuff are toward a
      wall (and it can't be slid out without being lifted, I'm the one
      all my coworkers call to lift things), and it's headless, and it's
      not the least bit mission-critical, and it backs up everything
      that matters at all over the network daily on a cron job, and even
      so I know what fire I'm playing with, having it where it is. I've
      considered the possibilities, and the most worrisome one is the
      power bar it's plugged into (under the credenza) getting bumped
      inadvertently by someone reaching for a trashcan. I'm prepared
      for the possibility that could happen at any time. The fact that
      it's not the least bit mission-critical is significant to my
      decision to leave it there, rather than trying to make space for
      it in the closet with the T1 router.

      --
      Cut that out, or I will ship you to Norilsk in a box.
  32. Me too� by DarkVein · · Score: 3, Funny
    ...he will be probably sending in a patch within the next couple of weeks to be included in the 2.6/2.5 kernel.

    The first thing that popped into my head when I read that was "uh... feature freeze two weeks ago." I'm sure Reiser's going to have another of his babyish hissy fits. I see another slashdot story in the near future.

    <Reiser> Oh, what about my sponsors! Why are you all playa hat'n on me?

    <EveryoneNotOnTheReiserCheerleadingSquad> You don't get special treatment. You waited too long, and we're not putting your untested code into a frozen code base.

    <Reiser> You're oppressing me! The GPLv3 will fix you all!

    <SomeoneFromFSF> Please shut up, Reiser. You still don't know what the fuck you're talking about.

    <Reiser> Wahh!! My sponsors!

    <Everyone> SHUT UP REISER.

    --

    I'm as mimsy as the next borogove but your mome raths are completely outgrabe.

  33. Upgrading... by SealBeater · · Score: 2, Interesting

    What about upgradablity? The nice thing about ext3 is that upgrading to a new
    version is a reboot away. With ReiserFS, you had to re-format the drive, in
    order to upgrade. Has this changed?

    SealBeater

    --
    -- Its survival of the fittest...and we got the fucking guns!!!
  34. Re:Gee Re:Reliability by Just+Some+Guy · · Score: 2, Informative

    Yep. Root is a prime candidate for a small (100MB should do it) ext2 system mounted "sync". You don't need decent write performance on root; in fact, many sysadmins make it read-only. Journalling is pointless if you're only writing to the filesystem once every 6 months to add a new user.

    --
    Dewey, what part of this looks like authorities should be involved?
  35. Re:Warning by hansreiser · · Score: 4, Insightful


    It is important that you use a distro that bases their kernel on 2.4.18, or later, when using ReiserFS. There exists a distro that bases their enterprise server kernel on 2.4.9, and intentionally declines to add any reiserfs bugfixes since then. (This is the same distro that once shipped their kernel with the ReiserFS debugging code turned on so that we would go slow.) Do NOT use their kernel with ReiserFS. Generally when someone reports that they are having really bad experiences with ReiserFS, it turns out they are using that kernel.



    I generally recommend using the latest official kernel from Marcelo, and not any distro kernels, but the SuSE kernels tend to have effective ReiserFS support also, and not everyone out there shares my non-technical preference for a common community developed kernel.

  36. Re:ok by Isldeur · · Score: 2, Funny

    The simplest way I found to move to reiserfs was to change all the LABEL=??? specifications to actual device files, boot from a recovery disk, move everything around while reformatting the partitions as another filesystem, then finally rebooting.


    Really? The simplest way I found to move to reiserfs was to use Suse.

  37. Re:Gee Re:Reliability by Daniel+Phillips · · Score: 4, Insightful

    Root is a prime candidate for a small (100MB should do it) ext2 system mounted "sync". You don't need decent write performance on root; in fact, many sysadmins make it read-only. Journalling is pointless if you're only writing to the filesystem once every 6 months to add a new user.

    On the contrary, that's exactly the case where you should always journal, and with full data journalling. You don't care about write performance, since you hardly ever do it, but you do care at lot about keeping your root filesystem consistent.

    --
    Have you got your LWN subscription yet?
  38. some questions abou ReiserFS by Pflipp · · Score: 2, Interesting

    I have some questions abou ReiserFS and was wondering whether someone out there would be able to answer them.

    First off, there's this stuff with ReiserFS storing fine-grained data. Does this imply that using ReiserFS (v3 or v4) directly as a database would be efficient? I know RFS doesn't have Relational features, but these might very easily be implemented in userspace if you can store e.g.: .../customers/0001/name .../customers/0001/phone .../customers/0001/agent -> .../personnel/0021
    (...etc.)

    Am I losing this or getting this???

    My other question was about this metadata-as-file thing. Hans can implement whatever he wants, but it just so appears that Linux behaves like Unix. I've just made a ReiserFS partition to check, and there's no way I can "touch foo; ls foo/" to see e.g. permissions etc.

    Now I'm aware that this might be v4 stuff, but I wonder if anything of this is ever to be seen back in Linux userland? E.g., will it be possible for projects to use ReiserFS to change the paradigm used for metadata using a straight Linux kernel?

    See, from time to time I just happen to be quite impressed with the "everything is a file" applied to metadata, and I hope we can make the shift to this future one day, and finally get rid of file extensions, MIME guesses and app association registries in Linux, and store this stuff in metadata space.

    --
    "We can confirm that Debian does *not* ship the version with the trojan horse. Our version predates it." [CA-2002-28]
    1. Re:some questions abou ReiserFS by hansreiser · · Score: 4, Interesting

      I get reports (not verified by me) that ReiserFS V3 is an order of magnitude faster when used as a backend for an XML database than relational databases that were tried. So, if your data happens to have a hierarchical structure, or, you can put it into one, then you are likely to get a performance gain. If your data does not have a hierarchical structure, then you need to wait for V6 where we plan to expand the semantics.

      If you want to be able to "cat filenameX/..owner" to see who owns "filenameX", you need to use V4.

  39. Right. by haraldm · · Score: 2, Informative

    Having used and tested about every file system, I have been using XFS exclusively even for my customers. Ext2 is solid but not good for production machines which need to be able to reboot without a manual fsck (this is why we started discussing journaling file systems in 1997 after all). ReiserFS managed to shred some of my files on root filesystems. Ever had a file which you could not delete without the rm command going haywire? Since then, I've been using XFS even on the largest RAIDs without any problem.

    But let's wait and see how fast and stable reiser4 is once it matures into the stock kernel.

    --
    open (SIG, "</dev/zero"); $sig = <SIG>; close SIG;
  40. In that case... by twoslice · · Score: 2, Funny

    The drivers are crap, and the box dies about every week or so

    You should call the server "Kenny"!

    As in Southpark: Oh my God you killed Kenny!

    --

    From excellent karma to terible karma with a single +5 funny post...
  41. Re:Gee Re:Reliability by SaDan · · Score: 3, Insightful

    It's nice to have your log files mostly intact after a hard reboot, too.

  42. XFS is not mature... not in linux anyway by bani · · Score: 2, Informative

    The core XFS might have been mature in IRIX but it most definitely is not in Linux.

    XFS is still not a fully linux-native fs, it is IRIX code pasted into Linux through a special translation layer. And thats where most of the problems arise, as it has to translate IRIX-isms to Linux-isms.

  43. Re:Warning by Anonymous Coward · · Score: 2, Informative

    No Red Hat isn't afraid of kernel upgrades.

    Red Hat is providing an ABI guarantee that Linux normally doesn't provide though.

    Specifically, other vendors (Oracle, Veritas, IBM, etc) have kernel modules compiled specifically for the Red Hat enterprise 2.4.9 kernel.

    Red Hat has committed to keeping the source and binary interfaces stable and compatible for the duration of the 5 year life cycle of Red Hat Linux.

    Updating the kernel would break all of that.

    BTW, yesterday Red Hat released a beta of the RHEL 3.0

  44. Re:Out of the server world.... by ScSurferDude · · Score: 2, Informative

    For the most part, ext3 should work fine for individual users. If your were doing a lot of sound or video, you might see a performance improvement with reiserfs. The big issue with jounalling is fast reboots after unclean shutdowns. This might happen a lot at home if you have small children (future sysadmins!). ext3 and reiserfs-v3 only journal the meta-data, ie, the structure of the FS, not the actual file data itself. But that is huge, since it essentially prevents an unrecoverable disk, and eliminates the need to completely walk the structure tree when mounting an unclean disk (fsck), very time consuming. But you can lose data on open files.

  45. F**k Pavlov, use Skinnerian Conditioning by duck_prime · · Score: 2, Funny
    So, just fix it so that doing Bad Things (like power-cycling) doesn't achieve perceived positive results.
    None of this namby-pamby "fail to reward unpositive behavior" stuff. You are the sysadmin, and it is better to be feared than loved.

    User: Um, I rebooted the box. Er, was that, like, okay?
    You: Und so... Ve haff earned ourzelfs ein timeout in "The Box", hein?
    User: Aiiieieieieeeie!
    You: Mua-ha-ha!
    [Fade to black]
  46. Re:Patch for FreeBSD request?? by Billly+Gates · · Score: 3, Interesting
    Hans. Thanks for all your cool fs work. I have a request. Would you be interested in porting Reiser to other os's and maybe creating a bsd licensed version for them?

    I like the UFS2 FS for FreeBSD. Its stable but a little sluggish. I think it would be cool to have internal competition but the MS GPL == viral crap has made a dent into the BSD developers. They fear linking to anything gpl would make their kernel gpl as well.

    Anyway this is just a pretty please with a cherry on top. Especially since you are being paid for by grants from DARPA who use Solaris, Linux, FreeBSD, and every Unix os under the sun.

  47. Apologies, couldn't resist by Linux_ho · · Score: 2, Funny

    >> ...we also have some 3Ware cards we could try if the new Promise drivers don't do the trick.
    >>
    >
    > What are you waiting for! You'll be trying the 3ware cards in a couple weeks almost guaranteed ;).


    Is that a promise?

    --
    include $sig;
    1;
  48. Features by einhverfr · · Score: 2, Interesting

    ReiserFS is really good at some things (and I actually recommend it for some things) but for my main system filesystems, I find that Reiserfs lacks some of the things that EXT2/3 has that are very useful--

    File attributes including things like "Append Only" although these can also be ways of screwing up your system (who'da thunk Qmail doesn't like append-only log files?).... See man chattr for more info....

    So I am happy to use it for directories where I don't have special needs for the filesystem, but for others, I have to still use ext3.

    --

    LedgerSMB: Open source Accounting/ERP
    1. Re:Features by hankaholic · · Score: 3, Informative

      I'm interested.

      Are there features other than attributes provided by ext[23] which ReiserFS can't provide?

      Regarding the attributes, ReiserFS can't do the following "useful" things:

      - Excluding a single file from access-time logging.

      - Append-only would be useful in certain cases.

      - 'c' (compress file contents) -- Ext[23] doesn't do this yet, and ReiserFS doesn't either, but they both will support it later.

      - 'D' (dirsync) attribute -- Other than ensuring that filesystem metadata operations are completed successfully before proceeding, I can't imagine a beneficial use of this attribute. ReiserFS already journals all metadata, and V4 journals everything (and does it faster).

      - 'd' (do not back up using dump) -- AFAIK you can't use dump to back up a ReiserFS partition anyways, but tar still exists, and I'm sure there are wrapper scripts for the die-hard dump users.

      - 'E' (error in compression) doesn't seem to apply to general (current) use of ext[23], and doesn't have an analogue in ReiserFS.

      - 'I' (hash-indexed directory) -- ReiserFS' directory algorithms are much (much!) faster than ext[23]. ReiserFS will eventually support directory plugins, which allow one to set a per-directory hashing scheme. I note that ext[23] doesn't allow modification of this attribute, so I'm not sure if/how it allows you to choose a hashing scheme.

      - Immutable ('i') bit -- This might be useful in some circumstances, such as a setuid binary. Not supported by ReiserFS.

      - The 'j' option is meant to get around the fact that ext[23] only journals metadata unless mounted with "data=journal". As mentioned elsewhere, ReiserFS' long-term goal is to provide fully atomic file operations, which provides the benefits of data journalling with none of the burdens. Thus, ReiserFS V3 doesn't support this attribute, as the time spent in adding this feature would have been better spent working on V4. Note that with V4 all operations are atomic, which has the effect of turning this flag on for every file.

      - 's' (zero file contents upon delete) -- ReiserFS doesn't support this.

      - 'S' (write file synchronously) is likely used in cases where preserving file integrity is important. ReiserFS' journalling (especially in V4) provides this "for free".

      - 'T' (top of directory heirarchy) is used to get around an ext[23] weakness. I don't recall details, but I believe it's something related to preventing fragmentation and grouping similar files together for enhanced performance. ReiserFS (especially V4) is already quite fast, and with code-level optimizations will likely get faster.

      - 't' (don't merge tails) does not apply to current ext3 code, as tails aren't yet merged anyways. This will be used as a flag to write the file to disk in such a way as to be compatible with LILO and other boot loaders. At present I'm not aware of any special issues with ReiserFS' tail packing (which does exist, by the way) with respect to LILO. Note that a ReiserFS partition can be mounted with the "notail" option if you want, which is feature-wise equal to ext[23], which don't seem to provide tail packing at all right now (and if they do, someone needs to update the chattr man page).

      - 'u' (back up upon deletion) -- ReiserFS doesn't support this.

      - 'X' and 'Z' relate to compression, and do not apply to ext[23] or ReiserFS at the moment. ReiserFS will eventually support compression plugins, as well as encryption plugins.

      So of the attributes, the only ones which might actually be of some use in the context of ReiserFS would be exclusion from atime logging, append-only, the immutable bit, zero-contents-upon-delete (although future crypto plugins will destroy the need for this feature), and back up upon deletion.

      Anything I've missed?

      --
      Somebody get that guy an ambulance!
  49. Re:Gee Re:Reliability by SnowZero · · Score: 2, Informative

    Yeah, who is the guy anyway. Oh wait, he's a kernel developer who has worked on journaling file systems (Tux2,ext3). Your parent was not bogus, he was right. Just because you flush things to disk aggressively doesn't mean they are always consistent. There are still short times where things would mess up if the machine lost power. With full journaling the file system is *never* inconsistant. The only thing you'd want sync for is the push the latest cache versions out to disk; but it would be best to do that on a journaling FS.

  50. Re:reiserfs 3.6 is also very fast by javamutt · · Score: 2, Informative

    This would be a much mre useful posting if you specified what command was behind that `time` output.

    I've been doing similar testing and have seen significant differences in numbers based on the io load / pattern. Each FS has its own strengths + weaknesses.

    Was it: random or sequential? big files or small files? consecutive io or single reader/writer?

    This stuff makes a BIG difference in the relvance of those numbers. Yuo probably know that if yu're doing testing, but it'w worth psting for those wh may not.