Slashdot Mirror


Ask ReiserFS Project Leader Hans Reiser

Hans Reiser leads a successful Free Software project that has attracted plenty of attention, many users, and even that Holy Grail of so many who have started their own Free or Open Source projects: Big-time funding from DARPA, SuSE, and others. How did he do it? What's his advice for other project leaders? Ask him! And ask him any other question you have in mind. Please stick to one question per post, and avoid questions that can be answered with a few minutes' worth of research. We'll publish Mr. Reiser's answers as soon as he gets them back to us.

83 of 343 comments (clear)

  1. Name... by Anonymous Coward · · Score: 3, Funny

    Where did you come up with the name for your project? :)

    1. Re:Name... by Bill+Privatus · · Score: 3, Funny

      More to the point - and I've checked but not found it - how do you pronounce 'reiserfs'?

      1. riser fs
      2. razor fs
      3. none of the above (please fill in blank): _______
      --
      Redundancy is good; triple redundancy is twice as good! - Me.
  2. What is the future of ReiserFS... by Gortbusters.org · · Score: 5, Interesting

    When you look at your Linux installation scripts today, you see that ext3 is all over popular distrobutions like RedHat. ReiserFS used to be the most popular journaling filesystem for Linux, but how will it shape up given the mass adoption of ext3?

    --
    --------
    Free your mind.
    1. Re:What is the future of ReiserFS... by arth1 · · Score: 5, Informative

      ReiserFS main competitor isn't really EXT3.
      EXT3 is a journaling addition to EXT2, and much more interesting for people who want to change their existing file systems instead of creating new file systems. Note that EXT3 is slower than both ReiserFS and EXT2, but it does have journaling, and provides faster reboots :-)

      The main competitor for performance is SGI's excellent XFS. The latest implementations are quite solid, and the performance likewise are excellent. Even compared to ReiserFS.
      Both ReiserFS and XFS suffer from the potential of data loss on system failures, and XFS probably more so than ReiserFS, as tiny files might not be committed at all. However, for RAID users, I can not see any reason to use ReiserFS instead of XFS, and definitely not EXT3 unless upgrading the file system.

      Regards,
      --
      Arthur Hagen

    2. Re:What is the future of ReiserFS... by volkerdi · · Score: 2, Informative

      SGI's XFS still occasionally hangs my machine under heavy load. Plus, by the time they have a release out for 2.4.20 (they still don't), I'm sure I'll be running 2.4.21. In addition, it's still not part of the standard kernel sources. XFS would have to be considered the least supported choice of the three.

      Even though ext3 is a journaling filesystem, it still does a lengthy (and annoying) filesystem check every 20 mounts or so. To its credit it has never found an error, but still. I thought getting rid of that stuff was why we wanted journaling filesystems.

      ReiserFS has been rock solid for me, and has been the default Slackware filesystem for two releases. I don't forsee something else replacing it as default any time soon. It's still a bit of a moving target, though... if you're thinking of running a few different kernel versions you may run into situations where your filesystem has features that are too new to be mounted. (In those kinds of cases ext2 is still the safe choice)

      There's also IBM's JFS. The one thing I've noticed about that is that a newly formatted partition won't mount cleanly until you've run fsck.jfs on it. This doesn't inspire great trust, but other than that I've had no problems while testing it.

    3. Re:What is the future of ReiserFS... by GrenDel+Fuego · · Score: 3, Informative

      Even though ext3 is a journaling filesystem, it still does a lengthy (and annoying) filesystem check every 20 mounts or so. To its credit it has never found an error, but still. I thought getting rid of that stuff was why we wanted journaling filesystems.

      I personally think that the occasional check is probably a good idea, but if it annoys you then you can always change the interval, or even disable it.

      Just use "tune2fs -c <how many mounts> /dev/PARTITIONNAME"

      -c 0 should cause it to not use that functionality.

    4. Re:What is the future of ReiserFS... by opk · · Score: 2, Informative

      SGI's XFS still occasionally hangs my machine under heavy load. Plus, by the time they have a release out for 2.4.20 (they still don't), I'm sure I'll be running 2.4.21.

      Just go to http://oss.sgi.com/projects/xfs/patchlist.html and pick up the patch against 2.4.20. Works very well for me. All the releases get you is a bunch of release notes and rpms against RedHat kernels. I always get these patches which come out very promptly after the stock kernel release and work very well.

    5. Re:What is the future of ReiserFS... by Surak · · Score: 2, Informative

      Gentoo -- and no, I'm not, Gentoo gives you the choice of vanilla sources, gentoo sources, xfs sources, etc.

  3. Why ReiserFS? by wizzy403 · · Score: 5, Interesting

    What specific benefits would I see by switching my box to ReiserFS as opposed to EXT3 or one of the many other journalling file systems that Linux now supports? At this rate, seems like I could be reformatting my hard drives every week.

    1. Re:Why ReiserFS? by Spacelord · · Score: 3, Interesting

      I chose ReiserFS over ext3 because it allows online resizing of my filesystems (in combination with the excellent LVM of course).

    2. Re:Why ReiserFS? by dasunt · · Score: 2, Informative

      Integrity of data? Er, please read up on ext3 - a journelling filesystem, same as reiserfs, that seems to have the same (or slightly better) filesystem integrity as reiserfs.

      The correct answer would be along the lines that reiserfs is better at handling some files then ext3 - especially small files. I have a ton of text files on an 80 gig shared drive - all small files. Since I'm using ext3, a lot of space is being wasted.

    3. Re:Why ReiserFS? by Anonymous Coward · · Score: 2, Interesting

      He asked why ReiserFS is better than other, similar FSs. You said it was better. Could you clarify why it's "better"? Exactly why is it better/more reliable than, say, ext3? They're both journalling AFAIK. Does one have a "better" journal than the other or something?

      Sorry if I sound trollish or anything, but your post sounded kind of like the circular-logic linux trolls around here. "Why is linux better?" "Because it's linux." "Uh, yeah, but WHY?" "It's better because it's the best!"

    4. Re:Why ReiserFS? by rossjudson · · Score: 2, Informative

      As a Java developer this is what I am interested in...Java produces very large numbers of small files. Any file system that handles this more efficiently is going to make for faster compilation.

  4. Good business planning by mao+che+minh · · Score: 5, Interesting
    Did you embark on this project in hopes of making a profitable business? It certainly seems that way, considering that you went looking for sponsorship and even planned pay-per-incident support, showing that you were prepared to work the whole "support revenue" angle.

    Now you just need to hire someone to desire a modern, more "commercially pleasing" website. =)

    1. Re:Good business planning by Surak · · Score: 2, Informative
      More than just the support revenue angle, Hans has made money off of reiserfs by directly selling the code. From the reiserfs/README file:

      Source code files that contain the phrase "licensing governed by reiserfs/README" are "governed files" throughout this file. Governed files are licensed under the GPL. The portions of them owned by Hans Reiser, or authorized to be licensed by him, have been in the past, and likely will be in the future, licensed to other parties under other licenses.

      Among his customers have been DARPA and BigStorage, which are noted sponsors right on the front page. I think I remember reading that BigStorage is using ReiserFS for some sort SAN.

  5. ReiserFS and laptops by strredwolf · · Score: 5, Interesting

    My main question(s) is this(these): Is ReiserFS tunable to reduce usage of a laptop HD? If not, will it? As is, it's great on low capacity (under 1 gig) drives (I'm not sure how much overhead it imposes in comparision to ext2/ext3). However, it also likes to keep the drive active, which drains laptop batteries.

    --

    --
    # Canmephians for a better Linux Kernel
    $Stalag99{"URL"}="http://stalag99.net";
    1. Re:ReiserFS and laptops by Lukey+Boy · · Score: 5, Informative
      I had the same problem. Disable access time in your fstab file and the drive will not be so frequent - apparently ReiserFS spools and flushes the atime data, keeping everything spun up. Make a line in fstab like this:

      /dev/hda5 / reiserfs noatime,errors=remount-ro 0 1

      In fact, I disable access time tracking on every box I work with. I haven't found a worthwhile reason to ever enable it. And that's my 2 cents!

    2. Re:ReiserFS and laptops by arth1 · · Score: 3, Informative

      atime can be quite useful for caches, like client and proxy web caches and man page caches. It's also used for other services that expire data based on access time, like usenet leaf servers, and log rotating programs.

      Before turning off atime, I advise that an effort is made to identify what data really needs atime, and if possible create separate partitions for those, with atime enabled.

      Regards,
      --
      *Art

    3. Re:ReiserFS and laptops by Utoxin · · Score: 3, Informative

      atime is necessary for one major component of a lot of websites: The PHP Session files.

      The default PHP session handler uses the atime of the files to expire them properly. If they don't have atime, they get expired prematurely. (I think... It's been a while since I made the mistake of noatime on the partition that holds the session files.)

      My solution to this is to use noatime everywhere except the /tmp partition. I also use notail on the /tmp partition, and anywhere that has frequent file IO.

      --
      Matthew Walker
      http://www.tweeterdiet.com/ - My Diet Tracking Tool
    4. Re:ReiserFS and laptops by vofka · · Score: 2, Informative

      Wait until one of your boxes gets r00ted, and you (or some other poor soul dealing with one of your mangled boxes) need to do some fairly in-depth forensic analysis on the box to work out exactly what was happening, to what file, in what order.

      The Access Time attribute can yield some useful clues to what was going on during an attack when you are doing a forensic analysis. Sure, there are plenty of other things to look at before you get that deep into things, but it's still useful to have sometimes!

      --
      Disclaimer: I meant what I thought, not what I wrote! What? You can't read my Mind? Oh dear!
  6. Guideposts? by TopShelf · · Score: 4, Interesting

    Having obtained financing for the project, how does that impact the future direction of development? How do you balance the interests of developers, users and sponsors to choose which updates to pursue?

    --
    Stop by my site where I write about ERP systems & more
  7. So here we go ... by SuperDuG · · Score: 4, Interesting
    As an opensource programmer and a rather large figure in the opensource community...

    You've probably noticed that there are quite a few journaling file systems for Linux and other Open/Free/Unixie operating systems that are fairly open or completely open source. Have you ever thought about getting together and making "THE" journaling filesystem for these free OS's? I know that open source is all about choice, but as you probably also know there are certain things EXT3, XSF, etc. can do that ReiserFS cannot. If their could be a hybrid made from the big journaling filesystems would you be part of it?

    --
    Ignore the "p2p is theft" trolls, they're just uninformed
  8. Another reiser4 by $alex_n42 · · Score: 2, Interesting

    Once Reiser4 comes out, should I switch or just stick to ReiserFS? Why, why not and so forth?

    1. Re:Another reiser4 by Wolfrider · · Score: 2, Informative

      --Reiser4 was planned from the ground up to surpass v3. One of its features is delayed block allocation until write-to-disk, which is expected to make the whole filesystem much more efficient. They will also be making the size of the journal smaller, which should finally enable me to start using Reiserfs on 100-Meg Zip disks. :)

      --Hans has said in the past that he believes filesystems should be re-written from scratch every few years, so they can take advantage of algorithm improvements and new concepts. He's making good on his word. The V4 whitepaper ( http://www.namesys.com/v4/v4.html ) is an interesting read, especially if you're into/understand database design.

      --
      .
      == WolfriderV6 == I'm willing to admit that *I just might* be wrong... Are you??
  9. Hash collisions by gazbo · · Score: 5, Interesting
    It is fairly well known that on a large filesystem the probability of a hash collision becomes unavoidable, and in ReiserFS that results in data-loss as the original data is overwritten.

    Although its speed and (otherwise) good level of data integrity is of great interest to my employers, before I can recommend it to such a large company, I would like to know how and when you intend to put in a fix.

    1. Re:Hash collisions by Anonymous Coward · · Score: 2, Insightful

      Uh, I'm not sure you want to ask a question on which your boss's decision or your job depends in a Slashdot interview thread...

    2. Re:Hash collisions by gazbo · · Score: 2, Informative
      Statistics not your strong point? By your reckoning lightning should never ever strike anywhere because the probability of it striking anywhere is so slim.

      The tricky thing about solving the hash problem (in cryptography) is finding a value that when hashed matches a given string. Here, we are saying that given several hundred thousand keys, what is the probability that any two of them hash to the same value.

      The probability is far enough from zero to be a significant danger. Just because hashtables and one-way encryption both use the hashing algorithms does not mean that you can use the same figures.

    3. Re:Hash collisions by Troed · · Score: 2

      My suggestion still stands - why you're modded informative I can't understand.

      Here, we are saying that given several hundred thousand keys, what is the probability that any two of them hash to the same value.

      Low enough.

    4. Re:Hash collisions by Q+Who · · Score: 2, Informative

      ReiserFS doesn't use cryptographic hash by default.

      Get a clue before you post irrelevant (and incorrect) information.

  10. Comment removed by account_deleted · · Score: 5, Interesting

    Comment removed based on user account deletion

  11. Why did you bother? by BeowulfSchaeffer · · Score: 5, Interesting

    What was so lacking in other file systems that you felt compelled to write your own? How do you think yours compares with the others?

  12. Does Reiser4 have a chance merging into 2.6? by Xpilot · · Score: 5, Interesting

    I've been a fan of Reiserfs ever since I started using it, and I've been very impressed by its speed. I'm really anticipating Reiser4, and I hope it'll get merged into 2.6. However the last I heard about this is that Linus hasn't read your docs yet, and the status page on kernelnewbies shows Reiser4 to be "pending". Can you give us any insight on what's going on?

    --
    "Backups are for wimps. Real men upload their data to an FTP site and have everyone else mirror it." -- Linus Torvalds
  13. Future of File Systems by Simon · · Score: 5, Interesting
    Mr Reiser,

    Can you clarify your long term goals with respect to ReiserFS and file system design in geek lay man's terms?

    I'm refering to your Future Vision paper in particular. How would this kind of system be used on a day to day basis?

    Right now any mention here of adding database like functionality to a file system is met with crude comments about SQL... Some explaination would really help.

    cheers,

    --
    Simon

  14. Undelete by Anonymous Coward · · Score: 2, Interesting

    When will reiserfs have undelete facility (a la) debugfs for ext2/3?

  15. Database-like features by drinkypoo · · Score: 5, Interesting

    As I understand, what ReiserFS offers that "enterprise-class" journaling filesystems like JFS and XFS don't is the ability to treat the filesystem more like a database and less like a tree, but I haven't really heard of anyone getting serious about using this functionality. Why did you bother to add it, who do you think will get the most use out of it, and how would you promote it, given the chance?

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  16. Reiserfs vs. ext3 by Rosco+P.+Coltrane · · Score: 4, Interesting

    I've used reiserfs for years and was always very satisfied with its performance and stability. Recently, I convinced myself that ext3 had gotten as stable as reiserfs. So I switched to ext3, mainly because I ext3 == ext2 + journal, so should the need arise, I can mount my ext3 partition as an ext2 one. I haven't run any test on my new ext3 partition, but they look like they're doing okay and ext3 behaves well in case of a crash.

    So fo me, today, in terms of usability ext3 == reiserfs + ext2 backward compatibility. As much as I hate to admit it, I have now given up on reiserfs.

    So my questions are : can you compare reiserfs and ext3 in terms of performance and reliability, i.e. are there good reasons to keep using reiserfs ? Do you think ext3 might become the de-facto standard for Linux journaling filesystems ? How would you feel about it, after all the work you've done on reiserfs ? :-)

    --
    "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
  17. Is it safe? by burgburgburg · · Score: 2, Funny
    Is what safe?

    Let me have a look at that tooth. Oh, you've got a cavity. Here let me ...

    Aaaaaghhhhhh!!!!!!!

    Now, is it safe?

  18. Versioning by tjansen · · Score: 5, Interesting

    Beside the finding and organizing files, the biggest problem for desktop users today is probably that changes on the file system are not recoverable. It is easy to accidentally overwrite a file and lose your work, and the only only sane way to solve these kinds of problems would be to make it possible to revert changes.

    Several research systems have been created, like the Elephant File System, but none of them made it into the mainstream free and commercial operating systems. Are there any specific reasons why nobody offers recovery (high complexity in implementation, very bad effect on performance, etc) or is it just because FS designers don't see the need for it?

  19. speed with many many small files by donkiemaster · · Score: 5, Interesting

    We ran reiserfs for a short period a few years ago because we were saving data to 10's of millions of small files, and reiser seemed like a perfect fit. We had to stop however because as we added more and more files, disk access slowed down more and more. It eventually became almost unusable and we had to switch back to ext2. How much faster is it now than it was then? Have you completely solved this problem?

  20. 3 Questions for Open Source Project Management by Dominic465 · · Score: 5, Interesting

    (1) Software project management techniques seem to vary by discipline. Some groups follow the traditional âoewaterfallâ(TM) model others follow Rapid Programming models â" Do you try to follow a particular style? (2) Classic project management spends a lot of time accounting for resources. I can imagine that an open source project has people signing-on and dropping off. How did you keep assignments & scheduling under control? (3) Have you come up with any tips or tricks that work for managing your project that might be useful to other software development Project Managers?

  21. Filesystems and metadata by androse · · Score: 5, Interesting

    In your Future Vision white paper, last modified in January 2001, you outline several very interesting ideas about metadata.

    Several developements have taken place since : the extensible attributes of BeFS has been burried with BeOS, the database-like metadata of Longhorn (aka Yukon) may actually be a separate layer from the filesystem altogether, and Apple is also moving all metadata out of the filesystem to XML files shared between applications (see iLife package).

    My question : What is your current take on the metadata debate ? Do you still think the filesystem is the right place to handle metadata ? Any predictions ?

  22. Snapshots? by afidel · · Score: 4, Interesting

    As a leading expert on linux filesystems I was wondering whether you were aware of any snapshot implementations for linux similar to those found in Network Appliance filers. Since working in a shop that used Netapp's I have always found myself wishing for the admin time saving feature of snapshots without the high cost of a filer. Veritas's snapshoting is whole volume only as are the other solutions I have seen outside of Netapp, is this a technical hurdle that no one else has been able to jump or is it a patent issue? Thanks for your time.

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    1. Re:Snapshots? by Spacelord · · Score: 2, Interesting

      LVM has snapshot capabilities built in, independent of the filesystem (as it should be!).

      I haven't used it in a production environment yet, but I've experimented with it and it seemed to work well.

  23. When do you expect it to be released? by Drinian · · Score: 2, Informative
    From the front page of the website:

    "Reiser4 is due June 30, 2003!"

  24. reiser4 plugins by i_am_pi · · Score: 5, Interesting

    Will dynamically-loadable filesystem plugins be feasible, or practical? What scope will the plugins have? Encryption/decryption is obvious, but what about more interesting things, like scan-on-access Antivirus, or perhaps a mail spool could be mounted with some kind of spamassassin plugin.

  25. A question that's interested me for some time… by shic · · Score: 5, Interesting
    I've often wondered about the most effective ways to lay-out files on a block device. While there are many obvious solutions, it seems, to my mind at least, a hard problem to determine an optimal solution. Assuming file-system goals include:
    • Minimizing Inner/Outer fragmentation
    • Maximising locality of reference
    • Minimizing the cost of file extension
    I'd like to know how Reiser FS approached these conflicting goals. Has Reiser FS used any mathematical models to evaluate long-term stability/performance?
  26. Double-edged sword of DARPA Funding by landoltjp · · Score: 3, Interesting

    Given the problems encountered by Mark West when DARPA pulled its funding on the OpenBSD project, are you concerned about what strings may be attached to such funding for ReiserFS, be they implied or explicit?

  27. where next? by wfmcwalter · · Score: 5, Interesting
    Hans,

    Reiser FS is already a pretty mature, stable, usable product. Once V4 is done, is there really much work left to be done on ReiserFS proper? Do you have a giant to-do list that'll keep you and the guys occupied for years, or do you intent to work in a diffent direction (SAN, networkFS, databases, etc.)?

    (or perhaps you'll just retire to Portugal and play lots and lots of golf)

    --
    ## W.Finlay McWalter ## http://www.mcwalter.org ##
  28. Re:debian-devel by astrashe · · Score: 2, Interesting

    I had the same reaction -- Reiser didn't seem so far out of line to me.

  29. Starting Large Free Software Projects by unsinged+int · · Score: 5, Interesting

    When you began a file system project as a free software project, you must have known that (assuming it worked) it had the potential to turn into a big project. How did you determine how long to work on it as your own project before making the first release? I imagine there must have been a strong temptation to just get it "out there" knowing its potential, yet certainly releasing too soon would make it look unprofessional and thrown together.

  30. Naming by sporty · · Score: 2, Interesting

    Obviously, you named ReiserFS after yourself. Why after yourself? Were there any runner-ups? What will happen when you create another great project? :)

    --

    -
    ping -f 255.255.255.255 # if only

    1. Re:Naming by uhoreg · · Score: 2, Informative
      The information doesn't seem to be in the current kernel, but in an older patch (search for treefs):
      Two other former employees were involved who won't be getting credit here because they tried to kill the project at the end of it, and almost succeeded (they cost me maybe two years). They wanted to force me to sell it to the company they tried to start. They get to keep the money they got from me, and that is it. When their "VC" said that he could get a hundred researchers to swear in Russian Court that I had had nothing to do with the development of treefs, I changed the name to reiserfs and registered the copyright.
      --

      To get something done, a committee should consist of no more than three persons, two of them absent.

  31. Re:debian-devel by TedCheshireAcad · · Score: 2, Funny

    Is it just me, or does that thread remind anyone else of "CRIPPLE FIGHT!!"?

  32. Raising Awareness by blinder · · Score: 4, Interesting

    One question I always have with regards to successful (meaning funded, wide acceptance, large user/developer community etc.) is how did you raise the awareness of your project to get it from just a side project to something that it is today?

    Did you use traditional PR techinques, or just through a community of connections?

  33. OpenBSD by chrysalis · · Score: 2, Interesting

    Is there any effort to port ReiserFS to *BSD systems?

    --
    {{.sig}}
  34. A simple and obvious question. by @madeus · · Score: 4, Interesting

    Apologies in advanced for a simplistic and obvious question but...

    What's the best way to get funding for a project, assuming the project is at some sort of usable prototype stage already (i.e. who should you approach, what ground work should be done)? How should one court potential sponsors? Is it better to wait for them to come to you, or should you present a formal funding request in a format similar to that of a traditional business plan?

  35. Trade offs by Anonymous Coward · · Score: 4, Interesting

    From what I have understood of file systems, data safety is priority number one. Considering that, are there any super speed algorithms you have dismissed because of "unsafe" behaviours?

  36. ReiserFS 4 vs. MS "SQL-FS" by Kernel+Corndog · · Score: 5, Interesting

    On the ReiserFS 4 page you say that the new filesystem uses algorithms allowing the FS to do transactions only databases could do previously. Do you envision ReiserFS 4 being something comparable to MS' "SQL-FS" plans in Longhorn? How would ReiserFS 4 affect regular (oracle, postresql, mysql) database projects/companies?

  37. Extended Attributes on Unix by Anonymous Coward · · Score: 5, Interesting

    Clearly Unix filesystems are moving in the direction of extended attributes. This isn't necessarily a smooth transition for Unix, where many utilities are modeled after streams of bytes.

    I personally have pondered about this a lot. What do you think needs to be adapted and why? Should the VFS be changed? Should traditional Unix tools like "cat" be adapted somehow, or given attribute-aware equivalents? Should file attributes be accessable at the file descriptor level? Will we need a new "tar"? What should be done to standardize these attributes, or make them compatible across filesystems?

  38. Unix filesystem layout by Erik+Hensema · · Score: 5, Interesting

    The directory layout of the typical unix filesystem has grown over the years to what has become the FHS 2.0.

    If you were in the position of designing an entirely new FHS, more suited to ReiserFS (lots of small files, database-like access, etc) and without backward compatibility, what would it look like?

    --

    This is your sig. There are thousands more, but this one is yours.

  39. Rules of thumb by realnowhereman · · Score: 5, Interesting

    In your future visions paper early on you talk about Reiser's Rule of Thumb #2. However, I can't find Reiser's Rule of Thumb #1 -- what is it? Is it a secret? Does it contain the sum of all human knowledge?

    TELL ME!

    --
    Carpe Daemon
    1. Re:Rules of thumb by Monkelectric · · Score: 2, Funny

      Rule #1 is don't talk about rule #1.

      --

      Religion is a gateway psychosis. -- Dave Foley

  40. As a newbie by Alkivar · · Score: 3, Interesting

    this question is aimed at satisfying the linux newbie.

    What features do you feel are most important/would convince a newbie to make ReiserFS the system to use as compared to the many other alternatives (XFS,EXT3,etc...) out there?

  41. Deadlocking? by IamTheRealMike · · Score: 5, Interesting

    I've spoken to a few people (not naming names) who appear convinced that the design of ReiserFS is fundamentally flawed, and that it would lead to inevitable deadlocking. I've read your white papers now several times (took me a while to figure them out), and think the work you're doing is great, but a few smart people seem to think it can't be trusted. How would you reassure those people that their data is safe?

  42. fighting the wrong battle? by Capablanca · · Score: 5, Interesting

    the redmond hegemony lost the internet protocol battle and decided to fight back with semantically rich but proprietary file systems (think 'halloween memo', 'longhorn', 'groove'). are you thinking about how to mutate ReiserFS to support arbitrary object attributes, full-text indexing, callbacks, namespace syntax for queries, etc?

  43. On being one of those "outspoken" people by salmo · · Score: 4, Interesting

    Mr. Reiser, first off I have no complaints about ReiserFS (which is a high compliment), I use it on almost all my machines, except a couple are running EXT3 because they're not heavily used and I'm lazy at times. But thats neither here nor there.

    You fall into an interesting subcategory of project managers or whatever you want to call them. I'll call it the "outspoken genius" category (even though the first word might be understated and the last is probably hyperbole). Basicly your work is technically interesting, applicable, etc. That's a give in. But there are quite a few people who have personal issues with you and your manner and usually cite some exchange or another. Sometimes this is the basis of an argument to reject the use of your work, which I think is somewhat silly. You're not the only one, and certainly not the first to be interviewed here.

    So what do you think about this? ie. Do you think you made interpersonal mistakes that landed you here or do you think you've been misunderstood? Does it bother you? Why do you think people enjoy egging on folks such as yourself and then citing the moment you get annoyed with them? Do you think this question ever has a prayer of being moderated higher than someone following the method of the previous question?

    Jeeze, I realize I just wrote an essay question in the style of one of my old Philosophy professors. You know the kind, here's a statement now write some stuff (I guess I'll give you a few ideas of where to go).

  44. How did you get the funding by josevnz · · Score: 3, Interesting

    I was wondering how complicated is to get funding for an Open Source project from big companies like IBM, SuSe or even RedHat. I mean, probably you have to create a "business plan" kind of document to explain why someone should invest money on your project, but what important things should be included as a part of such proposal? (or this is just a matter of have a proper "networking").

    Also, do you known if there is any company that dedicates itself to help Open Source projects, besides de FSF?

    Thanks in advance :)

    --
    Jose Vicente Nunez Zuleta RHCE, SJCD, SJCP
  45. Distributed Filesystems by LarryRiedel · · Score: 2, Interesting

    Filesystems like Coda and InterMezzo, network RAID applications like ENBD/FR1 and DRBD, and filesystem synchronizers like Unison provide some partial solutions for the need to have a distributed filesystem with replication and disconnected operation. Do you think Reiser4 or its successor will provide features which make it easier to implement a robust usable distributed filesystem?

  46. version control in ReiserFS? by Maimun · · Score: 5, Interesting
    Hans,

    I recall a discussion at lkml about the licensing issues with bitkeeper. Someone said that the future versions of your FS will have some version control built-in, and thus be a sort-of competitor to Larry McVoy's bitkeeper. Larry's comment was that if that happens, then you have to buy bk in order to use it.

    You suggested that indeed version control is going to be in. Can you say more about these plans now, and the eventual conflict with bitkeeper.

    Thanks.

  47. Researching filesystems by ProteusQ · · Score: 5, Interesting
    I'm going back to school this fall, and in a year I hope to be admitted into a Masters of Computer Science program. I'd like my main research focus to be on filesystems.

    I'm preparing by reading everything I can find: I'm working on Tanenbaum & Woodhull's "OS Design & Implementation"; I've read "Design and Implementation of the Second Extended Filesystem"; Steve Pate's "UNIX Filesystems" is waiting on my shelf; and of course, there's the FAQ and ReiserFS v.3 Whitepaper at www.namesys.com.

    Specific questions: what branches of math are useful in this line of research? Any books, articles, etc., that I haven't listed that are a 'must read' or 'should read'? Those who have succeeded in building a better filesystem: what have they done that I should also do? Any mistakes I should avoid? Anything that no one told you about filesystems that you wish you had known up front? And are there any special tricks (above and beyond mastering your subject) to getting hired in this field once a degree is in hand?

    Thanks!

  48. Re:Bad block handling! by markrages · · Score: 2, Informative

    Here's something to try:

    This month, I had two disk-failure on a 1.0 TB software raid5 with ReiserFS.
    I was able to copy most of the data with dd_rescue and myrescue.
    By the time I was finished mucking around, I had dome mkraid -f several times, so there were spots of missing data on the disk. The filesystem would not mount. So I used resierfsck --rebuild-tree, and once it completed five days later, I was able to mount the filesystem, with most of the files intact.

  49. Features vs. reliability by halfelven · · Score: 2, Interesting

    ReiserFS has got a large amount of features in a short time. Also, new features are added regularly, and new, massively changed releases were issued several times. There were concerns regarding how this could affect the reliability of your filesystem. What's your take on that?

  50. Re:Hash collisions misquote by _xeno_ · · Score: 2, Interesting

    Actually, it says by the time you reach a million or more files the chance for hash collision is approximately 100%. However, you can still potentially hit hash collosions way before then. The handling of hash collosions is the question - does it prevent the new file from being created, or does it overwrite the file, losing data. My understanding is the ReiserFS does the latter. I could be wrong though, and I hope I am.

    --
    You are in a maze of twisty little relative jumps, all alike.
  51. One very very important question. by jd · · Score: 2, Informative
    There are now a HUGE number of filesystems out there for Linux. To make the question make sense, I'll need to list a bunch.

    Non-Journalled, or Unsure:

    • Ext2 (Bronze-age tech. Better than stone-age FAT, neolithic VFAT or iron-age UnixWare)
    • v9fs (Plan 9 meets Linux, popular with nuke scientists)
    • Befs (Mmmmm.... Be-fy!)
    • NTFS (Can't write safely, then can't use much)
    • UFS (Same as above)
    • ISO9660 (If the image hasn't been burned, then you might as well make it read/write)

    Journalled:

    • Ext3FS (the XT-architecture of filesystems)
    • XFS (Very fast, but incompatible facilities)
    • JFS (The development is sloooow)
    • LogFS (abandonware, but fully-logging if anyone actually finished it!)
    • ReiserFS (Really nice, and an innovative use of B*-trees!)

    Network file-systems

    • Intermezzo (unreliable)
    • CODA (unreliable)
    • Novell Netware (reliable but ancient)
    • NFS (reliable but stupid)
    • CIFS (as reliable as Microsoft specs get)
    • Lustre (used by Linux supercomputers)

      To summarize: We have a horribly-large number of filesystems, most of which are incompatiable, many of which do not support the Linux security module extensions, one (e2fs) provides defragging under Linux, and none at all provide support for conversions.

      Hey, diversity is good! I -like- diversity! I want MORE diviersity! I also want ways to efficiently move data around.

      Will future versions of ReiserFS include additional userland tools for defrag, fs conversion, scope of logging (eg: none, meta, full), pluggable hashing algorithm, etc?

      Ultimately, all the choice in the world is no choice at all if there's no way to make use of those choices.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  52. Supporting applications by ttfkam · · Score: 4, Interesting

    Having seen the BeOS development APIs, Reiser4 strikes a familiar cord with its metadata handling and querying. However, while BeOS had a captive audience (there were no BeOS clones/work-alikes), you are making ReiserFS a part of an established operating system and all of the inertia that "establishment" brings with it.

    Where BeOS failed (among other reasons) because of the perceived lack of applications, do you have any thoughts on how to seed development for ReiserFS.

    Case in point, where in other filesystem setups you would have a mail spool and/or mailbox as a custom file format (and indexes) on top of the filesystem, Resier4 allows for custom queryable file attributes (metadata) and the ability to efficiently store each message in the queue as a separate file. The mail handling program becomes proportionally less complex as that complexity has been shifted toward the common filesystem layer.

    That said -- mostly for the sake of folks that haven't been following Reiser4 development -- do you see a way of encouraging Reiser4-aware programs? After all, it's a chicken and egg issue. Without common usage of Reiser4, application developers will be less inclined to commit their time to it (perhaps?). Without applications that take advantage of its unique features, what reason does the user have to put it on their workstation or server?

    Are we looking at custom Linux distributions with this in mind or just piecemeal applications? Personally, I'm looking forward to dropping Ogg Vorbis files into a directory and, through the use of plugins, having their ID3 info automatically searchable. Hmmm... Where was that Fugees song again?

    --

    - I don't need to go outside, my CRT tan'll do me just fine.
  53. Performance metrics by ttfkam · · Score: 3, Interesting

    Do you have any updated performance metrics for reiser4 in comparison to ext3, xfs, jfs, and reiser3?

    I'd be curious to see both small file test, large file tests, and a mixture -- as is the case for websites with movies where the HTML and graphics are 3K but the media files are 50MB and up.

    --

    - I don't need to go outside, my CRT tan'll do me just fine.
  54. Location by 21mhz · · Score: 2, Interesting

    How is/was your business in Moscow?

    --
    My exception safety is -fno-exceptions.
  55. reiserfs fud by bani · · Score: 4, Interesting

    There seems to be a lot of FUD against reiserfs being spread by a small but vocal minority of people.

    Among other claims I have heard by the FUDsters - 'no large production systems (eg terabyte or larger) trust their data to reiserfs', 'xfs is superior to reiserfs because it has been well tested on irix', 'reiserfs has chronic data corruption problems', etc. Spin tactics that would make microsoft's propaganda teams proud.

    How do you respond to the FUDsters? What claims are bogus, and what valid objections to reiserfs are there? (The only legitimate problem I can think of - lack of full data journaling - is shared by xfs and jfs as well).

    There also seems to be a resistance by commercial distros to adopting reiserfs.

    What do you attribute the resistance to?

    1. Re:reiserfs fud by bani · · Score: 2, Insightful

      what fud?

      all i see is a plain, matter-of-fact notification of a license violation.

      reading the thread (which you apparently hoped i wouldn't) I see some less-than-civil replies from debian listmembers, and an apparent admission by other members that debian was indeed in apparent license violation.

  56. Re:please name names. by bani · · Score: 2, Interesting

    ... but you just DID repeat things told to you in private!

    "I've spoken to a few people (not naming names) who appear convinced that the design of ReiserFS is fundamentally flawed, and that it would lead to inevitable deadlocking. I've read your white papers now several times (took me a while to figure them out), and think the work you're doing is great, but a few smart people seem to think it can't be trusted."

    lets have the names behind those specific claims you posted. lets hear who the smart people are.

  57. Re:Hash collisions misquote by Anonymous Coward · · Score: 2, Informative

    Up to 127 filenames per directory (MAX_GENERATION_NUMBER, defined in include/linux/reiserfs.h) can have the same hash value. After this, creating more filenames with this hash is impossible (the EBUSY error code is returned). ReiserFS does NOT blindly overwrite files because of hash collisions.

    You can choose from multiple hash algorithms when you create the filesystem (faster hashes have a greater probability of hash collision). But collisions aren't a reason to avoid ReiserFS - most other filesystems (including ext2/ext3) won't get anywhere near a million files in a directory before suffering huge performance losses.

    The following code was taken from linux-2.4.21-rc1/fs/reiserfs/namei.c and demonstrates the handling of hash collisions.


    gen_number = find_first_zero_bit ((unsigned long *)bit_string, MAX_GENERATION_NUMBER + 1);
    if (gen_number > MAX_GENERATION_NUMBER) { /* there is no free generation number */
    reiserfs_warning ("reiserfs_add_entry: Congratulations! we have got hash function screwed up\n");
    if (buffer != small_buf)
    reiserfs_kfree (buffer, buflen, dir->i_sb);
    pathrelse (&path); /*
    * Trivial changes by Alan Cox to remove EHASHCOLLISION for compatibility
    *
    * Trivial Changes:
    * Rights granted to Hans Reiser to redistribute under other terms providing
    * he accepts all liability including but not limited to patent, fitness
    * for purpose, and direct or indirect claims arising from failure to perform.
    *
    * NO WARRANTY
    * This is one of two lines that this fix consist of.
    */
    return -EBUSY; /* I think it was better to have an error code with a name that says
    what it means, but I choose not to fight over it. Persons porting to
    other operating systems should consider keeping it as it was
    (return -EHASHCOLLISION;). -Hans */
    } /* adjust offset of directory enrty */
    put_deh_offset(deh, SET_GENERATION_NUMBER(deh_offset(deh), gen_number));
    set_cpu_key_k_offset (&entry_key, deh_offset(deh)); /* update max-hash-collisions counter in reiserfs_sb_info */
    PROC_INFO_MAX( th -> t_super, max_hash_collisions, gen_number );

    if (gen_number != 0) { /* we need to re-search for the insertion point */
    if (search_by_entry_key (dir->i_sb, &entry_key, &path, &de) != NAME_NOT_FOUND) {
    reiserfs_warning ("vs-7032: reiserfs_add_entry: "
    "entry with this key (%K) already exists\n", &entry_key);

    if (buffer != small_buf)
    reiserfs_kfree (buffer, buflen, dir->i_sb);
    pathrelse (&path); /* Following line is 2nd line touched by Alan Cox' trivial fix */
    return -EBUSY; /* I think it was better to have an error code with a name that says
    what it means, but I choose not to fight over it. Persons porting to
    other operating systems should consider keeping it as it was
    (return -EHASHCOLLISION;). -Hans */
    }
    }

  58. Why the difference by dorfsmay · · Score: 2, Informative

    Another big reason why a lot of people implement snapshot differently than NetApps, is to avoid shooting yourself in the foot. With NetApps, the snapshot data is kept on the same volume as the data itself, which leads to situation where you jump from say 50% usage to 99% just like that overnight (the snapshot area is allowed to run over the data area). This is quite a delicate situation as deleting files makes things worse (you have to get rid of old snapshots to free up space). I have seen big production database taken to their knees because of this.

    On the other hand, the other implementations are a bit slower because the blocks are copied instead of being just not deleted, but snapshots never take space from data. The implementer has to make the choice, space control and simple understanding of space vs. speed to snapshot and recovery.

  59. Re:Reiser4 by Wolfrider · · Score: 2

    --Hans, why does the ' mkreiserfs ' program not have a switch to check for badblocks? Also, what does reiserfs *do* if a block goes bad or a reiserfs system is created on a device with bad blocks?

    --
    .
    == WolfriderV6 == I'm willing to admit that *I just might* be wrong... Are you??