Reiser4 Filesystem Released

ext3 to reiser4 ? by Anonymous Coward · 2004-08-23 14:23 · Score: 4, Interesting

Will I be able to convert my exsisting ext3 fs to reiser4 fs withou having to reformat?

Re:ext3 to reiser4 ? by Aardpig · 2004-08-23 14:26 · Score: 5, Informative

Will I be able to convert my exsisting ext3 fs to reiser4 fs withou having to reformat?

No, you will have to reformat. However, I recommend the upgrade; I've seen a number of studies showing that the performance of ext3 is awful compared to reiserfs. The only arguable advantage of ext3 is its compatibility with the baseline ext2.

--
Tubal-Cain smokes the white owl.
Re:ext3 to reiser4 ? by Dwonis · 2004-08-23 14:28 · Score: 4, Insightful

I doubt it. In general, that's not necessarily possible (though you can get away with it in special cases). In any case, doing that without a UPS would probably be risky, since there would be a (probably very long) period of time where the filesystem is totally incomprehensible to BOTH filesystem drivers (old and new), and if the system dies during that time, say bye-bye to your data.
Re:ext3 to reiser4 ? by David+M.+Andersen · 2004-08-23 14:28 · Score: 5, Informative

Possibly using convertfs, but I have no idea if it works or not.

This page seems to have more info about it.
Re:ext3 to reiser4 ? by Aardpig · 2004-08-23 14:38 · Score: 4, Insightful

Would it be possible to copy all data from the ext3 partiton to a network mountpoint(nfs, ftp, samba, etc...) format the drive to reiserfs, and then copy all the data back?

Yes! Some advice, however: if possible, make two separate copies of your data on different remote servers. Also, check the integrity of your copies using something like md5sum -- there's nothing worse than moving data to a new location and finding out it's corrupted only after you have deleted the originals.

--
Tubal-Cain smokes the white owl.
Re:ext3 to reiser4 ? by Aardpig · 2004-08-23 14:40 · Score: 5, Informative

yeah, that, and *stability*. reiserfs has a noteable history of people losing their data because of filesystem problems.

Not over the past couple of years -- the original corruption problems with reiserfs, although pretty severe, are well in the past now.

--
Tubal-Cain smokes the white owl.
Re:ext3 to reiser4 ? by Aardpig · 2004-08-23 14:43 · Score: 4, Informative

ext3 has a functioning fsck, reiserfs does not.

I myself have never had any problems with reiserfsck -- what exactly is wrong with it?

--
Tubal-Cain smokes the white owl.
Re:ext3 to reiser4 ? by EMN13 · 2004-08-23 14:46 · Score: 5, Informative

Nope convertfs won't work... From the horses mouth:
To upgrade from reiserfs V3 to V4, use tar, or sponsor us to write a convertfs.

The lkml posting is probably cached all kinds of places, but kerneltrap also reproduces it in full.

Then again, reiserfs v4 and v3 have nothing to do with each other (unlike ext2 and ext3 for instance), so there's no quick fix possible probably.

On the other hand - reiser4 is completely untested (compared to reiser v3 and jfs, xfs, ext2, heck even the wine-dll emulation layered ntfs writing driver...), so do yourself a favour and don't do anything quite so crazy as not just using it for a production machine but also trying to convert an existing system to it with 'smart' tricks... Give it a little while... or make a lot of backups...
Re:ext3 to reiser4 ? by Anonymous Coward · 2004-08-23 14:49 · Score: 5, Informative

>MD5 has been proven to have collisions.

Statistically speaking you are more likely to get malaria in Arizona than experience a random MD5 collision.
Re:ext3 to reiser4 ? by gordyf · 2004-08-23 14:59 · Score: 4, Funny

Um, every hashing algorithm has collisions. There are more things to hash than there are resulting hashes.
Re:ext3 to reiser4 ? by EMN13 · 2004-08-23 15:00 · Score: 4, Informative

All hash functions have collisions; that's not the point... It's extremely unlikely for one to occur non-maliciously. The MD5 collision found was only found after some trivial mathematics done as a matter of principle by your network card just to spite you (okay, just kidding).
It's still perfectly fine to use MD5 to check the validity of your files for bit-errors. Then again, so is CRC32.
I do have a question to anyone more knowledgeable in MD5's weakness: although MD5 can now be spoofed , it's not clear to me from reading the news - is it only directly applicable to messages of a certain type/length or to all messages?
Re:ext3 to reiser4 ? by 1000StonedMonkeys · 2004-08-23 15:04 · Score: 4, Informative

Sure MD5 has been proving to have collisions, but that really isn't news. All hash algorithms have collisions simply because of the nature of what they do.

Suppose you've got a 1K file. There are 2^1K possible values that file can assume. If you map those 2^1K values to the 2^160 values a SHA1 hash can assume, you have an average of 2^944 1K files that collide on any give SHA1 hash.

What differentiates hash algorithms is their ability to prevent people from generate a text that matches a given hash. It is currently not possible to do this for either MD5 or SHA1. It has been speculated that MD5 is nearing the end of it's life in this regard though. I don't follow the field closely enough to weigh in on the matter, but I can tell you that the only thing that finding an actual md5 collision will do is demonstrate what was rather easily proved in the previous paragraph.

As far as verifying files is concerned, the cryptographic strength of the hash algorithm is irrelevant. Unless you suspect someone will be tampering with your results, use whatever algorithm you can find a useful tool for, be it md5, sha1, or even crc32.
Re:ext3 to reiser4 ? by 13Echo · 2004-08-23 15:16 · Score: 5, Informative

ext3 has fewer bugs and has been through more testing. ext3 has a functioning fsck, reiserfs does not.

"man reiserfsck"

But ReiserFS doesn't need an "fsck" type program in normal circumstances. In power outages, etc., it's rock-solid. But for things like drive failures and the likes that tend to actually corrupt the data, then yes; EXT3 is the better choice. The reiserfsck program isn't intended to be run on the event of just any power outage or failed unmount, because those sorts of things don't tend to damage the filesystem.

I've been using ReiserFS 3 for years and I've really been happy with the results. The only times (once or twice) that I've had problems were when I had severe hardware malfunctions (due to failing mobo capacitors and a dying hard drive), and my own carelessness when trying to repair the bad data.
Re:ext3 to reiser4 ? by WindBourne · 2004-08-23 15:23 · Score: 5, Insightful

In any case, if you're looking for a really nice filesystem, use XFS. It was developed by professionals (SGI), is fast and stable, and is now released as open source.
And Reiserfs (and for that matter, Linux kernel) is not developed by professionals? Reiserfs is fully funded and the designers/coders are paid; By definition, PROFESSIONAL. But they are also talented
I suppose it's just a coincidence that the reiser benchmarks page doesn't compare it to XFS... or maybe they were too embarassed to show the results?
Please quit being a total twit. XFS has its' place, but for now, we are discussing ReiserFS. Just for the record, ReiserFS has been around for years, and does a great job with mixing loads of little to medium files. While XFS does an ok job, it really excells with the large files, in particular, very large sparse files.

For what it is worth, I have used Reiserfs, XFS, JFS, EXT3, EXT2, and minix for linux FSs. I have found that they all have advantages depending on what you are doing. minix works for compatability (with very OLD linux); Ext2 does a great job with a mostly read only fs (think boot or /usr; Ext3 has the advantage of data journaling, but it is soooooo slllloooowwww; Jfs, XFS, and Reiserfs are my main ones and they always work.

--
I prefer the "u" in honour as it seems to be missing these days.
Re:ext3 to reiser4 ? by Lshmael · 2004-08-23 15:47 · Score: 4, Informative

You must be referring to section 4d in the Gentoo Handbook:
XFS is a filesystem with metadata journaling which comes with a robust feature-set and is optimized for scalability. We only recommend using this filesystem on Linux systems with high-end SCSI and/or fibre channel storage and an uninterruptible power supply. Because XFS aggressively caches in-transit data in RAM, improperly designed programs (those that don't take proper precautions when writing files to disk and there are quite a few of them) can lose a good deal of data if the system goes down unexpectedly.
Incidentally, the Handbook is very pro-ReiserFS (although it also says ext3 is "an excellent filesystem," reiserfs is used as the default filesystem in all of the examples). If you spend time on the Gentoo Forums, you will find a number of people who claimed reiserfs tried to kill their computer, and an equal number who swear by it.
Re:ext3 to reiser4 ? by SaDan · 2004-08-23 15:48 · Score: 5, Informative

I've successfully recovered a trashed array running ReiserFS after losing a CPU.

reiserfsck is there, and does work.

I've had more problems with the Ext filesystems than I care to mention, and we do not use Ext2 or Ext3 on any production machines that run Linux any more. Everything's ReiserFS v3, and once we start testing Reiser4, we'll move to that.

Ext3 was a hack for compatibility with Ext2. It serves its purpose, which is easy upgrades and backwards compatibility.
Re:ext3 to reiser4 ? by Anonymous Coward · 2004-08-23 16:05 · Score: 5, Funny

I myself have never had any problems with reiserfsck -- what exactly is wrong with it?

You don't know? The problem with reiserfsck is that it is invisible to those who are dogmatically anti-Reiser. Hans is currently working on a ReiserDecloak() function to address this.
Re:ext3 to reiser4 ? by minion · 2004-08-23 16:53 · Score: 5, Informative

I suppose it's just a coincidence that the reiser benchmarks page doesn't compare it to XFS... or maybe they were too embarassed to show the results?
---

Please quit being a total twit. XFS has its' place, but for now, we are discussing ReiserFS. Just for the record, ReiserFS has been around for years, and does a great job with mixing loads of little to medium files. While XFS does an ok job, it really excells with the large files, in particular, very large sparse files.

I just wanted to add my two cents to this: We had done internal benchmarks at our company, and found XFS to be the fastest filesystem, and seemed to have a good track record with the community. (We didn't consider reiserfs because of its lack of bad block handling).
Either way, we converted ONE of our 2 Terabyte mount points to XFS. Whenever a file would be created on that mount point that exceeded 4G, bdflush would peg the cpu at 100%, commits to the disk would cease, and file system corruption ensured.

This was with kernel 2.4.23.. The problem was fixed in 2.4.25 (maybe 2.4.24, but we never tested that kernel). When we had this issue, and linked it to XFS (through another test system), we quickly migrated away from XFS, back to ext3.

We never had a problem like that was the ext's. We've lost data with both reiserfs and XFS. And if you grep the changelog for the kernels on XFS, you'll see tons of fixes for "deadlocks, race conditions, oopses", etc. These were all fixes AFTER 2.4.23..

Lesson: Stop playing with something that works, and be happy your servers serve. We never made it to testing JFS, and we probably won't. Ext3 might not be the fastest kid on the street, but it has been the most reliable for us.

--

-- If we don't stand up for our rights, now, there will be no right to stand up for them later.
Re:ext3 to reiser4 ? by boaworm · 2004-08-23 20:53 · Score: 4, Insightful

This is the danger of ext3 journaling (and possibly others as well). It makes people beleive that just because the filesystem passed as "clean" during boot, no corruption occured. Try a full fsck after a year or so of running (with a number of power failures/OOPS'es), and you will probably find a number of ext3 fs corruptions not detected by the "fast" fscking.

As far as reiserfs is conserned, bring me quota and i'll consider it. Until then, it's ext3 with full fsck's at boot.

--
Probable impossibilities are to be preferred to improbable possibilities.
Aristotele

oooooo, dancing trees! by fishbert42 · 2004-08-23 14:24 · Score: 5, Funny

... but can they tango?

Re:oooooo, dancing trees! by peculiarmethod · 2004-08-23 14:35 · Score: 4, Informative

"* Reiser4 uses dancing trees, which obsolete the balanced tree algorithms used in databases (see farther down)"

that's what he meant.

oh, and whoever moderated offtopic didnt rtfa, either. damn, peeps.. what is wrong with this community these days?

pm

--
** "It's not my job to stand between the people talking to me, and the ones listening to me." -- Pego the Jerk
Re:oooooo, dancing trees! by Jeremi · 2004-08-23 15:04 · Score: 4, Funny

I don't know about you, but for me the tfa isn't about algorithms at all, but rather is a very short piece regarding "server timeouts". :^P

--

I don't care if it's 90,000 hectares. That lake was not my doing.

Only one question... by Pathway · 2004-08-23 14:26 · Score: 4, Interesting

I only have one question (And I obviously have not researched an answer...):

Is there an easy and non-destructive way for me to migrate my ReiserFS version 3 to a version 4 Filesystem?

--Pathway

Re:Only one question... by EMN13 · 2004-08-23 14:37 · Score: 5, Informative

Well, Mr. Reiser Dude suggests tar in his posting to lkml which can also be viewed on kerneltrap.org.

In other words,

no.

Re:Windows port? by Coneasfast · 2004-08-23 14:28 · Score: 5, Informative

there is rfstool for reiserfs (afaik not v4)
and many for ext2/3

if OTOH, you are looking for a fully featured driver that can be used for production use, then i wouldn't count on it

--
Marge, get me your address book, 4 beers, and my conversation hat.

Helpful Mirror by Anonymous Coward · 2004-08-23 14:29 · Score: 4, Informative

Reasons why Reiser4 is great for you:

* Reiser4 is the fastest filesystem, and here are the benchmarks.
* Reiser4 is an atomic filesystem, which means that your filesystem operations either entirely occur, or they entirely don't, and they don't corrupt due to half occuring. We do this without significant performance losses, because we invented algorithms to do it without copying the data twice.
* Reiser4 uses dancing trees, which obsolete the balanced tree algorithms used in databases (see farther down). This makes Reiser4 more space efficient than other filesystems because we squish small files together rather than wasting space due to block alignment like they do. It also means that Reiser4 scales better than any other filesystem. Do you want a million files in a directory, and want to create them fast? No problem.
* Reiser4 is based on plugins, which means that it will attract many outside contributors, and you'll be able to upgrade to their innovations without reformatting your disk. If you like to code, you'll really like plugins....
* Reiser4 is architected for military grade security. You'll find it is easy to audit the code, and that assertions guard the entrance to every function.

V3 of reiserfs is used as the default filesystem for SuSE, Lindows, FTOSX and Gentoo. We don't touch the V3 code except to fix a bug, and as a result we don't get bug reports for the current mainstream kernel version. It shipped before the other journaling filesystems for Linux, and is the most stable of them as a result of having been out the longest. We must caution that just as Linux 2.6 is not yet as stable as Linux 2.4, it will also be some substantial time before V4 is as stable as V3.

Re:Helpful Mirror by PingXao · 2004-08-23 18:12 · Score: 4, Insightful

"Reiser4 is architected for military grade security."

DING * DING * DING * DING

Alarm bells going off here. There is no commonly accepted definition of what constitutes "military grade security". Authors and vendors should avoid this terminology like the plague. It reeks of snake oil and most security profressionals will look askance at anything that touts this "feature". Having said that, I've used Reiser3 and I think it's great. There's no reason to think Reiser4 won't be even better. Given its plugin architecture there's also no reason to think that secure plugins can't be developed for it in a transparent way that actually provide good security. Maybe my complaint here is pedantic. Never say never, but no software program should ever use the phrase "military grade security" if it wants to be taken seriously. There is no standard of "military grade security" by which such claims can be measured. Why would you want your software to be grouped with fraudulent security products, even if yours really is secure.
Re:Helpful Mirror by JamesKPolk · 2004-08-23 18:18 · Score: 5, Insightful

When you're being paid by the military and being told what their needs are, you can say military all you want.

Re:Windows port? by Aardpig · 2004-08-23 14:31 · Score: 4, Informative

Will we ever have a Windows port of ResierFS or any alternative filesystems?

I'm not sure about ReiserFS, but there is already a program -- Explore2fs -- which lets you mess around with Ext2 and Ext3 partitions from Windows. Why you would want to do that is beyond me, but there you go.

Of course, you may be talking about a native Windows implementation of Ext2/3 and/or ReiserFS. Which is a totally different kettle of fish...

--
Tubal-Cain smokes the white owl.

Who's got the balls... by Stevyn · 2004-08-23 14:35 · Score: 4, Interesting

...to use it for a while. I'm sure it's been tested very extensively, but there are always bugs initially in any major release like this. I'm sure nobody running a server will touch this for a while even with the benchmarks.

I'm not trying to spread FUD on reiser at all, I run reiser 3 and I've never had any problems. I'm just raising the question of how long does it take until people will put it in production servers and their main desktops?

Anyone who maintains servers care to shed some light on this?

Re:Who's got the balls... by dtfinch · 2004-08-23 15:06 · Score: 5, Informative

When deciding which filesystem would be best for our first critical samba file servers, this post and other scattered rumors of unreliability scared us away from reiser3 for the time being:

http://www.redhat.com/archives/fedora-list/2004-Ju ly/msg00418.html

The date of the post caught my eye. The test was very recent. Ext3 won in this particular case, by a longshot, leading a Red Hat employee to respond "Your investigation proves that we default to the right mode ;)".

I haven't seen ext3 (ordered) lose in any reliability benchmarks versus jfs, xfs, or reiserfs, though it's hard to find many such benchmarks.
Re:Who's got the balls... by SaDan · 2004-08-23 15:44 · Score: 5, Interesting

It hasn't been my experience that ext2 or ext3 filesystems are more reliable than ReiserFS. At least, not where I work (I only run ReiserFS at home).

Over the past year, we've had some fairly serious filesystem failures on some of our DB and large FS servers. Ext3 on failed in every instance, Reiser was recoverable (similar RAID/hardware/useage/failure).

We pound the living hell out of our machines, day and night, with billions of small files every year. ReiserFS makes Linux work for us.

There are some instances where ReiserFS v3 is slower than Ext3, but we don't care about that any more. We're finished with Ext2/3, and are looking forward to testing ReiserFS4 now that it's been released.
Re:Who's got the balls... by hansreiser · 2004-08-23 20:40 · Score: 5, Interesting

Keep in mind that redhat kernels are highly patched and they don't apply reiserfs bugfix patches out of a deliberate policy to exclude them (yes, we offered to supply them but were rejected), so we don't recommend the use of redhat kernels for reiserfs, we recommend the official kernel, or the SuSE kernel.

RedHat are the guys that at one point shipped their kernel with REISERFS_DEBUG turned on just to make us look slow.....

I don't know why RedHat regards us as in the enemy SuSE camp just because we took money from SuSE, we would take money from RedHat too if it was offered....;-)

These distro rivalries are distasteful to me.

Hasn

His thoughts on NTFS... by blackketter · 2004-08-23 14:43 · Score: 5, Interesting

IF you can get to the site, you'll find this juicy reference at the end:

[NTFS]

"Inside the Windows NT File System" the book is written by Helen Custer, NTFS is architected by Tom Miller with contributions by Gary Kimura, Brian Andrew, and David Goebel, Microsoft Press, 1994, an easy to read little book, they fundamentally disagree with me on adding serialization of I/O not requested by the application programmer, and I note that the performance penalty they pay for their decision is high, especially compared with ext2fs. Their FS design is perhaps optimal for floppies and other hardware eject media beyond OS control. A less serialized higher performance log structured architecture is described in [Rosenblum and Ousterhout]. That said, Microsoft is to be commended for recognizing the importance of attempting to optimize for small files, and leading the OS designer effort to integrate small objects into the file name space. This book is notable for not referencing the work of persons not working for Microsoft, or providing any form of proper attribution to previous authors such as [Rosenblum and Ousterhout]. Though perhaps they really didn't read any of the literature and it explains why theirs is the worst performing filesystem in the industry....

Re:here is the text from namesys.com by Tyir · 2004-08-23 14:48 · Score: 4, Informative

Actually, v3 is NOT the default filesystem of Gentoo, Gentoo has no default filesystem, you pick what you want. They give the easy option of ext2/3 xfs and reiser

No compelling reason to upgrade by Anonymous Coward · 2004-08-23 14:53 · Score: 5, Funny

I'm going to stick w/ Emacs for my filesystem thank you.

Huh? by Enahs · 2004-08-23 14:57 · Score: 5, Informative

Um, yes, there is an advantage. That's what the journal is for (duh.)

It astounds me that your post was marked as "Informative," because it's downright wrong.

Now, if you're talking about fsck after a certain number of boots, or a full fsck for whatever reason, then no, there's no advantage over ext2. It's ext2 + improvements + journal, for the most part.

For my money, using ext3 without btree hash dirs is stupid nowadays. Go back and bench reiser vs. ext3. ext3 is usually still slower, but the gap is narrower nowadays.

--
Stating on Slashdot that I like cheese since 1997.

Re:Huh? by timeOday · 2004-08-23 17:39 · Score: 5, Interesting

I did a comparo of ReiserFS and Ext3 a while back and these were my main findings:
1) Reiser destroyed Ext3 for directories with many thousands of files in them. However, now you say ext3 has btree hash dirs, probably minimizing the difference
2) Resier was much more space efficient if the average file sizes on the filesystem is very small (say, well under 4k). However, no *real* filesystems I found were like this.
3) The two were about the same in speed for large numbers of small reads and writes.
4) Ext3 was a bit faster for big sustained reads/writes. But it wasn't a huge difference and might not apply to Reiser4.
In short, Reiser4 was more robust to unusual filesystem usage, at a slight penalty to normal usage.
In fairness, this is because Ext has been around for so long, it is optimized for normal usage, and software is tailored not to step on the toes of Ext's deficiencies. For instance, to store huge numbers of small files, people usually use a database of some sort (even if only flat file). Reiser opens the possibility of simplifying life by replacing simple databases of small records with the filesystem; for instance, it might be practical for a Usenet newsreader to store every cached message in a separate file.
But for the most part, I think Reiser will stand on its new gee-whiz features (plugins), rather than raw performance, since there are so many filesystems with roughly comparable performance for normal usage scenarios. As with Longhorn's fancy new filesystem, the question is whether people really want feature-rich files.
Re:Huh? by hansreiser · 2004-08-23 19:05 · Score: 5, Informative

ext3 btrees are not well done performance wise. Most users are best off not using them, because they significantly slow performance unless directories are large, and I think that is why they are not on by default.

V3 of reiserfs paid a performance penalty for saving space and handling large directories efficiently. This irritated the shit out of me, the author, and we fixed it in V4 and then some.:)

V4 is finally to where it is sweet, and works like I fondly imagined earlier version of reiserfs would. We fixed deep design errors, and V4 is a complete rewrite from scratch reflecting all our regrets accumulated over 10 years of learning what the hell we were doing. We were beginners when we started out, as everyone is.

Now, the space savings makes things go faster not slower, and does not add seeks. We learned from XFS also, and allocation on flush works very well. Thanks SGI, for taking the time to explain to me why I should adopt allocation on flush in ReiserFS. XFS is a great filesystem.

Now that the performance advantage is ours for the now, and there aren't irritating flaws bothering me, we should and will move to semantic features not performance as our focus. The post above is right about that. Semantics matter more than performance.

Re:ATOMIC FILE-ING SYSTEM HERE I COME by Geiger581 · 2004-08-23 14:58 · Score: 5, Informative

Err, the point of atomicity w/ journaling in a heirarchical system is that if you lose power during a write, it is data to which no parent i-node or directory points. The data being created or altered is written first, then its updated directory, and then its parent directory on up to the root. Or you have one journal level, where the file is written to journal and then the journal entry is copied over the original location. If power dies when the journal is being written, data is lost but the FS maintains integrity, or if the power goes during the copy, the journal exist. Atomicity means that a transaction either happens all the way or not at all, and Reiser4 does guarantee this. In-flight data can be lost so long as partially written data does not leave the system or some other API-level atomic transaction partially completed.

Re:here is the text from namesys.com by auzy · 2004-08-23 15:17 · Score: 5, Interesting

Ok, so thats the standard response, but the main benefits will be stuff like: encryption plugins (so easy per directory encryption).. Finally maybe we'll have fully encrypted home directories easily. and stuff like the winFS system integrated into the filesystem possibly. its also 2X faster then reiserfs, and 4X faster then NTFS The big issue though is that until freebsd gets these benefits, apps aren't likely to get these capabilities :( so maybe someone should work on porting this, then maybe theres a good chance these technologies will be used extensively..

Transactions? by ceswiedler · 2004-08-23 15:29 · Score: 4, Interesting

How large (and long) can Reiser atomic transactions be?

Can I write an installation program which creates, replaces, moves, and deletes many files and directories, and have it all be under one transaction with a single commit at the end? Do other 'sessions' not see the transaction until it is complete? Are sessions based on processes or threads or something else?

That would be pretty amazing, to be able to roll back large sets of changes in case of an error. I know that database rollbacks can take large amounts of time (they optimize for the commit, which makes perfect sense) but nonetheless having rollback support in applications would be sensational.

Re:Transactions? by hansreiser · 2004-08-23 18:49 · Score: 5, Informative

Our atomicity does not provide isolation or rollback, it is only atomic in the sense of whether it survives a crash. That is, a reiser4 atomic set of operations will either all survive the crash or none of them will.

You can say that this is not really atomic, and by database traditions that is correct, but I believe we have implemented the aspect of atomicity that for sure should be implemented by the file system and not by the layers above.

Later we may support more isolation and rollback, but we started with allowing people to define a set of fs modifying operations that would either all be preserved across a crash or none of them would be preserved. I tried using the term "transcrash" instead of atom, but no one but me loved the term.

I must caution though that the API for defining an atomic set of filesystem operations is still being debugged. The core infrastructure is rock solid though, as it is what we use for atoms defined internal to the FS. We shipped as soon as our core code was rock solid, and plan to incrementally finish the other stuff over the coming year.

EVERY computer needs a u.p.s. by FrankHaynes · 2004-08-23 15:59 · Score: 5, Insightful

Write on the blackboard 10^10000000 times:

"EVERY computer needs an uninterruptible power supply. EVERY one."

There are so many problems of which you might not be aware, aside from those requiring you to run fsck afterwards, which are solved by a good u.p.s. that you'd be penny-wise, pound-foolish for not putting a u.p.s. on every machine in sight.

My clients think that I can walk on water simply because I eliminated a large share of unexplainable wierdnesses from their machines by installing an inexpensive u.p.s. on every single one.

Solid, clean power is very important to a stable computing system. I cannot stress this enough.

--
slashdot: A failed experiment.

Re:Stability by Wesley+Felter · 2004-08-23 16:06 · Score: 4, Insightful

If your filesystem has bugs, no amount of RAID will save you.

Uh, excuse me? by Lethyos · 2004-08-23 16:24 · Score: 4, Informative

Finally, we have a way of having an encrypted FS in Linux that's not an ugly kludge like loopback.

I think you misunderstand, that's the beauty of it. Basically, Linux (and FreeBSD with GBDE) allows you to encrypt a device at the block level. Everything is written to the disk encrypted, including the file system itself and not just the data. This also allows you to abstract the device. It could be a big file sitting on an existing device or the device itself. It's very flexible.

Some of the other advantages of this are fairly important. Here's a few off the top of my head.

It is easier to build a more secure and more reliable encryption system that works with all means of storing to a device rather than an encryption system for every one of those means. (1 versus an arbitrary number.) To simplify to more practical terms, it is better to write one encryption mechanism that can work with 10 file systems rather than 10 encryption mechanisms to work with each of those.
If you want to encrypt data, you might not always be writing a filesystem to a device. If I have a database that makes raw access to a device for its storage, but I want encryption, I need it at the block device level.
You do not want to make the file system any more complicated than it needs to be. Adding encryption would produce a disaster. Aside from making it easier to corrupt data, you lose a great deal of performance and security. How? Let's say you encrypted your data and sorted or indexed it by the plaintext. You are giving lots of clues to a potential attacker regarding the contents. If you do not follow this convention, you have to decrypt every byte to figure out whether or not its what you want. Horrible! (This may be an over-simplification. Anyone care to check me on this? Still, the basic principle should apply.)
Keeping encryption outside the filesystem makes it easy, even trivial to arbitrarily choose the cypher, the key size, and even the block size. The filesystem would undoubtably impose limitations on all these choices if the encryption were built in.

On the plus-side, filesystem level encryption lets you choose to encrypt on an as-needed basis (such as with NTFS), but the uses of this are minimal and questionable at best (what about swap, temporary files, and data that you forget to encrypt?)

all kinds of things without having to go through the trouble of writing an FS from scratch.

I think you may have learned from my previous comments how you accomplish this. Hint: you don't encrypt at the filesystem layer.

It's very disappointing that it took Linux all these years to get something as basic as a secure, encrypted way to store files. Even Windows has had FS encryption for a while.

Using the loopback device to encrypt data has been available for longer than NTFS has had encryption.

--
Why bother.

This looks very cool. by squidinkcalligraphy · 2004-08-23 16:24 · Score: 4, Insightful

This looks very cool.

Using files are both files and directories is really nice - throw ACLs, metadata, whatever in a directory the same name as the file: access it as a file and it is the file, access it as a directory and it provides access to the metadata. It doesn't break things. Well, not much. As mentioned, this will break things like tar a bit. But the VFS has managed to deal with resource forks from HFS, albeit in a slightly ugly fashion. This is a little nicer, and perhaps with time will be the framework for slowly abandoning outdated filesystem concepts.

How would you mofidy tar to deal with this? Add a .reiser_meta folder in each directory to store the corresponding file directories? Or is there another way?

--
"I think it would be a good idea" Gandhi, on Western Civilisation

Re:what are you talking about? by sirsnork · 2004-08-23 16:35 · Score: 4, Interesting

I'm hanging out for a SQL pluging. Being able to "SELECT filename FROM filesystem WHERE size > 1000000" would be fantatstic (note very basic example). Not to mention meta data plugins that can index your media files and store that data in the filesystem (again with SQL access)

--

Normal people worry me!

Re:Why I use ext3 (Was Re:ext3 to reiser4) by Aardpig · 2004-08-23 17:17 · Score: 4, Informative

This is the single most important factor when it came to deciding what filesystem to run, namely, can reiserfs 4 be upgraded to new versions easily?

Yes; as I understand it, ReiserFS 4 is designed with a plug-in architecture, so that future improvements to the filesystem can be incorporated in a non-destructive manner. You can read more about this functionality in the summary of the new features in v.4.

--
Tubal-Cain smokes the white owl.

Re:atomic v. journaled by Jetson · 2004-08-23 17:42 · Score: 5, Informative

Is Reiser V4 journaled? Is an 'atomic filesystem' the same, or is it better, or just different? If different, what is the difference?

Journaled: The data is written to a temporary queue and then copied to the main storage. If the system dies while writing to the temporary queue then the main storage is unaffected; if the system dies while writing the queue to main storage then the system will notice when it reboots and will resume writing the queue to main storage.
PRO: Safer than non-journaled since you can never end up with half a buffer written to disk.
CON: Writes everything twice, causing delays. Very bad things could happen if data and associated metadata are in separate transactions and the system crashes between them.

Atomic: The file data is written to unallocated space on the disk. Once that has completed, the directory record is updated by writing a copy of that record to unallocated space. The directory's parent is then updated by writing *it* to a new region of the disk, and so on up the tree. Since each write doesn't take effect until the next has completed, any interruption results in complete reversion.
PRO: Safe. Faster than journaled since there is no double-posting.
CON: More complicated to impliment, I suppose. I would expect it to be slighly slower than journalled method when writing very small changes to existing files as journalled can optimise the writes in the queue whereas atomic has to finish what it started...

Filesystems seem to be like VWs by mcrbids · 2004-08-23 17:44 · Score: 4, Insightful

There are two kinds of people, when it comes to the original VW Beetle: Those who love them, and those who hate them.

People who do not fall in one of the above two categories have never really used or owned an original VW Beetle.

It seems filesystems are the same way. I'm a long-term Ext2/3 user and have never had any particular issue with it. For the medium-power stuff I work with, it does fine. The filesystem on my laptop has been ext2/3 for almost 5 years now, I still have email, documents, etc. from 5 years ago on it. (It's been copied a few times - it originated on an AMD K6 system, now it's on a Dell Centrino Laptop)

So, I guess I'm in the "Ext3 is all good" camp.

Reading these posts, there are those who love Reiser, and those who hate it. Those in the middle haven't apparently used it.

I've found Ext3 to be slow when you have more than about 5000 files in a directory. If I had a specific need for that, I'd consider Reiser if my particular distro (RedHat migrating to Debian) supported it "out of the box".

Other than that, why bother? I've delivered millions upon millions of email messages and many millions of website hits on servers running Ext3.

So, for me, what filesystem I use is sort of like what tires I use on the car. I might care slightly when installing, but otherwise I wouldn't give even a rat's ass.

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.

Re:Stability by hansreiser · 2004-08-23 19:22 · Score: 4, Informative

Our response is definitely not so what. We might have told you that metadata journaling (what V3 uses) provides a level of service in which, like FFS and many other filesystems before it, if you crash during a write the write gets garbled.

Reiser4 is fully atomic though, and a write will either make it to disk entirely or not at all, with no data garbling. In other words, assuming that metadata journaling was what made you unhappy, we listened, but waited until a deep rewrite could allow us to fix it with no significant performance loss.

We are very happy that the use of wandering logs allowed us to make things atomic without losing any significant performance.

Re:Sweet! by hansreiser · 2004-08-23 19:27 · Score: 4, Informative

If we go for a week in -mm with no bug reports, I send it to Linus.

Re:How reliable is an "unstable 1.0" anyway? by hansreiser · 2004-08-23 19:34 · Score: 4, Informative

Our approach is to first get it to where nobody in the developer team using all known filesystem crashing scripts collected over 10 years can crash it, and then ask our mailing list to crash it, and only then ask real users to try it.

That said, if you have a mission critical server, be sensible, wait a bit.

It is in the -mm kernel, if it goes without a bug report for a week, we send it to Linus. I hav been surprised by the lack of bug reports after going into -mm. All we have is one apache 2 bug report that we cannot reproduce yet.

Re:ATOMIC FILE-ING SYSTEM HERE I COME by hansreiser · 2004-08-23 19:38 · Score: 4, Informative

No, atomic is the only option for reiser4, there is no metadata journaling even as an option. So, it both goes faster and keeps your data safer. It is nice when experiments with algorithms work.... :)

Hans

Bad blocks, etc. by warrax_666 · 2004-08-23 20:17 · Score: 4, Informative

(We didn't consider reiserfs because of its lack of bad block handling).

Oh, dear. Bad block handling is not needed on modern drives, all moderns drives have automatic remapping of failing blocks, and if you have a drive which actually has bad blocks which are visible to the OS you should not be storing any data on that drive.

Just to add a data point: I've also had very mixed experiences with XFS. I installed it and it seemed to be chugging along fine for ~1 year (just regular desktop machine, no particular I/O load to speak of) until suddenly the initial root mount showed an empty /. I was never able to track the cause down, but I promptly changed to ReiserFS and haven't looked back since.

--
HAND.

Windows vs Linux: encrypted filesystems by 0x0d0a · 2004-08-23 20:20 · Score: 4, Insightful

There are also some severe disadvantages to block-level encryption -- from a user standpoint, WinNT-style filesystem-level encryption is generally preferable. Among other things:

* Filesystem-level encryption can outperform block-level encryption.

* It's easy for a Windows NTFS user to "start encrypting something" -- they right-click a directory and check a box. Linux requires a new mounted filesystem running through a new loopback device. Since this isn't doable at the user level in any distro that I'm aware of, it pretty much means that each user doesn't have their private files encrypted separately.

* Choosing as-needed performance is not trivial. I currently maintain individual files encrypted with GPG. I don't want to have to have my P2P software making my kernel blow cycles constantly and unnecesarily encrypting and decrypting software.

* Unless I'm doing something really grotty, like putting a filesystem on block-level encryption on an LVM virtual volume, if I'm using block-level encryption, I'm forced to choose how much space to allocate to each encrypted area -- how much to put towards my ~/.private directory, how much to put in my ~/main/notes/passwords directory, and so forth. If I'm using filesystem-level encryption, I'm taking available space from a shared pool.

* While not strictly a block-level vs filesystem-level encryption issue, no major distro that I'm aware of provides a nice interface for setting up encrypted directories (well, mount points with block-level encryption) and home directories, with a user's login password used to decrypt keys used to access the encrypted filesystems. Windows is significantly more user-friendly (including providing the option of administrative key recovery) here.

The block-level approach is ideologically clean and modular, but has serious drawbacks. It cannot replace filesystem-level encryption.

--
May we never see th

Re:ATOMIC FILE-ING SYSTEM HERE I COME by hansreiser · 2004-08-23 20:48 · Score: 4, Informative

No, actually, waiting before deleting the old copy is not enough. You need to make sure that there are not two copies.

Think of classic banking example: credit savings and debit checking are a single atomic operation. You must ensure that you don't get the credit preserved and the debit lost by a crash.

The poster above you was right.

Re:Stability by Anonymous Coward · 2004-08-23 21:04 · Score: 4, Funny

Dancing trees, wandering logs - sounds like the ReiserFS development lab is more of an anthropomorphic lumbermill.

Re:On the other hand ... by martinde · 2004-08-24 00:56 · Score: 4, Funny

> The cause was a hardware problem, true, but fs should not be a toast because of it.

When you develop software that is immune to hardware failure, be sure to let us all know ;-)

60 of 637 comments (clear)