Benchmarking Linux Filesystems Part II

← Back to Stories (view on slashdot.org)

Benchmarking Linux Filesystems Part II

Posted by Zonk on Friday January 6, 2006 @05:34AM from the some-of-this-content-may-be-inappropriate-for-young-readers dept.

Anonymous Coward writes "Linux Gazette has a new filesystem benchmarking article, this time using the 2.6 kernel and showing ReiserFS v4. The second round of benchmarks include both the metrics from the first filesystem benchmark and the second in two matrices." From the article: "Instead of a Western Digital 250GB and Promise ATA/100 controller, I am now using a Seagate 400GB and Maxtor ATA/133 Promise controller. The physical machine remains the same, there is an additional 664MB of swap and I am now running Debian Etch. In the previous article, I was running Slackware 9.1 with custom compiled filesystem utilities. I've added a small section in the beginning that shows the filesystem creation and mount time, I've also added a graph showing these new benchmarks." We reported on the original benchmarks in the first half of last year.

21 of 255 comments (clear)

Min score:

Reason:

Sort:

Re:Very interesting article... by CastrTroy · 2006-01-06 05:41 · Score: 2, Insightful

I'd like to see how they perform on a 12 GB Disk on a P2 266. You really start to see the differences when working on older hardware.

--

Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
Need to be careful... by Conor+Turton · 2006-01-06 05:42 · Score: 3, Insightful

One thing this does show is that you need to be very careful to match the filesystem type to the main tasks the PC is going to be used for. Personally, there's no real clear winner as all have major gains or deficiencies in some areas. One very interesting point was the vast difference in the amount of available space after a partition and format between the different filesystems.

--
Conor "You're not married,you haven't got a girlfriend and you've never seen Star Trek? Good Lord!" - Patrick Stewart
1. Re:Need to be careful... by Raphael · 2006-01-06 06:19 · Score: 4, Insightful
  
  One very interesting point was the vast difference in the amount of available space after a partition and format between the different filesystems.
  
  Unfortunately, that graph is rather misleading. The ext2 and ext3 filesystems keep some percentage of the disk space as "reserved" and only root can write to this reserved area. This is useful if the disk contains /var or other directories containing log files, mail queues and other stuff. Even if a normal user has filled the disk to 100%, it is still possible for some processes owned by root to store some files until an administrator can fix the problem. On the other hand, if your filesystem contains only /home or other directories in which users are not competing for disk space with processes owned by root, then it does not make much sense to have a lot of disk space reserved for root. That is why you should think about how the filesystem is going to be used when you create it, and set the amount of reserved space accordingly.
  
  The default behavior for both ext2 and ext3 is to reserve 5% of the disk space for root. You can see it in the section Creating the Filesystems from the article:
  4883860 blocks (5.00%) reserved for the super user
  You can change this behavior with the -m option, specifying the percentage of the disk space that is reserved. The article did not mention how the filesystem was supposed to be used if it had been used in production. However, I would guess that the option -m 0 or maybe -m 1 could have been used in this case. This would have provided a fair comparison and suddenly you would have seen all filesystems in the same range (close to 373GB available), except maybe for Reiser3.
  
  --
  -Raphaël
how to lie with statistics by Clover_Kicker · 2006-01-06 06:00 · Score: 4, Insightful

I love the CPU utilization graph for "touch 10,000 files".

A quick glance shows ReiserV4 as much more CPU intensive, you have to look at the scale to realize it only used 0.3% more CPU.
somewhat worthless by aachrisg · 2006-01-06 06:06 · Score: 5, Insightful

His benchmark data is ruined by using a gross unrealtistic piece of hardware - modern fast hard disks coupled with a cpu which is absurdly slower than anything you can buy.
Sample size by rongage · 2006-01-06 06:13 · Score: 2, Insightful

Am I reading this "benchmark" correctly? Did he base his results on a sample size of 1?

At the very least, you run multiple times and average the results to give statistically meaningful numbers. I can't think of ANY time where a sample size of 1 was meaningful for anything.

What would be really interesting is to come up with a reasonable UCL and LCL for each test, and then calculate out a cpK for each test. It's one thing to say "I got these results one time", it's something much more impressive to say "I can achieve this result +-10%".

Of course, if a particular benchmark can't even hit a cpK of 1, then maybe there is room for improvement in the coding of the driver.

For those of you who haven't done much with statistics, cpK is a measure of "capability" in a machine or process. It shows how repeatable the measured process is. A higher number indicates that you have a highly targeted, low deviation process whereas a low number (1 or less) indicates that your process is incapable of repeatability and/or accuracy.

--
Ron Gage - Westland, MI
It would be nice if... by bhirsch · 2006-01-06 06:15 · Score: 4, Insightful

There were some current (recent 2.6 kernel with XFS, JFS, possibly Reiser4, etc) benchmarks done on highend servers (or at least something with drives a few steps up from the CompUSA weekly special), especially if anyone wants to see Linux succeed in the enterprise.
Of course Reiser4 was slow by Anonymous Coward · 2006-01-06 06:17 · Score: 1, Insightful

Everyone knows Reiser4 uses a lot of CPU, and these guys run the test on a 500MHz machine!!
IDE Drives Cause other Overheads by j0ebaker · 2006-01-06 06:20 · Score: 4, Insightful

It would be interesting to see the results of the same tests running against a SCSI drive system where there is less IO overhead to see if the results differ.
There are other considerations here as well. What about the I/O elevator's tuning options.
Yes, I'd much rather see this test occur against a SCSI drive or better yet against a RAM drive for pure software performance.

Cheers fellow slashdoters!
-Joe Baker
Re:I would agree by Anonymous Coward · 2006-01-06 06:26 · Score: 5, Insightful

Ext2/Ext3: Mediocre at almost everything. Distros like Fedora that mandate the initial install ONLY use Ext3 are being stupid. The best fall-back filing systems if you can't find anything better for what you want the partition to do, but should never be used in specialized contexts.

Huh? Sorry, did you read the same graphs or are you just trolling?

This article shows that ext2 and ext3 are close to the top performer in most tests and do not have many "worst-case scenarios" (unlike, e.g. Reiser3 and Reiser4).

If there is anything that you can conclude after reading this study, it is that ext3 is a reasonably good default choice for a filesystem.
Re:Hardware mismatch by Clover_Kicker · 2006-01-06 06:35 · Score: 2, Insightful

> If all you are doing is using samba or netatalk to serve files
> even 500mhz is overkill.

Not for ReiserV4 :)

Seriously though, there's nothing wrong with designing a new filesystem to take advantage of modern CPU horsepower as long as everyone understands the system requirements.
Re:Normalized results by phoenix.bam! · 2006-01-06 06:44 · Score: 5, Insightful

Reiser uses much more CPU for file system tasks. ReiserFS is a modern filesystem meant to run on modern machines. This machine is only 500mhz and therefore Reiser performs poorly. Had this machine been a 2ghz (standard now, 4x faster than the test machine), or even a 1ghz (Outdated and 2x as fast) machine Resier would have performed much better.

If you want to use parts from 1997 to build a computer, Reiser is not for you. 500mhz is at least 8 year old technology if I remember correctly.
Re:Warning by drinkypoo · 2006-01-06 07:01 · Score: 3, Insightful

XFS does things that ext? and Reiser can't do. Reiser does things other FSes don't do as well. It's a true 64-bit filesystem and it supports insanely large filesystems, up to 9 million terabytes in 64 bit mode (with a 64 bit kernel.) It even provides realtime support, although I guess that's still beta in linux? It can be defragged and even dumped while live. It has insanely quick crash recovery. And of course, it does other stuff too; check the project page. XFS may not be the fastest filesystem - it may even be the slowest - but it's got features no other filesystem has. If you need them, XFS is the winner. Hell, if you just trust XFS more than you trust other filesystems, it's the winner. (Sorry, but I wasn't sleeping when reiser was eating everyone's data, and ext3 handles corruption much more poorly than any of the other Journaled options.)

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:I would agree by smoker2 · 2006-01-06 07:03 · Score: 2, Insightful

Ext2/Ext3: Mediocre at almost everything. Distros like Fedora that mandate the initial install ONLY use Ext3 are being stupid. The best fall-back filing systems if you can't find anything better for what you want the partition to do, but should never be used in specialized contexts.
How the hell did you come up with that opinion ?
Ext3 came 1st or 2nd in 24 out of the 40 tests done. If you were producing an OS for general purpose computing, would you use a specialist fs or the best performing general purpose one ?
You seem to have good words for JFS and XFS though, and XFS had only 13 1st or 2nd places !
How do you work out that Ext3 is "mediocre" from those figures ?
(you sound like you run debian)
Re:Normalized results by Anonymous Coward · 2006-01-06 07:18 · Score: 0, Insightful

Don't use that software garbage excuse of "there's more cpu lets use it always cause we can".

That's why stock dell's and HP's are so much god damn slower than a much worse specced machine.

If that's the concept for reiser, I can only guess a large portion of the linux population is retarded.
Re:Normalized results by Westley · 2006-01-06 07:27 · Score: 3, Insightful

It's one thing to say "Let's use more CPU because we can."

It's another to say "Let's use more CPU (which is usually relatively idle) in order to improve the normal bottleneck, which is IO."

I don't see what's wrong with that at all. Of course, it's no good if you've got a machine which doesn't represent the "normal" current situation, any more than using a graphics card for "acceleration" makes sense if the graphics card in question is 10 years old but you're using a fast new CPU.

Jon
benchmarks that take less than 1/10 of a second by hansreiser · 2006-01-06 07:40 · Score: 4, Insightful

If someone does not know that filesystem benchmarks that take less than a tenth of a second are meaningless, it makes you wonder if they made errors in other aspects as well. These results are not consistent with the results that we have had. I bet he did not make an effort to ensure that you had to read the disk for these benchmarks, that he did not copy his file set from the same fs as he was measuring (makes a HUGE difference to performance and it is the mistake every beginner makes), etc. You'll note that the way he makes his graphs makes 1% differences look huge, etc.
Re:I would agree by diegocgteleline.es · 2006-01-06 07:49 · Score: 2, Insightful

Distros like Fedora that mandate the initial install ONLY use Ext3 are being stupid

It's amazing that such commentaries are moderated interesting these days. So, uh, fedora developers are stupid and you're smarter than them?. Please take a look at this commentary to understand why such decisions aren't so simple. You can tune your car's engine and it'll be faster, right? But why not everybody tunes their engines?

Let me quote a ext3 paper: "The ext2 and ext3 filesystems on Linux are used by a very large number of users. This is due to its reputation of dependability, robustness, backwards and forwards compatibility, rather than that of being the state of the art in filesystem technology."
Re:Very interesting article... NOT! by hackstraw · 2006-01-06 08:06 · Score: 5, Insightful

I would rather see these benchmarks on a computer less than 5 years old. I would also appreciate an open source version of the tests so they could be reproduced. For ease of reading, I think the article should be on a separate page on the site as well.

I've got a screaming Dell 1.6 GHz P4 to test with and here are my results for a couple of tests it only has ext3 and a whatever cheap harddrive came with the box. I'm not sure if dma is enabled or if I've done any hdparam tunings, but I'm not sure of their test system either:

my touch 10,000 files: 24.314 seconds theirs 48.25

I used a shell script that called /usr/bin/touch

Now if I use a Perl open() call, I get 8.887 seconds
Now with a cheesy C that uses fopen() and fclose() I get 4.639 seconds

my make 10,000 directories: 56.832 seconds theirs 49.87

that is a shell script

If I user perl, I get 35.171 seconds

The /dev/zero stuff is completely bogus. No indication of the blocksize that was used.

The copy kernel stuff to and from a different slower disk with an unknown filesystem on it is useless.

The split tests are not indicative of anything in real life, and they took on order of between 60 seconds and 130 seconds to perform on their 500MHz system with most being in the 130 second range. I got 16.547 seconds.

I do not see how any relevant information can be obtained from this article. I'm disappointed in the Linux Gazette and Slashdot for printing this information.
Old Shitty Machine, Shitty Results by LordMyren · 2006-01-06 08:09 · Score: 2, Insightful

<blink> Test is flawed! </blink>

Checkout the CPU utilizations; reiserfs is pegged at 100% cpu utilization for ~8 tests. For a FS which describes itself as willing to use more CPU in order to achieve better I/O than the competition, running the benches on an antiquated 700 mhz machine is simply not fair.

OTOH, Untarring and tarring are notably NOT cpu limited, and still pretty lackluster for Reisers case. Disappointing, very disappointing. I was extremely impressed in the ext's; I simply had no idea how consistently well performing they were.

I'd also like to see FreeBSD's UFS /w and w/o softupdate benched.

Myren
Re:I think trying on a P2 266 is a bad idea by captain_craptacular · 2006-01-06 08:12 · Score: 3, Insightful

So this benchmark on a 500Mhz machine will of course show Reiser in a bad light, and moving lower down to a 266Mhz will make it even worse.

If you look at the charts, the "editing" doesn't help either. For example one cpu usage chart showed a range starting @ 92% and ending @ 94%. The Rieser4 bar was 3x as long as the next bar, but guess what, it was using something like .7% (ie 93.7% as opposed to 93%) more CPU. If the scale hadn't been jacked up you wouldn't have been able to spot the difference at all, but they way they chose to present the data, it looked like a total smackdown.

--
They who would give up an essential liberty for temporary security, deserve neither liberty nor security