Slashdot Mirror


Linux Kernel Benchmarking: 2.4 vs. 2.6-test

frooyo pastes from kerneltrap: "Cliff White recently posted some re-AIM multiuser benchmark results comparing the stable 2.4.23-pre5 kernel against the 2.6.0-test5 and 2.6.0-test5-mm4 development kernels. In his conclusion he makes reference to earlier scheduler tests posted by Mark Wong saying, "Short summary: we mostly rock.""

57 of 293 comments (clear)

  1. Comment removed by account_deleted · · Score: 5, Funny

    Comment removed based on user account deletion

  2. SMP by Doesn't_Comment_Code · · Score: 5, Interesting

    The SMP code (written by Linux developers by the way) is supposed to be kicked up a notch in the new kernel. That's what I've heard anyway. I'd love to see Linux being the best OS for multiple CPU scaling.

    That will help everyone from the server market, to me when I save up enough for a two processor motherboard.

    --

    Slashdot Syndrome: the sudden, extreme urge to correct someone in order to validate one's self.
    1. Re:SMP by Arker · · Score: 4, Insightful

      I've got nothing against Linux improving at SMP in essence, but there is something very bad going on here it seems to me. Notice that while the new kernel 'kicks ass' on SMP systems, on uniprocessor systems the 2.4 kernel is the one kicking ass. Anyone benchmarked 2.4 against some of the pre-SMP kernels on a uniprocessor machine?

      Face it, the vast majority of users are uniprocessor, and kernel performance is more of an issue on lower-end machines. Improving performance on big multiprocessor boxes is fine by itself, but not when it harms uniprocessor performance. I'm not a kernel hacker, but I've read many people that this would not happen, that the SMP code would not hurt performance on a uniprocessor machine when the kernel is compiled without it, but that's obviously not turning out to be the case. Anecdotal evidence, at least, suggests that this performance degradation has actually been going on for quite some time, at least back to when SMP code first started being added.

      I'm not sure what all the factors here are, so naturally I'm not going to tell you the solution, but it certainly looks like a potential problem that should be discussed. Hopefully someone with more specifics than I have can chime in...

      --
      =-=-=-=-=-=-=-=-=-=-=-=-=-=-
      Friends don't let friends enable ecmascript.
    2. Re:SMP by GooberToo · · Score: 3, Interesting

      Yes, there are. The scheduler is fully HT aware. It seems that many of the SMP and numa optimizations also apply to HT'ing as well. As such, the developers have been working hard to support it.

      Worth noting, however, it's not uncommon, even for a system that fully supports HT, to see a noteworthy performance drop when HT is enabled. Seems many new systems come with HT disabled in the BIOS for this very reason. Granted, I'm not 100% that's not a Window's specific issue rather than a broad-board HT issue, but something to keep in mind, nonetheless.

    3. Re:SMP by blakestah · · Score: 3, Informative

      Notice that while the new kernel 'kicks ass' on SMP systems, on uniprocessor systems the 2.4 kernel is the one kicking ass. Anyone benchmarked 2.4 against some of the pre-SMP kernels on a uniprocessor machine?

      Yeah, they missed an important test - latency for interactive processes. A lot of scheduler work went into improving this, and it makes a huge difference when you have large memory processes working hard.

      This aspect is improved across the board in 2.6, as well as the SMP issues. Sure, the uniprocessor machine may be a little slower, but response latencies in X are a lot better, and this makes more of a difference to users.

    4. Re:SMP by pmz · · Score: 2, Funny

      I've got nothing against Linux improving at SMP in essence, but there is something very bad going on here it seems to me.

      I don't understand...scalability to hundreds of CPUs will provide much penis enhancement for geeks everywhere (even the ladies).

    5. Re:SMP by Arker · · Score: 2, Interesting

      This aspect is improved across the board in 2.6, as well as the SMP issues. Sure, the uniprocessor machine may be a little slower, but response latencies in X are a lot better, and this makes more of a difference to users.

      I beg to differ. I couldn't care less about X, I haven't even used it in over a year. Like the vast majority of Linux boxes, mine runs in console mode only, and on a single processor. I, and the rest of the world that uses it like this, find it hard to see anything to get excited about in 2.6. It could be that misty thing that happens with your memory over time, but it certainly seems like my first linux box (slack 1 on a 386-25) had better performance than I get now with substantially more powerful hardware in fact. At the very least I'm certainly not seeing any dramatic improvements. I used to upgrade kernels as a matter of course, but now I just don't care, unless there is a security update involved. If it weren't that necessary maintainence work (security patches, in particular) were suspended long ago I'd be tempted to swap in a fresh drive and give it a try, in fact.

      Like I say, it's nice to see SMP support getting better, I've got nothing against it, or any of the other features that have been introduced over the years, but when it starts interfering with performance on the machines that need performance the most, and coincidentally the ones that account for the vast majority of boxes running Linux as well, there is obviously something going wrong. Maybe the config scripts need more work, maybe there is nothing to do but fork large sections of the kernel, I don't know, but if Linux keeps getting worse at fulfilling its bread-and-butter role it will eventually have to be replaced with something else.

      Do you really want to see Linux running on Minicomputer class machines (the kind that private citizens usually can't afford) exclusively, at the expense of being usable in its traditional role of turning cheap old boxes on their way out the door into powerful and stable servers? Does Linus?

      I don't, and I predict there are enough programmers out there that agree with me that if the trajectory continues we'll eventually see either a kernel fork, or a mass migration to *BSD, or both.

      --
      =-=-=-=-=-=-=-=-=-=-=-=-=-=-
      Friends don't let friends enable ecmascript.
    6. Re:SMP by Vellmont · · Score: 3, Insightful

      Well, I don't think you can conclude that the SMP changes to the kernel are what's slowing down the 2.5 Uniprocessor performance vs 2.4 kernels. There are many other changes that took place (low latency and an improved scheduler come to mind) that aren't SMP related.

      Obviously the SMP performance has been improved, and there was a lot of potential for improvements looking at the 8x test. Another way to interpret the results would be to say that the other changes decreased performance across the board on SMP and Uniprocessor systems. The SMP improvements in SMP machines more than made up for this added cost and improved raw performance on SMP machines.

      Hopefully the performance loss on Uniprocessor machines can be decreased or eliminated. Even if it's not, I think you need to remember that raw performance isn't the be-all-end-all thing that's important. 7% is pretty small in the grand scheme of things where processing speed is doubling every 18 months. Responsiveness and better scheduling that doesn't starve processes is more important than a 7% performance decrease IMO, and you don't get that from faster processors.

      --
      AccountKiller
    7. Re:SMP by ocelotbob · · Score: 2, Informative

      The issue with hyperthreading's performance drop comes from the fact that both logical threads are contending for the same cache. Thus, code has to be rewritten in an HT-equipped machine to only use half the cache it normally would take. Thus, in your typical 512k cache machine, you've got to profile your loops, etc, so that it only uses half that cache. The typical program is not written with specific requirements on how much cache they use, thus they throw as much data as possible into cache, causing the two logical threads to fight over the cache, degrading performance. Pretty much any program will act this way, unless compilers get smart enough to have compile-time control of a cache model so that one can recompile everything to take advantage of HT.

      --

      Marxism is the opiate of dumbasses

  3. GREAT by proj_2501 · · Score: 4, Funny

    now i need another CPU to increase performance!

    1. Re:GREAT by Anonymous Coward · · Score: 2, Funny

      In Soviet Russia, we had to increase performance in order to get another CPU ...

  4. novel idea. by justin_w_hall · · Score: 4, Funny

    Go figure. An OS that gets faster with each version.

    --

    ---
    "how can the same street intersect with itself? i must be at the nexus of the universe!" - cosmo kramer
    1. Re:novel idea. by stratjakt · · Score: 3, Interesting

      It's only faster if you have 8 CPUs, your single proc desktop box will be slower.

      Which just reaffirms my belief that linux is becoming ever more firmly planted in the server world, and desktop linux is still just a hobby for the most part.

      --
      I don't need no instructions to know how to rock!!!!
    2. Re:novel idea. by norton_I · · Score: 2, Insightful

      2.6 is supposed to be fully preemptable, which should make lots of latencies decrease, leading to better interactive performance on a desktop, even if the overall throughput is lower. What this benchmark shows is that linux kernel 2.6 is a slower uniprocesser server than 2.4. While that is too bad, it doesn't really say much about desktop linux. I just installed 2.6-test5 on my (2 cpu) desktop, but haven't really had time to evaluate its performance relative to 2.4.

    3. Re:novel idea. by GooberToo · · Score: 5, Insightful

      That statement simply is not true. Granted, you can always find some corner case where the workload is going to be slower between releases (2.x or 2.6), however, as a rule of thumb, 2.6 should still be a huge improvement for even uniprocessor users. Best yet, many, many parameters of the kernel and scheduler are tunable, so, you can always adopt the kernel to work best for your specific workload needs.

      While it's true that they are working hard to significantly improve Linux for the server room, by far, they have never lost site of the uniprocessor user. Remember, there is nothing wrong with tuning the kernel for your uniprocessor needs, and specific workloads. They just can't do that when they are benchmarking because it would skew the results, invalidating them. They are not only trying to measure how their improvements effect the overall system, but, what makes for sane initial defaults, which are reflective of a general purpose and broad workload. If you understand what you are doing, there is not a reason to believe that you can't greatly improve things for your specific uses and workloads. It's important to keep all of these in mind when talking about these benchmarks. Furthermore, you should fully expect your favorite distro to come with tuning presents which reflect a targeted workload (file/print server, workstation, database, web server, etc.).

      Keep in mind that the benchmark you looked at represents one category of many different types of workloads. So, for that specific workload, it may of been slower, however, that workload my not represent anything you do with your computer. Remember, other types of workloads are significantly faster. One last note, remember, performance is the classic trade off with lower latencies. It trades responsiveness for raw throughput. If, on a uniprocessor workstation, you only see a -7% drop in performance and latency is greatly reduced, chances are, not only will you never notice the loss in performance, but you'll be praising it for how well it works with your mouse, monitor and keyboard (if feels better and makes you a happier user).

      Just some food for thought.

    4. Re:novel idea. by Karn · · Score: 3, Insightful

      Wrong.

      One benchmark used for Linux kernels is hammering a system while playing an mp3 to see if they can get it to skip. Low latency is mostly a desktop feature, and the 2.6 kernel is going to have much improved latency.

      Other portions of Linux have changed, and may not initially outperform 2.4, but if you think this kernel isn't going to be a dramatic improvement over 2.4 for desktop users and servers, and if you think the kernel developers aren't taking the desktop into consideration, you are mistaken.

      --


      Why do I keep typing pythong?
    5. Re:novel idea. by be-fan · · Score: 2, Informative

      Desktop Linux kicks ass. With 2.6, interactivity on an unloaded system is close to WinXP, and on a heavily loaded one (the steady state of my machine :) kicks XP's ass all over the place.

      --
      A deep unwavering belief is a sure sign you're missing something...
  5. woo by grub · · Score: 5, Funny


    If you thought SCO was mad over 2.4, just wait until they make up evidence for the 2.6 kernel!

    --
    Trolling is a art,
  6. Re:Real world please. by Anonymous Coward · · Score: 2, Funny

    OT, but I'm pretty sure I've never seen "real world" and "instant messenging" in the same sentence. Except maybe with the accompanying phrase "no relation to".

  7. Not to be a n00b... by Visaris · · Score: 2, Insightful

    Not to be a n00b, but I can't make too much sense of the benchmark the story linked to. Could anyone give a short simple little explanation of what it means? Thanks so much!

    --

    I am a viral sig. Please help me spread.
    1. Re:Not to be a n00b... by NtroP · · Score: 5, Funny

      Not to be a n00b, but I can't make too much sense of the benchmark the story linked to

      You actually READ the article?!? Man! You ARE a N00b!
      --
      "terrorism" and "pedophilia" are the root passwords to the Constitution
    2. Re:Not to be a n00b... by gyrovague · · Score: 2, Informative

      The workload simulates a multi-user system by running an increasing number of users. Each user does a list of tasks. We keep adding users, until the load reaches a max. The score shows tasks per minute, and peak user count. Bigger is better. http://www.osdl.org/stp

  8. timeline? by NumLk · · Score: 2, Insightful

    Seriously, its great and all, but when will it be ready for the masses? I.e. the holy 2.6 release? For us, loading a beta (or even alpha) kernel is something that we can do in our sleep, but look at it from this perspective: all of these improvements will only really make an impact once developers can write applications specific for this environment, which requires, at a minimum, an official release.

    --
    Children in the backseats don't cause accidents. Accidents in the back seats cause children.
  9. Rock? by TheLink · · Score: 5, Insightful

    It's only significantly faster if you have 8 processors.

    Whereas it is 7% slower if you have one processor.

    I suppose they'll have uniprocessor version which runs faster? Lots of people have uniprocessor pcs.

    Hyperthreading doesn't really count.

    --
    1. Re:Rock? by kasparov · · Score: 2, Interesting

      Where did you see that it is 7% slower with one proc? What is 7% slower than what with one processor? Not trying to disagree with you or anything, I just didn't notice anything in the article and was hoping for link.

      --
      There's no place I can be, since I found Serenity.
    2. Re:Rock? by arkanes · · Score: 3, Interesting

      It might very well be slower than 2.4, but "feel" faster. The low latency stuff can improve responsiveness at the expense of performance.

  10. User Experience by the_crowbar · · Score: 5, Informative

    I run 2.4.22 at work and 2.6.0-testX at home. The 2.6.0test(vanilla) series feel much more responsive, especially in X. I have not done any real benchmarks of my systems, but after working with 2.4 all day 2.6 seems to fly.

    Just my observation
    -the_crowbar

    --
    Have you read the Moderator Guidelines
    1. Re:User Experience by Spuggy · · Score: 2, Informative

      The option is still there for the SCSI emulation (I'm assuming this will go away in the future), but you should have no problem using your drive as a pure IDE drive now w/ the newest versions of cdrtools & cdrdao.

      I'm running in IDE mode only with no issues.

  11. Re:Real world please. by tomhudson · · Score: 2, Informative

    AIM (now at version 7) is not an instant messanger client. It's a benchmarking tool. Click on the link in the story to see what it is/does/etc.

  12. Better comparision by ajiva · · Score: 2, Informative

    A better comparision would have been against Solaris x86. Solaris scales very linearly with every added processor.

  13. 2.4 vs 2.6 by Doesn't_Comment_Code · · Score: 2, Interesting

    I assume that when they say the 2.4 Kernel outperforms the 2.6 on a uniprocessor computer, but not on a multi processor computer, that they have recompiled the kernel for each hardware environment.

    This struck me as strange, because when the kernel is compiled without SMP support, all that code is left out. So it doesn't seem like the 2.4 should outperform the 2.6 on one cpu.

    Does anyone know why this might be?

    --

    Slashdot Syndrome: the sudden, extreme urge to correct someone in order to validate one's self.
    1. Re:2.4 vs 2.6 by tmasssey · · Score: 3, Informative
      By definition, with the speed of context switches and other overhead the same, a system with "low-latency" switching (switching faster between interactive jobs) will be slower. It switches more often, therefore wasting more cycles with switching overhead.

      Of course, there is the possibility of trimming cycles from the process of switching contexts. Linux, though, already had that pretty low. That's why Linus is so resistant to shared-memory, shared-context threads: the cost of processes is so low that the benefits are small. However, some speed was gained in context switches.

      Overall, though, more switching means slower performance, even though the user feels like the system is faster. It's not faster. It's actually slower. It's just more responsive.

      Confused yet? :)

  14. Thanks SCO. by EDA+Wizard · · Score: 5, Funny

    Looks like that 1970's UNIX code really increases performance for SMP P-III's.

    Now we can appriciate the forsite that our Unix fathers had when developing Xeon SMP code in the late 1970's.

  15. I'm a bit leery. by devphaeton · · Score: 5, Interesting

    "the general trend in the metric indicates everything has been improving, so I think we rock."

    For some reason, the scheduling seems to get more and more choppy (in that i've noticed) with every iteration of 2.4.x kernel. Currently i'm on 2.4.22, and while i don't have any specific tests, numbers or statistics i'm noticing some issues.

    Easiest way to reproduce it is to have the machine do something cpu intensive, such as mkisofs, cdrecord, bzip2 some huge file, cp anything large, installing (via aptitude) or even the "Reading Package Lists...." stage of apt-get update.

    Oftentimes, the machine will become unresponsive for about 3 seconds at a time, then jolt back up to speed, then pause for 3, on and on. Even after the command line returns the prompt, or gkrellm's cpu and proc krells show that everything is all done, i will still see lag in responses from the kb, mouse, or whatnot off and on for about 10-15 seconds.

    I've gone over my kernel config and tweaked a few things here and there but with no change. I can back down to a 2.4.18 kernel and it's not as bad. Going down to a 2.2.x kernel completely solves the problem, but of course will bring its own issues with some of my newer packages (such as gcc) and a few pieces of newer hardware.

    A friend of mine and I have gone over this (on my machine and his) and he experiences a lot of the same issues i do.

    Mind you, i'm not complaining. I'm very grateful to all the developers of the world that i even *have* a linux system to run. But this is something that makes me more excited about the kernel 2.6.x series. I haven't tried one out yet, but from what i've heard and read, it should be awesoe. :o)

    --


    do() || do_not(); // try();
    1. Re:I'm a bit leery. by blonde+rser · · Score: 4, Informative

      Since it seems your running debian and all those cpu intensive operations are also hd intensive operations have you checked hdparm -d /dev/hda . I know it is simple but it is so simple that I forgot to check for about a month. Debian appears to have dma off by default.

    2. Re:I'm a bit leery. by Dacmot · · Score: 2, Interesting

      I've experienced this exact behavior before. It's because you don't have DMA enabled for your hard drive(s). "hdparm -d /dev/hda" will confirm that (off).

      To fix it you need to compile the kernel with the right PCI IDE chipset support. The only way I know of checking which one it is is to compile all the PCI IDE chipset support into the kernel and catch yours by doing "dmesg | more" after rebooting with the new kernel. Find which is yours and recompile without the other ones... that should do it.

      With DMA enabled you should see a very good overall speedup and no more lock-ups. Especially for hard-drive I/O stuff like "mkisofs, cdrecord, bzip2 some huge file, cp anything large, installing (via aptitude) or even the "Reading Package Lists...." stage of apt-get update"

  16. Architeture by HogGeek · · Score: 2, Interesting

    I didn't see anything in the articles to support this, but I'm assuming this is based on x86 architecture. Has 2.6 been ported to other architectures? And if so, have these AIM tests been run ?

  17. Re:I just can't do it by falconed · · Score: 2

    I for one welcome our new karma distributing overlords.

    --
    USE='clever' emerge -u sig
  18. strike one off by metroid+composite · · Score: 2, Informative

    Gnumeric (which I have on KDE at least) is a non-sucky spreadsheet. In fact, in the course I was TAing last spring the prof had to switch to it from Excel because it could handle the operations better. The only complaint I have about it is that I can't (or at least I haven't figured out) how to cut and paste into a text document (and vice versa). ...But that was point #4 as opposed to #3, so you can strike one off.

  19. Re:who cares? by Penguinshit · · Score: 2, Informative

    The fix to #5 is easy.

  20. Re:who cares? by Curien · · Score: 3, Insightful

    I agree that those things are issues, but they have nothing to do with Linux (the kernel). This is a new release of /the kernel/. You should only get excited about it if you care what kernel you're running. Most people don't, and they shouldn't (as long as the kernel supports all their hardware).

    --
    It's always a long day... 86400 doesn't fit into a short.
  21. Linux sorta Scales, but the hardware doesn't... by caveat · · Score: 5, Insightful

    I'd love to see Linux being the best OS for multiple CPU scaling.

    You do need a scalable OS to suport lots of processors, of course, but you also need hardware that scales too (clustering doesn't count). Example - SGI is using Linux with NUMAflex on the Altixes to cluster 64-processor system images, but that kind of hardware isn't commodity in any way, and isn't going to be anytime soon.
    Anyway, Linux doesn't scale THAT well...as of 9/2000, SGI was using IRIX for a 1024-processor single-system-image supercomputer; I've heard they can go to 2048 now, but I don't have anything to back that up. Dunno about Solaris, but I imagine it's pretty scalable as well.

    --

    Facts do not cease to exist because they are ignored. - Aldous Huxley
  22. Re:who cares? by Lumpy · · Score: 2, Insightful

    how you got insightful I have no idea...

    Everything you "want" has nothing to do with linux.

    That's like asking ford to build better tires or make higher octane gas.

    ASK the right people for the features you want... the people writing the software you want improvements. in...

    Because you know, Microsoft Windows Stinks because I can't do live editing in Adobe Premiere, same as I wont get excited about Internet Explorer until www.theonion.com get's better graphics...

    do you understand how you sound now?

    --
    Do not look at laser with remaining good eye.
  23. Half rock by roystgnr · · Score: 2, Insightful

    The "We mostly rock" statement was referring to a different benchmark (the one in the story's second link), in which the scheduler performance on single processor machines more than tripled (and performance on 8-way machines went up ~50%) between 2.5.30 and 2.6.0-test5. The first link's benchmark isn't very impressive, like you point out, but it's also not the same program.

    1. Re:Half rock by GooberToo · · Score: 5, Insightful

      You are correct! The scheduler reacts different to different work loads. This is why the kernel developers try hard to test their changes under a number of different workloads. To top it off with, they attempt to target the benchmarks which behave like real-world work loads rather than contrived and unrealistic workloads. That's not to say that they don't test those too, however, they clearly direct more attention at real-world workloads and corrosponding result sets.

      The 2.6x series kernels will be a big step up for just about everyone that seriously uses their computer. Significant realiability improvements as well as faster thoughput on disks, much, much higher scalability for SMP (hyperthreading and numa and even highly loaded uni-systems) systems, and much lower latencies, all at the same time. Granted, there are still some tests which may not be a win-win all the way around, however, almost everything in general is an improvement with hardly any detracters.

      So, saying, "we mostly rock", really is a true statement!

  24. Re:Real world please. by tmasssey · · Score: 3, Insightful
    It's called Lotus Instant Messaging (nee Sametime). And companies are using it in the real world.

    Just becaues you can't see its use outside of a toy, doesn't mean everybody can't.

  25. Smoother scheduling in 2.6.0 by Dr.+Zowie · · Score: 4, Informative

    Devphaeton, you hit the nail on the head about 2.6.0. Its main advantage over 2.4.x (for this luser anyway) is the smoother multitasking even on a uniprocessor system. I'm running a tweaked 2.6.0-test5 on my laptop, and jobs that would make 2.4.x unusable are barely detectable (from the standpoint of moving the mouse around, typing up slashdot articles, and the like).

    Of course, the ACPI support and swsusp doesn't hurt either :-)

  26. SCO Kernels by Schwartzboy · · Score: 5, Funny
    No, no, no! They don't have to "make up" a shred of evidence, you insensitive clod! Bear with me as I walk you through the intensive fact-finding process that will prove beyond a shadow of a doubt that 2.6 does, in fact, have more proprietary SCO stuff in it than any *nix ever has before! Watch as the scene unfolds...

    DARL: So, um, hey. It looks like there's this new "too-pointe-six colonel" out on the market from those Lenn-ucks people. We own all that too, right?

    SUIT: Well, sir, it's like this. Do you remember how the 2.4 kernel had all of those lines of code in them that are ours, even though they showed up in textbooks before most of our stuff existed?

    DARL: Sure, but how does that help us with this new thing?

    SUIT: Think about it. Most operating systems, according to my extensive research during years of never having looked at a computer before, contain the same code that they always did, plus a couple of lines of new comments and an extra variable or two that shows how much you're able to charge users for the new features. Just think about the Windows 95 and 98 thing. Perfect example there.

    DARL: But...my mansion only has 93 windows. Where is this heading?

    SUIT: *blinks* Errr...yeah. Well, it's all the same code, and even those sneaky Linux commies try to pull a fast one on us and put one of those different codes in there, we can always assert our ownership of these "opened sources" files that I just printed out. I asked this guy, you know, and he said that all of these sources are what's in Linux, and since I printed it on paper and stuff, I figure it must be a textbook. Since we own all the words that show up in textbooks, and this has a lot of words, I think we've found ourselves a new angle here.

    DARL: Smithers, cry havoc and let slip the Lenn-ucks colonel lawsuit monkeys once more!


    I do so hate having to correct you people. *sigh*
    --
    "Linux doesn't exist. Everyone knows Linux is an unlicensed version of Unix"- Kieren O'Shaughnessy
  27. Am I missing something here? by AntiGenX · · Score: 5, Insightful
    If you look at the difference between the outcomes for uniprocessor vs dual. There doesn't seem to be very good scaling.

    linux-2.6.0-test5 - 992.06 - Uni
    linux-2.6.0-test5 - 1017.43 - Dual
    linux-2.6.0-test5 - 5406.68 - Quad

    Does this mean that you only gain 3.49% when adding a 2nd processor? Obviously I don't expect things to scale linear but 3%!? Am I missing something here? And then 81.65% for quad? I'm not trolling, I'm looking for someone to explain what I'm missing.

    1. Re:Am I missing something here? by rakarnik · · Score: 3, Informative

      Yes, the number for dual is not 1017, but more like 1545.

      Here are the actual numbers for 2.6.0-test5 and the compute workload:
      1 - 992.06 - 100%
      2 - 1545.03 - 155%
      4 - 5175.28 - 521%

      Now for why the 4 processor case is actually 5 times better than the single CPU case, I do not know enough about the benchmarks to comment.

  28. The kernel isn't everything by skamp · · Score: 3, Informative

    I've tried the new kernel, and I got more responsiveness issues than improvements. But besides that (I might very well have misconfigured something), I'd like to point out that the kernel itself isn't all that matters: the new drivers that accompany it are just as much important. I noticed a significant increase in X's launch time as well as a whopping 250 FPS with glxgears to be compared to the 150 FPS I got with my 2.4.22 setup. This is probably due to major improvements that were brought to the drivers for my i830M chipset.

  29. Re:Some explain to me in layman terms what the hel by gyrovague · · Score: 2, Informative

    This is a simulation of a database load. Basically, larger numbers are better. The numbers are tasks per minute and peak user count. The load adds users each iteration until a max is reached. See http://developer.osdl.org/cliffw/reaim/index.html for more

  30. Scaling, et al by Anonymous Coward · · Score: 2, Interesting
    First, we need the common elements of bproc/mosix in the kernel. The specialized stuff, for each approach, may need to be kept out, but some level of generic process migration support is important.


    Second, unicasting looks to be slower. Ugh. I don't like that. That suggests to me that there are segments of code which are optimized for multi-processor use - which is great - but either there aren't uniprocessor versions, or the uniprocessor versions are highly non-optimal.


    Third, scaling needs to be improved. I don't know if Linux staggers bus use, to optimize usage, but if it doesn't, it should. Perhaps make use of the existing QoS code, or inject wait states if you know an internal bus is going to be heavily loaded.


    Fourth, the scaling between 2-CPU and 4-CPU suggests that something is kicking in at the 4-CPU level that's seriously good. My guess would be NUMA, which IIRC, was tied to the 4+ CPU level. NUMA developers might want to look and see if there's anything which stipulates more CPUs than it actually requires.


    Fifth, we need to sort out this damn RTOS issue. Linux 2.6 is supposed to go RT when the priority reaches a certain level, but it seems to be more of an rtsched-type scheduler trick than actual RT code. There are lots of approaches out there (RTAI springs to mind) and it might be good if someone added some good hooks into the OS for real-time operation.


    Sixth, and this goes along with the above, HP have a scheduler plugin system. Ok, it seems that pluggable schedulers aren't in vogue, but I do like the concept of a scheduling tree with tunable branches.


    Seventh, either use UGASI or don't. Adding in IPSEC is cool, and all that, but the old IPv6 stack is beginning to get stale. (In fact, there's also a lot in the IPv4 stack that's stale, and needs to be replaced.)


    Last, we've got the filesystems. I've not seen much serious CODA or Intermezzo work, recently, and I've never known the kernel-provided Intermezzo to work without problems. On the other hand, we've got Lustre, which seems to be a whole lot faster, and seems to be under active development. Yes, it's "late in the day" to go adding whole new components, but the impact would be minimal, and it would make a lot of networked users very happy.


    (And happy network users give karma to slashdot freaks! :)

  31. Re:Good Question, Bad Arithmetic by stef49 · · Score: 4, Informative

    a quad cpu more performant than 4 * single cpu?
    Odd but not impossible.

    For example, if in the single cpu config the processes are doing a lot of memory-cache missed then having 4 cpus (with 4 times more the amount cache) could reduce the number a cache misses and so could make the quad configuration more than 4 times faster.

    The same reason could explain why 2 cpus are not faster than one: if 2 caches are not large enough and if the processes have a very bad locality then you may get as much cache misses with the dual cpu system than with the single cpu system.

  32. Re: Debian Zealot by Linus+Sixpack · · Score: 2, Funny

    Why knows if Debian lurks in the hearts of men.... The Penguin knows! Debian redefined Linux for me.

  33. Re:Good Question, Bad Arithmetic by AntiGenX · · Score: 2, Funny
    Yeah, sorry I'm on major meds following the removal of my wisdom teeth!

    It may seem like i was on crack, but I promise they were prescription meds.

  34. You want charts? by gyrovague · · Score: 2, Informative

    I don't post charts when sending to a text-only mailing list such as linux-kernel. Not much point to that. If you'd like charts, see the full reports here: http://developer.osdl.org/cliffw/reaim/index.html