Slashdot Mirror


Putting Linux Reliability to the Test

Frank writes "This paper documents the test results and analysis of the Linux kernel and other core OS components, including everything from libraries and device drivers to file systems and networking, all under some fairly adverse conditions, and over lengthy durations. The IBM Linux Technology Center has just finished this comprehensive testing over a period of more than three months and shares the results of their LTP (Linux Test Project) testing."

33 of 296 comments (clear)

  1. USE BAD HARDWARE! by superpulpsicle · · Score: 5, Insightful

    You want to put any OS to the ultimate test, you should run cheap generic hardware. I swear it's an industry conspiracy that generic parts struggle a boat load. If your parts don't come from the big boys (DELL, gateway, etc), you are likely going to see issues down the line.

    Get some ECS motherboard, generic RAM... bang. You're in for the evening.

    1. Re:USE BAD HARDWARE! by spikev · · Score: 5, Insightful

      IMHO, it's the big boys that have the conspiracy to sell crappy hardware. Try performance testing almost any (PC Chips mobos don't count) custom system against a Dell with a similar hardware configuration, and you'll see what I mean.

      I've done it with my ECS board with generic ram, and I came out on top.

      It's the big computer makers that sell the cheap generic hardware. Try getting anything that's essential and non-OEM, hardware or software, to work with one of those boxes.

    2. Re:USE BAD HARDWARE! by boobsea · · Score: 2, Insightful

      That makes no sense whatsoever

      Are you saying good hardware can compensate for lousy software?

      Good software CAN deal with louse hardware, but only up to a point. Even so, are you going to be running your mission-critical enterprise server on ECS motherboards and knock-off RAM? I hope not.

    3. Re:USE BAD HARDWARE! by inode_buddha · · Score: 2, Insightful
      No conspiracy at all, just profit margins . I don't have to worry about that as an individual, so I do OK and still save with decent parts. The hour or so of labor to assemble it all isn't much to me.
      IOW, I agree - pick decent parts and get *exactly* what you want. I usually pick the previous generation CPU and get the biggest mobo I can for that from trusted brands. Then I stuff the mobo with the most it can handle, which is a *lot* nowdays. Of course, I get it all below retail from local OEM's, cash paid in person.
      It seems the "big boys" have all kinds of custom stuff done that makes life hard, but the "white box" stuff is easier/more flexible.

      To further reinforce your point, I built this machine 2 yrs ago for $1200 USD. The equivalent Dell Poweredge was $2200 USD. This is for SMP with big RAM, and yes it plays very nice with linux.

      'Nuff said.

      --
      C|N>K
    4. Re:USE BAD HARDWARE! by Anonymous Coward · · Score: 2, Insightful

      But then you have a linux on crappy hardware test. Not an OS test. If you want to test the OS, you have to minimize the impact of other factors.
      Otherwise you may have bad OS stability because you have a bad hardware constellation.

  2. You don't trust Microsoft to evaluate Windows... by davidstrauss · · Score: 5, Insightful

    Why do you trust IBM's Linux Technology Center to evaluate Linux?

  3. Not bad by changelingyahoo.com · · Score: 5, Insightful

    This is nice to hear, but it would be even more valuable if the same tests were performed on a variety of operating systems in order to compare the results.

    Brian

  4. Re:You don't trust Microsoft to evaluate Windows.. by DAldredge · · Score: 1, Insightful

    This is based on a quick scan of the paper.

    Because this test appears to be a fully documented test. It doesn't appear to be a test rigged to make one platform look better than the other.

    Why shoudn't we trust this test?

  5. Re:You don't trust Microsoft to evaluate Windows.. by starnix · · Score: 2, Insightful

    Because what would they have to gain by lying? The true power of opensource is that when someone does point out the weaknesses, they are fixed quickly. IBM knows that if they tell the opensource world "Hey, LINUX is pretty good but it kinda struggles in the (foo) area." that the opensource community will redouble their efforts to fix that. Microsoft is only trying to say "Windows rules, Linux sux. See, this evaluation we did proves it. Buy windows"

  6. Needs to be done independantly by ducomputergeek · · Score: 4, Insightful
    Well I have some Karma to burn tonight. I don't mind that its the 2.4 KERNAL even though 2.6 is ready. Why? We never put anything on our production server that hasn't been out for at least 6 months with exception of security upgrades.

    Second off, If this were M$ testing 2k3 and publishing the paper, everyone here would be crying foul. But because its, "Linux" it must be 100% unbais and true.

    I've been using Linux for 8 years now including under high stress enviroments, 3d graphics rendering mainly, and from experiance I have see very good things from Linux. We have had software glitches before, but the core software maybe has caused 3 - 5% of our downtime. Over 70% of our downtime involves human error and about 25% of failures are due to hardware giving out.

    Still what my customers are wanting to see isn't benchmarks as "So easy Grandma could use it" in Linux. While the people in the datacenters want to know how well Linux will bear under a load, most end-users and SMB's don't need to worry about it, they just need something easy to use that works.

    --
    "The problem with socialism is eventually you run out of other people's money" - Thatcher.
    1. Re:Needs to be done independantly by KrispyKringle · · Score: 2, Insightful
      Couple of small points to nitpick. First, you, as so many others, seem to think it's a novel idea to not implement the bleeding edge beta software on your critical hardware. No need to point this out; you presumably have a standard policy of testing all upgrades and patches on development hardware before moving it to your production equipment (or should, if you can afford it).

      Second, you're probably right about the publishers of the paper, but hey, what can you do? The people with the most interest in these studies are those who have some major investment in the results. Then again, IBM (though perhaps the Linux Technology Center are biased) has AIX as well. But you are right, they have a strong Linux agenda.

      Finally, you make the same old criticism that Linux isn't desktop-ready. Fine. Correct. I couldn't agree more; a properly configured desktop may be easy as pie, but that configuration still isn't automated enough for someone totally unskilled to do. But that doesn't matter. That's not what IBM cares about, it's not what large enterprise users care about, and it's not what most home Linux users really care much about, either. That's why it doesn't get done.

      I don't think anything is likely to challenge Windows on the desktop anytime soon (and this isn't the threat MS see from Linux, either). But Linux is gaining ground on Windows in datacenters and on servers; companies are turning from Windows to Linux, looking for stability and security (what this paper tests), and power users rely on Linux for advanced tools and efficiency. That, not desktop usability, is where the future of Linux lies.

    2. Re:Needs to be done independantly by haruchai · · Score: 4, Insightful

      Well, their bias is clearly stated on their web page- this is the Linux Test project - which is dedicated to evaluating the capabilities and limits of Linux.

      They aren't making a comparison to other OSes or saying that Linux is more suitable than such-and-such operating system; just that it is suitable for particular tasks or environments.

      A comparison between different OSes should be carried out by an independent testing facility but, in this particular case, I don't see anything
      wrong with their modus operandi.

      --
      Pain is merely failure leaving the body
    3. Re:Needs to be done independantly by antiMStroll · · Score: 3, Insightful
      Second off, If this were M$ testing 2k3 and publishing the paper, everyone here would be crying foul. But because its, "Linux" it must be 100% unbais and true.

      An ironic assertion regarding bias. IBM isn't the author of Linux or any of its tools, add-ons, servers, etc. as Microsoft is of 2k3 and its support software. Microsoft also has a long and distinguished history of FUD. IBM doesn't have anywhere near the historical attachment to Linux that MS has to Windows, and IBM hasn't been caught lying about it yet. It would be irrational to treat the two equally at their word.

    4. Re:Needs to be done independantly by Samrobb · · Score: 4, Insightful
      Microsoft also has a long and distinguished history of FUD.

      Pretty amusing that you would say that, considering the origin of the term:

      Defined by Gene Amdahl after he left IBM to found his own company: "FUD is the fear, uncertainty, and doubt that IBM sales people instill in the minds of potential customers who might be considering [Amdahl] products."

      Not that I disagree with your assertions - IBM doesn't have near the same ties to Linux that MS has to Windows. But it's amusing to see how much the technological landscape has changed, that a term coined to describe IBM can now be used to (in some sense) defend it.

      --
      "Great men are not always wise: neither do the aged understand judgement." Job 32:9
  7. Re:You don't trust Microsoft to evaluate Windows.. by davidstrauss · · Score: 3, Insightful
    Because what would they have to gain by lying?

    They have much to gain: more corporate customers and more respect and funding by greater IBM. Just because IBM supports Linux doesn't mean its motives are pure (not financially driven). Another reason for bias is the division also stood to have huge setbacks if the tests were unfavorable. How could they justify expansion and better funding if their previous statements about Linux being enterprise-ready were unfounded?

  8. Kernel 2.6 by Jeff+DeMaagd · · Score: 2, Insightful

    I am pretty sure that kernel 2.6 can't be considered "enterprise ready", for one, it hasn't gone through that level of testing.

    Don't knock "yesterday's news". Far be it from some geeks to understand this, but there are times that "tried and true" is more important than having the latest and greatest. This testing started well before 2.6.0 was released! They can probably get started wit 2.6 as soon as an enterprise Linux distribution incorporates it.

  9. Re:You don't trust Microsoft to evaluate Windows.. by DAldredge · · Score: 2, Insightful

    The full hardware/software details of the test are there. If you don't trust it, you have the ability to rerun the tests yourself.

  10. Re:You don't trust Microsoft to evaluate Windows.. by Surazal · · Score: 4, Insightful

    So, ya reply to one point but ignore the rest? I think his (ultimate) point is valid. If the test was rigged, the folks involved with developing the kernel would catch on and take IBM to task for fudging the results. No, I'm not talking about the Slashdot/Fark crowd. I'm talking about REAL developers.

    Also, Linux has weathered some unfavorable (and honost!) critiques before. Linus Torvalds said it best when he said (and I paraphrase since I am too lazy ATM to look up the actual quote) that it doesn't matter if there's negative publicity in the press about Linux. It just meant he got his bug reports from the Wall Street Journal as opposed to the regular kernel mailng list.

    --
    --- Journals are boring; Go to my web page instead
  11. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by RALE007 · · Score: 4, Insightful
    You're annoyed that they released the results for a 90 day stress test on a 2.4.x kernel, but not a 90 day stress test on a 2.6.x kernel? The 2.6.x kernel has been out for nine days. How would they have any results on the new kernel? By sending results to us using the Way Back Machine?

    I think IBM used SuSE instead of Redhat because IBM Global Services and SuSE have been partners for almost two years.

    Maybe you should stop hmmmmm'ing about these great mysteries and start googling.

    --
    Beware blue cats moving at .99c
  12. Diagnosing software vs. hardware is easy. by Inoshiro · · Score: 3, Insightful

    Hardware fails in random patterns (bits flipped by beta radiation, for example), software fails the same way in controlled instances (the same flawed logic fails the same way each time).

    So when you run a test 5 times, and you get 5 results, the hardware is broken. When you run the same test 5 times, and it gets to the exact same point before sig11ing, you have a software flaw.

    This is also why you do multiple tests to ensure you're getting an accurate picture of what's going on (flawed or not).

    --
    --
    Internet Explorer (n): Another bug -- that is, a feature that can't be turned off -- in Windows.
    1. Re:Diagnosing software vs. hardware is easy. by jmv · · Score: 4, Insightful

      software fails the same way in controlled instances

      That's true... in theory. In practice, there are many ways software can fail in random (in the weak sense) ways. Many of these are related to timing. For example if you have many threads and fail to lock things properly, the result will depend on when the tasks are preempted. You can also have different results because of the way the interrupts (disk, net, ...) happen. There's also the not-initialize type of problem where the behaviour depends on whatever was there in memory before. There are probably many other ways for software to fail at random, including obscure combinations of events.

      I'd say that the only kind of software that can't fail randomly is single-threaded and doesn't rely on any input other than regular files (and even then I'm not sure it's enough).

  13. Re:You don't trust Microsoft to evaluate Windows.. by davidstrauss · · Score: 2, Insightful
    Either trust the source of information or don't, but don't trust the result of a test based on the result of the test -- circular dependency.

    I think it's more human than that. If a company or group releases results completely against their interests, integrity is the only reason such a group would go forward. Why would anyone skew results to spite themselves?

  14. MOD PARENT DOWN by Anonymous Coward · · Score: 1, Insightful

    I think the moderators can decide for themselves what they find good or bad.
    Thank you though for assuming you know how everyone else in the world feels :)

  15. Re:You don't trust Microsoft to evaluate Windows.. by Unordained · · Score: 2, Insightful

    specifically because you'll now trust their future results?

    "wow, last year, they had to admit their product just wasn't up to the task. but now, dang, look at 'em go!"

    yes, it's quite human indeed. you don't know what all they're up to -- what seems to be self-defeating isn't always. and sometimes, well, you honestly find out that you're doing the job you had hoped you were doing, trouncing the competition. go figure: you might actually manage to not suck! but you don't get to tell anyone? and your only solution is to pay someone else to announce it? oh, wait, that's not allowed either!

    any publicity is good publicity. if you can't get good publicity by announcing your product is good, just say it isn't. close enough. it's not like anyone pays attention anyway.

  16. You have to plan for that. by khasim · · Score: 2, Insightful

    Before you roll something out into production, you hammer on it for a while. The crappy parts will fail or generate errors and you can have them replaced.

    In my experience, most of the no-moving-parts hardware will fail within the first week, or last for years and years.

    The stuff with moving parts will eventually fail. But that's harder to predict.

  17. Here goes by floydman · · Score: 5, Insightful

    FTA
    The tests demonstrate that the Linux system is reliable and stable over long durations and can provide a robust, enterprise-level environment.

    Ok, now i dont mean to troll here, so mod down if you wish, i really dont care.... BUT...
    I am a linux user/programmer/lover for the past few years now, and i wanna see a company that is not SO IN LOVE with linux say what have just been said by IBM above.
    In other words, i dont want to see companies who sell Linux, or who have benefit in selling Linux praise it. Does any one of you know of someone who fills in these criteria. Sun for one is not very fond of Linux, nor is MS ofcorse (despite the fact sometimes i doubt they have code in their stuff from Linux...)...to make a long story short
    It would be really nice if such a judgment came from someone else besides IBM/REDHAT/ORACLE...

    --
    The lunatic is in my head
  18. Scaleable.... really? by tonyr60 · · Score: 5, Insightful

    "The Linux kernel properly scaled to use hardware resources (CPU, memory, disk) on SMP systems."

    Sorry, but how can the scaleability of the CPU resource be proven on a 2 CPU system? Show incremental results on 1, 2, 4, 8, 16, 32 etc. etc. and then CPU scaleability may be proven.

    This is NOT an anti-Linux troll, rather the evaluation needs to justify it's outcomes or it starts to look like something from a company starting with M.

  19. Unbiased research is REALLY hard to get by Sycraft-fu · · Score: 2, Insightful

    Espically for thigns like this. If a company isn't the one performing the research, chances are they are bankrolling the company that is. This means that generally thigns will be stacked in the favour, and unfavourable results will often be suppressed.

    Even true independants are often not unbiased. For example some individual, with no teis to and OS developers or vendor, might decide to test OSes. Of course it might be that they are a huge Linux or Mac or Windows zealot so again stack things in their favour. You see this fairly frequently with independent Mac test sites. They are MAc heads that work to make thigns look good for the Mac.

    YOu see this in research too. I can't count the number of times I've seen articles, in respected journals, where the researchers have glossed over or ignored something that could contest or invalidiate their findings. They want their hypothesis to be true and so are prone to look at the data that supports it.

    So when you deal with something involving money, you are just going to see some biased results. In scientific research, you generally get other labs testing findings, so the truth is eventually revealed, despite biases. However in bussiness, espically something with quick life cycles like software, forget about it. All tests are going to be biased. Read the results and take them for what they are worth, don't use them ot generalize. Do your own testa, and use what works best for you.

  20. So what ? by Alain+Williams · · Score: 3, Insightful

    All that this shows is that a Linux based system works in the way that it should. Would you expect anything else if you ran your: TV, central heating, ... for a long period ?

    The trouble is that, after a period of increased stability in the 1980's, in the last decade people have come to expect that computers fail, and they wonder with amasement if they don't.

    OK: 30 years ago I remember it being a good day if the mainframe stayed up 12 hours. But things have moved on, today you expect your: MVS, VMS, Unix, Linux machine to stay working. The only OS vendor who's products have not matured is the one in Redmond - largely because of rampant infestation with new features.

    The above is not intended to belittle the fantastic efforts of all those involved.

  21. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by Anonymous Coward · · Score: 1, Insightful

    Are you locked out of your virtual consoles too? Are you running SATA drives? What are your logs saying after the crash?

    I'm running Suse 9 also, and X has only hung on me once(some other apps hang occasionally, but I'm still able to kill them). The only time it hung, I was still able to switch to a VC and restart the Xserver without a reboot.

    I use the system as my main devel machine, so it gets about 4+ hours of heavy use a day. My latest uptime is over 3 weeks...

    Also, are you running a stock kernel? Recompile and strip things out. I highly recommend people recompile any kernel from any distro before using it regularly. It's just a good idea...

  22. The problem lies not with reliability by Lord+Kano · · Score: 3, Insightful

    but with package and dependency madness.

    I couldn't tell you the number of times I tried to install something and it fails because I was missing "X-Widget-2.41.so.1", so I try to install that "X-Widget-2.41" package and the "X-Widget-2.41-devel" package and they fail because they are missing several other depends as well.

    Linux stability is fine. The GNU software stability is fine. We need a better way to install and maintain software.

    LK

    --
    "Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
  23. Because it confirms daily experience? by SmallFurryCreature · · Score: 2, Insightful
    Because it confirms daily experience?

    MS is running ads saying how windows XP is so reliable. It is kinda hard to believe when you hear the ad because you a getting a cup of coffee waiting for XP to reboot. Same with 2k3. It crashes. Not as often as XP same as XP doesn't crash as often as 98 and so on. But it still crashes.

    Now on to my linux machines. Wich don't crash. I only run in total about a dozen of them and not one of them has crashed.

    I also have had some experience with AIX. Typically on machines everybody had forgotten about that ran some app that everyone just used and they only noticed its importance when someone unplugs an old useless cable.

    So from my daily experience I will find any report coming from MS saying that they are reliable suspect. From my experience with AIX and Linux I will be far more willing to believe a report from IBM about reliabilty because THEY HAVEN'T BEEN TELLING ME TO MANY LIES BEFORE.

    You of course may have different experiences. Linux if nothing else seems capable of generating wildly conflicting emotions in people. So does MS software come to think of it. Funny that we can get so worked up over a collection of bits.

    --

    MMO Quests are like orgasms:

    You may solo them, I prefer them in a group.

  24. 2/3 Authors of Linux Paper - Women by Anonymous Coward · · Score: 1, Insightful

    Two of the three authors were women. Time for a new Barbie doll: "Linux is fun!"