Slashdot Mirror


Putting Linux Reliability to the Test

Frank writes "This paper documents the test results and analysis of the Linux kernel and other core OS components, including everything from libraries and device drivers to file systems and networking, all under some fairly adverse conditions, and over lengthy durations. The IBM Linux Technology Center has just finished this comprehensive testing over a period of more than three months and shares the results of their LTP (Linux Test Project) testing."

191 of 296 comments (clear)

  1. Linux Test? by Anonymous Coward · · Score: 5, Funny

    Just put a link to each box on /. and wait 24 hours.

    1. Re:Linux Test? by Zork+the+Almighty · · Score: 2, Funny

      Not just any link. Some, like goatse just don't get the hits whereas others, like geek chicks will down a server in 30 seconds.

      --

      In Soviet America the banks rob you!
    2. Re:Linux Test? by t0ny · · Score: 2, Funny

      IBM: "Oh thank God that went well. Now we can finally ditch AIX...

      --

      Manipulate the moderator system! Mod someone as "overrated" today.

  2. I had a feeling by gooman · · Score: 2, Troll

    Of course, we all knew this in our hearts, nice to see it in writing.

    --
    "Kittens give Morbo gas!"
  3. Almost 1P, but I RTFAd :( by Anonymous Coward · · Score: 4, Interesting

    Anyone know if the test will be repeated with kernel 2.6.x?

    1. Re:Almost 1P, but I RTFAd :( by Wolfrider · · Score: 1

      --I *seriously* doubt 2.6 would do that well, but it would be worth seeing.

      --I'm actually quite surprised the test DID go as well as they said, because the maintainers didn't take the horrible OOM killer out until 2.4.23. (That was the Most Annoying Thing Evar about 2.4.x for me.)

      --
      .
      == WolfriderV6 == I'm willing to admit that *I just might* be wrong... Are you??
  4. USE BAD HARDWARE! by superpulpsicle · · Score: 5, Insightful

    You want to put any OS to the ultimate test, you should run cheap generic hardware. I swear it's an industry conspiracy that generic parts struggle a boat load. If your parts don't come from the big boys (DELL, gateway, etc), you are likely going to see issues down the line.

    Get some ECS motherboard, generic RAM... bang. You're in for the evening.

    1. Re:USE BAD HARDWARE! by Anonymous Coward · · Score: 1, Informative

      Uhh..I have a computer built completely from individual parts that I've bought, and I haven't had any stability problems. Ever.

      Keep in mind that Dell and Gateway's PC's are good because the hardware they choose to include is good. Anyone can buy the same hardware.

    2. Re:USE BAD HARDWARE! by Anonymous Coward · · Score: 1, Informative

      > I swear it's an industry conspiracy that generic
      > parts struggle a boat load.

      Either that, or it's part of the myth of "I can build a box that does XXXX cheaper than *big name here*"

      After experiencing the reality of real world reliability, performance and support hassles, I just have to snigger at ANYONE who comes up with those claims.

      Generic cuts it when I have a server in the next room at home and I'm here all the time that it needs to be working. For anything else, it's just buying extra work for myself.

    3. Re:USE BAD HARDWARE! by spikev · · Score: 5, Insightful

      IMHO, it's the big boys that have the conspiracy to sell crappy hardware. Try performance testing almost any (PC Chips mobos don't count) custom system against a Dell with a similar hardware configuration, and you'll see what I mean.

      I've done it with my ECS board with generic ram, and I came out on top.

      It's the big computer makers that sell the cheap generic hardware. Try getting anything that's essential and non-OEM, hardware or software, to work with one of those boxes.

    4. Re:USE BAD HARDWARE! by mr+i+want+to+go+home · · Score: 1
      I've got to say that I totally agree.

      I almost feel that the results of this test should be more of a tribute to IBM's heavy iron (Power processors...!).

      Although it's also nice to note that this is a PPC flavour of Linux - which *some* people view as being inferior to x86 - being tested for enterprise level performance.

    5. Re:USE BAD HARDWARE! by Duty · · Score: 1

      ...Um, my understanding is that ECS and PC Chips are the same company.

      However, I've never had any problems with my ECS board either, despite what people say about them (and SiS's chipsets.)

    6. Re:USE BAD HARDWARE! by boobsea · · Score: 2, Insightful

      That makes no sense whatsoever

      Are you saying good hardware can compensate for lousy software?

      Good software CAN deal with louse hardware, but only up to a point. Even so, are you going to be running your mission-critical enterprise server on ECS motherboards and knock-off RAM? I hope not.

    7. Re:USE BAD HARDWARE! by router · · Score: 2, Interesting

      Exactly. How the fsck do you think they manage to have a whole corporation hidden in the razor thin margins that exist on commodity hardware? By cutting everything down to the wire. Dell et. al. are like the major automakers, saving a quarter on every piece is 3 million on the bottom line....

      andy

    8. Re:USE BAD HARDWARE! by inode_buddha · · Score: 2, Insightful
      No conspiracy at all, just profit margins . I don't have to worry about that as an individual, so I do OK and still save with decent parts. The hour or so of labor to assemble it all isn't much to me.
      IOW, I agree - pick decent parts and get *exactly* what you want. I usually pick the previous generation CPU and get the biggest mobo I can for that from trusted brands. Then I stuff the mobo with the most it can handle, which is a *lot* nowdays. Of course, I get it all below retail from local OEM's, cash paid in person.
      It seems the "big boys" have all kinds of custom stuff done that makes life hard, but the "white box" stuff is easier/more flexible.

      To further reinforce your point, I built this machine 2 yrs ago for $1200 USD. The equivalent Dell Poweredge was $2200 USD. This is for SMP with big RAM, and yes it plays very nice with linux.

      'Nuff said.

      --
      C|N>K
    9. Re:USE BAD HARDWARE! by BorgDrone · · Score: 1

      I used to own (back in the day) a really cheap motherboard + all shit integrated on it (vga, lan etc.).

      Really, it sucked, windows sometimes crashed on boot on the thing, reinstalling the thing didn't help at all. Reliability was gruesome... until I installed Linux (this machine was one of the reasons I switched). I never had one single crash while the machine was running Linux, running it on Windows (dual-boot) was still as unreliable as ever.

      Offcourse this is only one story.

    10. Re:USE BAD HARDWARE! by Natalie's+Hot+Grits · · Score: 1

      The funny thing is....

      ECS motherboards == PC Chips motherboards

      lol

      Oh well, they still make half decent parts provided you select the correct chipset.

      --
      Two infinite things: your stupidity and mine. But I'm not sure about the latter. If my sig offends you, I'm sorry.
    11. Re:USE BAD HARDWARE! by Anonymous Coward · · Score: 2, Insightful

      But then you have a linux on crappy hardware test. Not an OS test. If you want to test the OS, you have to minimize the impact of other factors.
      Otherwise you may have bad OS stability because you have a bad hardware constellation.

    12. Re:USE BAD HARDWARE! by bhtooefr · · Score: 1

      Thanks for confirming my fears that ECS WAS PC Chips... Looks like ECS is another brand I won't buy...

    13. Re:USE BAD HARDWARE! by bhtooefr · · Score: 2, Informative

      Try running Memtest86 on something like that. It might be the RAM, as Windows handles RAM differently from Linux, and can hit bad parts sooner than Linux (and vice versa).

    14. Re:USE BAD HARDWARE! by silas_moeckel · · Score: 1

      Why whould you ever run a server on some random ECS motherboard (not that they are particualry bad but for arguments sake) As allways if you want reliable get a server motherboard and powersupply with some nice ecc ram and a raid 5 set. Generic PC's are just that junk now granted Linux works extreamly well with generic PC's proably because thats what it gets tested on most. Now MS and IBM general have programmers of workstation class boxes that are closer to servers in there design (Realy the only difference is the AGP slot) I'm assuming on MS's side but know that for a fact on IBM's side.

      --
      No sir I dont like it.
    15. Re:USE BAD HARDWARE! by hey! · · Score: 1

      Well, you'd end up with a different kind of test.

      The value of this test is for people considering Linux for mission critical applications. That is to say the kind of bet-the-company applications that people don't run on a surplus box snatched out of a basement storage room.

      That said, the kind of "hey kids lets put on a show" projects are where open source really shines. I'd swear that if we had to make the decision to pay money up front, my company still wouldn't have an email server, a web server, an internal wiki, or any number of things we now can't live without.

      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
    16. Re:USE BAD HARDWARE! by ScrewMaster · · Score: 1

      Yes, I know what you mean. We dealt with a mail server application for years because it was really cheap, even after the company that wrote it went under and what little support we had disappeared. Even at its best this program *SUCKED* but hey, it was really cheap. The only reason I was able to push a new system was because the damn program timed out! Can you believe it? Even though it was bought and paid for it timed out anyway. The guy assigned to administer the system had to keep resetting the server's clock back every day: you can imagine the effect that had on the mail stream. I was glad I was only assigned to review and recommend a new package ... the poor admin was inundated with calls from cranky users.

      --
      The higher the technology, the sharper that two-edged sword.
    17. Re:USE BAD HARDWARE! by t0ny · · Score: 1
      Well see, thats the catch. If you ran just about anything on that quality hardware it would run reliably. As for your average Joe Linuxuser, they arent too likely to have a pSeries sitting around their house.

      So essentially this isnt a test of Linux, this is just a test of the pSeries hardware.

      Now IBM can breathe easier, knowing they can just use Linux instead of having to pay people to support AIX. This is a great day for the pointy-haired boss!

      --

      Manipulate the moderator system! Mod someone as "overrated" today.

    18. Re:USE BAD HARDWARE! by Anonymous Coward · · Score: 1, Informative
      Keep in mind that Dell and Gateway's PC's are good because the hardware they choose to include is good. Anyone can buy the same hardware.
      Dell and Gateway tend to use proprietary parts, usually mobo, psu, and RAM. So you can't really make the same computer.
    19. Re:USE BAD HARDWARE! by dsplat · · Score: 1

      I have to second this recommendation. I'm sitting in front of the only system on which I've ever seen Linux crash completely. I'm talking about seeing kernel oops and kernel panic messages regularly. Memtest86 showed that one of my SIMMs was half bad. Until I started using RAM about the 256MB mark, everything worked.

      --
      The net will not be what we demand, but what we make it. Build it well.
    20. Re:USE BAD HARDWARE! by kableh · · Score: 1

      And that mobo is probably an Intel reference design. Same for the RAM and PSU, outsourced to some big manufacturer.

      These aren't mid 90s Packard Hells we're talking about.. =D

    21. Re:USE BAD HARDWARE! by rodgerd · · Score: 1

      Dell and Gateway as exemplars of quality? It is to laugh.

      Dell once shipped me five servers - one was DOA because they didn't stick the processor in the slot, another was just DOA, and the other three failed within a month, with the Dell PERC controllers chewing up all the data on the drives.

      Dell PCs and servers routinely fail within a short timeframe. They use the same supplier of the month policies that any mom and pop outfit might use to keep costs down.

      If you want reliable PC hardware, you're going to have to pay for more than a Dell, or build your own out of decent quality parts.

    22. Re:USE BAD HARDWARE! by BorgDrone · · Score: 1

      I can't do that for the simple reason that the system is no more because it has evolved since that time (as in: every part has been replaced over the years, including the casing, the only original part is the disk drive)

    23. Re:USE BAD HARDWARE! by nyseal · · Score: 1

      Personally, I've had my issues with Dell about a year ago. The very day I received my 'custom built' unit I wanted to replace the hard drive with one of my very own. There were some configuration issues that I had problems with (and I've installed MANY hard drives mind you) and the PC woudn't boot so I called 'technical support'. As it turns out, they apparently don't support 3rd party hardware. I explained I wasn't asking them to 'support' it, just help me install it. Again, no help. After trying the manufacturer, I eventually gave up and re-installed the original HD. Guess what....thing worked like a charm. I got rid of it that day.

      --
      [SIG] Remember Mattel handheld games?
    24. Re:USE BAD HARDWARE! by chthon · · Score: 1
      And that mobo is probably an Intel reference design. Same for the RAM and PSU, outsourced to some big manufacturer

      Yep, probably Jabil for Dell, don't know about Gateway though/.

  5. Re:Linux Reliability? by rjmx · · Score: 5, Funny

    > You're thinking Microsoft Works.

    I'm thinking it doesn't.

  6. You don't trust Microsoft to evaluate Windows... by davidstrauss · · Score: 5, Insightful

    Why do you trust IBM's Linux Technology Center to evaluate Linux?

  7. s/w -vs- h/w failure? by Quixote · · Score: 5, Interesting
    I skimmed over the article (heretic!), and was wondering: how do they distinguish between software failures (the purpose of the test) and hardware failures (for example, random bit errors in the memory that could be caused by higher temperatures due to the stress testing)?

    I seem to recall getting random crashes with cheapo memory, and it was a pain to track down the offending component. Of course, one would assume that IBM wouldn't go for cheapo components, but still: how does one point the finger at the software, instead of hardware? Is it just repeatability?

    1. Re:s/w -vs- h/w failure? by Anonymous Coward · · Score: 5, Informative

      http://www.memtest86.com/ Freeware GPL bootable memory tester for PC platforms... highly recommeded for troubleshooting flaky RAM...

    2. Re:s/w -vs- h/w failure? by jackb_guppy · · Score: 1

      The pSeries has hardware validation built in. It a hard shows failure, it is shutdown (if nessary) and hard tech is called.

      Yes, the box makes the hardware call. It is freak to come to work and have IBM service sitting at your door want into install a hard drive or I/O Card... and the machine is still running.

      And now most of the time, they change it while the machines is active.

    3. Re:s/w -vs- h/w failure? by Mathness · · Score: 1

      If you have the Gentoo Linux 1.4 CD (I haven't checked any of the previous releases, but it might be there), it is included in the boot options.

      --
      Carbon based humanoid in training.
    4. Re:s/w -vs- h/w failure? by kableh · · Score: 1

      Good server hardware, like the IBM systems tested, have mechanisms to report hardware failures. In the case of memory, you would hopefully be using ECC or parity RAM in your server, so if the memory was flaky you'd know. My only experience with this level of kit is Suns, but I know they have dianostics built into their firmware.

  8. Not bad by changelingyahoo.com · · Score: 5, Insightful

    This is nice to hear, but it would be even more valuable if the same tests were performed on a variety of operating systems in order to compare the results.

    Brian

    1. Re:Not bad by ramzak2k · · Score: 1

      good point, and they have to be done under similar test conditions too.

      Stability of a stripped down linux boxen doing just what it was intended to it might be much better than that of windows XP (with all the bells and whistles) but i would really like to see both of them loaded with similar number of apps and look at how their performance match. From my experience, XP has been much better in that instance.

      --

      Siggy Say, Siggy Do
    2. Re:Not bad by BiggerIsBetter · · Score: 1

      In other words, management can see the numbers to back up claims that Linux is reliable, whereas for other OS they must take the word of a salesman.

      --
      Forget thrust, drag, lift and weight. Airplanes fly because of money.
    3. Re:Not bad by router · · Score: 1

      You have tons of experience with both running lots of apps and crashing? Ok, I'll take the bait:

      1. So XP crashes with lots of stuff running?

      2. Linux _has_ lots of stuff to run?

      Stability how? Individual apps failing, or IE crashing and taking down the OS resolver? And then, performance? So, with XP I get remote execution of GUI programs and the reverse, remote display of X on a different workstation?

      Oh, yeah, thought not, try not trolling here. XP has nothing like the "bells and whistles" of Linux, and has no capacity to strip off the crap when its not required. They are not comparable. Run XP on all the platofrms that Linux runs on (hell, run it on an Athlon 64 for one....) and get back to me. Or the hardware SPECIFIED in the article, which was dual and quad Power4 boxes. XP and Linux are not comparable, stop trying. The mere fact that you are trying to do so highlights your bias.

      andy

    4. Re:Not bad by ramzak2k · · Score: 1

      go back to my post and read it again before throwing a tantrum like a baby zealot.

      I wasnt taking away the credit from either of the operating systems. just mentioning the problems i have observed working with linux distros - RH 8.0 , 9.0 and more recently Suse 9.0 pro (dual boot with XP). I do think XP has been much faster to boot/ much more stable when it comes to managing rouge applications (killing one process doesnt affect the rest - freeze the system) on the same PC. Try that with Suse 9.0 pro and watch it lock up.

      --

      Siggy Say, Siggy Do
    5. Re:Not bad by bonehead · · Score: 2, Interesting

      He actually makes some very good points.

      Windows, even the server versions, are not the enterprise class OSs that they are marketed as. This should come as no surprise, because they were not even designed that way in the first place.

      All you have to do to realize this is boot up W2K AS and use it as a desktop machine for awhile. All of the desktop crap is still there sucking up resources. Even Freecell is there, fer cryin' out loud! Try as I might, I can't come up with a good reason for a headless server sitting in a data center to have a copy of Freecell on it.

      I can understand why, with a desktop OS, you would just go ahead and install everything by default, just to make sure that everything works. But why would you do that with an enterprise class server OS? At some level of the chain here, shouldn't MS acknowledge that the intended user of the product actually knows what he's doing?

      At some point in the design process, shouldn't someone have said "Hey, this is going to run on servers in the back room, we could probably ditch Freecell and Solitaire, couldn't we?"

      The fact that they didn't, well... It makes me wonder.

    6. Re:Not bad by bonehead · · Score: 1

      Which resources? All of the media stuff, for one. I don't see much of a reason for a database server to have a copy of Media Player on it.

      And, as you hinted at, I find the notion of a server OS that can't be run without the GUI to be downright stupid. In my mind, a server OS should be able to be administered through nothing but an SSH session, or maybe a simple web interface, making the entire GUI itself a significant and unnecessary drain on resources.

      And, yes, the presence of an app like Freecell on the disk is pretty much irrelevant. It's not the presence of the app that I object to, it's the fact that nobody took 5 minutes to remove it from the default install.

      I'll also concede that Linux isn't perfect in this area, either. However, my recent experiences (which happen to be with RedHat, for no particular reason) have been that when you choose the "server install", you do get a much more reasonable set of apps and services installed than with Windows. But my point wasn't to say that Linux is perfect, I'd switch my servers over to Solaris in a heartbeat if Sun would make a believable commitment to the x86 platform (I work for a small company, the boss doesn't want to pony up for Sparc hardware).

    7. Re:Not bad by bonehead · · Score: 1

      On a server, who cares how fast it boots?

      If your server can't afford to be down for an hour while it boots (hell, I've worked with NetWare servers that took 30 minutes just to spin up all the SCSI disks, one by one), then you need to design it in such a way that it never needs to BE rebooted, except during pre-scheduled maintenance windows.

    8. Re:Not bad by router · · Score: 1

      Rock on dude, I'm sure your viewpoint will be backed up. Or not. Not in my experience, but what the fuck do I know? I have never been unable to kill anything I wanted to on a Linux box, but obviously the syntax of kill (or, xkill) is too much for you. Glad XP works for you. Keep on with it. Love the pause when you put in a CD or floppy, gives you enough time to stuff your face full of doritos. Try not to wipe your hands on your shirt tho, it leaves tell tale orange stains that everyone you see for the next week will recognize. Oh, major hint, try not killing init on Unix boxes. When using ps, make sure you kill the PID that is not 1. Will stop those nasty unix lockups you speak of.

      andy

    9. Re:Not bad by bonehead · · Score: 1

      (until OS X which I think is great).

      OK, now we're venturing WAY off topic, but...

      I've always hated Macs. I've acknowledged that the hardware was superior to most PCs, but what good does that do you when it's running the "OS for Dummies"? No command line at all? Gimme a break...

      As of a month ago, my main computer at home is a Mac. OS X is what Linux wishes it was. (at least on the desktop) All of the unix-ish goodness, plus a polished GUI, and decent availability of commercial apps. I have yet to find a Linux app that has not either already been ported, or couldn't be ported with minimal effort.

      I now have all of the benefits that I used to get from Linux, and using apps like Excel and Quicken is no longer a hassle.

      I'm still running Linux in the back room, but for a desktop unix, OS X is a thing of beauty.

    10. Re:Not bad by bonehead · · Score: 1

      Here's the thing, though, if a machine is taking anywhere in the 30+ minute range to boot, that's probably due to hardware (disks spinning up, etc...), not the OS. Chances are you chose that harware for a good reason, and changing it isn't much of an option.

      I'm not saying you shouldn't investigate ways to decrease the boot time, I'm just saying that switching to a different OS is not likely, in that scenario, to give you the significant boost you're looking for.

      The post that I was replying to was comparing the boot times of OSs. In every case that I've seen where a PC class machine had a long boot time (30 minutes or more) most of that time was spent before the boot partition was ever accessed, and the boot time of the OS itself was a trivial figure.

    11. Re:Not bad by router · · Score: 1

      hey man, if ctrl, alt, and backspace (or alt and F7) aren't on your keyboard, you have bigger problems. At least the box is still up. Moreover, when XFree locks up, blame XFree not Linus (or Linux for that matter). If you don't like it, go fix it yourself. Oh, I forgot, its easier to complain than fix. I'm sorry, what an asshole I am; just take your fucking medicine and pay the stupid computer user tax to MS instead of supporting the XFree developers. Enjoy your dope.

      andy

    12. Re:Not bad by shaitand · · Score: 1

      Andy I am most definately a linux advocate as you'll find reading my previous posts. And killing a process on linux has never locked the system short of kill -9 1, but I have seen gui crashes. And I've certainly seen gui crashes in which keyboard input was ignored and you could not ctrl+alt+anything.

      They certainly aren't an everyday or everyweek or even everymonth event but they do happen. Personally I just walk 3 feet to my wife's computer and ssh into my system and take care of it (the system hasn't crashed after a
      ll).

    13. Re:Not bad by MegaHamsterX · · Score: 1

      Wow, really funny, I have been running SuSE for 5 years now, and never had any of the lockups you mention.

      What are you killing, why are you killing it, and do you know where the problem actually is or are you randomly killing stuff.

      XP does keep the neophyte user from killing anything important, but this also keeps the experienced user's hands tied.

      The only way to properly admin a Windows box is from the command line with a handful of VB or Perl Scripts, the GUI is useless, and isn't this totally anti-windows not to use the GUI for everything.

      A UNIXish machine is easier to properly admin when I need to script to change configurations not accessable from the GUI (I'm actually doing this to remove the inconsistancies of the GUI for reproducable results).

    14. Re:Not bad by bonehead · · Score: 1

      Yep, exactly. And on a server, I'd be a little leery of turning those checks off. If the server really can't be down for an hour, then this is where words like "redundant", "hot swappable", and "generator" start finding there way into your hardware spec. Possibly even mirrored servers with automatic failover.

      If someone's idea of maintaining high uptime is simply "build a machine that boots back up really fast when it crashes", then they're not ready to work on enterprise class stuff yet.

  9. Results... by rmdir+-r+* · · Score: 3, Interesting

    The Linux kernel and other core OS components -- including libraries, device drivers, file systems, networking, IPC, and memory management -- operated consistently and completed all the expected durations of runs with zero critical system failures. Every run generated a high success rate (over 95%), with a very small number of expected intermittent failures that were the result of the concurrent executions of tests that are designed to overload resources. How does that compare with other OS's?

    1. Re:Results... by shaitand · · Score: 1

      dunno, most other OS's have clauses in the EULA which prevent you from publishing benchmarks performed on them.

    2. Re:Results... by rmdir+-r+* · · Score: 1

      They do? What about all those studies paid for by M$ showing windows is faster than linux? Or that the TCO is lower? And how about those hardware benchmarks you see everywhere?

    3. Re:Results... by shaitand · · Score: 1

      "all those studies paid for by M$ showing windows is faster than linux?"

      maybe I'm crazy, but I'm fairly sure (though I haven't seen any) that if M$ paid someone to run benchmarks on the OS then they would also give them permission to do the benchmarks. That's the whole idea, only MS benchmarks on the OS are ever seen by the general public.

      "Or that the TCO is lower?"

      Don't see what that has to do with performance and stability benchmarks so I'm not even sure that would qualify? IANAL so if it does qualify, refer to my answer on the other MS funded studies.

      "hardware benchmarks you see everywhere"

      Those would be umm HARDWARE benchmarks. Not benchmarking the OS itself.

  10. Re:You don't trust Microsoft to evaluate Windows.. by DAldredge · · Score: 1, Insightful

    This is based on a quick scan of the paper.

    Because this test appears to be a fully documented test. It doesn't appear to be a test rigged to make one platform look better than the other.

    Why shoudn't we trust this test?

  11. Re:You don't trust Microsoft to evaluate Windows.. by starnix · · Score: 2, Insightful

    Because what would they have to gain by lying? The true power of opensource is that when someone does point out the weaknesses, they are fixed quickly. IBM knows that if they tell the opensource world "Hey, LINUX is pretty good but it kinda struggles in the (foo) area." that the opensource community will redouble their efforts to fix that. Microsoft is only trying to say "Windows rules, Linux sux. See, this evaluation we did proves it. Buy windows"

  12. Needs to be done independantly by ducomputergeek · · Score: 4, Insightful
    Well I have some Karma to burn tonight. I don't mind that its the 2.4 KERNAL even though 2.6 is ready. Why? We never put anything on our production server that hasn't been out for at least 6 months with exception of security upgrades.

    Second off, If this were M$ testing 2k3 and publishing the paper, everyone here would be crying foul. But because its, "Linux" it must be 100% unbais and true.

    I've been using Linux for 8 years now including under high stress enviroments, 3d graphics rendering mainly, and from experiance I have see very good things from Linux. We have had software glitches before, but the core software maybe has caused 3 - 5% of our downtime. Over 70% of our downtime involves human error and about 25% of failures are due to hardware giving out.

    Still what my customers are wanting to see isn't benchmarks as "So easy Grandma could use it" in Linux. While the people in the datacenters want to know how well Linux will bear under a load, most end-users and SMB's don't need to worry about it, they just need something easy to use that works.

    --
    "The problem with socialism is eventually you run out of other people's money" - Thatcher.
    1. Re:Needs to be done independantly by KrispyKringle · · Score: 2, Insightful
      Couple of small points to nitpick. First, you, as so many others, seem to think it's a novel idea to not implement the bleeding edge beta software on your critical hardware. No need to point this out; you presumably have a standard policy of testing all upgrades and patches on development hardware before moving it to your production equipment (or should, if you can afford it).

      Second, you're probably right about the publishers of the paper, but hey, what can you do? The people with the most interest in these studies are those who have some major investment in the results. Then again, IBM (though perhaps the Linux Technology Center are biased) has AIX as well. But you are right, they have a strong Linux agenda.

      Finally, you make the same old criticism that Linux isn't desktop-ready. Fine. Correct. I couldn't agree more; a properly configured desktop may be easy as pie, but that configuration still isn't automated enough for someone totally unskilled to do. But that doesn't matter. That's not what IBM cares about, it's not what large enterprise users care about, and it's not what most home Linux users really care much about, either. That's why it doesn't get done.

      I don't think anything is likely to challenge Windows on the desktop anytime soon (and this isn't the threat MS see from Linux, either). But Linux is gaining ground on Windows in datacenters and on servers; companies are turning from Windows to Linux, looking for stability and security (what this paper tests), and power users rely on Linux for advanced tools and efficiency. That, not desktop usability, is where the future of Linux lies.

    2. Re:Needs to be done independantly by haruchai · · Score: 4, Insightful

      Well, their bias is clearly stated on their web page- this is the Linux Test project - which is dedicated to evaluating the capabilities and limits of Linux.

      They aren't making a comparison to other OSes or saying that Linux is more suitable than such-and-such operating system; just that it is suitable for particular tasks or environments.

      A comparison between different OSes should be carried out by an independent testing facility but, in this particular case, I don't see anything
      wrong with their modus operandi.

      --
      Pain is merely failure leaving the body
    3. Re:Needs to be done independantly by antiMStroll · · Score: 3, Insightful
      Second off, If this were M$ testing 2k3 and publishing the paper, everyone here would be crying foul. But because its, "Linux" it must be 100% unbais and true.

      An ironic assertion regarding bias. IBM isn't the author of Linux or any of its tools, add-ons, servers, etc. as Microsoft is of 2k3 and its support software. Microsoft also has a long and distinguished history of FUD. IBM doesn't have anywhere near the historical attachment to Linux that MS has to Windows, and IBM hasn't been caught lying about it yet. It would be irrational to treat the two equally at their word.

    4. Re:Needs to be done independantly by Samrobb · · Score: 4, Insightful
      Microsoft also has a long and distinguished history of FUD.

      Pretty amusing that you would say that, considering the origin of the term:

      Defined by Gene Amdahl after he left IBM to found his own company: "FUD is the fear, uncertainty, and doubt that IBM sales people instill in the minds of potential customers who might be considering [Amdahl] products."

      Not that I disagree with your assertions - IBM doesn't have near the same ties to Linux that MS has to Windows. But it's amusing to see how much the technological landscape has changed, that a term coined to describe IBM can now be used to (in some sense) defend it.

      --
      "Great men are not always wise: neither do the aged understand judgement." Job 32:9
    5. Re:Needs to be done independantly by shaitand · · Score: 1

      ok I've heard people say this before like it's an issue. But I've never actually seen ANYONE use windows on the high end servers or datacenters. I've seen windows in small server roles, but I've never heard of someone actually USING clustering capabilities in windows or going beyond what a single system can do. A quad processor x86 box is hardly high end after all.

    6. Re:Needs to be done independantly by KrispyKringle · · Score: 1
      I suppose this doesn't count, since it's run by MS. But they claim they used Windows to serve up the website for the Olympic Games in 2002.

      Not that I think it was really more efficient, reliable, and secure than Unix. But you certainly can implement Windows solutions here, if you really, really want to. It just seems far more trouble than it's worth.

    7. Re:Needs to be done independantly by Trejkaz · · Score: 1

      The definintion of FUD itself did also say "After 1990 the term FUD was associated increasingly frequently with Microsoft"

      --
      Karma: It's all a bunch of tree-huggin' hippy crap!
    8. Re:Needs to be done independantly by Chemicalscum · · Score: 1
      "IBM isn't the author of Linux or any of its tools, add-ons, servers,"

      Large parts of the Linux Kernel code contain the statement "Copyright IBM", these are the parts SCO scum are accusing of being derivatives of System V UNIX. As for servers - IBM may not be the author of Apache but most of its lead developers work for IBM. So I think your statement is not too close to the truth.

      The members of the IBM LTC have a pretty strong attachment to Linux and anyway that secret underground group of anarchists that seems to have been running IBM for the past twenty five years has bet the company on Linux. More power to them

  13. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by LurkerXXX · · Score: 3, Interesting

    2.6.0 has only been out for a week. I'm going to want to see someone stress test it for a hell of a lot longer than a week before I call it "enterprise ready".

  14. Re:You don't trust Microsoft to evaluate Windows.. by davidstrauss · · Score: 5, Interesting
    Why shoudn't we trust this test?

    The people performing it have a vested financial interest in having it turn out a specific way, notably positive. If the test resulted showed poor reliability, then I would understand trusting it because it would go against the motives of the people performing it. Since the test affirms their business model, no matter how documented it is, it should be suspect.

    It doesn't appear to be a test rigged to make one platform look better than the other.

    It looks a bit skewed to me. Many of the test results depend on the computer systems meeting expectations of the people testing it, particularly in overload cases. Since the people who tested work in the Linux Technology Center, their expectations stand a greater likelyhood of being consistant with the system.

    Take C/C++ and Java. Someone who regularly works with C/C++ knows certain libraries (notably the character ones) return ints for status in the form 0 being false and not 0 being true. If someone expects that, the system meets expectations and passes. If someone comes from a different background, say Java, he or she may not expect that, and the system would consequently fail the test of meeting expectations. I would like an evaluation from somewhere in-between, not someone whose years of experience allow them to gloss over what might be problems for another person.

  15. Re:You don't trust Microsoft to evaluate Windows.. by davidstrauss · · Score: 3, Informative
    Of course we would not trust IBM to evaluate linux. That's why the used LTP for testing.

    Microsoft commonly hires outside companies to perform their tests. Do you remember the evaluation of Exchange versus Notes/Domino scalability by Ziff-Davis but funded by Microsoft? People justifiably questioned those results, as the company hired (Ziff-Davis) has an interest in pleasing the hiring company (Microsoft) so they get future work.

  16. Re:You don't trust Microsoft to evaluate Windows.. by davidstrauss · · Score: 3, Insightful
    Because what would they have to gain by lying?

    They have much to gain: more corporate customers and more respect and funding by greater IBM. Just because IBM supports Linux doesn't mean its motives are pure (not financially driven). Another reason for bias is the division also stood to have huge setbacks if the tests were unfavorable. How could they justify expansion and better funding if their previous statements about Linux being enterprise-ready were unfounded?

  17. Kernel 2.6 by Jeff+DeMaagd · · Score: 2, Insightful

    I am pretty sure that kernel 2.6 can't be considered "enterprise ready", for one, it hasn't gone through that level of testing.

    Don't knock "yesterday's news". Far be it from some geeks to understand this, but there are times that "tried and true" is more important than having the latest and greatest. This testing started well before 2.6.0 was released! They can probably get started wit 2.6 as soon as an enterprise Linux distribution incorporates it.

    1. Re:Kernel 2.6 by BiggerIsBetter · · Score: 1

      They can probably get started wit 2.6 as soon as an enterprise Linux distribution incorporates it.

      Is it just me, or is that backwards? If SuSE or RedHat wants me to pay $BIGNUMBER for use of their Enterprise distro services, I'd expect the testing to be done before it gets integrated into the distro. In fact, this kind of testing should be part of the doco you get to see before you sign up for the newest Enterprise distro.

      Hopefully this is the start of a trend.

      --
      Forget thrust, drag, lift and weight. Airplanes fly because of money.
  18. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by ramzak2k · · Score: 2, Interesting

    I used to use RH8.0 and then RH9.0 and moved on to SUSE 9.0 Pro recently. I noticed that when apps crash RH distros took it much better- You could close the unstable application safely and continue working on the rest. Suse OTOH freezes. Did anyone else notice this ?

    --

    Siggy Say, Siggy Do
  19. Why? Here's why... by Crypto+Gnome · · Score: 5, Interesting
    • because the test methodologies are documented
    • because it's disclosed up-front that it's IBM Linux Team testing Linux (ie no hidden conflict of interest
    As opposed to the usual (ie in the Microsoft World)
    • ZDNet (and/or others) "testing" Microsoft Products (but only vaguely describing how things were configured)
    • Microsoft paying someone to "report" on the quality/performance of a Microsoft product, but the evaluation is worded in such a way as to convince the user that it's an independent review and the "funded by microsoft" fact is never mentioned anywhere in the evaluation
    --
    Visit CryptoGnome in his home.
    1. Re:Why? Here's why... by davidstrauss · · Score: 2
      because the test methodologies are documented

      Yes, they are documented, but some of the evaluation criteria ("expected behavior") depend on the opinions of the team performing the evaluation.

      because it's disclosed up-front that it's IBM Linux Team testing Linux

      Yes, that's better than the Ziff-Davis test (which I'm familiar with and mention in another post on this thread). However, assuming no bias because of a disclosed possibility of bias is illogical. Such disclosure is a necessary but insufficient condition for deeming a test "fair."

    2. Re:Why? Here's why... by Crypto+Gnome · · Score: 5, Informative
      OK, so the Reality Check in this equation amounts to:

      You should not trust this evaluation at all.
      • Go to the site
      • download the testing tools yourself
      • read the test paper
      • use the test methodologies as documented
      • do your best to verify their test results yourself
      • go back to the site
      • post your results for everyone else to see
      (ie follow the good practices of basic science)

      After all... On the internet , nobody knows you're a dog.

      Any JimBOB can write a convinving paper, with all the right buzzwords, that sounds as if X+Y=Z, especially if that was logically a likely/expected outcome in the first place.

      As a well-known TV show once said (several times and loudly) Trust No-One.

      Remember people, YMMV.
      --
      Visit CryptoGnome in his home.
    3. Re:Why? Here's why... by SphynxSR · · Score: 1

      Well I go with the first response post then the test software itself must also be wrong. unless a programer who has no bais looks at it and says it is ok. Oh shit wait I can trust anyone. Now what am I going to do.

      --

      I don't suffer from insanity, I enjoy every minute of it.
  20. Re:You don't trust Microsoft to evaluate Windows.. by DAldredge · · Score: 2, Insightful

    The full hardware/software details of the test are there. If you don't trust it, you have the ability to rerun the tests yourself.

  21. Re:You don't trust Microsoft to evaluate Windows.. by Surazal · · Score: 4, Insightful

    So, ya reply to one point but ignore the rest? I think his (ultimate) point is valid. If the test was rigged, the folks involved with developing the kernel would catch on and take IBM to task for fudging the results. No, I'm not talking about the Slashdot/Fark crowd. I'm talking about REAL developers.

    Also, Linux has weathered some unfavorable (and honost!) critiques before. Linus Torvalds said it best when he said (and I paraphrase since I am too lazy ATM to look up the actual quote) that it doesn't matter if there's negative publicity in the press about Linux. It just meant he got his bug reports from the Wall Street Journal as opposed to the regular kernel mailng list.

    --
    --- Journals are boring; Go to my web page instead
  22. WHAT is the failure? by SharpFang · · Score: 4, Interesting

    95% success ratio... does that mean that 1 in 20 programs I run segfaults or what? What do they mean by "failure"? Not finishing given task in predefined time? Getting the results wrong? Hanging?

    Sorry but that means nothing. Even if there -was- a comparison to other systems, it would still mean nothing. 95% success ratio, 78% happiness factor and 93% user satisfaction.

    --
    45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
    1. Re:WHAT is the failure? by be-fan · · Score: 3, Informative

      Its the results from the Linux Text Project suite. 95% success rate, zero critical failures, means that 95% of the 2000 test cases completed successfully, and nothing crashed the kernel. To see what that means, just take a look at what test cases are in the LTP!

      --
      A deep unwavering belief is a sure sign you're missing something...
    2. Re:WHAT is the failure? by Virtex · · Score: 1

      I suspect the 5% failure rate refers to things like dropped packets. This isn't considered a critical error because of the way TCP/IP works -- the client will just resend the dropped packet. If there were incorrect results or processes hanging/segfaulting, that surely would have counted as a critical error.

      --
      For every post, there is an equal and opposite re-post.
  23. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by Anonymous Coward · · Score: 1, Informative

    Do a SuSE FTP install of SuSE 9.0.

    It's as close as it's going to get to a SLES release, and it's free. Suse has to many obligations to support propriatory software to make SLES free to download.

    link:
    http://www.suse.com/us/private/download/s use_linux /

    Also you can get a AMD64 bit version from download now, too.

  24. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by RALE007 · · Score: 4, Insightful
    You're annoyed that they released the results for a 90 day stress test on a 2.4.x kernel, but not a 90 day stress test on a 2.6.x kernel? The 2.6.x kernel has been out for nine days. How would they have any results on the new kernel? By sending results to us using the Way Back Machine?

    I think IBM used SuSE instead of Redhat because IBM Global Services and SuSE have been partners for almost two years.

    Maybe you should stop hmmmmm'ing about these great mysteries and start googling.

    --
    Beware blue cats moving at .99c
  25. Re:You don't trust Microsoft to evaluate Windows.. by davidstrauss · · Score: 1
    So, ya reply to one point but ignore the rest?

    I think the rest of your post is either biased (assuming IBM only intends this article for a technical audience that won't use it for business decisions) or inflammatory (the comments on Microsoft that mostly have root in the attitude on Slashdot more than Microsoft's actual statements).

    ...and take IBM to task for fudging the results

    I need something more empirical than this assertion. I don't see kernel developers spending weeks duplicating IBM's results just to verify that Linux is as good as IBM says.

    Also, Linux has weathered some unfavorable (and honost! [sic]) critiques before

    This says nothing about IBM's motives. That's like saying the airlines have no ulterior motive in claiming safety because they've whethered problems in the past.

  26. Re:You don't trust Microsoft to evaluate Windows.. by Tony-A · · Score: 1

    Why do you trust IBM's Linux Technology Center to evaluate Linux?

    Because it is to IBM's advantage to find any weakness AND FIX IT before their customers run into the same problems. Any "insider knowledge" would be used to make the tests harder rather than easier. If it were "IBM Linux" rather that "SuSE Linux" that was being tested you'd have at least a chance of making a point.

  27. Re:You don't trust Microsoft to evaluate Windows.. by Unordained · · Score: 1, Interesting

    The people performing it have a vested financial interest in having it turn out a specific way, notably positive. If the test resulted showed poor reliability, then I would understand trusting it because it would go against the motives of the people performing it.

    That may be human instinct, but let's be honest: it's not fair either. Either trust the source of information or don't, but don't trust the result of a test based on the result of the test -- circular dependency.

    It's much simpler to simply say that this is a linux test by IBM, and may therefore be tainted. Did I mention that it's good to have truly independent testing centers, albeit expensive for them? This whole independent/free media thing is rather important ... specifically because of this.

  28. Diagnosing software vs. hardware is easy. by Inoshiro · · Score: 3, Insightful

    Hardware fails in random patterns (bits flipped by beta radiation, for example), software fails the same way in controlled instances (the same flawed logic fails the same way each time).

    So when you run a test 5 times, and you get 5 results, the hardware is broken. When you run the same test 5 times, and it gets to the exact same point before sig11ing, you have a software flaw.

    This is also why you do multiple tests to ensure you're getting an accurate picture of what's going on (flawed or not).

    --
    --
    Internet Explorer (n): Another bug -- that is, a feature that can't be turned off -- in Windows.
    1. Re:Diagnosing software vs. hardware is easy. by dzym · · Score: 1
      So when you run a test 5 times, and you get 5 results, the hardware is broken. When you run the same test 5 times, and it gets to the exact same point before sig11ing, you have a software flaw.

      Wrong. It is perfectly reasonable for busted hardware to break software execution reproducibly ... the exact same way at the exact same moment.

      Your "it's either this or it's the other" approach would never work in the real world of stress testing.

    2. Re:Diagnosing software vs. hardware is easy. by puffing_billy69 · · Score: 2, Informative
      > When you run the same test 5 times, and it gets to the exact same point before sig11ing, you have a software flaw.

      Not necessarily: When uncompressing one of the XFree86 source tarballs, X430src-3.tgz, on my old k6 2-450, gzip would always die with a bad CRC. Nothing else at all seemed to go wrong with the machine, but I couldn't uncompress the file until I downed the memory clock to 66MHz, rather than 100.

      I found one other person with the same motherboard having the same problem in a google search, and also heard there was a problem with that mainboard using ram at 100MHz.

      --
      printf("%s@yahoo.co.uk\n", uid[569754].name);
    3. Re:Diagnosing software vs. hardware is easy. by ImpTech · · Score: 3, Informative

      Bleh, thats not necessarily true at all. A good race condition in a many-threaded program can quite easily look very much like a hardware problem, in that it is difficult to reproduce reliably.

    4. Re:Diagnosing software vs. hardware is easy. by jmv · · Score: 4, Insightful

      software fails the same way in controlled instances

      That's true... in theory. In practice, there are many ways software can fail in random (in the weak sense) ways. Many of these are related to timing. For example if you have many threads and fail to lock things properly, the result will depend on when the tasks are preempted. You can also have different results because of the way the interrupts (disk, net, ...) happen. There's also the not-initialize type of problem where the behaviour depends on whatever was there in memory before. There are probably many other ways for software to fail at random, including obscure combinations of events.

      I'd say that the only kind of software that can't fail randomly is single-threaded and doesn't rely on any input other than regular files (and even then I'm not sure it's enough).

    5. Re:Diagnosing software vs. hardware is easy. by jemfinch · · Score: 2, Informative

      So when you run a test 5 times, and you get 5 results, the hardware is broken. When you run the same test 5 times, and it gets to the exact same point before sig11ing, you have a software flaw.

      This isn't true. If you're running a program that uses a deterministic memory allocation algorithm (a compiler, for instance) and have a segment of bad memory, then you easily could crash at the exact same point (when a pointer in that segment is dereferenced, for instance).

      I know. It's happened to me. I've even had such slightly bad memory that I could compile nearly everything I needed, but one project consistently failed. I took out a bad memory chip (actually it was simply mismatched PC100/PC133) and everything worked fine.

      Jeremy

    6. Re:Diagnosing software vs. hardware is easy. by MoogMan · · Score: 3, Interesting

      You will find that a lot of the trickier bugs can depend on certain [eg. race-] conditions. Such things that are very hard to recreate, even under carefully controlled situations. Then you get the heisen-bug variety etc. Such errors could easily be passed off as hardware failiure. Im sure you can dream up your own examples. (I cant; Im still drunk)

  29. Re:You don't trust Microsoft to evaluate Windows.. by willabr · · Score: 1, Interesting

    IBM only want's to sell it's hardware, I don't think they could care much less about the OS (remember OS/2). As long as they can ride on the back of a free labor force they are happy campers.

    The Wall Street Journal reported last week that IBM had told its managers to plan on moving as many as 4,730 high-tech jobs from the United States, (I wonder if they are the Linux testers)

    Some of whom will be required to train the foreign workers who will replace them.

    Thank's IBM. You will be remembered. I'm happy I have nothing to do with them.

  30. Re:You don't trust Microsoft to evaluate Windows.. by davidstrauss · · Score: 2, Insightful
    Either trust the source of information or don't, but don't trust the result of a test based on the result of the test -- circular dependency.

    I think it's more human than that. If a company or group releases results completely against their interests, integrity is the only reason such a group would go forward. Why would anyone skew results to spite themselves?

  31. Failures needed by Error27 · · Score: 3, Interesting

    This test would have been more interesting if there had been failures. Perhaps they could have tried the test on an older version of Linux, or a different operating system.

    I have been trying to write some tests of my own recently. So far I have found a filesystem OOPs, a ptrace BUG(), and my system locks up on low memory situations. Probably the lockup is because my ethernet driver allocates memory in the interrupt handler (GFP_ATOMIC) and can't handle the result when there is no memory available.

    I need to fix the lock up first of all so the other tests have time to run...

    1. Re:Failures needed by Error27 · · Score: 1

      >>Recently I saw a machine with kernel 2.4.something (SuSE 8.2) which ran out of memory and a lockup happened.

      You could probably hear the drive grinding away in the background right? That's because the kernel is trying to move stuff around to swap space or back. After a while, the kernel just picks a program and kills it to free up memory. I don't know the algorithm for picking which program to kill, but for me it normally kills mozilla.

      Marcelo removed the Out Of Memory killer from 2.4 a couple weeks ago. I'm not sure how it works now.

      When I have low memory, I get messages like this:
      timeoutd: page allocation failure. order:0, mode:0x20
      After that the system locks up completely and the NumLock key stops working. When the NumLock key stops that means the kernel is not responding to interrupts.

  32. MOD PARENT DOWN by Anonymous Coward · · Score: 1, Insightful

    I think the moderators can decide for themselves what they find good or bad.
    Thank you though for assuming you know how everyone else in the world feels :)

  33. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by router · · Score: 1

    Or, perhaps because SUSE is qualified as a distro to run on p-series hardware and RedHat (until the recently release RHEL 3.0) is not? Jesus, do some fscking research.

    andy

  34. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by RALE007 · · Score: 1

    Think the IGS SuSE partnership had anything to do with SuSE being the only qualified distro to run on p-series hardware? Ow, I think I just ate some bait.

    --
    Beware blue cats moving at .99c
  35. Re:You don't trust Microsoft to evaluate Windows.. by Unordained · · Score: 2, Insightful

    specifically because you'll now trust their future results?

    "wow, last year, they had to admit their product just wasn't up to the task. but now, dang, look at 'em go!"

    yes, it's quite human indeed. you don't know what all they're up to -- what seems to be self-defeating isn't always. and sometimes, well, you honestly find out that you're doing the job you had hoped you were doing, trouncing the competition. go figure: you might actually manage to not suck! but you don't get to tell anyone? and your only solution is to pay someone else to announce it? oh, wait, that's not allowed either!

    any publicity is good publicity. if you can't get good publicity by announcing your product is good, just say it isn't. close enough. it's not like anyone pays attention anyway.

  36. You have to plan for that. by khasim · · Score: 2, Insightful

    Before you roll something out into production, you hammer on it for a while. The crappy parts will fail or generate errors and you can have them replaced.

    In my experience, most of the no-moving-parts hardware will fail within the first week, or last for years and years.

    The stuff with moving parts will eventually fail. But that's harder to predict.

    1. Re:You have to plan for that. by Ozric · · Score: 1

      I was going to mod ... but I wanted to post to this insted.

      YES YES YES. I have seen boards on Million Dollar F15k's fail. It does not matter where the parts came from or who stands behind them, every thing will FAIL. After you turn it on its just MTBF and that is just an adverage. You must build in redundancy. Somtimes it feels like a waste to have a passive clusters and standbys just sitting there, but if you have to have uptime, you must spend the money or you will get burned.

      If you have a say in what goes LIVE in production, make sure you speak up and chime in. Support costs are the most expensive part of any system. Remember when the rubber hits the road, nobody will blame the outtage on the bad design. They will bame you. Get SLA's on crappy systems and CYA. Insurance costs lots of money, you are the policy limiting DR time, but nobody will insure a steamliner with no life boats or a warehouse with no fire systems.

      And my last advice.. IF you must usr MSFT software, build in alot more redundancy!!!!!

      Happy new year and limited downtime to you.

    2. Re:You have to plan for that. by nyseal · · Score: 1

      There were no moving parts on the Pinto failures. It was a static line bolt and a gas tank placement design error. Both could have been fixed by less than $30.00 with a recall. Thanks, Lee.

      --
      [SIG] Remember Mattel handheld games?
  37. Eh by Anonymous Coward · · Score: 1, Offtopic

    The problem is that the moderators act and assume as if their opinions ARE how the rest of the world feels.

    That is the problem inherit in the mob moderation that Slashdot has

  38. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by JasonStiletto · · Score: 2, Informative

    for one thing, it would be difficult to run a 3 month stress test on 2.6.0 when 2.6.0 isn't 3 months old, and isn't part of a released enterprise product. If they stress tested one of the betas and it failed, Microsoft would use it for advertising. :)

  39. Re:You don't trust Microsoft to evaluate Windows.. by inode_buddha · · Score: 1

    Maybe because IBM has a few billion invested in Linux since y2k, and a pending lawsuit? Do you think thry would just gamble on that let alone publish the results?

    --
    C|N>K
  40. Maybe.. by Inoshiro · · Score: 1

    But highly unlikely. Plus, with the opensource software, you can always change the controlled variables to encourage the true problem to reveal itself.

    --
    --
    Internet Explorer (n): Another bug -- that is, a feature that can't be turned off -- in Windows.
    1. Re:Maybe.. by shaitand · · Score: 1

      hardly unlikely, it happens all the time. You download the iso, the iso is corrupted, you delete the iso and download it again, the iso is corrupted. Is this is a software failure? by your test yes it is. In reality your hard drive has a few bad sectors and your unallocating them deleting the iso and reusing when you download it again causing the repeated result.

      Or you boot system, start process x, at stage y you get a segfault, you reboot and try it again, and again and again, it always happens at stage y of process x, and doesn't occur when you run other things which appear fine. Is this a software problem? no At stage y process x consumes a massive amount of memory and comes across a bad byte in RAM. Your not running anything else that would require allocating that block of ram and therefore you only see it when running process x at stage y.

      Consistant result, but it's really hardware failure.

      Both of these are so common as to have happened to most every experienced computer user a number of times.

  41. True. by Inoshiro · · Score: 1

    A bad CRC on a really-big gzip file isn't a very granular test. A good way of stressing a system so that it will show you wether it fails reproducibly is to compile a couple different versions of the Linux kernel, and see it if fails in the same spot (or at all). If you ran memtest on it, it'd likely have shown an error. Whenever you aren't sure, run memtest :)

    --
    --
    Internet Explorer (n): Another bug -- that is, a feature that can't be turned off -- in Windows.
    1. Re:True. by puffing_billy69 · · Score: 1
      All true, yes. I ran Memtestx86 for some 30 hours and no errors were detected. Not only did a kernel compile, but Gentoo has gone through bootstrapping on the machine, a couple of times in fact.

      The only point that it would fail is when you emerge xfree. I'm confident I could reproduce it, but I don't want to, of course.

      --
      printf("%s@yahoo.co.uk\n", uid[569754].name);
    2. Re:True. by shaitand · · Score: 1

      You consider multiple kernel compiles and a full blown memory diag proportional troubleshooting when you can't unzip a tarball?

  42. Lies, damn lies... by bersl2 · · Score: 1

    oh, yeah---and statistics. Statistics are the most manipulatable facts. When given with no context, they might as well be lies.

  43. Here goes by floydman · · Score: 5, Insightful

    FTA
    The tests demonstrate that the Linux system is reliable and stable over long durations and can provide a robust, enterprise-level environment.

    Ok, now i dont mean to troll here, so mod down if you wish, i really dont care.... BUT...
    I am a linux user/programmer/lover for the past few years now, and i wanna see a company that is not SO IN LOVE with linux say what have just been said by IBM above.
    In other words, i dont want to see companies who sell Linux, or who have benefit in selling Linux praise it. Does any one of you know of someone who fills in these criteria. Sun for one is not very fond of Linux, nor is MS ofcorse (despite the fact sometimes i doubt they have code in their stuff from Linux...)...to make a long story short
    It would be really nice if such a judgment came from someone else besides IBM/REDHAT/ORACLE...

    --
    The lunatic is in my head
    1. Re:Here goes by Fnkmaster · · Score: 2, Informative
      You mean an "unbiased" industry analyst? The problem is that everybody needs their bills paid by somebody. And these days pretty much everybody in the computer industry has some interests tied up with either Microsoft or Linux (seeing as how most of the old Unix players are becoming Linux players as well - IBM, SGI, (sometimes) HP, Sun...).


      It takes a lot of time and money to do very thorough analyses of operating systems, hardware and enterprise apps. So that money has to come from somewhere. It would be all well and good to say "hi, we're an independent research and analysis lab, we'll write unbiased reports about the state of the industry", but somebody has to fund that shit. And pretty much all that money can be traced back one way or another to some of the big companies in the business who can afford to throw it around for marketing benefits - like Microsoft or IBM.


      In a perfect world, all the customers and potential customers of software would get together and each chip in a little bit of money to fund good, unbiased research. But like in the world of politics, it's easier to get a few special interest groups who have a lot at stake together than to get hundreds or thousands of parties who each have a little at stake to cooperate.

    2. Re:Here goes by sloanster · · Score: 1

      In the spirit of open science, all test programs and scenarios were proivided so that you can duplicate the testing and verify the results for yourself. So run the tests yourself, if you have a suspicion that there is something amiss - that is, unless you'd rather sit on your ass and play armchair quarterback.

  44. Re:You don't trust Microsoft to evaluate Windows.. by inode_buddha · · Score: 1
    OK, lets go through this one step at a time.

    "Just because IBM supports Linux doesn't mean its motives are pure (not financially driven)."
    Last I checked, nobody said anything about pure motives - maybe its just a mutually beneficial relationship.

    "I think the rest of your post is either biased (assuming IBM only intends this article for a technical audience that won't use it for business decisions)"
    Yes. To both technical and business decisions.

    "I need something more empirical than this assertion. I don't see kernel developers spending weeks duplicating IBM's results just to verify that Linux is as good as IBM says."
    Bummer, all that efort being duplicated by HP, NEC, etc. at OSDL. Of course, you could just fund some 3rd party testing yourself.

    SEE SUBJECT: I'd actually be interested in seeing Microsoft publish several evaluations of linux, since I don't trust them with evaluations of Windows. Interestingly, I haven't seen any such thing recently.

    BTW, its "weathered", not "whethered".

    --
    C|N>K
  45. Scaleable.... really? by tonyr60 · · Score: 5, Insightful

    "The Linux kernel properly scaled to use hardware resources (CPU, memory, disk) on SMP systems."

    Sorry, but how can the scaleability of the CPU resource be proven on a 2 CPU system? Show incremental results on 1, 2, 4, 8, 16, 32 etc. etc. and then CPU scaleability may be proven.

    This is NOT an anti-Linux troll, rather the evaluation needs to justify it's outcomes or it starts to look like something from a company starting with M.

  46. Axioms of Christmas Linux Economics by Anonymous Coward · · Score: 3, Funny

    1) Linux is free if your time is worthless
    2) My time is not worthless
    3) Linux is not free
    4) Giving Linux as a Christmas present is not "cheap"
    5) Linux is a good Christmas present.

    1. Re:Axioms of Christmas Linux Economics by sloanster · · Score: 1

      While that's a cute little saying, it has no basis in fact - rather than just posting a silly sound bite, could you explain why you are parroting that tired old shtick?

      I find linux an ideal server bacause it saves time compared to traditional unices - case in point, the most recent openssl, openssh, apache and sendmail vulnerabilities. Our linux systems were all fixed, with no interruptiuon in service, the day the vulnerabilities were published, via the vendors auto-update mechanism. Red Hat and Suse offer such features, and I'm sure other distros do as well. Meanwhile our traditional unix admins were sweating, trying to locate patches, and recompile apps. It took them weeks to get all the HPUX and Solaris systems patched. One of the admins made the comment that it "sucked", and he wished they could be using linux, since it was som much easier to manage.

      I'd love to hear an opposing view -

  47. True in theory, very true in practice. by Inoshiro · · Score: 1

    You can always go and find broken corner cases, but the vast majority of software is single threaded (or behaves that way), and in a testing situation will have normalized input. This leads to fairly regular errors. Obscure combinations of events don't usually happen with software that's mature, since it's been through many combinations and patched to follow them (IE: Apache won't die in a random fashion).

    I've put in many hours of debugging software (my own and others), and all the crashes and other problems that are in software can be reproduced with a test case. The test case is always inputs which travel the path of broken logic. Since most of the computer is a controlled set of variables (IE: library versions won't change during a run, memory bits won't flip, etc), it's very easy to diagnose these -- and very hard for software to fail like you mention unless it's fairly complex. Most software isn't that complex (how complex is Knotes?).

    --
    --
    Internet Explorer (n): Another bug -- that is, a feature that can't be turned off -- in Windows.
    1. Re:True in theory, very true in practice. by Tony-A · · Score: 1

      You can always go and find broken corner cases

      Right. That's where two bugs get together and you notice something.
      Remove either of the bugs and nobody will notice anything.
      Remove both of the bugs and maybe you get a bit closer to having something debugged.

      (IE: Apache won't die in a random fashion)
      One of us severely misunderstands Apache. My understanding is that it is quite possible to run Apache in production with some very broken modules.
      "Try it again. Maybe your browser has some problems." ;-)

      and very hard for software to fail like you mention unless it's fairly complex
      I think it's extremely hard to have anything both interactive and deterministic.

  48. memtest86 on PPC? by Xtifr · · Score: 1

    Minor quibble: I don't think memtest86 is going to do a whole lot of good on the systems that were being tested in this particular case, since the systems that were being tested were PPC, which doesn't run x86 software very well. But yes, it's a nice program to keep around for your x86 hardware. I got my copy from the BBC bootable linux business card CD.

    1. Re:memtest86 on PPC? by Jarnis · · Score: 1

      Shame it crashes and burns on Athlon64 - anyone know a version that would run on the latest hardware?

    2. Re:memtest86 on PPC? by Tony+Hoyle · · Score: 1

      Works here... There's even a native 64bit version I 'm told

    3. Re:memtest86 on PPC? by Jarnis · · Score: 1

      Odd. The floppy-bootable Memtest86 3.0 certifiably instantly reboots the computer that uses MSI KT8 DELTA motherboard and Athlon64 3200+. It flashes the testing screen for about half a second and reboots the computer. Tested on three separate computers using same motherboard+CPU combination.

      Which is kinda teh suck when you try to figure out if you have a bad motherboard or just a bad stick of ram...

      So, could someone point me towards a version (CD/floppy-bootable pre-built one, if possible) that does work?

    4. Re:memtest86 on PPC? by Wolfrider · · Score: 1

      Thanks for that link, whoever you are. :)

      --
      .
      == WolfriderV6 == I'm willing to admit that *I just might* be wrong... Are you??
  49. not much available by Xtifr · · Score: 1

    it would be even more valuable if the same tests were performed on a variety of operating systems

    How many operating systems run on IBM's pSeries machines? AIX and...?

    1. Re:not much available by Kent+Recal · · Score: 1

      Linux?

  50. Unbiased research is REALLY hard to get by Sycraft-fu · · Score: 2, Insightful

    Espically for thigns like this. If a company isn't the one performing the research, chances are they are bankrolling the company that is. This means that generally thigns will be stacked in the favour, and unfavourable results will often be suppressed.

    Even true independants are often not unbiased. For example some individual, with no teis to and OS developers or vendor, might decide to test OSes. Of course it might be that they are a huge Linux or Mac or Windows zealot so again stack things in their favour. You see this fairly frequently with independent Mac test sites. They are MAc heads that work to make thigns look good for the Mac.

    YOu see this in research too. I can't count the number of times I've seen articles, in respected journals, where the researchers have glossed over or ignored something that could contest or invalidiate their findings. They want their hypothesis to be true and so are prone to look at the data that supports it.

    So when you deal with something involving money, you are just going to see some biased results. In scientific research, you generally get other labs testing findings, so the truth is eventually revealed, despite biases. However in bussiness, espically something with quick life cycles like software, forget about it. All tests are going to be biased. Read the results and take them for what they are worth, don't use them ot generalize. Do your own testa, and use what works best for you.

  51. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by router · · Score: 1

    Or that they used the distro that was qualified...nevermind. You're right, its a big fscking con-spir-acy to promote SUSE at the expense of all other Linux Distros. My bad.

    andy

  52. So what ? by Alain+Williams · · Score: 3, Insightful

    All that this shows is that a Linux based system works in the way that it should. Would you expect anything else if you ran your: TV, central heating, ... for a long period ?

    The trouble is that, after a period of increased stability in the 1980's, in the last decade people have come to expect that computers fail, and they wonder with amasement if they don't.

    OK: 30 years ago I remember it being a good day if the mainframe stayed up 12 hours. But things have moved on, today you expect your: MVS, VMS, Unix, Linux machine to stay working. The only OS vendor who's products have not matured is the one in Redmond - largely because of rampant infestation with new features.

    The above is not intended to belittle the fantastic efforts of all those involved.

  53. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by Anonymous Coward · · Score: 1, Insightful

    Are you locked out of your virtual consoles too? Are you running SATA drives? What are your logs saying after the crash?

    I'm running Suse 9 also, and X has only hung on me once(some other apps hang occasionally, but I'm still able to kill them). The only time it hung, I was still able to switch to a VC and restart the Xserver without a reboot.

    I use the system as my main devel machine, so it gets about 4+ hours of heavy use a day. My latest uptime is over 3 weeks...

    Also, are you running a stock kernel? Recompile and strip things out. I highly recommend people recompile any kernel from any distro before using it regularly. It's just a good idea...

  54. Windows wouldn't last 8 hours! by Anonymous Coward · · Score: 1, Funny

    The reason there's no compare test is because Windows wouldn't last 8 hours without a blue screen crash. So you couldn't even start the test!

    1. Re:Windows wouldn't last 8 hours! by Edogger_in_da_ghetto · · Score: 1

      Windows Xp could!!! Im from the ghetto i know everything so shut up dawg. -_-

    2. Re: Windows wouldn't last 8 hours! by Thundersnatch · · Score: 1

      Oh really?

      Uptime is not really about your OS choice, it's how you manage your systems.

      Here's a report from one machine on my local network, a system which runs SQL 2000 under fairly heavy daytime load (especially disk I/O). The reason uptime is only 64 days is that was the last time we applied a OS-level patch that wasn't covered by other security measures:

      G:\Utilities\PsInfo>psinfo.exe

      PsInfo 1.34 - local and remote system information viewer
      Copyright (C) 2001-2002 Mark Russinovich
      Sysinternals - www.sysinternals.com

      Uptime: 64 days, 0 hours, 35 minutes, 40 seconds
      Kernel version: Microsoft Windows 2000, Multiprocessor Free
      Product type: Server
      Product version: 5.0
      Service pack: 4
      Kernel build number: 2195
      Install date: 1/8/2001, 9:02:44 PM
      Processors: 2
      Processor speed: 1.0 GHz
      Processor type: Intel Pentium III
      Physical memory: 2048 MB
  55. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by Spellbinder · · Score: 1

    SUSE 9.0 is a linux based windows
    it is allmost as buggie as windows and almost as easy to use

    --


    stop supporting microsoft with pirating their software!!!!!
  56. Re:Not OT: Graph Software by j0e_average · · Score: 1

    I agree, the graphs do have a very clean appearance. Why don't you pose your question in the comment box at the end of the article? You never know, they (the authors) might actually respond!

  57. Java uses booleans for logic variables by owlstead · · Score: 1

    For those developers that are not so Java aware (see title).

  58. I already knew GNU/Linux was stable by idiotnot · · Score: 3, Interesting

    In fact, it's not much of a question for me anymore -- when there's a problem, it's normally hardware malfunction. I have several machines with 160+ day uptimes, which would be longer if not for an extended power outage at the office.

    IBM just confirmed what I already knew. Guess what, Win2k is pretty stable, too. Sorry, but it's true.

    But, jeeze, isn't anyone else drooling over those systems they tested on? Makes me hate my busted whiteboxes and horrible HP's a little more everyday.

    Repeat after me....."MMMM, dual Power4......MMMM, dual Power4...."

    1. Re:I already knew GNU/Linux was stable by elFarto+the+2nd · · Score: 1

      I have to agree with you there. Win2k IS stable.

      My machine at work had an uptime of 188days+, (it was at 188days before I went on holiday). Then chatting to our sysadmin the other week, he mention that I wouldn't be able to VNC to my machine if I tired, I enquired as to why, and he told me that the PDC had blown up and they stolen my PC as a temporary replacement.

      So, in conclusion, Win2k is super stable, unless another machine in the office goes down, then all bets are off.

      And yes, I do actually do work on my PC.

      Regards
      elFarto
  59. My experience by MadChicken · · Score: 2, Informative

    OK, so I don't have a paper, but I remember my old Linux/P166 running great for a day or so when the CPU fan had died. I only noticed when I rebooted into Windows!

    My notebook has a flaky RAM connection. 32 MB comes and goes depending on how the machine is squeezed. Win 9x products crash it hard, Linux and Win2k don't even notice.

    So in my experience, Linux doesn't mind a hostile platform.

    --
    SYS 64738 NO CARRIER
  60. Look at companies using Linux by JaredOfEuropa · · Score: 2, Interesting
    any one of you know of someone who fills in these criteria.
    The closest you're likely to get is good testimonials from companies using Linux. IBM, SUN etc. all have a stake in Linux, and the 'independant' research outfits are probably funded by them, or by Microsoft (in case Linux needs a good bashing).

    My client is a big megacorp. Their strategy for the coming years is to migrate all Unix systems to Windows/.Net (client side), and to Linux or NT (server side, depending on which OS fits best). This isn't the kind of corporation that makes such a decision after reading a sales brochure or a Gartner article. They research their options, thoroughly. Apparently the conclusion was that Linux is reliable enough to be entrusted with mission-critical stuff.

    The sad thing is that they will (probably) keep the results of this research confidential. Why help the competition with this knowledge?
    --
    If construction was anything like programming, an incorrectly fitted lock would bring down the entire building...
  61. Re:You don't trust Microsoft to evaluate Windows.. by Daengbo · · Score: 1

    Except that this is exactly the test that historians use to guage the accuracy of a primary source and literary deconstructionalists use on the bible to determine who it was really written by.

  62. Re:You don't trust Microsoft to evaluate Windows.. by swillden · · Score: 1

    Why do you trust IBM's Linux Technology Center to evaluate Linux?

    Because the goal of the test is to find out whether or not IBM's customers should feel comfortable using Linux instead of AIX or some other highly reliable OS for mission critical computing applications, and IBM will look really bad in front of its own loyal, big-money customer base if the test results don't hold up in the real world.

    More concisely: Because IBM will lose sales if Linux fails after IBM said it wouldn't.

    --
    Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
  63. unfortunately... by floydman · · Score: 1

    thats why i ask the question, its because i work in a major corp too that migrates all its SOlaris/AIX/IRIX applications, which include major number crunching/visualization to Linux.

    Honestly they were (and i was too despite my years of experience) impressed with the improvement we yield with Linux, and they took their decision based on months of testing, years of experience, and lots of money, and i mean LOTS of it.

    But alaas.....nobody would admit it for the reasons you said.

    But yet its good to know that its gaining ground, and that the this specific marketing hype you hear is true, when it comes to real life experiece you get in the field; though it might not always be, actually it really differes from one env. to another.

    --
    The lunatic is in my head
  64. IMHO by kmichels · · Score: 4, Informative
    Nice to see some number coming in on Linux stability, although, as someone once said: there are lies, damn lies, and statistics. Anyone who has done any stats at University will know that you can prove anything from any result set. And as has already been pointed out by someone, the fact that it has been done by a company who has a not insubstantial vested interest in getting Linux into as many big companies as they can, it carries about as much credability as a Windoze security evaluation paper done by anyone other than Linus.


    The very reason Linux has already made so many inroads into coporations in the first place is because of its reliability and stability, and not because some marketing campaign has churned out the words on header paper.


    Another point is that I personally expect the sytems I administer to run for a darn side longer than 30, 60 or 90 days unless I need to restart them because of a kernel upgrade. When my last bunch I worked for went tits-up, our SAMBA file server had a 790 day uptime, and had run the SAMBA daemons reliably throughout, as well as doing internal DNS and DHCP. That's what your average Linux sysadmin expects from a Linux server box.


    A Linux desktop being used for all manner of things though is completely another story: if I muck around with the Linux install on my laptop, as I do because that's what I do, then I expect to break it from time to time, and so "reliability" is not measured in the same way on a desktop/laptop system, IMHO.


    The ideal environment for Linux is as a networked server, where it can get on with doing what it was setup to do, and will continue doing so until someone pulls the power plug on it. In that context, there are few OS's playing on the same field that can rival it for reliability and stability.

    1. Re:IMHO by bobbuck · · Score: 1
      I agree with most of the above, but I would like to make a point that 90 days at 95% load would be equal to 3 lifetimes on my little server/router at with a loadavg of .02.

      90 days under heavy load should bring out major problems. Even if it doesn't crash you could see performance degradation that indicated something under the surface.

  65. The problem lies not with reliability by Lord+Kano · · Score: 3, Insightful

    but with package and dependency madness.

    I couldn't tell you the number of times I tried to install something and it fails because I was missing "X-Widget-2.41.so.1", so I try to install that "X-Widget-2.41" package and the "X-Widget-2.41-devel" package and they fail because they are missing several other depends as well.

    Linux stability is fine. The GNU software stability is fine. We need a better way to install and maintain software.

    LK

    --
    "Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
    1. Re:The problem lies not with reliability by pe1chl · · Score: 1

      Note that they used SuSE Linux. That does not have this problem (like most modern distributions), as it automatically resolves such dependencies and installs the required packages.

    2. Re:The problem lies not with reliability by moZer · · Score: 1

      Holy Dogshit! So there really are people around that haven't heard about automatic dependency resolvers such as apt, yum, up2date, yast, urpmi or portage...even more amazing is that can be moderated insightful.

      --
      Hello, my name is Robert Lerner, and I pronounce Lernux as "99% cpu"
    3. Re:The problem lies not with reliability by Lord+Kano · · Score: 1

      So there really are people around that haven't heard about automatic dependency resolvers such as apt, yum, up2date, yast, urpmi or portage...even more amazing is that can be moderated insightful.

      When is the last time you used urpmi? For me it's been about a week, and it sucks.

      LK

      --
      "Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
  66. My experience: Linux survives hard drive crash by crow · · Score: 4, Interesting

    I've been using an old P120 laptop as a firewall/router for my house for the past several years running 2.2.something. I wondered why it rebooted after noticing an uptime of only a day or two, but found that instead I was experiencing the uptime rollover bug (at about 500 days; Windows used to crash on a similar bug after 48 days). About a month ago, it stopped giving out DHCP addresses. I went downstairs to investigate, as I couldn't log in remotely, and found that the hard drive was making that nasty clicking sound. I eventually managed to ssh in (sshd and sh were in ram; I just waited for the logging to time out). I was able to kill syslog and cron, and now dhcp is again giving out addresses.

    It's been running just fine for a month now with a dead hard drive.

    (Yes, I'm getting a replacement because it won't survive an extended power outage on that ancient battery.)

    1. Re:My experience: Linux survives hard drive crash by GoneGaryT · · Score: 1
      I was experiencing the uptime rollover bug (at about 500 days

      Thanks, I'll go Google for this - it happened recently on one of my DNS boxes (SuSE 7.3, kernel 2.4-10GB, old P233 PC). To the best of my calculations, it happened at about 496 days. Generated a bunch of error messages and just kept on chugging away. I still haven't rebooted it - no need.

      Given that the 2.4 kernel wasn't that great around the 10 release, that's pretty impressive. Given that the box was going to be junked and the whole caboodle cost zilch, that's more impressive still.

      Unfortunately, I have no Windows equivalent with which to compare it. Linux is definitely ahead on TCO, though :)

    2. Re:My experience: Linux survives hard drive crash by sloanster · · Score: 1

      Yes, windows had a rollover at 49.7 days - but unlike linux, it would crash hard at 49.7 days, if it ever happened to be up for that long.

      The funny thing is, this situation existed for years, and went totally unnoticed in the microsoft world.

    3. Re:My experience: Linux survives hard drive crash by kbielefe · · Score: 1

      One month with a broken hard drive is nothing. I've been running without a hard drive at all for over 6 months on my laptop (okay, not continuously, but daily). Of course, I've been using Knoppix. I intended to use it only for a couple of weeks until I could replace my hard drive, but I found I didn't miss the hard drive for the type of stuff I use my laptop for. At this point I figure why spend the extra $120 when I've been doing just fine without.

      --
      This space intentionally left blank.
  67. Because it confirms daily experience? by SmallFurryCreature · · Score: 2, Insightful
    Because it confirms daily experience?

    MS is running ads saying how windows XP is so reliable. It is kinda hard to believe when you hear the ad because you a getting a cup of coffee waiting for XP to reboot. Same with 2k3. It crashes. Not as often as XP same as XP doesn't crash as often as 98 and so on. But it still crashes.

    Now on to my linux machines. Wich don't crash. I only run in total about a dozen of them and not one of them has crashed.

    I also have had some experience with AIX. Typically on machines everybody had forgotten about that ran some app that everyone just used and they only noticed its importance when someone unplugs an old useless cable.

    So from my daily experience I will find any report coming from MS saying that they are reliable suspect. From my experience with AIX and Linux I will be far more willing to believe a report from IBM about reliabilty because THEY HAVEN'T BEEN TELLING ME TO MANY LIES BEFORE.

    You of course may have different experiences. Linux if nothing else seems capable of generating wildly conflicting emotions in people. So does MS software come to think of it. Funny that we can get so worked up over a collection of bits.

    --

    MMO Quests are like orgasms:

    You may solo them, I prefer them in a group.

    1. Re:Because it confirms daily experience? by hendridm · · Score: 1

      > Same with 2k3. It crashes. Not as often as XP same as XP doesn't crash as often as 98 and so on. But it still crashes.
      ...
      > Now on to my linux machines. Wich don't crash. I only run in total about a dozen of them and not one of them has crashed.

      Hmmm, maybe you're just better at configuring Linux machines than Windows machines? My two Windows machines never crash. My Linux box doesn't crash either, but it's a vanilla RedHat install and doesn't do a whole lot, so it's no surprise. Both operating systems require some tinkering to get them to work right (and stay working). I don't think Windows is more difficult to keep stable and useful than Linux, but perhaps I've just been lucky.

    2. Re:Because it confirms daily experience? by leonbrooks · · Score: 1
      I don't think Windows is more difficult to keep stable and useful than Linux, but perhaps I've just been lucky.

      You have. (-:

      I have a broad collection of Windows-adminning friends; two of them in particular stand out: they both use HCL equipment, they both stay up to date with updates, one has never had a blue-screen in the office he maintains (maybe 15 desktops, one big Windows server and two [hawk, spit] SCO OpenServer boxes); the other, in a similar office (but one Solaris instead of two OpenServer) schedules his Windows boxes to reboot every night because if he doesn't, several of them blue-screen on him once or twice a week. Somehow, I can't blame the difference on the Sun box...

      I have only ever seen my production Linux boxes crash for two reasons: dodgy hardware (typically failing CPU fans, failing hard drives or invading dust but sometimes marginal RAM or motherboard) or dodgy NVidia drivers.

      --
      Got time? Spend some of it coding or testing
    3. Re:Because it confirms daily experience? by hendridm · · Score: 1

      Yes, I agree Windows is not meant for server use. I've had nothing but problems with Windows services (like IIS and Microsoft's rendition of DNS), but I've had good luck with XP and 2000 on the desktop, for the most part. I run RedHat on most of my servers (both personal and production), although I personally only manage about 3 production servers.

  68. I think you left something out by SmallFurryCreature · · Score: 1
    If you run the test 5 times on 5 different machines. Then you can rule out software vs hardware errors. PRESUMING the hardware is known to be usually good. Remember the old pentiums and their error baked in? Good luck determining that one without some kind of reference material.

    And this is the real way to determine it. Run the suspect software on a piece of hardware known to be good or run a piece of software known to be good on suspect hardware. Testing anything with a single sample is meaningless. If you did this in medicine you would be kicked out so fast you might just discover if mankind can survive lightspeed.

    --

    MMO Quests are like orgasms:

    You may solo them, I prefer them in a group.

  69. 2/3 Authors of Linux Paper - Women by Anonymous Coward · · Score: 1, Insightful

    Two of the three authors were women. Time for a new Barbie doll: "Linux is fun!"

  70. USE GOOD HARDWARE, THEN! by MsGeek · · Score: 1
    Get some ECS motherboard, generic RAM... bang. You're in for the evening.

    There's plenty of good hardware to be had from places like Newegg, Directron and Computer Geeks. Just to name a few. Get yourself an ASUS motherboard, RAM from Crucial or from a reputable manufacturer like Kingston ValueRAM or Viking or Mushkin or Corsair, get a video card from a good manufacturer, and you have a nice solid machine that can handle anything.

    --
    Knowledge is power. Knowledge shared is power multiplied.
  71. Don't Trust IBM to be objective either - failures by Anonymous Coward · · Score: 1, Informative

    The 2.4 kernel has a number of unstable algorithms. Most of the corresponding algorithms in genetic UNIX are stable by design. For example, read the bug postings on RedHat Bugzilla for this critical flaw which has failed to get fixed after more than 6 months: [Bug 89226] (VM)Kernel prefers swapping instead of releasing cache memory

  72. Re:You don't trust Microsoft to evaluate Windows.. by MegaHamsterX · · Score: 1

    IBM has a reputation, so does Microsoft.

    Based on that who would you trust for your backend, must work everyday or we're out of business, needs.

    Every on-site IBM tech I've ever seen is a doctor, can't really say the same for the MCSE set.

    No one ever got fired for buying IBM.

  73. Re:Linux 2.4.19-ull-ppc64-SMP (SLES 8 SP 1) by kiore · · Score: 2, Interesting
    Yes, I do.

    We run SuSE Professional 8.2 on our home machines, one server, two workstations.

    My partner's machine freezes occasionally. Most recent one was yesterday, and even Alt+Ctrl+Backspace wouldn't get control back. I needed to power off!

    I've never been able to work out exactly what causes these freezes (my partner is not very "'puter literate"), but suspect that it may be somehow related to the printing subsystem. One time I did manage to ssh into the machine when it froze and the cups process was feasting on CPU cycles.

    We upgraded to the then latest patches from the SuSE ftp site about two weeks ago. This did not affect the reliability.

    I can't contrast these observations to any other distro. I've only ever used SuSE (since 6.x).

  74. Re:Here is a hint by sloanster · · Score: 1

    lol, gotta love the fantasy world these anonymous linux bashers live in... no idea what they are going on about, hehe...

  75. Re:You don't trust Microsoft to evaluate Windows.. by sloanster · · Score: 1

    why not? IBM is a huge seller of windoze pee cees... If you have a specific concern about the test, feel free to voice it.

  76. Slashdot bias by Overly+Critical+Guy · · Score: 1

    Here is an example of Slashdot bias.

    * Microsoft puts out a Windows reliability test study. Thousands of Slashbots jump on it and accuse it of being biased, maked BSOD and Clippy "jokes," and generally act like lunatics.

    * IBM puts out a Linux reliability test study. Nobody even dreams of calling it biased. It's considered to be a pass of "flying colors," and everyone nods along with the rest of the flock.

    --
    "Sufferin' succotash."
    1. Re:Slashdot bias by chthon · · Score: 1

      This is because we ourselves can rerun the tests if we want to on Linux,with the sources from here. In fact, I intend to do this on my brand new Dual Athlon system.

    2. Re:Slashdot bias by strobert · · Score: 1

      While I will give you that there are a lot of people who act "like lunatics" on /. (I would prefer the term childish -- and I don't think it is limited to /.), I think there is a more practical reason to see that sort of bias in this particular case.

      personal experience. I am fairly sure most people reading slashdot have have far more sucess with linux being stable, reliable, and able to interoperate with things rather than windows.

      I know in my experience most windows services need to be babysat a WHOLE LOT more than my linux services. the base OS can be stable (I generally keep my NT workstation up for weeks at a time without an issue) but combine it with the apps written for it, and probably most importanly the mindset of automation-enabled isn't important.

  77. Obligatory Gentoo... by MarcQuadra · · Score: 1

    You sir, are ready for Gentoo Linux. RPMs are way lame (yes, that's a technical term) compared to Gentoo's 'portage' package management. Seriously give it a try.

    --
    "Sometimes, I think Trent just needs a cup of hot chocolate and a blankie." -Tori Amos on Nine Inch Nails
  78. Re:You don't trust Microsoft to evaluate Windows.. by Wolfrider · · Score: 1

    Parent +2 Insightful

    --
    .
    == WolfriderV6 == I'm willing to admit that *I just might* be wrong... Are you??
  79. Re:You don't trust Microsoft to evaluate Windows.. by Luban+Doyle · · Score: 1

    >The people performing it have a vested financial interest in having it turn out a specific way, notably positive.

    Bad analogy. MS makes and sells their software. This isn't IBM evaluating an IBM software product, i.e. AIX, OS/2, etc., it's a third party product running on their hardware. Positive result doesn't mean IBM sells more hardware or more of their own software, it only shows that they have an offering that you may want to consider among other offerings of hardware with [or even without] a third party OS to meet your needs.

    >If the test results showed poor reliability, then I would understand trusting it because it would go against the motives of the people performing it.

    Only negative results are reliable because no one would fake that? Then why would we ever believe anyone who posts positive results? The "nothing positive is worthwhile" attitude is inherently self-defeating, non-productive and relies on blinding oneself to anything that may be good. Forget to take your meds? And how do you compare all the positive results that are posted with all the negative results that DON'T get posted. There's your real baseline. Comparing all the secret/non-existent data to the public stuff. It's just impossible to do. By your logic that would make it infallible.

    >Since the test affirms their business model, no matter how documented it is, it should be suspect.

    More of the "don't believe the people who have experience, skill and knowledge based on day-to-day operations ; they must be lying because they are more informed than we are" drivel. Documentation of success must be ignored because it proves the point of those who are successful and invalidates my defective criticism of them. This proves that you feel bad, not that they are doing something bad.

    >Many of the test results depend on the computer systems meeting expectations of the people testing it, particularly in overload cases.

    You know of businesses that buy computer systems that they don't expect to meet performance goals that they need to reach? And those unreal-world testing scenarios have more value than using appropriate hardware and software? It's not normally done that way here on planet Earth. Organizations work to succeed, not to fail and then point to someone else as being the responsible party.

    >Take C/C++ and Java.

    I would only take them if I thought they were of some use in the scenario I was going to apply them to. If they don't meet my expectations then I need to stop banging my head against my defective expectations and move on to something more appropriate. If they work, then I have failed because they meet my expectations? Backwards thinking again.

    >I would like an evaluation from somewhere in-between, not someone whose years of experience allow them to gloss over what might be problems for another person.

    Only trust results by those less qualified to evaluate the system or product because their lack of skill proves that they can't be trying to fool you? This isn't skepticism, it is negativism. Anyone can look at proof and say "How do I know you're not fooling me?" The question means that you don't trust yourself to see what is in front of you, not bad intent by the presenter.

  80. Re:You don't trust Microsoft to evaluate Windows.. by Unordained · · Score: 1

    would you care to give us more examples? i've personally thought that the determination of the authorship of texts was based on things like "whoever wrote this writes like whoever wrote that, and we're fairly sure who wrote one of them, but not the other" and so forth. that's more of an analysis of style and similarity of content ...

    if you mean something more along the lines of "was this reporter of events being accurate," in historical terms (religious or not) then yes, that probably happens. should we trust -that- analysis? perhaps not.

    historical accuracy is interesting in that, unlike something like a report about the performance of a newly created piece of equipment, we don't have (exactly) the opportunity to run our own tests. archeology, maybe ... but we aren't actually there, and we can't (usually) recreate the events to experience them again. the same applies to murder investigations, etc.

    the whole point of determining authorship was to attempt to get authority -- agreeing that a text was written by someone "in the know" we could trust. but we don't usually (in the case of biblical texts) know much about the authors other than what's in the text itself, in one way or another, or by tradition (which is highly speculative.)

    with this stuff, it seems to come down to: if you already admit the truth of the text, then it is self-supportive (stand-alone.) if you don't, you may not ever be convinced that it can (actually or theoretically) support itself.

  81. Dell and SoTM by leonbrooks · · Score: 1
    They use the same supplier of the month policies that any mom and pop outfit might use to keep costs down.

    Too true. I've seen the exact same model of Dell laptop with three different video chipsets (NVidia, SiS and one other I can't remember) and two different network chipsets and no external indication that any of them were different to any of the others.
    --
    Got time? Spend some of it coding or testing
  82. I have a funnier tale than that... by leonbrooks · · Score: 1
    they only noticed its importance when someone unplugs an old useless cable

    A local (Perth, WestOz) company decided to go 100% The Microsoft Way when the time came for an office upgrade. On Friday afternoon, the crew of geniuses arrived to rip out their miscellaneous servers an replace them with shiny new (and -ing expensive) Microsoft-running boxen. On Monday morning, the staff arrived to work, and all of their application and file servers worked flawlessly - but no internet.

    During the Sunday afternoon cleanup, Geniuses Inc had collected all of the old hardware, and had they donated it to Computer Angels? No, they'd flung the whole lot into a rubbish skip ("dumpster" in Yankee-land) which had then been carted off at silly o'clock on Monday morning to a waste transfer station, duly separated into recyclable-or-not and the recyclable-ish computers had already been crushed by SimsMetal and carted off by the time Geniuses Inc got around to figuring out what had happened.

    One of the pieces of scrap had operated without intervention as their DSL internet gateway, firewall, name server, intranet and email server for four years. Being Debian Linux, it had given itself security updates, too.

    They had to scan accounting records to even find out who their ISP was (DNS records had of course all well expired after three days and ringing up a known email recipient to ask what the headers said was evidently beyond them), because my friend who had set it up couldn't recall that far back when an ex-employee remembered who he was and Geniuses Inc rang him up. He told me later that his ribs had hurt for several days after he finished laughing. The company had rudely dropped his services after an argument with their then-resident paper tiger about security, and the last-straw trigger for the upgrade was their inability to permanently expunge a particular virus from their LAN.

    Poetic? The justice was damn near epic! (-: They were subsequently taken out by Slammer, too :-)

    --
    Got time? Spend some of it coding or testing
    1. Re:I have a funnier tale than that... by Hobophile · · Score: 1
      One of the pieces of scrap had operated without intervention as their DSL internet gateway, firewall, name server, intranet and email server for four years. Being Debian Linux, it had given itself security updates, too.

      Debian automatically restarts affected services after applying security updates, in every case?

      Does it automatically reboot the machine when it upgrades the kernel also?

      I suspect the answer is "no", so it's entirely possible that this server running unattended for four years was a significant security hole in its own right.

  83. Re:You don't trust Microsoft to evaluate Windows.. by leonbrooks · · Score: 1
    If I had a concern, it would be that IBM sees and understands the control issues behind MS-Windows. This understanding would terrify any competitor with a brain (and since Microsoft must grow or die, sooner or later you will be a competitor to them) and lead to a desire to skew results away from them.

    This is somewhat countered by the observation that, unlike any of Microsoft's alleged benchmarks, anybody who can afford to buy or rent a pSeries can both replicate and publish the results of those benchmarks. They can also read the benchmark code to see exactly what it does, and even tweak it and test the tweaks if they suspect subtle optimisation.

    --
    Got time? Spend some of it coding or testing
  84. Re:You don't trust Microsoft to evaluate Windows.. by Art+Tatum · · Score: 1
    How could they justify expansion and better funding if their previous statements about Linux being enterprise-ready were unfounded?

    No, you've got it all wrong. You get more funding when things are borked. How else is it going to be fixed? :-)

  85. Borderline useless by javamutt · · Score: 1

    I just finished reading the article, and I'm disappointed. It seems to me that the purpose of this article is to provide yet another "Enterprise-ready" stamp of approval for Linux. Now, let me be the first to say that I'm all for Linux in the Enterprise, but this article isn't the way to do it.

    In the Enterprise world reliability equates to availability. This test used high loads over 30 and 60 days. To be Enterprise capable, in my mind, means 2-nines, or 99.0% availability. That's 3.65 days/year or 01:41:00 per week. To be impressive (and worthy of an article) I'd be looking for a minimum of 4 nines (0:52:30/year, 1 minute per week). Most mature sites I work with are closer to the higher 3-nines range on a consistant basis using a commercial *NIX.

    What I'm getting at is that to have a real test give you a sample size that you can trust, 90 days is the bare minimum. I would have expected a 12 month study, and a much better explanation of Linux ability to deal with HW errors. IE - inject disk failures and scenarios which test fault compartmentalization. Doesn't matter what hardware you're running on - a good solar storm can play all kinds of havoc in a data center.

    I'm disappointed that IBM would consider this anything more than a science project. Knowing the resources and talent they have at their disposal I was expecting more.

    1. Re:Borderline useless by mbonar · · Score: 1

      The purpose of the Linux Test Project is not to "prove" anything. The test suite is simply a tool to be used to make linux better. They have been working on the tools and the methodology for a very long time. The tool is ready; the methodology is sound. They have shown that by running a series of tests and releasing them to the community. I am a professional tester and I work for IBM, so you may consider me biased. But, IMHO, the LTP has done a rock-solid series of tests. If I was a kernal developer I would be busy logging enhancement requests from those 5% failures and planning ways to use the suite in my regression tests.

      --
      ... There's no such thing as time; we invented it.
    2. Re:Borderline useless by javamutt · · Score: 1

      I agree that the tools are (LTP) are rock-solid and critical to Linux' growth. I also agree that Linux is ready for prime-time, and aknowledge IBM's significant contributions to advancing Linux in the Enterprise. I even agree with LTP's purpose not being to proove anything.

      My comment was specifically geared towards the article - not the tools, or the product of Linux. I think the article was very soft on details which is unfortunate given the talents IBM has working on Linux. The article seems to have its purpose rooted in adding credibility to Linux reliability, but I don't view the test as showing that. I believe you need a larger sample (test period) to make a conclusive argument about reliability, and a more wide range of tests.

      I work as an availability engineer (studying reasons for downtime and finding ways around them) and the trends I see often take a year to show themselves. We won't even perform an analysis until we have 90 days of data, and that's rarely enough for a trend. An OS' reliability is based on much more than running a load of 90+% for X days. Very few systems actually run with that kind of sustained load in a production environment. Reliabiliyt is also heavily based on it's fault isolation, and resiliance to the unexptected (all of which Linux excells at IMHO).

      I just don't think that this test did what it set out to do, although I'm sure marketing departments think otherwise. I would have covered various types of fault injection, and liklihood of human error for common recovery processes as well. I would also have had 90 days as my low end, not 30 days. Finally, I would have put these comparisons next to other options like Windows and a UNIX or two. I don't think the results would have changed, but I do think the results would be significantly more credible in prooving its capabilities.

    3. Re:Borderline useless by mbonar · · Score: 1

      Thank you, javamutt, for adding that much needed context to your earlier post. When you view the tools from an availability engineering perspective, you can see how they might be used within a larger process of which regression testing is only a part.

      It is unfortunate that they "observed" in their conclusions that their "tests demonstrate that the Linux system is reliable and stable over long durations and can provide a robust, enterprise-level environment." Their tests did show that Linux is "reliable and stable" over the durations specified in their report, but stating that Linux can "provide a robust, enterprise-level environment" is a wee bit too subjective for my tastes, even if I agree with them. I can forgive them for this, though, because when I write my test reports I often try to give my customers some sense of where to go after the completion of the tests, as opposed to just letting the dry facts speak for themselves. In their case, it seems to me that they chose a bit of a marketing slant for their conclusions; not surprising given that they chose to publish it on the internet.

      Cheers!

      --
      ... There's no such thing as time; we invented it.
    4. Re:Borderline useless by chip33550336 · · Score: 1

      Fortuneately this test was just with one server. I am sure if you clustered these servers we would see the availability increase to your expectations. I wonder how many servers it would take?

      It looks like you will need 3 boxes if you desire 4 or more 9's, if you round up. 4 boxes if you don't.

      1 - 95%
      2 - 99.75%
      3 - 99.9875%
      4 - 99.999375%
      5 - 99.99996875%

    5. Re:Borderline useless by javamutt · · Score: 1

      It's not safe to assume that clustering on its own will accomplish this. Most Enterprise cluster environments add significant administrative complexity. Given that human error is responsible for vastly more downtime than actual hardware failures this is significant.

      The most effective means of improving availability is through automation of administrative tasks. For clustering to have a long lasting impact you first need a sufficiently mature IT practice. This alone can get you to a very significant level of "9's".

      The second thing is adequate disk redundancy, and the third is environmentals. By this point you're at 3-4 9's consistantly and clustering is gravy.

  86. Linsucks by Edogger_in_da_ghetto · · Score: 1

    I spilled a soda on my Linsucks and it got all sticky and it broke. And now my Linsucks doesn't turn on. Its a tradegy. :) :) :( :) -_-

  87. You can stop suspecting by leonbrooks · · Score: 1
    Debian automatically restarts affected services after applying security updates, in every case?

    Yes, and the pedant in question would have made absolutely sure, which is the attribute that appears to have most upset the indigent paper tiger (catching said pedant in a mistake is one of those once-in-a-lifetime experiences, very frustrating for someone insecure in their abilities (which pretty much defined said paper tiger)).

    I haven't had much practice on Debian but do remember it restarting services. I know that Mandrake does that, sometimes annoyingly often (e.g. upgrade Apache and all associated modules, you might see a dozen restarts).

    Does it automatically reboot the machine when it upgrades the kernel also?

    Yup. At least, I'm sure that this one does, because all of his other ones do, too (and wait until Sunday morning, and send him email before and after).

    it's entirely possible that this server running unattended for four years was a significant security hole in its own right

    Most of my gateway machines only expose one service (SSH) and that on a non-standard port and sometimes also IP-restricted, so the likelihood of someone stumbling across a service that they can crack even several years after the last update are epsilon. This dude is more paranoid than I. Bet on something that gives you a chance of winning.

    --
    Got time? Spend some of it coding or testing