Lack of Testing Threatening the Stability of Linux

Hmm.. by Vickor · 2005-04-22 01:40 · Score: 5, Funny

I thought good technology required insane developers...

Re:Hmm.. by kfg · 2005-04-22 02:55 · Score: 5, Insightful

I have this tendency to respond to serious posts with a joke, and jokes with a serious post. I tend to come at problems from angles of perception that other people do not see.

This is what the best developers do, otherwise they would simply come up with the same mediocre to bad solutions that everyone else does, no?

They do, however, have this really annoying tendency to see everything from the same "Man from Mars" perspective, not restricting themselves to viewing only code differently than most do. This can make them appear "insane" to the general populace.

In the land of the blind the one eyed man is a paranoid schizophrenic.

Insanity is percieving things not as they really are. If the majority percieve things not as they really are the man who does so will give the perception of being insane when he acts upon his perceptions, those acts being unintelligable to the majority.

And thus is born the image of the "quirky" genius. All will hail his new invention, but titter quietly about how he wears his socks, never for one minute stopping to take the obvious point of view that there just might be something of genius in the way he wears his socks, because he wears his socks differently than the majority do.

And being the same is sanity, right?

Nevermind that we innately wipe out genius in one swell foop with that attitude. It enforces a regression to the median, if it weren't for the fact that half the populace would have to progress to the median somehow, which, trust me, they just ain't gonna do. So instead of a regression to the median we get a regression the "really dumb."

Take the current fad for "Playskool" interfaces. . . .please.

Of course, some "geniuses" really are just insane and "luck into" some discovery through their insane perception of things.

So how do you tell the difference? Well, takes one to know one I'm afraid. It would be nice if it didn't seem as if the people who end up in charge of "mental health" weren't all, themselves dimwitted morons at best, and completely, utterly crackers at worst.

They're coming to take me away, HO,HO! HEE, HEE! HA, HA!

I think it's something about the way I wear my socks.

KFG
Re:Hmm.. by Cthefuture · 2005-04-22 06:24 · Score: 2, Insightful

I have this tendency to respond to serious posts with a joke, and jokes with a serious post. I tend to come at problems from angles of perception that other people do not see.

Heh, I do the same thing. I often consider myself the ultimate Devil's Advocate.

When others are stressed, I'm calm. When others are calm, I'm stressed. Bizarre, but that's the way I work. Unfortunately this causes problems for me even in geek circles because they don't see what I see and often I can't explain why I know the things I do.

As for the socks thing, I think it just depends on what you think is important enough to put energy into caring about.

--
The ratio of people to cake is too big

Insanity by wirah · 2005-04-22 01:40 · Score: 2, Funny

Isn't any developer insane?

Re:Insanity by doppe1 · 2005-04-22 02:16 · Score: 2, Funny

It's a fine line between being a genious and a nutter...
Spelling is usually the first clue though.

Vacation for Linus...? by tquinlan · 2005-04-22 01:43 · Score: 5, Insightful

...does it seem like Linus might need a vacation?

TFA states that he's starting to take as much pride in rejecting patches as he does accepting them, and with this whole BitKeeper thing, it seems to me like he might need a small break.

Of course, I'm not one to really talk, as I don't do nearly as much as he does with Linux...

Also, with regards to testing, those of us who use it daily are testing all the time. I know it's not structured QA, but still, it's a lot of testing.

Also, maybe slowing down the kernel releases a bit might help. I know that I do an emerge world on my Gentoo boxes about once a week, and it seems like there's a new kernel release every week. If there's a need for more testing, perhaps a little less time releasing and more time testing is in order.

--
DBA? Software Engineer? My company is hiring! Click

Re:Vacation for Linus...? by mindstrm · 2005-04-22 01:56 · Score: 3, Informative

Why does it have to be an attitude? Linus has always maintained that it's his kernel tree, and that if you don't like the way he manages it, you are more than free to keep your own tree. The kernel is GPL, after all.
You won't hear linus complaining if someone forks his kernel and attention shifts away.. linus will continue to integrate things he wants to integrate.
Re:Vacation for Linus...? by Anonymous Coward · 2005-04-22 02:00 · Score: 4, Interesting

I don't think that is the issue at hand. The issue is the way he is behaving in public. The flames, the "fuck off" attitude towards people working on the kernel, etc...

The kernel did not get where it is with his current attitude.

As I said, the pressure can get to anyone and the kernel is now a mighty beast of a project to maintain. He just needs to get his head screwed on straight. Either that or risk turning into another Theo.
Re:Vacation for Linus...? by SecurityGuy · 2005-04-22 02:01 · Score: 5, Insightful

Also, with regards to testing, those of us who use it daily are testing all the time. I know it's not structured QA, but still, it's a lot of testing.

Isn't that the essence of Microsoft's QA?

Should we be doing what we rightly criticise them for?
Re:Vacation for Linus...? by bfields · 2005-04-22 02:14 · Score: 4, Informative

The issue is the way he is behaving in public. The flames, the "fuck off" attitude towards people working on the kernel, etc...I don't think that is the issue at hand. The issue is the way he is behaving in public. The flames, the "fuck off" attitude towards people working on the kernel, etc...
The kernel did not get where it is with his current attitude.

Oh, yes it did--go spend a few hours reading the lkml archives. He's always flamed people, and always been happy to drop patches that he thought weren't right for one reason or another. There's no sudden change here.... (But I wouldn't call it a "fuck off" attitude. Even when he flames someone he rarely seems to actually hold a grudge, or be unwilling to work with anyone.)
--Bruce Fields
Re:Vacation for Linus...? by molnarcs · 2005-04-22 02:35 · Score: 5, Interesting

Either that or risk turning into another Theo.
Well, that would really be a problem, but despite Theo's personality (which I think might have its own charms) doesn't necessarily get in the way of development. Just think of the huge contributions OpenBSD made. Common Address Redundancy Protocol (CARP) for instance. Or their excellent firewall, pf (now present in all BSDs). Not to mention OpenSSH. And beside these standalone or highly portable applications, they released a secure and stable OS. Not 'just' a kernel. They write their own libc. They maintain a lot of software in their base system. Apache 1.3.x can be almost considered a fork, with their security/stability related patchset. Which comes down to my main point: The problem is not lack of resources, monetary or otherwise
Currently there are ~100 developers payed fulltime just to work on the kernel (at various organizations). There are none in FreeBSD. There are perhaps a dozen devs whose employers let them work on FreeBSD part-time, or there are various works that are sponsored by companies (pair network comes to mind) from time-to-time. But all in all, FreeBSD, that writes its own kernel, its own C library, and generally speaking maintains an OS (userland apps like their package management and ports system for instance, burncd - the native cd burning app of freebsd, etc.) does that with 1/50 of the resource Linux & co has just to develop the kernel.
This is not about linux vs. freebsd btw. I chose to use the latter, you chose the former, I really don't care, and I'm not willing to engage in yet another linux vs. bsd flamefest. You can argue endlessly about why linux is better, and I can do the same about FreeBSD, but I think we can agree on one point: either way, neither is that much better (lets cut down that figure to 10x - you can't possibly claim that linux is 10x better or something). In other words, my point is that it is not about (monetary) resources. It is a problem of organization imho. Less frequent releases, more API/ABI stability, a controlled release engeneering process might be a solution. Perhaps a branch split like it was done during 2.5.x (current 2.6) development. Pronounce the current 2.6.x branch STABLE, meaning introducing a POLA (policy of least astonishment in freebsd) and forbid API/ABI changes, then continue development in a new, 2.7 branch at the current pace.
I don't mean to imply that there is no release engeneering in linux kernel development whatsoever. But somehow FreeBSD's (and I assume the other BSD's as well) release engeneering seems to me a lot more transparent. Click the first few links at the top of this page to see what I mean by "controlled release engeneering process."
Re:Vacation for Linus...? by ShieldW0lf · 2005-04-22 02:51 · Score: 2, Insightful

Also, with regards to testing, those of us who use it daily are testing all the time. I know it's not structured QA, but still, it's a lot of testing.

Isn't that the essence of Microsoft's QA? Should we be doing what we rightly criticise them for?

Say what you want about Microsoft and the stability/reliability/security of their software, but they have many full time (and paid) people devoted exclusively to testing and trying to break their software so that it can be fixed.

He's saying that they DON'T test their product enough, and that they DO ship it broken and have the community test it, and that the open source projects that are so critical of MS for doing this should aspire to something better.

You totally missed the point.

--
-1 Uncomfortable Truth
Re:Vacation for Linus...? by popeyethesailor · 2005-04-22 03:07 · Score: 3, Interesting

Scott Guthrie describes how they test ASP.NET in this blog posting
Read that, you might learn a thing or two.
Re:Vacation for Linus...? by iabervon · 2005-04-22 03:34 · Score: 2, Interesting

Linus has always felt that his main role was to reject patches. If you take just anything, people won't refine patches to the point where they maintain or improve the overall quality of the code. Andrew and Linus essentially do a good cop/bad cop routine on patches.

Of course, he's essentially been on vacation from Linux work, developing git. I'd guess that writing his own thing has make him feel a lot better about the BitKeeper mess. He certainly seemed to be having fun coming up with brilliant solutions to problems, rather than the current kernel situation of endless refinement.

As for testing, the article is misinterpreting its own quotes. There's no lack of testing, and Andrew didn't say there was. There is a lack of reporting of test results and a lack of credit to people who would provide them. There is a lack of communication infrastructure for getting bug reports to people who might be able to fix them, to other people who may or may not be seeing the same problem (and can provide more details on when the bugs are triggered).

Of course, slowing down releases would make testing more difficult, because people don't test kernels that aren't released, and testing fewer releases just means that more people report the same bugs, because the fixes for those bugs are held up waiting for the next release.

The lack of properly stable releases should be fixed by the 2.6.x.y process; now an effective reporting process is needed to help the maintainers find out about bugs people hit, and determine whether correctness fixes actually deal with cases that happen in practice.
Re:Vacation for Linus...? by ajs · 2005-04-22 03:44 · Score: 3, Interesting

"Currently there are ~100 developers [paid] fulltime just to work on the kernel (at various organizations)."

I would be shocked if the number is that small.

"There are none in FreeBSD. There are perhaps a dozen devs whose employers let them work on FreeBSD part-time, or there are various works that are sponsored by companies (pair network comes to mind) from time-to-time."

That's a shame, but ok...

"[FreeBSD is released] with 1/50 of the resource[s of] Linux [...]"

Right, and so the fact that FreeBSD works well is quite impressive. The fact that it doesn't work at all on certain high-end platforms, obsolete platforms, lots of embeded platforms, etc., is also not shocking nor does it make it a poor platform.

FreeBSD does what it can with the resources it has, and that's a good thing. Let's not try to compare them to Linux. Linux is Linux and BSD is BSD. They are excellent tools for different jobs.
Re:Vacation for Linus...? by ArbitraryConstant · 2005-04-22 04:06 · Score: 2, Insightful

"Also, with regards to testing, those of us who use it daily are testing all the time. I know it's not structured QA, but still, it's a lot of testing."

When having users stumble into bugs is your primary method of finding them, your QA has already failed.

Because they do active development on the 2.6 branch, new bugs are introduced all the time. Even if they're only there for one version, there's always more bugs in the next version, which is a big disincentive for upgrades. And not minor stuff, big things like the ability to burn CDs.

Without proper regression testing stuff like that will continue to haunt users. The assumption is that distros will do it, but the simple fact is that they aren't. The kernel developers must take responsibility for it.

--
I rarely criticize things I don't care about.
Re:Vacation for Linus...? by ajs · 2005-04-22 04:24 · Score: 2, Insightful

The point is that they don't come "remotely close", and there's the fact that these things do not scale linearly. In order to support 12 platforms as shipped, you have to do far more than 2x the work of supporting 6 platforms. Why? Because those first 6 are the ones that are most alike, and used by the largest intersection of developers. The other six are used by niche develpers and tend to be less like the first six.

This, of course, ignores the work that goes into special subsystems for popular platforms, special hardware, obsolete hardware, new protocols and standards, etc.

But again, this does not mean that FreeBSD is bad or poorly organized or useless. It's a fine OS that I recommend to people all the time. It's just that there's a different audience.
Re:Vacation for Linus...? by bit+trollent · 2005-04-22 06:37 · Score: 2, Insightful

He's saying that they DON'T test their product enough, and that they DO ship it broken and have the community test it,

Yes, but he is full of shit. Lots of people are missing the point around here today.
Re:Vacation for Linus...? by ajs · 2005-04-22 07:02 · Score: 2, Insightful

It's different in the Linux world, for the reasons that you pointed out before.

The kernel team tests and releases in one pass, which is roughly akin to unit testing in a large project that has several sub-projects.

Then distributions pick up the changes (it's really not that clean a separation, but let's say it is for sake of simplicity), and incorporate it into their OSes. Each distribution has its own unique QA/release process, so let's look at Red Hat as an example. They take some internal things, some external things, the "official kernel" and start testing it with their system. They make some changes, give some feedback/fix bugs/etc. and eventually they come up with a collection of patches that they feel brings the kernel to the point they want it (they repeat this for hundreds of packages, some larger, most smaller than the kernel).

The original source+patches is packaged in two forms: an SRPM, which contains all of the discreate pieces and an RPM which contains the result of unpacking the SRPM, applying the patches inside it to the base code inside it, building and installing along with any pre- or post-install steps that are required.

That's all shunted into Red Hat's final release process which I know almost nothing about, but I presume it involves test farms which they use to stress the new OS in various ways. This might, of course, result in bugs discovered, and further iterations of the process.

Moderators by ciroknight · 2005-04-22 01:43 · Score: 2

I know it's early, but do we really have to mod everything flamebait, even if it's hilarious??? Come on..

--
"Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush

Re:Moderators by RangerRick98 · 2005-04-22 02:00 · Score: 3, Funny

Moderators (Score:1, Flamebait)
by ciroknight (601098)
I know it's early, but do we really have to mod everything flamebait, even if it's hilarious??? Come on..

Evidently.

--
"You're older than you've ever been, and now you're even older."
Re:Moderators by DenDave · 2005-04-22 02:18 · Score: 3, Funny

I thought they suffered from..
Developers

Developers

Developers

Developers

--
-if at first you don't succeed, stay the heck away from paragliding.

In contrast to the MS method... by Dale549 · 2005-04-22 01:44 · Score: 5, Funny

where the testers (a.k.a. users) get to pay $$$ for the privilege of testing OS stability.

Re:In contrast to the MS method... by ciroknight · 2005-04-22 01:47 · Score: 4, Interesting

I'm really surprised no company really has used this as a business model.

I think it'd be awesome to run a software debugging/testing firm, where basically you have a bunch of computers and a bunch of users come in and try their best to break the software. Cheap labor and a good variety in machines, and you could quite quickly clean up even some of the nastiest code.

--
"Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
Re:In contrast to the MS method... by SpaceLifeForm · 2005-04-22 01:55 · Score: 2, Interesting

That may work in userland, but it's not that simple in the kernel where bugs can lurk for years and only appear due to un-related code changes that effect code path execution and timing.
You may be able to 'break it', but can you repeatedly 'break it'?
Can you predictably reproduce the bug?

--
You are being MICROattacked, from various angles, in a SOFT manner.
Re:In contrast to the MS method... by ciroknight · 2005-04-22 01:58 · Score: 3, Interesting

Virtual machines can help with this; running the kernel in a sandbox to get an actual snapshot of the kernel in action. But at the same time, the kernel's going to be running, and userland/kernel-land interaction will cause plenty of bugs to crop up and show themselves. But you are right; it's hard to poke at a kernel to see what's broken, especially when some code paths are very hard to follow and others are almost never used on certain systems.

--
"Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
Re:In contrast to the MS method... by Lumpy · 2005-04-22 04:25 · Score: 2, Insightful

I'm really surprised no company really has used this as a business model.

you don't work with Vertical market software do you.

all of our critical sales apps are considered alpha testing. by the time an app becomes stable and useable they retire it and sell us their next abomination that does not work right, has 1/3rd the features the sales guy sold the CTO on and has stability problems that make any Admin cry (imagine a printer driver on your W2K box causing data corruption in an app... WTF is THAT!

look at applications used by small segments of industry, there is wher eyou will find the untested and crappy code sold for $2000-$6000 per seat.

--
Do not look at laser with remaining good eye.

If chaos is so close at hand now... by frankblack9999 · 2005-04-22 01:46 · Score: 3, Interesting

just imagine what'll happen if Linux actually makes a dent in the non-geek desktop market, and widespread use by "appliance operators" ensues.

Apologies in advance by frankthechicken · 2005-04-22 01:46 · Score: 5, Funny

Must be why there is a huge popularity of mugs at MS bearing the obnoxious logo:-

"You don't have to be a developer to work here, but it helps".

Loose women ??? by noisymime · 2005-04-22 01:46 · Score: 5, Informative

Coming from someone who was at that talk, he specifically said NOT to give money to testers. His words were actually 'give them credit, fame and loose women'.

This drew laughs from the audience.

Re:Loose women ??? by Intron · 2005-04-22 02:34 · Score: 3, Funny

Everyone knows that the loose women are using Apple

--
Intron: the portion of DNA which expresses nothing useful.

Re:Contrapositive by jejones · 2005-04-22 01:47 · Score: 5, Informative

That's the converse. The contrapositive is, after a quick application of de Morgan's law:

"If it doesn't come to tears, then you didn't pick a good technology or your developers are sane."

Do we need something like what Sun do? by Anonymous Coward · 2005-04-22 01:48 · Score: 3, Informative

Osnews had an article a while ago about some of the testing Sun do on Solaris - http://www.osnews.com/story.php?news_id=10178

Does Linux have a built-in crash reporter? by G4from128k · 2005-04-22 01:50 · Score: 3, Interesting

Testing of Linux might be easier if it contained some automated features for sending crash reports back to a central database. Gathering some basic data on the stack trace, thread states, processes, etc. might help troubleshoot the OS in the context of the wide array of systems, configurations, and usage patterns. I know that both Microsoft and Apple have benefited strongly from this feature. Some tin-foil-hat wearers might object to their box phoning home. Tin foil hatters can just disable the feature but it might mean that the types of bug they experience never get fixed.

If developers are going to fix the bugs that occur in the real world, they need data from the real-world.

--
Two wrongs don't make a right, but three lefts do.

Open Source Means More Eyeballs? by xxxJonBoyxxx · 2005-04-22 01:52 · Score: 3, Insightful

I thought the point of Open Source was to allow more people to read through the code. You mean thousands of people aren't really doing that for fun? I'm shocked.

More seriously... I think many of the people who DO eyeball the code are looking for security problems these days (where you do get recognition, etc.). For the record, I know I won't get any HR props for putting OS bugs that I've uncovered on my resume, but the security bugs I've found are always good conversation pieces.

Re:Open Source Means More Eyeballs? by Bostik · 2005-04-22 02:43 · Score: 4, Interesting
Actually, I'd say that giving proper credit and public recognition for bug reports is good enough for most of the end-users. Case in point (interestingly enough, from LKML and by Andrew Morton himself):
- User reports a nasty bug on LKML
- Devs request details and the user provides them
- User complies and is provided with a patch to test. First one does not work but second one does.
- User reports back that the second patch fixes the problem and apologises for not being able to assist better.
- Andrew Morton replies (and I'm quoting from memory so only the context will be correct): "You reported a problem, provided enough details to pinpoint it, tested patches to fix the problem and reported back that a certain patch indeed was the correct fix. What more could we possibly ask for?"
Getting an answer like that should lift anyone's spirits. Not only has the bug been fixed, it was also recorded for posterity that a certain user discovered it and helped to his ability in fixing it. And to top it all off, the reporter was given an honest praise and a thank you. The last part alone is usually enough for most users, to see that the developers actually care.

As for resumes? If you have a verifiable record of reporting back bugs and helping to test their fixes, you should be able to use that for your advantage in CV or at least in an interview. If nothing more, it shows that you can communicate with different kinds of people and have enough technical ability to follow through with their requests for further details. You might have even gotten a better product for yourself to use.
--
There is no such thing as good luck. There is only misfortune and its occasional absence.

Not happy with Bugzilla? by selectspec · 2005-04-22 01:54 · Score: 2, Interesting

Morton also criticised the Bugzilla tool used for tracking problems, saying that it encouraged one-to-one communication, a process which didn't help educate the wider community about potential problems.
"Bugzilla is fine for tracking bugs, but as it's currently set up, it's not very good for resolving bugs."

Hmm... I'd be interested to understand what alternatives to a web-based system he has in mind. Any thoughts?

"This process, where individuals communicate via a Web site, is very bad for the kernel overall."

--

Someone you trust is one of us.

lack of testing? by Cat_Byte · 2005-04-22 01:54 · Score: 4, Insightful

I thought the whole Fedora project WAS mass testing of "cutting edge technology for Linux". Have I been wasting my time submitting bugs? Most have been fixed that I submitted so far.

--
Two roads diverged in a wood, and I - I took the one the bus load of girls just went down.

Brief Answer: No. by ciroknight · 2005-04-22 01:54 · Score: 4, Informative

Long answer; kinda. You can use core dumps and system logs to interpret what's going on, but you can never really know for sure. Besides, the kind of errors that are in the kernel are the kinds of errors that really don't return error codes; they're the kind that crash the computer and make you reboot.

Microsoft's method is for some of the higher up software, and so is Apple's. If there's a bug in the kernel it's very unlikely that their code will catch it. Or at least that's been my experience.

If the problem is that Linux is so buggy, we just need to run it on a bunch more machines, and start randomly poking it as hard as we can until we break something. Once we've broken it, do it again to make sure it's not hardware, and then go to work fixing it. Good old brute force repairs.

--
"Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush

Hi Linus! by Anonymous Coward · 2005-04-22 01:57 · Score: 2, Funny

I knew you visited Slashdot. Why not sign up for an account?

QA isn't sexy by ChaoticCoyote · 2005-04-22 02:03 · Score: 5, Informative

Morton is correct.

Even at commercial companies, QA isn't a "sexy" task. People would rather bang out code than write testing harnesses and run benchmarks.

Also, free software is driven by programmers, who tend to hate QA. Like any artist or craftsman, a programmer hates having their work critiqued. They spent hundreds (or thousands) of hours on a program, only to have someone nit-pick the details and point out the flaws. But for art, "quality" is a subjective quality -- and with software, quality and reliability are tangible quantities that can be measured.

My Acovea project demonstrated the problem. Users of GCC love Acovea; many developers of GCC, on the other hand, seem to treat it is an annoying distraction. Acovea identified more than a dozen errors (some quite serious) in the GCC 3.4/4.0 compilers -- and yes, I did report them to bugzilla. Only a couple of GCC's maintainers have said "thanks."

Not that the cool reception deters me. I have a new version of Acovea in the wings, and will be unleashing it on GCC 4.x Real Soon Now. ;)

As a consultant, I've been paid to perform QA work on commercial software packages -- but only one company, and a big one at that, has ever contracted me to QA a free software project.

Right now, free software is about many things, but quality is not job 1. And that needs to change.

--
All about me

Test Driven Development for OS? by Anonymous Coward · 2005-04-22 02:04 · Score: 5, Interesting

I have successfully used Test Driven Development in several of my projects and it is a uniquely satisfying experience. Writing test cases before writing the code then completing each test case one after another in steady progression gives a constant stream of small victories. It also means you can run all test cases at a later time and see that "yep, everything still works" or "doh! that change just broke 10 things I already had working."

There are several other benefits to writing tests first as well. The experts in the link above explain it all better than I could, I'm sure.

Many open source projects are taking this approach already and usually boast the number of unit tests along with the lines of code included in the distribution. Anyone can type in "build test" for example and it will show the program run and pass some odd thousand tests.

Is it time for the Kernel to embrace this methodology? I certainly think it is a genuine best practice. But is it applicable to OS development as well? I don't see any reason why it wouldn't be, but I am not a kernel developer myself.

crashme, etc... by pohl · 2005-04-22 02:04 · Score: 3, Interesting

I remember in the early days there was a program called 'crashme' that threw randomly-generated executables at the system, and it was credited bolstering stability. Do tests like this still hapen frequently by the unappreciated? Is there a good place online to read about these tests and their results for different point-releases? Along similar lines, I recall someone throwing random input at the various gnu utilities, and it was discovered that they were more robust against this sort of abuse than the commercial unix equivalents. Are there any other interesting tests that anybody knows about? Breaking stuff is fun.

--

The "cue the foo posts in 3, 2, 1..." posts will commence with no subsequent foo posts in 3, 2, 1...

already happening? by tverbeek · 2005-04-22 02:06 · Score: 2, Interesting

It may not be a Linux issue per se (more of a distro issue, I think), and it's purely anecodatal, but I've been seeing some QA problems lately in the mainstream distro I use. They include a bug that requires me to hand-edit the X11 config file to get my mouse to work, having to manually rebuild the routing table after every boot, and a so-far baffling total freeze of the system after rand() hours, only when it's serving web pages. I've been using Linux to do this job for six years, and never had these kinds of problems before.

--
http://alternatives.rzero.com/

We're insane? by DrXym · 2005-04-22 02:08 · Score: 5, Funny

When I heard that I nearly fell off my ostrich.

Re:We're insane? by Danuvius · 2005-04-22 02:19 · Score: 2, Funny

You have to lean forward against its neck while you ride.

Hug the ostrich as you go. Show it you care.

That way you *can't* fall off!!

--
Akarsz Magyar Gentoo fórumot? Akkor
Re:We're insane? by Jonathan · 2005-04-22 02:20 · Score: 2, Funny

No, you're a video game character

It's also frustrating to test a moving target. by Richard+Steiner · 2005-04-22 02:14 · Score: 2, Insightful

Linux is constantrly improving, but that means it is also constantly changing, and that makes it a constantly moving target.

That applies to most distros as well as the kernel itself.

It's hard to put a lot of effort into testing something when it's possible those tests will be invalidated a few months down the road...

--
Mainframe/UNIX Bit Twiddler and long time Windows/Linux Hobbyist.
The Theorem Theorem: If If, Then Then.

Want to help? by SenFo · 2005-04-22 02:17 · Score: 3, Interesting

If anybody reading this is interested in participating in the test procedure, check out the Linux Test Project.

--
My lame blog.

Uh, no. by bmajik · 2005-04-22 02:17 · Score: 5, Informative

Software testing (usually) isn't monkeys pounding on keyboards until the box BSOD's.

It is difficult to test software without adequately understanding what it is supposed to do. Varying the underlying machine type is almost irrelevant for binary distributed software unless you're testing an operating system kernel or looking for race conditions in software (which is really just a stab in the dark)

How are you going to have 3rd party people debug software they know nothing about?

Where users help find bugs is by using the software. It honestly takes a certain mentality to be an effective software breaker, and it's not very common. It takes something else entirely to be a software tester; you've got to be a good developer (because software testing is about automation these days unless you're insane) but you've got to not get sucked into the developers way of thinking.

I assure you - letting normal users play with software doesn't clean it up. we can show that this is true in the following way:

- more users use Microsoft software for more hours a day than any other software in the world
- slashdotters say Microsoft software is the buggest software made

clearly if users using software was sufficient to find all the bugs, MS stuff would be bug free, based on its frequency of use alone. I know this isn't the case, because im a software tester at Microsoft.

(The appropriate response is "well then, stop posting and get back to work; you're clearly not done yet!" :)

W.r.t. linux kernel testing: this is something that's always amazed me - linux works surprisingly well for something with so little formal testing. On the other hand, when there are edge case problems my experience has been that nobody is much interested in fixing them. One example i had was at a consulting gig. the client was looking to move his web hosting business onto linux boxes if he could get more sites per box then he could on windows. He had a problem where his linux server would start dying after a few days. I started to look into it and the box would basically panic() in low memory situations. I asked Alan Cox about it (via irc) and the response was "buy more memory". Nice.

Another sore point with me growing up was xserver crashes. The Xserver was 99% reliable, but then you'd get some random crash and lose everything you were doing, and you knew there was no real way of getting it fixed or investigating it.. you just had to hope it magically got better somehow.. maybe when you switched hardware or something.

Then there's the just plain lack of testing of some F/OSS projects in general. When i was in college i had NeXT, Sun, and SGI boxes in my dorm room (but no linux :). I remember dling the Gaim tarball (this was loooong ago) and seeing about getting it built on my SGI machine. IIRC, there were some makefile / #include problems getting it to even build, and once it was built there were some other issues with its runtime. Ultimately i submitted a patch to the gaim folks that more or less "enabled" gaim on IRIX. There is no way anybody had ever used Gaim on an SGI without making these fixes, so it seems reasonable to suggest the authors had never tried it before. This lack of a platform test matrix is pretty common amongst smaller F/OSS apps, even when they say "works on *nix" they mean "works on the distribution of linux i run at home".

Another baby patch i submitted was for the openBSD kernel.. this time for the wdc driver. Back when UDMA 100 was newish, i bought 2 UDMA 100 disks a month or so apart.. so they were different sizes and different vendors, but on the same bus. The UDMA rollback code in openBSD would drop the DMA level from 5 (UDMA100) to 2 (something much slower, i dont remember what) after a certain number of DMA errors. This obviously sucked since you can run UDMA devices at different speeds on the same bus, and you can also fallback to UDMA66 and UDMA33, both of which are better than mode 2.

--
My opinions are my own, and do not necessarily represent those of my employer.

"They get no thanks or credit or money... or anyth by Senor_Programmer · 2005-04-22 02:21 · Score: 3, Insightful

ing," he said.

Wait a minute here...

I thought the whole scheme was structured thusly...

I crank up the latest greatest kernel. I find a bug. I report it. My bug gets fixed. THAT's MY REWARD! The friggin bug gets squashed. What more could one ask for, with a clear conscience and a straight face.

As for those guys who fix the stuff. Well sanity is a relative term as we should all realize in light of the Japanese influence and emergence of cargo cults in WW-2 Niu Guinea. AFAIK, most Linux users view the kernel developers as some mysterious force from which benefit is derived through clever creation of effigy's.

We do exist by Necron69 · 2005-04-22 02:25 · Score: 4, Interesting

With all due respect to Andrew, Linux QA people do exist. After 11 years of being a sysadmin, I'm now entering my fifth month of being paid to test Linux releases. I'm having fun, learning a lot, and generally enjoying life.

BTW, we have not one, but two of my colleagues down under right now listening to Andrew in person. It should be interesting to get a first-hand account of what was said.

- Necron69

That's odd by radiophonic · 2005-04-22 02:32 · Score: 2, Insightful

I was under the impression that by using Linux, I was, in a sense, testing Linux.

--
Whenever you read this sig someone's refrigerator light turns on.

*sigh* by bmajik · 2005-04-22 02:34 · Score: 5, Interesting

that is NOT Microsoft's approach to testing.

Where did you hear or get the impression that that was the MS "approach" to QA ?

I've written test suites for the following Microsoft Products
- Visual Basic Compiler, 7.0
- Microsoft Business Framework 1.0 (unreleased)

None of them involved just using the compiler or the business framework over and over in day to day work to find bugs.

We have a variety of test approaches, including a few that _might_ be construed as what you describe - There are a few ways that we get test coverage via product usage

- stress
- bug bashes
- app weeks

Stress is funnier than it sounds. Did you know we're not allowed to ship windows until the exact build of windows under ship consideration has been running on hundreds (thousands, usually) of machines continuously with no problems while enlisted in a distributed "stress" client... where they're pounded and pounded with automated tests that do things like starve memory whilst performing other work, etc? Same with ASP.NET and the CLR - they have to _survive_ for a pre-determined time period before the build can be considered shippable. We dont think there are any show-stopper bugs at this point - but we just want to be reasonably sure. Note that if we find a bug (even an unrelated one, like the documentation has a typo) and take a fix for it, the stress cycle resets because the bits have changed. Better safe than sorry. In the end game of a product release it can literally be the case that taking a bug fix means delaying ship for another week or more.

- bug bashes
this is probably most like what you're describing. Everyone on the team sits down for a couple of days and really just beats on a specific area of the product. Security Bug Bashes have become popular int he last couple years (wonder why ;) These really dont happen that often during the product cycle, because ad-hoc testing doesn't catch that much stuff if you've got well developed automation suites. However, it's still very worthwhile because it is a good feedback mechanism to explain why your other testing missed something, and it's the best way to notice the odd "that's funny..." sort of issues that are not functinoally incorrect but are still user annoyance type issues.

- app week

For developer tool products (like Microsoft Business Framework) we like to do an app week with each milestone, where everyone on the team builds some sort of end to end application, using as much of the toolchain as possible. This sort of testing really makes the employees better (we're usually pretty compartmentalized on our areas of functionality ownership). It also lets unreleated parties take a look at peices of the product they don't own (so don't have preconceived notinos about). Finally, it lets us simulate the end-to-end customer experience on our product stack. If we can build the sort of apps a customer might build with our tools, then the tools are probably alright. Where we run into problems, we know the tools need help.

bug bashes and app weeeks happen perhaps 1-2 weeks per milestone (which is on the order of 2 months). It is a small part of our testing, time, effort, and results wise. It's still important to do, but it is not the _focus_ of QA at microsoft.

--
My opinions are my own, and do not necessarily represent those of my employer.

Re:*sigh* by Anonymous Coward · 2005-04-22 03:29 · Score: 5, Informative

I was a Microsoft developer for about 6 years, and this guy gets it exactly right.

Most of the really first-class groups at Microsoft (Windows, SQL Server, Developer Tools, lots more) have INCREDIBLY exacting test requirements, and extremely competent and thorough and demanding test teams. The open source community has done well, but it is nowhere near the professionalism and thoroughness of commercial software development. And it's precisely because the testers get *paid* to do the same damn test on every single build -- something open source people won't do, because there's no glory in it.

Slashdotters will no doubt respond, "Well, if it's so good, then what about all those security bugs!" Which is a fair criticism. Commercial software development (such as Microsoft's high testing standards, and similar at Sun, Apple, etc.) only works when the testing priorities you start with are the right ones. For a long time, Microsoft's priorities were 1) features, 2) usability, 3) more features, 4) stability, 5) security, 6) even more features.

This has changed. Microsoft mid-level managers (dev managers, product unit managers, etc.) have internalized the idea that they are literally under attack, and that security must be a high priority from here on. I wish they had STARTED with that as a priority, but at least they get the message now.

But, seriously, the parent poster is right on the money. Microsoft has AMAZING testers and test/developers. The hardware and software matrix that they run code under has to be seen to be appreciated.

And, again. This is not intended as a slight at all to open source development or testing. It's just *very* different.
Re:*sigh* by Anonymous Coward · 2005-04-22 04:49 · Score: 3, Insightful

Don't let the infantile MS bashers here bother you, the people that imply that microsoft has no testing are the same idiots who "hear" that MS products are insecure because NBC nightly news said so. If linux had the same user base as MS there would be 100,000 times as many patches, security holes etc. We may not all agree with MS's business practices, but its evident that they have a very serious testing regime, especially since products often get delayed for years because of minor things.

I thought the OPs article was very interesting. Comparing their testing process with their results is enlightening. I found his post really interesting.

You, on the other hand, are an idiot.

To the best of my knowledge the NBC Nightly News or any other mainstream press outlet have done nothing to help users understand the insecurity in MS products. If anything, their pet analysts have greatly downplayed any weaknesses in MSFTs software or business model leaving millions of uninformed users and investors with their dicks in the wind.

If Linux had the same user base as MS, there would exactly the same number of patches, security holes etc. More developers would slightly increase the number of patches and more users would increase the number of bug reports. The number of security holes is independant of both.

Microsoft does have a very serious testing regime: they need to to stay in business. They have never let fixing bugs interfere with their approach to competition, however. There is ample evidence that MS will ship a critically broken application or even introduce svere bugs rather than allow a competitor to gain market share. There is also abundant evidence that they will not bother to fix a product,no matter how broken, if there is no competitive advantage.

The fact that you still think that most of these "delays" aren't planned from day one suggests that you are watching to much TV and not reading enough (real) IT sites. When MS announced the original ship date for Longhorn, every reputable trade site said "MS claims that they are going to ship X, Y and Z in timeframe T. This is not possible." Do you think that these trade sites (having since been proved right) know that much more than the strategists at MS or do you think that there is another reason that MS might announce a ridiculously optimistic roadmap? How do you think the market would have reacted if, in 2003, MS had said "Longhorn will actually just be SP3 for XP and will ship sometime in 2006."

Re:Bugzilla by Mad+Merlin · 2005-04-22 02:38 · Score: 3, Insightful

If the average Linux user were educated on how to recognize a bug, and file a meaningful bug report it would mean a lot to developers, and likely speed up development and stability. ...and scare away 99.9% of potential new users.

--
Game! - Where the stick is mightier than the sword!

Somewhat alarmist headline by xixax · 2005-04-22 02:38 · Score: 3, Informative

From TFA:
"A lack of commitment to testing by the Linux community may ultimately threaten the stability..."

The content of the article is much better than the headlines and excerpts being quoted. I was there and felt that what he was geting at was that we need to start thinking about updating QA procedures. The ratio of bugs to features is decreasing, but the rate of features is (maybe?) growing that much faster. The point of his talk was to outline a number of options for improving QA, thre are issues, but the sky certinly isn't falling either. It was an excellent follow on from Tridge's keynote the previous day on how to do quality system programming (overshadowed by his very brief coverage of the BK thing).

Xix.

--
"Everything is adjustable, provided you have the right tools"

Kernel testing vs. app testing by jfengel · 2005-04-22 03:03 · Score: 2, Interesting

This article is about kernel development. While I appreciate the development being done to make the kernel faster/better/cheaper (well, it doesn't get any cheaper), it's already a Pretty Damn Good kernel. It sounds to me like the most crucial thing would be to solidify it and test the bejeezus out of it, then largely freeze it, because that's not where the problems are.

When people complain about MS Windows, they're not (usually) complaining about the kernel. They're talking about all of the stuff built on top of it: window manager, IE, networking, configuration. If the Linux kernel is receiving too little testing to be stable, what about the millions of lines of code that go into X windows, Gnome, CUPS (as mentioned the other day), etc.

If MS didn't have to make kernel changes to bettter support security, I suspect they wouldn't be touching it at all. BSODs are still more common than they should be, but most users find them extremely rare, and the kernel is Fast Enough relative to the work that needs to be done. The improvements in Longhorn are largely about changes above the kernel, especially in its spiffy interface.

While I'm grateful to Linus and all of the other developers for the kernel improvements, and while Open Source means never being told what to work on, kernel improvements other than stability are probably a terrible use of manpower. The kernel is a tiny fraction of the lines of code that go into a Linux distro. They are basic, and need to be rock-solid, but while performance improvements there benefit everybody, they don't benefit you at all if X, or KDE, or Konqueror, or any of the hundreds of other higher-level apps crash.

True... by jd · 2005-04-22 03:33 · Score: 3, Informative

However, there are aids. The Linux Test Project doesn't do much real testing, from what I hear, other than some basic standards stuff, but it should be simple enough to bolt on some real heavy-duty code testing routines.

Then, there's the mysterious Stanford Code Validator, used to great effect for a while. I feel certain that a few sweeps of that would uncover many of the more troublesome problems.

For those without SCV (99.9999% of the planet), there are some Open Source code validators out there. It should be possible, at the very least, to use those to identify the more blatant problems.

If you're not sure about using code validators, then it's simple enough to write programs that hammer some section of the kernel. For example, if you have some large number of threads mallocing, filling and freeing random-sized blocks of memory, can you demonstrate memory leaks? How well does the VMM handle fragmented memory? What is the average performance like, as a function of the number of threads?

Likewise, you can write disk-hammering tools, ethernet tests, etc. For the network code, for example, what is the typical latency added by the various optional layers? Those interested in network QoS would undoubtably find it valuable to know the penalties added by things like CBQ, WFQ, RED, etc. Those developing those parts of the code would likely find the numbers valuable, too.

If you don't want to write code, but have a spare machine that isn't doing anything, then throw on a copy of Linux and run Linpack or the HPC Challenge software. (Both are listed on Freshmeat.) The tests will give kernel developers at least some useful information to work with.

If you'd rather not spend the time, but want to do something, map a kernel. There's software for turning any source tree into a circular map, showing the connections within the program. If we had a good set of maps, showing graphically the differences between kernel versions (eg: 2.6.1 through to 2.6.12-pre3) and between kernel variants (eg: standard tree, the -ac version and the -mm version), it would be possible to get a feel for where problems are likely. (Bugs are most likely in knotty code, overly-complex code, etc. Latency is most likely in over-simplified code.) You don't have to do anything, beyond fetch the program, run it over the kernels, and post the images produced somewhere.

None of this is difficult. Those bits that are time-consuming are often mostly time-consuming for the computer - the individual usually doesn't need to put in that much effort. None of this will fix everything, but all of it will be usable in some way to narrow down where the problems really lie.

--
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)

It's Free Software! by Ulrich+Hobelmann · 2005-04-22 03:37 · Score: 2, Insightful

What are they expecting? It's based on voluntary work.

If anybody needs some guaranteed service, or commercial-grade testing, maybe they should hire some programmers to do it?

quoted out of context by mathgenius · 2005-04-22 09:48 · Score: 2, Informative

I was there, and the quote was taken _absolutely_ out of context: 'If you pick a good technology and the developers are insane, it's all going to come to tears.' He was not refering to BK in this instance; he was in fact talking more generally about SCM systems, and how he had noticed that these projects tended to attract "insane" developers (also the ide drivers do this too).
This was all part of a larger, very insightful remark, saying that had Linus chosen a free SCM tool three years ago, we would now have a fantastic SCM in the free software world. In this instance, it is not so much the _tool_ that would need to be good, but that the _team_ behind the tool needs to be solid, responsive etc.

Simon.

Slashdot Mirror

Lack of Testing Threatening the Stability of Linux

62 of 325 comments (clear)