Debugging Expert Wins ACM Dissertation Award

dissertation: by sxtxixtxcxh · 2006-03-21 15:37 · Score: 1, Funny

they're not bugs, they're features.

--
for a minute there, i lost myself...

blargh by MarkPNeyer · 2006-03-21 15:39 · Score: 1, Offtopic

'leveraging'

*tears out own hair and screams*

--

My blog

Re:blargh by Neo-Rio-101 · 2006-03-21 15:56 · Score: 3, Funny

*tears out own hair and screams*

Shouldn't that be "leveraging out own hair and screaming"?

--
READY.
PRINT ""+-0
Re:blargh by Anonymous Coward · 2006-03-21 16:23 · Score: 3, Informative

tr.v. leveraged, leveraging, leverages 1.
a. To provide (a company) with leverage.
b. To supplement (money, for example) with leverage.
2. To improve or enhance: "It makes more sense to be able to leverage what we [public radio stations] do in a more effective way to our listeners" Delano Lewis.

So listen and listen good all you academic paper writers: unless what you really mean by "leverage" is "improve", don't use it.

"Liblit's dissertation proposes a method for leveraging the key strength of user communities - their overwhelming numbers." WRONG. "improving the key strength of communities -their overwhelming ..." does not make sense. The overwhelming numbers are there they do not need improvement. What you may want to say is sthng like: "for using the key strengh of user communities as a leverage to blah blah blah"

Just because a word sounds good it does not mean it should be used as a wild card ...
Re:blargh by Tony-A · 2006-03-21 17:25 · Score: 3, Interesting

The root word is lever and the basic idea is that you use something under your control to effect control over what would normally be outside your control. Like a very long handle on a pipe wrench.

The money aspect you refer to has to do with debt financing whereby you manage to use your equity to finance something larger than your equity. I don't think the article is referring to corporate finance.

In a perfect world you would use a few people who would recognize and fix the bugs. These people would never talk to the users. They would have no need to and neither would gain from the experience.
In the world that I exist in, users are the ones who spot the bugs, specifically the circumstances under which the bugs exhibit themselves. I use my user's eyes to leverage {user's eyes, my skills}.

If all you mean is "improve", you would not use a word which essentially demands a discrepancy in the metrics between cause and effect.

b. To supplement (money, for example) with leverage.
If you add money to an account because of a margin call, does this increase or decrease your leverage? That is a horrible excuse for a definition.
Re:blargh by Anonymous Coward · 2006-03-21 17:40 · Score: 0

"The money aspect you refer to has to do with debt financing whereby you manage to use your equity to finance something larger than your equity. I don't think the article is referring to corporate finance."

Are you retarded? I just listed the dictionary definition which included the usage of leverage in finance. Of course finance has nothing to do with the paper smart ass!!!

As for the rest of your comment, you make no sense at all. Please read posts better before you reply.

Somebody mod that idiot down!

Sounds like Doc Watson by BadAnalogyGuy · 2006-03-21 15:41 · Score: 4, Insightful

No, not the wild-eyed madcap scientist from Back to the Future. Doctor Watson is an OS service present in Windows that monitors the running process list for terminal assertions. When a program hits an exception that it can't handle, it terminates immediately and Doctor Watson is on the scene to read the last gasps of the process before its bits get blasted. Microsoft even came up with a way to harness this to allow users to send real-time feedback to Microsoft HQ whenever a crash occurred in a program. No one I know ever sends that data back, but I'm sure someone must have once.

The current idea seems to be tracking the same termination events in the same way as Doctor Watson and sending the relevant data back to UWisc without informing the user. It sounds like a good idea, but I doubt it is in Liblit's power to fix Windows OS bugs.

Re:Sounds like Doc Watson by Bob54321 · 2006-03-21 15:56 · Score: 1

Microsoft even came up with a way to harness this to allow users to send real-time feedback to Microsoft HQ whenever a crash occurred in a program. No one I know ever sends that data back, but I'm sure someone must have once.

My Grandfather was having trouble running FlightGear under windows and sent one of those to Microsoft. Even got a response telling him it was a driver problem.

I send Firefox errors, just to give them some hope :)

--
:(){ :|:& };:
Re:Sounds like Doc Watson by Benoni · 2006-03-21 15:57 · Score: 5, Informative

sending the relevant data back to UWisc without informing the user.

Informed participation is a really big deal for me. No user should ever find themselves participating in the Cooperative Bug Isolation Project without their knowledge. Opt-in is explicit and revokable, and if the opt-in system runs into trouble of any kind, the fallback position is no data reporting at all.

The whole thing collapses if users don't trust me. So I've taken every measure I can think of to ensure that they can. Please see the relevant project page for more details about privacy matters.

It sounds like a good idea, but I doubt it is in Liblit's power to fix Windows OS bugs.

Working on it! Check back in with me in a few years ... maybe less. :-)
Re:Sounds like Doc Watson by Anonymous Coward · 2006-03-21 16:14 · Score: 5, Interesting

No one I know ever sends that data back, but I'm sure someone must have once.

Plenty of users do. There's a great blog posting by Raymond Chen called There's an awful lot of overclocking out there where he talks about investigating some of these "Watson" crashes.

The crashes were impossible - instructions like

xor eax, eax

Turns out unscrupulous vendors were selling overclocked computers without informing buyers. Pretty cool article.
Re:Sounds like Doc Watson by gzearfoss · 2006-03-21 16:34 · Score: 2, Interesting

My main issue with the good Doctor is that most of the time, when I had a program that crashed and invoked him, it wasn't a Microsoft product. Typically, it's because I was working on a programming assignment and it 'burped.' So unless Microsoft was willing to help me debug my homework, I didn't see much point in sending the data on to Redmond.

Not that I mind sending back data when it can be useful; if someone is going to look at the error logs, memory, etc., and try and make it so that it won't crash again, I'm all for it. I just pity the poor person who accidentally leaves a major bug in the code, and swamps the system with error reports.
Re:Sounds like Doc Watson by Animats · 2006-03-21 19:41 · Score: 2, Interesting

It goes back much further than that. See "The ALCOR Illinois 7090/7094 post mortem dump", a famous paper from 1967.
Automated dump analysis is an old idea in the mainframe world, but almost unknown outside it. The microprocessor world grew up with interactive debuggers and an early user-as-programmer assumption. This hasn't translated well to the modern software world.
In the mainframe world, there have even been mainframes that recorded the last 64 or so branches using dedicated hardware, so that after a crash, the control path could be recovered.
What does the Mozilla project do with the data from their "quality feedback agent", anyway?
Re:Sounds like Doc Watson by poot_rootbeer · 2006-03-22 04:41 · Score: 1

The crashes were impossible - instructions like
xor eax, eax

Watch out -- you may have just reverse-engineered Sony's latest DRM enforcement mechanism!
Re:Sounds like Doc Watson by kamochan · 2006-03-22 05:37 · Score: 1

The crashes were impossible - instructions like

xor eax, eax

How is that impossible? It's a single-byte, single-clock, u/v pipe op to zero eax... How do you (or the original author of the comment) do it? mov eax,0? *snicker* That'd certainly explain some of the Windows bloat...
Re:Sounds like Doc Watson by oaksong · 2006-03-22 08:35 · Score: 1

I always send my error records to microsoft, then either they fix it or they pass it on to the purveyor of the software. I've had a number of issues identified and remediated because of this feedback loop. There's no reason not to send the crash report to MS. It contains no data about your machine or it's contents. If MS passes the bug along, they don't identify the source. So where's the problem?
Re:Sounds like Doc Watson by Anonymous Coward · 2006-03-22 14:00 · Score: 0

What was impossible was that the program was crashing on that instruction. As you say, that shouldn't happen. However, with the CPU cranked up past its tolerances, it wasn't guaranteed that the contents of EAX would be XOR-able with itself to result in zero. Something to do with slow pipelines or something. Recommend reading the linked blog.
Re:Sounds like Doc Watson by Anonymous Coward · 2006-03-24 15:13 · Score: 0

You mean Doctor Emmett Brown from BTTF.
Re:Sounds like Doc Watson by arvindn · 2006-03-30 13:24 · Score: 1

The mad scientist from Back the Future was Dr. Emmett Brown.

Check out FindBugs for finding bugs in Java by licamell · 2006-03-21 15:43 · Score: 5, Informative

This reminded me of work going at at UMD (University of Maryland, College Park). I know it's not quite the same thing, but I feel as though this is a good place to mention it and the slashdot community would appreciate this software. FindBugs is a very cool tool for finding bugs in java code. And no, I am not affiliated with this project, I just saw a talk on it a couple months ago.

http://findbugs.sourceforge.net/

Re:Check out FindBugs for finding bugs in Java by shadow0_0 · 2006-03-21 18:01 · Score: 1

Hey, this is the second time I hear about the FindBugs project today! Thanks, I will have to check out the Eclipse plugin.
Re:Check out FindBugs for finding bugs in Java by cerberusss · 2006-03-21 19:40 · Score: 1

I tried FindBugs as well as PMD. Although the latter only examines sources per file, I found it much easier to use because it had so much less "false positives". Also, I found the PMD Eclipse plugin is better integrated. The author is also a slashdot user, by the way.

--
8 of 13 people found this answer helpful. Did you?
Re:Check out FindBugs for finding bugs in Java by tcopeland · 2006-03-22 03:46 · Score: 1

> The author is also a slashdot user, by the way.

Heh, you're right, and thanks for the mention! :-)

--
The Army reading list
Re:Check out FindBugs for finding bugs in Java by Doctor+Memory · 2006-03-22 05:32 · Score: 1

I really wanted to like PMD, but unfortunately it only finds problems in code that's there, not the code that isn't. That sounds kind of obvious, but consider this: you have a situation where you allocate some resource, and you need to free it within the same routine (a commonly-accepted Good Programming Practice). PMD can't tell you if you forget to make the call to free the resource. It would be nice if it had some way to specify what must be there, as well as what must not.

I guess I'm just grumpy because I really liked what PMD did, and I was crushed that I couldn't recommend it as a standard part of our code review kit.

--
Just junk food for thought...
Re:Check out FindBugs for finding bugs in Java by shadow0_0 · 2006-03-22 12:01 · Score: 1

I know I am being lazy :) but can you install and run both of them as Eclipse plugins? That way maybe you will screen out the false positives?
Re:Check out FindBugs for finding bugs in Java by cerberusss · 2006-03-22 19:25 · Score: 1

Yeah, you can install both. However, they work somewhat different. FindBugs works on the whole codebase. You can run it as a plugin, but it doesn't really integrate. It just starts up in a different window. PMD works on the file level and sits in the context menu of the Java source editor.
Since they have a different field of analysis (directory vs. file), you get completely different findings...

--
8 of 13 people found this answer helpful. Did you?
Re:Check out FindBugs for finding bugs in Java by shadow0_0 · 2006-03-23 14:01 · Score: 1

Thanks :) I actually went and installed both after I posted.
Now I just have to push it to the other people in the company.
Re:Check out FindBugs for finding bugs in Java by cerberusss · 2006-03-23 18:18 · Score: 1

Now I just have to push it to the other people in the company.
You can try, but you'll succeed anyway using it yourself. I used it on a piece of the codebase and in the next teammeeting, said I'd "found some potential bugs with this new tool". People will get interested. That's actually how I found out about FindBugs, because of a colleague who was impressed with me using PMD.

--
8 of 13 people found this answer helpful. Did you?

Standup Fight? by drewzhrodague · 2006-03-21 15:48 · Score: 1

"Sir, is this a stand-up fight, or another bug hunt?"

Seriously, congradulations, Ben!

--
Zhrodague.net - I do projects and stuff too.

Thank you, open source community by Benoni · 2006-03-21 15:49 · Score: 5, Informative

This research has been a wonderful collaborative effort, and many people deserve to share the credit. To quote from part of the Acknowledgements section of my dissertation:

I am indebted to the many members of the open source community who have supported our work. My thanks go out to the many anonymous users of our public deployment, and to the developers of the open source projects used in our public deployment and case studies.

So thanks, Slashdot, for helping me find those users (or helping them find me). The exposure was invaluable. And thanks, open source community, for your participation. I've benefitted greatly from standing on your massed shoulders. This could not have happened without you.

Re:Thank you, open source community by loconet · 2006-03-21 16:27 · Score: 1

Thank you for loving what you do (or so it seems from what I've read).

--
[alk]
Re:Thank you, open source community by SEWilco · 2006-03-21 19:49 · Score: 2, Funny

You're welcome.
We will now debug your dissertation.

Now at the University of Wisconsin-Madison by DrDitto · 2006-03-21 15:50 · Score: 2, Informative

Ben Liblit is now an assistant professor at the University of Wisconsin-Madison. He joins a fantastic Computer Science department. Good luck Ben!

Re: by kindyroot · 2006-03-21 15:51 · Score: 1

i think if people knew there were any probability that their bug reports would not be taken into consideration, maybe they wouldn't post them at all!!

Closed versus open source comparison by Anonymous Coward · 2006-03-21 15:52 · Score: 1, Interesting

It would have been interesting to know the difference in number and type of bugs between a closed source product and an equivalent open source product... let's say the MS Office suite versus OpenOffice.org suite.

Re:Closed versus open source comparison by ROBOKATZ · 2006-03-21 16:36 · Score: 1

Well that will never happen because this system requires the source code. Did you even look at the page?
Re:Closed versus open source comparison by Anonymous Coward · 2006-03-21 16:40 · Score: 0

Yes.

Heh... by the_skywise · 2006-03-21 15:55 · Score: 4, Funny

So somebody went and formalized the theory of "the users are the beta testers"...

Re:Heh... by Benoni · 2006-03-21 16:00 · Score: 5, Informative

Yes, exactly. The users are beta testers; we may as well admit it. I want to make them better beta testers. :-)
Re:Heh... by CodeBuster · 2006-03-21 16:40 · Score: 1

It has been my experience in software development that sharp users can give valuable feedback in the areas of usability and expected behaviors, but this is no substitute for trapping errors and logging failures on the user's machine when it comes to tracking down less visible faults.
Re:Heh... by calyxa · 2006-03-21 18:18 · Score: 1

having wrangled beta testers, I know that I totally spoiled the engineers at my last/psuedo-current gig by writing coherent and complete bug reports... sigh.

(hi, Ben! ;)

--
Decay! Decay! Decay! -Helium
Re:Heh... by Benoni · 2006-03-21 19:00 · Score: 1

(Hi Calyxa!)
Re:Heh... by Anonymous Coward · 2006-03-22 04:05 · Score: 0

So somebody went and formalized the theory of "the users are the beta testers"...

No, someone formalized the theory of the users are the testers. They always were. Using software is testing it.

Thank you, BenGay. by Anonymous Coward · 2006-03-21 16:02 · Score: 0

"I've benefitted greatly from standing on your massed shoulders."

You're welcome. Now could you get down? I'm getting a cramp.

Request for more information by BadAnalogyGuy · 2006-03-21 16:07 · Score: 3, Interesting

The installation of CBI is implicit consent to such monitoring, of course, and I didn't mean to imply that there was no consent involved at all.

However, asking us to read 170-odd pages of your dissertation is a little much. Would it be possible to describe the data collection system, how reports are generated and if the reports are sent automatically or as in the case of Dr. Watson sent with user approval. Also, what types of bugs you found using your statistical methods, as well as what types of bugs you think would be difficult to find using such methods.

A quick comparison to related mainstream debugging techniques would be useful to give us out here in the trenches a firmer grip on the techniques you describe.

And finally, if you wouldn't mind, could you describe a real-world scenario where a generalized product (codename: CBIMax) would be marketable. If such a general product is impossible, is it because each product is different and the methods you describe would need to be revised each time? What is the maximum level of abstraction of these techniques from specific scenarios that is achievable yet still retaining enough so as to not require largescale retooling for each project?

Thanks!

Re:Request for more information by Benoni · 2006-03-21 16:17 · Score: 5, Informative

However, asking us to read 170-odd pages of your dissertation is a little much.

Hey, it's a real page-turner. Well, it has pages and they turn, at least.

The other questions you ask are all good ones, but a bit much to address in a Slashdot comment. Please see the project home page for more information. The "Learn More" page may answer some of your questions, and there are additional drill-down pages from there with even more technical material on selected topics.

Please understand that I don't mean to brush off your insightful questions. They are just questions for which satisfactory answers are hard to give in a sentence or two.
Re:Request for more information by Anonymous Coward · 2006-03-21 16:25 · Score: 0

Then perhaps you could expand on why you'd be able to fix Windows bugs in the future (maybe nearer). Got an interview loop with MSR or WinCore?
Re:Request for more information by jschrod · 2006-03-21 22:02 · Score: 1

asking us to read 170-odd pages of your dissertation is a little much.
Why? If you're really interested in that area, that's not much material and it's important research. I know that I'm going to read it just because it's interesting.
But if you're not deeply interested, you will be able to pick the most interesting bits by looking at the table of contents, won't you? Or is that too much effort, too? Besides, the project has a Web site that is even referenced in the /. blurb -- did you even bother to look there for an answer?
Please note: I'm not connected to the CBI project, don't use the software, and don't know Ben Liblit.

--
Joachim

People don't write Manifestos any more -- what's going on in this world? [Frank Zappa]

Heh...Beta Drugs. by Anonymous Coward · 2006-03-21 16:14 · Score: 0

"Yes, exactly. The users are beta testers; we may as well admit it. I want to make them better beta testers. :-)"

Recreational drugs help.

Almost like infinite monkeys writing Shakespeare by gzearfoss · 2006-03-21 16:24 · Score: 2, Interesting

I know that in one particular http://www.kingdomofloathing.com/game, they tend to follow this approach. Once a new feature is created, and debugged enough so that it's stable and doesn't break anything, the feature is released to the general populace. After all, once all of the important bugs are found, a thousand users will find the minor bugs through general usage faster than a small dedicated team of testers. Also, the time the testers save by not having to verify every single minor detail can be used to work on new material.
Add into the equation that without some elaborate software (such as Mercury LoadRunner, or an open-source equivalent), it's hard to simulate the effect the entire population will have when they start hammering on the server. It can also help track down extremely low-occurance bugs, because with enough people working on it, those one-in-a-million cases will eventually come up.

Kinda reminds me of infinite monkeys eventually producing the works of Shakespeare.

Not really by ROBOKATZ · 2006-03-21 16:32 · Score: 1

If you even just skim the page, you'll see that only the fact that it sends a feedback report is similar. Most of the project consists of 'sparse' weighted sampling of instrumented code, research into what are useful metrics to use to instrument the code, and how to correlate the collected data with patterns to auomate finding bugs.

HI, my name's Dave by threedognit3 · 2006-03-21 16:36 · Score: 0, Flamebait

I have a doctorial thesis up for review and evaluation. Thank you for inviting me here to discuss my thesis. To be honest... This thesis is a composite of what has been offered to me by Microsoft, Oracle, Novell, Oracle, Microsoft, SAP, OpenSource and companies who wish not to be named..cough SUN...cough..SUN. Uh..what? Questions?...please see the index and feel free to look at the appendix. Thank you for the compliments...is that a Oracle teamwear shirt you're wearing?

Re:HI, my name's Dave by Anonymous Coward · 2006-03-21 16:41 · Score: 0

What the hell are you talking about?
Re:HI, my name's Dave by Anonymous Coward · 2006-03-21 19:08 · Score: 0

i think the fact this summary is more of a "my 'friend' thinks 'his' 3 year old dissertation is still doing good stuff and that 'he' is one bad ass mutha!"
Re:HI, my name's Dave by Anonymous Coward · 2006-03-22 03:13 · Score: 0

Ah yes, I see.

Drinking Club by joefish_only_1 · 2006-03-21 16:42 · Score: 1

You mean the ACM is more than just a drinking club? The way they advertised themselves at my university, one wouldn't have thought so...

Re:Drinking Club by ROBOKATZ · 2006-03-21 16:47 · Score: 1

ACM itself is a real professional organization; however most of the student chapters are primarily drinking clubs. I had to disassociate myself after the new officers started running a porn business out of the office.

Alternative (complementary?) approach by Anthony+Boyd · 2006-03-21 16:58 · Score: 1

I think that automated unit testing is the future of killing bugs. In layman's terms, this involves a program trawling your code and automatically trying to break it. If done well, the system can replace some of your QA team, and QA goes a lot faster. I hadn't even heard of such a thing until I did some contract work for Agitar, one of the companies doing this stuff. Here's a link with an overview & screenshots:

http://www.agitar.com/products/20051101-agitator.h tml

They only work with Java. Here is a link to a page where they ran some Open Source products through their tool & published the results:

http://www.agitar.com/openquality/

But Agitar's product isn't Open Source itself. :(

-Tony

--
My Greasemonkey scripts for Digg &

Re:Alternative (complementary?) approach by DaveAtFraud · 2006-03-21 18:21 · Score: 1

Unit testing tends to only confirm that the program under test works as designed. It does not catch design errors or requirements errors. To catch these, you need to design a test that confirms that the system works as its supposed to work. Where "as its supposed to work" is some arbitrary, external criteria. You should also note that I said "system" and not "program" where system is some larger assembly of components including the hardware and user environment.

You do better if you actually force the developers to create and run high quality unit tests so they fix the code before it gets into integration and becomes subject to automated testing. Peer and QA review of such unit tests tends to increase the quality of the software by ensuring that the unit tests actually test the functionality of the code against the requirements, are complete, coverage is complete, off nominal and boundary conditions are exercised, etc. Merely automatically running (and rerunning) poorly designed unit tests only turns the CPU into an inefficient space heater.

Of course this all presupposes testable functional requirements or user stories and corresponding test cases. Then you at best end up with a system that works as specified and you can turn it over to the users who will still manage to break it and/or complain that it doesn't do what they want it to do.

--
They that can give up essential liberty to obtain a little temporary safety deserve neither safety nor liberty.
Ben

Please stop saying 386 by ArcherB · 2006-03-21 17:11 · Score: 0, Offtopic

From the Centos about page:
CentOS-4 supports x86 (i586 and i686),

In other words, it won't run in a 386, I wouldn't want it if it was compiled so low as to be optimized for a 386. Please start using x86 something other than 386.

(Sorry, it just irks me)

--
There is no "I disagree" mod for a reason. Flamebait, Troll, and Overrated are not substitutes.

Dammit, wrong article by ArcherB · 2006-03-21 17:20 · Score: 1

I hate it when that happens.

--
There is no "I disagree" mod for a reason. Flamebait, Troll, and Overrated are not substitutes.

Speaking of debugging through sampling... by Anonymous Coward · 2006-03-21 17:23 · Score: 0

Interesting. I thought I recognized the name Libit.
He's one of the authors of a NIPS paper from three years back: Statistical Debugging of Sampled Programs

Re:Speaking of debugging through sampling... by Benoni · 2006-03-21 20:45 · Score: 1

Liblit. There's an off-by-one-error in your count of L's. (It doesn't matter what they say about you, as long as they spell your name right.)

Yeah, that's me along with a truly fantastic team of collaborators. And there's more where that came from.

reminds me of my rules on reporting list outages by SuperBanana · 2006-03-21 20:40 · Score: 2, Funny

This reminds me of a method on reporting mailing list outages I devised back in 2001 or so.

I told people we were switching to new software (Mailman)- and that if they got an error message or similar, to flip a quarter X times (I forget how many) and ONLY email me if they got all heads. I didn't want to get a couple dozen reports of the same problem, and I figured that if there were any problems, they'd affect a large set of the 1000+ users of the list.

It worked brilliantly.

--
Please help metamoderate.

Doctor Brown, you mean. by Grendel+Drago · 2006-03-21 21:11 · Score: 1

The mad scientist from "Back to the Future" was named Emmett Brown, not Doctor Watson.

--
Laws do not persuade just because they threaten. --Seneca

FP?! by Anonymous Coward · 2006-03-21 21:21 · Score: 0

this FP

You keep using that word. I do not think it means what you think it means.

Has anyone ever failed an FP so badly as you have? How embarrassing for you.

Guardian doesn't think so by Anonymous Coward · 2006-03-21 21:43 · Score: 1, Interesting

'Liblit's dissertation proposes a method for leveraging the key strength of user communities - their overwhelming numbers.`

'Of all the myths that have grown up around open source software, perhaps the most pervasive is Eric Raymond's aphorism that "Many eyes make bugs shallow",`
- Andrew Brown Dec 08 2005

Shaka, when the walls fell by BadAnalogyGuy · 2006-03-21 22:16 · Score: 1

Well, we've got Ben Liblit RIGHT HERE! Right here in the thread! Replying to my question!

Doesn't it make sense to ask him while he's here to discuss the topic in a simplified manner since he's the world's leading expert on the topic? This is a discussion forum, so being able to hear him express the concepts allows us to participate with him in a two way transfer of ideas.

The alternative is to write everything down and simply refer to documents instead of engaging people in conversation.

Re:Shaka, when the walls fell by jschrod · 2006-03-21 22:33 · Score: 1

I don't think that a request to reiterate the main points of his research is a good start for a sensible discussion here. (Not that I expect many sensible discussions on /.) Especially not if some of these questions are answered on the homepage and the `About this project' page. Your telling that the data is sent without user consent -- when the first paragraph on the home page tells that data is sent back -- and then slowly backpedalling is not a good start for a /. discussion either.
If you would have concentrated on one issue (e.g., the relation to Dr.Watson and/or Mozilla's Talkback) you would have had a point, IMHO. Otherwise, it was simply an overbroad request that looked as if you're too lazy to inform yourself before posting.

--
Joachim

People don't write Manifestos any more -- what's going on in this world? [Frank Zappa]
Re:Shaka, when the walls fell by BadAnalogyGuy · 2006-03-21 22:43 · Score: 1

I see. I must inform myself by reading his dissertation so that when I come back in several hours with questions he will be able to field them and we can all have a rousing conversation.

I'll keep that in mind the next time I have an interest in something and have the opportunity to have direct access to an expert and have essentially carte blanche to ask anything. Read up on it first, then hope that the expert hangs around while I'm busy informing myself.

As for the thrust of my main post, it was not about spying, but rather about the similarities of such a remote bug collection system (CBI) with Dr. Watson. The act of installing the software gives implicit permission to retrieve the software errors. Nowhere in the article or on "About this project" page is the exact mechanism for sending this data given. I assume it is somewhere in those 170 pages of dissertation. What's the harm in asking the expert how it works? He's already posting on Slashdot, it's not like his time is very important to him.

Where are the results? by dozer · 2006-03-22 00:40 · Score: 1

I would like to see the data that they're collecting but I can't find it anywhere on their site. Am I just missing it?

I learned my lesson with the cddb disaster: don't submit your own data unless you can mirror it yourself. Otherwise, if someone gets bored or greedy, everybody's hard work gets lost forever. Or, worse, it gets subverted to make profit for Gracenote.

Mirroring is easy enough even if you have almost no bandwidth: use bittorrent. So, where are the submitted results?

Computer Science is dying... by Anonymous Coward · 2006-03-22 02:53 · Score: 0

This just goes to prove how stupid academic computer science has gotten these days. This particular research is cheesy observational crap not even worthy of a degree at ITT Tech.

Sorry, dude, but you're a chode.

pfft..logical conclusion from Adams' 1984 paper by Anonymous Coward · 2006-03-22 03:29 · Score: 0

E. N. Adams, "Optimizing Preventive Service of Software
Products," ZBM Journal of Research and Development
28, No. 1, 2-14 (1984).

Real world bugs are not about a language by roman_mir · 2006-03-22 03:56 · Score: 1

Example: yesterday I had to solve a problem in the application I am developing for Bell that was absolutely independent of the programming platform.

The QA reports that there is an error on the screen while they are trying to save some data. The error is intermittent, it happens for some data sets but not for others. Investigation shows the following:

1. The data records that need to be saved must be first compared to existing data records for the same primary keys, if there are records, then the data is updated, otherwise it is inserted. Each normal record also can have one satellite record. These records share some of the primary key but are also different by one primary key component.

2. When the records are retrieved they are joined with data from another table. The application expects that all records from the other table that are related to the records from the first table have the same value in a particular field. However it happens that another application was used on the second table, which introduced data inconsistency between the records. Thus a work-around was introduced by the first application. The workaround consisted of selecting a Maximum value from the second table and joining that value to the records from the first table. This worked fine, until another application introduced NULL values into the second table. This caused the selection procedure used by my application to not return existing records when trying to save data. Since no records were returned on the select, my application assumed that the data was new and needed to be inserted rather than updated. Thus the application inserted multiple records where only one record had to exist.

3. An update procedure relied on having either 0, 1 or a maximum of 2 records per primary key. If it is 0 records, the data must be inserted as a new record. If it is 1 or 2 records, than one of the existing records is always a primary record, and the second one is the satellite. However, more than 2 records were found by this procedure because of the select/insert problem identified above. This created an error, because the code actually has a hard limiter built in: if there are more than 2 records returned by the database, an Illegal State Exception is raised.

--

What I described here is a software bug that caused another problem that caused another problem. But the bug in the first place was not expected, because it was assumed that other applications would not modify data in the secondary tables in an incosistent manner.

So, here is the actual chain of events: one application modifies some of the data data in the database in an way that is unexpected by the application that is being developed, this causes an incorrect interpretation of results in one of the operation in the new application, which does not actualy raise an exception, but it causes another independent procedure to cause an error.

The error is intermittent because only some of the data is modified in an incorrect manner. The procedures that cause the error to happen are asynchronous and are not necessarily executed in any particular order but rather depend on the user intervention to be executed.
--

The way that I found the problem was by eliminating all impossible logical branches. I was left with an improbabl logical branch: a SELECT statement returning an empty list, when in fact there is data in the database. Since all other logical paths were impossible, I had to assume that the improbable happened, investigated into it and found specific cases when SELECT statement did not return any records. From there I had to figure out what caused the incosistent data. The end solution was to change the SELECT procedure to return records even when joined with an empty set from a different table (the unexpected situation.)

--

Certainly if I had total control of the Data Model, this wouldn't have happened, because the data inconsistency would be impossible in the first place, but in the real world you don't always get to chose what you are working with. I was given an existing data model, and had to adopt the application to it.

Now, is it really of any relevance that the application is written in J2EE/BEA/Oracle? No, it is not.

--
You can't handle the truth.

Scumballs by Anonymous Coward · 2006-03-22 10:04 · Score: 0

ACM are a bunch of scumballs who should not be allowed to operate.
They are no better than spammers.

Re:Heh...users are the Beta by oaksong · 2006-03-22 10:30 · Score: 1

Ahh...the many years I've spent trying to make managment understand.... And why Open Source makes so much sense, at least from a got broke/get fixed perspective. Still haven't figured out the financial incentives. :)

Minor Rants by cant_get_a_good_nick · 2006-03-22 10:41 · Score: 1

RANT #1
I always think it's weird, something about the art and craft of getting better code out gets little notice, but some FireFox alpha (which is feature INcomplete, really only for extension developers) gets 200,000 messages. Mabe a third of them will be flames "(IE/FireFox/Whatever) is so buggy, you suck unless you switch to (whatever)". But tools to debug these get ignored. How much work is going into KDE vs GNOME, and even a group that wants to fork KDE (deity() help us).

RANT #2
I really think we're reaching some of the limits of the current programming models. Think of how many states there are in a 1GB machine? 2^(2^30) is
a lot of states. This is one of the best things about Java, trying to restrict the number of states, pointers massive complicate the state diagram, allow stray code from a totally unrelated segment fuck up yours if there is a bug. But we have the same 1 segment architecture that back with monolithic apps fitting into 10Kb or so. The security threats are radically different (stand alone machines to always on TCP/IP connctions) but we haven't isolated code much better, W^X just now making it's way into modern UNIXen. Why the hell do you have execution state (return address on the stack) next to data that can be overwritten? Maybe back in the old days of register poor architectures, but we shouldn't have that now. But there's so much code out there, can't rewrite it all.

If i link a library in, i should have a defined set of things that it can do. It should have it's own sandbox, and only do the things it specifies. If it does anything different, terminate the app and not let it do something crazy. I'm sure modern MMUs could be programmed to subsegment like this, but we don't.

OK, rant over.

Re:Heh...users are the Beta by Benoni · 2006-03-22 12:27 · Score: 1

Still haven't figured out the financial incentives. :)

If you mean incentives from the users' perspective, I like to pitch it this way. My statistical methods naturally tend to "learn" the most, most quickly, about the failures that happen most often. So the more a user participates, the more the developers' attention will be swayed to the bugs that user cares about. Thus, users can help steer bug triage.

Open source bug trackers can work the same way. When you report bugs in some project's Bugzilla system, you're helping that project see what they need to work on. At the same time, you're being selfish by drawing the developers' attention to your issues. It's a sort of enlightned self-interest.

More than a Drinking Club?? Nope. by Anonymous Coward · 2006-03-23 21:01 · Score: 0

Most recent CS papers I've seen seem to take every measure possible to avoid any math beyond 8th grade...

The fact that this guy actually used things like Statistics and Probability in a CS problem is probably what got him the dissertation in the first place. When some people in ACM couldn't understand a thing, they decided to give him an award.

Do you think this is flamebait?

If so, sit in on a 3D Graphics Course when the frequency domain and "Fourier Transform" are mentioned. I have personally witnessed a class of ~25 CS grad students [at a top 10 university in the USA] fiercely complain that taking a simple integral (to find the 1D fourier transform of a signal) was too hard for a test question. (And average grade for the class was about 20 points lower than previous classes, mainly due to that question).

For those not familar with this, computing a 1D fourier-transform requires little more than the level of a high-school calculus course [and yes, they do teach calculus in high schools!]

[I'm posting AC because many mods are probably CS people]

Re:Almost like infinite monkeys writing Shakespear by Anonymous Coward · 2006-03-29 14:04 · Score: 0

Kinda reminds me of infinite monkeys eventually producing the works of Shakespeare.

Or the kind of comments you get on Slashdot, which is very similar to an infinite number of monkeys typing on the internet, except the monkeys are VERY EGOTISTICAL and tend to engage in mindless FLAME WARS and eventually call one another NAZIs. Just like they'll do to this comment.

Slashdot Mirror

Debugging Expert Wins ACM Dissertation Award

83 comments