Slashdot Mirror


A Rubric for IT Analysis

Aredridel writes "Zed A. Shaw has an insightful article on how analyses of software systems should be performed, and how they're often done wrong. It should be required reading for all IT journalists, and all readers of IT journals."

15 of 86 comments (clear)

  1. Hmmmm.... by extagboy · · Score: 4, Funny

    Even worse it works about as well as pricing soap at $1.95 instead of $2.00 to fool people into thinking it's cheaper.

    I think $1.95 is cheaper, isn't it?

    Better run it through the rubric...

    8. Paper does not use the above terms correctly or calculates them incorrectly. Without the data you won't know the second part, but these 6 statistical concepts are very simple to calculate and get right.

    I think it's broken.

  2. How to Lie with statistics by alanw · · Score: 4, Insightful

    When you have read that article, go and buy a copy of the 1954 classic How to Lie with Statistics by Darrell Huff, ISBN 0393310728.

  3. Lying with statistics by YU+Nicks+NE+Way · · Score: 3, Interesting

    The author of the rubric "carefully" lists examples of things that ought to be seen -- and then carefully extracts two graphs from a long analysis in order to "prove" his claim. Never mind that the things he argues one should look for would be embedded in the materials and metods or results section, not the conclusion or the paper summary. Never mind, either, that his objections are bogus (red versus black ink? Uh, wait -- if the winning system had been shown in red, it would have conveyed how burningly fast the system was.)

    Oh, wait -- it's somewhich which shows that samba 3.0 is slower than w2k3. Never mind. This is slashdot, so the ditors have gotta troll for ad views.

    1. Re:Lying with statistics by Angostura · · Score: 2, Interesting

      Agreed, I'm sorry, the 'red lines are bad' claim is lame. If the Windows line was in blue he would probably have claimed that it "is intended to remind you of the blue screen of death".

      Moreover, he accuses the example graph makers of bad practice by re-scaling the x-axis, without rescaling the y-axis "to compensate".

      Excuse me? As far as I can see the x axis was scaled in order to display the data in the most room available, not to deceive in some way. The y-axes were left alone because the data range depicted were identical.

      A shame. There is some good stuff in the article, but it suffers by exaggerating its case in places.

    2. Re:Lying with statistics by Otter · · Score: 2, Insightful
      1) Yes, the grandparent got the x- and y-axis confused in his post.

      2) The point of the graphs is that the Windows server has a roughly 75% performance advantage over Samba on both systems. The different y-axes are used because one system is twice as fast as the other and using the same scale on both graphs would leave half of one graph empty. I would say the choice of scales is entirely correct.

      3) The x-axis is labelled in numbers, not intervals. Excel graphs place tickmarks between the labels. You can complain about them, but the author didn't place them there. In any case, Samba doesn't look any better no matter where you put them.

      4) Sorry, the whining about the red is just weak. In any case, this is another case of an Excel default being used, not a malevolent anti-Lunix conspiracy...

  4. Re:MetaRubricry by concept10 · · Score: 2, Interesting

    Thats my exact question. When will someone make software that will analyze TFA and tell me if it is worth reading? Think about how much bandwidth could be saved with this app? Sort of like a stumble upon for news.

  5. Red / Green (Bad graph examples) by mister_llah · · Score: 4, Insightful

    The usage of red and green determines the meaning, if the higher statistic was red, it wouldn't be the "bad" effect he is stating.

    The statement that green is good, red is bad, is not really true. Red is an attention getter, Green is an easy, inobtrusive color (relaxing, generally).

    While it is easy enough to make the leap that 'red' is bad because red is often an 'alert' color, the reason red is an alert color is because it is an attention getter, not because it means bad.

    Why else do you think so many people drive red sports cars? If red was bad, why wouldn't they drive green ones? ... and the graphs aren't necessarily misleading in the aspect of spacing, the graph seems to be trying to show the ratio of difference, not the difference amount. ... aside from what looks like a bad example of bad examples... there are some good points in the article...

    --
    MoM++ - A Classic Expanded - [Master of Magic 1.5]
    http://mompp.sourceforge.net/
  6. Good article by bobbis.u · · Score: 4, Interesting
    Perhaps the author of the Openoffice.org vs MS Office comparison should have read it first.

    I hate it when people lie with statistics. Even the BBC did it recently when they were trying to justify 1 million GBP on their new weather program. They said 7/10 people either liked the new system the same as the old one or preferred the new one. Perhaps they could also have said 9/10 liked the new system the same as the old one or preferred the old one? Who knows when you lump categories together like that without providing the raw data?

  7. somewhat obvious? by moz25 · · Score: 2, Interesting

    What he's stating seems rather obvious, but then again I might not be his target audience. One thing he seems to be missing is: who is paying for the test and is the one in whose favour the test turns out to be also the one who paid for it?

  8. Analysis of software systems? by Orion+Blastar · · Score: 5, Informative

    You have to be kidding me. The last three jobs I had, I got dinged if I did analysis of any sort. Most software developers skipped the analysis and design part, because Managers wanted them to start coding on the first day and not stop until it was ready for QA to look it over. I called it "Seat of your pants" programming. Often I had to fix problems in other developers' programs and they did not have proper documentation, source code comments, naming conventions, flow charts, or any sort of documentation to help me figure it out at all.

    Requirements kept comming in, and they changed daily. Often what I started writing at 8am, was useless by 4:45PM when the requirements changed on-the-fly and adhoc and required me to program something else to replace it before I went home for the night. While I could have waited until the requirements were locked in, there was no such thing as that, any idea anyone had was instantly accepted by a manager and given to me to put into the program. Combo boxes became Listvues, then combo boxes again, then a text box, and then a Listvue again, and then a combo box. Database names for tables and columns were always changed, and of the thousands of SQL Queries in my programs that accessed them, they needed to be changed as well.

    Management didn't think anything of it, and kept their "We cannot say no to anyone, no matter how insane the request" attutude.

    Analysis, hooo haaaa! Yeah I wish! Corporate America apparently does not believe in it anymore.

    --
    Remember, Slashdot does not have a -1 disagree moderation, and no, troll, flamebait, and overrated are not substitutes.
  9. INSIGHTFUL?!?! by imsabbel · · Score: 2, Insightful

    "Also look at the axes and their layout. The first graph has the y-axis (left side) going in 50 increments, and the second graph has the y-axis going in 100 increments. This distorts the graphs to make it look like they are the same results, but actually they look very different when graphed properly. What's worse is that the x-axis for both graphs is the same which means they are changing one scale (y-axis) without adjusting the other scale (x-axis). This creates a distorted graph."


    Well, no idiot. When graphed properly, they look the same. Both tests show an absolutely compareable performance ratio. What does it matter that the faster machine runs both OSses faster? How does this skew anything? Is the concept of relative speed increases a new concept for the creator of the article?

    A REAL loaded graph would surpress the y-axis or something to push the lower graph further down, or to skew the proportions.

    Man, is today really shit article day on slashdot?
    --
    HI O WISE PRINCE. WHT TOOK U SO DAM LONG?
  10. A rubric is a bit of red text by Colin+Smith · · Score: 4, Informative

    No, really. That's how it started, usually the title of a section, paragraph or similar.

    Obviously the bit of red text contained something someone thought was important so eventually the word came to mean an important rule or important passage. These days it means an important set of rules.

    http://www.dictionary.com/
    htttp://www.m-w.com/
    http://www.askoxford.com/?view=uk

    --
    Deleted
  11. Re:Now That You Mention It... by Anonymous Coward · · Score: 2, Informative
    Indeed and it's follow by an answer. What exatly is you point?
    That pulling fragments of a text out of its context serves to confuse?
    Well, I'm not getting these requirements from some arbitrary place, but from many books on properly displaying statistical data and graphical information. Read any book by Edward Tufte on displaying information, and just about any book on statistics for giving accurate information.

    In addition, I've developed this list after years of reading, writing, and studying studies with these problems. I've even read entire books on so called "performance tuning" which violate all of these basic things. I'm just blown away how someone can write an entire book or article on performance tuning and not mention standard deviation once or show one run chart with a mean and +/- standard deviation lines.
  12. This is a useful enumeration by eclewis · · Score: 2, Insightful
    Naturally, despite Zed's assertion, not all graphs need to avoid his enumeration of pitfalls. For example, if the target audience is mathematically unsophisticated, most statistics (save perhaps mean) are inappropriate. Or perhaps red is meaningful in some contexts. Or maybe the presenter wants to show a very small delta, so an axis range is chosen to illustrate this.

    Nevertheless, Zed's enumeration can be extremely valuable in helping a discerning reader (who doesn't already know it all!) to critically interpret graphs in order to decide what s/he may conclude. For example, if system X appears to outperforms system Y, but the difference may be within the (unpresented) deviation, one should not accept the assertion that X is superior. Instead, one may conclude that X may be better than or comparable to Y.

    Zed's article can help some of us tell the difference between lies and truth. That's a good thing. The unfortunate weakness of the article is that the example is not particularly compelling. It simply doesn't illustrate the most important pitfalls.

  13. Re:Also the graphs were fine in terms of scaling by T-Ranger · · Score: 2, Informative

    Actually, only having two values it is impossible to know how much difference there is. If you test 100 video cards made in the last 5 years, and their scores range from 500-11000, then these two cards are basicly the same speed. If your range is from 1800-1950 then they are radicly different. Numbers are meaningless without units, and units are meaningless if the user doesnt know what they are.