Slashdot Mirror


The Visual Display of Quantitative Information

danny writes "The Visual Display of Quantitative Information is a recognised classic on statistical graphics; to the 1983 original, this 2001 edition adds some additional graphics, extra colour, and corrections. It is a broad-ranging work, covering history, theory and practice and, despite the formal title and scholarly references, not at all narrowly academic. It assumes only a very basic understanding of statistics." Read on for the rest of Danny's review. The Visual Display of Quantitative Information author Edward R. Tufte pages 197 publisher Graphics Press 2001 rating 10 reviewer Danny Yee ISBN 0961392142 summary the classic work on statistical graphics

Tufte begins with the different kinds of informational graphics (maps, time-series, narratives, and relational graphics), describing their origins and evolution and presenting examples of excellence in their design. Many of these are fascinating in their own right -- two that I particularly appreciated were Minard's depiction of Napoleon's disastrous retreat from Moscow and an 11th century map of China.

"For many people the first word that comes to mind when they think about statistical charts is 'lie.'" Tufte gives examples of different kinds of deceit in graphics, along with some principles for maintaining graphical integrity. He goes on to consider the reasons for the poor quality of many informational graphics: one is the relegation of their design to those with art training but without an understanding of either the substance of the material or of quantitative (statistical) methods.

Part two begins by introducing some terminology and theory for describing graphics. The principle "Above all else show the data" is formalised as maximization of the data-ink ratio, and illustrated with some "before and after" examples of erasure of redundant or non-data-ink. Tufte excoriates various kinds of "chartjunk": moire vibration (the disconcerting effect caused by repeating patterns), the overuse of grids, and the "ducks" created when the design takes precedence over everything else.

Tufte gives specific suggestions for the design of box plots, bar charts, and scattergraphs. He argues for the use of multifunctioning graphical elements -- building data measures or grids out of the data itself, for example, by using labels that also show the end points of the data ranges. And he looks at ways of maximizing data density (within reason) and using "small multiples," or repeated smaller graphics. A final chapter steps back to consider the balance between text, text-tables, tables, semi-graphics, and graphics -- "Given their low data-density and failure to order numbers along a visual dimension, pie charts should never be used" -- and to touch on the aesthetics of proportion and scale.

All of this is liberally illustrated with examples, drawn from across the natural and social sciences. Despite the space devoted to these, The Visual Display of Quantitative Information packs a lot in, avoiding repetition or verbosity. Tufte's own tables and graphs are appropriately effective and the volume as a whole is elegantly put together: though it's more than that, it could be appreciated simply as a work of art. Tufte also finds room to survey publication practices across a select sample of international newspapers and journals, comparing the data density of graphics and the proportion of relational graphics (involving at least two variables that aren't temporal or spatial).

Most obviously, The Visual Display of Quantitative Information should be read by those involved in writing, editing, or designing documents or displays that contain statistical graphics -- from professional editors, technical writers, academics, and journalists right down to high school students. But others may appreciate it too: it has changed the way I look at informational graphics.

Danny has written over 700 book reviews. You can purchase The Visual Display of Quantitative Information from bn.com. Slashdot welcomes readers' book reviews -- to see your own review here, read the (recently updated) book review guidelines, then visit the submission page.

9 of 121 comments (clear)

  1. The Columbia Team could have used this book by twiddlingbits · · Score: 5, Interesting

    The author was interviewed by the CAIB. He stated a lot of the information presented that lead the NASA managers to the "we can't do anything" approach was poorly designed and did NOT get information across, or was slanted. He pointed out several PowerPoint slides that had 10-12 errors on them that led to incorrect interpretations by the audience. This is a GREAT book, and should be on every engineer's shelf if they present data to an audience (including peers). MBA's study some of this in their classes, but (most) Engineers and Scientists and Doctors don't. It's a shame when you have great information that is hidden by poor presentation.

  2. Bad statistical graphics are everywhere by rev063 · · Score: 4, Interesting
    Despite this classic book being available for 20 years, there are still plenty of examples of bad statistical graphics to be found in the mainstream media. Here's just one example from the Seattle Times this year, along with a "non-lying" revision of the chart, but you can find plenty by flipping through just about any regional newspaper. Or any edition of USA Today. The NYT usually has good charts, though.

    I thoroughly recommend this book to anyone who has to produce charts for their job.

    1. Re:Bad statistical graphics are everywhere by B'Trey · · Score: 2, Interesting

      Because that's how the original article (sadly no longer available) divided society.

      Funny, I have it up in another tab right now. I followed your link to the Seattle Times, created a bogus login, and accessed the article in the archives. It does not have the graphic in the archives article, but your page links to that.

      Nowhere in the article does it divide society up as you claim. I only find one mention of different percentiles:

      Seattle Times Quote: The top 5 percent of the nation's taxpayers paid 41 percent of all federal taxes, according to the Joint Committee on Taxation. But that same group paid from 56 to 59 percent of all income taxes.

      There were four groups: lower classes (bottom 60% quantiles) middle classes (not mentioned), upper classes (above 5%) and super-rich (above 1%). Then a graph compared income taxes paid by each. The top two groups were compared IN THE CONTEXT OF the bottom group, but the top two groups overlapped each other! This is deception.

      This is simply false. The article never mentions three or four groups. I posted the only mention of percentages in a quote above. Pull it up and look. (If you dispute that, let me know and I'll paste the entire text of the article in a comment and copyright laws be damned.) The point of the graphic is not to compare the bottom 60% to the top 5% to the top 1%, nor is it to show the distribution of the tax burden across the entire spectrum of the population. The point of the graph is to compare each of the top 5% and top 1% with the bottom 60%. The fact that the top 5% and top 1% overlap is intentional. No attempt is made to have the numbers add up to 100%. THAT ISN'T THE POINT OF THE GRAPH. In effect, the graph says "The top 5% pays much more that the bottom 60%, and even if you restrict it to the top 1% they still pay more!" The point is to show the disparity, and the graph does that admirably.

      Bertin's point regarding multiple pie charts is correct. Also, pie charts with more than four wedges are difficult to comprehend. That doesn't mean pie charts are useless. One would have worked fine for your "corrected" graph. (It would have been a poor choice for the original graph, however, because the original graph was not intended to show a division totaling 100%.)

      But it doesn't make sense to compare the ratios of A+B with C, with the ratio of B with C. That's plain misleading.

      It makes perfect sense to compare the top 5% with the bottom 60%, then to compare the top 1% with the bottom 60%. That's the purpose of the graph.

      And, for the record, IAAS (I am a statistician).

      God but I hope this is a lie. If not, please let me know where you earned your degree so I can make sure my kids don't go there.

      --

      "The legitimate powers of government extend only to such acts as are injurious to others." Thomas Jefferson.

  3. While we're on the subject... by chmod_localhost · · Score: 1, Interesting

    I have recently converted to the Linux way of doing things, after being fed up with M$ for too long.

    In my department, we use proprietary software for all of our data reporting. I would like to use an open source program instead, but since I'm new to Linux, I'm not sure what's out there.

    I'm hoping the slashdot community can help me on this one- what are some good plotting programs that run on Linux?

  4. Filelight by Makarakalax · · Score: 3, Interesting

    I've written a small program for KDE that exhibits what I feel is a fairly novel method for representating hierarchical data graphically.

    Currently it only shows information related to your filesystem, but with the next version it will begin accepting any kind of hierarchical data piped from the cli, via a text file, etc. (method of input as yet unfinalised).

    If anyone's interested, here's a screenshot, and here's the homepage

    I apologise for the plug; usually I'm quite good and wait for at least on-topic opportunities! I'm sure I'll still get the usual ac death threats etc. nothertheless I hope to have interested some people.

  5. Coffee Table Book for PowerPoint Jockies by Anonymous Coward · · Score: 3, Interesting

    I got this book for Xmas a few years back and was a bit disappointed -- it is basically an "old school" version of Jakob Nielsen. The book was pretty and a somewhat interesting read with (as everyone always mentions) good historical examples, but expensive and ultimately not incredibly insightful. ET seems to have carved a good niche for himself making PowerPoint jockies feel part of a broader cultural tradition.

    Sorry to be so negative.

  6. Napoleon map and ClickTracks by Stephen · · Score: 2, Interesting

    Interesting that you should single out the map of Napoleon's retreat from Moscow. That graphic was the inspiration for our web log analysis program ClickTracks. Our CEO saw it and realised that what web log analysis needed was to show data in context, rather than in long lists. We have the poster of the Napoleon map on the wall of our office.

    --
    11.00100100001111110110101010001000100001011010001 1000010001101001100010011
  7. Re:Good Book! by Stephen · · Score: 2, Interesting
    The other two books are called Envisioning Information and Visual Explanations. I recommend all three of them, although there is some overlap.

    Have a look at Tufte's sculptures too.

    --
    11.00100100001111110110101010001000100001011010001 1000010001101001100010011
  8. Re:Rating: 10? by Anonymous Coward · · Score: 1, Interesting

    Ah. I see.

    1
    2
    3
    4
    5
    6
    7 .........
    8 ............
    9 ..............
    10 ..*.........

    and on the second slide, you'd say
    * = this book
    . = other books reviewed here