Busting the MythBusters' Yawn Experiment

← Back to Stories (view on slashdot.org)

Busting the MythBusters' Yawn Experiment

Posted by kdawson on Monday April 23, 2007 @01:22PM from the it's-not-the-company-it's-the-hour dept.

markmcb writes "Most everyone knows and loves the MythBusters, two guys who attempt to set the story straight on things people just take for granted. Well, maybe everyone except Brandon Hansen, who has offered them a taste of their own medicine as he busts the MythBusters' improper use of statistics in their experiment to determine whether yawning is contagious. While the article maintains that the contagion of yawns is still a possibility, Hansen is clearly giving the MythBusters no credit for proving such a claim, 'not with a correlation coefficient of .045835.'"

6 of 397 comments (clear)

Min score:

Reason:

Sort:

Not quite, OmniNerd by Miang · 2007-04-23 13:38 · Score: 5, Informative

TFA's conclusion is correct but their methods are wrong. For these kind of data, correlations aren't the appropriate test; they should have used a chi-square distribution test. Using TFA's assumptions -- total sample size of 50, 4 yawners out of 16 not seeded, 10 yawners out of 34 seeded -- the chi-square value is .10, which pretty strongly misses the critical value of 3.84 for significance. Not that it matters anyway, but it's pretty funny to read an article debunking statistics that employs inappropriate statistics itself...
Doesn't anyone know statistics any more? by Ichoran · 2007-04-23 13:49 · Score: 5, Informative

Not only was MythBusters embarassingly statistics-free, but the "busting" was done using a wholly inappropriate statistical technique. Hansen used a correlation-based test, which assumes that the data follows a Normal distribution (which a bunch of 1s and 0s do not).

There is a very well-known test, the chi-square test, that deals with exactly this case. (Given the small sample sizes, the Fisher exact test may give better results.) Someone should point Hansen to the Wikipedia page on the topic.

For example, if there are 16 non-primed people, with 4 yawning and 12 not (for 25%), and there are 34 primed people, with 10 yawning and 24 not (for 29%), the chi square test gives a p value of 0.74.

The values Hansen supposes are significant 4,12 and 12,24 are not: p = 0.29.

You have to go all the way to 4,12 and 17,19 (i.e. 47% on a sample of 36) to get significance.

MythBusters was wrong to conclude that their results were significant, but Hansen was equally wrong to conclude that he had shown that Mythbusters was wrong.
1. Re:Doesn't anyone know statistics any more? by gumbi+west · 2007-04-23 14:02 · Score: 4, Informative
  
  You were actually right that it's Fisher's exact test that you want, it's similar to doing a complete permutation test which is exact. Because this is a 2x2 table, there's no reason not to use the exact test. The actual result has a p-value of 1.0 in a two-tailed test (whoops!) and even 4,12 and 17,19 has a p-value of 0.22 in the two-tailed test. In deed, it would have to go all the way to 4,12 and 21,15 to be significant at the 5 percent level for the two-tailed test. The two-tailed test is the right one because you had better believe that they would have made a big stink if it had come out the other way!
  But all this aside, I'm not sure I like the experiment. Why bore people? Why have so many in the room. the 4,12 number is way too high, I'd say the were better off looking at narrow time slices and natural yawns (i.e. do yawns happen at random or do they set off avalanches). Then there is only one group and you're just testing the Poisson process assumption of uncorrelatedness.
Re:Science by Excors · 2007-04-23 14:04 · Score: 4, Informative

Unfortunaely I can't find the name of a program that aired in the UK about 6 months ago. It took a team of 4 people to a deserted island and each week they had a task to complete each, they were only allowed to use what was on the island and what was given to them each week (as well as a tool set because, well no tools = screwed). They had to do things like make fireworks, record a song and various other "minor" things which required them to render down various things to achieve the chemicals they needed to complete each task. What they did and what it resulted in was very clearly labeled, having real science explained behind it.

Would that be Rough Science? In particular, it sounds like the second series. I've seen a couple of the series over the past few years, and I believe it did a pretty good job of being a science show – the interest comes from watching people who actually know what they're doing, designing and building ingenious solutions (admittedly with very convenient tools and materials available) to problems that aren't inherently interesting (like making toothpaste or measuring the speed of a glacier), rather than relying on 'interesting' problems that are large/dangerous/explosive and lacking focus on the solution process.
Re:Mythbusters is not scientific by wesmills · 2007-04-23 15:04 · Score: 4, Informative

They have stated both on the show and in other interviews that a lot more testing goes on than just what we see on the show. For the "showcase" experiment on each show (the one that opens and closes the program), the producers have taken to placing video of most or all of the tests on their Discovery website: http://www.discovery.com/mythbusters
Re:Submitter gets an F on this one by martin-boundary · 2007-04-23 18:20 · Score: 5, Informative

Who marked this informative?
The number of significant figures in an answer depends on how the function propagates errors. It's INCORRECT in general to think that if the inputs are given with two significant digits (say), then the output is only good for two significant digits.
The CORRECT way is to perform error analysis on the function being computed. If the function is linear, then the error magnitude is essentially multiplied by a constant. If that constant is close to 1 (and only then) will the output accuracy be close to the input accuracy.
In general, a function being computed is nonlinear, and the resulting number of significant digits can be either more or less than for the input. Examples are chaotic systems (high accuracy in input -> low accuracy in output) or stable attractive systems (low accuracy in input -> high accuracy in output).