Slashdot Mirror


Alternate Baseball Universes

Jamie found a NYTimes op-ed by a grad student and a professor from Cornell, outlining some research they did into alternate baseball universes. The goal was to find out how unlikely in fact was Joe DiMaggio's 56-game hitting streak, played out in the 1941 season. No one since has even come close to that record. The math guys ran simulations of the entire history of baseball from 1885 on — 10,000 of them. For each simulation they put each player up to the plate for each at-bat in each game in each year, just like it happened; and they rolled the dice on him, based on his actual hitting stats for that season. (Their algorithm sounds far simpler than whatever the Strat-O-Matic guys use.) The result: Joltin' Joe's record is not merely likely, it's basically a sure thing. Every alternate universe produced a streak of 39 games or better; one reached 109 games. Joe DiMaggio was not the likeliest player in the history of the game to accomplish the record, not by a long shot.

6 of 229 comments (clear)

  1. If its so likely, they why hasn't it happened? by quanticle · · Score: 4, Interesting

    I know the statisticians among you are going to bash me with a cluestick for such a naive question, but I'll ask anyway - if this event is so likely to occur, then why hasn't it happened again?

    --
    We all know what to do, but we don't know how to get re-elected once we have done it
    1. Re:If its so likely, they why hasn't it happened? by Anonymous Coward · · Score: 5, Interesting

      The early years tended to be batting competitions (in some ways like today's) rather than pitching competitions If by "early years", you mean 1920 and later, yeah.

      Otherwise, buddy, you're way off base.

      NL year-by-year stats.

      Look at those ERAs pre-1920. Before 1920, the ERA on the NL never significantly exceeded 3.00. After 1920, it never dropped below 3.3 or so, with the exception of a 2.99 in 1968, after which MLB made changes to the rules, amongst them lowering the acceptable height of the pitcher's mound.

      The time prior to 1920 was marked by pitchers such as Cy Young, Mordecai Brown, Walther Johnson, Ed Walsh, Christy Mathewson. You've probably heard of most of them.

      Here are the single-season MLB ERA leaders. Outside of Bob Gibson in the aforementioned 1968, you have to go all the way to Greg Maddux in 1994 at #48 all time to find a season after 1920 on the list. Barely 10 of the 100 lowest single-season ERAs in MLB history occurred after 1920. And that's only because Pedro Martinez in 2000 and Ron Guidry in 1978 tied with 9 others for #100 on the list. So only 8 of the best single-season ERAs happened after 1920.

      You need to research "dead ball era", and the response by baseball to "Black Sox". (Hint: just like the response to the 1994 strike, it involves the ball...)

      The fact that you got a +5 out of such a demonstrably incorrect post is a major indictment of the baseball knowledge of the Slashdot faithful.
    2. Re:If its so likely, they why hasn't it happened? by ByteSlicer · · Score: 3, Interesting
      Well, they modeled the batter using random numbers and their player stats. The problem is that real people don't behave deterministically. They might hit better on their birthday, or when it was a clear sky the night before the game, just because they believe that (baseball players are extremely superstitious). The model doesn't take into account that some player might get psyched out by a certain number (and always screw up on the 13th consecutive hit), or just by the pressure of wanting to break the record. It doesn't take into account the pitcher, weather conditions, and a lot of other things that matter to real people but not to computers.

      You might be able to model some long term behavior that way, but never the short term stuff, because the model is too simplified (man versus dice).

  2. Changing game of baseball by __aanonl8035 · · Score: 4, Interesting

    One of the key points mentioned in this article is when does the hitting game streak occur? They mention that it was much more likely to occur during the early 1900's which is known as the deadball era. The baseball wasn't as springy and they tended to use the same ball during the entire game. During that time it was more efficient to try and knock the ball between the holes in the fielders and get a double or single then to try and hit it out of the park.

    I think it would be more impressive to take a subset of the data, and compare from 1930 up until the present. Of course, there have been other major changes to; glove sizes, introduction of the slider for a pitch, steroid use.

  3. Re:Bogus by Miseph · · Score: 3, Interesting

    No, because the probability for ANYTHING, given enough chances, is 1.

    What they are actually saying is that reality appears to follow a probability bell curve.

    You could also say that, in 1,230,000 years of baseball games, we could be almost certain of a hitting streak longer than 56 games.

    --
    Try not to take me more seriously than I take myself.
  4. Re:Nerves by Kjella · · Score: 3, Interesting

    In any case, what you are talking about would affect all players equally, therefore it would cancel itself out in their research. Not when they use it the way they use it, and say streaks of 39 to 109 is to be expected. If the difficulty increases by the length of the streak, 56 could be a far more exceptional streak than their research indicates.
    --
    Live today, because you never know what tomorrow brings