Slashdot Mirror


Computer System Makes Best Sports Bets

schliz writes to tell us that a new computer system using the "Logistic Regression Markov Chain" (LRMC) has proven to be the most efficient system at predicting sporting event outcomes. The system was tested on the 2008 US NCAA basketball season and picked all four of the finalists. "Similar to other rankings systems, LRMC uses the quality of each NCAA team's results and the strength of each team's schedule to rank teams. The method has been designed to use only basic scoreboard data, including which teams played, which team had home court advantage and the margin of victory."

17 of 73 comments (clear)

  1. For the first time in a while... by insertwackynamehere · · Score: 5, Insightful

    The final four were also all #1s in their league. Coincidence? This has never happened before I believe and if the computer calculates odds the way the teams are ranked, then this may not always be so reliable.

  2. Making sports bets by Z00L00K · · Score: 4, Interesting
    Is always a question of statistics with a random noise involved.

    The amount of noise involved strongly depends on which sport that is involved. Basket is a sport where a lot of points is scored, which in turn means that the noise is relatively low while football (what americans call soccer for some strange reason and what americans call football is more like rugby) has a lot of noise since the ability to score a goal there is depending a lot on luck.

    This essentially means that counting points is a good way to score a basketball team while counting goals won't give much clue to how good a given football team is. You must look at other factors on a football team instead. And not all those factors can be as easily measured. Of course - the other factors are also important for a basket team. Other factors involved are the composition of players, individual player mood/health/inspiration, latest matches, history between the teams, referee behavior, weather, spectators, location, timezone etc. Add to this the element of randomness caused by the impact of the ball on a surface, player positions at certain points of the game etc.

    --
    If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    1. Re:Making sports bets by Mr.+Underbridge · · Score: 4, Insightful

      Is always a question of statistics with a random noise involved.
      The amount of noise involved strongly depends on which sport that is involved. Basket is a sport where a lot of points is scored, which in turn means that the noise is relatively low while football (what americans call soccer for some strange reason and what americans call football is more like rugby) has a lot of noise since the ability to score a goal there is depending a lot on luck.

      American football, over the course of a full game, has coarse scoring jumps (7pts for a touchdown) but luck plays a surprisingly small role. This is why good teams have very high winning percentages and poor ones have such low winning percentages. Not sure how that dynamic works in futbol, but the luck factor isn't as large as you'd think.

      The reason the LRMC method is well-suited to NCAA basketball is that A) there are a lot of games, and B) the good conferences don't play the bad ones much. That means that a high-order Markov model is a good way to determine who would beat whom through a game of "I beat a team that beat a team that beat a team that beat you" sort of thing.

      I came up with a version of this independently before I stumbled over these guys last year. It's pretty fun and works quite well. It's certainly much better than the polls, and in most cases last year my system was within a point or two of the Vegas spread. It's also pretty good at recognizing underdogs early - mine had Davidson and Drake before they were in the polls.

    2. Re:Making sports bets by drooling-dog · · Score: 2, Interesting

      Several years ago I was playing with some iterative and least squares approaches to predicting (American) football scores and rating teams. It worked pretty well, but one thing stood out: When you use only the scores from previous games and home/visiting status as inputs to the model, you hit a pretty hard floor of about 2 touchdowns (13 or 14 points) for your standard error. That error includes the "hidden variables" that you mention, as well as the fundamental randomness of the game.

      It also implies that any statistical predictions you do are going to be off by 7 or more points 62% of the time, 14 or more about 32% of the time, and 28 or more about 5% of the time. That's worth considering when betting against a spread...

  3. My NCAA predictor code had the same result! by doxology · · Score: 2, Funny

    Here's the code I used

    List pickFinalFour(Tournament tourney){
          List finalFour = new ArrayList();
          for (Division d : tourney){
                Team bestTeam = null;
                int minSeed = Integer.MAX_VALUE;
                for (Team t : d){
                      if (t.getSeed()minSeed){
                            minSeed = t.getSeed();
                            minSeed = team;
                      }
                }
                finalFour.add(bestTeam);
          }
          return finalFour;
    }

    --
    sigfault. core dumped.
    1. Re:My NCAA predictor code had the same result! by Anonymous Coward · · Score: 3, Funny

      Really? Here's mine:

      pick :: Ord a => [[a]] -> [a]
      pick = map minimum

  4. Best bet is not to bet... by Bazman · · Score: 4, Interesting

    One of our research assistants started doing something like this about ten years ago, fitting a statistical model to previous soccer match results and the home/away effect. He rounded some of us up to chip in a few pounds each week and off he went to the bookies to bet on the outcome of his model.

    Now, any statistical model (such as this LRMC thing, or the techniques m'colleague used) will only give estimates of the odds. It might say that the probability of team A winning is 0.6. Now, if the bookies are offering you a return of 0.7 then it's worth a bet. If the bookies rate it 50-50 then it's not worth a bet.

      The trouble is that any statistical model worth its salt is going to produce probabilities that add up to 1.0, whereas the bookies' odds can add up to 1.2 or so. That's how they play the game and make their profits.

      So after a season where we made a few pennies profit, and got some press interest (including a team from BBC Tomorrow's World filming us playing football), my friend realised the best thing to do was not to bet at all.

      And instead he went into the business of supplying odds to bookmakers. From where he now sits at the top of a rather large business empire!

      I might pop him an email to see what his current techniques are, but back in the day it was something similar to this LRMC thing.

  5. Re:Why not test it for the past 10 years? by Anonymous Coward · · Score: 2, Insightful

    Why would 10 years be so much better than the 9 years they analyzed?

  6. Re:only one question by TheCreeep · · Score: 2, Funny

    That's ok, becase I don't think that they created the algorithm with you in mind. You're just a negligible quantity.

  7. Wait a minute! by ydra2 · · Score: 2, Funny

    Are you telling me that somebody actually looked at win/loss records and margin of victory and strength of opponents to figure out which team might win? How can this be? Why did nobody ever figure out this simple algorithm before? [slaps forehead with hand] DOH!

    Oh wait, sorry it was patented years ago, and multiple times with minute variations such as going back to strength of opponents opponents, and margin of victory of opponents against common opponents, and strength of opponents opponents opponents, and ....

    But if you add in what they ate for breakfast, then you might have a new patentable algorithm.

  8. I'm not convinced by drsquare · · Score: 3, Insightful

    If I had a computer that could predict sports results, I wouldn't tell anyone about it. I'd take a briefcase full of cash down to the bookmakers.

  9. RTFA by sarahbau · · Score: 5, Informative

    I know this is Slashdot, but why can't people RTFA before commenting? They aren't using the seeds or rankings in the program - only game stats, home quart advantage, etc. They ran it on the last 9 years of data and it picked final four teams 30% more often than analysts. (30/36 vs 23/36).

    The linked article didn't mention it, but from the GA Tech web site, it said that it correctly identified several overrated teams that lost early on (like Georgetown), and underrated teams that went farther than expected (like WVU). The program picks Kansas to win this year.

  10. Data mining by 26199 · · Score: 2, Insightful

    Doesn't say whether the test was done on in-sample or out-of-sample data. That is, did they test using the same data that was used during development?

    If so, the results are worthless. You can make a "system" that says anything you want given enough tweaking. (This is often the problem with apparently successful computer trading models).

  11. Great sample by Idiomatick · · Score: 4, Insightful

    Great sample... They should test the algorithm on maybe 80 historical seasons and maybe we will be able to see something.

  12. Re:Now Hear This! by maxwell+demon · · Score: 2, Funny

    "We want this machine off, and we want it off now!"

    But I can predict which team the machine will predict to win: Team #42

    --
    The Tao of math: The numbers you can count are not the real numbers.
  13. I'm using it by jedijacket · · Score: 2, Interesting

    I heard about this last year and used their picks for this year's bracket. I'm tied for first in my pool, and 93.5% nationally in espn's bracket game. Just for comparison of how good their choices are. They had 100% on the first round day one.

  14. Link to the paper by yo · · Score: 2, Informative

    Here is the paper describing the method: http://www2.isye.gatech.edu/people/faculty/Joel_Sokol/ncaa.pdf