Elo Chess Rating System Topped By Proposed Replacements
databuff writes "About six weeks ago, Slashdot reported a competition to find a chess rating algorithm that performed better than the official Elo rating system. The competition has just reached the halfway mark and the best entries have outperformed Elo by over 8 per cent. The leader is a Portuguese physicist, followed by an Israeli mathematician and then a pair of American computer scientists."
Castle this.
To be fair that owning represents a difference of 0.000629 in the RMSE between the two of them - hardly the sound thrashing those snooty mathematicians rightly deserve.
Yes, I agree. We should also fight amongst professions because we simply do not have enough to fight about.
Long live Physicists and they physicisteries!
This is chess rating algorithm. The goal is to predict given a matchup between two players with known histories how they will likely fare in a game or series of games against each other. Elo is the standard rating system and has been for some time. These algorithms are improvements on that. So they predict better who will win. They have nothing to do with playing actual chess. So the Turk is irrelevant to this discussion (aside from the not minor issue that the operator has been dead for some time.)
Whoah there partner, we don't want a full-scale fight between all professions - some of those guys are pretty buff. Pick off the mathematicians and physicists first because the law of the playground must be respected - the small, weak, bifocaled, or curiously gifted with numbers should be taken down first. Then nap time.
A friend called my on my telephone line and told me out of the blue that the Elo rating system had been bested. I was so stunned I almost turned to stone. I said, "Dude, don't bring me down!". But the news slowly sunk in, and now I can't get it out of my head. But I'll tell you what, the jury is still out. I think there's gonna be a showdown, and then Elo will be back on top.
This is entirely unsurprising. The Elo system was, in a sense, designed to be easily calculable in a time before things like computers or databases or data mining were especially common (after all, it was adopted by US Chess Federation in 1960!), and it hasn't been revised much if at all since then. Of course statisticians using modern methods and number crunching capabilities and huge databases of both game results and game moves are going to be able to beat it by a lot - this isn't like the Netflix prize, where a bunch of teams were competing to improve something that had been in active development up until that very year.
Not any more.
Boffoonery - downloadable Comedy Benefit for Bletchley Park
The french computer scientist Remi Coulom, well-known for the pioneering computer go program Crazy Stone, has published some very interesting research on this issue. He claims not only to beat Elo, but also Glicko, Microsoft's TrueSkill and decayed-history approaches.
I was going to see if I could implement his ideas for the competition, since he's not going to participate himself. But it doesn't look like I have time for it.
Here's the paper in case anyone wants to give it a try. I suspect the approach is a bit more solid than the ad-hoc approaches of the quants.
xkcd is not in the sudoers file. This incident will be reported.
So, how did they rank the entries?
sic transit gloria mundi
No, Portrugal. Between Spairn and the Atlantirc.
Many rating systems seem to assume transitive dominance structures. If you are playing rock/paper/scissors no rating would be sufficient to predict the outcome of a tournament. Many games (using Batttlenet, true skill..) propably are not interested in finding nontransitive structures since players want to be the best and fans want to know who is the best which is kind of pointless with r/p/s.
Yeah, and his name is Él(Lowercase O-double acute), not Elo, but I understand that "hungarian umlauts" causes significant cognitive stress :)
Even for Slashdot it seems...
Actually he was born Él Árpád Imre but changed his name to a more Americanized Arpad Emrick Elo.
Careful, those physicists have arsenals of powerful lasers at their disposal...
But do they have sharks on which to mount them?
"But do they have sharks on which to mount them?"
We must avoid them teaming to Biologists at all costs!