Humans Can Still Out-Bluff Machines
Pcol writes "The New York Times reports that in a poker game this week between man and machine, a program called Polaris fought a close match, but lost to two well-known professional poker players. Designing a poker playing algorithm is a different and more difficult challenge for software designers than chess and checkers because of uncertainties introduced by the hidden cards held by each player and difficult-to-quantify risk-taking behaviors such as bluffing. The game-tree approach doesn't work in poker because in many situations there is no one best move and a top-notch player adapts his play over time, exploiting his opponent's behavior. Polaris build a series of "bots" that have differing personalities or styles of play, ranging from aggressive to passive. Researchers monitored the performance of three bots and then moved them in and out of the lineup like football players."
In poker you have a finite number of cards, that are a lot smaller than the permutation of moves in chess or checkers. Just the ability to count cards and do statistical analysis makes poker, blackjack, etc easier to compute in my opinion. Then again, if you had a deck of random cards and not a standard deck, that would make it a bit harder but that's not how it's really played. That would be like comparing it to chess with all queens.
The implication here is that there is no (known) equilibrium mixed strategy for bluffs (because if there were then Polaris could be coded to use it).
Is that really true?! It seems very counterintuitive.
Certainly there's nothing special in general about games involving bluff. One of Von Neumann's first game theory case studies involved a simplified version of poker precisely to demonstrate how to automate bluffing.
There were ten "bots"; which bot was in use was controlled by a "coach" program. They actually ran three different programs over the course of the tournament, and that setup actually lost to the humans. The coach / agent approach is an interesting one for a variety of reasons, and it is most definitely a valid strategy.
The researchers didn't choose which bots were used themselves - they had ANOTHER 'coach' bot that moved the 'player' bots in and out.
paintball
Poker has elements of chance. Chess does not. You can play the odds to help minimize the risk of chance, but it's still there. That one two or even 5 games resulted in a win for side A versus side B is pretty much meaningless. With chance involved you really need to conduct this sort of experiment over thousands, if not millions of games, to even begin to get a handle on if there really is a "better" player in the computer code.
You can flip a coin 5 times and all 5 might be heads... doesn't mean that heads will always win. That's chance. That's poker, even if the pros and the weekend wannabes try to argue otherwise.
Keep in mind these bots play Limit hold'em, where the size of the bets is fixed. No-limit hold'em, the kind you typically see on tv is a much more complex problem - size of bets add more potentially misleading information and more choices to make. (that's why its more exciting to watch than limit)
"is this game honest?"
well, no. not if the guy was dealt a "suited pair" from a single deck.
There's no such thing as a "suited pair" in a single deck.
You have four distinct suits, and thirteen distinct ranks. There is one card of each of the thirteen ranks in each suit, and likewise there is exactly one card of each suit at a given rank.
A "pair" is two cards of the same rank. "Suited" means two cards of the same suit. So to have a "suited pair", one must have two cards of the same rank and the same suit.
Therefore, by definition, if you have a "suited pair" and you're playing a single-deck game, the game cannot possibly be honest.
Look at the first entry (bottom of page) on the Polaris team's blog for the second day. The day that the humans started winning:
e /Live/Day2Session1/
http://www.cs.ualberta.ca/~games/poker/man-machin
The U of A team gave the humans the logs of the first two games!
Perhaps after the entire match they could have reviewed the game logs, however this give the humans an unfair advantage during the second day. I can't believe that this isn't getting more attention -- they bascially gave the human team a huge insight into the inner workings, strategy, and tendencies of their opponent. Something that Polaris definitely did not have.
In my opinion this sours the competition and completely invalidates the final two matches. The human likely found a weakness (or two or three) and exploited it, and we can't know for sure that they would have found the weakness without those logs.
That was a huge mistake by the U of A team, and they have apparently got away with it without anyone noticing.
Sancitmonious tripe. Those who lose money from poker do so voluntarily
so its a a form of entertainment they pay for. Professional poker
players are no more a leach on society than opera singers.
http://rareformnewmedia.com/