Researchers "Solve" Texas Hold'Em, Create Perfect Robotic Player

← Back to Stories (view on slashdot.org)

Researchers "Solve" Texas Hold'Em, Create Perfect Robotic Player

Posted by samzenpus on Thursday January 8, 2015 @12:01PM from the I'll-raise-you-infinity dept.

Jason Koebler writes The best limit Texas Hold'Em poker player in the world is a robot. Given enough hands, it will never, ever lose, regardless of what its opponent does or which cards it is dealt. Researchers at the University of Alberta essentially "brute forced" the game of limit poker, in which there are roughly 3 x 10^14 possible decisions. Cepheus runs through a massive table of all of these possible permutations of the game—the table itself is 11 terabytes of data—and decides what the best move is, regardless of opponent.

19 of 340 comments (clear)

Min score:

Reason:

Sort:

I guess that means ... by nospam007 · 2015-01-08 12:01 · Score: 4, Funny

...they got banned by 6241 online casinos and bragging here is the only thing left?
1. Re:I guess that means ... by houstonbofh · 2015-01-08 12:20 · Score: 4, Insightful
  
  Well, you would not want to brag here BEFORE getting banned by the 6241 online casinos, would you?
2. Re:I guess that means ... by Opportunist · 2015-01-08 13:24 · Score: 5, Insightful
  
  Maybe so, but a lot of online casinos have some reason to kiss their feet. You see, a lot of countries have laws against "gambling", which is usually defined as playing games for money where luck is the deciding factor.
  And these people just proved that luck plays no role.
  So playing poker is no longer gambling. Scientifically proven.
  
  --
  We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
3. Re:I guess that means ... by runningduck · 2015-01-08 14:25 · Score: 4, Interesting
  
  What happens if someone else creates an identicly perfect robotic player and joins the table? If, as the researches claim, winning limit Texas Hold'em is a directly solvable problem then anybody else who tries to solve the problem will come up with the exact same solution.
  If these two robots played each other wouldn't the winner be determined by pure luck?
  
  --
  -rd
4. Re:I guess that means ... by turbidostato · 2015-01-08 14:38 · Score: 5, Informative
  
  "What happens if someone else creates an identicly perfect robotic player and joins the table?"
  Not that I know so much about Texas Hold'em but, by the look of the text, "...Given enough hands, it will never, ever lose, regardless of what its opponent does or which cards it is dealt." these researchers have discovered the equivalent of a Nash equilibrium in the game.
  "If these two robots played each other wouldn't the winner be determined by pure luck?"
  Key words here are "given enough hands". This means that given enough hands, they would tie.
5. Re:I guess that means ... by greg1104 · 2015-01-08 16:07 · Score: 4, Insightful
  
  Casinos take a small amount of money out of each hand, the "rake". So if two perfectly matches robots play, the casino wins, as they slowly bleed all the money away from them both. These guys could have saved a lot of time. If you want a poker game where you always win, all you have to do was is the casino.
6. Re:I guess that means ... by kamapuaa · 2015-01-08 16:34 · Score: 5, Insightful
  
  This doesn't prove it at all. You shouldn't have been modified +5 Insightful. Over the course of a game, luck is very important. Like you get dealt the card you already have, or you get dealt crap.
  Blackjack also has a perfect strategy, and of course blackjack has a great amount of luck.
  I guess if you planned on playing an infinite amount of games, luck wouldn't be a part of Texas Hold Em. For somebody playing a finite amount, luck is key.
  
  --
  Slashdot: providing anti-social weirdos a soapbox, since 1997.
Perfect? Really? by igny · 2015-01-08 12:05 · Score: 4, Interesting

Wouldn't another robot which knows of all possible decisions of this particular robot be better that this "Perfect Robotic Player"?

--
In theory there is no difference between theory and practice. In practice there is. - Yogi Berra
1. Re:Perfect? Really? by grimmjeeper · 2015-01-08 12:14 · Score: 4, Informative
  
  The thing about this robot is that it only wins over time and many hands. The best you can do with another robot is to tie. But that's all in the long run and assumes you have deep enough pockets to keep playing through the losing hands. The odds don't hold up for individual games played in isolation. Texas Hold'Em is very dependent on the draw of the cards and that randomness makes it impossible to win every time. This robot won't win every single hand so it's maybe not so hard to beat in the short term over a few hands if you get a lucky draw. But in the long run it will win (or tie if it's playing another robot).
2. Re:Perfect? Really? by ais523 · 2015-01-08 12:25 · Score: 5, Informative
  
  Mathematically speaking, all these games which are based around predicting what your opponent might do (and possibly a random factor, like in poker) have a perfect strategy, but that perfect strategy has random factors. For instance, the mathematically perfect strategy for rock/paper/scissors is to pick "rock", "paper", and "scissors" each with 1/3 probability. There is nothing an opponent can do to get more than a 50:50 chance of beating this strategy.
  Rock/paper/scissors is unusual in that the game is symmetrical: a perfect strategy can't get any better than 50:50 against anyone. That's not true of poker, though; in such a case, a perfect strategy will have a better than 50% chance of beating anyone who plays imperfectly, and a 50% chance against a perfect strategy (due to symmetry).
  I'm actually quite interested in the theory of this sort of game (where there are random factors and outguessing opponents involved), and even in simple cases, the calculations can be hard. I'm reasonably interested in whether this poker strategy is a probabilistic one (that can't be outpredicted as long as the random number generator used is sufficiently high-quality), or whether it just takes the best option without randomizing (which is much easier to implement, but which can be outplayed via knowledge of the algorithm like you suggest).
  
  --
  (1)DOCOMEFROM!2~.2'~#1WHILE:1<-"'?.1$.2'~'"':1/.1$.2'~#0"$#65535'"$"'"'&.1$.2'~'#0$#65535'"$#0'~#32767$#1"
3. Re:Perfect? Really? by ShanghaiBill · 2015-01-08 12:30 · Score: 4, Insightful
  
  Wouldn't another robot which knows of all possible decisions of this particular robot be better that this "Perfect Robotic Player"?
  There may still be ways to beat it. For instance, two or more opponents could collaborate. So someone with a poor hand could be running up the pot so his teammate can win. Just knowing the odds isn't enough, because all bets are not for the same amount*. If your opponent suddenly makes a big bet, what does that mean? He could have a hot hand, he could be bluffing, or he could just be trying to run up the pot. Without some knowledge of the guy's history, and his relationship with the other players, it is hard to say. I play very differently against my amateur coworkers on "poker night", than I would against a pro in Vegas. Amateurs are way more likely to bluff, or stay in the game with a mediocre hand.
  *I just RTFA, and I now understand that the robot plays a limited game where only certain fixed amount bets were allowed ... but that really means they didn't solve it, because that is not how real poker is played. Also, it looks like it only plays one opponent, rather than at a table of four. So this is like solving "queens and pawns" and claiming you have "solved chess".
4. Re: Perfect? Really? by Anonymous Coward · 2015-01-08 13:53 · Score: 4, Insightful
  
  Uh, if you can't see my cards, how would you have any idea what my decision tree said?
Bets by gninnor · 2015-01-08 12:11 · Score: 4, Insightful

Recently I noticed that Texas Hold'em is only half of the game. The betting is the real strategic part. Unless the bot can do this well, I don't it will ever really "beat" a human player.
1. Re:Bets by hodet · 2015-01-08 13:15 · Score: 4, Interesting
  
  In limit poker there is more often a correct play. The odds would dictate, in a large pot to call that last bet because it is only a fraction of the pot. As long as your pot odds are better than your card odds it is correct to call, even if you only have one or two outs. In no limit where you can adjust the size of your bet, the correct bet is to give your opponents worse pot odds then their card odds. No bot can ever master no limit, it's not a card game at all. it's a people game played with cards.
Re:No such thing in real gambling by Carewolf · 2015-01-08 12:11 · Score: 4, Informative

That is why it is limit poker. Besides all games have limits acknowledged or not.
Think of the robot as the house, it might not win everytime but it always wins in the long run.
Re:No such thing in real gambling by TedTschopp · 2015-01-08 12:16 · Score: 4, Informative

From the article: "So, is online poker now dead? Destined to be crushed by robots? Not quite: No limit Texas Hold'Em—in which any amount of bet in any dollar amount can be made—is by far the most popular, and while robots can play that game quite well, we're no where close to solving it. Limit poker has roughly 3 x 10^14 permutations; no limit poker has 3 x 10^48, which is many orders of magnitude harder to solve."

--
Fantasy remains a human right; we make in our measure and in our derivative mode... -- JRR Tolkien
There are already 4 computers that beat it easily by TrollstonButterbeans · 2015-01-08 12:32 · Score: 4, Funny

There is another computer that triples the bet every time it loses and leaves when it wins and says something about knowing when to walk away.

And then there is the computer that deals the cards and takes 4% of the pot every round, muttering "House always wins".

The third computer that can beat it keeps giving it complimentary alcoholic beverages until it gets a buffer overrun.

The fourth computer doesn't "believe in no win situations" and reprograms the computer to lose, but then Spock finds out and he gets kicked out of the Academy.

--
Priest: "Universe from nothing, no laws of physics, sped up time"+ huge discrepancies. Creationism? No. Big Bang Theory
Yes, but... by Anonymous Coward · 2015-01-08 12:41 · Score: 5, Informative

I'm a former professional poker player, now semi-pro and working again in the IT industry. In a game like poker, to "solve" the game, from a mathemartcal and game theory point of view, means to develop a strategy that is "unexploitable", which basically means "mistake free". If two game-theory perfect players were to play against each other, then their "expectation" would be zero, as if they were flipping a coin between each other. Neither would make a mistake, so only te randomness of the cards would determine the winner of a given hand. In the long run, both perfect players would win as often as they lose.
But in a real poker game, human players make lots of mistakes. A player who adjusts their strategy to exploit these mistakes will win vastly more than this (formerly theoretical) "perfect player". The game-theory optimal strategy is focused on not losing, rather than exploiting mistakes and winning the most.
So in an actual game, the expert human player will outperform the computer because the other humans in the game are exploitable.
In live play, especially in tournaments, computer solutions are used in poker. In particular, when the game is "heads up" (only two players), and the chips are not deep, which happens at the end of every tournament, then the correct strategy is to "jam or fold" all hands. The solution to this has been determined in a computer and top players have the table memorized.
If this subject interests you, I HIGHLY recommend "The Mathematics of Poker", by Chen and Ankenman.
Rock paper scissors by Michael+Woodhams · 2015-01-08 14:06 · Score: 4, Insightful

You can better understand what is going on by considering the much simpler game Rock paper scissors. 'Perfect' here basically means the strategy gives you the best possible worst case.
For RPS, the perfect strategy (using the term in the same sense as it is used for the poker bot) is to play completely randomly. There is no way to gain an edge over this strategy, no counter-strategy which will give you more than 50% chance of winning, even if you know your opponent's strategy. (In this case, there is also no strategy which will give you less than 50% chance of winning against the 'perfect' strategy.)
For the poker bot, there is no strategy that will give you greater than 50% chance of winning against it in a two player game. If you know its strategy perfectly (but of course you don't know its cards) the best you can do is to equal that 50% chance (which is what happens if it plays itself.) Unlike RPS, you can can lose to the perfect poker bot by playing poorly. Also, as noted in the article, the perfect poker bot always plays as if it were playing against perfect opposition. A good human player will fleece you faster then the perfect bot, because the human player will notice your peculiar imperfections and exploit them, choosing to play in a way which would be suboptimal against a perfect opponent, but superior against you.

--
Quattuor res in hoc mundo sanctae sunt: libri, liberi, libertas et liberalitas.