A Poker-Playing Robot Goes To Work for the Pentagon (wired.com)
In 2017, a poker bot called Libratus made headlines when it roundly defeated four top human players at no-limit Texas Hold 'Em. Now, Libratus' technology is being adapted to take on opponents of a different kind -- in service of the US military.
From a report: Libratus -- Latin for balanced -- was created by researchers from Carnegie Mellon University to test ideas for automated decision making based on game theory. Early last year, the professor who led the project, Tuomas Sandholm, founded a startup called Strategy Robot to adapt his lab's game-playing technology for government use, such as in war games and simulations used to explore military strategy and planning. Late in August, public records show, the company received a two-year contract of up to $10 million with the US Army. It is described as "in support of" a Pentagon agency called the Defense Innovation Unit, created in 2015 to woo Silicon Valley and speed US military adoption of new technology.
[...] Sandholm declines to discuss specifics of Strategy Robot's projects, which include at least one other government contract. He says it can tackle simulations that involve making decisions in a simulated physical space, such as where to place military units. The Defense Innovation Unit declined to comment on the project, and the Army did not respond to requests for comment. Libratus' poker technique suggests Strategy Robot might deliver military personnel some surprising recommendations. Pro players who took on the bot found that it flipped unnervingly between tame and hyperaggressive tactics, all the while relentlessly notching up wins as it calculated paths to victory.
From a report: Libratus -- Latin for balanced -- was created by researchers from Carnegie Mellon University to test ideas for automated decision making based on game theory. Early last year, the professor who led the project, Tuomas Sandholm, founded a startup called Strategy Robot to adapt his lab's game-playing technology for government use, such as in war games and simulations used to explore military strategy and planning. Late in August, public records show, the company received a two-year contract of up to $10 million with the US Army. It is described as "in support of" a Pentagon agency called the Defense Innovation Unit, created in 2015 to woo Silicon Valley and speed US military adoption of new technology.
[...] Sandholm declines to discuss specifics of Strategy Robot's projects, which include at least one other government contract. He says it can tackle simulations that involve making decisions in a simulated physical space, such as where to place military units. The Defense Innovation Unit declined to comment on the project, and the Army did not respond to requests for comment. Libratus' poker technique suggests Strategy Robot might deliver military personnel some surprising recommendations. Pro players who took on the bot found that it flipped unnervingly between tame and hyperaggressive tactics, all the while relentlessly notching up wins as it calculated paths to victory.
Sorry, but that's a misleading summary for technical news. Libratus did some pretty good playing, but saying it beat four top human opponents is extremely misleading.
What it did do was play thousands of rounds one on one. With exceedingly large bankrolls compared to the size of the big blind that were reset after every hand. In other words, it never had to play with short stack, never had to worry that the opponent couldn't cover it's own bets, and that really long shots (which are easier for a computer to calculate) can be made to pay off if hit because of the size of the bankrolls were much larger than usual for the size bets being made. And was only one on one, so it had a minimum of unknown information, betting and bluffing. Hold 'em, so 5 common cars and only two hold cards it doesn't know. And thousands of rounds each, so any small edge would have time to multiply.
Now, it did do this against four top players (each against their own copy of Libratus). It really was quite an accomplishment. But it's not nearing the general poker imperfect-information feint-analyzing multiple-unknowns that the summary makes it out to be. Come on /., be News for Nerds. Get the tech details right.
LITTLE GIRL: But which cookie will you eat FIRST? C. MONSTER: Me think you have misconception of cookie-eating process.