Poker Pros Win Against AI, But Experts Peg Match As Statistical Draw
hypnosec writes with some positive news for Skynet watchers, in that humans still have at least a slight lead against the AIs who might one day imprison us in energy-harvesting goo tanks, or at least beating us in Las Vegas. The two-day poker showdown involving four of the world's top (human) players and a Carnegie Mellon University AI program called Claudico saw the professionals win, after several days of heads-up no-limit Texas Hold'em. "Despite the win, the poker players' $732,713 collective lead over Claudico wasn't quite large enough to attain statistical significance, experts have said. This means that the results can't be accepted as scientifically reliable thereby indicating that the "Brains Vs. Artificial Intelligence" competition effectively ended in a statistical tie." On the other hand, the computers sure got over what looked like a rout by the humans.
Halfway through the competition, the four human pros had a cumulative lead of 626,892 chips. Though much could change in the week remaining, a lead of around 600,000 chips is considered statistically significant.
or
"Despite the win, the poker players' $732,713 collective lead over Claudico wasn't quite large enough to attain statistical significance"
Maybe it means a 600k chip lead per individual head-to-head. Or it could mean FUCK THS COMPUTARZ ROOL
of the massacre of mankind.
So after about 13*8*4=416 man-hours of play, they can't say who's better? Was "no-limit" the reason why? I say the test is badly designed if 99.9% of the time is wasted betting pocket change, and the outcome is determined by just 10 big bets.
I think this is the optimal outcome for the scientists. They can show: 1) We have done something (i.e. the poker bot is not too far from the best human players so our time has not been wasted). 2) There is more to be done (i.e. give us more money to look at this).
Also, I do think it is a quite impressive outcome.
Seems more like applied statistics to me.
The losing team of every competition likes to say things like 'statistical draw'.
Let's see, over two days that's about 400 hands per hour, or one hand every 8.5 seconds, assuming the players take no breaks for sleeping, eating, etc. Yet they still trounced the computer, and it's claiming "no statistical significance"? I think this is one of the greatest achievements of man versus machine in history!
Have you read my blog lately?
Then can I have the $732,713?? I mean really, isn't this the very point of AI? To be less than just a statistical automaton and actually be able to beat a human at real life in something? AI should be about winning the money, that's it.
Laws are rules for the court, but merely a bottom bar to hit for life. Think beyond laws in your actions always.
This is just the most recent paper explaining this once again. The fickle P value generates irreproducible results:
http://www.ncbi.nlm.nih.gov/pubmed/25719825
Let's see, ONE university (Granted, it is one of the top five most prestigious in its field) given a small handful of decades is able to statistically just-about-beat people.
This does nothing to assuage my fears of Skynet. Imagine what a few dozens of universities, combined with the financial and industrial might of the US MIC, could do in a time span 60 or 70 years?
Where the card sharps win by just a small enough margin to con the mark into a comeback game...... Don't fall for this simple ruse Claudico!!!!!!!!!!!!!!!!!!!!!!
How much do you want to bet that if the computer beat the humans by $0.50, the "But Experts Peg Match As Statistical Draw" part would be left out of the headline?
Look, it is two thousand freaking fifteen. This is an article from some site called "Techie News" being re-reported at Slashdot. Can we please get a little ridicule of this supposed binary concept of "statistical significance" ? It would take us one or two sentences to tell us the actual numbers involved--the expected value, expected deviation, margin of error, confidence level, etc.
And then when all's said and done, if indeed the level of significance was too low (e.g. p too high), maybe we could get a Bayesian or two in here to criticize the traditional 5% value is being arbitrary and tell us all a little about the Frequentist vs. Bayesian rivalry in statistics that persists to this day? (Obligatory XKCDs: https://xkcd.com/1132/ , https://xkcd.com/882/)
Wonder what will the results looks like of the match is against novices. Will the AI still lose a bit, lose badly or win a lot?
Comment removed based on user account deletion
Awesome. I want to play poker with these guys... we can just draw right at the beginning, and they'll give me 700 grand, right?
The AI didn't exploit human weakness (failed bluffs) but the humans did. This explains why the AI got better as the weaker human players left the arena -- The winning humans didn't beat the AI, they beat other humans to outcompete the AI.
http://poker.srv.ualberta.ca/