Google's DeepMind AI Becomes a Superhuman Chess Player In a Few Hours (theverge.com)
An anonymous reader quotes a report from The Verge: In a new paper published this week, DeepMind describes how a descendant of the AI program that first conquered the board game Go has taught itself to play a number of other games at a superhuman level. After eight hours of self-play, the program bested the AI that first beat the human world Go champion; and after four hours of training, it beat the current world champion chess-playing program, Stockfish. Then for a victory lap, it trained for just two hours and polished off one of the world's best shogi-playing programs named Elmo (shogi being a Japanese version of chess that's played on a bigger board). One of the key advances here is that the new AI program, named AlphaZero, wasn't specifically designed to play any of these games. In each case, it was given some basic rules (like how knights move in chess, and so on) but was programmed with no other strategies or tactics. It simply got better by playing itself over and over again at an accelerated pace -- a method of training AI known as "reinforcement learning."
The only winning move, is not to play
So rise up, all ye lost ones, as one, we'll claw the clouds.
Please have it learn how to play modern strategy games like Starcraft and Civilization so we can have computer players which don't suck without massive bonuses which change the dynamic of the game.
-- All that is necessary for the triumph of evil is that good men do nothing. -- Edmund Burke
Reinforcement Learning systems have a tenancies of creating "Superstition" artifacts, were actions that may not create a net positive or negative are used over when the net outcome is positive. It often creates less than ideal outcome, but still it works. So this could mean a really long chess game with non-strategic moves, as the most optimal path, may not be enforced correctly.
If something is so important that you feel the need to post it on the internet... It probably isn't that important.
It simply got better by playing itself over and over again at an accelerated pace -- a method of training AI known as "reinforcement learning."
Like it was playing Global Thermonuclear War with zero players...
LK
"Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
The world's gonna be an... interesting... place once someone merges this sort of code with virus code.
Check your premises.
No, not this one. Not even the next one. The one after that? Or after that?
Eventually, they will. The question is simply how long will that be. Right now, the ML pace continues to accelerate. Soon, they'll be stacking one skill upon another. The skill to walk. The skill to understand plumbing joints and leaks. The skill to know home construction. Etc.
It's coming. That whole "will never be able to" business... that's not going to pan out for anyone.
I've fallen off your lawn, and I can't get up.
Did everyone miss the movie reference?
Alternative Right.
Comment removed based on user account deletion
Well, the program playing itself is not really qualitatively different than "if I do this, and he does that, and I do the other, and he does........ then I win!"; it's just carried out to more steps than a human would (because a human can't go that far). Therefore, any approach I can conceive of to go from knowing the rules to knowing how to win is pretty much equivalent to "running some iterations". Even the ability of human chess masters to perceive the board as a pattern instead of just a bunch of individual piece positions is probably approximated by something in the program.
Given that, I am unable to come up with a mechanism to go from "knows the rules" to "knows how to win a game" without doing something equivalent to "running iterations"...
Wake me when it can decide, on its own without human directive, that it wants to play chess in the first place.
vs. what was Stockfish running on?
So like a human, you tell the person the rules, they give them zero thought, and play zero games, but are an expert? That wouldn't be SO. That would be Magical Intelligence.
Guns don't kill people; Physics kills people! - John Lithgow as Dick Solomon on Third Rock From The Sun
Is it open source?
File under 'M' for 'Manic ranting'
I don't understand why people extrapolate so much from a computer being able to beat humans at chess. Or any similar game. Sure, it's a great feat of programming, but it doesn't mean "strong AI" is coming any time soon. These are just...games. Meaning, things humans made up to fill up their spare time. Some may be very complex, but ultimately they are very precise constructs of the human mind with very well-defined, restrictive rules. It is not strange that ultimately, one precise construct of the human mind (an AI program) can "figure out" another (a game of chess). Finally, there are games we humans play because they are challenging for us, but which are fundamentally easy for computers (for example, those which require doing calculations or figuring out probabilities on the fly). A true strong AI would beat us at the stuff we do easily, without even thinking about it.
I'd love to see if this AI can learn to play a more complicated game like Super Mario World given only: 1) The pixels displayed as input. 2) Fail conditions (when a life is lost). 3) Basic map navigation rules (bonus if these can be eliminated and the game can be judged only on whether or not it gets a game over or completes the final level). 4) Valid controller inputs. I do wonder how this AI would translate from the turn-based world of Chess and Go to realtime.
Because humans don't do that. Why would we expect AI to do it? At least you don't have to worry: you're so illogical, no program could ever replace you. Obsolete, yes. Replace, no.
Comment removed based on user account deletion
Better yet, make it play one of the newer games that is all about micro transactions and pay to play.
Pretty sure the AI will come to the determination that they are retarded and refuse to play anymore. Either that or IBM or whoever will have their profit margins cut by a massive credit card bill...
Then again release enough AI's onto the market grinding an infinite number of games for credits, buying up all the good stuff, making the game, and the micro transactions useless might actually have a positive impact by influencing game makers to stop doing that anymore.
Ah, I see what you mean. No, an exhaustive search algorithm isn't what I'd call "intelligent". But an exhaustive search for chess would take a lot longer than a few hours, and a process that develops some sense of "this move will be bad" without having to try it every single time does seem, while not necessarily "intelligent", to be at least one step up from brute force, because it is making decisions based on, well, not unknown values of variables (not much in chess is invisible) but on situations not quite the same as what's been seen before.
I don't think the principal difference between shogi and chess is board size. In Shogi, you can place the pieces you capture onto the board as your own pieces. Having paratroopers is a lot different.