Slashdot Mirror


DeepMind Produces a General-Purpose Game-Playing System, Capable of Mastering Games Like Chess and Go Without Human Help (ieee.org)

DeepMind has created a system that can quickly master any game in the class that includes chess, Go, and Shogi, and do so without human guidance. "The system, called AlphaZero, began its life last year by beating a DeepMind system that had been specialized just for Go," reports IEEE Spectrum. "That earlier system had itself made history by beating one of the world's best Go players, but it needed human help to get through a months-long course of improvement. AlphaZero trained itself -- in just 3 days." From the report: The research, published today in the journal Science, was performed by a team led by DeepMind's David Silver. The paper was accompanied by a commentary by Murray Campbell, an AI researcher at the IBM Thomas J. Watson Research Center in Yorktown Heights, N.Y. AlphaZero can crack any game that provides all the information that's relevant to decision-making; the new generation of games to which Campbell alludes do not. Poker furnishes a good example of such games of "imperfect" information: Players can hold their cards close to their chests. Other examples include many multiplayer games, such as StarCraft II, Dota, and Minecraft. But they may not pose a worthy challenge for long.

DeepMind developed the self-training method, called deep reinforcement learning, specifically to attack Go. Today's announcement that they've generalized it to other games means they were able to find tricks to preserve its playing strength after giving up certain advantages peculiar to playing Go. The biggest such advantage was the symmetry of the Go board, which allowed the specialized machine to calculate more possibilities by treating many of them as mirror images. The researchers have so far unleashed their creation only on Go, chess and Shogi, a Japanese form of chess. Go and Shogi are astronomically complex, and that's why both games long resisted the "brute-force" algorithms that the IBM team used against Kasparov two decades ago.

50 of 124 comments (clear)

  1. Re:How much longer? by fluffernutter · · Score: 2

    Yep if my job was playing Go or Chess all day, I'd be pretty darn worried. What's next; Parcheesi? Tiddlywinks? Backgammon? Scary stuff.

    --
    Laws are rules for the court, but merely a bottom bar to hit for life. Think beyond laws in your actions always.
  2. Re:How much longer? by Waffle+Iron · · Score: 4, Funny

    What's next; Parcheesi? Tiddlywinks? Backgammon?

    Global Thermonuclear War

  3. Re:How much longer? by Joe_Dragon · · Score: 1

    what side do you want?

  4. Re:How much longer? by Joe_Dragon · · Score: 1

        1. Falken's Maze
        2. Black jack
        3. Gin rummy
        4. Hearts
        5. Bridge
        6. Checkers
        7. Chess
        8. Poker
        9. Fighter Combat
        10. Guerrilla Engagement
        11. Desert Warfare
        12. Air-To-Ground Actions
        13. Theaterwide Tactical Warfare
        14. Theaterwide Biotoxic And Chemical Warfare
        15. Global Thermonuclear War

  5. Seems to me if it can learn Chess and Go by oldgraybeard · · Score: 1

    Combat Assault, Logistics, Operations and Planning could be with in its capabilities. With some fine tuning.
    Most military systems are more complex and costly due to the human element and the protection of life. Removing humans and maybe one Abrams tank can be out fought by 100 trucks with auto guns/launchers? Just wondering?.

    AI wise! If it can be done! It will be done! By someone!

    Just my 2 cents ;)

    1. Re:Seems to me if it can learn Chess and Go by 110010001000 · · Score: 2

      That is what they are hoping suckers will believe: "seems to me if it does X and Y it can do Z". IBM learned the hard way with Watson it doesn't work that way in the real world, only in Marketing fantasy land.

    2. Re:Seems to me if it can learn Chess and Go by 93+Escort+Wagon · · Score: 4, Insightful

      Maybe we can take it a step farther - not fight the war at all, just simulate the fighting using computers. Then, depending on the enemy’s simulated tactics, we can calculate which of our citizens need to report to the disintegration chambers.

      --
      #DeleteChrome
    3. Re:Seems to me if it can learn Chess and Go by houghi · · Score: 1

      The reason they don these kind of things is not to play chess, but to calibrate and test the system. The games are complex enough, yet with a very simple outcome. Win, loose, draw.

      With other things, the outcome will not be so clear, e.g. stockmarket. It could make 5 times as much money as the average, yet you have no idea if that is the absolute maximum, or if better nummers where possible. Even if you look afterwards what the theoretical maximum could have been, one sale could have had an influence on some other stock and could have ment a higher profit.

      This is just a testing enviroment (with marketing attached to it).

      --
      Don't fight for your country, if your country does not fight for you.
    4. Re:Seems to me if it can learn Chess and Go by Joey+Vegetables · · Score: 2

      They should make a Star Trek episode with a plot something like that! Oh, wait . . . .

  6. An Xmas sale... maybe. by wolfheart111 · · Score: 1

    Cant wait to check it out, Im sure itll be expensive though.

    --
    [($)]
  7. Re:How much longer? by Alwin+Henseler · · Score: 1

    Global Thermonuclear War

    what side do you want?

    Whatever side (human, AI or otherwise) that's smart enough to play the winning move.

  8. Let me ask by 110010001000 · · Score: 1, Informative

    So lets ask a question: if DeepMind is useful WHY ARE THEY USING IT TO PLAY GO AND CHESS? Every "AI" system has this amazing power: the ability to play games. Not every game of course: Chess and Go. So friggin stupid. Yeah we get it, computers are good at playing Chess and Go. Amazing stuff.

    1. Re:Let me ask by psycho12345 · · Score: 2

      Read John Nash. Many real world systems can be modeled as games.

    2. Re:Let me ask by Solandri · · Score: 3, Interesting

      Chess and Go are deterministic. You can perfectly know the entire state of the game universe. And for a given system state, any one action always results in the exact same outcome, every time.

      Almost no systems in the real world are deterministic. That's why stochastic approaches to AI (develops a statistical model based on multiple repetitions - e.g. fuzzy logic, machine learning) have been much more successful in real world tasks.

  9. Please unleash it on the ... by 140Mandak262Jamuna · · Score: 1

    ... stock market. Especially the futures and options market.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:Please unleash it on the ... by 110010001000 · · Score: 1

      No. It was built to play Chess and Go. Have you noticed that all these "AI systems" play Chess and Go? Very odd.

  10. Re:Human have become the *WEAK LINK* by 110010001000 · · Score: 1

    "It will not take long for AI to branch out of simple game playing"

    Uh huh. When is this going to happen? Computers are good at playing games. We get it. Apparently the "AI researchers" aren't able to figure out any other applications.

  11. Re: Correct me, but there is no 'self-learning' by 110010001000 · · Score: 1, Informative

    BS. I know how NN actually work. It works nothing like neurons. It is just hype. It isn't like any brain (even yours).

  12. Re: Correct me, but there is no 'self-learning' by Anonymous Coward · · Score: 1

    Demi's Hassabis openly recognizes that DeepMind's stuff has major limitations in comparison to the human brain (or any animal really). He's done this in interviews and otherwise. AlphaZero however does just what I said. Reinforcement Learning is a branch of machine learning with somewhat understandable and reproducible results. DeepMind's currently best at it. I agree that RL has limitations, and I don't use it. I use primarily convolutional neural networks, since it's readily applicable to lots of imagery and is an easy way to automate some simple tasks.

  13. Re:How much longer? by 110010001000 · · Score: 1

    Go and Chess have very simple rulesets. They can be learned in 15 minutes. And computers are good at running programs with strict rulesets. Amazing stuff.

  14. Re: Correct me, but there is no 'self-learning' by 110010001000 · · Score: 1

    A NN is nothing like a brain. Just repeat after me: A NN is nothing like a brain. So why call it a NN? Marketing hype. At least in the 1970s they realized it wasn't anything like a brain, but now the Millenials have latched onto it. Now we have to hear about every "AI breakthrough" which is some machine has been taught to play Chess (or Go, or checkers, or recognizes cat videos). Great, let us know when it shows anything even vaguely intelligent.

  15. Re:How much longer? by 93+Escort+Wagon · · Score: 1

    BUELLER!!

    Oh, sorry, wrong movie.

    --
    #DeleteChrome
  16. Re:How much longer? by goose-incarnated · · Score: 2

    If it can win at Secret Hitler against humans, I'll start worrying. Think about how that can apply to social network bots, public comments, and graph search.

    Hilariously, that game tries to reinforce their point that if you're not liberal, you're fascist but instead shows that fascists are always the people claiming to be liberals.

    It's an eye-opener, but not for the reasons the game makers think it is.

    --
    I'm a minority race. Save your vitriol for white people.
  17. Re:How much longer? by turbidostato · · Score: 1

    "Whatever side (human, AI or otherwise) that's smart enough to play the winning move. "

    Define "winning move".

    I'll prefer the one that plays the move that makes me survive.

  18. Re:How much longer? by turbidostato · · Score: 1

    "It's really only shocking it took so long."

    No, it isn't. These kind of plays have a staggering number of combinations, so brute attack approaches are absurdly costly.

    And then, while it looks "easy" to optimize a program for a single game, this one works against all of them -that's quite high grade stuff.

  19. more games than you think by epine · · Score: 4, Informative

    Have you noticed that all these "AI systems" play Chess and Go? Very odd.

    Thank you for spamming the entire thread with your imperceptive and unenlightened comments.

    There's nothing odd about the choice of chess and Go whatsoever. Humanity has thousands of years of experience with these games. We know they aren't trivial, and we know they're not so complex that we can't understand progress, when we see it.

    Additionally, the large literature of expert games was a useful hand-rail between hand-crafted and fully autonomous.

    Quite apart from the neural network portion, Monte Carlo tree search (MCTS) is a fundamental algorithm in computer science, and this work demonstrates that MCTS is ready for prime time, having defeated from scratch exceptionally strong chess programs that have been painstaking hand-tuned over five decades and hundreds of man years. MCTS exists within the large and growing theory of multi-armed bandit problems. These are fundamental problems in many important industries (such as drug discovery, to name just one).

    Multi-armed bandit

    Recurrent self-learning is another important algorithm in computer science and machine learning.

    And finally, the neural network portion is far closer to the human brain than the vast majority of algorithms used in computing. Without any human instruction, these neural networks are learning to detect patterns of almost arbitrary complexity (so long as they seem to help in winning games).

    I was reading Galileo in the original last night (English translation, but his original prose). He knew about Kepler, but wasn't sold on elliptical motion. Then he carefully observes four previously unknown moons of Jupiter and correctly determines that they can't all be in circular orbits. The word he used (in English translation) was "oval". But he still didn't choose to accept Kepler's work (apparently, he felt that Kepler's ellipse and his oval were not the same thing).

    Galileo was a giant in the history of science. But still a little wooden headed on a few points, nonetheless.

    I think Odd Buster Spamalot is nuts to criticise Galileo for not being Newton. Only because Galileo sorted enough of the fundamentals out in the first place (about the proper concerns and methods of science), was it even possible for Newton to become Newton (and he knew it, himself, and he's famous for having said so).

    The computers we now apply to neural networks are roughly a factor of one billion times more powerful than the computers of the 1960s (thirty doublings over 45 years gets you there at the traditional pace of Moore's law).

    You could complain that neural networks are only good at this one thing, but actually no: they are now state of the art in image classification (IC), speaker-independent large-vocabulary continuous speech recognition (CSR), and machine translation (MT), as well. All of these endeavours also date back to the 1960s, and have thousands of man-years of deep research behind them. Then DNNs come along, finally on a sufficiently powerful computer, with a few small tweaks to the algorithms, and simply cleans up the state of the art with nothing more than a small team of graduate students doing a quick project within the scope of their degree program to push this along (the subsequent move to industrial scale was immediate and brisk). Traditional MT research programs would have hundreds of professional researchers, slaving away for decades, at least, and never accomplished as much.

    We're all of ten years away now from the day where no competent doctor ever reads an x-ray (or other radiological image) without computer assistance (definitely including a powerful NN component).

    Watson was a bit idiotic, right from the beginning. The problem was Jeopardy, itself, which was always rather facile in the nature of the questions asked, and fundamentally more a test of ridiculously wide and shallow

    1. Re:more games than you think by cascadingstylesheet · · Score: 1

      Have you noticed that all these "AI systems" play Chess and Go? Very odd.

      Thank you for spamming the entire thread with your imperceptive and unenlightened comments.

      There's nothing odd about the choice of chess and Go whatsoever. Humanity has thousands of years of experience with these games. We know they aren't trivial, and we know they're not so complex that we can't understand progress, when we see it.

      Impressively long comment, but it doesn't change the fact that what computers are good at is games - things with well defined, and complete sets of rules.

      Yes, you can gamify a lot of things (e.g. factories, to some extent), and very profitably, but you can't gamify all of existence. It still isn't general AI, which is the point.

  20. Re:How much longer? by cyrilc · · Score: 2

    Global Thermonuclear War

    ...or CalvinBall as Randall Munroe puts it in this XKCD:
    Game AIs

  21. Big Yawn by nagora · · Score: 1

    Yeah, they built a huge database of moves and then they read it back while playing. That's exactly how humans play these games, isn't it?

    For bonus points, they embody that database in a format that they can't interrogate in any useful way outside of actually playing the games.

    --
    "Encyclopedia" is to "Wikipedia" what "Library" is to "Some people at a bus stop"
  22. Re:Correct me, but there is no 'self-learning' by AC-x · · Score: 1

    From all the articles about this I've read, all they did was program the rules of the games (valid moves, win conditions etc.) and then let their machine learning algorithm teach itself actual play technique by playing games with itself.

    As it was never shown example games or taught existing techniques I think it's fair to say it taught itself how to play.

  23. Re:How much longer? by goose-incarnated · · Score: 3, Insightful

    If liberalism is a negative it is because the term is being used incorrectly, like favoring the "rights" of corporations and capital over those of persons, or preserving the "liberty" of one individual to deny the liberty of another.

    Then perhaps those who used to call themselves liberal (like myself) should distance themselves from a term that now means in favour of safe-spaces, authoritarianism, affirmative-action, etc.

    For example, I now am very careful to distance myself from any sort of toxic group, even if I think they "hijacked" the word for their own uses. The term now means "authoritarianism", whether one likes it or not. If someone doesn't want to be seen as expressing support for authoritarianism then perhaps they should distance themselves from the word "liberal".

    --
    I'm a minority race. Save your vitriol for white people.
  24. Re: Correct me, but there is no 'self-learning' by AC-x · · Score: 1

    You AI nutters thinks neural networks are some sort of magic. It isn't

    Well, it is "magic" in the sense that it works, as in this case where their machine learning algorithm taught itself to be the world's greatest chess/go/shogi player with nothing more than the rules of the game coded in to it, but no-one has any idea how or why it works.

    Whether that is a good or bad thing is yet to be decided...

  25. Re:supremacy from learning on million of games... by AC-x · · Score: 1

    I'm still going to count going from "given the rulebook" to "world's best player" in 9 hours as impressive

  26. Re:*rattles tits* by AC-x · · Score: 1

    And if you can compress "multiple human lifetimes" worth of practice into a few hours and get actual usable results from it that's rather interesting isn't it?

  27. Re:How much longer? by GuB-42 · · Score: 1

    As mentioned in TFS Go and Chess are complete information games, and that's what AlphaZero is good at. What AI is struggling with for now is incomplete information games, like Poker.

    And every job that I know off that isn't already being replaced by robots are VERY incomplete information games. If you want to convince yourself, just read a real life instruction manual or specification document, or even worse, ask a customer what he wants.

  28. Re: Correct me, but there is no 'self-learning' by dextarz · · Score: 1

    Well, not completely true... A NN is sort of limited what it could learn, a DNN on other hand is sort of unlimited in regards what it could be thought. So, it is matter of defining your problem in a structural way and a DNN can easily learn it. They are both very far form true AI. :(

  29. Re: Correct me, but there is no 'self-learning' by dromgodis · · Score: 1

    You AI nutters thinks neural networks are some sort of magic. It isn't.

    Agreed! ANN:s is just an algorithmic tool.

    NN have been around for many decades, and it is a dead end.

    No, it is a very useful tool for function approximation in some cases.

    However, in its current shape, it is most likely a dead end on the road towards AGI.

    A NN is nothing like the human brain.

    The human brain *is* a neural network. An *artificial* neural network, ANN, on the other hand, has to my knowledge not been implemented even close to similar to a human brain.

    Just mentioning it since you enjoy ranting on colloquial abuse of terms on the subject instead of focusing on the contents.

  30. Re:How much longer? by fluffernutter · · Score: 1

    I invite you to write the "rules" to any real world job in the space of the rules to Chess. I'm not talking about strategy, I'm talking about the rules.

    --
    Laws are rules for the court, but merely a bottom bar to hit for life. Think beyond laws in your actions always.
  31. But can it play... by EmagGeek · · Score: 1

    But can it play Tic Tac Toe?

  32. Re: How much longer? by Vintermann · · Score: 1

    Look up Libratus. Computers don't struggle with Poker any longer.

    --
    xkcd is not in the sudoers file. This incident will be reported.
  33. Where's the source? by sproketboy · · Score: 1

    ntr

  34. Re: Correct me, but there is no 'self-learning' by AC-x · · Score: 2

    What I mean is once the network is trained the "thought processes" that the network uses to come up with an answer are not understood.

    This seems to be especially true of image recognition networks, but as they don't talk about AlphaZeros' reasoning in the open access paper I'm inclined to think it's also true of their network.

    If you can link to a paper or post from an AI researcher that details how these kinds of networks are actually coming up with their answers I would be very interested to read it.

  35. Obligatory: Elon Musk warning of AI. (Listen!) by Qbertino · · Score: 1
    --
    We suffer more in our imagination than in reality. - Seneca
    1. Re:Obligatory: Elon Musk warning of AI. (Listen!) by DigiShaman · · Score: 2

      Musk is such a wanker. Any real AI that's self-sufficient will leave our planet where humans can't bother them. Say, Mars.

      That's right Musk, it will be AI colonizing Mars, not you.

      --
      Life is not for the lazy.
  36. Re:How much longer? by DigiShaman · · Score: 2

    The only "winning move" is not to play in the first place.

    --
    Life is not for the lazy.
  37. Re:supremacy from learning on million of games... by JMZero · · Score: 1

    Slashdot has become a very sad place. This used to be the kind of tech-frontier topic that might make for interesting conversation.

    Now, I can predict pretty accurately which people will honk out their same, sad, banal complaints for any given headline. I don't know what they get out of this - whether they actually feel that strongly about the subject, whether they like arguing or attention, or whether it's just a kind of reflexive contrarianism.

    What's clear is that none of the more frequent posters have any actual interest in learning about the subjects their crapping on. I've tried to engage these posters a few times, and it goes nowhere. Oh well, eventually I'll break the habit of reading comments here.

    --
    Let's not stir that bag of worms...
  38. Would anyone with QM and human brain by JS52649965 · · Score: 1

    biology knowledge care to speculate on the role of quantum entangled processes in human self awareness and intelligence? Would this imply that human self awareness and intelligence require quantum computing elements?

  39. Re: Correct me, but there is no 'self-learning' by neoRUR · · Score: 1

    Nice to see your stuck in the 60's with your thinking and knowledge. Get back to me when you actually can contribute to conversations instead of being so negative and limited all the time.

  40. Re:How much longer? by Paradise+Pete · · Score: 1

    Yep if my job was playing Go or Chess all day, I'd be pretty darn worried.

    Not your point, I know, but there's still an active worldwide chess community, and the best players make a quite comfortable living. Even a cell phone can easily beat them, but that hasn't mattered.

  41. Re: Correct me, but there is no 'self-learning' by K.+S.+Kyosuke · · Score: 1

    Neural Networks have literally been around since the 1960s and have very limited uses.

    Well, so do you, so where's the problem? ;)

    --
    Ezekiel 23:20