Slashdot Mirror


DeepMind's Go-Playing AI Doesn't Need Human Help To Beat Us Anymore (theverge.com)

An anonymous reader quotes a report from The Verge: Google's AI subsidiary DeepMind has unveiled the latest version of its Go-playing software, AlphaGo Zero. The new program is a significantly better player than the version that beat the game's world champion earlier this year, but, more importantly, it's also entirely self-taught. DeepMind says this means the company is one step closer to creating general purpose algorithms that can intelligently tackle some of the hardest problems in science, from designing new drugs to more accurately modeling the effects of climate change. The original AlphaGo demonstrated superhuman Go-playing ability, but needed the expertise of human players to get there. Namely, it used a dataset of more than 100,000 Go games as a starting point for its own knowledge. AlphaGo Zero, by comparison, has only been programmed with the basic rules of Go. Everything else it learned from scratch. As described in a paper published in Nature today, Zero developed its Go skills by competing against itself. It started with random moves on the board, but every time it won, Zero updated its own system, and played itself again. And again. Millions of times over. After three days of self-play, Zero was strong enough to defeat the version of itself that beat 18-time world champion Lee Se-dol, winning handily -- 100 games to nil. After 40 days, it had a 90 percent win rate against the most advanced version of the original AlphaGo software. DeepMind says this makes it arguably the strongest Go player in history.

23 of 133 comments (clear)

  1. ObSimpsons... by sconeu · · Score: 4, Funny

    I, for one, welcome our new Go-playing robotic overlords.

    --
    General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.
  2. Re:so it got dumber? by angel'o'sphere · · Score: 2

    It feels nothing.
    It has no algorithms to 'feel' anything.

    --
    Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
  3. Re:This is cool, but I'll be more interested when. by ShanghaiBill · · Score: 4, Informative

    ... It can deal with hidden information.

    So you mean something like poker? AI beats pros at Texas-Hold'em.

  4. Re:STILL not a real AI by Anonymous Coward · · Score: 4, Funny

    That's just what an AI would say, Mr. Schumann

  5. imagine the possibilities ... by tommeke100 · · Score: 2

    They could have mined tons of bitcoins instead with that computing power.

  6. Re:so it got dumber? by shaitand · · Score: 4, Funny

    Come now, lets be honest, it enjoys playing with itself just like everyone else.

  7. Re:Drug Design and Climate models by MangoCats · · Score: 2

    Sorry, I disagree.

  8. Re:so it got dumber? by shaitand · · Score: 2

    That sentence isn't really clear. But the fact they are specifying "version of itself" suggests that they have a save state of the version that won those games but have not stopped improving AlphaGo AFTER it beat the world champion. It may well be able to beat that older version of AlphaGo 100% of the time but only be able to beat the latest and greatest AlphaGo 90% of the time. This is a fork, it might be so close as to be a major release revision number variation of the same software but it's had completely different nurture and experience and is a completely different mind than the first much like identical twins are different people.. creepy people yes, but distinct.

  9. Re:STILL not a real AI by MrDozR · · Score: 5, Insightful

    Not impressed, doesn't prove anything, and why should anyone even care?

    Maybe because they're not trying to prove anything? Maybe their actual goal is to improve general purpose algorithms by an iterative approach? Like it says in the article. Which you read of course.

  10. Re:Drug Design and Climate models by Anonymous Coward · · Score: 2, Insightful

    Though it's true that we're neural networks, we aren't the same neural networks as these are. Remember folks, "neural network" in the sense of AI is a marketing term, it does not in any way imply that it functions in a manner similar to how our brains work. Fact is, we have no idea how our brains work. We know what certain parts are responsible for, but no idea how they do it. If anybody claims to know, then please ask them to describe in detail how memory is encoded in our brains, and have them demonstrate by altering a memory in a predetermined way.

  11. Re:so it got dumber? by shaitand · · Score: 2

    There is also something to be said for diversity in those games. Beating AlphaGo doesn't automatically translate into beating the people AlphaGo's strategy defeated.

  12. Re:so it got dumber? by negrace · · Score: 2

    Ouch, I misread it. I got dumber instead :).

  13. Re:This is cool, but I'll be more interested when. by shaitand · · Score: 3, Insightful

    If it were all about tells there would be no online poker. Poker IS about reading other players but you can read a player from their play.

  14. Slashdot AI commentary summary... by JMZero · · Score: 5, Insightful

    I've decided that this accomplishment -- a dizzying milestone in artificial intelligence that not long ago was though impossible or at least decades away -- is actually meaningless and doesn't prove anything and they should clearly have been working on some other problem. I have no idea how their system works, but I'm confident that their approach is just "brute force" (or something, I clearly have no idea what even that means) and won't generalize to any "real" problem solving (with my definition of "real problem" subject to change without notice).

    I will only admit that any progress has been made towards artificial intelligence when computers perform exactly equivalent to humans in all tasks with no human intervention. I mean, I won't really, because I have weird quasi-spiritual hangups about believing computers can be intelligent, but that's where I'm putting the goal posts for now. Digital computers can't think, but I can because reasons. Free will or quantum mechanics or something else that I haven't thought about at all, probably.

    Also, cotton gins and blacksmiths, therefore computers will never take our jobs. Amen.

    --
    Let's not stir that bag of worms...
    1. Re:Slashdot AI commentary summary... by philmarcracken · · Score: 2

      > Also, cotton gins and blacksmiths, therefore computers will never take our jobs.

      They don't need to take all our jobs. Humans needs don't increase beyond shelter, security, food and water. The 'need' of entertainment is a bottomless pit in which you can throw money and out comes stories, sports, children and art, to name a few. Humans are definitely bright, but we are also routine and we like it when things at work don't change that much, so we can keep up.

      When the majority of us like average, easy to accomplish problems inside a set framework, and robots(now equipped with functional eyes thanks to neural networks) decimate that form of work, what do that very large number of people do?

      This isn't some off distant future. Since those jobs did vanish, people saw fit to add to their own value by increasing their education, but due to the above limitations on human needs, specialized problems are by nature a rare thing. This leads to a quick saturation of highly educated individuals still without positions. So they go into the only jobs on offer in the service sector, creating massive under-unemployment. All the rest that are still employed are doing the most vile thing I can fathom: creating work where there was none like renaming buildings, companies, projects. Shifting peoples seating. Sitting on old technology.

      At some point you have to ask, what the fuck are we doing?

    2. Re:Slashdot AI commentary summary... by Maxo-Texas · · Score: 4, Interesting

      Never forget that the many of the people who did lose their jobs to industrialization died homeless of exposure and starvation after being put down by the military. They were provided neither training nor jobs on the new machines (which was their true issue).

      Similarly, we could have a very rough 20 years where jobs are destroyed faster than they can be created and where workers over 50 (40?) can't afford train for the new jobs and there are more unemployed than society is willing to pay for (even tho our productivity is 100x what it was a hundred years ago so one worker should be able to completely support 99 unemployed with 200 square feet of living space and basic food).

      It's coming. It could be better but it's probably going to be bad. Possibly even "great depression" or "financial panic of 18xx" bad. (they had a lot of financial panics in the 1800s that were pretty terrible.).

      --
      She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
  15. Re:Shall We Play A Game? by mugurel · · Score: 2

    Although to me this is clearly a case of A.I., parent has a point that not all types of problems lend themselves to be treated as reinforcement learning tasks, which seems to have been the key to success in this case.

  16. Re:Proof of Global Warming! by HeckRuler · · Score: 2

    Don't be too hard on him, manufacturing this sort of outrage is a time-honored tradition and puts bread on a lot of tables.

    Outrage doesn't just grow on trees you know. Without his efforts the outrage deficit would lead to a world awash with harmony.

  17. Re:Can it be generalized? by phantomfive · · Score: 2

    The answer is no, and provably so, because it is not Turing complete.

    --
    "First they came for the slanderers and i said nothing."
  18. Re:an exciting first step by slew · · Score: 2

    The essence of intelligence is that it enables one to predict the outcome of a unique situation based upon an understanding of its essential elements.

    Starting with only the rules of Go, Zero explored a variety of combinations, learning that some were more likely to give a satisfactory result. It developed a sense of what types of moves are best. Thus, without playing or studying an infinite number of games it could know the type of move that should be best in each unique situation.

    Theoretically, a vast intelligence, given only the facts of the Big Bang, could anticipate most of the resulting evolution of our universe. Zero has taken the first small step.

    You are making quite a few assumptions. One, that somehow a "game" that has a goal (e.g., a "winner") is the same as predicted
    an open ended problem. Two: that somehow AlphaGoZero developed a "sense" of what types of moves are best.

    First, because of the limited rules and state space of the game Go, and the fact that there is a "winner", the Go universe is certainly closed and quite bounded.

    In contrast, the real universe has a much larger state space and the rules are unknown (although some approximate rules are known by the current state of physics, they are known to be somewhat inconsistent) so although perhaps a theoretically vast intelligence, given our *estimate* of the state of the universe at the *assumed* Big Bang could apply our current approximate rules of the universe and potentially extrapolate the evolution of an *idealized* universe, since we know our current rules are inconsistent, this would likely be equivalent to garbage-in, garbage-out.

    Similarly if AlphaGoZero wasn't aware of the full laws of physics of Go (say the 19x19 board size limitation, or perhaps was not taught the rule of Ko) and learned to play using simplified rules, it might not have learned anything essential about playing a "real" game of Go.

    Secondly, it is unclear if a "sense" of what types of moves are best is being learned (it may simply be better at out-computing humans and building/"learning" its own huge dictionary of game outcomes). I think AI will only truly be useful (and trusted) when it also learns how to "teach". Anecdotally, I often find out how much I know about a subject when I try to explain it to someone else. Until, I do that, I might be able to be okay at muddling/faking my way through it, but I know that I don't really know it until I can successfully explain it to someone else.

    FWIW, AI constructions like "GANs" (Generative Adversarial Networks), are a small step in this direction for certain things, but still have the potential loophole that it can "fake-it" as long as the discriminator can't be taught to detect it). Similarly, we can examine the moves that a program like AlphaGoZero makes and look at the probabilities that it assigns as part of it's MonteCarlo tree search before it moves, but that's like looking at an MRI of a Go Master and trying to learn what they are thinking when they made a certain move. By *humans* analysing the games that AlphaGoZero makes (and people have done this already) we can infer what it is trying to teach, but it will be much better when we figure out how to know for sure as integrated as part of its reinforcement learning.

  19. Re:so it got dumber? by mlyle · · Score: 2

    What happened was, Google made a version of AlphaGo that beat Lee Se-dol. Call this AlphaGo One. It won, but it was at least close.

    Then Google updated it and had it play lots and lots of top players, and it trounced them all. Call this AlphaGo Two.

    Then they did this new version, AlphaGo Zero. Zero, early in training, beat AlphaGo One 40-0. Late in training it defeated AlphaGo Two 90% of the time.

  20. Re:so it got dumber? by Kjella · · Score: 2

    This graphic paints a different picture. Three days to beat AlphaGo "Lee Sedol"-version, 21 days to beat AlphaGo "Master"-version that beat Ki Jie + 60 top pros, after 40 days AlphaGo "Zero@40" is now crushing the "Master" version by winning 90% of the time. I think that means the journalist got two things intermingled, "Zero@3" only surpassed the "Lee Sedol" version while "Zero@40" wins 100-0 against it. It's done both, but not at once.

    --
    Live today, because you never know what tomorrow brings
  21. This doesn't show we're winning at AI by thisisauniqueid · · Score: 2

    This doesn't show we are winning at creating AI. It simply shows that the game of Go is more tractable than we previously thought. Claims about the number of positions in Go being vastly greater than the number of atoms in the universe (something like the number of atoms squared) completely miss the point: this is a straw man argument for why algorithms weren't good at Go until recently, since (obviously) humans are not searching the entire space of all possible board positions either. It stands to reason that once a sufficiently flexible fuzzy hierarchical pattern matching algorithm were produced, it would be able to play Go much better than a human.