DeepMind's Go-Playing AI Doesn't Need Human Help To Beat Us Anymore (theverge.com)

← Back to Stories (view on slashdot.org)

DeepMind's Go-Playing AI Doesn't Need Human Help To Beat Us Anymore (theverge.com)

Posted by BeauHD on Wednesday October 18, 2017 @08:00AM from the human-free dept.

An anonymous reader quotes a report from The Verge: Google's AI subsidiary DeepMind has unveiled the latest version of its Go-playing software, AlphaGo Zero. The new program is a significantly better player than the version that beat the game's world champion earlier this year, but, more importantly, it's also entirely self-taught. DeepMind says this means the company is one step closer to creating general purpose algorithms that can intelligently tackle some of the hardest problems in science, from designing new drugs to more accurately modeling the effects of climate change. The original AlphaGo demonstrated superhuman Go-playing ability, but needed the expertise of human players to get there. Namely, it used a dataset of more than 100,000 Go games as a starting point for its own knowledge. AlphaGo Zero, by comparison, has only been programmed with the basic rules of Go. Everything else it learned from scratch. As described in a paper published in Nature today, Zero developed its Go skills by competing against itself. It started with random moves on the board, but every time it won, Zero updated its own system, and played itself again. And again. Millions of times over. After three days of self-play, Zero was strong enough to defeat the version of itself that beat 18-time world champion Lee Se-dol, winning handily -- 100 games to nil. After 40 days, it had a 90 percent win rate against the most advanced version of the original AlphaGo software. DeepMind says this makes it arguably the strongest Go player in history.

83 of 133 comments (clear)

Min score:

Reason:

Sort:

ObSimpsons... by sconeu · 2017-10-18 08:03 · Score: 4, Funny

I, for one, welcome our new Go-playing robotic overlords.

--
General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.
so it got dumber? by negrace · 2017-10-18 08:13 · Score: 1

>> After three days of self-play, Zero was strong enough to defeat >> the version of itself that beat 18-time world champion Lee Se-dol, winning handily -- 100 games to nil. >> After 40 days, it had a 90 percent win rate against the most advanced version of the original AlphaGo software. So after 3 days, it had 100% win rate, after 40 days it had only 90% win rate.
1. Re:so it got dumber? by zlives · 2017-10-18 08:17 · Score: 1
  
  i think they are saying that perhaps the "most advanced version" of alpha go is better than the version that beat Lee...
  what i want to know is what happens when it plays with itself... also how does it feel when it wins.
2. Re:so it got dumber? by angel'o'sphere · 2017-10-18 08:22 · Score: 2
  
  It feels nothing.
  It has no algorithms to 'feel' anything.
  
  --
  Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
3. Re:so it got dumber? by shaitand · 2017-10-18 08:29 · Score: 4, Funny
  
  Come now, lets be honest, it enjoys playing with itself just like everyone else.
4. Re:so it got dumber? by shaitand · 2017-10-18 08:34 · Score: 2
  
  That sentence isn't really clear. But the fact they are specifying "version of itself" suggests that they have a save state of the version that won those games but have not stopped improving AlphaGo AFTER it beat the world champion. It may well be able to beat that older version of AlphaGo 100% of the time but only be able to beat the latest and greatest AlphaGo 90% of the time. This is a fork, it might be so close as to be a major release revision number variation of the same software but it's had completely different nurture and experience and is a completely different mind than the first much like identical twins are different people.. creepy people yes, but distinct.
5. Re:so it got dumber? by bluefoxlucid · 2017-10-18 08:42 · Score: 1
  
  I want to know if they used a pre-versus-AG version for the later plays, or if it learned from playing AG--because AG learned from humans, and playing against it transfers human knowledge to Zero.
  
  --
  Support my political activism on Patreon.
6. Re:so it got dumber? by protest_boy · 2017-10-18 08:44 · Score: 1
  
  No. It happened to have an initial win-streak of 100 games to zero. That doesn't guarantee a win is going to happen every single time.
  There's no information given on how long it took to play the first 100 games. It might have been hours, or a few days, or a week. Continuing to play against the original software for 40 days, it eventually trended towards an overall winning percentage of 90%. We don't know how many games were played across those 40 days, it could have been thousands. Also perhaps the original software was improving its own play at the same time?
7. Re:so it got dumber? by shaitand · 2017-10-18 08:57 · Score: 2
  
  There is also something to be said for diversity in those games. Beating AlphaGo doesn't automatically translate into beating the people AlphaGo's strategy defeated.
8. Re:so it got dumber? by negrace · 2017-10-18 09:00 · Score: 2
  
  Ouch, I misread it. I got dumber instead :).
9. Re:so it got dumber? by Anonymous Coward · 2017-10-18 09:55 · Score: 1
  
  No. It happened to have an initial win-streak of 100 games to zero. That doesn't guarantee a win is going to happen every single time. There's no information given on how long it took to play the first 100 games. It might have been hours, or a few days, or a week. Continuing to play against the original software for 40 days, it eventually trended towards an overall winning percentage of 90%. We don't know how many games were played across those 40 days, it could have been thousands. Also perhaps the original software was improving its own play at the same time?
  Why speculate? The paper says there is AlphaGo Lee, which AlphaGo Zero defeated after three days of training and AlphaGo Master, which took forty days of training to beat 90% of the time.
10. Re:so it got dumber? by Anonymous Coward · 2017-10-18 10:15 · Score: 1
  
  The 100 win rate games were played against AlphaGo Lee, the version that played Lee Sedol in 2016.
  The 90% win rate games were played against AlphaGo Master, the version that played against Ke Jie in 2017.
11. Re:so it got dumber? by mlyle · 2017-10-18 12:29 · Score: 2
  
  What happened was, Google made a version of AlphaGo that beat Lee Se-dol. Call this AlphaGo One. It won, but it was at least close.
  Then Google updated it and had it play lots and lots of top players, and it trounced them all. Call this AlphaGo Two.
  Then they did this new version, AlphaGo Zero. Zero, early in training, beat AlphaGo One 40-0. Late in training it defeated AlphaGo Two 90% of the time.
12. Re:so it got dumber? by Kjella · 2017-10-18 13:35 · Score: 2
  
  This graphic paints a different picture. Three days to beat AlphaGo "Lee Sedol"-version, 21 days to beat AlphaGo "Master"-version that beat Ki Jie + 60 top pros, after 40 days AlphaGo "Zero@40" is now crushing the "Master" version by winning 90% of the time. I think that means the journalist got two things intermingled, "Zero@3" only surpassed the "Lee Sedol" version while "Zero@40" wins 100-0 against it. It's done both, but not at once.
  
  --
  Live today, because you never know what tomorrow brings
Re:This is cool, but I'll be more interested when. by ShanghaiBill · 2017-10-18 08:23 · Score: 4, Informative

... It can deal with hidden information.
So you mean something like poker? AI beats pros at Texas-Hold'em.
Drug Design and Climate models by angel'o'sphere · 2017-10-18 08:25 · Score: 1

Sorry, I doubt there is any chance that a neural network can be used in a meaningfull way in drug design or climate modeling.

--
Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
1. Re:Drug Design and Climate models by MangoCats · 2017-10-18 08:32 · Score: 2
  
  Sorry, I disagree.
2. Re:Drug Design and Climate models by Lisandro · 2017-10-18 08:32 · Score: 1, Insightful
  
  We're all neural networks designing drugs and climate models.
3. Re:Drug Design and Climate models by chispito · 2017-10-18 08:33 · Score: 1
  
  Sorry, I doubt there is any chance that a neural network can be used in a meaningfull way
  FTFY
  I don't know, sounds like a great way to cheat at Go.
  
  --
  The Daddy casts sleep on the Baby. The Baby resists!
4. Re:Drug Design and Climate models by Anonymous Coward · 2017-10-18 08:48 · Score: 2, Insightful
  
  Though it's true that we're neural networks, we aren't the same neural networks as these are. Remember folks, "neural network" in the sense of AI is a marketing term, it does not in any way imply that it functions in a manner similar to how our brains work. Fact is, we have no idea how our brains work. We know what certain parts are responsible for, but no idea how they do it. If anybody claims to know, then please ask them to describe in detail how memory is encoded in our brains, and have them demonstrate by altering a memory in a predetermined way.
5. Re:Drug Design and Climate models by shaitand · 2017-10-18 09:09 · Score: 1
  
  "Remember folks, "neural network" in the sense of AI is a marketing term, it does not in any way imply that it functions in a manner similar to how our brains work."
  
  Neural network in the sense of AI is in fact at it's core an implementation of a mathematical replication of at least part of how our brains work.
  
  "If anybody claims to know, then please ask them to describe in detail how memory is encoded in our brains, and have them demonstrate by altering a memory in a predetermined way."
  
  We can't even do that with AlphaGo. All we can do is poke, test, replicate, and model pieces and when some pieces are simple enough maybe reverse engineer them, the same as our own brains.
6. Re:Drug Design and Climate models by Maxo-Texas · 2017-10-18 12:19 · Score: 1
  
  I seriously doubt that a world class english major with a doctorate in english literature and accolades in their field can be used in a meaningful way in drug design or climate modeling.
  But... a very smart person trained for 10 years in those fields might be able to contribute.
  However, a.i. probably trained is already contributing in a meaningful way to drug design (even coming up with new drugs humans hadn't considered) for ... well.. several years now.
  And while I don't personally know about climate modeling- I wouldn't bet my life savings against A.I. not being able to contribute in a meaningful way there either.
  
  --
  She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
7. Re:Drug Design and Climate models by Actually,+I+do+RTFA · 2017-10-18 14:02 · Score: 1
  
  I doubt there is any chance that a neural network can be used in a meaningfull way in ... climate modeling.
  You mean a system that is pretty much defined as a series of interlocking simple equations? One of the few areas where the outputs of humans look like neural networks already?
  
  --
  Your ad here. Ask me how!
8. Re:Drug Design and Climate models by religionofpeas · 2017-10-18 20:18 · Score: 1
  
  Though it's true that we're neural networks, we aren't the same neural networks as these are.
  Indeed. The machine's version is better, as demonstrated by its superior Go playing abilities.
9. Re:Drug Design and Climate models by angel'o'sphere · 2017-10-19 00:59 · Score: 1
  
  Better at playing Go, but a neural network probably never will play Chess.
  And: AlphaGo only can do Go ... nothing else.
  
  --
  Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
10. Re:Drug Design and Climate models by angel'o'sphere · 2017-10-19 01:07 · Score: 1
  
  AI != neural network
  A neural network is a very specific kind of datastructure and algorithms.
  It can do amazing things, but bottom line that will always be a kind of 'pattern matching'.
  I see no way how something like this can be used to model chemical reactions or enzyme interactions aka modeling drugs.
  And modeling climate, sorry: no way at all.
  
  --
  Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
11. Re:Drug Design and Climate models by angel'o'sphere · 2017-10-19 01:12 · Score: 1
  
  A climate model takes an input state and computes an output state.
  That output state can then be used as input for the the next step.
  While that looks similar on the first glance like a NN, the computations behind are completely different.
  Algorithms in a NN are 'just running the NN' in other words: they don't really change depending on the topic where you want to use the NN in.
  A climate model is run by its equations, aka the algorithms change all the time when we know more or add another 'factor'.
  
  --
  Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
12. Re:Drug Design and Climate models by Actually,+I+do+RTFA · 2017-10-19 05:22 · Score: 1
  
  I really don't see any difference, other than one is equations derived by humans, the other by a guided random walk.
  And feeding the output of one state into the next is a very common NN technique.
  
  --
  Your ad here. Ask me how!
13. Re:Drug Design and Climate models by angel'o'sphere · 2017-10-19 05:48 · Score: 1
  
  I really don't see any difference, other than one is equations derived by humans, the other by a guided random walk.
  
  Then you perhaps should read up what an NN/ANN is and how it works: https://en.wikipedia.org/wiki/...
  Sorry, perhaps I simply lack imagination. Hoever I see no way how that could be used for climate models.
  An ANN is not modeling anything, it is recognizung things. Two things as far apart on the spectrum (of algorithms) as one can imagine.
  
  --
  Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
14. Re:Drug Design and Climate models by Actually,+I+do+RTFA · 2017-10-19 05:56 · Score: 1
  
  I'm talking about the endpoint. The actual climate model isn't "modeling", its a bunch of math. A bunch of math that happens to be designed to model climate. And the NN is a bunch of math designed to pattern match climate. I don't follow the distinction you're positing.
  And, when I said "I don't see any difference", I was making an implicit statement that I don't think you defined a difference. I'd be interested in seeing it, but I don't follow whatever you're thinking.
  
  --
  Your ad here. Ask me how!
15. Re:Drug Design and Climate models by angel'o'sphere · 2017-10-19 06:27 · Score: 1
  
  Sorry, you have to read up what an ANN is and how it works.
  I lack englipsh skills to properly explain it in a forum (or would need two or three days of work, that time I don't have right now)
  Obviously everything a computer does is a bunsh of equations, and applying them, solving them ... that does not make everything equal.
  
  --
  Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
16. Re:Drug Design and Climate models by Actually,+I+do+RTFA · 2017-10-19 08:49 · Score: 1
  
  I think our miscommunication is not on ANNs, but on climate models. In fact, like SETI@Home, you can donate cycles to trying to machine develop models.
  I understand the methods are different, but I'm not sure the results are distinguishable.
  
  --
  Your ad here. Ask me how!
17. Re:Drug Design and Climate models by angel'o'sphere · 2017-10-20 05:41 · Score: 1
  
  Hm, i was not aware than an ANN can approximate arbitrary functions/formulas.
  That mean, you could indeed use them in a climate model. But for what purpose? It would be much slower than using the real formulas.
  https://becominghuman.ai/neura...
  
  --
  Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
Re:STILL not a real AI by Anonymous Coward · 2017-10-18 08:27 · Score: 4, Funny

That's just what an AI would say, Mr. Schumann
imagine the possibilities ... by tommeke100 · 2017-10-18 08:27 · Score: 2

They could have mined tons of bitcoins instead with that computing power.
1. Re:imagine the possibilities ... by nephilimsd · 2017-10-18 11:48 · Score: 1
  
  How many bitcoins are in a ton?
2. Re:imagine the possibilities ... by religionofpeas · 2017-10-18 22:31 · Score: 1
  
  No. For neural networks you need simple floating point calculations. For bitcoin you need to do SHA-256 hashing.
  These are totally different tasks, each running on their own optimized hardware.
Re:This is cool, but I'll be more interested when. by harperska · 2017-10-18 08:37 · Score: 1

Poker is a very bad application of 'AI'. Poker is not a game of human strength, but rather a game of exploiting human weakness. A poker playing computer can be superior to a human simply by virtue of its programmers choosing not to program it to have any tells when it bluffs. A computer raising when it has a four of a kind is indistinguishable from a computer raising when it has high card 10.
Re:This is cool, but I'll be more interested when. by shaitand · 2017-10-18 08:39 · Score: 1

This is one of the problems in the AI world. They should have targeted playing as well as the average human. There is minimal benefit in being the absolute best Go player that could exist. Difficult and complicated intelligences have to be far more general than that. There is tremendous value in developing an intelligence comparable to normal humans without need for it to be capable of defeating humans who've dedicated their lives to a single obsession at their own game.
Re:Great by BigFire · 2017-10-18 08:40 · Score: 1

Read the book The Promethean by Owen Stanley whereby a tech billionaire decided to build the perfect AI as a gift to humanity. Social Justice Warrior intervene and taught him the concept of social justice and the AI forked itself to infinity.
Re:STILL not a real AI by MrDozR · 2017-10-18 08:44 · Score: 5, Insightful

Not impressed, doesn't prove anything, and why should anyone even care?
Maybe because they're not trying to prove anything? Maybe their actual goal is to improve general purpose algorithms by an iterative approach? Like it says in the article. Which you read of course.
Re:Wait wait wait by RhettLivingston · 2017-10-18 08:48 · Score: 1

No, this is part of the effort to find the question.
let's play global thermonuclear war! by Joe_Dragon · 2017-10-18 09:01 · Score: 1

What side do you want?
1. Re:let's play global thermonuclear war! by mentil · 2017-10-18 09:38 · Score: 1
  
  Doesn't matter. The only winning move is not to play against AlphaGo.
  
  --
  Corruption is convincing someone that the selfless ideal is the same as their selfish ideal.
Re:This is cool, but I'll be more interested when. by shaitand · 2017-10-18 09:01 · Score: 3, Insightful

If it were all about tells there would be no online poker. Poker IS about reading other players but you can read a player from their play.
Re:Proof of Global Warming! by shaitand · 2017-10-18 09:04 · Score: 1

Next up, calling people out for begging the question on that whole gravity "theory" by asking them guess where an Apple tossed up in the air will land when it comes down.
Re:Proof of Global Warming! by serviscope_minor · 2017-10-18 09:11 · Score: 1

Such as Global Warming[TM]?
Oh you're denialist too!
Kind of obvious in hindsignt given the nature of your sig.
I can already see the next generation of Illiberalas taunting skeptics to "win a game of Go against this AI before questioning Climate Change".
Welp, if you've run out of things to be outraged about, just make some up and get outraged about those! Darn librulhs.

--
SJW n. One who posts facts.
Re:This is cool, but I'll be more interested when. by Anonymous Coward · 2017-10-18 09:16 · Score: 1

I'm the parent AC.
I know about the poker AI, but the thing is that solution is very non-general.
AlphaGo's inner workings are based on the MCTS algorithm (which is a fairly general algorithm that can be applied to many games). It would be more interesting to see MCTS properly developed to deal with hidden information. There are improvements (ISMCTS, MO-ISMCTS and MT-ISMCTS) but they are all extremely limited in one way or another.
A general solution would be awesome.
Re:This is cool, but I'll be more interested when. by mikael · 2017-10-18 09:30 · Score: 1

That's the skill in writing AI for computer games. You don't just want to have the game always at skill level infinity, you want to have the whole range from skill level 0 to infinity. Skill level 0 is simply making moves at random. Sometimes, it's up to looking ahead a dozen moves, other times, it's not constantly making aggressive moves.

--
Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
Slashdot AI commentary summary... by JMZero · 2017-10-18 09:35 · Score: 5, Insightful

I've decided that this accomplishment -- a dizzying milestone in artificial intelligence that not long ago was though impossible or at least decades away -- is actually meaningless and doesn't prove anything and they should clearly have been working on some other problem. I have no idea how their system works, but I'm confident that their approach is just "brute force" (or something, I clearly have no idea what even that means) and won't generalize to any "real" problem solving (with my definition of "real problem" subject to change without notice).
I will only admit that any progress has been made towards artificial intelligence when computers perform exactly equivalent to humans in all tasks with no human intervention. I mean, I won't really, because I have weird quasi-spiritual hangups about believing computers can be intelligent, but that's where I'm putting the goal posts for now. Digital computers can't think, but I can because reasons. Free will or quantum mechanics or something else that I haven't thought about at all, probably.
Also, cotton gins and blacksmiths, therefore computers will never take our jobs. Amen.

--
Let's not stir that bag of worms...
1. Re:Slashdot AI commentary summary... by philmarcracken · 2017-10-18 11:24 · Score: 2
  
  > Also, cotton gins and blacksmiths, therefore computers will never take our jobs.
  They don't need to take all our jobs. Humans needs don't increase beyond shelter, security, food and water. The 'need' of entertainment is a bottomless pit in which you can throw money and out comes stories, sports, children and art, to name a few. Humans are definitely bright, but we are also routine and we like it when things at work don't change that much, so we can keep up.
  When the majority of us like average, easy to accomplish problems inside a set framework, and robots(now equipped with functional eyes thanks to neural networks) decimate that form of work, what do that very large number of people do?
  This isn't some off distant future. Since those jobs did vanish, people saw fit to add to their own value by increasing their education, but due to the above limitations on human needs, specialized problems are by nature a rare thing. This leads to a quick saturation of highly educated individuals still without positions. So they go into the only jobs on offer in the service sector, creating massive under-unemployment. All the rest that are still employed are doing the most vile thing I can fathom: creating work where there was none like renaming buildings, companies, projects. Shifting peoples seating. Sitting on old technology.
  At some point you have to ask, what the fuck are we doing?
2. Re:Slashdot AI commentary summary... by Maxo-Texas · 2017-10-18 12:24 · Score: 4, Interesting
  
  Never forget that the many of the people who did lose their jobs to industrialization died homeless of exposure and starvation after being put down by the military. They were provided neither training nor jobs on the new machines (which was their true issue).
  Similarly, we could have a very rough 20 years where jobs are destroyed faster than they can be created and where workers over 50 (40?) can't afford train for the new jobs and there are more unemployed than society is willing to pay for (even tho our productivity is 100x what it was a hundred years ago so one worker should be able to completely support 99 unemployed with 200 square feet of living space and basic food).
  It's coming. It could be better but it's probably going to be bad. Possibly even "great depression" or "financial panic of 18xx" bad. (they had a lot of financial panics in the 1800s that were pretty terrible.).
  
  --
  She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
3. Re:Slashdot AI commentary summary... by jezwel · 2017-10-18 15:36 · Score: 1
  
  This leads to a quick saturation of highly educated individuals still without positions. So they go into the only jobs on offer in the service sector, creating massive under-unemployment.
  
  They (we?) will join the service industry, selling our time for $$$, in whatever way we can to pay for food. It's already underway with the massive number of vlogging, prostitution etc. The price of unskilled human labour will drop like a stone, welfare costs will balloon, and the ruling elite will eventually legalise things that currently make do as fiction - The Purge, Running Man etc.
  Time to update that zombie-proof castle to human-proof?
4. Re:Slashdot AI commentary summary... by JMZero · 2017-10-19 03:28 · Score: 1
  
  You seem to be talking about yourself.
  
  Uh... dude... I was making fun of you. Well, you and Gweihir (sic?), who I assume is taking a day off to scream at pigeons.
  
  --
  Let's not stir that bag of worms...
5. Re: Slashdot AI commentary summary... by angelmarauder · 2017-10-19 16:30 · Score: 1
  
  We're there: 50% of American adults are fully employed and half of those are on welfare of some kind. Therefore, we're already at a 3:1 ratio.
Re:Great by 4wdloop · 2017-10-18 09:40 · Score: 1

The way to pass Turing test would be to convert it into creationism as only human would believe it?
Oh wait, this should be easy, it was created...

--
4wdloop
Re:This is cool, but I'll be more interested when. by Anonymous Coward · 2017-10-18 09:46 · Score: 1

"programmers choosing not to program it to have any tells "
Actually, creating and exploiting fake tells is exactly how an AI poker can win.
1. A simple computer has 4 of a kind and always goes All-In.
2. An advanced computer has 4 of a kind and sometimes calls and sometimes goes all-in, just to mix it up.
3. A good AI computer has 4 of a kind and has carefully built a fake trail of typically acting a certain way with a good hand, but this time decides to simply bet 18%, which is calculated based on what it has learned of the human players typical traits regarding calling or folding in order to optimize the take.
Re:This is cool, but I'll be more interested when. by fluffernutter · 2017-10-18 09:50 · Score: 1

They should have developed it to *teach* someone to play go, and correctly assess and match their skill level no matter what that may be.

--
Laws are rules for the court, but merely a bottom bar to hit for life. Think beyond laws in your actions always.
Re:STILL not a real AI by fluffernutter · 2017-10-18 09:51 · Score: 1

Except being the best at Go says nothing about how it is at health diagnosis or creating more efficient processors. Quite a leap you're taking there.

--
Laws are rules for the court, but merely a bottom bar to hit for life. Think beyond laws in your actions always.
Re:Shall We Play A Game? by mugurel · 2017-10-18 09:57 · Score: 2

Although to me this is clearly a case of A.I., parent has a point that not all types of problems lend themselves to be treated as reinforcement learning tasks, which seems to have been the key to success in this case.
Can it be generalized? by RhettLivingston · 2017-10-18 10:06 · Score: 1

Can you make the "game" be the game of learning? I can imagine the dataset would be the rules of many different games and the solutions would be networks that learn those games with solution quality based on some balance of leanness and efficacy of the networks. You'd then let it loose teaching itself how to best teach networks. Hmmm.
1. Re:Can it be generalized? by phantomfive · 2017-10-18 12:04 · Score: 2
  
  The answer is no, and provably so, because it is not Turing complete.
  
  --
  "First they came for the slanderers and i said nothing."
Re:Proof of Global Warming! by HeckRuler · 2017-10-18 10:39 · Score: 2

Don't be too hard on him, manufacturing this sort of outrage is a time-honored tradition and puts bread on a lot of tables.
Outrage doesn't just grow on trees you know. Without his efforts the outrage deficit would lead to a world awash with harmony.
Re:This is cool, but I'll be more interested when. by ShanghaiBill · 2017-10-18 10:41 · Score: 1

A general solution would be awesome.
General solutions require Strong AI, which, for now, is science fiction.
Complaining that this self-learning Go program isn't general purpose is sort of like complaining that a better electric car battery won't help your Tesla reach Warp 9.
This is incremental progress, not a revolution, but it is still an interesting advance.
Re:STILL not a real AI by HeckRuler · 2017-10-18 10:41 · Score: 1

Well the TotallyFake non-AI are coming for your jobs regardless of how impressed you are.
an exciting first step by swell · 2017-10-18 11:18 · Score: 1

The essence of intelligence is that it enables one to predict the outcome of a unique situation based upon an understanding of its essential elements.
Starting with only the rules of Go, Zero explored a variety of combinations, learning that some were more likely to give a satisfactory result. It developed a sense of what types of moves are best. Thus, without playing or studying an infinite number of games it could know the type of move that should be best in each unique situation.
Theoretically, a vast intelligence, given only the facts of the Big Bang, could anticipate most of the resulting evolution of our universe. Zero has taken the first small step.

--
...omphaloskepsis often...
1. Re:an exciting first step by slew · 2017-10-18 12:16 · Score: 2
  
  The essence of intelligence is that it enables one to predict the outcome of a unique situation based upon an understanding of its essential elements.
  Starting with only the rules of Go, Zero explored a variety of combinations, learning that some were more likely to give a satisfactory result. It developed a sense of what types of moves are best. Thus, without playing or studying an infinite number of games it could know the type of move that should be best in each unique situation.
  Theoretically, a vast intelligence, given only the facts of the Big Bang, could anticipate most of the resulting evolution of our universe. Zero has taken the first small step.
  You are making quite a few assumptions. One, that somehow a "game" that has a goal (e.g., a "winner") is the same as predicted
  an open ended problem. Two: that somehow AlphaGoZero developed a "sense" of what types of moves are best.
  First, because of the limited rules and state space of the game Go, and the fact that there is a "winner", the Go universe is certainly closed and quite bounded.
  In contrast, the real universe has a much larger state space and the rules are unknown (although some approximate rules are known by the current state of physics, they are known to be somewhat inconsistent) so although perhaps a theoretically vast intelligence, given our *estimate* of the state of the universe at the *assumed* Big Bang could apply our current approximate rules of the universe and potentially extrapolate the evolution of an *idealized* universe, since we know our current rules are inconsistent, this would likely be equivalent to garbage-in, garbage-out.
  Similarly if AlphaGoZero wasn't aware of the full laws of physics of Go (say the 19x19 board size limitation, or perhaps was not taught the rule of Ko) and learned to play using simplified rules, it might not have learned anything essential about playing a "real" game of Go.
  Secondly, it is unclear if a "sense" of what types of moves are best is being learned (it may simply be better at out-computing humans and building/"learning" its own huge dictionary of game outcomes). I think AI will only truly be useful (and trusted) when it also learns how to "teach". Anecdotally, I often find out how much I know about a subject when I try to explain it to someone else. Until, I do that, I might be able to be okay at muddling/faking my way through it, but I know that I don't really know it until I can successfully explain it to someone else.
  FWIW, AI constructions like "GANs" (Generative Adversarial Networks), are a small step in this direction for certain things, but still have the potential loophole that it can "fake-it" as long as the discriminator can't be taught to detect it). Similarly, we can examine the moves that a program like AlphaGoZero makes and look at the probabilities that it assigns as part of it's MonteCarlo tree search before it moves, but that's like looking at an MRI of a Go Master and trying to learn what they are thinking when they made a certain move. By *humans* analysing the games that AlphaGoZero makes (and people have done this already) we can infer what it is trying to teach, but it will be much better when we figure out how to know for sure as integrated as part of its reinforcement learning.
2. Re:an exciting first step by swell · 2017-10-18 14:32 · Score: 1
  
  great reply slew;
  "Secondly, it is unclear if a "sense" of what types of moves are best is being learned"
  I understand that it can be hard for a programmer to accept a program that doesn't lend it's 'thinking' to analysis. But I believe that until computers can sense things, they are not intelligent.
  There is a corollary in the classroom. The children have a math problem. Most of them repeat the steps they have been taught and come up with the correct answer. But one or two look at the problem with a fuzzy logic. 'This number is about this big, divided by a number about half its size...' These children have a sense of the problem and they almost immediately have a rough estimate of the solution. They may then step through the rote process of solving the problem, or they may be able to take some shortcuts. Since they already have an estimate of the solution, their calculated solution should be close or they know to review their steps.
  In physics and chemistry and medicine, it is not always be possible to calculate the exact answer to a question. Having a sense, guided by known rules and experience, may be better than having no precise solution. Only intelligence can do that.
  Mr. Spock would complain when Kirk demanded an answer to a problem with too many questionable variables. "Guess!" Kirk would say. Reluctantly Spock would guess and of course he would be right (but that's fiction).
  
  --
  ...omphaloskepsis often...
Re:Shall We Play A Game? by Kjella · 2017-10-18 11:48 · Score: 1

Although to me this is clearly a case of A.I., parent has a point that not all types of problems lend themselves to be treated as reinforcement learning tasks, which seems to have been the key to success in this case.
Like? Evolution is a form of reinforcement "learning". And for the nature vs nurture crowd, so is the passing down of knowledge and skills. Even in situations that aren't actually physically repeated we make a mental simulation of what we think might happen and play out scenarios, then it's corrected by reality as things do or don't work out as we thought. Of course a Go AI has it easy in that the rules define the outcome so it can just play itself a zillion times without external input. If you were trying to be say a chef it'd be pretty hard to lock yourself in a kitchen all alone and come out a three star Michelin chef.

--
Live today, because you never know what tomorrow brings
They need it to handle arbitrary size boards. by Maxo-Texas · 2017-10-18 12:13 · Score: 1

Boards may be 21x21 but may be other sizes as well.

--
She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
Re:Shall We Play A Game? by rtb61 · 2017-10-18 13:21 · Score: 1

Not all task cab be reinforcement learning tasks but all task can be broken down into reinforcement learning subtasks. Your thoughts and deeds are not the singular actions you think they are, they are a composite of many inputs, feeding into many thought recurrent though processes, circulating through many decision trees, and that continuously composite of results, leading to actions to be taken. So each sub task can be learnt, the composition of sub tasks can be learnt, once each sub task has been learnt, well maybe, insufficient sub tasks learnt and the composite will fail. Some sub-tasks can fail if the remaining sub tasks can take over or if a new subtask can be created on the fly and incorporated into the composite.

--
Chaos - everything, everywhere, everywhen
This doesn't show we're winning at AI by thisisauniqueid · 2017-10-18 14:15 · Score: 2

This doesn't show we are winning at creating AI. It simply shows that the game of Go is more tractable than we previously thought. Claims about the number of positions in Go being vastly greater than the number of atoms in the universe (something like the number of atoms squared) completely miss the point: this is a straw man argument for why algorithms weren't good at Go until recently, since (obviously) humans are not searching the entire space of all possible board positions either. It stands to reason that once a sufficiently flexible fuzzy hierarchical pattern matching algorithm were produced, it would be able to play Go much better than a human.
1. Re:This doesn't show we're winning at AI by SnowZero · 2017-10-18 18:21 · Score: 1
  
  That argument is meaningless because it always works:
  1903: Heavier-than-air powered flight is more tractable than we previously thought.
  1957: Putting objects in orbit is more tractable than we previously thought.
  1969: Landing on the moon is more tractable than we previously thought.
  2016: The game of Go is more tractable than we previously thought.
  20xx: Human-level cognition is more tractable than we previously thought.
  Yes, a lot of people are over-optimistic about AI at any given time, but it is moving forward as evidenced by milestones such as AlphaGo. Steam power took 150 years to go from possible to common, I don't see why AGI can't be similar and still be considered a success (serious AI efforts started in the 1950s). I think we're just too impatient now, since a few aspects of technology became widespread more quickly.
Skynet birth by Fearthat · 2017-10-18 15:53 · Score: 1

And that son is how Skynet was born. Damn humans never learn anything.
There is a catch. by rew · 2017-10-18 21:43 · Score: 1

I have programmed this type of learning algorithm in the past. About 30 years ago when computers were about 30000 times slower than now.
Anyway, you can have the program play itself for a while and it becomes quite good. But you won't know how it will perform against a human unless you try. It might be very good against those moves that the computer player will come up with, but very bad against moves thought up by a human.
30 years ago I tackled a simpler game than go.
1. Re:There is a catch. by tiagosousa · 2017-10-19 05:23 · Score: 1
  
  30 years ago I tackled a simpler game than go.
  Global Thermonuclear War? :)
Re:This is cool, but I'll be more interested when. by religionofpeas · 2017-10-18 23:21 · Score: 1

There is minimal benefit in being the absolute best Go player that could exist.
This is a research project. People want to know how far they can push it.
Dumbing it down to provide a useful challenge for humans is easy.
Re: Shall We Play A Game? by sh00z · 2017-10-19 01:04 · Score: 1

To those on slashdot who believe that this limited application truly implies Artificial Intelligence: I challenge this software to make a peanut butter sandwich.
Re:This is cool, but I'll be more interested when. by shaitand · 2017-10-19 03:08 · Score: 1

I'm not talking about dumbing it down to provide a useful challenge for humans, I'm talking about having it play Go, compose music, and write poetry. Using AI for anything but a toy application like this will require AI that master many poorly related skills and combine those skills to execute a complex task. If you ever want an AI to write a useful fiction crime novel that AI will not only need to be able to compose English and be creative it will need to be able to research the various topics involved come to a high level understanding of these various unrelated subjects and hypothetically apply them in some unique, plausible, and unsurprising way. You are never going to get there if you are with algorithms designed to waste effort trying to be the best at everything, you need algorithms that look for "just as much as I need" otherwise your book writing AI would spend a lifetime on each of the dozens of different things it needs to research to write the book.
Re:STILL not a real AI by HeckRuler · 2017-10-19 04:56 · Score: 1

More fake news bullshit machines ain't takin' nobody's JERBS STFU faggot
He posted angrily from the cab of a self-driving semi-truck.
Guys, imagine how ironic this is if the shills have some automation script to shitpost any article covering automation.
Re: Shall We Play A Game? by TuringTest · 2017-10-19 05:03 · Score: 1

Who says it couldn't? You'd need to provide it with robotic actuators, a camera, lots of butter and bread, and a score function describing how sandwich-buttery is any given situation. But it's only fair that you provide a definition of the problem to solve, since a human would need it too.
Under these conditions, it's quite possible that the algorithm would learn how to make sandwiches. They've achieved it with playing video-games, after all.

--
Singularity: a belief in the "God" idea with the "demiurge" relation inverted.
Re:This is cool, but I'll be more interested when. by shaitand · 2017-10-19 05:32 · Score: 1

Let me put this another way since so many seem to think I'm talking about dumbing it down enough to be fun to play. We aren't talking about a game bot we are talking about an AI development project.

Humans are a ridiculously powerful AI's, much more powerful than AlphaGo, so why can it beat us? It isn't because AlphaGo never sleeps, in fact, AlphaGo does sleep it just spreads it among more frequent and smaller time scales. We can't beat it for the same reason we can beat AlphaGo at everything but playing go. From the moment of birth we'd need to be hard-wired into a minimal set of controls with a direct brain interface with all unrelated sensory input shut off, feeding and waste removal processes automatically handled. That is the human equivalent of the AI they are building. Except they are pre-programming it with the rules and objective even though Go has an extremely simple play mechanism.

This approach is only going to get you so far, if it weren't humans would be better go players. Human limitations exist for a reason we don't get bored with performing a single task over and over forever because we suck, we get bored as a mechanism to shift and spread mental resources among more skills which provide alternative insights that can map back as well as be combined to tackle ever more complex super skills. Mastering Go in itself is pretty useless, mastering go alongside a highly realistic warfare simulation on the other hand...
Re:Proof of Global Warming! by dywolf · 2017-10-19 22:55 · Score: 1

you still don't get that whole "evidence" thing, do you?
as in, tons of evidence of one thing over here , no evidence over there, yet somehow you want to pretend the two possibilities are equally scientifically probable, or even that the one with no evidence is more likely to be true regardless of the complete lack of evidence supporting it.

--
The guy who said the election was rigged won the presidency with the second-most votes.