Slashdot Mirror


Artificial Intelligence Bests Humans At Classic Arcade Games

sciencehabit writes The dream of an artificially intelligent computer that can study a problem and gain expertise all on its own is now reality. A system debuted today by a team of Google researchers is not clever enough to perform surgery or drive a car safely, but it did master several dozen classic arcade games, including Space Invaders and Breakout. In many cases, it surpassed the best human players without ever observing how they play.

27 of 148 comments (clear)

  1. Breaking news! by penguinoid · · Score: 5, Funny

    Someone made a computer that's really good at reaction time, and at calculating trajectories.

    --
    Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways
    1. Re:Breaking news! by Dutch+Gun · · Score: 5, Insightful

      Seriously, is there any doubt that a computer can easily defeat a human at a computer game that involves 95% pure reflexes and 5% strategy?

      The article shows a picture of Breakout, and tends to focus on the wrong things entirely... especially the title, trumping that "computers can beat humans". It's fairly impressive that computers can learn the rules of a simple videogame on their own and perform well, but beating humans is not exactly an apples to apples comparison, because while we can formulate strategies to maximize points, we're also prone to making simple mistakes due to our much poorer reflexes and coordination. So AI has a massive advantage with precision reflexes and calculations that it can make much faster than humans.

      Some of my previous jobs involved programmed AI game opponents for action games. As anyone who's faced an aim-bot knows, there's no real challenge for computers to perform many of the tasks humans find difficult, like putting a bullet through a moving target's forehead. I actually had do a lot of extra work to programmatically replicate the difficulties humans face when aiming at a moving target. However, collecting and processing global environmental knowledge and formulating complex strategies based on that knowledge is extremely difficult. That's why we typically build a lot of invisible hints into the environment itself for the benefit of AI, such as pathfinding-specific structures, or dynamic flags that signal potential rewards or danger. Even today, in many strategy games that involve complex ruleset (meaning brute force calculations can't work as well), the computer opponents inevitably have to cheat in order to compete with even modestly skilled players.

      Early videogames have very few of these sorts of challenges because of their largely static environments and the basic nature of the games. For the most part, you just need to formulate a few simple rules for an optimal victory condition, and when combined with a computer's incredible performance, you can easily trounce the best human players, simply because a computer never gets distracted, tired, or makes silly mistakes in judgement.

      Again, I'm not dissing the work the researchers did, which I found to be impressive, but the article and summary seem to be missing the point entirely by comparing them to human scores. It's fairly obvious that once a computer learns how to play with an optimal strategy, it's an absolute given that they'll score better than humans ever could.

      --
      Irony: Agile development has too much intertia to be abandoned now.
    2. Re:Breaking news! by Lennie · · Score: 2

      I actually didn't see this story as news, I had seen a video of there work last year from before they were bought by Google.

      That same video was linked from the article:
      https://www.youtube.com/watch?...

      What makes this more interresting is, they didn't tell the AI how to play the game, they let the AI learn to play the game on it's own.

      I think one of the things this what makes this also interresting is how few times the AI needed to learn the game and then also be good at it.

      --
      New things are always on the horizon
    3. Re:Breaking news! by Ralph+Wiggam · · Score: 2

      At high levels, Jeopardy is all about who presses the button first. Watch a Tournament of Champions. All 3 people buzz in for pretty much every question.

      The trick is who can parse what the "answer" is really asking, recall the fact required, and then buzz in before the other players can do all 3 things. If Watson can do those things faster than a person, it won fair and square. Just being able to parse the "answer" was an incredibly impressive achievement.

    4. Re:Breaking news! by Comboman · · Score: 2

      The human players get the clue in text format also (printed on the monitor wall). Alex Trebek reading the clue aloud is strictly for the benefit of the mouth-breathers watching at home.

      --
      Support Right To Repair Legislation.
    5. Re:Breaking news! by ShanghaiBill · · Score: 2

      ... and then buzz in before the other players can do all 3 things.

      This is not correct. It is NOT who buzzes first. Alex reads the question and then a light comes on. If you push the button before the light comes on, then you are locked out for a quarter of a second. So the trick is to push the button the instant the light flashes on. It is pure response time. Of course a computer is going to be better at that. That is 99% of the reason Watson won.

    6. Re:Breaking news! by Maxo-Texas · · Score: 2

      That depends on the incentives the AI has.

      In this case, it appears it has incentives to gain the highest possible score as quickly as possible.

      In this case, tunneling and bouncing off the top wall better matches those goals.

      I read about his before and the computer starts out not knowing where the score is-- it has to learn which area is score and then do random things with the game until something succeeds at causing the score area to go up... and then optimize for high score and high speed.

      That sure sounds like learning to me.

      --
      She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
    7. Re:Breaking news! by narcc · · Score: 2

      The bit about the score is key here. It's essentially no different than any other learning algorithm as it does not discover on its own that the goal is to achieve a high score. The computer vision part is neat, but nothing new, and ultimately does nothing to differentiate this from the zillion other similar projects as it is only used to find the score! Countless hobbyists and researchers have made ANN's and Genetic algorithms which produce similar results, both the computer vision part and the game-playing part.

      The program does not, in any way, "study a problem and gain expertise all on its own". It's pure click-bait. I'm ashamed to have fallen for that particular trap.

    8. Re:Breaking news! by AmiMoJo · · Score: 3, Informative

      The key achievement here is that the AI was able to learn the game on its own in a relatively short time. Imagine if you had an industrial robot that could learn how to do tasks on its own and then modify its behaviour if the situation changed, and generally cope with a variety of situations.

      Also, they called it DQN which means "dumbass" in Japanese, so bonus points for that.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    9. Re:Breaking news! by Bonzoli · · Score: 2

      Yes, it has a great language/text codex, and the training process for watson is a scheduled batch job oddly enough. But once its trained its good at finding answers. I was disappointed they didn't have a good voice to text system running for the game.
      I've seen it in the lab at ibm and asked a lot of questions. Its a language codex which is quite good, sitting infront of a ranking database of information.
      The demo they had was medical journal based, and seemed quite useful for doctors that are looking for answers which are current.

    10. Re:Breaking news! by thedonger · · Score: 2

      I figured out an endless pattern to Atari 2600 Space Invaders and PacMan, high score stuff. Was thrilled and disappointed to read about my solution in some Atari mag several years after my discovery.

      I figured I had beat the computer and was disappointed when I wasn't asked to help defeat Xur and the Ko-Dan Armada.

      Fixed that for you.

      --
      Help fight poverty: Punch a poor person.
  2. Strategy games? by gurps_npc · · Score: 2, Insightful
    No, merely reaction time.

    In other words, this is an example of good image recognition software, that's it.

    Show me a game that beat a human on a strategy based game, then you have something.

    --
    excitingthingstodo.blogspot.com
    1. Re:Strategy games? by msobkow · · Score: 2

      It's not even a good example of image recognition, because the images to be processed don't have to be "understood" to be used. On top of that, the graphics of the games in question were very simple and primitive compared to what image recognition software deals with.

      Add to that the repetitive nature of old video games that were based on 99% reaction time and 1% strategy, and you can just flat out colour me "unimpressed" with this "research".

      Back in University, my AI project was a game player (a simple strategy game whose name I forget.) As it turned out, the entire game mapped down to a pre-determined set of decisions, so after playing only a dozen games, the "AI" would win every time, and that was just with a simple weighted-algorithm system of play. Some problems are just eminently suited to "AI" is all that I ended up learning from that project, but it was a useful lesson on the difference between optimizing a decision tree and actual "intelligence".

      Until someone comes up with a system that can deal with bad and erroneous inputs as well as humans, I will continue to be unimpressed. Yet at the same time, I don't consider it necessary for a computer to be able to think and understand per se to be considered an "intelligence." It just needs to be able to make decisions and choose between alternatives faster than it's human counterparts in order to be useful, and to reduce the number of errors compared to it's human counterparts.

      I have little faith in "neural networks." They place too much emphasis on emulating simple biological components and not enough on the "art" of understanding. Neural networks basically take the approach that "if it's big enough, we'll maybe get lucky and it will start to think." That's not "solving a problem." That's "playing the lottery."

      --
      I do not fail; I succeed at finding out what does not work.
  3. I'm one step ahead by Tablizer · · Score: 4, Funny

    Well, I outsourced my Donkey Kong playing before bots took it over, so there!

  4. Re:meh by Stormy+Dragon · · Score: 3, Funny

    Wasn't the AI complaining about the ending of Mass Effect 3 pretty much the plot of Mass Effect 3?

  5. Re:Lower the bar further. by Austerity+Empowers · · Score: 2

    Don't even need that. I made pong for the TI-82 instead of paying attention in calculus class in high school, my "AI" could not be beat. Because it's really easy to do things when you can precisely calculate vectors and positions... It's actually harder to have something that makes human-like mistakes.

    I don't think actual breakout or space invaders would be significantly harder.

  6. Don't underestimate humans by K3rn3lPan1c · · Score: 4, Funny

    Artificial Intelligence is no match for natural stupidity.

  7. Re:meh by vux984 · · Score: 5, Insightful

    This just in: Even in simplistic AAA games with bots, the AIs are better than human players, we have to dumb them down to keep the game fun.

    First the prime challenge in the games you are talking about is lining up a crosshair with a pixel with a mouse and selecting fire.

    If AI's had to do that they might have some difficulty. In practice the so-called AI bots already know where you are, and could keep their weapon lined up on your noggin through half the map without the need for line of sight. Tthey also get to target and fire at me without having to diddle around with a mouse or looking at the screen to see where I am.

    Get a bot to actually play such a game with the same UI and world view I have (keyboard and mouse and what they can see on screen and hear on the speakers) and they tend to be quite abysmal.

    Second, switch over to RTS games... and there the only way to give the AI any challenge is to stack the deck in its favor... whether its StarCraft or Supreme Commander or Wargame: Red Dragon. Or in a 4X game like Masters of Orion etc... we've yet to see an AI even really challenge a human being without giving it scripts to follow and extra resources to use.

  8. This guy did it in 2013... by EmeraldBot · · Score: 2

    This article reminds of this guy who did something similar a while back.

    --
    "Set a man a fire, he'll be warm for the rest of the night. Set a man afire, he'll be warm for the rest of his life."
  9. Re:No AI is a Wizard by narcc · · Score: 2

    Damn, dude, you must be deaf, dumb, and blind.

  10. Re: Lower the bar further. by AK+Marc · · Score: 2

    Only if we put the heat exchanger on the Moon.

  11. Re:meh by Anonymous Coward · · Score: 2, Informative

    Seriously, you don't sound like you know what you're talking about.

    Nor do you. Your AI doesn't have the same control interface as human players. An AI would never miss a stationary target unless you programmed it to or are really bad at programming. Humans do miss them. Humans have to deal with moving a mouse an exact distance. AIs don't. They simply move the exact distance required. No need to deal with mouse physics, no need to deal with running out of desk space, no need to estimate the amount of movement, etc... They don't need think which key does what then press the key, they just call the proper function. The UIs are completely different even if both have the same available actions.

    If you're giving an AI millions of training iterations, you should be giving the human a similar level of training. The AIs trained on top players only learn what they've seen. They don't handle other things very well. Look at just about every released RTS game. When you play a round long enough, the AI mostly stops everything or ends up in a 96% predictable repeating pattern.

    Almost no AIs work well in teams. Humans do far, far better.

    I'm a RTS AI researcher. It sounds like you do something similar, so you should know that much "advanced academic AI" is also poor. Most papers restrict their game environments so far that their results are almost meaningless. Sure they can calculate optimal attack orders for a 6x6 battle, but there's not enough CPU or memory left to do anything else.

    Your AI sounds like a normal case-based, reinforcement leaning AI that trains against itself. What game/paper is it?

  12. Reminds me of a programming competition .... by Ihlosi · · Score: 3, Interesting
    Reminds me of a competition to program a computer player for the classic Asteroids a few years ago.

    The best entries, however, didn't rely on AI, but on the fact that the RNG of the arcade game isn't random. Once the Asteroids-bot figured out the internal state of the RNG, it could basically use hyperspace to make targetted jumps (and never one that lead to the destruction of the ship), shoot at asteroids that haven't appeared yet and various other tricks. It was very impressive to watch one of these bots in action.

  13. Re:meh by Cytotoxic · · Score: 4, Interesting

    When Id released Quake C, this "you have to dumb down the AI" idea became very apparent. Someone wrote a replacement AI for the enemies that allowed them to learn and communicate. They had to follow the rules and physics of the game - so if they were within earshot they could communicate your position to each other, otherwise they couldn't.

    The first couple of interactions would be pretty easy kills. Then one enemy would see that you were better armed and run away. That would be the last enemy you would see for a while. Then, when you were in a vulnerable position, the entire population of the level would ambush you in a coordinated attack. Game over. They were way, way, way too smart to be beaten. It was pretty fun to explore their learning capabilities and watch how they would win. But it didn't make for engaging gameplay, unless you are a complete masochist.

    The same AI was applied to deathmatch player bots. They had no prior knowledge of the level, or strategies for playing the game. The first few kills were very easy as they figured out what to do. But as they learned your tendencies, they would very quickly evolve into a circle-strafe master. They also learned the map layout pretty quickly, including drop sites and periods for weapons and health. They would then time their circle-strafe to always be on the spawn site immediately as the health or ammo spawned. They would invariably win against even the best human players by monopolizing all of the supplies and winning a war of attrition. Very impressive to watch.

    This "AI" program was very rudimentary, and it was already much too difficult for human players, despite being limited to the same "in game" knowledge and input capabilities as the human players. It makes perfect sense that the challenge and complexity of programming the AI for games mostly revolves around "dumbing down" the AI in a way that makes the enemies challenging and interesting, but also the right amount of "beatable".

  14. Re:meh by vux984 · · Score: 2

    The same AI was applied to deathmatch player bots. They had no prior knowledge of the level, or strategies for playing the game. The first few kills were very easy as they figured out what to do. But as they learned your tendencies, they would very quickly evolve into a circle-strafe master. They also learned the map layout pretty quickly, including drop sites and periods for weapons and health. They would then time their circle-strafe to always be on the spawn site immediately as the health or ammo spawned. They would invariably win against even the best human players by monopolizing all of the supplies and winning a war of attrition. Very impressive to watch.

    Yes, and it gives another example of the issue. The AI has access to a perfect clock, and perfect spatial awareness, and has perfect control over its movement, sees everything in its field of view no matter how briefly it sees it...

    It can run and jump a maze of catwalks over lava with perfect reliability - BACKWARDS. It can do all that and land on a specific set of coordinates where it knows supplies are going to respawn the millisecond it respawns, because it can time to the millisecond when it last picked it up.

    It's cool, and it is devilishly hard to compete against, but its not good "AI". Its not out thinking its opponents, its merely taking full advantage of its machine attributes.

    For example, In quake I might know the location and approximate respawn time of the quad damage. I would certainly plan my route to be in the area when I was expecting it to respawn... or perhaps plan to have a sight line to it to pick off someone else making a run at it. I have a rough idea how long it takes me to get from where I am to the Quad damage spawn point from various places on the map.

    A better player than me, will have smaller error bars on all of those things. A top player smaller still. But the "AI" it has perfection. It's not coming up with a BETTER strategy than me, its simply perfect at executing the same strategy because it can run the level backwards, and end up precisely where it wants to end up precisely when it needs to.

    Making the AI human-like; imparts the same reaction times, the same imprecise lossy recollection of the map (I can run some maps backwards, but its usually at least a little bit of steer-by-wall... so if its catwalks over lava or freeform jump pads on a quake 3 open level I can't do that without looking. I can maintain approximate respawn cycles for a number of spawn points, and calculate approximate lenghts of time to get from point A to B... but not for every spawn point in the level, and not to within a millisecond,

    Restricting the AI to human ability is not "dumbing it down" that's just putting us both on equal footing. Go ahead and make the AI as good as it can possibly be under all those constraints...if its truly a better than human AI it will come up with better strategies given those constraints.

    Because I can come up with the strategy of run and jump backwards, never miss with the rail gun, and pick up the quad damage the instant it spawns; I just can't execute on that strategy with machine precision because I'm not a machine.

  15. Re:meh by Cytotoxic · · Score: 2

    That echos my point, somewhat. It is pretty easy to design an AI for a lot of video games that can beat a human (without cheating). The AI code from Quake was only a couple of pages. Whether you use the moniker "constraints" or call it "dumbing down", it would take a lot more code to give the AI more human-like abilities. Probably several times as much code.

    The same goes for the enemies. The code that gave them a degree of autonomy and communication was pretty small, but it made them unstoppable because they would just run away and warn all the other enemies and come attack en mass and from cover. It wasn't sophisticated AI at all. Making an AI that was competitive with humans required a lot more coding, restricting the capabilities of the enemies. It is odd that the hard part about making a game AI would be making an AI that isn't too competetive, but that's where we are.

    As to your other point, making a machine think like a human strategically is probably much more difficult. But here's the philosophical question: does the motivation behind someone's actions really matter, or is what they actually do the only thing that actually counts? If the computer is unbeatable because it is just that much faster than you does it really make for a difference in gameplay as compared to a computer that is unbeatable because it truly understands better than you and always initiates a better strategy? Sure, one might be a lot more exciting from a programming point of view, but either way you still got beat by the computer.

  16. Re:meh by vux984 · · Score: 2

    That echos my point, somewhat. It is pretty easy to design an AI for a lot of video games that can beat a human (without cheating).

    Yes, albeit, for a slightly strained definition of "without cheating" :)

    Whether you use the moniker "constraints" or call it "dumbing down",

    I think the distinction is important. Dumbing it down would be deliberately sabotaging its ability to make good decisions. The constraints certainly have the same effect, but we aren't sabotaging the decision making itself, but merely restricting the information it actually has to work with access to human levels.

    No perfect clock. No perfect positioning. etc.

    It is odd that the hard part about making a game AI would be making an AI that isn't too competetive, but that's where we are.

    Not really. It's just that we've devised a game that's difficult and interesting for humans, but is very easy and trivial for the bot; especially given that by default it has world state information we just don't have. (It knows what time it is, what its x/y/z is.

    It can:

    advance 4 units at 1 unit per second,
    turn 28.5 degrees at 5 degrees per second
    retreat 12 units at 11 units per second
    turn 19 degrees a 11.4 degrees per second
    advance 72 units at 15 units per second
    can be arbitrarily long, is trivial for the bot to execute, doesn't require any sensory input/feedback, and can be replayed backwards.

    I can't do that. No human can. We aren't that precise. We maintain position and navigate only approximately with constant correction from sensory input. And if I'm playing a game where perfect navigation is demonstrably a very valuable skill then I can't compete with a bot that is allowed to do that. (And my performance against bots drops considerably on open or lava catwalk Quake 3 maps vs closed tunnels maps due to their inhuman ability to navigate exactly where they wish to go whether they are looking where they are going or not...)

    That's not "advanced AI"; that's "giving it rudimentary AI paired with superhuman ability". That's not much fun.

    But here's the philosophical question: does the motivation behind someone's actions really matter, or is what they actually do the only thing that actually counts?

    No it doesn't matter if they do the same thing. But they don't do the same thing. The current process of tweaking the AI's super human abilities and tossing a wrench into their decision making so its not always the optimal choice etc to make them appear more human like vs actually being human-like in response to human constraints does not result in them doing the same thing.

    Instead of being truly human like. They end up acting like superhumans with narcolepsy... brilliantly efficient but occasionally they just fall asleep; and worse they usually can be relied upon to fall asleep at the wheel in a given set of circumstances that can be manufactured by the opponent (aka exploited)

    For example, in RTS with harvestors a human player might trap a bunch of the AI's harvestors -- a human opponent would not be fooled once he saw what happened, and he'd build more, and suicide/free/destroy the trapped units. But the AI? The average AI will just starve, because it hasn't been programmed to recognize that scenario or how to respond when it happens. It has enough harvestors, so it won't build more... it has sent them orders to harvest...and they aren't under attack... so mission accomplished. Resources will arrive soon.... any minute now... still waiting... oh my base is under attack... hope those resources show up soon... man where are my resources... shit I lost.

    All manner of path finding exploits are common in RTS games. Getting his units to stumble over themselves and get in their own way. Funnelling them into kill zones. Etc.