IBM Researchers Teach Pac-Man To Do No Harm (fastcompany.com)

← Back to Stories (view on slashdot.org)

IBM Researchers Teach Pac-Man To Do No Harm (fastcompany.com)

Posted by msmash on Thursday October 25, 2018 @04:06AM from the ideal-pac-man dept.

harrymcc writes: The better AI gets at teaching itself to perform tasks in ways beyond the skills of mere humans, the more likely it is that it may unwittingly behave in ways a human would consider unethical. To explore ways to prevent this from happening, IBM researchers taught AI to play Pac-Man without ever gobbling up the ghosts. And it did so without ever explicitly telling the software that this was the goal. Over at Fast Company, I wrote about this project and what IBM learned from conducting it.

The researchers built a piece of software that could balance the AI's ratio of self-devised, aggressive game play to human-influenced ghost avoidance, and tried different settings to see how they affected its overall approach to the game. By doing so, they found a tipping point -- the setting at which Pac-Man went from seriously chowing down on ghosts to largely avoiding them.

23 of 135 comments (clear)

Min score:

Reason:

Sort:

Precisely. by Anonymous Coward · 2018-10-25 04:13 · Score: 3, Insightful

This is being made up to be something it's not. Just another example of how non-technical people might interpret technical work.
And just another example of using trendy buzzwords to get undeserved attention.
But can you teach Google by ruddk · 2018-10-25 04:16 · Score: 3, Insightful

I don't think so. :*)

--
L'Idiot
Re: I fail to see what this has to do with ethics by Anonymous Coward · 2018-10-25 04:16 · Score: 4, Insightful

Yea. Even sillier when you remember that Pac Man gets points for eating ghosts, exponent style. The ideal Pac Man eats four ghosts per power pellet, otherwise you are leaving points on the table. Because the game has a finite number lf levels, every abandoned ghost is a lower total score.
Re: I fail to see what this has to do with ethics by Anonymous Coward · 2018-10-25 04:21 · Score: 5, Funny

That's the Conservative Pac Man you're describing. The Liberal Pac Man takes care not to eat the ghosts, and uses whatever points he get from eating pellets in a sustainable way to pay for ghost shelters and outreach to better understand them and protect them.
Interesting, but perhaps useless by alvinrod · 2018-10-25 04:21 · Score: 4, Interesting

This is interesting, but I'm not sure how useful it is. You might be able to create an AI that has some desirable characteristics based on human morality, but as soon as you make it compete against other AIs that don't possess those characteristics, it will either adapt to possess them itself in order to remain competitive or it will perish if it's been crippled in a way to prevent it from adjusting. A pacifistic Pac-Man AI might be novel, but if it was made to compete, it wouldn't do as well.
1. Re:Interesting, but perhaps useless by mi · 2018-10-25 04:45 · Score: 2
  
  Why would you make "dog-eat-dog" a trait, when AIs don't need to eat?
  ICBMs don't need to eat either... Putin needs to eat. Guess, how he'll program his AI?
  
  Why should AIs compete at all, is the question.
  Because some of the tasks we may wish to entrust them will have adversarial aspects. If AI is charged with picking out wanted criminals from the crowd, it will need to weight severely harming the criminal — by having him arrested — vs. harming the rest of us a little bit — by letting him slip. Such an AI will need to compete with the criminal's efforts to disguise himself.
  Similar considerations apply in military applications — though the stakes are higher there.
  
  --
  In Soviet Washington the swamp drains you.
2. Re:Interesting, but perhaps useless by mi · 2018-10-25 04:59 · Score: 2
  
  I hope the end state for AI isn't something as mundane as law enforcement or military applications
  End state? Certainly not — if there is an "end" at all. But they'll certainly be used for that — indeed, already are used for that.
  And adversity is not going anywhere, unfortunately — with competing entities (corporations, nations, criminal enterprises, terrorist groups) using AI to their own ends. Just as you can not raise a child unprepared for adversity, you can not develop an AI unprepared to compete...
  
  --
  In Soviet Washington the swamp drains you.
3. Re:Interesting, but perhaps useless by Monster_user · 2018-10-25 05:01 · Score: 2
  
  Early humans couldn't share because they would starve during the winter? That sounds highly illogical. Sure, there are limits, but why would we need a society if we didn't benefit from other members?
  
  I don't think this has ever been different for humans. We are stronger together. United we stand, divided we fall. We pool our resources to produce even more than we can alone. A being who spends his/her time hunting and gathering to survive has little time to spend researching advanced machinery to increase production per individual. That being had to produce excess so that innovators across the ages would have had free time to tip the scales away from a hunting and gathering society. Even the story of "man's best friend" the dog supports that sharing was important for the survival and advancement of the human race. Humans and dogs shared their abilities and spoils, and both would seem to have profited much. Some dogs are quite spoiled by their owners.
4. Re:Interesting, but perhaps useless by grep+-v+'.*'+* · 2018-10-25 06:37 · Score: 2
  
  severely harming the criminal -- by having him arrested
  What? Just being arrested is being severely harmed? No. At best they GOTCHA or at worst it's a waste of everyone's time. But we want to make absolutely sure the bad guys never get away, right?
  
  More like:
  if Is_This_A_Person() then Report(Yes);
  else Report(No);
  
  Of course bringing asset forfeiture into the picture:
  if Is_This_A_Person() then Report(Yes);
  if Is_This_An_Object() then Report(Yes);
  else Report(No);
  
  Or for the latter, a much simpler:
  Report(Yes);
  
  BF would NOT be happy with this. That it is better 100 guilty Persons should escape than that one innocent Person should suffer
  
  So, and I'm guessing here, Honor used to be a much more vaunted concept (hence duels to the death for the insulted) than today, and "random" imprisonment by the King (or their representative) used to be much more of a harsh and permanent thing.
  
  I fear for the day we're 100% efficient, since NO ONE'S innocent of everything. Of course the ones making the rules are exempt, almost by definition. With computer security, computer administrators are usually exceptions so they can help implement, check, and debug (and repair!) the rules.
  
  --
  If the universe is someone's simulation -- does that mean the stars are just stuck pixels?
5. Re:Interesting, but perhaps useless by PopeRatzo · 2018-10-25 09:34 · Score: 2
  
  Indeed, it is — or ought to be — a mind. To be useful in a real world, it has to know about all aspects of it, even if the degree of knowledge can differ between domains.
  You're still thinking like an engineer, anthropomorphizing the AI. Do you think for a robot to be useful it has to be bipedal and have the approximate shape and function of a human? The point of an AI is to exceed humanity, not imitate it.
  Don't make the mistake of thinking an AI has to operate like a human mind. Stretch yourself and think bigger. You are placing artificial restraints on yourself that are akin to thinking that self-driving cars need to have a robot shaped like a human being operating the steering wheel and pedals and sitting in the driver's seat.
  
  --
  You are welcome on my lawn.
They missed the broader ethics problem by kiehlster · 2018-10-25 04:22 · Score: 5, Funny

When the AI begins watching humans play Pac-Man, doing harm to the ghosts, it will consider humans a threat to ghosts and thus eliminate the humans to satisfy its directives. Is IBM's median employee age too young to have seen Robocop?
IBM Researchers Get Bored And Play Pac-Man.. by Rick+Schumann · 2018-10-25 04:23 · Score: 2

..all day long for weeks and weeks.
That's what the title should be.
Re: I fail to see what this has to do with ethics by AmiMoJo · 2018-10-25 04:23 · Score: 4, Interesting

"Pacifist" runs of games are popular with the speedrunning community. Depending on the game it might be an achievement to simply finish it without harming anything, or it may just be another category to get the fastest time in.

--
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
Slight contradiction? by Daetrin · 2018-10-25 04:38 · Score: 5, Informative

Title: "IBM Researchers Teach Pac-Man To Do No Harm"

Blurb: "they found a tipping point -- the setting at which Pac-Man went from seriously chowing down on ghosts to largely avoiding them."

So companies will presumably use a similar method to design AIs that will maximize corporate profit with only a _small_ amount of acceptable human murdering in the process?

--
This Space Intentionally Left Blank
1. Re:Slight contradiction? by DontBeAMoran · 2018-10-25 05:25 · Score: 3, Insightful
  
  Then the title should be "IBM researchers teach Pac-Man to be mostly harmless".
  
  --
  #DeleteFacebook
Re: I fail to see what this has to do with ethics by sjames · 2018-10-25 04:50 · Score: 5, Interesting

That's the point. There are plenty of real world cases where we could be more efficient if we simply disregarded moral and ethical concerns. One of the concerns with machine learning is that they may find an optimal solution that violates ethical considerations. The problem is even larger when you consider an AI finding locally optimal solutions disregarding externalities.
For a classic example, Ford once determined that paying off expected damages in wrongful death suits would be slightly cheaper than refitting existing Pintos to not explode.
The Pac-Man simulation is a very simplified version of a case where, due to ethical considerations it is necessary to avoid the locally optimal solution.
Er... that's not teaching the AI *ethics*. by hey! · 2018-10-25 05:01 · Score: 5, Insightful

It's having an algorithm grind out a solution to playing the game which meets an additional constraints, which they tweak.
If they'd actually taught the AI ethics, the AI would construct the play constraints for itself starting from ethical principles. At full human levels of ethical self awareness, the AI would be chasing ghosts down the hall and then -- unprompted -- stop and ask itself, "What am I doing?"

--
Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
1. Re:Er... that's not teaching the AI *ethics*. by 110010001000 · 2018-10-25 05:07 · Score: 3, Informative
  
  Exactly. Just more hype from IBM "AI" desperately trying to sell their AI snakeoil. Yeah we get it IBM: you can create NNs and change the parameters to change the output behavior.
Nope, just simplifies optimum. Ghosts are bad. by raymorris · 2018-10-25 05:47 · Score: 4, Interesting

> The Pac-Man simulation is a very simplified version of a case where, due to ethical considerations it is necessary to avoid the locally optimal solution.
The Pac Man doesn't avoid any optimal solution. It simply defines optimum as not to include not touching ghosts - ghosts are bad. In the classic version of the game, touching a ghost is bad. Unless you've eaten a Power Pellet in the last few seconds. They trained the AIto NOT learn the "unless you've eaten a pellet". It just does "touching ghosts is bad".
There's nothing moral, or even interesting, about "in Pac-Man, touching ghosts is bad". Essentially, just one too stupid to know that Power Pellets do anything.
Re: I fail to see what this has to do with ethics by apoc.famine · 2018-10-25 06:46 · Score: 2

Even non-speedrunning gamers can find some real joy in pacifist runs. It's often far more challenging to not kill enemies than it is to just play the game as designed. It requires different strategies and skills, and often radically different gameplay.
I've replayed a fair number of games that way, just because it breathes new life into an old favorite.

--
Velociraptor = Distiraptor / Timeraptor
Re: I fail to see what this has to do with ethics by fahrbot-bot · 2018-10-25 07:06 · Score: 5, Funny

That's the Conservative Pac Man you're describing. The Liberal Pac Man takes care not to eat the ghosts, and uses whatever points he get from eating pellets in a sustainable way to pay for ghost shelters and outreach to better understand them and protect them.
And PAC Pac Man doesn't disclose how many points he has and uses them to get the ghosts to do his bidding.

--
It must have been something you assimilated. . . .
Re: I fail to see what this has to do with ethics by sourcerror · 2018-10-25 07:32 · Score: 2

Meanwhile SuperPac Man just gobbles up all the points and uses them to lobby the game developer.
Re: I fail to see what this has to do with ethics by mysidia · 2018-10-25 07:57 · Score: 2

The Liberal Pac Man takes care not to eat the ghosts, and uses whatever points he get from eating pellets
Not very many, b/c the Liberal Pac Man deliberately allows the ghosts to defeat pac man.
Once by Pinky, then Inky, and Clyde, in that order --- Blinky doesn't get a turn, b/c there are only 3 Pacman Lives, and Blinky has too much privilege.