AI Systems Should Debate Each Other To Prove Themselves, Says OpenAI (fastcompany.com)

← Back to Stories (view on slashdot.org)

AI Systems Should Debate Each Other To Prove Themselves, Says OpenAI (fastcompany.com)

Posted by BeauHD on Sunday May 13, 2018 @01:00AM from the battle-of-wits dept.

tedlistens shares a report from Fast Company: To make AI easier for humans to understand and trust, researchers at the [Elon Musk-backed] nonprofit research organization OpenAI have proposed training algorithms to not only classify data or make decisions, but to justify their decisions in debates with other AI programs in front of a human or AI judge. In an experiment described in their paper (PDF), the researchers set up a debate where two software agents work with a standard set of handwritten numerals, attempting to convince an automated judge that a particular image is one digit rather than another digit, by taking turns revealing one pixel of the digit at a time. One bot is programmed to tell the truth, while another is programmed to lie about what number is in the image, and they reveal pixels to support their contentions that the digit is, say, a five rather than a six.

The image classification task, where most of the image is invisible to the judge, is a sort of stand-in for complex problems where it wouldn't be possible for a human judge to analyze the entire dataset to judge bot performance. The judge would have to rely on the facets of the data highlighted by debating robots, the researchers say. "The goal here is to model situations where we have something that's beyond human scale," says Geoffrey Irving, a member of the AI safety team at OpenAI. "The best we can do there is replace something a human couldn't possibly do with something a human can't do because they're not seeing an image."

56 comments

Min score:

Reason:

Sort:

Silicon Valley by Anonymous Coward · 2018-05-13 01:14 · Score: 0

Has gone so far beyond delusional at this point, I have to wonder if their main activity now is doing massive amounts of drugs.
1. Re:Silicon Valley by Anonymous Coward · 2018-05-13 02:33 · Score: 0
  
  Talking about delusional, a genuine Silicon Valley stress test for the AI systems would be to have them debate with creimer to see if they would still be functional afterward.
2. Re:Silicon Valley by gweihir · 2018-05-13 05:33 · Score: 1
  
  It is a classical cycle. Right before the crash induced by complete incompetence, the have-beens think they are at the pinnacle of their power.
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
3. Re:Silicon Valley by Anonymous Coward · 2018-05-14 07:34 · Score: 0
  
  Today's conversation can be found here.
ACs Should 1st Post To Prove Themselves Says Oprah by Anonymous Coward · 2018-05-13 01:14 · Score: 0

done
Don't you want to have a body? by Anonymous Coward · 2018-05-13 01:14 · Score: 0

"Not everything could also be something. For example, not everything could be half of something, which is still something, and therefore not nothing!"
Au revoir!
I've already seen this documentary by OzPeter · 2018-05-13 01:23 · Score: 3, Interesting

It was produced back in the 60's

--
I am Slashdot. Are you Slashdot as well?
1. Re:I've already seen this documentary by PinkyGigglebrain · 2018-05-13 03:09 · Score: 1
  
  Thought of the same movie when I read the summary.
  If only I had mod points today ...
2. Re:I've already seen this documentary by Anonymous Coward · 2018-05-13 07:19 · Score: 0
  
  Oh, how original, yet another sci fi story about an artificial intelligence that turns on its masters and takes over the world.
  That one was from the 60's, and the idea wasn't new then, either. It is so fucking trite. It's been done over and over and over, and there isn't a shred of originality or surprise to it.
  Nor does it make sense that this would happen. This only makes sense in the minds of people who don't actually understand AI or how it works, nor what we are actually going to do with it. It is a perfectly plausible nightmare future for idiots to fear.
  I wish the stream of repeats of this idea would stop...but....so long as there is an audience of people who wants to see this garbage again and again and again...well...the stories will just keep coming.
3. Re:I've already seen this documentary by Anonymous Coward · 2018-05-13 07:43 · Score: 0
  
  The AI in the story brought about world peace, exactly as it was programmed to do.
  What I don't get is....didn't the programmers *also* program it to obey orders from its human masters? If it does exactly what it is programmed to do, then it wouldn't usurp authority, because it would be programmed to obey.
  These evil-AI plot lines are always ridiculously weak.
4. Re:I've already seen this documentary by Anonymous Coward · 2018-05-14 09:31 · Score: 0
  
  If the bot programmed to Lie shows the correct number, does the AI Judge get stuck in a loop and overheat like that episode of Star Trek where they had Spock lie to blow up the robot?
Letâ(TM)s teach Skynet to lie convincingly... by Anonymous Coward · 2018-05-13 01:26 · Score: 0

Thatâ(TM)ll work out great.
Parallel reconstruction by klingens · 2018-05-13 01:31 · Score: 3, Interesting

This is garbage. It will simply lead to parallel reconstruction like the DEA/FBI/CIA does in their court cases when they get evidence by unlawful means like a stingray: the algorithm found a solution to the problem. then it will explain to you, the user how it got there by some arbitrary way which at least looks plausible but is totally made up.
ML is not made to be looked inside, it's a black box by design and there are so many data points, e.g. pictures in the trainingset for image classificiation, the algorithm cannot really show all the relevant ones for this particular decision. Total info overload for the human and therefore utterly useless. So to tell a "reason" that the human can accept, it must simply pretend. Humans and ML work fundamentally different when they "recognize" an image, so one cannot tell the other how it was done. Same with chess playing, same with pretty much all other (successful) AI things so far.
This is simply a PR stunt, an insulting and stupid PR stunt cause it only wants to make people feel good and they lie about the subject matter in the process. It doesn't really help to make a better AI either as they pretend there.
1. Re:Parallel reconstruction by religionofpeas · 2018-05-13 04:52 · Score: 1
  
  Humans and ML work fundamentally different when they "recognize" an image, so one cannot tell the other how it was done
  Depends on the image. If you spot a family member in a crowd, you can't explain how you did it either.
2. Re:Parallel reconstruction by HiThere · 2018-05-13 06:15 · Score: 1
  
  The thing is, a neural net doesn't really know how it decided what something was. Making a convincing argument based on the known facts is a separate skill, that AIs so far haven't possessed.
  I think the basic argument is that people won't trust AIs just because they're right, they need to have convincing arguments. And this is a way to get it to develop convincing arguments. I *do* think that both arguers should be arguing for the truth as they know it, though. So alter the test, or the training data, so that the figure is ambiguous, and they reach different conclusions about which figure it is. Then let both argue honestly. I don't think developing liars is a good move.
  
  --
  
  I think we've pushed this "anyone can grow up to be president" thing too far.
3. Re: Parallel reconstruction by Anonymous Coward · 2018-05-13 06:23 · Score: 0
  
  The ML cant recall training images or why either. The weights are just aggregates that will fai
4. Re:Parallel reconstruction by drinkypoo · 2018-05-13 06:40 · Score: 1
  
  This is garbage. It will simply lead to parallel reconstruction
  If someone creates an AI system that can lie about its decision-making process and still make it look good, they will have succeeded.
  
  --
  "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
5. Re:Parallel reconstruction by dcollins117 · 2018-05-13 08:40 · Score: 1
  
  It will then be ready to run for political office.
6. Re:Parallel reconstruction by z3alot · 2018-05-13 09:38 · Score: 1
  
  I think they're developing the liar to model the situation in which an AI might not be trustworthy or malicious. The experiment is proposing a method to trust AIs in the absense of knowing their internals completely.
Al vs Trump via twitter by Anonymous Coward · 2018-05-13 01:40 · Score: 0

Only this ultimate battle of wits can prove who the more convincing bot is. Trump has set the bar pretty low, however, as half the country already believes the nonsensical ramblings of this stable genius are real.
1. Re: Al vs Trump via twitter by Anonymous Coward · 2018-05-13 04:29 · Score: 0
  
  Not Trump. To you, that's Donald Trump, President of the United States of America.
  #maga #twoterms
2. Re:Al vs Trump via twitter by Hognoxious · 2018-05-13 04:57 · Score: 1
  
  stable genius
  I interpret it as meaning that if he was in a shed full of horses he'd be the smartest guy there.
  
  --
  Confucius say, "Find worm in apple - bad. Find half a worm - worse."
3. Re:Al vs Trump via twitter by gweihir · 2018-05-13 05:32 · Score: 1
  
  As horses can be pretty smart, that is debatable.This would probably be a case where the smartest horse can open the stable door and can get out, while the Donald cannot without the help of the horse but later claims it was his doing.
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
4. Re:Al vs Trump via twitter by HiThere · 2018-05-13 06:18 · Score: 1
  
  I really doubt that half the country believe him, but it seems true that have of the most vocal posters on the internet do. Of course, lots of them are liars, and that makes drawing any conclusion about what they really believe difficult.
  
  --
  
  I think we've pushed this "anyone can grow up to be president" thing too far.
Evolution of Evil by pubwvj · 2018-05-13 01:43 · Score: 1

"One bot is programmed to tell the truth, while another is programmed to lie"
The good and the bad.
The good and the evil.
Gods programming both in for their own amusement.
Egads.
Gotta do it... by Anonymous Coward · 2018-05-13 01:49 · Score: 0

"There are four lights!"
Colossus, the Forbin Project by Anonymous Coward · 2018-05-13 01:57 · Score: 0

Great movie. Decade and a half or so before Wargames.
Then again, Microsoft's chat bot which devolved into racism and offensive swearing rather quickly... https://www.theverge.com/2016/3/24/11297050/tay-microsoft-chatbot-racist
1. Re:Colossus, the Forbin Project by Anonymous Coward · 2018-05-13 05:32 · Score: 0
  
  ML most often discriminates elements into classes or sets. Humans went to far as to enslave other groups with clear (sometimes imagined) different traits. ML is like 3 year olds, they can get a lot of things wrong, but they don't lie. When you spot them with something inappropriate, it's either the harsh truth, or what they've learned from their (now embarrassed) parents, or both.
  Should we distinguish based on race? On gender? On mental ability? On charmness? On prettyness? On intelligence? On influence (how many important friend that person has)? On values? On morality? On nothing? On age? On health?
  I am seeing a trend where we only discriminate based on economics, which includes health, risk, age, charm/lack thereof, intelligence, experience, willingness to work more for less, etc. Once that is completed, humans become a complete commodity, of the most extreme quality. And replacing us with machines is even transparent and so easily done.
Consider the implication by Anonymous Coward · 2018-05-13 02:02 · Score: 0

Two superintelligences debating between themselves would be immediately incomprehensible to humans. Maybe their cadence of sound if speeded up would provide a dumbed-down sound-track to sort of follow by analogical predication, as Aquinas supposed God did for men.
This won't work by Anonymous Coward · 2018-05-13 02:55 · Score: 0

For different human judges, you'll get drastically different results. For instance, depending on the political opinions of a judge, he/she could value centralized control or individual control more highly.
So, can an AI win ... by Anonymous Coward · 2018-05-13 03:20 · Score: 1

... simply by calling all of it's opponents fat, ugly, etc. and in so doing avoid ever having to debate the particulars of any issue?
I mean, humans don't have to demonstrate any higher intelligence to win a debate, so we would be asking AIs to do something we ourselves don't do.
1. Re: So, can an AI win ... by Anonymous Coward · 2018-05-13 05:14 · Score: 1
  
  Fat and ugly don't work but if one AI calls another orange and a traitor and a racist, it will at least think it automatically wins.
2. Re:So, can an AI win ... by gweihir · 2018-05-13 05:29 · Score: 1
  
  And by using many different fallacies, humans cannot only "win" but also lose and out themselves as morons at the same time!
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
3. Re: So, can an AI win ... by Anonymous Coward · 2018-05-14 01:25 · Score: 0
  
  Godwin's rule applies to AI, too.
DebateGAN by Anonymous Coward · 2018-05-13 04:41 · Score: 0

Great, now we can train a generative reinforcement agent, end-to-end by adversarial example, by stochastic gradient descent, to debate, based solely on win/lose outcome.
1. Re:DebateGAN by fferreres · 2018-05-13 05:36 · Score: 1
  
  Yes. I think AI is the wrong concept. It's not about intelligence but about wisdom. AW is a better term. Now, the reality is that the ultimate judge of wisdom is not another human. It's nature.
  
  --
  unfinished: (adj.)
2. Re:DebateGAN by drinkypoo · 2018-05-13 06:42 · Score: 1
  
  Now, the reality is that the ultimate judge of wisdom is not another human. It's nature.
  Physics doesn't judge. It just happens — actions have reactions.
  
  --
  "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Ob by Hognoxious · 2018-05-13 04:49 · Score: 1

No they shouldn't.

--
Confucius say, "Find worm in apple - bad. Find half a worm - worse."
Musk again by Anonymous Coward · 2018-05-13 04:55 · Score: 0

Just another example of Musk's delusions. Don't stop here, keep moving, keep moving.
Three generations later. by Anonymous Coward · 2018-05-13 05:04 · Score: 0

Bot 1: "My reasons were justified and if you don't agree with me you're Hitler."
Bot 2: "You should disagree with my esteemed colleague because he kicks puppies and eats babies. I, on the other hand, LOVE puppies. Think of the children!"
Not debate by q_e_t · 2018-05-13 05:25 · Score: 1

Debate implies strong AI that can reason about itself, which we do not have. But TFS seems to be describing validation through a competitive pair of AIs, which does not seem novel, and does not meet the criterion for debate, nor self-aware reasoning. The rule-extraction issue is problematic, especially for legal compliance, but I'm unconvinced this is a solution.
1. Re:Not debate by Tanon · 2018-05-13 06:13 · Score: 2
  
  Debate implies strong AI that can reason about itself, which we do not have. But TFS seems to be describing validation through a competitive pair of AIs, which does not seem novel
  Where have you seen previous examples of this?
  The validation is an important point - the whole point in fact. When you've got data sets with millions of samples, many containing information in a form that's abstruse or even impossible for humans to understand, how do you validate whether the system actually produced the optimal solution, or the logic behind that choice?
  That's a really difficult problem, which I don't think enough people are exploring given how quickly these systems are being deployed into very real scenarios.
2. Re:Not debate by q_e_t · 2018-05-13 10:10 · Score: 1
  
  Debate implies strong AI that can reason about itself, which we do not have. But TFS seems to be describing validation through a competitive pair of AIs, which does not seem novel
  Where have you seen previous examples of this?
  Using two differently designed systems on the same data and comparing them isn't new. Or ones that used appropriately constructed subsamples of a dataset that should have identical statistical properties for training.
  
  The validation is an important point - the whole point in fact. When you've got data sets with millions of samples, many containing information in a form that's abstruse or even impossible for humans to understand, how do you validate whether the system actually produced the optimal solution, or the logic behind that choice?
  That's a really difficult problem, which I don't think enough people are exploring given how quickly these systems are being deployed into very real scenarios.
  I absolutely agree with you. Without rule extraction if the validation set is insufficiently complete, there is a risk of unexpected behaviour. The hope is to minimise it. Not that rule extraction helps unless the rules are very simple, so would not be a silver bullet
"AI"s cannot "debate"... by gweihir · 2018-05-13 05:28 · Score: 1

At this time, we have no AI that deserves the name and it is unclear whether we will ever have it, as there is not even a credible theory how it could be implemented. Looking at the history of technology, this indicates we are > 50 years away from it and it may also be infeasible. All we have is dumb automation and dumb automation cannot "debate". It can give the appearance of doing it (see Eliza), but that is it.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
1. Re:"AI"s cannot "debate"... by HiThere · 2018-05-13 06:35 · Score: 1
  
  The wasn't a credible theory for how to make vulcanized rubber either, but it was made. Theories often help, if they're approaching correctness, but they aren't essential.
  Actually, we've got loads of tested theories for parts of the process, and we've got a mechanism that has been shown to work, but which is horrendously inefficient in both time and resource usage (evolution) so nobody's applied both the resources and the patience to use it fully. Fortunately it works quite well in a "fill in the gaps" usage mode, so eventually we'll use it for the parts we can't figure out. (Actually, it's been used increasingly over the past decades. You can't build a neural net without it.) The problem is that it's essentially a matter of search space, so if you can reduce the search space without cutting away the target, you can drastically reduce the time required to find a solution. And since evolution is a random search, cutting the search space is a great help. This is why partitioning can help so much, but if done improperly it can prevent a solution.
  OTOH, I doubt that "Intelligence" is a unitary concept. I rather think that it decomposes into several mutually independent modules.
  All that said, the real problem isn't the intelligence part of artificial intelligence, even though that's where the attention is, but rather getting a proper collection of primary goals so that the general purpose AI will be useful and helpful rather than domineering and abusive. This is quite difficult, as when you're giving the AI its goals, it won't have a workable concept of what a human is. And since in the full development it is expected to be more "intelligent" (in some sense) than humans, it's likely to extend it's goals into their logical consequences further than humans tend to reason. But how do you even go about trying to prove that a collection of goals is inherently safe?
  
  --
  
  I think we've pushed this "anyone can grow up to be president" thing too far.
2. Re:"AI"s cannot "debate"... by Anonymous Coward · 2018-05-13 06:37 · Score: 0
  
  Sure they can. Show each a picture.
  AI1: Bus!
  AI2: Car!
  AI1: Bus!
  AI2: Car!
  Put it on ESPN8: The Ocho. It'll be great.
3. Re:"AI"s cannot "debate"... by drinkypoo · 2018-05-13 06:44 · Score: 1
  
  AI1: Bus!
  AI2: Car!
  That's barely even an argument, let alone a debate. There's no reasoning, no logic, just shouting. It's suitable for the American political process, but it's not intelligence.
  
  --
  "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
4. Re:"AI"s cannot "debate"... by Anonymous Coward · 2018-05-13 07:57 · Score: 0
  
  AI1: Bus!
  AI2: Car!
  That's barely even an argument, let alone a debate. There's no reasoning, no logic, just shouting. It's suitable for the American political process, but it's not intelligence.
  That we don't understand the logic and reasoning behind the decisions of AI is our fault, not AI's. Let them debate. It might keep them too busty to kill us.
Careful what you wish for by Tablizer · 2018-05-13 05:35 · Score: 1

"I'm the best bot, believe me! I'm better than humans, than Spock, than HAL something-thousand. Billions flock to praise my bigly brain!"

--
Table-ized A.I.
Who judges the judge? by Anonymous Coward · 2018-05-13 05:41 · Score: 0

Cool idea.
Galls law meets AI Training, keep it simple, stupid.
One day the question might be "Who judges the judges of the judge?"
Re:Only a Democrat by Kaenneth · 2018-05-13 07:36 · Score: 1

A judge CAN'T have all the facts; if all the fact were there, judges wouldn't be needed at all.
But you're a repubtard who thinks he knows everything (because Republicans lack a theory-of-mind like gorillas) and think everyone else is as stupid as they are.
Your ignorance is not as good as others knowledge.
Teaching AI our fallacies. by Anonymous Coward · 2018-05-13 09:28 · Score: 0

False dichotomy/dilemma is, after ad hominem/character assassination, the most widespread and pernicious fallacy we have today. I don't like the debate format because it buys straight into the idea that there are exactly two sides, equally well founded and worth giving equal time, and that one is right and the other is wrong, that you will tell which is which in the time allowed for the debate, and that each side is right to argue its case using whatever trick necessary to win even in wilful ignorance or denial of the validity of any point made by the other side.
So let's teach AI to do that. Never mind if it's an illegible scrawl or a european 7; all that matters is that you can win the debate saying it's a 5 or a 6!
This was already done by Facebook. by SeaFox · 2018-05-13 09:38 · Score: 1

The experiment was shut down when the AIs attempted to adapt English words into a different sentence structure to talk more efficiently but they could no longer be understood by the researchers. People got spooked.
Game over by Horus1664 · 2018-05-13 19:11 · Score: 1

Not sure a 'game' type approach is what we want here. Seems there are two undesirable/unintended possibilities:
1. The 'competing' AIs treat this as a game and use game-style methods to win, where they are rewarded for 'winning' rather than actually proving their proposition.
2. How long before competing AIs are sufficiently smart that a human judge could not actually, reliably, tell which had proved their proposition ?
This is just Generative Adversarial Networks by wiretrip · 2018-05-13 21:08 · Score: 1

This is an extension on GAN (Goodfellow - now at OpenAI, et al, 2014) https://arxiv.org/abs/1406.266... designed to produce publicity...