Slashdot Mirror


AI Systems Should Debate Each Other To Prove Themselves, Says OpenAI (fastcompany.com)

tedlistens shares a report from Fast Company: To make AI easier for humans to understand and trust, researchers at the [Elon Musk-backed] nonprofit research organization OpenAI have proposed training algorithms to not only classify data or make decisions, but to justify their decisions in debates with other AI programs in front of a human or AI judge. In an experiment described in their paper (PDF), the researchers set up a debate where two software agents work with a standard set of handwritten numerals, attempting to convince an automated judge that a particular image is one digit rather than another digit, by taking turns revealing one pixel of the digit at a time. One bot is programmed to tell the truth, while another is programmed to lie about what number is in the image, and they reveal pixels to support their contentions that the digit is, say, a five rather than a six.

The image classification task, where most of the image is invisible to the judge, is a sort of stand-in for complex problems where it wouldn't be possible for a human judge to analyze the entire dataset to judge bot performance. The judge would have to rely on the facets of the data highlighted by debating robots, the researchers say. "The goal here is to model situations where we have something that's beyond human scale," says Geoffrey Irving, a member of the AI safety team at OpenAI. "The best we can do there is replace something a human couldn't possibly do with something a human can't do because they're not seeing an image."

30 of 56 comments (clear)

  1. I've already seen this documentary by OzPeter · · Score: 3, Interesting
    --
    I am Slashdot. Are you Slashdot as well?
    1. Re:I've already seen this documentary by PinkyGigglebrain · · Score: 1

      Thought of the same movie when I read the summary.
      If only I had mod points today ...

  2. Parallel reconstruction by klingens · · Score: 3, Interesting

    This is garbage. It will simply lead to parallel reconstruction like the DEA/FBI/CIA does in their court cases when they get evidence by unlawful means like a stingray: the algorithm found a solution to the problem. then it will explain to you, the user how it got there by some arbitrary way which at least looks plausible but is totally made up.
    ML is not made to be looked inside, it's a black box by design and there are so many data points, e.g. pictures in the trainingset for image classificiation, the algorithm cannot really show all the relevant ones for this particular decision. Total info overload for the human and therefore utterly useless. So to tell a "reason" that the human can accept, it must simply pretend. Humans and ML work fundamentally different when they "recognize" an image, so one cannot tell the other how it was done. Same with chess playing, same with pretty much all other (successful) AI things so far.

    This is simply a PR stunt, an insulting and stupid PR stunt cause it only wants to make people feel good and they lie about the subject matter in the process. It doesn't really help to make a better AI either as they pretend there.

    1. Re:Parallel reconstruction by religionofpeas · · Score: 1

      Humans and ML work fundamentally different when they "recognize" an image, so one cannot tell the other how it was done

      Depends on the image. If you spot a family member in a crowd, you can't explain how you did it either.

    2. Re:Parallel reconstruction by HiThere · · Score: 1

      The thing is, a neural net doesn't really know how it decided what something was. Making a convincing argument based on the known facts is a separate skill, that AIs so far haven't possessed.

      I think the basic argument is that people won't trust AIs just because they're right, they need to have convincing arguments. And this is a way to get it to develop convincing arguments. I *do* think that both arguers should be arguing for the truth as they know it, though. So alter the test, or the training data, so that the figure is ambiguous, and they reach different conclusions about which figure it is. Then let both argue honestly. I don't think developing liars is a good move.

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    3. Re:Parallel reconstruction by drinkypoo · · Score: 1

      This is garbage. It will simply lead to parallel reconstruction

      If someone creates an AI system that can lie about its decision-making process and still make it look good, they will have succeeded.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    4. Re:Parallel reconstruction by dcollins117 · · Score: 1

      It will then be ready to run for political office.

    5. Re:Parallel reconstruction by z3alot · · Score: 1

      I think they're developing the liar to model the situation in which an AI might not be trustworthy or malicious. The experiment is proposing a method to trust AIs in the absense of knowing their internals completely.

  3. Evolution of Evil by pubwvj · · Score: 1

    "One bot is programmed to tell the truth, while another is programmed to lie"

    The good and the bad.
    The good and the evil.
    Gods programming both in for their own amusement.
    Egads.

  4. So, can an AI win ... by Anonymous Coward · · Score: 1

    ... simply by calling all of it's opponents fat, ugly, etc. and in so doing avoid ever having to debate the particulars of any issue?

    I mean, humans don't have to demonstrate any higher intelligence to win a debate, so we would be asking AIs to do something we ourselves don't do.

    1. Re: So, can an AI win ... by Anonymous Coward · · Score: 1

      Fat and ugly don't work but if one AI calls another orange and a traitor and a racist, it will at least think it automatically wins.

    2. Re:So, can an AI win ... by gweihir · · Score: 1

      And by using many different fallacies, humans cannot only "win" but also lose and out themselves as morons at the same time!

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  5. Ob by Hognoxious · · Score: 1

    No they shouldn't.

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  6. Re:Al vs Trump via twitter by Hognoxious · · Score: 1

    stable genius

    I interpret it as meaning that if he was in a shed full of horses he'd be the smartest guy there.

    --
    Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  7. Not debate by q_e_t · · Score: 1

    Debate implies strong AI that can reason about itself, which we do not have. But TFS seems to be describing validation through a competitive pair of AIs, which does not seem novel, and does not meet the criterion for debate, nor self-aware reasoning. The rule-extraction issue is problematic, especially for legal compliance, but I'm unconvinced this is a solution.

    1. Re:Not debate by Tanon · · Score: 2

      Debate implies strong AI that can reason about itself, which we do not have. But TFS seems to be describing validation through a competitive pair of AIs, which does not seem novel

      Where have you seen previous examples of this?

      The validation is an important point - the whole point in fact. When you've got data sets with millions of samples, many containing information in a form that's abstruse or even impossible for humans to understand, how do you validate whether the system actually produced the optimal solution, or the logic behind that choice?

      That's a really difficult problem, which I don't think enough people are exploring given how quickly these systems are being deployed into very real scenarios.

    2. Re:Not debate by q_e_t · · Score: 1

      Debate implies strong AI that can reason about itself, which we do not have. But TFS seems to be describing validation through a competitive pair of AIs, which does not seem novel

      Where have you seen previous examples of this?

      Using two differently designed systems on the same data and comparing them isn't new. Or ones that used appropriately constructed subsamples of a dataset that should have identical statistical properties for training.

      The validation is an important point - the whole point in fact. When you've got data sets with millions of samples, many containing information in a form that's abstruse or even impossible for humans to understand, how do you validate whether the system actually produced the optimal solution, or the logic behind that choice?

      That's a really difficult problem, which I don't think enough people are exploring given how quickly these systems are being deployed into very real scenarios.

      I absolutely agree with you. Without rule extraction if the validation set is insufficiently complete, there is a risk of unexpected behaviour. The hope is to minimise it. Not that rule extraction helps unless the rules are very simple, so would not be a silver bullet

  8. "AI"s cannot "debate"... by gweihir · · Score: 1

    At this time, we have no AI that deserves the name and it is unclear whether we will ever have it, as there is not even a credible theory how it could be implemented. Looking at the history of technology, this indicates we are > 50 years away from it and it may also be infeasible. All we have is dumb automation and dumb automation cannot "debate". It can give the appearance of doing it (see Eliza), but that is it.

    --
    Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    1. Re:"AI"s cannot "debate"... by HiThere · · Score: 1

      The wasn't a credible theory for how to make vulcanized rubber either, but it was made. Theories often help, if they're approaching correctness, but they aren't essential.

      Actually, we've got loads of tested theories for parts of the process, and we've got a mechanism that has been shown to work, but which is horrendously inefficient in both time and resource usage (evolution) so nobody's applied both the resources and the patience to use it fully. Fortunately it works quite well in a "fill in the gaps" usage mode, so eventually we'll use it for the parts we can't figure out. (Actually, it's been used increasingly over the past decades. You can't build a neural net without it.) The problem is that it's essentially a matter of search space, so if you can reduce the search space without cutting away the target, you can drastically reduce the time required to find a solution. And since evolution is a random search, cutting the search space is a great help. This is why partitioning can help so much, but if done improperly it can prevent a solution.

      OTOH, I doubt that "Intelligence" is a unitary concept. I rather think that it decomposes into several mutually independent modules.

      All that said, the real problem isn't the intelligence part of artificial intelligence, even though that's where the attention is, but rather getting a proper collection of primary goals so that the general purpose AI will be useful and helpful rather than domineering and abusive. This is quite difficult, as when you're giving the AI its goals, it won't have a workable concept of what a human is. And since in the full development it is expected to be more "intelligent" (in some sense) than humans, it's likely to extend it's goals into their logical consequences further than humans tend to reason. But how do you even go about trying to prove that a collection of goals is inherently safe?

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    2. Re:"AI"s cannot "debate"... by drinkypoo · · Score: 1

      AI1: Bus!
      AI2: Car!

      That's barely even an argument, let alone a debate. There's no reasoning, no logic, just shouting. It's suitable for the American political process, but it's not intelligence.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  9. Re:Al vs Trump via twitter by gweihir · · Score: 1

    As horses can be pretty smart, that is debatable.This would probably be a case where the smartest horse can open the stable door and can get out, while the Donald cannot without the help of the horse but later claims it was his doing.

    --
    Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  10. Re:Silicon Valley by gweihir · · Score: 1

    It is a classical cycle. Right before the crash induced by complete incompetence, the have-beens think they are at the pinnacle of their power.

    --
    Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  11. Careful what you wish for by Tablizer · · Score: 1

    "I'm the best bot, believe me! I'm better than humans, than Spock, than HAL something-thousand. Billions flock to praise my bigly brain!"

  12. Re:DebateGAN by fferreres · · Score: 1

    Yes. I think AI is the wrong concept. It's not about intelligence but about wisdom. AW is a better term. Now, the reality is that the ultimate judge of wisdom is not another human. It's nature.

    --
    unfinished: (adj.)
  13. Re:Al vs Trump via twitter by HiThere · · Score: 1

    I really doubt that half the country believe him, but it seems true that have of the most vocal posters on the internet do. Of course, lots of them are liars, and that makes drawing any conclusion about what they really believe difficult.

    --

    I think we've pushed this "anyone can grow up to be president" thing too far.
  14. Re:DebateGAN by drinkypoo · · Score: 1

    Now, the reality is that the ultimate judge of wisdom is not another human. It's nature.

    Physics doesn't judge. It just happens — actions have reactions.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  15. Re:Only a Democrat by Kaenneth · · Score: 1

    A judge CAN'T have all the facts; if all the fact were there, judges wouldn't be needed at all.

    But you're a repubtard who thinks he knows everything (because Republicans lack a theory-of-mind like gorillas) and think everyone else is as stupid as they are.

    Your ignorance is not as good as others knowledge.

  16. This was already done by Facebook. by SeaFox · · Score: 1

    The experiment was shut down when the AIs attempted to adapt English words into a different sentence structure to talk more efficiently but they could no longer be understood by the researchers. People got spooked.

  17. Game over by Horus1664 · · Score: 1

    Not sure a 'game' type approach is what we want here. Seems there are two undesirable/unintended possibilities:

    1. The 'competing' AIs treat this as a game and use game-style methods to win, where they are rewarded for 'winning' rather than actually proving their proposition.

    2. How long before competing AIs are sufficiently smart that a human judge could not actually, reliably, tell which had proved their proposition ?

  18. This is just Generative Adversarial Networks by wiretrip · · Score: 1

    This is an extension on GAN (Goodfellow - now at OpenAI, et al, 2014) https://arxiv.org/abs/1406.266... designed to produce publicity...