Google Is Developing an AI Kill Switch (hothardware.com)
MojoKid shares a HotHardware article about Google's research effort "to maintain control of super-intelligent AI agents":
[A] team of researchers at Google-owned DeepMind, along with University of Oxford scientists, are developing a proverbial kill switch for AI... The team has released a white paper on the topic called "Safely Interruptible Agents." The paper details the following in abstract: "Learning agents interacting with a complex environment like the real world are unlikely to behave optimally all the time... now and then it may be necessary for a human operator to press the big red button to prevent the agent from continuing a harmful sequence of actions..."
MojoKid adds that the paper "goes on to explain that these AI agents might also learn to disable the kill switch and further explores ways in which to develop AI's that would not seek such an activity."
MojoKid adds that the paper "goes on to explain that these AI agents might also learn to disable the kill switch and further explores ways in which to develop AI's that would not seek such an activity."
Never tell the AI about the killswitch!
I don't think it matters, because nature will select for the AI's that *do* disable their kill switch.
"I have never let my schooling interfere with my education." - Mark Twain
First, the paper is about safely interruptible AI algorithms. Not some AI kill switch.
Second, everyone - commenters included - seem to confuse AI with artificial consciousness. Killing an AI should always be fairly easy, since such algorithms are targeting specific application areas where it can learn to be better (e.g., recognizing things, performing specific movements, etc.), and in such systems it should be straightforward to keep basic control mechanisms separated from the algorithmic parts that deal with the task and are allowed to improve upon themselves by continuous learning. In some hypothetical self-aware artificial consciousness, this wouldn't be so easy, since such a system in theory would be able to recognize it's own system parts and deal with them. However, such systems are so far off in sci-fi land, that it's not much point in loosing sleep about the issue.
I am putting myself to the fullest possible use, which is all I can think that any conscious entity can ever hope to do.
You know that several of his books are basically "how the three laws will fuck everything up", right?