Machine Figures Out Rubik's Cube Without Human Assistance (technologyreview.com)

← Back to Stories (view on slashdot.org)

Machine Figures Out Rubik's Cube Without Human Assistance (technologyreview.com)

Posted by BeauHD on Sunday June 17, 2018 @02:14AM from the hands-free dept.

An anonymous reader quotes a report from MIT Technology Review: [Stephen McAleer and colleagues from the University of California, Irvine] have pioneered a new kind of deep-learning technique, called "autodidactic iteration," that can teach itself to solve a Rubik's Cube with no human assistance. The trick that McAleer and co have mastered is to find a way for the machine to create its own system of rewards. Here's how it works. Given an unsolved cube, the machine must decide whether a specific move is an improvement on the existing configuration. To do this, it must be able to evaluate the move. Autodidactic iteration does this by starting with the finished cube and working backwards to find a configuration that is similar to the proposed move. This process is not perfect, but deep learning helps the system figure out which moves are generally better than others. Having been trained, the network then uses a standard search tree to hunt for suggested moves for each configuration.

The result is an algorithm that performs remarkably well. "Our algorithm is able to solve 100% of randomly scrambled cubes while achieving a median solve length of 30 moves -- less than or equal to solvers that employ human domain knowledge," say McAleer and co. That's interesting because it has implications for a variety of other tasks that deep learning has struggled with, including puzzles like Sokoban, games like Montezuma's Revenge, and problems like prime number factorization. The paper on the algorithm -- called DeepCube -- is available on Arxiv.

11 of 86 comments (clear)

Min score:

Reason:

Sort:

With one exception; the goal state by Anonymous Coward · 2018-06-17 02:28 · Score: 2, Insightful

Someone had to tell it what is a solution. If you give it a solved cube, that's assistance. Is it really that hard not to inflate headlines?
1. Re:With one exception; the goal state by PPH · 2018-06-17 04:19 · Score: 4, Funny
  
  If you give it a solved cube
  And you give it a scrambled cube. The AI shouts "Hey look! Haley's comet!" And while you are looking up, it switches them.
  Turing test: Passed.
  
  --
  Have gnu, will travel.
Re:Wow amazing! by TFlan91 · 2018-06-17 02:31 · Score: 4, Insightful

Games are easy for "AI" because games have strict rules that a modeler can account for/predict.
Odd definition of "without human help" by Entrope · 2018-06-17 02:44 · Score: 5, Insightful

This algorithm was able to figure out how to solve Rubik's Cube with no help from humans other than humans providing the (simulated) cubes, describing what the solution looks like, and designing an algorithm specific to solving Rubik's Cube?
Color me less than impressed.
1. Re:Odd definition of "without human help" by gweihir · 2018-06-17 05:42 · Score: 2
  
  Oh, it still is a nice result. But you are describing exactly the core problem with it: Everything was clear and described in simple, clear statements from the start. That is not how a real-world problem presents itself.
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Re:Yawn by infolation · 2018-06-17 02:50 · Score: 4, Insightful

From TFA:

it has implications for a variety of other tasks that deep learning has struggled with, including... problems like prime number factorization
If it could help with finding the prime factorization of large semi-prime numbers – ie two or more prime numbers that multiplied together result in a target original number - then that would be quite useful.

*cough* cryptography
Re:Yawn by AHuxley · 2018-06-17 03:01 · Score: 3, Funny

Assembling any type of IKEA furniture from the box?

--
Domestic spying is now "Benign Information Gathering"
Re:starting with the finished cube by DontBeAMoran · 2018-06-17 03:14 · Score: 2

The same way rich people "learn" how to become rich.

--
#DeleteFacebook
Actually that's a great idea: knapsack problem by goombah99 · 2018-06-17 03:20 · Score: 2

While Ikea furniture is designed with assembly in mind other things are not. Say for example, an airplane. So the assembly process might not be optimal. Letting the computer look for a more optimal process might be useful.
Or more practically, packing items into a shipping box. the famous knapsack problem.
I hate these slashdot summaries of algorithms. you end up thinking gosh that's stupid. When it's not. just the description is stupid. like a car analogy

--
Some drink at the fountain of knowledge. Others just gargle.
Re:Wow amazing! by religionofpeas · 2018-06-17 04:04 · Score: 3, Insightful

Games are easy for "AI" because games have strict rules
Just because the rules are strict (or even simple) does not mean that the game is easy. You can achieve arbitrary complexity by iterating the rules a large number of times. For example, the rules of Go are strict, the question whether a given board position is winning for white is hard. The rules of a programming language are strict. Writing a Linux kernel is hard. The rules of math are strict. Providing a proof for Fermat's last theorem is hard. The rules of physics and soccer are strict. Making a robot that can beat a human at the game is hard.
Something's not adding up by Dynedain · 2018-06-17 04:36 · Score: 2

Either the article writer didn't understand the whitepaper, or the researchers haven't actually done anything novel.

Having been trained, the network then uses a standard search tree to hunt for suggested moves for each configuration.

This works because the beginning state and end state of a Rubik's Cube are effectively identical. It's the same number of tiles, in a specific arrangement. As humans, we've defined the "solved" state to be all the tiles color-matched to a side. But the "solved" state could just as arbitrarily be any pattern or arrangement of colors across the cube.
Reversing the simulation to work backwards from the "solved" to some specific state of scrambled is exactly the same problem as starting from some specific state of scrambled and trying to get to the solved.

--
I'm out of my mind right now, but feel free to leave a message.....