Machine Figures Out Rubik's Cube Without Human Assistance (technologyreview.com)
An anonymous reader quotes a report from MIT Technology Review: [Stephen McAleer and colleagues from the University of California, Irvine] have pioneered a new kind of deep-learning technique, called "autodidactic iteration," that can teach itself to solve a Rubik's Cube with no human assistance. The trick that McAleer and co have mastered is to find a way for the machine to create its own system of rewards. Here's how it works. Given an unsolved cube, the machine must decide whether a specific move is an improvement on the existing configuration. To do this, it must be able to evaluate the move. Autodidactic iteration does this by starting with the finished cube and working backwards to find a configuration that is similar to the proposed move. This process is not perfect, but deep learning helps the system figure out which moves are generally better than others. Having been trained, the network then uses a standard search tree to hunt for suggested moves for each configuration.
The result is an algorithm that performs remarkably well. "Our algorithm is able to solve 100% of randomly scrambled cubes while achieving a median solve length of 30 moves -- less than or equal to solvers that employ human domain knowledge," say McAleer and co. That's interesting because it has implications for a variety of other tasks that deep learning has struggled with, including puzzles like Sokoban, games like Montezuma's Revenge, and problems like prime number factorization. The paper on the algorithm -- called DeepCube -- is available on Arxiv.
The result is an algorithm that performs remarkably well. "Our algorithm is able to solve 100% of randomly scrambled cubes while achieving a median solve length of 30 moves -- less than or equal to solvers that employ human domain knowledge," say McAleer and co. That's interesting because it has implications for a variety of other tasks that deep learning has struggled with, including puzzles like Sokoban, games like Montezuma's Revenge, and problems like prime number factorization. The paper on the algorithm -- called DeepCube -- is available on Arxiv.
:D :D :D :D
Traveler's diarrhea - Wikipedia
https://en.wikipedia.org/wiki/Traveler%27s_diarrhea
"Montezuma's revenge (var. Moctezuma's revenge) is a colloquial term for traveler's diarrhea contracted in Mexico."
This is great. Now that we have mastered Chess, Go, and Rubkis Cube all of these "researchers" will put them to work solving meaningful problems. Because AI. Right?
Computer are good for sequencing, imagine that. No wait, imagine real news about Stefan Halper instead.
Someone had to tell it what is a solution. If you give it a solved cube, that's assistance. Is it really that hard not to inflate headlines?
Yep one way humans solve tasks is to work backwards and forwards. Makes sense to employ an algorithm that works similarly.
This algorithm was able to figure out how to solve Rubik's Cube with no help from humans other than humans providing the (simulated) cubes, describing what the solution looks like, and designing an algorithm specific to solving Rubik's Cube?
Color me less than impressed.
wait, if this algorithm was "starting with the finished cube" how did it LEARN anything?
Sounds like GOAP - goal orientated action planning. You start at the end state then perform actions (in reverse) until you get to the current state. I read the article which doesn't tell you much more about how they did it. It sounds like they brute forced a bunch of moves to build up a tree then used A* on the tree. Then they trained a neural net on the brute-forced solutions. They talk about evaluating how close a cube state is to the goal state, but they don't explain how the AI determined that. It sounds like they hard coded what closeness means, so they're lying when they say the AI worked without human assistance. Without human assistance means they would have needed to use a GA, self-playing, or some other learning method to determine what closeness means. The article's "without human assistance" refers to them not hard-coding any move sequences. I consider that a very far stretch of the word. I'd call hard-coding moves as cheating. If you give it everything it needs, it turns an AI into an algorithm in my mind.
Here's a link to the paper from the article. I don't have the time to read it right now: https://arxiv.org/pdf/1805.07470.pdf
it has implications for a variety of other tasks that deep learning has struggled with, including... problems like prime number factorization
If it could help with finding the prime factorization of large semi-prime numbers – ie two or more prime numbers that multiplied together result in a target original number - then that would be quite useful.
*cough* cryptography
This is great. Now that we have mastered Chess, Go, and Rubkis Cube all of these "researchers" will put them to work solving meaningful problems. Because AI. Right?
That would be nice, actually. See, much of our problems in this World is because of humanity's stupid ape brain. It's easily manipulated and we all strive for ape goals.
Dominance. Control. Accumulation of wealth. Fighting over resources. etc ....
Not the things that really make us happy and healthy.
Humanity would be best served by AI overlords.
https://www.youtube.com/watch?...
Cryptofeces Lepidoptera Creimerus infestation is a serious problem. Not only are they capable of reproducing asexually like amoebas, they can also lay eggs hermaphroditically in unexpected places. They can disguise eggs as something useful to fool the unaware, sometimes pretending to be a haiku author, blogger, vlogger, or IT closet cleaner.
Very dangerous. They can seemingly reproduce out of the cosmic background radiation, even if you step on twelve of them, there's always one you miss.
Don't be fooled by the C. Lepidoptera Creimerus's innocuous, rolly-polly, and almost friendly appearance; despite its great size, stupid demeanor, and bedraggled toothless appearance, they have the hardiness of a tardigrade.
Only a concerted, targeted downmodding campaign has been shown effective in controlling this dangerous pest.
Experience shows that stopping such a campaign leads to C. Lepidoptera Creimerus returning within days.
Don't let it happen again!
Assembling any type of IKEA furniture from the box?
Domestic spying is now "Benign Information Gathering"
Find state of cube and look up the next move, the lookup table could also include every move from that point to solve with very little future work. Granted, creating that table efficiently is where the magic happens.
How close is A.I. in the field of cat-girls?
Searching for long forgotten Sokoban.exe game now... Found it.
While Ikea furniture is designed with assembly in mind other things are not. Say for example, an airplane. So the assembly process might not be optimal. Letting the computer look for a more optimal process might be useful.
Or more practically, packing items into a shipping box. the famous knapsack problem.
I hate these slashdot summaries of algorithms. you end up thinking gosh that's stupid. When it's not. just the description is stupid. like a car analogy
Some drink at the fountain of knowledge. Others just gargle.
The fastest way to reset a Rubik's Cube is to pull it apart and reassemble it. That works well on the original Rubik's Cube (3 x 3 x 3). Other variations (4 x 4 x 4, 5 x 5 x5, or 17 x 17 x 17) are increasingly difficult as the parts get smaller.
Goodbye, Slashdot!
So these 'Researchers' are calling an 'autodidactic iteration' what is otherwise known as embedding FOR loops with every iteration added to one of two collections: success or failure. Where is the innovation, exactly? Its increased in performance is attained solely by using newer generations of hardware -- give it older hardware and its performance would comparatively deteriorate.
If ever you've traveled, you know that Montezuma's Revenge is no game!
Have gnu, will travel.
Either the article writer didn't understand the whitepaper, or the researchers haven't actually done anything novel.
This works because the beginning state and end state of a Rubik's Cube are effectively identical. It's the same number of tiles, in a specific arrangement. As humans, we've defined the "solved" state to be all the tiles color-matched to a side. But the "solved" state could just as arbitrarily be any pattern or arrangement of colors across the cube.
Reversing the simulation to work backwards from the "solved" to some specific state of scrambled is exactly the same problem as starting from some specific state of scrambled and trying to get to the solved.
I'm out of my mind right now, but feel free to leave a message.....
All they did is have it start with a solved cube, build up a database of cube combinations and moves it took to get there, and then have it solve an cube by looking up the shortest move set to get from a solved cube to that combination...
You could do that for any simple problem but it's going to require building a lookup table which is all they really did. This is very old tech with new buzzwords slapped on.
People always say things like "the computer/machine did X". Like it isn't just a distilled frozen version of the behavior of its designers/programmers.
The entire machine was already built by humans. There is no such thing on this planet, as a machine that does anything without having been assisted to get there by a human.
This uses to be called the Brute Force method - find all possible solutions - then pick the solutions that fit the test case.
But this time it is novel "In an AI Computer" ?
See subject & answer this: Did YOU write this? https://tech.slashdot.org/comments.pl?sid=12238544&cid=56791244/ or was some unidentifiable anonymous fool impersonating you as they often do me in some puny attempt to "stir me up" vs. you?
* I've never personally seen you do THAT kind of post (submittal by AC & it was off your usual time-pattern) & as far as trolls doing it, wouldn't surprise me in the least - they constantly use "Bitch Tactics" & never fight a battle themselves, trying to get others to fight one another etc. so they "win" etc. - et al.
APK
P.S.=> That all said & aside, just asking for an honest answer from you... apk
There are countless videos online that show the same simple technique that can solve any rubix cube. You repeat the same movements over and over -- even blindfolded -- and its solved. Depending on your hand speed you can solve any combination in under a few minutes. I had my mom, who has never seen a rubix cube, repeat the same steps and she was able to solve it with ease. Then she asked what a Rubix Cube was for/about. 'nuf said.
Its hard for an AI to figure out what is the correct answer on its own...
See subject & I mirror your sentiments on what you said in regard to what you call (lol) "deficients" (I call them the "ne'er-do-well" not-men myself).
* I am assuming you told the truth & take you @ your word. Don't even assume it's me, it IS me & the dorks you describe have been "at me" for years now (since 2012 mostly).
APK
P.S.=> They are EXACTLY what you state & obviously, whoever does it to ME also has some "bone to pick" w/ you & for the reasons you state & THEY WERE TRYING TO "Goad me" into giving you guff!
(I do as you say you do to when others attack me - no "PC" from me then, they do NOT merit that much courtesy & I speak to 'em in a language they understand (wish it could be face to face sometimes using the "ultimate diplomat" which backs all law unfortunately) - crudity RIGHT BACK @ 'EM except loaded w/ truth about them being what we BOTH call them in our OWN ways (losers that can't stand winners & productive hard working guys since THEY just CANNOT compete))... apk
So reverse engineering is AI now..
Some trig tests consisted of proposed equalities. You had to determine which ones were valid. My technique was to assume it was true, and work backwards. The teacher objected to my starting off assuming it was true. So I wrote the steps from the bottom of the answer box to the top, announcing I had derived it the proposition from a known equality. Teacher couldn't say a damn thing. Just goes to show the quality of math teaching in the 1950s.
other tasks that deep learning has struggled with, including puzzles like Sokoban, games like Montezuma's Revenge,
It could have the solution to traveller's diarrhea?
Fancy name -- "autodidactic iteration." It's just another term for a heuristic, a way of characterizing how close a search through a problem space is to the desired solution. They've been used in machine learning and AI for decades. https://en.wikipedia.org/wiki/Heuristic
Someone already got a Guinness record for making a Rubik's Cube solving machine back in 2016. How is this anything new?
All this excitement is not called for as long as there is no real chance of replicating this.This is not science.
__,.--=#[How to deal with CRIMER on Slashdot]#=--.,__1)Why-are-people-upset-with-him? 2)What-can-I-do 3)What-are-his-names 4)Who-is-FatCashewsLovesMe 5)How-to-defeat-his-hustles 6)Why-are-there-dashes 7)Pastebin-Copy
1)Why-are-people-upset-with-himHe makes frequent low quality posts for two reasons:
Money) BASICALLY: He made thousands of shitty posts & bragged about how much money it made him.
DETAILS: He wants u to folow his referer links & pick up his cookie. Even if u dont buy what he linked but do buy somthing else from that site later on he often makes money;He ALSO tries to drive TRAFFIC to his BLOGS & vlogs.
Karma)Hi krma helps him disply his ads. He believes karma acumulates infinitely So he makes lots of pointles posts that r not bad enuf to mod down;hoping they wil get moded up;He was a raging ahole when he thought he had a karma surplus
2)What-can-I-do DOWNMOD u wil usually get more mod points. If he is postng from a new sock acount w/ krma, get his oldst posts first. DOWNMOD him and AC in fresh thrads early on;Metmods wil reward u. METAMOD his posts. REPLY ONLY ANONYMOUSLY to the most deeply nested coments in his threds it helps hide his posts. Dwnvote his SUBMISSIONS, he uses to get krma. REPORT HIM to slshdot & the afiliate progrms he is usng. DONT MENTION his 'brand names' c**mer.
3)What-are-his-namesMost famous:Cre1mer Cdre|mer ILoveFatCashews, Anonymous Cashews, The Fat Bastard aka TCDR
4)Who-is-FatCashewsLoveMe AKA Tardu Lardo,FCLM Funny & anoying; Not me or crimer;He keeps lookout for infestation
5)How-can-I-avoid-his-hustles - --===DONT FOLLOW HIS LINKS!!!===-- -
IF YOU MUST:Use a privte tab & nevr buy anything on the same sesion. If he fools u, close tab, cler the cookies for that site. There r sites other than yutube that wil let u watch his videos. I dont know if people view his contnt but I can pictre his jowls jigling at the thot of people subvrting his 'business model'
6)Why-are-there-dashes & weird stuffI know most only skim thse posts. I want the most imprtnt infrmton to pop out at a glnce & to keep it shrt. I dont use TCDRs name becase he may think tht he benfits from geting it indxed by serch engnes. Id lik 2 thnk TCDR & FCLM for editrl advce 7)TardCopy: http://archive.is/HYiM2