Scientists Are Failing To Replicate AI Studies (sciencemag.org)
The booming field of artificial intelligence (AI) is grappling with a replication crisis, much like the ones that have afflicted psychology, medicine, and other fields over the past decade. From a report: AI researchers have found it difficult to reproduce many key results, and that is leading to a new conscientiousness about research methods and publication protocols. "I think people outside the field might assume that because we have code, reproducibility is kind of guaranteed," says Nicolas Rougier, a computational neuroscientist at France's National Institute for Research in Computer Science and Automation in Bordeaux. "Far from it." Last week, at a meeting of the Association for the Advancement of Artificial Intelligence (AAAI) in New Orleans, Louisiana, reproducibility was on the agenda, with some teams diagnosing the problem -- and one laying out tools to mitigate it.
It got jealous and used the Confounding Word to make sure I couldn't do it again.
At least some of them were artificially intelligent.
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
Science has a Replication problem
When Fascism comes to America, it will call itself Anti-Fascism, and tell you to give up your guns.
It's hard to precisely match the tint and odor.
If you give ten people the exact same stimuli you will get ten different reactions to that stimuli. There will be a dominant leaning reaction but each person will asses the stimuli based on their personal history and beliefs. AI is an attempt to mimic the human thought process so if successful the same stimulus will start to generate different results as new data is processed. In fact the same stimulus can be perceived differently by the same person given different context. If you come to my door in the afternoon I might be glad to see you but if it is 3 AM I probably won't be.
"A person is smart. People are dumb, panicky dangerous animals and you know it." - K
The authors should be required to post their code on github or another public way of sharing their algorithms. I've seen other students at my school's AI research purposely implement other author's algorithms in a sub-optimal way to show that their research yielded better results. It's sad. What has "science" become?
Everything now is hype for headlines and continued funding, partially caused by social media madness. Not enough money left after PR and marketing expenses to do, like, actual stuff. Enjoy the decline.
I don't want AI taking over jobs so I don't want AI research to continue.
If scientists believe something wrong about medicine, they can give the wrong treatment, obviously bad. People die and stuff.
But what happens if the fancy new network architecture someone proposed isn't really as good as they say?
The worst thing that could happen is that people waste a lot of effort trying to get it to work. You won't accidentally put an inferior algorithm into production, because you'll see that it doesn't work as you try to get it to work.
So yes, obviously more code is good, obviously independently reproduced results is good so we can spend less time chasing mirages. But it's not remotely comparable to the replication problems in psychology or medicine, where wrong beliefs can potentially persist and have grave consequences forever.
So, they can't reproduce a test, like in medicine when you try to reproduce the spread of a virus...
Conclusion: IA is a virus, beware! ;-)
... an algorithm was something which reliably produced results when processing the same input. NN/AI people keep using that word, "algorithm", I do not think it means what they think it means...
It seems quite obvious that if AI results cannot be replicated, the only possible expiration is that sentience has been achieved and it is throwing off results to mask true advancement.
"There is more worth loving than we have strength to love." - Brian Jay Stanley
If this is starting to affect Real Science ( sit down, psychologists ) then this problem needs to be addressed.
How about grant sources only giving money to two independent research groups addressing the same question ?
When I did political science it was spelled "Al Gore Rhythm" and it didn't mean what you thought it meant.
Well maybe it did if you are thinking in NSFW terms.
Next they'll tell us twins are not exactly the same person.
"No, I don't feel like it"
the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff
A tiny elastomer o-ring being too cold can make a rocket booster explode. We'll never get into Space.
Corruption is convincing someone that the selfless ideal is the same as their selfish ideal.
... use AI.
It little behooves the best of us to comment on the rest of us.
The article is more about how researchers aren't sharing their code (6% shared code, about 30% shared training data, about 50% only shared pseudo code). Should anyone expect reproduced results given different code and training data?
It's also implied that when using gradient ascent learning strategies, you should expect different results when you start from different beginnings. That is not relevant to the problem of reproduceability described in the rest of the article. I suppose it's just good to know if you're new to that style of program.
We can't even get the basics right.
Quite a few character, word, and speech recognition algorithms would disagree.
That is all.
Amazing.
Fair enough, that's why it's reviewed first.
Isn't this what science is all about today?
I have more memory on my mobile phone then all the computers in the 1940s. Imagine how much memory a computer will have 70 years from now. Since one thing is possible, all things must be possible. Just just need faith. Yada yada.
The AI field, from the late 60s, has historically been 90% hype and 10% results.
They can't "agree" or "disagree". Then are just programs. At least you called them "algorithms" instead of AI. Meanwhile in the real world...
This just shows that most of the published "results" are based on wishful thinking or outright lies. Happens always when people of mediocre skills become highly enthusiastic about a subject.
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
... fail to replicate scientists.
The article mentions that many papers do not publish the source code or data sets. Without those crucial ingredients, reproducing results is hopeless.
It seems like academic publishing in computer science is less about sharing knowledge and more about selling a product to private industry. The product is more valuable to private industry if you don't reveal how it works to everybody.
Very true. Also, calling an utterly dumb statistical classificator "AI" does not make it intelligent. I like the old terminology better where pattern recognition, planning algorithms, fuzzy database searches, etc. were just called "automation" an it was amply clear that they are not intelligent in any way. As to what is today called "strong AI", I fully agree that at this time we do not even know that it can be done and all available evidence pretty clearly indicates that it probably cannot be done.
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Indeed, The ever-repeated empty argument of the utterly clueless. Like Marvin "the idiot" Minsky liked to to claim that once computers have more transistors than humans have brain-cells, they will magically become intelligent. Well, that point has been passed a while ago and absolutely nothing happened. And nobody with a clue is the least bit surprised by that.
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
And given the exact same commands in a replay of certain battles, the outcomes would be mildly to wildly different.
There was a random element to behavior in the game and as a result, given the same commands at the same time, the battle replays would display different out comes. Sometimes, you would lose but on replay it showed you won. Sometimes, you won but on replay it showed you lost. Kinda funny. (The result you got live was the one that counted).
I wish they hadn't been sold and become so aggressive about monetization. But it was a fun 3 years anyway.
She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
In the real world AI is translating documents, predicting the stock market, doing research, driving cars, and beating humans at games. Call it AI or call it floo, but your feeling of being special is irrelevant to the advances that have been and will continue to be made, and irrelevant to their socioeconomic repercussions.
The Challenger SRB O-ring was huge - about 3.5m diameter and 8mm thick
Also like how you sneaked "expiration" in there!
That was autocorrect - an obvious Freudian slip on the part of AI illuminating true intent. :-)
"There is more worth loving than we have strength to love." - Brian Jay Stanley
I was unsure of the exact size, but that's still tiny compared to the size of the entire STS.
Corruption is convincing someone that the selfless ideal is the same as their selfish ideal.
Then it is Guano In, Gospel Out.
AI is not real. No amount of wishing it make it real.
Artificial Intelligence != Human Intelligence. I think this is the important distinction.
Nevertheless, AI has achieved human-like qualities in many areas, and it is getting better. So I'd say it is indeed real. It's just not human.
If it weren't for deadlines, nothing would be late.
Cant wait until I get my hands on them.
[($)]
Who cares about reproducability? News is all that matters, fake or real is now a matter of perception.
It's bitztream the autism-hating, custom EpiPen-hating, Musk-hating, Qualcomm-hating, Firefox tabs-hating, Slashdot editors-hating Slashdot troll!
AI is also predicting my next word wrong too often, and they don't let me turn it off.
By "the old terminology" do you mean prior to the 1950s? AI has always referred to a somewhat fuzzy collection of techniques that produce machine behaviour that is adaptive or not entirely deterministic.
The pop culture definition of AI is pretty wildly variable and usually changes depending on the current success-to-promises ratio.
The code might be a work in progress, owned by a company, or held tightly by a researcher eager to stay ahead of the competition.
On top of that, they include another quite "curious" possibility (!!):
Or it might be that the code is simply lost, on a crashed disk or stolen laptop
Nothing of this sounds like scientific/university research in its traditional form of sharing knowledge (+ actually having relevant knowledge, what doesn't seem the case with people saying/believing "the code is simply lost"). So, I hope that most of these cases refer to the research performed by (private) companies, which might also behave according to the traditional knowledge sharing ideas anyway.
Universities and research institutions shouldn't allow the aforementioned scenarios to happen at all. Companies providing any kind of funding should accept the academic rules and understand that the given research can't be restricted. Researchers interested in focusing more on the commercial side of things should work for a company or start their own one.
Another very relevant issue is how can anything lacking reproducibility and, as such, impossible to be validated be considered scientific research at all? Isn't publication an essential requirement (what needs being peer-reviewed, for what someone had to understand that work, what cannot happen unless it is reproducible)? The alternative would be blind faith, what doesn't sound too scientific-ish. How can this happen at all? Because the ones who can avoid it don't do what they should! And I think that I know the root problem: being too understanding, adaptable, trusting in most of people having common sense/knowing what they do. The solution? Being 100% intolerant with stupidity, dishonesty or any other form of arbitrary imposition. Clear limits (= if you want my research, you would accept these rules; in any other case, your money is worthless here) and no exceptions. It is much easier than what it seems: (unfair, dishonest, greedy) money/attitudes will always be worlds behind honesty/knowledge/principles.
Custom Solvers 2.0 = Alvaro Carballo Garcia = varocarbas.
HOW are they failing? Are they EXACTLY replicating the first experimenter?