Misleading Results From Widely-Used Machine-Learning Data Analysis Techniques (bbc.com)

← Back to Stories (view on slashdot.org)

Misleading Results From Widely-Used Machine-Learning Data Analysis Techniques (bbc.com)

Posted by EditorDavid on Saturday February 16, 2019 @08:37AM from the I-see-what-you-did-there dept.

Long-time Slashdot reader kbahey writes: The increased reliance on machine-learning techniques used by thousands of scientists to analyze data, is producing results that are misleading and often completely wrong, according to the BBC.

Dr. Genevera Allen from Rice University in Houston said that the increased use of such systems was contributing to a "crisis in science".

She warned scientists that if they didn't improve their techniques they would be wasting both time and money. Her research was presented at the American Association for the Advancement of Science in Washington.

This is the oft-discussed 'reproducibility problem' in modern science.
The BBC writes that this irreproducibility happens when experiments "aren't designed well enough to ensure that the scientists don't fool themselves and see what they want to see in the results." But machine learning now has apparently become part of the problem.

Dr. Allen asks "If we had an additional dataset would we see the same scientific discovery or principle...? Unfortunately the answer is often probably not.â

2 of 23 comments (clear)

Min score:

Reason:

Sort:

Not surprised by Anonymous Coward · 2019-02-16 09:09 · Score: 4, Interesting

I worked as a ML researcher in a science lab. Was often asked for results they wanted rather than good methodology, which I pushed back hard on, but the lab frequently contracted out analysis and then chose which results they liked best for publication. They got a few publications in Nature. Don't trust the ML results of any science paper unless they fully present and you understand their data, methodology, and statistics, and even then take things with a grain of salt.
Re:Black Box by epine · 2019-02-16 14:00 · Score: 3, Interesting

sexy instant gratification
Deep neural networks barely made it through a decade-long siege of Leningrad where it became so unfashionable it was almost left to die in the snow. Is that your definition of "instant gratification"?
Humans are equally terrible at articulating many of our fundamental skills. Even grand master chess players only manage to articulate a pedagogical narrative, and not the real thing.
It does bug me sometimes that people forget that 90% of the reason we like our machines is they provide complementary abilities: massive databases with total recall, blinding fast arithmetic, rarely ever making an error, sub-microseconds reaction times rather than tens of milliseconds. Where we're at now is substituting mechanical systems that overlap key human competences, where the mechanical system is nowhere near as good on many dimensions, but nowhere near as erratic as human performance, either.
Finally, wherever did this idea originate that big messy systems were going to have clean analytic decompositions?
Back in the 1950s the excuse for this view was that when you only have a hammer, everything looks like a nail. When you're limited to a few kilobytes of memory, the computer is applicable to a few classes of extremely analytic systems, where no part is giant and messy. But actually, DNN systems for machine translation require hundreds of megabytes. Because human language is extremely messy. In NLP, the GOFAI agenda was only ever aimed at some kind of highly constrained conlang, which encapsulated a dense, proposition nucleus (completely bereft of metaphor) entirely unlike any human language ever spoken.
At no point in the last forty years have I not regarded GOFAI as some kind of adolescent SF fantasy reified.
Do you look at Winograd's work from 1970 and see a glass half full or a glass half empty? It was cool for its day, but as a software engineer, I always thought to myself "this dog doesn't scale". And I was right. There was no era of SHRDLU 2.0 or SHRDLU 3.0. The analytic complexity in this domain scaled far faster than the analytic ingenuity of Terry Winograd's graduate students.
So much for Lisp. Then along came Prolog: another scaling disaster.
Perhaps once we refine the DNN and invent the first DNN rectifier (mapping a messy world onto a clean, orthogonal conceptual world) maybe we'll finally find a good home for the kind of cleverness we once thought of as the whole AI cheese plate.