Slashdot Mirror


Misleading Results From Widely-Used Machine-Learning Data Analysis Techniques (bbc.com)

Long-time Slashdot reader kbahey writes: The increased reliance on machine-learning techniques used by thousands of scientists to analyze data, is producing results that are misleading and often completely wrong, according to the BBC.

Dr. Genevera Allen from Rice University in Houston said that the increased use of such systems was contributing to a "crisis in science".

She warned scientists that if they didn't improve their techniques they would be wasting both time and money. Her research was presented at the American Association for the Advancement of Science in Washington.


This is the oft-discussed 'reproducibility problem' in modern science.

The BBC writes that this irreproducibility happens when experiments "aren't designed well enough to ensure that the scientists don't fool themselves and see what they want to see in the results." But machine learning now has apparently become part of the problem.

Dr. Allen asks "If we had an additional dataset would we see the same scientific discovery or principle...? Unfortunately the answer is often probably not.â

23 comments

  1. 12 Years Left Anyways by Anonymous Coward · · Score: 0

    The nutter Amerikuks are sure Zombie Jeebus is going to rapture them all away from the consequences of their selfish luves so fuck it! The Earth will go on, humanity.. answer not clear, try again.

    1. Re: 12 Years Left Anyways by Anonymous Coward · · Score: 0

      Oh but who needs CS? I just get this feeling banging away at my KB and mouse.

    2. Re: 12 Years Left Anyways by Anonymous Coward · · Score: 0

      I put on my robe and wizard hat

  2. Is the AI honeymoon over? by Anonymous Coward · · Score: 0

    Seems like a little run on growing distrust with AI. Predictive policing fails, image recognition Tom Foolery, and now basic science falling victim. Please, somebody think of the children!

  3. Black Box by jythie · · Score: 1

    Something I never liked about machine learning, and 'new fangled AI' in general is how opaque it is, you get fast interesting results but you can not explain how you got them or defend them directly. But GOFAI techniques are out of style right now, and it is getting worse as GOFAI systems are so much slower and resource intensive not to mention require so much more domain knowledge to set up and just can not compete with the sexy instant gratification that machine learning can give you.. or give your customers/sponsors.

    1. Re: Black Box by Anonymous Coward · · Score: 0

      In economy full of SJWs, n1ggers, and shitty minorities, you will get shitty n1gger AI.

    2. Re:Black Box by serviscope_minor · · Score: 1

      I had to look up "GOFAI". Turns out it refers to symbolic reasoning type systems (good old-fasioned AI).

      y. But GOFAI techniques are out of style right now, and it is getting worse as GOFAI systems are so much slower and resource intensive not to mention require so much more domain knowledge to set up and just can not compete with the sexy instant gratification that machine learning can give you..

      Well yeah... and symbolic systems haven't delivered the results. Take for example the bit of machine learning which responds to "hey siri" or whatever Alexa responds to (no idea, I don't want either kind of spy-box in my house). It really doesn't matter why or how it works, if it works 99% of the time instead of 100% and so on.

      It's a useful (apparently) product anyway. There are countless examples of things like that. If AI or ML isn't an end in itself, but part of a product, then that explains why GOFIA is not very popular.

      On the other hand Facebook using GANs to hallucinate MRI data for medical purposes? Holy shit that is not going to end well.

      --
      SJW n. One who posts facts.
    3. Re:Black Box by jythie · · Score: 1

      True, they struggle to, as you say, deliver results. But what they do produce, you can explain and validate. ML gives you answers, quickly, which is great as long as the answers don't actually matter. That has made it great for recommendations and toys, but is really alarming when applied to more serious stuff.

      Moving further away from consumer products and services, there is the worrying limitation of ML that, well, you don't really learn anything from it. Seeing it used in research always worries me since it means the researcher doesn't understand the relationships and is hoping a black box will give them an answer, an answer they can't understand since it can't explain, and no one else can validate. Again, great when you want an answer but don't care what it is, if it is right, or leaning anything from it.

    4. Re:Black Box by epine · · Score: 3, Interesting

      sexy instant gratification

      Deep neural networks barely made it through a decade-long siege of Leningrad where it became so unfashionable it was almost left to die in the snow. Is that your definition of "instant gratification"?

      Humans are equally terrible at articulating many of our fundamental skills. Even grand master chess players only manage to articulate a pedagogical narrative, and not the real thing.

      It does bug me sometimes that people forget that 90% of the reason we like our machines is they provide complementary abilities: massive databases with total recall, blinding fast arithmetic, rarely ever making an error, sub-microseconds reaction times rather than tens of milliseconds. Where we're at now is substituting mechanical systems that overlap key human competences, where the mechanical system is nowhere near as good on many dimensions, but nowhere near as erratic as human performance, either.

      Finally, wherever did this idea originate that big messy systems were going to have clean analytic decompositions?

      Back in the 1950s the excuse for this view was that when you only have a hammer, everything looks like a nail. When you're limited to a few kilobytes of memory, the computer is applicable to a few classes of extremely analytic systems, where no part is giant and messy. But actually, DNN systems for machine translation require hundreds of megabytes. Because human language is extremely messy. In NLP, the GOFAI agenda was only ever aimed at some kind of highly constrained conlang, which encapsulated a dense, proposition nucleus (completely bereft of metaphor) entirely unlike any human language ever spoken.

      At no point in the last forty years have I not regarded GOFAI as some kind of adolescent SF fantasy reified.

      Do you look at Winograd's work from 1970 and see a glass half full or a glass half empty? It was cool for its day, but as a software engineer, I always thought to myself "this dog doesn't scale". And I was right. There was no era of SHRDLU 2.0 or SHRDLU 3.0. The analytic complexity in this domain scaled far faster than the analytic ingenuity of Terry Winograd's graduate students.

      So much for Lisp. Then along came Prolog: another scaling disaster.

      Perhaps once we refine the DNN and invent the first DNN rectifier (mapping a messy world onto a clean, orthogonal conceptual world) maybe we'll finally find a good home for the kind of cleverness we once thought of as the whole AI cheese plate.

    5. Re:Black Box by serviscope_minor · · Score: 1

      That has made it great for recommendations and toys, but is really alarming when applied to more serious stuff.

      I don't entirely agree. I mean I accept your point, but I don't agree with the conclusion in all cases. Take self driving cars for example. These are heavily dependent on ML, and certainly important.

      I reckon though we don't *need* to know why exactly it does the things it does, because humans aren't statistically very good drivers. The aim isn't to replace a perfect system with a cheaper one, it's to replace a tired, angry, distracted system with a better one.

      Again, great when you want an answer but don't care what it is, if it is right, or leaning anything from it.

      Or if you only want it to be on average better than another practical way of getting the answer. I think that use case is important.

      --
      SJW n. One who posts facts.
    6. Re: Black Box by Anonymous Coward · · Score: 0

      That was almost an insightful comment were it not so shitty and full of everything he hates bringing it to the forefront and making it present but really contributing nothing overall. Welcome to /. ...

  4. Not surprised by Anonymous Coward · · Score: 4, Interesting

    I worked as a ML researcher in a science lab. Was often asked for results they wanted rather than good methodology, which I pushed back hard on, but the lab frequently contracted out analysis and then chose which results they liked best for publication. They got a few publications in Nature. Don't trust the ML results of any science paper unless they fully present and you understand their data, methodology, and statistics, and even then take things with a grain of salt.

    1. Re:Not surprised by Anonymous Coward · · Score: 0

      "If we had an additional dataset would we see the same scientific discovery or principle...? Unfortunately the answer is often probably not

      Wouuldn't this sentence from the summary suggest overfitting or overtraining the networks?

    2. Re: Not surprised by Anonymous Coward · · Score: 0

      That is exactly the process of ML. It can work well for finite games such as tictactoe, chess and go. It can approximate enough to fool cursory glance. It can procedurally generate a bit more variety than traditional methods. But it ain't science or even sound statistics.

    3. Re: Not surprised by Anonymous Coward · · Score: 0

      It shows that she doesn't have a clue what she's talking about. Pure publicity.

    4. Re: Not surprised by mSparks43 · · Score: 1

      Pretty much my take. This repeatability problem has absolutely nothing to do with ML and everything to do with researchers hammering a dataset into a form that says what they wanted to say when they first started the research. Something that has been happening since before the invention of the transistor.

      Only reason this is coming to light now is the likes of sci hub made the garbage widely accessible, rather than to only those few who already knew it was probably bs before they opened it.

  5. Look Further by fish · · Score: 1

    Yes, wel, that is what one does, look for patterns in the data. But the idea is that the data is a good representation of the real world, and that the patterns you find can be generalised to something useful. If you are just drawing conclusions from whatever your machine learning algorithm finds in the data, you need to look over your method, research questions and evaluation.

    (Discalimer, the article doesn't give any details, and briefly mentions astronomy and biomedical research, areas I am not too familiar with, but I would think that what I mentioned is common practice in all scientific research.)

    1. Re: Look Further by Anonymous Coward · · Score: 0

      Mathematically speaking the percentage of patterns in data that are not an overfit of the real phenomena ought to approach zero as the data set size increases

  6. Nothing changed from when AI was PCA ... by foobsr · · Score: 1

    It was a common phenomenon to be observed ever since a complex methodology existed that researchers, especially the most successfully extraverted, did not understand what they were doing, analysis-wise. But it is a relief, I suppose, for machine-learning metholodists, that they for sure find the find-what-I-want switch easily. Amen ...

    --
    TaijiQuan (Huang, 5 loosenings)
  7. idiocracy by Anonymous Coward · · Score: 0

    This is exactly how a world without education can seemingly function. Idiocracy is not an impossible parody, but one possible future.

  8. Like science doing a Rorschach test. by Anonymous Coward · · Score: 0

    Q: "What do you see here?"
    A: "Ummm... a meerkat!"

    It reveals more about the science community's collective subconscious, I guess.

    1. Re:Like science doing a Rorschach test. by Anonymous Coward · · Score: 1

      More like:

      Q: Do you see a meerkat in this inkblot?
      A: Well, now that you mention it, yea...kinda

      Just tweak the parameters until you prove your hypothesis