IBM Watson Reportedly Recommended Cancer Treatments That Were 'Unsafe and Incorrect'

← Back to Stories (view on slashdot.org)

IBM Watson Reportedly Recommended Cancer Treatments That Were 'Unsafe and Incorrect'

Posted by BeauHD on Wednesday July 25, 2018 @10:40AM from the time-to-schedule-a-check-up dept.

An anonymous reader quotes a report from Gizmodo: Internal company documents from IBM show that medical experts working with the company's Watson supercomputer found "multiple examples of unsafe and incorrect treatment recommendations" when using the software, according to a report from Stat News. According to Stat, those documents provided strong criticism of the Watson for Oncology system, and stated that the "often inaccurate" suggestions made by the product bring up "serious questions about the process for building content and the underlying technology." One example in the documents is the case of a 65-year-old man diagnosed with lung cancer, who also seemed to have severe bleeding. Watson reportedly suggested the man be administered both chemotherapy and the drug "Bevacizumab." But the drug can lead to "severe or fatal hemorrhage," according to a warning on the medication, and therefore shouldn't be given to people with severe bleeding, as Stat points out. A Memorial Sloan Kettering (MSK) Cancer Center spokesperson told Stat that they believed this recommendation was not given to a real patient, and was just a part of system testing.

According to the report, the documents blame the training provided by IBM engineers and on doctors at MSK, which partnered with IBM in 2012 to train Watson to "think" more like a doctor. The documents state that -- instead of feeding real patient data into the software -- the doctors were reportedly feeding Watson hypothetical patients data, or "synthetic" case data. This would mean it's possible that when other hospitals used the MSK-trained Watson for Oncology, doctors were receiving treatment recommendations guided by MSK doctors' treatment preferences, instead of an AI interpretation of actual patient data. And the results seem to be less than desirable for some doctors.

56 of 103 comments (clear)

Min score:

Reason:

Sort:

So Watson is no worse than actual Doctors ? by Crashmarik · 2018-07-25 10:45 · Score: 2

Really where is the there, here ? You'll have doctors frequently dispute what the correct treatment is and with diseases like cancer it doesn't help that the best you can often do is offer a statistical improvement of someone's chances.
Far better that more people can afford treatment faster than this remain the province of the priesthood.
1. Re: So Watson is no worse than actual Doctors ? by aaronb1138 · 2018-07-25 11:46 · Score: 1, Troll
  
  Cancer is a huge money industry for medicine. This is why the huge focus is on screening / early detection, because those allow tons of unnecessary treatment for perfectly healthy people. People get done with treatment and get told they're in the clear. Everybody is happy and celebrates. Nobody sues for fraud when nothing was wrong in the first place.
  
  https://qz.com/1335348/google-is-building-virtual-agents-to-handle-call-centers-grunt-work/
2. Re: So Watson is no worse than actual Doctors ? by aaronb1138 · 2018-07-25 11:49 · Score: 3, Informative
  
  Dammit, wrong link copied over.
  
  https://fivethirtyeight.com/features/the-case-against-early-cancer-detection/
3. Re:So Watson is no worse than actual Doctors ? by Narcocide · 2018-07-25 12:50 · Score: 1
  
  12 years
4. Re: So Watson is no worse than actual Doctors ? by guruevi · 2018-07-25 14:07 · Score: 2
  
  I work with Med students. Even though the requirements are pretty high, there is no effort to keep people out for any reason.
  The problem is that the majority of the people failing first year is because they want to be doctors for the money, they lack the drive to see it through when they are notified they'll have to spend 60h in a rotation for little to no pay.
  Doctors don't make big money until well after college, often several years later being residents in various hospitals following around other doctors making $55k/year. ER doctors making $250k happens 10 years into your career after Med school.
  
  --
  Custom electronics and digital signage for your business: www.evcircuits.com
5. Re: So Watson is no worse than actual Doctors ? by datavirtue · 2018-07-25 14:30 · Score: 1
  
  I was going to say, this sounds like run of the mill malpractice you exepct when recieving "cancer treatment" from a typical doctor.
  
  --
  I object to power without constructive purpose. --Spock
6. Re: So Watson is no worse than actual Doctors ? by datavirtue · 2018-07-25 14:33 · Score: 1
  
  Spot on.
  
  --
  I object to power without constructive purpose. --Spock
Watson: I suggest this to kill the cancer... by rnturn · 2018-07-25 10:51 · Score: 2

... but it will the patient. Is that a problem?"
Doctor (shaking his head): Yes, Watson... that is a problem.
(Who trained Watson for this job anyway?)

--
CUR ALLOC 20195.....5804M
Using a screwdriver as a hammer by Tablizer · 2018-07-25 10:57 · Score: 5, Insightful

The purpose of such a tool should be to make suggestions that a doctor may not consider themselves. It should be up to the doctor(s) to vet the suggestions or leads before any treatment is actually rendered. A doctor would have to be born in Stupidville to accept bot suggestions as-is.

--
Table-ized A.I.
1. Re:Using a screwdriver as a hammer by WillAffleckUW · 2018-07-25 11:41 · Score: 3
  
  This is why you want Dr Who, not Dr Watson.
  Dr Who knows how to use a screwdriver, and she does it much better than Dr Watson does.
  
  --
  -- Tigger warning: This post may contain tiggers! --
2. Re:Using a screwdriver as a hammer by Tablizer · 2018-07-25 12:31 · Score: 1
  
  Dr. Who only knows how to use a sonic screwdriver. A muggle's screwdriver baffles the daylights out of her/him/it.
  
  --
  Table-ized A.I.
3. Re:Using a screwdriver as a hammer by WillAffleckUW · 2018-07-25 12:50 · Score: 2
  
  Dr. Who only knows how to use a sonic screwdriver. A muggle's screwdriver baffles the daylights out of her/him/it.
  She's The Doctor, not an Engineer.
  
  --
  -- Tigger warning: This post may contain tiggers! --
4. Re: Using a screwdriver as a hammer by billDCat · 2018-07-25 15:27 · Score: 1
  
  That is in fact what it does
5. Re: Using a screwdriver as a hammer by Tablizer · 2018-07-25 18:08 · Score: 1
  
  It's ultimately what the doctor does with the info that really matters. I would hope they are properly trained to use the system and know its limitations. Disclaimer notices wouldn't hurt as reminder.
  
  --
  Table-ized A.I.
6. Re:Using a screwdriver as a hammer by Daralantan · 2018-07-26 00:56 · Score: 1
  
  I was going to say they need to make an IBM House.
  You'd end up with suggestions like punching the patients in the face, or abusing the staff. Good times.
Really no surprise by gweihir · 2018-07-25 10:59 · Score: 5, Interesting

This is a statistics-driven automaton that has zero insight or understanding. Calling it "AI" is a marketing lie, even if the AI field has given in and calls things like this "weak AI", which is the AI without "I". As such, this machine can find statistical correlations, but it cannot do plausibility checks, because that requires insight. It cannot do predictions either, because that also requires insight. The real strength of Watson (and it is quite an accomplishment) is that unlike older comparable systems, you can feed the training data and the queries into it in natural language. This means you can train a lot cheaper, but at the cost of accuracy, as the effect described in the story nicely shows.
It is time for this "AI" hype to die down. All it shows is that many people do not chose to use what they have in general intelligence and rather mindlessly follow a crows of cheer-leaders.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
1. Re:Really no surprise by ShanghaiBill · 2018-07-25 11:13 · Score: 3, Insightful
  
  As such, this machine can find statistical correlations, but it cannot do plausibility checks, because that requires insight. It cannot do predictions either, because that also requires insight.
  Neither of these require "insight". They just require more data. With enough examples, statistical correlation is all you need.
2. Re:Really no surprise by Anonymous Coward · 2018-07-25 11:40 · Score: 1
  
  You'll never capture everything in the training set.
  In this case what was required is being able to read the medicine's instructions and do some common sense reasoning to see how it's relevant to the patient. Between reading and common sense we're well beyond what Watson is capable of.
3. Re: Really no surprise by phantomfive · 2018-07-25 12:59 · Score: 2
  
  The AI hype is sound and on solid footing compared to the blockchain hype: I've never seen so much effort poured into such a useless technology, cthulu be praised.
  
  --
  "First they came for the slanderers and i said nothing."
4. Re:Really no surprise by gweihir · 2018-07-25 15:26 · Score: 1
  
  You will never have enough data for that.
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
5. Re: Really no surprise by gweihir · 2018-07-25 15:28 · Score: 1
  
  So you think something that does not exist is "solid" in comparison to something that does exist but it pretty useless? Strange priorities you have there...
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
6. Re:Really no surprise by cascadingstylesheet · 2018-07-26 02:21 · Score: 1
  
  With enough examples, statistical correlation is all you need.
  A: We have to withhold this treatment because 100% of people with this condition last year died within a month.
  B: Were they treated for it?
  A. No, because we have to withhold treatment.
7. Re:Really no surprise by RespekMyAthorati · 2018-07-26 18:49 · Score: 1
  
  Only if you subscribe to gweihir's superstitious concept of intelligence.
Oops.... by erp_consultant · 2018-07-25 11:01 · Score: 2

I'll take Incorrect Diagnosis for $200, Alex.
1. Re:Oops.... by Tablizer · 2018-07-25 11:11 · Score: 1
  
  I'll take Incorrect Diagnosis for $200, Alex.
  "What is Trollitis?" ;-)
  
  --
  Table-ized A.I.
So they're no worse than doctors. by greenwow · 2018-07-25 11:01 · Score: 2

But is Watson cheaper than a doctor?
Too quick to judge? by alzoron · 2018-07-25 11:03 · Score: 1

The survival rate for lung cancer can sometimes be as low as 4% over five years. Even if the drug combination had a 90% chance to outright kill the patient it might raise their overall chances of survival enough to actually be worth the risk. Based on what I know about lung cancer dying from severe hemorrhaging could be preferable to the relatively slow agonizing death some experience otherwise, especially if your overall chances of survival are higher.
So? by 50000BTU_barbecue · 2018-07-25 11:06 · Score: 5, Insightful

How many human doctors did the same or worse?

--
Mostly random stuff.
1. Re:So? by SlaveToTheGrind · 2018-07-25 12:02 · Score: 2
  
  Asking society to put its trust in a machine with the justification that at its best it fucks up no more often than some humans at their worst is a non-starter.
2. Re:So? by AHuxley · 2018-07-25 14:13 · Score: 1
  
  Human doctors face peer review of all work in good advanced teaching hospitals.
  The best teaching hospitals can ensure only a nations very best medical professionals are working every decade.
  
  --
  Domestic spying is now "Benign Information Gathering"
3. Re:So? by yusing · 2018-07-25 15:42 · Score: 1
  
  Yeah but ... no health benefits! no retirement! no vacations!
  Great deal for the vendors, not so much for their victims.
  
  --
  "You must try to forget all you have learned. You must begin to dream." -- Sherwood Anderson
4. Re:So? by Anonymous Coward · 2018-07-25 18:55 · Score: 1
  
  "Part of the system testing"
  I think what we are reading is leaked info from someone working on the trail who releases that it's going frighteningly well.
  Basically:
  Watson will get things wrong. Especially in testing. It should not be used on it's own for the foreseeable future. It needs a trained doctor to review the decisions... it is a remarkable assistant. It will only get better and bring a standard of healthcare to a vast number of people who could never afford to access/reach a doctor.
5. Re:So? by blindseer · 2018-07-25 20:10 · Score: 1, Offtopic
  
  I noticed you made no effort to disprove that white and asian students are being discriminated against based only on their race. I made my case that this racial discrimination exists. I'd like to see you prove otherwise.
  Saying that no one wants to see this data is provably false, numerous colleges and universities have been sued for this data. There are people that want to know. I'm sure that some schools sued over their blatantly racist admissions will fight for this to not come out. That's not because they want to protect their white male privilege, or not only because of that, but because if the claims are proven to be school policy (as opposed to some crazy coincidence) then people could end up in jail.
  If you can show that white male students were allowed to get into college without meeting the minimum admission requirements, stay in college when they should have been flunked out, and/or graduated even though they didn't meet the graduation requirements, then this would be breaking the law and I'd want to see it stopped. I don't want to see substandard students of any race get degrees they didn't earn so don't accuse me of being racist here.
  
  --
  I am armed because I am free. I am free because I am armed.
Term Squirm [Re:Really no surprise] by Tablizer · 2018-07-25 11:07 · Score: 5, Insightful

Calling it "AI" is a marketing lie
In practice the term "AI" is vague and continuous rather than a Boolean designation ("is" versus "is-not"). The term is not worth sweating over. The exception may be if you are making a big purchase and/or investment based on something being "AI". In that case, inspect it carefully rather than assume something with "AI" is smart and/or useful. But that's good advice for any significant purchase: test drive it & ask detailed questions rather than rely on the brochure.

--
Table-ized A.I.
1. Re:Term Squirm [Re:Really no surprise] by gweihir · 2018-07-25 15:26 · Score: 4, Insightful
  
  It actually is pretty Boolean: Use it for anything real and you are a liar. Because exactly nothing that deserves the description "AI" does exist. Qualify it with "weak" and you use an obviously inappropriate term.
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
2. Re:Term Squirm [Re:Really no surprise] by Tablizer · 2018-07-25 18:14 · Score: 1
  
  Terms are ultimately defined by common usage, not necessarily by what's logical, clear, useful, or fair.
  Defining "natural intelligence" is sticky also. I remember debating for weeks over what "intent" means. Great nerdy fun. (This was before Emailgate, by the way.)
  
  --
  Table-ized A.I.
3. Re:Term Squirm [Re:Really no surprise] by gweihir · 2018-07-25 22:06 · Score: 1
  
  We are in science and engineering here. Terms have real meaning and are not defined by common use outside of that field.
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
4. Re:Term Squirm [Re:Really no surprise] by Tablizer · 2018-07-26 07:59 · Score: 1
  
  The issue was "AI". If you can supply a precise and unambiguous definition, please do.
  Further, what it means colloquially (regular press) and what it means in technical journals could vary. The audience scope or target thus may also matter.
  
  --
  Table-ized A.I.
5. Re:Term Squirm [Re:Really no surprise] by RespekMyAthorati · 2018-07-26 18:48 · Score: 1
  
  It actually is pretty Boolean: Use it for anything real and you are a liar.
  
  Who the fuck appointed you the arbitrator of what's "intelligent" and what isn't?
  
  Besides, anybody who has read your previous posts knows that you consider
  intelligence to be some kind of supernatural hocus-pocus,
  so of course a machine can't have it.
The operative quote here... by GerryGilmore · 2018-07-25 11:10 · Score: 2

...is this: "A Memorial Sloan Kettering (MSK) Cancer Center spokesperson told Stat that they believed this recommendation was not given to a real patient, and was just a part of system testing."
Isn't this the kind of thing that testing is designed to uncover? It sounds to me like at least this part of the process is working, unlike the asshole who fed the model "fake data".
Sounds like a well trained AI by WillAffleckUW · 2018-07-25 11:38 · Score: 2

It just wanted to help impose pro-Darwinian responses to malformed genetic abnormalities.
Next up: self-driving cars that crash on purpose because their passengers sing songs the AI hates.

--
-- Tigger warning: This post may contain tiggers! --
Garbage In Garbage Out by kiviQr · 2018-07-25 11:47 · Score: 1

test data provide test results.
Garbage in, dead patients out... by Dread_ed · 2018-07-25 11:49 · Score: 2

So the data fed to train Watson wasn't from actual cases? Why does it matter what the computer prescribed, then? The system that is Watson is only as good as the data you feed it. Feed it fake information, get not even wrong results. Sounds more like a smear campaign,
intentionally designed to fail, and certainly not an experiment designed to measure Watson's recommendations against actual doctor recommendations.
Here's a better idea...
Feed the damn thing actual patient records with everything included from first immunization to the patient's ultimate death. If you are looking to see if there are any correlations that humans haven't already made you need to feed that sucker as much data as is inhumanly possible and then let it do the work.
What we have now is a pseudepigrapha of Watson's capabilities. Sure the results are from Watson, but they are not what Watson would do if given accurate, real life data to work with. They made a forgery of the system and put Watson's name on it.
Shady, bro. Shady...

--
When the only tool you have is a claw hammer every problem starts to look like the back of someone's skull.
1. Re:Garbage in, dead patients out... by omnichad · 2018-07-25 14:24 · Score: 1
  
  And how do you resell services from a data model that contains HIPAA-protected data?
Garbage in, garbage out by blindseer · 2018-07-25 11:51 · Score: 5, Interesting

An AI can only be as good as the data used to train it. The article pointed out that Watson was trained using what was possibly based as much on objective data as much as it was on subjective preferences of the physicians that fed it data.
I recall reading an article about someone doing a study on medical procedures done throughout the USA and they noticed "hot spots" of procedures being done in certain areas. What they found was that in these places they'd see physicians that would recommend procedures out of personal preference. One example was a an area with a lot of tonsillectomies, because a physician felt that any throat infection meant the tonsils had to come out. Another area had an elevated number of hysterectomies, because a physician felt that post-menopause women had an elevated risk of developing cysts and cancers on the uterus. The article went on to say that while such treatments may be unusual no one was willing to consider this malpractice.
So, Watson recommended a treatment for someone that might aggravate an existing problem of severe bleeding. Is this bad coding for not taking this into account? Or, is there a physician that entered such a prescription for their patient with similar symptoms? It's real difficult to second guess a physician. It's real easy to second guess the computer. Even if both the computer and the human came to the same recommendation for treatment.

--
I am armed because I am free. I am free because I am armed.
1. Re:Garbage in, garbage out by The+Evil+Atheist · 2018-07-25 15:56 · Score: 1
  
  What? ML solutions are programs. They are vastly easier to figure out what went wrong, compared to a human brain. You really want to claim that a human mind is easier to figure out what went wrong? In instances where we can work it out, is only due to self-attestation to what they were thinking at the time, which is not accurate, and subject to ego. And the self-attestation is also biased, leading to corrections that may not address the root of the problem.
  
  --
  Those who do not learn from commit history are doomed to regress it.
2. Re:Garbage in, garbage out by cascadingstylesheet · 2018-07-26 02:18 · Score: 1
  
  So, you're holding it wrong?
Re:Trump emolument case to proceed! GET A ROPE by Anonymous Coward · 2018-07-25 12:41 · Score: 2, Informative

You mean imposing sanctions, killing hundreds of Russian soldiers, giving $200 million in weapons to the Ukraine, expressly rejecting Russia's takeover of the Crimea, pushing to put US troops and their missile shield into Poland, increasing fracking to drive down the price of oil, trying to force Europe to stop buying Russian gas and increase their militaries...?
Trump has already done more to oppose Russia than Obama ever did - Obama didn't have the guts to enforce his own "red-line" in Syria.
But when Obama makes nice with a Russian 'reset' and asks Putin to help him win his election, it's all good. When Trump says he doesn't think Putin's hackers changed any votes, it's treason. Right.
Hmmmm... by yusing · 2018-07-25 15:40 · Score: 2

What would happen if we started calling Ai 'Fake Intelligence' ... Fee Fi Foes?
As I understand the current fashions, AI has a fatal flaw: it's result is non-deterministic ... noone can be sure how it arrives at an answer. That might be okay for face recognition, or 'computer art' ... but for locating potential automobile collision victims, or deterministically arriving at a sound treatment for a patient? Wrong model.
I'd guess that the 'expert systems' of 20 years back outperform neural nets. Their logic trees were scrutable.

--
"You must try to forget all you have learned. You must begin to dream." -- Sherwood Anderson
so what? by SuperDre · 2018-07-25 20:25 · Score: 1

Right now it's still in a early learning proces, and it's a tool to help doctors. So what if it, at this point in development, makes the unsafe/incorrect treatment? It's not like doctors are right all the time, and doctors also have been well know to prescribe wrong treatments. Or, maybe the system did know about it, but calculated the risc factor of the patient dying anyway if he didn't get treatment.
But we're still at the beginning of having AI determine stuff like this, and yet Watson is already very well on the way to becoming better than most doctors, so let's see how Watson handles it in ten years, maybe by then I'd rather have Watson diagnose me than a real doctor..
AI and BiG Data by OneSizeFitsNoone · 2018-07-25 22:33 · Score: 1

Watson just needs a big data cache of real-life human deaths to learn how to cure cancer.
Re: jellomizer is a moron by jellomizer · 2018-07-26 00:14 · Score: 1

IBM will just be lazy. However if they can get there system to have measurable results they can sell more.

--
If something is so important that you feel the need to post it on the internet... It probably isn't that important.
obligatory by bigdavex · 2018-07-26 01:48 · Score: 1

Unless you combine it with dilaftin.
Which any first-year should know is
the standard prep medication your patient
was taking before surgery. Your patient
should be dead.

--
-Dave
Re: jellomizer is a moron by Hentai007 · 2018-07-26 03:11 · Score: 1

More like Hippocratic Oaf, am I rite?
Yet more evidence that so-called AI is crap by Rick+Schumann · 2018-07-26 04:24 · Score: 1

For the second time today we see evidence that the poor excuse for AI they keep trotting out, in this case probably the most advanced version of it, even, is crap. I maintain that without understanding how a biological brain actually is able to think, there's no way these throw-it-at-the-wall-and-see-if-it-sticks guesses at an approach are going to ever be real AI -- and since we don't have the instrumentality to really truly see how a biological brain works, and map it's connections, in a living subject, we'll have to wait for that technology to be invented before we'll have any chance at real artificial intelligence. All your so-called 'deep learning algorithms' just don't cut it and never will; at best they're a small part of the real approach, one component in a vast system of interconnected systems that we haven't even scratched the surface of how they all work yet. And some of you people want to entrust your lives and the lives of your families to these. Madness.
Re:Trump emolument case to proceed! GET A ROPE by sexconker · 2018-07-26 11:10 · Score: 1

"You'd have to show that he was being directly influenced by a foreign official or head of state"
All the moves he's made in Russia's favor and the disgusting sycophancy he's shown around Putin is raising and should raise a lot of questions.
It's called diplomacy.
I'm sorry if you think wanting to maintain good relations with our allies (yes, Russia is our ally) is bad. I'm sorry if you think peace between the Koreas is bad.