Predicting the Risk of Suicide By Analyzing the Text of Clinical Notes
First time accepted submitter J05H writes "Soldier and veteran suicide rates are increasing due to various factors. Critically, the rates have jumped in recent years. Now, Bayesian search experts are using anonymous Veteran's Administration notes to predict suicide risks. A related effort by Chris Poulin is the Durkheim Project which uses opt-in social media data for similar purposes."
That would make some sense if the suicide rate was around 50%. Thankfully it's much lower.
According to the study this is 67% effective. But, once this is applied to the general population you have an issue, because the vast majority of people are not suicidal. In the US, about 122 in 100,000 people attempt suicide a year, and about one in ten are successful. Even with a test that is 99% accurate, you are going to end up with somewhere around seven million false positives every year if you screen everyone.
It's refreshing to see predictive data analysis used for positive efforts, rather than simply selling more ads. Here's a call to action for all you data scientists at Twitter, FB, and other SV startups who think they're changing the world when all they're doing is putting money in their advertisers' pockets. News flash: statistics can be used to benefit society for a change.
Dictionaries are for loosers.
Let's say that they diagnose somebody as "mentally unfit"... what happens then? Do they get locked up "for their own protection" or something?
File under 'M' for 'Manic ranting'
Ironic that Venezuela had their violent crime rate drop by a factor of a thousand by removing guns from the citizenry.
It's worth noting that neither happened. The citizenry still has lots of guns and people are still dying at a pretty high rate (with 2014 starting at an even higher rate than 2013).
Well, if the suicide rate's 10%, just say "no soldiers commit suicide," and bam, you're 90% successful.
Um, this isn't about predicting the suicide rate or how likely someone in the general population is going to commit suicide.
It's about how likely a veteran who writes a suicide note and gives it to someone else is going to follow through and try to commit suicide.
That rate is probably closer to the flip of a coin than it is to 10%.
Sleep your way to a whiter smile...date a dentist!
Critically, the rates have jumped in recent years.
The rates aren't the only thing that've ah screw it.
systemd is Roko's Basilisk.
What they did was this: they identified 100 VA patients who committed suicide and then identified two "matched cohorts" who hadn't committed suicide, consisting of 70 patients each (one cohort had been hospitalized for psych reasons, the other hadn't). Then they gathered up all the doctors' notes and examined the frequency of all of the words and phrases occurring in the notes. Certain words and phrases occurred more frequently in the notes for patients who had committed suicide.
The single word which appeared to predict suicide most strongly was "agitation". Want to know which word was the second-strongest predictor of suicide? "Adequately". That's right, "adequately". Here are some of the other "predictor" words: "swab", "integrated", "Lipitor".
I guess the finding that "agitation" appears more frequently in the suicide cohort is of mild interest. (As the authors themselves point out, it simply confirms a piece of information that has already been well documented-- namely that agitated affect is a risk factor for suicide). The rest of it is obviously statistical noise. I don't know much about genetic algorithms or neural-net learning, but it seems to me that these techniques are being used to provide an end-run around any reasonable test for statistical significance.
One thing that the authors didn't comment on-- was the identity of the clinician a predictor for suicide? Maybe there were one or two clinicians who, for whatever reason, experienced a significantly higher suicide rate among their patients. (This would explain why "adequately" showed up so often-- every doctor has their own writing style with their own collection of pet phrases/words, and my guess is that certain doctors like to use the word "adequately" more often than others).