Researchers Developing An Algorithm That Can Detect Internet Trolls
An anonymous reader writes Researchers at Cornell University claim to be able to identify a forum or comment-thread troll within the first ten posts after the user joins with more than 80% accuracy, leading the way to the possibility of methods to automatically ban persistently anti-social posters. The study observed 10,000 new users at cnn.com, breitbart.com and ign.com, and characterizes an FBU (Future Banned User) as entering a new community with below-average literacy or communications skill, and that the low standard is likely to drop shortly before a permanent ban. It also observes that higher rates of community intolerance are likely to foster the anti-social behavior and speed the ban.
Disqus, and the comment section at The Atlantic.
Best Slashdot Co
Automated censorship. Eh, saves us the trouble, I guess
“He’s not deformed, he’s just drunk!”
Anybody who tells the truth that the scum in power don't want you to hear, apparently...
Trolls are usually above average literacy and trying to skilfully cause a fight. It's easy to identify "illiterate" people and humans are way too quick to judge someone who cannot spell as having nothing to contribute or (worse) malicious, but these are not trolls. This is just another classist meme where the person is judged positively by the overcomplexity of their language and convolution of their sentences, as this must mean they have been educamated right.
BTW I went to a £30k/year British boarding school, so I have no axe to grind, nor insecurity about describing things as they are.
Forum detects you!
I don't want to talk to you no more, you empty-headed animal food trough wiper! I fart in your general direction! Your mother was a hamster and your father smelt of elderberries!
Get free satoshi (Bitcoin) and Dogecoins
A true troll can never be detected :P'''
Good luck with your algorithm.
within the first ten posts after the user joins
So, this algorithm only needs nine more posts than a troll will actually make per throwaway account, then?
That's some mighty fine police work there, Lou!
:-)
I rather like reading all the trolling, I mean slashdot discussions...
What Cornell's researchers found is way to id posters who are likely to get banned.
... at the term "Future Banned User" (FBU).
Isn't that just a nice way of saying "somebody our algorithm flags as a complete asshat"
Your mistake is in using the "classic" definition of "troll" - somebody who sets out to deliberately cause fights on a forum. Trawl through the archives of Slashdot and you will find many instances of this kind of trolling - and yes, the people doing it are often highly literate (and, when they do it right, sometimes very funny with hindsight).
But the term "trolling" has gone political these days and is routinely used to describe any form of online behaviour that the speaker doesn't approve of. So everything from outright criminal behaviour (eg. threats of immediate violence) at one end of the scale through to disagreeing with a forum's established groupthink (however respectfully) at the other.
And yes, it has become a favourite term of the intellectually insecure, whenever they want to shout down an opposing point of view without engaging with it. In fact, conflating those two extremes I mention above under the same term is outright beneficial for the easily offended, as it allows them to group polite dissenters together with the mouth-foaming loons.
Internet pre-crime.
There goes Gawker.
It also observes that higher rates of community intolerance are likely to foster the anti-social behavior and speed the ban.
If automated an intolerant core could try to get users expressing opinions that they don't like banned. The fact that they are subjected to intolerance would make the algorithm more likely to ban them.
It might be useful to inform an admin to look at suspicious postings, especially if they can get the accuracy higher. BUT I hope no one uses such algorithms to automatically stop suspected trolls. This can only lead to unforeseen consequences and stifling of free speech (unless of course stifling is not an unforeseen consequence, but an intended one).
Many Slashdotters already complain about the Lameness-Filter, this has the potential to be a hundred times worse.
The technology will of course be developed, let us hope its impact isn’t too negative.
In a somewhat related note, have you noticed how the automated answering at phone centers is getting more aggressive keeping you from a real representative and wasting huge amounts of your time when it doesn’t know how to process your query? Even hanging up on you when your issue is not resolved. My last experience with Verizon was a nightmare in this respect last time dealing with a technical problem with our phone. The more these things can be automated, the more they will – customer friendly or not </vent spleen>
Letter To Iran
any d00d who sez "fork ewe" obviously can't spel.
int troll;
troll = found("goatse.cx");
OK. There's your algorithm.
I can't wait til the anti-bullying crowd lobby for something like this wherever their children might be on the internet. Sounds like it is early days though, 20% false positives is pretty darn high.
The article defines a troll as someone who has been banned from an online group.
You can be banned from a website such as redstate for being an Obama supporter. People are often banned from websites solely for having minority viewpoints.
Apparently poor grammar and spelling are trolling now.
There goes 60% of Facebook English posters, who type in a bastardized text-speak form of English. (let's not forget Twitter as well)
Let's not forget English as a Second Langauge, or Third, or even more.
Generally poor grammar and spelling for the first 5 years on average until they finally get the hang of it.
Mind you, admittedly, I would only want reasonably decent posts being at the top of news feeds comment sections as well, with lesser quality being hidden behind a scroll similar to Slashdot. (in addition to a similar Slashdot comment rating system on top)
Partial automatic moderation and user moderation is the best of both worlds, because the user moderation can then fix mistakes that the algo might make, and if you add some basic machine learning, it could also learn with time.
Or you could do a mishmash of machine and manual learning and have a file dump sent to the algo writers who can update it with time, rather than directly update the source, which could be abused and/or broken with time. I mean, look at how much fun 4chan has with automated systems, or systems of immediacy. (like Twitter harassment reports system. aha)
You could do some minor learning I guess, so the general community culture would be reflected on said automated rating.
Researchers have detected researchers detecting an algorithm detecting researchers researching.
We play the game with the bravery of being out of range
What they identify isn't people who "troll", it's people who get mobbed and ostracized by a community. There's a big difference between the two. That's not a question of "false positives", it's a question of whether people lose themselves completely in group think.
Of course, in practice, there is little chance this will actually go anywhere. Although content creators and ideologically biased readers frequently denounce as "trolss" anybody who disagrees with them, sites actually like controversy because it increases ad impressions. That's why sites continue to use systems like Disqus, which have rather ineffective moderation.
This is a form of censorship. The very reason speech must be protected is because it offends. The very speech that must be protected most vigorously is the most offensive. The internet is a public space and if people want to embarrass themselves in public, as long as they are not harassing or assaulting anyone or otherwise acting contrary to the law, that's perfectly acceptable. Most sites already allow us to block or hide comments/commentors we don't like. Animating the process is redundant and frightening.
Even the writers.
The ability to be scathing in an erudite fashion, the art of verbally sliding the knife home in such a way that the subject doesn't realize that s/he's been intellectually disemboweled for a full five seconds, that is the hallmark of the masters of rhetoric in the rough and tumble of the internet.
Slashdot should use this new algorithm to detect trolls and automatically ban persistently anti-social posters so faggets, niggers, libtards, e.t.c., can enforce free-speech...
Does the algorithm account for the fact that the Troll designation is applied by some specific person who (a) has mod points, (b) strongly disagrees with a given post and (c) is in many cases part of a group who is looking for antagonists to some cause that group really believes in.
If (internet) then troll_present = true;
Done, just that easy.
Lost at C:>. Found at C.
https://xkcd.com/810/
Seems relevant.
Looks like my days on the internet are numbered.
The word troll is a pointless word which is misused by people who mainly want to villify those who disagree with them, and excuse for people who do not want anyone else to be able to express opinions except for the ones they approve, to censor anyone elses opinions they do not like. Thus, the marking in almost all cases is abused and has no real purpose except for censorship. Obviously, since a message board should be a place for discussion and expressing of differing views and opinions, such is contrary to the purpose of message boards to begin with, to express ones views and to debate subjects.
The fact is, expressing a view someone else disagree with is not something we should censor, and the tr*** accusation is just an excuse for censorship. As long as the poster honestly believes in what they are posting, its not a tro**, their are posting their view to express their position for the sake of the issue itself, rather than to annoy anyone. Maybe, a tr*** might be someone who posts things they do not agree with for the sole purpose to annoy. However, since it is impossible for anyone to know whether or not someone posting a message honestly believes in what they say, it is impossible to determine if a message is a tr***, or not. it is also impossible to know if someone is posting a view just because they are interested in a subject and have a view on it, rather than trying to annoy anyone.
The fact is, if someone is annoyed by something, the person responsible for being annoyed is the person who is annoyed, its all in the eye of the beholder, some people will agree with something and others will disagree, you have to allow for a difference of opinions and views. It is always the case that someone will disagree with someone else says, it does not mean that the message was posted with the sole intent to annoy, but the reader of a message may still misconstrue or assume that even though it is impossible for them to truly know that. It is okay and important for people to be able to post messages they know will annoy others, because, anything can annoy anyone, its impossible to post a view or position on anything if one has to fear annoying someone.
The tr*** thing could only apply to messages written with the sole intent to annoy, But as I said, its impossible for anyone to know if that was the sole intent, to be the sole intent, the person would have to not honestly believe in what they say, otherwise they are posting because they believe in what they say and think that its important.
That is why the marking on a message cannot be used legitimately and fairly, there is impossible for anyone to know if a message is a tr8**. Thats why, we should remove the marking from messaging and bullitin board systems. As I said before, in 100% of cases the marking is abused, it cannot be used in any proper, fair way, because it is a fundamentally flawed feature.
It would be best policy on these matters is that bullitin boards should have a rule against computer generated and mass posted advertising, but thats about it.
Slashdot should use this new algorithm to detect trolls and automatically ban persistently anti-social posters so faggets, niggers, libtards, e.t.c., can enforce free-speech...
Slashdot does not need this algorithm since its faggets, niggers, libtards, e.t.c., crowd moderate it by censoring the truth!
Besides tracking trolls, users without an established identity will be automatically down ranked.
Also, certain keywords associated with negative opinions about the country will cause demotion.
Oh, wait, this has been happening for some time... well, it's SSDD then.
Slashdot should use this new algorithm to detect trolls and automatically ban persistently anti-social posters so faggets, niggers, libtards, e.t.c., can enforce free-speech...
Slashdot does not need this algorithm since its faggets, niggers, libtards, e.t.c., crowd moderate it by censoring the truth!
You trolling Slashdot's faggets, niggers, libtards, etc,, aren't you?
will be wiped off the map
patent and trade mark trolls need to be removed
The 80% figure, which is the AUC (area under the curve), refers to threshold tuning. In order to make that usable in the Real World, you'd have to crank it so that it has nearly zero false positives (and thus very few detected trolls) or else you'd have to make it flag posts non-fatally, perhaps with nearly impossible captchas, which immediately defeats its anti-troll utility (not to mention angering all of the falsely identified trolls!).
The article, like the paper itself, ends on this note:
Use my userscript to add story images to Slashdot. There's no going back.
and volunteer to help test. We have a steady stream of trolls available for review, a truly endless supply.
spacefem.com
If (aligns with my prefered political view)
{not troll}
else
{troll}
I don't understand: "...the low standard is likely to drop shortly before a permanent ban." Who drops the low standards? The trolls drop? Why would the trolls drop the low standards? Most of the trolls already have below-average literacy or communications skills. I'm so confused. This article is too short for me to understand.
So, posters on message boards deemed "anti-social" or that have views that are not tolerated by the community are now the definition of troll? Wow, that's a good way to make sure opposing viewpoints never get heard. The "algorithm" will just drop any message that goes against the "party line".
I'd imagine there are plenty of places where if you stand up for your individual rights and privacy you'd be marked a subversive and the community wouldn't tolerate your presence. How about speaking of the value of equal civil rights for all humans on the kkk.org forums? I'm pretty sure the community would be pretty intolerant to your viewpoint there!
When did the definition of "troll" become someone who has a viewpoint that you don't necessarily want to hear? A troll, for those who don't know, is someone who legitimately goes out of his or her way to get a rise out of someone just for laughs. Someone who is passionate about an issue that may go against popular opinion or a political narrative doesn't make them a troll. Not having the best grammar and writing skills doesn't make you a troll.
This is purely a push by establishment websites to automate the moderators who go through and do the wholesale deletion of any and all viewpoints that go against the "party line" with robots so in the case of any kind of populist backlash against the predetermined narrative and status quo, they won't have to rely on humans that (possibly) have critical thinking skills going outside the box.
The only thing better is if you had mentioned something bad about libertarians in the synopsis.
Why talk to people who act in ways that the majority does not appreciate? It is the majority that counts, the rest should obviously be ignored and die in utter despair and isolation. Any computational tools that further this goal can only be good for society, right?
I can see, how this may defeat (ab)users trolling for fun and not suspecting automated detection before it hits them (though, with only 80% accuracy, I dread the thought of the methods expanding out of the virtual realm).
But what about people "trolling" professionally — paid and/or otherwise compelled into it by a state or corporate actor pretending there to exist some kind of "grass-roots" movement? How would it deal with thousands of fake accounts mounting a coordinated assault, posting (while "liking" and "following" each other)?
Some times you may be able to catch accounts posting identical things at the same exact time (and ban them all in bulk), but Russians seem to have fixed that bug in their bots now...
This is turning into another battle like that, in which spammers have fought the best Information Technology minds into a standstill. I doubt, progress against forum-spammers will be much better than that — not when mere technology, however clever, is up against interests of a reasonably powerful state.
In Soviet Washington the swamp drains you.
... something like this:
int is_troll( const char* username ) {
if( !whitelisted(username) ) {
return 1;
}
}
Log in or piss off.
The study observed 10,000 new users at cnn.com, breitbart.com and ign.com, and characterizes an FBU (Future Banned User) as entering a new community with below-average literacy or communications skill, and that the low standard is likely to drop shortly before a permanent ban.
So, what about communities with above-average literacy and communications skills?
This won't work nearly as well as the authors expect. The moment such system gains adoption, the rules will change and anti-detection and algorithm poisoning techniques will be adopted. For example, proposed approach would likely be completely defeated by first making 10 "constructive" FAQ copy-paste postings. Also, spam is much easier to detect than trolling, since spam is not unique. Still it took years and complicated spam-detecting analytical algorithms to reduce it to manageable levels.
I'm secretly hoping the 80% success rate indicates that everyone else has a 1 in 5 chance of getting randomly banned, troll or not.
I feel this would noticeably improve many online communities.
Anybody who tells the truth that the scum in power don't want you to hear, apparently...
In days when someone can be attested for quoting from a published book by Winston Churchill I have to agree.
He wasn't arrested for his speech. He was arrested for failure to disperse after multiple complaints.
But I was really just trying to disagree with someone's point of view.
Laws are rules for the court, but merely a bottom bar to hit for life. Think beyond laws in your actions always.
you sir, win the internet.
article = new nonsensefilledstory();
article.addStrife();
article.addContraversy();
article.stoketribalisim();
article.allowAnonymousComments(true);
stack_of_trolls *users = article.create();
forall users as user (
if (user.isTroll() == false && user.respondsToTrolls() == true)
(globalBanList.addUser(user));
)
Unfortunately, many people think that if you express a different viewpoint or opinion than the masses that you're trying to start an argument or a fight. Why is society so hell-bent on crushing dissenting opinions? And not merely silencing them, but villifying them?
I've often been tagged as "trolling" because I don't agree with the crowd. If you knew me personally, you'd know very well that I'm not trying to start a fight, just expressing my opinion. Just because it is not the popular viewpoint doesn't mean my views aren't valid.
Here on Slashdot, I often see people flagged as being trolls just because they don't follow the masses. You'd think a site full of outcasts and oddballs like programmers and technologists would be more accepting of alternative views, but the exact opposite seems to be the case.
I do not fail; I succeed at finding out what does not work.
(you mean like that?)
Fukers who cant spell or write good sould not be alowed to make comments on the Internet.
1. Anything positive about Microsoft.
2. Anything positive about a Republican.
3. Question any part of a global warming story.
4. Refer to climate change as global warming.
5. ?
6. Profit!
Wait, there are commenters on breitbart.com that *aren't* trolls?
I am German and i am curious how a single person can "disperse". Explain please.
Trollolololol!
I am German and i am curious how a single person can "disperse". Explain please.
It's the friendly way of saying "get lost" by police enforcement. Please note that by "get lost" we don't actually mean to enter a state of being lost with no direction home but instead to exist somewhere other than the place they are currently in. Rigidly it means to scatter even if it is one person or particle or whatever.
Perhaps it would be better to say "relocate to your own property" in this case but the wording used is usually "disperse".
Researchers have discovered a way to troll Internet news sites by claiming to detect "trolls" (a term having duplicitous meaning).
between trolls and people who disagree with you. I know fourth wave feminists have trouble telling them apart though.
A 80%-accurate, or 20%-inaccurate troll-detection system, could be one which misses 1/5th of trolls, or one which labels one non-troll as a troll for every 4 trolls it correctly identifies (or both).
Mistakenly labeling one non-troll as a troll for every four trolls it actually comes across could be pretty painful.
Well - answer my question in the subject
The best troll is the one which seems to have an honest opinion, and makes non trolls react negatively. As a troll, this just makes me excited to exploit a new attack vector.
Of course neither I read the article, but if researchers are seeking to reduce the terrorist, anti-american, etc. trash comments, wouldn't be better to just permit comments only from registered, white male republicans? So, if someone wants to comment about crackers, racists, cuntservatives, etc, must be a registered white male republican (that way free speech is guaranteed for all) and crackers, racists, cuntservatives, etc, can still rule!
Everyone here knows you're a goddamned astroturfing troll. You use the name Obfuscant. How much more obvious could it be? Just shut up already!
PLEASE do NOT taunt me a second time!
and call it "Minority Retort".
Police officers arrested Mr Weston, mid-speech, for failing to comply with their request to move on under the powers of a dispersal order made against him.
He was further arrested on suspicion of religious or racial harrassment.
So yes, he was arrested for failing to comply with a police order and yes, he was arrested for his speech as well.
the rest of the white race
Oops, what a giveaway, you racist twat.
To have a right to do a thing is not at all the same as to be right in doing it
You call it Troll; They call it Advertisement; Somebody else will call it Spam;
Casteism
aka Flamer? http://www.newser.com/story/19... http://www.nytimes.com/2014/08...