Twitter Should Use Random Sample Voting For Abuse Reports
In August, Twitter user Kristin Puhl made public the fact that another Twitter user had tweeted at her:
f@#king die feminist moron i'm coming after u and raping u.
and when Puhl filed an abuse complaint with Twitter, Twitter responded after two days:
We've investigated the account and reported Tweets for violent threats and abusive behavior, and have found that it's currently not violating the Twitter Rules (https://twitter.com/rules).
(The "rules" linked in the message include the clause "You may not publish or post direct, specific threats of violence against others.") Twitter must have changed their mind eventually, because the account of the user who sent the message is now gone, but why didn't they close it the first time?
Twitter can't effectively adjudicate all the abuse complaints that they get, but I don't blame them. I don't think they publicize numbers for how many abuse complaints they receive every day, but I'm sure that it's more than an internal review panel could handle fairly. Twitter should not be faulted for that. They've created a world-changing tool, and they shouldn't have had to stifle the growth of their platform just because it grew faster than their ability to handle the abuse reports.
But now that they're publicizing their latest tools for handling online harassment, it's fair to ask more of them. And while the tools may streamline the process of categorizing incoming abuse reports, there's always going to be a human review bottleneck, which will get tighter as the Twitter platform continues to grow.
So I'd suggest the same solution that I suggested for Facebook abuse reports: recruit a pool of volunteers from the general public to review "abuse reports". (You would need a "critical mass" of at least tens of thousands of reviewers for my idea to work, but Twitter shouldn't have trouble amassing that many people for a special program.) Then when an abuse report comes in, do the following:
- Some small number of reviewers -- say, ten -- are randomly recruited from the pool of volunteers.
- Each of them looks at the reported content and the category of abuse that it was reported under, and votes Yes or No as to whether the content meets the criteria for abuse.
- If some threshold of users (say, eight) vote that it does, then the report gets bumped up to a higher-level review. This "higher-level review" could mean having a new, larger pool of users (say, twenty) look at the content and vote on it, in case the original eight-out-of-ten vote was a statistical fluke. Or it could mean forwarding the reported tweet to some human review panel at Twitter -- which now has far fewer abuse cases to review, because it only has to look at the reported tweets that cleared the hurdle of getting eight out of ten votes for violating the guidelines.
These numbers are just guesses. I might be over-optimistic about how many reviewers would even respond when Twitter asked them to vote on whether some content was abusive (even though that's what the reviewers signed up to do) -- it might turn out that to get even ten responses, Twitter would have to nag 50 people to come and vote on a piece of content. And the size of the voting initial voting panel should be large enough to avoid statistical flukes most of the time -- if a tweet is inoffensive enough that only 10% of the reviewer population would consider it "abusive", you'd have to be really unlucky to convene a panel of 10 users where 7 out of 10 voted to label the tweet as "abuse".
As long as the size of the reviewer population grows in proportion with the Twitter user base (or, more precisely, as long as it grows in proportion with the volume of abuse reports coming in), this system scales as much as you want it to. (Well, unless the "higher-level review" involves review by an internal panel at Twitter, which still creates a bottleneck.)
Because the voting panel is randomly selected from among the entire pool of volunteers, that means you can't "game the system" by forming a mob with dozens of your friends so that everyone can file an abuse report about the same content at once. As long as your mob only comprises a tiny proportion of the 100,000+ reviewers in the system, there's virtually no change that a randomly selected panel would contain enough of you to swing the vote.
This could also potentially result in an almost-instant turnaround time for handling abuse cases (a matter of reassurance for victims of normal harassment, and a matter of life and death in the case of suicide threats or threats of violence). Twitter could restrict their random sample to only those users who happen to be signed in at the present moment, and who have a minute or two to review a piece of content and vote on whether it violates the guidelines.
Tweets are by definition public, so there wouldn't be any potential privacy violation in taking someone's tweet, putting it before a panel of 10 volunteer reviewers, and asking them to determine if it violated the terms of service. Direct Messages sent via Twitter, on the other hand, are intended only for the recipient, and are not public by default. If a recipient wanted to flag a Direct Message as abusive, they would have to specify whether they want the content to be reviewable by a panel of randomly selected public volunteers. So in the case of the tweet received by Kristin Puhl -- "fucking die feminist moron i'm coming after u and raping u" -- even if she had received it as a Direct Message from someone she was following (you can only receive DMs from someone if you're following them), presumably she would have been OK with showing the tweet to a panel of volunteers, who probably would have voted that it was in fact abusive. On the other hand, sometimes a user might receive abusive DMs where they want to report the abuse, but the DMs might contain sensitive information that they don't want publicized to randomly selected volunteers. So those abuse reports might have to be handled the old-fashioned way at Twitter, by internal review, which still creates a bottleneck. But hopefully the abuse reports about Direct Messages comprise only a small minority of abuse reports that Twitter receives, since most talk about abuse on Twitter comes in the form of public tweets. (If someone is "abusing" you via DMs, you can just unfollow them.)
Twitter could even be completely transparent about the entire voting process: "Your complaint has been reviewed by 10 people. 8 of them agreed that the tweet in question violated our guidelines. This is above our minimum threshold of 7 that triggers a higher-level review of this content." (Twitter presumably wouldn't want to tell the complainer who the voters on the panel were, since the complainer might harass the individual voters if the voting panel as a whole rejected the complaint. But there's no reason not to be transparent about the actual numbers.)
Why would someone sign up to volunteer to review abusive content? Maybe for the glimpse into strangers' lives. Maybe hoping to save copies of some of the porn contained in the tweets that get reported for abuse. (Of course, there are easier ways to get porn online, but maybe they get off on the fact that some particular pornographic image made someone angry and upset enough to report it.) Maybe they altruistically believe it's part of their civic duty towards the Twitter community. Maybe because they're bored.
Whatever people's myriad motivations for signing up, the important thing is that there's still a statistically significant difference between the number of "yes" votes received when content truly is abusive, and when it's not. Even if you have people signing up as reviewers for all kinds of weird reasons, a tweet like "fucking die feminist moron i'm coming after u and raping u" is still going to receive, on average, more "yes" votes than a tweet like "I respectfully disagree, so let's go our separate ways".
If Twitter were nervous about rolling out a system like this, ceding control of the abuse-report-handling process to a pool of volunteers, they could always do their own random sampling of the random-sample-voting system, to see how it was working. An internal auditor could pull 100 of the abuse report cases that have been handled by the random-sample-voting system recently, decide in each case whether the tweet did in fact violate the abuse guidelines, and then look to see if the voting system reached the same answer. As a control in the experiment, look at some abuse reports that were routed to the old-fashioned internal review panel during the same period, see how they handled the reports, and see how they fared in comparison. I would confidently bet money that the random sample voting system would handle the abuse reports more accurately, and faster, as well.
This won't do much to deter abusers who create an endless series of throwaway accounts for harassment purposes, which makes it futile to block or report any particular account. But it would at least get step zero right, which is to correctly adjudicate whether a tweet is abusive or not. And it would do it in a way that is scalable, non-gameable, and transparent. Plus a few volunteers would get an interesting story to tell at dinner.
Not your blog. Go away.
And why would anyone care what he thinks?
if a post only gets 1 report - manual review
if a post gets multiple reports it gets disabled, and the user gets a notification that it was disabled, why, and a change to reply stating why you believe its wrong, for a manual review.
have you seen my sig? there are many others like it but none that are the same
Blah Blah, Bennett is an idiot, blah blah, how did this get to feed, blah blah, there's a seperate section that this loser should post to, blah blah, I don't read his stuff but I'll comment about it.
We don't care what he thinks!!!
We think he should get his own blog!!!
...for flagging Bennett posts.
"I'd just like to emphasise that taking a million years isn't a metaphor here..." -Rich Bradshaw
We don't care!!!
Got your own blog its not that hard!!!
Why should one volunteer to review twitter posts for free for a company that makes 700m$ a year?
How is going to pay rent or feed the kids or pay for the Internets with that?
Just wow.
I'm starting to think that he's a big part of the funding keeping Slashdot afloat, by paying them to publish his stuff.
Yes, Bennett, we know you founded some kind of website that combat Internet censorship and you also contributed to that lawsuit to allow porn to be viewed on library computers in Washington state. You're some kind of Internet Hero, I guess?
But, again, I would like to understand why your blog postings end up on Slashdot.
No
I do my best to overload their system and report every promoted post I see as the spam it is.
Now I may bathe myself in the wisdom of this post.
And I'm out.
below, above or even sideways.
Because his shit is stupid.
Hm? Linda, what do you think?
So twitter should adopt Slashdot's metamod system? (The old system, not the incomprehensible mess in place today)
For example, a majority of US whites think certain cops are innocent, but a vast majority of US black think the cops are murderers.
Which side is right?
Even when there's video of the cops actively killing unarmed people, nobody gets charged.
But if we use your system, someone saying the cops are murderers will be reported and sustained as "abusive", given that Twitter is mostly used by white people.
Same thing goes with global warming. 87 percent of Americans think it's happening now. But the 17 percent that don't are very vocal and have large corporations that make fake twitter (and FB etc) accounts.
I'd like to tell you it's a simple thing, but it's not.
-- Tigger warning: This post may contain tiggers! --
Tattletale mechanics are important for self-moderating systems.
Facetweets and socnets only ring around 0.5-0.7 Giveashits though.
Think of all the problems that could be solved with unpaid labor!
Why should suicide threats be alerted?
Actually, I *always* read Bennett's posts. The comment stream is, by far, the funniest postings on slashdot. I'm delighted by the myriad inventive ways slashdotters have discovered to mock this maroon.
If Bennett did not exist, it would be necessary to invent him.
Much memorizing work to do
The heartbreak of Bennett-itis? Then take a tip from Mr. Paul Anka!
[Singing]
To stop that Bennett, one-two-three
Here's a fresh new way that's trouble-free
It's got Paul Anka's guarantee
Guarantee void in Tennessee!
Just don't look! Just don't look!
Just don't look! Just don't look!
Just don't look! Just don't look!
Who is going to volunteer to do reviews? I sure won't, the only reason I'm willing to moderate Slashdot posts is because I'm reading anyway. Hardly anyone was metamoderating on Slashdot, even when it was well built.
What kind of people are those going to be who volunteer to do a corporation's job?
"First they came for the slanderers and i said nothing."
Seriously. It would fit right in there, and Bennett would probably find a more appreciative audience.
I write sci-fi for metalheads
Filter error: You can type more than that for your comment.
Have I agreed to find this post on slashdot? Have I said "yes"?
Because they are an agenda driven, power mad, gleeful censor who love nothing better than silencing the opinions of people they just don't like. Our freedoms end where their outrage begins.
These people are the cancer that is killing the internet.
What are you going to do, Bennett? Call and ask for your money back? Demand to see life's manager and return the lemons?
What's original about your idea that you don't think the folks at Twitter don't already know about from places like Slashdot, Reddit, Google, etc? At this point, crowdsourcing volunteers by one method or another to help rate the quality of something isn't exactly an unknown way of doing things. Do you have any original ideas?
Just one minor detail.....
I don't believe they can even if they wanted to.
It is very tough for a for-profit company to have volunteers doing the work. One eventually sues claiming they should be paid. Court sees them doing 'work' and agrees. All volunteers dismissed and done. Now they probably have less paid people to do it and twice the backlog. This would certainly be seen as work. I don't remember what we called them in Asheron's Call but the volunteer police force didn't work out. Getting volunteers to do something they are already paying someone to do would never fly legally.
Dude needs a little more research before these grandiose future plans.
Targeted ads are getting creepier, and not helpful showing me an unique eBay item I didn't win and asking if I want to try again....there AREN'T anymore!
Nothing I hate worse than ads for stuff you won't / can't sell me. Talk about the ultimate waste of time. NOR DID IT NEED TO UPDATE 3 TIMES WHILE WRITING ABOUT IT,
gotta go recover my password, the ads and reloading, reloading, reloading as AC are too much.
hurfy
Much needed to report useless bullshit. Fuck off dipshit, you haven't posted a single useful bit of information for years.
That would be those people who already have an agenda that they believe could be furthered by restricting other people's accounts.
Tyranny of the majority.
And that isn't counting hiring people to do that. For just $X a day, you can down-vote post opposing Y and up-vote posts supporting Y. Think about whatever political position you don't like and imagine those people doing that.
Bennett Haselton is an idiot. That's okay.
The fact that Bennett Haselton's idiotic ideas get front page posting on /. is a problem. Why did samzenpus feel that this was worth posting?
Or you could just block the original poster. I'm getting a little sick of people complaining the bad man said scary words to me.
Suit up or get off the internet.
There is a perfectly reasonable tool to block annoying people on twitter....takes all of 2 seconds.
f@#king die feminist moron i'm coming after u and raping u.
From personal eye-witness accounts of vile feminists and SJW's, I think this twitter comment is actually fairly tempered and reasonable. The poster should be given a medal for his restraint.
Dear /.: please enable killing all Bennett posts without completely disabling the site. If I put "Bennet Hasselton" in my list of filtered terms every story disappears.
Blasphemy is a human right. Blasphemophobia kills.
They've created a world-changing tool,
Only if your world is really smally.
and they shouldn't have had to stifle the growth of their platform just because it grew faster than their ability to handle the abuse reports.
So you're rightly observing that a business which stops delivering quality service, focussing instead on quantity, can expand faster than one which has to continue delivering a quality service as it expands. Thanks, Captain MBA. Now why the hell shouldn't they continue delivering quality?
So I'd suggest the same solution that I suggested for Facebook abuse reports: recruit a pool of volunteers from the general public to review "abuse reports".
So, your argument is that the work should be done, but shouldn't have to pay people to do the work. Why? Does Twitter have some inherent right to profit?
What the fuck is wrong with people like you?
I didn't realize somebody other than Francis could pontificate so well. I guess this is now the Slatican City.
This is all a great misunderstanding. He doesn't come here to pontificate: he comes here for getting-hit-on-the-head lessons.
You've got to be a real glutton for punishment in order to want to do UGC moderation for free.
You won't even get Hot Pockets, big guy.
Yep. Which is why /. should require that every down-mod be accompanied by a short explanation of WHY it fit "abusive/trolling/offtopic".
Up-mods don't matter. If you want to mod something up then no explanation is necessary since they don't "bury" unpopular opinions.
Riot has had a system like this for some time, the Tribunal, and they allege it works pretty well. It used to have an in-game based reward (absolutely minuscule amount of IP, the in game currency), but they have since removed it, and last I checked it still had high numbers. I don't know if Riot is the originator, but I know it's a pretty major part of their abuse/harassment control.
I really don't know if a Tribunal style method would carry over to Twitter - I remember that part of the reason that people liked Tribunal was just the absolutely ridiculous stories you could read about players and the crap they pulled, and the in-game jokes made it worth it. Riot also made a mini-game of the system, insomuch that you get ranking based on how often your suggested ruling lines up with the actual ruling made on the case. You don't get anything in game anymore, nor does it affect your game profile, but people seem to like it. Likewise, Riot's punishments aren't just pardon/ban, but a range of punishments which can be administered by the admins there.
One thing that does make me kind of worried is that there's not a lot really holding people to the abuse Twitter accounts; in Riot's case, having a Level 30 account (necessary level to participate in the game in full) takes a bit of time, and while many users have accumulated quite a few spare accounts, eventually those pools run dry -- on top of that, primary accounts tend to have in game purchases tied to them, so loss of the account represents a financial loss. With Twitter, you can make a spoof account in seconds with no penalty, and harassment accounts are able to participate immediately and by necessity for Twitter to work. Without the time commitment or something tying people to the account, I'm not sure that this will have as great of an impact.
On a TF2 server I regularly play on (I only play on other servers if it's empty or on a map I hate), we have about 40-50 regulars that are there at least once a week, if not multiple nights, for many hours. One of these was a guy who was usually getting on at least one person's nerves every night, but he's been absent for a few weeks, likely just busy. In the meantime we have a new person who has taken on the same role. He's a dick, but has some intelligence and is never a *huge* dick (at worst one or two people will try to votekick, but most who find him annoying have just muted him.)
Last week one of the old regulars, also an admin on the server, was on at the same time and was telling the new person that the admin appreciated his presence, because there was universal annoyance, at best, amongst the server population that helped bind them together. It was something like a common cause, but replacing productivity with hate. While it was certainly intended to be a riff on the dick, there was some truth to what the admin was saying.
Perhaps that's what Bennett is to us. The whole Beta thing has really died down (or I've willfully ignored it), and that was a very uniting aspect of millions of /. users. Since it's died down, these "stories" by Bennett Hazelton have begun. Perhaps these aren't intended to be actual stories, but to give a "common distaste" (or detest, if you prefer) amongst /. users that will act as a common ground: /. can act towards a large goal, fueled by our mutual hate for Bennett blog posts. A grand conspiracy by /. editors (and Dice?) for the greater good.
A: "Hey fuck you ignorant conservative"
B: "No fuck you, lazy liberal"
A: "Ah man, it's another Bennett article. I hate that guy's drivel."
B: "Oh, really? Me, too. Wish I could ignore all of his articles."
A: "Heh, yeah. So, hey, about earlier..."
And thus
Oh no it's Rolla..... Bennett.
People these days do some very interesting things with machine learning and with all the user data Twitter collects, can't they create something intelligent that finds and eliminates trolling better than random sampling?
Yup
The introductory example was a false negative. Benett rants on why a false positive will be avoided with his method.