Slashdot Mirror


The 'Scunthorpe Problem' Has Never Really Been Solved (vice.com)

dmoberhaus writes: Yesterday, a writer for SB Nation named Natalie Weiner posted a screenshot of a rejection form she received when she tried to sign up for a website. Her submission was rejected because a spam algorithm considered her last name "offensive." After she posted about this, hundreds of other people with similarly "offensive" last names sounded off about how they had experienced similar issues. As it turns out, this phenomenon is so widespread that it has a name among computer scientists. It's called the Scunthorpe problem and it's been a scourge of the internet since the beginning. Motherboard spoke to content moderation experts about its origins and why it's such a hard problem to solve 20 years later. A big reason why the problem has yet to be solved is "because creating effective obscenity filters depends on the filter's ability to understand a word in context," reports Motherboard. "Despite advances in [AI], this is something that even the most advanced machine-learning algorithms still struggle with today."

"This works both ways around," Michael Veale, a researcher studying responsible machine learning at University College London, told Motherboard. "Cock (a bird) and Dick (the given name) are both harmless in certain contexts, even in children's settings online, but in other cases parents might not want them used. Equally, those wanting to abuse a system can find ways around it."

2 of 382 comments (clear)

  1. Re:DIdn't Know It Had A Name by denzacar · · Score: 5, Funny

    But I sure know the problem... it is really surprising how often (or how many) buried obscenities pass under our eyes

    Tehehe! You said but. And ass.

    --
    Mit der Dummheit kämpfen Götter selbst vergebens
  2. Re:WeightWatchers by niks42 · · Score: 5, Funny

    My favourite was when PowerGen opened a web site for their Italian operation, called powergenitalia.com ..