Slashdot Mirror


Hatebase Tries To Scan For Precursors of Genocide In Language

An anonymous reader writes "Hatebase, a new crowdsourced database of multilingual hate speech from The Sentinel Project, is an attempt to create a repository of words and phrases that researchers can use to detect the early stages of genocide."

3 of 190 comments (clear)

  1. China-Friendly by caiocaiocaio · · Score: 5, Interesting

    1) They are missing most Chinese-language racial slurs, and are apparently not searching for Chinese characters. I think the results would be predominantly Chinese otherwise. I mean, how could they miss "waiguolao"? In China, hearing that word was my red flag to get indoors or to a cop as soon as possible. 2) I could find you literally thousands of websites calling for genocide in China (either against resident minority groups or towards immigrants in China) which don't use any ethnic slurs. Most of the ethnic slurs in China are condescending more than hateful (except those directed at the Japanese), and using more neutral terminology gives pro-genocide Chinese an air of legitimacy. I can remember when "nanlaowai", for example, was quite a popular blog, but their didn't use any racial slurs in spite of the constant demands for the ethnic purification of China. Chalk it up to cultural difference I guess.

  2. Re:Hatebase as in hate speech, as in ... by retchdog · · Score: 4, Interesting

    Oh, calm down.

    This is the same as any NLP crowd-sourcing tool; it's simply designed with a focus on correlating vocabulary with prejudicial sentiment. No one is planning to use it to pass restrictive laws. It's just useful for people who are involved in a country, but are not fluent speakers of $foo or involved in the right subcultures, to know that a certain word has now acquired a negative connotation.

    Maybe those people should butt out, sure, but you're jumping the gun a bit, here. If they could force everyone to use ``political correct speeches", they wouldn't need this app in the first place.

    --
    "They were pure niggers." – Noam Chomsky
  3. Re:Hatebase as in hate speech, as in ... by overlordofmu · · Score: 4, Interesting
    NLP = natural language processing

    This is the same as any NLP crowd-sourcing tool; it's simply designed with a focus on correlating vocabulary with prejudicial sentiment.

    In case some of you were wondering about the acronym. That becomes:

    This is the same as any natural language processing crowd-sourcing tool; it's simply designed with a focus on correlating vocabulary with prejudicial sentiment.

    To take the conjugation one step further it becomes:

    This shit be the same shit as any goddamn shit where we get other motherfuckers to do the fucking dirty work of working out when shit-talkers, shit-talking in some other fucking language, be talking shit is a way that means that those fuckwits mean to start some shit.

    Of course, sometimes you can take conjugation a bit too far.