Slashdot Mirror


Hatebase Tries To Scan For Precursors of Genocide In Language

An anonymous reader writes "Hatebase, a new crowdsourced database of multilingual hate speech from The Sentinel Project, is an attempt to create a repository of words and phrases that researchers can use to detect the early stages of genocide."

25 of 190 comments (clear)

  1. Mostly false positives, will be used for "hate". by Ungrounded+Lightning · · Score: 5, Insightful

    My expectation is that this will be used for political infighting, much like the genocide it purports to try to head off.

    The "crowd" will include activists for one (or more) sides of contentious political disputes, who will feed the database with typical word choices of their enemies, in the hope of branding them as potential genocide perpetrators. The result will be a produce far more false positives than true ones (if it produces any of the latter at all).

    Indeed, the very phrase "hate speech" is such a faction-specific term. It is used by the US left wing to attempt to suppress politically incorrect free speech - especially politicall speech - of those with whom they disagree.

    For an example of what I'm talking about, look at the Southern Poverty Law Center's pronouncements - including especially their advice to law enforcement and the Department of Homeland Security that displaying bumper stickers supporting Ron Paul during the presidential primary, or any of a number of other pro-Constitution or Tea Party political position messages, was a sign that the driver was a terrorist.

    --
    Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
  2. Good idea by AlphaWolf_HK · · Score: 3, Insightful

    Now instead of just laws requiring data retention to prevent child pornography, we can now also use genocide prevention as an excuse. And then of course just use it to go after copyright infringers.

    If you want to learn about genocide speech, go to stormfront.org, there's no need to build a new database when somebody has already created one for you.

    --
    Careful with names containing L slashdot.org/~AiphaWolf_HK slashdot.org/~AlphaWoif_HK slashdot.org/~AiphaWoif_HK
    1. Re:Good idea by Intrepid+imaginaut · · Score: 3, Insightful

      Correct, genocidal tendencies are not difficult to spot. For one thing a large segment of the population is regularly talking about killing, imprisoning or deporting a smaller segment of the population. Genocide by its nature requires the involvement of lots and lots of people, and they won't be shy about giving you their opinion. For example pre world war 2 Germany was riddled with anti-semitism, that's a hazardous situation. It's not something that appears from a vacuum nor a spontaneous event.

      I'd be very wary of this initiative as another effort to water down terms which describe truly horrendous crimes by assigning them to lesser actions, like the incessant use of 'rape' by hardline misandrists.

    2. Re:Good idea by Alsee · · Score: 3, Insightful

      Actually stormfront.org is valuable as a huge negative data point.

      The purpose of their project, as I understand it, is to detect when a population is in the early stages of an actual developing genocide situation. Stormfront.org is a sample of the sort of speech that occurs in a group which is grossly failing to to get anywhere. The stormfronters have all sorts of grand fantasies of what they want and believe, but at least on some level they know damn well that they don't have general public support for it. Stormfront.org's rhetoric is filled with an attitude that they are persecuted victims, the feel frustrated and powerless. I expect that is about the last thing you'd find in a genuine developing genocide situation. In a genuine genocide situation the hate speech agitators are not feeling powerless - they are feeling supported and powerful and emboldened... that they can boldly go out there without hiding their intent, without fear or shame, to seize control, to just plain engage in flagrant public violence.

      The stormfronters feel like powerless victims. They may sneak around in the dark and commit pointless vandalism like spiteful children, but they are not anywhere in the same universe as a situation where people go out in broad daylight committing mass violence and inviting their police-buddies to come along with their cop-cars and heavy weapon supplies.

      -

      --
      - - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.
  3. Re:A precursor to genocide? by Anonymous Coward · · Score: 3, Funny

    How many picohitlers is that?

  4. Thought crime by ildon · · Score: 3, Insightful

    Sweet. We finally reached the point where we're just looking for thought crimes.

  5. I'll contribute by srussia · · Score: 4, Funny

    From hatebase.org:

    "Language-based classification, or symbolization, is one of a handful of quantifiable steps toward genocide.
    To support Hatebase, please contribute to our database, either by adding vocabulary or by logging sightings and citations.


    My submission:

    Language: Beltwayspeak
    Vocabulary: "senior operational leader", "enemy belligerent", "imminent threat", "organized armed groups"
    Source: http://msnbcmedia.msn.com/i/msnbc/sections/news/020413_DOJ_White_Paper.pdf

    --
    Set your phasers on "funky"!
  6. China-Friendly by caiocaiocaio · · Score: 5, Interesting

    1) They are missing most Chinese-language racial slurs, and are apparently not searching for Chinese characters. I think the results would be predominantly Chinese otherwise. I mean, how could they miss "waiguolao"? In China, hearing that word was my red flag to get indoors or to a cop as soon as possible. 2) I could find you literally thousands of websites calling for genocide in China (either against resident minority groups or towards immigrants in China) which don't use any ethnic slurs. Most of the ethnic slurs in China are condescending more than hateful (except those directed at the Japanese), and using more neutral terminology gives pro-genocide Chinese an air of legitimacy. I can remember when "nanlaowai", for example, was quite a popular blog, but their didn't use any racial slurs in spite of the constant demands for the ethnic purification of China. Chalk it up to cultural difference I guess.

    1. Re:China-Friendly by Anonymous Coward · · Score: 3, Insightful

      apparently not searching for Chinese characters

      They're not even allowed. In a time when most of the web has switched over to Unicode this is an unbelievable oversight.
      From the FAQ:

      Why doesn't submitted vocabulary retain any accented characters?
      Hatebase is a multilingual platform and accepts all UTF-8 characters (including accented characters), but "latinizes" the main vocabulary field to optimize search performance.
      If you're adding a term with an accent, simply repeat the term somewhere in the "meaning" field, which isn't latinized.

      Why doesn't Hatebase accept non-UTF8 characters?
      At present, Hatebase is architected to display UTF-8 characters only -- basically, the extended Latin alphabet, including accented characters. Further extending Hatebase to logographic character sets is certainly on our roadmap, but may not happen anytime soon unless we hear from our users that it's strongly desired.

      So according to them Chinese characters cannot be encoded in UTF-8? And entries in non-Latin alphabets are excluded from a multilingual database? These people don't know what they're doing.

  7. Re:Hatebase as in hate speech, as in ... by retchdog · · Score: 4, Interesting

    Oh, calm down.

    This is the same as any NLP crowd-sourcing tool; it's simply designed with a focus on correlating vocabulary with prejudicial sentiment. No one is planning to use it to pass restrictive laws. It's just useful for people who are involved in a country, but are not fluent speakers of $foo or involved in the right subcultures, to know that a certain word has now acquired a negative connotation.

    Maybe those people should butt out, sure, but you're jumping the gun a bit, here. If they could force everyone to use ``political correct speeches", they wouldn't need this app in the first place.

    --
    "They were pure niggers." – Noam Chomsky
  8. Re:Mostly false positives, will be used for "hate" by mrbluze · · Score: 5, Insightful

    My expectation is that this will be used for political infighting, much like the genocide it purports to try to head off.

    I rather think this will be used to weed out political dissent among the population.

    --
    Do it yourself, because no one else will do it yourself. [beta blockade 10-17 Feb]
  9. Re:Hatebase as in hate speech, as in ... by overlordofmu · · Score: 4, Insightful
    Instead of "they", please try using the the word "we".

    Here is a FTFY (aka fixed-that-for-you) example. I will now conjugate the following:

    No, they do not believe in the true concept of FREE speech - their only aim is to force everyone in using political correct speeches

    With the FTFY conjugation which takes ownership of all aspects of society by turning all third person plural forms into first person plural forms that quote becomes:

    No, we do not believe in the true concept of FREE speech - our only aim is to force everyone in using political correct speeches (sic)

    My point is that "they" are not the problem. My point is that "we" are the problem. Every last fallible one of us can be a problem or a solution. The difference is often a matter of how compassionate we are combined with how much we are able to take personal responsibility for problems. Even (maybe especially?) the problems which seem to be caused by other people.

    While I may be liberal and you may be conservative, the reality is that our society is comprised of both of us and we are both liberal and conservative. We are all the things we which are. By treating the problem as "our" problem instead of "their" problem we can approach the solution with realism and healing, instead of idealism and revenge.

    Of course we could, instead, go on blaming other people, but look where that has gotten us so far . . .

  10. Re:Hatebase as in hate speech, as in ... by overlordofmu · · Score: 4, Interesting
    NLP = natural language processing

    This is the same as any NLP crowd-sourcing tool; it's simply designed with a focus on correlating vocabulary with prejudicial sentiment.

    In case some of you were wondering about the acronym. That becomes:

    This is the same as any natural language processing crowd-sourcing tool; it's simply designed with a focus on correlating vocabulary with prejudicial sentiment.

    To take the conjugation one step further it becomes:

    This shit be the same shit as any goddamn shit where we get other motherfuckers to do the fucking dirty work of working out when shit-talkers, shit-talking in some other fucking language, be talking shit is a way that means that those fuckwits mean to start some shit.

    Of course, sometimes you can take conjugation a bit too far.

  11. Re:hatebase? by jamesh · · Score: 3, Insightful

    I'm with you. Haterbase would be a much cooler name.

  12. "The 8 stages of Genocide" by nomad-9 · · Score: 4, Insightful

    Just read the part about the 8 stages of genocide of this so-called "Genocide Watch" ( http://www.genocidewatch.org/aboutgenocide/8stagesofgenocide.html )

    Unfortunately they focus mainly on religious and ethnic hatred, which doesn't really account for some of the biggest genocides of the 20th century like in Pol Pot's Cambodia, Stalin's USSR and Mao's China, They do mention Pol Pot a couple of times, for the "blue ribbon" symbolism and the "Denial" stage, but miss the root of the problem. Their view is shallow at best, IMO.

    It is fashionable to focus almost exclusively on race, religion and nationalism, but ironically, the biggest killings in the past century came from ideologies aiming to unite mankind beyond those "hate" barriers.

    "Genocide Watch" would have probably missed those "early stages" of Communism...

  13. no data for big data by stenvar · · Score: 4, Insightful

    The basis for this appears to be pure speculation. There is no actual data (big or otherwise) showing the validity of the assumptions on which this is based.

  14. New ammunition for political correct do-gooders? by Takatata · · Score: 4, Insightful

    Always looking for new ways to feel morally superior and lecture others. 'You don't say this', 'you don't say that'. Everyone says that? No negative connotation? Who cares? We say it is discriminating and if you don't follow us, you are a racist. Fortunately those cannot read thoughts, else they would tell you what to think and what not to think.

  15. Re:Mostly false positives, will be used for "hate" by AlphaWolf_HK · · Score: 3, Insightful

    I think you just proved his point. You are doing exactly that, you are using this as fodder for political infighting. You don't like it so you label it as hate speech. Does that mean we treat it the same way Europe treats it? Somebody makes an anti-Semitic comment on twitter, so France wants to put them in jail?

    Anyways, yes they both do it. 10 seconds of google and I found something that tops yours: Obama makes fun of disabled people:

    http://abcnews.go.com/blogs/politics/2009/03/president-ob-15-3/

    You're one of those people who I commonly rail against when I say we need to stop treating the election like we're rival football teams. Quite possibly one of the ones I rail against for blind voting.

    --
    Careful with names containing L slashdot.org/~AiphaWolf_HK slashdot.org/~AlphaWoif_HK slashdot.org/~AiphaWoif_HK
  16. Re:Hatebase as in hate speech, as in ... by SplashMyBandit · · Score: 5, Informative
    The parent was right. Why do you think the Left and leftist media completely ignore the genocides happening *today*. These are happening and you are completely unaware of them. Why? because the Left don't want you to know. If you care to look you can see:
    • The Islamic genocide against Assyrian Christians in Iraq. Most Assyrians have fled and the rest have been killed. Don't blame this on the US invasion of Iraq. It happened under Saddam. Now under democracy (as in, "mob rule") the pace has accelerated, but the motivation is the same. Sura 9:5 "The Verse of the Sword". See
      http://www.answering-islam.org/Silas/swordverse.htm
      Did you know this?
    • The Sudanese Islamist genocide against animist and Christian Southern Sudanese. Jihadi raids still occur and non-Muslims are taken as *slaves* (as permitted under Sharia). This is happening *today*. See the testimony of Simon Deng (a former slave).
    • The call for genocide against Jews, as commanded by the Qur'an. See the following article for all the reasons the Qur'an and Palestinians give for killing Jews based on *mainstream* Islamic doctrine: http://palwatch.org/main.aspx?fi=584
      Note that anyone who supports the Islamist cause against Israel is unwittingly supporting this genocidal agenda. The Israelis are not the aggressors (as the historically ignorant often believe). The Jews have been living in Palestine *continuously* for over 3000 years (the Roman expulsions were temporary, and only from Jerusalem). The modern State of Israel is trying to *defend* its citizens from the *aggression* and schemes of the 57 Muslim Majority countries in the Organization of Islamic Cooperation (although countries like Azerbaijan get on well with Israelis, and after many attempted genocides Jordan now has working relations). Anyone who thinks the Arabs are the victims simply doesn't know the genocidal agendas of Hamas and Hezbollah (hint, look for the hate speech in the Hamas Charter: http://www.thejerusalemfund.org/www.thejerusalemfund.org/carryover/documents/charter.html). When Hamas talks about "Occupied Territories" they mean killing, expelling, or enslaving all non-Muslims in *all of Israel*.
    • The expulsion, killing, and denial of rights to Serbs in northern Kosovo by Albanian Muslims. This is happening today. Did you know that?
    • The genocides between Sunni and Shia branches of Islam, and the Sunni+Shia attacks on more peaceful Sufis. This is happening *today* in Syria, Iraq, Iran, Turkey (against Kurds), Pakistan as Islamic sect battles Islamic sect (with the aim of expelling the "apostate" sects): See: http://www.thereligionofpeace.com/
      particularly http://www.thereligionofpeace.com/index.html#Attacks
    • We won't even talk about the historical genocides of approximately one million Armenians by Turkish jihadis in 1915, or the hundreds of thousands slaughtered by the two Bosnian Muslim SS divisions in World War II (raised by the Grand Mufti of Jerusalem, Haj Amin al-Husseini, http://en.wikipedia.org/wiki/Mohammad_Amin_al-Husayni - the bastard who told Hitler the idea of the "Final Solution", stating "Who remembers the Armenians").
    • Then we have the genocides soon to come. The persecution and subjugation of Christian Copts in Egypt. Now that Obama installed the Muslim Brotherhod the life of Copts is going from bad (as dhimmis) to worse. Bad stuff is happening to the Copts *every day*. And the media and leftist "human rights" organizations are mostly silent (with a few exceptions). Did you know all the bad stuff happening to Copts today? http://www.amnes
  17. Re:Mostly false positives, will be used for "hate" by RogueyWon · · Score: 5, Insightful

    I'm not in the US, so maybe I'm missing some context here, but...

    How on earth are either of the links you've just posted examples of hate speech? The first is a line on the abortion debate that we've seen many times over the years. I'm not going to pick sides in that one; but if you approach the debate (as some people do) with the starting point that foes "life begins at conception" then abortion is infanticide. I think a lot of the lack of civility around that particular debate stems from the fact that neither side recognises just how high the stakes feel for the other side in it.

    The second link is a fairly silly take on the gun control debate that somehow slides into an odd reductio ab absurdum take on the gay marriage debate. But again, incoherent though it is, is it really hate speech?

    If somebody says "All members of (ethnic group x/social group y) are scum! Let's (kill them/throw them out of our country/deprive them of their property rights)" then that feels like hate speech. That's a hell of a long way from either of the examples you link to.

    As a test, let's take an example from a left-wing perspective of somebody linking a (generally supported - the UK public consistently backs a tougher line on welfare in polls) Government policy to murder. In this case, it's the murder of the disabled rather than the infanticide, but I think that's still pretty emotive. So: from the UK's Guardian newspaper. Is that hate speech?

    If you answer "yes", at least you're consistent. If your answer is no, then it looks more like you're just demonstrating totalitarian instincts to suppress speech that goes against your own values.

  18. How to by chaos_technique · · Score: 3, Interesting
    Filling the search field with a | (pipe) will give you the full listing of their "base".

    Ha, only 729 ethnic or national slurs?

    Come on, even I could do better than that :-)

    The programming seems interesting too: non ascii characters in the Search box will break the site.

    --
    Singe capitulard mangeur de fromage
  19. Re:Mostly false positives, will be used for "hate" by sg_oneill · · Score: 3, Insightful

    Yeah dude. Political scientists don't think like that and tend to be fairly serious minded men. If activists are putting in bogus data, its going to stick out like a sore thumb.

    This seems to be more like some of the research google was doing spotting emerging trends via language use.

    Go read the stuff about the 8 stage genocide model and specifically on the 'symbolization' phase. I suspect its more about looking for trends like where a population for instance stops saying "jews" and starts saying "kikes" or whatever, whith the observation that a population is heading towards the crucial dehumanization phase needed to allow people to sleep at night whilst committing genocide.

    --
    Excuse the Unicode crap in my posts. That's an apostrophe, and slashdot is busted.
  20. No, you're clueless by Su27K · · Score: 4, Informative

    "waiguolao" definitely has a negative tone to it, and no it cannot be interpreted as "foreign elder". "waiguo" is "foreign country", you got this correct; but here "lao" is not the same Chinese character for "old/elder", it has ren (person) radical and it's a slur for a group of people. I think you confused it with "laowai", which is considered a neutral word to describe a foreigner by Chinese but not so much by the foreigners themselves.

  21. Re:Hatebase as in hate speech, as in ... by K.+S.+Kyosuke · · Score: 3, Insightful

    This is the same as any NLP crowd-sourcing tool; it's simply designed with a focus on correlating vocabulary with prejudicial sentiment. No one is planning to use it to pass restrictive laws. It's just useful for people who are involved in a country, but are not fluent speakers of $foo or involved in the right subcultures, to know that a certain word has now acquired a negative connotation.

    Except that it's not going to work. Show me an NLP system that correctly distinguishes statements from quotations and references. You'd need an AI-complete system to divine the actual state of mind of the speaker. Or does the "crowdsourcing" part mean that there are going to be scores of people checking and rechecking flagged texts?

    --
    Ezekiel 23:20
  22. Re: Hatebase as in hate speech, as in ... by Anonymous Coward · · Score: 3, Interesting

    Stupid moron. You are right with your anger at islam. What you don't understand is that it's the right who manfacture this enemy. Your ultra-rightist presidents gave money to islamist fundamentalists to defeat russian communism (which, as an atheist state, was not pro-islam).

    Speaking of "stupid moron",

    All of the American left, right, center, up, and down supported the use of Islam, Christianity, and religious identity in general as a weapon against the Soviets. Islam was not as uniformly hostile and violent back then as it is today, although the CIA made awful choices of which Muslims to partner with.

    The Russians began a large-scale campaign of terrorist funding in 1964. The reason so many Islamic terrorist organizations speak the language of anti-American postcolonialism is because their founders learned it at Patrice Lumumba University.

    It is largely the left today that allies with the Islamic right wing. Look at what happened to Michelle Bachmann when she reasonably suggested that people in sensitive wartime posts with possible Muslim Brotherhood connections should be vetted; not fired, not punished, but simply vetted. She was mocked and condemned for such a thing, and called a racist. Look at what happened to Amnesty International's Gita Saghal when she protested Amnesty's alliance with the Taliban; she was fired and Amnesty shut down its womens' rights division so that no more uppity women would interfere with the great deal Amnesty has with the Taliban. Look at what happened to Eric Allen Bell when he criticized the website Loonwatch as "a terrorist spin control network"; the Daily Kos banned him. Look at what happens whenever anybody references the Muslim Brotherhood or the Hamas charter in the Guardian's comments section; their post is immediately deleted.

    The right has its own alliances with Islamic terrorism. During the 1990s, it was Republicans who got into the news for praising the Taliban. Grover Norquist partnered with Abdulrahman Alamoudi to raise terrorist funds for George W. Bush. More recently, within the past few weeks, the Conservative Political Action Committee (CPAC) banned Jihad Watch founder Robert Spencer from receiving an award for fear that he might criticize Norquist or another Republican fundraiser, Suhail Khan.

    This discussion is not about Democrats or Republicans, it is not about the worker and the capitalist, it is not about the rich and the poor. It is about the double standard of modern society's tolerance of hate speech and actual genocide if done in the name of Islam, and the triple standard of ostracizing anybody who opposes the hate speech and genocide.

    The question is in the air as to whether Hatebase will end up like the Southern Poverty Law Center's Intelligence Project, which has made the decision not to report hate speech by Muslims in its reports on hate speech in the United States. Their excuse is (paraphrasing) "the Jews do that", and evidently the SPLC does not want to risk being associated with the Jews.

    Will people speak up or remain silent?