Geomapping Racism With Twitter
Hugh Pickens writes "Megan Garber writes that in the age of the quantified self, biases are just one more thing that can be measured, analyzed, and publicized. The day after Barack Obama won a second term as president of the United States, a group of geography academics took advantage of the fact that many tweets are geocoded to search Twitter for racism-revealing terms that appeared in the context of tweets that mentioned 'Obama,' 're-elected,' or 'won,' sorting the tweets according to the state they were sent from and comparing the racist tweets to the total number of geocoded tweets coming from that state during the same time period. Their findings? Alabama and Mississippi have the highest measures followed closely by Georgia, Louisiana, and Tennessee forming a fairly distinctive cluster in the southeast. Beyond that cluster North Dakota and Utah both had relatively high scores (3.5 each), as did Missouri, Oregon, and Minnesota. 'These findings support the idea that there are some fairly strong clustering of hate tweets centered in southeastern U.S. which has a much higher rate than the national average,' writes Matthew Zook. 'But lest anyone elsewhere become too complacent, the unfortunate fact is that most states are not immune from this kind of activity. Racist behavior, particularly directed at African Americans in the U.S., is all too easy to find both offline and in information space.'"
The SPLC does in deed execute their honorable mission. Go to http://truthy.indiana.edu/ for other meme propagation and dissemination graphics so you can see that this is one lens to the output of a much larger engine.
---- Teach Peace. It's Cheaper Than War.
How did they account for multiple racists tweets from one "tweeter"?
One racist sending 100 racist tweets is not the same as 100 different racists each sending one racist tweet each.
Reading the article it doesn't look like they bothered. And they only found a total of 395 tweets which will lead to appalling precision in any of their findings. Sadly 'information scientists' don't always appear to be the best statisticians.
Why is it irrelevant? They claim they're measuring the geolocation of racism, but only pick one very specific type.
BTW, I just checked out a sample size calculator. For a 95 percent confidence level with a +- 5% confidence interval, and a population of 400 million, guess what your sample size needs to be.
384.
Now this calculation for a survey is a little different from what the researchers are doing here, but it illustrates my point. You can do a lot with small sample sizes if the differences between groups are large.
Give me Classic Slashdot or give me death!
The floatingsheep page specifically says, "we are measuring tweets rather than users and so one individual could be responsible for many tweets and in some cases (most notably in North Dakota, Utah and Minnesota) the number of hate tweets is small and the high LQ is driven by the relatively low number of overall tweets." It's not their fault that the author of the Atlantic article left out those details.
I hate to break it to you, but the press doesn't understand peer reviewed work any better. Whenever media ever looks at any academic work they completely misrepresent it. That's something you get used to.
You are right but this means that the peer review filter is even more important so that what gets out to the media and beyond has at least some chance of being right. Also, having been through the process a few times I'd say academics are at least as guilty of overstating their findings as journalists. We want the headlines and the 'impact' as much as journalists to.
LOL
You clearly did not look at the website. The guys who did this are in academia, but the website is not an academic publication nor would any of the stuff they post garner funding. Their top articles include such things as "The Beer Belly of America", "The Price of Weed", "Church vs. Beer? on Twitter" and "Church, bowling, guns and strip clubs."
They're doing this as a fun hobby, not serious academic research. There is no funding, no grants, nothing.
395 racist tweets from a 0.05% sample works out to 790,000 racist tweets for the country. Even if you assume each racist only posted one racist tweet max, that's 0.25% of the country overall being racist. For the state with the highest rate (Alabama) that's 2%. In an effort to root out racism, this study is presenting their results in a careless fashion which could be used to justify discrimination and anti-Southerner stereotypes against 98% of Alabama residents because of a small minority of bad apples. It is doing the very thing it is criticizing.
395 that contained search terms that the searcher decided indicated racist intent. There's no indication that any of the tweets that came up were filtered for actual racist intent. For example "Obama won bitch, niggers back" would have came up as a "racist" tweet. However it's likely that such a tweet was not a racist and that the author of it was probably black. If you look at the clustering of where tweets were coming from you see that a high clustering was occurring in cities like Atlanta, Georgia or Montgomery, Alabama which has a majority of a minority population (54% black and 56.6% black respectively).
The study is flawed and was never properly scrubbed, but the results were posted because it fits a preconceived notion (the south is racist) when the majority of the effect that causes that effect to show may originate from the very population which is the victim of racism against blacks.
"Lack of speed can be overcome. In the worst case by patience." --Znork