What The DHS Is Looking For In Your Posts
New submitter lister king of smeg writes "As we all know The Department of Homeland Security monitors social networks,in an attempt to expose 'Items Of Interest.' As it turns out many terms including seemingly benign words such as flu, agent, response, cops drill, etc are on the list of words that set off warning bells for the government spooks. Many of the terms make sense ..., but there are some real stupid ones on the list to like 'social network' ... [according to a] list of key words provided to a DHS contractor that were released by EPIC."
Of the 4 examples in this post, "flu", "agent", and "cops drill" aren't inappropriate things to monitor (if it's appropriate to monitor at all, which is another story). "Flu" tracking is important for epidemics. Discussions of the location of cops and agents seems to make sense too. Again, I think it's silly they're trying to monitor social networks to this level, but if they're going to do it those aren't the worst keywords.
Also, I guess now I'm going to be tracked for discussing the keywords. How very meta.
Developers: We can use your help.
i have noticed the government gotten more & more drunk and insane with their control & power, and it is steadily getting worse, my only question is how tyrannical will they get before the citizens of this nation wake up and turn on them...
Politics is Treachery, Religion is Brainwashing
I predict a large percentage of false positives as word of this list spreads across the social networks.
One of our competitors trademarked the term "hypothesis". From now on, we will call them "boneheaded ideas".
Call me a conspiracy nut, but I wouldn't think that the DHS would let their wordlist get released if all they were doing was matching texts on specific terms. That is no better than a really dumb bayesian spam filter and would easily be defeated with childishly simple methods. When it comes to content filtering and semantic extraction, the science has moved way beyond such simple methods. Actually it is a very interesting research topic and I would love to have a job working with developing such models for the DHS if it wasn't for such an immoral purpose.
Likely other signals they use to extract information is the dates and times when messages are sent and from which ip addresses. Also how well written they are and what kinds of spelling and grammatical errors. Native speakers of semitic languages such as Arabic make different kinds of spelling errors than Germanic language speakers. That's just from the top of my head. My point is that government surveillance organisations aren't as dumb as the article seem to suggest.
Football Odds
Throw the words from the DHS list in with the usual spammer's algorithm to generate nonsense text.
The end result could look like it was written by a cross between Sarah Palin and Osama Bin Laden.
I agree. This list is probably opsec from DHS side. Disinformation. If it was me, I would have technically a 'keyword list' matching system specifically for release in FOIA situations like this. The actual searching/identification/tagging is algorithmic and context based and has very little to do with this list.
How old are you, 103?
When I was a kid in the '70's, it was quite common to insert the phrase "screw you J. Edgar Hoover" into any telephone conversation when odd noises were heard over the line. Most adults I knew did it. People my parent's age still say it on occasion. It seems pretty clear their assumtion was that the FBI was listening in on personal phone calls with impunity.