Slashdot Mirror


Algorithms Claimed To Hunt Terrorists While Protecting the Privacy of Others (vice.com)

An anonymous reader sends this report from Motherboard: Computer scientists at the University of Pennsylvania have developed an algorithmic framework for conducting targeted surveillance of individuals within social networks while protecting the privacy of untargeted digital bystanders. ... The algorithms are based on a few basic ideas. The first is that every member of a network (a graph) comes with a sequence of bits indicating their membership in a targeted group. If say, the number two bit was set in your personal privacy register, then you might be part of the “terrorist” target population. For an algorithm searching a network for targets, it doesn’t just get to ask to reveal every network member’s bits. It has a budget of sorts, where it can only reveal so many bits and no more. The algorithms work to optimize this scenario such that as many bits-of-interest are revealed as possible. It does this optimization via a notion known as a statistic of proximity (SOP), which is a quantification of how close a given graph node is to a targeted group of nodes. This is what guides the search algorithms.

45 of 81 comments (clear)

  1. Simpler explanation by sunderland56 · · Score: 4, Funny

    When signing up for Facebook, everyone needs to either check or uncheck the "I am a terrorist" box. That way the Government can do detailed searches on terrorsts only, and not invade the privacy of non-terrorists.

    1. Re:Simpler explanation by Fire_Wraith · · Score: 4, Interesting

      You joke, but this is actually a question on the customs declaration and entry form given to everyone arriving in the United States.

      Of course, they don't actually expect anyone to say 'yes' - the idea (as I understand it) is to give the authorities one more thing to charge an actual terrorist with.

    2. Re:Simpler explanation by phantomfive · · Score: 1

      Are you sure? I just entered the US a couple months ago, and I don't remember seeing it on the customs declaration. I could have just forgotten that part, though.

      --
      "First they came for the slanderers and i said nothing."
    3. Re: Simpler explanation by Anonymous Coward · · Score: 1

      How about: "I own a gun."

    4. Re:Simpler explanation by Knuckles · · Score: 3, Informative

      On the visa application form, https://www.schneier.com/blog/...

      --
      "When I first heard Daydream Nation it quite frankly scared the living shit out of me." -- Matthew Stearns
    5. Re:Simpler explanation by tinkerton · · Score: 1

      The form is being read out loud here. Because it always was funny.
      https://www.youtube.com/watch?...

    6. Re:Simpler explanation by houghi · · Score: 1

      They d not have the option 'I never thought about it, but now I do'. It also should be obvious that 95% of the senators should fill out 'yes' here.

      --
      Don't fight for your country, if your country does not fight for you.
    7. Re:Simpler explanation by Vitriol+Angst · · Score: 1

      It swells my heart with pride to say that the TSA has caught everyone checking YES on the question; "I am a terrorist and thanks for asking!" And exactly ZERO smart assess who can't help themselves by making fun of Homeland Security have gone unpunished.

      To date, they may have saved the planet, or at least dealt with up-armored homeless people before this and the urine smell on subways escalates beyond control. Does all this splendiferous success merit a $1 trillion dollar price tag? Some cynic might say that for $500 billion we could win hearts and minds by building hospitals and schools in the nations that breed terrorism, but those are the same people check "I am a terrorist and thanks for asking!", and we'll take care of them all eventually, as our policies get more invasive and dumber.

      --
      >>"ad space available -- low rates!!!"
    8. Re:Simpler explanation by Hognoxious · · Score: 1

      IIRC the last question used to ask if you'd lied on the other questions. I'm imagining hundreds of criminal masterminds saying "Bah! Foiled again!" over that.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    9. Re:Simpler explanation by david_thornley · · Score: 1

      It also gives them the ability to deport someone immediately if they gave a false answer on the form. (A few years ago, it became known that one of my next-door neighbors had been in the Galician SS, and was the commander of a unit that committed war crimes. Mikhail Karkov was a nice guy and good neighbor, BTW. You know that his entry papers said he hadn't been in the SS, although he had very little command of English at the time, and we were wondering if the State Department would use that to deport him. As it happens, everybody wound up figuring that doing anything to a 94-year-old man who had been a good citizen or legal resident for the last seventy years was pointless.)

      --
      "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
  2. We're slowly diving into Person of Interest by Lisandro · · Score: 2

    And i thought it was just a cool fun show.

  3. statistic of proximity by fustakrakich · · Score: 2

    Meaning, guilt by association. Yeah, that should work....

    --
    “He’s not deformed, he’s just drunk!”
  4. Re:AKA by davester666 · · Score: 3, Insightful

    requires something that so far the gov't has shown no interest in...not invading peoples privacy.

    --
    Sleep your way to a whiter smile...date a dentist!
  5. Obvious algorithm by penguinoid · · Score: 1

    The obvious algorithm is to vacuum up all data from every citizen, in case your other algorithm gets updated you can re-run it more quickly and without risk of some of the data having been deleted since then.

    --
    Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways
  6. Not guilt, but suspicion by association by drnb · · Score: 2

    Meaning, guilt by association. Yeah, that should work....

    Not guilt, but suspicion by association, and yes it has worked. A while ago the FBI got phone bill type information ("metadata", both phone numbers, date/time, duration) for known organized crime members and built a graph of all the connections these phone calls revealed. The FBI knew most of the nodes in the graph would be innocents; even criminals make restaurant reservations, see if dry cleaning is ready, etc. However analysis of the graph helped discover people actively involved in organized crime who had been completely unknown. This graph analysis was not simply looking at proximity as you suggest, I believe it looked at the number of unique connections from known organized crime members and various other factors.

    1. Re:Not guilt, but suspicion by association by mooterSkooter · · Score: 1

      I dislike crime as much as the next man but this worries the hell out of me.

    2. Re:Not guilt, but suspicion by association by Qzukk · · Score: 1

      I hope you haven't phoned in a pizza order.

      --
      If I have been able to see further than others, it is because I bought a pair of binoculars.
  7. Even more simple by s.petry · · Score: 2

    Everyone will be profiled and rated in a method completely hidden from them.

    Just like the FBI, DOJ, TSA, DHS, etc.. do now in fact. The difference here is that it leaves open the "we outsourced that to Facebook and Google, we didn't have anything to do with their bad decisions." plausible deniability option. Those people can say "We took the algorithm from some professor at some college", so they get the same benefit.

    Yup, I am extremely cynical and have become so after being proven correct way too many times.

    --

    -The wise argue that there are few absolutes, the fool argues that there are no probabilities.

  8. and the no-BS version of this algorithm? by Anonymous Coward · · Score: 1

    Ok, I work in academia publishing (other field, though) and this is obviously an interesting math problem solved before somebody declared "hey, if we named those higher potential nodes as 'terrorist' we could apply to a bunch of military funds!"

    So what was it? I can think of versions of the travelling salesman (where non "terrorist" nodes act as cities that could be visited when designing a path to an "evil" one) or an electronics routing problem (where signals has to pass around heavy loaded blocks disturbing only the minimum components, i.e. "respecting the privacy" is like ignoring the state of some adyacent circuitery instead having to know exactly what is the exact load of the system)

    1. Re:and the no-BS version of this algorithm? by bitchtits · · Score: 2

      I'd fully expect the "algorithmic framework" has commercial uses too, particularly in advertising. And no doubt it could also be used to track networks of assorted "anti-socials". I guess that lacks the public appeal of the spin they actually used: "counterterrorism and the containment of infectious disease".

  9. RFC 3514 has finally been implemented! by Sean · · Score: 4, Funny
    1. Re:RFC 3514 has finally been implemented! by chrism238 · · Score: 2

      Indeed! :-) Alas, I can't find it now, but I can remember a similar 'proposal' for a single bit to be reserved to support a female-only USENET group. One more for the confected *gate community.

  10. The crime of lying to a Federal Agent by Etherwalk · · Score: 2

    You joke, but this is actually a question on the customs declaration and entry form given to everyone arriving in the United States.

    Of course, they don't actually expect anyone to say 'yes' - the idea (as I understand it) is to give the authorities one more thing to charge an actual terrorist with.

    If you're a US Citizen they don't make you sign that on entry, at least not normally. I don't know offhand for foreigners of if you're bringing in a lot of goods. They *do* have that on security clearance applications.

    And you're right, the idea isn't that you'll answer yes, it's that if you answer no and turn out to be a terrorist or have supported terrorism, etc..., then you've committed a felony by having lied to a federal officer. (YES. Lying to feds is a crime. The First Amendment doesn't protect you from that.) So they can arrest you and throw away the key, at least for a while.

    1. Re:The crime of lying to a Federal Agent by Vitriol+Angst · · Score: 1

      The sort of people who use "yes I'm a terrorist" as an excuse to remove your civil rights -- or at least load up charges, are the same douche-bags who would falsify evidence because they KNEW you were guilty.

      Nobody is convinced by "yes I'm a terrorist" but the dishonest and eager. It seems our local PD mentality runs all the way to our HS. If they can't find real terrorists, they keep lowering the bar to call SOMEONE a terrorist.

      I can hear it now; "OK, we didn't find any weapons, but we do know that you lied when you said you packed your own bag. Scum like you will never learn."

      --
      >>"ad space available -- low rates!!!"
    2. Re:The crime of lying to a Federal Agent by tehcyder · · Score: 1

      And you're right, the idea isn't that you'll answer yes, it's that if you answer no and turn out to be a terrorist or have supported terrorism, etc..., then you've committed a felony by having lied to a federal officer. (YES. Lying to feds is a crime. The First Amendment doesn't protect you from that.) So they can arrest you and throw away the key, at least for a while.

      But it's only a lie if you have been convicted of terrorism, surely?

      In which case, serving an extra year or two for lying to the Feds on top of the forty eight thousand years you'll get for actual terrorism seems irrelevant.

      --
      To have a right to do a thing is not at all the same as to be right in doing it
    3. Re:The crime of lying to a Federal Agent by Etherwalk · · Score: 1

      But it's only a lie if you have been convicted of terrorism, surely?

      It depends on the precise question. Usually it's not "are you a terrorist" so much as "Have you ever been a member of, or in any way associated (either directly or indirectly) with a terrorist organization." That's the question on the n-400 application for naturalization. Lying about it can get you arrested even if the way you were associated would not have. Like, I don't know, if you married a girl and then found out she has an uncle in ISIS. It's not a crime to marry a girl who has an uncle in ISIS (unless you invite them to the wedding), but it's a crime to answer no to that question if you married her.

    4. Re:The crime of lying to a Federal Agent by david_thornley · · Score: 1

      I can make a good case that the US has participated in terrorism in my lifetime, so I suppose I'd have to answer "yes".

      --
      "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
  11. They had this tech by Etherwalk · · Score: 2

    requires something that so far the gov't has shown no interest in...not invading peoples privacy.

    Yes. I remember hearing someone had basically developer very similar, very careful tech for the NSA that did one of the surveillance routines they wanted but was *very* careful about user privacy... and they couldn't care less and decided to completely go a different direction. i.e. the one that didn't care about that. Maybe there was a slashdot article on it a few years ago?

    1. Re:They had this tech by davester666 · · Score: 1

      Oh, yeah, the other thing is that "terrorism" is just the sales pitch trotted out to the public as to why the data is collected and how it is used. But the actual use of the data is not actually limited to that [even if the data has "legally" been permitted to be collected only for one specific use, like say, to catch terrorist], but rather whatever they feel like using the data for.

      --
      Sleep your way to a whiter smile...date a dentist!
    2. Re:They had this tech by GameboyRMH · · Score: 1

      The various three-letter agencies have shown themselves to be at least as interested in peaceful protest groups as they are in terrorists, which is a messed-up set of priorities at the very least.

      --
      "When information is power, privacy is freedom" - Jah-Wren Ryel
    3. Re:They had this tech by david_thornley · · Score: 1

      And the other other thing is that, statistically, terrorists don't exist (meaning that a vanishingly small proportion of the populace is terrorists), so statistical tests to see if someone is a terrorist will either find nobody or have tons and tons of false positives.

      --
      "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
  12. It's a Data Budget by mentil · · Score: 4, Informative

    Reading the article (gasp!) didn't elucidate things much beyond the summary, although it mentions infectious disease spreading as a possible application while maintaining privacy for unrelated health issues.

    In essence the idea is to use artificial scarcity via technological means to create a 'bit budget', where those who access a database of personal info are only allowed a certain amount of flags to search for; this encourages more efficient searching and thus less retrieval of extraneous data. This could be used so that private entities could try to find suitable targets for medical research or advertising, while revealing as little info about as few people as possible; and it might work in that situation. However, there are two big problems with this idea:

    1) It assumes the data is only accessible through this one database and can't be accessed in another, more privacy-invading way. If any analysts even suspect that the full dataset will be more useful, then they will use the full dataset if they can and this scheme will be useless. "More data better" seems to be the motto of Big Data despite the well-known haystack problem.

    2) Governments are always saying that barriers need to be broken down for their investigators, that they need more/new powers, so there's no way they'll stick to their bit budget. They're gonna ask for more, enough that they have effectively full access to the full dataset, and that's in the unlikely event that they're somehow limited to this access scheme. They're one private 'request', subpoena, or NSL away from full access, anyhow, and political pressure or tax/import/regulatory pressure would make most for-profit entities like Facebook cave in. If this database were maintained by some international nonprofit then it might stand a chance of resisting this.

    --
    Corruption is convincing someone that the selfless ideal is the same as their selfish ideal.
    1. Re:It's a Data Budget by retroworks · · Score: 1

      "The real problem is not whether machines think but whether men do."- B.F. Skinner

      --
      Gently reply
  13. Why limit collection... by AHuxley · · Score: 1

    The face? Others faces in a picture? Linguistic analysis? Terms used? Non english words? Slang? Tattoos or symbols? First hop of friends and their pics? Second hop of friends and their friends, links?
    3rd hop? Getting to the maths and scale of total collection yet? 4th hop?

    What can be found in a front facing web 2.0 site without the ip logs and support from regional ISP providers to ensure the ip range is even from a real persons computer, desktop, phone or tablet? For that deep telco support is needed. Did they use a VPN for all submissions? Access to the original IP is then needed.. local wifi? CCTV is good for that .. A real person or a friendly clandestine service setting another fake account up to fool contacts into joining a converstaion?
    The NSA and GCHQ dropped dictionary and friends of friends as its cheaper and much more useful to just collect it all globally.
    No clandestine service is going to set boundaries with a "target population" when anyone could be interesting to any friendly nation or agency asking for help over the years.

    ie the security services in 5 nations have aspects of every connection, term, scrap of information, ip, image, call collected.
    With no limits anyone found to be of interest can be backtracked over any year given a request by any mil or gov or a tip from an NGO, informant or other collection method.
    Limits on collection at the front end was only an issue to the US and UK in the 1950-70's when hardware could not keep up with early attempts at collect it all.
    Once enough hardware was installed the global telecommunications use was tamed, kept and could be indexed. Putting a filter on what is even considered for collection was of no use.
    Too many total strangers with no connections to anyone of interest got listed as been interesting and having information already collected on them was vital.

    Also note that a lot of easy to find groups are "turned" or total fronts of Western clandestine service as tools for color revolutions, freedom fighters, politically useful moderates mentioned in the press or vast sock puppet networks to contain other advanced nations.
    To bait new members to walk in they have to have all the trappings: slag, flags, music, culture, past glory... that can take years before it becomes a trusted pipeline for the Western clandestine service to collect vast numbers of unique individuals of interest.
    All the West has is signals intelligence over the internet, the internet has to be free and open to get people feeling hidden enough to reach out or create profiles... start chatting.. then collect it all can work its magic

    An algorithmic framework on the "net" will just alert or shut groups been tracked online and they can return to protected community face to face meetings.
    Does the West have a cadre of trusted informants to cover all people of interest in shifts? It takes a few people per shift to watch just one person.
    Dont let a simple rush to do "algorithms" and block accounts make the totally observable internet stop chatting.

    --
    Domestic spying is now "Benign Information Gathering"
  14. The same way it works IRL by Required+Snark · · Score: 1
    This isn't new, it goes back to before the Viet Nam war.

    It's dead simple: communist/terrorist/anti-American scum are anyone you've already killed. Women, children, infants, farm animals, trees, it makes no difference. As soon as a victim joins the corpse club they are automatically guilty. It's what's happening in the Middle East right now. It never went away.

    This will work the same way. That whole constitutional bullshit about "innocent until found guilty" is obsolete. Based on the "certainty" of the infallible computer, the authorities will use "parallel investigation" to find (i.e. fabricate) evidence to charge you with a crime. Then they lay on criminal counts so severe a conviction means your dead body will still be doing hard time in solitary into the next century. If you plead guilty then you will only do 5 or 10 years, so you will have some life on the outside before you die. Everyone rolls over because the courts are a joke, and innocence will not save you. The game is rigged.

    Any questions? There better not be, or you will end up in a Super Max prison under a different name and Social Security number.

    --
    Why is Snark Required?
  15. An automated version of 1984 is here by golodh · · Score: 1
    It's basically the page rank algorithm but with new proximity measures.

    I think we all agree that this "state of the art" represents an automated version of 1984. The tool is there, it just depends on what you use it for, i.e. what your target population is.

    Proximity measures can be derived from anything on the Internet, and that opens the gates to widespread use (and abuse).

    Take e.g. proximity to known mafia members as a distance measure, and you'll find mafia networks (even though most of their connections are offline. Adding cellphone metadata to the mix will soon cure that).

    Take proximity to e.g. farmer Bundy and his son as your measure, intersect that with gun ownership, affinity to guns, right-wing ideological websites, and anti-government activism and you'll find an interesting pool of homegrown "potential terrorirsts".

    Take proximity to known Jews, phonecalls to Israel, Israeli embassy personnel, Israeli citizens or pro-Israel-interest websites, and (sensitive defense information or political power including House membership), and you've got a pool of potential "Pollards" .

    Take proximity to websites like Salon.com, Bernie Sanders, online articles that bash Tea Party politics, Trump, Cruz, Rubbio, Palin, Koch brothers, NFA etc., and you've identified "militant leftists".

    Take visiting of pro-Islam websites, ability the read Arabic, mosque attendance, having a beard, and owning a gun, and you've got potential Jihadi terrorists.

    Great huh? Fully automated pre-screening of undesirables of all stripes. Possibilities are endless. I'm sure that Sen. McCarthy and mr. Hoover would have approved wholeheartedly.

    Who needs plodding old-fashioned intelligence work and old-fashioned police work now?

    All we need now is for someone to relax the standards of evidence needed to prosecute people for suspicious behaviour and we can really get to work on "terrorists". If they're truly innocent the subsequent legal process will clear them, right?

  16. It's so full of BS anyway - What is terrorism? by aliquis · · Score: 1

    It's still dictatorship and lack of freedom of thought, opinion and expression.

    Why shall we be ruled by our nations and governments? Shouldn't they accept that?

    I hate my government, the traitors, the media, the immigrants.
    I'm open with that.
    I haven't attacked shit.

    If anything the problem isn't that we aren't allowed to speak and that people don't listen, the set the foundation for "terrorism" but what is "terrorism" anyhow?
    Wikipedia: "In a narrower sense, terrorism can be understood to feature a political objective."

    My government is clear that my opinion doesn't matter, they won't listen and nothing will change and they ignore people like me.

    In the end what is the difference? The ruling authority use threat of violence against those who are against it too. All governments are terrorists for their own agenda?

    Are Hamas terrorists? Israel? USA? Al-Assad? ISIS? Iran? France?

    1. Re:It's so full of BS anyway - What is terrorism? by aliquis · · Score: 1

      Enjoy your RED threat assessment. Have fun being followed by the police and pulled over for things like not doing a 1 second stop at stop signs.

      I was born in a free country, I expected it to allow people to be free and I'll do my best to keep it free.

      Sure it's got risks - That sucks.
      And for sure I don't know where the right balance is, and I don't really know how to figure that one out. I don't know the laws of my country and I wish I didn't had to because they shouldn't even exist.

      I'm not American. And I totally respect the American society and I wish you all a free country in line with the constitution not the Marxist communist bullshit.

      / Svensk.

  17. Buttle, by ThatsNotPudding · · Score: 1

    Tuttle... CLOSE ENOUGH.

  18. yeah by sociocapitalist · · Score: 1

    Bullshit.

    The whole point of such algorithms is to determine who the terrorists are which means that if you 'associate' in some way (live near? work near? use a shooting range with? take airplane flying lessons in the same school as? share a clothing store with? visit a website with IS newsletters? ) with one or more terrorists your bits are going to flip.

    --
    blindly antisocialist = antisocial
  19. That algorithm was invented centuries ago by king+neckbeard · · Score: 2

    The algorithm for finding criminals while protecting privacy was disclosed in an ancient process called "getting a warrant."

    --
    This is my signature. There are many like it, but this one is mine.
    1. Re:That algorithm was invented centuries ago by oh_my_080980980 · · Score: 1

      Exactly!

      Of course the computer scientists at University of Pennsylvania who are in bed with the NSA, fail to grasp the fundamental problem. It has very little to do with the data mining they suggest they found a solution for. The simple problem is: THEY ARE COLLECTING DATA FROM EVERYONE! Earth to Potsy, that's the problem. The NSA is collecting information from EVERYONE and storing it regardless of value.

    2. Re:That algorithm was invented centuries ago by david_thornley · · Score: 1

      No, a warrant is (according to the Fourth Amendment) issued by a judge where there is sworn probable cause to believe it's, um, warranted. The police have to find some sort of probable cause before getting the warrant (or lie their asses off, which is also done). This means they have to do other things to find who's likely to have committed a particular crime before getting a warrant.

      --
      "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
  20. Differential Privacy by whh3 · · Score: 1

    This is referred to as differential privacy by Cynthia Dwork. She's an expert on the techniques used to perform data mining without personally identifiable information about "other" people in the dataset. Here's a video of a fascinating talk she gave that outlines her work:

    https://www.youtube.com/watch?v=vh2xfgfymHk

    --
    remove nospam. to email!
  21. Bullshit by allo · · Score: 1

    Even if it would work flawless, the problem is "the targeted group". For the NSA, the target is the suspected terrorist (one wrong word in a mail), his friends and the friends of his friends. As TARGETs. So even when all others are spared, its still the average number of friends to the power of two.