Slashdot Mirror


When Metadata Analytics Goes Awry

jfruh writes "When blogger Dan Tynan started seeing lots of Latvians in his LinkedIn People You May Know list, it was pretty funny, considering he'd never been to Latvia or ever met anyone from there. But now that shadowy spy agencies are using algorithms similar to LinkedIn's to see if we're terrorists, mistakes like this are a lot scarier. From the article: 'More than ever -- and online in particular -- who you know can be more important than who you are. In fact, who somebody thinks you know may be more important than who you are, especially if that somebody is a faceless government bureaucracy with limitless power to izjaukt savu dzvi (mess up your life).'"

18 of 88 comments (clear)

  1. Who is this Steve Lexus? by 140Mandak262Jamuna · · Score: 4, Funny

    I created a new gmail id to get price quotes from auto dealers. And now Google keeps telling me I might now someone named Steve Lexus and wants me to add him to my circles. Well, at least they seem to have filtered out Jane Honda and Palvayantheeswaran Toyota and Poponopoulous Mitsubishi.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:Who is this Steve Lexus? by Anonymous Coward · · Score: 3, Funny

      I don't have a facebook account, but a few months ago someone mistakenly sent me an invitation to join. I didn't think much and deleted that mail, since I didn't knew the person and couldn't even read it since it was in arabic. Ever since then facebook has been spamming once or twice a week with useless stuff about some arab dudes I never heard of. It's really annoying, I have no fucking idea who these people are, I've never set foot anywhere even remotely close to that part of the world, it's absolutely impossible that my name can be mistaken for an arab name (since I'm european). The email address where I get FB spam is lastname.firstname@gmail.com and is quite old, it was actually one of the first gmail accounts ever (I got invited by a friend that had been one of the initial 1000 gmail beta testers, so probably in the first three days or so); besides it's plastered all over the internet, simply googling it will tell you I'm a mid 30's vegetarian guy that wears contacts and has contributed to a number of open source projects, currently has a samsung galaxy note 2, has a 20 year interest in photography, loves fountain pens and geocaching, owns a ten year old beat up yamaha xjr1300 with a headlight patched with duct tape, rides a bike to work some 8 or 9 months of the year, does volunteer work at an animal shelter, plays a fender p bass through a traynor amp and is presently employed by the security division of a major telco in southern europe. None of this is a secret, in fact all of this appears in the first 100 hits on google, yet facebook seems to think I'm arab because of a single mistaken invitation.

      Their incompetence earned them a free procmail rule.

    2. Re:Who is this Steve Lexus? by jeffmflanagan · · Score: 3, Funny

      Your loss. Jane Honda is really cute.

    3. Re:Who is this Steve Lexus? by TarPitt · · Score: 2

      I set up a Facebook account for my dog. Female, 5 years old so I figured that was 35 years old in human years. Neutered, so obviously single and never married.

      Amazing the number of invites she gets from lesbian singles.

      --
      If your children ever found out how lame you are, they'd murder you in your sleep
  2. Unfortunately true... by wbr1 · · Score: 2
    Facebook and google+ may recommend possible people you may know, based off of degrees of separation, contact lists, etc. Most of the time I do not know any of the people they suggest.

    If the NSA just reverses a similar algorithm, what happens when it says that Mahmoud Ahmadinejad may know me? Especially if I have access to centrifuges.

    Then I have to prove a negative, that I do not know this person. All their evidence points to the opposite. "He was in New York at the same time!" (BUT I LIVE THERE) "Doesn't matter". "Your fathe'rs, cousin's, uncle's former roomate went to Iran as an exchange student", etc, etc.

    --
    Silence is a state of mime.
    1. Re:Unfortunately true... by Chrisq · · Score: 5, Interesting

      Then I have to prove a negative, that I do not know this person. All their evidence points to the opposite. "He was in New York at the same time!" (BUT I LIVE THERE) "Doesn't matter". "Your fathe'rs, cousin's, uncle's former roomate went to Iran as an exchange student", etc, etc.

      That's an excellent point - its the classic example of the Prosecutor's fallacy. By definition anyone found through a meta-data search will have strong evidence against them. If someone was caught independently plotting terrorist activities then it would be valid to say that it would be very unlikely for them to have a lot of connections with known terrorists. Trawl through databases and find someone who has a lot of known connections and it doesn't say a lot. Its like if you have evidence that someone tampered with a lottery and won you could say the chances of winning are one in 14 million (or whatever), but if you look for people who have won then it is not valid to say that they must have cheated as the odds against winning are so low - because someone will through chance!

  3. Latvians, Schmatvians by cold+fjord · · Score: 2, Funny

    Time to worry about the real problems affecting people's lives.

    --
    much of left-wing thought is a kind of playing with fire by people who don't even know that fire is hot - George Orwell
  4. LinkedIn is worse than most by Aguazul2 · · Score: 2

    When FB or Amazon recommends something/someone, I can usually see some sense behind it. LinkedIn is just plain random. I don't know 95% of the people it seems to want to connect me with. It is a joke.

    1. Re:LinkedIn is worse than most by auric_dude · · Score: 2

      Please remember it is you that is the product that Linkedin and other services are advertising and selling so play the game and get linking, liking and friending.

  5. There is a simple cure by macbeth66 · · Score: 4, Insightful

    Stop using social media. Some of the crud I've seen on LinkedIn is as bad as Facebook and I do not want to be associated with it.

    Side Note: If you do use LinkedIn; it is not a dating site. Some of my female colleagues have started complaining about unwanted attention. Just because she met you at that training class last month, and accepted your connection, does not mean she is interested in 'knowing' you. Sheesh.

  6. Analysing Degree of separations creates conformism by Coeurderoy · · Score: 4, Interesting

    Originally when the concept of "degree of separation was invented" the idea was that everybody was connected to everybody through 6 degree of separation.
    At the same time people think that are "the good guy" who does not keep "bad company".

    With social media the length of the separation chain has considerably shrunk.

    Add to this that most people who "do something interesting" (like making really nice flower arrangement for instance) will tend to travel and meet a "much smaller" crowd of people who "move around".

    In this "smaller world" you can make "very short chains" to quite shady people. Actually it is trivial to create a chain from any US politician to "big list of officially evil guy" that at most 4 level deep. (For instance Ex HP Head Carly Fiorina went to KSA and met large HP clients including the heads of SBL managed by the brother of that really bad guy who did get some support from the ex President(s) Bush when he was against the Soviets...
    And now comes the "suspicion creep" if you know Fiorina and one or tow of the Bushes, then you know 2 suspicious characters that are 3 or less level away from Really Suspicious guy.
    So "one" could be ok, but 2 humm very bad...

    So unless you take great pain to avoid anybody that might "be out of the ordinary", you imediatelly are 100% sure to become somehow "in contact" with somebody "suspicious".

    Or seen another way, being not completely boring gets you something like 200 contacts, among which you can expect at least 3 "super connectors" who do not really overlap, particularly if you are travelling, so taking in account diminishing returns it is hard to avoid having less the 3M "level 3 contacts"
    or 1/1000 of all adults in the world

    the probability that less than 2 are "bad guys" is quite low.

    so be boring or be afraid, very afraid...

  7. Oh dear, I am in trouble... by Stolpskott · · Score: 2

    As I used to work for an American company who have an office in Dubai (full of people with Arabic names, and lots of Muslims), a working team in India (very close to Pakistan, never mind the fact that the two countries hate each other almost as much as Chicago Bears and Green Bay Packers fans), and a development/support team in the Phillippines (close to China, with a similar relationship to India and Pakistan, and with their own domestic terrorism issues), clients in sub-Saharan Africa, Russia and Texas, my LinkedIn and Facebook profiles are full of people in those areas.
    Given that the NSA does not stop at analyzing your own contacts, I am apparently a person of interest if one of my contacts has any dubious friends, or if one of my contact's contacts has any dubious friends.
    Kevin Bacon is indeed going to be screwed, we might as well just lock him up and start waterboarding him now, and save the NSA the trouble.

  8. real question by beefoot · · Score: 3, Insightful

    If you truly concern about this problem, the real question to ask is why on earth do you sign up with linkedin (or g+ or facebook).

  9. Re:So how big of a problem is it, really? by cold+fjord · · Score: 2

    The real question isn't who is connected to terrorists, but rather, "Who are the terrorists and their support network?"

    The intelligence agencies are not going to be much interested in accumulating every possible association, but rather in narrowing it to the people of interest for the purpose at hand. If I knew the pilot that flies Prime Minister Cameron and his guests, I could be connected to many of his guests with 2 hops. Lets say one of those guests was Angela Merkel. Anyone that cared to look would realize that I did not in fact know or have any influence with Angela Merkel. It would also become obvious pretty quickly that I don't communicate with her. The "connection" may technically exists or be possible, but as a practical matter it is pointless.

    I will also point out that when traversing a hierarchical organization, 6 hops doesn't necessarily get you to the top.

    --
    much of left-wing thought is a kind of playing with fire by people who don't even know that fire is hot - George Orwell
  10. Well.... Duh! by plopez · · Score: 3, Interesting

    This is what I, and a host of others, having been screaming about for years. People are blindly using analytics and "big data" to make important decisions decisions about health care, insurance, credit ratings, terrorist affiliations, etc. I have encountered so much bad data in my career the thought that it is take as "gospel" makes me sick. Bad data are out there and cleaning up a polluted data stream, when possible, is expensive and takes a long time.

    Then you add in the use of NoSQL databases engines such as MongoDB which are not ACID compliant. You are virtually guaranteeing data will be corrupted. But then again, maybe I "just don't get it". But personally I think contributing to bad data is unethical.

    --
    putting the 'B' in LGBTQ+
  11. Re:Analysing Degree of separations creates conform by Anonymous Coward · · Score: 4, Interesting

    My work requires me to keep up to date with the computer industry. This means I must be connected with the hacker sites and ipso facto it also demands I am 1 degree separated from Mr. Snowden and many others who the US Government takes a dim view of. Get real people, mere contact isn't criminality, it is in the case of the investigator necessity. This is why the whole concept of Probable Cause is such a necessity!

  12. Re:Analysing Degree of separations creates conform by OrugTor · · Score: 3, Insightful

    "You" might "want" to "back off" on the "quotes".

  13. Basic problem with big data by mtm10 · · Score: 2

    I have heard it said, perhaps apocryphally - If you look at the birth and death records for the State of Florida, you will conclude that a majority of people in that state are born Latino and die Jewish. Having reams of data is a start; but you must also have an accurate model.