Slashdot Mirror


Google Researchers Warn of Automated Social Info Sharing

holy_calamity writes "Researchers from Google have written a paper about how social networks can undermine privacy. The most interesting scenario they discuss is 'merging social graphs' — when correlating multiple social networks makes it possible to reveal connections that a person has intentionally kept secret (PDF). For example, it may be possible to work out that a certain LinkedIn user is the same person as a MySpace user, despite their attempting to keep their profiles separate. The Google solution is to develop software that screens new data added to a social network, attempting to find out if it could be fodder to such data mining."

35 of 124 comments (clear)

  1. I Wonder... by ITEric · · Score: 4, Insightful

    how many people will be surprised about Google being the champion of privacy?

    --
    The most exciting phrase to hear in science, the one that heralds new discoveries, is not 'Eureka!' but 'That's funny...
    1. Re:I Wonder... by exley · · Score: 2, Funny

      Yes, they are helping protect privacy by developing software that can... Help undermine privacy.

    2. Re:I Wonder... by ITEric · · Score: 4, Informative

      I know most people here don't, but try reading the actual paper. Clearly they were analyzing the risks and suggesting ways to put privacy back in the hands of the people.

      --
      The most exciting phrase to hear in science, the one that heralds new discoveries, is not 'Eureka!' but 'That's funny...
  2. Hmm... by SlashThat · · Score: 2, Funny

    I thought Google would be the ones undermining privacy in this case...

    --
    1's and 0's should be free.
    1. Re:Hmm... by vux984 · · Score: 2, Interesting

      I thought Google would be the ones undermining privacy in this case...

      How do you think they "discovered" this?

  3. Re:Well. by nine-times · · Score: 4, Insightful

    Also in the "what do you expect?" vein, you're putting lots of personal information on various websites that are publicly available worldwide. What kind of privacy are you expecting?

    Hell, I maintain totally different personas on several sites and in many cases have different lies about my identity on each site, and I can still see how people would put the pieces together.

  4. Aggregation and correlation destroy privacy by Estanislao+Mart�nez · · Score: 5, Insightful

    This is a case where multiple pieces of information, that individually do not compromise one's privacy, can actually do so when aggregated and correlated together.

    This sort of pattern is why something like Google Street View subverts the privacy laws that we have. Yes, a photo taken from a public location of things viewable from that location, by itself, does not violate privacy, and privacy law has been developed so that each individual photo that Google takes and publishes does not, on its own, violate anybody's privacy. What the law fails to capture is that putting a vast number of such photos together, correlating them with a geographical information system, yellow page listings, satellite imagery, internet search results, and offering it to the general public to use for free, without any restrictions of purpose, does massively violate privacy. So the standard response to privacy challenges to Street View ("the law allows you to take photos of any public place you want") just massively misses the point.

    1. Re:Aggregation and correlation destroy privacy by 0100010001010011 · · Score: 3, Informative

      There's already a website that aggregates this information. http://www.123people.com/. A friend showed it to me the other day and It's pretty good in what it picks up, since most of them don't even show up on google top 50 together. There were 1 or 2 pieces of information that were off, but these were obvious.

      For people that had more of a web presence it picked up quite a bit more, even photos tagged of them from other people's photobucket and such.

      Obviously the more unique the name, the better the results too.

  5. don't need fancy data-mining tools for that by xmark · · Score: 2, Insightful

    Just look at any long thread here on Slashdot and you see a pattern of convergence to a disgruntled, socially frightened, and overworked IT/programmer guy named Anonymous Coward.

    On second thought, looks like that applies to most other UIDs here as well. ;-)

    On a more serious level, makes me wonder. If such a tool was used to narrow down a suspect in a crime or malfeasance, would constitutional guarantees against self-incrimination come into play? Could one argue that intentional postings of information pseudo-anonymously are implicitly protected from meta-analytical incrimination?

    Not that I have any thing to, umm, hide, mind you....

    1. Re: don't need fancy data-mining tools for that by nine-times · · Score: 4, Insightful

      If such a tool was used to narrow down a suspect in a crime or malfeasance, would constitutional guarantees against self-incrimination come into play?

      No, the right against incriminating yourself basically amounts to "you have the right to remain silent". The police aren't allowed to punish you for not telling them about your crimes. However, anything you do tell them can (and will) be used against you.

      (IANAL, but I'm pretty sure I'm right)

    2. Re: don't need fancy data-mining tools for that by bill_mcgonigle · · Score: 2, Interesting

      However, anything you do tell them can (and will) be used against you.

      Yes, and only against you.

      (bug #8332 has been filed).

      --
      My God, it's Full of Source!
      OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
  6. Other people may publish information about you by Estanislao+Mart�nez · · Score: 5, Insightful

    Also in the "what do you expect?" vein, you're putting lots of personal information on various websites that are publicly available worldwide. What kind of privacy are you expecting?

    While that is a completely fair thing to point out, there is a very important thing that it misses: other people can put information about you online, without your permission, and that information is just as subject to analysis as what you put up.

    The two best examples that come to mind right away:

    1. Facebook allows users to tag photos with the names of the people who appear in them.
    2. Google Street View puts photos of your residence without asking you for permission, and correlates it to a bunch of other stuff like geographic information, satellite images, yellow page listings, web search results, etc..

    Notice that both of these acts are perfectly legal, and while the second arguably should be regulated and restricted by law (the aggregation, correlation and publication parts, not the picture-taking part), the first one ought not to.

    1. Re:Other people may publish information about you by Creepy+Crawler · · Score: 4, Interesting

      --While that is a completely fair thing to point out, there is a very important thing that it misses: other people can put information about you online, without your permission, and that information is just as subject to analysis as what you put up.

      Absolutely true. However, this information also falls under libel laws in cases which it false and harms the subject.

      ---The two best examples that come to mind right away:
      ---1. Facebook allows users to tag photos with the names of the people who appear in them.

      It may be a stretch to call it this, but posting stories and pictures usually is a form or journalism, regardless the content and methodology of dissemination. Also, photographers need a release whenever the photo is not taken in a public area, or accessible from a public area of a person who does not consent. It is not a crime to fail to get a release, but a nice tort claim.

      ---2. Google Street View puts photos of your residence without asking you for permission, and correlates it to a bunch of other stuff like geographic information, satellite images, yellow page listings, web search results, etc..

      Google, as long as they obey the law in terms of public/private property, they have full legal standing, and shouldn't be regulated. However, they did get in trouble when they went down private roads, with nice posted NO TRESPASSING signs peppered all over, which the GooCam dutifully captured.

      ---Notice that both of these acts are perfectly legal, and while the second arguably should be regulated and restricted by law (the aggregation, correlation and publication parts, not the picture-taking part), the first one ought not to.

      Sounds like Google-Hatred. Somehow we should just make a law for Google, because they spy on us!! Guess what: You have as much access to the same pictures as I do. And if there is any sort of law, it's that I'd like posting on who (specific names/residences) viewed some area in high resolution. You know, watch the watchers. Sous-veilance.

      --
    2. Re:Other people may publish information about you by Estanislao+Mart�nez · · Score: 4, Informative

      However, this information also falls under libel laws in cases which it false and harms the subject.

      It may not have come across, but I was assuming that the information in question was true, and, taken on its own, largely trivial and harmless. I'm not thinking of a case where, for example, somebody says on their blog that you committed incest with your aunt. I'm thinking of cases like, for example, a hypothetical social networking application that allows people to list you as somebody they know, without requiring your authorization. An application like that would be just as much subject to the kind of analysis detailed in the article, which could easily uncover lots of information about people who never volunteered any of it.

      It may be a stretch to call it this, but posting stories and pictures usually is a form or journalism, regardless the content and methodology of dissemination. Also, photographers need a release whenever the photo is not taken in a public area, or accessible from a public area of a person who does not consent. It is not a crime to fail to get a release, but a nice tort claim.

      So, what's really the essential difference between these three acts?

      1. Showing your friends, in person, a printed-out photo of some people at your party, and pointing out which of them is Joe Smith.
      2. Uploading the photo to your Facebook page, and tagging it with Joe Smith's name.
      3. Publishing a hard copy book or newsletter that shows the photo, and label the photo with Joe Smith's name.

      The law was created to make a distinction between informal sharing and dissemination of the first sort, and "publication" of the third sort. You don't need any model release for #1, and the journalism or modeling arguments are largely irrelevant.

      The problem here is that case #2 falls squarely between #1 and #2. Should it be subject to the law for informal sharing, the law for printed publications, or some new, yet-to-be-developed body of law? (And would the court system be able to handle the caseload that treating informal online sharing always as "publication" would imply? People share stuff informally all the time, and they increasingly do it online.)

      Google, as long as they obey the law in terms of public/private property, they have full legal standing, and shouldn't be regulated.

      "As long as Google obeys the current law, they're obeying the current law, and therefore, we shouldn't change the law."

      Um, what? That's a transparently bad argument. This is an argument about whether the law should be changed. Whether Google is following the current law correctly is completely irrelevant. You can't defend againt an argument that the law is wrong by saying that the acts allowed by the law are allowed by the law!

      Guess what: You have as much access to the same pictures as I do.

      Transparently bad argument too. "It's ok if I have access to these photos that I shouldn't have access to, because you also do have access to them."

    3. Re:Other people may publish information about you by TaoPhoenix · · Score: 3, Insightful

      Let's go back even more innocently. A friend wanted to put some random picture of me on their page. I declined, and they were shocked. It's Do It Yourself Blackmail.

      I like to clown around, but it's in a context. If you were there, drink's on me. But I don't need some random person following the 6 degrees of separation asking me about it 2 years later.

      Part of the big shift that has to happen is social. We joke about it, but the Slashdot Karma rating is a prototype. We need to quit penalizing people for facebook party pics and watch for other measures of worth.

      --
      My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
  7. For me... by Anonymous Coward · · Score: 5, Interesting

    I don't have a LinkedIn profile. I had a MySpace, but I killed it. My Facebook account does have political rants and opinions, but I don't have any pictures that could prove embarrassing. Google my name, and you'll find political rants I've posted on Yahoo! Groups and similar.

    However, I currently have something that can't hold a candle to being careful on Facebook: an employer that doesn't care what I do off the clock. As long as I stay out of trouble with the law and notify my employer if I'm moonlighting, all they care about is that I show up for work and do my job. Should I get on-call duties in the future, I'll be held to a written SLA, and as long as I can respond to calls within an allotted time, they don't care what I was doing. I can even apply for other jobs without fear.

    I used to be a K-12 teacher--if you ever think you're being spied on off the job, just be glad you're not a teacher.

    - Twice, I was indirectly threatened with termination for looking for other jobs.

    - The local newspaper photographed a teacher with a group of animal-rights protesters when the circus came to town. Parents started complaining to the principal saying they didn't want their children in that teacher's class.

    - Some students found a picture online of a naked woman who closely resembled--but wasn't--a high school teacher. They took the picture to the school's principal, and the teacher wound up having to defend her teacher certification against revocation--fortunately, she won. I'm not even sure if there's a rule against teachers posing naked, as long as it doesn't directly involve students.

    Granted, should I ever find myself looking for a job, a potential future employer might Google me and decide that I'm too liberal (or too something-else) to work there. But the question here is: do I really want to work somewhere they hold employees' personal, legal, off-the-clock lives under a microscope?

    1. Re:For me... by Anonymous Coward · · Score: 4, Informative

      I've got a friend who used to be a great teacher. She loved her job and her students, did lots of volunteer work for students and parents, and volunteered to coach sports. The principal slept with a student, admitted it, resigned and was arrested.

      When the police were interviewing students about the principal, one student said he slept with my friend. She was arrested and her mug shot went up all around the world. He later said he was kidding. The police still investigated, as well they should, and dismissed charges.

      Problem is, she can't teach anymore. She's easily googleable and you find her mug shot.

      Ah, the web.

    2. Re:For me... by Creepy+Crawler · · Score: 2, Informative

      Then she needs to have them "clean" the image off easily found areas. To not do so could be considered libel, and she already can easily show loss of career.

      She just needs a good lawyer, and considering the case, a pro-bono at that. She could draw blood good.

      --
    3. Re:For me... by Jeian · · Score: 2, Funny

      And yet you're posting as AC. :p

  8. Re:Well. by nine-times · · Score: 3, Insightful

    If you are concerned about your identity, do you think your domain registrar would give it up if somebody claimed that massive spam was coming out of that interface?

    Well sure, but that's a bit of a different issue. That's a question of whether companies that you have private transactions can be trusted to keep your information private. My point was that with social networking sites, you're posting information in a public forum and then expecting privacy-- which doesn't make a lot of sense.

  9. Shout FIRE by retech · · Score: 4, Funny

    Google warning of privacy issues is like an arsonist holding a can of gas and a lit match shouting FIRE in an already burning theater.

  10. So when do we give up on privacy? by Toe,+The · · Score: 4, Insightful

    Since the inception of the web, I have been wondering how much longer privacy could last.

    People who have grown up with the web tell everything about themselves freely on sites like MySpace. I don't know if this is because they are just stupid from youth or if it is a different paradigm than the old folks had.

    But in any event it is clear that privacy is diminishing rapidly. Look at cameras. Everyone carries a camera in their pocket now. Anyone can set up a wifi-connected miniature webcam with very little effort or cost. It's not even very difficult to listen through walls (or especially windows) nor to see at least heat traces through walls. And of course, there are satellites watching everything we do at least outside of walls.

    Then think about things like grocery store cards, credit cards, online accounts... And how many people here use a plethora of Google accounts with the blind faith that a mere slogan (Do No Evil) will somehow protect their privacy? Really?

    Then think about how cheap data storage is and how everything is not only logged but archived. It might not be used today, but it can be accessed ten years from now, or twenty, or fifty. After all, computers of a decade from now will be able to eat petabytes like Tic-Tacs.

    Expecting to maintain an old-school sense of privacy is probably not realistic in this, um, brave new world we live in.

    1. Re:So when do we give up on privacy? by martin-boundary · · Score: 3, Interesting
      Not necessarily. While it is true that once some piece of information is on the internet or on someone's hard drive, then it can be assumed to never go away, and eventually be collated, there is in fact a theoretical way to neutralize the big brother. It's known as spamming, or poisoning the well, or hiding in plain view, and I expect those techniques to actually help privacy in the long run, however crazy that sounds.

      The reason that Google and other companies can mine so much useful information about you is because that information is substantially true. If, for example, the information was 90% useless and totally contradictory, then it would be impossible to reassemble a correct picture about you. If this was common, then people would not usually trust what's written about you or anybody on the web, and you would regain some level of privacy.

      As privacy becomes more of an issue, I fully expect SEO companies and spamming outfits to reposition themselves as privacy protection agencies. There's going to be a lot of money to be made in helping people, and the same bunch of scum who are now crucified for spamming may well be hailed as pioneer heroes in, say, 20 years.

  11. Yah, we know... *they* only just realised this... by Klootzak · · Score: 3, Interesting

    They've ALREADY opened Pandora's box, now they're trying to get some publicity on how they can fix the problem they (at least helped) create.

    I don't hate Google, I think they tend to go "Oh, cool, we can do this!" without thinking through the implications of it... much like other clever people in history have done.

    --
    A Man's ethical behavior should be based effectually on sympathy, education, and social ties -- Albert Einstein
  12. You're missing the point by Estanislao+Mart�nez · · Score: 5, Interesting

    I'm not sure I understand why a picture of your house, taken from a public road, constitutes an invasion of privacy.

    But that's the whole point: the picture of your house, taken from a public road, as one isolated token of information, does not posit much risk to your privacy at all. Tons of individual people in our society own cameras, and take photos in public places that depict other people's property, and everybody agrees that the owners of said property should not have a right in general to prevent others from taking such photos. The privacy laws we have are built to protect individuals' rights in that sort of isolated case.

    The problem is when a corporation starts taking such photos systematically, aggregating them all together and correlating them with other systematic data sets. In that situation, a photo that just happens to contain your house is no longer just that; it's a piece of information that can be used to access many other pieces of information that may allow somebody to infer facts about you that you would rather prefer they couldn't.

    Google Street View is only the start. Just wait for the day when digital cameras commonly include GPS units and automatically tag each photo with a precise location and time, which can then be cross-indexed with a geographical information system like Google Maps. I can imagine it already: your wife carelessly forgets to close the window shades one day when they're changing. A neighbor takes a photo of her naked, and posts it to 4chan. Thousands of folks copy the photo all over the web. The photo has GPS information in the EXIF tags. Creepy /b/tards start stalking your wife. You give a resume to a potential employer with your residential address in it; they look up the address in Google Maps, click on the link to show image search results taken nearby, and are treated to a naked picture of your wife.

    That's an example where the photo in question is probably illegal to take, but other examples may be concocted where the picture, by itself, is fine. The point of the example isn't the photo; it's how the technologies that we have today for associating one item of information to others make it too easy for people to find out more about you than they should be able to.

    To sum up, the privacy laws we have today are laws that were designed to protect people's privacy in yesterday's, pre-computer world. Because of this, they primarily address things like whether somebody had the right to take a given individual photo, and not whether somebody is empowering others to infer facts about you by correlating many individually innocuous items of information.

    1. Re:You're missing the point by nine-times · · Score: 2, Insightful

      Ok, but let's just presume for the sake of argument that no one has a picture of your naked wife, and it's just Google with a picture of the outside of your house, latitude and longitude, and address with those pictures. They can also go down the street and see pictures of your neighbors houses in the same way.

      Is that an invasion of privacy, in and of itself? Maybe I am missing the point, but I don't see how having a GPS location and address mapped to a photo of the outside of my house is supposed to scare me. I'll start complaining when someone gets naked pictures of my wife and posts them on the Internet, whether there's GPS data attached or not.

    2. Re:You're missing the point by Estanislao+Mart�nez · · Score: 3, Insightful

      Ok, but let's just presume for the sake of argument that no one has a picture of your naked wife, and it's just Google with a picture of the outside of your house, latitude and longitude, and address with those pictures. They can also go down the street and see pictures of your neighbors houses in the same way. Is that an invasion of privacy, in and of itself?

      I think you're too hung up on this concept of "invasion," which, in my opinion, is too closely linked to existing privacy law. I think the real questions should be: (a) what is privacy, and (b) how should the law protect people's privacy?

      The way I see it, privacy is your ability to control what information other people can learn about you. Privacy is the law's recognition that people have a right to conceal various kinds of information about themselves from others, either because some people may find such information discreditable, or simply because the information would cause embarassment (though deep down I think the latter is just a special case of the first).

      Of course, privacy has never meant that a person is allowed to conceal any facts about themselves. There has always been information about oneself which the law requires to be a matter of public record (e.g., your criminal record, or what real property you own); and also, one person's right to privacy has to be carefully balanced with other people's right to learn and disseminate facts about others. This stuff falls squarely in the intersection of law, ethics and sociology, and is insanely context-sensitive and subtle. For example, if you see my last name and try to find out whether I'm Mexican or some other Hispanic nationality, it can make a big difference whether you're a member of the Mexican-American Students Association trying to recruit people for your association, or somebody who's evaluating my job application. Basically, there are vague, ethical and legal rules of what information various people should be allowed to consider in which contexts, and what information they should not be allowed to consider.

      But I digress. If privacy is about controlling what information others can learn about you, then we can see privacy in terms of these three components:

      1. What information about you other people can obtain directly;
      2. What information about you people can disseminate to others;
      3. What information about you people can infer from other facts they know.

      The idea of an "invasion" of privacy really boils down to acts that violate the rules for (1), and in some cases (2); somebody invades your privacy when they directly obtain a fact about you that they do not have the right to obtain (or example, when they take a picture of your naked wife inside your house, and give it to somebody else.) My argument is that our privacy laws have been built to deal with cases (1) and (2) there, because, historically, (2) was less of a problem than now (there was no Internet), and (3) wasn't really a problem (there were no computers!). So we need to come up with laws to regulate (3); which doesn't mean to forbid (3), but rather, to strike a balance between people's rights to conceal about themselves and to learn about others.

      So, to answer your question: if an "invasion of privacy" means a violation of rules about (1) or (2), then no, I don't believe that Google is "invading" your privacy by taking and publishing a picture of what anybody who goes by my house can see. The thing that concerns me is that Google and other folks are working very actively on technologies that affect my privacy via route (3), and what kinds of laws we should have in other to protect people's privacy in that regard. Basically, I'm worried in general about cases where many individual pieces of information, licitly obtained and disseminated, allow somebody equipped with newer technology to infer facts about me and use them to make decisions that negatively impact me, in a way that is u

    3. Re:You're missing the point by CraftyJack · · Score: 2, Insightful

      Try it this way: What are the legitimate, non-creepy uses of Google StreetView in a residential neighborhood?

  13. Social networking bigamy by Valacosa · · Score: 5, Funny

    Wow. I hope my Facebook girlfriend doesn't find out about my MySpace girlfriend.

    --
    "Live as if you'll die tomorrow." Ridiculous. You could die later today.
    1. Re:Social networking bigamy by Anonymous Coward · · Score: 2, Funny

      Your WHO?!?! - Your's Truly, ur /. gf

    2. Re:Social networking bigamy by value_added · · Score: 2, Funny

      I hope my Facebook girlfriend doesn't find out about my MySpace girlfriend.

      You'll have bigger worries if your wife finds out about either.

  14. Re:go back to the basics by Ethanol-fueled · · Score: 2, Insightful

    A few years ago, Many of us got our first MySpace/Facebook/Friendster etc. Shortly after that, we started to contact and be contacted by all of our old, obscure buddies from high school and whatnot. We plastered our psychological profile and unflattering pictures all over the place as much as we could, with our thousands of "friends" and our "pimped" profiles.

    Then we got the E-mail and the phone numbers of the people who we actually gave a shit about, then we realized the folly of our obnoxious seizure-inducing profiles and toned 'em down. Some of us even said "fuck MySpace" only to come back with a barebones profile just in case. Many of us relalized that 96% of our "friends" didn't even respond to us and so we deleted them to remove cruft. An additional 2% did contact us only to leave trite "happy hump day" posts on our profiles -- they eventually became the ones leaving ads and phish links after they were (heh)"hacked". The remaining 2% of them are buddies who we still talk to using old-fashioned methods like voice and E-mail.

    Whomever still bites into that "social networking" crap is either an attention-starved teen or a pedophile who wants one.

  15. Re:A solution: by idlemachine · · Score: 2, Insightful

    Don't create accounts for all these social sites in the first place.

    But that's not the issue here. The problem is that other people will continue to create social network accounts and can put information in there that violates my privacy.

    There's no point in protecting my identity strongly online if all it takes is one person to attribute one of my email addresses to my real name in their Facebook contacts.

  16. Re: Obfuscation by TaoPhoenix · · Score: 2, Interesting

    This may be the best common ground. The internet made "leveraging the genius" the thing of the 2nd decade (give the world another year to rest & prepare).

    Now all any organization has to do is locate one of some 10,000 people worldwide with the knack at seeing ultra-patterns. Most of us aren't that interesting to bother with, so a little camoflage to avoid 5 second name searches is usually enough. But what the real point of the Mrs. S. effect is, "if you annoy the collective Net, they'll borrow a pattern expert and pulverize you".

    Web 2.0 and 2.1 are about Sharing.

    Web 3.0 and 3.11 will be about privacy.

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
  17. Re:Regulation won't solve this problem. by TaoPhoenix · · Score: 2, Interesting

    All. Everyone will have everyone's information. There's no way it can be removed from everywhere. We learned that if you try to nuke the top ten sites, it just floats around the 100 second tier sites you'll never find.

    All we can do is resign ourselves to it and be boring enough not to really be worth the time.

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine