Slashdot Mirror


Schneier's Revised Taxonomy of Social Data

Jamie noted that over at Schneier's blog, he has a worthwhile entry on the data in the social networks. He writes "Lately I've been reading about user security and privacy — control, really — on social networking sites. The issues are hard and the solutions harder, but I'm seeing a lot of confusion in even forming the questions. Social networking sites deal with several different types of user data, and it's essential to separate them."

28 comments

  1. Unfortunately by cosm · · Score: 3, Interesting

    The unfortunate thing is that Schneier's taxonomic breakdown of data is most likely known by the majority here, and the folks who really need that information conveyed to them (ie mom and pop, aunt velma and her pic's of fluffy, partying cheerleader squad, drunk frat, etc..) will probably never see it, and if they did, they wouldn't understand it, or take heed to its importance even if they did.

    Not to mention large social sites are not really transparent with their collection and retention practices in the first place.

    Cynical, yes. Realistic, perhaps.

    --
    'We are trying to prove ourselves wrong as quickly as possible, because only in that way can we find progress.' RPF
    1. Re:Unfortunately by lapsed · · Score: 2, Interesting

      I hadn't heard about it until now, but I'm just one data point. Later in the post, Schneier writes that "there are other ways to look at user data," so it's not clear that his proposed taxonomy is the only way of classifying social networking data. What's weird about it is how it assumes and implies ownership. A user owns the page to which other users post, and as a result, the data posted by those other users is of a different type than the data posted by the page's 'owner'.
      Empirically, these types don't exist. In this sense, it's more of a typology than a taxonomy (in the social sciences, conceptually-derived classification systems are called typologies and empirical classification systems are taxonomies). Control over data -- particularly social networking data -- is, to a much greater degree, a function of the underlying protocols, API's, and SLA's.
      I get that the post is normative -- that Schneier is proposing a means of classifying data that will result in a social networking infrastructure that returns the control over data to its creators. But as you say, that change has to take place without the active participation of Facebook's 5 million indifferent users.

  2. Asymmetric Information Availability by gravos · · Score: 1

    Indeed. The real problem with social data today is how asymmetric the availability is... If everyone's lives were 100% public all the time, it would just become the new norm and we would all adapt and deal with it.

    But that's not the way it is at all: some people don't show up on the web, some people who don't know what they're doing end up with all sorts of permanent self-incriminating data, and those of us who know the score work hard to ensure we look good through every available channel.

    It's this asymmetry that creates unfairness. If I know your dirty secrets but you don't know mine, I have more power than you do.

    It's not so much the fact that all the data is stored that matters in this context (whether companies should be allowed to store the stuff without your permission is a topic for another day), it's that some people get it and manage their reputation and many people don't and are exploited. But life isn't fair, and I don't suppose that will never change.

  3. Selling to third parties by whencanistop · · Score: 3, Interesting

    We often don't mind if a site uses it to target advertisements, but are less sanguine when it sells data to third parties.

    Really this is the problem with the whole privacy thing that has caused so much issue in the past. The problem isn't that the company collects the data, it is that they then sell it to third parties to make a profit.

    Similarly if you look at the in depth report that the WSJ published then the real issue isn't the use of cookies or even the collection of the behavioural data - it is that they have then sold out to third parties by either selling the data or allowing them to collect it in the first place (which they can then do whatever they want with).

    1. Re:Selling to third parties by roqetman · · Score: 1

      The bigger question is that if they paid you when they sold that data, would you mind? - I think it's a question of how much is that data worth.

    2. Re:Selling to third parties by VortexCortex · · Score: 1

      We often don't mind if a site uses it to target advertisements, but are less sanguine when it sells data to third parties.

      The problem isn't that the company collects the data, it is that they then sell it to third parties to make a profit.

      Especially when the sale is made without consulting the data itself; Putting data to work without it's consent is data slavery! Information wants to be Free!

    3. Re:Selling to third parties by Fnord666 · · Score: 3, Insightful

      Really this is the problem with the whole privacy thing that has caused so much issue in the past. The problem isn't that the company collects the data, it is that they then sell it to third parties to make a profit.

      One important thing to keep in mind is that the users of these sites are not the customer, they (or at least the data that they generate) are the merchandise.

      --
      'The tyrant will always find pretext for his tyranny.' - Aesop's Fables
  4. social data vs pr0n? by vlm · · Score: 2, Interesting

    Story about a fuzzy taxonomy of social data gets like 4 posts, whereas a taxonomy of Pr0n would probably have about 900 comments by now.

    --
    "Science flies us to the moon. Religion flies us into buildings." - Victor Stenger
    1. Re:social data vs pr0n? by Genwil · · Score: 1

      That's 'cuz we're only human, Jim.

  5. Interesting, timely by fak3r · · Score: 1

    After the release of all of the publicly available facebook data last week there's a real question of who owns your data, and if it's public does that mean it should be publicly accessible via open APIs for anyone to grok? an immature but developing example is things like phasebook http://github.com/philcryer/phasebook

    1. Re:Interesting, timely by John+Hasler · · Score: 1

      > ...who owns your data...

      No one. Data cannot be owned.

      --
      Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
  6. Re: I'm Just... by TaoPhoenix · · Score: 1

    There's something unnerving there too.

    Hi Data Point #141592653589793238462643383279502884197169399375105820494459!

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
  7. When did it stop trickling down? by SomePoorSchmuck · · Score: 4, Insightful

    The funny thing about Facebook is that it is similar to television: You've been given the impression you're the customer, but you're actually the product being sold to the real customers - advertisers.

    Facebook just takes it another step, because you're voluntarily giving them extensive data about who you are as an individual, as well as involuntarily (or unknowingly for most people) letting the technology create profiles of your browsing habits. If that weren't enough, there is the further twist - at least on TV there are professional artists and actors and creative types who are producing the content you enjoy. On Facebook, the content is created by you and your friends and given away for free to the website owners.

    So all they have to do is create a database infrastructure, then you and your friends come along and do almost all the Data Entry, and while you're doing it they're watching you and adding meta-data to their private database, then they can turn around and sell all the aggregate data to their customers. Profit!

    At the peak of the show "Friends", Jennifer Aniston was getting paid a million dollars for pretending to be Rachel whatsherface for each 30 minute episode. The million dollars ultimately came from advertisers who bought airtime from the network.

    We are now self-creating global databases with billions of entries and in return are getting... the ability to "poke" someone from your 10th grade Health class. Meanwhile, how much money is Google, Facebook, BlackPlanet, etc. making from both the ads already on their sites, and the immediate/future revenue from the data we are giving them?

    --

    Hollywood, Television, has become the dream machine. We need to take that back; each of us is a Dream Machine
    1. Re:When did it stop trickling down? by Anonymous Coward · · Score: 0

      There are some people in my 10th grade health class that I'd *love* to poke.

    2. Re:When did it stop trickling down? by Nethead · · Score: 1

      There are some people in my 10th grade health class that I'd *love* to poke.

      Please, please tell me you are not the teacher!

      --
      -- I have a private email server in my basement.
    3. Re:When did it stop trickling down? by Anonymous Coward · · Score: 0

      We derive better emotional satisfaction by watching a "Friends type" show filled with our actual friends: on our newsfeed with photos, status updates and the lot. Tv and Movies gave us what we don't need now since we're making our content with real people and real events, we don't care about not getting paid 1 million dollars. However, as student looking into becoming a professional content producer, Facebook scares me a lot, because it might just make professional content irrelevant.

    4. Re:When did it stop trickling down? by Kikuchi · · Score: 1

      The funny thing about Facebook is that it is similar to television: You've been given the impression you're the customer, but you're actually the product being sold to the real customers - advertisers.

      A (now former) french private television network's CEO, Patrick Le Lay, drew criticisms to himself with his now infamous quote "What we sell to Coca-Cola is available human brain time."

      --
      There's no scientific consensus that life is important.
    5. Re:When did it stop trickling down? by YourExperiment · · Score: 1

      We are now self-creating global databases with billions of entries and in return are getting... the ability to "poke" someone from your 10th grade Health class

      We're getting a hell of a lot more back than that. We're getting the best web search and email services available, free of charge. We're getting a service that allows us to stay in touch with an extended group of friends and acquaintances easily, free of charge.

      And what are we giving away in return? A bunch of inane data about our lives, which allows advertisers to perfectly fine tune and target the adverts which never make it past our ad-blockers anyway.

      I can't pretend that I'm not the least bit concerned about the amount of data these companies are racking up. To a certain extent, I'm just playing devil's advocate to your comment. But these things are always a trade-off. Most of my friends are on Facebook. I'm not going to miss out on communicating with them just in case the next advert I see is for a band I despise a little less than usual.

    6. Re:When did it stop trickling down? by SomePoorSchmuck · · Score: 1

      We are now self-creating global databases with billions of entries and in return are getting... the ability to "poke" someone from your 10th grade Health class

      We're getting a hell of a lot more back than that. We're getting the best web search and email services available, free of charge. We're getting a service that allows us to stay in touch with an extended group of friends and acquaintances easily, free of charge.

      And what are we giving away in return? A bunch of inane data about our lives, which allows advertisers to perfectly fine tune and target the adverts which never make it past our ad-blockers anyway.

      I can't pretend that I'm not the least bit concerned about the amount of data these companies are racking up. To a certain extent, I'm just playing devil's advocate to your comment. But these things are always a trade-off. Most of my friends are on Facebook. I'm not going to miss out on communicating with them just in case the next advert I see is for a band I despise a little less than usual

      I hear you. One day about a year and a half ago I came to the sudden realization that I didn't seem to be reading and replying to personal email at the same level as from 1997-2007. After a couple days spent looking over my mail accounts chronologically, and thinking about the situation, I became convinced that the trend was real, and an explanation presented itself -- nobody is on email anymore because they're all on Facebook, and that's where they do most of their daily communicating. (And at this point in 2010 I would say that I've received less personal electronic mail than I did even under FIDOnet in the early 1990s.)

      Picture sharing, email, IM, games, music... the unified social network website has reached a critical mass at which you will be forced to join if you don't want to be left out.

      But long-term, I think it's a little leap-before-you-look to assume that the future uses of extremely detailed and cross-linked data will never progress beyond targeted banner ads. This also applies to the idea that the value of the data is determined during a one-time transaction and then out of your hands, like you would sell old reading material to a used bookstore. Data isn't an object that can be owned, it is information. That is, it is less like a good traded for money, and more like a collection of intellectual property used and re-used simultaneously by multiple interested parties while maintaining its worth due to being permanently linked to YOU, just like a painting or a performance. Therefore it would be far more economically consistent to, instead of giving up your data once in exchange for some shiny web trinkets, you were to allow it to be re-sold and disseminated broadly with the requirement that commercial use of your data result in royalties regularly deposited in your account.

      I assure you that this is how pharmaceutical companies, market research firms, and industrial product development divisions view their accumulated information -- as property to be vigorously protected and only used by other parties on a fee-license basis.

      Shouldn't we, the consumers, get our piece of the pie?

      --

      Hollywood, Television, has become the dream machine. We need to take that back; each of us is a Dream Machine
  8. not just "selling" by paramour · · Score: 2, Insightful

    The problem isn't that the company collects the data, it is that they then sell it to third parties to make a profit.

    No, the problem isn't limited to selling. Data gets lost or stolen with alarming frequency: someone leaves a laptop on a train, with the data unencrypted; a web site permits SQL injection hacks; an employee walks away with with a flash drive; a National Security Letter arrives. You're lucky if you ever hear of any of these happening.

    So even if the company has the best intentions and never sells or misuses the data, if they do not or can not secure it I'd rather they not have it.

  9. The influence bothers me more than the privacy... by Anonymous Coward · · Score: 1, Interesting

    My girlfriend was browsing facebook on a system without adblock and I glanced over at her screen. It was _filled_ up with bridal advertisements. Get married here! Have this cake! Perfect honeymoon! I logged into the site and saw nothing like that. She fits the advertising profile— no doubt. Age, gender, — in a relationship for many years... etc. Lucky for me she's as disinterested as I am or somewhat more.

    This got me thinking though— what influence does this kind of thing have— certainly some number of unlucky dudes are more likely to get the wedding bell nags as a result of this.

    I called up a friend of ours— same age group, married a year ago. I ask her, "What ads do you get on facebook?" Baby supplies, baby showers, pregnancy blah blah. Will these advertising systems promote population growth?

    It seems to me that what we expose people at large scales can have a considerable influence on the our world... and much more so when the messages are so tightly taylored and focused. We hand that power over to whomever has the most money or whomever can best profit from it. Is this a good thing?

  10. Ghostery add-on for firefox by Anonymous Coward · · Score: 0

    "Ghostery sees the "invisible" web, detecting trackers, web bugs, pixels, and beacons placed on web pages by ad networks, behavioral data providers, web publishers, and other companies interested in your activity."

    https://addons.mozilla.org/en-US/firefox/addon/9609/

    I installed it a week ago and have been becoming increasingly unhappy about the state of the web. About the only site that I visit which does have invisible behavioural tracking beacons (usually several of them) is Wikipedia.

    (E.g. At the moment its reporting that Slashdot embeds a piece of doubleclick code)