Slashdot Mirror


Why Anonymized Data Isn't

Ars has a review of recent research, and a summary of the history, in the field of reidentification — identifying people from anonymized data. Paul Ohm's recent paper is an elaboration of what Ohm terms a central reality of data collection: "Data can either be useful or perfectly anonymous but never both." "...in 2000, [researcher Latanya Sweeney] showed that 87 percent of all Americans could be uniquely identified using only three bits of information: ZIP code, birthdate, and sex. ... For almost every person on earth, there is at least one fact about them stored in a computer database that an adversary could use to blackmail, discriminate against, harass, or steal the identity of him or her. I mean more than mere embarrassment or inconvenience; I mean legally cognizable harm. ... Reidentification science disrupts the privacy policy landscape by undermining the faith that we have placed in anonymization."

49 of 280 comments (clear)

  1. Damn voyeurism is all it is by Ethanol-fueled · · Score: 4, Insightful

    For almost every person on earth, there is at least one fact about them stored in a computer database that an adversary could use to blackmail, discriminate against, harass, or steal the identity of him or her. I mean more than mere embarrassment or inconvenience; I mean legally cognizable harm.

    ...And this is the first thing that the author(s) though of regarding data-mining? Okay, but how would this happen? Why go through all the trouble to gather all that data when you could just hire a P.I. or know (or bribe) a law-enforcement official or an ISP employee? It Reminds me of a conversation I had with a guy who bragged that he could get anybody's info because a very good friend of his worked at the DMV. There were a couple semi-profile firings at the State Department because some employees snooped through celebrities' records for no reason other than voyeurism..er..curiosity.

    Those types, the ones with the direct access to the info, are the weakest link. They're only human. "Hey, Bob, there's this guy I really hate. Look up his IP logs and tell me what you see!"

    It all boils down to voyeurism. People would rather bring others down before bring their own lives up. It's the nature of the beast! Pathetic.

    1. Re:Damn voyeurism is all it is by causality · · Score: 5, Insightful

      But the voyeurism slant isn't newsworthy.

      Then how do you explain shows like Entertainment Tonight and all of these magazines and Web sites devoted entirely to completely useless celebrity trivia? Y'know, the ability to obsess over the personal life of someone you have never met and will never personally know, merely because they can sing or act, should be recognized as a pathology. Voyeurism only seems to partly explain it; much of it seems to come from an empty and unsatisfying life that leads to an attempt to live vicariously through some sort of idol which is perceived to be successful, in that sense that "most men lead lives of quiet desperation". However stupid and useless it may be, I can't deny that many do consider it newsworthy and much of "the news" includes such elements.

      --
      It is a miracle that curiosity survives formal education. - Einstein
    2. Re:Damn voyeurism is all it is by mea37 · · Score: 2, Insightful

      Do you mean, you think you could've gotten an individual's medical records in MA for less than $20? Or maybe you can't see why someone would dig up an individual's medical records? (I can think of many... but then my employer was extorted by someone who'd stolen a bunch of medical-related data from them not that long ago.)

      I think I hear a bit of "nobody would go to all that trouble" in your message. If in the early days of WiFi networks I described to you in tedius yet vague terms how to compromise WEP encryption, you probably would've thought the same thing. Today anyone who cares to can break WEP using readily available tools - it's really no bother at all if you're even slightly inclined to do it.

      I've seen companies with contractual and regulatory obligations to protect data privacy make half-gestures to make it look like they're honoring privacy while still engaging in whatever easy-money scheme or shortcut they want. Shedding light on why those half-gestures don't work is a big deal.

    3. Re:Damn voyeurism is all it is by causality · · Score: 2, Insightful

      Except that most people who watch Entertainment Tonight and such aren't "obsessed" with celebrity trivia. Interest =/= obsession.

      Dear AC, perhaps we are using different definitions of "obsession." Here's mine: when something cannot possibly benefit your life in any measurable way whatsoever, and you devote energy to pursuing it anyway, this is something of an obsession. To me, an interest is something different. The RIAA has an interest in strong copyright laws. Why? Because the RIAA is benefitted by strong copyright laws. Therefore, it's not a surprise that the RIAA tries to bring them about. However, it doesn't do a damned thing for me to know that $ACTRESS is thinking of divorcing her husband. I don't benefit from knowing this, therefore I can accurately say that it is not in my interests. Her family and personal friends might have an interest in this, and with good reason, but then they wouldn't need to find out second-hand from a TV show either.

      Think about it this way. If we treated all industries equally, in the sense that all industries were treated just like the entertainment industry, then anytime you bought a car or a computer it would come with a big long list containing the names of all the members of management, designers, and factory workers who produced it as well as the truck drivers who shipped it and the advertisers who marketed it. We would then have TV shows and magazines talking about the personal private lives of those people who produced your cars and computers, whom they marry, how many times they divorce and why, what goes on behind closed doors in their homes, and paparazzi would follow them around and try to get "exclusive" or embarassing photos of them. Additionally, average people who never met any of them would talk about them fondly as though they personally knew them.

      Now if this happened for the automobile or computer industries, and I said it was obsessive behavior, on what grounds would you dispute that? Real question. I'd like to know.

      --
      It is a miracle that curiosity survives formal education. - Einstein
    4. Re:Damn voyeurism is all it is by causality · · Score: 4, Insightful

      Do you mean, you think you could've gotten an individual's medical records in MA for less than $20? Or maybe you can't see why someone would dig up an individual's medical records? (I can think of many... but then my employer was extorted by someone who'd stolen a bunch of medical-related data from them not that long ago.)

      I think I hear a bit of "nobody would go to all that trouble" in your message. If in the early days of WiFi networks I described to you in tedius yet vague terms how to compromise WEP encryption, you probably would've thought the same thing. Today anyone who cares to can break WEP using readily available tools - it's really no bother at all if you're even slightly inclined to do it.

      I've seen companies with contractual and regulatory obligations to protect data privacy make half-gestures to make it look like they're honoring privacy while still engaging in whatever easy-money scheme or shortcut they want. Shedding light on why those half-gestures don't work is a big deal.

      That's the thing that I also think people don't understand. With good reason, I am not satisfied merely that someone probably wouldn't want to abuse my information. I am satisfied only when I know that they cannot do so.

      I think the solution is to have the concept of "intellectual property" work both ways. Obviously your private information has value, otherwise advertisers and other companies wouldn't go to such great lenghts to obtain and use it. The problem is that they obtain it without your consent and without directly compensating you. For example, if I don't actively block web bugs, cookies, HTTP "ping", analytics tools, and other similar attempts, then that data will be gathered whether or not I like it.

      The reason why I actively go out of my way to prevent companies from gathering data on me is simple. No one asked me if I wanted to be data-mined. I refuse to honor agreements in which I did not participate. Why anyone else would do so is a mystery to me.

      So make each individual's private data their personal property. They can set whatever value they like, and if that value is more than a company thinks it is worth, the company is free to decline the sale. Most importantly, any attempt to just take that data will be theft, and anyone who does this can be prosecuted in a criminal court. I mean, think about it: why is it "marketing" when a company helps itself to my information against my will and "piracy" or "industrial espionage" if I helped myself to THEIR zeroes and ones against their will?

      --
      It is a miracle that curiosity survives formal education. - Einstein
    5. Re:Damn voyeurism is all it is by causality · · Score: 2, Insightful

      Dear AC, perhaps we are using different definitions of "obsession." Here's mine: when something cannot possibly benefit your life in any measurable way whatsoever, and you devote energy to pursuing it anyway, this is something of an obsession. Sorry, no; an obsession is just a troubling preoccupation. Benefit has nothing to do with it.

      Arguing semantics is much more useful when you are able to suggest a more suitable word. However, even then it's of little use, because even if I admit that "obsession" was a terrible choice of words, it doesn't do anything to change a single point I have made. Likewise, I can't help but notice you have not addressed any of the points I have made. So, I must conclude that this is the only fault you were able to find with my reasoning, which is pretty good, because it has nothing to do with my reasoning at all.

      --
      It is a miracle that curiosity survives formal education. - Einstein
    6. Re:Damn voyeurism is all it is by andy_t_roo · · Score: 2, Interesting

      i think i found a new sig (a bit too long for /. unfortunately):
        "why is it "marketing" when a company helps itself to my information against my will and "piracy" or "industrial espionage" if I helped myself to THEIR zeroes and ones against their will?"

  2. Paul Ohm? by Yvan256 · · Score: 4, Funny

    Paul Ohm's recent paper is an elaboration of what Ohm terms a central reality of data collection: "Data can either be useful or perfectly anonymous but never both."

    Great, another Ohm's law to learn.

    1. Re:Paul Ohm? by natehoy · · Score: 4, Informative

      Nonsense, it could be a extension of the current Law:

      "In electrical circuits, Ohms' law states that the current through a conductor between two points is directly proportional to the potential difference or voltage across the two points, and inversely proportional to the resistance between them. In data anonymity, the law states that the general usefulness of any set of data that originally contained personally-identifiable information is inversely proportional to the degree of anonymity applied to said data."

      See, on simple law to memorize, and now data analysts learn just a teensy bit about electricity and EEs learn just a teensy bit about data anonymization.

      --
      "This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
    2. Re:Paul Ohm? by 2names · · Score: 2, Funny

      Could you put that in the form of a car analogy so us laymen can understand it please? :)

      --
      "I'm just here to regulate funkiness."
    3. Re:Paul Ohm? by Beardo+the+Bearded · · Score: 4, Informative

      Okay, let's take a road. The speed at which traffic can travel depends on the quality of the surface, gradient, camber, zoning, etc. Let's call this the "road conditions", with a lower number being better roads.

      The number of cars that want to get through that road is a primary unit, which we can refer to as the "volume of traffic".

      The third major criteria is the speed at which the traffic actually flows. This is the "actual flow" of traffic -- in other words, the "influence of other cars" on the traffic congestion.

      In other words:
      volume = influence of traffic * road conditions

      or:
      V = IR

      --

      ---
      ECHELON is a government program to find words like bomb, jihad, plutonium, assassinate, and anarchy.
  3. Duh. by SatanicPuppy · · Score: 3, Informative

    Am I the only one who always gives their birthday as 01/01/1970 and their zip code as 20500?

    I mean, seriously. They don't need to know. Why would I give 'em the right numbers? They're lucky I even allow them to have rough demographic data.

    --
    ad logicam Claiming a proposition is false because it was presented as the conclusion of a fallacious argument.
    1. Re:Duh. by ColdWetDog · · Score: 4, Funny

      I just put "No" under sex. I like to tell the truth. Not sure how it helps on the ID end though.

      --
      Faster! Faster! Faster would be better!
    2. Re:Duh. by garcia · · Score: 3, Insightful

      Am I the only one who always gives their birthday as 01/01/1970 and their zip code as 20500?

      I use 1/1/1979 (it's closer to my real age) and 90210 instead. I get a lot of crosseyed looks and many times the cashier (or whatever human I'm dealing with) will end up entering in a local zip code instead but people are no longer arguing w/me about what I choose to provide them when pressured for information (I always politely reply, "no thanks," when asked for that type of information but will give them false shit when they ask again and whine that they'll be fired).

      Why would I give 'em the right numbers? They're lucky I even allow them to have rough demographic data.

      Because the majority of people have absolutely no problems handing over any and all information they're prompted for up to and including their e-mail address, phone number or even SSN! Because most people don't even blink, those of us that don't feel like it should be anyone's business (like the scanning of IDs at liquor stores or bars to check age--there is a birthdate listed on IDs for a fucking reason people--not that they can scan my rare earth magnet swiped ID anyway) are looked at like assholes when we refuse to provide information that no one really needs anyway.

    3. Re:Duh. by Anonymous Coward · · Score: 4, Funny

      I put "please!" and it doesnt seem to help either.

    4. Re:Duh. by interkin3tic · · Score: 5, Funny

      Yes you are. I always put put 90210. Phone number 867-5309. If anyone tries to find me, they're at least going to have that song stuck in their head and recall with disgust the shows they watched in the early 90's. Hopefully that will demoralize them enough to give up.

    5. Re:Duh. by compro01 · · Score: 3, Funny

      I would think 90210 is a more common choice for zip code. It's probably the most densely populated area on the planet according to dataminers.

      --
      upon the advice of my lawyer, i have no sig at this time
    6. Re:Duh. by syrinx · · Score: 2, Funny

      It identifies you as a Slashdotter...?

      --
      Quidquid latine dictum sit, altum sonatur.
    7. Re:Duh. by mmkkbb · · Score: 2, Informative

      (like the scanning of IDs at liquor stores or bars to check age--there is a birthdate listed on IDs for a fucking reason people--not that they can scan my rare earth magnet swiped ID anyway)

      That's not to check age; that's to check for counterfeits with mismatched mag data, or mismatched 2-D barcode data, or missing UV ink prints, or missing holograms, etc. etc.

      --
      -mkb
    8. Re:Duh. by ACMENEWSLLC · · Score: 2, Funny

      This makes me think of a probably not unique idea. Most places that ask my my phone number are the same places asking over and over again. Radio Shack, Toys-R-Us, and Sears for example. What would be great is to memorize one of their phone numbers from the phone book and always give them that. Perhaps a number from a different store. Let their telemarketers waste time calling their own stores.

    9. Re:Duh. by plague3106 · · Score: 4, Funny

      I once gave a gamestop employee my zip as 12345. He say "its ok if you don't want to give it." My reply was the no, I am from Schenectady, NY.

    10. Re:Duh. by causality · · Score: 3, Funny

      And you wonder why you never get laid when you go to a bar.

      Usually it's better to wait until you leave the bar.

      --
      It is a miracle that curiosity survives formal education. - Einstein
    11. Re:Duh. by RabidMoose · · Score: 3, Funny

      The only bar I go to is the one my parents built in their basement while I was away at college.

      I never pay for drinks, I know the password for the Wi-fi, and it never closes.

      Problem is, the only girl who ever shows up is my sister.

    12. Re:Duh. by causality · · Score: 2

      Your "account" is indexed under your phone number - they are looking it up to know what offers they should let you in on, check to see if you have a store credit card or should have one and of course to build their profile on you.

      They don't care about your phone number other than that it is a unique identifier.

      I have the money, they have the goods, we make an exchange. I like it when it remains that simple. Their mistake is assuming that I want to establish an "account" without first asking me. When it comes to my personal information, everyone is on a need-to-know basis. Almost no one needs to know. If they have an entitlement mentality that prevents them from respecting that, then I have no moral qualms whatsoever about giving them false information.

      --
      It is a miracle that curiosity survives formal education. - Einstein
    13. Re:Duh. by Jah-Wren+Ryel · · Score: 2, Interesting

      An out-of-state fake ID will not necessarily work. There are interstate standards for the content of mag stripes and 2-D barcodes, for example.

      But no where near all states follow those standards. All you gotta do is make a fake-id for one of those states. Even if the state does follow those standards, if you pick a state far enough way you can make up pretty much anything, call it an id card (rather than a driver's license) and the person using the machine will have to make the human decision to accept the id anyway or not. As someone who made such a fake-id for a girl who wanted to appear younger than she was (got tired of the bouncers at the clubs loudly exclaiming "you lookin gooooood for XX years old" and thus informing everyone she was with of her true age) I can say that the card always failed to scan because it was 100% bogus, but the people running the machines always accepted it anyway.

      --
      When information is power, privacy is freedom.
    14. Re:Duh. by Planesdragon · · Score: 3, Funny

      And after that, it's to keep a list of everyone who has entered the bar for the history of it's operation. Much easier to identify "troublemakers" when you have a list of people who like to have fun once in a while.

      You DO know that in many states, a bartender is legally responsible for anything you do while drunk from the moment you take a drink until you're finally sober, right?

  4. I'm perfectly anonymous! by A+beautiful+mind · · Score: 2, Funny

    See!

    -- Anonymous Coward

    --
    It takes a man to suffer ignorance and smile
    Be yourself no matter what they say
  5. Only three bits? by Yvan256 · · Score: 4, Funny

    [researcher Latanya Sweeney] showed that 87 percent of all Americans could be uniquely identified using only three bits of information: ZIP code, birthdate, and sex.

    Holy hell forget about that anonymized data crap, I want to learn how she can compress that much data into three bits!

    1. Re:Only three bits? by steelfood · · Score: 2, Funny

      Why, that's the simple part! You just use very big bits and hope they don't notice!

      --
      "If a nation expects to be ignorant and free in a state of civilization, it expects what never was and never will be."
  6. Mission Impossible by im_thatoneguy · · Score: 5, Insightful

    I've pretty much given up any hope of being anonymous. It's just going to get exponentially more difficult as time goes on.

    I had my credit card stolen once. It was stolen from the CC company. How is a business supposed to entrust me with thousands of dollars in credit if they don't know who I am? How is a credit card company supposed to function without a worldwide network which authorizes transactions.

    If someone wants to find me they'll find me.

    If someone wants to use my identity to frame me for a crime then they're just going to encounter a mountain of evidence from numerous sources which contradict their fabrication.

    "My G1 was on a Starbucks Wifi at the time of the crime. I used my CC to purchase the drink. I received a text from a nearby tower. I posted a comment on breaking news story that is written in my style of writing. I was seen on 8 security cameras walking to the starbucks from my car. I used an automatic toll card 5 miles away from the coffee shop...." Good luck coming up with a large mountain of evidence to put me somewhere else.

    1. Re:Mission Impossible by riqtare · · Score: 2, Insightful

      If access to the evidence you just stated was available to the framer it makes it very easy to find a likely fall guy according to their habits. Makes the alibi of overwhelming evidence evaporate into prime suspicion.
      The best lies are those that are mostly truth.

      --
      42
  7. Bah, humbug. by jdgeorge · · Score: 2, Funny

    Forget anonymity. I'm better off living in a glass house, so it's easier for me to know when I need to yell "Get off my lawn!"

  8. anonymization is bullshit by Anonymous Coward · · Score: 2, Insightful

    Even if the data is completely and unreversably anonymized, it is still invasive. Look at the story yesterday about the marketers data-mining kids' online private conversations for consumer gadget preferences. Even if there's no way from that data to infer the preferences of any particular kid, they should still be able to talk to each other without having their conversation be part of a marketing survey.

    Think also of a cafe that sells two kinds of food: apple pie (eaten by freedom-loving patriots), and felafel (eaten by terrorists and their supporters and sympathizers). Of course it would be invasive for the cafe to disclose which of its customers ordered which kind of food. But even releasing aggragate statistics is bad. An increase in felafel sales can led to a bullshit fbi investigation even if individual customers aren't identified.

    People sitting on private data constantly search for self-searching justifications to disclose as much as they can without getting clobbered by the sources of the data. It is bullshit. Private should mean no disclosure, not anonymized disclosure, not aggregate disclosure, just plain no disclosure period.

    1. Re:anonymization is bullshit by blahplusplus · · Score: 2, Interesting

      "Private should mean no disclosure, not anonymized disclosure, not aggregate disclosure, just plain no disclosure period."

      The profit motive and privacy are at odds, trying to make the most money and sell the most stuff means you want to know everything about everyone so that you can one up you competitors, it's a race to the bottom. Ideals in the real world always submit to the pragmatic concerns of making money in a capitalist society.

  9. Remeber "Mother Earth" and the Espionage Act by RevWaldo · · Score: 2, Informative

    If you ever wonder why people view the privacy of your records in the hand of third parties is important, and don't just hop on the "privacy is dead" bandwagon, this is the sort of scenario they have in mind.

    http://en.wikipedia.org/wiki/Mother_Earth_(magazine)

    Mother Earth was an anarchist journal that described itself as "A Monthly Magazine Devoted to Social Science and Literature," edited by Emma Goldman. Alexander Berkman, another well-known anarchist, was the magazine's editor from 1907 to 1915. It published longer articles on a variety of anarchist topics including the labor movement, education, literature and the arts, state and government control, and women's emancipation, sexual freedom, and was an early supporter of birth control. Its subscribers and supporters formed a virtual "who's who" of the radical left in America in the years prior to 1920.

    In 1917, Mother Earth began to openly call for opposition to American entry into World War I and specifically to disobey government laws on conscription and registration for the military draft. On June 15, 1917, Congress passed the Espionage Act. The law set punishments for acts of interference in foreign policy and espionage. The Act authorized stiff fines and prison terms of up to 20 years for anyone who obstructed the military draft or encouraged "disloyalty" against the U.S. government. After Emma Goldman and Alexander Berkman continued to advocate against conscription, Goldman's offices at Mother Earth were thoroughly searched, and volumes of files and detailed subscription lists from Mother Earth, along with Berkman's journal The Blast, were seized. As a Justice Department news release reported:

    "A wagon load of anarchist records and propaganda material was seized, and included in the lot is what is believed to be a complete registry of anarchy's friends in the United States. A splendidly kept card index was found, which the Federal agents believe will greatly simplify their task of identifying persons mentioned in the various record books and papers. The subscription lists of Mother Earth and The Blast, which contain 10,000 names, were also seized."

    Mother Earth remained in monthly circulation until August 1917.[1] Berkman and Goldman were found guilty of violating the Espionage Act, (imprisoned for two years) and were later deported.

  10. Re:20500 by natehoy · · Score: 5, Insightful

    Because everyone knows that EVERYONE in DC lies.

    --
    "This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
  11. Three things? Really? by Applekid · · Score: 2, Insightful

    So, despite the Birthday Paradox, they can still identify 87% of Americans? For some reason I'm under the impression that there are a lot more zip codes with more than 366 people (heck, even 1000 to call upon 3 or 4 duplicates that should cover gender differences) than there are zip codes under that amount.

    --
    More Twoson than Cupertino
    1. Re:Three things? Really? by Daniel_Staal · · Score: 5, Informative

      That Paradox ignores the year. Add that in and it starts to become harder.

      --
      'Sensible' is a curse word.
    2. Re:Three things? Really? by clone53421 · · Score: 2

      Your birthdate includes the year. Your birthday does not (at least for this discussion).

      The party trick of finding two people with the same birthday (a good probability in any group of 30 people or more) doesn't require them to have the same year of birth (although in most gatherings there's a good chance of this as well since often it's already somewhat segregated by age).

      --
      Alexander Peter Kristopeit bought his basement from his mommy for one dollar.
    3. Re:Three things? Really? by OrigamiMarie · · Score: 2, Informative

      Perhaps they meant zip + 4. Which gets you down to very few households, but most people can't rattle off their zip + 4, so this information wouldn't actually apply to the questions posed by cashiers. On the other hand, I have heard that data mining on web-surfing habits can usually pick up your zip + 4, so yeah, it would be pretty trivial to put that together with birth date (which is asked for a various places to determine that you're of-age -- though of course you can lie) and sex, which can probably be guessed at even if you don't click one of the radio buttons.

  12. Couple of things.. by hansraj · · Score: 5, Insightful

    Potential nitpick, but here goes.

    The summary (not surprisingly for a /. summary) omits a couple of details that give the reader a rather partial picture.

    For one, Paul Ohm is an Assistant Professor of law, and although the summary makes it sounds like the linked article would be from a technical perspective, (mostly) it is not.

    A quote like:

    "Data can either be useful or perfectly anonymous but never both."

    needs a bit of background about the qualification of the person making that claim. Why? Simply because it sounds like a rather technical remark. If some computer science researcher made this claim, I would tend to take it more on the face value, otherwise I would take it with a grain of salt.

    Now obviously this statement was not meant to be taken quite literally because the notion of "useful" is not precise. I can get reasonably useful information like "most of the people in my country like to buy branded stuff" or "most people who rent videos of actor X regularly, also rent the videos of actor Y regularly" without needing the underlying data to contain *any* personally identifiable information. The fact that extra data is store is a different thing.

    I personally believe that instead of claiming that some researcher has argued X, it can be more informative to actually say what kind of researcher it is who made a claim. Not because only researchers in a certain area can be trusted, but because a little bit of background puts the claims in right perspective.

  13. Anonymous can be useful.. by EasyTarget · · Score: 4, Insightful

    Data can either be useful or perfectly anonymous but never both

    What a load of bolaks....

    Supposing you have a list of -just- birth dates for every citizen at the census. You -only- have only been given one piece of data per person, the date, nothing more. Just a huge list of dates, sorted chronologically.
    1) The data has been totally anonymised.
    2) You can do all kinds of meaningful analysis on the age demographics of the population. And make policy decisions based on that.

    Fully anonymous data producing useful results.

    --
    "Oops, I always forget the purpose of competition is to divide people into winners and losers." - Hobbes
  14. Re:At least one fact about them could be used by interkin3tic · · Score: 2, Insightful

    I did think that was an overstatement that undermined the main point. None of my prescriptions would be embarassing to anyone but a holistic medicine believer, I've told some tasteless jokes online. If someone were to send that information to my family along with what porn I looked at, that would be awkward at most. And that's assuming it's credible, which it wouldn't be.

    How exactly would this blackmail work? Bob, the evil co-worker threatens to tell your wife and boss you have had a sex change, a running prescription to anti-psychotic medication, were arrested for something that they don't know about and you weren't legally obligated to inform them of, and look at gay porn aproximately 30% of your waking hours. For this hypothetical situation, assume that information is true. Do you do what he wants? If you don't and he does tell your wife and boss, do they actually believe him?

    I think privacy is good for privacy's sake, overstatements such as this undermine the point.

  15. Some perspective please. by mosb1000 · · Score: 2, Insightful

    How is this any different than articles about rockets and space travel (after all, most of us will never travel into space, or work for NASA)? Or any other in a myriad of technical subjects that most of us are not, and will not be directly involved in or use directly.

    People are curious. They are curious about everything. It's an exercise in futility to pick and chose useful information over non-useful information since none of us knows what tomorrow holds. If someone want's to read celebratory gossip more power to them. In truth, the gossip is more likely to be both true and useful than news about an new process that may produce titanium at half the cost or an article about NASA's next big toy. We on slashdot find the technical news more interesting, normal people who are interested in interpersonal relationships find the gossip more interesting. It's two sides of the same coin.

    1. Re:Some perspective please. by causality · · Score: 2, Interesting

      How is this any different than articles about rockets and space travel (after all, most of us will never travel into space, or work for NASA)? Or any other in a myriad of technical subjects that most of us are not, and will not be directly involved in or use directly.

      That's not really a valid question and I'll explain why. The difference is that rockets and space travel are about the actual technology. If the entertainment industry operated that way, then all of the discussion would be about photography/camera work, lighting, audio recording, film editing, and other such techniques. That information is useful to anyone who wonders how these things are done, or who wants to do such things themselves. That's why your question is like comparing an apple to an orange.

      For a real comparison you would have to ask why we don't have paparazzi following scientists around and invading their privacy and digging up dirt on them for public consumption. You'd have to ask why we talk about Firefox the browser and its features instead of the personal lives of the programmers who created it.

      People are curious. They are curious about everything. It's an exercise in futility to pick and chose useful information over non-useful information since none of us knows what tomorrow holds. If someone want's to read celebratory gossip more power to them. In truth, the gossip is more likely to be both true and useful than news about an new process that may produce titanium at half the cost or an article about NASA's next big toy. We on slashdot find the technical news more interesting, normal people who are interested in interpersonal relationships find the gossip more interesting. It's two sides of the same coin.

      Most of the time that you point out that something makes no sense whatsoever, some (possibly well-meaning) apologist will come out of the woodwork and respond with an attempt to portray all possible choices as equally viable. That way everything is just a personal preference with no objective criteria whatsoever. There needs to be something like "Godwin's law" for this. It's often an effective way to halt all useful discussion, and I'm not buying it. It's fine when you are talking about what kind of music you listen to or what kind of food you like to eat. However, it doesn't apply here.

      Knowing that a scientist named John Doe just married his third wife doesn't do a damned thing for anybody. It's useless trivia. No one can take that information and produce a useful product from it. No one can grow as a person or improve their quality of life by learning this. However, knowing that a scientist named John Doe has invented and published a big improvement on how we produce titanium benefits everyone who produces titanium and everyone who purchases products containing it (anything from white paint to rockets). It is not useless trivia. The two types of information are therefore not on equal footing -- one is objectively useful, while the other is not. Ergo, this is not a mere preference or matter of taste, and it's intellectually dishonest to pretend like it is.

      It's just that when someone can sing or dance or act, we pretend like this trivia is somehow profound or meaningful or useful. There's a certain desperation behind that if you look deeply into it. The people who do it don't want to so much as they need to. They have to have some kind of excitement, to make a big deal out of something, because otherwise they must address the emptiness of their own lives. There's nothing wrong with finding interpersonal relationships interesting, because interpersonal relationships are a big and important part of the whole life experience. However, there is something deeply wrong with such a strong interest in interpersonal relationships that don't involve you and in which you cannot participate, particularly when they involve some of the most immature, unenlightened, and superficial people that our society has to offer (i.e Paris Hilton, Brittney Spears).

      --
      It is a miracle that curiosity survives formal education. - Einstein
  16. Ohm is overwrought by feenberg · · Score: 2, Informative

    I have worked with anonymized government data extensively, and birthdate and zipcode are always considered personally identifiable information. Sometimes birth year is available, and sometimes state or (rarely) county is available, but I have never even heard of a dataset with both. Datasets with month and day of birth are never considered to be anonymized, and are not released. The author of the paper is much overwrought.

  17. Re:The Only Truly Anonymous Data by causality · · Score: 2, Insightful

    Especially if you're outed by a friend who posts RL info without your permission that you can't retract.

    Had this happen to me once.

    That's why you have to be very careful about who your friends are. I am no longer surprised by someone who "suddenly changed" because it is not really sudden at all, it is merely subtle before it becomes bleedin' obvious. Sorry to hear you got screwed.

    And yes, I have trusted people I should not have trusted and gotten screwed. What I learned from it is that I ignored red flags and warning signs that should have tipped me off and so I set myself up for what should have been a predictable outcome. Usually this is because I denied the reality of what that person was telling me about themselves in favor of seeing only what I wanted to see. In that way, the person who meant me harm actually made me sharper, wiser, and more difficult to deceive. That could not have happened if 100% of my reaction was to blame that other person for doing something wrong, because that's for victims. Whether a victim gets screwed or not depends entirely on the other person. Whether I get screwed or not depends entirely on whether I make good decisions.

    Victims have another significant disadvantage: they don't profit from their negative experiences, because when everything is someone else's fault, there is no incentive to examine yourself and find out why you should have known better. Thus, where they should be learning and growing they are stagnant. The popularity of this mentality is really amazing.

    --
    It is a miracle that curiosity survives formal education. - Einstein
  18. Re:The Only Truly Anonymous Data by shentino · · Score: 2, Insightful

    By "outing" I mean the publication of ANY personal information that you'd prefer to keep secret.

    Sexual orientation, which btw doesn't apply in this case to this 24 year old virgin, is only one of many possible "secrets" that can be blown.

    Also, consider if I were an iranian dissident who had hard proof that the elections were rigged? Isn't it foreseeable that, with protesters being deemed to be against Allah by the powers that be, that outing me would put me in grave danger?

    Anonymity is a precious thing that should be respected. Whether it protects your ego or your life.

  19. CT scans by Cajun+Hell · · Score: 2, Insightful

    Have you ever thought about how a "cat" scan works? Forget the 3D aspects and let's just think about how the cross-sectional pictures work.

    Every given reading, is just shooting a ray through the target, and getting a single number out. This is analogous to aggregate summaries are personal details in data. You know the average income of people in zip code 12345, but no specifics. The trick is, later, just as that CT scan is going to shoot a ray through a certain point again from a different direction, your personal details are going to be summarized again by someone else, in a different way.

    A picture will emerge. The CT scan is going to "see" the bone as distinct from the tissue right here at this pixel, and this person's data will be un-summarized. It just takes enough rays, and eventually all ambiguity goes away.

    A long time ago (about 20 years ago, I think?) there was a neato explanation of a cat scan algorithm in Scientific American. I wish I could find it. Because I bet you could show that article to any "database guy" these days, and they'd nod and smile.

    --
    "Believe me!" -- Donald Trump