Slashdot Mirror


Why Anonymized Data Isn't

Ars has a review of recent research, and a summary of the history, in the field of reidentification — identifying people from anonymized data. Paul Ohm's recent paper is an elaboration of what Ohm terms a central reality of data collection: "Data can either be useful or perfectly anonymous but never both." "...in 2000, [researcher Latanya Sweeney] showed that 87 percent of all Americans could be uniquely identified using only three bits of information: ZIP code, birthdate, and sex. ... For almost every person on earth, there is at least one fact about them stored in a computer database that an adversary could use to blackmail, discriminate against, harass, or steal the identity of him or her. I mean more than mere embarrassment or inconvenience; I mean legally cognizable harm. ... Reidentification science disrupts the privacy policy landscape by undermining the faith that we have placed in anonymization."

4 of 280 comments (clear)

  1. Re:anonymization is bullshit by blahplusplus · · Score: 2, Interesting

    "Private should mean no disclosure, not anonymized disclosure, not aggregate disclosure, just plain no disclosure period."

    The profit motive and privacy are at odds, trying to make the most money and sell the most stuff means you want to know everything about everyone so that you can one up you competitors, it's a race to the bottom. Ideals in the real world always submit to the pragmatic concerns of making money in a capitalist society.

  2. Re:Some perspective please. by causality · · Score: 2, Interesting

    How is this any different than articles about rockets and space travel (after all, most of us will never travel into space, or work for NASA)? Or any other in a myriad of technical subjects that most of us are not, and will not be directly involved in or use directly.

    That's not really a valid question and I'll explain why. The difference is that rockets and space travel are about the actual technology. If the entertainment industry operated that way, then all of the discussion would be about photography/camera work, lighting, audio recording, film editing, and other such techniques. That information is useful to anyone who wonders how these things are done, or who wants to do such things themselves. That's why your question is like comparing an apple to an orange.

    For a real comparison you would have to ask why we don't have paparazzi following scientists around and invading their privacy and digging up dirt on them for public consumption. You'd have to ask why we talk about Firefox the browser and its features instead of the personal lives of the programmers who created it.

    People are curious. They are curious about everything. It's an exercise in futility to pick and chose useful information over non-useful information since none of us knows what tomorrow holds. If someone want's to read celebratory gossip more power to them. In truth, the gossip is more likely to be both true and useful than news about an new process that may produce titanium at half the cost or an article about NASA's next big toy. We on slashdot find the technical news more interesting, normal people who are interested in interpersonal relationships find the gossip more interesting. It's two sides of the same coin.

    Most of the time that you point out that something makes no sense whatsoever, some (possibly well-meaning) apologist will come out of the woodwork and respond with an attempt to portray all possible choices as equally viable. That way everything is just a personal preference with no objective criteria whatsoever. There needs to be something like "Godwin's law" for this. It's often an effective way to halt all useful discussion, and I'm not buying it. It's fine when you are talking about what kind of music you listen to or what kind of food you like to eat. However, it doesn't apply here.

    Knowing that a scientist named John Doe just married his third wife doesn't do a damned thing for anybody. It's useless trivia. No one can take that information and produce a useful product from it. No one can grow as a person or improve their quality of life by learning this. However, knowing that a scientist named John Doe has invented and published a big improvement on how we produce titanium benefits everyone who produces titanium and everyone who purchases products containing it (anything from white paint to rockets). It is not useless trivia. The two types of information are therefore not on equal footing -- one is objectively useful, while the other is not. Ergo, this is not a mere preference or matter of taste, and it's intellectually dishonest to pretend like it is.

    It's just that when someone can sing or dance or act, we pretend like this trivia is somehow profound or meaningful or useful. There's a certain desperation behind that if you look deeply into it. The people who do it don't want to so much as they need to. They have to have some kind of excitement, to make a big deal out of something, because otherwise they must address the emptiness of their own lives. There's nothing wrong with finding interpersonal relationships interesting, because interpersonal relationships are a big and important part of the whole life experience. However, there is something deeply wrong with such a strong interest in interpersonal relationships that don't involve you and in which you cannot participate, particularly when they involve some of the most immature, unenlightened, and superficial people that our society has to offer (i.e Paris Hilton, Brittney Spears).

    --
    It is a miracle that curiosity survives formal education. - Einstein
  3. Re:Duh. by Jah-Wren+Ryel · · Score: 2, Interesting

    An out-of-state fake ID will not necessarily work. There are interstate standards for the content of mag stripes and 2-D barcodes, for example.

    But no where near all states follow those standards. All you gotta do is make a fake-id for one of those states. Even if the state does follow those standards, if you pick a state far enough way you can make up pretty much anything, call it an id card (rather than a driver's license) and the person using the machine will have to make the human decision to accept the id anyway or not. As someone who made such a fake-id for a girl who wanted to appear younger than she was (got tired of the bouncers at the clubs loudly exclaiming "you lookin gooooood for XX years old" and thus informing everyone she was with of her true age) I can say that the card always failed to scan because it was 100% bogus, but the people running the machines always accepted it anyway.

    --
    When information is power, privacy is freedom.
  4. Re:Damn voyeurism is all it is by andy_t_roo · · Score: 2, Interesting

    i think i found a new sig (a bit too long for /. unfortunately):
      "why is it "marketing" when a company helps itself to my information against my will and "piracy" or "industrial espionage" if I helped myself to THEIR zeroes and ones against their will?"