Slashdot Mirror


'Anonymized' Credit Card Data Not So Anonymous, MIT Study Shows

schwit1 writes Scientists showed they can identify you with more than 90 percent accuracy by looking at just four purchases, three if the price is included — and this is after companies "anonymized" the transaction records, saying they wiped away names and other personal details. The study out of MIT, published Thursday in the journal Science, examined three months of credit card records for 1.1 million people. "We are showing that the privacy we are told that we have isn't real," study co-author Alex "Sandy" Pentland of the Massachusetts Institute of Technology, said in an email.

20 of 96 comments (clear)

  1. Study by jklovanc · · Score: 2

    Where is the link to the actual study?

    1. Re:Study by Anonymous Coward · · Score: 4, Informative

      http://www.sciencemag.org/content/347/6221/468.full?intcmp=collection-privacy

      The published article the clickbait was based on has much better information. For instance: the transactions for a person all still shared a unique ID#. "All that remained were the metadata: amounts spent, shop type—restaurant, gym, or grocery store, for example—and a code representing each person."

      If you don't cycle the code per person regularly of course correlation attacks will always work.

  2. "the privacy we are told that we have isn't real." by turkeydance · · Score: 4, Funny

    Staff Sergeant Obvious reporting for duty.

  3. This is like... by Ichijo · · Score: 2

    ...using a fingerprint database to show that cash isn't anonymous.

    --
    Any sufficiently unpopular but cohesive argument is indistinguishable from trolling.
  4. Re:women easier to ID by drinkypoo · · Score: 2

    Could it be that men tend to shop a lot less than women!?

    Men are either more likely to buy what everyone else is buying, or more likely to buy based on logic and not emotion. Or both.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  5. Why even 3? by Wycliffe · · Score: 2

    The article says it can identify someone in as few as 3 transactions.
    But they aren't really identifying them, they are just showing that no other person hit the same exact set of shops.
    Well, they also mention that they get a datestamp with the transaction so assuming that datestamp has minutes
    or seconds then it should only take 1 transaction or 2 at the most. That being said, you really haven't identified
    this person as you don't know who they are in the real world just that they have a unique shopping pattern as
    everyone does.

    1. Re:Why even 3? by Courageous · · Score: 3, Insightful

      This article isn't scary. What should be scary is that cell companies cell anonymitized _geolocation_ data. That data can be used to deterimine: A) who you are, B) where you live, C) where you work, and D) who your friends are. Step #1. Look where the phone is, regularly at midnight. Step #2, cross reference with public records databases on property ownership. That get's 65% of Americans right there. Now check where it parks every day at noon. Place of work found. And so forth.

    2. Re:Why even 3? by suutar · · Score: 2

      combine the two and now they know that the person who was at shop A at time X, shop B at time Y, and shop C at time Z also appears to live at address Q and work at address R, and there you go: anyone who can get the "anonymized" data knows where you live, and that you just bought not only new living room electronics but also airline tickets.

    3. Re:Why even 3? by Not_Wiggins · · Score: 4, Informative

      The article is misleading. It talks about how it can be used to "identify someone." And with all the talk about privacy, it implies the identification of an individual.

      But, reading through it closely, they aren't talking about identifying a specific someone; the information isn't enough to say Not_Wiggins made these purchases.
      Instead, it focuses on identifying characteristics of purchasers and then extending it to see what other behavior purchasers in those groups would make.

      In the article example, they talked about someone making a purchase at both a bakery and a restaurant within a short time period. Finding that they had one such instance, named him Scott, then looked to see what other behaviors "Scott" had. By extending that logic, they are saying "look at the group of people who typically shop at a bakery and a restaurant... then you know those people are typically also interested in shoes."

      The example is a bit silly, but that's what they're saying.

      They're talking about documenting patterns of behavior on purchasing decisions.
      This article really isn't about loss of anonymity. It is about using anonymized credit card transactions to develop definitions of "user groups" and predicting their shared behavior pattern.

      To me, it seems more like the equivalent of last.fm... tell us what music you like, we'll compare it against what others who also have the same "likes" have said, and give you options for things that might fit your tastes.

      In this instance, it is: tell us what purchases you've made, we'll compare it against similar purchases that others have made, and we can predict what other purchases you might want/like that you haven't made yet.

      --
      Diplomacy is the art of saying, "Nice doggie!" until you can find a rock.
    4. Re:Why even 3? by NicBenjamin · · Score: 3, Informative

      And this only works if you have a lot of other data in your data set. If you don't know who Scot is, then you can't figure out he's the only person who could go to the bakery on that one exact day and that particular restaurant the next.

      I don't think anyone is particularly sanguine about the future of privacy if big companies manage to figure out a way to profit from combining their multiple massive databases. This is particularly true in the US, where it would be virtually impossible to stop the police from using said databases with our warrants. Or worse, using info that the big companies forwarded them as the basis for warrants.

      If Apple or Google can silence one of it's critics by figuring out he was paying a hooker with his supposedly anonymous Mastercard gift card, that is a really fucking bad thing.

  6. Re:Regular users only by jbgroup1 · · Score: 3, Insightful

    If you don't count my student loans, I'm well off and have plenty of money.

    Of course, "Outside of the killings, DC has one of the lowest crime rates in the country"--Marion S. Barry Jr., 1989

  7. Re:"the privacy we are told that we have isn't rea by ShanghaiBill · · Score: 3, Funny

    I always thought it was anonymized through aggregation.

    Aggregation is not very useful. Much more useful is being able to look for relationships between purchases by the same user. Years ago department stores would have an "accessories" section. Then Wal-Mart crunched their data, and figured out that people don't shop for accessories randomly. They buy a belt when they are buying pants. They buy a necktie when they are buying shirts. So today, the belts are placed by the pants, and the neckties are placed by the shirts. This seems kind of obvious in hindsight, but it took data analysis to make it happen.

    If a woman stops buying condoms and starts buying vitamin supplements, that means you should showing her popup ads for maternity clothes. Nine months later, you can show her a different brand of condom, with ads than emphasize reliability.

  8. Re:Regular users only by mjwx · · Score: 2

    As one who hot tired of high fees, I dropped the use of credit/debit cards. I used a gift card for an online purchase. Nothing annon about it. Has my name and address on the order.

    Its less about the order itself, more about credit card companies selling the data to advertisers and other dodgy organisations. They claim the data is anonymised (which means they remove names from the orders) but its trivial to de-anonymise the data.

    This is one of the reasons I use cash for most purchases.

    --
    Calling someone a "hater" only means you can not rationally rebut their argument.
  9. Re:women easier to ID by NicBenjamin · · Score: 2

    Best guess?

    The number of women buying unique items (i.e.: that one purse that's so cute) is 4-5 points higher then the number of men doing the same thing, which would mean a given data point is 4-5% more effective if the shopper is female.

  10. Meaning of "Anonymous" by eepok · · Score: 2

    I don't know about you, but I think it's pretty fair to say that a record without any information directly identifying the subject is "anonymous".

    The ability to complete an analysis of multiple records and data sources thereby reasonable guess (90% accuracy) of who the subject might be is insufficient to remove the title of anonymous.

  11. Re:Regular users only by mjwx · · Score: 4, Insightful

    Not sure what you're talking about. My credit card has no fees

    It has no fees you know about... And banks want to keep it that way. When you pay for something by credit card, the merchant pays 3% or more for accepting the card. This means they have to pass the cost onto you in the form of higher prices.

    You didn't think the bank gave you free money did you?

    Its Machiavellian in its brilliance, you're robbing yourself of 3% in order to give yourself 1% and you're so enamoured with it, you're trying to do this as much as possible.

    --
    Calling someone a "hater" only means you can not rationally rebut their argument.
  12. Re:Regular users only by AthanasiusKircher · · Score: 2

    When you pay for something by credit card, the merchant pays 3% or more for accepting the card. This means they have to pass the cost onto you in the form of higher prices.

    Yes. But if they're like most merchants in the world (with the exception of some gas stations and a random shop here and there), they pass that cost onto YOU too, even if you don't use a credit card.

    Its Machiavellian in its brilliance, you're robbing yourself of 3% in order to give yourself 1% and you're so enamoured with it, you're trying to do this as much as possible.

    Umm, well again if it's like most merchants in the world, you and I pay the same price if I pay by credit card and you pay by cash.

    The difference is that they're "robbing" 2% from me, while they rob 3% from you.

    Thus, I win if I use the card in the current system.

    Convince more merchants to offer cash discounts or convince so many people to stop using cards that most merchants want to charge a fee. Then we can talk about how people are stupid for using credit cards to get rewards. Until you do so, refusing to use a card is just letting the card companies take MORE money from you.

  13. Re:Regular users only by sjames · · Score: 2

    Actually, many businesses had a credit surcharge for a while. Then the credit cards added a no surcharge clause to the merchant contracts. So they hiked their prices and offered a cash discount. Then the credit cards added a no cash discount clause.

    Yes, services cost money. That's no excuse for hiding how much it costs and forcing it to be paid for by people not using the service (for example, everyone that pays cash).

    Many merchants prefer cash because cash can't be charged back after the fact. For example, if the card turns out to be stolen but not yet reported or the customer files a false claim.

  14. Can only identify you...if they know who you are! by Overzeetop · · Score: 2

    They did NOT show that, from 3-4 transactions, they could provide your name, address and phone number, or even that if you have 3-4 transactions in a million transaction anonymized data set they can find out anything about you personally *unless they know you first*.

    What they did is show that if they know that you, personally, had 3 to 4 types of transactions on specific dates (you went to a grocery store and a gas station today, and a restaurant yesterday), they could identify which anonymized data set you belong to. Their discovery requires specific outside knowledge not contained in the data.

    This only matters if, say, a third party could identify specific purchases and dates - they could then comb the records and find the rest of your transactions on that specific card. IOW, someone has to be looking for you, and know at least something about you, to even start the search.

    --
    Is it just my observation, or are there way too many stupid people in the world?
  15. Of course not ... by gstoddart · · Score: 2

    "We are showing that the privacy we are told that we have isn't real"

    Of course it's not bloody real.

    For us to believe this data has been 'anonymized', we have to assume that a) the company is qualified to do what is required to anonymize the data, b) that they actually give a shit, and c) that they bear any penalty if they do a terrible job.

    Entrusting these companies with this data in the first place is the problem. Allowing them to share it all over the place for profit and with no restriction is a terrible idea.

    This is precisely why sane countries have data protection and privacy laws -- because corporations are greedy, self serving entities, who won't give a crap if the collateral damage of their stuff is to damage the privacy of everybody they deal with.

    And this is precisely why all of those analytics companies in web pages are just parasites and not to be trusted.

    --
    Lost at C:>. Found at C.