Slashdot Mirror


'Anonymized' Credit Card Data Not So Anonymous, MIT Study Shows

schwit1 writes Scientists showed they can identify you with more than 90 percent accuracy by looking at just four purchases, three if the price is included — and this is after companies "anonymized" the transaction records, saying they wiped away names and other personal details. The study out of MIT, published Thursday in the journal Science, examined three months of credit card records for 1.1 million people. "We are showing that the privacy we are told that we have isn't real," study co-author Alex "Sandy" Pentland of the Massachusetts Institute of Technology, said in an email.

96 comments

  1. Regular users only by Technician · · Score: 1

    As one who hot tired of high fees, I dropped the use of credit/debit cards. I used a gift card for an online purchase. Nothing annon about it. Has my name and address on the order.

    --
    The truth shall set you free!
    1. Re:Regular users only by olsmeister · · Score: 1

      Not sure what you're talking about. My credit card has no fees, I pay the balance monthly so no interest, and I get 1.25% of everything I spend back in cash. I try to use it whenever I can.

    2. Re:Regular users only by Anonymous Coward · · Score: 1

      Read between the lines, he got into credit card trouble, can't get anymore credit and has probably gone through or going through bankruptcy or has some sort of agreement with the CC companies to pay them back. So he has to use visa gift cards to purchase online items.

      If you work in retail you learn to recognize this behavior. A customer always paying in cash by digging out of their envelope from the bank after they cash their pay check. It's because that's the only way they can budget their money and their credit is so horrible they can't get a credit card anyway.

      Dropping debit cards for this reason makes absolutely no sense at all. They have no fees, if they do, you're with the wrong bank.

    3. Re:Regular users only by Anonymous Coward · · Score: 0

      This is about data which has people's identity removed. It's not about anonymous credit cards or the inability of some people to manage money.

    4. Re:Regular users only by Anonymous Coward · · Score: 0

      Debit cards lets someone drain your account drop with little to no recourse. Cash doesn't have that problem.

      There are a few people that organize their money using envelopes. They have a specific limit and envelope for each category and when they pay for things they take it out of the proper envelope. When there's no more money in that envelope, no more buying of those types of things. I use GNUCash to do something similar, but I almost always pay with a credit card. If you don't count my student loans, I'm well off and have plenty of money.

    5. Re:Regular users only by Anonymous Coward · · Score: 0

      I'd count your student loans. Shameful. Just shameful. I was debt free out of college.

    6. Re:Regular users only by Anonymous Coward · · Score: 0

      > Dropping debit cards for this reason makes absolutely no sense at all. They have no fees, if they do, you're with the wrong bank.

      They also have no protections. All you get is a promise that is not legally binding. And even the promise is no good when it comes to second-order effects, like your landlord charging you a fee for a bounced check. Sure the bank promises to refund their insufficient funds fee if your balance is zeroed by fraud, but they won't even consider paying that landlord's bounced check fee.

    7. Re:Regular users only by jbgroup1 · · Score: 3, Insightful

      If you don't count my student loans, I'm well off and have plenty of money.

      Of course, "Outside of the killings, DC has one of the lowest crime rates in the country"--Marion S. Barry Jr., 1989

    8. Re:Regular users only by ShanghaiBill · · Score: 1, Informative

      As one who got tired of high fees, I dropped the use of credit/debit cards.

      What? Debit cards don't have fees. Credit cards are usually available with no fee, or with benefits (such as airline miles) that more than compensate for the fee. There may be good reasons to not use credit/debit cards, but "high fees" is not one of them.

    9. Re:Regular users only by Anonymous Coward · · Score: 0

      Where do you get those gift cards?
      I want to get some in the Netherlands, but don't know where to get them.

    10. Re:Regular users only by mjwx · · Score: 2

      As one who hot tired of high fees, I dropped the use of credit/debit cards. I used a gift card for an online purchase. Nothing annon about it. Has my name and address on the order.

      Its less about the order itself, more about credit card companies selling the data to advertisers and other dodgy organisations. They claim the data is anonymised (which means they remove names from the orders) but its trivial to de-anonymise the data.

      This is one of the reasons I use cash for most purchases.

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
    11. Re:Regular users only by mjwx · · Score: 4, Insightful

      Not sure what you're talking about. My credit card has no fees

      It has no fees you know about... And banks want to keep it that way. When you pay for something by credit card, the merchant pays 3% or more for accepting the card. This means they have to pass the cost onto you in the form of higher prices.

      You didn't think the bank gave you free money did you?

      Its Machiavellian in its brilliance, you're robbing yourself of 3% in order to give yourself 1% and you're so enamoured with it, you're trying to do this as much as possible.

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
    12. Re:Regular users only by Anonymous Coward · · Score: 1

      That 3% covers processing and fraud prevention. If you pay with a check, a guarantee service costs the same. Services cost money. Besides gas stations, I don't know of any retail establishment that will give you a 3% discount for using cash. Why would they, they would rather pocket the difference.

      If I was running a retail establishment, I would encourage credit and debit transactions. It is much easier to deal with plastic than cash, less exposure to risk for the merchant. After working in retail management 25 years ago, there were cash shortages, robbery in store and during night deposits, employee theft, daily accounting, etc... Yet, settling the credit receipts for the day took 10 seconds.

    13. Re:Regular users only by Lunix+Nutcase · · Score: 1

      Even if I were to spend $10000 on a credit card in a year that 3% doesn't even represent a full day's pay. I think I'll live somehow.

    14. Re:Regular users only by Anonymous Coward · · Score: 0

      Especially a visa debit card. It's all I use. When I travel, it automatically does money conversions, it's great. Use a US credit union. Low bank fees, interest bearing checking, etc. Much more forgiving than a normal bank. I had major problems with my finances until I switched to one, then I was finally able to get everything under control. Now I'm all good. I pay for everything, and all my bills, from my visa debit card. No checks. Unless you want a ton of work, you can do either bank card or checks, but not both. It's when I would mix that I would get into trouble. Now, by paying everything with the card, I can stay pretty up to date on my balance by checking it online. Weekends can be a bit hairy, but I've learned to accommodate that.

    15. Re:Regular users only by AthanasiusKircher · · Score: 2

      When you pay for something by credit card, the merchant pays 3% or more for accepting the card. This means they have to pass the cost onto you in the form of higher prices.

      Yes. But if they're like most merchants in the world (with the exception of some gas stations and a random shop here and there), they pass that cost onto YOU too, even if you don't use a credit card.

      Its Machiavellian in its brilliance, you're robbing yourself of 3% in order to give yourself 1% and you're so enamoured with it, you're trying to do this as much as possible.

      Umm, well again if it's like most merchants in the world, you and I pay the same price if I pay by credit card and you pay by cash.

      The difference is that they're "robbing" 2% from me, while they rob 3% from you.

      Thus, I win if I use the card in the current system.

      Convince more merchants to offer cash discounts or convince so many people to stop using cards that most merchants want to charge a fee. Then we can talk about how people are stupid for using credit cards to get rewards. Until you do so, refusing to use a card is just letting the card companies take MORE money from you.

    16. Re:Regular users only by AthanasiusKircher · · Score: 1

      And by the way, for the three places in town I often go to where they actually offer a cash discount, I bring cash.

    17. Re:Regular users only by Anonymous Coward · · Score: 0

      Then they either don't take Visa, or they didn't get called out on it yet. Mastercard allows merchants to have difference prices (Err sorry, discounts) for non-card using customers.... Visa does not, and generally enforces it pretty swiftly.

    18. Re: Regular users only by Anonymous Coward · · Score: 0

      Albert Heijn

    19. Re:Regular users only by Anonymous Coward · · Score: 0

      Debit cards can only drain money if you put all your money on the account connected to the card. Open tons of accounts; it is free anyway.

    20. Re:Regular users only by sjames · · Score: 2

      Actually, many businesses had a credit surcharge for a while. Then the credit cards added a no surcharge clause to the merchant contracts. So they hiked their prices and offered a cash discount. Then the credit cards added a no cash discount clause.

      Yes, services cost money. That's no excuse for hiding how much it costs and forcing it to be paid for by people not using the service (for example, everyone that pays cash).

      Many merchants prefer cash because cash can't be charged back after the fact. For example, if the card turns out to be stolen but not yet reported or the customer files a false claim.

    21. Re:Regular users only by Anonymous Coward · · Score: 0

      Except when fraud happens, the merchant will not receive the money from the credit card company, plus the merchant will have to pay a fine to the credit card company.

    22. Re:Regular users only by Anonymous Coward · · Score: 0

      They do when you overdraft your account, going along with what you said.

    23. Re:Regular users only by Anonymous Coward · · Score: 0

      Banks set a limit on daily debit card spending.

    24. Re:Regular users only by tehcyder · · Score: 1

      There are a few people that organize their money using envelopes. They have a specific limit and envelope for each category and when they pay for things they take it out of the proper envelope. When there's no more money in that envelope, no more buying of those types of things.

      So if your food envelope's empty, you starve, even if the booze and hookers one is still full up?

      Crazy talk.

      --
      To have a right to do a thing is not at all the same as to be right in doing it
    25. Re:Regular users only by Anonymous Coward · · Score: 0

      So long as the company has followed procedures properly, fraud costs are eaten by the credit card company, not the retailer. For chip'n'pin, pin verified transactions will basically never get charged back. Signature verified, if signature matches that on back of card and you checked card number vs number of the strip (so not cloned), you also won't be charged back.

      Speaking as someone who works in retail, cash is a huge pain. Double count for large transactions into the til. Double count on going into safe. Double count on going to the bank. Have to pay security firm to collect cash and take it to the bank. Bank then count it, and charge a fee for deposits.

    26. Re:Regular users only by GrumpySteen · · Score: 1

      you're robbing yourself of 3% in order to give yourself 1% and you're so enamoured with it, you're trying to do this as much as possible.

      As opposed to your plan of not using credit cards while paying the same price and getting nothing back. You are so SMRT.

    27. Re:Regular users only by cygnwolf · · Score: 1

      Only if you can meet their account 'requirements', which in my experience usually include either an extremly high minimum balance or regular monthly direct deposits from your employer. it's a pain in the backside to get an employer to split direct deposits among multiple accounts, and maintaining the high balance kind of defeats the point of trying to keep someone from cleaning you out. Sure, some small banks and credit unions have lower requirements, but then you run in to issues like lack of atm/branch access, no 24 hour telephone support, etc.

      --
      Free Pie! The Pie is Also Evil!
    28. Re:Regular users only by cygnwolf · · Score: 1

      cool idea, but the no checks thing could be a killer... my landlord has this strange refusal to take any kind of card for the rent, they accept check, money order, or cashier's check only. Which means if my bank account is card only, I'm stuck paying the fees for money order or cashier's check every month, and dealing with the hassle of actually having to go somewhere to get one. Still, this is a personal experience thing. I couldn't get by with no checks, but someone who has a more reasonable landlord can.

      --
      Free Pie! The Pie is Also Evil!
    29. Re:Regular users only by Anonymous Coward · · Score: 0

      Merchant charge backs are not that common. Just because you may ask your bank for a charge back does not mean it comes out of the merchant's pocket. As long as the merchant follows all the procedures in their agreement with the bank and there is no fraud on their end, they have no liability and there will be no charge back for them. However, this does not mean that the account holder will be charged. It really depends on the type of dispute and its value amount. Most of the time the financial institution eats the cost. That is where part of that 3% goes.

      If I pay by credit, I should not be force to pay for the insurance the merchant carries to cover cash losses in the event of a robbery. I should not be forced to pay for the armored car service they use. Maybe I can save a few cents on the next loaf of bread. ;)

    30. Re:Regular users only by ruir · · Score: 1

      Well, forget about being anonymous in the Europe Community, this year saw the passing of a law that any Internet sale has to pass a receipt to the buyer. The real idea is to 1) register who the buyer is 2) cross-reference data for sales evasion 3) cross-reference data for expenditures to catch people spending more than they earn

    31. Re:Regular users only by Anonymous Coward · · Score: 0

      You assume that in the absence of these fees, the savings would be passed on to consumers.

    32. Re:Regular users only by Anonymous Coward · · Score: 0

      I pay by credit to make sure returns are easy. Cash is always a hassle with returns. So now, I am not robbing myself. Those who pay in cash and or debit are getting screwed.

    33. Re:Regular users only by Anonymous Coward · · Score: 0

      i suspect you would also have to not count bribery as a crime for that to be true.

    34. Re:Regular users only by Anonymous Coward · · Score: 0

      no 24 hour telephone support

      who calls their bank anyway? i can count on one hand how many times i have ever called my bank, and the majority of those were just to figure out their lobby hours mostly because it is still difficult to do certain types of transactions online.

    35. Re:Regular users only by Anonymous Coward · · Score: 0

      i guess i may not be understanding this fully.

      if it is an internet sale doesn't the merchant typically already have your full address, because much of the time they will need to ship something to you.

    36. Re:Regular users only by Anonymous Coward · · Score: 0

      Superior assholes are annoying.

    37. Re:Regular users only by Anonymous Coward · · Score: 0

      Yeah like it actually ever works out that way instead of the other way around. And of course the food envelope is fair game for booze and hookers.

    38. Re:Regular users only by sjames · · Score: 1

      Actually, whenever they agree to not charge the customer, the bank charges it back. I know small merchants that have had it happen in spite of following all of the procedures as required.

    39. Re: Regular users only by Anonymous Coward · · Score: 0

      While having seen the behavior you describe i am reasonably well off and choose to make almost all purchases with cash for privacy. Several of my freinds do so as well. Hint, we all have a background in the defense\intel world... Some of my online purchases are also done with disposable credit cards, though is more to avoid advertising connections.

    40. Re:Regular users only by StormUP · · Score: 1

      Does it count if my cash comes from my wallet instead of an envelope? Does the retail setting matter? Perhaps I dislike the high % cut that credit card companies take from retailers and philosophically prefer cash. Cash also works when the credit transaction system goes down, though that is rare.

    41. Re:Regular users only by StormUP · · Score: 1

      For this reason I never use my debit card anywhere other than at my bank. I use either cash or a credit card. If credit card info gets stolen it's still a hassle, but not an all my money is gone (until things get resolved or possibly forever) hassle.

    42. Re:Regular users only by thegarbz · · Score: 1

      And yet a merchant who doesn't accept cash is liky to have lower costs at the end of the day.

      You don't think balancing the till, counting the money, trips to the bank, storing and maintaining a float, and dealing with cash in general were "free" and didn't include a whole boat load of inefficiencies for trading did you?

      Its like when I expense things for work I only ever do so on a company credit card because the end of the day I don't need to keep records, I don't need to fill out paperwork, I don't actually need to do anything other than look at a printed statement and click approve.

      3% is a small price to pay.

  2. It's easier to identify women by Anonymous Coward · · Score: 0

    ...but the research couldn't explain why."

    Afraid of the feminazis, Mr. Scientist?

  3. Credit Card and Big Data scarier than NSA by retroworks · · Score: 1

    When NSA collects 'metadata', it's disturbing but also difficult to see how they benefit from corrupt use of the data. But corporate 'big data' just has many ways to make money off of it. Where is the Snowden of Citibank?

    --
    Gently reply
    1. Re:Credit Card and Big Data scarier than NSA by Anonymous Coward · · Score: 0

      Credit card data, banking data, purchases of cars are reported to the treasury, Medical Information Bureau, Credit bureaus, marketing data, magazine subscriptions, tax information, Lexis Nexus, Choicepoint, .....

      Corporate American has been collecting personal data on all of us for decades.

      Their propaganda says they do it for "your convenience" but the truth is for profit.

      Get enough information on the consumer and you can then exploit them. Demand a SS # from a customer and do a credit check. Well, now, they can get credit for so much money - let's steer them to something more expensive.

      Marketing has become so sophisticated and coupled with marketing psychology, selling becomes easy.

    2. Re:Credit Card and Big Data scarier than NSA by Anonymous Coward · · Score: 0

      even assuming that corporate america is entirely evil and without any redeeming qualities whatsoever, i highly doubt they have any idea how to even use that data they collect.

  4. Study by jklovanc · · Score: 2

    Where is the link to the actual study?

    1. Re:Study by cOle2 · · Score: 1

      I believe this is the study in question.

    2. Re:Study by Anonymous Coward · · Score: 4, Informative

      http://www.sciencemag.org/content/347/6221/468.full?intcmp=collection-privacy

      The published article the clickbait was based on has much better information. For instance: the transactions for a person all still shared a unique ID#. "All that remained were the metadata: amounts spent, shop type—restaurant, gym, or grocery store, for example—and a code representing each person."

      If you don't cycle the code per person regularly of course correlation attacks will always work.

  5. "the privacy we are told that we have isn't real." by turkeydance · · Score: 4, Funny

    Staff Sergeant Obvious reporting for duty.

  6. This is like... by Ichijo · · Score: 2

    ...using a fingerprint database to show that cash isn't anonymous.

    --
    Any sufficiently unpopular but cohesive argument is indistinguishable from trolling.
  7. Re:"the privacy we are told that we have isn't rea by Anonymous Coward · · Score: 0

    Staff Sergeant Obvious reporting for duty.

    Not so obvious actually. But then, I never realized before that anonymizing my data just means "replacing 'John Smith' with 'User 12345'". I always thought it was anonymized through aggregation.

  8. I guess i wasn't worried about this by shadowrat · · Score: 1

    When i make purchases with my credit card, i'm not worried about someone knowing it was me, Shadowrat, who made the purchase. When did people claim that you could anonymously buy anything with a credit card? Obviously that's stored in lots of places. I buy something online, the vendor needs to know where to ship it, my credit card company knows who to bill, amazon knows because they are passing the info on.

    What i worry about is someone stealing my number. This Honestly, i don't even worry about that so much anymore since it's happened enough and i've come away completely unharmed, i'm just kind of numb to it.

    1. Re:I guess i wasn't worried about this by Lunix+Nutcase · · Score: 1

      What you are saying has no relationship to the article. The article is talking about the supposed "anonymized" data given to marketers about your purchases. It's about thinking you were hiding yourself from the business you were buying from.

    2. Re:I guess i wasn't worried about this by Lunix+Nutcase · · Score: 1

      It's *not* about, that is.

  9. women easier to ID by porjo · · Score: 1
    From TA

    It's easier to identify women, but the research couldn't explain why, de Montjoye said.

    Could it be that men tend to shop a lot less than women!?

    1. Re:women easier to ID by drinkypoo · · Score: 2

      Could it be that men tend to shop a lot less than women!?

      Men are either more likely to buy what everyone else is buying, or more likely to buy based on logic and not emotion. Or both.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    2. Re:women easier to ID by kogut · · Score: 1

      Could it be that men tend to shop a lot less than women!?

      Or maybe men are identified by the proxy billing info for porn sites.

    3. Re:women easier to ID by Anonymous Coward · · Score: 0

      Right. Because either everyone they know is buying TruckNutz, or it is entirely logical to want nuts on your truck. No emotion to it.

    4. Re:women easier to ID by drinkypoo · · Score: 1

      Right. Because either everyone they know is buying TruckNutz, or it is entirely logical to want nuts on your truck. No emotion to it.

      I strongly suspect that many of those were purchases as snarky "gifts" for men by women as commentary on their lifestyles. Alas, a google search for "truck nuts buyer gender breakdown" did not result in any useful data. Regardless, such a special case can't prove a point. And anyway, I allowed as how there was an alternate and less generous explanation.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    5. Re:women easier to ID by NicBenjamin · · Score: 2

      Best guess?

      The number of women buying unique items (i.e.: that one purse that's so cute) is 4-5 points higher then the number of men doing the same thing, which would mean a given data point is 4-5% more effective if the shopper is female.

    6. Re:women easier to ID by turp182 · · Score: 1

      And somewhere, in a chilly, highly vented room, a Google search server chuckled when presented with the search phrase: "truck nuts buyer gender breakdown".

      --
      BlameBillCosby.com
  10. Why even 3? by Wycliffe · · Score: 2

    The article says it can identify someone in as few as 3 transactions.
    But they aren't really identifying them, they are just showing that no other person hit the same exact set of shops.
    Well, they also mention that they get a datestamp with the transaction so assuming that datestamp has minutes
    or seconds then it should only take 1 transaction or 2 at the most. That being said, you really haven't identified
    this person as you don't know who they are in the real world just that they have a unique shopping pattern as
    everyone does.

    1. Re:Why even 3? by Courageous · · Score: 3, Insightful

      This article isn't scary. What should be scary is that cell companies cell anonymitized _geolocation_ data. That data can be used to deterimine: A) who you are, B) where you live, C) where you work, and D) who your friends are. Step #1. Look where the phone is, regularly at midnight. Step #2, cross reference with public records databases on property ownership. That get's 65% of Americans right there. Now check where it parks every day at noon. Place of work found. And so forth.

    2. Re:Why even 3? by suutar · · Score: 2

      combine the two and now they know that the person who was at shop A at time X, shop B at time Y, and shop C at time Z also appears to live at address Q and work at address R, and there you go: anyone who can get the "anonymized" data knows where you live, and that you just bought not only new living room electronics but also airline tickets.

    3. Re:Why even 3? by Not_Wiggins · · Score: 4, Informative

      The article is misleading. It talks about how it can be used to "identify someone." And with all the talk about privacy, it implies the identification of an individual.

      But, reading through it closely, they aren't talking about identifying a specific someone; the information isn't enough to say Not_Wiggins made these purchases.
      Instead, it focuses on identifying characteristics of purchasers and then extending it to see what other behavior purchasers in those groups would make.

      In the article example, they talked about someone making a purchase at both a bakery and a restaurant within a short time period. Finding that they had one such instance, named him Scott, then looked to see what other behaviors "Scott" had. By extending that logic, they are saying "look at the group of people who typically shop at a bakery and a restaurant... then you know those people are typically also interested in shoes."

      The example is a bit silly, but that's what they're saying.

      They're talking about documenting patterns of behavior on purchasing decisions.
      This article really isn't about loss of anonymity. It is about using anonymized credit card transactions to develop definitions of "user groups" and predicting their shared behavior pattern.

      To me, it seems more like the equivalent of last.fm... tell us what music you like, we'll compare it against what others who also have the same "likes" have said, and give you options for things that might fit your tastes.

      In this instance, it is: tell us what purchases you've made, we'll compare it against similar purchases that others have made, and we can predict what other purchases you might want/like that you haven't made yet.

      --
      Diplomacy is the art of saying, "Nice doggie!" until you can find a rock.
    4. Re:Why even 3? by Anonymous Coward · · Score: 0

      I would be scared if I were running a program that hid the identities of people that testified in courts. You can change someones name and where they live and what they do for a living, but if those people start making purchases of things that remind them of their old life and those transactions are attainable, well...they probably won't be living a new and secret life for long.

    5. Re:Why even 3? by mjwx · · Score: 1

      The article says it can identify someone in as few as 3 transactions.
      But they aren't really identifying them, they are just showing that no other person hit the same exact set of shops.
      Well, they also mention that they get a datestamp with the transaction so assuming that datestamp has minutes
      or seconds then it should only take 1 transaction or 2 at the most. That being said, you really haven't identified
      this person as you don't know who they are in the real world just that they have a unique shopping pattern as
      everyone does.

      Actually its a lot more in depth than that.

      Also consider the class of store you visit, You hit up a hardware store, then an auto supply store and a Micky D's on the way home. They have a reasonable idea what you ordered at McD's from the price and a good line of where you live from the trail of stores you visited.

      Of course the "as little as 2 or 3 stores" is a bit of a misnomer, same as when a teclo advertises "up to 4 mbs", only a few can be that easily identified but realistically they dont need to work with such a small amount of data when they have 100 or 200 transactions within the space of a month it's easy to narrow down where you live from a heat map of where you shop on a regular basis, and of course they have a well established shopping pattern to help "notify you of products and services that you may find useful".

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
    6. Re:Why even 3? by NicBenjamin · · Score: 3, Informative

      And this only works if you have a lot of other data in your data set. If you don't know who Scot is, then you can't figure out he's the only person who could go to the bakery on that one exact day and that particular restaurant the next.

      I don't think anyone is particularly sanguine about the future of privacy if big companies manage to figure out a way to profit from combining their multiple massive databases. This is particularly true in the US, where it would be virtually impossible to stop the police from using said databases with our warrants. Or worse, using info that the big companies forwarded them as the basis for warrants.

      If Apple or Google can silence one of it's critics by figuring out he was paying a hooker with his supposedly anonymous Mastercard gift card, that is a really fucking bad thing.

    7. Re:Why even 3? by Anonymous Coward · · Score: 1

      I think you missed the point. The point is that there's enough information in the time and place of our transactions that knowing only 3 places/times a person went shopping is sufficient to identify them. This means that if I saw you at the grocery store on Monday night, had dinner with you on Wednesday, and saw your facebook posting about getting gas on your road trip, and I have access to the "anonymized" data, I can reconstruct your complete credit card statement, and so discover your subscription to catporn.com.

    8. Re:Why even 3? by Imrik · · Score: 1

      Should probably use 3am rather than midnight, far fewer people away from home. 10am or 2pm would probably be better than noon, people tend to eat around then and frequently leave their workplace.

    9. Re:Why even 3? by Wycliffe · · Score: 1

      Wow. Someone mod this up. I didn't get this from the summary or the article but this is dead on
      and kindof scary. It would be fairly simple to find 3 transactions for a real life person and then be
      able to cross reference it especially if you could do it over several months. Ypu could possibly
      even trick someone in doing a couple purchases and then just wait for the data.

    10. Re:Why even 3? by Imrik · · Score: 1

      To do that, however, you need both the information on the transactions and the information on the person, which is twice as much as advertized.

    11. Re:Why even 3? by squiggleslash · · Score: 1

      I'm thinking that some organizations don't even need to scour social media for this.

      Amazon, for example, already has my name and three transactions they can use.

      Or am I missing something?

      --
      You are not alone. This is not normal. None of this is normal.
    12. Re:Why even 3? by neurovish · · Score: 1

      combine the two and now they know that the person who was at shop A at time X, shop B at time Y, and shop C at time Z also appears to live at address Q and work at address R, and there you go: anyone who can get the "anonymized" data knows where you live, and that you just bought not only new living room electronics but also airline tickets.

      and then......?
      They send a salesman to your house from shops A, B, and C trying to sell you something?
      How often do you buy a lot of living room electronics, then go on vacation?

    13. Re:Why even 3? by suutar · · Score: 1

      no, then they burgle your house while you're gone.

    14. Re:Why even 3? by banda · · Score: 1

      In my experience the "timestamp" portion of the transaction date coming from merchant banks is pretty useless - often truncated to midnight, frequently transposed into the timezone of a corporate office... it's kind of garbage, and not nearly as useful as you think.

  11. Re:"the privacy we are told that we have isn't rea by ShanghaiBill · · Score: 3, Funny

    I always thought it was anonymized through aggregation.

    Aggregation is not very useful. Much more useful is being able to look for relationships between purchases by the same user. Years ago department stores would have an "accessories" section. Then Wal-Mart crunched their data, and figured out that people don't shop for accessories randomly. They buy a belt when they are buying pants. They buy a necktie when they are buying shirts. So today, the belts are placed by the pants, and the neckties are placed by the shirts. This seems kind of obvious in hindsight, but it took data analysis to make it happen.

    If a woman stops buying condoms and starts buying vitamin supplements, that means you should showing her popup ads for maternity clothes. Nine months later, you can show her a different brand of condom, with ads than emphasize reliability.

  12. Re:"the privacy we are told that we have isn't rea by Livius · · Score: 1

    This isn't actually privacy, and it's sad that people aren't clearer about what is and isn't privacy.

    Though still a bit troubling.

  13. Re:"the privacy we are told that we have isn't rea by LessThanObvious · · Score: 1

    Spafford, who wasn't part of the study, said it makes "one wonder what our expectation of privacy should be anymore."

    Privacy can't be monetized and retailers can't profit from privacy so therefore we know how much privacy we have; it's the small fraction left after they collect everything useful. This will continue this way until we have laws that make data retention and privacy violation such a legal liability hot potato that businesses will be tripping over themselves to delete data and avoid unnecessary collection and retention.

  14. Meaning of "Anonymous" by eepok · · Score: 2

    I don't know about you, but I think it's pretty fair to say that a record without any information directly identifying the subject is "anonymous".

    The ability to complete an analysis of multiple records and data sources thereby reasonable guess (90% accuracy) of who the subject might be is insufficient to remove the title of anonymous.

  15. It is like proving 2 = 3 by 140Mandak262Jamuna · · Score: 1
    A mathematician could easily prove 2 = 3, for large values 2.

    For loose definitions of "identify" they could find sets of credit card transactions that would meet the given "pieces" of information. If Detective Paul Drake is looking for someone who went to a particular restaurant one night and then bought cake from some bakery next day, and Della Street knows the same person paid for toll the same evening, the super duper algorithm will tell Perry Mason all the sets of transactions that would match the given "pieces". But the data sets will not have any name or address attached to it. But still Ham Burger will make a mistake and his star witness will confess on the stand.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:It is like proving 2 = 3 by Anonymous Coward · · Score: 0

      A mathematician could easily prove 2 = 3, for large values 2.

      No mathematician could do such a thing because it's false by definition.
      The JOKE is "1+1=3, for particularly large values of 1". (And it's not true, it's a JOKE.)

  16. Re:"the privacy we are told that we have isn't rea by fustakrakich · · Score: 1

    Damn! You were demoted?

    --
    “He’s not deformed, he’s just drunk!”
  17. Why the heck by crioca · · Score: 1

    Can't these articles link to the journal's entry for the paper. This is of professional interest to me and I'd like to read the abstract at least, maybe even purchase the damn thing.

    1. Re:Why the heck by crioca · · Score: 1
  18. Why do we care about the nickname? by rebelwarlock · · Score: 1

    The fact that someone calls him/her "Sandy" isn't useful information to me since we're not going to hang out and shoot the shit. Trim useless information from the summary.

  19. Re:"the privacy we are told that we have isn't rea by Anonymous Coward · · Score: 1

    ... show her a different brand of condom ...

    I think her child's needs will consume more of the family budget. The adverts will concentrate on selling goods satisfying that need.

    ... with ads than emphasize reliability ...

    Maybe it was a planned pregnancy. Women go from the pill to condoms so they can choose the month of birth, or so they know when they're ovulation cycle is regular.

  20. Can only identify you...if they know who you are! by Overzeetop · · Score: 2

    They did NOT show that, from 3-4 transactions, they could provide your name, address and phone number, or even that if you have 3-4 transactions in a million transaction anonymized data set they can find out anything about you personally *unless they know you first*.

    What they did is show that if they know that you, personally, had 3 to 4 types of transactions on specific dates (you went to a grocery store and a gas station today, and a restaurant yesterday), they could identify which anonymized data set you belong to. Their discovery requires specific outside knowledge not contained in the data.

    This only matters if, say, a third party could identify specific purchases and dates - they could then comb the records and find the rest of your transactions on that specific card. IOW, someone has to be looking for you, and know at least something about you, to even start the search.

    --
    Is it just my observation, or are there way too many stupid people in the world?
  21. What data was left? by houghi · · Score: 1

    I read the article but I did not find what was left. In Belgium (Perhaps Europe) all that remains is the transaction number and the last 4 numbers of the card. The card company will only see the amount and will have no idea what is bought.

    So if the last 4 digits are 1234 (And about 1 in 10000 will scream) they know if I pump gas, take out some cash, eat in a restaurant and buy at a supermarket that they know who I am?

    I would really, really, really try to test that claim.

    I assume some other data has been left.

    --
    Don't fight for your country, if your country does not fight for you.
    1. Re:What data was left? by moeinvt · · Score: 1

      I *think* what they're saying is that if they know it was you who bought the gas, took out the cash and ate at the restaurant, they can figure out that it was also you who went to the supermarket, and identify all of your other purchases for which they have records.

  22. Of course not ... by gstoddart · · Score: 2

    "We are showing that the privacy we are told that we have isn't real"

    Of course it's not bloody real.

    For us to believe this data has been 'anonymized', we have to assume that a) the company is qualified to do what is required to anonymize the data, b) that they actually give a shit, and c) that they bear any penalty if they do a terrible job.

    Entrusting these companies with this data in the first place is the problem. Allowing them to share it all over the place for profit and with no restriction is a terrible idea.

    This is precisely why sane countries have data protection and privacy laws -- because corporations are greedy, self serving entities, who won't give a crap if the collateral damage of their stuff is to damage the privacy of everybody they deal with.

    And this is precisely why all of those analytics companies in web pages are just parasites and not to be trusted.

    --
    Lost at C:>. Found at C.
  23. Not really that disconcerting. by moeinvt · · Score: 1

    From what I can tell, they first need to know the identity of the individual who made those 3 particular purchases. From that, they can link the individual to the entire set of his/her purchases in the "anonymized" CC data.
    I'm very concerned about privacy issues, but this doesn't really surprise or disturb me. It would be quite a coincidence for another person to engage in transactions at the same three places I did and at approximately the same times.

  24. Well, sure by Anonymous Coward · · Score: 0

    Take a similar example; all that thoroughly scrubbed medical history that is sold wholesale these days. You think the 40 year old female with sickle cell and a glioma removal isn't identifiable? For instance.