Slashdot Mirror


Data Mining Amazon.com Wish Lists

Dr. Webster writes "In his article "Data Mining 101: Finding Subversives with Amazon Wishlists," Tom Owad of Applefritter outlines a way in which one could build detailed personal profiles of hundreds of thousands of U.S. citizens in a matter of hours. Reading habits, personal tastes and even political party affiliation could be inferred from the results, and through the use of Yahoo! People and Google Maps, one could even map out geographically where people with certain interests or affiliations live, down to their address. Most surprisingly, the process of doing this is completely legal, and doesn't even violate Amazon's Conditions of Use."

8 of 183 comments (clear)

  1. Re:This is not a story. This is not news that matt by Reverend+Darkness · · Score: 5, Insightful

    Actually, if you had read the article, you would have seen that Mr. Owad does not reference "invasion of privacy" at all. What he does do is help people understand how information they share online can be used to create a general profile, and even to link them to others. The point of the article is to educate people. But, like others, you were probably just going on the snippet... I mean, why read the whole story when you can see the headlines via RSS, right?

    --
    ... elipses...
  2. Point of the article by terradyn · · Score: 4, Insightful
    Most of the comments seem to be along the lines of: "What use is it to mine wishlists?" You're missing the point of the article. His main idea is from this section of the article:
    This is what's possible with publicly available information, but imagine if one had access to Amazon's entire database - which still contains every sale dating back to 1999 by the way. Under Section 251 of the Patriot Act, the FBI can require Amazon to turn over its records, without probable cause, for an "authorized investigation . . . to protect against international terrorism or clandestine intelligence activities." Amazon is forbidden to disclose that they have turned over any records, so that you would never know that the government is keeping records of your book purchases. And obviously it is quite simple to crossreference this info with data available in other databases. On a final note, the FBI is now hiring computer scientists to implement a project that sounds very similar to what I just did.
  3. Most subversive anarchists... by CupBeEmpty · · Score: 4, Insightful

    ...that I know freely subscribe to Amazon.com wish lists. They are like "lets overthrow the government that wants to jail us" but they are also all over "Lets let everyone know how we feel about corparations and the government by making wish lists that not only incriminate us but play into the hands of the very corporate droogs we hate... makes sense right." Anyone thinking they will get useful information about truly dangerous groups from Google Maps or Amazon Wish Lists needs to take a breather and sit down for a minute.

  4. Just to point out by TubeSteak · · Score: 4, Insightful
    It is one thing to 'mine' information from Amazon, it is another thing entirely, to mine useful information.

    Even his crude filtering techniques can yield worthwhile leads for police/FBI. He says that the first result for bible is "The Cannabis Grow Bible: The Definitive Guide to Growing Marijuana for Recreational and Medical Use".

    Is it so hard to imagine that a certain fraction of people with that book on their wishlist may either be growing weed, or have it in their possesion? Or that a percentage of people 'wishing' for the Improvised Munitions Handbook (printed by our favorite Uncle Sam @ the DoD) aren't chemists or demolitionists?

    /doesn't have an Amazon wishlist and never will

    --
    [Fuck Beta]
    o0t!
  5. Re:Mining voluntary information on a public websit by Rei · · Score: 5, Funny

    Profile for Jaish al Ashurah ] Wishlist

    Wishlist

    This list is for: Jaish Al Ashurah
    Birthday: None Entered
    Shipping Address: Private
    Unique Facts: A shadu la ilaha illah Allah

    Total items: 10

    "The Anarchist's Cookbook" by William Powell
    "Improvised Explosives: How To Make Your Own" by Seymour Lecker
    "Ultimate Sniper: An Advanced Training Manual For Military And Police Snipers" by John Plaster
    "Crusades Through Arab Eyes" by Amin Maalouf
    "The Protocols of the Meetings of the Learned Elders of Zion With Preface and Explanatory Notes" by Sergius Nilus, Henry Ford, and Victor E. Marsden.
    "Explosive Dusts: Advanced Improvised Explosives" by Seymour Lecker
    "Creative Cloth Doll Making: New Approaches for Using Fibers, Beads, Dyes, And Other Exciting Techniques" by Patti Medaris Culea.
    "The Tragedy of Karbala" by M.A. Naquvi
    "51 Documents: Zionist Collaboration With the Nazis" by Lenni Brenner
    "How to Build a Nuclear Bomb: And Other Weapons Of Mass Destruction" by Frank Barbaby

    --
    South Park pokes fun at sacred cows to make a point. Family guy pokes cows to hear them moo.
  6. well by Nutty_Irishman · · Score: 4, Interesting

    In my county in NC, if you want a party affiliation all you need to do is look it up on the public records website:
    http://www.co.durham.nc.us/common/PublRecordsdB.cf m

    You can also figure out how much someone's house is worth, what they paid in taxes, etc.

    It starts to get a little scary though when your search for public records reveals mortage applications with the individual's SS# listed on the sheet. All available online, and provided for by your very own government!

  7. Re:I see by psykocrime · · Score: 4, Funny

    So THAT'S why I'm on the no-fly list

    No, that's because you ordered those Paladin Press, Delta Press, and IMS catalogs.

    --
    // TODO: Insert Cool Sig
  8. Re:WTF is wrong with you? by typical · · Score: 4, Insightful

    Very simple principle. Lots of data is individually acceptable, but when compiled or processed, is unacceptable.

    For example, say you maintain a Slashdot identity that you don't link to your real name. While no one post of yours may be sufficient to tie your identity to your name, the sum total may be sufficient.

    Or security cameras. Most people don't worry about *one* security camera, but a lot of people get concerned when they are constantly being monitored by cameras which are tied together by computer to monitor where they go each day.

    --
    Any program relying on (nontrivial) preemptive multithreading will be buggy.