AOL Releases Search Logs of 657,427 Users
An anonymous reader writes "AOL has released the search logs of over 650,000 users for research purposes. This looks like it may become a public relations disaster for AOL, as well as a privacy nightmare for the users involved as Michael Arrington of TechCrunch notes: "AOL has released very private data about its users without their permission. While the AOL username has been changed to a random ID number, the ability to analyze all searches by a single user will often lead people to easily determine who the user is, and what they are up to. The data includes personal names, addresses, social security numbers and everything else someone might type into a search box." This is also being covered on The Paradigm Shift and Oh My News."
fantomas adds " Looks like they've just taken it down but it's still available on The Pirate Bay; not sure why but some of the academic researchers are going crazy musing the ethical aspects of letting the world know who's searching for how to kill their wives ..."
Update: 08/07 21:32 GMT by T : amromousa writes "AOL is now apologizing for the release ..., calling it a "screw-up," which they're upset and angry about."
personal names, addresses, social security numbers and everything else someone might type into a search box.
Who in their right mind would type their social security number in a search box, in plain text??? I mean, really???
I got nothin'
Since most people search for their own name, this really isn't very private. I imagine law enforcement may use this to track AOL users. I wonder what the legal implications are...
Company calls data posting a mistake.
Hmm, I wonder if this "sorry" will be enough
A friend of mine downloaded this dataset.
A teacher's credit union employee was searching for sexy underwear, how best to conduct a relationship with a co-worker, and have sex in a pickup.
Just before that, she was searching for cars. And appears to have cancer as well, or lives with someone with cancer. Maybe it's her sick husband.
I wonder if that demonstrates why someone wouldn't want their Google searches or AOL info to make it into the public realm. AOL is obviously a bastion of consumer rights.
Saskboy's blog is good. 9 out of 10 dentists agree.
It occurs to me that it would be pretty difficult to trace back to the user who is doing the searching by knowing what they are searching for. Sure I have Googled myself and have entered my address into Google Maps, Map Quest, etc. But I have Googled about a hundred other people and thousands of addresses. It would be an interesting game of what do all these things have in common for someone to triangulate all this information back to who I am. Granted I have never done a search on my or anyone elses Social Security Number, that's just asking for it.
So, how many wives are either not going to be home tonight, or are going to fix hubby his very favorite dish?
You're probably just trying to be funny, but this could be a real problem. I know I have had some seriously bizarre search historys when doing research on possible articles to write in my lame ass vanity site. They could very easily be taken out of context and used to make me look like a sicko instead of a cynic who wanted some of the bizarre material that non fiction can provide.
Maybe this guy is doing some research on a book. Maybe he's an artist doing some death metal band's cover. Hell, maybe they have a socially retarded CS major for a dorm mate and are trying to freak them out.
It's the ridiculous release of this type of data and the sensationalist warping of these smallest elements that allow our privacy to get train wrecked.
Actually, it IS a geographic location in Norway :)
It is 1.5 hours drive from where I live, and a really beautiful place.
More info here.
Furthermore, I just searched for "End of the world" on google...
Are you a grammar Nazi? I'm trying to improve my English; please correct my errors!
I mistyped my credit card information, so they asked me if I could to to them again over email.
I am always amused by people who are concerned about sending their credit card number over email. Credit card numbers are just plain not secure period. The number is even printed right on your card, and also encoded in a machine-readable format! It's sent through the mail on your bill, it's printed on receipts (although things are getting much better here), there are plenty of easy ways to illicitly get credit card numbers that are much easier than email.
If you're not willing to send a credit card number through email, then you probably just shouldn't have a credit card at all.
paintball
There are a couple of lines in those logs that have supposedly led AOL users to my site. However, I can't verify a single one of those with my own logs. Any site owners out there who were more successful? Any explanation for that phenomenon?
Check out Disturbing Search Requests where people search through their logs for interesting HTTP REFERER links from Google and submit the most disturbing. A common reaction is befuddlement from webmasters when Google returns their site in response to certain queries (such as "sweet as food delicious cheap dog fellatio").
Who hasn't typed "how to kill your wife" into a search box by now anyway? (That was a joke! Hi honey!)
Hmm Didnt the govt ask for just this kind of info from google sometime ago? And now aol just accidentally manages to release the same kind of info?
Back in January, related to the story on how the DoJ demands and gets ISP data, AOL had said that "We did not comply with the request made in the subpoena," spokesman Andrew Weinstein said. "Instead, we gave the Department of Justice a list of aggregate anonymous search terms that did not include results or any personally identifiable information."
AOL- you need to rethink that phrase personally identifiable, because it doesn't seem to mean what you think it means. You're hiding behind one technical definition of PII, without concern about whether or not the results actually have PII. If you're releasing results with personally identifying information, then you cannot say you're not releasing PII. I'd written in January I'd writen "I question this assumption by Yahoo, AOL, etc. that search terms, by themselves, have no privacy considerations because they've been separated from personal info. What if the search itself contains personal information? Are the search companies deleting the timestamps and randomizing the order of the search terms themselves? Because otherwise I could see personal info showing up." Obviously, half a year later, they still think that replacing a name with a number takes away the PII. They need to have a talk with, say, the Census Department, about why the department will withhold data about *groups* of businesses in a region. Grouped data can easily become PII data if you can tease out characteristics. AOL didn't even group the data!
As always, relevant quotes from the best.essay.evar on why privacy is a fundamental human right: "If information that is actually about someone else is wrongly applied to us, if wrong facts make it appear that we've done things we haven't, if perfectly innocent behavior is misinterpreted as suspicious because authorities don't know our reasons or our circumstances, we will be at risk of finding ourselves in trouble in a society where everyone is regarded as a suspect. By the time we clear our names and establish our innocence, we may have suffered irreparable financial or social harm..."
"...agents of the state in Canada cannot order Canada Post to photocopy the address on every envelope we send, nor can they order bookstores to keep a record of every book we buy, let alone of every page of every magazine we leaf through. There is no reason why they should be able to exercise such powers with regard to every e-mail someone sends or every Web site he or she visits."
"I do not see any reason why e-mails should be subject to a lower standard of privacy protection than letters or telephone calls. And I do not see why Internet browsing should be subject to a lower standard of protection than book purchasing or researching in a reference library. Canadians should not be subject to greater state monitoring or scrutiny just because they choose to use new communication technologies."