Slashdot Mirror


The Face of One AOL Searcher Exposed

Juha-Matti Laurio writes "No. 4417749 conducted hundreds of searches over a three-month period on topics ranging from "numb fingers" to "60 single men" to "dog that urinates on everything., report NYT journalists Michael Barbaro and Tom Zeller Jr., but with a permission from Mrs. Thelma Arnold, 62. "Those are my searches," she said, after a reporter read part of the list to her, continues the article."

25 of 315 comments (clear)

  1. Nothing we can do! by mgblst · · Score: 5, Insightful

    Asked about Ms. Arnold, an AOL spokesman, Andrew Weinstein, reiterated the companys position that the data release was a mistake. We apologize specifically to her, he said. There is not a whole lot we can do.
     
    What a load... there is plenty you can do AOL. You can promise not to release this data again, you can actively hunt for it on the web. You can promise to delete your copy. You can promise that you won't keep data like this anymore. You can implement better security policies so that you know where your data is, and what is hapenning with it. You can limit the people who have access to posting stuff on your website.

    Useless bastards!

    1. Re:Nothing we can do! by Rob+T+Firefly · · Score: 4, Insightful

      On behalf of AOL, let me clarify... what they meant to say was "there is not a whole lot we could do that wouldn't interfere with the lucrative data-mining business."

    2. Re:Nothing we can do! by Jafafa+Hots · · Score: 4, Insightful

      and they can pay hundreds of miliions of dollars in damages.

      --
      This space available.
    3. Re:Nothing we can do! by cortana · · Score: 4, Insightful

      Why should they? Consumers have shown time and time again that they don't give a shit about how ethically a corporation acts, only about how cheap their products are. :(

    4. Re:Nothing we can do! by ConceptJunkie · · Score: 4, Insightful

      To be fair, there isn't a whole lot AOL can do about the data that's already been released. In fact, nothing. That genie's out of the bottle, and while it is totally their fault for allowing someone to make such an enormously foolish and potentially dangerous decision, they have stated that they are taking steps so that it won't happen again. Believe me, with so many people looking for an excuse to further bash AOL, they won't dare let this kind of thing continue.

      "Not keeping data like this" doesn't make any sense at all and doesn't accomplish any good for customers. Indeed there is great value in understanding what searches are made and how the search process can be improved. Keeping this kind of data secure is sufficient in my mind. The last two sentences are something I would agree with.

      I just have to wonder who would be stupid enough to not realize the ramifications of doing this. It doesn't take "thorough vetting" to figure out that this would cause a firestorm of bad publicity.

      Of course, the real lesson here is: Don't do anything on the Internet you wouldn't want your mother to find out about. There is no anonymity on the Web. It doesn't take a stupid decision by a large company to prove this.

      --
      You are in a maze of twisty little passages, all alike.
    5. Re:Nothing we can do! by rifter · · Score: 5, Insightful

      The data is out there, what exactly could they do? Erase it from peoples hard drives, remove it from all the pipes that its in, drug everyone who has seen it?

      The fact they have this data is one thing, releasing it to the public is another.

      When it is data that they *care* about, corporations seem able to do plenty. If it's their source code, the code to decss, TimeWarnerAol's labels' mp3 files, the latest incriminating memos/emails ... they are positively rabid about protecting it. Cease and desist orders fall like rain, sites get shut down, people get sued for millions and prosecuted to the fullest extent of the law. But if it's their customers' data, like these searches, their email addresses, their credit card numbers, etc. They just shrug and say "Oh well. What canya do?"

      It's typical, frustrating, and complete bullshit. If the privacy laws were enforced and these corporations were punished for such egregious mishandling of our data maybe then they might think they can do something. But unless it directly affects them, they just are not going to care and will continue to take no precautions.

    6. Re:Nothing we can do! by Bob9113 · · Score: 2, Insightful

      they have stated that they are taking steps so that it won't happen again.

      That is not enough. It is one thing when you get caught kicking a dog to say, "I won't kick the dog again." It is another, and far more noble, thing to say, "I will begin actively campaigning for the ASPCA." There has to be some accountability; not necessarily punishment, but retribution. For example, AOL could take steps to prevent any company from doing this again (promoting corporations to have data privacy built into their customer contract, lobbying for data purge laws, lobbying for privacy rights acts). If they do not, then they have done nothing but say, "We will feign remorse when we get caught." That is not good enough.

    7. Re:Nothing we can do! by Lord_Dweomer · · Score: 2, Insightful
      "On behalf of AOL, let me clarify... what they meant to say was "there is not a whole lot we could do that wouldn't interfere with the lucrative data-mining business.""

      More like what they meant to say was, "there is not a whole lot we can do right now because our lawyers are threatening to castrate us if we say anything else to people who have been affected by this".

      --
      Buy Steampunk Clothing Online!
  2. 1 down, 24.9999 million to go... by kafka47 · · Score: 4, Insightful
    What about the one we really need to know?? User 17556639!!!

    /K

    1. Re:1 down, 24.9999 million to go... by Billosaur · · Score: 2, Insightful

      Well, you can look at it one of two ways: User 17556639 is a diseased member of society or User 17556639 is a coroner doing research. Which is it? How do you decide based on just search information? And what does "steak and cheese" suggest?

      Yes, AOL releasing this information was the longest in a series of boneheaded decisions, and when it finally dies, no one will mourn its passing. However, unless you're a card-carrying member of the tin hat brigade, there's not much to fear. Yes, someone can potentially trace you, as they did with this lady, but the fact is so many searches are non-descript that you'd have a hell of a time tracking someone down. Look at User 17556639's searches -- what in that string of searches tells you who this person is, where they live, what they do, etc. If you're smart, you keep your searches general. It's not so much the searches you have to watch out for anyway, it's the links in those searches you click on that can give you away.

      --
      GetOuttaMySpace - The Anti-Social Network
    2. Re:1 down, 24.9999 million to go... by scribblej · · Score: 4, Insightful

      Your comment is marked "insightful"

      That is sad. "Funny" sure. But "Insightful?"

      Here's the person's searches in question:

      17556639 how to kill your wife
        17556639 how to kill your wife
        17556639 wife killer
        17556639 how to kill a wife
        17556639 poop
        17556639 dead people
        17556639 pictures of dead people
        17556639 killed people
        17556639 dead pictures
        17556639 dead pictures
        17556639 dead pictures
        17556639 murder photo
        17556639 steak and cheese
        17556639 photo of death
        17556639 photo of death
        17556639 death
        17556639 dead people photos
        17556639 photo of dead people
        17556639 www.murderdpeople.com
        17556639 decapatated photos
        17556639 decapatated photos
        17556639 car crashes3
        17556639 car crashes3
        17556639 car crash photo

      If you want this person investigated, you are worse than the "thought police." First off, it's clear (to me, at least) that this guy isn't thinking about killing anyone. He just wants to see some gory photos. "steakandcheese" is a site like rotten.com. Even if he is thinking about killing someone, that's OK. There's a comment further down on the site you linked to that I find to be "insightful" about an old twilight zone episode. The main character could read minds and he reads the mind of a bank security guard who is thinking about robbing the bank! He has the man investigated, but nothing comes out of it. In the end, the guard admits he was thinking about robbing the bank... in fact he's thought about it almost every day. It's just a fantasy he has to make the day go faster... not something he'd ever act on.

      And having been a regular visitor to rotten.com in the past myself, I know that just wanting to see some of the reality of death that we tend to keep hidden in American society is not a crime. It's not even thinking of a crime. It's perfectly natural and healthy curiosity. Neither is daydreaming about terrible things you would never do -- or want to have happen -- in real life. Fantasy is normal and healthy.

      In fact, if you've never been to rotten.com or a similar site, I'd recommend you go sometime.

    3. Re:1 down, 24.9999 million to go... by hackstraw · · Score: 3, Insightful

      What about the one we really need to know?? User 17556639!!!

      Hello, I'm user 17556639, and I'm a crime novelist.

      Actually, I'm not but it is simply not up to AOL or the government or anybody to snoop into my business without probable cause. And probable cause is limited to the government, the rest stay the fuck out of my business.

      Anything taken out of context can look completely different, and it simply is NOT the duty of a citizen to chronically prove their innocence.

      A) Its sometimes impossible to prove that I was home alone asleep.

      B) I'm innocent until proven guilty. Even after being charged and possibly jailed until my court time.

      So, yes, I'm one of those "Fuck the children" people. I'm one of those people that respects my privacy. I'm one of those people that believes in free speech. Yes, I vote libertarian too.

    4. Re:1 down, 24.9999 million to go... by jc42 · · Score: 2, Insightful

      The main character could read minds and he reads the mind of a bank security guard who is thinking about robbing the bank! He has the man investigated, ...

      This is really an example of a common failure in logic. If you were the least bit rational, you'd hope that the bank's security people are thinking about how to rob the bank. If not, they're incompetent and should be replaced with people who do think about obvious job-related problems.

      Actually, I've seen this sort of failure in person. I've worked with a couple of software teams that were concerned with network security issues. So of course their searches included phrases related to security violations. This was noticed by the actual company network admins, whose job included spying on employees' outside network accesses. The developers repeatedly faced some pointed questioning by the security people, and basically just kept saying "We're trying to do our jobs." Eventually it would settle down and we'd be left to do our jobs. Then a few months later, the same thing would happen again.

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
  3. Re:Torpark by FireFury03 · · Score: 4, Insightful

    I guess this just goes to show that you should be using something like Torpark even when merely conducting an online search.

    Whilest protecting your privacy does, on the surface, seem like a good thing, I wonder if it might count against you if you were ever suspected of a crime. We've already seen 'he has some encrypted data' used as evidence (even though the contents of the encrypted file weren't known) in one successful conviction, I suspect 'he's using privacy protection software called Tor' may go down the same way.

    Remember, only people who have something to hide care about protecting their privacy. :)

  4. She should sue the pants off AOL by Anonymous Coward · · Score: 2, Insightful

    Why is it that whenever a big company blatantly violates the law, they get away with a few users boycotting them for a while, but when big business is slightly victimized, all hell breaks lose, laws are changed in their favor and individuals' lives get ruined? Sue AOL. Make them pay. Nothing says sorry like a multi-million dollar cheque.

    1. Re:She should sue the pants off AOL by ObsessiveMathsFreak · · Score: 2, Insightful

      Nothing says sorry like a multi-million dollar cheque.

      I would far prefer AOL executive officers getting jail time.

      --
      May the Maths Be with you!
  5. Re:Torpark by z0idberg · · Score: 5, Insightful

    At the very least do your searching through an engine that is separate to your ISP.

    A customer of AOL searching through AOL has their searches linked to you as an individual. If you search through google then they get your IP address, and your ISP knows which IP address links to which individual at any one time (open Wifi networks aside). But at least the same company doesnt know both.

    The data AOL released was the equivalent of any other search engine releasing its searches with IP addresses, so the same damage could be done by any other search engines logs, but imagine how much a marketing company would pay for that info from AOL with the personal details for each user included (i.e. Age, Sex, location etc.).

  6. And people wonder... by Anonymous Coward · · Score: 1, Insightful

    Why Google made such a fight out of the government's request for similar information, even if anonymized. It isn't a harmless request. I mean, the particular search identified in the article isn't a big deal, but some of the others that are in there are rather scary/personal, to say the least. Out of millions, I expect this pattern is normal.

    As goofs go, this is a biggie, but an instructive one that will hopefully serve as a wakeup call. If the government were requesting something like this, it is as invasive as a library turning over a carefully-tracked list of patron searches that would be one ISP subpoena away from being personally identifiable. It's basically one huge fishing ground.

  7. Anonymity? by CopaceticOpus · · Score: 2, Insightful

    Why is online anonymity so hard to come by? It seems that every service I use on the web keeps logs and statistics, and there always seems to be some trail linking me to whatever I've done online. Perhaps there are searches and discussions I've had online that I don't want a potential employer to come across, for example. No matter how careful I may be, I never feel too confident that I've been successfully shielded by anonymity.

    It would be nice to see more online services that at least make an effort to maintain your anonymity. How about a proxy that will do all your google searches from a set of hundreds of random IP addresses, selecting a new one each time and never connecting the searches to one another? Or how about an ISP that gives you a new, random IP address on request, and keeps NO LOGS of who had which IP in the past?

    There are two obstacles to this - first, the average joe doesn't think too carefully about anonymity, so the demand for such services is low. Second, there are legal issues regarding what information would be recorded. It would be very interesting to see the RIAA come to the ISP in my above example and request the account information of a file trader. What would happen if they literally had no logs and no way of telling which user had been using that IP? It seems like they might get in trouble, but why should they? Grocery stores aren't required to keep careful logs of each person walking through their doors. Don't ISPs have the same right to allow people to come and go?

  8. Re:Torpark by Anonymous Coward · · Score: 3, Insightful
    "But at least the same company doesnt know both."


    That is not completely correct. Remember, your ISP knows both who you are and what you searched for at any of the search engines.

    The next big privacy nightmare may be an ISP (and not a search engine) opening up its logs.
  9. Re:Legal Standing? by muellerr1 · · Score: 2, Insightful

    It seems to me that if you're going to give the guy who wants to kill his wife the benefit of the doubt, then the same benefit should extend to the child pr0n guys. Either it's protected speech or it's not. That's why the ACLU defends the neo-Nazis' right to free speech--we may not like what they say, but they have the right to say whatever they want. Not that I want to protect child pr0n guys in any way, however this is what people are talking about when they say 'slippery slope'. First it's the child pr0n, then it's the terrorism, then it's the abortionists, then it's your political opponents. Then it's you for no really good reason other than that they can.

    Courts rule time and again that if a search is illegal, the fruits of that search may not be used in court. This is the same principle. If we want the expectation of privacy in our web browsing kept as private as in our homes, then we need to find some other way to get the child pr0nsters. On the other hand, if we have no expectation of privacy in our web searches and should know better than to google child pr0n, then by all means nail them and everyone else to the wall. Just be sure to extend that principle to include things we link to on web pages, check out at libraries, and purchase at bookstores. I believe that libraries and bookstores in the US are already required by legislation to report to the government. Just remember that next time you're curious about the Anarchist's Cookbook and the recipes in there. It's all just chemistry anyway, right? I mean, I don't want to blow crap up, but I find it fascinating that horse poop and fuel oil can be that explosive and I want to know why. But ask at the public library and you might find yourself being asked uncomfortable questions by the Feds.

    Not to get too off topic, but do you remember in the wake of 9/11 how one person asked a Post Office clerk if there were any stamps without American flags on them and got detained and questioned? All I'm saying is, just because a web search returns illegal results doesn't mean it isn't a free speech or a privacy issue.

  10. Re:Torpark by jagilbertvt · · Score: 2, Insightful

    If you people RTFA, the reporter was able to find her based on her queries, not her IP Address or anything else. Torpack wouldnt help, nor would using a different search engine (after all, that search engine could be compliling the same data about your searches), unless you want to use a different search engine everytime you make a query. And even then, there are only a limited number of decent search engines out there.

  11. Privacy as evidence of nefarious character by RareButSeriousSideEf · · Score: 4, Insightful

    You raise an important and oft-overlooked point.

    This is exactly why I think it's so critical to evangelize with regard to using privacy measures. I want my mother, Aunt Sally, and 8-year old neice to be using TrueCrypt and Tor at a minimum (or, something providing similar functionality). Privacy / anonymity suites need to become as commonplace as antivirus, firewall and anti-spam software.

    Helping strong privacy measures become the status-quo serves other important goals too. It makes it more politically costly to try to legislate them out of use, and it reduces the usefulness of developing new data mining programs that require person:transaction relationships - both for the government and for private industry.

    In short, when everyone's Aunt Sally can be expected to have countermeasures against activity monitoring running on her home PC, the world will have become a safer place for all of us.

  12. Re:Torpark by Bob9113 · · Score: 3, Insightful

    At the very least do your searching through an engine that is separate to your ISP.

    Your ISP has access to everything you do online unless you're using an encrypted channel like SSL. Your HTTP requests go through your ISPs routers, which see all. Not just search terms, everything. Cox will see this submission when I send it through, and has seen each preview. Cox sees every email I send, including the full content and any attachments. Some ISPs may not be recording it, but for AOL a big part of their business is selling aggregated data to advertisers, and enterprise grade storage costs a few dollars a gig. They'd be stupid to throw away HTTP requests, and I'd lay 20 to 1 odds that they are not. At least until we have laws that require them to. But then, I think we're more like to have laws that require them to keep the data. The EU already does.

    Everything you do online is watched. It's just a question of whether you can trust your ISP. We currently lack any serious accountability for privacy breaches. The public is blissfully ignorant, and the government, far from promoting privacy, actually wants the data. In fact, depending on how far you think Epic/Carnivore/TIA goes, they already have it. Your phone records are protected by federal law, and they have those. What of data that isn't protected? Do you think they don't have it?

  13. Re:Oh those whacky AOL users... by cr0sh · · Score: 2, Insightful
    I am just trying to figure out how someone swaps back and forth between researching "death and violence", and then, right in the middle of such research, decides "yeah, let's look for 'poop'" or "yeah, 'steak and cheese', that'll do it!" - and then continues right on searching for more "death and violence".


    I can only think of a few possibilities as to why this is - either someone else was searching at the same time using the same account (or, hopefully, multiple people, unless the "steak and cheese" caused them troubles with "poop" - eh), or these records are presented in nothing like date/time order.

    Can anybody tell me if the data in the dump has more than two fields (all I have ever seen is an "id" field, and a "search terms" field listed)? Are there other fields in the data dump that indicate a date/time stamp or something so that the searches can be ordered by that?

    If not, then it is very likely that these searches were simply dumped using the equivalent of "SELECT id, terms FROM table", with no ORDER BY (or equivalent) clause tacked on, and the results were returned in a non-defined order (which might be by record insert order, by random order, or by any other possible order - for SQL compliant databases, if you don't specify an ORDER BY clause, the returned order of a recordset is undefined, and could possibly be in a different order each time the query is run by the backend SQL engine). If that is the case, than this data become just a bit more meaningless, as one could not follow a searcher's "train of thought" to determine what they were going after.

    This would have both good and bad consequences for the data as it stands - good in that it obsfuscates the data just a bit more which could conceivably help hide a searcher's intentions, but also bad in that it could make innocent intentions look more non-innocent, depending on how the result set is skewed...

    --
    Reason is the Path to God - Anon