Slashdot Mirror


Google to Anonymize Users' Search Data

Google's official blog states they are on an effort to anonymize their search data after 18-24 months. After previously fighting turning over search data to the feds, it looks like they are striking another blow to the "think of the children" crowd. Any bets on whether MSN or Yahoo! will follow suit?

14 of 151 comments (clear)

  1. Re:right.... by ag0ny · · Score: 4, Insightful

    Why would Google have to comply with EU regulations? :?

    Maybe because they do business in Europe?

  2. Re:0 months? by cdrudge · · Score: 4, Insightful

    My guess is they don't do it immediately is because there is internal business value in mining the data. User patterns, length of stay, etc. After 18 or 24 months, the internal value has dropped significantly as things change quickly. I would have thought that the value would have dropped even quicker then that, say after 6 months or maybe a year.

  3. Re:Uhm by Rakishi · · Score: 4, Insightful

    And anonymous proxies do not need to make money or provide much of a service unlike google, logs are very useful for such things.

  4. According to TFA by ReallyEvilCanine · · Score: 4, Insightful

    Google plan to make it "more anonymous". Like pregnancy, data either ARE anonymous or they ain't. You can't qualify an absolute, and "anonymous" is an absolute condition indicating lack of information.

  5. Re:Uhm by Whiney+Mac+Fanboy · · Score: 4, Insightful

    All they have to do is erase the logs every day or just not keep them. It doesn't "take an effort". Anonymous proxies have been doing this for years.

    I know where you're coming from, but that would kinda fuck with their targetting advertising business model dontcha think?

    --
    There are shills on slashdot. Apparently, I'm one of them.
  6. Re:It's there servers by solevita · · Score: 5, Insightful

    Stop googling for "jihad death to american president" if you're worried about getting caught.
    You're correct. The only people that demand privacy are those up to no good. How about I come over to your house later, sit in your bed for a bit, go through your draws and your phone records, take some pictures of you and your friends, ask the neighbours some pressing questions?

    If you've got nothing to hide, you should have no problem with this.
  7. Re:Mine already is by solevita · · Score: 5, Informative

    Ignore that post above - I'm a moron. I meant to say CustomizeGoogle Firefox plugin .Get it here.

    I guess that's what happens when you Slashdot before caffeine. I'm sorry.

  8. Re:right.... by skrolle2 · · Score: 5, Informative

    http://eur-lex.europa.eu/LexUriServ/LexUriServ.do? uri=CELEX:32006L0024:EN:NOT

    The data retention directive only applies to ISPs, and only deals with who you "communicate" with. It does not explicitly say that a record of which websites you visit should be retained, and it explicitly says that the content of the communication must not be retained.

    However, as for all EU directives, it only contains the baseline of regulation. Directives are never law themselves, but have to be implemented in each respective member state by each respective legislative body. These, in turn, are free to implement whatever they want ABOVE the baseline, so some member states may have longer retention periods for this data, some member states may require ISPs to retain additional data.

    The deadline for this directive is September this year, but if you read it, a few member states have reserved the option to postpone parts of the directive, typically of the internet-related traffic. This basically means that they recognize the difficulties in implementing it, and want more time to think about on how to do it, or possibly obstruct it.

    What all of this boils down to is that maybe, sometime in the future, if you have an European ISP, they may be required to store all the URLs that you access. Google search data is transmitted as querystring parameters that are part of the URL, which means that your search data may be stored by your ISP, in a non-anonymized way. There's nothing in this possible future that Google has to comply with, as long as they are not an European ISP.

  9. Because Google's primarily a media company... by xxxJonBoyxxx · · Score: 4, Informative

    Why not anonymise the data after zero months?
    Because Google's primarily a media company, like NBC, only with much finer detail about what you want to see. Like any media company, Google finds demographic data incredibly valuable because it allows them to "connect" you with the "correct" advertisers. There's no way in hell Google would let people be completely anonymous; it goes against their business plan. (I'd also bet three years from now we'll find through some court case that backup tapes somewhere really extend "anonymous after 18 months" to 4-5 years.)
  10. Re:It's there servers by Dunbal · · Score: 4, Interesting

    Ah, the out of context argument. My house is private by the definition that I have locks on the doors and blinds on the windows.

          Funny - my computer is in my house, behind locks and blinds too. Hey Google's computers also are behind lock and key, and they even have security guards and alarm systems. I don't ever remember giving Google permission to disclose any information shared between them and I - oh and heaven forbid I go around giving away the information Google found for me - I'd get sued!

          Why would the whole world automatically be party to the information Google and I shared one evening? My computer sent that information to a specific internet address, and the answer came back specifically to my computer.

          Not so out of context...

    --
    Seven puppies were harmed during the making of this post.
  11. Re:The real WTF is.. by Dunbal · · Score: 4, Funny

    Not only that, but is the history of searches you made over 2 years ago relevant to your current searches performed today?

          Studies have shown that 43% of all people who search for "Donkey Love" will buy our product within 3 years if they see our ads.

    --
    Seven puppies were harmed during the making of this post.
  12. No Consent by Anonymous Coward · · Score: 4, Interesting

    Exactly, it's to Google's MONETARY benefit that they record this information. The EU Privacy law says THEY CANNOT RECORD MORE PERSONAL INFORMATION THAN IS NEEDED FOR A TRANSACTION. Now that it's clear that search data is personally identifiable, the EU Privacy law should be used to FORCE GOOGLE TO QUIT IT.

    "The moment you sent your request out over the internet in plain text to a third party (that is a corporation out to make money you know) you lost that."

    Not so, the law says we have to consent and we didn't consent!

    And what about when that party isn't Google? Google analytics is not on Google's site, it's embedded on third party sites, Google's adsense is on other people's site too. I didn't consent to handing my data to Google when I surfed to third parties site, Google took that data and recorded it in violation of EU privacy laws.

    This has also been sued for before resulting in Doubleclick backing down over exactly this issue.

    http://archives.cnn.com/2000/TECH/computing/01/28/ double.click.lawsuit.idg/

    "A California woman has filed suit against DoubleClick, accusing the U.S.-based online advertising company of unlawfully obtaining and selling consumers' personal information, according to a statement issued by her attorney's office."

    "Hariett M. Judnick filed the suit in Marin County Superior Court in California, on behalf of the "general public of the state of California," the statement said.
    The suit alleges that DoubleClick employs Internet cookies to identify users and track their movements on the Internet. The company tracks and records the sites an individual visits, as well as the information transmitted on the sites, such as names, ages, addresses, shopping patterns and financial information."

  13. Things That Bit Butts, Part Deux by WED+Fan · · Score: 5, Insightful

    List of nifty little phrases that have bitten their speakers in the ass:

    • They will never bomb Berlin
    • Read my lips, no new taxes
    • I did not have sex with that woman
    • Mission accomplished
    • Don't be evil

    Now Google brings us:

    Let's just be less evil, now that we've been caught.

    --
    Politics is the art of looking for trouble, finding it everywhere, diagnosing it incorrectly and applying the wrong fix.
  14. Re:Hash the IP addresses? by santiago · · Score: 5, Insightful

    There's 2^32 IP addresses under IPv4. If Google is doing the hashing, then they know the hash function. How long do you think it would take them to brute-force break the hash by hashing every possible IP address and creating a map from the hashed values back to the originals? Express your answer in microseconds.

    (If your solution is to increase the space of inputs by adding a variable salt value, please explain how this allows them to use the resulting hashes for aggregation.)