Slashdot Mirror


New Google Tool To Find Trend Correlations

Kilrah_il writes "In 2008 Google found correlation between seasonal flu activity and certain search term, a finding that allowed it to track flu activity better and more rapidly than previous methods. Now, Google is offering a new tool, Google Correlate, that allows researched to do the same for other trends. 'Using Correlate, you can upload your own data series and see a list of search terms whose popularity best corresponds with that real world trend.' Of course, Google reminds us that correlation does not imply causation."

17 of 76 comments (clear)

  1. Never Forget: Heisenberg Rules by NReitzel · · Score: 2

    This is a wonderful tool. In the short term, it should allow a lot of people to track interesting trends.

    In the long term, though, Heisenberg Rules. If I may paraphrase, "Knowledge of the model, invalidates the model."

    Want a real world example today? Stock market. This is why automated make-money tools don't work nearly as well as they should.

    --

    Don't take life too seriously; it isn't permanent.

    1. Re:Never Forget: Heisenberg Rules by Yvan256 · · Score: 2

      Are you suggesting coconuts migrate?

    2. Re:Never Forget: Heisenberg Rules by JWSmythe · · Score: 2

      Sure they migrate. Up to approx 100', mostly vertically.

      --
      Serious? Seriousness is well above my pay grade.
  2. US only? by Albanach · · Score: 2

    Unfortunately the service appears to be limited to US search data. Hopefully this will be extended in the future.

    1. Re:US only? by i+kan+reed · · Score: 2

      Other places have privacy laws that google isn't ready to lawyer about.

  3. Google == free stuff! by cpu6502 · · Score: 2, Interesting

    I'm really starting to like this company. Free web browser, free word processor (and spreadsheet?), free language translation, free nudie pics, free scanned books, free email, free Usenet reader, and now this cool Dataset research tool.

    Still not sure I want to store my documents on the internet though. (1) Not secure. (2) Government can review the documents without having to ask a judge for a warrant.) But overall I guess Google is a decent company. Why pay for stuff you can get for free and legal?

    --
    My AC stalker: " I personally agree with your posts most of the time, but that won't keep me from modding you troll"
    1. Re:Google == free stuff! by Anonymous Coward · · Score: 2, Informative

      “If you are not paying for it, you’re not the customer; you’re the product being sold.”

    2. Re:Google == free stuff! by cpu6502 · · Score: 2

      >>>Those things you mention are not Google's product and you are not Google's customer. YOU are in effect Google's product. They're selling you to advertisers and "paying" you with those things.

      Same is true with free TV and radio.
      Your statement is 100% true, but
      I don't worry about it. (shrug)

      --
      My AC stalker: " I personally agree with your posts most of the time, but that won't keep me from modding you troll"
  4. next product by stormpunk · · Score: 2

    So when do they release the next product: Google Causation?

    1. Re:next product by Chrisq · · Score: 2

      So when do they release the next product: Google Causation?

      There is actually a very strong possibility of this. Because sites are ranked by popularity of selection Google itself could well amplify trends. If you do a search for "family health" and the top results are news reports on increased rates, what would your next search be? If you get sick what are you likely to put it down to?

  5. xkcd by SPBesui · · Score: 2

    Correlation doesn't imply causation, but it does waggle its eyebrows suggestively and gesture furtively while mouthing 'look over there'.

    http://xkcd.com/552/

  6. Assumptions out the window by Palmsie · · Score: 2

    Correlations are one of those simple statistical terms that lots of non-technical people like to throw around without actually knowing what it means. It's a wonderful tool that Google has provided for everyone but people need to remember what the basic assumptions are of correlations, namely a relatively normal distribution of scores and independence of observations. Independence is especially important if you're tracking search engine results because if you were to look at how many times people Google'd Randy Savage's name the day he died it would influence the subsequent day, ultimately biasing whatever other variable you decided to correlate it with.

    --
    Carl Sagan quotes get you an automatic +5 on all posts.
  7. Weird science by cultiv8 · · Score: 3, Insightful
    From the Google Correlate Whitepaper:

    Trends in online web search query data have been shown useful in providing models of real world phenomena. However, many of these results rely on the careful choice of queries that prior knowledge suggests should correspond with the phenomenon.

    Yes, that is how science is done; hypothesis, predict, test, evaluate.

    Here, we present an online, automated method for query selection that does not require such prior knowledge. Instead, given a temporal or spatial pattern of interest, we determine which queries best mimic the data. These search queries can then serve to build an estimate of the true value of the phenomenon.

    So we have a backwards type of science: Evaluate, test, predict, hypothesis. Cuz hey, if there's a correlation, there must be a relation, and if there's a relation, we can build an estimate of the value of the relation, right? The marketing manager is gonna LOVE this....

    --
    sysadmins and parents of newborns get the same amount of sleep.
    1. Re:Weird science by Hatta · · Score: 4, Insightful

      Absolutely correct, this is going to swamp us in false positives. Remember, in order for science to work the way it's supposed to we have to report the negative results as well as the positive results. If 20 groups do the same experiment and only one gets a result significant at p=.05, that "positive" result doesn't mean anything. p=.05 means there's a one in 20 chance of the correlation being random.

      It's the same thing here. If Google goes out looking for positive results, and ignores all the negative results this is going to be so skewed as to be worthless.

      --
      Give me Classic Slashdot or give me death!
    2. Re:Weird science by i+kan+reed · · Score: 4, Insightful

      Yes, that's true, except there's a step before hypothesis. Observe. You're not allowed to use data from your observations that generated the hypothesis to support it, but you are allowed to use data to build a hypothesis in the first place.

      As their comic points out, property values correlate to liposuction searches. That's an interesting fact that you might make a socioeconic hypothesis based on. You could then turn to other avenues of research to validate your hypothesis.

      Not everything in science is a race to conclusions.

  8. Re:What? by cpu6502 · · Score: 2

    >>>Explain how is local storage would be more secure than remote storage?

    HUGE difference. It requires a warrant to enter my house and obtain the files. A warrant requires probable cause (we suspect he's a murderer, because we smell dead bodies), and review by an impartial judge to approve the warrant.

    Remote storage is subject to random snooping by a bored FBI agent browsing through Google's or Apple's or Microsoft's servers. (Thanks to the Patriot Act.)

    --
    My AC stalker: " I personally agree with your posts most of the time, but that won't keep me from modding you troll"
  9. Re:What? by JWSmythe · · Score: 2

      With local storage, you have choices. Who can use my computer? Do I use an encrypted volume? Do I use Windows, Linux/*nix, or Mac? What program(s) do I do it with?

        With Google docs, your spread sheet is in their format. Your letter is in their format. You can export it, print it, and whatever else makes you feel good. They retain your browsing and activity history. They have every email you've sent and received. In theory it's all yours privately. In reality, it's yours, and viewable by everyone at Google, assuming they have the permissions. I won't say it's *all* google employees that have access, but it is greater than 1, which more than you'd want.

        If law enforcement wants to see your remote data, they serve Google with a subpoena or warrant. Google hands the information over, and you may never know. It would likely be done in a court outside of your area anyways.

        If law enforcement wants to see your local data, they serve you with the warrant or subpoena. You can chose to contact an attorney immediately. You know what they took, and what they viewed.

    --
    Serious? Seriousness is well above my pay grade.