New Google Tool To Find Trend Correlations
Kilrah_il writes "In 2008 Google found correlation between seasonal flu activity and certain search term, a finding that allowed it to track flu activity better and more rapidly than previous methods. Now, Google is offering a new tool, Google Correlate, that allows researched to do the same for other trends. 'Using Correlate, you can upload your own data series and see a list of search terms whose popularity best corresponds with that real world trend.' Of course, Google reminds us that correlation does not imply causation."
This is a wonderful tool. In the short term, it should allow a lot of people to track interesting trends.
In the long term, though, Heisenberg Rules. If I may paraphrase, "Knowledge of the model, invalidates the model."
Want a real world example today? Stock market. This is why automated make-money tools don't work nearly as well as they should.
Don't take life too seriously; it isn't permanent.
Unfortunately the service appears to be limited to US search data. Hopefully this will be extended in the future.
I'm really starting to like this company. Free web browser, free word processor (and spreadsheet?), free language translation, free nudie pics, free scanned books, free email, free Usenet reader, and now this cool Dataset research tool.
Still not sure I want to store my documents on the internet though. (1) Not secure. (2) Government can review the documents without having to ask a judge for a warrant.) But overall I guess Google is a decent company. Why pay for stuff you can get for free and legal?
My AC stalker: " I personally agree with your posts most of the time, but that won't keep me from modding you troll"
So when do they release the next product: Google Causation?
Correlation doesn't imply causation, but it does waggle its eyebrows suggestively and gesture furtively while mouthing 'look over there'.
http://xkcd.com/552/
Correlations are one of those simple statistical terms that lots of non-technical people like to throw around without actually knowing what it means. It's a wonderful tool that Google has provided for everyone but people need to remember what the basic assumptions are of correlations, namely a relatively normal distribution of scores and independence of observations. Independence is especially important if you're tracking search engine results because if you were to look at how many times people Google'd Randy Savage's name the day he died it would influence the subsequent day, ultimately biasing whatever other variable you decided to correlate it with.
Carl Sagan quotes get you an automatic +5 on all posts.
Trends in online web search query data have been shown useful in providing models of real world phenomena. However, many of these results rely on the careful choice of queries that prior knowledge suggests should correspond with the phenomenon.
Yes, that is how science is done; hypothesis, predict, test, evaluate.
Here, we present an online, automated method for query selection that does not require such prior knowledge. Instead, given a temporal or spatial pattern of interest, we determine which queries best mimic the data. These search queries can then serve to build an estimate of the true value of the phenomenon.
So we have a backwards type of science: Evaluate, test, predict, hypothesis. Cuz hey, if there's a correlation, there must be a relation, and if there's a relation, we can build an estimate of the value of the relation, right? The marketing manager is gonna LOVE this....
sysadmins and parents of newborns get the same amount of sleep.
>>>Explain how is local storage would be more secure than remote storage?
HUGE difference. It requires a warrant to enter my house and obtain the files. A warrant requires probable cause (we suspect he's a murderer, because we smell dead bodies), and review by an impartial judge to approve the warrant.
Remote storage is subject to random snooping by a bored FBI agent browsing through Google's or Apple's or Microsoft's servers. (Thanks to the Patriot Act.)
My AC stalker: " I personally agree with your posts most of the time, but that won't keep me from modding you troll"
With local storage, you have choices. Who can use my computer? Do I use an encrypted volume? Do I use Windows, Linux/*nix, or Mac? What program(s) do I do it with?
With Google docs, your spread sheet is in their format. Your letter is in their format. You can export it, print it, and whatever else makes you feel good. They retain your browsing and activity history. They have every email you've sent and received. In theory it's all yours privately. In reality, it's yours, and viewable by everyone at Google, assuming they have the permissions. I won't say it's *all* google employees that have access, but it is greater than 1, which more than you'd want.
If law enforcement wants to see your remote data, they serve Google with a subpoena or warrant. Google hands the information over, and you may never know. It would likely be done in a court outside of your area anyways.
If law enforcement wants to see your local data, they serve you with the warrant or subpoena. You can chose to contact an attorney immediately. You know what they took, and what they viewed.
Serious? Seriousness is well above my pay grade.