Slashdot Mirror


Examining Presidential Candidates Via Google Trends

Michael Giuffrida writes "Google Trends is a free application produced by Google that shows how often a given keyword is searched for, over time. After seeing how candidates in the 2008 primaries have done in Google Trends in different states, it's clear that this tool can be very useful for campaigns." Read on below for some of the specifics about how these candidates have fared, Google-wise. "For example, in New Hampshire, in the days leading up to the Jan. 8 primary, Clinton was searched for the most, followed by Obama, followed by Edwards — which was how the primary results turned out. In other words, the candidates most searched for on Google by users in New Hampshire were also the candidates with the most number of votes. This works for many other states as well.

For the first 37 Democratic primaries and caucuses, 32 states had enough data on Google to make a prediction. This method correctly predicted 27 of those 32 sates. Predictions aside, the tool is also useful simply in finding out how popular a candidate is in different states, assuming that the more popular candidates are entered more often as a search term in Google (an assumption that was verified, at least for the Democratic primaries, by the positive association found)."

9 of 119 comments (clear)

  1. 27 out of 32 by TubeSteak · · Score: 4, Insightful

    Interesting, but I don't think it's that hard to beat an 84% accuracy rate with traditional methods.

    --
    [Fuck Beta]
    o0t!
  2. Old farts by Hankapobe · · Score: 4, Insightful
    Folks, especially you statisticians out there, what's wrong with these pictures?

    The sample: folks on the internet.

    Who's missing: folks who aren't on the internet; like old farts, poor people and Amish. As far as the old farts are concerned: they are the most well organized and the most vocal political group in this country. I think these charts reflect nothing.

  3. ridiculous statistical flaws by jeske · · Score: 4, Insightful

    The author is quick to dismiss using "Obama" and "Hilary" as the indicator terms for fear that Obama's huge online campaign popularity is not representative of the true popularity of the two candiates.

    However, he makes no mention of the fact that "Obama" and "Hillary" are the most popularly used terms to refer to the candidates. Almost all candidates are refferred to primarily by their last names. Hilary is a special case where we use her first name because her husband was so recently president.

    His use of "Barack" and "Hilary" is about as statistically accurate as using "Barack" and "Rodham". Fortunatly, this inaccuracy is obviously visible in his numbers, because using his first-name method it quickly looks like Edwards might be a write-in candidate to rival them both.

    Please slashdot, stop posting braindead stories.

  4. Cherrypicking the data to reach false conclusions by MSTCrow5429 · · Score: 5, Insightful
    "For the Republican Primaries, last names could easily be used. Ron Paul was excluded. His last name is too common. Using his full name is not a good solution either, because he had massive popularity on the Internet, becoming a meme of sorts, which did not at all correspond with his actual successes (or lack thereof) in the primaries."

    So, in other words, any data that contradicts the hypothesis will be thrown out.

    --
    Slashdot: Playing Favorites Since 1997
  5. I Know It's Not Statistically Valid. by mgiuffrida · · Score: 5, Insightful

    Hey, guys, You're all right. There are serious statistical flaws here. I chose the keywords "hillary" and "barack" and "edwards" because that was the model that showed the best success. If one were actually predicting this live, he or she would soon see that "hillary" and "barack" do produce better results and would switch to those terms, simply because it fits the model better. Every election will have its outliers (Obama, Ron Paul). But hopefully this brief overview will provoke some campaign analysts to actually try to interpret Google Trends results for themselves, using more advanced models.

  6. Re:In case anyone looks at the pretty graphs... by amccaf1 · · Score: 5, Insightful

    "For the Republican Primaries, last names could easily be used. Ron Paul was excluded. His last name is too common. Using his full name is not a good solution either, because he had massive popularity on the Internet, becoming a meme of sorts, which did not at all correspond with his actual successes (or lack thereof) in the primaries."
    Translation: Including Ron Paul would have indicated our hypothesis was incorrect, so we excluded him.
    --
    "Flag on the moon. How did it get there?"
  7. Re:In case anyone looks at the pretty graphs... by Z34107 · · Score: 4, Insightful

    They're using a candidate's popularity in Google Trends as a measurement of the success of their campaign.

    For Ron Paul, his popularity on the internet has nothing to do with his real-life political success - as grandparent poster said, he's an internet meme. You think there's any correlation between "Ron Paul" jokes on forums and genuine interest in his campaign?

    Their "hypothesis" for the other candidates Google trends measures the success of their campaign. With Ron Paul, it's measuring something else entirely.

    --
    DATABASE WOW WOW
  8. Re:In case anyone looks at the pretty graphs... by laddiebuck · · Score: 4, Insightful

    So how do you know that say, searches for "Barack" are any more meaningful than searches for "Ron Paul"? No, the writer had a neat theory and got so attached to it that he discarded a bad data point instead of admitting it didn't work for Ron Paul.

    By including Ron Paul but at least being complete and honest, the analysis would have been more worthwhile than what it now is -- a pile of rubbish.

  9. Re:Where is Ron Paul? by jasonditz · · Score: 4, Insightful

    TFA says they deliberately excluded him because he was so popular on the internet and his search results don't correspond to actual votes.
    In other words... their examination works great, except when it doesn't. And in that case, we'll just not included that data in the final results.
    Wish I could've gotten away with that in college.