Slashdot Mirror


Mapping Google News

CousinLarry writes "A neat project called Buzztracker.org has been mining Google News for over a year and keeping track of relationships between geographic locations mentioned in articles. The results are some really cool maps that actually seem to reflect the "buzz" of the day - check out the Vatican clusters from earlier this month, or the global New Year's chatter. You can also dig down into the articles from which the maps were generated."

44 of 133 comments (clear)

  1. I can't find Slashdot. by Anonymous Coward · · Score: 2, Funny

    Where is Slashdot on the map?

  2. Definitely on the Nifty List by gateman9 · · Score: 2, Interesting

    This is by far one of the most interesting uses of data-mining I've seen in while. Neat to see what are the hotspots, as far as news goes, in the world.

    The guys at Buzztracker desrve a cookie (edible variety).

    --
    You can't defeat physics.
    1. Re:Definitely on the Nifty List by FuturePastNow · · Score: 5, Informative

      This site has another list, of the sources Google News uses (something Google refuse to publish). Also an interesting use of data mining.

      --
      Give a man fire, and you warm him for the night. Set a man on fire, and you warm him for the rest of his life.
  3. Re:MetaWeb by winkydink · · Score: 4, Insightful

    SEMANTIC WEB!

    Thank you Tim (Berners-Lee) Didn't know you were a /. reader. The question remains, while it's very interesting (and cool), what does one do with the aggregated data?

    --

    "I'd rather be a lightning rod than a seismometer." -Ken Kesey

  4. thats old news.. by ShaniaTwain · · Score: 4, Funny

    ..no, literally. its made up of old news..

  5. Can't say I'm surprised. by CSMastermind · · Score: 3, Insightful

    Well when you think about it aren't those the exact places you'd expect to be hotspots?

    1. Re:Can't say I'm surprised. by alphan · · Score: 4, Insightful
      Well when you think about it aren't those the exact places you'd expect to be hotspots?

      It is good that you could expect that. For me, there are a lot of different factors that add to complexity. Neutrality of Google being one, the fact that Google News is in English being another.

    2. Re:Can't say I'm surprised. by jonno317 · · Score: 4, Insightful

      Well, actually if you look at the bottom of google news you'll notice that it's in 21 languages other than English (counting Canadian English, Australian English, and the like as separate languages...so maybe a few less than that technically). But I'd say that Google is in enough other major languages to not be considered biased (at least as far as languages are concerned). If buzztracker.org is biased toward English, then I would say it's because of their choices and no fault of Google.

    3. Re:Can't say I'm surprised. by alphan · · Score: 2, Insightful

      Even if one considers all languages, there are a lot of English news sources that are located in non-English speaking countries. Plus, I can safely bet that English still has the majory of the news pie.

  6. That's cool by Red+Moose · · Score: 3, Insightful
    That is finally some news for nerds. About fucking time.

    What a cool site, and it works very quickly and is not overflowing with advertising crap?

    --

    Acting stupid isn't much fun when there's someone around who knows better

  7. This gives me a great idea... by MisterLawyer · · Score: 4, Funny

    I should start a website, beertracker.org, to keep track of my daily buzz.

    1. Re:This gives me a great idea... by Pfhorrest · · Score: 2, Funny

      It`s nice but you will have to drink a lot off bear in a lot`s off places ;) maybe i`am going to help you :)

      By the way you're writing it looks like you've already started.

      --
      -Forrest Cameranesi, Geek of all Trades
      "I am Sam. Sam I am. I do not like trolls, flames, or spam."
  8. virtual sightseeing by tedtimmons · · Score: 3, Interesting
    not news (pun intended), but here is a compilation of neat google maps I've been collecting:

    http://perljam.net/notes/interesting-google-satell ite-maps/

    -ted

  9. Spam tracks current events too by G4from128k · · Score: 2, Interesting

    I've noticed an upsurge in "Living Willing" spam since the Terry Schiavo story and even a few Pope-related offers.

    --
    Two wrongs don't make a right, but three lefts do.
  10. Nelson Mandela != Nelson town by Anonymous Coward · · Score: 5, Interesting

    It looks like the code needs a bit more tuning. http://www.buzztracker.org/index.html lists Nelson, NZ, as one of the hot spots. Clicking on that lists a bunch of articles about apartheid. I think the site code misinterpreted a reference to Nelson Mandela in one of the articles.

    1. Re:Nelson Mandela != Nelson town by Joff_NZ · · Score: 2, Informative

      Also, Nelson is probably one of the least newsworthy places on the planet.

      It however, it quiet, has stunning weather, awesome beaches, friendly hippy locals. Many nice holidays spent in and around Nelson :-)

      --
      The revolution will not be televised. It won't be on a friggin blog either
  11. Does Google mind? by IBeatUpNerds · · Score: 4, Interesting

    I remember about a year ago or so, there was a guy who was mining google news to produce an RSS feed. IIRC, google politely demanded that individual stop offering this to people. I can't find the article to cite this, maybe someone can help? At any rate, I wonder how google will feel about this.

  12. New Business Plan! by Fyz · · Score: 4, Funny

    1. Map out the world in x and y coordinates.
    2. Feed google buzz data into huge neural network.
    3. Predict location and magnitude of future events.
    4. ???
    5. Profit!

    1. Re:New Business Plan! by MrAnnoyanceToYou · · Score: 2, Interesting

      4 = Sell your business and services to Google.

      New, much better business model than the old one of sell your business to M$ or Intel. Why better business model? Who would you rather work for?

    2. Re:New Business Plan! by Ingolfke · · Score: 5, Funny

      I actually spent the last 3 months of my life writing something very similar to this for my PhD thesis. My work was slightly different then what you explained though. Basically I take the Google data, use it to prime the network, and then feed in historical data from a particular news site into the neural network. The app processes the data, and predicts which news events the news site will report on in the coming days. I've run this application against Slashdot, since such a wide range of topics are reported on here, and have found that the application can guess 7 stories from the next day 87% of the time. I didn't have nearly this much success with other news sites, so I decided to figure out why I was so successful. I found that the nueral network was simply reporting on news events that happened more than 3 weeks ago, contained words like 'Star Wars', would search for anything about Google and then would add the question "Are they becoming evil?", would take all Microsoft and EFF press releases, and somehow managed to pull every 17th email from Linus Torvald's inbox, would repost every 19th article, and would occasionally take a story about someone being prosecuted and insert "Your right online" in front of the original news source's title. Unfortunately for me, the nueral network seemed to learn too much from Google and now requires that I become a member of its club before I can see any more future stories.

  13. BuzzTracker? by Storlek · · Score: 3, Insightful

    Apparently they didn't Google their own name, or else they would've noticed the name was already in use for a fairly popular music composition program.

    --
    Bears don't normally eat things that talk and move backwards.
  14. Other possible topics by 14erCleaner · · Score: 2, Funny
    Googling mapping news

    New Google mappings

    Goo mapping news

    Mapping new Googles

    New mapping goggles

    --
    Have you read my blog lately?
  15. Animations by Doctor+O · · Score: 3, Interesting

    Now, take the data and put up some nice animations, archive the first 100 articles or so and put it into some nice database to mine for interesting stuff. Should not be too hard to script together the data gathering, you can already start fetching stuff while developing the functionality and frontend.

    Someone wanna join? This cries 'distributed database'... ;)

    --
    Who is General Failure and why is he reading my hard disk?
    1. Re:Animations by jmc · · Score: 2, Interesting

      Now, take the data and put up some nice animations...

      Indeed, I see lots of interesting possibilities for mapping and animating data like this on the web.

      In fact, after the last election I had a nifty idea to create some sort of animated map of the US showing how political affiliations have geographically shifted over the years (primarily, the North slowly turning blue and the South slowly turning red). Then I started to think about extending that to a generic web app to display and animate various demographic data. Basically, a very dumbed down and animated online GIS.

      It really would've been somewhat trivial to throw together with Flash, pulling in XML data streams. My biggest stumbling block ended up being finding a nice raw map of the US in vector format, either at the county level or even state level.

      Anyone happen to know of a good source for free vector maps that can be easily imported into Flash?

    2. Re:Animations by Chuq · · Score: 2, Insightful

      In fact, after the last election I had a nifty idea to create some sort of animated map of the US showing how political affiliations have geographically shifted over the years (primarily, the North slowly turning blue and the South slowly turning red). Then I started to think about extending that to a generic web app to display and animate various demographic data. Basically, a very dumbed down and animated online GIS.

      This sounds like it would be most easily done with a Worldwind Add-on!

      --
      - Chuq
  16. That's BS! by Fyz · · Score: 2, Funny

    It's just a screenshot from the NORAD command center!

  17. This is pretty nifty by aftk2 · · Score: 4, Interesting
    While mapping the news activity over the whole world is certainly cool, I can see this having an even greater effect when applied to a smaller area. For example, if you're moving somewhere, you could easily see crime news applied to the particular region. It doesn't have all have to be depressing news, either: you could use such a "buzz" indication to find out information like the following:
    • find where there are lots of new jobs being generated
    • view up-and-coming areas by their positive "buzz" (new creative hot spots, architecture, etc...)
    • find areas of town with great new restaurants
    I think this is where it starts to get exciting (and more useful). Mapping Google news? Meh. Mapping the northwest, and giving that information to Citysearch? You betcha.
    --
    concrete5: a cms made for marketing, but strong enough for geeks.
  18. Shoulda thought of this earlier by nxtr · · Score: 2, Informative

    Too bad. They have it already.

  19. Through With Buzz by Doc+Ruby · · Score: 3, Interesting

    The big circle in the US is called "Washington", which is rated at 03%. It obscures "New York" in the GUI. Boston is available, and the only other US buzz is Grand Rapids, apparently on the strength of a local paper's report 2 days ago of a resident killed in Cairo. I find all that hard to believe, or at least to make into any sense. The GUI is unusable, and the mapping of data to "reality" defies sensibility. I think the buzz has gone to their heads, and they should put the pipe down quick.

    --

    --
    make install -not war

  20. but google news API is not available yet, is it? by Cryofan · · Score: 2, Interesting

    How are they parsing google news content? Google news does not yet offer an API, correct? What are they doing, screen scraping? You can only query google programmatically about 1000 times a day, I think.

    I wish I had more details...

    And this is a REALLY stupid aspect to tackle--connections between cities.
    THe real cheese would seem to be in word counts, and connections between words--like "economy" and "recession", etc.

    --
    eat shiat and bark at the moon
  21. Wrong question. by Anonymous Coward · · Score: 2, Funny

    Why haven't they done this with porn and geographically linking ip addresses?

  22. Really cool but suffers from a common problem... by terraformer · · Score: 4, Insightful
    That being one level of hierarchy to display complex data. China (Top with .09%) is top dog because all international press refer to china as a whole and fails to reference individual places in china (ie; Guangdong Province) despite the sheer size of the country. Therefore, China is over represented when looking at news. However, in the case of Gaza (the second highest at .08%), the exact opposite occurs where Gaza steals all of the thunder from the larger Palestinian issue (Gaza is one of two territories in question and is not in Palestine, the place where all of the problems in the middle east originates from).

    If, they represented this in hierarchical format, the middle east would dominate by picking up points from children Gaza, West Bank and Palestine (not to mention Iraq). Baghdad is probably a good example here. How much actually happens in areas outside of Baghdad proper but gets labled baghdad anyhow.

    --
    Who are you? The new #2 Who is #1? You are #617565. I am not a number, I am a free man! Muhahaha.
  23. Why do we need this? by A+Sea+and+Cake · · Score: 3, Interesting

    Why do we need this?

    A map that showed where the stories getting the least attention that contained certain keywords - famine, Schiavo, wobbegong, whatever - came from would strike me as more interesting.

    We already know where the stories indicated by this map are coming from, because they're taking up ridiculous amounts of space on the front pages of newspapers everywhere.

  24. Chasing the Pack (and running from it) by doom · · Score: 3, Insightful
    What I like about google news is that it's an incredibly easy way of keeping an eye on what has been called the "pack journalism" problem. Just as an example, trying doing a google news search on "Count Every Vote Act": that's consistently turned up less than 100 hits since it was announced. Is there some reason it's not newsworthy? Similarly, when the Ohio recount thing was going down last year, it took *forever* for it to punch through as a top-level story. Evidentally the pattern is something like a story is dead until the AP Wire runs it, and then a thousand other news "sources" pick it up.

    I've had the thought that it might be cool to implement an anti-news site that would do something like show you links to New York Times stories that have never been referenced by the top page of Google News.

  25. How long until... by jacksonj04 · · Score: 2, Insightful

    How long until Google invites the creators to join the team for coming up with such a great idea? Or failing that, aquire the rights to the concept and implement it.

    Google have a habit of doing great things with software they get hold of, can't wait to see what they do with this.

    --
    How many people can read hex if only you and dead people can read hex?
  26. I thought that was obvious by commodoresloat · · Score: 2, Funny

    One draws maps with red circles on them.

  27. The need for maps that show where news is not from by WillAffleckUW · · Score: 2, Interesting

    We already know where the stories indicated by this map are coming from, because they're taking up ridiculous amounts of space on the front pages of newspapers everywhere.

    Exactly. If it hadn't been for the Tsunami, would we have seen as many stories from adjacent countries, for example?

    Just because it's not reported, doesn't make it not news. It's just that our filters screen out things that aren't the latest thing.

    --
    -- Tigger warning: This post may contain tiggers! --
  28. Another cool google hack by digital+bath · · Score: 2, Informative

    www.paulrademacher.com/housing

    A cool combination of Craigslist housing listing and Google maps. Seems to be very well done.

    --
    find / -name "*.sig" | xargs rm
  29. Animate it! by elambi · · Score: 2, Interesting

    It would be interesting to watch an animation of where the Buzz is over a period of time.

    --
    Sig, we don't need no stinking Sig!
  30. Nov 3rd? Dec 26? by mzieg · · Score: 5, Insightful
    I have to question the results a bit. Look at the archive for November 2004, especially around Nov 3rd . Anyone remember any "buzz" about Ohio? Maybe a Florida 2000 reprisal? "Battleground States," anyone? That was a hugely geographic news event, and it doesn't even register on their chart. Likewise, Sumatra barely merits a blip on Dec 26. I'm not sure I'm buying this.

    What we have here is one computer algorithm aggregating another computer algorithm's assessment of "newsworthy," with no provision for hindsight or fluff-vs-historical weighting. It's a neat idea, and the graphics are pretty slick, but I don't see any real value here.

  31. Re:please tell us more! (and that link is dead.... by Ghoser777 · · Score: 2, Interesting

    It's actually both:

    From the text (how did I get marked redundant in my first post, even if I did screw up the url somehow):

    In the hope that these events have resulted from your inadvertence rather than your deliberate actions, we propose the following:

    1. We demand that you cease and desist using our search service in a manner that is not authorized by our Terms of Service. This includes, but is not limited to, (1) no longer sending automated queries to www.google.com, or other affiliated sites, and (2) no longer using search results from
    www.google.com or other affiliated sites, except in accordance with our terms of service and this letter. This applies to the GoogleNews menubar interface to Google News as well as any other products or sites that you operate or control.

    2. We demand that you cease and desist using the mark GoogleNews or any other mark or name that incorporates our famous GOOGLE mark or any similar marks.

    3. If you remain interested in providing our award-winning search services to your users, we suggest you visit the variety of programs we offer at http://www.google.com/services/.

    --
    James Tiberius Kirk: "Spock, the women on your planet are logical. No other planet in the galaxy can make that claim."
  32. Inaccurate data!!! by Anonymous Coward · · Score: 2, Insightful

    This map isn't accurate. You're not reporting on the news. You're reporting on what made headlines. There's a big, big difference.

    More people are murdered in Detroit than in than in Baghdad or the surrounding area.

    More Americans are kidnapped in Mexico in 3 days than in Iraq in a months' time.

    Isn't Mexico supposed to be a friendly country?

    Why does the press ONLY focus on Iraq?

    Clinton sent us into Bosnia. In fact, we're still there, and the only improvement was the arrest of Milosevic. Since then, they've had as many troubles as they had before. Why doesn't the press report this?

    The truth is, the press is HEAVILY biased. They all take their lead from the NY Times, and the NY Times is as biased a newspaper as biased can be.

  33. Re:Nov 3rd? Dec 26? by Random+Chaos · · Score: 3, Insightful

    I understand your point, however I think it is partially based on a false premise: In reguard to Nov 3rd. The site tracks cities, not states.

    After checking Dec 26, 27, 28, and 29th they do have Indonesia, but it doesn't show up until the 28th (and then under Jakarta only). I would guess this is due to them not having Sumatra or Banda Aceh in their keyword search system.

    I also notice that most cities in the US other then Washington and New York seem to almost never show up - could it be that their "selection of articles" is a bit limited (refering to the above's 2nd paragraph)?

  34. Re:MetaWeb by Infinityis · · Score: 2, Funny

    Well, if you want to break into the journalism world, but you don't watch or read the news, this might help you get the gist of where you can go for news.

    Alternatively, if you want to make a big splash with decent news coverage, don't try to do it near one of the big red dots, because there's already too much going on there.