Mapping Google News
CousinLarry writes "A neat project called Buzztracker.org has been mining Google News for over a year and keeping track of relationships between geographic locations mentioned in articles.
The results are some really cool maps that actually seem to reflect the "buzz" of the day - check out the Vatican clusters from earlier this month, or the global New Year's chatter. You can also dig down into the articles from which the maps were generated."
SEMANTIC WEB!
/. reader. The question remains, while it's very interesting (and cool), what does one do with the aggregated data?
Thank you Tim (Berners-Lee) Didn't know you were a
"I'd rather be a lightning rod than a seismometer." -Ken Kesey
..no, literally. its made up of old news..
Starsucks
Well when you think about it aren't those the exact places you'd expect to be hotspots?
What a cool site, and it works very quickly and is not overflowing with advertising crap?
Acting stupid isn't much fun when there's someone around who knows better
I should start a website, beertracker.org, to keep track of my daily buzz.
http://perljam.net/notes/interesting-google-satell ite-maps/
-ted
It looks like the code needs a bit more tuning. http://www.buzztracker.org/index.html lists Nelson, NZ, as one of the hot spots. Clicking on that lists a bunch of articles about apartheid. I think the site code misinterpreted a reference to Nelson Mandela in one of the articles.
I remember about a year ago or so, there was a guy who was mining google news to produce an RSS feed. IIRC, google politely demanded that individual stop offering this to people. I can't find the article to cite this, maybe someone can help? At any rate, I wonder how google will feel about this.
1. Map out the world in x and y coordinates.
2. Feed google buzz data into huge neural network.
3. Predict location and magnitude of future events.
4. ???
5. Profit!
Apparently they didn't Google their own name, or else they would've noticed the name was already in use for a fairly popular music composition program.
Bears don't normally eat things that talk and move backwards.
This site has another list, of the sources Google News uses (something Google refuse to publish). Also an interesting use of data mining.
Give a man fire, and you warm him for the night. Set a man on fire, and you warm him for the rest of his life.
Now, take the data and put up some nice animations, archive the first 100 articles or so and put it into some nice database to mine for interesting stuff. Should not be too hard to script together the data gathering, you can already start fetching stuff while developing the functionality and frontend.
;)
Someone wanna join? This cries 'distributed database'...
Who is General Failure and why is he reading my hard disk?
- find where there are lots of new jobs being generated
- view up-and-coming areas by their positive "buzz" (new creative hot spots, architecture, etc...)
- find areas of town with great new restaurants
I think this is where it starts to get exciting (and more useful). Mapping Google news? Meh. Mapping the northwest, and giving that information to Citysearch? You betcha.concrete5: a cms made for marketing, but strong enough for geeks.
The big circle in the US is called "Washington", which is rated at 03%. It obscures "New York" in the GUI. Boston is available, and the only other US buzz is Grand Rapids, apparently on the strength of a local paper's report 2 days ago of a resident killed in Cairo. I find all that hard to believe, or at least to make into any sense. The GUI is unusable, and the mapping of data to "reality" defies sensibility. I think the buzz has gone to their heads, and they should put the pipe down quick.
--
make install -not war
If, they represented this in hierarchical format, the middle east would dominate by picking up points from children Gaza, West Bank and Palestine (not to mention Iraq). Baghdad is probably a good example here. How much actually happens in areas outside of Baghdad proper but gets labled baghdad anyhow.
Who are you? The new #2 Who is #1? You are #617565. I am not a number, I am a free man! Muhahaha.
Why do we need this?
A map that showed where the stories getting the least attention that contained certain keywords - famine, Schiavo, wobbegong, whatever - came from would strike me as more interesting.
We already know where the stories indicated by this map are coming from, because they're taking up ridiculous amounts of space on the front pages of newspapers everywhere.
I've had the thought that it might be cool to implement an anti-news site that would do something like show you links to New York Times stories that have never been referenced by the top page of Google News.
What we have here is one computer algorithm aggregating another computer algorithm's assessment of "newsworthy," with no provision for hindsight or fluff-vs-historical weighting. It's a neat idea, and the graphics are pretty slick, but I don't see any real value here.
I understand your point, however I think it is partially based on a false premise: In reguard to Nov 3rd. The site tracks cities, not states.
After checking Dec 26, 27, 28, and 29th they do have Indonesia, but it doesn't show up until the 28th (and then under Jakarta only). I would guess this is due to them not having Sumatra or Banda Aceh in their keyword search system.
I also notice that most cities in the US other then Washington and New York seem to almost never show up - could it be that their "selection of articles" is a bit limited (refering to the above's 2nd paragraph)?