Slashdot Mirror


Web Log 'Word Bursts' Could Identify New Crazes

Zorgatron writes "New Scientist reports that a researcher from Cornell University has come up with clever method of identifying what's cool by automatically searching weblogs. Sudden increases or "bursts" in the usage of particular words may reflect a new craze, according to Jon Kleinberg. He has demonstrated the technique by searching through state of the union addresses given since 1790." I wonder how long before this can be done real time enough to really make this useful.

16 of 239 comments (clear)

  1. Google? by irc.goatse.cx+troll · · Score: 4, Insightful

    Could this be what Google wants with Blogger?
    They have the capacity to do this, I don't see why they wouldnt.

    --
    Pain lasts, kid. Its how you know you're alive. Sometimes I think this growing up thing is just pain management-TheMaxx
  2. Blogdex by nob · · Score: 4, Informative

    Theres another "what's popular on blogs" webpage at Blogdex. It tracks links, showing which pages are most linked to.

    --
    daed si luap
  3. Nukular weapons by flokemon · · Score: 5, Funny

    In a simple historical test of the technique, Kleinberg analysed all the annual State of the Union addresses given by US Presidents since 1790. He found that particular word "bursts" could indeed be linked to important events at the time the speeches were delivered.

    Has an important increase of the use of the word "nukular" been reported in the last few weeks then?

  4. Great.... by xtermz · · Score: 4, Funny

    ..Now we're going to see Pepsi add's slinging "in soviet russia, you drink pepsi' , and Nike yelling about "all your sports belong to us..."...

    --


    I lost my concept of community when my community lost all concept of me.
  5. "What's cool"? by ites · · Score: 4, Insightful

    By my definition "cool" is that which most people have not yet discovered. Example: that... ah, but I'm not going to tell you. Perhaps this method can tell you what just became cool, but it's hard to track something that is by definition under the radar. Otherwise, just track Google searches. You'll soon see what's popular.

    --
    Sig for sale or rent. One previous user. Inquire within.
    1. Re:"What's cool"? by deanc · · Score: 4, Insightful

      That's what the researchers seem to track. Not the commonality of a phrase, but the "burstiness" of a certain word or phrase... ie, the delta of the word use over time. High delta values indicate something is starting to take off, though it may not yet have become popular or mainstream. That's a decent metric of "coolness."

    2. Re:"What's cool"? by Fishstick · · Score: 4, Insightful

      Ever see Merchants of Cool on Frontline?

      A Report on the Creators & Marketers of Popular Culture for Teenagers

      Yeah, that's right. Popular Culture is manufactured -- everything the teenies think is "cool" or "hot" is identified months in advance by a highly sophisticated machine that probes the minds of kids to predict what will be the next trend so that the marketing establishment can gear up to take advantage of the short window where the "thing" is "cool" and can be sold to teens in such a way that they don't even realize what is going on.

      --

      There is much cruelty in the universe, John.
      Yeah, we seem to have the tour map.

  6. Useful? by Longjmp · · Score: 4, Insightful

    I wonder how long before this can be done real time enough to really make this useful.

    Yes, I bet the spammers can't wait until they can use it...

    --
    There are fewer illiterates than people who can't read.
  7. Daypop by Apreche · · Score: 4, Informative

    http://www.daypop.com

    Its got the top 40 every day. Doing it some other way would only catch memes sooner. And if the system doesn't catch it until its popular, it really doesn't help. What we need is a large and complete database of all meme type things.

    --
    The GeekNights podcast is going strong. Listen!
  8. Let the webloggers determine what's cool? Heh. by rubberpaw · · Score: 4, Insightful

    Of course, since there is only a very specific socioeconomic subset of the world population weblogging, what real usefulness does this give us? Honestly, even if you did ranking based on the most popular weblogs, that wouldn't help you very much.

    Furthermore, this thing isn't telling me anything I don't know. So it finds the word "Vietnam" during the Vietnam years. Hooray. I bet it finds the word Iraq today, or the phrase "Bin Ladin" last year.

    Whoopdie-do. I'm impressed :P. Unless this thing actually can find out the things that people are excited about that aren't well-known, it's pretty much just another search tool limited to blogs.

    1. Re:Let the webloggers determine what's cool? Heh. by barnaclebarnes · · Score: 4, Insightful
      Unless this thing actually can find out the things that people are excited about that aren't well-known, it's pretty much just another search tool limited to blogs.


      Thats the whole point. Weblogs are not the mainstream media so he is betting that a new craze (or refresh of an old one) will show up there beofore the mainstream sites get a hold of. Face it, once it has hit CNN it is already past its sell by date.


      Take the whole potato gun thing for instance. if this was appearing on peoples weblogs 6 months ago and an underground following had started then it would pick this up. Could be a perfect time for one of the toy companies to start producing a parent friendly version (Not sure how...but hey!). By the time the craze hits CNN Toys 'R Us is stocked with a version that fires water ballons, only uses compressed air and comes in 10 different plastic colours. Then they would have the advantage before the other companies jump on the bandwagon.


      Of course, since there is only a very specific socioeconomic subset of the world population weblogging, what real usefulness does this give us?


      A lot! Let me see, I have a large group of people who are rich, computer owning, and probably middle /Upper Middle Class all saying they want X. Now who is your target audience again? Not low income, no disposible cash types.

      /b

      --
      [Please type your sig here.]
  9. Re:Useful for... by Duds · · Score: 4, Funny

    Seriously, just read /. if you want to know the important stuff of the day. :)

    Twice usually.

  10. Re:Google by ccweigle · · Score: 4, Informative
    Google can do much the same thing, on a real-time basis, by examining what phrases are searched for.

    And they do that much already ... on their Zeitgeist page: http://google.com/zeitgeist

    But this is different. The article is about monitoring the blogs, not the searches. As suggested in another comment, this may be related to Google's acquisition of Blogger.

  11. Feedback loop and dotcom crash by skillet-thief · · Score: 4, Interesting
    It is kind of like the stock market craze and the theory that "all the information you need to know about a stock is contained in the market itself" (ie. in the stock's chart). Enough people start believing that theory, and the stocks quit behaving rationally.

    The analysis only works if your tool doesn't start modifying the data you are analyzing. If this thing ever caught on, it would quickly become meaningless, because everybody wants to be part of whatever craze is going on. Every morning you check which words are hip, you put them on your website... etc. etc.

    You are right about feedback: the buzz would become a terrible din. That said, it is a cool idea.

    --

    Congratulations! Now we are the Evil Empire

  12. Stamp consumer on my forehead... by tazochai · · Score: 4, Interesting

    .... one more time why don't you. And I quote,

    "For example, identifying word bursts in the hundreds of thousands of personal diaries now on the web could help advertisers quickly spot an emerging craze."

    Gonfonit!!! Why does cool new social technology have to be related to ways to help people sell things to Americans! Why is it okay for us to be considered a nation of consumers, otherwise basically useless biological skinsacks?!

    I'll just strap my wallet to my chest with duct tape now and write my social security number in huge numbers on the back of my t-shirt for fast credit checks.

  13. Re:Hopefully they don't read slashdot for this by mshiltonj · · Score: 4, Insightful

    they'll think that goatse.cx is now considered cool.

    Which begs the observation: once poeple know the rules that determine what a "word burst" is and when it's happening, then tools will be developed to artificially inflate desired word burts

    Create a few hundred shill accounts across thousands of blogs, then each accounts on each blob will make a couple posts with the pre-determined phrase, and you have a manufactured word burst.

    Like a few years ago, when poeple sold the ability to seed search engines so your site is in the top of the results list based on certain keywords.

    Google makes that harder now, but it's always a contest between those who develop the rules (or algorithm) and those who seek to manipulate the data or the rules of the game.

    A manufactured word burst I can remember from before the 2000 election was 'gravitas'. That word came out of nowhere, and was suddenly all over the media, used to describe a quality that Dubya was lacking. There was a talking points memo somewhere that was very widely distributed -- which is the analog version of what I am describing.

    Look it up.