Slashdot Mirror


Extracting Meaning From Millions of Pages

freakshowsam writes "Technology Review has an article on a software engine, developed by researchers at the University of Washington, that pulls together facts by combing through more than 500 million Web pages. TextRunner extracts information from billions of lines of text by analyzing basic relationships between words. 'The significance of TextRunner is that it is scalable because it is unsupervised,' says Peter Norvig, director of research at Google, which donated the database of Web pages that TextRunner analyzes. The prototype still has a fairly simple interface and is not meant for public search so much as to demonstrate the automated extraction of information from 500 million Web pages, says Oren Etzioni, a University of Washington computer scientist leading the project." Try the query "Who has Microsoft acquired?"

12 of 138 comments (clear)

  1. Try the query.... by Finallyjoined!!! · · Score: 3, Funny

    "Who has dumped Vista?"

    --
    If I had an Ass, I'd call it Fanny Bottom, then I could slap my Ass; Fanny Bottom, on the Arse.
    1. Re:Try the query.... by maxume · · Score: 3, Funny

      I tried to read your comment, but I did not attempt to understand it.

      --
      Nerd rage is the funniest rage.
  2. Nascent AI? by Drakkenmensch · · Score: 4, Funny
    I've always viewed intelligence as the ability to take unrelated facts and create new and original ideas from their synthesis. This project may very well lead to new ideas to create the first true AI.

    I'll start stockpiling food and armor piercing rounds for the moment Skynet goes live.

  3. 500 million web pages can't be wrong by Dunbal · · Score: 4, Funny

    Yet strangely, I get a result of:

    TextRunner took 9 seconds.
    Retrieved 0 results for what is the airspeed velocity of an unladen swallow?.

    Meh, call me when this stuff can answer the really USEFUL questions in life.

    --
    Seven puppies were harmed during the making of this post.
    1. Re:500 million web pages can't be wrong by JDHannan · · Score: 3, Funny

      And even worse:

      Retrieved 0 results for what is the answer to life, the universe and everything?.

    2. Re:500 million web pages can't be wrong by sukotto · · Score: 4, Funny

      Obviously it's not indexing http://www.style.org/unladenswallow/

      estimate that the average cruising airspeed velocity of an unladen European Swallow is roughly 11 meters per second, or 24 miles an hour.

      --
      Come play free flash games on Kongregate!
  4. Re:Not entirely helpful by owlnation · · Score: 4, Funny

    I suppose the major problem with this is that it cannot tell the difference between truth and lies or urban legends, it just repeats what other people have said, even if they are conspiracy theorists. The query "Who killed JFK?" suggests the CIA did it.

    So much like Wikipedia then?

  5. what causes cancer? by umundane · · Score: 5, Funny

    I learned that

    > smoking (387) causes cancer.

    I was also surprised to learn that

    > girls and women (11) cause most cases of cervical cancer

    This is a great resource if you need to cite a reference for a Wikipedia article.

  6. TextRunner confirms it: by guruevi · · Score: 4, Funny

    Who is at Area 51
    aliens (3), Carter (2), Colonel Sanders (2), Hi Group (2) is at Area 51

    Who bombed WTC
    Al Qaeda (5), Bush (5), Clinton (2), 4 more... bombed the WTC

    Who built the pyramids (example on site):
    Egyptians (298), aliens (73), Pharaohs (40), 77 more... built the pyramids

    What contains antioxidants (example on site):
    Coffee (17), Recent scientific research (15), food (6), 5 more... contain significant amounts of antioxidants

    -- man, I gotta get me some more recent scientific research.

    --
    Custom electronics and digital signage for your business: www.evcircuits.com
  7. Re:Not entirely helpful by thedonger · · Score: 2, Funny

    it just repeats what other people have said

    I don't see anything new here, most people have done this since the beginning of time.

    Yeah, Textrunner just repeats what other people have said, like most people since the beginning of time.

    --
    Help fight poverty: Punch a poor person.
  8. Re:Exactly by bxbaser · · Score: 2, Funny

    "The query "Who killed JFK?" suggests the CIA did it"

    Hmmm....And now its not responding because its "slashdotted"

  9. Retrieved 1 result for does god exist by ebertx · · Score: 2, Funny
    Retrieved 1 result for does god exist. God DOES exist last night (2).

    Well, that answers that question.