Post-Googleism At IBM With Piquant

← Back to Stories (view on slashdot.org)

Post-Googleism At IBM With Piquant

Posted by ryuzaki0 on Sunday December 26, 2004 @01:03AM from the find-the-prettiest-girl-in-new-jersey dept.

kamesh writes "James Fallows of the New York Times reports an interesting search technology that IBM is developing. IBM demonstrated a system called Piquant, which analyzed the semantic structure of a passage and therefore exposed 'knowledge' that wasn't explicitly there. After scanning a news article about Canadian politics, the system responded correctly to the question, 'Who is Canada's prime minister?' even though those exact words didn't appear in the article. What do you think?"

9 of 159 comments (clear)

Min score:

Reason:

Sort:

Wow by setagllib · 2004-12-26 01:08 · Score: 4, Insightful

That's pretty impressive. It takes quite a clever AI to read between lines and connect concepts, but I have to wonder how much of its 'understanding' was hard-coded rather than purely abstract. Would it be trivial to just stick in another language database and have it read translations of the article the same way?

Nevertheless it makes me feel like all the programming and design I've ever done is pathetic and I will never amount to anything. That's how it is in the software industry - always someone out there who makes you look bad.

--
Sam ty sig.
1. Re:Wow by EpsCylonB · 2004-12-26 01:13 · Score: 2, Insightful
  
  That's how it is in the software industry - always someone out there who makes you look bad.
  
  Thats how it is in Life.
Prolly a hand-picked question by Ancient_Hacker · 2004-12-26 01:22 · Score: 3, Insightful

One example is meaningless. To get a realistic idea of how useful this system is, we'd like to see what it says if you ask several dozen questions. For all we know this was the one question out of 100 that it answered correctly.
Re:Trust Issue by Anonymous Coward · 2004-12-26 01:30 · Score: 1, Insightful

One way of trusting is based on what google currently does for page relevance. Trust a site based on the number of other sites that link to it. In that way you could get a 'rough' idea of how trustworthy the site is.
As important as this tech is for web-searching by Anonymous Coward · 2004-12-26 01:39 · Score: 1, Insightful

...in the long term it may be even more important for translation between languages -- being able to discern both implicit and explicit meaning in a passage will make accurate translations easier -- and perhaps in combination with Cycorps "Cyc" (or similar project) in the extreme long term to create an artificial intelligence capable of understanding human communication.

There are other interesting possibilities. In the tradition of Esperanto and Lojban, it can also be used to gather the common aspects of natural language and create a universal second language (one much easier in grammar and spelling, more compact in expression, and more complete in meaning). This wouldn't have the cultural baggage of English, which is at present the only thing coming close to a universal second language.
Now... by SharpFang · 2004-12-26 01:52 · Score: 5, Insightful

Feed it the news about Iraq. Then ask it what the war was about.
Good bye, new system, too dangerous for "national security".

--
45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
citation analysis by jeif1k · 2004-12-26 02:40 · Score: 3, Insightful

The genius being google's success was paying *less* attention to the content of a page when categorizing it, and relying on links *to* the page instead. Why? Because of spammers.

"Genius" would imply some sort of brand new insight, but citation analysis has had a long tradition before Google appeared on the scene as a search engine. Google's biggest achievement is probably in implementing citation analysis on a very large scale, but they didn't break completely new ground in how people search.

And, in the long run, semantics-based analysis, like IBM's Piquant, is probably going to be the better technology: citation analysis for determining relevance to a query is really just a limited substitute for understanding of the content.
Re:Latent Sematic Indexing by MasonMcD · 2004-12-26 04:10 · Score: 2, Insightful

From the article:

MR. CICCOLO, the search strategist, said that in a way his team was trying to match - and reverse - what Google has achieved. "As Google use became widespread, people began asking why it was so much easier to find material on the external Web than it was on their own computers or in their company's Web sites," he said. "Google sets a very high standard for that Web. We would like to set the next standard, so that people will find it so easy to do things at work that they'll wonder why they can't do them on the Internet."

They seem to be explicity targeting intranets or known good databases, so the spammer issue might be moot.

This raises another issue, however. Will this technology become so useful as to lead to the bad old days of proprietary information dbs a la Lexis/Nexis? I'm assuming the indexing will have to take place on company-owned servers.
Re:Google already has an unfair monopoly by Anonymous Coward · 2004-12-26 11:40 · Score: 1, Insightful

Legislation?!? Your kidding, right? Yup, Usenet is in the public domain, but the value added is they bought a company that kept copies of it. And they continue to maintain those archives. That costs money in hardware, software and support and they are entitled to charge for that if they want to.
Anyone who wants to can maintain their own archive of usenet - and could have from the beginning if they wanted to.
Just like LexisNexis® searches - sure I can find information on the Internet, but there is an awful lot of garbage passing itself off as fact (such as many commentaries on this site) whereas the value added for the commercial services is access to verified, targeted and reliable content, delivered much faster than a search on Google can deliver.
Not everything is free, nor should it be. Could be worse - could be back in the MIddle ages when only the clergy were taught to read and write - you couldn't pontificate on a site like Slashdot then :P