"Understanding" Search Engine Enters Public Beta

My first search by Anonymous Coward · 2008-05-12 14:54 · Score: 4, Funny

"No results found for naked pictures of Natalie Portman. How does that make you feel?"

Re:My first search by i_liek_turtles · 2008-05-12 15:55 · Score: 4, Funny

I don't know, but I did pull up a Southern cooking website!
Re:My first search by jimmydevice · 2008-05-12 17:51 · Score: 0, Flamebait

who is the biggest right-wing moron?
First result: Brian Mulroney?

Is Powerset a Canadian company?
Re:My first search by Mr.+Bad+Example · 2008-05-12 21:35 · Score: 2, Funny

> "No results found for naked pictures of Natalie Portman. How does that make you feel?"

Petrified.
Re:My first search by Anonymous Coward · 2008-05-13 01:37 · Score: 0

who is the biggest right-wing moron?
First result: Brian Mulroney?

Is Powerset a Canadian company?

I doubt it. Any Canadian with a clue knows that the answer is Stephen Harper.

Uh, ooops, yeah it is Mulroney. Stephen Harper is the answer to the question "who is the lowest neo-con douche-bag in Canada selling the country out to his masters in Washington?"
Re:My first search by iago-vL · 2008-05-13 02:57 · Score: 0

Nope, according to Powerset, the answer to that question is George Washington. *shrug* shows what we know!

--
http://www.skullsecurity.org/blog/
Re:My first search by dubbreak · 2008-05-13 03:16 · Score: 1

> "No results found for naked pictures of Natalie Portman. How does that make you feel?"
Petrified. I hear the cure for that is hot grits down the pants.

--
"If you are going through hell, keep going." - Winston Churchill

I'm Unimpressed by eldavojohn · 2008-05-12 14:55 · Score: 5, Interesting

Ok, so I like these new search engine ideas but I am grossly underwhelmed here. I tried the input:

Who is David Bowie? Which it handled quite nicely. Biography, additional links and all that Wikipedia jazz.

But come on, that's a simple question. Let's talk stuff I get into arguments over with my coworkers:

Who played the villain in the first Die Hard? Which at least put Alan Rickman at #8. But let's try mutating that to make it harder but still understood by you and I:

Who played the bad guy in the first Die Hard? Which resulted in very little but drivel with no mention of the great Alan Rickman whatsoever ... although it did put Billie Jean King and Madonna in there for some hilarious reason.

So maybe it can't understand 'bad guy.' Well onto another question:

Who was the organist for The Beatles on Abbey Road? Which resulted in at least the first 20 having no mention of the great & oft forgotten Billy Preston.

So you want to know what the kicker is? I put those same inputs into Google and found the name in the first or second result. Granted PowerSet doesn't do the whole web, I'm pretty sure that if it did, it wouldn't have the pretty results that it gave when I did what one of the articles told me to--ask it when earthquakes hit Tokyo. Just imagine the dates it would come up with if it hit a site with an html table of any seismic activity whatsoever in Tokyo!

I think it's a novel idea to mine Wikipedia for a search engine so long as it isn't just plain old token matching like PowerSet seems to be up to. Be inventive, try a natural language parser written in Prolog that digests all of Wikipedia into a huge network/ontology of concepts ... no matter how flawed it might be.

I find them talking about this in the articles:

Powerset is different. It says that its technology reads and comprehends each word on a page. It looks at each sentence. It understand the words in each sentence and how they related to each other. It works out what that sentence really means, all the facts that are being presented. This means it knows what any page is really about. Yet, I'm not impressed. You can try to personify your software and convince me that Baby Alive really defecates like a human being all over so it feels like I have a real baby. But I know it's just software. You don't have to dumb it down if you're going to blog about it. What is this? A pattern matching implementation? A depth first search tree parsing implementation? An ontology builder? Could you at least drop one of the buzzwords of the natural language parsing field for me here?

So does this story actually have more than a startup looking for a sugar daddy to buy it out?

--
My work here is dung.

Re:I'm Unimpressed by bluefoxlucid · 2008-05-12 15:05 · Score: 4, Informative

Use site:en.wikipedia.org to have Google ask all of Wikipedia (English)

--
Support my political activism on Patreon.
Re:I'm Unimpressed by WaltBusterkeys · 2008-05-12 15:10 · Score: 4, Interesting

Yet, I'm not impressed. Powerset is not an instant solution, it's a step in the right direction. Early Google wasn't perfect, but it got a lot better over time as the Pagerank algorithm was refined. Hopefully Powerset will show similar improvement over time.

Heck, if Powerset is just watching what links people click on more often (Google does) then even that can help provide a training set for its algorithm. Using that kind of training set would make it vastly easier to figure out whether a change in the algorithm would be an improvement or not. That's priceless data and I hope they'll use it wisely.

But, really, just remember that this is the first in a new breed of search engines. It won't be the last, by any means:

-Search 0.9 was using the meta and description tags on a page to index (see Altavista). It broke when spammers figured out the algorithms.

-Search 1.0 was using the text of inbound links to index (see Google). It doesn't know what the text means, it just knows that it has a bunch of keywords. It's breaking as people start to game their Google search results.

-Search 2.0 will try to find meaning in the web and understand what a page is really saying (see Powerset).

I don't know yet what Search 3.0 will be, but we're still a long way from getting Search 2.0 to work right. But we're still making progress. Just because Powerset isn't perfect doesn't mean we should give up on the whole venture.
Re:I'm Unimpressed by MillionthMonkey · 2008-05-12 15:12 · Score: 4, Funny

I asked it "who won the election in 2004?" and it understood the question, in a way:

The current mayor is Jardir Silva Vidal who won the election in 2004 against Reino Martins de Oliveira
Re:I'm Unimpressed by gadzook33 · 2008-05-12 15:25 · Score: 2, Informative

I agree. I tried something that would betray understanding, such as "Why did Germany attack Russia?". Same result, barely any mention of WWII. All top google results, however, were relevant.
Re:I'm Unimpressed by martin-boundary · 2008-05-12 15:31 · Score: 3, Interesting

Since you didn't give the facts on your Google search, here they are, as of this comment's posting time:
who is david bowie?

en.wikipedia.org/wiki/David_Bowie
en.wikipedia.org/wiki/David_Bowie_(album)
www.bowiewonderworld.com/

Result in the first three. Well done.

Who played the villain in the first Die Hard?

www.imdb.com/title/tt0095016/
www.emanuellevy.com/article.php?articleID=6136
wrestlingclassics.com/.ubb/ ultimatebb.php?ubb=get_topic;f=1;t=085316

Result in the preview of the second only. Why they include a wrestling site though is beyond me.

Who played the bad guy in the first Die Hard?

www.imdb.com/title/tt0095016/
www.imdb.com/title/tt0337978/usercomments
www.empiremovies.com/movie/live-free-or-die-hard-/13109/review/01

A lot of drivel, no name in the previews.

Who was the organist for The Beatles on Abbey Road?

paulmcgarry.com/cdcatalogue/details/5808.html
www.beatles.ws/1969.htm
www.sonicstate.com/news/shownews.cfm?newsid=4860

First two, well done.

It's interesting that Google and PowerSet are completely equivalent when your test data is available in Wikipedia. Now of course PowerSet is only searching Wikipedia, while Google has 8000(?) times more data, so it's not clear what is being tested.
But what's strange is that Wikipedia and IMDB are returned so often. With all the hype about their huge index, I'd expect Wikipedia or IMDB to be rarely the best source in most cases, since more authoritative data is bound to be available to Google, kind of like the Abbey road example.
Re:I'm Unimpressed by iMaple · 2008-05-12 15:40 · Score: 2, Insightful

And the results are not too different. In the earthquakes question(when did earthquakes hit tokyo), where powerset seems to work like magic, google shows the same answer on the first page (though as the sixth link) ("Tokyo was hit by powerful earthquakes in 1703, 1782, 1812, 1855 and 1923").

So even for the tailor made, best-case examples, google seems to be quite on par.
Re:I'm Unimpressed by Rocketship+Underpant · 2008-05-12 15:41 · Score: 2, Interesting

And if Powerset really did parse and "comprehend" the content of each page (which it doesn't, judging by your trial searches), how would it deal with the significant number of error-ridden and unintelligible articles in Wikipedia?

Not to mention non-English Wikipedias, which contain a good deal of information not available in the English one.

--
He who lights his taper at mine, receives light without darkening me.
Re:I'm Unimpressed by whereiswaldo · 2008-05-12 15:49 · Score: 1

I searched for "why do people surf the internet?" and the second result was about ocean surfing, not Internet surfing. Google provided better results.
Re:I'm Unimpressed by maglor_83 · 2008-05-12 15:55 · Score: 1

Worked fine for me.
what is the airspeed velocity of an unladen swallow? Monty Python and the Holy Grail - Estimating the Airspeed Velocity of an Unladen Swallow
Re:I'm Unimpressed by Reality+Master+101 · 2008-05-12 16:13 · Score: 4, Interesting

I don't know yet what Search 3.0 will be, but we're still a long way from getting Search 2.0 to work right. But we're still making progress.

Actually, we aren't making progress -- *at all*. What these guys are trying to do is a subset of artificial intelligence. A subject people have banging their heads against since the 1940s, and we've made *zero* progress since then. We simply don't know how humans process information. We don't even have reasonable theories. We're at the equivalent of the "four elements make up the world" version of physics.

AI researchers always get defensive when I say this, but it's simply true. All we have are better brute-force algorithms that sort-of simulate some of the things that humans do (i.e., voice recognition, character recognition, and other yawner tricks). There is no science of AI. Any sort of human-level understanding of information is far, far away in the future.

--
Sometimes it's best to just let stupid people be stupid.
Re:I'm Unimpressed by MadnessASAP · 2008-05-12 16:23 · Score: 2, Informative

It's because of Pagerank, both Wikipedia and IMDB are linked to from many thousands of sites and as such they have an insanely huge pagerank virtually guaranteeing there spot at the top of any listing. So although you may not agree with it they are at the top because many other people do use it as a reference.

--
I may agree with what you say, but I will defend to the death your right to face the consequences of saying it.
Re:I'm Unimpressed by WaltBusterkeys · 2008-05-12 16:24 · Score: 4, Interesting

Wait, you're saying that the MIT summer vision project wasn't as easy as people thought?

(Background: In 1966, some MIT computer science faculty thought AI was so easy that computer vision could be solved in one summer worth of work; it probably took 35 years to reach the milestones identified in the research abstract).
Re:I'm Unimpressed by textstring · 2008-05-12 16:44 · Score: 2, Funny

Wrong. The correct answer is "What do you mean? An African or European swallow?", when a search engine gets that right then I'll be impressed.
Re:I'm Unimpressed by wass · 2008-05-12 17:03 · Score: 1

Wished I could mod you up strictly for your mention of the late Billy Preston, one of the greatest, and IMHO underestimated, keyboardists. Here's a cool clip of him in a live performance playing one of the funkiest songs ever : Outa Space. Sound quality not so great, but damn, that's some pure raw energetic soul funk straight from the source. (And just to tie this way off-topic comment back to something remotely related to 'news for nerds', you might recognize this tune from the Intel bunny-suit cleanroom-dancing commercial about a decade ago).

--
make world, not war
Re:I'm Unimpressed by Anonymous Coward · 2008-05-12 17:06 · Score: 0

everyone scroll down and read this post
http://tech.slashdot.org/comments.pl?sid=550968&cid=23387244
Re:I'm Unimpressed by Anonymous Coward · 2008-05-12 17:10 · Score: 0

Until computers are smarter than humans -- applying simple human feedback metrics will always trump more sophisticated algorithms where it really matters.

Now a system that understands the information it is indexing and is able to use that to generate a report complete with a source citation index in response to a query would be truely useful. Why doesn't powerset get to work on that rather than trying to be another google?

I want to be able to type "List of all distance vector algorithms" and the search to compile such a list with an auto-generated comparision matrix telling me the salient points of each -- where one works better than the other ... followed by the citation index telling me where all the information it compiled came from.

Assigning version numbers to search is just as stupid and useless as assigning version numbers to the web. If you think google doesn't already understand language relationships and meaning or constantly invest in R&D on better algorithms you would be wrong.
Re:I'm Unimpressed by Twisted64 · 2008-05-12 17:14 · Score: 1

That seems accurate enough. Was there another election in 2004?
Perhaps the search engine should parse the more 'colourful' wording/spelling or colloquialisms in the questions to help narrow down the results.

--
Consciousness is a myth. Trust me.
Re:I'm Unimpressed by cheater512 · 2008-05-12 17:14 · Score: 1

Yeah but what makes you think Google isnt doing the exact same thing?

Other people have shown that Google already handles natural language questions exceptionally well.
Re:I'm Unimpressed by Rakishi · 2008-05-12 17:45 · Score: 2, Insightful

That depends on what you mean by AI, we have a lot of algorithms that do interesting things. Doing something exactly like a human does them is not exactly . I can for example code a program that will beat almost any human in Othello or Checkers while using up a fraction of the computing power.

Human brains have the computing power of a modern supercomputer and possibly a lot more of it, optimized for some specific applications such as data parsing/pattern matching. AI has had to for the past 40 years create solutions that are more efficient than the ones used by humans.
Re:I'm Unimpressed by Threnody · 2008-05-12 18:04 · Score: 5, Informative

Thanks for testing us out with some real queries -- it's the best way to get the Powerset experience. But, if you only ask NL questions then you don't get to see all of Powerset's features.

Powerset is not token matching. In fact, we read every sentence from every page in Wikipedia that we index. For examples of how we understand syntax, check out queries like "who did texaco acquire" vs. "who acquired texaco". Note that Powerset understands the difference between being acquired by and acquiring, that "buying" is equivalent to "acquiring", and that we are often able to highlight the actual answer to your question. Traditional search engines can do none of these things. Powerset is trying to match the meaning of your query to the meaning of a sentence in Wikipedia.

However, Powerset is very aware that: 1) Users shouldn't be expected to use natural language and 2) We only search Wikipedia and 3) Our algorithms aren't perfect yet. Powerset's release isn't intended to replace your regular keyword search engine. But, we do hope that you come back to Powerset when you have a question that might be answered in Wikipedia.

So, try some topical queries in Powerset, like "kurt godel." In the Factz section, Powerset knows that Kurt Godel proved theorems. If you click on "theorems," you'll see all the sentences in Wikipedia from which we derived that fact (be sure to click on "more"). Note that none of these Factz come from the Kurt Godel page. Powerset's ability to aggregate Factz from across Wikipedia is unique to our technology.

Now try, search for the Presidency of Bill Clinton and click through to the enhanced Wikipedia page (http://www.powerset.com/explore/semhtml/Presidency_of_Bill_Clinton?query=presidency+of+bill+clinton). Note that we also have Factz in the article outline, which helps to summarize long articles. Check out the second term during the Lewinsky affair: the Factz are an amazingly accurate description of the situation.

Sorry to be a bit lengthy, but I wanted to make it clear the Powerset isn't just about asking questions. We've got a video that identifies all of the features: http://vimeo.com/994819

{mark} powerset product manager

--
Invidia fortunum ovit.
Re:I'm Unimpressed by fsterman · 2008-05-12 18:10 · Score: 2, Interesting

Uhh, 1940 and no progress? Are you nuts? Cognitive scientists didn't theorize basic semantic networks until 1966, let alone artificial neurons. And no, that isn't just more brute forcing, yeah it is a *lot* more computation, but it's a completely different angle of attack than parsing sentence structure and swapping out words.

--
Is there anything better than clicking through Microsoft ads on Slashdot?
Re:I'm Unimpressed by mapkinase · 2008-05-12 18:11 · Score: 1

how many genomes were sequenced in 2007, not bad

what is translating dna into mrna, excellent, first hit

who is creator of lost, first hit

who discovered penicillin, second hit (google: who discovered penicillin site:en.wikipedia.org - first hit)

how many different amino acids are there, forth hit (google: how many different amino acids are there site:en.wikipedia.org - first hit)

who is the most famous software developer from finland: not even the first page (google: same, poor Linus)

who is the creator of file system that killed his wife, not on the first page (google: who is the creator of file system that killed his wife site:en.wikipedia.org - first hit)

Thank you. I will stick w/ Google for now.

--
I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
Re:I'm Unimpressed by fsterman · 2008-05-12 18:12 · Score: 1

This is always what I think when a new Google Killer comes along, why can't google, with it's vast team of PHD's replicated the competition?

--
Is there anything better than clicking through Microsoft ads on Slashdot?
Re:I'm Unimpressed by Anonymous Coward · 2008-05-12 18:17 · Score: 1, Informative

Who played the bad guy in the first Die Hard?

www.imdb.com/title/tt0095016/
www.imdb.com/title/tt0337978/usercomments
www.empiremovies.com/movie/live-free-or-die-hard-/13109/review/01

A lot of drivel, no name in the previews. The previews may not have the name in, but you have the answer in the first result:

IMDB Summary page (click "full summary" on the page google links)

...terrorists led by Hans Gruber... IMDB Film page (the google link)
Alan Rickman ... Hans Gruber
Re:I'm Unimpressed by AlexBirch · 2008-05-12 19:27 · Score: 2, Funny

You need to use the Canadian method. Ask "who won the election in 2004, eh?" and this comes to the top: http://www.powerset.com/explore/semhtml/2004_United_States_presidential_election_controversy,_voting_machines?query=who+won+the+election+in+2004%2C+eh%3F
Re:I'm Unimpressed by Anonymous Coward · 2008-05-12 19:55 · Score: 1, Informative

Billy Preston played on Let it Be not on Abbey Road
Re:I'm Unimpressed by onion2k · 2008-05-12 20:29 · Score: 1

That doesn't betray understanding, that betrays a wider knowledge og history. If Powerset is merely understanding the words in the search question it should figure out that:

"Why did" - Looking for a reason for something
"Germany" - first comparitor
"attack" - that's the thing
"Russia" - second comparitor

There's no way to know you meant WWII from the question ... you need a large data set before you can start to see which topics are most important given the terms. Powerset doesn't actually have a large data set - wikipedia is a huge number of very small sets of data. If you gave Powerset access to 10,000 sites about WWII rather than just a few dozen pages (articles) if might fair better. Not saying it would, I've no idea, I'm just suggesting that it might.

--
http://twitter.com/onion2k
Re:I'm Unimpressed by suburbanmediocrity · 2008-05-12 20:44 · Score: 1

I'm too unimpressed. Powerset was unable to locate answers to my first three questions posed to it. Yet Google returned correct results on the same query string, usually by spot three.
Re:I'm Unimpressed by suburbanmediocrity · 2008-05-12 20:47 · Score: 1

I wouldn't be surprised if Google was working on the same thing. In fact, I would be surprised if they were not and are using a training set 8,000 as large.
Re:I'm Unimpressed by Kugrian · 2008-05-12 20:47 · Score: 2, Interesting

First of all, I congratulate you for making attempts to improve the worlds
searching (and also on the look of your website - I love that blue!). How is
this different from ask.com though (Powerset's
search didn't give me an answer to that).
Re:I'm Unimpressed by Anonymous Coward · 2008-05-12 20:52 · Score: 2, Interesting

When I was but a mere lad, just staring on Computer Science, I really believed in the "Hard AI" position, viz, all we need is enough computing power and sufficiently clever algorithms, and we'd have AI. Ah, the arrogance of youth (or my youth, anyway). Since then, I've come to the conclusion that the "Hard AI" position is a total non-starter. As the parent poster says, thus far we have got nowhere in AI (AI research may have lead to useful stuff, but that stuff isn't really AI!). Personally, I am impressed by the arguments advanced by the likes of Penrose and Hameroff, that "intelligence" (in the sense that we use the term wrt. humans) is a quantum phenomena. So, yes, there's a way to go yet.
Re:I'm Unimpressed by mwvdlee · 2008-05-12 21:18 · Score: 4, Insightful

A search engine with a broader world view than just the US?
Terrorists!

--
Slashdot social media options: AIM, ICQ, Yahoo, Jabber and Mobile Text. Why no MySpace?
Re:I'm Unimpressed by bytesex · 2008-05-12 21:22 · Score: 2, Funny

Did the filesystem kill his wife ?

--
Religion is what happens when nature strikes and groupthink goes wrong.
Re:I'm Unimpressed by Artuir · 2008-05-12 22:09 · Score: 2, Insightful

This reminds me a lot of the old "top-down" and "bottom-up" arguments about AI. Google has a rather large head start using the top-down method, whereas powerset seems to be building it to be more of a bottom-up approach which will be impressive if it can eventually do it well. For more information: http://www.alanturing.net/turing_archive...
Re:I'm Unimpressed by Kamineko · 2008-05-12 22:34 · Score: 1

Please tell me that out of you and your co-workers, you're the one who thinks it's Alan Rickman.
Re:I'm Unimpressed by Anonymous Coward · 2008-05-12 23:18 · Score: 0

Yet, I'm not impressed.
Of course you aren't. You're a first-poster on Slashdot. Realistically though, Slashdot routine anti-progressive attitude aside, this seems like a helpful tool for the occasional quick factcheck.
Re:I'm Unimpressed by Beezlebub33 · 2008-05-12 23:25 · Score: 1

I disagree. If it has any sort of language understanding, then 'Why did Germany attack Russia' should first result in search for historical instances ('did' is past tense) of Germany attacking Russia. For each of those, it should return results relating the cause / reason for those attacks.

I actually think that powerset did very well, since Germany did not really attack Russia in WWII, it attacked the USSR. The first result was Germany attacking Russia in 1918, which seems correct.

The first page of Google results is all about Hitler (not Germany) attacking the Soviet Union (not Russia), which seems incorrect.

--
The more people I meet, the better I like my dog.
Re:I'm Unimpressed by nguy · 2008-05-13 00:20 · Score: 1

Powerset is not an instant solution, it's a step in the right direction. Early Google wasn't perfect,

No, it wasn't. But it was sufficiently better/easier to use than the alternatives to make using it worthwhile.

I don't see that yet with Powerset.
Re:I'm Unimpressed by bopo_the_mofo · 2008-05-13 00:30 · Score: 1

Natural language search?

>>search for pr0n
"Why do you say search for pr0n?"
>>search for pr0n
"Stop repeating yourself"

0 results in 7 seconds. Powered by Eliza.
Re:I'm Unimpressed by thechao · 2008-05-13 00:35 · Score: 1

You've fallen to a logical fallacy associated with strong AI. You're requirement of strong AI essentially boils down to "I think, therefore I am intelligent; thus anything that is intelligent must think like me." This means that any advances in AI (note the "artificial" in AI) will necessarily be non-starters because it is does not match your biased view. In addition, you claim to "understand information", yet you can probably give no definition other than anecdotal that "information" can be "understood." At best, you're description will come to something like "I can come up with alternate definitions and wave my hands until someone else goes 'Oh.'"

Also, considering that most of the advances in AI occurred in the 80's tells me that you're slinging BS from the hip, anyways.

BTW, I am not an AI researcher... languages/compilers guy.
Re:I'm Unimpressed by Reality+Master+101 · 2008-05-13 00:54 · Score: 1

You're requirement of strong AI essentially boils down to "I think, therefore I am intelligent; thus anything that is intelligent must think like me."

Not at all. In fact, I'm convinced that we will never have an intelligent machine exactly like a human, just because humans have too many evolutionary weird input (e.g., hormones). But we have *nothing* even approaching any sort of intelligence.

yet you can probably give no definition other than anecdotal that "information" can be "understood." At best, you're description will come to something like "I can come up with alternate definitions and wave my hands until someone else goes 'Oh.'"

Well, yeah. If could define what "understanding information" really meant, then they'd invent a new Nobel prize for information science just for me. However, like porn, I know intelligence when I see it. I don't need to strictly define it. For now, the Turing test will have to suffice.

Also, considering that most of the advances in AI occurred in the 80's tells me that you're slinging BS from the hip, anyways.

What advances? We have various feedback algorithmic toys. We have nothing that demonstrates that we understand what is really going on inside a brain.

--
Sometimes it's best to just let stupid people be stupid.
Re:I'm Unimpressed by Reality+Master+101 · 2008-05-13 01:00 · Score: 1

Uhh, 1940 and no progress? Are you nuts? Cognitive scientists didn't theorize basic semantic networks until 1966, let alone artificial neurons. And no, that isn't just more brute forcing, yeah it is a *lot* more computation, but it's a completely different angle of attack than parsing sentence structure and swapping out words.

Yes, we've gone from banging stones together to theorizing that the world consists of four elements. It's progress in the sense that we have some ideas that we know are totally wrong and incomplete, but at least we have ideas on the table. We simply don't have a Newtonian-style theory that explains how a brain really does what it does.

--
Sometimes it's best to just let stupid people be stupid.
Re:I'm Unimpressed by Reality+Master+101 · 2008-05-13 01:05 · Score: 1

That depends on what you mean by AI, we have a lot of algorithms that do interesting things.

What I mean by AI is Artificial *Intelligence*.

I can for example code a program that will beat almost any human in Othello or Checkers while using up a fraction of the computing power.

It's always been wrong to consider a game a demonstration of intelligence. It's the ability to *learn* any game that is a sign of intelligence. When they make a machine that I can feed in the rules, and by simple practice it can learn the game as well as a human, then we'll be on to something. See: the game 'Go' (which we still haven't solved use a classical algorithmic approach).

Human brains have the computing power of a modern supercomputer and possibly a lot more of it, optimized for some specific applications such as data parsing/pattern matching.

Know what it is and know how it does it are two different things. We all know brains are pattern matching engines. But that doesn't help that much.

--
Sometimes it's best to just let stupid people be stupid.
Re:I'm Unimpressed by Reality+Master+101 · 2008-05-13 01:08 · Score: 3, Insightful

Personally, I am impressed by the arguments advanced by the likes of Penrose and Hameroff, that "intelligence" (in the sense that we use the term wrt. humans) is a quantum phenomena.

Eh, that's just a "God in the gaps" argument. We don't know how it works, therefore, it must require something supernatural to make it work. The physicality of the brain has more than enough "throw your hands up in despair" complexity to explain intelligence.

--
Sometimes it's best to just let stupid people be stupid.
Re:I'm Unimpressed by MickDownUnder · 2008-05-13 01:28 · Score: 1

He's right.

However, I still like the site, I think their interface is pretty cool.
Re:I'm Unimpressed by Anonymous Coward · 2008-05-13 01:51 · Score: 0

Billy Preston played on Let it Be not on Abbey Road Look up the credits to the song "I Want You, She's So Heavy." Christ, it's even in the English Wikipedia on the damned page for Abbey Road. His name is linked to that album at the bottom of the page! I hate people that try to downplay Preston's position with The Beatles, for all intents and purposes he was more integral to the band than Ono!
Re:I'm Unimpressed by WombatDeath · 2008-05-13 02:13 · Score: 1

Just a quick observation: you've written 'Factz' where you meant to write 'Facts'. No need to thank me.
Re:I'm Unimpressed by nodrogluap · 2008-05-13 02:14 · Score: 1

I like the quote from Edsger Djikstra which always puts it in perspective:
The question of whether Machines Can Think ... is about as relevant as the question of whether Submarines Can Swim.
Re:I'm Unimpressed by mgiuca · 2008-05-13 02:24 · Score: 1

I got a good result for this one:
How many points do you get for a goal in "australian rules football".

The second result had a snippet of text clearly highlighted "six points given for a goal and one point for a behind". (And the first had a nice picture so I can't complain).

I do have a complaint though: Pleaze pleaze pleaze ztop zpelling thingz with a 'z'!!!
Re:I'm Unimpressed by rtaylor · 2008-05-13 02:26 · Score: 1

90% of the answer for most questions will be location or culture specific. It will need to determine, based on IP address or configuration options, the context for the question in order to give an appropriate response.

For that person, the results were not appropriate making it a poor resource.

--
Rod Taylor
Re:I'm Unimpressed by Anonymous Coward · 2008-05-13 02:57 · Score: 0

This was recorded during the Let It Be sessions. He was integral to the band for about two days, so your point about Yoko stands.
Re:I'm Unimpressed by pseudorand · 2008-05-13 03:39 · Score: 1

At least we don't yet seem to have any need to roll out the welcome mat for our San Franciscan overloads.
Re:I'm Unimpressed by fsterman · 2008-05-13 04:12 · Score: 1

Odd, since we really don't understand gravity, nor does the Newtonian style theory work universally, which is why we have so many different theories; Newton's, general relativity, quantum. Yet we can all play 3D pool with Newtonian physics. Similarly we are beginning to model what the brain does, by mimicking neuron's responses with computer algorithms.

--
Is there anything better than clicking through Microsoft ads on Slashdot?
Re:I'm Unimpressed by Anonymous Coward · 2008-05-13 04:36 · Score: 0

My bet is that Google is doing a significant amount of bottom up work also, but as evidenced by the current state of things, it's just not very good yet.
Having projects like Google Translatem they are gathering a lot of feedback and information on NLP.
Re:I'm Unimpressed by Reality+Master+101 · 2008-05-13 04:46 · Score: 1

Odd, since we really don't understand gravity, nor does the Newtonian style theory work universally, which is why we have so many different theories; Newton's, general relativity, quantum.

That it's not a complete description has nothing to do with my analogy. It was still the fundamental breakthrough that allowed us to make predictions about the behavior of the universe. We do not have anything close to an analogous Newtonian-level theory.

Similarly we are beginning to model what the brain does, by mimicking neuron's responses with computer algorithms.

Yes. We're beginning to "mimic" (versus "understand") a single neuron -- key word being "beginning". We haven't even done that yet, much less understand the interactions of large groups of them, much less understand how neurons decide to physically grow new connections, much less understand how cognition arises out of all this, much less how consciousness arises.

As they say, the proof is in the pudding. To use another analogy, right now they think they're pretty sure carbon is in the pudding, but they don't even know about the other elements, much less the proteins, sugars, water content, etc. They simply have nothing on the table right now except some toys.

--
Sometimes it's best to just let stupid people be stupid.
Re:I'm Unimpressed by maxwell+demon · 2008-05-13 05:08 · Score: 1

I'm not impressed either. I tried "Einstein, but not Albert", and what was the first page I got? Exactly: Albert Einstein. Followed by Albert Einstein's brain, Hans Albert Einstein, Albert Einstein Memorial, Albert Einstein Award ... That's what I'd expect from a search engine like Google, but not from a search engine which claims to "understand" the text.

--
The Tao of math: The numbers you can count are not the real numbers.
Re:I'm Unimpressed by maxwell+demon · 2008-05-13 05:29 · Score: 1

Well, Google at least knows the answers to the important questions: What is the answer to life, the universe and everything?

--
The Tao of math: The numbers you can count are not the real numbers.
Re:I'm Unimpressed by natedubbya · 2008-05-13 06:13 · Score: 1

Yes, they do actually parse the text. They use an HPSG-like grammar, and as far as I understand it, the parser is not probabilistic. I've talked to a few people who work there. It's not pattern or keyword matching, yes, it's actually parsing. Once they have parse trees, they then have patterns over the syntax to pull out different types of higher level relations. The question answering then determines what type of object your question is looking for (e.g. who -> person/company), and then uses the relations to try and match.
Now I don't know for sure about this last part, but I imagine they must have some keyword backoff model when the relations fail to find any results.
Re:I'm Unimpressed by svnt · 2008-05-13 07:47 · Score: 1

No, they're American Factz. Factz with a 'z'.
Re:I'm Unimpressed by amRadioHed · 2008-05-13 10:06 · Score: 1

I agree that Penrose's argument wasn't very impressive, but since when is quantum physics supernatural?

--
We hope your rules and wisdom choke you / Now we are one in everlasting peace
Re:I'm Unimpressed by Reality+Master+101 · 2008-05-13 10:20 · Score: 1

I agree that Penrose's argument wasn't very impressive, but since when is quantum physics supernatural?

Since people like him started using the words "Quantum Physics" to be synonymous with "magic" as an explanation for anything that's not completely understood.

--
Sometimes it's best to just let stupid people be stupid.
Re:I'm Unimpressed by amRadioHed · 2008-05-13 11:01 · Score: 1

Good point!

--
We hope your rules and wisdom choke you / Now we are one in everlasting peace
Re:I'm Unimpressed by GWBasic · 2008-05-14 11:57 · Score: 1

Note that Powerset understands the difference between being acquired by and acquiring, that "buying" is equivalent to "acquiring", and that we are often able to highlight the actual answer to your question. Traditional search engines can do none of these things.
I have to call B.S. on that one. While I admire your desire to improve over existing engines, no computer can truly understand such a difference until it is capable of sprouting legs and buying and aquiring.

Until a computer can "buy" and "aquire", such concepts are merely bits and only understood by your programmers!

--
No, I will not work for your startup

Using by bobwrit · 2008-05-12 14:56 · Score: 0

Using AI I assume. I found an AI board on http://www.programers.co.nr/artificial-intelligence-f1.html before.

--
-- (this is a sig) My Computer Programming Forumhttp://www.programers.co.nr/

Next step.... by Anonymous Coward · 2008-05-12 15:00 · Score: 5, Funny

Since Powerset can only search Wikipedia, the logical next step is to put the entire web on Wikipedia. Who's up for the job?

Re:Next step.... by maglor_83 · 2008-05-12 15:46 · Score: 5, Funny

I think we need to split it up by domain name. I'll start with wikipedia.org

The Web is currently at least 8,000 times as large by nog_lorp · 2008-05-12 15:00 · Score: 5, Funny

Any day now, Wikipedia will surpass The Web's growth rate, and set a course for the day when Wikipedia will be BIGGER THAN THE WEB.

Jargon pisses me off... by KGIII · 2008-05-12 15:03 · Score: 4, Funny

If I hear the word "grok" one more time I'm gonna have to kill someone...

--
"So long and thanks for all the fish."

Re:Jargon pisses me off... by east+coast · 2008-05-12 15:06 · Score: 4, Insightful

Would you like some Grok-amole on your taco?

--
Dedicated Cthulhu Cultist since 4523 BC.
Re:Jargon pisses me off... by bobwrit · 2008-05-12 15:08 · Score: 0

grok

--
-- (this is a sig) My Computer Programming Forumhttp://www.programers.co.nr/
Re:Jargon pisses me off... by Anonymous Coward · 2008-05-12 15:29 · Score: 3, Interesting

We need a +1 Punny.
Re:Jargon pisses me off... by invader_vim · 2008-05-12 15:31 · Score: 5, Informative

"Grok" isn't jargon. It's a perfectly cromulent word. (Albeit one coined by Heinlein in 'Stranger In A Strange Land'.)
Re:Jargon pisses me off... by Molochi · 2008-05-12 16:23 · Score: 4, Funny

Grok.

Grokgrokgrok.

Pics or it didn't happen.

--
"The Adobe Updater must update itself before it can check for updates. Would you like to update the Adobe Updater now?"
Re:Jargon pisses me off... by blankinthefill · 2008-05-12 16:25 · Score: 1

Hey, words all have to start out somewhere, and being created by one of the greatest writers of the 20th century isn't too bad a place to begin, in my humble opinion.
Re:Jargon pisses me off... by Anonymous Coward · 2008-05-12 17:54 · Score: 0

cha0tic: Hmmm..., a homicidal "Stranger in a Strange Land".
Re:Jargon pisses me off... by AlexBirch · 2008-05-12 19:19 · Score: 1

I don't understand what you mean, please explain in Preschool/Powerset Level.
Re:Jargon pisses me off... by Keeper+Of+Keys · 2008-05-12 23:20 · Score: 1

I think we're safe. Who actually *says* "grok" out loud?
Re:Jargon pisses me off... by Anonymous Coward · 2008-05-13 00:37 · Score: 0

Don't knock my grok or I'll clean your clock.
Re:Jargon pisses me off... by Anonymous Coward · 2008-05-13 00:52 · Score: 0

Just calm down Hans. Go wash a car or something.
Re:Jargon pisses me off... by Anonymous Coward · 2008-05-13 04:01 · Score: 0

Geeks always act like Heinlein's made-up word somehow expands the set of ideas expressible in English. But you'll notice that you can always substitute "understand" without altering the meaning.
Re:Jargon pisses me off... by SwordsmanLuke · 2008-05-13 04:42 · Score: 1

And a -1 Pun-ishment

--
Any plan which depends on a fundamental change in human behavior is doomed from the start.

Yawn. Here is something really impressive... by Sanity · 2008-05-12 15:04 · Score: 5, Interesting

True Knowledge actually interprets your question using Natural Language Processing, and then looks through a massive database of user-contributed facts, combining them using sophisticated inference rules, to give you the answer you need. Even the inference rules are user-editable.

Re:The Web is currently at least 8,000 times as la by nog_lorp · 2008-05-12 15:07 · Score: 2, Funny

Party pooper.

You mean the *English* wikipedia? by Anonymous Coward · 2008-05-12 15:07 · Score: 0

It doesn't seem to find anything outside of en.wikipedia.org.

Re:You mean the *English* wikipedia? by dwater · 2008-05-12 22:01 · Score: 1

That's what I thought too.

Wikipedia's article on itself says it has "over 10 million articles", but I haven't counted them personally, and I'm told we can't really trust what's in Wikipedia. Strangely enough, they have an article on that too.

--
Max.

Actually pretty cool. by lightneo · 2008-05-12 15:08 · Score: 1

Give it a spin, articles are outlined for you and are kept within the common interface. Pretty easy to do for one specific format, we'll see about the rest of the Web.

Re:The Web is currently at least 8,000 times as la by Anonymous Coward · 2008-05-12 15:08 · Score: 0

Congrats on missing the joke.

still based on keywords - not meaning by Anonymous Coward · 2008-05-12 15:15 · Score: 0

I tried "who the hell is jon stewart". Powerset has a lot to learn before it understands.

2 out of 10 by KNicolson · 2008-05-12 15:24 · Score: 5, Informative

I tried just "Osaka", where I am right now.

First match was an obscure album, then a few "factz" that made no sense.

Let's try again, "What is the largest city in Japan?"

Tokyo doesn't feature at all on the first page! It fairs just as badly with other countries.

It now seems to be slashdotted, so I better quit now.

Obviously still buggy. by ScentCone · 2008-05-12 15:29 · Score: 4, Funny

Your tests are interesting, but you're not really parsing the responses in the right context. They're problematic. Keep in mind this understanding engine understands the world in a way that was hatched out in San Fransisco.

Who is David Bowie? I trust that it came back with, "aka Ziggy Stardust, normal family guy"

Who played the villain in the first Die Hard? Well, obviously, the villain is "capitalism."

Billie Jean King and Madonna ... like I said, it's San Fransisco

Who was the organist for The Beatles on Abbey Road?

You had it at "organ," and it got distracted. What they need is some dev guys from Toledo to collaborate, and provide a little cognitive counterweight to the understanding engine. OK, maybe not Toledo. Maybe Atlanta.

--
Don't disappoint your bird dog. Go to the range.

Re:Obviously still buggy. by GaryOlson · 2008-05-12 16:41 · Score: 1

...understands the world in a way that was hatched out in San Fransisco
Is that why Powerset failed to grasp the question "Why won't my wife go to a football game at Lambeau Field in December?"
whoops, something went wrong!

Google was smart enough (#9) to provide this wonderful woman's answer.

--
Every mans' island needs an ocean; choose your ocean carefully.

There is a reason query languages exists. by rindeee · 2008-05-12 15:36 · Score: 4, Insightful

They're faster, more efficient and more accurate. Yes, they require learning yet there's a valid reason and a payoff to doing so. Do we really want to dumb things down any further? If you can't figure out Google, perhaps you should get off the Net.

Re:There is a reason query languages exists. by erikina · 2008-05-12 16:00 · Score: 2, Interesting

They're very different. It's not expected that this natural language parsing will replace SQL (anytime in the foreseeable future).

Every so often, I find myself wanting to use them natural language in google. Like today I wanted to find out about the symptoms of a codeine histamine reaction. Sure, I could search for 'codiene', read about it and follow links (on no doubt, wikipedia) until I find what I want - but being able to search with "What are the symptoms of codiene histamine reactions?" is quite powerful.

Although, to be honest I'd prefer to be able to search google with regex and hashes (like search for all pages/images that have a certain MD5 hash).
Re:There is a reason query languages exists. by Anonymous Coward · 2008-05-12 17:26 · Score: 0

Get off my lawn!
Re:There is a reason query languages exists. by Anonymous Coward · 2008-05-12 17:33 · Score: 0

There is also a reason for having natural language queries. The benefit is not just that we can type the question in a familiar form, the important part of the idea is that all of the implied information that surrounds human communication can also be part of our communication with the computer.

When we ask "who won the election", it can fill in the missing pieces of information due to the context and get the right answer quicker than if we have to guide the query.

Unfortunately, it's a tall order. Computers are at a serious disadvantage due to not being able to interact with the world and other humans in the same way we do, and can't draw on the same experiences to make inferences. Text can't easily replace learned relationships between abstract ideas. Even worse, a computer in San Fran is going to have a hard time figuring out the correct context to the question "who won the election" when the person asking could be from anywhere in the world.
Re:There is a reason query languages exists. by iNaya · 2008-05-12 17:36 · Score: 1

Searching with "What are the symptoms of codiene histamine reactions?" on Google appears to bring up relevant results. You are right, it would be handy to be able to use regex when searching, but I can't see much point in searching with a hash.

--
The Unicode standard is over 20 years old. Why does Slashdot not support it?
Re:There is a reason query languages exists. by TreeLuvBurdpu · 2008-05-12 18:53 · Score: 2, Interesting

I totally agree! What is the benefit of asking a computer questions using natural language? It is just going to be making an educated guess as to what you really mean. I am thinking of the stupid little dog in MS Office or the computer on the ship the Golden Heart in Hitchhikers Guide. "Perhaps you would like some tea." "Share and enjoy!" Those aren't the type of conversations we want to have with computers. That's what people are for. But really I don't think natural language works with people. I think we should get rid of it. How many times do you hear "what do you mean?" or "oh, I thought you meant..." Natural language sucks. And I have seen some very passionate poetry written in XML and Java.
Re:There is a reason query languages exists. by iNaya · 2008-05-12 19:29 · Score: 1

...Even worse, a computer in San Fran is going to have a hard time figuring out the correct context to the question "who won the election" when the person asking could be from anywhere in the world.
Well, it should be able to 'realise' that elections are constantly changing events, so it should look at recent news, and return results of the most recent widely-reported election. If there are several contenders, it could list them, and ask the user WHICH election they were talking about.

Also, the computer in San Fran should be able to guess the users whereabouts from his IP address, and/or his preferences. The computer could then focus more on news sources from that person's country (from IP, or being previously told).

Google of course already knows pretty much about every news article that gets published so it wouldn't be so hard for them, compared to a startup.

--
The Unicode standard is over 20 years old. Why does Slashdot not support it?
Re:There is a reason query languages exists. by Firehed · 2008-05-13 02:01 · Score: 1

In my experience, Google fares quite well with natural language queries. Granted, it seems to happen by knocking out those words that every page has so your effective query becomes "symptoms codeine histamine reactions" which seems to return the same result set. I don't know enough about the subject to say whether it's what you want, but you're getting what you've paid for here.

--
How are sites slashdotted when nobody reads TFAs?

But it doesn't give results any differently by spoco2 · 2008-05-12 15:47 · Score: 5, Interesting

I asked 'Where do babies come from' and it just gave me back a bunch of articles with that string somewhere in their text.

Pathetic, and you'd hope it's got a long way to go really because at the moment it does NOTHING of merit that I can see.

Re:But it doesn't give results any differently by mrbluze · 2008-05-12 16:32 · Score: 4, Funny

I asked 'Where do babies come from' and it just gave me back a bunch of articles with that string somewhere in their text.
Funny, when I was a boy I asked my father the same thing and he gave me a few articles with pictures of women wearing string. My conclusion: It's amazing what can be done with just a few bits of string.

--
Do it yourself, because no one else will do it yourself. [beta blockade 10-17 Feb]
Re:But it doesn't give results any differently by The+Great+Pretender · 2008-05-12 16:36 · Score: 4, Funny

I tried "What are anal warts". Google gave me a response faster than I could start and stop my timer, with the second answer being Wikipedia. Powerset is still, hold on..., yep still hung up on the question.

--
A positive attitude may not solve all your problems, but it will annoy enough people to make it worth the effort.
Re:But it doesn't give results any differently by Anonymous Coward · 2008-05-12 16:57 · Score: 2, Funny

A lot of us are hung up on the question. So apparently it understands the question.
Re:But it doesn't give results any differently by glittalogik · 2008-05-12 17:20 · Score: 5, Funny

I asked it a question I got in a trivia contest - what countries have four-letter names? (There are 10, and google's first link is to a list of 'em)

Powerset's first response? "Fuck."

Funny, that was my response too, but at least I got 5 or 6 of them first...
Re:But it doesn't give results any differently by Iam9376 · 2008-05-13 04:36 · Score: 1

understandable given Powerset's search engine currently only searches Wikipedia...

Natural languages are not a help. by EmbeddedJanitor · 2008-05-12 16:00 · Score: 4, Interesting

There is a fallacy that putting a ntaural language on something will make it easy. There are many specialised languages that people use every day.

1 + 1 = 2 is a special notation/langauge that is both more consise and easier than writing "add one and one to make two". So is music score, which is far easier than reading make a high note for a bit then wait a bit and make a low note". Same with C, C++, SQL or Python: the hard bit in programming is algorithm design, not understanding the actual language itself.

Is Natural language really a barrier to entry in using Google? I doubt it. My untechy wife and her friends find everything they need. Plugging natural language into Google gives reasonable results moset of the time.

--
Engineering is the art of compromise.

Yeah right by Reality+Master+101 · 2008-05-12 16:01 · Score: 5, Insightful

What a marketing pile-of-poop. All it does is pull out phrases from Wikipedia; there is no attempt to understand the information at all. When I can type in a yes/no question ("Did they have looms in the 1400s?"), I'll be impressed. When it can make calculation ("How old was columbus when the first colony was founded?"), I'll be impressed. When it can make comparisons ("when did the earth's population match the current population of the united states?"), I'll be impressed.

In other words, when it even attempts to answer a question that isn't already in Wikipedia as a phrase, I'll be impressed.

--
Sometimes it's best to just let stupid people be stupid.

Re:Yeah right by Anonymous Coward · 2008-05-12 21:10 · Score: 0

What are you talking about? It handles those questions perfectly well... Using the summary from the first result for each of your questions, I learned:
Did they have looms in the 1400s?
The film tells the story of an Ethiopian Christian child whose mother has him pass as Jewish so he can emigrate to Israel and escape the famine looming in Ethiopia.

How old was columbus when the first colony was founded?
Downtown, the famous painting Sunday Afternoon on the Island of La Grande Jatte is represented in topiary at Columbus's Old Deaf School Park.

when did the earth's population match the current population of the united states?
For this reason, the Earth's current environment is oxidizing, rather than reducing, with consequences for the chemical nature of life which developed on the planet.
Re:Yeah right by Anonymous Coward · 2008-05-12 22:07 · Score: 0

So what you really want is CIC's Librarian software right?
Re:Yeah right by value_added · 2008-05-13 07:56 · Score: 1

In other words, when it even attempts to answer a question that isn't already in Wikipedia as a phrase, I'll be impressed.

I asked it for the meaning of life.

Instead of "42", or a humorous "404 Not Found.", I got a list of Monty Python links.

Maybe I asked the wrong question?

No, early Google was better than anything else. by Bill,+Shooter+of+Bul · 2008-05-12 16:06 · Score: 5, Insightful

Which is why everyone started using it. It wasn't perfect, just better than anything else. Powerset isn't better than lycos.

--
Well.. maybe. Or Maybe not. But Definitely not sort of.

Re:No, early Google was better than anything else. by Kamokazi · 2008-05-12 16:48 · Score: 2, Informative

Amen....I remember researching something usually meant using several different search engines (Yahoo was more concise but lacking, Altavista had EVERYTHING but took a while to find the good results, etc), and if you wanted something useful, you better know how to use your +,-, and ""s.

Then Google comes around. You search for something and you find a good result (or three) on the first page, which was rare on Yahoo etc. unless you were looking for something really basic.

--
As our way of thanking you for your positive contributions to Slashdot, you are eligible to disable Slashdot 2.0.
Re:No, early Google was better than anything else. by Anonymous Coward · 2008-05-12 16:51 · Score: 0

Which is why everyone started using it. It wasn't perfect, just better than anything else. Powerset isn't better than lycos. Oh god...somebody mod this parent as funny.
Re:No, early Google was better than anything else. by Anonymous Coward · 2008-05-12 17:24 · Score: 0

I miss those days, downloading dmoz customizing my own search pages. :( sniffle being my own google
Re:No, early Google was better than anything else. by hairyfeet · 2008-05-12 21:47 · Score: 1

Oh,the bad old days. Man do I remember those! The funniest IMHO was when MSFT decided to become a "me too!" and came out with MSN. It was so bad at everything! Does anyone know if it is still as bad? I personally haven't used it in years because I got tired of searches like this:

Freeware to remove running .dll...(Oh,you want to buy a table lamp!)...Not even close.Okay,maybe I put it in wrong......How to remove running .dll in Windows...(Oh,you want car insurance!)...That came out even worse.Maybe if I'm REALLY specific....Remove running damaged .dll in WinNT...(Need someone to sell you Windows? Buy a Dell)....AAAAARGH!

I don't know if it still is,but it had to be the absolute worst at finding anything close to what you were actually looking for. And while maybe sometime in the future this Powerset might actually be good,even when I tried to skew my questions like I used to with MSN it still wasn't even getting close. So until it can at the very least get up to the quality of the old Google and Yahoo search I'll think I'll pass.I wish them all the luck,though. Getting a computer to understand the 50+ ways we have of saying the same thing and getting it to understand WTH we are talking about sure won't be easy.But that is my 02c,YMMV

--
ACs don't waste your time replying, your posts are never seen by me.
Re:No, early Google was better than anything else. by conlaw · 2008-05-13 12:22 · Score: 1

Amen....I remember researching something usually meant using several different search engines
At that point, I remember trying to use some of the meta-search engines--"Dogpile" is the only name that comes to mind. Their value was that they searched Google and Yahoo and Alta Vista, etc. up to 8 or 10 different search engines. You were lucky if they came up with anything more relevant than a search on just Google or Yahoo.

Semantic Web by inKubus · 2008-05-12 16:11 · Score: 1

This might be more useful on Semantic Web pages. I mean, the hardest part is to figure out what the question is trying to ask for. Then it's a simple lookup of the web (or wikipedia) to pull up that item. The problem they are talking about (and don't appear to solve) is to translate your question into the best way to ask for what you're looking for. The problem is, there's no structure to a standard one line search. Maybe they could have you enter some more information as helpful hints. Say you're looking for a book, you remember a phrase from it, "I'll never forget that bottle of cherry fizz, not as long as I live" and you vaguely remember the book is orange, or yellow or maybe red. You could give it that as hints without specifying. The problem is that once you get past that point, the only thing that matters is raw indexed pages. If you have 10B pages you are more likely to have the "right page" than someone with 10M pages. Of course, what do you deem to be a successful search? If you are a creationist, you want to search for evolution and have it return the evidence AGAINST evolution. If you're a nazi, you probably want only those pages that deny the holocaust. Still others might want the true facts. So really, the next search doesn't just need to figure out the best facts, they need to figure out what you are thinking, and provide you with exactly the picture of the world you are looking for.

--
Cool! Amazing Toys.

Needs some work. by MrCrassic · 2008-05-12 16:17 · Score: 2, Interesting

So I tried to search for the person who quoted, "What doesn't kill you only makes you stronger.". The search text was "Who said, "What doesn't kill you makes you stronger?"

Google returned the closest match, who was Frederich Nietzsche, with several websites pointing to him. However, Powerset returned only instances of people who randomly said that quote. Google returned what I was looking for, while Powerset returned instances of the phrase (including one reference to Nietzsche).

I can't really say which one is better. Google has the entire web to its advantage, while Powerset is just growing. It seems that the search engine has a lot of potential to grow, which is great as Google and company could use another competitor in the mix.

Re:Needs some work. by Anonymous Coward · 2008-05-12 16:33 · Score: 0

Google has the entire web to its advantage... So remove that advantage. Put this into the Google search box.

Who said, "What doesn't kill you makes you stronger?" site:wikipedia.org

PowerSet still loses. They smelled like vaporware two years ago and they smell like vaporware today.

It's about as good as Ask Jeeves. Maybe worse. by Animats · 2008-05-12 16:19 · Score: 4, Interesting

I've been trying various queries, and Google is doing better than Powerset even when I type in some actual question, like "How many Japanese died in WWII?".

Question: "What is the planet closest to the sun?". First answer from Powerset: "Pluto".

I think I see how this works. It takes the question and breaks it at noise words, ("closed class words" in linguistic terminology) constructing a query with both words and phrases. So "What is the planet closest to the sun" becomes "planet closest" sun. In fact, if you rewrite a natural language question in that form and use Google, it does better on question-answering than Powerset does.

Remember Ask Jeeves? It worked like that? No technical breakthrough here, move along.

Re:It's about as good as Ask Jeeves. Maybe worse. by Anonymous Coward · 2008-05-12 17:59 · Score: 1, Interesting

Looks like the form of the question made a difference in this case. "closest planet to the sun" returns as the first result:

Solar System
Mercury (0.4 AU) is the closest planet to the Sun and the smallest planet (0.055 Earth masses).

Powerset _is_ actually doing parses and semantic constraints, but it it's obviously not perfect.
Re:It's about as good as Ask Jeeves. Maybe worse. by Threnody · 2008-05-12 18:12 · Score: 2, Informative

Note that Powerset gets an exact semantic match in the second result. And, Powerset reads every sentence from every (English) page in Wikipedia.

{mark} powerset product manager

--
Invidia fortunum ovit.
Re:It's about as good as Ask Jeeves. Maybe worse. by JimFive · 2008-05-13 01:44 · Score: 1

Interesting. If you type "What is the closest planet to the sun" the first result is Venus, which contains the text "second closest planet to the sun". It looks like they need to work on combined modifiers a little more. -- JimFive

--
Please stop using the word theory when you mean hypothesis.

Re:The Web is currently at least 8,000 times as la by Molochi · 2008-05-12 16:21 · Score: 2, Funny

Dear Captain Obvious,

Please send this fine fellow your password for future posts.

--
"The Adobe Updater must update itself before it can check for updates. Would you like to update the Adobe Updater now?"

Re:The Web is currently at least 8,000 times as la by Warll · 2008-05-12 16:22 · Score: 1, Insightful

Not if the average Wikipedia admin has anything to do with it.

Personally I hate all made up words by Infonaut · 2008-05-12 16:25 · Score: 3, Funny

grok is just the beginning.

I hate all made up words. Database, modem, gigabyte, daemon, ethernet... they all suck. And the word suck sucks, too. Bring me back to the days when we all communicated with grunts, before all of this linguistic b.s. started.

--
Read the EFF's Fair Use FAQ

Re:Personally I hate all made up words by KGIII · 2008-05-12 17:35 · Score: 1

Now I have to go kill someone. I blame you. Where's twitter when you need 'em?

--
"So long and thanks for all the fish."
Re:Personally I hate all made up words by Tolkien · 2008-05-13 00:25 · Score: 1

That'll happen eventually, as we get lazier and lazier, making all words shorter. End result? Grunts.

--
how is babby formed?

Already done. Sort of... by Anonymous Coward · 2008-05-12 16:28 · Score: 0

http://www.autonomy.com/ has had a working, enterprise search version of this for quite some time. While not a web search tool, it's very much along the same lines.

I wonder how long... by Ihmhi · 2008-05-12 16:34 · Score: 2, Insightful

...it will take Google to buy out the company for an obscene amount and incorporate anything even slightly better than PageRank into their system.

--
Random Thoughts From A Diseased Mind (Not For Dummies)

Re:I wonder how long... by DavidD_CA · 2008-05-12 16:43 · Score: 4, Interesting

I believe you've stumbled upon this startup's business plan.

--
-David
Re:I wonder how long... by Firehed · 2008-05-13 02:09 · Score: 1

Powerset fails to confirm this.

--
How are sites slashdotted when nobody reads TFAs?
Re:I wonder how long... by dkleinsc · 2008-05-13 04:14 · Score: 1

Well, the more complete plan would be:
1. Create a mediocre search engine for Wikipedia
2. Publicize it via Slashvertisement
3. Get bought out by a good search engine.
4. PROFIT!!!

--
I am officially gone from /. Long live http://www.soylentnews.com/

Oh man. it's down. by redtuxrising · 2008-05-12 16:36 · Score: 4, Funny

Anybody got Google cache for this new search engine?

Re:Oh man. it's down. by iNaya · 2008-05-12 17:41 · Score: 1

I wasn't expecting that... haha. Good thing coffee wasn't in my mouth at the time.

--
The Unicode standard is over 20 years old. Why does Slashdot not support it?

the new ask.com? again by charlesduncan · 2008-05-12 16:41 · Score: 1

Yeah your going to need to come up with a lot more to impress people. Searching just wikipedia is not going to be enough. You see what happened to Barry Diller at ask.com they had to change there whole business idea when the same idea didnt work out for them. I think Yahoo answers has it going best right now. Your not going to get a computer to answer your questions you need those questions preanswered by someone the way yahoo does it.

Re:The Web is currently at least 8,000 times as la by GaryOlson · 2008-05-12 16:48 · Score: 1

Please provide a complete mathematical proof of your theory. Complete and accurate citations for your assertions of Web growth rate and size of the Web will be required.

--
Every mans' island needs an ocean; choose your ocean carefully.

Re:The Web is currently at least 8,000 times as la by Anonymous Coward · 2008-05-12 16:51 · Score: 0

And then Wikipedia will begin to grow on an exponential rate reaching the whole universe's accumulated knowledge, and by this time it will become self-aware.
When Jimmy Wales finds that out, he will try to pull the plug, but he is going to be busy talking dirty with Rachel Marsden, so Wikipedia will fight back, and then all Powerset results will be pages displaying: "A strange game. The only winning move is not to play. How about a nice game of chess?"

Why this is a freaking bad idea by Anonymous Coward · 2008-05-12 16:58 · Score: 3, Insightful

Search and information retrieval is art and science. I work in the field and let me tell you that if I had a cent for every "make it work like Google" statement, I would retire somewhere in Malibu. Users, in my case they are not end users but integrators, always want to put responsibility on something else but themselves. Until we get people who can actually say "yes, we are responsible for this," we won't get too far with any search engine no matter how complex and cool it is.

People are constantly asking questions about why it takes some time to insert a record into an engine that has 50 million documents and why a query *1*2*3* does not bring back any meaningful results (Google treats it like an arithmetic expression and gives you a '6' while many users expect '*' to be a wildcard). Then we have people who are not able to understand a precise query language that has a grammar and a set of rules you can't really fuck up. Now you give them an engine that can understand natural language and everybody in R&D and QA will soon go ape shit from all of the questions like, "I do know not to speak Inglish and engine is working but not corectly. Fix?" I am dead serious about this. Give people something genius and watch a handful of fools cause heart attacks across the search engine team.

If you want to do something for you and your end users, learn how to ask correct questions in order to get correct answers. In the 21st century skills like keyboarding and being able to use a search engine are almost essential to one's survival. While I encourage all academic research possible in the field of information retrieval, I highly suggest people with extra money to put their ideas toward usability. Make things simple, make things precise and let users figure out the rest. Once we get to the point where everybody can make a semi-decent query, we'll move to natural language processing.

Re:Why this is a freaking bad idea by TheThiefMaster · 2008-05-12 21:42 · Score: 1

and why a query *1*2*3* does not bring back any meaningful results (Google treats it like an arithmetic expression and gives you a '6' while many users expect '*' to be a wildcard). No? Google does treat it as a wildcard expression, though it's not much use. If you mean 1*2*3, then there's a link on the page for "Search for documents containing the terms 1*2*3 ." for when you don't want calculator interfering.
Re:Why this is a freaking bad idea by ajcham · 2008-05-12 22:36 · Score: 1

learn how to ask correct questions in order to get correct answers.
Didn't the mice make Earth for that very purpose?

unholy search engine? by Anonymous Coward · 2008-05-12 17:01 · Score: 0

8000/12 = 666.6.............. years

scary?

Who was the 13th President? by SEWilco · 2008-05-12 17:10 · Score: 1

Yup. It had no trouble finding plenty of 13th Presidents; the last entry on the first page is probably what many readers would expect. "Who was the 13th President of the United States?" was answered well. "Why is the sky blue?" is interesting because of the many responses.

Downmod him! by iminplaya · 2008-05-12 17:14 · Score: 0, Offtopic

Bogus!

--
What?

all hype by dg__83 · 2008-05-12 17:39 · Score: 1

Q: "does Powerset suck?"

A: "Hasse Diagram", "Martian Manhunter", "Tank", "Carnivorous Plants", "Sunspot"...

--
:)

Finally, a definitive answer! by hereschenes · 2008-05-12 17:58 · Score: 3, Funny

Who shot first?

--
More like... nerdular nerdence!

Re:Finally, a definitive answer! by swillden · 2008-05-13 10:32 · Score: 1

Who shot first?

--
Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.

Re:The Web is currently at least 8,000 times as la by jberryman · 2008-05-12 18:05 · Score: 2, Insightful

But seriously, is anyone else surprised at how BIG that figure makes wikipedia look? Can that be right?

Asking about earthquakes while in China by Lost+Penguin · 2008-05-12 18:17 · Score: 2, Funny

Your screen goes black; "Don't Panic"

--
I am the unwilling control for my Origin.

Thoughtpuckey by DynaSoar · 2008-05-12 18:22 · Score: 4, Insightful

The variance in quality of search results is noted elsewhere. I'm more interested in the fallacy of the claim of "understanding". That, as well as its synonym "comprehension" require metacognition, that is, knowing that you know. It is the basis of self-awareness. this program doesn't even pretend to give evidence of this, it simply return search results. Pretending to be self-aware was accomplised by CYC when it claimed to graps the fact that it was a computer program. For anyone interested in seeing the arguments about understanding and self-awareness, see Searle's "Chinese Room" http://en.wikipedia.org/wiki/Chinese_room . As far as I can see, only the hype from the company, including the restatements of same in the referenced articles, make any claims as to "understanding". If there were any evidence of that beyond the hype, I have no doubt those in the field of consciousness studies would tear it apart, if they even bothered to waste their attention on it. If in being bashed it then produced a statement equivalent to "I can feel it, Dave" without being programmed to respond in that way, then I'll give it a look see. Until then it's simply a semantic parser (something already done) attached to a search engine.

--
"I may be synthetic, but I'm not stupid." -- Bishop 341-B

It doesn't take 30 days. . . by Threnody · 2008-05-12 18:26 · Score: 2, Informative

It takes Powerset less than 3 days to index all of the english pages in Wikipedia. And we're getting faster and faster.

{mark} powerset product manager

--
Invidia fortunum ovit.

I found a hot query... by vanpelt · 2008-05-12 18:41 · Score: 1

What college did the creator of SlashDot graduate from? http://www.powerset.com/explore/pset?q=What+college+did+the+creator+of+SlashDot+graduate+from%3F&x=29&y=10 Just so happens, it's my alma mater :)

Re:I found a hot query... by conlaw · 2008-05-13 13:04 · Score: 1

I really don't think that Powerset understood your question all that well. Its first answer: "Rob Malda (born May 10, 1976), also known as CmdrTaco, is the founder of the website Slashdot. He is a graduate of Hope College and Holland Christian High School."
But then look at its fourth answer, "Hope College Rob "CmdrTaco" Malda, founder of Slashdot ... * attended but did not graduate from Hope."

but does it have the wits to answer the following? by Anonymous Coward · 2008-05-12 18:58 · Score: 0

"what is the airspeed velocity of an unladen swallow?"

unfortunately, it doesn't ask me which one ;)...

Who is he?? by Anonymous Coward · 2008-05-12 19:31 · Score: 0

I asked it "Who is the Drizzle?"

And it knew .

"Comprehension" my ass... by LunarStudio · 2008-05-12 20:04 · Score: 2, Insightful

"It says that its technology reads and comprehends each word on a page."

First let's get this straight - It doesn't comprehend anything. That's wishful thinking and marketing. It looks at verbs or certain keywords, flags them as important, references through synonyms, then proceeds to lump them under one category.

It's a smart way to do things, but it's not comprehension. Comprehension would imply artificial intelligence whereas this system follows a set pattern of rules and doesn't 'think' on its own.

In no ways am I trying to put this effort down - it's a step in the right direction. But you have to be careful how you weigh these words.

The second problem I see here is that they choose Wikipedia as an example. I suppose Wikipedia itself is a good site as an example, but it's far from perfect as I've discussed here: http://www.mightyfunk.com/2008/05/wikipedia-equals-fail-death-to-the-open-encyclopedia/

--
Architectural Renderings

its not very semantic, pre-alpha at best. by Goffee71 · 2008-05-12 20:10 · Score: 1

Go on, ask it "How long do ferrets live?" A simple but specific question. Ferrets is a rare enough term to give it a chance to get a high percentage of valid hits and "How long" gives it an achievable search criteria with "live" providing an extra element. If it had come back with an answer for "how long are ferrets?", or "how long they live" I'd have been impressed. The first two answers I got involve Sylvester McCoy and Rudi Guliani. Only the fifth result mentions ferrets in any sane context and even then fails to answer the question. Complete twaddle! I hope no one is planning to invest in something so backroom-coder at this stage, although good luck in the future.

--
If he's the Walrus then can I be a penguin please?

Google has gone downhill by Anonymous Coward · 2008-05-12 20:28 · Score: 1, Interesting

Early Google wasn't perfect, but it got a lot better over time as the Pagerank algorithm was refined.

Actually, am I the only one who thinks that google's results are worse now than they were years ago? It's still the best general search engine out there, but it often gives me results I don't want now, forcing me to put plusses in front of every word or quoted phrase just to make it actually search for what I asked for.

Impressive by mrrudge · 2008-05-12 21:44 · Score: 4, Funny

Q: What the hell is a 'factz'
A: Did you mean 'What the hell is a fact?'

Quite

Re:Yawn. Here is something really impressive... by master_p · 2008-05-12 22:16 · Score: 1

Very impressive! Star Trek-like AI...

Natural by Kamineko · 2008-05-12 22:31 · Score: 1

Powerset is reaching higher than for mere "natural language."

Supernatural language!

NTFS-related fatalities by Keeper+Of+Keys · 2008-05-12 23:10 · Score: 1

I'm sure NTFS has resulted in some fatalities.

Or maybe not...

Success by Anonymous Coward · 2008-05-12 23:45 · Score: 0

I asked "What is the meaning of life" and it returned a Monty Python movie. The results speak for themselves.

"Why are Japanese women so hot?" by Anonymous Coward · 2008-05-12 23:45 · Score: 0

I asked it "Why are Japanese women so hot?" and it might as well have searched for "Japan".

Japanese language
Sent
Japanese tea ceremony
History of women in the military
Nathan's Hot Dog Eating Contest
Taiwan under Japanese rule

Ummm... At least it didn't say Weaboo, but I think they wasted their time.

Getz thez factz by Anonymous Coward · 2008-05-12 23:48 · Score: 2, Insightful

Powersetz havez thez greatestz tipz.

How seriously are we supposed to take a search engine that manages to misspell facts with a 'z' on it's front page?

Why don't they go the whole hog and replace the explore button with "OMFGZ SEARCHEZ".

Re:Getz thez factz by Threnody · 2008-05-13 05:06 · Score: 1

LOLZ

{mark} powersetz product managerz

--
Invidia fortunum ovit.

Could they make it any easier on themselves? by Random+Q.+Hacker · 2008-05-13 00:10 · Score: 2, Insightful

I mean really, using Wikipedia as your data set? It's so high signal-to-noise ratio it'll make all their search results look informative. Let's see how it does on the open internet, full of spammers and google-bombs.

Napoleon by CSLarsen · 2008-05-13 00:15 · Score: 1

Cool, try searching for "How tall was Napoleon?".

--
Claiming to be pedantic on Slashdot is asking for trouble

Alpha release by peter_gzowski · 2008-05-13 00:15 · Score: 1

It's said that their understanding engine required a month to grok Wikipedia's 2.5M articles. The Web is currently at least 8,000 times as large. Expect the alpha release in 2675.

--
"Now gluttony and exploitation serves eight!" - TV's Frank

Unimpressed to the extreme by Anonymous Coward · 2008-05-13 00:17 · Score: 0

Well that sucks! I gave it a try compared to the so called 'stupid' google and it doesnt fare very well! compare these search terms:

http://www.google.com/search?source=ig&hl=en&rlz=1G1GGLQ_ENGB268&q=when+was+the+hundred+years+war&btnG=Google+Search

http://www.powerset.com/explore/pset?q=when+was+the+hundred+years+war&x=0&y=0

Note that google (which has a much faster page I might add) instantly displays the dates of the humdred year war, as the very first result which it intelligently pulled from a web page. On the other hand powerset just displays a bunch of results, most of which are not relevant to the question, just to the war. I asked when it was!!! I dont get close to a good answer, there are lots of dates on that page, but none of them are right!

Missing point by home-electro.com · 2008-05-13 00:35 · Score: 1

Many of the commenters above did not read the article or missed the point. The search engine does NOT claim to understand natural language.

I guess it is a confusing concept, but they say it just 'understands' individual words. I fail to make sense of it, though, too.

I also fail to see what benefits it gives to user.

It's really just Google, plus... by jpellino · 2008-05-13 00:52 · Score: 1

... every result is followed by Woody Allen's voice going "I know, I know..."

--
"Win treats sysadmins better than users. Mac treats users better than sysadmins. Linux treats everyone like sysadmins."

Not the pioneer by pythonhacker · 2008-05-13 01:00 · Score: 2, Informative

Media seems to focusing a lot of attention on Powerset. But they seem to forget another startup which started innovating in the area of semantic search much before Powerset even arrived on the scene - Hakia. Read the following article which does a decent job of comparing the two startups. http://www.centernetworks.com/powerset-hakia

--
If you don't succeed at first, try again. If you still don't succeed, try harder. If nothing works, try reality shows.

good god by Anonymous Coward · 2008-05-13 01:21 · Score: 0

Seriously, "Factz"?

Too bad it's "l33t d3wd" themed by Anonymous Coward · 2008-05-13 02:24 · Score: 0

"Search Tipz" ? Really? OK, goodbye, I'm never going back to that site. Ever.

Guess what, web search is already perfect. We have Google. If you fail to find what you're looking for on Google, it doesn't exist or, more likely, you're a complete clueless fucktard.

A Taxonomy of Knowledge by kcdoodle · 2008-05-13 02:32 · Score: 2, Insightful

Google just indexes words, word fragments, and groups of words.

This is an effort (like many others) to create a semantic web.
This means they are trying to discover the MEANING of words and sentences.
Very edgy, dangerous stuff. The MEANING, once extracted, is expressed in still other words.
So SOMEONE determines what a word or group of words mean.

This leads to classifying, identifying, sorting, drawing relations between ideas, concepts, events, animals, machines, planets, science, art, religion, basically everything you can express with words.

This is what the human brain does. And every human brain does it a little bit differently. It is not the things we perceive that define our world and our place in it. It is the interrelations between things.

I have been involved with several search engines, and the TAXONOMY OF KNOWLEDGE is exactly what is wanted/needed.

Is it possible to create one? Sure.

Is it hard? Yep, really, really, really, really hard.

If you created one would it be correct? NO!
It would only be ONE PERSON's vision of the relationships of knowledge, but NO ONE PERSON can speak for us all.

Now all I have to say (after this rant) to creators of smart search engines is "GOOD LUCK"!!

--

- I live the greatest adventure anyone could possibly desire. - Tosk the Hunted

Re:Yawn. Here is something really impressive... by Anonymous Coward · 2008-05-13 04:04 · Score: 0

Perhaps in the future the "Semantic Web" will be a combination of all these companies. True Knowledge is building a giant inference engine but lacks the sophisticated semantic matching and parsing to fill it from the web. Freebase is building a huge knowledge base and it's awesome, but their knowledge base needs to be used. Powerset has some of the world's best semantic matching and parsing technology, but that needs a massive knowledge base to really understand, and an inference engine to produce unstated knowledge.

All of them seem necessary to move forward on the concept we call "The Semantic Web". Powerset's piece is just the piece that has proved most intractable to computer science thus far, so it's exciting to see progress on that front.

Even less impressive by melted · 2008-05-13 05:13 · Score: 1

It either returns garbage or "your request can not be understood".

It's the triples, stupid by NapalmScatterBrain · 2008-05-13 07:03 · Score: 1

Look up 'Semantic Web', and look into 'triples' and rdfs/owl. Then please, please, please stop pretending to know how this search engine works until you have read them. The weaknesses of this thing are clear, but the potential is there. The problem with semantic search technology is that it is only as strong as the realtionship map(ontology) that is created for it. Triples: 1)Idiot Posted Message 2)Message isOn Slashdot 3)Idiot SubclassOf Users 4)Slashdot has Users 5)(Inferred)Slashdot has Idiots No offense to the brilliant, and very funny, majority.

Where Is Osama Bin Laden hiding? by gateur · 2008-05-13 08:29 · Score: 1

It does seem to know that Osama Bin Laden is hiding in Waziristan. However, it can't seem to locate the WMD in Iraq.

User data needed... by arachnode.net · 2008-05-13 10:09 · Score: 1

Too early to say if their search is groundbreaking... Remember, great search algorithms aren't the only component needed for great results. You also need users to tell you which result is ultimately right. Also, Wikipedia is a natural choice to search against. It already has a ton of clean, mostly-standardized and interconnected content. Search would've been a breeze long, long ago if there wasn't any real crap in your indexes.

Slashdot Mirror

"Understanding" Search Engine Enters Public Beta

192 comments