Super-Fast RDF Search Engine Developed

Official DERI Website by achillean · 2007-05-04 02:34 · Score: 3, Informative

Here's the link to the official NUIG: DERI (omgwtfbbq) website in Ireland:

DERI

Re:Official DERI Website by PDHoss · 2007-05-04 03:23 · Score: 4, Funny

I tried to access that site, and I got a good look at their DERI Error.

--
======================================
Writers get in shape by pumping irony.
Re:Official DERI Website by Anonymous Coward · 2007-05-04 03:43 · Score: 0

Go to your room.
Re:Official DERI Website by fche · 2007-05-04 03:59 · Score: 1

Some of their software is at http://www.deri.ie/publications/tools/
but not the "yars2" world-record-busting supergadget.
Re:Official DERI Website by YourMotherCalled · 2007-05-04 04:14 · Score: 1

You're welcome. I'm beautiful. You'll be here all week. I should tip my waitress.

This could be huge by $RANDOMLUSER · 2007-05-04 02:35 · Score: 4, Interesting

Except for the minor little problem of getting everyone to agree on the ontologies. Being able to search quickly is important, but until somebody comes up with the Dewey Decimal System for all knowledge, it won't mean much.

--
No folly is more costly than the folly of intolerant idealism. - Winston Churchill

Re:This could be huge by Anonymous Coward · 2007-05-04 02:59 · Score: 0

How about People?
Who cares if a search is so much is faster??
Does this make people process the info any faster ?
Re:This could be huge by Anonymous Coward · 2007-05-04 02:59 · Score: 1, Insightful

Except for the minor little problem of getting everyone to agree on the ontologies. Being able to search quickly is important, but until somebody comes up with the Dewey Decimal System for all knowledge, it won't mean much. How about the Dewey Decimal System?
Re:This could be huge by complete+loony · 2007-05-04 03:00 · Score: 3, Insightful

Ah, but the Dewey Decimal system only works because responsible people are involved in categorizing everything. They let just anyone publish information on the internet these days.

--
09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
Re:This could be huge by stevemulligan · 2007-05-04 03:16 · Score: 1

First time poster, long time lurker. Just wanted to say grats grats to the Irish. Way to go. What kind of hardware spit out 7billion records in a second? I guess I'll have to read the article... :(

AND is it public? Can I hook up to it and send some queries just to see for myself how fast it is?
Re:This could be huge by stevemulligan · 2007-05-04 03:31 · Score: 1

Ok so it's not public and someone from the project posted a PDF with more information than the press release. http://www.deri.ie/fileadmin/documents/DERI-TR-200 7-04-20.pdf I'd like to play with RDF someday when I have free time.
Re:This could be huge by StefanDecker · 2007-05-04 03:58 · Score: 1

Fully agreed. But it worked for RSS - and it also seems to work for SIOC (see http://sioc-project.org/ ). Other XML structured formats are also catching on - eg., XBRL. All of them can be (quite easily) translated in a graph and integrated. So there is hope. However, Andreas and Aidans work reported on in the press release enables us to build scalable engines - scalability was a major headache before.
Re:This could be huge by spemen · 2007-05-04 04:07 · Score: 2, Interesting

Actually there is a lot of research being done to get around the need for a 'Dewey Decimal System'. The idea is to analyze relations between terms (names, datatypes, ect.) in an ontology. One could also compare relationships between terms: A child of B, C child of D, and A=B does B==A ?? Please note that these are examples of how terms and ontologies *could* be matched and not necessarily how someone would match terms. http://www.ontologymatching.org/ Also, http://wordnet.princeton.edu/ is a project I think is in the direction of a 'Dewey Decimal System' for knowlege.
Re:This could be huge by angst_ridden_hipster · 2007-05-04 05:19 · Score: 1

It was an admirable attempt in its time, but it's pretty clunky. It's also very biased towards the world-view of one man in the 1870s. While it does get updated, you'll find that there are structural issues. The classic example is religion: everything involving Buddhism, Sikhism, or Jainism is lumped together in a number space which is the same size as the number space reserved for Christian "Parish Government And Administration." Christianity itself gets 88 percent of all the top-level numbers set aside for religion.

Anyone who has ever tried to make a meaningful taxonomy knows that it's really, really hard.

Still, it could be a starting point for the inevitable endless committees.

--
Eloi, Eloi, lema sabachtani?
www.fogbound.net
Re:This could be huge by $RANDOMLUSER · 2007-05-04 06:11 · Score: 1

Congratulations to you, Andreas and Aidan. Scaling is a major accomplishment. I've thought about this (ontology) issue a lot, ever since I realized how brilliant Roget was. As I've said elsewhere in this thread, to me, the issue of things being on a continuum, and the judgementalism of placing them on that continuum is a serious problem. Then there's the issues of getting everyone to standardize, overlapping ontologies and schema drift over time. I think we'll see most of the initial progress in very vertical domains, (medical perhaps) but some (I'm thinking of legal) are so slippery and fraught with peril, I doubt we'll ever make any serious progress. Good luck to you.

--
No folly is more costly than the folly of intolerant idealism. - Winston Churchill
Re:This could be huge by maharg · 2007-05-04 07:27 · Score: 1

until somebody comes up with the Dewey Decimal System for all knowledge, it won't mean much it's coming: http://metadata-stds.org/19763/index.html

Last year I attended an excellent seminar track on content and knowledge at this : http://www.xmlsummerschool.com/ - and one of the speakers had a great example - he wanted to be able to search for a guitar amp speaker cabinet that would handle the 100w (that's RMS) output of his Marshall amp, and fit in the boot/trunk of his car - I forget the make/model, let's say it's a Ford whatever... anyhow, the point is that the semantic search app would need to discover the dimensions of the guy's car's boot/trunk from Ford, then it could search for cabs that would fit in the boot/trunk that could handle 100w (that's RMS remember) so, if the dimensions of the boot/trunk are expressed in inches by Ford and the dimensions of the cab are expressed in various different ways by the purveyers of the cabs, then of course there needs to be a standard scheme to express units of size, and then the conversions are easy. Don't forget, that's 100w RMS...

Probably the only XML seminar in the world where you get to go punting as well... Great thing is that it's been going for several years, and noone has fallen in the river yet, so the sense of anticipation is fantastic ;o)

--

$ strings FTP.EXE | grep Copyright
@(#) Copyright (c) 1983 The Regents of the University of California.
Re:This could be huge by DocDJ · 2007-05-04 09:04 · Score: 1

Absolutely right, and it's not as if these academic chancers don't have form in this area - the importance of this "breakthrough" can indeed be overestimated, just like previous "breakthroughs"). The semantic web is nothing more than snake oil.
Re:This could be huge by Anonymous Coward · 2007-05-04 11:39 · Score: 0

What the semantic web is or is not is our own opinion.
However, the engine built is able to store and retrieve graph structured information, which is a very flexible format and useful if you want to integrate information from various sources.

That's all.

The only RDF I know.... by Anonymous Coward · 2007-05-04 02:36 · Score: 0

Is "Reality Distortion Field."

Sounds about right, in this case.

Why would I want to search... by msauve · 2007-05-04 02:37 · Score: 1

for a Radio Direction Finder?

--
"National Security is the chief cause of national insecurity." - Celine's First Law

Re:Why would I want to search... by Anonymous Coward · 2007-05-04 02:57 · Score: 0

Don't they mean Reality Distortion Field?
Re:Why would I want to search... by Mattintosh · 2007-05-04 03:03 · Score: 1

Yes. And it's always easy to find the largest RDF on the planet... It's wherever Steve Jobs is. I hear it extends into space.
Re:Why would I want to search... by SleepyHappyDoc · 2007-05-04 03:19 · Score: 1

I thought they meant the Robotech Defense Force.

--
Stasis is death. Embrace change.

Links! by SolitaryMan · 2007-05-04 02:37 · Score: 3, Insightful

These results enable us to create web search engines that really deliver answers instead of links.

I need both: answers *and* links! Many times when I search the web, I don't know for sure what am I searching for, let alone being able to ask specific question...

--
May Peace Prevail On Earth

Re:Links! by CastrTroy · 2007-05-04 02:55 · Score: 1

This is probably the biggest problem with searching. Google can return really good results if you know what to search for. Most people I know just type in the first word that pops into their head, and make the search way too generalized, and don't get good results. Knowing what words to type in can save you a lot of time in searching.

--

Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
Re:Links! by Andy_R · 2007-05-04 03:08 · Score: 1

The sad thing is that it's so easy to learn how to get good results using current search engines, but people are never taught how to do it.

RDF could do very useful things, like throwing up a disambiguation question at the top os the results page when you've not made it clear what you want, or filtering out the plague of typosquatter/content free price comparison/'be the first to write a review of this item' sites, but so could a bit more intelligence built into Google.

--
A pizza of radius z and thickness a has a volume of pi z z a
Re:Links! by CastrTroy · 2007-05-04 03:17 · Score: 1

People just expect computers to do everything for them, and turn off their brains most of the time. This is why people have so many problems operating computers. Most people when searching for information about Cats (the musical) will probably just type in "cats", and look through all the results. Whereas, a person who understands the concept of feeding the right information to the search engine, will probably type in "cats musical", or if you're looking for something more specific, you may type in "cats musical song list".

--

Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
Re:Links! by Red+Flayer · 2007-05-04 03:39 · Score: 2, Funny

RDF could do very useful things, like throwing up a disambiguation question at the top os the results page when you've not made it clear what you want
It looks like you're trying to search for tentacle porn. Would you like help?

No thanks, I don't need Clippy in my search engine.

--
"Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
Re:Links! by Andy_R · 2007-05-04 04:06 · Score: 1

I'm not suggesting Clippy, I'm just suggesting disambiguation. Google already does this for typos (You searched for "kats musical song list" did you mean "cats musical song list"?). If Google noticed that the cats pages fell into 3 major categories (musical/animal/character who says 'all your base') and offered me those options in the typo line, I'd find that useful in narrowing down which of the 86,500,000 pages it found is the right one.

In your example, I'm guessing you might find the option to filter down by gay/straight or censored/uncensored useful?

--
A pizza of radius z and thickness a has a volume of pi z z a
Re:Links! by Red+Flayer · 2007-05-04 04:14 · Score: 1

I know, I was just making a joke.

However, I think contextural disambiguation questions like what you're suggesting are already served by "search within results" queries. Proposing likely criteria for narrowing down the results would be, I think, a disservice. It pigeonholes sites, but worse than that, pigeonholes searches. This leads to easy gaming of the search system -- SEO would cause pretty much every site to make sure it's associated with the typical disambiguation terms, thus removing the utility of those terms.

--
"Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
Re:Links! by Andy_R · 2007-05-04 04:46 · Score: 1

I'm not so sure. I'm not suggesting linking to disambiguation pages (which could be gamed by SEO), I'm suggesting Google analyses the text and notices that pages tend to either use the words "Andrew Lloyd Webber" "Kitty-litter" or "set us up the bomb" and that these phrases tend to be mutually exclusive, so they would be good ones to offer as means of disambiguation.

The terms wouldn't be 'typical disambiguation terms', as they would be generated freshly from the content of the pages that appear in the search results. Probably too computationally intensive to do yet, but ontology isn't needed to give useful results when it becomes possible.

"Kitty-litter" isn't a great example, but it shows why I think this would be useful - I spent a while trying to thing of a phrase would occur in pages about the animal but not the musical and I couldn't come up with a good one. Doing a statistical analysis on the text content would probably find a much better term, and that term would be offered to me in the disambiguation line.

Normally I'd use "-musical" as a search term if I was after feline resultd, but that relies on me knowing about the existance of the musical and it does a sort of reverse pigeon-holing by throwing out pages that are 99% about felines but happen mention the musical in passing.

I'm not seeing any way for SEO to game this.

--
A pizza of radius z and thickness a has a volume of pi z z a
Re:Links! by Red+Flayer · 2007-05-04 04:58 · Score: 1

Hmm. I do manually what you're suggesting when searching. Enter my search terms, and if most of the results are for something different than what I'm looking for, I'll add terms to remove the extraneous results. This is based on the couple lines of content info returned by Google (of course, those lines aren't 100% fresh with Google).

So, I think what you're suggesting is that the search engine prompt those terms to help people narrow their search? Didn't Ask Jeeves try this and miserably fail -- and if so, is that because of execution or concept?

--
"Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
Re:Links! by roman_mir · 2007-05-04 07:42 · Score: 1

let alone being able to ask specific question... - "Computers are useless. They can only give you answers." (Pablo Picasso)

When they come up with a computer that will be able to ask questions for me, then I will be impressed :)

--
You can't handle the truth.

42 by Prysorra · 2007-05-04 02:37 · Score: 1

NO.

The first answer will be 42.

Re:42 by Anonymous Coward · 2007-05-04 02:46 · Score: 0

I was trying to be serious and all you can do is fool around!
Re:42 by sryx · 2007-05-04 04:12 · Score: 1

NO.

The first answer will be 42.

That, it turns out wasn't the hard part, it's figuring out the query!
-Jason :P
Re:42 by VWJedi · 2007-05-04 05:04 · Score: 1

The first answer will be 42.

If that is the answer to "Life, the Universe, and Everything", then ALL the answers should be 42.

Give me a couple minutes and I'll write the code for you search engine.

Great!! by Anonymous Coward · 2007-05-04 02:37 · Score: 0, Insightful

Now all we need to do is get everyone to start using RDF.... wait.. you dont even know what that is??

Re:Great!! by $RANDOMLUSER · 2007-05-04 02:43 · Score: 4, Informative

Now all we need to do is get everyone to start using RDF.... wait.. you dont even know what that is??
It's the Resource Description Framework, which RSS is a subset of.

--
No folly is more costly than the folly of intolerant idealism. - Winston Churchill
Re:Great!! by jrumney · 2007-05-04 03:18 · Score: 2, Informative

Actually, only RSS 1.0 is based on RDF. The only similarity between RDF and the more popular RSS 2.0 and RSS 0.92 is that they are all based on XML.
Re:Great!! by martinmarv · 2007-05-04 04:04 · Score: 1

The article is a bit light on details, but if this can be used to filter my RSS feeds in real-time, I'll be a happy bunny.

Search solved. World hunger next. by 140Mandak262Jamuna · 2007-05-04 02:38 · Score: 3, Funny

Having solved the problem of search, and providing a breakthrough product that has consciousness to what was previously mere series of tubes, now the National University of Ireland announced that it is going to solve world hunger next, may be in three months. Other projects in the pipeline includes cure for cancer and solving full Navier Stokes equation.

--
sed -e 's/Chuck Norris/Rajnikant/g' joke > fact

Re:Search solved. World hunger next. by skidv · 2007-05-04 02:49 · Score: 1

'The importance of this breakthrough cannot be overestimated,' said Professor Stefan Decker, director of DERI.

Having solved the problem of search, and providing a breakthrough product that has consciousness to what was previously mere series of tubes

This breakthrough makes it possible to use the Interweb as a tube of artificial intelligences capable of answering such questions as "Who is Neuromancer?" and "Why is the number 42 so important, anyway?" as well as organize a successful revolution by moon colonists.
Re:Search solved. World hunger next. by Anonymous Coward · 2007-05-04 03:03 · Score: 0

It's obvious that the solution to hunger is whirled peas.

While we are on the topic, what's all the hoopla about endangered feces?
Re:Search solved. World hunger next. by Jessta · 2007-05-04 04:05 · Score: 1

http://www.newscientist.com/article/dn10971-cheap- safe-drug-kills-most-cancers.html
We've already got a cure for cancer.

--
...and that is all I have to say about that.
http://jessta.id.au
Re:Search solved. World hunger next. by BalkanBoy · 2007-05-04 11:07 · Score: 1

That's already being solved - http://www.thp.org/.

And the next one is world peace - and if you want to solve that one, join the Landmark Forum.

--
'A lie if repeated often enough, becomes the truth.' - Goebbels

Hype by gvc · 2007-05-04 02:42 · Score: 4, Insightful

users should get more relevant results

Yet another /. article parroting an uncritical popular press account of a press release.

Re:Hype by Dhalka226 · 2007-05-04 03:00 · Score: 1

Yet another /. post bitching about /. articles, yet adding absolutely no value of their own.
Seriously. Do you have anything to add to the discussion or were you simply karma whoring?
Re:Hype by StefanDecker · 2007-05-04 04:03 · Score: 2, Insightful

We have a Technical Report available at http://www.deri.ie/fileadmin/documents/DERI-TR-200 7-04-20.pdf that should answer most of the technical questions. From the abstract: "We present the architecture of an end-to-end search engine that uses a graph data model to enable interactive query answering over structured and interlinked data collected from many disparate sources on the Web. In particular, we study distributed indexing methods for graph-structured data and parallel query evaluation methods on a cluster of computers. We evaluate the system on a dataset with 430 million statements collected from the Web, and provide scale-up experiments on 7 billion synthetically generated statements."
Re:Hype by Anonymous Coward · 2007-05-04 22:26 · Score: 0

Can you tell me on which page in the paper the actual performance evaluation is described?

"'...this breakthrough cannot be overestimated" by Anonymous Coward · 2007-05-04 02:44 · Score: 0

Cool. It'll end war and bring universal freedom to all people.

Re:"'...this breakthrough cannot be overestimated" by Miseph · 2007-05-04 02:50 · Score: 1

I asked the RDF search engine, and that's what it told me. Maybe if we ask it the right question it could come up with an answer to do that. Now if only we could devise a machine powerful enough to tell us what that question would be...

--
Try not to take me more seriously than I take myself.
Re:"'...this breakthrough cannot be overestimated" by FormOfActionBanana · 2007-05-04 02:58 · Score: 1

...and overclock the sucker until she smokes...

--
Take off every 'sig' !!
Re:"'...this breakthrough cannot be overestimated" by tygerstripes · 2007-05-04 03:18 · Score: 1

Ah, so Global Warming is evidence that the mice are looking to boost their clockspeed...

--
Meta will eat itself

Next up: Ontology spam by G4from128k · 2007-05-04 02:44 · Score: 5, Insightful

Yes, creating a consistent ontology is challenge. But the bigger challenge is the lack of incentive for ontology truthfulness. If this type of search becomes popular, ontology spam and OSEO (Ontology Search Engine Optimization) will become a booming industry.

--
Two wrongs don't make a right, but three lefts do.

Re:Next up: Ontology spam by $RANDOMLUSER · 2007-05-04 02:52 · Score: 1

Of course you're correct. It had never occured to me that there would be ontology spam, but of course there will be. Still, for the pure knowledge aspects (think Wikipedia on RDF) it would be a wonderful thing.

--
No folly is more costly than the folly of intolerant idealism. - Winston Churchill
Re:Next up: Ontology spam by inviolet · 2007-05-04 03:04 · Score: 1

Of course you're correct. It had never occured to me that there would be ontology spam, but of course there will be. Still, for the pure knowledge aspects (think Wikipedia on RDF) it would be a wonderful thing.

For a while, yes. But as long as there is a cash-per-page-view market, the onslaught of adverspam will reach every corner of the web. It can't be stopped as long as there is money to be made there.
Certainly the big "pure knowledge" sites will defend themselves, as Wikipedia does, but that is an arms race that will eventually exhaust the resources of any single organization.

--
FATMOUSE + YOU = FATMOUSE
Re:Next up: Ontology spam by G4from128k · 2007-05-04 03:10 · Score: 1

I agree with you 100% and did not mean to imply that the goal is not worthy. Being able to search semantically or to pull out just the relevant information would be hugely valuable.

And I'm sure that next generation search engines will create clever ways of detecting and punishing ontology spam (e.g., noting the dissonance between the text and the tags)

--
Two wrongs don't make a right, but three lefts do.
Re:Next up: Ontology spam by maxume · 2007-05-04 03:12 · Score: 1

Semantic information still adds value to a given page in isolation. Hence microformats.

--
Nerd rage is the funniest rage.
Re:Next up: Ontology spam by cyphercell · 2007-05-04 03:23 · Score: 1

I was thinking the article kinda indicates a resolution to the ontology. Most definitions are a product of synonymous/antonymous context. For instance a person cannot understand the concept of clear without simultaneously understanding opaque. This level of search would suggest that if you throw enough generic definition at a term then some logic could be used to say "if we find so many synonyms then we have an accurate definition" this is how AIML works at a basic level. RDF would be like AIML on crack and steroids.

--
Under the influence of Post-Cyberpunk Gonzo Journalism
Re:Next up: Ontology spam by Rakshasa+Taisab · 2007-05-04 03:33 · Score: 1

Why do I suddenly get this mental image of spam on increasing the plumage size of birds...?

--
- These characters were randomly selected.
Re:Next up: Ontology spam by $RANDOMLUSER · 2007-05-04 03:52 · Score: 1

Ah, but that's the rub: most things are not binary, neither this nor that. Things live on a continuum, and it's all too often a judgement call where they should lie.
transparency ==> translucency ==> opacity

Or, to put it in website design terms: "It's not blue enough.

--
No folly is more costly than the folly of intolerant idealism. - Winston Churchill
Re:Next up: Ontology spam by DragonWriter · 2007-05-04 05:34 · Score: 1

Yes, creating a consistent ontology is challenge. But the bigger challenge is the lack of incentive for ontology truthfulness.

I'd say consistent ontology is a bigger challenge (though also one that doesn't need to be anywhere near completely solved for all kinds of useful applications to exist.) Trust mechanisms built on RDF aren't really all that big of a challenge: trust relationships are fairly basic, straightforward relationships of exactly the type RDF was designed to express from the outset, after all.
Re:Next up: Ontology spam by CarpetShark · 2007-05-04 05:49 · Score: 1

Agreed, spam could really screw things up. Different ontologies aren't such a big problem, as there are already tools to translate between them -- that's part of the show, rather than being showstopper. However, when a search engine like this is open source, available to install on a lan (or any specific project server), and can be kickstarted with an ontology that says "we trust x, y, and z, and n hops of trust from them, with each trust hop reduced by a m", things may start to look up.
Re:Next up: Ontology spam by PitaBred · 2007-05-04 05:53 · Score: 1

Ornithology, not ontology ;)

But... does that make a parrot an ornithological ontologist?

--
My blog. Good stuff (when I remember to update it). Read it.
Re:Next up: Ontology spam by smittyoneeach · 2007-05-04 06:38 · Score: 1

Only when using certain obscure Perl6 operators.

--
Get thee glass eyes, and, like a scurvy politician, seem to see things thou dost not.--King Lear
Re:Next up: Ontology spam by treeves · 2007-05-04 06:59 · Score: 2, Insightful

Ontology SPAM is OK, but Epistemology Spread is really yummy!

--
...the future crusty old bastards are already drinking the Kool-Aid.
Re:Next up: Ontology spam by grcumb · 2007-05-04 09:06 · Score: 1

But as long as there is a cash-per-page-view market, the onslaught of adverspam will reach every corner of the web. It can't be stopped as long as there is money to be made there.

Agreed. As long as there are bullshit artists in the world, they will find ways of expressing themselves.

Certainly the big "pure knowledge" sites will defend themselves, as Wikipedia does, but that is an arms race that will eventually exhaust the resources of any single organization.

I don't think so. I think that liars work at a deficit in this contest, as they do everywhere in society. They remain a constant irritant, true, but the value of truth is (nearly) universal, whereas the value of deceit derives only to a few. For that reason, if for no other, the majority of the population value truth over deceit, and will support and supplement technical measures to trounce this spam with the one tool computing has never adequately replicated: Human pattern matching.

The really interesting part about the Semantic Web is not what authors says about their content, but what readers say about it. This is a resource that can sometimes be polluted, but never utterly subverted.

--
Crumb's Corollary: Never bring a knife to a bun fight.
Re:Next up: Ontology spam by dkf · 2007-05-04 10:28 · Score: 1

Mod parent up as Interesting. Yeah, I know it's a joke, but it's also an excellent illustration of why writing ontologies is hard: the meaning of a word is a slippery thing, inclined to run away from you.

--
"Little does he know, but there is no 'I' in 'Idiot'!"

RDF? by lancelotlink · 2007-05-04 02:46 · Score: 4, Funny

I didn't realize Steve Jobs' Reality Distortion Field was able to be harnessed and bottled in a search engine, or any software for that matter. His abilities are boundless!

RDF by ZooSpeed · 2007-05-04 02:55 · Score: 0

Why would anyone want to search Steve Jobs Reality Distortion Field?

I'll prove him wrong by Big+Nothing · 2007-05-04 02:57 · Score: 3, Interesting

"'The importance of this breakthrough cannot be overestimated,' said Professor Stefan Decker, director of DERI."

This is without a doubt the greatest invention in the history of time!

There, I just proved the professor wrong. Muahaha.

--
SIG: TAKE OFF EVERY 'CAPTAIN'!!

Re:I'll prove him wrong by StefanDecker · 2007-05-04 07:09 · Score: 2, Funny

OK, I concede. You won.
Some people can overestimate the importance ;-)
Re:I'll prove him wrong by jZnat · 2007-05-04 12:07 · Score: 1

But you didn't cover the possibilities of inventions being even greater than that in the future.

Gotta think logically...

--
'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'

contradictory by DohnJoe · 2007-05-04 02:58 · Score: 1

The importance of this breakthrough cannot be overestimated I think he just did...

Re:contradictory by Anonymous Coward · 2007-05-04 04:30 · Score: 0

'The importance of this breakthrough cannot be overestimated,' It'll cure us from Aids, malaria, smallpox, close the hole in the ozonlayer, get our climate back under control while allowing for exponential economic growth and will put a man on Mars in nine months time, all by itself without any human intervention.

There. This is all true according to TFA.
Re:contradictory by blooba · 2007-05-04 13:20 · Score: 1

dude that's funny as hell i wish i had mod points

Cannot be overestimated by stevenp · 2007-05-04 02:59 · Score: 4, Insightful

- "The importance of this breakthrough cannot be overestimated"

The importance of any event can be overestimated and quite often is overestimated. It is called hype.
When speaking of XML, XHTML and semantic WEB then the word "overestimated" fits just nice.
If this was not the case then HTML should long have been dead and the whole WEB should have been based on pure XML with meaningful tags.

-- Do not read me, I am a stupid tag

Re:Cannot be overestimated by gaspar+ilom · 2007-05-04 05:39 · Score: 1

How could this possibly be modded insightful? This is actually an uninformed and naive view (however prevalent on Slashdot) of how and why XML are used.

I've seen this sentiment regarding "HTML" vs "XML" on Slashdot so often; let me set the record straight:

Many sites use XML on the back-end, either as an interchange format with a DB, or to store and to generate HTML. I would dare say that *most* web-based applications of XML generate HTML, rather than XML, as the final output format. Outputting HTML is simply a matter of supporting the lowest common denominator, and is no indication that "XML" lacks utility. With minimal effort, these XML data sources could be made directly accessible by search engines. (Or, XMHTML can be outputted instead of, or in addition to, HTML)

I would opine that if you are a developer, and you are storing vast amounts of marked-up data as "HTML" -- rather than X/HTML, then you are doing something wrong. (I'll back this up with one brief point: It is *trivial* to transform XML into HTML using a technology like XSL -- but going from HTML to XML most certainly is not.) Having your data in a machine-readable, micro-addressable format like X/HTML is so much more advantageous than HTML tag soup, I shouldn't have to enumerate all the reasons here. (This remedial info can be sought via a quick google search.)

Just because YOU only see "HTML" in the browser (and don't know how XML is used?), it does not mean that XML is not useful -- or even ideal -- for representing marked-up text and documents in an information system.
Re:Cannot be overestimated by Ant+P. · 2007-05-04 05:47 · Score: 1

Yes, I agree the W3C's vision of everyone surfing the WEB on their Blue-ray equipped MACS is just wishful thinking
Re:Cannot be overestimated by Anonymous Coward · 2007-05-04 07:46 · Score: 0

omgwtfbbq! u can mock researchers! u must have many brains!!1

Look dude.

Since version 1.0, Firefox have had this feature called "live bookmarks" (IE has something similar). Live bookmarks are based on RSS. RSS is based on RDF. RDF is the cornerstone of the Semantic Web. Live bookmarks is the first tiny step unto the Semantic Web.

Firefox 3 will support Microformats. Microformats are to a even larger extent based on RDF, and supports far more applications than RSS feeds. Microformats are the next, slightly bigger step unto the Semantic Web.

The Semantic Web is not hype. The Semantic Web is not a matter of 'if'. Nor is the Semantic Web a matter of 'when'. The Semantic Web is already here! Chances are that you are looking at it right now (this page is RSS enabled).
Re:Cannot be overestimated by Anonymous Coward · 2007-05-04 08:57 · Score: 0

Microformats aren't really based on RDF. But the basic data structure is quite similar and entirely compatible with RDF and the Semantic Web.

TMA: Too Many Acronyms by EccentricAnomaly · 2007-05-04 03:05 · Score: 2, Insightful

Why assume everyone knows your acronyms. To me RDF means "Reality Distortion Field". Zeesh, 7 billion triples or whatever.

--
There are 10 types of people in this world, those who can count in binary and those who can't.

Re:TMA: Too Many Acronyms by QuickFox · 2007-05-04 03:17 · Score: 4, Funny

Why assume everyone knows your acronyms.

OMG: Oh my God!
WTF: What the fuck?
BBQ: Barbecue.

HTH

--
Terrorists can't threaten a country's freedom and democracy. Only lawmakers and voters can do that.
Re:TMA: Too Many Acronyms by BeerCat · 2007-05-04 06:07 · Score: 1

Why assume everyone knows your acronyms. To me RDF means "Reality Distortion Field".

I also wondered why anyone would need a search engine to go through Steve Jobs' notes. But... "RDF" could be "Radio Direction Finding".

Wiki has several more suggestions. The one I think this thread is about is at the bottom of the list.

--
"She's furniture with a pulse"

Mere fractions of a second? by Anonymous Coward · 2007-05-04 03:05 · Score: 0

Since everyone's being pedantic... I notice it takes more than one fraction of a second then.

Re:Mere fractions of a second? by Anonymous Coward · 2007-05-04 06:24 · Score: 0

Yeah. Like 1/2764363467090067 + 1/37362384875235992338. That's two fractions for you.

Web or Database search engine? by Anonymous Coward · 2007-05-04 03:07 · Score: 0

Ok, I might say something very stupid here but even the best search engine still isn't a WEB search engine but a DATABASE search engine (searching in copies/excerpts from websites previously (i.e. recently) acquired).

My question: has someone ever proposed (i.e. written down) ideas/plans/designs for a life-searchable web (altough something like that would seem impossible to me)? It might be a very interesting read however.

Re:Web or Database search engine? by Anonymous Coward · 2007-05-04 06:36 · Score: 0

A web search engine is a database search engine optimized for databases containing data from the Web.

A 'live searchable Web' is essentially the same as a peer-to-peer network, and plenty of research have been and is being done on P2P.

Could be interesting, but missing details by Anonymous Coward · 2007-05-04 03:11 · Score: 5, Interesting

What kind of data set did they use? The structure and contents of the graph that is the data in an RDF database has a huge impact on the performance of query execution, and different applications have different structures.

What kind of queries are they running? There are several different RDF query languages (think of SeRQL, RDQL, N3, SPARQL, etcetera) and some of them support quite complex queries. Quickly finding the answers to a simple query like

SELECT ?name WHERE ?name <http://xmlns.com/foaf/0.1/name> "John Smith"

is just a matter of an indexed lookup and not very special. But, like in SQL, much more complex expressions can be generated that require complex index operations on the query execution level. Having implemented an RDF database that supports SPARQL queries an order of magnitude faster than the software the W3C uses for their experiments (which, admitedly, doesn't have performance as a prime requirement), I know that it's possible to do simple things fast, but the interesting part is handling RDF queries that don't easily map to efficient database operations.

Which brings me to the most important point: where is their detailed report? Can I get the software somewhere and perform my own tests? The article is too vague to draw any conclusions about what their RDF database does, and how good it is. I'd love to read up on it, but I can't seem to find the information.

SUPER Speed by phoric · 2007-05-04 03:13 · Score: 2, Funny

Colonel Sandurz: Prepare ship for light speed. Dark Helmet: No, no, no. Light speed is too slow. Colonel Sandurz: Light speed is too slow? Dark Helmet: Yes. We're gonna have go right to... SUPER speed. [everybody gasps] Colonel Sandurz: SUPER speed? Sir, we've never gone that fast before. I do'nt know if this ship can take it. Dark Helmet: What's the matter Colonel Sandurz? Chicken? Colonel Sandurz: [Wimpering] Prepair ship! [Calms down] Colonel Sandurz: Prepare ship, for Ludicrous speed. Fasten all seat belts. [everybody fastens in their seat belts and locks all of the doors] Colonel Sandurz: Seal all entrances and exits. Lock all stores in the mall. Cancel the 3-ring circus. Secure all animals in the zoo... Dark Helmet: [Takes the intercom from Sandurz] Gimme that, you petty excuse for an officer! [speaks into the intercom as Sandurz puts on his seat belt] Dark Helmet: Now hear this, Ludicrous speed... Colonel Sandurz: [Interrupts] Sir, you better buckle up. Dark Helmet: [to Sandurz] Ah, buckle this. [Into the intercom] Dark Helmet: SUPER speed, go!

Re:SUPER Speed by VWJedi · 2007-05-04 04:59 · Score: 2, Informative

[Wimpering] Prepair ship! [Calms down] Colonel Sandurz: Prepare ship, for Ludicrous speed. Fasten all seat belts.

If you're going to steal a joke, you need to make sure to replace all references to the original. Find / Replace works great for this.
Re:SUPER Speed by phoric · 2007-05-14 10:50 · Score: 1

And how exactly does quoting a movie qualify as 'stealing a joke'?
Re:SUPER Speed by VWJedi · 2007-05-15 00:57 · Score: 1

And how exactly does quoting a movie qualify as 'stealing a joke'?

When you do it well, it's art. When you do it poorly, it's stealing.
Re:SUPER Speed by phoric · 2007-05-15 02:45 · Score: 1

And when you spend that much time caring, you're just a loser.

all about context by Anonymous Coward · 2007-05-04 03:17 · Score: 0

if those RDF statements are tiny and basically pointless, then searching 1 billion entries isn't all that hard. Especially if they are properly indexed. Most RDF engines suck ass at the moment. If they implement an efficient bitmap index for the RDF statements, the query times for complex n-dimensional queries should be basically constant. W3C's specs for semantic web suck ass and their approach is totally impractical.

Here's the Tech Report by aharth · 2007-05-04 03:19 · Score: 5, Informative

Hello, I am one of the main developers of SWSE. True, the press release is vague, but there is only so much you can say in a press release aimed for the general public.

We have a Technical Report available at http://www.deri.ie/fileadmin/documents/DERI-TR-200 7-04-20.pdf that should answer most of the technical questions.

From the abstract:

"We present the architecture of an end-to-end search engine that uses a graph data model to enable interactive query answering over structured and interlinked data collected from many disparate sources on the Web.

In particular, we study distributed indexing methods for graph-structured data and parallel query evaluation methods on a cluster of computers.

We evaluate the system on a dataset with 430 million statements collected from the Web, and provide scale-up experiments on 7 billion synthetically generated statements."

Re:Here's the Tech Report by $RANDOMLUSER · 2007-05-04 03:37 · Score: 3, Insightful

You are too modest. You're the lead author. Congratulations on a first-rate contribution to mankind. And such a young pup, too.

--
No folly is more costly than the folly of intolerant idealism. - Winston Churchill
Re:Here's the Tech Report by Anonymous Coward · 2007-05-04 03:57 · Score: 0

I skimmed the pdf real quick and noticed the index section didn't consider bitmap indexes. Given that RDF data is relatively static and queries are n-dimensional, it seems odd that bitmap indexes weren't considered. I've pointed this out to other RDF storage engines and they also made the same mistake of not considering bitmap indexes. Using an efficient bitmap index means queries would be near constant even for complex queries.
Re:Here's the Tech Report by Grabble · 2007-05-04 10:07 · Score: 1

So how can I tinker with this? What are the licensing options? When and where will the code be made available?

Also, what has your team done with utilizing GPUs as graph-parsing engines?

I know just enough about triple-stores and RDF to know this really is big news. Congrats.
Re:Here's the Tech Report by rvarada · 2007-05-04 16:05 · Score: 1

If I am paying a consultant 125 Euro per hour, I would expect a little more attention to detail from him.

At http://www.harth.org/~andreas/consulting/
[Priolva Technologies, Spain Dec 2005 - now CTO and co-founder] links to http://www.privolva.com/

So did he co-found Priolva Technologies or Privolva Technologies? Not being a grammar nazi or a spelling nazi - but I expect a high priced consultant to at least spell the name of his company right.

Two things... by PornMaster · 2007-05-04 03:31 · Score: 1

First, giving the amount of time and the number of items searched means nothing. Are they doing it on a BlueGene or an Apple II?

Second, the problem with "the semantic web" if you're relying on people providing the metadata themselves, is the reliability (trustworthiness?) of the person creating the metadata. There's a reason the meta name="keywords" tags aren't a significant factor if at all in any of the major search engines' ranking systems.

--
500GB of disk, 5TB of transfer, $5.95/mo

Re:Two things... by StefanDecker · 2007-05-04 04:08 · Score: 1

First: The experiments have been done on a 18 node cluster of cheap servers.
Second: There are other ways to get metadata - eg., via SIOC (see URL:http://sioc-project.org/>. But true, trust is an issue. And some people in DERI Galway are working on ranking algorithms on top of the search engine.

Re:Here's the first query result... by roshanpv · 2007-05-04 03:39 · Score: 0

Don't put slashdot to trouble with such messages

sounds fishy by vga_init · 2007-05-04 03:50 · Score: 2, Interesting

Of course a search based on meta data is going to be faster and more accurate, but only when the meta data is correct. We've had this since the beginning of the interweb; people would load up their pages with bogus meta data just to generate search traffic. Because of this dishonesty, search engines have had to resort to other methods of evaluating and indexing pages (for example, based on actual content).

I don't see any difference between this new RDF and that old stuff.

Re:sounds fishy by CaptSolo · 2007-05-04 04:04 · Score: 1

RDF is just a way to express knowledge. In answer to "any difference between this new RDF and ..." you may take a look at the W3C Semantic Web FAQ (published very recently).

Now, like you said what we find depends on what we feed into search engines and on the engines themselves. To this regard it's work for better search engines and ranking algorithms, and the work described here is an important step in this path. There's a link to a technical report and more details posted (by a developer) in another Slashdot comment.
Re:sounds fishy by bbtom · 2007-05-04 06:25 · Score: 1

Yes, and that is why the key thing in the Semantic Web space is the making of assertions. If I say that your page is about fishing and you say that your page is about fishing, that's no problem. If you say that your page is about fishing and 90,000 people say it's junk, that's when we have a problem. Having a data model like RDF which allows these assertions to be made across the web is what makes such a thing possible without it all becoming too centralised to be useful.

--
catch (HumourFailureException e) { e.user.send("You, sir, are a humourless idiot."); }

Save the hype by broothal · 2007-05-04 03:57 · Score: 1

So now we have a search engine capeable of making a godzillion searches in a data domain that does not exist yet. That's all great and dandy, and we do indeed need new models and architectures for search engines once (if) the web goes all semantic. However, when (if) the semantic web ever becomes a reality, this search engine will long be retired. So, this result is great from a research point of view, but don't expect it to leave the lab.

Re:Save the hype by StefanDecker · 2007-05-04 06:43 · Score: 1

I guess in the early days of the Web many people said the same thing - why bother if nobody is providing HTML pages and nobody is using HTML browsers (in fact, I remember that time very well).
Of course building a web of data is more demanding - the infrastructure is far more complicated.
But we have made tremendous progress over the last years - to the point where currently structured data coming from applications like Wikis, Mailing Lists, Bulletin Boards can, should and will be integrated. And progress is being made - eg., with things like FOAF or SIOC (see http://sioc-project.org/.
The service http://pingthesemanticweb.com/ provides a good overview - progress may be slow, but Metcalfs Law did prevail in the past. Why should it not in the future?
And what is the alternative?

Developer on this project by aidhog · 2007-05-04 04:18 · Score: 3, Informative

As one of the developers on the project (along with user aharth), feel free to ask any specific questions you may have here. The article is quite vague and so I refer you to a technical report at http://www.deri.ie/fileadmin/documents/DERI-TR-200 7-04-20.pdf/.

Re:Developer on this project by kellererik · 2007-05-04 05:24 · Score: 1

could be my browser (Safari), but clicking on your link leads to a 404, going through the main page and clicking my way through works, though. Just in case it happens to other users. Thanks for the link, now my friday evening is officially ruined, I got to read this right away. ;-)
Re:Developer on this project by wintermute42 · 2007-05-04 05:29 · Score: 1

I'm using Firefox under Windoz and I could not access the article either. It's a bad URL.
Re:Developer on this project by kellererik · 2007-05-04 05:44 · Score: 1

This is how I got the PDF:

go to www.deri.ie

click on "World Record 7 Billion Triples"

scroll down on the resulting page click on the word "here" in the last sentence.

get busy reading ;-)

I suspect that this is not a browser-related problem but a server-problem. The link in the OP and the one mentioned here is the same.
Re:Developer on this project by oever · 2007-05-04 06:47 · Score: 1

Does it handle text search with wild cards? What is the search syntax?
Can we download the code (what language)?

--
DNA is the ultimate spaghetti code.
Re:Developer on this project by aidhog · 2007-05-04 23:55 · Score: 2, Informative

Text search on literals is supported through Lucene. Only vanilla keyword searches are currently supported. The query syntax for the indexing is a subset of SPARQL. The code is Java and has yet to be officially released.

This is great and all by rsilvergun · 2007-05-04 04:21 · Score: 1

but why would I want to search several million statements from the Robotech Defense Force? I mean, sure I'm an Anime nerd, but there are limits...

--
Hi! I make Firefox Plug-ins. Check 'em out @ https://addons.mozilla.org/en-US/firefox/addon/youtube-mp3-podcaster/

Beer fund by billcopc · 2007-05-04 04:22 · Score: 0, Troll

Top o'the mornin' laddy! We've got this crackpot idea that doesn't work in real-world scenarios, but you see, we're out of Guinness and me welfare checks be runnin' dry. How's about a nice research grant to refill me beer fridge ?

Tally-ho!

--
-Billco, Fnarg.com

Mod parent up! by smartdreamer · 2007-05-04 04:23 · Score: 1

... for obvious reasons.

Web of Data (not just metadata) by CaptSolo · 2007-05-04 04:33 · Score: 1

Second, the problem with "the semantic web" if you're relying on people providing the metadata themselves, is the reliability (trustworthiness?) of the person creating the metadata.

One of misconceptions about the Semantic Web - that it's only about metadata when in fact it's about a Web of Data, e.g., currently locked in in databases, blog engines or social software sites. (related: SemWeb FAQ entry on "Does the Semantic Web require me to manually markup all the existing web-pages ... ?")

A very, very simple example - if you enable creation of RDF data creation in a WordPress weblog (via a WordPress SIOC plugin), all this information is generated automatically, from the data already inside a database. What you get is every blog post, etc. in a machine-readable form (RDF), ready for query and reuse.

Of course, that is very "light" semantics - expressing what the blog engine knows. As for data / structured content created by people directly - there's always risk for someone writing lies. Then there's a need for the concept of trust (can we trust the source?) and some ranking mechanism.

Boo @ Slashdotters by Anonymous Coward · 2007-05-04 05:04 · Score: 0

Give me a break! Scientists produce a technology that has the potential to change the face of the web and all they get is the usual, automated Slashdot moaning.

Listen, we're not just talking about a technology for searching cookie recipies. We're talking about a technology that will match your favorite ingredients with a recipe, find pastry chefs in your area who knows the recipe, weed out the ones that doesn't meet your quality standards, and, on your command, order the ingredients at the lowest available price, have them sent to the chef of your choice along with the payment for your order.

So, you just take a look around on Web 2.0 while eating your dry, factory produced, hard-as-a-rock cookies. I'll be enjoying my customized inexpensive home-baked quality pastries and getting real answers at the fraction of a second on the Semantic Web in the meantime.

Re:Boo @ Slashdotters by Hyperspite · 2007-05-04 05:55 · Score: 1

...and all they get is the usual, automated Slashdot moaning.
At least you've realized the truth, this being ./, everyone owns a spambot that searches for keywords in stories. You get this particular crapflood when the bots realize that something with the words "revolutionary" and "internet" (and possibly also "semantic")has appeared. Everything here is just markov chain output. Welcome to the future.

Also, there's an intelligent thread further up where the lead researcher posted that no one seems to be responding too :(

Hey, i created RDF (Reality Distorsion Field)
Cease or Desist!

Signed,

Steve Jobs

sounds foxy. by Anonymous Coward · 2007-05-04 05:14 · Score: 0

"RDF is just a way to express knowledge. In answer to "any difference between this new RDF and ..." you may take a look at the W3C Semantic Web FAQ (published very recently)."

Mozilla uses RDF under the hood.

Next up: Murky spam by Anonymous Coward · 2007-05-04 05:25 · Score: 0

Fuzzy Sets

Web 3.0 by Anonymous Coward · 2007-05-04 05:33 · Score: 0

Web 3.0 is near... Seriously - what does this development give us?

Fixed URL by CaptSolo · 2007-05-04 05:36 · Score: 2, Informative

2 all: remove the ending slash '/' from the URL above, it will work then.

Correct link: http://www.deri.ie/fileadmin/documents/DERI-TR-200 7-04-20.pdf

RDF is a bad idea by Zarf · 2007-05-04 06:16 · Score: 1, Insightful

I just read the basics of RDF and I can see that this could be a really really bad idea. If RDF is intended as an internal data representation for a search engine company to use then this is great. The search engine company or your own company's search engine staff can police and audit your RDF data. However, if I'm reading this right RDF is *supposed* to be populated by *volunteered* data. As such you're going to suffer not just the Wikipedia effect but all the problems seen in MetaData from an internet generation ago.

You'll see RDF associations linking the president to a crass picture of a donkey or a goat of some kind. You'll see companies set up to deliberately poison RDF data with false links designed to drive traffic to a site... you'll see sock-puppets and all kinds of other attacks.

This whole effort reminds me of the "this is spam" bit that was proposed to stop spam. You can't expect spammers to say to themselves, "wait, I better flip the this-is-spam but to true before I send this" you also can't expect people to not abuse the RDF system in similar ways.

Don't expect that if you RDF search for Stephen King that everything that comes up was actually posted by him. Imagine the pages that would get attributed to the president or Mr. T as a prank... the information would only be useful if you could verify the document as legitimate first.

The "is part of" feature is the most likely target of abuse I think. I could say that everything I wrote is part of the New York Times or as part of some official document that gets searched for often. The result would be erroneous hits in RDF search and artificial authority for my crack pot theories.

--
[signature]

Re:RDF is a bad idea by StefanDecker · 2007-05-04 06:26 · Score: 1

Zarf, you are absolutly correct that indeed raw RDF data can be polluted if crawled naively. That is exactly the reason why in all newer applications not the simple triple model is used, but actually quads, where the last argument may represent the source of the data. This data model is called named graphs.
So once you have the source recorded one is able to do trust computions with the graph and its source - eg., using pagerank like algorithms. Some sources can be assigned a low trust value, others can get assigned a higher one, based on their spam content and adoption of a web community (just like conventional webapges using pagerank).
Indeed the implementation that DERI reported on is realizing named graphs for exactly that reason, and Aidan is working on a ranking algorithm which is taking the source of the data into account.
Re:RDF is a bad idea by Anonymous Coward · 2007-05-04 08:12 · Score: 0

Unlike Wikipedia and e-mail, RDF is machine-readable. This means that the RDF data can be tagged by other RDF data. Then it only becomes a matter of finding a RDF tag source you trust - if necessary you pay someone to provide such a service.
Re:RDF is a bad idea by aidhog · 2007-05-05 00:08 · Score: 1

There is a workshop paper on a preliminary version of such an algorithm at http://sw.deri.org/2005/07/n3rank/paper/paper.pdf.
Re:RDF is a bad idea by Zarf · 2007-05-07 01:15 · Score: 1

Well, this is good stuff. Once again it seems that the web is saved by a variation of the karma point model. The more valuable data you provide the more likely your data is to be trusted the more heavily you are rated as a source. I guess I should pay more attention to search engines... it may be hard for some of you to believe but I've been ignoring them for the most part.

I did have the opinion that the only way to provide a truly useful search system was to create an engine that could read and understand unstructured text at least well enough to know what a page was really talking about. But then, I don't even work on web pages that much so I don't necessarily think about these things often. Thanks for the link.

--
[signature]
Re:RDF is a bad idea by Zarf · 2007-05-07 01:20 · Score: 1

Indeed the implementation that DERI reported on is realizing named graphs for exactly that reason, and Aidan is working on a ranking algorithm which is taking the source of the data into account.

Then the fast search engine is not really proving its speed on the real problem... only a problem sub-set. I see now I was missing the part about the quad. (None of the linked materials talked about them either I don't think) Thank you for educating me on that point.

--
[signature]

almost boundless by roman_mir · 2007-05-04 07:49 · Score: 1

Job's abilities are bounded by the little pinky of Chuck Norris.

--
You can't handle the truth.

What the hell is RDF? by Blakey+Rat · 2007-05-04 07:51 · Score: 1

Hard to know if this article is worth reading or not when the summary doesn't even tell me WHAT RDF IS!! Criminy.

--
Comment of the year

Reality Distortion Field by Namarrgon · 2007-05-04 10:16 · Score: 1

Yeah, I was wondering why they were so pleased about searching Steve Jobs' keynote addresses (and where they found 7 billion of them too).

--
Why would anyone engrave "Elbereth"?

wonderful by yahurd · 2007-05-04 10:54 · Score: 1

just what the man needs, a good search engine for his mesmerizing skills, im leaving california now.

use google to search their site by cheekyboy · 2007-05-04 13:22 · Score: 1

put this request into google

site:www.deri.ie technical report 2007 4 20

--
Liberty freedom are no1, not dicks in suits.

Interoperability with other RDF stores? by gridinoc · 2007-05-04 13:32 · Score: 1

Hello Andreas,

Does Yars2 provide any interoperability with Jena (or Sesame)?

For example I would like to create a Jena Model that has Yars2 as underlying storage, due to the fact that I can use over this model API from an ontology model to reasoners designed for Jena.

Is Yars2 available for download somewhere, or it is just the same Yars svn repository?

Overestimation by Phleg · 2007-05-04 13:53 · Score: 1

"'The importance of this breakthrough cannot be overestimated,' said Professor Stefan Decker, director of DERI."

I agree. In my estimation, this could well foretell the cure to AIDS, cancer, world hunger, war, and genital warts.

--
No comment.

Sounds like a parallel MonetDB system by Anonymous Coward · 2007-05-04 15:29 · Score: 0

From the paper: The use-case scenario is to find mutual acquaintances between two people. More specifically, the query is as follows: give me a list of people known to both Tim Berners-Lee and Dave Beckett. If that's really all there is to it then there's nothing new here other than some acronyms. It may be interesting research but apart from the RDF syntax there is nothing new here worthty of the PR hype.

Fast queries over 7 billion triples may sound like a breakthrough but systems like http://monetdb.cwi.nl/projects/monetdb/MonetDB/Ver sion4/Documentation/mel/index.html have been able to do that kind of thing blindingly fast for many years. Once you store everything as a BAT (binary association table), 20-way joins that would choke an ordinary RDBMS become easy.

The weakness of MonetDB-like systems is generally that they don't do well in the face of OLTP style transactional update patterns. They are optimized for loading data once and reading many times, not dealing with a steady stream of queries and updates, and typically you have no choice but to resort to http://infolab.stanford.edu/~backrub/google.html style parallel index construction, which is most of what the technical report goes into.

Slashdot Mirror

Super-Fast RDF Search Engine Developed

144 comments