Untangling Web Information
Ostracus writes "The next big stage in the evolution of the Internet, according to many experts and luminaries, will be the advent of the Semantic Web — that is, technologies that let computers process the meaning of Web pages instead of simply downloading or serving them up blindly. Microsoft's acquisition of the semantic search engine Powerset earlier this year shows faith in this vision. But thus far, little Semantic Web technology has been available to the general public. That's why many eyes will be on Twine, a Web organizer based on semantic technology that launches publicly today."
First, before anything even really started, The Semantic Web was merely a pipe dream.
.... twice. And we were happy.
... the Semantic Web went mainstream and started getting real.
...
But that was the long long ago, so let's fast forward a few years. When its future looked most bleak, Sir Tim (who can summon fire and explosions at will) told us what to expect
Then a few years passed and nothing.
Until the 2006 World Wide Web conference made us suspicious of the Semantic Web. We spread rumors about the Semantic Web and told all the cooler technologies that the Semantic Web was just out to rape our privacy. So we challenged the Semantic Web. And claimed it would fail.
Just when I was expecting Sir Tim to get underneath a blanket & release a sobbing YouTube video of everyone being bastards for attacking The Semantic Web right when she was going through really tough times and that we should all just leave her alone
I've got no problem with people pushing technologies but this one sounds more like a soap opera than anything. Has the Semantic Web changed anything for anyone on Slashdot? I haven't seen anything directly if it has
My work here is dung.
will include a digital rights management compliant
cloud based on a service oriented architecture
that will empower my workgroup over the new semantic web 2.0
insert license fee here.
Good people go to bed earlier.
The advertisers and search engine optimizers have already shown that they have absolutely ZERO qualms about providing false or misleading information to search engine robots in the form of page cloaking, hidden frames, false meta tags, etc so what makes anyone believe that they will not play the same games, possibly with even greater result, against the semantic web? There is money to be made by gaming the system and as long as it is possible for website operators to describe themselves on the semantic web then they will describe themselves in any way they have to to drive traffic to their sites and get ad hits, truth be damned.
Twine seems to be just a generic contextual search engine, as opposed to a pure keyword search engine. While it's a step, it's a very tiny step.
What I want to see is more about the correlation between topics. For example, if I'm looking into PHP templating and search twine, I get a few people's bookmarks on the topic. Nothing especially useful, and definitely nothing I couldn't find elsewhere. With real semantics I'd want to see a list of various templating engines, pro and con articles grouped for each, and maybe other sections on related design patterns and frameworks.
In other words, I want to see semantics. Context search isn't going to make anyone turn their head.
Developers: We can use your help.
It works like a charm. I entered "parsing" as an area of interest and it came back with "Beer"...
:-)
I was quite surprised by how quickly it understood me and my interests based on such a limited amount of information
She made the willows dance
...how are they supposed to teach a machine to infer meaning better than they're able to?
I'm seriously wanting to know.
Indeed, <meta content="sex, porn, titties, hotties, clit, cunt, naked, nude">
Of course it's not really a porn site, but the site operator found that it gave him hits. My (admittedly limited) look at the "semantic web" shows no sign that it will be any less suceptable to being gamed.
Free Martian Whores!
In a nutshell, the goal of the Semantic Web is to bring knowledge representation to the Web (using graphs, networks, binary predicates, however you want to call it).
I've been trying to apply data from the Semantic Web for a few years now.
I can see two roadblocks to mainstream adoption:
* Web data is immensely scruffy. If thousands of people contribute to a dataset without any restrictions, you get a mess (e.g. multiple URIs used to denote the same class or individual, which results in fractured data). Having said that, I can see some convergence happening on reusing URIs (for classes that has happened for a while now, for instances this is getting better every day).
* Without proper data, it's hard to show the benefit of having a web-wide knowledge base. Right now, my marketing pitch for our semantic web search engine is to go "from documents to objects", i.e. you want to locate objects (the person CmdrTaco) rather than documents matching keywords.
Once you have achieved a web-wide knowledge base of decent quality, you can start thinking about how to navigate that information space to actually answer questions (and I don't mean natural language understanding, but a point-and-clic, menu-based interface). CmdrTaco's phone number, people he knows, blog posts he's written, and so on.
The chicken-and-egg circle is slowly breaking up. For a demo, our system is online at http://swse.deri.org/.
Buncha anti-semantists on this site.
Life needs more saving throws.
All the semantic web gives you is the ability to layer a logical design over data. It's like a database design, except it's "open world", meaning there can be many different designs, it's up to the agent to pick the one it trusts, and it can't really make assumptions based on what it doesn't know.
The only inferences made are those that have been imagined by some human designer. And they might be very wrong , if the designer was wrong.
The "kinds" of inferences available are also pretty limited, like hierarchy or transitivity, or set membership. Useful, yes, but stepping stones...
-Stu
I worked at a big tech company doing SemWeb, where my experience was exactly the same. Everyone was scratching their head.
Now I've moved into Healthcare IT environment, where SemWeb makes perfect sense. Its like the best tool for the job.
The essential difference is what end of the stick you are picking up. The tech folks who are trying to shoe-horn RDF/OWL onto anything n everything (e.g. search) are failing. On the other hand, Healthcare/Life science folks who have to work with heavy knowledge intensive stuff, its working like a charm.
The SemWeb story is quite similar to Amazon Kindle.. wherein the tech folks are hating it whereas real users are all over it.. So it might seem like a failure to all you tech bozos.. but the domain experts are lovin' it.