Slashdot Mirror


Why the Semantic Web Will Fail

Jack Action writes "A researcher at Canada's National Research Council has a provocative post on his personal blog predicting that the Semantic Web will fail. The researcher notes the rising problems with Web 2.0 — MySpace blocking outside widgets, Yahoo ending Flickr identities, rumors Google will turn off its search API — and predicts these will also cripple Web 3.0." From the post: "The Semantic Web will never work because it depends on businesses working together, on them cooperating. There is no way they: (1) would agree on web standards (hah!) (2) would adopt a common vocabulary (you don't say) (3) would reliably expose their APIs so anyone could use them (as if)."

4 of 179 comments (clear)

  1. One word: SPAM by ngunton · · Score: 5, Insightful

    The thing the academics who push the semantic web fail to consider (most of the time) is that the Real World does not function like their Ideal World. In the Ideal World, everybody cooperates and works together to produce something of value for all mankind. So we get lots of correctly and appropriately marked up pages that give useful information on what's stored therein.

    But in the Real World, any online system that is used by a large enough number of people will eventually become attractive for spammers and scammers to defile and twist to their own purposes. So you'll get a deluge of pages that appear to be useful reviews of digital cameras (and are marked up as such) but in fact simply go to a useless "search" page that has lots of link farm references.

    And if you say "Ok, so we don't trust the author of the page, we have someone else do it"... then who? Who's going to do all the work? Answer: Nobody. AI is nowhere near being smart enough for this. Keyword searching is, unfortunately, here to stay. If you trust the author to do the markup, then the spammers have a field day. If you say "Only trusted authors" then the system will still fail, due to laziness on most people's part - if a system isn't trivial to implement and involves some kind of "authentication" or "authorization" then nobody will use it, period. The Web succeeded in the first place because anybody anywhere could just stick up a Web server and publish pages, and it was immediately visible to the whole world.

    The Semantic Web will fail for the same reason that the "meta" tag failed in HTML: Any system that can be abused by spammers, will be abused.

    So, the Semantic Web, which is all about helping people find stuff, will fail. Not because of any technological shortcomings (it's all very nice in theory), but simply because we as people won't work together to make it work. Well, a small number of people could work together, but as that number got larger, until it reaches the point of being useful, it will automatically get to the tipping point where it becomes worthwhile for the spammers to jump in and foul it all up.

  2. Re:It will fail for other reasons too by tbriggs6 · · Score: 5, Insightful

    Have you ever read the original presentation of work by Codd on relational databases? How about the RFC standards on TCP/IP? How about the original presentation and arguments on the inclusion of Interrupts in a processor? Boy, those were so easy to understand and obvious that they were even published at all. The process of science is to push the state of the art; which by definition is new and novel. This is the job of the computer science researcher. It is left to others to examine the research and reformulate in terms that mere mortals can understand. If you understand the concepts behind the OSI layers, Lambda expressions, or symmetric multi-processing, thank a computer science educator who abstracted and distilled the hell of the science and research and packaged in such a way that you can understand it and maybe even use it. To claim that failure is imminent because the current presentation of the Semantic Web is too complex is nonsense.

  3. Re:Reason #1 the Semantic Web will fail by Wah · · Score: 5, Insightful

    For one thing, Google's results are much too noisy. For another, it relies on keywords occurring on pages, and that's rather primitive

    No it doesn't. The genius of google was that it relies on people linking to pages talking about keywords. And uses various tools to identify and promote good linkers.

    But the most important reason is that it would be much cooler to have a web where you could say "give me a list of all the goals scored by Romario" and have it list them for me.

    That's a curious thing to ask for, since the first google result is a story about how there is a good bit of controversy surrounding Romario's "1,000" goals. The problem is your request is to vague and doesn't define all the words within itself (i.e. does a goal scored as teenager in a different league count?).

    This goal is quite a bit higher than many realize, as you could get 10 people (5 of them experts) in a room and they wouldn't necessarily be able to agree on the "right" answer.

    To ask, or even demand, that computers do the same task as a background function is ludicrious, IMHO (at least when applied to a universal context).

    --
    +&x
  4. RDF promotes interoperability and extensibility by Finin · · Score: 5, Insightful
    Stephen's argument is based on the belief that "The Semantic Web will never work because it depends on businesses working together, on them cooperating." He says:

    "But the big problem is they believed everyone would work together:
    • would agree on web standards (hah!)
    • would adopt a common vocabulary (you don't say)
    • would reliably expose their APIs so anyone could use them (as if)"
    While the argument he makes is grounded in his distrust of corporations, which I share to some degree, his second point above is off the mark, at least for RDF.

    One of the features of the W3C's model (based on RDF) is that it doesn't push the idea that everyone should adopt the same vocabulary (or ontology) for a topic or domain. Instead it offers a way to publish vocabularies with some semantics, including how terms in one vocabulary relate to terms in another. In addition, the framework makes it trivial to publish data in which you mix vocabularies, making statements about a person, for example, using terms drawn from FOAF, Dublin Core and others.

    The RDF approach was designed with interoperability and extensibility in mind, unlike many other approaches. RDF is showing increasing adoption, showing up in products by Oracle, Adobe and Microsoft, for example.

    If this approach doesn't continue to flourish and help realize the envisioned "web of data", and it might not after all, it will have left some key concepts, tested and explored, on the table for the next push. IMHO, the 'semantic web' vision -- a web of data for machines and their users -- is inevitable.