Slashdot Mirror


Tim Berners-Lee and the Semantic Web

An anonymous reader writes "As we all know, Tim Berners-Lee is the hero of the Web's creation story--he conjured up this system and chose not to capitalize on it commercially. It turns out that Sir Tim (he was knighted by Queen Elizabeth II in July) had a much grander plan in mind all along--a little something he calls the Semantic Web that would enable computers to extract meaning from far-flung information as easily as today's Internet links individual documents. In an interview with Technology Review, the Web-maestro explains his vision of 'a single Web of meaning, about everything and for everyone.'"

7 of 250 comments (clear)

  1. What Does 42 Mean for Privacy? by Allen+Zadr · · Score: 3, Interesting
    'a single Web of meaning, about everything and for everyone.'

    So, once this is off the ground, who wants to bet that the answer really is, 42?

    Seriously though, this could be really cool, but I imagine that this could have some very adverse effects on privacy given the amount of information that finds itself on the web. Items that are linked by obscurity in disperate places would be easily linked into a single profile (If the stuff he's talking about isn't primarily smoke and mirrors). Either way, like any powerful technology, it will have both good and bad consequences. Here's hoping for the good...

    --
    Kinetic stupidity has a new brand leader: Allen Zadr.
  2. 'Twas a happy day on SemWebCentral... by tcopeland · · Score: 3, Interesting

    ...when the man himself signed up for a user account. w00t!

  3. Why is a hero? by Gothmolly · · Score: 3, Interesting

    Because he chose not to capitalize commercially on the Web? How is the measure of your altriusm the measure of your heroism? I understand that many people DO feel that way, but nobody has ever really explained WHY heroism is a necessary consequence of altriusm. Why is someone who makes a profit necessarily evil? The man who invented a corrugated-cardboard coffee-cup holder holds a patent on it; every Starbucks coffee sold puts a penny in his pocket. Why is that wrong?

    --
    I want to delete my account but Slashdot doesn't allow it.
  4. Ontology by dodongo · · Score: 5, Interesting

    I want to offer an alternative, as proposed by Victor Raskin at Purdue. I speak for neither Sergei Nirenburg nor Victor (who does enough talking for himself).

    While this idea for more thorough, concise, and accurate searches is a good one, I would question whether embedding semantic tags into web pages is the way to go.

    As outlined in Ontological Smenatics, there is an automated system of semantic processing already underway. Basically, it takes a text, then runs it through a parser, which looks up meanings in a lexicon, then reduces whatever translation it comes up with to a text-meaning representation (TMR), by pushing the concepts from the lexicon through an ontology / onomasticon / world-knowledge library. The TMR is basically the "pulp" of the semantics of the article, web page, book, or whatever it's been fed. It just contains the ideas, the things involved, and other relevant concepts, stripped of all other linguistic information.

    TMR is great, becuase the TMR can be used then, by reversing the process and using the lexicon of another language, to translate a text from one language to another.

    However, it seems to me that with the bits and pieces of the TMR stored in a search engine's index, this could be a huge boon for the search engine.

    Instead of just trying to match keywords, by parsing the TMR of web pages and by parsing TMR of search strings, you no longer search for keywords, but keyconcepts.

    The advantage to semantic searches / indexes by this implementation is manifold:

    -Searches (and the web as a whole) will gain the richness Mr. Berners-Lee is advocating.

    -Web authors will not be able to lie in their semantic tags, or otherwise misinform spiders what the page is about (remember tags?)

    -No extra work is required in the actual construct of the web or *ML standards. The TMR is only generated and stored by the sites / processes that need it.

    -Others?

    Just an alternative solution, for fun :)

  5. Re:The rest of us call this... by j1m+5n0w · · Score: 3, Interesting

    Google identifies relationships between data using only on the links between pages containing the data.

    The Semantic web represents relationships between data based on metadata (i.e. data about data). This is a far more powerful way to describe the meaning of data.

    This is an important point. Google computes the pagerank of a page based on the eigenvector of the web link matrix, which is a clever and usually effective approach. Unfortunately, each link only conveys a little bit of information. A link from page A to page B is assumed to be an endorsement of page B's relevance by page A. But what if you could add extra metadata to the links? Not just a URL and a human readable text label, but a machine readable label as well, like this?

    <a href=http://slashdot.org relevance=0.3 novelty=0.8 accuracy=-0.2 funny=0.2> slashdot </a>

    If you could apply arbitrary attributes to web pages, google would have much better information to go on, and a user could specify the importance of certain attributes depending on what he/she is looking for.

    -jim

  6. Re:The rest of us call this... by JimDabell · · Score: 3, Interesting

    Google's a hack. No, really, it tries to extract meaning from web pages that really aren't engineered to store that kind of information.

    Google is also an application. The Semantic Web is all about building the infrastructure so applications like Google don't have to chase the holy grail of AI to become more than a hack. Think of the Semantic Web as the layer underneath Google.

  7. The need for information management pops up again. by master_p · · Score: 4, Interesting

    If you have followed this little crazy guy that is me, you may have seen that most of today's computer problems are because modern operating systems offer nothing in the information management department.

    Remember the CVS story a couple of days before? it's information management: http://slashdot.org/comments.pl?sid=123076&cid=103 47565

    WinFS is also about information management: http://slashdot.org/comments.pl?sid=121101&cid=101 99083

    The story that the Evolution e-mail client offers the e-mail data as a data model separate from the application? another information management issue.

    The web? information management issue.

    Distributed databases? information management issue.

    Web search engines? information management issue.

    Windows search tool? information management issue.

    The Windows registry? information management issue.

    The unix etc directory? information management issue.

    Enterprise workflows? again, an information management issue. That's why there is no general workflow solution accepted and used worldwide.

    Dynamic web site contents? information management issue.

    The semantic web? another information management issue!

    As you can see, from the numerous examples given above, all that an operating system should do, but no one does, is that it must manage information instead of files. If that is coupled with a distributed networked environment, 90% of the world's software would be considered obsolete overnight and the productivity and fun from using computers will increase 10fold.

    If any open source developer is reading this, you may contact me for a private discussion on the idea. THIS IS OPEN SOURCE'S BIGGEST CHANCE TO LEAD THE TECHNOLOGICAL RACE!