Slashdot Mirror


Welkin: A General-Purpose RDF Browser

Stefano Mazzocchi writes "Many consider the Semantic Web to be vaporware and others believe it's the next big thing. No matter where you stand, a question always pops up: Where is the RDF browser? The SIMILE Project, a joint project between W3C, MIT and HP to implement semantic interoperability of metadata in digital libraries, released today the first beta release of a general purpose graphic and interactive RDF browser named Welkin (see a screenshot), targetted to those who need to get a mental model of any RDF dataset, from a single RSS 1.0 news feed to a collection of digital data."

43 of 189 comments (clear)

  1. Semantic Web Firefox plugin? by otisg · · Score: 5, Funny

    Considering the big 1.0 for Firefox is out, I would think people who wanted their Semantic Web browsing software to be wide-spread would implement it as a Firefox plugin, no?

    --
    Simpy
    1. Re:Semantic Web Firefox plugin? by Paua+Fritter · · Score: 2, Informative

      There is a Mozilla extension called Annozilla which adds the ability to browse and edit RDF annotations to web pages, using the Annotea protocol.

    2. Re:Semantic Web Firefox plugin? by WankersRevenge · · Score: 3, Funny

      I personally won't be satisfied until someone writes a standards compliant browser in HTML.

  2. The question is not about a browser by Dancin_Santa · · Score: 5, Interesting

    The question is about whether we really need a World Wide Web that looks like Wikipedia with links to every word and generally just a jumbled mess of blue and purple text. No matter how you cut it, the problem lies in having too much information immediately available.

    Imagine you are a reading a book, but each word is connected by string to a dictionary reference, and each dictionary reference definition is tied to the definitions of the words in the definition. You'd end up with a huge, eventually circular mess of string and you couldn't realistically get any enjoyment out of the book. The fact of the matter is that if you want to get more information about something, it is easy to go to an outside source to look it up. It does not need to be easier, because by making it easier than it must be you necessarily end up cluttering the thing you want to illuminate.

    There is an old saw, "Make things as simple as possible, but no simpler." The Semantic Web, while an interesting idea, tries to make things too easy, beyond the point of usefulness. The lack of content on the Semantic Web is a testament to the uselessness of such an over-engineered web space.

    1. Re:The question is not about a browser by Moby+Cock · · Score: 2, Interesting

      Hear, hear. Wikipedia is about the most hyperlinking that I could stomach. It is useful in that specific application, but the notion of the Semeantic Web is silly. For crying out loud, people on /. don't RTFA, let alone verify all definitions of the words in the summary.

    2. Re:The question is not about a browser by gl4ss · · Score: 2, Insightful

      make it so that they don't look like different links.. until you press some button or triple click on the word or whatever..

      so.. invisible strings that you can see if you wish.

      --
      world was created 5 seconds before this post as it is.
    3. Re:The question is not about a browser by 0x461FAB0BD7D2 · · Score: 2, Insightful

      Welkin is simply a PoC, IMO. It just attempts to prove that you can link information together in a fairly suitable way. This is always the first step in any new technology. Other products could, and probably should, use it for different purposes.

      Your main objection lies in that it does not filter information, but adds to the mass information overload humans experience daily. However, this can be changed simply. Welkin seems to dump all data at once. The code could be changed so you could traverse ideas. I can already see the usefulness of such a thing for educational purposes.

      The lack of content on the Semantic Web is a testament to its current lack of usefulness. If there was more content on it, it would be inherently more useful.

    4. Re:The question is not about a browser by mat+catastrophe · · Score: 4, Funny

      but about how quickly can Microsoft turn it into a security hole for your friends and family?

      --
      sig not found
    5. Re:The question is not about a browser by sonsonete · · Score: 4, Insightful

      The point of the Semantic Web lies not in making information readily available to people browsing the internet but in providing semantic context with which computers can work. A person reading a document in a browser is not expected to follow links attached to every word. Rather, a computer program is expected to be able to use this information to learn the meaning behind the sting of characters.

      --
      "Folks bent on reinventing the wheel should understand that if it's not round, it ain't a wheel." - Jonah Goldberg
    6. Re:The question is not about a browser by greg_barton · · Score: 2, Insightful

      The fact of the matter is that if you want to get more information about something, it is easy to go to an outside source to look it up.

      The semantic web isn't about human usability. It's about building machine intelligence and knowledge.

    7. Re:The question is not about a browser by Anonymous Coward · · Score: 5, Funny
    8. Re:The question is not about a browser by Anonymous Coward · · Score: 2, Funny

      so.. invisible[FNORD] strings that you[FNORD] can see if you wish[FNORD].

    9. Re:The question is not about a browser by base_chakra · · Score: 2

      The question is about whether we really need a World Wide Web that looks like Wikipedia with links to every word and generally just a jumbled mess of blue and purple text ....
      Imagine you are a reading a book, but each word is connected by string to a dictionary reference, and each dictionary reference definition is tied to the definitions of the words in the definition. You'd end up with a huge, eventually circular mess


      Although your concerns about user interface are well-taken, you seem to be thinking about this strictly in terms of hyperlinks as presented by web browsers, which is a rather limited view. Behaviors could be user-defined, hidden, and abstracted in virtually endless ways.

      For example, even now double-clicking any word in the Opera web browser activates a context-sensitive menu with such options as Dictionary, Encyclopedia, Search, Translate, etc.

    10. Re:The question is not about a browser by Fnkmaster · · Score: 2, Interesting

      The semantic web isn't about human usability. It's about building machine intelligence and knowledge.

      Right, but the problem is if it's unusable for humans to _create_ that content, or to map it from human knowledge-space into machine-parseable format, then it doesn't matter if it's well-engineered from the machine's perspective. That's why adoption of the semantic web has been so poor (outside of applications that could just as well be filled with any ole' XML dialect, like RSS or RDF descriptors used to package Firefox extensions, and so on).

      Nobody wants to hire a team of ontological engineers to map information they already have in human accessible form into some highly structured, machine parseable format, and pay them to keep that information up-to-date. Mind you, companies only started paying people to put stuff up on the web when it became clear there was demand, and the early adopters of the web were individuals and academics, but the web was accessible from day one - I put up my first personal web page when I was 15 years old or so, and it took me about an hour to figure out how to do it.

      Also remember that big companies spend tens of millions of dollars hacking together some HTML for their website. Imagine how much they would have to pay to get people smart enough to construct ontologies and RDF data versions of all of their content. Yowsers!

    11. Re:The question is not about a browser by greg_barton · · Score: 4, Funny

      Nobody wants to hire a team of ontological engineers to map information they already have in human accessible form into some highly structured, machine parseable format, and pay them to keep that information up-to-date.

      Please don't tell that to the company I'm interviewing with on Friday. :)

    12. Re:The question is not about a browser by Earlybird · · Score: 2, Insightful
      • And how is the meaning behind a string of characters given? For example, lets say you want to give the meaning behind a strong of characters that describes to a human the proof of Skolem's Paradox.
      It's given by marking it up. The computer doesn't need to know anything about the proof of anything, just like Google doesn't know anything about porn, and yet when you search for "big boobs", it knows what to return. *wink*

      The point isn't that a computer program will ever "know" what Skolem's paradox is, in the same way a human would "know" what it is. The semantic web isn't about building artificial intelligence into computers, but rather adding knowledge statements to information.

      If you tell a computer than Einstein is a scientist, that Einstein is a German, that Einstein won the Nobel prize in physics in 1921 and that this is an image of Einstein, then a computer will be able to infer that this picture is of a German scientist.

      Based on this information, I could ask the computer for pictures of all the other German scientists who were awarded the Nobel prize in 1921, or some other time. Clearly the computer doesn't need to know about nationalities, or dates, or to understand pictures.

      There are simpler use cases, too. Say there's a product called Paradox (well, there used to be one). People searching for just the word "paradox" might get matches for pages about "Skolem's paradox". But if the pages were appropriately marked up, Google (or whatever) could ask you whether you meant a specific paradox, just the way Google currently asks if you perhaps meant something else.

    13. Re:The question is not about a browser by Jagasian · · Score: 2, Insightful

      Yes, but all this assumes that people agree on exact precise ways of representing everything. Just as in flat UNICODE text, you can describe the same thing in multiple ways, in RDF you can also describe the same thing in multiple ways that still differ in RDF's semantics. For example, there are multiple ways to encode a ternary predicate in terms of binary predicates. Each one of these represenations will not just differ in syntax, but also in RDF's semantics. Nothing is gained!

      RDF and the semantic web assume an ideal situation in which all information is complete and formatted in a uniform way.

  3. Solution space? by Anonymous Coward · · Score: 2, Interesting

    Isn't RDF much like the laser use to be? A solution looking for a problem?

  4. RDF a load of crap by Anonymous Coward · · Score: 4, Insightful

    enough people have said it, but it's worth while saying again. RDF is totally flawed and will never meet the vision of W3C. The whole idea that an RDF resource is true and authorative is just silly. Look at what happened to HTML metadata tag. I got abused instantly and search engines stopped using them. RDF rules is monotonic, which is just totally silly. that basically means any rules written in RDF will timeout if the data isn't already on that particular server. W3C should just give up already on RDF and move on.

    1. Re:RDF a load of crap by Yosi · · Score: 3, Interesting
      There are those who worry about these things.
      Much work on the semantic web has been with n3
      N3 is a superset of rdf, allowing for quoting of groups of triples (known a formulae). In n3, you can say things about groups of n3 triples, including about their trustworthiness.

      For instance, you can say:
      [is log:semantics of <documentURI> ] a :untrustworthyInformation .
      essentially saying that the formula which is the semantics of the given document if of a class :untrustworthyInformation, which your n3 parser may attach special meaning to.

      There are many who are very wary on n3 for precisely the same reasons.

      Note that I will always plug n3, given that I'm heavily involved with cwm.
  5. The question is not about a browser-Paradigm by Anonymous Coward · · Score: 2, Informative

    "There is an old saw, "Make things as simple as possible, but no simpler." The Semantic Web, while an interesting idea, tries to make things too easy, beyond the point of usefulness. The lack of content on the Semantic Web is a testament to the uselessness of such an over-engineered web space."

    Or a testament to people's inability to understand new paradigms.

    1. Re:The question is not about a browser-Paradigm by Anonymous Coward · · Score: 5, Interesting

      Or a testament to people's inability to understand new paradigms.

      Or a testament of the inability of the paradigm's creator to get people to understand it's necessity.

  6. Gee thanks... by Fnkmaster · · Score: 5, Insightful

    After looking at that screenshot, it's sooo clear to me the value that the semantic web brings to us (mirrored here as their server appears to be flaking out a bit). If anything, this makes it crystal clear why the semantic web hasn't really taken off, other than in the much more limited form of RSS feeds.

    A network of random connections of semantic concepts embodied as URIs is just not a friendly form of data for humans to manipulate directly, and I don't think it every will be. That's right, I don't believe this is really an issue that's solvable with slightly better tools. I think ultimately the management of and connection of ontologies is something that computers will have to learn to do themselves.

    It's just too hard to expect normal human beings to describe knowledge in any way other than the way we are used to. The web is only as popular as it is because HTML is a simple, appearance-based way to markup documents (yes, I realize strictly speaking HTML isn't supposed to describe many aspects of appearance per se, but there's no denying that it comes from that root). We understand bold and italics (and even strong and em), but ask somebody to generate two concepts by constructing URIs for them and relating them in subject-predicate form and they are going to look at you and drool.

    Even programmers aren't used to the idea of describing knowledge - it's one thing to tell a computer what to do, it's another thing to tell a computer how to know about something that you know.

    Alright, I know I'm opening myself up to the flames here, so flame away. Anyway, I think the "semantic web" will need to wait for tools like Cyc et. al. to come along far enough to construct and relate their own ontologies out of English text, and until then all we will see is stuff like RSS or RDF files in Firefox extensions to describe deployment conditions (i.e. stuff that can be done with any arbitrary XML dialect that doesn't really qualify as the "semantic web" to me).

    1. Re:Gee thanks... by Fnkmaster · · Score: 2, Informative

      Your comments have no relevance to mine - I have done a fair amount of work with semantic web technologies before (well, compared to most people out there anyway), my comments were a response to my personal experiences, not some random misconceptions formed by reading Slashdot articles.

      1) I never said anything of the sort. RDF/Semantic Web technologies have nothing to do with inserting links into HTML.

      2) I never said it was a replacement for HTML. I just said it wasn't likely to be adopted because of the difficultly of creating content in a properly structured, ontologically connected RDF format.

      Of course your secretary isn't supposed to hand-edit RDF files, but somebody has to not only write code that dumps stuff from a database into RDF (easy - not really any different from dumping into any ole' XML format) but map all the stuff into relevant ontologies (not easy), where "easy" is defined in terms of being comprehensible enough to permit adoption outside of academia.

      3) It's only easier for two corporations to merge databases if all the entities therein are connected by direct or indirect ontological relationships. People have to build these relationships. That was the whole point of my post.

      4) I said nothing about URIs being associated with viewable web pages. Stop inserting random straw man attacks.

      Apparently you are the one who needs to RTFM instead of getting up on your high horse there, buddy. Not everybody on Slashdot is as ignorant as you presume.

  7. RDF browser? by aixou · · Score: 2, Funny

    Niice. I've always wanted to know what's going on in Steve Job's head.

  8. RDF a load of crap-Says an AC. by Anonymous Coward · · Score: 3, Insightful

    "enough people have said it [All related to the OP], but it's worth while saying again. RDF is totally flawed and will never meet the vision of W3C [And that is?]. The whole idea that an RDF resource is true and authorative is just silly [Just like the present web]. Look at what happened to HTML metadata tag. I got abused instantly and search engines stopped using them [They're used, just not alone]. RDF rules is monotonic, which is just totally silly. that basically means any rules written in RDF will timeout if the data isn't already on that particular server [Can you say local, and intranet?]. W3C should just give up already on RDF and move on. [Just like the advice we give those KDE guys]"

    Read this.

    http://www.oreilly.com/catalog/pracrdf/index.html/

    Maybe you'll learn something.

  9. Do you really want web pages that look like this? by xmas2003 · · Score: 3, Funny
    --
    Hulk SMASH Celiac Disease
  10. Why is this funny? by Trejkaz · · Score: 4, Insightful

    Are people marking this Funny just to be cruel? I find this rather interesting. XUL's data model is RDF already, so it's not like Firefox doesn't already have the foundation to do this.

    --
    Karma: It's all a bunch of tree-huggin' hippy crap!
    1. Re:Why is this funny? by otisg · · Score: 2, Interesting

      Yeah, why is this funny? I wasn't trying to be funny - this is a valid question.

      All this talk about GBrowser (Google's browser), for instance.... what do you think it is going to be based on? Firefox, of course! You don't think Google would be crazy not to make use of that powerful, flexible, extensible platform that runs on all major operating systems, has support of geeks and hackers to the point where they contribute $250K for The New York Times advert.

      --
      Simpy
  11. The International Obfuscated C Code Contest by 3770 · · Score: 3, Funny


    Make it the goal of next years International Obfuscated C Code Contest.

    I'm sure we'll get a really cryptic one liner that actually is a fully functional RDF browser.

    --
    The Internet is full. Go Away!!!
  12. This is the future of the web by Nooface · · Score: 4, Informative

    Browsing metadata is the next frontier in the evolution of the web. Some of the other RDF browsers popping up include Gnowsis, MIT Haystack, and Fenfire.

    With the growth of the Internet, the value of data itself is dropping, while the value of metadata (i.e. "data about data") increases, introducing a need for tools that can manipulate metadata. That is what RDF is all about - standardizing a way to represent metadata. It is not a standard for the metadata itself...those standards will be determined the same way everything else is on the Internet: with the best solutions rising to the top.

    The most common objections to this scenario?
    a) "Nobody will bother entering metadata". Wrong...it's already happening. Users are voluntarily generating metadata all the time. Just check out sites like flickr (photo blogging) and del.icio.us (collaborative bookmarks), not to mention Amazon reviews and Ebay ratings.
    b) "RDF tags will just be abused with spam, trolls, and other useless info". A variety of techniques are emerging that are designed to protect the integrity of user-contributed data, including trust metrics like Slashdot's own distributed moderation (PDF) or Advogato.

    --

    Nooface
    In Search of the Post-PC Interface
  13. This business isn't vaporware.. by kidlinux · · Score: 2, Interesting

    Check out Semaview Inc. who's making a business of RDF. They've already got one good product out. They're somewhat OSS friendly, too.

    Personally, I think eventSherpa is pretty neat.

    (Disclaimer: I know the CEO.)

    --
    -kidlinux.
  14. Seriously now by dynamic_cast · · Score: 2, Funny

    Why would the Robotech Defence Force need there own browser?

  15. The wrong answer to the right question by dpm · · Score: 4, Insightful

    I promoted RDF a fair bit back in the late 1990s and even wrote one of the first libraries for it. I think that the idea of machine-readable data on the web is a very good one (and probably more scalable than the whole Web Services thing), but six or so years later, I don't think that RDF is it.

    The trouble is that RDF (and OWL) try to do too much, getting all tangled up in the arcana of knowledge representation, and the Semantic Web thing has only muddied the waters further -- the screenshot is a stunning graphic representation of the mess that RDF has gotten itself into (I'll assume that it's serious, since it's a long time until 1 April).

    All we really need for a data web is a bunch of XML files online that make references to each other for machines to follow, the same way that web pages make links -- in other words, a data web would be a distributed database, the same way that the document web is a distributed hypertext system. RDF reminds me more of the complex pre-HTML hypertext systems of the late 1980s than of the successful, simple formats and protocols that drive the Web.

    1. Re:The wrong answer to the right question by Fnkmaster · · Score: 2, Interesting

      Well, you would certainly know, ease of use is critical to winning developer mindshare and promoting adoption of technologies - and I would point to SAX as a great example of this for promoting use of XML in the Java community.

      It does seem to me that the key thing is to promote ad-hoc use of a relatively standardized mechanism for relating XML document structures to other XML document structures. Forget about waiting for somebody else to build relevant ontologies, reconstructing the entirety of human knowledge from the ground up, or any of that stuff. What people could reasonably do today is relate XML schema one to XML schema two because they need to connect widget A with widget B. Make the adoption of this technology as low cost as possible.

      Just like adding a few anchor tags to a basic HTML document is an easy way to relate some human readable information to other human readable information, relating XML document types to other XML document types should be "easy".

      Then the only big problem is to find a few applications that would actually demonstrate the benefits of doing this clearly. Yes, it is effectively a distributed XML database of sorts, but what is it good for? RSS has real applications for end users, so it has caught on. Without some software to demonstrate the benefits of linking up your XML data structures, people just won't bother with it. It seems specific, realistic use cases are what's needed here (and what seems terribly lacking from all the W3C RDF documentation as well). How does the distributed, semi-structured database that results provide use to me beyond what I have now with lots of disparate XML documents out there, when you cut out the truly grandiose notions behind RDF and the full-fledged semantic web?

      I'm too tired to come up with convincing arguments right now, so hopefully somebody else will fill in the blanks here.

  16. Narcissism by pico303 · · Score: 5, Insightful

    Almost everybody here seems to be missing the point: RDF isn't for you--it's for your computer. The point of RDF and the Semantic Web is to structure knowledge so that programs can interact with one another to perform better, even in some cases simulating intelligent decisions. Unless you're working in developing Semantic Web technology, you should never have to look at an RDF document.

    It's not a wiki. It's not a new way to see metadata. It's your softwares' version of the WWW.

    It's not always about you humans.

  17. The semantic web, in a nutshell by Anonymous Coward · · Score: 5, Funny

    1) people like to masturbate.

    2) some people like to look at pictures of naked girls while masturbating.

    3) some people like to think about graph theory while masturbating.

    The semantic web is the unfortunate result of #3.

    Now, while I have no problem with any of these behaviors, I do ask that people in group #2 to keep their sticky dirty magazines under their bed, not on their coffee tables; and people in group #3 to likewise keep their inventions locked in the closet, and not release them to standards bodies or working groups.

    So when you see someone in a clear frenzy of sexual excitement talking to you about "ontologies" and "reification", simply smile politely, and call the police.

    Remember, these people are the exception, not the norm, in an otherwise healthy society.

  18. Visualization conundrum by SavvyPlayer · · Score: 2, Interesting

    One trouble regarding many semantic visualization techniques involving large datasets is: the more visually appealing a graph is rendered, the less useful it often becomes. Many projects undertaken over the past 6 years (including Welkin) have focused on 2- and 3-dimensional renderings of a dataspace, using lines, proximity, node-shape, fly-over metadata display, etc. to classify and relate nodes, only to find there is no room left for persistent display of the textual metadata that ultimately drives a user toward the content he/she is looking for.

    Marcos Weskamp's Newsmap (slashdot) on the other hand demonstrates an excellent balance of form and function, emphasizing textual metadata over symbolic graphic representation. How might this approach be applied specifically to RDF? One possibility: 5 axes rendered in a 2d visual space: color (category), saturation (relevance), size (interest), x/y position (age) and text (metadata). Just a thought anyway.

  19. BZZZT, RDF is past sell-by date already by Ars-Fartsica · · Score: 2, Informative

    RDF has been collecting mold next to XSL and XML Schema for some time now. Its practically impossible to say anything interesting easily using RDF. Try composing even a moderately interesting RDF document yourself and you will see why no one is using it.

  20. More like answer to a question no one asked by Ars-Fartsica · · Score: 3, Interesting

    Statistical text analysis and link analysis are a superior technique because it presumes the author could be BSing. The entire document must contribute to the corresponding query value, not just keyphrases which could or could not be true. This is why Google is a $50 billion company and no RDF firm ever will be.

  21. Re:There goes those AI-types. by greg_barton · · Score: 2

    d00d, you're getting into a religious fervor here. I refer you back to my blacksmith analogy.

    You're saying to the blacksmith, "HAHA! You haven't made a 747 yet!"

    Blacksmith says, "I don't have the proper tools. Maybe if I had a better hammer..."

    You: "You blacksmiths always say that. HAHA! You'll never make it!"

    Eventually a 747 was made. I'll bet, if you'd been around then, you would have criticised the airplane makers every step of the way...

  22. Dont let poor presentation fool you. by Post · · Score: 2, Insightful

    People who look at these browser screenshots and decide that the semantic web is/will be a mess stop thinking too early.

    This graph-like presentation is just one way to show semantics, and it only works for certain things, like topic maps.

    I'm sometimes using tools like outliners and the Brain (insert pun here) to present ideas and their relationships. This is not the way you would want to e.g. read/present a complex manual.

    Other, more complex forms of presentation are required - and possible. Ted Nelson had a lot of ideas regarding hypertext and presentation of relationships that have never turned into products. I'm working on my own little, Xanadu-ish project that aims to make navigation in structured text easier. The benefit is not presentation "A" or "B" - but the fact that you will be able to tweak the presentation according to what you need to know. This requires semantics, which in turn requires new tools both for the author, not (only) for the reader.

    One day, we will look back and wonder how we could live with an Internet where a search engine had to guess if we are looking for Lotus The Car or Lotus The Flower or Lotus The Software Company, or where separating articles by an author from those about him was nearly impossible. No-one in their right mind can claim this is good enough for the future.

  23. RDF in Mozilla by Quantum+Jim · · Score: 2, Informative

    I would say that XUL is more like HTML than RDF. However, you're right that Mozilla's framework has built-in support for querying RDF datastores (although primitive compared with Jena or Redland). In fact Mozilla internally represents bookmarks through RDF even though they are serialized in a pseudo-html syntax on disk (for compatibility reasons). The history, extension registry, and file system are also RDF-based. Mozilla may very well be the most widely distributed framework for accessing RDF datastores on the planet!

    --
    It is impossible to enjoy idling thoroughly unless one has plenty of work to do.
    - Jerome Klapka Jerome