Welkin: A General-Purpose RDF Browser
Stefano Mazzocchi writes "Many consider the Semantic Web to be vaporware and others believe it's the next big thing. No matter where you stand, a question always pops up: Where is the RDF browser?
The SIMILE Project, a joint project between W3C, MIT and HP to implement semantic interoperability of metadata in digital libraries, released today the first beta release of a general purpose graphic and interactive RDF browser named Welkin (see a screenshot), targetted to those who need to get a mental model of any RDF dataset, from a single RSS 1.0 news feed to a collection of digital data."
Considering the big 1.0 for Firefox is out, I would think people who wanted their Semantic Web browsing software to be wide-spread would implement it as a Firefox plugin, no?
Simpy
The question is about whether we really need a World Wide Web that looks like Wikipedia with links to every word and generally just a jumbled mess of blue and purple text. No matter how you cut it, the problem lies in having too much information immediately available.
Imagine you are a reading a book, but each word is connected by string to a dictionary reference, and each dictionary reference definition is tied to the definitions of the words in the definition. You'd end up with a huge, eventually circular mess of string and you couldn't realistically get any enjoyment out of the book. The fact of the matter is that if you want to get more information about something, it is easy to go to an outside source to look it up. It does not need to be easier, because by making it easier than it must be you necessarily end up cluttering the thing you want to illuminate.
There is an old saw, "Make things as simple as possible, but no simpler." The Semantic Web, while an interesting idea, tries to make things too easy, beyond the point of usefulness. The lack of content on the Semantic Web is a testament to the uselessness of such an over-engineered web space.
Isn't RDF much like the laser use to be? A solution looking for a problem?
enough people have said it, but it's worth while saying again. RDF is totally flawed and will never meet the vision of W3C. The whole idea that an RDF resource is true and authorative is just silly. Look at what happened to HTML metadata tag. I got abused instantly and search engines stopped using them. RDF rules is monotonic, which is just totally silly. that basically means any rules written in RDF will timeout if the data isn't already on that particular server. W3C should just give up already on RDF and move on.
"There is an old saw, "Make things as simple as possible, but no simpler." The Semantic Web, while an interesting idea, tries to make things too easy, beyond the point of usefulness. The lack of content on the Semantic Web is a testament to the uselessness of such an over-engineered web space."
Or a testament to people's inability to understand new paradigms.
After looking at that screenshot, it's sooo clear to me the value that the semantic web brings to us (mirrored here as their server appears to be flaking out a bit). If anything, this makes it crystal clear why the semantic web hasn't really taken off, other than in the much more limited form of RSS feeds.
A network of random connections of semantic concepts embodied as URIs is just not a friendly form of data for humans to manipulate directly, and I don't think it every will be. That's right, I don't believe this is really an issue that's solvable with slightly better tools. I think ultimately the management of and connection of ontologies is something that computers will have to learn to do themselves.
It's just too hard to expect normal human beings to describe knowledge in any way other than the way we are used to. The web is only as popular as it is because HTML is a simple, appearance-based way to markup documents (yes, I realize strictly speaking HTML isn't supposed to describe many aspects of appearance per se, but there's no denying that it comes from that root). We understand bold and italics (and even strong and em), but ask somebody to generate two concepts by constructing URIs for them and relating them in subject-predicate form and they are going to look at you and drool.
Even programmers aren't used to the idea of describing knowledge - it's one thing to tell a computer what to do, it's another thing to tell a computer how to know about something that you know.
Alright, I know I'm opening myself up to the flames here, so flame away. Anyway, I think the "semantic web" will need to wait for tools like Cyc et. al. to come along far enough to construct and relate their own ontologies out of English text, and until then all we will see is stuff like RSS or RDF files in Firefox extensions to describe deployment conditions (i.e. stuff that can be done with any arbitrary XML dialect that doesn't really qualify as the "semantic web" to me).
Niice. I've always wanted to know what's going on in Steve Job's head.
"enough people have said it [All related to the OP], but it's worth while saying again. RDF is totally flawed and will never meet the vision of W3C [And that is?]. The whole idea that an RDF resource is true and authorative is just silly [Just like the present web]. Look at what happened to HTML metadata tag. I got abused instantly and search engines stopped using them [They're used, just not alone]. RDF rules is monotonic, which is just totally silly. that basically means any rules written in RDF will timeout if the data isn't already on that particular server [Can you say local, and intranet?]. W3C should just give up already on RDF and move on. [Just like the advice we give those KDE guys]"
/
Read this.
http://www.oreilly.com/catalog/pracrdf/index.html
Maybe you'll learn something.
Yeah, lord knows proprietary software like MSIE has never had bugs or security issues.
it's nice seeing the first screenshot on a Mac.
Just something friendly about that.
That screenshot is from a Mac! I wonder if it is Mac-only (does not matter, I use Macs :-).
Dashboard Widgets
"Even programmers aren't used to the idea of describing knowledge - it's one thing to tell a computer what to do, it's another thing to tell a computer how to know about something that you know."
Yea. Just try getting a programmer to explain the latest thing they're working on. "Well you see it does this, and if you click on that, something happens. It's all too complicated to explain, sorry."
There is a significant difference between a production level bug and a trivial bug. This program is unusable. MS Excel is useable, but crashes in certain instances. This is nothing less than a failure of open source QA in this respect. Proprietary QA at least tries to make sure that production software works enough so that most people don't have problems.
The Incredible Hulk had fun with his halloween decorations but that's a warmup for his christmas lights where he plays RoShamBo when not helping out Google Compute.
Hulk SMASH Celiac Disease
Are people marking this Funny just to be cruel? I find this rather interesting. XUL's data model is RDF already, so it's not like Firefox doesn't already have the foundation to do this.
Karma: It's all a bunch of tree-huggin' hippy crap!
Marklar?
Ever read Infinite Jest (or anything else) by David Foster Wallace? QED
(2,3-Benzopyrrole)
Due to the changes required for IPV6. In order for it to work under IPV6, with asynchronous packets, all the packets must be bunched together for a total size of 4096 bytes (for RDF type (or child type) files to load dynamically per the standard). Since the header for an IPV6 packet can vary in size from packet to packet (if it keeps a log of its travels), this makes this task a little more difficult. Additionally, since normally the OS controls the size of the packet (for optimization), the IPV6 installed by the operating system can't be used (unlike TCP/IP). So a new IPV6 is implemented which is why it needs server/protocol support. It's a bitch (I have spent about 40 hours trying to find a workaround). Just another example of why IPV6 adds too much new overhead to be viable.
Make it the goal of next years International Obfuscated C Code Contest.
I'm sure we'll get a really cryptic one liner that actually is a fully functional RDF browser.
The Internet is full. Go Away!!!
Browsing metadata is the next frontier in the evolution of the web. Some of the other RDF browsers popping up include Gnowsis, MIT Haystack, and Fenfire.
With the growth of the Internet, the value of data itself is dropping, while the value of metadata (i.e. "data about data") increases, introducing a need for tools that can manipulate metadata. That is what RDF is all about - standardizing a way to represent metadata. It is not a standard for the metadata itself...those standards will be determined the same way everything else is on the Internet: with the best solutions rising to the top.
The most common objections to this scenario?
a) "Nobody will bother entering metadata". Wrong...it's already happening. Users are voluntarily generating metadata all the time. Just check out sites like flickr (photo blogging) and del.icio.us (collaborative bookmarks), not to mention Amazon reviews and Ebay ratings.
b) "RDF tags will just be abused with spam, trolls, and other useless info". A variety of techniques are emerging that are designed to protect the integrity of user-contributed data, including trust metrics like Slashdot's own distributed moderation (PDF) or Advogato.
Nooface
In Search of the Post-PC Interface
Check out Semaview Inc. who's making a business of RDF. They've already got one good product out. They're somewhat OSS friendly, too.
Personally, I think eventSherpa is pretty neat.
(Disclaimer: I know the CEO.)
-kidlinux.
Why would the Robotech Defence Force need there own browser?
load Slashdot faster, or block popups?
j/k
CB
free ipod and free gmail!
Not really, I want to be able to toggle hilighting on and off. Optimally my browser would let me look up any word... hmm, actually, it does with a plugin or two. Hooray for firefox. Personally I think that people who don't make a lot of links are doing the web a disservice, but maybe it's just me.
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
This http://simile.mit.edu/welkin/images/screenshot.gif
screen shot reminds me of my big college sophomore year project. Connecting lots of pretty lines together in hopes of impressing people by calling it a neural network. I have to give props though for getting the lines to be anti-aliased.
"What was I supposed to read? To the mods: if that link was dead from the the get go, you guys aren't doing your jobs."
Take the "/" off the end of the URL.
It'd make the internet look like a Tom's Hardware review... *Scanning a page with pointer... Moves pointer over word that just happens to be "thong"* Window pops up saying... "Thong: Type of underwear, usually stringlike, that gives you a MAJOR wedgie in all the wrong (or right) places. *mini link*BUY THONGS NOW CONDOM SEX SEXY BONDAGE $30.00" Things like those would be all over the page.
I promoted RDF a fair bit back in the late 1990s and even wrote one of the first libraries for it. I think that the idea of machine-readable data on the web is a very good one (and probably more scalable than the whole Web Services thing), but six or so years later, I don't think that RDF is it.
The trouble is that RDF (and OWL) try to do too much, getting all tangled up in the arcana of knowledge representation, and the Semantic Web thing has only muddied the waters further -- the screenshot is a stunning graphic representation of the mess that RDF has gotten itself into (I'll assume that it's serious, since it's a long time until 1 April).
All we really need for a data web is a bunch of XML files online that make references to each other for machines to follow, the same way that web pages make links -- in other words, a data web would be a distributed database, the same way that the document web is a distributed hypertext system. RDF reminds me more of the complex pre-HTML hypertext systems of the late 1980s than of the successful, simple formats and protocols that drive the Web.
Almost everybody here seems to be missing the point: RDF isn't for you--it's for your computer. The point of RDF and the Semantic Web is to structure knowledge so that programs can interact with one another to perform better, even in some cases simulating intelligent decisions. Unless you're working in developing Semantic Web technology, you should never have to look at an RDF document.
It's not a wiki. It's not a new way to see metadata. It's your softwares' version of the WWW.
It's not always about you humans.
1) people like to masturbate.
2) some people like to look at pictures of naked girls while masturbating.
3) some people like to think about graph theory while masturbating.
The semantic web is the unfortunate result of #3.
Now, while I have no problem with any of these behaviors, I do ask that people in group #2 to keep their sticky dirty magazines under their bed, not on their coffee tables; and people in group #3 to likewise keep their inventions locked in the closet, and not release them to standards bodies or working groups.
So when you see someone in a clear frenzy of sexual excitement talking to you about "ontologies" and "reification", simply smile politely, and call the police.
Remember, these people are the exception, not the norm, in an otherwise healthy society.
...semantic interoperability of metadata in digital libraries is a beautiful thing. And it doesn't stink half-bad either.
Let's see, here's a post with a broken link (bad data.. extra slash).
Yet it was modded up "Insightful" by some clueless mods who didn't even bother testing the validity of the data (bad metadata).
I think that sums up why the semantic web will never happen, in one neat example.
Is it just me, or does this post remind anyone of some pages on the Internet that feature ads tightly integrated into the page, where certain keywords are actually hyperlinks to product pages? (Yahoo News is an example, where they feature a link, in parentheses, to related news and Internet searches, for items such as Iraq, President Bush, etc.) If what I'm saying sounds really obvious, then I might have just pointed out a potential marketing advantage/annoying piece of advertising in RDF.
I am sure I will be modded as a troll, but somebody needs to say it, somebody needs to stop these guys.
The "Semantic web" is the latest snake oil being pawned by the AI community.
Nothing is worse than an AI-type. They make big claims and never deliver. They overly anthropomorphize all aspects of computation, fooling themselves into a false understanding of all that is related to computer science. For example, Emacs is "intelligent" because it includes a broken implementation of the lambda-calculus, an implementation that doesn't even properly implement beta-reduction? Since when has breadth-first search, depth-first search, and other search algorithms had anything to do with intelligence? They are just procedures for solving problems. The list goes on and on...
These people regularly try to disprove things such as the undecidability of various problems, the incompleteness of various logics, etc... as these undeniable mathematical proofs point to the fact that computers cannot be intelligent. Sure it is fun for sci-fi movies, but it surely isn't real science.
The semantic web is nothing more than AI-types recycling their same old crap. RDF and OWL, the two most popular scripting languages for the semantic web, are just "semantic nets", an AI concept from the 1970s, rehashed. Yup, when you can't sell any snake oil, just rename it to something else and profit... and profit they do, but does society ever see the AI-types' promises come to fruition?
Give me a break! Give the world a break. Computer science is just that: a science, and so the pseudoscience that is AI must be addressed by more people in the field of computer science.
One trouble regarding many semantic visualization techniques involving large datasets is: the more visually appealing a graph is rendered, the less useful it often becomes. Many projects undertaken over the past 6 years (including Welkin) have focused on 2- and 3-dimensional renderings of a dataspace, using lines, proximity, node-shape, fly-over metadata display, etc. to classify and relate nodes, only to find there is no room left for persistent display of the textual metadata that ultimately drives a user toward the content he/she is looking for.
Marcos Weskamp's Newsmap (slashdot) on the other hand demonstrates an excellent balance of form and function, emphasizing textual metadata over symbolic graphic representation. How might this approach be applied specifically to RDF? One possibility: 5 axes rendered in a 2d visual space: color (category), saturation (relevance), size (interest), x/y position (age) and text (metadata). Just a thought anyway.
That is a good example of why the semantic web just won't work. Here is another example:
j kd fbgskldjfbgwidfuiwdfbgosidfubgsdfgsdfgsjkdfbgskldj fbgwidfuiwdfbgosidfubgsdfgsdfgsjkdfbgskldjfbgwidfu iwdfbgosidfubgsdfgsdfgsjkdfbgskldjfbgwidfuiwdfbgos idfubgsdfgsdfgsjkdfbgskldjfbgwidfuiwdfbgosidfubgsd fgsdfgsjkdfbgskldjfbgwidfuiwdfbgosidfubgsdfgsdfgsj kdfbgskldjfbgwidfuiwdfbgosidfubgsdfgsdfgsjkdfbgskl djfbgwidfuiwdfbgosidfubgsdfgsdfgsjkdfbgskldjfbgwid fuiwdfbgosidfubgsdfgsdfgsjkdfbgskldjfbgwidfuiwdfbg osidfubgsdfgsdfgsjkdfbgskldjfbgwidfuiwdfbgosidfubg sdfgsdfgsjkdfbgskldjfbgwidfuiwdfbgosidfubgsdfgsdfg sjkdfbgskldjfbgwidfuiwdfbgosidfubgsdfgsdfgsjkdfbgs kldjfbgwidfuiwdfbgosidfubgsdfg
PORN, SEX, CUNT, FIST, PENIS, GAY, SEX, TEENS, PORN, SEX, CUNT, FIST, PENIS, GAY, SEX, TEENS, PORN, SEX, CUNT, FIST, PENIS, GAY, SEX, TEENS, PORN, SEX, CUNT, FIST, PENIS, GAY, SEX, TEENS
sdfgsjkdfbgskldjfbgwidfuiwdfbgosidfubgsdfgsdfgs
So this year the AI community claims that their latest snake oil is "the future". What a big surprise. What was it before this? Expert systems? Semantic nets? Oh, better not mention that cuz it is what the semantic web is a recycling of.
A couple more flaws about RDF. Say i go into a library and i look in the card catalog. If I travel to another country, they may have a different card catalog.
OWL is suppose to provide a framework for categorizing data. But who the hell gets to decide? Everyone or one standard body. If everyone defines their own ontology, then the likelihood some other application can use the data and ontologies is very unlikely. If one body defines the master ontology, will people adopt it? Not likely. What's the solution to this problem?
I'd like to think ontologies should be defined by each persion (ie everyone), but there should be some way for users to define who they trust and provide links from their own ontology to someone elses. Without that kind of support, any notion of describing data is going to be rather hard or impractical to implement. W3C needs to be broken up and the money spent on something more useful.
semantic interoperability of metadata
"I know all those words, but that sentence makes no sense."
Seriously though, it sounds like somebody just threw a bunch of buzzwords together to impress people. I understand the technology and have written software in this field, but I have absolutely no clue what "semantic interoperability of metadata" is meant to mean.
of all the things, what we really need is XXXDF. no one gives a rats ass about computers understanding data and using it. Let's do something productive like make it easier to find Pr0n and categorize them better.
anthropomorphization
knowledge, semantics, intelligent, reasoning, think, agent, meta-data, knowledge, semantics
I have been looking for a tool (that's better than Document inspector) to troubleshoot while I'm trying to code in RDF. I was hopeing for a debugger so I wouldn't have to test so many cases through multiple steps, but being able to see the structure may help some.
Too bad it doesn't take the XUL rules into consideration when redering maps like the one shown in the screenshot. Do you know if they are going to open development up anytime soon?
Go Gusties
Well, Slashdot stories are sometimes hyperlinked to death that way...
"In our tactical decisions, we are operating contrary to our strategic interest."
The point of metadata criticisms is that "untrustworthy" would have to be assigned to practically every tag. So, what is the point? In any case its just another opinion. Is dmoz.org trustworthy? Is it if I say so in my n3? Maybe, maybe not.
RDF has been collecting mold next to XSL and XML Schema for some time now. Its practically impossible to say anything interesting easily using RDF. Try composing even a moderately interesting RDF document yourself and you will see why no one is using it.
DJIA/NASDAQ traded firms specializing at least partly in statistical recognition of text as a business model:
GOOG,YHOO,MSFT etc etc
DJIA/NASDAQ firms using metadata as a business model:
When is the future supposed to arrive again?
The AI crowd has moved to statistical text analysis...ie, search engines. The polar opposite of what this crowd (the SGML die-hards) are doing. The SGML die-hards have made a valiant go of it, from the 80s until now...they really thought they had it made with XML until they killed it with standards and committees. Now they hope RDF will validate them, pun intended!
You mean "raises the question". It doesn't beg the question unless it assumes that you should get a cock ring as part of the argument for you getting a cock ring.
Statistical text analysis and link analysis are a superior technique because it presumes the author could be BSing. The entire document must contribute to the corresponding query value, not just keyphrases which could or could not be true. This is why Google is a $50 billion company and no RDF firm ever will be.
It's a Java Swing app, running on a Mac:
Welkin.java
Your hybrid is not saving the environment. Its purpose is to make you feel good about buying something.
Now that is about the worst argument I've heard against the semantic web. Just because the links are there doesn't mean you will be distracted by them. There is nothing that says that links are always blue and underlined. In fact, in your scenario, they would more likely look like regular text, but you could select a word or group of words and use a context menu to get a definition. Remember, the semantic web only provides semantics, not appearance. If and how to display the information it provides is a separate issue.
Please correct me if I got my facts wrong.
Personally, at work, it might be a good idea, unless the owner of the information does NOT want to be "connected" for business reasons. However, after umpteen years of hacking, I don't even open a browser anymore when I get home. I started two new hobbies, nether of which involves computers, just to un-stress and enjoy life some more. Guess that makes me un-RSS'ty. (I will go now.)
I really don't understand what is the point of this rules/proof stuff. What are they trying to do with the semantic web? How will this help my car to fly?
People who look at these browser screenshots and decide that the semantic web is/will be a mess stop thinking too early.
This graph-like presentation is just one way to show semantics, and it only works for certain things, like topic maps.
I'm sometimes using tools like outliners and the Brain (insert pun here) to present ideas and their relationships. This is not the way you would want to e.g. read/present a complex manual.
Other, more complex forms of presentation are required - and possible. Ted Nelson had a lot of ideas regarding hypertext and presentation of relationships that have never turned into products. I'm working on my own little, Xanadu-ish project that aims to make navigation in structured text easier. The benefit is not presentation "A" or "B" - but the fact that you will be able to tweak the presentation according to what you need to know. This requires semantics, which in turn requires new tools both for the author, not (only) for the reader.
One day, we will look back and wonder how we could live with an Internet where a search engine had to guess if we are looking for Lotus The Car or Lotus The Flower or Lotus The Software Company, or where separating articles by an author from those about him was nearly impossible. No-one in their right mind can claim this is good enough for the future.
I would say that XUL is more like HTML than RDF. However, you're right that Mozilla's framework has built-in support for querying RDF datastores (although primitive compared with Jena or Redland). In fact Mozilla internally represents bookmarks through RDF even though they are serialized in a pseudo-html syntax on disk (for compatibility reasons). The history, extension registry, and file system are also RDF-based. Mozilla may very well be the most widely distributed framework for accessing RDF datastores on the planet!
It is impossible to enjoy idling thoroughly unless one has plenty of work to do.
- Jerome Klapka Jerome
Ten years ago, we used to say, "we are angry at this or that issue." Now, we say, "we are frustrated". The entire idea of trying to reduce our language to assertions of logic is silly because as soon as we do so, we immediately try to change the definition of one of the composing elements in order to get out of it.
Here's the biggest problem with the web. Most people that have web sites have them to sell stuff, and they DON'T WANT their stuff to be easily searchable and diced and sliced. All this interoperability and data exchange stuff XML, Web Services, Internet Tools, even old COM Objects and CORBA objects and even older RPC all failed, has failed, and will always fail because people don't want you to make it easy to compare you against someone else. It's not a question of cost.
It's just stupid to do it. What, I'm going to pay someone to make it easier for customers to choose someone else for any product? That's the most retarded thing in the world.
If you wanted to make the next big web, make one where people CAN'T compare content from your site to someone elses and have zones of it be franchized off for exclusivity. Like, I'd make a browser where every page sent is just a giant bitmap, and that way, it couldn't be scraped at all.
This is my sig.