Slashdot Mirror


Challenging the Ideas Behind the Semantic Web

mytrip writes to tell us that after a recent presentation to the American Association for Artificial Intelligence (AAAI) Tim Berners-Lee was challenged by fellow Google exec Peter Norvig citing some of the many problems behind the Semantic Web. From the article: "'What I get a lot is: "Why are you against the Semantic Web?" I am not against the Semantic Web. But from Google's point of view, there are a few things you need to overcome, incompetence being the first,' Norvig said. Norvig clarified that it was not Berners-Lee or his group that he was referring to as incompetent, but the general user."

144 comments

  1. Problems w/ the Semantic Web by CTalkobt · · Score: 4, Insightful

    is the users.

    Not the ones searching but the ones creating the content.

    They'll be some idiot out there (like there is now) that will code his data in a way that guarantees that he gets the most page views etc. So often searched terms will turn up on search indexes and other ilk.

    It's a loosing proposition unless you come up with filters but then they have their own set of problems.

    --
    There's a gorilla from Manilla whose a fella that stinks of vanilla and has salmonella.
    1. Re:Problems w/ the Semantic Web by ErikTheRed · · Score: 1, Funny

      Could be worse. You could try to make it find something useful within the domain "myspace.com".

      --

      Help save the critically endangered Blue Iguana
    2. Re:Problems w/ the Semantic Web by klenwell · · Score: 1

      The answer lies not in filters. All will be revealed when the Great Parser comes forth.

      --
      Innovation makes enemies of all those who prospered under the old regime... -- Machiavelli
    3. Re:Problems w/ the Semantic Web by CRCulver · · Score: 4, Interesting

      ...is the users. Not the ones searching but the ones creating the content.

      Sure, the technical limitations of Joe Public might slow the growth of the Semantic Web on the whole, but what few people realize is that the Semantic Web has already existed for years in in-house or limited-audience networks. Just look at FOAFnaut (an update in a few weeks will return it to full usability) or the very much real-world examples in Geroimenko & Chen's Visualizing the Semantic Web (Springer, 2005).

    4. Re:Problems w/ the Semantic Web by Anonymous Coward · · Score: 0

      Or a worthwhile, non-bitchy comment on /.

    5. Re:Problems w/ the Semantic Web by JiveDog · · Score: 1
      it's a loosing proposition...

      ...as long as it's not a losing proposition everything will just need to be tightened to the point where the "UI" of the semantic web is fine-tuned for Joe SixPack.

    6. Re:Problems w/ the Semantic Web by CaptSolo · · Score: 3, Interesting

      The problem with users (authors) is valid when we consider individual authors creating data (RDF, HTML, ...) "by hand". TimBL has referred to the Semantic Web as a global database of knowledge (as compared to the current web of text content). The problem of incompetent users goes away and higher value of data is achieved when exposing already existing content and databases on the Semantic Web. Think sites like SlashDot, wordpress.com, amazon.com, NY Times, ...

      Authoring of RDF data is not so different from authoring XML or RSS. This means that costs of putting your site on the Semantic Web are quite low. The benefits are a global reuse of information.

      For example: it is easy to install WordPress SIOC plugin to export RDF from any WordPress based weblog. Individual users don't have to care what RDF is or looks like. And the data about all posts and comments are now computer readable and can be reused in a number of ways, e.g., to create a TimeLine of your posts.

      If we take this approach and expose data from existing sites in RDF, the task of authoring quality data can be accomplished. The problem of spam referred in the article can be dealt with by signing the information - since Semantic Web is still young the problems of misuse can be addressed in the architecture right from the beginning.

      I would like to focus your attention in another important area - consumers of Semantic Web data. There is and will be quality data out there. What is interesting now is to find new and useful ways to use this information and add value over what can be done with simple web pages.

    7. Re:Problems w/ the Semantic Web by dbc001 · · Score: 2, Interesting

      We already have that problem without the Semantic Web. Semantic Web coding is not a fix for that problem, it's a fix for other problems.

      This is like saying "Don't use Open Source software because people will do bad things with it". People will do bad things with or without Open Source software, and with or without the Semantic Web.

      Anyway the article isn't very clear... By "Semantic Web", are we talking about using <div>s and <p>s instead of <table>s and <br>s? Or are we talking about microformats? Or something else?

      One more thing... if google doesn't support the "Semantic Web", it will most likely fail.

  2. nifty! by Anonymous Coward · · Score: 0
    Norvig clarified that it was not Berners-Lee or his group that he was referring to as incompetent, but the general user."
    Score one for the rest of us?
    1. Re:nifty! by The_Cheese_Stands_Al · · Score: 1

      Well, I would say the average /.er would classify as competent. Think of your average myspace denizen, or someone of that nature. On another note, Norvig seems to have put his foot in his mouth. Nice recovery, though.

    2. Re:nifty! by nolsen · · Score: 2, Interesting

      I dunno, I'm guessing he knew exactly what he was saying... But I do wonder if he was trying to tease Tim Berners-Lee a little. It would be interesting to see/hear audio/video of that exchange.

  3. Semantics... by Thakandar2 · · Score: 5, Funny

    "Norvig clarified that it was not Berners-Lee or his group that he was referring to as incompetent, but the general user."

    Here I was, thinking we were arguing over Semantics...

  4. Damn by ErikTheRed · · Score: 4, Funny
    "...Norvig clarified that it was not Berners-Lee or his group that he was referring to as incompetent, but the general user."
    Because Norvig vs. Berners-Lee going 10 rounds in a cage is something I'd pay to see.
    --

    Help save the critically endangered Blue Iguana
    1. Re:Damn by salmon_austin · · Score: 2, Informative

      In the U.S. regular cage fights are 3 rounds, with championship fights being 5 rounds. There are no 10 round cage fights that I am aware of anywhere in the world.

  5. Are you just another Anti-Semanticist? by dolo724 · · Score: 2, Interesting

    I'm calling the Anti-Neutrality Web Designers of Amerika!

    Demands of inequality such as this should be allowed!

    (btw, the spelling doctor has "loosing" as in "loosing the hownds for the huhnt")

    --
    But you just gotta have another sigarette
    1. Re:Are you just another Anti-Semanticist? by Crayon+Kid · · Score: 2, Insightful

      I don't think he's "another Anti-Semanticist". He's just saying that the whole semantic Web concept is based on this: that people will classify content properly and in good faith. Let's be fair, what are the chances of it not being abused? And if so, doesn't it mean that the semantic Web is doomed from the start?

      Think of all the things that were fouled by abuse. Email was a very sweet thing until it got perverted by spam. Newsgroups too. If the possibility for abuse exists, it will happen.

      --
      i ate crayons when i was a kid and now i have two braincells and the blue ones taste nicer
  6. Semantic web is currently fragile technology by UR30 · · Score: 5, Interesting

    The current semantic web seems to offer a technology too fragile to use on the global scale. The complexity of various classification and ontological schemes, work needed to provide the metadata etc. Also, semantic web seems to offer great opporturnities for spammers and other mischief makers. Now we already have comment and reference spamming, but semantic web (on the global scale) raises the possibilities enormously.

    1. Re:Semantic web is currently fragile technology by znu · · Score: 4, Insightful

      The full semantic web scheme really ignores a lot of what the Internet has taught us about what technologies succeed. It's not about grand visions and long specifications, it's about simple stuff that solves real problems of limited scope. Look at RSS, for instance; it's about the simplest thing which could do the job it does.

      I think we'll eventually realize most of the benefits of the semantic web, but it won't be a result of a grand vision imposed from the top down and implemented all at once. It'll probably be though increasing adoption of microformats, which don't try to classify and specify everything, and are implemented entirely using existing web standards.

      --
      This space unintentionally left unblank.
    2. Re:Semantic web is currently fragile technology by mgblst · · Score: 1

      At some stage, things get complicated, or you are left with a mess. This is the next step, and that step comes with classifiying everything. It is not such a grand vision, it just means adding extra information about objects on a page. This can be done in a number of ways, not necessarily that complicated. You just have to describe a bit better the page you put on, putting it into contect, so that the computer can interpret better what that page means. It is a good thing.

    3. Re:Semantic web is currently fragile technology by Crayon+Kid · · Score: 2, Insightful
      Look at RSS, for instance; it's about the simplest thing which could do the job it does.
      But may I point out, in addition to your comment, that such technologies have fared well as long as the human element is closely involved with them. RSS, social bookmarks, tags, microformats.

      On the other hand, Tim Berners-Lee seems to stress the fact that the semantic Web is all about AI doing content classification for us. So I think it's time we remember the old joke, "artificial inteligence is no match for natural stupidity". Or for human malice, I should add.

      I see a problem in all this AI involvement. It's a single point of failure of sorts, if you will, similar in a way to the one involved in precisely identifying people's identity: the more you trust an automated system, the more badly you'll be burned if the system is abused into reporting the wrong thing.

      The theory is wonderful, so's the Web, the Internet, computers and so on. But they are used by people. I have a hard time believing people will behave and resist the temptation to abuse this system just like they have abused countless others before.
      --
      i ate crayons when i was a kid and now i have two braincells and the blue ones taste nicer
    4. Re:Semantic web is currently fragile technology by Bogtha · · Score: 3, Insightful

      Tim Berners-Lee seems to stress the fact that the semantic Web is all about AI doing content classification for us.

      I don't think I've seen him stress that in the sense that the users are dissassociated from the process. The Semantic Web is all about representing things like tags, microformats, etc, in a generic way.

      For example, if comment moderation was defined in terms of a relationship between a person, a comment, and an opinion, that doesn't mean a computer would be moderating comments, it just means that the same mechanism could be applied across multiple websites, without having to build moderation into the websites themselves. You could mod Dvorak -1, Troll, and everybody who lists you in their FOAF file using a browser that supports it, would see that moderation.

      Just because the focus is on making the software smarter, it doesn't mean that it's about replacing user opinions with computer opinions. In fact, the majority of Semantic Web stuff I've seen have been all about codifying user opinions to make them more accessible to computers, and thus, more easily exposable to the end-user in a useful way.

      --
      Bogtha Bogtha Bogtha
  7. Googlebombing by QuantumFTL · · Score: 4, Insightful

    The biggest problem with the semantic web is spam. If you can trust the tags, it's a beautiful idea. If you can't, it's worse than useless - it's a waste of time. Google has the right idea, automatic extraction of semantics from content. If there's no real content, then (hopefully) that will be reflected in the semantic analysis.

    Me, I estimate we're 5-10 years away from doing anything terribly useful with all of this stuff, but I can definitely envision the day when an internet without semantics seems as distant as an internet without Google.

    1. Re:Googlebombing by Wastl · · Score: 5, Insightful

      The "Semantic Web" is not about search engines, as you and many other posters seem to believe. It is about representing Web content in a structured, formal way that is more easily accessed by machines, going beyond simple presentation. This can be used for searching, but also for many other applications, e.g. integration, exchange, personalisation, ... .

      Spam content on the Semantic Web is in no way different to spam content on the normal Web (well, except that it is formal). This also means that a search engine that is capable of working with Semantic Web data has exactly the same issues with trust as traditional search engines. Except that on the Semantic Web, trust can be expressed formally as well. Similar to the authorities in Google, whose outgoing links make a statement about the trustworthiness of other sites, an "authority" on the Semantic Web can make statements about the trustworthiness of other sites. However, these statements are explicit, and they could also be used to state that another site is *not* trustworthy.

      Google has the right idea, automatic extraction of semantics from content.

      Google does not extract any semantics from content. It merely analyses the linking between websites and connects that with keywords. No semantics here.

      Sebastian

    2. Re:Googlebombing by QuantumFTL · · Score: 4, Informative

      Google does not extract any semantics from content. It merely analyses the linking between websites and connects that with keywords. No semantics here.

      I believe you are referring to PageRank, which is one of many algorithms used by google to determine search relevance. This article discusses their use of Latent Semantic Indexing, which is a somewhat crude but effective form of sematic inference which is widely used in the field of NLP.

    3. Re:Googlebombing by russellh · · Score: 1
      The biggest problem with the semantic web is spam. If you can trust the tags, it's a beautiful idea. If you can't, it's worse than useless - it's a waste of time. Google has the right idea, automatic extraction of semantics from content. If there's no real content, then (hopefully) that will be reflected in the semantic analysis.
      yes, in theory, nobody needs google in a semantic web populated by lawful good users. But the power of google is in its verification, which will translate to the semantic world. ultimately it may not be all that different from today, except search will be highly nuanced, and may be more like browsing among relevant things rather than wondering what keywords to type.
      --
      must... stay... awake...
    4. Re:Googlebombing by QuantumG · · Score: 1

      I don't get it. Most all of us these days are writing webapps that spit out xml and have a CSS style sheet that makes that stuff pretty. So what's left, standardizing how the xml should be structured? Maybe instead of dictating how it shall be the Semantic Web proponents should go out and look at what xml people are spitting out and do something useful with it. Then people will see that they too can offer users something useful by making their xml more readable by your tools. Why is it that folks like the W3C seem to try very hard to work against network effects instead of working with them?

      --
      How we know is more important than what we know.
    5. Re:Googlebombing by Eivind · · Score: 1
      I agree. By default Google considers any link to a page an implicit endorsement of the page. Which is a problem, you see stuff like comment-spam attempting to increase page-rank by making it appear that slashdot endorses a certain site when that isn't the case.

      There's a extension to disable this, something like rel="nofollow" that says, essentially, the link should not be considered an endorsement.

      But even more useful would be the possibility to explicitly say what relation you have to some site.

    6. Re:Googlebombing by navarroj · · Score: 2, Informative

      > Google has the right idea, automatic extraction of semantics from content.
      >
      > Google does not extract any semantics from content. It merely analyses the linking between
      > websites and connects that with keywords. No semantics here.

      Google does extract semantics from content in a few particular domains: addresses and bussines info for Google maps, show times and additional information on movie searches, dates and appointments from Gmail to Google Calendar, ...

      The semantic web has already started. Now we only have it in a few and simple enough domains but, I agree, this should be the right way to go.

    7. Re:Googlebombing by Yvanhoe · · Score: 1

      Similar to the authorities in Google, whose outgoing links make a statement about the trustworthiness of other sites, an "authority" on the Semantic Web can make statements about the trustworthiness of other sites.

      Want to manage a $10 billion company in ten years ? Here is your plan...

      Just my two cents, soon to be gazillions...

      --
      The Wise adapts himself to the world. The Fool adapts the world to himself. Therefore, all progress depends on the Fool.
    8. Re:Googlebombing by sgt101 · · Score: 1

      And no semantics anywhere (outside of humans) either, just sets of relationships attached to symbols stored in computers. I think it's important to say, the semantics in OWL (for example FOL is the same) come from an agreement between and within communities in humans which are framed in natural language (not maths, the meaning of the maths is what we are agreeing) and are therefore subject to debate.

      As an example some people don't accept constructed proofs as valid. This makes a lot of physics and maths inaccessible or unproven in thier view, but it is a reasonable enough stance, although not necessarily the most useful. The point is that this is about stances and not immutable truth.

      --
      --------------------------------------------- "In the end, we're all just water and old stars."
    9. Re:Googlebombing by Bogtha · · Score: 1

      Most all of us these days are writing webapps that spit out xml and have a CSS style sheet that makes that stuff pretty.

      XML has no semantics whatsoever. None. It's a way of serialising and unserialising a tree of elements and attributes. It's markup languages that are built on top of XML that contain the semantics. Part of the Semantic Web is finding a good representation for the deeper semantics that are pervasive on the web. Think less about "This bit of text is a paragraph" and more about "This is the relationship X has with Y".

      With that infrastructure, you can express all kinds of different things with minimal special-case knowledge from the software you are using. The same code can handle relationships expressing you tagging a Slashdot story as [troll] and relationships expressing your friend network on Friendster. The way you interact with that code will be different - so the UI stuff will be special-case, but underneath the superficial stuff, it can be handled more generically, which means you don't have to come up with a new XML document type every time you want to express a different set of semantics.

      --
      Bogtha Bogtha Bogtha
    10. Re:Googlebombing by radtea · · Score: 2, Interesting

      Google has the right idea, automatic extraction of semantics from content.

      But content has no semantics.

      Meaning is a verb, and "to mean" is an action of a knowing subject. Communication is an attempt to stimulate the same meanings in multiple subjects--kind of a psychological choreography.

      As such, meaning is not extracted from content, ever. Rather, probable meaning is inferred from content, and the basis of inference is fundamentally psychological. What a given word, symbol, sentence, paragraph or page means will depend entirely on who is doing the meaning, and any attempt to infer meaning from content will therefore require a model of who is doing the meaning.

      Building such a model is non-trivial, and multiple models will be required to serve the needs of multiple constituencies. The meaning of "nekkid women with goats" will vary widely depending on who is doing the meaning: a man or a woman, an religious person or a rational person, a child or an adult, and so on.

      Tagging mechanisms that allow anyone who visits a page to classify it will be subject to an enormous range of variation even without spammers, bored teenagers and other malicious entities. People have mentioned the low quality of data in freeDB, and that is within an extremely narrow, specialized area with well-recognized commercially-created categories. Consider the mess that will result from the average American, nearly half of whom believe that god created humans in our current form in the past few thousand years being free to tag pages dealing with evolution and ID.

      None of this is to say that some form of classification wouldn't be a good thing, nor that useful classifications aren't possible. But successful attempts at associating common (within a given culture/sex/age group) meanings with content are going to be vastly harder than most advocates of the semantic web believe, and will be based on a fundamental awareness that content on its own has no sematics. Until that fact is recognized and incorporated into the designs as the deepest level we will get nothing useful out of the semantic web.

      --
      Blasphemy is a human right. Blasphemophobia kills.
    11. Re:Googlebombing by Anonymous Coward · · Score: 0

      The problem is that web companies don't want to present their content in machine readable, structured way. If e-shop would provide information in structured way, it would always be possible to find the cheapest product, and this is not in their interest. Similarly, if IMDB would provide all the data in machine readable way, whom would they sell ads and DVDs? Semantic web is not going to happen for this very reason.

    12. Re:Googlebombing by Wastl · · Score: 1

      And no semantics anywhere (outside of humans) either, just sets of relationships attached to symbols stored in computers.

      That's what I actually tried to propagate the "Symbolic Web" rather than the "Semantic Web". But as people are usually looking very confused at me on these occasions I am now (out of resignation) using the "Semantic Web" as everyone else does.;-)

      Sebastian
    13. Re:Googlebombing by Wastl · · Score: 1
      While I agree with most of what you said, I'd like to comment on your following statement:

      XML has no semantics whatsoever. None. It's a way of serialising and unserialising a tree of elements and attributes. It's markup languages that are built on top of XML that contain the semantics.

      This is not true. XML has no less and no more "semantics" than OWL or RDF. The main difference is that OWL and RDF allow you to build more complicated structures (graphs in RDF, typed graphs in OWL) than XML. The Semantic Web essentially is only about structure, whose symbols are then given "semantics" by the programs processing them and, ultimately, by the humans sitting in front of the machine and looking at the presented content. Therefore I often rather call it the "Symbolic Web". In this respect, I consider XML as a Semantic Web language (and it is also part of the Semantic Web stack).

      The really innovative thing about the Semantic Web is a set of standardised languages and standardised reasoning mechanisms for working with the elements of these languages, and that these languages and tools are "Web-capable".

      Think less about "This bit of text is a paragraph" and more about "This is the relationship X has with Y".

      But what about "This bit of text is the name of the author"? I'd say that this is very well in the scope of the "Semantic Web". It's just a matter of interpreting the structure.

      Sebastian

  8. Incompetence of users such as Slashdot editors... by rsidd · · Score: 4, Insightful

    Thanks for the illustration of what Norvig meant. How is "Google Director of Search and AAAI Fellow Peter Norvig" (original article) semantically equivalent to "fellow Google exec" (Slashdot summary)? The latter suggests that Tim Berners-Lee too is a Google exec, which would be news to him.

  9. Re:Wow, that was confusing by Schraegstrichpunkt · · Score: 0, Offtopic

    What's a semete?

  10. Filtered semantic webs might work by dr_pump95 · · Score: 2, Insightful

    Semantic webs (emphasis on plural) produced by editors such as those at /. or in the consumer-rated style of Digg, Del.icio.us etc might actually work. Trusting authors to do it right is a disaster, as Norvig suggests.

  11. Re:Wow, that was confusing by Anonymous Coward · · Score: 0
    With a name like Schraegstrichpunkt you're clearly an Anti-semete!

  12. Always bet on the million monkeys by IvyMike · · Score: 4, Insightful

    It's really, really difficult to get people to follow rules. We're lazy, we're incompetent (yes), and some of us are evil. I still don't think I truly understand how RDF is supposed to work exactly, and it doesn't even seem like it will be fun to try.

    On the other hand, it's really easy to release a million monkeys and let the create what they will. It's not so easy to sort through what they end up producing, but Google does a surprisingly good job of this.

    It reminds me of the early days of the Web, when companies like CompuServe and AOL wanted to design and own all content. On the other hand, an internet server with httpd let anybody make a ~/public_html directory and put up whatever they wanted to. The million monkeys won that battle. I think they'll win this one, too.

    1. Re:Always bet on the million monkeys by mgblst · · Score: 2, Informative

      People who want to add extra information to there page can, it doesn't all have to change at once. The people who do add semantic information to there page, will be indexed better, or by a different browser which producers only relevant results - this is a huge advantage. Then, this will be popular, and more and more people will add the extra information (which for sure, takes extra time). If people spam the system, or put in incorrect information they are excluded. It is possible, and it is the next step - it is a solution to the problem of spamming.

    2. Re:Always bet on the million monkeys by sco08y · · Score: 1

      It's really, really difficult to get people to follow rules. We're lazy, we're incompetent (yes), and some of us are evil. I still don't think I truly understand how RDF is supposed to work exactly, and it doesn't even seem like it will be fun to try.

      It's not about following rules. It's about offering some kind of incentive. The major disincentives are that RDF is a confusing, poorly engineered spec and that it probably won't provide them any benefit. You can't call someone lazy or evil for having common sense.

    3. Re:Always bet on the million monkeys by hazygin · · Score: 1

      the incentive is simple : good stuctured information, it is the same reason we visit google, wiki, and slashdot. the trick is how to make a system by which the users will help create and organise the information. I am sure that the processing power of 50,000 people is not that shabby. the only problem is realy the validity of the information, but then that is only a small problem since truth to us is relative any way.

    4. Re:Always bet on the million monkeys by TuringTest · · Score: 1

      On the other hand, it's really easy to release a million monkeys and let the create what they will ... and that's how Semantic Web is supposed to work.

      The SemaWeb is all about human-provided content represented in a common format, just like Web 1.0 was! HTML was the format for hyperlinked generic information chunks ("pages"), RDF is the format for hyperlinked metadata-anotated chunks.

      The main difference is that HTML was, at the beginning, a very simple common format (that's not true nowadays, though). Machine-readable semantic is a complex beast, but still it's supposed to be simpler than the current HTML + CSS + Javascript + XMLRequest + EJBs mixture.

      --
      Singularity: a belief in the "God" idea with the "demiurge" relation inverted.
  13. Blaming the user is never right by robolemon · · Score: 3, Insightful

    From http://www.7nights.com/asterisk/archive/2004/03/do nt-blame-the-users

    Blaming the users for anything should raise a huge red flag that you've got some usability problems.

    Maybe the Semantic Web should aim to be useful to people rather than require people to be useful to it. There has to be a better way than trying to educate droves of people to a problematic and vulnerable design.

    --

    I design user interfaces for a free network management application,

    1. Re:Blaming the user is never right by QuantumFTL · · Score: 2, Insightful

      Blaming the users for anything should raise a huge red flag that you've got some usability problems.

      Bollocks! The fact that flying an F22 is probably fatal for untrained grandmothers does not mean it has "usibility problems" - not every task in life is meant to be done by idiots, and the more effort is put into idiot proofing software, the less is put into reliability, functionality, and extensibility for the rest of us. Some things are too hard for a segment of the population to do, and ontologically tagging complex relationships between data entries may simply be beyond the average user. That's not a bug, that's a challenge.

      There's too many generalizations like "blaming the user is always wrong" and "security through obscurity is not useful" that are incorrect under many conditions, and /. posters and moderators seem to be doing their best to propagate these. People have finite time, money, intelligence, knowledge, skill and experience. Not everything can be easy enough for everyone to use.

    2. Re:Blaming the user is never right by Achromatic1978 · · Score: 1
      This is precisely the point. If I had mod points, etc, etc.

      Far too many geeks have far too many ideas about 'socialising' (as in human relationships, not political agendas) the Internet, and their method of doing so is so far from related to normality it's not funny. People don't care about XML DTDs, FOAF, it just needs to work, or else we're just building a big database with odious standards for data normalisation and invasions of privacy that it's no longer a tool for us, but one against us (and I mean that with the least possible amount of tinfoil).

    3. Re:Blaming the user is never right by mgblst · · Score: 1

      Yes, great example. This invalidates his point completely.

      By definition blaming the user is wrong. If your grandmother is a user of a F22, then the machine should not stop her rom trying to fly it. A computer user should be able to use a computer, without getting an infected machine by checking their email, or going to a webpage. And this is what has happened. When you buy a computer today, it will come with a virus checked and spyware checked, and a better browser (hopefully) - why, what would you do. Just blame the user and not create such tools? This is strange reasoning.

    4. Re:Blaming the user is never right by FooBarWidget · · Score: 2, Interesting

      On my website, there are a few links, among which are:
      - Download
      - Forums
      The Download and Forums links are next to each other, and highly visible (48x48 icons with labels). But people go to the forum to ask where they can download my program! When I ask them why they didn't click on the Download link, they don't give an answer.

      If that isn't user incompetence, then what is it? And yes, this happened for real. In fact, it happens all the time, so it's not just 1 or 2 people.

    5. Re:Blaming the user is never right by robolemon · · Score: 1
      Bollocks! The fact that flying an F22 is probably fatal for untrained grandmothers does not mean it has "usibility problems" - not every task in life is meant to be done by idiots, and the more effort is put into idiot proofing software, the less is put into reliability, functionality, and extensibility for the rest of us. Some things are too hard for a segment of the population to do, and ontologically tagging complex relationships between data entries may simply be beyond the average user. That's not a bug, that's a challenge.

      Grandmothers are not users of military jets. One of the major goals of the Semantic Web is a decentralized and contextualized meaningful information store that experts in disparate fields can contribute to. Thus the intended users of the Semantic Web do include the very people being insulted.

      Social software architects are not the sole intended users of social software. Air Force pilots are the intended users of jets. One has to define a user group before applying usability arguments.

      There's too many generalizations like "blaming the user is always wrong" and "security through obscurity is not useful" that are incorrect under many conditions, and /. posters and moderators seem to be doing their best to propagate these. People have finite time, money, intelligence, knowledge, skill and experience. Not everything can be easy enough for everyone to use.

      A red flag is not the same as "always wrong".

      Also, people do not have enough money, intelligence, knowledge, skill, and experience to spend time with a system that doesn't consider them as important. It's a lot easier and more efficient to spend money on making the Semantic Web easier to use than to spend a lot more money training people to use a faulty system.

      --

      I design user interfaces for a free network management application,

    6. Re:Blaming the user is never right by Captain+Perspicuous · · Score: 1

      You know, there are some instances where the general population is right and the professional is wrong, and this is one of them. The normal people know that something that helps organizing a democratic medium needs to be democratic too - it needs to be understood by the masses, or it won't take off.

      As long as you stay in an ivory tower and target the semantic web only to you and your peers, you don't grasp what it is really all about. And it will just stay on the ground.

    7. Re:Blaming the user is never right by Mr.+Hankey · · Score: 1

      There's a difference between a tool written for a task which requires prior knowlege, and insecurely written software. A decent email client wouldn't automatically open attachments by default without asking, and a decent web browser wouldn't run code using greater privileges than the current user in any case. I'll grant you that. However, stating that a CAD program is poorly designed because it's difficult for a new user to grasp would be incorrect. Not everyone is trained in CAD/CAM, so the interface of the software isn't going to make sense for everyone. That doesn't mean there's anything wrong with the software, but an untrained user may well be incompetent. The same could be said about development tools, the aforementioned F22's cockpit, and many other tools which require some training before use.

      --
      GPL: Free as in will
    8. Re:Blaming the user is never right by robolemon · · Score: 1
      If that isn't user incompetence, then what is it? And yes, this happened for real. In fact, it happens all the time, so it's not just 1 or 2 people.

      It's evidence that you should consider changing your layout.

      You know that they have trouble finding your download link, yet you're stubborn enough not to try to improve your site? That's pretty closed-minded.

      I know that it's hard to think that other people could see things differently than you do. Maybe if you want people to download your software more than use your forums, you should separate the Downloads link and make it much larger and prevalent. You're lucky that they even ask where to download, because it's just as easy to close the window.

      People naturally don't act logically. Don't ever expect them to or you will fail. It's easy to change a website, and difficult to change a large population. Pick your battles wisely.

      --

      I design user interfaces for a free network management application,

    9. Re:Blaming the user is never right by FooBarWidget · · Score: 1
      You know that they have trouble finding your download link, yet you're stubborn enough not to try to improve your site? That's pretty closed-minded.


      Why do you think I don't try to improve my site? I do it all the time.

      A few points:
      1. I've already redesigned the website twice, and people still ask at the forum where they can download it.
      2. When I ask those people why they can't find it, they never give an answer! How am I supposed to know what they think when they don't even reply?
      3. I asked a lot of other people who *do* reply, but all of them think that the download link is easy to find, and think that the people who can't find the download link are idiots.
      4. The download link is on the left of the forum link! How is it even possible that one cannot see it?
      5. Even on the forum pages, there are all kinds of links that link back to the download page. When a user enters the forum, he's presented with a huge, orange box that tells the user where he can find the download page. This box is much larger than the "Continue to the forums" link.

      (And FYI, I'm not talking about the Autopackage website.)

      Given all of the above points, what else can you conclude other than that there are indeed incompetent users? Heck, even the users themselves think that incompetent users exist.
    10. Re:Blaming the user is never right by emmadw · · Score: 1

      At a guess, your user was starting in the forum (e.g. a websearch took them right there). They mayn't have noticed that the home page of the forum isn't the home page of the site, and so wondered where to download the apps from. There's a forum that I use regularly (IT support), as the contents are regularly updated - it's forum pages that come up when you google, rather than the home page. A lot of people don't even know that the home page exists. So, rather than the users being dense, or you having designed your site badly, perhaps it's just that search engines do a really good job of the forum, so users arrive there & don't realise that there's more. I can't find your forum - so don't know if you've got a link on the forum pages to "Downloads" - but, have you seen the way that Spyblocker forums work (http://www.spyblocker-software.com) - they're off line at present, but basically, if you've ended up on the home page then there's a link to the forums. If you end up directly on a forum page, then they've got forums that are called "Downloads" - which redirect you back to the home page in case you didn't get to the home page first. At least, that's what they're doing now, not sure about what they'll be doing when they've redone them!

    11. Re:Blaming the user is never right by jotaeleemeese · · Score: 1

      If your forum revolves around your package only, put a link or button in the same place where you put "Send" or "Reply" buttons.

      --
      IANAL but write like a drunk one.
    12. Re:Blaming the user is never right by Anonymous Coward · · Score: 0

      The problem is that there is such an urge to make the web into a public library, but it requires an automated librarian. The legions of content creators certainly are not librarians, and that is what makes them incompetent.

      What we have today is little more than a garbage dump full of books, with pack-rat residents who scurry around and know where to find certain books within their immediate vicinity, because they've tripped over them before. The web is little more than a bizarre telephone system linking these residents. No real indexing happens without a human in the loop to interpret data and find some meaning.

      There doesn't seem to be any motivating force to push us on to a better system, because the majority of the public is satisfied with picking treasures from a garbage dump. I think I agree with Norvig's stance, which seems to be that we have to work on automated filtering to try to incrementally step closer to an automated librarian, without ever trying to enlist librarian skills from the users at large. Anyone who remembers the impact when altavista first came online ought to resonate with this hope for brute-force heuristics to save the day...

    13. Re:Blaming the user is never right by file+terminator · · Score: 2, Insightful

      Are you just trolling, or are you really serious?

      The poster said that the links are next to each other. Unless you have seen the site in question, I don't think you are in any position to bash its layout.

      There are people I seriously think shouldn't be on the Internet. Heck, there are people I think shouldn't even own a computer. Besides IT-related issues, there are also people I don't think should be allowed to drive a car, use a credit card, raise children, have dogs, etc.

      An interesting aspect is that many of these people, especially when it comes to technology but also seen in other areas, somehow think that others are somehow obliged to help them with their activities. Have kids, but expect others to raise them. Get in debt, but expect others to lend you money to deal with the bills. Get a computer, but expect others to hold you by the hand whenever you use it. (Caveat: There are things you can't control, and those you can. If you get in debt because you are between jobs and need a place to live and food to eat, that's one thing. If you get in debt because you must have a better car than your neighbour, that's another.)

      I dunno about others, but around the 20th time I show someone how to copy a file, it already feels old. At some point you start wondering whether the user really cannot learn how to copy a file, or doesn't want to learn. In either case, you're screwed.

      The same goes for userfriendlyness. I'm all for userfriendlyness. If a task inheritedly is so simple that it can be generalized into "click here" and the designer does just that, great! (I'd argue that there is a class of tasks that can't, and also yet another class that can, but only at the expense of configurability--take a set of defaults and assume they'll be good enough for everyone.) But even then, you'll see users that will say it is too difficult--they might not know where they saved the program, they might not yet have figured out how to run any other program than Internet Explorer, they might feel intimidated by the button, they might refuse to run the program because they read somewhere/someone told them that they should not run programs they got from the Internet.

      And that's where userfriendlyness falls apart, and that is when the battle is picked. You can either strive to improve on userfriendlyness forever, aiming to make it automatic and/or intuitive for 100% of your userbase, a goal that will never be reached, essentially wasting all your time for that final fraction... or you can set your percentage lower (exactly how low is up to you) and spend more time on actually developing your program.

      And quite frankly, barring a completely horrid homepage, if a user can find the "Forums" link but not the "Download" link right next to it (or maybe not understanding that "download" means "get program")... that user might not be the kind of user you want to spend time supporting.

    14. Re:Blaming the user is never right by SchwarzeReiter · · Score: 1
      People naturally don't act logically. Don't ever expect them to or you will fail. It's easy to change a website, and difficult to change a large population. Pick your battles wisely.


      And you expect people, who cant even act logically, to classify the content on which you can later rely on? The problem is, quite a big part of the population is a functional analphabet nowadays, if you rely on them to do a good job in for example classifying information they dont really understand, the results will be far from optimal. Take for example slashdot's modding system.
    15. Re:Blaming the user is never right by FooBarWidget · · Score: 1

      Take a look at this. It was posted on a support forum. If that isn't an incompetent user then what is it?

    16. Re:Blaming the user is never right by Anonymous Coward · · Score: 0

      You're an agenda-wearing troll because you take an exaggerated example to counter a subtle point.

    17. Re:Blaming the user is never right by bit01 · · Score: 1

      If you're referring to the autopackage website I think I know why you're getting those questions.

      There's more than a dozen hyperlinks on the main page. None of them say "Download".
      Okay. I'll go to the "Help & Support" section. None of the links there say "Download" either.
      What's left on the main page that seems vaguely relevant? "Packages; various packages"? I don't want various alternative packages, I want autopackage.
      Okay, I'll check the FAQ link on the main page. Do a search for the word "Download"; nothing relevant.
      This is stupid, I'll ask in "Forums".

      You broke a very common standard when you called your download section "Packages; Various packages". Fix it by changing the word "Packages; Various packages" to "Downloads; autopackage" on the main page so the hyperlink is both standard and has the same name as what it is referring to. People aren't mind readers and they have no way of knowing that a link titled "Packages; Various packages" points to a web page titled "Downloads". You could also make that link more prominent (Different color or top-left corner?) because that's what most people visiting your website for the first time probably want to do.

      When you design a web page or a program you need to get into the mind of your intended audiences. Don't assume they know what you do, don't assume they're mind readers and do remember there's a first time for everybody with everything. Standards are one way to to reduce error rates by making sure it's less of a "first time". Also add redundancy so that different members of your intended audience using different thought processes will still get to the right place in the end. e.g. By putting links to the download section in the Help&Support and FAQ sections.

      ---

      Don't be a programmer-bureaucrat; someone who substitutes marketing buzzwords and software bloat for verifiable improvements.

    18. Re:Blaming the user is never right by The-Bus · · Score: 1

      Not sure if you mean this autopackage page or not, but that doesn't have a button that says "downloads"... In fact, it doesn't say downloads anywhere until you click on Packages, then see "Downloads" as the page title. If the question is truly one of the most asked questions, it's not under the "Most Asked Questions" section. It should at least have one of those big buttons on the right... "Download now" etc.

      Again, I don't know if that's the site you're referring to.

      --

      Small potatoes make the steak look bigger.

    19. Re:Blaming the user is never right by FooBarWidget · · Score: 1

      "If you're referring to the autopackage [autopackage.org] website I think I know why you're getting those questions."

      No, I'm not referring to the Autopackage website. In fact, Autopackage is not supposed to be downloaded by end users.

    20. Re:Blaming the user is never right by FooBarWidget · · Score: 1

      No, I'm not referring to the Autopackage website. In fact, Autopackage is not supposed to be downloaded by end users.

    21. Re:Blaming the user is never right by dubl-u · · Score: 1

      There's too many generalizations [...] that are incorrect under many conditions

      Uh, that's the point of a generalization. Generally it's right. Sometimes it isn't.

      Generally, that's fine. Sometimes, it isn't.

    22. Re:Blaming the user is never right by robolemon · · Score: 1

      First off sorry if my tone has been a bit combative. I'm very passionate about this issue and I'll try to tone it down a bit. My observations come from experience in user interaction design involving actual user interviews and watching people interact with sites. You'd be surprised what happens. Really. People aren't logical. There's a lot of good literature on it too if you're interested.

      1. I've already redesigned the website twice, and people still ask at the forum where they can download it.

      I understand your frustration.

      2. When I ask those people why they can't find it, they never give an answer! How am I supposed to know what they think when they don't even reply?

      They probably never come back to the forum. Also, users aren't the best for diagnosing these kind of problems. Try asking someone why they lost their keys. There's not really a good answer to questions like that. These people are going elsewhere anyway.

      3. I asked a lot of other people who *do* reply, but all of them think that the download link is easy to find, and think that the people who can't find the download link are idiots.

      People are entitled to their opinions. You seem personally frustrated that these people don't find your download link, so I think you'd want to help them. If the software isn't designed for them, maybe you can make that more clear on your site by describing or showing it better. If it is, keep on trying, there's got to be a way. Otherwise, just live with it I guess.

      4. The download link is on the left of the forum link! How is it even possible that one cannot see it?

      Who knows? The mind works in mysterious ways that are often unintuitive. I haven't seen your design. Have you tried putting it on the right? Making it bigger? Putting links in multiple places? Putting whitespace around a link so it stands out? All I ask is that you consider other solutions than exclusively linking from within a larger list. I don't even know what your site is, so really I'm just trying to give you some free usability advice.

      5. Even on the forum pages, there are all kinds of links that link back to the download page. When a user enters the forum, he's presented with a huge, orange box that tells the user where he can find the download page. This box is much larger than the "Continue to the forums" link.

      Tweak it maybe. Does the orange make people skip over it because it's too bright and looks like an ad? Is there sufficient spacing around to make it stand out? You might look into eyetracking studies (here's one) to see where people look first on websites. Try experimenting with things other than putting it in a list and putting it next to the Forum link.

      Given all of the above points, what else can you conclude other than that there are indeed incompetent users? Heck, even the users themselves think that incompetent users exist.

      The other side of the coin is that users don't always suggest the best solutions. I'll admit that there's a reason you're the website designer and they're not. Use your intuition, but don't let it blind you to alternatives that better fit how people other than yourself see your website. Probably the best thing you could do is watch a few people use your site who haven't seen it before. Watch what they do.

      Or maybe try out Google Analytics's site overlay feature to see where most people click. Do they try multiple links before hitting the forum? How many leave the front page without downloading the app or going to the forums? All of this information is interesting and useful, whether or not it matches your perceptions.

      --

      I design user interfaces for a free network management application,

    23. Re:Blaming the user is never right by bit01 · · Score: 1

      Fair enough. I see from your other posts you've already done most of my suggestions on the original site.

      Maybe you're simply seeing the statistical outliers in your forums. If you have a large volume of users over a normal curve of abilities it's a statistical certainty you're going to get a few people who are a children, of lower than average mental ability, drunk and/or high on drugs, sleep deprived, distracted by the TV/spouse/kids/etc., having a momentary lapse, non-native English speaker, inexperienced, color-blind (red-green/blue-yellow) and can't see icons obvious to you, unusual thought processes (e.g. comes from a technical field where "download" has a completely different meaning), in a foul mood and not thinking straight, wanting to establish human contact and/or insure there is live support available before investing time in the software etc. etc.

      People are human and even the most intelligent make mistakes. To call all those I've listed stupid (with the exception of "lower than average mental ability") is not correct.

      I think one of the reasons you've pushed a few people's buttons is that it is a common error for programmers to regard these outliers as the "typical user" because they are the most visible squeaky wheels. "Stupid users" are also often used as an excuse by programmers to cover up their own shortcomings. I'm not saying that's the case here, just the reason why some people are wondering what's happening.

      ---

      Marketing talk is not just cheap, it has negative value. Free speech can be compromised just as much by too much noise as too little signal.

  14. Web of Trust by VDM · · Score: 5, Interesting

    In one of the very first papers mentioning the Semantic Web, some paragraph was devoted to something then lost in the hype around the semantic web: the Web of trust, which had to be something like a certification of metadata. This is perhaps to be again regarded as important for the semantic web and the web in general (although not easy to manage).
    By the way, Norvig is not only a Google exec, but also a well known AI researcher, author of one of most important books on that subject.

    1. Re:Web of Trust by cardpuncher · · Score: 2, Interesting

      Indeed. It's noteworthy that a lot of the work being done on "The Semantic Web" is by academics. They come from an environment in which there are peer-review mechanisms and established publishing channels which ensure that "trust" is the norm. Outside that world, information is generally less trustworthy but it may still be relevant. The research challenge is to make use of "trust" where it can be proved to exist but not assume it elsewhere. In commercial terms, though, it may not be worth even trying to do this for the generality of information on the web, only a tiny proportion of which will ever fall within a "web of trust".

      TBL also originally intended that hyperlinks be bi-directional: the target of the link would also link back to the origin. Clearly a useful idea but only practical in a small community with controlled access to documents. It's not unreasonable that there should be more feature-rich versions of the web for use in certain community groups, but it doesn't mean they would necessarily suit the Information Highway's Wacky Racers...

    2. Re:Web of Trust by trawg · · Score: 1

      Google are apparently already on the way to TrustRank. I can't wait to see how this works out.

  15. Norvig's personal project by tfinniga · · Score: 4, Interesting

    Slightly offtopic. Peter Norvig gave a talk at my university on similar topics, and there was a short Q&A afterwards.

    One of the students asked him what he did for his 20% project. He said that he was usually too busy keeping tabs on what the other employees were doing with their 20% time, so he didn't quite get around to working on his. He told us what he wanted to do, as motivation for himself.

    The basic idea is that when he used to work for NASA, it'd always make him upset when people saw faces in random spots on the moon's terrain, and claimed it was aliens that NASA was covering up, or similar. So, he was planning on taking facial recognition software and running it on all of google earth. I think it'd be pretty awesome..
    Any progress yet, Mr. Norvig? I'd love to see the results.. :)

    --
    Powered by Web3.5 RC 2
  16. That:s it gentlemen by Anonymous Coward · · Score: 2, Funny

    The jig is up!

  17. Not a new problem by tubapro12 · · Score: 1

    I've been complaining about this and related issues for a while now. my last journal

  18. Place your bets by Nefarious+Wheel · · Score: 1

    Who would win? (/troll)

    --
    Do not mock my vision of impractical footwear
    1. Re:Place your bets by moyix · · Score: 2, Interesting

      Depends--if Norvig got Russell (co-author with him on Artificial Intelligence - a Modern Approach) to go in with him for a tag-team kind of thing, they'd probably win. On the other hand, Berners-Lee has the W3C on his side, a notoriously large and heavy organization, which could be hard to topple.

      As a side note, I heard from a friend who was attending that Norvig's opening comment about people always asking him "Why are you against the Semantic Web?" was a response to Berners-Lee's opening, 'Poeple always ask me, "Why are you against Artificial Intelligence?"'

    2. Re:Place your bets by inKubus · · Score: 1

      Not to mention that Berners-Lee is a bona-fide KNIGHT--therefore he would win the joust. BYEEEEEEAAAAAAHHHHHHHHH

      --
      Cool! Amazing Toys.
  19. not jsut the general users by Mofaluna · · Score: 3, Interesting

    It's the business users too that are a problem. I'm currently trying to get a project on the rails based on semantic web technology, and I'm confronted with an IT department where some are even struggling with the difference between subtyping and instantiation- let alone more advanced modelling issues... It doesnt help ofcourse that most people never even heard of conceptual modelling languages such as ORM but instead were thought to use uml and ER where it's the modellers' responsibility to make a distinction between what is conceptual, logical and physical which ofcourse most never did.

    In regards to the google issue I think the idea that you should crawl everything is faulty cause you need to be able to trust the source. Most ontologies will simply be restricted to a certain domain and corresponding user group, often in a b2b context. Integrating every man and his dog, the lawnmower and the kitchen sink with some kind of top level ontology is merely a nice-to-have philosophical issue that I dont expect to be solved in the near future, if only cause we havent seen much advances since Aristole started toying around with the idea. In other words, at google they are worried about an issue that's atleast a decade away from now, probably even more.

  20. Hmph... by Jello+B. · · Score: 5, Funny

    That anti-semantic bastard...

    1. Re:Hmph... by BRSQUIRRL · · Score: 1

      No kidding. Next thing you know, they are going to want their own schools!

  21. Sem Web, meet Chicken & Egg by AlXtreme · · Score: 3, Informative
    The semantic web is, in my eyes, a typical chicken & egg problem. You've got loads of content on one side, yet current search engines work well enough to not worry about representing that content in a structured way in a markup language like OWL. On the other side, you've got embarassingly few semantic web applications that use structured content. How is a typical web developer going to justify structuring the content on his side if he can't point to an example how it could improve shareholder value? What would exporting our databases in OWL currently solve?

    True, the web had a similar problem, however creating a webpage is a lot more interesting (you see the results directly, how terrible they might be you do see a result) than structuring data. The latter takes a lot more work, and the direct benefit just isn't there.

    Sem-Web-like standards like RSS, XML and SOAP have become mainstream, but primarily because they fill a gap. The adoption of RDF or OWL simply doesn't solve anything. Yet. It would be cool to let agents loose onto the semantic web and retrieve them together with a summary on a certain subject using a multitude of sources, but as long as it's easier to Google I don't think it would generate any interest outside academia.

    Feel free to prove me wrong though.

    --
    This sig is intentionally left blank
  22. A bad example: FreeDB by h_benderson · · Score: 1, Insightful

    Exactly. Equally worse to the malicious content creator is the incompenent one.

    As an analogy, look at FreeDB. It should be obvious that the CD information database loses much of its worth if entries are not double-checked for errors before submittal. Yet, there is so much crappy entries in FreeDB that it's just not funny anymore.

    Ergo: Don't count on people to adequately tag or label content. It won't work.

    1. Re:A bad example: FreeDB by kthejoker · · Score: 5, Insightful

      Ugh, this is the major misconception of proper Semantic Web implementation.

      There are two user types of Semantic Web materia: the individual user and the group.

      The individual user only cares about context. It's like a Proustian adventure for him. If he tags Slashdot as "blatherscyte" because that's how he views it, then that's valid. If he tags it as "cmdrTaco" because he is stalking Rob, then that's valid, too. And if he tags it as "monkey" because one time he was petting a monkey while he viewed the site, then that's valid, too. It's like the old saying, "Whether you think you can or think you can't, you're right." There are no wrong semantics for the individual user, because it is his context alone which defines the usefulness of a tag.

      For this reason, the individual user should be allowed to tag freely and without limits, and also be able to edit or remove tags later.

      ----

      Now for the group, they have a different goal. Context does them no good, because they don't have the same context. Their goal then is consensus. Take your problem at FreeDB. The simple solution is to let people vote on the accuracy of disputed tags. Or flag ones they view as incorrect, and then review those that meet a certain threshold for flagging. Basically, you want the group to filter out things that don't apply to the group, WHILE maintaining individual context. You don't delete the tags that the group has rejected - you just hide them from the person who has come to view the group tags.

      I think this dichotomy of group vs. individual is what has gotten us into trouble with the Semantic Web. To use one example, I think delicious' big mistake was to show you "popular" tags for a given link. What that does is encourages you not to create your own tags, but instead just piggyback on popularity. Over time, this creates homogeny, which is great for the group, but not for the individual user. Sure, they can probably find that link again in a minimal amount of time, but if an individual tag might help them find it faster, but they shunned individual tags for groupthink, so much the worse for them.

      And on the flipside if you don't provide proper weighting and trust metrics into your tagging system, you are opening yourself up to not only abuse and inappropriate behavior, but also to the "incompetence" mentioned in the article, which is not so much incompetence as a zero-filter. It's like reading Slashdot at -1. It's kind of a touchy-feely way to look at it, but in Web 2.0 thinking, it's bad to delete content; just filter it out instead. It's bad to censor opinions from the software side; let each user do their own stifling. Give the users complete control over the content, and they will find models that work. It's that simple.

      The main problem with the Google guy's point is that philosophically, Google is more groupthink than individual user, because they're a search engine. They value consensus over context. In the future, perhaps they will value context a little bit more than they do. Until then, they have to stand where they stand, because they can't let context into their system. They've tried some clunky mechanisms to do so (Personal Search, anyone?) but until they get it right, the Semantic Web won't have any value to them.

    2. Re:A bad example: FreeDB by berbo · · Score: 1
      To use one example, I think delicious' big mistake was to show you "popular" tags for a given link. What that does is encourages you not to create your own tags, but instead just piggyback on popularity. Over time, this creates homogeny, which is great for the group, but not for the individual user.
      I disagree. Users don't have any problem going along with a standard vocabulary, with no loss of accuracy or precision. Our brains are wired for it. I think this use of popular tags is a great way to develop a de facto standardized vocabulary.
    3. Re:A bad example: FreeDB by coolGuyZak · · Score: 1

      How would you say this philosophy meshes with Google Personalized Search?

    4. Re:A bad example: FreeDB by kthejoker · · Score: 3, Insightful

      That was the entire point of my post! The group benefits from standardization, but the individual suffers. The Semantic Web is an attempt to give power back to the individual user. Subjectivity is a crucial element of the system, and sanitized, standardized, NPOV systems deny the individual subjectivity.

      Delicious is very smart in that it left the *option* for customised tags, but they are clearly saying by implication that the best tags are the ones everyone else is using. My point being that the idea of a "standardized vocabulary" is antithetical to the ideals of the Semantic Web. We don't want a democracy of ideas; we want a free market of ideas!

      Think of the concept "funny." Let's say I asked you to go to 100 different random sites and tag them as funny or not funny. Let's say that of the sites you listed as funny, it was clear you enjoyed witty, New Yorker-style humor, and not fart jokes. But let's say 99 other people did the same thing, and they did the opposite: they clearly enjoyed the fart jokes, and hated the New Yorker wit.

      Now if you asked this seeded engine for a recommendation of a new, 101st site that was funny, should it give you fart jokes, or New Yorker style? This is the power of the Semantic Web. What's funny to you, isn't funny to everyone else. Why should you be punished for that? And if a total n00b comes to our engine for a recommendation, they get the fart jokes page, because it assumes they're like everyone else. But if they start marking those sites as not funny, eventually it'll figure out they're more like you, and start giving them sites that you like.

      Now, will delicious ever do that? Of course not, because it doesn't offer any discrimination to you on the word funny. You get the democratic version of funny. Fart Jokes for all. And that's what "standardization" has to offer. So, no, you can keep that; I want the Internet to understand who I am, and what I like, not what everyone else likes. And if they HAPPEN to coincide, that's fine, so much the better - things are popular because of the people, after all - but they shouldn't have to.

    5. Re:A bad example: FreeDB by kthejoker · · Score: 1

      Well, that was what I was alluding to do as their "clunky approach", in that it offers you a degree of personalization (a plus), but only in the form of past searches (and a weak form of relevance.) It's better than nothing, but just barely.

      I have written up my own theory of the ideal Semantic Web search engine at everything2.com, (another link), (and another) (which of course I encourage everyone to read), but here are some ideas a good Semantic Web search engine would obey:

      1) If I search for "bank", it should find my bank, and not a bunch of other banks.
      2) It should follow the hierachy of personal preference -> local preference -> appeal to authority -> appeal to popularity -> all other results. So if I search for "sofa" it should go:

      a) Has he ever searched furniture before? What sites did he visit from that search? Has he given his seal of approval to any sites that sell furniture?
      b) What furniture stores are within 20 miles of him?
      c) Are there any sites that do serious furniture reviews? Are there any sites that provide consumer reports on furniture?
      d) What are the most popular furniture sites? What are the most popular sofas?
      e) Any other sofa pages out there? Sofa Kingdom, what's that all about anyway?

      And 3) It should have a distinctive "kthejoker" flavor to my searches. How it defines that is entirely up to my history and interaction with it, but the Semantic Web is the easiest way to get this thing done as abstractly and as simply as possible.

    6. Re:A bad example: FreeDB by NetRAVEN5000 · · Score: 1
      What you're proposing is scary to me.

      I would NEVER visit a site if it knew what bank I went to, if I ever searched for furniture before, where I live, etc.

      Yes, I know there are cookies that collect certain data about my search habits, that's fine, but for them to know what bank I go to and where I live?

  23. It's really, really difficult... by pedantic+bore · · Score: 2, Interesting
    It's really, really difficult to get people to follow rules.

    Especially if the rules appear to be an incomprehensible ad-hoc mix of principles taken from a dozen not-quite-fully-baked AI dissertations.

    I still don't think I truly understand how RDF is supposed to work...

    I don't think anyone does.

    I'm not saying that the semantic web is bullshit, but it does trigger my bullshit detector. At least one of them must be broken.

    --
    Am I part of the core demographic for Swedish Fish?
    1. Re:It's really, really difficult... by Bogtha · · Score: 2, Informative

      I still don't think I truly understand how RDF is supposed to work...

      I don't think anyone does.

      RDF's core idea is simple. Give everything a URI. Express relationships as a set of three URIs, (subject, property, value). So you might have (#me, #friend, #bob) expressing the idea that Bob is a friend of mine. Or you might have (#photo, #contains, #me), expressing the idea that I'm in a photo.

      RDF is little more than a mechanism for expressing relationships. It doesn't give software the ability to understand those relationships, you need to build that on top. RDF just helps you solve the relationship problem in a generic way. So, for example, even if you have (#me, #friend, #bob), that's still meaningless until you write software that knows the #friend, #spouse, #employer, #whatever URIs are relationships between people. For instance, you could build a social network like Friendster, only decentralised - and people have, with FOAF, because they've agreed that particular URIs express particular relationships.

      --
      Bogtha Bogtha Bogtha
    2. Re:It's really, really difficult... by pedantic+bore · · Score: 2, Interesting
      This is a little bit like saying "Computer science is easy. It's all just one's and zero's."

      The representation isn't the problem. The problem is agreeing what the the relationships mean. What does "#friend" mean? Does it mean the same thing to program X as it does to program Y? How can you tell? What do you do when there's a conflict -- who gets to decide what #friend means, and whether this is a global or local definition? These are questions that I've never heard answered in any believable manner.

      --
      Am I part of the core demographic for Swedish Fish?
    3. Re:It's really, really difficult... by Bogtha · · Score: 2, Insightful

      The representation isn't the problem. The problem is agreeing what the the relationships mean.

      That problem is not the problem that RDF addresses. It just gives you the tools so that you can concentrate on solving that problem instead of worrying about all the crap underneath. It's like XML doesn't address semantics, it just gives you tools so you can focus on semantics without worrying about parsing.

      What does "#friend" mean? Does it mean the same thing to program X as it does to program Y? How can you tell?

      You read the specification for the vocabulary you are working with. For example, here's the FOAF specification.

      To use the XML analogy again, the XML specification doesn't tell you what particular element types mean, because that's outside XML's scope. You read the specification for the XML document type, e.g. XHTML, to find out what an element type means.

      who gets to decide what #friend means, and whether this is a global or local definition?

      You're forgetting that #friend is just shorthand for a URI. It's not a literal string "friend". If Slashdot choose to expose their friend data with URIs like http://slashdot.org/rdf/#friend that doesn't have any bearing on the meaning of Friendster's data if they use URIs like http://friendster.com/rdf/#friend. They are two separate URIs with two separate meanings that the owners of the domain have chosen.

      I know, the next thing you are wondering is how this is of value if everybody makes up their own URIs. Well the answer is, if they want interoperability, they don't just make up their own URIs. Just like people using XML get together, agree on concrete definitions and write specifications like XHTML, the same things happen with RDF vocabularies, people get together and decide what they think #friend should mean, write a specification like FOAF, and use the same URIs.

      These are questions that I've never heard answered in any believable manner.

      Ignore all the hype from PHBs, this isn't about computers magically understanding arbitrary documents. This is about expressing relationships in a standard way. Of course you need some way of agreeing on what relationships mean, which is why people write specifications. RDF doesn't solve that problem, it's outside RDF's scope. RDF is much smaller and more focused than you think, it's not magic.

      --
      Bogtha Bogtha Bogtha
    4. Re:It's really, really difficult... by pedantic+bore · · Score: 1
      I don't think I'm disagreeing with you.

      Like XML, the notation is just a beginning. It's nice if everyone agrees to use the same syntax to express information (even if it's somewhat gnarly, like XML) but that just saves everyone the effort of writing a bunch of boilerplate code. As someone who has been using IDLs and markup languages for decades, XML and/or RDF doesn't excite me much. It's those other problems -- the ones beyond their scope -- that remain unaddressed.

      Writing the URIs is where all the pain is. I don't see any difference here this and hashing out any other protocol spec. Some SemWeb advocates tell me that they've almost got this solved and pretty soon we won't have to do this any more -- they'll all be written automagically. This is the claim -- that this new notation, if we all used it correctly, will suddenly make the problem easy -- that smells like bullshit to me.

      --
      Am I part of the core demographic for Swedish Fish?
    5. Re:It's really, really difficult... by Bogtha · · Score: 1

      Writing the URIs is where all the pain is. I don't see any difference here this and hashing out any other protocol spec.

      The difference as I see it is simply that the protocol is being specified at a higher level, which means that if you have the right libraries, it's just less work to implement.

      It's those other problems -- the ones beyond their scope -- that remain unaddressed.

      My perspective is that you don't stand a chance of solving the larger problems in a generic way until you solve the smaller problems they are dependent upon in a generic way. The web started out solving the addressability and file transfer problems and bashed out an ad-hoc markup language to tie it all together. Then we got XML to solve the markup syntax problem (yeah, SGML was there beforehand). Now we have RDF to specify relationships. They are all individual steps along the path. The key thing is that they are useful on their own merits apart from the Semantic Web, which lets them be used and refined right now, rather than needing to invent a whole Semantic Web all at once.

      Some SemWeb advocates tell me that they've almost got this solved and pretty soon we won't have to do this any more -- they'll all be written automagically. This is the claim -- that this new notation, if we all used it correctly, will suddenly make the problem easy -- that smells like bullshit to me.

      Me too. There's a long way to go and quite a few components missing before anybody can tie the whole lot together to solve the bigger problems generically, but each step along the way makes the manual work easier.

      --
      Bogtha Bogtha Bogtha
  24. I See Value in the Semantic Web by n+is+prime · · Score: 2, Insightful

    Even if we are inherently lazy, and even though some people seem to be generally against the idea, it doesn't make any sense to me not to employ this and experiment with it. Norvig is an AI guru, and his ideas on the Semantic Web may be interesting, but Google is not against the idea. Google's GData looks to me like a primitive Semantic Web. Even if only 10% of web masters adopt the system, querying to find a set of results that have been tagged as certain meta-data can come up with some interesting results. If the results are interesting enough, more webpages will include meta data tags. Also, being inherently lazy argues for not spending time writing tags all over your code, so why would anyone take the time to sabotage the system. While I understand the difficulties of the spamming problem, there are plenty of cookies on the internet anyway. I think the same inherent problem in the Semantic Web exists with PageRank. In PageRank what happens is a web page will say the same words over and over to acheive a higher ranking in the semantic analysis of the page, and thus the page will be a top result when entering a query with related words. But I think PageRank works pretty well overall. Google's next step with PageRank is to filter all the spam sites that just say the same words. Security in the Semantic Web would also be to filter those sites with obviously spammy RDF or OWL tagging. Overall the Semantic Web is a cool project that could lead to really smart searches, with axioms involving how different meta-tags are related to each other. I'm in favor of the new technology.

    1. Re:I See Value in the Semantic Web by Anonymous Coward · · Score: 0

      Also, being inherently lazy argues for not spending time writing tags all over your code, so why would anyone take the time to sabotage the system.

      If it will make more money for web sleaze, they'll do it.

      This post is brought to you by the captcha 'graded'.

    2. Re:I See Value in the Semantic Web by pisco_sour · · Score: 1

      Just to add some thoughts based on some things I've been thinking about lately.

      I haven't had too much of an approach on semantic web concepts being developed; I certainly will work on that after this thread. My interest, though, comes mostly from philosophy (and other things I want to add into the mix): I want to look into the possibility of our cognitive processes, the way we're dealing with information and transforming that information into knowledge, is either being shaped by our interactions with technology, shaping the technology we're interacting with, or both. This would somehow, I think, require some sort of mutability in the way we learn and the way we know, which is, of course, complicated hypothesis (but then again, who knows?).

      This is all still in its infancy, I don't even know how viable it will be, but it is still fun. Our interactions with technology are transforming our interactions with the world and with other people, and this whole 'tagging' process, bears some resemblance with the tagging of the world that is the construction of language (clearly I still have a lot of research to do on language acquisition and cognitive sciences). And in a similar fashion as words are taken up and used commonly -being, after all, arbitrary pointers towards objets-, perhaps are tags applied to content.

      It's all still pretty blurry to me. I just wanted to add to your comments on the value of the semantic web, as it is probably an indication of how we're perceiving our own minds to be working, but, of course, not all minds are working the same way (for starters, not everyone has access to the technology involved, or the knowledge required to use it), which makes it hard to expect it to be easily adopted. Or something along those lines. I see profoundly interesting possibilities for the concept as a research subject for the time being, but still, to catch on massively, a whole lot still has to happen, particularly on bringing access to everyone.

      Oh well.

      --
      http://castorexmachina.wordpress.com - Filosofía, tecnología y cultura.
  25. RDF Ability vs. RDF Techincal Complexity by Anonymous Coward · · Score: 3, Insightful

    The idea of RDF is applicable to much more than public innerweb content. I've spent the last 7 months researching and developing an RDF backed system for my company's core products. Everyone should think of the value of RDF beyond the scope of trust, and then it becomes easy to realise methods of simple non-web implementation. We can all spend the next 5 years pondering how we're going to figure trusted content providers for RDF web data, or we can just start developing apps for sources which understand themselves as trusted (ie. data input from an individual, employees of a company, and any group where the individual must be accountable for their actions). Whats more important than the blind trust of sources, is data verfication. There are ways to run data input from one user by another user, without doing it in an infringing, demanding way, for validation. I'd like to go into detail of exactly what I mean by all this, but I don't want to violate any portion of my NDA or tip off industry competition (I know that sounds retarded, sorry). If RDF does gain popularity, I can say it will from within the private sector, not the public. Genious implementation may bring RDF to the public sector, but thats not something I would say is guaranteed to happen.

    Current technical obstacles to creating any RDF applcation: The matter of complexity of its integration into DB backed systems (popular methods), and instatiated class marshaling within not-so-object oriented languages. The technical design and implementation of a standards compliant RDF system has been extremely difficult for me. I don't think it would ever be possible to get RDF data represented nearly as minimally as you could with simple relational tables (although formally no more bloated than bloaty XML). RDF also creates many long linked relationships; this tends to create some serious performance issues in querying the data. Lastly, I hate XML, and you can't always correctly export from RDF to XML (capable type to incapable type) in a correct manner.

    1. Re:RDF Ability vs. RDF Techincal Complexity by johnmark · · Score: 1

      Yes, making RDF popular and widespread is really what Tim had in mind when he promotes the semantic web. Its not about users; its about machines understanding (in the AI sense) each other's content. Sorry to hear that in reality RDF isn't there yet. Tim's vision is that someday soon database users would push the RDF button on their web database, and it would make available their data for other machines (not necessarily people) to use. Specifically he made the point that the semantic web is not about manually adding meaning-tags to html. Its about automation.

      Also interesting was the next question at the session, (I was there) by Andrew McCallum, who commented about the semantic web's inability to express a fact's degree of certainty. Andrew is from the probabilistic AI tribe, and rightly characterized the work done to date on the semantic web as coming from old-school AI. This limitation will probably do more to hamper its development in the long run than the ones idenfitified by Novig.

      --
      so much uncertainty, so little time..
  26. Semantic knigth by Anonymous Coward · · Score: 4, Funny

    This remind me of the famous Semantic knigth parody...

  27. Threat to google's business model? by Anonymous Coward · · Score: 2, Interesting

    Do not forget that the semantic Web is not a replacement of the existing technologies: HTML contents will always be there but, What if these little 'metadata' description where added to ALL the Web Pages? In this case, the pages could be categorized, analysed and searched much more easily, and the algorithms related to these operations would be better. In such an scenario, the use or one or another Web search engine would be irrelevant because all of them would have powerful and acurate algorithms. Maybe a threat to google's business model? These would be the perfect world, but we have to assume that Webs would certainly lie or made mistakes in their semantic descriptions. OK, but... would it produce an scenario worst than the actual?. Now, fake webs are quite common; irrelevant sites try to advertise them by using all the available means to attract the most visitors, misleading them. The best web search engine is this who best filters these sites in the searches. In a semantically described Web, the problem will be the same, but there would be another easy-to-use filtering criteria to enhance the results. the Web search engines' algorithm will be better for sure.

  28. Re:Incompetence of users such as Slashdot editors. by coralsaw · · Score: 1

    Bah, semantics...

    --
    <before>now</before>
  29. Too complicated by Chris+Graham · · Score: 1, Insightful

    There is no way that regular people, even the majority of intelligent educated people, are going to be able to use it. It's a ridiculous pipe-dream. Think how hard it is to get people to understand broken-down logical arguments where everything is already layed down for them, and now imagine trying to make them understand how to conceputalise their own data domains and define their own relationships. Maybe 2% of people could do it properly, and then 1% of those would end up in a profession that would use the skills.

    When programmers write software for general use we have to think how to make things easy multiple levels below the level we have to think at. The vast majority of people are not able to think technically, and do not have patience - and that's because most people in this world find it uncomfortable to do anything that isn't centred around a social or emotional act.

    Developers find users can't do programming, so the programming language becomes a graphical interface. The users can't navigate the graphical interface via a structure based on logic, so the screens get built into an icon based organisaion with a well-defined 'workflow'. The user can't think logically about how to use the graphical interface, so help is written to explain how it works and what it can do. The help is too general so specific examples are given. There are too many examples and the user can't be bothered to read them, so a colleague stands next to them and they learn to mimmick their colleague.
    This isn't an extreme situation - it is typical of the vast majority of users. Now think about the inherent technical complexities of OWL and RDF, and imagine people actually using it for real problems? There's no way to hide what is a purely logical and structural framework for organising extensive data, behind pretty pictures and simple examples.

    1. Re:Too complicated by budGibson · · Score: 1

      I agree with the general facts you are outlining, but you're being way too hard on users. What most programmers forget is how hard it was for them to come up with the paradigms embedded in their software to begin with. After months (years) of struggling to code the program, the programmer expects the user to just get it.

    2. Re:Too complicated by TuringTest · · Score: 1

      The OWL and RDFs are not for the people using the system, no more than HTML and CSS.

      The semantic layer could be used to automatically generate goal-oriented workflows, instead of relying on the predefined ones, without making the user think.

      --
      Singularity: a belief in the "God" idea with the "demiurge" relation inverted.
  30. Blame the user by sco08y · · Score: 1

    Brilliant! Blame the user. No, it's not that you don't have a rational data model (you know, so that those "semantic" tags actually *mean* something) or that you haven't done squat to even suggest a proper UI, it's the user's fault.

    And it *certainly* couldn't be that HTML is a piece of fucking garbage and that trying to kludge semantics into the spec is an effort doomed from the beginning.

  31. Re:Incompetence of users such as Slashdot editors. by TrappedByMyself · · Score: 3, Insightful

    Bingo! You've just proven that the incompetence spreads beyond MySpace.
    The problem with the semantic web movement is this: You have the web guys from the W3C who got famous by building kinda crappy, but effective technology (HTTP, HTML, etc...) going goo goo gah gah over PhD Ontologists from the AI community. They team up and build these great things that the average person (including the people who think they are really really smart, like the Slashdot editors), has no chance in hell of using effectively. What'll happen, is that eventually there will be useful Semantic content and Intelligent Agents doing great things, but that work will be done by a select few. The unwashed masses will still be the domain of Google.

    --

    Help me take back Slashdot. When did 'News for Nerds' become 'FUD and Conspiracy Theories for Extremist Nutjobs'?
  32. KISS by Anonymous Coward · · Score: 0

    I don't understand why the semantic web has to start of so complicated.
    Learn from RSS, which is relatively simple, and got adopted quite fast.

    For example: let's say everyone puts a robots.rdf file in the root of their website, which contains some very simple things, a vcard with address info, telephone numbers, gps coordinates and opening hours (if it is bussiness). That would be such a great addition for the web.
    Let's first try to create the equivalent off the yellow pages. Where you can look up some info in a simple structured manner, instead of wading through webpages with impossible designs.

    Google will always be necessary for 'fuzzy' text searches, but if we could put the simple facts in more structured/simple documents, it would help sooo much. I don't really understand why the people in google don't try to push this kind of technology. Maybe because everyone can write a spider for these documents, so it's not that interesting for them, just like (in my view) they never really embraced rss.

    Jasp

  33. Who are your users matey? by jotaeleemeese · · Score: 1

    The context will tell you if you have usability problems or not.

    If an important group of users is grandmothers, trained or otherwise, and they can't use your product or service (call it F22 or a kettle) then you have got usability problems and you have got to address them.

    Insulting the intelligence of your intended audience is a typical no-no for somebody knowledgeable with the rudiments of usability theory and practice.

    --
    IANAL but write like a drunk one.
    1. Re:Who are your users matey? by FooBarWidget · · Score: 1

      What if even the users themselves complain about incompetent users? Would you still say that there is no such thing as incompetent users?

      This is not a hypothetical situation: people on my forum complain all the time about idiotic posts on the forum, despite all the hundreds of man hours I spent into organizing the information in easy to find ways and redesigning the website.

      (FYI, I'm not talking about the Autopackage website)

    2. Re:Who are your users matey? by robolemon · · Score: 1
      What if even the users themselves complain about incompetent users? Would you still say that there is no such thing as incompetent users?

      Then some of your users are making the same mistake that you are. Please look past their opinions and think about whether you want more people to download your software. If you get enjoyment out of making fun of some people and calling them incompetent, then you're set. Otherwise, try to be humble and put yourself in these other people's shoes. How would you like it if the first time you tried to download some new project a bunch of people you didn't know called you incompetent?

      Also, I agree with the post elsewhere that people might be coming in through a search engine straight to your forums. How can you help people understand where they are within your site, and how that relates to the download location? Some sites even go so far as to tell the user 'Hey we see that you came from a Google search for "foo download", are you looking for our DOWNLOAD PAGE (link)?'"

      --

      I design user interfaces for a free network management application,

    3. Re:Who are your users matey? by FooBarWidget · · Score: 1

      Wanting to help people is good and all. In general, I still want to help my users. But after 3 years, things get tired quickly when you read the same question for the 84235823th time (despite massive efforts to redesign the website to make the answer to that question easy to find). Take a look at help desks. Have you ever seen a help desk operator who likes his job? I've never seen one, or even heard of one. Take a look at the administrators/moderators of some major forums where non-technical people go to. Almost all of them (the admins/mods) eventually burn out. I've seen the nicest people turning into frustrated jerks after having to deal with 'n00bs' on forums for more than 2 years.

      99% of the people who tried to adopt the "user is never wrong" mentality eventually burn out, and fail. What do you think that means?
      a) Those people are all wrong and the user is never wrong.
      b) Incompetent users exist.
      In the ideal world, I'd believe in a, but after 3 years it's becoming increasingly hard to not believe in b.

  34. Insult humanity by MADnificent · · Score: 1

    So instead of insulting a very small amount of people, he insults everybody in the world...

    Way to go!

  35. As the tale goes, "The emperor is naked!" by jschrod · · Score: 1
    Some well-known researcher called the emperor naked. Maybe they believe him more than they did the practicioners that pointed the Semantic Web's problems out long before. Here we'll see that fairy tales are not true -- a small child is not sufficient, we need a bigshot to notice.

    News at 11...

    --

    Joachim

    People don't write Manifestos any more -- what's going on in this world? [Frank Zappa]

  36. Complex? Opportunities for spammer? Don't think so by CaptSolo · · Score: 3, Insightful

    As I pointed out in the previous comment authoring data on the semantic web is no more difficult than authoring RSS or XML.

    Yes, figuring out for the first time how to represent your data in RDF (or XML for that matter) can be difficult. Imagine if everyone was trying to come up with an RSS standard on his own instead of using RSS export functionality of his content management tool. That's why we need good guidelines how to publish information on the semantic web. And RDF export functionality (plugins) similar to what RSS plugins are doing.

    As for opportunities for spammers and mischief - don't think so.

    Why? - If you look at the Semantic Web "layer cake" you will notice such technologies as digital signatures, encryption and trust being part of the scheme. They allow to identify the author of data and ensure he is what he claims to be. There is nothing wrong with your application if it only accepts signed and trusted data. And there is nothing preventing authors of the data from signing the contents. Since the semantic web is a new technology and we already know about problems that spam and misuse can present it is more not less prepared to fight spam.

    Note1: Semantic web should be viewed as an integral part of the existing web, not its opposite. Might even be that it can provide an additional layer that will help to combat spam and other problems you mention here. Who knows.

    Note2: Spammers will always try to come up with new exploits. We all have to be prepared for this and think how to close the holes they are using. But saying that newer (a further development of existing [web]) is necessarily more opportunities for spammers in wrong.

  37. Tutorial on the Semantic Web by CaptSolo · · Score: 2, Informative

    Here is a Tutorial on the Semantic Web.

    Pay attention to the slide #22 which shows how data from different sources can be merged together. This is one of key differences between XML and RDF - to merge XML data from a number of different schemas one would need to create an application that processes data in these schemas and generate merged data (possibly inventing a new schema to represent the merged information).

    In RDF that happens "magically" - in order to merge heterogenous data you don't need to do *anything* - just put all the information in an RDF store and it merges. If the data to be merged change no modifications to the store are necessary - it is like a bag that can hold anything.

  38. Re: How is Semantic Web supposed to work by CaptSolo · · Score: 1

    A good point about the million monkeys. :)

    To illustrate one of key differences of how RDF is supposed to work and why it may even be fun (and not to post same info again) check out at this comment: Tutorial on the Semantic Web.

    A simple and concrete example: when a conference publishes its delegate list in RDF it is suddenly very easy for services that use this data to appear. By combining this data with Google Maps we get a FOAFMap of its participants where the application has extracted machine readable data from RDF, used geo-information to put them on the map and has retrieved more data from FOAF RDF profiles of the participants.

    That's a simple mashup, but it shows how machine readable data can help spring new and sometimes unexpected use cases. I don't understand why all conferences do not provide data in RDF yet. ;)

  39. It's the page content, stupid by saddino · · Score: 1

    Although trust is certainly a issue when it comes to the Semantic Web, the real problem is that its design is not a true abstraction, but is nothing more than more metadata. And like the actual textual data in a typical web page, it suffers from all the same problems, save for one: being unstructured (and thus not truly parseable).

    IMHO, the Semantic Web is solving one problem (the lack of structure and descriptive context in textual HTML content) in a very hard way (asking the entire web to implement this new RDF).

    Many companies (disclaimer: like my own) are approaching these problems from a different angle: working on statistical and semantic systems to extract structurue from the textual content that is already there on the web page.

    Now some people will argue that trying to create a system that can understand langauge/content is insanely difficult.

    But what is a more realistic time frame? The one in which an intelligent parser can begin to understand the content that is already on the web, or the one which requires the entire world to implement a solution to a problem they don't even realize is a problem?

  40. pardon my ignorance by plopez · · Score: 2, Interesting

    But what, exactly, is the definition of the 'Semantic Web'? How is it different from what has been done in the past? Is there any agreement of any sort as to what it means? If yes, please let me know. If not, then how can we achieve this goal if we do not know what it is?

    I am confused, I really do not see too many differences in the web in the last few years. Nothing 'Earth Shattering' anyway.

    --
    putting the 'B' in LGBTQ+
    1. Re:pardon my ignorance by TuringTest · · Score: 1

      Do you understand the difference between |-separated configuration files and XML configuration files? Both are equivalent in that they provide constants for a program; but XML files can be processed by a generic parser, while "pipe" files need an ad-hoc parser.

      RDF and the layers on top of it (OWL, DAML...) try to achieve the same for other tasks. Instead of having to build separate applications to achieve the same task again and again for every website, you can reuse a generic code by having all the meanings of the application explicit in the RDF.

      --
      Singularity: a belief in the "God" idea with the "demiurge" relation inverted.
  41. RDF Ability vs. RDF Techincal Complexity-trust by Anonymous Coward · · Score: 0

    "The idea of RDF is applicable to much more than public innerweb content. I've spent the last 7 months researching and developing an RDF backed system for my company's core products."

    You've resolved the trust issue by limiting it's domain. e.g. intranet.* BTW there are ways of getting semantics from content that are "clever".e.g. making it a game.

    *The advogato trust metric may be a way of resolving the issue.

  42. Old AI vs New AI by Alomex · · Score: 1
    The Semantic Web is in the old AI tradition of grand overhyped promises with little results to show for them many years later. AI had managed to moved away from this practice that had led to the crisis in funding in the 80s, when people woke up to the fact that AI did not deliver as promised. Here at AAAI there is a sentiment that the semantic web is a step in the wrong direction and Tim Berners-Lee talk here was presented as such. Here's the abstract from the program:

    The relationship between AI and the semantic web has been something that has provoked a lot of heated corridor discussion over the years. This talk will try to outline what the semantic web is and is not, at a conference where there may be some anniversary reflection on what AI is and is not. It is not always obvious how to transfer existing AI techniques into a fractal weblike space, or what the effect will be. But it is certainly exciting.

    The same chasm appeared during the founders panel, where John McCarthy gave a more sober cautious perspective of where AI is going, while Marvin Minsky issued a call for the old style of over-hyped research such as the "emotion machine" whatever that means.

    There was also a feeling that perhaps some of the grand challenges are too ambitious. "We can't make predictions since in some cases we don't even know what the problems are" a famous panelist noted. It is good to have long term goals, but they must be set within the realm of what is at least vaguely foreseeable. Challenges beyond that boundary are in the realm of science fiction not scientific AI.
  43. Wow, that was confusing-Slot A, Tab B. by Anonymous Coward · · Score: 0

    "What's a semete?"

    It's what you inpregnate computers with.

  44. Re:Incompetence of users such as Slashdot editors. by berbo · · Score: 1

    Google to Berners-Lee: you are PWN3D!

  45. Re:Incompetence of users such as Slashdot editors. by NoOneInParticular · · Score: 1

    Given the track record of Good Old-Fashioned AI in building something genuinely useful, I'm afraid Semantic content and Intelligent Agents will not be involved in great things for a while. Maybe in 50 years ;)

  46. The real core problems of the Semantic Web ... by alispguru · · Score: 1
    ... are still unsolved. The problems of data inconsistency (from bogus or fraudulent data entry) are bad enough, but the semantic web idea has problems even if you assume all the data is valid. There are some theoretical results on inheritance networks (a classic AI predecessor to semantic web representation) from the 1980's and 90's that are rather depressing:
    • Touretzky's dissertation where he shows that if you allow exceptions, it's hard to keep inheritance networks globally consistent
    • Another result I can't locate right now which proved certain basic inferencing techniques in inheritance networks to be NP-complete

    That last one means that straightforward, guaranteed-to-reason-correctly searchers for semantic webs won't scale, which means their use on the global internet is problematic. Failure to scale was one of the major causes of AI winter, guys.
    --

    To a Lisp hacker, XML is S-expressions in drag.
    1. Re:The real core problems of the Semantic Web ... by n+is+prime · · Score: 1

      Agreed.

      The rest of this thread is a little bit misguided; the entire discussion is centered around users and writing the tags. Those things are not a problem. It will be as problematic writing embedded code in HTML as it is now writing CSS, DHTML, PHP, JavaScript, etc. We've never been too lazy to incorporate more code before, and we're not all too lazy to incorporate some more tags now, that's ridiculous.

      Organizing the incorporation of multiple and related ontologies on the Semantic Web is much tougher, and there's a lot of research being done in the area. It's probably quite doable with some global guidelines.

      The problems you discuss here are significantly detramental to the goals of the Semantic Web. I wonder what "basic inference techniques" you speak of. Somehow I still argue in favor of more metadata.

  47. Semantic Web is just backwards by snowwrestler · · Score: 2, Insightful

    It's trying to impose structure on something that is not very structured--human thought. Even the use of the word "semantic" points out the futility of the exercise, as it indicates language and changes in meaning--not structure.

    Semantics is a human discipline--it is focused inward, not outward. Likewise the proper place for semantic technology is in the client, not the content. Building "semantic web sites" makes no sense. Google is absolutely right on this one--Web sites should simply be what they are, and it is up to the client to assign meaning and remember connections. Google provides a variety of tools that help people do just that.

    Why should I have to tag everything I read online? I don't tag things I see in real life. I just remember them and make connections in my mind. If we want computers to be actually useful to us as assistants and not just stupid tools then they will need to begin to operate the same way. That is a very tough problem, yes. But it is the way we are headed, and the "semantic web" is IMO just a bad hack until we get there.

    Furthermore the idea of trustworthyness and authority online is ridiculously complicated. I can't think of a harder problem in all of AI. It's much harder to determine if someone knows what they're talking about, or if they are trustworthy, than it is to simply identify the topic of an article. And we're still struggling with the latter.

    --
    Build a man a fire, he's warm for one night. Set him on fire, and he's warm for the rest of his life.
    1. Re:Semantic Web is just backwards by Wastl · · Score: 1

      It's trying to impose structure on something that is not very structured--human thought.

      True. Unfortunately, "structure" is the only language that machines can work with. If we want the machines to help us, then we need to provide information in a structured way. The current situation (without the "Semantic Web") is in no way different, only that the structure currently only describes the layout and presentation of information. The goal of the Semantic Web is to also be able to provide structure that describes further aspects, like (but not limited to) the content.

      One issue we (the Semantic Web community as a whole and I in particular) are working on is to provide tools that make the creation of structure as easy as possible - ultimately through natural language processing, but that does not work too well at the moment.

      Why should I have to tag everything I read online? I don't tag things I see in real life.

      This is a very valid question, and believe me, it is considered by many researchers. You should, because you get something out of it whose value is higher than the effort that you invested. Why do you mark up text in HTML? I don't do it in "real life", I just put my pen on paper.

      If we want computers to be actually useful to us as assistants and not just stupid tools then they will need to begin to operate the same way. That is a very tough problem, yes.

      One of the biggest failures in computer science history / AI was exactly that - to try to make computers behave in the same way. Computers are not humans, and they never will be. Computer intelligence is different, and has many strengths but also many weaknesses compared to human intelligence. Computers are increadibly fast at computing, faster than any human can be, but they are extremely bad with vague information and intuitive judgements. So rather than trying to behave like humans, computers should complement humans, and they actually do.

      I have read an article about AI (German) in a German newspaper, and it has a very interesting statement: the problem with AI is that all methods that are developed appear rather simple and structured afterwards, and therefore not "intelligent". For this reason, many people claim that AI is a failure. At the same time, we have gone an increadibly long way in AI research.

      Sebastian

  48. HAHAHA good luck! by Anonymous Coward · · Score: 0

    "You can't fix stupid"
    --Ron White.

    Seriously though, web 2.0 is going to have a hard time with thousands of arrogant programmers making claims that it is crap because they can't figure it out. So maybe it will be a niche market ;). That's right, go with that thought. Web 2.0 is garrrrrrbage.

  49. Norvig Too Kind: Problem Is Berners-Lee by goat_roperdillo · · Score: 1
    Berners-Lee's insistence that his Semantic Web ideas can work has kept him in a backwater of the WWW. Berners-Lee developed the SW ideas early, without outside critical thought and the SW remains a pipe dream.

    The basic problem with the SW is that the use of separate ontologies defies any exchange process that does not include human intelligence. IOW to do it properly there must be human intervention. But Berners-Lee keeps thinking that there is a shortcut - there isn't. Better men have trod this path and know what's at the end of the road.

  50. Hear, hear! by TuringTest · · Score: 1

    Fellow moderators, parent is the most Insightful comment I've seen in this thread; the Semantic Web is all about human-provided content represented in a common format, just like Web 1.0 was!

    Someone with points please mod it up!

    --
    Singularity: a belief in the "God" idea with the "demiurge" relation inverted.
  51. [ot: wink] by wild_berry · · Score: 1

    This stalking thing's hard work. I'm glad I could track you down to ask: I'm interested in downloading your software, where might I find a link to it?

  52. Precisely Where TBL Stumbles, SW Falls by goat_roperdillo · · Score: 1
    While the slide's simple FOAF example works, in general that is not true. The ontologies for the FOAF example are well-defined and consistent. But that is not the general case. In general, human intervention is required (here I mean people arguing about definitions). There is no "magic" in SW. The emperor has no clothes.

    The SW doesn't work and won't work. Give it up. The real future is in the nouvelle AI as demonstrated by Luc Steels' research, wherein negotiation of language and negotiation in language in a real-world environment establishes the semantics of communication.

    1. Re:Precisely Where TBL Stumbles, SW Falls by CaptSolo · · Score: 2, Interesting

      I won't be so sure (that it is not a general case).

      A regular user won't be inventing his own ontologies the same way as he is not inventing a new RSS format. There is a set of well-define ontologies that you can use to describe your data. And a regular user won't be hand-crafting RDF data either. Instead RDF data will be exported from his applications the same way as RSS and Atom are exported from his weblog software or as Word saves users files.

      RDF data will still merge together, provided there are "crystalisation points" that are common to data from different sources.

      Regarding Luc Steels' research you are mentioning - could you give some pointers to his work?

  53. Need to make semweb toys by alucinor · · Score: 1

    If we want to get users to enter in metadata, we need to do three main things:

    - create editors that automate the syntactical complexities of RDF/OWL, like what blogs have done for HTML.
    - make entering metadata entertaining somehow.
    - make some killer apps that show to regular users the usefulness of the semantic web.

    Then we'll have a semantic web. Problems like spam can just be addressed as we come to them, but Web of Trust is probably a good start.

    --
    random underscore blankspace at ya know hoo dot comedy.
  54. Re:Incompetence of users such as Slashdot editors. by durdur · · Score: 1

    Specifically, to get these technologies adopted in a business environment, somebody has to be able to explain to executives what the benefits are and the cost, and therefore why they should adopt this technology, and if it's for sale somehow, why they should invest in semantically aware software/hardware. The barrier here is that business guys who write checks may not be technical, or not very, and sales guys who talk to them are also not generally technical. So it doesn't get traction because the benefits are hard to explain and the stuff you do to get the benefits is really inscrutable. There are a few areas where SW is getting significant adoption (pharamaceutical research is one) but in general it is off the radar screen and the vendors providing software that implements this stuff are niche players.

  55. Links to Luc Steels' Research by goat_roperdillo · · Score: 1

    Just click on my SlashDot username (directly above) and read my posts attached to "Robot Dogs Evolve Their Own Language". There you'll find links to Steels' work. Google him, too.

  56. Suggestion: Semantic Web needs a proving grounds by Retired+Replicant · · Score: 1

    All I would ask of the Semantic Web evangelists is that they go off together and build a network of Semantic Web systems that proves the following: 1. It works 2. It is more useful than currently existing practices 3. It is more cost-effective than currently existing practices 4. There is a killer application that is not possible using currently existing practices If they can do #1 and any one of the other 3, then maybe people will see the value and start adopting it in the real world. Until then Semantic Web sounds like the Dvorak keyboard to me: a "solution" that everbody thinks is worse than the problem it is trying to solve, because it requires millions of people to change the way they do things without much proven benefit. What Semantic Web needs to do is prove is that it is actually worth implementing by showing some honest results.

  57. Norvig vs. Berners-Lee by wintermute1974 · · Score: 1

    Do I hear a Googlefight in the making? Why yes, it's Norvig vs. Berners-Lee.

  58. You need to offer insight, not a mash of opinions by dsmatthews · · Score: 2, Funny

    Google seemed to be looking to the web for meaning, but they should be building their own Ontology of Everything, based on what they find in the content. Let Cyc loose on there caches perhaps would be a good start. Then integrate their Ontology of Everything with those Formal Ontologies that already exist. About Intelligent Searching, when a person asks me for advice, I tap into my Personal Ontology, which has overlap with other ontologies in a domain specific way. i.e. I read information, much of it structured, I then fit it into my Personal Ontology and if required expand my ontology to fit the new information. I may even face a paradigm shift that requires a major restructuring of my Ontology, i.e. I need to have a set of new transforms to link the old with the new in a way that lets me sanely access both. At this point I have acquire new Knowledge which I can now share with people that ask me questions. When I'm talking to a Knowledgeable Source I need to find the Transforms that allow me to incorporate "knowledge of shared knowledge" as well as knowledge of our unique knowledge. This is how we are able to communicate and learn from each other. If I am dealing with a Naive Searcher I need to Probe their Personal Ontology or World View until I am able to construct enough domain specific transforms to allow me to know what they are trying to learn and how best to find it and Teach it to them.

  59. Reinventing the wheel... by Anonymous Coward · · Score: 0

    I totally agree with Peter Norvig.

    There has been a lot of enthusiasts but also a bunch of stumbling amateurs in AI all around the Semantic Web project, and this has not been a good thing.

    They are reinventing the wheel. RDF is nothing more than knowledge representation 101.
    Don't forget that Russell & Norvig's book devotes 5 out of its 26 chapters on KR.

    And now with the DAML language they are reinventing Prolog interpreters.

    From their Website - Why use DAML?
    http://www.daml.org/2002/04/why.html

    (motherOf subProperty parentOf)
    (Mary motherOf Bill)

    when stated in DAML, allows you to conclude

    (Mary parentOf Bill)

    Check this from an introductory course in Prolog
    http://www.cs.wisc.edu/~noto/cs540/lecture/14-prol og.html

    fatherOf(keith, duane).
    motherOf(keith, diana).
    adoptedBy(webster, mrPapadapolis).
    parentOf(X, Y) :- adoptedBy(X, Y).
    parentOf(X, Y) :- fatherOf(X, Y).
    parentOf(X, Y) :- motherOf(X, Y).

    Given a query, parentOf(keith, X)., Prolog will:

    ?- parentOf(keith, X).

    X = duane ;

    X = diana ;

    LOL ;-)

    I see, maybe two real innovations : URI (Uniform Resource Identifier) and Digital Signatures and Web of Trusts.