Slashdot Mirror


The Need For A Tagging Standard

John Carmichael writes "Tags are everywhere now. Not just blogs, but famous news sites, corporate press bulletins, forums, and even Slashdot. That's why it's such a shame that they're rendered almost entirely useless by the lack of a tagging standard with which tags from various sites and tag aggregators like Technorati and Del.icio.us can compare and relate tags to one another. Depending on where you go and who you ask, tags are implemented differently, and even defined in their own unique way. Even more importantly, tags were meant to be universal and compatible: a medium of sharing and conveying info across the blogosphere — the very embodiment of a semantic web. Unfortunately, they're not. Far from it, tags create more discord and confusion than they do minimize it. I have to say, it would be nice to just learn one way of tagging content and using it everywhere.""

200 comments

  1. Don't agree by pubjames · · Score: 4, Insightful

    Isn't the power of tags that you can tag stuff however you want? To me a standard for tagging would be a negative thing.

    I don't thing the problem is a standard for tagging, the problem is having a standard for sharing tags between applications. But that's another problem and it doesn't need to be solved to implement tagging itself.

    1. Re:Don't agree by Anonymous Coward · · Score: 0

      I want a standard dogs tags! My dog deserves them!

    2. Re:Don't agree by dsginter · · Score: 2, Funny
      Some ideas for tag standards:

      <yes>
      <no>
      <maybe>
      <haha>
      <evil>
      <spam>
      <cow boyneal>
      <firstpost>
      --
      More
    3. Re:Don't agree by pipatron · · Score: 2, Funny


      <fud>
      <notfud>

      --
      c++; /* this makes c bigger but returns the old value */
    4. Re:Don't agree by Anonymous Coward · · Score: 5, Interesting

      Er, guys?

      Tags are keywords.

      There's a keyword line up in the header that isn't being used for much these days.

      If you want to tag your document in a machine-readable way, put the tags in the keyword field. Problem solved.

    5. Re:Don't agree by lousehr · · Score: 5, Insightful

      Your analysis of "the problem" is exactly the point of TFA. The stated concern is not that the content of the tags has no standard, but that the format of the tags has no standard. If a single tag contains multiple words, should the words be separated by spaces or underscores, or should we use StudlyCaps?

    6. Re:Don't agree by Anonymous Coward · · Score: 2, Interesting

      I agree with you, and would add:

      1. If you establish a "tagging standard", you practically guarantee nobody's going to follow it. What's in it for them?
      2. What normal person care about tags anyway? If we want more info on a topic, we use Google.
      3. The blogosphere is for losers anyway. Most of the time, they just sit around blogging about the blogosphere. Case in point: TFA. This garbage dump of anti-content can remain disorganized, for all I care.
    7. Re:Don't agree by laffer1 · · Score: 2, Interesting

      Case is also an issue. Some sites only allow lowercase tags while others don't care about case.

      This is similar to the problem blogging sites have with cross site scripting. Try to tell a blogger you won't take HTML or bbcode posts (depending on generation of the blogger). Regardless of what you do, there's going to be sites that don't follow the rules and there will also be ways to screw it up for everybody.

      There isn't a standard for many things on the internet which causes validation to be near impossible. Security researchers complain people don't do input validation, but I've never seen a complete webapp that's an example of security at the time its written. You can't validate things like addresses, names, e-mail addresses, or long text entries including blogs and content without leaving out characters that should be valid or flat out blocking some from using your service.

      As for the standard, this reminds me or RDF. Had it taken off, we could tag data with properly defined, shared tags. Defining tags in RDF would allow sites to share the information through RDF and thus solve the problem of transmittal. Of course getting everyone to agree to this is another thing.

    8. Re:Don't agree by ArsonSmith · · Score: 5, Funny
      You forgot to close them all

      </yes>
      </no>
      </maybe>
      </haha>
      </evil>
      </spam>
      </cow boyneal>
      </firstpost>
      --
      Paying taxes to buy civilization is like paying a hooker to buy love.
    9. Re:Don't agree by MoogMan · · Score: 1

      No, the problem with tagging, is that everyone else may have a different interpretation of what the metadata means. The solution to this? Why, tag tagging of course.

    10. Re:Don't agree by radtea · · Score: 4, Insightful

      The stated concern is not that the content of the tags has no standard, but that the format of the tags has no standard.

      The medium, as Marshall Maclluhan said, is the message. As soon as you standardize the format of the tags you will restrict the kind of information people can convey with them. That may be an acceptable limitation to you, but not to others, and they will find workarounds that effectively break the standard.

      For example, if tags were standardized on underscores to separate words you would have to forbid spaces and caps to enforce that standard. And then we would have no way of distinguishing between Polish and polish, which would be bad if you were looking for things to do with Eastern European culture or furniture care products. People would then start doing things like expressing capitalization by some other syntactical hack which would be inconsistently applied and a greater mess would ensue.

      Alternatively, tags could be represented as more complex markup:
      <tag>
      <word order="1">really</word>
      <word order="2">stupid</word>
      </tag>

      But because words and concepts have no general one-to-one correspondence (many words do not convey a unique concept or a concept at all, and many concepts cannot be conveyed in one word) this would be inadequate, and in any case even if the content model of the "word" tag forbade spaces, caps and underscores, people would still create tags that looked like:

      <tag><word>reallystupid</word></tag>

      The basic idea of "semantic markup" is wrong. From the summary:

      the very embodiment of a semantic web. Unfortunately, they're not. Far from it, tags create more discord and confusion than they do minimize it. I have to say, it would be nice to just learn one way of tagging content and using it everywhere.

      Actually, tags as they stand are the very embodiment of the semantic web. The only function of the semantic web is to create confusion and discord, because confusion and discord is the essence of the human epistemological condition. And the call for "one way of doing X" has a nice religious ring to it, history shows that attempts to standardize things relating to human thought are very much misguided.

      --
      Blasphemy is a human right. Blasphemophobia kills.
    11. Re:Don't agree by QuickFox · · Score: 1

      If a single tag contains multiple words, should the words be separated by spaces or underscores, or should we use StudlyCaps? If tags are to be read by people they should be written and separated the way people always separate words and items. Space between words, comma between tags. Why mess things up with weird and unintuitive arrangements?

      Example:

      privacy, Big Brother, government control

      --
      Terrorists can't threaten a country's freedom and democracy. Only lawmakers and voters can do that.
    12. Re:Don't agree by Al+Al+Cool+J · · Score: 1

      Maybe I'm missing the point to this, but shouldn't these variable formats all be parsed and handled by the software in the first place? How hard is it for a computer to be able to convert between, and simultaneously understand, the use of spaces, underscores, dashes, quotes, StudlyCaps, etc? Seems to me that is the right way of handling this, because you surely can't expect all implimentations and all users to know and follow whatever standard may possibly eventually arise.

    13. Re:Don't agree by ravenlock · · Score: 4, Funny

      You closed them in the wrong order, so I guess the rest of the conversation is no longer valid. :)

    14. Re:Don't agree by Zaphod2016 · · Score: 1

      I agree 100%. Expecting the +1,000,000,000 users of the Internet to follow some sort of tagging convention is a pipe dream at best. I'd settle for proper punctuation in forums.

      If any "standard" is proposed, it should be a mild, behind-the-scenes change. Perhaps a META TAGS header, i.e. rss/application or META KEYWORDS. And only to make it easier for scrapers and aggregators (and of course, our beloved Google).

      If anything, we need to be focusing on "concept" mining. Synonyms, slang and syntax must all be considered. For example: "President Bush", "George Bush" and "W" might all be used to categorize the President (#43). However, on some pages, those same tags might be referring to H.W. Bush (#41) AND his son (#43). In other words, on site #1 these tags would refer to one concept, on site #2 they might refer to two distinct concepts.

      On the other hand, it would be fantastic to click on the tag "funny", and immediatley see a list of all "hysterical", "knee-slappers" and "yuks"- all contained in a single "concept" list.

      Knowing when to combine a concept, and when to "split" it into multiple concepts, is what the "intelligent web" is all about. I believe this is not only possible, but inevitable- and it will make the current web look like an Apple IIe v. a MacBook Pro.

      Mastering the grouping of tags will provide a fantastic step forward in this pursuit.

    15. Re:Don't agree by deesine · · Score: 2, Funny
      Why mess things up with weird and unintuitive arrangements?

      Why are trying to take away my job?

      --
      damaged by dogma
    16. Re:Don't agree by marcello_dl · · Score: 1

      I thought that "keyword" meant a word characterizing a document usually coming by the document creators, while "tag" meant something that the user puts on documents according to his needs. So if one user has his own habit he does not need any tag standard. As you say the only problem is to have aggregators for tagged things coming from different apps, but in my experience it's easy to recall if what you were looking for was an email rather than a word document or a webpage link. I'd be more interested in tags encoded in files' metadata, so that a file path is relevant for the OS while the user is free to catalog his hd content however he sees fit. I guess new filesystems allow this?

      --
      ---- MISSING MISCELLANEOUS DATA SEGMENT --- [sigdash] trolololol
    17. Re:Don't agree by Sardaukar86 · · Score: 1

      OK, I'm throwing away an opportunity to throw another log on your '+5 Insightful' fire, but I just had to drop you a quick line to compliment you on the quality of your post. Thank you for a well-referenced, interesting and thought-provoking comment, beautifully executed with flawless spelling and grammar. Perhaps '+5 Made My Day' would be a more appropriate mod!

      --
      ..Mullah or Pope, Preacher or Poet, who was it wrote: "Give any one species too much rope and they'll fuck it up"?
    18. Re:Don't agree by Hyperspite · · Score: 1

      Well Google already has Google Sets, which allows one to generate a list of associations. We are probably almost there.

    19. Re:Don't agree by Anonymous Coward · · Score: 0

      Adding meta data to your own document is an alread-solved problem. Beyond the keywords header, there's also the much-more-evolved Dublin Core standard.

      Adding meta data to someone else's document is much less solved and by no means standardized. It's precisely this problem that sites like del.icio.us and Technorati address. I think anyone familiar with SEO knows how quickly we learned not to trust meta data added by the author of a document, especially one available on the web. External tagging is a much more reliable means of finding things, hence the discussion we're having here.

    20. Re:Don't agree by Ethan+Allison · · Score: 1
      How about XML?

      <?xml version="1.0" encoding="utf-8" ?>
      <tags>
      <tag>yes</tag>
      <tag>no</tag>
      <tag>Cowboy Neal</tag>
      <tag>1&lt;2</tag>
      </tags>
      etc.
    21. Re:Don't agree by Anonymous Coward · · Score: 0

      <dupe>
  2. Automatic tagging by drcoppersmith · · Score: 5, Insightful

    I'm inclined to disagree that 'tags' are the answer here. I wrote my masters thesis on a method automatically generating semantic webs from plaintext. It's a huge problem with about a dozen different stages, but I had backing in all of my research from the psycholinguistics and computer-science field.

    Herein lies the rub: You're never going to get everyone to agree on a set of appropriate tags. Even if you do, you'll never have them uniformly applied (well I find that humorous but you have it tagged as inappropriate).

    There are other solutions here, such as automatic semantic generation. Hey, I never said it was an easy solution, but it's one that I'm certain can be accomplished. Flame away ;-)

    1. Re:Automatic tagging by mangu · · Score: 4, Informative
      I wrote my masters thesis on a method automatically generating semantic webs from plaintext.


      In the end, this could be said to be one of the central problems in AI. Basically, this is dimensionality reduction. People have been trying to do this manually for a long time. The Encyclopaedia Britannica's Propaedica is an example of a tentative semantic web for all human knowledge, but it's so inefficient that it's of very little use by a human, not to mention by automatic mechanisms.


      You're never going to get everyone to agree on a set of appropriate tags ... There are other solutions here, such as automatic semantic generation


      I believe it could be done if it were an automatically generated tag set. If it could be proven mathematically optimal in a certain context, it would be hard for anyone to disagree.

    2. Re:Automatic tagging by remmelt · · Score: 4, Interesting

      Tags are probably very community based, so they would only make sense within that community. (!itsatrap wouldn't work so well on iloveponies.co.ae). That said, why make tags which are meaningless to other communities or have vastly different meanings to other people available as a sorting or searching option? Sure, you could make some pretty mean stats proving any point you'd like (bad grammar in tags up 14.8% from last year! tag "yes" used in 87% of all blogs, world population feeling positive!) but I don't see the point.

      Also, anyone trying to make a serious argument containing the word "blogosphere" should really try and get out more. Come on people, it's not world hunger we're solving here. Viz: http://coolestshop.com/headline-blog.html

    3. Re:Automatic tagging by maxume · · Score: 5, Insightful

      The article is mostly talking about standardizing the envelope, not the message, which is to say, how do you share/create a two word tag, and how to you specify exactly what is supposed to be described by that tag, and how do you share that in a useful way.

      The fact that someone thinks something is funny and someone else thinks it is inappropriate is useful information to gather, if you get 5000 funny and 5 inappropriate, you have a lot more information than if you have nothing at all, but even in you get 10 and 10 you still have more information, which is probably a good thing.

      --
      Nerd rage is the funniest rage.
    4. Re:Automatic tagging by drcoppersmith · · Score: 4, Interesting

      There are a lot of instances of manual tagging, and I agree with you that they're just too cumbersome (as does almost an entire field of psycholinguists [if you think you can get all of them to agree on anything you're sorely mistaken. They'll disagree just because they can]).

      The automatically generated tags are exactly what I was talking about. I didn't get terribly explicit with my ideas, but you seem to be going in the same direction I was. Getting the software to both tag incoming documents and categorize the semantic webs generated by each is the key to some 'universal' tagging sytem. This way we have maximally efficient tags along with a standardized definition for each and (perhaps most importantly) an automatic way of tagging all the documents to be processed. No room for the "13 year old cheerleader tags" as someone so eloquently put before.

      We still have the problem of naming the 'generic' tag categories generated by the software... The solution for that one is a lot hazier, though important. I don't think anyone will go looking for 'category 12233242' to find 'academic humor'.

    5. Re:Automatic tagging by djupedal · · Score: 1

      " I believe it could be done if it were an automatically generated tag set...it would be hard for anyone to disagree."

      Gotta luv those ifs - With English due to be a minority language before ye' know it, and since I already know Chinese/Japanese/Korean, let's just jump right ahead and use strokes. What? You don't have a clue? But what about the math-proven, optimally certain, shit-in-my-pants if it ain't true proposal ya'll just laid out...? Lead by example, ok?

    6. Re:Automatic tagging by drcoppersmith · · Score: 1

      Standardizing the envelope isn't too interesting to me, but the potential of standardizing tagging itself leaves a lot more room for creativity for me. If we were to implement such a 'standardized tagging' scheme, we would need an envelope that made it work, but I'd rather think about the semantics than the technical side... personal preference.

      I wholeheartedly agree with your take on the inappropriate/funny discussion. I can't say that I ever thought in terms of that before, since I've always looked at this from an 'automatic tagging' point of view, where one program is categorizing all of them. Though you bring me to a very interesting idea: We can categorize via machine the way we categorize IRL: consensus. We could make a population of agents tagging on different criteria (as you would expect from a group of people) and take the consensus (or some composite) as the tag. -=scurries off to write code=-

    7. Re:Automatic tagging by Anonymous Coward · · Score: 0

      Do you have a link to your thesis? I'm interested in reading it. Thanks!

    8. Re:Automatic tagging by drcoppersmith · · Score: 2, Interesting

      I agree with your take on tags being community based. I think there's more use for this out there, such as categorizing communities, looking at the underlying semantics of a website, determining the focus of a company, or summarizing the entirety of a body of research (and more interestingly, categorizig what is part of and what is not part of that body of research).

      This is just a problem I've worked on for a few years and have always had a small fascination with, I'm glad to share it (both in the mundane and fantastic applications).

    9. Re:Automatic tagging by jimmyswimmy · · Score: 1

      I think the best demonstration that a problem exists is on this story - Slashdot's tags are currently: tags, tagging, no, tag.

      A good solution would be to find the linguistic similarities between the selected tags and meld them together. I get no extra information by using the three words "tag" "tagging" and "tags" which I would not have received from selecting only "tagging" or perhaps "tag". Find the similar word roots and use the appropriate one (probably the same as the most often-selected of the lot).

      --

      Just my $0.55 (US inflation, 1774-2008, for $0.02)
    10. Re:Automatic tagging by maxume · · Score: 2, Insightful
      --
      Nerd rage is the funniest rage.
    11. Re:Automatic tagging by carpeweb · · Score: 2, Funny

      go looking for 'category 12233242' to find 'academic humor'.

      Isn't this recursive?

    12. Re:Automatic tagging by jfmiller · · Score: 1

      I wrote my masters thesis on a method automatically generating semantic webs from plaintext.

      I would really be interested in reading this if it could be posted somewhere

      --
      Strive to make your client happy, not necessarly give them what they ask for
    13. Re:Automatic tagging by drcoppersmith · · Score: 1

      I agree. Whatever tagging software we're using should account for the fact that 'tag' and 'tags' are essentially the same thing, despite the fact that "tag" != "tags". That's one place where a unified 'tag envelope' or 'tag exchange program' would be very welcomed, I imagine.

    14. Re:Automatic tagging by drcoppersmith · · Score: 1

      Seeing as it has possibility for future software development (of which I plan to pursue) I don't want to put it where the entirety of the internet can get ahold of it, but I'd be quite happy to share it with a few interested parties. Please contact me via email and we'll chat. Here's my Bio so you know I'm for real ;-)... oh yeah, and there's some contact info there too...

    15. Re:Automatic tagging by smallpaul · · Score: 1

      Herein lies the rub: You're never going to get everyone to agree on a set of appropriate tags.

      Yeah. That's not what TFA is about. You should read it.

    16. Re:Automatic tagging by Anonymous Coward · · Score: 0

      I've created a way of accurately tagging blog posts in a specific domain using bayesian classification with a few additional layers.

      The rub is that the classification is highly domain sepecific. I'm currently working on adapting the same approach to a more generic, and its not working out to bad.

    17. Re:Automatic tagging by bunions · · Score: 1

      > I think the best demonstration that a problem exists is on this story - Slashdot's tags are currently: tags, tagging, no, tag.

      Oh, there's a problem, all right. The problem is that tags are worthless. As noted by pretty much everyone, 90% of the tags are 'yes, no, maybe, fud, notfud, itsatrap, thinkofthechildren' and words that are already in the title of the article. This is not helpful, this is noise. I have no idea what the tagging system is supposed to help with, other than providing everyone a chance to be anonymously snarky.

      --
      there is no need to sign your posts. this isn't usenet. your username is right there above your post. stop it.
    18. Re:Automatic tagging by Zaphod2016 · · Score: 1
      Very interesting read, but I am going to have to disagree with the author. Consider this example:
      • Page 1: Cyberduck is an open-source FTP client for Mac OS X.
      • Page 2: In order to upload data to your web host, you can use FTP.
      • Page 3: Our FTP can be used via our JAVA system, or with the client of your choice.
      • Page 4: All current Macs come preloaded with OS X.

      User query=> I use a mac. How do I upload?

      Intelligent web answer=> Download Cyberduck [link]here[/link], it uses FTP to upload data to your web host.

      With information scattered about the way it is, I can see many, many applications for syllogisms in "search 2.0".
    19. Re:Automatic tagging by meta-monkey · · Score: 1
      I believe it could be done if it were an automatically generated tag set. If it could be proven mathematically optimal in a certain context, it would be hard for anyone to disagree.


      An automatically generated set of tags for a document? Didn't the GP just say that was impossible because no one can agree on the tags? Well I thought dead baby jokes were inappropriate, but the computer mathematically proves they're funny. So I think they're funny now! Right. That's going to happen.
      --
      We don't have a state-run media we have a media-run state.
    20. Re:Automatic tagging by maxume · · Score: 1

      Quoting from the article:

      "This example sets the pattern for descriptions of the Semantic Web. First, take some well-known problem. Next, misconstrue it so that the hard part is made to seem trivial and the trivial part hard. Finally, congratulate yourself for solving the trivial part."

      The hard part of using ftp to transfer files to a website isn't finding a program to do it with, it is understanding where to put the files and how to log on and things like that. Note that your page 3 had a preexisting solution to the 'problem'.

      The point isn't that you can't do stuff with it, it is that the level of 'interesting' that applies to the things you can do is way over stated.

      --
      Nerd rage is the funniest rage.
    21. Re:Automatic tagging by sohooo · · Score: 1

      I'm very interested in this topic - is your masters thesis available for interested people? I would love to take a look at it and learn something.

    22. Re:Automatic tagging by Anonymous Coward · · Score: 0

      Then the obvious solution would be to use a mix.

      You can't do manual tagging of a large body of works, and you can't get a meaningful description from your automatic tags, but you can get a correlation between the two.

      You do automatic tagging of your large body of text (blog posts), and let people also tag them manually. You then look for correlations between the auto-tag and the manual-tag.

      If 99% of the posts manually labeled academic_humor are also labeled 'category 12233242', then it's probable that these tags match (you have to do the whole train/test split to avoid over-training, of course).

      Then you start the refining process (Confirm your hypothesis). For example, maybe category 'university' will also match academic_humor, so what you do is, when a new post comes in, you add to the interface a 'vote for category' button for the categories users think it matches. If your categorization doesn't actually match, you get the feedback on false positives and can retrain automatically.

      This also gives you standardization for free. If two tags from different sites match a single category, you can ask users whether it's the same. You can "find similar posts in other blogs", even filtering for/against posts that link at the same article. Also new posts are biased towards your standardized categories, as people will lazily just click the "it's_a_trap!" Yes button rather than type out itsatrap.

      Also note that even non-standardized tags can be useful for classification. One of my favorite program analysis/data mining papers (from ICSE a few years ago), tried to cluster source code by similarity using both grammar (caller/callee are related, etc) and identifiers. Even if you don't have standard variable names, names like 'window' would be more likely to appear in windowing code. They used a PCA-like analysis to help them distinguish between file with both socket and window and those with drawing and window, plus filtering out common variables like i, j, k. They had some interesting results, like finding multiple implementations of linked lists in Mosaic by this method (head, next, etc).

    23. Re:Automatic tagging by illuminatedwax · · Score: 1
      If it could be proven mathematically optimal in a certain context, it would be hard for anyone to disagree.

      You've got a lot to learn about human nature.
      --
      Did you ever notice that *nix doesn't even cover Linux?
    24. Re:Automatic tagging by Nazlfrag · · Score: 1

      You got it in one. A standard is needed, one that appends a slashdot: or iloveponies: before the tag would enable coders to distinguish the meaning and context accurately based on the community it came from, and makes more sense than standardizing the input field.

    25. Re:Automatic tagging by Anonymous Coward · · Score: 0

      You're right. The meaning of the "scat" tag is not the same in Jazz lovers forums as in erotica archives.

  3. The other option by T-Ranger · · Score: 5, Funny

    Is not to tag everything like 13 year old cheerleaders.

    1. Re:The other option by archen · · Score: 4, Funny

      omg.ponies

    2. Re:The other option by _Sharp'r_ · · Score: 2, Interesting

      Speaking of /., it'd be nice if someone would implement negative tags so that the community can remove obviously inappropriate tags. Something like using -itsatrap would work nicely.

      --
      The party of stupid and the party of evil get together and do something both stupid and evil, then call it bipartisan.
    3. Re:The other option by jZnat · · Score: 5, Informative

      !itsatrap

      That's existed since tags started, so problem solved!

      --
      'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'
    4. Re:The other option by meta-monkey · · Score: 1

      But doesn't !itsatrap imply that it is not a trap?

      --
      We don't have a state-run media we have a media-run state.
    5. Re:The other option by teneighty · · Score: 1

      I am a 13 year old cheerleader you insensitive clod!

    6. Re:The other option by _Sharp'r_ · · Score: 2, Insightful

      I was looking more for something that would actually remove the tag from the list of tags, giving people a chance to clean up the list, not to add additional useless tags....

      --
      The party of stupid and the party of evil get together and do something both stupid and evil, then call it bipartisan.
    7. Re:The other option by ear1grey · · Score: 1

      I agree, tagged as "goslashdotgo"

    8. Re:The other option by jZnat · · Score: 1

      That's what the bang prefix does; it's an anti-tag. It's not like the "notfud" tag; it's the technical opposite of "fud" in order to remove that tag.

      --
      'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'
  4. One Key Point by Azarael · · Score: 4, Insightful

    How do you standardize something that has not been widely implemented before? It's great to say that it would be good idea to have one standard practice for tagging, but which one? There's no reason to make a huge fuss about this until it a least one clear contender for standardization emerges (which will probably happen on its own).

    1. Re:One Key Point by Qzukk · · Score: 4, Funny

      Obviously we have to find the lowest common denominator between all the different tagging systems.

      I propose that we standardize the following tags:
      thissux
      omgthisrox
      That should cover 100% of the content in a manner that everyone can relate to.

      --
      If I have been able to see further than others, it is because I bought a pair of binoculars.
    2. Re:One Key Point by T-Ranger · · Score: 3, Insightful

      Well, not quite. Reading the blog post the problem lies with two areas: technology and linguistics.

      For technology, as an example, how do you quote things? How do you separate tokens? Do you use StudlyCaps and spaces? "Quoted words", and commas? If the later, what about nested quotes?

      Bullshit question. The question is solved. Use XML. (Yeah, well, it is the web). We don't need Yet Another CSV "standard". Tags may be presented as lists, in spans, or WTF ever. But if you are talking about storage and transmission, then store the tokens separately, and transmit them in an unambiguous format; in 2007, on the web, the solutions are implementation-specific and XML, respectively.

      For linguistics, thats harder. Nouns or verbs? Talk to a librarian, Im sure there are volumes of information on the right way. But I don't care, as I'm still disgusted that the technology problem even exists.

      Right now it seems there is little discussion on the problem. Right now, if implementations are trying to reinvent data encoding schemes either the implementations are totally brain dead (and need a kick in the ass from an outside force), or are completely oblivious to the problems they are encoding into there core features (and thus still need a kick in the ass). This is so bad, its worse then wrong. You have to try to get to the point of being wrong.

      Of course, I don't care because tags are stupid. OTOH, perhaps I would care if they at least were implemented in a potentially useful way.

    3. Re:One Key Point by robotninja · · Score: 3, Insightful

      I can't believe I'm reading this -- it's a sad day for information science (I'm a librarian) when many otherwise knowledgeable, tech-savvy people are blinded by Web-2.0-speak. Let me reiterate another poster's comment:

      Tags are keywords. More specifically, they are subject keywords.

      If you can wrap your head around this idea, then you might realize what the author is talking about is a list of 'standardized' subject headings. You may know this by its common street name: a thesaurus (although some people prefer to use the terms ontology, taxonomy, or controlled vocabulary).

      There are plenty of well-established, long-standing, open thesauri out there - a few examples:
      MeSH
      LCSH (Library of Congress)
      CSH (National Library of Canada)
      ... and hundreds more, in nearly every language.

      There are even ANSI and ISO guideline standards for how to develop monolingual and multilingual thesauri for specific subject areas. This practice has existed well before the advent of the Internet, since before the first libraries even. Over the past hundred years or so, the practice has become highly refined in order to facilitate the practice of indexing.

      That's right, when you "tag" a page, you're actually indexing it. But you can call it "tagging" if that makes you feel cooler.

    4. Re:One Key Point by Azarael · · Score: 1

      I don't doubt that that is the case, but I do think that that approach is over-kill for a problem that should have a quick and dirty solution. Associating short text strings with a particular piece of content is easy, and any programmer who does any coding for the web can implement that (why we have so many approaches in the first place). On the other hand, implementing a standard is usually pretty involved and mostly best left to 'experts'. To use the thesauri that you mention, would you not have to reference whole books full of subject headings to stick to the standard? There is also the issue of keeping up to date with the latest set of headings. Sure these things can all be implemented in libraries or with some kind of web service, but the knowledge required to do those things is beyond many inexperienced programmers. Any standard for tagging should be as easily accessible to novice site developers as to the big players (something like RSS comes to mind).

    5. Re:One Key Point by Anonymous Coward · · Score: 0

      you forgot 'meh.'

  5. itsatrap by Megane · · Score: 1

    Er, I mean notatrap!

    One big problem is that people can just make them up, then you get the "greifers" who put bogus joke tags all over the place.

    (remember, the opposite of "itsatrap" is "!itsatrap", not "notatrap"!)

    --
    #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
    1. Re:itsatrap by drcoppersmith · · Score: 1

      I think you'll agree with me that "slashdotters" are not "everyone". I mean, they're clearly the creme de la creme of "everyone", but what about all those lesser beings?

    2. Re:itsatrap by Pink+Tinkletini · · Score: 2, Funny

      Um, no. I don't care what the FAQ says; "!itsatrap" is hard to distinguish from "itsatrap." Maybe it works in monospaced code, but not so well in proportional font.

      People who insist on sticking to the fucking rules are the number one problem facing today's society, methinks.

    3. Re:itsatrap by discord5 · · Score: 1
      People who insist on sticking to the fucking rules are the number one problem facing today's society, methinks.

      Way to go against the machine! Fight the power of the slashdot tag system FAQ, and become a legendary internet hero/freedomfighter. Free yourself from this evil oppressor, and liberate your fellow slashdot (ab)users from these bonds.

      Grow up

    4. Re:itsatrap by ravenlock · · Score: 1

      We'll just tag them all "irrelevant". :)

  6. Didn't they have this problem in Babel? by zappepcs · · Score: 4, Insightful

    How to share and categorize information is an ages old problem. One man's trash is another man's treasure, likewise, one man's bread is another man's dietary problem.

    I'm not sure, but haven't we already figured out that tagging would require more tags than the actual information being tagged to accomplish what the original poster was asking for?

  7. XML? by gravesb · · Score: 1

    Why not use an XML standard? If sites used a or similar, then people could put whatever they wanted inside. It would be simple for automated tools and users to find the tags and search against them. One of the most useful things that are similar to tags is the alt field in img. It allows people to search for photos online. Of course, this is open to abuse like anything else, and weren't search engines based on meta tags in the header at one point until people took advantage of them? Still, if tags are going to be standardized, I think it should be through a mark-up standard, and allows people to be as creative as they want with the actual tag itself.

    --
    http://bgcommonsense.blogspot.com
    1. Re:XML? by Azarael · · Score: 1

      Syntactically that would probably work fine, however, it won't add anything in the way of semantics, which is what the article covers. i.e. What types of text to actually put inside of the tags, spaces/no-spaces, etc.

    2. Re:XML? by T-Ranger · · Score: 2, Insightful

      Words have spaces between them. A tag may have multiple words and be an independent thought. Store it as English demands, with spaces. The Space, NoSpace question is only relevant if you are using an encoding scheme that is broken. Verb/Noun is a different question, but space/nospace, quotes, and BS like that is quickly solved with existing technology.

    3. Re:XML? by Anonymous Coward · · Score: 0

      and separating words/phrases/values with commas is that hard?

  8. Isn't needed by NineNine · · Score: 1, Troll

    A tagging standard isn't needed. Tags are just keyword that describe something. They're *WORDS* for Christs's sake. Just screen scrape them if you have to. Put them in a database. Read them aloud with a British accent, if you'd like. But if you can't parse plain old words, then I don't think that any kind of "standard" is going to help you.

    In the article, this guy is saying that some tags have spaces in them, and some don't, so that makes it hard. How about "where lcase(tags) like '%vista%'? How hard is that?

    This guy is an idiot.

    1. Re:Isn't needed by LiquidCoooled · · Score: 1

      Results 1-36000000 for tag "%vista%"

      VISTA - Visible and Infrared Survey Telescope for Astronomy
      VISTA, Visible & Infrared Survey Telescope for Astronomy.
      www.vista.ac.uk/ - 24k - Cached - Similar pages

      The Blind and Visually Impaired - Local Services provided by Vista
      Offer services and support to the blind and visually impaired in Leicestershire and Rutland. Information about their activities, events, newsletter, ...
      www.vistablind.org.uk/ - 45k - Cached - Similar pages

      Vista
      VISTA - Statistical Analysis and display of Geometric and Other Data. VISTA is an interactive analytical and statistical program. ...
      www.ccdc.cam.ac.uk/products/csd_system/vista/ - 13k - Cached - Similar pages

      VISTA
      The UK ATC is responsible for the design and construction of VISTA, ... More information about VISTA is available from the ATC project pages, the VISTA ...
      www.roe.ac.uk/ukatc/projects/vista/ - 9k - Cached - Similar pages

      Vista is currently under construction
      Thank you for visiting www.vista-cctv.com. Our site is currently under development, but in the meantime information on the full range of Vista CCTV products ...
      www.vista-cctv.com/ - 4k - Cached - Similar pages

      Coventry VISTA - Visually Impaired and Sighted Tandem Association
      Visually Impaired and Tandem Cycling Association. Events, rides diary, ride routes and social diary.
      www.vista.org.uk/ - 3k - Cached - Similar pages

      VISTA - Volunteers In Service To America
      A collection of resources and stories put together by former Americorps members and friends of the VISTA program.
      www.friendsofvista.org/ - 4k - Cached - Similar pages

      Or did you think it only meant one thing?

      --
      liqbase :: faster than paper
    2. Re:Isn't needed by NineNine · · Score: 1

      OK, then.

      where lcase(tag) like '%windows%' and lcase(tag) like '%vista%'

      Whatever. It's not rocket science.

    3. Re:Isn't needed by LiquidCoooled · · Score: 1

      Thats a little better, but context is everything.
      Your algorythm would work like google and would not require tags of any kind.
      Tags are useful to subcategorize content into neat little blocks.
      On a TV site, the tag "24" would be useful for finding information about Jack Bauer, but would be useless when talking about petrol stations open all night.

      Each tag (like your vista example) needs some kind of upversion (related to the originating context) to make the tags specific to the subject and this is the root problem with generic tags.

      --
      liqbase :: faster than paper
  9. People actually pay that much attention to tags? by east+coast · · Score: 4, Insightful

    I don't feel that tags have enough significance behind them to merit a standard. I'd be more concerned with truth in journalism first, for my part.

    --
    Dedicated Cthulhu Cultist since 4523 BC.
  10. Hopeless by bigmouth_strikes · · Score: 4, Insightful

    Trying to standardize tags in the context of standardizing what they are, is hopeless. It'll be like the Unicode standard; too complex to use in its entirety.

    But to standardize the format of tags and to standardize how to exchange tags between systems, is a great idea.

    --
    Oh, I can't help quoting you because everything that you said rings true
    1. Re:Hopeless by Simetrical · · Score: 1

      It'll be like the Unicode standard; too complex to use in its entirety.

      I think the many full-fledged Unicode implementations available would disagree that it's "too complex to use in its entirety". On the font side, certainly current computers don't come packaged with enough decent Unicode fonts, and automatic character substitution can be ugly, but these are problems that will abate with time. Doesn't Vista now have proper support for characters outside the BMP, for instance, with real UTF-16 used for internal encoding?

      --
      MediaWiki developer, Total War Center sysadmin
    2. Re:Hopeless by doom · · Score: 1

      But to standardize the format of tags and to standardize how to exchange tags between systems, is a great idea.

      Great idea. Have you ever heard of XML?

      (Since the problem we'd like to solve is hard, let's solve an easy problem instead. Even though that's been done already.)

  11. I Completely Agree... by setirw · · Score: 5, Funny

    Which is why I tagged this article with "njkewjdkewd."

    --
    This message printed on 100% post-consumer recycled electrons.
    1. Re:I Completely Agree... by KincaidKMF · · Score: 5, Funny

      How random... I was looking through tagged articles for more information about the "New Journal for Keeping Every Word a Just and Defined Kooky Emphasis While Describing" and popped over here. And all this time I thought tags were working.

  12. I'm fired, aren't I? by elzahir · · Score: 4, Insightful

    He said "blogosphere." Instantly, I don't care.

    Only thing worse would be something like, I dunno, "tags should be a Web 2.0 standard" or somesuch.

    Excuse me, but "proactive" and "paradigm"? Aren't these just buzzwords that dumb people use to sound important?

    --
    For a successful technology, reality must take precedence over public relations, for nature cannot be fooled - R Feynman
    1. Re:I'm fired, aren't I? by Anonymous Coward · · Score: 0

      "For a successful technology, reality must take precedence over public relations, for nature cannot be fooled"

      I guess your hatred of neologisms and 'buzzwords' doesn't extend to manager-speak.

    2. Re:I'm fired, aren't I? by Frnknstn · · Score: 5, Funny

      "buzzword" is a term used by cynical people trying to sound important.

      --
      If it's in you sig, it's in your post.
    3. Re:I'm fired, aren't I? by Anonymous Coward · · Score: 2, Funny

      "Cynical" is a term used by anal-retentive people trying to sound wise.

    4. Re:I'm fired, aren't I? by Anonymous Coward · · Score: 0

      "anal-retentive" is a term used by closet gay people gagging for a good hard pounding

    5. Re:I'm fired, aren't I? by Anonymous Coward · · Score: 0

      function putdown (x,y,z) "(" + eval (x) + ") is a term used by " + y + " people trying to sound " + z + ".")
      putdown (lambda { putdown (x,y,z); }, "recursion theorists", "relevant"); }
      Your head asplode. :)
    6. Re:I'm fired, aren't I? by Anonymous Coward · · Score: 0

      Too easy.

    7. Re:I'm fired, aren't I? by hotdiggitydawg · · Score: 1

      I call "bingo"!

  13. A standard for tagging by AlanS2002 · · Score: 2, Insightful

    Would make tagging almost useless. There are many different ways you can view one thing and to limit the expressions used to tag something limits the possibilities of communication. On the other hand leaving the tags available as open ended can turn out to be redundant, you may as well just tag something as its complete description. Perhaps the best way would just be to let people make up their own minds.

    --
    Not all conservatives are stupid,
    but it is true that most stupid people are conservative.
    - Hume
    1. Re:A standard for tagging by SchizoDuckie · · Score: 2, Insightful

      I don't think there is a real problem here... The problem mainly is the displaying of tags. The 'tag cloud' (and the person that invented it) should be banned from the internet and something better will have to be invented in the next couple of years. Tags that work on site x don't have to work on site y and don't even have to have any relevance so why a standard?

      --
      Quack damn you!
  14. Is he talking about HTML tags? by Anonymous Coward · · Score: 0

    ...or is he just another controlfreak whose article was tagged with vocabulary he didn't expect.

    Tagging (and Wikipedia) don't need your rules. You can always close your eyes if you don't like it.

    On the other side I know Zonk would be desperate to censor tag vocabulary like "stupid", "lame", "FUD" and (of course) "No".

  15. Hyphens. by caluml · · Score: 2, Funny

    I must say that the Slashdot way of tagging irks me. I think tags should have hyphens between words, much like they do in their "from the the-slow-down dept". Makes it more readable.
    Any-tagging-stuff-I-have-to-write-will-use-hyphens as who knows what analbum is?

    1. Re:Hyphens. by Lissajous · · Score: 1
      ...who knows what analbum is?
      Umm...tautological?
    2. Re:Hyphens. by Chryana · · Score: 5, Insightful

      I would add to this that slashdot tags tend to be not very useful.

      Most of the time, the tags have little to do with the actual article (eg. yes, no, maybe, fud, notfud, flamebait). I thought the purpose of tags was to be able to find an article easily later on when it has been archived, and the usefulness of the tags I just mentioned for this purpose is dubious at best. I do not pretend to have a solution to this problem, but I think the situation would be improved if the editors or maybe the /.ers who wrote the top rated comment where the only people allowed to set the tags.

    3. Re:Hyphens. by jamie · · Score: 1

      Yes, we'd love to have readers making more use of tags to describe / categorize / save / locate content. That's their best use, more helpful than expressing an opinion or making an in-joke. We think that's the main reason for tags and our FAQ talks about this a bit. I often use tags that way on Slashdot, the way they're used on del.icio.us, bookmarking stuff for later.

      But of course the point of tags is not to have top-down decisions about what words to associate with content, so we're not going to limit who's allowed to tag.

      That said, we are working on systems to identify which Slashdot users tag in the ways that benefit the site (categorizations and value judgments) and to have the site make more use of those users' contributions.

    4. Re:Hyphens. by Anonymous Coward · · Score: 1, Interesting
      I do not pretend to have a solution to this problem, but I think the situation would be improved if the editors or maybe the /.ers who wrote the top rated comment where the only people allowed to set the tags.
      I think a bigger part of the solution is to stop asking (leading/retorical/flaimbait) questions in the submission. When you do that all of the discussion gravitates to that idiotic question rather than the meaningfull content of the blurb/article.
    5. Re:Hyphens. by tehcyder · · Score: 1
      who knows what analbum is?
      anal bum is pretty tautological in the UK.
      --
      To have a right to do a thing is not at all the same as to be right in doing it
  16. Bullshit by Colin+Smith · · Score: 1, Insightful

    Even more importantly, tags were meant to be universal and compatible: a medium of sharing and conveying info across the blogosphere Oh my god what rubbish.

    --
    Deleted
    1. Re:Bullshit by arun_s · · Score: 2, Insightful

      Yeah, I agree. I personally feel tags are hyped way beyond their actual worth. I couldn't care less about 'conveying info across the blogosphere', but I'm genuinely interested in organising my own information neater (e.g. my bookamrks).
      Look at gmail, frinstance. Labels replace folders, and a mail can have more than one label. More importantly, they're predefined, and the interface doesn;t really allow you to be prolific with your tagging.
      Compare this with the crappy way del.icio.us allows you to put a billion tags for each link, and I can see why its such a mess.
      I agree with others here that something like tagging oughtn't to be standardised or they'll lose their whole purpose, but really, there are other reasonable solutions that atleast help in atleast reducing the amount of craptagging going on. I've experimented with Blinklist and del.icio.us, and my bookmarks in the former are far better tagged because I can actually see my existing tags while bookmarking a new site.

      --
      I can explain it for you, but I can't understand it for you.
    2. Re:Bullshit by evanbd · · Score: 1
      Actually, I'd like it if gmail let me be prolific with my tagging. Not infrequently I find that I'm looking for an email that never actually references the obvious search terms. Basically I'd like to use labels to create collections of things I look at a lot (eg a label for each mailing list; a label for work related stuff; a label for family email, etc), but I don't want dozens and dozens of different emails. I don't think it would be hard for me to notice an email coming in, decide "this might be useful to find later," and add a few keywords to it.

      I guess I really want a pair of things -- I want to be able to only have some of the labels / tags show up in the box on the left, and I want a quicker way than the drop down menu to add labels / tags to a message. (Like a text box with autocomplete like slashdot tags in addition to the drop down menu. Perhaps the menu should only show the ones in the box on the left.)

  17. itsatrap by transporter_ii · · Score: 2, Funny

    You're never going to get everyone to agree on a set of appropriate tags.

    Then how come everyone on here has agreed on a handful of standard tags:

    itsatrap fud haha stupid

    ?????

    transporter_ii

    --
    Doctors destroy health, lawyers destroy justice, universities destroy knowledge, religion destroys spirituality
  18. tagging by AcidLacedPenguiN · · Score: 4, Funny

    and here I thought the standard for tagging was for the first person to agree or disagree with the headline, then the next has to immediately disagree with the first person. 5 minutes down the line if no one has added another tag, the third must disgree with BOTH the first and the second poster. Finally, a serious slashdotter will show up to add a relevant tag, followed by the oh so frequent itsatrap and slownewsday tags.

    --
    disclaimer: I've been known to store numbers in my ass for which to dig out when quantities are required.
    1. Re:tagging by AstrumPreliator · · Score: 1

      The parent should be modded insightful, not funny. As it is the tagging system is about as useful as the parent describes; that is to say it's not useful at all.

  19. Tags are for things that AREN'T standardized by vidarh · · Score: 3, Insightful

    Tags are human assigned labels for something that we don't have better meta-data for, or where we don't want to be bothered with formalism. If you want something formal, go use a proper taxonomy/ontology and put bucketloads of OWL or RDF-schema data on your site to define relationships, or use format with well defined semantics to add information. Noone is stopping you, and there are cases where formally defining relationships is worthwhile, such as when you want software agents to be able to infer stuff about the data. But that's not what tagging is used for. Tagging is used for ad-hoc manual classification in situations where it is good enough

    1. Re:Tags are for things that AREN'T standardized by Simetrical · · Score: 1

      Tags are human assigned labels for something that we don't have better meta-data for, or where we don't want to be bothered with formalism. If you want something formal, go use a proper taxonomy/ontology and put bucketloads of OWL or RDF-schema data on your site to define relationships, or use format with well defined semantics to add information.

      You are confusing formality of semantics with formality of syntax. The latter is mostly what's suggested in the article, not the former. He was suggesting that sites agree on how to handle spaces in tag names, what delimiters to use, and whether to prefer nouns or verbs, not whether you should use "Vista" or "Windows Vista".

      --
      MediaWiki developer, Total War Center sysadmin
  20. XSLT for Tags? by null+etc. · · Score: 2, Interesting

    Similar to how XML uses XSLT to transform XML documents from one application to another, it wouldn't be a half-bad idea to have a Tag Transformation Language. Organizations with a lot of market share can define their own tag standards, and then people can optionally specify the transformation between their own local ontologies and the established tag standards. This has the advantage of being participation-driven.

  21. I agree! by tbannist · · Score: 1

    And while we're at it, we should get everyone to agree to speak the same language and believe in the same religion!

    --
    Fanatically anti-fanatical
  22. yes, by Anonymous Coward · · Score: 0

    such a shame we can't tag _everything_ in our everyday life :/ the world is in dischord, because we have no set standard scheme for tagging everything around us with the opinions of OTHERS rather than building our own.. stupid dimwit..

  23. Already done by Anonymous Coward · · Score: 0

    This is what RDF is for. You distinguish between your idea of the "fud" tag and somebody else's idea of the "fud" tag because the relationships are expressed as URIs. You refer to the subjects of the tagging with URIs too, meaning that you don't need special tagging behaviour built into whatever you are tagging - you just need a URI to point to. And because you can put an RDF file anywhere, it's totally decentralised.

    This has already beeen designed and implemented in Amaya and Annotea by the W3C. The correct place to solve this is in the client, not in each individual website.

  24. Too many chefs, etc. by Pope · · Score: 3, Informative

    Tagging, like anything else designed to be helpful, simply won't work if *anything* is allowed. For every person who tags something "correctly" in an effort to do good, how many people will deliberately mis-tag something to produce misleading results?

    Better to get rid of tagging altogether and go back to text searching! :)

    --
    It doesn't mean much now, it's built for the future.
    1. Re:Too many chefs, etc. by Anonymous Coward · · Score: 0

      Yeah, letting people write any tags they want won't work. Just like that wikipedia nonsense, this'll be forgott... oh.

    2. Re:Too many chefs, etc. by eipipuz · · Score: 1

      By your train of thought. Let's hurry for wikipedia and every open source project is gonna crash.

      How about if we stop pretending that people care much about vandalism. Yes, there will always be some trolls under the bridge. But most people don't have a motivation to destroy common wealth.

    3. Re:Too many chefs, etc. by Brummund · · Score: 1

      By your train of thought. Let's hurry for wikipedia and every open source project is gonna crash.

      We often hear about the might of the pen, but the train of thought hits pretty hard as well

      http://observer.guardian.co.uk/uk_news/story/0,,19 90011,00.html

  25. Dateline? by Anonymous Coward · · Score: 0

    ...not to tag everything like 13 year old cheerleaders. Otherwise you may find yourself in a living room Stone Phillips.
  26. No.. and yes by slashmojo · · Score: 2, Interesting

    First of all tags are not exclusive to the blogosphere - they exist on the boardscape (see boardtracker for example) and of course on the many social nets and pretty much everywhere else.

    There are already microformats for defining tags which can and should be used.

    Tags are for building a folksonomy and created 'by the people' so are by their nature, to a certain extent, personalized and flexible.. what makes sense to you may make no sense to everyone else but so what? You made it, its good for you and thats good enough.. however chances are it will make sense to some other people anyway, no matter what or how you tag, so its all good.

    1. Re:No.. and yes by nuzak · · Score: 1

      > Tags are for building a folksonomy

      I can live with Web 2.0 and blogosphere, but you will never catch me saying "folksonomy" in any conversation except to deride it.

      It's keywords. Welcome to usenet. There's nothing to see here. Seriously, internet pop culture is like some sort of weird reverse cargo cult that believes it invented every trivial technology.

      --
      Done with slashdot, done with nerds, getting a life.
    2. Re:No.. and yes by slashmojo · · Score: 1

      Well its folksonomy as opposed to taxonomy.. its a good enough word to describe what it actually refers to. Makes a lot more sense than 'web2.0' anyway.. ;)

    3. Re:No.. and yes by Petrushka · · Score: 1

      its a good enough word to describe what it actually refers to.

      Ah! Based on this, I shall assume it means what it looks like it means. Without reading the WP article you pointed to, I can easily tell that it obviously and unambiguously refers to the naming and identification of ethnic groups -- just as "taxonomy" refers to the naming and identification of different categories.

      If that is not correct, then kindly forgive me if I join with the gp in deriding the term ... :-)

  27. Annotea Project by Lodragandraoidh · · Score: 1

    The Annotea Project is a W3C project that ties together standards that allow attaching metadata to web pages without altering the contents of the object.

    The Amaya web browser/editor is a W3C project that serves as a testbed for the consortium's standards - including an annotea implimentation (the most interesting part of the project imho).

    Basically, you can keep your own local metadata, or have a central shared resource with that implimentation. Of course, you could build your own implimentation that has other properties (merging/sharing annotations/tags etc...)

    --

    Lodragan Draoidh
    The more you explain it, the more I don't understand it. - Mark Twain
  28. Patent Tagging by JediHomer · · Score: 2, Funny

    Someone should patent tagging, license it out for a small cost and enforce a standard...

  29. World's best tagging system by nbauman · · Score: 2, Interesting

    I've talked to librarians and information scientists, and they talk about "controlled vocabulary". They told me one of the best systems was Pubmed http://www.ncbi.nlm.nih.gov/entrez/query.fcgi which is an index of essentially every article published in a peer-reviewed medical journal. Every article is "tagged" with Medical Subject Headings (MeSH) keywords, and you can search the database for those keywords. If they can use "heart" or "cardiac", they have to decide which one to use (they use "cardiac"). They have keywords to separate human studies from animal studies. Here's more explanation http://www.nlm.nih.gov/mesh/meshhome.html It's basically open source.

    A similar system in law is the Westlaw key word system. The New York Times used to have a great keyword index, but I can't find it in the NYT online.

  30. Standards aren't the be-all-end-all by scdeimos · · Score: 1

    Even if there was a tagging standard, the choice of tag(s) will still be up to the people applying the tags. Different people will have different interpretations.
    Just look at how well Genres worked out for MP3-ID3, especially on services like Gracenote where people would just upload any old cruft.

  31. The cycle goes on and on. by musterion · · Score: 1

    Hmmm .... I've got an idea--Why doesn't every one simply use the Library of Congress Subject Headings, or if you are French-speaking, RVM, or if you are Spanish Speaking--BIDEX. What? too restrictive? Not relevant? We don't like no stinkin' control.

    If you allow uncontrolled assignement of "tags" (aka keywords) you will get junk. Some people will be conscientious and try to do a good job, some will be "cute" and tag will the latest slang for their domain; some will take advantage and tag for their own nefarious purposes. You will have to deal with the junk/crud/cruft.

    If you impose some sort of control, you will have to deal with those who will flame about the restrictions, you will have to deal with endless discussion on what the correct "tag" is for any given subject and whether or not it offends some group or not.

    There is a term for this area of endeavour--Library Science! There has been discussion and work going on related to the web and indexing schemes for at least 10 years. try searching for "Network Knowledge Organization Schemes" or some variant.

  32. Social grouping? by Gunark · · Score: 1

    I'm surprised this isn't already common practice, but why not implement tag searching in a LikeMinds scheme. For example, when you're looking for items tagged "funny", you would be shown items that have been tagged as such by people who in the past have tagged things the same way you have. This way you have a better chance of finding things that really are funny to you, based on what you thought was funny in the past.

    Obviously though this would do nothing for Slashdot's tagging system which has (probably unintentionally) become a micro-comment system -- in the way, for example, that people tag articles whose titles are questions with yes/no answer tags.

  33. Evolution of Tags by BlueBoxSW.com · · Score: 1

    The only useful evolution of tags I can see is to give it a dot format, general to specific, such as:

    ->USA.Nevada.LasVegas
    ->Literature.HunterThompson

    Similar to OOP coding or Usenet. But, you wouldn't want to set a fixed standard, you would want it to evolve as it's used.

    1. Re:Evolution of Tags by LiquidCoooled · · Score: 1

      Perfect!

      All you have to do is map the tags

      slashdot.itsatrap = fark.Florida

      or something similar.

      --
      liqbase :: faster than paper
  34. partial hierarchy for tags by fantail · · Score: 1

    Rather than a standard, allowing people to use partial hierarchies might be enough for a sophisticated algorithm to figure out when tags mean approximately the same thing (given a large set of data).

    For example, given the example of Windows Vista from the article, tags indicating partial hierarchies might look like "software/windows vista" or "windows/vista". The first tag could be interpreted as: both the tags "software" and "windows vista", and that the tag "windows vista" is a sub-tag of "software".

    I've heard rumors of software that is capable of analyzing tags for large collections of objects to determine which tags are related and even make guesses as to when different tags mean the same thing. Allowing users to also say that one tag is a sub-tag of another might be enough to create full tag hierarchies and improve the ability to guess when two tags mean the same thing.

  35. What would we use the standard for? by MrBlic · · Score: 1

    The one time that tags were really useful to me were when I was asked to implement rounded corners on the portlets of a web page, and I was looking at what other people had done. I didn't find exactly what I was looking for until I checked del.icio.us for 'rounded' and found lots of other ways of rounding the corners of web content.

    I've been asked to implement tags on an existing discussion site, and I'm afraid it's going to turn out as poorly as the tags on slashdot. Unless the tagging system is used by people in a way that's meaningful to them, the quality of the tags is going to be poor.

    The tags have to fill a need for categorization, and there have to be people interested in those categories, then it will get used widely, and turn into something that even newcomers to the site would use.

    So far, del.icio.us has done well, because the tags are 'mine' and they mean what I want them to mean, and they categorize my bookmarks, I know what to expect when I use them.

    In a huge pile of similar things, like flickr, tags are the only categories that outsiders can use to narrow what they're interested in without text search.

    Tags are truly implemented differently based on the role that they serve. There's no way to standardize yet. If they were standardized, how would one site use the tags on another site? What would all the use-cases be? There's also no way to ensure quality of tags are accurate and thorough. Text search is reliable because it looks at the content itself. Has text search been standardized? I think not.

    Search and tags are web services that are exposed in different ways from different sites, for different purposes and that's about the best we can hope for for the time being.

    --
    Celebrate Excellence!
  36. the solution... by Bazman · · Score: 1, Interesting

    I just RTFA and apparently the biggest problem is whether you type your tags as "Windows Vista","Piece Of Crap" or Windows_Vista,Piece_Of_Crap or WindowsVista,PieceOfCrap, so that people who put tags on D.e.li.cio.us might get confused when putting tags on technofarti.com. Spaces? Quotes? Delimeters? Oh my. What shall we do.

      Basically, people are too dumb/lazy/stupid to read a one-line description of how to format their tags. How confusing can it be? You just show people how to do it in the form, e.g.

      Tags [ ] (eg dogs, "border collies", barking)

    or

      Tags [ ] (eg dogs,border_collies,barking)

    or

      Tags [ ] (eg dogs,borderCollies,barking)

    Now, do we need a standard, OR do we need people to be able to read instructions? Note that one of these choices is a specific, set-in-stone piece of information, the other is a general piece of advice that people would do well to follow for most of their lives (although being able to read instructions is no guarantee that following them is a good idea).

  37. Old problem, and you're not going to solve it. by Anonymous Coward · · Score: 1, Interesting

    Libraries have had a similar problem, for the past hundred years of so.

    Here is a book. Where do you put the book in the library, and how do
    you classify it so as to make it maximally useful for your [*] patrons?

    [*] That's important! Your patrons are distinct from mine!

    You can order all the books in the collection
    by accession date (when you got the book).

    You can order all the books by author's last name.

    You can order all the books by title.

    You can order all the books by subject. If you do this, you can use
    Dewey classification, LC classification, or something else.

    Suppose you just stick with LC classification.

    Even two libraries that have the same book, and both use LC
    classification, say, may classify a book differently. Say you
    have an AI book that is *the* seminal text covering how to do
    clustering via fuzzy logic. Do you put this with all the AI books?
    with all the clustering books? with all the fuzzy logic books?
    (and all three sets of books may be in different places.)

    Tagging content on the web represents a similar situation. If you
    use a 'standard term' to tag a text, different sets of users /
    customers / readers may not associate that 'standard term' with
    the meaning you intended. A given term or phrase can be 'classified'
    (library science term) or placed into different categories of meaning,
    depending on context.

    I think that the original poster's statement ("Tagging was intended to
    be universal and standardized") either shows great naivete or hubris
    on the part of the unstated "intenders". Context is the key. Any
    one and his dog can come up with a standardized tagging scheme, but
    users of it will nonetheless adopt it (or not) based on the scheme's
    ability to classify information in a way that works for the adopter.
    What prospective adopters want, however, is not a straighjacket that
    forces them to classify web pages in a way that the adopter's users
    won't understand and won't use.

    ---a former AI researcher

    1. Re:Old problem, and you're not going to solve it. by UnknownSoldier · · Score: 1
      Exactly, Tags are multi-dimensional.

      What really sucks is that no filesystems properly support tags (aka meta-data), and we're left to software to "flatten" the database. (Think of your favorite mp3 player's library) YES, I _want_ my music placed into MULTIPLE genres.

      File systems links are the fore-runner to this, but the creators didn't realize the general principle.

      i.e.
      For every game in my system, I have a screenshots folder

        \game1\screenshots
        \game2\screenshots
        \game3\screenshots


      I would have to manually link each game screenshots folder into a master one:

        \game1\screenshots -> \screenshots\game1\
        \game2\screenshots -> \screenshots\game2\
        \game3\screenshots -> \screenshots\game3\
        \game4\screenshots -> \screenshots\game4\


      BUT what I really want is to be able to browse screenshots without hunting around tons of directories. Inverting the database as above was the the first step towards solving the problem; if the file was tagged, this would make it easier to browse the "complete library" by allowing us to flatten the database one level.

        \screenshots\[game1] screenshot01.tga
        \screenshots\[game1] screenshot02.tga
        \screenshots\[game2] screenshot01.tga
        \screenshots\[game3] screenshot01.tga


      What also sucks, is that I have to manually add tags _into_ the filename, since P2P don't support searching by tags.
      i.e.

      \audio\[Hemi-Sync,Meditation] foo.flac


      I hope people start realizing that we need a better and easier way to start categorizing all our data.

      Cheers

      --
      "Windows: Why it's file system was designed by idiots?".txt
  38. Hurry, guys.... by Chapter80 · · Score: 1

    Quick, somebody tag this article "Yes".

    1. Re:Hurry, guys.... by berbo · · Score: 1

      And then somebody else tag it 'No' as well.

    2. Re:Hurry, guys.... by MORB · · Score: 1

      I always wanted to add a maybe tag when it happens.

      But really, I love the humor in the slashdot tags: "The Need For A Tagging Standard", tagged with: tag, tagging, and tags.

      The "fuckoff" tag on "Is A Bad Attitude Damaging The IT Profession?" was pretty good, too.

    3. Re:Hurry, guys.... by Ford+Prefect · · Score: 1
      The "fuckoff" tag on "Is A Bad Attitude Damaging The IT Profession?" was pretty good, too.

      I think that was one of mine - I tagged the article as such, and posted a comment suggesting other people do so too. I guess the threshold for tags being displayed is pretty low, as the comment got modded down - but enough people must have seen it, and responded, for it to appear for everyone. It started at the end of the list, but others saw that, and presumably liked the joke and tagged the article accordingly, increasing its strength.

      It's quite easy to poison the tagging thing with jokes - itsacrap was another of mine, although admittedly it's difficult to claim responsibility for any particular tags in any easily confirmed way.

      In part, tagging seems to be an excuse for not writing a decent search facility (interestingly, the 'search' box at the top right of Slashdot appears to ignore tags entirely) - although perhaps you need Google's combined intellect to get all the synonyms and related words parsed and indexed appropriately. But conversely, it's very much a yes-or-no thing - it's easy enough to list all articles with a particular tag, but harder to define precise divisions in an index like Google's. So a 'developers' tag on Slashdot would only relate to software developers, while 'developers/developer/developing/develops/etc.' on Google could also cover photographic sites, embryology papers and everything in between.
      --
      Tedious Bloggy Stuff - hooray?
    4. Re:Hurry, guys.... by Anonymous Coward · · Score: 0

      MORB is a chickenshit unquestioning slashdot groupthink karma whore.

      CrankyOldFart

  39. Meta keywords by Anonymous Coward · · Score: 0



    I think this 'standard' already exists...

  40. Argh by eMbry00s · · Score: 3, Funny

    I think by "blogosphere", you really mean "internet".

  41. Oh yes, Brain. by neimon · · Score: 2, Insightful

    Let's create a committee to discuss the standard, and send out several RFCs, then split off into an angry sub-contingent that insists tags be open-source and then Sun decides to embrace tags, but screws it up, and Microsoft buys its way into tags and engineers a perfect way to pwn your machine through the tag "1337."

    Don't forget to make it structured, with methods and types and blah blah blah.

    It's just words, fer chrissakes. When you can tell me the difference between "its" and "it's" then you can talk about standards for words. Until then, PLEASE let's not have another standards war over something trivial that is supposed to save the world but will only serve to confuse everyone, all over again.

  42. I believe what the article really meant was... by ErGalvao · · Score: 1
    something like:
    • When your tag contains more than one word, separate them with an underscore, rather than a hyphen or a space
    for an example. Of course it would be foolish to standardize which tags are "valid" or not, since this would destroy the whole concept of tagging. For that kind of purpose we have directories, like Yahoo! or DMOZ.
    --
    Er Galvão Abbott - IT Consultant and Developer
  43. RSS is the answer by Paulrothrock · · Score: 1

    RSS readers are ubiquitous. Generating custom RSS feeds from content is trivial on the server side. Building an RSS reader to pull tag information from another site into tagging software is trivial. The only thing that people need to do is to build it into their tagging engines so that their customers can easily find related information. The last thing I want to do, however, is to have things "suggested" to me. Why should del.icio.us and Technorati automatically be integrated? Leave that to the user.

    --
    I'm in the hole of the broadband donut.
  44. Translation by Alec+Eiffel+Jr · · Score: 1

    One big problem is that you need to provide translations for all these tags in a large range of language...

  45. there is a standard by Yonder+Way · · Score: 4, Interesting

    There is a standard but nobody uses it these days. Even the search engines disavow it anymore.

    <META name="keywords" content="foo, bar, baz"/>
    1. Re:there is a standard by dave420 · · Score: 1

      Close, but that only defined keywords in an HTML document. How would that apply to an image, for example? Or an mp3? etc. Having one standard for one file type is not a standard when considered against all other file types.

  46. Better hurry... by supabeast! · · Score: 3, Insightful

    If someone gets started on a tagging standard right now, it might see a little use before the whole silly idea goes out of style next year.

  47. standard tagging API by jefp · · Score: 1

    I don't care so much about the actual tags themselves. Letting people tag however they like is the whole point, isn't it?

    But I would like to see a single standard API for adding tags and searching for tags. The exact same code should be able to connect to every tag-enabled site. A nice simple REST thing, an HTTP GET to send a query and an XML fragment as result.

    I figure there would be two types of query:

    1) Send an object identifier (URL, photo id, whatever) and get back a list of tags.

    2) Send a tag or list of tags, possibly wildcarded, and get back a list of object identifiers.

  48. Different strokes for different sites by shawnmchorse · · Score: 1

    I'm a big fan of tags. I just recently got finished with a redesign of a personal web site at RockyMusic.org, and it uses tags heavily now. All of the content including photos, MP3s, videos, song lyrics, external links, documents, albums, individual tracks, and so on uses the same set of tags (which I applied). I can then use those tags to tie everything on the site together. Examples: Richard O'Brien, Little Nell, Rocky Horror Show. Since I did all the tagging, it's consistent across all the content as well.

    That's a case I think where a tagging "standard" would not be at all useful, since making it non-standard is the primary reason it works well I think. Slashdot and other sites that use user-generated tags will always see a lot of humorous, insulting, and otherwise unuseful tags. I know I've seen photos of girls tagged "butter face", and Slashdot obviously has its own cults of "itsatrap", "slownewsday", and so on. Some that are specific to Slashdot ("slashvertisement") are quite useful though.

  49. Registry by berbo · · Score: 1
    One approach to this is a metadata registry: http://metadataregistry.org/

    That way one group can label their tags ( e.g. Microsoft:embrace_and_extend ) and others can submit their own tags (e.g. slashdot:itsatrap).

    This is labor intensive and is not a universal solution, but its a partial solution

  50. Standard What? by Baavgai · · Score: 1

    Two issues here.

    One is the "standard" of representation. Tags are not like a contact lists or meta data laden resource indexes. They're just words, an array of strings. If your favorite language can figure out how to go from, say, "tag1, tag2" to "array('tag1','tag2')" you have bigger problems than standards.

    The other issue is defining a universal, standard, taxonomy. From Dewey to RDF, we're no closer now than we ever were. You're asking people to all come to an agreement as to how they view the world. Good luck with that.

  51. Um wait... by gillbates · · Score: 1

    You mean XML doesn't solve this problem? Blasphemy!

    --
    The society for a thought-free internet welcomes you.
  52. Confused - where's the problem? by Anonymous Coward · · Score: 0

    Is the author talking about an interoperable API or application specific tagging implementations? For our app, we limit a single alphanumeric tag to be listed via the web front end because we store the tags char delimited in a db field. If the guy is talking about a web API then I'd suggest something with ATOM/Microformats and we'd oblige with a parallel dataset. user-classified content flies in the face of established hierarchical information architecture, if someones going to start applying rules they need to make sure they don't kill the goose!

    Tagging itself is something of a none issue for us, marking up a cloud is a major problem. What do you guys think of using h1 through h6 to mark up a tag cloud? Currently we use spans with CSS font-size decs but it's not possible to convey weighting information for the cloud in an accessible way. The options are to abuse the header element or offer an alternate ordered list.

  53. They're not that useful anyway by AdamHaun · · Score: 3, Insightful

    The only tags I like are my own. The real use of other people's tags is to show how they organize information, not to help me find something. The problems the article brings up are only the beginning -- the natural tendency of a global tagging system is for the number of tags applied to an object to increase without bound. If I'm doing a master's thesis on, say, web design, I might tag any number of sites "thesis". Is that useful to anyone else? Probably not. But it will interfere with someone who's searching for sites about writing theses.

    --
    Visit the
  54. Dublin Core ... a cautionary tale by ab762 · · Score: 1

    So while ago, there was an agreed standard for web metadata, the Dublin Core Metadata Initiative aka ISO Standard 15836-2003. Very few people use it.

  55. Or be like a normal person... by Cylix · · Score: 0, Redundant

    Tagging? Who cares? Gimme my pr0n....

    Seriously, I don't tag, I don't care and I don't want to care.

    I'm not going to change anytime soon and I bet there are a lot more of me then you.

    This is one of the few days where I can tell I look like a troll, but this is exactly how I feel about the subject. I guess I'm an angry person.

    --
    "You should always go to other people's funerals; otherwise, they won't come to yours." -- Yogi Berra
  56. Please engage brain first by Moraelin · · Score: 1
    "For a successful technology, reality must take precedence over public relations, for nature cannot be fooled"

    I guess your hatred of neologisms and 'buzzwords' doesn't extend to manager-speak.


    First of all, the quote is by Feynman, who was a physicist, not a manager.

    Second, I'm not sure where you see buzzwords in it, because I can't. It's a simple idea expressed in plain English, which is something a lot of PHBs and PR drones seem to have forgotten how to do.

    Third, it's a very sane and simple advice to everyone (including scientists, engineers, managers, marketting, etc): you can't make technology by PR announcement. You can run the PR machine as long as you want that water really runs uphill, it won't convince nature to actually behave that way. You can't just rewrite the laws of physics, and if you try, don't be surprised if nature still behaves the old fashioned way. So if you want to build something that works, put your faith in how the world actually works, not in how much PR crap you can churn stating the opposite.

    At any rate, it actually has a meaning.

    Most of us have a beef with buzzwords that really don't have any meaning, or no extra meaning over a simpler everyday word. Whole paragraphs or whole memos, mission statements, etc, get written that don't actually tell you anything. That is the problem.
    --
    A polar bear is a cartesian bear after a coordinate transform.
  57. njkewjdkewd is brilliant by Anonymous Coward · · Score: 0

    "tagging, no, tag, tags, njkewjdkewd (tagging beta)"
    Somebody mod the tags +funny. :) And interesting. And perhaps insightful.

  58. MOD PARENT UP by metamatic · · Score: 3, Insightful

    We've got a standard for keywords in HTML documents. There's no problem there.

    The only issue is what to do when there are multiple sub-documents on a single page, like if Slashdot allowed individual replies to be tagged.

    --
    GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
  59. My moment of realization by wraithgar · · Score: 2, Funny

    I think for me the moment I realized that the idea of tags needed a little bit of work was the day I saw them on Amazon.com. I was viewing a product there, and it had been tagged "Presents for Jim".

  60. hohum by plexium_nerd · · Score: 1

    Herding cats would be eaiser... What about a tagging system for the tags themselves? That would be fun!

    --
    ____ plex
  61. Tag Search by coldtone · · Score: 1

    The only thing missing from tagging is search. The API is standard already, (Flickr -> pictures, del.icio.us -> Resource Sites, News Sites: News)

    I would love to have a tag option in Google.

  62. classification schemes and metadata by khaledh · · Score: 1

    As many people noted in this discussion, it's really hard to get everyone to agree on a standard tagging scheme. There are, however, good schemes for classifying content that can be used to standardize classification. Those include: controlled vocabularies (flat lists), taxonomies (trees) (e.g. Dewey, LC), ontologies, and topic maps[*] (graph-like structures). As you see the tagging model can be as simple as a list, or as complex as a graph of concepts with their associated relationships. Regardless of which scheme you use, you should use it within a metadata standard (e.g. Dublin Core, IEEE LOM), by allowing the subject field of the metadata record to take values only from the used classification scheme.

    If a standard classification scheme is used, coupled with a metadata standard, I say we're much closer to having more consistent distributed content classification than with ad-hoc tagging.

    [*] Check this article on "Metadata? Thesauri? Taxonomies? Topic Maps! Making sense of it all" by Lars Marius Garshol from Ontopia.

  63. Dem tags by Anonymous Coward · · Score: 0

    Tags don't belong on the interweb, and neither does Al Gore.

  64. Opt-in specific tagging! by Tiemen · · Score: 2

    I think there definitely should be a standard of classifying information: we have huge masses of people classifying information they encounter on the web, it's the human equivalent of SETI. And therefore it would be a shame if we didn't allow people to take it one step further and give everyone the ability to do it "right" (for their particular version of right).

    I am not opposing the free-style tagging, as many people wouldn't bother with any formal definitions, but having a standard so you can optionally use ontologies (predefined, or create an ontology yourself!) as a reference for what exactly you mean would be great. The tag could still be displayed as it's string representation, but would store exactly what meaning the original tagger intended. And can therefore be searched for more easily.

    It'd be relatively easy to store a bit of RDF for every tag which specifies whether it's a wild tag or a "precise" reference to a phrase in an ontology. As one poster said especially the content creators might want to use this feature, and the consumers can still do however they want.

    Sounds like a win-win situation to me :)

    -Tiemen

  65. Utter nonsense by owlnation · · Score: 1

    A tagging standard? Never gonna happen. Probably not even possible.

    Some words have more than one meaning in English alone, and different meanings throughout the different versions of English, then you can add in foreign languages too.

    Take for example the words "strip" or "ass" as an example (um, not sure why those came into my head first...). Search for "ass" and you get things about donkeys, amongst a variety of more or less interesting things. My guess is that the few mental patients searching for donkeys are as upset as those of us looking for more interesting things.

    Tags work on trust. For the most part they work 30%-40% of the time as they are on a bad day. There is plenty of tag spamming going on, and that's only going to get worse if standards are "enforced". God knows how anyone could enforce this, gaming tags is a piece of cake.

    I'm not clear why the web 2.0 tagerazzi ever thought tags were going to work in the first place. It is essentially the system that search engines used before Google. It failed miserably in many cases, being the reason why Google was successful - they moved away from tags to search algorithms that worked much much much better.

    Does everyone really have such short memories? Or is it really 1996 and I'm living in my own solipsism?

  66. Already been proposed "XFML" and ignored. by mjlesko · · Score: 1

    See http://xfml.org/.

    Google Facet Classification for a more in depth study of the idea.

  67. Huey, Dewey & Louis by psbrogna · · Score: 1

    I've thought for along time that the Internet (search engines in particular) is screaming for the equiv. of a Dewey Decimal system. I think the blog categorization taxonomy is an extension of that need...

  68. Technorati has a standard... by BovineSpirit · · Score: 3, Interesting

    The rel-tag microformat is an attempt to standardise tagging. It relies on other microformats to define what it is you are tagging. There isn't a 'photo' microformat at the moment, so you can't do a web-wide search for photos tagged 'fireworks' for example. If you're interested in the semantic web it's worth checking out microformats. You can download a plugin for firefox that reads microformats. Go and have a look at Flickr with it, or any other site that implements microformats. If people have tagged something with a 'geo' tag giving long. and lat. then it will bring up a Google Map showing the location. If they've included a 'hCard' around their contact details you can add it to your address book.

  69. automated categorization more consistent by WamBamBoozle · · Score: 1

    If you have a set standard tags, you still need people to apply them consistently. In practice, this can be done much more consistently by using machine learning techniques. You have a set of documents with agreed-upon tagging as your training set.

  70. tagging "standards" for the semantic web by Anonymous Coward · · Score: 2, Informative
    hey guys,

    for those interested in a "tagging standard" for the semantic web (i.e. an ontology describing the concept of tagging) check out:


    -ukio
  71. Why Ontologies (and the Semantic Web) Fail by Anonymous Coward · · Score: 0

    No two people have the same meaning for a given word. It's no surprise that experts disagree on word meanings when defining "ontologies" (dictionaries, really) for the WWW. Herein lies a problem that AI continues to wrestle with.

    Meanwhile the village fool, W3C's Tim Berners-Lee, continues to insist that nothing's wrong and that his Semantic Web effort will save the world and the children.

  72. We already have a book for tagging, its called... by monkeyboythom · · Score: 2, Interesting

    A dictionary.

    There are people who live and die by tagging their information. They build folders and create lists.

    There are people who just go through life serendipitously. They never use the laundry hamper and most people call them slobs.

    Between these two groups are the rest of humanity. Sometimes they make lists and sometimes they don't. And just because the word, "librarian," strikes a fear of boredom, most people ignore library sciences. The science of tagging, if to be used as a global panacea, must be approached or studied to be feasible and usable over generations.

  73. What's the big deal about 'Tags?' by Anonymous Coward · · Score: 0

    Tags and Keywords are the same thing, just different names.

    1. Re:What's the big deal about 'Tags?' by Anonymous Coward · · Score: 0

      We need a standard for this too.

  74. File Systems by mrxoliver · · Score: 1

    Wouldn't it be nice to have tagging as an integral part of a filesystem, filetype unaware. Imagine a picasa-like desktop experience. Lush

  75. Mod the tag by imasu · · Score: 1

    You can't mod tags, but I wish I could mod the tag "njkewjdkewd" that someone added to this article as funny.

  76. Tags are boss! by Gray · · Score: 1

    Unless they figure out that the things you call 'thesis' are useful.
    I do this all the time on del.icio.us.

    - Find a page that is likely to be bookmarked by anyone doing academia on web design

    - Use the "others who have bookmarked this page" function to get a list of other users
    who have tagged the URL in question.

    - See anybody using the tag 'thesis'? You probably want to see what else they've tagged that way.

    If you're lucky, you just saved yourself a butt load of research.

    Tags allow people to usefully keep track of thousands of things with almost no cognitive cost.
    No other system lets you do that. This is why everyones favorites list is a mess.

  77. A standard by kirun · · Score: 1

    Right then, I appoint myself founder and overlord of the Internet Expositor of Tagging Formats, and I here propose a draft standard. Comments are requested.

    1) Don't call them "tags". Tagging is what chavs with three ASBOs do to the back of buildings. They're called "labels".
    2) Underscores and camel case are fine for programming, but not for humans. Therefore, we allow spaces.
    3) Multiple labels give us a list. What do we separate lists with? Commas, of course!
    4) In the future, we may invent some "magic" tags that work as key:value pairs. What words do what magic is out of the scope of this standard.
    5) Implementors MUST NOT use the word "blogosphere" or the phrase "Web 2.0" again ever.

    --
    I'm scared of numbers that can't be written as a fraction. It's an irrational fear.
  78. Tangerine and her Tee Tee Tags by Anonymous Coward · · Score: 0

    Juice, Blythe, Wally, Captain Mancini, Dodger, Garbage Pail kids,

  79. Look into topic maps by vrillusions · · Score: 2, Interesting

    I've started to run into this problem myself from using del.icio.us as my primary bookmark source. One of my current issues is not what tags other people are using, but what tags I am using. Currently I have a lot of overlapping tags. I did some cleanup lately so that 'photos' and 'photo' are in a single tag, etc.

    I started to look around and found there have been a lot of standardizations of topic maps. Although intended more for very large systems (think government sized systems categorizing millions of documents). The UK government has a topic system called the e-Government Metadata Standard (e-GMS). The schema is browsable online. Another good article is The TAO of Topic Maps (also in pdf)

    I think there should be a basic standard to avoid situations like the photo/photos tags above. But I think that should be as far as it goes. The good thing about tagging on most sites is you are not limited. The bad thing about tagging on most sites is you are not limited.

  80. Not so fast... by Gavin86 · · Score: 1
    Well, actually there is a slight problem: most search engines pretty much disregard the keywords meta-data in the header of an HTML file entirely. As stated in wikipedia:
    "Search engines began dropping support for META keywords in 1998, and by the early 2000s, most search engines had veered away from reliance on meta elements, and in July 2002 AltaVista, one of the last major search engines to still offer support, finally stopped including them."

    Furthermore, Tags have a more flexible approach--as you recognize is needed--than simple document-level meta-data; they can be applied to specific entities on a page instead of the entire page itself. Not only that, but tags vs. header meta-data seem to target two different groups: search engines and people. How many times do you View Page Source to get information about a specific object on the page or the page itself? Not to mention that only the maintainer of the document even likely has access to the meta-data, where on the other hand with tags any user can contribute at any time leaving the potential for a much more robust set of information.
    --
    "Progress comes from the intelligent use of experience."
    1. Re:Not so fast... by metamatic · · Score: 1

      Search engines that disregard keywords will disregard tags for exactly the same reasons.

      And I see no purpose in trying to standardize the UI for humans to view and enter tags. That's as unrealistic as hoping to standardize the UI of site navigators.

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
    2. Re:Not so fast... by Gavin86 · · Score: 2, Insightful

      folksonomical While I agree with your sentiment, I believe you are incorrect. Search engines specifically disregard keywords from the Meta HTML element found nested in the Head element of a page. Alternatively and because there is no penultimate tagging format, search engines do not discern the difference between normal page content and folksonomical tags. Furthermore, the discussion is not about whether or not to standardize the interface, but rather just to create a common formatting of certain types of tags. I also find it hilarious that the parents of all my posts in this article are ranked high but the actual content of the parents are based on incorrect assumptions as to the purpose of the article. Oh wait, that's right, it's slashdot.

      --
      "Progress comes from the intelligent use of experience."
    3. Re:Not so fast... by metamatic · · Score: 1

      Search engines ignore the HEAD keywords because they were used by spammers. If search engines started paying attention to some standard for tags, spammers would fill their pages with tags, and search engines would go back to ignoring the tags again.

      So there is no point creating a new standard for tags. It'd be redundant with the HEAD keywords standard we already have. Same features, same problems.

      As for common formatting, I can just about see a case for a TAG or KEYWORD element in the next HTML spec, akin to KBD and CITE, so you could then specify formatting of tags in CSS.

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
  81. Re: Parent is not a problem by benji+fr · · Score: 1

    No, you can (and certainly should) access each and every comment in its own URL. So you may put a synthesis of all comments tags plus the article tags on the complete article itself, and comment tags only on comments urls.

    --
    -- .rats live on no evil staR
  82. Searching the tag space by Tw00x · · Score: 1

    Some general GUIDELINES for tagging could be helpful for cross-user findability. I don't see a need for a standard at this point but clever search tools for the tag space are definitely missing. For example, to search for items with two words in a defined order users have to guess the tags of other users, e.g. hard drive: hard::drive, hard_drive, hard-drive). Using a small set of main tags and the concept of ordered tags has some advantages (more details: http://tdot.blog-city.com/advanced_tagging_with_or dered_tags.htm).

  83. the standard is the rel-tag microformat by spage · · Score: 2, Insightful

    META keywords provides keywords for a page.

    The better, more relevant standard for tagging is the rel-tag microformat, http://microformats.org/wiki/rel-tag

    You put a rel="tag" attribute on a hyperlink to the page on Del.icio.us or whatever that defines the tag. The microformats.org page succinctly explains the benefits of this approach.

    It even explains how to encode spaces and special characters. So there's NO issue with the envelope or format, except that Del.icio.us (or is it Technorati?) doesn't like spaces in tag names.

    As for the *content* of tags, yeah they're unavoidably a disorganized mess. Eggheads who know about ontology and RDF say they can't work. But they do, sort-of.

    --
    =S

    --
    =S
  84. I'm not saying we need a standard, god dammit :P by Gavin86 · · Score: 1

    That's fine and all, but I really have no idea why you are telling me any of this.
    I am not arguing that there should be a standard for tags!
    I haven't taken a stance one way or another, I was merely pointing out in my original post that the suggestion of, "screw tags, we have the meta HTML element that works with keywords", is an impractical idea that currently serves to solve a different problem.
    Furthermore, I believe you too are confused. The article isn't asking, "Should we create a new system for detailing the contents of an entity on a web-page?". It is asking, "Should we create a standard naming convention for the system that has already been created that details the contents of an entity on a web-page, so as to create interoperability between disparate websites that use tagging systems?" The idea is to keep the meaning of the tag the same, while letting different websites use the same information and collect data under the same headings.
    P.S. I feel like I'm taking crazy pills!

    --
    "Progress comes from the intelligent use of experience."
  85. Is this for real? by /dev/trash · · Score: 1

    A tagging standard???

  86. "Metacrap" by hugg · · Score: 1

    This article makes some succinct points about the fallacy of relying too-much on metadata.

    I find tags useful for personal collections, but it breaks down because my mental map is different from anyone else's. Heck, I'm often inconsistent from day to day (sometimes I use UPPERCASE_CONSTANTS, sometimes camelCaseConstants, etc)

  87. Automatic Ambiance by Anonymous Coward · · Score: 0

    Read the book Ambient Findability by Peter Morville. He addresses tagging and the semantic web.

  88. What's njkewjdkewd? by zobier · · Score: 1

    I guess that's the point.

    BTW: I find it amusing that CmdrTaco uses the slownewsday tag

    --
    Me lost me cookie at the disco.
  89. Re:I'm not saying we need a standard, god dammit : by metamatic · · Score: 1

    I still maintain that the keywords already implemented as part of HTML metadata are no more impractical than a reimplementation of the same thing but named "tags" would be.

    If the original suggestion is that there ought to be a standard ontology for tags, then (a) it would have been nice if it could have been more clearly expressed, and (b) fat chance, even DCMI hasn't attempted that.

    --
    GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
  90. Hyphens vs. Underscore? by pbhj · · Score: 1

    If you use hyphens as a replacement for spaces how do you show a hyphen ... what does mother-in-law-school mean (my mothers gone to law school || a school to teach mother-in-laws how to keep shtum and do the dishes :o)> )

    I prefer underscores — then hyphens can be kept for their proper purpose.