Slashdot Mirror


RDF For Desktop Metadata?

claes writes "There is an article "Metadata for the desktop" that suggests that RDF should be used to describe data in desktop environments. This is an interesting idea. RDF is already used by Creative Commons to attach license metadata to its works. Mozilla also supports it. RDF was designed for the web, but can it also find its way to the desktop? And what metadata is most important to describe?"

38 of 167 comments (clear)

  1. The killer app for metadata on the desktop by foidulus · · Score: 5, Funny

    is porn!
    Suppose today I want to see shaved asian hardcore action. Now provided that metadata searches are integrated into the OS(like they will be in Tiger), all I need to do is a quick metadata search on my hard drive and boom, there is what I am looking for.
    I mean provided there was a decent standard(a porn standards body would rule!) and good regex capabilities built into the OS, I would be willing to pay for porn. I know that there are comments built into the jpeg standard, but there are all sorts of porn file formats, it would be helpful to have a universal standard across them. It saves time, beats trying to search on google and going through a lot of crap just to get to something good. I am a man on the run, I have places to go, I can't be bogged down by my porn. Plus, think of the people that get to catagorize this stuff(well, the fun stuff anyway, not goatse), what an awesome job that would be!
    I should probably post AC, but I figure this post is bound to earn me at least one fan and/or freak.

    1. Re:The killer app for metadata on the desktop by PowerBook2k · · Score: 5, Funny
      Suppose today I want to see shaved asian hardcore action.


      Just check your email. If it's not there now, it will be soon enough.
  2. Definition:...? by bogaboga · · Score: 5, Interesting

    Why don't slashdoters define what meta-data is in the first place? Google's define: metadata lists not less than 20 definitions. Are we talking about "data about data"?

    1. Re:Definition:...? by ResidntGeek · · Score: 2, Informative

      Yes.

      --
      ResidntGeek
    2. Re:Definition:...? by doshell · · Score: 2, Insightful

      So, data describing metadata would be called metametadata?

      --
      Score: i, Imaginary
    3. Re:Definition:...? by Anonymous Coward · · Score: 3, Informative

      In short, Yes.

      Say you have a digital photo. It's from a vacation you took in 2002, to hawaii, and contains photos of you, your partner, one of your children, but not your other kids and no pets. All that info could be kept as metadata of those pictures, and more.

      The same can be done for finance info for the year 1999 for you, or 2001 for your partner, or music files bought from a certain place, by a certain artist and band.

      While each of the filetypes above can have their own metadata (exif for images, comments for excel spreadsheets and mp3 tags for music) not all of it is singularly accessible and searchable by the one mechanism by the OS.

      This is a good goal.

    4. Re:Definition:...? by Jugalator · · Score: 3, Insightful

      Yep, it's called like that.

      I don't see with the thread started wanted a definition by Slashdotters in the first place, since it's already pretty well described and AFAIK the word doesn't have several meanings.

      --
      Beware: In C++, your friends can see your privates!
    5. Re:Definition:...? by zephc · · Score: 2, Funny

      I never Metadata I didn't like.

      --
      "I would say that 99 per cent of what my father has written about his own life is false." - L. Ron Hubbard Jr.
  3. Implicit feedback for filesystem information by PureFiction · · Score: 3, Informative

    I am a big fan of implicit filesystem feedback. This can support all kinds of services from file sharing to most recently accessed search requests. Even fine tuning access controls in an RSBAC security policy.

    The big concern is keeping this data protected and private. You dont want to share all of your metadata with everyone, so security of these systems should be something to look at carefully.

  4. What happened to forked files? by Amiga+Lover · · Score: 4, Insightful

    Are there any filesystems left that use forked files? Resource, Data and Metadata forks? Any at all?

    While MacOS was at a disadvantage being one of the only ones to use it, wouldn't it have been an excellent advantage for ALL filesystems to be forked?

    (I don't know the answer to this - anyone who knows more about filesystems, give your thoughts)

    1. Re:What happened to forked files? by Jugalator · · Score: 4, Informative

      Forks? Would that be the NTFS streams?

      I think the new filesystem WinFS in Longhorn is basically just an evolution of NTFS streams to make them more accessible for the users. They've always been there, just not very accessible besides a limited set of text fields in the file properties dialog box in Windows. (i.e. they've always been able to hold custom data and have custom key names)

      --
      Beware: In C++, your friends can see your privates!
    2. Re:What happened to forked files? by k98sven · · Score: 2, Informative

      While MacOS was at a disadvantage being one of the only ones to use it, wouldn't it have been an excellent advantage for ALL filesystems to be forked?

      Well, one problem immediately springs to mind: The translation between different metadata formats. It's already a pain in the butt when using transferring files of not-so-popular types to the Mac.

      The second gripe I have with the Mac is that it's so friggin' hard to edit the metadata. AFAIK you can't even do it on OS 9 without software. Now assuming the user is too stupid to change this manually is good. But not providing the ability at all, even for people who know what they're doing is just stupid.

      (Windows first hides the extensions, then if you try to change them, it warns you first. That feels about right for me. - Not that extensions isn't a klugde.)

      Apart from that, I agree.. anything is better than file extensions.

    3. Re:What happened to forked files? by Anonymous Coward · · Score: 3, Insightful

      > wouldn't it have been an excellent advantage
      > for ALL filesystems to be forked?

      Yes, but the trouble of compatibility remains. But there is a simple solution for this: fork as dir bundles: Instead of a file with a metadata fork you simply put the metadata file and the datafile into a dir and give that folder the name of the datafile. The current users copy the dir around and use its contents. But modern OSes treat the dir as if it is the datafile when the user interacts with it.

      The metadata file says 'treat this dir as a file, when the user opens it please open the datafile called ... instead'

      This is what Mac OS X does.

      This has some cool advantages for the future of metadata because the metadata file can refere to multiple files inside the dir. Not just point out the datafile but also point out the Mac OS X icon (which is simply a tiff file) and even a custom kde icon. Yes you could have complete container documents like a webpage where individual objects can be individually for the knowledgeable user simply by opening the dir or access them as a whole.

      It gets even better when you look at Applications in Mac OS X. Seemingly a file you can doubleclick to execute but actually a dir you can access with file organized in subdirs. Language dirs with UI files and text files you can translate, executables for different platforms, the required libs. It could even contain the source code yet it looks like, and by default works like, a single file which you can copy to the harddisk to install and drag to the trash to uninstall. That's how simple computing should be.

  5. Integration by mrchaotica · · Score: 4, Interesting

    Why does the document complain about the lack of integration, then mention that Microsoft, Apple, the ReiserFS people, etc. are coming up with solutions, and then adds a completely new one? Shouldn't they just be supporting one Apple's or ReiserFS's efforts?

    --

    "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

  6. What is wrong with you people? by Anonymous Coward · · Score: 2, Insightful

    Sure. I have no objection to a more extensive use of metadata. In fact I crave it - must have it.

    But why oh why do people think that XML-based solutions is the way to go? An RDF solution would be bloat beyond belief. Ok, so it's not that bad for a few files, but when we get down to it - we don't have just a few files. We have plenty of them.

    So why not use something smaler? A simpler protocol?
    We can still have RDF-frontends for those that crave their daily XML-fix. Get real.

  7. This is largely irrelevant if you have experience by Real+Troll+Talk · · Score: 4, Insightful

    Since most of us are advanced computer users or even computer experts, I think we largely know how to search for content.

    For one thing, I always give my filenames relevant titles, not things like document06.doc.

    Also, I already know how to search through files for content using basic grep or advanced Windows searching.

    I mean, sure, meta data like ID3 tags for MP3s that I steal offline are important because my Nomad mp3 player indexes based on that info, but in general I'd say meta data is not quite as important as some may suspect.

    --

    If you liked my post,
  8. Calling Autopr0n! by mrchaotica · · Score: 2, Funny

    If ever there was an appropriate thread for him to post in, this is it! : D

    --

    "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

  9. FS support for metadata by doshell · · Score: 5, Interesting

    I've heard the NTFS file system is designed to allow the system to add any number of properties (besides the obvious filename, last access time and permissions) to any stored file. This is likely to be exploited by Longhorn, which is planned to be capable of appending metadata to newly created files (for example, if you download a file from the Internet, the system would likely append a Originated-From-URL property to it).

    What I wonder is, is there any filesystem in the FOSS world that supports something like this, or are there plans to make it supported before 20??, when Longhorn hits the stores? I see this as a critical feature that must be made available by non-Windows OSes.

    --
    Score: i, Imaginary
    1. Re:FS support for metadata by PhrostyMcByte · · Score: 2, Insightful

      I don't think you can attach metadata to files with NTFS. If you can, I havn't seen the API for it anywhere while coding.

      Longhorn is using WinFS, which afaik is just a metadata layer slapped on top of NTFS.

    2. Re:FS support for metadata by pizzarobot · · Score: 5, Informative

      Actually, you can. To add a metadata item called "hidden.txt" to a file called picture.jpeg, just type on the command line:

      notepad picture.jpeg:hidden.txt

      Notepad should say that it "created the file." You should notice that no new files have been created: just look for them with explorer. But you can later open this "file" and read and edit it.

      You can do this with any file with any metadata name.

    3. Re:FS support for metadata by zsau · · Score: 2, Informative

      I think XFS does; at least, some versions of ROX-Filer are capable of writing additional metadata about the filetype on XFS drives. My understanding was that ReiserFS v 3.x can, but I've never seen anything that uses it. Of course, Reiser4 will be able to, but I think it and Longhorn have joined Duke Nukem Forever in a race to the bottom...

      --
      Look out!
  10. let's keep the Meta data simple... by howman · · Score: 3, Insightful

    Who
    What
    Where
    When
    Why
    and possibly How...

    --
    flinging poop since 1969
  11. Spotlight by Kesh · · Score: 2, Interesting

    I'm mostly wondering if the new Spotlight feature of MacOS X 10.4 is going to be based on this, or a proprietary technology. I've been itching for cross-platform metadata file support for years now...

    1. Re:Spotlight by aristotle-dude · · Score: 2, Informative
      I don't see how considering that Spotlight is a search technology that leverages metadata already existing in files on OSX today and this article talks about tagging files with metadata.

      The search technology in Spotlight probably is inspired by live query from BeOS but first appeared at Apple in iTunes and later Preview for Panther.

      Many former Be Inc. employees work at Apple now and some had worked at Apple before joining Be.

      --
      Jesus was a compassionate social conservative who called individuals to sin no more.
  12. Can't wait.. by bigattichouse · · Score: 2, Funny

    for when I can just throw out the whole desktop in favor of a "cloud" of data... using google-like interfaces to find my stuff. I think it would be interesting to figure out how to tell a compiler where to find stuff...

    --
    meh
  13. discussions about winfs and rdf by scupper · · Score: 3, Interesting

    Danny Ayers has some interesting discussion on his blog about winfs and rdf. There's also discussion of Jon Udell's Questions about Longhorn.

  14. Haystack and Metadata efforts by Knight2K · · Score: 4, Interesting

    A group at MIT is using RDF for an integrated data management system. It's sorta like Outlook (or Kontact, if you prefer ;-) on steroids. It's called Haystack.

    I have to say, their ideas are intriguing, but after using it... I think the big shortcoming is that it's tough to come up with a generalized user interface for manipulating any data thrown at it. Haystack tries at this, and I think, fails at providing any kind of cues or context that tells you what your are dealing with. In Haystack, every task and piece of information you deal with looks very much like every other piece of data, because, as a design choice, Haystack every piece of data has the same rank as every other piece of data.

    Having different applications for different types of data usually make sense, if only to limit the amount of options presented to the user so they can make an intelligent decision about what action they want to perform. See this article on Slashdot about how users need limited since it makes decision-making too difficult psychologically.

    Inevitably, discussions around RDF and metadata always devolve into hand-wavy discussions on how the computer will be able to "magically" do smart things based on the metadata. But it really isn't magic and it isn't automatic at all. Equivalencies and mappings have to be created by humans along with the rules about what to do.

    RDF uses many concepts from AI research. Anybody who has read about this branch of computer science knows that the discipline has pretty much given up on creating AI in the 'sci-fi' sense as an impractical dream. That's what makes the Loebner prize so controversial. I don't expect that computers will be intelligent enough able to relieve users of too much of the burden in assigning metadata.

    RDF is a promising approach, but if you read the article, it makes a lot of assumptions about what needs to happen to make the benefits real. Among them are establishing standards for what metadata fields apply to different types of objects: photos, people, music, etc. That kind of standardization won't happen overnight, if at all.

    The computer also needs to know what to do when it encounters that kind of data. The article mentions MIME and browsers and, in effect, says the browser can make a rational decision even if it hasn't seen a particular MIME type before. That isn't really true.. you have to install a plugin that tells the browser what to do, or have a registry that someone has put together where the browser can install the right plugin at the right time.

    That said, KDE's unification of contact information and passwords does show some of the promise of metadata efforts. And Apple's Spotlight looks like a good solution as far as it goes. I guess I'm just trying to make the point that the magic of metadata needs to be taken with a fairly large hunk of salt.

    --
    ======
    In X-Windows the client serves YOU!
  15. Many community websites don't permit RDF by MichaelCrawford · · Score: 2, Interesting
    I have a couple of articles that have Creative Commons licenses, and I tried at first to include RDF in them.

    But when I tried to publish one article at Kuro5hin, the RDF code, which took the form of HTML comments, was displayed literally in the visible body of my article. That is, all the tags had been turned into entities so the tags appeared literally in the rendered text.

    I think Kuro5hin's Scoop content management system doesn't permit HTML comments. Maybe it's not trying to suppress comments, but it didn't occur to scoop's developers to allow them.

    RDF on the web would likely be much more popular if one could count on publication sites allowing it in the submitted markup.

    Another problem I had is that Creative Commons' recommended way to apply a license to a web page is not permitted by any of the community sites I frequent. CC-licensed web pages usually have a small banner that links to the license text. But for obvious reasons, sites like Slashdot and Kuro5hin don't permit images in article or comment submissions.

    The result is that, even for the copies of my articles on my own website, I use neither RDF nor the CC banner, because I want to make it easy for others to copy my CC-licensed articles to site that don't permit RDF or graphics.

    The way I apply the license is the much-less-cool method recommended for plain text files. I have the following text appear in the body of my articles:

    This work is licensed under the Creative Commons Attribution-NoDerivs License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nd/1.0/ or send a letter to Creative Commons, 559 Nathan Abbott Way, Stanford, California 94305, USA.

    --
    Request your free CD of my piano music.
    1. Re:Many community websites don't permit RDF by mlinksva · · Score: 2, Informative

      You have another option -- put the RDF in a separate file and reference it with a link tag. See http://creativecommons.org/technology/metadata/ext end#link

  16. I have 300,000 files on my Windows box by MichaelCrawford · · Score: 2, Informative
    I know this because ad-aware tells me so when I have it scan all my disks.

    The vast majority are very small files. How much more space would be required to give each one some RDF? And remember disk space is allocate in terms of sectors, or sometimes in blocks of several sectors, so small files waste proportionately more space.

    And that's just on the Windows installation for my PC. I also have Slackware Linux and BeOS on other partitions. Quite likely there are very nearly a million files on my PC alone.

    --
    Request your free CD of my piano music.
  17. Isn't it the same problem? by pyrrhonist · · Score: 3, Insightful
    After reading this article, I'm wondering if metadata is really going to be as effective as the author thinks it is. The author points out that, "the computer makes us do the work of a filing clerk". In other words, when you place a files on your computer, you normally place them into a folders to organize them, which is, "not fun". The author implicitly claims that metadata will solve this situation.

    But that's the problem! If it's not fun to organize items into folders, how is it anymore fun to add metadata to a file? I'm not talking about text files. Text files are easy, because you can pull the metadata out of them automatically (in fact, you can do this now with search tools). I'm talking about files that have to be explicitly tagged with metadata, like pictures. How is adding metadata to each picture file to categorize your vacation pictures any less laborious than placing the vaction pictures into their own directory?

    That's the problem as I see it. You still end up being a filing clerk! If people don't even organize their folders now, are people going to use metadata when it's available? Will improved search capabilities make users want to be clerks?

    In a nutshell, isn't it the same problem?

    --
    Show me on the doll where his noodly appendage touched you.
    1. Re:Isn't it the same problem? by value_added · · Score: 2, Insightful

      When I was a kid and would ask aloud where something was, my mum would say, "Look where you put it." It annoyed me to no end, of course, but years later I find myself "putting things where they belong" and emptying my mind of everything else, much like putting phone numbers in a phone book so one doesn't have to clutter up one's my mind remembering any of them.

      My own opinion is that there is no substitute for "putting things in folders." Boring, but true. Regular expressions and databases can go a long way (even for the average Joe), but it's as brainless as it is fast to look in an appropriately named folder. Not everyone agrees, of course:

      Apple Unveils Faster Searching
      Apple Throws Spotlight on Search

  18. Watching the XML kiddies reinvent the wheel by Animats · · Score: 3, Informative
    It's fun watching the XML kiddies re-invent concepts from LISP. They just re-invented property lists, "is-a" links, and much of the baggage that made SGML painful.

    Knowledge representation via "is-a" links has been tried, and it breaks down rather quickly. Read "Artificial Intelligence meets Natural Stupidity", by Drew McDermott, for a 20 year old critique of this concept. It's overkill for searching, and not powerful enough for reliable automated question answering.

    The Cyc debacle illustrates how much work you have to put into tagging to get very little out. After twenty years of that money sink, it's still useless.

    1. Re:Watching the XML kiddies reinvent the wheel by benson+hedges · · Score: 2, Interesting

      if you had RTFA, or even read anything in the last 20 years, you would probably know that XML != RDF. there is a XML implementation of RDF, called (duh) rdfxml, but that's far from the only way to describe RDF data. I have to agree though that rdfxml is one of the worst ways to do RDF.

      have a look at N3 or ntriples for starters.

      --
      Karma : Soylent Green (Mostly due to eating junk food and mocking religion)
  19. RDF (and OWL) in Pike by janbjurstrom · · Score: 3, Interesting

    I noticed the article made no mention of Pike (also the name of a fish - see language logo). Pike's a fine C-like scripting language ...that I know extremely poorly myself, but anyway..

    From Pike's official homepage (at the University of Linkoping, Sweden):

    The release of Pike 7.6 marks the first results of a long-running project to make Pike the first scripting language for the Semantic Web. The current highlight in that respect is the support for W3C's standard formats RDF and OWL.

    Worth downloading and checking out for other reasons than "just" RDF & OWL. Free software, available under LGPL, GPL, and MPL (Mozilla Public License).

    --
    668.5
  20. NTFS streams by Otto · · Score: 2, Informative

    This "metadata" is actually called an "NTFS stream" and has been around since at least NT4.

    If you move the file around the NTFS drive, or from one NTFS drive to another, then yes, the metadata goes with it. If you move it to a FAT volume though, the metadata is lost forever. Not a huge deal as NTFS is getting more and more users nowadays.

    XP uses these metadata streams to some degree, actually. Some of the things in the properties page for a file are actually NTFS streams.

    Longhorn will make more extensive use of them, I'm certain.

    --
    - Give a man a fire and he's warm for a day, but set him on fire and he's warm for the rest of his life.
  21. RDF by cyberfunk2 · · Score: 2, Funny

    Did anyone else read RDF and think.. Reality Distortion Field ( Steve Jobs)

  22. Another format.. by 12357bd · · Score: 2, Insightful

    Good, yet another format to use/suffer!

    No matter how good those formats are (XML/RDF/etc) they all fail at the simplicity norm, the KISS principle.

    In the example of the article, by not using a simple text oriented format they innecesarily complicates the access by any program to these values, and that leads to the second point.
    The computational cost involved in parsing / validating all those formats; the day that our cpu's can process hundreds or thousands of simultaneous parsings without a noticeable impact on performance, that day it could start to make sense to popularize his usage, until then, they are a luxury and as such restricted to a limited (especialized) usage.

    On the RDF case, metadata is data, the 'meta' part is a human hability and can be used wherever we want, no need for a special format. By pretending to format the 'metadata' concept we are just defining a new stream format, and if we consider how wide the 'meta' concept is, it seems dificult to limit to a simple ontology. The result? the need of another international consortium to stablish a reasonable set of vocabularies, big deal!

    I think there are better ways to spend our cpu cicles than to parse verbose formats, but how knows?

    --
    What's in a sig?