newdocms: Beyond the Hierarchical File System

What's wrong with hierachical systems anyway? by jlanng · 2003-01-03 01:54 · Score: 1, Insightful

They work fine for me

Re:What's wrong with hierachical systems anyway? by Anonymous Coward · 2003-01-03 02:09 · Score: 1, Insightful

Yeah, non-descriptive directory names are poo.

But make those directory names descriptive, and all of a sudden you're not so much of an idiot.
Re:What's wrong with hierachical systems anyway? by archeopterix · 2003-01-03 02:33 · Score: 5, Insightful

Yeah, non-descriptive directory names are poo. But make those directory names descriptive, and all of a sudden you're not so much of an idiot.
There are bigger problems than non-descriptive names:

1. Paths tend to get long.
2. You have to be careful of your "current path". Some apps have weird defaults and if you're not careful, you end up with your file in a strange location.
3. Some items do not fit into the hierarchical structure. Should my porn directory be organized into movies, stills and texts or perhaps perverted, spicy and nice? Whichever atrribute I choose I will have trouble searching on the other.

Of course I can always use locate or find, but these tools only look at preset attributes (filename, last access date, substrings) and the solution from the article lets you specify your own attributes.
Re:What's wrong with hierachical systems anyway? by carlos_benj · 2003-01-03 02:45 · Score: 3, Insightful

If you try to forget everything that you know about computers, and then abstractly think about what a filesystem should be you come to one of the following two conclusions:

Let's see. If I want to retrieve a document that's been filed I go to the bank of file cabinets, select the cabinet that has the drawer I want, open the drawer, scan folders, pull envelope from correct one, extract document.

Cabinet/drawer/folder/envelope/document

Maybe it's because there really is an analog in meatspace for the heirarchical file system.

--
--

As a matter of fact, I am a lawyer. But I play an actor on TV.
Re:What's wrong with hierachical systems anyway? by Directrix1 · 2003-01-03 03:11 · Score: 2, Insightful

What's wrong with HFS? 1) Not optimized for catagorical querying of stored objects which can render views which are far more useful than a standard HFS. This is its primary use. 2) Distance between any two leaves is arbitrarily large and can be quite large (requires you to make sacrifices as to the categorization of your files vs. relative distance between common files or create a myriad of symlinks which is basically what a catagorization system does inherently) 3) Whats hard about setting categories: attribute: movie attribute: porn attribute: Jenna Jameson whooo, hard. And these file attributes eventually can very easily be represented across the web in saved downloads. 4) Is this not open source? patents? give me a break.

--
Occam's razor is the blind faith in the natural selection of least resistance and in universal oversimplification. -- EF
Re:What's wrong with hierachical systems anyway? by b_pretender · 2003-01-03 03:37 · Score: 3, Insightful

Somebody else commented: "When you are working on some (paper and pencil) project, and just stand up and walk away, do you exepect it to be available at the office tomorrow?".
Well, yes. On a computer, I would expect it to available tomorrow *exactly* the way i left it. The only reason that I don't expect this in the real world is because it's not a feasible possibility. If it were, then I would expect it to be as I left it in the real world, too.
You commented: "My biggest concern with this new system is that if you fail to generate good keywords (I suspect this will be a big problem) it is going to be hard to browse through a likely directory to find the file."
Although I will admit that current searching technologies are not very good at determining what I actually want (e.g. misspellings, synonyms), I will say that I don't think that choosing keywords would be a problem. I believe that choosing which directory to place the file in is a more complicated problem, because you can only pick one place (without worrying about shortcuts or links). Many of the searchable keywords would be generated from the document itself: last-edited-today, various project keywords, application-based (e.g. excel-spreadsheet, letter), keywords based upon the content. Ultimately, I believe this system would be *more* tolerant of poor organization, rather than less as you state. I believe that people would adapt to it and learn to use good keywords easier than they did for hierachical filesystems. I will admit, however, that it is a flaw *whenever* people have to adapt to something, and most have alread adapted to the idea of a hierachical filesystem.
You also mention that the PalmOS filesystem implements such a filesystem poorly, but please don't crush the idea based upon one implementation. I see NO reason that application developers would have to worry about implementing a keyword filesystem any more than they would hierachical filesystem. It sounds like Palm's version isn't mature enough to be useful.
The industry is working to remove the hierachical filesystem. It's only a matter of time. Look at WindowsXP Tablet edition's note-taking program. You basically have one *file* for all of your notes... ever. You can subdivide and categorize these notes, but it's all one file.
Re:What's wrong with hierachical systems anyway? by Oculus+Habent · 2003-01-03 03:40 · Score: 3, Insightful

While HFS has a number of drawbacks, it seems better to change or replace the FS instead of abstract its failures.

We still have to deal with file systems on some level. What happens to your abstracted layer when you want to copy something to a disk or burn a CD? You can't perform a file copy without breaking the abstraction, so the abstraction is broken before you begin to use it.

When you insert a Drivers CD in Windows, it may auto-run, sheilding you from the (often arcane) filing of the drivers. But unless there is an agreed format for the meta-data, your computer may not understand what is on the disc.

The system he proposes also breaks down on anything that is not new and made by the user. Document storage. Do we then only abstract the Documents folder?

While document management is a good idea, it needs to be subtle. It may take a user some time to learn the system, but that is better than crippling it to ensure first-time user ease. Macs used to come with several Tutorials on how to use the mouse and interact with the OS. We will probably need tutorials of that type again, soon.

Document management needs to spend very little time taking the user away from work. It must be integrated with the file system to work adequately, or the "switching" people will have to do to move from managed to unmanaged filing will aggravate and confuse them.

--
That what was all this school was for... to teach us how to solve our own problems. -- janeowit
Re:What's wrong with hierachical systems anyway? by Kymermosst · 2003-01-03 07:21 · Score: 3, Insightful

3. Some items do not fit into the hierarchical structure. Should my porn directory be organized into movies, stills and texts or perhaps perverted, spicy and nice? Whichever atrribute I choose I will have trouble searching on the other.

Well, in a good file system, you can make a set of directories like this: (since we're using porn as an example) /porn/movies (contains movies)
/porn/stills (contains stills)
/porn/text (contains text)
/porn/perverted (contains symbolic links to files in the above)
/porn/spicy (ditto)
/porn/nice (ditto)

Some platforms are much better suited to doing this (unix), while others (Win) are not.

Now, having the ability to automatically generate the symbolic links would be nice.

--
"Alcohol, Tobacco, Firearms, and Explosives" should be a convenience store, not a government agency.
Re:What's wrong with hierachical systems anyway? by Anonymous Coward · 2003-01-03 07:47 · Score: 1, Insightful

The problems with hierarchical file systems are that (1) every time you file something, you have to think about where it belongs, and (2) every time you retrieve something, you have to remember where you put it. The former is usually only a minor hassle, but it comes up extremely often; the latter can be a big pain.

To people who say "oh, if you have a good hierarchy set up then you know where everything goes and where you put it", I say get real. At least in my experience, there are lots of things that don't readily fit into a hierarchical system.

Think about how we find information on the web. Which is more convenient, following a Yahoo-like hierarchy, or just typing a few keywords into Google and picking the choice that you like?

I'm not sure the proposed solution is a good one, but there is definitely lots of room for progress!

Interesting... by Akardam · 2003-01-03 01:58 · Score: 5, Insightful

It sounds basically like when you want to find a file, you go type in a few pieces of meta-data, and then hit "search". It's a way to do it, but it seems to me (and it's early, so bear with me) that it's easier for me to remember one piece of meta-data (i.e. the path to the file) than several (as it would seem with this setup, as you would have to present more than one piece of data to differentiate between different documents, let's say, created by the same author on the same day). Maybe I'm just used to a HFS, but I find it simple to open up a command prompt and type "pico /documents/foo/bar/fubar.txt".

Anyway, an interesting concept.

Re:Interesting... by Anonymous Coward · 2003-01-03 02:55 · Score: 1, Insightful

There are several things you are missing:

1) a file path is not a single piece of metadata, but a collation of several pieces of metadata. Of course, you don't always have to remember all the elements and everyone learns quickly to keep similar hierarchies under similar heads, but nonetheless, a path is NOT one piece of metadata, (unless it's / of course)

2) hierarchical file systems force you to use either-or classifications of all your files (unless you have a large number of very similar hierarchies, which becomes difficult to remember and ends up being too sparse)

3) The brain will remember a given chunk of data by different classifications under different circumstances. The hierarchical system gives you only one classification (you can use symlinks to ameliorate this limitation to an extent)

4) Remembering paths of frequently used or archetypical files is easy, but can be very difficult for remembering files that were created, say, 6 months ago and not used since, especially when you have several disparate fields of endeavour that you switch between over time.

5) Although none of these factors make much difference when you are creating hundreds of files, they begin to make a difference when you are creating/dealing with thousands. And now, many people deal with tens of thousands of files.
Re:Interesting... by Elwood+P+Dowd · 2003-01-03 03:00 · Score: 5, Insightful

Except that those users that can't remember where their shortcuts are aren't going to set up good metadata in the first place. So knowing that it's about loans isn't going to help anyway.

When it comes to that, users just need full text indexing of their documents so they can do full text searches more quickly. Iduno about windows, but we've definitely got that in mac os.

--

There are no trails. There are no trees out here.
Re:Interesting... by Theatetus · 2003-01-03 03:36 · Score: 5, Insightful

When it comes to that, users just need full text indexing of their documents so they can do full text searches more quickly. Iduno about windows, but we've definitely got that in mac os.

Great for writers, not so good for graphic artists. I sysadmined for a few years in a graphics/video shop that had tens of thousands of images on the various fileservers. I essentially wrote a very simple version of this "DB on top of FS" idea because I was tired of helping people find their TIFFs.

Yes, /home/projects/DOJ/annual_report/masters is just one piece of metadata, and some people find that easier to remember than several keywords. OTOH, suppose two years later you want to reuse that image of the hispanic male using a computer. Was that in /home/projects/DOJ/annual_report/masters or /home/projects/USDA/website/images ?

My solution (and, it would seem, the article's, though I'm sure that one is a lot more robust), was to keep the users away from the FS completely. Just let them bring up all the images tagged with "hispanic male computer." Most graphics shops I've seen either built a DB file manager or bought one.

Honestly, I think the idea of computers holding a lot of "files" organized into "directories" is a little old. It was great in 1970 but maybe (like this guy is doing) we should rethink it a little. Why not say a computer has certain knowledge ("files") and certain capabilities ("executables")? Rather than naming files, describe the data you want the computer to retain, and retreive it later from that description.

As somebody pointed out, Office2K/XP and W2K/XP have something like this already, but people don't use it because they still have to name files. That's the crucial step, I think, and that's why I took that power out of my users' hands. They never named files; the app did it for them. Instead, they described files and versions. Abstraction and all that...

Anyways, this idea may not help everybody, but it sounds like my old users would have liked it (they, btw, were very good about using specific and accurate keywords... no QWERTY effect here; they just didn't think in terms of files and directories). Plus, it's nice to see somebody trying to move past the "files and directories" mindset we've had for the past 3 decades.

--
All's true that is mistrusted

Well now, hold your horses by Anonymous Coward · 2003-01-03 01:59 · Score: 0, Insightful

This is a testament to the power of free software: this sort of innovation could never happen if it weren't for the free software nature of the underlying systems."

How is an "ugly" beta version of an untested new file management system a testament to the power of free software? And why is this better than a hierarchical system? Hierarchies make sense to Joe user. Normalized databases (you did normalize it, right?) do not. And why on earth would I want to set all kinds of BS attributes on a file instead of just clciking File, Save As, and then hitting the little "My Documents" button in the window that pops up?

Re:I already use a different one: by NineNine · 2003-01-03 02:02 · Score: 4, Insightful

This is a testament to the power of free software: this sort of innovation could never happen if it weren't for the free software nature of the underlying systems.

This is completely untrue. There are lots of other options (like The Brain) that have been out for a while that have nothing to do with "free software". Hell, the fact that other proprietary systems (that are better, in my opinion) came out earlier shows that not only is "free software" irrelevant in this discussion, but it actually lags behind software driven by the profit model.

looks like very high quality work, but... by bartman · 2003-01-03 02:04 · Score: 4, Insightful

While I do think the work presented is a great idea, it seems to me that it's a lot of effort just to setup the system.

It would be ideal if the computer -- the thing that is supposed to make life easier -- did the clasification. Until that happens I cannot see myself even considering such a file access method.

--
-- bartman

Look at the save dialog by codepunk · 2003-01-03 02:04 · Score: 3, Insightful

My father is really going to understand that. Not a bad idea but the implementation appears to need work. Another interesting thing to note is that this is probably coded in C++ and is going to be a bitch once again to interface with scripting libraries. I love KDE but it is a difficult task to integrate other languages with.

--

Got Code?

Re:Interesting...But Why? by Havokmon · 2003-01-03 02:05 · Score: 3, Insightful

It sounds basically like when you want to find a file, you go type in a few pieces of meta-data, and then hit "search".......Maybe I'm just used to a HFS, but I find it simple to open up a command prompt and type "pico /documents/foo/bar/fubar.txt".

Exactly. Users STILL have to create their own type of organization.
/documents contains documents. Duh.
/documents/work contains documents for work.

The problem is people don't want to be organized, so they look to technology to help them be lazy. Plus try explaining 'metadata' to someone. At least now you can use the file cabinet, drawers, folders, papers example to explain the layout to someone.

--
"I can't give you a brain, so I'll give you a diploma" - The Great Oz (blatently stolen sig)

Historical Q by MacAndrew · 2003-01-03 02:09 · Score: 5, Insightful

Who came up with the idea of "folders" anyway? Not hierarchical trees, but the metaphor.

The biggest problem with folders is no one wants to be a file clerk and weed, sort, and file their docs. The act of socking away a doc should as mindless as possible, not because (all) users are mindless but because they have better things to do, and shouldn't spend a minute adding keywords to every doc they might never see again.

You know how it is -- you're searching and coming up with junk, and want to yell at the computer, do what I meant, not what I said! This would be one of my first pics for AI on a personal computer.

I agree folders doesn't cut it, though as a metaphor for explaining the tree it's not bad. The problem is the tree.

Idea by GNOME by Anonymous Coward · 2003-01-03 02:10 · Score: 0, Insightful

This idea was made by GNOME and now being inherited and implemented by KDE. Read here. And again please don't make Linux start to suck with that idea.

Not quite the same thing by TheConfusedOne · 2003-01-03 02:10 · Score: 2, Insightful

The Brain is an interface on top of your current FS. Things like this have been done going back to the days of the Leading Edge Word Processor (separate file to get around the 8.3 naming conventions).

I believe the point that this mad scientist was making was that he's completely replaced the FS with this new database-based one.

It's certainly not innovative, but it's something different I guess.

--
--- I wish I could hear the soundtrack to my life. That way I'd know when to duck.

This system would demand a lot of discipline... by MyNameIsFred · 2003-01-03 02:12 · Score: 4, Insightful

...you define any number of document attributes when saving a document and then query a database of those attributes when trying to retrieve it later on...

The problem I see with this system is that it requires you to be disciplined when you save a document. I could see something like this working for things like MP3s where there is an internet database that could be used to select the appropriate attributes. However, in the work environment where you're cataloging Word files and Excel spreadsheets, I don't see it as useful. From my experience, when I'm searching for an old file, its never for the reason I would have guessed, so I wouldn't have picked the right attributes when I saved it. In fact, I find it best to use features such as the MacOS X find dialog (or grep on the command line) that allows me to search by content.

Re:This system would demand a lot of discipline... by Just+Some+Guy · 2003-01-03 02:42 · Score: 5, Insightful

Furthermore, it's hard enough to get people to give their documents reasonable names. Convincing them to tag their files with accurate meta-data seems like an exercise in futility. I can hear the conversations:
IT staffer: "That's the 3rd quarter financial report? You should click 'Financial', 'Quarterly', 'Company-wide', and 'Public'."
Secretary: "I already named it T42f.doc. Get it? 'T' for third. '4' for quarter. '2' for 2002. 'f' for financial - 'F' is for filing'."
IT staffer: "But noone but you can find it!"
Secretary, with a wink: "Hmmm... I never thought about that."
I'm really not joking. If you can't get people to use filenames like "Prelimary quote to Foo, Inc. for widget sales 2002-12-23.doc", why are they going to bother picking those attributes from a menu?
How about this: Give the users a palette of choices (with the ability to add more as required), and generate the filename based on their choices. Don't even give them the option of whipping up their own personal hash table - make them let the program come up with reasonable names for everything. You could even set a threshold, such as "At least one attribute from each category must be checked", or "every file must have at least 4 attributes".

--
Dewey, what part of this looks like authorities should be involved?

Mom and Dad file system by dmorin · 2003-01-03 02:14 · Score: 2, Insightful

Who needs this? As one poster put it, isnt the path the only real piece of meta data you need to find a file? Think about mom and dad. What do they want to know? "Where are the christmas pictures of the baby from last year?" "What happened to that email I sent my brother last week?" "Where's the latest copy of my resume?" and so on. Natural language aside, these are all metadata-type queries (mostly dealing with time and filetype data, both of which can be extracted without any additional effort by the user). I think that if such a system of searching files is ever perfected, we'd have a serious killer app on our hands. Isn't this part of what the "semantic web" is all about? Isn't it frustrating to everybody that even the best search engine in the world still can't understand "find me all books whose author is mark twain"? It seems like a logical progression to expect that. Just like most of us aren't searching the web for *pages*, but rather particular *informatin* on those pages, I think that Joe User doesn't care about looking for *files*, but rather the information contained in those files. Thus it's only reasonable that if you give a user a way of easily describing those files by something more than just a filepath, that it will then be easier to find the information later.

--
www.HearMySoulSpeak.com

BeOS already did this... by Interfacer · 2003-01-03 02:16 · Score: 2, Insightful

or something very much like it a few years ago.
i used it and it works like a charm.

of course hierarchical file systems are easy to use, you can name folders after categories, and they are easy to backup.

Interfacer.

all your HFS are belong to us.

Thinking of new metaphors by ideonode · 2003-01-03 02:21 · Score: 2, Insightful

The whole desktop/file/filesystems may indeed be ripe for a new metaphor to help conceptualise them. When computers were the principal domain of workers, the idea of a desktop with files and folders allowed them to grasp alien concepts.

But computers are becoming ubiquitous, pervasive. Perhaps a new metaphor could be found. An example could be objects in rooms. Think of different folders as different rooms - all files (or rather, all streams) are objects in those rooms. Navigation between rooms is possible through doors.

Of course, as others have pointed out, the HFS ain't broken, so why fix it? (Answer: why not? PC cases aren't broken, but we still have case-modders, don't we?)

Re:SQL does not cut it by Zeinfeld · 2003-01-03 02:23 · Score: 4, Insightful

What we really need is a really relational, full DBMS (with sane defaults) as the fundamental storage component of an OS.

That was done pre-UNIX with PICK. The whole O/S was a database.

Microsoft has been working on an Object File System for years and it is rumored that it might finaly ship in Yukon.

A database baked file system is a great idea for an O/S. But the relational model is long overdue for the garbage pail. Modern programming languages since C have used pointers or object references. If JOIN and messing arround with tables is so good why don't we all use COBOL?

One of the things that appeared in VMS a while back that was pretty cool (and pretty easy to do on a log based file system) was transactions at the file level. You could take any set of file I/O operations and wrap a transaction arround them. This meant that you could have atomic updates to any file base resource without having to suffer the pain of SQL.

It would be pretty easy to implement this on a Linux log based file system (or windows for that matter). All you do is extend the log structure so you can group operations together and implement some sort of commit flag.

You could then build an object oriented filestore database using XML flat files. OK so maybe the system is not going to be up to storing millions of records without more infrsastructure. However most programming tasks use configuration files that are unlikely to be more than a few tens of Kb and are routinely managed as in memory structures anyway.

--
Looking for an Information Security student project suggestion?
Try http://dotcrimeManifesto.com/

Data volume by jocks · 2003-01-03 02:24 · Score: 2, Insightful

It seems to me that the majority of people who reply with "I use HFS just fine, file-> save as -> my documents works just fine" are also the type of people who don't actually create more than a few documents anyway.

I write a lot of documents and my filing system becomes ever more difficult to manage, without the skills that a librarian or filing secretary has I find that my documents become harder to locate over time. To me this is a potential solution to that problem, I do however appreciate that "Joe Bloggs" will not understand what it is about, but as far as I am concerned "Joe Bloggs" should not be using computers in the first place. Pandering to his ilk has set computing back 10 years.

The potential pitfall of this system could be where many documents have been written about the same subject i.e. testresult001.txt to testresult999.txt. The user would know with the traditional system that he wants testresult823.txt but with the new system would be presented with 1000 choices. I am possibly being myopic here!

Perhaps it is time for a new paradigm and I for one will be looking at this method with great interest.

Plz don't forget E-Mail and Web documents by egghat · 2003-01-03 02:31 · Score: 4, Insightful

I have used "The Brain" while I was in Windows, but it was nearly useless as it didn't support the two most important things:

a) Web browsing

it should now the sites you've visited, know your bookmarks and allow you to open everything you have found with a simple click.

b) E-Mail.

When it finds an E-Mail a simple double-click should be enough to open it in your mail, show you the thread it belongs to, etc.

I guess, that I'm not the only one, who has more important things in mails than in .docs or .xls.

Bye egghat.

--
-- "As a human being I claim the right to be widely inconsistent", John Peel

Re:Didn't BeOS have this years ago by Anonymous Coward · 2003-01-03 02:35 · Score: 3, Insightful

yes BeOS had it, and I think ReiserFS is planning on similar functionality.

But this is the first time I've seen it implemented in userland.

Re: submitter's cockiness about innovation, I think it's simple a pumped up way of saying "if I hadn't have had the source, I couldn't have done this hack". No shit.

Maybe it's just me, but I think it would have been truly more clever if it had been implemented using a stacked filesystem, or even a hacked open(2).

This should be implemented at the FS level by Anonymous Coward · 2003-01-03 02:38 · Score: 4, Insightful

So where do your documents go when you save them with newdocms? As you might have noticed (if you looked at the window titles after saving something), they are stored as ~/Docs/{numeric id}.{ext}.7 All the metadata is stored in a file called ~/newdocms.db. (It is not wise to delete it!) In that file each document's attributes are associated with its unique numeric id (the one which is used as a file name).

Right.

This is astoundingly bad software engineering.

Manuel, when your software fails, and it will, and somehow that db file gets trashed you've rendered that users' files as a huge heap of unsorted data. Effectively it would be 100 times worse than never implementing your system than 10 times better. No matter how bulletproof you think your code is, it probably isn't 100% perfect so having all your eggs in one basket is unwise to say the least.

Even if your code is 100% perfect this is a mistake. What happens when a sector goes bad and this file is trashed? What happens when the first really dangerous linux worm makes it a point to delete *.db from the filesystem?

Give the files names that are coded with human readable attribs! Double up that db file! Jesus, man... build SOME kind of redundancy in your system before you throw away the old way of storing the data.

There's a reason why there is such a scramble to implement a general attribute system at the FS level on many FS projects right now(*). The time has come for OSS to start being smart about this, but cramming all your metadata into a single file and throwing the backup out the window is just a very, very poor idea.

(*) BeOS was, yet again, way ahead of it's time with BeFS.

Re:Folders by Progoth · 2003-01-03 02:43 · Score: 2, Insightful

Consider this: you save your spreadsheet today as "Yearly Report 2002", and two days later you want to call it back your mind just doesn't say "Yearly Report 2002", but more like "Financial Data last year".

um, he took care of this.

try reading the article next time.

(am I feeding a troll if they're marked +2 Interesting?)

agree by ragnar · 2003-01-03 02:46 · Score: 4, Insightful

I believe metadata is a useful additional means to find files, however I would still want heirarchy as the primary storage. For most people the only metadata they ever consider is the name of a file, and this is often poorly named. I applaud the effort of the person who is doing this project though.

--
-- Solaris Central - http://w

If I can't text process it, then I don't want it by Danathar · 2003-01-03 02:47 · Score: 2, Insightful

What is the deal with people wanting EVERYTHING in a SQL/LDAP style databse! Every intern I have to manage out of college seems to have been brainwashed to think that whatever the app, it's data should be in a relational database.

I like datbases, but for somethings they should not be used!

When it comes to the OS, I want to be able to text process data EASILY...with BASH! This road leads to things like binary configuration files and that leads to things like the Microsoft registry which I detest.

Databasizing everything (including the filesystem) IS NOT THE ANSWER

Re:sorry by Anonymous Coward · 2003-01-03 03:16 · Score: 1, Insightful

On slashdot saying the emporer has no clothes is considered "trolling".

Old idea - new platform by khawaga · 2003-01-03 03:24 · Score: 2, Insightful

"it is a layer between the hierarchical file system (HFS) and the user, which provides a radically new way to store and retrieve documents"

The only things that are radically new is that a) it is open source b) it is aimed at individuals.

Commercial EDMS (Electronic Document Management Systems) vendors have been doing this for years - companies like Documentum and Filenet. Like newdocms, they combine the filesystem and a database of attributes for file storage and retrieveal. Documentum itself has gotten very sophisticated; it can read the text of the document and, using an XML taxononmy of your choice, auto-file the document in the appropriate place. Likewise searches can be performed on both user-supplied attributes, computer-generated taxonomic values, or the text of the document itself. Companies like pharmaceutical manufacturers, with millions of complex documents, simply couldn't function without it. And it works with multimedia files as well, actually reading the close-captioning on movie clips to perform it's auto-tagging and auto-filing operations.

That said, it is a great accomplishment, and a welcome addition to this KDE user. While I have worked with EDMS systems off and on for the past 5 years, I could never actually afford one myself, and none of the current EDMS vendors that I'm aware of support Linux.

As work begins on version 1.1, I suggest taking a look at the features available with the commercial EDMS products for ideas - especially for things they haven't thought of!

Good work!

The HFS *is* a database by Alomex · 2003-01-03 03:44 · Score: 3, Insightful

Here's something that is not stressed enough in school: the HFS is a database, with the fully qualified path name as unique ID and basic operations of update, delete, record lock, and retrieve supported across most operating systems.

Other query operations are supported such as wildcard characters and, in large OSes other than Unix, a variety of other attribute queries (a la "/usr/bin/find" but accessible from "ls").

Now the file table itself is a database, which can be readily implemented using a relational database. Microsoft NT an other OSes have had such support for quite a while now.

I'm glad to see the full relational database FS model starting to hit the mainstream. By this time researchers are looking into XML based File Systems (store metadata in XML-like syntax, support any XML query on the files).

Which brings us back to an often overlooked fact. Linux has, in general, not been at the leading edge of OS research (with the possible exception of the beowulf architecture). This is alright as for many years the goal of Linux was to reimplement Unix on the intel x386 architecture. However we must keep in mind that the really advanced OS features out there have yet to make it into Linux, things such as new environment metaphors, persistent data support, and intelligent user interactions.

Re:Remind anyone of something? by Keith+Russell · 2003-01-03 04:47 · Score: 3, Insightful

Standard Slashdot Clue-Slap #4: The Fallacy of Mass Hypocracy

If you walked into PNC Park during a game, and saw a group of 10 people wearing Braves jerseys, would you call the remaining 38,000+ Pirate fans* in the crowd hypocrites? What about a vegetarian eating a salad at a steakhouse?

What you're observing is not hypocracy on the posters' part. They're willing to join the debate, and they deserve credit for that. (You imply that much with your preemptive taunt to anyone who would mod you down.) It's just human nature getting the best of the moderation system. It's too easy to silently and anonymously squelch a valid dissenting opinion. And while meta-moderation can cull out the egos and zealots, it operates too slowly to keep up with short tempers.

*: Jokes about the Pirates selling out a home game > /dev/null :-)

--
This sig intentionally left blank.

Re:Intuitive by rreyelts · 2003-01-03 06:15 · Score: 2, Insightful

I think what you are missing is that human beings find categorization intuituve - not the hierarchy especially. A hierarchy is simply just one form of categorization. This becomes immediately obvious once you try to categorize things by multiple attributes. For example, I'd like to categorize my mp3s by author, album, genre, and year. That is a perfectly natural and intuitive desire, but it can't be achieved with a simple hierarchical file system.

Re:Folders by Progoth · 2003-01-03 06:42 · Score: 2, Insightful

yes, this system does take care of this. you're limited in your thinking, you're still looking at this from a HFS point of view.

you save your spreadsheet today as "Yearly Report 2002"

the thing is, you don't save it as "Yearly Report 2002". that's a file name, which is not what this system does. in newdocms, you give as much information about the document as you want, in any number of categories, and you don't have to remember arcane names, or the difference between "Yearly Report 2002" and "reports/yearly/2002". in your situation, after 2 days have passed, you come back and make a simple query for, say, Reports, and there your document is, right at the top of a list (since it's sorted by last access time).

I'm 21, started with DOS at age 8, so I can handle hierarchical file systems. that doesn't mean I have to like them. in my case, especially; I'm extremely messy (room, car, apartment, desk), and my hard drives are the same way. I have around 180 gigs spread between 7 drives, and I have no idea where anything is. I could find everything and categorize it in a hfs, but it sure would be easier to not worry about details like "where a file is located" or "what a file is named" and just worry about the types and contents of the files.

Re:Folders by Corporate+Troll · 2003-01-03 06:53 · Score: 2, Insightful

I do understand it is not a filename, that doesn't change the problem I describe. If he does not use the right keywords to make the query, he will not find the file.
Oh, and what if he wants to find the "Yearly Report 1976"? Guess that one won't be on top of the list. I guess this system is good for recently used files, but not for files that have been archived for years on your machine. (Yes, I do have files back from 1981, and yes, I know exactly where they are)

And that you are messy is your problem, not mine. Learn to organize, makes your life easier.

Slashdot Mirror

newdocms: Beyond the Hierarchical File System

41 of 650 comments (clear)