Database File System

gnome people... by alphan · 2004-09-06 03:18 · Score: 3, Informative

...seems to have something more interesting: storage

Re:gnome people... by Jellybob · 2004-09-06 03:20 · Score: 3, Informative

Storage has been more or less dead as far as I can see for a while now, however Beagle is showing good progress on the same front, having been demoed at conferences recently.
Re:gnome people... by Anonymous Coward · 2004-09-06 03:47 · Score: 1, Informative

Well if you think 3 weeks is a long time...
One of storage creator Seth Nickell and Marco Pesenti Gritti of epiphany fame seems to still be working on it.
Re:gnome people... by noselasd · 2004-09-06 04:30 · Score: 3, Informative

Not really still activities in the CVS
The current release (0.2) is just some proof of concept, devels
are working on a nice'n'real solution.
Re:gnome people... by kidgenius · 2004-09-06 05:04 · Score: 0

The Beagle and Dashboard projects actually have been around since about Nov/Dec of 2003. That's a little more than 3 weeks.
Re:gnome people... by kwoff · 2004-09-06 05:27 · Score: 3, Informative

Beagle is written in C#? /me clicks the Back button
Re:gnome people... by Anonymous Coward · 2004-09-06 05:42 · Score: 3, Interesting

Here is my opposite hierarchical "document oriented" desktop: it does not have to be based on database file system, but intelligent caching and indexing

1) No "Start" application menu. You never "start" an application. You always open (read:look at) a document by clicking on it. You want to write a new one? you click on "template" document, it immediately asks you for new name (there never exists unnamed one, even on memory - because there is no difference between memory and disk). Do you want to read your email? you look inside mail/inbox dirextory. Everything has a directory/file structure! The same is with web - although you can see (list) only "directories" (sites) where you have been. Otherwise you have to either type URL location dialog or use bookmark. Bookmarks kook just like links in ~/bookmarks/whatever_link_page. The same page you can list in virtual http://whatever.com/ directory (of course only if if you allready bookmarked or visited it)

2) No open/save dialog. "invisible" application flushes updates to the ducument frequently, it appears that you directly instantly to the file. On the other hand you should be able to see a "historic" (you can say undoed) under some history.

3) No application "Close" (x, exit). You just remove (minimise) the document from your view. If you do not look at the document (work with, view url, ...) for some time, intelligent caching system invisibly closes application (of course the document is flushed, probably quite some time ago) and frees resource to the system. There is no difference between memory and disc allocated documents from user point of view.

4) Taskbar is not a list of running applications but a "History" list of last opened documents (URLs, ...) with more-less constant number if items

5) Window name functions also as "location" (URL) input, saves space

6) there is no "all time" visible menu, toolbar, location bar (see (5)) inside the window - those only take space. Right mouse button opens context menu, together with Alt it opens "document" menu (full-blown what is more-less now visible on the top of windows), with Ctrl it opens "shortcut" menu (ala toolbar) under cursor (so you do not need to move too far.

Oh, and did I say no start/close application?

Roman Kantor
Re:gnome people... by yerfatma · 2004-09-06 06:01 · Score: 1

I think the parent was referring to this:
Storage has been more or less dead as far as I can see for a while now,
Re:gnome people... by Anonymous Coward · 2004-09-06 06:12 · Score: 1, Funny

Yes but it wasn't invented by KDE so it's irrelvant.
Re:gnome people... by Ignominious+Cow+Herd · 2004-09-06 07:17 · Score: 1

Yes, yes, yes.

2) Add a history/revision list (DAV?).

--
Lump lingered last in line for brains, and the ones she got were sorta rotten and insane.
Re:gnome people... by Spy+Hunter · 2004-09-06 08:48 · Score: 1

There is nothing wrong with C#. It performs well, is a nice language to program in, and has an excellent standard library. Microsoft can't assert complete control over C# to somehow cripple open source, as some feared. Doing so would simply cause open-source implementations to break from Microsoft's standard and create incompatible versions of C#, fragmenting the C# language, which Microsoft doesn't want. Your bias against C# is unfounded.

--
main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
Re:gnome people... by Anonymous Coward · 2004-09-06 09:58 · Score: 0

Sounds interesting, doesn't it?

Write it, and then we can discuss how it works.

By the way, "You never 'start' an application. You always open (read:look at) a document by clicking on it."? What the hell? What about when I sign on to AIM or IRC? What about when I start emacs to read its built-in documentation? What about when I start a performance tracker like ps3 or gkrellm? What about when I start aterm to type a command? None of those are opening a document, they are all starting an aplication. Also, about point number 3: Why can't I just exit an application when I know i'm done with it, freeing up system resources then instead of later? It takes the same amount of work (one click).
Re:gnome people... by schuster · 2004-09-06 10:45 · Score: 1

someone correct me if I'm wrong but I think OpenDoc worked a lot like this. Everything in OpenDoc was an object and they could all work with all the other objects that were available. I used OpenDoc once but I was only like 12 years old at the time so my memory is hazy. I remember it being a fascinating technology though, even if the software was really unstable. To this day, nothing has blown me away quite like OpenDoc did.

--
--- Don't ever trust a woman until she's dead- B.B. King
Re:gnome people... by Anonymous Coward · 2004-09-06 11:44 · Score: 1, Insightful

It's Microsoft Java and it's awful. There may be patent problems, not with C# or CIL but the 'excellent standard library' as you say. How do you know what Microsofts long term strategy regarding .NET is? Why are .NET people .ALWAYS so defensive over vague comments that they interperet as being negative, yet .NEVER respond to questions about patents on the supporting libraries? I don't have a JVM or Flash installed either, is someone going to start jumping up and down about that?
Re:gnome people... by Spy+Hunter · 2004-09-06 12:00 · Score: 0, Troll

They can't patent the libraries, only ways of implementing the libraries. Mono can either work around the patents (assuming microsoft ever tries to enforce any), or they can break with Microsoft compatibility in the few patented places. However, as I argued in my previous post, though you seemed to miss it, it's not in Microsoft's bset interest to do this because it would fragment the C# community and destroy any perception of C# being cross-platform. Since it wouldn't destroy Mono and it would have bad consequences for C#, Microsoft has no incentive to do it. Furthermore it would be very bad press for Microsoft to start suing open-source developers. There's no precedent for that kind of behavior. Mono is safe.

--
main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
Re:gnome people... by juhaz · 2004-09-06 12:09 · Score: 1

They can't patent the libraries, only ways of implementing the libraries.

In theory, that's the way patent system should work.

In practice, that is NOT how it works, and you know it. They can, and do, patent ideas, not ways of doing them. Oh, sure, that kind of patents would probably get thrown out in court, but it'll still be heck of an expensive lawsuit to go against Microsoft even if you win, which is why nobody wants to take a change of that happening.
Re:gnome people... by Spy+Hunter · 2004-09-06 12:27 · Score: 1

HAS microsoft patented the libraries? If they haven't the point is moot. I have yet to see any evidence of Microsoft patents on the C# libraries. People were all up in arms about this a few years ago when C# came out, but any patents Microsoft had pending on C# are either granted or rejected by now, so they're open to public scrutiny. So show us the patents you're so worried about already! And even if Microsoft does have direct patents on several parts of the core C# libraries, my argument still stands. It's not in Microsoft's best interest to enforce those patents.

--
main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
Re:gnome people... by Spy+Hunter · 2004-09-06 13:47 · Score: 1

That patent is disturbing, but it hasn't been granted. And I can't imagine that it will be, with claims like those. In fact, if this patent is granted as-is, practically every open-source or commercial project under the sun will violate it, not just Mono. Furthermore, Mono was started in 2001; this patent wasn't filed until 2002 so Mono itself counts as prior art. If Microsoft gets anything out of this patent, it is likely to be the parts that deal specifically with web applications. But even if Microsoft does have patents on System.Web.Services or Windows.Forms and the like, that doesn't affect GNOME development since they use GTK#. So this silly bias against everything written in C# is ridiculous.

--
main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
Re:gnome people... by rar · 2004-09-06 23:13 · Score: 1

You never 'start' an application. You always open (read:look at) a document by clicking on it.

I am not the original poster, but I have been thinking along similar lines.

What about when I sign on to AIM or IRC?

Then you open the document "my AIM session" which opens your AIM window with your preferences and configs. You could also have a 'my alter-ego AIM session' document to open AIM with a completely different configuration.

What about when I start emacs to read its built-in documentation?

You open an emacs document and pick 'help' from the context menu. If you do not yet have any emacs document you create one from the emacs-document-template.

What about when I start a performance tracker like ps3 or gkrellm?

Same thing as with AIM above.

What about when I start aterm to type a command?

You open a 'terminal' document. The terminal opens in precisely the same state as when you last 'closed' it. If you want a terminal with a fresh state, you create one from the aterem-templates.

Why can't I just exit an application when I know i'm done with it, freeing up system resources then instead of later?

In the system described, there is no difference between "hiding away" and "closing". As for system resources, they will be freed up if they are needed. If they are not needed, you have the benefit of cached access when you re-activate the document.

--
Open Materials Database
Re:gnome people... by timts · 2004-09-07 04:51 · Score: 1

is this really revolutionary over the file system or just a sub system work with the file system to provide index and additional information of files for search?

I think it's easier to implement and use, simple to backup and restore if it's not integrated to the file system core.

just my 2 cents
Re:gnome people... by alphan · 2004-09-07 05:08 · Score: 1

I guess, it is completely user-space , which makes me wonder about performance.
Actually, I would like to see some interface in the kernel level. Here is a pretty recent discussion in kernel's mailing list regarding reiserfs4 (, winfs, mac osx' new search thing) and reiser4's plugin mechanism. Interesting read.
Re:gnome people... by timts · 2004-09-07 05:53 · Score: 1

the file system itself should remain same old speed

just like current M$ indexing service is supposed to use idle time to index stuff

if the search part is not used all that frequent, it doesnot have to be kernel space and it should be fast enough for users

again, just my 2 cents.
Re:gnome people... by Tyball · 2004-09-12 10:53 · Score: 1

Sounds like you're really shoehorning a lot of things in order to make your "everything is a document" model work, especially in regards to the terminal stuff. Sessions might work for AIM and gkrellm, but terminals?

Backups, and being organized in a general way? by manavendra · 2004-09-06 03:19 · Score: 5, Interesting

...I can kind of see this would make it easy to search and locate documents. What about backups though? How would a user be able to group (manually) related files together, so that the whole bunch can be backed up later, without having to search for all seeminly related (or unrelated) keywords to trace all hitherto-unrelated documents?

Secondly, with this mass of files being spread over several disks, surely, this is in a way forcing the user to "search" for everything. Or isnt it? Will the underlying FS layer still be accessible in the general way that it is?

--
http://efil.blogspot.com/

Re:Backups, and being organized in a general way? by manavendra · 2004-09-06 03:21 · Score: 3, Funny

Oh, and forgot to add that I wouldn't be happy if my disk begins to churn away as the "Crawler" updates its indexes, say, while I'm playing doom! Can't afford to have any other spooky, disk grinding noises, while squinting my way through those dark corridors..

--
http://efil.blogspot.com/
Re:Backups, and being organized in a general way? by carnivore302 · 2004-09-06 03:26 · Score: 2, Interesting

Will the underlying FS layer still be accessible in the general way that it is?
Sure, because the dbfs is implemented in userland. All applications will work as before, including ls.

I wonder if this could be made into a plugin for reiserfs 4.

Mark.

--
Please login to access my lawn
Re:Backups, and being organized in a general way? by manavendra · 2004-09-06 03:37 · Score: 4, Funny

Well, you may find it funny, I've already messed up two pair of trousers - once when my phone rang and second when my friend put his hand on my shoulder to get my attention...

--
http://efil.blogspot.com/
Re:Backups, and being organized in a general way? by Aeiri · 2004-09-06 03:46 · Score: 5, Funny

You should have posted that as Anonymous Coward...
Re:Backups, and being organized in a general way? by Myolp · 2004-09-06 05:09 · Score: 1

I can see some areas where a RDBMS-based filesystem could be useful. However, in most cases, I really can't see the point of using a relational structure for your files rather than a directory structure.
Re:Backups, and being organized in a general way? by Ignominious+Cow+Herd · 2004-09-06 07:24 · Score: 3, Insightful

Think of a directory structure as just one instance of a relational structure - a heirarchical or location-based one. Directories (or locations, or relationship to other files), even nested ones, are just another type of metadata. Once you have that concept in mind other things become obvious. For example you may group files by type (Image, Text) or by project, or by author. With a directory structure you either have to make multiple directories and symlink everything, or you're stuck with one view of your files.

In short it gives you multiple, simultaneous groupings of your data.

--
Lump lingered last in line for brains, and the ones she got were sorta rotten and insane.
Re:Backups, and being organized in a general way? by Pharmboy · 2004-09-06 10:26 · Score: 2, Funny

No, I think its funnier that he didn't. My guess is after he hit "submit" he went, "oh crap, now everyone knows I wear Depends when gaming".

--
Tequila: It's not just for breakfast anymore!

Performance? by saden1 · 2004-09-06 03:19 · Score: 2, Interesting

Isn't this thing with DB's getting a little excessive? You're adding another layer and step to storing data which will in all likely hinder performance. I'm not sure the benefit out weight the cost.

--

-----
One is born into aristocracy, but mediocrity can only be achieved through hard work.

Re:Performance? by Anonymous Coward · 2004-09-06 03:31 · Score: 5, Insightful

Depends on what you are using your computer for of course.

You can say the same thing for a GUI, and its correct for certain applications of computers, but wrong in others.
Re:Performance? by bogie · 2004-09-06 03:33 · Score: 2, Informative

Of course your right. But knowing that pretty smart people are working on this I don't think your going to see them go ahead with an implementation that's only half the speed of current linux file systems. I'm sure they'll only go ahead with this and integrate it into KDE when the performance is up to snuff. It's simply way too early to say that the benefits out weigh cost until the code is complete.

--
If you wanna get rich, you know that payback is a bitch
Re:Performance? by psavo · 2004-09-06 03:34 · Score: 4, Insightful

Isn't this thing with DB's getting a little excessive? You're adding another layer and step to storing data which will in all likely hinder performance. I'm not sure the benefit out weight the cost.
Well, if it's only a name-translation thingy, then it shouldn't affect performance of file reading (when operating on sufficiently big files), only file opening/stat:ing.

--
fucktard is a tenderhearted description
Re:Performance? by Anonymous Coward · 2004-09-06 03:40 · Score: 0

That'll take forever, code is never complete ;)
Re:Performance? by kosmosik · 2004-09-06 03:48 · Score: 4, Informative

Nobody is sugesting to use such database FS for entire system. Only for specific data (f.e. user documents) - not entire system (binaries, libraries etc.) where such performance matters. Well in fact it will improve performance since right now applications that need such indexing (best examples are apps for organizing music (like iTunes) or digital pictures colections (like Adobe Photo Album or Google Picasa)) do it themselves which probably is not the fastest way and is not unified across the system. Now for *some* applications such view on files that lets you query for specific files/objects operating on query results rather as directories of files have much benefit. But it is only for organizing data, and in limited scope (as I've said - digital music, photography, probably some other fields). I don't really belive that this would seed up searching for office documents over LAN or smth. - when somebodys documents are in mess DB-FS won't change anything as the documents probably lack metadata, proper naming anyway.
Re:Performance? by Anonymous Coward · 2004-09-06 03:52 · Score: 0

Not only that but someone, somewhere has to manually enter all the meta-data which may consume more time than a search on a traditional file system. I have all the file search tools I need, directly from the shell and metadata is only created once per directory, then becoming a part of the path eg: /home/user/images/raster/logos/companylogo.png

Who cares if a database cross-indexes, a shell script can do that and knowing exactly where a file is will always be faster than searching for a keyword. It may work for google but I don't need or want this, I can't imagine any use for it on the company network either. Blaghhhhhhhh.
Re:Performance? by BenjyD · 2004-09-06 03:54 · Score: 4, Insightful

Why not just run in console mode? All this GUI stuff is just getting in the way of absolute performance.

If it adds 0.5 seconds to every time you save a file, but saves you 20 seconds of filesystem navigating every time you open the file, that's a worthwhile tradeoff. Add to that the fact that copmuters don't get tired or bored, while humans do, and it makes even more sense to shift as much of the burden of working onto the computer as is practical.
Re:Performance? by Anonymous Coward · 2004-09-06 03:56 · Score: 0

ls -l / > /dev/next_life
Re:Performance? by jgardn · 2004-09-06 03:59 · Score: 4, Insightful

Not necessarily. Consider the performance of finding a document you wrote two years ago. How long does it take you to walk through the directory hierarchy browsing file names? How fast is the file search tool? Wouldn't it be faster if you could say "Show me the documents I wrote two years ago" and the refine the search or browse the results?

Storing data in a relational database is natural because it is more like the way we store data in our minds than the hierchical structures of traditional file systems.

Also, we allow a complete abstraction of the underlying database in relational systems. The database can store the data however it sees fit, and can arrange the data on disk without the users noticing a change.

I look forward to experimenting with a relational filesystem. I think it would be a wonderful thing to try out and see if it actually has the advantages I outlined above. I'd also like to see the actual disadvantages.

--
The radical sect of Islam would either see you dead or "reverted" to Islam.
Re:Performance? by Anonymous Coward · 2004-09-06 04:03 · Score: 0

I do run 'in console mode', you insensitive clod!

It takes longer to navigate using a GUI than by using tab-completion, sorry.
Re:Performance? by Glock27 · 2004-09-06 04:17 · Score: 1

Isn't this thing with DB's getting a little excessive? You're adding another layer and step to storing data which will in all likely hinder performance. I'm not sure the benefit out weight the cost.
[sarcasm]Yeah, you sure wouldn't want to gratuitously use the 99.5% idle time your desktop computer is currently wasting.[/sarcasm]
Actually, my sarcastic response isn't even right..this will only affect performance when you're searching for a file, or saving one. How much time do you really spend doing that? Further, how much time will you save because you're able to find things efficiently? Nothing wrong with using the CPU to simplify life for the user...if the user so desires of course.
Looks like an interesting project...

--
Galileo: "The Earth revolves around the Sun!"
Score: -1 100% Flamebait
Re:Performance? by DanielDittmar · 2004-09-06 04:25 · Score: 1

Storing data in a relational database is natural because it is more like the way we store data in our minds than the hierchical structures of traditional file systems.

You'd be surprised how few actually think in predicate logic. Or any kind of logic.
Re:Performance? by Anonymous Coward · 2004-09-06 04:30 · Score: 0

Further, how much time will you save because you're able to find things efficiently?
The cost of manually entering the metadata to enable the search functionality far outways that. It's a pipedream like the semantic web, entering consistant metadata is a PITA.
Re:Performance? by EvilSS · 2004-09-06 04:46 · Score: 1

I think it is worth the cost (as long as it is reasonably well implimented). I'd love to see this at the enterprise level. I can't tell you how many of my clients boast that they have X terabytes of storage on but when I ask them for the install instructions for something, I just get that deer-in-headlights look. Unfortunatly, storing data in an organized fashion takes more discipline than most companies can muster. Then there are the IT departments who see the 26 letter alphabet as a personal challenge when it comes to mapping drives for users. I believe offloading the "where" to the OS so that the user can find the "what" they need to do their jobs will benifit businesses greatly.

--
I browse on +1 so AC's need not respond, I won't see it.
Re:Performance? by EvilSS · 2004-09-06 04:55 · Score: 1

Will it? Would filling out a short form when saving a file take up that much time? What about auto-indexing? One would have to assume that the system would be smart enough to be able to read documents and create indexes for those? Sure, it wouldn't work very well for images and audio files, but for 90% of what most users and businesses use everyday, it would.

--
I browse on +1 so AC's need not respond, I won't see it.
Re:Performance? by hunterx11 · 2004-09-06 05:38 · Score: 1

They're probably not the ones running Linux :)
But seriously, you don't have to be smart to understand computers, you just have to think logically. Most of the time I find when people can't understand how do to something on the computer, it's because they expect the computer to act like a human instead of a machine. I think this is part of the problem with Windows--being aimed at the lowest common denominator, it's more concerned with working when used improperly than excelling at being used properly. Hackers hate the paper clip for all the right reasons, and people like it for all the wrong reasons.
</rant>

--
English is easier said than done.
Re:Performance? by kfg · 2004-09-06 06:23 · Score: 2, Insightful

Why not just run in console mode? All this GUI stuff is just getting in the way of absolute performance.

Although this is a KDE related project the concept itself has nothing to do with whether you use a GUI or not and the performance hit comes at the level of the DB, not the GUI.

As for shifting the burden to the computer it doesn't really do much of that either as a human mind still has to formulate and input the query terms as well as judge the validity of the query result.

The DB as filesystem has a lot of merit, but really only in those situations where you have a massive number of files distributed across many systems. Take Google and the internet for example.

Now imagine having to google your local system to find every damned file.

Now, maybe I'm just different, but I've got 45,000 files spread across 1300 directories in my Home directory, and I can find any one of these by navigation in under your hypothetical 20 seconds saved, but then I'm the sort of person who sorts his laundry by placing it in seperate hampers in the first place instead of later spreading it across the basement floor and sorting it out. I know exactly where all my dirty whites are up front.

And the latter sort of person isn't likely to do a very good job of entering the metadata necessary to make a DB based filesystem work well anyway. The brain is a wonderful DB in and of itself and knows the "meaning" of things as well. I already know what 14t.jpg is all about. I'd have to tell my DB what it means.

Sure, let the computer do the work that it's better at, like recalculating spreadsheets or finding redheads with big ones on the web, but that doesn't replace the brain.

I already know which redheads have big ones on my local system and where to find them no matter what the file is named and the computer can't find all of the redheads with big ones on the web unless they all have the proper metadata attached to them. Some file out there somewhere named 1038754875747.jpg is just as anonymous to the computer as it is to you.

KFG
Re:Performance? by value_added · 2004-09-06 06:25 · Score: 1

"Storing data in a relational database is natural because it is more like the way we store data in our minds than the hierchical structures of traditional file systems."

Interesting, but it does remind of the oft-repeated "There are no straight lines in nature." Seems when it comes to even the most mundane things like mowing the lawn, planting a garden, or vacuuming the rug, we seek out or create a our own straight lines to adhere to a requisite logic imposed to make sense of it all.

Put another way, it's probably just as natural and possibly more efficient (taking into account things like routine administration) to put "letters to grandma" in the folder named "letters."
Re:Performance? by jilles · 2004-09-06 06:33 · Score: 1

Where does this notion that databases are slow come from? Databases are optimized for storing, finding and retrieving data as fast as possible without compromising the so called ACID properties. Filesystems are optimized to retrieve data that you already know the location of. These features are not mutually exclusive. However, implementing the features of one on top of the other is not necessarily an optimal solution (though it should be feasible).

That's why searching existing filesystems is slow, very limited and generally not worth the trouble beyond very simplistic queries. The search mechanisms on top of them are totally inadequate and slow. Similarly, databases are not very good at serving up large amounts of small files or at streaming big datablobs at a high bit/s ratio. That's why we still have hierarchical filesystems.

Hierarchical filesystems (even the ones that can store custom file attributes) do not exploit the enormous amount of meta data that is known (or can be extracted from) data. It would be enormously useful if I could update a version table with a database trigger whenever I hit the save button on some document.
Trivial to implement on a dbfs, only feasible using hacks or custom tooling on a normal fs.

Using triggers, external datasources, manually entered data etc. you can associate very rich metadata with your data very easily. Obviously you don't want to that for system files (as the author of the article points out).

--

Jilles
Re:Performance? by Anonymous Coward · 2004-09-06 06:37 · Score: 0

Would filling out a short form when saving a file take up that much time?
It will if you want any sort of cross indexing. Even if you stick some core taxonomies in there to help generate additional metadata, the user has to understand basic information management and enter relevant and appropriate keywords.

One would have to assume that the system would be smart enough to be able to read documents and create indexes for those?
Automatic indices across a single format can be inconsistant, look at the web with html markup used for presentation for an example.
for 90% of what most users and businesses use everyday, it would
And the remaining 10% can just be saved in predictable places on a traditional file system and bypass the overhead of the database and manual entry of metadata entirely?

A filepath is metadata and if you want cross indexing, there's this thing called a symlink or even a database table referencing a real-world file. I'm really not being a twat, there's the geeky "hey cool" factor but other than that, I just don't see how this improves things for users. Instead of forgetting a filepath, they now forget their metadata. The ultimate goal for WinFS was Microsoft ownership of users data archives, once the file interface becomes more abstracted and works exactly like google/MSN search it becomes far more acceptable to store your home directory remotely!
Re:Performance? by feidaykin · 2004-09-06 07:32 · Score: 1

Now, maybe I'm just different, but I've got 45,000 files spread across 1300 directories in my Home directory
Holy shit! I have two sort of catch all directories for my personal files, the largest containing a paltry 4,992 files in 329 folders. And I thought I was a pack-rat. But perhaps you simply produce a lot of files in your average computing sessions.
It's interesting how different people can perceive a certain number of files in different ways. An example that has stuck in my mind occurred a couple years ago when I was taking a networking course. A group I was working with needed a router configuration file, and I assumed that a PC used for entering in these files would have one stored. I suggested someone sitting there to search for all text files. A mere 90 files showed up and he was visibly agitated, and instructed me to find a config file in those results by abruptly rising from the seat and proclaiming "You do it, Einstein boy." Of course to me 90 files was not a large number to sift through and thankfully someone had clearly titled a router config file, so it didn't take me more than a few seconds.
So yeah, you have a lot of files. But to the disgruntled user in my example 90 text files is a lot and I'm sure his brain would likely explode if he ever had to manage an amount of files nearly as massive as those in your home directory. Different strokes and all that I guess.
But back on topic, the tinfoil hat in me tends to believe that the only logical reason to embed metadata into files on personal computers would be in order to allow third party programs to collect and upload that metadata for marketing purposes. Heh... It seems pretty obvious to me, which means some marketing bastard somewhere is thinking of it right now.

--
"To confine our attention to terrestrial matters would be to limit the human spirit." -Stephen Hawking
Re:Performance? by rthille · 2004-09-06 08:29 · Score: 0, Troll

adds 0.5 seconds to every time you save a fike, but saves you 20 seconds [...] every time you open

Definitely not a good trade off. I'm a former Microsoft Word user; every other keystroke is Ctrl-S

--
Awesome furniture, accessories and cabinetry in Santa Rosa, CA: http://humanity-home.com/
Re:Performance? by arose · 2004-09-06 08:48 · Score: 1

Your mind may store data like in a relational database, but mine seems to work in a rather wiki-like way. :-/

--
Analogies don't equal equalities, they are merely somewhat analogous.
Re:Performance? by chiph · 2004-09-06 09:07 · Score: 1

Storing your files in a DB gets you at least three big advantages:

1) Vastly reduced fragmentation. You can store a couple of small files in a single DB page. Also, when a DB grows, it usually does it as a percentage, or as a large allocation (like a megabyte or more).

2) Finding your files. Like another poster said, finding a file from two years ago is just a matter of building the correct WHERE clause.

3) Arbitrary indexes. If I'm really into my music, I can build an index over my MP3 genre, artist, album, etc. If I'm a salesman, I can build an index (ok, ok, they'll use a wizard) over my contacts, so when I travel to another city, I can easily do a locality search, and make sales calls on people who have expressed interest in my product.

As a further extension of the arbitrary index idea -- imagine doing joins -- I can find the cover art for my MP3 by joining on the artist & album name.

Does all this slow things down? Sure. Everything is a tradeoff -- if you want more flexible searching, reduced fragmentation, etc. you have to pay the cost up front when you insert new rows.

Chip H.
Re:Performance? by EddWo · 2004-09-06 10:05 · Score: 2, Interesting

Why don't people get it? It isn't about entering arbitrary keywords as file properties, it's about forming relationships between items. You shedule a meeting, you create a new meeting item and link in the attendee items, location items etc. You take some notes at the meeting, you link the notes to the meeting item. You link the meeting item to the project you are involved in.
Now you can locate the notes by filetype, time/date, project, meeting, people, location, without entering any text by hand, just linking one or two critical bits of information per new item.

--
"Taligent is still pure vapor. Maybe they'll be the last who jumps up on Openstep... "
Re:Performance? by Anonymous Coward · 2004-09-06 11:00 · Score: 0

1) Fragmentation? You must be a windows user, I backup and recreate /home and /var partitions on all my machines every other month.

2) I use centralized CVS/SVN for things I need to keep archived, plenty of metadata there. At worst it's only a matter of grepping a directory listing. It's much more difficult for me to locate the right machine which is why I started using CVS.

3) I can already auto-build a database by pulling the tags from my ogg/mp3 files. This isn't a difficult application to write, IIRC it's an app in a couple of C for beginners style books. I'll wager freshmeat is full of such stuff.

3b) This is what databases were invented for, the salesman would already have a database or contacts manager that let him do this and wouldn't need a db wrapper around the filesystem.

Your album cover example is simply odd, why would you need to do a join for that (Hello Mr Nail, meet Mr SQL database)? Doesn't ID3v2 allow you to add covers anyway?
Re:Performance? by Anonymous Coward · 2004-09-08 06:06 · Score: 0

Why don't people get it? It isn't about entering arbitrary keywords as file properties, it's about forming relationships between items. You shedule a meeting, you create a new meeting item and link in the attendee items, location items etc. You take some notes at the meeting, you link the notes to the meeting item. You link the meeting item to the project you are involved in.
Now you can locate the notes by filetype, time/date, project, meeting, people, location, without entering any text by hand, just linking one or two critical bits of information per new item.

Yes, you're right! If only UNIX had some way of linking (if only symbolically) data from one logical listing (say, like a telephone directory) into a different logical listing.

Oh, well! I guess we'll just have to hope that one day a new kernel system call gets created to link things in that way. Of course, in typical UNIX fashion, it would probably get abbreviated, to something dumb, like ln.

Still, we can only but hope!
--
AC
Re:Performance? by EddWo · 2004-09-08 07:50 · Score: 1

Yeah very clever, but it's not quite the same thing. NTFS has that too, though it's more difficult to get at. So how to you get person<->organisation<->project<->event<->document <->communication(email) so you can find all the related things given anyone bit of information.

--
"Taligent is still pure vapor. Maybe they'll be the last who jumps up on Openstep... "

"Implementing in GNOME" by kosmosik · 2004-09-06 03:19 · Score: 5, Insightful

Such thing should be implemented at kernel level to be transparent for *any* aplication. Without this it will just lead to a mess (like 4 different implementations) and some apps working with it and most not. As f.e. you can browse SMB network with Nautilus but when you actually try to open a file (from SMB via Nautilus) in OpenOffice.org you will get a info that viewer does not support this method... It must be a standard system routine not another level between system and GUI.

Re:"Implementing in GNOME" by Anonymous Coward · 2004-09-06 03:33 · Score: 1, Informative

the problem is implementing such things on all unix systems. gnome/kde are not just linux desktops, so it is far easier to implement as part of their platform stack than persuade lots of different kernel development teams to implement it.
Re:"Implementing in GNOME" by slimyrubber · 2004-09-06 03:37 · Score: 2, Insightful

Actually database storage should be implemented in the filesystem rather then kernel or window/desktop managers. That would make much more sense and theoratically, it will be faster.

Just my uneducated opinion.

--
[ I can not bring myself to believe that if knowledge presents danger, the solution is ignorance ] -- Isaac Asimov
Re:"Implementing in GNOME" by eludias · 2004-09-06 03:38 · Score: 1

Just because distributions/developers cannot standardize upon one library for all their filesystems needs does not mean it has to be implemented in the kernel. Having kernel support for synchronising an index with the filesystem it indexes makes sense, but not to have a complete implementation in the kernel.

The same reasoning would put Gnome or KDE into the kernel so this won't lead 'to a mess' and different applications working differently.
Re:"Implementing in GNOME" by Henk+Poley · 2004-09-06 03:39 · Score: 2, Informative

No, not in kernel. It should be done as a deamon, like he did.
Re:"Implementing in GNOME" by Anonymous Coward · 2004-09-06 03:48 · Score: 0

please, mod parent up. In the article is a description of the system which explains that the userinterface and the functional layer are seperated (and not dependent on the windowing/widget system).
Re:"Implementing in GNOME" by Anonymous Coward · 2004-09-06 04:06 · Score: 0

No, the real problem is that there's a kernel project and there's DE projects (KDE/Gnome), but nobody's in charge of the stuff in-between.

Therefore if someone has a good idea, the only practical way to get it into the operating environment is either through the kernel or through KDE/Gnome. (which is why the kernel is loaded with filesystems that could be in user-space, and the DEs have way too much low-level functionality)
Re:"Implementing in GNOME" by angel'o'sphere · 2004-09-06 04:38 · Score: 1

That does not need to be in kernel level.

Enough is a standard API so that Gnome and KDE can make a "File Open Dialog" for opening files.

Applications should have an "Open Recent" option anyway, or an option to open a browser with all files created by them.

Bottom line the term Database is IMHO missleading, or not needed. E.G. most iApps on a Mac have a root directory in the user home directory. In that directory they make a sub directory stucture like this: yyyy/mm/dd for each year, month and day where a file gets created.

The final file name is still free to chose. All other informations can be kept in the filesystem, via links.

A application should save without even asking the user in the standard place, except if configured otherwise.

Most higher level organization schema like 'project' can be accomplished with directories and links to the files involved.

Basicly only a good schema, a browser/file open dialog and an API for applications is needed.

angel'o'sphere

--
Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
Re:"Implementing in GNOME" by justsomebody · 2004-09-06 04:40 · Score: 3, Informative

Yeah right. And whole world will use this daemon.

Problem with this logic is that not everybody is gonna use DBFS. For example: Some people would like to use Reiser4.

Proper thing would be dummy kernel (or some higher VFS, but making whole thing independant of wheter graphic mode is present or not) API for this kind of file access. If accessed FS is not DB related then it should convert standard functions (implementing some Metadata index on basic FS). If accessed system is Database FS then it should go trough it's native layer.

Who says that

MOVIES="*.avi *.mpg *.qt"
for ftype in $MOVIES; do
ls /mnt/volume/Personal/$ftype

aka.

select from fs where (path = '/mnt/volume/Personal/*') and ((path = '*.avi') or (path = '*.mpg') or (path = '*.qt'))

is different than

select from fs where ((type='movie') and (location='Personal'))

Second option could pose other options. Like searching by actors and directors, MP3 info, Office tags etc. But all open and save routines could be done trough and with Metadata. While Metadata and data would be connected.

Let's say I copy MP3 files from CD. CD is not DBFS. All my tags go down the drain. Hope you don't expect I will copy all files trough this daemon or in graphic mode at all. Just as on plain system as on DBFS.

Off course, it would help if VFS layer could detect MIME and act accordingly. Example: Old ISO9660 burned CD (or even better ssh mounted drive) with MP3. I copy these files to my computer remotely from terminal console on some other computer, computer being copied to, resolves ID3 and updates Metadata from ID3 tag index on fly. Without having some cron job.
Same thing goes for my Office files. All files have it's creator and it's description. Why wouldn't go in Metadata index when file is received and saved from e-mail. If this problem is not solved before implementation then all you can expect are holes in your Metadata with a lot of non-indexed data.

Well, that example is nothing new. Reiser4 already does that with plugins. The question here is:

Will everybody use Reiser4? NO
Will everybody write metadata plugin for Reiser4? NO
Will Gnome or KDE support Reiser4 directly? NO

Would this have better chance when Universal file access would be defined independent from FS and independant from Graphic mode? YES

--
Signature Pro version 1.13.2-3 release 83.5 beta3try7 after-breakfast edition
Re:"Implementing in GNOME" by Anonymous Coward · 2004-09-06 04:54 · Score: 1, Informative

no, there are many kernel projects, you missed the point, there is not just linux, there is all the bsds, solaris, irix etc. etc.
Re:"Implementing in GNOME" by Flammon · 2004-09-06 07:00 · Score: 1

Such thing should be implemented at kernel level to be transparent for *any* aplication.

Yes, you're right and the best way it can be done at the moment without breaking every app out there is with Reiser4 and it's plugin infrastucture.

I'm looking forward the the FOSS desktop with all the innovative development happening on all fronts like freedesktop.org, reiser4, gnome, openoffice.org, mozilla... The list just goes on and on.

--
ayottesoftware.com
Re:"Implementing in GNOME" by kosmosik · 2004-09-06 07:23 · Score: 1

I know that it does not *need* to be at kernel/system level. But it makes most of sense in that way. I use many different applications, ranging from non interface daemons, via CLI to GUI aplications. If I would ever think about bothering with somekind of batabase driven filesystem I would like it to support every application I use. It only makes sense in this way. F.e. I love ZSH and I do whole of my file management under shell, but I also love Rhythymbox - so for me it is only valuable if I get the same interface (for DB-filesystem) in my CLI app, in my GUI app, and even in my daemon app (which makes something automated for me, or serves something). And only way to go is *one* userspace library/daemon (not posible it is Open Source) or kernel/system level. I'll go system level with ReiserFS4 and plugins that will come soon...
Re:"Implementing in GNOME" by kosmosik · 2004-09-06 07:38 · Score: 1

Yes in filesystem that is obvious. But usualy when you use filesystem it managed by kernel. :-) So it is somehow mounted in / and visible as some directory. :-) It could be handled something as a *special* (not typical like lets say FAT or ext3) filesystem such as proc or sys or devfs and so on... Basicaly so I can use *shell* (if you can use shell you can use GUI on it also it is in standard unix path) to browse this db filesystem. It is just a example but imagine something like this: "/sqlfs/dbname:SQL_QUERY/"... It could be used in KDE, GNOME or whatever that can use files and directories...
Re:"Implementing in GNOME" by spitzak · 2004-09-06 07:44 · Score: 1

Several of the responders seem confused here. What is wanted is not to literally put every detail of this into the kernel, no more than the details of ext2 are in the base kernel code. Nor does this mean that the entire database interface must be in the kernel.

What is needed is the ability to take any selected object, anywhere, identify it with a "filename" that is a null-terminated string of bytes, and send it to open() and get a link to that object. Then when that SMB selection is made, it can be sent to OpenOffice and it will, guaranteed, open it! The filename can be complete gibberish, OpenOffice does not have to understand it, it just has to rely that it can open it and get the same thing the user expected.

I don't understand why anybody thinks we want to use any interface other than open() to get at objects. It seems painfully obvious to me, yet both Gnome and KDE seem to think we need to learn complex, incompatable interfaces.

It also does not help that there are morons out there pushing "case independence" and "wide characters" and i18n using anything other than utf-8. All of these are a serious impediment to getting this done correctly.
Re:"Implementing in GNOME" by kosmosik · 2004-09-06 07:45 · Score: 1

Yes I am exactly talking about this Enough is a standard API so that Gnome and KDE can make a "File Open Dialog" for opening files. OK lets assume that GNOME and KDE standardize on one implementation. But what with dozens of other stuff like OpenOffice.org, Blender, Apache (I've just named some killer apps that you can not deny that are important)? You want Apache to use *GNOME* or *KDE* API's? Files and so on should be handled at kernel level (yes it is kernel level, kernel acts betwen applications and filesystem giving applications consistent interface to *any* (supported) underlying FS).
Re:"Implementing in GNOME" by kosmosik · 2004-09-06 08:16 · Score: 1

Yes but drawing GUI is something different from managing files... Do you think implementations like (check out my other post in this thread) procfs or sysfs are mess? They are very convinient and useful IMHO. Oh and that can be done on various platforms.
Re:"Implementing in GNOME" by Anonymous Coward · 2004-09-06 13:47 · Score: 0

*Which* kernel level? The Linux kernel? BSD? MacOS? Windows? The programs you name run on all of them...
Re:"Implementing in GNOME" by yuri+benjamin · 2004-09-06 17:09 · Score: 1

How about a Gmail-like labels system:
$ ls /dbfs/dbname:label1+label2+label3.../

You could have predefined labels that take arguments, like lasttouch(>2004-03-15) or owner(yuri).

I really like this idea, because any app that can pass a filepath to a system call can use it straight away.

Get coding all you überdevelopers. Give me a call when it's ready :-)

--
You make the mistake of thinking you can educate the fundamental stupidity out of people. You can't.
Re:"Implementing in GNOME" by Anonymous Coward · 2004-09-07 02:09 · Score: 0

All files have it's creator and it's description.

"its", "its". (Actually, "All files have their creators and their descriptions." or "Each file has its creator and its description.".)
Re:"Implementing in GNOME" by kosmosik · 2004-09-07 02:16 · Score: 1

You are right but this are details. Only thing I wanted to state that such implementation should work for open() function as an ordinary unix path.
Re:"Implementing in GNOME" by the_truk_stop · 2004-09-07 06:55 · Score: 1

you can browse SMB network with Nautilus but when you actually try to open a file (from SMB via Nautilus) in OpenOffice.org you will get a info that viewer does not support this method
Ximian has been working on this in their OOo builds, and they're working to get the changes accepted upstream. They're using GnomeVFS to handle the situation.

File system in KDE... by JakeThompson1 · 2004-09-06 03:20 · Score: 1, Insightful

So now, you will lose access to your file system if you use a simple window manager instead of KDE?

Great idea.

Re:File system in KDE... by mn3m05yn3 · 2004-09-06 03:23 · Score: 2, Interesting

Doesn't seem to be true. Article indicates the DBFS will run on top of other filesystems. if you don't use KDE, you don't use DBFS... ext2/3 etc is still there. The real question is whether you can use KDE without using DBFS...

stubborn by mn3m05yn3 · 2004-09-06 03:20 · Score: 3, Insightful

Article doesn't address whether or not we can turn DBFS off and use the more traditional hierarchical method of file placement. Will we be dragged into this kicking and screaming?

Re:stubborn by LegionX · 2004-09-06 03:42 · Score: 1

"This gives an impression that there is no hierarchy, but to applications nothing has changed, the open-file and save-file dialogs have the same APIs."

it does..

Your filesystem is still there.. hidden.. waiting..
Re:stubborn by I+confirm+I'm+not+a · 2004-09-06 03:44 · Score: 1

Article doesn't address whether or not we can turn DBFS off and use the more traditional hierarchical method of file placement.

I would imagine the article author thought it was a given: we currently have choices of various file-systems, with no one FS requiring it be used to the exclusion of all others. I'd be highly surprised if DBFS was any different: people who want to use it will use it, and people who want to use Reiser4 will use that, and people who want to ext2 will use that.

--
This is where the serious fun begins.
Re:stubborn by donkeyboy · 2004-09-06 03:57 · Score: 2, Insightful

Most of these DB File System implementations overlay the existing file system. The standard file access methods are still there. The DB functionality is implemented as extensions.

The DB tables are then implemented as special files that coexist nicely with the existing directory files.

Keep in mind that if a "new" file system breaks heirarchial file access is also breaks every app in existance.
Re:stubborn by demented · 2004-09-06 05:25 · Score: 1

I'd be highly surprised if DBFS was any different: people who want to use it will use it, and people who want to use Reiser4 will use that, and people who want to ext2 will use that.

Actually, DBFS is just an abstraction layer over existing file systems. So people who want to use DBFS are still using reiserfs/ext3/xfs or whatever is the filesystem du jour. Very roughly, DBFS could be implemented like this: there is a daemon and its underlying backend storage in the form of bunch of data files (ie. gdbm files) somewhere on the file system. Whenever there is a request for opening a file, the file selector, instead of presenting the current directory and all the files in it, contacts the DBFS daemon and gets a list of keywords. Then the user picks some keywords and file selector queries the DBFS daemon with the selected keywords and, at the end, gets one or more filenames of the real files to open. Same is with the file-close: file selector present the user with some keywords and the posibility for the user to enter some new ones, as well as the filename for the document. Then, user associates the file with selected keywords. File selector then prepares the query (something like SQL INSERT or UPDATE) and queries the DBFS daemon. If everything is OK and because of the posibility that document could be comprised of several files, the DBFS daemon should reply to the file selector with the possibly altered names for the files. The file selector then saves the files to the underlying file system, with the names obtained from the DBFS daemon. This way, all the files are accessable even when DBFS is turned off and the implementation is platform-independent. The open() and close() system calls are unaffected with this, so no penalties on performance.

That is, as I've said before, just a very rough proposition of the DBFS implementation.

Version control would be nice as well by zero-one · 2004-09-06 03:21 · Score: 5, Interesting

I have always thought that version control (file histories, branching and atomic changes) would be nice to have at the file system level. Instead of storing myessay-firstdraft.doc, myessay-seconddraft.doc, myessay-final.doc, the file system should do the work. Then if I want to make a bunch of changes (perhaps I want to try a new page layout), I should be able to commit them as one atomic change (or throw them all away if I change my mind). Then, when I want to make a set of documents with US spelling, I should be able to branch the whole lot (using no disk space) and make the small changes from UK spelling while still being able to integrate other changes I have made.

Re:Version control would be nice as well by ultrabot · 2004-09-06 03:27 · Score: 4, Interesting

I have always thought that version control (file histories, branching and atomic changes) would be nice to have at the file system level.

Sounds like a job for an SVN plugin for Reiser4 file system. Anyone doing one already?

--
Save your wrists today - switch to Dvorak
Re:Version control would be nice as well by trynis · 2004-09-06 03:41 · Score: 1

I've been told the file system for Tops-20 had built-in version control. I tried to google for a reference, but failed. Maybe someone could enlighten me?

--
This is not a sig.
Re:Version control would be nice as well by Anonymous Coward · 2004-09-06 04:04 · Score: 1, Interesting

There is some non-free software which does this kind of thing, such as Tower Trim Context. We've been implementing this at my office. Users are often less than enthusiastic - because it changes the familiar Windows world, and breaks things like links in Office documents which depend on HFS storage...
Re:Version control would be nice as well by Anonymous Coward · 2004-09-06 04:06 · Score: 0

VMS had something similar: file;1, file;2, file;3, &c...
Re:Version control would be nice as well by Anonymous Coward · 2004-09-06 04:06 · Score: 3, Informative

There are two filesystems I know of, off the top of my head, which support versioned files. The first is VMS and the second is ISO9660. I suspect the VMS file versioning probably came from TOPS so you could be right there.

Yes, ISO9660 really does support versioning. You can create files and add a version number using the ; seperator E.g. FOO.TXT;1 and FOO.TXT;2 etc. A compliant ISO9660 can either show you all of the available versions if your OS can cope with it or just chooses the latest version (E.g. for systems such as MS-DOS). I'm not totally certain how versions are handled if you use RockRidge or Joilet extensions; the original ISO Level 1 names are still there but you'll only see the extended filename if you mount a RockRidge disc with a RockRidge ISO9660 driver, so you probably won't see the different versions. It's been a while since I fiddled with an ISO9660 driver to be honest.
Re:Version control would be nice as well by david+duncan+scott · 2004-09-06 04:27 · Score: 2, Informative

Close. That's a VMS feature.

--
This next song is very sad. Please clap along. -- Robin Zander
Re:Version control would be nice as well by SnprBoB86 · 2004-09-06 05:55 · Score: 1

MS SBS 2003 provides this through a "Previous Versions" addition to all XP computers on the domain. It is very poorly implemented and only affects files on the server, but has proved itself useful many times.

I am definitly in favor of version control everywhere. Undo is arguably the best time saving feature ever, let's make an ULTRA-UNDO.

--
http://brandonbloom.name
Re:Version control would be nice as well by Anonymous Coward · 2004-09-06 06:27 · Score: 0

Rational ClearCase is a proprietary version control system that does just this with it's mvfs filesystem.

Each user has it's own view on the versioned filesystem. File versions visible in the view are selected by the user himself by writing simple rules applied to the view.
Re:Version control would be nice as well by Svennig · 2004-09-06 06:36 · Score: 1

That is an absolutely FANTASTIC idea!
Re:Version control would be nice as well by iabervon · 2004-09-06 07:42 · Score: 1

The problem with this is that revision control actually takes a certain amount of interaction, and the filesystem doesn't have the interface to interact with the user. The filesystem can't tell that some of the things that get written are auto-save backups, others are the user habitually hitting "save", others are the user exporting to PDF to see how it will actually look when printed, and a few are actual drafts of the document.

Furthermore, as soon as you start integrating changes, you need human help. When you add some text to your US version, and you pull those changes into the UK version, you'll obviously have to look over the new text to change the spelling. The filesystem is going to be hopeless at figuring out what tool is needed for this operation, and this isn't even a case where you're resolving conflicts.

The real solution is just to use a filesystem-external version control system, and integrate it with programs that manipulate documents (i.e., things the user thinks about, as opposed to arbitrary files). Ideally, have a standard API for integration with version control (with the commands "update from with ", "commit to ", and "revert to ", having everything else non-integrated) that applies across version control systems and document-manipulators.

The currently practical solution is to avoid binary formats like the plague and put all your documents in CVS (or SVN or arch). This actually works really well, and would probably work even better if diff and patch understood ZIP files (such as .sxw).
Re:Version control would be nice as well by Anonymous Coward · 2004-09-06 08:38 · Score: 0

I have always thought that version control (file histories, branching and atomic changes) would be nice to have at the file system level.

Sounds like a job for an SVN plugin for Reiser4 file system. Anyone doing one already?

SVN has a webdav server, Linux has a webdav filesystem driver, so mounting a SVN rep via webdav on your Linux machine should do the trick.
Re:Version control would be nice as well by Anonymous Coward · 2004-09-06 09:48 · Score: 0

wow! yes i agree. i would love to see automated svn versioning and backup via file system.
Re:Version control would be nice as well by oz_ko · 2004-09-06 10:16 · Score: 1

Unfortunately the current SVN does not have complete support for WEBDAV. The next release will have enhanced WEBDAV support. Oz
Re:Version control would be nice as well by Anonymous Coward · 2004-09-06 11:19 · Score: 0

Not damned likely, since libsvn is incompatable with the GPL!!
Re:Version control would be nice as well by geirt · 2004-09-06 12:11 · Score: 1

There have been a real attemt to build a versioning file system for linux: SnapFS. Think about upgrading a system from one version to the next (eg.redhat7.3 to redhat 9). If you don't like the result, you just roll back to ver 7.3 again. SnapFS does not put a version on every file, it creates "snapshots" of the whole system, on demand. I think this is a very good tradeoff between having the overhead of versioning every write, and no versioning at all. You get the additional benefit of getting a snapshot of the complete system status instead of just one file.
More info

--

RFC1925
Re:Version control would be nice as well by jrutley · 2004-09-06 14:17 · Score: 1

From the FAQ
I use the save button as a super undo, don't throw it away.
You and 65% of America. But really, we have better methods for that. Like versioning systems. Developers used them for years, and a file system should support them natively.
Re:Version control would be nice as well by elodan · 2004-09-06 20:06 · Score: 1

Ditch Linux and use VMS. It's always done this. :-)

Interesting concept by BCW2 · 2004-09-06 03:21 · Score: 1

I like the looks of this and the way it can search the file system. Nice job! This would be a great way to keep track of multiple projects. Blows away the Winfs idea, I will try it out.

--
Professional Politicians are not the solution, they ARE the problem.

Re:Interesting concept by EddWo · 2004-09-06 04:00 · Score: 1

How does it blow away the WinFS idea? It basically is the WinFS idea. It stores metadata and file relationships in a database with a reference to the file location in the traditional hierachy.

WinFS does all that and more with automatic metadata promotion/demotion, synchronisation, and queries generated by virtual paths for legacy applications.

--
"Taligent is still pure vapor. Maybe they'll be the last who jumps up on Openstep... "
Re:Interesting concept by Anonymous Coward · 2004-09-06 04:18 · Score: 0

Well let me be the first to say that if this is even remotely similar to the big WinFS hoopla then MS is in really bad shape because all I can say is --yawn.
I really don't get this. I don't really care about bloat. But I mean if it's just about finding files it's not that big of a deal. By now I think most people have figured out ways to find their files using existing tools such as mnemonic devices and using directories in a reasonable way such as folders with dates and such. I mean if you've ever spent time looking for files that probably taught you to be more responsible about where you put them.
I don't really buy this scenrio about a person with a big media project losing their files and finding them with this kind of tool. I do big media projects with tons of audio, video, text, code all woven together and if I didn't make my directory structures carefully I would never be able to finish a job. So, a media pro probably doesn't really need a tool like this as they already do things such as backups that force them to be somewhat cognizant of what goes where.
That gives us the amateurs. But I think something like this just makes them lazy which is a bad idea. The key idea for this group is that amateur computer users need to understand what a path is and what a tree looks like and how it can help them organize their files. This is basic stuff that people should understand. Hiding it isn't really doing anybody a favor. Anyway, a path in a filesystem is not too different from a URL so even kids are exposed to the idea from very early.
So if pros don't need it and amateurs shouldn't depend on it, its doesn't seem to be such a big deal. I mean if people want it that's cool. But I don't see this as a make or break item.
Re:Interesting concept by M.+Baranczak · 2004-09-06 05:03 · Score: 1

WinFS does all that and more

I notice you use the present tense. I was under the impression that WinFS is still in the vaporware stage?
Re:Interesting concept by BCW2 · 2004-09-06 07:45 · Score: 1

Except that this is up and running, at least in a test mode and Winfs is strictly theoretical at this time. They already removed it from Lonhorn which is 2 years away. I'll bet this is debugged and running in the world before Winfs is in Beta (which is the release version for M$).

--
Professional Politicians are not the solution, they ARE the problem.
Re:Interesting concept by EddWo · 2004-09-06 09:09 · Score: 1

WinFS is up and running in test mode. Just install one of the Longhorn Alpha Previews.
It runs really slowly, uses up tons of resources but it's there and it works. It's going to be a long time coming, but it's way more than just theoretical.

--
"Taligent is still pure vapor. Maybe they'll be the last who jumps up on Openstep... "
Re:Interesting concept by BCW2 · 2004-09-06 11:56 · Score: 1

I wouldn't install an alpha from M$ on a bet. When finally released it will be late and bug filled as always. It's a matter of trust, or a complete lack therof.

Linux is for business, Windows is for games.

--
Professional Politicians are not the solution, they ARE the problem.
Re:Interesting concept by EddWo · 2004-09-06 12:20 · Score: 1

So install it in VMWare or Virtual PC or something. If you don't like it just wipe the disk image.

--
"Taligent is still pure vapor. Maybe they'll be the last who jumps up on Openstep... "
Re:Interesting concept by some+guy+I+know · 2004-09-07 03:33 · Score: 1

I was under the impression that WinFS is still in the vaporware stage?
It is.
A more accurate statement would be "WinFS will do all that and more when Duke Nukem Forever is released.".

--
Those who sacrifice security to condemn liberty deserve to repeat history or something. - Benjamin Santayana

Disadvantages by BHearsum · 2004-09-06 03:21 · Score: 4, Insightful

How much permforance overhead will this cause? The 'Desktop Environments' already eat a lot of RAM and CPU.
How much disk space will you lose over this? All the metadata has to be stored somewhere, and just glancing over the link I read something about a versioning system, which will definently take up quite a bit of space. Will a 20gb hard drive become 15gb with DBFS?

Re:Disadvantages by MemoryDragon · 2004-09-06 03:52 · Score: 1

Performance penalties, hard to tell, I assume for plain searches, the whole thing is faster. Read access might be the same because it goes plainly to file system level. Write access probably add a few miliseonds per file, to have the index updated, probably not if you have atomic commits on filesystem level and can do the indexing multithreaded. (reiser4 comes instantly to my mind)
Re:Disadvantages by aodl · 2004-09-06 04:03 · Score: 4, Insightful

While performance is something that should always be kept in mind, we are a long way away from the days of the original Macintosh where a desk accessory had to weigh in at 600 bytes in order to make the cut and fit into both memory and on a floppy disk. As current desktop machines outperform the high end servers of a few years ago, it would be nice to put a lot of that muscle to use in improving the user experience. I'm not excusing bloated and slow code here, but we don't really need to be counting bytes.

In any case, database based operating systems have been around for decades, from OS/400 to the BeOS. Many BeOS users claimed it was hands down faster than any other shipping OS at the time, and it featured a journaling, database-styled file system. One of the primary developers of that file system is now working at Apple on Mac OS X 10.4's spotlight functionality.

The thing is - as our desktop storage continues to grow at the pace that it does, and as we curiously find ways to fill it up, new ways of looking at and finding the information we store are going to be needed.

DBFS, Gnome Storage, Apple's Spotlight, and WinFS, all take different routes to get there. It's worth looking at all and what they offer and where they differ. WinFS, is a new storage layer that combines file system resources with more structured data in a Relational/XML hybrid system, with the aim (from what I gather) of turning the file system into a global "soup" of data. That sort of soup can be seen in office suites or PDA style applications, and in older Operating Systems like the Newton OS, where everything is a shared and available resource that is stored and available through common structures. Spotlight, on the other hand, combines file system searches and indexes (think 'locate') with full content indexes and a metadata index, which uses 'importers' to parse out other file formats. Spotlight is not a new file system, but an indexing system that acts on files in the file system. From what I remember of Gnome Storage, it is similar, using the VFS layer and Postgres triggers and callbacks, along with plug-ins, to parse and extract relevant metadata and contents out of files. DBFS looks to be like WinFS in that it purely wants to be a new kind of information store. I don't know which style will win out. My theory is that technologies like Spotlight will eventually evolve into a new kind of storage system, while remaining familiar and file based for todays users and developers. But this is an idea whose time has more than come. It's something that's been promised for the desktop for at least a decade, and has been shown to work, albeit in targeted OS's (the Newton) or ones that never achieved mass market penetration (BeOS).

So I think that performance concerns aren't that big of a concern, so long as (like all development) there are good people working on the solution.

i don't have time to reinvent the wheel today by non · 2004-09-06 03:22 · Score: 1, Flamebait

why don't the nice KDE people and the nice Gnome people work on developing a library that sits on top of this and then we can stop all the stupid name calling and use the right tool for the right job

--
...vividly encapsulates that post-Watergate/pre-punk/coked-up moment when you could trust no one, least of all yourself.

Re:i don't have time to reinvent the wheel today by LousyPhreak · 2004-09-06 03:47 · Score: 1

maybe because not everyone wants to use reiser4?

--
-- Karma: beyond good and evil - mostly affected by posting political
Re:i don't have time to reinvent the wheel today by JohnFluxx · 2004-09-06 03:52 · Score: 0, Flamebait

What an arrogant dick you are.
Re:i don't have time to reinvent the wheel today by uncommonlygood · 2004-09-06 05:16 · Score: 2, Insightful

why don't the nice KDE people and the nice Gnome people work on developing a library that sits on top of [reiser 4] and then we can stop all the stupid name calling and use the right tool for the right job
I wouldn't normally jump in except this is modded +4, even though the poster doesn't appear to have read the article.
The article doesn't talk about an actual "file system" (it admits its a bit of a misnomer itself), merely a way of referencing files with "keywords" in database that, according to the author, will make it easier to find your files. The author has written a file browser and file selector dialogs for KDE that use this, then goes on to say that he's planning to make a GNOME implementation soon, but nowhere does anyone start any name calling.
Since this article essentially discusses a GUI and not a file system (which could be implemented at kernel level), it would be a little silly to use the same tool for GNOME and KDE, since they have different look-and-feel. That said, a common library of functions needed by both the GNOME and KDE versions would obviously be sensible, and would speed up porting to other desktops.
Furthermore, this idea claims to work on top of existing heirarchical file systems (removing the idea of a hierarchical FS completely would involve a major restructuring of the whole operating system and everything that works on it), so just saying that it should use reiser 4, I suppose because its just l33t, is a little redundant. This should work on top of just about any FS, ext2, reiser, or even an NFS mount.
Re:i don't have time to reinvent the wheel today by Anonymous Coward · 2004-09-06 05:42 · Score: 1, Interesting

Hans Reiser has a lot of very ambitious ideas. But having ambitious ideas is easy; it's the execution that's difficult. Take a look at the recent threads on lkml discussing ReiserFS 4 (in particular, posts from Linus and Al Viro), and you'll see that while there's at least some interest in ReiserFS 4-like features, there's disappointingly little evidence that the ReiserFS people have given sufficient thought to some fundamental implementation problems.
Also note that while Hans Reiser may have talked about competing with WinFS and including more database-like features in the filesystem, and while ReiserFS may be designed to ease the future implementation of such features, as far as I know none of that is actually done.
Re:i don't have time to reinvent the wheel today by master_p · 2004-09-06 06:56 · Score: 1

I've carefully read the Namesys documentation and I am happy that finally someone reached the same conclusions as me. I have been saying the following for the past 2 years here on Slashdot, and I have been ridiculed, but now the great engineers of ReiserFS have reached to the same conclusions as me:

The 'filesystem' must be an object-oriented database system, where each node is an object, with specific data, and specific code that handles this data. Each object could contain other objects as children nodes. This is what Reiser4 does: each node in the system is an object (they call it plugin), with specific data and layout, containing other nodes.

Now, if the above-described system is mapped to a highlevel programming language that automates the process through attributes, the I/O problem would be solved once and for all. What I mean is by declaring objects and/or members 'persistent', the system already knows the type of plugin that should be instantiated to handle the data. References/pointers to other data will automatically be mapped to the Reiser node system...
Re:i don't have time to reinvent the wheel today by Anonymous Coward · 2004-09-06 08:55 · Score: 0

I've carefully read the Namesys documentation and I am happy that finally someone reached the same conclusions as me. I have been saying the following for the past 2 years here on Slashdot, and I have been ridiculed, but now the great engineers of ReiserFS have reached to the same conclusions as me:

Well, we're quite the misunderstood genius, aren't we?
It's easy to come up with these grandiose schemes:
The 'filesystem' must be an object-oriented database system, where each node is an object, with specific data, and specific code that handles this data. Each object could contain other objects as children nodes.

Now, how do you do provide atomic renames, without compromising SMP performance (no global filesystem lock, please!), without risking creating cycles? What are you going to do about hardlinks? How do you guarantee you're not going to break some 20-year old application that expects to be able to create files with any name, when suddenly some names ("meta") have become reserved, in contradiction to POSIX standards? Etc., etc.
And no, you can't just wave these away as minor details to be cleaned up after you've got a nearly finished implementation. If you can't answer these sorts of questions than you might as well not start--the whole project's dead in the water.
And no, the "great engineers of ReiserFS" don't appear to have worked out answers to these questions yet either, unfortunately.
Someone may yet get this figured out. But they're going to have to spend a lot more time thinking very hard about these kinds of gnarly details. It's really not people with Big Ideas that are missing here....
Re:i don't have time to reinvent the wheel today by master_p · 2004-09-06 22:10 · Score: 1

Well, we're quite the misunderstood genius, aren't we?

No, but I consider the idea very important, so I would like it to be considered by the big heads.

Now, how do you do provide atomic renames , without compromising SMP performance (no global filesystem lock, please!), without risking creating cycles?

In the same way that is done now.

What are you going to do about hardlinks?

Hard links are not needed. They were a hack anyway. Todays' organization schemes based on database principles is a much better way.

How do you guarantee you're not going to break some 20-year old application that expects to be able to create files with any name, when suddenly some names ("meta") have become reserved, in contradiction to POSIX standards?

Old apps can continue to work with what they've got. There is no need to change that. New apps though can be built around the new way of things.

RTFA by 1qa2ws3ed · 2004-09-06 03:23 · Score: 1

"The DBFS does not actually store files, it holds references to files on the underlying hierarchy based file system. The GUI part is implemented in KDE where it replaces all hierarchy based file accesses. This gives an impression that there is no hierarchy, but to applications nothing has changed, the open-file and save-file dialogs have the same APIs."

Re:RTFA by JakeThompson1 · 2004-09-06 03:25 · Score: 1

So... With KDE application, user gets pretty dialog with nice database-organized file system.

With GNOME (until this is implemented), or Motif application... they get the "hierarchy"...

And people thought the GTK Open/Save dialogs were bad before...

Reiserfs, storage and why do you want this? by ACK!! · 2004-09-06 03:25 · Score: 4, Interesting

Perhaps I am more organized than most but I already categorize my files and such in the hierachal file structure.

Isn't Rieserfs planning to do this on the kernel level?

Where does that leave other fs choices and storage and other idea dbfs?

I see more and more people saying look what neat things you can do with these tools.

But really why do you personally want something like this?

Curious to see the response is all.

--
ACK /ak/ interj. 2. [from the comic strip "Bloom County"] An exclamation of surprised disgust, esp. i

Re:Reiserfs, storage and why do you want this? by BenjyD · 2004-09-06 03:45 · Score: 3, Interesting

The problems with the hierachical system are:

- maintenance overhead on part of the user to create hierachy and maintain it. Every time you save a file you have to think "where do I put this?"
- Finding files can be hard. Is that letter about the planning application in Documents/Letters or Documents/Planning App?
- keeping files in two or more places at once is hard (as in the previous example). You can use softlinks, but that's hardly ideal and doesn't survive moving things around.

Basically, the current file system imposes a significant overhead. Most power users have restructured the way they work and use a computer in order to minimise that overhead without really noticing. It's just become one of those things you have to do, like remembering to save documents.

Why not shift the burden of organising the files onto the enormously powerful computer, rather than take up valuable human mental resources.
Re:Reiserfs, storage and why do you want this? by Aeiri · 2004-09-06 03:55 · Score: 2, Funny

- Finding files can be hard. Is that letter about the planning application in Documents/Letters or Documents/Planning App?

find ~/Documents | grep "planning application"

You've just taken a giant step into learning how to search for files in UNIX! In my next tutorial, I'll teach you the power of the "locate" command.
Re:Reiserfs, storage and why do you want this? by BenjyD · 2004-09-06 04:06 · Score: 1

And if I called the file planning_app.sxw? plan_app.sxw? "letter to council.sxw"? "Planning Application" wouldn't even match your example. Yes, a bunch of regexs would probably find it, but that's a fairly involved process just to find a file I only need for a few seconds to look up a name or something.

The point is, for most people the overhead of naming and organising files has become subconcious, and we have a bunch of tools to sort-of work around it. That doesn't mean an attempt to create a different system to remove the overhead completely isn't worthwhile.
Re:Reiserfs, storage and why do you want this? by TheRaven64 · 2004-09-06 04:12 · Score: 2, Insightful

I find that a spacial interface to a hierarchical storage layout makes it very easy when I want to find my files. This kind of thing is more useful when trying to find files you didn't create / save.

--
I am TheRaven on Soylent News
Re:Reiserfs, storage and why do you want this? by Aeiri · 2004-09-06 04:31 · Score: 1

You should know what you named the file, otherwise a database file system wouldn't help you anyway...

For your "Planning Application" example, just use the -i switch.

find ~/Documents | grep -i "planning application"

And for you other people that said it can't search through SXW files, etc, this was for searching for the filename, not through the document. SXW files have plaintext filenames last I checked.
Re:Reiserfs, storage and why do you want this? by BenjyD · 2004-09-06 04:44 · Score: 1

Isn't the whole point of this to use keywords and other meta data to search instead of the filename?
Re:Reiserfs, storage and why do you want this? by HermanAB · 2004-09-06 04:56 · Score: 1

Exactly - doesn't matter how you organize files - sooner or later you have to search for one and searching with ReiserFS is damn fast.
I guess the hoopla is mostly a Windows thing, where it is well nigh impossible to search for anything.
Essentially, in a hirarchical system you sometimes search for stuff, while in a DB system, you *always* search for stuff - big deal...

--
Oh well, what the hell...
Re:Reiserfs, storage and why do you want this? by Anonymous Coward · 2004-09-06 05:01 · Score: 0

I think you're missing the whole point...

People should be able to search for their files in the most intuitive way possible and shouldn't have to be required to have l33t UNIX haXoring skills to be able to find their files. Obviously they shouldn't have to drop to a command prompt either to be able to do this either...

Gotta love slashdot folks.
Re:Reiserfs, storage and why do you want this? by ctr2sprt · 2004-09-06 05:18 · Score: 4, Interesting

OK, let's consider an example other than documents. Joe is a big music fan with a couple hundred CDs. He likes having instant access to any one song when he wants it and has a lot of hard drive space, so he's MP3-ized (or ogg-ized or flac-ized or whatever) his entire collection, plus all the MP3s he's acquired via other means.
Joe is pretty good about organization, so he names every MP3 properly with the group, album, and track names, plus the track number. (Something like The Beatles/White Album/01-Back in the USSR.mp3.) This way if he knows, for example, the track name but not what album it's on, he can find it pretty quickly using a method like yours.
The problem is if he wants to do something like put all the country music he has, for example, in a playlist. How does he do that? It can be done, certainly, but if he has a collection with several thousand MP3s it's so tedious and difficult as to be effectively impossible. What if he wants to listen to 60s rock? What if he wants to find a particular song he has, but all he knows is it's between 3 and 5 minutes long, came out between 2002 and 2003, and is probably categorized as either "pop" or "alternative?" What if he just wants a list of all the songs he never listens to because he's sick of what he's been playing lately? Or maybe he needs to free up disk space and wants to find out what he'll miss the least.
All these things are impossible to do in an efficient and timely manner using our current system. He can certainly use a command-line ID3 tagger to strip out the things he cares about, something like
find mp3 -type f -print0 | xargs -0 id3tag -l | grep 'Genre: Country'
but that's painfully slow: a second or two per file means a big connection will take 15 minutes or longer to scan, and if you typed "Gerne" by mistake you have to do it all over.
Now if you had a filesystem-like object which could be smart and store ID3 metadata in the filesystem, then it would be much faster: the main overhead to using the find/xargs/id3tag/grep approach described above comes from having to seek through the MP3 file to get at the metadata. The reason this needs to be a "core OS component," perhaps even part of the kernel, is that MP3 tags can change at whim and the filesystem needs to know about it or its metadata can get out of sync. It's possible, but impractical, to update this on a periodic basis, like the locatedb; it makes much more sense to have the kernel inform some plugin "Hey, this file just changed, do you care?" And if the plugin does care, it can look at the changes, see if it's affected, and possibly update the metadata to match. It could also go the other way, where Joe updates the filesystem metadata and the OS knows to update the MP3's ID3 tags too.
Re:Reiserfs, storage and why do you want this? by Anonymous Coward · 2004-09-06 05:19 · Score: 0

- maintenance overhead on part of the user to create hierachy and maintain it. Every time you save a file you have to think "where do I put this?"

As opposed to what? "How do I best describe this file using keywords?". Nobody will ever choose sensible keywords/metadata.

- Finding files can be hard. Is that letter about the planning application in Documents/Letters or Documents/Planning App?

You're inventing a problem that doesn't exist. "slocate" solves your problem.

- keeping files in two or more places at once is hard (as in the previous example). You can use softlinks, but that's hardly ideal and doesn't survive moving things around.
This I'll grant you, but you could get much, much further by just having the file system attach unique IDs to all files and then having an indexing daemon or two update symlinks appropriately.

In short: It's a non-solution looking for a problem.
Re:Reiserfs, storage and why do you want this? by SlashDread · 2004-09-06 06:33 · Score: 1

I for one, would like faster searching. Google my filesystem, something like that. Personally.

"/Dread"
Re:Reiserfs, storage and why do you want this? by Anonymous Coward · 2004-09-06 07:42 · Score: 0

Maybe if "Joe" wasn't such a messy bastard, he would have put his Country songs into a folder called "Country" when he first downloaded them :)

Are you Joe?
Re:Reiserfs, storage and why do you want this? by johannesg · 2004-09-06 07:54 · Score: 1

While I would agree that many users couldn't organize themselves out of a wet cardboard box, I don't think a keyword system like this would rescue them from their plight. If they cannot be bothered (or are unable) to organize a good directory structure, what on earth makes you think they would be able to use a keyword system? Instead of "..../documents/letters/grandma" they would now have to specify something like . The same people who are unsufficiently organized for directories will also be unsufficiently organized for keywords.
Preemptively, I'd like to point out that I do not believe in an AI word processor that just happens to know you are writing a letter ("Hey! It looks like...") and magically fills in the required keyword fields. I seriously doubt it could come up with useful search information, and if I hadn't actually entered the search terms myself I don't think I'd ever be able to guess what search terms it had come up with for my letter.
All of this does not mean that I do not believe in meta-data, on the contrary. I think it will be massively useful, but only in the hands of power users (the same people who have no trouble organizing things in directories), only for specific types of data (such as MP3's, mail, etc.), and only in addition to directories rather than instead of.
Re:Reiserfs, storage and why do you want this? by Anonymous Coward · 2004-09-06 08:09 · Score: 0

Winamp and iTunes will do most of that for you. Database support using metadata in relavent files already exists in applications that need it. Why bother adding it at the FS level?
Re:Reiserfs, storage and why do you want this? by pchan- · 2004-09-06 08:57 · Score: 1

this is an excellent example. in fact, i used to do just this on my long gone beos system. oh no, not another beos nut raving like a lunatic, i'm sure is what you're thinking. i'll try to keep it short. most users will not see the value of this until they've had some time to use it. here's the case that i found useful: viewing a list of the most recent songs in my collection (which are spread over multiple directories). sure, [some app] will let me do the same thing, but will it also let me do the same thing for jpegs/movies/documents/[your favorite most used files here]? now, wouldn't it be great if EVERY APP on your system could do the exact same thing, without having to include special id3 parsing code or knowing anything about your xyz files?

another use, one which microsoft seems interested in, is using it for email filtering. bemail used to dump every email you got as a file into your mail directory. there was metadata for the from, to, subject, date fields. you could query your email. list by newest first, find mail sent by this person in this time period, whatever. and every app could do this, with a generic interface to the BeFS, without having to understand anything about email headers. this will effectively be microsoft's M: drive in the far future (or, so they claim). the same thing for contact info (be's People files used to have no content, only metadata).

i look forward to having this implemented in linux. truly a feature i've been missing.
Re:Reiserfs, storage and why do you want this? by kirkjobsluder · 2004-09-06 10:37 · Score: 1

- maintenance overhead on part of the user to create hierachy and maintain it. Every time you save a file you have to think "where do I put this?"

However, for a relational database filesystem to work, every time you save a file, you have to think "what keywords do I want on this file."

The fallacy in this argument is in believing that something magic will happen that will encourage people who have problems using descriptive heirachal directory names will suddenly use descriptive keywords.

- Finding files can be hard. Is that letter about the planning application in Documents/Letters or Documents/Planning App?

There are a fair number of applications that prove that full-text searching is entirely compatible with traditional heirachal file systems. (grep, glimpse and swish-e come immediately to mind.)

- keeping files in two or more places at once is hard (as in the previous example). You can use softlinks, but that's hardly ideal and doesn't survive moving things around.

Hardlinks are even better for this purpose. However, these problems don't magically go away with relational database systems. You replace the problem of having a file live in two or more places with the problem of maintaining the file in two or more categories.

Why not shift the burden of organising the files onto the enormously powerful computer, rather than take up valuable human mental resources.

Any form of contextual indexing that can be done with files located in a relational database, can be done on files in heirachal file system.
Re:Reiserfs, storage and why do you want this? by Tyreth · 2004-09-06 13:46 · Score: 1

Because other applications will almost certainly want these types of features too: movies, text documents, html files, pdf's, images, etc. So why have every application develop their own database and metadata? Make it standard and implement at a core level so that work isn't duplicated.
Re:Reiserfs, storage and why do you want this? by Anonymous Coward · 2004-09-06 15:41 · Score: 0

Preemptively, I'd like to point out that I do not believe in an AI word processor that just happens to know you are writing a letter ("Hey! It looks like...") and magically fills in the required keyword fields. I seriously doubt it could come up with useful search information, and if I hadn't actually entered the search terms myself I don't think I'd ever be able to guess what search terms it had come up with for my letter.

Well, some things are obvious. Your letter probably has starts with something like "Dear ", so that's already a piece of metadata you could search with.
Re:Reiserfs, storage and why do you want this? by Anonymous Coward · 2004-09-07 04:23 · Score: 0

Read the fisrt part of his post, you stupid cunt. He stores by Artist/Album/Song (e.g., "The Beatles/White Album/01-Back in the USSR.mp3"). He can't prefix genre to that path because because some artists may have crossover songs (e.g., "Rocky Racoon" (sp?) from the White Album can be considered sort of country, but the rest of the songs on the record aren't), and putting it anywhere else doesn't make much sense, either, for similar or other reasons.

And what about his other problems? Is he supposed to create a directory named "between 3 and 5 minutes long, came out between 2002 and 2003"? Of course not, you cum-swallowing pedophile.

software? by Alosja · 2004-09-06 03:28 · Score: 1, Interesting

might sligtly offtopic, but is there any open source software for windows that could do the same thing? I produce a lot of notes, and i want to be able to categorize my files.

--
A little stupidity is as unlikely as a little pregnancy

Re:software? by CanadianCrackPot · 2004-09-06 05:23 · Score: 1

Try naming the directories descriptively, such as rants, memos, etc. Oh wait Windoze user I ment file folders *groans*.

--
Good programmers drink beer to relieve job stress.
Great programmers drink hard liquor and work best hungover.

Comments by the Author by data1 · 2004-09-06 03:28 · Score: 5, Interesting

The author is asking for help to move the project to Gnome.

Quote:There is of-course the hard choice of platform. I choose KDE? because I am familiar with QT a bit, and because it is inherently object-oriented, being C++ and all. But in my mind GNOME? is much closer to how I would like a desktop system to function. So I would like to go for the GNOME option. I leave KDE developers to do what they want with this, and I am offering them support. But I would like to focus my efforts on GNOME and implementing the above in GNOME.

Any volunteers?
In the first place we will need developers. Would you like to join, send me an email (o.gorter@student.utwente.nl) with DBFS and JOIN somewhere in the subject. If you are not a developer, but still would like to help, please revisit this page in a few weeks. There will probably be a community website by then somewhere.

What happen to the OODB? by jeanicinq · 2004-09-06 03:31 · Score: 2, Insightful

The database file system originated from the ideas of an object-orientated database. Keywords and references are all part of the orientation objects of the database to index to files or other objects. It does away with the traditional hierarchal view, being rooted at some place. The OODB does not need to be rooted as it is more like a web. The DBFS seems to try to implement part of the concept of the OODB. Good. There are many more features an OODBFS can offer: dynamic organization, classification, and mutliple "skeletal" views to name a few. I hope that this DBFS will give a taste of what an OODBFS offers.

Re:Why bother.... by Anonymous Coward · 2004-09-06 03:36 · Score: 1, Funny

neither gnome nor kde are window managers, and I doubt berman would allow either to appear on enterprise today or tommorow.

Sigh, Andrew Morton seemed to be right... by greppling · 2004-09-06 03:37 · Score: 5, Interesting

...when he said on LKML, slightly paraphrased: "The only reason I see to put filesystem semantic enhancements into the kernel, is that it would be socially hard to get people to agree an a single userspace library."

(In the course of the heated discussion about Reiser4.)

Re:Sigh, Andrew Morton seemed to be right... by Spy+Hunter · 2004-09-06 08:38 · Score: 1

... and it would be practically hard to retrofit every single Linux program from GIMP to grep to support this new library. Why waste all that effort when enhancing the kernel could solve all the same problems much more easily, and in a backwards-compatible manner as well?

--
main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
Re:Sigh, Andrew Morton seemed to be right... by orasio · 2004-09-06 15:08 · Score: 1

I think that it would not be a good idea to change the way everyone manages files, just because of an upgrade. A kernel update seems more sensible, and doable.
Of course, there's the problem of having the FS DB in the kernel instead of userland, but maybe that could be solved by taking care of a bigger problem that few people are trying to fix, and that is the monolothic kernel. Maybe some effort should be put in a modular kernel, that lets us plug-in this kind of enhancements, without having to mess with the core of the kernel, but staying behind its frontiers, but that could be too hairy an issue.

Re:Why bother.... by justsomebody · 2004-09-06 03:37 · Score: 1

What a stupid thinking. Both things: Making it for KDE and porting it to Gnome.

In reality DBFS should only provide higher API besides standard one. And if desktop or application supports this API here's all of the functionality. If not plain FS access is all you get. In Gnome case gnome-vfs should detect and use DBFS, and in KDE probably the equivalent is KIOSlave.

Kernel should set some standard API for filesystems like DBFS and Reiser4 if not for other reason then to set starting access point for filesystems like these.

--
Signature Pro version 1.13.2-3 release 83.5 beta3try7 after-breakfast edition

SharePoint anyone? by Petronius · 2004-09-06 03:39 · Score: 2, Informative

While everybody is busy making fun of WinFS, Microsoft is very quietly and successfully letting their customers install SharePoint sites all over the place.
As usual, Xerox came up with the concept years ago (DocuShare). Sigh.

--
there's no place like ~

Re:SharePoint anyone? by Anonymous Coward · 2004-09-06 04:11 · Score: 2, Interesting

It feels like sharing files in the manner that SharePoint does is an afterthought. First, they built a web portal, then they decided to add that feature in, along with a few ActiveX controls. More important to SharePoint was the web portal component. Where I work, everyone has a SharePoint site for a default webpage. This allows for simple announcements and memos without choking down the email system. It's also fairly unobtrusive, which allows users to read said memos if/when they like, rather than forcing it into their inbox where it would inevitably get deleted.
Re:SharePoint anyone? by killjoe · 2004-09-06 06:12 · Score: 1

Our company installed sharepoint. Nobody seems to like it very much.

--
evil is as evil does

This is not a file system by MobyDisk · 2004-09-06 03:40 · Score: 4, Insightful

Maybe we could call it a "filing" system since it indexes files that are on another file system. Really, a file system IS a database, not an add-on that indexes files. Still, perhaps this is a better approach than trying to redo all the file-system internals. Although to be truly useful, this needs to be an API that is GUI-independent, with GUI-bindings as needed.

Re:This is not a file system by Baloo+Ursidae · 2004-09-06 11:12 · Score: 1

Like thrown into a mysql (or whatever) database, which would allow for different frontends right there.

--
Help us build a better map!

Stop letting goof ideas by DarkOx · 2004-09-06 03:41 · Score: 2, Interesting

Stop letting good ideas be victimized by the M$ marketing FUD machine. Someone said "database filesystem" That was a good idea. M$ can along and said gee lets steal that idea. Hey there is no existing implementation to copy how do we do it. The answer is they did not do it. They put a database to keep rather mundane information on top of NTFS(a real filesystem) and called it WINFS(NTFS and a almost completely unrealated database, not a file system). Keeping data on the relations ships between files is a nice idea. Putting it in user space is dumb. Its overhead. Look at most of the example he gave "find a word document I worked on last month". All that info is already in the filesystem. A filesystem really is already a database in the strictest since. It stores whats on which inode assoicated with how many blocks which you could think of as attributes. It also sores attibutes like permissions and dates. Why not just put some more attributes into it like subject and relatedtopicID . If you did that and then added the ability to maintain some other tables where you could put extended descriptions and stuff, and built up the query engine to be able to efficently solve queries users will likely ask then you'd have what your really looking for. Addionally you would lose the overhead to a degree because you'd be storing informaiton once instead of in the FS and in the database.

--
Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html

Re:Stop letting goof ideas by Anonymous Coward · 2004-09-06 04:24 · Score: 0

> Why not just put some more attributes into it like subject and relatedtopicID

This is essentially what NTFS does, but the software support is pretty thin (especially for searching). But software like MS Office does really store metadata (Subject, Author, etc) into the filesystem, and you can programmatically search for it using Index Server. I don't know why they've never put together a good UI for this - the tech is there.

WinFS was an attempt at solving a greater problem of accessing non-filesystem data. Things like Outlook and SQL databases, etc.

Of course, Reiser (etc) also now support the same idea, but it breaks POSIX, so who knows how good the user-space support will be.
Re:Stop letting goof ideas by Jonti · 2004-09-06 06:37 · Score: 1

there is no existing implementation to copy

Take a look at WinFS Vs GNOME Storage? Can you Say OS/400?. Here's a snippet to whet your appetite: ... it's worth bearing in mind that "Database Filesystems" are not a new idea - in fact, they've been in widespread use for many years. For a prime example, one need look no further than to IBM - their OS/400 operating system, which runs on the iSeries (previously known as the AS/400) minicomputers, features what can best be described as a "DB2 filesystem".
Re:Stop letting goof ideas by Henk+Poley · 2004-09-06 06:57 · Score: 1

All that info is already in the filesystem.

Problem is finding it quickly. An indexed database solves that. Though of course you could implement it directly in the filesystem.

But doing it in userspace is easier since you already have the filesystem as foundation to write on, and upgrading your kernel because some fileformat changes doesn't seem ideal to me. Remember you want it to know what you wrote, so it needs to be able to look into your documents.

edit by slimyrubber · 2004-09-06 03:41 · Score: 1

What I ment is.. DFS interface should come as filesystem tool (xfsprogs, e2fsprogs, reiser4progs etc) rather then depending on one desktop or window manager.

--
[ I can not bring myself to believe that if knowledge presents danger, the solution is ignorance ] -- Isaac Asimov

GNOME has Beagle (Re:"Implementing in GNOME") by otisg · 2004-09-06 03:42 · Score: 1

Read about Beagle here. I posted about this on Slashdot a few days ago.

--
Simpy

This looks like BLINKX plus more by lcsjk · 2004-09-06 03:43 · Score: 4, Interesting

I already use blinkx, beta, from http://www.blinkx.com/, to automatically search my files along with internet keywords. It doesn't have the search by date or extension and is not configurable to my liking, but it seems to do a good job of finding things I have misplaced. Integrated with the author's system, this could make a great search system.

Normally I file things in a hierarchial method by year and month and by project name (2004file/9sep/) or (2004file/workfile/projectname), but still I lose track now and then and need keywords. Change the "slash" slant to fit your OS.

AVFS - A Virtual File System by mooobo · 2004-09-06 03:43 · Score: 2, Interesting

Have a look at this for a userspace filesystem http://www.inf.bme.hu/~mszeredi/avfs/

There are a debian package called fuse-source and fuse-utils. I.e. use all these nice kio plugins on filesystem level.

Just add

deb http://www.kalyxo.org/debian/ unstable main
deb-src http://www.kalyxo.org/debian/ unstable main

to your /etc/apt/sources.list

apt-get update
apt-get fuse-source fuse-utils

Actually, I didn't test that yet. Someone else?

Re:AVFS - A Virtual File System by Anonymous Coward · 2004-09-06 06:48 · Score: 0

fuse-source and fuse-utils are also in Debian unstable itself. No need for external repos.

Another one... by tenco · 2004-09-06 03:47 · Score: 1

DoXfs

What About Code Bloat? by reporter · 2004-09-06 03:49 · Score: 3, Interesting

These days, operating systems like both Linux and Windows XP have too many bells and whistles that I simply do not need. Unfortunately, these bells and whistles drastically increase the amount of space that I need on my hard drive.

Adding a database layer to Gnome sounds like using another 300 megabytes of storage on my hard drive. I simply do not need the database.

If the FSF/GNU folks really want to do something revolutionary, they should fork Linux+Gnome into 2 distinct paths: minimalist and maximalist. The maximalist is what we have now. The minimalist is a minimally featured Linus+Gnome distribution. It is the bare minimum in functionality that we need to have a decent operating system and desktop.

Into this minimalist installation, I will then add the applications (e.g. MatLab) that I use daily.

Re:What About Code Bloat? by Anonymous Coward · 2004-09-06 04:05 · Score: 0

There are so many distros that exactly match what you're describing. How small do you want to go? DSL is quite a bit smaller than even the Win98 install. Works fine as a hard drive install. I have one right here it has Blender, Gimp, OpenOffice, Firefox, NX Client, XMMS, Xterms and almost nothing else. Okay, a few Xmame games but I wouldn't call that bloated and yet it has some very high power apps and it is fast even on an old PC.
Re:What About Code Bloat? by Anonymous Coward · 2004-09-06 04:27 · Score: 0

GNU Hippies already own the hardware dumpsters in this town.
Re:What About Code Bloat? by brennz · 2004-09-06 04:35 · Score: 1

Isn't OSS always about a choice?

You can do a minimal install of whatever OS you want. If you don't like it, don't install the DB layer.

apt-get install dbfs
Re:What About Code Bloat? by M.+Baranczak · 2004-09-06 04:43 · Score: 2, Informative

You don't need to do anything to Gnome. There are plenty of existing Linux desktop environments that sound like they'd work for you. Take a look at xfce. If you want it even more stripped down, then try BlackBox, IceWM or WindowMaker. Most distros include all of the above.

higher level of abstractions... by 3seas · 2004-09-06 03:49 · Score: 3, Informative

Keeping the traditional file system is only logical as this database sort of file access is in fact a higher level of abstraction.

Considering there are numerious project of such higher level of file access abstraction going on, it does become a secondary choice for the user if they want to use one of these higher abstraction level file access systems.

To remove the traditional file system altogether would be a mistake, as then it could become a system of babel or keywords.... "what was I thinking when I created that keyword and lets not even get into what crazy joe was thining when he came up with his keywords...

But hey, given how MS based developers would create some obscure dll name and place it in some obscure location in order to copy protect .....

higher level abstractions are useful only to the point that you can, if need be, drop down in abstraction level to get your bearings as to where you are. If you cannot touch physical reality then how do you know you are not floating around aimlessly?

being out of touch with physical reality can evenm be very dangerious and hard to correct.

Not a new thing... by lei7 · 2004-09-06 03:51 · Score: 1

The idea of a database-driven file system is nothing new.. try googling for mysqlfs

Shit.... by Alwin+Henseler · 2004-09-06 03:59 · Score: 1, Offtopic

Link points to ...utwente.nl (University in Enschede, the Netherlands).

My internet connection uses their network, so it may become sluggish for some time, while the /.ing is in effect...

standard filesystems are NOT databases by lkcl · 2004-09-06 03:59 · Score: 3, Interesting

standard filesystems only have ONE index - a hierarchical one that contains a certain amount of real-time-updated indexing (such as the timestamps on a directory)

but it is NOT a relational database: you CANNOT easily create or use an alternative index to your files.

that's what all the fuss is about.

some people mentioned here that they already organise their files. great. fantastic.

HOW LONG DID IT TAKE YOU?

and how long would it take to reorganise?

with a relational database, all your indexes are updated AUTOMATICALLY.

therefore, doing searches on a relational database filesystem (find me all music files with dates between last week and last month: SELECT * from files WHERE files.type = "music" and files.date NOW() - 7days

you _can't_ do that sort of thing on a traditional filesystem.

sure, you can emulate it by creating symbolic links all over the place, but what happens when a file is deleted or moved? you need to manage / relocate the symbolic links...

nah.

databases.

fantastic idea.

now can we have them as a kernel module, pleeease?

Re:standard filesystems are NOT databases by mgkimsal2 · 2004-09-06 04:08 · Score: 2, Interesting

therefore, doing searches on a relational database filesystem (find me all music files with dates between last week and last month: SELECT * from files WHERE files.type = "music" and files.date NOW() - 7days

you _can't_ do that sort of thing on a traditional filesystem.

Argh! Don't say things like that - someone will throw down a shell script which WILL do it (probably in one line) combining find, file, grep, perl/python and some other crap. Which, while it may work for some, entirely misses the real point, is that there's no *easy* way to do this. We've got GUI bindings for opening/saving/editing files, but nothing for harnessing the power of searching. Don't say it can't be done (because it can somehow) just say that it can't be done easily, which is the bigger point. :)

--
creation science book
Re:standard filesystems are NOT databases by pandrijeczko · 2004-09-06 04:08 · Score: 1

therefore, doing searches on a relational database filesystem (find me all music files with dates between last week and last month: SELECT * from files WHERE files.type = "music" and files.date NOW() - 7days
you _can't_ do that sort of thing on a traditional filesystem.
This is honestly where I have a problem with the concept of database filesystems and don't understand all the fuss over them.
I can do exactly what you mention above (and more) using a user utility like "find" (in UNIX/Linux).
As far as I'm concerned, if a file-system always knows the names, dates, sizes and locations (on the hard disk) of all my files and does it's best to avoid corruption, what more can I ask of it?

--
Gentoo Linux - another day, another USE flag.
Re:standard filesystems are NOT databases by Anonymous Coward · 2004-09-06 05:25 · Score: 1, Insightful

The problem with find is that it doesn't scale because it has to do a sequential search. locate is faster than find, but the index is not always up to date. A database could keep various indices always up to date.
Re:standard filesystems are NOT databases by Tablizer · 2004-09-06 07:47 · Score: 1

We need to be careful about database definitions here. There are generally 3 different "kinds" of databases:

* Hierarchical
* Network
* Relational

The existing file systems are indeed a "database", just not a relational database, but a hierarchical one. If you add cross-links between the "children" in the tree, you start to have a "network" database, which is basically a big graph (web) of nodes with pointers all over the place. Combination hierarchical and network DB's are sometimes called "navigational", because of the way one has to "surf pointers" to navigate them.

In theory a navigational database can represent anything a relational database can, but most generally find relational a more disciplined approach. Relational is governed by rules that navigational lacks. I compare navigational to shanty towns: anything (any links) goes. But relational organization is governed by the requirements for unique primary keys and each "node" belonging to a table, among others. By following these rules the results, indexing techniques, and query techniques are more predictable and consistent from designer to designer.

Most complaints against relational are things that are vendor-specific faults, and not an inherent fault of relational. As far as taking up more machine resources, perhaps that is true, but higher abstractions often require more horse-power. We can't live in the 70's forever guys. Moore is your friend (I meant Intel's Moore, not necessarily Michael Moore.)

--
Table-ized A.I.
Re:standard filesystems are NOT databases by otis+wildflower · 2004-09-06 10:14 · Score: 1

Argh! Don't say things like that - someone will throw down a shell script which WILL do it (probably in one line) combining find, file, grep, perl/python and some other crap. Which, while it may work for some, entirely misses the real point, is that there's no *easy* way to do this.

Easy or not, it's the performance that counts. Try doing something like this with traditional 'find' traversing the file tree. FOREVER.

The thing is, having multiple indexes and stuff flabs out the storage requirement: hierarchical DBs are more space-efficient.

WHO CARES ABOUT SPACE EFFICIENCY ANYMORE?

At this point, unless you're designing smartphones, PDAs or other embedded-style applications, you can blissfully ignore any storage efficiency requirement for the sake of better speed, robustness, stability, functionality, etc.
Re:standard filesystems are NOT databases by kirkjobsluder · 2004-09-06 10:16 · Score: 1

therefore, doing searches on a relational database filesystem (find me all music files with dates between last week and last month: SELECT * from files WHERE files.type = "music" and files.date NOW() - 7days

you _can't_ do that sort of thing on a traditional filesystem.

Ahem:

find . -name *.oog -mtime -7

That's just for the bad SQL you posted (I imagine there must be a missing operator in the where clause between "files.date" and "NOW()".)

A better example of where indexing would be would be useful is a case where you want to index the content of files. For example "I want a list of all papers authored by me where I've referenced Doe, Doe, and Doe's seminal paper 'Mental Masturbation About the Need for Relational Database Filesystems on Slashdot'". But even then, this is a type of indexing that can be easily accomodated within a heirarchal database. (For two examples, glimpse and swish-e come immediately to mind.)

To answer your two questions:

some people mentioned here that they already organise their files. great. fantastic.

HOW LONG DID IT TAKE YOU?

A pretty trivial amount of time. Each new project starts with:
mkdir newproject cd newproject
A maximum of 1 minute, even with my gimpy hands. This is about the same amount of time, and perhaps much less, that I would have to spend adding the keyword "newproject" to every file that gets added to the project.

and how long would it take to reorganise?

Well, it depends on the reorganization. But for example, I reorganized one of my key project directories (version controlled in subversion) with.
svn mv proposal-may2004 proposal1 svn mv proposal-aug2004 proposal2 svn cp proposal2 proposal svn commit
(Making two archive "tags" and starting a new branch for the "HEAD" tag.)

A nice thing about heirarchal file systems is that you can reorganize on multiple levels including collections of files. ("proposal" in my case includes about a half-dozen files.) I would argue that it would take me just as much time to recode those files using database keywords as renaming the directories they reside in.

A large part of the argument for relational database filesystems seems to be that the same types of people who are unwilling to do the work necessary to create good heirarchal file trees, will be willing to do the work to attach the metadata needed to replace heirarchal file trees. Switching from a heirachal to relational model is not going to change the GIGO rule, and the solutions around the GIGO problem (such as full-text searching, and journaling) don't depend on either model.

A heirachal model also provides some nice facilities for dealing with related collections of files. It is unclear no me how they would be implemented in a relational model, or how a relational model would deal with stardard filenames like "README", "CHANGELOG" and "index.html".
Re:standard filesystems are NOT databases by kirkjobsluder · 2004-09-06 11:00 · Score: 1

Argh! Don't say things like that - someone will throw down a shell script which WILL do it (probably in one line) combining find, file, grep, perl/python and some other crap.

In this case, all you need is find.

Which, while it may work for some, entirely misses the real point, is that there's no *easy* way to do this.

What is hard about?:
find . -name "*.oog" -mtime -7
It actually seems to be easier given that the expression requires fewer keystrokes, and would return a correct result rather than an SQL error.

We've got GUI bindings for opening/saving/editing files, but nothing for harnessing the power of searching. Don't say it can't be done (because it can somehow) just say that it can't be done easily, which is the bigger point. :)

The need for GUI bindings for searching files is a very different thing from changing the underlying file system. By all means, yes, lets create easy tools that permit users to conduct full-text and metatag searches of file systems.

But, and here is the big issue, there is nothing inherent in heirachal file systems that make full-text searching or keyword searching difficult.
Re:standard filesystems are NOT databases by juhaz · 2004-09-06 12:05 · Score: 1

find . -name "*.oog" -mtime -7
It actually seems to be easier given that the expression requires fewer keystrokes, and would return a correct result rather than an SQL error.

Well, if you want to go pedantic about typos, no, it wouldn't return anything because the file extension is .ogg.

Do that in the root directory and it'll take ten minutes too, whether it returns anything or not, whereas this kind of db thing would be nigh instant (think locate, only with all the filters you can use with find).
Re:standard filesystems are NOT databases by kirkjobsluder · 2004-09-06 12:49 · Score: 1

Well, if you want to go pedantic about typos, no, it wouldn't return anything because the file extension is .ogg.

True, however the point had very little to do with typos. The point was that it was claimed that seaching for files by date was first not possible using hierarchal file systems, and then not easy. My one-line example shows that not only is it possible with existing hierarchal file systems, but it is perhaps even easier than composing an sql query.

Do that in the root directory and it'll take ten minutes too, whether it returns anything or not, whereas this kind of db thing would be nigh instant (think locate, only with all the filters you can use with find).

Perhaps a bit of a stupid question here, why would the search be directed from the root directory? That is, why wouldn't the person conducting the search limit the search to a subset of the database, rather than search through every record of the disk? An advantage of a hierarchal file system is that it is hierarchal. It is easy to limit an operation to nesting subsets of criteria.

Now of course, you bring up a valid point in that from a user's point of view, find runs slowly. However, the problems with find running slowly have nothing to do with hierarchal file systems, and everything to do with file indexes. There is nothing inherent in the hierachal file system that makes fulltext indexes, date indexes, or keyword indexes difficult.

Read the article, read some history by eschasi · 2004-09-06 04:00 · Score: 4, Interesting

To quote from the article (which most folks have not read, as usual):

The DBFS does not actually store files, it holds references to files on the underlying hierarchy based file system.

That line alone should answer many of the questions re backup, speed of FS performance, etc.

At a deeper technical level, nany of the questions asked here have historical answers or clues in The Design and Implementation of the Inversions File System. The abstract reads:

This paper describes the design, implementation, and performance of the Inversion file system. Inversion provides a rich set of services to file system users, and manages a large tertiary data store. Inversion is built on top of the POSTGRES database system, and takes advantage of low-level DBMS services to provide transaction protection, fine-grained time travel, and fast crash recovery for user files and file system metadata. Inversion gets between 30% and 80% of the throughput of ULTRIX NFS backed by a non-volatile RAM cache. In addition, Inversion allows users to provide code for execution directly in the file system manager, yielding performance as much as seven times better than that of ULTRIX NFS.

Note that this paper was published in early 1993. Many of the issues it addresses are relevant to DBFS, and many of DBFS's advantages are foreseen by that paper. IMHO DBFS has chosen a direction that should have better performance than inversion, not to mention lower risk and easier failure recovery.

Inversion was built on POSTGRES, which makes one wonder what happened to the source.

blah blah blah by Anonymous Coward · 2004-09-06 04:01 · Score: 2, Insightful

Sorry, but I have to say it, you are an idiot!
Did you even care to RTFA?
This has nothing to do with the gnome or kde devs. Some developer invested his time to come up with something he thought was useful and all you can do is complain?

And if you look at the project, it is something completley different then a plugin for the admittedly great ReiserFS4 and it is here and usable right now.

So friggin stop your stupid whinig.

And to the mods who modded parent interesting ...

Btw., why don't you whine to the Reiser people that they should stop developing now? It would be as justified as your whining aboutt this project.

Re:blah blah blah by Anonymous Coward · 2004-09-06 04:30 · Score: 0

I think he's "whining" about the guy who wrote the original story saying "hey, they should port it to gnome so we don't have to use KDE" crap straight from the "KDE is too much like windows" camp.

Hell.. if people really were interested in the future of desktops, they should look at ION..

~ion user (with KDE apps :-D)

Sweet - now do it with DOC, PDF and JPG files by mgkimsal2 · 2004-09-06 04:04 · Score: 1

"grep" - why didn't anyone else on the planet think of that one? That's fantastic. Now I just HOPE to goodness I've always managed to always save everything under ~/Documents, never anywhere else, and then make sure I'm only searching for ASCII data inside any files. I'm all set! Thanks. You've made it much easier for me to find all my pictures and audio files related to various topics.

--
creation science book

Re:Sweet - now do it with DOC, PDF and JPG files by chill · 2004-09-06 04:08 · Score: 1

Meta data. All three formats you mentioned support embedded meta data.

And YES, I would expect people to actually remember to enter proper meta data for their documents. If not, then they don't get the benefits of this type of system.

--
Learning HOW to think is more important than learning WHAT to think.
Re:Sweet - now do it with DOC, PDF and JPG files by mgkimsal2 · 2004-09-06 04:13 · Score: 1

I'm not sure grep would always find the metadata in those - maybe it would. As another person pointed out, SXW files wouldn't easily be searchable (haven't looked at metadata for those). Assume that though all those file formats *do* have metadata available, wouldn't having a standard way to search and filter all those various metadata formats in one file dialog be a good thing? If this (or another) project would do that, I'd be all for it. And for those file formats that don't support metadata, this system would allow for the creation of metadata for those files. :)

--
creation science book
Re:Sweet - now do it with DOC, PDF and JPG files by yuri+benjamin · 2004-09-06 18:09 · Score: 1

And YES, I would expect people to actually remember to enter proper meta data for their documents.

The only way to enforce this is in the "Save" dialog.
I'm thinking of knowledge workers, not home users. Unfortunately, in most document management systems I've seen, knowledge "workers" are often too lazy, and just place a dot "." in the mandatory fields in order to get past those "annoying" dialogues.

--
You make the mistake of thinking you can educate the fundamental stupidity out of people. You can't.

Don't hire him! by Doc+Ruby · 2004-09-06 04:06 · Score: 1

"I am currently looking for a job. Interested in employing me? Drop me an email."

Ozy has worked on this in his time available as a student. If he gets a job doing something different, he might drop DBFS, and it might die a lonely orphan. Email him only with DBFS job offers, please!

--

--
make install -not war

Re:Don't hire him! by Anonymous Coward · 2004-09-06 06:41 · Score: 0

Fucking Selfish Bastard !
The man's gota live. If you are so much interested in the project, fund it.
Re:Don't hire him! by Doc+Ruby · 2004-09-06 07:23 · Score: 1

Fuck you, Anonymous fucking Coward. Why do you think I linked my post to his email address? Reactionary moron.

--
--
make install -not war

whatever happened to the google desktop search? by GuyFawkes · 2004-09-06 04:07 · Score: 1

The smallish (windows) app that you downloaded and in made an index of everything on your PC?

(or was it from altavista?)

--
http://slashdot.org/~GuyFawkes/journal

Re:whatever happened to the google desktop search? by yuriismaster · 2004-09-06 04:33 · Score: 0

The "Google Desktop Search" you speak of is a small app that allowed you to google the web without opening your browser.

Of course it just opened a small IE window above the deskbar to view the results. Ate unnessecary memory.
Re:whatever happened to the google desktop search? by Anonymous Coward · 2004-09-06 05:51 · Score: 0

Copernic Desktop Tracker

Easily search your entire hard drive in less than a second to pinpoint the right file, e-mail, music or pictures.
CDS brings the power of a sophisticated, yet easy-to-use search engine right to your PC and allows you instantly to search files, e-mails, and email attachments stored anywhere on your PC hard drive. It executes sub-second searching of Microsoft Word, Excel, and PowerPoint files, Acrobat PDFs, and all popular music, picture and video formats. CDS also searches your browser history, favorites, and contacts.
(Back to Top)

But no Ogg files (check out the FAQ for file types)

Dear ./ crowd by Anonymous Coward · 2004-09-06 04:09 · Score: 0

After reading through the comments to this story I have to say I'm sincerely disgusted of you once again.

About 90% of the posters don't even care to read the article.
Now this fact of course doesn't stop you from bitching and whining, on the contrary, it's so much easier if you don't have a clue about what you are talking about.

So as in any story that reports about something new about half the posts consists of "I don't need this, it suxors!!11!!one!", while the other half is busy telling everyone that yet an other project that does the same is hardly needed. Of course the project does something no other project does, but hey, who am I to actually read the article?
I mean I can remember there being other stories about database and filesystem, so hey, this project must be redundant. And if it isn't, who cares, at least I'll be modded insightful by the always competent /. mod crew.

Disgusted as always,
AC

I can't see the gain. by The+Mighty+Git · 2004-09-06 04:09 · Score: 2, Interesting

A db fs with rich searching of metadata requires the orderly and accurate entry of said metadata.
You can't get organisation out of nothing - you are just asking people to be organised in a slightly different manner.

An organised person can already work effectively in a filesystem with current tools. The fact that they are organised is the key.

A disorganised person is not helped as their metadata will be erratic or absent. In fact might they now have the capability to be even more disorganised?

As I see it this is not solving an organisational problem as much as shifting the interface to the problem. I do not believe it to be either an easier or better way to organise personal data. Conversely I do not believe it to be inherently worse either. It's just different.

Where's the gain?

Re:I can't see the gain. by Anonymous Coward · 2004-09-06 04:17 · Score: 0

"A db fs with rich searching of metadata requires the orderly and accurate entry of said metadata."

Argh! Again! I'm getting a headache.
Did it ever occur to you, that applications can produce this metadata and actually do produce this kind of metadata even today?

Don't you guys have mp3s?
Re:I can't see the gain. by The+Mighty+Git · 2004-09-06 04:18 · Score: 1

That's not metadata - that's data embedded in the file. I already have tools to deal with those and they are more than adequate.

Where's the gain?
Re:I can't see the gain. by eschasi · 2004-09-06 07:26 · Score: 1

Speaking as a person who can find an arbitrary piece of email from the 1980s in minutes if not seconds, I'd be the last to disagree with the idea that an organized person can produce an organized filestore. That said, tho . . .
The gain is that a great deal of organization can be obtained from automated methods. Something as simple as a scan of the file contents with a find is a help -- especially for end users who wouldn't know how to run a command-line if it bit them.
But there's far more you can do. Certian classes of files have a great deal of meta-data that can easily be extraced and used in searches. This meta-data is sometimes at odds with the normal FS metadata, and can be more useful. Sticking with email as an example, you have fields like From:, Date: (which may be radically different from file creation/mod date), Subject:, etc. Tracking things like date of file to a folder can be done in far more useful ways than the metadata now seen in file systems.
Or take mailed attachments of documents, spreadsheets, etc. It looks to me like DBFS could be used to automate many indexing processes that are currently difficult if not impossible (who sent it, when, what dates did I modify it, etc).
It should also be possible to make a versioning file system out of it.
Does DBFS do that now? I dunno, I've not downloaded the code yet. But it ought to let us build browsers on steriods that can maintain far more useful state than the average naive user can manage. It should also let power users become even more powerful.

Apple's Spotlight by DuckWing · 2004-09-06 04:19 · Score: 4, Informative

For those of you that have not yet looked at the Mac OS X (Tiger) preview and WWDC web cast, the new spotlight technology built into the next version of Mac OS X looks very much like a fully integrated database file system. And it's incredibly fast. Go check it out!. Note: QuickTime required. Mplayer may work for us Linux heads but I haven't tried it.

--
-- DuckWing

Re:Apple's Spotlight by douthat · 2004-09-06 10:11 · Score: 1

For people who don't like to wait, fast forward to 42 minutes to see the spotlight section.

--
She loves me: 09F911029D74E35BD84156C5635688C0 She loves me not: 09F911029D74E35BD84156C5635688BF ...

Any innovation is good by Stevyn · 2004-09-06 04:19 · Score: 3, Insightful

People can offer their opinions for or against this, but I think that any innovation benefits linux. I've read about WinFS and it sounds like a good idea, but who knows when it will be ready. If people working in their spare time can get something like this working in linux before Microsoft can get it out, I think that would just be another reason to trust the open source model of developing code and squash Ballmer's FUD.

I don't have too much trouble using a hierarchy file system. I keep my stuff pretty organized, but computers are supposed to save time, not create more problems. If this database can do a good job, I'll give it a shot.

why not .... by butlerdi · 2004-09-06 04:21 · Score: 0, Flamebait

Just give every doc a url ? A bit more restful

--
"If the King's English was good enough for Jesus, it's good enough for me!" -- "Ma" Ferguson, Governor of Texas (circa

Huh? by LuSiDe · 2004-09-06 04:21 · Score: 1

From the FAQ

"KDE, GNOME, make up your mind.

Choice is a good thing. Myself I use Mac OS X daily and love it (much to the irritation of my friends and family). I am not against KDE, or GNOME. Actually the DBFS has two parts in it, a low level part, which can be shared by KDE and GNOME (or Mac OS X or Windows XP) and a GUI part. The DBFS cannot do without a graphical display, and I have to choose a platform. GNOME seems to go the route of instant apply and simplicity for the user, which is more inline to my own ideas. That is why I now want to focus on GNOME."

--
WE DON'T NEED NO BLOG CONTROL.

storage baggage liquidation by Doc+Ruby · 2004-09-06 04:24 · Score: 1

DBFS is a much more accurate model of our stored data, and how we use it, than the hierarchial databases we've used to bootstrap the world into using personal computers. But it's still really a prototype, proof of concept, because its data server calls the hierachical filesystem API of the Linux filesystem model, on which it runs. Underneath DBFS, and above the disk (or other storage media) driver, is an inode database, which is hierarchial. A giant improvement in efficiencies, whether speed, space, or complexity, would come from implementing a SQL-to-inode engine directly, without the intervening layer of even something good like a ReiserFS or something well supported like ext2/3.

Linux's VFS system for modular filesystem installation is a good package. Packaging a DBFS in a VFS API, which operates directly on inodes, while replying to calls from a storage server like DBFS, SAMBA, NFS, or all of them, gives a powerful storage layer that interoperates with current apps, while opening the future for new apps that can use the relations among the data, as well as the meta-/data. A SQL API, as well as familiar data object references, would make such a system complete. Then work could begin on factoring file browsers, file access dialogs, email GUIs, and every other app/feature that will be revolutionized by dragging data out of the 1960s hierachial file cabinet, into the 21st Century of networked relational multimedia objects. Let's get cracking!

--

--
make install -not war

Here's How I Have Done It by Anonymous Coward · 2004-09-06 04:29 · Score: 0

When I got my first PC in 1990, I was faced with a similar problem. I first put all my data (I'm a writer and I used Word Perfect 5.1) in hierarchical directories descriptive of the project., since WP supported long filenames and the ability to search the long names.

I did, however, use the DOS .3 extensions to "keycode" my files: finished edits ended with my initials; works-in-progress were *.raw and partially-done files ended in *.edt

Later I added *.fnl for dead files and *.prt for formatted files that have been or are ready for printing.

Soon I found the limitations to WP's scheme and found XTree-Gold, a DOS tree-based filemanager similar to Norton Commander, but more powerful in that it has a global searchable list function by filename, too, and can be made to sort by date.

So what can any of these new DB filesystems do that XTree-Gold for DOS or its clone ytree (http://www.han.de/~werner/ytree.html) for Linux/Unix can't, given the abillity to use long, descriptive filenames on ordinary fss?

Lessons to be learned from email systems... by MadMorf · 2004-09-06 04:47 · Score: 1

A large part of proprietary email systems (GroupWise and Exchange in particular) is the quick search and retrieval of messages (files).

Email systems do this by storing pointers to messages (files), along with pertinent data (subject, received/sent date, etc.) in the DB. GroupWise also stores small (2kb) messages inside the DB, but IMHO, storing the files themselves in the DB is probably a bad idea.
The messages (files) are still accessable in the event of DB corruption and the DB can be rebuilt by scanning the files.

I don't see any reason the whole FS couldn't work this way...

--
Goofy, Geeky Gifts and More!

from the ... department by Anonymous Coward · 2004-09-06 04:49 · Score: 0

Am I the only one to notice that this article didn't have a from the x-or-y department line?

too stubborn by ciroknight · 2004-09-06 04:53 · Score: 1

Here's the facts:
Database filesystems REQUIRE traditional filesystems to write on top of (unless the SQL server is implemented IN the kernel, which everyone (and I) agree is too much bloat). So, for DESKTOP machines, and STORAGE SERVERS, this technology would rock. It improves the ability for a user to find his/her files effeciently.

Meanwhile, for mission critical systems, for the underlying systems, for EVERYTHING ELSE, they will continue using ordinary hierarchy file systems like Reiser.

--
"Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush

Graphical display a must? by Anonymous Coward · 2004-09-06 04:56 · Score: 0

You need a graphical display? Wow, great, send to trash automatic systems. Or blind people.

please! not KDE vs. GNOME again by whovian · 2004-09-06 05:09 · Score: 2, Interesting

This basically seems like giving find the capability to do file and then storing the results with locate. And add some time stamping, sorting capabilities and GUI.

Since all but the GUI are basic commands, it would seem sensible to have an underlying library with hooks for use by your choice of desktop manager.

--
To-do List: Receive telemarketing call during a tornado warning. Check.

Can we? Can we PLEASE? Huh? Can we? by solios · 2004-09-06 05:12 · Score: 1, Interesting

Using a linux desktop is like using Win3.1 or 95 hyped up on really lame raver drugs. I've always found it sad and extremely frustrating that a venue with MASSIVE potentiall for innovation has instead been spending its time reimplimenting Windows Explorer. :|

I'm down with anything that makes the linux desktop experience a real linux desktop experience- not a pissass wannabe win95 experience or a solaris experience. There's so much cool stuff going on under the hood... but the thin candy shell feels like GPX or Coby* slapped onto BMW internals. I know it's possible to do better.... but after years of seeing Windows and MacOS features reimplimented (not nearly as well in most cases) in linux, I'm starting to lose faith. :-(

*GPX and Coby make shit electronics that are cheap and break very, very easily. Coby specifically, in that their products (and LOGO!) are modeled after Sony... and the best they can do is to come across as a cheap ripoff.

Hurd has similar facilities by SWroclawski · 2004-09-06 05:14 · Score: 1

One of the greatest features of GNU/Hurd is the idea of translators. By getting rid of the traditional file system model, you can make many interfaces to systems that "look" like filesystems, and such keyword based indexing/browsing/searching could be impelemented at the user level *as a filesystem*, rather than as a set of calls on top of a traditional filesystem requiring the application to "know" how to use it.

The Hurd documents talk about everything from an "ftp filesystem" to ways to rewrite X. I can imagine an IM program where you could echo and cat to friends like:

echo "Hi there" >> ~/IM/jabber/friend@jabberserver.org

When you think of the filesystem as a set of interfaces rather than a set of files, the possibilities are endless, including the one presented in the project.

It's sad that Hurd hasn't done better in making a working system, and it's also sad that Linux hasn't taken this "Best of Breed" idea.

Re:Hurd has similar facilities by SWroclawski · 2004-09-06 05:16 · Score: 1

Your topic is a bit misleading. Hurd has the possibility to make similar features, but it doesn't as of now..

Not Unix by flossie · 2004-09-06 05:20 · Score: 5, Interesting

the DBFS does not store system files: No shared libraries, no font files or others like that. These are not documents, not files you look up at a day to day basis, and have no place in a file system.

Whether or not you look at system files every day probably depends on what you are doing with your machine and what you consider "system files" to be. Moreover, this idea would seem to go entirely against the whole UNIX "everything is a file" philosophy which is supposed to be one of the great strengths of UNIX.

--
flossie

Write now. Defend liberty

Re:Not Unix by Anonymous Coward · 2004-09-06 05:43 · Score: 0

He's not really talking about a file system, he's talking about his (not actually a) file system. What he means is that these files have no place in a database layer that sits atop the filesystem. Of course fonts and shared libraries have their place in a file system; where else would they go?

Document repository by Anonymous Coward · 2004-09-06 05:22 · Score: 0

It's more a document repository than a filesystem. But sure, it's a good idea. :)

reluctance to change by erroneus · 2004-09-06 05:23 · Score: 1

Reluctance to change is a pretty common public reaction to just about anything especially when it's about something that people don't understand.

Think about something as simple as USB technology. It was a great idea from the beginning, but we were all so used to "parallel for printers, serial for modems" mentality that we couldn't see into how useful and universal it could become. But how about today? Just about anything new can be given a USB interface.

Now I'm reading about all kinds of reservations about changing the way files are accessed.... or even stored. A database filesystem? Now we're playing with our comfortable concept of "how things are done" again. When will we every stop shaking things up like this!? Damnit!!!

Actually, consider this. People are damned afraid of it coming from Microsoft. With MS, it's pretty much an "all one way" kind of thing. Once Microsoft's "viral infection" starts to proliferate and this early adopter causes the next company to upgrade and the next and the next, there's no stopping it.

Now let's look at open source. If an idea doesn't work, you can abandon it with little to no investment lost. Get another computer to run the newest experiments in systems concepts and don't abandon your old way until you're ready.

David Blunkett eat your heart out ... by flossie · 2004-09-06 05:27 · Score: 1

Today computers are dualistic; they have a screen area, this is where you input information, but it is violate. And they have a disk area which is permanent storage, but you need to explicitly save to that storage. There used to be a good reason for this behaviour, but today there is nothing wrong with storing every keystroke you input.

Shhh! Don't let David Blunkett hear you! If he finds out that computers can do this, he will make it illegal not to use keylogging.

--
flossie

Write now. Defend liberty

SVN + DBFS + 10TB Nano Hoozie by KrackHouse · 2004-09-06 05:33 · Score: 4, Interesting

I'm using Subversion for a project and the idea of Atomic Commits seems like an obvious direction for file systems. If that other slashdot story is correct, storage becomes less of an issue and it would be possible to roll back the system to any point in time or to only roll back one file if need be. Now throw an intuitive way to navigate files on top of that and you've got a sure winner.

In the grand scheme of things, only a very small handful of us on earth are aware of Linux or even know what an Operating System is for that matter. File systems seem to be the big stumbling block for new users. Anything that can make computers and therefore access to information easier for the coming waves of new computer users (maybe billions of people?) will be a good thing. Even if the "bloat" slows down the system by 10%.

I hate to preach but that old quote comes to mind "With great power comes great responsibility". I don't think most of the people working on the OS that will soon dominate in developing nations (that's Linux) are aware of the harm they can do by slowing down Linux development with petty personal disputes. Like it or not, Linux is no longer just an edgy hacker tool. It has the potential to change the lives of Billions of people.

--
What if Digg added local news and a Slashdot inspired comment karma system? ---
http://houndwire.com

Re:SVN + DBFS + 10TB Nano Hoozie by Anonymous Coward · 2004-09-06 08:40 · Score: 0

I don't think most of the people working on the OS that will soon dominate in developing nations (that's Linux) are aware of the harm they can do by slowing down Linux development with petty personal disputes.

I don't think most of the people not working on the OS realize how often something that looks from the outside like a "petty personal dispute" has at its core really crucial technical problems.
For example, from a brief glance at the Linux Kernel Mailing List, one might assume that ReiserFS 4 is currently being kept out of the kernel because of petty personal issues. A closer examination will show a few posts that point out critical technical flaws that the ReiserFS people have not yet fixed.
It's easy to look in from the outside and say things like "stop squabbling and get on with your work!", ignoring the fact that the squabbling is sometimes an unfortunately inevitable by-product of people trying to tackle thorny problems.

Backups by Thu25245 · 2004-09-06 05:40 · Score: 3, Informative

"How would a user be able to group (manually) related files together, so that the whole bunch can be backed up later, without having to search for all seeminly related (or unrelated) keywords to trace all hitherto-unrelated documents? "

Search for: Documents
Modified since: last backup

I don't know about the other implementations but Searchlight and WinFS are implemented atop the existing filesystem. (The FS in WinFS supposedly stands for "Future Storage" and not "File System") Sort of like how Google is implemented "on top" of the regular hypertext-linked internet.

Hurd??? by FreeLinux · 2004-09-06 05:46 · Score: 0, Troll

Is this the same Hurd that will ship one year after Duke Nukem Forever?

Frankly, one of the greatest features of Hurd is that people still talk about it, in the present tense no less, despite appearances that it will never exist.

From GNU Hurd: It is not ready for production use, as there are still many bugs and missing features.

On the negative side, the support for character devices (like sound cards) and other hardware is mostly missing.

Re:Hurd??? by arose · 2004-09-06 08:56 · Score: 1

Hurd just lacks the momentum that is needed for quick progress in the free software world. Sad.

--
Analogies don't equal equalities, they are merely somewhat analogous.

Metadata, not DB by Thu25245 · 2004-09-06 05:51 · Score: 1

As I understand it, Searchlight is implemented as indexed metadata in additon to the standard Mac OS HFS+ file system. It's not a true "database," but rather a quick and fairly functional facsimile. (Similarly, WinFS is not a FS at all, anymore.)

Re:Metadata, not DB by TheInternet · 2004-09-06 15:22 · Score: 1

As I understand it, Searchlight [apple.com] is implemented as indexed metadata in additon to the standard Mac OS HFS+ file system. It's not a true "database," but rather a quick and fairly functional facsimile

Based on the description on the site, the "Database file system" appears to be the same thing:

"The current implementation is a server/client that provides the DBFS as a layer above a hierarchy based file system. The DBFS does not actually store files, it holds references to files on the underlying hierarchy based file system. The GUI part is implemented in KDE where it replaces all hierarchy based file accesses."

- Scott

--
Scott Stevenson
Tree House Ideas

This is not a new idea..... by Cnik70 · 2004-09-06 05:51 · Score: 3, Interesting

Take a look at an AS/400 (iSeries). They've been around for more than 10 years. And before that you had the System 38 and System 36... and so on..... Why is this some big revelation now?

--
-Cnik

Re:This is not a new idea..... by Punboy · 2004-09-06 12:59 · Score: 1

because this is bringing it to desktop/personal computing. obviously linux is not OS/400

--
If you like what I've said here, and want to read more, go to http://www.krillrblog.com

mysql filebased access by suso · 2004-09-06 05:53 · Score: 1

A few years ago I remember seeing a filesystem based access gateway to MySQL. That was pretty neat because you could access rows of information using standard Unix tools like grep, sed, awk, numsum and so on.

System 6? by lastmachine · 2004-09-06 06:10 · Score: 1

Surely that is the latest version of MacOS which could have debuted the "Desktop Database".

I know it was there in System 7, which still puts us in the early nineties, I think. Not sure of the year--not sure of the System version--not sure whether it was a database or a Metadata whoozywhatsis. But I am very sure that I never had my operating system complain that "somebody must have (gasp!) *moved* a file!".

What strikes me here is not the ease with which you could find a file (it was not a sure thing), but the rarity in which you had mis-placed one. Even the most elderly Macs you might come across kept pretty tight track of what was going on in the FS.

A resourceful* Mac user could assign one of eight or ten categories to a file, which would then show up COLOR-CODED in your SORTABLE display. Note that these were not, however, fields--just one-shot categories. If you wished to use them as fields, great, but you had a total of eight (or ten) values.

I was just surprised not to see it mentioned here yet. Nevermind OSX. If I could have the MacOS 9 interface (same as System 7, basically) on top of something POSIX, well, then we'd be talking. And yes, I would use my desktop database to find things impossible in other pricey consumer-oriented OSs.

* sorry bout that

What an improvement by Anonymous Coward · 2004-09-06 06:19 · Score: 1, Insightful

Did you work on that file last month? Find all files you worked on last month. Was it a word document? Find all word documents you worked on last month. Was it for a certain project. Find all word documents from that project you worked on last month. That is the thinking the DBFS supports.

What a coincidence. That is the thinking that "find" supports, too. And I don't have to run a bloated desktop environment to use it. Cool.

Re:What an improvement by Risto · 2004-09-06 06:52 · Score: 1

how long does it take to run find / 2 minutes? 10 minutes? ok use locate instead takes 1 second maybe less but the indeces may not be up do date this essentially updates the locate-style db indeces whenever a file is saved brilliant going into a folder and displaying its contents becomes the same thing as show me all files from last week that are images about nature brilliant I do agree though that it should have a command line equivalent so you need to have a real dbfs AND a command line utility akin to find that akes advantage of it, ie make find dbfs aware and have it run faster and capable of searching metadata, but you also need this kind of improvement to KDE so that you can display files by keywords, not by the ANTIQATED notion of FOLDERS
Re:What an improvement by Risto · 2004-09-06 06:56 · Score: 1

how long does it take to run find / 2 minutes? 10 minutes? ok use locate instead takes 1 second maybe less but the indeces may not be up do date this essentially updates the locate-style db indeces whenever a file is saved brilliant going into a folder and displaying its contents becomes the same thing as show me all files from last week that are images about nature brilliant I do agree though that it should have a command line equivalent so you need to have a real dbfs AND a command line utility akin to find that akes advantage of it, ie make find dbfs aware and have it run faster and capable of searching metadata, but you also need this kind of improvement to KDE so that you can display files by keywords, not by the ANTIQUATED notion of FOLDERS

Exec 8 by jefu · 2004-09-06 06:28 · Score: 2, Informative

Univac's Exec 8 (I think), its been quite a while since I used it, had versioning integrated into the files themselves - at least for some files. That is, they marked changes to the files as part of the file content up to 5 (I think) levels deep.

This did mean that you had to use the right tools to get into the files or you had to cope with the changes in programs that worked on them.

VMS also had file revision numbers on files as a couple of posters have noted.

Both of these were nice in some ways, but relatively difficult to deal with in other ways. By comparison unix is straight-jacketed but easy to use.

AMEN, and slimware already exists and is debugged by Ars-Fartsica · 2004-09-06 06:28 · Score: 1

I agree that I want software that starts to exploit my system instead of wanting forsaking features to minimize resource use....particularly when the resources are fairly abundant.

Added to which, if someone wants "slimware", its already out there - it was written in 1995. If you are stuck with a 486, boo-hoo, fortunately there was a time when this was the cutting edge, and during that time people wrote and optimized code like lynx and fvwm and xview. So the code is there if you still need it, stop complaining!

I am tired of the posts on here telling us about the crippled/lameduck hardware some welfare case is running, and how we all need to accomodate him. For Christ's sake folks, you can get a P4 desktop for $500.

"top" nerds go away, we want features by Anonymous Coward · 2004-09-06 06:34 · Score: 0

Let me guess, the first app you fire up is top, and you get hives if the system dips below 95% idle?

Database File System by smi4409 · 2004-09-06 06:36 · Score: 1

If considering file-system as database, then consider application as database. The Run-Time-Access package makes your program's internal data structures appear as tables in a PostgreSQL DB. RTA is to a DB as /proc is to a file-system.
http://www.runtimeaccess.com/

Great.... by DrJonesAC2 · 2004-09-06 07:02 · Score: 1

...that's all us tech folk need is for the users to be EVEN MORE RETARDED! More abstraction is not necessarily a good thing.

BFS by jeti · 2004-09-06 07:13 · Score: 1

The database-like filesystem of BeOS was replaced in the "Advanced Access" release (before PR 9). It was said to have inadequate performance for multimedia work. The old filesystem was replaced with BFS, which has been written by Dominic Giampaolo. It is pretty powerful with its indexable custom attributes. But it is not really a database-like system - as the old file system of BeOS was.

300 MB? by poptones · 2004-09-06 07:22 · Score: 1

Think more like 30GB. Take any person with a digital camera, give them the ability to have at hand any photo ever taken cross referenced by any number of data, and it won't take long at all to amass several GB of thumbnails and tags.

By the time you factor in the near annual cost of replacing DVD-R drives that have failed due to laser burnout it costs about as much today to store data on hard drives as on DVDs. I have 320GB in storage on my desktop that cost all of $200 - why give a fuck if my "search engine" eats up 30GB? These types of accessories are an investment in memory augmentation - buying a bigger hard drive is an incrediby cheap investment for the benefits obtained.

If you want minimal, install gentoo or something and learn to love blackbox. For a long time I used blackbox with mdk8 on 200mhz pentium MMX systems and with only 128MB ram it was pretty damn acceptable and it looked very nice. Minimal is easy - minimal is what linux has been all along.

When I was stuck in windows I used Thumbsplus for this (which, despite its name, will work with just about any MIME type, not just pics). I had several databases, all of them 2GB (for easy backup to DVD). The one thing I have found no acceptable substitute for in linux, ironically, is thumbsplus.

I've been wanting something like this for years. Many times I've stumbled upon a website during a search that seemed mildly interesting for whatever reason at the time but not relevant, only to think a few days later "what was that site where I found...?"

I know that sounds a bit like dashboard, but what I've seen of dashboard falls short of this - it's almost like the two products are the same concepts but built from different directions. When they meet in the middle, many of us will finally have something an order of magnitude closer to our dream desktop.

300MB? Hell, my /tmp is four times that!

Re:300 MB? by ResidntGeek · 2004-09-06 10:19 · Score: 1

He meant the GNOME install is going to be 300 MB larger even though he doesn't want the extra features. He's not going to be using it to store several GB of thumbnails and tags.

--
ResidntGeek
Re:300 MB? by NuclearDog · 2004-09-07 10:57 · Score: 1

"I have 320GB in storage on my desktop that cost all of $200"

That's excellent that you have $200 to spend on storage. Personally, my family can barely get by (total income is about $10000/year), so just deciding to spend $200 on a bigger hard-drive is not something I can do. If it wasn't for being given a two 20GB hard-drives for free, I'd likely still be using a 10GB in my desktop and a 4GB in my server. In fact, I was looking the other day for anything larger, the smallest I could find was 80GB selling for ~$150 (CA), still out of my price range.

My point is that although you may have money, not everyone is as well off.

ND

--
This statement is forty-five characters long.

Set Theory to the rescue by Tablizer · 2004-09-06 07:28 · Score: 2, Interesting

What about backups though? How would a user be able to group (manually) related files together, so that the whole bunch can be backed up later, without having to search for all seeminly related (or unrelated) keywords to trace all hitherto-unrelated documents?

Defining "group" is one of the problems with hierarchical systems. What is the group you want is dependent on your needs at the moment. Relational makes it easier to create ad-hoc groups.

For example, do you want to backup by file age, file type, and/or by topic? And in the real world topics naturally overlap. Set theory has an easier time with this because it is meant to deal with overlaps; but hierarchies get messy beyond about 3 orthogonal factors. Relational is closer to set theory than trees are.

A drawback of sets is that most people are not familiar with non-tree arrangements. There will probably be a "training hump".

More about sets versus trees.

--
Table-ized A.I.

Little research won't hurt by zdzichu · 2004-09-06 07:30 · Score: 2, Informative

Before inventing something you should check if no one did this earlier. Because there you have GNOME Storage. Don't be fooled by screenshots there. Storage isn't only cool search facility with native language parser ("computer, find me all porn I've downloaded yesterday" anyone?).
Storage is, suprisingly, method to store files decomposed to contents. The great searching ability is a side effect.

Imagine collaborating in of group of people over one document. Every one got some paragraphs to edit. With Storage, everyone can edit this document in the same time, seeing other's changes as letters are typed. Store version history and you have revision control. Throw in network transparency (you go to other department, connect laptop and automagically you can work on those department files) with OpenTalk (Zeroconf/Rendezvous) and you got best idea since hierarhical directories.

Be sure to read whitepaper about Storage available on mentioned site. Also check for Storage related entries in Seth's blog (Seth is one architect of GNOME Storage). Now if only KDE people work on compatibility with Storage, freenix desktop would rule the world.

BTW, KDE, don't miss chance of integration! KDE is planning to introduce google-like search in desktop. Don't reinvent wheel! Beagle is here, working. Just integrate Beagle with KDE desktop and we are set.

--
:wq

Backward compatibility via syntactic fluff in cd by Anonymous Coward · 2004-09-06 07:37 · Score: 0

Change directory could be made to handle DB queries by having some bracketing character sequence in the directory names be made reserved and using it to enclose relational queries. On access the db could execute the queries and would need to remove the pseudo path elements from what gets sent to the underlying file system. Then any scripting system that needed to pass a path could take advantage of the DB access without the app really being aware of it and without massive reprogramming.

Jeez, when will these people learn? by Grendel+Drago · 2004-09-06 07:47 · Score: 3, Insightful

Joel on Software said it best:

For example, WinFS, advertised as a way to make searching work by making the file system be a relational database, ignores the fact that the real way to make searching work is by making searching work. Don't make me type metadata for all my files that I can search using a query language. Just do me a favor and search the damned hard drive, quickly, for the string I typed, using full-text indexes and other technologies that were boring in 1973.

--
Laws do not persuade just because they threaten. --Seneca

Why you don't want a DB-based FS by 0x0d0a · 2004-09-06 07:49 · Score: 2, Interesting

I hate GNOME-VFS and KIOSlaves and all that stuff that people try to roll into a higher layer than they should. The idea of combining VFS functionality and a *desktop environment* is *stupid* and exactly the sort of fragmentation that hurts everyone. Want a userspace VFS layer (like, other than the existing transparent systems like LUFS and FUSE, which are much better if you can use them, because existing apps continue to work) Fine. Make one. Call it libvfs, and make KDE and GNOME bindings. But for the love of God, quit trying to roll filesystem functionality into desktop environments. It's ridiculous, and not where it belongs.

That being said, I'm still not a huge fan of this.

There are three main features that people seem to want with a DB-based FS:

* Transaction/views/high-level-DB functionality.

You don't want a FS that tries to do all this. DBMSes have worked for a long time to do this efficiently. If you want this, use a real DBMS, because it's going to smoke what someone tried to hack up into a filesystem.

* Automated index updates. Basically, locate but atomatically updated as changes occur.

Mac OS Classic had synchronous index updates as the filesystem ran. This makes the filesystem slow. It sucks for servers.

It's a much better idea to do asynchronous index updates, where the index approximates the filesystem quickly, but perhaps not right away -- you can do that without killing performance. Basically, you have the kernel notify a userspace daemon when changes occur, which rebuilds the part of the directory hierarchy (possibly waiting a while to see if it can use idle time). If a ton of changes occur to a small portion of the directory hierarchy, instead of trying to keep up with each change (say, a million changes to part of the directory hierarchy), the update daemon rolls all those queued up updates into a singled queued up full update of that entire portion of the directory hierarchy. It does a bit of extra indexing, but doesn't get backlogged.

This *could* be done under Linux, but there is one bit of kernel support that is missing -- currently, it is unusably inefficient for an app using Linux's existing dnotify() directory monitoring mechanism to deal with (a) changes to a directory containing many entries and (b) changes to directories anywhere on a filesystem (currently, dnotify() requires a FD for each directory to be monitored).

There *is* a enhanced dnotify patch out that would improve Linux's dnotify() mechanism to the point where people can write daemons with this, but Linus has not yet rolled it in. Once he does so, we can get the ball rolling on such daemons.

* Indexing of metadata.

People want to be able to search for their files using metadata. Again, this can just as easily be done using such a daemon. Existing apps continue to work, people can choose where to put something in a unique hirarchy so that they can reach it again, but files can also be addressed via metadata. If you want to provide, say, a tabbed file selector containing a tab for selecting files via metadata, that's quite feasible.

A major reasons why you don't want to provide a full DB-based interface is that you lose the existing hierarchical representation, and every app in the world stops working. You don't want people to just "save this file to a filesystem" and have to address it via metadata -- you want the user to give the thing some kind of unique identifier.

--
May we never see th

And as for DBFS by 0x0d0a · 2004-09-06 08:01 · Score: 1

Oh, and as for reasons that I don't like this particular implementation of a DB-based filesystem -- they add the idea of attributes, which is cutesy, kinda of like Nautilus did with emblems (grr...darn desktop-environment-level FS stuff), but then they overload the keyword namespace with attributes, which is just plain silly, and runs the risk of any app using attributes getting into name collisions.

--
May we never see th

The real benefit? by dr_warm · 2004-09-06 08:18 · Score: 1

I can't really see the need for it in a single machine environment - nice touch, but unnecessary.

Where I see the real benefit for this type of file system is when it's networked. Then the whole idea of thin network clients starts to come into it's own. Not only does the end user naturally then have to save all his/her work onto the SAN (easily doable today with other methods), but they transparently do it in a way that means other people can easily find what they've done. They'd also naturally have access to shared resources without having to consciously search through shared drives etc. Just an idea, but it could be an absolute boon for collaborative working, if done correctly...

Bring the network storage right into the opsys!

Does it use QT? by Anonymous Coward · 2004-09-06 08:49 · Score: 0

If it uses the QT library, forget it being used by GNOME people.

Think evolution, not revolution by rcpitt · 2004-09-06 09:03 · Score: 1

Lots of talk about ReiserFS previously but I didn't see anything about the use of its XML plug-in and the possibilities that this might be a way to allow an evolutionary approach to the whole problem of alternative means of finding stuff you know is there but don't know where.

The fact is that XML in some form or other seems to be the direction that many "office" type programs are headed, and it is amenable to being used as the underlying means of indexing other document types too - there being XML extensions and filters for many databases as well as native databases for it.

In fact, Reiser with the XML plug in is supposed to be one of the fastest XML databases out there.

So... if the implementation is split into file store and meta-data store, and the metadata is stored in XML - then it should be possible to cater to both those who can (and want to) use Reiser and those who can't/don't.

Of course I may just be blowing smoke out my ears after the Labor day blowout last night ;)

The other point I'd make is that there should be some way to plug-in tools for after-the-fact analysis and indexing of documents to add to the system - I have e-mail and correspondence going back to 1983 (from Xenix and TRS-Dos days) that I'd love to include in the system but only if the system will find and catalog it for me. I'd certainly be willing to teach it how to deal with Scripsit and Multi-plan files (even Visicalc).

Don't worry if the analysis takes days/weeks/months - shove it off into background and let it vie for time with your screen saver or SETI. I don't care because I don't expect to get around to it manually for another 20+ years ;)

Taking a look at ctime and mtime (on Unix systems) for example to tell when a document was initially created and last modified would go a long way towards filling in the gaps on the timeline. Then categorize by the keywords in the directory tree structure the document is found in (there are several trees from the past that contain the word "customer" for instance - and all items below those trees should be able to be aggregated) and by gross file-type (visicalc, multiplan, excel, etc. are all spreadsheets for example) and things would start looking up.

Use the magic file, not file extensions - look inside the file (using "strings" for instance) for keywords if the file type's major structure isn't (yet) known - and abstract comments and other info (date/time/f-stop, copyright notices, etc.) from the likes of image files too. Even abstract the image files down to a predominent color (show me all the green images - or all the flesh-colored ones). Abstract the image size and track it too - show me all the images except the 160x120 and 640x480 (they are created from the originals)

Think about the fact that the average desktop system is idle a majority of the time and use that CPU horsepower to make my life easier.

Also - don't worry about using more storage for the meta-data than the original document used - storage is now so cheap compared to my time that you can use 10-100 times as much if it will save me an hour looking for something.

I'm watching any/all of these types of projects looking for one that will come even close to what I think I need.

--
Been there, done that, paid for the T-shirt
and didn't get it

Good description - I back Spotlight because... by SuperKendall · 2004-09-06 09:07 · Score: 1

Yes I own a Mac, I admit that bias. But even before I had heard about Spotlight I had doubts about the WinFS approach to file storage.

Why? Because a user should not have to help manage metadata if at all possible. If they do things will just become a mess. I think in the end the only reliable data at all is what is actually in the files themselves - so any approach that parses and extracts file metadata from contents is going to end up with way better searches for a user than one where you have to say "this is a picture of Uncle Fred".

The advantage a Newton or Palm has is they provide all the key apps for the OS. I think that even in Windows Microsoft would have a hard time getting enough programs to get behind the storage of key metadata to make it as useful as they would look - who knows, perhaps acquiring such backing is part of the WinFS delay.

The Spotlight approach seems cool in that is offers the possibility of custom metadata-extraction plugins, that perhaps go the extra mile to provide meta-meta data (like figuring out which pictures were mostly water or sky).

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

Re:Good description - I back Spotlight because... by EddWo · 2004-09-06 12:46 · Score: 1

WinFS also has metadata extraction, they call it Promotion/Demotion, MS plans to support the major file formats out of the box, Office, PDF, JPEG EXIF, ID3, etc, but it will be fairly easy to implement your own metadata handler for your custom file formats.

File property handler API

As far as meta-meta goes data there are classes in WinFX for facial recognition not just water and sky.

--
"Taligent is still pure vapor. Maybe they'll be the last who jumps up on Openstep... "

Taking filesystems to the limit by PureFiction · 2004-09-06 09:19 · Score: 1

Let's take a step back and consider some fundamentals:

- Filesystems are a way to represent resources in an organized namespace.
- Disk capacity is increasing faster than our ability to utilize it effectively.
- Networks are everywhere, from Internet to WiFi to Lan Party.
- How you use data is more relevant than where you use it.
- Social networks provide context and reputation to resources and conversations.

Lets mix them all together and see what comes out:

- A filesystem that is not "fixed" to a disk, but is a "view" associated with you (securely) that travels wherever you go, using networks and aggressive caching on plentiful disk space to make access fast.

- A filesystem that is a shared collaborative space organized by the way you use resources within it, and the way your peers in your social network use their resources as well. New resources can be quickly brought to your attention as something "interesting". Common resources are more aggressively cached, making them fast to access and distribute, more reliable.

- A filesystem that has rich semantics and robust metadata data associated with the resources contained within. This metadata is built via implicit feedback based on the way you and your peers use its contents as well as explicit feedback / assignment you manually perform.

Microsoft and everyone else is focused on the flexible view of resources but only on local data contained on disks you have in front of you.

Where OSS can beat everyone to the punch is the decentralized, networked view which makes file resources as mobile as the peers who use them. This scares content owners, and breaks DRM models. This is an area where OSS has an advantage, and can be on the leading edge of innovation instead of the trailing copycat angle (WinFS/Spotlight, etc)

Lower level? by SmileR.se · 2004-09-06 09:22 · Score: 1

Why doesnt anyone care to implement this on a lower level so all DEs/filemanagers/whatever can use these kind of features instead of binding it to one or two single enviroments?

A bigger weenie by poptones · 2004-09-06 09:55 · Score: 1

if you don't want it shared, don't share it. Seems pretty simple to me - stay away from Redmond and you got nothing to worry about. But what if you WANT to share that info?

45,000 files is nothing - I know people who easily have ten times that. And if it's just about finding redheads with big ones - well.. ok then let's go there: What if you want to put together a slideshow of redheads with big ones? What if you're an old schooler and want it to include those other redheads with not so big ones as well? And what if you want your slideshow to include only "big ones" plus your favorite shots of Julia Hayes and Angie Everhardt? Now you have to navigate to god knows how many directories, fish through thousands of jpegs and add them to some application's "sideshow" list. All so you can enjoy a thirty minute wank. Sure, you can save the slideshow - but repeating this dozens or hundreds of times sounds to me like a huge waste of wanking time.

Enter your data properly, tag it as you add it (and tag the old stuff as you have time) and there's no need to do any more. That directory of 500 Julia Hayes images might be "anonymous" to the database because they're all just numbers and letters, but you only have to select the "JuliaHayes" directory and enter "Julia Hayes Redhead Penthouse Canada usenet dancer height:66 mea:37C-23-34 zoology elephants ballerina" one time to correlate every single file in that folder to the database with those properties. Then the computer will know which are your favorites - they're the ones you keep clicking on and leaving open for minutes at a time.

So now putting together a good wank is only a matter of clicking on your "favorites" link, typing "redheads ++big ones," and enjoying the show. And if sharing this info means it might give some salesdroid the impetus to offer you even more Julia Hayes (ahem) "data"... well, ain't that the point?

someone clue me in here. by zaqattack911 · 2004-09-06 10:12 · Score: 1

The whole point of the DB file system is to easily search for files in a massive HD. The files you'd likely need to search for are not data used by games or applications but your own media.

such as text/ mp3s / pics /vids.. yadaa.

Thus, forcing all file xactions to take place through a database wouldn't be useful all the time.

You pretty much want something along the lines of what windows media player "Media library" does with your music and videos.

You build a database when you're not using the computer.. and you load the "explorer/searcher" program when you need to find something.

calling it a new filesystem seems a lil much.

God is in the implementation details by Anonymous Coward · 2004-09-06 10:37 · Score: 0

The server and client are implemented using O'Caml, but the client has different APIs: O'Caml, C, C++ and Objective-C.

The only languages I see there of any use to 99% of the world are C and C++. What I really need to improve my support structure is to adopt OSS implemented in OcaML -- give me a break.

Grandma by AnEmbodiedMind · 2004-09-06 10:47 · Score: 1

Or should you put them in the folder named "grandma"?

Better Ideas have Existed by Slicker · 2004-09-06 11:22 · Score: 1

The idea of a database filesystem is, of course, not novell at all. Implementations have existed for years on the command line, include a mysql mounter..

Here's my idea, as I've been telling people for years (and for which I've been awaiting the arrival of Reiser4):

The darn thing needs to start with a filesystem that supports extended attributes and very rapid name/metadata reading. Then, we need to add or extend commands such as the "ls" command to take advantage of SQL querying of filenames.

Next, build a GUI kpart for making queries and the ability to drag and drop a query as an icon on the desktop (stored view).

Imagine a GUI interface organized into rows and columns where the first row

(1) holds checkboxes for (show/don't show) the results.
(2) The next row holds attribute names in dropdown boxes so you can pick what attribute names are shown (or not shown) from left to right...such as name, creation date, project, and document type.
(3) The third row holds dropdown boxes for sorting which the options: Ascending, Descending, or None.
(4) The fourth row are where-clause text entry boxes. For example, one might put "> 06/25/04 and 07/25/04" under the creation date attribute.
(5) And all following rows are basic "Or" statement lines...show anything that matches the criteria on this line or any other line.

This mechanism has been referred to as a QBE (Query By Example) and it translates easily into a SQL statement. SQL statements can be parsed and a list of relevent files quickly produced.

One thing I hadn't thought of was also putting this mechanism into the File Open dialog.....but I guess that's a natural....I just didn't try to think much about it....good idea.

Matthew C. Tedder

Time for the dictionary guys by dbIII · 2004-09-06 11:54 · Score: 1

Surely this thing sits on top of ext2, ext3, nfs, iso9660 and a host of real file systems. Saying this is a file system is like saying the gnome panel is an operating system.

Why is IT dumbing down to the level of those consumners who call the monitor a computer and the beige box under their desks a hard drive?

Installation instructions ... Re:Version control by akbkhome · 2004-09-06 13:13 · Score: 1

I've been using this for over a year - saved my ass so many times..
Subverison and DavFS

restore yesterdays file:

svn cat -r {yesterday} http:/..../myfile.doc > myfile.doc

I really need to look at fixing davfs for 2.6 though.

--
Taking PHP to the next level: phpmole, php codedoc, php-gtk pear installer, DataObjects for php, ldap schema viewer and

Link dead, but found some things... by SuperKendall · 2004-09-06 14:10 · Score: 1

Well, the link doesn't work as of course Microsoft cannot comprehend someone might want to link directly to a section..

I found it after a little looking. The API is mainly for programs that want to add metadata, which is neat and all... but not for the recognizers.

Now I figured they had to have something that would a least pull EXIF data from files (and I was also thinking of facial recognition parsers for meta-meta data). But I can't find how you would write a custom one for the system, or how it would get invoked - not all of the links under the WinFS overview appear to work.

I really think Microsoft is banking more on programs creating rich trees of relationships though, more than having a bunch of file parsers (they jusy need them to essentially pre-load the DB).

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

Re:Link dead, but found some things... by EddWo · 2004-09-06 20:09 · Score: 1

Link works fine for me with Firefox.

It contains links to the other information you wanted
C# Code for a Sample File Promoter

Registering a File Property Handler in "WinFS"

It is to preload the DB, and place the modified DB information back in the file so it can be moved to a non WinFS system, and to update the DB when the file is modified by a non WinFS aware application.
So if you started editing your ID3 tags with Winamp2 it would catch the file apis and call on the file property handler to keep the DB in sync.

--
"Taligent is still pure vapor. Maybe they'll be the last who jumps up on Openstep... "

Better discription of my disquiet... by SuperKendall · 2004-09-06 14:22 · Score: 1

Ok, related to the last sentence of my last post (Microsoft intends people more to make use of rich API's) I found this blog complaining about WinFS - and describing exactly the biggest problem I see with the whole thing.

If you read the blog entry linked to, you find this paragraph:

No - the "big deal" about WinFS IMHO isn't "search". Like Web Services - it's about the fact that it's an attempt to get a higher level of interoperability between programs through agreement on schemas. Hopefully, toward the goal of bootstrapping network effects, and unintended/innovative consequences, on the client. WinFS defines an extensible object model and persistence mechanism, as well a rich and extensible "relationship" mechanism that can be used to intertwingle objects with that are somehow related to one another. Some kinds of stock relationships are obvious: common Author, common Artist, common Location, common Priority, etc. Some may be more subtle or domain-specific: common Project, common Client, common Contributors, or even manual and thus not-easily-described ties.

Like the first blog says, this hope of "interoperability between applications" is a pipe-dream that may work OK with office but will then proceed to fall flat on its face beyond that realm.

It's pretty telling that Tiger will have Spotlight working next year, while you won't even see a public WinFS is 2006. The domain is more complex, but does it need to be when Spotlight is achieving the same practical effect for 99.9% of use?

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

FS on Database by Anonymous Coward · 2004-09-06 14:53 · Score: 0

Guys,
Oracle has already created this, its iFS. I don't remember how advanced this has gone on Linux, but I know that its available on Windows platforms..

Beagle is already written by Anonymous Coward · 2004-09-06 17:11 · Score: 0

Perhaps the Beagle folks should have done some research about Gadget, which has been in planning for about a year, and been working for more than four months. Why did they go and implement their own thing when they could have just integrated Gadget into the GNOME desktop.

Re:Beagle is already written by zdzichu · 2004-09-07 00:41 · Score: 1

Well, Beagle began as part of Dashboard nearly two years ago. Only recently indexing engine has been separated and released as stand-alone project.

--
:wq

Cooper wrote about this in 1995 by pepperi · 2004-09-06 18:31 · Score: 1

The original idea is both brilliant and quite old. Alan Cooper wrote about it in his landmark book "About Face: The essentials of interface design" in 1995 and again in "About Face 2.0: The essentials of interaction design". He's got a lot of good suggestions for a decent implementation. I hope that the DBFS developers read that. WinFS developers propably have read it too, but as always, they're trying to add too much spice in it, and end up with an indigestible agglomeration of crap, that won't come out of their colons before sometime around 2007.

I dont get it by Intrinsic · 2004-09-06 21:20 · Score: 1

Ok I understand that serachable databases can be usefull for alot of things, but I still do not see how this can replace hierarctical directory structures?
I mean there are times when I dont know the name of the file, when I created it, or what it was about. But I do know what directory I placed it in?
What if you still want to be able to find a file based on location?
Sorry for the clueless question.

Another field of application by Wolf+nipple+chips · 2004-09-07 01:31 · Score: 1

Anybody who ever tried to gather a significant bibliography, won't need any other example to convince him that a DB tool for dealing with metadata can be darn useful.

Sure, I tried to handle it all with symlinks for a while, but a single paper can have many authors, subjects, versions (ps, pdf), etc. rendering
maintenance painstakingly complex.

Since I work in an IT lab, we ended up building our own set of DB tools to allow access to papers through author, date, journal, URL, lab, subject, title, etc. and to generate bibTeX, html or XML indexes.

But I guess there are many other labs struggling to maintain bibliographies, unaware that such tools even exist. And they would benefit greatly from the possibility to handle metadata through a relational DB at system level.

--
Nothing is foolproof to a sufficiently talented fool.

Re:Another one... and one ... by accensi · 2004-09-07 02:06 · Score: 1

XFS does not already support operations on extended attributes?

OpenBeOS people have already completed the work in the OpenBFS (in fact it one of the only compllleted parts) and they implemented a lot of features not done in the Be implementation, but already planned, like for instance searching on extended attibutes.

Perhaps it is time to porting it to Linux ...

Re: Built-in file version control by some+guy+I+know · 2004-09-07 02:53 · Score: 1

RSX-11 for the PDP-11 also had versioning (the ;1 ;2 etc. that another respondent mentioned).
It was a DEC thing; they had file versioning on many of their OSes.
Every time that I saved a file, say, from a text editor, the system created a new version.
Multiple versions got annoying after a while, because there'd be so many of them.
Fortunately, what passed for their shell supported wildcards (e.g., "DEL FILE.FOR;[10-21]" (or something similar)).

Versioning would be nice, as long as it is not "in your face".
For example, right-click on a file to pop up a menu that, among other things, accesses earlier versions, or (in a hierarchical filesystem) have a special directory ("..."? "..ver.."?) that contains earlier versions.

--
Those who sacrifice security to condemn liberty deserve to repeat history or something. - Benjamin Santayana

Thanks... by SuperKendall · 2004-09-07 03:47 · Score: 1

I was using Safari, not sure why the links would not work right. They do work in Mozilla.

I am a little dissapointed to see that you register file promoters only by extension, and not by other criteria.

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

Long Live Medusa by sig · 2004-09-07 04:42 · Score: 1

Probably no one remembers this, but there was a project called Medusa in development 4 or 5 years ago by Rebecca Schulman and the other happy hackers over at Eazel. It had some awsome indexing technology but a lot of people eyed it suspiciously because it was somethnig of a resource hog, espceially for its day. Probably, these days no one would flinch at dropping that kind of RAM on something like this. Unfortunately, after Eazel crashed and burned, Medusa was heaped on the pile dead carcases of so many other free software projects that line the road to software Nirvana. It's good that this functionality will finally be come to fruition with projects like Storage and DBFS, but I always thought that had Medusa been saved, we might already have a mature document indexing system by now.

Alas, alack.

296 comments