Database File System

"Implementing in GNOME" by kosmosik · 2004-09-06 03:19 · Score: 5, Insightful

Such thing should be implemented at kernel level to be transparent for *any* aplication. Without this it will just lead to a mess (like 4 different implementations) and some apps working with it and most not. As f.e. you can browse SMB network with Nautilus but when you actually try to open a file (from SMB via Nautilus) in OpenOffice.org you will get a info that viewer does not support this method... It must be a standard system routine not another level between system and GUI.

Re:"Implementing in GNOME" by slimyrubber · 2004-09-06 03:37 · Score: 2, Insightful

Actually database storage should be implemented in the filesystem rather then kernel or window/desktop managers. That would make much more sense and theoratically, it will be faster.

Just my uneducated opinion.

--
[ I can not bring myself to believe that if knowledge presents danger, the solution is ignorance ] -- Isaac Asimov

File system in KDE... by JakeThompson1 · 2004-09-06 03:20 · Score: 1, Insightful

So now, you will lose access to your file system if you use a simple window manager instead of KDE?

Great idea.

stubborn by mn3m05yn3 · 2004-09-06 03:20 · Score: 3, Insightful

Article doesn't address whether or not we can turn DBFS off and use the more traditional hierarchical method of file placement. Will we be dragged into this kicking and screaming?

Re:stubborn by donkeyboy · 2004-09-06 03:57 · Score: 2, Insightful

Most of these DB File System implementations overlay the existing file system. The standard file access methods are still there. The DB functionality is implemented as extensions.

The DB tables are then implemented as special files that coexist nicely with the existing directory files.

Keep in mind that if a "new" file system breaks heirarchial file access is also breaks every app in existance.

Disadvantages by BHearsum · 2004-09-06 03:21 · Score: 4, Insightful

How much permforance overhead will this cause? The 'Desktop Environments' already eat a lot of RAM and CPU.
How much disk space will you lose over this? All the metadata has to be stored somewhere, and just glancing over the link I read something about a versioning system, which will definently take up quite a bit of space. Will a 20gb hard drive become 15gb with DBFS?

Re:Disadvantages by aodl · 2004-09-06 04:03 · Score: 4, Insightful

While performance is something that should always be kept in mind, we are a long way away from the days of the original Macintosh where a desk accessory had to weigh in at 600 bytes in order to make the cut and fit into both memory and on a floppy disk. As current desktop machines outperform the high end servers of a few years ago, it would be nice to put a lot of that muscle to use in improving the user experience. I'm not excusing bloated and slow code here, but we don't really need to be counting bytes.

In any case, database based operating systems have been around for decades, from OS/400 to the BeOS. Many BeOS users claimed it was hands down faster than any other shipping OS at the time, and it featured a journaling, database-styled file system. One of the primary developers of that file system is now working at Apple on Mac OS X 10.4's spotlight functionality.

The thing is - as our desktop storage continues to grow at the pace that it does, and as we curiously find ways to fill it up, new ways of looking at and finding the information we store are going to be needed.

DBFS, Gnome Storage, Apple's Spotlight, and WinFS, all take different routes to get there. It's worth looking at all and what they offer and where they differ. WinFS, is a new storage layer that combines file system resources with more structured data in a Relational/XML hybrid system, with the aim (from what I gather) of turning the file system into a global "soup" of data. That sort of soup can be seen in office suites or PDA style applications, and in older Operating Systems like the Newton OS, where everything is a shared and available resource that is stored and available through common structures. Spotlight, on the other hand, combines file system searches and indexes (think 'locate') with full content indexes and a metadata index, which uses 'importers' to parse out other file formats. Spotlight is not a new file system, but an indexing system that acts on files in the file system. From what I remember of Gnome Storage, it is similar, using the VFS layer and Postgres triggers and callbacks, along with plug-ins, to parse and extract relevant metadata and contents out of files. DBFS looks to be like WinFS in that it purely wants to be a new kind of information store. I don't know which style will win out. My theory is that technologies like Spotlight will eventually evolve into a new kind of storage system, while remaining familiar and file based for todays users and developers. But this is an idea whose time has more than come. It's something that's been promised for the desktop for at least a decade, and has been shown to work, albeit in targeted OS's (the Newton) or ones that never achieved mass market penetration (BeOS).

So I think that performance concerns aren't that big of a concern, so long as (like all development) there are good people working on the solution.

What happen to the OODB? by jeanicinq · 2004-09-06 03:31 · Score: 2, Insightful

The database file system originated from the ideas of an object-orientated database. Keywords and references are all part of the orientation objects of the database to index to files or other objects. It does away with the traditional hierarchal view, being rooted at some place. The OODB does not need to be rooted as it is more like a web. The DBFS seems to try to implement part of the concept of the OODB. Good. There are many more features an OODBFS can offer: dynamic organization, classification, and mutliple "skeletal" views to name a few. I hope that this DBFS will give a taste of what an OODBFS offers.

Re:Performance? by Anonymous Coward · 2004-09-06 03:31 · Score: 5, Insightful

Depends on what you are using your computer for of course.

You can say the same thing for a GUI, and its correct for certain applications of computers, but wrong in others.

Re:Performance? by psavo · 2004-09-06 03:34 · Score: 4, Insightful

Isn't this thing with DB's getting a little excessive? You're adding another layer and step to storing data which will in all likely hinder performance. I'm not sure the benefit out weight the cost.

Well, if it's only a name-translation thingy, then it shouldn't affect performance of file reading (when operating on sufficiently big files), only file opening/stat:ing.

--
fucktard is a tenderhearted description

This is not a file system by MobyDisk · 2004-09-06 03:40 · Score: 4, Insightful

Maybe we could call it a "filing" system since it indexes files that are on another file system. Really, a file system IS a database, not an add-on that indexes files. Still, perhaps this is a better approach than trying to redo all the file-system internals. Although to be truly useful, this needs to be an API that is GUI-independent, with GUI-bindings as needed.

Re:Performance? by BenjyD · 2004-09-06 03:54 · Score: 4, Insightful

Why not just run in console mode? All this GUI stuff is just getting in the way of absolute performance.

If it adds 0.5 seconds to every time you save a file, but saves you 20 seconds of filesystem navigating every time you open the file, that's a worthwhile tradeoff. Add to that the fact that copmuters don't get tired or bored, while humans do, and it makes even more sense to shift as much of the burden of working onto the computer as is practical.

Re:Performance? by jgardn · 2004-09-06 03:59 · Score: 4, Insightful

Not necessarily. Consider the performance of finding a document you wrote two years ago. How long does it take you to walk through the directory hierarchy browsing file names? How fast is the file search tool? Wouldn't it be faster if you could say "Show me the documents I wrote two years ago" and the refine the search or browse the results?

Storing data in a relational database is natural because it is more like the way we store data in our minds than the hierchical structures of traditional file systems.

Also, we allow a complete abstraction of the underlying database in relational systems. The database can store the data however it sees fit, and can arrange the data on disk without the users noticing a change.

I look forward to experimenting with a relational filesystem. I think it would be a wonderful thing to try out and see if it actually has the advantages I outlined above. I'd also like to see the actual disadvantages.

--
The radical sect of Islam would either see you dead or "reverted" to Islam.

blah blah blah by Anonymous Coward · 2004-09-06 04:01 · Score: 2, Insightful

Sorry, but I have to say it, you are an idiot!
Did you even care to RTFA?
This has nothing to do with the gnome or kde devs. Some developer invested his time to come up with something he thought was useful and all you can do is complain?

And if you look at the project, it is something completley different then a plugin for the admittedly great ReiserFS4 and it is here and usable right now.

So friggin stop your stupid whinig.

And to the mods who modded parent interesting ...

Btw., why don't you whine to the Reiser people that they should stop developing now? It would be as justified as your whining aboutt this project.

Re:Reiserfs, storage and why do you want this? by TheRaven64 · 2004-09-06 04:12 · Score: 2, Insightful

I find that a spacial interface to a hierarchical storage layout makes it very easy when I want to find my files. This kind of thing is more useful when trying to find files you didn't create / save.

--
I am TheRaven on Soylent News

Any innovation is good by Stevyn · 2004-09-06 04:19 · Score: 3, Insightful

People can offer their opinions for or against this, but I think that any innovation benefits linux. I've read about WinFS and it sounds like a good idea, but who knows when it will be ready. If people working in their spare time can get something like this working in linux before Microsoft can get it out, I think that would just be another reason to trust the open source model of developing code and squash Ballmer's FUD.

I don't have too much trouble using a hierarchy file system. I keep my stuff pretty organized, but computers are supposed to save time, not create more problems. If this database can do a good job, I'll give it a shot.

Re:i don't have time to reinvent the wheel today by uncommonlygood · 2004-09-06 05:16 · Score: 2, Insightful

why don't the nice KDE people and the nice Gnome people work on developing a library that sits on top of [reiser 4] and then we can stop all the stupid name calling and use the right tool for the right job

I wouldn't normally jump in except this is modded +4, even though the poster doesn't appear to have read the article.

The article doesn't talk about an actual "file system" (it admits its a bit of a misnomer itself), merely a way of referencing files with "keywords" in database that, according to the author, will make it easier to find your files. The author has written a file browser and file selector dialogs for KDE that use this, then goes on to say that he's planning to make a GNOME implementation soon, but nowhere does anyone start any name calling.

Since this article essentially discusses a GUI and not a file system (which could be implemented at kernel level), it would be a little silly to use the same tool for GNOME and KDE, since they have different look-and-feel. That said, a common library of functions needed by both the GNOME and KDE versions would obviously be sensible, and would speed up porting to other desktops.

Furthermore, this idea claims to work on top of existing heirarchical file systems (removing the idea of a hierarchical FS completely would involve a major restructuring of the whole operating system and everything that works on it), so just saying that it should use reiser 4, I suppose because its just l33t, is a little redundant. This should work on top of just about any FS, ext2, reiser, or even an NFS mount.

Re:standard filesystems are NOT databases by Anonymous Coward · 2004-09-06 05:25 · Score: 1, Insightful

The problem with find is that it doesn't scale because it has to do a sequential search. locate is faster than find, but the index is not always up to date. A database could keep various indices always up to date.

What an improvement by Anonymous Coward · 2004-09-06 06:19 · Score: 1, Insightful

Did you work on that file last month? Find all files you worked on last month. Was it a word document? Find all word documents you worked on last month. Was it for a certain project. Find all word documents from that project you worked on last month. That is the thinking the DBFS supports.

What a coincidence. That is the thinking that "find" supports, too. And I don't have to run a bloated desktop environment to use it. Cool.

Re:Performance? by kfg · 2004-09-06 06:23 · Score: 2, Insightful

Why not just run in console mode? All this GUI stuff is just getting in the way of absolute performance.

Although this is a KDE related project the concept itself has nothing to do with whether you use a GUI or not and the performance hit comes at the level of the DB, not the GUI.

As for shifting the burden to the computer it doesn't really do much of that either as a human mind still has to formulate and input the query terms as well as judge the validity of the query result.

The DB as filesystem has a lot of merit, but really only in those situations where you have a massive number of files distributed across many systems. Take Google and the internet for example.

Now imagine having to google your local system to find every damned file.

Now, maybe I'm just different, but I've got 45,000 files spread across 1300 directories in my Home directory, and I can find any one of these by navigation in under your hypothetical 20 seconds saved, but then I'm the sort of person who sorts his laundry by placing it in seperate hampers in the first place instead of later spreading it across the basement floor and sorting it out. I know exactly where all my dirty whites are up front.

And the latter sort of person isn't likely to do a very good job of entering the metadata necessary to make a DB based filesystem work well anyway. The brain is a wonderful DB in and of itself and knows the "meaning" of things as well. I already know what 14t.jpg is all about. I'd have to tell my DB what it means.

Sure, let the computer do the work that it's better at, like recalculating spreadsheets or finding redheads with big ones on the web, but that doesn't replace the brain.

I already know which redheads have big ones on my local system and where to find them no matter what the file is named and the computer can't find all of the redheads with big ones on the web unless they all have the proper metadata attached to them. Some file out there somewhere named 1038754875747.jpg is just as anonymous to the computer as it is to you.

KFG

Re:Backups, and being organized in a general way? by Ignominious+Cow+Herd · 2004-09-06 07:24 · Score: 3, Insightful

Think of a directory structure as just one instance of a relational structure - a heirarchical or location-based one. Directories (or locations, or relationship to other files), even nested ones, are just another type of metadata. Once you have that concept in mind other things become obvious. For example you may group files by type (Image, Text) or by project, or by author. With a directory structure you either have to make multiple directories and symlink everything, or you're stuck with one view of your files.

In short it gives you multiple, simultaneous groupings of your data.

--
Lump lingered last in line for brains, and the ones she got were sorta rotten and insane.

Jeez, when will these people learn? by Grendel+Drago · 2004-09-06 07:47 · Score: 3, Insightful

Joel on Software said it best:

For example, WinFS, advertised as a way to make searching work by making the file system be a relational database, ignores the fact that the real way to make searching work is by making searching work. Don't make me type metadata for all my files that I can search using a query language. Just do me a favor and search the damned hard drive, quickly, for the string I typed, using full-text indexes and other technologies that were boring in 1973.

--
Laws do not persuade just because they threaten. --Seneca

Re:gnome people... by Anonymous Coward · 2004-09-06 11:44 · Score: 1, Insightful

It's Microsoft Java and it's awful. There may be patent problems, not with C# or CIL but the 'excellent standard library' as you say. How do you know what Microsofts long term strategy regarding .NET is? Why are .NET people .ALWAYS so defensive over vague comments that they interperet as being negative, yet .NEVER respond to questions about patents on the supporting libraries? I don't have a JVM or Flash installed either, is someone going to start jumping up and down about that?

23 of 296 comments (clear)