Slashdot Mirror


Tom's Hardware Looks At WinFS

Alizarin Erythrosin writes "Tom's Hardware Guide has an article about the new WinFS file system. The article talks first about some of the problems and advantages with FAT[16|32] and NTFS, then talks briefly about WinFS. Here is the summary: 'Microsoft is breaking new ground with Longhorn, successor to XP. The upcoming WinFS file system will be the first to be context-dependent, and promises to make long search times and wasted memory a thing of the past. Today, THG compares it to FAT and NTFS.' Personally, I still have reservations about using a relational database to keep track of files. Unless they can keep the overhead to a minimum, I can't see it being as efficient as a file system should be."

135 of 809 comments (clear)

  1. other FSs are out there by double_plus_ungod · · Score: 3, Interesting

    yeah, but you still get a choice--i don't use mac os x's journaling because of the overhead--you don't hve to use winfs if the performance penalty is too high.

    1. Re:other FSs are out there by localghost · · Score: 4, Interesting

      Journaling doesn't reduce performance much, and at least for me, it's well worth it for the peace of mind, and the lack of fsck. Hard drive space is hardly at a premium, most people can spare the 10%, and without it, I spend 15 minutes scanning my 40GB disk every certain number of boots (or if it's not shut down right). If I used Windows, I'd at least give WinFS a try.

    2. Re:other FSs are out there by double_plus_ungod · · Score: 2, Informative

      i wonder about the performance for increasingly large storage space...

      do you think that a search taking O(log n) [this is assuming similar performance to oracle] time over exponentially large drive space is better in the long run than what we've got now? is hfs+ that bad taking care of my 40gb drive?

      maybe i'm being less open minded about the journaling, especially since the /. consensus is journaling = good...

      and i am running a g4 400 w/ macosx and i really would hate to see that 10 percent get sucked up by disk accesses.

    3. Re:other FSs are out there by Hes+Nikke · · Score: 3, Funny

      Sorta like running XP in 2K mode

      you can get WindowsXP to run in 2 KibiBytes of ram?!

      or

      Sorta like running XP in 2K mode

      not that that's a bad thing....

      --
      Don't call me back. Give me a call back. Bye. So yeah. But bye our, well, but alright we are on a shirt this chill.
    4. Re:other FSs are out there by afidel · · Score: 5, Insightful

      Linux can not safely write to NTFS, not even NTFSv1.2 even though it has been around since 1994. The lack of documentation makes it so that some aspects of updating the metadata and indexes cannot be done safely in all situations.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    5. Re:other FSs are out there by TheNetAvenger · · Score: 5, Informative

      Yeah, but I bet you it'll be like Windows XP's theoretical ability to use FAT 32 - it can read it, but it won't actually let you format in FAT32 when doing an installation

      So when is the last time you actually used WindowsXP? What you said is pretty silly.

      You can install WindowsXP on FAT32, even create FAT32 partitions during setup etc, all the same options you get with NTFS during the install.

      Are you trying to make up stuff or just never used it?

      Geeshâ¦

    6. Re:other FSs are out there by TheNetAvenger · · Score: 2, Informative

      yeah, but you still get a choice--i don't use mac os x's journaling because of the overhead--you don't hve to use winfs if the performance penalty is too high.

      If done right, journalling has little to no performance impact. NTFS has had journalling since its beginning over 10 years ago, and even 486 based machines performed just fine with NTFS.

      WindowsXP and Windows2003 Server of today still use NTFS and it is still fully journalled, and the performance numbers Windows2003 Server is getting shows that NTFS is not slowing the OS down a bit.

      I'm sorry that the Mac Journalling has a noticeable performance hit for you.

    7. Re:other FSs are out there by SweetAndSourJesus · · Score: 5, Informative

      From what I've read and experienced, the performance hit only involves writes. Reads are unaffected.

      Writes are about 10% less efficient, which a pretty good tradeoff for that peace of mind.

      --

      --
      the strongest word is still the word "free"
    8. Re:other FSs are out there by NightSpots · · Score: 5, Insightful

      Right, but in general, journaling is a hack. The BSD softupdates concept is both cleaner and faster, and accomplishes nearly the same thing. To be fair, all of NTFS, ext3, reiserfs, BeOS's FS, and the new OSX FS have some advanced journaling features, but the concept itself should be more cleanly handled. Much like databases should be ACID compliant, filesystem writes can and should occur in such a manner that no external journal is required.

    9. Re:other FSs are out there by Unoriginal+Nick · · Score: 5, Informative
      Let me assure you that while it can read FAT32, the OS as default will not FAT32 format any drive over (I belive) 2Gb.

      The limit is 32GBs.

      FAT32 Formatting in Setup is an option, but is known to frequently fail or have significant errors

      No it doesn't.

      FAT 32 Formatting once you are in the OS requires 3rd Party wares.

      Again, only if creating partitions bigger than 32GBs.

    10. Re:other FSs are out there by akpcep · · Score: 2, Informative

      Falsehoods on all counts, I have 2 machines running FAT32 partitions of up to about 20GB with absolutely no problems. It's not an issue.

      --
      Hmmm.
    11. Re:other FSs are out there by afidel · · Score: 5, Informative

      The sysinternals driver is NOT a reverse engineering, it is basically a DOS mode interface to the windows system dll's. When you use the NTFSDOS disk you have to run it under the version of windows for which you will be accessing the NTFS partition, it then snarfs up the system dll's and puts them on the disk. Then when you load up the disk you run ntfsdos.exe which is basically a wrapper around that code. This could probably be done for linux but it would probably not be redistributable for fear of legal reprecussions from MS (how sysinternals gets away with it I do not know, maybe they have an agreement with MS, or maybe they are just too small of fish and too usefull for MS to swat them)

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    12. Re:other FSs are out there by Unoriginal+Nick · · Score: 4, Informative
      From this KB article:

      You cannot format a volume larger than 32 gigabytes (GB) in size using the FAT32 file system during the Windows XP installation process. Windows XP can mount and support FAT32 volumes larger than 32 GB (subject to the other limits), but you cannot create a FAT32 volume larger than 32 GB by using the Format tool during Setup. If you need to format a volume that is larger than 32 GB, use the NTFS file system to format it. Another option is to start from a Microsoft Windows 98 or Microsoft Windows Millennium Edition (Me) Startup disk and use the Format tool included on the disk.

      I've done it myself, with and without SP1 slip-streamed, and it should have let you do it as well. Are you certain the partition was only 30GBs?

    13. Re:other FSs are out there by Anonymous Coward · · Score: 5, Informative

      It should be noted that the regular fsck should still be performed, even on a journaling filesystem. The fsck after an unclean shutdown is obsoleted by the journal, but the regular fsck is supposed to catch subtle errors in the filesystem code and hardware problems which can cause the filesystem structure to rot slowly - problems which a journal can't and won't fix.

    14. Re:other FSs are out there by TheNetAvenger · · Score: 5, Informative

      I'm posting this from a WinXP machine. Let me assure you that while it can read FAT32, the OS as default will not FAT32 format any drive over (I belive) 2Gb.

      FAT32 Formatting in Setup is an option, but is known to frequently fail or have significant errors.

      FAT 32 Formatting once you are in the OS requires 3rd Party wares.

      SCO has performed an illegal operation and will be shut down.


      I hate to rain on your parade, but the information you are providing you are either mixing with NT4 information or just making it up as you go.

      Nothing I can say is convincing you, so let me reference you to the information you need.

      http://www.microsoft.com/windowsxp/pro/using/how to /gettingstarted/guide/installnew.asp

      It clearly provides information on the only limitations of doing a FAT32 installation of WindowsXP, and that is the inherent limitation of FAT32 only being able to recognize a 32GB partition size.

      As for '(I belive)2Gb', you are referring to the FAT16 installation of NT4. It doesn't apply to WindowsXP.

      FAT 32 Formatting once you are in the OS requires 3rd Party wares.

      Here let me quote from a command prompt for your edificationâ¦

      Format /?
      Formats a disk for use with Windows XP.

      FORMAT volume [/FS:file-system] [/V:label] [/Q] [/A:size] [/C] [/X]
      FORMAT volume [/V:label] [/Q] [/F:size]
      FORMAT volume [/V:label] [/Q] [/T:tracks /N:sectors]
      FORMAT volume [/V:label] [/Q]
      FORMAT volume [/Q]
      volume Specifies the drive letter (followed by a colon), mount point, or volume name. /FS:filesystem Specifies the type of the file system (FAT, FAT32, or NTFS).


      Notice the last line?

      So where do you get your information, tooth-fairy, or just make it up as you go? Third party utilities are NOT needed to format FAT32 in WindowsXP, as you said âonce you are in the OSâ(TM) â" BTW, did you know that even during Setup â" YOU ARE âIN THE OSâ(TM)? It is running a stub version of NT already at that point; you are already in the NT OS.

      FAT32 Formatting in Setup is an option, but is known to frequently fail or have significant errors.

      Since when? Where do you document this at? Funny in all the time our tech team has put into working with WindowsNT, they can find no reference to this, either online or in their own experience. Tooth-fairy again or just making it up as you go?

      There are NO known issues or documented issues of WindowsNT EVER having a problem running on a FAT partition, either FAT16 from NT4 days or FAT32 of current days. The NT core has an installable File System, like most modern OSes, and its interaction with the file system underneath is irrelevant to how it operates in regard to failure or problems.

      Besides, this is really a moot point. If you are running WindowsXP and it is your only OS, then there is NO reason not to use NTFS. It is faster with larger files, faster with large volumes, supports compression, supports encryption and can have partitions up to 16 exabytes in size.

      If you are running WindowsXP on FAT32, you are missing half the benefits of WindowsXP and the security of NTFS as well as its reliability of sitting on a journalled file system, etc, etc.

      Geesh.

    15. Re:other FSs are out there by Hast · · Score: 5, Informative

      I think you are confusing the terms. Or you just got the threads mixed up.

      Journalling makes sure that the state of data, or meta-data is always defined. So even if you turn the computer off while in the middle of a write command the file-system will be in a correct state afterwards. (The write operation will be lost however.) It is basically a way of making HDD operations atomic. Either they succeed or they fail, the in between state is what puts your drive in a corrupted state.

      When talking about 10% it is refering to space on the disk for the log file to make this possible.

      Now storing the meta data in a database, which is essentially what WinFS and such are doing, is not as clear a benfit. Personally I can imagine that it would be a very practiacal FS for keeping movies and MP3's on. I don't really see the benefits of running the OS files on that FS though. A lot of unneccesary overhead. (I don't search for files in my OS partition very often.)

    16. Re:other FSs are out there by platypus · · Score: 5, Insightful

      Much like databases should be ACID compliant, filesystem writes can and should occur in such a manner that no external journal is required.

      And how would you propose to achieve that without performance going down the drain?
      The guys writing these filesystems are not dumb, and the reasons why journals are used are well considered. Another thing is that ACID compliant databases also use something like a journal to achive the atomicity.
      Oh, and forget softupdates, they are _not_ comparable to journaling filesystems, for instance you still need to fsck, it's just faster.
      Compare that with one of the funnies manpages I know.

    17. Re:other FSs are out there by illtud · · Score: 4, Informative
      As for '(I belive)2Gb', you are referring to the FAT16 installation of NT4. It doesn't apply to WindowsXP.


      AFAIK, the NTFS installation of NT4 won't allow you to have a primary (system) volume > 4GB. This is because NT will install a FAT16 volume and later convert it. This may have been fixed in a service pack, but until you install the OS, how are you going to get a SP on there? Sure, you can
      grow the partition later, but you're being a bit disingenuous in your specifying that this problem is confined to FAT16 installation on NT4, since an NTFS installation *uses* the FAT16 installation.

    18. Re:other FSs are out there by WickerChap · · Score: 3, Informative
      Microsoft use the tools themselves... from the upcoming teched 2003 conference in Barcelona:

      ADM291 Tour of the Sysinternals Tools The Sysinternals.com web site is source of dozens of powerful freeware tools that reveal aspects about the internal behavior of the Microsoft® Windows® operating system as well as the activity of applications that run on it that are otherwise not available, making them valuable troubleshooting and administrative tools. Many are used on a daily basis with in Microsoft's product support and development groups. Take a tour, guided by their author, of some of the most popular tools. You'll learn undocumented tips, see examples of how to get the most out of them, and learn about how they work. Speaker(s): Mark Russinovich (Winternals Software)

      --
      "I love deadlines. I love the wooshing sound they make as they fly past" Douglas N Adams
    19. Re:other FSs are out there by TheNetAvenger · · Score: 3, Informative

      AFAIK, the NTFS installation of NT4 won't allow you to have a primary (system) volume > 4GB. This is because NT will install a FAT16 volume and later convert it [is-it-true.org]. This may have been fixed in a service pack, but until you install the OS, how are you going to get a SP on there? Sure, you can
      grow the partition later, but you're being a bit disingenuous in your specifying that this problem is confined to FAT16 installation on NT4, since an NTFS installation *uses* the FAT16 installation.


      Nice information, but it has nothing to do what I was talking about.

      If you read the post, you will see that I was responding to a post that said WindowsXP had a 2gb partition limit for FAT32 installations. Which is NOT true.

      I never said anything about the primary boot partition of NTFS on NT4 or its size.

      In reference to what you said, you are partially correct, SETUP of NT4 and previous versions will not allow for a boot partition larger than 4GB; however, you can pre-partition a Hard Drive with NTFS using another WINNT computer to whatever size you would like and then install WINNT4.0 on the blank larger NTFS boot partition. (This is a well known workaround in the industry.)

      As an additional note the original Pre NT4.0 setup did not fully load the NT Kernel and related drivers to support the drives or the NTFS file system, this is why the installation created a FAT partition for the file copy portion of the install. NT4.0 did fully load the NT Kernel, but the original installation mechanism for partitioning was left in from the NT 3.x days.

      Windows2k and WindowsXP do not have these restrictions and fully load the NT Kernel during the initial file copy portion of the Setup.

      People wanting more information try:
      http://support.microsoft.com/default.aspx?sc id=kb% 3Ben-us%3B119497

      Take Care,
      The Net Avenger

    20. Re:other FSs are out there by illtud · · Score: 3, Interesting
      Nice information, but it has nothing to do what I was talking about.

      The term for that is 'non-sequitur', and you've just posted a lovely example of one. Let's go back to my post and see what I was replying too (hint, nothing to do with XP at all) - it's the bit you snipped:

      As for '(I belive)2Gb', you are referring to the FAT16 installation of NT4. It doesn't apply to WindowsXP.

      That's what I was replying to. I was attempting to clarify that the limit (4GB, not 2GB) also applied to an NT install in which you specified NTFS (your post seemed to imply FAT16 only).

      I don't think we're disagreeing. I was clarifying a point you made which could imply something which wasn't the case.

    21. Re:other FSs are out there by Perseid · · Score: 2, Insightful

      I currently have a 66GB FAT32 partition. I use it to pass data around between XP, Win98 and Linux. Of course, I had to format the thing using Linux, but once I did all was well.

      The 32GB limit in XP is done on purpose by MS to 'encourage people to use NTFS'

      So does that mean Longhorn will only let you format NTFS partitions to 32GB to 'encourage people to use WinFS'? I shudder at the thought.

    22. Re:other FSs are out there by zeugma-amp · · Score: 5, Interesting

      Now storing the meta data in a database, which is essentially what WinFS and such are doing, is not as clear a benfit. Personally I can imagine that it would be a very practiacal FS for keeping movies and MP3's on. I don't really see the benefits of running the OS files on that FS though. A lot of unneccesary overhead. (I don't search for files in my OS partition very often.)

      Indeed. It seems like what they are claiming as an improvement, (i.e., faster searching for files), does not appear to help what people actually do most of the time. It is similar to claims of "boots much faster!" that you used to hear about new versions of windows. I would think the thing that would be important to people is data integrity and access efficiency. I know my primary concern is "how safe is my data".

      I also question the need to include the overhead of a database frontend to the filesystem. Seems like a catastrophe just waiting to happen.

      Also, since the DB is always active, what issues do you have with backups? I'd be concerned about backup and restoral issues with this type of filesystem. I haven't seen that addressed at all.

      --
      This is an ex-parrot!
    23. Re:other FSs are out there by Talez · · Score: 4, Insightful

      Now storing the meta data in a database, which is essentially what WinFS and such are doing, is not as clear a benfit. Personally I can imagine that it would be a very practiacal FS for keeping movies and MP3's on. I don't really see the benefits of running the OS files on that FS though. A lot of unneccesary overhead. (I don't search for files in my OS partition very often.)

      BeOS used a metadata enabled fully journaled multithreaded filesystem and it was the bomb.

      Mail client? What mail client? You email inbox was a directory in the OS with all the email metadata attributes enabled in the file manager. Mail filters literally became shell scripts. Want to index your MP3s according to artist, album, etc all at the same time? Use the shell scripting and metadata, Luke!

      It also does away with the stupid limitation of extensions. When all your data is stored as a datatype in the metadata listing, who needs them?

      I think that Microsoft are going a little overboard embedding a database into the filesystem, proper meatdata enabled filesystems are a GOOD thing in my book.

    24. Re:other FSs are out there by Salamander · · Score: 2, Interesting
      Oh, and forget softupdates, they are _not_ comparable to journaling filesystems, for instance you still need to fsck, it's just faster.

      That's true, but misleading. If soft updates are done right, the only reason to fsck is to reclaim resources (orphaned blocks etc.). It is not necessary to get your filesystem into a usable state, and can therefore be done in the background after you've come up. Journaling filesystems also still need to fsck, it's just faster and it's called a log redo, and that is necessary to make the filesystem usable. I'd say the two are very comparable, and soft updates come out slightly ahead. BTW, I'm one of those guys who writes filesystems, the ones you say are not so dumb. :-P

      --
      Slashdot - News for Herds. Stuff that Splatters.
    25. Re:other FSs are out there by xsecrets · · Score: 2, Insightful
      It clearly provides information on the only limitations of doing a FAT32 installation of WindowsXP, and that is the inherent limitation of FAT32 only being able to recognize a 32GB partition size.

      Actually The 32 GB limit is purely a microsoft thing to get you to use NTFS. FAT32 can format up to 2 TB. And microsoft will read drives that large it just wont format them that large for you. And there's actually something that will expand that to 4 TB but I can't remember what it is off the top of my head. And no I'm not talking about doublespace or any other compression technologies.

      I won't go into the reasons why but I have personally created a 1.2 TB FAT32 partition. Funny thing is that the easiest way I found to format it was in linux.

    26. Re:other FSs are out there by Salamander · · Score: 2, Interesting

      BTW, cool name. If you ever decide to abandon that identity, let me know. ;-)

      Is it right to assume that softupdates are faster, but journaling might save a little bit more data in case of a crash?

      I'd have to say it depends. The beauty of soft updates is that they require exactly zero additional writes beyond what you'd be doing anyway; you're just being careful about the order in which you do them. Performance is fine, but this pretty much does nothing to ensure that data is consistent without some sort of sync/flush. With journals the picture is more complicated. Yes, there are additional writes, but they can be overlapped with the writes you're already doing so they often don't impact performance that much. Also, there are usually more opportunities to combine/strategize the metadata writes. Ultimately, the performance ends up being affected very little. As far as data protection, it's a big tradeoff. Most journaling filesystems only journal metadata, so they provide the exact same non-guarantee regarding data that soft updates would. If you want to journal data as well you get a better guarantee but worse performance, and it's rarely done; if you're heading in that direction you might as well go all the way to a log-structured filesystem.

      There are certainly ways that either journaling or soft-update filesystems can be tweaked to provide guarantees for data or metadata. In either case, you write to a "clean" set of blocks (never write in place) and take care of the metadata updates in such a way that if the metadata makes it the new data automatically comes along for the ride and if it doesn't then the blocks containing new data get reclaimed. This can be useful in certain cases, but it can also suck massively for performance if you have a lot of sub-block updates.

      As you can see, it's an interesting set of tradeoffs. It gets even better when your filesystem is distributed. No matter what, though, I tend to prefer soft updates due to greater storage efficiency and less need for provisioning/tuning.

      --
      Slashdot - News for Herds. Stuff that Splatters.
    27. Re:other FSs are out there by platypus · · Score: 2, Interesting

      BTW, cool name. If you ever decide to abandon that identity, let me know. ;-)

      Hehehe, I forgot to compliment you for the name of your homepage ;).

      Most journaling filesystems only journal metadata, so they provide the exact same non-guarantee regarding data that soft updates would.

      I read once a paper about softupdates (quite old, I think it's the paper presenting the idea of softupdates for the first time, at least it reads that way), where they (completely IIRC) talk about an "update daemon" which writes the dirty in-memory metadata blocks to disc at regular intervals. That lead me to the conclusion about softupdates loosing somewhat more metadata in case of a crash. But OTOH there's a lag in writing to a journal, also.

      As you can see, it's an interesting set of tradeoffs.

      And as if that weren't enough things to think about, I heard that there are these drives which plainly lie about what has been really written to the platters.

      No matter what, though, I tend to prefer soft updates due to greater storage efficiency and less need for provisioning/tuning.

      Oh come on, in reality you prefer softupdates because you are a BSD zealot. ;)))

      Thanks for your explanations, and if I ever decide to sell my slashdot handle and the attached wellness of super positive karma on ebay, I'll make you a special offer ;).

    28. Re:other FSs are out there by ckaminski · · Score: 2, Informative

      There are database products available out in the world that take a snapshot of a filesystem based on a time interval. All changes after this time do not get added to the backup, even if it's the last file to be backed up, and it's brand new.

      This is standard fare in the database universe, where a change to a record after the backup has started can break tables already stored to tape, therefore destroying referential integrity. Many of the problems we have with backups and system restores today could be alleviated if our filesystems had this functionality built into them naturally. In many systems it goes by the acronym MVCC (multi-version concurrency control).

      It would allow me to kick off a backup, and make changes to system settings, and yet the backup system wouldn't pick up any new settings, the filesystem would queue the changes until the backup "transaction" had ended, and then commit the new changes permanently.

      Windows has a product available for it now that does this, I'm not sure of the name for it though. I have to tell you, the ability of linux to do this at the VFS layer would be killer. Of ANY OS really. And it's come to the point where software is so complex, and so interdependant on the OS (Windows particularly) that it SHOULD be a standard feature.

  2. Can you say SQL Slammer x 100? by xanie · · Score: 3, Insightful

    Now that every Windows user is going to be running a SQL variant on their system, imagine the bugs and holes that are going to be in this. Now THIS will be interesting to see.

    --
    Fundamentalism stops a thinking mind.
    1. Re:Can you say SQL Slammer x 100? by NightSpots · · Score: 5, Insightful

      This is completely unfair.

      How many gaping security holes have there been in NTFS?

      By most accounts, NTFS is one of the better filesystems ever written. Journaling, ACLs, decent performance.

      There is talent in Redmond. Ignoring that is as flawed as assuming the entire Linux community is represented by Sendmail and SCO (security holes and bad publicity). Pretty bad, right?

    2. Re:Can you say SQL Slammer x 100? by eidechse · · Score: 4, Informative

      Uhh...no.

      Slammer exploits a buffer overflow in the sql server "named instance" resolution mechanism. It has nothing to do with the storage/querying/indexing/etc of relational data.

    3. Re:Can you say SQL Slammer x 100? by mindbooger · · Score: 5, Informative
      By most accounts, NTFS is one of the better filesystems ever written. Journaling, ACLs, decent performance.

      ... fragments all to heck and back. Ever notice the absence of "defragger" utils in the non-microsoft world? IIRC, I'd read that NTFS was one of the _worst_ fragmenting filesystems.

    4. Re:Can you say SQL Slammer x 100? by rekkanoryo · · Score: 5, Interesting

      NTFS is actually not bad with fragmentation. I'm running a Win2k Pro box and a WinXP Pro box; neither's partition has ever reached more than 12% fragmentation, and even that was after having the partitions 98% full. Most people won't notice NTFS file fragmentation as a problem until it reaches 50% to 60% anyway. FAT32, however, is quite a different story. I can start to notice performance hits around 9% fragmentation or so. Also, according to my MCSE training kit, the main cause of filesystem fragmentation on windows machines is using a page file that does not have a static size. Using a statically-sized page file can decrease defragmentation dramatically. (If you don't believe me, set your paging file to a static size between 1.5 and 3 times the amount of physical RAM you have, reboot, defragment, and test with every FS torture you can think of.)

  3. db filesystem by Horny+Smurf · · Score: 5, Interesting
    Personally, I still have reservations about using a relational database to keep track of files.

    BeOS used indexing for certain attributes, and it is GREAT. Maybe someone is just sour that linux didn't do it first?

    1. Re:db filesystem by Osty · · Score: 5, Interesting

      BeOS used indexing for certain attributes, and it is GREAT. Maybe someone is just sour that linux didn't do it first?

      I gathered that the quote was alluding to the fact that while the BFS did initially use a full relational database backend, it performed very poorly. Be replaced the backend with a more conventional one, but kept the SQL-like interface to it. It increased performance, but just wasn't quite as cool anymore. Maybe now that PCs have increased in power by several magnitudes since Be last tried this, Microsoft may actually be able to pull it off.

    2. Re:db filesystem by n.wegner · · Score: 2, Informative

      >find . -type f -iname "*.mp3

      I think BeOS has plugins for different file types. IIRC it reads the tags in mp3s, so you could do queries on artist, genre, track #, album, etc. that might or might not be in the filename itself.

    3. Re:db filesystem by Osty · · Score: 2, Insightful

      1. Microsoft has yet to pull anything off to my satisfaction.

      That's your problem, not Microsoft's.


      2. This is probably just an excuse to break with NTFS as the kernel developers are nearing compatibility. I suspect WinFS will be completely closed "for security reasons" and anyone using it will be prohibited from writing Linux code as well.

      Paranoia at its purest. If you were truly paranoid, you'd realize that Microsoft wouldn't go this far just to stop Linux from supporting NTFS. They'd just change a few important things about how NTFS works. They wouldn't throw it out unless there was a good reason to do so. And no, Linux having nearly decent support for NTFS doesn't qualify as a "good reason".


    4. Re:db filesystem by mlk · · Score: 2, Insightful

      How do you store, retrive & manipulate META data with find/grep/awk.
      Now you can, by storing it as part of the file name, but it would suck.

      BeOS did this very well, and MS has had time to copy, it should be pritty good.

      --
      Wow, I should not post when knackered.
    5. Re:db filesystem by be-fan · · Score: 2, Insightful

      Actually, in this case, Linux might very well do it first. Reiser4 has all the bits and pieces in there to be a killer DBFS back-end. They're trying to get good file performance out of objects as small as individual XML elements. Throw on a database layer on top (doesn't have to be that featureful for something like this) and you've got yourself a WinFS competitor. I think MS might have done themselves a disservice by throwing these ideas out there this early. Longhorn isn't due out for almost two years. In that time, it may very well be that many of the features touted for Longhorn have been in Linux for months. Remember, this time frame we're talking about is about how long it took the KDE guys to totally rewrite KDE in the 1.0 to 2.0 transition. In the last year, Linux has had several major subsystems totally overhauled. In a year and a half, Linux changes as much as Windows does in a full release cycle (2-3 years). I would not be surprised if Longhorn has some serious competition when it debuts.

      --
      A deep unwavering belief is a sure sign you're missing something...
    6. Re:db filesystem by Malcontent · · Score: 2, Informative

      "you'd realize that Microsoft wouldn't go this far just to stop Linux from supporting NTFS. They'd just change a few important things about how NTFS works. They wouldn't throw it out unless there was a good reason to do so. And no, Linux having nearly decent support for NTFS doesn't qualify as a "good reason"."

      Well according to their past history this is exactly what they would do. Messing with NTFS might break backward compatibility.

      Why do you think MS has changed their stripes? Same management, same philosophy, same monoply. They are the same company as always and there is no reason to believe that they will act differently today.

      --

      War is necrophilia.

    7. Re:db filesystem by a_n_d_e_r_s · · Score: 2, Interesting

      To stop competitor DB software firms like Oracle.

      "You dont need a database when you have our file system." they now can say.

      MS is basically putting everythingn under the sink into the operating system so that no one can compete with them. A practice they started after they could not have hidden systems calls to make MS own applications faster than competitors and could not force others to ship IE.

      This way all other software companies go belly up since they canÃt offer anything that not alreadu is part of the operating system.

      --
      Just saying it like it are.
    8. Re:db filesystem by Jon+Peterson · · Score: 4, Insightful

      That's a very small amount of metadata. Let's say I've got a picture of mountain that has been digitised and sits on my computer. Here's some meta data I want to store against it:

      a) I want to store that it's a jpg file. I am Japanese and see no reason why the file type should be indicated by a small dot shape followed by three symbols left over from the roman empire that stand for some words in a language I don't speak. File name extensions are very archaic technology.

      b) I want to record when I took the photograph that this picture represents.

      c) I want to record that this photograph is going in my family album

      d) I want to record that this is an 'outdoor' photograph

      e) I want to record that this photograph is of Mount Fuji

      f) I want to record that this particular file is the high-resolution version, suitable for output onto my photo-printer.

      g) get the point? I could have a lots of symlinks of this file to folders called 'my photos' and 'outdoor photos' and 'photos taken in July' but that would be a stupid ugly hack. I could call my file:

      mount_fuji_july-hi_res-outdoor.jpg

      but that would be even more stupid.

      Now, at this point someone says "well, if you want all that then just buy a content-management application that sits ontop of the OS and lets you catalog all your stuff."

      I think MS has realised that pretty much every file _should_ be in a content management system of some kind. They are simply adding a lightweight CMS to the OS, which seems perfectly sensible to me. That way, other applications can make use of the meta-data the CMS holds. The file open dialogue on all windows apps will, instead of a directory browser have the ability to query the CMS for, say 'most recent versions of my hi res outdoor photos' which seems good to me.

      I never fail to be baffled at the degree of inertia in the IT world. I sometimes thing every computer person thinks technology should be frozen at whatever point they got tired of learning new stuff. "File name extensions and symbolic links were good enough for me lad!" It's a weird attitude.

      --
      ----- .sig: file not found
    9. Re:db filesystem by aphor · · Score: 2, Interesting
      Maybe now that PCs have increased in power by several magnitudes since Be last tried this, Microsoft may actually be able to pull it off.

      CORRECTION: Now that PCs ... anyone may actually be able to pull it off.

      What I don't see addressed here is the added complexity of the bootstrap required to support RDBMS based files. Where are you going to stick this bootstrap? I see a tightly controlled licensing arrangement between motherboard suppliers and MS. "Thou shalt not boot except through WinFS bootstrap code which is licensed to you for this purpose. We will revoke your license to distribute WinFS bootstrap if you make us cry. We will take OUR ball and go home, and you will not be able to sell any PCs to our captive users."

      --
      --- Nothing clever here: move along now...
  4. SCO by Michael+Crutcher · · Score: 5, Funny

    Wasn't this supposed to be an SCO story? We're falling behind our quota today.

  5. I'll reserve judgement by mao+che+minh · · Score: 4, Insightful
    It sounds like Win FS will operate a lot like a completely closed ReiserFS. Like the author (and probably many here) I don't like the idea of using a relational database as a FS very much. ext2 might be a bitch to bring up after a crash, but I'll be damned if it isn't stable and well documented. Imagine how well refined ext3 will be in another 12 months.

    In any event, Microsoft still has a few years to refine this "Future Storage" file system, so all judgements concerning it's effectiveness are a bit premature on some levels. Then again, it's always good to start planning as early as possible - especially when you consider that it may be introduced into Windows Server 2003 some time during the next 12-18 months. For now, all we can base judegement off of is Microsoft marketing hype and comparisons to existing file systems that operate in a similar way.

    1. Re:I'll reserve judgement by idiotnot · · Score: 4, Insightful

      They mentioned the problems that occurred when moving a volume from one machine to another. Something like this sounds even more risky if there's a problem. With a FAT or NTFS partition, there are many programs that can at least read the partition if the system gets farked to an unbootable state. How will this work with a DB? Furthermore, will you have to have a full OS up and running to be able to query and get data out of the DB?

    2. Re:I'll reserve judgement by blowdart · · Score: 3, Insightful
      Why doesn't Microsoft adopt an open standard like ReiserFS, JFS, or XFS?

      Simple, none of the *nix filesystems out there support the Windows security model. So, even if they based it on an open source file system, they would have to extend it to cope with the NT permissions model.

      However, that would provide lots of scope for slashdot rantings about embrace and extend...

    3. Re:I'll reserve judgement by ComputerSlicer23 · · Score: 4, Interesting
      Hmmm, several things.

      First, the async, means that not all reads and writes are syncronous which is an incredibly good thing for speed. Try putting your UFS/FFS filesystem into fully sync mode and then talk about performance, I'm willing to bet that UFS/FFS isn't sync by default either. However, calling fsync in the mail server (normally sendmail) in Linux will actually make it sync before returning. So no worries about RFC 1123. It's the SMTP server's job to ensure that it tells the filesystem, make sure the bits are on the disk. If Linux didn't have the ability to ensure bits where actually on the disk nobody would use it. That's why in Moshe Bar's series comparing Linux, FreeBSD, and OS X, he always said he recompiled after removing the fsync calls, otherwise you just compared how fast the disks in each system were.

      For goodness sakes, Oracle ships on Linux, if Linux couldn't get the bits on the disk Oracle would have never ported to it. Not a chance. If Linux tells you the bits are on the disk, they are on the disk in my experience.

      I've heard of people losing UFS filesystems while running them under NFS, or losing them due to any number of naferious VM race conditions. So what? Welcome to the real world, people lose data, buy a tape drive, make backups. Knew a guy who got really good at rebuilding filesystems by using dd on Solaris to recover email for customers.

      Oh, and as I recall, async actually affects directories more then files, if you put the sync modifier on the filesystem, it only affects directories, not the file data for ext2/3. In ext3, directory writes are always journaled as I recall, so it shouldn't make much difference.

      Now, from what I've heard of Linux and FreeBSD, is that until the late 2.2.X and early 2.4.X, there we're certain jobs Linux couldn't do like run big Usenet News services, or really disk intensive applications they the filesystem buffering was really hard to get right, and might cause corruption. The guy who ran a local ISP always said FreeBSD never did that when he was running the Usenet server on it, but Linux did with some regularity.

      ext2 hasn't lot any data of mine in my 7 years of using Linux, including running a 120GB Oracle Database for the past 30 months. Ext3's never lost any data since I started using it. I've lost disk drives, I've lost mirrors, I've lost files, never lost a complete ext2 filesystem unless the disk just stopped spinning. Lost a couple of ReiserFS filesystems after installing RedHat7.0. Never tried most of the other journalling filesystems.

      Kirby

    4. Re:I'll reserve judgement by lpontiac · · Score: 5, Informative
      Simple, none of the *nix filesystems out there support the Windows security model.

      Plenty of filesystems are out there that do ACLs. FreeBSD's UFS2 does, I'm pretty sure SGI's XFS and IBM's JFS do also.

    5. Re:I'll reserve judgement by cookd · · Score: 5, Informative

      I think you miss the point of WinFS. It is actually a layer abover NTFS, so the actual files will still be stored the same way they are presently (and NTFS is pretty good about reliability). NTFS is already a lot like ReiserFS even without WinFS.

      What WinFS adds is more powerful indexing, not a new storage system. Whenever you add or modify a file, WinFS adds the file's attributes to its indexes. The attributes stored are customizable, and vary depending on file type (MP3s have their ID3 info indexed, etc.). For example: somerwhere on my 120 GB disk I have a file named "code.txt" but it will take 10 minutes to find it by scanning the directory structure. Instead, I do a "SELECT Path FROM Files WHERE Filename='code.txt'" and WinFS comes up with the answer right away. If I have full text indexing, I could search for a specific phrase. Even more useful, you don't have to make playlists anymore. Just put all of your MP3s in the same directory, and when you want to hear all of your Bon Jovi, just perform the appropriate query. (Obviously you don't have to know SQL to make this useful.)

      Some of this is already present in a more limited form in Unix. For example, I still can't figure out why Windows doesn't have something like the LOCATE database that is set up by default on my FreeBSD box. But WinFS will blow LOCATE away (update is real time, not daily, and it has much more than just the file name).

      However, from what I've heard, so far it is a bit of a dog performance-wise. I hope it gets up to speed by ship time... At least it can be turned off if you can't take the perf hit.

      --
      Time flies like an arrow. Fruit flies like a banana.
    6. Re:I'll reserve judgement by DarkEdgeX · · Score: 2, Interesting

      Yeah, but would they be 100% compatible with NT's internal representation of how ACL's are supposed to work? Highly unlikely, so that would require Microsoft to still embrace and extend, and with the GPL involved, be forced to give out the changes that were necessary to make the open source FS work with an NT-based OS.

      --
      All I know about Bush is I had a good job when Clinton was president.
  6. For lots of files... by chill · · Score: 2, Insightful

    A relational database setup should do wonders for file search and access. Most filesystems today weren't designed with 200 Gb drives and millions of files in mind.

    I keep thinking back to my Amiga when a 40 Mb hard drive was huge. Hell, I have a keyfob with more storage space now! Can you imagine AmigaDOS (not FFS, the old, slow one) on a 200 Gb drive?

    --
    Learning HOW to think is more important than learning WHAT to think.
    1. Re:For lots of files... by cant_get_a_good_nick · · Score: 2, Interesting

      I keep thinking back to my Amiga when a 40 Mb hard drive was huge.

      I ran a Mac lab where a lot of the machines had 20meg drives, and that wasn't all that long ago. They used to sell a 10Mb drive (I forgot how ungodly it cost) for Apple ][s. Apple DOS 3.3 could only recognize floppy size chunks, about 140Kb IIRC, so the thing had to be partitioned into along the lines of 80 pseudo-drives. I never saw one physically, but I can imagine what the P.I.T.A. that was.

    2. Re:For lots of files... by g4dget · · Score: 2, Insightful

      Most filesystems today weren't designed with 200 Gb drives and millions of files in mind.

      I have no idea what "most" is supposed to mean in this context, but systems like NTFS, ext3, ReiserFS, XFS, and JFS were definitely designed for those kinds of applications.

      A relational database setup should do wonders for file search and access.

      Relational databases aren't designed for millions of records with potentially huge BLOBs in them. In fact, the way Oracle and DB2 handle that kind of data is by putting it into the file system; the database just refers to it and tries to keep it consistent.

    3. Re:For lots of files... by Twylite · · Score: 4, Interesting
      It's a hierarchical organization; it doesn't matter how big the drive gets.

      One word: FAT. You are making three assumptions here. The first is that the underlying implementation is capable of supporting near-infinite extension without degradation. Invalid for FAT, valid for the FS types mentioned in the grandparent, and the reason for what I said. The second is that the file system will be used as a hierarchy, which is invalid for most end users. The third is a combination of the first and second, being that the file system extends without unreasonable degradation to a vasst number of files in a single directory, and performing operations (esp. searches) on them quickly. This is invalid for all of these file systems, because of how they store metadata.

      Why would I want to do "a simple name search for a file across an entire drive"? The file name is meaningless outside the context of its hierarchy.

      Again, you're assumiung you, a technically savvy user. End users don't behave like this. By and large they use meaningful file names in a single directory. If you're looking for a document someone else did, it will be in their single directory, not in a common folder for documents relating to that topic. If you don't know who worked on the document, you need to do a broad search based on keywords.

      Yes, and it's fine for what it is. I certainly wouldn't want the overhead of updating the "locate" database every time a file changes somewhere.

      Which shows how little you've thought about the implementation of this system. You only have to make a change if the file metadata changes. In many file systems you already have to write that change in a different location to changes to the file itself (if you don't, your metadata search time goes out the window). If your "locate" database is a relational database, making a change has trivial overhead.

      Well, welcome to the club. For years, Linux has had several implementations, among them FAM, dnotify, and changedfiles, with hooks into indexing systems, and Linux is hardly the first.

      Actually, this isn't what I was meaning. I was referring to the relationship between the data in the FS and in the locate database (or any other metadata search database), and indicating that WinFS (in theory) takes out the step of building a separate database by using the database as the "index" of the file system. Unfortunately in this incarnation of WinFS (the current implementation) MS will not be implementing it quite in that fashion.

      But to answer your point ... Win32 systems have had file change notification in their APIs from day 1 (NT 3.1 / Win95 + have FindFirstChangeNotification; NT 3.51 + have ReadDirectoryChangesW).

      [snip] you are far better off with a real document database

      And that's pretty much what MS is doing by converging a tradition file system with a metadata view.

      Microsoft should focus on creating a robust file system with decent performance, not get side-tracked with gimmicks. NTFS could still stand a lot of improvement.

      Of course, WinFS was intended for client operation systems, not servers. And while NTFS could still be improved, it doesn't make a lot of sense to do so: most high data volume applications store their data in structured files, and don't require much from the file system in any place where performance could be signficantly improved.

      --
      i-name =twylite [http://public.xdi.org/=twylite], see idcommons.net
  7. This article is bullshit by Zork+the+Almighty · · Score: 5, Troll

    This article is bullshit. There isn't a shred of new information in it. It's like watching CNN.

    --

    In Soviet America the banks rob you!
    1. Re:This article is bullshit by Zork+the+Almighty · · Score: 5, Insightful

      Hey did the person who modded me "Troll" READ THE FUCKING ARTICLE ? DID YOU ? Read it. It's 6 pages of how FAT and NTFS work, along with the classic line that VFAT "was the first system that could write long file names". The LAST PAGE is the only page that talks about WinFS. It's titled "Conclusion: WinFS - The Future". It regurgitates that WinFS is based on the upcoming file system for SQL server, which is *modelled on* a relational database, and gives all the usual hype about how you'll be able to search by content, blah blah. There is not a shred of any information, let alone anything new. The whole fucking article is filler.

      --

      In Soviet America the banks rob you!
    2. Re:This article is bullshit by sprag · · Score: 4, Informative

      Hear hear! I paged through the whole thing hoping to find some _actual_ information, but was really disappointed on page 6 which was basically the summary over again.

    3. Re:This article is bullshit by more+fool+you · · Score: 5, Funny

      Welcome to the marketing revolution. A 6-page ad, made mostly of smaller ads. Why do I suddenly have the urge to purchase Longhorn & more memory?

    4. Re:This article is bullshit by Osty · · Score: 2, Informative

      Worse than that, THG can't even get their facts straight. For example, when discussing fsutil.exe on page 4, the caption of the picture calls it a DOS app (it's not) and say it's from Sysinternals (perhaps they meant ntfsinfo, like the picture shows), yet the article text properly calls fsutil a "command line utility" (which it is) from Microsoft (which it is). While they do mention that it works on XP and not Windows 2000, they don't bother to mention that it's also available on Windows Server 2003, and that it's a system utility that's installed with the OS (c:\win[dows|nt]\system32\fsutil.exe). And just to add insult to injury, the "fsutil fsinfo" command they suggest you run is not quite correct. You need something more like "fsutil fsinfo ntfsinfo c:". "fsutil fsinfo" by itself just gives you another help screen, and not "scads of fascinating statistical information on the file system, volume and MFT."


      All this article does is reinforce my dislike for Tom's Hardware Guide, and gives me ammunition I can use to convince others that THG is crap, too. If you want good hardware reviews, go somewhere good like AnandTech or Sharky Extreme. Hell, you could even go to Blue's News for the daily Hardware Reviews and still get better info. (I've not once seen Blue's link to THG from the Hardware Reviews ... I wonder why?)

    5. Re:This article is bullshit by doorbot.com · · Score: 5, Insightful

      Hey did the person who modded me "Troll" READ THE FUCKING ARTICLE ? DID YOU ? Read it. ... The whole fucking article is filler.

      Tom's Hardware must have degenerated while I wasn't looking. The articles used to be full of detail, two printed pages long per "digital article page" (what you see on the screen ;)). Now it looks like the two paragraphs per "article page" are just used to keep the ads from bumping into each other. The whole article could've fit on one page, but I guess that doesn't get very many banner impressions when you know that Slashdot will link to your story, no matter how high the "filler" ratio.

    6. Re:This article is bullshit by Jugalator · · Score: 4, Interesting

      Also, WinFS is no file system like FAT and NTFS. It's just a service running on top of NTFS.

      It's really funny how they try to compare it with a file system, since they're just looking at NTFS with a layer giving the user an easier time to do certain things.

      --
      Beware: In C++, your friends can see your privates!
  8. Mmmm Hmmm Sure by technomancerX · · Score: 3, Insightful

    Haven't they basically been trying to implement this since the days of Cairo? Seems the 'revolutionary new file system' gets announced for every Windows release that is several years away, then vanishes by the time the release takes place.

    --
    .technomancer
  9. Windows Explorer by Scoria · · Score: 3, Funny

    The upcoming WinFS file system will be the first to be context-dependent, and promises to make long search times and wasted memory a thing of the past.

    Well, yes; we must preserve those system resources for the most recent incarnation of explorer.exe.

    --
    Do you like German cars?
  10. nothing new by g4dget · · Score: 2, Interesting

    This will be better than FAT32 and NTFS, but it is hardly "breaking new ground". A number of operating systems have used more-or-less relational databases as their file systems; it's a special purpose technology and has no place in a general purpose OS. I think ReiserFS makes the right kind of compromise here: it uses a little bit of database technology, but it mostly remains a traditional file system.

  11. WinFS is on top of NTFS by xWeston · · Score: 5, Interesting

    As far as i was concerned, WinFS was not actually a real file system but something that just runs on type of an NTFS filesystem.

    This was actually confirmed at WinHEC:

    "Microsoft has scaled back its 'Big Bang', and its Future Storage initiative will build on, rather than supersede the NTFS file system, when the next version of Windows 'Longhorn' appears in 2005."

    "WinFS is not a file system

    NTFS will be the only supported file system in Longhorn, from a setup and deployment standpoint, though the OS will, of course, continue to support legacy file systems like FAT and FAT32 for dual-boot and upgrade purposes. The oft-misunderstood Windows Future Storage (WinFS), which will include technology from the "Yukon" release of SQL Server, is not a file system, Mark Myers told me. Instead, WinFS is a service that runs on top of--and requires--NTFS. "WinFS sits on top of NTFS," he said. "It sits on top of the file system. NTFS will be a requirement."

    Interestingly, when WinFS is enabled, file letters are hidden from the end user, though they're still lurking there under the covers for compatibility with legacy applications. This reminds of when Microsoft added long file name (LFN) support in Windows 95, but kept using short (8.3) file names under the covers so 16-bit applications would still work. Expect this to be the first step toward the wholesale elimination of drive letters in a future Windows version."

    1. Re:WinFS is on top of NTFS by TheNetAvenger · · Score: 2, Insightful

      Thats ok, windows wasn't a real operataing system when it ran on top of DOS. hmmmmm. come to think of it, it still isn't..

      Windows95,98,ME debatable...

      However, WINNT, Win2k, and WindowsXP are very much their OWN OS, and DO NOT HAVE ANYTHING TO DO WITH DOS. Where have you been?

      How can it be that NT is over 10 years old, and people are still think it is based on DOS...

      And how come people still think that Windows9x and the NT based Windows variants are even remotely the same after all these years?

      Geesh...

    2. Re:WinFS is on top of NTFS by xWeston · · Score: 3, Interesting

      I believe they originally planned to have it as an entirely new filesystem but that they wouldnt be able to hit the mark with it...

      The article that i got some of that information from was from The Register: http://www.theregister.co.uk/content/4/30670.html

      Also, there is more information here: http://www.winsupersite.com/showcase/longhorn_prev iew_2003.asp

    3. Re:WinFS is on top of NTFS by be-fan · · Score: 2, Insightful

      Not really. A database FS is one of the few new advances that add real power to systems. If you think about the human thought process, they don't tend to think of things in terms of strict hierarchies. They tend to look at a set of items from multiple viewpoints, possibly creating a hierarchy within a given subset, but rarely subject all items to the same hierarchy. There are lots of places where a hierarchical filesystem doesn't really make sense. Is there any sane method of organizing a documents directory? Whenever I try to do it, I generate huge, complex hierarchies with only a few files in each directory. Organizationally, its elegant, but its a pain to traverse. Instead, if I simply shove all documents into a single store, I can use the database mechanism to view the file set in a way that is most useful for the task at hand. A media file directory is another good example. Its tedious to organize media files into a strict hierarchy, and very limiting to have to deal with that hierarchy every time, especially if you want a different view of the data. For example, usually, I organize music files by artist, then album. But what if I want to quickly access files based on catagory (blues) or period (80's)? Since the artist trait is usually orthogonal to the catagory and period traits, a simple hierarchy fails to adequately describe the situation. This is why jukeboxes like iTunes or MusicMatch are getting so popular: they are very flexible with how songs are presented to the user. The whole idea here is to take the power and efficiency of something like the iTunes interface, and apply it to file management in general. If a simple query language is supported, such a scheme could potentially be very usable via a command line as well.

      --
      A deep unwavering belief is a sure sign you're missing something...
    4. Re:WinFS is on top of NTFS by TheNetAvenger · · Score: 3, Insightful

      Reread the post again, paying special attention to the word 'when'.

      I suspect people confusing DOS kernels and NT kernels is your pet peeve, and it bothers you so much you see it everywhere.


      Maybe you should reread your post, you said that it 'still isn't'. - Implying that Windows still isn't an operating system, and with the previous reference to the DOS underpinnings you set yourself up for that interpretation.

      Besides, now that we are on the subject, Win9x and its successors also were TRUE Operating Systems, as they handled ALL Input/Output of the OS and the only DOS it relied upon was for compatibility with legacy application or drivers. DOS was basically a 'boot loader' for the Win9x Operating System.

      Go look up the terminology of Operting System, and you will find that Win9x easily fits the definition. This is why the Win3.1 box said 'Operating Environment' and Win95 box said 'Operating System'.

      Maybe your pet peeve is that some people notice when you make an ass out of yourself when you are so quick to find a reason to bash Windows and yet you still have to find a way to respond.

      Geesh...

    5. Re:WinFS is on top of NTFS by Waffle+Iron · · Score: 2, Funny
      WinFS is not a file system

      Microsoft has reached new lows in ripping off ideas from the free software world: They've started using recursive acronyms for names. Where is it going to stop?

  12. Re:Again? by AvitarX · · Score: 3, Interesting

    ummm, FAT 32 has enormous cluster sizes for large drives. That does affect the normal user.

    Fat 16 was limited to 2GB partitions, that affects the normal users.

    Now, the fact that if the database file system works the way I imagine it would it will be a bad thing for the normal user's more tech savy friend.

    I have spent years explaining to relatives that the same file name in 2 places is 2 different files.

    Now I must spend time explaining that if you brows, documents, taxes, and edit file blah it will effect important stuff, file blah and what not.

    People will be confused by this I believe. And I also think the the techies saying it is stupid would benifit from this greatly, I know I would love to organize things with tons of logical ways to browse there.

    But I am not some overpaid market researcher so what do I know.

    --
    Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
  13. Maybe it'll work by localghost · · Score: 2, Insightful

    It sounds interesting, to say the least. Maybe Microsoft has finally come up with something innovative. I'd be interested to try it out and see how it feels, and if it really can do everything they say it can. As usual, though, security could be an issue. A virus could wreak havoc if it found a way into the database.

    Also, I'm wondering if they'll finally give up on that stupid drive lettering. I don't see any reason why that ever had to exist, and now that they're doing an overhaul of the whole filesystem, it seems like a good oportunity to get rid of it. You'd think, since they try to be user friendly, that they would want to give devices and partitions names instead of letters. I do still see it in that screenshot, but things could change by the time it's released.

  14. Re:Again? by timeOday · · Score: 2, Funny
    I have spent years explaining to relatives that the same file name in 2 places is 2 different files.
    man ln :)
  15. Terrible article by dbarclay10 · · Score: 5, Informative

    This article is tripe.

    The closest it gets to examining the (possible!) new Windows filesystem is calling it a relational database, and going on for a bit about how paths (ie: directory structures) will be irrelevant. Oh, and yeah, the closest thing he found to the implementation was called "winfs.exe" and did nothing but produce errors.

    The bulk of the article is a (poor) attempt at explaining filesystems in general, and FAT and NTFS in particular. However, it gets a number of things wrong and - at best - garbles a lot of things. If you already know what he's trying to say, you *may* be able to pick out truths, but if you don't you'll walk away with misinformation.

    I would suggest instead perusing arstechnica.com and aceshardware.com. I don't know if they've done any filesystem stuff, but if they have it'll be of reasonable quality.

    --

    Barclay family motto:
    Aut agere aut mori.
    (Either action or death.)
  16. Oh so informative! by BasharTeg · · Score: 5, Interesting

    I read this article hoping for some real information on the WinFS file system, and instead I got an amature's review of Microsoft file systems I grew up with.

    "There has been much speculation"

    Uh huh.

    "Win FS is modeled on the file system of the coming SQL server"

    Uh huh.

    "In its latest build (M4), Longhorn contains few hints of the technology's imminent implementation."

    Uh huh. You're saying you don't know anything, yeah, I'm getting that part.

    "One of those is more than 20 MB in size and bears the name winfs.exe."

    Neat.

    "In the end, Win FS will probably emerge as an optional file system beside FAT and NTFS. It's also possible that Win FS will supersede its predecessors, however."

    So in the end, it'll be A... but it is also possible it'll be B. I see.

    "That would most likely produce problems for multi-boot systems"

    An astounding feat of logic Mr. Spock!

    This is the most uninformative article I've ever had the displeasure of reading on Tom's Hardware. These people know exactly nothing more about WinFS than any of the rest of us have heard in rumors and vague press releases.

  17. Mod the parent up! by fiftyvolts · · Score: 2

    His post is darn insightful, wish I had some mod points :-/

  18. Re:I can almost guarantee you.... by AndroidCat · · Score: 5, Funny
    I'll even get a paperclip to help me manage my files!!!

    How about Clippy? "I see you're looking for your work files. You're fscked."

    --
    One line blog. I hear that they're called Twitters now.
  19. My turn by poptones · · Score: 5, Insightful
    to say how much this article sucks. I should have known when I saw "Tom's" that it would be nothing but a few pages of useless drivel slapped into a directory as an excuse to sell ads, but I thought because this was a /. "cover story" it might be worth checking out.

    Looks like I was wrong - or, actually, right all along. Musta been a slow news day?

  20. I'd be happy if they just let me use write caching by Gldm · · Score: 4, Informative

    It's really irritating to have a RAID that gets crippled to 5-10MB/sec when on other OS and FS combos it can do 80-100 because MS has decided "Oh performance isn't important, reliability is, so we'll force cache off for all SCSI miniport devices even if it says it's on."

    See http://forums.storagereview.net/index.php?act=ST&f =2&t=1758&hl=slow+scsi+performance&s=9f0e65a3ff482 2032e4a63091694cc3f it never got fixed. Non-RAID IDE is unaffected supposedly due to a "bug" in the system where IDE devices ignore the OS commands to switch to write through caching. It's really ridiculous when a 700MB file takes almost 2 minutes to copy under XP and yet under BSD on the same system dual booted on the same array, it takes 11.2 seconds.

    --

    Introducing the new Occam Fusion! Now with sqrt(-1) fewer blades!

  21. Difference between FAT32 and NTFS by mrklin · · Score: 2, Interesting
    Normal user does not triple boot their system nor do they bother with two versions of Windows.

    NTFS has tons more advantage than FATxx. The official list can be found here. Granted, this benefits the corporate user more than home user.

    At the very least, NTFS offers a quicker way to hide porn than FAT32.

    1. Re:Difference between FAT32 and NTFS by yanestra · · Score: 2, Insightful
      At the very least, NTFS offers a quicker way to hide porn than FAT32.

      Of course. Simply put it in a second stream of some other file. Not even MS experts would look there...

  22. Re:I really don't like this idea.... by SWTP_OS9 · · Score: 2, Interesting

    Not the only one!

    With MS database record they have got to be kidding! I know that Windows CE uses a DB format for storage but I want to see it under max load with "n" task accessing it and the planet worth of data to pull from with a good percentage being changing. Then crash it and try to restore the mess. What would the resulting speed be. Recovery time.

    I guess you will need a 4 to 6 ghz system with an insane speed hd array and memory up the wazu!

    Instead of revamping the wrapper why not improve on surviablity of both data/os/programs! When will they get it in their head that the OS does and should not be a swiss army knife with cheep blades that are dull, usless, break and hard to open!

    I cant remember but they was something based on somthing called "tumblers" that was a way to access data. Read somting in a ancient issue of Byte mag. Had to do with Objects and mondering content.

  23. Good idea by Bodrius · · Score: 3, Interesting

    Hopefully this will encourage more competitors (including open source) to go for the RDBMS-based filesystem model.

    I don't understand the concerns of the poster regarding performance (at least without evidence of truly dismal performance): no one is forcing anyone to use the FS if they are not satisfied with performance.

    For most users, they main bottleneck in storage is their own organizational faculties. I used to be exasperated when users didn't know where they put their files, but once you get past the 100GB mark, it becomes very understandable.

    Consider what most people use their massive storage for these days: videos, music, multimedia, games. Not only is this the kind of content that SHOULD be stored in a database, it's the kind of content that is ALREADY being handled through a database because the filesystem is not enough: people are using their media players, P2P programs and other software to handle their files, up to the point they rarely ever interact with the filesystem unless they lost a file.

    For most users, the performance penalty is well worth the price.

    For those for whom it is not, it doesn't take a genius to realize you can use more than a single filesystem, and perhaps rediscover the joy of proper partition organization: keep the OS and applications separate from your data, and you can use your highly efficient filesystem for the first and your metadata-loaded one for the second.

    --
    Freedom is the freedom to say 2+2=4, everything else follows...
  24. For such an in-depth article... by sn00ker · · Score: 5, Informative
    there sure were a large number of glaring errors.
    1) NT4 (certainly from SP3) allows you to make partition alterations without a reboot. Even 2K requires a reboot for alterations to the boot partition, however.
    2) 2K doesn't dispense with the drive letter concept, despite the implication in the article that this is the case. That you can mount partitions under folders doesn't change this.
    3) You can specify the cluster size when formatting a drive under NT4.

    Also, has anyone actually come across a data centre that is making use of multi-hundred-TB NTFS volumes?
    And, will Longhorn finally do away with the whole drive letter concept?

    --
    "God, root, what is difference?" - Pitr, userfriendly
  25. Better, not best by Peaker · · Score: 4, Interesting

    Relational databases are better than conventional file systems in both performance and transaction management/journalling.

    However, the best solution is that used by EROS, which is for the kernel not to provide a file system at all, but instead provide Orthogonal Persistence.

    This is a much simpler layer for applications, since it doesn't require them to explicitly access the memory and disk separately. It is also much simpler to recover from because the entire state of the whole disk is always known to be coherent with itself at all given points in time, without an expensive journal.

    In terms of performance - it beats the hell out of explicit disk access systems (Both conventional and database systems) because it performs big continuous reads and writes (that don't move the head much) rather than small writes on metadata and file data that forcibly jump the disk head around.

    In EROS then, on top of the Orthogonal Persistence, you can create any arbitrary Objects you want easily - because they're just normal processes with normal memory. Conventional File Systems become useless and objects implemented by processes become a much better and more powerful alternative to files.

    A relational database of the user objects is then much more powerful than a string hierarchy, but this is all the user's choice - and not hardcoded into a kernel.

    1. Re:Better, not best by Anonymous Coward · · Score: 5, Insightful
      Relational databases are better than conventional file systems in both performance and transaction management/journalling.
      Relational databases focus on data relationships not physical storage, which is what conventional file systems do. Performance and transactions are completely orthogonal and depend on typical use and fault tolerance requirements. E.g. XFS has both performance and journaling (only metedata though) and is still a file system.
      However, the best solution is that used by EROS, which is for the kernel not to provide a file system at all, but instead provide Orthogonal Persistence.
      Orthogonal Persistence = leave machine on indefinitely. What happens when errors and/or data corruption occur in core? Planned maintenance involving powering down the machine? How about persistent storage (backups, data transference, system upgrades)? And cost: do you think its cheaper to get a 64-bit machine and 500GB of ram to store EROS processes indefinately or a 32-bit machine with 1GB of ram and 500TB of disk space? The whole reason computer systems developed secondary storage is to address these concerns.
      This is a much simpler layer for applications, since it doesn't require them to explicitly access the memory and disk separately. It is also much simpler to recover from because the entire state of the whole disk is always known to be coherent with itself at all given points in time, without an expensive journal.
      Basically turn everything into memory mappings...and that limits your data size to your address sizes. May not be a problem with 64-bit addressing, but storage is growing exponentially.
      In terms of performance - it beats the hell out of explicit disk access systems (Both conventional and database systems) because it performs big continuous reads and writes (that don't move the head much) rather than small writes on metadata and file data that forcibly jump the disk head around.
      Modern filesystem already make such optimizations as do databases. Heard of LFS? This is a filesystem designed to do this for *BSD.
      In EROS then, on top of the Orthogonal Persistence, you can create any arbitrary Objects you want easily - because they're just normal processes with normal memory. Conventional File Systems become useless and objects implemented by processes become a much better and more powerful alternative to files.
      And backups? Seperating data from instructions? Sharing data across heteregeguous systems? Or is the plan EROS everywhere?
      A relational database of the user objects is then much more powerful than a string hierarchy, but this is all the user's choice - and not hardcoded into a kernel.
      Yes, but a string hierarchy can be used to built a relational database by layering code and design. However, the opposite is probably less efficient, if it is at all possible. Basically, flexibility and modularity are better: a user can always buy addons for database features.
    2. Re:Better, not best by sql*kitten · · Score: 2, Insightful

      Orthogonal Persistence = leave machine on indefinitely. What happens when errors and/or data corruption occur in core? Planned maintenance involving powering down the machine?

      No it doesn't - it just means that from the perspective of an application, there is no difference between malloc() and open(). You ask for some storage, you get it (unless something goes wrong, of course). You read and write to your bit of storage. At some point it may be in main memory, in some cases it will be on disk. The OS takes care of moving things around, you never see it. In Unix, there are the concepts of main memory, the file cache, the swap space, the file system. On (say) an AS/400 these distinctions simply don't exist outside of the kernel - all disk is swap space, all memory is cache, all memory is allocated from swap, and all cache is flushed when a program exits.

      Basically turn everything into memory mappings...and that limits your data size to your address sizes. May not be a problem with 64-bit addressing, but storage is growing exponentially.

      64-bit is something new in the PC world, but in the world of professional computing, 64-bit has been around for a long time.

      And backups? Seperating data from instructions? Sharing data across heteregeguous systems? Or is the plan EROS everywhere?

      IBM solved all of these problems decades ago. I always smirk when Unix people in general and Linux people in particular think they're just inventing something that IBM had back in the 80s :-)

  26. Truth be told... by Da+VinMan · · Score: 4, Interesting

    I'm looking forward to this! I personally am sick and tired of filesystems as we know them today. Today's filesystems are a strict hierarchy, the existence of which is only necessary in the systems of yester-year.

    A filesystem based on a relational database will have some characteristics to which today's filesystems can only aspire:

    1. ACID - In every way that the underlying database supports Atomicity, Consistency, Isolation, and Durability, so now will the filesystem. In so far as the database is robust, the filesystem will be robust. Please spare me the comments about the supposed unreliability of SQL Server. Itâ(TM)s certainly more reliable than NTFS; which is itself very good.

    2. As an offshoot of the above - Imagine multiple file updates to a filesystem which is transactional! Imagine that transaction failing and being able to just rollback the changes without touching every file in your program! Imagine being able to make file changes programmatically without having to worry about locking because the engine will do it for you (just handle any exceptions)! Yeah, you could do all that today if you like. But it takes extensive to make it happen.

    3. Operational characteristics - We can run queries against databases. We can index them. We can cluster them. We can replicate them. We can access them easily from any development platform you can imagine. Now your filesystem is a database. The possibilities make me shiver! :+) Maybe the initial implementation wonâ(TM)t get all this right. But at least it stands a chance.

    4. Another offshoot from #3 - Security. Databases are inherently better than filesystems (IMNSHO) at enforcing security and enabling administration of security.

    I only have reservations about one issue with the database as filesystem area: recovery. Currently, all good and low-tech filesystem recovery tools really are based on the filesystem allocation table sort of scheme. Obviously, databases usurp this category of tried and true tools. However, good tools already do exist that allow recovery of relational databases. Itâ(TM)s just a matter of getting easily accessible tools of this sort into the hands of professionals that need them. It's more of a training issue I guess, but it will still need addressing.

    I know many people will have a knee jerk reaction to this idea, and I understand why. But I would encourage people to keep an open mind to this. While there will probably be some issues with the idea, there's so much more that could easily be done with a filesystem on top of a database than could be done easily (or well) with a traditional filesystem.

    And for you hard-core naysayers out there, you have to ask yourself this: If this is such a bad idea, then why did Oracle provide this as a feature too?

    --
    Please mod this post only if you think others should/n't read this. I have enough ego^H^H^Hkarma. Thanks!
  27. Good idea, but ONLY much further out... by Future+Shock · · Score: 4, Interesting

    Near-Term: this thing should be just as stable as every other MS product prior to version 3.0 of it. (In short, damned lousy). To make it worse, it probably also enables DRM at a file system level...

    Mid-Term: FS finally works, and allows easier retrivial by relevance, author, source, etc. in ways that we can just dream of now. It's the kind of thing we didn't realize we needed until we had it...until it inevitably blows up as all MS products must do eventually. But when it works, we will be fairly happy to have it...especially end users, most of whom can't figure out a hierchical file system in the first place.

    Far-Term: FS is finally able to use it's relational roots to distribute filesystems over multiple processors in an cluster or over a network. Such a system would support atomic, distributed file updates by threads of processes on differing processors (including HyperThreaded procs). Imagine a virtual filesystem that can span your whole-house network, with a single file system image...in WINDOWS.

    So I guess my view is: painful in the near-term, but may be cool to have when they get it right.

    1. Re:Good idea, but ONLY much further out... by Nucleon500 · · Score: 2, Funny
      FS is finally able to use it's relational roots to distribute filesystems over multiple processors in an cluster or over a network. Such a system would support atomic, distributed file updates by threads of processes on differing processors (including HyperThreaded procs). Imagine a virtual filesystem that can span your whole-house network, with a single file system image

      Oh, you mean like the Parallel Virtual File System, right?

      ...in WINDOWS.

      Oh, sorry.

  28. Re:Pretty thorough article by Zenki · · Score: 3, Interesting

    Does it matter? HPFS was created at MS. I guess it explains why HPFS hasn't been improved on recent OS/2 beyond HPFS386, and JFS is now an optional FS on OS/2. JFS is probably a much more capable FS than HPFS anyhow.

    http://www.cs.wisc.edu/~bolo/shipyard/hpfs.html

  29. Fast != Fast by WoodstockJeff · · Score: 4, Interesting
    I can see where a more efficient directory structure might be helpful, but... will they continue to sacrifice file access speed for file search speed?

    I recently installed a Win2K server that is blindingly fast at finding documents and such... but horridly slow at serving up portions of files, for things like legacy database programs. Three of the customer's applications started running at 1/4 speed.

    It got so bad, even after all the "fix win2k speed" patches, that we re-introduced the 200MHz NT4 server to feed the database apps, and the dual-processor 2GHz system just serves up documents!

  30. We don't need no steenkin efficiency by rot26 · · Score: 3, Insightful

    Unless they can keep the overhead to a minimum, I can't see it being as efficient as a file system should be.

    They may have goals other than efficiency. Security, probably. But probably also security's perverted uncle, DRM. As DRM becomes more common, and we "pirates" look for more innovative ways to get around it, locking us out of our hard drives would seem to be a logical if not downright necessary step. It's pretty obvious that a lot of entities out there would benefit greatly from a model where we don't really OWN our computers, we just lease the right to use them. I mean, look, they're already floating the notion out there, at least for software and entertainment media, that we don't really OWN ANYTHING, and it's not that much of a stretch for that to become literal truth, aside from the hardware, which will be as impervious to meddling as they can possibly make it. (You decide who "THEY" might be, but I have a long list with a lot of familiar names on it.)

    How technically difficult would it be for, say Microsoft, to "rent" out portions of your hard drive to various media and software providers, using a combination of hardware and software controls to assure those companies that you and I ABSOLUTELY CANNOT meddle with "their" product while we (temporarily) posess it? A database-driven file system provides exactly the access control and accountability that would be required to successfully implement something like that.

    --



    To ensure perfect aim, shoot first and call whatever you hit the target
  31. Re:Again? by AvitarX · · Score: 2, Insightful

    yeah,
    Because these same people will really quickly adapt to a gear or a foot instead of START.

    And they really like to call me with questions when they buy a new printer and the install CD doesn't work, and then I find out that it is incompatable.

    But their favorite thing is when they decide to make the family tree the family tree maker they buy doesn't run.

    Links in Windows (9x anyway) cause all sorts of confuision too, like when copying the whole start menu to a floppy does not allow them to pirate all of their programs to someone else.

    No, that just points to the program it is not the real program. Really the easiest thing to do is buy another copy and give it to them (we are talking playing card games, not $300 office packages).

    I like and use Linux nearly exclusivly at home (I have SanctionII installed at home on Windows for work reasons), but I do not like to foist it upon my relatives. That is just rude, I like having a huge library of decent slightly less pretty slightly harder to use apps for free (in every sense). Most people like having easier to install (if you don't know anything), cheap and ubiquitose apps and hardware. And I can respect that.

    --
    Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
  32. Re:I really don't like this idea.... by Frostalicious · · Score: 2, Insightful

    It just can't be good. Using MS SQL as a database is bad enough, I couldn't imagine depending on it as a file system.

    I've dealt with a lot of bad products from MS, but SQL Server is not one of them. What exactly is wrong with it? It's been rock solid to me, with good features. I suppose the licensing might be brutal, but I don't care so much about that.

  33. Re:Lindus says "Me-Too"!!! by MikeFM · · Score: 2

    Geeks everywhere have been doing this already for a long time. It's nothing new or innovative. It's even been done in the filesystem code by several different groups (and that's still a bad idea). I've been doing this with Linux and MySQL for years. Once again Microsoft is trying to claim innovation on something everyone else was already doing.

    On the other hand I wouldn't mind seeing a standard library for this purpose if one doesn't already exist. I know Gnome has a VFS library. I'm not sure how much of it is reused by other projects or who originated it.

    --
    At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
  34. Good idea, bad implementation by Nucleon500 · · Score: 5, Interesting
    Most of what we say is guessing, because we don't know the question MS is trying to answer. I can't think of any goals best met by WinFS.

    A directory tree is a very useful structure, at least to the software. Similar stuff is grouped together, and easily cached. It provides a very clean and simple way of putting data somewhere and getting it back later. This should not lightly cast aside.

    So, you want to use a relational database to keep track of files? Go for it, but instead of keeping track of the files themselves, keep track of their paths. Let the filesystem do the efficient storage, and the database do the efficient lookups. The database can be made faster and smaller, the filesystems can remain as fast as they are, and the files are still there even if the database gets corrupted.

    Put hooks wherever necessary to update the database when the filesystem changes. For example, put a database in the root of each filesystem. Use a stacked mount to mount that disk, so when interesting things happen, the kernel tells a userspace process that updates the database. Then, make some standard libraries that use the database. Make file browsers that can query it, but pass the path to programs. Make save dialogs that can also save metadata about the file, and open dialogs that can search for it. Use LUFS or FUSE to make directories that correspond to queries.

    This is just as effective as what MS is doing, but it's more efficient, it's more compatible, and it doesn't reinvent the wheel.

  35. Anyone remember "On Location"? by podperson · · Score: 3, Insightful

    A Mac desk accessory / extension combination, I believe, that came out in 1990 or so. It allowed you to instantly retrieve lists of file on your hard disk based on name and content (by "instant" I mean the list of matching files changed as you typed your query).

    On my IIci it was perfectly fast. Faster than BeOS queries on a dual 603 box.

    It took a little time to build your index, but keeping it up-to-date was pretty painless. Apple's developer CDs used to ship with On Location indices on them.

  36. Re:lol by chill · · Score: 2, Insightful

    and may I ask, how many people have 1000 40kb files on a hard drive let alone one that is 500MB in size?

    Three words...

    Internet Temporary Files

    A thousand small files is nothing with the default IE cache settings if you have a large drive.

    --
    Learning HOW to think is more important than learning WHAT to think.
  37. So where's the article? by Professor+D · · Score: 3, Informative
    The headline says "... Looks at WinFS." I click on the link expecting to find some details about the upcoming file system and how it's going to be implemented. Instead there's just page after page of someone's high school paper on Windows' file system through the ages.

    Then at the end there's a few paragraphs with no real info about the FS at all. What meta info will be stored? How will the files be laid out on disk? Is there going to be journaling? How about file system integrity and recovery?

    The only thing _really_ learned was that there exists a 20MB beta executable that doesn't do anything. What the frell? It's two years before Longhorn is to be released. (As if Microsoft is going to get it right the first time anyway).

    Mod the whole article down. Way down.

  38. New FS by rf0 · · Score: 2, Interesting

    New file systems always worry me esp with regard to data loss. With FAT and NTFS they are old but they are stable. I've never seen the OS lose data, though if there is a sudden crash then yes, but not during normal operation.

    New FS = New corruption?

    Rus

  39. Be proud mate. by hayden · · Score: 4, Funny

    It's every slashdotters dream to get a "Score:5 Troll" post.

    --
    Nerd: Derogatory term typically directed at anybody with a lower Slashdot ID than you.
  40. Once again... by pergamon · · Score: 2, Insightful

    ...MSWindows inches closer to where BeOS was 7 or 8 years ago.

  41. RTFA, /. !!! by NetFu · · Score: 5, Insightful

    I've been reading and contributing responses to Slashdot for years, but this is by far the worst article I've ever seen posted here. I can't believe whoever posted the article on the "front page" of Slashdot actually read the article -- they couldn't have even read the first page.

    Right on the first page of the article, the "journalist" who wrote it describes disk storage as "memory". In the "Summary" of the article posted on every page, current file systems are described as wasting "memory". This reminds me of every-day users who confuse their computer running slowly (or literally telling them they don't have enough memory) with the need to delete files from the hard drive -- two completely separate things in most situations. This is all aside from the fact that the article doesn't actually tell you much that anybody who's used computers for more than six months doesn't already know. This guy sounds like some of the kids who come to me interviewing for I.T. positions thinking they've got a leg-up on everyone else because they've got some basic experience with PC's and Windows.

    The bottom line is that the guy who wrote this article doesn't have any business writing tech articles without heavy supervision from someone who KNOWS tech, and I don't just mean someone who knows enough to rattle off performance numbers for CPU comparisons (read some other articles on the site). Lastly, Slashdot has no business posting amateurish and misinforming articles like this for the rest of us to waste our time on.

  42. 160,000 Files by canowhoopass.com · · Score: 2, Interesting

    I have ~ 160,000 files taking up 55 gigs on my NTFS partitioned hard drives. It took over 5 minutes on my 1.6ghz machine to come up with that.

    To search for a specific file often takes much longer.

    Personally I look forward to a better, faster file system on Windows. Although I'd still hold off judgement of the new system until it becomes available.

    -
    Rod

  43. Re:I like the idea by Arandir · · Score: 2, Insightful

    The concept of naming files, and sorting them in directories isn't a very good concept, and the proof of it is looking at how everyone here uses playlists to handle media files.

    Another poster mentioned phone numbers. An even better analogy is email addresses. The concept of email addresses and typing them into a TO field isn't a very good concept, and the proof of it is looking at how many people use address books to handle email addresses. Why type in an email address when you can just type in (or select) "tom"?

    Thus I have the situation at work where I can't find anyone's email address. I want to send an email to "John Smith" down the hall. But the company exchange server is global, so I have to scroll down an Outlook list poring over 50,000 John Smith entries to make sure I don't accidentally send it to the John Smith in Kuala Lampur instead of the John Smith down the hall. I finally find the right John Smith and I expect to see the email address so I can write it down for later. But no! It's not available! The way my company has Exchange set up there is no actual email to be found. So I can only send email to John Smith from Outlook, because Evolution, KMail, Netscape, etc., keep complaining that "John Smith" isn't a valid address.

    If that's the kind of situation you want for my file system, then may I suggest you take a long walk off a short pier. There are valid reasons for file names just as there are valid reasons for email addresses. Just because you use a playlist does not mean that the filenames for your MP3s are irrelevant.

    This obviously doesn't require a database filesystem, but I think it's gotten to the point where we *need* some way to assign metadata to files and then deal with files *soley* by metadata.

    We (my, myself and the mouse in my pocket) are already dealing with files solely by metadata, because the path of a file is metadata. Maybe it's not metadata that you want or care for, but there's a lot of people that do. I'm all for having a lot of metadata attached to my data. But you haven't explained why we need to eliminate the current metadata in order to get new ones.

    p.s. Have you ever seen how the typical Windows user works? There are fifty icons on the desktop in no particular order and every document they create goes into a single "My Documents" directory. What makes people think they won't just shove everything into the same metadata category of "unfiled"? I know if I had to specify four or five different categories every time I had to save some work, I would probably shove a crowbar through my monitor within the first week.

    Right now I have the choice of using a file system or a database. Why must this choice be eliminated? Does the concept of a file system offend you so much that you have to eliminate everyone else's use of one? Can't you just go use a database and let the rest of use work the way we want to work?

    --
    A Government Is a Body of People, Usually Notably Ungoverned
  44. Backards(w) by pyrrho · · Score: 4, Funny

    I think we'd be better off replacing the relational database with a file system.

    Just a joke SQLiers, just a little joke. I know they are indispensible. Really. I believe you.

    --

    -pyrrho

  45. Re:I'd be happy if they just let me use write cach by Gldm · · Score: 2, Interesting

    I did use command line copy on both OS's. I've also tried tons of "accelerator" programs that claim to be faster at copying. I tried same partition, different partitions, changing cluster size to every possible setting it allows, etc. With Atto, the drive performs well once write size gets over 256K, but the OS's internal copy routines for both command line and drag/drop apparently want to write one 512 byte cluster at a time and confirm it went to disk before writing the next one, which is crippling the write performance because the writes won't cache or stripe.

    Just "right click, turn on write cache etc" (from a previous post to this) DOESN'T WORK. If you'd care to READ what I originally posted I mention that it indicates write cache is enabled when IT ISN'T. It's pretty obvious whether it is or isn't based on the write performance. When read is 80-90MB/s and write is 1/10th of that, there's a problem. It's called the OS is forcing write-through, i.e. confirm all data to physical disk instead of just write back to the cache.

    As for the controller, it's a 3Ware Escalade 7500-4, not one of those POS promise things. The drives are 4 Western Digital 1200JBs in RAID5. My previous Escalade 6400 and 4 75GXPs in RAID0 had the same problem. (this was asked in one of the previous posts)

    --

    Introducing the new Occam Fusion! Now with sqrt(-1) fewer blades!

  46. Performance is a question of whether they care by adamsc · · Score: 5, Interesting

    I think the performance of WinFS will tell us how serious Microsoft is about really changing the way files are used. Performance is just a question of time and engineering resources - OS X's journaling is slow but HFS+ is an antique filesystem; in contrast BeOS had BFS, a journaled filesystem with all of the indexing buzzwords WinFS claims except free-text context searches and it was also extremely fast.

    The difference isn't features - BeFS supported everything HFS+ does and arbitrary attributes, journaling, much larger file/filesystem support, and indexing and it was still faster. Be simply made performance a much higher priority than Apple has so far; fortunately they've hired the BeFS lead developer and perhaps 10.3 will have some surprises.

    Another good example is ReiserFS - while some of their choices reflect overall design goals (e.g. targeting large numbers of small files instead of BFS's massive videos) they've largely passed the traditional filesystems in most areas despite having to do more work to keep all of the extra features going.

    Microsoft has a number of engineers who do understand performance; the question is simply whether it'll be a significant priority for them to make WinFS fast enough that we'll realistically be able to use it.

  47. Exchange 2000, the Beta for WinFS? by jwgoerlich · · Score: 2, Informative

    Personally, I still have reservations about using a relational database to keep track of files. Unless they can keep the overhead to a minimum, I can't see it being as efficient as a file system should be.

    Microsoft already has a file system that is integrated within a relational database. If you install Exchange 2000, you will see an M: volume on your local drive. You can flip thru this using Explorer as if it were an actual drive. In fact, it is a representation of a back-end relational database. The speed seems adequate to me so long as you have less than 8 GB in your mailboxes and public folders.

    jwg

  48. WinFS was originally a desktop usability thing by jamezilla · · Score: 5, Insightful
    The reason why WinFS was slated for the desktop first (and then later considered for server deployment) is because it was considered a usability enhancement -- not a performance enhancer. Most /. readers won't understand this because they think Linux is easy to use (it's not).

    Usability engineers looked at what users were doing in Windows and they saw that tons of people weren't using the filesystem - at least not directly. They were just putting everything on the desktop. If it was on the desktop, they could find it. They kept folder structures to a minimum and organized things visually (or not at all).

    This posed a significant problem, so indexing and searching and abstracting the filesystem was one of the solutions. Instead of having to navigate a filesystem (hard for many users), you just type in what you're looking for and *poof* it appears. Not sure what you're looking for? Start describing it... *poof* it appears.

    I'm not saying this is the right solution, but technology is not always about cluster size and performance - especially if the system isn't usable. It will be interesting to see how user friendly this WinFS thing is...

  49. Microsoft Marketing 101 by stox · · Score: 4, Funny

    3 years before release: Product will do everything for everyone.

    2 years before release: Product will do everything for the majority of users.

    1 year before release: Product will do many things for many users.

    Release: Well it does something.

    --
    "To those who are overly cautious, everything is impossible. "
  50. Alas it doesn't help by iamacat · · Score: 4, Informative

    I use journaling and still get filesystem corruption that is not automatically fixed (like overlapped extent allocation) once in a while. On the other hand, HFS+ seems to already use a B-tree and file search is quite fast.

  51. data recovery nightmare by drdoc · · Score: 5, Insightful

    Microsoft loves to create incredibly complicated, undocumented and fragile data structures. Consider word: the internal structure of a word document as itself a filesystem, it has a root, a FAT and a bad block list. If even 1 block is damaged the file is useless. There are few repair tools because the internal layout is secret and undocumented. Access and SQL files are worse. In data recovery we frequently recover 90% of the files from volumes that have thousands of bad blocks or other damage. That wont work with WinFS. Are you going to store you precious digital photos on your $80 WesternDigital 80gb drive with its new 1 year warranty? Microsofts (and WD's) attitude will be that you should have backed it up. How do you easily back-up 80gb anyway?

  52. Re:Like Win2K search service? by cookd · · Score: 2, Informative

    NTFS is based on attributes. The filename is an attribute of the file. The ACLs are an attribute of the file. The file data is also an attribute. Some attributes are special and are used by the file system itself, but otherwise a file is just a collection of attributes. So NTFS has always been able to act as a database in the way you mention -- the attributes in Win2K are simply stored as NTFS attributes.

    The only thing missing has been indexing. NTFS is not designed to index everything the way a database does. And it doesn't have a query engine. This is what WinFS will add.

    --
    Time flies like an arrow. Fruit flies like a banana.
  53. Yipee! by Realistic_Dragon · · Score: 2, Funny

    Does this mean that finally the Windows file transfer counter will give sensible answers? Right now the only thing you can be sure of is that whatever it says, it's going to be something else.

    --
    Beep beep.
  54. This is not about performance! by Fefe · · Score: 2, Informative

    This is about denying Linux and BSD compatibility to their file system. NTFS is being reverse engineered as we speak, write access is being worked on, so obviously Microsoft needs something new to get rid of them pesky Linux people.

    All the new and great features could be done with a (simple?) layer between NTFS and the application. There is no reason why Microsoft needs to invent a new file system here.

    1. Re:This is not about performance! by EddWo · · Score: 2, Interesting

      I think that is what it will end up being. NTFS underneath and the WinFS running as a service. Most applications will still use normal file system access calls. The winfs seems mainly for shell integration, searches etc. If the files are still saved on ntfs they will still be accessible from linux, minus the fancy queries. The only problem will be you won't be able to update the database structures from linux, at least not right away.

      --
      "Taligent is still pure vapor. Maybe they'll be the last who jumps up on Openstep... "
  55. "Memory"? by bluephone · · Score: 5, Insightful

    Am I the only one really annoyed at this guys use of "memory" for disk space? I _still_ find customers who can't tell the difference between RAM and HDD, and this guy goes and makes it worse for every twit who thinks they know everything. Yes, I know that one can make a theoretical argument for using the term, but really, in practice, memory is RAM. It's just annoying... I'm surprised to see it at Tom's Hardware...

    --
    jX [ Make everything as simple as possible, but no simpler. - Einstein ]
  56. Been done already: PGFS by Moderation+abuser · · Score: 2, Informative

    Postgres as an rdbms backend to a filesystem front end.

    Designed mainly for version control. Could easily be modified for other purposes though.

    http://www.linuxjournal.com/article.php?sid=1383

    --
    Government of the people, by corporate executives, for corporate profits.
  57. We beta test an atomic filesystem next month by hansreiser · · Score: 5, Informative

    Take a look at www.namesys.com/v4/reiser4_the_atomic_fs.html

    and at www.namesys.com/v4/v4.html.

    We will be adding support for semi-structured data querying in the next major release, assuming that we find funding for it. The semantics for it are described at www.namesys.com/whitepaper.html, which also explains why I don't think the relational model is effective for semi-structured data stores such as a general purpose filesystem is normally used for.

    Best,

    Hans

  58. Two words for you, buddy - "filesystem deadlock" by hansreiser · · Score: 2, Informative

    Avoiding deadlock, dealing with transaction timeouts, controlling permissions on who can keep a transaction open for how long, these are all very serious issues that have us first making transactions available to plugins and trusted processes only in the first release. However, there is a LOT you can do with just plugins and trusted processes to make user data more secure.

  59. NTFS Streams by dmaxwell · · Score: 2, Informative

    I noticed right clicking in Win Explorer in Win2K allowed you to apply attributes to files. Not sure if they were limited to MS Office files.(Sorry, I can't check this anymore ;-)

    I believe that NTFS supports multiple streams of data per filename much like HFS. However, Windows doesn't make the pervasive use of the capability that Apple did in OS 9. I think one reason is that pretty much anything can go in these multiple streams meaning that particular files are tied even more closely to particular applications. Another reason may be that multiple filestreams complicate transfering files from machine to machine.

    You're correct about RPM. It is using a more conventional approach by chucking it's data in a database file. Read about Netatalk's .AppleDouble directories for an approach to foreign datastreams...they can get ugly pretty quick.

    1. Re:NTFS Streams by abulafia · · Score: 2, Informative
      I believe that NTFS supports multiple streams of data per filename much like HFS.

      Sort of.

      OS9 suppored two streams, the resource and the data fork (this went way back before OS9, BTW.) The resource fork was usually used via a defined structure that allowed storage of elements like code segments, icons, text strings, etc. in a common format. (ResEdit rocked.) The data fork was freeform.

      NTFS streams are arbitrary in definition and number. MS played with using streams for Office documents, as best I can tell simply to tie the format to the file system, but was forced to write in more sensical ways to legacy file systems and for some forms of legacy network access.

      NTFS streams also enabled some fun hacks, like letting attackers store cracker tools as alternate streams attached to other files. These streams wouldn't show up in most normal methods of viewing files.

      Read about Netatalk's .AppleDouble directories for an approach to foreign datastreams...they can get ugly pretty quick.

      Very true. You see similar things with some databases' implementation of BLOBs - shoved on the file system with an ugly name.

      I think we are moving to a day where more and more, file systems are used for application access instead of user access. Another layer of indirection. Content management packages already point in this direction. Part of me is not looking forward to it - we'll need tools to rebuild sensical filesystem structure from crashed applications on top of the tools we already have for rebuilding filesystems. But some things can be useful, too. Progress?

      --
      I forget what 8 was for.
  60. Re:ReiserFS by hansreiser · · Score: 2, Informative

    I don't think it is possible to configure ReiserFS wrongly and to lose data as the above suggests. What is very possible is to use it with the wrong kernel and obsolete utilities. If you use ReiserFS with RedHat, get an OFFICIAL kernel from Marcelo, not a redhat kernel, and get the latest utilities off of our website.

    I would take a guess that the users complaining about losing data are redhat kernel users and the ones that are happy are SuSE or official kernel users. When you compare stability of ReiserFS and ext3, try to compare them using the same version of the official kernel. We have generally been more stable than ext3 in the official kernels at any given moment in time, in large part because we were about 6 months ahead of them in the development cycle. Remember that RedHat does not tend to keep up with ReiserFS updates in its nonofficial kernels.

    Recent RedHat kernels are probably much more stable than the old ones because they have bugfixes from more recent official kernels, but remember that these are the guys who in one of their releases compiled reiserfs with debugging turned on to make it slower than ext3, so I would go with an official kernel if it was me.

    V3 of reiserfs is very stable currently. We go for months without bug reports and we have a lot of users in Europe. This is in large part because we put all our new features into V4. V4 will be unstable for quite some time.

  61. Reservations? by Junks+Jerzey · · Score: 2, Funny

    Personally, I still have reservations about using a relational database to keep track of files.

    A hugely conservative Slashdot reader? No way!

  62. Exchange File System by kyoko21 · · Score: 2, Interesting

    Not having read Tom's Hardware review of WinFS, but from the sound of things of the post, if WinFS is supposedly using a relational database to keep track of files, this sort of already sounds like what MS is implementing in some of their exisiting software such as Exchange and Sharepoint. Sharepoint utilizes Exchange-like file system where you can store files into the Sharepoint repository. In the first delivery of Sharepoint, there was an upper bound on the size of an individual file to be stored (I believe it was an 4GB limit) but in the current release I believe there is no upper limit. From what someone had told me if i recall right, Sharepoint utilizes the SQL engine to keep track of the files that are stored in Sharepoint. Maybe MS is just taking what they have learned from Sharepoint and making it more 'general purpose' for day to day use.

  63. Re:db filesystem ... will never be used by most by johnlcallaway · · Score: 4, Insightful

    Let's get one thing straight ... many of these features are already available in one form or another and no one uses them.

    How many people click on Properites in M$Word and put in the information?? How many people download files and leave them lying in whatever directory they just happen to fall in.

    Don't take me wrong ... I'm not against having an FS that enables you to annotate files outside of the programs that create them. I remember working many years ago with FSs that would allow you to automatically keep the last n revisions, which was very helpful when coding.

    It's just that I have a hard time getting excited over something that is going to simply bloat a system and the odds are no one will use.

    My girlfriend is fairly smart, but she still downloads all her pictures into the default folder, and uses thumbnails to find the ones she wants. She has about 1000 of them, and it only takes here a few minutes to find the ones she wants. It would take here no extra work than what this new FS is suggesting to rename the file and/or store it in another folder

    Useful feature, bloatware, Linux beater, or disaster waiting to happen. My guess is all of the above, at one time or another. Some people will use it and spend hours cross-linking files. I'm sure the initial releases will have security or data loss issues until the bugs get worked out. It will take 0.10 minutes for some Linux hacker to reverse engineer the ability to at least read it. And it will probably take up gobs more memory.

    It's all a matter of perspective....

    --
    I rarely read replies, it's my opinion and if you thought about your opinion a little more, I'm OK with that.
  64. DMCA protected BIOS code by aphor · · Score: 2, Insightful

    Microsoft can require the bootstrap call routines from a separate DMCA/Patent protected, Microsoft branded, bootstrap ROM, and can control motherboard vendors through licensing for that chip.

    Nasty Microsoft if they try that. Someone better get some prior-art out there and QUICK!

    --
    --- Nothing clever here: move along now...
  65. AS/400 and the relational filesystem by neurojab · · Score: 2, Insightful

    This idea isn't new... Microsoft as long been trying to nip at the heels of IBM's midrange platforms. The system/38 shipped with a relational-only filesystem over 20 years ago. This filesystem is still alive in the AS/400 and iSeries systems. The advantages really are nice... merging the filesystem and the database, combined with a really smart optimizer, allow you to run a high performance database without having a DBA constantly monitor it. You end up with no tablespaces or any of the rest of the cruft that occurs when you try to build a database using files.

    If it were just that, it might be laudable... but what MS is trying to do here is really more nefarious... MS is really trying to rid itself of all non-MS databases on Windows. This is the old IE vs Netscape trick. Why would you buy a real database when you're forced to buy SQL server anyway? Enough people might not know the difference

  66. Great for Porn collecting by zapp · · Score: 2, Funny

    Finally, I can stop naming my folders like:
    cute brunette
    cute brunette in dress
    hot blonde with guy 1
    hot blonde with guy 2
    hot blonde with dildo 1 ...

    Now I can attach meta data like
    hair color,
    props (clothing, toys, surroundings),
    # partners,
    partner gender(s),
    whatever else... you get the idea ;)

    --
    no comment
  67. Re:db filesystem ... will never be used by most by c13v3rm0nk3y · · Score: 4, Interesting
    ...many of these features are already available in one form or another and no one uses them

    For most file data, perhaps.

    I will use this, and to good effect, as well.

    The point to take into consideration is that the context will also change depending on the metadata available. Your view of the aggregate file objects changes, depending on the context. Not to mention that this same metadata will be available, in the same format, to all participating applications. Your apps can have all the same view, if you like.

    What this means in concrete terms is that your carefully sorted directory of MP3's can look like a file library in iTunes. There are searchable, sortable columns for Title, Album, bitrate, Cover Art, year, label, and whatever (note I did not say "filename", which is just another attribute under a modern filesystem). This is possible with only the most basic gestures on the part of the user, and is remembered for the next time you visit this same view.

    Similarly, a tree of photographs appear in any participating file browser with whatever columns you want (bit depth, format, date taken, date published, ICC info). It's important to consider that you can do this with any arbitrary collection of data, even one's you define yourself (to take the BeFS example, anyway).

    So you can take your collection of widgets, define attributes about these widgets, and your file browser applet works the same for the same user in all applications. It should, anyway. This is why we have APIs.

    To cite your example, why visual grep through a bunch of thumbnails looking for a particular photo when you can just indicate with a few gestures the "type" of photo you are looking for? I like the iPhoto interface when I'm browsing photographs, but if I want a particular photo of the GF from a rough date taken at night, I certainly don't want to browse through 1000's of images, especially when some of them can be hard to discern at thumbnail resolutions. I certainly don't want to do this repeatedly when I'm assembling a photo album on a specific subject.

    Let the computer do the grunt work of selecting a result set that matches my criteria, and then I can use my human abilities to select the object I want, or refine the search.

    Most of us already keep our aggregate file types in associated groups on the filesystem already. In most cases, the tree structure of most filesystems is sufficient. All this does is extended the functionality of the filesystem so that you can choose to abstract aggregate file objects and treat them in a a myriad of different ways. In the most basic sense, you tell the OS, "look, when I have the Explorer/Finder open on this directory of MP3's, make sure you change the column view so it shows this, this and that. In icon view, make sure that mouse-over pop-ups (if enabled) display this that and that. Default sort is alphabetically by Artist's Last Name. I don't want to see the filename, as that doesn't contain any useful information."

    That is, you don't have do anything special to make use of the file attributes in this way. You just tell the ultimate app that all of us use the most (the operating system's file browser) to treat certain directories in a different manner.

    --
    -- clvrmnky
  68. I read the article.... by rew · · Score: 2, Insightful

    .... partly..... and barfed.

    I know a thing or two about filesystems: I work in the data-recovery business.

    This article was written by someone who has some basic details, and has fantasized a lot of incorrect info around it. Bunches of terms are used incorrectly etc. etc.

    NTFS is a very well-thought-out filesystem. It can be made to perform well, and has almost no limits. (it does have limits: For example, files are limited to about 16 Billion gigabytes. Something like that....)

    I sure hope they don't throw away the good things about NTFS....

    Microsoft makes very little "good" software. But the NTFS filesystem generates the impression that it's different. They probably didn't design it themselves.