File Organization — How Do You Do It In 2011?

Directories by Anrego · 2011-02-13 04:46 · Score: 4, Informative

.. seriously.. they still work for me.

I’ve got a 12TB file server (~6TB filled). It’s arranged as follows:

documents/
incoming_downloads/ (before you ask.. yes.. _legit_ downloads)
media/
media/video/
media/video/movies/
media/video/tv_shows/
media/video/tv_shows/some_tv_show/
media/video/standup
media/video/etc..
media/music/
media/images/
media/images/various_subfolders/
code/
virtual_machines/
tmp/
backup_links/
backups/

That’s always been enough for me. Never got into all this tagging/meta data stuff. If there’s anything I’d ever want to search on... I put it in the file name. Indexed every night via slocate.

backup_links is part of my hacked together backup system.

The thing is raid6, setup so two drives can fail without loss of data. I see this as adequate “backup” for stuff that is replaceable (the large portion of my media is rips of DVDs I own... so although it would be a huge pain in the ass to re-rip them all... it’s not impossible). Stuff that is irreplaceable, I backup to separate hard drives (via hot swap trays).

I leave one backup drive plugged into the machine, and keep the other elsewhere. I periodically swap these drives. I have a script that just rsyncs the files and directories pointed to in backup_links (the irreplaceable ones) to the currently plugged in drive (and yes I verified that I’m not getting a backup of my links ;p). This way I always have one drive that has a pretty recent backup (runs nightly), and one drive that has at most a month or so old backup if the plugged in one fails for some reason.

backups is backed up files from other machines.

Keeping everything in one place helps with the organization I think. Most of the other machines on this network are basically just OS installs. All the real files are on the file server. My desktop runs of a small SSD, which is not even half filled.

Re:Directories by RuiFerreira · 2011-02-13 04:52 · Score: 5, Interesting

I basically use the same structure as you but I have an extra directory called "attic" where in practice I end up putting everything.
Re:Directories by Anonymous Coward · 2011-02-13 04:52 · Score: 0

I've also always wondered how people get used to tags. With tagging, it feels as if you put a tag on something then just release that something into the cloud-of-disorganization.
Re:Directories by Anonymous Coward · 2011-02-13 04:52 · Score: 0

Same here. Use directories to organize files, and some homebrew to backup files to external drives and other computers. At any one time I have about 7 copies of my home directory less than a month out of date from current.
Re:Directories by G4Cube · 2011-02-13 04:57 · Score: 1

ok looks good enough to steal...
Re:Directories by Anonymous Coward · 2011-02-13 05:07 · Score: 0

Yep, this is still the most straightforward way to do it. I have a similar layout, though I also have "porn" (obvious) and "software" (for installers, applications, utilities, etc). I also have a "holding_tank" that I just drop everything into; when it gets too full, I move everything out of there and into the proper directories. Also have "emulation" to hold all my emulators and ROMs (NES and SNES mostly). Finally, I have a separate "retail" directory that holds anything that I paid for and the related receipts, serial keys, and whatever else.
Re:Directories by quoob · 2011-02-13 05:09 · Score: 1

I also have an "attic" directory and I use it for files that don't need to be backed up in my regular backup schedule. It contains stuff that I've already archived or that wouldn't be badly missed if I lost it. I'm also considering a adding a place for stuff I know won't be of any use or interest after a few years. I do this already with my dead tree files of receipts and such. I keep things in the order filed and throw out some of the oldest when the file drawer gets full. I should be able to do this for similar computer based data by occasionally purging files with old dates.
Re:Directories by dixonpete · 2011-02-13 05:17 · Score: 1

ya, and what happens if you have a fire and all your equipment melts?
Re:Directories by Anonymous Coward · 2011-02-13 05:35 · Score: 0

He easily restores from backups? Organized files are easy to back up. Disorganized files that are "organized" by means of meta tags and scattered all over a hard drive are much much more difficult to back up.
Re:Directories by Anonymous Coward · 2011-02-13 05:39 · Score: 2, Funny

I used to download stuff directly on my desktop (my habits have since then changed). One day, my desktop was completely full (meaning that there was no more iconspace, I know I could've just used an Explorer window but it was too much of a hassle), so I made a folder called "shit", and I dragged all of my stuff in there (most of the stuff was shit, so the folder was aptly named). It took a month or two to refill my desktop, then I made another folder called "more shit", and I again dragged everything on the desktop in, including the previous shit folder. When it was time to move my files to a new machine, all I had to move was about 10 documents in my Documents folder, my Music folder, and a folder called "megashit", which contained six nested levels of folders named after some variation of "shit", and with a total size of 200 gigabytes.
Re:Directories by bazorg · 2011-02-13 05:45 · Score: 1

... and what if your family add their own files and do not observe your directory rules? What if someone joins the household and bring their own 500GB of files with a different directory structure?
Re:Directories by Anonymous Coward · 2011-02-13 05:47 · Score: 0

I completely agree, a proper directory tree will help you find anything in seconds. How hard is it to create
Music/Artist_Name/Album/
Documents/Taxes/Year/
Video/Movies/Movie_Name
Code/Myprojects/Project_Name
This has worked for me with data from a few MB up to a couple of TB. And, for those odd cases where I'm having trouble finding something, there's the locate and find commands. As long as the files and folders are properly named life is easy.
Re:Directories by sb98052 · 2011-02-13 05:48 · Score: 1

I think what you mean is it still doesn't work for you. Some things suck less than they did in 2003, but not this.
Re:Directories by Anonymous Coward · 2011-02-13 05:50 · Score: 0

ya, and what happens if you have a fire and all your equipment melts?
Then he restores his "irreplaceable" stuff from his offsite hard drive. Which he specified.
Jeez, I know people don't read TFA, but at least read the comment you are snarkily replying to.
Re:Directories by isopropanol · 2011-02-13 05:50 · Score: 1

Similar here, but RAID is RAID1 for the partition where the OS, email (IMAP on Maildirs), and must-not-go-down files are, RAID0 where the must-be-recoverable and re-downloadable stuff goes. Backup is via rdiff-backup (plain files) and cp --sparse=always (sparse iSCSI shares) to an eSATA drive cradle.
Also, I ran multiple backup/recovery drills for scenarios such as accidental deletion or overwrite of files, Partial failure (one mirror of the Raid1, all of the RAID0), and complete failure on a VM before I even ordered the hardware.
Re:Directories by TeknoHog · 2011-02-13 05:53 · Score: 1

Same here. I also use symlinks to organize music based on genre, even though all artist directories are under the music dir. This way it is also possible to file one band under multiple genres.
Actually, pathnames are also metadata. Or at least they can be used to provide a lot of metadata, when properly used. Fancy metadata/tagging systems also need some discipline to work, and sloppy people can lose track of their data despite the fancy tools. I choose the old-fashioned way as it works for me, and is readily accessible with a variety of programs.
I also have a simple filing system for music burned to DVDs. Basically, after burning a new disc I do "find > disc#". To find a filename, I simply grep for it in a directory with all these files. Usually it is enough to find just the disc number.

--
Escher was the first MC and Giger invented the HR department.
Re:Directories by timeOday · 2011-02-13 06:02 · Score: 2

I agree, his structure appeared to be for only 1 person. The top level of organization, really, is /home/username, with permissions set so people can't accidentally mis-file things under others' directory. I've still found that useful on my home computers even though there's no concept of enforced 'security' between users (they could all sudo to each other without a password, though they don't know it). This is because I have one desktop with a submenu for each person that launches their apps in their home directory as them. Trying to make them log in and out to run each program would never work, too much hassle, besides it would kill all the previous person's programs which they usually just leave running.
Of course, sharing is where it gets complicated. The music for my wife and I is in one directory structure. I used to just dump kid stuff under a "kids" subdirectory, but as they're developing individual tastes that doesn't really work either.
Re:Directories by Anonymous Coward · 2011-02-13 06:05 · Score: 0

Indeed. can you also post your address please?
Re:Directories by creeront · 2011-02-13 06:18 · Score: 1

Out of curiosity, what hardware are you using for this?
Re:Directories by Anonymous Coward · 2011-02-13 06:22 · Score: 0

I really like this scheme, as well as the one posted by chill below. The one adjustment I would make is to divide my library into two major subsets. The first would be files that are irreplaceable. These tend to be files I create myself or are sent to me by others and include things like family photos, emails, documents, programs I've written, etc. Files in this group tend to be smaller, accessed or, updated regularly, added to frequently and more personal or sensitive in nature. These files are backed up at a higher rate and to more secure, reliable (and more expensive) storage.
The second group is pretty much everything else. Rips of DVDs, CDs, and software I own, music or software I've downloaded (legally of course), temp files, and all kinds of replaceable things would be in this category. This group tends to be much larger in size. I perform a less rigorous backup of these items. They tend to be write-once, read-many use cases. I may write-protect them to prevent unintended modification.
I could envision a third category that includes system-level files, like virtual machine images, OS files, search index files, etc. Although, I haven't done anything about this category yet.
Re:Directories by Anrego · 2011-02-13 06:28 · Score: 1

That other "elsewhere" drive I store with the irreplaceable stuff .. is offsite. As for the DVDs and other replaceable stuff.. if my house burns down that would be pretty low on the list (and could be re-purchased or replaced.. kind of the definition of "replaceable").
Re:Directories by maxwell+demon · 2011-02-13 06:30 · Score: 1

As long as the tagging is available from the command line, shouldn't the following work?

find ~ -exec hastag backup {} \; -print | xargs backup_files

assuming hastag and backup_files have the obvious semantics.

--
The Tao of math: The numbers you can count are not the real numbers.
Re:Directories by Grishnakh · 2011-02-13 06:48 · Score: 1

It's fairly simple: they have their own directory at /home/username, and they can do whatever they want there.

On my system, stuff which might be shared between machines (media) is in a separate top-level directory (and also on a separate RAID volume). But it's fairly simple: /media/TV, /media/music, /media/photos, etc. Anyone new would just have to conform to that, or perhaps have separate subdirs: /media/music_user1, /media/music_user2 for instance.
Re:Directories by Jorl17 · 2011-02-13 06:50 · Score: 1

Back in the day when I was 11 and used Windows, that's what I used to do. I just didn't call it "shit", I called "Old Desktop Stuff". I ended up wasting 5 DVDs to save all those folders when I migrated to another machine.

--
Have you heard about SoylentNews?
Re:Directories by Kilrah_il · 2011-02-13 06:51 · Score: 1

For which we say: Directories: it works, bitches.

--
Whenever in an argument, remember this.
Re:Directories by Anrego · 2011-02-13 06:51 · Score: 1

LIAN LI PC-P80 with 3x of those 5 drive in 3x 5.25" trayless hotswap backplane deals, and one individual tray hotswap bay (with two trays, one for each backup drive). I went with the PC-P80 because it's got fans on the door.. which means you don't lose cooling by filling all your 5.25" bays, and you get cooling on all of them.
Currently 8x 2TB drives (generic semi-cheap sata.. different brands bought at different times) in a raid6 config, giving me 12TB and setup so that 2 drives can fail. (This only fills two of the backplanes, however I figured it was best two buy the third one, as if I went to upgrade it might no longer be available).
Pretty generic motherboard/processor/ram and a couple cheap sata interface cards. Resource usage on the thing is very low, even with the whole thing encrypted.
Linux software raid, xfs file system (though I'm considering moving to something with checksums.. zfs if it didn't suck on Linux). Shared via NFS.
Usual pseudo-server stuff. I use nut to manage all the various UPSes in my network. I have things set up so that everything powers down before the file server (as even with the various allow fail settings, NFS suddenly dropping _still_ totally hangs up machines).
I run gentoo on the thing. Normally I'd never do this on a server.. but it's what I use on everything else, so it just makes things simpler.
I actually am very happy with my setup. This is my third file server, and I've kind of learned something from each one. This time around I actually developed _written procedures_ for all the various recovery and maintenance tasks.. and tested them before I put real data on the thing. This means when I want to add another drive, or a drive fails and I need to replace it, I have a step by step procedure I can follow (vice the usual frantic google searching and "oh man I hope this works").
Re:Directories by mfnickster · 2011-02-13 06:58 · Score: 1

Yep. I set mine up using aliases (shortcuts to you Windows users, soft links to you *nix folks) so that I don't have to remember the order of the path:
Music/Artist/MP3/Album
Music/Artist/Album/MP3
Music/MP3/Artist/Album
They all work fine.

--
"Slow down, Cowboy! It has been 3 years, 7 months and 26 days since you last successfully posted a comment."
Re:Directories by Anonymous Coward · 2011-02-13 07:01 · Score: 0

I have to ask this while I am thinking about it. I have 10 devices, computers and Tivo's that I am going to put together into some kind of organized system this summer. I want to be able to hold all the media files in one place. Pics, movies, audio, documents and the like. Along with the Tivo's in the home I have a media PC that I run also. So my question is what is the best way to back-up all of this. Should I look into Home Server 2011 even though they have scrapped the drive extender? I want a secure way to keep the data and a way to image the OS and programs from the systems so should I ever need to replace a failed drive in any of the computers it's just a matter of putting that image on a replacement drive.
Re:Directories by lormahoykyd2007 · 2011-02-13 07:03 · Score: 1

I have to ask this while I am thinking about it. I have 10 devices, computers and Tivo's that I am going to put together into some kind of organized system this summer. I want to be able to hold all the media files in one place. Pics, movies, audio, documents and the like. Along with the Tivo's in the home I have a media PC that I run also. So my question is what is the best way to back-up all of this. Should I look into Home Server 2011 even though they have scrapped the drive extender? I want a secure way to keep the data and a way to image the OS and programs from the systems so should I ever need to replace a failed drive in any of the computers it's just a matter of putting that image on a replacement drive.
Re:Directories by clang_jangle · 2011-02-13 07:24 · Score: 0

Same here. A logical, well-maintained hierarchical system is pretty hard to beat, assuming a knowledgeable and capable user, of course. I've known a lot of people though who can barely remember their own zip code, so I also understand the clamor for bloaty db-driven solutions. In OS X it didn't take me long to find how to kill Spotlight. Nice feature, Spotlight -- for people with nothing to do and all day to do it. I find it an awful lot easier to just pay attention to where I put things in the first place. :)

--
Caveat Utilitor
Re:Directories by noidentity · 2011-02-13 07:29 · Score: 2

Mine's even simpler than your attic: I just throw absolutely everything the / directory.
Re:Directories by icebraining · 2011-02-13 07:31 · Score: 2

A directory is nothing but a tag attached to a file, which is then saved on random sectors (possibly non-contiguous) on your hard drive.
The only problem I have with file tagging is the lack of integration; all applications know how to interact with the filesystem, but very few have support for a decent tagging backend.
The Nepomuk project seems promising, though.

--
Dilbert RSS feed
Re:Directories by Anonymous Coward · 2011-02-13 08:15 · Score: 0

Keeping everything in one place helps with the organization I think. Most of the other machines on this network are basically just OS installs. All the real files are on the file server. My desktop runs of a small SSD, which is not even half filled.
This is my answer as well.
Data on the NAS device, OS on my computers' internal drives, working set on a portable E-SATA drive (Lightroom catalog) for speed.
Re:Directories by mburns · 2011-02-13 08:25 · Score: 1

Definitely get people their own account; this is basic. Instead of logging out the other person, just use the switch user feature.

--
Michael J. Burns
Re:Directories by timeOday · 2011-02-13 09:05 · Score: 1

They do have their own accounts. It's just more convenient for everybody to let their programs run side by side (as them), but without any switching.
Re:Directories by Ponder+Stibions · 2011-02-13 09:21 · Score: 3, Funny

I find these useful, but for family stuff I can't recommend a simple hard drive crash enough. They will suddenly know where copies of everything important is, and it'll come down to only a few gigabytes....
Re:Directories by TuringTest · 2011-02-13 09:22 · Score: 1

Thatâ(TM)s always been enough for me. Never got into all this tagging/meta data stuff. If thereâ(TM)s anything Iâ(TM)d ever want to search on... I put it in the file name. Indexed every night via slocate.
Do you realize that folders + slocate indexing is logically equivalent to tagging&metadata search?

--
Singularity: a belief in the "God" idea with the "demiurge" relation inverted.
Re:Directories by Maxo-Texas · 2011-02-13 09:27 · Score: 1

Have /softwarestack
which is the installable version of all software installed on the machine.
In media, I have
classic
cool
I distinguish between adult and porn.
So I have
adult/classic (classic nudes from the 70's and 80's.. .when women had hair and tan lines. And looked more real and less like terminators.)
adult/cool (for things like corset piercing pictures)
adult/pinup (no nudity or rare)
And
Pictures/cool (various cool pictures that elicited an emotional response from me)
Pictures/classic (various pictures from the 40's, 30's, 20's and even earlier)
Pictures/(various subtitles).
Documents are divided into various gaming material folders. I rarely keep anything sensitive on computers attached to the net tho so no tax forms, no bank information.

--
She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
Re:Directories by Anrego · 2011-02-13 09:41 · Score: 1

In the most abstract sense.. sure.
Making a phone call and sending an email are also logically equivalent.
Practically though, file names and tagging/metadata are very different. Information in file names is universally available to every program on the system. Tagging/metadata requires program specific support. The amount of information you can cram into a file name however is considerably limited when compared to tagging/metadata systems.
Re:Directories by Anonymous Coward · 2011-02-13 09:43 · Score: 0

I’ve got a 12TB file server (~6TB filled). It’s arranged as follows:
Interesting. I have: porn/
I don't have anything yet that requires another folder.
Re:Directories by sgage · 2011-02-13 10:16 · Score: 1

I have a somewhat similar setup, though with an "archive" directory for really old stuff. By really old, I mean my 8-bit CP/M days and early DOS stuff - my master's thesis, my first coding in html, etc. I've been at this for so long that my system has evolved, never really planned it out. Let's just say that in 1982 I wasn't thinking in terms of terabytes of storage :-)
Every time I get a new computer, I move everything along, sometimes consolidating and rationalizing the system a bit as I go.
Actually, my system is not quite as rational as yours, but the point is that after all these years I know where everything is. To me, there's a combination of content type and age that needs to be dealt with. The really old stuff is in its own museum, and I know right where to go.
After a while, it feels like curating a museum, really. I don't care about posterity or anything like that, but I don't want to lose anything. I like being able to look up my journals from the 80's and emails from my VAX account at Uni and ancient newsgroup conversations that I filed, and all that.
Man, time flies...
Re:Directories by Hognoxious · 2011-02-13 10:23 · Score: 2

Finally, I have a separate "retail" directory that holds anything that I paid for and the related receipts, serial keys, and whatever else.
Me too - it's on a 3.5" floppy.

--
Confucius say, "Find worm in apple - bad. Find half a worm - worse."
Re:Directories by Anrego · 2011-02-13 10:30 · Score: 1

I can relate to that.
This is actually my third file server. And before that, I had a similar structure on whatever machine I called my desktop. It has kind of evolved.. as you said.. new system is a good time for cleaning up and moving stuff around... but in general this is just a more refined version of what I had in the 90s.
I don't have anything going back as far as you... oldest I have is some qbasic and turbo c projects in my code directory. I also have some old TV shows that they still haven't released on DVD. They were ripped from VHS recordings (back when that wasn't an easy thing to do!).
Re:Directories by Keen+Anthony · 2011-02-13 10:58 · Score: 1

My "attic" is called "current". It used to be called "unindexed".

I suppose I'm still doing things the "wrong" inefficient way although I've dabbled into metatagging, spotlight searches, and data management software over the years. I keep coming back to folders out of fear of relying on a singular monolithic database that I know will just keep growing and growing. Some of my data has a high rotation rate. Stuff in "current/" reflects the wide collection of text files, random images, and links that I've accumulated since my last system refresh. Most of my other stuff remains untouched for months: photography/, music/, software/, etc.

The problem I have is that it is difficult to satisfactorily categorize groups of data that contain a broad context or a lot of contextual overlap using just a system of folders. I hate having to navigate huge directory trees, so my personal rule is not to go more than 3 directories deep. Music and TV recordings are my biggest trouble area. I categorize music by major genre, then by performer name, then by album. This is a constant exercise as I continue to digitize my large record collection. Categorizing by major genre is inadequate. Most of the music artists in my collection fit across multiple genres. My Yngwie Malmsteen collection is searchable across three different major genres: guitar virtuosos, hard rock, and heavy metal. This is because this guitarist is both a soloist composer, played in a heavy metal band "Steeler" and also played in a hard rock band "Alcatrazz" My Deep Purple and Frank Zappa collection get complicated easily too.

My backup is still USB and firewire hard drives. I still have stuff on optical. I no longer create DVD/Rs or CDROMs for my own storage as they are time-consuming and have never been very convenient. Plus, I get a lot of overlap in my data collections, so one group of related data may only take up 1GB but another will take up 5GB or 9GB, so I have to deal with spreading groups of data across discs. I'm actually running into this problem again with my disparate collection of USB and firewire HDDs.

Metatagging would definitely be better. And Mac OS X makes it easy with Spotlight. But it's such a task going back and metatagging the tens of thousands of files on my backups. What I need as a Mac user is a text field in Finder which quietly loads spotlight comments as I arrow through files, or perhaps the ability to press a key when highlighting a file, and having the file's metatags replace the file name so that I can make edits with the same motions that I would rename a file. cmd + I requires me to refocus.
Re:Directories by Bai+jie · 2011-02-13 10:59 · Score: 1

Dude! Don't name it "porn", name it "paint drying documentaries" or "economics lectures". How else are you gonna hide it from your Wife/Mother/Kids?
Re:Directories by Anonymous Coward · 2011-02-13 11:46 · Score: 0

This. I've occasionally mucked around with metadata tagging, but in the end a logical hierarchy of directories is all the organization I need for my terabytes of data. And easier than mucking around with metadata tagging.
Re:Directories by hazem · 2011-02-13 12:37 · Score: 1

My "attic" is called "current". It used to be called "unindexed".
I have a name that reflects both my optimistic outlook and my tendency to procrastinate: "ToBeSorted". Someday, I know I'll have time to sort all that out, even though I may never actually do it.
Re:Directories by Anonymous Coward · 2011-02-13 12:56 · Score: 0

If there’s anything I’d ever want to search on... I put it in the file name.
Likewise, and if I ever put the date in a file or directory name, I use YYYY-MM-DD format.
Re:Directories by Anonymous Coward · 2011-02-13 13:42 · Score: 0

I want to know how many people call their dump directory "temp" or "tmp". Back when I owned a Gibson, I just concat'd everything into a garbage file.
Re:Directories by Tacvek · 2011-02-13 13:59 · Score: 1

I have for many years kept a directory on my desktop named "test" which contains that which does not come with an installer (emulators, useful utilities, and whatnot), or things like PDFs downloaded that don't seem to really belong in a "My Documents" directory, such as the occasional ebook, or programing language standard, etc.
Stuff gets purged each time I get a new computer, as I carefully consider what to copy or not.
Stuff in my documents is almost always worthless old documents, so they don't usually get transferred, but I look through them, and anything work keeping is kept.

--
Stylish sheet to fix many problems in Slashdot's D3: https://gist.github.com/801524
Re:Directories by cayenne8 · 2011-02-13 15:28 · Score: 1

What OS is this tagging available on?
Not familiar with it.....is it built in, or an add-on you buy (if windows or osx)?

--
Light travels faster than sound. This is why some people appear bright until you hear them speak.........
Re:Directories by cayenne8 · 2011-02-13 15:35 · Score: 1

Why would you hide anything...or need to?
Kids? They don't get on YOUR computer (you do have it password protected at least..don't you?
And the girlfriend/wife? Why would you hide that you look at porn to her? She know's your a guy and you look at it..and if she doesn't like it.tough luck for her....geez, grow a pair, eh?

--
Light travels faster than sound. This is why some people appear bright until you hear them speak.........
Re:Directories by buswolley · 2011-02-13 17:31 · Score: 1

Old Desktop inside Old desktop inside Old desktop inside Old desktop. Yep, that's how I've done it. WHen I got new computer, I looked at the folder and said, "to hell with sorting that" so I put it in a folder called "New Desktop".

--
A Good Troll is better than a Bad Human.
Re:Directories by aaarrrgggh · 2011-02-13 17:36 · Score: 1

Simple directory trees are fine when you have a single person with a consistent style managing everything. It becomes much harder when you add a few OCD people in the mix that work from a few different third-level trees with subtle differences in purpose for their own superimposed top-level logic.
Oh, and they create template trees for everything, so the sub-directories could be empty...
Personal information is pretty easy unless you have a spouse that puts all your music in a directory labeled "crap.". Crap/ is for other stuff!
Re:Directories by ThePromenader · 2011-02-13 19:03 · Score: 1

LOL I do the same, but as parent mentioned, my habits have changed: I no longer download to the desktop, I have a specific folder (on an external hard drive) for everything downloaded through a browser. Today my desktop is populated by mostly screenshots and reference images (for web development/design), and periodically I put these into a 'date_putaway' folder that I dump into a parent 'putaway' directory. Today 'putaway' contains about 6go of data through around 20 sub-folders... but spring cleaning time ~is~ just around the corner.

--

No, no sig. Really.

ThePromenader
Re:Directories by Anonymous Coward · 2011-02-13 23:19 · Score: 0

... and what if your family add their own files and do not observe your directory rules? What if someone joins the household and bring their own 500GB of files with a different directory structure?
You make them manage their own backups.
This really isn't hard. Why would you want to manage their backups anyhow? But to address your point, it's pretty simple. Just put it all under one directory, thus preserving whatever structure they already have.
Re:Directories by cbope · 2011-02-14 01:57 · Score: 1

I use a somewhat similar set of directories to organize all the stuff on my new NAS with 2.68TB of RAID 5 storage (3x1.5TB). I still have a lot of organizing to do so it will be interesting to follow this topic and pick up some ideas. I have files going back well over two decades, though most of the old stuff is pretty useless, except mainly for nostalgia.
Long-term, I would like to migrate the local docs/pics/videos folders on a couple of my desktops to the NAS, to simplify backups. I have a gigabit switched network, so speed of access is not really an issue for most tasks.
Re:Directories by Kagetsuki · 2011-02-14 01:59 · Score: 1

I call mine "Junk", but I do organize it occasionally.
Re:Directories by jedidiah · 2011-02-14 02:00 · Score: 1

Things worked for me in 1994 and they still work for me.
The real problem is that some people are just hopeless. Fancy metadata and tools that hide the filesystem aren't the answer because most people that can't be bothered to organize their own stuff also can't be bothered to tag their own stuff.
Stuff doesn't just magically tag itself.

--
A Pirate and a Puritan look the same on a balance sheet.
Re:Directories by jedidiah · 2011-02-14 02:03 · Score: 1

Chances are, the really important stuff isn't any larger than what can fit onto a modern external harddrive or perhaps the storage that comes with your typical desktop PC these days or even a nettop.
All of my nettops hold backups of the "small" files for my media server. ...as far as drive extender goes: there's always software raid. It's not spoon fed to you but that's life sometimes.

--
A Pirate and a Puritan look the same on a balance sheet.
Re:Directories by scotjam · 2011-02-14 02:07 · Score: 1

" I see this as adequate “backup” for stuff that is replaceable (the large portion of my media is rips of DVDs I own... so although it would be a huge pain in the ass to re-rip them all... it’s not impossible)"

RAID is NOT a backup solution. Ever. I have twice (once on RAID 6 and once on RAID 5) had logical device filesystem issues that were completely unrelated to any particular physical drive, and could not therefore be fixed by replacing one or more drives.

You think that your media rips are replaceable, but with "about 6TB of data" (most of which is likely to be video content I assume) I assure you that when your drive fails and you're faced with re-ripping them, you'll find that "theoretically replaceable" is not the same as "feasibly replaceable".

Oh, and BTW if you do decide to continue without backups (or even if you do create backups) I recommend running a torrent client (even if not connected to the internet) as a way of creating and managing checksums for files - torrents are a great way to identify file corruption. I had a RAID failure where about 30% of my files lost random parts of their data, and having torrents for those files really would have helped! (no way of knowing what video file is corrupt and what's not otherwise except by watching it all).

Yes, there are other ways to calculate checksums, but torrents are easy, visual etc. In fact, torrents in general would probably have been a better solution than ripping (provided you own the DVDs!!) because then the torrent client would have just identified any inconsistencies and re-downloaded those parts of those files where corruption had occurred, thereby avoiding having to go through the soul-destroying process of trying to manually check and identify what had been corrupted and re-ripping / converting the entire file in each case.
Re:Directories by tehcyder · 2011-02-14 02:12 · Score: 1

I basically use the same structure as you but I have an extra directory called "attic" where in practice I end up putting everything.
So why not just have one directoy per user called attic ans stop worrying about creating more sub-folders?
To be honest, is there any point in moving all .jpeg files into a directory called /photos? I mean, what else is going to be there? It's simple enough to sort by file type then name if you should wish to browse through your adult fine art picture collection that way.

--
To have a right to do a thing is not at all the same as to be right in doing it
Re:Directories by RuiFerreira · 2011-02-14 02:25 · Score: 1

I'm exaggerating, i try to keep photos organized, folders for code, etc. I just use this attic to put every file that i'm not sure I wan't to keep until I read and analyze it. There are some things people send to my gmail that I just save there. I have it in gmail anyway if I ever what to find it again, and sometime this folder has several copies of the same file.As I do research, I read lots of articles and I'm not really sure if they are useful or not until I read the abstract and the introduction. Someone sends you a pdf where do you put it? I think I wouldn't lose much information if my disk died. I have all my pictures in flickr. All code and important stuff under versioning... I don't even care about backups... My comment was just to illustrate the problem with directories - the information that does not easily fit is the structure.... I would like to have a solution for this but I don't have any...
Re:Directories by tehcyder · 2011-02-14 02:34 · Score: 1

And the girlfriend/wife? Why would you hide that you look at porn to her? She know's your a guy and you look at it..and if she doesn't like it.tough luck for her....geez, grow a pair, eh?
Amazingly enough, not all women like porn, and not all men would prioritise uncensored viewing of porn over their relationship.

--
To have a right to do a thing is not at all the same as to be right in doing it
Re:Directories by Anrego · 2011-02-14 02:37 · Score: 1

I know RAID isn't _really_ a backup.. that's why I put backup in quotes ;p
I do have backups of the ripped DVDs.. namely.. the original DVDs. I probably have somewhere in the neighbourhood of 500. I’ve actually re-ripped my entire collection recently (to a higher quality format.. which I won’t name in an effort to avoid an unrelated flamewar). Done sporadically, it took about a month. Basically a powerful multi processor box with 6 DVD drives, and a frontend with nice queue support. I wouldn't really call this infeasible. Inconvenient, sure but not infeasible.
Ok.. so now time to put you on the spot. How the heck would _you_ backup 6TB (possibly 12TB or even more down the road). The _cheapest_ solution would seem to be a complete mirror of the system, which seems a little overkill to avoid a months work in the unlikely-ish event of a raid failure. Would take forever to remotely transfer somewhere over the internet, and keeping any kind of backup of thise size in sync would be way more inconvenient then re-ripping a bunch of DVDs.
Worth noting... been running raid systems for years (this is my third file sever, and have used raid on my desktops/other servers for quite some time), never had one fail. Yes I know it's possible (this is why I have local and offsite backups of the irreplaceable stuff), but the closest I've ever come is not being able to start up the array a few times due to hardware glitches knocking out the drives uncleanly. A little gentle massaging (and in one case superblock editing) and have always managed to get things going again with no signs of corruption. Maybe I'm just really lucky?
Re:Directories by cayenne8 · 2011-02-14 02:39 · Score: 1

"Amazingly enough, not all women like porn, and not all men would prioritise uncensored viewing of porn over their relationship."
Didn't say SHE had to like it...if you enjoy viewing it, it hurts nothing, and she should have no say in it.
There's plenty of them out there to choose from...if she doesn't fit the bill, go on to the next one.
If for some reason you want to marry one of them...make damned sure about her before you tie that legally binding knot.

--
Light travels faster than sound. This is why some people appear bright until you hear them speak.........
Re:Directories by McGruber · 2011-02-14 04:29 · Score: 1

Back in the day when I was 11 and used Windows
Back when I was 11, all my data and programs had to fit on a couple of cassette tapes...
Re:Directories by marcosdumay · 2011-02-14 07:53 · Score: 1

When I craeted my current backup procedure I figured up that backing up things while I reindex them was a big thoublemaker. Consequently, now I have an "attic" directory too, but I call it "temp". There goes eveything that will still change name or location, and won't be backed-up until that happens. Now I have so much thing there that I fear missing it, so maybe I should make some backup, but without keeping history.
About all that buzz about tags, I say: To the Hell with tags! It is hard enough to give names and put the files at the correct directory, are OS writters realy expecting me to also tag the files?

--
Rethinking email
Re:Directories by ottothecow · 2011-02-14 08:15 · Score: 1

Nah, this system is kind of cool (and he notes it is for one desktop...presumably someones laptop would operate with individual accounts while a shared family unit can do this).
Imagine in a windows environment (if such a thing were actually feasible), basically having a start menu for each of the 4 members of your family. If Johnny wants to check his email while Mom is editing the volunteer newsletter, he just has to launch the email program from the "Johnny" start menu and it will load up with his server settings and files already in place. Then if Suzie wants to show off a website she found, she can just open the browser from the "Suzie" start menu and her bookmarks will be there waiting.

--
Bottles.
Re:Directories by badkarmadayaccount · 2011-02-15 06:45 · Score: 1

Now that's a Ubuntu feature request in the making.

--
I know tobacco is bad for you, so I smoke weed with crack.
Re:Directories by Grishnakh · 2011-02-15 08:23 · Score: 1

To be honest, is there any point in moving all .jpeg files into a directory called /photos? I mean, what else is going to be there? It's simple enough to sort by file type then name if you should wish to browse through your adult fine art picture collection that way.</i>

I don't know about you, but I'm pretty sure most people with adult photos also have other photos too. Maybe they'd like to keep their family photos separate from their adult photos, and also their non-human photos related to their hobby. Sorting by "*.jpg" isn't going to do that for you, but directories make this simple.

What is with this modern-day aversion to directories anyway?

No Porn? by Anonymous Coward · 2011-02-13 04:48 · Score: 4, Funny

I think you left a directory out. ;)

Re:No Porn? by lordandmaker · 2011-02-13 04:53 · Score: 1

What do you think is in media/video/movies/ ?
Joking aside, that's pretty much what I do. I've never had some turning-point moment where I've thought "I need to do this differently", and it does still Just Work.
Though my collection is way under 1TB at the minute, so I suppose i'm still where everybody else was about five years ago.
Re:No Porn? by Anrego · 2011-02-13 04:53 · Score: 4, Funny

media/video/etc..
I figured it didn't even need to be said ;p
Re:No Porn? by Anonymous Coward · 2011-02-13 04:55 · Score: 0

I think you left a directory out. ;)

/documents/incoming_downloads/ (before you ask..yes..._legit_ downloads)
Re:No Porn? by maxwell+demon · 2011-02-13 05:15 · Score: 1

Ever heard about hidden directories? :-)

--
The Tao of math: The numbers you can count are not the real numbers.
Re:No Porn? by j_l_cgull · 2011-02-13 05:24 · Score: 1

Joking aside, that's pretty much what I do. I've never had some turning-point moment where I've thought "I need to do this differently", and it does still Just Work.

Have been in this boat for a while and just out of curiosity tried to move into something more sophisticated. Using Mac OS X, I was unwilling to get into any organizing software that stores the data in propitiatory format - making it difficult/impossible to get the data out. Ended up with EagleFiler, which uses tags that can go into/come out of OpenMeta. Can also use directory structure if that's your style. Since the data is just regular file system contents, you can get to the data using any means.

Text searches are far better than Spotlight's performance.

Using two different external USB drives for backups using TimeMachine ensures some amount of safety from hardware failures (have had to restore once when the internal HD died).
Re:No Porn? by commodore6502 · 2011-02-13 06:37 · Score: 1

I convert my files (Star Trek TNG, Babylon 5, etc) to storebought DVDs, put them in a caselogic notebook, and store them on a bookshelf. Much neater way of keeping those types of things.
Other stuff just keeps its original title - such as OuterLimits and BillboardHot100 on the main F: directory (with appropriate subdirectories like Season1 or 2009).
That leaves a a few random files, like GIF avatars and old resumes, that I just dump in my "stuff" folder. I also have a "download" folder for bittorrent files until I get a chance to watch them and delete them.

--
Information wants to be expensive AND wants to be free. So you have Value vs. Cheap distribution fighting each other.
Re:No Porn? by Anonymous Coward · 2011-02-13 08:02 · Score: 0

documents/

First item on the list, buddy.
Re:No Porn? by Anonymous Coward · 2011-02-13 12:32 · Score: 2, Funny

He keeps his porn in media/music/Nickelback because he knows no one would ever look in there.
Re:No Porn? by dynamo · 2011-02-14 01:43 · Score: 1

Yeah, nice thought, but come back in 5 or 6 years and try to read those DVDs, you're in for a nasty surprise. The shelf life on burned optical media can be surprisingly short - it helps to vacuum seal and keep cool but for long term storage, I have already come to regret making the choice you suggest.

Arcane commands by Anonymous Coward · 2011-02-13 04:50 · Score: 0

What's wrong with typing arcane commands at the terminal? And who said I unleashed that upon my extended family?

Re:Arcane commands by maxwell+demon · 2011-02-13 05:11 · Score: 1

What's wrong with typing arcane commands at the terminal?
It means you still have too few scripts. :-)

--
The Tao of math: The numbers you can count are not the real numbers.
Re:Arcane commands by Arker · 2011-02-13 05:15 · Score: 0

That was the one part of the question that makes no sense to me. I have the same issues and was hoping to find some good answers posted, but then I notice that the question is pre-rigged to exclude the best answers!
Face it, searching a file system for relevant documents is one of those tasks where the best tool for the job is always going to be a command line. It doesnt have to be "arcane" but then that seems to be a word that is (mis)used solely by anti-command-line people to refer to any command line, not just those that are unnecessarily complex or difficult for humans to parse.

--
=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Friends don't let friends enable ecmascript.
Re:Arcane commands by osu-neko · 2011-02-13 08:43 · Score: 1

...It doesnt have to be "arcane"...
Aww, that takes all the fun out of it!

--
"Convictions are more dangerous enemies of truth than lies."

Two, er make that three folders: by Anonymous Coward · 2011-02-13 04:52 · Score: 2, Funny

Fap and non-fap, and must-not-fap.

Re:Two, er make that three folders: by Fysx · 2011-02-13 04:54 · Score: 0

lol

Home Server Solution by Anonymous Coward · 2011-02-13 04:52 · Score: 0

i use windows home server and fill the shares with nicely named subfolders. around 6tb by now.

one could use linux for that, too. just make sure it's something that hides physical disks away from you.

My solution by Average_Joe_Sixpack · 2011-02-13 04:53 · Score: 1

If hasn't been accessed in 3 years then it gets deleted [porn files 6 months].

Re:My solution by Anonymous Coward · 2011-02-13 05:43 · Score: 0

If I adopted your solution then a good 50% of my Photo Archive would be gone forever.
Then I wouldn't have been able to sell that picture of the two puffins colliding in flight that I took in 2003 for $500.
Oh well.
Re:My solution by woboyle · 2011-02-13 05:49 · Score: 1

Well, I have data that I want to keep, but haven't looked at in 5-10 years (or more). Just like a book on the shelf. I may not have read it in a dog's age, but I still want to keep it (when was the last time you read Little Women/Little Men?). So, we need a means to archive, but keep accessible, data that has value over time, but may not be currently relevant.

--
Sometimes, real fast is almost as good as real-time.
Re:my solution by woboyle · 2011-02-13 05:53 · Score: 2

Some savant once said "DON'T TRUST ANYONE!" with your money or your wife (or today, your data). I think that includes Google...

--
Sometimes, real fast is almost as good as real-time.
Re:My solution by skids · 2011-02-13 06:43 · Score: 1

Similar: if I haven't used it in a while, it gets flushed to a backup CD-R (because I still have stacks of them) and thrown in a drawer. If it's important enough to find it later, it will be important enough to slog through the backups.
Except for code. If it's old I check to see if it is still needed. If it is, I modernize it, make sure it still works, and commit it somewhere online. Not working on anything proprietary makes that easy.

--
Someone had to do it.
Re:My solution by santax · 2011-02-13 07:10 · Score: 1

I know how you feel, if I had done it like that the video I took of 2 girls and a cup would never made it big time.
Re:My solution by hedwards · 2011-02-13 07:40 · Score: 1

In the past that was more of a necessity, these days at the rate that storage capacity is increases other priorities tend to arise. For instance the number of files I have is more contingent upon my ability to organize and keep organized the files as well as perform regular backups. Given that a standard DVD is somewhat less than 9gb and that you can get a 1tb disk for under a hundred dollars, that's enough for a fairly typical DVD collection right there, considering most DVDs don't use anywhere near the full capacity of a dual layer disk. Uncompressed Bluray is of course a completely different matter.
Right now I only have about a half terabyte or so of data that I have to keep backed up and something like 3tb of hard disk capacity. My main issue is making sure that I can find my files and ensure that they aren't succumbing to bit rot.
So, personally, I stick to a file system convention where possible and sort those basically by priority for backup. Certain folders like my downloads I tend not to delete until I need space, which shouldn't be for quite a while.
Re:My solution by xaxa · 2011-02-13 09:55 · Score: 1

Similar: if I haven't used it in a while, it gets flushed to a backup CD-R (because I still have stacks of them) and thrown in a drawer.
CD-Rs fail. Just last week a friend mentioned a book, and I remembered I had the audiobook on CD-R (ahem). It was burned in about 2007 onto a decent CD-R (TDK Metallic), used once, then kept in a cool dry place. The disc won't read on any of my drives. That particular disc is no big deal, but I don't feel I can rely on CD-Rs any more.
Re:My solution by Anonymous Coward · 2011-02-13 13:14 · Score: 0

I tought this was just one guy and one jar?
Re:my solution by marcosdumay · 2011-02-14 08:10 · Score: 1

Yeah, don't trust anyone, even if the ones says he does no evil.
Religious folk should already have teached that lesson to people, but people never learn, it seems.

--
Rethinking email

Mac Spotlight by Anonymous Coward · 2011-02-13 04:54 · Score: 0

For personal documents and such, I don't even worry about it any more. Just make sure that I don't put anything important in /tmp, and then let Spotlight find it for me. The only problem is when you have several versions of the same document, and you they all hit on the Spotlight search. So you just need to discipline yourself to not have a zillion copies of them (let Time Machine do that for you).

Directories are fine until you forget what your taxonomy is, plus the "let's build an arbitrary genus on the fly" syndrome.

Nah. Ad hoc organizing when I feel like it, but Spotlight find them for me no matter how bad I mess it up.

Re:Mac Spotlight by sauge · 2011-02-13 05:22 · Score: 1

I second this - I have come to find spotlight (Apple OS X's search tool) to be wonderful. It can search through everything - email entries, pdfs, text files, word processing files, etc. (And I am a find . -name "*" -exec grep -Hi searchterm {} \; user.)

Add to that, Apple's file system allows comments and tags to be added as file attributes that spot light searches on.
Re:Mac Spotlight by newcastlejon · 2011-02-13 05:51 · Score: 1

I think my Spotlight must be broken... It used to work just fine but I typed in "Dracula" before I started this comment: two minutes later I gave up on moaning and decided to moan on slashdot. I store all my rips in either /Volumes/External/Movies or /Volumes/External/TV and "Dracula, Dead and Loving it" is in the former. Oddly, Spotlight doesn't find what I want when I search for "Dracula", but has no problems with "dead".
Frankly, I used to use spotlight all of the time but since realising it was often quicker just to find stuff myself I practically gave up. The problem wasn't a lack of speed per se, more that it randomly refuses to find some search terms.

--
If God forks the Universe every time you roll a die, he'd better have a damned good memory.
Re:Mac Spotlight by clang_jangle · 2011-02-13 07:39 · Score: 0

The Spotlight db gets corrupted. Read this.

--
Caveat Utilitor
Re:Mac Spotlight by 0123456789 · 2011-02-13 20:36 · Score: 1

For me, spotlight became much more useful when I discovered you can use it from the command line - mdfind. The man page is a must to use it well, of course.

Hierarchical Diectories by Dakiraun · 2011-02-13 04:57 · Score: 1

Hierarchical named directory structures is how I organize things. I've actually been relatively conservative with the data I keep around, and have about 600g of it, with maybe 100g being irreplaceable. Everything is organized via an appropriately named directory with appropriately named sub-directories, sub-sub-directories and so on. The files themselves are also named with an appropriate name as to contents. I was doing this long before libraries and "tags" and stuff came along, and I've just always kept doing it that way since I just don't have the time to go back and "tag" thousands if not tens of thousands of files. For me, this named-directory approach has been best due to it's simplicity - the structure is easily transferable to any OS, and easily understood by anyone that sees it. It requires no application to handle it or interpret it either. I can't see myself deviating from this method even with 10 times more data as it would continue to be effective regardless of the amount of data I collected.

Re:Hierarchical Diectories by symes · 2011-02-13 10:24 · Score: 1

Similar here - except I add a year directory... so it is like projects/2009/descriptivename/, projects/2010/descriptivename/, media/2009/..., media/2010/... and so on. so once something is finished or reaches a milestone, it gets moved across from a /working/ folder into this hierarchical structure. if a project forks then everything up to that point is copied across and then the project continues to grow in the /working/ folder. it does mean that there is some duplication, but i just see that as an additional layer to by usual backup routine. i'm also in the habit of saving important email into project folders as .eml files, as an additional and sometimes useful form of backup. although i should say that i am still struggling to find a useful way of archiving email. i also save personal photos by /year/month/ which, for me at lest, makes a lot of sense. it is very interesting to hear what everyone else does though. ever since an engineer replaced the wrong disk in a mirror raid i've been terrified of significant data loss (i didn't lose anything, but it did leave me slightly more anxious).
Re:Hierarchical Diectories by Dakiraun · 2011-02-13 23:56 · Score: 1

Yes, I do that as well for some things like Pictures, videos and some document types. Again, simple, but works well. :)

My approach by mikael_j · 2011-02-13 05:01 · Score: 2

I've tried forcing myself to use various schemes including relying completely on metadata and search. The last couple of years this is how I've ended up setting things up:

"Public" network storage

This is for data that should be accessible to the entire network at home. NFS mounted on all my machines, stored on ZFS volume on my file server.

Software/Applications - Application installers and ISOs.
Software/Games - Game installers and ISOs.
Video/compressed - Download directory.
Video/Movies - Hard links from Video/compressed, naming set to work with Plex (for looking up movie info from imdb).
Video/TV Shows - Hard links from Video/compressed, similar naming as for movies.
Music/Rips - Music I've ripped myself, organized by artist and album name.
Music/Downloads/Singles - Single songs downloaded, organized by genre.
Music/Downloads/Albums - Whole downloaded albums, organized by genre.

Private network storage

I use my home directory on the file server (also on the ZFS volume) for storing personal files and mirroring home directories from client machines in ~/Backup/homes/.

Local storage

On individual client machines I generally try to stick with whatever the operating system tries to make me use with an rsync script that syncs everything to the file server (automatically for desktops, run manually on portable machines).

This is what works for me. I would probably have stuck to the "just use metadata" approach if most user interfaces didn't seem to try and make it a major chore to edit and view metadata...

--
Greylisting is to SMTP as NAT is to IPv4

Ultrafast search and metadata filesystem by Twinbee · 2011-02-13 05:02 · Score: 5, Interesting

I have recently found an incredibly fast search tool called Everything. We're talking about Google-like searching where the results pop up as you type. It must be something on the order of a fifth of a second for my 1.5 million files. This kind of technology should be widespread - it makes searches actually *pleasant* to do. Anyway thanks to Everything, I worry less now about where I store my files, and I also try to pack in keywords into the filename.

Anyway, this kind of program is just a glimpse of what a future OS would look like. Imagine a system where everything is stored in tags and where folders become obsolete or used far less often. What you have then is a database or metadata file-system. The relatively new Haiku OS uses such a system, and I wrote about the massive advantages from this old page:
http://www.skytopia.com/project/articles/filesystem.html

Honestly, we'll all be better off the sooner we switch.

--
Why OpalCalc is the best Windows calc

Re:Ultrafast search and metadata filesystem by JSG · 2011-02-13 05:21 · Score: 2

Everything (just looked at the homepage) looks just like "locate" (slocate, mlocate etc) which is a long standing *nix system tool. Oh and with a GUI frontend. There's plenty of those for locate as well.
As to that sort of metadata based FS, it seems to be really hard to do properly and despite it seeming like a good idea, not many are screaming out for it. If they were we'd all have one by now.
My money is on it being hard to do whilst not sacrificing performance. FSs are bloody hard - watch the development of any of the new breed of FS eg BTRFS, EXT4 etc.
Re:Ultrafast search and metadata filesystem by maxwell+demon · 2011-02-13 05:36 · Score: 2

Reiser was going to go into that direction (at least if I understood the description on the namesys web sites correctly). But then, development stopped because Reiser turned out to be a real killer ...

--
The Tao of math: The numbers you can count are not the real numbers.
Re:Ultrafast search and metadata filesystem by Anonymous Coward · 2011-02-13 05:59 · Score: 0

I have recently found an incredibly fast search tool called Everything. We're talking about Google-like searching where the results pop up as you type...
So... like Spotlight, then?
Re:Ultrafast search and metadata filesystem by timeOday · 2011-02-13 06:11 · Score: 5, Interesting

Imagine a system where everything is stored in tags and where folders become obsolete or used far less often.
It bothers me when people think tags are fundamentally different from folders (directories) in the first place. I'm going to re-introduce directories as "hierarchial tags" and blow everybody's mind.
Maybe it's because people think of directory membership as exclusive? But it isn't. You can link a file into as many directories as you like with the 'ln' command. If that hasn't caught on, and if Windows Folders don't even really support that, it's because most people just don't bother... and the same is/will be true of tags by any other name.
Re:Ultrafast search and metadata filesystem by asvravi · 2011-02-13 06:58 · Score: 1

Vista has it... and I guess so does Win7. Why don't I see it being exposed or people actually using it?
Re:Ultrafast search and metadata filesystem by foobsr · 2011-02-13 07:14 · Score: 1

not many are screaming out for it

Probably since those who use a CMS do not scream?

CC.

--
TaijiQuan (Huang, 5 loosenings)
Re:Ultrafast search and metadata filesystem by 0123456 · 2011-02-13 07:31 · Score: 1

If that hasn't caught on, and if Windows Folders don't even really support that, it's because most people just don't bother... and the same is/will be true of tags by any other name.
Indeed. When I copy 2,000 images off my digital camera, I am not going to go through and add tags to every single one, whereas I might well cut and paste them into three directories for the places that I visited on that trip.
Tagging all files requires a huge amount of effort that most people simply aren't going to do.
Re:Ultrafast search and metadata filesystem by Garble+Snarky · 2011-02-13 07:45 · Score: 1

Thank you.

I've been trying to figure out the best way to organize my data lately, and I've been looking into tagging systems. I have tried using symbolic links before, but didn't like the system I came up with because I was unable to move the original file. I think I've always avoided using hard links because a long time ago, someone told me they were dangerous for some reason (I know enough to get by in Linux, but I'm far from an expert on the details of how things work).

Your post prompted me to quickly reinvestigate hard links, and I realized exactly what you just said - they ARE identical to tagging systems. My problem is now solved.
Re:Ultrafast search and metadata filesystem by Waffle+Iron · 2011-02-13 08:23 · Score: 1

Hard links can be dangerous if you forget about the fact that you have multiple links. You get what appear to be multiple copies of your files, but in reality there's only one copy of the data. If you modify the contents under any of the links, they all get irrevocably changed. I've been burned by this before.
However, that problem may not be much of an issue if you're just making a read-only archival file collection.
Re:Ultrafast search and metadata filesystem by markass530 · 2011-02-13 08:27 · Score: 1

yea it came with a mod'd Win7 install I used. Didn't even know what it was, or use it for several months. It's now my favorite app EVER.
Re:Ultrafast search and metadata filesystem by Twinbee · 2011-02-13 08:48 · Score: 1

It's easy to think of a way to batch tag multiple files at once. At least in theory.

--
Why OpalCalc is the best Windows calc
Re:Ultrafast search and metadata filesystem by timeOday · 2011-02-13 09:02 · Score: 2

You're right, but I think any tagging system is complex enough to create similar issues. Say you divorce "Joan" and decide late one night to delete all pictures tagged with "Joan." Whoops, lots of those pictures had your kids in them, too. Whereas if you put the photo in a directory for each person in the photo (using links, or plain old "copy" on filesystem with data de-duplication), the picture wouldn't be erased unless you deleted the directories for every person in it, which is kind of cool.
My personal method of keeping it straight is having one directory under which all of my (e.g.) music lives, then other directories with different subsets - one for each family member's mp3 player, then one with my "favorite 8gb" of music, etc. But the "all" directory is the authoritative one... I wouldn't buy an mp3 album and save it to an mp3 player directory just because I want to listen to it there first; instead I file it under "all", then make a link under the directory for my mp3 player, then sync the player.
For photos, just name them by date plus the names of the people or places in them, thus packing multiple "tags" into one filename. It works fine.
Re:Ultrafast search and metadata filesystem by yuriks · 2011-02-13 09:04 · Score: 1

There's no need to update any database with Everything, and it is also *fast*, there's no delay between you typing a letter and the results being updated, it's literally instant, over three whole disks. It's the definition of a pleasurable program to use. These days I just use it to open files, even if I already know where to find them, just because it's so fast.
Re:Ultrafast search and metadata filesystem by AmberBlackCat · 2011-02-13 09:54 · Score: 1

What happens when you move your files?
Re:Ultrafast search and metadata filesystem by xaxa · 2011-02-13 10:09 · Score: 1

I use symbolic links to organise my photos.
I have ~4GB "source" folders, Photos_NNN, which I don't change and are backed up to DVD as they reach 4GB: ./photos/Photos_001/2005-01-23 ./photos/Photos_001/2005-04-13 ./photos/Photos_001/2005-08-03 ./photos/Photos_002/2005-12-31
everything is linked into year folders so I can see it: ./photos/2005/2005-01-23 -> ../../Photos_001/2005-01-23 (etc)
I could do places, people, themes etc too, but I haven't bothered yet.
and some stuff is in the public folder, visible on my website. This might be a symlink to the whole directory, or a directory of symlinks to a selection of files (usually I omit pictures of people from my website) ./photos/public/2005/2005-01-23 -> ../../2005/2005-01-23 ./photos/public/2005/2005-04-13/002.jpg -> ../../2005/2005-04-13/02.jpg (etc)
I'm not had a reason to move the files, but if I did I could write a script to update the links. Note that you can make a relative symlink (../example) or a fullly qualified one (/home/xaxa/example). Your choice will affect what you can move (both, or just the link) without the link breaking.
I don't use hard links as they're more trouble to back up (the backup software either wastes loads of space, or has to do something clever).
Re:Ultrafast search and metadata filesystem by Anonymous Coward · 2011-02-13 10:12 · Score: 0

Yeah, because you can't tag multiple files at once...
Re:Ultrafast search and metadata filesystem by lennier · 2011-02-13 10:52 · Score: 3

You can link a file into as many directories as you like with the 'ln' command.
Well, sort of. You can create a hardlink or symlink in the Posix model easily enough, certainly. But the link is only one way - you can't easily find, given a file, where all its links are. So they can tend to get caught up in the bit-rot. And there's enough of a stigma around symlinking - let alone hard-linking - that very few tools can be relied on to support it in all cases.
A true tagged or non-exclusive-directory filesystem would, I assume, have proper two-way linking between a file and and its links, so you could query a file and get a list of its tags/locations. And all the tools, without exception, would fully support it. This would include things like copying a 'folder' to removable media - you would need to standardise what it means. You can't just copy the links and you can't just turn the links into unlinked files.
What you could do, perhaps, is store all the originals (including folders) in a single universal folder as a globally-unique identifier (it can't be just system-unique, because what if you copy a file to someone else's machine?), then make the other folders on a system contain only hardlinks, and have the file-and-folder copy algorithm copy both a subset of the originals folder and all the appropriate tag folders...
It gets messy, is what happens, because things like disk drives fundamentally have a notion of containment (my file is either on this disk or it's not, it doesn't help if it's 'virtually somewhere out there in the cloud' once I've pulled the network plug) while tags don't. I'm sure we could solve these problems, but they need to be solved correctly and with mathematical rigor at the lowest layer of the filesystem. I don't see any serious attempts to do that in any of the tagged filesystem approaches I've seen yet.

--
You are not a brain: http://books.google.com/books?id=2oV61CeDx-YC
Re:Ultrafast search and metadata filesystem by timeOday · 2011-02-13 10:59 · Score: 1

Hard links are just multiple names for a file, so nothing special happens when you change any of the names. But you have a point that if a single file were moved to a different filesystem, it would lose its 'tags' (all its other names). Technically, this is preventable if you use a tool that understands directory structure and links to copy or move the file (e.g. tar), but admittedly that is a lot to ask.
I use links to organize my music, but for photos I simply put the date and names of people and places (tags, effectively) into the filename. This makes it extremely simple to find them using tools like 'find', 'grep,' and file globbing (or Windows "search" button for that matter). I'll take automated tagging such as exif tags when I can get it, of course.
Re:Ultrafast search and metadata filesystem by sulfur · 2011-02-13 12:48 · Score: 1

I think we should stop trying to create One FS To Rule Them All, and instead aim for more specialized file systems. I don't care about performance in my /storage, and similarly I don't care about metadata in my /usr.
Re:Ultrafast search and metadata filesystem by Hognoxious · 2011-02-13 13:12 · Score: 1

Great. I guess a way to automatically decide which tags to apply is the simple bit.

--
Confucius say, "Find worm in apple - bad. Find half a worm - worse."
Re:Ultrafast search and metadata filesystem by Kvasio · 2011-02-13 13:22 · Score: 1

Thanks man!
So far I've been using thing called Lazycat - which was developed until 2000 and apparently dropped around that time. It was also fast, but I had growing discomfort about software not updated any longer.
Re:Ultrafast search and metadata filesystem by syousef · 2011-02-13 14:51 · Score: 1

Everything is nice, but it doesn't do network drives without running it on the network server. That makes it less useful at work.

--
These posts express my own personal views, not those of my employer
Re:Ultrafast search and metadata filesystem by syousef · 2011-02-13 14:55 · Score: 1

You're right, but I think any tagging system is complex enough to create similar issues. Say you divorce "Joan" and decide late one night to delete all pictures tagged with "Joan."
Nothing is idiot proof. The problem with hard links is that they make it look like there are multiple copies at the file system level. You can intend to delete one copy and end up hosing the lot. In contrast, if you set out to delete everything on your filesystem related to a particular thing or person you can't blame the tool and there's nothing tricky about it here. Just don't delete drunk.

--
These posts express my own personal views, not those of my employer
Re:Ultrafast search and metadata filesystem by paultag · 2011-02-13 17:44 · Score: 1

You're not wrong at all, and I agree -- it's just that the source document has to "live" somewhere. If I were to be happy with it, I'd have to be able to create "debian/manual_on_this", then link it to my "ubuntu/" folder. When I add a change that's no longer "debian/" qualified, I should be able to "rm debian/manual_on_this", but have it live in "ubuntu/". What I'm getting at is that tags let the filesystem ( or database, whatever ) manage the documents and present you with "links", rather then having original documents in the tree. It's like saying that C has pointers like Java (in before Java jokes), ignoring the fact you have to malloc / free. Perhaps I'm ranting at this point, but I think it'd be a nice way forward :) Off to implement it in fuse!

--
This is not a viral sig. Copy it at your peril.
Re:Ultrafast search and metadata filesystem by Risen888 · 2011-02-13 18:38 · Score: 1

"Imagine a system where everything is stored in tags and where folders become obsolete or used far less often. What you have then is a database or metadata file-system."
I'd slit my wrists.

--
Hey, I finally got my first freak! Took you long enough!
Re:Ultrafast search and metadata filesystem by Anonymous Coward · 2011-02-13 19:55 · Score: 0

Well, AFAIK what you describe has already been done, actually. The iPod does indeed store the files in the described way. Even more, the files are marked as hidden and if you just browse the iPod as a USB drive you don't immediately see them. Also, once you do find the files, you find that their filenames have been garbled (like asfhasfasfghasg). The iPod then has a database of items, where each piece of content is marked with several tags (from the ID3 tags, mostly, but also with other data). So you can find the same song in a multitude of ways and in several places in the iPods content tree.
Now, don't take me as a fruit-themed-gadget-lover (I only own a fairly old iPod 4G, as it is), but I would love to have a similar content database (preferably content agnostic, so it would fit any kind of document, not just audio/video/image) on my computer too. It wouldn't need to be implemented at the filesystem level. A content database that would let you import a file, assign it an arbitrary number of tags and then let you group/search/organize the content based on these tags would be almost as good.
Strictly in the wishful thinking department would also be the possibility to share the content across several computers, preferably using uPnP/DLNA.
I know, I know. Wishful thinking at the best.
Re:Ultrafast search and metadata filesystem by Tom · 2011-02-13 22:15 · Score: 1

Imagine a system where everything is stored in tags and where folders become obsolete or used far less often.
Folders will never go away.
You need a unique, constant ordering system in order to access your data. And you almost always have groups of data that belong together.
Folders are the simplest solution to this problem, and there is always a context where the simple solution is the best one.
I'd rather expect folders to become more like tags. I've had the opportunity to direct some money towards Punakea so it would support a virtual folder structure. That's one solution I believe has a future. Tags alone are a mess once you approach thousands of documents and/or share your files with other people in a team.
We definitely need more and better use of metadata. OS X has the basic infrastructure in its filesystem, but it isn't used as much as it should.

--
Assorted stuff I do sometimes: Lemuria.org
Re:Ultrafast search and metadata filesystem by Kashgarinn · 2011-02-13 22:22 · Score: 2

Well, that's because they are fundamentally different.
With a hierarchial structure, you have the file in one place, and if the file truly should be in more than one place for ease of finding it through the directory structure, then you need to link to it (or worse) copy it to the other place. Let's say you link to it, then you need to go to the other place and create the shortcut to the file, the more directories your file needs to be under, the more you need to manually set this up, and manually change, if a change is needed. It's worse if you copy it, as you need to update the copies and most would do that manually. Most people don't do this, they put it under one palce, and just remember that it's there. New people who need the data won't know that, and have to traverse any/all trees until they find it.
a 'tag' based hierarchy is fundamentally different in that the information about the file stays with the file itself, if you want to add some parent link to it, you edit the file itself, and you don't have to do anything else, the system recognises the changes and readjusts on the fly.
It makes the data itself self-advertising and self-identifiable, thus organisation becomes quite simple and easy and very future proof regarding any changes because you don't have to recreate a directory structure manually just because you got a new file and you're wondering where to put it, it's all there already within the file itself.
example: a picture of Ma, Pa, brother X, brother Y, and sister Z, taken on location x: 64.163277,y: -21.859124, date:14/02/2011 10:16 relative terms: vacation, family, peacelight, Iceland, Ma, Pa, X, Y, Z, date, location
- Someone managing a directory structure would never do anything more than just put it under "images/2011/vacation feb"
- Someone doing tags on the picture itself adds whatever seems appropriate, and they don't have to worry about where it's kept as they can look up any tag they want and the picture will be found.
There is a crucial difference, a fundamental difference, and I hope going forward that this is the way data in the future will be organised.
Re:Ultrafast search and metadata filesystem by Anonymous Coward · 2011-02-13 23:35 · Score: 0

What I'm worried about when it comes to metadata and tagging is the system-dependence. If I switch operating systems/file systems or the program I'm using is not available anymore, what happens? Directories are pretty much a universal feature, but how system-independent is metadata?
Re:Ultrafast search and metadata filesystem by Twinbee · 2011-02-14 00:01 · Score: 1

Some of the tags would be automatically saved by default for a file, and you can always add tags later, or batch add them. Also, you can add recently used tags from a dropdown list which are updated in realtime as you type single characters, like autocompletion. Unfortunately, it also means programs would have their load/save requesters modified slightly.

--
Why OpalCalc is the best Windows calc
Re:Ultrafast search and metadata filesystem by mlush · 2011-02-14 00:44 · Score: 1

I actually do add tags to every single one of my digital photos. Its not (quite) as hard as it sounds.
The first thing todo is ensure all cameras are synchronised to the same time. That way when you sort the photos by creation time they organize into a single 'virtual filmstrip'. To a first approximation we are wandering round as a family so photos taken at about the same time are in the same place (if not of the same subject).
I've written a webapp to display photos and allows me to apply tags to ranges of the photos. so I take 2000 images can mark the first and last of the 1500 photos we took going on holiday to Scunthorpe, 'Scunthorpe 2010' (we go there Every year). I then call up all photos I've just tagged and mark the ranges for the various places we went (Scunthorpe Crazy Golf-o-rama, Scunthorpe Aquarium, Scunthorpe World of Fishing Theme Park, Our Hotel etc). I Then look through and tag the people in the photos. In all it may take 10-20 minutes. to process a whole holiday (some times it takes a bit longer if we split up). For really deep tagging (adding Golf, Aquarium, Theme Park, the names of the various rides urls to the theme park etc it may take a hour or so)
The reason I said its not quite as hard is that I've had to write my own software to manage the tag database.. and the reason I'm posting this is I'd dearly love to use something off the shelf to do this (I would not want anyone who inherits the system to have to maintain my software;).
Anyone heard of something that does this?
Re:Ultrafast search and metadata filesystem by Grismar · 2011-02-14 01:27 · Score: 1

I'm not going to say I completely disagree here, because I don't - I use folders myself and I find it to be more than adequate.
But the argument doesn't stand, the way you present it. If it did, you could have predicted the iPhone flopping on the premise of the Newton flopping; 3D would have never become a commercial succes based on prior tech; etc.
Instead, it seems to me that *sofar* nobody has come up with a way to implements "tags" or "arbitrarily ordered nesting folders" in a way that makes more sense than strictly hierarchical folders. It's an interface and usability thing, but it's by no means the proof that people "just don't bother".
Re:Ultrafast search and metadata filesystem by jedidiah · 2011-02-14 02:09 · Score: 1

Since my photos get batch renamed based on the directory they ended up in.... nothing.
You can take one of my photos isolated from the rest of the system and have some clue regarding what it is just by looking at the filename.
It's not like the number the camera gave it has any usefulness.

--
A Pirate and a Puritan look the same on a balance sheet.
Re:Ultrafast search and metadata filesystem by jedidiah · 2011-02-14 02:12 · Score: 1

...that's fine as long as someone does the legwork.
If it's not automatic, 99.9% of people won't bother.
The other 0.1% are already just using manual folder heirarchies.

--
A Pirate and a Puritan look the same on a balance sheet.
Re:Ultrafast search and metadata filesystem by jedidiah · 2011-02-14 02:14 · Score: 1

Something needs to be the primary key.
It's better if that primary key is something that actually means something to the person looking at it.
This is why I rename my photos. The file from the camera is unique enough but not at all useful.

--
A Pirate and a Puritan look the same on a balance sheet.
Re:Ultrafast search and metadata filesystem by Risen888 · 2011-02-14 02:26 · Score: 1

Why? For the love of all that's holy, why? That seems like a hell of a lot of trouble to go through rather than just doing something crazy like giving your files and directories sensible names. Databases may come and databases may go, but my directory structure is universal, portable, sensible, and future-proof.

--
Hey, I finally got my first freak! Took you long enough!
Re:Ultrafast search and metadata filesystem by Twinbee · 2011-02-14 03:04 · Score: 1

Did you read the link in my original post?
Worried about portability? The metadata information can be stored as just standard text. I mean, even now, this is happening with the timestamp, comments, and other info stored along with each file.
And why? Time saving obviously - if you value time. Navigating through a dozen folders (and creating those folders and organizing them) is much worse than *instantly* searching for the right file (even one or two letters could easily come up with the right file if it was used recently for example). There are many other benefits too if you read the article at that URL.

--
Why OpalCalc is the best Windows calc
Re:Ultrafast search and metadata filesystem by Anonymous Coward · 2011-02-14 03:29 · Score: 0

You do know that realtime and instantaneous filesystem search is a standard feature of OS X for over 5 years, right?
Re:Ultrafast search and metadata filesystem by Pollardito · 2011-02-14 05:30 · Score: 1

What you could do, perhaps, is store all the originals (including folders) in a single universal folder as a globally-unique identifier
Windows Explorer (and maybe other file browsers) can actually have problems opening folders that have too many items in them, since the time it takes to display the items seems to increase non-linearly. That bleeds over to the file-save dialog when you try to save new items into the folder too
I've thought of putting together something that will encode tags into the folder heirarchy, so that something with the tags "programming" and "Python" will end up in /programming/Python/ and something tagged "programming" and "Ruby" would end up in /programming/Ruby/ (the tags are ordered alphabetically so that the files don't get split between this and /Python/programming/). That would make the list of tags and their assignments easily discoverable from walking the directory tree in the event that you lost your tag database, but saved the actual tree of items. And it should also do a good job of splitting things up, so that hopefully no single folder is packed with items. The biggest problem would be the need for a custom navigator, so that when you search on "Perl" it is able to look in multiple places like /examples/Perl/ and /programming/Perl/ and if you search on "programming" it can list the contents of /programming/Python/ and /programming/Ruby/ and any other subdirectory of programming.
Re:Ultrafast search and metadata filesystem by Anonymous Coward · 2011-02-14 07:09 · Score: 0

Adobe Lightroom.
The only thing I'm not sure about is the multiplexing of multiple camera imports into chronological order.
Re:Ultrafast search and metadata filesystem by Risen888 · 2011-02-14 10:11 · Score: 1

Time saving? Going back and applying metadata to literally hundreds of thousands of files, and keeping that regime going into the future, for time saving?
It takes 1-2 seconds for me to drill half a dozen directories down (hey, guess what else autocompletes? every decent file manager in the world!). Everything's human-readable and human-discoverable right now. I copy that directory structure somewhere else, it's still human-readable and human-discoverable right now. I need another layer of complexity on top of that like I need a burning dose of the clap.

--
Hey, I finally got my first freak! Took you long enough!
Re:Ultrafast search and metadata filesystem by DigitalCrackPipe · 2011-02-14 11:03 · Score: 1

Directories as tags were my first thought when reading the parent post. The only way I'll every be able to convert to a tag-based system is if the existing work I've done to organize my files is leveraged into tags.

That, and some analog needs to be available for linking to a static set of data in a simple manner. "Just search for it" doesn't have the same level of simplicity as clicking on a shortcut to where I put an exact set of files for a particular task.
Re:Ultrafast search and metadata filesystem by maraist · 2011-02-14 15:54 · Score: 1

Good lord, are you people still using PC-centric data models? Automatic Facial recognition and peer-scanning/meta-data-tagging is an awesome form of data-mining of thousands of pictures. I typically have 5 or 6 tags on any picture of interest from a collection and at least 2 tags on all the rest.

--
-Michael
Re:Ultrafast search and metadata filesystem by Hognoxious · 2011-02-14 20:33 · Score: 1

You can't see the faces on most of the shots.

--
Confucius say, "Find worm in apple - bad. Find half a worm - worse."
Re:Ultrafast search and metadata filesystem by frodo+from+middle+ea · 2011-02-16 01:22 · Score: 1

My only problem with Everything is that the at every boot I have to click YES on the UAC prompt. Really annoying.

--
for the last time people, I am "frodo from middle eaRTH", not "middle eaST".
Re:Ultrafast search and metadata filesystem by Twinbee · 2011-02-16 01:41 · Score: 1

Yes, though I think we can blame Microsoft more on that front.

--
Why OpalCalc is the best Windows calc
Re:Ultrafast search and metadata filesystem by Twinbee · 2011-02-16 06:38 · Score: 1

The names from your current folder setup would be automatically generated as tags (i.e. the folder names would become the tags). So either you can leave them 'as is', or for tidiness you would have them all put into a single folder along with future files.
One or two seconds fast? I'm a fairly slow typer, but I can easily type a tag name under half a second. Or you can select the tag from a dropdown of recently used tags. Again, you *don't need to do this for every file you save*, especially as it would often be autogenerated.

I need another layer of complexity on top of that like I need a burning dose of the clap.
Obviously you must be right, so let's get Google to change its search into a giant hierarchical mess where you need to navigate several layers of fat deep to get what you want. Never mind that tons of files belong in multiple folders and all that hassle that brings. How about if you want to rate files for how good or how useful to you they are? Again, folders make this a mess.
Remember that tags can imitate folders anyway. Just batch apply a unique tag to a collection of files, and bam, there's your equivalent to a folder. This will often be done automatically if the required metadata was saved with the file, like MP3s do.
I dare you to try that Everything program I mentioned in my initial post, and think about the possibilities something like this could bring.

--
Why OpalCalc is the best Windows calc
Re:Ultrafast search and metadata filesystem by Twinbee · 2011-02-16 06:51 · Score: 1

Using an exclusively hierarchical system for tags as an imitation for a standard folder system removes some of the inherent advantages you get from the metadata approach. Yes of course, you can still tag in that way if appropriate (for example a Visual Studio solution), but I would tend to encourage a 'flatter' approach generally.
A metadata system in Windows is nowhere near what I would hope for since save requesters from programs don't even *offer* to save along some custom tags. And Windows search is painfully slow, even in Windows 7. So yeah, people don't know what they're missing.

--
Why OpalCalc is the best Windows calc
Re:Ultrafast search and metadata filesystem by Risen888 · 2011-02-18 19:19 · Score: 1

Obviously you must be right, so let's get Google to change its search into a giant hierarchical mess where you need to navigate several layers of fat deep to get what you want.
That's a specious argument and I'm quite sure you know it. I am the one who organized all my files in the first place. I don't have to wade through someone else's subjective judgments to find what I need in my directory structure.
How about if you want to rate files for how good or how useful to you they are?
Why the hell would I want to do that? If I'm looking for it, it's useful to me. If I'm not, it's not. The first rule of tautology club...
Remember that tags can imitate folders anyway. Just batch apply a unique tag to a collection of files, and bam, there's your equivalent to a folder. This will often be done automatically if the required metadata was saved with the file, like MP3s do.
My file manager does that already. I don't use that feature either, because I'm the one who generated the tags in the first place.
I dare you to try that Everything program I mentioned in my initial post
I'll take your dare. Two questions: 1. Does it run on Linux? 2. Does it do anything that Nepomuk/Strigi does not do?

--
Hey, I finally got my first freak! Took you long enough!
Re:Ultrafast search and metadata filesystem by Twinbee · 2011-02-20 13:45 · Score: 1

Why the hell would I want to do that? If I'm looking for it, it's useful to me. If I'm not, it's not.
Well, it would be to filter a list even quicker. Say if the initial filter produced 100 results, then the highest rated/used/valuable would be nearer/at the top.
Well, unless we both go into massive detail, I'm through I think for now. I know you're still against the idea of a metadata filesystem generally, but hopefully I've pulled you towards that direction, or a subset of that direction, even it's just by an inch.

--
Why OpalCalc is the best Windows calc

If you need to manage documents by Anonymous Coward · 2011-02-13 05:03 · Score: 0

consider using software like Zotero, Jabref, BibDesk or others. But from your problem description it sounds like you don't have the problem that you need to refer to documents quickly and keep a lot of pertinent meta data about them. For managing a few videos and photos for other people directories should be enough. And for anything resembling source code and configuration files you can also use directories plus a distributed revision control system like git.

Learn to delete by zwei2stein · 2011-02-13 05:05 · Score: 2

Simple: Delete stuff.

Do you need all those instalation files for 10 year old shareware? Do you really need Gigabytes of movies you will never watch again? Music Collection so big that your playlist is months on lenght? Irrelevant TV shows? More ebooks than you can possibly read?

What you really need to keep are personal files - photos, home video, documents. Those can easily be managed - tag by occasion, file under year/month. done. (they do not take that much space either and people get tired of documenting everything sooner or later.).

--
-- Technology for the sake of technology is as pathetic as eschewing technology because it's technology.

Re:Learn to delete by mikael_j · 2011-02-13 05:11 · Score: 1

I have no problem deleting stuff that's easy to find again, the problem is all that hard-to-find data. The movie you can't find a DVD or Bluray of and just finding a torrent with seeds took days of searching (not to mention downloading it at a blazing 50 kB/s from the one person seeding it). Removing data like that means that if you ever want to watch it again it will take you days or weeks of preparation.
The problem is that data like that adds up. At first it's just a couple of movies, after a while half the movies you have are movies like that.

--
Greylisting is to SMTP as NAT is to IPv4
Re:Learn to delete by Hatta · 2011-02-13 05:18 · Score: 4, Insightful

Do you need all those instalation files for 10 year old shareware?
Sure do. In fact I just installed StuffIt Deluxe on an SE/30 last weekend
Do you really need Gigabytes of movies you will never watch again? Music Collection so big that your playlist is months on lenght? Irrelevant TV shows?
The bigger the collection, the more fun shuffle is.
More ebooks than you can possibly read?
You never know which one you'll need to refer to.

--
Give me Classic Slashdot or give me death!
Re:Learn to delete by maxwell+demon · 2011-02-13 05:21 · Score: 1

Can I get your crystal ball please? Or how do you determine what you'll ever watch/listen to/read again?

--
The Tao of math: The numbers you can count are not the real numbers.
Re:Learn to delete by Anonymous Coward · 2011-02-13 06:44 · Score: 0

this is a prime example of hoarder mentality.
weeellllll i miiiiight need it soommmeeeday.
Re:Learn to delete by zwei2stein · 2011-02-13 06:55 · Score: 1

Well, stuff that is hard to find again is so for reason: it is not that good. or popular. those are not synonyms, but it works out like good indicator or worthiness.
Collections grow because or your mindset - if it is "expensive" to download file, you are unlikely to part with it easily. You just put your own pricetag on file (or link with file emotionally as downloading it again is going to be painfull). Such files just gather dust, seen once and then just considered too precious to delete without taking actual quality of content into account.
It is hard thing to do. I know, It is quite hard ot make yourself delete that mp3 for which you waited for week back in dialup years.

--
-- Technology for the sake of technology is as pathetic as eschewing technology because it's technology.
Re:Learn to delete by maxwell+demon · 2011-02-13 07:02 · Score: 1

Maybe you never look again at files which you didn't touch for years. For me, it's a relatively regular thing.

--
The Tao of math: The numbers you can count are not the real numbers.
Re:Learn to delete by foobsr · 2011-02-13 07:25 · Score: 1

Simple: Delete stuff.

Yes, great, I have done it, but it is difficult to learn.

Even more so if you are suspected to fall for a slogan like 'because we can', I guess.

CC.

--
TaijiQuan (Huang, 5 loosenings)
Re:Learn to delete by Garble+Snarky · 2011-02-13 08:00 · Score: 2

But at $100/TB, there is very little cost to this mentality - except for now, the growing cost of finding the data - but that is still minimal compared to, say, filling your house with every edition of the NYT.
Re:Learn to delete by mikael_j · 2011-02-13 08:21 · Score: 1

Well, stuff that is hard to find again is so for reason: it is not that good. or popular. those are not synonyms, but it works out like good indicator or worthiness.
They are indeed not synonyms. Especially when it comes to things that are no longer part of popular culture it is very possible that those who hold the rights to the original material don't consider it profitable to release it on DVD or Bluray and most pirates have no interest in releasing it either. It can still be quite good though.

Collections grow because or your mindset - if it is "expensive" to download file, you are unlikely to part with it easily. You just put your own pricetag on file (or link with file emotionally as downloading it again is going to be painfull). Such files just gather dust, seen once and then just considered too precious to delete without taking actual quality of content into account.
While I may not watch the same movies from the 1950s over and over again on a weekly basis I do tend to watch them again every now and then and it would suck to have to spend a week re-downloading it (or looking for an ancient VHS release so I can rip it in horrible quality).
You may find that these files "gather dust", I don't. In my experience the files I rarely re-watch are current TV shows, I might watch them twice at most before I end up deleting them (I only download new shows because I don't have a TV and even if I did I really don't want to wait for months before shows make it here to Sweden).

It is hard thing to do. I know, It is quite hard ot make yourself delete that mp3 for which you waited for week back in dialup years.
I don't really have any music I downloaded in the days of Napster or earlier because they were low-quality and often mislabeled or simply missing id3 tags.

--
Greylisting is to SMTP as NAT is to IPv4
Re:Learn to delete by zdepthcharge · 2011-02-13 09:04 · Score: 1

I agree. I store all the movies I've acquired on an older 160 GB drive. If it dies, I really don't care. That stuff never gets backed up. In fact, I should say that I DID store movies on an older drive that never gets backed up because it died last week. And I haven't missed it or the files on it n the slightest. The important stuff is backed-up either to a local drive or to the cloud (for strange values of "the cloud").
Re:Learn to delete by he-sk · 2011-02-13 09:34 · Score: 1

I very much agree. My whole digital life, excluding photos, fits into 6 GB and goes back to to 1997. That includes e-mail, code, stuff for school, uni, and some work and family projects.
I don't count photos, because photography is my hobby and I keep RAW files around. That get's large pretty fast. But I do weed out obvious crap shots and irrelevant duplicates or tests, otherwise it becomes unmanageable. My photos are filed under "year/year-month-day shoot" (e.g. "2011/11-1-2 We Love Food 9+10") which is easy enough.
I also don't count music, because although I do enjoy listening to it, I wouldn't mind losing it all. It's not irreplaceable.

--
Free Manning, jail Obama.
Re:Learn to delete by Tuan121 · 2011-02-13 12:03 · Score: 1

Simple: Delete stuff. ... More ebooks than you can possibly read?
Yeah I agree. I tried to shut down the local library by arguing there were more books than I could possibly ever read and it was a waste of space. That didn't go over so well.
Re:Learn to delete by Anonymous Coward · 2011-02-13 13:28 · Score: 0

Download it again when you need it.
Re:Learn to delete by Anonymous Coward · 2011-02-13 14:41 · Score: 0

i dont have cable tv. so, yes.
Re:Learn to delete by maxwell+demon · 2011-02-13 21:00 · Score: 1

Download it again when you need it.
First, that assumes that all of the files I have are downloads from the net. While it's true for some files, it isn't true for all. There are files I've actually paid for, and there are files I've created, there are files which I created from CDs (and when in doubt, I'd rather get rid of the CDs than of the files, because the CDs take up more physical space), there are recordings from TV, ... indeed, downloaded files make up a minority of my files (if you don't include installed software, which normally comes right from the distribution's repository anyway).
Moreover, it assumes that whatever files I got from the net are still available on the net. Sometimes this is true, sometimes it isn't. Even free software projects can die, and sometimes this also means the code isn't out there any more. Or sometimes there's a limited-time free download offer. If I wanted to get the same files again, I'd have to pay for them.
Third, even if the file is still available on the net, finding it on the net isn't necessarily easier than finding it on the hard disk.

--
The Tao of math: The numbers you can count are not the real numbers.
Re:Learn to delete by Anonymous Coward · 2011-02-13 21:32 · Score: 0

Good to see a fellow data whore whom I can identify with. I look at my data as a collection. Sure, I delete some stuff from time to time (usually whenever I am nearing my storage limits, about every six months or so), or if it's something I think I *may* need at some point in time then I might archive it off to a spare HDD I keep for just such occasions laying on the parts rack. Overall, though, I look at my data (movies/pictures/music/documents/files/code/you-name-it) as a running collection that I should have full access to whenever I need it. And with hard drives as cheap as they are nowadays, why should I really have to delete anything?
I admit that over the years my storage system has gone through a number of changes, and that at this point it would make absolutely no sense to anyone but myself. But I have managed to create a system that works, almost purely by folder structure alone (combined with memory of how those directory trees are organized). I have also managed to keep redundant copies of everything on at least two systems at any given time, just in case one drive or RAID array crashes, or in case an entire system goes down. If my desktop's motherboard goes tits up tomorrow, then all my stuff (within the last day or so) is still on my fileserver, still accessible so long as my ISP does what I pay them for. Server's RAID card decides to shit its pants and wipe everything out, nothing truly irreplaceable has been lost because 99% of this stuff also resides on my desktop. I have had two major crashes at this point where my data was not being backed up quite like I thought, so now I go overkill with the backups/redundant copies. Actually I'm about to get a 2TB ioSafe or similar fireproof/waterproof external storage system just for the purpose of keeping the truly irreplaceable stuff backed up, so then there will be no less than three copies of my "irreplaceables" floating around on the network at any given time.
Just figure out the most simplistic system you can for organizing your data that also allows for enough granularity/control over share permissions to keep your data relatively organized, manageable, in grouped locations where you won't be able to forget (so you don't end up with 17 copies of the same series of ebooks/etc.), and easily set up for backups. And no, I don't care if you do RAID-1, or even something as ridiculous as RAID-6 or RAID-151 (not an official RAID level, though I have set one up before for a client, took an even 20 250GB SATA disks [and getting the RAID card to nest 3 levels deep was fun as well] LOL) RAID does NOT count as a backup plan.
Re:Learn to delete by mlush · 2011-02-14 00:52 · Score: 1

You never know which one you'll need to refer to.
This line is the nub of the matter.. Disk space is so cheap that the cost of storage for any given file is pretty much negligible. The only time is a waste of money is if you can't find it easliy.
Re:Learn to delete by Anonymous Coward · 2011-02-14 04:00 · Score: 0

Like, for example, Kung Fu.torrent. Jack Black < David Caradine.

For better or for worse... by Anonymous Coward · 2011-02-13 05:05 · Score: 0

I have my local folder structure the same as the project server, so that anyone who might need to, can get something off of it, and it's easier for me to make backups.

Keep the irreplaceable stuff in a separate tree by traindirector · 2011-02-13 05:07 · Score: 3, Insightful

I also still use a similar directory structure, but I've made once change in the past few years that makes it much easier to manage: I keep the special, personal, irreplaceable in a separate hierarchy.

This negates the need for something like a backup_links directory, and makes it much easier to just share the "normal" media directory with everyone/thing on my home network and then handle permissions on the personal stuff with more granularity. It's also much easier when I know I'm looking for a photo I've taken or a document I've made that it'll be in the personal hierarchy under those categories rather than the main ones.

It's a small change, but keeping a separation between stuff I've made and the easily replaceable stuff I've acquired has gone a long way to making my personal data and treasures more secure--both from loss and accidental sharing.

Re:Keep the irreplaceable stuff in a separate tree by CptJeanLuc · 2011-02-13 22:17 · Score: 1

I have a similar setup, with three layers - "unprotected", "synchronized", and "full backup".
Synchronized is backed up with regular rsync. I put e.g. my music there to avoid re-ripping hundreds of CDs if the RAID gets corrupted. Full Backup gets an incremental backup, so older versions are retained. Plus running that backup from another PC, so any malware on the home server (however unlikely) would not be able to wipe the backup.
Keeping two external drives with LUKS file system encryption for running backups, and one of them is always in a separate physical location. Periodically swapping the two drives so there is always an off-site backup which is maximum 3-6 months old.
In order to simplify access to these directories, I mount all three read-only to the same mount point via aufs and share that mount point. So when browsing e.g. media, I do not care about which of the three a file is stored, as long as I have kept the same directory structure.
It gets a bit more complex when setting this up for multiple family members with separate permissions etc. and running backup for everyone, but it's still quite manageable. Also, aufs is quite convenient for e.g. joining everyone's privately managed media files into one master read-only media mount point.
Re:Keep the irreplaceable stuff in a separate tree by AmiMoJo · 2011-02-14 04:02 · Score: 1

The archive bit could be used for that, assuming you don't use it for something else. Then you only have one place to look for files and only the important stuff gets backed up.
You can set the archive bit on entire directories too.
Unfortunately Windows seems to abuse the archive bit quite a lot so it isn't very helpful there.

--
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC

OpenGoo/Fengoffice by IANAAC · 2011-02-13 05:07 · Score: 1

For all my editable docs I create/edit for work, I use OpenGoo/FengOffice. I can keep track of versions, and apply any tagging, etc.

For my media files (on a separate server)I use Rhythmbox for audio and XBMC for video.

And I back up everything somewhere else, just in case. I don't have terabytes of stuff though. Close to a terabyte.

Delegation by rhendershot · 2011-02-13 05:09 · Score: 1, Insightful

I gave my son his own computer and, like many IT strategies, told him I'd back up what he asked me to. I made him responsible for his own collection, as am I. They may duplicate but hardware is so cheap. When we watch recorded TV shows sometimes we are both interested in keeping a copy, and that's ok. A gig here or there really doesn't matter when I can add 2TB for a $100.

That's very different from the scenario we faced when his brothers were kids. A 100MB hard drive was then pretty significant. I had to consider floppies and temp spaces. Now I'm more concerned with the age of the hard drive.

I don't think I'm the best one to decide how he might like to find his information - who knows what innovation might bring. I DO care that the systems are stable and reliable. That means repairable, at least to me.

Organization, Tags and Smart Programs by chill · 2011-02-13 05:10 · Score: 2

My main file server, where anything not in immediate use is stored, is organized mostly for human convenience. That is, a tree-hierarchy of folders.

media
media/video
media/video/movies
media/video/tv
media/video/shorts
media/video/educational
media/audio
media/audio/music
media/audio/drama
media/audio/comedy
media/audio/educational
media/pictures
media/pictures/family (with various subfolders like "zoo", "picnic", "christmas 2010", etc.)
documents
documents/work/[person's name]
documents/school/[person's name]
documents/misc
web/[site name]
programming/[person's name]/project
family history/
misc/

At the end of the year, or when I do a mass data import, I spend more time getting the meta-data and tags correct than anything else. All of my audio and video are properly tagged. Ditto for any documents.

Almost all video is accessed with "smart" programs, like Amarok or XBMC which automatically pull in things like lyrics, trailers, cover art, etc. That stuff is almost never accessed thru the directory tree. The interfaces on the programs are way too good -- assuming the stuff is properly tagged.

The web and programming folders are basically .tar.gz files that are backed up and copied over (drag-n-drop via smb mounted share). They're archives of whatever project someone is working on their local system. I've set up cron/scheduled tasks to update those daily on everyone's PCs, even the kids.

Most media folders are read-only, to prevent accidental deletion. My account is the master and I can upload stuff there, but I don't want accidents from people wanting to just watch a movie. 600+ DVDs/BluRays, including movies, educational & television shows all on a 2 Tb file server in h.264 format. All *music* is FLAC format, with Amarok auto-transcoding if people want to transfer to an iPod. All other audio, like drama/comedy/educational is 128 Kbps MP3 for ease of streaming. And old comedy albums aren't exactly THX-quality to begin with.

--
Learning HOW to think is more important than learning WHAT to think.

Re:Organization, Tags and Smart Programs by dargaud · 2011-02-13 07:21 · Score: 1

media/pictures/family (with various subfolders like "zoo", "picnic", "christmas 2010", etc.)
I'll expand on that. The very best way to sort personal pictures is to put them in dated directories with dated filenames, then add relevant info to the folder/filename. For instance 20100201_SkiEverest/20100201_064554_Ascent.jpg. It's sometimes hard to remember how you named a file, but it's easy to remember more or less the date (or the month if not the year). I use a script to date the images according to the exif info.

--
Non-Linux Penguins ?
Re:Organization, Tags and Smart Programs by chill · 2011-02-13 07:26 · Score: 1

Pictures I'm not overly concerned about, because I tag them properly with EXIF information. Besides date and event, I'll put in people's names and notes.
Tags are fantastic, if you use them.

--
Learning HOW to think is more important than learning WHAT to think.
Re:Organization, Tags and Smart Programs by Anonymous Coward · 2011-02-13 07:29 · Score: 0

My main file server, where anything not in immediate use is stored, is organized mostly for human convenience. That is, a tree-hierarchy of folders.
media
media/video
media/video/.porn
media/video/movies
media/video/tv
media/video/shorts
media/video/educational ...
That’s more like it.
Re:Organization, Tags and Smart Programs by Anonymous Coward · 2011-02-13 09:56 · Score: 0

Have a look at mp3fs.
Re:Organization, Tags and Smart Programs by dargaud · 2011-02-13 19:36 · Score: 1

I don't know what you mean by tags, but if it's some kind of external metadata, it gets lost on the first backup or hard-drive change [hence why I always thought WinFS was doomed fromt he start]. If it's internal EXIF info, then it needs a special program to read and search through them. Filename baby, filename...

--
Non-Linux Penguins ?

It's a mess by SchizoDuckie · 2011-02-13 05:10 · Score: 1

In spite of what MS promised, we still have no SQL filesystems.. I'd love one of those by now. I have terabytes of data, photos, code, php, javascript, movies, chat logs all scatterd throughout different disks backed up when needed, double copies everywhere. I want something to manage this properly! Any advise?

--
Quack damn you!

Re:It's a mess by darkpixel2k · 2011-02-13 05:21 · Score: 1

In spite of what MS promised, we still have no SQL filesystems.. I'd love one of those by now. I have terabytes of data, photos, code, php, javascript, movies, chat logs all scatterd throughout different disks backed up when needed, double copies everywhere. I want something to manage this properly! Any advise?
mysql --user root --password s3cr3t
create database 'my_new_filesystem'
...
Have fun.

--
There's no place like ::1 (I've completed my transition to IPv6)
Re:It's a mess by rhendershot · 2011-02-13 05:40 · Score: 2

I also was intrigued by the idea of a database-oriented file-system. A basic operation though is to get a file. how? by it's name or id. what is it's name? Something you have to define. It could have a category (eg. javascript development library) but that's something the schema would impose upon you. what if you're more interested in the files' Contributor (author, downloader, etc.) ?
By itself a file-system backed by a database engine doesn't make the problem smaller it adds overhead.
There's only one resolution that identifies one from another and that's the explicit bytes contained within its storage. That can be simplified by indexing schemes like mdasum but they all can have collisions. (rare but how much of a chance are you willing to take?)
Is a file of bytes, ended by CR the same as the same file of bytes ended by CRLF? While the system itself might probably use null termination, other files from other systems won't.
the low-hanging fruit for file de-duplication is in backup storage. When you and another person need to retain the same file it can easily be merged into the stream. when you have two files that are byte comparable that's not so easy because you probably have defined some separation criteria (eg different file paths). so on your system they still need to remain discrete.
I've not heard much about how they would integrate this at the OS level but I think that's the trick.
Re:It's a mess by ron_ivi · 2011-02-13 05:48 · Score: 1

Plenty of FUSE to expose a database as a file system projects: http://www.open.hr/~dpavlin/fuse_dbi.html http://yiannnos.com/dumbofs Can't imagine why you'd actually want one. If you want an exotic way of accessing data, Reiser4 is pretty convenient for file-system-like access.
Re:It's a mess by maxwell+demon · 2011-02-13 05:51 · Score: 1

OK, now you "only" have to write the file system driver for it (probably through fuse), so that normal applications can access the files in your SQL file system ...

--
The Tao of math: The numbers you can count are not the real numbers.
Re:It's a mess by present_arms · 2011-02-13 06:09 · Score: 1

mysql --user root --password s3cr3t

How did u know my MySQL password?

--
http://chimpbox.us
Re:It's a mess by Tacvek · 2011-02-13 14:38 · Score: 1

The correct way is to use SHA-1 or similar hash. The odds of your drive and all the backups in your three offsite locations spontaneously combusting are still greater than an accidental collision. For all intents and purposes a SHA-1 hash uniquely identifies a file. So to get a file a program would simply pass that file's SHA-1 value to the filesystem, and the filesystem would return that file. Locating a file could be done by searching tags.
But that sort of system still has some issues. A program would have real problems trying to find its configuration files. Want to find the latest revision of a file? No problem if each file stores the SHA-1 hash of its predecessor in the metadata, except of course that it my have multiple predecessors, and multiple successors so a many-to-many axillary table is still needed. Of and of course the request for the "latest revision of a file" does not uniquely identify a file, since a file may have multiple forks, so you need a branch naming system. (But why should a branch only be able to have one name? Recurse ad nauseam.)
Also keep in mind that most user's are terrible at keeping even their music tagged. I've seen many user's music collections where the file names were more likely to be accurate than the tags. A Word document's metadata is more likely to be incorrect than correct, since in many businesses it is common to take an existing document and use it as a template, (rather than create a real Word template), with the result that the metadata retains the author and title of the original document, not this new one.
Need I go on? Any system that requires the user to tag things, rather than using heuristics is doomed to fail when the average user goes to use them.

--
Stylish sheet to fix many problems in Slashdot's D3: https://gist.github.com/801524

Tags are useless for personal organization by icemaze · 2011-02-13 05:11 · Score: 3, Interesting

Who has the time to hand-pick all the relevant tags for every file they download? Yeah, me neither.
Finding time to put things in their own directory, and not dumping them all in "downloads", is a great accomplishment.

However finding a meaningful, hierarchical structure is non-trivial. I'm still working on it.

Re:Tags are useless for personal organization by darkpixel2k · 2011-02-13 05:23 · Score: 1

Who has the time to hand-pick all the relevant tags for every file they download? Yeah, me neither. Finding time to put things in their own directory, and not dumping them all in "downloads", is a great accomplishment.
However finding a meaningful, hierarchical structure is non-trivial. I'm still working on it.
I'd settle for being able to tag a file/folder with 'temp' and have the folder/file automatically delete $x days after I last touch it.

The reason I can't just 'rm -rf *' in my downloads directory is I don't want to delete the stuff I just downloaded a few minutes/days ago but haven't sorted properly yet.

I support I should just write a damn script.

--
There's no place like ::1 (I've completed my transition to IPv6)
Re:Tags are useless for personal organization by maxwell+demon · 2011-02-13 05:27 · Score: 1

The nice thing about tags is that you don't need to apply all of them right away (yes, it's better if you do, but it's not mandatory). You can easily add tags later if you discover that some specific tag would be useful. In contrast, it's quite hard to retrofit an existing hierarchical structure.
That said, there's unfortunately no way I know to consistently tag arbitrary files without the danger of the tagging getting broken either by file system operations, or by programs to edit the files.

--
The Tao of math: The numbers you can count are not the real numbers.
Re:Tags are useless for personal organization by hedwards · 2011-02-13 07:49 · Score: 1

The solution to that is to create a folder which is for things you've downloaded recently then move, either manually or via script, any of the ones you haven't used in the last X days to say an attic folder. If you use a file from the attic you move it back to the original folder and periodically you create a new attic and delete the old one. Which should work fine as long as you're not really picky or likely to not use something for a long period of time and then suddenly find a use for it.
Alternatively you could sort the files into 3 or 4 categories, purchases, free, trial and personal or similar. The main thing which causes people trouble with your approach is not getting the level of control right, either trying to have too much or failing to have enough.
Re:Tags are useless for personal organization by Anonymous Coward · 2011-02-13 10:51 · Score: 0

Who has the time to hand-pick all the relevant tags for every file they download?
Who has the time to handpick the relevant directory for every file they download? If someone does, picking one tag would be equally time consuming. Of course, if you're only ever going to use one tag, might as well use a directory. But two tags is not much more work than one and could be very much more descriptive than one directory path.

File organization/etc by Anonymous Coward · 2011-02-13 05:12 · Score: 0

For the most part, I use directory structure coupled with hard drive provisions that organize the content based on media type.

I also use http://bulkfilemanager.codeplex.com for managing names/relocation of mass files, makes life a lot easier. Especially in download sets where a uniform naming scheme for the media is non-existant, and having one implemented would be useful.

Same here, Directories are fine. by guidryp · 2011-02-13 05:15 · Score: 2

I have media drives that hold the bulk and they are easily organized into games/pictures/books/movies/tv/music. Smaller document/coding directories are on my C drive for source/text/spreadsheets I make myself.

I don't tag anything. For my pictures. I simply name the directories Year_date_mainContent. (ex 2010_12_25_Xmas). Media names are self evident, but I also run XBMC for video, so I guess that has internal tagging. But still easy to find video outside of XBMC which I only use about 50% of the time.

I almost never even use search to find things, because the layout is very logical and it is pretty much obvious where everything is.

Everything is online and in my computer, multiple TB drives. No raid.

For backup I simply use external esata multiple TB drives and FreeFileSync, that I run once/week.

Re:Same here, Directories are fine. by AmiMoJo · 2011-02-14 04:10 · Score: 1

I am a fan of automated backups because I know I am too lazy to do them all the time (and plus sometimes I am away for more than a week or two but keep the server on).
To that end I built a little relay controller which switches an external HDD on. It actually controls several relays so that every connection is isolated, i.e. all three mains power lines, all four USB lines and the relay controller's RS232 lines. That way even if lightning strikes there is 15kV of air gap isolation to protect the external drive. A script turns the drive on, mounts a TrueCrypt partition on it, runs a backup, unmounts and powers it off again.
I have not finished the web page for it yet otherwise I would provide a link, but it uses an AVR microcontroller. Cost was ~Â£10, mainly due to the number of relays needed.
Naturally I do online backups of really critical stuff too, but since I bulk upload things like photos there is really very little that needs to be synced each day.

--
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC

Google said it best.. by xtal · 2011-02-13 05:15 · Score: 2

Search, don't sort.

--
..don't panic

Re:Google said it best.. by Rob+the+Bold · 2011-02-13 05:25 · Score: 1

Search, don't sort.
Good idea in theory, but in practice -- at least for me -- most files aren't going to have enough data in the filename and tags (if any) to search for pictures from Uncle Bob's second wedding. Unless, of course, you've sorted all your pics and tagged/named all the pics from that event.
Some kind of picture/video/audio searching that would be effective seems a long way away, at least if you're looking for the aforementioned wedding as opposed to all pictures with blue flowers.

--
I am not a crackpot.
Re:Google said it best.. by Anonymous Coward · 2011-02-13 05:38 · Score: 0

Google created the ilusion that everything can be searched, lots of information are not accesible via Google and other search engines. Google is very similiar to TV, if you are not shown in TV you don't exist.
Re:Google said it best.. by runningduck · 2011-02-13 06:24 · Score: 1

What's not descriptive enough about vid002034.avi? I am sure searching for the vacation or family event will turn this video up in your search results without a problem.

--
-rd
Re:Google said it best.. by maird · 2011-02-13 06:36 · Score: 1

Search, don't sort.
...most files aren't going to have enough data in the filename and tags (if any) to search for pictures from Uncle Bob's second wedding...
How many dates did it take place on and how many other things did you take pictures of on the same date(s)? The date of the wedding should be at least enough to get a search result with a high density of what you're looking for.
Re:Google said it best.. by tiago.bonetti · 2011-02-13 06:55 · Score: 1

Search for pictures with bob in the name at the year of the marriage... DONE!!!
Re:Google said it best.. by hedwards · 2011-02-13 07:52 · Score: 2

That works better if you're not in control of the files. From personal experience searching does precisely bupkiss for backing files up or excluding ones that you don't want backed up. It also doesn't do you much in terms of figuring out which files are duplicates and deciding which files should be deleted.
Right now I've got several folders for the importance of the files under it to me, and I've named them based upon the duration of backups that I feel I could afford to lose without being distraught. Some by the hour and some by the month. It's a little bit of a pain, but it makes it a lot easier to make sure that my backups are done because I don't have to set up rules for individual files all over my HDD in random places.
Re:Google said it best.. by Rob+the+Bold · 2011-02-13 08:29 · Score: 1

Search for pictures with bob in the name at the year of the marriage... DONE!!!
If only . . . Unfortunately, I really haven't put even that much metadata in the file names, like Bob's name, so it'd be a matter of going through all the photos from that year. Assuming also I had the date set right on the digital camera. I've screwed that up for at least a while just about every time I let the thing's batteries completely die -- clock resets. Just about all my pics are named something like DSC0056.jpg and have a date code earlier than the camera was made.
So my lack of planning ahead really screws me over in terms of simple searching.

--
I am not a crackpot.
Re:Google said it best.. by xaxa · 2011-02-13 10:25 · Score: 1

I recently borrowed a friend's camera, and used that and my own camera on a trip (I was experimenting with her better camera). I got home, and realised the clocks were different on the cameras. I needed consistent times so the photographs showed up in time order when I put them in the same directory, so I did
for i in IMG_0*; do exiv2 -a '-1:12' $i; done
which fixed the timestamps (removed 1h12m from the timestamp). exiv2 -T set the file modification time to the EXIF time.
You can always photograph a clock if you don't have time to sort out the EXIF data when you download the photos.
Re:Google said it best.. by Anonymous Coward · 2011-02-13 22:45 · Score: 0

Search, let others sort.
Fixed that for ya
Re:Google said it best.. by Rob+the+Bold · 2011-02-14 03:11 · Score: 1

I recently borrowed a friend's camera, and used that and my own camera on a trip (I was experimenting with her better camera). I got home, and realised the clocks were different on the cameras. I needed consistent times so the photographs showed up in time order when I put them in the same directory, so I did for i in IMG_0*; do exiv2 -a '-1:12' $i; done which fixed the timestamps (removed 1h12m from the timestamp). exiv2 -T set the file modification time to the EXIF time.
That is a good idea with the timestamps to fix that clock offset problem.
Now, I'm honestly not trying to piss on anyone's parade, but all the fixes to make photographs more practically searchable require you to have taken (past tense) a certain degree of care in getting the metadata in the photos (or the files, or the filenames, or directory names) into some sort of organized (and accurate) scheme upon creating/downloading them.
My point is this really only works in a going-forward kind of sense. It's the huge backlog that the original poster of this story seemed to be alluding to. And not just the backlog of a geek's files -- which would probably be better organized than most (which still isn't saying much) -- but also those of family members whose actions are totally out of one's control. And don't get me started on scanned versions of old prints/slides . . .

You can always photograph a clock if you don't have time to sort out the EXIF data when you download the photos.
Now that's a really good idea. Even better if that clock displays date, too. And it's something that I might actually remember to do -- well, from now on. I remember as a kid that I was a little smarter than I am now -- at least I thought so then -- and I would always take some sort of "establishing shot" before each session: the big sign at the entrance to the national park, the program from a wedding, etc. I hadn't done that or even thought about it inapproximately forever.
And as a BTW. I'd mentioned in earlier post that I've had cameras that lost all their time info on total battery discharge. I'm pleased to note that my latest has a backup. That's the good news. It also has a "daylight savings" checkbox in the clock settings. And it was checked -- just now. So I should apply your time offset adjustment to all the photos in my collection taken since last DST switch with that camera. As to last year's photos, I don't know if I had unchecked that box or not. I guess the hour difference isn't that big of a deal compared to my other organizational problems, though.

--
I am not a crackpot.
Re:Google said it best.. by xaxa · 2011-02-14 04:51 · Score: 1

My point is this really only works in a going-forward kind of sense. It's the huge backlog that the original poster of this story seemed to be alluding to. And not just the backlog of a geek's files -- which would probably be better organized than most (which still isn't saying much) -- but also those of family members whose actions are totally out of one's control. And don't get me started on scanned versions of old prints/slides . . .
Some manual sorting of some kind is probably inevitable for old photographs (new ones might have geo data as well as a time, which should make it easy to find sets of photographs). What would be interesting is some automatic face detection. Here's a start -- with a bit more work similar-dated or similar-geotagged photographs could be tagged in common with the ones with people in.

You can always photograph a clock if you don't have time to sort out the EXIF data when you download the photos.
Now that's a really good idea. Even better if that clock displays date, too. And it's something that I might actually remember to do -- well, from now on.
Sometimes I first photograph a handwritten note along the lines of "€20 cash for this memory stick if found, contact [email] or [phone] or post to [work address]. Alternatively €20 to the charity of your choice.". Fortunately I've not needed it myself, but I've returned two separate memory sticks I found on trains to their ungrateful owners in the last year. In both cases I doubt a non-technical person would have been able to find the owner even if they'd tried, so I want it to be really easy.

It also has a "daylight savings" checkbox in the clock settings. And it was checked -- just now. So I should apply your time offset adjustment to all the photos in my collection taken since last DST switch with that camera. As to last year's photos, I don't know if I had unchecked that box or not. I guess the hour difference isn't that big of a deal compared to my other organizational problems, though.
I keep my camera set to UTC, initially through chance (I bought it in winter, I live in the UK) but then later by choice (to prevent photographs appearing out-of-order when I went to France, which is UTC+1.)
I still get confused when I work with timezones, I've always had +0000 for half the year.
$ exiv2 01.jpg
Image timestamp : 2010:07:03 12:55:35
$ stat 01.jpg
Modify: 2010-07-03 13:55:35.000000000 +0100

There is a new project for binary version control. by mekberg · 2011-02-13 05:16 · Score: 1

This problems keeps popping up more and more often, as people collect more and more data... I don't have the answer to the question on how to keep everything indexed and searchable, but I do have the answer to the question on how to safely version control and store/backup such large amounts of data data... A little project I have called Boar. I quote from the project front page:

"BOAR aims to be the perfect way to make sure your most important digital information, like pictures, movies and documents, are stored safely.

* BOAR prevents data loss due to human or machine error
* BOAR makes it possible for you to restore any or all of your files from any point in time.
* BOAR makes it easy to maintain verified backups of your data, including file history.
* BOAR will make it much more likely for your digital heirlooms to reach your grandchildren some day.

If you are familiar with vcs software such as Subversion, you might think of boar as "version control for large binary files". But keep reading, because there is more to it."

Please check it out at google code: http://code.google.com/p/boar/

Please tell us by rossdee · 2011-02-13 05:16 · Score: 1

Where you keep all your valuable data, so if we ever hack into your computer, we know where to steal (or at least make copies of) your pron collection

Re:Please tell us by Anonymous Coward · 2011-02-14 09:52 · Score: 0

Isn't that joke a little old.
Porn isn't really valuable, with the amount freely available it is practically disposable.

I use 'group directories' by david.emery · 2011-02-13 05:18 · Score: 1

I'm working in a Mac OS X environment, but this should work for Linux too: I have groups for the various classes of stuff, e.g. photos, household files (like taxes and Christmas letters), etc. Each group has a group home associated with it, and I mount those from my server as needed. (The server's a RAID 5 box). Irreplaceable stuff like photos are copied a couple of times, once to a disk on a separate machine and periodically to a portable USB drive that I keep at a friend's house. (I have 2 of them and rotate them.) An advantage of the group-based approach is that I can use group privileges to limit access if required (e.g. my work related stuff is not readable by the rest of the family. Photos are updatable by my wife and I and readable by everyone else, etc.)

For sensitive materials, I actually use a Mac OS X encrypted disk image in the group home directory. One of these days I'll work out how to get whole-drive encryption on my Mac OS X Mini Server.

For my photos, I'm experimenting with various keyword Digital Asset Management schemes, inspired by "The DAM Book" http://oreilly.com/catalog/9780596523589/

And as a side note, I'm seeing -50% failure rate- on Seagate 3.5" 1tb drives that are about 1-2 years old. The RAID enclosure is running Toshiba 1tb drives. One of my 2 USB backups (with a Seagate drive) failed, so I'll replace that with a Toshiba or WD drive. I'm really disgusted with Seagate reliability!

my solution by StripedCow · 2011-02-13 05:18 · Score: 1

1. Post it on the web, or run your own apache instance.
2. Use google to find back your data.
3. ?
4. Let others also profit from your data.

--
If Pandora's box is destined to be opened, *I* want to be the one to open it.

google desktop (RIP) by meeotch · 2011-02-13 05:18 · Score: 4, Interesting

I had great success with Google Desktop Search (on windoze) for a while. It would index my mail, files, and web history (if instructed to) - and the best part was hitting one key to get an instant, minimalist search box with auto-preview. From there, you could jump straight to what you were looking for, or open a further page to narrow the search.

Sadly, it doesn't work with Thunderbird 3.0, and Google doesn't appear to care, or even to be supporting it anymore. So now I'm on a hodgepodge of GDS, Windows built-in search, and the sucky T-bird search bar.

I honestly can't believe that nobody has duplicated this Spotlight-esque functionality yet. I realize there are other desktop search options, but none of the ones I've come across have that one-key mini search that goes away as easily as it is called up. For an operation that I'm performing dozens of times daily, that's pretty crucial. It even replaced the file browser for me - much easier to call up the GDS box & type a couple letters than to grab the mouse and drill down into some directory structure - even if I know exactly where I'm going.

Re:google desktop (RIP) by Anonymous Coward · 2011-02-13 06:23 · Score: 0

Have you tried Copernic Desktop Search?
Re:google desktop (RIP) by Anonymous Coward · 2011-02-13 06:49 · Score: 0

Not sure how well it scales to your level of data, but Find And Run Robot may be just what you are looking for.
A couple other comments mention Everything as being quite good, but is not when I have tried out.
Re:google desktop (RIP) by QuantumRiff · 2011-02-13 12:25 · Score: 2

the built in search in windows 7 is actually a drastic improvement over what used to be. It will work with thunderbird as well: http://superuser.com/questions/80848/how-to-have-windows-7-index-thunderbird-3-messages
You should give it a try. Its especially handy when searching network drives. If the server has indexing running, it will hand off the search to the remote server, and you'll get the results back instantly.

--

What are we going to do tonight Brain?
Re:google desktop (RIP) by xtracto · 2011-02-13 20:04 · Score: 1

I second this. Coopernic search is a really good program. I use it in all my desktops.

--
Ubuntu is an African word meaning 'I can't configure Debian'
Re:google desktop (RIP) by Pebby · 2011-02-14 08:56 · Score: 1

Check out Launchy. It is seriously awesome and snappy (especially if you tune it to not show the flashy UI fades and such).
http://www.launchy.net/
Re:google desktop (RIP) by BusyBeeNYC · 2011-02-17 03:58 · Score: 1

In my opinion google desktop search is one of the best http://bbcleaningservice.com/Office_Cleaning_Services.html

--
http://bbcleaningservice.com/

Disagree by IANAAC · 2011-02-13 05:19 · Score: 1

Tags are no less useful than any other form of organization. They're more useful to me when used with software that keeps track of location for me.

Well thought out is well thought out, regardless of whatever system you use.

Colour-coded by mr_lizard13 · 2011-02-13 05:21 · Score: 2, Funny

sticky labels on each floppy disk.

--
"We live in a global world" - Harvey Pitt, former Securities and Exchange Commission Chairman

Re:Colour-coded by marcosdumay · 2011-02-14 08:12 · Score: 1

That's good. With a box of 10 colored floppiess, with 10 colored labels you have enough combinations for 100 disks. I've never used more than 100 floppies...
Altough, nowadays I do use more than 100 DVDs.

--
Rethinking email

It/is/all/junk/then/you/die by B_SharpC · 2011-02-13 05:22 · Score: 0

It/is/all/junk/then/you/die

--
Score & Karma: SASA: Slashdot Approval Seekers Anonymous

File Organization How do you do it in 2011 by Anonymous Coward · 2011-02-13 05:24 · Score: 0

I have a huge NUMBER of files, in a huge NUMBER of formats about a huge NUMBER of topics....

My Secret Technique by Snarf+You · 2011-02-13 05:25 · Score: 1

Here is my highly effective file organization methodology:

I keep multimedia (audio, video, porn) files scattered around 37 or so different folders, each with several subfolders. The majority of these folders have 'temp' or 'download' in their names, except for porn files and folders which consist solely of simple combinations of letters and numbers, such as zzz/tt2.mpg, abc/xyz10.mpg, etc.
Programming projects (some for work, some for hobby, none of which are finished or ever will be) get split up among folders whose names convey increasing seriousness. For example, first there was simply 'Projects', then along came 'New Projects', which begat 'Really New Projects' and eventually 'Even More New Projects'.
Helper files (planning, documentation, spreadsheets, etc.) for each such project have strange filenames relating to whatever thoughts were racing through my mind at the time they were named.
Then, all you have to do is remember where and what everything is.

And that's pretty much all there is to it. This system hasn't failed me yet. Plus, it will stimulate the economy in approximately 0 to 60 years, because the investigator who has the pleasure of snooping through my computers after I croak will have job security for years.

Re:My Secret Technique by woboyle · 2011-02-13 06:13 · Score: 1

And I keep all my tax-relevant papers in a single big cardboard box... For all years, of course! :-)

--
Sometimes, real fast is almost as good as real-time.
Re:My Secret Technique by melikamp · 2011-02-13 11:48 · Score: 1

I agree. Burning paper is way more satisfying than telling a computer to forget.
Re:My Secret Technique by marcosdumay · 2011-02-14 08:14 · Score: 1

SMART disks can't even forget.

--
Rethinking email

Let me give you one word...Volume. by Anonymous Coward · 2011-02-13 05:25 · Score: 0

One big disk, or multiple ones if you want redundancy for your data. It's the only way you'll keep anything.

Switching out media is a pain. There are people who do it. If you were one of those anal retentive types, you'd hardly be asking about it on Slashdot. So I'll assume you're a normal person.

So one big storage space. There are a number of ways to do that, from a USB or eSata drive, to a network share. Whatever choice you make, if you don't like it, change in another year or so. At the rate of data space increases, you won't run out anytime soon.

The real key is to keep your media up to date. Disks decay, drives break down. Yeah, you can buy a 5 1/4" drive, but is it worth the bother? Even 3 1/2" are fading, and it won't be long before some others hit the dust.

Of course, specifics are up to you, different people like different organizations, and who knows what you really have a use for? I remember some old games I've played once or twice. Should I keep them, or just forget about them because if I really cared, I'd still know their names and not just have to look for them.

A word for "lifestreams" and against livelink by rbrander · 2011-02-13 05:27 · Score: 3, Insightful

I'm pretty much a "have a lot of structured directories" guy myself; I don't see your complaint about rising file sizes, or even total number of files. They've pretty much increased linearly in number while the speed of the linux "locate" command has gone up exponentially with Moore's Law. It's the other way around from management trouble - with TB hard drives, I have so much space I leave around TV shows and other media files I'll likely never watch again, "just in case".

At work, the search problems are harder, because I've got quite the multi-tasking job where I may spend just minutes on some problem, then be asked for an update months later, totally skeptical that I ever addressed the issue. And my favourite file-management with that is the most insane-sounding of all: one big directory. I sort it by date and rely on the fact that I take time to write out helpful file names like "downtown_condition_assessment_newmall_4_ernie.xlsx" (not actually that long, I use abbrevs in RL). Only files that have a whole lot of subject-matter friends get their own subdirectory; lonely "one-off" files go in the Big Pile.

The "sort the directory by date" uses the theory behind "lifestreams" promoted by Eric Freeman and David Gelernter at Yale. It really is the best thing I've found (same 30 years) to stimulate the memory - seeing the names of other things you did at the same time; you can actually sense yourself getting close to the file as you remember, "Oh yeah, I worked on that in the spring".

An additional word of Fear & Loathing for "document management systems" like LiveLink by Formark. Required to use this by work (shared directories are strictly for 'short-term' storage), it's awful. Terribly slow, the search function approaches useless, and it's hard (and slow, did I mention slow) to even re-sort a directory (sorry, that's a 'filter down' in Livelink's vocab) by name or date or whatever. After promising that photos would be displayed with thumbnails by the great new Version 4 for two years, it came, broke some stuff that was working, and did not provide thumbnails - all media files are unsearchable in any way. I suspect for long-term archiving, putting documents in a database would have advantages, but for active business usage, it's been crippling.

Re:A word for "lifestreams" and against livelink by ZerdZerd · 2011-02-13 08:10 · Score: 1

This is why I would like a "created" timestamp for files in unix file systems.

--
I'm not insane! My mother had me tested.

Directory orgs I use by bzipitidoo · 2011-02-13 05:31 · Score: 1

These are what I've come up with.

For Windows, I create C:\Software and C:\Hardware. Drivers, DirectX updates, and such all go in Hardware. Any software I install goes in Software. Games are the reason to use Windows, and are huge consumers of hard drive space, so they rate their own subdirectory, C:\Software\Game. (I've also decided to drop plurals from directory names I create. Was getting annoying having "pic", "pics", "pictures", "images", etc.) It doesn't have to be "Software", all it has to be is not C:\Program Files. That way I can tell at a glance what I put on there, and what else is there. Back in the days of dial up BBSes, I used C:\LOAD\DOWN and C:\LOAD\UP. When I installed Windows, I'd have it install into C:\W, figuring that would make various configuration files ever so slightly smaller.

For UNIX, of course I have /home mounted on its own partition. Makes upgrading and backing up a lot easier. I use 'u' (for "user") for my primary user name. (Some distros, such as SuSE, won't allow single char user names, so it's "u1" for those.) Besides keeping it as simple and short as possible, it also heads off any possibility of my real name being easily discovered from my chosen user name. As more and more crap has been stored in the home directories of users (directories like .mozilla, .gnome, .gnome2, Desktop, Documents, Downloads), I've recently taken to putting all my stuff in /home/u/own/, so I can easily tell them apart. I could live with it as long as they kept to hidden names, but when the desktop environments started pushing in with subdirs like /home/u/Documents, I decided to do something. Same idea with C:\HOME\U on Windows, when I have anything there. C:\My Whatever attracts too much junk from programs that take it upon themselves to save their ever so valuable configuration info there.

And lastly, I save configuration tweaks, with full path names, in /home/localconfig/. If I change, say, /etc/hosts.deny, I save the changed copy (not the original) in /home/localconfig/etc/hosts.deny. Really helps when I'm trying to remember what I had to do to get sshd, CUPS, XWindows, or whatever to work, or where the window manager du jour stores its global configuration and menus, or where the heck they moved DIRCOLORS functionality this time. Of course there is no user named "localconfig".

--
Intellectual Property is a monopolistic, selfish, and defective concept. It is "tyranny over the mind of man"

Re:Directory orgs I use by Risen888 · 2011-02-14 03:04 · Score: 1

"And lastly, I save configuration tweaks, with full path names, in /home/localconfig/."
I automate this (appending a datestamp onto the filename) with an emacs hook that is called whenever I edit something with sudo. It has saved my ass many times when, just like you, I've been trying to remember what the hell change I made to rc.conf to make whatever needed to work work (or break, as the case may often be).

--
Hey, I finally got my first freak! Took you long enough!
Re:Directory orgs I use by Anonymous Coward · 2011-02-14 20:46 · Score: 0

i put the config files i edit under RCS

Ubuntu + Google Rainbow by Anonymous Coward · 2011-02-13 05:33 · Score: 0

Nexus One / Nexus S / Folio (Android Tablett = iPad, cheaper)

Then : Chrome / Picasa / Google Power

Because the Invisible Garfield @french_matt ... pushes Innovation instead of just staying @ home ;)

Keep it Spinning by aoeu · 2011-02-13 05:34 · Score: 1

in two places, everything.

--
All your database are belong to U.S.

Everything on PC, Spotlight on Mac by Xian97 · 2011-02-13 05:35 · Score: 4, Interesting

Everything is what I use on the PC to quickly find any file I am looking for.

On the Mac I use Spotlight.

While it would be nice to be completely organized, these tools let me find my files anywhere they are located on my PC. I try to keep things organized into folders, but I am always falling behind so these are what I can use in the interim.

Re:Everything on PC, Spotlight on Mac by Anonymous Coward · 2011-02-15 01:34 · Score: 0

Does "PC" really pass for "Windows machine"... on Slashdot?

Re:Directories / split over machines / crashplan by rvw · 2011-02-13 05:37 · Score: 1

Another option is to split data over machines. Use a media server, and keep media there. Maybe pictures and personal video's can be on your normal laptop as well, but your MP3 collection, movies and tv-shows don't need to be on your laptop.

Then I use one old desktop with Ubuntu on it as backup-machine. I use Crashplan for this. It has a free option to backup to your own machine, or to backup to a friend's machine. I backup several machines (from my parents as well) to this one machine. Then this one machine can be backed up online for $5/month, no limit. (You have unlimited storage, only limited by the upload speed. But as we've seen with Mozy, that can change very quickly.)

Just two for everyone I thought by Anonymous Coward · 2011-02-13 05:45 · Score: 0

porn/
notporn/

Re:Just two for everyone I thought by shikaisi · 2011-02-14 01:14 · Score: 1

I won't even ask about the relative sizes of those two.

--
No left turn unstoned.

A 2-tier backup system would be nice by azgard · 2011-02-13 05:51 · Score: 1

I don't use any tools. I just have all the content on two sets of external disks (copies of each other; I use external disks because I don't have large enough computer and I don't like the idea everything to be under current at all times). It's a pain to manage. I think Linux (or your favorite OS) desperately needs a 2-tier backup system with deduplication (but still making sure you have enough copies for recovery) and a good user interface.

Ideally, I would say, in file manager, unarchive me this file, and he would look for the file, let me mount the proper disks/CDs required to get and then copy it to some cache area on main harddisk. Here I could play with the files (change, sort, rename, tag, whatever), and then he would automatically backup them again when I wouldn't play with them any longer anymore.

On a side note.. by vondiggity · 2011-02-13 05:54 · Score: 2

What happened to Beagle for Linux? It used to work pretty well for me, and now it seems to have been abandoned.

Re:On a side note.. by tomtomtom · 2011-02-13 10:24 · Score: 1

What happened to Beagle for Linux? It used to work pretty well for me, and now it seems to have been abandoned.
Try Recoll. Finding this was a real revelation to me - it was the first desktop search tool I'd really ever used which worked well. It seems to consume almost no resources (runs from a cron job rather than needing to run as a daemon) and search is very fast, intuitive and efficient.
Apart from that, I have to say I go down the hierarchical directory route. Just seems like the most logical way to do things to me, and when searching isn't really possible/practical (e.g. for photos), it does make things easier to find.
Re:On a side note.. by westyvw · 2011-02-14 18:27 · Score: 1

Recoll rocks had to say it.
For images I used Kde's Digikam and the search can be location, like images, a sketch (so so results for now), time period, and tags. Nice.

Easy by Smartcowboy · 2011-02-13 05:55 · Score: 1

I put everything on the desktop. When there is no more place on the desktop, I create a subfolder named "temp" and then put everything in it, including the last "temp" subfolder.

Re:Easy by pythonboy · 2011-02-13 07:14 · Score: 1

I really really wanted you to finish that with...

When there is no more place on the desktop, I bought a bigger monitor.

Do what my boss does by Anonymous Coward · 2011-02-13 05:56 · Score: 0

All files on your desktop with subfolders - along with one or two random virus .exe files.
All emails sent to me have subject lines with "Re:" followed by some completely unrelated subject.
And all VBA code is commented at a ratio of 1 commented line for every 600 lines of code.

That should it do.

*smart* metadata filesystem by itzdandy · 2011-02-13 05:59 · Score: 1

when need a smart metadata filesystem. The system needs to be a simple and automatic system which file extensions and file headers are used to create the base level tags. Other tags could be added for items like music and video but the 'bread and butter' of the tag system needs to come from obvious information in the file and filename.

Re:*smart* metadata filesystem by revscat · 2011-02-13 06:56 · Score: 1

Spotlight.
Re:*smart* metadata filesystem by itzdandy · 2011-02-13 10:20 · Score: 1

uh, mac only. Also, this is an app on top of the filesystem.
We need this built into the filesystem. Then be able to produce views to suite the need of a given interface.
I am a database guy so It is easiest for me to explain in those terms.
select * from files where type = 'media' and format in (select * from types where category = 'audio') and bitrate between 128 and 192
present this as a saved query/folder/favorite search. add a "and owner in ('username','public')" and/or "and location in ('local','network')"
this is just a representation in sql, but storing files in such a way and allowing fast searches via indexes or filesystem equivilent within the filesystem via translating traditional search commands would simplify the user interface for files greatly.
This could be implemented in fuse in a hurry (there have been attempts such as mysqlfs and such) but not only is fuse bound to be quite slow, but a full RDB is overkill and has too much overhead.
I feed like there are really two things that are very important to the average home user. 1)fast access to data, which means very fast search results and very fast browsing. 2) large file thoroughput. Small files are typically not handled in very large volumes for the average user, but 1Mb+ files from pictures and docs and videos are.
for desktop users, you could really sacrifice a lot of performance if you could focus on fast searching and browsing and on large files transfers.
Re:*smart* metadata filesystem by Anonymous Coward · 2011-02-13 11:51 · Score: 0

I am a database guy so It is easiest for me to explain in those terms.

select * from files where type = 'media' and format in (select * from types where category = 'audio') and bitrate between 128 and 192
I assume the "Types" has at least 2 columns, since you're using a where clause to limit by "category". In that case, your query will fail because the subselect returns at least 2 columns and you're using an "in" instead of "where exists" (which is significantly faster in most cases).

You know, if you're going to describe yourself as a "database guy" you need to get your damn SQL right.

Directories, b*tch by tkprit · 2011-02-13 06:01 · Score: 1

(waiting for someone to say 'CLOUD')

Not folders, "libraries", and sure as hell no tags (I tried that w/ Picasa for my pictures a few years ago; made a mess; deleted Picasa -- returned to sensible dir structure "pictures/TOPIC/year/month" and I'm fine). And I separate code by language ("code/4th/TOPIC" or "code/c++/TOPIC" ..I even keep a /fortran dir though I haven't used it since 80s, and a /48sx— some of my favorite code even though it's essentially unusable). Homebrew backup across local network drives and [for pics/video and code] solid state.

evil: I keep an /mp3 dir that's root-accessible only, with no subdirs whatsoever. ONE time I had a problem with this, and for that system alone I made a few subdirs. Learned hard way w/ iTunes. I despise programs that rearrange your files for you, make ridiculous subdirs w/out permissions, etc. I have to use iTunes, but I look forward to the day when I can get rid of anything apple and/or adobe. Hell, not even MS forces directories on you (not incl. the OS itself, I guess).

Re:Directories, b*tch by Anonymous Coward · 2011-02-13 07:11 · Score: 0

iTunes doesn't force directories on you either.
If you looked at your preference settings, you would have found a checkbox that says to allow iTunes to organize your files for you. Uncheck that and you're done.
I happen to leave that checkbox checked, however. I have no problem with certain forms of data being reorganized for me. Music and pictures are better organized by an application designed for organizing this form of data. When I think of music or pictures I don't think of the files they are stored in or what directory they might be in.
Re:Directories, b*tch by Omestes · 2011-02-13 07:40 · Score: 1

Learned hard way w/ iTunes. I despise programs that rearrange your files for you, make ridiculous subdirs w/out permissions, etc. I have to use iTunes, but I look forward to the day when I can get rid of anything apple and/or adobe. Hell, not even MS forces directories on you (not incl. the OS itself, I guess).
That behavior in iTunes is no longer the default, at least. And, last I checked, it could keep a large, unorganized, directory organized within itself, without mucking with the directory.
Personally I love iTunes. I have over 40Gb of music in it, and the idea of manually organizing it sounds like absolute hell. I tried that once, hated every second of it, so I'm a large fan of a program that will do all the boring work for me.
Actually MS does, or at least their sloppy design does. Its one of my peeves, I'm trying to be a good little nerd and keep my /user directory on a separate drive. But a lot of programs won't play nicely with that. Even if all of my home directory is sitting pretty on D:\ everything will still write to C:\Users\[NAME]\Appdata. Yes, you can change it with a bit of registry work, but it then breaks a bunch of programs that keep it hard coded. Same with trying to keep control of your home directory. Every damn program has to make some silly folder in an arbitrary location within it. Games are especially bad, since you often can't edit where they save. So you end up with, in my case:
D:\Personal\[somegamesaves1]\
D:\Personal\[some gamesaves2]\
D:\Personal\My Documents\[company]\[game]\saves\
D:\Personal\My Documents\[company2]\[game]\saves\
D:\Personal\My Documents\save games\
D:\Personal\My Documents\[game]\save\
D:\Personal\My Documents\saved games\[game]\save\
D:\Personal\My Documents\saved games\saved games\[game]\saves\
And then a bunch saved in the immovabe C:\users\[Name]\appdata\ and its various, somewhat arbitrary subdirectories. And then a bunch that insist in saving in their directory in C:\Game\[comapany]\[title]\ or under their Steam directory, at some random place.
It makes backing up saved games a pain.
More on topic; I'm a strict hierarchy type, though I do mostly use search. When I used a Mac I had strict hierarchies, but mostly used Quicksilver to access files. Same on Linux, I use GnomeDo, or the one that comes bundled in KDE (is there anything halfway like Quicksilver/Do for KDE?) 90% of time, but am paranoid enough to know that a good heirarchy withstands most other organization fads. There is only so much tags and metadata can do when the standard evaporates, but folders and logical organization will be around forever.

--
A patriot must always be ready to defend his country against his government. -edward abbey
Re:Directories, b*tch by Anonymous Coward · 2011-02-13 10:17 · Score: 0

What happens when one TOPIC has code in two languages?
Re:Directories, b*tch by lennier · 2011-02-13 12:21 · Score: 1

I despise programs that rearrange your files for you, make ridiculous subdirs w/out permissions, etc.

++this, though I don't actually mind iTunes (actually I have Rhythmbox on Ubuntu for my main music store, and use iTunes on Windows 7 just for purchasing new singles). I'm sorta okay with the 'Artist/Album' structure, but it sometimes makes organising albums of songs by multiple artists annoying.
But for photos... I had a very bad experience with F-Spot wanting to take over the process of importing - and then losing - all my photos. Shotwell seems better behaved. Every few weeks I sync my camera - I do it manually, creating a folder under 'photos' in the form yyyy-mm-dd, copy all the photos since last time, then import that folder into Shotwell. Never ever do I let a dedicated photo management application try to put stuff anywhere. That's what the filesystem is for, a reader application's job is to read it, not to write it.
I don't like the proliferation of all these 'organiser' apps. It seems to be fundamentally wrongheaded to me. If I want to organise my files, that should be a function of the filesystem, not an app.
I wish an operating system had a concept of 'query folders' where I could have honest-to-goodness filesystem folders, indistinguishable from real ones, where membership and even name was the result of a script. Then I could create as many such folders as I wanted to organise my stuff, and let my apps get on with editing and displaying and NOT organising. That way I could use whatever app I wanted to do whatever I wanted and not be in the position of 'darn, I renamed my external hard drive and now Rhythmbox can't find anything'.

--
You are not a brain: http://books.google.com/books?id=2oV61CeDx-YC
Re:Directories, b*tch by Zancarius · 2011-02-13 13:51 · Score: 1

That behavior in iTunes is no longer the default, at least. And, last I checked, it could keep a large, unorganized, directory organized within itself, without mucking with the directory.
As I seem to recall, the Windows version of iTunes (which is terrible, I admit, but that's another rant entirely) would at least ask if you wished for iTunes to manager your collection for you. I can't think of an instance where it automatically took it upon itself to do what it wanted, unlike what the OP suggested. Then again, I don't own a Mac.
Generally, though, I just bounce between using Winamp and a Python-based Amarok clone when I'm under Ubuntu. I loved the daylights out of Amarok when I used to use Gentoo as my primary OS, but the blasphemy that has become the latest version was abhorrent that last time I tried it. But for me, it doesn't really matter. I always have my music under some /mp3s/artists/[artist_name]/[album] structure anyway, and while I have about 40GiB worth of music, I seldom need a tagging system to keep track of what I want to listen to.
Then again, my listening habits are such that I'll listen to one song repeatedly until the neighbors lose their minds, so aside from queuing my entire collection and shuffling at random when I want to listen to something new, there's almost no point in using something like iTunes. Even Amarok was a little bit of overkill.
(As an example, I have listened to Disturbed's "Facade" from Indestructible around 3900 times.)
But you're right, it's insanely difficult to keep things well organized under Windows. I have most of my documents on my Windows install sync'd using Subversion, but since I don't like large OS partitions, I've had to do some odd tricks with junction (from sysinternals) to get it to behave nicely. It makes using a real OS a breath of fresh air compared to the nightmare that is the Windows directory layout, although it has improved slightly since the XP days--SLIGHTLY.

--
He who has no .plan has small finger. ~ Confucius on UNIX
Re:Directories, b*tch by Tacvek · 2011-02-13 14:53 · Score: 1

For iTunes:
If the album is a compilation with no primary artist, You set the compilation flag, and the album with will stored in a compilation folder, rather than an artist name. For albums with a primary artist, you put the actual artist of each song in the artist field and put the overall artist of the album (the name seen on the front or spine of the CD case) in each song as the "Album Artist", and the whole Album will be stored under the directory of the Album Artist.

--
Stylish sheet to fix many problems in Slashdot's D3: https://gist.github.com/801524
Re:Directories, b*tch by Omestes · 2011-02-13 20:30 · Score: 1

I'm not the largest fan of Itunes on Windows either (too much bloat for the sake of products I don't own or want), but it is better for my uses than anything I've found on Linux. I had my HTPC using Ubuntu for awhile and switched it over to Windows7 for lack of an Mp3 player. Amarok was nice, but a bit creepy in modern version, Banshee was nice but died with my library of over 6000 songs, I used Guay-something mostly. I spent a month going through every damn Linux player I could find and none of them were feature rich, stable, or intuitive (pick two). I have a large library, addicted to "constrained shuffle" (like iTunes DJ feature, a random list, weighted by rating, that you can edit), and need something that guests can use with little or no training. Guay-something (forgot its name, and Google is no help) was pretty much perfect, and its "constrained shuffle" was even a bit better than iTunes (in concept, its implementation was a bit off) being able to stick within related songs via Last.fm recommendations. It was hard to explain to random non-nerds during game nights and parties, I never could figure out what the hell some options did despite days of fiddling, and it never was as stable as poor flawed iTunes.
I finally ditched Linux, though, because of Flash. What good is an HTPC that can't handle most streaming video? I feel dirty, and unnerdy about it, but... there comes a time when we have to pick function over ideology. Especially when I expect my girlfriend to be able to handle it. I get the "Flash is Adobe's fault" argument (and not being able to handle HDMI sound is Nvidia's) , but she didn't. "Wait, you have a HTPC that can't handle media? That makes sense how?". Oddly Boxee could handle 1080p streaming fine, but nothing else could, despite rolling back drivers and hand editing, and recompiling random crap. And Boxee become almost useless when Hulu pulled all support from them... again. And Hulu's client fails completely on Linux. None of it is Linux's fault... but damn.

Then again, my listening habits are such that I'll listen to one song repeatedly until the neighbors lose their minds, so aside from queuing my entire collection and shuffling at random when I want to listen to something new, there's almost no point in using something like iTunes. Even Amarok was a little bit of overkill.
We're opposites! I'm a shuffle fanatic. I collect completely random genres and listen to them all mixed on shuffle. Nothing like Mozart, followed by Slayer, followed by Coltrane, followed by Pink Floyd, followed by AC/DC, followed by Tool, followed by Dangermouse. followed by Fantomas, followed by Joanna Newsom, followed by John Zorn covering Ennio Morricone... etc...

It makes using a real OS a breath of fresh air compared to the nightmare that is the Windows directory layout, although it has improved slightly since the XP days--SLIGHTLY.
Sometimes Linux and OS X annoy me too though. Sometimes I ponder whether their glut of directories are a bit antiquated. Yes, there is less redundancy and more specificities and (actually followed) rules, but things can get a bit overwhelming at times. I lean more towards the *nix system though. Better overly specific than sloppy and inconsistant. Nothing is perfect, I suppose.

--
A patriot must always be ready to defend his country against his government. -edward abbey
Re:Directories, b*tch by Zancarius · 2011-02-15 08:28 · Score: 1

I'm not the largest fan of Itunes on Windows either (too much bloat for the sake of products I don't own or want), but it is better for my uses than anything I've found on Linux. I had my HTPC using Ubuntu for awhile and switched it over to Windows7 for lack of an Mp3 player. Amarok was nice, but a bit creepy in modern version, Banshee was nice but died with my library of over 6000 songs, I used Guay-something mostly. I spent a month going through every damn Linux player I could find and none of them were feature rich, stable, or intuitive (pick two).
I have to agree. Though I did appreciate the earlier versions of Amarok, the latest incantation is blasphemy of the highest order. Now, that's not to say it has improved--when I last tried it, their 2.x (was that it?) branch was feature incomplete and still effectively in beta but being pushed to most distros' repositories as shippable software. I ditched it as quickly as I could and reverted back to the v1.4 as soon as possible. Though, that was also around the time I was taking a .NET class and didn't have enough RAM in my box to support running Windows + Visual Studio in a VM, so I ditched it all and returned to Windows. I still dual boot, but the majority of my media consumption (music) is under Windows.

I finally ditched Linux, though, because of Flash. What good is an HTPC that can't handle most streaming video? I feel dirty, and unnerdy about it, but... there comes a time when we have to pick function over ideology. Especially when I expect my girlfriend to be able to handle it. I get the "Flash is Adobe's fault" argument (and not being able to handle HDMI sound is Nvidia's) , but she didn't. "Wait, you have a HTPC that can't handle media? That makes sense how?". Oddly Boxee could handle 1080p streaming fine, but nothing else could, despite rolling back drivers and hand editing, and recompiling random crap. And Boxee become almost useless when Hulu pulled all support from them... again. And Hulu's client fails completely on Linux. None of it is Linux's fault... but damn.
Hear, hear. Flash support under Linux may be OK--if you have a decent gaming rig. The average HTPC can't handle it, though, unless it's a cast off from your previous generation of--*drum roll*--gaming rigs. I built a relatively cheap HTPC using Intel's Atom platform and Ubuntu for my parents in the hopes that I wouldn't have to worry too much about supporting it. I must say that I was rather underwhelmed, and fullscreen Flash is impossible (even with the various Xorg tweaks/Flash tweaks and so forth). It's almost painful to watch.
Admittedly, it improved a bit in recent versions, but I almost regret just not buying a cheap Athlon+mobo and going with that instead. So, really, it's probably more the fault of the Atom, but given my general experiences with Flash being very hit or miss, I think there's plenty of blame to share around.

We're opposites! I'm a shuffle fanatic. I collect completely random genres and listen to them all mixed on shuffle. Nothing like Mozart, followed by Slayer, followed by Coltrane, followed by Pink Floyd, followed by AC/DC, followed by Tool, followed by Dangermouse. followed by Fantomas, followed by Joanna Newsom, followed by John Zorn covering Ennio Morricone... etc...
That's hilarious! My playlist is somewhat schizophrenic, but my listening habits tend to force me to listen to a specific band (or sometimes song, as I pointed out) for a couple days. Then I shift over to something else that's wildly different. Although, it seems that recently I've been shifting between various European metal bands (Unsun seems to be my current favorite) and some weird mix of electronica/trance (Negative Format has one or two songs I've really liked listening to recently, but many are trash). Oh, and maybe a celtic folk song or two.
But, I've always maintained: True musical taste doesn't exist until your playlist is

--
He who has no .plan has small finger. ~ Confucius on UNIX
Re:Directories, b*tch by Omestes · 2011-02-16 06:01 · Score: 1

Admittedly, it improved a bit in recent versions, but I almost regret just not buying a cheap Athlon+mobo and going with that instead. So, really, it's probably more the fault of the Atom, but given my general experiences with Flash being very hit or miss, I think there's plenty of blame to share around.
Naw, a decent Atom (I've got a D510) can handle Flash. I'm running Windows 7 on my HTPC (the same one that had Ubuntu and failed) and it handles streaming 1080p full-screen just fine. Considering the fact that my board maxes out at 2Gb of RAM, its a pretty zippy machine. Also, Boxee, under Ubuntu, ran fullscreen 1080p pretty well (not perfectly, but decently enough to get the job done), but it completely choked in Hulu, Firefox, Chromium, Rekonq (when I was playing with OpenSuse), and any other way you could be inflicted with Flash.
Doing my monthly "essential" back-up is a massive pain in Windows (I know, it should be weekly or daily, but my data doesn't have enough churn to really warrant it) , I get to spend an hour hunting down where some random program might want to stick files. I wouldn't even mind if they just stuck with "/user/name/", at least I could just copy the whole mess and be done with it (with the caveat that it easily lets me keep /user/ on a different partition or disk).
What the hell is up the spacing on these comments now? Go, Go Slashdot 3.0!

--
A patriot must always be ready to defend his country against his government. -edward abbey

Re:There is a new project for binary version contr by woboyle · 2011-02-13 06:09 · Score: 1

Interesting, but most people need (at least I do) more of a data de-duplication tool than anything else. That, and a subject-based whereis tool, would deal with 95+% of the problems most people (and organizations) face in this realm. JMHO, but then I only have about 40 years experience in this field... :-)

--
Sometimes, real fast is almost as good as real-time.

folders + wiki by alexmagni · 2011-02-13 06:10 · Score: 2

I finally dealt with this problem once and for all in the following way. I found the best personal wiki out there (Zim: http://zim-wiki.org/), and wrote a simple python script (http://www.inrim.it/~magni/zimDMS.htm) that scans nightly my folder structure, keeping up-to-date my wiki. My wiki, therefore, is a perfect mirror of my folder structure, with the added bonuses that I can navigate to each folder, comment it, describe its content, insert images, insert links to other folders, and finally by a single click I can open it in the file manager. My ~ 15000 folders are managed perfectly...

Re:folders + wiki by kcbnac · 2011-02-13 06:37 · Score: 1

Interesting...how long does it take to scan the folder structure? (If say, you wanted to run it *right now* because you'd made a bunch of changes to commit notes on; I'm curious on the performance of said system)
I wonder because I like the simplicity of a basic file structure, but the ability to add metadata of some sort without risking the file structure...this seems to be a (relatively) simple way of doing this...
Now only if it could be kept current real-time, without storing the files in a funny system...
Re:folders + wiki by sslayer · 2011-02-13 07:45 · Score: 1

If you read the link from GP you will get these numbers:

example statistics (HP Proliant ML350 quad-core Xeon CPU 1.86GHz)
* n.directories: ~4800.
* initial scan: ~ 14sec.
* initial notebook upgrade (only once): ~ 13min.
* zim folder total dimension: 37MB.
* maintenance scans: ~ 25sec.
So it seems to me that you wouldn't wait more than a few minutes to get the new structure written on the wiki. Sounds pretty good.

ls -lR works. Just add grep. by Anonymous Coward · 2011-02-13 06:12 · Score: 0

Arcane commands?? Terminal window?

Certainly you are confused, grasshopper.

I have a mix of online and offline files. Online files are stored across 20+ machines, but most live on a file server that runs 'updatedb' nightly. That means 'locate' can be used to find any file on that system efficiently.

For media files stored off line, it is all about building a text DB. Those offline files are (usually) stored on numbered optical media and the contents are stored with the equivelant of 'ls -lR' > nnnn.txt. If certain types of files are included in the media, additional information may be pulled from the internet and placed into another text DB with "additional" information. Egrep is used to find anything and the optical media number is shown in the results.

On Windows, you can use locate32 for similar capabilities to the UNIX 'locate' command. I think it will look inside files too, so the egrep command to find which media disc a file is stored would be easy.

I like that it is all TEXT files for efficiency, trivial access, and maintenance.

I've created web interfaces ... never use them. They just get in the way. Wife and kids use those, but only with limited searches based on filename. I search the additional metadata when that is desired.

Remember when you learned how to create a dictionary using a large text file as input?

The old ways are not necessarily bad.

Re:There is a new project for binary version contr by dmbasso · 2011-02-13 06:15 · Score: 1

Your project is very interesting. I will try it.

--
`echo $[0x853204FA81]|tr 0-9 ionbsdeaml`@gmail.com

Two level file structure (maybe three) by Overzeetop · 2011-02-13 06:16 · Score: 1

Everything I have is on an UnRaid box, so it's organized into shares (virtual drives)

Personal Files
Photos (actually, it might be called Images)
Music Archive (all my originals)
iTunes (my working volume/set, compressed to mp4)
Applications
Home Movies (Raw, In Process, & Finished folders)
Files (everything else goes here, organized by what it is - health, rockets, cooking, guitar...etc)
Work Files
My business stuff
Video
Separated by type (DVD, HD, Recorded TV, Youtube)

This lets me backup my Personal and Work volumes with very little fanfare - I use LiveDrive remote storage and local external drives with SyncBack Pro. I don't back up the Video directory. Sure, it'd be a bummer to have to re-rip 400 discs, and lose some old TV shows, but it wouldn't be the end of the word, and it's not worth building a separate box (or two) with 4TB of storage to back up what I own on commercial media. That may change in the next year or two, and when it does, I'll backup with SyncBack just like the others.

The files are organized enough that most files I need can be narrowed down to 2-4 folders, tops. (side note...I have another 2-3 levels under my Files share..but that's just a second level of organization) All my images are cataloged using Picasa. I don't subscribe to the "dump it in a folder and tag it for searching." Even in Evernote I'm relatively organized. I've found that you can never remember the tag you used 4 years ago to search for your stuff. I keep a running log of my work jobs - about 1200+ in the past 8 years - and I still have problems finding specific jobs from too long ago.

Occasionally I'll clean stuff out and reorganize, and having the folders makes it easy. The biggest thing is that the master set is in ONE spot, and all my other machines sync to that spot. Sadly, the LiveDrive engineers are a bunch of useless hacks with an inflated view of their servers, so I don't sync any of my personal stuff there. I use their sync for work because it's the only service that seems to work reliably for less than $200/yr, and I've modified my workflow to use their ass-backwards system. As a result, I have to manually sync things to and from my server if I go remote for a while, but that's rare for my personal stuff; usually if I'm away from home and not on business, I want to be away from technology.

--
Is it just my observation, or are there way too many stupid people in the world?

Proper file structure by Anonymous Coward · 2011-02-13 06:20 · Score: 0

If you don't have a series of nested folders: "Old Laptop," "Desktop," "Laptop Files - to be sorted," "Laptop," "Desktop Documents" with counterparts for pictures, you're not storing files properly.

Re:There is a new project for binary version contr by mekberg · 2011-02-13 06:22 · Score: 1

Yes, I hear that a lot... True deduplications seems to be important for lots of people, and boar will gain "true" data deduplication quite soon, it's on the roadmap. (Even if I doubt that most people actually need it... what do they do? Do they obsessively edit their exif headers all day?)

In the meantime, boar already has a trivial form av deduplication in that it only stores identical files once. This is actually quite useful, as it allows for the cheap creation of "views" of the data. I can keep all my pictures in one place, including the blurry ones, and then have another directory containing only the nice ones, ready for a slideshow. Also, I have often experienced the problem that I have two _almost_ identical copies of some large file tree. Which one to keep? Often, I'd just keep both and wait for a rainy day to sort it out (which never comes). This was one of the things that drove me to create boar. Nowadays, I will just import both trees into my boar repo and delete the originals. Duplicate files will only be stored once in the repo, so the benefit will be almost as good as if I had taken the time to look through all those files, but with zero effort. Maybe I'll merge those similar directories some day, but until then, they'll at least not clutter up my harddrive.

my photo files are the most organized by Anonymous Coward · 2011-02-13 06:26 · Score: 0

All_Photos
raw
2009 (here is a comment you can ignore)
Winter
Spring (echo quarter contains whole months, while seasons do not. Do not be afraid)
Summer
Autum (the postercomment compression filter sucks)
Oct
Nov (I'm afraid of whitespace)
Dec
2009_12_01
DCIM_102345.jpg
2009_12_31
DCIM_103456.JPG

(what the fuck is wrong with whitespace you piece of shit)

All_Phots (repeated for clairty)
2009
jan
feb ...
dec
2009_12_23_solstice_party_01.jpg

backup ...

incoming
apps
firefox_20_install.exe
photoshop_cs5_install.dmg
license_keys
photoshop_cs5.txt

outgoing
(e.g. calendar_2010, slideshow_birthday_party, send_to_parents)

Tags by tiago.bonetti · 2011-02-13 06:36 · Score: 1

Does anyone tried to organize the files in a tag based system? Something like: My C project for a microprocessor class at my university(UFRGS) would have: Programming, C, UFRGS and microprocessors tags. My C projects at work would have: Programming, C, (*company) and (*project) tags. Any idea how to do it?

According to... backup by obarthelemy · 2011-02-13 06:38 · Score: 1

- irreplaceable "live" files, backed up daily (about 200 megs): files I created myself, ie mails, docs, code... It's the "My Documents" folder.
- irreplaceable archives, backed up monthly to dvd (about 4 gigs): mainly photos and home videos: "My Pictures" and "\Archives", plus my "live" files.
- everything, backed up monthly to an external HD: mainly my painstakingly ripped CDs and DVDs, plus all of the above. "Flacs", "MP3s", "Films", "Series", "XXX"

--
The Cloud - because you don't care if your apps and data are up in the air.

SharePoint baby by Anonymous Coward · 2011-02-13 06:41 · Score: 0

Being a system architect specializing in SharePoint tech the choice was rather easy for me; i set up a fileserver and WSS3, created a document library in WSS with an attached event handler that stuck any uploads larger than 500mb or of specific media types (music,video and so on) onto the file server. Everything's nicely tagged and deeply indexed and easily searchable either through the search center or using a connected desktop search (or it will be next week when i get around to upgrading to SharePoint 2010 Foundation), and the videos and music is shared using media sharing.
The family's not aware that there are two separate locations for storage, they just toss their stuff into the library and access it mainly through search or windows media player.

Re:There is a new project for binary version contr by woboyle · 2011-02-13 06:43 · Score: 1

Well, I'll have to check it out in more detail. The name is suckage however! JMHO... Sounds too much like "boaring" (sic).

--
Sometimes, real fast is almost as good as real-time.

OpenAFS by FridayBob · 2011-02-13 06:44 · Score: 1

That's the FOSS version of the venerable Andrew File System (Debian packages available). I use it together with MIT Kerberos V and OpenLDAP. It may not be the easiest system to set up and maintain, but what you get for your efforts -- a distributed file system -- is pretty cool even on a single server.

The problem with NFS and Samba solutions is that the manner in which servers and hard disks are organized has too much influence on directory structure: as disks and servers are changed over time, the structure of the file system usually changes as well. In addition, many users have different drive mappings, which further increases confusion and the risk of files eventually getting lost.

This is not the case with AFS; it's namespace (the AFS file system directory structure) is not influenced by disk structure and all users always see the same directory structure. So, when disks and/or servers are added or removed from a cell, which is what an AFS administrative unit is called, its namespace remains unaffected. In this respect it is less likely that anything will ever be lost.

Re:OpenAFS by Risen888 · 2011-02-14 03:56 · Score: 1

"The problem with NFS and Samba solutions is that the manner in which servers and hard disks are organized has too much influence on directory structure"
Truth. When I first set up my NFS server (back in the back in the back in the day, when I was just learning Linux), I had a public share and a private share, which were (not really, but for instance) /media/public and /media/private on both the server and the clients. Now that's become superfluous to my needs, but so many external programs are pointing at /media/public and /media/private that I'm stuck with it unless I want to really go through some hell to change all those pointers.
And before some asshole says "symlinks!"...no. Not symlinks.

--
Hey, I finally got my first freak! Took you long enough!
Re:OpenAFS by marcosdumay · 2011-02-14 08:29 · Score: 1

Hey, Symlinks!
What do you have against symlinks? You just create 2 dirs anywhere, and put those applications writting on those dirs. Then, when you decide to change again, you just make a couple more of dirs, like the "Desktop/Desktop/Desktop/.../Desktop/old_trash" hierarchy people pointed over and onver on this thread. It beats not changing at all.
Now, seriously, just change the name of the folders and you'll probably see that most of the programs that use them you don't use anymore. Those few that you do still use, reconfigure. It is not that big a deal (Windows users do it all the time, why can't you once?), and beats not being able to organize your things.

--
Rethinking email
Re:OpenAFS by Anonymous Coward · 2011-02-14 11:40 · Score: 0

AFS is good, but AFS takes three competent sysadmins to manage, and still may not work well :)

Infinite monkeys technique... by fahrbot-bot · 2011-02-13 06:51 · Score: 3, Funny

Applying the Infinite Monkey Theorem I put everything into one folder, assigning each file a pseudo-random name. Although there's only one of me, in time, I'm confident that a pattern will emerge...

--
It must have been something you assimilated. . . .

Re:Infinite monkeys technique... by Anonymous Coward · 2011-02-14 07:57 · Score: 0

I had a boss who would save most of his files in whatever directory was handy at the time, with the filename "blah.txt".
If there was already a "blah.txt" in that directory, then it would be named "blahdiblah.txt".
Re:Infinite monkeys technique... by Anonymous Coward · 2011-02-14 07:57 · Score: 0

Applying the Infinite Monkey Theorem I put everything into one folder, assigning each file a pseudo-random name. Although there's only one of me, in time, I'm confident that a pattern will emerge...
Ah. You must be the inventor of the I-pod file system.
Re:Infinite monkeys technique... by marcosdumay · 2011-02-14 08:30 · Score: 1

You are doing it wrong. The infinite monkey theorem only works for really random strings, pseudorandom ones break it.

--
Rethinking email
Re:Infinite monkeys technique... by Anonymous Coward · 2011-02-14 09:50 · Score: 0

This.

Spotlight by revscat · 2011-02-13 06:51 · Score: 1

Spotlight and fuhgeddaboutit.

Basically I don't worry about it any longer. Spotlight let's me search the entire file system, and subsets of it like emails, and OS X is pretty good about automatically generating metadata. Good enough for most circumstances, anyway.

And since you can search the contents of files, this makes looking for that PDF Joe Blow sent you last week dead easy.

Honestly, I have ~/Documents and a few subfolders, and that's about it. Between Spitlight and Quicksilver I don't have to worry about directory hierarchies any more.

Re:Spotlight by Risen888 · 2011-02-14 03:57 · Score: 1

Good luck with that shit in 10-20 years. My directory hierarchies are good for life.

--
Hey, I finally got my first freak! Took you long enough!

They're all over the place by 91degrees · 2011-02-13 07:00 · Score: 1

The hierarchical file system is a nice structure when dealing with floppy disks, was adequately suited for a system where people were creating a few dozen documents, but it's felt a little dated for a while.

I'd just like a new paradigm that doesn't rely on backwards compatibility, and uses metadata as a primary means of associating files with each other.

Just sort your porn alphabetically by Anonymous Coward · 2011-02-13 07:05 · Score: 0

yllacitebahpla nrop ruoy tros tsuJ

Use MKlink or link by teadrop · 2011-02-13 07:19 · Score: 1

The solution is to consolidate all the data your care under one drive, one folder, so whatever you look for you only have one place to look under, one place to backup.

If you are using Windows, consolidate everything under c:\[some_folder]. Even if you run out of space on C Drive, it does not matter. The trick is to use "MkLink" (windows 7 and Vista only, for XP use linkd.exe from resource kit). It is like "link" in Unix, you can create a symbolic link or hard-link to anywhere else on your system, it can be located in an internal/external hard drive, or even network drives. And later if you move some data from d: to e:, your data will still be located in the same location under c:. You don't have to ever reconfigure any app to point to different folders. They will remain at the same location for the next decade or so.... And also, put the command to setup the symbolic links in one batch file, so you can easily recreate all the links when you setup a new computer.

If you are using unix, then you already know link, so no need to say more.

The low level folder name should be tied to your backup strategy. For example....
c:\usr\doc (for all the documents e.g. words, excels, programming files...)
c:\usr\pic (for all your photos...)
c:\usr\mov (for all your movies....)
...

And the backup strategy should be divided into...
CID (Critically Important Data) - Backup daily, create at least 3 copies on 3 different physical media, preferably with one offsite, e.g. the c:\usr\doc in the above example.
KID (Kind of Important Data) - Backup weekly, create at last 2 copies on 2 different physical media. e.g. c:\usr\pic in the above example
IDC (I Don't Care) - No need to backup, I may cry if I lose the copy, but I will get over it....

definitely by Anonymous Coward · 2011-02-13 07:23 · Score: 0

through a wisely ordered directory structure, though indexed with tracker. i have hundred thousands of files on my disks, so tags would drive me crazy and are imho insufficient. backups and syncing with rsync. i tend also to buy drives with the same block count, so i can backup and clone directly with dd. pretty oldschool, nothing fancy, for my data is sacred and should not get lost through insecure technologies like version control, binary diffs or distributed file systems. i really wish there'd be a standard for metadata and everybody using it appropriately.

Mostly the same way I did it in 1993 by Vyse+of+Arcadia · 2011-02-13 07:25 · Score: 1

Directories! Occasionally, desktop search provided by Nepomuk if I can't find it in less than two minutes or so of manual searching. Once in a blue moon, good ol' find piped to sort piped to less.

Duplicates by randallman · 2011-02-13 07:29 · Score: 1

I ran into problems with duplicates over the years from copying files off my laptop before installing a new OS and for other reasons. I used dupmerge, which identifies the duplicate files and uses hardlinks to keep only a single copy. Freed up quite a bit of space for me.

It's all about the file manager ... by Anonymous Coward · 2011-02-13 07:32 · Score: 1

I would lose my mind without Directory Opus. I would call it the swiss army knife of file system tools. Been using it since the Amiga days.

If you have a decent sense of order and something like Directory Opus - it goes a long way in maintaining your sanity.

directories and year by mazda_corolla · 2011-02-13 07:47 · Score: 1

Category and Date.
Overall, I like date-based folders (2010, 2009, etc.).
But first, I use a couple of high-level categories based on the type.

MyFiles
...2010
......ProjectA

MyMedia
...Pics
......2010
...Music
......Artist A

MyFinance
...2010
......BankA

MyFiles
...2010
......ProjectB

Bucket
...random files...

The categories are:

MyMedia / pics -> large files, changes slowly, not replaceable if lost. #1 backup priority
MyFinance -> bank statements, etc. -> small files, old stuff not replaceable, new stuff replaceable . #2 backup priority
MyFiles -> stuff I create / want to keep. -> small files, not replaceable. #3 backup priority
MyMedia / music -> large files, changes slowly, replaceable for $ or time. #4 backup priority
Bucket -> temporary files, downloads, etc. -> mix of large and small files. Not backed up.

This helps me organize my backups.
Nightly backup job copies everything to internal hard disk #2, and an offsite backup (crashplan) backs up offsite categories 1-3.

I don't tag. There's no universal format, so I stick with directories and filenames only.

bunch of hypocrites by XCondE · 2011-02-13 07:48 · Score: 2

Of everyone who posted their fancy choice of directory structure *nobody* told us where they keep their ~/.pr0n

Re:bunch of hypocrites by gmhowell · 2011-02-13 15:42 · Score: 1

Of everyone who posted their fancy choice of directory structure *nobody* told us where they keep their ~/.pr0n
Well, obviously that is the name of the root directory under which these other structures appear. Weird bit of obfuscation, but, whatever.

--
Jesus was all right but his disciples were thick and ordinary. -John Lennon

For Mac people by 605dave · 2011-02-13 07:52 · Score: 1

I don't want to start an argument about which system is best for this task, but rather I have some ideas about how you could do this if you are on a Mac. I have over 20TB of storage in my home studio, with hundreds of thousands of files. Its one thing to have that much data, but its another to have it well organized. To that end, here are the apps I currently use on the Mac to organize my data

iTunes - 2TB of media on NAS drives
iPhoto - family and personal pics
Aperture - handles DSLR semi pro collections
iMovie - all family DV and HDV files
Suitcase Fusion - handles fonts very well
Final Cut Server - Media files for studio
Time Machine - local Mini server running OS X server backs up all local machines

It is also important to organize you data, and these work great for me.

Bento - personal database stores serial numbers, passwords, insurance info, and much more. Local network sharing
Yojimbo - stores and syncs news, photos, notes, and other media.
Things - amazing to do list

After years of trying many choices, these are apps on a regular basis. Good luck...

--
Be kind, for everyone you meet is fighting a difficult battle. - Plato

The logfile is the key by Anonymous Coward · 2011-02-13 07:58 · Score: 0

Everything is on 2 seperate harddrives at the same time. Some of the downloaded things like drivers and things that will be updated are deleted. A logfile where I just write notes and some subdirectories for rough sorting.

Re:There is a new project for binary version contr by maxume · 2011-02-13 08:05 · Score: 1

How does it compare to bup?

--
Nerd rage is the funniest rage.

Those who have not used Usenet... by SPrintF · 2011-02-13 08:12 · Score: 1

... are condemned to repeat it.

I understand the problem. At work and at home, I use my homebrew hierarchy to manage my files. But looking at this thread, I felt a wave of nostalgia for Usenet's alt.barney.dinosaur.die.die.die.

The worst "hierarchies" I've encountered at my work are:

1. Save everything on the Desktop. (Good Heavens, woman! Do you even have wallpaper?!)

2. New Folder. New Folder2. New Folder3. Etc.

--

Honesty. Loyalty. Kindness. Laughter. Generosity. Magic!

Simplicity & Easily readable by Phurge · 2011-02-13 08:12 · Score: 1

Max two directories deep.
Reverse date system
Everything gets a folder (no unfiled files)

eg:
Music/Pink Floyd - Dark Side of the Moon
Photos/2008-11 Europe Vacation

--
I'll see your hokum and raise you a boondoggle.

Re:There is a new project for binary version contr by mekberg · 2011-02-13 08:28 · Score: 1

Bup seems to be a great backup program, while Boar aims to be a full vcs. There is certainly some overlap between these types of applications, but in general, backups will sooner or later be replaced by more recent backups, and it is up to you to make sure that you don't accidentially delete any files, because you will have a limited amount of file history. Now, bup seems to allow for efficient backups, so perhaps you don't need to (cannot?) purge old backups. However, I like vcs-like workdir concept, and a commit and update command to keep workdirs synchronized between different computers. Also, boar allows for efficiently making or updating verified copies of the repository, which makes it easy to maintain good backups. But granted, the distinction in general between a good backup tool and a vcs is blurry.

Whats wrong with 50 duplicates on different media? by Anonymous Coward · 2011-02-13 08:46 · Score: 0

Check the price of data storage.

Redundancy is cheap, data loss is expensive.

Re:Directories / split over machines / crashplan by AliasMarlowe · 2011-02-13 08:52 · Score: 1

Yup, distribute across the LAN, and use frequent backups and sensible write permissions.
I have a tree of directories with mostly sensible names on the server (2TB) for stuff that's worth filing away, with automated nightly backup to one of two external 2TB disks. Another 2TB disk contains videos and movies which are supplied on demand to the TV, and a 1TB disk contains music and audio lectures (we've spent a fortune on stuff from the Teaching Company, for instance). The latter two disks are not in the backup cycle, as they are mirrors of disks on two of the workstations at home.
The kids have been instructed that only deletable stuff should be saved locally on a workstation - anything to be retained should be in their directories on the server. The kids have read access to almost all of the server (a few "special" directories are off-limits) but write access only to their own home directories. They know I'll copy the server's stuff to new disks or a new server when the time comes, but workstation disks are just wiped...
Once upon a time, I used to make file links if a particular file reasonably belonged in more than one directory in our tree. However, I stopped this since it was extra work without tangible benefit: locate has never let me down on finding a file.

--
Those who can make you believe absurdities can make you commit atrocities. - Voltaire

Power Tools for Organization of Files by LBook3 · 2011-02-13 08:54 · Score: 1

Organizing your directories and files are just the beginning. I have found I need to use a couple of tools to have the power I need to find the specific document, phrase, number, image (w/tags), etc. I use the following: (1) Google Desktop (not all file formats supported); (2) A database software to catalog and LINK the document (full document path) for those very special items I KNOW I will need to find again [Software program: DBTextworks by Inmagic]; (3) Indexing Software - similar to Google Desktop - more robust and can handle MOST Professional Software Formats [Software Program: ISYS by Odyssey based in Australia]. Yes, I use all of these methods in combination WITH organizing directories and files. Whether you use multiple hard drives, a server system, or a single pc, these tools can make your file organization much, much easier. I use this power tool combination professionally and at home. I can spend more of my time working on the task at hand, instead of searching for "lost" documents. These programs run on PC (Micosoft operating systems) and were not designed for the MAC OS.

Let's see... by he-sk · 2011-02-13 09:14 · Score: 1

For Media:

- Photos are organized in Lightroom and iPhoto under a ~/Pictures folder.
- Music is organized by iTunes under ~/Music.
- There is a ~/Movies folder, but that's only for downloads. I don't keep movies around after watching them. Waste of space.

I have a ~/unix directory which is the prefix for manually compiled software. My own code also lives under ~/unix/src and is installed into ~/unix/bin. Oh, because of Eclipse there's also a ~/Workspace folder lying around. I should move that somewhere else.

Then there's ~/org for my org-mode system. This keeps track of notes, todos, appointments, and more and more project files, including my own scripts and stuff I do for Uni.

Mail lives under ~/Mail which I access via mutt and search/index with maildir-utils. I dump all mail into an =Archive mailbox and clean out my =Inbox every day. If a mail requires further action I capture it within org-mode to get it into my system and move it to =Archive with one key. mutt is also integrated with the OS X address book. (The integration of the address book with org-mode is an open loop.)

Downloads (except movies) from the internet go straight to ~/Desktop which I also try to clean out every day. Occasionally, I will check out a project folder and keep it around longer, which brings me to...

~/git. In there I keep (bare) git repositories for my org-system and e-mail (checked in daily), my code, and other large projects. I also use these repositories to sync stuff across different machines.

Oh, and then there's an 3-year old, 30-GB-large encrypted disk-image from my last computer lying around which contains stuff that I haven't brought over yet. (20 GB of that are photos. Most of the rest are caches and data cruft from programs.)

Last time I checked, my whole digital life (minus photos) fits into 6 GB. That includes a 15-year-old e-mail archive and all the other stuff I've kept since then. I have around 30 GB in music lying around but I don't count it because I find it completely replaceable and there are good online options available now, such as Soundcloud and others. (I'm also lucky that there are two excellent radio stations available here: Fritz and Motor.FM.)

Backups are handled by TimeMachine and I regularly swap out the drive and take it to my parents' place for offsite storage.

I generally try to avoid folders for organization and prefer to access the files through a software layer using folders only as a backend. For media there's ready-made stuff (iTunes, iPhoto) and Lightroom allows me to impose my preferred structure. My own stuff I usually manage through org and git. I also try to go through my stuff once a year and clean up the cruft. That keeps everything nice and clean.

--
Free Manning, jail Obama.

Simple by snookiex · 2011-02-13 09:17 · Score: 1

docs, img, programs, code, music + grep + find. Definitively everything had become very easy since I don't watch porn anymore.

--
Open Source Network Inventory for the masses! Kuwaiba

Nemo Documents by daserver · 2011-02-13 09:33 · Score: 3, Interesting

If you are on Windows you might want to give Nemo Documents a try. It gives a time based view and allows one to use tags. Disclaimer: author posting ;-)

Directories by Anonymous Coward · 2011-02-13 09:47 · Score: 0

private/backup/
private/documents/
private/pictures/
private/source/
private/videos/
public/books/
public/games/
public/images/
public/movies/
public/music/
public/software/
downloads/complete/
downloads/incomplete/
downloads/torrents/

No dependencies, requirements, maintenance, or bugs. Easy.

Directories + NAS + (specific) Cloud Services by BlackCreek · 2011-02-13 10:29 · Score: 1

- Fotos: Smugmug account + local NAS.
- _General_ personal files: organized by directory with Faubackup backups to local NAS (no offsite backup)
- Music & Video (i.e. entertainment, not personal): local NAS.
- Smaller more important files (code, latex docs, text): git + github account
- Contacts, addresses, agenda: Google cloud
- General notes: Remember The Milk "pro" account

Archive by Anonymous Coward · 2011-02-13 10:37 · Score: 0

I recently got a laptop for the first time since I spend most of my time away from my main computer now. It's easy enough organising files in one location, but keeping them synchronised and secure is another matter.

The solution I came up with for myself is making use of an archive.

I have five main folders: Audio, Documents, Images, Software and Videos. I also have the same hierarchy in my archive folder on my main computer. This allows me to move the bulk of my files into the archive and keep my other folders synchronised with Unison.

Harddrive space is cheap, but:
- file transfers are slow,
- RAID isn't very portable,
- network access isn't always available and
- you don't need access to *everything* all the time.

My Problem by Anonymous Coward · 2011-02-13 10:57 · Score: 0

My wife used to share an iTunes share on the network, but I started hauling my computer to work so our collections diverged and she now has things I don't and vice versa. In some cases, we'll have both imported the same cd, and one will have read better. I have no idea how I'm supposed to figure out exact duplicates or better copies, etc.

My Directory Structure by tomhermann · 2011-02-13 11:00 · Score: 1

I concur with many of the others, directories are still a good way to organize your files :)

I have a three terabyte raid-5 Ubuntu file server I use to serve up the media in my house. I stick with a simple directory structure trying to keep it as shallow as possible:

/anime
/downloads
/e-books
/movies
/music
/pictures
/programming
/tv

This seems to work well for me, but just keep moving stuff around and you'll figure it out. File management is easy with Samba or your favorite SFTP client. I've also found that a bluray burner is good for stuff I want to keep, but in reality am never going to look at again.

Good Luck!

metadata file system, anyone? by hendrikboom · 2011-02-13 11:48 · Score: 1

It's been proposed to have a database whose keys are metadata tags, whether improvised algorithmically, found within files of certain types, or hand-assigned. This would be fine when you want to find a file yourself. But many programs want to use file names. A Linux user-mode file system could query the data base and provide access to the files it finds. Designing the right notation would involve serious insight. The file name will likely need components addressing routing, status (do you want a developer version of a stable one?), content, and other attributes.

I don't... by frank_adrian314159 · 2011-02-13 12:25 · Score: 1

It's really a simple answer. Most stuff you'll never need again. Really. Some guy storing .iso images? Burn it to a CD/DVD and keep the freaking disk. If it's not important enough to keep up with retention, chances are it's not important enough to hold onto. Photos? Do you really need pics of your girlfriend thrice removed (unless she's naked, of course, in which case why haven't you already uploaded it to the internet to share with us)? Old emails? If they're really important, print them out and file them. If not, see above - most of the crap you're hauling about just isn't that important.

The rest of the stuff that you actually do reference frequently will fit just fine in a hierarchical file system. That stuff that you really need and can't, for the life of you, find, you can probably get from one of your friends - take the time to reconnect to them as a person rather than with just their information. Finally, not remembering some things can be more precious than remembering. It brings refound treasures and retold stories back into your world. And, trust me, those are a lot more important than any file you've misplaced...

--
That is all.

Re:I don't... by marcosdumay · 2011-02-14 08:36 · Score: 1

Wow, you must have a pretty big warehouse! While I'm concerned about digitalizing everything, put those .iso in HDs, and making the big versions of them go away, you are advocating more paper and disks.

--
Rethinking email

Re:Whats wrong with 50 duplicates on different med by Anonymous Coward · 2011-02-13 12:32 · Score: 0

Redundancy can be done the wrong way (50 copies on 50 media) or the smart way - one copy on a redundant storage with regular backups off that. You get the benefit of cheap media and no management drawbacks, noob.

To get to the smart way, you need organization.

What I use and problems encountered by Bomarc · 2011-02-13 12:40 · Score: 2

Having just started cleaning my house, this story comes close to my heart. Looking around, I have 6 boxes of old “documents”. What to do with them?

First to cover the common areas:
Video:
I have two TIVO boxes, one is high definition, both recording constantly.
I have one system with 8TB of storage to sort/organize the incoming TIVO recording.
I’m setting up two 60TB servers for my “movies and TV shows”. (Each will handle 26 hard drives). I use the term “setting up” as I’ve run into some issues with these systems.

Binary:
I have a 2TB system set up for binary files. (This would be development, OS, drivers, patches and the like). You never know when you will need a DOS bootable disc.

Music:
I have one system (with 2TB storage) to handle my MP3’s. (Still need to sort/organize/remove duplicates). Currently this one also houses my image collection, important documents and the like. It is acting as kind of a catchall for everything else.

Data:
I’ve recently set up a system to handle “data” (document based); with 130 GB of space. I’m using “Home Document Manager” . Though not mature, they are more amenable to fixing the problems.

And now to the point: Organization.
Overview
The first – glaring issue is lack of a good storage house. Most management systems sort a single file in a single location, sometimes with tags. A good example of the problem that I found: what if I have a Medical Bill, which is being kept for Legal reasons, which I will need at Tax time? What if I have a MP3, Music Video and Movie that I would like to tie together (or heaven forbid multiple playlists)? Or Movie props that I’ve purchased off eBay.
I would not like to keep the medical bill after 3 years, but for legal reason would like to keep it for seven. I don’t want to delete the “item”, but I no longer need to be reminded about the “bill”. I don’t want to have multiple copies of the same item, which makes searching a nightmare. And “tags” are a start, but are not granular enough.

Video organization:
Extreme Movie Manager. Ok, it has some bugs, but it does a VERY good job. With its multiple views, and multiple ways of keeping track of movies, it is the best one that I’ve seen.

Music: Currently I’m (just) using Media Monkey and MS Media Player. Media Money has a severe limitation in that it does not handle video (read music videos-Watch "Vertical Lines" by Leather Hands to get the point). I attempted to use an “automated sorting” system, however it has significant issues, the biggest being it took MPS’s from a known group (1970’s for example), and moved them to “Unknown”, “Unknown”. Can’t use that. I also used Clone Master, and found that I have almost 2500 duplicate (MP3) files. Unfortunately, it “guesses” the wrong one most time for the likely file needing to be deleted.

Binary is actually the most straightforward simple file structure

Other issues:
Video Servers: I’m also running hard drive selection into issues with the video servers. The problem is: Enterprise class SATA drives are expensive, “small” (only 2TB), fast (as such they use a lot more energy). “Green” drives are cheap and plentiful and use a lot less power (and generate a lot less heat) however they are not compatible with the RAID controllers needed.

Video Playback: I have a decent system to handle the Blue-ray, high def requirements. However the software also has problems: In/with high def you can’t read the “default” fonts displayed

Re:What I use and problems encountered by hendrikboom · 2011-02-13 13:39 · Score: 1

Linux has software RAID, which doesn't really care what kind of hard drives you have. But that probably won't help you with Windows.
Re:What I use and problems encountered by Anonymous Coward · 2011-02-13 23:46 · Score: 0

You sir are fairly outdated on that, or trolling. Let me assure you that Windows can do RAID0, which you seem to refer to, just fine. Read http://technet.microsoft.com/en-us/library/cc732422.aspx for assistance on how to do it.
Re:What I use and problems encountered by jedidiah · 2011-02-14 02:39 · Score: 1

That's the problem with "ease of use systems". Everyone is encouraged to be completely clueless and helpless.
So not even the "native users" are completely adept at fully utilizing the system.

--
A Pirate and a Puritan look the same on a balance sheet.
Re:What I use and problems encountered by ewhac · 2011-02-16 10:35 · Score: 1

I have two TIVO boxes, one is high definition, both recording constantly. I have one system with 8TB of storage to sort/organize the incoming TIVO recording...
How are you able to get the hi-def programs off the TiVo and on to external file storage? Our TiVo sniffs derisively at us if we try to do that (depending on the show). Also, that must be achingly slow, since TiVo throttles network transfer rates.

--
Editor, A1-AAA AmeriCaptions
Re:What I use and problems encountered by Anonymous Coward · 2011-02-16 11:20 · Score: 0

Tivo Desktop. Note that some channels won't transfer (copy protection).
As for speed, I have a dedicated system. It’s not gigabit, but it does transfer, and I just let it run at it's own speed.

Hashes and checksums by Anonymous Coward · 2011-02-13 12:48 · Score: 0

It's surprising that no one seems to have mentioned hashing files or writing checksums to file yet. Besides organizing the files it's probably the most important thing you can do. There don't seem to be very many graphical interface tools for it, but here are two commands that work:
Within the top directory:
$ find . -type f -exec sha512sum '{}' + | tee hashes.sha
To verify:
$ sha512sum -c --quiet hashes.sha | tee verify.txt
Strongly recommend running this at least once. It's a good idea to hash before copying to a new hard drive then verify afterward. I was able to detect and prevent more data corruption issues by doing this on a Samsung Spinpoint F4 hard drive which had a (now resolved) firmware bug for example. Verification tasks can be split up over different operations by cutting lines out of the file and placing them elsewhere, so that one directory is checked while others aren't (due to how time-consuming the process is).
Also, while organizing personal data onto many hard drives, keep in mind encryption options that are very convenient (especially on Ubuntu). Truecrypt hard drives look like random binary data initially which makes it harder to know if a drive has the wrong password or is simply uninitialized. Cryptsetup works and prompts for passwords on Ubuntu:
$ sudo cryptsetup luksFormat /dev/sda1
$ sudo cryptsetup luksOpen /dev/sda1 securebackup
$ sudo mkfs.ext4 -m 1 -L securebackup /dev/mapper/securebackup
Now, after unplugging the drive and plugging it back in, Ubuntu should automatically detect the presence of an encrypted volume.
Even with multiple computers it's possible to keep all the files in one location this way. A proper boot hard drive isn't even needed because the Ubuntu live CD, whether on a physical disk or on a (faster) thumb drive includes the cryptsetup software. Or, for added security, install Ubuntu using the alternate installer with encrypted system partitions.

Re:Hashes and checksums by cool_arrow · 2011-02-13 14:12 · Score: 1

This works well on win: http://www.exactfile.com/

...also dates in file names by LongearedBat · 2011-02-13 14:00 · Score: 1

For my personal photos and movies I sub categorise them using international date format in sub folders named like so:
"1998-03-30 - X's birthday party" (A single day, so full date.)
"1999-06 - Travel to Europe" (A few weeks, so just putting the month in.)
etc.

That way, I can sort alphabetically yet still find events in a sort of timeline.

Or for tax records... (submitted quarterly)
"BAS - 2009-Q4"
"BAS - 2010-Q1"

All the BAS sibmissions are now in order, but grouped together.

fdupes by Anonymous Coward · 2011-02-13 14:55 · Score: 1

I use fdupes, http://netdial.caribe.net/~adrian2/fdupes.html

and FSLint looks interesting http://www.pixelbeat.org/fslint/

Of course, it would be nice to be able to replace some of the dupes with links

Huge Amount of FAIL by Anonymous Coward · 2011-02-13 15:52 · Score: 0

"amassed a huge amount of files in [a] huge amount of formats about a huge amount of topics"

Files, formats and topics are all numerable, unlike, say, wheat, so you want "number" rather than "amount".

30 years of attention to English would have been awesome!

My System by Scarletdown · 2011-02-13 16:22 · Score: 1

My filing system is still under development off in the Someday Isles, as in:

Someday I'll get around to sorting and organizing all my stuff, but not right now.

--
This space unintentionally left blank.

Backup vs. Archive by joeyadams · 2011-02-13 16:41 · Score: 1

Although I have by no means figured out the file problem, I've learned what I believe to be the fundamental formula of data management:

Backups != Archives

When I first got a computer with a CD/DVD burner, I was thrilled, as I would finally have limitless and cheap data storage. When I filled up my disk, instead of having to delete files to create new ones, I could "back up" the files that were taking a lot of space.

It came to pass that I literally had hundreds of CDs and DVDs with a variety of backups using multiple indexing schemes. AVI_SET_1 through AVI_SET_5. LS000 through LS038. DVD±RWs 0 through 8. Many of my discs remain unlabeled. Most of them contain a hodgepodge of file types, many of which are outdated.

I have hundreds of gigabytes of data, but most of it is unaccessible. When I want that project file from five years ago, I can't just search my hard drive for it. I have to dig through disc after disc, insert it, look at the contents, and occasionally write a memo of some sort concerning the contents of the disc (often losing track of the memo in mere months). It would be a lot easier if it were all on one monster hard drive (better yet, two hard drives in mirrored RAID configuration). If I actually knew how to organize the files as well, they might not even take that much space, and I wouldn't have needed today's hard drive prices to replace compact discs for the type of storage I was doing.

I failed to understand what a backup is for and what an archive is for. A backup is for disaster recovery. An archive is for shoving files out of sight until I need them in the future. I was using backup technology (compact discs) for archiving.

Let's say that, of all the data one has, 99% of it is junk that you hang on to "just in case", and 1% of it is used routinely (these figures vary wildly in real life). Backups target the 1%, while archives target the 99%. Fortunately, for most people, a DVD is more than sufficient to store the 1% at any given moment in time (operating system files aside).

The bottom line is, an ideal (and cheap) arrangement to keep archives and backups in their respective places is:

Buy two or more big hard drives (e.g. 1TB each), set them up in RAID-1 (mirrored) configuration, and use them to archive junk. When you put a file in the archive, you can delete it from your system. You shouldn't have to worry about losing your junk files, as they are stored redundantly. This assumption is the whole purpose of the junk drive: you can delete files from your system without blotting them out of existence.
Once a month or so, burn a DVD backing up all of your most important files, as if you were grabbing your valuables out of a burning building. However, once you have done so, proceed as though this backup does not exist (i.e. don't delete the corresponding files from your hard drive). Backups are for disaster recovery only. I'll say it again, backups are for disaster recovery only .

Re:Backup vs. Archive by pnutjam · 2011-02-14 03:39 · Score: 1

I'm thinking of launching an annual archiving service, we come to your house and archive your stuff annually, or more often. Store the data in a Truecrypt container on an offline server, and back up to a removable Hard Drive.

--
Cheap storage VM.

Only archive important stuff by Anonymous Coward · 2011-02-13 17:28 · Score: 0

On my computer (Linux), I have gobs of directories full of applications that I use, and I archive shell scripts I create to build more complicated bits (if it takes 20 packages to build a major package like ffmpeg, use a script to do the job, and it gets done much faster, since it builds all packages, configures all pieces, moves all subdirectories, sets all permissions, modifies all configuration files from one place, one command (to rule them all). Other than the script, the source files can be fetched from the net (usually newer versions), so don't archive it. System files are untouched. I use rsync to do backups/incremental backups. It does a first 'big, everything' backup, and every subsequent run is incremental (unless you delete the big one, in which case it makes another big one). Mail, pictures, movies, etc., all gets backed up to a second drive. Every year or two, create a dvd of anything more than 2 years old (keep it secret, keep it safe), and toast it from the drive. If you are nervous about the DVD, make two. 4 GB of data is quite a bit of data. Done.

Simpler Times by davesque · 2011-02-13 17:55 · Score: 1

In my opinion, computing has become much more simple in recent years. Most major operating systems are of good quality and have worked out the major BSOD kinks. Furthermore, advancements in interface design and process flow have made computer use much more intuitive.

From all that, I've found that I actually hoard far fewer files these days. I have one folder that contains my documents (with some subfolders to organize topics, but not more than one layer of subfolders). My music sits in iTunes (yes, I know where it really is). Photos sit in iPhoto (same story). I really don't track that many files all together. I think it could fit all on one 4 GB USB stick.

Has anyone else had this experience lately?

My Planned setup by Blue_Wombat · 2011-02-13 20:22 · Score: 1

Timely article, I am in the process of re-doing my setup at the moment. Here is the plan, most of which I intend to implement over the next 2-3 months:

1. A NAS box, with either 8 or 12 TB of storage in JBOD configuration as a media server. Media files originate with my main PC, and are transferred to the NAS box. As this is done the converted files are duplicated to a USB external drive (stored offsite), and also burned to DVDs (stored in the garage, seperate from the house, with numbered disks and an index file on the PC for recovery). Original disks are stored in a safe cool place - re-rippable in extremis.About 300 CDs and 1,400 DVDs (some purchased, some recorded off cable on a DVD recorder) still to process - tedious! Emulator software, copies of ebooks, my photos etc will also go here. Organised in a logica directory structure (/media/TV/documentaries, /photos/2010_03_15_Fred's_Party/ etc). As well as my main network, a pair of WD TV Live's will be wirelessly linked to this. For photo directories etc, I will include a small text file describing the contents.

2. A second NAS box with 2x2TB drives in RAID 1 configuration. This is for the stuff I am working on and that has value to me & isn't easily replaced. This will be backed up nightly to a USB drive. Two external USB drives will be used for the backing up, one kept at home, the other locked in my drawer at work, and rotated weekly. All three of my machines will be backed up to this box. These will be encrypted.

As disk capacities continue grow exponentially, and my storage needs will as well, I will just keep migrating the whole kit and kaboodle to NAS boxes with ever larger capacity, but with the same structure intact.

Movies by iam34m0 · 2011-02-14 00:31 · Score: 1

Have quite an international taste in movies, so I like the following folder format: $countryOrRegion/$YearOfRelease/$MovieName.$YearOfRelease.avi

Music Genres for me == Colors (e.g. Red, Blue,...) by ginbot462 · 2011-02-14 02:43 · Score: 1

At least I know I have a lot of company with my "stuff", "sort", "move_me", "misc", "etc" folders.
I had the same issue with music genres, but genres translated to user space (i.e. what use does it have to me) as what was I in the mood for. So, I just started organizing music by color. Red and blue being the most obvious. Still, bands that have really speedy songs and ballads? Well, I class them by the "majority".
I just realized, I don't have a "black" folder. I am more optimistic than I previously thought. Wait.. is that right? Maybe black can be for doubt, so Nickelback can go there as it shows there is little hope for humanity.

(As stated in another comment: Nickleback would just be porn)

--
Atlas Shrugged : Thematic Story :: Battlefield Earth : Organized Religion

I use OS X and Spotlight by mario_grgic · 2011-02-14 02:46 · Score: 1

I have over million files and even though I do put them into folder hierarchies, I tend not to browse for files but simply use Spotlight to quickly get to them. Spotlight is amazingly usable, fast and always there (and it's available from the CLI as well). Queries are instantly updated as you type, can be saved as dynamic folders (views that update as more files match criteria) etc.

So for example, interested in all pdf books that mention python in file name? Type kind:pdf filename:pyton

Interested in all pictures you took at aperture f4 and where you used flash? Type kind:image fnumber:4 flash:1

And of course you can always put additional metadata on your files to find interesting binary files.

--
As the island of our knowledge grows, so does the shore of our ignorance.

Directory structures. The Right Way (tm). by Risen888 · 2011-02-14 04:57 · Score: 1

A sensible directory structure never goes out of style. Human-readable, dependent on no external anything, eminently portable and transferable, and altogether future-proof. Metadata's as good as whatever the standard is. If you're pouring time into creating metadata for some gimmicky piece of proprietary garbage, you're only hurting yourself in the long run.

Without going into superfluous detail on my directory structure (lots of other people have discussed that here), I will say that the one area where I do use metadata religiously is in id3 tags for audio files. My audio files are sorted sensibly, a la /audio/artist/year - album/tracknumber - trackname, with all metadata as close to perfect as I can possibly make it. When I've got new music to add, it goes into the unsorted directory outside my audio path, where it sits until I get around to tagging and bagging it (I've been using Easytag for years, but lately I've been experimenting a bit with kid3), getting the names and directories uniform, et cetera, when it gets included in the audio structure. This is good for all kinds of reasons, especially human-readability and the ability to just copy shit over to an external drive or mp3 player (or import into an application database or whatever I want to do today) with no muss, fuss, or bother. It Just Works.

I learned this the hard way about six years ago when I first decided to organize my absolute shitty mess of audio files, which at the time was around 60GB. That process took months, but when I was finally through it it was perfect. Everything's named correctly, tagged correctly, all the cover art's there and named correctly, everything is perfect. I am determined to never have to go through that again, thus the unsorted directory regime.

--
Hey, I finally got my first freak! Took you long enough!

And how do you recognise stuff when you find it... by Rozzin · 2011-02-14 08:37 · Score: 1

The limit to any organisational scheme that you're going to end up using is where, after drilling down through whatever directory-structure you come up with, you end up in a folder with a bunch of files in it, and then the *parcticular* file that you want is somewhere in that folder--sitting right in front of you, along with all of the other files that ended up in the same group, and the question at that point is how long it's going to take you to recognise the one among all the others.

Different types of organisation-schemes basically try to minimise this problem by reducing the number of not-easily-distinguishable items that you have to deal with at any given time.

The obvious approach is to try splitting-up large, flat collections into smaller collections and adding levels of indirection when a given tier has too many items in it to be maneagable, so the first task is to figure out what `too many items to be manageable'--and you want to avoid splitting things up too much beyond the point of `small enough to manage', because there's really a trade-off going on: in order to reduce the complexity at each particular level of your structured collection of stuff, you add some navigational complexity.

So, while that will help you tackle the `number' part of the problem, there actually be some interesting (and useful!) work done toward figuring out ways of making larger collections more manageable without splitting them (i.e.: tackling the `not-easily-distinguishable' part); one of the more notable ones is a scheme for fixing the homogeneity of file-icons--because it's significantly easier to recognise a thing when it actually appears distinct from its surroundings; J.P. Lewis et al. published an essay on this, a while back, called "VisualIDs: Automatic Distinctive Icons for Desktop Interfaces", and included the results of their user testing; Lewis has a website (with the title, "VisualIDs: Scenery for Data Worlds") that's worth looking at:

http://scribblethink.org/Work/VisualIDs/visualids.html

There's even a link, at the bottom of that page, to a reference implementation--and even patches to integrate VisualIDs into Nautilus.

--
-rozzin.

This is a good question! by Anonymous Coward · 2011-02-14 09:13 · Score: 0

All of comments I read put the focus on hardware and folder tree. I simply use the default directory tree of MacOS X ([user]/[folder by type of datas]), the rest comes naturally and I retrieve them through finder or spotlight.

The most interesting thing is how works storage and backup of important files... in fact all datas are backup on my Dropbox account on the cloud.
At least, the next step should be to store all our datas in a personal space on the cloud, more reliable and more accessible.

What do you thing of such a possibility?

organizing your stuff using the OS and basic tools by Anonymous Coward · 2011-02-14 10:56 · Score: 0

This posting inspired me to blog about the topic:

http://scienceblogs.com/gregladen/2011/02/how_to_organize_your_stuff.php#c3298223

Simple Solution: by Anonymous Coward · 2011-02-14 11:04 · Score: 0

Two categories (Pr0n and not-Pr0n)

Re:Music Genres for me == Colors (e.g. Red, Blue,. by Keen+Anthony · 2011-02-14 15:26 · Score: 1

Nickelback goes in my "Canadian Bands that aren't Rush, Loverboy, or Killer Dwarfs" folder. Extra points to anyone who knows who Killer Dwarfs are.

I haven't thought about organizing by color. Does red mean hot and blue mean cool? If so, are your red songs big hits and your blue songs sleepers? I listen to my music on MiniDisc, as such, I'm still basically making mix tapes in 2011. I want to organize my songs in a way that reflects mood, but I don't want to abandon my genre->performer->album format because I find that's still best for casual browsing.

The Music Genome Project's work is really interesting to me, and I would like to take advantage of genetic attributes in my own music searches because "hard rock" isn't a very worthwhile tag. Hard rock can mean everything from Deep Purple to Van Halen to Poison to Evanescence. That's a lot of difference. What is better for my needs are genome attributes like "Feel" and "Roots". I would benefit from odd custom attributes like "miami vice" and "belongs on an '80s sci-fi/horror movie soundtrack"

Does anyone know of a project to produce a quality audio player that has built-in search and allows you to add custom tags? Does FLAC and APE allow custom tags?

In the road... by wolftone · 2011-02-14 23:13 · Score: 1

We do it in the road. I mean, really, no one will be watching us.

Slashdot Mirror

File Organization — How Do You Do It In 2011?

356 comments