Slashdot Mirror


Ask Slashdot: Software To Organise a Heterogeneous Mix of Files?

BertieBaggio writes "I am a medical student at the end of an academic year trying to get my notes organised. I'm looking for a software document organisation system to organise a mix of text notes, journal articles, diagrams and scans. Ideally such a system would permit full-text and metadata search, multiple categorisations (eg tags), preserve the underlying files and be cross-platform (Linux/Windows/OS X). While I'm not averse to paying for such a complex solution, ideally the software would be FOSS so that extension or migration are possible if necessary. Desktop search (eg Google Desktop) probably does 90% of what I want apart from multiple categorisations, which is the feature I'm most interested in. Searching turned up a similar question over at 43folders which pointed me in the direction of Papers and DevonThink, but these are OS X only and seem to be aimed more at academic paper organisation. What recommendations does the Slashdot community have for categorising and organising a heterogeneous mix of files?"

254 comments

  1. Quick Answer by Anonymous Coward · · Score: 5, Informative

    Zotero - is awesome - Firefox plugin

    1. Re:Quick Answer by sl4shd0rk · · Score: 1, Funny

      +1 informative but I have no mod points :(

      --
      Join the Slashcott! Feb 10 thru Feb 17!
    2. Re:Quick Answer by Anonymous Coward · · Score: 0

      +1 Zotero the best ;)

    3. Re:Quick Answer by marklandm · · Score: 1

      I agree that Zotero is awesome. I just wish it wasn't tied so closely to Firefox.

    4. Re:Quick Answer by Anonymous Coward · · Score: 0

      Zotero is very very slow in indexing citation, especially if the style is ordered by number ala Nature journal. If you add a citation, wait 5 minutes to get all your 90 page document gets updated.

    5. Re:Quick Answer by dm42 · · Score: 2

      I really like Zotero for web research - and if everything comes from the web, it can be great. What I don't like is that, aside from Zotero, there's no real good way to access the files from other software (for example, opening Adobe Acrobat and searching through the tagged files in Zotero to find that PDF that I know I have). In order to access stuff, I need to go into Zotero as my file manager first. Then there's the backup issues, data migration issues, and what happens when my Zotero database is over 4 GIG and can't be written to a DVD anymore? Or, what happens if Zotero disappears? Or what about corrupted databases?

      Don't get me wrong, Zotero is an EXCELLENT research tool - I've used it and I like it in many respects. But if I'm thinking about LONG TERM storage, resiliance against corruption and future accessibility - I'm not sure it's the best tool for the job.

      What I would really like to see/have is a system that allows me to tag files within my filesystem either as I write out the file from my application (i.e., FILE-->Save FILENAME: Basefilename:tag1:tag2:tag3.ext ) or by renaming a file already within my filesystem.

      I USED to be in the tech sector as my livelihood and knew about some tools that might help... low and behold, with a combination of FUSE, PERL, and SQLITE, I was able to cobble together something for my purposes... It may or may not be what you're looking for. It's basically a tagging overlay filesystem writen in Perl using FUSE and SQLITE. The difference between this and others (it reuses some code and concepts from StratusFS) is that it is hierarchical. All of your files are not dumped into one big directory, they are tagged within the file system hierarchy underneath.

      Since I'm no longer in the tech sector, my dev time is limited and it certainly isn't a "PRODUCT" in the conventional sense. Rather, it's one of those things that "works for me but your mileage may vary". If you're interested, you can find the project at http://code.google.com/p/htaggingolfs/ . I use it to organize a bunch of stuff. It ain't perfect but is a good start - or at least "proof of concept." If any dev types are interested in contributing... drop me a line.

    6. Re:Quick Answer by spectro · · Score: 2

      dude, you make some good points but DVDs have been obsolete for like 10 years. A 32Gb thumb drive goes for like 30 bucks these days.

      --
      HTML is obsolete. It's time for a new, simpler and richer markup language.
    7. Re:Quick Answer by mrmeval · · Score: 1

      What is the longest you've had a thumb drive last?

      --
      I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
    8. Re:Quick Answer by Anonymous Coward · · Score: 0

      Zotero is exactly what you need.

    9. Re:Quick Answer by c0lo · · Score: 1

      I agree that Zotero is awesome. I just wish it wasn't tied so closely to Firefox.

      Use it as a plugin in your word processor?

      --
      Questions raise, answers kill. Raise questions to stay alive.
    10. Re:Quick Answer by tobiah · · Score: 1

      That was my first thought. It is fantastic for citing references in a paper, and keeping your citations and papers organized. Seems a bit strange to be a plug-in, but it works and makes a lot of sense once you use it. Put a good bit of research into this awhile ago, and I haven't looked back.

      --
      "The ability to delude yourself may be an important survival tool" - Jane Wagner -
    11. Re:Quick Answer by tobiah · · Score: 1

      I haven't had that problem, but I also haven't produced a 90 page document with Zotero's help. Indexing 20 page documents has been acceptably fast (a few seconds).

      --
      "The ability to delude yourself may be an important survival tool" - Jane Wagner -
    12. Re:Quick Answer by johncandale · · Score: 1

      they last longer then a consumer burned dvd, besides you used have managed redundant hard drive backups.

    13. Re:Quick Answer by enoz · · Score: 1

      One dude's "obsolete" is another person's "cost-effective storage medium".

      Flash drives were (and still are) one of the more expensive storage mediums. At 30 bucks for a 32Gb drive today, that's like the price of DVDs 10 years ago. Today, that's 10x the cost of other media.

    14. Re:Quick Answer by tehcyder · · Score: 1

      dude, you make some good points but DVDs have been obsolete for like 10 years. A 32Gb thumb drive goes for like 30 bucks these days.

      But with a cheap DVD, you can just do a daily backup and file the DVD away. Unless you enjoy wasting money, you're not going to do the same with even a $10 thumb drive each day.

      --
      To have a right to do a thing is not at all the same as to be right in doing it
    15. Re:Quick Answer by mrmeval · · Score: 1

      Ok, I've never managed to keep one I've carried working more than a few months. I should design a case out of billet aluminum and cast it in with alumina filled epoxy. ;)

      Maybe three or four even.

      --
      I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
    16. Re:Quick Answer by marklandm · · Score: 1

      Use it as a plugin in your word processor?

      From the Zotero site:

      The word processor plugins are distributed as Firefox extensions, which provide integration support to Zotero and install the necessary components into the word processors. After you have installed a plugin from this page, Firefox will prompt you to install later versions automatically the same way that it does for Zotero.

  2. Evernote! by Anonymous Coward · · Score: 0

    Check out www.evernote.com It isn't FOSS but it is extendable and free up to a point. I use it everyday and love it.

    1. Re:Evernote! by Anonymous Coward · · Score: 0

      Count me as a second vote for evernote. I'm using it for light-duty stuff, so I can't comment on how well it works with a real load of data to sort through, but I've heard good things.

  3. In The Works by Anonymous Coward · · Score: 0

    This is very similar to something I am actively developing. I'm happy to see that someone else is interested.

    1. Re:In The Works by nomadic · · Score: 1

      A document management system? You have a lot of competition there.

    2. Re:In The Works by jd · · Score: 1

      In the Open Source world, DSpace is probably the document management system to beat.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    3. Re:In The Works by nomadic · · Score: 1

      Still, it's a little weird to suggest that this is some brand new type of technology.

    4. Re:In The Works by jd · · Score: 2

      Weird, certainly. You could sort, search and store documents in the way described using Gopher or WAIS long before HTML was even invented.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  4. Zotero by Lexible · · Score: 1
    1. Re:Zotero by Anonymous Coward · · Score: 1

      >It lives right where you do your workâ"in the web browser itself.
      I do all my work in a whole slew of mysterious and incompatible applications.

    2. Re:Zotero by maxwell+demon · · Score: 2

      Indeed, I found Zotero extremely valuable to manage papers (especially because you can add them directly from the web site with a single click), but I don't use it for anything else (my LaTeX files are under version control and organized using traditional directories, my notes are mostly in Tomboy [except for those more complex which I do in LaTeX, and of course those I do on paper], my mails are on the mail server [which I don't access through the web interface if I don't need to], any self-written programs are of course also on disk and under version control, data produced by those programs also lives on disk with directory organization [but not under version control; a data file is not supposed to be changed after generation], ...)

      --
      The Tao of math: The numbers you can count are not the real numbers.
    3. Re:Zotero by BertieBaggio · · Score: 1

      You seem to have missed Zotero (it's up there in the comments somewhere) which is a FOSS plugin for firefox. It keeps an offline database, and for nearly any site (e.g. journals) you click one button on the URL bar and it downloads the citation including full pdf so you can read it whenever. It will also let you perform full text searches of your database, and can be configured to perform OCR on scanned documents. Best of all, it's trivial to make bibtex (or many other formats) bibliographies.

      I use that in combination with TiddlyWiki for personal typed notes not associated with a journal article/textbook, and Xournal for annotating documents and taking notes with my tablet computer. When annotating documents (textbooks, journal articles) just configure xournal as your pdf viewer and you'll be able to save every annotation you make. TiddlyWiki has a ton of plugins to do whatever you need, including a GTD (Getting Things Done -- it's a book) variant that's probably comparable to Emacs Org-mode, LaTeX math (I wrote that one -- use it every day), and many more.

      The one drawback to all this is that I have no way to automatically organize my handwritten notes from xournal. Though they're computer files, my organization for them is horrendous. I still fantasize about some kind of hybrid mutant of TiddlyWiki, OCR (that can magically read my handwriting and equations), and xournal that would let me do all this on a pen-based tablet...

      You're right, I didn't include Zotero in that comment! Having looked at it before, my impression of Zotero was that it is a (very competent) reference manager / source collection. Is it more than this? TiddlyWiki sounds familiar too - this is a subject I've dallied with before, so perhaps I came across it previously :-). I'll have another look at Zotero and TiddlyWiki and see if I can get it to do the categorisation thing I'm looking for that I mentioned in my other comment. Cheers!

      --
      If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
    4. Re:Zotero by mcelrath · · Score: 1

      Zotero does categorization. You have a familiar hierarchy of folders (topics). A single reference can appear in multiple folders. (It's your organization, not the filesystem -- each thing appears only once on disk). It does not automatically create categories for you, however. But frankly, I don't think any software could do that in a way I would agree with... You can further attach arbitrary files and notes to things you put in your Zotero database.

      --
      1^2=1; (-1)^2=1; 1^2=(-1)^2; 1=-1; 1=0.
    5. Re:Zotero by BertieBaggio · · Score: 1

      That's useful to know, thanks, I'll look into Zotero further (if for no other reason that I'll probably end up using it as a reference manger!).

      --
      If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
  5. From The Atlantic's James Fallows by wiredog · · Score: 1
    1. Re:From The Atlantic's James Fallows by TheTrooper · · Score: 1

      I'll second Personal Brain - its mind mapping software that allows uploads of files, etc. I find it a bit easier to get my 'stuff' into Brain than into Evernote ... but I do use both...

      --
      http://andreas.materns.com
    2. Re:From The Atlantic's James Fallows by ryscott · · Score: 1

      PersonalBrain is fantastic but the free version does have limitations - you can only attach URLs to a thought, no file attachments. In order to attach files to a thought you have to buy the program. The Core version ($150) allows you to attach 1 file to each thought, while the Pro version ($250) allows multiple file attachments per thought. Also, a paid version is required to search for keywords within web pages or file attachments.

  6. Gunna hate this BUT by TheRealQuestor · · Score: 1
    Sharepoint Services

    does what you need and then some.

    1. Re:Gunna hate this BUT by X0563511 · · Score: 1

      Sharepoint is utter crap if it cannot understand the file you are putting on it. Which incidentally would be anything not MS Office.

      --
      For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
    2. Re:Gunna hate this BUT by Anonymous Coward · · Score: 0

      I have been using onenote for things like this. It is like postit notes on steroids. Usually bundled with office...

    3. Re:Gunna hate this BUT by blair1q · · Score: 1

      I just threw up a little in my series of tubes.

    4. Re:Gunna hate this BUT by oh_my_080980980 · · Score: 2

      What specific file types can't you load? I have used Sharepoint 2007 since it rolled out and you can load any file type you want. There are no restrictions. We have non Office documents loaded in document libraries as well as Office libraries.

      Sharepoint would do what you are looking for but the advanced features will cost you.

    5. Re:Gunna hate this BUT by Anonymous Coward · · Score: 0

      Parent is a FUDster and has never actually used Sharepoint to make such a plainly false statement.

    6. Re:Gunna hate this BUT by TheRealQuestor · · Score: 1

      I'm guessing you have never even used Sharepoint. SP is probably the best package MS has ever made. Way underappreciated.

    7. Re:Gunna hate this BUT by Shatrat · · Score: 2

      But, it can't search any of those files.
      It has a search function, but it's almost completely useless. I can even put the exact file name I'm looking for and it won't even be in the top 10 results.
      The only advantage Sharepoint has over a simple shared file directory is some crude revision control and the ability to create calendars.

      --
      09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
    8. Re:Gunna hate this BUT by rwa2 · · Score: 2

      Heh, the only way I've ever been able to tolerate having to use any version of Sharepoint is to open a document library in IE, and then click on some dropdown to change it to explorer view, and then create and right click on a folder and select explore in a new window. Then it opens up in File Explorer, where I bookmark/favorite it so I don't have to deal with the atrocious "information blackhole" Sharepoint web interface, and I can easily drag / drop / delete entire folders using the File Explorer interface, and the URLs I send to coworkers are a lot more sane-looking and consistent. (At least in older versions, Sharepoint URLs would seldom get the user to where they wanted to go (way to break the internet there!), leading to long entertaining prose as people attempted to describe how to "navigate" to some random place in Sharepoint.)

      And maybe the search works better now, but I often couldn't find files amidst all the junk that shows up, even if I knew and specified their filenames.

      Makes it much easier to use a local revision control thing too, I've lost work a few times trying to use Sharepoint's revision tracker doodad.

    9. Re:Gunna hate this BUT by jd · · Score: 1

      Sharepoint is hardly FOSS, though. If you want that kind of solution, DSpace is a more logical starting point.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    10. Re:Gunna hate this BUT by Anonymous Coward · · Score: 1

      Lotus Notes client...does all this and then some more. You don't need the server, you can put all of those files into a document library database (template is provided), and enable full-text search.

    11. Re:Gunna hate this BUT by Anonymous Coward · · Score: 4, Interesting

      As a SharePoint admin for three years, I can definitely say, without any kind of reservation, that it is utter crap.

      Now don't get me wrong, the idea of SharePoint is great. But it is badly designed (the users can't find any document they need) and badly implemented (loosing data is unacceptable).

      If you need a document management system, I advice anyone to use SharePoint for a few months and then switch to another system. You'll appreciate your new system so much more that way...

    12. Re:Gunna hate this BUT by LWATCDR · · Score: 1

      How about OWL? http://owl.anytimecomm.com/
      Yes you need to install a webserver on your system but it really does work very well.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    13. Re:Gunna hate this BUT by jd2112 · · Score: 4, Interesting

      I'm working on a sharepoint add-on to allow paper documents to be processed in the same manner as electronic documents. In it's current form it's a paper shredder with a Sharepoint logo taped to it.

      --
      Any insufficiently advanced magic is indistinguishable from technology.
    14. Re:Gunna hate this BUT by mbenzi · · Score: 2

      I am also a Sharepoint server admin and I would never recommend it to anyone.

      As has been said already, it has a lot of really good ideas, all executed terribly. Search is so important, yet Sharepoint is very bad it. Yes you can drag and drop a whole hierarchy of files to add them to sharepoint, but woe to you if one of those files has a name that Sharepoint does not like http://blogs.msdn.com/b/joelo/archive/2007/06/27/file-name-length-size-and-invalid-character-restrictions-and-recommendations.aspx

    15. Re:Gunna hate this BUT by YrWrstNtmr · · Score: 2

      Interesting. We've run a pretty large SP farm (100,000+ users, 7M+ items in the search index) for about the same time, and i can't recall it actually losing any documents/items, without direct deletion by a user. Ever.
      Can't find something? Poor design of sites/libraries/lists.

      For a simple at home Sp implementation, a couple of doc libraries with appropriate metadata would allow for easy searching.

      Oh, and SP Foundation can run on Win7. Doesn't strictly need Server2008.
      If you're well tied into the whole MS ecosystem, then SharePoint may be the way to go. If not....try somewhere else.

    16. Re:Gunna hate this BUT by Chelloveck · · Score: 1

      Losing data!? How can you say that? Sharepoint comes from the company that brought us Visual Source Safe, after all!

      --
      Chelloveck
      I give up on debugging. From now on, SIGSEGV is a feature.
    17. Re:Gunna hate this BUT by kcitren · · Score: 1

      What do you think of the new MS FAST search additions?

    18. Re:Gunna hate this BUT by Anonymous Coward · · Score: 0

      I've done some consulting work on various SharePoint 2007 and 2010 projects. If you can spare the time to setup a couple of sensible views and put a bit of forethought into how you want to organise your information SharePoint can be decent. However, as others have mentioned, a badly designed structure of Sites/Libraries and not putting some time into customising your views can make it worse than useless.

      I also agree the default web interface is pretty painful and if you are trying to organise a large number of documents for a medium to large scale enterprise there are definite annoyances for many users.

      If anyone is interested the company I work for has made a file explorer style treeview which makes navigating SharePoint much more manageable.

    19. Re:Gunna hate this BUT by Anonymous Coward · · Score: 0

      You have obviously never used it. It can search inside PDF and in fact on any document type that has an IFilter registered (on the sharepoint server computer, obviously).
      Adobe provides the IFilter for free and it installs it when you install adobe reader and such.
      A trivial google search for "sharepoint pdf search" tells you about the IFilter and such. Apparently the only utter crap is your ability to solve issues.

    20. Re:Gunna hate this BUT by YrWrstNtmr · · Score: 1

      Heh, the only way I've ever been able to tolerate having to use any version of Sharepoint is to open a document library in IE, and then click on some dropdown to change it to explorer view, and then create and right click on a folder and select explore in a new window. Then it opens up in File Explorer, where I bookmark/favorite it so I don't have to deal with the atrocious "information blackhole" Sharepoint web interface, and I can easily drag / drop / delete entire folders using the File Explorer interface, and the URLs I send to coworkers are a lot more sane-looking and consistent.

      And that is the precise wrong way to use SP. The concept of 'folder tree within a SP document library' is fuckin useless in SP. Hard to navigate and locate stuff. Metadata (extra columns) on the doc libs, and various views will get you what you want, easier and faster.

    21. Re:Gunna hate this BUT by tehcyder · · Score: 1

      Lotus Notes client...does all this and then some more. You don't need the server, you can put all of those files into a document library database (template is provided), and enable full-text search.

      I think you missed the slashdot memo which forbids any positive mention of Lotus Notes. Ever.

      --
      To have a right to do a thing is not at all the same as to be right in doing it
    22. Re:Gunna hate this BUT by Anonymous Coward · · Score: 0

      As the administrator thats more like your fault... Managing the index metadata and tags are your responsibility. And Sharepoint runs on sql or if you have configured blob on the filesystem... don't really know where you can "loose" data there...
      And with the right file definition (almost every filetype used in business have one) you can search documents on contentlevel...

      The downside to this solution is just that its seems to be to much work for one person^^

    23. Re:Gunna hate this BUT by X0563511 · · Score: 1

      We use it for all our internal documents.

      Yea, it's great for MS Office documents. I can load up a spreadsheet and look at it, check it out for editing, check in etc - that works fine! I love that!

      But try to do anything else? Sharepoint doesn't know what a PDF is - it treats it like a binary blob and can do nothing special with it. Other filetypes (not images, text files, or Office documents) are the same.

      Can't use Firefox in FIPS mode, because the IIS server throws a hissy fit. Can't do much in any non-IE browser at all, actually. Admittedly the first issue is likely the fault of our sharepoint admin, but the latter is the direct fault of MS using ActiveX and saying "screw everyone else" ... Does it still sound like I've never used it before?

      --
      For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
  7. A blog maybe? by Anonymous Coward · · Score: 0

    You can post whatever you want in a blog, you can also search the blog for whatever you need.

    I have a wordpress blog, I can post pictures to it, as well as videos.

    Wordpress is both free and opensource, you can host it yourself, or get a free blog.

  8. I have it! by Anonymous Coward · · Score: 0

    I have a heterogeneous mix in my pants...just sayin'...

    1. Re:I have it! by Anonymous Coward · · Score: 0

      We're on to you GW

  9. OS X - Plain old search by FormOfActionBanana · · Score: 2, Interesting

    I have a Mac and it's not the greatest OS, but I love the search. I search all my old emails and a horde of other documents all the time.

    I'm sure other computers can do this just fine, but I was never satisfied with a desktop search implementation until OS X. And I used to be a search index consultant.

    --
    Take off every 'sig' !!
    1. Re:OS X - Plain old search by tibit · · Score: 1, Informative

      I second that. You can add metadata (Spotlight comments) to any file, and that metadata is searchable, so if you're thinking of tagging files, that's easy.

      --
      A successful API design takes a mixture of software design and pedagogy.
    2. Re:OS X - Plain old search by FormOfActionBanana · · Score: 1

      What gets me is how often you see people complaining about how "bad" Spotlight is, and how they really missed the boat. I don't get this at all... I search on my computer the same way I search on Google, and find the stuff I want. I couldn't be happier.

      Now, compare that with an absolute crap search implementation like Ebay, and grrr...

      --
      Take off every 'sig' !!
    3. Re:OS X - Plain old search by Anonymous Coward · · Score: 0

      Third that. Spotlight is how search should work everywhere.

    4. Re:OS X - Plain old search by AHuxley · · Score: 1

      Spotlight was bad on slow systems as it indexed. With better ssd's and updates it seems to be not as bad now.

      --
      Domestic spying is now "Benign Information Gathering"
    5. Re:OS X - Plain old search by angel'o'sphere · · Score: 1

      On a Mac you can also use smart folders (folders which have no content but a search querry).

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
  10. or a homogeneous mix by Anonymous Coward · · Score: 0

    or a homogeneous mix of files for your gay porn

  11. an age old solution by Akatosh · · Score: 3

    Sounds like you're describing a directory tree. Search with grep, or any similar utility. Put files in multiple categories (appropriately named directories) using ln. It's cross platform, timeless, and seems to do what you describe. I feel like I'm missing something though.

    1. Re:an age old solution by oh_my_080980980 · · Score: 4, Insightful

      Because that's not a proper document management solution. Directories/Folders are not a substitute for documents tagged with meta data. Not too mention you can't create views.

    2. Re:an age old solution by c0d3g33k · · Score: 1

      No, but Directories/Folders as low level storage (the files have to exist somewhere) and a layer on top of that which can provide metadata tagging, searching, views etc. is. KDE4 with Nepomuk (http://nepomuk.kde.org/) seems to provide much of what the OP asked for.

    3. Re:an age old solution by Hatta · · Score: 2

      Directories/Folders are not a substitute for documents tagged with meta data

      Why not? It works for me. It's pretty easy to have a script parse, e.g., your MP3s ID3 tags and link them to the appropriate directory.

      Not too mention you can't create views

      That's what 'find' is for.

      --
      Give me Classic Slashdot or give me death!
    4. Re:an age old solution by Anonymous Coward · · Score: 0

      The problem is indexing. Using "grep", how long is it going to take to find a word buried somewhere in a collection of 1000000 text files, each 1MB in size? Using a proper solution, the answer would probably be on the order of half a second.

    5. Re:an age old solution by drooling-dog · · Score: 1

      Because that's not a proper document management solution.

      Nevertheless, it can work quite well, depending on your needs. I manage large numbers of files that way, with separate index files handling the metadata. Grep is very fast for simple keyword and pattern searches, even on index files with ~100,000 records, so there's no reason even to use a DBMS (I don't need complex queries). Scripting for special purposes is simple and straightforward, and backups are a breeze, using nothing but rsync. No special-purpose software needed, beyond what comes with the OS.

      To each their own, of course. I recently talked to a guy who runs a MySQL server just to manage a couple hundred phone numbers. Nothing at all wrong with that, especially if you're learning about databases and SQL that way!

    6. Re:an age old solution by jedidiah · · Score: 1

      The file manager may not be great but if it isn't up to the task then that really needs that the file manager needs an update.

      All sorts of sometimes mutually exclusive bits of software that try to organize one bit of information or the other really doesn't cut it.

      If you have a good "primary key" that's generally useful most of the time, you can "get by" even with just a simple file manger. In truth, most n00bs probably aren't industrious enough to really make a good multi-key approach work. It would require far too much data entry and most users are really lazy.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    7. Re:an age old solution by Anonymous Coward · · Score: 1

      No, but Directories/Folders as low level storage (the files have to exist somewhere) and a layer on top of that which can provide metadata tagging, searching, views etc. is. KDE4 with Nepomuk (http://nepomuk.kde.org/) seems to provide much of what the OP asked for.

      The very best thing about Nepomuk is that it can be disabled without having to modify source code.

      The very worst thing about Nepomuk is that kdepimlibs won't build without it. Not only do you have to build kdepimlibs with it, you must build kdelibs with it too. So if you don't actually want this, you get to enable support for something you're never going to use. For a while you could disable "semantic desktop" entirely in KDE and still build it, then it got harder and harder to do that. Now it's not really possible unless you want to make your own fork of KDE and maintain it. Thank you, upstream, for all these options!

      Here's a tip for all would-be developers. If you want to implement some new idea (actually a rehash of very old "desktop search" ideas that never gained traction for a good reason) and add it to a system lots of people already enjoy as-is, make it completely optional. That way if people want it, they will obtain and use it. If it's popular make it a default even, but make it still optional so people who don't follow the crowd can cleanly remove it.

      Forcing someone to support a feature they don't want and can't build the system without anymore is just asinine. Especially when everything else is so minutely configurable. It starts looking like that "We know what's best for you" bullshit that should remain confined to politics.

    8. Re:an age old solution by Anonymous Coward · · Score: 0

      In and of itself, no, but backup is easy and the data DOES exist somewhere -- plus, all the file manipulation tools exist and almost all applications can write to a filesystem. The problem you describe is what led me to write a FUSE Overlay Filesystem that adds tagging to filenames and a way to search them. I'm not a coder and no longer work in the tech sector, so it's one of those FOSS projects that carries the standard disclaimers, "it works for me, but your mileage may vary."

      More info and repository at: http://code.google.com/p/htaggingolfs/

      Anyone who wants to contribute feel free to contact me. The only downside I've found is that certain linux file managers do funky things with a rename - so really I've only found that dolphin works properly allowing you to add tags by simply renaming the file.

      If you find it useful, drop me a line!

    9. Re:an age old solution by Anonymous Coward · · Score: 0

      Trees are just crippled graphs, and only make things harder. Softlinks help a bit. But without reliable directory hardlinking, it's still a mess.

      I, for one, am in the process of developing such a (backwards-compatible via FUSE) graph data system for the exact reason of organizing my data.
      In there, file/directory names, links, tags, categories, or whatever you want to call them, all become relations. You know, associations, just like our brain works. Which gives me strong confidence in it being the right way. And about a billion years more timeless than primitivist crippled tree structures that require search utilities. ;)
      (In a full graph, walking the graph *IS* entering the query terms.)

      Sadly, until I'm done, I know for a fact that there is no acceptable data storage solution on this planet. (I've researched them all. Not a single one fits all the requirements. not even the ones who say they have full ontology support.)

    10. Re:an age old solution by LanMan04 · · Score: 1

      ln -s

      Directory for each meta data tag. Directory for each view.

      Directory FOR EVERYTHING!

      I kid, I kid.

      --
      With the first link, the chain is forged.
    11. Re:an age old solution by TheoMurpse · · Score: 0

      Yeah, hang on and let me put my 3GB movie file in the "sci-fi" "movie" "Peter Jackson" and "closeted-hobbits" folders and take up 12GB of space!

    12. Re:an age old solution by Anonymous Coward · · Score: 0

      Are you dumb or just a troll?
      He said "using ln". That creates hard links in the filesystem, not a copy of the data. It'll take up a few bytes at most to store the extra inodes pointers.

    13. Re:an age old solution by Anonymous Coward · · Score: 0

      Use dotted notation for the file names. Meta-tags are unreliable and dependent on some crappy software that isn't available everywhere.
      category.subcategory.subject.author.title.ext
      Also works wonders for organizing a website.
      Any type of software is still going to require you to remember key points about the data/docs in able to find it. So why bother? Sharepoint is a great example of this conundrum.

  12. Evernote by Anonymous Coward · · Score: 0

    Try Evernote. They have a free or premium service. You can attach pretty much any document type. Everything is searchable, including hand written notes. You can organize your notes using notebooks and tags.

    1. Re:Evernote by Anonymous Coward · · Score: 0

      Evernote gets my vote. I purchased the annual so no worries about space. The Optical Character Recognition (OCR) has been outstanding. I took a photo of a roadside marker, and dang if Evernote didn't have the whole text indexed by the time I got home. Evernote saves to the cloud, no backup worries. You can forward a message with .pdf or .doc file attached and it gets indexed. You can set up folders, share selectively part of your stuff. I send credit card statements to a finance folder. I setup a owners manual folder for the literature on all the gadgets and gizmos I've bought. My budget will include the annual fee (?$45) for the rest of my years and my backup / search worries are over, forever.

    2. Re:Evernote by RickRack · · Score: 1

      Evernote gets my vote. I purchased the annual so no worries about space. The Optical Character Recognition (OCR) has been outstanding. I took a photo of a roadside marker, and dang if Evernote didn't have the whole text indexed by the time I got home. Evernote saves to the cloud, no backup worries. You can forward a message with .pdf or .doc file attached and it gets indexed. You can set up folders, share selectively part of your stuff. I send credit card statements to a finance folder. I setup a owners manual folder for the literature on all the gadgets and gizmos I've bought. My budget will include the annual fee (?$45) for the rest of my years and my backup / search worries are over, forever.

      So I logged in after being tagged 'anonymous coward'. There, That was RickRack talking about Evernote.

    3. Re:Evernote by Ardeaem · · Score: 1

      Evernote saves to the cloud, no backup worries.

      You are foolish if you trust your data to the "cloud". Please understand: THE CLOUD IS NOT, BY ITSELF, A REPLACEMENT FOR BACKUPS.

    4. Re:Evernote by EricTheGreen · · Score: 1

      Everything in your Evernote account is also replicated to your machine, in a set of files you can include in a local backup if you wish.

    5. Re:Evernote by Anonymous Coward · · Score: 0

      I hope you like getting fucked up the ass in prison, HIPAA bitch.

    6. Re:Evernote by mrtwice99 · · Score: 1

      I use Evernote, and so do a lot of my med students. It is cross platform, the free version is quite functional and stores PDFs, rich text and graphics. It is searchable and shareable.

      Evernote does not work on Linux, so "multi-platform" might be more correct than "cross-platform."

    7. Re:Evernote by gobland · · Score: 1

      Evernote does not work on Linux, so "multi-platform" might be more correct than "cross-platform."

      There is an unofficial Evernote client for Linux called "Nevernote" (http://nevernote.sourceforge.net/). Haven't tried it, but apparently it's pretty faithful.

  13. thebrain dot com by superwiz · · Score: 1

    I love that program. You can get it from www.thebrain.com. It may sound like sarcasm, but it isn't. It's allowed me to organize a myriad of loosely-related information many times. I even bought the full version with my own money ($250).

    --
    Any guest worker system is indistinguishable from indentured servitude.
  14. spectate swap desktop search (SSDS) by Anonymous Coward · · Score: 0

    say no more
    http://forums.thedailywtf.com/forums/t/13917.aspx

    1. Re:spectate swap desktop search (SSDS) by tibit · · Score: 1

      Oh, yes, this. Brings back fond memories of a troll extraordinaire.

      --
      A successful API design takes a mixture of software design and pedagogy.
  15. Emacs Org-Mode by he-sk · · Score: 3, Interesting

    Emacs Org-Mode. I've learned a little Emacs syntax just to use that package after I've being a Vim user for over 15 years.

    --
    Free Manning, jail Obama.
    1. Re:Emacs Org-Mode by complex_pi · · Score: 2

      Emacs Org-Mode. I've learned a little Emacs syntax just to use that package after I've being a Vim user for over 15 years.

      A bit more: Org-mode allows to define text documents with smart headings and lists. You can insert links, equations, store file attached to a heading. It is cross-platform and you can export your documents to, among other options, html or latex-pdf. You can flags items as TODO or attribute a "done" time or a "todo" time.

    2. Re:Emacs Org-Mode by he-sk · · Score: 2

      Org-Mode files are plain text, but there is extensive support for inline tables/spreadsheets and images and even code blocks. For example, you can keep data from an experiment in a table, do some analysis for which one would normally use Excel, and then plot and display a graph based on the data in one document. That takes care of a lot of files right there, because many tasks can be incorporated into the org file organically.

      Another great feature is the Org-Agenda which defaults do displaying date-based information (what is due or scheduled for today), but can be used to create filters (stored searches if you will) across any kind of data in all your org files.

      I guess it takes a certain hacker mentality, but the great thing about Org-Mode is that it allows you to organize your files and system organically to exactly suit your needs as you discover them. Of course, the major disadvantage is that you have to buy into using Emacs.

      --
      Free Manning, jail Obama.
  16. Does a good one actually exist? by Jane+Q.+Public · · Score: 2

    My sister has worked for various doctors over the years, and a doctor's office is a prime candidate for something that will organize information: forms, papers, x-rays (photos), scanned documents, etc.

    Many times she has spoken to me of the failures of the information-organization software that they have tried. Some would reach a certain capacity and choke, others had terrible OCR, and so on. In fact she even asked me about building a better application to do that; but I was too busy trying to put food on the table to take on a large project that probably would not pay off for a couple of years at least.

    If there is an application that does this well, I would like to know about it too. (One person has already mentioned Evernote, but that's a "cloud-based" application for some unknown reason, and I would have privacy concerns.)

    1. Re:Does a good one actually exist? by Anonymous Coward · · Score: 0

      OCR is ALWAYS going to be crap. The best output I've ever seen is from tesseract, and it's still at least 20% not readable on average (varying significantly on input quality and typeface, of course.)

      If anybody is relying on OCR to generate metadata for them, they are in for a rude awakening.

      A perfectly laid typeset page will OCR with usually an acceptable number of errors, but as soon as you start looking at photocopies, scanned images, forms, charts... yeah. Good luck separating signal from noise. Assuming there is ANY signal. Even among typeset pages, or documents printed to PDF, there can be significant issues. Typefaces with serifs? Small font size? Hope you like "rn == m" and "ri == n" and a host of other problems.

      So reading between the lines, what your sister's employers are doing is apparently putting garbage in one end, and expecting gold bricks from the other. You can build an expert system smart enough to figure it out most of the time, if the input is a well-defined set of information, but in a way that's more dangerous, because a mistaken classification there means you have misclassified data THAT YOU DON'T KNOW ABOUT.

      Now the capacity problems are a whole other issue. That's just crap software. But nobody - and I mean nobody (currently) - can OCR arbitrary documents and sort them based on the results of the OCR alone. You will always need a human to double check every document, in which case they might just as well be hand-sorting them anyway.

    2. Re:Does a good one actually exist? by mls · · Score: 1

      (One person has already mentioned Evernote, but that's a "cloud-based" application for some unknown reason, and I would have privacy concerns.)

      Evernote notebooks don't have to be synchronized to the cloud, you can have local-only notebooks if you have sensitive data. Just make sure to back it up; I use their script engine to perform regular dumps to a network backup location.

      --
      -mls
    3. Re:Does a good one actually exist? by Anonymous Coward · · Score: 0

      There's a bunch of them, collectively known as Practice Management Software or PMS. These are Australian products, but I'm sure there are a heap of American ones around too.

      Best Practice - http://www.bpsoftware.com.au/
      Medical Director - http://www.hcn.com.au/

  17. Alfresco by Anonymous Coward · · Score: 0

    Alfresco

    1. Re:Alfresco by burisch_research · · Score: 1

      [citation-needed]

      --
      char*f="char*f=%c%s%c;main(){printf(f,34,f,34);}";main(){printf(f,34,f,34);}
  18. Google Docs by Anonymous Coward · · Score: 0

    I can't access the site from work (to see if metadata tags can be added to a document), but it does support most doc types that you would probably use and searching is crazy fast.

  19. Look at FreeMind for next semester by Anonymous Coward · · Score: 0

    This is not necessarily going to help for this semester, but look at FreeMind - http://freemind.sourceforge.net/wiki/index.php/Main_Page.

  20. Project Xanadu ? by alexhs · · Score: 0

    What about Project Xanadu by Ted Nelson ?
    Demo here.

    (And in case you wonder, no, you're not supposed to take that advice seriously.)

    --
    I have discovered a truly marvelous proof of killer sig, which this margin is too narrow to contain.
    1. Re:Project Xanadu ? by jd · · Score: 1

      It's ok to take seriously. It'll be done by 2150, when the Daleks will invade the Earth. (The Daleks are too smart to invade a disorganized planet.)

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  21. Org-Mode by Anonymous Coward · · Score: 0

    People adapt Org-Mode to manage writing papers, running businesses, organizing files, etc., and it has an active user and development community.

  22. Re:Organise? by icebraining · · Score: 1

    There's nothing wrong with "organise."

  23. Re:Organise? by quink · · Score: 1

    Learn that spelling varies between different countries, and you may find maturity.

  24. Not FOSS, but free by jandersen · · Score: 1

    Have checked out Oracle Text? As I understand it, it is now a standard part of an Oracle database, and it can index text documents - according to rumours, it should be able to index not just words as they occur in the documents, but also their "meaning", whatever that means, and it should understand several doc formats. I haven't used it myself, though.

    You can download it for free for development purposes - get the enterprise edition for your OS plus the very, very (VERY!) comprehensive documentation, and install; now you just need a handy front-end :-)

    1. Re:Not FOSS, but free by jandersen · · Score: 1

      now you just need a handy front-end :-)

      - unless you enjoy the subtle beauty of the SQLplus command line.

    2. Re:Not FOSS, but free by SocPres · · Score: 1

      - unless you enjoy the subtle beauty of the SQLplus command line.

      I do, but for those wierdos who don't, the free-to-use Oracle Application Express ("APEX") works surprisingly very very well.

      APEX requires install in an Oracle database. Happily, the free-to-use Oracle Express ("XE") edition is in beta for v11.2 (should be GA any day now), which moves the maximum user storage to 11GB, up from the 4GB restriction in the current 10.2 version.

      Oracle XE: http://www.oracle.com/technetwork/database/express-edition/index.html
      Oracle APEX: http://apex.oracle.com/

      Be productive. Have fun. Drink beer.

  25. Microsoft Sharepoint by oh_my_080980980 · · Score: 1

    Hate to say it but it works. I have used it for several years and based on your requirements it would be a perfect fit. It can store several types of documents in document libraries tagged with meta data. You can create views on the meta data with the document libraries. You can perform full text and meta data searches. It has out of box workflows and can create complex workflows with developer tools.

    It has alot of what you would need. It also is not cheap. Sharepoint Services or Sharepoint Foundation is free (part of the server software) but the advance features will require licenses.

    1. Re:Microsoft Sharepoint by Anonymous Coward · · Score: 0

      How about Alfresco?

  26. I second Zotero by Anonymous Coward · · Score: 0

    Zotero will do everything you ask (tags, files, notes both associated with an independent from an item, various item types, files of varying formats associated with a parent item if desired, full-text search, cross-platform) and more- including static snapshots of web pages that can be made live on request (just right-click), easy transfer into a bibliography, producing reports and sharing among collaborators.

  27. Evernote by Anonymous Coward · · Score: 0

    Looks like what you're looking for

  28. Software To Organise a Heterogeneous Mix of Files by monoqlith · · Score: 1
  29. Practical Solution by Anonymous Coward · · Score: 0

    At the end of every academic year, I discarded all my personal notes, went to the library and spent some weeks studying good old fashion textbooks.

  30. File system by Dynetrekk · · Score: 1

    It is called a file system. You can put it on Dropbox, Jungledisk, or even better, use unison for synchronization. Use folders to group files together. Use filenames to remind yourself what the content is. Use file suffixes to show the file type. To search, use spotlight or mdfind on OS X, locate on linux, and... go kill yourself on windows (disclaimer: I've only tried the search function on XP and older windozes). Metadata works great with spotlight; I don't know any solutions to that on linux or windows, but someone else probably does.

    1. Re:File system by sirlark · · Score: 1

      Spotlight (specifically the metadata) is not portable/transferable. Quite frankly using grep/locate/find over a directory structure where files are symlinked into multiple categories is a real pain in the ass. POSIX extended attributes mean that the underlying file system is capable of storing and manipulating the metadata necessary, but the userspace tools suck!!

      I've pondered this question before, and voiced some of my thoughts on the issue here. In summary we would need three things:

      • Convenient userspace tool for specifying tags on an individual file. Could be done easily enough with a script wrapper around attr
      • Convenient userspace tool for querying tags on an individual file, Again, simple with a script wrapper around attr, but ideally we would also want to add a new option to 'ls' to display an additional column to containing files/dirs tags.
      • A way to query collections of files matching a binary expression of tags, e.g. patient_name && x-ray; And this is the tricky part. The ideal way to do it imho is to add some form of tag expression syntax to shell globbing patterns, and modify the standard C libraries accordingly. Then the current userspace utilities would automatically fall in line as far as I can tell. Anyone up for this job? It's not the coding that's hard, it's getting the community to agree to it...

      Of course that's just Linux, and possibly Mac is covered if HFS+ does POSIX extended attributes. Once again, windows user's are left out in the cold. However, it would be trivial to write an export script that would create a copy of a collection of files in a hierarchical structure using symlinks ;) One could even write a fuse filesystem that makes the underlying collection appear as a symlinked hierarchy, which would be a quick step to at least get GUI file managers etc to grok the idea, although ideally the various file managers would provided direct methods to query tags, filter by tags, and modify tags.

      I haven't tried thebrain (mentioned above) before, but I'm keen to give it a shot. I've certainly never seen or heard of anything else that does what I want, and what the OP has asked for. I'm especially curios to see how portable is? Can I for example, take my files from my linux laptop, and back them up to my NTFS external drie, preserving the organisation overlay provided by the brain?

    2. Re:File system by sirlark · · Score: 1

      I note that since Mac OS X 10.4 the HFS+ system has supported what look like extended attributes properly.

    3. Re:File system by Dynetrekk · · Score: 1

      You need to be modded up and start a project. I have been thinking along similar lines. I would think you could make a filesystem which would let you "tag browse"; i.e. as an alternative to browsing directories, one would browse tags instead. Probably you could even do it without making a new filesystem implementation. By the way, lots of metadata already exists (that I think are portable): PDFs can have their creator specified, EXIF data on JPEGs and so on. Just creating a general unix/linux/mac(/windows through cygwin?) variant of locate that would search metadata on the command line would be awesome (and could be the foundation of a GUI tool or two).

    4. Re:File system by dm42 · · Score: 1

      Sorry for the shameless plug, but I'm really hoping someone picks up on this idea...

      A Posix extended attribute solution would be optimal.

      I solved it using FUSE + PERL + SQLITE a while ago. More a proof of concept and a tool for myself than a "project" per se. I think it should be portable to other platforms that can use these technologies - and I'm pretty sure most of the majors (including Window$ can).

      In any event, take a look at HTaggingOLFS (http://code.google.com/p/htaggingolfs). "Hierarchical Tagging Overlay File System". It overlays the ability to tag files using a simple syntax as part of the file name. The tags that have been associated with files are treated as directories so drilling down with a logical AND is a matter of descending directories. Tag "virtual" directories are obvious because they begin with an @ character. Tags are separated from the main filename by using a colon (:). Thus, adding tags is as simple as naming the file FILENAME:tag1:tag2:tag3.extension the only potential issue in the current implementation is that the extension is assumed to be only a single field (i.e. in FILE.tar.gz, only the .gz would be considered the extension. It would appear as FILE.tar:tag1:tag2:tag3.gz).

      It'd be great if someone wanted to work more on it. I'm no longer in the tech world, but at least I've gotten it to the point where you can add tags as you write new files, change existing files to add tags, works in most tools including GUI file managers - although several of them have issues in their "rename" function that don't work properly (notably Nautilus and Thunar). Dolphin works, though.

    5. Re:File system by dm42 · · Score: 1

      Note: there's no reason it can't also be modularized to parse the file and look for metadata and add tagging for it as well in some format (assuming there is a perl parser for it)-- although that's beyond what I had needed at the time. Again, I'm no longer a techie in my work-a-day world and merely need/ed a tool for my own purposes which sound very much like the OP.

  31. Alfresco by Anonymous Coward · · Score: 0

    Maybe try Alfresco. It's java based, and you can have the source code if you ever need to make some modifications.

  32. OneNote by StikyPad · · Score: 1

    This is actually what OneNote -- the oft overlooked/maligned offering from MS -- is designed to do, and it does it pretty well believe it or not. Technically it's aimed at collaboration, but there's no reason it can't work equally well for self-organization.

    1. Re:OneNote by Fnord666 · · Score: 1

      This is actually what OneNote -- the oft overlooked/maligned offering from MS -- is designed to do, and it does it pretty well believe it or not. Technically it's aimed at collaboration, but there's no reason it can't work equally well for self-organization.

      Looks like that would be a good fit, except for the requirements that it be open source and cross platform that is.

      --
      'The tyrant will always find pretext for his tyranny.' - Aesop's Fables
    2. Re:OneNote by StikyPad · · Score: 1

      Right, except those are ideals, not requirements.

    3. Re:OneNote by jamesl · · Score: 1

      OneNote is available for Windows, Mac and iPhone. The iPhone version is curretly free.

      OneNote is part of Office which is available for a 60 day free trial.

      OneNote is the best thing since sliced bread for what this guy wants to do. It is especially useful if you need (want) sync'd data on two or more devices.

    4. Re:OneNote by BertieBaggio · · Score: 1

      Why not use OneNote with either a local folder, a personal server or even a cloud-based Skydrive? It will gobble up everything you have and allow you to search, organize, etc.

      Unfortunately OneNote seems to be Windows-only, and I'm spending ever-decreasing amounts of time using Windows. It's undoubtedly a good suggestion for everyone who does though :)

      --
      If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
    5. Re:OneNote by shhtonghui · · Score: 1

      Get yourself a copy of Microsoft's OneNote. It is exactly what you want - easy to use, allows for searching within documents, and allows for mixed media storage - voice, documents, screen shots, handwriting, all mix freely. It has basic OCR and voice recognition skills that help it search through non-typed material, though sometimes it's a bit hit or miss here.

      See here http://www.shhtonghui.com/

    6. Re:OneNote by rcrath · · Score: 1

      onenote is great if all you ever need what you write in it for is onenote files. if you need to get it out of onenote you are looking at a badly formatted rtf file...you cant even get it into word neatly...or an even less portable mht file ... or what I did to reclaim my stuff, a half day of learning perl oneliners to convert it all to html. I loved onenote for getting stuff in, especially on the fly ocr, (take that, look inside the book feature), but it is worse than useless for actually writing something.

  33. Evernote by dmr001 · · Score: 2

    I use Evernote, and so do a lot of my med students. It is cross platform, the free version is quite functional and stores PDFs, rich text and graphics. It is searchable and shareable.

  34. Wow by Anonymous Coward · · Score: 0

    Before you go and organize all your documents with a management system, look into a spell-checker.

    1. Re:Wow by rbphilip · · Score: 1

      What did he spell incorrectly? You're not getting all twisted over the American/British spellings of organi(s|z)e, are you?

  35. Google had a solution.... by metalmaster · · Score: 1

    Im not sure what happened to it or why they stopped developing it, but Google Wave was an awesome tool for doing just this. It was on the net, so it was cross compatible. It handled a wide array of file formats. It was searchable and it had a collaboration element that held on to revisions quite nicely. The API was open so others could develop their own solutions. Its a shame it went the way of the buggy

    1. Re:Google had a solution.... by burisch_research · · Score: 1

      It was ahead of its time. It's gone the way of the personal flying car, not the buggy ...

      --
      char*f="char*f=%c%s%c;main(){printf(f,34,f,34);}";main(){printf(f,34,f,34);}
  36. Homophobe by Anonymous Coward · · Score: 1

    Why does it have to be a heterogeneous mix of files?

  37. Mendeley Desktop by burningcpu · · Score: 2

    I use Mendeley Desktop for this purpose. It integrates well with Microsoft Word, and provides easy citations and reference organization. It is FOSS, and works under Windows and Linux. http://www.mendeley.com/ It also has an Iphone app, but I've never used it, so I can't vouch for its usefullness.

    1. Re:Mendeley Desktop by Anonymous Coward · · Score: 0

      I'll also chime in that Mendeley is awesome and should work wonderfully. It certainly handles my 'collection' of over 1500 journals articles beautifully!

  38. OpenKM by bill_mcgonigle · · Score: 2

    OpenKM will do most of what you want. We've deployed it for clients who have been happy with it. It does not preserve the underlying filesystem, but you can upload a ZIP file of documents.

    It's a tomcat app - that used to be heavy-duty - if it is today depends on what kind of machine you're using.

    --
    My God, it's Full of Source!
    OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
  39. What about the by Anonymous Coward · · Score: 0

    trash can?

  40. A system to organize files by Hatta · · Score: 1

    Would be a file system. Run something like Beagle for full text/metadata search. Use hard links to keep a single file under multiple folders.

    --
    Give me Classic Slashdot or give me death!
  41. T-Ball Anyone? by Anonymous Coward · · Score: 0

    Seems like the individual who posted the question and the first poster are orchestrating a marketing coup on Slashdot for Zotero

    1. Re:T-Ball Anyone? by mspohr · · Score: 2

      Zotero is free. Where is the profit in marketing that?

      --
      I don't read your sig. Why are you reading mine?
  42. Re:Software To Organise a Heterogeneous Mix of Fil by monoqlith · · Score: 1

    Also, to elaborate on your specific requirement for "multiple categorisations", and so that I might save myself from a "smart-ass" mod, here's a possible suggestion: http://www.tagsistant.net/ It's a tagging file system. You didn't specify which operating system you were on, but this works with Linux/BSD. Not quite mature, but I could see it potentially going places. At the very least, the idea of implementing tags directly in the filesystem might trickle up to extfs or NTFS or hfs+ eventually.

  43. Google Desktop and full-text indexing by gstr · · Score: 1

    Hello, As far as I know Google Desktop doesn't allow full-text indexing of files. Is there a desktop search engine that would allow doing full-indexing of file?

  44. Knowledge Tree by Anonymous Coward · · Score: 0

    I used it for my business a while back. Should have most of what you need. However, as this is medical data, have you checked on HIPAA compliancy requirements for whoever does that for your organization?

    http://www.knowledgetree.org/Main_Page

  45. Google Docs doesn't suck by dnut · · Score: 1

    Google Docs works OK for something like this. Can add as many categories as you like to each docs for easy sorting. Can add descriptions, etc for metadata search ...and of course content search works well for known file types.

  46. Re:Organise? by Anonymous Coward · · Score: 0

    There's nothing wrong with "organise."

    There's definitely something wrong with "feeding the trolls". Quit doing that, you fucks.

  47. EndNote by multimediavt · · Score: 1

    Check out EndNote (http://www.endnote.com/eninfo.asp), also. Your school may already have a site license like mine where it is free to all students.

  48. Wiki by Anonymous Coward · · Score: 0

    The general problem of how to organize arbitrary information for convenient retrieval is fairly hard. Ask any library scientist. All approaches require you to either impose some organizing principle on the information or be prepared to exhaustively search it.

    (1) If the information is inherently hierarchical, you can develop a taxonomy for it and organize it according to that taxonomy. (2) If it's classifiable but not uniquely classifiable, you can tag it. (3) If it can be meaningfully interrelated in a mesh, then you can hyperlink it. A Wiki does all three, plus it can search.

    1. Re:Wiki by Nutria · · Score: 1

      library scientist

      Eh????

      --
      "I don't know, therefore Aliens" Wafflebox1
    2. Re:Wiki by Elendil · · Score: 1

      library scientist

      Eh????

      In case it wasn't sarcasm but genuine surprise, yes, there are such beasts: http://en.wikipedia.org/wiki/Library_science

    3. Re:Wiki by Nutria · · Score: 1

      In case it wasn't sarcasm but genuine surprise

      Having been to College and seen it in the the course book, it was sarcasm.

      Still doesn't explain what the Science is in Library Science.

      --
      "I don't know, therefore Aliens" Wafflebox1
  49. Bibdesk by qvatch · · Score: 1

    OSX only again, but does what you want, including handling all files.

  50. Tim Berners Lee... by biodata · · Score: 1

    invented the web for this didn't he? How about putting your files on a webserver with something like a Lucene index?

    --
    Korma: Good
  51. Hide-ads on slashdot no longer there? by burisch_research · · Score: 0

    [Off topic] (ok so not really so off-topic, because this is slashdot)

    As a long-time user of slashdot (shaddap you 6-digit UID freaks), I've become accustomed to the 'disable ads' checkbox. Now that's suddenly disappeared. Why? Also, these days I have to enable ALL scripts (FireFox 4, NoScript) in order to make comments or perform moderation. Why?! What's going on, slashdot? What happened to the promise that no existing functionality would ever be taken away to be then given to subscribers only?

    What's going on, slashdot?

    --
    char*f="char*f=%c%s%c;main(){printf(f,34,f,34);}";main(){printf(f,34,f,34);}
    1. Re:Hide-ads on slashdot no longer there? by Anonymous Coward · · Score: 0

      your karma went down?

    2. Re:Hide-ads on slashdot no longer there? by burisch_research · · Score: 1

      No actually it went from 'good' to 'excellent'.

      --
      char*f="char*f=%c%s%c;main(){printf(f,34,f,34);}";main(){printf(f,34,f,34);}
    3. Re:Hide-ads on slashdot no longer there? by rbphilip · · Score: 1

      Still there for me. Not that I need it with Adblock Plus, but still..

    4. Re:Hide-ads on slashdot no longer there? by johncandale · · Score: 1

      BAAAWWWW Everything should be free. Everything should run on antique equipment. Bawwwww

  52. KTDMS by Anonymous Coward · · Score: 0

    Opensource/pay version available

    http://www.knowledgetree.com/

  53. Re:Organise? by jd · · Score: 1

    Learn how to spell in English, not just American.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  54. a good one does exist... by Anonymous Coward · · Score: 0

    Emacs: org-mode is the best choice, and if emacs' control-key powers give you pause, don't forget there is a menu-bar and org mode utilizes it heavily... so you don't need to remember every command key combination, you just pull down the ORG menu list and there it is, baby... YEAH!

  55. Sounds like... by Anonymous Coward · · Score: 0

    ...Google Docs might do the trick for you? I use Docs to organize notes/diagrams/scans/... with multiple tags in a group (the entire Computer Sciences class contributes). This can potentially save a lot of time if you distribute the effort across multiple people; nudge, nudge, wink, wink.

  56. They used to be by NEDHead · · Score: 1

    called secretaries, now they want to be known as administrative assistants.....

  57. I'm old school. by gregarican · · Score: 1

    I use VisiCalc because I'm down like that...

    1. Re:I'm old school. by Anonymous Coward · · Score: 0

      I was going to suggest Windows Cardfile.

    2. Re:I'm old school. by Un+pobre+guey · · Score: 1

      Or just a card file.

  58. Springpad? by Anonymous Coward · · Score: 0

    Springpad is also an option. I use it on PC, Mac, and Android phones.

  59. We're looking at ResourceSpace by lostpuppy · · Score: 1

    We're looking at ResourceSapace http://www.resourcespace.org/ "Free and Open Source Digital Asset Management". Web based and multi-user. Works with images, video, pdfs, .doc and other types of files. So far it looks like it could be pretty handy.

  60. Evernote is your answer by Anonymous Coward · · Score: 0

    Evernote is what you need.

    -Full text searchable.
    -mobile apps to tie into it.
    -Plugins that will automatically sync folder(s) on your PC to your evernote account.
    -and more

  61. Tagged Frog by denpun · · Score: 1

    http://lunarfrog.com/taggedfrog/ Does not do full text searches but can use the tags to categorize and also the favorites function. Its light and easy on resources....

  62. OP here by BertieBaggio · · Score: 3, Interesting

    Many thanks for all the informative replies so far. I've had a quick glance at Evernote, thebrain, Nepomuk (I'm loving KDE4 so far after switching a week or so ago), OpenKM and FreeMind and these seem promising. I've still to look at emacs' org-mode, and when I do I will try to put my vi prejudices aside ;-) Some of the other suggestions are rather good but aren't really what I'm looking for as they are either fully cloud-based (eg Google Docs, Wave) or one platform only (eg Sharepoint) or too expensive (hire a secretary :-P).

    I like the idea of some of the "roll your own" ideas, eg directories + hard links, serving from a web server or wiki. The problem is as I progress though the medical degree, I am likely to have decreasing amounts of time to tinker with things if they have shortcomings; and to be honest they probably will as I am unlikely to have thought through the problem fully! Plus third-party solutions will definitely have substantially more polish than anything hastily dreamt up by me!

    A shared wiki for my cohort / medical school / country may be an option on top of whatever comes out of this discussion, but I'd like something personal as... ah, let's say medical students have wildly varying standards of what is acceptable for notes ;-)

    My supplementary questions for anyone still wanting to chip in:

    • Do Evernote and thebrain/personalbrain have an "offline" mode? That is, can I keep things nicely organised locally instead of having to upload to the cloud? Evernote seems to suggest this is the case but at a very cursory perusal both seem to stop short of actually saying "you can use this to organise the files on your hard drive".
    • Do any other docs / medical students want to chip in and say what they use? I suspect each solution will have practical considerations that may affect my decision or not. Sharing practical experience is very useful, even if it's just "I use Evernote and find it useful" as some have said above.
    • Similarly, does anyone have experience of Nepomuk? Any drawbacks?

    Thanks again for the helpful replies, Slashdot. You continue to impress me - I doubt I would have gotten such a useful variety of responses elsewhere. I hope this discussion is useful to other folk looking for something similar.

    --
    If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
    1. Re:OP here by Anonymous Coward · · Score: 0

      With Evernote, you can keep as much data as you like on your personal computer, offline. It is only the online notebooks that are limited in size.

      I get good results using Evernote to organize my data on my PC, and using Spideroak to back it up to the cloud and sync it between my two computers. Only for data that I might want to access from someone else's computer do I use the online Evernote storage space.

    2. Re:OP here by supercrisp · · Score: 3, Interesting

      I am a researcher. I want to add my vote for "file system." The less interaction you do with most of this material, the better off you'll be. For me, important or useful material goes into a reference manager. Those files get tagged in the reference manager. At this point in my career--only four years in--that's just under 600 articles with accompanying pages of notes. Other stuff goes into folders based on broad categories. I don't do any tagging on these because find-by-content always does the job just fine. Avoid the extra work. You're not paid to be a secretary. And most of the organizing won't pay off, will become an end in itself.

    3. Re:OP here by mcelrath · · Score: 1

      You seem to have missed Zotero (it's up there in the comments somewhere) which is a FOSS plugin for firefox. It keeps an offline database, and for nearly any site (e.g. journals) you click one button on the URL bar and it downloads the citation including full pdf so you can read it whenever. It will also let you perform full text searches of your database, and can be configured to perform OCR on scanned documents. Best of all, it's trivial to make bibtex (or many other formats) bibliographies.

      I use that in combination with TiddlyWiki for personal typed notes not associated with a journal article/textbook, and Xournal for annotating documents and taking notes with my tablet computer. When annotating documents (textbooks, journal articles) just configure xournal as your pdf viewer and you'll be able to save every annotation you make. TiddlyWiki has a ton of plugins to do whatever you need, including a GTD (Getting Things Done -- it's a book) variant that's probably comparable to Emacs Org-mode, LaTeX math (I wrote that one -- use it every day), and many more.

      The one drawback to all this is that I have no way to automatically organize my handwritten notes from xournal. Though they're computer files, my organization for them is horrendous. I still fantasize about some kind of hybrid mutant of TiddlyWiki, OCR (that can magically read my handwriting and equations), and xournal that would let me do all this on a pen-based tablet...

      --
      1^2=1; (-1)^2=1; 1^2=(-1)^2; 1=-1; 1=0.
    4. Re:OP here by Anonymous Coward · · Score: 0

      How about a wiki (specifically MediaWiki) - provides categorization. Add Semantic Media Wiki (SMW) extension, and you get searchable tags and other metadata. Add Semantic Result Formats, and now you can do all sorts of stuff, from faceted navigation, to charting, to tag clouds and lists.

      And MediaWiki runs on any platform that supports PHP + web server - let's see - that's all major platforms and OSes.

      No matter what solution you choose, there is going to be a learning curve (somewhere).

      MW + SMW + SRF is fairly easy to set up. Adding your documents may be time-consuming, but without Natural Language Processing or similar, you'll have to provide the metadata yourself, and even with NLP, you still probably would.

      Filesystems obviously will work - to an extent. Metadata searching is harder, as is tagging (not all filesystems have support for such capabilities).

    5. Re:OP here by halfnerd · · Score: 1

      http://www.branchable.com/ (or acutally ikiwiki) is a wiki that you can work on offline. And there's vim-outliner if you don't like emacs orgmode.

    6. Re:OP here by Peganthyrus · · Score: 1

      Offline mode is one of the things Evernote uses to distinguish the paid version from the free version

      --
      egypt urnash minimal art.
    7. Re:OP here by ffflala · · Score: 1

      Why are you excluding fully cloud-based solutions? If the only reason is because you want local backup, I think you should reconsider using google docs. It immediately popped into my mind reading your description. If you're concerned about losing your material in the cloud, it's a trivial matter to create a local backup of all your docs. Fwiw, I just finished law school, and wound up using google docs to organize everything I had --text notes, journal articles, diagrams and scanned documents, even a handful of audio clips. It permits full-text & metadata search, tags, preserved the underlying files, and is cross-platform. If you have other reasons why you don't want a fully cloud-based solution, please describe them so we can better understand (and hopefully address) them.

    8. Re:OP here by Anonymous Coward · · Score: 0

      It won't do everything but for general note organization I've yet to find anything better than Wikidpad (http://wikidpad.sourceforge.net/).

      Its kind of hard to explain why but I can safely say that medicine would be a lot less fun without it.

    9. Re:OP here by Anonymous Coward · · Score: 0

      Quick note...

      Use your web browser. If everything you have is categorized in folder/directory structure, point your browser to your 'file:///' location and navigate as appropos. File associations are likely tagged to the appropriate app, so clicking and launching is your only task.

      Better yet, create a profile just for that content. If you get real creative, create some simple html/css interfacing at the top of the directory heirarchy.

      If your repository starts getting fairly large, say 400+ files, you may want to consider a light web server and some database indexing for speed and performance improvements.

    10. Re:OP here by Anonymous Coward · · Score: 0

      +1 on that: I'm another researcher (well, postgrad research student), using the bare file system. In addition to the reasons you listed, I'll add one more: I don't have to worry about installing specialised software on my laptop or home desktop, or persuading IT support at either of the institutions I work at to install it for me.

    11. Re:OP here by ccabanne · · Score: 1

      I use org-mode for notes and highly recommend it; here is a talk about it from the creator: http://www.youtube.com/watch?v=oJTwQvgfgMM

    12. Re:OP here by Anonymous Coward · · Score: 0

      EverNote (notice the capitial N) did save files in a local database before they moved to become a Cloud based service and became Evernote (which was a sad day in my eyes).

      EverNote v2 is a wonderful program that had XML style sheets so that you could create your own forms for capturing specific data and formatting it in a way that was meaningful to you (it had a good community support behind that create all sorts of XML forms). Along with the ability to recognize writing in pictures, diagramming and meta tagging system it was the best software in its class (and still is as far as I am concerned). If you can find anyone willing to let go their copy of it - then grab it, I think you will like it.

    13. Re:OP here by Anonymous Coward · · Score: 0

      Evernote has an "offline" mode, in that a copy of your data is downloaded to your device (PC, Mac, iPad, iPhone, Android, ...) when you sync from that device. However, it's not a set of files - it's Evernote's proprietary format that can only really be accessed by the Evernote client.

      Evernote won't touch your original files - it uploads a copy (in some format) to its cloud, indexes the content there, then you can use the Evernote client on your own box to search for it. You can delete your original files from your hard disk, and Evernote will retain its copy - to get rid of it altogether, you'd have to explicitly delete the copy via the Evernote client.

      I'm very happy with Evernote - I keep all sorts of documents, PDFs, text notes, graphic images, etc. in there and it's all easy to search through. You can tag stuff as you upload it, or after the fact. You can "clip" content straight from Web pages using Chrome or Firefox, which I find particularly useful.

      Spend an hour or so with it, and I suspect you'll be pleasantly surprised at how useful it is.

    14. Re:OP here by mls · · Score: 1

      Do Evernote and thebrain/personalbrain have an "offline" mode? That is, can I keep things nicely organised locally instead of having to upload to the cloud? Evernote seems to suggest this is the case but at a very cursory perusal both seem to stop short of actually saying "you can use this to organise the files on your hard drive".

      You can have local only, as well as synchronized Evernote notebooks.
      The notes are search-able and files are embedded in the notes as attachments of a sort. It is not a file system organizer - it is more of a document database.

      --
      -mls
    15. Re:OP here by Anonymous Coward · · Score: 0

      Check out http://en.wikipedia.org/wiki/Solr and keep your files as you like them.

    16. Re:OP here by Anonymous Coward · · Score: 0

      Evernote definitely works offline on a Mac. The iPhone implementation is limited in how much it can store offline (mostly because the iPhone is itself small), but it's capable of doing a very efficient search of the rest of the info even over a 3G connection.

    17. Re:OP here by MeerCat · · Score: 1

      If you're interested I can drop you an invitation code to

          http://www.mysparebrain.com/

      (hit the button to request a pre-launch invite and then ping me on twitter or similar to identify yourself)

      It's free and runs in the browser without plugins (but you can export your data locally, an offline mode is fairly straightforward, just not a high priority at this point) - it's aimed more at when you move your files onto the web for storage, but it can handle local files within browser sandbox limitations.

      It mixes note-taking, bookmarking, annotation and tagging, reminders, and notifications, but then also features smart integration with other sources, so it knows about things like wikipedia and twitter and actively integrates with such sources via plugins (I'm toying with the idea of a Mendeley plugin).

      The aim is not so much to replace lots of other services, but to sit on top of them and give you a smart integration layer, so rather than building a static "bucket" as a repository, you can build something that is more active, and lets you get on with what you need to do with your resources rather than constantly going back and checking them.

      And I'm UK based so you may find some of the spellings familiar ("organise"!!)

      --
      I spent a lot of money on booze, birds and fast cars. The rest I just squandered. - George Best
    18. Re:OP here by Kashgarinn · · Score: 1

      I'd recommend Tiddlywiki, a wiki in a single html file you can keep on your desktop or on a USB stick or wherever.

    19. Re:OP here by dmr001 · · Score: 1

      Evernote stores notes locally when you use the Mac or PC versions; under Linux, you'd be limited to the browser version which uses the cloud. It will store notes locally on a smartphone if you use the paid version.

      In my experience, on the wards, when some beady-eyed attending starts pimping you, more students come up with more answers quickly if they've got a handheld (or an eidetic memory), since even though computers are prevalent, in teaching hospitals they tend to be busy and not necessarily accessible right outside of a patient's room. Now that we have computers in our clinic exam rooms I don't need to rely on my handheld as much, but that thing was indispensable during 7 years of med school and residency ward rounds. (It was a Palm back then, but those days are over.)

      Having a link to UpToDate on your device is also going to be helpful; if your hospital has wifi, it's likely free. And a Sanford Guide app would be nice, if they ever come out with one; until then you're stuck with the book. (Or the Johns Hopkins app, which I have not tried.)

      I agree that Sharepoint is a great idea with absolutely terrible implementation. At least in our environment, search is horrible. Evernote search is really amazingly good, using fulltext (even in PDF's and scans for crissakes) and multiple tags. While a flat file OS X database would probably work well, unless you're hauling your laptop everywhere (which you won't be) it's not going to be practical. The fact that you're considering suggests you may be a 2nd year. Are any 3rd or 4th years at your school bringing their laptops on the wards? Around here anyway, that thing would get stolen. (So would your iPad.) Some of those poor VA patients need to steal your laptop so they can eat when they get discharged, which might be a good thing; god knows they deserve it.

    20. Re:OP here by Anonymous Coward · · Score: 0

      PersonalBrain is primarily an offline tool with a new optional cloud service WebBrain. I keep links to files orsometimes whole files in it, but don,t feel I keep on top of the "gardening" of the resulting concept maps.

    21. Re:OP here by Anonymous Coward · · Score: 0

      Yes, Evernote has an "offline" mode. You can even have both on and offline "notebooks".

    22. Re:OP here by Anonymous Coward · · Score: 0

      Do Evernote and thebrain/personalbrain have an "offline" mode? That is, can I keep things nicely organised locally instead of having to upload to the cloud? Evernote seems to suggest this is the case but at a very cursory perusal both seem to stop short of actually saying "you can use this to organise the files on your hard drive".

      Yes, you are able to create an offline notebook in Evernote. You can store all documents in this offline notebook just as you would with the default notebook (stored remotely). I personally do this, and have not found any drawbacks. The offline notebook works just as well as the online notebook. Although, you can not set it as the default notebook. The default must be an online notebook.

      I've read that some doctors use the offline notebook in Evernote to store data. Apparently, the online notebook cannot be used due to HIPAA, but the offline notebook is appropriate.

    23. Re:OP here by angel'o'sphere · · Score: 1

      TheBrain usually is used offline. It just stores a single brain in a singel file.

      --
      Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
  63. Try out KnowledgeTree by Binestar · · Score: 1

    http://www.knowledgetree.org/Main_Page Works rather well and has a *LOT* of features. Perhaps a bit too much for what you're looking for?

    --
    Do you Gentoo!?
  64. "Papers" and other metadata thoughts by david.emery · · Score: 1

    A friend recommended this to me: http://www.mekentosj.com/ I've played with it a bit, and it's very academic-focused.

    There's probably a need for a more general metadata-integrated information management tool, that makes use of Mac OS X facilities for metadata definition and management. Do Linux and Windows 7 have similar OS level facilities to support metadata creation and management?

    A key consideration is the ability to store metadata in the file, rather alongside. Some EXIF tools support this for photographs and I think the various Office formats (Microsoft and otherwise) support metadata within the file.

  65. Re:Software To Organise a Heterogeneous Mix of Fil by hedwards · · Score: 1

    The OP specified cross platform compatibility. Linux/BSD is probably not going to be enough. However, I'd imagine that it would be easy enough to get it running under OSX. I'm not sure how easy it would be to get this running on Windows though.

    Personally, this looks like something I'm going to have to keep my eye on as my main OSes are Linux and FreeBSD.

  66. Tagsistant by Anonymous Coward · · Score: 1

    Tagsistant is a tags based (multiple categories) filesystem working on top of a traditional filesystem.

  67. ESRI Geoportal by Anonymous Coward · · Score: 1

    You might want to setup a Metadata server with descriptions of your documents. Think Library card catalog, basic equivalent to a Metadata server. The ESRI Geoportal will do everything you want and it is FOSS.

  68. Zotero is hamstrung by it's limits. by goombah99 · · Score: 1

    I agree that Zotero is awesome. I just wish it wasn't tied so closely to Firefox.

    there is an alpha stand alone version of zotero now but that's not what I really want. it's one that works will all browsers and shares a common database.

    But my major gripe with Zotero is that it does not work with Pages.app. The zotero folks seem to get inexplicably indignant when asked why they don't support footnotes in pages.app (other than the useless RTF conversion method). They say it's because there is no public API, but any doofus can examine the XML and it's obvious how to extend it from a few minutes contemplation. it would be perfectly simple to add new tags that contained the footnotes. I've actually test that for other reasons so I know it can be done from a syntactic point of view. It's the integration of that into Zotero that is beyond me. I wish they didn't have a Stallmanesque purity test for their extensions. After all they support MS Word.

    So i struggle on with the lameness of endnote. And zotero goes unused despite being a better approach.

    --
    Some drink at the fountain of knowledge. Others just gargle.
  69. Zotero is probably your best best by Anonymous Coward · · Score: 0

    I recently went through a similar search and decided that zotero was closest to what I wanted. If you're willing to drop the FOSS requirement, then you might consider Mendeley as well --- it's free as in beer. However, the fact that zotero is FOSS was important to me and probably should be to you, because I don't think you're going to find anything at this time that does everything you want. Zotero should be reasonably future-proof: it's FOSS, it's been around for several years now, it seems to have a large and active user community, and it's development seems to be continuing at a good pace and is supported by several heavyweight organizations (Andrew W Mellon Foundation, Alfred P Sloan Foundation, Institute of Museum and Library Services).

    For emacsians, there is some rudimentary integration of zotero with org-mode. You can find that at http://e6h.org/~egh/hg/zotero-plain/

    I would also strongly recommend the zotfile add-on at http://www.columbia.edu/~jpl2136/zotfile.html

  70. Wuala by aarongadberry · · Score: 1

    Wuala is a really great client for this, and also comes with other nice-to-haves you may not have considered such as sharing, security and previous version access.

    Linux / Windows / Mac / Android / iPhone / Web (Java Applet)
    Cloud storage
    Sync Locally / Backup jobs
    Secure
    -Encrypted and your PW never leaves your computer
    -Files are in cloud, unless synced, so logout and files are inaccessible
    -Files are chunked, meaning the assembled file does not exist anywhere
    Many ways to get more storage space (some free)
    Share files with friends, groups, or public web access
    Previous version access
    File tagging
    File comments
    File search
    No filesize limit
    Explorer integration

  71. the cloud by recharged95 · · Score: 1

    I've been evaluating Amazon Cloud Drive and it's very much like Google Docs but a bit more generic file-wise. That maybe an option for you.

  72. Heterogeneous Mix of Flies by Anonymous Coward · · Score: 0

    Personally I like the long yellow strips of sticky paper hanging from the ceiling.
    It's a bit old school I realize but highly effective.

    >

    Uhh... nevermind...

  73. OCR,pdf annotations,text/html,pdflatex + recoll by drolli · · Score: 2

    If you insist in keeping your notes (think before how often you will look at them, then the above tools may work for you.

    (recoll is based on xapian and works very well if you have big static archives which you don't need to index often.)

  74. Data Crow by Anonymous Coward · · Score: 0

    Try Data Crow:

    http://www.datacrow.net/

  75. Heh by Anonymous Coward · · Score: 0

    Now that is some quality humor, right there! ;-)

  76. blog it by Anonymous Coward · · Score: 0

    iv just recently wanted to do somthing similar. and im sure iv found one of the best ways of doing it. Start a blog, either private or public. Wonderfull way to organize, catagorize and tag either text pictures or links. Then either access it from the web or save an offline copy using httrack or something similar. Its also the most crossplatform way of doing it as any system, fone or device with a web browser would be able to view it.

  77. Shoebox 2.0 by peterofoz · · Score: 2
    For class notes, the shoebox is the perfect bucket sort container for paper you'll probably never reference again. Works especially well for utility bills, bank statements. Mark it by year and possibly by broad topic so you know if you have to shred the contents or just toss it.

    Save yourself - go on a date.

  78. OneNote by kenh · · Score: 1

    Why not use OneNote with either a local folder, a personal server or even a cloud-based Skydrive? It will gobble up everything you have and allow you to search, organize, etc.

    --
    Ken
  79. FreePlane - it meets your requirements by Anonymous Coward · · Score: 0

    Please try / consider FreePlane - it is a fork of FreeMind and contains many / better features.

    I use it for document management, searches etc. - everything you describe and more.

    My recommendation is to use the 1.25 Alpha version; I've not encountered anything more than trivial defects and created many large mindmaps in it.
    You can run formulas or even custom Groovy code in it.

    Paul

  80. Xena by spasm · · Score: 2

    You might also want to look at an xml normalizing tool like Xena - automagically converts all your docs, files, etc into open formats whose content can be searched by open tools.

  81. Finding specifics versus finding sets by BertieBaggio · · Score: 1

    I am a researcher. I want to add my vote for "file system." The less interaction you do with most of this material, the better off you'll be. For me, important or useful material goes into a reference manager. Those files get tagged in the reference manager. At this point in my career--only four years in--that's just under 600 articles with accompanying pages of notes. Other stuff goes into folders based on broad categories. I don't do any tagging on these because find-by-content always does the job just fine. Avoid the extra work. You're not paid to be a secretary. And most of the organizing won't pay off, will become an end in itself.

    Thanks, it's good to be aware of the payoff of pre-organising things (a la putting in frameworks in programs that will never need it) before embarking on something like this. You also reminded me of something I wanted to put in my follow up: The reason I wanted multiple categorisations or tags is that some things do fall under more than one category. I'm a medical student, so I'll give you an example: Malaria. Does that come under:

    1. Parasitic diseases
    2. Blood-borne diseases
    3. Diseases involving the liver
    4. Tropical medicine
    5. Block 3, where we first covered it?

    I'm not sure what your research area is, but you must have come across something similar. I've (very briefly) worked in the area of antioxidants, and even there it would have been useful to pull out say everything on copper chelators in rat models, clinical trials of antioxidants and so forth. Basically, I'm concerned that while a filesystem-based approach is good for retrieving one or a few results for a specific query (ie where was that paper by Doe et al from 2007 on the effects of foo in vivo), it is less good at including things that fall under multiple categories (eg malaria, heart disease). Symlinks go somewhat of a ways towards alleviating this but they are a significant increase in efforts versus tagging and I'm concerned about cross-platform symlink handling.

    Of course, you may ne right in tht I may be putting *too* much thought into this, but then again I'm a medical student and we worry about not knowing things fully ;-)

    (OT but ordered/unordered lists aren't showing up for me in preview. Weird.)

    --
    If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
    1. Re:Finding specifics versus finding sets by massysett · · Score: 1

      (ie where was that paper by Doe et al from 2007 on the effects of foo in vivo)

      Don't you have a medical librarian for that sort of thing?

      I'm a lawyer and I never once have regretted tossing out any piece of paper that can easily be located in the library or on Westlaw. If you save all that stuff, you will spend more time saving it than you spend using it and, even then, it will be harder to find it when you need it than it would be if you just went back to the library.

      The most important knowledge winds up in your head or can be found in a few important reference works. For everything else, there are professionals who organize it and make it easy to find through databases and catalogs.

      To the extent that you DO need to keep things, do keep it very simple. Anything too complicated goes unused. In this vein, the filesystem works well. It's hard to forget how it works, it's easily backed up, and it doesn't get obsolete.

    2. Re:Finding specifics versus finding sets by colondee · · Score: 1

      I never once have regretted tossing out any piece of paper that can easily be located in the library or on Westlaw. If you save all that stuff, you will spend more time saving it than you spend using it and, even then, it will be harder to find it when you need it than it would be if you just went back to the library.

      The most important knowledge winds up in your head

      THIS.

      From the spelling of "organise" I guessed the OP is a Brit - confirmed from his website. I just finished 2nd year of medicine (=1st clinical) in London and like the OP (and unlike most UK students) did an undergrad degree first. I had exactly the same issue as the OP last year.

      The OP's post about his difficulty deciding how to categorise malaria doesn't really make sense to me. I decided on a filesystem structure by date supplemented by my brain, as I associate topics by when they were taught. So Malaria.pptx comes under Year 1 / Term 2 / Infectious diseases for me. If there are any papers, vidoes, etc. they go in the same folder with a sensible name. I do keep lots of papers, but I have barely had time to read most of them. What I do is summarise the abstract and citation into my own notes.

      As the parent says I have actually spent more time sorting out these files than reading them. I don't think I would ever need to read those papers again except for interest, and I would find which one I wanted by checking the citations in my notes. I haven't even touched last year's notes. I have maybe touched the Year 1 folder twice this year, and had no problem finding the (powerpoint) file I needed. Yes, malaria involves the liver, but I don't see how creating a folder and symlinking things like malaria and alcohol would have helped.

      Year 2 is a different matter. It's mostly hospital based and now medicine becomes about textbooks and your memory and experience, rather than keepnig lots of files. I don't think I would ever need to actually remember a named person's specific research findings (except for eponymous syndromes, but the original document is likely to be in German and of historical interest only). At the moment most of my Y2 notes have been handed down from various people, so I might have a file called Malaria.docx. I store this under M at the moment. I might also have files called List of liver conditions.docx or Causes of splenomegaly.docx which go under Abdominal. Again, while things like Parasitic diseases, Blood-borne diseases, Diseases involving the liver and Tropical medicine are good as tags, I'm not sure what files I would be tagging as these are actually titles of my documents. Look into MeSH if you really want to sort things this way.

      My Y2 notes are not really ideal - but as I said they were mostly handed down. Also, I hate word, yes I can convert them to ODT but I have decided to convert them to HTML over the summer. I'll need these notes for the next few years and so will my girlfriend. They are entirely text files anyway, and HTML loads really fast anywhere (phone, computer room, parents' house). The advantage of this is that when Malaria appears in under Causes of splenomegaly, it can be a link to my malaria page. Also, it gives me a chance to do stuff with PHP, python, ajax, etc. if I decide to put them online (which previous students have failed to do).

    3. Re:Finding specifics versus finding sets by BertieBaggio · · Score: 1

      Your investigations are quite correct, I'm studying up in Glasgow. I don't know how your university organises their course but we're strongly into PBL here, and all the implications that has. We don't fully get to the clinical stage until 4th and 5th year, so I'll need my notes for a while yet! We also don't cover things by system, so malaria may turn up as an example of a fever in the returned traveller, an example of a disease with often poorly-adhered to prophylactic treatment, and so forth; each visit to the subject adding more information. It's perhaps not the best example, but as you say there are other contexts where it would come up, and it wouldn't do to miss out on them. There are a lot of interlinked subjects, but perhaps that's calling out for a hypertext ( / wiki) solution rather than a multiple categorisation system.

      Yes, everything important should end up in my head, but it has to get there first ;-) I've actually also done an undergrad degree as well, so I'd like to incorporate the useful bits from that too. Basically, I'd prefer to be overprepared than underprepared!

      I appreciate you sharing your experiences, it's very useful to hear from another medic. Which London university are you at, if you don't mind my asking?

      --
      If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
  82. If you're this close to the end of the term... by frank_adrian314159 · · Score: 1

    ... quit dicking around trying to find software and start studying. Use whatever break you have before your classes start again to get your organization system in shape. Frankly, you're about nine months late on this quest and your faith in computers and your own tagging abilities are, if not misguided, pathetically touching. Good luck with your medical career.

    --
    That is all.
    1. Re:If you're this close to the end of the term... by BertieBaggio · · Score: 1

      After the end of term actully, but I appreciate your concern :-) You do make a good point for anyone else reading now or in the future which I'd like to echo - don't leave organising things to the last minute, get organised as early as possible because it takes a long time to do properly, and gets increasingly time consuming and difficult the longer you leave it. Fortunately, I have about 4 months to get it done!

      --
      If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
  83. ZOTERO by Anonymous Coward · · Score: 0

    ZOTERO will do all this for you and more. It has a Word plug in that will allow you to cite automatically in almost any style.

  84. Two projects to look at by dbosman · · Score: 2
  85. +1 iusethistoo by Peganthyrus · · Score: 1

    +1 iusethistoo

    Evernote has clients for OSX, IOS, Android, Blackberry, Windows Mobile, Palm, and the web. OCRs images you put in it. Syncs between your devices transparently. All clients are free but there's a max of like 60M/mo transfer; if you want to exceed that it's $45/year.

    I haven't put alllll my data in it and probably never will but for reference material and general notes? It's kinda where everything goes now.

    --
    egypt urnash minimal art.
  86. Foldera? by RalphTheWonderLlama · · Score: 1

    Anybody ever heard of Foldera? It sounded like it was going to be great (collaborative workspace software-as-a-service) until they never actually made a public release, oops :)

    --
    simple, fast homepage with your links: http://www.ngumbi.com/
  87. How to use Devonthink Pro by ThousandStars · · Score: 2
    I don't have a perfect answer for you, but I can tell you that I use Devonthink Pro as described here by Steven Berlin Johnson. In addition, I have a large "random" folder that consists mostly of snippets of text found in articles on the Internet.

    This isn't your ideal solution—as you've noted, DTP is currently OS X only—but it does work pretty well for me, especially when I'm thinking about a general topic and need to find information on it. I even wrote a post about the similarities between Joyce's method of composition / finding material and how Johnson uses DTP.

  88. Tracker by mugurel · · Score: 2

    Tracker ( http://projects.gnome.org/tracker/index.html ) is relevant to your needs. It stores relational information about your data. So for example, if you have an id3 tagged mp3 album, it will store the artist as an entity x, and the album as an entity y, and it will store that each of the mp3 files are a part of y, by x. In the same way, it stores authors and publishers as entities for pdf's. You can address this data store in the ordinary `desktop search' way (there are some search GUI's available), but more interestingly, you can use sparql to query the data store (allowing you for example to ask for all documents dated before 2000, by an author whose full name contains "Knuth").

    Tracker comes with some data miners that crawl your data (e.g. using full text-search, and meta-data extractors) to build up the data store, but in your case it might make sense to enter your organization of the data into the data store yourself (using RDF). This would allow you to use the Tracker infra structure to access your data afterwards.

    1. Re:Tracker by tenco · · Score: 1

      Is PDF support through poppler better now? I just remember indexing a bunch of PDF files with tracker but then got no results when searching for characteristic words in them. PDFlib TET PDF Ifilter 4.0b2 had no problems with indexing them properly AFAICS.

  89. FOSS content management system by Anonymous Coward · · Score: 0
  90. Shouldn't Nautilus be able to do this? by toby · · Score: 1

    I'm not saying it does *today* - but isn't this a really common use case for a typical Linux user - geek or not?

    --
    you had me at #!
    1. Re:Shouldn't Nautilus be able to do this? by Greg+Merchan · · Score: 1

      Yes, it should be able to. There's a lot of stuff the graphical shell should make easy, but those we have are going in other directions.

  91. The Brain by zooblethorpe · · Score: 1

    The reason I wanted multiple categorisations or tags is that some things do fall under more than one category. I'm a medical student, so I'll give you an example: Malaria. Does that come under:

    1. Parasitic diseases
    2. Blood-borne diseases
    3. Diseases involving the liver
    4. Tropical medicine
    5. Block 3, where we first covered it?

    I'm a linguist, not in medicine, but linguistics presents similar challenges of multiple categorization. One thing I've quite liked about The Brain is the ability to create both hierarchical (A contains B, C is contained by B) and sideways relationships (D is related to E, but I don't have to say how, or I can notate the link to describe the relation). These can also be redone on the fly, and this flexibility is wonderful when reorganizing things as you learn more about your problem space.

    The Brain also allows many-to-one or many-to-many relationships, something that I don't think FreeMind allows. Using your example of malaria, it would be quite easy to create multiple higher-level topics/nodes/ideas such as "parasitic diseases" / "tropical medicine" / etc. and then have the "malaria" node be a child of all of the relevant ones.

    A number of other mind mapping tools I've looked at in the past enforce a single-parent tree model that doesn't fit various kinds of semantic relationships very well, whereas the relations available in The Brain and the ability to choose any arbitrary node as the "root" node for viewing the model, or even the "root" node for overall organization, allows for a more organic web of association. I seem to recall that you can also tie two independent mind maps together, but I haven't tried that to date.

    NB: I haven't bought The Brain just yet, but the demo impressed me, and this discussion has reminded me, so I think I will later today...

    HTH,

    --
    "What in the name of Fats Waller is that?"
    "A four-foot prune."
  92. Alfresco by Anonymous Coward · · Score: 0

    http://www.alfresco.com/

  93. VUE by eleuthero · · Score: 1

    VUE - from http://vue.tufts.edu/ might be a helpful mix of directory tree and mindmap. It allows for content tagging and linking (local or on the Internet). Search by hierarchical (or not) ontologies is possible.

  94. mac os x lion by Anonymous Coward · · Score: 0

    and spread the word
    hahahaha.....

  95. OneNote by mcbiondi · · Score: 1

    Get yourself a copy of Microsoft's OneNote. It is exactly what you want - easy to use, allows for searching within documents, and allows for mixed media storage - voice, documents, screen shots, handwriting, all mix freely. It has basic OCR and voice recognition skills that help it search through non-typed material, though sometimes it's a bit hit or miss here.

  96. Database by Tablizer · · Score: 2

    Via RDBMS, I've done similar things with info I had to use extensively. I parsed the files, tracked the paths, classified sections (as best a machine could), added category tagging, word indexing, etc. I'm not called "Tablizer" without a reason.

    Note that it was for personal use, but could be extended to others with a fancier UI and less abbreviations.

    1. Re:Database by Tablizer · · Score: 1

      I forgot to mention the advantages of using a database:

      1. Plenty of existing RDBMS products to choose from, free and commercial. Commercial ones tend to have better GUI's.

      2. You can index and cross-reference on just about any field (isolated "item") or combo of.

      3. You don't have to hard-wire in any hierarchy or taxonomy up front and be stuck with that as your only navigation/layering choice. Relational gives you a lot of simultaneous data "navigation" options without changing the underlying model.

      4. Can use SQL, table browsers, and/or Query-By-Example tools to sift, sort, search, summarize, etc. without reinventing a lot of C.R.U.D., querying, and reporting wheels from scratch.

      5. You get multi-user and concurrency with little or no extra effort. (And potentially ACID.)

      6. Benefits I forgot that I'll add later.

  97. Bare Bones Yojimbo (not cross-platform) by Ayanami_Rei_II · · Score: 1

    I think it satisfies all your other requirements, though

  98. KT, Alfresco, OpenDocMan, various other DMSs by QuietRiot · · Score: 2

    You might consider exploring http://www.knowledgetree.com/ (commercial open source) or another DMS like the following:

    Alfresco - Open Source Enterprise Content Management (CMS) ... - alfresco.com
    OpenKM - powerful, easy to use, web-based scalable electronic ... - openkm.com
    Epiware - Document Management Solutions for Everyone. A powerful ... - epiware.com
    Document Management Software - Your Search for Document ... - ademero.com

    http://www.opendocman.com/ (PHP)

    http://en.wikipedia.org/wiki/Category:Document_management_systems

  99. Zoot on Windows by rhover · · Score: 2

    James Fallows from The Atlantic magazine writes periodically about these kinds of software programs, everything from the ancient Lotus Agenda to many of the programs mentioned above. One very powerful tool that appears to be missing from my skimming through the posts is Zoot! (only on Windows) - soon to be Zoot-XT in July: http://www.zootsoftware.com/index.html

    1. Re:Zoot on Windows by rhover · · Score: 1

      To save you all an extra click, I should have included a list of features from their About page (I have only tried Zoot a few times as I am mostly using Macs now.): Zoot XT is a next generation information processor. What's New in Zoot XT > The Zooter - The Zooter in Zoot XT lets you clip a web page and save it as a complete web page with both text and graphics. > E-mail client - Zoot XT lets you create an unlimited number of e-mail accounts. You can create them all in one database or across multiple databases. > Feed Reader - Add a RSS feed to any Zoot database or open the Feeds database and indulge in all the feeds you like. > Dropbox / SugarSync - Keep your databases synchronized across all of your computers effortlessly. You can synchronize all of your databases or select a subset of databases to synchronize. > Field Blaster - Field Blaster recognizes naturally occurring fields and adds them to your database automatically. And that's just the beginning. > Integrated Web Browser - Zoot XT has an integrated web browser so you can quickly find what you're looking for on the web without leaving town. > Cloud services - Zoot lets you post content to popular services like Facebook, Twitter, Blogger, Tumblr, and many more. > Speech Recognition - You can completely control Zoot XT and with voice commands using the Zoot XT Speech Pack and Windows Speech Recognition. > Color Themes - Zoot XT lets you customize the way the app looks from top to bottom. The default theme is Zorro and it's very dark and mysterious, but you have many other options.

  100. Dolphin by dotancohen · · Score: 2

    KDE's Dolphin file manager, coupled with Akonadi and Strigi (built-in, and seamlessly integrated) does everything that you are asking for. It runs best on Linux, but there are KDE Windows and Mac ports. Of course, that means that you must install all of KDE on that Winbox or Mac.

    Note that in the past there was much criticism of Akonadi due to resource usage, but that has been taken care of for at least two major version numbers (KDE 4.5 and 4.6). Let us know how it works out for you, you are really going to enjoy the tools that KDE and Dolphin offer for file management and organization.

    --
    It is dangerous to be right when the government is wrong.
  101. Compendium ? by Anonymous Coward · · Score: 1

    I've only started playing with it, so not sure it would suit your needs, but it is free and cross platform. From their site -
    "About Compendium
    Many people use Compendium to manage their personal digital information resources, since you can drag+drop in any document, website, email, image, etc, organise them visually, and then connect ideas, arguments and decisions to these. Compendium thus becomes the 'glue' that allows you to pool and make sense of disparate material that would otherwise remain fragmented in different software applications. You can assign your own keyword 'tags' to these elements (icons), create your own palettes of icons that have special meanings, overlay maps on top of background images, and place/edit a given icon in many different places at once: things don't always fit neatly into just one box in real life."

    http://compendium.open.ac.uk/institute/about.htm

  102. something like dmoz? by hendrikboom · · Score: 2

    Have a look at www.dmoz.org, the so-called Open Directory. Not that I suggest you post your notes on that site, but it is a good example how a directories-and-links homebrew solution could work. You could even have the actual notes sitting outside the index tree.

    What I'd like on top of this is a mechanism to track files as they get moved around in the file system, which does happen occasionally. Also to keep track of their copyright and confidentiality status, so I can avoid releasing that which shouldn't be.

  103. Wikipedia, "reference management software" by OldHawk777 · · Score: 2
    --
    Unaccountable leaders are masters, and unrepresented people are slaves. How do US and EU fare?
  104. Nepomuk + Zeitgeist by Anonymous Coward · · Score: 0

    Nepomuk + Zeitgeist
    It's libre software and it's being integrated into the KDE stuff pretty well.

  105. Storing/Indexing/Tagging vs. Searching/Finding by TheLoneGundam · · Score: 2

    I've only had time to skim a lot of these comments, so forgive any redundancy, but one of the first questions to ask is: Will you really spend time tagging or organizing things as you add them? Many people think they will, but then don't follow through. Perhaps make sure you have full-text indexing and search - it is costly to implement at times, but might automate some of the work of getting the stuff ready to search.

  106. Two key presses by Anonymous Coward · · Score: 0

    Ctrl-A (Select All) -> DEL (Delete)

  107. tiddlwiki fiddlyiki by rcrath · · Score: 1

    for the personal wiki thing I use my version of tiddlywiki, fiddlywiki. ssynced with dropbox. downside is no wysiwyg yet but it is html underneath so it is cross platform, easy to get stuff out . tags work great and google desktop ppicks it up so it is searchable, unlike zotero which i also use just not so much for content any more. http://way.net/FiddlyWiki

  108. wikidpad is the answer by Anonymous Coward · · Score: 0

    Wikidpad is what I use, it worked well for me in both ubuntu and xp.
    You use wiki style syntax to create and cross link pages in your local db.
    Drag and drop support to embed just media in your wiki is good, even in linux. Combos of mod keys while dragging let you link a file relative to root of wiki folder, copy the file to wiki folder and link relatively, or just link absolutely.
    Plugins for Mimetex/Latex, gnuplot, and others.
    Pages can have a tag, be in a tab, and contain python scripts that get executed on key combo.
    Oh, and you can embed python anywhere
    -qe2eqe