Slashdot Mirror


Document Management For Research With Annotation?

msimm writes "I'm currently looking for a document management system for personal and research-related use. Having looked at Alfresco and KnowledgeTree along with a slew of similar open source document management systems they seem to have a common set of features including version control, archiving, document permission/ownership and search/indexing. What I'd like, in order to help me manage my own continually growing collection of pdf/doc/odf/rtf/txt files, would be something that allowed me to view and annotate documents (and possibly collaborate/share notes) without requiring me to download, edit and re-upload each document. Obviously there are plenty of capable document management systems out there, so I really suspect I've simply missed something and am hoping someone can point me to a better way to index, search, collaborate and keep and share notes on the ever increasing glut of useful information I seem to use and collect."

28 of 122 comments (clear)

  1. Just buy EndNote? by Anonymous Coward · · Score: 2, Informative

    Nothing much more to say here... I have found EndNote very useful.

  2. mediawiki by Anonymous Coward · · Score: 4, Informative

    if you want a low-tech approach, just install a wiki. Mediawiki is full featured while MoinMoin is easy to install and configure (no separate database needed). I haven't used any others.

    1. Re:mediawiki by Anonymous Coward · · Score: 2, Interesting

      This implies a painfully manual process of copy-pasting quotes, references, etc. While a wiki system would be a good repository a front end system is needed to open the documents, select the text and attach a comment to it.

    2. Re:mediawiki by caffeinemessiah · · Score: 5, Informative

      Try Mendeley. They're still pretty new, but very promising with their desktop client for Linux/Mac/Win in addition to the web interface. They also sync perfectly with Zotero and CiteULike, which makes migration easier. You can annotate PDFs directly in the desktop, but I think only the latest beta build has support for sync'ing the annotations across multiple computers. I'm hopeful for them -- it's definitely one of the most promising Ref manager systems I've seen (oh yes, they also support Bibtex,Endnote,Refworks formats heavily)

      --
      An old-timer with old-timey ideas.
  3. 'Collaborate' Implies ... by eldavojohn · · Score: 4, Informative

    Collaborate, in my opinion, implies that there is some advanced messaging going on in the background. And the persistence of that messaging (whether on a centralized server or via some P2P/Client routing protocol) is not only complex but often needs to be specific to what you want to collaborate about. Let's look at annotations. Where are they stored? How am I notified if you add an annotation to my document? How do I track my annotations? How do I share my annotations? Where is that stored? Etc. The questions raised are endless.

    A coworker implemented a basic ruby service of this where I work and I have to say that he didn't find any open source alternatives before he started that fulfilled anywhere near what we needed. Ruby made it pretty easy (1 or 2 person job) with the emphasis just being javascript and DOM coding to get the interface correct. Then we just had a RESTful service for storing these and from there we'll keep adding on features like messaging/e-mail alearts/etc for the users when we get time. Yes, I'm aware that if I open sourced this you could help me out with that but I'm sorry, my employer is not on that boat (yet).

    For your reference, even just document management is a sticky solution to find in open source, we've talked about it time and time again.

    --
    My work here is dung.
  4. Jabref? by Anonymous Coward · · Score: 2, Insightful

    Does jabref suit your purpose : http://jabref.sourceforge.net/

  5. Privacy concerns aside... by ircmaxell · · Score: 2, Informative

    Would Google's Wave work for you? It's real time, centralized, and browser based. I say privacy concerns aside, because the protocol is available, and people could build their own servers (such as http://code.google.com/p/pygowave-server/)...

    --
    If a man isn't willing to take some risk for his opinions, either his opinions are no good or he's no good
  6. It's not a DMS by Anonymous Coward · · Score: 2, Interesting

    I've been searching for something similar for a while but can't really find anything to fit the bill.

    What I'm looking for is a system that will allow you to highlight a particular quote in a PDF and attach a comment to it. When I finish my review I would like to have all my comments organized in a tabular format. The table should have the quote, page number (ideally also chapter and paragraph but this is asking too much) and my comment.

    This way I can attach my comment sheet to the top of the document and inspect it quickly without even having to open the actual document. This review sheets are also "portable" because they can be shared and anyone with no infrastructure could still identify the comment and quote.
    Adobe Acrobat does only half of this, you can highlight, comment but you can not control the format of export (CSV or excel would be cool)

    Does anyone now a system that does this?

  7. For Mac, I use Papers by pacergh · · Score: 4, Informative

    I use Papers. It does not do everything you want, but it is a nice management tool. It is still growing in features, and the support staff is very responsive. (They provided me, same day, a new NIB file that allowed me to use it on my small hackintoshed Dell Mini 9 screen.)

    The link is here: http://mekentosj.com/papers/

    Otherwise, Endnote works well. I know many who use it. There are a few others that are also out there.

    Good luck with it.

    1. Re:For Mac, I use Papers by pacergh · · Score: 2, Informative

      I like it a lot. It makes me sad that Apple is forcing me to install another OS sooner rather than later. Right now I have OS X Tiger, but won't update it further.

      As for docs, I mainly use it for viewing purposes. That's why I got it: a way to carry all my PDFs in a form factor larger than an iPhone. (I had an iPhone, but found it a pain to read and manage my collection on it. 4-hour reading sessions on the small text of the iPhone screen is not ideal.)

      With the iPad coming, I'll probably work to switch to that. I'm confident the Papers folk with have an edition for iPad.

      As for actual typing, I have a bluetooth keyboard (Apple) and Mouse (Apple, old BT mighty mouse). I actually typed 2 significant (25+ page) papers on it last spring for my masters classes.

      My masters dissertation, though, was typed on a 15-inch Macbook Pro. I didn't have to lug it as far, and the extra screen space was nice.

      When I wrote on the Dell Mini 9, I ended up writing drafts in Apple Pages or Scrivener, and then polished them up in Word (formatting and footnotes). I had to have a good Word output file because the school's printers were attached to computers with Word. ;-)

      Altogether, I do like OS X on the Mini 9. Still, I have been tempted, very tempted, to try both Moblin and Google Chrome, just to see what it's like. I'll wait until I get that iPad, though, so I won't regret losing my Apple HackBook.

  8. Zotero by yes+it+is · · Score: 5, Informative

    Zotero may well be what you're looking for. Much better and more open source than EndNote (mentioned above).

    1. Re:Zotero by mmsimanga · · Score: 2, Informative

      +1 Make sure to go to the actual Zotero.org site and install the beta, version 2. It has a whole more features than the version available from the Firefox addons site.

    2. Re:Zotero by takowl · · Score: 2, Interesting

      Is there something like Zotero that *isn't* a cloud service?

      Well, you could always use it without the sync feature: giving them your data is very much optional. For most users, their institution is likely only aware of Endnote, and won't set up a server for them, so Zotero's hosting the server themselves makes sense.

      I'm not sure that it really meets the OP's needs, though. It fits how I work brilliantly--it's designed for indexing web pages, like a highly structured bookmark manager. But the OP specifically talks about a collection of local files, which Zotero handles rather awkwardly. Any notes would be outside the file, for example, not embedded in it. Mendeley comes closer, but AFAIK it only deals with PDFs, not all the other formats.

  9. Zotero by Anonymous Coward · · Score: 3, Informative

    Zotero might be worth a look. It's a Firefox plugin (open-source), mainly designed for keeping track of a collection of academic litterature. It allows you to organize the papers in folders, tag, annotate, and share the papers and annotations with others, all easily available in the FF gui. You can export lists of references to Word/OpenOffice/TeX when writing papers, they can be autoformatted to a wide range of citation styles.

    It works really well (with full-text search) for storing web pages/pdfs. I don't know how well it works for .odt etc. Even if your purpose is not that of the typical university researcher it might be useful. For instance, recently I've liked using it for storing job ads, and my corresponding applications.

  10. iTunes... believe it or don't by Dystopian+Rebel · · Score: 2, Informative

    If you already have it installed, iTunes may be a simple solution.
    http://lifehacker.com/software/pdf/geek-to-live--organize-your-pdf-library-with-itunes-240447.php

    --
    Rich And Stupid is not so bad as Working For Rich And Stupid.
  11. Zotero by CAPSLOCK2000 · · Score: 3, Insightful

    About a year ago I needed a piece of software that matches your requirements. I wanted to be able to do my research from anywhere and keep track of notes and annotations in a very simple but searchable way.

    Zotero is the closest thing. It's not perfect, far from it, but none of the competition came even close.
    Zotero is a Firefox plugin that allows you to link or store information, be it webpages, pdf's or anything else you may see online. It's possible to group & tag your documents in various ways and there are various options for taking notes and adding annotations.

    All of it is stored online so you don't need to carry anything with you. Just install the firefox plugin, enter your credentials and off you go.

  12. Consider Wikindx3 by pongo000 · · Score: 2, Informative

    Wikindx3 is a full-fledged bibliographic database that can manage *any* type of document, and permits annotations. As an added bonus, you can export the biblio info in any number of formats (including my favorite, .bib for LaTeX).

    I've had good success with OpenDocMan as well, but I'm not sure if that application permits annotation (at least I've never used that feature set).

    1. Re:Consider Wikindx3 by pongo000 · · Score: 2, Informative

      OWL is a nice setup in that it will automatically index all your PDF/RTF/whatever files. Its UI is a bit clunky, and documentation is sparse, but if you have the patience, it might be worth your time.

      I use all three of these apps (see parent also) in various capacities. Which, as you have discovered, indicates that there really doesn't seem to be a "killer" F/OSS app out there that handles everything for a full-fledged document management system.

  13. Zotero by tyroneking · · Score: 2, Insightful

    Zotero is brilliant. I could go on about how I use it every day at work and it makes everything a hell of a lot easier, but instead, just check it out.
    Versioning of documents it doesn't do - but that's what Mercurial is for I guess.

  14. SharePoint or OneNote by KnownIssues · · Score: 2, Informative

    Microsoft Office SharePoint includes the capabilities you mentioned (version control, archiving, document permission/ownership and search/indexing) and is on par price-wise with KnowledgeTree (though not free). They also have a hosted model, SharePoint Online.

    The capabilities you list actually needing--index, search, collaborate and keep and share notes--might be better fit by Microsoft OneNote. It doesn't do version control and document permission/ownership, but it does what you described doing. At my place of business, there are two categories of people: those who love OneNote and those who haven't tried it.

  15. TagTeam by Anonymous Coward · · Score: 2, Informative

    For a basic, low-tech solution I'd suggest TagTeam (http://www.andrew-quinney.com/tagteam.html). It's a basic file tagging utility that makes use of filesystem metadata (PC and Mac), so any changes you make to a given file are immediately visible to others with access to the same file. It also includes a powerful searching language.

  16. Dspace? by andresambrois · · Score: 2, Interesting

    Have you tried it? It's quite powerful and free. They have a good tour video here: http://www.dspace.org/about-dspace/DSpace-Video.html

  17. An unmet need in the biotech community by cinnamon+colbert · · Score: 2, Interesting

    I work in a biotech startup with 12 people total. We have several thousand pdfs, mostly of scientific publications downloaded from places like pubmed, along with some .ppts and .docs and other files. We use a endnote, a program from the behemoth in this area, thompson research, which has most of hte software in this area. see http://thomsonreuters.com/products_services/science/science_products/a-z/procite Based on what I have seen, there is a huge need for software that meets our needs; the thompson products are very $$ and , awfull - a classic case of crappy software with a lot of marketing. Programs like endnote were created back in the 90s, for DOS machines, and they still look and feel like it, once you get past the pretty home page gui of the software that thompson has added on. if anyone out there is serious about making a product to compete, give me a hollar

  18. basic solutions are the best by godrik · · Score: 2, Informative

    I use a git repository containing a bibtex file that tells me where the documents are with an annote field containing information. documents are put in the git repository. If I need to annotate them on the paper for not forgeting something about it, I use xournal. And I push everything in the git repository.

    It implies that people update the repository which is in my opinion not really a problem.

  19. From what you describe ... by oneiros27 · · Score: 4, Informative

    You're looking for a reference management system, not a document management system. (although, they might not deal with all of the stuff that you mentioned that a document management system will)

    Zotero should work for a single person, but if you're trying to do this for an office, you might want to take a look at Aigaion.

    If you want to look at others to see what best fits your needs, see:
            http://en.wikipedia.org/wiki/Comparison_of_reference_management_software

    And , if you still can't find anything -- try asking on the Code4Lib mailing list, as you might need one of the 'integrated' library solutions.

    --
    Build it, and they will come^Hplain.
  20. Bibdesk and Mendeley by coaxial · · Score: 2, Insightful

    I use Bibdesk on the mac, and I like it. Specifically, I like that it organizes all my PDFs into folders and stores all the data in a Bibtex file. The only problem I have with it, is that it stores the paths and macosx aliases and so instead of getting a nice pathname, you get 1500+ characters long hash. I'd really like a way to convert those back to paths so I could migrate in the future if I need to.

    I used Mendeley for about 10 minutes, but I was impressed. It looked really good. It's cross platform, and web based. The only reason why I'm not using it is because I already started with Bibdesk, and it just wasn't quite worth converting over. (Again the pathname issues.), but I'd recommend it.

    Anything that doesn't support BibTeX is simply a non-starter.

  21. Great suggestion.. by msimm · · Score: 2, Informative

    So far it's one of the best I've tried and it does a pretty great job of extracting all the reference/author data. As a desktop application, for my purposes at least, it seems just about perfect with my only current quibbles (only an hour or so into use) would be 1) the way it's search handles multiple matches within a document (hint: it doesn't) 2) they way it displays matched documents (matches aren't highlighted and must be manually paged/scrolled to).

    Those 2 points are kind of important issues for an indexing/search/research tool, but overall I'm still really impressed with the project and features like the folder watch (rather then manually importing new documents) definitely add value.

    Of course it's pretty slick too, which is always nice.

    --
    Quack, quack.
  22. Agorum by Nagilum23 · · Score: 2, Informative

    From what I've heard http://www.agorum.com/ is what you're looking for.