Slashdot Mirror


User: l-carnitine

l-carnitine's activity in the archive.

Stories
0
Comments
1
First seen
Last seen
Profile
(view on slashdot.org)

Comments · 1

  1. Re:Yes it's a dupe, but lets get something straigh on Text-Mining Technique Intelligently Learns Topics · · Score: 1

    The evaluated http://gate.ac.uk/ which is GPL software but ended up using http://search.cpan.org/~acoburn/Lingua-EN-Tagger/. There are several other tools in this space that can be glued together to create this type of software:

    http://www-nlp.stanford.edu/
    http://tcc.itc.it/research/textec/tools-resources/ jinfil.html
    http://wordnet.princeton.edu/
    http://www.alias-i.com/lingpipe/web/faq.html
    http://www.isi.edu/licensed-sw/halogen/index.html

    Not trivial, but if you wanted to DIY, you don't need to start from scratch. Though, having a bunch of hardware to chug through 1000s of documents would still be needed :).