Slashdot Mirror


Open Source Natural Language Processing?

fieldmethods asks: "One area where Open Source and Free Software doesn't seem to have really taken off is Natural Language Processing (using computers to deal with human languages). There are a few projects that are open source, such as Festival (a speech synth system, now ported to Java), NLTK, a general-purpose NLP system in Python, and the Linguana project, a Perl implementation of a semantic network not unlike Wordnet (but better). Generally, though, there doesn't seem to be a lot of Open Source momentum behind the field as a whole. It's a challenging, difficult field that would benefit from collaboration, especially given the potential of replacing static corpora with on-the-fly corpora developed by search engines. Is anybody else interested in this?"

6 of 31 comments (clear)

  1. Re:Doesn't look like it! by Some+Guy · · Score: 2, Informative

    Actually, I am interested in this. I did some computational linuguistics work while I was doing my BSc/MSc and really enjoyed it.

    You should also have mentioned your [?] interesting website fieldmethods.net as a good source for exploring all things NLP [which I thought referred to Neuro-Linguisitc Programming when I first saw it...].

  2. Did you google? by FeatureBug · · Score: 4, Informative
    Is there a reason you haven't tried answering your question using Google? You're not chasing karma are you? Last I heard Google is free.

    There's a huge amount of open-source NLP resources and software for many languages on the web.

    Last but not least:

    Will.

  3. two projects by Kunta+Kinte · · Score: 3, Informative
    http://freespeech.sourceforge.net/
    http://www.speech.cs.cmu.edu/

    There are probably others ( search google.com, freshmeat.net, sourceforge.net )

    --
    Based on upvotes, Ageism is the only "-ism" Slashdotters care about and think isn't SJW
  4. open source NLP (in POESIA) by basiles · · Score: 2, Informative

    The POESIA (an opensource internet content filter, partly funded by the European Commission, safer Internet Access Plan IAP2117/27572) project will have some opensourced NLP components (for English, Spanish, Italian...).

    See POESIA site for details.

    POESIA (Public Opensource Environment for a Safer Internet Access) aims to protect European youth (in educational institutions) against harmful or inappropriate Internet content, and use several techniques (including NLP, Image processing, ...) to achieve this goal.

  5. Check out OpenCyc by jungd · · Score: 3, Informative

    One of the best speech understanding systems in existance is OpenCyc - and it is open source!

    --
    /..sig file not found - permission denied.
  6. I've got two links for you by perkr · · Score: 3, Informative
    General open source NLP tools:

    http://opennlp.sourceforge.net

    http://nlpfarm.sourceforge.net

    If you're looking for speech software there isn't that much good software as open source, since just about every aspect of modern speech processing is patented.