Slashdot Mirror


Open Source Automated Text Summarization?

TrebleJunkie writes "I've spent some time recenting looking for open source projects dealing with Automated Text Summarization -- automatically generating detailed summaries from longer documents -- to no avail. I can find a lot of research papers and several commercial projects, but no open source code or projects? Does anyone out there know of any?"

3 of 38 comments (clear)

  1. Microsoft Summarize by cabalamat2 · · Score: 4, Funny

    I know it's not open source, but have you tried the Summarize feature in Microsoft Word? I fed it the entire contents of the GNU website and it came back with:

    GNU is rubbish. Don't use the viral GPL! Bill is your friend. You love Bill. Microsoft software is the best.
  2. I'll GPL this: by Bazman · · Score: 4, Funny
    perl -ne 'split;foreach(@_){print $_." " if (rand()>.9)}'

    Try it on man pages:

    man awk | perl -ne 'split;foreach(@_){print $_." " if (rand()>.9)}'

    and it still makes sense! :)

  3. Re:Have a look in CPAN by orangesquid · · Score: 4, Interesting

    Lingua::EN::Summarize tested on the GPL v2:

    USA Everyone is permitted to copy and distribute verbatim copies of this license document. Changing it is not allowed. Preamble The licenses for most software are designed to take away your freedom to share and change it. The GNU General Public License is intended to guarantee your freedom to share and change free software. To make sure the software is free for all its users. (Some other Free Software Foundation software is covered by the GNU Library General Public License instead.). We are referring to freedom. Our General Public Licenses are designed to make sure that you have the freedom to distribute copies of free software (and charge for this service if you wish).

    It seems comparable to MS Word..

    --
    --TheOrangeSquid Is it any wonder things seem so awry? We swim in a sea of confusion and don't have to think to survive