IBM Open Sources UIMA
psykocrime writes "Line56 is reporting that IBM has open-sourced core components of their UIMA project. UIMA is (or was) an IBM Research Project
for managing "unstructured information" such as free-text. IBM describes UIMA as: 'an open, industrial-strength, scalable and extensible platform for creating, integrating and deploying unstructured information management solutions from combinations of semantic analysis and search components. [...] and makes the core Java framework available as open source software to provide a common foundation for industry and academia to collaborate and accelerate the world-wide development of technologies critical for discovering the vital knowledge present in the fastest growing sources of information today.' The newly opened source is available at SourceForge."
FTFA: UIMA is about text analytics, which encompasses functions like search and pattern analysis. "It's able to create sophisticated solutions that extract insight from a large amount of unstructured data stored in computers," says Dr. Nelson Mattos, IBM VP of Information and Interaction. "The goal is to create an open standard for analytics."
In other words, it's now an opensource way of picking out themes in text. So, I guess you can (or the company you work for) have a version of Carnivore
Whatever the hell that means...
If you stop and think about it for a moment, this would be stupid for IBM to do, and it's obviously not what they are doing.
Paradoxically, it costs money to open source something (at least for a big company like IBM). Getting the package ready for the light of day, getting all the approvals and sign-offs, and staffing the resource to keep working on it, all take money.
IBM wouldn't do that for something it doesn't consider valuable in some way to its business.
This is actually a perfect example of the kind of thing IBM likes to open source. It's a framework that (hopefully) will actually make unstructured analysis components work together in an enterprise (at least as developers start adopting it). In order to get to "critical mass", open sourcing the framework lets developers know that they can count on it (a) being there and (b) being free. It'll (again hopefully) make the market space larger for all players, allowing IBM to compete on its merits, rather than having to try to displace entrenched proprietary solutions.
That, or maybe you're just trying to get the word out to developers. Since the mainstream press will basically just print anyone's press releases as articles without revision, you have a pretty good chance of getting your word out there verbatim these days. It doesn't matter if they can understand it if they don't even try to understand it.
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"