IBM vs. Content Chaos
ps writes "IBM's Almaden Research Center has been featured for their continued work on "Web Fountain", a huge system to turn all the unstructured info on the web into structured data. (Is "pink" the singer or the color?) IEEE reports that the first commercial use will be to track public opinion for companies. " It looks like its feeding ground is primarily the public Internet, but it can be fed private information as well.
Sounds good. There ought to be something similar under BSD or GPL.
Political dissidents would definitely benefit from this kind of super search system, and so do normal users like kids doing searches for their homework.
We need our own "commie" version.
I wish I was fluent in computer languages or else I'd be the first one to start this up under BSD licence.
Any suggestions as to what language I need to learn to develop this kind of search engine?
Its gotta have a capability like freenet to distribute load on the network and the system while keeping users anonymous, since private users won't have the resource to come up with 1000s of servers. I'm thinking on the lines of XML.