Slashdot Mirror


IBM vs. Content Chaos

ps writes "IBM's Almaden Research Center has been featured for their continued work on "Web Fountain", a huge system to turn all the unstructured info on the web into structured data. (Is "pink" the singer or the color?) IEEE reports that the first commercial use will be to track public opinion for companies. " It looks like its feeding ground is primarily the public Internet, but it can be fed private information as well.

5 of 216 comments (clear)

  1. I think a better question... by bc90021 · · Score: 5, Funny

    ...doesn't concern whether "Pink" is a colour or a singer, but whether "Paris Hilton" is a hotel in France or an oft downloaded video... ;)

  2. structure... by Rhubarb+Crumble · · Score: 5, Funny
    a huge system to turn all the unstructured info on the web into structured data

    In order to do this, they will use a scheme by which each document is referred to by a string including the transfer protocol, the host name, and a file path.

    oh, wait...

  3. Re:What about Existing Data? by Ronald+Dumsfeld · · Score: 5, Funny
    Are you telling me that there are programmers willing to go through [Insert Ludicrously Large Number Here] files and "annotate" them using XML to fit the new system?

    No, they're writing software to put in the XML tags.

    What will be more interesting to see is if it's possible to pollute the database by putting in your own XML. Instead of Google-Bombing we'll have people pissing in the WebFountain.
    --
    Where's the Kaboom?
    There's supposed to be an Earth-shattering Kaboom.
  4. SCO by Zork+the+Almighty · · Score: 4, Funny

    IEEE reports that the first commercial use will be to track public opinion for companies.

    Searching "SCO"
    Found "Slashdot"
    ERROR arithmetic underflow.

    --

    In Soviet America the banks rob you!
  5. CrapFountain by s4m7 · · Score: 4, Funny

    Here's how it works:

    Executive Bob, who's paid IBM $150,000 for his enterprise liscence of webfountain, enters into his webfountain search box: "Pink the musician, not the color"

    IBM's powerful software parses this command into "pink music -color" and passes it to google, retrieves the results, removes Google's paid ads and replaces them with IBM's paid ads. The content is then served to Executive Bob, who shouts: "EUREKA" since within the top ten search results he finds "NUDE PICTURES OF RAPPER PINK!"

    IBM then lands a lucrative support contract with Exectutive Bob to remove all the viruses and spyware from his desktop PC. Rinse and Repeat.

    --
    This comment is fully compliant with RFC 527.