Slashdot Mirror


The Rise of Machine-Written Journalism

Hugh Pickens writes "Peter Kirwan has an interesting article in Wired UK on the emergence of software that automates the collection, evaluation, and even reporting of news events. Thomson Reuters, the world's largest news agency, has started moving down this path, courtesy of an intriguing product with the nondescript name NewsScope, a machine-readable news service designed for financial institutions that make their money from automated, event-driven trading. The latest iteration of NewsScope 'scans and automatically extracts critical pieces of information' from US corporate press releases, eliminating the 'manual processes' that have traditionally kept so many financial journalists in gainful employment. At Northwestern University, a group of computer science and journalism students have developed a program called Stats Monkey that uses statistical data to generate news reports on baseball games. Stats Monkey identifies the players who change the course of games, alongside specific turning points in the action. The rest of the process involves on-the-fly assembly of templated 'narrative arcs' to describe the action in a format recognizable as a news story. 'No doubt Kurt Cagle, editor of XMLToday.org, was engaging in a bit of provocation when he recently suggested that an intelligent agent might win a Pulitzer Prize by 2030,' writes Kirwin. 'Of course, it won't be the software that takes home the prize: it'll be the programmers who wrote the code in the first place, something that Joseph Pultizer could never have anticipated.'"

1 of 134 comments (clear)

  1. Not for human consumption by PCM2 · · Score: 0, Flamebait

    Wired occasionally carries good stories, but this ain't one of them. It sounds portentous and should play well to all the anti-journalism reactionaries and self-styled media pundits, but really this is just flying cars and robot butlers.

    It's important to note here that NewsScope isn't a news service like Reuters; rather, it's a targeted data stream for the finance industry. Its output is not meant to replace the work of human journalists. Its output is not even meant to be read by humans.

    But leave it to Wired to come up with an angle like "NewsScope has started carrying stories written by machines." A writer less enamored with breathless futurism might instead say that NewsScope parses corporate financial statements and extracts relevant data points, which it then summarizes in a machine-readable format, stripping out all the excess verbiage and historical statements that aren't useful to automated trading software. It's somewhat analogous to a search spider, one that builds an index of finance news as it crosses the wire, making the data easier for third-party software to query.

    This isn't the Master Control AI writing news stories, people. It's a product -- and probably a pretty valuable one if you're in that industry.

    Similarly, TFA says the program that generates news stories based on stats was "rigged up" by some college students. Is it useful? Potentially. Is its output capable of replacing human sports journalists? Is it even publishable? There's no evidence that anybody even suggested that. How many of your college projects changed the world?

    TFA goes on to talk about how reporters have been forced to pick through information by hand -- for example, reading volumes of PDFs -- and how much nicer it would be to have machine-readable data to query. Well, no kidding! You're not alone there, brother; I like Google, too.

    And then, like so many breathless Wired article, this one evaporates:

    Further out toward the horizon lies the prospect of intelligent systems that filter vast quantities of unstructured content, drawing inferences that can be formatted according to journalistic norms.

    Uh-huh. Where can we find that horizon, precisely? And "formatted according to journalistic norms" -- what does that even mean? And then:

    Along the way, of course, intelligent systems will need to start coping with the complexities of human language have so far confounded them, including idiom, metaphor and sarcasm.

    "Of course," indeed. As Han Solo once said, "Well that's the real trick, isn't it?"

    --
    Breakfast served all day!