Google Trains AI To Write Wikipedia Articles (theregister.co.uk)
The Register: A team within Google Brain -- the web giant's crack machine-learning research lab -- has taught software to generate Wikipedia-style articles by summarizing information on web pages... to varying degrees of success. As we all know, the internet is a never ending pile of articles, social media posts, memes, joy, hate, and blogs. It's impossible to read and keep up with everything. Using AI to tell pictures of dogs and cats apart is cute and all, but if such computers could condense information down into useful snippets, that would be really be handy. It's not easy, though. A paper, out last month and just accepted for this year's International Conference on Learning Representations (ICLR) in April, describes just how difficult text summarization really is. A few companies have had a crack at it. Salesforce trained a recurrent neural network with reinforcement learning to take information and retell it in a nutshell, and the results weren't bad.
Obligatory XKCD Reference: https://xkcd.com/810/
It might be fun to watch the Google Wikipedia AI Bot get into "turf wars" with existing Wikipedia Bots...
If you want news from today, you have to come back tomorrow.
Can this bot win edit wars, get Wikipedia administrators to side with it, drive n00bs off its pages? Without that, it's not very useful on Wikipedia itself.
Such models have no common sense yet - can't tell if "the use of the umbrella causes the rain or the other way around". They can't think like us, they just copy text and try to hit all the sub-topics with naturally sounding language based on the source material. It's more similar to Google translator than a human Wikipedia editor.
Great. Just what we need. A trained monkey that summarizes the summarizers.
According the the article, "The generated sentences are taken from the earlier extraction phase and aren’t built from scratch, which explains why the structure is pretty repetitive and stiff."
Mohammad Saleh, co-author of the paper and a software engineer in Google AI’s team, told The Register: “The extraction phase is a bottleneck that determines which parts of the input will be fed to the abstraction stage. Ideally, we would like to pass all the input from reference documents. “Designing models and hardware that can support longer input sequences is currently an active area of research that can alleviate these limitations. We are still a very long way off from effective text summarization or generation. And while the Google Brain project is rather interesting, it would probably be unwise to use a system like this to automatically generate Wikipedia entries. For now, anyway.
Also, since it relies on the popularity of the first ten websites on the internet for any particular topic, if those sites aren’t particularly credible, the resulting handiwork probably won’t be very accurate either.
My faux outrage is entirely synthetic.
Insofar as the google understands that knowledge is power, I'm only surprised that they decided to show their hand. Maybe Wikipedia is less naive and less harmless than I thought? The google perceives an actual threat to their Gawd Profit?
I've actually been considering this branch of technology in terms of specific applications, such as (1) Writing assistance to help people tell their hidden stories (in interesting ways, of course), (2) Email analysis for asymmetric celebrity email systems (as a dual of the spam problem), or (3) Aggregation of EPR (Earned Public Reputation) for such places as Wikipedia (and Slashdot), just to focus on the three that I keep banging my head on. Automatic digesting of knowledge for such purposes as Wikipedia articles is actually a more pragmatic branch with high relevance to the corporate cancers as they seek the elimination of their most serious cost: Paying the salaries of all those pesky human beings. Much better to harvest their knowledge and flush them away.
I better confess that yes, I do have a personal axe to grind, having been flushed by one of those large cancers. I still think I have some mental capacity to work, and even a desire to do so, but they can always spot my age with one glance at my resume. Won't even give me a chance to underbid the young whippersnappers. If it ain't the question of how well I've kept up with the technologies, then it's the belief the youngsters have fewer bad habits to unlearn plus the life expectation thing...
Ha ha! Last laugh's on them. The google can't harvest my abundant mistakes to learn anything from them. They were all erased before I turned in my last corporate computer. (Actually, most of my corporate career was fixing OTHER people's mistakes. With my terrible attitude, I was a natural for the work.)
Almost forgot one more laugh. The problem faced by the corporate cancers is fundamentally unsolvable. There is NO largest profit that will solve their desperate need for the infinitely large profit number. There's only the threat of slumping next quarter.
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
Most articles I find on the net follow a pretty consistent pattern, using one of two variations on that pattern:
How To Foo a Fizz
Fizz s very popular these days blah blah. First paragraph says nothing useful at all.
Fizz is good for blah blah. Second paragraph also pointless.
Sometimes it helps to Foo your Fizz. Some people like to Foo it because blah blah blah.
You can Foo your Fizz by:
Clicking the tiny menu at the bottom
Choose Preferences
Select "Foo"
Now your Fizz is Foo and blah blah blah.
Share this on Facebook. On Twitter. On Google Plus. MySpace. Yourspace. Farmers only . Black people meet. Stupid people meet.
Pretty standard pattern. The first two or three paragraphs are pointless. Sometimes they forget to actually tell you how to do it, and ONLY have the fluff. That's really annoying.
The "human interest" version is similar:
How to Close a Resume Cover Letter
Debbie Wood, a mother of two from Englewood, Colorado was driving home in her blue Mustang when she stopped for some fries. After eating them, with ketchup, she got a call saying she was fired. Blah blah blah.
Blah blah blah about Debbie.
Debbie worked at Poor Writing Inc for six years, starting out as an eraser. Blah blah blah.
Debbie wrote "I'm looking forward to hearing from you" at the end of her cover letter. It worked great.
Debbie now works at blah blah blah. She enjoys blah at her blah job blah blah blah.
Share this on Facebook. On Twitter. On Google Plus. MySpace. Yourspace. Farmers only . Black people meet. Stupid people meet.
Pretty much the entire useful part of the story is the fourth paragraph.
This is just crying out to be applied to some famous texts to amuse us with what it comes up with.
The Hunting of the Snark. Fox in Socks. We're Going on a Bear Hunt. Ulysses. 50 Shades of Grey. Titus Andronicus. Sonnet 130. Harry Potter and the Portrait of What Looked Like a Large Pile of Ash. The Magna Carta. Genesis. Terms and Conditions for iTunes.
Quattuor res in hoc mundo sanctae sunt: libri, liberi, libertas et liberalitas.
And then just spin your article via word AI https://wordai.com/... Oh thats what the penguin was for was for. ::) Grrrr
[($)]
than saving it. They are anti-information. I gave-up after I created a page for my uncle that had a platinum record and five gold records that was deleted as not being notable.
most of the information within a wikipedia page is spread around on little visited websites in terrible formatting it actually takes someone who understands or wants to understand "the subject" to actually do a half decent job
the fact it simply takes the summary of the summarizers basically makes it pointless... go back to you lisp machines "researchers"
A Google AI could hardly do worse than a large number of Wikipedia entries.
I didn't realise there is a Google Trains subsidiary. But even so, why does it have an AI and why would this AI edit Wikipedia?
"It's such a fine line between stupid and clever" -- David St. Hubbins, Spinal Tap
This guy was already doing that in 2014?
Seriously, if this is done right, it might actually give us 'cliff notes' on businesses, their web sites, ideally, even connect the subsidiaries.
And perhaps they will show the dots WRT family connections.
I prefer the "u" in honour as it seems to be missing these days.