Google Open-Sources SyntaxNet Natural-Language Understanding Library, Parsey McParseface Training Model
Google announced on Thursday that it is open sourcing its new language parsing model called SyntaxNet. It's a piece of natural-language understanding software, Google says, that you can use automatically parse sentences, as part of its TensorFlow open source machine learning library. The company also announced that it is releasing something called Parsey McParseface (Google has a sense of humor), which is a pre-trained model for parsing English-language text. Nate Swanner of The Next Web, attempts to explain it: Combining machine learning and search techniques, Parsey McParseface is 94 percent accurate, according to Google. It also leans on SyntaxNet's neural-network framework for analyzing the linguistic structure of a sentence or statement, which parses the functional role of each word in a sentence. If you're confused, here's the short version: Parsey and SyntaxNet are basically like five year old humans who are learning the nuances of language. In Google's simple example above, 'saw' is the root word (verb) for the sentence, while 'Alice' and 'Bob' are subjects (nouns). Parsey's scope can get a bit broader, too.
It's a piece of natural-language understanding software, Google says, that you can use automatically parse sentences, as part of its TensorFlow open source machine learning library.
YOU CAN USE AUTOMATICALLY PARSE SENTENCES
So, can Parsey McParseface make sense of what manishs posts? Because I generally can't. I assume that the example sentence from the summary probably came from the article, but for some reason the "editor" didn't think to read his summary to make sure that it actually made sense out of context.
How can we continue to believe in a just universe and freedom to eat crackers if we have no ale?
James while John had had had had had had had had had had had a better effect on the teacher.
The company also announced that it is releasing something called Parsey McParseface (Google has a sense of humor)..
If by 'sense of humor' you mean 'a repeat of something that was humorous a while ago under a different context'.
"I like to lick butts!" by MobileTatsu-NJG (#32700246) (Score:5, Informative)
Fruit flies like a banana.
Bison from Buffalo, New York, are known to bully other Buffalo bison, who in turn bully (buffalo) other New York bison. In other words:
Buffalo buffalo buffalo buffalo buffalo buffalo buffalo.
Knowledge workers process natural language inquiries and recall from the 0.01% of human knowledge they have managed to memorize the relevant details to solve the problem, identify where to look for more information, and/or refer the inquiring individual to the correct resource they need to solve the problem themselves.
A computer capable of parsing natural language inquiries can construct an appropriate query of all publicly accessible digitized human knowledge and analyze the contents of that knowledge to identify what information is relevant to the inquiry.
Processing the relevant information in to a solution will still require a capability to generate useful models from that understanding that allow it to identify optimal solutions. Preprocessing and structuring the data for analysis is one of the largest obstacles preventing forward-progress towards that goal.
If the importance of Natural Language Processing is lost on you, that's a reflection of your own ignorance more than it's a reflection of the value of the achievement.
I suppose you thought the important use-case to measure this achievement was beating the Turing test? That's the five-year old you're referencing right?
What if this entire post was written by a Syntaxnet based AI? When the comments sections of websites are flooded with public opinion shaping bots from the NRA, Brady Campaign, and presidential campaigns: will you care then?
A concise version of the Library of Babel expressing every idea if a language?
Not really, because Xy McXface is not funny for any value of X.
Confucius say, "Find worm in apple - bad. Find half a worm - worse."
94% syntax is definitely good, for a machine learning parser. Now if you were to come to the land of rule-based parsers, 94% is the norm.
Google loves machine learning, and it's easy to see why. That's how they made their whole stack. They have the huge amounts of data to train on, and the hardware to do so. It's so seductive to just throw a mathematical model at huge amounts of data and let it run for a few weeks.
Rule-based systems don't need any data to work with - they just need a computational linguist to spend a year writing down the few thousand rules. But the end result is vastly better, fully debuggable, easily updatable, understandable, and domain independent. That last bit is really important. A system trained for legalese won't work on newspapers, but a rule-based system usually works equally well for all domains.
In 2006, VISL had a rule-based parser doing 96% syntax for Spanish (PDF) - our other parsers are also in that range, and naturally improved since then. Google is hopelessly behind the state of the art.
that is precisely why it is not funny. google peons are just parroting something funny to try to be funny.
A two-year-old gelding destined to race in Australia has been saddled with the name Horsey McHorseface. (pun intended by editors)
http://www.bbc.com/news/world-...