Baidu's Voice Recognition Software Is More Accurate Than Typing (thestack.com)
The massive Chinese web services company Baidu has
launched their sophisticated new TalkType 'keyboard' which defaults to voice recognition app. An anonymous reader quotes The Stack: Baidu claims that the app's speech recognition is more accurate than actual typing, having developed and tested the technology alongside speech software experts at Stanford University...The researchers concluded that Baidu's technology was three times faster than a typical user typing in English. The results showed that the TalkType error rate was 20.4% lower than an English texter hunting and tapping for letters. The accuracy was even greater for those typing in Mandarin, with the error rate dropping 63.4% when using TalkType.
Of course, last year Baidu was also accused of gaming the testing for their image-recognition software.
Of course, last year Baidu was also accused of gaming the testing for their image-recognition software.
texter hunting and tapping for letters
your doing it wrong m8
That is like the test where someone claimed they defeated the Turing test by pretending to a retarded foreign boy that didn't speak English.
I guess they also only picked people who had never before typed anything in their life as well.
"20.4% lower than an English texter hunting and tapping"
What about touch typists? How does that compare?
I type at about 40wpm but there are others who can type
above 100wpm (words per minute).
Not at all unusual 'banter' among dudes that I have known.
Now, Billy Bush. I would just never be around that kind.
But for the recording. Whooop! Deee! Doooo! Daaaaa! The wife. Yeah, she can act all pissed off. Imagine the shoe on the other foot. I have. That would be unusual, but funny, and frankly, cool. And you know women do that all the time, too, you just don't get it hear it.
The condemnation, though. That is crocodile tears: makes for good short-time ratings.
And Baidu. Echo Me-too.
On a full English language keboard there is no way speech is faster if you know how to type. Now if you don't know how to type or you're using a touch screen, then yeah. Maybe if you're using Mandarin because it's not as straightforward as the Roman alphabet. But no, I can type considerably faster than I can talk and almost as fast as I can read, which is well over 100 wpm, and with a display and backspace key (since I'm human) my ultimate accuracy is 100%.
Brackets contain world's first nanosig, highly magnified:[.]
When I got my Trash 80 back in the 70s the first program I bought was a typing tutor. I've been touch typing for some 40 years ago and my fingers don't have any trouble keeping up with my brain. My mouth, not so much.
The article talks about speech recognition, not voice recognition. EditorDavid has the two concepts mixed up: speech recognition is all about trying to recognized what you are saying, whereas voice recognition is all about recognizing specific voice, like e.g. for reasons of identifying who is speaking.
The results showed that the TalkType error rate was 20.4% lower than an English texter hunting and tapping for letters.
How many of those errors could have been reliably corrected by some form of autocorrect, or was such already included in the tests?
If I try and type "thw quick rbown fox jump sover the lazy dog" as fast I can... well, that's the result. Autocorrect could have fixed most of those problems.
systemd is Roko's Basilisk.
Oh good, more assholes yelling into their phones while in public spaces. That's exactly what we need.
I ignore Anonymous Coward posts. If you want to discuss something, that's awesome. Log in.
Another thing -- when I'm typing, and there is an error, I'm right there to correct it.
With voice recog, at least right now, editing it after it's been screwed up by Google or whatever is more of a PITA than just typing it out in the first place.
Trying to actually do decent editing (at least on my S7) is seriously annoying. Cursor positioning is flaky as hell, parts of messages disappear above and blow the edit point, I try to drag the edit point and it scrolls up or down so fast there's no chance of actually getting where I meant to go...
I grant you that this kind of thing is the result of bad design at some level in Android or some library most everyone is using, and could be corrected... but right now, it's SN/AFU. That's a big factor in why editing as I go, rather than trying to get "somewhere" in something already containing lots of text, is much easier on my temper.
That said, I would welcome 99.99999% accurate voice recog. Not holding my breath, though.
I've fallen off your lawn, and I can't get up.
Here, have some Trumpy crotch grabbing humor.
I've fallen off your lawn, and I can't get up.
No, it's really not.
One of the things that characterises modern Chinese language is the proliferation of homophones (words that sound alike).
The way that Chinese people cope with this is extreme use of context and of spelling; the homophones don't have the same character. Sometimes Chinese people will clarify meaning by sketching a character in the air, often unconsciously.
If the error rate reduction is so huge based on speech recognition this would suggest that pinyin can replace characters for writing Chinese. And this has been disproved on many occasions; you can literally write an entire story using only the syllable 'ma'. In pinyin it all comes out as 'ma' with the 4 tones. In characters its actually readable. Same with the story of the lion eating poet in the stone den which is all 'shi'.
So a great test of this Baidu software would be to get someone to read this to it and see what it comes up with:
https://chinesepod.com/blog/ho...
https://en.wikipedia.org/wiki/...
and see if it gets it right:
Sh Shì shí sh sh
Shíshì shshì Sh Shì, shì sh, shì shí shí sh.
Shì shíshí shì shì shì sh.
Shí shí, shì shí sh shì shì.
Shì shí, shì Sh Shì shì shì.
Shì shì shì shí sh, shì sh shì, sh shì shí sh shìshì.
Shì shí shì shí sh sh, shì shíshì.
Shíshì sh, Shì sh shì shì shíshì.
Shíshì shì, Shì sh shì shí shì shí sh.
Shí shí, sh shí shì shí sh sh, shí shí shí sh sh.
Shì shì shì shì.
In the free world the media isn't government run; the government is media run.
I don't think the typical person 'hunts' and types anymore. Maybe 30 years ago......
"First they came for the slanderers and i said nothing."
What not a single comment about big brother implication .... specially for a Chinese company where each of them are accused of being an extension of the party.
And I don't care about look what yahoo did or whatever an extension is different from complying with the law in a democracy
I see
NSA is going to announce a competing keyboard. They're already digitizing everything but now they'll be sending you a copy as a courtesy and for proofreading and correction. Win-win!
The problem is the false sense of security and subsequent lack of proofreading and error correction.
Try voice recognition software for a week. You'll likely find that you will read over something that you dictated and not realize that there are errors in it. People are less likely to find errors in something they dictated than in something they typed.
Help! I'm a slashdot refugee.
Not sure where you got your incorrect information from, but the current keyboard was designed for maximum speed of the day, not to slow down typing speeds.
While it is true that there are keyboard layouts that can make typing faster on a computer, the current keyboard was designed to space out the hammers so they would not jam on typewriters thus increasing the speed people could type.
Microsoft, Apple, Google, Amazon what's the difference? All steal money from devs and control with walled gardens.
A character error is easy to read past be a word error changes the meeting.
Some drink at the fountain of knowledge. Others just gargle.
I can type more precisely and quickly and confidently than I can talk. Deciding which words to type is faster and easier. How is voice recognition going to improve on the speed by which I can speak, which is inferior to my typing ability whenever precision is required?
John_Chalisque
...the app is probably keeping track of everything a user speaks/types and sending it back home to China.