Google Wants Your Voice Data
00_NOP writes "Peter Norvig, Google's director of research, has told New Scientist that one of the reasons the search engine launched Google Voice is that it needs more human voice data to perfect the sort of 'big data, simple algorithm' probabilistic approach to translating voices to text that drives Google Translate. Norvig says that no one is listening to your calls on Google Voice — it is simply their servers trying to get the translation right."
I will say that the translation of my voice mails is terrible. Although, how can you tell if it is translated correctly if you don't listen to it? You can look for proper English, but even some of my translations are proper English yet still incorrect. (names, etc come out wrong.) Though most of the time it it's just a jumbled mess that I can't deduce the actual meaning of.
How do servers assess whether they've got the translation correct without having a human-in-the-loop to listen to the conversation and concurrently read what the server translated? Maybe the data is anonymous by the time it gets to a human, but it seems like humans need to interface with the voice data somehow to validate that the server is translating accurately.
>simply their servers trying to the translation right
>trying to the translation right
>the translation right
Nicely done.
First post? Tell it to google, I'm sure they'll listen.
oh shit! Google accidentally my voice data!
I can tell that they want the voice data badly. They make it very difficult to delete call and voicemail history. You can't delete more than 10 records at a time and even then they go into trash and keep piling up over there. You can delete the data from trash but again only 10 at a time. There is no option to empty the trash. Their help section says that the history is purged from trash after 30 days automatically but only that it isn't. My call history sits in the trash indefinitely unless I painstakingly delete all history 10 records at a time.
"it is simply their servers trying to the translation right"?
Slashdot accidentally the whole verb!
Or maybe they just put it through Google Translate... it's 'probably' right.
My voice is my passport. Verify me.
Brent Spiner would want to be paid for it.
They have another server that checks the first server's translation. Part of their work is checking that server's affectiveness, too.
This is the price of "free" services.
I gave up trying to get voice software to work over a decade ago. The reason is that I'm trilingual and use all 3 daily. So the software needs to be able to:
- understand a lousy accent: there are some words I cannot and will never be able to pronounce 'right'
- recognize what language is being spoken (having those 3 and only those 3 preset in the options)
Now I haven't tried Google Voice, but none of the software I've tried or heard about could even remotely do those two basic things.
Non-Linux Penguins ?
it is simply their servers trying to the translation right.
Time to offend someone
And you're surprised why? All voice apps. do this. Always have, always will, and until it's perfected, and we're a long, long way from perfecting it.
Steven
"it is simply their servers trying to the translation right"
I think you a word in your sentence.
"Peter Norvig, Google's director of research, has told New Scientist that one of the reasons the search engine launched Google Voice is that it needs more human voice data to perfect the sort of 'big data, simple algorithm' probabilistic approach to translating voices to text that drives Google Translate. Norvig says that no one is listening to your calls on Google Voice — it is simply their servers trying to the translation right."
I think Google Voice translated the last part of that sentence.
The translation is off pretty far most of the time for my voicemail. But they do end up to be entertaining. Here are two actual translations from google voice:
1) Okay, I don't know why it takes for ever, for your voicemail to pick up. But anyway, I was just calling to tell you that we forgot to while. I will and I told Mrs. Smith and this is best but she signed about it, so I'm gonna shout in the car and have it for her after I pick her up, bye. You Get Out virtual slot is not with us. So, wish me luck. I don't have to run out over there, bye.
2) Hey, it's me. Yeah, I would you like. The has the can't seeing tomorrow night and then there's the car was up for a late night should keep talking about it I'm sure wondering if we could let him get the cantinas left for due to Cardiff blues. I thought I'd run it by you and see what you thought, let me know if any a message or something, so I can out and I will talk to you later. Bye.
Right. Everyone get on Google Voice with funny unnatural accents, unusual intonation and non-native grammar! Let's skew their data.
Comment removed based on user account deletion
as far as conspiratorial fear mongering? who does it better? the truth. other outstanding features; religious crusaderism, chosen ones minionisms, genocidal historisms, & restorisms. then there's mythical magical fictional being promotions, & real sex religious 'trainings'. who do we trust? thanks again for the observable progress by others in the disarming arena. as for the 'weather'? tell the truth.
Except of course the servers, and the people later on who are looking at the translation and also listening to the original perhaps to see if it was correct.
Yes, if you agree to them doing it.
Which means you're a moron for not knowing what you're talking about.
There was a Userfriendly.org strip years ago which pretty much summarizes my experience with voice recognition software for the past 15 years. . .
I can't find the link to the comic anymore, but basically, one of the guys in the office had been trying to use voice recog software. Some of his coworkers come to his office. He's not there, but on the screen, they wonder about the mysterious message, "Cod Am Pizza Ship".
I think I read somewhere that this was the same reason that Google 411 launched. It was a great, free service for getting directory information from a cell phone or home phone. What Google was really doing was collecting and testing voice recognition algorithms.
Unfortunately, they killed the service last year. Now I use a free one sponsored by Bing. Go figure.
The Strip.
Check out the app Communicate for Android - it grabs every word nearly spot on.
Oddly enough, if you use Google Voice Search and opt-out of the 'personalization' feature that records your voice and stores that data with your account - it doesn't work anywhere nearly as well.
What is going on? How can this app, built on Google Voice, be so accurate, while the companies native offering doesn't fare as well unless you 'teach it?'
Tinfoil hat says the company is mis-hearing your voice on purpose, to encourage you to upload your voice samples, which fits nicely in the grand scheme of things.
Y'all kin have mah voce data. Sheeeit! I warn't doin' nutin' wid it anyhows.
Have gnu, will travel.
so they can send higher quality data to the CIA / NSA about what you are saying and who you are saying it to.
I'd be willing to let this happen if google then released the derived heuristics as free open source software. I'll share if you share.
Google is knot an evil umpire. They our hear 2 us with wheel whirled problems. Please stop bash tag google. All your words belong to us.
The next site to slashdot will be ready soon, but subscribers can beat the rush and start slashdotting it early!
Google announces Google Voice, noting that it will be archiving and auto-transcribing subscribers phonecalls.
"But don't worry," Google Voice Product Evangelist Boris Badinov said at the press conference announcing the service's launch. "We promise full interoperability with Google Docs, GMail, Android, and the NSA. Also, the artist who does the daily search engine doodle has promised to come up with a really cool, shiny logo."
And around the world, geeks sign up in droves, many noting that they didn't even realize they needed this, but if Google sez it will be shiny, they'd better get on board with it quickly.
It's interesting that all the comments so far have been about the technology etc.
This breaks just about every federal and state wiretapping law that exists. If a person or organization wishes to record the phone conversation of two individuals, according to federal law, one party must have full knowledge that the call will be recorded and give her consent before the call.
Additionally, according to some states' law, both parties must have full knowledge and give prior consent.
The key here is the prior consent. Google is breaking all of these laws by recording first then telling the people after the call about it. By that time the recording is a criminal act, no matter what under federal laws and ALL state laws.
So.. They are trying to teach GALaDOS (Google Artificial Lifeform and Disk Operative System) to speak?
This is the same as how they put "Closed Captions" on youtube videos.
Google has no interest in crowd-sourcing the translation or transcription of speech, they want it all automated.
Which is why YouTube Closed Captions SUCK!
Cwm, fjord-bank glyphs vext quiz
GV has had an opt-in feature to basically donate each GV transcription to Google with an indicator of whether you thought it was a good or bad transcription. Is there any evidence that Google is delving into your voice data without your consent? Were you expecting a GV transcription *without* a machine at least analyzing the voice data that came in and then discarding it?
Just grab audio from thousands of dialogs or talks on YouTube and test it out.
Except for ending slavery, the Nazis, communism, & securing American independence, war has never solved anything.
Play talk radio or music in the background. Loop recital of you saying alphabet. Baby crying. etc.
There will only be one voice operating System. Google wants to get there first.
I'm sure Google wants to be able to identify people by their voices... I mean in the digital era, where you have pseudonyms, multiple identities, and where portable (micro)phones are proliferating, it would be a mistake not to take advantage of the opportunity to identify or disambiguate people's identity thanks to their voice signature... I'm going to choose my future phone operating systems very carefully...
I love the idea of the feature -- I hate stopping to listen to voice mails. ...but I got the most ludicrous / hilarious translation yesterday. Pure poetry!
"Hi, My name is The bring the Anderson and I was interested in ordering. I'll call. Sarah, Mrs. Kate, on the Hudson birthday. He really liked all here in the for you. So, anyway, I would see it for next Sunday. I'm not sure where they said of. It's Hey Lady, Thank tonight anyway. If you can just give me a back. My phone number is 972."
Good to know about the "helpful? / donate" feature (from a comment above), which I did for this one...
"Norvig says that no one is listening to your calls on Google Voice — it is simply their servers trying to the translation right."
Uh Huh... Suuuuuuuuuuuurrrrrrre they are.
Those two requirements don't exactly strike me as "basic".
Computers do things far from "basic" with a good algorithm.
"Now how did the machines know what Tasty Wheat tasted like, huh? Maybe they got it wrong. Maybe what I think Tasty Wheat tasted like actually tasted like, uh ... oatmeal or tuna fish. That makes you wonder about a lot of things. You take chicken for example. Maybe they couldn't tell what to make chicken taste like which is why chicken tastes like everything!"
$20 says the CIA outsources Echelon to Google in the near future.
FAIL. They can not win. I posted this way for more than one reason. Why the lag on the text box eh?Fail on slashdot too.