Are You Talking to Your PC Yet?

Clippy by Anonymous Coward · 2004-12-09 06:03 · Score: 5, Funny

So if I ask Clippy to STFU, will he?

Re:Clippy by khrtt · 2004-12-09 13:45 · Score: 1

Yes, you just have to know how to ask *evel grin*

Forget talking to PC's by Anonymous Coward · 2004-12-09 06:04 · Score: 0

Where's my transparent aluminum?

Re:Forget talking to PC's by Rei · 2004-12-09 06:08 · Score: 1

Right here.

(and yes, I know that this is not pure aluminum - it is aluminum oxide)

--
The human body can be drained of blood in 8.6 seconds given adequate vacuuming systems.
Re:Forget talking to PC's by IMarvinTPA · 2004-12-09 06:08 · Score: 1

Right Here

--
Trusting software vendors is no smarter than trus
Re:Forget talking to PC's by mhaisley · 2004-12-09 06:22 · Score: 2, Interesting

But then again, you have never seen pure aluminum...well, 99% of the population has never seen pure aluminum, it oxidies instantly. There are some methods for observation, but it's mostly not worth it, besides, pure aluminum looks mostly just like aluminum oxide.
Re:Forget talking to PC's by Zugok · 2004-12-09 06:58 · Score: 1

you can see it on your (older) CDs, or as close it gets

--
"I just can't sit while people are saying nonsense in a meeting without saying it's nonsense" J Watson, Sci Am 288:(4)51
Re:Forget talking to PC's by databyss · 2004-12-09 07:34 · Score: 1

Ruby consists of Aluminum Oxide.... I suppose that could be transparent enough for some people.

--
Hmmm witty sig or funny sig? Maybe elitest techy sig!

I treid with my Mac by the_2nd_coming · 2004-12-09 06:05 · Score: 3, Interesting

OS X has really good system wide integration for Voice commands. and the voice interpreter is pretty good for one that comes with the OS, but I could not get it to work consistently....

other than that I thought it was cool to say "computer give me brad's number" and it would display my buddy brad's phone number on the screen :-) (when it worked)

--

I am the Alpha and the Omega-3

Re:I treid with my Mac by jellomizer · 2004-12-09 06:18 · Score: 1

I just remember while I had it on. Someone behind me droped some books and went "crap" And it reconized it as closed application. Another trick is to adjust the sensitivity of the microphone to the right level if it goes all the way up to the top then your voice starts clipping and it doesn't work well.

--
If something is so important that you feel the need to post it on the internet... It probably isn't that important.
Re:I treid with my Mac by hode · 2004-12-09 06:23 · Score: 0

I built a picture frame mac with an old powerbook and mounted it on the wall. I talk to it frequently because that is the only way to control it. It understands me about 75% of the time, but I've found that I need to pronounce words the way the computer speaks them. For example, I pronounce the word turnip as turn-up while the mac says turn-ip. It won't understand unless I speak with its accent. If you're wondering why I'm talking about turnips with my mac, it was during one if its stupid knock knock jokes. Try saying "tell me a joke" to your mac, you'll see. They're awful ;)
Re:I treid with my Mac by pdiaz · 2004-12-09 06:35 · Score: 2, Informative

I don't know OS X Voice commands system, but I'm guessing that it works on a reduced vocabulary (i.e.: "close", "open", "mail", and such). That's the key there: with reduced vocabularies and a strict syntax speech recognition works pretty good. The challenge is to make it work with a natural lang . such as English (and in real time)

Said in another way: when you issue a voice command to OS X, it has to choose between 40-100 alternatives (this is guesswork). (True) speech recognizers work with +60K vocabularies...

--
Make It Secret . Free JavaScript implementation of AES for your browser
Re:I treid with my Mac by Anonymous Coward · 2004-12-09 06:55 · Score: 1, Funny

My internet browser heard us saying the word Fry and it found a movie about Philip J. Fry for us. It also opened my calendar to Friday and ordered me some french fries."
Re:I treid with my Mac by lowrydr310 · 2004-12-09 07:50 · Score: 1

Is anyone familiar with Canada's telephone directory assistance? Does it use voice recognition or is there an operator that listens to what you say?
I was in Toronto in 2000 and had to call directory assistance. I remember using a pay phone to call, and there was no charge (unlike the US). The system asked me for the listing, and within seconds I was given the number I needed. I don't see how any human could have given me the results that fast.
Re:I treid with my Mac by Anonymous Coward · 2004-12-09 08:48 · Score: 1, Funny

That's because we only have 5 numbers here in Canada. (kidding)

Actually, all it does is generate a random number. Fortunately, in Canada, we're all so polite that whoever you call will look up the number for you. We call it "Distributed Directory Assistance" or DDA for short.

You just got lucky and got the right number the first time. That happened to me once.
Re:I treid with my Mac by the_2nd_coming · 2004-12-09 09:20 · Score: 1

there was no charge because it is government run.

--

I am the Alpha and the Omega-3
Re:I treid with my Mac by compwizrd · 2004-12-09 09:55 · Score: 1

no.. bell canada is not government run..

it was no charge because it was from a payphone, which rarely has a phone book attached
Re:I treid with my Mac by lowrydr310 · 2004-12-09 10:01 · Score: 1

So do they use voice recognition, or is there an operator listening to your request and conducting the search, then letting a computer speak the phone number?

OpenSource Voice recognition projects? by farsideofthemoon · 2004-12-09 06:05 · Score: 5, Informative

Any recommended ones? http://sourceforge.net/projects/cmusphinx/

--
I know what's on your hard dr

Re:OpenSource Voice recognition projects? by Palshife · 2004-12-09 06:15 · Score: 3, Interesting

Sphinx is a great project. I definitely recommend it.

--
Attention deficit disorder is a complicated issue, spanning several major... HEY LET'S GO RIDE BIKES!
Re:OpenSource Voice recognition projects? by Anonymous Coward · 2004-12-09 08:48 · Score: 0

ugh, that webpage is terrible. totaly dev centric, definitely not ready for users.
Re:OpenSource Voice recognition projects? by spdt · 2004-12-09 12:42 · Score: 1

How about this one?

Summary of the article by Ingolfke · 2004-12-09 06:05 · Score: 5, Funny

All though Text two speach is a grape gnu technology it is not red E for the main stream yet.

Re:Summary of the article by Anonymous Coward · 2004-12-09 06:11 · Score: 0

That's because your computer doesn't know how to wreck a nice beach.
Re:Summary of the article by Anonymous Coward · 2004-12-09 06:29 · Score: 0

Fatal error: Call to undefined function: message_die() in /home/httpd/vhosts/pocketpcaddict.com/httpdocs/db/ db.php on line 88
Re:Summary of the article by jayhawk88 · 2004-12-09 06:41 · Score: 3, Funny

Damn, is it 1993 again already? Quick, someone call Scott McNealy up and tell him we're all ready for the thin-client PC, this time we mean it, no really!
Re:Summary of the article by the+grace+of+R'hllor · 2004-12-09 06:45 · Score: 4, Insightful

All though Text two speach is a grape gnu technology it is not red E for the main stream yet.

Now I see the master plan. Rather than improving speech-to-text technologies, the focus is on destroying basic literacy. Apart from the word 'grape', I parsed that sentence without even having to mentally translate it.
Based on current experience, I'd say speech-to-text is going to be a growing market :-(
Re:Summary of the article by Bohemoth2 · 2004-12-09 08:05 · Score: 1

Great link man! I have it bookmarked and wish i had mod points to give.
Re:Summary of the article by ReelOddeeo · 2004-12-09 08:14 · Score: 1

Did you pay sails tacks on that? Did you get a re-seat? Was it all just an op tickle illusion?

--

Those who would give up liberty in exchange for security and DRM should switch to Microsoft Palladium!
Re:Summary of the article by Anonymous Coward · 2004-12-09 08:19 · Score: 0

Speech to text is even further behind...
Re:Summary of the article by Anonymous Coward · 2004-12-09 09:03 · Score: 0

Yeah, you all laugh, but...

Monday comes around once a week
January comes around once a year

1993's got to come around again sometime!
Re:Summary of the article by Anonymous Coward · 2004-12-09 09:37 · Score: 1, Interesting

Aoccdrnig to rscheearch at an Elingsh uinervtisy, it deosn't mttaer in waht oredr the ltteers in a wrod are, olny taht the frist and lsat ltteres are at the rghit pcleas. The rset can be a toatl mses and you can sitll raed it wouthit a porbelm. Tihs is bcuseae we do not raed ervey lteter by ilstef, but the wrod as a wlohe
Re:Summary of the article by fcw · 2004-12-09 12:43 · Score: 1

I think it deserves a pullet surprise.
Re:Summary of the article by Anonymous Coward · 2004-12-17 06:42 · Score: 0

You're thinking of Larry Ellison.

Slashdot - when facts don't matter, and conjecture is proof.

Dohh! by nherc · 2004-12-09 06:06 · Score: 1

Two comments and the link is dead! lol

--
'He was a dreamer, a thinker, a speculative philosopher... or, as his wife would have it, an idiot.' - Douglas Adams

Re:Dohh! by Anonymous Coward · 2004-12-09 06:19 · Score: 0

In Korea, only old people talk.
Re:Dohh! by Anonymous Coward · 2004-12-09 08:38 · Score: 0

YOU MUST BE NEW HERE!!!one!1one!!one1!one!oneoneone

It screws up with my accent by Janitha · 2004-12-09 06:06 · Score: 2, Interesting

My Accent screws up everything. I hate my Accent.

Re:It screws up with my accent by Anonymous Coward · 2004-12-09 06:15 · Score: 0

Juice Lee send two me four enact scent.
Re:It screws up with my accent by Anonymous Coward · 2004-12-09 06:16 · Score: 0

Well, Hyundai is known for the poor speech recognition integrated in its cars, especially the Accent.

Hooray for unnecessary capitalizations!
Re:It screws up with my accent by adeydas · 2004-12-09 06:27 · Score: 1

same here... :( don't they have softwares to understand other accents too?
Re:It screws up with my accent by sjwaste · 2004-12-09 06:43 · Score: 1

Yeah, these things definitely don't understand those of us who speak New Jersey.
Re:It screws up with my accent by Anonymous Coward · 2004-12-09 06:44 · Score: 0

Really? I love my Elantra.

My problem by teiresias · 2004-12-09 06:06 · Score: 5, Funny

The problem I always found with uuhhhhh voice writing was mmmmm filtering out unwanted noises and shhhhh distractions from my posts period return But I uhh guess they've fixed most of those burp problems by now right question mark

--
-Teiresias

Re:My problem by the_2nd_coming · 2004-12-09 06:08 · Score: 1

try using a head set.

--

I am the Alpha and the Omega-3
Re:My problem by Jeff+DeMaagd · 2004-12-09 06:12 · Score: 1

You suggested getting a headset, so the computer can still get a clear feed on the user's belches, uhhhs and ummms?
Re:My problem by the_2nd_coming · 2004-12-09 06:30 · Score: 1

he was complaining about ambient noise.

and if he does not like the ahhs and ums, then perhaps he should take his hand off the signal key to tell the computer to not listen.

--

I am the Alpha and the Omega-3
Re:My problem by superpulpsicle · 2004-12-09 07:18 · Score: 1

Don't mention it, I tried Dragon Naturally speaking way back.

It interpreted "Open Explorer" to "Moment is poor".
Re:My problem by Anonymous Coward · 2004-12-09 07:34 · Score: 0

It interpreted "Open Explorer" to "Moment is poor".
...and damn right it was!
Re:My problem by silverfuck · 2004-12-09 08:04 · Score: 4, Interesting

Exactly. Also, when will these programs start filtering for what the computer is putting out of the speakers (is there a sensible way to do this, to compensate for manual volume controls that the PC doesn't know about/control?)? I nearly always have some music on in the background...

--
You know you've been IMing too long when you almost say 'lol' out loud to a non-geeky friend...
Re:My problem by legirons · 2004-12-09 08:24 · Score: 1

"The problem I always found with uuhhhhh voice writing was mmmmm filtering out unwanted noises and shhhhh distractions"

Yeah, that TRAIN can be ARRIVING annoying AT when PLATFORM you're ONE trying IS to THE use 14:45 speech-recognition in TO SALISBURY public, can't it?
Re:My problem by Anonymous Coward · 2004-12-09 09:00 · Score: 0

OK. Next excuse for not-ready-for-prime-time performance.
Re:My problem by jg_elliott · 2004-12-09 11:34 · Score: 1

One solution would be for the computer to look at the sound output of the computer and take that signal away from the input from the microphone. Although, this wouldn't help things like your dog barking in the back ground, or if your hi-fi plays your music instead of your pc.
Maybe the best solution is just to get a decent microphone that cancels out a lot of back ground noise?
Several years ago on tomorow's world I saw a microphone which worked on vibrations of the jaw for environments like concerts so that the security people could still communicate with each other. Something like that could be ideal.
Re:My problem by Anonymous Coward · 2004-12-09 16:19 · Score: 0

Or even using [some company's] stereo microphones that uses software to cancel out background noises. As long as you are in the cone of the overlapping microphone areas, it'll get recorded.

Of course, now you'll have to play it back into the dictation program or something.
Re:My problem by Anonymous Coward · 2004-12-10 00:32 · Score: 0

The problems I found were

(a) Annoying other people with constant speech.
(b) Getting a sore throat.
(c) Having to use overdeliberate pronounciation.
(Normally I stammer).

I suppose it could be good training for my stammer, but some days the stammer is more severe and it is hard work.

Shoot and Voice Buddy are handy add ons for games (less speech required) especially if you have multiple flight sims.

Does "DAMM WINDOWS" count as talking? by Anonymous Coward · 2004-12-09 06:06 · Score: 0

If so I've been talking to my machine for more then a decade.

Re:Does "DAMM WINDOWS" count as talking? by Anonymous Coward · 2004-12-09 06:31 · Score: 0

My machine talks to me. Should I answer it?

Spoken programming languages by The+G · 2004-12-09 06:07 · Score: 3, Interesting

Is anyone out there giving any thought to how a programming language should be structured to make it easy to code using a speech recognition engine?

If not, why not?

Re:Spoken programming languages by Anonymous Coward · 2004-12-09 06:10 · Score: 0

Your asking us to answer one of your homework problems, aren't you?
Re:Spoken programming languages by vidnet · 2004-12-09 06:15 · Score: 4, Funny

Like an intern, it should accept phrases like "code that %&#@% interface by tomorrow you @#$%@ piece of *&^$&!!!"
Hey, maybe you could base it on perl!
Re:Spoken programming languages by nacturation · 2004-12-09 06:21 · Score: 1

All text and numbers, no punctuation or symbols at all. Hey, isn't that Cobol?

--
Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
Re:Spoken programming languages by Coffee · 2004-12-09 06:30 · Score: 1, Insightful

Has anyone given any thought as to how a programming language should be structured so that it would be easy to code by waving things with particular smells in front of the computer? If not, why not?

Seriously, this is not a design goal for programming languages. Programming languages are meant to a) be not ambiguous, and b) match or impose a way of thinking. Neither of these map particularly well onto spoken languages, because the bloody syntax (to keep it nonambiguous) gets in the way.

This is also why Edsger Dijkstra felt that natural programming was not a good idea.
Re:Spoken programming languages by Anonymous Coward · 2004-12-09 06:34 · Score: 0, Flamebait

Because no one else is a moronic karma-whore like you.
Re:Spoken programming languages by s4m7 · 2004-12-09 06:46 · Score: 4, Funny

"code that %&#@% interface by tomorrow you @#$%@ piece of *&^$&!!!"

Boss, is that you?

--
This comment is fully compliant with RFC 527.
Re:Spoken programming languages by stuffman64 · 2004-12-09 06:49 · Score: 1

Well, why not use this plugin for Dragon NaturallySpeaking? Of course, you'd be condemned to Visual Studio... but of course, if you were paralysed and couldn't type, I'm sure this would be great if you wanted to write programs.

--
--- At my sig, unleash hell.
Re:Spoken programming languages by hackstraw · 2004-12-09 06:59 · Score: 1

Is anyone out there giving any thought to how a programming language should be structured to make it easy to code using a speech recognition engine?

Wow. Never thought of that. (f)lex for spoken language.

Scary.

Soon we will have yalc. Where l = language and c = compiler, so we will have "yet another language compiler" like we have yacc.
Re:Spoken programming languages by gstoddart · 2004-12-09 07:11 · Score: 1

"code that %&#@% interface by tomorrow you @#$%@ piece of *&^$&!!!"

Boss, is that you?

Ashok?! Get back to work!

--
Lost at C:>. Found at C.
Re:Spoken programming languages by thelenm · 2004-12-09 07:19 · Score: 1

I don't know of any work in making programming languages amenable to speech recognition. Not to say there is none, I just don't know about it. But I do know of a prototype system for writing Java code using spoken language. It's called NaturalJava.

--
Use Ctrl-C instead of ESC in Vim!
Re:Spoken programming languages by myowntrueself · 2004-12-09 07:20 · Score: 1

"Hey, maybe you could base it on perl!"

Actually, I thought that sample looked more like a mix of APL and COBOL...

--
In the free world the media isn't government run; the government is media run.
Re:Spoken programming languages by SnapShot · 2004-12-09 07:24 · Score: 1

I've always been more interested in getting my computer to read to me.

From looking at the Festival ( http://www.cstr.ed.ac.uk/projects/festival/ ) documentation it seems that this limiting factor isn't difficulty of programming but the cost and time required to record all of the required diphones (?) and build the required lexicon.

It seems that we (as a profession) have given up trying to synthesize voice so that it sounds real and depend upon recorded speech fragments instead. Is this the end of my dream of having my source code recited back to me in Sophie Marceau's voice?

--
Waltz, nymph, for quick jigs vex Bud.
Re:Spoken programming languages by m50d · 2004-12-09 07:29 · Score: 1

The loglan idea was to have a human language that would be easy to process for computers (based on predicate logic) and easy to interpret from speech, but so far no computer I know of can understand it.

--
I am trolling
Re:Spoken programming languages by alphadingo · 2004-12-09 07:30 · Score: 1

Is anyone out there giving any thought to how a programming language should be structured to make it easy to code using a speech recognition engine.
About 40 years ago they tried to create a language for people with a speech impediment. I believe it was called Lisp :-)
Re:Spoken programming languages by Anonymous Coward · 2004-12-09 07:48 · Score: 0

A better goal is to make an editor or an IDE that lets you write code through speech, so instead of saying "if open parenthesis my-capital-F-ile period is-capital-O-pen open parenthesis close parenthesis close parenthesis space open brace return" you'd say "if my file method call is open then" and it'll figure out what to do, going as far as matching existing variable and function names to what you say.
Re:Spoken programming languages by good-n-nappy · 2004-12-09 08:11 · Score: 1

If not, why not?

While this might be a last resort for the disabled - it's probably a bad idea otherwise. Speech is not the same as typing/writing. Speech uses a part of your brain that interferes with other kinds of reasoning. Please, let's not let yet another generation of engineers waste their time on this.

--
Never underestimate the power of fiber.
Re:Spoken programming languages by Waffle+Iron · 2004-12-09 08:20 · Score: 1

Is anyone out there giving any thought to how a programming language should be structured to make it easy to code using a speech recognition engine?
I can just imagine a printout at a code review:
125 ok... make a function that takes an array 126 of nodes and makes a hash from the node 127 names... umm, ok, not the tree data nodes, 128 I mean the index nodes. oh, and make sure to 129 use a multihash, not a normal hash. good. Now, 130 the thing is we need to make sure the hash is 131 always consistent with the master index, so 132 put a mutex around the operation... not there, 133 it needs to go over the whole function. /* Oh 134 Hey, Joe! How was the hot date last night? 135 Hahaha! Sounds Great! */ Ummm ok, where was 136 I? Oh yeah... mutex... that looks ok. 137 Oh yeah, you need to return the number 138 of nodes that changed. 139 Thanks.
Re:Spoken programming languages by legirons · 2004-12-09 08:27 · Score: 1

OK, I've used speech-recognition system for a "proper" project, and it's barely adequate for replacing push-buttons (for pilots with both hands on the controls, etc.) so trying to program in it would be a frustrating experience, right until you got a sore throat after the first 20 minutes...
Re:Spoken programming languages by legirons · 2004-12-09 08:33 · Score: 1

"code that %&#@% interface by tomorrow you @#$%@ piece of *&^$&!!!"

The Perl compiler says you missed out a quote...
Re:Spoken programming languages by Simonetta · 2004-12-09 08:39 · Score: 1

I don't know about the programming language itself but this would be great for documentation.
Speech to text is big but necessary step to creating a universal translator.
Don't get discouraged. Keep trying to make it work. It is important.
Talking to a machine and expecting a serious response is probably the most unnatural thing that a human can do.

Speech to text is the emblem of the new 21st century information age in the way that the atomic bomb was the emblem of the Super Power - Cold War age of the 20th century.
Re:Spoken programming languages by bill_kress · 2004-12-09 09:07 · Score: 1

What is important is the ability to calculate the next symbol from a limited set of symbols as the code is being generated.

I was thinking along those lines for a development tool for a palm pilot where a list of possible symbols was always presented in a list on the right (preferably one that didn't have to scroll) so you could code an entire program by single-clicks.

This might require restructuring the way variable names and packages work in most languages--at least a little.

Also quite a few programming constructs are difficult to pronounce distinctly (no matter how you abbreviate it, parentheses will get annoying if your language is going to be c or lisp based).

Finally, the ultimate voice interface would be headless, (a watch or something) and it would take a savant to be able to track more than a trivial few lines of code in his head.

Now that I think about it--you might have to create a new language targeted at writing short simple routines.

If I remember correctly, APL can do an awful lot with a very small amount of code, and the code is often considered write-only or throw-away (Very difficult to read).

I'm not saying something based on APL would work, but if I were to define the requirements for a spoken-programming language it would be that it needs to do a lot with a small amount of code and the code could be write-only.
Re:Spoken programming languages by Skjellifetti · 2004-12-10 08:58 · Score: 1

Speech uses a part of your brain that interferes with other kinds of reasoning.

Ah, that explains why politicians all look like statesmen until they open their mouths.

--
FreeSpeech.org

Voice recognition is great by mistersooreams · 2004-12-09 06:07 · Score: 3, Funny

It walks just fin four mee!

--
apterous.org

Re:Voice recognition is great by Lord+Kano · 2004-12-09 06:33 · Score: 1, Funny

Eye thought that eye was a loan.

LK

--
"Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
Re:Voice recognition is great by SunPin · 2004-12-09 07:46 · Score: 1

Lord Kano's response was funny. Your post is not.

Errors in voice dictation result in the _wrong_ word. There are no "illiterate fsck" misspellings unless you actually program them into the vocabulary.

--
Laws are for people with no friends.
Re:Voice recognition is great by JavaRob · 2004-12-13 17:09 · Score: 1

It walks just fin four mee!

Errors in voice dictation result in the _wrong_ word. There are no "illiterate fsck" misspellings unless you actually program them into the vocabulary.

Oh, get over it. An extra "e" on one word. Can't you notice that the rest was funny, and ignore the typo? Besides, "mee" actually *would* be added to the dictionary is some places. It's a word I know, because it means "noodles" in Malay (taken from Mandarin Chinese, I think), and the word is used a lot in English conversation thereabouts. My wife has it added to her Word dictionary.

Voice Recognition by Boronx · 2004-12-09 06:07 · Score: 1

In Japan, talking PCs are for schoolgirls.

--
Play Command HQ online

Re:Voice Recognition by frankvl · 2004-12-09 06:39 · Score: 1

What happened to the old people?
Re:Voice Recognition by Fnkmaster · 2004-12-09 07:03 · Score: 1

And in Korea, old people talk to their PCs.

Are You Talking to Your PC Yet? by glsunder · 2004-12-09 06:07 · Score: 5, Funny

I've been talking to my PC for years:

You god damned son of a bitch! F'n Piece of shit!

Re:Are You Talking to Your PC Yet? by nordicfrost · 2004-12-09 06:19 · Score: 0, Offtopic

I've been talking to my PC for years:

You god damned son of a bitch! F'n Piece of shit!

I said the same thing to my PC when I was running Windows. Then I formatted the harddrive and installed Debian. Then the dialouge became like: "Hey, buddy! Nice work there on the nmap job! Let's go have a pr0n surf for a while. Wanna beer?"

Then I bought a Powerbook and now it's like: "Ooooh.... Yeeeeahh... You're so sexy... (bedroom voice) Let me see you prase that file for me... Can I touch your curved aluminium body? Let me just plug in my iPod somewhere I know you'll approve of it."

Strange transformation that.
Re:Are You Talking to Your PC Yet? by UnknowingFool · 2004-12-09 06:26 · Score: 0, Offtopic

Me too. Especially after I got one with Windows ME. I talked to it and Bill Gates all the time
WTF??!! You piece of shit! Damn you too, Bill Gates! If you were here right now, I would shove this computer so far up your ass!

--
Well, there's spam egg sausage and spam, that's not got much spam in it.
Re:Are You Talking to Your PC Yet? by drinkypoo · 2004-12-09 06:29 · Score: 0

I started talking to my PC as soon as I clicked on a story and was taken to the IT section... "Oh, my eyes! My eyes! Curse you!"

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:Are You Talking to Your PC Yet? by kaustik · 2004-12-09 06:36 · Score: 1

So, you're basically saying that you went from a normal human being to a normal (lonely and pathetic) slashdot user, and finally devolved into a sick demented pervert?
Are you bragging?
Re:Are You Talking to Your PC Yet? by nordicfrost · 2004-12-09 06:40 · Score: 1

Well, yeah. More or less. I meanb, I have always been a sick and demented pervert but never got it out.

How 'bout you? Has your humour always been so bad even the Amish think your company sucks?
Re:Are You Talking to Your PC Yet? by Anonymous Coward · 2004-12-09 07:01 · Score: 0

I felt the same way, but turns out my PC just can't hold its liquor. After a single beer it was sputtering and sparking, and just generally died after the second. Anyone know where I can get a computer that can drink like a real man?
Re:Are You Talking to Your PC Yet? by virid · 2004-12-09 07:29 · Score: 1

You god damned son of a bitch! F'n Piece of shit!
Curious that you censor yourself on the "f-word" but not the other explatives...

--
"The world only exists in your eyes. You can make it as big or as small as you want." - F Scott Fitzgerald
Re:Are You Talking to Your PC Yet? by SirGeek · 2004-12-09 07:50 · Score: 1

've been talking to my PC for years:
You god damned son of a bitch! F'n Piece of shit!
And the software would translate part of that into Pizza Ship ( from when my wife tried the software a few years back ).

--
UPS Sucks

Fun with Macros by BlueCup · 2004-12-09 06:07 · Score: 2, Insightful

My brother and I work at a company making efficiency programs... for awhile we toyed with the idea of having all of the programs activated by voice... we tested it out for awhile with an open source cantation originally used for games, that would execute a command, or type text based on what you said... for a while, it was awesome, every time we said something, it'd find the word from our list, and activate the program... problem was, when it listened to your voice, it only compared it to the words you had programs assigned to... so if you had four words, no problem, but if you had 60, it started choosing horribly... we eventually had to scrap the program all together... though it was funny watching what programs it would have to run through when I started cursing in frustration... I'm pretty sure the annoyance of people talking to their computers all over the building would have caused problems as well.

--
WANNAWIKI Wannawiki WannaWiki WANNAWIKI!

Re:Fun with Macros by RPI+Geek · 2004-12-09 06:22 · Score: 1

My dad got one of those really primitive speech recognition software programs like 10 years ago, one of the ones that only recognizes volume changes. It worked well for about 20 minutes, and then I coughed into the mic. It opened the calculator, we both laughed about it, and stopped using the software.

--

- "Nobody came out that night, not one was ever seen. But Old Man Stauf is waiting there, crazy sick and mean!"
Re:Fun with Macros by 99BottlesOfBeerInMyF · 2004-12-09 06:29 · Score: 1

though it was funny watching what programs it would have to run through when I started cursing in frustration

Years ago Apple implemented voice recognition built into OS9. The peak for me was when I managed to get it to connect the internet, download my e-mail, and read the messages to me, without having to get out of bed. Unfortunately, the system suffered from just the problem you are mentioning. If it did not understand what you said it would assume you meant "open netscape." Do you know how long it took to open netscape on a 66 Mhz box with 4 meg of RAM? It took way too long.
Re:Fun with Macros by 99BottlesOfBeerInMyF · 2004-12-09 06:32 · Score: 1

Oops, I meant MacOS 7, not 9.
Re:Fun with Macros by drinkypoo · 2004-12-09 06:35 · Score: 2, Interesting

The Macintosh has had this since the Quadra, and it has just the same problem. You drop aliases (or whatever) into your speakable items folder and it matches them to your speech. Problem is, when you get fifteen or twenty things in there (maybe more on a powermac, but I'm talking the quadra experience here) it frequently would match things that made no sense whatsoever.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:Fun with Macros by Anonymous Coward · 2004-12-09 06:41 · Score: 1, Interesting

Some of my fellow classmates at Rice University and I are in the process of finishing up some Digital Signals projects.

One group is trying to do a voice conversion - one person says something and you change it to another person's voice. They're having fair success.... for the phrases they've already recorded.

My group recognizes an instrument playing within a larger context; it's working so long as the context isn't _too_ large. (i.e. picking out a clarinet in the recording of an orchestra).

What we've learned from this... If you match filtered your voice with the samples you saved on your computer, the matching works _extremely_ well (with well-chosen threshold values). We test with on the order of 40 notes and it picks each one out very, very cleanly.

You might be able to do something similar with your voice and the commands on the computer.

Michael Lawrence
Re:Fun with Macros by Anonymous Coward · 2004-12-09 06:51 · Score: 0

What speech recognition api/engine were you using?
Re:Fun with Macros by Anonymous Coward · 2004-12-09 06:58 · Score: 0

We wrote our own code in matlab. To implement matched filters, you fft (fast fourier transform) two signals (we recorded the signals, then load them into Matlab; so this isn't a real-time system yet.... we're looking into making it one), then multiply them in the frequency domain. After inverse ffting them, you get out a signal with large peaks where it recognizes the similarities.

Thus, if you take all the samples and run it through the algorithm and find the one that matches best (has the highest peak), it's your best guess for the word spoken. We take noise into account with various thresholds... and since we're doing this with notes, we have to ignore higher harmonics... so we basically test the sample to find out which note is most likely and then run all our samples of that note against the inputted waveform.

This is equivalent to convolving the two signals in the time domain.

Michael Lawrence
Re:Fun with Macros by pragma_x · 2004-12-09 07:30 · Score: 1

One question: why not have your software expect the vocal equivalent of a command prompt, like saying "Computer!" ala Star Trek?

Seems to me that would eliminate a lot of the trouble.

Link Error by mordors9 · 2004-12-09 06:07 · Score: 1

Fatal error: Call to undefined function: message_die() in /home/httpd/vhosts/pocketpcaddict.com/httpdocs/db/ db.php on line 88

Not Used by reddigitaldragon · 2004-12-09 06:07 · Score: 1

Several campus administrators at the high school I work for barked for Dragon Naturally Speaking. I have yet to see/hear any of them use it.

Re:Not Used by Anonymous Coward · 2004-12-09 06:35 · Score: 0

open slash ee tee see slash shadow
mailed /etc/shadow successfully to users@localhost

Star Trek Encyclopedia by BitwiseX · 2004-12-09 06:07 · Score: 0

I remember way back when, I could talk to my Star Trek Encylopedia. I was actually real disappointed that I didn't have to say "Computer." before every command though. If I find an old Apple mouse, I think I'll wire up a mouse into the bottom of it. "Hello, Computer...." "Just use the keyboard!"

Can't RTFA by Froboz23 · 2004-12-09 06:07 · Score: 1

Only 3 comments, and the article is already hosed.
Mirror anyone?

Fatal error: Call to undefined function: message_die() in /home/httpd/vhosts/pocketpcaddict.com/httpdocs/db/ db.php on line 88

--
Take off every Sig. For great justice.

Re:Can't RTFA by Anonymous Coward · 2004-12-09 06:15 · Score: 2, Informative

Mirror

Re:Article text by Anonymous Coward · 2004-12-09 06:07 · Score: 0

karma whore mother fucker. use ac next time. my next mod points are going towards makin ur comments' scores below 0. have a nice day now.

Note to self by narsiman · 2004-12-09 06:07 · Score: 3, Funny

Do not run webservers on PocketPCs even if you are an addict

Re:Note to self by scupper · 2004-12-09 06:11 · Score: 1

how about a cluster of pocket pcs?
Re:Note to self by RangerRick98 · 2004-12-09 06:17 · Score: 2, Funny

No no, you never actually make a beowulf cluster of things; you just imagine beowulf clusters of things. :)

--
"You're older than you've ever been, and now you're even older."
Re:Note to self by CanadianCrackPot · 2004-12-09 12:06 · Score: 1

Helllfire beeg in dick tate shone captial W captial T capital F is ah bay o wolf clustor of piss aunt systems going to do question mark submit post shut up bee ouch I'm slash dotting not mad

--
Good programmers drink beer to relieve job stress.
Great programmers drink hard liquor and work best hungover.

Via Voice is excellent by SpooForBrains · 2004-12-09 06:07 · Score: 1

... for attempting to dictate message board posts for humerous effect. Gave me many hours of amusement. Plus I got a free mic which I now use with Skype :)

--
"The dew has clearly fallen with a particularly sickening thud this morning"

I think it would be cool... by greatscot · 2004-12-09 06:08 · Score: 0

... to be able to sit and talk to my computer instead of typing, since my typing is so bad.
Does anyone know if there is this type of app for Linux? I would even be willing to pay a reasonable price.

--
Registered Linux User
Registered KDE User

all the time by millahtime · 2004-12-09 06:08 · Score: 1

I talk to my PC all the time...if you consider swearing at it and yelling profanity at it talking to it.

--
Evolution or ID?

That looks promising by FearTheFrail · 2004-12-09 06:08 · Score: 2, Funny

And mine works just fine. Submit. Submit. I said submit. Why isn't this expletivedeleted thing triggering the submit button. Submit! Submit! Damn it, I have to move the mouse.

--
___ In the words of Gen. Douglas McArthur: "I'll be right back."

From The Onion by Anonymous Coward · 2004-12-09 06:09 · Score: 0

Voice Recognition Software Yelled At

NEW YORK--Fidelity Financial Services' Gwen Watson, 33, shouted angrily at her IBM ViaVoice Pro USB voice-recognition software, sources close to the human-resources administrator reported Monday. "No, not Gary Friedman! Barry Friedman, you stupid computer. BARRY!" Watson was heard to scream from her cubicle. "Jesus Christ, I could've typed it in a hundredth of the time." After another minute of yelling, Watson was further incensed upon looking at her screen, which read, "Barely Freedman you God ram plucking pizza ship.

Huh? What? by PornMaster · 2004-12-09 06:09 · Score: 1

My computer would be asking me to repeat anything I tried to communicate by yelling over its own leafblower noise levels.

--
500GB of disk, 5TB of transfer, $5.95/mo

Heh- by IWantMoreSpamPlease · 2004-12-09 06:09 · Score: 1

I recall back in the early 80s I was in a Singer shop (as in sewing machines) and they sold IBM PCs as well.... ...including speech to text recognition software.

I tried it out, and surprise! it didn't work very well.

I see nothing has changed.

--
So rise up, all ye lost ones, as one, we'll claw the clouds.

Re:Heh- by Anonymous Coward · 2004-12-09 06:29 · Score: 0

It's an old association. Singer and IBM both made M-1 carbines for Uncle Sam during WWII.

John Udell has a nice review by aemain · 2004-12-09 06:10 · Score: 1

http://weblog.infoworld.com/udell/2004/11/04.html

You get to watch/listen to him use it, which really gives you a sense of how far the software has come.

the article: by virtualone · 2004-12-09 06:10 · Score: 0

if you want to read TFA, i was lucky i yould grab it early enough.
it reads as follows:

Fatal error: Call to undefined function: message_die() in /home/httpd/vhosts/pocketpcaddict.com/httpdocs/db/ db.php on line 88

--
Only morons moderate based on a sig.

While it might be Interesting, it's more Funny ... by Anonymous Coward · 2004-12-09 06:10 · Score: 0

.. though.

Re:Article text by Rei · 2004-12-09 06:10 · Score: 1

What would you do if your enter key actually worked? Imagine the possibilities!

--
The human body can be drained of blood in 8.6 seconds given adequate vacuuming systems.

Re:Worst Server... by mirko · 2004-12-09 06:11 · Score: 1

The question is whether you talking to YOUR PC yet, not whether you are shouting at THEIR server now ;-)

--
Trolling using another account since 2005.

I WAS getting excited... by rufo · 2004-12-09 06:11 · Score: 1

...until I noticed that the PocketPC version is just a delayed dictation device - it records, then you transfer it to your desktop computer and it's the host computer that actually does all the speech recognition.

No wireless. Less space then a Nomad. Lame.

--
My English teacher once told me that two positives don't make a negative. Two words for her: Yeah, right.

MOD PAIR RENT UP by Anonymous Coward · 2004-12-09 06:11 · Score: 2, Funny

bracket en slash tee close bracket

Re:Article text by Anonymous Coward · 2004-12-09 06:11 · Score: 0

"Enter" key is your best friend.

Re:I tried with my Mac by turtledot · 2004-12-09 06:11 · Score: 1

The highest rated commercial program for the Mac is iListen (not produced by Apple :-)) A visually handicapped friend is very interested in obtained speech to text software - any slash dotters have experience with the iListen or ViaVoice for the Mac?

Tried that by gr8_phk · 2004-12-09 06:12 · Score: 1

I tried Dragon several years ago. It worked, but you really need accuracy to the nines (99.99%) to be productive with it. One mistake in 100 sylables means constant corrections. I did make a little flying demo that took english commands (right, left, up, down, slower, faster) and it was cool to control it via voice commands. There was no distinction between typing commands and speaking them though. I would recommend (if they don't have it already) the Gnome and KDE folks provide a seperate input stream for voice commands to all applications - or something. If it's there, people will code for it even if the free recognition software isn't that great yet. If there are apps that support it, people will improve the recognition software.

Re:Tried that by CoolVibe · 2004-12-09 06:36 · Score: 1

With KDE it's already a possibility with DCOP and all. Every KDE app's function is accessible by it. Dunno about GNOME though, I never use that :)

It helps by baggins2002 · 2004-12-09 06:13 · Score: 1

I have a relative that has severe dyslexia. This program has been instrumental in getting him through college. He's going through a prestigous private business school and is doing well, 3.0+ and his mom says that without that program it probably wouldn't have happened.
I tried to use a version about 5 years ago and found it somewhat frustrating, but the best of breed. I heard that it had improved quite a bit and recommended it.

Re:It helps by Opie812 · 2004-12-09 07:52 · Score: 0

He's going through a prestigous private business school and is doing well, 3.0+...

Of course considering he has severe dyslexia his actual gpa is 0.3

--
I'm not a nerd. Nerds are smart.

Am I talking to my PC yet? by GillBates0 · 2004-12-09 06:14 · Score: 1

No...not yet. It's too early, and I need some time to myself right now.

It was her fault though...crapping out on me like that when I was just past Level 5 in digdug.exe.

And just when I was going to get her a shiny new Windows 3.11 for Christmas too. It sure is a pity. It'll be a while before I'm ready for another relationship.

--
An Indian-American Hindu committed to non-violent thought/speech/action alarmed by the global explosion of radical Islam

Re:Huh? What? by Jeff+DeMaagd · 2004-12-09 06:14 · Score: 1

Maybe you should fix the computer, first by removing the leafblower and fix the reason that it was put there in the first place?

Re:Article text by mmkkbb · 2004-12-09 06:14 · Score: 1

his enter key does work, he probably forgot to specify the correct formatting option for the post.

--
-mkb

is it perfect yet? by miseryinmotion · 2004-12-09 06:14 · Score: 1

As much as I love progress, I'd much rather type than have speech to text capabilities. Why? for the same reason I would rather type than speak on the phone: Someone down the hall can't hear you.

Unless speech recognition tech has become perfect, then there is a distinct possibility that someone else is going to hear you repeating the same word over and over again with increasing volume and anger, in a futile attempt to get the software to recognize a troublesome word.

Re:is it perfect yet? by scupper · 2004-12-09 06:23 · Score: 1

Why? for the same reason I would rather type than speak on the phone: Someone down the hall can't hear you. Absolutely. I wouldn't want to give up my privacy, what little there is. It's not that there are any state secrets I'm passing, but some discussions aren't for public broadcast.

Why I've never liked speech to text by Rude+Turnip · 2004-12-09 06:15 · Score: 5, Interesting

1. It's awkward to talk when you're trying to compose something that requires a lot of thought first. I usually like to talk to myself (either out-loud or in my head) and type out what I'm thinking in a more formal fashion.

2. It is very tedious to go back and edit or make corrections. If I make an error while typing, I'm cognizant of the error very soon after it happens. With voice recognition, techincally "someone else" is typing and it takes more time to see where the mistakes were made.

3. I deal with lots of boilerplate text with original content intermingled. A lot of times working on such a text becomes an editing process where using the keyboard & mouse is more efficient.

4. My voice doesn't last for much longer than 30 minutes for non-stop speaking...and that's with short breaks for water.

Conclusion: Just hire a hot secretary that can type.

--
Bill Clinton: Pimp we can believe in. - The Shirt!!!

Re:Why I've never liked speech to text by smooth+wombat · 2004-12-09 06:34 · Score: 1

Conclusion: Just hire a hot secretary that can type.

Only type?

--
We will bankrupt ourselves in the vain search for absolute security. -- Dwight D. Eisenhower
Re:Why I've never liked speech to text by Anonymous Coward · 2004-12-09 06:44 · Score: 0

I'm morally offended by that statement!

---
Vote W
Re:Why I've never liked speech to text by harrkev · 2004-12-09 06:51 · Score: 1

Well, this is your solution. Obviously, you can type (along with, I would guess, 99.99% of /.). You also can't talk for very long.

I would imagine that there are some people out there who can talk for hours on end, but can't type.

--
"-1 Troll" is the apparently the same as "-1 I disagree with you."
Re:Why I've never liked speech to text by Blakey+Rat · 2004-12-09 07:46 · Score: 1

Yeah, my problem is that IBM ViaVoice doesn't work in the newest MacOS X version and there's no patch available I can find. What a fucking ripoff.

--
Comment of the year
Re:Why I've never liked speech to text by iabervon · 2004-12-09 08:29 · Score: 1

I think it would be helpful to have it record when you talk to yourself, so that, instead of retyping it the way you settle on, you can just rearrange the text that's already available. I think a system where you would generate text from speech in a temporary buffer while you control the program and edit the text with the keyboard would be good. Also, it would be nice to be able to speak while offline and then edit the text into your document while sitting at the computer. This would be really nice for taking notes on papers in a cafe.
Re:Why I've never liked speech to text by Anonymous Coward · 2004-12-09 08:39 · Score: 1, Funny

Yeah, it would be really cool if she kicks ass in Halo 2 so I could have a good wingman on co-op games!

That's what you meant, right?
Re:Why I've never liked speech to text by Vitriolix · 2004-12-09 08:39 · Score: 1

i agree... i've tried several times to use dragon naturally speaking, and while it has gotten a hell of a lot better in recent releases, i think that these days the biggest block is myself. i suppose i could get use to a dictation style of speaking though, lots of people are used to using tape recorders for that.

--

tasty electronic music vittles
Re:Why I've never liked speech to text by JR · 2004-12-09 08:57 · Score: 1

Already available, and essentially a good usage of the reviewed software. Carry the Pocket PC with the recognition software on it and it will record your speach and recognize it. Later, insert that recognized speach into another document. In some cases (Dictaphone?), they have a docking station to recieve a digital voice recorder that retrieves the recorded speach and recognizes it.
Re:Why I've never liked speech to text by Anonymous Coward · 2004-12-09 09:06 · Score: 0

see that's why I keep on coming back to /. - where else would I get my daily dose of fossilised 1960's style misogyny?
</sarcasm>
Re:Why I've never liked speech to text by Anonymous Coward · 2004-12-09 17:44 · Score: 0

where else would I get my daily dose of fossilised 1960's style misogyny?

According to my dictionary, misogyny is a hatred of women. It is your own sexist assumption that a secretary must be a woman, and even if it were appropriate in this case, how is finding a woman attractive equivalent to hatred? If anyone is mentally messed up about gender roles, it's you, buddy!
Re:Why I've never liked speech to text by Blackjax · 2004-12-10 02:16 · Score: 1

Good point, oral skills are important too...after all, I think everyone appreciates a cunning linguist.

I've been swearing at it for years... by bADlOGIN · 2004-12-09 06:16 · Score: 3, Funny

I'll be happy when someone codes a DWIM method (Do What I mean):)

--
*** Sigs are a stupid waste of bandwidth.

Re:I've been swearing at it for years... by k31bang · 2004-12-09 07:30 · Score: 1

Once they code the DWIM method, should they intergrate it with Teledildonics ? :-)

--
-+-=-+-=-+-=-+-=-+-=-+ *** http://www.mountainfort.com *** +-=-+-=-+-=-+-=-+-=-+-

Love that Dilbert Cartoon... by turtledot · 2004-12-09 06:16 · Score: 1

where he's working with speech recog and someone maliciously says over his shoulder " format hard drive "

Laptop Keyboards by BicycloHexane · 2004-12-09 06:16 · Score: 0

I would take the speech to text over those darn laptop keyboards with the mouse pad perfect placed so you brush over it all the time when your typing and erase the last paragraph you were writing.

Biggest problem by retinaburn · 2004-12-09 06:17 · Score: 1

Is I can never start the application:
microphone on
Microphone On
MICROPHONE ON GOD DAMMIT IT !!!

Re:Obligatory by Anonymous Coward · 2004-12-09 06:17 · Score: 0

but do they talk them, or us

Waiting, in vain, for technology to catch up by Anonymous Coward · 2004-12-09 06:18 · Score: 0

I actually find it annoying that the focus of NaturallySpeaking and ViaVoice is to allow users to speak to their computers. Witness how they require users to "train" the software for their particular speech patterns.

As someone who is hearing impaired I've been fervently awaiting the day when I can use speech recognition to transcribe meetings, big or small, or to help me answer the phone.

I'm worried that these companies have essentially given up on writing an app that doesn't require "training" the software, can identify different speakers in a room, can identify and adjust for accents, etc.

great if you have carpel tunnel. by Brigadier · 2004-12-09 06:18 · Score: 1

Though i've never used this personally. I had a co-worker who was strickened by carpel tunnel. We both worked in tech support at the time. In order to accomadate her she was allowed to use DNS. To be honest after training it ( which is the most tedious part) it worked quite well. To an auther or someone who types for aliving this is a great tool. The only other concern I've heard is it does require a reasonable amount of computing power.

Re:great if you have carpel tunnel. by kirkjobsluder · 2004-12-09 07:52 · Score: 1

Dragon NaturallySpeaking is really the only deal killer that keeps me from converting over to FreeBSD. About three years ago my bad typing habits caught up with me, and I have found the dictation is really the only thing that keeps me able to put out a couple thousand words a day. However, there are quite a few annoyances that comes with using speech dictation. Expect to plop down at least $50 for good headset. Also, my choice and editors is limited to notepad and Dragon pad. Editing is a pain in the ass, and correcting what it gets wrong takes quite a bit of time.

To be honest, I would not use it unless I had to. I have been typing since I was 8 years old, and that is still the most natural way for me to write.

As I was saying.... by kidoman · 2004-12-09 06:18 · Score: 1

I hv installed and re-installed Dragon('s) Naturally speaking and yet every time, my mating with the software dosent last much beyond the obligatory training session.... might be my dad howling behind me to go to bed (the software responds -- too much backgroud noise), but the logic coded into the piece of Dragon droppings (literally....) always has failed to impress.... might just be my asian accent.

--
~~bada bing, bada bang, bada bong and voila~~

I am so dirty. by Anonymous Coward · 2004-12-09 06:19 · Score: 0

> Are You Talking to Your PC Yet?

You don't ? I'm doing so since 25 years and all my diskdrives have hair clued around the holes :)

I tried Dragon Dictate... by eno2001 · 2004-12-09 06:19 · Score: 5, Interesting

...it worked OK as long as you trained it properly and you had a nice quite room and a good mic. However, there are issues with "voice typing" that can't be overlooked. Primary is security. If you want to type a document or e-mail that contains sensitive data, make damn sure that no one can hear you. My bank recently moved to a voice activated system. I'm surprised they haven't gotten a ton of complaints from people since it REQUIRES you to say your SS# and PIN out loud. This means I can no longer check my account from my cell phone or at work. If you sit down and think about how many things you type that you would never want to say out loud, you can see why voice typing hasn't taken off. Imagine this emanating from your cubicle in a monotone:

"http://www.goat.cx/ Take that you bukkake loving lunixtards."

Your co-workers would think you were a nutjob if they saw half of what you posted as AC to Slashdot. ;P

--
-"...bad old ideas look confusingly fresh when they are packaged as technology" - Jaron Lanier (Digital Maoism on Edge.o

Re:I tried Dragon Dictate... by Fnkmaster · 2004-12-09 06:59 · Score: 1

Your co-workers would think you were a nutjob if they saw half of what you posted as AC to Slashdot.

As AC? Shit man, most people would think I was nuts if they saw half of what I posted _with_ my username.
Re:I tried Dragon Dictate... by mutterc · 2004-12-09 07:14 · Score: 1

This bothers me as well - many systems that used to be IVR's ("Press 1 for foo, 2 for bar, ...") have went for voice recognition systems instead ("If you want something else, say 'other service'...).
For me, this means instead of choosing from a menu (which I don't mind), now I have to get the @#*%! machine to understand me. I know what is driving it (marketers have heard that people don't like touch-tone IVR's), but IMO they've created a system that's worse.
Better to have "press or say 1", that way people get to talk, the speech recognition works better for the more limited and well-tested vocabulary of numbers, and people like me who don't mind pressing buttons get less frustration.
Re:I tried Dragon Dictate... by bigpat · 2004-12-09 07:17 · Score: 1

"Your co-workers would think you were a nutjob if they saw half of what you posted as AC to Slashdot. ;P"

Your coworkers probably already consider you a nutjob.
Re:I tried Dragon Dictate... by poot_rootbeer · 2004-12-09 07:17 · Score: 1

Have you tried using your touch-tone pad when prompted to say your SSN and PIN? Your bank may very well be different, but every automated phone system I've used has allowed me to do either.

Granted, someone with a tap on your line would still be able to determine your numbers by analyzing the touch tones just as easily as from hearing you say them...
Re:I tried Dragon Dictate... by egghat · 2004-12-09 07:21 · Score: 1

Normally these systems should only ask for *parts* of you PIN, e.g. the second and third number.

If you listen long/often enough, you'll have all the numbers together, but still not in the right order. Safe enough for most applications.

Kind of offtopic though.

Bye egghat.

--
-- "As a human being I claim the right to be widely inconsistent", John Peel
Re:I tried Dragon Dictate... by Anonymous Coward · 2004-12-09 07:36 · Score: 0

If you were really all that worried about people getting your SSN or your PIN, you wouldn't enter them into the keypad of any phone (especially a cell or other cordless). These are so easy to pick out of the air.
Also, this fear of saying your information is based soley on the fact that you think someone actually cares to listen to you :)
Re:I tried Dragon Dictate... by Anonymous Coward · 2004-12-09 08:04 · Score: 0

A person with a trained ear can easily recognize a number typed into a touch-tone phone several feet away. Careful where you check your bank account whether it's with speech recognition or not.
Re:I tried Dragon Dictate... by bynary · 2004-12-09 08:10 · Score: 2, Funny

All right, here's why I have problems with voice-driven phone systems: I have young children. Here's a typical scenario in my house:

I call the bank to check on my balance...

"Thank you for calling Bank WeHaveYourMoney. Please say your 16 digit account num..."

"DAAAAAAAAAAAAAAAAAAAAAAD"

"I'm sorry, I didn't understand your request. Please..."

"AAAAAAAAAAAAAAAHHHHHHHHHHHHHHH...DON'T HIT!"

"It sounds like you're having trouble. Please say one of the following support options for further assistance. For assistance with your account balance say..."

*CRASH* "STOP IT!"

"It sounds like you're having trouble. Please say one of the following support options for further assistance. For assistance with your..."

"Stupid, fucking phone! Just give me an operator!"

"Thank you. Our mailing address is..."

*CLICK*

--
http://www.bynarystudio.com
Re:I tried Dragon Dictate... by rev_sanchez · 2004-12-09 16:02 · Score: 1

I tried playing Pearl Jam into it a few versions ago in a desperate effort to figure out what the hell Eddie Vedder was singing. It just came out in the program as garbled crap and I'm not that sure it wasn't correct.

--
If you didn't come to party don't bother knocking on my door. Prince '1999'

Re:I tried with my Mac by wibs · 2004-12-09 06:20 · Score: 2, Informative

I don't have enough experience with iListen or ViaVoice for my opinion on them to matter, but I found the built-in speech recognition for 10.2 and 10.3 to be more annoying than anything else. It would work just often enough for me to think it was doing its job, but fail just often enough to be a consistent pain in the ass. I'm looking forward to the complete rewrite in 10.4, aka VoiceOver. Here's the apple hype page for it: http://www.apple.com/macosx/tiger/voiceover.html.

--
If you get nervous, just remember that there are a few billion other people who don't really give a damn.

1st experience by gmuslera · 2004-12-09 06:20 · Score: 1

back in '96, buyed OS/2 4 that come with with a headset and ViaVoice. It could be used for dictating or for just commands. But don't liked the training part, and realized that i write faster than talk, and more important, much faster than i can talk correctly.

Looking backward, depending on my mood my voice should be like mmm remember those distorted graphics where people can say what text is in there but not OCR used for confirmations in web sites? well, the same :)

Voice recognition is sooo yesterday by pdiaz · 2004-12-09 06:20 · Score: 1

Music recognition is the way to go (ok, that was a shameless plug ;-)

Anyways, it seems that Dragon Naturally Speaking scans your documents to adapt to your writing style. My question is... how does it know if that docs where written by me?. I mean: imagine what would happen if it finds a copy of the bible on your HD ;-). And BTW: "detailed review"?. It seemed more like a "It worked for me and I liked" thing. No detailed word error rates, a lot of useless screenshots, etc...

--
Make It Secret . Free JavaScript implementation of AES for your browser

Burned in the past by tangledweb · 2004-12-09 06:20 · Score: 1, Interesting

I don't know if they have improved, but I am not too keen to try again.

Dragon comes in accent variants for particular countries but unfortunately, I have a slightly mixed accent.

This means I once spent an entire evening "training" the software, and repeating specific words over and over. It drives a person over the edge very quickly.

I am sure if I ever need to type something that is all obscenities, it will be really well trained. If I need to use any words with an oo sound in them, then I think I will have to type.

Re:Burned in the past by Anonymous Coward · 2004-12-09 06:33 · Score: 0

fook yoo looser coompootoor.
Re:Burned in the past by gmletzkojr · 2004-12-09 06:36 · Score: 1

I also tried Dragon in the past, and it did not support "New York" accent very well either. It seemed to just not get the words unless the speaker pronounced the word with dictionary precision.

--
I for one welcome our new [insert main topic] overlords.
Re:Burned in the past by lack1uster · 2004-12-09 08:33 · Score: 0

Legs: "Shut uppa' you face"
Linguo: "Shut up YOUR face"
(sterotypical italian-mobster english abuses)
Linguo: "Bad...grammer...overload..."
(Linguo explodes, lands across town near Homer)

Homer: "Linguo ... dead?"
Linguo: "Linugo ... IS ... deaaad"

SLEEP!! by Anonymous Coward · 2004-12-09 06:21 · Score: 0

For the last goddamn time I said SLEEP - not delete!

speaktalk =different mental output processes by dotmax · 2004-12-09 06:23 · Score: 1

I find it distincly difficult to speak, or dictate, with the same level of concision i can achieve with single-pass typing. The different cognative processes involved can make STT, in my experience, quite difficult to master, rendering it less useful for long-form composition than one might think. I'm just guessing, but i suspect that the difference is due to the increased processing impedance required to get words out with a keyboard -- the longer time constant gives us more time to think about what we're saying.

Here's an idea by beatdown · 2004-12-09 06:23 · Score: 0

Talk to these!

Dragon NaturallySpeaking. by silicon-pyro · 2004-12-09 06:23 · Score: 1

I use Dragon NaturallySpeaking version 4. It was released about 6 years ago. I have never had any problems with it. Initially training it, I spent about four hours reading the text passages, much more than it suggests that you need to train. I have long been able to dictate entire paragraphs and need only make a single correction. This is still much faster than typing for me, usually around 100wpm (I know, I was amazed too).

I modify my speech when I dictate, exaggerating the phonemes, speaking as clearly as possible while maintaining a reasonable and steady pace. Remember -- this version was from 6+ years ago, and I initially started on a 400Mhz PIII. With advances in hardware and software, I'm sure newer versions are even better these days.

The only drawback is when people make fun of me for speaking so oddly in a room with only a computer.

Hope this helps.

-C-

Commands by BicycloHexane · 2004-12-09 06:24 · Score: 0

I cant stand trying to use speech to text, but I do like the ability to give commands, If nothing more than for fun.

Now we're really in trouble... by Akki · 2004-12-09 06:24 · Score: 1

The link died so fast that even the Coral cache is the error message.

Re:Worst Server... by kidoman · 2004-12-09 06:25 · Score: 1

donch u worry, those dragon people would be shouting all sort of perl insults at /. right about now. but isnt it lunch hour their?? i mean, even the webadmins got to sleep. i gotta thought tho: can /. take down http://www.ibm.com/ or the likes of http://www.microsoft.com/

--
~~bada bing, bada bang, bada bong and voila~~

Let's Say it's Perfect...it's Still Dumb. by Onimaru · 2004-12-09 06:26 · Score: 1

Even if this were perfect, it would still be stupid. Have you ever listened to yourself talk? Do you really want that recorded?

More to the point: have you ever learned a foreign language? Remember how obscenely different the written language is than the spoken one? The same is true of English, we just don't notice it as much. There are more stringent requirements for written speech -- that's why giving dictation is so hard. Complete sentences, no body language or appreciable emphases, paragraph structure, and correct grammar are all expected in writing but almost frowned on in speech (after all, do you look with admiration at someone who corrects your grammar, even in private)?

I suppose if you for some reason were vastly quicker at revising stream of consciousness drivel than you were at writing coherent prose then this might be useful. So maybe for journalists who do a lot of interviews?

Or, I suppose you could be writing the next Sound and the Fury, in which case please let me know so I can come shoot you before you inflict your Emperor's New Clothes on a whole new crop of high school English classes looking for meaning in a book which, ironically, signifies nothing.

--
adam b.

Re:Let's Say it's Perfect...it's Still Dumb. by Anonymous Coward · 2004-12-09 08:46 · Score: 0

I've heard myself speak. Yes, i want it recorded. That's the point.

The Unfinished Revolution by Milo+Fungus · 2004-12-09 06:26 · Score: 1

An important read on this topic is The Unfinished Revolution by the late Michael Dertouzos. In the book he describes the core technologies and approaches of human-centric computing, and speech interface is included as an essential ingredient. It's not just for "slow typists who would like to use their voice to write", it's for the future of computing.

level of intelligence by radarsat1 · 2004-12-09 06:29 · Score: 1

unfortunately, the truth is likely that in order to get decent results out of a speech recognition engine, that is-- in order for it to act like it understands what you're really asking it to do or saying to it, it must actually understand. There is no difference (from the computer's point of view) between acting appropriately and understanding.

So I think we won't really have satisfactory speech recognition until we manage to achieve a higher level of artifical intelligence. And this, sadly, is a long way off...

Until then, you'll have to learn to speak to your computer the same way you program: by specifying precisely what you want, with no ambiguity. For a computer to resolve ambiguity, it must understand "context", and that won't really happen until it is at least as intelligent as a teenage human. Will we ever get there?

Obligatory by Anonymous Coward · 2004-12-09 06:31 · Score: 0

In Soviet Russia, computer listens to YOU!

Oh, wait...

Other "Voice Recognition" by gers0667 · 2004-12-09 06:31 · Score: 1

Net Friedman (of Ximian) fame, just blogged about his experiences with an alternative voice recognition application.

Check it out here

Still waiting for AI by CrazyJim1 · 2004-12-09 06:32 · Score: 1

Voice recognition and translations would be more powerful if the computer could imagine what you're talking about in context. While I bet you'll see grammar rules before AI though, I just like to see how many problems would be easier with AI.

--
God spoke to me.

If by talking you mean.... by ltwally · 2004-12-09 06:33 · Score: 1

If by "talking" you mean verbally abusing and threatening Windows with a loaded gun.... then: YES, YES I DO TALK TO MY PC.

--

/dev/random

Speech built into Mac OS X 10.3 by YetAnotherName · 2004-12-09 06:33 · Score: 1

Just go to System Preferences, click on Speech, choose the Recognition tab, and away you go. How well does it perform? "Naught 2 wheel." Cancel; "Knot 2 veil." Cancel; "Not to L." Cancel; oh forget it, gimme my keyboard!

Re:Speech built into Mac OS X 10.3 by MarcQuadra · 2004-12-09 07:53 · Score: 1

Speech recognition shipped on my 25MHz Mac Quadra 660av back in '93, on 'System 7.1' (wasn't called 'Mac OS' back then). It was cool and useful, worked well, and Windows isn't at that level yet eleven years later.

Also worth nothing that my Dad bought an 8MHz Mac Plus (for $2,500) back in '87 and it could do decent text-to-speech. My friends and I learned a lot about phonetics, A.I., typing, and computers by playing with that little applet.

--
"Sometimes, I think Trent just needs a cup of hot chocolate and a blankie." -Tori Amos on Nine Inch Nails

Dragon NaturallySpeaking is excellent by techmuse · 2004-12-09 06:35 · Score: 2, Informative

I have used Dragon NaturallySpeaking Professional for many years, and ViaVoice before that. ViaVoice's recognition was not so great, and the program crashed constantly. Dragon works very well. I can do around 140-150 wpm with it. I seem to have to make 1-2 corrections per sentence, sometimes less. I am using Dragon 7, but there is a new version (8) out now. I highly recommend this program if you have repetitive stress injuries, or would like to avoid developing them.

Re:Dragon NaturallySpeaking is excellent by anderiv · 2004-12-09 07:13 · Score: 1

I seem to have to make 1-2 corrections per sentence, sometimes less.

Houston, we have a problem. 1-2 corrections per sentance isn't a big deal if you're typing an email. But extrapolate that over a 10-page paper, and you're talking 800-1000 corrections. That's a drag. Whenever I've used voice recognition software in the past, I've found that the overhead induced by having to make corrections completely eliminates the time savings over typing.
Re:Dragon NaturallySpeaking is excellent by Brimstar · 2004-12-09 08:46 · Score: 1

Granted it was ages ago and I believe with either version 3 or 4, but without a Dragon recommended sound card or headset headset I could easly speak in a normal manner (after about 2-3 weeks worth of training and doing corrections) in any window and get maybe one or two errors per page in a document. Had more problems with chatting on IRC and ICQ, but not too many. Did a few multi-page papers at that time, and actually would say it didn't hurt me any. A mute switch on the mic was a godsend though. Kept it in one hand, and reached for the mouse as needed. Main thing is, make sure you do your corrections through the software and it'll catch on. It improved over time. Note this was in a dorm room for the most part so I was greatly pleased.

Mac OS X PlainTalk "Speakable Items" by Speare · 2004-12-09 06:36 · Score: 3, Interesting

I've got a few machines around the house, and one is an eMac in the living room. It's mostly for edutainment titles, games, and to act as a print server, but I played with Mac OS X's "Speakable Items" capabilities.

I use the text-to-speech on several crontab entries. Chip (yes, that's the computer's name) will announce basic daily schedule items, such as the date in the morning, kid's bedtime, and a final signoff at 11pm. I added some checks so it wouldn't talk whenever iDVD or iTunes was running. I used to have it monitor news headlines too, but it would talk too often and we would tune it out.

I also tried some "Speakable Items" for basic tasks. Essentially, there is a special folder with a number of AppleScript files. The filenames are their voice triggers. If the computer hears you say one of those filenames, it runs the AppleScript. There are nested directories with items for specific applications, so you can speak the global commands or the active app's specific commands. Well thought-out.

Some Speakable Items could come in handy, but the eMac microphone is too limited to be able to command the machine from across the room. You also cannot have a set of Speakable Items somewhere which are still active when nobody's logged in. Thus, I need to have a user logged in (and then turned away with user switch). Lastly, for most of the automation tasks I'd like to run, Perl or Bash is a better choice than AppleScript, but Speakable Items must be special text-command files or AppleScript, and I can't imagine making a bunch of AppleScript stubs for each Unix-style script I would write. These each limit the usefulness of the voice-commandable appliance I was hoping for.

On the utility side, speech command would be great for specific queries, "Chip, what day is it?" and generic countdowns: "Chip, give me ten!" and he'll tell you when ten minutes have elapsed.

--
[ .sig file not found ]

Re:Mac OS X PlainTalk "Speakable Items" by DaoudaW · 2004-12-09 09:26 · Score: 1

the eMac microphone is too limited to be able to command the machine from across the room

It'd be a bit more expensive, but you could always use a wireless lapel mike if you knew you needed to control the computer while walking around the room. I'm a teacher and got inspired by all that could be done instructionally with that set-up.
Re:Mac OS X PlainTalk "Speakable Items" by gobbo · 2004-12-09 10:33 · Score: 1

I played with Mac OS X's "Speakable Items"
Interestingly, this is actually the same feature from Mac OS 8 or or even System 7, and the user interface is hardly changed at all (though the guts may be, IDK). It's ancient tech! It was great in its day (really was revolutionary for 1996), but you'd think they would at least upgrade the stinkin' voices a wee bit. In those days (yah I'm an ol' fart), there were external "PlainTalk" microphones that sat on the monitor and did a better job than the tiny laptop mic on the eMac.
With a systemwide macro program like Keyquencer I could stack up a huge amount of everyday tasks with a simple vocal command (computer, do morning routine), while looking for my coffee cup. Try asking it to tell you a joke. There are quite a few knocknock jokes in there.

--
Damn those pesky terrorists
Re:Mac OS X PlainTalk "Speakable Items" by Omestes · 2004-12-09 10:34 · Score: 1

Sacraficing mod points for this. I am a new Mac person, and would like to know how you did this? Is this with the standard software, or did you have to buy something?

Is there any online docs on this?

--
A patriot must always be ready to defend his country against his government. -edward abbey
Re:Mac OS X PlainTalk "Speakable Items" by HitByASquirrel · 2004-12-09 13:21 · Score: 0

To turn on/off Speakable Items

To add/remove Speakable Items

Hope this helps, its fun to play with, but not very useful if you listen to music or have your computer in a noisy environment.
Re:Mac OS X PlainTalk "Speakable Items" by HitByASquirrel · 2004-12-09 13:26 · Score: 0

Yeah, it's a very old tech, since System 7 at least, and I still have a bunch of those ol' plaintalk microphoens... even this cool grey one that you could clip onto your collar.

Speakable Items was _very_ helpful for my dad after he underwent some shoulder surgery. It's something Apple definitely needs to market more, along with a plethora of other things.

What about other types of speech recognition by vespazzari · 2004-12-09 06:36 · Score: 1

Speech to text recognition for dictation is great and all, but it seems that few people can actually speak coherently while they are in deep thought. I would much reather have a program that allows me to control such things as music on my pc, like skipping tracks, volume, etc, or opening new windows, urls. Is there anything out there that is capable of doing this efficiently?

--
"Alcohol, cause of, and solution to, all of life's problems" -Homer Simpson

mirror by demon411 · 2004-12-09 06:36 · Score: 1

Mirror:
http://www.mirrordot.com/stories/a5cbfb6d cb7b3c27c 77b59ea6919a624/index.html

Our programmers loved that by SlashDread · 2004-12-09 06:37 · Score: 1

Shouting to yer neighbour: cee dee slash enter! are em space dash are ef! enter!

I just always imagine a room full... by Infonaut · 2004-12-09 06:38 · Score: 2, Insightful

of people talking to their computers. Some people aren't bothered by noise pollution, but it drives me crazy. The babble of people on the phone in a crowded space is bad enough. Add to that people talking to their computers constantly, and postal employees won't be the only people going off with AKs.

--
Read the EFF's Fair Use FAQ

Re:I just always imagine a room full... by myowntrueself · 2004-12-09 07:27 · Score: 1

And of course, what happens when you have a room full of people all talking is that the overall volume level slowly rises as they try to compensate for not being able to hear themselves by talking louder... and louder... and louder... until everyone in the entire cube farm is screaming at the tops of their voices at their computers.

Might make for a good sitcom scene...

--
In the free world the media isn't government run; the government is media run.
Re:I just always imagine a room full... by ediron2 · 2004-12-09 10:39 · Score: 1

I just always imagine a room full of people talking to their computers. Some people aren't bothered by noise pollution, but it drives me crazy. The babble of people on the phone in a crowded space is bad enough. Add to that people talking to their computers constantly, and postal employees won't be the only people going off with AKs.
Oh, like a Call Center?
Re:I just always imagine a room full... by Infonaut · 2004-12-09 16:06 · Score: 1

Yep, and I'd hate to work in a call center too. It's fucking inhumane treatment, imho.

--
Read the EFF's Fair Use FAQ

Problem with speech to text by Spy+der+Mann · 2004-12-09 06:38 · Score: 3, Interesting

IMHO, the problem with this kind of engines is that they don't make a separation between speech to phoneme / phoneme to text.

If someone designs a good open source speech to phoneme architecture, I'm sure people would start working on phoneme to text AI algorithms.

They say: "Open source? Death!!! Where will our revenues for research go?"

But... what use is patenting/selling something that doesn't work in the first place?

Again, this is only my personal opinion. (I couldn't RTFA because... *slashdotted* :-/ )

Re:Problem with speech to text by LocalFire · 2004-12-09 09:31 · Score: 1

I concur with the idea of translating speech into phonemes, but perhaps it might be easier to do a neural network of some kind.

There was a time when it was thought that Naturally Speaking would run much better on a special purpose unit that had an especially good graphics card, installed in a separate enclosure with isolation from all the RF noise created by the high frequency activities on a regular PC motherboard.

I haven't tested it recently, so I don't know if the additional processing power available from 2+ GHz processors and fast buses are any better than the old 500 mhz machine I used to run it on. My old machine could never get much better than a 20 on the built-in signal to noise checker that comes with the software.

Another problem I had then was that the learning algorithm software seemed to get loaded up with too much information after about a page and the accuracy of the transcription would fall off. This is another thing that I would like to check one of these days. I was worried that if you didn't reboot the package frequently it would fail because its data collection efforts would become overloaded with data.

Interestingly enough almost none of the comments on this thread are modded high enough to be visible at the setting I am running.
Re:Problem with speech to text by Anonymous Coward · 2004-12-09 10:23 · Score: 0

IMHO, the problem with this kind of engines is that they don't make a separation between speech to phoneme / phoneme to text.

Actually, breaking down the problem into two separate sub-problems would impair performance by increasing the risk of compounded error.

For example, if I had a speech recognition system with an integrated parser, I could rule out text that clearly is grammatically incorrect. Maybe the second best phoneme produces text that is grammatically flawless. On the other hand, I commit to certain speech-to-phoneme output, I cannot use the extra information given by the parser to improve performance.

So breaking down the problem into sub-problems is justified only if one weighs the two sources of information simultaneously, without making any intermediate decisions.
Re:Problem with speech to text by Olivier+Galibert · 2004-12-09 12:18 · Score: 1

I'll admit we haven't measured recently, but the speech-to-phone error rate was around 60% not so long ago. OTOH, speech-to-word is closer to 10% by now. Lexical information helps a lot.

Maybe you could do a phone lattice then decode the lattice to words, but there wouldn't really be a point and it would be extremely slower.

OG.
Re:Problem with speech to text by Fwongo · 2004-12-09 13:13 · Score: 1

Um, what's the advantage of having a high-end graphics card in terms of interference? I never knew that was one of the benefits.
Re:Problem with speech to text by LocalFire · 2004-12-10 06:04 · Score: 1

"graphics" was a brain cramp--I meant sound card

Mine works great. by Anonymous Coward · 2004-12-09 06:39 · Score: 0

I'm entering this comment right now using the voice software and a cheap mic. It's very useful when you're doing a couple things at once. Oh, hey honey, how are you this afternoon? Up to anything exciting? Whoa, aren't we frisky this morning? Yeah, just pull those off. We're going to do this until you submit

Re:Article text by LanMan04 · 2004-12-09 06:40 · Score: 0, Offtopic

That's exactly what happened, thanks for the support mmkkbb.

--
With the first link, the chain is forged.

I talk to mine all the time by wiredog · 2004-12-09 06:41 · Score: 1

and it talks back.

Which is weird, since it doesn't have a mic or speakers.

Right now it's telling me that it's time to go home and clean the guns.

--

Best Slashdot Co

Re:I talk to mine all the time by CSG_SurferDude · 2004-12-09 07:00 · Score: 1

WireDog;
Please let us know when your computer tells you to bring those guns to work for "show and tell".
Unfortunately, whatever day that will be is the day I have an off-site meeting. :-(
But let me know the day before, just in case.

--
LongTail SSH Brute Force analysis tool is here!

Voice Software by meatspray · 2004-12-09 06:43 · Score: 1

I played around with Dragon and Kurzweil back in the day and man were they horrible. You practically had to read the thing an entire novel to get it to 95%.

Sometime in the post Pentium revolution, algorithms got a shot in the arm and dictation software started getting significantly better even before training.

The biggest problem I've had is that reading a predetermined text to a computer doesn't sound anything like my causal style speech I'm going to use for voice input. Anything I read over turned out reasonably well (an error a page or so, that's much better than my typing)

Dragon Naturally Speaking by CFBMoo1 · 2004-12-09 06:45 · Score: 1

Back in the days of Windows 98 and all, Dragon Naturally speaking was a very nice product. I got a copy for my mother who has arthritis in her hands and she loved it for when she did stuff on the computer. Once it was trained to her she loved how much typing it saved her.

--
~~ Behold the flying cow with a rail gun! ~~

my (limited) experience by sootman · 2004-12-09 06:45 · Score: 2

I've played with the speech recognition that came with my tablet PC. Works OK if I'm by myself in a quiet room where I can non-self-conciously talk unusually loud-n-clear. Every time I've demo'ed it to people, in an office environment talking normally, the results are laugable.

The good news is, you can play "Telephone" all by yourself! Remember that game where you sat in a circle, and one person says a sentence to the person next to him, and he tells the next person, and so on all around the circle, and then you hear the final version? Just talk to your computer, then when your words are shown (incorrectly) on the screen, read those words back, and so on. Easier and more fun than going from german to french to english to spanish to french to german to english in babelfish. :-)

--
Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.

Chip and Pin's even better. (ot) by oliverthered · 2004-12-09 06:48 · Score: 1

We've now moved to 'chip+pin' in the uk for credit card transactions.
This means that you type or pin number into a little box in full view of everyone in the cueue.

What ever happened to never tell anyone you pin number.

I should imagine that you could use the length of the keyboard loops in the pin entry box to work out the pin number remotley.

Then all you have to do is mug the person, when they pretend they don't know their pin it's ok, because you do, so not even a GBH charge for beating it out of them.

I shouldn't imagine they've used anything better than 56bit encrypition on the chips either, so you should be able to brute force in a couple of days or so.

--
thank God the internet isn't a human right.

I used to talk... by 216pi · 2004-12-09 06:48 · Score: 1

...to Opera. The Voice-Feature lets you control the browser easily including zoom-in and -out what is very interesting for those of us that are handicapped.

Using DNS/Natlink/Vocola by wstephens · 2004-12-09 06:48 · Score: 2, Informative

I've been using voice for about 6 months. I had a big issue with right hand pain this year so I talked to a fellow developer who helped me get setup. We've done some custom grammars in python for or dev environment. It's been helpful. It's a long way to go if you want to reduce mouse usage. The mouse has to be the worst peripheral for the PC. I'm considering buying the SmartNav http://www.naturalpoint.com/smartnav/ to get rid of my mouse. I've messed with one on a PC with 2 screens. It was nice.

Re:Using DNS/Natlink/Vocola by Anonymous Coward · 2004-12-09 13:23 · Score: 0

stop masterbating so much and your hand won't hurt

sheesh

Talking to PC's by Anonymous Coward · 2004-12-09 06:50 · Score: 0

If God had meant us to talk to computers he would have spared this guy's site!

Very Convincing Video Demo by ctwxman · 2004-12-09 06:51 · Score: 1

David Pogue in the New York times did a very effective review of Dragon Naturally Speaking by using the product in a split screen video. http://www.nytimes.com/videopages/2004/12/02/techn ology/20041202_STAT_VIDEO.html

I wish I could figure out how to embed a url without printing out the entire url.

Problem with text-to-speech by mcrbids · 2004-12-09 06:53 · Score: 1

As much as I type, and as fast as I do it, can you imagine how hoarse I would get trying to keep up?

Also, how would you say:

$preg_match="/([0-9a-zA-Z\_].*)[Ee]nd/"; if ($a=5) { echo $theTotalAmount; echo SubRoutine($subTotalAmount, $preg_match); echo '<br>There is <B>not enough money</B>, stupid!'; }

Keyboard for me....

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.

Re:Obligatory by duffbeer703 · 2004-12-09 06:55 · Score: 1

In Korea, only old people talk to computers.

--
Conformity is the jailer of freedom and enemy of growth. -JFK

Sphinx 4 by Anonymous Coward · 2004-12-09 06:56 · Score: 0

Anyone ever had any experience with Sphinx 4?
It's Carnegie-Mellon Universities Java "speech recognizer". I plan on incorporating it into a project I'm devising. And along with that, Java's speech API seems to be circling the drain...at least as far as keeping it current and such. Visited the web-site a few weeks back and it doesn't seem to be updated very often.

Re:Article text by RicktheBrick · 2004-12-09 06:56 · Score: 1

I want both speech and noise recognition. I want to put a speaker/microphone is every room of my house. I want the computer to talk to me all the time. I basically want it to keep track of me all the time. When I am awake it should asks me if I am allright several times a day. Whenever I start something like taking a shower it should know how long it takes me and ask me if I take too long. It I start cooking something it should know how long it will take and remind me if I did not tell it I turned it off. When I leave the house it should ask me how long I expect to be gone. It should be able to recognise all noises around the house and than determine if the noise is a problem such as a leaky water pipe or gas line or the noise of a fire or someone unexpectedly being in my house(burglar). It should be able to respond to a problem by notifing someone(me, neighbor, relative, police, hospital, fire department). When I exercise it should know exactly how much I accomplished and encourage me to as much or more the next time I need to exercise. It should be both a mentor and a protector.

Nat's blog by davidkv · 2004-12-09 06:57 · Score: 1

Nat Friedman wrote about speech recognition in his blog today. Some interesting thoughts there.

I believe it has to be said... by jacobcaz · 2004-12-09 06:59 · Score: 1

In Korea, voice recognition is only for old people

No by Mantorp · 2004-12-09 07:07 · Score: 1

but I often swear at it

My Talking PC got me busted for p0rn by bonytony · 2004-12-09 07:09 · Score: 1

I can't remember the name of the song I was trying to play, but I had just installed RealVoice on my PC, and I said, "Open ????" where ??? was the name of the song I wanted to hear. Well, the computer repeated my every command, so it was pretty embarassing when it said back to me, "Open Paris Hilton Blow Job?"

Hope it is better then my Phone by CmdrObvious · 2004-12-09 07:12 · Score: 0

Hopefully the speech recognition software for PC is better than my phone. I have an LG phone, and tried and tried to get the voice dialing to work. it worked great when there two numbers, but was compleatly unable to tell the difference between "dad" and "dan". and that was with me training the phone. I know it will work one day, but i dont see my keyboard going away any time soon.

Translated via MS Office Speech by Anonymous Coward · 2004-12-09 07:21 · Score: 0

Try to use the us know what the law now once you know when to go on from the home page ½ years in the bay view of the the guy is that we're all the evidence in the navy has the one you have one of the year is the home in the movie , a once and one over the CEO is the one I know of a home is the insert its own>one of one San Juan's over the CEO is the one I know of a home. Ors in all know exorcise. It looks like the theory is visited this program. This program is the socks or. Actually the state status as a guest who would happen to somebody else tries to use this disappointed. If they're under standard thinking will.

Of, the ferry. 250 left the ROI. No tax tax wars. The more it's been a textile and yet so we were so we got so we suite its own this is not very sweet area Kennedy Kennedy nine points in the low as the stars write letters are for the new young will visit the move "a great deal of data these will you for the one that all from the on the line and we know there are only are the various the law will live in or near the loan. He saw on the one on the war e-mail address these who I real have only one home the parents who are the way I see no, I one woman will be in jail in this, you do you know that all try (to get one for $1.00 . Of course a public one merely in the other is working again with the one in the one that merely a united Ireland has.

It's everything you could wanted to be in more. $1 billion. It's getting closer and Jill mama. The period. The sentencing said.. It put two periods. But if you. But in monkey. But in button lucky to needs help, and what is that smell. The new and more of the greatest thing since the end of the prison sentence and a half its Germany and we're going on in the state of Nevada site for the new unit are in the training is over Iraq not to spend the night before you as a one. Don't feed the walkie. And the navy in Mexico is creeping 21 lead whatever you can just send it to the lowest reading 111 three and one of the last minute for the lights you a little bit of money goes to all of this is the only way we're still in the eighth and one can only is the only is the one we were of the one on the status quo has over for trial for this year thing we're a long it will of the one that's been an online or the country are ones in the reader's still no thing that if you were a lot of which ones are sooner know better The

Assertive and a year in. business footing on some business footing of your. Who is your daddy. And. That was your daddy? The Sony will get a brand names and the my diet ran so it a further run overnight the topic of Spanish like from the on the Clinton people between the two have all of a lot of the these were the ones you know, for the all are all on one wall will be one for the press has been the staff room one one or more of those who are all over the matter was, as you like or see on one day I'll know, one of the Vietnamese home the new at 101 and you have one of the one and one of the one or the vision like Percy on one day all know one of the enemy's home at the news a lot of work here, I have great but with the island way you live in fear of what got us out of the sweetie to the end of the north Norfolk and a half hour milk and milk killed for the long-awaited on the one thing I can say you were to its translating everything anybody says the middle of the shoe can't find a new.. Nowhere
These,

If you say. It means no on the planet. In the line. If (the city can run the no 11 one citizen. If the for the one at a

Re:Translated via MS Office Speech by Anonymous Coward · 2004-12-09 07:34 · Score: 0

WTF?

Preferred sucks by Rerazor · 2004-12-09 07:22 · Score: 1

I'm wanting to create a hand free environment on my desktop due to RSI but the Preferred version doesn't have the features (macros) needed to manipulate menus and windows. The problem? The Pro version costs nearly ten times the preferred version, apparently they gouge companies looking to avoid workers comp situations but people like me are left out in the cold. Yeah, I'll probably end up paying it but it feels like extortion.

I can't dictate by spaeschke · 2004-12-09 07:26 · Score: 1

My problem is that the way I speak and the way I write encompass two completely different mindsets. I think most people take on a more formal tone when writing, and dictation doesn't seem to lend itself to that clarity of thought. I much prefer the good old fashioned "come up with a draft, revise" approach to writing.

Re:I tried with my Mac by the_2nd_coming · 2004-12-09 07:28 · Score: 1

dude, that application is a spoken interface. meaning it will speak to the user what is on the screen. it might include some enhancements for taking commands, but it did not mention any.

--

I am the Alpha and the Omega-3

Voice recognition, apple and applescript by l4m3z0r · 2004-12-09 07:29 · Score: 1

I've played with voice recog stuff on my mac under OS X. I've written several AppleScripts that I tell to do things like Open FireFox, Open App. Each app i want to open has to have a seperate entry but to be honest using apple script this is an absolute cakewalk. Also the speak like nature of apple script makes me think it is going to one day be something you just starting talking at your computer and it runs what you say.

I must say this stuff is very forgiving and for the kinds of actions like open, close it works great. It however does not take dictation very well, and honestly I don't expect/want it to. What i would like to be able to do is say "computer record until monkey... say my bit monkey.. computer send last message to email friends name" and then that thing just sends that wav or mp3 or aac or whatever to my pal... And you know what with a short(reasonably) applescript I can...

Re:I tried with my Mac by crimbil · 2004-12-09 07:29 · Score: 3, Interesting

I tried iListen when my wife was having difficulty typing. I ran through the training and played with it a bit to get familiar with it prior to having her use it. The accuracy rate was very high for me.

Then she tried to use it. Even the training procedure was difficult for her. She grew up in the midwest and had no discernable accent, so that wasn't the problem. Near as I could determine, she didn't always have the same inflection when saying many words. Without the consistency of pronunciation, the software couldn't learn correctly. She became very frustrated, which led to her over-enunciating the word in question, which just confused the software even more. It became shelfware.

I dragged it out a month ago and started using it again. I've gotta say, the response time on a Dual-G5 is pretty impressive. And for the smartasses out there; no, I'm not using it for this post, I'm at work. Isn't that where everyone reads slashdot?

slow typist? by real+gumby · 2004-12-09 07:38 · Score: 1

It is encouraging for slow typists who would like to use their voice to write.

Apart from the physically handicapped, are there really that many "slow typists" who type so slowly that this would be a speed improvement?

I always assumed that speech recognition was for cases where typing was not possible (e.g. in a car, if you are handicapped, etc).

I get more throughput typing my own letters than dictating to a stenographer, for example.

Re:I tried with my Mac by wibs · 2004-12-09 07:39 · Score: 1

Since keyboard commands can be controlled through speech, and it's adding a whole bunch more for universal access, I'd assume that the new stuff could be controlled by voice commands. So I guess you're right, it's not as cool as I thought it was at first, but it will still be an improvement.

--
If you get nervous, just remember that there are a few billion other people who don't really give a damn.

Discrete, not "natural" speech by SunPin · 2004-12-09 07:40 · Score: 1

Discrete speech works fine for programming. I don't see how natural speech could work since they depend on complex recognition models. Discrete allows for the auditory nonsense of coding.

--
Laws are for people with no friends.

Yes, in fact I am by RealProgrammer · 2004-12-09 07:42 · Score: 2, Interesting

I've been working recently on a language I call 'verbal'. My goal initially was a language I could use in the car, while driving. (I love to code.)

I realized that such a language would be useful for blind people and anyone who couldn't type.

The target is a language that will mimic a subset of English, so that a program might be:

Let number be seven. While number is greater than zero, do print number; number is number minus one; done.

I've written a compiler that translates that kind of thing into C, but I'm not releasing it just yet. It only has the type int, and no functions or objects. As soon as it can handle objects, I'll post it quietly.

(I got stuck for a day doing an elegant itoa.c, but that's done now. All I needed it for was to generate good labels for constants on the symbol table, and sprintf didn't fit right. Of course I found a slightly simpler one after I got it done.)

--
sigs, as if you care.

Re:Yes, in fact I am by sehryan · 2004-12-09 08:11 · Score: 1

You should call it Keyser Soze when you release it.

--
The world moves for love. It kneels before it in awe.
Re:Yes, in fact I am by drinkypoo · 2004-12-09 11:04 · Score: 1

If you're going to be doing it with speech you need to eliminate punctuation, or you need an editor that's designed around programming with the spoken word.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:Yes, in fact I am by Dr.+Sigmund+Freud · 2004-12-09 18:47 · Score: 1

The target is a language that will mimic a subset of English...
That would be cool (uber-cool if it can be extended to an OO framework.) AppleScript does a pretty good job of using a subset of English for vocab and syntax, but it remains a scripting language, not quite suited for number crunching.
Good luck with your project!

--
Dr. Freud
Technology meets Transportation.

Hello? by Napoleon440 · 2004-12-09 07:43 · Score: 1

"Hello? Hello Computer?" "Just use the keyboard."

Need translator for Australian Accents by tonejava · 2004-12-09 07:46 · Score: 1

I got a PowerBook with the speech recognition only thing is you need an American accent to get it to understand you!!!!!

MS Office speech to text by KD7JZ · 2004-12-09 07:49 · Score: 1

I recently had to write about a thousand word public address and thought I would try the speech to text that comes with Office. I loved it. It was not perfect and I still had to use the keyboard for clean up, but for that application it was great.

Re:MS Office speech to text by UNFAIRMAN · 2004-12-09 12:30 · Score: 1

I played around with it too, but I only spent about 10 minutes training it, and I've got a cheesy microphone. So, here's "Peter Piper", spoken fairly slowly and carefully:

Peter piper picked up a bit of a purse it didn't occur to obtain the third in a bitter pill to pick a bit of a person that difficult at first in a paper picked it up ever expect people that first where's the fact that the peppers that your paper bag

Peter piper picked up at the gold that there's a lot of people that are speaking of paper that if your paper picked up at the cult members where are the people peppers that either pay for it

We're paper picked that the people that first attracted to culpeper speeder paper bag that your paper but the backup and that was where of the pickle pepper is that your profession

Peter piper pick a pack of the gold at first but the fact of the gold peppers that your paper that if your paper picked up at the people that first where are the legal matters that you have a perfect

Peter piper take an act of the cold versus. A project of the people that Bruce Weir labor on it. And we're right where they think that the legal matters, where are the people that birds that your paper that.

Basic Commands with XP by Mondrames · 2004-12-09 07:54 · Score: 1

I used some freeware on my XP machine and it worked ok. Granted, I did not dictate documents - I used it for commands. For example, when I wanted to skip to the next song in WMP I assigned the phrase "next song" to CTRL-F. Handy when you are sitting reading something and don't want to get up. You also had to say "computer" to get it to wake up and listen.

Testimonial by ThePyro · 2004-12-09 07:55 · Score: 1

One of my friends uses Dragon Naturally Speaking on a regular basis and loves it. He's a family practice physician, and says the software saves him a lot of time, since he can use it to dictate his "reports" (don't know the technical term) between patients. He's used it for several years now.

What about stuttering? by tmgtmgtmg · 2004-12-09 07:56 · Score: 1

Not that I'm very interested in using software for speech recognition, but if I were, I'd be in trouble. I stutter a bit (how to react when talking to a person who stutters), and, the estimates vary, a lot of people do at some stage in their life.

Most stutterers don't stutter when they're alone, though. But, then again, even when I'm alone, recording a message on an answering machine can be a challenge since I know that someone is going to hear it. Stuttering is a big mindfuck, so I wonder whether I'd experience the same sort of self-awareness when talking to a computer.

As far as I know, software like this doesn't deal well with speech disorders, and it probably should't be expected to.

Re:What about stuttering? by cjrichard · 2004-12-09 08:07 · Score: 1

Heh, I thought no-one had wrote about this so I posted a comment. Directly below yours. =/ I have a stammer and I know what you mean about not stuttering when talking to yourself. What worries me is when speech recognition replaces more forgiving methods (i.e. a call operator). When you have a stammer it takes a bit of courage to pick up the phone, and having to talk to a machine which judges you on every word is too much pressure.

Having a stammer in a speech rec world isn't fun. by cjrichard · 2004-12-09 08:01 · Score: 1

News reports claiming that speech recognition is the future really worry me. I have a stammer, and even when I am speaking my best my voice is dysfluent and interspersed with "ums" and "ahs". At worse I cannot say anything at all - a "block". Although I try to avoid the phone, when I do talk I can at least explain to the operator that I have a stammer. Speech recognition does nothing of the sort.

Now that it is unavoidable - for example, booking a seat at a cinema - I really feel that a significant portion of society is being discriminated against. There are 550,000 people in the UK alone who have speech dysfluency problems, and yet speech recognition cannot deal with the multitude of different manifestations of this; repetition of sounds, blocking, or laboured or breathless speaking. What am I, and everyone else who has these problems, supposed to do? Even if they did put effort into making the system smarter, I can think of nothing more intimidating than having my stammer strutinized by a piece of code.

And nothing can replace the feeling of acceptance when you can tell the operator about your stammer, and what s/he can do to make it easier for you. I place the human virtues of patience and understanding over all software (even Firefox)

Using Dragon NaturallySpeaking by Video_Wizard · 2004-12-09 08:02 · Score: 1

Comments on Dragon NaturallySpeaking (and voice-recognition)

I have used and developed software for Dragon NaturallySpeaking Professional version 7.3 for about one year. Here is a summary of my experiences:

With a good microphone system, I have been able to achieve an accuracy of about 95% with Dragon NaturallySpeaking. This means that about 5% of words are misrecognized, missed, or spurious words. It seems to be acceptable for some command-and-control application on computers. For example, I browse the World Wide Web using Internet Explorer, send and receive electronic mail, play DVDs, and some other tasks fairly comfortably using voice commands. With this error rate, dictating lengthy documents proves tedious because of the frequent need to correct errors -- although it can be done. For example, this posting was dictated using Dragon NaturallySpeaking.

ScanSoft claims a significant improvement in accuracy in Dragon NaturallySpeaking version 8, recently released. I have begun testing Dragon NaturallySpeaking version 8 and cannot yet confirm or deny this claim.

I use Dragon NaturallySpeaking for a voice operated wall display computer system. The computer system consists of a laptop, computer projector, wireless microphone system, powered speakers, and Dragon NaturallySpeaking Professional version 7.3. For greater comfort, I use a directional microphone that clips on my shirt -- I do not need to wear a headset. I use the system as a Home Theater System and a general computer. Some common computer tasks such as browsing the World Wide Web and reading and responding to typical e-mails, which tend to be short, are quite easy and enjoyable. Other tasks, such as dictating lengthy documents, are rather tedious.

I can see the computer display from anywhere in my room. I can move around freely, stand, even walk while operating the computer hands-free. The wall display seems to encourage better posture than sitting at a desktop computer.

In my experience, it is necessary to retrain Dragon NaturallySpeaking frequently during the first few months of use. This seems to be due to natural variations in our voice. Over time, Dragon NaturallySpeaking can learn some of the variations in the speaker's voice and the need to retrain the program drops. Dragon NaturallySpeaking works better for long multisyllable words and complex phrases. Over time, its ability to recognize long words improves. However, Dragon NaturallySpeaking version 7 has problems with homonyms, short one syllable words, singular versus plural forms of nouns, and tenses of verbs. These problems do not go way over time.

Dragon NaturallySpeaking version 7 is sensitive to the rate at which you speak. If you speak faster than the rate that it has learned, it will fail. It is important to learn to speak evenly at the rate that Dragon NaturallySpeaking understands.

Over time, Dragon NaturallySpeaking can adapt to some background sounds -- even some kinds of music. However, it has consistent problems when another speaker is present, especially the same gender as the user.

Regarding security, there are many pieces of information that we do not want to speak aloud, for example our Social Security number or a computer password. In Dragon NaturallySpeaking Professional, one can create commands or new vocabulary entries with a secure spoken form such as "my supersecret password" which the program will transcribe as the actual sensitive value such as "XRILCYREWVP". In this way, an eavesdropper cannot determine the social security number, password, credit card number, or other sensitive information.

Dragon NaturallySpeaking is a heavy user of computer resources. Dragon NaturallySpeaking running on Windows XP seems to slow down significantly after a few days of continuous use. I frequently find it necessary to reboot the computer and to defragment the hard drive.

Dragon NaturallySpeaking version 7.3 has a learning curve. It is like any computer tool. Users need to learn

The speech technology works…the tools for ave by N1ck0 · 2004-12-09 08:05 · Score: 1

Many of the new releases of Dragon Naturally Speaking (version 7 and 8) do a very good job off recognizing speech and filtering out non-speech. The main problem is adequate training...

My company has been working tightly with Scansoft engineering for the past year+ and on most of their recent developer beta releases we can easily achieve 98.5-99.9+ recognition rates (We just got copies of v8 desktop, and haven't messed with it much). However to achieve those rates we use their backend server engine (essentially the same core technology as the desktop, just better server-oriented APIs) to process, at minimum, 4-6 hours of pre-recorded speech from each user.

The big problem with the desktop version is that not many people want to sit down for 6 hours reading random chapters from a book out loud. The easier way is to use pre-recorded audio along with transcribed text. Then the engine can just chug along and create a pretty accurate model with 8-14 hours of audio files.

Another big problem is punctuation...not many people feel like saying 'period', 'semi-colon', 'newline', etc. as they go along. Personally I find speaking formatting clues extremely frustrating, and end up giving up. So you really need learning engines to analyze the inflections and pause in people's speech patterns, and infer this missing information automatically (of course that's what we're working on right now).

All in all, if you spend all day typing large volumes of text (writing novels, instruction manuals, etc.) the current tools could get the job done (if you have the patients to stick with the training). But IMO a keyboard is much more efficient for average users.

Open Explorer? Moment is poor. by cbr2702 · 2004-12-09 08:09 · Score: 2, Funny

How perceptive!

--

This post written under Gentoo-linux with an SCO IP license.

Literacy by cbr2702 · 2004-12-09 08:15 · Score: 1

Basic literacy is not being distroyed, it is being improved. Notice that you are now able to read better. Why fight it?

--

This post written under Gentoo-linux with an SCO IP license.

Re:Literacy by Anonymous Coward · 2004-12-09 19:31 · Score: 0

distroyed

But obviously, not spell better...
Re:Literacy by cbr2702 · 2004-12-10 03:43 · Score: 1

But that's my point: you can read it, so your literacy is good. If you could only read perfect spelling, you wouldn't be as literate. Having "language abusers" helps keep your mind sharp.

--

This post written under Gentoo-linux with an SCO IP license.

The Market I wish this had by mdarksbane · 2004-12-09 08:15 · Score: 1

Is more toward the portable than my desktop.

My desktop has a nice keyboard that I'm quite accustomed to using. As a general rule I can type almost as fast as I can talk, and generally faster than I can think of what to say. Definitely faster than any speech recognition software I've used.

However, my iPod and palm *don't* have nice keyboards. If I could tell my palm to take a note, write down someone's phone number and email, or just start dictating my thoughts, and have it end up in my contacts ready to be synced... well, that'd be pretty nice.

Until then, it's just a novelty to me.

Windows flag key + U on Win XP by Anonymous Coward · 2004-12-09 08:28 · Score: 0

No I don't talk to it but I think I hear it talking to m...

IT'S ALIVE! Run!

Frenzied swearing? by Robber+Baron · 2004-12-09 08:30 · Score: 1

Does frenzied swearing count?

--

You're using her as bait, Master!

You think cell phones are annoying? by Anonymous Coward · 2004-12-09 08:31 · Score: 0

What about an office full of people talking to their computers?

Olfactory programming languages by Mad+Bad+Rabbit · 2004-12-09 08:32 · Score: 1

Has anyone given any thought as to how a programming language should be structured so that it would be easy to code by waving things with particular smells in front of the computer? If not, why not?

I guess I just never thought of it. OK, let's name it SACHET, the olfactory-based programming environment. Can anybody out there with synaesthesia give some suggestions on (for example) what a while-loop should smell like?

--
>;k

Obligatory Simpsons by Cumstien · 2004-12-09 08:33 · Score: 1

Kearny: "Jimbo, take a note on your Newton: Beat Up Martin!"
[Jimbo writes the note on his Newton and reads it back]
Jimbo: "Eat Up Martha? Bah!"
[Newton is thrown at Martin]

One of IBM's projects... by davew2040 · 2004-12-09 08:33 · Score: 1

...that I happen to have worked on a little bit is the following:

http://www-306.ibm.com/software/pervasive/multim od al/

It's basically an attempt to bring voice I/O into the Web application framework, by integrating voice I/O components into the XHTML. The desktop browser version is availabe for free, while the main target market (portable devices) isn't. It uses a version of the ViaVoice engine tailored for embedded systems that requires no training; it's designed for small sets of vocabulary rather than the large set that dictation requires, so it doesn't work all that poorly.

Granted, a programming language structured through XML is a bit of a hassle, but... :|

Not dictation... by Trinition · 2004-12-09 08:36 · Score: 1

I'm not interested in speech recognition necessarily for dictation. as other posters have pointed out, speaking code, or thinking while speaking a large document can be frustrating and unnatural.

Instead, I'd like to use my voice as a supplemental input device. I seem to always have one hand on the mouse, and one on the keyboard, ready to hit ALT-TAB, CTRL+S, or whatever. If I could use my voice for that, it would free my hands for typing (not anything else, you sickos!)
If I could give voice commands like "Save", "Next Window", "Previous Window"... it would be nice.

And perhaps even some things could be done in the background by recognizing speech rather than having to bring an application into the foreground to interact with it visually. Perhaps you could say "dialup" (for those without broadband) instead of having to click on an icon or type a command -- keeping your visiaul context in the application you were already in.

slashdotted by dingfelder · 2004-12-09 08:39 · Score: 1

bah... the site has been down all day.

can you lame-Os stop surfing at work so i can visit the main review?

Not only do we speak to computers,... by michaelpoltorak · 2004-12-09 08:42 · Score: 1

Nass and Moon discovered that: "...Individuals mindlessly apply social rules and expectations to computers. (...) individuals overuse human social categories, applying gender stereotypes to computers and ethnically identifying with computer agents.(...) people exhibit overlearned social behaviors such as politeness and reciprocity toward computers".

See: "Machines and Mindlessness: Social Responses to Computers", Journal of Social Issues, Volume 56 Issue

Short Answer: by CarrionBird · 2004-12-09 08:46 · Score: 1

No.

--
Free Mac Mini Yeah, it's

Nat Friedman's experience by Dammital · 2004-12-09 08:48 · Score: 2, Informative

Nat (the Ximian dude) recently hurt himself and has been reduced to being a one-handed typist. In order to stay connected, he's hired someone to take dictation for him. In today's blog entry he talks about the experience, what it's like for a very competent typist to use a dictation system, and thinks aloud about future intelligent speech-to-text applications.

Any NON-Real-time speech to text tools? by Bluedove · 2004-12-09 08:52 · Score: 1

I have a lot of audio files from my personal digital recorder that i need converted to greppable textfiles. I have absolutely no requirement for realtime function - i'd be very happy to load up a directory of files and leave my computer to think about it for hours/days/weeks for an optimal recognition/transcription.

Has anybody got a tool for something like that? I've been searching for months and found nothing suitable.

--
Demonstrant's Open Source Tools

Installed and running in Major Hospitals by Blitzenn · 2004-12-09 08:53 · Score: 1

I have actually installed a few Speech to Text systems in some major hospitals to reduce the workload of the transcriptionist staff. This technology actually works and works very well. I could not read the slashdotted article, so I am not sure of the direction that it is taking, but I will tell you it works well. The engines we used were from the Dragon software and are wrapped in a complex learning system and a SQL database backend. The doctors have to spend a few hours up front 'talking' to the system so that it can 'learn' their speech patterns. After that it has an accuracy rate of 95%. The last 5% of the errors are caught by a reduced transcriptionist staff. The savings are huge to the hospital(s) and there are no longer delays in getting the medical records filed in a timely fashion. Win Win situation.

I remember Dragon ... by Pathetic+Coward · 2004-12-09 09:02 · Score: 1

as one of the biggest 2001 crash victims. The original owners sold their company to Lernout-Hauspie for what was then several hundred million dollars worth of stock, and is now about ten cents worth of stock. No exaggeration here.

At least, though, they're better off than Messrs. Lernout and Hauspie, who are in jail.

The last I heard of Dragon was that the technology was sold off in the L&H bankruptcy proceedings to some software discounter.

Trial version by Anonymous Coward · 2004-12-09 09:09 · Score: 0

The software is very expensive. Is there a trial version I can try?

MOD UP please by Anonymous Coward · 2004-12-09 09:14 · Score: 0

This guy writes a useful article about specific, real world usuability and it's get's one'd? Whereas some putz makes a stoopid and obvious 'joke' about the mistakes a sloppy dictator can get from voice recognition software and it gets a Funny:5? Feh

Re:Having a stammer in a speech rec world isn't fu by narcc · 2004-12-09 09:28 · Score: 1

I really feel that a significant portion of society is being discriminated against. There are 550,000 people in the UK alone who have speech dysfluency problems, and yet speech recognition cannot deal with the multitude of different manifestations of this; repetition of sounds, blocking, or laboured or breathless speaking.

How, exactly, is your very small minority being discriminated against? That is, quite possibly, the most absurd statement I've heard in some time!

A few things:
1) Learn what discrimination is. You obviously don't know.

2) You can't expect speach recognition, which is just now becoming a viable technology, to be perfected to the point that it can cope with your particular disability.

3) It looks to me like all you're after is sympathy! I find that disgusting. Learn to live in society like the rest of us. Don't expect society to learn to live with you. You're not that important.

--
Required reading for internet skeptics

Works great on some Tablet PCs by Barlo_Mung_42 · 2004-12-09 09:29 · Score: 1

Voice recognition is an often overlooked aspect of the Tablet PC. On the single mic systems it is often confused by background noise. Some systems have an array of three microphones that help compensate. The difference between the single mic and array mic setup is astounding.

Voice is an enhancement, not a substitute by bitspotter · 2004-12-09 09:47 · Score: 1

CDs have replaced floppies, but mice haven't replaced keyboards. There seem to be plenty of good reasons (privacy, for starters) why the Linguistic User Interface won't replace the GUI.

This is not to say LUIs are pointless; There are great advantages to the LUI as well; I'm thinking that voice recognition might allow you to ditch the keyboard, but not the mouse (or other pointing device) - which could be used to designate the context of voice input (iow, "who you're talking to/about").

DNS experiance by evanfrey · 2004-12-09 09:47 · Score: 1

I had to set up dragon naturally speaking for a client about 4 years ago. It worked amazingly well (much better that the experiences that I had heard about or that I had with it personally. The big however was, the client purchased a mic & amp set that cost about $450 dollars. Also I think he just had the right voice for it because when my co-workers and I tried to use it, the software worked better then with a cheaper mic, but still not as well as was touted by the dragon developers.

--
:(){ :|:& };:

Am I talking to my PC? by Dr.+Mu · 2004-12-09 09:52 · Score: 1

Yes, #$%^&* it! But I'm ordering an iMac, so expect to be doing less of that soon. :-)

Not Quite Ready Yet by Anonymous Coward · 2004-12-09 10:06 · Score: 0

My friend, a screenwriting student at USC, developed an awful case of carpal tunnel a couple years back and has been forced to rely on Dragon Naturally Speaking to communicate online and produce his writing... The problem is, Dragon Naturally Speaking, even the latest version, really has a lot of trouble understanding even common words and phrases.

Needless to say, I get lots of weird, garbled messages and eMails from him.

"How about Saturday for the flowerpots?"

Dragon works great - with the right microphone by Silicon+Knight · 2004-12-09 10:19 · Score: 2, Informative

I'm supporting users in a medical environment with Dragon 7. Getting the right microphones made all the difference. We went from an average 96% to 98-99% efficiency just by spending ~$200 on Sennheiser headset microphones and Andrea USB sound pods from these folks.

Not affiliated, just a happy user - thanks for getting the MDs off my back!

Re:Having a stammer in a speech rec world isn't fu by cjrichard · 2004-12-09 10:30 · Score: 1

So what I said was the most absurd thing you've ever heard for some time? Wow.

Regardless of your view, I think that it is important that adequate help is given to people who can't use speech recognition methods; no different to braille on public buildings, or whatever. If a person with a stammer tries to book a theatre ticket, but they can't because the speech recognition picks up on every hesitation in the voice, what else can they do? You can't just expect them to speak better, just as you can't expect a blind person to just "make an effort" and read the signs.

i talk to it all the time! by Anonymous Coward · 2004-12-09 10:51 · Score: 0

... oh, i misread the topic... thought you meant 'PP,' not 'PC'!

[fumbling to keep fingers from going crazy]

product dis-improvement by WTF+Wazzat · 2004-12-09 10:52 · Score: 1

A certain GPS-enabled mapping program I have been using for years has been re-written to eliminate the drop-down menus and substitute badly-done little whatever-they-are at the bottom of the screen. Navigation through this mess is suddenly rendered horrid and just about impossible while driving. The idea, apparently, is to force the user to use the goofy speech regonition business built-in to the program. It reeks, folks.

There's an operator listening by Anonymous Coward · 2004-12-09 11:10 · Score: 0

The thing is that even if the computer did identify the request properly you'd have the problem of turning it into a well-formed query. Many of the requests they get are things like "There's a hairdressing shop on Shelton, or is it Howe? Anyway, it's Southy's or Surly's, I think." Combine that with the vast number of immigrants in Canada, particularly Vancouver, and there's almost no point in trying speech recognition. They do have a primary speech recognition system attacking two parts of the query, though. When it asks you what city you're interested in and whether you want a residence or a business, that's a limited enough range that the computer can just translate it and attach it to the box that appears on the operator's terminal. Keep in mind that the operators are not listening to the whole call or sticking with your call in any way. They're in front of a scheduling computer with a headset on. Your request plays in the operator's ear, the operator types it in if she can understand it and asks you for more detail if she can't. Then she's on to the next one while the computer plays it for you. She can easily take four requests in a minute. Many people are charged for this service. Depends on your plan.

Fatal error: by at_slashdot · 2004-12-09 11:22 · Score: 1

Fatal error: Call to undefined function: message_die() in /home/httpd/vhosts/pocketpcaddict.com/httpdocs/db/ db.php on line 88

Yeah...great piece of information....

--
"It is our choices, Harry, that show what we truly are, far more than our abilities." -- Prof. Dumbledore

Hah by Kanasta · 2004-12-09 11:23 · Score: 1

I've used both, because we used to have one at work and I had the other at home.

If ur not American, Dragon NaturallySpeaking is the way to go. I get good accuracy most of the time, tho there are some words it seems to NEVER learn. It doesn't hang other programs tho.

IBM ViaVoice really sux. First, no matter how hard I try to speak American, I get maybe 70%. It's also much more CPU intensive and slow, plus there are bugs in dictating numerals/dates I have reported for 3 versions and they haven't fixed! Also, it uses this lame copy/paste things to test whether a program supports it. Which means every time you move the cursor with the mouse, it types an 'x' then cuts it out. On a slow PC (600Mhz at the time), goodbye text if you made a range select just before it cuts....

I don't talk to my PC... by raehl · 2004-12-09 11:28 · Score: 1

But my PC talks to me.

For some reason though, it keeps calling me "Gordon".

--
paintball

Re:I don't talk to my PC... by the+angry+liberal · 2004-12-09 15:13 · Score: 1

But my PC talks to me.

Soviet Russia?

Why speech to text is the holy grail for some by Linuxathome · 2004-12-09 12:15 · Score: 1

1. It's awkward to talk when you're trying to compose something that requires a lot of thought first. I usually like to talk to myself (either out-loud or in my head) and type out what I'm thinking in a more formal fashion.

If you're writing a novel with lots of editing while you're writing, speech to text is not for you. If you're a physician who sees patients with just about the same conditions day in and day out, then it's a godsend if you have it working right. Have any of you heard an experienced physician rattle off a patient history into the hospital medical transcription service? Sounds a lot like the Hot Wheels guy.

For critical fields such as healthcare, speech to text will not be implemented until it's perfect. But imagine all the money saved (to help the current healthcare quagmire that we're in) if medical transcriptionists could be supplanted by software.

--
Linux at home

Text-to-speech looped to Speech-Recognition by mikey573 · 2004-12-09 12:29 · Score: 1

Proposed Experiment for Bored Hackers:

Pipe the text output from a chatbot to a text-to-speech program.

Then, use a speech recognition software package to listen to the audio and see how well it picks up the words.

If you are good, you can have two computers, both with talking and listening capability. See what kind of conversation they have. Perhaps it will mimic the conservation of two people with poor hearing.

Insipration for the above experiment: What Happens When Chat Bots Talk to Each Other

punctuation by RealProgrammer · 2004-12-09 12:35 · Score: 1

>[deal with punctuation]

I glossed that. I assume a speech module would have macros, so you'd say 'dot' to mean the end of a sentence and 'semmy' for semicolon. Or, the language could use keywords like 'stop', 'dot', 'semmy', etc., in place of punctuation, and suggest formatting rules to keep things readable.

It's no big deal to alter the grammar to use a word in place of a punctuation mark. That's how I started out, replacing '+' with the word 'plus', and so on.

--
sigs, as if you care.

Been talking to my computer for 2 decades by Anonymous Coward · 2004-12-09 12:50 · Score: 1, Funny

Now if it was only equipped with some sort of speech recognition product, I'd really have something'...

I am definitely screaming at my monitor now! by Anonymous Coward · 2004-12-09 12:54 · Score: 0

"WTF is wrong with you?! You can't be dying on me now!"

"Oh wait, it's not my monitor after all!" :)

http://shit.slashdot.org/article.pl?sid=04/12/09/1 650234

As Duke Nukem would say in the john, "Ahh! That's better!" :)

Is OS2 Warp Back? by Anonymous Coward · 2004-12-09 13:18 · Score: 0

Call me an Old Fart, but IBM had this feature integrated in OS2 before in the mid 90's. M$ said it was a feature no would ever want. Is it possible they were right?

OS/2 Warp v4 by XO · 2004-12-09 13:54 · Score: 1

OS/2 Warp v4 included IBM's full, very expensive voice recognition software, only lacking the very expensive dictionaries that the full version supported. It could still learn quite well on it's own, though.

The only problem I had with it was that a very speedy for the time 80MHz machine with 96MB RAM was quite bogged down by it. I played with it shortly after upping to 128MB on a 166MHz, and it was quite nice.

Unfortunatly, Windows/Linux/most other OS's are virtually impossible to manipulate via voice.

Ahh.. those were the days. Sit thirty feet away from the computer... say "Menu. File. Open. Down. Down. OK." sure, we now have wireless mice that will allow me to achieve the same thing much faster.. but.. the only wireless mice in that era were really awful IR based ones.

And you can't dictate with a mouse.

--
"Champagne for my real friends - and real pain for my sham friends!" http://ericblade.postalboard.com/

windows users by Anonymous Coward · 2004-12-09 14:01 · Score: 0

I know they are...Trying to convince the system not to get viruses.

Re:Bigger trouble by symbolic · 2004-12-09 14:41 · Score: 1

If the software translates exactly what it hears (which is exactly how people talk), then we're even in for a bigger problem. Sometimes I wish I was a teacher - I guarantee that if I saw one instance of a slang in a written assignment I'd mark it down a grade, and make them correct it if they wanted any grade at all. Any mention of "u" instead of "you", and it's an instant F.

Solution by YowzaTheYuzzum · 2004-12-09 15:51 · Score: 1

One word: headphones.

Re:Solution by silverfuck · 2004-12-10 00:36 · Score: 1

No headphones are ever going to replace my Klipsch speakers...

As an aside, would headphones/earphones reduce the feedback you get on your own voice just through hearing yourself, and would this have an affect on how accurate the voice recognition was?

--
You know you've been IMing too long when you almost say 'lol' out loud to a non-geeky friend...

I do NOT have LIP FUNGUS! by jrutley · 2004-12-09 17:26 · Score: 1

I LMAO every time I see this Kyocera commercial.

Girl: "Thanks, Mike. I had fun last night."
Phone: "Thanks, Mike. I have lip fungus."

My experience with Naturally Speaking by nothingx · 2004-12-09 18:09 · Score: 1

Once I tied the Naturally Speaking software together with AT&T's Natural Voices and A.L.I.C.E. bot with some interesting results. Basically, I could just speak to my machine and have it answer me in a real voice. In conversation, I found myself being more forgiving of the shortcomings of the A.I. just because it was speaking to me. When it erred, it was easier to believe that the "person" I was speaking to had just misunderstood me, or I had just misunderstood what was said.

Actually, does anyone know of any research in this area? If adding speech recognition and synthesis to a computer makes the machine seem "smarter"?

Naturally speaking ... sort of works by Netssansfrontieres · 2004-12-09 18:28 · Score: 1

A few months back, I had an accident which made it temporarily impossible for me to type with my dominant hand (left, in my case).
Amazing ... typing then becomes hunt-and-peck.
Unfortunately, I have to write for my work. I have to write LOTS for my work. It's 2004, so there's no 'steno pool'. So: voice recognition, part 1. I used the speech recognition embedded in M$FTXP. I was quite prepared for a slow learning process ... but was pleasantly surprised that it was quick. HOWEVER ... it did not seem to improve much over the next week or so, being particularly bad at really common words. AND correcting it was slow. (To be fair to M$FT: it acknowledges that the embedded speech-rec software is NOT intended for extensive use ... I actually think it's mainly intended for Chinese users to avoid the oddity of entering Han characters using a QWERTY keyboard.)

So, Speech Rec, part 2. It was clearly time to spend $$ and buy Dragon. I was particularly intrigued by the assertion in the ads that one can enter text using Dragon faster than in normal typing. Hmmm ... I bought the professional version and its microphone and settled in to teaching my computer who I am. Again, you can get going pretty quickly. And it gets better and better ...

Some weeks later, my hand is (pretty much) back to normal. I'm typing this using both hands, largely because it does remain a little time-consuming (a minute or two) to get Dragon going, and I just didn't. HOWEVER: when I have to write long documents, I now still fire up Dragon and dictate. It actually CAN BE faster than typing.

And, another thing: when I type, I look at the screen, see what's there, ... more words flow, and what ends up is, in its first draft, stream-of-conscious writing. When I use Dragon, I can concentrate on what I *really* want to say. I can -- and do -- close my eyes, and try hard to create a logical flow. I have found, to my surprise, that what I create by dictation is actually better in grammatical and logical terms than what I get when I type.

Dragon is NOT flawless. Partly, I'm sure, this is my fault (yeah, have I read the manual???) I haven't mastered the art of commands. When I tell it to 'move (the cursor) to the end of the line' ... about half of the time I get text instead of the command. And, it was more complex than it should have been to get Dragon to ignore the tiny, terrible microphone in my laptop. I did something that turned out to be a Bad Idea. I bought another, better microphone -- one for home, one for the office. Bad Idea, because the machine recognizes the Dragon microphone (which plugs into the a/v sockets) as being a different profile from the nifty USB microphone I bought. Different profile = different person.

On the final visit to the hand surgeon to check out my hand's recovery, I talked about this ... and recommended that he look into the Professional version for medics, since -- at least from my experience -- Dragon really can enable much more fluid, lucid, accurate 'writing'. It can be faster than typing And without that dreadful problem of illegible handwriting, as well.

Voice recognition and IRC. by EightMillion · 2004-12-09 18:34 · Score: 1

(www666) this is so cool I'm typing with Dragon NaturallySpeaking in mIrc
(www666) no more typing
(LameLLama) www: try "thlash exit"
*** www666 has quit IRC (Leaving)
*** www666 (baroca@spc-isp-ham-uas-05-11.sprint.ca) has joined #visualbasic
(www666) Hugh Masters
(www666) you basterdes

jesus, you're all fools by RMH101 · 2004-12-09 23:58 · Score: 1

there are *plenty* of enterprise-grade voice recognitions systems out there: both real time (dictation) and offline (digital typing). all are in constant use all over the world in legal practices, large corporations, etc, all working very nicely indeed, thank you.
dragon + decent headset and sound card = very effective.

Directionality? by Anonymous Coward · 2004-12-10 01:08 · Score: 0

With a stereo [or more] mike, the math of implementing directionality is simple. Then you can theoretically identify the direction of your source, and track it, and attenuate all other sources.

Whether anybody is actually implementing that kind of project, in an integratable GPLed package, is another question. Seems like that would be great fun... Any takers? Any google-able word combinations, so we can identify who is already working on this?

Echo canceller by UnConeD · 2004-12-10 04:27 · Score: 1

This is not that hard. The main problem is that usually there is no easy way to get exactly what is coming out of the speakers, due to varying drivers and latency. Once you have the speaker signal, you need to implement an echo canceller which subtracts the signal including reflections. This is usually done by using an LMS or FDAF algorithm to find the impulse response of the speakers relative to the microphone. And you can easily design microphones that are sensitive to (mostly) one direction to get rid of more background noise. This is implemented in every phone that has a speaker-phone function. Without it, the person on the other side would constantly hear themselves talking back with a delay, which is very annoying.

Re:Echo canceller by silverfuck · 2004-12-10 05:32 · Score: 1

I know about directional microphones and the principles of noise suppression. What I want to know is are there ways to cancel out a known (or nearly-known) signal with an unknown volume - the computer has no idea what volume I have the amplifier on my speakers set to...

--
You know you've been IMing too long when you almost say 'lol' out loud to a non-geeky friend...

Re:Having a stammer in a speech rec world isn't fu by narcc · 2004-12-13 11:39 · Score: 1

So what I said was the most absurd thing you've ever heard for some time? Wow.
Yes, it is. Read my origional comment again to gain a clear understanding.

I think that it is important that adequate help is given to people who can't use speech recognition methods ... just as you can't expect a blind person to just "make an effort" and read the signs.

You can't compare a "stammer" to blindness -- no, a blind person can't "make an effort" (sic) to see. Nor should I expect such from a blind person. A person with a "stammer", on the other hand, CAN make an effort. As another poster pointed out, often times a person with a "stammer" can speak very clearly when they're alone. I don't know of any blind people who can see just fine when no one else is around. :)

Yeah, a "stammer" is certainly NOT A DISABILITY. Nor should it be so considered. Some therapy -- both speech and psychiatric -- could do wonders for you, as it has for many others with similar problems.

--
Required reading for internet skeptics

Re:Having a stammer in a speech rec world isn't fu by cjrichard · 2004-12-14 07:30 · Score: 1

You called it a disability in your previous comment >.>

You seem to have very strong views on this. Stammering has basically no effect on my life, apart from speech recognition stuff, so I thought I would like a little article. I go to speech therapy regularly, and it's helped, but in the end it actually doesn't matter. I can still speak, as you point out.

At the same time, I wanted to highlight the difficulties that a significant (yes, it is significant, 550,000 people is 1% of the UK population) may have when trying to use this new techology. Read what I said as a plea for sympathy, a pathetic, ignorant comment, whatever.

Hi Raefer by Anonymous Coward · 2004-12-19 16:07 · Score: 0

Hi Raefer. We're still watching you.

Transcribing Perl by Nurgled · 2004-12-20 12:11 · Score: 1

I was given a copy of ViaVoice for my birthday one year after complaining about wrist pain. Fortunately, the wrist pain turned out to be due to the poor setup of my chair vs. my table, but I still had a copy of ViaVoice around to toy with.

I was writing small articles and comments with it with little problem; it handled English quite well. However, quite a lot of my typing is to produce code in Perl since that is what I do for a living. Perl has so many funny little symbols that I just spent the entire time trying to figure out what ViaVoice calls different symbols, and once that was sorted having to say long descriptions of what I wanted to type rather than just bashing a button on my keyboard. I gave up in the end.

I don't recommend speech-powered programming to anyone without a specially-targetted language. It would be interesting to try it with COBOL or AppleScript! :)

Slashdot Mirror

Are You Talking to Your PC Yet?

333 comments