Slashdot Mirror


Are You Talking to Your PC Yet?

An anonymous reader writes "If you have ever asked "Do those speech-to-text apps like Dragon NaturallySpeaking and IBM ViaVoice really work?" Pocket PC Addict has posted a detailed review of Dragon Naturally Speaking for Pocket PC and Desktop machines. It is written from the perspective of someone who has been burned by speech to text software in the past and had vowed to never try one of these apps again. It is encouraging for slow typists who would like to use their voice to write. Plus it details some valuable tips for using it with Pocket PCs."

40 of 333 comments (clear)

  1. Clippy by Anonymous Coward · · Score: 5, Funny

    So if I ask Clippy to STFU, will he?

  2. I treid with my Mac by the_2nd_coming · · Score: 3, Interesting

    OS X has really good system wide integration for Voice commands. and the voice interpreter is pretty good for one that comes with the OS, but I could not get it to work consistently....

    other than that I thought it was cool to say "computer give me brad's number" and it would display my buddy brad's phone number on the screen :-) (when it worked)

    --



    I am the Alpha and the Omega-3
    1. Re:I treid with my Mac by pdiaz · · Score: 2, Informative
      I don't know OS X Voice commands system, but I'm guessing that it works on a reduced vocabulary (i.e.: "close", "open", "mail", and such). That's the key there: with reduced vocabularies and a strict syntax speech recognition works pretty good. The challenge is to make it work with a natural lang . such as English (and in real time)


      Said in another way: when you issue a voice command to OS X, it has to choose between 40-100 alternatives (this is guesswork). (True) speech recognizers work with +60K vocabularies...

      --
      Make It Secret . Free JavaScript implementation of AES for your browser
  3. OpenSource Voice recognition projects? by farsideofthemoon · · Score: 5, Informative

    Any recommended ones? http://sourceforge.net/projects/cmusphinx/

    --
    I know what's on your hard dr
    1. Re:OpenSource Voice recognition projects? by Palshife · · Score: 3, Interesting

      Sphinx is a great project. I definitely recommend it.

      --
      Attention deficit disorder is a complicated issue, spanning several major... HEY LET'S GO RIDE BIKES!
  4. Summary of the article by Ingolfke · · Score: 5, Funny

    All though Text two speach is a grape gnu technology it is not red E for the main stream yet.

    1. Re:Summary of the article by jayhawk88 · · Score: 3, Funny

      Damn, is it 1993 again already? Quick, someone call Scott McNealy up and tell him we're all ready for the thin-client PC, this time we mean it, no really!

    2. Re:Summary of the article by the+grace+of+R'hllor · · Score: 4, Insightful
      All though Text two speach is a grape gnu technology it is not red E for the main stream yet.

      Now I see the master plan. Rather than improving speech-to-text technologies, the focus is on destroying basic literacy. Apart from the word 'grape', I parsed that sentence without even having to mentally translate it.

      Based on current experience, I'd say speech-to-text is going to be a growing market :-(

  5. It screws up with my accent by Janitha · · Score: 2, Interesting

    My Accent screws up everything. I hate my Accent.

  6. My problem by teiresias · · Score: 5, Funny

    The problem I always found with uuhhhhh voice writing was mmmmm filtering out unwanted noises and shhhhh distractions from my posts period return But I uhh guess they've fixed most of those burp problems by now right question mark

    --
    -Teiresias
    1. Re:My problem by silverfuck · · Score: 4, Interesting

      Exactly. Also, when will these programs start filtering for what the computer is putting out of the speakers (is there a sensible way to do this, to compensate for manual volume controls that the PC doesn't know about/control?)? I nearly always have some music on in the background...

      --
      You know you've been IMing too long when you almost say 'lol' out loud to a non-geeky friend...
  7. Spoken programming languages by The+G · · Score: 3, Interesting

    Is anyone out there giving any thought to how a programming language should be structured to make it easy to code using a speech recognition engine?

    If not, why not?

    1. Re:Spoken programming languages by vidnet · · Score: 4, Funny
      Like an intern, it should accept phrases like "code that %&#@% interface by tomorrow you @#$%@ piece of *&^$&!!!"

      Hey, maybe you could base it on perl!

    2. Re:Spoken programming languages by s4m7 · · Score: 4, Funny

      "code that %&#@% interface by tomorrow you @#$%@ piece of *&^$&!!!"

      Boss, is that you?

      --
      This comment is fully compliant with RFC 527.
  8. Voice recognition is great by mistersooreams · · Score: 3, Funny

    It walks just fin four mee!

  9. Are You Talking to Your PC Yet? by glsunder · · Score: 5, Funny

    I've been talking to my PC for years:

    You god damned son of a bitch! F'n Piece of shit!

  10. Fun with Macros by BlueCup · · Score: 2, Insightful

    My brother and I work at a company making efficiency programs... for awhile we toyed with the idea of having all of the programs activated by voice... we tested it out for awhile with an open source cantation originally used for games, that would execute a command, or type text based on what you said... for a while, it was awesome, every time we said something, it'd find the word from our list, and activate the program... problem was, when it listened to your voice, it only compared it to the words you had programs assigned to... so if you had four words, no problem, but if you had 60, it started choosing horribly... we eventually had to scrap the program all together... though it was funny watching what programs it would have to run through when I started cursing in frustration... I'm pretty sure the annoyance of people talking to their computers all over the building would have caused problems as well.

    --
    WANNAWIKI Wannawiki WannaWiki WANNAWIKI!
    1. Re:Fun with Macros by drinkypoo · · Score: 2, Interesting

      The Macintosh has had this since the Quadra, and it has just the same problem. You drop aliases (or whatever) into your speakable items folder and it matches them to your speech. Problem is, when you get fifteen or twenty things in there (maybe more on a powermac, but I'm talking the quadra experience here) it frequently would match things that made no sense whatsoever.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  11. Note to self by narsiman · · Score: 3, Funny

    Do not run webservers on PocketPCs even if you are an addict

    1. Re:Note to self by RangerRick98 · · Score: 2, Funny

      No no, you never actually make a beowulf cluster of things; you just imagine beowulf clusters of things. :)

      --
      "You're older than you've ever been, and now you're even older."
  12. That looks promising by FearTheFrail · · Score: 2, Funny

    And mine works just fine. Submit. Submit. I said submit. Why isn't this expletivedeleted thing triggering the submit button. Submit! Submit! Damn it, I have to move the mouse.

    --
    ___ In the words of Gen. Douglas McArthur: "I'll be right back."
  13. MOD PAIR RENT UP by Anonymous Coward · · Score: 2, Funny

    bracket en slash tee close bracket

  14. Re:Can't RTFA by Anonymous Coward · · Score: 2, Informative
  15. Why I've never liked speech to text by Rude+Turnip · · Score: 5, Interesting

    1. It's awkward to talk when you're trying to compose something that requires a lot of thought first. I usually like to talk to myself (either out-loud or in my head) and type out what I'm thinking in a more formal fashion.

    2. It is very tedious to go back and edit or make corrections. If I make an error while typing, I'm cognizant of the error very soon after it happens. With voice recognition, techincally "someone else" is typing and it takes more time to see where the mistakes were made.

    3. I deal with lots of boilerplate text with original content intermingled. A lot of times working on such a text becomes an editing process where using the keyboard & mouse is more efficient.

    4. My voice doesn't last for much longer than 30 minutes for non-stop speaking...and that's with short breaks for water.

    Conclusion: Just hire a hot secretary that can type.

  16. I've been swearing at it for years... by bADlOGIN · · Score: 3, Funny

    I'll be happy when someone codes a DWIM method (Do What I mean):)

    --
    *** Sigs are a stupid waste of bandwidth.
  17. I tried Dragon Dictate... by eno2001 · · Score: 5, Interesting

    ...it worked OK as long as you trained it properly and you had a nice quite room and a good mic. However, there are issues with "voice typing" that can't be overlooked. Primary is security. If you want to type a document or e-mail that contains sensitive data, make damn sure that no one can hear you. My bank recently moved to a voice activated system. I'm surprised they haven't gotten a ton of complaints from people since it REQUIRES you to say your SS# and PIN out loud. This means I can no longer check my account from my cell phone or at work. If you sit down and think about how many things you type that you would never want to say out loud, you can see why voice typing hasn't taken off. Imagine this emanating from your cubicle in a monotone:

    "http://www.goat.cx/ Take that you bukkake loving lunixtards."

    Your co-workers would think you were a nutjob if they saw half of what you posted as AC to Slashdot. ;P

    --
    -"...bad old ideas look confusingly fresh when they are packaged as technology" - Jaron Lanier (Digital Maoism on Edge.o
    1. Re:I tried Dragon Dictate... by bynary · · Score: 2, Funny

      All right, here's why I have problems with voice-driven phone systems: I have young children. Here's a typical scenario in my house:

      I call the bank to check on my balance...

      "Thank you for calling Bank WeHaveYourMoney. Please say your 16 digit account num..."

      "DAAAAAAAAAAAAAAAAAAAAAAD"

      "I'm sorry, I didn't understand your request. Please..."

      "AAAAAAAAAAAAAAAHHHHHHHHHHHHHHH...DON'T HIT!"

      "It sounds like you're having trouble. Please say one of the following support options for further assistance. For assistance with your account balance say..."

      *CRASH* "STOP IT!"

      "It sounds like you're having trouble. Please say one of the following support options for further assistance. For assistance with your..."

      "Stupid, fucking phone! Just give me an operator!"

      "Thank you. Our mailing address is..."

      *CLICK*

      --
      http://www.bynarystudio.com
  18. Re:I tried with my Mac by wibs · · Score: 2, Informative

    I don't have enough experience with iListen or ViaVoice for my opinion on them to matter, but I found the built-in speech recognition for 10.2 and 10.3 to be more annoying than anything else. It would work just often enough for me to think it was doing its job, but fail just often enough to be a consistent pain in the ass. I'm looking forward to the complete rewrite in 10.4, aka VoiceOver. Here's the apple hype page for it: http://www.apple.com/macosx/tiger/voiceover.html.

    --
    If you get nervous, just remember that there are a few billion other people who don't really give a damn.
  19. Re:Forget talking to PC's by mhaisley · · Score: 2, Interesting

    But then again, you have never seen pure aluminum...well, 99% of the population has never seen pure aluminum, it oxidies instantly. There are some methods for observation, but it's mostly not worth it, besides, pure aluminum looks mostly just like aluminum oxide.

  20. Dragon NaturallySpeaking is excellent by techmuse · · Score: 2, Informative

    I have used Dragon NaturallySpeaking Professional for many years, and ViaVoice before that. ViaVoice's recognition was not so great, and the program crashed constantly. Dragon works very well. I can do around 140-150 wpm with it. I seem to have to make 1-2 corrections per sentence, sometimes less. I am using Dragon 7, but there is a new version (8) out now. I highly recommend this program if you have repetitive stress injuries, or would like to avoid developing them.

  21. Mac OS X PlainTalk "Speakable Items" by Speare · · Score: 3, Interesting
    I've got a few machines around the house, and one is an eMac in the living room. It's mostly for edutainment titles, games, and to act as a print server, but I played with Mac OS X's "Speakable Items" capabilities.

    I use the text-to-speech on several crontab entries. Chip (yes, that's the computer's name) will announce basic daily schedule items, such as the date in the morning, kid's bedtime, and a final signoff at 11pm. I added some checks so it wouldn't talk whenever iDVD or iTunes was running. I used to have it monitor news headlines too, but it would talk too often and we would tune it out.

    I also tried some "Speakable Items" for basic tasks. Essentially, there is a special folder with a number of AppleScript files. The filenames are their voice triggers. If the computer hears you say one of those filenames, it runs the AppleScript. There are nested directories with items for specific applications, so you can speak the global commands or the active app's specific commands. Well thought-out.

    Some Speakable Items could come in handy, but the eMac microphone is too limited to be able to command the machine from across the room. You also cannot have a set of Speakable Items somewhere which are still active when nobody's logged in. Thus, I need to have a user logged in (and then turned away with user switch). Lastly, for most of the automation tasks I'd like to run, Perl or Bash is a better choice than AppleScript, but Speakable Items must be special text-command files or AppleScript, and I can't imagine making a bunch of AppleScript stubs for each Unix-style script I would write. These each limit the usefulness of the voice-commandable appliance I was hoping for.

    On the utility side, speech command would be great for specific queries, "Chip, what day is it?" and generic countdowns: "Chip, give me ten!" and he'll tell you when ten minutes have elapsed.

    --
    [ .sig file not found ]
  22. I just always imagine a room full... by Infonaut · · Score: 2, Insightful
    of people talking to their computers. Some people aren't bothered by noise pollution, but it drives me crazy. The babble of people on the phone in a crowded space is bad enough. Add to that people talking to their computers constantly, and postal employees won't be the only people going off with AKs.

    --
    Read the EFF's Fair Use FAQ
  23. Problem with speech to text by Spy+der+Mann · · Score: 3, Interesting

    IMHO, the problem with this kind of engines is that they don't make a separation between speech to phoneme / phoneme to text.

    If someone designs a good open source speech to phoneme architecture, I'm sure people would start working on phoneme to text AI algorithms.

    They say: "Open source? Death!!! Where will our revenues for research go?"

    But... what use is patenting/selling something that doesn't work in the first place?

    Again, this is only my personal opinion. (I couldn't RTFA because... *slashdotted* :-/ )

  24. my (limited) experience by sootman · · Score: 2

    I've played with the speech recognition that came with my tablet PC. Works OK if I'm by myself in a quiet room where I can non-self-conciously talk unusually loud-n-clear. Every time I've demo'ed it to people, in an office environment talking normally, the results are laugable.

    The good news is, you can play "Telephone" all by yourself! Remember that game where you sat in a circle, and one person says a sentence to the person next to him, and he tells the next person, and so on all around the circle, and then you hear the final version? Just talk to your computer, then when your words are shown (incorrectly) on the screen, read those words back, and so on. Easier and more fun than going from german to french to english to spanish to french to german to english in babelfish. :-)

    --
    Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.
  25. Using DNS/Natlink/Vocola by wstephens · · Score: 2, Informative

    I've been using voice for about 6 months. I had a big issue with right hand pain this year so I talked to a fellow developer who helped me get setup. We've done some custom grammars in python for or dev environment. It's been helpful. It's a long way to go if you want to reduce mouse usage. The mouse has to be the worst peripheral for the PC. I'm considering buying the SmartNav http://www.naturalpoint.com/smartnav/ to get rid of my mouse. I've messed with one on a PC with 2 screens. It was nice.

  26. Re:I tried with my Mac by crimbil · · Score: 3, Interesting

    I tried iListen when my wife was having difficulty typing. I ran through the training and played with it a bit to get familiar with it prior to having her use it. The accuracy rate was very high for me.

    Then she tried to use it. Even the training procedure was difficult for her. She grew up in the midwest and had no discernable accent, so that wasn't the problem. Near as I could determine, she didn't always have the same inflection when saying many words. Without the consistency of pronunciation, the software couldn't learn correctly. She became very frustrated, which led to her over-enunciating the word in question, which just confused the software even more. It became shelfware.

    I dragged it out a month ago and started using it again. I've gotta say, the response time on a Dual-G5 is pretty impressive. And for the smartasses out there; no, I'm not using it for this post, I'm at work. Isn't that where everyone reads slashdot?

  27. Yes, in fact I am by RealProgrammer · · Score: 2, Interesting

    I've been working recently on a language I call 'verbal'. My goal initially was a language I could use in the car, while driving. (I love to code.)

    I realized that such a language would be useful for blind people and anyone who couldn't type.

    The target is a language that will mimic a subset of English, so that a program might be:

    Let number be seven.
    While number is greater than zero, do
    print number;
    number is number minus one;
    done.

    I've written a compiler that translates that kind of thing into C, but I'm not releasing it just yet. It only has the type int, and no functions or objects. As soon as it can handle objects, I'll post it quietly.

    (I got stuck for a day doing an elegant itoa.c, but that's done now. All I needed it for was to generate good labels for constants on the symbol table, and sprintf didn't fit right. Of course I found a slightly simpler one after I got it done.)

    --
    sigs, as if you care.
  28. Open Explorer? Moment is poor. by cbr2702 · · Score: 2, Funny

    How perceptive!

    --


    This post written under Gentoo-linux with an SCO IP license.
  29. Nat Friedman's experience by Dammital · · Score: 2, Informative

    Nat (the Ximian dude) recently hurt himself and has been reduced to being a one-handed typist. In order to stay connected, he's hired someone to take dictation for him. In today's blog entry he talks about the experience, what it's like for a very competent typist to use a dictation system, and thinks aloud about future intelligent speech-to-text applications.

  30. Dragon works great - with the right microphone by Silicon+Knight · · Score: 2, Informative

    I'm supporting users in a medical environment with Dragon 7. Getting the right microphones made all the difference. We went from an average 96% to 98-99% efficiency just by spending ~$200 on Sennheiser headset microphones and Andrea USB sound pods from these folks.

    Not affiliated, just a happy user - thanks for getting the MDs off my back!