Slashdot Mirror


Using PDAs for Dictation?

SunPin asks: "I'm a writer that is 99% dependent, due to fine-motor disabilities, on voice dictation. I've been a dictation user since 1990. My preference is 'discrete' speech because of very low resource consumption and its effectively infinite flexibility. Over the years, my computer use has de-evolved to programming, FTP, email (Mozilla), word processing (OpenOffice) and Ricochet. Drop the game and there's nothing that I shouldn't be allowed to do on the go. The problem is that I can't. Back in 1990, the requirements for IBM VoiceType were: DOS, 8MB RAM, 10MB of drive space with one of those new-fangled scorching 386-16MHz processors... not exactly demanding by today's standards and, unless I'm outright wrong, not demanding by today's PDA standards. Why hasn't it occurred yet?"

"In the disability offices of the hundreds of universities across the US, such software would be a major money saver because not all students need a high-powered laptop. While natural speech is great from a marketing perspective, it is simply impractical for general use and cannot adapt to mildly noisy environments. IBM, L & H and Microsoft have all given me the run-around. IBM refused to entertain the possibility. L & H is on life support, in a deep coma. Only Microsoft had a remotely positive response saying that they were testing natural recognition in Mandarin Chinese in their Beijing research office. Does anyone believe in keeping it simple, anymore?"

54 of 302 comments (clear)

  1. Well... by acehole · · Score: 3, Interesting

    The reason for this can be put down to a couple of reasons.

    First off, buying a dictaphone is still much cheaper than a PDA with software.

    And secondly the whole voice/word recognition program market hasn't really accomplished any great leaps or bounds over the past five years, not to mention it's not popular in the mainstream yet.

    --
    Be you Admins? nay, we are but lusers!
    1. Re:Well... by stratjakt · · Score: 5, Funny

      >> First off, buying a dictaphone ...

      DICTAPHONE? DICTAPHONE?

      re-vulcanize my tires, post-haste. And make sure this post is on the next auto-gyro to Prussia.

      --
      I don't need no instructions to know how to rock!!!!
    2. Re:Well... by banzai51 · · Score: 4, Funny
      Reminds me of what someone at work says about IBM:

      "IBM: Where software goes to die."

  2. Simputer by papasui · · Score: 3, Informative

    http://slashdot.org/article.pl?sid=02/11/19/234216 &mode=thread&tid=100

    1. Re:Simputer by Keighvin · · Score: 3, Insightful

      The Simputer comes with Text-to-Speech out of the box, but not Speech-to-Text. It does have microphone and USB jacks, so loading additional software may be an option. Battery life is in the not-so-great realm as the major downside.

      --
      Any spoon would be too big.
  3. My god by Anonymous Coward · · Score: 5, Funny

    Next thing, you'll be wanting a machine to wash your dishes and clothing, or, heck, let's be crazy, and send moving pictures around the world!

  4. More to do with perception by zanerock · · Score: 5, Interesting

    I think it has more to do with the perception of voice dication as unreliable and resource intensive rather than any actual fact, as the poster points out, it can be done fairly cheaply.

    I have not had much experience, but I think the other thing is that people are averse to any sort of training or teaching required, no matter the long term dividents.

    Like most things, it comes down not to fact, but to perception and prejuidice. Most people base their buying decisions on 30-second spots, not informed research, so the cost of educating people to is too high for producers to incur.

    1. Re:More to do with perception by Locutus · · Score: 5, Interesting

      I met some people at COMDEX who have VR(voice recg) running the the Sharp Zaurus. I've run IBM's VR software and it was pretty good 6 years ago. On the Zaurus, I would imagine that at 256MB CF card could hold a good sized dictionary so dictation appears to be possible. Especially since this guy was doing it on a 16MHz 386 years ago.

      The ability of the Zaurus to take a MIC input makes a big difference since a good MIC is important due to noise cancelling features they have. All the PDA's with no external MIC option are pretty much useless for VR/Dictation.

      LoB

      --
      "Anyone who stands out in the middle of a road looks like roadkill to me." --Linus
    2. Re:More to do with perception by Cy+Guy · · Score: 4, Informative
      PDAStreet has lots of info on PDA Voice Recognition:
      Since it doesn't look too promising I think you may want to expand your search beyond PDAs. I saw several references to the linux based simputer, maybe one of those with Linux based speech-to-text software is the way to go?

  5. It's not just the processor... by gpinzone · · Score: 5, Interesting

    It's the other, most overlooked piece of hardware used in speech recognition, the microphone. The junky headset given away with ViaVoice or the el cheapo unit sold in Radio Shack for under $10 makes most people's experiences with voice recognition software less than favorable. Invest in a $50-$60 professional headset and the ability of the software to accurately detect your speech patterns improves dramatically. How are they going to shoe horn a high fidelity audio sound processor in there? Maybe a USB headset might be the answer assuming the device can accept USB devices.

    I'm also going to assume that the current line of speech recognition products are MUCH better than what ran on your old 386.

    1. Re:It's not just the processor... by nojomofo · · Score: 5, Funny

      My saying car and some one from Boston saying car are drastically different but they are the same word.

      Hey! I resent that remahk! You ah stereotyping heah, and it's not fa-uh. Some of us from Bahston can say cah just like the rest of you. Just jealous, that's what you ah. Come up heah, and you'll be wicked sorry that you did. :-)

    2. Re:It's not just the processor... by CrazyJoel · · Score: 5, Interesting

      I remember seeing a ViaVoice demo a couple of years ago. The guy doing the demo said they use these headmikes that are actually 2 microphones. One mike faces the mouth, the other faces away. The circuitry then filters out any environmental noise from your voice. Don't know how much they cost though.(I'm sure I could look it up)

      --

      Such is the infinite Grace of Popeye.
    3. Re:It's not just the processor... by Zordak · · Score: 5, Funny

      You used your dictaphone to post, didn't you? Somebody mail this guy a keyboard.

      --

      Today's Sesame Street was brought to you by the number e.
    4. Re:It's not just the processor... by stienman · · Score: 3, Informative

      The guy doing the demo was probably dumbing up a basic microphone tactic that's been in use for decades.

      There are not two microphones in that headset - that would just make it worse, since no PC it would run on is real time enough to match the sound samples together, etc, etc, etc.

      Instead they use a dual port microphone. The element lies between the front of the mic (towards the speaker) and the back (towards ambient noise). Sound pressure from ambient noise tends to hit both the front and back simultanously, while sound pressure from the speaker hits only the front. The difference gives mainly the speaker, with muted external sound

      Even cheap mics have that now. The main difference between a good mic and a bad one is its construction and materials, which affect its response characteristics.

      -Adam

  6. Because by drinkypoo · · Score: 5, Informative
    Those speech recognition packages were only really capable of handling a few key phrases. In order to do seamless voice recognition that people will actually want to use it is necessary to recognize any (reasonable) word from any (Reasonable) speaker in a (reasonably) :) short amount of time.

    IBM can't even manage to do this on, for example, a P3 733EB. How they're going to do it on a 300MHz XScale or SH chip or similar (let alone a Motorola Dragonball) is beyond me. I think your head is in the clouds.

    With that said, voice recognition is very much on everyone's minds and it is coming. The limiting factor in handhelds right now is battery technology, which seems to be advancing more rapidly now than it has been in the last decade or so. With more power density comes faster processors and more ram, and the ability to perform these kinds of operations on smaller computers.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    1. Re:Because by photon317 · · Score: 5, Insightful


      Yeah but the author claims he was happy with discrete speech processing on a 386-16 that we had back in the day. He doesn't want continuous speech that doesn't have to be trained and all that jazz - just simple old school voice recognition. Is it so much to ask that someone port the old algorithms to the palm?

      --
      11*43+456^2
    2. Re:Because by tmark · · Score: 5, Insightful

      Yeah but the author claims he was happy with discrete speech processing on a 386-16 that we had back in the day.

      The author might be happy with what he had those days. The rest of the market would not be happy with that. In fact, the market is not happy with what we have now, as witnessed by the very low penetration of voice-recognition software. So why would we expect companies to spend the resources porting the old stuff when the new stuff won't even sell ?

    3. Re:Because by alanh · · Score: 5, Funny
      I disagree, I think voice recognition will (eventually) become the way of interacting with computers. Think Sci-Fi TV; being able to just speak and have the computer respond to your requests.


      I guess you haven't seen 2001: A Space Odyssey....

      "Open the pod bay doors HAL."
      "I'm sorry Dave, I'm afraid I can't do that."

      Maybe it wasn't that Hal was insane, just his speech recognition software failed....
      --
      - AlanH
    4. Re:Because by WatertonMan · · Score: 3, Interesting
      Speech recognition is only a niche market because of the way it is integrated at present. If there was reliable speech recognition on PDAs then I suspect many people would use that instead of the nearly as unreliable handwriting recognition. (I can speak a lot more clearly than I can write) Further it would be a boom for businessmen on the go. You could dictate notes and letters while driving, for instance.

      Sometimes niche markets turn out not to be. Just look at a lot of "desktop publishing" software. Back in 1986 that was still largely a niche market. Now it is indespensible for many, many people.

    5. Re:Because by JanneM · · Score: 5, Insightful

      Voice being the natural way to interact with devices? Think it through: an entire office trying to dictate to their word processing program all at once, with people popping in to each other trying to talk about work; an airplane of road warriors all trying to dictate stuff to their respective laptops at once (without saying anything confidential); support departments trying to make dictation work with fifty other people speaking commands to their respective clients; or programmers trying to spell their way through their creations.

      And have you ever actually tried speaking for eight to ten hours at a stretch? I'm not talking about random, occasional speech acts, but sustained, focused speech. You'd have about three weeks until laryngitis became an occupational hazard among white-collar workers.

      Speech is nice, but it is very much a niche application. Not only now, but ever. A keyboard is faster than speech, and does not contribute to noise level or occupational damage nearly as much as sustained speech would. It's a nice, even essential, mode of operation for those apps when a keyboard just won't do; the disabled, firemen, surgeons and so on will rightly love the interface. For mainstream use, however, it's just not good enough even when it's perfect.

      It could become an accessory input, on the lines of replacing menu commands for an app: mark text, say "cut", mark a place, say "paste" and so on, but it just would never replace keyboard input in any mainstream application.

      --
      Trust the Computer. The Computer is your friend.
    6. Re:Because by fishbowl · · Score: 3, Insightful

      "Think of an office full of people using speach recognition. Not pretty. "

      Almost as frightening as an office full of people all using telephones.

      You don't remember typewriters and adding machines, or for that matter, the dictaphone, do you?

      --
      -fb Everything not expressly forbidden is now mandatory.
  7. Dragon Dictate is portable by Anonymous Coward · · Score: 5, Informative

    Dragon has a portable product that you dock to your PC to do the voice to text. You can bring it with you, then connect it when you're home. A digital recorder is available bundled with the software, or you can use any micro cassett recorder and a Norcom playback and interface device. Seach Google for info!

  8. Storage space? by StandardDeviant · · Score: 5, Insightful

    I'm guessing the storage space requirements for that in terms of the data files the programs would use to map vocalizations to meaning would be the biggest stumbling block... Most mainstream PDAs only have 8mb of ram/storage combined, and Palm is still shipping devices with as little as 2mb. Your best bet might be one of the StrongArm based handhelds combined with a reasonably large CompactFlash/SecureDigital card... (E.g. Sharp Zaurus, Hewlett-ComPackard's iPaq, etc.) Of course, that's probably 300-500, but that's still less than a new laptop...

  9. dictaphone's EXSpeech by Anonymous Coward · · Score: 5, Informative

    With a simple search for dictaphone I was able to find a product called EXSpeech. I think this is what you are looking for.

    1. Re:dictaphone's EXSpeech by M00TP01NT · · Score: 3

      Although I am a fan of Dictaphone, the EXSpeech product is hardly suitable for a PDA or for the general tasks that the original poster is looking for. From the site:

      "EXSpeech(TM) offers a highly accurate continuous speech recognition solution that's fully integrated with Dictaphone's industry-standard Enterprise Express® voice and text management system. This state-of-the-art speech recognition technology, incorporated into a complete patient information workflow management system, can reduce transcription costs by more than 20% while speeding report turnaround."

  10. Re:stephan hawking by ceejayoz · · Score: 3, Informative

    Stephen Hawking uses text --> speech, not speech --> text (considering he can't speak). Text --> speech is easy, speech --> text is not.

  11. Re:Because it does not work. by Wiseazz · · Score: 3, Funny

    Unfortunately, the keyboard is the most accurate way to input data.

    According to my boss, it's actually something called an office administrator.

    --
    My sig sucks.
  12. Re:Because it does not work. by RAMMS+EIN · · Score: 3, Insightful

    I wonder where your claim that it doesn't work comes from. I have some experience (a couple of hours) with Philips FreeSpeech 2000 (I guess it's called that - I don't have it at hand). It recognizes natural speech fairly accurately. My guess would be that discrete speech is easier to recognize, making for better results and requiring less hardware real estate. I am absolutely willing to bet that it works a lot better than typing for people with disabilities. Your turn again.

    --
    Please correct me if I got my facts wrong.
  13. Dependable dictation by t0qer · · Score: 3, Interesting

    Sorry no links....

    There are dictation services availiable on the net, basically you e-mail them an MP3 and they e-mail back a fully typed document.

    As far as the reason for voice recognition not being on a PDA, I think it's space requirements. Of the two packages i've tried (dragon dictate and IBM) both of them require a lot of disk space to contain the recognition engine and your personal voice pattern files. Much more than your average PDA can hold. We're probably only a few years off from PDA's having that type of storage.

  14. voice server by Steven+Rumbalski · · Score: 3, Insightful

    Conjecture: Voice recognition on a PDA could work if you had a separate voice server over a wireless connection. So you have voice sent over a regular phone connection to you home pc (with modem) that does the recognition, it then spits back text (over another connnection?) to your PDA.

    Some might say that this would make VR to slow. I don't see why this would be noticibly slower than doing VR in person. After all, when we talk on the phone the person on the other end hears us almost instananeously.

    On a side note: my brother is doctor who uses VR to do his dictations. It is much cheaper than paying a transcription service. He also does not need to review the transcriptions afterwards for accuracy, because he essentially reviews it as he speaks it.

  15. Re:Text to Speech, why so crappy? by stratjakt · · Score: 5, Insightful

    It's not just the phonetic sounds, but the multitude of various inflections and emphasis' that are lacking, and are pretty hard to reproduce, unless the TTS engine can interpret the meaning of the text.

    Raising the voice at the end of a question may be easy enough. But how much? When? This is a question too, is it not?

    A good orator would read a more 'exciting' passage more quickly, and with more enthusiasm, punctuating key verbs and nouns. How is software to know which passages are more exciting, and which arent?

    It's not just a hard task for computers, but people too.
    Computers read aloud at about the same level as poor orator. Pho-net-i-call-y, in a dull drab monotone. Drop by the local high school, and listen to them reading shakespeare.

    Reading aloud may be simple, reading it well and naturally is a skill.

    --
    I don't need no instructions to know how to rock!!!!
  16. ViaVoice for PocketPC exists! by RevAaron · · Score: 5, Informative

    You can get a version of ViaVoice for the PocketPC. However, it sucks. It's not a real dictation system though- it only allows you do use a pretty small pre-defined group of commands, not general english word dictation. I was pretty disapointed. However, I wouldn't be surprised if eventually there will be a full-blown ViaVoice Embedded version for the PocketPC.

    As usual, there are some results that come up with a simple Google search.

    There was a Dragon Naturally Speaking beta for the Newton OS 2.1, and it works OK. But it's still a beta and is far from perfect.

    If you're looking for voice recognition for other PDAs, including PalmOS or Linux devices, you'll probably have much less luck.

    --

    Working toward a usable PDA environment in the spirit of Newton OS: Dynapad
  17. Research is underway... by Cyclopedian · · Score: 5, Interesting
    This place at the University of Washington is working on different model of speech recognition that could be conducive to PDA use (low-power, filter out extraneous info).

    Basically, they are working to analyze speech in slices (phonemes) instead of the more computationally intensive task of the whole word. This would lead to a higher success rate and could be easily used across multiple accents of the same language (English, engrish, etc).

    I'm excited about what they could accomplish there.

    -Cyc

  18. Lack of a ADC/DAC is a big problem by Myrv · · Score: 4, Informative

    Only recently have PDA's been shipping with anything approaching a good DAC and many PDAs still lack any ADC support. Without a good Analogue to Digital convertor built into the PDA you won't be able to do voice recognition. Remember that your 386 still required a soundcard to work properly. The same is true for PDAs today.

  19. Patience.... It's coming.... by jgrider · · Score: 5, Informative
    (Disclaimer: I am currently consulting for a firm that is developing a Palm cradle with built-in dication/voice recognition capabilities for the medical transcription market...)

    Since the asker wanted to know WHY nobody has done this yet, I'll spell it out:

    Basically the major pitfalls to developing this are:
    1) Crappy algorithms that mangle what you really said into something unrelated :)
    2) Power Consumption
    3) Interfacing to the PDA (not hard to do, but non-trivial)
    4) Limited PDA capabilities (Remember that Palm's DragonBall is a RISC architecture, and things like speech recognition NEED floating point math which must be emulated)

    The solutions:

    1) Somebody (not unlike me...) has to code the already existing better algorithms (check the literature - speech recognition is a mature technology, and publications abound) into a usable chunk of code, instead of simply recycling ViaVoice or NaturallySpeaking's libraries.
    2) Add more battery storage.
    3) Use another processor to do the conversion, then simply write it to the Palm in a serial stream.

    I would just wait about a year, then ask that question again to your physician friends, and see what they whip out of their pockets... :)

  20. It's the battery by PCM2 · · Score: 4, Insightful

    Palm applications, in particular, are designed around the idea of "forms" -- you put a form up on the screen, and then you sit there waiting for the user to do something. You don't run a constant loop listening to a microphone every minute, because that sucks up the battery like crazy. The Palm programming philosophy says that 99% of the machine's time should be, essentially, idle. Voice recognition, on the other hand, is very processor-intensive -- probably too much so for a pair of AAA's.

    --
    Breakfast served all day!
  21. EARS by david.given · · Score: 5, Informative

    Lo, many years ago I had a lot of luck with EARS on my 66MHz 486. It's a very simple discrete trainable recogniser; you have to teach it every word before it would recognise it. But it was fast then, it should be really fast now, and was pretty decent for recognising simple commands.

  22. A different solution by victim · · Score: 3, Insightful

    Tiny devices like cell phones and PDAs don't have the CPU power for sexy, high quality voice recognition. They do however have wireless connectivity. So, solve the problem this way...

    Install voice recognition servers, network connected boxes with powerful CPUs and the best voice recognition software you can get your hands on. A voice recognition client then just needs to send the voice data up to the server and get the translation back, say 100kbps up and some tiny amount back.

    The payback comes because most devices will only use voice recognition for brief periods, so will present a negligible load on the servers. The dictation users will place a higher load on the servers, but even there, I'm guessing there is a lot of pausing involved. I'm also going to guess that some lag is acceptable for dictation. Presumably the person is thinking about what they are saying and proof reading later. This load can be prioritized lower to allow better immediate response for people issuing voice commands on their mobile devices.

    Power consumption on the portable device will probably improve. They will have to operate their transmitter (think "talk time" vs. "on time"), but they won't need 5 watts of CPU doing recognition. (Guessing from a mobile G3 PPC, further validated, considering that the CPU spot of my iBook gets far hotter under solid use than a cellphone.)

    So, just to pick numbers out of the air, a dual processor, high end commodity hardware voice server might serve 500 pda users giving intermittent commands and 6 simultaneous dictation users.

    A company or school could easily justify the hardware cost of this service.

    Now, someone go out and build one.

  23. Whaddaya mean, no market? by ianscot · · Score: 3, Insightful
    People have been saying there's no market, here.

    You don't have to be disabled in some way to think this'd be handy, do you? That's the story for this one person, okay. But if you hadn't heard of a PDA ever before, wouldn't this be one of the most likely functions you'd think of for them? It's a totally natural application for a handheld gadget like that, and one that really would have a natural market among all the middle manager types who made Palms so popular to start with. Right?

    (Are there PDAs that can even read text in the other direction, though -- text to speech?)

    --
    "Fundamentalism" isn't about divine morality. It's about human authority.
  24. Speech-to-text? by thatguywhoiam · · Score: 3, Insightful
    I think the question really is one of processing power, and pattern recognition. I have yet to see any truly impressive speech technology beyond what was available on a Mac in 1994.

    The poster's question brings to mind a thought I've had lately, though, on PDAs and smart mobile phones. I've recently 'switched' from a Visor to just using my Sony Ericsson T68 as an organizer. Works great with iSync, etc.

    The Palm-with-phone always made more sense to me than the phone-with-organizer. It seemed that the phone part could change shape - I could stick it in my ear in the form of a headset, with a connector to the Palm. A phone I need to hold up to my head. I can't surf with something held against my head that way.

    However,

    I've realized that I need a phone more, and more importantly, I only enter very small bits of text into the Palm. Furthermore, I spend much more time looking up things than entering things (as I use the Mac do enter data whever possible).

    This led me to the conclusion -- the one thing we are missing from the organizer/phone landscape, as the poster asked, is some kind of speech-to-text.

    If I could literally hit a button and say "lunch with Dave next Tuesday" and have it enter that as live text... blammo. No more Palm, no more stylus. The phone already listens to voice commands. If it took short notes/appointments, I could literally walk around, call people, make appointments and notes, and not take the thing out of my pocket. Nice dream.

    *sigh*

    --
    If Jesus wants me it knows where to find me.
  25. hello world by _ph1ux_ · · Score: 4, Informative

    and it also has 8k of storage! You could store all of "Hel" on that thing

  26. Re:Dosent Voice work on PDAs. by falzer · · Score: 3, Funny

    Oh well. It happends.

  27. Plantronics makes Sound-Cardless Headphones by scotpurl · · Score: 3, Informative

    Plantronics makes several headsets with microphone that only require a USB connection, but do not require a sound card. They work quite well, and this should lower the hardware requirements for a small, lower-powered device.

    http://www.plantronics.com

    and search for their DSP-*00 series. I picked up their DSP-500 (normally $110) for $40 on a deal.

  28. Distributed Speech Recognition by kylef · · Score: 3, Interesting

    It is interesting that I JUST did a project on this subject for a Ubiquitous Computing class... My project was called "Distributed Speech Recognition." Here is a link:

    Distributed Speech Recognition Project

    I also have heard it through the grapevine that the big voice recognition companies are working on exactly this technology... I wouldn't be surprised if Speech .NET includes support for something like this in the near future. I believe I read on some website that support for Speech API on PocketPC was coming soon...

  29. Re:stephan hawking by Cyn · · Score: 3, Funny

    I think that Mr. Hawking's "speech" -> text would be trivially easy. For one thing, he just typed it - and for another, we have exact samplings of the voice that was generated and know how to regenerate it.

    --
    cyn, free software and *nix operating systems enthusiast.
  30. Hey! by Akardam · · Score: 4, Funny

    If I can find a machine to wash my dishes AND clothing, I'd say that'd be pretty cool!

    1. Re:Hey! by NanoGator · · Score: 4, Funny

      "If I can find a machine to wash my dishes AND clothing, I'd say that'd be pretty cool! "

      Get a wife.

      (I hope you're all happy, that comment cost me an expensive dinner.)

      --
      "Derp de derp."
  31. Re:Patience.... It's coming.... by leandrod · · Score: 3, Informative
    > Palm's DragonBall is a RISC architecture, and things like speech recognition NEED floating point math which must be emulated

    Dragonball's Motorala's, not Palm's. It is a CISC, not RISC, more specifically a M68K. RISC is usually better than CISC at floating point, but both architectures can go without a floating point unit, and that's what Dragonball does.

    --
    Leandro Guimarães Faria Corcete DUTRA
    DA, DBA, SysAdmin, Data Modeller
    GNU Project, Debian GNU/Lin
  32. NC microphones by Andy+Dodd · · Score: 4, Informative

    Some mics do this mechanically also - They have a port on the reverse side of the mic element so it only detects pressure differences between the two sides of the mic, i.e. only nearby sounds coming from one side of the mic (your mouth). Plantronics has plenty of these - Such NC headsets are common thanks to cellular telephone handsfree kits being required by law in some states, and they are quite good. (I love my Plantronics headset.)

    --
    retrorocket.o not found, launch anyway?
  33. ack! by Stoutlimb · · Score: 3, Informative

    Ack it filtered out my URL.

    http://www.dictaphone.com

  34. We're working on that. by davids-world.com · · Score: 3, Interesting
    Well, it two or three years, you will be able to buy something like that. We're working on it (MIT's Media Lab Europe).

    until recently, the PDA processors were not good enough, but that is changing rapidly (even though there is, in my view, little use for so much power except language technology).

    The resulting dictation systems will not replace conventional keyboard input for a while, however, as recognition rates are .97-.98 (accuracy), and that's a wrong word in at least every second sentence. In comparison to low-bandwith input, however, (as in the PDA with the stylus or as in the author's case due to a fine-motor dysfunction), voice recognition is very competitive.

    cheers from dublin.

  35. Move over Harry Potter... by NanoGator · · Score: 3, Funny

    "Over the years, my computer use has de-evolved to programming, FTP, email (Mozilla), word processing (OpenOffice) and Ricochet."

    I'd say this guy found the magic combination of words to get his article posted on Slashdot. Heh.

    --
    "Derp de derp."
  36. I worked on this at MS by rufusdufus · · Score: 5, Interesting

    I worked on dictation and dialogue on a PDA prototype at MS several years ago. It was called MiPad and was pretty cool. Well except that it really had to use a wireless network to a computer to get the recognition done.

    There are a couple of reasons why this hasn't hit the market yet:
    1) the PDAs really are not powerful enough to do decent recognition. Mainly, they don't have good enough audio input systems for reasonable speech quality. Also not enough disk space for dictionary storage. And the cpus are slow and the RAM is too low.

    2) at least at MS it is not a top priority to make speech work for disabled users. Outrageous you say? Not so! Turns out when the speech guys approached the accessability guys on the subject, they learned that speech recognition is not workable in most cases where accessability is needed; that is to say, the market for disabled people who cannot use the keyboard but who CAN use speech input is actually quite small. Most people who don't have the motor function to type (or use some sort of keyed input like Stephen Hawking has) dont have the motor function to speak clearly enough for speech recognition to work. Bottom line: other solutions work better.

  37. Not enough CPU by bluGill · · Score: 3, Interesting

    Sure, a 386 could do vioce recignition, but it required a special card that not only had higher quality sound inputs, but also had some DSPs to do the hard work. When IBM put voice recignition in OS/2 they warned you that a a 486 was not enough. (Several people tried it anyway, and it worked only within narrow limits)

    To emulate a DSP required a lot of floating point math. Most PDAs do not have floating point in the CPU because nothing would use it. The few times it is needed emulation is easy enough, just very slow. No problem though because as I said floating point math isn't much used.

    Don't forget that PDA cpus are not designed for speed above all else. They are designed for low power, which means they have to compromise something and require extra CPU cycles to get something done.

    Finially don't forget power requirements. When doing normal use the CPU is shut down most of the time, and drawing essentially no power. Voice recignition would change that, and your battery life would suffer drasticly.