Slashdot Mirror


Text-to-Speech on a Low-Power Chip

bluephone writes: "The EE Times has a story on a new chip from Winbond that can take ASCII or UNICODE text and convert it to either spoken English or Mandarin (the Chinese language, not the orange). The low-power chip scans the text and translates it into spoken phenomes and outputs it to a filter for smooth analog sound, or can directly output the digital signal. Imagine a cell phone with this, you can have your email read to you, rather than seeing a line at a time on a dinky screen, street directions from a website, or even Slashdot's headlines. :)"

263 comments

  1. wel.. by Psychopax · · Score: 0

    ..probably you won't understand it anyway.

  2. Done that by beretboy · · Score: 4, Interesting

    I have a little bot written in perl and VXML that reads my email. It is far esier than making the phone do the processing, and ita free. see studio.tellme.com

    1. Re:Done that by beretboy · · Score: 1

      And making it read headlines would be trivial

    2. Re:Done that by beretboy · · Score: 1

      If you had actually LOOKed at my refrence to studio.tellme.com you would realize that this will run ON ANY PHONE through a 800 number!

    3. Re:Done that by beretboy · · Score: 1

      LMAO, whatever punk. anyway for all of those interested in the subject matter tellme inc. just released VXML 2.0, what HTML is to the web, VXML is to the web over the phone.IMO I would highly reccomend that those who think this chip is cool get an account. I do however wish they made the server that runs the code GNU.

    4. Re:Done that by Anonymous Coward · · Score: 0

      peddle your wares somewhere else, tellme sellout;
      i remember when HCS used to be cool, before it became all about the money.

    5. Re:Done that by fux0rz · · Score: 0

      actually i did look at your link and you need to lern how to fuvking speel! jeesus... don't link up shit to shit that is so smelly with poormanships... huh? you get it? fuck nut... also, i mean, think about it... i got the ARM specs before the mafia made that poor chaneese bastard take it down and i got a GPS converting astro glyde jelly simulator that runns on chip... now think of that... i dawn the chip during coitus and wammo... fractal fucking magic! that's better than some stooped text to speek shit

    6. Re:Done that by Anonymous Coward · · Score: 0

      I like the tellme service, but it's frustrating sometimes when it picks up environmental noise and tries to interpret it as a command. A 'talk' button like on walkie-talkies would be useful. Also, using voice commands in public feels goofy.

  3. Dr. Sbaitso by ChazeFroy · · Score: 1

    Anybody remember Dr. Sbaitso? This program was great for being written way back when (1994?).

    I believe Radiohead used it as the voice for their track "Fitter Happier" on OK Computer.

    1. Re:Dr. Sbaitso by Scrameustache · · Score: 1

      I'm pretty sure that OK computer was MacIntalk.

      (The bundled voices for text to speech on MacOS 8+)

      --

      You can't take the sky from me...

    2. Re:Dr. Sbaitso by CrazyBrett · · Score: 1

      Yes! Dr. Sbaitso ruled! I'll bet many people here remember it. But...

      How many people know where the name Dr. Sbaitso actually came from?

    3. Re:Dr. Sbaitso by Genom · · Score: 2

      sb = Sound Blaster

      ai = "artificial intelligence"

      tso = ???

      text-to-speech output?

    4. Re:Dr. Sbaitso by generic-man · · Score: 3, Informative

      All you had to do was ask. Sound Blaster Acting Intelligent Text-to-Speech Operator.

      --
      For more information, click here.
    5. Re:Dr. Sbaitso by Verteiron · · Score: 1

      Let's talk about sexy stuffs.

      If you keep using language like that, I will have parity error.

      --
      End of lesson. You may press the button.
    6. Re:Dr. Sbaitso by beerits · · Score: 1

      MacInTalk is much, much older than MacOS 8. It dates back all the way to 1984 and the first macs.
      http://developer.apple.com/technotes/pt/pt_22.ht ml

    7. Re:Dr. Sbaitso by Scrameustache · · Score: 1

      Does it? But I was talking about the specific voice (I think its marvin) that seems to be the one from the OK Computer album by RADIOHEAD.
      AFAIK, that voice was new in os8.
      I could very well be wrong about that though...

      --

      You can't take the sky from me...

    8. Re:Dr. Sbaitso by EvlPenguin · · Score: 4, Funny

      MacIntalk is older than that, and quite franky, it rocks. Man or Astroman (one of the greatest bands ever -- especially live) use it as their lead singer. Fred really can sing.

      In other news, "Man or Astroman wants all the party people.. to say.... yeeeaaaaahhhhhhhhhhh"

      And by the way, the voice on "Fitter, Happier" (Radiohead) was actually Thom during an especially intense episode of innebriation >:P

      --

      --
      #nohup cat /dev/dsp > /dev/hda & killall -9 getty
    9. Re:Dr. Sbaitso by faichai · · Score: 1


      sb> Go fuck yourself
      "I don't know how to fuck"

      Hahahahahahaha....ROFLMAO (when I was 13...)

    10. Re:Dr. Sbaitso by JazzyJ · · Score: 1

      I miss Dr. Sbaitso

      anyone still have a copy of that lying around?

    11. Re:Dr. Sbaitso by Anonymous Coward · · Score: 0

      yup

      http://www.webtribe.net/a/abandonnet/apps.htm

      first one on the list! have fun.

      Aaah memories!

    12. Re:Dr. Sbaitso by Doug+Neal · · Score: 0

      Ahh, this brings back memories of when I was in school ... back then, there were only a few computers in the place that had this new "multimedia" stuff, which where in the library, and Sbaitso was installed on one of them... myself and friends used to have lots of silly immature schoolboy fun getting it to say rude words :-) One of the things that intrigued me was it's ability to read out very long numbers. It not only knew million, billion, trillion, etc but also ocarian, heptarian, hextarian... :)

    13. Re:Dr. Sbaitso by MiTEG · · Score: 1

      I believe the voice used on OK Computer was "Fred" which was the only available voice until OS 8.

      --
      The future isn't what it used to be.
    14. Re:Dr. Sbaitso by posmon · · Score: 1

      there's fuckloads of voices on at least 7.5.3, possibly earlier

      --

      update comments set karma=-1, reason='offtopic' where sid=26315

    15. Re:Dr. Sbaitso by Explo · · Score: 1

      This is still very recent compared to SAM on Commodore 64 or even to the built-in speech support on Amiga (well, until AmigaOS 2.04 or something, when the licence expired). SAM had quite bad quality, but the words could be still made out. On Amiga, the SPEAK: - device was very cool; just copy text to it and it was spoken.

      --
      Everyone who makes generalizations should be shot.
  4. Mac OS has that by homb · · Score: 1

    MacOS has had that for a while. It works ok. In fact, by default in OS 10.1 it speaks modal error dialogs. It surprised me the first time this happened.

    1. Re:Mac OS has that by SnapShot · · Score: 3, Funny

      My Amiga was talking to me 15 years ago.

      Actually, my Timex Sinclair 1000 was talking to me 20 years ago, but I think that was the acid...

      --
      Waltz, nymph, for quick jigs vex Bud.
    2. Re:Mac OS has that by SnapShot · · Score: 2, Insightful

      Actually, on a more serious note, is there anyone working on an open source speech synthesis project?

      --
      Waltz, nymph, for quick jigs vex Bud.
    3. Re:Mac OS has that by cduffy · · Score: 3, Informative

      Actually, on a more serious note, is there anyone working on an open source speech synthesis project?

      Yup; it's called Festival.

    4. Re:Mac OS has that by Chakat · · Score: 2, Informative

      Yep, it's called Festival, and the results are pretty decent. Became free as in speech a couple minor versions back, too.

      --

      If god had intended you to be naked, you would have been born that way.

    5. Re:Mac OS has that by KernelHappy · · Score: 1

      Was that what that was? I always though my Amiga was doing a Stephen Hawkins impersonation....

      But I'm still wondering why this is such a big deal, anyone else remember Speak-n-Spell and its derivatives? If thats not low powered speech I don't know what is.

      --
      -- Button up, your ignorance is showing
    6. Re:Mac OS has that by blair1q · · Score: 2

      My Clark-Nova was talking to me in the '60s, but that was its job.

      --Blair

    7. Re:Mac OS has that by Anonymous Coward · · Score: 0

      Yes, but Amiga went out of business a long time ago, so we couldn't care less.

    8. Re:Mac OS has that by bjb · · Score: 1
      Heck, my Apple ][+ was talking to me in 1980!


      Cool hack.. a program by Muse Software called "The Voice" was a 1-bit sampler that allowed you to plug a microphone into the cassette in port. It worked rather well considering what the Apple had to work with. The technology was used for the sound effects in Castle Wolfenstein, so if you've played that on an Apple, you know what it sounded like.

      --
      Never hit your grandmother with a shovel, for it leaves a bad impression on her mind...
  5. Oh no...... by the_2nd_coming · · Score: 2

    man, if they put simplt text (apples text/scripting/voice filter program) on one of those things, no one will gt there work done!!!

    --



    I am the Alpha and the Omega-3
    1. Re:Oh no...... by the_2nd_coming · · Score: 1

      sorry if I am to esoteric and dry for you, I am a geek what do you expect?

      --



      I am the Alpha and the Omega-3
    2. Re:Oh no...... by SPK · · Score: 2, Funny

      yes, but simple text and Chinese ... that's like comparing apples to oranges, no?

      --
      Regnant populi. (The people rule.) Pregnant ropuli. (The snake will soon lay eggs.)
    3. Re:Oh no...... by jiheison · · Score: 1

      yes, but simple text and Chinese ... that's like comparing apples to oranges, no?

      Not in this case, since the chip translates text in to written phonemes. This just means that the Chinese language text would have to be written phonetically, in simple text. How it handles the tonal aspect of Mandarin is another story, but I suspect that there is some phonetic writing scheme that accounts for this.

    4. Re:Oh no...... by the_2nd_coming · · Score: 1

      HA HA :-)

      --



      I am the Alpha and the Omega-3
    5. Re:Oh no...... by FrankHaynes · · Score: 1
      man, if they put simplt text (apples text/scripting/voice filter program) on one of those things, no one will gt there work done!!!


      With all due respect to the_2nd_coming, the quoted text is a perfect example of why text-to-speech systems will not be very useful for general purpose communications. Unless there is a fool-proof built-in spell checker that includes the latest /usr/dict on the chip.

      ---
      --
      slashdot: A failed experiment.
  6. Great for all sorts of devices. by Negadin · · Score: 3, Interesting

    Cell phones, PDA's, perhaps new tools for people with vision disabilities, where it could pick up plain text via IR near busy intersections or information kiosks. Text is small, broadband wouldnt be required, since its all converted in real time on a chip. Since it is supposed to be low-powered, it would be great for devices that didnt need to be recharged often, like the pagers mentioned in the article.

    I wonder how lifelike the voice is though. I don't think any text-speech tools are going to become very mainstream untill they sound better.

    1. Re:Great for all sorts of devices. by westies-from-hell · · Score: 1
      "I wonder how lifelike the voice is though. I don't think any text-speech tools are going to become very mainstream untill they sound better."

      True. Think of how far the technology has progressed, but Stephen Hawking still sounds like he's doing a screen test for "War Games".

      ....k
      -=-=-=-
      "Just because you're a genius doesn't make you a smart guy!" --Narrator, Powerpuff Girls

      --
      "Just because you're a genius doesn't make you a smart guy!" -- Narrator, Powerpuff Girls
    2. Re:Great for all sorts of devices. by phossie · · Score: 1

      for people with vision disabilities, where it could pick up plain text via IR near busy intersections

      This would perhaps not be the ideal way to do this... last I checked, IR was directional.

      --

      [|]
    3. Re:Great for all sorts of devices. by Negadin · · Score: 1

      Correct, just happened to be the wireless protocol of choice that came out of my fingers as I typed the message. 802.11 or something of that nature perhaps.

    4. Re:Great for all sorts of devices. by dodald · · Score: 1

      Text to speech can sound very lifelike, the first time I used AT&T TTS I was shocked at how realistic it was. check it out here. http://www.research.att.com/~mjm/cgi-bin/ttsdemo

      --
      101010b 2Ah 52o
    5. Re:Great for all sorts of devices. by dodald · · Score: 1

      I forgot to mention that the male voice is the one that sounds the most lifelike.

      --
      101010b 2Ah 52o
    6. Re:Great for all sorts of devices. by IronChef · · Score: 2


      All computers should sound like the Voice of World Control. Audio clips from Colossus: The Forbin Project are here for your enjoyment.

      If Stephen Hawking sounded like this, he would have taken over the world long ago.

    7. Re:Great for all sorts of devices. by Cassandra · · Score: 1

      Impressive! It fails to distinguish between past and present tense of "read" though.

    8. Re:Great for all sorts of devices. by RFC959 · · Score: 1

      That, and it said to me (spelling it out phonetically), "welcome to uh t and t labs"... It does get "ballerina" right, which is nice; that's tripped up a bunch of speech synthesizers I've tried before.

      I can't really blame it for its minor failings, though; English is /very/ inconsistent, and a good speech synthesizer has to have a lot of special-case rules.

  7. wouldn't it be easier.... by CodePoet82 · · Score: 5, Informative

    as the writeup said, this could be used in a cellphone to read what you were looking at, but wouldn't it be simpler, and backwards compatible, to just do text to speach synthisis on a remote computer. every cell phone out there can already just transmit the sound from a remote location, and it wouldn't require any new/expensive chips.

    1. Re:wouldn't it be easier.... by LWolenczak · · Score: 1

      thats what the companies would do, so they can use up your minutes when your getting your email...

    2. Re:wouldn't it be easier.... by CodePoet82 · · Score: 1

      unlimited minutes to local/same provider calls is starting to become more and more standard, under that plan, would this logically be a free call?

    3. Re:wouldn't it be easier.... by Anonymous Coward · · Score: 0

      I think Interactive Intelligence does TTS for faxes and voice mail to any remote phone already.

    4. Re:wouldn't it be easier.... by realdpk · · Score: 1

      Providers are only offering unlimited free minutes to the same provider under competitive pressure. It's not free for them to provide it. They know there is money in things like this, so you can be sure they'd charge for it.

      Anyways, it costs the cell companies $X/kB where X is a very low number (the cost, as I understand it, is mainly in that the network is overloaded, so keeping a line open for you prevents them from keeping a line open to someone else). An e-mail can be 1-2kB, a voice reading that e-mail would be many times more.

  8. Converting text to an orange ... by (void*) · · Score: 2, Funny

    would be something we can all be impressed with.

    1. Re:Converting text to an orange ... by Anonymous Coward · · Score: 0

      Oh, very easy... just get me some paper sheets with orange dna (complete version) on them.

      I'll activate my Mozilla 0.99.9999999999214a Object Composer-Synthetizer and voilà: an authentic orange.

      Just don't merge fruit fly DNA sheets or you'll get a fruit with worms. ;^)

    2. Re:Converting text to an orange ... by Anonymous Coward · · Score: 0

      I saw a demo of that once, long ago, and I sure was impressed. (Tron.)

  9. Great... by Anonymous Coward · · Score: 2, Funny

    That's all I need, Stephen Hawking's voice coming at me from my cell phone:

    "Get Viagra Because You Care !! Watch the Famous Jennifer Lopez p0rn video Swing LOOOW, Sweet Chariot dash dash dash dash dash 8 eff ess arr...."

    Anonymous cowards love the rich meaty taste of spam.

    1. Re:Great... by friscolr · · Score: 0, Redundant
      reminds me of M.C. Hawking - http://www.mchawking.com/music.html

    2. Re:Great... by Anonymous Coward · · Score: 0

      That stuff is pure genius. thanks for the link

    3. Re:Great... by Doc+Hopper · · Score: 2

      Heh, I already have this daily on my cell phone using "Voicelink", a service provided by WorldCom (and my employer, Talk2 Technology). I routinely get emails like this:

      From: root
      Subject: cron kill `ps -ef | grep username | awk '{print $2}'`

      Just imagine how that sounds read back to you over your cell phone. It really beats having to lug a laptop with me just to check my email, but the kinds of email a sysadmin receives often don't translate well into spoken English. However, it's fun to hear this female voice try to get it right. One of these days, I've gotta get together with the programmers here and make sure these things get read right, like "kill back-tick pee-ess dash ee-eff pipe grep user-name"...

  10. pr0n related use? by bludstone · · Score: 2, Funny

    could i set it to a deep erotic female sounding voice and have it read dirty stories to me?

    --

    no .sig
    1. Re:pr0n related use? by Anonymous Coward · · Score: 0

      yeah, you can do that in every state except for Utah...

    2. Re:pr0n related use? by Anonymous Coward · · Score: 0

      Better yet: make it read Slashdot articles.

  11. A Possibility... by VA+Porware · · Score: 2, Funny

    Can you imagine millions of geeks' cell phones with this chip? "First Post" echoing throughout the world...

    1. Re:A Possibility... by Anonymous Coward · · Score: 0

      that isnt funny, you are stupid

  12. Nothing new by CmdrTroll · · Score: 1
    Radio Shack used to sell these chips years ago. I once built an automated model rocket launcher that used the chip to announce the countdown - pretty damn slick, if you ask me. I believe the same chip was also used in the old TI-99/4A speech synthesizers (if anyone else remembers those).

    There's really nothing new about this product, except for its ability to speak Mandarin. And given the state of the Chinese economy, it's not very likely that many citizens over there will be in the market for talking electronic devices anytime soon. Most of them are still trying to get phone service and running water.

    -CT

    1. Re:Nothing new by morcheeba · · Score: 5, Informative

      No, those chips (it was a pair) were power-hungry 5 volt parts made by General Instrument. One was a microcontroller (8051?) with the text-to-phonome algorithm, and the other was the phonome-to-audio processor (GI SP0256). Actually, the SP0256 could accept external roms for specialized words, so it could have spoken in any language you wanted.

      Check out quadravox for boards that emulate the SP0256, using ISD's analog flash memory and a microcontroller.

      (My misadventure with the old GI chip: -12 instead of +5, just for a split second. After that, it developed an stutter!)

    2. Re:Nothing new by discoinferno · · Score: 1

      Rat Shack used to sell a chip 'like' this, not these exact chips. The voice quality of these chips is much improved, and it has MANY MANY new features. Perhaps you should read the article and actually check out the specs of the chip before running it down, hmmm?

      --
      - It's anarchy baby. Suck it up.
    3. Re:Nothing new by AndroidCat · · Score: 1

      Let's see. (I've got a board in front of me.)

      CTS256AL2/P (40 pin CPU?)
      SP0256A-AL2 (GI 28 pin thingy)

      And there's an empty socket beside the 2K RAM chip. Hmm. The board understands a bit of punctuation, takes power from an ISA slot, and converts from a RS-232 input. Speaks Computer Swedish.

      --
      One line blog. I hear that they're called Twitters now.
    4. Re:Nothing new by morcheeba · · Score: 1

      Here's some info on the SP0256. I had forgotten that there was an option for extra RAM -- the processor uses this for a buffer for incoming text; without it, its limited to something on the order of 10-30 characters. So, that socket could be for the serial rom -- check to see if it connects to pins 3-6 of the SP0256. Otherwise, it might be a socket for a 1488/1489 RS232 driver/rec'vr.

    5. Re:Nothing new by Anonymous Coward · · Score: 0

      No this is NOT some syntheizer that has problem emulating even plain English. It store human voice samples, which bring up another problem: memory limitation on small devices.

      Now the whole thing on a single chip for embedding and low-cost enough.

      It depends on where you live, if you want to live in the desert of your own country you will be the one having problem getting water and phone service.

      China has 7% GDP growth while the US has 5% unemployment rate. Take your ignorance home. Dude!

    6. Re:Nothing new by Anonymous Coward · · Score: 0

      I'd be inclined to agree with the original poster. Investing in China is unjustifiably risky during these tough economic times. They've got a backwards government, an unworkable economic system (called Communism), and a dirty, cheap culture. Having visited the apartments of several Chinese citizens who are here in the U.S. on student visas, I know firsthand how grimy these people are and how they value living in squalor. The last thing they are interested in is a text to speech chip.

  13. The only problem with this.... by Teancom · · Score: 2

    would be reading /. headlines. I mean, text-to-speech is great, but can it spell-check at the same time?

  14. applications... by recursiv · · Score: 1
    The chip could enable items such as a teddy bear that lulls a child to sleep by reading a bedtime story with the pre-programmed voice of Winnie the Pooh.

    Think of the applications for blow up dolls and pr0n!

    --
    I used to bulls-eye womp-rats in my pants
  15. Text 2 Speech on Linux by Hornsby · · Score: 1

    I've tried festival on Linux, and it's output is always really fuzzy and hard to understand. Do any of you know of any alternative programs that are more discernable in their delivery of voice? I would love to have my Linux box talk to me like one of those sexy Imac operators...

    --
    A musician without the RIAA, is like a fish without a bicycle.
    1. Re:Text 2 Speech on Linux by Anonymous Coward · · Score: 0

      It's free software... you get what you pay for. Just use a modern commerical OS instead.

    2. Re:Text 2 Speech on Linux by Anonymous Coward · · Score: 0

      ALternative: get a Mac has had it for years, Steven Hawking uses it, it works.

    3. Re:Text 2 Speech on Linux by Anonymous Coward · · Score: 0

      Have a look at IBM's web site. They've got a nice ViaVoice TTS SDK that lets you tune the voice to make it sound older, younger, whisper, etc.

    4. Re:Text 2 Speech on Linux by Nullsmack · · Score: 1

      oh, you mean an anti-innovation product like Windows?

      You do understand where we would be if Mr. Bell had decided that postal mail was enough and he didn't develope on of those fancy new voice boxes.. Or If the DMCA had been in place during the IBM vs (whoever cloned their bios) trial?

      Please. I know I'm not the only one who hates it when someone says "Forget about doing it yourself, go buy something to do it for you".

      Microsoft are the antithesis of innovation.
      Besides, do you REALLY want to run an operating system that takes as much processing power as games did a year or two ago? I'm referring to the minimum sytem requirements for windows xp of course. a p233, 128mb ram, 1.5gb hdd.. the Recommended system requirements are even higher than that.. p2-300, 256mb ram, etc..

      What does an operating system REALLY have to do, that it takes so much processing power?

      .

  16. Why dump more tech than necessary into the phone? by StaticEngine · · Score: 4, Insightful
    Sure, the tech is cheap and relatively disposable, but is moving every feature but the kitchen sink into the cellphone really the way to do it? The phone can already send and transmit voice, so why not keep the text-to-speech synthesis at some central server where the systems can be maintained and upgraded, rather than having to support/manufacture/refurbish thousands of phones out in the field?

    The cellphone may have all the power of an original Palm Pilot these days, but we don't need to make it into a Onyx Server.

  17. Old news man ... by Mr.+Eradicator · · Score: 1

    /me gets out his Speak n Spell.

    --

    That's Mr. Eradicator to you.

    trance-port
  18. Oh the possibilities! by Neutron_F1uX · · Score: 1

    Oh imagine them! TO have one of these babies that understood more languages, and could translate them to one of the others. Need a real translator? Nooope, watch my lil PDA Do it for me!

    Seriously, that could have tremendous bussiness implications for those who are doing bussiness in other countries.

    Their usage of EEPROM is nothing ut ingenious, why hasn't anyone done this before? Or have they? It makes a lot more sense then a flash card, and it's cheaper too.

    1. Re:Oh the possibilities! by jiheison · · Score: 1

      Nooope, watch my lil PDA Do it for me!

      Now hang on a second while I enter the text I want translated at 10 words per minute. . .

  19. No Big Deal by Anonymous Coward · · Score: 0
    In the old days, Radio Shack sold a set of two chips. One contained a small ROM and an 8-bit microprocessor and broke up English text into a string of "phonemes" - 64 vowel and consonant sounds. The second chip would actually synthesize the phonemes.

    This device could easily be connected to a personal computer. Somewhere, I have schematics for a version which plugged into a Commodore 64, and code for wedging the "SAY" command into BASIC - look in the old Ahoy Magazine. I saw a working version at Lawrence Tech.

    1. Re:No Big Deal by dodald · · Score: 1

      At the community college I went to. We used one of those chips in a systems integrations class. We had to wire it up on a bread board and make it talk. The professor gave us a bunch of statments we had to replicate, and we had to make it say them.

      Since it was also a programming class we also had to write a phoneme to audio program (the program only told the ship what to "say")

      I we could have kept the chips :)

      --
      101010b 2Ah 52o
  20. I bet it will choke... by MarkusQ · · Score: 5, Insightful
    I bet it will choke on:

    The lead story read: "Unionized environmental health workers object to new chip that can read un-ionized lead levels."

    Reading english is a lot tougher than most English speaking people think.

    -- MarkusQ

    1. Re:I bet it will choke... by jiheison · · Score: 2, Insightful

      I would assume that it could recognize and handle a dash appropriately.

      You're right though: rough, bough, cough, laughter, slaughter etc. might give it trouble (they certainly gave Ricky Ricardo a headache).

      It would have to do a lot more than simply translate the text to phonemes to be effective with English.

    2. Re:I bet it will choke... by gmarceau · · Score: 1

      It time to come up with a standard encoding for streams of phenomes. A big central server should be doing the english -> phenomes conversion.

      It should do it offline, using the lastest mightiest, cpu-intensive, experimental text-reading AI available. Once that's done, and it only need to be done once, you can stream the phenomes down and expect the end device to render them with the user's favorite voice tone. Neato.

      --
      This post was compiled with `% gec -O`. email me if you need the sources
    3. Re:I bet it will choke... by thelexx · · Score: 2, Informative

      Which is why the other language is Chinese. I remember hearing years ago that Chinese is very well suited for voice recognition due to the fact that it is a tonal language with a total set of only a few hundred distinct sounds. Not sure if this is true just for Mandarin or also Cantonese and the others.

      LEXX

      --
      "Gold still represents the ultimate form of payment in the world." - Alan Greenspan, 1999
    4. Re:I bet it will choke... by thelexx · · Score: 1

      I know it's bad form to a) follow up your own post and b) post before reading the linked stories, but hey. Sue me!

      So, seeing as it is a Taiwanese company developing this, it is even more obvious why they chose Chinese :) and yet would also seem to confirm that maybe Cantonese isn't as easy to deal with as Mandarin since, AFAIK, Taiwan is predominately Cantonese speaking. Unless they are trying to go for a mainland audience with the chip. Any Chinese speakers here that can help out with this?

      LEXX

      --
      "Gold still represents the ultimate form of payment in the world." - Alan Greenspan, 1999
    5. Re:I bet it will choke... by Spankophile · · Score: 2

      Far as I know:

      Taiwan = Mandarin
      Mainland China = Mandarin
      HongKong = Cantonese
      Toronto = Cantonese

    6. Re:I bet it will choke... by singularity · · Score: 1

      For what it is worth, the text-to-speech capabilities built into the MacOS could handle this sentence. A few of the voices had trouble with the hyphened word, but most were understandable to the random people I grabbed to listen.

      I have been playing around with text-to-speech for several years and I have seen it come a long way since then.

      I currently use my computer as an alarm clock. It starts up and checks my mail. While I am sitting in bed, it reads off the sender and the subject line.

      A lot of mornings, the time I get out of bed is determined by who sends me mail.

      --
      - (c) 2018 Hank Zimmerman
    7. Re:I bet it will choke... by Giant+Robot · · Score: 1

      Actually Taiwan is predominately Mandarin..

      Cantonese would be harder, since people don't usually write what they say, or read out loud exactly what was written (though the trend is breaking down). Phrases tend to be converted to common cantonese phrases when orally presented.

    8. Re:I bet it will choke... by jrockway · · Score: 1

      The problem is with things like read and read, as in "I read slashdot twenty times a day", and "I read that article yesterday." How will the computer know the context? (The data that tells you that is later on in the sentence, and if the data isn't buffered...).

      I imagine mandarin has the same problems.

      --
      My other car is first.
    9. Re:I bet it will choke... by jiheison · · Score: 1

      Maybe it will force you to write phonetically. Kind of like "speech recognition" that forces you to adopt an awkward method of speech.

      Hell, as far as english goes, that might not be a bad thing over the long run.

    10. Re:I bet it will choke... by Paul+Lamere · · Score: 1



      The Chaos

      G. Nolst Trenité

      Dearest creature in creation,
      Study English pronunciation.
      I will teach you in my verse
      Sounds like corpse, corps, horse, and worse.
      I will keep you, Suzy, busy,
      Make your head with heat grow dizzy.
      Tear in eye, your dress will tear.
      So shall I! Oh hear my prayer.

      Just compare heart, beard, and heard,
      Dies and diet, lord and word,
      Sword and sward, retain and Britain.
      (Mind the latter, how it's written.)
      Now I surely will not plague you
      With such words as plaque and ague.
      But be careful how you speak:
      Say break and steak, but bleak and streak;
      Cloven, oven, how and low,
      Script, receipt, show, poem, and toe.

      Hear me say, devoid of trickery,
      Daughter, laughter, and Terpsichore,
      Typhoid, measles, topsails, aisles,
      Exiles, similes, and reviles;
      Scholar, vicar, and cigar,
      Solar, mica, war and far;
      One, anemone, Balmoral,
      Kitchen, lichen, laundry, laurel;
      Gertrude, German, wind and mind,
      Scene, Melpomene, mankind.

      Billet does not rhyme with ballet,
      Bouquet, wallet, mallet, chalet.
      Blood and flood are not like food,
      Nor is mould like should and would.
      Viscous, viscount, load and broad,
      Toward, to forward, to reward.
      And your pronunciation's OK
      When you correctly say croquet,
      Rounded, wounded, grieve and sieve,
      Friend and fiend, alive and live.

      Ivy, privy, famous; clamour
      And enamour rhyme with hammer.
      River, rival, tomb, bomb, comb,
      Doll and roll and some and home.
      Stranger does not rhyme with anger,
      Neither does devour with clangour.
      Souls but foul, haunt but aunt,
      Font, front, wont, want, grand, and grant,
      Shoes, goes, does. Now first say finger,
      And then singer, ginger, linger,
      Real, zeal, mauve, gauze, gouge and gauge,
      Marriage, foliage, mirage, and age.

      Query does not rhyme with very,
      Nor does fury sound like bury.
      Dost, lost, post and doth, cloth, loth.
      Job, nob, bosom, transom, oath.
      Though the differences seem little,
      We say actual but victual.
      Refer does not rhyme with deafer.
      Foeffer does, and zephyr, heifer.
      Mint, pint, senate and sedate;
      Dull, bull, and George ate late.
      Scenic, Arabic, Pacific,
      Science, conscience, scientific.

      Liberty, library, heave and heaven,
      Rachel, ache, moustache, eleven.
      We say hallowed, but allowed,
      People, leopard, towed, but vowed.
      Mark the differences, moreover,
      Between mover, cover, clover;
      Leeches, breeches, wise, precise,
      Chalice, but police and lice;
      Camel, constable, unstable,
      Principle, disciple, label.

      Petal, panel, and canal,
      Wait, surprise, plait, promise, pal.
      Worm and storm, chaise, chaos, chair,
      Senator, spectator, mayor.
      Tour, but our and succour, four.
      Gas, alas, and Arkansas.
      Sea, idea, Korea, area,
      Psalm, Maria, but malaria.
      Youth, south, southern, cleanse and clean.
      Doctrine, turpentine, marine.

      Compare alien with Italian,
      Dandelion and battalion.
      Sally with ally, yea, ye,
      Eye, I, ay, aye, whey, and key.
      Say aver, but ever, fever,
      Neither, leisure, skein, deceiver.
      Heron, granary, canary.
      Crevice and device and aerie.

      Face, but preface, not efface.
      Phlegm, phlegmatic, ass, glass, bass.
      Large, but target, gin, give, verging,
      Ought, out, joust and scour, scourging.
      Ear, but earn and wear and tear
      Do not rhyme with here but ere.
      Seven is right, but so is even,
      Hyphen, roughen, nephew Stephen,
      Monkey, donkey, Turk and jerk,
      Ask, grasp, wasp, and cork and work.

      Pronunciation -- think of Psyche!
      Is a paling stout and spikey?
      Won't it make you lose your wits,
      Writing groats and saying grits?
      It's a dark abyss or tunnel:
      Strewn with stones, stowed, solace, gunwale,
      Islington and Isle of Wight,
      Housewife, verdict and indict.

      Finally, which rhymes with enough --
      Though, through, plough, or dough, or cough?
      Hiccough has the sound of cup.
      My advice is to give up!!!
  21. not the orange? by Anonymous Coward · · Score: 0

    either spoken English or Mandarin (the Chinese language, not the orange).

    It's impossible for "Mandarin" to be one or the other. As a modifier, the word is used to describe BOTH a cultural subpopulation in China and a type of orange native to the areas in which that subpopulation resides. So really, it's BOTH the Chinese language AND the orange.

    1. Re:not the orange? by Anonymous Coward · · Score: 0

      So by the same definition...



      moron (môrn, mr-)
      n.
      A stupid person; a dolt.
      Psychology. A person of mild mental retardation having a mental age of from 7 to 12 years and generally having communication and social skills enabling some degree of academic or vocational education. The term belongs to a classification system no longer in use and is now considered offensive.

      you are both as stupid person and a mentally retarded person. And somehow you cannot be either. Truly you are a moron.

  22. Good for the Blind. by jellomizer · · Score: 2, Insightful

    Any technology that can translate text to words is a good thing so the Blind people can have less of a hard time with technology which is mostly sight driven. But of course with my Really bad spelling it could drive people nuts. (Yea Yea Lern to spell and that will fix the problem) But I always want the feature to disable it no matter how low processing power it uses. Speaking is generally slower then reading. Plus there is some times were your concentration dosent need a computer speaking to you.

    --
    If something is so important that you feel the need to post it on the internet... It probably isn't that important.
  23. Kavita Maharaj by blair1q · · Score: 2

    Does it sound like Kavita Maharaj?

    Because I swear, sexy though it is, her voice is synthesized.

    --Blair

  24. Phenomes? by Sodium+Attack · · Score: 4, Insightful
    The low-power chip scans the text and translates it into spoken phenomes and outputs it to a filter for smooth analog sound, or can directly output the digital signal.

    But is it smart enough to pronounce the boldfaced word above as "phonemes"?

    --

    Never take moderation advice from sigs, including this one.

  25. Hrmm... by dasmegabyte · · Score: 2

    Didn't I see this first in "Wargames?"

    Incidentally, a guy I work with has a father who designs for Chrysler. He said that the big D-C was "really interested" in applications of text to speech. Think about it: ebooks that read themselves to you while you drive, driving directions and traffic info read to you rather than displayed on a screen (most nav screens require you to take your eyes entirely off the road and down the dash as much as 18 inches...eep!). You've got a much more useful interface, and with a low cost(though they'll charge you a grand, i'm sure) , easy to interface chip, they'll have no excuse not to bring this much safer system for data interaction to my dash today, and not six years from now.

    --
    Hey freaks: now you're ju
    1. Re:Hrmm... by Anonymous Coward · · Score: 0

      Ebooks that read themselves to you while you drive....

      You mean, books on tape?

    2. Re:Hrmm... by dasmegabyte · · Score: 2

      Yes, but read a lot of books that don't have a book on tape available. Like 95% of the market. The only books available on tape are best sellers or otherwise popular books...if i want to listen to, say, Dirk Gently's Holistic Detective Agency or the Faerie Queene or RedHat Linux Unleashed, books I can get on ebook but not as a book on tape, i'm up a creek.

      Of course, a text to speech reader isn't going to sound anywhere near as nice as an audiobook...but until you can find an audiobook of my email, or my students' papers, or the latest press release from Sun Microsystems, i'll take the coder.

      --
      Hey freaks: now you're ju
  26. read your e-mail outloud? by Peyna · · Score: 1

    I can just see it now. You're sitting on a bus on your way to work, reading your e-mail when all of a sudden you hear: "ENLARGE YOUR PENIS NOW!" "XXX GIRLS INSIDE, CUM JOIN US" "COME INSIDE AND LICK MY..." well you get the idea. I hope you have headphones!

    --
    What?
    1. Re:read your e-mail outloud? by inburito · · Score: 2

      Uh.. who seriously would have their private e-mail read out loud in a bus?

      "As your accountant I need to inform you that..",
      "Here is your divorce settlement proposal..",
      "This is your doctor. Test results came in. You have..",
      etc..

      In comparison some x-rated junk mail might actually make some poor fellows day..

  27. This'll be great for phone spam by saarbruck · · Score: 2, Funny
    Can you imagine getting your Hotmail spam read to you while your significant other is hanging around?


    Phone: "Are you looking for hot [chicks|sex|pussy|love]?"

    Wife: "um... what was that, honey?"

    Phone: "Get your University diploma!"

    Wife: "What, I'm not good enough the way I am?"

    Phone: "Get out of debt now!"

    Wife: "Okay, you know what? That's your birthday present on the Credit card, bucko. That's it. I'm leaving..."

    --
    I am the very model of a modern major general!
  28. Companies in this business by aspillai · · Score: 1

    Right now there are companies exclusively in the business of doing this. There is very little mathematical challenge here. Psychologists and linguists have researched phenomes and related material pretty well.

    But there are some implementation issues here. Example, if you have GNU. How do you say it? What about if you have Jekka Pukka Sarasate? If you were to take the literal English pronounciation you might never even be able to understand what it's trying to say. Figuring out how to solve that is an interesting CS problem.

    But this is a cool invention. Low power wireless research is just taking off. Before we were trying to figure out how to just transmit wireless well. Now we can have fun with it. I truly look forward to a wireless life :)

    Me..

    1. Re:Companies in this business by Anonymous Coward · · Score: 0

      It always amazes me how little people understand this stuff. Speech synthesis is nowhere near a solved problem; intonation, speaking style, even pronunciation variation are still open questions. There is a lot of open research and science to be done.

      Just concatenating one phoneme after another sounds, well, awful. You have to get the prosody right, the text analysis has to be good, the signal processing has to be there, the voice quality is still an issue, and it rarely sounds like the individual that was sampled.

      So "Psychologists and Lignuists have researched phonemes pretty well" is an empty statement. Knowing that we can call something /t/ even though it may have a number of realizations doesn't get us to good synthesis. There are coarticulatory phenomena, energy modulations, variations in spectral tilt, and intonation, just for starters.

      I don't doubt this chip sounds similar to the diphone voices in Festival.

  29. Better in the Next Generation by Murdock037 · · Score: 2, Funny

    Yeah, but you know the power consumption would be a hell of a lot higher on the chip that everybody would really want anyways: the Text-to-Barry-White-Speech chip.

    "You've got mail, baby."

  30. Heh. by Anonymous Coward · · Score: 0

    Somehow I doubt it'd be able to read Slashdot headlines. Can you imagine a chip trying to reproduce Malda's spelling?

  31. Re:Dr. Sbaitso - no it's bruce by Radiohead · · Score: 1

    Actually the "fitter happier" voice is the Bruce MacInTalk Pro voice that is part of MacOS. You can paste the lyrics into SimpleText on a Mac and get your very own live performance.

    The lyrics are here

  32. Great for ordering Chinese food. by 10.0.0.1 · · Score: 1

    Will it translate into broken chinenglish? "Me want pork fried rice. Chop chop."

    --
    forth ?love if honk then
    1. Re:Great for ordering Chinese food. by Uttles · · Score: 1, Funny

      No, no, that's "polk flied lice"

      --

      ~ now you know
    2. Re:Great for ordering Chinese food. by jiheison · · Score: 1

      It is more politically correct to simply request your pork fried rice with extra spit.

  33. Catching up with an Amiga? by DumbSwede · · Score: 1
    My old Amiga had decent text to speech processing in 1984.

    With all the horsepower available in any modern handheld device -- surely much more than an 8mhz 68000 with 512K of memory (of which only a fraction was used I'm sure) -- I don't understand why a dedicated chip would be needed to pull this off.

    1. Re:Catching up with an Amiga? by SnapShot · · Score: 1
      decent text to speech processing

      I don't know what model Amiga you had, but if you define decent as "sounding like a robot that forgot what intonation was but could alter its voice half an octave to simulate slight masculine or slight feminine undertones" then I'll agree with you. Don't get me wrong, I really liked my Amiga, but I never got the speech synthesis confused with, say, the sensual voice of Sophie Marceu.

      On a related note. This chip + BabelFish + web-enabled cell phone = equals ability to order the really complicated things on the menu at the chinese resteraunt (or, from the other point of view, the ability to order the really complicated things at Mom's American Cusine in downtown Beijing).

      --
      Waltz, nymph, for quick jigs vex Bud.
    2. Re:Catching up with an Amiga? by uradu · · Score: 2

      > I don't know what model Amiga you had, but if you define decent as "sounding like a robot that
      > forgot what intonation was but could alter its voice half an octave to simulate slight masculine
      > or slight feminine undertones" then I'll agree with you.

      Well, that was if you fed text to the translation device, which did its best to generate the required phonetic output--also in ASCII--that was fed to the speak device. This translation could be pretty rough, and could be much improved upon if you generated your own raw phonetic output. You could smooth out, lengthen, shorten, or intonate individual phonemes that way, making the output sound much better. Basically, the translation device needed a good rewrite.

  34. Yeah, but... what about everyone else? by cd_Csc · · Score: 1

    We all know how annoying it can be when other people to have their cell phones ring in public places... the last thing we need is people listening to a monotonous computer voice in public. Not to mention the fact that its usually much more convenient to read text which allows for skimming and variable speeds.

    1. Re:Yeah, but... what about everyone else? by czardonic · · Score: 1

      the last thing we need is people listening to a monotonous computer voice in public.

      No the LAST thing we need is another whiny, irrational complaint about cell phones. Unless they were using speaker phone, how would this make a difference?

      --
      Takahashi Rumiko made beats! DON, taku, DON, taku. . .
  35. Re:Why dump more tech than necessary into the phon by Anonymous Coward · · Score: 1, Interesting

    Sure, the tech is cheap and relatively disposable, but is moving every feature but the kitchen sink into the phone really the way to do it? Why put a radio transmitter and receiver in the headset-- if you do that, then you'll need a battery to power it. Stay with the corded phones, I say! And, if you have more than one person in a city talking on these newfangled radiophones, you'll need a computer to set the radio frequency! My Gremlin's 8 track player/AM radio doesn't need a computer to change channels -- it's got those big preset buttons to move the dial for me. The cell phone may have all the power of an trs-80 these days, but we don't need to make it into an IBM-PCjr.

    p.s. and don't even get me started on digital phones... converting analog to digital to analog baseband to RF, and then back again!

  36. Oops, 1986 by DumbSwede · · Score: 1

    Sorry, I was off my 2 years.

  37. So what? by Erik+Fish · · Score: 1

    Why is this a big deal? I was doing text to speech on my Commodore 64 when I was a kid with a program called SAM (which was written in 1979). The C64 had what...1MHz?

    1. Re:So what? by Anonymous Coward · · Score: 0

      Ahhh, SAM. I remember it well. My friends and I can still imitate it, especially they way it would mispronounce "encyclopedia" (en - see - cloe - pedIa) and "hippopotamus" (hip - po - po - tAmus).

      Of course, we were young then (junior high) so we would also have it curse. "Super Dooper Bitch" was our favorite.

  38. Post your Ideas here! (please) by Uttles · · Score: 2

    "We are looking at devices that don't necessarily have a really powerful processor on board," said Hezi Saar, product marketing manager at Winbond. "Usually most of the accessories for handheld devices don't have the power to run text-to-speech algorithms and they don't have the huge memory capacity to support this feature."

    OK, so just imagine that in the near future anything and everything will have one of these small, low cost chips. Now, imagine the possibilities! Everyone I'm sure has their own ideas on how cool this could be, so go ahead and reply with yours!

    --

    ~ now you know
    1. Re:Post your Ideas here! (please) by Uttles · · Score: 3, Interesting

      1) My microwave at home displays "ENJOY YOUR MEAL" when it's finished cooking something, I'd sure love it if instead of cheesy LED's I heard a sexy voice saying "come and get it, baby."

      2) Text messengers for blind people. You know those little IM devices all the kiddies have? Well just put brail on the keys and have one of these chips installed... there you go.

      3) Watches. The next time somebody says "what time is it?" you just press a button and the voice chip in your watch simulating someone who sounds extremely pissed off shouts the time.

      Well, that's it for now...

      --

      ~ now you know
    2. Re:Post your Ideas here! (please) by Anonymous Coward · · Score: 0

      I've always thought that ruggedized computers that begged for mercy and sobbed when beaten would be a great idea.

      These new chips would allow the same functionality to be stuck in laptops and palmtops without grossly reducing battery life.

      Now, if they could only make them develop bruises that "healed" over the course of a few days...

    3. Re:Post your Ideas here! (please) by Anonymous Coward · · Score: 0

      I've always thought that ruggedized computers that begged for mercy and sobbed when beaten would be a great idea.

      "Arghhh! Arghh! Owwwww! O.K, O.K, I'll tell you! You've missed the semicolon from the end of your class declaration! OWWWW!"

    4. Re:Post your Ideas here! (please) by Anonymous Coward · · Score: 0

      The wrist watch exists. I blind friend of mine has one. And yes, it did sound pissed off

  39. Remember when Speak'nSpell was a new thing by iplayfast · · Score: 1

    They used speech parts to make up words in the same way. (Hope this sounds better though)

  40. They've had stuff like this for a while ... by wobblie · · Score: 1

    I remember one of my coworkers had gotten a service where you could email his account, which would forward to a voice mail text -> speech system.

    It was hilarious sending him obscene and or ridiculous emails and listening to the recorded voice play them back ...

  41. Already done. by Doc+Hopper · · Score: 2

    Well, I jumped ship to this little company I work for now called Talk2 Technology (free plug, I guess). We've taken a different tack in voice-enabling applications. I think there are different target markets -- the Talk2 stuff uses servers on the back-end, which go out and fetch your email to read it to you. Putting this on-chip in the cell phone itself is a great step in the right direction.

    Fundamentally it's a different approach than today's "voice portal" technology. Voice Portals retrieve data for you, and read it over standard cell or PSTN network. There are many benefits to this approach, principal among them being improved processing power for additional functionality such as voice-processing (speech to text, or compressing speech for reply email voice attachment). By putting the power into the phone, instead of at an expensive central office, this chip could either be a great advancement for text-to-speech technology, or a "killer app" that puts my company out of business :)

    Regardless, I'm excited to see this happening. I've long envisioned a PDA with the only interface being spoken, rather than requiring any video component. This would bring the power consumption and delicacy of these devices down within reason for extended usage. The downside is that speech is necessarily a rather slow interface to a machine; it will be interesting to see how we adapt speech for greater speed with speech-based devices, and how English as a whole will fare.

    Now that I've used voice-enabled email, it would be really hard to go back to the "old" way. I still do an enormous amount of correspondence every day by typing, but when I'm on the road I don't need to bother with a laptop since I can have my email read to me over the phone *and reply* with a voice message via email. Until you've used it, it's tough to realize how convenient it is.

    I want one of these for my Agenda VR3! Or something...

    1. Re:Already done. by RetroGeek · · Score: 1

      Voice Portals retrieve data for you, and read it over standard cell or PSTN network. There are many benefits to this approach, principal among them being improved processing power for additional functionality such as voice-processing (speech to text, or compressing speech for reply email voice attachment). By putting the power into the phone, instead of at an expensive central office, this chip could either be a great advancement for text-to-speech technology, or a "killer app" that puts my company out of business :)

      No, the place for the voice processor IS on the server. Think "thin client". If the the micro-code in the phone is wrong, then you need to upgrade all the phones, rather than just the server code.

      Let's not go backwards here....

      --

      - - - - - - - - - - -
      I am a programmer. I am paid to produce syntax not grammar. Deal with it.
  42. Don't count on Hearing a Slashdot headline by Erris · · Score: 1
    The blind have gotten a tremendous shaft from the WWW maintainers. Don't think so? Try surfing with images turned off! Most pages start off with a blizard of adverts, banners and other junk that gets in the way of real content. Think about having to listen to the same 12 item banner for each new page that contains one or two images, no real links and something dumb for text like a copyright notice. Many government sites are well designed. Other sites like Slashdot can be customized but many have gone in the exact opposite direction. Using images for navigation might look nice, but it usually is not.

    M$'s Front Page is one of the worst offenders. It's full of useless font adjustments and other needless code. Worse, it lables images crypticaly and encourages all of the worst practices.

    As Bill Gates once said, software is what is lacking in a world full of technology. He aims to keep it that way for those who trust him.

    --
    DMCA, Hollings, Palladium. What might have sounded like paranoia is now common sense.
    1. Re:Don't count on Hearing a Slashdot headline by shokk · · Score: 1

      Don't like ads and banners and pop-ups and pop-unders, etc? Take a look at things like WebWasher and Junkbuster. It can certainly keep the sighted among us from ripping our eyes out in frustration as the whole interface hangs waiting to load that pop-up. No doubt that will help those who do not have the benefit of sight from getting to the core of what they seek on the WWW.

      --
      "Beware of he who would deny you access to information, for in his heart, he dreams himself your master."
  43. Re:Why dump more tech than necessary into the phon by beretboy · · Score: 1

    studio.tellme.com

  44. Its about time... by jasno · · Score: 2

    I remember my first computer - a ti99/4a - had a box I plugged into the side that generated speech. It didn't sound all that good, but you could recognize it well enough. If I remeber correctly, it cost about $100.

    That was... 21 years ago. Its sad that this aspect of human computer interaction has been overlooked for so long. Its nice to finally see some development.

    --

    http://www.masturbateforpeace.com/
  45. New Products Are Key to This by Embedded+Geek · · Score: 1
    As other posters have pointed out, all this has been done before. I really don't see existing applications (EMAIL, etc) as benefiting from this.

    The real potential of this thing is its low power and (presumably) price. Sure, a PDA with this might have a certain "cool" factor, but the inovation would be in other, traditionally "dumb" products:

    • Embed this with a cheap GPS unit in a walkman to give directions while running or biking (or a soldier navigating through the woods).
    • UPS/FedEx tracking tags that can be interrogated without needing a scanner.
    • Tie into a cellphone and timer to do audio data dumps for personal products - your doctor can query your insulin pump.
    • Diagnostics in almost any product - washing machine, furnace, stereo - could be done without adding the cost or space of a visual display.

    Surely, other people have suggestions to add...

    --

    "Prepare for the worst - hope for the best."

    1. Re:New Products Are Key to This by Anonymous Coward · · Score: 0

      hooraaaayyyy, just what I want:

      Dryer: "you have excess lint in the lint screen"
      Washer: "you have an unbalanced washload"

      It will be just as annoying as Steve Jobs' Next machine saying 'you have mail'...it should have said 'you have a real os and not a bloated hack unix one with no decent api'

    2. Re:New Products Are Key to This by Embedded+Geek · · Score: 1
      Agreed - dryer lint would be enough to make me want to reprogram it with a ball peen hammer. But that's a user interface design issue.

      What I was thinking of was you press a recessed button and the pizeo speaker says: "On board timer - OK. Internal fuse check - OK. Motor check - intermittant failures. Possible intermitant short in motor housing or windings. Please contact your Maytag repair professional at (800)555-5555 and please indicate failure code 247."

      General rule: It'd be more specific if only a technician could reach it, less specific (code and phone number) if it was intended for a consumer to self test. Again, user interface issues.

      --

      "Prepare for the worst - hope for the best."

    3. Re:New Products Are Key to This by green1 · · Score: 1

      Imbed this with a cheap GPS unit...
      hate to jump on the bandwagon but... it's been done. Garmin Street Pilot III

  46. No - it IS for pr0n! by Haeleth · · Score: 1

    Silly - that's a mistake for "pheromones"! I'm not sure what a "spoken pheromone" might be, but it sure sounds sexy.

  47. Speeach teakes onleay ninee kayyy of memorray by Unanonymous+Coward · · Score: 0

    soa yoau kann uase iit inn yourr own prooagramez! Thats what i heard in summer 1994, when I got a small program on the commodore 64 called speech. it was pretty good except it sounded like a drunk robot. But it DOES show that you don't need much computing power for text2speach if a programmer can do it on a lowly commodore 64 using BASIC! for crying out loud.

    --
    The Unanonymous Coward
  48. Could do this, too... by Darth+Hubris · · Score: 0, Redundant

    http://www.mchawking.com/

    "Aw Yeah! This be some funky-ass shit I be laying on your ass!"

    --
    The party's over ... the drink ... and the luck ... ran out
  49. Mandarin (not the orange) by gbrandt · · Score: 4, Insightful

    If the you feel that you have to state 'not the orange' when using the word Mandarin in a language context, perhaps you should also state 'not the peoples of the England' when using the word English in the same context.

  50. Just what I need by endersdad · · Score: 2, Funny

    Another female voice calling my cell phone and telling my i'm offtopic...

  51. Apple II by KPU · · Score: 1

    I still have an apple II program that converts text to speech (and outputs to the PC speaker). It fits on a 5.25" floppy and its pronounciation was better than some of the more modern TTS engines. Sure, it choked on words like pneumatic, but it was cool to have a TTS engine run in real time on the 2 MHz processor.

  52. Winbond's Whitepaper by jdclucidly · · Score: 2, Informative

    I haven't seen anyone post a link to Winbond's own web page on the WTS701 Text-to-Speech Processor so, here it is straight from the mouth:

    Winbond

  53. Not many lines either by Unanonymous+Coward · · Score: 0

    And if I can remember correctly, it was less than 100 lines of code!(!)

    --
    The Unanonymous Coward
    1. Re:Not many lines either by Anonymous Coward · · Score: 0

      And if I remember correctly the C64 was about as portable as the entire continent of South America. Recognize this as a step beyond that and not "oooo! I have a C64 back in the day. Remember those? What do you mean by 'what have I accomplished since then?'"



      This new chip sounds more like another one of those modules for those damn handspring PDAs that nobody will buy, but which the military will also produce and have strapped to every soldier's left butt cheek.

  54. Top 5 things I never want my phone to say by uslinux.net · · Score: 1

    5. Your wife is on line 1
    4. Your ex-wife is on line 1
    3. Your proctologist is on line 1
    2. You've got mail... pattern baldness
    1. You've got spam

  55. they have the hard part backwards... by gol64738 · · Score: 1

    it doesn't take tremendous chip power to convert text to speech, it takes tremendous chip power to convert speech to text. well, not tremendous power, but you know what i mean.
    to give you an example, using SAM (Software Automated Mouth) on my Commodore 64 (1984) produced quality speech similar to speech synthesis products of today.

    so, what was the big deal again??

    1. Re:they have the hard part backwards... by t · · Score: 1
      Actually the computation power needed to convert speech to text is unknown because it is an unsolved problem. There are hacks that can guess using probabilites and such but none of them are anywhere near startrek levels.

      t.

  56. Imagine... by Anonymous Coward · · Score: 0

    Imagine a cell phone with this

    Well, that's kinda hard, but I can imagine a Beowolf clu... *slap*

  57. Bandwidth problem -- audio is dead by RobertGraham · · Score: 3, Interesting
    I guess I'm a little skeptical of all technology that attempts to supply "old" paradigms to new problems.

    The most important thing about the Internet is "bandwidth". I'm not talking bits on the wire, I'm talking how fast information flows into my brain. Speech is vastly slower than text as a medium for transfering information into my brain. I'm so accustomed to Internet speeds for information, I can no longer watch TV news -- the bandwidth is too slow. I'm glad I don't go to school anymore -- I could barely stand lectures when I was a kid, I would never be able to sit through them as an adult.

    Five years ago everyone in Japan walked around with their phone to their ears. These days, everyone in Japan walks around looking at their phone (instant messaging, etc.). I'm not sure if people "get" the bandwidth problem. Sound must be multiplexed into half-bandwidth, serialized communication. By this I mean you can only input or output at the same time, but not both. Also, incoming messages must arrive separately, not in parallel. With audio, I can only talk to one person at a time, with messaging, I can carry on multiple text-based conversations simultaneously. I mean, text-to-voice has long been availabe on PCs, but nobody uses it for ICQ/AIM/YahooIM/MSIM.

    As far as I can tell, audio is dead. Maybe somebody will invent some sort of hyperfast language (didn't Heinlein describe something like that in a book?), but I think the next wave is going to be something new that replaces reading text, not something that goes backwards to audio.

    1. Re:Bandwidth problem -- audio is dead by Tryfen · · Score: 1

      Maybe somebody will invent some sort of hyperfast language

      Doubleplus ungood idea.

      --
      If a square is really a rhombus, why aren't all triangles purple?
    2. Re:Bandwidth problem -- audio is dead by Relic+of+the+Future · · Score: 2

      While reading your comment, something occured to me: you're right, when using IM software, I can carry on multiple conversations, i.e. I can get information from multiple sources at once, BECAUSE, I can read faster than someone can type. However, might that be because just about everyone can't type as fast as they can talk? So, those multiple conversations... are you really getting more bandwidth, or is it just comming in burst whenever someone hits the 'send' button? And what about cost for context switching? I think that overall there is some gain on total bandwidth, but that each individual conversation takes longer.

      Consider this: have you ever been IMing w/ several people, but then called one of them because it was _really_ important to get that conversation done _fast_? I have, and it makes it much harder to try to keep all the _other_ conversations going.

      Fast speach-to-text could give you the best of both worlds, but that's, of course, still a long way off.

      --
      Those who fail to understand communication protocols, are doomed to repeat them over port 80.
    3. Re:Bandwidth problem -- audio is dead by epukinsk · · Score: 1

      At first it seems like bandwidth is indeed the issue. But eventually we all get tired of the crap. What it comes down to eventually is not quantity of information but quality. If you are carrying on ten IM conversations, are any of them actually any good? My best IM conversations are not half as satisfying as the most mundane voice conversations I have.

      And I do agree, that for tasks that require high data traffic (stock ticker) text is better than speech. But this hardly means audio is dead. If I am coding, I'd rather hear "Your girfriend is on the phone" than see it on the screen. I can process that information with a different part of my brain without interrupting my coding nearly as much. With a text message, I have to take my eyes off the code, and use the part of my brain that is used for reading.

      In addition, while eating breakfast I'd rather hear "You need to leave in five minutes" than see it on the newspaper I'm reading. I can process that information quickly without interrupting my current task when it is in audio form.

      -Erik

    4. Re:Bandwidth problem -- audio is dead by Anonymous Coward · · Score: 0

      Don't be stupid. The 1984-ish doublespeak was though of by Orwell because it satirically looked stupid to begin with. The fact that we use contractions and verbalized abstractions (da bomb?) means we are speeding up the language, albeit at a pace that isn't quite keeping up with today's hectic pace.

    5. Re:Bandwidth problem -- audio is dead by Cassandra · · Score: 1

      Ehm. Doesn't the content of the message matter even more? Try reading a maths textbook as fast as your IRC :-)


      You can get the chat much faster since it has lower entropy. An other way to put it is that plain talk is more redundant, video is of course an other magnitude more redundant. A completely different matter is ofcourse that we get the impression of a richer content. The amount of information per time unit you can take in is most likely of similar magnitude in radio and TV etc.


      Look for the book Norretranders, Tor - "Mark the World", if you are interested in this topic.

  58. Commodore C64: SAM by twms2h · · Score: 3, Funny

    Great achievement, my Commodore C64 could do that so many years ago that I don't even remember when it was. SAM, the speech synthesizer which could even "sing".

    Has anything new happened lately? ;-)

    1. Re:Commodore C64: SAM by Velex · · Score: 2

      Great achievement, my Commodore C64 could do that so many years ago that I don't even remember when it was. SAM, the speech synthesizer which could even "sing".

      ::nods:: I remember programming my TI-99/4A to read me a menu of games whenver I started it up. Then there was that text-to-speech program for my 386 that came with my soundblaster that enabled me to make my computer announce that it was booted up and ready for his l33tness Velex himself to use dos. Just a few months ago, my roommate download this monkey called Bonzi that talked to him, but Bonzi got annoying so my roommate shot him.

      I'm sure that there's been tons of text-to-speech programs that I've never heard of, no will I ever, because it's been done so many times before, and the AI required to get the computer to talk in an un-Vice Fearless Leader #42 fashion is beyond the grasp of even the most 31337 at the moment. What I would really like for my mobile phone is rudementary speech recognition.

      As it's been pointed out before, text is just simply faster than speech, and who knows, maybe twenty years down the road we'll all carry around little AIM or ICQ devices. What I'd really like to do, though, is skip the minature keyboards or fumbling on a keypad. Speech regonition is the way to go. I'd much rather tell my phone, "call pink" and have it call pink back at the hotel, than have it announcing all my spam to the world.

      No, unfortunatly nothing new's happened lately

      --
      Join the Slashcott! Stay away entirely Feb 10 thru Feb 17! Close all tabs to prevent autorefresh!
  59. Re:Why dump more tech than necessary into the phon by DdJ · · Score: 5, Insightful

    Sure, the tech is cheap and relatively disposable, but is moving every feature but the kitchen sink into the cellphone really the way to do it?

    I've got a co-worker, our Oracle admin, who's blind. As things stand, with most cell phones he can't do anything except dial out and answer calls. He can't use the built-in address book to place calls for example, because all of the info is in text on a tiny screen. With text-to-speech software on the phone, he'd be able to use the address book just like sighted folks, read text messages he received earlier even when he's in an area with no coverage just like sighted folks, and so on. This is a good idea.
  60. English (not the muffin) by Rand+Race · · Score: 1
    And here I was thinking it translated text to scrumy bread products, thusly being one step closer to Talkie Toaster. Crapazola...

    --
    Insanity is the last line of defence for the master diplomat. But you have to lay the groundwork early.
  61. Technology advances... by Znork · · Score: 3, Funny

    ...and everything gets slower. I read between 2-20 times faster than I can comprehend spoken language, depending on the junkfiltering that's possible.

    No way in hell do I want to read email on a cell phone (it's a PHONE. You _talk_ to people in it. If it was a generic mail reader it would have at least a 17 inch monitor and a keyboard that lets you type faster than .2 cps. I know this is a difficult concept to grasp for certain cell phone companies, but a phone, as opposed to a computer, does not have these things, and thus it _sucks_ for email and browsing, and will continue to do so until it has those things, at which point in time you will not want to carry it around because it aint gonna fit in your pocket anymore.). Nor do I want to listen to my email. I dont have the time or the patience for it.

    At least until the phone can give me an (intelligent) summary when I say 'Get to the point'.

  62. Talking Everything by 1alpha7 · · Score: 1

    The chip could enable items such as a teddy bear that lulls a child to sleep by reading a bedtime story with the pre-programmed voice of Winnie the Pooh.

    In the short term, however, Winbond is setting its sites on more sophisticated markets. Topping the list are power-sensitive mobile devices, such as PDAs, cell phone accessories and pagers. Also on the radar are automotive applications such as telematics systems and car stereos.

    Great. Now every damn thing everyone owns will be talking.

    1Alpha7

    --
    Live to be Moderated
  63. You know it's time to be scared when by t0qer · · Score: 1

    Your phone rings and says "Greetings Professor Faulken, Shall we play a game?"

    Sorry mods couldn't resist.

  64. Re:Mac OS has that [forever] by Tokerat · · Score: 1
    System 1.0 had that, at the introduction of the Mac it spoke to the crowd ("Hello, I'm Macintosh"). Mac has always had that.

    It's not a low-power chip, though, it's software, so this is mighty cool. I'm sure we're going to see a billion little "talking devices."

    I can't wait until I press the "Find" button on my TV and my remote control yells "between the couch cushons again, asshole!"

    --
    CAn'T CompreHend SARcaSm?
  65. So much for steganography... by Anonymous Coward · · Score: 0

    The EE Times has a story on a new chip from Winbond that can take ASCII or UNICODE text and convert it to either spoken English or Mandarin (the Chinese language, not the orange).

    Too bad... steganography would have never tasted so sweet...

    (Posted anonymously because, let's face it, that joke sucked.)

  66. What Happened to TI by farrellj · · Score: 2

    Texas Instruments used to have some of the best Speech Synthsis chips out there...I remember the TI/99 computer had a speech module, and one skiing game got both male and female realistic sounding voices out of the speech module. If they could do it in the early 1980's why can't they do it now?

    ttyl
    Farrell

    --
    CAN-CON 2019 - Ottawa's only book oriented Science Fiction Convention! October 18-20, Sheraton Hotel, Ottawa, Canada h
    1. Re:What Happened to TI by Detritus · · Score: 2

      From what I remember about the early speach synthesis systems, they cheated. The input to the synthesis module was a stream of phonemes, not normal text. The programmer had to translate the text into phonemes.

      --
      Mea navis aericumbens anguillis abundat
    2. Re:What Happened to TI by CodeShark · · Score: 2, Informative
      Yes, the programmer had to translate the text into phonemes, but you know what? That wasn't the hard part, because with a decent translation dictionary, you can get about 95% of the words right with a simple one pass "phoneme compiler" and a good set of rules. The hard part is that English is a highly inflected language without a constant set of rules for doing the inflections.

      Still, I was part of the team that made the first Apple II (at least in the State I lived in at the time) that could read from the screen back in 1981 -- to an "Echo II Speech Synthesizer" which IIRC came from Radio Shack.

      We took some of our stuff to the linguistics department at the University across town, and of all things, had the darn machine speaking understandable Japanese (from Romaji, or romanized letters) within a few days because the Japanese language is consistent not only in phonetic translation but also in inflection. It still sounded like a machine, but that was a limitation of the sound chip's internal phoneme library in the Echo II. The same program with one of today's chips would have sounded very near normal.

      Goes to show you how much more difficult spoken English is than most of us native speakers tend to realize, because I have yet to see a low cost implementation of a text to speech translator that was all that much better than what we were doing back in '81. (not that I have seen everything out there by the way -- I do have a life outside the PC world....occasionally :-)

      --
      ...Open Source isn't the only answer -- but it's almost always a better value than the alternatives...
  67. reading /. headlines... oh, the trauma! by digitalmuse · · Score: 1

    I'm just imagining the horrid phonetic contortions the poor chip would produce trying to interpret the acronym-laden jargon that us techie folks throw around so loosely. I mean, we have enough trouble agreeing on how you should pronounce something simple like MMPOG. And then there are people's habits of using code-speak and syntax fragments? the poor punctuation checker will shoot itself.
    The mind reels in horror...

    --
    "If I wanted your input on my pet project, I'd stick my hand up your ass and use you like a sock-puppet." - Muse
  68. Ironic ad coinky dink. by suso · · Score: 1

    While reading the headline for this article I looked up and saw the ThinkGeek add saying "How are you gentlemen....". Of all the possibilities that a chip like this could be used for, it will most likely in th end be used for jokes like sending messages saying "Would you like to play a game." Of cource, then you could send messages through the ikinator and it would say "Would you like to play a god damn game!"

  69. Re:Mac OS has that [forever] by Anonymous Coward · · Score: 0

    Actually, the Macintosh at release had a processor with considerably less power and RAM than even the lowest end Palm Device does now.

  70. language transulation? by josepha48 · · Score: 2
    It would be interesting if this could be used for language transulation. Turn on Spanish mode (when they get it) and type the spanish words you see into your phone and viola.. it tells you what it means in english. Sort of like a speaking babelfish if you will.

    Now all we need are really good speach to text converters....

    --

    Only 'flamers' flame!

  71. Re:Why dump more tech than necessary into the phon by Kraft · · Score: 3, Informative

    Wow! Your Oracle admin is blind? *im baffled*

    I have never worked with blind people, but after reading an article last year about how websites are getting more and more difficult for braille browsers (flash, imagelinks without alt tags etc.), I decided to make a lynx-friendly version of my site - and so should YOU!

    Anyways, how does he do it?? Is it worth it to the company you work for, or does it cause everyone else problems? Is he good? Tell! Hopefuly this could encourage others to take on "disabled" in their company....

    --

    -Kraft
    Live and let live
  72. Re:Why dump more tech than necessary into the phon by t0qer · · Score: 1

    Ok i've seen 2 schools of thought over the years when it concerns what some see as overkill in technology.

    1. Tech should be stripped down and basic I.E. just enough to get the job done.

    2. Tech should be overpowered to show what can be done with tech.

    Considering the price of technology has fallen considerably over the last 5 years. WHY NOT add everything but the kitchen sink. 5 years ago I never dreamed I could afford a gig of ram. Low and behold today's low priced pc-133 can be had for less than 20 cents a megabyte. Do I have a gig of ram? Yes I do! It doesn't matter if you call it an upgrade, addition, or bragging rights it was affordable so I went ahead with it.

    Any aspect of our lives becomes better when you throw more hardware at it. Freezers that keep inventory, centralized house controls, personal area networks. When the benifit > price and price a drop in the bucket it then becomes a necessary feature for the manufacturer to add in order to stay competitive, which in the end is a win for the consumer. If you want to stay in the dark ages with your low power palm pilot go ahead, i'm going to buy my onyx handheld for the same price.

    I can't remember if I saw that on a bugs bunny cartoon or school. I just remember a mouse dressed up like a professor, or was it elmer fudd the shoe maker?

  73. Re:Done that? Do this! by Mentifex · · Score: 1

    The ASCII-to-speech chip and the Perl program are good candidates for inclusion in a Perl-based AI Mind.

    Since is Perl is already a major Web and Internet language, imagine an Internet iMind roaming the 'Net and talking to people in English or Mandarine Chinese with the new speech chip being fed the ASCII output of the AI Speech Module.

  74. Memory density by pogofish · · Score: 2, Funny

    Unlike everybody who posted "big deal, my Commodore 64 used to hold long, sexy conversations with my Speak & Spell about the meaning of Wargames," I actually read the article. Near the end it says "The multilevel storage memory system allows the chip to store up to 256 different voltage levels, or the equivalent of 8 bits, into one EEPROM cell, which is up to 8x the capacity of conventional memories..."


    Being a software geek with my last classes in EE/CE several years safely my sordid past, I'm out of touch. Is this a big deal?

    --

    A man without a God is like a fish without a bicycle.
  75. Re:Why dump more tech than necessary into the phon by tych0 · · Score: 2, Insightful

    You have to remember that economics is what drives these things. If there are yuppies or geeks out there who want to have "every feature but the kitchen sink" in their cellphone, PDA, or whatever, there will be a company out there that will be happy to take their money to implement these technologies.

  76. Babel Fish by Anonymous Coward · · Score: 0

    One step closer to the Babel Fish ...

  77. Where's the application... by Dr.+Cam · · Score: 1

    ...that runs on any system that can do this reliably, i.e. so that it is readily understandable and pleasant to listen to, whose logic has been reduced to this chip. I'm not talking individual words, here. The production of speech from text has been a hot research topic for over 50 years, and every step forward is like Zeno's paradox. English is an extremely complex language, with a very large number of orthographic rules. From the description, it sounds as though the individual phones are calculated, and then smoothed, which isn't exactly how speech works (it's an approximation, but not always a good one).

    On the other hand, this could give rise to an entirely new kind of email virus. ;-)

  78. I'll show you low power... by x136 · · Score: 2

    My Macintosh SE (8MHz 68000, circa 1988) can convert text to speech no problem. It's not necessarily smooth and natural, but it can't be *that* much of a jump...

    --
    SIGFEH
    1. Re:I'll show you low power... by GiMP · · Score: 2

      Yeah, macs have great text->speech.. even the old ones. Although, the older ones had it on a chip.. whereas apple has since moved it to software.

      For obsolete machines, they still pack a punch as a server for text->speech conversion. :)

  79. Put it on the Device for Privacy, Security, and Ba by oznoid · · Score: 1

    Here's some reasons why you want it on the device, and not on the server:

    Privacy: I, for one, don't want to send my personal content through a portal provider. I don't want Microsoft getting all my mail, I don't want TellMe getting it either. And I don't want to have everything that I'm supposed to want available for me at the channel, with my usage stats, habits, and particulars sold to direct marketers or worse.

    Security: The more places you ship the data around to, and the more intermediaries involved, the more possibilities there are for sniffing, bad security, leaks, and misuse. Passing things through a provider means trusting them to maintain security properly, and I for one don't trust many people enough to allow that.

    Bandwidth: Alice in Wonderland can be shipped in full audio at several gigabytes, or shipped as a 100+kb text file and synthesized on the device. Cell connections are terrible, despite what the telcos are pushing in their media campaigns -- coverage even in the Bay Area is spotty; you lose signal as you pass in and out of cells, and there are network overloads and outages. Keeping it down to small text streams and synthesizing on the device means getting one step away from the unreliable, low-bandwidth networks available today, and 3G is a long. long way off.

    Kevin

  80. Re:Done that? Do this! by beretboy · · Score: 1

    interesting

  81. Re:Why dump more tech than necessary into the phon by Glytch · · Score: 2

    It's easier on bandwidth to just send a few hundred bytes of text than streaming audio.

  82. two words by jbellis · · Score: 1

    context sensitivity

  83. Re:Why dump more tech than necessary into the phon by mal0rd · · Score: 1

    Have you looked at your proposed solution? As I see it there are two ways that you could have it: 1.) Put the text to speech in the phone. This allows everything local and remote to be converted into sound. The server needs to do no processing, and much bandwidth is saved. Also, it gives the end user more control over the program (this is important). The disadvantages are that the phone would need a small, cheap chip. More power may not even be used because the wireless transmission doesn't need to be powered. This works when the main server cannot be contacted. 2.) Impliment the tech. at the server. This uses more bandwidth, cpu time (unless they have these dedicated chips) and is under control of the service provider. I think it's clear.

  84. Speech Quality by NachtVorst · · Score: 1
    This sounds really cool in theory, but how will the voice sound?

    I recently tried some of the shareware/freeware TTS engines I found on the web and the quality was pretty lousy. It still sounded like Operation Stealth on my Amiga.

    So if my desktop PIII can't do it, how are they gonna put this into a low-power chip?

    1. Re:Speech Quality by Infosquawk · · Score: 1

      Try "Mary in Hall," by microsoft.

      --


      OoO

      Please do not publish outside of /.
  85. Re:I bet it will choke... Looks not by tandr · · Score: 1

    Well, if algorithm in this chip anyhow recembles Bell Labs text to speech system, then it handles this text pretty well. I'll check home how XP's narrator handles it thou...

  86. And SAM, Atari 800. Note SAM is part of Win2000 by GodWasAnAlien · · Score: 1

    Microsoft bought up what was SAM and put it
    in Windows 2000.

  87. Great by ZaneMcAuley · · Score: 1

    A CPU that TELLS you when its time to upgrade :D

    --
    ----- Whats wrong with this picture? http://www.revoh.org:1234/whatswrong
  88. Re: your sig by Anonymous Coward · · Score: 0

    Hey, I know this question is somewhat off topic, but I've been using /. for a while and am stumped anyway. How in the heck do you get those Chinese characters into your sig?

  89. Let me get this straight... by Anonymous Coward · · Score: 0
    Imagine a cell phone with this, you can have your email read to you, rather than seeing a line at a time on a dinky screen

    Joe wants to tell Bob something. Joe turns on his fancyass cell phone, types a 12 word message on his 8 by 4 character screen using a keyboard the size of his thumb, to send to his buddy who then uses his phone to read the text to him?

    Why in the world would anyone want to do that? JUST TALK INTO THE DAMN THING ALREADY!!!

    Man, we so deserve to get wiped out by a comet...

  90. Re: your sig by Anonymous Coward · · Score: 0

    Unicode. For example, "&#" + 28961 + ";" =

  91. Commodore 64 has that! by bgarcia · · Score: 2
    My Amiga was talking to me 15 years ago.
    I can't remember the name, but I had a program for my Commodore 64 that did text to voice.

    Does anyone remember the name of this program? I think it was something like "Simon Says".

    --
    I'm a leaf on the wind. Watch how I soar.
    1. Re:Commodore 64 has that! by Anonymous Coward · · Score: 0

      The program was S.A.M. I have no idea what it stood for, but it was a very nice little program, did about as well as most of the things I hear today on the PC platform.

    2. Re:Commodore 64 has that! by Anonymous Coward · · Score: 0

      That would be "Sayit Sam". Very clever program, and I think one of the first ones of its kind.

      I actually programmed it to sing happy birthday to my girlfriend (I was in highschool at the time). We broke up not long after that. Draw your own conclusions. :-)

    3. Re:Commodore 64 has that! by Bitsy+Boffin · · Score: 1

      I had this. At the same time I also had a "Voice Recognition" unit that plugged into the user port as I remember. Anyway, I wrote a program in the speech system (it was pretty much basic with a "SAY" command IIRC) that interacted with the voice recognition system so I could have a kinda disjointed "conversation" with the old C64.

      Problem was the voice recognition wasn't that great, and I usually ened up shouting at the unit in some wierd accented voice trying to get it to recognise "Yes" instead of "James" or soemthing like that.

      Ah, good times, good times.

      --
      NZ Electronics Enthusiasts: Check out my Trade Me Listings
    4. Re:Commodore 64 has that! by Anonymous Coward · · Score: 0

      SAM = Software Acoustic Mouth.
      I believe it was on Apple II also.

    5. Re:Commodore 64 has that! by Emil+Brink · · Score: 2

      I'm pretty sure the name was "SAM". I found this page about a C64 speech synthesis program named SAM, but I'm not sure if it's the same one. Sounds right though, I remember that running SAM added a SAY keyword to the C64's built-in BASIC... Aah, nostalgia!

      --
      main(O){10<putchar(4^--O?77-(15&5128 >>4*O):10)&&main(2+O);}
  92. Other langages support ? by Monsieur_Marco · · Score: 1

    The support of TTS software for other languages than english, seems to be quite low. I tried every software I am aware on for french TTS (Mbrola, Euler), and it is not able for me to read a text correctly. This TTS feature is most useful for visually impared people, some blind friends of mine even had to use english TTS with french text, and I sware it sounds horrible

  93. Re:Why dump more tech than necessary into the phon by PoiBoy · · Score: 1
    I've got a co-worker, our Oracle admin, who's blind. As things stand, with most cell phones he can't do anything except dial out and answer calls...

    This situation seems like a case where speech-to-text translation would be more useful. For example, instead of having to lsten to an address book being read to him, he could simply say "Call Bob Smith" and have the phone dial for him. In fact, some wireless carriers already offer this.

    --
    Sig (appended to the end of comments you post, 120 chars)
  94. Re:Why dump more tech than necessary into the phon by DdJ · · Score: 3, Interesting
    Wow! Your Oracle admin is blind? *im baffled*
    ...
    Anyways, how does he do it?? Is it worth it to the company you work for, or does it cause everyone else problems? Is he good? Tell! Hopefuly this could encourage others to take on "disabled" in their company...

    He's got a variety of tools at his disposal. Just the other day, he gave a demo of some of them to a bunch of us.

    He's got an 8-dot braile terminal that gives him enough characters to do C and Perl programming. He's got a hardware speech synthesizer he cranks up to something like 200+ words per minute. I tried, and could only understand a few phrases when it was cranked up to 95 words per minute.

    And when a web site he needs or wants to access is inaccessible, he complains to them, and sometimes things get fixed. He can navigate web sites that use alt tags remarkably well. A good rule of thumb is that if a site makes sense with images turned off (or in lynx), then it'll work for him.
  95. Any one remember... by Pope+Slackman · · Score: 2

    The General Instruments SPO256 chipset?
    The '256 took coded phonemes an outputted audio,
    while the other chip in the set (don't remember the name) took ASCII serial data and
    converted it to phoneme codes the '256 could understand.

    This set has been around for prolly close to 20 years now. (I remember finding a variant of it in
    the Intellivison voice module ["Bee Sevunteen Bahlllllmer"!] that I believe was circa 1984.)
    The '256 has been discontinued for a long time now, and I'm kinda excited to see
    something similar to it show up, it was a cool gadget.

    C-X C-S

  96. Wow a VOTRAX on a chip. by Anonymous Coward · · Score: 0

    Welcome to 20 years years ago.

    it's still gonna like sound azz.

  97. Why its quicker just to read and type by t_allardyce · · Score: 1

    robotic synth voice: eeeeeeuoo hhhaaaaveeee waaaaauuuunnn neeeeeeuwwwwww meeeeeessssaaage. meeeeeessssaaage seeeennnnt tweeeelveee foooorreeteeee tuuuuuueeee. meeeeeessssaaage reeeeeeaaads: heeeeeeeeeloooooou.
    1 hour later/etc...

    [translation - you have one new message, message sent: 12:42 message reads: "hello"]

    and then you respond with voice recognition:

    "hi"
    > high

    "no! HI"
    > go, high

    "HI"
    > bye

    "HIEEEE EEE"
    > i

    [type] "hi"

    --
    This comment does not represent the views or opinions of the user.
  98. Re:Why dump more tech than necessary into the phon by Dr+Caleb · · Score: 2
    I've got a co-worker, our Oracle admin, who's blind. As things stand, with most cell phones he can't do anything except dial out and answer calls

    I hope he practices safe cell phone use and doesn't call out while he's driving.../humour

    --
    "History doesn't repeat itself, but it does rhyme." Mark Twain
  99. Sad state of affairs - Mandarin not an orange? by Anonymous Coward · · Score: 0

    If the Slashdot crowd cannot identify a language spoken by 25% of the world, it shows how ignorant USians are.

  100. Old News by AndroidCat · · Score: 1

    I've had a PC card (based on late '70's technology) for ages that does RS-232 to speech. (© 1986 B.G. Micro)

    Okay, it doesn't do great speech, but it made a dandy talking clock when hooked up to cron. "BONG! BONG! BONG! BONG! BONG! BONG! The time is six o'clock!"

    I'd use it for something, but ISA card slots are rare these days. I'd power up one of my horde of 486/66's to fit it, but that's too silly even for me.

    Is it just me, or is this a solution still looking for a problem to solve?

    --
    One line blog. I hear that they're called Twitters now.
  101. Re: your sig by Anonymous Coward · · Score: 0

    viewing the source html reveals all:

    & # 22825; & # 32; & # 19979; & # 32; & # 28961; & # 32; & # 19978;

    (remove spaces to get characters to show)

  102. Well, it's been a while since the SC-01A went away by SWPadnos · · Score: 1

    This improves upon an old chip by Votrax, called the SC-01A.

    That chip required you to pump it phonemes, rather than text. Phonemes are the actual sounds that are strung together to form syllables, then words - there were 64 of them, including pauses of varying length. So you had to have an external microprocessor with a text-to-phoneme algorithm (not too big - just a few K in 6502 or Z80 assembly), then you would feed the phonemes to the chip. This chip actually sounded pretty good (for its' day) if it was used correctly. You usually had to have some good filtering on the output, and had to be careful about generating phoneme strings.

    The chip in the TI-99/4A worked the same way, except that the speech synthesizer cartridge included the text-to-phoneme algorithm (and a new BASIC command "SAY". It worked ok, but not great. Unfortunately, the implementation of the "say" command in TI BASIC didn't allow you to specify phonemes, so you had to do things like "tayck thuh kawttin owt uv yoar mowth" to get it to pronounce things correctly. It still sounded worse than the computer in War Games. I'm not sure if this chip was the SPO256-AL2 or not, I seem to remember that TI was down on using chips from other companies at the time.

    So, the big deal is that they now have an easy way to dump text in one side of a chip, and get something you can drop into a speaker on the other side. Plus it does it in two languages.

    That's a far cry from the crap we had in the 80's. But, you should note that it's not a speech synthesizer, it's just a fast playback device, which chooses the order of playback based on text input.

    As for all the people mentioning the various software speech synthesizers, don't forget SAM - "Software Automated Mouth" for the Commodore 64. I think that was actually based on something that had been around for the Atari 400/800's since the very early 80's. And none of those computers was on a single chip (though they're about as powerful as a modern watch :)

    A friend and I had considered making a small (EEPROM-sized) board to emulate an SC-01A. We didn't plan to put the text to speech on it, just an EEPROM full of recorded phonemes and a little microcontroller to interface to the outside world. Never got around to it - oh well.

    --
    - The Sigless Wonder
  103. seeking more info by Technodummy · · Score: 2


    Are there any websites where you can get a review by a blind person? or anything similar?

    We can talk about web standards until we are blue in the face, but when we stop certain people from being about to use the web, that's more than a failure of standard.

    1. Re:seeking more info by Anonymous Coward · · Score: 0

      Have a look at http://www.cast.org/bobby/ . We are developing web stuff for a blind organisation, we run everything through this to check they're able to read it properly with their tools.

    2. Re:seeking more info by Kynn · · Score: 1
      --
      Kynn's page: http://kynn.com/
  104. you by Anonymous Coward · · Score: 0

    are a moron

  105. Need assistance ? by Anonymous Coward · · Score: 0

    Speaking of getting assistance from the network, check this out : http://www.logilab.org/narval/ if you've ever been thinking about getting some actually useful AI stuff to get work done.

  106. Compare to ... by Magnusite · · Score: 1
    Forget about the old TI chips and the Intellivision speech module and such. Don't compare a single chip solution to an entire computer such as the Amiga, Mac, or PC. What I want to know is, how does it stack up to the Kurzweil 3000?

    http://www.kurzweiledu.com/kurzweil3000.html

    We had one of these in my public library back in '92. It could scan, OCR, and read a page to you within about 10 seconds. The prosody was actually fairly good too -- you could tell it was artificial, but it didn't sound like crazy pickle face man. <Adam Sandler>Hey, I'm crazy pickle face man! I got a pickle on my face. Now give me some candy!</Adam Sandler>
    Of course, it cost in the tens of thousands back then but hey, we expect that kind of technology advance don't we?

  107. Interesting chip by Looke · · Score: 1

    Perhaps more interesting than the voice technology is the actual chip, which is able to store 8 bits in a single EEPROM cell instead of the usual 1 bit. This is achieved by a multi-voltage technique. 256 voltage levels of course give 8 bit capacity. Voila, the actual size of the chip is potentially reduced to an eight of conventional technology.

  108. Great, now how about a chip for speech-to-text? by ism · · Score: 1

    Mobile phones are already usable by blind people, and while it's certainly nice to have this added functionality in for them, I would like to see something for the deaf and hard of hearing first. As it stands, most mobile phones are completely useless to the deaf. Attaching a teletype machine is only possible on very few models, and lugging a teletype machine defeats the purpose of having a small phone. Considering that many modern digital phones have texting capability through SMS or other technologies, and some mobile networks support 711/relay service, it doesn't seem like too much trouble to add teletype support directly into a mobile phone. A chip that could demodulate the tty codes would be almost all it takes.

  109. Re:Why dump more tech than necessary into the phon by Anonymous Coward · · Score: 0

    i wish i had some mod points so i could mod this off topic.

  110. Re: extended usage by kr4jb · · Score: 1

    I've long envisioned a PDA with the only interface being spoken, rather than requiring any video component. This would bring the power consumption and delicacy of these devices down within reason for extended usage.

    This chip uses 53 milliamps. Explain to me how this voice interface will be less power-hungry than an LCD and a few buttons.

    I am not sure how much power a Palm display draws, but my phone (when not on a call) uses 9 milliamps without a back light, 29 milliamps with a back light. That's TOTAL, including powering the radio components and processor. Meanwhile, your 53 milliamps is just to power the speech chip. Not to mention the fact that audio speakers are very power-hungry.

    Furthermore, an LCD display shows a lot more usable information in a shorter amount of time than any serialized voice interface. And for now, we will ignore the huge amounts of power required for speech recognition.

    So again, explain how this speech interface will bring your PDA power consumption down?

    --
    // Alan Porter
  111. Re:Dr. Sbaitso vs MC Hawking by kr4jb · · Score: 1
    --
    // Alan Porter
  112. No need to imagine... by Anonymous Coward · · Score: 0
    "Imagine a cell phone with this, you can have your email read to you, rather than seeing a line at a time on a dinky screen, street directions from a website, or even Slashdot's headlines. :)"

    It's already here.


    60623

  113. Hey by Infosquawk · · Score: 2, Interesting

    I like to convert text to mp3s for long journeys so I can listen to Dickens on my Rio. Of course, that takes a lot of disk space. I'd much prefer a little handheld device that simply converts the .txt file which is much smaller, to speech.

    I'd pay for it, and I bet a bunch of other people would too.

    --


    OoO

    Please do not publish outside of /.
  114. English to Mandarin? by shokk · · Score: 1

    Isn't this just a step away from an English to Mandarin translator, in itself a step away from being a universal translator for the most common Eurasian languages? You know this has military implications, with a US Tour of Duty over Taiwan. And Carnivore usage is pretty much guaranteed if this isn't already spawned from that project.

    Still, doesn't this type of tech get in the way of learning new langauges? Learning languages is known to foster some good brain wiring in early ages, so this tech throws that out the window if people are no longer going to need to learn the language. Just take out your C-Pen, scan the text, and it will speak the text into your bluetooth connected earpiece.

    Makes me wonder if other tech that we introduce to our kids will affect them, making them too dependent on the tech. Those PalmPCs are great to help you remember things, but I know I start to rely on it and become more forgetful. Or maybe this will be a boon because we have so many things to remember that it no longer fits in our skulls. Time will tell.

    --
    "Beware of he who would deny you access to information, for in his heart, he dreams himself your master."
  115. Re: your sig by shokk · · Score: 1

    Babelfish seems to say the sig means something like "World Above". Which just goes to show that this is nothing new, just that it has become hardware capable so that it becomes portable and fast enough to translate in real time. Definitely has military use. Of course, that's also probably badly translated, showing the limits of the tech.

    --
    "Beware of he who would deny you access to information, for in his heart, he dreams himself your master."
  116. already done? by ironfroggy · · Score: 1

    Forgive me if I am making a mistake, but isn't this old tech? I've seen Text-2-Speech chips for sale in at Radio Shack for like five bucks. Not with the Chinese, however. But, still...

  117. Application Idea by Dolohov · · Score: 1

    If I could get my hands on one of these, I'd hook it up to a PIC and a Compact Flash card in IDE mode, then store Project Gutenberg text files on the card with an index so that I could select a title by having the device read them to me. Sounds like a summer project...

  118. Re: your sig by Anonymous Coward · · Score: 0

    watch out... someone's going to make a unicode goatse.cx

  119. Nice, but... by Anonymous Coward · · Score: 0

    As witten by another poster, the Amiga speech system (translator.library IIRC) did that job very well about 15 years ago. Unfortunately it was never ported to other languages than English, and other platforms when Commodore died, for licensing problems.
    There were speech systems for other microcomputers in the late 70s or early 80s, but the quality of the Amiga system was way superior. The chip used at that time in many toys (SP0256 IIRC) didn't implement inflexions and emphatic characterizing, but a simple allophone->sound conversion table; the Amiga voice did, and it was amazing.

    However, it's a nice product , and being low power it can be added to a great number of small and portable devices too.
    Well, just imagine (what mess would produce) a Beowulf cluster of these things!:)

  120. Not ready for phones quite yet by neile · · Score: 1

    Yeah, it'd be cool to have your phones read your email to you. But, if the chip just supports English (I'm ignoring the Mandarin part, I know, but bear with me), it's a long way from making it into real cellphones.

    A cellphone isn't built to serve just English people. Phone manufacturers cram multiple languages into a single phone, and the chip would have to support all those languages at once. The manufacturers to this so they can make *one* phone and sell it to all the countries in Europe. This is, of course, less of a problem in North America, but let's face it, Europe is a vastly bigger cellphone market. The last thing a phone manufactuer is going to do is throw a chip in that costs them more money but only serves English-speaking people.

  121. Good for accessibility by kimihia · · Score: 2

    Such a device will be very handy for people that have visual impairments. Instead of the current bulky and expensive kits, this will be an improvement, especially for VI users out-and-about.

    What can you do? Make your web pages accessible for a start.

  122. Mbrola and ircha (and festival) by NoP_2k · · Score: 1


    There is a package called IRCHA (a perl plugin) to add text to speech capabilities to xchat, bitchx and MIRC/W32 under Linux and Windows. It uses a program called MBROLA (freeware but restricted for non-militar and non-comercial users).

    In the other hand, anyone knows how to use festival??? I've installed potato packages and:

    [sromero@compiler:~]$ festival
    Festival Speech Synthesis System 1.4.1:release November 1999
    Copyright (C) University of Edinburgh, 1996-1999. All rights reserved.
    For details type `(festival_warranty)'
    festival> (SayText "hola")
    #
    festival> (SayText "hi")
    #

    ARGHHH!! I'm not able to make it running!

    CU!

  123. Phew by pne · · Score: 1

    can take ASCII or UNICODE text

    Phew! For a second I thought it had said "ASCII or EBCDIC text"....

    --
    Esli epei etot cumprenan, shris soa Sfaha.
  124. The State of the Art... by Sam+Lowry · · Score: 2, Informative

    Specialized chips for TTS applications has
    been around for a while... The problem with
    their acception is that they have poor voice
    quality. Actually, ther are tho quite different
    technologies available to produce text nowadays:
    1. Diphone synthesis and its variations. The idea
    is to have one sample of each sound compination
    (diphone) in a speechase and produce the actual
    speech by manipulating those sounds. This is what
    give computer-syntethized, somewhat metallic speech
    that most people have already heard somewhere and
    this is what actually used in low-powered devices,
    handhelds and speaking dictionaries.

    2. Corpus-based synthesis. The idea is to store
    a few hour of the speech of a highly trained
    speaker in the speechbase and select fragments
    of this speech that suit best for the genaration.
    The second approach gives astonishing results with
    the quality of the speech being sometimes
    undistinguishable for the human. However, the size
    of the speechbase is an issue. You can not fit a
    300Mb speechbase onto a handheld hevice yet
    and hardware optimizations dont help much when
    it conserns fetching data from the speechbase
    and performing text-to-phonemes conversion.

    Several companies have corpus-based synthesis
    demos on-line. Check out SpeechWorks' and
    Lernout & Hauspie's sites

  125. Shit Talker on cell phone by __4096 · · Score: 1

    I'll be happy as long as it comes with a copy of Shit Talker built into the phone. I envision a bold new era of prank calls...

  126. Re:Why dump more tech than necessary into the phon by Doc+Hopper · · Score: 2

    Well, not necessarily. The cell and PSTN networks are designed around carrying audio and that is still what they do best. Today, it's a toss-up as to whether it's better to approach text-to-speech from the back-end (where you can have more flexibility) servers, or by embedding pieces into phones which gives you a whole new set of problems and potentially great solutions.

    The problem is, the idea of using this tech in phones is fighting against hundreds and hundreds of millions of deployed telephones without any tech newer than perhaps a microchip for caller ID. Over the long-term, text-to-speech embedded in the device is the more efficient and user-controllable format. Over the short haul, though, we're going to see many years still of central-office-controlled voice apps on your phone.

    Niche applications, like on a Pocket PC, now there something like this would absolutely rock. Get a toehold, and eventually low-power text-to-speech and speech-to-text devices will be all the rage.

    Now if only someone would perfect a speech-to-text engine that didn't require hours of training to recognize my accent...

  127. Re:Why dump more tech than necessary into the phon by mgroeber · · Score: 1
    I've got a co-worker, our Oracle admin, who's blind. As things stand, with most cell phones he can't do anything except dial out and answer calls. He can't use the built-in address book to place calls for example, because all of the info is in text on a tiny screen.
    He may be interested in a product we have been working on for a couple of months now: we added software to produce speech output to an off-the-shelf Nokia 9110 Communicator smartphone, giving access to essentially all the functionality of a "modern" phone (phone book, calendar, SMS, settings, WAP etc.).

    Alternatively, output on a Refreshable Braille display can also be used.

    At the moment, this is GSM900 only (so it is mostly a European thing for now), but with the arrival of the Nokia 9290 in the States by mid-2002, it will also be available in the US.

    You can find more info on the device here.

  128. +1 Funny, +1 Informative on the MQR standard by MarkusQ · · Score: 2
    The Chaos

    G. Nolst Trenité

    Dearest creature in creation,
    Study English pronunciation.

    Spot on! Not only would it have to disambiguate homonyms by semantic context, it would even need to use poetic context. Great poem!

    -- MarkusQ

  129. thanks Kynn by Technodummy · · Score: 2

    thanks Kynn