Slashdot Mirror


Ask Slashdot: Linux and Telephony

This one is a doosy. I've received various submissions from people who were looking for information on how to make their Linux box into an answering machine. I've also received submissions asking about Voice Synthesis and Speech-To-Text. I have to admit I haven't found much information on either while browsing on the net, so I'm turning the question over to you folks. However I wonder if there isn't a issue hidden here? Can Linux be used as an Interractive Voice Response(IVR) platform? If not, why not? First off, let's NOT forget the actual questions:

Metiu and Sri both want to know if a Linux box with a voice modem can be used as an answering machine.

Gextyr is looking for information on Voice Synthesis packages that are available for Linux.

This Clan AC Member wants to know if there are any applications or APIs for Linux that deal with Speech-To-Text or Text-To-Speech.

Lastly, there have been quite a few submissions asking whether or not Linux can be used as a demand fax server. Can it?

If Linux can be used for all of the things above, what's stopping it from performing as an IVR system? IVR systems are simply systems designed to use a telephone as the computer interface (using both touch tones and voice). IVR systems are used everywhere, from your voice mail, to ordering systems, and corporations are adopting more and more IVR systems for various tasks.

I've seen IVR implemented on DOS systems but most of these have moved to NT. What's preventing Linux from operating in this market? Are there existing IVR projects in progress, or is this another area where Linux falls behind?

153 comments

  1. Linux & IVR by Anonymous Coward · · Score: 0

    I have another question to add to the ones that started this all out: Can an entire VMS system potentially be developed on Linux (either enterprise or service provider)? I used to work for a company that used the NMS (- I think) TX2000 boards to do SS7 (MTP1 and 2). We used solaris x86; which NMS had no driver for at the time so we received the source for that and ported it. Trillium is a company that provides source code for various signalling protocols (at a huge price), we used their MTP3-ISUP and wrote the rest from scratch. We used the dialogic boards on top of SCO to do the non-signalling stuff. It seems like an open source ISDN and SS7 stack would be pretty useful here. I've tossed around the idea with some friends but we've never got it together enough to go for it. A company based around providing support for open source signalling stacks (national variant development etc).
    What is the general opinion, could big telephony platforms be run on top of Linux? We did around 100k subscribers on a system running various types of Solaris, SCO and Oracle. It doesn't seem like Linux is much less stable than either SCO or Solaris (SCO where we got lost UDP packets on the loop back interface).
    Thanks
    gid-fu
    (I forgot my password @ work and am sick @ home...)

  2. Completely off-topic non-answer by Anonymous Coward · · Score: 0

    'Why' is such an over-rated question. 'How' is much more interesting. The numbering of buildings around Killian Court (dominated by 'The Dome') is symmetrical: 1 is across from 2, 3 is across from 4, etc. There is local symmetry elsewhere on campus: one side of vassar street are buildings numbered 40's...on the other side the 20's and 30's.. Buildings with an 'E' are on one side of Ames street, buildings with W are on the other side of Mass Ave. Fascinating stuff, but not as interesting as the color symbolism of all the domes and steeples at Harvard. If you can explain that, there's a thesis topic in there somewhere...

    Signed, an overworked yet procrastinating grad student in E15.

  3. KVoice - don't give up! by Anonymous Coward · · Score: 0

    KVoice relies on [mv]getty, Kvoice doesn't actually handle the phone, it looks into the directory where vgetty puts the messages (/var/spool/voice/incoming on my system). With vgetty you can automatically pick up a call coming in (rather than having it recorded) by lifting the handset and pressing the "#" key. Recording stops and you don't have to be logged on for this. You can do this from any phone in the house.

    As far as KVoice being unstable, I started looking into KVoice as a nice front end and, yes, I've noticed its instability. However, in its defense it is just at a 0.3.X release. I've fixed a couple of bugs that've improved things, but I want to add a few features such as:
    * a "play all new messages" button
    * pause/speed up/slow down buttons

    Also, KVoice does conversion from the RMD file format to an intermediate format before playing causing long messages to have awkward delays before sound is heard. I've hacked this code in my version which now does the coversion on the fly in parallel with playing it (pipes are really useful); sound is heard almost immediately even on long messages. Perviously, a 2 minute message would not be heard for 15-30 seconds when pressing play.

    When I get the rest of it put together, I'll forward the changes off to the KVoice developer. So don't give up on KVoice just yet.

  4. Voice modem uses sound card... I don't think so by Anonymous Coward · · Score: 0

    You do not need any type of sound card to use a voice modem. If you want, all the "voice data" that is recorded or sent is through the serial port. (for many voice modems, that is)

    The sound card hookups are not required, they are a "bonus".

  5. ARGH! I have this working by Anonymous Coward · · Score: 0

    I wanted to do exactly what you are wanting to.
    I used vgetty and mvm (i think it is bundled with the latest vgetty source)

    I actually stripped a bunch of stuff out of mvm and just used the script functions.

    I have a system that can be dialed into, someone can leave a voice/fax or make a data call.
    If I dialin I can check my messages, setup ppp to dial back or use my x10 interface to control power to various devices in the house.

    Use vgetty and mvm as a starting point.

    Chuck Moss

  6. Re:Rsynth, festival by Anonymous Coward · · Score: 0

    How to you set up festival to use a female voice?

  7. DDlinux Speech Recognition Mailing List by Anonymous Coward · · Score: 0

    Don't bother with this list. I checked it out for a while. It mostly consisted of a bunch of losers whining about "Why won't Dragon Systems port their Dragon Dictation program to Linux for us?" A lot of time was spent composing letters to Dragon and analysing the resulting replies. The high point of the discussion was when a Dragon employee emailed the list to tell them the their chances were lessening with every email they sent.

    The dd in ddlinux means this ain't a general discussion list. It's a list dedicated to Dragon Dictation for Linux.

  8. Text2Speech by Anonymous Coward · · Score: 0

    This is something I need very badly... I like my
    machine to ask me how I'm doing in the morning,
    and it'd make life alot easier.

    Anyway, I've never found a "good" text2speech
    synth for GNU/Linux. There is a neat project
    though, named "Festival" which is trying to
    create a module text-2-speech solution for
    translations and stuff like that. I got it
    awhile back, and stopped messing with it after 2 days of tinkering couldn't get the thing to talk...there is almost no docs for it. You may want to look on SunSite for some stuff.. There
    use to be a linux/sound/text2speech directory on
    there with some neat, but useless tools. YOu might get something to work!

    Good luck.

  9. Voice over IP+encryption+answering machine by Anonymous Coward · · Score: 0

    Theres also a SpeakFreely/SpeakEasy for
    unix systems i think..it does encryption
    and answering machine stuff..try speakfreely.org
    or speakeasy.org..i forget which.

  10. Rsynth, festival by Anonymous Coward · · Score: 0

    There's always rsynth. The sound quality isn't great, but it's
    small and easy to use. My daughter had fun with it, setting
    up a shell script to say "Later, dude!" when she logged out
    of KDE.

    I also tried Festival a while back, and did eventually get it
    to work. The sound quality was better than rsynth, but to me
    at least, it wasn't THAT much better. The people who did
    Festival may be brilliant researchers, but they could use a
    few lessons in the user-inteface area. If I remember, you
    have to run an iteractive program, and then type Lisp-like
    commands to get it to say stuff.

    1. Re:Rsynth, festival by thomasa · · Score: 1

      If you do "festival --tts" or "festival --tts filename" it bypasses all that lisp stuff.
      It will read out the contents of filename. I have
      set it up with a female voice and use netcat to
      attach it to a port on my client machine, then
      other computers on the network can telnet to it
      (using expect) and send ascii words that get spoken on my workstation. E.g., "I cannot ping
      the mailserver at so and so". Well, I think it
      is interesting. tts of this quality used to cost
      thousands.

  11. Re:I'm using an *ordinary modem* + soundcard for I by Anonymous Coward · · Score: 0

    Me too.. ;-) This would be great for my shop. Thanks!

  12. CTI for Linux by Anonymous Coward · · Score: 0

    I've ben told by the Dialogic corp. rep. that they have Linux support for their cards in the making...

  13. Linux Telephony - some good answers by Anonymous Coward · · Score: 0

    Well, mgetty+sendfax+vgetty also works great with the modern 3COM/USR V90 modems. I have one of these and a Linux box working as an answering machine and set up FAX so it prints directly to my printer. I also use the same PC to connect to Internet using pon/poff and there are no conflicts, everyone is happily sharing the modem.

  14. Voice BBS by Anonymous Coward · · Score: 0

    I have a friend who's written up a complete voicemail BBS, public postings, private mail, online games, the works. It's all written using bourne shell scripts and a program which does the data dumping to and from the modem (similar to vmcp from the mgetty package). It runs well on two phone lines, two voice modems, and a 486. Try contacting him: zcat@coders.net, and/or ring +64-7-855-8334 (Hamilton/New Zealand) for the voice mail system.

  15. Dialogic support?--- Since when? by Anonymous Coward · · Score: 0

    We use dialogic products with SCO as well and the last thing I had heard on the subject was that they were considering it and there is a survey you can fill out.

    A search of their website turns up nothing but that survey:

    http://www.dialogic.com/uk/forms/ossurvey.htm

    I couldn't find anything on their press release section either.


    If you are intersted in IVR's under Linux I suggest that you go fill out this form.

  16. More links. by Anonymous Coward · · Score: 0

    There are some more links to speech- & voice-related stuff at the Adaptation page of Gary's Encyclopedia

  17. Cannot be used - NO DRIVERS! by Anonymous Coward · · Score: 0


    Linux drivers for IVR hardware simply do not exist, from any vendor. As an example, my system has a Dialogic Dialog/4 in it, but I can't use it because of this. Now that Microsoft has invested in Dialogic, I don't expect this to change.

    There is a way to use your Rockwell modem though, from what I have heard.

    If I'm wrong about the IVR hardware, please let me know at mark@knm.org

    Thanks


  18. Timely Mtg (Boston): Voice Recognition on Linux by Anonymous Coward · · Score: 0

    The next meeting of Boston Voice Users will be on
    Tuesday, April 13, 7:30 p.m.,
    in MIT room 2-135 (directions below).

    Topic: Speech recognition for Linux? Can it happen?

    Directions to room: Go in the main door of MIT, at 77 Mass. Ave. Go
    straight down the Infinite Corridor for as long as you can. When forced to
    turn, go right. You'll pass one intersection decorated with murals of
    jungle animals, hallway to library on your left, stairs on your right.
    2-135 is a couple of doors further along on the left. For wheelchair
    access: use the door at 33 Mass Ave, toward the River from the main door.
    There's a lift here.

  19. ACS project by Anonymous Coward · · Score: 0

    There's a project just starting up having to do with this very topic. I don't remember the URL but the ACS project is setup for this. The announcement was on Freshmeat (search for ACS).

    ACS is "The ACS (Adjunct Communication Server) project was founded to deliver a GPL licensed multi-line telephony server built using a class extensible C++ state engine framework capable of delivering commercial applications in voice messaging, automatic call distribution, and IVR/voice response systems. This initial release primarily uses the Pika's Monte Carlo API under Linux and represents the class framework and core design for ACS. Future releases will include support for Natural Microsystems CT Accesss platform. Outside contributors to the ACS project are welcome.

  20. proprietary hardware/ interfaces by Anonymous Coward · · Score: 0

    Other PBXs may have an alternative interface, but the one I worked at for a while used phone jacks and serial connections to do all of the data transfer between the main unit and the voicemail. It shouldn't be too hard to model those interfaces with modems and software. If you start providing voicemail systems for what are normally proprietary systems, though, you are stepping on some really big toes.

    btw the modem gets voice from the sound card, with the same interface as a cdrom...

    d

  21. I use my modem as an answering machine by Anonymous Coward · · Score: 0

    There's a package on MetaLab (SUNSite) called "voice". It is an interface to the voice modem. It comes with an answering machine (written as a bourne shell script). It's pretty messy and required a few changes to the script in order for it to work properly.

    There is vgetty, which hooks into mgetty and is supposedly better, but I haven't had the time to try it out.

    As for text-to-speech on the voice modem, it apparantly can be done, but with the "voice" package, I haven't been able to convert the sound files to/from anything useful. The same would go for speech-to-text. The main obstacle to speech to text over the modem would be the low sampling rate. The phone system only uses a samping rate of 8000Hz 7 bits (Canada/USA) or 8 bits (Europe). I don't know if that resolution's good enough.

    --
    BigJimmy (at school and forgot password)

  22. Telephony on Linux by Anonymous Coward · · Score: 0

    You can also add Natural Microsystems

    www.nmss.com they primarily product high density boards but they are very well engineered and would be a perfect choice for IP telephony projects etc where they could become the gateway.

  23. SNA / Linux by Anonymous Coward · · Score: 0

    well, there's been a considerable amount of work
    on SNA for Linux; see http://www.linux-sna.org./

  24. Completely off-topic reply by Anonymous Coward · · Score: 0

    By the way, why are the building numbers so fscked up at MIT? I used to spend a lot of time going to the various libraries there and got rather good at "habitrail-ing" from bldg to bldg to get to my destination.

    But the numbers are nutty. Is it some kind of neo-Fibonocci series or something?

    signed,
    Graduate of the other Cambridge, Mass. university.

  25. Festival works well for me! by Anonymous Coward · · Score: 0

    I didn't have any problems installing and using, then again I'm a very gifted individual :)

  26. Voxilla is dead... by Anonymous Coward · · Score: 0


    Lots of promise, but with no drivers it is simply impossible for a project like this to get anywhere.

    Mark

  27. Telephony on Linux by Anonymous Coward · · Score: 0

    There are currently two board vendors that I know of that have drivers for their telephony interface boards for Linux. Aculab - a UK company specializing in protocol integration and PIKA - a canadian company specializing in low cost interfaces. I personally was involved in try to get Dialogic - US company to QUICKLY port their drivers to Linux but they are too big and bloated to do anything quickly. Pika just announced Linux support at last months Computer Telephony Expo in LA for their entire line of boards. Low-density analog and high density digital. Aculab will support only digital as far as I can tell. I would expect Dialogic to come online with Linux by Q3 or Q4 99.

    www.aculab.com
    www.pikatech.com
    www.dialogic.com

    cheers,

  28. Anyone know about "Shotgun" modem tech and Linux?? by zonker · · Score: 0

    You know, those new dual 56K modems that you use two phone lines for and call your "Shotgun" enabled ISP and get a throughput of 112K? Can this be used with Linux or FreeBSD? Is it a Microsoft/Windows API only? If it can be used with Linux or FreeBSD, can you used two different kinds of modems, or do they have to be the "Shotgun" type like Diamond puts out? Thanks...

  29. CTI for Linux by DASTAR+COM · · Score: 0

    NMS may have better boards than Dialogic but their API sucks hardcore. As for them releasing Linux drivers...really? Where? All I see is frivolous announcements of them releasing the source to their suckish API, but no driver as of yet.

    They should've listened to me and started on this two years ago when I told them they were foolish for jumping headlong into NT and ignoring Linux. They could've been way ahead of the game already.

    Same thing goes for Rhetorex (now Lucent) but at least they were more receptive than NMS was (and more honest about their decision).

    Screw NMS! They'll never get my business.

    Compliments to Pika for their efforts, although I must also give them a big THHRRT! for not doing this earlier as well.

    All you PC-telephony board vendors have a lot of catching up to do. You all would have been in a much better position to capitalize on the Linux tsunami if you'd listened to my rantings three years ago, including an article prompting you to produce Linux drivers in CTI Magazine in 1996! I wasn't just a kook yammering on about some obscure operating system after all, now was I?

    SCREW YOU ALL!

  30. My Linux box is my answering machine by Anonymous Coward · · Score: 1

    I set up my Linux box as my answering machine several months ago, and it works just fine. If you're using redhat the tools are already there. Just edit voice.conf in /usr/lib/mgetty+vgetty. You'll also need to start the vgetty process on that com port. I have it set to receive faxes and voice. You can also have it do data connections. I didn't need this since I have a cable modem. I also set up a script to convert incoming messages to wav format, and put them on a password protected web page, so I can check my voice messages from anywhere on the net.

    If you want a full voice mail system, search for mwm, which is a bunch of scripts that sit on top of vgetty.

    I use a USR Sporster Voice, and the sound quality is not great (8bit 8hz). Check the vgetty FAQ for better quality modems.

    Linux, it's my router, it's my firewall, it's my web server, it's my development platform, it's my answering machine, it's the brain of my recording studio.

    It doesn't make my coffee, yet...

  31. ISDN and VBOX by Kirth · · Score: 1

    It's very easy to implement an answering machine
    using ISDN and vbox. Vbox can be found at
    http://www.mayn.de/michael/vbox/
    The thing can be configured to listen to arbitrary
    numbers and respond automatically after a certain
    number of rings.

    --
    "The more prohibitions there are, The poorer the people will be" -- Lao Tse
  32. Some speach apps which are out there. by Brett+Viren · · Score: 1
    There is `Festival' which does turn text into pretty normal sounding voice. You can plug in different voices. And there is `say' (part of `rsynth') which turns your multi-kilo-buck PC into a Speak-and-Spell. Don't have URLs but both are Debian packages.

    Browsing Debian's packages also turns up a few others so this like, all Linux solutions, begins with: ``well, first install Debian...''.

    -Brett.

  33. Re: ARGH! by Jeff+Lightfoot · · Score: 1

    If you are looking to get the same results you could use xringd. It runs commands based on incoming ring patterns to your modem. xringd

  34. Coffee (not reading your LDP?) by cduffy · · Score: 1

    There's been a Mini-HOWTO on the subject for quite some time... shame on you for not reading your documentation!

    http://metalab.unc.edu/LDP/HOWTO/mini/Coffee.htm l

  35. CMU's Sphinx voice recognition system by gavinhall · · Score: 1

    Posted by hersh:

    I heard recently that some people at CMU will be porting the Sphinx 2 speech recognition system (developed there) to Linux. Not sure about licensing though.

  36. Telephony by gavinhall · · Score: 1

    Posted by Aven:

    I had been wanting to set up a house answering machine with a WWW gateway for some time. I started to write something myself, but found that there are a few utilities out there that do the job for you. I haven't played with mvm too much, but it looks promising. vgetty seems to work pretty well despite having to write the frontend in shell scripts.

    http://alpha.greenie.net/mgetty/
    http://www-internal.alphanet.ch/~schaefer/mvm/

    You'll also need rsynth or some type of text-to-speech package for mvm to work. Good luck.

  37. I'm using an *ordinary modem* + soundcard for IVR by gavinhall · · Score: 1

    Posted by RichDrewes:

    I have hacked together an IVR/answering machine that uses an ordinary modem (not Zyxel type voice modem) and a SoundBlaster type soundcard in Linux. This requires construction of a simple circuit ($5 in parts from Radio Shack) to interface the sound card to the phone line, and a bit of software. I have coded a fast fourier transform DTMF (touchtone) recognizer and I use an 'expect' script for the call flow. If there is sufficient interest I can make a web page with a circuit description and post the code.

  38. Linux Journal by gavinhall · · Score: 1

    Posted by smich:

    Wasn't there an article in LJ a year or two ago about setup like this? Maybe the guy was just using Linux to control his phone system, but I remember something about voice synthesis.

    If you knew my memory...

  39. doing caller-id (vgetty sucks!) by Jamie+Zawinski · · Score: 1

    I'm amazed that so many people here say they were able to get vgetty working; you guys must all lead charmed lives, because I spent months fighting with it, and couldn't ever get it to correctly answer the phone and record a message twice in a row.

    I settled for having my machine simply use the modem to listen to the caller ID info, and pop up a big old dialog box telling me who's calling, then let my real answering machine take the call.

    Features:

    • I can read the dialog box from across the room;
    • It also checks for the incoming number in my address book ( BBDB);
    • The phone doesn't ring if it's late at night or early in the morning and the screen saver is active;
    • It securely logs calls to my web server, so if I'm at another site, I get asynchronous notification that someone has called me at home, and I know to call in and check my messsage!

    Works pretty good. Get the code and read all about it.

  40. My Linux box is my answering machine by C.Lee · · Score: 1

    Why bother using a computer as an answering machine at all? Just buy an $30.00 digital answering machine instead. Spend a little more ($40-50 dollars) and you'll get multiple mailboxes on some of these things along with caller ID...

  41. We need a TAPI for UNIX. by C.Lee · · Score: 1

    You're right. Who really needs TAPI? Just buy an $30.00 digital answering machine and free up your computer....

  42. ARGH! by robin · · Score: 1

    Look into xringd. This can recognise complex sequences of rings.
    --
    W.A.S.T.E.

    --
    W.A.S.T.E.
    1. Re: ARGH! by landtuna · · Score: 1

      Very cool. I just tried xringd, and it seems to work pretty well. The source is pretty simple, too.

  43. A couple of useful links by ptomblin · · Score: 1

    http://www.linuxtelephony.org/ and
    http://www.opentelecom.org/

    Neither of them answer *my* basic question, which is how to add touch tone response to a web based application I'm working on.

    --
    The next Cmdr Taco duplicate will be ready soon, but subscribers can beat the rush and see it early!
  44. A couple of useful links by ptomblin · · Score: 1

    I'm writing an application where people enter stuff either on the web or through a touch tone phone. The connection is through a database that either application can update.

    --
    The next Cmdr Taco duplicate will be ready soon, but subscribers can beat the rush and see it early!
  45. Simple question by ptomblin · · Score: 1

    What I need is something that can put up an audio menu along the lines of "press 1 for foo, 2 for bar, 3 for qux", and call the appropriate C/perl functions or shell scripts and put up another audio menu. Is this what they mean by IVR? I don't need to recognize verbal or non-touch-tone responses, just touch-tone.

    --
    The next Cmdr Taco duplicate will be ready soon, but subscribers can beat the rush and see it early!
  46. Voxilla by Hans · · Score: 1

    The Voxilla Project (http://www.voxilla.org) is working on some stuff, but the web pages are a bit outdated unfortunatly...

  47. CTI for Linux by asmussen · · Score: 1

    Really? That's good news. The last time I had checked Dialogic was refusing to support Linux at all. I used to work for West Interactive, which is probably the largest VRU company around. They had several hundred VRUs with a couple of single T1 Dialogic boards apiece, and they've only gotten larger since I left. They were using SCO on all of their systems, and their own in house software to drive the calls. At one point I was looking into Linux support for the Dialogic cards because I was toying around with the idea of setting up a few VRUs of my own and going into business dealing with some of the smaller customers that West didn't like to handle because of their smaller size. SCO would have been all right, but I saw Linux as an advantage, particularly with being small, because being open and easily customizable, it would be easier to have the kind of flexability that you often need as the smaller company, but they had no Linux support at all at the time. Ended up deciding against the whole idea anyway, but it's nice to know that Dialogic came around.

    --
    Shawn Asmussen
  48. Linux & IVR by Snapple · · Score: 1

    Well, SOMETHING that I know a bit about...

    First off.. IVR CANNOT be handled by a Voice Modem. Now before you startup the flamethrowers give me a sec... I am dealing with MINIMUM 1 incoming T1, and we have a couple boxes with 8-10 T's... That is up to 230 incoming calls at once, and not one line at a time.

    The current KING of the hill (in hardware) is DIALOGIC. BUT Dialogic is DEEPLY in bed with Microsoft. People have been pressuring Dialogic for about 6 years to come out with drivers for Linux, but nothing yet. There have been a couple times when activity on the Dialogic mailing list where rumors have been flying about (last one was something was supposed to happen March 25th), but again NOTHING!

    There are a couple companies that seem to be embracing Linux... Pika (Ya! Canadian!), Acculab, and NMS. I suspect that Rhetorex will be the next one to throw their hat into the ring, and the only one missing is Dialogic.

    Now for the next important thing... IVR Software. Currently there is nothing out there to handle the IVR back end. BUT there is ample open source languages that could be extended. Database support used to be an issue, now with pretty much every major database on Linux, that is not a problem anymore.

    Once you have your basic "Interface" card working, there are a whole bunch of other cards you can add in to get the extended support that you want. SCSA pretty much the standard for connecting cards. Once you have the card, you need the firmware.. Hopefully they will port it over!

    Anyway, enough ranting... summary:

    ACCULAB, PIKA, and NMS have support for Linux. Dialogic needs to be beaten over the head with a wet fish and have some sense beat into them!

  49. Voxilla is dead... by voxman · · Score: 1

    now, now. there are drivers for the pika line of telephony cards (http://www.pika.ca) that support fax (without artifacts like modems) voice, switching, tode detection, caller id, etc. You can find a Linux based phone system that can be used with these cards on http://www.tycho.com. It's called ACS.

    You can also use the AT+V command set with any voice modem card. I have the chase PCI-RAS4 at home running under linux and it works quite well.

    The reason I haven't had time to work on voxilla is that I have been working at VA Research fo the last 6 months and they keep people busy. I'll update the site tonight with a bunch of links to the developments that have been going in all over the place that almost no-one has asked about.

  50. Linux PBX? by voxman · · Score: 1

    Pika, Aculab, and NMS all have cards that support switching...but they are also all in early beta as far as driver development goes. The best you can do about switching at the moment is to lobby pika and aculab to open source their drivers or work with nms on porting their open source driver to one of their cards that does support switching.

  51. Audio widgt software... by moore · · Score: 1

    I wrote some audio widget software that
    used festival fot text-to-speech and some
    custome hardware for DTMF decoding. It is
    in perl and handels menus quite nicly right
    now I have not realeased it only becouse I
    wanted to make it work with a voice modem
    instead of with my sound card and custom
    hard ware but I do not have a voice modem
    so it hasent happed. I would gladely give it
    out under the GPL if anny one was instread.

  52. Open Speech Project by Rustless+Walter · · Score: 1

    AT&T (was Olivetti) Research Labs in Cambridge used news broadcasts with simultaneous captions to train their voice mail search program. ISTR they found it the cheapest way of getting both text and speech.

  53. Now THAT is cool! by Bwah · · Score: 1

    Wish I'd though of that sooner. You know what the freqs are?

    /dev

    --
    "There's no secret. You just press the accelerator to the floor and keep turning left." -- Bill Vukovich
  54. Re:Speech recognition? by Bwah · · Score: 1

    You ever used the UPS tracking number system over the phone? It's really really good. You can reattle off the number in a natural voice and as fast as you like and it gets it damn near every time. Granted it's going character by character, but I think it demostrates what you can do with the BW that the phone allows.

    BTW, IBM via voice for Linux beta SDK is out for free. I think they said it ships with RH6.0 on the app disc. I downloaded it from IBM, but at 40meg it's quite a hit for a modem.

    /dev

    --
    "There's no secret. You just press the accelerator to the floor and keep turning left." -- Bill Vukovich
  55. PIKA -- Where and how much? by Great_Jehovah · · Score: 1

    Where do you buy the hardware and how much does it cost?

  56. vgetty by vinn · · Score: 1

    A while ago I needed to do some crazy stuff like this. I put some pieces together that actually worked fairly well.

    I used a Multitech 5600 ZDXV voice modem. I
    highly recommend Multitech's if for no reason
    other than their no-hassle/no-RMA return policy
    and 10 year warranty.

    Then I used vgetty to handle answering the phone,
    recording stuff, and decoding DTMF tones.

    It all works, make sure you have ALL the vgetty
    patches. Maybe join the vgetty mail list. If
    you don't like hacking scripts and crap together
    you won't enjoy setting this up. vgetty is
    HIGHLY undocumented.

    --
    ----- obSig
  57. My Linux box is my answering machine by otis+wildflower · · Score: 1

    It doesn't make my coffee, yet...

    Well, slacker, get cracking! You could probably wire a soft power switch to a serial port or something and use your carrier to indicate wheter the switch should be on or off...

    0 8 * * * 1-5 /home/me/coffeemaker -1
    0 10 * * * 1-5 /home/me/coffeemaker -0


    Or go for the whole thing and wire your sockets with X10..

  58. IVR Hardware is the biggest problem by sbreakwater · · Score: 1

    Linux has proved itself to be a very stable OS. That's the main reason I have chosen it as my primary OS. At work, however, we develop our telephony software for Windows NT. This is because major players in the IVR hardware industry have chosen to ignore Linux as a possible platform for their buyers' systems. I would _love_ to be developing for Linux and getting paid, but Dialogic (probably the biggest telephony hardware producer) will not write drivers for Linux nor will they release hardware specs. Not even under an NDA. This has kept Linux out of a market where it has much potential. IVR hotline systems are very commonly mission-critical machines and, as most of you are aware, NT isn't the most stable platform in existence. I have had to develop complicated startup and shutdown processes for our machines in the field so that they may babysit themselves. We also require pcAnywhere to be installed on the systems in order to fix them when NT decides to get ugly. VNC would fill that requirement on a Linux box. Now all we need is the help of some complacent hardware companies and the Linux community will be on its way!

    (Somehow, I'm not too hopeful. Let's see the slashdot effect in action...)

    --
    -- A hacker is a machine for turning caffeine into code. G: GU d-(--) s:- a--- C++++(++)$ UL++(+++) P+(++) L++(+++)
  59. That is cool. Yeah! by Harbinger · · Score: 1

    That is cool. Yeah!

    --
    Be smart and work to create. Don't ride on the backs of others.
  60. CTI & Linux - see Dialogic by gelfling · · Score: 1

    At least Dialogic has started to develop their CTI products for SCO Unixware - can Linux be far behind?

  61. Linux now does phone spam? by OrcSlicer · · Score: 1

    You can get a box which will emit those three, whiney annoying tones that tell a dialling computer that the phone line doesn't exist whenever you pick up the phone. The systems then take you off of thier list.


    Orcslicer

    --
    So, Lone Star, now you see that evil will always triumph because good is dumb.
  62. mgetty and voice mail url by pimp · · Score: 1

    The URL should have read http://www.cis.ohio-state.edu/hypertext/faq/usenet /fax-faq/mgetty+sendfax+vgetty/faq.html

  63. Linux Telephony - some good answers by Fionn · · Score: 1
    There also is a nice graphical frontend for vgetty/sendfax installations named PalMail. It hasnt been developed for ages but its pretty configurable and it worked for me for years.


    Fionn

  64. Linux can be used for an IVR... by IOstream · · Score: 1

    Using Dialogic's D/41E Linux can use the (slightly modified) Dialogic APIs to create a fully functional IVR/phone system... While the boards are not cheap, they do offer performance...

    --
    |0stream
  65. Speech recognition? by coreman · · Score: 1

    I've asked about this before. I wouldn't mind donating some cycles to the project. I have a contract that would work very well with Speech input. Mostly I need a system that will record/recognize stuff for data capture in real time with offline verification/correction. Basically a voice dump/sink. I have also heard that Dictaphone was supporting an offline/after the fact server for speech to text.

    Will just wants to yell at his robots 8^)

  66. Timely Mtg (Boston): Voice Recognition on Linux by coreman · · Score: 1

    Are there notes made available from these meetings? I have a class in central mass tonight that I need to go to. Is there additional info available on BVU?

  67. You can always hack one.... by Peale · · Score: 1

    I was thinking of putting it out on the Net, but due to creeping featurism the code is embarassing. :) (That and I find little time to fix things that aren't quite broken)


    Isn't that the point of the open source initiative? To release the code so others can have a crack at it, to iron out all these bugs? I say 'release the code!'

  68. ARGH! by dario · · Score: 1

    You could avoid DTMF by counting "RING"s from
    the modem. A script which starts ppp when it
    notices, e.g. three rings, pause, four rings,
    would do nicely. That way you can even call from
    another country and not spend a penny for the
    phone call (assuming that the modem will see as
    many "RING"s as you hear on your end).

    --d

  69. ARGH! by Raptor+CK · · Score: 1

    Even when my questions get answered, they don't get answered. A while back, I asked "Ask Slashdot" about a similar application, but all I wanted it to do was to read DMTF codes and complete a set of commands based on those codes.

    Example:
    I call up my linux box (which I can't keep dialed in 24-7)
    It picks up and plays some random audio file, piped into the phone line.
    "Press 1 to start ppp interface"
    I press 1.
    It hangs up. I do the same.
    I wait a minute, and then ssh into my machine, thanks to the wonders of a dynamic DNS service and a remotely mailed ifconfig dump.

    If this is possible, I'd love to know how.

    CK

    --
    Raptor
    "Procrastination is great. It gives me a lot more time to do things that I'm never going to do."
  70. Linux Telephony - some good answers by stefanm · · Score: 1

    I've been using mgetty+sendfax/vgetty for about three years. There has been a flurry of new code recently; the new version of vgetty is highly scriptable, and I've haven't even scratched its potential. Look up:

    http://alpha.greenie.net/mgetty/index.html
    http://www.leo.org/~doering/mgetty/index.html

    I've been using 1.1.20, with the ZyXel 1496E+ .
    I use it for voicemail, incoming logins (i.e. data) and faxes (in and out). Since I work at two locations connected at T1 speed, I like being able to hear my voice messages at each place. And I can also dial it up like a standard answering machine to hear my messages (that I have to re-configure).
    I have a number of users who dial in to read their e-mail and do some (slow...) surfing; ppp works
    well. The current vgetty appears to be quite stable, and doesn't slip into recording white noise instead of voice as much. My PII-350 box runs Slackware 3.6.

    What I would REALLY like to see is a voice-mail system which can connect a voice channel over the LAN to another CPU selected according to the tones punched in by a caller. That sort of thing has been developed for MS ...

  71. Commercial and Open Source offerings for UNIX by GiMP · · Score: 1

    For those of you looking for a commercial application try: http://www.entropic.com/ they specialize in EXACTLY what your looking for, not only speech recogition software but for TELEPHONY systems !

    A nice Open Sourced options is EARS, one of the few.. and probally one of the first UNIX speech -> text applications. Although I personally have had some difficulty compiling it under Linux. http://www.tmt.de/~stephan/ears.html


    Also, there are many output programs, the best one available is called festival and can be found in above comments and on freshmeat, but is somewhat bloated. Alternatives maybe found in the "say" program, which is distributed with "GxEdit" (find it on freshmeat as well), Emacs text->speech: http://www.cs.vassar.edu/mirror/emacspeak/emacspea k.html, and DECtalk at: http://www.ultranet.com/~rongemma/indext.htm


    If none of these are suited for your needs try:
    http://www.bright.net/~dlphilp/linux_soundapps.h tml#speech

    The Linux sound/midi page is a valuable tool ! :)

  72. DDlinux Speech Recognition Mailing List by emrek · · Score: 1


    FYI, there's a mailing list about
    speech recognition on Linux. The
    home page for the list is
    http://leb.net/ddlinux/

    From the "Current Status":

    Discussion on ddlinux is currently on hiatus while we wait for various open-source speech recognition engines to mature (a process which is likely to
    result in something useful to us in mid-1999). Ddlinux therefore operates as an announcement list, so traffic is very low.

    They have a few useful links and discussions.

    Emre |=^)

  73. vgetty can do this. Just add you your voice.conf:

    dtmf_program /usr/local/bin/dtmf.sh

    /usr/local/bin/dtmf.sh will get called with all the numbers you typed.

    --

    -- Don't Tase me, bro!

  74. Check Open Telecom http://www.opentelecom.org/ by tincho · · Score: 1

    A group of medium to big players in the telecommunications business have released some of their software with open source licensing. You still have to buy the hardware though. NMS (Natural MicroSystems) has very recently released software and drivers for their hardware for Linux. For more information please check http://www.opentelecom.org/

  75. Speech recognition? by WillWare · · Score: 1
    Anybody know what's going on with speech recognition? There are now several good, cheap speech rec packages in the commercial shrink-wrap world (at least they claim to be good, I haven't tried any of them myself) in the sub-$100 range.

    Last I heard, the only open-source thing of this sort was something called 'ears'. I've never heard of anybody actually using it, which suggests it might perform underwhelmingly (tho, again, I've never tried it myself so this is just a guess).

    A good OSS speech-rec program would have all kinds of uses. It would be a no-brainer for the wearable-computer people. It would be great for any PDA with a microphone. Maybe speech recognition could ease some peoples' fears of the command line, or avoid wrist injuries.

    There is some relevant stuff in the FAQ for comp.speech. A web search for "speech recognition", "phoneme", and "hidden markov model" turned up a lot of interesting hits.

    --
    WWJD for a Klondike Bar?
  76. Text2Speech by Kamelion · · Score: 1

    I've had pretty good luck with Festival for Text to Speech. I think I had to hack an include file here and there to get it to compile on SuSE, but I was able to get it to compile none the less. I have it read a fortune from my fortune file when ever I open a shell or log in. It sounds a little like an altra advanced Speak and Spell with a slight Scottish lint. Loads of fun.

    One of these days I'm going to see if I can get it to work from my Netwinder.

  77. speech2text by Kludge · · Score: 1

    http://www.gel.usherb.ca/grpetudiants/speechi/

  78. Toe-stepping == good by Weasel+Boy · · Score: 1

    Isn't stepping on big toes what Linux is all about?

  79. Voice systems -- lots of proprietary hardware by swb · · Score: 1

    Not that this answers any of the questions, but we recently looked at adding more storage to our voicemail system. The vendor wanted $6,000 to add what was essentially a 500 MB hard disk.

    We then looked at a PC-based voicemail system that would integrate with our LAN mail package, and the stumbling block there wasn't storage as disk space is so cheap when you're buying computer disks, but all the proprietary interface cards for interconnecting our Nortel switch. We would have needed something like $5k worth of boards (all ISA, no PCI available) to do the interfacing.

    I'm guessing this is going to be a stumbling block for any linux-based system. Most PBX systems are pretty proprietary and so are the interfaces, which means that not only are you single-sourcing interfaces but you're paying a fortune for them as well.

    I don't know how voice modems get the voice part into the computer -- through the serial port or through the sound card -- but if it's through the serial port, you could probably do a standalone voicemail-only system with a multiport serial card and a bunch of voice modems.

    You'd still be screwed when it came to interfacing with PBXs, though. You could do the voicemail, but there would be no access to the PBX itself, which means you could only leave messages. No rollover to operators, calling back into the PBX, etc.

    Most organization that have PBXs have also sunk a ton of money into them, so convincing people that the low-cost linux voicemail system would be a good investment when they'd throw out PBX features might be a tough sell.


  80. Excelent Text-To-Speech by bjwest · · Score: 1

    Take a look at The MBROLA Project.. I played around with text-to-speech a few months ago and this was one of, if not the best I found. "Freely Available multilingual synthesizer!"

    --

    --- Keep the choice with the user..
  81. I'm using an *ordinary modem* + soundcard for IVR by CE@UIC · · Score: 1

    Yes, please do so. I would love more info on how you did this.

  82. For less than $1200 PER YEAR? by pwb · · Score: 1

    If you can put together a reliable Fax on Demand system under linux for less than $3000 up front and less than $1200 a year in support contracts then you have a viable chance to start your own company. And I'm talk about SOFTWARE prices only this excludes the $4000 to $15,000 in hardware costs!!

    Reliable is defined as your software crashes less often than Windows-NT.

  83. re: speech to text by Quickening · · Score: 1

    I played with Abbott Demo a while back. It was only a technology demonstration. It was more amusing than useful. About 20% errors with no speech training.

    --
    tcboo
  84. ISP by Jose · · Score: 1

    can anyone point me to a FAQ, or a HOW-TO on howto set up my box to act as an ISP...what I want to do is to be able to dial in to home, and connect to the net from my box. I have an ethernet connection to the net.

    --
    The basic sleazeware produced in a drunken fury by a bunch of UCBerkeley grad students was still the core of BIND. --PV
  85. Linux & IVR by Yuk · · Score: 1

    I worked for Dialogic HQ a few mounths back as a Co-op. The idea of Linux drivers was always being thrown about the table. There is a small group of people that is currently working on porting the Dialogic current UNIX drivers over to Linux. I have no idea the status of this project. (I left while it was still a twinkle in Howard Bubbs Eye) I know however that some people have gotten Dialogic cards (the low end one mostly D41E, D41d etc) up and running. You may want to toss the question up onto the descusion board on the support web site. http://support.dialogic.com

    --
    "Yuk doesn't Belive in Pleasure......only pain"
  86. Linux IVR in development by Yuk · · Score: 1

    Just for the Record Dialogic was not "gobbled" up by MS. They where commissioned by them to write the new TAPI API

    -Yuk

    --
    "Yuk doesn't Belive in Pleasure......only pain"
  87. CTI & Linux - see Dialogic by Yuk · · Score: 1

    Dialogic has has several UNIX drivers for quite sometime now. Unfotunately they seem to be dragging there feet when it comes to Linux.

    -Yuk

    --
    "Yuk doesn't Belive in Pleasure......only pain"
  88. Speech Synthesis by ibis · · Score: 1

    You might want to start with the comp.speech FAQ:

    http://www.speech.cs.cmu.edu/comp.speech/

    In particular, take a look at:

    http://www.speech.cs.cm u.edu/comp.speech/Section5/Q5.5.html


    Two speech synthesis programs I have played with are:

    rsynth: ftp://svr-ftp.eng.cam.ac.uk/ pub/comp.speech/synthesis/

    Festival: http://www.cstr.ed.ac.uk/projects/f estival.html

  89. text-to-speech answers by Gleepy · · Score: 1

    One can find the tarball "rsynth-2.0.tgz" at the Metalab Linux archives in the directory apps/sound/speech. It "tries" to speak through your existing sound card. I say "tries" because I'm getting buffer overruns that the program isn't accounting for. Those with the Cheapbytes Linux Archive CD set for Winter 1999, the tarball is on the second disk under the same directory.

    For those with the DECtalk card, one might consider "emacspeak" available from the Debian or Slackware distributions.

    --
    Gleepy the Hen. More intelligent than the average hen.
  90. Tele-computer Controller by BrownJ · · Score: 1

    In the June 1999 Popular Electronics magazine they have an article which shows how to build a Tele-computer Controller and it allows you to have ring detection and dtmf detection for only a $57 kit probably around $20 in parts. And it hooks up to a parallel port. I'm going to be using a modified circuit to build a mini pbx system for my house. http://www.geocities.com/sil iconvalley/foothills/1897/

    --
    Eh?
  91. An IVR in an Enterprise System -- Perhaps Not by shri · · Score: 1
    While IBM is making very aggressive moves to fix this gap, Linux currently lacks the software which would connect it to big blue enterprise systems. A large Bank for example might need to make a query using LU 6.2 (SNA) into its mainframe. Many Airlines run their CRS (Customer Reservation systems) on Unisys mainframes which uses some more obscure protocols (even if theey are IP based) to connect into its boxes.

    The second problem is that a lot of the IVR systems are custom built by system integrators, who are usually more familiar with NT or OS/2 (large in the IVR market) than with NT.

    Are there any moves been made by an existing IVR vendor to port their systems over to linux?

  92. CTI on Linux by RichiP · · Score: 1

    I emailed a question to AskSlashdot which hasn't gotten posted. The question goes: are there any combination hardware-API packages available for developing CTI on Linux?

    Our choice platform used to be Dialogic with their nice hardware or Talking Technologies, Inc. with their Powerline II card on MS-DOS (the Windows NT platform was never stable enough and we used to get complaints about that), but I can't seem to find an API/lib for Linux.

    Is there anybody out there doing CTI/Linux right now? Our company is willing to try any hardware that supports a stable and free OS such as Linux.

  93. Help plug Linux on Dialogic Survey by RichiP · · Score: 1
    Dialogic has one of the better cards for CTI development. I asked them casually several weeks back if they would support Linux and they said they were "looking into it". I then asked a friend who worked very closely with Dialogic on MS-DOS and NT (I should mention he tries to avoid this because it necessitates a reboot once a week), and he showed me an email from Dialogic stating they were NOT yet supporting Linux.

    There's a survey at http://www.dialogic.com/uk/forms/ossu rvey.htm which asks what OSes we use (and perhaps would like to use CTI on). It says UK on the URL and Diaogic customer on the page, but I'm sure they won't mind if we showed them our support for our favorite OS. If you would like to see a Dialogic SDK for Linux, please sign up!

  94. Festival Text To Speech (TTS) by __aalomb7276 · · Score: 1

    I am impressed with Festival. RedHat RPMs and Debian packages are available.

    It comes with several British voice. Several American, a Mexican Spanish and a German voice are available from Oregon Graduate Institue.

    Call me a nerd but I like to hear the original voices quotes my favorite lines from Monty Python.

  95. --DTMF...RTFM by cg · · Score: 1

    Dual tone, multi-frequency

  96. You can always hack one.... by WareW01f · · Score: 1

    A roommate and I took an external Cardnal voice modem and hacked a answering machine server for it. The thing could handle caller ID as well. We had great fun recording individual messages for everyone the box could resolve. Multiple mailboxes... Course the playback and interface was pretty primative, some perl scripts and sox.
    I was thinking of putting it out on the Net, but due to creeping featurism the code is embarassing. :) (That and I find little time to fix things that aren't quite broken) I would imagine that each modem would have it's own little method for all the funtions. Everything on the Cardinal was AT commands. I'd recommend the box to anyone.

  97. A couple of useful links by Sleuth · · Score: 1

    Touch tone and web based application? What's the connection? There are DTMF to serial converters available that would allow you to pull touch tone info off the phone line, and some modems ought to be able to do this also. But why web?

  98. Text -> Speech by QuMa · · Score: 1

    I had a reasonable text -> speech app, But I can't get hold of the name right now because I'm in combat with my partition table and the 1024Cyls limit :-(.

    It was somewhere in the sound section of metalabs.unc.edu .

    QuMa

  99. SohoVoice & vgetty by Scott_F · · Score: 1

    i'm writing a complete voice-mail and fax system for linux which uses vgetty. the only problem is that your modem needs to be supported by vgetty.

    currently, vgetty uses hard-coded modem support, and is a little difficult to hack to get working with modems that aren't supported in the release. so if you don't have a modem that vgetty currently supports, it takes a little hacking to get it working. this could be overcome if vgetty moved to a plain-text file for modem configuration, sort of like the (eek) .inf files in win because essentially it is a matter of sending AT commands to the modem to put it into different states (record, playback, etc...) and those could easily be inserted in a text file, like say "Play AT+TX" to play a file, for example (probably not the right command. ;)

    anyways... there are programs out there. mvm, and other vgetty scripts that sit on top of vgetty.

  100. Linux Computer Telephony offerings by Calum · · Score: 1
    Right up front I'll note I'm not unbiased, since I'm an engineer working for Aculab (not Acculab, thats a company that makes cute little balances and scales).

    The best resource I've found for finding out about Linux and Computer Telephony is www.linuxtelephony.org. They even leaked about Aculab's Linux support policy last year (it was only announced in February officially).

    As an engineer working in the CT industry, I'm seeing more and more companies considering real, money-earning projects using Linux. We're starting to see support for Linux from other CT vendors, and I wouldn't be surprised if Microsoft's investment in Dialogic is a reaction to that. My personal prediction is that we'll see Linux and Solaris grabbing an increasing share of these types of applications, so it should get easier and easier to get Linux drivers and software for CT cards. Aculab certainly plans to expand on Linux support from what I can see.

    Solid Compact PCI and hot-swap support under Linux would help a lot in this application area though.

    --- Calum

  101. Drivers are coming soon... by TokyoJimu · · Score: 1
    Most of the hardware manufacturers I've spoken to recently are finally (reluctantly?) acknowledging Linux's popularity and are planning Linux releases.

    My sources at Dialogic say they'll probably have a Linux port of their drivers out sometime early next year, while the good folks at Aculab already have Linux drivers for some of their products and will release more Linux drivers in the coming months.

    Aculab is also pretty good about releasing hardware specs if you're really interested in doing a port yourself. Dialogic has never quite understood that they could boost sales of their products by releasing enough information to allow outside parties to write more drivers.

  102. Linux Voice Mail system by cepler · · Score: 1

    This ISN'T Linux, it's SCO, but read on.

    http://www.nexpath.com

    This is a SCO based phone system which we happen to use here at work and it works quite well. I was browsing their FTP site one day and happened to notice some Linux files...some headers and what looked like a lowlevel driver of some sort. I didn't look into it much. (in /pub/linux) I'm suspecting that they are working on porting their stuff over to Linux to cut down on licensing costs.

  103. CTI under Linux by ColPanic · · Score: 1

    Slowly it is happening. I develop IVR's and have started to see a few packages forming for linux telephony.... If you have the bucks for a decent telephony card Natural Microsystems(www.nmss.com) sells some fairly decent stuff. They also recently released their CTAccess SDK source code for linux, pretty cool! There is also a site (www.linuxtelephony.com) that you can read all about stuff, and I think the CTAccess source is on there..

    --
    -------- I dig Mobile Phones
  104. You don't have to install debian to use .deb's... by austad · · Score: 1

    Just do a "ar x filename.deb" and unzip the resulting data.tar.gz in the root.

    --
    Need Free Juniper/NetScreen Support? JuniperForum
  105. List of Linux speech synthesis / analysis links by divbyzero · · Score: 1

    Not much help on the telephony-specific angle, but for general purpose speech synthesis and analysis info for Linux, check out the Linux Audio Developers' Resource Page.

    Why does it seem like suggesting this link is my answer to many Ask Slashdot questions? Maybe we need a FAQdot!

    Div.


    --
    But my grandest creation,
    As history will tell,
    Was Firefrorefiddle,

    --
    But my grandest creation, as history will tell,
    Was Firefrorefiddle, the Fiend of the Fell.
  106. Hardware? by mustard · · Score: 1

    I've been wanting to do something like this, but finding hardware, inexpensively to do 2 lines hasn't been easy. I know Dialogic is supposed to be the stuff, but about two years back, I saw an ad for some vendors making inexpensive dialogic compatible hardware. Around $400 for a two line board. I'd love to be able to conference two lines together, do voicemail, etc, but need more than one line, and I'm not wild about the voicemodem approach. I think it needs to be a dsp board!

  107. Linux IVR by ToadStool · · Score: 1

    Natural Microsystems announced Linux drivers for their AG cards last week. These include a range of cards from an 8-line loop-start to a dual T1 card. You can read more at http://www.nmss.com/nmss/nmsweb.nsf/n ew/linux
    You might also want to check out http://www.opentelecom.org

    One thing that I found particularly interesting about Natural Microsystems is that rather then developing Linux drivers, they open-sourced their existing drivers and, of course, Linux drivers soon followed.

  108. Commercial and Open Source offerings for UNIX by maw · · Score: 1

    Of course you mentioned emacs. It does everything. :))

    Also be sure to have a look through the LDP howtos -- there is an Emacsspeak howto. Should be handy.

    --
    You're a suburbanite.
  109. Linux IVR by xs4all · · Score: 1

    Any chance that your setup is/will/would be available as a
    HOW-TO ... (hint, hint)

  110. OpenH323 (another link) by juan+large+moose · · Score: 1
    Another link of interest:

    http://www.openH323.org/

    Juan

  111. Kraemer Voice System by juan+large+moose · · Score: 1

    Umm.... Keep the GPL in mind. Even a custom Linux *must* be available under the GNU Public License. (I.e., you could sue *them*.)

    juan

  112. Linux PBX? by juan+large+moose · · Score: 1

    In addition to voice and fax cards, I'm interested in switching functionality.

    Hardware exists that makes it possible to build small and/or large-scale PBX switches using PCs. It should be possible to create a large, distributed (possibly even redundant) telephone switching system from these parts. Such a system could be used as the core of a telephone system for a large company or university campus.

    So, is anyone working on this sort of application?

    Juan

  113. Text2Speech by twitham · · Score: 1
    Uh, Festival works great for me on Linux. Yeah, it was a pain to build, but worth it I think. I have it announce when I have mail waiting, who it's from and the subjects.

    -tim

  114. I use my answering machine as a modem! by TheDullBlade · · Score: 1

    I tried anyway. It didn't work too well...

    Nevermind.

    --
    /.
  115. TTS libraries by company+nuncio · · Score: 1

    Lucent indicates that they're going to release their TTS libs for Linus RSN. The Lucent libs are quite good, and come in US english, European French, German, and Mexican Spanish.

    If you want something really understandable, this is the one.

    The downside is they're pricey ($595 for personal use, and licensed by the copy if you distribute). However, if you want excellent, they are it. Obviously they're not OS. However, having listended to a bunch of really crappy TTS while evaluating them, it's worth it to me.

    As for APIs, there is a half-decent Java API, but no Linux support for the engines - you'd have to roll your own glue.

    --
    Of course I don't speak for my employer. My employer doesn't speak for me, either.
  116. Open Speech Project by Kevin+S.+Van+Horn · · Score: 1

    Take a look at www.openspeech.org in a month or two. I'm working on putting together an open source speech recognition toolkit for Linux. I'd appreciate hearing from people on what particular speech apps they're most interested in, and whether they'd be interested in trading programming for instruction (i.e., if you write code for the project, I'll teach you about probability theory and speech recognition.)

    Also, I'm interested in ideas for collecting training data. At present there are few, if any, *free* corpora of training data. Both text corpora (for training language models, i.e., what kinds of word sequences are plausible) and acoustic corpora (collections of recorded utterances, along with transcriptions) are needed. One possibility, that would be useful for training models to be used in command applications on desktop computers, would be to ask people to record snippets of speech on their own computers, and donate these to the project.

  117. ISP by SendBot · · Score: 1

    I would recommend an o'reilly book call linux network administrator's guide. Or maybe the crab book (tcp/ip). Or dig around the linux howto's. Then put together your knowledge of setting up a ppp server and routing traffic through your various net devices.

  118. Nexpath has linux drivers by Zachary+DeAquila · · Score: 1

    http://www.nexpath.com is a PC-based voicemail/autoattendant/etc system. They base their solution on one of the more commercial unices (SCO, I think), but linux drivers for their boards have recently appeared on their ftp site (ftp://ftp.nexpath.com/pub/linux).

  119. We need a TAPI for UNIX. by udp · · Score: 1


    I think I've said it once, and I'll say it again.... we really need a TAPI abstraction layer for UNIX.

    As TAPI is MS copyright, we can't use that, but
    Siemens sunk a ton of input into the spec. We can
    look at the TAPI, and perhaps produce an open source alternative which is object-based, using CORBA/IDL. Methinks that would rock.

    I currently have one big dream app idea for TAPI, and I'm not telling anyone what it is because I want to write it myself (open source, of course) - all I'm saying is I love music...

    --
    Bruce M. Simpson Unix/Network Bod & Win32 Developer
  120. ARGH! by Kitarra · · Score: 1

    This would not work. Since the ring is generated on the end where you are calling from is indipendant of the ring generated on the receiving end. To see RL example... every had someone pick up the phone even before it rang once on your end?

    I was trying to set up something similar and it kept failing. When I contacted the phone company that is what they gave me as a reason

    --
    -Kit
  121. Speech Synthesis by Sarunas · · Score: 1

    There is a package called rsynth available over at the FreeBSD ports site. It works well enough to understand most of what it says.

  122. We need a TAPI for UNIX. by steveco · · Score: 1

    No we do not need TAPI for UNIX. Have you ever used TAPI - it is terrible far too complex for what initially is a very simple task.

    TAPI doesn't really isolate you from the complexities of the underlying Switch (either 1st Party or 3rd party) it just makes it more difficult to do anything.

    It would be good to develop some CORBA objects that model a call, but you really need to follow some standard (such as CSTA) and model their call scenarios.

    Steve.

  123. Linux IVR in development by syphon · · Score: 1

    I am currently writing the libraries and subsystems necessary for a Linux IVR platform. I've worked with Lucent Technologies and Nortel/Dialogic systems for several years, and know what the base IVR system needs. It's a few months off, even in Alpha form, but it's coming. What is really missing is Voice Card support. Aculab seems to be the only vendor advertising drivers. Since Dialogic was just gobbled up by MS, I doubt that their cards are going to be supported.

    This little project will hopefully become a business for me: hardware and software solutions for IVR. If anyone is interested in details or in contributing code (unpaid but credited, for now), please let me know.

    Syphon

  124. CTI on Linux by sam+i+am · · Score: 1

    See our web site www.pika.ca or contact
    info@pikatech.com

  125. Hardware? by sam+i+am · · Score: 1

    www.pika.ca

  126. Yes and No by eswierk · · Score: 1

    Yes, you can use your Linux box as an answering machine. I am using a USRobotics 56K Internal Voice Faxmodem with vgetty on RedHat 5.2 to answer calls, recognize touch-tones (DTMF), and record and play audio files.

    Yes, you can do text-to-speech/voice synthesis too. I am using Festival to read email over the phone using the above setup.

    I don't think there's any viable speech-to-text telephony solution for Linux, but I'd be happy to be proved wrong on that one.

  127. Kraemer Voice System by aada · · Score: 1

    I developed a Text Email to Voice Mail system on a customized version of Linux. The system includes a Text to Speech Synthesis, ULaw audio to ADPCM converter, voice modem player. Kraemer Voice System has a web page here.

    You can also find information about the customized version of Linux here, I called it Kraemer. It's a distribution designed to be an "All in One" server solution that I started in '96. The company never really sell enough of Kraemers to make it a popular distribution, and I want to make this distribution free for download, however, the company that I work with doesn't like it to be "FREE".

    Maybe I should just release the distribution and get sued by the company. Hmmm, I guess I worked on the distribution for too long, and I am becoming a madman.

  128. voice/fax emailed by willb · · Score: 1

    All except the voice to text I know can be done.

    The text to speech can be done using rsynth.

    As far as answering voice/fax I posted the following in our linux user group on how to get Linux to work as an answering machine/fax machine with caller id and have the messages automatically emailed to you with any associated directory information

    http://www.vlug.org/ezarc/discuss/msg01471.html

  129. We need a TAPI for UNIX. by DASTAR+COM · · Score: 1

    TAPI? TAPI!?

    TAPI is a farging joke. It came from a company that can hardly even make an OS that's stable, let alone understand the fundamentals and proper paradigms of a telephony switching interface.

    A Linux Telephony Library should be created from scratch, by a person or organization that understands telephony (and doesn't just want to impose their own silly ideas to simply achieve domination of a growing and important technology realm).

  130. ISDN and VBOX by BoloMK30 · · Score: 1

    I have it working too.
    Unfortunately the docs are still in german.
    Anyone know how to program those touchstone sequences?

  131. blinux project archive by GuppieBoy · · Score: 1

    ftp://ftp.leb.net/pub/blinux is an ftp archive of the blinux project which wants
    to make linux work for blind folk.
    among other things you will find mirrors of several speech synthesis packages--
    one big difference being the type of hardware they want you to have.

  132. My Linux box is my answering machine by Dmitri+Baughman · · Score: 1

    1. Because it makes for a fun scripting project
    2. Because it makes a good excuse to buy more hardware
    3. Because Linux CAN!

    --
    http://www.darker-side.com
  133. Which voice modems? Was: ARGH! by angelatlarge · · Score: 1

    Which modems does mgetty/vgetty support for voice features? From what I gathered in the posts here, 3COM v90 and old Multitechs work well. I don't know if it is even possible to get those Multitechs now. The list of best modems supported on ohio-state seems to be hopelessly outdated.

    Thanks

    --
    And yet it also pleases me and seems right that what is of value and wisdom to one man seems nonsense to another -Hesse
  134. SCO no NT by Irwin · · Score: 1

    CTi is a vast sunject that covers everything from fax/voicemail on a personal machine to SS7 message handling in CO facilities.

    As reported on the Linux telephony org website, several manufacturers are know supporting Linux. I was recently at the CeBit in Hannover, and visited the Aculab people (www.aculab.co.uk) who are amongst those taking Linux seriously, and one can make comprhensive call centre solutions using thier products.

    BUT, many manufacturers do not yet support :inux, indeed in the past thier has been very much an "NT will be the solution for everything" attitude.

    As with so many other areas that have suffered the NT hype, CTI is beginning to realise that UNIX type systems, including PC as opposed to SPARC based systems, have a role in thier lives.

    Recently, many manufacturers (including Dialogic) have added SCO Unixware to thier supported systems. This is clearly a better option than NT, for people who would like to deploy Linux, as it is highly Linux compatible, and it would be possible to develop solutions that may at a later date be Linux based.

    VAR's may develop solutions that may be shipped as Linux as SCO based, depending on customer attitude to OSS systems.

  135. ARGH! by whoop · · Score: 2

    ringconnectd will allow a simple setup, but not (that I know of, a little tweaking could allow it) the sort of dual counts you suggest.

    I used it for a long time with vgetty to act as an answering machine. If the phone rang once, it dialed up ppp. If it went for 4 or 5, I forget now, vgetty would pick up and record a message.

    My beef with vgetty was that it would not play any message to greet callers. So only family/friends knew that when it beeped (it was quite a loud beep too), just start talking. The many times I tried, it either left the phone on hook until I went back home to reset it, or would just play an empty hiss for the length of the sound file.

  136. Because voice modems suck. by heroine · · Score: 2

    The sound quality coming out of voice modems sucks. At least on my modem, using some voice modem package that was around sunsite in 1997, the modem playback was unintelligable. The modem requires some horrible variation of ulaw compression. If the sound quality on modems was usable, voice modems would make a great touch tone interface.

    To get really intelligable sound, you need some kind of dedicated, expensive, phone hardware.

  137. KVoice by Matts · · Score: 2

    KVoice handles my voice mail, although it's a bit unstable, and the pickup feature is crap (there's no automatic pickup - you have to load kvoice and click on pickup - which is impossible to do in time if you're not logged on!).

    You can do demand fax serving with HylaFax.

    Other than that, I don't know of any text->speech or speech->text projects. Unfortunately it's not something that can be done very easily for free - it requires a huge investment of time, hence why these speech->text systems were originally hugely expensive.

    Matt.

    --

    Matt. Want XML + Apache + Stylesheets? Get AxKit.
  138. Some speach apps which are out there. by tgd · · Score: 2

    I was very impressed with the Festival software. Anyone looking for speech synthesis should definately take a peek at it. Text-to-speech isn't quite as nice in it as the speech synthesis itself, but its not bad.

    Its a system-hog though. I tried to use it to read e-mails to me through my voice system (see my other posting in here about it), but I found it took several minutes per message to put the audio together... Hardly worth it. Hell, my system is so slow, even using say to generate timestamps is too slow. :)

  139. ARGH! by tgd · · Score: 2

    Yes its possible. Pretty easy to set up too, once you've got vgetty working with your voicemodem. You need a voicemodem that works with Linux and vgetty though (most voicemodems these days seem to be winmodems...)

    I shied away from dynamic DNS and just e-mail the number to my pcs phone.

    One tip -- make sure you have an activity timeout on it, so if you dial it up accidently, or you (for whatever reason) don't get the dynamic DNS to update or get the e-mail that you can still cause it to disconnect.

    Throw a secure webserver on there, and just make some simple CGI's to trigger a delay to bring the machine back off the network.

    On my system I've got an X10 automation setup too, so I can remotely turn on other systems in my apartment. (Useful if I'm a bonehead and leave a file I need at home...)

  140. Speech recognition? by cfulmer · · Score: 2

    Having worked on this for a while...

    The main problem with speech recognition over the telephone is that the digital standard currently used by the PSTN samples voice at 8khz, with each sample being 8-bits wide. As a result, the speech recognition engine just doesn't have a whole lot of data to play with -- Speech recognition algorithms typically use a lot of statistics to determine how well a given chunk of speech matches a word stored in its vocabulary. The less data in the incoming speech, the harder it is to be accurate with a match. In fact, it actually gets harder, as many cell phones use various encoders to further reduce the data rate. Add that to interference and background noise, and ASR over the phone is decidedly not easy.

    Many of the shrink-wrapped ASR applications that you see are designed to work through the microphone jack on a computer, which provides much more data than is available over the phone network. IBM, L&H and Dragon are the vendors I'm aware of.

    Now, there are various vendors out there who do ASR for phone applications. Nortel (my employer, but not my project) has one, as does VCS, Nuance Communications and several others. These, however, are not generally priced for the consumer market. In addition, many of these solutions run on Digital Signal Processors, which require additional cards....

    OSS speech rec would be a good thing, but I'm afraid that it's going to be a while before it comes to pass, just because of the complexity of the statistics and the specific knowledge required. Those reasons also mean that it'll probably be a while before a PDA has the juice for it.

    (There's the urban legend of the guy presenting ASR control of his computer at a voice conference, when a voice from the back of the room shouts "Format c: Return" and somebody else chimes in "Yes Return")

  141. Linux Telephony - some good answers by jra · · Score: 2

    There are two major sites corralling telephony projects for Linux:

    linuxtelephony.com is an omnibus site, which has seemed not to have had any updates recently, and

    opentelecom.org which, well, has. :-) These folks are supported by Natural Microsystems, who have released a bunch of their code as open source under some license or another. I mean internal switching and driver code and like that.

    On a lower level front, it's possible to use mgetty+sendfax and Gert Doering's vgetty to build answering machine type stuff and also, possibly, 2-call fax response. I'm not sure about 1 call; switching modes can be messy.

    This stuff works with the old Zyxel 1496+ modems, among others, and _maybe_ with the Rockwell voice chips, but I'm not sure; the Zyxel's ought to be, roughly, free, by now.


    Cheers,

  142. mgetty and voice mail by pimp · · Score: 2

    Ohio-state has a FAQ on using [mv]getty for voice mail.

  143. CTI for Linux by squistle · · Score: 2

    Natural Microsystems has better boards, especially for industrial applications. They have released Linux drivers as well as source to their API at http://www.opentelecom.org.

    --
    There are 10 kinds of people in the world: those who understand binary and those who don't.
  144. Linux now does phone spam? by afniv · · Score: 2

    Just what I need. I reliable system to increase the number of unsolicited calls I get every evening when I'm eating dinner.

    I wonder how long it will be before that happens? I'm not sure what systems are used now, but they can't be cheap.

    Maybe I can set up my box to call them back? Or at least filter out the unsolicited calls or maybe even have preprogrammed answers to use up their time. Now there are some ideas. :)

    ~afniv
    "Man könnte froh sein, wenn die Luft so rein wäre wie das Bier"

    --
    ~afniv
    "Man könnte froh sein, wenn die Luft so rein wäre wie das Bier"
    Richard von Weizs
  145. Reveal's Serial-and-soundcard interface by SEWilco · · Score: 2
    Reveal's VM100 Telesound ($59 list) plugs into a serial port, phone line, and sound card. It is basically just a ring detector, on/off relay, and interface between phone line and sound card. I sometimes see them at electronics sales.

    Some VM100 FreeBSD code here.

    A press mention of the VM100 in Byte

  146. Dialogic support? OK, but too late for me. by SEWilco · · Score: 2

    That's nice. Wish they had not said no two years ago when I could have used it. Too late now for that project.

  147. Yes, I'm doing this now by smart2000 · · Score: 2

    I use some source I ported over from NeXTSTEP called am. IT drives a Zyxel modem, and allows callers to either page me, or leave a message, or recieve a fax. When a fax or voice mail arrives the caller id number is sent to my pager via an email to pager gateway. I then forward the voice and fax mails to myself via email, so that I can get them and store them on my note book on the road.

    I'm also in the middle of using this technology to provide a replacement for an old VRU (Voice Response Unit) from IBM. It grabs data from an AS/400 and provides information to customers on current shipments etc.

    Very easy to write. My next project involved with this is to use ears, or something like it to convert the voice to text (and then send it to my pager)

    --
    To purchase it is not like spending money but rather it is an investment in the future in a blow against the empire
  148. CTI for Linux by sam+i+am · · Score: 2

    At PIKA we already have the API in beta. See www.pika.ca.

  149. Voice systems -- lots of proprietary hardware by sam+i+am · · Score: 2

    At PIKA we have a beta version of our API running on Linux. Supports all basic telephony and fax.
    No text to speech or voice recognition.

    For more on Linux telephony see:
    http://www.linuxtelephony.org/

  150. It's just AT commands for the most part by schwantz · · Score: 3

    There are AT commands to do all this stuff, if you want to roll your own software. You'd have to do the system side (sound, etc) yourself. Rockwell (now Conexant) supports this through the use of what they call "business audio," which uses half-duplex digital PCM audio data from your computer (over the serial port/ISA slot). They also have an analog path to and from the chip, but that would be trickier, as unless you have a speakerphone version, the mic from your PC is probably not hooked up to your modem. Here's a few Rockwell (they're the MOST comman modem chipset manufacturer) AT commands (including fax and CLID)to get you started:

    7.5 CALLER ID COMMANDS
    #CID=0 Disable Caller ID.
    #CID=1 Enable Caller ID with formatted presentation.
    #CID=2 Enable Caller ID with unformatted presentation.
    7.6 FAX CLASS 1 COMMANDS
    +FCLASS=n Service class.
    +FAE=n Data/fax auto answer
    +FRH=n Receive data with HDLC framing.
    +FRM=n Receive data.
    +FRS=n Receive silence.
    +FTH=n Transmit data with HDLC framing.
    +FTM=n Transmit data.
    +FTS=n Stop transmission and wait.
    7.7 FAX CLASS 2 COMMANDS
    +FCLASS=n Service class.
    +FAA=n Adaptive answer.
    +FAXERR Fax error value.
    +FBOR Phase C data bit order.
    +FBUF? Buffer size (read only).
    +FCFR Indicate confirmation to receive.
    +FCLASS= Service class.
    +FCON Facsimile connection response.
    +FCIG Set the polled station identification.
    +FCIG: Report the polled station identification.
    +FCR Capability to receive.
    +FCR= Capability to receive.
    +FCSI: Report the called station ID.
    +FDCC= DCE capabilities parameters.
    +FDCS: Report current session.
    +FDCS= Current session results.
    +FDIS: Report remote capabilities.
    +FDIS= Current sessions parameters.
    +FDR Begin or continue phase C receive data.
    +FDT= Data transmission.
    +FDTC: Report the polled station capabilities.
    +FET: Post page message response.
    +FET=N Transmit page punctuation.
    +FHNG Call termination with status.
    +FK Session termination.
    +FLID= Local ID string.
    +FLPL Document for polling.
    +FMDL? Identify model.
    +FMFR? Identify manufacturer.
    +FPHCTO Phase C time out.
    +FPOLL Indicates polling request.
    +FPTS: Page transfer status.
    +FPTS= Page transfer status.
    +FREV? Identify revision.
    +FSPL Enable polling
    +FTSI: Report the transmit station ID.
    7.8 VOICE COMMANDS
    #BDR Select baud rate (turn off autobaud).
    #CLS Select data, fax, or voice.
    #MDL? Identify model.
    #MFR? Identify manufacturer.
    #REV? Identify revision level.
    #TL Audio output transmit level.
    #VBQ? Query buffer size.
    #VBS Bits per sample.
    #VBT Beep tone timer.
    #VCI? Identify compression method.
    #VGT Set playback volume in the command state.
    #VLS Voice line select.
    #VRA Ringback goes away timer (originate).
    #VRN Ringback never came timer (originate).
    #VRX Voice receive mode.
    #VSD Enable silence deletion (no function, command response only).
    #VSK Buffer skid setting.
    #VSP Silence detection period (voice receive).
    #VSR Sampling rate selection.
    #VSS Silence detection tuner (voice receive).
    #VTD DTMF/tone reporting.
    #VTM Enable timing mark placement.
    #VTS Generate tone signals.
    #VTX Voice transmit mode.
    7.9 VOICEVIEW COMMANDS
    +FCLASS=n Service class
    -SVV Originate VoiceView data mode
    -SAC Accept data mode request
    -SIP Initialize VoiceView parameters
    -SIC Reset capabilities data to default setting
    -SSQ Initiate capabilities query
    -SDA Originate modem data mode
    -SFX Originate FAX data mode
    -SMT Mute telephone
    -SDS Disable switchhook status monitoring
    -SQR Capabilities query response control
    -SCD Capabilities data
    -SER? Error status (read only)
    -DTP VoiceView transmission speed
    -SSR Start sequence response control
    +FLO Flow control select
    +FPR Serial port rate control
    -SSV VoiceView data mode start sequence event
    -SFA Facsimile data node start sequence event
    -SMD Modem data mode start sequence event
    -SRA Receive ADSI response event
    -SRQ Receive capabilities query event
    -SRC: Receive capabilities information event
    -STO Talk-off event
    7.10 DSVD COMMANDS
    -SSE=1 Enable DSVD
    -SSE=0 Disable DSVD

  151. Linux IVR by tgd · · Score: 4

    Its very possible.

    I've currently got an old 486/50 DX running Linux 2.2.5 at home that handles voicemail for me using mgetty and some custom shell scripts. (Unfortunately I was never able to get get vgetty perl module working... its very old and there's almost no docs for it...)

    Its pretty slick. People calling can leave voice messages or faxes. I've got it set up so either one gets packaged up in a mime attachment to my e-mail and queued to send to me. Next time the system is online it sends them off. If they sit there more than two hours it'll dial itself up and send them and get back offline. Also archives them so I can get them through a web browser on any systems in my apartment, or I can just hit the reset switch on the front of the system (which is plugged into the parallel port) and it plays any new messages for me. The turbo light blinks when I've got new messages.

    I can also control all the X10 stuff in my apartment (mostly useful for options #1 -- turn off all the halogen lights, and #2 -- turn of coffee pot, both reducing the chances that my spacing out one morning will result in my apartment burning down) ;)

    Last thing I can do is use it to cause my network to dial up. The system handles my masquerading and internet access as well as voicemail, so when it dials up my entire network is online, then it e-mails the IP address it got to my PCS phone. Secure SLL webpage on that IP address lets me control all those devices directly (especially turning on other PCs), check my messages, or disconnect the network...

    The real limiting factor I'd see in using it as an IVR system is more limited support of multi-line voice products, and the poor documentation and difficult programming for vgetty. I'm not sure there are any options other than vgetty.

    Using vgetty in combination with packages like HylaFAX gives you easy ability to do fax-on-demand and other services like that.

    I also used a system with three 14.4k voicemodems and vgetty as a way of validating information on a system that required the user give their true phone number. User was e-mailed a code to punch in after storing their supposed phone number and that code in a database. The voice system would use caller id and compare the code they entered with the code matching that number in the database. Match? Voila! Flag is set, account is activated.

    Worked great, client never used it though. C'est la vie.