Software Based Echo Cancellation?

← Back to Stories (view on slashdot.org)

Software Based Echo Cancellation?

Posted by Cliff on Wednesday May 8, 2002 @05:10AM from the muting-the-reverb dept.

tcyun asks: "I am helping to put together a small studio for a project at my workplace which will require some audio mixing. We have been able to find software solutions (often times open source) for almost all of our needs except for echo cancellation. I have done the requisite searches and have found a large number of hardware based echo cancellation devices, but have not found a purely software based solution. Is anybody aware of one?"

"For some more information, my office is trying to get a small system up and running that will allow multiple locations to video conference together. We have some specific requirements and have a fairly good handle on the entire video part of the problem.

However, we are running into problems with parts of our audio mix. The first issue is something that (I believe) is called 'mix minus.' This means that in a group conference, speakers do not have audio sent back to their location. (This is important for various psychology and network latency related issues.) There are several hardware based solutions that are available and we have some software based options.

The larger problem is echo cancellation. As many people may need to speak at once (and to avoid the requirement of having individuals constantly muting their microphones), we would like an echo cancellation component. The ideal would be a software solution that we could run locally, perhaps in conjunction with the same code running on the remote systems. However, most of the solutions we have found are hardware based (DSPs, ASICs, etc.).

The technology used on the studio side as well as the host side will involve various operating systems. We are trying to avoid avoid relying on specific OTC hardware solutions (namely, sound cards) as we would like to be able to create a solution that would function over time, particularly as specific hardware solutions tend quickly to horizon. So, having nice code that could be compiled on different systems would be a plus. Ideally, we would like to minimize the amount of hardware necessary, so an echo cancellation algorithm that could run in conjunction with other processes would be nice, but it is not a requirement."

29 of 211 comments (clear)

Hardware Audio Tools by saveth · 2002-05-08 05:16 · Score: 5, Informative

The reason you're finding more hardware tools than software tools for echo cancellation, among other things, is that the telecommunications industry demands these sorts of things with much more fervor than the average consumer. Echo cancellation devices (for example, a codec with echo cancellation built in, running on a DSP) are used extensively in cellular telephones, voice routers, and this sort of thing. Your best bet, in this respect, is to find a company that is willing to release the source code to the software that is running on your hardware.

Alas, I do not know of any software, especially open source or free, that provides a full suite of audio processing utilities. Why is it that you're against using hardware, in the first place? Too expensive? Those are the breaks.
1. Re:Hardware Audio Tools by Ooblek · 2002-05-08 05:57 · Score: 3, Interesting
  
  I believe the echo he is referring to is the echo that happens in the latency from the time their voice is spoken to the time it makes the round trip from the other side.
  The term "Mix Minus" does not apply here. It is generally used in post production where you lay off your audio mix minus the voice over. You probably also don't need echo cancellation software. Just put a mixing board on each side of the conference call. Mix the local side's voice with the remote side's voice on each side. So they hear themselves as they speak and don't hear themselves from the remote side. Use directional microphones so that the loud speaker on the remote side can't be heard in the remote microphones. You could also require everyone to wear headphones I suppose. (Probably wouldn't be popular.)
Noise gate by joshwa · 2002-05-08 05:17 · Score: 3, Informative

As many people may need to speak at once (and to avoid the requirement of having individuals constantly muting their microphones),
Why not just install a noise gate at the microphone inputs?
For the non-audio-inclined slashdotters, a noise gate sets a minimum sound level threshold before the signal is transmitted.
1. Re:Noise gate by Anonymous Coward · 2002-05-08 05:40 · Score: 3, Informative
  
  >>For the non-audio-inclined slashdotters, a >
  >>noise gate sets a minimum sound level threshold
  >>before the signal is transmitted.
  
  I'm no audio engineer, but it's obvious it wouldn't work.
  
  Person A talks into Microphone A - also picked up with a delay in Mic B.
  
  a)You want to cancel the echo from Mic B - so you use a noise gate. it works as long as i don't talk loud enough to cross the threshold on Mic B. Given that this is in a conference setting, the mic of the person next to me is going to pick me up without that huge of difference from the mic right in front of me. Whats the difference between me speaking loudly and the person next to me speaking somewhat softly? Not much and the gate doesn't know any different.
  
  b)To screw things up we only have to get over the threashold. So I'm speaking AND a neighbor starts to speak. So the sound at his mic is his voice + my voice. The total is over the threashold. Again - the gate does nothing.
Removing echo by Myshkin · 2002-05-08 05:17 · Score: 5, Funny

sed 's/^echo/#echo/' /etc/inetd.conf >/etc/inetd.new
mv /etc/inetd.new /etc/inetd.conf
kill -HUP $(ps -ef |grep root.*inetd|grep -v grep|awk '{print $2}')

no more echo
1. Re:Removing echo by zbuffered · 2002-05-08 06:09 · Score: 5, Funny
  
  You can do it even easier in DOS:
  echo off
  
  --
  Synergy is your friend
Asterisk PBX by Anonymous Coward · 2002-05-08 05:18 · Score: 5, Informative

There's an excellent open-source PBX called Asterisk. Among other things, it provides an MMX-optimized echo-canceller. Look here
Tough Problem by mellifluous · 2002-05-08 05:18 · Score: 3, Informative

Maybe someone at /. will find an answer for you, but I would be surprised to see this implemented in any kind of stand alone SW package. Because it is a specialized real-time application requiring fast feedback, it makes sense to implement it as an embedded system (i.e. in hardware).
Hack old Modem Drivers! by WndrBr3d · 2002-05-08 05:18 · Score: 5, Interesting

Back in the day when 56k modems were taking off, there was a large piece of software people were coding into drivers called 'Ring Cancelation'.

These were added because when you send data down an analog line at high speeds, you begin to hear an audible sound which sounds like ringing. The modem drivers needed to be able to tell the difference between this ringing sound and the actual data.

I think a good place to start if you cannot find any software is perhaps hacking these drivers or something along those lines.

It's a good start at least. Hope this helps :-)
1. Re:Hack old Modem Drivers! by mellifluous · 2002-05-08 05:24 · Score: 4, Insightful
  
  I don't think this is quite the same problem. Ring cancellation is looking for a very particular sound with known characteristics. Echo cancelation has to supress the delayed versions of an aribitrary sound feeding back through the system.
we need more by RealisticWeb.com · 2002-05-08 05:20 · Score: 3, Insightful

This is a great question you are asking, and I would love to see a good answer. The shame of it is, I'm expecting to see a bunch of posts in response to this saying "If you need one then write it yourself".

Is it just me, or does it seem like the open source offerings for things related to audio/video are lacking in general? I wish I had time to make improvements myself, or the money to contribute to the developers, but it seems like we need more in this area to be able to be more competitive with proprietary solutions.

--
Sigs are out of style, so I'm not going to use one...oh wait..
Classic application! by spaceyhackerlady · 2002-05-08 05:25 · Score: 3, Informative

Echo cancellation is a classic application of adaptive filters. Every reference ever published on the subject discusses it. I like Haykin's book myself.
I just did a search on Google and came up with 4000 references.
The underlying theory is pretty hairy, but the implementation of an algorithm like LMS is straightforward.
...laura
1. Re:Classic application! by Erandir · 2002-05-08 07:05 · Score: 3, Informative
  
  Laura's right: you'll find the maths and the algorithms for echo cancellation in most textbooks on adaptive filtering. Check out the July 1999 issue of the IEEE Signal Processing Magazine (it shouldn't be too hard to get hold of it, most university libraries' engineering section should have it) -- it is an issue dedicated to "Adaptive Algorithms and Echo Cancellation". All the maths and algorithms you need are discussed there. Yes, you do need a good background in linear algebra to follow the underlying theory, but the algorithms should be easier to implement, and you're likely to find source code for most of them on the web (LMS filtering is used in many other applications too).
  Echo cancellation is a common design problem in hands-free telephone systems and conference systems; there is lots of literature on the subject. See the references in the articles I mention above.
The Access Grid uses hardware to do this... by Troy+Baer · 2002-05-08 05:27 · Score: 3, Interesting

The Access Grid is a project started at Argonne National Lab's Math and Computer Science Division to build a mostly open videoconferencing system over the Internet, using multicast audio and video streaming. You may want to take a look at their technology to see if they have ideas you can use.

Anyway, a "node" on the Access Grid consists of a room with at least three computers: a multihead box running Win2k for display to several video projectors, a computer running Linux for audio capture and playback, and another running Linux for video capture. The audio capture machine usually runs into a Gentner AP400, which does echo cancellation as well as phone bridging.

I don't know of anybody who has software that does this; sorry.

--Troy

--
"My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
Forget software, get hardware by ttyp0 · 2002-05-08 05:28 · Score: 3, Informative

I remember when I worked at Tellabs we had a product, EC-8000 Digital Echo Canceller Might be worth a look.
Searches for echo cancellation software by Seth+Finkelstein · 2002-05-08 05:29 · Score: 5, Informative

Am I misunderstanding the question? A Google search for "echo cancellation" software turns up quite a bit.
Notably, a lead such as: http://www.nist.gov/speech/tests/ctr/h5e_97/echoca n.htm
The echo cancelling software (ec_v2.5.tar.gz) that is applied to telephone data, may be obtained from Mississippi State University.
The LDC has provided a perl script (mu_ec.perl) that will take a sphere-headered, 2-channel mu-law waveform file as input, apply the MSU/ISIP echo cancellation software, and produce a sphere-headered, 2-channel mu-law waveform file as output.

Sig: What Happened To The Censorware Project (censorware.org)
1. Re:Searches for echo cancellation software by stilwebm · 2002-05-08 06:15 · Score: 3, Informative
  
  This is a good start. Note that the perl script linked to above only provides raw data to the ec.exe binary, but the source code is linked to on that page. Also, there is more information and the source code at http://www.isip.msstate.edu/projects/speech/softwa re/legacy/fir_echo_canceller/. Nevertheless, consider:
  
  * In running the echo canceller on sparcs (ss20, SPARCserver-1000), it takes between 3 and 4 times realtime to operate.
  
  Now a Pentium III 800 will probably run it in a fraction of the time for an SS20, say 1/2 realtime to 1/4 realtime. But if it is for a mixing project, there will be several streams to process. I wonder if the cost of having to use a dedicated computer for software processing will outweigh the cost of dedicated DSP hardware?
The Analog devices EZ-Kit (a 2181 demo) has it. by Ludwig668 · 2002-05-08 05:34 · Score: 5, Informative

Check out Analog devices; their 2181 demo has echo cancellation as a part of the included software; source included.
perhaps the reason you can't find it... by ultramk · 2002-05-08 05:40 · Score: 3, Insightful

...is because it doesn't exist.

Realtime processing, AFAIK, be it audio or video, is astonishingly processer-intensive. It doesn't surprise me that DSPs are being used for this reason: they may be the only thing that can cut it in a cost-effective manner.

i.e. you may be able to build a high-end workstation, and write some real-time software to handle this task, but since it probably wouldn't be able to do anything else at the same time doesn't that qualify as a hardware solution?

Perhaps instead of going to extreme lengths to remove echos, perhaps you just need to work harder to prevent them in the first place? Pro audio mags have tons of ways to reduce echo and other unwanted effects in small (usually home) studios. Have you looked into this?

Michael-

--
You catch enchiladas by picking them up behind the head and holding them underwater until they don't kick anymore -VeGas
1. Re:perhaps the reason you can't find it... by blair1q · 2002-05-08 06:52 · Score: 3, Informative
  
  A DSP is just a CPU with one or twelve little two-step and array-math hacks in it. Any CPU that's 2X faster in FLOPs can do the same thing with ordinary arithmetic code.
  
  There are lots of new CPUs that are faster than lots of 5-year-old DSPs.
  
  --Blair
  "But then Microsoft puts the code in a directory somewhere under C:\Windows and kills the market."
I have echo-cancellation software! by jmv · 2002-05-08 05:41 · Score: 5, Interesting

Look here for my echo-cancellation code:
http://speex.sourceforge.net/audio/sndio.tg z

It's bundled with open-sound calls to read and write audio in real-time, while removing acoustic echo from the input. There's not much doc, but the test2.c program is quite simple. Feel free to contact me at jean-marc.valin@hermes.usherb.ca. Note that there's no real project (sourceforge or other) assiciated to it but if you find it useful, I may create one.

--
Opus: the Swiss army knife of audio codec
1. Re:I have echo-cancellation software! by jmv · 2002-05-08 05:54 · Score: 3, Interesting
  
  Two things I forgot to mention:
  1) It works on stereo (of multi-channel) input/output
  2) There's an SSE (float version of MMX) version too
  
  --
  Opus: the Swiss army knife of audio codec
A possible simpler solution by hidden · 2002-05-08 05:58 · Score: 3, Informative

1) Use directional microphones, or else throat mikes. This will make the neigbour's microphone only pick some one up very quietly, if at all.

2)if there is still some echo problem, it should be quiet enough that simple (software) noise gates should solve the problem.
Flash 6 as a possible solution... by Aquaman616 · 2002-05-08 06:03 · Score: 3, Interesting

I've been hearing about some new technology from Macromedia that might make your life a *lot* easier. Apparently the Flash 6 plugin supports hooking into both webcams and mics (after the user OKs it) as well as special socket-based connections to a new piece of server software codenamed TinCan. In addition they've talked about the server supporting shared objects as well.

From what it seems you're able to put code on both the client and server and both are based on ECMAscript. This would let you do a lot more than nearly every other solution I've ever seen. I don't know when the server is supposed to be released, but if you check up on the recent interviews with MMs CTO Jeremy Allaire on C|Net or The Register you'll see that they seem to be hinting that it will be available later this year.

--
A|Q|U|A
You need an acoustic echo canceller by Anonymous Coward · 2002-05-08 06:04 · Score: 5, Informative

Most solutions offered by Ditech, Telogy, etc. cancel the electrical echo caused by an impedance mismatch 2 to 4-wire hybrids in the analog part of the Old Telephone Network. You seem to develop a packet-based videoconferencing system, which has no hybrid in it, so you must want to cancel acoustic echo, caused by reflection of the sound produced by the speaker-phone on the walls of a conference room.
This is a very hard problem, because you have to modelize the environment of each conference room. You will have to guess mathematically (with the LMS algorithm for example) the echo response on a tail of at least 128ms for each room, which would take at least a few minutes to one hour on a P4 2GHz system.
And what about if a door is suddenly closed in the conference room? Or what if the speaker phone is moved? You will have to re-modelize your echo response each time that happens, because the geometry of the room will have changed.
The solution is surely not a software echo cancellation system, at least not before 2010.
Think about a hardware solution, DSPs or ASICs (http://www.octasic.com)
Um..one of each of these... by teamhasnoi · 2002-05-08 06:42 · Score: 3, Informative

One (good) omnidirectional condenser Mic in center of room; everything will be in phase and mono. Send this signal to a noise gate to cancel out paper rustling, and then a compressor (hard or software). I'd guess a 1:10 (or less), with a threshold of -20db (give or take) and a soft limiter would do it. This will equalize the volumes between the loud drunk salesguy, and the quiet intern. Educate members of meeting that they need to speak confidently.
I guess I don't see why NOT routing the audio back would be a problem, or maybe I don't understand the question.
Otherwise, save your paper towel rolls, and hand them out before a meeting. I don't do this for a living, so YMMV.
Echo cancellation on 12 lines of code. by Petrus · 2002-05-08 07:20 · Score: 4, Informative

#define AdaptationRate 0.99
// Basic adaptive LMS FIR algorithm.
float EchoCancellation(float Sample)
{
static float History[MAX_ECHO_DURATION+1] = 0;
int i;
float AdaptationRate;
float EchoAmpl;

for( i=0; iMAX_ECHO_DURATION; i++)
{
EchoAmpl = History[i]*Coef[i];
Coef[i] *= AdaptiationRate*(Sample-EchoAmpl);
History[i+1] = History[i];
}

History[0] = Sample;
return Sample-EchoAmpl;
}

That's all the "basic" science.
You might find, that for 40kHz and 250ms echo this is too computationally intensive for a single Pentium. You may need some 1200 MIPS.

You may then:
1. Use Athalon ;-)
2. Convert it to pointer arithmetic
3. Convert it to integer arighmetic
4. Skip some samples for echo estimation, sometimes
5. Contact me to use more clever algoritm (IIR?)
(Petrus.Vectorius@ied.com)
Use low-tech solutions first. by Lumpy · 2002-05-08 07:22 · Score: 3, Informative

if you are creating your studio then you need to make the studio fix the problem first, dont try to compensate for a crapy studio in the recording hardware/software.

#1- Sonex, sonex, sonex. If you dont have sonex or the crappy sonex copy or even just carpet on the walls (Yes wall carpet looks good) along with the roughest texture ceiling tiles you can buy at the home-depot (or better yet the $90,00 a 2foot square city scape audio ceiling tiles) then you are wasting your time. it takes very little to make a room acoustically deadened to the point that properly set up microphones wont pick up any perceptiable echo. (Note: if you have you're mic's set so your artists or voice talent is farther away than 3 inches from the P popping screen then you have it set wrong. also dont let the talent talk quietly, make then talk or sing loud to overcome room acoustics.

start with the low tech, then add your high tech bandaid filters.

--
Do not look at laser with remaining good eye.
I've worked in a radio studio by AlaskanUnderachiever · 2002-05-08 07:42 · Score: 3, Informative

Hell I helped build one. And while there is a LOT of noise cancellation and "echo reduction" software on the market (Cool Edit Pro has a few nice plug ins) the sound quality after applying such a filter could at best be called "fair". Unfortunately your best solution is to find a high quality mic with a bit of noise cacellation (and the higher end ones can be "tuned" with a hardware equilizer) and just suck it up and BUY THE FOAM. I know it's ugly. I know it's a pain in the ass. I know it's only effective if the studio is designed well, but nothing that I have personally seen (well under 40k that is) beats the stuff. Acoustic dampening foam is your cheapest option that will still maintain audio quality to a reasonable degree.

--
Find out about my new childrens book: SS Death Camp Criminal Batallion Go To Monte Carlo For The Massacre