VoIP Calls Double In Quality

Please get the rest of the telcomms to follow. by rob_squared · 2006-07-17 04:06 · Score: 3, Informative

Everything else is stuck at 8khz, so unless your call uses this service end-to-end, there's going to be a downconversion if you're calling someone on a land line. And you'll be stuck with 8khz if you get any calls from someone not on this service.

Still, its a good piece of news, onward and upwards.

*crosses fingers* Please nobody mention video phones. *crosses fingers*

--
I don't get it.

Re:Please get the rest of the telcomms to follow. by joe+155 · 2006-07-17 04:11 · Score: 3, Funny

I'll agree to some extent that this is good news, but my friend, 16khz is a lot of packages which will have to be squeezed through the current pipes which I recieve my internets through; so this will make the speed of internets go down to no faster than the standard post. Did you know the other day I got an internet which had been sent on Friday! *mubbles to self*...

--
*''I can't believe it's not a hyperlink.''
Re:Please get the rest of the telcomms to follow. by a_nonamiss · 2006-07-17 04:13 · Score: 1

It's just a bunch of pipes...

--
-Arthur
Cave ne ante ullas catapultas ambules
Re:Please get the rest of the telcomms to follow. by Anonymous Coward · 2006-07-17 04:22 · Score: 0

Tubes, actually.

And its not like a truck, you can't just dump your voice on it. You have to wait your turn.

I got a telephone on my telephone just yesterday. And it was my friend who called me last week. He was wondering why I didn't answer him.
Re:Please get the rest of the telcomms to follow. by anthm · 2006-07-17 04:22 · Score: 2, Informative

Yes, you are correct. The benefit comes when both ends of the call are using a 16khz device. Situations where you are connecting to the PSTN would obviously be better suited at 8khz. The media description on the pstn gateway would advertise only 8k so the client would know better than to operate at a higher frequency.
Re:Please get the rest of the telcomms to follow. by Jeff+DeMaagd · 2006-07-17 04:30 · Score: 1

Is there a particular reason why it needs to be 16kHz? It's just vocals, what are the upper limit of typical vocals? 8kHz would seem to be plenty unless you are trying to play a violin or something.
Re:Please get the rest of the telcomms to follow. by CastrTroy · 2006-07-17 04:36 · Score: 1

This way once they take away all the other options we can still use the VOIP system to pirate music.

--

Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
Re:Please get the rest of the telcomms to follow. by Detritus · 2006-07-17 06:14 · Score: 1

It's also lower distortion and improved signal-to-noise ratio.

--
Mea navis aericumbens anguillis abundat
Re:Please get the rest of the telcomms to follow. by mattavian · 2006-07-17 06:15 · Score: 1

how many kHz does a videophone need to work?
Re:Please get the rest of the telcomms to follow. by cb0nd · 2006-07-17 06:17 · Score: 1

Yes, but then there would be DRM to stop those VOIP using thieves and pirates. Not to mention that, even worse, you can actually use VOIP to share lyrics with the songs.
Re:Please get the rest of the telcomms to follow. by warsql · 2006-07-17 06:25 · Score: 1

Oh, just like 640k ought to be enough for anybody
I know, I know, he never said it.

--
878659 - yep its prime.
Re:Please get the rest of the telcomms to follow. by Anonymous Coward · 2006-07-17 06:27 · Score: 0

The way this is done with Speex you can fit 3 16Khz calls in the same bandwidth of 1 G.711 8Khz call. So it doesnt have to take more data on the internet to accomplish this.
Re:Please get the rest of the telcomms to follow. by amorsen · 2006-07-17 06:32 · Score: 1

The real limit is 4kHz, you need 8kHz sample rate to reproduce that. 4kHz is low enough that it's dissicult to diftinguifh between s and f.

--
Finally! A year of moderation! Ready for 2019?
Re:Please get the rest of the telcomms to follow. by Anonymous Coward · 2006-07-17 06:47 · Score: 0

Off topic, but what's the big deal about the pipes?

It seems to me to be a reasonable approximation to explaining it to laymen.
Granted, you're not going to use that model to configure a router... but I've used a pipes model to describe the difference between modems and cable modems for bandwidth.

Are we just making fun of him for not describing the TCP/IP protocols in details, or am I missing something?
Re:Please get the rest of the telcomms to follow. by denttford · 2006-07-17 07:02 · Score: 1

The kHz here refers to the sampling rate of the audio (and thus an aspect of its quality and does not directly affect the size of the tube needed - only in analogue communications is a direct (and literal) measure of bandwitdh.
Nobody does videeo conferencing over analogue lines anymore - if they aren't sent over the internet, then they use some sort of leased line or ISDN connection, etc. Incidentally, and this probably will confuse the issue, but TV broadcasts use ~6mHz over analogue connections for one way transmission of video and sound.

--

Leben Sie jetzt die Fragen.
Re:Please get the rest of the telcomms to follow. by timeOday · 2006-07-17 07:07 · Score: 1

Is there a particular reason why it needs to be 16kHz?
Maybe it will help me understand consonants better.
Personally I'm disappointed that this is considered impressive. Since it's limited to pure VOIP calls, it should be as simple as selecting a bitrate when you encode an MP3 or record a show on your PVR.
Re:Please get the rest of the telcomms to follow. by Anonymous Coward · 2006-07-17 07:20 · Score: 0

They meant to say "tubes" to make fun of the senator. A lot of us do use "pipes" to describe bandwidth.
Fat pipe, skinny pipe, clogged pipe. :-)
Re:Please get the rest of the telcomms to follow. by SanityInAnarchy · 2006-07-17 08:37 · Score: 1

Stop hitting the pipes, Ted. You want to be sober for this legislation thing!

--
Don't thank God, thank a doctor!
Re:Please get the rest of the telcomms to follow. by SanityInAnarchy · 2006-07-17 08:40 · Score: 1

While we're originally making fun of him for the word "tubes", that's only because it's easily memorable/recognizable, compared to the near-incomprehensible comments about a staffer sending an Internet to him...

The real reason we're making fun of him? In the words of John Stewart, "Maybe it's because you don't know jack-shit about the Internet."

--
Don't thank God, thank a doctor!
Re:Please get the rest of the telcomms to follow. by riflemann · 2006-07-17 08:51 · Score: 1

Everything else is stuck at 8khz, so unless your call uses this service end-to-end, there's going to be a downconversion if you're calling someone on a land line. And you'll be stuck with 8khz if you get any calls from someone not on this service.

Even more worrying is that you will get progressively worse audio quality through the telephony chain as the audio undergoes several up and down conversions in sample rate.

One of the really neat things about the common exiting audio codecs in use for telephony (G711, GSM, G729, etc) is that they are only lossy at the initial conversion. So if you get some device that's encoded the audio in GSM, then decompress it to G711 for the PSTN, and then re-compress it to GSM later on, the resulting GSM will be the same quality as the original GSM.

In other words, converting audio codecs back and forth between PSTN and (the same) compressed codec does not progressively degrade beyond the first compression, no matter how often you do it. This is a big difference to mp3 and other non-voice codecs, and vital for global telephony.

Acheiving them same with sample rate conversion will require developing similarly clever algorithms.

--
Sparks:Gadget:Beer Maker
Re:Please get the rest of the telcomms to follow. by mattavian · 2006-07-17 08:57 · Score: 1

(j/k) re: parent.
Re:Please get the rest of the telcomms to follow. by Andrew+Kismet · 2006-07-17 09:22 · Score: 1

your sarcasm/humor was lost in the internets. Perhaps you need to put your posts at a higher encoding to make sure you can tell the difference between serious questions and jokes?
Re:Please get the rest of the telcomms to follow. by denttford · 2006-07-17 12:18 · Score: 1

heh.

sorry, seen such dumb things of late here (there are eight bits in a byte!) it's hard to tell anymore.

--

Leben Sie jetzt die Fragen.
Re:Please get the rest of the telcomms to follow. by Anonymous Coward · 2006-07-23 17:57 · Score: 0

Skype has 16 kHz sampling.
For this reason the audio is so good.

Good Work by kasgoku · 2006-07-17 04:14 · Score: 3, Insightful

good work there, but all you need is to get the message across. its not like u r singing on the phone and need good voice quality. just do what's needed.

Re:Good Work by Tychon · 2006-07-17 04:30 · Score: 2, Insightful

But for those of us with a bit of trouble hearing, or when speaking with a person that has a thick and or foreign accent, that extra quality is the difference between a conversation and a stream of "What'd you say?"

So what? by Spazmania · 2006-07-17 04:16 · Score: 4, Insightful

So what? If you're going to up the sampling rate why not go directly to 44khz stereo (CD quality audio) and be done with it? Jumping from the telephony industry standard 8khz to 16 khz is thoroughly uninspired.

--
Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.

Re:So what? by Anonymous Coward · 2006-07-17 04:21 · Score: 0

Sure, because voice transmissions need that 16-20 kHz chunk (above the audibility of ~ half the population) to be completely convincing. Especially over those soooo high quality handsets.
Re:So what? by xachen · 2006-07-17 04:21 · Score: 3, Interesting

If you find a codec that does 44kHz stereo, FreeSWITCH will do this. It has no hard limit in it and is variable to any rate! This is just awesome!
Re:So what? by Anonymous Coward · 2006-07-17 04:24 · Score: 0

You mean 8-20 KHz. The maximum frequency is half the sampling rate.
Re:So what? by frequenicity · 2006-07-17 04:24 · Score: 1

What would be the point of going to 44khz stereo for a mono signal?
Re:So what? by treeves · 2006-07-17 04:24 · Score: 1

. . .and I wouldn't call it a doubling of quality. The improvement of adding one octave of high frequency is, subjectively, less than a doubling. A human voice is quite intelligible with no frequencies above 4 kHz or so present.

--
...the future crusty old bastards are already drinking the Kool-Aid.
Re:So what? by anthm · 2006-07-17 04:35 · Score: 1

It's ok if you are not interested but there are some who are. Here is an article about some of the benefits. http://www.analogzone.com/nett0307.pdf as well as a wiki entry from http://www.voip-info.org./ http://www.voip-info.org/wiki/view/Wideband+VoIP
Re:So what? by Vellmont · 2006-07-17 04:36 · Score: 1

So what? If you're going to up the sampling rate why not go directly to 44khz stereo

Because stereo would be a complete waste of bandwidth and processing power (one microphone, one speaker), and the human voice doesn't get anything near 22khz in frequency. Normal speaking voices have an even lower cutoff frequency. The CD standard is great for music, but complete overkill for sending voice.

--
AccountKiller
Re:So what? by slyvren · 2006-07-17 06:41 · Score: 1

YES! While we're at it why don't we all invest in stereo microphones and headsets. That way when we talk to our friends through the internets we can talk around the mic and make it sound like we're swirling in their heads... GENIUS!
Re:So what? by zenslug · 2006-07-17 06:53 · Score: 1

Make it stereo then. Makes video conferencing even better.
Re:So what? by Anonymous Coward · 2006-07-17 07:36 · Score: 0

Have you ever digitized speech (say, for a podcast)? I have. 8khz (phone rate) sucks... we're all familiar with "telephone voice". No data above 4khz (8khz sample)? ya, right.

24khz sample rate is the sweet spot. 95% quality compared to 44.1khz.

16khz is really close (slightly above AM radio quality). I'd give it 80% quality compared to 44.1khz.

32khz is indistinguishable from cd quailty.

16khz is a big jump up in quality from 8khz. I can provide some samples if you like to compare for yourself.
Re:So what? by frequenicity · 2006-07-17 07:45 · Score: 1

I understand that. I guess I am asking what the advantage for this would be. The audio that you are recieving is coming from a single (mono) source...so it seems to me that boosting the quality to 44khz stereo would really be wasted bandwidth.
Re:So what? by Brickwall · 2006-07-17 08:17 · Score: 1

I wish some people here actually KNEW something about the telephone network. First off, there are still hundreds of thousands of miles of copper wire in the network. Much of it is connected to 'loading coils', which are essentially low-pass filters. Any frequencies over 4kHz are attenuated, so your 44kHz is just a dream. Telephone engineers knew that; that's why they picked the 8kHz sampling rate (Nyquist theory). Second, as someone else pointed out, there remains the question of getting every single telco, and switch (PBX), upgraded to support 16kHz sampling. That's not going to happen for a very long time, given that 8kHz sampling works just fine. If you have trouble hearing someone, it's probably due to a lousy local loop at one end, not sampling; on a full digital end-to-end call, you can hear people breathing. This sounds more like a technology in search of a problem.

--
What was once true, is no longer so
Re:So what? by treeves · 2006-07-17 08:52 · Score: 1

I certainly did not say "no data above 4 kHz".
No doubt there is a significant perceptible difference, it's just that in terms of *intelligibility* (what really matters for a voice phone call) there isn't THAT big a difference between 8kHz and 16kHz, certainly not a doubling.
Put it another way, if 95% of listeners can understand a sentence uttered by a speaker at the other end at 8kHz, maybe 96% can understand at 16 kHz. And yes, I just pulled those numbers out of my posterior, but hopefully you get the idea I'm trying to impart. ;-)
For music, I'd agree that the difference is much more important.

--
...the future crusty old bastards are already drinking the Kool-Aid.
Re:So what? by Antony+T+Curtis · 2006-07-17 09:39 · Score: 1

So what? If you're going to up the sampling rate why not go directly to 44khz stereo (CD quality audio) and be done with it? Jumping from the telephony industry standard 8khz to 16 khz is thoroughly uninspired.

16kHz is pretty similar to analog FM radio transmissions and people have been listening to music on that medium for a long time quite satisfactorily. Besides, if you want to have high fidility transmission of music over the internet, there is already pretty decent solutions with streaming ogg/mp3.

IMO 8kHz was fine for basic one to one voice conversations. 16kHz will be much better and less confusing when having conferences as it will be easier to differentiate between peoples voices.

--
No sig. Move along - nothing to see here.
Re:So what? by Anonymous Coward · 2006-07-17 09:52 · Score: 0

> *intelligibility* (what really matters for a voice phone call) there isn't THAT big a difference between 8kHz and 16kHz, certainly not a doubling.

I totally agree with you there.

But, a doubling of the sample rate does add some intelligibility, and a lot more tonal info. 16khz sounds almost natural.
Re:So what? by SirDaShadow · 2006-07-17 12:04 · Score: 1

If you find a codec that does 44kHz stereo, FreeSWITCH will do this. It has no hard limit in it and is variable to any rate! This is just awesome!

Get foobar 2K and grab the free mp4+SBR codec from nero. You can turn a cd quality stereo signal (mp3, whatever) into a svelte, 16kBIT/s 44khzs/stereo (!) signal without much quality loss (well at least compared to current telephony anyway...)
Re:So what? by Spazmania · 2006-07-17 12:38 · Score: 1

You missed the point: with internet connections rapidly reaching video speeds and the telephone network very much tied to 8khz there is no value in having a 16 khz VoIP. If you're going to up the sampling rate only for VoIP, go straight to 44khz and be done with it. Don't brag because you were dumb enough to select a median value.

--
Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
Re:So what? by cdrudge · 2006-07-17 12:40 · Score: 2, Insightful

Yeah, but the on hold music sounds great!
Re:So what? by billcopc · 2006-07-17 14:33 · Score: 1

Dude.. have you never worked in a call center ? I would kill to have a phone system that runs at 16khz, better yet 32khz. Don't double the bitrate, maybe a 30% increase would be enough, just move the filter cutoff freq higher because not everyone's voice has intelligible transients in the low-khz range, often times those voices get wrecked by the filtering and all you can hear is mumbling, as if the caller were talking with the mouthpiece in their armpit :P

Higher frequency from the source, then less aggressive filtering and shaping to give a more natural sound. That's gotta me the #1 reason why I dislike phones. I have golden ears and phone conversations just don't sound human to me.

--
-Billco, Fnarg.com
Re:So what? by treeves · 2006-07-18 05:35 · Score: 1

No, I haven't. Maybe that's my problem, I don't like talking on the phone to begin with, so I don't do it anymore than I have to, but I don't think it's because of the audio quality. ;-)
Thanks for your comment - whoever designs and buys phones and voice networks should obviously give more weight to your opinion than mine.

--
...the future crusty old bastards are already drinking the Kool-Aid.
Re:So what? by zenslug · 2006-07-25 02:40 · Score: 1

use two mics

Define: IVR by theGreater · 2006-07-17 04:18 · Score: 3, Informative

Google gives the definition of IVR as Interactive Voice Response.

So I knew what one was, I just didn't know there was a TLA for them. This inane personal revelation brought to you by the captcha "accuse".

-theGreater.

I agree in general terms . . . by mmell · 2006-07-17 04:21 · Score: 1

but it needs to be done in a way that won't 1) swamp the internet backbone with huge quantities of digitized voice telephony, and 2) give ISP's a good excuse to insist on multi-tiering the internet.

That said, can video telephony and the kind of communication we've seen portrayed on Star Trek et. al. be far behind?

Doubling? hardly by MacBoy · 2006-07-17 04:23 · Score: 3, Insightful

I fail to see how adding one additional octave of frequency response to the 6 or 7 currently available, can be called "doubling" the quality.

Re:Doubling? hardly by Anonymous Coward · 2006-07-17 04:26 · Score: 0

Cuz an octave is doubling the frequency? I'm wondering who this is for? Are Alvin and the Chipmunks making a lot of voice calls?
Re:Doubling? hardly by jdmicklos · 2006-07-17 04:33 · Score: 5, Informative

The only real advantage to adding in "unused" octaves is in order to transmit overtones. Overtones shape the sound you can hear even though they may not be hear directly. Think about it as if you were to have a G note at 120 dB playing in an octave that you couldn't hear. It would still cause all things around with a fundamental frequency that is a "G" to vibrate as well as color certain audible noises.

--
-Jon
Re:Doubling? hardly by slyvren · 2006-07-17 06:47 · Score: 1

Correct me if I'm wrong, but I'm assuming this means the digtal to analog conversion rate. This means it's sampling the analog audio 16,000 times per second instead of 8,000 times per second. Which in theory is double the "quality".
Re:Doubling? hardly by ejdmoo · 2006-07-17 06:49 · Score: 1

I thought this number referred to the sampling rate...

You're thinking 8bit audio to 16bit audio.

CMIIW
Re:Doubling? hardly by slyvren · 2006-07-17 07:01 · Score: 2, Interesting

Actually 8 bit to 16 bit is far greater than double quality. The quality essentially doubles everytime you add a bit.
Re:Doubling? hardly by timeOday · 2006-07-17 07:13 · Score: 1

What? No. They're only talking about 16 KHz sampling here, which would capture sounds up to 8 KHz. You can hear 8 KHz directly, these are not "unused" octaves.
Re:Doubling? hardly by tverbeek · 2006-07-17 07:15 · Score: 1

I've got a buddy who uses VOIP, and I can assure you: the quality of his phone calls to me has not doubled. It's all the same old "Dude, there's this chick on tv right now, I'm not sure which channel, who is like majorly hot. Turn it on!"

--
http://alternatives.rzero.com/
Re:Doubling? hardly by cfulmer · 2006-07-17 07:25 · Score: 1

The maximum frequency that can be carried is proportional to the sampling rate -- if I recall correctly, the highest frequency that can be carried is half the sample rate. Sample 8000 times per second and you can carry up to 4 kHz. At 16000, it's 8 kHz. People can hear up to about 20 kHz, so this does increase the frequency range. Since 'going up an octave' means doubling the frequency, the previous poster was correct. The end result is only to raise the maximum frequency by an Octave.

The bigger problem, though, is the sample size. The traditional phone system uses an exponential 8-bit sample size. The result of this is that the softer the audio, the finer the reproduction. For example, say that the signal level were linearly measured from 0 to 1000 and mapped to the numbers 0-127. Under this sort of encoding, 1 might map to 1, 2 to 2, 3-4 to 3, 5-7 to 4 . . . 630-800 to 126 and 800-1000 to 127.

The end result is that loud sounds are more distorted by the telephone system than soft sounds.
Re:Doubling? hardly by Jerry+Coffin · 2006-07-17 07:36 · Score: 1

The only real advantage to adding in "unused" octaves is in order to transmit overtones.

I'm pretty sure his point is that it's only one more octave. What the phone companies consider an "ideal" response for a telephone line is a bandwidth from about 180 Hz to 3-4 KHz or so, with a signal to noise ratio of about 45 dB. That means an ideal POTS line starts with about 4.5 octaves of bandwidth, and this increases that to about 5.5 instead. IOW, even though it doubles the maximum frequency, the perceived change is only about 20-25% at most.
In reality it'll usually be less than that. The perceived quality is reduced only to the extent that the original signal started out with energy in that region. Take a look at a spectrum analysis of a voice. This is a recording of a woman (more or less) singing a scale. The highest note is the last one, so let's look at it (though you'll see there's relatively little difference between them). What you're looking for is the graph titled: "This is the eighth utterance." The highest peaks are at about -20 dB. There's only one peak at or above 4 KHz that goes above -50 dB, meaning the signal is at least 30 dB down at that point. By 8 KHz, the signal has dropped even further, to around -65 dB, or about 45 dB below the primary signal -- and the energy drops still further above that point.
That means, even if the normal POTS line actually transmitted signal at higher frequencies, it would be almost entirely buried in noise anyway, unless they also reduced noise levels. It turns out that while low noise levels are generally good, on a phone most people don't like it much.
In fact, on a current phone syste, much of the noise is there intentionally. To maximize bandwidth, especially over the long distance networks, they compress the signals, and when the compressor detects a time that nothing is being said, instead of transmitting compressed noise, it transmits a more compact signal saying there's no real signal right now. On the receiving end, they respond to this by synthesizing and injecting what they call "comfort noise". Without this, many people think the call has been dropped, because it sounds like it has "gone dead".
In case anybody wonders: I talked about 4 KHz and 8 KHz instead of 8 KHz and 16 KHz. The 8 and 16 KHz are the _sampling_ rates. A guy named Nyquist long ago theorized that to sample a signal meaningfully, you have to sample at a frequency at least double that of the signal itself. On the decoding side, you have to do some filtering, and in the process you normally lose at least a little bit of the bandwidth near the limit, so an 8 KHz sampling rate means the theoretical maximum signal is 4 KHz, but in reality it'll normally be a little less than that.

--
The universe is a figment of its own imagination.
Re:Doubling? hardly by tincho_uy · 2006-07-17 23:33 · Score: 1

It won't. While there's a noticeable difference in quality when listening to 8KHz and 16KHz sampled speech, it certainly won't double the perceived quality. Even more so if it's in a VoIP context, where other factors such as the loss rate and distribution, forward error correction and the choice of codec (which tend to be of the non-PCM kind) play such big roles. Just my 2 cents...

What's wrong with the current implementation? by HockeyPuck · 2006-07-17 04:30 · Score: 1

We're a Cisco VOIP shop and phone conversations sound fine. I'm not sure how going from 8->16 would make it any better.

Re:What's wrong with the current implementation? by porkThreeWays · 2006-07-17 09:40 · Score: 1

It does make a difference. 44KHz would be ideal, but 16 is good. The original 8KHz is a carry over from the old telecom days. That's how much uncompressed voice data they could carry over a single copper line. So in essence voice quality really hasn't improved much on telephones since the 80's.

It would make understanding people who mumble, have poor english skills, lispers, etc, etc, significantly easier. 44KHz would be ideal, but 16 would be an improvement. I'm pretty sure however that many VoIP soft switches can do things like this anyway for internal calls. As long as the PSTN still has to cater to relic's from the 80's though nothing's going to really change (and it wouldn't be fair anyway because many poor countries can barely handle 8KHz as it is).

--
If an officer ever threatens to taze you, say you have a pacemaker.

PING Ted Stevens by Rob+T+Firefly · 2006-07-17 04:30 · Score: 4, Funny

This can only mean twice as much material filling up the tubes.

--
Slashdot Burying Stories About Slashdot Media Owned

Re:PING Ted Stevens by mr_flea · 2006-07-17 06:09 · Score: 1

The horrible part is when the tubes get backed up, it makes a horrible mess all over your bathroom floor...

Oh wait...

High-Def Telephony with Open Source Soft-Switch! by evilviper · 2006-07-17 04:34 · Score: 1

I wasn't aware that telephones even HAVE "definition", let alone that they are in HIGH DEFINITION now.

definition 4. a. The clarity of detail in an optically produced image, such as a photograph, effected by a combination of resolution and contrast. b. The degree of clarity with which a televised image or broadcast signal is received.

Of course, what do I know... I didn't realize wireless networking equipment had fidelity, either (ie. WiFi).

--
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant

Only a slight improvement by riflemann · 2006-07-17 04:36 · Score: 4, Informative

Actually, I've used Asterisk to pass through 24KHz Speex encoded audio - very impressive sound quality, but only works when the SIP channel is client to client.

In theory a SIP server doesn't need to know all of the codecs a client supports - the clients themselves negotiate any compatible protocol.

Of course, if the sip server puts itself in the path (such as when it needs to pass through to PSTN or firewalled clients), then 8KHz is the (till now) maximum supported rate.

--
Sparks:Gadget:Beer Maker

Re:Only a slight improvement by jmv · 2006-07-17 14:07 · Score: 1

Actually, I've used Asterisk to pass through 24KHz Speex encoded audio - very impressive sound quality, but only works when the SIP channel is client to client.

Care to provide more info on this. Speex is *not* optimized for 24 kHz so it would probably sound worse than 16 kHz or 32 kHz. If the devs are indeed using 24 kHz, it's probably a bad idea that would be fixed. (BTW, I know what I'm talking about -- I wrote Speex)

--
Opus: the Swiss army knife of audio codec

Right... by Anonymous Coward · 2006-07-17 04:36 · Score: 0

...just like going from a 32- to 64-bit processor doubles it's processing power.

Re:Right... by Anonymous Coward · 2006-07-17 06:40 · Score: 0

. . . ignorance on SlashDot . . .

64bit is just a larger address space . . . not a faster processor . . .

If your application is written to take advantage of the 64bit addres space at the same speed of processor, you can chew through twice as much data in the same amount of time. Otherwise, you are just running at the same speed as always.

Heck, you have to buy special versions of MS OS to even get 64 bit support on your new "double the processing power" 64 bit processor.

Get off the Internet and turn off your computer before you hurt yourself.
Re:Right... by LocalH · 2006-07-17 10:15 · Score: 1

Wow, you completely missed the sarcasm.

--
FC Closer
Re:Right... by Loonacy · 2006-07-17 11:48 · Score: 1

Heck, you have to buy special versions of MS OS to even get 64 bit support on your new "double the processing power" 64 bit processor.

Umm... I'm running at 64 bits right now, and I don't even HAVE an MS OS (assuming you mean MS == Microsoft). Nor did I have to buy any special OS at all. I just downloaded it. Legally, even.

Get off the Internets and turn off your hard drive before you hurt yourself.

The telemarketers by The+Relentless · 2006-07-17 06:04 · Score: 1

will be able to clearly understand me when I say, "I can't talk now, my leg is on fire."

can we say astroturf? by bferrell · 2006-07-17 06:10 · Score: 1

the submitter is the author of the code.

Move along, nothing to see here yet

Big Whoopie by jmorris42 · 2006-07-17 06:11 · Score: 2, Insightful

The problem isn't making a software based IVR system or even a softswitch run at a better rate. Now find me a SIP phone that runs at anything other than 8Khz. No, I'm not talking about a F/OSS softphone, but a real hardphone. They have the minimum DSP power the manufacturers can get away with to support 8Khz. Now find me a PRI that can interface with it. For now that is still an issue.

Skype has been running their softphones at higher than 8Khz/8bit so their softswitch obviously was the first widely deployed one to leave 64kbit max quality behind.

Yes, someday all telephony (except legacy telco stuff that will never change, which will be a shrinking market) will offer higher quality audio and an option for video. But not for a few more years until the saturation of next gen telephony products gets better.

--
Democrat delenda est

I don't think this is the real problem by Sarusa · 2006-07-17 06:13 · Score: 1

8khz to 16Khz is fine, but that's not usually the problem we encounter with VOIP. It's latency and dropped packets, which this will just make worse. But if you're doing this on your own network only then I can see where this would be neat.

Nothing special here... by Anonymous Coward · 2006-07-17 06:16 · Score: 1, Informative

They're just using a higher quality codec than G.711 (which is the standard for the back-end digital phone system).

The phone people (probabably AT&T) chose that standard since it gave pretty good voice quality given the limitations of current technology.

People are generally happy with the voice quality of the phone system - which is different from the voice quality of the last mile - the analog copper loop to your house, or CDMA/GSM/TDMA to your cell phone.

It's highly unlikely this new codec will catch on - the installed base of G.711 phone systems out there is enormous.

Marketing BS by jheath314 · 2006-07-17 06:17 · Score: 3, Insightful

This "improvement" is idiotic. The thing which most limits the quality of a VoIP call is delay and jitter, NOT the sampling rate. Guaranteeing the quality of a telephone conversation over the internet is tricky because the internet was originally designed for best-effort packet delivery, with no guarantees on packet delay, sequence, or even (at the network layer) delivery.

If anything, this feature reduces end-to-end quality by doubling the amount of data being sent down the pipe, as you'd need to buffer more data at the same transmission speed to correct for jitter. Brillant!

--
Procrastination Man strikes again!

Re:Marketing BS by anthm · 2006-07-17 06:32 · Score: 2, Informative

FYI: 20ms of 16khz audio (the typical size of 1 RTP packet) encoded with the Speex Codec http://www.speex.org/ is 43 bytes. 20ms of 8khz audio encoded with the Speex Codec http://www.speex.org/ is 29 bytes which is only 1.4 times as big as it's 8khz counterpart. 20ms of 8khz g711 is 160 bytes so with speex at 16khz, you can still fit 3 calls in the same amount of bandwidth that it takes for one 8khz call. The biggest overhead in VoIP is the various headers on each RTP packet per level of encapsulation, not the size of the payload.
Re:Marketing BS by amorsen · 2006-07-17 06:35 · Score: 1

Guaranteeing the quality of a telephone conversation over the internet is tricky because the internet was originally designed for best-effort packet delivery

There's more to VoIP than the Internet, you know. Some of us work with lines which are guaranteed big enough or have QoS.

--
Finally! A year of moderation! Ready for 2019?
Re:Marketing BS by Anonymous Coward · 2006-07-17 07:07 · Score: 0

Do you only VoIP with people with the same big lines / QoS?

Nor do I.
Re:Marketing BS by Anonymous Coward · 2006-07-17 07:48 · Score: 0

With Speex high quality VoIP conversations (16bits/16kHz) can occur with just 6kb/s (3kb/s in each direction). For an example check out www.omilyx.com (Pocket PC onwards/Win32) or www.voiperized.com (Win 32 only).
Re:Marketing BS by jmv · 2006-07-17 14:11 · Score: 1

Thanks. That's something a lot of people forget. Actually, the overhead of the headers is usually 16 kbps, i.e. about as much as the codec data itself. That's also why very low bit-rate ( 8kbps) codecs are (almost always) useless in VoIP.

--
Opus: the Swiss army knife of audio codec

Is it.. by bruno.fatia · 2006-07-17 06:20 · Score: 1

Is it even a difference human ear can notice? I mean, VoIP calls today are pretty good..

Re:Is it.. by azurepalm · 2006-07-17 06:38 · Score: 1

Try doing a Skype call to an international country and you'll see the difference in "reception" (quality). Probably not entirely Skype's fault, but any improvements will make a difference.
Re:Is it.. by anthm · 2006-07-17 06:39 · Score: 1

The more quality, the easier it is to perform detection algorithms for things like speech recognition, and yes you can notice the difference as long as the audio was generated digitally by a microphone+soundcard or with something like cepstral that defaults to 16khz for a reason.
Re:Is it.. by Anonymous Coward · 2006-07-17 08:28 · Score: 0

What the hell is an international country?
Re:Is it.. by Anonymous Coward · 2006-07-17 08:58 · Score: 0

Yes. The human ear has a dynamic range of roughly 20 kHz, although it decreases with age.

Bits is Bits by Detritus · 2006-07-17 06:27 · Score: 1

It's more complicated than doubling the sampling rate. Standard PCM telephony uses 8 kHz sampling rate, 8-bit samples, non-linear encoding. It's fairly simple, resulting in 64 kbps.

Speex is a CELP (code excited linear prediction) codec that is far more complex than the simple PCM system used by the telephone company. The resultant bit rate can be fixed or variable, and is not rigidly tied to the sampling rate used for data acquisition.

--
Mea navis aericumbens anguillis abundat

Ugh, Don't get me Started by Greyfox · 2006-07-17 06:32 · Score: 1

I mean sure I can route a call through Enum or DUNDi (Well... my DUNDi peer group only has 2 nodes right now, so that's kind of pointless) and it could be pure digital. I've yet to find softphone I/O solution that doesn't suck (Maybe a bluetooth headset would be OK if it could push that sort of quality) so it's still much easier to dump the call out to an old $10 wireless RadioShak special via the digium FXS card.

The VOIP to PSTN scene kind of sucks at the moment anyway. There are a lot of fly-by-night operators offering a vareity of confusing plans which may or may not leave you stuck on proprietary hardware. I'm more inclined to place my bets on private DUNDi peering networks like fwdout. It might be easier just to find someone who wants to call numbers local to your phone line and trade them service in your area for service in their area. Better yet don't terminate into the PSTN at all, except that in my case neither my parents nor my surviving grandparents have high speed Internet access.

--

I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

My voice bandwith runs at 80 KHz! by wsanders · 2006-07-17 06:43 · Score: 1

So it's 10 times better than the Evil (tm) telcos!

And my software puts a green stripe around the edge of the data too... sucka!

--
Give a man a fish and you have fed him for today. Teach a man to fish, and he'll say "WHERE'S MY FISH, YOU IDIOT?"

Re:High-Def Telephony with Open Source Soft-Switch by anthm · 2006-07-17 06:45 · Score: 1

Well sure, VoIP is digital audio over the internet. It contains all of the same properties as any digital audio. It can vary from CD quality down to unintelligable static.

Because it covers almost all of the human voice by Sycraft-fu · 2006-07-17 07:10 · Score: 2, Informative

Our voices don't have that wide a frequency range, there's little up in the high frequencies. A voice sample recorded at 22kHz (11kHz frequency range) is very hard to distinguish from one recorded at 44kHz (22kHz frequency range). In fact you'd need to be using a fairly good mic to really get much of the higher frequencies anyhow. 8kHz works since F1 and F2 (the frequencies of the first two peaks in the harmonic curve) fall under 4kHz for essentially all speakers. F1 and F2 are what we primarly use to determine vowel sounds and thus are what's realy relivant. Well with an increase to 16kHz you get F3 and even F4 which leads to pretty natural sound as far as most listeners are concerned. Past that, there's just not a whole lot that affects your perception of speech.

The reason for chosing 16kHz is probably simply that it's twice what you have before. Thus if you are interfacing with an old system that doesn't support it, just discard every other sample, no sample rate conversion needed (which is CPU intensive).

Re:High-Def Telephony with Open Source Soft-Switch by DA-MAN · 2006-07-17 07:13 · Score: 1

I wasn't aware that telephones even HAVE "definition", let alone that they are in HIGH DEFINITION now.

Apparently audio can have "definition"...

http://en.wikipedia.org/wiki/High_Definition_Radio

Of course, that's only in the same as networking equipment has fidelity...

--
Can I get an eye poke?
Dog House Forum

Not really free? by Anonymous Coward · 2006-07-17 07:31 · Score: 0

If you have to provide your name, address, phone number, e-mail, etc., just to download it, it should not be called "free"switch; it should be called "spam"switch.

Re:Not really free? by anthm · 2006-07-17 08:20 · Score: 1

Dear entitlist,

Please send us your name and email address and we will
send you instructions on how to download our unquestionably
open source code without having to provide any information.
Re:Not really free? by Anonymous Coward · 2006-07-17 09:22 · Score: 0

UH, how about paying more attention? On the right side of the site where it says "Download Now"

Let me be the first to say... by jheath314 · 2006-07-17 08:18 · Score: 1

16 kHz should be enough for anyone ;)

--
Procrastination Man strikes again!

Will this help the hard-of-hearing? by Anonymous Coward · 2006-07-17 08:58 · Score: 0

I can't hear anything on the phone with my left ear, presumably because that eardrum's heavily scarred. I can hear most people talk with the left ear when they aren't on the telephone, but I get nothing over a phone.

I can't understand Lorena McKennitt's ballads using either ear (the right eardrum has much less scar tissue, and I can hear most things, including phone voices, but not including bats or Lorena, with the right ear).

No, really. I'm serious.

Moore's Law by Goody · 2006-07-17 09:03 · Score: 1

OMG, at this rate, we'll have 64 kHz calls in 6 years, and 128 kHz in 12 years!!!!

(Going from 8 kHz to 16 kHz isn't a "doubling of quality" :-P )

--
Tired of being "punished" by the Slashdot $rtbl since 2002. I'm now over at http://soylentnews.org/ .

Re:Moore's Law by anthm · 2006-07-18 02:00 · Score: 1

OMG, actually, we can actually operate at any sample rate we want! 16khz was just a logical test because the phone we tested it with supported it.

Virtually pointless by Anonymous Coward · 2006-07-17 09:12 · Score: 0

Until the hardware terminating a call to PSTN supports the quality of these shiny new codecs, there's more or less no point to them. Seriously, that's never, EVER going to happen. Telecom' hardware manufacturers are going to stick with G.711 and similar ITU standards.

The change will come when IP -> IP calling comes of age and the legacy hardware players are taken out of the loop entirely.

Re:Virtually pointless by anthm · 2006-07-18 02:12 · Score: 1

You may have a point, we should stick to ITU codecs like perhaps, g722 http://www.umiacs.umd.edu/~desin/Speech1/node3.htm l Oh waddya know! its a wideband codec! yay does that mean we can use it now???? Not exactly pointless, You can do conferencing, ivr, voicemail and media proxy calls from a SIP phone all at 16khz Mostly only client applications have been able to operate at this rate. Now we actually have a switching platform that will allow people to interact with the calls. The goal is not to make the pstn 16khz it's to make devices that are better than legacy phones able to do all of the same things *without* the PSTN.

First, PSTN is a 4 kilohertz bandwidth by Beryllium+Sphere(tm) · 2006-07-17 09:40 · Score: 1

Theoretical maximum, may be as low as 3.

Second, this is enough to capture most of a human voice. Can you hit a high "C"? That is about one kilohertz.

Everything above 1kHz is being used to carry ever-dimishing harmonics that provide resolution for fast-rising sounds like "k" and "p". There's a slight loss of detail at 4kHz and very little at 8kHz. There is no honest way to refer to a move from 8 to 16 as "doubling the quality". Sycraft-fu's post has it right. In fact, if I were designing the system I'd put in a gentle low-pass filter to keep flyback transformers and the like out of the channel.

Besides, look at the response curve of human hearing, and notice how many dB it has dropped by 16 kHz.

16KHz is nothing. by Darth+Android · 2006-07-17 11:12 · Score: 1

For those of you who are not IRC junkies, the IRC client KVirc has built-in support for 44.1 KHz "voice chat" (not sure if it qualifies for "VoIP", but is a simple direct connection between two computers supporting real-time audio transfer). Not only does it support 44.1 KHz, but it has for at least a year (when I started using it). What's the big deal with 16KHz?

--
Do not meddle in the affairs of dragons for you are cruchy and good with ketchup.

Re:16KHz is nothing. by anthm · 2006-07-18 02:26 · Score: 1

If you wish, you are welcome to operate at 44khz. We support any speed we just tested it with 16khz.

Yum by Anonymous Coward · 2006-07-17 11:21 · Score: 0

Wahoo. Now I won't just hear that the slob on the other end is eating while on the phone, but maybe with this higher quality I'll actually be able to tell WHAT he's eating.

16 kHz by jmv · 2006-07-17 12:04 · Score: 1

That's called wideband speech. It's been around for 10+ years and Speex supported it about 4 years ago. About time people actually use it (i.e. why people are still using narrowband in VoIP is beyond me).

--
Opus: the Swiss army knife of audio codec

I need it by r00t · 2006-07-17 13:32 · Score: 1

Think of the hold music.

Now imagine that it responds to button presses so you can change songs.

"Operator... oh won't you help me make this call..."

Multiple codec conversions *are* really bad by billstewart · 2006-07-17 14:09 · Score: 1

Don't know where you got your information, but it's unfortunately incorrect. There are some of the early ADPCM codecs (e.g. 32kbps) that do the same transformations every time, so you can convert from 64 to 32 to 64 to 32 to 64 again without additional damage, but most of the newer high-density codecs take a significant hit if you do them two or more times, e.g. 64kbps to 8 to 64 to 8 to 64. I've forgotten the precise MOS scores, but the standard G.729 codec family goes from "better than a cellphone with a decent headset" at the initial compression to "worse than a cellphone in bad traffic" with two 8kbps compressions to "cellphone in a tunnel full of trucks during a lightning storm" with three.

The one case where several conversions are safe is converting from the original G.711 64kbps format down to almost any 8kbps or 6.3kbps format and back to 64kbps. The reason that mobile-to-mobile calls can work better than that is that most of them use the same codecs, so they can usually avoid conversion if you're going from one GSM phone to another. I don't know if this works when connecting a GSM carrier to a CDMA carrier or not. Also, of course, mobile phones usually have lousy little microphones and tinny little speakers, so much of the audio damage is done at the ends rather than by the codec itself, especially if you're using it in traffic.

--

Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks

Speex Wideband Codec by billstewart · 2006-07-17 14:30 · Score: 1

Newsforge has no technical information, and Freeswitch is largely Slashdotted, but there's one sentence that says that they're using the Speex Wideband Codec as their 16kbps codec. One reason Speex is using 16khz sampling is because it's relatively available on PC sound cards, but another reason is that they do a cute sub-band coding technique - instead of representing the 8kHz analog waveform by directly encoding the 16k samples/second, they split the information into two bands - 0-4kHz which they encode using the same encoding they use for their 8kbps codec, and 4-8kHz which they encode (somewhat differently, for complex technical reasons :-) to provide additional depth for receivers that support the wideband format. So if you've got a wideband codec encoding the speech at 16kbps, you can play it on an 8kbps player if that's all you've got. For a live conversation between two software-phones, that's not particularly useful (except for a bit of code reuse), but if you're playing recorded files or setting up a multipoint conference between some 8kbps phones and some 16kbps phones, it's easy to send each phone what it wants.

--

Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks

It's for the consonants, not the vowels by tepples · 2006-07-18 04:22 · Score: 1

This is a recording of a woman (more or less) singing a scale.

Of course vowels aren't going to have a lot of content in the upper frequencies. Now try saying "This is the eighth utterance" into a microphone and see what doesn't happen. I did it myself, using a crossover at 4 kHz to split the signal into low-pass left and high-pass right channels. Listen to the Ogg Vorbis file and play with the balance. Notice how the phoneme /s/ comes through three times clearer when you have both speakers on (8 kHz bandwidth) vs. just the left speaker (4 kHz bandwidth).

poor choice of demo by tartley · 2006-07-18 22:38 · Score: 1

That's a bit of a retarded demo for the technology: every techie's instincts are screaming "why not just transmit the RSS and convert to speech at the client?"

Re:High-Def Telephony with Open Source Soft-Switch by evilviper · 2006-07-19 06:12 · Score: 1

Congratulations on being the guy who completely missed the point. Perhaps next time you'll try reading my entire post before replying.

"Definition" is a video term, it has NO application at all to audio. It makes no sense.

--
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant

116 comments