Xiph Episode 2: Digital Show & Tell

← Back to Stories (view on slashdot.org)

Xiph Episode 2: Digital Show & Tell

Posted by Unknown on Wednesday February 27, 2013 @06:19AM from the very-complicated-beeps dept.

An anonymous reader writes "Continuing a firehose tradition of maximum information density, Xiph.Org's second video on digital media explores multiple facets of digital audio signals and how they really behave in the real world. Demonstrations of sampling, quantization, bit-depth, and dither explore digital audio behavior on real audio equipment using both modern digital analysis and vintage analog bench equipment... just in case we can't trust those newfangled digital gizmos. You can also download the source code for each demo and try it all for yourself!" Plus you get to look at Monty's beard and hear his soothing voice. There's a handy wiki page with further information and a summary of the video if text is your thing.

34 of 50 comments (clear)

Min score:

Reason:

Sort:

Re:soothing? verarschst Du? by Neuroelectronic · 2013-02-27 07:26 · Score: 1, Informative

>transient signals behave differently from the periodic signals he tests in the video
He addresses this. No signal in our rate-limited sample exists in isolation. A transient blip or noise is not detected if it qoesn't fall in a point that is measured, yes, but a real signal does not exist on a single point. An audio codec is at another level above the hardware and doesn't change the fundemental physics of digital-analog implementations that havn't changed much in the last 10-15 years, and are developed with signal theory in mind.
I jus happened to have watched part 1 last week by Mister+Liberty · 2013-02-27 07:36 · Score: 4, Informative

Found it very informative to a non-guru.
Aside from that, the video and its audio, and I'm not kidding here,
were very pleasant and sympathetic to the ears and eyes.
Re:soothing? verarschst Du? by cheater512 · 2013-02-27 07:39 · Score: 4, Insightful

If a 100Mhz analog scope can't detect the stair steps, then there is absolutely no fucking way you have the slightest chance of detecting it.
100,000,000 Hz vs your ears at maybe 20,000 Hz.
See the difference?
Re:soothing? verarschst Du? by Anonymous Coward · 2013-02-27 07:43 · Score: 5, Informative

He has actual hardware there, as he explains quite old consumer grade hardware, which does the conversion from analog to digital to analog, and the result is still for all intents and purposes PERFECT. Yes, the delivery is smug, but rightly so. Talking to "audiophiles" is like talking to people who believe in homeopathy: It is extremely difficult to not just make fun of the fools. When you instead manage to deliver an explanation and a demo that clear, you get to be smug. (Captcha: mockery)
Re:soothing? verarschst Du? by Aguazul2 · 2013-02-27 08:06 · Score: 5, Informative

He knows what he is talking about, he explains things clearly, he is not condescending to the viewer -- I think the apparent smugness is not for real, or maybe it is just how his personality comes over. And if you still don't understand how there is no stair-step, you need to watch the video again! Even though I've done loads of DSP, the nice demos he gives really illustrate well what he is saying, and who can argue with pure-analogue gear proving the point -- not just theory and hand waving, but real experimental evidence. Really nice work.
Re:soothing? verarschst Du? by frank_adrian314159 · 2013-02-27 08:15 · Score: 3, Funny

See the difference?
But, but, but... I'm an audiophile, dammit! I listen with my soul. That's why I can hear it!

--
That is all.
This is good by ceoyoyo · 2013-02-27 08:19 · Score: 3, Informative

This guy knows what he's talking about, and communicates it well. Amateur audiophiles should especially read his article here: http://people.xiph.org/~xiphmont/demo/neil-young.html.
full props to Monty by nadaou · 2013-02-27 08:31 · Score: 5, Informative

... for all the bullshit Blackboard technology mess, videotaped classroom lectures, and .edu buzzwords, this sort of thing is exactly how open education should be done.
congrats Monty, once again you've done well.

--
~.~
I'm a peripheral visionary.
Re:soothing? verarschst Du? by Anonymous Coward · 2013-02-27 09:05 · Score: 2, Informative

Never underestimate the power of self-delusion. Placebo effects have been accepted by science as being very real. The fact that someone, whether a believer in homeopathy, audiophile-quality, sugar pills or whatever scientifically-unsubstantiated nutjobery, actually believes in it is beneficial to them. Trying to take away their self-delusions is just plain mean. I really wish I could convince myself that what amounts to water is as effective as real medicine...I might be a healthier person.
Re:Using real world audio waveforms? by jmv · 2013-02-27 09:12 · Score: 2

Could you please post the specs of the reel to real equipment you're using. To beat a 16-bit digital system, it has to have better than 96 dB SNR and dynamic range. I don't remember having ever seen that.

--
Opus: the Swiss army knife of audio codec
Re:Using real world audio waveforms? by Anonymous Coward · 2013-02-27 09:20 · Score: 1

you're missing the point.. whether it's a pure sine wave, or a combination of sines (all real number samples up to nyquist freq), nothing is lost.
Re:soothing? verarschst Du? by epyT-R · 2013-02-27 09:23 · Score: 1

This ignores the fact that perception doesn't define reality. Stripping delusion away might be 'mean', but sometimes it's necessary. History is rife with examples of delusion driving whole societies over the edge.
Re:Using real world audio waveforms? by Anonymous Coward · 2013-02-27 09:26 · Score: 1

What he should be using is a true audio wave which requires so much more resolution to recreate accurately.
With 44.1kHz, 16bit audio we already have all the data to reconstruct a signal band limited to the range of human hearing. Effectively then you're calling nyquist bullshit without even understanding it.

I've yet to come across a 16bit digital system being able to beat reel to reel, let along anything lesser.
Only via compounding / NR and those solutions are not broadband. How could you use tape and not know this?
Re:Using real world audio waveforms? by ceoyoyo · 2013-02-27 09:37 · Score: 4, Insightful

He says several times that he used sine waves as a simple illustration. Then he switches to square waves. You apparently don't understand sampling theory well enough to understand why your second sentence, in the context of PCM audio, is incorrect. Perhaps reading this will help: http://people.xiph.org/~xiphmont/demo/neil-young.html.
On the other hand, you might just be an AC troll, an "audiophile" or an old enthusiast or sound engineer who might have been an excellent technician but never developed a proper understanding of signals. In any case, anybody tempted to agree with your post should read the article at that link.
Re:soothing? verarschst Du? by ceoyoyo · 2013-02-27 09:48 · Score: 1

No, stair steps don't exist in the music you hear (at least not beyond trivially small ones). You could potentially make an ADC that did produce stair step samples but it would be a stupid thing to do. As he also mentioned, in some cases ADCs to produce stair step samples, but that's an intermediate step in the conversion and is not what is output.
If you want to get really pedantic, any real ADC does take a certain amount of time to complete a sample so the sample does have some finite extent, but it's very, very small, isn't uniformly distributed and is far outside the band limit anyway.
I suppose you could construct a codec in which an in-band transient might be treated differently than a periodic signal but that has very little to do with what actually happens in real codecs that people do, or might, use.
Maybe you think he's smug because he knows what he's talking about and it disagrees with what you think you know?
Re:soothing? verarschst Du? by ceoyoyo · 2013-02-27 10:17 · Score: 1

Stripping away someone's self delusions isn't mean. It prevents him from being an easy mark for snake oil salesmen, whether those are homeopaths or Best Buy employees. In medicine the placebo effect is great, until you get something it doesn't work on and you die because you didn't get real treatment. In stereos it just means you get separated from your money, over and over and over again.
Re:Using real world audio waveforms? by Entropius · 2013-02-27 10:44 · Score: 1

So long as all of the steps involved are linear, does it matter?
If a linear signal processing procedure behaves a certain way for all sine wave signals in a frequency range, then it will behave in the same way for any sum of sine waves in that range as well.
Great lesson, but what's with the audio? by wonkey_monkey · 2013-02-27 11:48 · Score: 2

For a video all about audio, why does the guy's voice keep flapping around from left speaker to right speaker? I found it pretty distracting. Next time, try a clip-on mic and mix it down to mono unless it's necessary to make a point.

--
systemd is Roko's Basilisk.
1. Re:Great lesson, but what's with the audio? by xiphmont8352 · 2013-02-28 05:41 · Score: 2
  
  I intentionally avoid using lavalier mics. Their amplitude and timbre are all over the place depending on the direction you're looking, they pick up clothing noise, and you're either tethered to a wire or have to deal with the complexity and limitations of a wireless system.
  Headsets work better overall, but are highly visible and still add a layer of complexity. They also sound like someone talking directly into your ear. Even with additional postprod [going all the way to room modeling], I've never managed to make a headset sound totally natural. If others have, I'd appreciate some tips.
  In any case, as someone who spent ten years dealing with lavs and headsets in live performance as a sound engineer, I avoid them in my own time as much as humanly possible :-)
  The stereo image was intentional, it's a trick for removing perceived echo/reverb by spreading it out across a stereo image instead of it all piling up right behind the voice in mono. That said, the image was wider than I'd have liked. That was a result of mic placement and angle, another tradeoff to avoid wide amplitude changes as I moved around.
  FWIW, several other video producers wrote me to ask how I got such great sound without a lav, and (like you) others wrote to mention that they found the very wide stereo distracting. I'll tinker with it more in the next vid.
2. Re:Great lesson, but what's with the audio? by wonkey_monkey · 2013-02-28 06:36 · Score: 1
  
  FWIW, several other video producers wrote me to ask how I got such great sound without a lav
  Aside from the stereo thing, it did sound very clear.
  
  The stereo image was intentional, it's a trick for removing perceived echo/reverb by spreading it out across a stereo image instead of it all piling up right behind the voice in mono.
  You just can't stop teaching people stuff, can you? ;)
  
  --
  systemd is Roko's Basilisk.
3. Re:Great lesson, but what's with the audio? by xiphmont · 2013-02-28 07:24 · Score: 1
  
  I think it's more "I want people to know why I do the stupid things I do." Latent fear of being committed.
4. Re:Great lesson, but what's with the audio? by iluvcapra · 2013-02-28 11:18 · Score: 1
  
  The stereo image was intentional, it's a trick for removing perceived echo/reverb by spreading it out across a stereo image instead of it all piling up right behind the voice in mono.
  o_0
  
  FWIW, several other video producers wrote me to ask how I got such great sound without a lav, and (like you) others wrote to mention that they found the very wide stereo distracting. I'll tinker with it more in the next vid.
  
  Nobody records dialogue in X-Y stereo professionally, and recording dialogue in any kind of stereo field is exceedingly rare -- we pan it if we want to position it, but generally speaking it's distracting and breaks convention unless you're trying to emphasize two sources in L-R space. Tom Holman was fond of telling us in class that the only major American feature film to ever shoot stereo dialogue was Bette Midler's The Rose, and the experiment died a quick death in dailies. (That crew did win an Oscar though.)
  If you insist on recording in stereo though, you might do as they did, and record with a Mid-Side array and use a matrix to decode back to L-R, so you can control the stereo spread in post-production.
  
  --
  Don't blame me, I voted for Baltar.
5. Re:Great lesson, but what's with the audio? by xiphmont · 2013-03-02 17:48 · Score: 1
  
  >If you insist on recording in stereo though, you might do as they did, and record with a Mid-Side array and use a matrix to decode back to L-R, so you can control the stereo spread in post-production.
  That would not have controlled the reverb; the space this was recorded in was a concrete floor with concrete walls and no acoustic treatment. Like I said, it was a tradeoff, and one that was successful if not perfect.
Re:soothing? verarschst Du? by iluvcapra · 2013-02-27 14:58 · Score: 1

Yes, sampling *theory* means there are no stair steps, but codecs are real devices.
You're absolutely right, and real devices are more or less prone to some distortion when converting digital signals.
However, this must be determined empirically, and is beyond the scope of a discussion of digital signals. As a rule, the DSP developer gets his algorithm right, and getting those Signed Ints to sound good is strictly the client's responsibility.

--
Don't blame me, I voted for Baltar.
Re:Using real world audio waveforms? by iluvcapra · 2013-02-27 14:59 · Score: 3, Informative

While Monty's presentation is excellent, what we have to remember is we are not listening to pure sine waves.
We listen to nothing more than sums of pure sine waves.

--
Don't blame me, I voted for Baltar.
Re:Using real world audio waveforms? by iluvcapra · 2013-02-27 15:04 · Score: 1

To beat a 16-bit digital system, it has to have better than 96 dB SNR and dynamic range
Not absolutely. You get dithering losses and quantization distortion, so a 16 bit system usually has about 14 ENB thus 84 db dynamic range. SNR is the range from the floor to the reference fluxivity, which per the AES spec would be -18 dbFS or -66 db. I have worked with 24 track machines (a Studer A827 in this case), at 15 ips and with Dolby SR, with a fresh, degaussed roll of Quantegy, that could do 70 decibels of SNR on the first generation.

--
Don't blame me, I voted for Baltar.
Re:soothing? verarschst Du? by wiredlogic · 2013-02-27 16:21 · Score: 2

SACDs are encoded with a 1-bit sigma-delta pulses. Those would be some pretty whopping staircases if sampling theory didn't work to eliminate them.

--
I am becoming gerund, destroyer of verbs.
The value of science by xiphmont8352 · 2013-02-28 05:17 · Score: 1

On that topic, while I was filming the epilogue I started feeling ill and ignored it for a while, but eventually went to the emergency room. Three hours later I had an emergency appendectomy. Medical science also rocks. If I'd gone to a faith healer, I'd be dead. (FTR, the epilogue footage in the final vid was from three days after the appendectomy)
It sure is! by xiphmont8352 · 2013-02-28 05:44 · Score: 1

Extensive testing reveals that no matter the wailing and gnashing of teeth claiming digital sucks, no one can actually tell the difference in a double-blind test.
Theory is indeed useless when reality doesn't agree. Every real scientist knows this. An engineer's job depends on it. I invite audiophiles to add that bit of wisdom to their own thinking.
Re:Using real world audio waveforms? by xiphmont · 2013-02-28 06:25 · Score: 1

As a nitpick, you get dithering losses _or_ quantization distortion, or a linear tradeoff between the two. You don't get the worst case of both on top of each other unless you screw up.
Without dither, worst case, all your 16 bit quant distortion products will be under -100dB regardless of input amplitude. I actually display the worst case in the video to make it easy to see. Quantization distortion aliases, and I chose an integer sample period so the aliased distortion would always land in the same bins after folding. If I hadn't, it would have spread out more and been even lower. If I had chosen a relatively prime frequency, the quantization distortion would have spread out across all bins equally.
Re:Using real world audio waveforms? by iluvcapra · 2013-02-28 07:52 · Score: 1

As a nitpick, you get dithering losses _or_ quantization distortion, or a linear tradeoff between the two.

As you point out though, this depends a lot on the selection of dither spectrum and the dither's probability density function, and these are much more pokey issues and depend on subjective analysis of the signal target -- noise shaped dither is great if you're mastering for delivery but can screw you if your recipient isn't an ISO 226 listener, and expects to be able do pitch shifting or folding with a ring modulator.
Counter nitpick: Monty, as a professional motion picture sound designer, I cannot tell you how distracting it is to hear your voice constantly changing its pan across the stereo field :)

--
Don't blame me, I voted for Baltar.
Re:Using real world audio waveforms? by xiphmont · 2013-02-28 08:02 · Score: 1

Right, and this is why dither is only applied to 'last-mile' audio intended to be consumed. Dither 'screws' you in other ways if you intend to use that audio in production, such as losing all the property of removing the distortion, yet still having the additive noise. But we're still talking about changes 100+dB down.
>Counter nitpick: Monty, as a professional motion picture sound designer, I cannot tell you how distracting it is to hear your voice constantly changing its pan across the stereo field :)
The audio was recorded with a stereo pair. It wasn't panned artificially :-) Look down a few comments for more about this, you weren't the only person to complain.
Re:Using real world audio waveforms? by xiphmont · 2013-02-28 08:06 · Score: 1

Oh! And 'linear' was completely wrong. I don't know how I braino-ed that in there, the [at very least perceived] distortion/noise tradeoff is not linear.
Re:Using real world audio waveforms? by Sigg3.net · 2013-02-28 23:36 · Score: 1

...what we have to remember is we are not listening to pure sine waves.
We listen to nothing more than sums of pure sine waves.
Well I, for one, listen to the radio.

--
Defining Statistics and Social Research