Xiph Episode 2: Digital Show & Tell
An anonymous reader writes "Continuing a firehose tradition of maximum information density, Xiph.Org's second video on digital media explores multiple facets of digital audio signals and how they really behave in the real world. Demonstrations of sampling, quantization, bit-depth, and dither explore digital audio behavior on real audio equipment using both modern digital analysis and vintage analog bench equipment... just in case we can't trust those newfangled digital gizmos. You can also download the source code for each demo and try it all for yourself!"
Plus you get to look at Monty's beard and hear his soothing voice. There's a handy wiki page with further information and a summary of the video if text is your thing.
soothing, really? I am keenly curious to what he has to say, but find his delivery smug.
Also, he doesn't address what really happens in the hardware of the audio codecs. Yes, sampling *theory* means there are no stair steps, but codecs are real devices. They run on discrete clocks and depending on the particular codec implementation, there are layers of non-idealities. So, yes, the stair steps do exist in the music you hear. And no, you most likely can not perceive it. I'm honestly not sure if that hardware he uses could detect it. Probably for certain specific signals. Speaking of certain signals, for certain codec architectures, transient signals behave differently from the periodic signals he tests in the video. No, that does not mean he's wrong about transients having to line up with a sampling boundary in order to be properly sampled.
No matter how confident he seems to you, take your grain of salt and caveat emptor. Or as Wu-Tang Clan would put it: "Ain't a damn thing changed boy, protect ya neck."
>transient signals behave differently from the periodic signals he tests in the video
He addresses this. No signal in our rate-limited sample exists in isolation. A transient blip or noise is not detected if it qoesn't fall in a point that is measured, yes, but a real signal does not exist on a single point. An audio codec is at another level above the hardware and doesn't change the fundemental physics of digital-analog implementations that havn't changed much in the last 10-15 years, and are developed with signal theory in mind.
Found it very informative to a non-guru.
Aside from that, the video and its audio, and I'm not kidding here,
were very pleasant and sympathetic to the ears and eyes.
If a 100Mhz analog scope can't detect the stair steps, then there is absolutely no fucking way you have the slightest chance of detecting it.
100,000,000 Hz vs your ears at maybe 20,000 Hz.
See the difference?
He has actual hardware there, as he explains quite old consumer grade hardware, which does the conversion from analog to digital to analog, and the result is still for all intents and purposes PERFECT. Yes, the delivery is smug, but rightly so. Talking to "audiophiles" is like talking to people who believe in homeopathy: It is extremely difficult to not just make fun of the fools. When you instead manage to deliver an explanation and a demo that clear, you get to be smug. (Captcha: mockery)
If something is barely perceptible in a good way, that's what 'soothing' means.
He knows what he is talking about, he explains things clearly, he is not condescending to the viewer -- I think the apparent smugness is not for real, or maybe it is just how his personality comes over. And if you still don't understand how there is no stair-step, you need to watch the video again! Even though I've done loads of DSP, the nice demos he gives really illustrate well what he is saying, and who can argue with pure-analogue gear proving the point -- not just theory and hand waving, but real experimental evidence. Really nice work.
See the difference?
But, but, but... I'm an audiophile, dammit! I listen with my soul. That's why I can hear it!
That is all.
This guy knows what he's talking about, and communicates it well. Amateur audiophiles should especially read his article here: http://people.xiph.org/~xiphmont/demo/neil-young.html.
... for all the bullshit Blackboard technology mess, videotaped classroom lectures, and .edu buzzwords, this sort of thing is exactly how open education should be done.
congrats Monty, once again you've done well.
~.~
I'm a peripheral visionary.
While Monty's presentation is excellent, what we have to remember is we are not listening to pure sine waves. What he should be using is a true audio wave which requires so much more resolution to recreate accurately. When he says he can reel to reel in x number of buts, I was surprised. He doesn't say what sort of speed of tape and source of recording he was using. Having done both reel and digital recordings for two decades, I've yet to come across a 16bit digital system being able to beat reel to reel, let along anything lesser.
Never underestimate the power of self-delusion. Placebo effects have been accepted by science as being very real. The fact that someone, whether a believer in homeopathy, audiophile-quality, sugar pills or whatever scientifically-unsubstantiated nutjobery, actually believes in it is beneficial to them. Trying to take away their self-delusions is just plain mean. I really wish I could convince myself that what amounts to water is as effective as real medicine...I might be a healthier person.
This ignores the fact that perception doesn't define reality. Stripping delusion away might be 'mean', but sometimes it's necessary. History is rife with examples of delusion driving whole societies over the edge.
No, stair steps don't exist in the music you hear (at least not beyond trivially small ones). You could potentially make an ADC that did produce stair step samples but it would be a stupid thing to do. As he also mentioned, in some cases ADCs to produce stair step samples, but that's an intermediate step in the conversion and is not what is output.
If you want to get really pedantic, any real ADC does take a certain amount of time to complete a sample so the sample does have some finite extent, but it's very, very small, isn't uniformly distributed and is far outside the band limit anyway.
I suppose you could construct a codec in which an in-band transient might be treated differently than a periodic signal but that has very little to do with what actually happens in real codecs that people do, or might, use.
Maybe you think he's smug because he knows what he's talking about and it disagrees with what you think you know?
Stripping away someone's self delusions isn't mean. It prevents him from being an easy mark for snake oil salesmen, whether those are homeopaths or Best Buy employees. In medicine the placebo effect is great, until you get something it doesn't work on and you die because you didn't get real treatment. In stereos it just means you get separated from your money, over and over and over again.
You deleted the part where I said "for certain codec architectures,"
He doesn't tell you what codec architecture is inside the audio interface he uses.
However, for certain types, such as sigma-delta, there are errors which are worse on sharp transients at high-frequencies than on steady-state signals.
Sure, you don't let someone treat their cancer with homeopathy, but why stop them from treating an upset stomach, migraines or some other such non-fatal malady? We take medicine for a lot of problems and very few of those problems can be fatal.
For a video all about audio, why does the guy's voice keep flapping around from left speaker to right speaker? I found it pretty distracting. Next time, try a clip-on mic and mix it down to mono unless it's necessary to make a point.
systemd is Roko's Basilisk.
There are no stair steps, as the analog filter after the DAC removes them.
And yes, I am talking about modern oversampling delta sigma converters here. There is still an analog filter, but it's a very gentle slope at ultrasonic frequencies.
Also, he doesn't address what really happens in the hardware of the audio codecs. Yes, sampling *theory* means there are no stair steps, but codecs are real devices. They run on discrete clocks and depending on the particular codec implementation, there are layers of non-idealities. So, yes, the stair steps do exist in the music you hear.
No, they don't. Nature abhors stairsteps. A true step function (infinitely fast rise/fall time) expressed in a physical medium would require an infinite series of large-amplitude harmonics of the fundamental (square-wave) frequency. There is no way to force air to do this.
"Okay", you say, "maybe there's still a crude approximation of a step function, with higher frequency harmonics naturally attenuated away." But it turns out not even that is true. An essential part of every complete DAC is a "brick wall" analog filter placed immediately after the raw digital-to-analog conversion. Ideally, this filter removes all frequency components higher than 1/2 the sampling frequency. This filtering means you won't find a single one of the harmonics required to approximate sharp stairsteps between samples. They'll all have been smoothed away by the filter.
This is also known as a "reconstruction filter". It's an essential part of any complete sampling system, as described by the Shannon-Nyquist Sampling Theorem. Which is literally a mathematical proof that it's possible to reconstruct any continuous waveform from discrete samples, within certain limits (the big one: you can only reproduce frequency components less than 1/2 the sampling frequency, and in fact you must filter out anything higher before sampling in the first place).
The final output of a system as described by the theory is not a discrete series of points. Think of it as being more like a mathematical curve-fit through those points. In real systems, we can only approximate the ideal mathematical constructs described by the Sampling Theorem, and the reconstruction filter is what approximates the curve-fitting. Though it's hard to design theoretically ideal reconstruction filters, the engineers who design audio DACs have refined them to the point where they're damn close to ideal over audible frequency ranges. (This is no new development, either. Nigh-perfect audio DACs have been cheap, mass market parts for about 20 years. Just add competent system design.) So you will not see stairsteps in the output waveform.
No matter how confident he seems to you, take your grain of salt and caveat emptor.
Advice people would do well to apply to your words. You're trying to project an image of confidence to prop up an argument which can only be made from ignorance. IMO, the reason you "find his delivery smug" (and so forth) is that he's challenging your presumptions, and you don't like it.
Sure, you don't let someone treat their cancer with homeopathy, but why stop them from treating an upset stomach, migraines or some other such non-fatal malady? We take medicine for a lot of problems and very few of those problems can be fatal.
Because the people who promote things like homeopathy reliably over-sell their product's efficacy, and thereby discourage people who need real medical attention from getting it.
Because it mightn't be "upset stomach" or a "migraine", it might be internal bleeding or brain cancer, for example.
You're absolutely right, and real devices are more or less prone to some distortion when converting digital signals.
However, this must be determined empirically, and is beyond the scope of a discussion of digital signals. As a rule, the DSP developer gets his algorithm right, and getting those Signed Ints to sound good is strictly the client's responsibility.
Don't blame me, I voted for Baltar.
The "lollipop" as opposed to "stairstep" representation of a digital soundwave really clarifies things.
SACDs are encoded with a 1-bit sigma-delta pulses. Those would be some pretty whopping staircases if sampling theory didn't work to eliminate them.
I am becoming gerund, destroyer of verbs.
You could potentially make an ADC that did produce stair step samples but it would be a stupid thing to do.
That's called a NOS (non-oversampling) DAC and sold for substantial amounts of money to audiophools who have been convinced that "filters are evil".
There's no talking to trolls because those who have heard master tape know what I am saying here. 16/44.1 is definitely not enough to reconstruct a proper analogue signal. The manufactures themselves knew this and that standard was a bare minimum. However much one might try to justify this with numbers and figures is irrelevant. The manufactures themselves knew it and that's why there was so much work in trying to improve on that standard.
On that topic, while I was filming the epilogue I started feeling ill and ignored it for a while, but eventually went to the emergency room. Three hours later I had an emergency appendectomy. Medical science also rocks. If I'd gone to a faith healer, I'd be dead. (FTR, the epilogue footage in the final vid was from three days after the appendectomy)
Extensive testing reveals that no matter the wailing and gnashing of teeth claiming digital sucks, no one can actually tell the difference in a double-blind test.
Theory is indeed useless when reality doesn't agree. Every real scientist knows this. An engineer's job depends on it. I invite audiophiles to add that bit of wisdom to their own thinking.