Dolby's TrueHD 96K Upsampling To Improve Sound On Blu-Rays
Stowie101 writes in with a story about your Blu-ray audio getting better. "The audio on most Blu-ray discs is sampled at 48kHz. Even the original movie tracks are usually only recorded at 48kHz, so once a movie migrates to disc, there isn't much that can be done. Dolby's new system upsamples that audio signal to 96kHz at the master stage prior to the Dolby TrueHD encoding, so you get lossless audio with fewer digital artifacts. The 'fewer digital artifacts' part comes from a feature of Dolby's upsampling process called de-apodizing, which corrects a prevalent digital artifact known as pre-ringing. Pre-ringing is often introduced in the capture and creation process and adds a digital harshness to the audio. The apodizing filter masks the effect of pre-ringing by placing it behind the source tone — the listener can't hear the pre-ringing because it's behind the more prevalent original signal."
...worthless shit.
44.1khz audio is already transparent to the human ear. Blind studies have been done where a 16 bit 44.1khz ADC-DAC pair was inserted into a high resolution analog audio source. No significant difference was observed.
Don't waste money on the placebo effect.
Give me Classic Slashdot or give me death!
Dumb, dumb, dumb. An ideal sample rate upconversion results in something that *is* identical to the source. Mathematically. It's like re-encoding a 64kbps MP3 to 192kbps. If anything you are going to *lose* quality due to inherent errors in the process.
Make me a friend and I'll mod you up
May I be the first to say this- fuck Bluray, and fuck Cinavia.
I used to buy Bluray disks. Hell, I own a whole shelf full of them (about 80 titles in total). Every single one eventually got ripped to my NAS in two formats- a relatively lossless MKV file containing the original video and audio streams (up to DTS-HD MA), and a lossy x264 version for playing on crappy devices like the PS3 or 360.
Then Cinavia rolled around, which did two things:
1) It purposefully corrupts the audio stream in an attempt to encode digital information into it (go read their patents- the harder you try to pry Cinavia into an audio stream, the more damage is done to the original quality)
2) It prevented me from playing my legally purchased and legally ripped (it's legal in my country to rip disks and things you BUY) disks off my NAS on my PS3
What pisses me off the most though is that Sony is pushing Cinavia on everyone as hard as they can. AFAIK all new BR players need to be equipped with it, and most of the new BR disks are supposed to have it as well. And they're still advertising the disks as "Lossless", when in fact the audio is NOT lossless- it's lossy, the degradation of which is brought about solely by Cinavia's presence.
Before anyone yells [citation needed] at me, here's your proof straight from the Wikipedia page (http://en.wikipedia.org/wiki/Cinavia):
"Cinavia's in-band signaling introduces intentional spread spectrum phase distortion in the frequency domain of each individual audio channel separately, giving a per-channel digital signal that can yield up to 20 kilobits per second—depending on the quantization level available, and the desired trade-off between the required robustness and acceptable levels of psychoacoustic visibility. It is intended to survive analogue distortions such as the wow and flutter and amplitude modulation from magnetic tape sound recording. On playback no additional audio filters are used to cover up the distortions and discontinuities introduced."
So there you have it. Lossless is no longer lossless, because Sony insists on using this stupid fucking DRM on their stupid fucking format (as usual). Dolby's new gimmicky technology might claim to give you better lossless audio, but none of that matters the moment they drive Cinavia into the stream.
-AC
You blinded me with science!
All jokes aside, few if any people can hear the difference between 44.1, 48, or 96khz sample rates. Under the Nyquist limit (half the given sample rate) all are equally precise in recording (an hence rendering the sound). What a higher sample rate does do is make for simpler ADC/DAC chips that sound good, at the expense of more bits. And it allows audio manipulating software (plugins and such) more accurate (but good software design can engineer around that). So aside from mixing and mastering (a little), it makes no difference at all to the ear in the final mixed down track.
Silence is a state of mime.
make a system that amplifies dialog to the same level as everyfucking thing else in the movie so I dont have to constantly fiddle with my remote. Why is it in music we have the loudness wars where all sound is mashed into mindless noise at the peak of volume, but in movies there HAS to be a 100db difference between scenes
Purely a marketing stunt. Audio has been recorded at 44kHz for ages now because a signal sampled at that rate can be accurately reconstructed up to 22kHz (Nyquist theorem). Human hearing peaks out around 20kHz at best. Even 22 kHz is a pretty lenient upper bound, most of us will not be able to hear frequencies in the upper teens kHz (think of those mosquito ringtones). 96 kHz is severe overkill and nothing more than superfluous data.
It sounds like they're using the extra spectrum to do some processing on the signal, but there's no reason not to do the processing and then just downsample back to 44 kHz for storage/streaming/what have you.
The title is misleading if the actual goal of this is to apply an apodizing filter. I suspect the reason it's called "Advanced 96K Upsampling" is because that's much easier to get people to buy into that than a "Apodizing Filter" sticker.
The article explains how the audible benefit comes from the application of the Meridian apodizing filter, which changes the analog signal reproduced from digital data by reducing the pre-ringing. IIRC the trade-off is that post-ringing increases. The claimed benefit is that since the ringing now occurs after the "real" music of larger amplitude and as a result the ringing is masked or could be considered like an acoustic echo that naturally occurs.
The 96K upsampling is just a side-effect of wanting the extra samples when you are applying the filter.
Here's a decent summary of what is supposed to happen to the analog audio signal as a result of the filter application: Technical analysis of the Meridian Apodizing filter.
That being said, from what I've read over the past few years I think people are kind of mixed on whether or not the filter makes things better, worse, or just different but not better.
Next they'll be saying I need Monster cables to give my audio a truely analog-sounding experience.
do you have a cite for that? I don't believe it.
even home recording is laughed at (technically) if you are not using 24/96. recording at 48k is just absurd. playback at 48k is fine, though; but I'm not at all convinced that million dollar (at least) movies capture audio at 48k.
if that really is true, then people have been ripped off on their blue ray purchases. one of the supposed benefits is 'better sound' and if you still get 48k (and likely 16bit audio too; as its not common to use 48/24 mode) at record time, nothing the BD can do will ever make it better than dvd. yes, dvd uses compression on dolby 5.1 or dts but its compression is actually nearly lossless *compared* to most consumer playback (not a huge S/N dac+preamp+amp) systems.
--
"It is now safe to switch off your computer."
The whole reason why there is any industry push for audio over 44.1KHz is to implement watermarking in the frequency ranges above 20KHz.
Same reason they want 32 and 48 bit color.
Unless it was recorded at 96k then this is about as good as upscaling from a dvd player is!
I don't care what filters they use, upsampling is just a gimmick, best to get 96k at the source.
Psst hey, nyquist called and wanted to ask you, what's the frequency, kenneth? //got nothing ///nothing like how your ear can perceive frequencies above 22k or so nothing. ////+3,000$, so your dog can enjoy a TrueHD experience, too! A bargain!
(really, if anyone wants to enlighten me as to why their technique of de-apoizing /requires/ that sample rate, please, let us know)
CS majors know the time/space tradeoff, but they never get taught the 3rd, crucial, tradeoff of the set: comprehension!
Oh, c'mon !!
This is one thing that simple does NOT make any sense
If the thing was recorded in 48KHz, it's at 48KHz, and no matter how one can "un-sampling" that shit and then re-recording it in 96KHz (even at 96MHz or 96GHz), it does not boost _anything_ !!
Muchas Gracias, Señor Edward Snowden !
That is just (not-so-)simple math. You can perfectly represent any signal with a frequency less than half of your sampling frequency. Audiophiles don't like this, but it doesn't change the fact. The greatest reason for confusion is the 'stepped waveform' graphic often used to explain sampling, which is badly misleading.
40Hz is ample. Anything more is overkill. All you get from 96kHz sampling is ultrasonics that need to be filtered out to prevent distortion from your speakers.
Prediction for end of Universe #42: Fencepost error in Quantum_bogosort.cpp
Upsampling is not going to fix shitty sound engineers and their crappy encodes.
April 1st was a while back... Just saying.
Feel free to mod me down, just know that unlike some Anonymous Cowards I'm not afraid to express my views as myself.
This is pure snake oil.
There is absolutely no point to store the up-sampled audio on disk. It is just a waste of space (and more licensing fees for Dolby)
It is extremely common for output DAC HW to do up-sampling and digital filtering these days. This already removes the ringing without the need for storing the up-sampled data, which is completely pointless. I doubt there is any modern DAC HW that is still using native 44.1/48 and analog filters in the output stage.
So this is total redundant nonsense.
pre-ringing
Really? In an uncompressed audio? And the solution not only involves upsampling as a part of the process but requires the signal to stay upsampled?
My eyes are rolling at 15krpm.
Contrary to the popular belief, there indeed is no God.
Not that this whole thing isn't absurd for the reasons already discussed above, but what no one bloody well seems to understand it that an audio stream is not a godamn bitmap picture. You can't improve audio quality of *audible frequencies* by increasing resolution of the horizontal axis (sampling frequency) beyond a rate which surpasses the Nyquist frequency for human hearing. Assuming a high quality anti-aliasing filter is used and excellent quality recording and playback equipment, audio sampled at 48kHz can be unambiguously represented up to about 24kHz. 96kHz is a waste of bits.
Vertical resolution (# of bits) is the only theoretical way to improve actual audio quality further... and beyond about 16-18 bits, it's also beyond the ability of even the most diehard audiophiles to discern (in properly conducted experiments.)
From a human point of view, me owning all kinds of high end analog sound cards and having both a blueray + dts-hd receiver, I can tell you this... When done right there is definately a difference on good quality speakers + receiver when it's in DTS... partly because it only goes through one DAC on the way out (your receiver) so it's much louder and clearer than analog. But this upconverting stuff... this is only good for crap quality mp3's where you can pretty much predict patters of garbage like that crystal garbled digital sounding effect, I could see something filtering that, or maybe enhancing some higher end frequencies by boosting them a bit. This doesn't belong on bluerays...
Now another thing i noticed... you ever have a pair of headphones on, or watching a movie and you hear a sound that makes you turn around because it sounded very real? i've noticed on dts-hd at home that this happens more frequently... especially with things like knocking on doors. Where as before it only happened very rarely. The last movie that did this to me was valkyre, I had my sound up a good deal and me and the other person almost jumped out of our chairs lol... Then again I also have 2 sub-woofers, one sealed 12' for real lows and one ported 10' both properly calibrated to match my main speakers, 2 full range sony towers, and 4 large sattelites (also with pretty wide frequency range) for the rest of the surround (7.1), and one pretty big center speaker with 3 cones.
With the new "wait 10 seconds while we educate you" after waiting for 60 seconds for the BluRay disk / player to "boot", will anyone really care? This is just a waste of effort, there is nothing that is THAT good that can't wait until its on HBO/TNT or Free OnDemand. Let the Movie Companies and Empty Suits/Suitests freeze in the dark. My wife, my family nor I give a crap about some 5% issue that is lost with munching on popcorn, telephones, A/C, heaters, dogs and bad speaker placement.
Anyone have a laser disk they want to use as a fresbie?
Only a tiny proportion of jerkoff audiophiles and home cinema nuts will be able to hear the difference.
I wish they spent all that time, effort and money on improving something that actually needed improving
Now we know what slashdot's parent company meant by looking for ways to further monetize their online properties
You can't improve audio quality of *audible frequencies* by increasing resolution of the horizontal axis (sampling frequency) beyond a rate which surpasses the Nyquist frequency for human hearing.
Nyquist-Shannon notwithstanding, the range of human hearing is wider than 20kHz.
http://www.cco.caltech.edu/~boyk/spectra/spectra.htm (a properly conducted experiment)
That said, doubling the sampling rate isn't going to do anything for a digital signal. At best, the new signal will simply play each of the old signal's samples twice.
After all, I am strangely colored.
You can't improve audio quality of *audible frequencies* by increasing resolution of the horizontal axis (sampling frequency) beyond a rate which surpasses the Nyquist frequency for human hearing.
Nyquist-Shannon notwithstanding, the range of human hearing is wider than 20kHz.
http://www.cco.caltech.edu/~boyk/spectra/spectra.htm (a properly conducted experiment)
That article says nothing about the human hearing range other than making a reference to some other unproven hypothesis. The article does show that instruments produce frequencies will above 20kHz, which was never really in question.
Assuming from the press release that this is an apodizing filter that 'removes' Gibbs effect preringning... how many peer reviewed studies can we compile here below my post that indicate anyone can hear these 'artifacts'?
Ready, on your marks, go!
Be careful of publications from the businesses that are pushing these filters (eg, Meridian audio / J. Robert Stuart). AES papers count only if the results have been independently reproduced (the AES is in the business of publishing 'interesting ideas', and the papers run the gamut of careful science to raving lunacy).
"I listened and _totally_ heard a rounder, fuller sound with better staging" is not data. It's what audiophiles have said of every nonsense breakthrough of the past 40 years. The one true breakthrough, digital audio, they generally still roundly pan and feel the need to 'fix' by sprinkling fertilizer all over it. It's like holy penguin pee, only it smells bad.
'Proofs' and explanations are nice, I indulge in them myself, but the blind listening data is the final authority.
It doesn't matter if there's a mathematical difference, it matters if there's a perceptible one. There's a lot out there that you can prove mathematically is more like the actual original sound wave. None of that shit matters to reproduction for human enjoyment. What matters is if the difference is perceptible to humans. The sound wave could be totally different and if humans can't hear the difference it doesn't matter.
That is the whole thing behind lossy compression. You can do an imperfect deconstruction/reconstruction of a sound wave and humans will have trouble telling the difference, or find it impossible at higher bitrates. Telling the difference as an objective matter isn't hard, you can do it on a scope, FFT, with a diff, whatever. Telling the difference listening to it is impossible (with sufficiently high bitrate, like 256k MP3).
Also don't think that just because the AES is the society for audio engineers they are immune to audio voodoo. The world of pro audio is one I play in as a hobby, and there's voodoo that goes around in it to. A friend of mine is a professional audio engineer, and a good one, but he gets in to the voodoo. He has special cables, paints the edge of his CDs green with a marker, and things like that. He's convinced he can hear a difference.
Some devices also call it dynamic range compression. You, well, compress the dynamic range of the soundtrack. Movies are supposed to have large dynamic range, by design, and they (usually) ship with the full range soundtrack on disc. Dialogue can be 30-40dB below peak (how much is actually encoded in the Dolby stream). Theater reference levels are 105dB on the mains, 115dB on the sub. That is how loud it is allowed to peak for big hits and so on.
You can have that at home too. Get a good receiver with its own mic for measuring and configuring our setup and you can have the volume dial configured to cinema reference, hence the negative numbers. 0dB will mean full theater reference, same levels you'd get at the movies.
If that is too much dynamic range, kick on the compressors your gear has. Most receivers call it "night mode" and it'll squash the shit out of the dynamic range. You can also axe the subwoofer, big hits come from it and when a system doesn't have one, it'll further squash them down (as to not blow out your main speakers).
You can't add information where none exists. If you have only 48,000 samples for a given second of audio, interpolating 96,000 samples from that doesn't make the audio better any more than using an HD camcorder to record an SD television displaying a VHS movie will improve the movie's video quality.
That said, doubling the sampling rate isn't going to do anything for a digital signal. At best, the new signal will simply play each of the old signal's samples twice.
Actually upsampling can be useful when you apply digital filters. There is no such thing as an ideal filter, so if you modify one frequency band (e.g. in a equalizer) you end up modifying all others. The higher is the sample rate the lower is this sideband interference.
`echo $[0x853204FA81]|tr 0-9 ionbsdeaml`@gmail.com
There's no such things as an ideal analog filter. No such fundamental problem exists for digital filters.
I've seen it when the cap was biassed at a few volts, and this was in a cheap measurement instrument using PIC with a 10 bit ADC. I sorted it by designing out the bias, as the product was low cost.
"they're probably marginally worse that"
Let me guess: you're an American, right?
It's "worse THAN"...
lots of comments from people *insisting* that there's no difference.
TV's upscale low-res (standard) broadcasts to 1080 - does the resulting picture look hi-def? No. Does it look better than directly displaying the lower-res? Yes.
At this point, whether the upscaled picture looks as good as 1080 is irrelevant - it just looks better than if you'd displayed the low-res picture without any processing.
Do you then complain endlessly about how it's not actually 1080? Apparently, lots of the posters here would.
A response ignorant of the problems of digital filtering. Go have a look at the frequency response of a digital filter to see why one would want a damn high resolution before it's applied. It's not a lovely smooth 20db/octave rolloff with a flattening of response due to leakage that you get on an analogue system.
If you are approaching digital and analogue filters the same way then please do everyone a favour and leave the industry.
Bullshit. Assuming enough resources (computational time and storage), any 4-bit computer can implement an ideal digital filter with as high resolution and steepness as you wish. Something you can not simply get with an analogue filter due to laws of physics.
In practice, you have limited resources, usually time because you're realtime-constrained, and then you must cheat. If you must cheat, a higher sample rate may or may not benefit you.
c++;
There are far too many BD audio formats already, AC3, DTS, DCA, DTS-master, etc, etc, etc. With a decent ($3000) surround-sound HT setup and 40 year old ears, I cannot tell much difference between any of them. I wish the BD producers focused more on doing better video transfers. I'd much rather they use the space wasted by these new audio formats on higher bitrate video (and the same goes for the useless, space-wasting extra features).
As far as I'm concerned, the only thing these extra audio formats do is make ripping the files & playing them back via an embedded streaming device more complex. My oldest device cannot handle any of these new fancy formats beyond AC3, so I need to remux newer BDs to add an AC3 sound track to the MKV.
Sigh
You forget about the upcoming cochlear implant for ultrasonic localization. I fully expect to realize an return on all my audio "investments". On a side note, all my alien friends already say our movies sound like s&it smells.
You can't improve audio quality of *audible frequencies* by increasing resolution of the horizontal axis (sampling frequency) beyond a rate which surpasses the Nyquist frequency for human hearing.
Nyquist-Shannon notwithstanding, the range of human hearing is wider than 20kHz.
http://www.cco.caltech.edu/~boyk/spectra/spectra.htm (a properly conducted experiment)
That said, doubling the sampling rate isn't going to do anything for a digital signal. At best, the new signal will simply play each of the old signal's samples twice.
Just because musical instruments produce frequencies above 20kHz (as shown in your link), it doesn't mean that the average human can hear them. Younger people can hear frequencies up to ~20kHz, and maybe a bit above, but most middle age adults probably cut off around 15kHz or lower. Here's one study showing 18-24 yr olds who can mostly hear 24kHz, but they're generating the sound at 117 dB -- a very dangerous level for more than just a few seconds. (http://informahealthcare.com/doi/abs/10.3109/00206098409070087?journalCode=ija)
Listening to loud sounds (>85dB) for extended periods of time will decrease the high frequency response of the human ear, so I wonder if high frequency hearing in children and teens of the last decade or two will have even worse hearing that their parents due to the ubiquitous white ear buds.
"X. Significance of the results
Given the existence of musical-instrument energy above 20 kilohertz, it is natural to ask whether the energy matters to human perception or music recording. The common view is that energy above 20 kHz does not matter, but AES preprint 3207 by Oohashi et al. claims that reproduced sound above 26 kHz "induces activation of alpha-EEG (electroencephalogram) rhythms that persist in the absence of high frequency stimulation, and can affect perception of sound quality." [4]
Oohashi and his colleagues recorded gamelan to a bandwidth of 60 kHz, and played back the recording to listeners through a speaker system with an extra tweeter for the range above 26 kHz. This tweeter was driven by its own amplifier, and the 26 kHz electronic crossover before the amplifier used steep filters. The experimenters found that the listeners' EEGs and their subjective ratings of the sound quality were affected by whether this "ultra-tweeter" was on or off, even though the listeners explicitly denied that the reproduced sound was affected by the ultra-tweeter, and also denied, when presented with the ultrasonics alone, that any sound at all was being played.
From the fact that changes in subjects' EEGs "persist in the absence of high frequency stimulation," Oohashi and his colleagues infer that in audio comparisons, a substantial silent period is required between successive samples to avoid the second evaluation's being corrupted by "hangover" of reaction to the first.
The preprint gives photos of EEG results for only three of sixteen subjects. I hope that more will be published.
In a paper published in Science, Lenhardt et al. report that "bone-conducted ultrasonic hearing has been found capable of supporting frequency discrimination and speech detection in normal, older hearing-impaired, and profoundly deaf human subjects." [5] They speculate that the saccule may be involved, this being "an otolithic organ that responds to acceleration and gravity and may be responsible for transduction of sound after destruction of the cochlea," and they further point out that the saccule has neural cross-connections with the cochlea. [6]
Even if we assume that air-conducted ultrasound does not affect direct perception of live sound, it might still affect us indirectly through interfering with the recording process. Every recording engineer knows that speech sibilants (Figure 10), jangling key rings (Figure 15), and muted trumpets (Figures 1 to 3) can expose problems in recording equipment. If the problems come from energy below 20 kHz, then the recording engineer simply needs better equipment. But if the problems prove to come from the energy beyond 20 kHz, then what's needed is either filtering, which is difficult to carry out without sonically harmful side effects; or wider bandwidth in the entire recording chain, including the storage medium; or a combination of the two.
On the other hand, if the assumption of the previous paragraph be wrong â" if it is determined that sound components beyond 20 kHz do matter to human musical perception and pleasure â" then for highest fidelity, the option of filtering would have to be rejected, and recording chains and storage media of wider bandwidth would be needed."
After all, I am strangely colored.
Its probly also a typo. And im probly makeing a few ov them on purpis.
APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
... is that yours aren't the only ears in the room. Even if you don't need 24/96....
Think of the puppies ! ! !
Oh, I'm sorry sir, I thought you were referring to me, Mr. Wensleydale.
The thing I don't get about all these consumer-grade audio products hawking 96k sound are why engineers, who should know better, are attracted to them. It's as though we came out with a new TV that displays past the wavelengths of visible colors into the ultraviolet spectrum. Sold, presumably, with the tag line, "Our new plasma television reproduces all the colors that you CAN'T see... but BEES CAN!" Likewise, dogs in your household may appreciate an upgrade to 96k sound, but if humans cannot physiologically perceive sounds above about 22-23 kHz... then why... why...
When dealing with digital inputs, too much accuracy in the audio stream produces harshness and digital fatigue.
Citations, Please. This sounds very much like analog fan boy bullshit. "Too much accuracy", really? I have seen quite a bit idiotic pseudoscience used to explain why analog is better than digital. If you just like vinyl over CDs that is fine, just say so, but don't try to snow job me with some jargon filled statement in an effort to back up your personal tastes.
HA! I just wasted some of your bandwidth with a frivolous sig!
The sad fact of it is that people don't like high quality audio.
When people cannot tell the sound is coming out of a box, they can't distinguish artificial sounds from other environmental acoustic sounds, and the effect can be quite unnerving. You don't know if a scream is on the recording or happening in the room next to you. People like to be able to tell the sound is coming from a speaker.