Can You Really Hear the Difference Between Lossless, Lossy Audio?
CWmike writes "Lossless audio formats that retain the sound quality of original recordings while also offering some compression for data storage are being championed by musicians like Neil Young and Dave Grohl, who say compressed formats like the MP3s being sold on iTunes rob listeners of the artist's intent. By Young's estimation, CDs can only offer about 15% of the data that was in a master sound track, and when you compress that CD into a lossy MP3 or AAC file format, you lose even more of the depth and quality of a recording. Audiophiles, who have long remained loyal to vinyl albums, are also adopting the lossless formats, some of the most popular of which are FLAC and AIFF, and in some cases can build up terabyte-sized album collections as the formats are still about five times the size of compressed audio files. Even so, digital music sites like HDtracks claim about three hundred thousand people visit each month to purchase hi-def music. And for music purists, some of whom are convinced there's a significant difference in sound quality, listening to lossy file formats in place of lossless is like settling for a Volkswagen instead of a Ferrari."
Usually if the bitrate is above 256kb/s, i dont notice any difference.
Ofcourse it still effects some songs (especially the percussion parts).
We recently discovered that human hearing beats the linear response assumptions used in lossy codecs. So yes, their criticisms are scientifically founded.
Slashdot's rate-of-post filter: Preventing you from posting too many great ideas at once.
The concept of improving consumer listening experience using studio quality recording has been thoroughly debunked, right here on Slashdot...
Why Distributing Music As 24-bit/192kHz Downloads Is Pointless
"Here Lies Philip J. Fry, named for his uncle, to carry on his spirit"
Your preference for 24/96 audio as a listener is entirely due to the placebo effect.
Well, in all fairness, listeners may actually hear perceptible differences between 24/96 and 16/44.1 audio sources due to different mastering, but of course that says nothing about whether they can actually tell the difference between the two bitrates when everything else is equal.
This article is a pretty good explanation of why 16/44.1 is as good as anyone needs for playback.
Momentarily, the need for the construction of new light will no longer exist.
You sure can hear the difference if you stick a 44.1kHz DAQ in a 96kHz signal chain before filtering out ultrasonic high frequency components (if there are enough to make a difference). The advantage of 96kHz recording isn't that it can capture any more human-audible frequencies than 44kHz can, but that you have a lot more leeway to prevent aliasing of signals above the Nyquist limit down into the audible range (a 25kHz tone sampled at 44kHz results in a spurious, highly audible (25-44/2)=3kHz aliasing signal).
It's pretty much impossible to build analog frequency filters with a sharp cutoff (e.g. everything below 20kHz and below gets through, everything above 22kHz is -60dB attenuated), so recording at 44.1kHz sampling requires either being absolutely certain the original sound source has minimal high-frequency harmonics, or heavy analog filtering that cuts well into the audible high frequency range. With 96kHz sampling, it's much easier to build an analog filter that gradually rolls off high frequencies between 20kHz and 40kHz (...producing a >40kHz sound is tricky in the first place), preventing aliasing without the filter cutting into the audible range. Once digitized, it's trivial to make a *digital* filter with a perfect frequency cutoff to downsample the 96kHz to aliasing-free 44.1kHz.
44.1hkz 16bit audio is completely transparent to the human ear. No one has ever been able to detect when a 16bit DAC ADC pair has been placed in a 24/96 audio path.
Your preference for 24/96 audio as a listener is entirely due to the placebo effect. There are good reasons to master audio in high res, but for listening 16 bit 44.1khz audio is as good as anything.
As a former audio professional (specialized in location recording of choirs and orchestras) I must agree. But even my aging ears can hear the difference between 44.1 (or 48)kHz 16 bit uncompressed and a typical MP3. Side note: 24-bit has a few audible advantages for music with extremely wide dynamic range (from ppp to fff, say) where 16 bit will struggle a little at the very soft end.
This is the real point: People are so used to listening to music with no dynamic range, on ear buds, in crappy acoustic environments that they wouldn't know where to start listening for a difference.
No sig today...
Speaking as someone who frequently does recording, your comment suggests that no one has done that test with classical music in a properly controlled listening environment using quality gear while giving the test subject the ability to control the volume arbitrarily. When you crank up the volume, the noise floor difference in soft passages alone should make the difference between 16-bit and 24-bit signal paths a dead giveaway, even for someone with moderate to severe hearing loss. It isn't even subtle. Of course, if the person doesn't turn it down for the loud passages, he/she will likely suffer hearing damage, but perhaps that's why he/she has moderate to severe hearing loss in the first place. :-D
The 44.1 vs. 96 kHz difference is more subtle, requiring someone with top-notch hearing (very rare), headphones that can accurately reproduce frequencies above 20 kHz, and 96 kHz DAC hardware that does not have a bandpass filter starting at 16 kHz. If you fail to verify even one of those requirements, you would expect no one to be able to hear the difference, because there won't be any difference.
Check out my sci-fi/humor trilogy at PatriotsBooks.
No, not at all like 640K.
The point of the equipment is that you have quality in reserve as you go through the process of mastering the tracks. The more quality you have in reserve the more you're able to do before you start having to deal with artifacts and other nastiness. As with all such things, you have to think about the order in which you do things and the order in which you throw out data to get the best results.
The point of buying lossless music isn't so much that it's better for listening to, it's that you can compress it however you like later on without having to worry as much about the sound quality you get. Since you have more data to work with, you can get a better quality at a lower bitrate than if you were starting with an already compressed track.
I will tell you now that the average person cannot hear to 20khz. Young children can. Anybody who has listened to loud music for any length of time have blown away the top couple of khz of their audio range.
If you have ever gone to a rock concert and been near the front or gone to most dance clubs and you will have sustained hearing damage. If you have ever left one of these venues with ringing ears, or been around loud machinery and noticed the same, then you have sustained hearing loss. Your hearing will recover mostly after the trauma and that will be indicated by the subsiding of the ringing of your ears.
If you want to find out how your good/bad hearing is, spend the money and see an audiologist. You will be surprised on to find out what your hearing is really like.
"Every security scheme that is based on secrets eventually fails." - Steve Jobs
Good point. Sadly, my $3k hearing aids don't seem to help either.
Bitrate doesn't matter much if your ears are the lossy part.
Lets start refering to The War Against Terror by it's initials. . .
According to Wikipedia the audible range for human hearing is around 130dB. 16 bits can in best case offer a dynamic range of 96 dB, whereas 24 bits offer 144 dB.
So it should be pretty obvious that you can't fit the entire audible range into 16 bits. This might not be relevant to modern day music. But if you want to record what the ear is actually capable of hearing (not including sound levels above the pain threshold) you will need those 24 bits.
Two nits to pick:
1) You can get arbitrarily close but you can't get "perfect" frequency cutoff.
2) A 25 kHz tone sampled at 44 kHz gives you a 19 kHz tone. Remember the [-pi:0] (or [pi:2*pi]) frequency range comes first.A 41 kHz tone would get you a 3 kHz tone after sampling.
Otherwise all true, which is why most recording devices do exactly that, sample at a high rate and digitally filter before downsampling to 44.1. But none of that has much to do with whether or not, once you've gotten past the aliasing problem as you say, you can tell the difference between a 44.1 kHz playback and a 96 kHz playback.
Caveat: You have to have decent headphones (not Apple earbud BS), and/or good speakers, but that's about it. The difference is negligible once you hit ~320Kbps MP3, in my opinion, but anything under 256Kbps, regardless of lossy format, you can *clearly* hear cymbal hits turning to an underwater splooshy mess.
Highhats are even worse than cymbals. Even at 256 kbps, highhats tend to sound like they're being hit with a bag of broken glass, and is the easiest way to identify lossy compression I can think of. Except, perhaps, some of Mike Oldfield's earlier works.
1) Digitally, yes you can. Take the DFT of the data; zero out all components above your frequency cutoff; reconstruct the signal as the sum of below-cutoff frequencies. Voila, a perfect sharp cutoff. The only subtlety is that you can only choose an exact cutoff corresponding to some integral number of cycles in your sampling window, so you can't cutoff at exactly sqrt(e*pi)kHz --- but you do have plenty of wave numbers from which to select a perfect cutoff (increasing with the size of your DFT window).
2) Untrue: a 44kHz *sampling rate* has a 44/2=22kHz Nyquist cutoff. Frequencies f>22kHz Nyquist limit "wrap around" to f-22kHz difference frequencies.
But yes, I agree, on the playback side there's no audible difference between a (sufficiently well made) 44.1kHz and 96kHz DAC.
When the music gets soft in 16b you have a lot of zeros in front of the number. So you effectively only have a three or four bit signal being fed into the DAC. This is fixed point math, not floating. With 24b you can put all of those zeros in the front and still have eight or more bits to feed into the DAC. This is even more beneficial when the amp implements power supply volume control. PSVC raises the effective noise floor the DAC has to deal with.
But yes, I agree, on the playback side there's no audible difference between a (sufficiently well made) 44.1kHz and 96kHz DAC.
No, but what makes a big difference is when you have a 48 kHz sound card that resamples everything to 48 kHz for an internal DSP stage that cannot be bypassed, and then back again. Yes, Soundblaster Audigy, I'm looking at you.
44.1 -> 48 kHz gives a lot more audible artifacts precisely because they're so close. Think of it as audible moire.
Also, for newer computer audio cards, if you have a choice, use 88.2 kHz for the internal rate instead of 96 kHz. The reason is that most high quality sound is in 44.1 which converts perfectly to 88.2. For 48 kHz, it's less of a problem in the first place, and likely also worse quality sound to start with.
Of course, unless the rest of the audio path is good, it doesn't matter much, but if you like to listen to FLACs with high end headphones, it sure won't hurt to use 88.2 instead of 96 kHz.
In a finite window, *any* signal can be represented as a sum of elements with frequencies corresponding to n=0 (DC offset), 1, 2, 3, ...., infinity integral cycles in the window. A signal corresponding to a non-integral number of cycles, e.g. 100.5, is indistinguishable over the window from some (infinite) combination of integral cycle waves. If you measured in a window twice as long, the 100.5-cycle signal would now be a unique, identifiable 201-cycle component. So, in an important sense, in a finite window the "intermediate" frequencies "don't exist" --- they can't do anything different from the (infinite series) of integral frequencies. Thus, you can create a cutoff that is as "perfect" as is meaningful in a finite window.
I think the real point is that there are known limits to human hearing and many audiophiles fantasize about their hearing being superhuman. It just ain't so. Dynamic range compression is one thing, but perceptual compression, sample rate, and bit depth are a different matter. No audiophile has ever heard the difference between FLAC and 320Kbps mp3 audio in an ABX test at a statistical rate that is better than guessing.
Any time this argument starts, I refer people to this well written article that lays out the limits of human hearing compared to the specifications of recording formats...
Actually, you've proven the GP's point. You can't tell the difference if you are listening to the program. Turning a program up in the "soft sections" is exactly what you should never, ever do when listening to a program. You may as well put on the IR headset with compression that came with your TV so you can watch late night TV without disturbing your wife.
Mastering is an entirely different ball of wax and, yes, you want all the headroom you can get. It's no different than photographers using RAW formats instead of JPGs (even lossless JPGs) out of the camera. You want all the bits you can get. But after your done mastering, dropping to 16bits isn't going to affect the outcome. That's the whole point of mastering - if we didn't want to be that soft, we would have engineered it to be louder.
Is it just my observation, or are there way too many stupid people in the world?
As a live sound engineer dealing with vocalists who do that regularly (sing at normal program levels and then BELT A PHRASE OUT)... let me say... ARGH.
I put a steep compressor on someone who's prone to doing that, and let me tell you, it makes my life much easier. I can't fix the clipping, but I can make sure they don't cause the audience to cover their ears.
That's correct, there is no audible difference to a human between a 22kHz sine wave and a 22kHz any-other-shape periodic wave. Not to mention, no adult human can hear 22kHz anyway. I hear 16kHz. My 9-year-old can hear 19kHz. Get a frequency generator app and test yourself -- it's fascinating.
FLAC is about an order of magnitude simpler than MP3. I once implemented a decoder in about an hour over lunch just because I could. And because many lossless codecs feature error detection, they're much more likely to survive as a long-term archive than something like MP3 which doesn't even have a container or any reliable way to verify that the file's contents are correct.
Ita erat quando hic adveni.