2nd Multi-Format 128kbps Public Listening Test

← Back to Stories (view on slashdot.org)

2nd Multi-Format 128kbps Public Listening Test

Posted by ryuzaki0 on Thursday May 13, 2004 @09:50AM from the music-to-one's-ears dept.

technology is sexy writes "Roberto Amorim has launched his latest public listening test evaluating the performance of different audio codecs at 128kbps, among them Apple's AAC implementation (used in iTunes), LAME, Ogg Vorbis fork auTuV, WMA, Musepack and even Sony's Atrac3 format, which is soon to be used in their own music store. Read more on Hydrogenaudio and check out the results of prior tests. As opposed to most evaluations of audio codecs, this is a scientific test adhering to ITU-R BS.1116-1 as much as possible while still allowing everybody to participate."

24 of 316 comments (clear)

Listening test? by Anonymous Coward · 2004-05-13 09:51 · Score: 5, Funny

Never heard of it.
Ogg! by gekkotron · 2004-05-13 09:52 · Score: 4, Funny

Ogg, ogg ogg. Ogg oggity ogg ogg!

Now that that's out of the way, let the insightful comments begin.
1. Re:Ogg! by MikeXpop · 2004-05-13 09:56 · Score: 4, Informative
  
  Here's insightful. Ogg is a wrapper. It has nothing to do with the quality of the sound. You should be chanting Vorbis.
  
  --
  Etiquette is etiquette. He kills his mother but he can't wear grey trousers.
2. Re:Ogg! by gekkotron · 2004-05-13 09:58 · Score: 4, Funny
  
  Vorbis vorbis vorbis!
  
  Nope, it just doesn't have the same ring to it.
  Plus, vorbisty just doesn't work.
3. Re:Ogg! by Anonymous Coward · 2004-05-13 10:00 · Score: 5, Insightful
  
  I wish there was a filter that scored any post with the words "You're new here, aren't you?" -5 stupid joke.
4. Re:Ogg! by Neil+Blender · 2004-05-13 10:13 · Score: 5, Funny
  
  I wish there was a filter that scored any post with the words "You're new here, aren't you?" -5 stupid joke.
  
  I, for one, would welcome our new filter overlords.
5. Re:Ogg! by morcheeba · 2004-05-13 10:17 · Score: 4, Funny
  
  You're not doing it right. Try this:
  
  Vorb-Vorb Vorbbity Vorb Vorb.
  Bissy Bis... ba bis bis bis.
  Vorbbity Vorbbity va va vorb. bissity bis.
  
  --
  HIV Crosses Species Barrier... into Muppets
Re:Honesty of responders by Per+Wigren · 2004-05-13 09:59 · Score: 5, Insightful

Great, now all the ____ fanboys are going to forge results to make their codec look good. Talk about useless tests.

Not possible. All you will get is a bunch of WAV-files, you have no way to tell which file belong to which codec.

That said, I don't care which codec wins the test because Vorbis is still the only one free from patents and the margins are so incredibly small.
Vorbis will win for me even in the unlikely scenario that it comes out last.

--
My other account has a 3-digit UID.
Performance is only one more factor by rnbc · 2004-05-13 10:00 · Score: 4, Insightful

Yes... certainly this kind of listening test is important to access the capabilities of each codec.

But in the real world other factors may be more important to chose a coded, like for example general acceptance, freely available code and specs, and a large content base available.

You see: performance will increase allways in all codecs with time... so this kind of testing is only a minute factor amongst others.

--
You cannot proceed from the informal to formal by formal means
Re:Objective audio analysis by trentblase · 2004-05-13 10:00 · Score: 5, Interesting

Because "human auditory capacity" is not fully understood. Sure we can give standard frequency response graph, but most of these codecs take advantage of psycho-accoustic hearing models -- where certain frequencies mask other frequencies in our perception. Since this is a developing field, objective listening tests could really help determine what's working and what's not.
Re:Objective audio analysis by The+Clockwork+Troll · 2004-05-13 10:02 · Score: 5, Insightful

That is a great idea in theory, however there is much debate on how psychoacoustics work, i.e. what information really "needs" to be there in music in order to be perceived.
For example, conventional wisdom says that the human ear cannot detect sounds above roughly 20kHz, yet there is at least some anecdotal evidence that higher order harmonics shape what we hear.
If "normal" human auditory capacity was a completely decoded topic, there wouldn't be nearly as much a need for different approaches to music compression (it would be a much simpler problem with fewer possible solutions)

--

There are no karma whores, only moderation johns
Re:Objective audio analysis by j3ll0 · 2004-05-13 10:05 · Score: 4, Insightful

Well I could be wrong, and forgive me if I've misinterpreted your post...but

Don't all of these compression algorithms rely on psychacoustic modeling to remove 'extraneous' information from the bitstream?

If that is correct, and the algorithms are implemented correctly, then really what we are looking for is the best perceived result.

Just because the output meets the algorithm input->output specs, justn't mean it's the best output as perceived by humans.

Maybe think of it as optimizing sort routines? Yep, bubble-sort or b-tree still output a sorted list, but the perceived value is that the b-tree is better because it performs it's function more quickly.

This isn't an exercise in getting the frequencies algorithmically correct - the end result has to be listenable.

Humans are analog devices...
Re:No matter *what* by mrgreen4242 · 2004-05-13 10:06 · Score: 4, Insightful

128kbps doesn't cut it. It's an absolute lossy, disgusting bitrate, no matter what it's in. They should test similar file sizes instead of by bitrate, to determine whether something is good or not- this gives a better impression of quality vs size, instead of a purely comparison based test.
Uh, if the sample is the same length, and the but rate is the same, won't the file size be the same as well? A 10 second sample at 128 Kb Per Second should be 1280Kb regardless of the format, no?
And, just FYI, MOST people, something like 95% of listeners cannot tell the difference between 128kbps sample and the original. I generally can't, even with decent headphones on.
I think that all you compression elitist snobs work for HD manufacturers, trying to get me to buy a 250GB drive to store the same amount of music as my 60GB will hold!
Uh, file size *is* bitrate... by rsidd · 2004-05-13 10:06 · Score: 4, Insightful

a given audio stream, at a given bitrate, for a given length of time, always has the same filesize. What else do you think bitrate measures?
BTW, I think the difference between MP3 and Vorbis at 128 kb/s is perfectly noticeable. MP3 sounds rather bad, vorbis sounds pretty good. And the point is precisely to tell which format sounds best, so you don't want to do 512 kb/s bitrate where all formats sound close to CD quality.
Re:Objective audio analysis by Woogiemonger · 2004-05-13 10:06 · Score: 4, Interesting

Because "human auditory capacity" is not fully understood. Sure we can give standard frequency response graph, but most of these codecs take advantage of psycho-accoustic hearing models -- where certain frequencies mask other frequencies in our perception. Since this is a developing field, objective listening tests could really help determine what's working and what's not.

From my understanding of MP3 compression and others, the compression protocols take advantage of this frequency masking, so if humans can't hear it, it removes it. It also obviously takes into account frequency ranges of hearing. As a side note, I think it might be neat to be able to compress 30-50% better based on your personal hearing characteristics, but it'd stink if you got old and had to not only wear a hearing aid, but also start collecting MP3's all over again.
Re:Objective audio analysis by Anonymous Coward · 2004-05-13 10:11 · Score: 5, Informative

The purpose of a "perceptual" encoder such as MP3 is to remove the frequencies one cannot perceive. The frequency graph therefore need not be the same as the original and yet the encoded version may not be distiguishable from the original.
Also, a frequency plot tells us nothing about the phase or frequency distribution at certain times in the signal. I can make a sine sweep that would match exactly the spectrum of a pop song, but obviously would sound nothing like it.
There are ways of objectively measuring the performance of perceptual encoders, but frequency analysis isn't really one of them.
Re:No matter *what* by Jugalator · 2004-05-13 10:11 · Score: 4, Insightful

No matter *what*?

Not even if it's about average quality speakers?
Not even if it's about some rather cheap speakers?

I can't say I hear much of a difference with modern codecs, and I own some average speakers. Maybe 128 kbps mp3 can sound bad (although that depends a lot on the kind of music), but that's an aging codec anyway. I think encoded files in the 192 - 256 kbps range is the best, and 128 kbps ogg's often acceptable, especially with the DFX plugin (or similar) for Winamp to compensate for shortcomings in compressed formats.

I'd definitely not call 128 kbps in modern codecs "disgusting". In ogg's I've found it to be roughly as 160-192 kbps mp3's and that's perfectfly fine for my ears.

--
Beware: In C++, your friends can see your privates!
Re:Objective audio analysis by tashanna · 2004-05-13 10:13 · Score: 4, Informative

Frequency analysis only gets you part way there. For those who didn't look around at the articles (I'm not refering to you, of course; just some hypothetical /. reader), there are time domain audio effects that are not visible on FFT plots. An example of this is pre-echo. With pre-echo you get a n echo of an upcoming sound (like a drum beat) before the actual sound happens. This can happen when linear-phase FIR filters are used, but is also an artifact of some frequency domain encoder/decoder systems. The FFT is only part of the story.
Re:How about: by Carnildo · 2004-05-13 10:21 · Score: 4, Funny

FLAC! Flac-a-flac-a-flac!

Aflac? What does a silly duck have to do with sound compression?

--
"They redundantly repeated themselves over and over again incessantly without end ad infinitum" -- ibid.
Sound quality is in the speakers by Anonymous Coward · 2004-05-13 10:22 · Score: 5, Insightful

When you listen to compressed audio over inexpensive speakers / headphones, you can't hear the difference. With my Sony Studio Monitor headphones, I lost the difference at about 250k with mp3, so I started using 320K as that was the best at the time. Then I bought $2000 Martin Logan Mosaic Speakers, and the original CD was clearly better than even the 320K bitrate. So now I only do lossless compression. That's fine at home, but in any other environment, there's usually so much noise and distractions that even if you had excellent headphones or speakers, you wouldn't appreciate that little difference lossless brings over 256K or even 128K.
Re:No matter *what* by Gumber · 2004-05-13 10:23 · Score: 4, Insightful

And how do you know what you are asserting? Have you done properly controlled listening tests with 128kbps encoding using a variety of codecs?

The fact is that for a lot of people, knowing the best codec at 128kbps is worth knowing because:

1) They are using portable devices where they are space constrained
2) They are using portable devices that may not have the perfect fidelity of a high-end sound system, but can go anywhere with them.
3) They are using their portable device in a somewhat noisy environment that overshadows any sound quality issues caused by a lower bitrate.
Re:Okay... by sploo22 · 2004-05-13 10:23 · Score: 5, Informative

DON'T CLICK THE LINK!

The sad thing is that somebody went to the trouble of putting together a perfectly reasonable, logical post just to throw in a porn link. *sigh*

--
Karma: Segmentation fault (tried to dereference a null post)
Re:What ever happened to r3mix.net? Any replacemen by DeeKayWon · 2004-05-13 10:51 · Score: 4, Insightful

r3mix.net died because people actually did objective analysis of his recommended LAME settings and found they were crap. IIRC, the main guy behind it wasn't very accepting of criticism. Plus, he was a message board spammer.
The best replacement for r3mix.net in my opinion is HydrogenAudio . The forums are frequented by a lot of professionals, as well as developers of LAME, FLAC, Nero AAC, Musepack, Wavpack, and other codecs.
Re:What ever happened to r3mix.net? Any replacemen by JebusIsLord · 2004-05-13 11:25 · Score: 4, Informative

The r3mix tuning (--r3mix), while a small step forward, was inherently flawed because of his insistance on tuning based on pictures instead of acual listening tests. As a result, the --dm-presets were invented and improved by Dibrom (the HydrogenAudio founder) along with a multitude of testers. eventually those were included in LAME as the --alt-presets (and in the latest version they just replace the normal --presets). In short, Hydrogen Audio is THE place to go for this stuff now.

--
Jeremy