Slashdot Mirror


Audio Format Listening Tests Concluded

Pointing to the conclusions of this listening study, nullity writes: "The results are interesting, and show a high variation in the performance of the various codecs on different musical styles. Ogg seems to work well on dance music, WMA8 on chamber music, etc."

30 of 337 comments (clear)

  1. WMA8 by af_robot · · Score: 5, Funny

    Ogg seems to work well on dance music, WMA8 on chamber music, etc.

    Like requiem...

  2. Who cares about 64 kbps tests? by splorf · · Score: 5, Insightful

    These tests are all at 64 kbps and most people use much higher bitrates for real music. I'd like to see comparisons at 128k bits minimum, and preferably 160k or 192k, which is what most quality mp3's are at, for direct comparison.

    1. Re:Who cares about 64 kbps tests? by keller · · Score: 5, Insightful

      People who wnat to stream audio of course! .K

      --

      Enig? Det alt for hot det smor!

    2. Re:Who cares about 64 kbps tests? by ProtoCat · · Score: 5, Interesting

      64Kbps is where the flaws of a codec are truely exposed. It's a great median between being too high to produce much results and too low where everything completely falls apart. You may not think any of this has any relevance to you as you're encoding above 128Kbps, but it actually does make a difference when you stress your encoder with a difficult piece of music.

      However, if the difference between sounding 'good' and sounding 'accurate' mean little to you, as someone who'd make an argument of 64Kbps tests being worthless would, then you really aren't the intended audiance of such tests. You can merrily use any of those encoders at 128-192Kbps without ever really noticing or caring much.

      I, personally, would like to see OGG1.0, MP3 Pro and WMA8 take on some real tough to beat codecs such as Dolby's AAC High-Complexity Mode (which no AAC freely available encoder supports, including QuickTime) and Sony's ATRAC3. But, that'd be kinda moot, because most people out there do not have access to those toys.

      For now, I'm content to just watch people hop around and proclaim whatever they want as king of audio formats while sticking to 256Kbps Fraunhoffer MP3 (archival purposes) and 192Kbps LAME HQ MP3 (general usage) as something both widely supported and pratically indistinguishable from the source. Even if AAC-HC and ATRAC3 were freely available, it'd take an awful large effort to wean people off of MP3 so far as support base and to migrate them to a new format. New P2P programs, new players/plug-ins (in some cases) and new hardware players. Not gonna happen for a while.

    3. Re:Who cares about 64 kbps tests? by flipflapflopflup · · Score: 5, Insightful

      > Are you pondering what I'm pondering?

      I thought your sig was part of your comment, and was going to agree entirely. 64 kbps tests are pointless because no-ones uses those rates, and at higher rates the differences become negligable.

      At the kind of bit rates that real people actually use (160, 192, and up), it takes a real pro/audiophile/picky git to tell the difference. Which makes the whole thing seem a bit pointless really.

      Chances are then, it's not going to be audio quality that makes or breaks these standards. Look at betamax...

    4. Re:Who cares about 64 kbps tests? by nagora · · Score: 5, Insightful
      64Kbps is where the flaws of a codec are truely exposed.

      Running your car over a cliff is where the flaws in its safety system are truely exposed but I don't tend to drive over cliffs much.

      However, if the difference between sounding 'good' and sounding 'accurate' mean little to you, as someone who'd make an argument of 64Kbps tests being worthless would, then you really aren't the intended audiance of such tests.

      What do you mean by this? 64Kbits is worthless for listening to any music I own while 128 is good enough to not actually annoy me much of the time so why should I be interested in these tests? Are you saying that the intended audience for these tests are people that are not interested in the quality of the music they're listening to?

      TWW

      --
      "Encyclopedia" is to "Wikipedia" what "Library" is to "Some people at a bus stop"
    5. Re:Who cares about 64 kbps tests? by TummyX · · Score: 3, Insightful


      64 kbps tests are pointless because no-ones uses those rates, and at higher rates the differences become negligable.


      With MP3 maybe, but OGG, WMA, MP3Pro and ACC all aim to get MP3@128Kbs quality at 64Kbs. That's what this test is FOR! Try comparing a 128KBps MP3 with a 64KBps OGG file. You'll find the quality to be quite comparable. It's a pity the test didn't include MP3 (at 128KBps) so we could see how good these new codecs really are.

      And BTW, you might as well say it is pointless to test 128, 160 or 192kbps and that we should all be testing uncompressed 1500Kbps audio.

    6. Re:Who cares about 64 kbps tests? by gleam · · Score: 5, Insightful

      You don't often intentionally hurl your car at 45 miles an hour into a steel box, either, but insurance companies do it all the time to see how well a particular car stands up to the abuse.

      Even if you don't knowingly take the insurance institute's results (or federal crash-test ratings) into account, the company selling you insurance does, and your premiums will be higher.

      To say "just because i'll never do something this way it has no merit" is silly. Performance in a 45-mile-per-hour offset crash will tell a car company how well it would stand to you accidentally bumping into the corner of your garage, or into the bumper of another car.

      Tests like this are important because they're indicative of performance at all bitrates. If you want to know WHICH codec will sound the best at 128kbit, you should look at which codec sounds the best at 64kbit--the two are likely to be the same.

      There are two intended audiences for this test: 1) people trying to decide which audio format to use for a stream (which are very often in the 32-64kbit range)

      and

      2) people who realize these tests can tell us much more than simply which codec performs best at 64kbit, and want to know how to maximize the quality-to-diskspace ratio on their own encodings.

      Hope this clears something up for you.

      -gleam

      --
      this .sig is not a .sig.
    7. Re:Who cares about 64 kbps tests? by perlyking · · Score: 3, Funny
      64 kbps tests are pointless because no-ones uses those rates, and at higher rates the differences become negligable.
      Not quite true, I used 64kbps wma files for about a week and listening using really shitty earbuds and walking alongside noisy truck filled roads on the way to work I couldnt tell the difference :-) Now I have some decent portable earbuds though (etymotics) you wont catch me using 64k wma (or I suspect any bitrate of wma!).
      --
      no sig.
    8. Re:Who cares about 64 kbps tests? by Anonvmous+Coward · · Score: 3, Insightful

      Oh but it is CD quality.

      "We took 1,000 people and had them listen to an NSync song off a 64-kbit WMA file, and then off the CD. When asked 'Do you like the song any better?', 999 out of 1,000 people said "No. Can I go now?"

      So yes, there is no quality difference between WMA and CD's, it's just a matter of asking the right question.

    9. Re:Who cares about 64 kbps tests? by Zeinfeld · · Score: 3, Funny
      Everyone is saying 'you would need golden ears to tell the difference' yadda yadda. In my view the whole test is bogus because we don't have figures for the original CD track.

      I would not be at all suprised to see people favor the compressed over the original. The fact is that a lot of so called audiophiles are really pretty ignorant gadget freaks. At university I knew a friend who made money by helping to repackage the components of a bog standard Philips CD player in a pretty box to sell to audiophiles for ten times the price.

      I had a so called audiophile witter on for ages about how this was actually quite rational and how using a more stable motor with reduced wow and flutter dramatically improved the sound. He still does not believe that the quartz crystal controls the sound output rate.

      --
      Looking for an Information Security student project suggestion?
      Try http://dotcrimeManifesto.com/
  3. Hmmmm by cca93014 · · Score: 3, Funny

    Tests confirmed that attempting to encode "Aphex Twin" with any of these codecs caused the PC to tremble at a frequency that, when connected to a refracting laser stuck up Bill Gates' ass, had it spell out "we're all dead" on the nearest wall.

  4. Re:Didnt read the article yet but.... by mccalli · · Score: 5, Informative
    it seems that music you listen closely too sounds better with WMP, and fast, not listened to music sounds ok w/ ogg.

    Well....not quite. There's a different frequency distribution between electronic, pop acoustic and classical music.

    Specifically, electronic music, which most dance stuff is, has a very flat frequency distribution. See this for yourself - load your favourite media player, siwtch on the graphic equaliser graph and watch how basically nothing happens except in the mid-range.

    Now try again with an orchestral piece. There will be much more variation, though in most it will tend towards the top end.

    Now try again with rock. Tends towards the bottom and top, with middle frequencies missing.

    Keep going with any format you feel like mentioning...you'll get the same.

    Actually, this is a striking example of how recording techniques can ruin sound as well. Take a look at the Apollo 440 album - Gettin' High on Your Own Supply. A good mixture of guitars and electronics, right? Well, look at the frequency graph again. See how virtually every guitar frequency variation has been cut out: this music was recorded digitally, mostly using samples by the looks of it. The normal variations you'd associate with having guitars play live are all filtered out, and the graph goes back to the flat digital sound again.

    Cheers,
    Ian

  5. Should compare Ogg as a single entry by Russell+Coker · · Score: 3, Insightful

    Let's assume that anyone who likes Ogg and is seriously into music will compress their music with both Ogg variants and use the best variant for each file.
    Therefore we should also consider taking the best of the two results and comparing it to mp3.
    From a quick look at the results it appears that Ogg will still be edged out by mp3 when analysed in such a fashion, but it's much closer.
    Also a test on several bit rates would be useful.

    --
    See http://etbe.coker.com.au/ for my blog.
  6. So I guess its time for a new frontend by CableModemSniper · · Score: 4, Interesting

    I guess grip will have to use Genre info from CDDB to decide what to encode the the files as now. I wonder if you coudl set up something to optimize individual tracks. Like scan a wav and pick the best codec for the frequencies used in the audio.

    --
    Why not fork?
  7. MP3PRO not MP3 by Anonymous Coward · · Score: 4, Interesting

    I noticed a number of confused posters here... The tested codecs were AAC/MP3PRO/OGG/WMA, not MP3. Had mp3 been tested, it would have lost every round as all of the tested codecs are vastly superior to plain MP3 at this bitrate.

    It also should be noted that the only two samples that WMA beat OGG at (indeed the only ones that it didn't totally flop on) were two very simple samples that are demonstrations of two differnt weaknesses in the current revision of vorbis. Orignally the results page had some very interesting commentary from Monty on this, but it looks like it got pulled.

    With the exception of those two samples, OGG clearly won. Even including those, it was only beat out by MP3PRO by a small margin. When you factor in that MP3PRO isn't available at anything but such low bitrates and that it's substantially more propritary then MP3, it seems like pretty much a no-contest.

  8. Problem Is by BlackGriffen · · Score: 3, Informative

    that these codecs are lossy, and take advantage of the fact that the human ear is better at hearing certain things than others to pair out extraneous info and improve compression. IOW, it doesn't matter how technically different the new files are as long as they still sound the same to the human ear.

    BlackGriffen

  9. Multi-Codec Codec anyone? by BlackGriffen · · Score: 4, Interesting

    Considering that different codecs do better at different music w/ different frequency spreads, who else thinks that the next generation of audio codecs will be multi-modal; in effect, be several codecs in one. Then have each codec specialize on certain types of music. Perhaps even have them run in an advanced mode where they do a frequency analysis of whole songs, rather than just using genre, to automatically select the best codec for the job. Perhaps even use different codecs for different sections of the song. That would definitely help songs like Bohemian Rhapsody and orchestas with movements, etc.

    Would this be too time consuming to implement or what?

    BlackGriffen

  10. Re:Didnt read the article yet but.... by hfranz · · Score: 3, Interesting

    Just a side note about the frequency distribution of different styles of music:

    The reason why classical music generally compresses better is because the frequency distribution of the sound of natural instruments like for instance string instruments (including the human voice) is harmonic. This means that the sound spectrum consists mainly of a superposition of peaks at the base frequencies of the instruments played and their corresponding harmonics at higher frequencies.
    If you were to make a two dimensional spectral analysis of a such sound recording with the time axis to the right, the frequency to the top and the amplitude as the color intensity of the point you would see a lot of wiggling lines at
    regular distances. (BTW: this would make a great visualization plugin for xmms)
    Since audio compression algorithms also make such a spectral analysis and after that discard some of the information below a threshold they can
    reproduce a mainly harmonic spectrum easier than that found in pop or rock music, which is much more complex and more "noisy" because of the
    use of distorting amplification and all kinds of
    percussion.

    Holger

  11. Outiers skewing the results? by Outland+Traveller · · Score: 4, Interesting

    Looking at the data, it looks the two samples where Ogg performed poorly ended up being encoded at a significantly smaller average bitrate than any of the other encoders.

    The table at the end lists LiszBMinor with an average ogg bitrate of 45 and BachS1007 with an average bitrate of 47. Since the other codecs encoded those samples at a bitrate 64 or higher, this may explain the results.

    The results may point to a flaw in Ogg's VBR login rather than in the lossy compression scheme.

    1. Re:Outiers skewing the results? by jonathan_ingram · · Score: 4, Informative
      If you read the thread on HydrogenAudio (which is the message board where most of these tests / codecs are discussed), you'll find the following information from Monty, the lead developer of Vorbis:
      Ogg had a very low bitrate (in the forties) on all the classical samples, which is the way it should have been (Classical solos with their deep noise floors and simple harmonics are relatively easy). But the real reason Ogg scored so low in both (and Beauty Slept as well) was a) the tuning behind noise normalization is still not perfect. This is the very first release of that feature, and the test found flaws b) also the first release of new, more aggressive stereo modes and I think that they too need more analysis infrastructure driving them.

      I expect Ogg's performance on Liszt and Bach to be very subpar NN performance. The poor performance on BeautySlept and Waiting was most likely insufficient stereo analysis. Ogg had the infrastructure to win those four samples, but the encoder didn't know how to do it yet (because I didn't know it would be necessary).
  12. Yes, look at the SCORES by tweakt · · Score: 3, Interesting

    OVERALL RANKINGS (12 SAMPLES)

    mp3pro 49.00
    oggq0 44.00
    ogg64 40.00
    wm8 24.00
    aac 23.00

    The AC above me speaks the truth. mp3PRO has no hope of gaining enough market share to become a worthy competitor. It's a very proprietary extention to MP3. OGG being open source and free (as in beer) has clear advantages for hardware vendors (where it really counts). Lets hope the codec is easy to embed into portable products.

    I want my Portable OGG CD Player! I'll buy the first one that comes out. Could you imagine? Twice the capacity of normal players and it STILL sounds better (or same capacity truly indistinguishable from CD -- at only 128k). Right now I have to encode my mp3's at ~180-220kbit to get something acceptable. =/

    1. Re:Yes, look at the SCORES by micromoog · · Score: 4, Interesting
      mp3PRO has no hope of gaining enough market share to become a worthy competitor. It's a very proprietary extention to MP3.

      mp3PRO has one very specific advantage over all the other formats on the market-share front. It has the characters m, p, and 3 in it. Everyone has heard of mp3, and people who don't care about the open source cause (read: the vast majority of people) will buy an mp3PRO device way before considering an Ogg Vorbis device.

      As I've said before, name is really important when marketing comes into play. And Ogg Vorbis' name simply blows.

    2. Re:Yes, look at the SCORES by Greg+W. · · Score: 3, Interesting

      Twice the capacity of normal players and it STILL sounds better (or same capacity truly indistinguishable from CD -- at only 128k).

      Vorbis 1.0 does sound amazingly good at ca. 128 kbps (VBR -q 4). That's what I've been using lately for CDs that I rip. But it's not "indistinguishable from CD" in all cases. On at least one song ("Feed My Hungry Soul" by the Lords of Acid), I can differentiate Vorbis -q 4 from the original in ABX testing. And I'm not a trained listener, and not using high-end equipment.

      I urge everyone to encode for themselves, using their favorite music CDs, and decide what works best for them. Some people are very sensitive to the lossy stereo separation that Vorbis RC3 and 1.0 employ at low-to-mid-bitrate settings. I was able to hear this clearly on several of the samples in the 64kbps test, though I'd never noticed it at higher quality levels.
  13. Re:No Control by Anonymous Coward · · Score: 5, Informative
    Here, there is no control - crappy experiment.

    The participants could have just been scoring on "this is different to the unencoded track, therefore it must be worse".

    So put a copy of the unencoded track as a test track and see if it gets marked down (and also, of course do NOT tell the participants that it is there).
    Umm.. did you bother to read about the testing methodology before coming to your grand conclusion here?

    ABC/HR.. as in ABC/Hidden Reference... as in, there is a copy of the original track included as a hidden reference on every single trial.

    The users are given 2 sliders per sample laid out on a panel. The samples are loaded in random order. On the sliders for each sample, one slider is for the original sample, and one is for the encoded. These are also randomized per sample. The user does not know which is which. If they happen to rate the original sample less than 5.0 (highest rating, meaning it should be transparent), then their results are disregarded entirely for that sample.
  14. Ogg on Quicktime by heroine · · Score: 3, Insightful

    Been using Ogg/Vorbis/Squish on Quicktime for a year. The Ogg/Vorbis/Squish codec got much better between 1.0rc2 and 1.0. At 128k it's already better than mp3 and the managed bitrate encoding is faster than the hard drive can read. The real value is of course, the ability to read these encoded files as long as there is UNIX. Mp3 is going to die and when it does there won't be any appliance makers interested in paying the $10,000 royalty to support mp3.

  15. Newer algorithms of limited use to many by dh003i · · Score: 3, Insightful

    To thsoe of us who just want to listen to music on a PC, the newest greatest best algorithms are always good (mp3pro, oggs, wma8). But for many, the goal is to put that music on a MP3Player and listen to it anywhere. I'll summarize the support of these various codecs by MP3Players, as well as mention whether or not my MP3Player (RioVolt SP100) supports them.

    MP3PRO -- little support on MP3Players. Not supported by RioVolt SP100.

    Oggs -- little/no support on MP3players. Not supported by RioVolt SP100.

    WMA8 -- little support on MP3players, though many support older WMA's. Not supported by RioVolt.

    So, in summary, all of these new formats are completely useless to me on my MP3Player. The one option they present is if I want to encode something in two formats -- one for my computer, and another for the MP3Player.

    Personally, I think more work should go into fractal endcoding, as most music has fractal patterns in it (especially Bach's music).

  16. Re:This study tells nothing by Greg+W. · · Score: 3, Informative

    I am Karma Man, hear me Whore.

    An honest double-blind listening test is extremely difficult to arrange, and there is no evidence whatsoever on such on the site.

    This is how the test was conducted.

    The test required access to a Windows machine (probably Win95 and up, didn't try with Win3.1) with a sound card. Users were required to download the ABC/HR "practice" Zip file, which includes the ABC/HR program, the Ogg Vorbis 1.0 command-line encoder and decoder, a LAME command-line encoder/decoder (I forget which version), a FLAC command-line decoder program, and a .flac sample file (the instrumental introduction to The Eagles' "New Kid in Town").

    After unzipping this, the user had to run a batch file (encdec_foobar.bat) which un-FLACced the sample file, then encoded it with Ogg Vorbis and LAME, then decoded both of the resulting files back to .wav.

    Then the user executed the ABC/HR program, which is a Win32 GUI application. After loading the sample into the application (pull-down menu and file selector dialog), the interface became a row of double-slider pairs. Below each slider was a "Play" button. Below each slider pair was a "Play Ref" button. Below that was a "Stop" button. There was a pair of sliders for each decoded sample -- so for the practice run, there were two pairs of sliders: one for file #1, and one for file #2. The user did not know which file was Ogg Vorbis, and which was LAME MP3.

    The user then listened to the Reference file by clicking any of the "Play Ref" buttons. After hearing the Reference, the user could then click any of the normal "Play" buttons. The first task was to determine, for each pair of sliders, which one was the original and which one was the encoded file. Having determined that, the user used the slider (which went from 1.0 to 5.0 in increments of 0.1) to "score" the sample on the subjective quality of the result. There were also text labels on the slider: 4.0 was "perceptible but not annoying", 3.0 was "slightly annoying", 2.0 was "annoying" and 1.0 was "very annoying".

    Finally, there was an ABX button, which launched a different window. In the ABX window, the user could select "Original", "Sample 1", or "Sample 2" for the "A" and "B" samples. Normal ABX testing proceeded from that point. (If you don't know what ABX is, go to pcabx.com.) I found that the ABX window sometimes helped me to focus on a specific sample so that I could find its flaws; armed with that knowledge, I was able to make a determination of which of the two sliders, right or left, was the encoded version.

    Once a slider was pulled down from the default 5.0 position, another button became active above that slider. Clicking on it opened a new window with a text box, into which comments could be typed. When the user was finished with the test, the slider positions, the comments, and the ABX results (if any) were written to a plain text file (DOS CR/LF format), which was to be mailed to the test administrator. (Though, of course, you weren't supposed to mail the practice results.)

    Now, that was just the practice session, which was a prerequisite for participation in the actual test. For the actual test, the process was similar, but differed in a few details.

    The actual test samples included copyrighted, patented codecs for which there are no freely distributable decoders. Therefore, the WMA, AAC and MP3Pro samples were distributed as FLAC files, and decoded by the batch file. Since MP3 did not participate in the listening test, the LAME encoder was not used during the actual test. The Vorbis encoder, of course, was used twice: first with -q 0, and then with -b 64 --managed.

    With 5 encodings per audio sample in the actual test, there were 5 pairs of active sliders instead of only 2 pairs. But otherwise, the actual test was exactly like the practice session.

    (Personal note: I did 10 of the 12 samples, skipping the two classical ones. Out of 50 encoded versions of the 10 samples, there was only one case where I couldn't tell right from left -- "The Source", encoded with MP3Pro.)

  17. Re:Sound Artifacts by Jucius+Maximus · · Score: 3, Funny
    "That is because you have crap hearing. Get a very cleanly (over) produced song - I recommend TLC - Unpretty (no, you don't have to like it. [...] If you can't hear this difference having done exactly what I've said then I suggest you are not qualified to ever post a comment to a thread discussing audio quality again."

    And if you log onto kazaa and download the mp3 to and then attempt to do this quality comaprison, you are not qualified to ever post on slashdot again. :P

  18. Test compares codecs, not formats by benwaggoner · · Score: 4, Insightful

    This is an interesting and relatively well done test (although it appears that the listeners knew which format they were listening to, so it wasn't truly double-blind, and a anti-MS and pro-Ogg bias can't be ruled out).

    However, some discussions seem to be focusing on this saying AAC is bad or WMA is bad, when really it refers to the particular implementations in codecs of those formats.

    For example, the Apple MPEG-4 AAC-LC encoder was used for AAC. This is a Low Complexity version of the format. Also, the Apple encoder has a strange limitation where it automatically converts 44.1 stereo to 32 stereo at that data rate. This isn't required by the AAC format. Other AAC encoders yield MUCH better results, and beat MP3 Pro in double-blind testing. I haven't seen any double-blind comparisons between AAC and Ogg.

    Also, the WMA8 encoder is due to be replaced by the backwards-compatible WMA9 in early September. Of course, there may well be improved versions of the other encoders by then as well.