Audio Compression Primer
Hack Jandy writes "For those of you with a little extra time this afternoon, check out Sudhian's primer to all things concerning audio compression. The article details everything from DRM to CRC matrixes (with a healthy dosage of Ogg)."
Given the topics in the audio section (it has an audio section!), the site seems to lean more towards audiophiles.
I don't agree with the dismissal of lossy algorithms either, but I think it makes sense given the context.
I use FLAC because converting from a lossy format to another lossy format can produce crappy results. If I choose a lossy format for all my audio and then I need the audio to be in some other lossy format, I might be screwed.
You might choose Ogg for your audio then sometime in the future, a new lossy format sweeps the industry. Your Ogg files might not convert well to the new format.
and besides...Disk is Cheap!
Shouldn't that be 1200 kb/s? 150 KB/s * 8 = 1200 kb/s, right? Or is the 150 KB/s figure I'm using incorrect (I could have sworn that was the 1x CD speed)?
I did a little googling and found this (http://www.teamcombooks.com/mp3handbook/13.htm):
Because the code is open source, FLAC will be around forever and available on whatever OS/Platform you want to use it on if you feel like compiling the software.
Another reason it's going to be around and much more prevalent as time goes on is that the compression is so good and the speed/resource usage figures are so attractive. When I rip CD's to FLAC I am limited to 40x by my burner (CPU utilization is around 20-25%). When I rip the same CD to ogg, I top out under 30X because the processor has reached 100% utilization.
Fast. Free. Efficient. Frugal with the CPU. What else do you need?
FM Radio is far from CD quality hence there isnt really a need to use very high bitrate MP3s or whatever
;-)
Or consider this; since FM radio has a limited range of frequencies that come across well, songs that are intended to be widely played on FM radio (e.g. Britney Spear's latest "hit" song) are actually engineered to sound best in those frequencies. With the end result that when you hear Britney Spears on the radio, the track sounds just like it does on the CD.
Meanwhile, quality music, lovingly mixed onto CD by people who actually give a damn, sounds like crap on the radio..
In other words; if you can't hear the difference between 128kbps and higher, it might just be that you're listening to mass produced music.
As for musicians preferring 128kbps? Well, sound engineers usually don't sit on stage with zillion Watt speakers right next to their fragile precious ears for a reason..
Me, I have crap taste in music AND I'm tonedeaf, so whatever, 128kbps all the way!
(MPEG artifacts in video drive me nuts, though)
SCO employee? Check out the bounty
While the article is a primer, I was a little disappointed in the algorithmic treatment given in the article itself. Right now I know of two excellent free publications: Introduction to Sound Processing and The Sounding Object, which both treat the theoretical, DSP side of things. Any other resources that Slashdot readers can recommend for those who are interested in the subject of audio compression and representation?
Titus Barik
If it's lossless, you should be able to take digital file A, compress it into compressed file B, and then if you uncompress B to get A', then A' = A.
That is, the checksums for A and A' should match, etc.
That's how I define mathematically lossless.
Whatever this asshat is on about double blind and testing and all that, has more to do with the ability of his FLAC playing equipment to sound the same as his CD player, which is a whole 'nother ball of wax altogether.
I don't need no instructions to know how to rock!!!!
Yes, I noticed the article is 3 PAGES LONG! It makes only passing reference to other codecs. Not much of a primer, and it didn't take the entire afternoon to read, it to 5 minutes.
Did I miss a crucial link or something?
https://www.accountkiller.com/removal-requested
Vorbis decoder is and has been done for a long time. Like other codecs, tweaks can always be made to the encoder to produce better results by using different psychoacoustic models, etc. As long as the output still follows spec, the decoder will still decode just fine. This is why your crappy MP3's from 1997 still play today, and fancy MP3's from today will still play on those old sound players from 1997. As long as the encoder follows spec, the decoder will always be able to decode it properly.
especially when listening to music on hi-quality speakers a la Bose
Bose is doesn't make high-quality speakers, they make expensive speakers that don't perform nearly as well as alternatives (for instance, the Acoustimass satellites use crappy paper cones that perform poorly in the upper frequencies). A $300 pair of B&W DM302's will thrash anything Bose makes soundly for sound quality. Also investigate Hale, Thiel, or Paradigm. If you really want to spend thousands, spend it on Magnepan (Magneplanar 1.6Q) or Vandersteen (2ce signature) or the higher end speakers from the companies I already mentioned. But those DM302's are good enough to be highly rated by places like Stereophile magazine and they're an incredible deal.
If you really want a bunch of little satellite speakers, Energy makes a much better sounding (and somewhat cheaper) system like that. I hear from people I trust that Tannoy makes an incredible one as well, but I haven't heard it.
rage, rage against the dying of the light
(Mod to -3, nitpicking)
The MDCT in itself is actually lossless. Any distortion you notice is most likely introduced by the quantization applied post MDCT during compression.
"There is no dark side of the moon really. Matter of fact it's all dark."
WRONG!
Nyquist's criterion is "You must have at least twice as many samples as the largest BANDWIDTH of the signal in order to correctly reconstruct it."
You can take a 10.7 MHz signal, and sample it at 10000 samples per second, and correctly reconstruct it, so long as the signal is guaranteed to be bandwidth limited to 10.7 MHz +/- 2.5 kHz. This is often done in software defined radio to aquire the signal from the intermediate frequency (IF) of the analog front end.
You also have to have an appropriate reconstruction filter at the output of the system in order to correctly recover the signal - if you don't have the right reconstruction filter, you will NOT reconstruct the signal correctly.
You also have to take into account the effects of any signal modulation - take a 20 kHz sine wave, and burst it for 10 msec, and you widen the bandwidth of the signal by about 100 Hz (depending upon the exact shape of the burst - a perfect square burst will widen the signal as a sinc function and will, in effect, increase the bandwidth to infinity, which is why square bursts are generally Considered Harmful in communications work).
Also, you don't oversample a signal in time to account for "rounding errors" - you oversample in time because the frequency response of sampling a system in time introduces a sinc response in frequency - by moving the sampling rate up you reduce the impact of this response on the recovered signal's frequency response. You also greately ease the requirements on the reconstruction filter - the filter can be wider (have fewer poles in the transfer function - thus fewer parts needed).
www.eFax.com are spammers
- Audio formats supported: AAC (16 to 320 Kbps), MP3 (32 to 320 Kbps), MP3 VBR, Audible, AIFF, Apple Lossless and WAV
- Upgradable firmware enables support for future audio formats
The second bullet leaving the possibility there, but the page lists it as currently (meaning iPod users now, popularity etc) not supporting it.For context, click Parent.
Sampling frequency would typically be 44.1KHz, bitrate would be 128kbps. Also, FM radio quality (with good reception) compares to about 96kbps well-encoded mp3, so there's not much point in them recording higher except for archival purposes.
You should be using LAME to encode, and LAME only goes up to 320kbps (blade for instance goes up to 384kbps, but is much lower quality), ergo you can only have 320kbps CBR, not VBR.
And to everybody else out there who complains about background noise, you should be extracting digitally from the CD!
flac doesn't seem to have come far enough yet for me (500+ albums is a lot of diskspace if it's around 300MB/album), but to my ears on my equipment (Klipsch £250 (pound sterling if that doesn't come out) speakers, cheapo SB Audigy2 soundcard), lame --preset standard (around 200kbps VBR) sounds damn near perceptual transparency.
You know you've been IMing too long when you almost say 'lol' out loud to a non-geeky friend...