Introduction to Linux Sound Systems and APIs

This is the problem by Otter · 2004-08-10 03:17 · Score: 3, Insightful

The fact that users require an explanation illustrates precisely what the problem is with sound on Linux. Do Windows or MacOS have analogous multiple sound servers that are somehow handled better or have those platforms standardized on a single server? I don't have the slightest idea, and as a user (or a hobbyist programmer), that's precisely how it ought to be.

--
What I'm listening to now on Pandora...

Re:This is the problem by Ianoo · 2004-08-10 03:23 · Score: 1

Well, OSS is now deprecated and has been effectively relegated to second place behind ALSA for some time now. However, there are still issues with what people put on top of ALSA, things like esd, aRts, or whatever.
Re:This is the problem by molo · 2004-08-10 04:28 · Score: 2, Insightful

I don't know about MacOS.. but Windows has the standard "multimedia" sound API and the "directSound" API. These are both of course different from the Win16 sound API.

Windows goes through the same type of API revisions.

-molo

--
Using your sig line to advertise for friends is lame.
Re:This is the problem by Otter · 2004-08-10 06:13 · Score: 2, Insightful

I know that there are different APIs you can write to, although not exactly how they compare to the Linux sound servers in terms of low-level access.
But the point is that as a user, it's completely transparent. As a user, it's absurd that I should have to know that sound servers exist, let alone have to kill artsd to hear xmms.

--
What I'm listening to now on Pandora...
Re:This is the problem by molo · 2004-08-10 06:52 · Score: 2, Insightful

Yes, I agree there. There is no reason that any application should ever start up a sound daemon that locks the device. If I wanted to run a sound daemon, I would start it myself.

-molo

--
Using your sig line to advertise for friends is lame.
Re:This is the problem by nathanh · 2004-08-10 09:53 · Score: 1

Do Windows or MacOS have analogous multiple sound servers

Yes.
Re:This is the problem by letalis · 2004-08-10 10:00 · Score: 1

I don't think the article was ment for users but rather developers since it described the different APIs, why would a user need to read about that? And as many other mentioned, yes, windows has different APIs, but exactly as under linux the user do not need to think about them as long as the applications does not require them to.
Re:This is the problem by Anonymous Coward · 2004-08-10 13:16 · Score: 0

The Windows sound APIs I know of at the moment are..

MME, DirectX, WDM, ASIO, GSIF, EASI.

The last three are pro audio, and not in common use.
Re:This is the problem by Anonymous Coward · 2004-08-10 15:49 · Score: 0

Did you have some comment about the actual article, or do you just enjoy bitching?

I run mostly Japanese-built systems by Anonymous Coward · 2004-08-10 03:18 · Score: 2, Interesting

I have never had sound work on any of these machines (NEC, Fujitsu, HPj).

I used to be a team leader back on the initial Unix (read SCO) team, and one thing that we never would have let happen was letting down the Japanese customers by not supporting their hardware.

If there is any one thing holding back Linux uptake, it is the lack of driver support for non-mainstream devices.

Re:I run mostly Japanese-built systems by Ianoo · 2004-08-10 03:21 · Score: 2, Insightful

Unfortunately, it's the chicken-and-egg problem (officially known as "Network Externalities"). Hardware manufacturers won't write drivers until lots of people use Linux, and lots of people won't use Linux until there are drivers. What's really needed is the backing of some major coperations to drive development, like say, IBM, or HP, or Nov.... oh wait...
Re:I run mostly Japanese-built systems by Anonymous Coward · 2004-08-10 05:51 · Score: 0

I used to be a team leader back on the initial Unix (read SCO) team,
I'm trying to figure out what this means.

You're kidding, right?
Re:I run mostly Japanese-built systems by Anonymous Coward · 2004-08-10 09:31 · Score: 0, Flamebait

Initial unix was at Bell Labs, doofus!
Re:I run mostly Japanese-built systems by AliasTheRoot · 2004-08-10 09:53 · Score: 1

Sound on Linux is a mess. On my system, getting sound to run under a 2.4 kernel using ALSA was trivial, but some other niceties in the 2.6 kernel didn't work, so I spent an awful lot of time under various revisions trying to get ALSA to work on 2.6.

One day it did, through seemingly random unrelated combinations of modules and kernel options. Today I grabbed an incremental release of the 2.6.7 and poof, there goes sound again!

I don't see these problems in X or the network subsystems, or disk access so why is it so damn problematic to have sound?

In windows it was detected immediately, and hardware accelerated - and this is on every version of Windows that came out after the chipsets manufacture.

On Freebsd, I did cd /usr/ports/sound/nameofcard && make && make install (i don't remember the exact directory) and bang there it was, with all my applications supporting it.
Re:I run mostly Japanese-built systems by chez69 · 2004-08-10 10:23 · Score: 1

strange, with fedora I booted up and it found the card and kudzu configured it.

that's it.

--
PHP is the solution of choice for relaying mysql errors to web users.
Re:I run mostly Japanese-built systems by AliasTheRoot · 2004-08-11 01:28 · Score: 1

lucky you. that was kindof my point - linux sound is so hit and miss...

if i could boot up and there was sound, and you booted up and there was sound and joe schmoe did the same it would be fine.
Re:I run mostly Japanese-built systems by Anonymous Coward · 2004-08-11 23:34 · Score: 0

That description matches my experience with Windows. The way I got sound to work was by getting a new PC. Of course this was not the reason for getting a new PC, but it was the thing that made me finally have sound in games running on Windows.

Re:Re-inventing the wheel. by Ianoo · 2004-08-10 03:19 · Score: 1

ALSA was released 6 years ago, and OSS was released even earlier. What's your point?

Re:Re-inventing the wheel. by EnglishTim · 2004-08-10 03:30 · Score: 3, Insightful

I think the point is that programs using sound under Windows just Pretty Much Work (TM). Sadly I can't say the same for Linux.

Re:Re-inventing the wheel. by gl4ss · 2004-08-10 03:46 · Score: 2, Insightful

yes.. work like they control wave out volume of the entire system instead of just their own & etc "just workings"(like there being reasons for why it's good for winamp that it has several audio output plugins each outputting through different apis..).

(I think beos had this covered the best.. being able to control volume per application & etc..)

--
world was created 5 seconds before this post as it is.

Simplicity itself by gmhowell · 2004-08-10 04:15 · Score: 1

Linux sound is simplicity itself. How does this article demonstrate that Linux is ready for the desktop again?

--
Jesus was all right but his disciples were thick and ordinary. -John Lennon

Moving programs from OSS to ALSA by linuxkrn · 2004-08-10 04:25 · Score: 4, Informative

Being a Gentoo user, I've been running ALSA for some time. While ALSA has an OSS compat API that you can load, it doesn't allow you to have the full control of more advance cards. (Like the EMU10k1/2 chipsets)

While oss-compat-api will give you basic sound, mixer controls, etc. sometimes you want to do more advanced things. For example, I use a tvtuner app and wanted to be able to control detailed mixer channels (Analog Capture Volume and Analog Playback Volume) that just couldn't be done with OSS. Looking at my app, tvtime, I found it only had OSS mixer controls. So I just took a weekend to learned/wrote the ALSA API version for it. Wasn't too bad and the app works great now. I can configure any control (mixer channel) on any card I want. Hopefully the dev will include the patch I sent it in the 1.0 release this month.

I know that this isn't an option for everyone. But I think as time goes on, more and more apps should have support for ALSA. Especially since it's in the 1.0.x range and the API has become more stable.

Re:Moving programs from OSS to ALSA by Omega1045 · 2004-08-10 05:40 · Score: 2, Interesting

I had a lot of problems installing a new card based on EMU10k. In the end I had to go into the Kernel and turn off ALSA and turn on OSS, and it had the "driver" right there. Compile, reboot, sound works! However, this proves yet again that while Linux is a great desktop (I use it for my home machine), it is not "ready for the desktop". This is not to throw blame at anyone. But sound should be seemless, or at least very easy to configure with a GUI tool.
I would not be against taking some developer resources away from progress on the kernel, etc, and have them work on drivers and configuration applications for sound, video, modem, network, vpn, etc.
Perhaps IBM and the other big players could fund a team that ONLY develops drivers for these standard services, and a plug-and-play type of detection module that really does work. Take every know video card, sound card, network card out there and get them to work?

--
Great ideas often receive violent opposition from mediocre minds. - Albert Einstein
Re:Moving programs from OSS to ALSA by simcop2387 · 2004-08-10 06:02 · Score: 2, Insightful

I would not be against taking some developer resources away from progress on the kernel, etc, and have them work on drivers and configuration applications for sound, video, modem, network, vpn, etc.

the problem there is that lots of people ([i think] including myself) see this as a distro problem and beyond the scope of the kernel where most of this has moved (i think the vpn thing isn't though). And as you said making applications for configuring sound, video, etc. this is usually done at the distro level, i know that suse did it pretty well but i dont think they had autodetection.

Take every know video card, sound card, network card out there and get them to work?
the problem with this is that the manufactures of those cards are typically not very coopritave(sp?) and that means that you have to have the hardware and reverse engineer the whole thing to get it working.

these systems are junk by XO · 2004-08-10 05:11 · Score: 3, Informative

The Audio subsystems are junk. Mixing should be handled intelligently by the drivers, and it should be standard unix systems used to access it. You want to play a file, you dump it to /dev/audio, you want to record something, you open /dev/mic or /dev/linein and and record it.

Additional controls should be handled by ioctls to the special devices.

The sound system in Linux is a nightmare.

--
"Champagne for my real friends - and real pain for my sham friends!" http://ericblade.postalboard.com/

Re:these systems are junk by Anonymous Coward · 2004-08-10 06:06 · Score: 0

You know, I never understood why ALSA abandoned the simple read/write/ioctl method. Nor have I really approved of it. I don't see how 16 different devices in /dev/snd and the requirement that a weird library make sense of it all is an improvement.

I thought the problem with OSS was mostly the quality of the drivers. In writing ALSA they ended up rewriting all the drivers anyway, so why couldn't they have done it right?

And you know, it really does bug me, the number of devices that ALSA puts in /dev.

What's even been going on? by bersl2 · 2004-08-10 05:33 · Score: 1

As far as Linux sound support, I must be 5 years behind the times; and yet, it seems as though nothing has been happening on that front for the past three. What have I missed (besides the obviousness of ALSA)?

Just copy Core Audio and be done with it by tigeba · 2004-08-10 05:46 · Score: 3, Interesting

Perhaps Linux developers should take a whack at emulating/copying OSX Core audio. It might provide an incentive for application developers to port their audio apps to Linux.

Re:Just copy Core Audio and be done with it by Unknown+Lamer · 2004-08-10 06:54 · Score: 4, Informative

JACK uses a callback based API much like Core Audio.

Basically every high-end (e.g. ardour, JAMin, Rosegarden, Hydrogen, etc.) uses it.

You can get really low latency using it if you have good sound hardware (e.g. RME Hammerfall for extremely low latency or even an M-Audio Delta 1010). Something like an SBLive! (what I have) will need a period size of 2048 bytes with two periods to avoid underrunning (I have a Dual AthlonMP 2800+ so I'm pretty sure it's the sound card...). Stuff like QJackCtl and Jack-Rack make controlling Jack easy.

Getting realtime mode working for a normal user can be tricky, but Debian makes it really easy. Just install the realtime-lsm package and build the realtime-lsm-source package for your kernel and all users in the audio group gain the ability to run applications realtime (at least with the default config). It could be made easier (mainly by prebuilding the realtime-lsm modules for the stock kernels) but GNU/Linux pro-audio is still mostly for hackers and adventurous people right now. Stuff like PlanetCCRMA and AGNULA are aiming to make everything work out of the box. I have yet to try either (I use Debian so PlanetCCRMA is useless for me) but it looks like DeMuDi has everything set up for recording out of the box.

--

HAL 7000, fewer features than the HAL 9000, but just as homicidal!
Re:Just copy Core Audio and be done with it by tigeba · 2004-08-10 07:51 · Score: 2, Interesting

I agree that low latency is quite important, and anything that furthers that goal is a good
thing. Even the really good native systems still arent quite up to the task of recording lots of live musicians, which is why for now I use Protools HD on OSX.

I was recommending implementing CoreAudio (or heck Direct Sound) instead of something just similar because it would decrease the level of effort for the developers of the applications (and very importantly plugin developers). It would just be a case of recompile-and-pray vs recode-recompile-pray, which might make it feasible to get some of these great high-end apps on linux.
Re:Just copy Core Audio and be done with it by Unknown+Lamer · 2004-08-10 12:37 · Score: 1

I think CoreAudio could very easily be implemented on top of Jack because the APIs are similarish (well, at least as far as being callback based and realtime capable).

CoreAudio would be more worthwhile than DirectSound because OS X apps are more Unixy than Windows apps and the OpenSTEP/Cocoa stuff for the GUI is mostly implemented by GNUStep. OS X is way closer to GNU/Linux than Windows and I'm betting it would be tons easier getting an OS X Cocoa app working on GNU/Linux than a Windows app.

I don't really see the need for proprietary sound packages on GNU/Linux; I'm a card carrying member of the Free Software Foundation though so my opinions are a bit different from most.

--

HAL 7000, fewer features than the HAL 9000, but just as homicidal!

Re:Re-inventing the wheel. by Anonymous Coward · 2004-08-10 05:55 · Score: 0

Um, the Windows sound APIs are a pretty big mess, actually.

What's happened in the last few years by 0x0d0a · 2004-08-10 07:14 · Score: 4, Informative

Oh, let's see:

* The OpenAL library came around. Does 3d audio, hardware mixing, doppler, etc, etc. Good for games.

* OSS/Free got deprecated.

* The plethora of eight million halfassed sound servers resolved down into just a few -- artsd is probably going away in favor of JACK (if the article is correct), which means that we just have the (icky) esound -- which with any luck will give way to JACK -- and JACK. Finally, applications can avoid having eight million output plugins.

* Hardware mixing in drivers became par for the course. Five years ago, everyone used OSS/Free. Today, you can play audio in xmms and *still* hear your "bong" when an error occurs without having to ram everything through a high-latency sound server.

* Wavetable MIDI is, at long last, reasonably well supported. I remember the early days with my emu10k1-based Sound Blaster Live Value and earlier cards where I had to just use FM synth because I couldn't load soundfonts to my card. Linux was behind for years here.

* Creative Labs is no longer ignoring Linux users.

* At least in theory, I can use the DSP on my emu10k1 chip to do things like adjust bass.

* There are half-decent sound applications out there. Rosegarden doesn't suck, there are synths and trackers and editors. Still not the same as a Windows or MacOS-based sound editing environment, but you can actually do sound work on Linux without coding up your own tools. :-)

I actually really like Linux as a sound-using environment. I can plonk two or three sound cards into a Linux system and (unlike Windows) all my apps let me choose what device to play out of. I can be playing music going to speakers out of Sound Card A for everyone in a room, but still be listening to what someone's saying on VoIP over my headphones connected to Sound Card B.

--
May we never see th

I don't think it's as bad as you make it out to be by 0x0d0a · 2004-08-10 07:21 · Score: 4, Interesting

Ever used a system with multiple sound cards? I have, and I'm not even an audio engineer. That approach wouldn't work very well for it.

You want to "dump a file to /dev/audio"? What format would be used? Linear or logarithmic encoding? What if the sound card does MP3 decompression onboard -- how do you get MP3 data to it? How do you detect whether to use 44.1 or 48kHz? Am I unable to set bass enhancement from the command line? What if I want to play a MIDI? What about cards that have a front and rear stereo channel -- where does what go?

I'm not saying that these are insoluable, just that there's a bit more complexity than you're making out.

How would you implement "mixing should be handled intelligently"? This is something that I've thought and bitched about for a while. The ideal would be to automatically use hardware mixing up to the maximum number of channels (two on an old card I had, 32 on my current Sound Blaster Live), then fall back to software mixing. The problem is that you have to have some buffer space to mix audio, which means adding latency. When you hit 33 channels and that last channel has to be software-mixed, what are you going to do -- suddenly bump up the latency in the audio to add a buffer into the audio output line? Right in the middle of playback?

--
May we never see th

Re:Re-inventing the wheel. by 0x0d0a · 2004-08-10 07:27 · Score: 2, Informative

On the Microsoft side, DirectX came out nearly 10 years ago...

Let's see ... that would be, what, ten years after the initial release of Windows 1.0 (we'll ignore DOS for the moment)?

And DirectX is comparable to SDL, not to ALSA/OSS. How many years after Linux was in a releasable form did SDL come out?

ALSA/OSS is a driver revamp. I do believe Microsoft underwent a pretty thorough throwing out of drivers when they ended the 9x line and moved to the NT line completely.

--
May we never see th

Re:Re-inventing the wheel. by letalis · 2004-08-10 10:05 · Score: 1

The times I have worked with more advanced audio projects on win32 platform it has actually not "pretty much worked" at all. ASIO drivers or Directsound, what driver should I use to make Cubase work good? If you have regular audio applications in mind then I must say that I have not once had a problem with a regular application (music players, soundrecorders, videoplayers etc.) - it Pretty Much Works...

This is really needed by Anonymous Coward · 2004-08-10 21:07 · Score: 0

I have coded using ALSA APIs and some of the APIs are currently really badly documented.

Easiest way to learn it is to read all (outdated, few) documentation and then look for examples and/or API library source code.

If ALSA people want to get their API used (instead of OSS emulation layer) their should bother to write better documentation for it.

Re:This is really needed by Anonymous Coward · 2004-08-11 10:20 · Score: 0

Here's what you get when you follow the link on the ALSA site to latest documentation. Pretty good, hunh?

Against ALSA by RAMMS+EIN · 2004-08-12 01:47 · Score: 1

Two problems with ALSA are that it is Linux specific, and that its drivers have consistently been worse that the OSS ones for me.

--
Please correct me if I got my facts wrong.

Nightmare indeed by Anonymous Coward · 2004-08-12 10:01 · Score: 0

Does it occur to anyone that applications have to produce sounds on terminals (such as an X terminal) rather than on host machines where they are running?

And indeed, on the machine - a terminal - where the sound needs to be played, using /dev/audio etc. would be most appropriate (if it's a Unix machine), because this is standard way of communicating with a device in Unix; on top of that there could be a large number of libraries that facilitate processing of audio data - but they should not be considered a part of Unix (Linux) proper.

Slashdot Mirror

Introduction to Linux Sound Systems and APIs

43 comments