Preemptible Linux Kernel: Interviews and Info

So will that make Linux a superior audio platform? by geekplus · 2001-10-14 06:31 · Score: 3, Interesting

The reductions in latency -- would that include the type of latency that plagues real-time audio applications like sound-on-sound recording?

Public Service Announcement from Brokaw & Torv by waldoj · 2001-10-14 06:33 · Score: 5, Funny

We're sorry, but tonight's "Linux" will not be aired. Normally you would find 2.4.12 or 2.4.13-pre2 on Sunday nights, but not this evening. Now that Linux is fully preemptible, NBC will be airing a four-hour music-and-ice-skating tribute to Bill Gates.

We apologize for any inconvenience, and for the reduced uptime. Enjoy the show.

Finally.. by Renraku · 2001-10-14 06:36 · Score: 2, Interesting

You'd think this would have been one of the first few 'features' of the Linux core. If the latency were high, it would screw programs and things that rely on low latencies to compute. Better late than never.

--
Job? I don't have time to get a job! Who will sit around and bitch about being broke and unemployed then?

Hmm by drinkypoo · 2001-10-14 06:38 · Score: 3, Interesting

I thought the Slack 2.0 release had a 1.1 kernel.

I'm wondering about this paragraph:

We had to modify the interrupt code in entry.S to prevent some situations and to allow preemption on return from an interrupt handler. However, we can't preempt within critical regions for the same reason we can't allow concurrency within them with SMP -- so we prevent preemption while holding a spinlock. The bottom half handler and scheduler were also modified to prevent preemption while they are executing.

Can anyone give a nice layman's description of what he's talking about here?

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"

Re:Hmm by selectspec · 2001-10-14 06:53 · Score: 5, Informative

The interrupt handlers can't allow premption during the context switch of an interrupt because the registers are intransit. Basically, you can't have an interrupt while your in the process of any kind of context switch otherwise you're never sure what registers you were able to flush to and from the CPU to the stack.

Critical Sections (such as access to the IP stack or I/O queues) have to be protected. With the advent of multi-processor systems under the SMP scheme, there is already considerable locking within the kernel to synchronize access of critical resources between processors. Critical regions also need to be protected from interrupt concurrent access as well.

Bottom Half handlers generaly are fast track implementations to quickly deal with the interrupts. To avoid concurrency collisions of reasources used within the bottom half handlers, interrupts (for that particular handler) must be disabled during the handler's execution.

All in all, this is basic non-preemptive stuff. What I don't understand is that this strategy that he is defining is a textbook NON-premtive approach to kernel design. I'm not too sure where he gets off claiming that the kernel is fully-preemptive here.

--
Someone you trust is one of us.
Re:Hmm by Anonymous Coward · 2001-10-14 07:09 · Score: 5, Funny

Only on slashdot would "IP stack", "I/O queues", "interrupt concurrent access" and "SMP" be considered laymans terms.
Re:Hmm by sagei · 2001-10-14 07:54 · Score: 5, Informative

I originally felt I should stay out of any discussion here, but I want to answer some of these questions and clear some of this stuff up. To be honest, it is a little embarrassing having everyone read and comment on the interview. :)

Bottom Half handlers generaly are fast track implementations to quickly deal with the interrupts. To avoid concurrency collisions of reasources used within the bottom half handlers, interrupts (for that particular handler) must be disabled during the handler's execution.

Interrupts, even just the in question, are not disabled during a bottom half, at least in general. The reason we can't preempt bottom halves is that they are guaranteed to be serialized w.r.t CPUs (ie a given BH runs on only one CPU at a time). Because of this, the BHs are designed without a regard reentrancy. So we can't preempt them.

All in all, this is basic non-preemptive stuff. What I don't understand is that this strategy that he is defining is a textbook NON-premtive approach to kernel design. I'm not too sure where he gets off claiming that the kernel is fully-preemptive here.

Hardly. Would you say an SMP system is not SMP if it is non-concurrent inside critical sections? No, you wouldn't, and this is the same situation we have here with preemption. We can't preempt inside critical regions. We have concurrency and reentrancy concerns, just like SMP does. We also can't preempt inside interrupt handlers or bh's because they aren't designed to be preempted (nor would you want to interrupt the top half of an interrupt, anyhow).

The current kernel is not preemptive _anywhere_. The only way, in fact, kernel code ever yields execution is if it explicitly does so or returns. Since with the preempt-kernel patch we can now preempt in 90% of the kernel, I think its safe to say we have a preemptible kernel now.

--

Robert Love
Re:Hmm by sagei · 2001-10-14 08:05 · Score: 4, Informative

I thought the Slack 2.0 release had a 1.1 kernel.

It could of, I just seem to remember a 1.0 kernel.

Can anyone give a nice layman's description of what he is talking about here?

Basically I am explaining the modifications to the kernel we made in order to make it preemptible. To try to put it more for the layman, besides just allowing the kernel to preempt itself as needed, we had to prevent some certain situations from being preempted. This is the same situation with SMP. We use SMP's locks to disallow preemption, for concerns of concurrency and reentrancy. We can't preempt during interrupt or BH handling because those things are not designed for concurrency, either.

To sum it up, we have to prevent preemption in some situations. Those situations are: while locks are held, while handling interrupts and bottom halves, and while inside the scheduler itself.

--

Robert Love
Re:Hmm by selectspec · 2001-10-14 09:48 · Score: 3

I agree with Robert here, and give him full kudos on his work, and appriciate his clarifications. We should all support his work with MontaVista. I apologize as my comments were hastely put together and unfairly characterized this project (which I wholeheartedly think is cool!). I would agree that the kernel with this patch is preemptive (just not "fully" preemptive however). I realize after reading my original comments that I should have choosen some better wording! These guys have done some kick ass work, and I'm sure that it was a considerable amount of work.

-Pete

--
Someone you trust is one of us.

Best line from the interview: by mike_the_kid · 2001-10-14 06:40 · Score: 4, Funny

JA: What tips and inspiration can you offer aspiring kernel hackers?

Robert Love: Read the source, play with the source, and bathe regularly.

All computer science labs should have available eye-wash style emergency showers.

--
Troll Like a Champion Today

Ok, I'm missing something by Quasar1999 · 2001-10-14 06:45 · Score: 2

Can someone fill me in... Hasn't Microsoft been claiming windows has been preemptive since win95??? Is this some other form of 'preemptiveness'?

What is this 'preemptive' thing refering to? Task scheduling?

--

---
Programming is like sex... Make one mistake and support it the rest of your life.

Re:Ok, I'm missing something by jeffy124 · 2001-10-14 07:13 · Score: 4, Informative

pre-emptive is a form of multi-threading. the other form is co-operative.

Co-operative means that threads relinquish control on their own. This meant that a greedy thread could put a serious stranglehold on the OS and lock-up the system, forcing a reboot.

Co-operative was used in every Mac prior to and including OS-9, which made it very unstable should a thread crash.

Pre-empt means the OS decides when the thread loses control. A thread can still voluntarily relinquish control, but the final call still comes down to the OS.

OS-X is fully pre-empt, meaning a crashed thread doesnt crash the entire system, bettering the stability overall as that will usually only crash the program that thread belonged to, not the entire system.

I dont know what MS has for their threading model, they seem to have a very bad hybrid system. The threading in Windows 95/98 tends to cause a good number of BSODs. NT/2000 OTOH, had a better model and crash a lot less often, which is why they have traditionally been the more stable MS OS.

Task scheduling has to do with what thread gets control next. Priority and other factors decide that. Solaris threads have 2^31 possible levels of priority, Windows (all versions, IIRC) has 5 classes and then 5 sub-classes of priority for each (a REALLY screwed up and tough to understand and explain technique, iow not a clear-cut 25 levels), and Java has 10 levels for cross-platform threading. Each model has their plusses and minuses, but that's getting offtopic from preemptive vs. co-operative.

--
The One Rule Of Chess You'll Ever Need: Don't play someone who carries a kit in their bookbag.
Re:Ok, I'm missing something by sagei · 2001-10-14 08:18 · Score: 4, Informative

Can someone fill me in... Hasn't Microsoft been claiming windows has been preemptive since win95??? Is this some other form of 'preemptiveness'?

You are thinking of forms of multitasking. One form is preemptive, in which tasks are given a specific period in which to run (timeslice) and then forcibly preempted by the next runnable task when that quanta ends. Win95, NT, all Unices, and anything decent fit in here

The other form is cooperative, in which tasks run until they yield execution. This is how Win 3.1 is. In 3.1, tasks ran until they finished processing their current Windows Message or called yield().

This article is about a preemptive kernel, where actually the same ideas apply. Inside the kernel, things are currently cooperative in the sense the kernel code runs until it completes or yields control. This patch makes it preemptive -- it will be preempted when something more important needs to happen.

Win95 does not have a preemptive kernel (it isn't even reentrant). NT might. Solaris does. Linux does with this patch.

--

Robert Love
Re:Ok, I'm missing something by joe+user+jr · 2001-10-14 08:18 · Score: 3, Informative

windows has been preemptive since win95??? Is this some other form of 'preemptiveness'?

Windows' "preemptiveness" refers, as explained somewhere else here, to the windows kernel being able to jump in and stop any user process executing to give the next one its term - so (in theory) no user-run program can hog all of the CPU and resources.

Linux has always done this - it's the standard way to write a unix kernel.

In relation to the audio discussion, preemptive in a linux kernel means (as far as I understand it) that the kernel attempts to guarantee a minimum time between an interrupt coming in on some device and control being handed to the driver for that device. It does this by preempting its own tasks in order to hand control over to the driver for the device needing the attention (the driver, of course, runs as a kernel process, also).

Typically, the goal is to get a maximum latency of 10ms or better (less) between the interrupt and the waking up of the driver.

In a professional audio situation, of course, the user can go a long way by stripping all the unnecessary hardware and tasks out of the configuration of the machine, which will mean that (if done properly) the only thing which can get in the way is linux' internal book-keeping. This is a different situation to playing with audio apps on a networked computer while you print out web pages.. ;)

Beyond this, there is real-time linux in which (as I recall) a hard maximum latency of 2ms or so is claimed. But the overheads introduced by all the timing and checking which guarantees this impact the performance to the extent that it's quite a different beast, for specialised applications.

Some audio programmers would like a low-latency patch (either the preemptive one or some other) which has a soft guarantee of "almost all" latencies below 5-10 ms to go into the standard kernel because they would like their userbase not to have to deal with the complexities of kernel recompilation and/or patching, but this is a pretty tall order because Linux will not like having basically ugly fiddly designs with lots of volatile little conditionals which have to be fiddled with everytime something changes going into the beautiful kernel.

Maybe vendors like mandrake should pick up the baton and provide a low-latency alternative kernel installable with their gui tools or at install time, which would keep everyone happy at the cost of not too much effort and space.

--
.sigs: Just Say No!
Re:Ok, I'm missing something by Yokaze · 2001-10-14 08:32 · Score: 3, Informative

Not quite correct.
It's not preemptive vs. cooperative.

But preemptible vs. non-preemptible kernel.

"Pre-empt means the OS decides when the thread loses control."
Yes, that's preemption.

B,ut there is another preemption.
Should a process get a higher priority than the currently running process, then the current process gets preempted.

E.g.
You have a low priority CPU-bound process A(e.g. Seti@home) and you have a high priority I/O-bound process B (e.g. XMMS).
Usually, B does nothing but waiting for I/O (e.g. the soundcard and the harddisk). While waiting, the process is not in the run-queue.
Meanwhile, A hogs the CPU. Usually, when the I/O request is done, the CPU gets an interrupt request (IRQ) which causes the OS to switch in kernel mode and handle the request. B gets active again and has a higher priority than A, so A gets preempted. Usually that works fine, but now A wants to do some I/O (deliver a packet) and calls the kernel, which handles the request. Just this moment is the I/O for B ready. In Linux (as in most other OSs too) B has to wait until A gets its syscall done, since the kernel is not preemptible. This period of time until the B gets the CPU increases the latency.

Windows 95 is preemptive (at least according to A. Silberschatz) as is Linux.

The high amount of crashes of the whole system stem from the resource protection (direct hardware access), not the scheduling.

--
"Between strong and weak, between rich and poor [...], it is freedom which oppresses and the law which sets free"
Re:Ok, I'm missing something by Anonymous Coward · 2001-10-14 08:46 · Score: 2, Informative

As I understand it:
NT has 32 priority levels.

The split into idle (p=0), low, below-normal, normal, above-normal, high and realtime (p>=16) (which I assume is what you were referring to) is just a simple way to name different general priority levels. It's the 32 levels that matter.

Normal priority is 14.

Anything running at 16 or above ('realtime') will never get interupted by threads running at lower priorities. The OS will never change these priorities, though the user can.

Ready to run threads of priority =14 can be given a temporary priority boost to 15 (lasts for a double timeslice which is 40ms normally) if they have been ready to run for about three seconds. Anything at lower than priority 16 shares what time is available, with higher priorities being favored. At priorities lower than 16, no thread will ever be totally starved of CPU time.

Priority 0 is for things which should only run when nothing else needs CPU time, like RC5 or SETI@home (though some such apps actually set themselves to priority 4 and hence slow most things down. folding@home used to do this).
Re:Ok, I'm missing something by nathanh · 2001-10-14 10:13 · Score: 2

Can someone fill me in... Hasn't Microsoft been claiming windows has been preemptive since win95??? Is this some other form of 'preemptiveness'?

What you're thinking of here is userspace preemptiveness. A userspace application can be preempted to make way for another process. The other process could be in userspace OR kernelspace. Linux has always been like this.
The article is describing kernelspace preemptiveness. Basically if the kernel is doing something (eg, reading a block off disk) then the current Linux kernel can't preempt that to do something else in userspace OR kernelspace.
These patches add kernelspace preemptiveness in addition to the already existing userspace preemptiveness. It makes Linux extremely suitable for low-latency applications (eg, professional audio).
Re:Ok, I'm missing something by be-fan · 2001-10-14 13:00 · Score: 2

Also, whenever a thread unblocks on I/O, it gets a priority boost so it can run again, quickly issue another I/O, and go back to sleep. The boost varies depending on the thing that it unblocked from, such as audio or input. Input tends to get large boosts, which is one reason why Windows tends to be "Snappier" than Linux.

--
A deep unwavering belief is a sure sign you're missing something...
Re:Ok, I'm missing something by spitzak · 2001-10-15 06:30 · Score: 2

You are confusing preemptive with multithreading.

I'm not sure... by TheMMaster · 2001-10-14 06:47 · Score: 2, Interesting

I think this is a good short-term solution for the latency problems but I personally wouldn't include it in the main kernel releases. I believe that it *might* be a good idea to fork the kernel releases (temorarily) in two groups: One for servers and one for workstations until the problems have been solved.
I think that (for now) using this patch on workstations is a pretty good idea. And I think that there should be a better solution for the problem witch should THEN be something along the lines of kernel 3.0
I am not a kernel developer or anything, but I am currently reading up on the source and the mailing lists.
Basically all I am trying to say is: Make it work NOW and solve the real problem later. Just make sure that is WILL be solved... (no microsoft coding ways here ;-)). We still need a larger user base...

--
Fighting for peace is like fucking for virginity

Re:I'm not sure... by STSeer · 2001-10-14 07:07 · Score: 3, Informative

Love said that this patch even if added to the main tree would still be a config option.
Re:I'm not sure... by debrain · 2001-10-14 08:13 · Score: 3, Interesting

Actually, given the current state of the vm parameters set almost exclusively for a workstation (since bdflush chokes a server real good), would seem to dictate that you have to tinker with the kernel anyway and that forking the kernel itself wouldn't necessarily help since the number of forks for each configuration of properly scalable high intensity server would be enormous. It works good for a workstation, and perhaps preemption should be default on a workstation (I use Love's patch on mine), but splitting the kernel between workstations and servers is probably not the best way to go about making servers customized to their personal best performance level since the configuration is quite sticky anyway.
Re:I'm not sure... by sagei · 2001-10-14 09:09 · Score: 5, Informative

Disclaimer: It's my patch

I think this is a good short-term solution for the latency problems but I personally wouldn't include it in the main kernel releases. I believe that it *might* be a good idea to fork the kernel releases (temorarily) in two groups: One for servers and one for workstations until the problems have been solved.

I tend to look at this more of a long-term solution, and I think people who see it has a short-term solution or hack are missing the point. First, this is a feature. We aren't kludging kernel code so that we can lower latency by stopping it when needed. We are effectively using the SMP code to multitask better within the kernel.

Second, forking the kernel over this is a terrible idea. Since it is a config setting, this is a non-issue anyhow, but I really don't want to see this thing forked off. In fact, I think the ideal situation is where we can get a preemptible kernel that benefits throughput so that server processes benefit from it as well.

I think that (for now) using this patch on workstations is a pretty good idea

Agreed :)

And I think there should be a better solution for the problem witch should THEN be something along the lines of kernel 3.0

There isn't a better solution that is not a hack. There is a reason Solaris, NT, and all RTOS are preemptible inside the kernel: it is the only way to achieve real-time response. You just _have_ to be able to respond to events when needed.

The "better" solutions in this case are "simpler" -- if we can hack some conditional schedules into places, perhaps simplify some algorithms, etc. then we can perhaps reduce latency without preemption. This is what Andrew Morton's low-latency patches do. But we need more. The point is not that preempt-kernel is a hack, but that it is a whole new high-tech feature, and some people want to find a simpler solution.

Personally, I don't think a simpler solution exists, and I believe the preemptive kernel satisfies other problems (and it also a neat feature:>). Thus I work on it.

--

Robert Love
Re:I'm not sure... by The+Pim · 2001-10-14 11:45 · Score: 2

There is a reason Solaris, NT, and all RTOS are preemptible inside the kernel: it is the only way to achieve real-time response.
I thought that what (certain) kernel hackers really objected to is preemption while locks are held. The complications (eg priority inversion) they talked about seem only to arise in that case.
So, first, does "fully-preemtive" traditionally mean with or without locks? Are Solaris, NT, and RTOS preemtible when locks are held?
Second, observed results aside, what reason do you have to believe that preempting the lock-less parts of the kernel is "good enough". All else equal, one would expect the latency distribution to be similar with and without locks, so you would expect plenty of "worst cases" to occur with locks. Of course, there is already a pressure to reduce the time that critical locks are held, but I wouldn't be surprised to see non-contended locks (especially outside the kernel core) held for long times. So is there a good reason that the important "worst cases" are happen without locks?
IANAKH.

--

The evaluation of an action as 'practical' . . . depends on what it is that one wishes to practice.
Re:I'm not sure... by sagei · 2001-10-14 12:28 · Score: 5, Informative

I thought that what (certain) kernel hackers really objected to is preemption while locks are held. The complications (eg priority inversion) they talked about seem only to arise in that case.

There are a few reasons other hackers complain, although I didn't know this was one of them. Since MontaVista's original preemptive kernel work, I believe, we have never preempted inside of locks. Note that you can, but then you reach the issues with deadlocks and thus the need for priority-inversion that you spoke of.

So, first, does "fully-preemtive" traditionally mean with or without locks? Are Solaris, NT, and RTOS preemtible when locks are held?

I would say it means sans locks. None of the mentioned OS's are preemptive while holding a lock. You always have to respect the lock. Now, you can preempt during the lock and go do other things. If you do this, you are assuming the lock is going to be held long (or else it is favorable to just spin for a cycle or two). In this situation you want to use semaphores, which we _do_ preempt during.

When a process hits a semaphore that is in use, it goes to sleep and something else continues. The process awakes when the resource is available. Now we reach the problem you wrote of above: priority inversion. What if task A holds resource Y and sleeps waiting for resource X and task B holds resource X and sleeps waiting for resource Y? You deadlock.

Thus we need to use a type of semaphore called a priority-inheriting mutex, which inverts the priority of the task holding a resource so it will always complete and release the lock. I know Solaris has these. However, I would consider any kernel that can preempt itself in general a preemptible kernel.

Second, observed results aside, what reason do you have to believe that preempting the lock-less parts of the kernel is "good enough". All else equal, one would expect the latency distribution to be similar with and without locks, so you would expect plenty of "worst cases" to occur with locks. Of course, there is already a pressure to reduce the time that critical locks are held, but I wouldn't be surprised to see non-contended locks (especially outside the kernel core) held for long times. So is there a good reason that the important "worst cases" are happen without locks?

First, before I cast results aside, let me mention that observations show we are already lowering latency a great amount. But, you are right, periods in which locks are held are a problem. This is why I mentioned in the interview the use of things like Andrew Morton's low-latency patch, the preempt-stats patch (for finding the locks), etc.

Some of the problems still occur while locks are held, but thankfully the point of a spinlock is that they are held for a VERY short time. A solution to this may be to replace the spinlocks held for a long time with a priority-inhereting mutex.

--

Robert Love
Re:I'm not sure... by be-fan · 2001-10-14 12:51 · Score: 3, Informative

So, first, does "fully-preemtive" traditionally mean with or without locks? Are Solaris, NT, and RTOS preemtible when locks are held?
>>>>>
I don't know about those, but BeOS isn't preemptible during a spinlock either. BeOS requires you to disable local interrupts before acquiring a spinlock, which means that the scheduler never even gets to run on that CPU because it won't take the timer interrupt. I'd surmise that almost all preemptible kernels work like this. Judging from this doc it would appear QNX does it this way as well. This method shouldn't effect latency, because you are only supposed to hold a spinlock for a very short time.

--
A deep unwavering belief is a sure sign you're missing something...

needed badly by xah · 2001-10-14 06:49 · Score: 4, Insightful

A fully preemptible kernel is important to the future of Linux. Everyone knows that the system will lock up a lot if it's misconfigured, or if a piece of hardware is buggy.

So long as the console driver and the keyboard driver are alive, root should always be able to open a new shell and kill an offending process that is hanging the rest of the system. Right now, this is too frequently a non-option.

--
I am not a lawyer. Do not take my words as legal advice. If you need legal advice, consult an attorney.

Re:needed badly by be-fan · 2001-10-14 09:58 · Score: 2

Actually, there's some truth to his point. Say a process makes a system call and the kernel code for that call hangs in a loop. Since the scheduler won't preempt the kernel code, that process will run forever and the machine will hang. If the kernel can be preempted, the user can get to a shell and kill the stuck process. I have no idea how often this situation would happen in the real world, though. I'd think that infinite loops would be too much of a newbie bug.

--
A deep unwavering belief is a sure sign you're missing something...
Re:needed badly by chabotc · 2001-10-14 10:57 · Score: 5, Insightful

Actualy 9 out of 10 cases when that happens, and the hardware is locked up, it will have locked up the PCI bridge as well (they have to to communicatie), so this wont do anything.

Also if the systeem feels locked up, and its not a hardware lock, there's a good chance its the tty/console subsystem thats killed.

only in a few cases, where a run-away process would deal out so much of a beating to the system, then the better multithreading will help in the way you described.

(ps, telnetting in is always a good work around for a system with a dead keyboard/console :P)
Re:needed badly by Dwonis · 2001-10-14 11:30 · Score: 2

Check your RAM. If it's not a RAM bug, it's probably an X server bug.
Re:needed badly by Elladan · 2001-10-14 12:07 · Score: 2, Interesting

There's a bug in recent 2.4 kernels where a multithreaded app dumping core could livelock the system. You might try setting a hard limit on core file size to zero and see if the crashes go away.

You'd want to do this in /etc/profile, of course.
Re:needed badly by Peter+La+Casse · 2001-10-14 13:27 · Score: 2, Interesting

Actually, there's some truth to his point. Say a process makes a system call and the kernel code for that call hangs in a loop. Since the scheduler won't preempt the kernel code, that process will run forever and the machine will hang. If the kernel can be preempted, the user can get to a shell and kill the stuck process. I have no idea how often this situation would happen in the real world, though.
Will the linux kernel allow a user process to be killed that is blocked in a kernel call? In my experience, Solaris and Tru64 do not: a user program that is blocked in a kernel call will stay blocked until the kernel call returns, regardless of any action (short of rebooting the machine) that a user can take. I assume that there is some well-thought-out reasoning behind this, but sometimes (e.g. during device driver development) I wish it were somehow a configurable behavior.
I'd think that infinite loops would be too much of a newbie bug.
Lots of times, the most junior person gets stuck writing device drivers. And even experienced programmers can have brain farts.
Re:needed badly by Pseudonym · 2001-10-14 15:44 · Score: 2

Xah is correct except for one detail. As far as the scheduler is concerned, pre-emptable threads running inside the kernel should be pretty much the same as pre-emptable user-space threads in a microkernel system. They should be able to be killed and/or restarted if they've hung.

The one mistake I think xah made was using the term "process". Linux's current design encourages the confusion between threads and processes by implementing threads as processes that happen to share "process stuff" (address space, file handles, credentials, rlimits etc).

--
sub f{($f)=@_;print"$f(q{$f});";}f(q{sub f{($f)=@_;print"$f(q{$f});";}f});

What does that mean? by BlackGriffen · 2001-10-14 06:50 · Score: 2, Interesting

I thought giving the Kernel the ability to preemt other programs was important. If you give programs the ability to preempt the kernel, doesn't that just change the system back to cooperative multi-tasking? I could just see programmers abusing the ability to preempt the kernel to squeeze a little more speed out of their app.

Re:What does that mean? by naasking · 2001-10-14 07:05 · Score: 2, Informative

No, I think you're misunderstanding. It's not preempting the kernel, it's preemtping a lower-priority thread that happens to be in the kernel (ie. during a system call). If there is a runnable thread with a higher priority, it should be set running. But as things currently stand, if the low-priority thread is in the kernel it can't be preempted, and so the high priority thread has to wait. That is bad.

--
Higher Logics: where programming meets science.
Re:What does that mean? by Adnans · 2001-10-14 07:05 · Score: 2

I thought giving the Kernel the ability to preemt other programs was important. If you give programs the ability to preempt the kernel, doesn't that just change the system back to cooperative multi-tasking?

Nope, because the kernel is still always in control. In a cooperative multi-tasking enviroment the userspace programs can choose to hold on to the processor as long as they like (i.e. not cooperate nicely with others). This patch simply allows a lower priority process to be interrupted by a higher priority one even if the low priority one is in the kernel, doing a system call for example. However, this preemption is done by the kernel scheduler.

-adnans

--
"In short: just say NO TO DRUGS, and maybe you won't end up like the Hurd people." --Linus Torvalds

Background and a different patch by alewando · 2001-10-14 06:55 · Score: 5, Informative

If you're wondering what the heck a preemptive kernel entails, then here's some background.

Also, if you don't like Robert Love's implementation, then Andrew Morton maintains a patch with a similar low-latency goal.

Links on Spinlocks, etc by Alien54 · 2001-10-14 06:59 · Score: 5, Informative

There are these links:

Linux Devices Article detailed, very nice, on this issue from Sept 6
Kernel Hacking How-to Page on this again, very detailed.
The Kernel SpinLock metering FAQ

All around useful stuff, enough to get you started destro^H^H^H^H^H^H hacking your own kernel

--
"It is a greater offense to steal men's labor, than their clothes"

Re:So will that make Linux a superior audio platfo by Xoro · 2001-10-14 07:05 · Score: 5, Informative

I don't want to sound like I'm contradicting you, but did you happen to read this link from the article? It's specifically about realtime audio. Key paragraph:

*EXCITING* NEWS: things getting almost perfect ! Ingo's lowlatency-2.2.10-N6 patch with the shm.c part backed out and a modification of filemap.c (thanks to Roger Larsson) performs _REALLY_ well, using my usual latencytest parameters (4.3ms buffer), I got NO DROP-OUTS anymore, with sporadic maximum peaks of ONLY 2.9ms This is really exciting because it opens the doors to a whole new class of Realtime applications for Linux, simply using userspace processes scheduled SCHED_FIFO. I heard of comparable low-latencies only from BEOS, Windows can't simply guarantee these kind of latencies, not even using DirectX. Using a soft-synth on Win98 on my BOX I must use 15-20ms audio buffers to get _SOMEWHAT_ reliable audio. This is actually about more than 3-4times the buffer I used for testing under Linux ( 4.35ms).

I don't know much about the field, but the page seems to speak to several of the audio-related concerns mentioned above.

--
Kill, Tux, kill!

Re:Hmm...laymans terms by A_Non_Moose · 2001-10-14 07:12 · Score: 2

The left hand (processor 0) needs to know what the right hand (processor 1) is doing.

Reverse if necessary.

Ambidexterous people...just HUSH! :)

Heh, how about that, computers do have a real life (tm) frame of referrence.
Moose.

--
Have you read the moderator guidelines? Well, have you, PUNK? (and I want a Karma: Gnarly option)

Re:OS-X by JanneM · 2001-10-14 07:29 · Score: 2, Informative

Linux is fully preemptible, and has always been. This is about being preemptible while executing in the kernel. I have no idea if OSX allows this or not - it's BSD based, so probably no, but then Mach is involved someway or other, so maybe. It would be interesting to know.

/Janne

--
Trust the Computer. The Computer is your friend.

Re:So will that make Linux a superior audio platfo by Lando · 2001-10-14 07:41 · Score: 3, Informative

Ummm,
Sorry, just want to note that mutex and semaphore programming is not all that difficult if you do it much. True windows have a few kinks, but the concept is pretty basic. Basically I would have to disagree that mutex and thread programming makes programming hard. It's just programming once you understand it, it's pretty straight forward.

As for the windows problem use startthreadx instead of startthread (Yeah probably not the real api functions, but close enough haven't worked on windows for a while.)

Lando

--
/* TODO: Spawn child process, interest child in technology, have child write a new sig */

yes, but why? by markhahn · 2001-10-14 08:35 · Score: 3, Informative

it's all very well to say that you want to trade 5% of normal performance for a 200% improvement in latency. but why does anyone need better latency? afaikt, the latency here is strictly for people who want to do RT audio effects. this has nothing to do with audio playback, which has no latency sensitivity (because of buffering). this also has nothing to do with "feel", since humans are terribly slow, and cannot possibly feel the difference between 5 and 10ms.

I hope that Linus will look at whether these patches hurt the normal case. "normal" means things like kernel compilation, not just an arbitrary latency measure and dbench (one of the least realistic benchmarks possible!)

there are good reasons to be skeptical of all-out premptiveness: it will unavoidably lower throughput in easy-to-define cases. any intro OS text will talk about optimal scheduling, where 'optimal' requires a definition of throughput or some other metric. preemptive kernels will context switch more, and will probably interfere with the natural 'batching' that happens when a big job runs for a while. think about caches: you never want to switch unless you must. this is not an argument against low-latency! it's an arguement against lowest latency as an absolute; we need to set a target (5ms would be fine imo) and meet it. going beyond such a goal will hurt the normal case.

Re:yes, but why? by Spy+Hunter · 2001-10-14 09:06 · Score: 4, Insightful

this has nothing to do with audio playback, which has no latency sensitivity (because of buffering)
Unless you're writing a game, where sounds have to happen at specific times synchronized with events on-screen. Or you're in KDE and you want a "minimize" sound effect to happen when you press the button, not a second afterward. Or you're writing a media player and you want to have an EQ that responds immediately rather than a second from now, making it a tedious chore to adjust the settings.
Large latency is very noticable in these situations. While it may sound like pointless whining to complain about the "minimize" sound effect being a second late, it really creates a bad perception in the user's mind about the speed of KDE. These things are actually important.

--
main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
Re:yes, but why? by sagei · 2001-10-14 09:39 · Score: 5, Informative

Disclaimer: It is my patch

but why does anyone need better latency? afaikt, the latency here is strictly for people who want to do RT audio effects. this has nothing to do with audio playback , which has no latency sensitivity (because of buffering). this also has nothing to do with "feel", since humans are terribly slow, and cannot possibly feel the difference between 5 and 10ms.

You ever have an mp3 skip? Audio become out of sync in a game? That is caused by scheduling latencies becoming greater than the duration of the audio buffer. Ie, audio playback does not just need x units of CPU but it also needs it every y units of time. The preempt-kernel patch helps alleviate this.

I hope that Linus will look at whether these patches hurt the normal case. "normal" means things like kernel compilation, not just an arbitrary latency measure and dbench (one of the least realistic benchmarks possible!)

Not only does preempt not hurt a kernel compile, but it helps it. I and many users have benchmarks. One of my requests from users is to get a lot of benchmarks and "feelings" so I can substantiate the patch. I am _not_ an audio guy. I use my Linux machine to code, go on the net, etc. just like 90% of the people here. Preemption helps me. I don't want to hurt the common case either.

Even so, it is a configure item. Merging it into the kernel does not equate to you having to use it. But I bet you would want to!

there are good reasons to be skeptical of all-out premptiveness: it will unavoidably lower throughput in easy-to-define cases. any intro OS text will talk about optimal scheduling, where 'optimal' requires a definition of throughput or some other metric.

The cases in which we lower throughput are cases in which file I/O is favored since it runs until completition. In this case, you can extend that argument to be that I/O-intense tasks should just be cooperatively scheduled. An I/O task won't be preempted unless its timeslice has run out (ie, it should be preempted, and it would be if it were in userspace). If the I/O is so critical, run it at a higher priority. Hell, maybe we should look into a higher timeslice.

Note that a lot of this is a non-issue, since we don't affect throughput (or actually improve it!) In the cases throughput is decreased, it is just a couple of percent, which could be cost-benefited to the increase in response some other application gets.

we need to set a target (5ms would be fine imo) and meet it. going beyond such a goal will hurt the normal case.

This is very very true, and an insightful point. One of the problems with this whole latency quest is that eventually we are going to reach some point and have to decide if enough-is-enough. We can always keep doing more and eventually the work _is_ going to be detrimental to the common-case. I agree we need to set a threshold and celebrate when we reach it. The super-special situations needing much lower latency can apply super-special solutions.

--

Robert Love
Re:yes, but why? by Gordy · 2001-10-14 12:42 · Score: 2, Funny

Robert,

Admit it, your just a karma hore. 8)

-Gordy

One day, we will find it, the rainbow connection..
Re:yes, but why? by be-fan · 2001-10-14 12:57 · Score: 2

it really creates a bad perception in the user's mind about the speed of KDE
>>>>>>>>
No, I believe it is the ass-slow redraw and glacial startup times that do that.

PS> I use KDE myself, so don't accuse me of being a GNOME-bigot!

--
A deep unwavering belief is a sure sign you're missing something...

Options... by Mike+McTernan · 2001-10-14 08:43 · Score: 2, Informative

Whether this patch is added or not is surely just a question of whether it is stable enough or not.

As it says in the interview, the enablement of the patch is an option in the config... For those that want it (i.e. most desktop users I would expect) it's there. For those that don't, it can be disabled.

It seems that the patch works, as scientifically explored by his benchmarks. If there is a fault in the patch, I'm sure that half of slashdot will email the chap.

In summary, it works, is probably stable and can be enabled/disabled in config if needed. It already does, and probably can, benefit lots of people.

Put it in!
(At worst it can be removed and a new kernel released the day after... hehe)

--
-- Mike

Linux Devices Article by Alien54 · 2001-10-14 08:45 · Score: 4, Informative

The Linux devices article link should be:

http://www.linuxdevices.com/articles/AT4185744181. html

Goofed that up.

Nice discussion, from Sept 6, with related links

[sigh]

--
"It is a greater offense to steal men's labor, than their clothes"

Re:Windows 2000/NT by RelliK · 2001-10-14 08:59 · Score: 4, Interesting

Really? That's the first time I hear about that.

There is a difference between pre-emptive multitasking and pre-emptible kernel.
Pre-emptive multitasking means that the kernel can interrupt any thread and give control to another thread, so that a thread cannot hog the CPU resources. This is what all modern operating systems do, except Windows 3.1/9x/Me and MacOS (pre- X), though it could be argued that Windows 3.1/9x/Me is not an operating system much less a modern one ;-)

Pre-emptible kernel is a different beast. It means that the kernel can interrupt itself (i.e. a thread running in the kernel mode) and give control to another thread running in the kernel mode. This is used in real-time operating systems where you need to have a guaranteed maximum response time (i.e. a thread must not wait longer than a certain amount of time before it gets control). However, this is not all that useful for general-purpose OSes and may even be detrimental to servers, where throughput matter more than response time. So it's good to know that this will be a compile-time option.

--
___
If you think big enough, you'll never have to do it.

Re:Preempt Patches in Kernel by GGardner · 2001-10-14 09:42 · Score: 4, Interesting

Linus himself said, that they should rather fix the CAUSE of those latencies instead of the symptoms.
Isn't that like saying that you'd rather fix all buggy applications instead of providing a protected memory environment?

Re:So will that make Linux a superior audio platfo by be-fan · 2001-10-14 09:50 · Score: 2

BeOS programming really isn't that hard. You just have to get used to the idea of locking every single thing that could possibly be shared.

--
A deep unwavering belief is a sure sign you're missing something...

Re:So will that make Linux a superior audio platfo by sllort · 2001-10-14 10:28 · Score: 3, Troll

The extremely high-resolution threading of the operating system made even the simplest programming tasks near impossible, as mutex locks and thread conditionals needed to be spread throughout the code to ensure proper execution.

Right on! I ran BeOs under VmWare to try developing for it, and the pthreads compatibility was... well let's just be polite and say "extremely non-optimal". The spin locks in the kernel were so tightly placed that any possible race condition you could think of would occur if you didn't mutex lock the hell out of it, and the littany of devices you had to lock to access memory was just unbelievable. I pretty much had to read through the video driver code to get anything done as the documentation got as far as "Hello World" before wishing you luck.

Anyway, DeMuDi looks to be a step in the right direction - maybe if a Linux distro starts shipping with 2 kernels, a standard kernel and a multi-media enhanced kernel, we'll finally have a workable solution.

Re:So will that make Linux a superior audio platfo by Error27 · 2001-10-14 11:15 · Score: 2

"Professional audio processing requires an extremely special form of real-time processing that is pretty much only good for handling audio, and which actually can cause problems with any other types of software."

Special in what way? I'm not really familiar with audio software but I have a hard time picturing what you mean by "special" real-time.

You say that in Windows software handles it's own scheduling and bypasses the kernel. What exactly does that buy you that you couldn't get more elegantly in Linux by creating a kernel patch (A premptable kernel patch for example). The windows way strikes me as not very stable, flexible or good.

Linux let's your program hog the cpu already by setting nice levels. With preemption even if it gives the cpu to a different process it can take it back right away.

The Linux way seems better exactly because it's not a special purpose hack. Why is hogging the cpu for audio processing any different than hogging the cpu for video processing?

Re:So will that make Linux a superior audio platfo by be-fan · 2001-10-14 12:43 · Score: 2

True, also, you can always chicken out and use big global locks if you need something quick.

--
A deep unwavering belief is a sure sign you're missing something...

Just need pcmcia-cs to be updated! by JahToasted · 2001-10-14 12:57 · Score: 2, Informative

Tried this patch before... it works great adds a nice option in the kernel config. But the problem is pcmcia-cs doesn't work with it. Says in the changelog it will be fixed on the next release of pcmcia-cs but I want it now!
It does work nicely... everything is a lot more responsive.
Great work!

A few problems I've noted by bruns · 2001-10-14 13:09 · Score: 3, Informative

After messing with it on several machines, here is what I have found.

* it doesn't work well on a shell server or anything which might have alot of disk activity. The changes seem to do everything at the expense of disk IO and network IO. I do see better speed on interactive stuff though. Its not worth the hit in IO.

* there is no option to turn it off while in operation. Means you have to run different kernels if you want to do some things with the preempt, and other stuff without.

--
Brielle

Re:A few problems I've noted by Lethyos · 2001-10-14 16:55 · Score: 2

* there is no option to turn it off while in operation. Means you have to run different kernels if you want to do some things with the preempt, and other stuff without.

This is something I haven't seen brought up yet. Windows 2000 can change favor from background to foreground processes on the fly (right click 'My Computer' and check out the properties). Now while, this is not the same thing, it's in the ballpark to most users who understand it as something that speeds up their apps. We really need a /proc switch that lets you turn this thing off and on.

--
Why bother.
Re:A few problems I've noted by Lethyos · 2001-10-15 02:48 · Score: 2

My original post: Now while, this is not the same thing, it's in the ballpark to most users who understand it as something that speeds up their apps.

Your post: That doesn't change NT's pre-emption behavior dude.

Actually, you are incorrect, and you back me up with your own words:

NT automatically boosts the priority of the process that owns the topmost window, to make the UI Snappier... that option just turns this on or off. It's not the same thing, but it's very similar.

So basically, you're turning preemption on and off. That's not a priority change - apps keep their priority mode (one of the top level 5 that is). With NT, the active process, top most window, preempts others that are running. That's how they make it seem like it's running faster. For Joe S. Admin, switching to the other setting, causes local procs to not preempt those that are running server software, thus increasing throughput.

Of course, UI speed in NT is a totally different matter from under *nix. The Windows GUI is faster because it's not networked, like X. There's no abstraction layer between the producer (server), and consumer (client).

--
Why bother.
Re:A few problems I've noted by spitzak · 2001-10-15 06:46 · Score: 2

The X interface is not slow due to the network abstraction, it is slow because of incredibly bad design of the Xlib calls themselves, requiring a huge amount of synchronous calls to the server. The network abstraction could actually speed things up enormously by providing a convienent way to batch tens of thousands of calls into a single context switch, if it were not for the enormous number of calls that return a value from the server and thus require synchronization.

As to the original subject, I think there is some confusion on both sides. That NT provides a user-friendly method of controlling things is a point in it's favor over Linux where the designers insist that nothing be easy for the end user. However it is true that the NT switch is equivalent to the Unix "nice" command that existed decades before anybody dreamed of pre-emptive kernels.

Personally I would like to see a system were there were as few controls as possible. Any kind of adjustments like this indicate to me that the designers really don't know how to get performance as good as possible. If games require different scheduling than seti@home, I would like to see a kernel that recongnizes the behavior of the programs and adjusts. This would be way better than requiring the user to do this, no matter how user-friendly the button is!

Re:Preempt Patches in Kernel by richie123 · 2001-10-14 14:07 · Score: 2, Insightful

No, I thinks its more like Linux is saying he would rather make Linux faster, than make it feel faster.

Re:So will that make Linux a superior audio platfo by scrytch · 2001-10-14 14:13 · Score: 2

This is why BeOS ultimately flopped: it was too hard to program for.

True, but in a very different way. The lack of decent developer support, for a platform running on hardware most people use windows on, aimed at the market that people buy macs for, compatible with less actual hardware than either, with no software from vendors anyone had ever heard of, is why it flopped.

I liked BeOS too. I ultimately wiped it off my system because I just didn't have a use for it.

--
I've finally had it: until slashdot gets article moderation, I am not coming back.

Re: PCMCIA-CS works here ... by JahToasted · 2001-10-14 15:00 · Score: 2, Informative

yeah the one I tried was with 2.4.10 didn't work. my card's driver isn't included with the kernel so I have to compile the pcmcia driver that comes with the pcmcia-cs package (why IS there 2 different drivers?). I checked the changelog on the pre-emptive kernel page and it mentions that they are pressing the maintainers to fix the code for pcmcia-cs for the next release which I hope is soon.

Re:So will that make Linux a superior audio platfo by WNight · 2001-10-14 15:16 · Score: 3, Informative

Anywhere that BeOS highlighted your race condition by causing unwanted behaviour is somewhere that you'd get "random" crash bugs from in another OS if you didn't fix the code.

Other OSes don't guarantee much about how long your timeslice is, or how often you'll get time, it's sort of haphazard. That randomness means that while those race conditions don't manifest as much, they're still there to bite you.
Think of it like memory leaks and dangling pointers. Ninety-nine times you can use an element of a linked list after delinking it, one time it will have already been written over. But you don't want to somehow make the bug come up one in a thousand times... you want it to come up EVERY time so that you fix the problem before release.

It might be a bit of a pain to put locks around everything, but after a while it becomes quick and natural and you still have the power of a fast kernel with a very small timeslice for when you need it.

Re:So will that make Linux a superior audio platfo by WNight · 2001-10-14 15:22 · Score: 2

Those locks you're leaving out of your non-BeOS code might make it easier to code, but they mean it'll crash at random years later when some odd combination of load variables causes your program to be yanked out of a critical area before completion and lets the next thread enter too soon. But you'll have run it for months, seen no problems, and shipped it.

You *need* to lock everything that might ever be an issue, even if it's the tiniest operation. That's why there's a "Test & Set" operation in ASM. It might be the tiniest thing, but you need to guarantee it's atomic.

I wish more OSes were hard in the way you describe - if race conditions were more easily shaken out they wouldn't plague "release caliber" software.

And as to BeOS being hard to program for... What?!? It might have enforced better style which could be a pain at first, but it was (is still, I guess) a wonderful OS for programmers.

The truth no one wants to admit by Ukab+the+Great · 2001-10-14 15:23 · Score: 2

The desktop end-user *is* a real-time application

Re:OS-X by jonabbey · 2001-10-14 15:53 · Score: 2

I can't speak to whether or not OS X is kernel preemptible either, but I assume when Apple talks of "fully preemptible" they are just drawing a contrast with the cooperative multitasking that MacOS has had since day one.

--
- jon

Ganymede, a GPL'ed metadirectory for UNIX

Re:So will that make Linux a superior audio platfo by Webmonger · 2001-10-14 16:16 · Score: 2

The pre-emptible kernel patches radically rework the way the kernel functions. So people are being cautious.

Linus is suspicious by steveha · 2001-10-14 16:39 · Score: 3, Informative

In his recent interview on osnews.com, Linus said he was in no hurry to include the kernel preemption patches in the official kernel source. He said:

Some people have been playing with using the [SMP] locks on UP too, creating a fully preemptible kernel. A lot of people are playing around with the patches, and we'll see when/if I'll integrate them into the standard tree. It's not a high priority for me: they don't add performance (like the SMP scalability does), and if they improve latency noticeably I'd really rather look at why the latency is bad in the first place.

So right now as far as I'm concerned it's one of those "cool features" things, and it will need some prodding from the real world to show whether it is worth it.

I was surprised he said this. This isn't a big scary kludge that inserts a bunch of hacks all over the place in the kernel; this is a relatively small patch that simply leverages all the SMP work. It won't make the kernel uglier or harder to maintain, so IMHO it is very worth adding.

I am confident that Linus will get that prodding from the real world he is waiting for, because my own experiences with this patch are overwhelmingly positive. I'm using kernel 2.4.10 with the preemption patch on my desktop Linux boxes, and I love the snappy feel it gives my system. Playing back MP3 music never skips now, and my K6-III/450 system pops up web pages in Galeon so fast it feels like an Athlon system.

Kudos to Robert Love and anyone else who worked on this patch.

steveha

--
lf(1): it's like ls(1) but sorts filenames by extension, tersely

Re:Linus is suspicious by spitzak · 2001-10-15 06:54 · Score: 2

Linus: and if they improve latency noticeably I'd really rather look at why the latency is bad
in the first place.

I don't really agree with this. I would imagine that there are many things in the kernel that could be written much cleaner, smaller, and faster if the writer did not have to worry about latency. Since making the kernel pre-emptable would allow this I could see things actually improving as a result.

PS: I don't know anything about kernel design, so ignore my comment if necessary...

Re:Windows 2000/NT by EnglishTim · 2001-10-14 21:02 · Score: 2

Win 9x/ME *does* have pre-emptive multitasking. However, it doesn't have very effective memory protection which is why it's rather prone to being crashed by errant programs. I believe this was to ensure compatibility with 16bit apps left over from Win 3.1

Re:So will that make Linux a superior audio platfo by bfree · 2001-10-14 21:22 · Score: 2

And are you a Linux user? If so PLEASE go and get demudi up and running on that box and see what latencies you get there for comparable tasks and then come back and tell us! Demudi is spawned from the Hammerfall's ability to offer (Semi-)Professional audio capabilities to Linux. I have a Hammerfall about 5 metres from me at the moment, but it's not mine and I would be beaten severly if I touched it :-(

--

Never underestimate the dark side of the Source

A potential problem.. by Junta · 2001-10-15 00:51 · Score: 2

I don't know how this might manifest itself, but I could see how some existing, highly tuned programs could have problems with such a patch. If software is developed with the current mainstream kernel in mind, it may make certain internal assumptions about not being preempted during certain operations, and some timing getting messed up because of that. One poster mentioned a potential VMware problem, which could be an effect of something like this.
I would like to see this patch in there, but I could see some reasons to be hesitant about putting it in now. I would love to see latency on the level of QNX, that seems very responsev..

--
XML is like violence. If it doesn't solve the problem, use more.

Re:When will the Linux HZ be bumped up to 1000? by ivan256 · 2001-10-15 11:46 · Score: 2

HZ is already 1024 on architectures such as Alpha and IA64. You have the source. Change it before you build; it's only a 2 byte change.

Slashdot Mirror

Preemptible Linux Kernel: Interviews and Info

74 of 238 comments (clear)