ArsTechnica Explains O(1) Scheduler

← Back to Stories (view on slashdot.org)

ArsTechnica Explains O(1) Scheduler

Posted by michael on Thursday December 25, 2003 @11:00AM from the time-to-kill dept.

geogeek writes "The recent release of Linux's 2.6 kernel introduces a number sweeping improvements. These can be hard to understand without a background in programming. This week's Linux.ars examines the improved scheduler for an enthusiast audience, concisely explaining its nuances and the practical effects of an O(1) design."

13 of 295 comments (clear)

Min score:

Reason:

Sort:

Multi core proc's by niko9 · 2003-12-25 11:05 · Score: 2, Insightful

With all the recent talk about multi core processors, will this sort of thing be relegated to hardware; or will there still be a need for software scheduler?
1. Re:Multi core proc's by be-fan · 2003-12-25 11:18 · Score: 2, Insightful
  
  A multi-core processor appears as multiple processors to the software. Even SMT processors appear as multiple processors to the software. So the software scheduler is needed to juggle the available processors among the threads competing for them.
  
  --
  A deep unwavering belief is a sure sign you're missing something...
A question if anyone can answer by abhikhurana · 2003-12-25 11:46 · Score: 2, Insightful

That was kinda interesting though it doesn't really explain how they achieve the O(1)ness, and for example how did kernel 2.4 fare in comparison. Anyway, an O(1) scheduler is nice and all under low to moderate loads, but at high loads, when a lot of threads are running, the O(1) behaviour should enusre that the tasks are switching extremely fast, and as every context switch has some related overhead in terms of saving the context etc., that means that performance of the O(1) schedular should be worse than that of 2.4(where tasks are not switching as fast) in theory atleast. So is this really true?

--
What's under yellowstone?
Re:Hmm. by Anonymous Coward · 2003-12-25 11:49 · Score: 1, Insightful

You have 100 processes. Each has a priority. If you check each process individually every time you do a scheduling, you go into O(n) where n is the number of processes.

In other words, the magic lies in selecting a process based on its priority, without checking each processes priority, while still not leaving low priority processes starved of resources.

Now, I don't know how they did it, I'm not a kernel hacker. But just thinking about it, O(1) is never easy to achieve when dealing with n items .
sidebars??? by mefus · 2003-12-25 12:03 · Score: 3, Insightful

Isn't that what a sidebar is for?

Couldn't they have linked out into more in depth treatments, and saved the complexity for the interested (technically) reader while saving 'readability' for the non-technical ppl?

--
mefus
In Open Society, GPL Software frees YOU!
Ah-ha! by tsanth · 2003-12-25 12:18 · Score: 2, Insightful

Thanks; that third paragraph alone was much, much more informative than the linked article. Perhaps, if the editors can be bothered to edit a bit more (unlikely), could Ars provide links to these more technical articles?
Re:Ars' Piece by moonbender · 2003-12-25 12:28 · Score: 2, Insightful

For a desktop (or otherwise interactive, like my media box) system, it really made a major perceived improvement in performance (while *actual* performance wasn't affected; it was just about priorities and scheduling, making it "feel" more responsive).
It doesn't just "feel" more responsive, it is more responsive! It's a matter of definition, but I'd certainly call that an improvement of performance. Taking from the article (or comp.sci classes), while the throughput remains equal - I suppose it's even reduced - the latency is lower, too. Performance is both factors, not just the former.

--
Switch back to Slashdot's D1 system.
Preemption defined incorrectly, I think by vrt3 · 2003-12-25 12:39 · Score: 4, Insightful

In the article, you write:
It is important to remember that a process can be stopped in the middle of its timeslice; this is the purpose of preemption.
The purpose of preemption is not to stop processes in the middle of their timeslice; the purpose is to stop them when the scheduler decides to, not only when the process itself decides to give another process a chance to run. The latter is called cooperative multitasking, and is used in Windows 3.x and MacOS prior to X.
Timeslices are used in order to implement preemption: a process stops either when it blocks (waiting for I/O) or otherwise voluntarily gives up the rest of its timeslice, or when it's timeslice is used up.
About the article in general: I'm surprised about the dumbing down. Other articles on Ars, including but not limited to the series about processor technologies (e.g. the one about hyperthreading) are much more thorough and detailed.

--
This sig under construction. Please check back later.
Re:Ars got it slightly wrong about O(1)... by qc_dk · 2003-12-25 12:55 · Score: 2, Insightful

that is not entirely true it means that for the worst case input, there exist a constant k and a smallest number of processes np where the following equation is fulfilled: Time_To_Switch_n_proces(p)< k where p>np that is not exactly the same as your definition.
Preemption and disk requests - educate me by ortholattice · 2003-12-25 14:05 · Score: 4, Insightful

I think this may be important:
The benefits of preemption are great enough that the kernel developers decided to make the kernel itself preemptible. This allows a kernel task such as disk I/O to be preempted, for instance, by a keyboard event. This allows the system to be more responsive to the user's demands. The 2.6 kernel is able to efficiently manage its own tasks in addition to user processes.
One of the most time-wasting, counterproductive things for me on Windows, and to a lesser extent on Linux, is that background tasks can essentially take over the computer because of disk thrashing. Sometimes (on Windows XP) I've seen it literally take minutes to bring a window to the front and repaint it while the disk is thrashing doing god-knows-what in the background. It is impossible to get any serious work done when that is happening. Setting my process to highest priority (on Windows) doesn't seem to help when disk thrashing is involved. I can understand how it might take a few seconds to swap another process out and bring mine in, but not minutes.
Now I don't know exactly what's happening behind the scenes, but as soon as I click on a window to bring it in focus, I want everything on the computer to be devoted to doing just that until its done, regardless of what disk blocks from other processes are in the queue. My disk block requests should go in front of all of the others unconditionally. In other words I should see essentially no more latency than if the computer were completely idle in the background, other than just the minimum delay required to bring it back into memory if it's swapped out. For all I care just completely freeze all other processes on the computer until my request is finished, or at least give me the option to specify that it be done that way. My productivity should override everything else on my own desktop/workstation computer.
So, educate me about what I may be overlooking here, and will 2.6 help this? I mean for Linux of course, not Windows, which may be a lost cause (:
Re:Ars' Piece by spectecjr · 2003-12-25 22:02 · Score: 5, Insightful

Windows XP, for example, isn't all that great in this area, and often one process can too easily slow things down. This is even further emphasized when all the processes you're working with are actually the main "explorer.exe" process; eg, you do something in Windows Explorer that blocks (spin up a CD drive) and everything else (the "desktop", task bar, etc) all become unresponsive...

The kernel hasn't slowed down in that case; it's the GUI layer which has slowed down. It hit a bottleneck.

Interestingly, it would appear that Windows has used an O(1) scheduler in Windows NT and up since at least NT 4.0 (which came out in 1996).

It's also interesting to note that it has the same priority-boosting for IO threads that Linux has.

"Windows NT also uses additional priority adjustments based on other events, such as momentarily boosting thread priority when it returns from an I/O call"

Detailed explanation of the NT4.0 scheduler

Easier to read version of the above

Part 1 of a slightly more indepth (but in some places inaccurate)* article

Part 2 of the article

*The article makes it sound like it's O(N) in places; it's not. The scheduler is using a set of priority lists, which are kept in-order at insertion time when a thread exits its context, by the thread inserting itself at the end of the list for its priority.

--
Coming soon - pyrogyra
Re:Hmm. by tigga · 2003-12-25 23:49 · Score: 2, Insightful

I'm just saying, I don't get the concept that merely making an O(1) scheduler is so incredibly difficult, and it's so utterly amazing that they were able to do so.
Well, they (afaik Ingo Molnar) created universal scheduler which should work properly on user workstation and on server, on uni- and multiprocessor systems with small latencies and without performane loss. It may be easy to create a model in your head, but proper implementation takes a lot of effort and keyword is 'proper'.
Re:What was the O value od the 2.4 scheduler? by LordBodak · 2003-12-26 01:29 · Score: 2, Insightful

This is actually a really good question and it's sad to see the trolls hijacked the thread.
Without explaining the O() of the 2.4 kernel, what good is an article explaining that 2.6 has a O(1)?

--
LordBodak's journal.