Multicore Requires OS Rework, Windows Expert Says

This is new?! by DavidRawling · 2010-03-21 11:50 · Score: 4, Insightful

Oh please, this has been coming for years now. Why has it taken so long for the OS designers to get with the program? We've had multi-CPU servers for literally decades.

Re:This is new?! by Sir_Sri · 2010-03-21 12:06 · Score: 2, Insightful

ya but those cases, as he reasonably explains, tend to get specialized development (say scientific computing), or separate processes, or while he doesn't explain it, a lot of server stuff is embarrassingly (or close to) parallel.
I can sort of see them not having a multi-processor OS just waiting for the consumer desktop- server processors are basically cache with some processor attached, whereas desktop processors are architected differently, and who knew for sure what the mutlicore world would look like in detail (or more relevantly what it will look like with 4, 8 or 16 or whatever cores). How will those cores be connected? How symmetric/asymmetric will they be? Right now OS's are built around two big asymmetric processors (cpu and gpu) and several smaller specialized ones (networking sound etc). Some of those architecture things *could* be fairly fundamental to the design you want to use, and there's no point investing huge development time trying to build software for hardware which doesn't exist and may never exist.
I'm not sure about his proposed architecture. It doesn't sound easily backwards compatible (but I might be wrong there), and there's a certain simplicity to 'reserve one core for the OS, application developers can manage the rest of them themselves' sort of model like consoles.
Re:This is new?! by PhunkySchtuff · 2010-03-21 12:07 · Score: 5, Insightful

Since when have OS designers optimised their code to milk every cycle from the available CPUs? They haven't, they just wait for hardware to get faster to keep up with the code.

--
Specialist Mac support for creative pros, Melbourne
Re:This is new?! by Jeremi · 2010-03-21 12:19 · Score: 4, Insightful

Why has it taken so long for the OS designers to get with the program?
Coming up with a new OS paradigm is hard, but doable.
Coming up with a viable new OS that uses that paradigm is much harder; because even once the new OS is working perfectly, you still have to somehow make it compatible with the zillions of existing applications that people depend on. If you can't do that, your shiny new OS will be viewed as an interesting experiment for the propeller-head set, but it won't ever get the critical mass of users necessary to build up its own application base.
So far, I think Apple has had the most successful transition strategy: Come up with the great new OS, bundle the old OS with it, inside an emulator/sandbox, and after a few years, quietly deprecate (and then drop) the old OS. Repeat as necessary.

--

I don't care if it's 90,000 hectares. That lake was not my doing.
Re:This is new?! by Cryacin · 2010-03-21 12:26 · Score: 5, Insightful

For that matter, since when have software vendors been willing to pay architects/designers/engineers etc to optimise their software to milk every cycle from the available CPUs and provide useful output with the minimum of effort? They don't, they just wait for hardware to get faster to keep up with code.

The only company that I have personally been exposed to that gives half a hoot about efficient performance is Google. It annoys me beyond belief that other companies think it's acceptable to make the user wait for minutes whilst the system recalculates data derived from a large data set, and doing those calculations multiple times just because a binding gets invoked.

--
Science advances one funeral at a time- Max Planck
Re:This is new?! by drsmithy · 2010-03-21 12:38 · Score: 2, Interesting

It doesn't sound easily backwards compatible (but I might be wrong there), and there's a certain simplicity to 'reserve one core for the OS, application developers can manage the rest of them themselves' sort of model like consoles.
Those curious about what life would be like with application developers managing system resources, should try firing up an old copy of Windows 3.1 or MacOS and running 10 or so applications at the same time.
I can only assume TFA is an atrociously bad summary of what he's actually proposing, because it sounds way to boneheaded for someone in that position to be seriously suggesting.
Re:This is new?! by Bengie · 2010-03-21 12:44 · Score: 3, Informative

developing server apps to run parallel is easy, client software is hard. Many times, the cost of syncing threads is greater than the work you get from them. So you leave it single threaded. The question is, how may you design a Framework/API that is very thread friendly while making sure everything runs in the order expected all the while making it easy for bad programmers to take advantage of it.
The biggest issue with developing async-threaded programs is logical dependencies that don't allow part to be loaded/processed before another. If from square one, you develop an app to take advantage of extra threads, it may be less efficient, but more responsive. Most programmers I talk to have issues trying to understand the interweaving logic of multi-threaded programing.
I guess it's up to MS to make a easy to use idiot-proof threaded framework for crappy programmers to use.
Re:This is new?! by gig · 2010-03-21 13:09 · Score: 2, Informative

> Since when have OS designers optimised their code to milk every cycle from the available CPUs?
Apple has been doing this for years. This is one of the advantages for the user of buying a complete product. Apple can't pretend that someone else will solve the problem for them through bigger hardware or the magic of open source.
Enabling large scale multiprocessing is one of the fundamental features of Mac OS X v10.6 Snow Leopard. The feature is called "Grand Central" and enables an app developer to make fairly small modifications to their app which cause it to go from pegging 1 CPU to pegging an unlimited number of CPU's. But multiprocessing has been part of OS X since the beginning. They shipped machines with multiple CPU's a long, long time ago compared to PC. Even Mac OS 9 had multiprocessing features.
Apple has also had XGrid going for some time now, which is quick and easy cluster computing.
Then if you look at iPhone OS, that has been highly, highly optimized. An iPhone 3GS with a 600MHz CPU outperforms a Nexus One with a 1000MHz CPU. The iPhone 3G with a 400MHz CPU outperforms a Palm Pre with 600MHz CPU. Those optimizations are part of the reason why Apple is currently undercutting both Android and Palm on price, which is the opposite of what was expected by Palm and Android developers and the entire industry. iPad on a 1000MHz CPU has been described by the people who have used it so far as being incredibly fast.
So you are right if you're talking about Microsoft (and maybe Linux, I don't know) but you're definitely wrong if talking about all OS designers.
Re:This is new?! by jc42 · 2010-03-21 13:19 · Score: 4, Insightful

Since when have OS designers optimised their code to milk every cycle from the available CPUs?
This isn't just an OS-level problem. It's a failure among programmers of all sorts.
I've been involved in software development since the late 1970s, and for the start I've heard the argument "We don't have to worry about code speed or size, because today's machines are so fast and have so much memory. This was just as common back when machines were 1,000 times slower and had 10,000 times less memory than today.
It's the reason for Henry Petroski's famous remark that "The most amazing achievement of the computer software industry is its continuing cancellation of the steady and staggering gains made by the computer hardware industry."
Programmers respond to faster cpu speed and more memory by making their software use more cpu cycles and more memory. They always have, and there's no sign that this is going to change. Being efficient is hard, and you don't get rewarded for it, because managers can't measure it. So it's better to add flashy eye candy and more features, which people can see.
If we want efficient code, we have to figure out ways to reward the programmers that write it. I don't see any sign that people anywhere are interested in doing this. Anyone have suggestions for how it might be done?

--
Those who do study history are doomed to stand helplessly by while everyone else repeats it.
Re:This is new?! by fuzzyfuzzyfungus · 2010-03-21 13:20 · Score: 4, Insightful

I doubt that it's just google. I suspect the following:

There are(in broad strokes, and excluding the embedded market), two basic axes on which you have to place a company or a company's software offering in order to predict its attitude with respect to efficiency.

One is problem scale. If a program is a once-off, or an obscure niche thing, or just isn't expected to have to cope with very large data sets, putting a lot of effort into making it efficient will likely not be a priority. If the program is extremely widely distributed, or is expected to cope with massive datasets, efficiency is much more likely to be considered important(if widely distributed, cost of efficient engineering per unit falls dramatically, if expeced to cope with massive datasets, amount of hardware cost and energy cost avoided becomes significant. Tuning a process that eats 50% of a desktop CPU into one that eats 40% probably isn't worth it. Tuning a process that runs on 50,000 servers into one that runs on 40,000 easily could be).

The second is location: If a company is running their software on their own hardware, and selling access to whatever service it provides(search engine, webmail, whatever), their software's efficiency or inefficiency imposes a direct cost on them. Their customers are paying so much per mailbox, or so much per search query, they have an incentive to use as little computer power as possible to deliver that product. If a company is selling boxed software, to be run on customer machines, their efficiency incentives are indirect. This doesn't mean "nonexistent"(a game that only runs on $2,000 enthusiast boxes is going to lose money, nobody would release such a thing. Among enthusiasts, browser JS benchmarks are a point of contention); but it generally does mean "secondary to other considerations". Customers, as a rule, are more likely to use slow software with the features they want, or slow software that released first and they became accustomed to, than fast software that is missing features or requires substantial adjustment on their part. Shockingly enough, software developers act on this fact.

On these axes, you would strongly suspect that Google would be efficiency oriented. Their software runs on a grand scale, and most of it runs on their own servers, with the rest competing against various desktop incumbents, or not actually all that dramatically efficient(Nothing wrong with Google Earth or Sketchup; but nothing especially heroic, either). However, you would expect roughly the same of any entity similarly placed on those axes.
Re:This is new?! by PhunkySchtuff · 2010-03-21 13:23 · Score: 3, Interesting

I don't know if you noticed my sig, but I'm pretty familiar with what Apple have been up to these past few years ;-)
What I was getting at was that, in general, programmers simply don't have the time or money to really optimise their code and now that computers are, for all intents and purposes, fast enough to not really worry about optimisations.
Apple are doing a lot of good, as you mention, with things like Grand Central Dispatch, but the multiprocessing features in earlier versions of OS X, and even more OS 9, were nothing that was in any major way any better than that offered by, say, Windows or other Unix based OSs. In fact, in the Mac OS 9 days, the multiprocessing capabilities of Mac OS lagged quite far behind that of Windows NT at the time.

--
Specialist Mac support for creative pros, Melbourne
Re:This is new?! by Jane+Q.+Public · 2010-03-21 13:25 · Score: 3, Interesting

Mod parent up!

The fact is, the vast majority of programmers (and their tools) are not going to change virtually everything they do in order to deal with multiple cores. And there's good reason for that: it hugely complicates what could otherwise be fairly simple tasks. As the number of cores expand, it gets worse to the point of simply not being practical. This is a job that properly belongs in the OS or hardware layer.

Is it harder to design a system that decides for itself how to go about threading and multiprocessing, rather than relying on the programmer to know when it is best for that particular program? Yes! But that is irrelevant, because in the long run, that is the way it must be done. There is no other practical choice.

I had to laugh at Intel a few years ago when they called for end-product programmers to start programming for their multicore processors. I say, "No, Intel. It is you who must cater to the programmers. They are your customers, and essential suppliers of your other customers. It is your job to make sure that your processors do what the programmers want, not the other way around!"

Apple's decision to put provision for this in their Snow Leopard OS is a clear demonstration of their forward (and practical) thinking. Where are all the others?
Re:This is new?! by stevew · 2010-03-21 13:25 · Score: 5, Informative

Well - I can tell you that Dave Probert saw his first multi-processor about 28 years ago at Burroughs corporation. It was a dual-processor B1855. I had the pleasure with working with the guy way back then. From what I recall he then went on to work at FPS systems which was an array processor that you could add onto other machines (I think vaxen...but I could be wrong there..)
Anyway - he has been around ALONG time.

--
Have you compiled your kernel today??
Re:This is new?! by Brian+Gordon · 2010-03-21 13:34 · Score: 4, Insightful

Maybe it's not a question of whether the code is efficient. Maybe it's a question of how much you're asking the code to do. It's no surprise that hardware struggles to make gains against performance demands when software developers are adding on nonsense like compositing window managers and sidebar widgets. I'm enjoying Moore's law without any cancellation.. just run a sane environment. Qt or GTK, not both, if youre running an X desktop. Nothing other than IM in the system tray. No "upgrade fever" that makes people itch for Windows Media Player 14 when older versions work fine and mplayer and winamp work better.
Re:This is new?! by PixelSlut · 2010-03-21 13:40 · Score: 2, Interesting

Google? I'm a big Google fan (and despite the rest of my comment, also a big Android fan and totally love my Nexus One).. but if Google was so hardcore into efficiency, why the hell did they develop a new runtime for their Android that's based on Java?
Google didn't seem like the best company to praise for efficiency. I would have picked some sort of video game company like id Software (yeah, I realize this an apples and oranges comparison though).
Re:This is new?! by jo_ham · 2010-03-21 13:59 · Score: 2, Insightful

The beachball of rumination is there to remind you to book your holiday to the coast.
It used to be a feature of 10.2 and earlier - I only see it occasionally in the later versions, but it is still there occasionally.
Re:This is new?! by hitmark · 2010-03-21 14:02 · Score: 2, Funny

now thats a program name that begs to be had fun with. Whoever named it that must have one impressive sense of irony.

--
comment first, facts later. http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm
Re:This is new?! by skids · 2010-03-21 14:12 · Score: 2, Insightful

No glory in it either. Even when you're doing it for free, nobody seems to care if you produce an optimization.
Plus, there are many more coders who have limited depth of understanding of OS interfacing, than there are coders who would go in after them to optimize. Heck, forget multicore -- how many applications fail to use vector units?
Sometimes optimizations get dropped from code as too difficult to maintain. Rarely, enough of them get collected in one spot to make a library out of them. Even more rarely, those libraries actually get used.
And it will stay that way until the consumer starts showing a preference for performance over features.

--
Someone had to do it.
Re:This is new?! by not+already+in+use · 2010-03-21 14:12 · Score: 2, Insightful

An iPhone 3GS with a 600MHz CPU outperforms a Nexus One with a 1000MHz CPU.
The reason the 3gs "outperforms" the N1 is because the N1 has more than twice the pixels of a 3GS. If the N1 had to drive the iphones resolution, it would wipe the floor with the iphones ass, all while supporting user app multitasking.

--
Similes are like metaphors
Re:This is new?! by Anonymous Coward · 2010-03-21 14:29 · Score: 2, Insightful

Grand Central is not a novel concept, similar libraries like OpenMP have been around for years on *nix/Windows.
Also, why are you being an iPhone shill in a discussion about multicore processing? The iPhone OS doesn't even really support multitasking, and runs on a mobile device with a single CPU.
Re:This is new?! by Mr.+Freeman · 2010-03-21 14:43 · Score: 4, Insightful

Because Google ain't crunching data sets on fucking mobile phones. They're optimizing their servers and the applications that run on those servers because Google is so damn big that a fraction of a percent increase in efficiency translates into huge amounts of money saved through less wasted CPU time. Mobile phones aren't a part of google.

If you phone runs a little less efficient then no one gives a damn. They want to make their phones easy to program for, which generally conflicts with efficiency.

--
-1 disagree is not a modifier for a reason. -1 troll, flaimbait, redundant, overrated are NOT acceptable substitutes.
Re:This is new?! by nine-times · 2010-03-21 14:57 · Score: 2, Informative

I don't know if you had to support Mac users during the years of transition, but it wasn't quite as easy as you made it sound. It was pretty smooth for such a drastic change, but I wouldn't want to repeat it any more than necessary.
Re:This is new?! by johanatan · 2010-03-21 14:59 · Score: 2, Interesting

Umm, I don't know about your organization but at mine we do perf measurements and there are perf specificiations which must be met as much as any of the other specs.
Re:This is new?! by amRadioHed · 2010-03-21 15:00 · Score: 3, Informative

The iPhone certainly doesn't outperform a Nexus One. If you compare browser rendering tests the Nexus One consistently completes loading pages quite a bit faster then the iPhone. You are probably thinking of games performance, and while it's true that the iPhone gets better frame rates, you're forgetting that the Nexus One is pushing around 2.5 times more pixels so that's not exactly an apples to apples comparison.

--
We hope your rules and wisdom choke you / Now we are one in everlasting peace
Re:This is new?! by Anonymous Coward · 2010-03-21 15:42 · Score: 2, Funny

A: Restrict the time of daily torture to shortly before lunch, instead of afterwards or first thing in the morning. This goes slightly against my grateful learning of the Policy, but I believe it will improve appetite, reduce wasted corporate meal provisions, and it can reduce messes caused by bleeding on corporate property by allowing time for wounds to clot before work resumes.
B: Allow the compiler designers to breathe fresh air for an hour a day, in the evening before returning them to their chains in the boiler room. Most compiler designers will use the opportunity to weep quietly to themselves, which will advantageously clean soot from their eyes. Their temporarily improved eyesight will aid in morning productivity. Fresh air in the morning would be misconstrued as a reward for something, but in the evenings they'll be too exhausted from the standard seventy hours work to do anything but quietly babble and faint.
C: Use the whip at random after they've made their comments in our daily herding. Immediately whipping while they talk or right after discourages them from singling themselves out as disbelievers. When they are inevitably whipped, they'll naturally ask themselves why and your lesson will not go unheeded. Others being whipped at random will be that much more aware they shouldn't have listened, to the other lower workers.
D: The rigours of programming often take a toll on programmers eyes and hands, so reducing the pressure of clamps and other productivity devices attached might be more effective if used sparingly.
All hail to the management overlords. We obey, thou almighty ones. As I am but the brown haired guy from sub floor B, whom speaks to you with reverence. I eagerly await and deserve my anticipated punishments.
Humbly,
Systems Minion, B floor, seat 522.
Re:This is new?! by dudpixel · 2010-03-21 15:43 · Score: 3, Informative

Come up with the great new OS...
hang on, this "new" OS you're referring to is basically UNIX (BSD). It was invented before Windows. Sure apple has modified it and put a shiny new layer on top (that works exceptionally smoothly, mind you), but if you wanna get technical, they didn't come up with a new OS, they improved an old one.

--
This seemed like a reasonable sig at the time.
Re:This is new?! by AaronMK · 2010-03-21 15:43 · Score: 2, Interesting

Grand Central Dispatch (GCD) is not some magic bullet that "deals with the cores", as you put it. The big thing it adds is a system wide tasking and scheduling component accessible to individual applications, making it easier to designate blocks of code (ie tasks) that can run in parallel, and spread those tasks among the available CPUs. Programmers still have the burden creating task parallel algorithms to solve their problems, and that is usually the tricky part. Creating a Thread Pool (GCD like functionality) component for an application (or using one someone else built, of which there have been many long before GCD), in both Windows and OS X is very easy in comparison.
Don't get me wrong, GCD is a nice optimization and has some good features, but it is a relatively small and trivial part of the bigger problem.
Re:This is new?! by LtGordon · 2010-03-21 15:44 · Score: 2, Informative

... but if Google was so hardcore into efficiency, why the hell did they develop a new runtime for their Android that's based on Java?
Because the Java gets executed on the user's hardware. Google cares about efficiency insofar as it affects their own hardware requirements.
Re:This is new?! by jc42 · 2010-03-21 15:53 · Score: 3, Insightful

Hey, if you liked programming for a one-byte machine, maybe you should join the quantum computer research effort. They're just now looking forward to the creation of their first 8-bit "computer" in the very near future. ;-)
Of course, you can do a bit more computing with 8 Q-bits than you can with 8 of the more mundane bits that the rest of us are using.

--
Those who do study history are doomed to stand helplessly by while everyone else repeats it.
Re:This is new?! by tsotha · 2010-03-21 17:42 · Score: 2, Insightful

It's not a failure among programmers at all - it's a business decision. The main reason software is less efficient is the costs are so heavily tilted toward software development instead of hardware. For the vast majority of business applications companies are using generalized frameworks to trade CPU cycles and memory for development time.
Even in terms of development style, it just isn't worth it to optimize your code if it's going to substantially increase development time. People are expensive. Time is expensive. Hardware is not.
Now, if you're Microsoft, or Blizzard, or Google, then the equation changes, since your code is running on millions of CPUs. But that's not the normal case. If I'm writing a web service so the accounting software at headquarters can tell how many widgets are in my company's warehouse, it really doesn't matter how inefficient the code is (within reason) as long as it works reliably and is easy to maintain. What my boss really wants is for me to finish as quickly as possible and move on to the next task.
Re:This is new?! by mjwx · 2010-03-21 17:58 · Score: 5, Informative

Then if you look at iPhone OS, that has been highly, highly optimized. An iPhone 3GS with a 600MHz CPU outperforms a Nexus One with a 1000MHz CPU. The iPhone 3G with a 400MHz CPU outperforms a Palm Pre with 600MHz CPU
Citation needed? I think you'll find that Iphone only appears to outperform Android because Android is doing a lot more then the Iphone. Further more many things that work on Android do not work on Iphone, slashdot for instance works fine on my HTC Dream or newer Motorola Milestone with the standard browser, it works even better with Dolphin browser.

This cannot be a fair comparison until the Iphone can do everything that Android phones can, unless you want to compare functionality where Iphone is an epic failure.

Those optimizations are part of the reason why Apple is currently undercutting both Android and Palm on price,
Now I can tell you're full of it. All prices are incl of local taxes, and UK VAT does not apply outside the EU for those in Australia, Canada and the US.

UK Expansys
Motorola Milestone GBP 379
Nexus 1 GBP 599
Iphone 32 GB GBP 799

AU Mobicity
Motorola Milestone A$659
Nexus 1 A$849
Iphone 16 GB A$959

The cheapest Iphone 3GS available is A$100 more expensive then the newer Motorola Milestone (droid for the Yanks) and Google Nexus One. Not to mention that both the Milestone and Nexus One can do more as well as lack the restrictions of the Iphone. But then again I suspect you were merely looking to confirm your quite obvious bias rather then do an accurate comparison.

Apple's operating systems are not very well optimised, not even as much as Windows operating systems, Apple's OS pretend to have optimisation by providing the OS with more hardware then it needs and limiting functionality to prevent any perceived loss of speed. Most people using a Mac or Iphone rarely use the full power of the hardware, ergo an un-optimised OS goes unnoticed by the user. Here is the core of the design (in an engineering perspective) a design does not have to work well, it just has to work. The vast majority of people will ignore tiny flaws if they can get the task done, OTOH if a computer doesn't do the task the user will get annoyed no matter how pretty the interface.

As a good developer friend of mine likes to say, "If given the choice, a user will press the 'I just want it to work today' button". OSX provides this very shiny button but only in a few select places, Windows provides this not so shiny button almost everywhere. This is why Windows is still the number one OS on the planet.

--
Calling someone a "hater" only means you can not rationally rebut their argument.
Re:This is new?! by mjwx · 2010-03-21 18:00 · Score: 2, Insightful

The reason the 3gs "outperforms" the N1 is because the N1 has more than twice the pixels of a 3GS. If the N1 had to drive the iphones resolution, it would wipe the floor with the iphones ass, all while supporting user app multitasking.
What many people are forgetting is that the N1 has no GPU, it requires the CPU to do all the rendering, which makes the rendering a little slower.

We are better off comparing it to the Motorola Milestone (Droid in the US) which has a GPU.

--
Calling someone a "hater" only means you can not rationally rebut their argument.
Re:This is new?! by Anonymous Coward · 2010-03-21 18:06 · Score: 3, Interesting

I don't think you understood the point he was trying to make. Windows has had threading since 1993 and a threadpool API since before OS X was released. The point he was making was not that Windows wasn't good enough for multiple cores, it was that the current paradigm about how OSes and apps relate wasn't good enough.
Back when you only had a single core CPU, the OS had to share the CPU with all the apps. Thus arose the kernel/user model where the OS ran in kernel mode and the apps ran in user mode. When an app needed some system service it would stop running, the CPU would switch to kernel mode, perform the server, and go back to user mode so the app could resume. When multiple CPUs and then multiple cores per CPU became available, this model was simply expanded so that the OS ran on every CPU core. This is called the SMP (Symmetric MultiProcessing) model because every processor core has the same duties as all the others.
I think what he's saying is that having the OS run on every core means that data structures it uses will have to be shared across all the cores in the system, causing problems like contention and false sharing. It sounds like he is considering what would happen if the OS just ran on some cores and apps ran on others. If an app needs a system service it need not stop running, switch into kernel mode, run the OS, etc. Instead it could send a message to one of the cores that the OS is running on and go about its business, hopefully staying more responsive that way. Obviously the app can't have full control of the CPU because it has to share the computer nicely, but it doesn't need a fully-blown kernel either, so the thin supervisor layer is what he related to a hypervisor.
It may be hard to imagine a 256-core computer because Apple doesn't make any, but Windows can already run on 256 cores. Of course those are huge server boxes, but it won't be long before it's common to have desktop boxes with 256 logical CPUs (2 sockets, 32 cores/socket, 4 threads/core), and then you can imagine that a high-end server might have upwards of 2048 cores. At that point does it even make sense to have the OS running on hundreds or thousands of cores simultaneously? Probably not.
I'm not saying that this guy has the right solution, but he has some interesting ideas worth considering.
dom
Re:This is new?! by IamTheRealMike · 2010-03-21 18:35 · Score: 4, Insightful

Why Java for Android? This is a good question. There are several reasons (that the Android team have discussed).
One is that ARM native code is bigger, size-wise, than Dalvik VM bytecode. So it takes up more memory. Unlike the iPhone, Android was designed from the start to multi-task between lots of different (user installed) apps. It's quite feasible to rapidly switch between apps with no delay on Android, and that means keeping multiple running programs in RAM simultaneously. So trading off some CPU time for memory is potentially a good design. Now that said, Java has some design issues that make it more profligate with heap memory than it maybe needs to be (eg utf16 for strings) so I don't have a good feel for whether the savings are cancelled out or not, but it's a justification given by the Android team.
Another is that Java is dramatically easier to program than a C-like language. I mean, incredibly monstrously easier. One problem with languages like C++ or Objective-C is that lots of people think they understand them but very few programmers really do. Case in point - I have an Apple-mad friend who ironically programs C# servers on Windows for his day job. But he figured he'd learn iPad development. I warned him that unmanaged development was a PITA but he wasn't convinced, so I showed him a page that discussed reference counting in ObjC (retain/release). He read it and said "well that seems simple enough" - doh. Another one bites the dust. I walked him through cycle leaks, ref leaks on error paths (no smart pointers in objc!), and some basic thread safety issues. By the end he realized that what looked simple really wasn't at all.
By going with Java, Android devs skip that pain. I'm fluent in C++ and Java, and have used both regularly in the past year. Java is reliably easier to write correct code in. I don't think it's unreasonable to base your OS on it. Microsoft has moved a lot of Windows development to .NET over the last few years for the same reasons.
Fortunately, being based on Java doesn't mean Android is inherently inefficient. Large parts of the runtime are written in C++, and you can write parts of your own app in native code too (eg for 3D graphics). You need to use Java to use most of the OS APIs but you really shouldn't be experiencing perf problems with things like gui layout - if you are, that's a hint you need to simplify your app rather than try to micro-optimize.
Re:This is new?! by julesh · 2010-03-21 20:04 · Score: 3, Informative

One is that ARM native code is bigger, size-wise, than Dalvik VM bytecode.
Citation needed. Dalvik is better than baseline Java bytecode, agreed. But so is ARM native code. [http://portal.acm.org/citation.cfm?id=377837&dl=GUIDE&coll=GUIDE&CFID=82959920&CFTOKEN=24064384 - "[...] the code efficiency of Java turns out to be inferior to that of ARM Thumb"]. I can find no direct comparison of ARM Thumb and Dalvik, so I can't tell you which produces the smaller code size.
So it takes up more memory.
Even if your first statement is true, this doesn't necessarily follow. VMs add overhead, usually using up somewhat more runtime memory to execute, particularly if a JIT is used (the current version of Dalvik doesn't have one, but the next one apparently will).
Re:This is new?! by VulpesFoxnik · 2010-03-21 22:53 · Score: 2, Insightful

I think it's more of a architecture problem. The x86 is a horrible creature, which has an inefficient language. What x86 does in a huge set of instructions, Arm can often do in 2/3 that many. All Mhz does is eat more power and generate more heat. The answer is smarter digital languages.

--
RES PUBLICA NON DOMINETUR
Re:This is new?! by YourExperiment · 2010-03-21 23:49 · Score: 2, Funny

Of course, you can do a bit more computing with 8 Q-bits than you can with 8 of the more mundane bits that the rest of us are using.
So an 8-bit quantum computer is the equivalent of a 9-bit standard computer?
Re:This is new?! by greed · 2010-03-22 03:22 · Score: 2, Interesting

Still, one could hope that people learned from Apple's experience with the Classic VM inside OS X.
The one thing I don't like from the transition is how Carbon/Classic :-separated paths are still hanging around in some interfaces.
But between Classic.app on OS X, Cygwin on Windows, and WINE on anything POSIX-ish (source API compatibility, not binary), there's plenty of work out there to serve as a template.
None of which Microsoft can use, I guess, because it's either Apple's or (L)GPLed.
Re:This is new?! by nine-times · 2010-03-22 04:37 · Score: 2, Interesting

Well it's my understanding that Carbon simply wasn't supposed to stick around this long. Cocoa was supposed to replace it, but there were some major developers (e.g. Adobe and Microsoft) who refused to transition.
There was even a dust up in the last year or so when 10.6 was released, and Apple made it clear that they weren't ever going to update Carbon to support 64-bit applications. Adobe pretty much flipped out, and is only now working on migrating over to Cocoa in CS5. Microsoft is finally releasing a Cocoa version of Office in 2010.
So in essence, we're 10 years out and the transition from OS9 still isn't done.
Don't get me wrong-- in the past 10 years, Apple has transitioned to an entirely new OS and a different chip architecture (PowerPC->x86), and overall both transitions went fine. I still wouldn't want to keep doing it every couple of years.
Re:This is new?! by steelfood · 2010-03-22 06:39 · Score: 2, Insightful

MS did the same during the transition to 32-bit. They included a 16-bit DOS emulator and had it run transparently. They did the same for the transition to 64-bit. It was so successful and so transparent a lot of IT professionals didn't even know it was even happening in the background.
Unlike Apple though, they never removed it. Sure, it resulted in a major security hole, but it also let legacy custom business apps run far longer than they otherwise would have been able to.
I suspect if they were ever to make another large transition, they'd do the same thing they've been doing for years.

--
"If a nation expects to be ignorant and free in a state of civilization, it expects what never was and never will be."

waiting by mirix · 2010-03-21 11:51 · Score: 5, Insightful

'Why should you ever, with all this parallel hardware, ever be waiting for your computer?'

Because I/O is always going to be slow.

--
Sent from my PDP-11

Re:waiting by DavidRawling · 2010-03-21 12:04 · Score: 4, Insightful

Well, with the rise of the SSD, that's no longer as much of a problem. Case in point - I built a system on the weekend with a 40GB Intel SSD. Pretty much the cheapest "known-good" SSD I could get my hands on (ie TRIM support, good controller) at AUD $172, roughly the price of a 1.5TB spinning rust store - and the system only needs 22GB including apps.
Windows boots from end of POST in about 5 seconds. 5 seconds is not even enough for the TV to turn on (it's a Media Center box). Logon is instant. App start is nigh-on instant (I've never seen Explorer appear seemingly before the Win+E key is released). This is the fastest box I've ever seen, and it's the most basic "value" processor Intel offer - the i3-530, on a cheap Asrock board with cheap RAM (true, there's a slightly cheaper "bargain basement" CPU in the G6950 or something). The whole PC cost AUD800 from a reputable supplier, and I could have bought for $650 if I'd wanted to wait in line for an hour or get abused at the cheaper places.
Now, Intel are aiming to saturate SATA-3 (600MBps) with the next generation(s) of SSD, or so I'm told. Based on what I've seen - it's achievable, at reasonable cost, and it's not only true for sequential read access. So if the IO bottleneck disappears - because the SSD can do 30K, 50K, 100K IO operations per second? Yeah, I think it's reasonable to ask why we wait for the computer.
Not that I think a redesign is necessary for the current architectures - Windows, BSD, Linux all scale nicely to at least 8 or 16 logical CPUs in the server world, so the 4, 6 or 8 on the desktop isn't a huge problem. But in 5 years when we have 32 CPUs on the desktop? Maybe. Or maybe we'll just be using the same apps that only need 1 CPU most of the time, and using the other 20 CPUs for real-time stuff (Real voice control? Motion control and recognition?)
Re:waiting by Jimbookis · 2010-03-21 12:17 · Score: 2, Insightful

Nature abhors a vacuum. It seems that no matter how much compute power you have something will always want to snaffle it up. I have a dual PentiumD at work running WinXP and 3GB of RAM. The proprietary 8051 compiler toolset god awful slow (and pegs one of the CPUs) compiling even just a few thousands of lines of code (10's of seconds with lots of GUI seizures) because I think for some reason the compiler and IDE are running a crapload of inefficient python in the backend. Don't even get me started on how long it takes to upload the frickin' binary to the target on JTAG. My debug cycles take far too long. My point is the compilation of my code base should be done literally in the blink of an eye but the developers saw fit to use a framework that depends on brute CPU power to do relatively simple stuff. A colleague writes VB.net apps to and sometimes it's like being back in 1989 watching .net draw all the elements of the GUI on the screen when you open it or change tabs. Fsck knows how this has come to pass in 2010 and why it's acceptable. So really, blame the programmers for making your beast of a PC slow and waiting around. This notion of massive language abstraction and wanting to use scripting languages ('coz it's easier, apparently) and just-in-time this and that is what is slowing computers down. And hard disks. '
Re:waiting by DavidRawling · 2010-03-21 13:16 · Score: 2, Informative

Yes, as does Windows. I think I should have been more clear - the scale curve is nice and flat up to 8, 16, maybe 32 logical CPUs. After that though, doubling CPUs doesn't necessarily double performance (even in heavy compute) - other bottlenecks start to impact, as does scheduler performance and architecture.
Re:waiting by BikeHelmet · 2010-03-21 13:52 · Score: 2, Insightful

Seems pretty good to me.
If true.
Re:waiting by Courageous · 2010-03-21 14:34 · Score: 4, Interesting

Well, with the rise of the SSD, that's no longer as much of a problem.
ORLY!
Let's do some math shall we? Take a simple 4 core Nehalem running at 2.66Ghz. Let's conservatively assume that it can complete a mere *1* double precision floating point number per clock cycle, per core. So. How big is a double? 64 bits, or 8 bytes. Now, that's 2.66 billion * 4 = 10.64 BILLION doubles per second, which is 85 GB/s.
The trick to understanding computing is that all computing really *is* at its heart a throughput problem.
Do you see the asymmetry in throughput b/t the Nehalem and your SSD?
C//
Re:waiting by node+3 · 2010-03-21 16:57 · Score: 2, Insightful

The question wasn't, "why should your CPU have to wait", it was, "why should *you* have to wait". At speeds approaching 3Gb/s, I think it's fair to say, at the person you replied to actually did say, "well, with the rise of the SSD, that's no longer as much of a problem."

The trick to understanding computing is that all computing really *is* at its heart a throughput problem.
The trick to understanding computers is to realize that all computing really is, at its heart, a human problem. It really doesn't matter if the CPU has to wait a trillion cycles in between receiving each byte of data, if the computer responds in an apparently instantaneous manner for the person using it, everything is working just fine.
I only care abstractly about how long my CPU has to wait. I do care directly about how much I have to wait.

The problem isnt even that simple by indrora · 2010-03-21 11:54 · Score: 5, Insightful

The problem is that most (if not all) peripheral hardware is not parallel in many senses. Hardware in today's computers is serial: You access one device, then another, then another. There are some cases (such as a few good emulators) which use muti-threaded emulation (sound in one thread, graphics in another) but fundamentally the biggest performance kill is the final IRQs that get called to process data. The structure of modern day computers must change to take advantage of multicore systems.

Grand Central? by volfreak · 2010-03-21 11:55 · Score: 3, Insightful

Isn't this the reason for Apple to have rolled out GrandCentral in Snow Leopard? If so, it seems it's not THAT hard to do - at least not that hard for a non-Windows OS.

Re:Grand Central? by jonwil · 2010-03-21 12:31 · Score: 2, Insightful

The overhead of systems like .NET is part of WHY we have a problem with excessive CPU usage in the first place.
Re:Grand Central? by jasmusic · 2010-03-21 13:00 · Score: 4, Insightful

I'm thinking you don't have much experience with .NET. During my projects it has always run comparable to native compiled code when I write my code with the mindset of a C++ programmer and not a VB one.

Current architecture flawed but workable BUT.... by syousef · 2010-03-21 11:56 · Score: 4, Interesting

...the implementation sucks.

Why for example does Windows Explorer decide to freeze ALL network connections when a single URN isn't quickly resolved? Why is it that when my USB drive wakes up, all explorer windows freeze? If you are trying to tell me there's no way using the current abstractions to implement this I say you're mad. For that matter when a copy or move fails in Explorer, why can't I simply resume it once I've fixed whatever the problem is. You're left piecing together what has and hasn't been moved. File requests make up a good deal of what we're waiting for. It's not the bus or the drives that are usually the limitation. It's the shitty coding. I can live with a hit at startup. I can live with delays if I have to eat into swap. But I'm sick and tired of basic functionality being missing or broken.

--
These posts express my own personal views, not those of my employer

Re:Because by Anonymous Coward · 2010-03-21 11:56 · Score: 3, Insightful

Why should you ever, with all this parallel hardware, ever be waiting for your computer?' he asked.

Because it might be waiting for I/O.

That's no reason for the entire GUI to freeze on Windows when you insert a CD.

Dumb programmers by Sarten-X · 2010-03-21 11:57 · Score: 2, Insightful

You wait because some programmer thought it was more important to have animated menus than a fast algorithm. You wait because someone was told "computers have lots of disk space." You wait because the engineers never tested their database on a large enough scale. You wait because programmers today are taught to write everything themselves, and to simply expect new hardware to make their mistakes irrelevant.

--
You do not have a moral or legal right to do absolutely anything you want.

Re:Dumb programmers by Anonymous Coward · 2010-03-21 12:10 · Score: 2, Insightful

not true, you wait because management speed tracks stuff out the door without giving developers enough time to code things properly and management ignores developer concerns in order to get something out there now that will make money at the expense of the end user, I have been coding a long time and have seen this over and over. Management doesn't care about customers or let developers code things correctly - they only care about $$$$$$$

Re:Fist post! by SilverEyes · 2010-03-21 11:58 · Score: 4, Funny

Fist post!

I come to /. to read tech news... not so see people fisting.

--
Interesting.

reinventing the wheel by pydev · 2010-03-21 12:09 · Score: 4, Interesting

Microsoft should go back and read some of the literature on parallel computing from 20-30 years ago. Machines with many cores are nothing new. And Microsoft could have designed for it if they hadn't been busy re-implementing a bloated version of VMS.

Re:Current architecture flawed but workable BUT... by Threni · 2010-03-21 12:23 · Score: 5, Insightful

Windows explorer sucks. It always just abandons copies after a fail - even if you're moving thousands of files over a network. Yes, you're left wondering which files did/didn't make it. It's actually easier to sometimes copy all the files you want to shift locally, then move the copy, so that you can resume after a fail. It's laughable you have to do this, however.

But it's not a concurrency issue, and neither, really, are the first 2 problems you mention. They're also down to Windows Explorer sucking.

Re:Fist post! by Jeremi · 2010-03-21 12:23 · Score: 4, Funny

I come to /. to read tech news... not so see people fisting.

Well, I came here to see the fisting. And frankly, so far this site has been a real disappointment.

--

I don't care if it's 90,000 hectares. That lake was not my doing.

Re:I hate to say it, but... by GIL_Dude · 2010-03-21 12:25 · Score: 4, Insightful

Are you running a 9 year old version of OSX too, or are you comparing a two generation old Windows version to a nice new Mac version? It really sounds like you are comparing apples (snicker) to oranges. After all, both Vista and Windows 7 have no problem running for a long, long time between reboots and don't get slow during that time.

Multithreading is the problem, not the answer by Anonymous Coward · 2010-03-21 12:26 · Score: 3, Interesting

The Problem with Threads (UC Berkeley's Prof Edward Lee)
How to Solve the Parallel Programming Crisis
Half a Century of Crappy Computing

The computer industry will have to wake up to reality sooner or later. We must reinvent the computer; there is no getting around this. The old paradigms from the 20th century do not work anymore because they were not designed for parallel processing.

Re:Multithreading is the problem, not the answer by lennier · 2010-03-21 12:59 · Score: 2

++.
In the 1980s there was lots of academic interest in parallel computing. Unfortunately a lot of it seemed to be driven merely by the quest for speed- once single CPUs got fast enough in the early 90s and everyone went 'whee C is good enough also objects are neat!', a whole generation of parallel language work was lost to the new&shiny.
It's depressing.

--
You are not a brain: http://books.google.com/books?id=2oV61CeDx-YC

Re:Luckily OSX is Already Has MultiCore Tech by Sc4Freak · 2010-03-21 12:27 · Score: 2, Insightful

I'm not sure I get it - GCD just looks like a threadpool library. Windows has had a built-in threadpool API that's been available since Windows 2000, and it seems to do pretty much the same thing as GCD.

Duh by Waffle+Iron · 2010-03-21 12:38 · Score: 3, Funny

Why should you ever, with all this parallel hardware, ever be waiting for your computer?'

For a lot of problems, for the same reason that some guy who just married 8 brides will still have to wait for his baby.

Re:Duh by keeboo · 2010-03-21 19:40 · Score: 3, Insightful

Why should you ever, with all this parallel hardware, ever be waiting for your computer?'
For a lot of problems, for the same reason that some guy who just married 8 brides will still have to wait for his baby.
Of course, he'll be able to get 8 babies at once, assuming none of the processes crash during the computation.
That improves bandwidth, but not latency: almost 1 baby/month, but 9 months of latency.
The guy could try interleaving the pregnancies, in order to get the illusion of lower latency.

The problem: the event-driven model by Animats · 2010-03-21 12:40 · Score: 4, Informative

A big problem is the event-driven model of most user interfaces. Almost anything that needs to be done is placed on a serial event queue, which is then processed one event at a time. This prevents race conditions within the GUI, but at a high cost. Both the Mac and Windows started that way, and to a considerable extent, they still work that way. So any event which takes more time than expected stalls the whole event queue. There are attempts to fix this by having "background" processing for events known to be slow, but you have to know which ones are going to be slow in advance. Intermittently slow operations, like an DNS lookup or something which infrequently requires disk I/O, tend to be bottlenecks.

Most languages still handle concurrency very badly. C and C++ are clueless about concurrency. Java and C# know a little about it. Erlang and Go take it more seriously, but are intended for server-side processing. So GUI programmers don't get much help from the language.

In particular, in C and C++, there's locking, but there's no way within the language to even talk about which locks protect which data. Thus, concurrency can't be analyzed automatically. This has become a huge mess in C/C++, as more attributes ("mutable", "volatile", per-thread storage, etc.) have been bolted on to give some hints to the compiler. There's still race condition trouble between compilers and CPUs with long look-ahead and programs with heavy concurrency.

We need better hard-compiled languages that don't punt on concurrency issues. C++ could potentially have been fixed, but the C++ committee is in denial about the problem; they're still in template la-la land, adding features few need and fewer will use correctly, rather than trying to do something about reliability issues. C# is only slightly better; Microsoft Research did some work on "Polyphonic C#", but nobody seems to use that. Yes, there are lots of obscure academic languages that address concurrency. Few are used in the real world.

Game programmers have more of a clue in this area. They're used to designing software that has to keep the GUI not only updated but visually consistent, even if there are delays in getting data from some external source. Game developers think a lot about systems which look consistent at all times, and come gracefully into synchronization with outside data sources as the data catches up. Modern MMORPGs do far better at handling lag than browsers do. Game developers, though, assume they own most of the available compute resources; they're not trying to minimize CPU consumption so that other work can run. (Nor do they worry too much about not running down the battery, the other big constraint today.)

Incidentally, modern tools for hardware design know far more about timing and concurrency than anything in the programming world. It's quite possible to deal with concurrency effectively. But you pay $100,000 per year per seat for the software tools used in modern CPU design.

Re:The problem: the event-driven model by shutdown+-p+now · 2010-03-21 13:22 · Score: 4, Interesting

This has become a huge mess in C/C++, as more attributes ("mutable", "volatile", per-thread storage, etc.) have been bolted on to give some hints to the compiler.
An interesting comment overall, but what relevance does "mutable" have to multi-threaded programming? It is just a way to say that a particular field in a class is never const, even when the object itself is as a whole. There are no optimizations the compiler could possibly derive from that (in fact, if anything, it might make some optimizations non-applicable).
Same goes for "volatile", actually. It forces the code generator to avoid caching values in registers etc, and always do direct memory reads & writes on every access to a given lvalue, but this won't prevent one core from not seeing a write done by another core - you need memory barriers for that, and ISO C++ "volatile" doesn't guarantee any (nor do any existing C++ implementations).

Microsoft Research did some work on "Polyphonic C#" [psu.edu], but nobody seems to use that.
It's a research language, not intended for production use. Microsoft Research does quite a few of those - e.g. Spec# (DbC), or C-omega (this is what Polyphonic C# evolved into), or Axum (the most recent take at concurrency, Erlang-style).
Those projects are used to "cook" some idea to see if it's feasible, what approach is the best, and how it is taken by programmers. Eventually, features from those languages end up integrated into the mainstream ones - C# and VB. For example, X# became LINQ in .NET 3.5, and Spec# became Code Contracts in .NET 4.0. So, give it time.
Re:The problem: the event-driven model by Animats · 2010-03-21 17:51 · Score: 2, Informative

An interesting comment overall, but what relevance does "mutable" have to multi-threaded programming?
A "const" object can be accessed simultaneously from multiple threads without locking, other than against deletion. A "mutable const" object cannot; while it is "logically const", its internal representation may change (it might be cached or compressed) and thus requires locking.
Failure to realize this results in programs with race conditions.
Re:The problem: the event-driven model by thesuperbigfrog · 2010-03-22 03:06 · Score: 3, Interesting

Most languages still handle concurrency very badly. C and C++ are clueless about concurrency. Java and C# know a little about it. Erlang and Go take it more seriously, but are intended for server-side processing. So GUI programmers don't get much help from the language.
In particular, in C and C++, there's locking, but there's no way within the language to even talk about which locks protect which data. Thus, concurrency can't be analyzed automatically. This has become a huge mess in C/C++, as more attributes ("mutable", "volatile", per-thread storage, etc.) have been bolted on to give some hints to the compiler. There's still race condition trouble between compilers and CPUs with long look-ahead and programs with heavy concurrency.
We need better hard-compiled languages that don't punt on concurrency issues. C++ could potentially have been fixed, but the C++ committee is in denial about the problem; they're still in template la-la land, adding features few need and fewer will use correctly, rather than trying to do something about reliability issues. C# is only slightly better; Microsoft Research did some work on "Polyphonic C#", but nobody seems to use that. Yes, there are lots of obscure academic languages that address concurrency. Few are used in the real world.
Ada 2005's task model is a real world, production quality approach to include concurrency in a hard-compiled language. Ada isn't exactly known for its GUI libraries (there is GtkAda), but it could be used as a foundation for an improved concurrent GUI paradigm.
This book covers the subject quite well.

--
42

Re:Current architecture flawed but workable BUT... by Kenz0r · 2010-03-21 12:42 · Score: 4, Insightful

I wish I could mod you higher than +5, you just summed up some of the things that bother me most about the OS that is somehow still the most popular desktop OS in the world.

To anyone using Windows (XP, Vista or 7) right now, go ahead and open up an Explorer window, and type in ftp:// followed by any url.
Even when it's a name that obviously won't resolve, or an ip of your very own local network of a machine that just doesn't exist, this'll hang your Explorer window for a couple of solid seconds. If you're a truly patient person, try doing that with a name that does resolve, like ftp://microsoft.com . Better yet, try stopping it.... say goodbye to your explorer.exe .

This is one of the worst user experiences possible, all for a mundane task like using ftp. And this has been present in Windows for what, a decade?

--
+1 Funny Signature

Re:Microkernel? by Amanieu · 2010-03-21 12:48 · Score: 2, Interesting

Actually most current monolithic kernels are multithreaded, so they can have one thread working on reading that CD, while another threads handles user input, etc. The only difference from microkernels is that it's all in a single address space.

Re:I hate to say it, but... by drsmithy · 2010-03-21 12:50 · Score: 3, Interesting

I noticed the same on my mac. With a set of eight CPU graph meters in the menu bar, they're almost always evenly pitched anywhere from idle to 100%, with a few notable exceptions like second life, some photoshop filters, and firefox of all things.
When booted into Win, more often than not I have two cores pegged high, and the others idle. Getting even use out of all cores is the exception, not the rule.

This is pretty much completely down to the application mix. Windows has no trouble whatsoever scheduling processes and threads to max out 8 (or 16, or whatever) CPUs, but if the applications are only coded to have, say, 1 or 2 "processing" threads, then there's nothing the OS can do to change that.

Re:Current architecture flawed but workable BUT... by drsmithy · 2010-03-21 12:53 · Score: 2, Informative

For that matter when a copy or move fails in Explorer, why can't I simply resume it once I've fixed whatever the problem is.

You can as of Vista.

Re:Current architecture flawed but workable BUT... by hitmark · 2010-03-21 13:07 · Score: 2, Insightful

or basically replaces windows with something else.

--
comment first, facts later. http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm

Re:Luckily OSX is Already Has MultiCore Tech by shutdown+-p+now · 2010-03-21 13:08 · Score: 3, Insightful

The trick with GCD is that it is somewhat more high-level than a simple thread pool - it operates in terms of tasks, not threads. The difference is that tasks have explicit dependencies on other tasks - this lets scheduler be smarter about allocating cores.

Re:Current architecture flawed but workable BUT... by NatasRevol · 2010-03-21 13:09 · Score: 2, Informative

Transaction is copying some files, failing in the middle, and not rolling back those copied over??

Hint. It's not transaction. It's just a bad piece of software that fails badly at doing it's basic job. Handling files.

--
There are two types of people in the world: Those who crave closure

Re:Luckily OSX is Already Has MultiCore Tech by Smurf · 2010-03-21 13:09 · Score: 4, Interesting

It seems you are severely underestimating what GCD means to the application developer. I strongly suggest you read parts 12 and 13 of John Siracusa's excellent review very carefully. As Siracusa says,

Those with some multithreaded programming experience may be unimpressed with the GCD. So Apple made a thread pool. Big deal. They've been around forever. But the angels are in the details. Yes, the implementation of queues and threads has an elegant simplicity, and baking it into the lowest levels of the OS really helps to lower the perceived barrier to entry, but it's the API built around blocks that makes Grand Central Dispatch so attractive to developers. Just as Time Machine was "the first backup system people will actually use," Grand Central Dispatch is poised to finally spread the heretofore dark art of asynchronous application design to all Mac OS X developers. I can't wait.

OSX is no more responsive than Windows by judeancodersfront · 2010-03-21 13:10 · Score: 2, Insightful

The author is talking about a complete OS redesign, not a new threading system.

Re:OSX is no more responsive than Windows by exomondo · 2010-03-21 18:06 · Score: 2, Insightful

The issue is who does the thread management, the programmer or the OS?
The issue is working out how to break up inherently serial problems into smaller parallel problems. Threading is not difficult, the difficulty comes in parallelising the problem and this must be done regardless of who does the thread management.

Re:Fist post! by omfgnosis · 2010-03-21 13:25 · Score: 4, Funny

That's actually pretty good typing with your fists. Do you have a comically large keyboard?

Microsoft's slowness and Windows 2005 by gig · 2010-03-21 13:31 · Score: 5, Insightful

I love how Microsoft can come along in 2010 and with a straight face say it's about time they took multiprocessing seriously. Or say it's about time we started putting HTML5 features into our browser. And we're finally going to support the ISO audio video standard from 2002. And by the way, it's about time we let you know that our answer to the 2007 iPhone will be shipping in 2011. And look how great it is that we just got 10% of our platform modernized off the 2001 XP version! And our office suite is just about ready to discover that the World Wide Web exists. It's like they are in a time warp.

I know they have product managers instead of product designers, and so have to crib design from the rest of the industry, necessitating them to be years behind, but on engineering stuff like multiprocessing, you expect them to at least have read the memo from Intel in 2005 about single cores not scaling and how the future was going to be 128 core chips before you know it.

I guess when you recognize that Windows Vista was really Windows 2003 and Windows 7 is really Windows 2005 then it makes some sense. It really is time for them to start taking multiprocessing seriously.

I am so glad I stopped using their products in 1999.

Re:Microsoft's slowness and Windows 2005 by RMS+Eats+Toejam · 2010-03-21 16:06 · Score: 2, Funny

I am so glad I stopped using their products in 1999.
But you are still an asshole 11 years later! What gives?

--
Turning to a Linux advocate for thoughts on Microsoft is like asking Hitler how he felt about the Jews.

Re:Current architecture flawed but workable BUT... by hitmark · 2010-03-21 13:45 · Score: 2, Interesting

there is a option, at least as far back as xp that allows explorer windows to run as their own tasks. Why its not enabled by default i have no clue about (except that i have seen some issues with custom icons when doing so).

--
comment first, facts later. http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm

Re:Luckily OSX is Already Has MultiCore Tech by hitmark · 2010-03-21 13:49 · Score: 2, Insightful

so basically a big pile of C64 wired to a single keyboard and screen, via a BIG kvm switch?

--
comment first, facts later. http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm

Reminds me of the Cache Kernel. by Grenamier · 2010-03-21 13:53 · Score: 2, Interesting

The part of the article where Probert discusses the operating system becoming something like a hypervisor reminds me of the Cache Kernel from a Stanford University paper back in 1994. http://www-dsg.stanford.edu/papers/cachekernel/main.html

The way I understand it, the cache kernel in kernel mode doesn't really have built-in policy for traditional OS tasks like scheduing or resource management. It just serves as a cache for loading and unloading for things like addresses spaces and threads and making them active. The policy for working with these things comes from separate application kernels in user mode and kernel objects that are loaded by the cache kernel.

There's also a 1997 MIT paper on exokernels (http://pdos.csail.mit.edu/papers/exo-sosp97/exo-sosp97.html). The idea is separating the responsibility of management from the responsibility of protection. The exokernel knows how to protect resources and the application knows how to make them sing. In the paper, they build a webserver on this architecture and it performs very well.

Both of these papers have research operating systems that demonstate specialized "native" applications running alongside unmodified UNIX applications running on UNIX emulators. That would suggest rebuilding an operating system in one of these styles wouldn't entail throwing out all the existing software or immediately forcing a new programming model on developers who aren't ready.

Microsoft used to talk about "personalities" in NT. It had subsystems for OS/2 1.x, WIn16, and Win32 that would allow apps from OS/2 (character mode), Windows 3.1 and Windows NT running as peers on top of the NT kernel. Perhaps someday the subsystems come back, some as OS personalities running traditional apps, and some as whole applications with resource management policy in their own right. Notepad might just run on the Win32 subsystem, but Photoshop might be interested in managing its own memory as well as disk space.

The mid-90s were fun for OS research, weren't they? :)

--
-- John Truong

Energy efficiency will do it by pslam · 2010-03-21 14:13 · Score: 5, Interesting

If we want efficient code, we have to figure out ways to reward the programmers that write it. I don't see any sign that people anywhere are interested in doing this. Anyone have suggestions for how it might be done?

It's happening, from a source people didn't expect: portable devices. Battery life is becoming a primary feature of portable devices, and a large fraction of that comes from software efficiency. Take your average cell phone: it's probably got a half dozen cores running in it. One in the wifi, one in the baseband, maybe one doing voice codec, another doing audio decode, one (or more) doing video decode and/or 3d, and some others hiding away doing odds and ends.

The portable devices industry has been doing multi-core for ages. It's how your average cell phone manages immense power savings: you can power on/off those cores as necessary, switch their frequencies, and so on. They have engineers who understand how to do this. They're rewarded for getting it right: the reward is it lives on battery longer, and it's measurable.

Yes, you can get lazy and say 'next generation CPUs will be more efficient', but you'll be beaten by your competitors for battery life. Or, you fit a bigger battery and you lose in form factor.

The world is going mobile, and that'll be the push we need to get software efficient again.

Re:Current architecture flawed but workable BUT... by duguk · 2010-03-21 14:18 · Score: 2, Informative

For that matter when a copy or move fails in Explorer, why can't I simply resume it once I've fixed whatever the problem

Try TotalCopy which adds a copy/move in the right click menu; or Teracopy commercial (free version available, supports Win7) complete replacement for the sucky Windows copy system.

USB/Network freezes and file copying isn't a fault of CPU cores like you say, Windows is just a sucky OS. Multicore stuff gets complicated, but this isn't going to be a panacea for Microsoft, it's another marketing opportunity.

Put up or shut up. by Anonymous Coward · 2010-03-21 14:22 · Score: 2, Interesting

I'm getting really sick of posting this, but I'll continue to do so until you do.

BUILD A WORKING PROTOTYPE OF THIS "UNIVERSAL BEHAVING MACHINE", OR SHUT THE HELL UP.

Those of us who aren't insane aren't impressed by talk, we're impressed by results. If you spend half as much effort building the thing as you do flapping your damn jaw, you'd be done by now.

(For any uninitiated mods, this fellow is slashdot poster "rebelscience", and maintains a website of the same name. Every time a multiprocessing-related thread comes up, he posts this tripe but has never actually done anything about it. Visit his website, and you'll see why I call him a lunatic)

Re:Fist post! by wiredlogic · 2010-03-21 14:34 · Score: 2, Informative

Well, I came here to see the fisting. And frankly, so far this site has been a real disappointment.

You have to read at -1 to see the goatse trolls.

--
I am becoming gerund, destroyer of verbs.

Re:Oh and which .NET programs are taxing the cpu? by jonwil · 2010-03-21 14:37 · Score: 2, Informative

.NET apps DO use a virtual machine, the Common Language Runtime and support .NET IL. However, the Virtual Machine DOES use just-in-time compiling and precompiling to compile the code to native code before it runs it.

Same as any halfway decent desktop Java Virtual Machine implementation does now (mobile JVMs usually use hardware features like ARM Jazzelle to run the Java code faster)

Re:I hate to say it, but... by ceoyoyo · 2010-03-21 15:12 · Score: 2, Informative

It doesn't make it as different as you seem to think.

I think GCD is a great idea, and a very useful tool, but it's not a magic bullet. GCD can schedule some things more effectively because it has a system-wide view. The closure extensions and GCD interface makes it reasonably easy for novice programmers to get things actually running in parallel.

Of the two, the latter has a MUCH bigger impact in terms of actually getting programs to take advantage of multiple cores. Actually sending

BUT, it's nothing you can't do (and hasn't been done) with various multiprocessing libraries, many of which run on Windows, or with good old threads and processes if you've got a moderate level of skill. In order for it to work effectively the programmer still has to a) structure his program in such a way that the parallelism is exposed and b) actually use GCD.

Contrary to what you seem to suggest, GCD does not really "creates and manages threads on its own, even in applications that are not written to be threaded." It creates threads at the (indirect) request of the application and schedules them appropriately. The application MUST be designed to take advantage of multithreading. The only difference is that GCD makes it easier for the programmer to actually get those threads up and running, and can possibly schedule them more effectively.

Re:Current architecture flawed but workable BUT... by Anonymous Coward · 2010-03-21 15:17 · Score: 2, Informative

Fixed in Vista and 7, you can ignore errors and continue copying.

Question: Does Linux need any retooling? by Taco+Cowboy · 2010-03-21 15:20 · Score: 2, Interesting

The article in question talks about Winblows.

What about Linux?

Does it need retooling as well?

--
Muchas Gracias, Señor Edward Snowden !

Data flow languages by goombah99 · 2010-03-21 15:24 · Score: 5, Interesting

I've always thought that both data flow languages and fortran95 had some innovations for multi-core programming worthy of being copied.

Data flow languages such as "G" which is sold as national instruments "labview" brand are intrinsically parallel at many levels. What they do is look at a function call as a list of unsatisfied inputs. These inputs are waiting for the data to arrive to make the variables valid. Then the subroutine fires. Thus every single function is potenitally a parallel process. it's just waiting on it's data. If you program in a serial fashion then of course those functions get called serially. But with graphic programming in 2D, you almost never are programming serially. You are just wiring outputs of other functions to inputs of others. Serial dependencies do arise but these are asynchronous and localized cliques. everything else is parallel. Yet you never ever ever actually write parallel code. it just happens automatically. Perl data language had a glimpse of this but it's not the same thing since the language is still perl and thus not parallel.

Objective-C with it's "message passing" abstraction is perhaps getting closer to the idea of a data flow. While one might complain that well objective-C message passing is just a different sugar coating of C just like C++ is. This would be true from the user's point of view. But it's not as true from the Operating system's point of view. IN OSX, these messages are passing more like actual socket programming at the kernel level. So there's more to objective C on apple's than meets the eye. But I don't know how far you can push that abstraction.

In fortran there are some rather simple but powerful multi-processor optimizations. First there's loops like "forall" that designate that a loop can be done in any order of the loop index and even in parallel. and then there's vectorized statements as part of the language like matrix multiplies. those are rather simple things so don't solve much but they do show that you can put a lot of compiler hinting into the language itself without re-inventing the language.

--
Some drink at the fountain of knowledge. Others just gargle.

Re:I hate to say it, but... by Nadaka · 2010-03-21 16:43 · Score: 2, Insightful

You may not have to write your code around threading, but you then have to write it around grand central dispatch. Having GCD available is going to do absolutely nothing for a program that was not written with GCD in mind. Its changing one set of problems/features for another. Writing multi-threaded software isn't exceptionally hard. I have done a lot of it. It may take a lot less code with GCD, but you also give up control. Even using GCD with code blocks you still have to deal with the problems that can be a pain in the ass, things like concurrency, blocking and munging data.

Answer: Yes by Bigjeff5 · 2010-03-21 16:44 · Score: 4, Interesting

First, the article in question talks about OS architecture, not Windows specifically. He specifically states that what he is speaking about is not something MS is working on. Quite the opposite, many of his MS colleagues disagree with him.

Second, the fundamental problems with OS design are exactly that: fundamental problems with OS design. Nobody is making an OS that truly takes advantage of multiple cores, it's still single-processor thinking with the ability to use more than one processor, and this leads to a number of inherent problems.

The article talks about what an OS might look like if built from scratch specifically for multiple core processing power, and there is nothing on the market like it at the moment. It's basically a hypervisor-based OS, where instead of giving programs slices of CPU time, the OS gives programs actual CPUs and slices of memory to use.

Something like that would be extremely slick, we already do that for virtual machines and we end up with 8+ full-fledged servers running on the same machine. Why can't you pull that back a little more so it's individual programs assigned to each CPU such that they don't have to interact with the OS at all once they are up and running? Can you imagine?

--
Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller

Re:Answer: Yes by jesset77 · 2010-03-21 19:15 · Score: 2, Interesting

Looks like it's time for me to update my whitepaper on massively parallel OS design again? I admit, due to lack of interest I have let it fall a bit out of date, recently.
Among other things, I'm going with the name "Ironfluid" now, as I've finally deconflated the terms "cloud computing" and "fluid computing". Cloud really just means "run by somebody else", while "fluid computing" implies parallel processing and fault tolerance; decoupling the software completely from the hardware. Google, for example, offers both: but does not offer the tools for the common sysadmin to form their own clouds.
I think I'd like to.

--
People willing to trade their freedom of expression for temporary entertainment deserve neither and will lose both.
Re:Answer: Yes by jackharrer · 2010-03-21 22:08 · Score: 2, Informative

>>What we need is a "you don't want to use C: right now, trust me" signal. Ever tried to use Firefox while copying something big? Why does it take ages to display a webpage when it does not need to use the disk?
It only works like that on Windows. I think it's mostly about bad system design. I have no such issues on my Linux machine, but lots on my wife's Windows one. Both are the same Thinkpad laptops, so fault can only be on OS side.

--

"an experienced, industrious, ambitious, and often, quite often, picturesque liar" - Mark Twain
Re:Answer: Yes by mario_grgic · 2010-03-22 01:26 · Score: 2, Informative

Firefox doesn't behave like that in OS X. So I don't know if this is OS specific issue?

--
As the island of our knowledge grows, so does the shore of our ignorance.

Apple Grand Central Sucks by Anonymous Coward · 2010-03-21 16:55 · Score: 4, Informative

Apple's grand-central dispatch (GCD) solution is really primitive. It's just a simple thread-pool, where the programmer breaks their program down into tasks that can be executed independently then queues them for execution by the thread-pool.

GCD is not in the slightest innovative, except for a hack that allows "c" programmers to write tasks with slightly more convenience, by adding limited "closure" support to the language.

Similar concepts can be found all over the place; just see the "see also" section on the wikipedia article:
http://en.wikipedia.org/wiki/Grand_Central_Dispatch
Using any of the libs listed in that "see also" section, you can get GCD equivalent behaviour on unix/windows, and have been able to for years.

There are also languages with far superior parallel-processing abilities, where the effort is done by the compiler/environment, not the programmer. See any functional language, eg Haskell or Erlang. Write a program in these languages, and the parallel-processing happens just about automatically.

Adding parallelism to the *OS* is quite a different issue, and not one that Apple's GCD addresses.

Re:Current architecture flawed but workable BUT... by RzUpAnmsCwrds · 2010-03-21 17:04 · Score: 2, Insightful

Windows Explorer no longer kills network transfers after a failure as of Windows Vista.

Maybe some of the people complaining about Windows should stop using a version thats 9 years old (XP). Red Hat 7.2 isn't particularly great by today's standards either.

Nothing to see here by Low+Ranked+Craig · 2010-03-21 18:15 · Score: 2, Funny

Please move along

--
I still cannot find the droids I am looking for...

It's not even about multiple cores by macraig · 2010-03-21 18:35 · Score: 4, Insightful

What's wrong with at least some operating systems doesn't even have anything to do with multiple cores per se. They're simply designing the OS and its UI incorrectly, assigning the wrong priorities to events. No event should EVER supersede the ability of a user to interact and intercede with the operating system (and applications). Nothing should EVER happen to prevent a user being able to move the mouse, access the start menu, etc., yet this still happens in both Windows and Linux distributions. That's a fucked-up set of priorities, when the user sitting in front of the damned box - who probably paid for it - gets second billing when it comes to CPU cycles.

It doesn't matter if there's one CPU core or a hundred. It's the fundamental design priorities that are screwed up. Hell should freeze over before a user is denied the ability to interact, intercede, or override, regardless how many cores are present. Apparently hell has already frozen over and I just didn't get the memo?

Re:It's not even about multiple cores by macraig · 2010-03-22 03:59 · Score: 2

The FS operations simply need to happen at a slowed pace that favors the user at all times; why should that cause loss of data integrity? A file system that demands a minimum effective data rate below which bits are lost or corrupted?! That would certainly be a poorly designed file system or interface to it, wouldn't it? No wonder we need redundant (R)AID and backup solutions!

Looks like Tanenbaum will have been right after al by maweki · 2010-03-21 20:35 · Score: 2, Interesting

Looks like Tanenbaum will have been right after all, I mean, a vast amount of cores and huge parallelism is the advent of the micro- and exokernel, isn't it? This would be the simplest way to harness the multiple cores (instead of modifying a monolithic kernel to use multiple cores)

Re:Fist post! by Whalou · 2010-03-21 23:11 · Score: 3, Funny

I'd browse at -2 if I could.

--
English is not this .sig mother tongue...

Re:I hate to say it, but... by TheRaven64 · 2010-03-22 00:07 · Score: 2, Informative

I am finding it very difficult to believe that you have actually used GCD. I have, and have read most of the code for the implementation. Creating threads is not hard - it is definitely not what makes parallel programming difficult. The difficult bit is splitting your code into parts that have no interdependencies and so can execute concurrently.

When you use libdispatch, you still have to do this. All that it does for you is implement an N:M threading model. It allocates a group of kernel threads and then multiplexes them into work queues. The pthread_workqueue_*_np() family of system calls lets the kernel decide the optimum number of kernel threads for the application, depending on system configuration and load. The libdispatch code then executes blocks on these threads. This saves some thread creation time and saves some context switching and cache churn because it runs blocks sequentially in a small number of threads (ideally one per core), rather than running them all concurrently on a separate thread.

It creates and manages threads on its own, even in applications that are not written to be threaded

No it doesn't. You must get a dispatch_queue_t and then send it blocks to execute concurrently. You must do this explicitly.

--
I am TheRaven on Soylent News

Re:A more basic question by radish · 2010-03-22 03:07 · Score: 2, Informative

iPhone isn't even slightly "instant on" - it takes at least a minute to boot an iPhone from off. What you're seeing most of the time is "screen off" mode. Unsurprisingly, switching the screen on & cranking up the CPU clock doesn't take much time. Likewise, waking my Windows box up from sleep doesn't take very long either. Comparing modern OS software running on modern hardware I see little difference in boot times, or wake time from sleep - which would indicate that if MS are being lazy then so are Apple & all the devs in the Linux & BSD worlds. As for why my ST used to boot so much quicker, well the lack of discs helped, as did the lack of hardware variance (and thus lack of drivers to load & start).

--

---- Den ene knappen er powerknapp, den andre er Bender voice knapp "Bite My Shiny Metal Ass"

Slashdot Mirror

Multicore Requires OS Rework, Windows Expert Says

108 of 631 comments (clear)