Inside the PowerPC 970
daveschroeder writes "Jon "Hannibal" Stokes has posted a long-awaited, very detailed analysis of the IBM PowerPC 970 at Ars Technica. Notable quote: 'The 970 was made for Apple'."
← Back to Stories (view on slashdot.org)
See here More on the power pc posted yesterday
PAGERANK++ Robsell.com
Its like comparing apples with apples... Its a dupe.
It's ok, the last post didn't get enough comments. Please continue discussion.
Fast forward a few months....hmm...a few options:
Sun: Nice hardware, very expensive, CDE.
AMD: Commodity hardware, cheap, WinXP.
HP: Intel hardware, very expensive, CDE or WinXP.
I think I know what I'd buy.
Of course, the Athlon64/Opteron would get quite a bit of consideration due to my hobbies.
But I think it'd end up being the Mac.
... in The Matrix. That strange feeling of deja vu can only mean one thing! Either that or the /. editors are asleep at the wheel again.
Floating point ops, optimized for graphics processing and things like compression (jpeg, mpeg, mp3). If you read the Ars article he waxes on about it's superiority over MMX/SSE/SSE2.
Also: "Water is special stuff that makes stuff float."
"The CPU does important stuff."
For all of your "What is AltiVec?" needs, check this out instead:
http://www.motorola.com/SPS/PowerPC/AltiVec/
Mikey-San
Karma: +Eleventy billion (mostly affected by watching Celebrity Jeopardy)
Apple'd be putting DDR400 on the G4 right now if they could. None of this (well, except the decision to go Moto) was their fault.
My real problem with the current G4e situation, aside from the 167 SDR FSB, is the fact that it's a shared bus topology, which is just ridiculous. To my knowledge, there's nothing stopping Apple from putting out a chipset that gives each G4e a dedicated FSB (even if it's still 167MHz SDR) to the chipset.
As far as the low MHz and SDR situation, I've also never been totally convinced that Apple wasn't partially to blame for this either, unless they just have zero clout with Moto SPS.
I believe that Hannibal mentions that the 970 is designed for SMP.. Clearly CmdrTaco is just testing its newest feature: you click post and the operation gets carried out by both processors.
Tierce
Tierce
Who sponsors your feelings?
Reading through the article, its nice to see some real design going into a processor. Looking through Intel's last few chips, they've been upping ther clock speed and packing in more cache.
Yeah, yeah, they are hog-tied because you can't easily re-compile the entire windows platform to use new instruction sets. Linux users, of course, don't have this problem (muhahahah).
Did anyone else catch the bit on the twin FPU's? I'm just imagining what this thing is going to do with vector operations and frequency transforms.
For most of you non-engineers:
-Most 3d vector operations are affine tranformations. Using a 4x4 array of floating point numbers you can translate, rotate, and scale. Works beautifully, but it's a lot of calculations.
-The Fast Fourier Transform (FFT) is used a lot in signal processing. It's a floating point monster.
But the reality of regular high-end computing is that people don't have the time to optimize their software for the latest oddball hardware platform. And even something like a hand-coded vectorized BLAS library doesn't help because most scientific software still doesn't use such libraries.
ATLAS is a BLAS implementation that is tuned for each system that it runs on. The people at Mathworks use this as the underlying BLAS system in Matlab. Mathematica Maple, etc. use this as well. There is even a G4/AltiVec optimized version available here. This is the whole point of layered software.
AltiVec is nice for somethings.
My iTunes ripping of mp3s nearly tripled when I went from a 466 MHz G3 to a 400 MHz G4 due to iTunes being optimized for AltiVec.
Some Photoshop actions and filters see up to 800% improvments.
Running iMovie exports on a 600 MHz G3 iMac take 2-300% longer than on a 400 or 500 MHz G4
AltiVec is important for Apple marketing because it lets them claim impressive performance figures without actually needing to push the state of the art in terms of processor design further than Intel.
No, AltiVec is important for Apple full stop - in the short term to make up for the anemic bus speeds allowed by the G4, and in the longer term because a SIMD unit is now as expected a component of modern desktop CPUs as an FPU is.
And even something like a hand-coded vectorized BLAS library doesn't help because most scientific software still doesn't use such libraries
The only thing you can really sure about "most" scientific software is that it needs an FPU. Scientists and engineers do a huge variety of simulations, some of which are vectorizable and some of which aren't.
If AltiVec has a weakness in the scientific field, it's the lack of support for double precision. And there's nothing in the instruction set which precludes this, so I wouldn't be surprised to see it appear in some future CPU.
Imagine how much better it would be if Apple could ship systems based on the 970 today, rather than after a few months additional delay due to AltiVec.
If it didn't have AltiVec, it wouldn't be what Apple needs in a desktop CPU - not much point in getting what you don't need a few months early (not like that would happen anyway: this isn't lego: you can't unplug "the AltiVec bits" without any impact on the rest of the design).
And every dollar and watt that is shaved off the AltiVec price makes it a much more viable processor for servers and blades, which would get volume up and prices down.
Except that Apple aren't currently in the blade market at all, and have a fairly small presence in the more general server market. If they can sell a few boxes there, fine, but getting the volume up means targetting consumers - not server farms.
Nae bother
Don't confuse "new" with "state of the art". The former is just something that hasn't been done before. The latter is something that yields "impressive performance figures". If Altivec is competitive with Intel, then it is state of the art, by definition, even if it's 20 years old. The CPU cache is a decades old concept, yet CPUs with caches are still state of the art.
Imagine how much better it would be if Apple could ship systems based on the 970 today, rather than after a few months additional delay due to AltiVec.
Don't underestimate the cost of software. Your idea is expensive, because it requires software vendors to maintain two different versions of their code. This can lead to buggier or more expensive products, or it can lead to the "abandonment" of the G4 installed base. That could easily be worth the few months for Apple.
You'd be surprised how much stuff in Mac OS X is AltiVec optimized. Even memcpy gets a boost from it. It's a lot more than just a "gimmick".
Free Hans!
Altivec is Motorola's name for the vector processing unit. The unit handles SIMD commands. SIMD stands for Single Instruction, Multiple Data. Basically, intead of looping through a list of 50,000 values one by one and multiplying each value by PI for instance, you simply tell the CPU where the list is, and to multiply it by PI.
In a much simplified analogy, it's like lighting 200 candles with a flame thrower instead of one by one with a match.
Article X: The powers not delegated... by the Constitution...are reserved...to the people