Slashdot Mirror


IBM PowerPC 970 Architecture

riclewis writes "Hannibal from Ars Technica offers an explanation of some of the internals of the new IBM chip. It's certainly more powerful than anything on the desktop now, but by the time it's released a year from now, it looks to be middle-of-the-pack (which could still be a step up for Apple...) This excitement over the early release of hardware specs kinda reminds me of all the hype surrounding the Sony's Emotion Engine when it was introduced a couple years ago. In fact, some are suggesting the PPC 970 chip might be closely related to the PS3's 'Cell' processor..."

26 of 262 comments (clear)

  1. Re:IBM? by mccalli · · Score: 5, Funny
    Are they good or evil today?

    Depends....where do you want to go today?

    Cheers,
    Ian

  2. PS3 apple? by barfarf · · Score: 5, Insightful

    "In fact, some are suggesting the PPC 970 chip might be closely related to the PS3's 'Cell' processor..."

    Even though it's really doubtful, it'd be extremely cool to see a PS3 emulator on the mac if the processors are that closely related.

    I remember running Mac OS 6.0.5 on my Atari ST. Because it had the same processor, it didn't need much to make it run.

    Oh well, I can at least dream, can't I?

    1. Re:PS3 apple? by Arcturax · · Score: 5, Interesting

      Oh wow, now that brings back some memories! I remember a friend of mine bringing over this ST he got at a garage sale and had a Mac emulator disk on it. Since I was a Mac person and he wasn't he asked me to help him get it running.

      We managed to get System 7 running on it and even managed to coax AOL 2.x to run on it via the modem, getting it online! It was slow and it was AOL running on a Mac emulator on an Atari, but hey, it was geeky and it was fun to do.

      But back on topic, if they do use the same or similar chip it could possibly work though Sony would DMCA it straight to hell.

      --

      --Won't that be grand? Computers and the programs will start thinking and the people will stop. - Dr. Walter Gibbs
    2. Re:PS3 apple? by Tim12s · · Score: 5, Interesting

      Ouch. I just had a very nasty thought that you might be very Very wrong. Assuming Jobs is that insideous, I'd give it a slim chance. IBM brokers the deal between Sony and Apple to bring economies of scale to the 970 and. . .

      Sony wants to sell less hardware and sell more games with a higher per title margin. Selling hardware at a loss is typical of the console market.

      IBM supplies the goods, Sony & Co supplies the reference PS3 platform and games backing.

      Apple continues building its brand name computing with the ability to run PS3 games. That gives Apple a MAJOR supplier of video games. People who own an Apple typically will be able to afford the various PS3 games. ... now realise that Sony is able to ship a number of exclusive titles, without making a loss on a console AND maintaining their premium on their titles.

      -Tim

  3. SPEC INT and SPEC FP numbers (937 and 1051) by WittyName · · Score: 4, Insightful

    Not that hot NOW! They will have a lot of competition in that space with Opteron/Clawhammer, and the new Sparcs.

    Still, glad to see something other than incremental progress.

    --
    The law is a weapon of the government, not a protection for the likes of you. Surely you understand that.
    1. Re:SPEC INT and SPEC FP numbers (937 and 1051) by Visigothe · · Score: 4, Interesting

      Also keep in mind that SPEC marks are *highly* manipulatable. Here you have a benchmark that is supposed to test the CPU. The problem with this is that is both compiler dependent *and* OS dependent . As has stated many times before, the current G4 machines score in the low 300s in SPEC marks. Does this mean that the G4 is 3 to 5 times slower than the P4? In practice, it isn't. Yes, the P4 2.8 is much faster than the current G4 in most day-to-day activities, but not by *that* much. Anyone can "cook" SPEC marks.

      What you *really* want to do is use the machine, *then* consider whether or not the machine is fast enough for your purposes. Personally, I think that machines with the 970 in them will be quite competitive with the machines that are available at launch.

      .

  4. For the off-the-shelf computer consumer... by crumbz · · Score: 5, Interesting

    This could help push Apple back to a respectable market share over a couple of years. A *nix box with a decent processor and lots of commercial software? Of course, Apple has proved to be just as fierce in protecting their proprietary code as Microsoft, so I wouldn't expect the price to drop significantly for every million sold. But still, alternatives (especially of this caliber) are good.

  5. Chunks of five by Faggot · · Score: 5, Interesting

    Unlike the P4, the 970 does one more trick after it has cracked the PPC instructions down into iops. The 970 divides up the iop stream into "groups" of five iops a piece. So first it cracks the PPC instructions down into iops, then it collects the iops back together into groups. The iops are placed the group's five slots in program order with the stipulation that all branch instructions must go in slot 4 (the last slot). Furthermore, slot 4 can hold only branch instructions and nothing else. It is these groups of five iops that are dispatched in-order to the issue queues. (I haven't yet seen a functional diagram of the 970's core, so I'm not sure how many issue queues there are.)

    computing in chunks... sounds a lot like a Cray. Together with the 900MHz-effective (jesus... that's a lot) FSB, Apple really will be selling supercomputers in the next few years.

    --

    But what do I know. I'm just looking for anonymous gay sex.

  6. will it scale? by vicarina22 · · Score: 5, Interesting

    when the p4 debuted it was sort of average too... the p4's power has come from it's ability to scale to higher mHz ratings pretty quickly. what kind of life are they going to get out of this chip? if it's going to top off at 2gHz then it doesn't really seem worth it, but if they can chip can get up to 3 gHz or so within a year of its release...

  7. It's a shame for apple that IBM announced this. by bluemilker · · Score: 4, Insightful

    I mean... it's great news that Apple won't have to rely on Motorolla's decidedly passive desktop chip development strategy anymore...

    But man. First off, this kills any possibility of a big surprise hit. Second, this dooms apple sales for the next year or so... who wants to buy a stagnating desktop model when the next edition has so much promise?

    Then again, Apple's desktop offerings have been a little stagnant anyway... most people probably won't want to play the waiting game for as long as it'll take for these to come out.

    I just hope that by the time they do, they're worth it.

    1. Re:It's a shame for apple that IBM announced this. by Arcturax · · Score: 5, Interesting

      You are right, most of us can't wait, at least those of use with really old Macs (about to retire my Beige G3 in fact). I just ordered one of the Dual 1.25 GHZ machines and that should be more than enough power for me for some time. I'll move up to a 64 bit Mac in 3-5 years when they've worked out the kinks and about the time most people quit making 32 bit apps.

      I did at least learn my lesson with the Beige G3 when it comes to jumping onto the latest thing just as it first comes out. While my old Beige G3 Rev A box has been a fairly solid machine for the past 5 years, it does have some serious shortcomings (possible voltage regulator blow out if upgraded to a G4, 66 MHZ bus (ick) and Rev A rom means no IDE slave support!).

      I feel fairly confident this possibly last of the line G4 should be fairly solid other than the chips not fully utilizing DDR (at least DMA operations will take advantage of it) and the silly idea of making the second IDE channel only ATA66.

      Once the issues of moving to a 64 bit chip and the new Hypertransport bus and such are worked out and my machine starts to look as slow as my Beige G3 is now compared to the latest machines, then I will start itching to move up.

      --

      --Won't that be grand? Computers and the programs will start thinking and the people will stop. - Dr. Walter Gibbs
  8. Vick's Vapo Rub.... by Anonvmous+Coward · · Score: 4, Funny

    " In fact, some are suggesting the PPC 970 chip might be closely related to the PS3's 'Cell' processor...""

    Ah, so it runs on vapor instead of smoke?

    *wonders if anybody'll get that.*

  9. Re:Apples and oranges by Christopher+Thomas · · Score: 4, Insightful
    As has been well-documented, Macs perform just as well as Windows machines. The slower clock speed of PowerPC compared to Intel is made up by the lack of code bloat is Mac OS compated to Windows.

    A couple of points to throw water on this:
    • Not all x86 machines run Windows.
    • Windows "code bloat" mainly slows down the interface, as opposed to number crunching, which is where you'll be processor limited. It also eats memory for breakfast, but if you're doing anything computation-intensive you'd *better* be throwing memory at the machine no matter what.
    • Apple became allergic to producing useful benchmark comparisons some time ago. Find me SPECmarks on an actual Apple machine submitted within the past 3-4 years. All you find are photoshop benchmarks.


    Apples are certainly wonderful machines, and Windows certainly is icky most of the time, but be prepared to back up any benchmark statements with actual benchmarks.

    Also, PowerPC and Intel/AMD are two different types of processing, so they can't really even be compared.

    Um, no.

    All general-purpose microprocessors perform certain basic tasks upon which everything else is built - integer and FP math, memory access, and control flow operations. Processors take different approaches in how they implement these functions, but the interfaces presented to programmers - even assembly programmers - are very similar [and yes, I've done assembly on multiple platforms].

    You can also completely ignore architecture and take test programs that you think are representative of the kinds of tasks found in different types of application, compile them for both platforms, and measure how long it takes to do the same amount of work on each machine. This is the _foundation_ of benchmarking.

    If the machines were completely different, you wouldn't be able to do the same tasks on them!
  10. Re:Apple Chips by WittyName · · Score: 5, Informative

    The PowerPC 970 triples the length of the PowerPC pipeline

    This will give it the same issues the P4 has. Namely a large penalty for branch mispredicts, etc. Instructions per clock will decrease.

    OTOH, they should be able to crank the speed!

    --
    The law is a weapon of the government, not a protection for the likes of you. Surely you understand that.
  11. This is gobbledegook to me... by Anonvmous+Coward · · Score: 4, Interesting

    ... so could somebody who understands this processor tell me this:

    Would a 3D rendering app such as Lightwave potentially see a huge benefit to this processor? I understand that it's up to the developer to tune it, yadda yadda yadda, I'm concerned with potential not real world numbers.

    I'm trying to get an image in my mind about how the various processor descriptions (32-bit, 64-bit, Altivec, SimD, etc...) can radically change how an app like that would work.

    Us vertex pushers have a substantial interest in machines that excel at that type of work...

    1. Re:This is gobbledegook to me... by Visigothe · · Score: 5, Informative

      Well, I'll try.

      rendering apps like Lightwave, Maya, etc will benefit from this for several reasons:

      The 64bit architecture:
      Lightwave [if rewritten to be 64bit] will be able to use bigger numbers, and use more memory. Bigger numbers means that calculations that would involve making a 64bit word out of 2 32bit words [as it currently stands] needn't be done. Being able to address more memory is *always* a good thing.

      Really good Floating Point Performance:
      3D rendering apps love FP. bigger/faster/more Fp units are a good thing.

      Memory Bandwidth:
      The 900MHz bus will allow a *huge* amount of memory to be shuttled back and forth from the processor *very* quickly. This means your huge scenes will be rendered faster.

      Altivec/Vector Processing unit:
      Because the VPU doesn't do double precision FP, it doesn't help in the final rendering [much]. It *will* help in things like realtime previews, where the math is simplified. Imagine *big* previews of scenes in realtime.

      Multiprocessing:
      This chip is [as implied] MERSI compliant. This means that it is a perfect candidate for multiprocessing, like the current G4.... but the 970 can go many more "ways" than the G4 [the G4 was in an "optimal" multiprocessing stage with 2 procs]. The 970 can go up to 16, IIRC.

      This seems like it'll be a winner.

      .

  12. Biased how? by X_Caffeine · · Score: 5, Interesting
    It's clear that they're somewhat fans of Macs, but they don't buy the Jobs Distortion Field claims that a 1ghz G4 (or even dual 1ghz chips) are as fast as a 2ghz P4. They've been totally up-front and honest about the strengths and weaknesses of OS X, too. Ars Technica has consistently had the most fair and balanced coverage of Apple that I've seen anywhere.


    "Hannibal" also has an incredible knack for making the workings of microprocessors understandable to those with no hardware engineering backgrounds.

    --
    // I will show you fear in a handful of jellybeans.
  13. Re:Apple Chips by Visigothe · · Score: 5, Informative

    > Instructions per clock will decrease.

    Actually, IPC is *increased* from the current G4. It will now fetch 8 instructions per clock, and retire 5 per clock.

    The current G4 IIRC fetches either 3 or 4 per clock. I have no idea how many it can retire at once.

    This coupled with a quick move to a .09 process shows me that this 970 chip has legs. Another thing... IBM has *always* been conservative about what not-quite-ready chips will do as far as clock, and benchmarks. I expect "Real World" [no relation to Peter Gabriel] performance to be quite good. [although I expect Peter Gabriel's performances to be fantastic =)]

  14. Re:Apple Chips by Don+Negro · · Score: 5, Insightful

    Does it?

    The eetimes story linked at the top says it's an 8-stage pipe. That doesn't mean any more or less than the extreme tech statement that the new pipe is triple the length (which would be 21, the current pipe is 7) since we haven't seen any actual reference docs from IBM.

    Can anybody who was at the Microprocessor Forum give us more info?

    --

    Don Negro
    Perl 6 will give you the big knob. -- Larry Wall

  15. Re:Oh look by frankie · · Score: 5, Informative
    if you can't tell the difference between chip architectures by now, there's no point in trying to explain.

    I have no idea who you are, Mononoke, but I'd wager $1000 that Hannibal Stokes knows more about chip architecture than you do. The PPC 970 will have a hard fight (both in marketing and benchmarks) against the 4+GHz x86 chips also due a year from now.

    p.s. How the heck did that get rated as Insightful? I'm as rabid a Mac addict as any of you, but it's just plain wrong to mod someone up for spouting false evangelism.

  16. 64-bit provides many advantages. by Inoshiro · · Score: 4, Informative

    Once you move beyond a 4.5billion, into the realm of 18.5 (two orders of magnitude past trillion), you can address anything for the forseable future (since you can count each year until the heatdeath of the universe this way, for example).

    For vector operations, 64bit words make for some fast math operations, since you can pack more 32-bit integer components into each bus transfer.

    For floating point, it means you have greater precision in hardware (allowing things like real physics and shapes to be modelled without noticable issues caused by subtle number creep). Since most systems use IEE-784 (64bit double precision floating point), it means a speedup to that software since you're not working with it as 2 32-bit operations.

    In terms of storage space, it means you can address more than 2,199,023,255,552 bytes (~2 terabytes) of disk space (assuming a 512-byte sector). This is important for people with big RAID arrays today, and people with ludicrously big Maxtor drives 3-4 years from now.

    For RAM, it means you don't have to worry about your server topping out at 4 gigabytes of RAM. It also means that your VM space has no effective limitation for the forseable future (very useful for people working on large projects, trying memory-intensive algorithmic approachs to traditionally NP-hard problems, or distributed computing problems).

    I'm sure I missed a lot of the benefits even with this list. As you can see, 64-bit is not just a number game. It is 32 orders of magnitude larger than 2^32, meaning our grandchildren will probably still be using 64bit machines with no limitions being apparent (unlike 16-bit to 32-bit, which only moved from 65k to 4.5 billion in terms of addressable amounts of something).

    --
    --
    Internet Explorer (n): Another bug -- that is, a feature that can't be turned off -- in Windows.
  17. Re:Apple Chips by be-fan · · Score: 4, Informative

    Actually, the DDR thing is a little misguided. The real reason DDR had no effect was because the 2.1 GB of memory bandwidth was feeding into 1.3 GB/sec of processor bus bandwidth.

    --
    A deep unwavering belief is a sure sign you're missing something...
  18. Apple is OK, Mac user's aren't performance freaks by AHumbleOpinion · · Score: 5, Insightful

    Contrary to some of the opinions presented recently it is just fine for Apple to use the 970 and be behind the curve with respect to typical performance. Sure there are specialized apps that can leverage a RISC architecture to outperform x86 or leverage Altivec to outperform SSE, but that is a small minority. Typical performance lags behind PC a little but we are in a situation where PCs and Macs have more performance than most people actually use. Most folks out there in the real world will get along very nicely with a 1GHz PC or a 800MHz Mac. Very few people need 2.xGHz machines, and only a few more have enough disposable income to buy those machines for Quake FPS pissing contests :).

    The real Apple problem is that the gap between typical PC and typical Mac performance is starting to grow beyond the range that has historically shown to be viable. Not a problem today, standard dual CPUs counter this to a degree, but it's likely to be a problem in a year or two. While the 970 may only perform like a 3GHz P4 (SPEC), lag whatever Intel/AMD has in a year or two, it will be close enough. Apple will be back to a point where the typical performance gap is small enough. Apple has sold tens of millions of Macs that lagged PC counterparts in performance. They know that their customers are more interested in ease of use. Performance wise close-enough is all they need.

  19. Re:Apple Chips by Shanep · · Score: 4, Interesting

    "Mhz doesn't matter"

    The MHz Myth that Apple talks about is not about trying to say that "Mhz doesn't matter", it's about the fact that MHz cannot be used as a direct comparison between architectures.

    Of course MHz (brute force) matters. But what also matters is smart design.

    I think showing a 333MHz G3 running faster than a 500MHz Pentium III, kinda proves the MHz Myth is just that. Bear in mind, that the G3 is not AltiVec equiped! So not getting a huge vectorized benefit here.

    If you think that's impressive, look at the G4! I can't wait to see what CPU Apple actually unleashes next.

    I'm astonished that there are actually people who think MHz is THE sole number to go by.

    --
    War crimes, torture, lies, illegal spying... Would someone give Bush a blowjob, already, so he can be impeached?
  20. Re:Apple Chips by Shanep · · Score: 5, Informative

    PPC chips can only work on one swing of the computing "cycle", not on the up and down like an Athlon can for example

    It's called positive and negative edge triggering. It's not a new technology either. I was dealing with it in the 80's at the discrete logic level.

    AGP 2x uses this and 4x uses positive, negative, high and low triggering. Certain UDMA modes make use of this clocking technique also.

    Your argument doesn't hold water.

    His arguement DOES hold water. PPC CPU's DO outperform Intel x86 CPU's by a good margin when compared clock for clock (showing the MHz Myth for what it is). Especially the G4 and boy when AltiVec can and is exploited... Wow. There IS more to CPU design than smaller die and deeper piplining for higher MHz.

    As far as I can tell, Apple seem to be in a position where they have to make the best of what they can get, due to Motorolla dropping the ball pretty baddly.

    I hope IBM comes to their rescue. How ironic.

    --
    War crimes, torture, lies, illegal spying... Would someone give Bush a blowjob, already, so he can be impeached?
  21. Show me the money by Andy+Dodd · · Score: 4, Informative

    http://www.heise.de/ct/english/02/05/182/

    SPEC benchmarks for the G4 processors. (Not a synthetic benchmark issued by Apple, but by an unbiased third party, SPEC)

    G4 1 GHz SPECs at 306 integer 187 floating-point
    Interestingly, the 1 GHz G4 was almost neck-and-neck with a 1 GHz PIII (http://www.heise.de/ct/english/02/05/182/qpic02.j pg)

    http://www.spec.org/osg/cpu2000/results/cpu2000. ht ml
    A large archive of SPEC results for many CPUs, including x86.

    A few choice results:
    1.2 GHz Athlon (Ancient by today's standards) - 443 integer, 387 FP
    Athlon XP 1700+ on an Epox EP-8KHA (Happens to be my mobo - Slowst Athlon XP listed for this mobo):
    633 integer, 561 FP
    Dell Precision Workstation 330, 1.3 GHz P4 - 474 integer, 502 FP (The P4 doesn't seem to be taking too much of a branch misprediction hit here)

    So in the case of G4s, while they may be a bit more efficient MHz for MHz (And the P3 vs. G4 benchmarks so that this isn't even necessarily the case), the fact that they're so far behind on the clock speed curve hurts them badly.

    If you want to see a good example of MHz not being everything, check out the benchmarks of Alpha systems - The 750 MHz ones chew even 1.2 GHz Athlons for lunch. But don't look at Apple...

    Also interesting in the case of the SPEC benchmarks run by Heise - MS C pays a 10-15% performance hit over GCC in the SPEC benchmarks.

    --
    retrorocket.o not found, launch anyway?