Slashdot Mirror


Design Philosophy of the IBM PowerPC 970

D.J. Hodge writes "Ars Technica has a very detailed article on the PowerPC 970 up that places the CPU in relation to other desktop CPU offerings, including the G4 and the P4. I think this gets at what IBM is doing: 'If the P4 takes a narrow and deep approach to performance and the G4e takes a wide and shallow approach, the 970's approach could be characterized as wide and deep. In other words, the 970 wants to have it both ways: an extremely wide execution core and a 16-stage (integer) pipeline that, while not as deep as the P4's, is nonetheless built for speed.'"

200 comments

  1. unfotunatly Apple is going with Intel instead... by esome · · Score: 0, Offtopic

    at least according to this amusing article at MacCentral

  2. What is: 2H03? by Malc · · Score: 2, Interesting

    When will the "projected 2H03 release date" be? I'm not familiar with this term.

    1. Re:What is: 2H03? by ajakk · · Score: 2, Informative

      Second half of 2003.

    2. Re:What is: 2H03? by TheViffer · · Score: 5, Funny

      2h = second half
      03 = 2003

      In the case of Blizzard that means Fall 2005.

      --
      -- Knowing too much can get you killed, but knowing who knows too much can make you rich.
    3. Re:What is: 2H03? by Montreal+Geek · · Score: 1

      That would be the second half of 2003, I.e.: somewhere late in 2004. :) -- MG

    4. Re:What is: 2H03? by FreeLinux · · Score: 3, Informative

      Second half 2003. Which almost always slips so the real meaning is assumed to be Q403(4th quarter 2003) or even Q104(First quarter 2004).

    5. Re:What is: 2H03? by teamhasnoi · · Score: 3, Funny

      In the case of Duke Nukem' Forever, that means just after our sun expands to a red giant and swallows the earth and moon.

    6. Re:What is: 2H03? by mholt108 · · Score: 2, Funny

      It means to finger the 2nd fret and then hammer onto the third -- duh!

    7. Re:What is: 2H03? by Anonymous Coward · · Score: 0

      Who cares? Go out and get laid!

      Tarbaby, now going off to hang with her niggahz!
      http://www.askheartbeat.com/cgibin/ultim atebb.cgi

    8. Re:What is: 2H03? by trunc · · Score: 0

      In the case of Halo for the PC and Mac, that means never.

  3. Re:Will this save apple? by Anonymous Coward · · Score: 0

    Says who? Apple is doing just fine and will be without the crap that is Motorola in a year. I see nothing but a bright future for them with some real hardware.

  4. It's got nothing on the Pentium 4 by tcd004 · · Score: 5, Funny
    1. Re:It's got nothing on the Pentium 4 by Anonymous Coward · · Score: 0

      And you are a complete idiot as evidenced by this post

  5. IBM Chip by MadocGwyn · · Score: 2, Troll

    Is this not the chip that was a co venture between apple and ibm? Is this also no the chip that apple intended to use? is this also not the chip thats a 'pc' chip built on RISC? Personally I wanna see what this thing can do before damming it or pronounceing apple dead. Its a year from now, the specs could be COMPLETLY different by then

    --
    Jesus saves, everyone else takes full damage from the fireball.
    1. Re:IBM Chip by MadocGwyn · · Score: 1

      How is this a troll might I ask? Story my info comes from is here, as well as present article Here/a

      --
      Jesus saves, everyone else takes full damage from the fireball.
    2. Re:IBM Chip by Anonymous Coward · · Score: 0

      In my comparison of the design philosophies behind the P4 and G4e, I tried to summarize each processor's overall approach to organizing its execution resources to squeeze the most performance out of today's software. I characterized the G4e's approach as "wide and shallow." The G4e has a relatively "wide" execution core that contains nine functional units, and the processor's front end can issue up to 4 instructions per clock (3 instructions + 1 branch) to these functional units. Each of these functional units has a fairly short pipeline, so the instructions take very few cycles to move through and finish executing.

      Because it focuses on getting a small number of instructions onto the chip at once, spreading them out widely to execute in parallel, and then getting them off the chip in as few cycles as possible, at any given moment the G4e can have at most only 16 instructions spread throughout the chip in various stages of execution. Compared to both the P4 and the PPC 970, this instruction window is quite small.

      The P4, in contrast, takes a "narrow and deep" approach that involves moving a large number of instructions onto the chip and pushing them through fewer functional units in more, faster cycles. The P4's 20-stage pipeline can hold up to 126 instructions in various stages of execution (as opposed to the G4e's 16). When you combine the P4's high clock speed with its very large instruction window, you get a processor that grind through a large volume of instructions in a rapid-fire, serial manner.

      If the P4 takes a "narrow and deep" approach to performance and the G4e takes a "wide and shallow" approach, the 970's approach could be characterized as "wide and deep." In other words, the 970 wants to have it both ways: an extremely wide execution core and a 16-stage (integer) pipeline that, while not as deep as the P4's, is nonetheless built for speed. Using a special technique, which we'll discuss shortly, the 970 can have a whopping 200 instructions on-chip in various stages of execution, a number that dwarfs not only the G4e's 16-instruction window but also the P4's 126-instruction one.

      You can't have everything, though, and the 970 pays a price for its "more is better" design. When we discuss instruction dispatching and out-of-order execution on the 970, we'll see what tradeoffs IBM made in choosing this design.

    3. Re:IBM Chip by Hast · · Score: 2

      It's a scaled down version of the Power4 CPU which IBM use in their mainframes. It's also been designed to be compatible with Motorolas AltiVec vector processing.

      It's very likely that Apple intend to use it in future high end boxes. (They would be rather stupid not to.) Unless Motorola can one-up IBM by then.

      I don't think it has all that much to do with Apples future though. Apple boxes have not really been about high performance lately, and people buy them anyways. Mostly becuase they like the OS/apps and such I assume.

      And the specs are pretty much solid now I assume. Creating hardware is /very/ different from software were you can update and patch it to the last minute. They might be able to tweak the clocks a bit, but don't expect a lot more.

  6. Question by wcbrown · · Score: 5, Interesting

    What's the difference between the Power4 and the PowerPC 970? As a Mac guy, I've been following all of the rumors and announcements with interest but I keep seeing the PPC970 referred to as a scaled-back version of the Power4.

    Why wouldn't Apple go with the Power4 over the PPC970? And I already know that nothing official has been announced by Apple and that this is all probably going to be a lot of sturm und drang signifying nothing, but that's what keeps us Mac guys going I guess.

    1. Re:Question by Faggot · · Score: 3, Informative

      The PowerPC 970's design is adapted from IBM's successful Power4 server processor. Physically smaller, the PowerPC 970 sacrifices some execution units -- including the Power4's second processor core -- for 64-bit compatibility and the SIMD unit.

      While the Power4 core has two processor cores and massive caches for MP implementations, the PowerPC 970 has only one processor core, an SIMD unit and a 512K on-die L2 cache. The cache includes error correction. The PowerPC 970, as described today, has no connectors for an L3 cache.

      --

      But what do I know. I'm just looking for anonymous gay sex.

    2. Re:Question by mfago · · Score: 5, Informative

      the PowerPC 970 sacrifices some execution units -- including the Power4's second processor core -- for 64-bit compatibility and the SIMD unit.

      This implies that the Power4 is not 64 bit -- which is of course wrong.

      I would say that the PowerPC 970 trades the second core and fancier interconnects of the Power4 for lower power, cost, and the SIMD unit.

    3. Re:Question by afidel · · Score: 2

      No, the PPC970 loses a number of execution types (read the recently linked article about the POWER architecture in contrast with PPC), the second core and the extremely large cache of the Power4 and gains SIMD and lower power usage. Basically it's a desktop cpu instead of a big iron cpu.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    4. Re:Question by WatertonMan · · Score: 2, Informative

      As for why Apple wouldn't go for the Power4, it is because it simply is too expensive. Further Apple wants something like the Altivect unit that is in current G4s. Power4 is simply optimized for non-desktop uses and is overkill for what Apple needs. The 970 is a nice balance between Apple's needs and the Power core. Further by moving to IBM Apple is able to get a far better provider than Motorola whose G5 has been missing in action for some time now.

    5. Re:Question by Coz · · Score: 4, Informative
      Another reason (in addition to the excellent ones other folks have listed) - cost. Power4 chips are over-engineered, compared with "consumer" CPUs like the G4, P4, and 970. Hannibal's article mentions that at the same clock speed, some instructions execute faster on the 970 simple because of the thickness of the oxide layers used in the transistor gates. It's a different emphasis - high reliability and expense versus "less" (still acceptable to 80% of the world) reliability and acceptable mass-production cost-per-chip.


      Plus, the Power4 is really designed as a server/Big Iron chip - it's really 2 CPUs on 1 die - and that's just not what an iMac needs.

      --
      I love vegetarians - some of my favorite foods are vegetarians.
    6. Re:Question by mfago · · Score: 5, Informative

      The reason that Apple won't use the Power4:

      It is HUGE.

      The picture at the top right shows the Power4 multichip module as used in the p690. Yes, it is the 5" square thing in the guys hand.

      There are better pictures of the MCM itself, but I couldn't find the close-up showing just the MCM in someone's hand.

      The large size (along with everything it entails: it uses 125W power, and supposedly costs about $3500 to manufacture) is one indication that IBM designed the Power4 for its big-iron. Nevermind that IBM does offer the Power4 (sans MCM) in some of their smaller servers.

      The PowerPC970 is the equivalent processor tweaked for the desktop/low-end servers.

    7. Re:Question by Anonymous Coward · · Score: 0
      Why wouldn't Apple go with the Power4 over the PPC970?

      1. IBM won't sell them.

      2. Even if they did... the current Power4 Multi-chip module (just the CPU cores) likely costs more a high end Mac all by itself. The non multi-chip module versions of Power4 won't be "cheap" either. Apple cannot use a >$1,000 per unit CPU. Boxes of that type just aren't going to sell in the mass market. In a $10,000 server it makes much less of a different if the CPUs alone cost you $3,000. There is still $7000 to buy the rest of the stuff that goes in the box and still eek out a profit.

      For Power4 think very high end Xeon Processors. You don't see those in desktop boxes... let alone notebooks. However, Xeons do share a substantial amount of technology with the plain "everyday" P4s. That's what the 970 is. Power4 package for more mundane usage. Plus IBM will actually sell you one.

    8. Re:Question by firewood · · Score: 1
      Why wouldn't Apple go with the Power4 over the PPC970?

      The Power4 is very expensive device to manufacture. Just the chip packaging alone probably cost more than a high-end PowerMac.

    9. Re:Question by Billly+Gates · · Score: 4, Informative
      As evident in the article, the transistors and logic gates in the power4 are biger and alot more expensive to produce to increase reliability. This is not needed in a desktop and it slows down performance since the gates can't switch as fast. I find this claim hard to believe since any freebsd and linux (2.2 and earlier kernels) can run for months and years without a single reboot.

      The power4 costs 4-5k per cpu. Obviously too expensive for desktop systems because of high end server features and very large caches in the chip that will offer no performance benefit to desktop apps. Only heavily threaded multitasking apps running in parrallel will see the performance improvments by a power4. A web server running servlets and databases are the examples I refer to as heavily threaded multitasking applications. Adobe photoshop will show little performance difference and may even run slower on a power4 vs a powerpc 970 due to the lack of simd instructions.

      IBM did good with this processor and its leaps and bounds ahead of the g4. The main limitation of the g4 is the lack of ddr memory support. In ddr macs the chipset has to slow down memory access to the cpu to 133mhz speeds and it creates a very serious bottleneck. This alone is bottlneckintgthe processor down to half its potential in +1 ghz processors. Expect a %200-300 performance increase with these new processors.

    10. Re:Question by alannon · · Score: 2

      My guess was that he meant "for 32-bit compatability". As far as I know, the Power4 is not binary-compatable with 32-bit PPC executables. IBM would be foolish to release a DESKTOP PowerPC chip that was off-the-bat incompatable with all current software made for Apple's brand new OSX.

    11. Re:Question by Zueski · · Score: 2, Informative

      Actually, its based on the Book-E standard, same as the 601, 603, 604, G3, G4, etc. So, assuming all the execution units are present (i.e., AltiVec), it should run the same code. Book-E is a 64 bit archetechure that can be implemented as a 32 bit version.

      --
      please don't feed the monkey
    12. Re:Question by mfago · · Score: 2

      the Power4 is not binary-compatable with 32-bit PPC

      As long as you don't compile with -arch=pwr4 (etc), xlc (IBM's C compiler) will compile the code to run on any Power or PowerPC.

    13. Re:Question by Alan+Partridge · · Score: 1

      the latest PowerMacs actually use a 167Mhz FSB, but that's still WAY too slow

      --
      That was classic intercourse!
    14. Re:Question by WatertonMan · · Score: 1
      This is actually something I don't understand. On the one hand the 970 is single processor. Yet at the same time, as the Ars article explains, IBM clearly intends it for multi-processor systems. (i.e. like the dual 867 G4e that is popular at the moment)

      If this will likely appear in multiprocessor systems with 2-4 processors, why make each die single processor?

      I'm sure there is a reason - likely heat. Further Apple clearly will be selling single processor systems. (i.e. iBooks and iMacs) However it seems like there may be advantages to a dual processor on a single die as opposed to using the external bus to communicating. Perhaps this will be the next version of the 970?

    15. Re:Question by Anonymous Coward · · Score: 1, Informative

      If you had read the article you had pointed out
      you would have found out about the size and
      other things Quote from article
      "
      Each Power4 chip, with two CPUs, is packaged on a 4-inch-square ceramic multi-chip module with three other Power4 brethren. The multi-chip module, a 60-layer ceramic package, is filled with wiring to join the chips to neighbors, to 128MB modules of cache memory, and to a high-speed switch to reach the more distant CPUs.
      "

    16. Re:Question by ealar+dlanvuli · · Score: 2

      Basically it can be sumed up that the PowerPC 970 is a Power4 minus some "server perks" and manufactured with desktop level reliability in mind (everything becomes majorly cheaper when your chip can stop doing self diagnostics and chips failing in 5 years is slightly acceptable).

      They have a slightly divergent instruction set, but considering neither actually perform the instructions they "support" that's not anything worth noting.

      --
      I live in a giant bucket.
    17. Re:Question by Anonymous Coward · · Score: 0

      Putting two processors on a die isn't the same as putting two processors on a motherboard. PowerPC 970 isn't inherently limited from putting 2 on a die at a later date should it prove the cheapest way to ramp up performance.

      From the impressions I get they are trying to make the cheapest cpu possible, then if you need more power throw two in there (two cpu's that were designed for smp from the start are more expensive than desigining every one cpu as a 2 die chip hacked in half).

    18. Re:Question by Tower · · Score: 1

      Putting both processor cores on a single die also requires the duplication of the L1 caches, and (preferably, though not necessarily) the L2 cache as well. This creates a *much* larger, hotter chip, with lower yields, making it far less attractive to the desktop market.

      Without a duplicated L2 cache on the chip, the cores could easily become starved for bandwidth, and the number of cast-outs and line fills on the L2 would be greatly increased, lowering performance. This can be seen on the P-IV chips (Prestonia) with hyperthreading (Jackson technology, IIRC)... in that case, there isn't a second full core, just a replication of certain execution units and such, but has only one L2 cache. It does provide a solid 25-35% performance gain on a lot of typical workloads, but does add extra heat, etc.. Fun for the whole family.

      --
      "It's tough to be bilingual when you get hit in the head."
    19. Re:Question by softsign · · Score: 1

      Putting two CPU cores on a single die is also more costly, as you are basically doubling your failure rates. So if you were tossing out 5-10% of dies when they were single-core due to defects, it's within the realm of possibility that you'll need to scrap 10-15% of your dual-core dies, ceteris paribus.

      Even a miniscule difference in yields can mean big $$$.

    20. Re:Question by Anonymous Coward · · Score: 0

      That's an MCU module, which are 4 dual core power 4 chips, with a separate bus connecting each of the 4 dies so they don't tie up the memory bus when they go to comminicate to each other. IBM calls it a 'fabric', I think.

      I got to hold one of these things in my hand. Dense as lead.

      Cray also uses these things for their new supercomputers. They cool them by spraying flowinert on the module itself.

    21. Re:Question by Anonymous Coward · · Score: 0

      Looks small compared to some SGI cpus:

      http://mail.rochester.edu/~cm007i/indigo2/inside .3 .small.jpg
      http://mail.rochester.edu/~cm007i/img/ r10000.jpg
      http://www.reputable.com/prodpix/singl ecpu.jpg
      http://www.reputable.com/prodpix/dualcpu .jpg

    22. Re:Question by KowShak · · Score: 0

      That news.com article is over a year old...

      I'm sure die shrinkages have happened since then.

    23. Re:Question by Shuh · · Score: 2
      Putting two CPU cores on a single die is also more costly, as you are basically doubling your failure rates. So if you were tossing out 5-10% of dies when they were single-core due to defects, it's within the realm of possibility that you'll need to scrap 10-15% of your dual-core dies, ceteris paribus.
      But if they did their design right, they wouldn't have to have a dual-core and a single-core fab. They would just be able to cut the dead/slow die off the good one and sell it as a single, or else disable one and mount it in the same package and sell it as a single.
    24. Re:Question by softsign · · Score: 1

      True enough, but it still results in more silicon going to waste...

      Who knows? Maybe they'll do this at some point in the future? Perhaps, once they've worked out the bugs at their new fab, they'll be able to produce multi-core dies with the same yields as they anticipate from single-core dies.

  7. An overview of pipelining by PhysicsScholar · · Score: 3, Informative

    Some of you may have read an extremely wide execution core and a 16-stage (integer) pipeline in this article's write-up and been extremely confused. I took a few computer architecture courses back in my undergrad days, so I can refresh some of your memories as well as teach basic processor design to those of you who never got to attend a 4-year college and study computer chips in-depth.

    Basically, all modern processors are pipelined. This means that they execute various instructions at the same time. Whereas doing a load of wash, waiting for it to finish, putting it into the dryer, waiting to finish, and then folding would take 30 minutes * 3 steps * 3 loads = 4.5 hours, one could PIPELINE such a process, thus removing sequentialism and doing the first load, then while that's drying put the second load into the washer, and so on ... this takes a much shorter amount of time.

    This is all a processor really does. It does a FETCH, an INSTRUCTION DECODE, then an EXECUTION, then perhaps a MEMORY READ/WRITE, and then a WRITE BACK, perhaps. So this 16-stage pipeline can have 16 different instructions executed all at the same time, but just in different points of its execution. The example in CAPS above is a 5-stage pipeline that's similar to those in MIPS processors.

    Hope this was helpful!

    --

    Department of Physics and Atmospheric Science, Dalhousie University, Halifax, N.S., Canada, B3H 3J5
    1. Re:An overview of pipelining by Anonymous Coward · · Score: 0

      Quick someone knowledgeable about processors, is this a troll? It hasn't ended with PS's usual random meaningless senctence that gives it away, so I'm wondering, could he actually NOT be trolling for once?

    2. Re:An overview of pipelining by LumpishGenius · · Score: 1

      It's bona-fide. Even when he trolls he's usually more right-on than 99% of posters; in this case it's 100% informative, no bullshit.

      For a decent intro to pipelining, Google for design notes for the MC6502, the first pipelined processor.

    3. Re:An overview of pipelining by lamp77 · · Score: 3, Funny

      Your not doing my laundry with that kind of attitude!

    4. Re:An overview of pipelining by Anonymous Coward · · Score: 0

      What he wrote is correct, but his attitude bothers me.

      PhysicsScholar typically rushes and posts answers to questions no one asked in the first place, hoping to get modded up. And a lot (most?) of it is offtopic as well, so he's the canonical example for a karma whore.

      Any decent engineering or physics 2nd year undergraduate knows the shit he posts anyway, so I'm not impressed.

    5. Re:An overview of pipelining by joto · · Score: 2
      Hope this was helpful!

      It wasn't. You only explained "deep", not "wide". Since both words occured in the sentence you were trying to explain, you should try to do better than only explaining the one you remembered from college...

    6. Re:An overview of pipelining by khuber · · Score: 1
      That's pretty good. I would explain it a little differently with more emphasis on there being stages to executing one instruction, and contrasting with superscalar.

      First off, "superscalar" is when multiple functional units can do work in parallel. For example, two adds can happen at the same time on most processors. This is what I take "wide" to mean. Lots of true parallelism.

      Pipelining is not really executing various instructions at the same time. It is executing _parts_ of those instructions at the same time. That's important to bear in mind.

      In order to execute an instruction, several steps (stages in the pipeline) must occur, the fetch, decode, execution and memory access he mentions. That's for just one instruction.

      Each "step" happens on a clock cycle before the instruction can move on to the next stage and instructions may take many cycles to complete. This is like putting something into a pipe and waiting for it to come out the other end.

      To speed this up, you don't execute multiple instructions at once, you start processing the next instruction before the previous one is done. When instruction A is on step 2, instruction B can be on step 1 because A is no longer using that part of the processor.

      This is similar to putting several things into a pipe before the first one comes out the other end. They will all come out close together (e.g. 1 clock cycle) even though each instruction took many clock cycles total. You put in 16 things and they still come out one at a time, but much closer together than if you waited for each one (exactly like the good washing example above) That is pipelining. If your processor was 16 adders (a lot!) "wide" you could put in 16 adds and they would complete at the _same_ time. That's superscalar.

      For the curiouser: instruction prefetch is used to keep pipelines busy. Branches are the bane of pipelines because you may have to throw away all the partially finished instructions because you just found out you're going somewhere else. There are ways to deal with that such as branch prediction.

      -Kevin

    7. Re:An overview of pipelining by Drakonian · · Score: 1

      NICE! Is this Patterson and Henessey?? It's a little too familiar.

      --
      Random is the New Order.
    8. Re:An overview of pipelining by SvnLyrBrto · · Score: 1
      > typically rushes and posts answers to questions no
      > one asked in the first place, hoping to get modded
      > up. And a lot (most?) of it is offtopic as well,
      > so he's the canonical example for a karma whore.

      You, obviously, don't remember Signal 11.

      cya,
      john

      --
      Imagine all the people...
    9. Re:An overview of pipelining by Anonymous Coward · · Score: 0

      Just by coincidence I have the book sitting right in front of my right now. Trying to study, but /. is more interesting. Btw/ MIPS rulez

    10. Re:An overview of pipelining by Monkelectric · · Score: 1

      My god man this is the *funniest* thing I've ever read on slashdot. I'm gonna start the lamp77 fanclub.

      --

      Religion is a gateway psychosis. -- Dave Foley

  8. Wow.... by Anonymous Coward · · Score: 0

    Make the spec scores barely better than currently produced P4s and schedule a release a year from now. Interesting design philosophy.

    1. Re:Wow.... by Anonymous Coward · · Score: 0

      SPEC is a very dependent on memory bandwidth, compilers, etc. it's not an adequate metric to compare radically different CPU designs. benchmarks are only good for one thing anyway-starting flame wars.

  9. $$$/performance by teamhasnoi · · Score: 4, Interesting
    That's what I want to see, and missing in the article and links. Anyone have an idea?

    I recall IBM's PPC boards going for over a grand, which is (to me) far too much. Especially when it was a 'G3' chip.

    Even if the new chip is faster, will I be able to buy 2 pentium 4's (5?) for the price of it?

    1. Re:$$$/performance by Anonymous Coward · · Score: 1, Insightful

      Were you by chance looking at PowerPC evaluation boards? Those are low-volume boards intended for prototype developers only and thus, somewhat more expensive than high volume boards.

      Tom

    2. Re:$$$/performance by Anonymous Coward · · Score: 0
      will I be able to buy 2 pentium 4's (5?) for the price of it?

      Rule of thumb.... look at the die size. Smaller is cheaper. (Intel wins some of that back on ecnonomies of scales).

      Given that IBM is likely using larger wafers (If IBM uses its newest fab to do these, very likely.) then the G4e price should be close to the sale price (large wafer compensating partially for the 970 being larger. Intel will be on larger wafers too. So they'll probably stay in place on price year over year..). The G4's are cheaper now than the P4s.

      If AMD doesn't keep the pricing pressure on Intel even more so.

      Couching your question in a different light. Will you be able to buy two of today's mainstream P4s for 1 970 next year? Perhaps; buying last year's bleeding edge technology is always more of a bargin.. You'll have a steeper electric bill though. :-)

    3. Re:$$$/performance by Anonymous Coward · · Score: 0

      Grrr... does anything other than expensive evaluation boards ever hit the market? Bought a PREP or CHRP PC lately?

  10. Every time I readone of these articles... by larien · · Score: 4, Insightful
    ...I realise just how little I know.

    Kudos to the Ars team yet again for going deeper into CPU designs than 99% of the populace need to go :)

    1. Re:Every time I readone of these articles... by selderrr · · Score: 2

      Kudos to the Ars team yet again for going deeper into CPU designs than 99% of the populace need to go :)

      lol... did anyone else read an 'e' too many here ? Surely smells of goatse that way :-):-)

    2. Re:Every time I readone of these articles... by Anonymous Coward · · Score: 0
      They aren't half bad in a pure light weight technical sense. They tend to make a lot of assumptions about some things though. In the conclusion he make statements about the chips ability to keep itself fed that are completely unfounded.

      Half the stuff ars reviews in that manner they never actually have and they pretty much compare datasheets.

    3. Re:Every time I readone of these articles... by renoX · · Score: 2

      If you want to learn more about CPU architecture trade-off, the best book I know is "Computer Architecture: A Quantitative Approach"
      by Hennesy-Patterson.

      A really good book, not too hard to read..

  11. Comparison without AMD? by rsborg · · Score: 4, Interesting

    I think that since this is a 64 bit chip, why not compare it with other 64 bit consumer desktop chips (ie, AMD Clawhammer)? A lot of Intel's questionable moves (12K micro-ops instruction cache?) for the P4 were obviously not copied by AMD, and x86-64 seems to be the 64 bit desktop chip of the future.

    --
    Make sure everyone's vote counts: Verified Voting
    1. Re:Comparison without AMD? by toddhisattva · · Score: 1

      RTFA - Hannibal says why he wrote these comparisons and not vs. Hammer, SPARC, Alpha....

    2. Re:Comparison without AMD? by Coz · · Score: 4, Informative

      The article points out that a comparison with the AMD chip would be appropriate, but it's not "out there" right now as a basis for comparison. Hannibal says he'll probably use the 970 as a reference when he gets hold of the Opteron and does his down-in-the-registers review of it.

      --
      I love vegetarians - some of my favorite foods are vegetarians.
  12. Whoa by MalleusEBHC · · Score: 5, Interesting

    The PowerPC 970 has other potential customers as well, though, not the least of which is IBM itself who, with its large investments in Linux, would love to see a high-performance, 970-based 4-way or 8-way SMP Linux desktop workstation halt the steady flow of former 64-bit *NIX workstation users who began switching to Wintel hardware in the late 90's.

    Before all my fellow Mac users start A) thinking about going to Linux B) drooling C) wondering about Darwin or D) some combination of the above, let me remind you that Darwin scales very well. You can now return to your previous state of awe.

    PS - How much you want to bet good ol Steve is already having wet dreams about doing the traditional Photoshop test at a Macworld with 4-way SMP?

    1. Re:Whoa by Anonymous Coward · · Score: 0

      Darwin scales very well.


      The showstopper on these N-way workstations is a mass producable chipset to do this N-way with.


      Secondly, at 42 Watts per 1.8 GHz CPU...... a 4-way is a server box. That is a 168 Watts just from the CPUs alone.


      Two-way workstations... tops. There are serious thermal issues here if you don't want to sit next to hair dryer all day (and folks complain about todays G4 boxes).



      wet dreams about doing the traditional Photoshop test


      "Wet dreams"... Steve should have nightmares about how much longer he is going to be able to hide behind these "fig leaf" Photoshop benchmarks.


      The CEO of Boeing doesn't get wet dreams when GE comes out with a bigger, badder jet engine. If Boeing can't build a decent plane using the engine there is nothing to be excited about. The engine, in and of itself, is a necessary, but not sufficient piece of the system. The 970 may keep Apple in the ballgame.


      The steady flow of folks off of 64-bit workstations is not because of lack of horsepower... it is more so due to high cost.

    2. Re:Whoa by Anonymous Coward · · Score: 0

      If you clock them down to 1.2 GHz as the article mentions would be usable for laptops, at 19 watts each, then with 4 of those you're barely up to the power levels of a SINGLE high-end Athlon.

      Furthermore, with the wattage spread across much more die area (with 4 separate processors at 121mm^2, as compared to the latest Athlons at, what, 84mm^2?) your heat dissipation doesn't need to be nearly as drastic as the fancy copper-plate heatsink/fan combos companies are selling for Athlons these days. Good case ventilation, of course, would still be a requirement.

      Finally, (and this is just a hunch so anyone feel free to correct me,) I don't think that the heat generated is strictly linear with respect to power draw; the more power drawn, the larger percentage would be wasted as heat simply because the hotter processor would be less efficient, making it a vicious circle.

      4 fast processors, each one with 2 Altivec units - that's some serious number crunching power.

      Of course, who needs to crunch THAT many numbers? :) Most people who want faster machines these days want them to play games, it seems, and those are pretty much all single-threaded anyways. Still, I can't say I'm not drooling over the prospect...
      .

  13. PPC, not just for Apple any more by d3xt3r · · Score: 5, Insightful
    One of the more interesting, non-technical observations made by the author is that IBM most likely has some real consumer products in mind for this chip, not just helping Apple replace it's G4's.

    "The PowerPC 970 has other potential customers as well, though, not the least of which is IBM itself who, with its large investments in Linux, would love to see a high-performance, 970-based 4-way or 8-way SMP Linux desktop workstation..."

    This chip could be the start of something big in the Linux space as well. Think about it, we are now at a point where a few companies other than Intel are now poised to take the center stage in the next gen workstation, most notably AMD, Apple, and now IBM themselves.

    While Linux has run on PPC chips for a long time, it is difficult to come upon a G4 chip without paying the "Apple Tax" for the hardware. If IBM steps up to the plate with this chip, which can then run OS X, Mach, Linux, *BSD, (insert other OS'es here), and can be purchased directly or in a package from IBM, we may see a good set of Windows challengers for the desktop and server room. Obviously OS X will still only run on Apple derivatives.

    These chips will be big, I guarantee it, and not just for Apple. It will be interesting to see if Microsoft ports Win XP to these chips.

    1. Re:PPC, not just for Apple any more by Drakonian · · Score: 3, Informative

      PowerPC hasn't been just for Apple for quite some time, as evidenced by the 3930 hits for "Embedded PowerPC" on Google or the Embedded PowerPC Resources and Information page.

      --
      Random is the New Order.
    2. Re:PPC, not just for Apple any more by Elwood+P+Dowd · · Score: 2

      These chips will be big, I guarantee it, and not just for Apple.

      Don't guarantee it 'till you see the prices for the chips. The G4 is pretty expensive.

      --

      There are no trails. There are no trees out here.
    3. Re:PPC, not just for Apple any more by Sloppy · · Score: 2, Flamebait
      That sounds a lot like the Open-WhateverItWasCalled that IBM/Apple/Motorola were promising a decade ago. We'll run Taligent and OS/2 and Unix and NT and MacOS on it.

      Hey, I'll be delighted if it really happens, but that sounds like the usual "BIG NEWS in two weeks" type of stuff from my Amiga days. I can't believe it anymore, and if I tried, I would go insane. Mock my unrealistic fanboy idealism once, shame on you. Mock it twice, shame on me.

      --
      As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.
    4. Re:PPC, not just for Apple any more by Christopher+Thomas · · Score: 3, Insightful

      This chip could be the start of something big in the Linux space as well. Think about it, we are now at a point where a few companies other than Intel are now poised to take the center stage in the next gen workstation, most notably AMD, Apple, and now IBM themselves.


      Bear in mind that when IBM says "desktop workstation" they mean a $20k+ machine. Consumer desktop machines these aren't.

    5. Re:PPC, not just for Apple any more by Anonymous Coward · · Score: 0

      Amiga days, eh? Don't you mean more like "Mock it fifteen times, shame on me?" ;)

    6. Re:PPC, not just for Apple any more by Anonymous Coward · · Score: 0


      The irony is that NT did used to run on PowerPC chips (as well as MIPS and Alphas), but Microsoft killed development on all but the x86 platform. Sucks for them.

    7. Re:PPC, not just for Apple any more by Anonymous Coward · · Score: 0

      That's Windows .NET, you dolt!

    8. Re:PPC, not just for Apple any more by Anonymous Coward · · Score: 0

      wtf? The G4 is cheap, check mot.com SPS site for the prices.

    9. Re:PPC, not just for Apple any more by nickos · · Score: 1

      While Linux has run on PPC chips for a long time, it is difficult to come upon a G4 chip without paying the "Apple Tax" for the hardware

      Perhaps I could point you in the direction of the Pegasos , a PPC board aimed at Linux and alternative computer users.

  14. Re:They should make it work three ways by .pentai. · · Score: 5, Insightful

    Why would they want to?
    Intel and AMD have the x86 market pretty well locked down.

    More importantly, why would ANYBODY want to implement the x86 ISA (Instruction Set Architecture or smtn like that). It's the most horrid instruction set in use today.

    Some instruction sets can't really be mapped to others easily, and optimizing for good performance with PPC would probably not have good x86 performance anyways.

    In Pentiums and Athlons, the instruction set isn't really emulated. It's translated to a smaller instruction set (uops, iops, pick whatever term you like and run with it). However, these smaller sets are still made pretty much specifically to cover the overlying ISA (x86 in this case).

  15. Only the FP is slightly better by Anonymous Coward · · Score: 0

    Estimated Power4 scores
    P4 scores at SPEC

    Estimated Power4 scores:
    Est. SPEC INT 937 @ 1.8 GHz
    Est. SPEC FP 1051 @ 1.8 GHz


    Intel Corporation Intel D850EMVR motherboard (2.8 GHz, Pentium 4 processor):
    SPEC INT 1032
    SPEC FP 1034

  16. Re:unfotunatly Apple is going with Intel instead.. by I_am_Rambi · · Score: 5, Insightful
    A few things wrong with this document.

    • This information is from a Giga Information Group perspective. - This is what they think.
    • There has been no "official" annoucement from Apple.
    • Its highly unlikely that Apple will go with two different chips. Story Here.

    I think Apple will stick with a company that it knows, IBM, since they have been working together for years. It doesn't seem that Apple will just jump ship to the x86 platform. This would also mean redoing the Mac OS X code and optimization (not like they will have to do some anyway, but they will have to do more). It is highly unlike that Apple will go with a heat producing, energy wasting x86 Intel chip.
  17. Re:They should make it work three ways by Paul+Neubauer · · Score: 3, Informative

    One of the early planned PPC chips had that idea in mind, by pretty much adding an x86 processor and logic to figure out what instruction set it was getting and where to send it. The idea was that then there would be no barrier to using PPC code - it could run x86 code to replace existing systems and run PPC as well. Thus it would be a transitioning thing.

    The x86 world seemed to move faster than the design for this and it fell away. It made more sense to concentrate on PPC stuff rather than try to do PPC and changing x86 stuff. Also, if it ran x86, why should anyone bother to write for PPC?

    The difference is Pentiums and Athlons are intended to be x86 family upgrades, while the PPC is not. The PPC 970 is meant as an upgrade to earlier PPCs. One could as well ask why AMD doesn't make an Athlon that can run PPC code.

    --
    I don't subscribe to RMS's GNUtopian vision.
  18. Re:unfotunatly Apple is going with Intel instead.. by WatertonMan · · Score: 5, Insightful

    Apple will almost certainly go with 970 and not switch to Intel for the following reasons: 1. It is difficult to emulate PowerPC with Intel (although the reverse isn't *that* difficult). Apple would need a PowerPC emulator so that all that software (including OSX software) isn't lost. 2. Apple wants to differentiate itself somewhat from the PC. 3. IBM appears to be moving up after the several years of problems with Motorola. The downside is that by the time a 970 board is out it will definitely be in the middle of the pack relative to the PC world. That means that Apple still will have computers that are more expensive than the PC world and that aren't as fast. Of course I think OSX is sufficiently better than either Linux or WinXP for a workstation that I'll stick with it. But Apple best hope that IBM gets large yields on time and perhaps with better speeds than expected.

  19. Re:YOU HAVE TONAL DISORDER by Anonymous Coward · · Score: 0

    I modded him up previously based on your linked comment and after reading his journal about rough mods. I struggled since some of his past posts were pretty weak and a chunk of his friends are trolls, but I decided to try giving him the benefit of the doubt since the post seemed genuine and was interesting. I hope this wasn't just the classic troll attempt to get back to +1.

  20. Re:Will this save apple? by Anonymous Coward · · Score: 0

    Did you miss the System Controller present on the Xserve and the new G4s?

  21. They wouldn't have to redo anything... by Chibi+Merrow · · Score: 2, Insightful

    As was mentioned in this previous article, Mac has been maintaining a seperate port of MacOS X for x86 in synch with the PPC version... I still remember some promotional material pre-OS X talking about an x86 version of Mac OS X being released that lacked the functionality to run Mac OS Classic apps (I think it was called Blue Box?).
    Are they going to jump ship to x86? Not likely if they can help it... but they're keeping the option open. Kind of like how Dr. Evil KNOWS his plans will never fail, but he always has that Big Boy rocket hidden in the back--just in case. ;)

    And who ya calling energy wasting? My Palomino keeps my room nice and toasty on those lonely nights and makes great julienne fries! ;)

    --
    Maxim: People cannot follow directions.
    Increases in truth directly with the length of time spent explaining them
    1. Re:They wouldn't have to redo anything... by Zelet · · Score: 1

      I know that keeping your room warm was a joke... but I have an Athlon XP 2000 system that, when running, I have to close the heater vent so that my room doesn't get too hot.

      --
      ...And when they came for me, there was no one left to speak out for me." - Martin Niemoeller (1892-1984)
    2. Re:They wouldn't have to redo anything... by Chibi+Merrow · · Score: 2, Interesting

      Actually not that much of a joke... when I moved into my new house I noticed that the area under my desk stays noticably warmer than the rest of the room and my feet were kept downright toasty if I propped them up on top of my Athlon XP 1800 tower... I think the GeForce 4 and the 400w power supply with all the fans piping heat outta the back add to the effect also. :)

      --
      Maxim: People cannot follow directions.
      Increases in truth directly with the length of time spent explaining them
    3. Re:They wouldn't have to redo anything... by Anonymous Coward · · Score: 0

      There have been a lot of optimations in OS X for Altivec. All those optimizations would have been wasted and might have to be redone for SSE.

  22. Re:They should make it work three ways by puto · · Score: 2

    Because they would have to license the x86 architecture from the respective owners. AMD and INTEL have licensing agreements.

    Puto

    --
    The Revolution Will Not Be Televised
  23. Re:unfotunatly Apple is going with Intel instead.. by DrinkDr.Pepper · · Score: 4, Insightful

    finally they'll see that clock speed does make a difference Clock speed is something Intel uses to bolster their performance claims and give people an excuse to upgrade to the newest model. Clock speed tells very little about the performance of a computer. Look at AMD's athlon. Many reviews like the ones on tom's hardware show that running Windows on a "slower" athlon yeilds better performance than a comparably clocked P4. If you meant that finally, if apple runs on x86, there will be a better benchmark between Windows and MacOS, you would be more accurate. Until that happens you are comparing two different fruits.

    --
    0xfeedface
  24. Error correction? by Anonymous Coward · · Score: 0

    I've been wondering about this also in regards to ECC for RAM. Does the fact that it has built-in error correction mean that the CPU/RAM is actually more prone to errors by design (since they can be corrected? And if so, is there more of a chance for degraded performance?

    1. Re:Error correction? by Anonymous Coward · · Score: 0

      In short, yes. Look up Chipkill memory.

      DRAM technology relies on the capacitive nature of a transistor (each transistor is 1 bit of memory). A radiation hit causes a short, which drains the capacitor and /can/ alter the value. The smaller the capacitor, the more damaging the hit.

      So the denser RAM gets, the more prone it is to a single bit failure. In my opinion, we should be using ECC now....

      As for whether performance will degrade. It can - depends on the implementation. But marginally, an ECC 'correction' hidden from the application layer.

      Tom

  25. Apple selects a winner! by doorbot.com · · Score: 3, Funny

    It is highly unlike[ly] that Apple will go with a heat producing, energy wasting x86 Intel chip.

    ...because PPC chips completely disobey the laws of physics, producing neither heat, nor "waste energy" (perhaps through the production of heat?). Yes, it is PPC, miracle of modern technology, standing up for the common man against the perils of Thermodynamics!

  26. Re:Power4 vs PowerPC 970 by NattyDread · · Score: 5, Informative

    The other responses to your question have pretty much hit it dead-on. I just wanted to comment that the PowerPC has always been the little brother of the Power architecture used originally in the RS6000 ... and now in almost everything IBM makes - AS400, E9000, etc.

    The first generations (601, 603/604 and the ?aborted? 620) of the PowerPC line were scaled-back versions of the Power and Power2 architectures respectively [the original Power architecture was mounted on a 3x5 daughter card with 4-5 separate chips [I'll have to go looking for my tech papers] making-up the core ... because of this the migration of everything into one die for the PowerPC was amazing.

    Additionally, IBM has tended to work-out new capabilities -- such as the move to 64-bit and dual cores -- on the larger scale Power architecture, before attempting to stuff it into the smaller PowerPC pacakge [besides, IBM has to keep something to distinguish its pricier iron from the OEMs. ;)

    Natty

    --
    Maybe the rain Isn't really to blame. So I'll remove the cause, But not the symptom!
  27. Re:unfotunatly Apple is going with Intel instead.. by Anonymous Coward · · Score: 0, Insightful

    i understand that comparing intel and g4's is apples and oranges and that intel uses clock speeds for marketing as well. however, why do you think there is such a huge overclocking market? it's simple: clock speed does make a difference.

  28. are you cmdrtaco? by Lethyos · · Score: 0, Offtopic

    you failed to properly spell "genitals".

    --
    Why bother.
  29. hahaha by Anonymous Coward · · Score: 0

    you hit the wrong checkbox, luz0r

  30. What? by mill5ja · · Score: 2

    How on earth is anyone supposed to know what this part is going to be priced at? I doubt IBM even has anything resembling a firm idea. The price is going to depend on yields, production quantities and a number of other factors.

    -jason m

  31. Re:unfotunatly Apple is going with Intel instead.. by DrinkDr.Pepper · · Score: 2, Insightful

    Sure, clock speed makes a difference, but only on the same chip. The post I was commenting on implied that you could compare the clock speed of an Intel machine to the clock speed of an Apple machine. For such a comparison clock speed is wholly irrelevant.
    P.S. My 333Mhz P2 runs great at 400Mhz, but largely because Intel underclocked the identical core to run at 333Mhz. Intel plays up the importance of clock speed, so they do dumb things like underclocking, and multiplyer locking.

    --
    0xfeedface
  32. G5? by gouldtj · · Score: 2

    I know that the press believes that Apple is going to grab IBM's chip hot off the presses, but does anybody know anything hard core about Motorola's G5?

    1. Re:G5? by Sycraft-fu · · Score: 3, Informative

      Go look on their site, they have specs on the first of their G5 chips:

      http://e-www.motorola.com/webapp/sps/site/prod_s um mary.jsp?code=MPC8540&nodeId=01M98655

    2. Re:G5? by Anonymous Coward · · Score: 0

      anybody know anything hard core about Motorola's G5?

      All I know is I wouldn't count on Motorola to deliver anything not targeted for the embedded market.

      Even that is uncertain: there are rumors they are selling their entire chip business. I don't think the management knows what it's for anyhow...

    3. Re:G5? by Anonymous Coward · · Score: 0

      Their management hasn't known what that division was for for a long time now.

    4. Re:G5? by Toraz+Chryx · · Score: 1

      G5 is an Apple designation, not a Motorola (or IBM) one, I'm firmly convinced at this point that the PowerPC 970 _IS_ "G5"

      And as for Motorola, they have a 64bit Embedded chip (the 8540) that lacks floating point and altivec (amongst other things)..

    5. Re:G5? by Lars+T. · · Score: 2
      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

  33. Re:They should make it work three ways by Anonymous Coward · · Score: 0

    make the chip compatible with x86 instructions -- the Hammer does it, why can't the PowerPC chip?

    If that's such a great idea, then why isn't it working so well for transmeta?

  34. 16 stages does not mean 16 times as fast by misterhaan · · Score: 1

    to further clarify, sometimes the instruction you want to push into the pipeline depends on the result of another instruction that's still in the pipeline. when this happens, you have to basically close the entrance to the pipeline and wait for that instruction to come out of the pipeline before opening it up again. there are ways to minimize this effect, but still you will not see a 16-fold improvement over not having the pipeline.

    --

    track7.org has all kinds of interesting stuff!

  35. Die size as an indicator of cost by IIci · · Score: 2, Interesting

    Looking purely at die size, one can expect manufacturing costs of the p4 and ppc970 as being roughly equal: PowerPC 970 1.8 GHz, 0.13um, 121 mm2, 52 million transitors Pentium 4 2.8 GHz, 0.13um, 131 mm2, 55 million transistors As long as IBM is not using the exotic materials of the power4, then the main advantage Intel has for pricing is that their R&D can be spread over many more chips. What the R&D costs of the ppc 970 are is interesting, especially since IBM is trying to position themselves as a maker of custom chips leveraging their ppc ISA and the experience gained through their big bucks power series.

  36. All this talk... by gnuadam · · Score: 1

    All this talk about SMP machines coming nicely from this chip (and IBM's supposed workstation/linux aspirations) has me wondering something. Has anyone thought about adding openMP support to gcc? I'd give my eye-teeth for that. (Well, at least my wisdom teeth.)

    I already know about OmniMP and OdinMP, but I want openmp natively in the compiler. Anybody know more than me?

    --
    You say :wq, I say ZZ. Why can't we all just get along?
    1. Re:All this talk... by Roadmaster · · Score: 5, Informative

      " OpenMP is a specification for a set of compiler directives, library routines, and environment variables that can be used to specify shared memory parallelism in Fortran and C/C++ programs." All that would have to be added to gcc are the "compiler directives", as the "library routines" and "environment variables" aren't directly a part of the compiler.

      Now, openMP is good for programming extremely high-performance shared-memory applications, like scientific computation applications and stuff like that. It really sounds like overkill for a desktop environment where it's probably easier to program a multithreaded application with standard IPC mechanisms where communication is required. And really high-performance applications could also be programmed using MPI and a message passing communication scheme, which is far more widely used (compare the # of people who know about openmp versus those who know about mpi), probably wouldn't be much less efficient, and would quite likely scale much better than a shared memory implementation.

    2. Re:All this talk... by gnuadam · · Score: 2, Interesting

      0. When I say I want openMP added to gcc, that sort of implies that I want the compiler directives added, and the library routines added to the standard libraries that are shipped with the compiler. I realize gcc isn't going to affect any environment variables.

      1. I write scientific code. That's how I know openmp. I think it's great for that.

      2. It's not really overkill, because it's quite easy to program for (in a portable way!) Much easier than fork(), and IMHO easier than pthreads.

      3. Admittedly, I've never needed much in the way of IPC for the codes I've written for smp (scientific codes on smp machines don't need much of it) but you could probably use a pipe, a socket, or whatever else. I do mostly want it for the scientific uses, though.

      4. MPI is good. But openMP is like 1000x easier to write for, and on good hardware is usually better ... for my problems.

      5. If IBM, or apple even, makes affordable, good smp boxes with this processor, openmp would be quite useful.

      6. The same features it's good at in science ought to make it perfect for other processor intensive tasks. Anything that needs a for (do) loop can be scaled quite well. Anything that has chunks that don't need communication can be scaled well as well. I imagine video/sound encoding might be easily parallelized using this...much easier than pthreads, and for this mpi would be overkill, don't you think?

      7. All I'm saying is that openMP is out there, is supported on most commercial compilers, and it's noticably missing from gcc. *I* would use it. I suspect many more people would use it if it was available .... especially if 4 processor boxes become available on the commodity market as people seem to be expecting (dreaming) in this topic.

      --
      You say :wq, I say ZZ. Why can't we all just get along?
    3. Re:All this talk... by Anonymous Coward · · Score: 0

      Bull plop,
      IPC/RPC/DOM yadda yadda all spend heaps of CPU time sending piss poor 4 bit msg to each other, for big f*ck off cpu grunt share the mem, write some pared SMPed code (Bwolf?) booomshanka
      Mind you no good for Word '87

  37. Re:I am tired of 64 bit lies ! (40 bit only, folks by KewlPC · · Score: 3, Interesting

    *Sigh* The size of a CPU's address/data bus does not reflect a processor's "bitness". 64-bit means that it has a 64-bit word size (as opposed to the 32-bit word size on x86 processors), 64-bit registers, etc. Most 64-bit CPUs don't actually have a 64-bit address bus. Like you said, the Alpha's is 48-bit. This is usually done to keep the pin count down to a sane level (if you need all the physical RAM that a 64-bit address bus would provide, you need something bigger than a desktop CPU). You can expect that as 64-bit chips become more common on the desktop to see somebody introduce a 64-bit CPU that has a 64-bit address bus just so that they can say, "Hey, look, we have a 64-bit address bus, the other guys only have a 48-bit one!" and (like you are trying to do) will insinuate that this means the competition's CPUs aren't "true" 64-bit (even though they are).

    I dunno 'bout Macs (I don't know the M68k's "bitness"), but Intel introduced the 386 (their first 32-bit CPU) in 1986. And I certainly don't think the M68k was a RISC processor.

    at current prices and projected prices, 512 gigabytes or RAM will barely cost more than a couple of the fastest processors of this type.

    Really? I would LOVE to be able to buy 512 gigabytes of RAM for the cost of a couple of fast desktop processors. Don't forget that the PowerPC 970 is meant to be a desktop processor.

  38. aww damn by ProppaT · · Score: 1

    So now I have to be wide AND go deep to be competitive?? Damn it...next thing you know the girlfriend will be wanting a faster FPU too....

    --
    Wise men say, "Forgiveness is divine, but never pay full price for late pizza."
  39. Re:Question IT IS ONLY 40 BITS not 64. by Anonymous Coward · · Score: 0, Flamebait

    Why do you keep claiming its 64 bits?

    However, as fast and amazing and AMD crushing as this 970 chip is... it only has a 40 bit data bus.

    This means you cannot physically ever attach more than 1024 gigabytes of ram to it EVER.

    At least in the 1990's and early 1990's some 64 bit risc chips had 48 bits of addressing, not lame 40 bits.

    This is considerred a hack by some engineers.

    The Power4 is astounding and amazing and will help apple trounce Intel benchmarks (Pentium4 cannot be multiprocessor only expensive xeon and
    itanium) but this chip is only 40 bits! The RC5 key benchmark is already over 4 times faster on a mac than on the fastests Pentium 4, and twices as fast as ADM dual processor. Consult rc5 project site if you doubt this.

    True, it the 970 has 64 bit integer instructions that only have value in cryptography, and some relevance in some gaming computation, but the chip is
    NOT 64 bits in my mind uless it can physically attach to more than 1024 gigabytes of mapped locations.

    Heres another "gocha" Apple, like EVERY OTHER OS KNOWN, will steal a bit or two for dividing pci dma space or dividing kernel space
    meaning that this chip will only EVER support 512 gigabytes of RAM, ever. Or even 256 GB if they steal two bits instead of one.

    at current prices and projected prices, 512 gigabytes or RAM will barely cost more than a couple of the fastest processors of this type.

    In linux you cannot have a single application grab and use over 1 gigabyte of GENUINE held and locked RAM, nor in Win2000 regular, nor
    WinMe.... but years ago in 1996 normal mac such as the PowerMac 9500 could actually create and USE over 1 gigabyte of ram dedicated to
    a single user task.

    In 2002, few unix OSses, not even OS X, allow over 1 gigabyte of ram for a single process.

    I want Power4 from Sonnet in my mac, not this 970 inadequacy.

    But then again I also need a way to cram in 512 gigabyte of RAM for three-dimensional finite difference time domain 3D FDTD, for studying energy for
    optics computation of laser circuitry construction.

    FDTD NEEDS RAM!!!! real ram... not emulated virtual ram.

    Why do fanboys mod stuff like this down? Its all facts and not expressed elsewhere. I restate this because evidently modding is based on popularity contests and not logic sometimes. My original was slammed at -1 as this one is as well probably, even though it is informative and factual.

  40. Re:They should make it work three ways by stripes · · Score: 3, Interesting
    More importantly, why would ANYBODY want to implement the x86 ISA (Instruction Set Architecture or smtn like that). It's the most horrid instruction set in use today.

    Because that is where most of the desktop CPU money is going, some of the high end, and frighteningly enough a fair bit of embedded CPU money too.

    In short if you can navigate the patent mine field, the brutal competition mine field, and deal with the instruction set making things a royal bitch doing an x86 CPU is a total no-brainer.

    In Pentiums and Athlons, the instruction set isn't really emulated. It's translated to a smaller instruction set (uops, iops, pick whatever term you like and run with it). However, these smaller sets are still made pretty much specifically to cover the overlying ISA (x86 in this case).

    Other then needing a whole new decoding front end, and being forced to use a trace cache because decoding multiple instructions in x86 land is very hard... the instruction thing isn't a big deal. Handling the odd-ball 80 bit FP format is. So is emulating all of the trap stuff and the other little odd bits close to the instructions set (like the MMU).

    A big pain. But with much of the effort not being where folks think it is!

  41. PG-13 by Daleks · · Score: 0

    If the P4 takes a narrow and deep approach to performance and the G4e takes a wide and shallow approach, the 970's approach could be characterized as wide and deep.

    Watch it, this is a family website.

  42. Finally, my girlfriend can be happy by ndogg · · Score: 3, Funny

    When I had the Pentium 4, she complained about its narrowness, but its was great. With the G4 Mac, it was nicely wide, but too short, she would note. I'm sure that with the 970, I can fulfill all her dreams!

    --
    // file: mice.h
    #include "frickin_lasers.h"
  43. Re:I am tired of 64 bit lies ! (40 bit only, folks by Anonymous Coward · · Score: 0

    Learn your history... APPLE was first beating wintel.

    Apple used 32 bit processors (MC68000) in Lisa in Sept 1982.

    It used them again in January 1984 with the Macintosh computer.

    The 1982 lisa had windows, scolling, dialogs, fonts, buttons, WYSIWYG text editing with graphics, etc.

    The IBM PC in Sept 1982 supported only black and white and used a cassette tape drive standard. It cost 600 dollars and was sold at Sears and had a 4 week backlog of orders. But it was not 32 bits like the Macintosh or Lisa.

    the Apple II had 75% of us market and used floppy drives (fast ones), but as you know, the IBM PC eventually supported Lotus 123 and the rest is history. (Apple used VisiCalc and supported Multiplan using a cheap CPM card, as well as Turbo pascal and other cpm products)

    Also in March 1987 Apple supported choices from million of colors and 6 simultaneous monitors attached and supported.

  44. Re:I am tired of 64 bit lies ! (40 bit only, folks by Anonymous Coward · · Score: 0

    The lisa may have been a 680010 for special purposes. It has no TAS (test and set) and other things that the Macintosh lacked, but the Lisa did have multiple hard drives, and a 15 inch hires printer.

    Sorry for typing MC68000, the Mac used the 680000 for certain, I think the lisa did but I do not want to be wrong. The Lisa was shown publicly in fall of 1982 though but delayed shipping until special new floppy drive and hard drive issues were all settled in late dec 1982.

    Apple OS was generally 8 years ahead of MS in most technologies over the next 15 years. Apples chips were ahead as well for most years too, but economies of scalle and price wars still have not kept the price of a 800 Mhz itanium chip below 7 and a half thousand dollars. (look on Price watch if you doubt me).

    Thats why apple does so well in workstation world.

  45. For those /.ers who will not read the article by Sivar · · Score: 5, Insightful

    ...an extremely wide execution core and a 16-stage (integer) pipeline that, while not as deep as the P4's, is nonetheless built for speed.

    For those not planning to read the article, I wanted to mention the following so you do not get the wrong impression. The speed that the article refers to (of a long integer pipeline, like a 16-stage or like the Pentium IV's 20-stage) is clockspeed, not necessarily actual performance. The P4's super long pipeline, for example, allows it to run at higher clock speeds, but less work gets done in the same number of clock cycles. This is the "braniac" vs "speed demon" philosophy (with a high clock speed but low instructions-per-clock representing "speed demon") and neither is necessarily better than the other (though one is obviously better for the marketing dept.)
    Just don't assume that "built for speed" always means "built to be fast" -- a confusing but important distinction. :-)

    --
    Computer Science is no more about computers than astronomy is about telescopes. --E. W. Dijkstra
  46. Decoded cache in the P4. by Christopher+Thomas · · Score: 3, Insightful

    A lot of Intel's questionable moves (12K micro-ops instruction cache?) for the P4 were obviously not copied by AMD, and x86-64 seems to be the 64 bit desktop chip of the future.

    The P4 has its flaws, but IMO cacheing decoded instructions isn't one of them. It shortens the pipeline, and paves the way for a true trace cache (cache of decoded basic blocks indexed by entry point; very handy for renaming and scheduling).

    1. Re:Decoded cache in the P4. by jason_watkins · · Score: 1

      Indeed, if you do the math, it's hard to say trace caches arn't better for the same die area. I expect AMD will go this way sooner or later as well.

  47. Re:unfotunatly Apple is going with Intel instead.. by dolanh · · Score: 2

    My guess is that we're gonna see "Four brains are better than one" right around the time the 970 arrives, and that this time it will make the Mac competetive (unlike the Daystar days).

  48. Speculate... by skinlayers · · Score: 3, Interesting

    What I really want to know is how much this chip is going to cost. If its cheap for Apple to put 2 or 4 of these in a machine, then how much will it matter that an expensive P4 (P5) out performs it? Hmmm.... The current Wind-Tunnel G4s raised a few eyebrows when it first came out do to the new case design. These things were designed to disapate heat! A HUGE (7 lbs) heat sink w/ matching fan, a small case fan, 2 fans on the power supply, and a ton of ventalation in the back. WAY more cooling that those 2 little G4s require. I think Apple is trying to avoid the fiasco it had with the Sawtooth (1st gen) G4s where they just slapped a G4 onto a G3 mobo. This time around, I believe they're releasing a new mobo first and then put a new proc in it down the road. I've also read stuff in forums suggesting that the power supply for the Wind-Tunnel had way more juice than the system currently demands. Can anyone out there do the math on this? We know how much power the PPC 970 eats. Can we figure out how much heat the Wind-Tunnel case is designed to disapate? What about how much power the power supply is putting? With these numbers, can we figure out how many PPC 970 the Wind-Tunnel case could power and cool? I've been suffering with a 266MHz G3 iMac, and I refuse to upgrade until Apple comes out with a system that really is worth that premium they charge, and a G4 is not it.

    1. Re:Speculate... by Alan+Partridge · · Score: 1

      "I think Apple is trying to avoid the fiasco it had with the Sawtooth (1st gen) G4s where they just slapped a G4 onto a G3 mobo."

      that wasn't the "Sawtooth", but rather the "Yikes!" machine. And the fiasco was entirely due to Motorola not supplying Apple with the number of chips at the speeds that they'd promised. The "Yikes!" machine wasn't bad, it actually had better PCI bandwidth than the later "Sawtooth"...

      --
      That was classic intercourse!
    2. Re:Speculate... by Anonymous Coward · · Score: 0

      1: read the article, the wattages are clearly listed.

      2: your final sentance turns you into a troll or a karma whore. No two ways about it.

  49. Blah blah blah... by KewlPC · · Score: 2, Insightful

    I specifically said, "I dunno 'bout Macs..." because I wasn't trying to make a "See? Intel is better than Mac" post. I was just giving the date at which Intel introduced their first 32-bit processor.

    And now that you mention it, I do remember reading that the M68k was 32-bit, but it only had a 24-bit address bus, which meant the max. amount of RAM it could physically have was 16 megabytes. Again, I'm not trying to bash anybody, I'm trying to point out that your "64-bit CPUs aren't really 64-bit because they only have a 48-bit address bus" argument is flawed.

    Why you went off on the whole Apple vs. Wintel thing is beyond me, but if you want to play ball, OK. For the record, the PC wasn't meant to compete with the Lisa or the Mac, and both of those computers were introduced after the PC. The original IBM PC was a competitor to the Apple II, but more oriented towards business use rather than home use. If you remember, the Apple II also used a cassette tape drive (just like the original PC), but, like the PC's successors, the PC-XT and the PC-AT (all modern PCs are descended from the PC-AT), later had the ability to use floppy drives and hard drives.

    The 1982 lisa had windows, scolling, dialogs, fonts, buttons, WYSIWYG text editing with graphics, etc.

    Which were all "borrowed" from Xerox PARC. The fact that Apple later whined and bitched about Microsoft "borrowing" those ideas from the Lisa and/or Mac (when Apple themselves had stolen those concepts from somebody else in the first place) is too amusing for words. I can't stomach Bill Gates, but I have just as hard of a time putting up with Steve Jobs ("You stole Windows! It's not fair! We stole it first!").

    And as for Mac OS always being 8 years ahead of Windows, well, I'm no lover of Windows, but Windows had preemptive multitasking years before Mac OS (Windows got it in Win95, Mac OS didn't have it until OS X).

    the Apple II had 75% of us market

    Although I don't have any hard facts, I have a hard time with this. It wasn't like IBM and Apple were the only players in the personal computer market. There was Commodore with their highly successful Commodore 64 computer (not to mention the Commodore PET, VIC-20, and Commodore 128), Sinclair, the TRS-80 (from Tandy and RadioShack IIRC), and a whole host of others.

    1. Re:Blah blah blah... by Maxwell · · Score: 2, Insightful

      And as for Mac OS always being 8 years ahead of Windows, well, I'm no lover of Windows, but Windows had preemptive multitasking years before Mac OS (Windows got it in Win95, Mac OS didn't have it until OS X).

      Windows NT/2000XP had real pre-emptive multitasking before the half bit crappy win95 implementation. 1993? Think NT 3.1 and Nt 3.51. Those OS's, much maligned then are now the foundation for MS future OS's...

    2. Re:Blah blah blah... by Anonymous Coward · · Score: 0

      "Which were all 'borrowed' from Xerox PARC..."

      Anyone who thinks that Lisa (and/or Macintosh) was "stolen" from the Xerox Alto or Star obviously hasn't seen the two machines side-by-side. Read on...
      http://www.mackido.com/Interface/ui_history.html

    3. Re:Blah blah blah... by KewlPC · · Score: 2

      I'm not saying they copied the look & feel of the GUI developed at Xerox PARC, just that they copied the concepts and ideas from them.

    4. Re:Blah blah blah... by dke · · Score: 1

      And Xerox borrowed the mouse, overlapping windows, menus, direct manipulation and other things from other places as well. (Many papers, researchers, Apps, etc., that had existed for 5 or 10 years before PARC). They just brought the ideas together and furthered them.... just like Apple did.

  50. No connectors for L3 cache? by Anonymous Coward · · Score: 0

    Who needs L3 cache connectors when you have a 900 mhz system bus? There are only two reasons to have L3 cache: the system is massive multiprocessors system (like the Power4) or the system has a ridiculously slow system bus (like the Motorola G4). The PowerPC 970 has no need for L3 cache because the bus is just that fast. Hell, the bandwidth to the L3 cache on a G4 is only 4.6 Gbps. It is DDR and effectively runs at half the clockspeed just like the main bus on the 970.

    1. Re:No connectors for L3 cache? by Lars+T. · · Score: 2

      Uhh. Cache (SRAM) has far lower latency than DRAM. IOW you already have the data from cache when your RAM hasn't started sending it yet.

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

  51. Re:Question IT IS ONLY 40 BITS not 64. by Anonymous Coward · · Score: 5, Interesting

    IT IS ONLY 40 BITS not 64

    Your desire to use address pins (or is it max pinned space per process?) to measure size puts you in a distinct minority. That doesn't make you wrong. But neither does it help make you right in this particular jungle.

    Systems whose physical addressing match their claimed "bitness" are probably in the minority.
    Some systems provide more physical addressing than register width (later PDP-11s, 8086, S/390), some less (68000, classic CDCs, early POWER). The 970 falls into the less category. Nothing unusual there.

    Apple, like EVERY OTHER OS KNOWN, will steal a bit or two

    Some bits come from physical addresses, some from virtual addresses. These should be addressed [pun slipped in, sorry] separately. AIX, btw, steals less than one bit. Linux can also be configured to steal less than one bit. (Assertions I can get away with no loss of credibility, since AC's have none to start with.) Were you frightened by a VAX in your formative years?

    Why do fanboys mod stuff like this down?

    Because we can't figure out why someone who needs 512GB, or 1TB, or more (which is it?) cares that a Linux process is limited to 1GB and not 2GB or 4GB.

  52. Oh, she'll just wait for you to leave... by mtec · · Score: 1

    and use her Palm and an emulator

    --
    Cake or Death? Cake Please!
  53. Re:Trolls overflowing on K5... by Anonymous Coward · · Score: 0

    Yes, there are trolls at K5 and /., and yes they are damn annoying.

    But the real problem is the number of arrogant bullshitters who are drawn to internet message boards and obsessively post all day long.

  54. Re:unfotunatly Apple is going with Intel instead.. by Alan+Partridge · · Score: 1

    yeah, sure

    coz FOUR PPC 970s will be virtually free, won't they? It's not as if these chips ae gonna cost $500 each or anything...

    --
    That was classic intercourse!
  55. Re: Number Crunching by ghutchis · · Score: 1


    Actually... I wouldn't mind having that many processors, even ignoring that the Altivec uses single-precision floats rather than doubles.

    Sorry, can't resist. Imagine a cluster of PPC970s, each with 4 fast processors and a huge memory bandwidth. Not bad.

    -Geoff

  56. Re:I am tired of 64 bit lies ! (40 bit only, folks by khuber · · Score: 1
    Why can't they just change the address bus / MMU later? They're dumbing down the addressing from the Power 4 anyway - I assume they'll take notes so they can remember how to switch it back ;-).

    -Kevin

  57. Re:They should make it work three ways by Anonymous Coward · · Score: 0

    Not if they scratch-built their own copy of it, then they'd only have to get copyright licenses to be able to say 'x86'. Think about SAMBA.

  58. Re:Question IT IS ONLY 40 BITS not 64. by Anonymous Coward · · Score: 4, Informative

    40 bit address bus. Not a 40 bit data bus. BIG difference there.

    And the 40 bit address bus is most likely a pin packaging limitation. They did not see a need to bring those extra 24 address lines out to the chip package. Internally, it is 64 bit. Much like the venerable MC68000 was 24 bit externally, but 32 bit internally.

    But seriously, in the life-span of THIS processor implementation - do you seriously see ANY desktop manufacturer even thinking about putting that much RAM in their CPUs?? Heck 1GB of RAM is not 'standard' yet. Extrapolating w/ Moore's Law, we'll be approaching 40bits in 8 years. Apple will undoubtedly have another chip before then!

    If you truly need THAT much physical storage today, you'll need to shell out for a SERIOUSLY large server. IBM's high-end p690 currently maxes out at 256GB. The virtual address space is undoubtedly much higher.

    Tom

  59. Re:unfotunatly Apple is going with Intel instead.. by Anonymous Coward · · Score: 0

    I ask you good sir, how do you know they are going to be more expensive?

    Thanks for random speculation based on no facts whatsoever, it really helps the discussion.

  60. Re:unfotunatly Apple is going with Intel instead.. by Anonymous Coward · · Score: 1, Interesting

    probably closer to $400 for the top of the cutting edge at the begining, with the 1.2g versions being around $270 (lower misfab rates). Prices will go down as this is IBM's competition to "comodity x86" hardware they are forced to use on their blades currently, I wouldn't doubt but within 10 years IBM and AMD will be fighting a price war over the desktop.

    Intel seems happy to sit out this round, I'm wondering if they are going to be there next round though.

  61. Re:unfotunatly Apple is going with Intel instead.. by cehardin · · Score: 2

    Ok, apple most likely isn't going to go with Intel CPUs, but one of your reasons is wrong.
    Mac OS X binaries are capable of being FAT. They can contain both PPC and Intel instructions. OS X binaries are actually folders which can contain multiple architecture implementations within them. This also applies to the "libraries". Ok, the Framework bundles.

  62. Re:It WAS DOUBLE POSTED BECAUSE OF CENSORS by Anonymous Coward · · Score: 0

    Why don't you just STFU and buy a http://www.sgi.com/visualization/onyx/3000/

  63. Re:unfotunatly Apple is going with Intel instead.. by Joey7F · · Score: 2

    One time someone asked me "Will this faster processor help me download quicker or what"

    I stared at him and just said "Yes...yes it will"

    --Joey

  64. why by Anonymous Coward · · Score: 0

    isn't this linked on apple.slashdot.org???

    IBM 970 makes us aquadorks drool...

  65. Re:yay! by Anonymous Coward · · Score: 0

    IMHO five minutes of researching a mod decision isn't out of line when a comment is getting a large number of moderations up and down or when half of the replies are from people trying to figure out whether the parent was written by a troll. Providing info to other mods as AC is hardly ruining slashdot, especially if it helps to clarify which posts are accurate and trustworthy.

  66. Re:Question IT IS ONLY 40 BITS not 64. by jason_watkins · · Score: 1

    All 64bit machines are actually narrower, and as long as the architecture is spec'd properly, THERE IS NO REASON A LATER CHIP CAN'T ADD MORE PHYSICAL ADDRESS BITS. See the progression the Dec Alpha went through for example.

    For the time window the 970 will be in market, a 1TB address space is huge, and saving some bus lines will be a bigger impact by saving bottom line $ than bragging rights over being able to address ram capacities that no one will use.

    If you need to address large ram capacities, you're clearly not in the market the 970 is designed for. While geek bragging rights might be fun, apple would have to be as daft as you are to actually bias the price of their product line to achive it.

  67. Re:unfotunatly Apple is going with Intel instead.. by dolanh · · Score: 2

    Nobody ever said premium Mac systems were cheap. Quad proc Daystars were quite expensive in the day.

    I said "competitive", but I meant in performance terms. Price competitiveness is a whole different issue, though as another poster hinted, price/performance might not be as bad as one would think, esp. if IBM isn't plagued by the same fab issues Motorola has come across with the G4 (and history has shown that they probably won't be).

  68. Re:Question IT IS ONLY 40 BITS not 64. by Anonymous Coward · · Score: 0

    Uhh, 40 bits is 1 TERAbyte. Not 1 GB... Yeesh.

  69. Re:unfotunatly Apple is going with Intel instead.. by Anonymous Coward · · Score: 0

    It is difficult to emulate PowerPC with Intel (although the reverse isn't *that* difficult).

    This seems to be a common myth amongst Mac users, but it's completely bogus. There's nothing magical or mysterious about emulating one 32-bit architecture on another, and writing a PowerPC emulator for the x86 would not be significantly different to writing an x86 emulator for the PowerPC. The fact that Intel CPUs are considerably faster, however, does mean the x86 could in all probability do a better job of emulating the PowerPC than vice-versa.

    The main reason for the lack of (modern) Mac emulators is a lack of demand. I mean, virtually anything that runs on a Mac also runs on a PC, so why on earth would any PC user want to emulate a Mac? The reverse is obviously not the case, hence the popularity of PC emulators amongst Mac users.

    Apart from the lack of demand, the tricky thing about emulating a Mac is Apple's closed architecture. The PC architecture is open, so it's relatively easy to implement it in software. Although the Mac has imported a lot of open architectures from the PC (e.g. PCI, AGP, USB), the Mac system-board designs are closely guarded by Apple to prevent clones. That's why not even other PowerPC systems (e.g. IBM RS/6000s) can emulate current Macs. If PC or RS/6000 users really wanted to emulate Macs, someone could probably reverse-engineer one, but the simple fact is they don't.

  70. Re:Question IT IS ONLY 40 BITS not 64. by Anonymous Coward · · Score: 0

    Ignorant rants of this kind get on my nerves. A process in Windows (NT/2000/XP) gets 2GB of address space by default, which can all be backed by physical RAM. With the /3GB switch in the boot.ini, which has been around since NT4 Enterprise Edition, the process address space is increased to 3GB (with the kernel address space correspondingly reduced from 2GB to 1GB). If Linux for some reason can't even back 1GB of the process address space with physical RAM, it must have a really crappy memory manager.

    On the hardware side, modern x86 CPUs actually support 64GB of physical memory, but it can't be directly accessed (since the address space is only 4GB), so it's only available via convoluted mechanisms like the Windows PAE APIs (and I think they can only access the full 64GB on Windows 2000 Datacenter -- not that normal PC motherboards/chipsets support it anyway).

  71. Re:I am tired of 64 bit lies ! (40 bit only, folks by Anonymous Coward · · Score: 0

    Large address space does not have to mean "buy lots of RAM".

    It can also mean "lots of virtual memory", or "ability to mmap extremely large files" without any problems at all.

  72. That's just like me by D+iz+a+n+k+Meister · · Score: 0, Redundant

    If the P4 takes a narrow and deep approach to performance and the G4e takes a wide and shallow approach, the 970's approach could be characterized as wide and deep

    Hey ladies!! My approach could be characterized as wide and deep too.

    --

    He painted a unicorn in outer space. I'm askin' ya, what's it breathin'?
  73. Re:yay! by jweatherley · · Score: 1

    What does a poster's history have to do with it? Read his post and decide - if you still can't figure out if it's a troll or not then don't moderate. In this case PS's post is not a troll as it is factually correct (when he is trolling he is a lame watered down knock off of PhysicsGenius). No, in this case he is whoring for karma - if you think this is informative mod it up - if not then don't bother modding it down - mod someone else up instead. Remember the slashdot janitors have an infinite supply of mod points and are not scared to (ab)use them.

    --

    --
    Reverse outsourcing: it's the future
  74. repost by SHEENmaster · · Score: 1

    just after our sun expands to a red giant and swallows the earth and moon.

    Didn't I see a post about that being emminent? We only have a few million years left. Why waste them on /.!?

    --
    You can't judge a book by the way it wears its hair.
  75. Re:unfotunatly Apple is going with Intel instead.. by Lars+T. · · Score: 2

    Yes, the binaries can be "FAT" - but currently not a single one is. IOW all software would have to be recompiled (if that is possible at all).

    --

    Lars T.

    To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

  76. Re:unfotunatly Apple is going with Intel instead.. by Anonymous Coward · · Score: 0

    I see my post has been modded down. This must be because it links to SPEC results which show Macs are somewhat slower than PCs? They are nevertheless good machines, and unfortunately for the modder, sticking ones head into the sand like an ostrich wont speed them up!

  77. Re:They should make it work three ways by Lars+T. · · Score: 2

    You mean like AMD? With more than 20% marketshare and bleeding money? Yeah, right, I know this is just because of their Flash memory business.

    --

    Lars T.

    To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

  78. Re:Power4 vs PowerPC 970 by chrysrobyn · · Score: 2

    The first generations (601, 603/604 and the ?aborted? 620) of the PowerPC line were scaled-back versions of the Power and Power2 architectures respectively [the original Power architecture was mounted on a 3x5 daughter card with 4-5 separate chips [I'll have to go looking for my tech papers] making-up the core ... because of this the migration of everything into one die for the PowerPC was amazing.

    The PowerPC 601 was not a scaled back version of the Power series. To say this would imply that they took the design and modified it. In fact, they took the Power instruction set, modified that and then designed the processor to support it for the target markets.

    The 64 bit PowerPC 620 was not "aborted" per se (like the PowerPC 615 was), rather IBM decided that its role was filled by the higher clocked 604 series and the then soon-to-come IBM Rochester, MN designed 64 bit PowerPC 630 (aka Power 3).

    To verify my claim that the PowerPC 620 was not aborted, Motorola got suckered into manufacturing them for Bull.

  79. Re:Question IT IS ONLY 40 BITS not 64. by Anonymous Coward · · Score: 0

    If you don't enable HIGHMEM in the Linux kernel, each process is limited to 3 GB, so it steals only 1 GB (wich is less than a bit). If you do, you can have the full 4 GB per process.

  80. Re:Question IT IS ONLY 40 BITS not 64. by Anonymous Coward · · Score: 0

    Windows only recently (last couple of years) got the ability to work with more than 512Mb, and now they're pissed about another OS only being able address a Terabyte? Talk about yer double-standard...

  81. Re:Question IT IS ONLY 40 BITS not 64. by Anonymous Coward · · Score: 0

    Windows only recently (last couple of years) got the ability to work with more than 512Mb, and now they're pissed about another OS only being able address a Terabyte? Talk about yer double-standard...

    Huh? Windows NT's been around since 1993 (a wee bit longer than 'a couple of years'), and has never had any '512MB limit'. Of course Windows NT/2000/XP supports 4GB of RAM (the max a 32-bit system can directly address), and for a few years has even offered an address-space mode that reduced the kernel address space to 1GB (from the usual 2GB), leaving each process with 3GB. The more expensive server versions also support >4GB RAM (up to the CPU limit of 64GB) through PAE, but only extremely pricey server hardware supports that much RAM, and it can't be directly addressed.

    If 64GB RAM and a 3GB process address space (essentially the upper limit for x86 systems) isn't enough, there's Itanium and x86-64.

  82. Re: PPC, Never *just* for Apple? by ebooher · · Score: 1

    What ever happened to the IBM released ATX compliant, socket (PPC) motherboard specs? I used to have a link to them, but I can't find it now.

    All I remember is that a couple of years ago, IBM released an open resource platform that was a standard ATX implementation for their (at the time) top of the line PPC604 chips.

    The thing looked just like an (Intel) board that we've all come to know and love. It had DIMM sockets, a ZIFF based processor socket, PCI bus, AGP video (though for some reason I seem to remember it *not* having USB).

    All of these companies (something like six, which is a lot in this type of industry Mainboard wise, right?) jumped on the build about PowerPC motherboards. "It was like several PC component manufactures cried out in agony, and then were suddenly silenced." Because they all, like, just disappeared off the radar.

    Did Apple pull some kind of "Don't you dare let them build those motherboards or we'll walk" type of attitude? Because it's just *really* odd that I didn't see a *single* physical computer get built out of all these announced and proposed PowerPC Linux workstation systems.

    Anyone know what happened?

    --
    "Genius may shine aloof and alone, like a star, but goodness is social, and it takes two men and God to make a Brother."
  83. SMP by Into+The+White · · Score: 1

    If IBM can offer the PPC 970 at a good price, then Apple will be able to offer dual processor Powermacs, or perhaps even quad processor Macs. With such a design, it is no longer important for the PPC 970 to best the Pentium in terms of performance; the balance between performance and power consumption/heat dissipation becomes more important, and this is where the PPC 970 shines. A dual PPC 970 Mac will undoubtedly be considerably faster than a single CPU Pentium system, and for the first time in years, Apple will offer the highest performance consumer desktops, by a large margin! If one 970 is about as fast as a 2.5 GHz Pentium 4, then two 970s should offer nearly twice the performance in some situations. It will be interesting to see if gamers flock to the mac...

    --
    "If you're half-evil, nothing soothes you more than to think the person you are opposed to is totally evil." N. Mailer
  84. Re:Power4 vs PowerPC 970 by NattyDread · · Score: 1

    You are correct that the 601 was a new engineering effort and implementation of a modified Power instruction set. I did not intend to understate this fact and give the impression that it was a direct adaptation or modification of the original Power architecture (it could also be debated that the Power2 architecture is about the same distance of relation to the Power as is the PowerPC ... though not quite in the same direction ... I remember still receiving biweekly microcode updates for the Power CPU after it has been (prematurely according to some) branded Gold and released ... both the PowerPC and Power2 were much more mature).

    My "?aborted?" statement w/r to the 620 was ment to be tongue-in-cheek as IBM had given a lot of publicity (both internal and external) to the coming of the 620 since about the time it entered the design phase ... you are correct that it is really a case of having missed its window in the family evolution than actually being scrapped.

    Natty

    Nonetheless, thank-you for your clarification.

    --
    Maybe the rain Isn't really to blame. So I'll remove the cause, But not the symptom!
  85. Using Big RAM in the Real World by billstewart · · Score: 2
    There are several different limits you'll encounter, which will have different effects on the programs you write as well as on the machines you can afford to use...:
    • Physical RAM limits - at ~$100/GB, commercial applications can easily afford a TB of RAM, but it's not that simple putting it into one computer - with 4GB RAM boards, that's 256 boards, which won't fit on your average PC ATX motherboard.... Solid-state disks can do the job, with obvious differences in performance from RAM or cache, but they usually lead the CPU-memory market by a couple of years. By the time you can get a normal machine with 1TB of RAM, that Power4/970 will be an antique, and you'll be complaining about the lame 48 bits in your wristwatch's address space.
    • Virtual Address space you can mmap() or put in a single array - this makes a lot more difference, because it affects whether your programs can view the large space as a segmented or unsegmented thing, and whether you need visibility into the memory structure, and whether you can get the visibility if you want it. For most scientific applications, it's not a big deal, because you seldom need it all in a big 1-D array; more commonly, anything that needs a huge address space is working in 2-D or 3-D, so you run an ugly setup thing to get your array pointers initialized and then return to normal programming.
    • Virtual memory for files and databases, though, is more of a problem. It's convenient to be able to treat the disk as one big flat virtual array, even if you do a certain amount of explicit management on the relationship of disk and RAM rather than leaving it all up to the VM system. A terabyte of disk is under $2K these days, though if you want faster disks and faster busses you might pay a bit more, but the costs of disk have been diving faster than CPUs for a few years, and anything below about 47 bits of address space for disk starts to heavily eat into the programming effort for products that are meant to work for a few years.

    Almost two decades ago, I was supporting a network modelling application that needed 12MB, and our computer was a VAX with 4MB of RAM, but fortunately 4.1BSD and System 5.2p were able to give it enough virtual memory to scrape along. Our typical runs took about a week, until a couple years later when the price of RAM dropped to the point that we could afford to upgrade to 16MB, at which time it dropped to an hour per run.

    One thing we found out was that as you approach the limits of a machine's capacity, all the details of the architecture that you were able to ignore on smaller problems become visible, like how the TLBs work and what the memory page sizes are; things became somewhat clunky at 6MB and much more clunky at 22MB (or maybe 24MB), and some of that may have been the OS rather than the hardware. The extra 12MB RAM cost us approximately one person-year's salary, but unfortunately corporate accounting rules made it much harder to buy capital equipment than to make our study take a year longer.

    At this point, I'm supposed to include the obligatory old-geezer rant about walking to the mainframe, five miles in the snow, uphill both ways, carrying punch-cards and hand-winding magtape, but that was back when I was an undergrad, and we didn't actually go to the mainframe, just the card-reader/printer/keypunch room which was half a mile away (still snow and hills), and the reason I handwound the magtape was because the professor's only copy had a cracked reel, though we did use hand-cranked papertape tools to do it, and I really _did_ wind papertapes by hand :-)


    I haven't dealt with these problems lately - Moore's Law has long passed the limits of any problems I solve in practice, laptop diskdrive sizes have gotten two jumps ahead of Microsoft bloatware (though fitting backups onto CD-Rs feels a lot like fitting them onto floppies used to), and the only practical application I've got that could think about pushing the terabyte disk boundary is Tivo, if I'm willing to devote that much resources to something that makes me watch more television... The reason it took so long to get the 120GB drive on my 233MHz Pentium wasn't cost, it was BIOS upgrades :-), and if I decide to rip all my CDs onto the disk, it'll mainly be just because I can.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  86. Re:unfotunatly Apple is going with Intel instead.. by Toraz+Chryx · · Score: 2

    "There's nothing magical or mysterious about emulating one 32-bit architecture on another, and writing a PowerPC emulator for the x86 would not be significantly different to writing an x86 emulator for the PowerPC"

    Go count up the PPC registers.. then the registers in the x86 architecture...

    With JIT "acceptable" speed would be possible, but I wouldn't expect more than 601-66 performance from a high clocked P4 running interpretive PPC emulation..

  87. Last Post! by alpg · · Score: 1

    BOFH excuse #207:

    We are currently trying a new concept of using a live mouse. Unfortuantely, one has yet to survive being hooked up to the computer.....please bear with us.

    - this post brought to you by the Automated Last Post Generator...