Slashdot Mirror


ArsTechnica Compares the P4 and G4e: Part II

Deffexor writes "It looks like Hannibal of ArsTechnica fame has put Part 2 of his original comparison article between Intel's P4 and the Apple/Motorola G4e. In a nutshell, this second article covers the execution core, the AltiVec unit and SSE2, as well as a myriad of other interesting factoids. An interesting read, if not a little technically intense for those of us with less than a CE/EE degree. Have at it boys!"

192 comments

  1. As Always an interesting read by xamfear · · Score: 1

    gotta love AT, they dont publish often but when they do its fantastic stuffs.

    1. Re:As Always an interesting read by tom.allender · · Score: 1

      Shame half the links are to files on the "F:" drive...

  2. Allright! by Ashcrow · · Score: 2, Interesting

    This is exactly what I've been trying to find out for some time now. I've been increasingly upset with the x86 line of chips since it seems that there is hardly any diffrence between 600Mhz and 1.2Ghz.

    1. Re:Allright! by fault0 · · Score: 1

      But.. for the next generation of apps (games, especially), a 600mhz box will not cut it. a 1.2ghz one may. and 1.5-2ghz probably will.

    2. Re:Allright! by JCMay · · Score: 2

      I disagree completely.

      Doubling the clock speed *should* provide noticable differences in performance for *all* applications, not just new games. If these new games are all that benefit, then the hardware has been designed wrong for a general-purpose PC.

      Case in point: here at work my old P2-266 was replaced with a P3-500. I noticed absolutely NO difference in performance. My Linux box (K6-2/400) at home seemed slow with only 64 MB of RAM. Adding 128MB to it (192 MB total) did more for performance than doubling the clock speed of the machine at work!

    3. Re:Allright! by shakamojo · · Score: 1

      Check the bootimes. I upgraded my Celeron 333 to a PIII 500 and it booted in about half the time. Upgraded that PIII to a P4 1.6 and it boots in about 3/4 of the time that the PIII does. In the end, if you aren't going to USE the power, then don't bother upgrading. Don't think that a 2 Gig P4 is going to make your word processor run faster.

    4. Re:Allright! by LinuxAntM · · Score: 1

      Boot times have less to do with proc power and more to do with OS implementation. Case in point may slackware system PII350 boots much faster than my win2k PIII500

    5. Re:Allright! by reflective+recursion · · Score: 2, Informative

      Not exactly. I'm sure you know clock speed is meaningless for determining how fast a CPU really is. Now higher model numbers should mean huge advancements, and they do many times. One reason all applications are not sped up is because the bottleneck is in different places for different applications. If you tried to play those same games that sped up on a newer CPU, but with an older graphics card, well it would be exactly the same. The word processor would then seem faster than games. Another thing, many games take advantage of special CPU instructions which many applications do not need.

      Anyone who has ever upgraded or put together their own PC should know that performance depends on all parts, not just the CPU or memory. Maybe your applications don't speed up because you are still using that old hard drive, which is the bottleneck? Yes, it would be nice to purchase a magic chip that you throw into a computer which speeds everything up dramtically, but the reality is no computer (that I know of) works that way. I'd love it if I could install a new CPU and get 20% more bandwidth, but it just ain't gonna happen. The reason your applications do not seem faster is because they are probably already as fast as possible (to detect by any mere mortal, that is).

      --
      Dijkstra Considered Dead
    6. Re:Allright! by Anonymous Coward · · Score: 0

      from my (limited) tests, 'old' celerons (pre 800mhz) were severly crippled, performance-wise. I used a c400, and it was about half as fast as my k62-333. i havnt used any new ones, so i couldnt tell you more.

      I too have noticed little difference between a 333 and a 550, which is unfortunate. i do make use of the slight improvement with all the graphical apps i use, but it still isnt what i was hoping for.

    7. Re:Allright! by Anonymous Coward · · Score: 0
      Yup, I think that it is only worth upgrading a computer when the cumulative component improvement will lead to a 5-10x performance increase over the old one.

      So, upgrade a 333MHz Celeron to a 1GHz Duron, more memory, and a faster graphics card. Of course, you can interleave the upgrade purchases to effectively get a new toy every 6 months or so, and a new system every 1.5 years to 3 years depending on how often you upgrade, and what you use the system for.

      People don't notice smaller performance increases - especially under 2x the speed, like your upgrade!

      Gaming, that is different, and expensive.

    8. Re:Allright! by Jimmy+Seven+Nads · · Score: 0

      That's because it's Linux.

      The power of an 80's calculator, and 100 times the boot time.

      Fucking shit.

      --
      Luca Turrilli makes me call him Daddy.
    9. Re:Allright! by Anonymous Coward · · Score: 0

      Do they make 1ghz durons? I thought 950 was the limit.

    10. Re:Allright! by Datafage · · Score: 2

      Um, yeah, but he said he just upgraded the system, which means it had the same OS, so that doesn't apply.

      --

      Nicotine free Amish .sig.

  3. Another view by Mik!tAAt · · Score: 5, Funny

    Here's another comparison: Joy Of Tech (and the next 6 pages as well)

    --
    This is the place where you write something that will make you seem like a complete idiot.
    1. Re:Another view by Anonymous Coward · · Score: 0

      LOL!

      Now why on Planet Earth didn't /. link to that earlier?

  4. technically intense.. by smaughster · · Score: 4, Funny

    >An interesting read, if not a little technically intense for those of us with less than a CE/EE degree.

    Tell me about it, I do have more then CE, two letters even, namely MCSE and even I had to stop when they started throwing around the heavy stuff. I mean, A = A + B is supposed to make sense even if B isn't equal to zero.

    --
    I intend to live forever, so far so good.
    1. Re:technically intense.. by Anonymous Coward · · Score: 0

      I mean, A = A + B is supposed to make sense even if B isn't equal to zero.

      Think of A and B as containers. The value of the contents of A plus the value of the contents of B, and the sum is then stored in A.

    2. Re:technically intense.. by Lars+T. · · Score: 1

      He probably used IE's Link Click Wizard ;-)

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    3. Re:technically intense.. by xbrownx · · Score: 1

      you're a Microsoft Computer Engineer?

      Wow...

    4. Re:technically intense.. by Phrogz · · Score: 1, Offtopic
      ...I do have more then CE, two letters even, namely MCSE...
      A = A + B is supposed to make sense even if B isn't equal to zero

      That doesn't apply. The important formula here is that A>=A+B if and only if B is non-negative. MCSE-CE = MS, and since you didn't understand the full article, this implies that the addition of MS to a title actually reduces knowledge, not increases it.

      :)

  5. conclusion by Scrameustache · · Score: 2, Redundant

    Big article, only had time to glance over it (and I'm not technically qualified to understand it in its full detailed glory), but as far I can see, the dude isn't picking sides. Wich is a rare treat.

    Although he does confirm Steve Job's words of wisdom: Mhz aren't everything :)

    (I, on the other hand, am picking sides)

    --

    You can't take the sky from me...

  6. G5 is coming soon by Lemur+catta · · Score: 3, Interesting
    Its not really fair to compare the G4 and the P4, since the G4 was aimed at competing with the P3.

    A more interesting comparison will be to pit the P4 against the comming G5. According to the Register, Apple has begun seeding early G5's at up to 1.6GHz to key devlopers. Other sources are claiming limited yeilds in the 2.4GHz range already.

    There's still bugs to be worked out before production ramps up for release early next year, and supposedly AltiVec will not be as strong on the G5 as it is in the G4. But at 2.4GHz on an already-superior FPU, who needs it?

    1. Re:G5 is coming soon by Anonymous Coward · · Score: 1, Funny
      Other sources [macosrumors.com] are claiming limited yeilds in the 2.4GHz range already.

      Oh yes, and we all trust MOSR.. especially me, as I type this on my flat-panel iMac. Oh, wait, that's not a product (yet), isn't it? :-)

      Seriously.. rumour sites aren't credible.. don't try to make them be.

    2. Re:G5 is coming soon by Scrameustache · · Score: 1

      Does anybody know if the fabled G5 will be compatible with the existing motherboards of the G4 machines?

      ie: Will I be able to just buy a chip upgrade and keep my pretty tower?

      --

      You can't take the sky from me...

    3. Re:G5 is coming soon by Anonymous Coward · · Score: 0
      Does anybody know if the fabled G5 will be compatible with the existing motherboards of the G4 machines?

      No, it won't. Some third party might build some sort of interface to let a G5 pretend to be a 32-bit PPC for back compatibility, but it would be largely piontless since you'd be cut off from the 64 bit addressing and instructions, and have none of the G5's bandwidth improvements.
    4. Re:G5 is coming soon by Anonymous Coward · · Score: 0
      Yes, but it won't be the cheapest of options. It will consist of a G5 - G4 bus translator, and the memory slots will also have to go on the ZIF socket along with the G5 and bus translator.

      You will however be able to use the G5 as a full 64-bit machine, with a small performance issue when accessing anything on your motherboard. This happens whenever fusing new technology to old technology however.

      Don't expect this for a while either.

    5. Re:G5 is coming soon by tmark · · Score: 3, Insightful

      It *is* fair to make these comparisons since the G4 and the P4 are the best Motorola and Intel have to offer us, right now. It's irrelevant what the G4 was "aimed at competing with"; what is relevant is what we have in our hands now.

    6. Re:G5 is coming soon by Lars+T. · · Score: 1

      You could, if somebody made a upgrade, which is possible, but since non of the upgrade makers have made a G4e/7450 card yet, it's unlikely they will make a G5 card in the near future (and I'm not talking Internet time).

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    7. Re:G5 is coming soon by horster · · Score: 1

      thank you! isn't that obvious?

    8. Re:G5 is coming soon by Peter+La+Casse · · Score: 1

      "Fair" is an interesting word. If I make a product that is specifically designed to compete with my competitor's Product X, it's not accurate to say that my product is a failure if my competitor's Product X+2 is better than my product. Is it "fair" to judge something using criteria that it was never intended to consider?

      For example, the K6 was designed to compete with Pentium MMX and early Pentium II chips. There should be few surprises presented by a study that compares the performance of the K6 to the performance of the Pentium IV.

    9. Re:G5 is coming soon by Space+Coyote · · Score: 1
      Actually, MOSR is one of the better rumour sites. Their next-gen iMac pieces are worded more like 'what prototypes Apple has in-house at the moment. You'd be stupid if you didn't think they have at least several designs for a flat-panel iMac, they're just sticking with the current one because it's not yet economical to release an LCD-based consumer machine.

      They also seem to be more adept at spotting a bad photoshop job than some sites :)

      --
      ___
      Cogito cogito, ergo cogito sum.
    10. Re:G5 is coming soon by Sloppy · · Score: 2

      Its not really fair to compare the G4 and the P4, since the G4 was aimed at competing with the P3.

      It is fair to compare whatever happens to be on the market at any point in time.

      If Motorola is reacting instead of acting, that's just too bad for them. It doesn't make the comparison unfair.

      A more interesting comparison will be to pit the P4 against the comming G5.

      Yes, that will be a good and interesting comparison when I can buy a machine with a G5 in it.

      --
      As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.
    11. Re:G5 is coming soon by iceburn · · Score: 1
      Its not really fair to compare the G4 and the P4, since the G4 was aimed at competing with the P3.

      The article wasn't really a shootout of the two processors. The G4e was simply used as a point of reference as to how a normal processor does things compared to the way Intel does things to crank up the Mhz. The G4 seems to me to be a great reference when analyzing things like the P4s insanely deep pipelining and such.

      --
      A sphincter says what?
    12. Re:G5 is coming soon by Anonymous Coward · · Score: 0

      I believe he was compairing the G4e to P4 and it is a fair comparison.

      I wouldn't trust the Register anymore than I would trust cnn.
      The G5 isn't out yet. From what I understand the G5 is a true 64-bit processor. The G4e and P4 are still 32-bit processor class.

  7. ppc power by peachboy · · Score: 4, Insightful

    i personally believe that flexibility of the assembly instructions as well as the number of instructions executed per cycle contribute greatly to the dominant speed (at any given MHz/GHz) of the ppc processor. compare any intel/amd processor to a ppc at the same clock speed, and the ppc will kick its x86 ass.

    the high end ppc desktops are topping out around 900MHz, while the p4's are hitting 2GHz. there has to be another explanation besides the complaint that jobs is ignorantly sitting on his thumbs. i think he knows what he's doing.

    note: i am not a mac zealot.. i don't even own a mac - only 4 x86 pc's (1 athlon, 2 p133, 1 p120). i simply can appreciate the speed of the ppc.

    --
    "I just want to thank my coach Eric a.k.a. Disco for shattering my reality..."
    1. Re:ppc power by tswinzig · · Score: 3, Insightful

      the high end ppc desktops are topping out around 900MHz, while the p4's are hitting 2GHz. there has to be another explanation besides the complaint that jobs is ignorantly sitting on his thumbs. i think he knows what he's doing.

      What a strange statement, considering that neither Jobs nor Apple create the PPC chips!

      --

      "And like that ... he's gone."
    2. Re:ppc power by Anonymous Coward · · Score: 0

      What a strange statement, considering that neither Jobs nor Apple create the PPC chips!

      They are part of an alliance (IBM, Motorola and Apple) that designs the chips so technicaly they do create the chips.

    3. Re:ppc power by autopr0n · · Score: 0

      the high end ppc desktops are topping out around 900MHz, while the p4's are hitting 2GHz. there has to be another explanation besides the complaint that jobs is ignorantly sitting on his thumbs. i think he knows what he's doing.

      Maybe he does, maybe he doesn't. But either way, he has no control over the speed of PPC chips, which are made by Motorola.

      --
      autopr0n is like, down and stuff.
    4. Re:ppc power by Elwood+P+Dowd · · Score: 1, Troll

      Well, it's convenient that we don't have to compare PPCs and x86s at the same clock speed.

      --

      There are no trails. There are no trees out here.
    5. Re:ppc power by Christopher+Thomas · · Score: 5, Interesting

      the high end ppc desktops are topping out around 900MHz, while the p4's are hitting 2GHz. there has to be another explanation besides the complaint that jobs is ignorantly sitting on his thumbs.

      Two factors come into play here.

      The first is that, if I remember correctly, PPC and x86 chips use a different clocking scheme. This means that clock rates between them aren't even directly comparable (what a "clock" is depends on the clocking scheme).

      The second is that it's perfectly possible that the PPC architecture is limited to lower clock rates than the x86 architecte. Signal propagation through gates takes time. If one architecture expects signals to propagate through logic three gates deep per clock, and another architecture expects signals to propagate through logic five gates deep, then of *course* one will have a faster maximum clock rate than the other. They would hopefully still be doing the same amount of work per unit of real time.

      You should already be familiar with this from the Athlon/P4 spin war. A 0.18 micron Athlon core simply cannot be clocked as quickly as a 0.18 micron P4 core - no matter what you do. Does this make the Athlon automatically a poor performer? No, because it can do more per clock. Does this make the Athlon automatically kill the P4, because it "can do more per clock"? No, because the P4 can be clocked faster. Only real benchmarks will tell.

      A point against Apple is that Apple has been allergic to publishing SPECmarks for its processors for the past couple of years (the only PPC-ish benchmarks are IBM's benchmarks of the Power series of chips, which forked after the G3 IIRC). This removes a very consistent (if somewhat flawed) means of comparison.

    6. Re:ppc power by Anonymous Coward · · Score: 0

      the AIM alliance dissolved years ago.

    7. Re:ppc power by Webmonger · · Score: 3, Interesting

      Of course there's another explanation-- namely, it's not Jobs who's sitting on his thumbs-- it's Motorola. (Ouch-- can you imagine Motorola sitting on Jobs' thumbs?)

      Furthermore, Motorola isn't sitting on their collective thumbs-- they're simply targetting a market whose requirements are different from Apple's. The embedded market.

      This is all hugely ironic, because the RISC architecture was s'posed to result in chips that could cycle faster but did less per cycle. Instead, it's the CISC chips that are cycling faster and doing less per cycle.

    8. Re:ppc power by Anonymous Coward · · Score: 0

      Eh. Recent Athlons and Pentiums are RISC inwardly, and use a CISC-wrapper for compatibility reasons. Yeah.

    9. Re:ppc power by Hal-9001 · · Score: 1
      A point against Apple is that Apple has been allergic to publishing SPECmarks for its processors for the past couple of years (the only PPC-ish benchmarks are IBM's benchmarks of the Power series of chips, which forked after the G3 IIRC). This removes a very consistent (if somewhat flawed) means of comparison.
      That, and Apple seems to have this delusion that Photoshop (highly optimized for the Macintosh) is everything. That's akin to arguing that the P4 is infinitely faster than the G4 just because something written for x86 doesn't run on the G4.

      Someone should create a site debunking the Photoshop myth. I would if I weren't so lazy...

      The truth, as usual, probably lies between the megahertz and Photoshop camps and is about what a resaonable person would expect: the G4 is roughly equivalent in performance to the P4 and Athlon, code can be highly optimized for any of those processors, and most users don't need that much computational power anyway.
      --
      "It take 9 months to bear a child, no matter how many women you assign to the job."
    10. Re:ppc power by Arandir · · Score: 2

      It's mainly the internal architecture of the chips. Is the memory address you need in cache or will you have to spend 50 clock cycles to fetch it? What if you want to read an address before a previous instruction finished writing it?

      The Intel chips are really fast at "hurry up and wait", but not nearly as fast at "hurry up and do something".

      --
      A Government Is a Body of People, Usually Notably Ungoverned
    11. Re:ppc power by Webmonger · · Score: 2

      Sorry, try again:

      In the end, I'm not calling the Athlon or P6 "RISC," but I'm also not calling them "CISC" either. The same goes for the G3 and G4, in reverse. Indeed, in light of what we now know about the the historical development of RISC and CISC, and the problems that each approach tried to solve, it should now be apparent that both terms are equally nonsensical when applied to the G3, G4, MIPS, P6, or K7."

      Of course I know that current x86 CPUs have a RISC-like core. They're still the heirs of the 8086, and that's what's ironic.

    12. Re:ppc power by iso · · Score: 3, Insightful

      Someone should create a site debunking the Photoshop myth.

      Actually the Photoshop benchmark is completely valid. Apple's largest market for their G4s is the designer community. The most popular application for those people, and the application they spend the most time waiting for, is Photoshop. Therefore, when they're out to buy a new computer, the most important thing to them is ... the speed of Photoshop!

      Additionally Apple has been using two benchmarks lately: Photoshop and movie compression. High-end video is Apple's second-largest market for their G4s, and this market spends most of their time waiting for video effects, such as compression. This is also a valid benchmark.

      Thirdly, microprocessors are increadibly complex, plus the end-user speed is also dependant on many factors including software and OS optimizations. It is absolutely useless to compare the speed of processors by some numerical benchmark. What's important to anybody who wants to be productive on a computer, is how quickly your key applications run. And lo and behold, this is what Apple is benchmarking.

      Remind me again why this benchmark is invalid? If anything I would suggest that any benchmark besides end-application speed is useless.

      - j

    13. Re:ppc power by Amoeba+Protozoa · · Score: 2
      Remind me again why this benchmark is invalid? If anything I would suggest that any benchmark besides end-application speed is useless.

      Well, that is of course unless you are an applications developer: in which case you would like to know some lower level benchmarks so you can take advantage of the things that the hardware is good at and converely stay away from the stuff that stinks.

      -AP

  8. Nice. by tcc · · Score: 5, Interesting

    But with the G5 around the corner, I think THAT will be THE interresting comparison.. expecially since Intel plans on keeping the P4 for a while (, ramping it up in speed, when you Read adobe saying the G5 are significantly faster than P4 (and if you go read the article, the same people do say that the P4 is faster than a G4 (exept for altivec stuff) so if they say G5 is faster than P4, it probably will be :)...it should be really nice to see something that kills the P4 in raw performance other than AMD).

    --
    --- Metamoderating abusive downgraders since my 300th post.
    1. Re:Nice. by shut_up_man · · Score: 2, Interesting

      Hang on... that article says "we'd caution against taking them as gospel", and that's coming from The Register, a site that has been... uh... less than correct on some issues in the past.

      Plus, the reports are from Adobe, who make Photoshop... an extremely Mac-optimised piece of software usually held up by the Mac brigade whenever they attempt comparative benchmarks.

      Look, I *want* to believe that the G5 makes great coffee, gives fantastic backrubs, cures cancer and runs faster than every P4. I do. I've just heard all these lines before, with the G4.

    2. Re:Nice. by Space+Coyote · · Score: 2, Insightful
      You don't think Adobe optimizes any of their code for x86? Of course they do, the reason why the G4 so soundly whips the P4 in Photoshop is because of the nature of the AltiVec units. If you had read the article, you would know this.

      I'm pretty sure Adobe doesn't care whether you spend your $900 on the Windows or the Mac version of Photoshop.

      --
      ___
      Cogito cogito, ergo cogito sum.
    3. Re:Nice. by Lars+T. · · Score: 1

      Photoshop is at least as optimized for x86 (by Intel engineers no less) as for PPC.

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    4. Re:Nice. by Anonymous Coward · · Score: 0

      Which platform has the bigger user base (for Photoshop)? That's right! It's the Mac! Probably a 10-1 ratio too. NOW where do you think Adobe is going to concentrate it's efforts?

      So your claims of a level playing field are wrong. The photoshop benchmark is slanted.

    5. Re:Nice. by Arandir · · Score: 3, Informative

      Look, I *want* to believe that the G5 makes great coffee, gives fantastic backrubs, cures cancer and runs faster than every P4. I do. I've just heard all these lines before, with the G4.

      I just got back from a seminar with Motorola and the architecture of their new and upcoming chips. Some nice stuff on the horizon. The did mention some of the less than stellar performance on prior chipsets, and explained that it was due to not taking advantage of the chip features. An operating system or user application that must be backwards compatible will not be able to utilize the chipsets to full advantage.

      You don't judge high end chipsets based on mass market consumer applications.

      --
      A Government Is a Body of People, Usually Notably Ungoverned
    6. Re:Nice. by horster · · Score: 1

      > But with the G5 around the corner, I think THAT will be THE interresting comparison.

      NO it WON'T! the G5 (64bit right?) will be compared against products like amd's hammer, and intel's itanium line. Competition does not stand still, intel _also_ has a next geration chip in the works. So will we hear the same bs about the next chip generation when G5 get's smoked again while still costing twice as much?

    7. Re:Nice. by Lars+T. · · Score: 1

      More like a 50-50 ratio. Now where do you think Adobe concentrates its efforts? Right, where they get fast code the easy way.

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    8. Re:Nice. by Anonymous Coward · · Score: 0

      Being an owner of two PCs and a Mac I must first say...

      G5 will be OSX only. Adobe needs to get their shit together and release Photoshop before they can compair anything.

      G4e and P4 are 32-bit class. G5 is a 64-bit processor and hasn't been released yet.

      Intel and AMD arn't sitting on their hands. They will have 64-bit processors to compete.

      In the end it all comes down to who has the biggest penis.

      Me!

  9. Maybe pointlessly detailed by Junks+Jerzey · · Score: 5, Interesting

    Note: I have a B.S. in computer science, a solid understanding of hardware issues, and have been programming for 19 years.

    When I read articles like this, there's so much detail that I find myself--even willingly--losing sight of the big picture. Sure, you could read a detailed write-up about Toyota's new engine, but those details don't really matter much unless you've just made a hobby of knowing about engines. Realistically, you'll have a hard time connecting those details to your driving experience. Heck, someone could put in a different engine, tell you that its a Toyota, and you'd be saying things like "Oh, yes, this feels just like a Toyota, I can tell that the designers did blah and blah."

    After the Pentium II generation of CPUs, things have gotten very, very muddled. Amazing features that are supposed to increase performance don't always do so. Sometimes they make things worse. Little compiler tweaks can make one program be twice as fast as another, given the same hardware. Chips with higher clock rates can be significantly slower than chips with 20% slower clocks. Certain applications run much faster than on previous chips, but there are others that show no increase.

    It's all very chaotic and confusing, even for people in the know. I suspect that if you took a program that people claimed to need a P4 or Athlon for--something very performance sensitive--and set yourself the task of making it run faster on a PII than an Athlon, you could do it. But that doesn't matter, as everyone seems to be clamoring for newer chips.

    1. Re:Maybe pointlessly detailed by well_jung · · Score: 0, Offtopic

      Will whomever modded the parent post "Insightful" please cue me into the insight?

      Using paragraphs is not insightful.

      --
      Carl G. Jung
      --
      "With one breath, with one flow, You will know Synchronicity" -La Policia
    2. Re:Maybe pointlessly detailed by Rogerborg · · Score: 2
      • Will whomever modded the parent post "Insightful" please cue me into the insight?

      Feel free to debunk it. Explain why it's better for developers (and the user experience) to have to work out how to optimise to a new pipeline every couple of years, rather than to squeeze every last drop of speed out of one design before moving onto the next on (e.g.) a five year cycle.

      Ever looked at the specification of a Playstation and wondered how on earth developers got it to do what they had it doing by the end of its lifecycle? Ever wondered why early Playstation 2 games bit the weenie?

      I'm not claiming that the original poster was right, or you are wrong (if that's indeed your position), I'm saying that it's debatable. How about debating rather than sneering?

      --
      If you were blocking sigs, you wouldn't have to read this.
    3. Re:Maybe pointlessly detailed by Anonymous Coward · · Score: 0

      If you don't know how to use whomever, and don't want to look foolish, then err on the side of caution and use whoever.

    4. Re:Maybe pointlessly detailed by wct · · Score: 3, Insightful

      Feel free to debunk it. Explain why it's better for developers (and the user experience) to have to work out how to optimise to a new pipeline every couple of years, rather than to squeeze every last drop of speed out of one design before moving onto the next on (e.g.) a five year cycle.

      The whole point of modern processor architectures is for the work of pipeline optimisation be done by either the processor or the compiler. The goal is less work for the application developer. This coincides with the trend towards higher level programming languages - I can't think of any large major applications architected such that the code needs to be hand optimized at instruction level. Sure, after profiling some parts may be tuned, but the size of applications today are just too difficult to design in overview with that scope. A huge amount of the transistor budget these days is taken by microcode that performs these optimizations on the fly, but with architectures such as Itanium you're seeing a move towards compile time ordering. But now I'm getting sidetracked...

      Ever looked at the specification of a Playstation [e-scapegames.co.uk] and wondered how on earth developers got it to do what they had it doing by the end of its lifecycle?

      This is a time-honoured trend in closed-hardware systems. Look at the last generation of games on the SNES. Look at the scene demos being put out for the Amiga in the mid 90's, essentially a decade old hardware platform at that point.

      Ever wondered why early Playstation 2 games bit the weenie?

      All first generation titles do not harness the full capacity of a machine; I think this is your point. But on open systems developers don't have to optimize anything, thanks to Moore's Law. Maybe it doesn't strictly live up to the "small-is-beautiful" aesthetic, but software development is about optimizing results. Time will be spent where the greatest payoff is, and since performance boosts are a natural consequence of progress more resources are devoted to development.

    5. Re:Maybe pointlessly detailed by well_jung · · Score: 1

      I'm not disagreeing with him. But you seem to actually made a point, whereas he pretty much just said "it's confusing". Well yeah, it is confusing. Anyone trying to figure it out realizes that. I don't understand what was insightful about that.

      Maybe you, or someone else read more into his comments than I did. That's why I asked to be clued in.

      --
      Carl G. Jung
      --
      "With one breath, with one flow, You will know Synchronicity" -La Policia
    6. Re:Maybe pointlessly detailed by Rogerborg · · Score: 2
      • The whole point of modern processor architectures is for the work of pipeline optimisation be done by either the processor or the compiler

      How long has Microsoft Visual Studio 6.0 been out for? How many new pipelines have come out in that period. Thanks for making my point for me. ;-)

      --
      If you were blocking sigs, you wouldn't have to read this.
    7. Re:Maybe pointlessly detailed by the_2nd_coming · · Score: 2

      that is why I think the best compilers are the ones that the chip makers create. Intel releases a new compiler or update with every iteration of the pentium because they want to have an optimised compiler for the chip to take full advantage of its abilities. If MS and other groups whould make deals with the hardware vendors
      to use there compiler and have a plugable interface into the IDE so you can place the compiler of your choice into it......but MS would neve do that would they :-)

      --



      I am the Alpha and the Omega-3
    8. Re:Maybe pointlessly detailed by Anonymous Coward · · Score: 1, Insightful

      > After the Pentium II generation of CPUs, things
      > have gotten very, very muddled.

      To continue with your engine analogy, after the fifties things have gotten more complex. Variable valve timing is common. BMW is working on technology (if it's not in production already) that opens/closes valves using electromagents - no more camshaft! All electrical. Probably both more reliable and more efficient.

      Turbocharging is now common - the combination of a small displacement engine and turbo is commonly found and provides a good compromise of power and efficiency/mileage.

      As the technology increases, designs become more complex. This isn't always a bad thing.

    9. Re:Maybe pointlessly detailed by RedGuard · · Score: 1

      Actually the Intel compiler plugs into
      Visual Studio without any difficultly
      at all.

    10. Re:Maybe pointlessly detailed by the_2nd_coming · · Score: 2

      ok, cool, then what is all the fuss about?

      --



      I am the Alpha and the Omega-3
    11. Re:Maybe pointlessly detailed by Brand+X · · Score: 2

      The price tag on that compiler. $499 per seat for the first few developers. For Windows. And another $499 for the Linux version. Added to the V$ cost. (And then there's the $5k for the Forte package to extract the sun C++ compiler, which requires hacks to automake and autoconf to switch on)

      Guess who does the porting and profiling where I work...

      --
      -- Still waiting for the Nike endorsement
    12. Re:Maybe pointlessly detailed by the_2nd_coming · · Score: 1, Offtopic

      ok, ;-)

      --



      I am the Alpha and the Omega-3
    13. Re:Maybe pointlessly detailed by Anonymous Coward · · Score: 0

      It said in about 4000 words what I can say in 4:

      This article oversimplifies everything.

      There. Want me to say how I've been a functional alcoholic for 2 years? How I checked out a book on PHP and haven't returned it in 6 years? How I graduated with 164 credits, when only 128 were needed because I'm a hapless slacker?

      Done.

      Want me to karma whore?
      Not a chance.

    14. Re:Maybe pointlessly detailed by Anonymous Coward · · Score: 0
      Funny enough, as a gearhead, I can identify a modern Toyota engine by the slight lag in their variable timing. Honda's Vtechs are smoother but have less oomph when the high timing kicks in.


      And as a driver of a car with a plant that displaces all of 1.3 liters, has twin turbos, and no camshaft yet produces around 300HP I'm with the new tech. Granted she only gets 19 MPG, but the new rotaries are supposed to be much better ;)

    15. Re:Maybe pointlessly detailed by Anonymous Coward · · Score: 0

      Didn't the old (buggy) Cadillac 8-6-4 engine use electromagents on the valve covers? Of course, it still had a camshaft (or pushrods, I forget).

    16. Re:Maybe pointlessly detailed by dollargonzo · · Score: 1

      i agree... i think the reason is becuase, these days, people are not the ones designing and actually testing new chip architectures-- computers are. the p3 could have been used (although more likely a supercomputer) to design and test the p4, etc. also, no one engineer actually knows all the others parts of chip except the one he is designing. the alu designers @ intel probably couldnt design an fpu nearly as good as the fpu designers, simply because the computer chip is no longer the specialization of an engineer-- a specific, more detailed part is; so, when such articles begin to compare whole chips, and start to go into details, yuo quickly lose track, and easily get lost in the details. from this comes: if a scientist cant explain what he is doing to a 10 year old and have him understand in 100 words, he is no good. same here. most people dont know that much about computer architecture, so shorter explanations would do more good.

      --
      BSD is for people who love UNIX. Linux is for those who hate Microsoft.
    17. Re:Maybe pointlessly detailed by Anonymous Coward · · Score: 0

      Note: I have met assclown KnowItAlls like you in real life.

      You make no point, only vague generalizations which are flippant and pedantic.

      If you're going to reply, say something intelligent ok?

      If I want writing vague and childish, I'll grab harry potter.

  10. Re:4 is 4, right? by Dunall · · Score: 1

    Uhh, in a word, No.

    It'd be easy to use this in the video card realm as well. 2 is 2, right? Voodoo2 and a GeForce2.. I won't touch the microsoft jab.

  11. OT : Non Apple G4 boards? by Anonymous Coward · · Score: 1, Interesting

    Does anyone know if an ATX board with a G4 exists? I just started developing my own little OS and, frankly, x86 assembly stinks, I hadn't touched it for 4 years and didn't remember how crap that ISA indeed is. The 68k series were such a nice development platform, the PPC ISA looks quite cool as well.

    1. Re:OT : Non Apple G4 boards? by Anonymous Coward · · Score: 1, Informative
      bPlan have one called the Pegasos coming out soon (next year) with dual G4 sockets.

      Will cost user $1000 with graphics card, audio, firewire, processor, ethernet, memory, apparently.

    2. Re:OT : Non Apple G4 boards? by Anonymous Coward · · Score: 0

      as the other poster replied, bplan (http://www.bplan-gmbh.de/news.html) is doing a PPC board and entire systems around it in cooperation with merlancia (www.merlancia.com/MISEL/torro.html).

      However IME these PPC projects always turn out to be all smoke, no fire, and it gets frustrating being on the verge of having a solid established platform to build that BSD/ppc box to hack around with, then watching the fucking project go down the fucking john (PPCP/CHRP/PReP, POP, now they're taking their time at bplan...)

      Fuck I hate that. Could you tell? :p

  12. Great Article! by Uttles · · Score: 5, Insightful

    This article is extremely informative and gives you a good insight into how these processors are designed, as well as how they compare. I disagree with the poster though, you don't need a CE or EE degree to get the idea of what's going on. I'm a CE and I had classes on this sort of thing so yes I could follow all the gritty details, but I think the author did a good job of explaining things so that most people could understand. Also, I thought the author summed things up perfectly saying:

    The preceding discussion should make it clear that the overall design approaches I outlined in the first article can be seen in the execution cores of each processor. The G4e continues its "wide and shallow" approach to performance, counting on instruction-level parallelism to allow it to squeeze the most performance out of code. The P4's "narrow and deep" approach, on the other hand, uses fewer execution units, eschewing ILP and betting instead on increases in clock speed to increase performance.

    This is exactly the case. Unfortunately the popular masses don't understand all of this wide vs narrow stuff, so they go for the higher clock speeds. In reality, Intel is really pulling one over on us, charging more money and all we're getting is a higher clock rate, not a whole lot of performance gain. PPC has proven itself time and time again to be the better processor, but unfortunately they aren't used in very popular machines (mostly Macs,) so we don't get to reap the benefits.

    On a related note, this article touches on one of the many reasons why the Gamecube will run circles around the Xbox. GameCube's processor is a 485Mhz PPC designed specifically for video games, while the Xbox just uses a common Pentium running at 733 MHz.

    This all brings up a good question: why haven't Macintosh's or GameCube's marketers come up with a bench mark to put next to the processor speed? Maybe I missed it, but I've never seen a Macintosh commercial saying "comes with a G4 800 MHz, comparable to a P4 1.5 MHz." There might be too many legalities involved to do something like that, but it seems like they need to educate people somehow of the non 1 to 1 relationship between clock speeds of P4s and PPCs.

    --

    ~ now you know
    1. Re:Great Article! by west · · Score: 3, Interesting

      This all brings up a good question: why haven't Macintosh's or GameCube's marketers come up with a bench mark to put next to the processor speed? Maybe I missed it, but I've never seen a Macintosh commercial saying "comes with a G4 800 MHz, comparable to a P4 1.5 MHz."

      The problem is that you can compare processors in far too many ways. Apple likes to use Adobe Altivec-enabled applications when it compares speed (big suprise), but the reality is that comparing two processors is only marginally more useful than comparing two human beings. Computers are far too multi-purpose to be able to come up with a useful comparison.

      Processors are even worse... It's like trying to compare two brains without letting education or experience skew the comparison.

      In the end, manufacturers will choose from the dozens of possible benchmarks to make their processor look the best (or make up their own if none of the others will do).

    2. Re:Great Article! by VA+Software · · Score: 1

      "It's like trying to compare two brains without letting education or experience skew the comparison. "

      That's now easy ... http://news.bbc.co.uk/hi/english/sci/tech/newsid_1 638000/1638755.stm

      --

      ---
      http://slashdot.org/moderation.shtml
    3. Re:Great Article! by Anonymous Coward · · Score: 0

      Back in the Pentium II days Apple had a couple of advertisements like that (the PII on a snail was one, I'm not sure if there were others). They stopped doing them a couple years ago though. Now days the closest they get is a graph of Photoshop benchmarks on their website.

      IIRC Intel has some marketing ethics guidlines saying that it can't compare their CPUs to the compettition, only claim their CPUs are powerful. Which would explain why Apple can get away with dissing the Pentium and proving (at least with benchmarks) its inferiority and Intel just shows more idiots in colored suits (or crappy CGI aliens with the P4).

    4. Re:Great Article! by bmajik · · Score: 4, Informative


      In reality, Intel is really pulling one over on us, charging more money and all we're getting is a higher clock rate, not a whole lot of performance gain


      This is a debatable point. I think it is wrong to conclude intel is "pulling one over on us". It has been demonstrated that as more EU's are added, the effectiveness and utilization of EU's goes down. The quest for ILP comes to a crashing screeching halt before you even get to 4 EU's. IIRC, only one processor-scheduled CPU is designed with more than 4 EUs.

      The necessity of the chip to extract ILP in realtime is what leads us to these big hairy controllers and limited clock speeds. Controller shrink was what led to RISC in the first place, and now that we've had to add in superscalar "goo" there's hardly a difference between the CISC philosophy and the RISC one. Never mind that Intel chips have been re-writing CISC instructions as multi-EU uops forever.

      The point is, adding additional EU's has been desmontrated to be of dubious merit. Right NOW, the P4 speed improvements come from SSE2, just like the G4's speed improvements come from AltiVec. Both do essentially the same thing, although i've read more about AltiVec and it seems "cooler" :)

      The difference is this - When the P4 core hits 3ghz, its retire rate will just destroy anything a G4 or Athlon will do. Intel took the pipeline length hit NOW and will reap the benefits later.

      They also spent the time to get their prediction units as top notch as possible, because iirc statistically there will be > 3 conditional branches in progress in those ridiculous 20 stage pipes :)

      So - the problem with intel's approach - a single instruction takes longer to complete, and the fill/drain penalty for mispredictions is high.
      The retire rate however, is amazing, and the clock rate ramping ability is similarly amazing.

      Your assertion that MOTs approach _relies_ on adding additional EU's is surely incorrect, because "everyone" knows that controller complexity is again dominating cpus, and much of that is dedicated to extracting and managing ILP on 4 or less EUs (and that it just isn't there beyond 4.. i think the Power4 was supposed to have 6 EUs, and the Alpha 364 or 464 was going to have 8 ?)

      Intel has already "side stepped" the SuperScalar risc EU problem with IA64 - Thats what LIW does. LIW is interesting again now because of the reliazation that controller extracted ILP was too expensive and not good enough for the performance increases needed.

      --
      My opinions are my own, and do not necessarily represent those of my employer.
    5. Re:Great Article! by bmajik · · Score: 2

      Oh, I forgot to address this in my other reply:


      On a related note, this article touches on one of the many reasons why the Gamecube will run circles around the Xbox. GameCube's processor is a 485Mhz PPC designed specifically for video games, while the Xbox just uses a common Pentium running at 733 MHz.


      What horseshit. All console CPUs are outright or adapted cores of commodity outdated CPUs.

      the gamecubes cpu is not any more specifically designed for gamecube than the R4300 was "specifically designed" for the nintendo 64.

      Does the fact that XBox has a different memory system, "northbridge", cache size, etc, mean its 733mhz proc was "specially designed" for xbox ? You wont find any of the xbox core components in anything besides an xbox... it must all be CUSTOM ENGINEERED FOR GAMING!

      I try not to be a fan boy, I only ask that others do the same. Have you played anXbox ? What about a gamecube ? Have you done performance testing on their respective processors ? What about their GPUs ?

      You can speculate all you want to, but dont write some article about how "I've taken CE classes, I can tell that gamecube whoops xbox". It just smears your credibility all over the place.

      --
      My opinions are my own, and do not necessarily represent those of my employer.
    6. Re:Great Article! by Uttles · · Score: 2

      OK, well, first of all, I've played both. Like I said, the GameCube runs circles around the XBox. The animation is smoother, the control is more responsive, the load times are shorter, etc, etc.

      Secondly, they could be telling and outright lie, but in a press release by Nintendo which I unfortunately can't find right now, I read that the processor was a modified design that originated from a general purpose PPC and was designed specifically for the GameCube. Now, you may not believe that, but that's not the only time I've heard it, and based on the performance of the thing, I don't doubt it.

      --

      ~ now you know
    7. Re:Great Article! by Lars+T. · · Score: 1

      Intel's marketing ethic guidelines allow them to tell people that their processors make the Internet faster and the pictures more colorfull however. Anyway, as long as they are ahead in MHz, many people will buy them, no matter how fast they actually are.

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    8. Re:Great Article! by Cheese+Metal+Rulez!! · · Score: 2, Informative

      The Gamecube's CPU is a PPC 603 with its FPU split in two and very fast custom busses for connection to the rest of the console so yes, I'd say it's "custom engineered for gaming".

      At least compared to that of the X-Box.

    9. Re:Great Article! by Anonymous Coward · · Score: 0

      The difference is this - When the P4 core hits 3ghz, its retire rate will just destroy anything a G4 or Athlon will do. Intel took the pipeline length hit NOW and will reap the benefits later.

      A similar argument could have been made by the MIPS designers when they went 'superpipelined' to compete with the superscalar processors that were coming out. As far as I know only HP have managed to pull the fast clock speed trick successfully -- and that wasn't sustainable. High clock speeds is a short term win that strains everything from branch prediction to the cooling fans.

    10. Re:Great Article! by Anonymous Coward · · Score: 0

      As a console developer I can say that playing a game can't determine if a console is better than another. Some games are geometry intensive, others are texture/fillrate intensive.

      PS2 is a monster on geometry/fillrate, XBox has great texturing engine (thanks nvidia), and the GameCube is in between.

      Also, remember that most 3d xforms and graphics effects are done on the GPU/VU. Just becausea console has a smoking CPU, It is still mainly limited by the GPU.

  13. His CPU Articles are like Tootsie Pops by well_jung · · Score: 2
    You start reading one, and pretty soon you're just scrolling down looking for pictures and key statements, too impatient to process or understand his evidence.

    Like with a Tootsie Pop, you start licking, and finally get too impatient and just bite the damn thing.

    Mmm...chocolatey goodness...

    --
    Carl G. Jung
    --
    "With one breath, with one flow, You will know Synchronicity" -La Policia
  14. Huh? by Anonymous Coward · · Score: 0

    "Intel introduced SIMD to the PC market with MMX on the PII."

    No, my dear sir, Intel introduced SIMD in the first generation Pentium, labelled as Pentium MMX which started at 233Mhz IIRC.

    And I've yet to see a single Sparc app using VIS except their crap ShowMeTV.

    1. Re:Huh? by LSD-OBS · · Score: 1

      SIMD, standing for "Single Instruction, Multiple Data" (ph34r my acronym skillz), has been around on x86 CPU's since the PI-166Mhz MMX, for the record. It should also be noted that there were clever ways to do simple SIMD-like operations using the standard registers and instructions.
      --

      --
      Today's weirdness is tomorrow's reason why. -- Hunter S. Thompson
  15. He misses one important difference... by Anonymous Coward · · Score: 4, Informative
    about floating point instructions: on the PPC, both the clsassical FPU and the Altivec unit have fused multiply-add instructions, i.e. a single machine instruction computes: RA=RB*RC+RD where RA, RB, RC and RD are arbitrary floating point registers. This takes the same time as a multiply, basically the add (which can also be a substract) step is free.

    The two operand Intel architecture does not allow the fused multiply add, so that the latency of such an operation is the latency of a multiply plus the latency of an add (and the destination register has to be one of the operands, although the other operand can be in memory, saving you a load). There are plenty of practical algorithms which benefit greatly from the fused multiply-add, for example polynomial evaluations, matrix multiplications, etc, a feature pioneered by IBM in the RS6000 series and that Intel is using in Inanium.

    And people who claim that you can do loop unrolling to hide the latencies should check their math: with only 8 registers, there is no way to hide the latencies of a multiply plus an add on a P4, while it is almost trivial on a G4 (32 registers and shorter latencies between accumulates). Furthermore many transcendental function evaluations are evaluated in libraries through polynomial approximations, which cannot be unrolled nor easily sped up: the number of coefficients is usually large enough to make the routine limited by the latency of the back to back floating point operations, but not large enough to take a divide and conquer approach.

    While the G4 is clearly the better architecture (not having double precision Altivec is not that important, I consider vector processing is only worth if you can do more than 4 elemnts per vector), the memory susbystem of the P4 is far superior. Hopefully the G5 will be comparable in this area (and I can't buy a desktop Power4 system :-().

    1. Re:He misses one important difference... by morbid · · Score: 0

      Would you please tell me what sort of vector sizes you would find useful? I am interested, and am playing about with SIMD code.

      --
      I'm out of my tree just now but please feel free to leave a banana.
    2. Re:He misses one important difference... by Anonymous Coward · · Score: 0

      Um the P4 has 128 FP registers not 8.

    3. Re:He misses one important difference... by Lars+T. · · Score: 1

      No, it doesn't. The Itanium has.

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    4. Re:He misses one important difference... by Anonymous Coward · · Score: 0

      Something on par with Texas is always nice. In a pinch, you get get by with a Gabon-sized register, but only if you're really hurting.

    5. Re:He misses one important difference... by Anonymous Coward · · Score: 0
      Well, I believe that 4 elements is the strict minimum (I said more than 4 but that's a thinko).
      With two elements per register, which you can do with SSE2 in double precision, the win is not worth the complexity in the code to handle for example the corner case of an odd length vector.


      And yes, there is one case where you win with vector of length 2: complex arithmetic. Unfortunately it seems that Intel miseed the boat
      and that you can't do easily a complex multiplication (I've not found an instruction
      which would allow to perform an add and a substract in parallel or to change the sign of
      only one half of a register).


      For audio, the 8 items given by 128 bit registers are fine (the 4 items given by 64 bit MMX registers were a bit on the low side).

    6. Re:He misses one important difference... by morbid · · Score: 0

      I see.
      What sort of things are you calculating?

      --
      I'm out of my tree just now but please feel free to leave a banana.
  16. Error in the article by LSD-OBS · · Score: 3, Funny

    ----snip----
    add A, B
    mov C, A

    The first command adds the two numbers, and the second command moves the result from A to C. Of course, you still have the potential problem that the original value of A was erased by the add command, so if you wanted preserve A's value then you'd have to insert even more instructions to store A in a temporary register and then restore its value once the addition has been performed.
    ----snip----

    Not quite. I'm sure even people who _dont_ know x86 assembly language will realise all you don't need any extra instructions at all. Simply reorder them:
    mov C, A
    add C, B

    Obviously, the example was being used to show how much nicer it would be to have three or more operands in your instructions, but it was a lousy example.

    On a sidenote, we've been able to specify more than two operands with certain instructions since the 80386. Look up the syntax for the "imul" instruction.

    --
    Today's weirdness is tomorrow's reason why. -- Hunter S. Thompson
    1. Re:Error in the article by Uttles · · Score: 2

      Not quite. I'm sure even people who _dont_ know x86 assembly language will realise all you don't need any extra instructions at all. Simply reorder them:
      mov C, A
      add C, B

      True, but that's going to create some stalls, and is still two commands rather than one:
      add C, A, B
      as with the PPC chip.

      --

      ~ now you know
    2. Re:Error in the article by LSD-OBS · · Score: 1

      It was going to create a stall _anyway_, and it was already two instructions _anyway_. The point is, you don't need any extra instructions....
      --

      --
      Today's weirdness is tomorrow's reason why. -- Hunter S. Thompson
    3. Re:Error in the article by jonnythan · · Score: 1

      Well then couldn't the value of C be erased by the add command? You need extra instructions to retain the values of C and A.

    4. Re:Error in the article by Anonymous Coward · · Score: 0

      Why is this moderated as funny? :)

    5. Re:Error in the article by denzo · · Score: 2
      Why is this moderated as funny? :)
      Because the moderator had a "mov" operation performed on their brain before they could evaluate the content of the message.

      I don't know either, I don't understand. ;)

    6. Re:Error in the article by Anonymous Coward · · Score: 0

      So write:
      lea ecx, [eax+ebx]
      then and you preserve the original values while still doing an addition. Or how about:
      lea ecx, [eax+ebx*4+382479]
      and you have two additions and a multiply. How many instructions would that take in PPC asm?

      By the way, I'm an EE and have been programming asm on both x86 and PPC, and I thought that the article basically sucked.

    7. Re:Error in the article by Klatma · · Score: 1

      WTF article did you read? The article I read said:

      mov C, A

      add C, B
      The first command moves A into C in order to preserve the value of A, and the second command ands the two numbers.


      With a three- or more operand format, like many of the instructions in the PPC ISA, you get a little more flexibility and control. For instance, the PPC ISA has a three-operand add instruction of the format add destination, source 1, source 2, so if you wanted to add A to B and store the result in C without erasing the values in either A or B (i.e. "C = A + B") then you could just do:

      add C, A, B

      First actually read the article and make sure your dyslexia isn't acting up before you post something that makes you look like a complete idiot.


      And yes the article does say that the second instruction ands the two numbers when it should say adds.

    8. Re:Error in the article by LSD-OBS · · Score: 1

      Good call dude :) I wondered whether it was worth getting into detail like that, but you put it across nicely. Like I said, the example used was lousy.

      And I agree - the article basically sucked.

      PS: moderators - the above should be marked "Insightful". He makes a good point.

      --
      Today's weirdness is tomorrow's reason why. -- Hunter S. Thompson
    9. Re:Error in the article by LSD-OBS · · Score: 1

      What are you talking about? The purpose of the author's example was to store the result in C as opposed to overwriting A. Or maybe you just made a joke I didn't get.

      --
      Today's weirdness is tomorrow's reason why. -- Hunter S. Thompson
    10. Re:Error in the article by LSD-OBS · · Score: 1

      Actually, don't be so sure of yourself. The author changed the article after realising his error - what you see in my post was the original, duplicated with nice, healthy cut and paste. Don't call me an idiot just because you're slow on the trigger.

      --
      Today's weirdness is tomorrow's reason why. -- Hunter S. Thompson
    11. Re:Error in the article by Klatma · · Score: 1

      Idoit or not, that doesn't really matter, mainly because if the author changed it then its not your fault that it appeared wrong to me. What I think we can both agree on is that the author really needs someone to proof read his articles for him, since he obviously can not do it himself. I mean correcting one mistake and "anding" another, come on.

    12. Re:Error in the article by Anonymous Coward · · Score: 0

      And you just posted something that makes you look like a complete asshole, so I'd say he's still ahead of you.

  17. Maybe because it doesn't really matter by bill_mcgonigle · · Score: 2, Insightful

    This all brings up a good question: why haven't Macintosh's or GameCube's marketers come up with a bench mark to put next to the processor speed? Maybe I missed it, but I've never seen a Macintosh commercial saying "comes with a G4 800 MHz, comparable to a P4 1.5 MHz." There might be too many legalities involved to do something like that, but it seems like they need to educate people somehow of the non 1 to 1 relationship between clock speeds of P4s and PPCs.

    Cyrix used to sell PR parts, PR133 might have been a 116Mhz chip, but it was as fast as a 133Mhz pentium. So there's precedent, and it's probably legally OK, but I suspect the reason is it doesn't really matter.

    What really matters is that the CPU is fast enough for what you want to do. I run OS9, OSX and linux on my machines. My home machine is a G3/350, and it's plenty fast for running OS9 for everthing but compressing MPEG1 video. It's not fast enough for running OSX. My work machine is a G4/400 and it's just fast enough for running OSX. But it's not fast enough for compressing MPEG1 video. If I had a dual-800 G4 it would be more than fast enough for OSX, but it would still be too slow for compressing MPEG1 video. My linux machine is a Dual-800, P3. It's just fast enough for running linux with all the crap I have running. It's still too slow for compressing MPEG1 video, though. I also use a 1.2GHz Athlon machine occasionally, and I consider that just fast enough to run Windows 2000. I assume XP is similar. But it would still take a long time to compress MPEG1 video.

    So, how would you structure a comparison benchmark? SPECint? BYTEMark? PhotoShop duals? I think the answer is that you don't. It doesn't matter, as long as the computer is fast enough to do what you want it to do. The semi-annual MacWorld Photoshop duals are interesting since they actually show that the computer is too slow for designers but the Windows machines aren't any better. Perhaps they need to enunciate more, but I think their current stand of , "it's fast enough," is the mature one.

    --
    My God, it's Full of Source!
    OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
    1. Re:Maybe because it doesn't really matter by morbid · · Score: 0

      FWIW (probably not much)
      During the War, my father had a Cyrix PR133+ (running at 110MHz) and I had an Intel P100 (running at 100MHz obviously).
      The rest of our hardware was almost identical.
      I was running Linux and he was running Win95.
      I used the old Byte UNIX benchmarks to test my machine. Then I transplanted the hard disk into his machine and ran them again. Overall his machine came out very slightly slower than mine.
      Where it really looked slower was Floating Point, but it was faster at integer. My score was (IIR) 12.9 and his was 12.8, or something like that.

      --
      I'm out of my tree just now but please feel free to leave a banana.
    2. Re:Maybe because it doesn't really matter by Anonymous Coward · · Score: 0

      You consider an Athlon 1.2 "just enough to run Windows 2000?" Dude, I've been running it on a PII 300 *very* comfortably for over a year. It just needs RAM - feed it lots of RAM and burp it afterwards and it'll be a good little baby.

    3. Re:Maybe because it doesn't really matter by Anonymous Coward · · Score: 0

      Back in Da Nang, there were these three hookers who had some nice gitups.

      It was always a chore to multitask them, since it was more cooperative mt than anything else -- they had long fingernails and liked to bite.

      I found the less desireable ones to struggle with integer functions, particularly converting inches to feet, but the fast ones generated way too much heat for the Viet Nam climate.

      Go with what works.

  18. pointless by SpacePunk · · Score: 0, Offtopic

    The whole comparison is completely pointless. The majority of machines running Intel processors are running Windows. The majority running Motorola processors are running some version of the Mac OS. Might as well have Car and Driver running a comparison of a Jaguar S-type and a 10-ton dumptruck. (which processor is the equiv of a Jag is left as an excercise for the reader)

  19. Benchmark tests for beowulf clusters? by Anonymous Coward · · Score: 0

    Come on now! How can /.ers REALLY know which is the best CPU for us if there's no BEOWULF cluster benchmark comparisons between the G4 and the P4.

    I want to imagine.

  20. MOSR is more reliable than /.! by Anonymous Coward · · Score: 1, Funny

    Last month they said the G5 was going to be released with that iPad wireless web tablet (not the iPod, the tablet photoshop job). It scales perfectly to 16GHz, costs $0.13 each, and does 543TFlops.

  21. question: by Anonymous Coward · · Score: 0

    Are you stupid or just idiotic? Or maybe just completely humourless?

  22. Summation for the "Shiney Object" Crowd. by GISboy · · Score: 1

    Of which, I "R" in that group.

    While the G4e has fairly standard, fairly unremarkable floating-point hardware, the PPC ISA does things the way they're supposed to be done
    [snip]
    The P4, on the other hand, has slightly better hardware but is hobbled by the legacy x87 ISA.


    I could have sworn tomshardware stated it best as "Essentially we have a P4(86) 2Ghz".
    I'm paraphrasing, mind you and possibly taking it out of context, *but* instead of increasing the cache (instruction/data/registers) they combined and dropped it down to 8k of instruction and data.

    Oh, and on the P4 vs AMD's XP chip, how would this analogy be changed or overhauled as it stands with the P4 vs G4e?

    I'd really like to know. Or have a better "real world" analogy geared for the newbie user who usually winds up asking me, and I have to be able to explain complex things in simple terms to myself first.

    Thanks.

    GISboy

    --
    If it is not on fire, it is a software problem.
  23. the poster is wrong by Anonymous Coward · · Score: 0

    please would someone re-moderate this wildly incorrect +3 post

  24. caution? by Anonymous Coward · · Score: 0

    How is it more careful to use one word over another? What are the mortal dangers of using 'whomever' incorrectly that don't apply to using 'whoever' incorrectly? Should I use 'he' instead of 'him' incorrectly, too? Or maybe I should just *gasp* dare to make mistakes so that I can learn from them?

    1. Re:caution? by Anonymous Coward · · Score: 0
      Whom and whomever are falling out of use simply because people have difficulty with them. There's no problem, especially in informal settings such as here on Slashdot, with just using who and whoever everywhere -- it's still considered correct. Using whom/whomever makes it appear that you think you know better than the rest of the English-speaking world who are content with who/whoever. That's fine if you get it correct, but if you don't, you set yourself up as a target for grammar trolls like me.

      It's good that you dare to make mistakes so that you can learn from them. More people should be like you. But in this particular case, how would you have known you made a mistake if it were not pointed out?

      Glad to have been of service.
      Your friendly grammar troll :)

  25. Oops, correction by LSD-OBS · · Score: 1

    We're still thinking according to the Pentium I mindset. Stalls like that one aren't really an issue anymore thanks to the out-of-order instruction execution scheme that has been evolving since the P-II.
    --

    --
    Today's weirdness is tomorrow's reason why. -- Hunter S. Thompson
  26. Re:Who let this one out of his cage? by smaughster · · Score: 2

    Next time, I'll include those humour tags for those who do not recognize it :)

    As for the math part, (yes I knew it was about computer instructions) there isn't a single problem with A = A + B with B not equal to 0. Modulo arithmic for example. But then again, I am not sure what grade math level you need to have for that :-)

    --
    I intend to live forever, so far so good.
  27. Um... by autopr0n · · Score: 0

    What does the motherboard have to do with the case? Just pop the board out and replace it with another.

    It always amazes me the way Mac-heads will use all kinds of adaptor cards to upgrade old motherboards, which of course would cripple performance.

    --
    autopr0n is like, down and stuff.
    1. Re:Um... by Matthias+Wiesmann · · Score: 3, Insightful
      The problem is, Apple has stopped offering motherboard upgrades some time ago. So you can pop the board out and... do nothing. Buying a never motherboard is not trivial. Also both motherboard and case design tend to change a lot, there is no such thing as an ATX case.

      The whole market for motherboard upgrades comes from this situation. Apple does not support motherboard that it did not manufacture, and the OS used to check for the precence of genuine ROMs. So third parties could not build replacement boards. By upgrading only the CPU subsystem, the rest of the motherboard would remain genuine Apple and therefore run the system without problems.

      Also remember that Mac hardware tends to be more expensive and last longer than PCs. While the performance boost you win by upgrading only the CPU system is lower, the impact on the workstation is also lower. Changing a motherboard means changing the system, having new drivers, so basically more maintainance work.

      This situation might change with darwin, theoretically, nothing prevents some company from producing PPC motherboards, recompile Darwin for it and then build a installer that instals OS X on top of Darwin. Old machines that Apple does not support can run OS X this way.

    2. Re:Um... by MaxVlast · · Score: 1

      Also, don't forget that the New World (ROM in RAM) architecture further moves the OS from the hardware.

      And you're right -- with the entire bottom half of the OS being open source, anyone can wangle around with it and try and get it working on their hardware.

      --
      There should be a moratorium on the use of the apostrophe.
      Max V.
      NeXTMail/MIME Mail welcome
  28. Overall picture isn't quite that easy to see... by Anonymous Coward · · Score: 1, Interesting

    I'm a CE as well, and I absolutely loved the article. It's nice to have someone fight the technical battles for you to release minute details about their procs. Trying to get information like this is usually like pulling teeth from some companies.

    What I wanted to convey though, for people who may not deal on a hardware level with this stuff. Is that it is very hard to really get a good understanding for the whole processor. These projects are IMMENSE. Trying to keep track of millions of transistors, and lay them out, etc... is a nightmare. I know. So while it is good to talk about the higher level concepts of narrow Vs. Wide on a conceptual level that all falls away when you start looking at transistors. More than anything these projects are all about coordination. You can have a team of engineers working on a specific part that have NO IDEA what the other bits and pieces look like unless they have to interface with them. So just keep that in mind when we're judging these companies. I just think we lose sight of the massive scale of these projects sometime.

  29. Are there any *real* benchmarks out there? by autopr0n · · Score: 0

    Are there any real benchmarks comparing the p4/athlon/g4? (by real, I don't mean a set of 2 or 3 similar Photoshop filters). I hear a lot of people saying that the g4 is a superior performer, but I have a hard time believing a 800mhz chip is really faster then a 2ghz one.

    --
    autopr0n is like, down and stuff.
  30. Re:Who let this one out of his cage? by Buck2 · · Score: 1

    there isn't a single problem with A = A + B with B not equal to 0. Modulo arithmic for example

    What? You just defined 0 in a mathematical sense. What's your example?

    --

    As my father lik@(munch munch)... ....
  31. Re:4 is 4, right? by utdpenguin · · Score: 0

    FYI, the whole thing is meant as a joke :)
    After all, a Pnetium "4" is way less than a 286. 4 286 !! :)

    --
    In Soviet Russia you dant have to put up with these crappy jokes
  32. not pointless by overunderunderdone · · Score: 2, Insightful

    If you are a consumer who wanted a comparison to decide which kind of computer to buy, you are right the article was (mostly) useless.

    BUT, for the audience the article is intended for - geeks, technophiles, nerds & propeller heads it was not pointless at all. On this forum in particular there are a lot of people that use neither Windows nor MacOS but other operating sytems which run happily on either processor. Even if there is no *practical* point there is always sheer geek curiosity - alot of us find such articles entertaining.

    Might as well have Car and Driver running a comparison of a Jaguar S-type and a 10-ton dumptruck.

    I don't think that the difference between a P4 and a G4 is quite as wide as that - and they are being marketed by both sides as roughly equivalent products. Most techie people may know which is the "dumptruck" and which the "Jaguar" but it is still interesting to see a technical explanation of WHY and precisely HOW they are so different.

    1. Re:not pointless by Anonymous Coward · · Score: 0

      one gets work done, the other is a jaguar.

    2. Re:not pointless by Anonymous Coward · · Score: 0

      rofl. just about posted it myself.

    3. Re:not pointless by Anonymous Coward · · Score: 0

      10-ton dumptruck

      Actually, most end dumps hold from 12-15 tons of material.

  33. Be more specific by Pope · · Score: 1
    If I had a dual-800 G4 it would be more than fast enough for OSX, but it would still be too slow for compressing MPEG1 video

    What do you mean by "too slow?!"
    Do you mean in real time? If so, say it.
    Saying it's "too slow to compress MPEG1 video" is bollocks. Look at the recent iDVD2 demo on a dual 800, that's MPEG 2 which is harder to compress.

    --
    It doesn't mean much now, it's built for the future.
    1. Re:Be more specific by bill_mcgonigle · · Score: 2

      Yes, if you have to wait for a computer it's too slow. Realtime is too slow. If anything takes more than 10 seconds it's too slow.

      But MPEG2 implementations typically do *way* less compression than MPEG1. MPEG2's bitrate is typically at least twice as high as MPEG1, sometimes 10x more, and when you're doing discrete cosine transforms that greatly increased size along with motion vectors and variable bitrate encoding allow you to do alot less work during encoding. That's why it's so much faster. You also need alot more space to store the resultant file (DVD vs. CD) Just because 2>1 doesn't mean it's compressing more, it just means it came after.

      --
      My God, it's Full of Source!
      OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
  34. What a Great Read! by Anonymous Coward · · Score: 0

    Man, I've gotta pay more attention to that site. I remember learning about pipelines and such as a CompSci major. It's cool to actually put that part of my education to use. :)

  35. Re:Who let this one out of his cage? by smaughster · · Score: 1

    I'll bite (bit of professional math pride flaring up).
    First of all, I am interested which number would be defined at 0 since I assumed B to not be equal to 0 (unless you are counting module B of course).

    Furthermore, any mathematician who would accept a definition of 0 in a mathematical sense without a uniqueness clause should be fired.

    And if you still do not get it, think clock times, where 1 am + 24 hours equals 1 am.

    --
    I intend to live forever, so far so good.
  36. Re:Who let this one out of his cage? by jeffehobbs · · Score: 1

    Oh, I recognise humor.

    But this was not it.

    ~jeff

  37. Re:Who let this one out of his cage? by smaughster · · Score: 1

    To each his preferences.

    --
    I intend to live forever, so far so good.
  38. Re:4 is 4, right? by Anonymous Coward · · Score: 0
    (cheap shot returned) Okay, that would mean that C# (read as C++++) is twice as good as C++ :-)

  39. What Good is the G4? by Anonymous Coward · · Score: 0

    If it can't run windows XP?

  40. Depends what you use your computer for by _avs_007 · · Score: 1

    I have a bunch of boxes I use for dev/test purposes. My primary is a 2Ghz box, and I also have a 800Mhz P-III next to it. Both with 512mb of ram... My apps sure as heck compiles MUUUUUCH faster on the P4 then the P-III. When I run some of my framework apps, you should see the performance. I have a Celery 800 as well. That thing slows to a crawl when I crank up the number of clients, but the P4 continues to fly. The P-III, the animations will slow down a bit. The P4 runs full steam through it...

  41. me and my P166 MMX had a good laugh over that by dmnic · · Score: 1

    nevermind this post. . .

  42. Re:What Good is the G4? You're mistaken by alfredo · · Score: 2, Informative

    Yes XP will run on it using VPC. The question is why.

    On the G4 you can run OSX, OS9, XP and rootless XWindows all at the same time. The only problem is you have to reboot to run Linux. But then you can run the MacOS from within Linux.

    Flexibility of the Mac is one of its strong suits. Check out the different Gnu Darwin, Darwin, and Xon X sites. That is where the action is

    Yes I am running BSD, you still running Windows?

    --
    photosMy Photostream
  43. P4's & G4e's by Anonymous Coward · · Score: 0

    Imagine a Beowolf Cluster of THESE!!!

  44. Apples to apples by Esoteric+Moniker · · Score: 1

    It would be fair to compare the K6 to the Pentium 4 if the K6 was the best chip AMD had to offer as is the case with the G4 vs P4. If Motorolla had released the G5 before the P4 came out, a comparison of the G5 vs P3 would be fair because it would be the best thing each company had to offer.

    --

    man RTFM
    No manual entry for RTFM.
    1. Re:Apples to apples by horster · · Score: 1

      this reminds me of how Microsoft always tries to avoid judgement by saying, 'don't look here, everything will be fixed and more in the _next_ release!'

      come on, who are they [apple] kidding?

  45. Re:What Good is the G4? You're mistaken by Anonymous Coward · · Score: 0

    Actually I cant get XP to install under VPC (v. 4.0.2). It goes thru the first stage install, but then poops in the 2nd stage (in GUI mode) with some error regarding one of the 'assembly' files (XML linking to DLLs). I think maybe the disc image i downloaded is bad...

  46. Resources by rusti999 · · Score: 2, Informative

    A good general resource for this kind of advanced computer architecture is the book Computer Architecture by David Patterson and John Hennessy. It's quite dense. For the latest in processor architecture, the IEEE Micro magazine is useful.

  47. Re:slightly off topic but I'd like to know by Anonymous Coward · · Score: 0

    Just a quickie. This is the second time that the Power4 has come up on this thread.

    Just how much do the POWER and PowerPC lines have in common. I know the PowerPC was born out of the POWER line but have they now gone down completely different roads?

  48. Re:Who let this one out of his cage? by cc+bcc · · Score: 0, Offtopic

    And I live to intend forever. So far so good...

  49. Re:Who let this one out of his cage? by Anonymous Coward · · Score: 0

    YHBT. HAND.

  50. Re:slightly off topic but I'd like to know by Anonymous Coward · · Score: 0

    Quite a lot. Actually, from an application point of view the instruction set of the Power4 is excatly the 64 bit PPC instruction set. For a system programmer, there are a few differences in the MMU and some exception handling details, but nothing dramatic.

  51. Masking loop latency. by Christopher+Thomas · · Score: 2

    The two operand Intel architecture does not allow the fused multiply add, so that the latency of such an operation is the latency of a multiply plus the latency of an add (and the destination register has to be one of the operands, although the other operand can be in memory, saving you a load). There are plenty of practical algorithms which benefit greatly from the fused multiply-add, for example polynomial evaluations, matrix multiplications, etc, a feature pioneered by IBM in the RS6000 series and that Intel is using in Inanium.

    And people who claim that you can do loop unrolling to hide the latencies should check their math: with only 8 registers, there is no way to hide the latencies of a multiply plus an add on a P4, while it is almost trivial on a G4


    Actually, it turns out that you can still mask the loop latency with a limited register set.

    First, you can use "software pipelining" to mask quite a bit of the loop latency without having to unroll (it's a clever reshuffling of the loop instructions; for brevity, I won't describe it here). This requires one extra FP register over the straightforward implementation of an x86 dot-product loop (four instead of three, because I can no longer re-use scratch registers between steps).

    Second, branch prediction will to a limited extent perform unrolling for you. While the architectural register file has only 8 registers, there are many more internal registers on the chip. Register renaming allows the processor to run several iterations of the loop in parallel without having to worry about namespace conflicts (though true dependencies remain intact). This works as long as the total number of iterations being unrolled fits within the scheduler's window (usually 8-16 instructions; I don't know how big the P4's window is).

    In summary, for something as straightforward as a dot product, it's certainly possible to write x86 code that will avoid the penalty of having separate add and multiply instructions.

    [You'll really be bound by the memory subsystem for both chips, but that's moot point for this discussion.]

  52. OT: Potential Immortals by SteveM · · Score: 1

    From one potential immortal to another, I'm not dead yet!

    Steve M

  53. Re:Who let this one out of his cage? by t · · Score: 1
    A = infinity

    nuff said.

    t.

  54. Re:slightly off topic but I'd like to know by t · · Score: 1
    The Power line is pretty much irrelevant since the cpus cost in the many thousands of bucks. I seem to badly remember that they can be priced as high as 30k.

    A google later, we have: IBM is launching its lowest-priced system, featuring eight processors, at $450,000. ... he new IBM servers have a Power4 processor that contains two processors, a system switch, a large memory and input/output technology -- a design that enables the server to conserve energy and outperform servers that have twice as many processors, IBM said.

    Note that this is the lowest-priced system.

    t.

  55. Re:technically intense.. Z notation by Anonymous Coward · · Score: 0

    A = A + B

    With a MCSE, you don't know what this stands for ?
    x = x + y ?
    x += y ?

    In Z notation,
    let say you have a SoftEnd or CompEng degree,
    you would write this as:

    A' = A + B

    To indicate that A' is the new value of A.

    Wow that's difficult!

    And you got a job on top of that!
    What's the email of your employer?
    You should get fired!

  56. Re:Who let this one out of his cage? by magicslax · · Score: 0

    infinity is not a number, but we should know that shouldn't we. i need my karma back.

  57. Very, very Off Topic by ffatTony · · Score: 3, Informative

    Thats a very amusing comic, much more so than UF. Any idea why its never mentioned on SD?

  58. Re:Who let this one out of his cage? by Anonymous Coward · · Score: 0

    Well any professional mathematician who talks about an alternate number system (not real) without making that clear should be fired.

    A = A + B
    B = 0
    because by not specifying the number system you just have to assume real. The previous poster was not talking about defining a number system but the validity of an equation in some already defined system (presumably real).

    I didn't think mathematicians would have the childish ego of most computer nerds. Then again you probably aren't a mathematician.

    Have a nice day.

  59. Re:Where is Jamie McCarthy by Anonymous Coward · · Score: 0

    Nope...hiatus. He's starting SAMBLA.

  60. Re:What Good is the G4? You're mistaken by Thaidog · · Score: 0

    Does it run OS X? That answer is no.

    --

    ||| I still can't believe Parkay's not butter.