Slashdot Mirror


Inside the PowerPC 970

daveschroeder writes "Jon "Hannibal" Stokes has posted a long-awaited, very detailed analysis of the IBM PowerPC 970 at Ars Technica. Notable quote: 'The 970 was made for Apple'."

163 comments

  1. DUPE by wang33 · · Score: 4, Informative
    --
    PAGERANK++ Robsell.com
    1. Re:DUPE by class_A · · Score: 0, Offtopic

      How the fuck is that off topic?

    2. Re:DUPE by class_A · · Score: 1

      Oh great! Now I'm off-topic :-)

  2. deja vu by groundpig · · Score: 5, Funny

    Its like comparing apples with apples... Its a dupe.

    1. Re:deja vu by Jason1729 · · Score: 0, Offtopic

      Apples to Apples is a great game

  3. Dupe? by krisp · · Score: 5, Funny

    It's ok, the last post didn't get enough comments. Please continue discussion.

  4. One long read... by Thaidog · · Score: 1, Interesting

    Interesting idea to say that the vector units were "hacked" onto the power arcitechture... and this being the reason therefore this chip is designed for apple...

    --

    ||| I still can't believe Parkay's not butter.

    1. Re:One long read... by batboy78 · · Score: 1, Interesting

      I'm more concerned about the release of the PPC 980 the mobile edition of the 970 in a nice 15 inch Al Powerbook. If you are into the rumor mill your should check out MacBidouille's website. He has some speculations about speed and performance against the current line of P4 processors.

    2. Re:One long read... by Thaidog · · Score: 1
      Powerpc chips have always been excellent mobile chips thanks to the little heat the produce


      btw poeple, how is my 1st post a troll?

      --

      ||| I still can't believe Parkay's not butter.

  5. What the heck is 'Altivec' anyway? by Anonymous Coward · · Score: 0

    and where would I want to use it?
    How is it unlike normal CPU opcode processing?

    1. Re:What the heck is 'Altivec' anyway? by Thaidog · · Score: 1, Informative

      altivec units are special 128bit registers that can be used for many different optimizations.

      --

      ||| I still can't believe Parkay's not butter.

    2. Re:What the heck is 'Altivec' anyway? by Anonymous Coward · · Score: 1

      altivec units are special 128bit registers that can be used for many different optimizations.

      for instance?

    3. Re:What the heck is 'Altivec' anyway? by binaryDigit · · Score: 5, Informative

      Floating point ops, optimized for graphics processing and things like compression (jpeg, mpeg, mp3). If you read the Ars article he waxes on about it's superiority over MMX/SSE/SSE2.

    4. Re:What the heck is 'Altivec' anyway? by Mikey-San · · Score: 4, Informative

      Also: "Water is special stuff that makes stuff float."

      "The CPU does important stuff."

      For all of your "What is AltiVec?" needs, check this out instead:

      http://www.motorola.com/SPS/PowerPC/AltiVec/

      --
      Mikey-San
      Karma: +Eleventy billion (mostly affected by watching Celebrity Jeopardy)
    5. Re:What the heck is 'Altivec' anyway? by Anonymous Coward · · Score: 1, Informative

      It's a really fast verctor processing unit. It can do floating point manipulations blazingly quickly.

    6. Re:What the heck is 'Altivec' anyway? by Anonymous Coward · · Score: 0
      Altevec? It is a screwy name for SIMD - Single Instruction, Multiple Data.

      AMD calls its SIMD "3DNow!" and Intel calls its SIMD by the name "MMX". Currently AMD has the fastest commodity SIMD implementation.

      SIMD is used for mathematical operations of taking sums of products and such, which are extensively used in various DSP software (Fourier transforms, wavelets, etc.).

    7. Re:What the heck is 'Altivec' anyway? by Thaidog · · Score: 1

      And why is the blue? Do I really have the floor to post a 50 page post on what is altivec?

      --

      ||| I still can't believe Parkay's not butter.

    8. Re:What the heck is 'Altivec' anyway? by gerardrj · · Score: 4, Informative

      Altivec is Motorola's name for the vector processing unit. The unit handles SIMD commands. SIMD stands for Single Instruction, Multiple Data. Basically, intead of looping through a list of 50,000 values one by one and multiplying each value by PI for instance, you simply tell the CPU where the list is, and to multiply it by PI.

      In a much simplified analogy, it's like lighting 200 candles with a flame thrower instead of one by one with a match.

      --
      Article X: The powers not delegated... by the Constitution...are reserved...to the people
    9. Re:What the heck is 'Altivec' anyway? by Alan+Partridge · · Score: 2, Informative

      "Currently AMD has the fastest commodity SIMD implementation"

      You've not been looking at the distributed.net results, have you? The Altivec/VMX technology currently used by Moto and soon to be used by IBM is LEAGUES ahead.

      --
      That was classic intercourse!
    10. Re:What the heck is 'Altivec' anyway? by jorjun · · Score: 2, Funny

      I iz not very technical but. I think it means faster without a noisy fan or a burnt lap.

    11. Re:What the heck is 'Altivec' anyway? by Anonymous Coward · · Score: 0

      The operative word is commodity -- as in go down to Best Buy today and pick one up.
      AMD is delivering fast SIMD today, not next year.

    12. Re:What the heck is 'Altivec' anyway? by Anonymous Coward · · Score: 0
      Of all of the "big stores" you could have chosen...

      • Circuit City... doesn't sell Macs ---> doesn't sell PowerPCs
      • Wal*Mart... doesn't sell Macs ---> doesn't sell PowerPCs
      • KMart... doesn't sell Macs ---> doesn't sell PowerPCs
      • Office Max... doesn't sell Macs ---> doesn't sell PowerPCs
      • Office Depot... doesn't sell Macs ---> doesn't sell PowerPCs
      • Staples... doesn't sell Macs ---> doesn't sell PowerPCs
      • Best Buy... DOES sell Macs ---> DOES sell PowerPCs with Motorola Altvec implementations
      Better luck next time!
    13. Re:What the heck is 'Altivec' anyway? by Alan+Partridge · · Score: 3, Informative

      "AMD is delivering fast SIMD today, not next year"

      What ARE you blathering about? Pentium 4 has SSE2, PowerPC has Altivec - here's a clue for you, when people code for x86 SIMD, they choose MMX, SSE and SSE2, they don't choose 3D Now!, when people code for SIMD under PowerPc ISA, they choose Altivec. Both SSE2 and Altivec are available to day, both are used in "commodity" CPU families. I think you'll find that it's "x87" FPu strength that typically marks out AMD's current CPUs, not their patchy implementation of SSE2.

      --
      That was classic intercourse!
    14. Re:What the heck is 'Altivec' anyway? by ZigMonty · · Score: 3, Informative

      Basically, intead of looping through a list of 50,000 values one by one and multiplying each value by PI for instance, you simply tell the CPU where the list is, and to multiply it by PI.

      Well, not really, but you're close. You can't just pass the Altivec unit an array of numbers and tell it to do some operation on them. Altivec (and MMX, etc) simply allows you to process the data in bigger chunks that normal.

      Altivec can process 128 bits of data at a time. For example, it can add 16 8-bit integers to another 16 8-bit integers, resulting in yet another vector of 16 8-bit integers with a single instruction, rather than doing them one at a time.

    15. Re:What the heck is 'Altivec' anyway? by Mikey-San · · Score: 2, Funny



      The hue of the sky is determined by a phenomenon known as the "Tyndall Effect", the scattering of light through a colloid by dust or molecules suspended in a transparent medium.

      Note that the light scattering that determines what color you see isn't due to dust in the air, as some think, but rather oxygen and nitrogen molecules.

      However, all we are, as Bill and Ted once pointed out, dust in the wind, dude.

      </t-i-c> ;-)

      --
      Mikey-San
      Karma: +Eleventy billion (mostly affected by watching Celebrity Jeopardy)
    16. Re:What the heck is 'Altivec' anyway? by coolmacdude · · Score: 1

      You've not been looking at the distributed.net results, have you?

      With RC5-64 that was true. Unfortunately for RC5-72, no one has written an optimized Mac core yet so the PC versions are way faster.

      --

      -You may license this sig for only $6.99.
    17. Re:What the heck is 'Altivec' anyway? by cayenne8 · · Score: 1
      Oops...forgot CompUSA. They sell Mac's too...

      :-)

      --
      Light travels faster than sound. This is why some people appear bright until you hear them speak.........
    18. Re:What the heck is 'Altivec' anyway? by Lars+T. · · Score: 1

      Odd. All the Mac scores must be cheated then.

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    19. Re:What the heck is 'Altivec' anyway? by Alan+Partridge · · Score: 1

      If you use the Mac beta client (final release candidate, actually) you get a proper, Altivec optimised cruncher. Just as fast at RC5-72 as it was at 64.

      --
      That was classic intercourse!
    20. Re:What the heck is 'Altivec' anyway? by coolmacdude · · Score: 1

      My bad. I hadn't checked it in a while. Looks like they fixed that. Horrible Mac performance was a problem when RC5-72 first came out and that was the reason the tech support gave me.

      --

      -You may license this sig for only $6.99.
  6. In the market for a 64-bit workstation? by idiotnot · · Score: 5, Insightful

    Fast forward a few months....hmm...a few options:

    Sun: Nice hardware, very expensive, CDE.
    AMD: Commodity hardware, cheap, WinXP.
    HP: Intel hardware, very expensive, CDE or WinXP.
    I think I know what I'd buy.

    Of course, the Athlon64/Opteron would get quite a bit of consideration due to my hobbies.

    But I think it'd end up being the Mac.

    1. Re:In the market for a 64-bit workstation? by Bold+Marauder · · Score: 4, Insightful

      AMD, NetBSD in place of XP? ;)

    2. Re:In the market for a 64-bit workstation? by Anonymous Coward · · Score: 3, Informative
      Sun: Nice hardware, very expensive, CDE.

      http://wwws.sun.com/software/star/gnome/. Also, you take a shot at CDE rather than Solaris? Wow.

      AMD: Commodity hardware, cheap, WinXP.
      HP: Intel hardware, very expensive, CDE or WinXP.

      Earth-to-poster: Linux runs in 64-bit. Thank you.
    3. Re:In the market for a 64-bit workstation? by Mooncaller · · Score: 5, Interesting

      I did not have much trouble getting GNOME working on a HP B180 with HP-UX 10.20 ( compiled with acc, HP-UXs standard compiler). BTW that is a 180MHz PA-RISC machine. It kicked a 1GHz Pentium based workstations butt, even after I put Gentoo on the Intel box ( it original had only Windows NT, shudder). Fast clock rates can't compensate for a moronic architecture in the hands of heavily multitasking users like me.

    4. Re:In the market for a 64-bit workstation? by pmz · · Score: 1

      Also, you take a shot at CDE rather than Solaris? Wow.

      Well the original post was fair by saying "Nice hardware," and, really, CDE is still the default offering in Solaris 9. Sun's GNOME 2 is very promising and looks great, but they are still refining it until it is worthy of being the default desktop.

      Sun's workstations really are great machines, in spite of what SPEC zealots say. For example, few people mention that typical PC graphics cards look like crap relative to even the elderly Creator 3D (translation: the cost differential between really good PCs and Suns is generally smaller than PC zealots will admit).

      Regardless, Sun had better keep their heads up, because Opteron-based workstations are a genuine threat to the Sun Blade line. Opteron+a gig of RAM+SCSI+a really good OpenGL card=a compelling workstation.

    5. Re:In the market for a 64-bit workstation? by Anonymous Coward · · Score: 0

      "Sun: Nice hardware, very expensive, CDE.
      AMD: Commodity hardware, cheap, WinXP.
      HP: Intel hardware, very expensive, CDE or WinXP.
      I think I know what I'd buy."

      You appear to be mildly confused: CDE is a windowing manager, WinXP is an operating system...they're not equivalent.

      In any case, Solaris (an OS) is moving from CDE to Gnome in the next release or so. And Sun workstations start around $1K, which ain't all that expensive.

  7. It's a dupe by PD · · Score: 1, Redundant

    And I even e-mailed daddypants to notify him that it was a dupe.

    1. Re:It's a dupe by Flounder · · Score: 1
      Same here. Nice to know the on-call editor responds the e-mail.

      On-call doesn't mean "I'm at The Matrix Reloaded and can check e-mail when I get back." And if it did, I certainly wouldn't be here at work right now.

      --

      No boom today. Boom tomorrow. There's always a boom tomorrow. - Cmdr. Susan Ivanova

    2. Re:It's a dupe by descentr · · Score: 1

      Has anyone seen the listed on-duty editor actually CHANGE? Every time I can remember seeing it it's always daddypants, does this person sleep? If so, this must be nap time.

    3. Re:It's a dupe by Anonymous Coward · · Score: 0

      Do what I do. Block the ads and Double Click cookies (yes, that's right, kids, Slashdot uses Double Click to track you!), post trolls, and load each story from the past week several times.

      It puts a hell of a load on the database servers, uses up their bandwidth, and annoys the paying users.

      Anarchy in the internet!

      Or something.

  8. There has been a change ... by binaryDigit · · Score: 4, Funny

    ... in The Matrix. That strange feeling of deja vu can only mean one thing! Either that or the /. editors are asleep at the wheel again.

  9. I've never done this before but... by bdaehlie · · Score: 0, Redundant

    I've never been one of the people that scream when a dupe is posted (the other is still on the main page!), but this is frickin' ridiculous. It happens SO OFTEN. OSDN should dock Taco and them a week of pay every time they post a dupe, two weeks if the other story is still on the main page. I realize it isn't 'pro' news source like NYT or CNN but there are 7th graders geeks that could avoid posting the same story twice in 24 hours...

    1. Re:I've never done this before but... by Anonymous Coward · · Score: 0

      There are also 7th graders who can spell and edit better than these guys. It's really embarrassing.

    2. Re:I've never done this before but... by 0rbit4l · · Score: 1
      OSDN should dock Taco and them a week of pay every time they post a dupe...

      These guys get paid? I always assumed it was a run-from-my-mom's-basement operation - what else could explain the poor spelling, frequent dupes (god help you if it's +-5 days from April Fool's), and the ho'ing out of the site to Microsoft advertisements?

  10. Is this the G5? by localghost · · Score: 1

    I understand that a while ago there was some competition between IBM and Motorola about whose chip would be the G5. Was Motorola ever a serious contender, and if so, has Apple decided on IBM? I haven't heard much about Motorola for some time.

    1. Re:Is this the G5? by binaryDigit · · Score: 3, Informative

      I understand that a while ago there was some competition between IBM and Motorola about whose chip would be the G5. Was Motorola ever a serious contender, and if so, has Apple decided on IBM? I haven't heard much about Motorola for some time.

      Mot actually had a G5 on the roadmap. They apparently got all the way to samples, but then ditched the effort. There never was a competition per se wrt the G5 name. There was a bit of friction over AltiVec, as IBM wanted to focus on clock speed and didn't think AV was worth the complexity (and hence why Mot came out with the G4 while IBM stuck with the G3). Motorola hasn't been serious about the mainstream cpu market for a while as they've been losing money on it. They'd rather focus on things like embedded proccies and cell phones (and related chips).

      I don't know which came first, Mot ditching G5 so Apple pleads with IBM to come out with 970. Or Mot gets whiff of 970, so sees a way out of doing G5. Perhaps others more "in the know" can chime in?

    2. Re:Is this the G5? by Anonymous Coward · · Score: 0

      Yes.

    3. Re:Is this the G5? by Duck_Taffy · · Score: 3, Informative

      Officially, the PowerPC G5 is the Motorola PowerPC 8500 chip. So this would not be it. Apple may or may not call a computer that features IBM's PowerPC 970 the PowerMac G5 or PowerBook G5, but it wouldn't be the actual G5 chip. Although I don't think this chip officially has a G* name, I'd be more inclined to designate it the G6, since the G5 was actually a 32-bit chip.

      --
      Karma: Ran over your dogma.
    4. Re:Is this the G5? by Anonymous Coward · · Score: 0

      There was a story some months ago on macrumors.com that Apple was about to sue Moto for failure to deliver the G5. If true, that, and the fact that Apple has been languishing in the speed stakes for a while, would suggest that the 970 is Apple's "second-choice" strategy.
      Some people at macrumors stated that this is all Moto's revenge for Apple cancelling the cloning programme.

    5. Re:Is this the G5? by afidel · · Score: 1

      I think people look at the wrong area when they finger Motorolla about PowerPC, they always point out that Apple is a small customer and that the R&D is expensive etc. Well as you stated they had the G5 mostly completed when they shelved it. I think it has to do more with the freaking rediculous price of modern fabs. Fabs are a HUGE capital cost for a bleeding edge facility, and other than Apple Motorolla doesn't have a huge need for one. IBM on the other hand needs up to date fabs for their own product lines and so adding additional manufacturing capacity to build parts for Apple isn't such a huge expenditure. Even so the PPC970 will debut at 130nm even though IBM will probably have at least some 90nm capacity at that point, but it will be dedicated to their own needs, then after they have yields up and can spare some capacity for Apple they will transition the PPC970 to the 90nm process.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    6. Re:Is this the G5? by Alan+Partridge · · Score: 2, Insightful

      well, G5 was written on a roadmap I once saw describing the Motorola MPC 85xx series - but seeing as how that series has been on sale for about a year now with no sign of a CPU variant suitable for Apple to use, I guess we can forget about the MPC 85xx being used in PowerMacs. Personally, I'd like Apple to adopt the moniker "G64" for their PPC 970 powered machines - that'd stick it to Intel alright, and the idiotic warez kids would stop comparing clock speed and start comparing word length instead of getting on with their lives.

      --
      That was classic intercourse!
    7. Re:Is this the G5? by MachineShedFred · · Score: 1

      Officially, the G3 and G4 brands are trademarks of Apple. Therefore, the G5 could be a Xeon MP if Apple wanted it to be. I'm glad they don't.

      When Motorola and IBM hear the term "G3", they think PPC75x. When MOT hears the term "G4" they think PPC 74xx / 75xx. Neither of these companies use the G3 / G4 terminology, because that refers to the entire architecture from Apple.

      --
      Slashdot still doesnâ(TM)t support Unicode after it was added to the HTML standard in 1997.
    8. Re:Is this the G5? by pmz · · Score: 1

      Mot actually had a G5 on the roadmap.

      Wow, ditching a whole CPU has got to hurt. At least Motorola, however, knows how to steer around the iceburg to focus on their core business.

    9. Re:Is this the G5? by Tombstone-f · · Score: 1

      Intel can also sell PPC 970 Linux workstations to company's using Power4 servers (or anyone for that matter). Tacking on Altivec would add a second market for the chip, without costing them too much more.

    10. Re:Is this the G5? by Duck_Taffy · · Score: 1

      Except that I've seen a Motorola roadmap that designates the MPC85xx series of processors as the "G5" family.

      --
      Karma: Ran over your dogma.
    11. Re:Is this the G5? by gig · · Score: 1

      Motorola's G5 is smaller and lower-power than it was originally going to be. IBM has basically taken over making Mac CPU's over the past few years. They make G3's and G4's and the PowerPC 970 is going to be Apple's G5.

    12. Re:Is this the G5? by gig · · Score: 1

      Motorola uses "G4" and "G5", but Apple doesn't have to mimic that with its own "G4" and "G5". This is the Mac equivalent to "P3", and "P4" ... as a user you know that you are moving up a generation.

      The Motorola G5 is a much smaller and lower-power chip than it was originally going to be. As a result, Apple obviously asked IBM to make a next-generation PowerPC to be the heart of their G5 computers.

      The G designations are just code-names, or marketing names. The Power Mac G4 has had a number of different CPU models, same with the PowerBook G4. The full model numbers (7500, etc) are only of interest to hardware geeks.

  11. Inaccuracy, Part I by Anonymous Coward · · Score: 4, Informative
    Unfortunately, the vector performance of the G4e has been consistently bottlenecked by Apple's lackluster motherboard and chipset designs--specifically the anemic frontside bus and memory subsystems that Apple has saddled the PowerMac line with.
    This implies that the decision of how much bus bandwidth to give the G4e was up to Apple - which it was not. Motorola designed the processor (for Cisco, depending on who you believe), and Apple made do with the anemic MaxBus at 133mhz that they got from Motorola.

    Apple'd be putting DDR400 on the G4 right now if they could. None of this (well, except the decision to go Moto) was their fault.

    My real problem with the current G4e situation, aside from the 167 SDR FSB, is the fact that it's a shared bus topology, which is just ridiculous. To my knowledge, there's nothing stopping Apple from putting out a chipset that gives each G4e a dedicated FSB (even if it's still 167MHz SDR) to the chipset.

    As far as the low MHz and SDR situation, I've also never been totally convinced that Apple wasn't partially to blame for this either, unless they just have zero clout with Moto SPS.
    1. Re:Inaccuracy, Part I by PetWolverine · · Score: 2, Interesting

      Whoa! A duplicate article, this I've seen before. But this is nuts!

      --
      I found the meaning of life the other day, but I had write-only access.
    2. Re:Inaccuracy, Part I by Mooncaller · · Score: 1

      Not SPS, its MPS ( I think thats the correct acronym for the organization that handels microprocessors, its been awhile). And yes Apple has little if no clout with SPS, or at least they did'nt. Things might have changed. Apple is a very very flacky customer. Most of the time, that I was involved with them, they did not know what they wanted. I personaly had little respect for their engineering. They used to be incredibly myoptic and narrowly focused. This does not work well when they keep changing their minds. They also tended to come up with very specific solutions that had limited applicability, eshewing more general solutions that would have gotten them farther down the road. But their worst characteristic was allowing preconcived attitudes to take presidence over engineering realities. That is they would refuse to use technology that did not like even if it could be proved to be the best solution.

  12. Isn't it obvious... by GrodinTierce · · Score: 5, Funny

    I believe that Hannibal mentions that the 970 is designed for SMP.. Clearly CmdrTaco is just testing its newest feature: you click post and the operation gets carried out by both processors.

    Tierce

    --


    Tierce
    Who sponsors your feelings?
  13. Dual FPUs! by Anonymous Coward · · Score: 4, Insightful

    Reading through the article, its nice to see some real design going into a processor. Looking through Intel's last few chips, they've been upping ther clock speed and packing in more cache.

    Yeah, yeah, they are hog-tied because you can't easily re-compile the entire windows platform to use new instruction sets. Linux users, of course, don't have this problem (muhahahah).

    Did anyone else catch the bit on the twin FPU's? I'm just imagining what this thing is going to do with vector operations and frequency transforms.

    For most of you non-engineers:

    -Most 3d vector operations are affine tranformations. Using a 4x4 array of floating point numbers you can translate, rotate, and scale. Works beautifully, but it's a lot of calculations.

    -The Fast Fourier Transform (FFT) is used a lot in signal processing. It's a floating point monster.

    1. Re:Dual FPUs! by pmz · · Score: 3, Funny

      Did anyone else catch the bit on the twin FPU's?

      Yeah, but they only work when the aftermarket mini-turbochargers are attached and a fiberglass spoiler is added to the heatsink. The resulting turbo lag adds latency that really defeats any advantage of the second FPU. It's really too bad, because the 970 could have pushed Photoshop easily into the 12s.

    2. Re:Dual FPUs! by Drakonian · · Score: 1

      Is this the new way to deal with duplicate articles? Repost highly moderated comments from the last article. Very interesting. At least it was two ACs who posted these two comments. If it was a reg user I'd hope they get torched.

      --
      Random is the New Order.
  14. Re:POWERpc? by Anonymous Coward · · Score: 0

    No, it's not really ironic.

    Also, it's Ford Motor Company, not Ford Motor Works.

  15. drop AltiVec by g4dget · · Score: 2, Insightful
    If the 970 were solely intended as a Linux desktop platform for IBM, they would've preferred to reduce the 970's die size, power consumption, time-to-market, etc. by just leaving out the Altivec unit altogether, instead of shoehorning it into the design the way they did.

    This is probably true and rather unfortunate. AltiVec is important for Apple marketing because it lets them claim impressive performance figures without actually needing to push the state of the art in terms of processor design further than Intel. It's also important for a few special-purpose applications (PhotoShop filters, etc.).

    But the reality of regular high-end computing is that people don't have the time to optimize their software for the latest oddball hardware platform. And even something like a hand-coded vectorized BLAS library doesn't help because most scientific software still doesn't use such libraries.

    I think this tradeoff doesn't even work well for Apple. Imagine how much better it would be if Apple could ship systems based on the 970 today, rather than after a few months additional delay due to AltiVec. And every dollar and watt that is shaved off the AltiVec price makes it a much more viable processor for servers and blades, which would get volume up and prices down. Gimmicks like AltiVec cost much more than they are worth, even for Apple.

    1. Re:drop AltiVec by Anonymous Coward · · Score: 2, Insightful

      A few problems:
      (A) SIMD is really bloody fast if you use it. And Apple does. Heavily. Would you want to rewrite OSX, significantly slowing it down, to create an altivecless version?

      (B) Apple has gone through two major transtions: 68k->PPC, Mac OS Kernel->BSD kernel. Another rewrite requiring transition is possible, but over something this small? That seems unlikely. And I bet users would be THRILLED when some apps just stop working.

      (C) The other option is to just crack those 128 bit instructions down, just like everything else. But if you're gonna make the chip bigger and uglier to do this, why not just add altivec? The only argument I can think of is that this would get rid of the altivec's extra long pipeline and possibly allow lower clocks/operation for some things.

    2. Re:drop AltiVec by Daleks · · Score: 4, Interesting

      But the reality of regular high-end computing is that people don't have the time to optimize their software for the latest oddball hardware platform. And even something like a hand-coded vectorized BLAS library doesn't help because most scientific software still doesn't use such libraries.

      ATLAS is a BLAS implementation that is tuned for each system that it runs on. The people at Mathworks use this as the underlying BLAS system in Matlab. Mathematica Maple, etc. use this as well. There is even a G4/AltiVec optimized version available here. This is the whole point of layered software.

    3. Re:drop AltiVec by Wyatt+Earp · · Score: 5, Interesting

      AltiVec is nice for somethings.

      My iTunes ripping of mp3s nearly tripled when I went from a 466 MHz G3 to a 400 MHz G4 due to iTunes being optimized for AltiVec.

      Some Photoshop actions and filters see up to 800% improvments.

      Running iMovie exports on a 600 MHz G3 iMac take 2-300% longer than on a 400 or 500 MHz G4

    4. Re:drop AltiVec by Textbook+Error · · Score: 5, Informative

      AltiVec is important for Apple marketing because it lets them claim impressive performance figures without actually needing to push the state of the art in terms of processor design further than Intel.

      No, AltiVec is important for Apple full stop - in the short term to make up for the anemic bus speeds allowed by the G4, and in the longer term because a SIMD unit is now as expected a component of modern desktop CPUs as an FPU is.

      And even something like a hand-coded vectorized BLAS library doesn't help because most scientific software still doesn't use such libraries

      The only thing you can really sure about "most" scientific software is that it needs an FPU. Scientists and engineers do a huge variety of simulations, some of which are vectorizable and some of which aren't.

      If AltiVec has a weakness in the scientific field, it's the lack of support for double precision. And there's nothing in the instruction set which precludes this, so I wouldn't be surprised to see it appear in some future CPU.

      Imagine how much better it would be if Apple could ship systems based on the 970 today, rather than after a few months additional delay due to AltiVec.

      If it didn't have AltiVec, it wouldn't be what Apple needs in a desktop CPU - not much point in getting what you don't need a few months early (not like that would happen anyway: this isn't lego: you can't unplug "the AltiVec bits" without any impact on the rest of the design).

      And every dollar and watt that is shaved off the AltiVec price makes it a much more viable processor for servers and blades, which would get volume up and prices down.

      Except that Apple aren't currently in the blade market at all, and have a fairly small presence in the more general server market. If they can sell a few boxes there, fine, but getting the volume up means targetting consumers - not server farms.

      --

      Nae bother
    5. Re:drop AltiVec by GlassHeart · · Score: 4, Interesting
      AltiVec is important for Apple marketing because it lets them claim impressive performance figures without actually needing to push the state of the art in terms of processor design

      Don't confuse "new" with "state of the art". The former is just something that hasn't been done before. The latter is something that yields "impressive performance figures". If Altivec is competitive with Intel, then it is state of the art, by definition, even if it's 20 years old. The CPU cache is a decades old concept, yet CPUs with caches are still state of the art.

      Imagine how much better it would be if Apple could ship systems based on the 970 today, rather than after a few months additional delay due to AltiVec.

      Don't underestimate the cost of software. Your idea is expensive, because it requires software vendors to maintain two different versions of their code. This can lead to buggier or more expensive products, or it can lead to the "abandonment" of the G4 installed base. That could easily be worth the few months for Apple.

    6. Re:drop AltiVec by jovlinger · · Score: 2, Insightful
      Don't forget the oh-so-cool Fastest FFT In The West.

      IIRC, it will partically evaluate the code against the known size of the input, and I think also do some data-driven special-casing.

      Basically, it beats the pants off standard-library FFTs.

      While I'm at it, responding to grandparent:
      most scientific software still doesn't use such libraries

      Would you care to elaborate? I mean, if you're not writing against known ultra-optimized libraries, what business do you have expecting your software to run fast? That's like compiling with gcc instead of intel's compiler. I would expect that most scientists DO care enough to use the fastest libraries at hand, and put some effort into identifying bottlenecks.

      Or perhaps you were implying that most scientific software is too esoteric to benefit from fast linear-algebra? From what I recall from my physics courses, it was pretty much ALL linear algebra: vectors/ matrices/ determinants / eigen values.

      In summarium, this position confuses me.
    7. Re:drop AltiVec by Apotsy · · Score: 4, Insightful

      You'd be surprised how much stuff in Mac OS X is AltiVec optimized. Even memcpy gets a boost from it. It's a lot more than just a "gimmick".

    8. Re:drop AltiVec by g4dget · · Score: 0, Troll
      Would you care to elaborate? I mean, if you're not writing against known ultra-optimized libraries, what business do you have expecting your software to run fast?

      It's a simple cost/benefit analysis. I can spend a week shoe-horning something into the BLAS interface and possibly create obscure bugs in the process, or I can just write a reasonably efficient high-level loop and live with whatever the compiler gives me.

      I would expect that most scientists DO care enough to use the fastest libraries at hand,

      Scientists get paid to do science, not to hand-optimize inner loops or link to cumbersome libraries. If a regular compiler can't compile regular loops into efficient code for a processor, then that's a problem with the processor, not with the scientists using the compiler. The scientists may still decide to eat the dogfood, but that doesn't make it any better.

      In any case, in this case, people can have their cake and eat it, too. For the price of a single 1GHz PPC Macintosh ($1500) or dual 1.25GHz PPC Macintosh ($2000), people can get two 2.4GHz Pentium4 machines, which gives them more compute power even using plain gcc than hand-optimized AltiVec code on the Macintosh. And, frankly, the 970 doesn't look like it's going to change that ratio whenever it may actually come out.

    9. Re:drop AltiVec by afidel · · Score: 3, Informative

      Actually every single application that is actually pushing the PC at this point benifits from vector units. Lets look at what is somewhat strenuous for a modern PC, 3D rendering:yep, media encoding and transcoding:yep, audio processing and creation:yep, 3D gaming:yep, etc. Basically all of the applications that will push a PC are things that process large chunks of data that can be worked on efficiently by a vector processing unit. This isn;t a server processor, it's a PC processor. If you want a server processor get a Power4 or Power5 with the huge cache and multicore chips that are designed for that market.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    10. Re:drop AltiVec by Lars+Arvestad · · Score: 1
      I do science and I have benefitted from optimized BLAS libraries (on x86 chips as it happens) both when using a program for semidefinite programming and running Octave scripts. A colleague of mine gave a talk just yesterday, mentioning how easy it had been setting something up using Perl and PDL, which apparently is built ontop of BLAS. I think the GSL is also using BLAS if it is available.

      You simply do not necessarily need to work directly with BLAS (and LAPACK for that matter) to enjoy the fruits of excellent optimization!

      --
      Reality or nothing.
    11. Re:drop AltiVec by g4dget · · Score: 0, Troll
      You simply do not necessarily need to work directly with BLAS (and LAPACK for that matter) to enjoy the fruits of excellent optimization!

      Let's say you own a home worth $500000. Would you pay another $500000 to own a vacation home in Aspen that you use two weeks out of the year? Aspen is a really nice place to stay, after all, and it's so much more pleasant to stay at a place one owns. Or would it make more sense just to rent a place for the two weeks, and maybe even go to different places to vacation every year?

      Just because something is nice and just because it is high quality doesn't mean that it is worth paying a premium for it.

    12. Re:drop AltiVec by Ikari+Gendo · · Score: 2, Insightful
      SIMD is really bloody fast if you use it. And Apple does. Heavily. Would you want to rewrite OSX, significantly slowing it down, to create an altivecless version? ... Apple has gone through two major transtions: 68k->PPC, Mac OS Kernel->BSD kernel. Another rewrite requiring transition is possible, but over something this small? That seems unlikely. And I bet users would be THRILLED when some apps just stop working.

      Er ... OS X doesn't need to be rewritten. It runs on Altivec-less G3s, and probably will continue to do so for a very long time.

    13. Re:drop AltiVec by cygnus · · Score: 2, Informative
      AltiVec is nice for somethings.
      that's not just AltiVec accounting for that increase. the G3 uses the old 60X bus designed for the original PPC chips. the G4 uses the MaxBus, which offers streaming from RAM. the G3 burns up a lot of it's bus repeatedly asking for the next block of whatever from memory, while the G4 can say "fetch me these next few blocks." that makes a huge difference on a slow bus.
      --
      Just raise the taxes on crack.
    14. Re:drop AltiVec by drunkenbatman · · Score: 1

      (C) The other option is to just crack those 128 bit instructions down, just like everything else. But if you're gonna make the chip bigger and uglier to do this, why not just add altivec? The only argument I can think of is that this would get rid of the altivec's extra long pipeline and possibly allow lower clocks/operation for some things.

      Apple (and most other people benefitting from altievec aren't exactly using it for 128bit or 64bit instructions most of the time... more as an SIMD engine (simple instruction, multiple data). Altivec doesn't have double precision I don't believe (you can hack around it but its often not worth it) so it often isn't suited for a lot of high-precision instructions... but the dual FPU's of the 970 would be, lord.

      In other words, if you have a Photoshop document with 10,000 pixels and wanted to run a fairly simple single precision instruction on all of those pixels (ie, a filter) a G3 would take the same instruction and apply it to each pixel. If you had a G4, and the code was made to use altivec, Altivec could take its 128bit engine, and assuming each pixel was 8bits worth of data could run 16 pixels through the engine at once and output the answer. Obviously in most cases (due to architecture, data needs, etc) it most often isn't going to be 16x faster, or even 2x faster...

      So a case could be made that if you can do the above instruction in 30 seconds on a 1GHz G3, and 15 seconds on a 1GHz G4, having a 2GHz G3 would run it just as fast as the G4 but would run EVERYTHING else that altivec can't be used for or isn't appropriate for or people just haven't taken the time to code for twice as fast too.

      That's always been the old IBM argument- that adding altivec added complexity to the design helping to keep it from scaling, and they would have rather either used the space for other things (cache, more units, etc) or decreased the die size altogether allowing for cheaper and faster chips.

      drunkenbatman

    15. Re:drop AltiVec by macmastery · · Score: 1
      AltiVec is important for Apple marketing

      Note that Apple markets "AlitVec" as "Velocity Engine". http://www.apple.com/powermac/architecture.html

    16. Re:drop AltiVec by Pig+Bodine · · Score: 1
      It's a simple cost/benefit analysis. I can spend a week shoe-horning something into the BLAS interface and possibly create obscure bugs in the process, or I can just write a reasonably efficient high-level loop and live with whatever the compiler gives me.

      Except for the fact that a large number of scientific codes come down to solving linear systems, eigenvalue problems, discretizing and solving a PDE etc. For these sorts of standard problems, the libraries are easier to use than writing your own. They are also more likely to be robust with respect to round-off error. And the people who write such code quite often use BLAS or optimize their code for a particular architecture.

      I do agree that if you have a problem for which there are no higher level libraries then it's probably not worth your time to worry about heavy optimization or use of relatively low-level stuff like BLAS.

    17. Re:drop AltiVec by Lars+T. · · Score: 1
      "But the reality of regular high-end computing is that people don't have the time to optimize their software for the latest oddball hardware platform."

      Yet you keep saying they have nothing better to do than recompile their code whenever Intel blesses them with a ne processor, like SPEC tells them to.

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    18. Re:drop AltiVec by squiggleslash · · Score: 1
      I dropped a 433MHz G4 into a Beige G3 that previously had a 300MHz CPU. I've gotten a speed up - iTunes, for example, now rips at around 3.5-4.5x as opposed to 1.5x-2.5x previously (and I have another Beige G3 I dropped the 300 into, so I know this isn't a software issue, the figures are still consistant.) Both CPUs have a 1 meg L2 cache.

      Now, that's a speed up, but it's nothing like the "3x-6x for a clocked-slower G4 over a G3". It's more like a "nearly twice as fast for a CPU clocked at about 50% faster". That's impressive, but it's nowhere near the figures above.

      So my question is - would your comments about the bus be responsible for the less impressive speed-up? Or am I missing something bigger?

      (Sonnet are about to release a 1GHz G4 for the Beige G3 machines, so it'd be interesting to see if it's actually going to improve things before I spend the cash on it.)

      --
      You are not alone. This is not normal. None of this is normal.
    19. Re:drop AltiVec by g4dget · · Score: 1
      Yet you keep saying they have nothing better to do than recompile their code whenever Intel blesses them with a ne processor, like SPEC tells them to.

      I have actually consistently argued against using Intel's Pentium 4 compiler (and Itanium as well), for pretty much the same reasons I think AltiVec is a bad idea.

      But, hey, don't let little details like facts get in the way. After all, you are on a crusade, and if the cause is as worthy as Holy Apple, anybody who doesn't agree with you is a mortal enemy, and what do a few little lies matter then anyway?

    20. Re:drop AltiVec by Lars+T. · · Score: 1

      Hey, YOU keep saying that SPEC scores are the bee's knees. But don't let the facts get into YOUR way.

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    21. Re:drop AltiVec by Anonymous Coward · · Score: 0

      Apple's altivec fft is, they claim, faster than FFTW.



      "If you are a developer who would like to make more use of AltiVec in your application, you should first explore the functionality that is available in Apple's vecLib.framework. This framework can be found in /System/Library/Frameworks on any MacOS X installation. There you will find many highly optimized routines for a diversity of tasks. There are single and double precision FFT's. Some of these are up to five times faster than FFT-W."

    22. Re:drop AltiVec by g4dget · · Score: 1
      Hey, YOU keep saying that SPEC scores are the bee's knees.

      And what does that have to do with anything? SPEC scores can be measured with many different compilers, and, obviously, I prefer to use SPEC scores that don't rely on weird compilers like the Intel compiler.

      But even if people use Intel compilers for benchmarking, that still tells us a lot. For the comparisons you seem to be obsessed with, the P3/G4, the gcc and Intel scores for the P3 are similar. P4 systems beat G4 systems both in terms of top performance and price/performance even using gcc. And the Opteron outperforms the Xeon even if you compare gcc-compiled Opteron code against Intel-compiled Xeon code.

      Face it, PPC is just behind in the performance and in the price/performance curve. I sure wish some manufacturer would step up to the plate and deliver an affordable high performance processor not based on x86. But gimmicks like AltiVec are not the answer and they only delay the release of processors like the 970 to the point where it, too, will be behind the curve.

    23. Re:drop AltiVec by Lars+T. · · Score: 1
      the gcc and Intel scores for the P3 are similar.

      Sure, if you think that 10% higher scores (ICC 7.0 against gcc 3.2.2) or even 20% (against gcc 2.95) are "similar" - and that includes "evil" SSE support on gcc. And if you happen to use MS C...

      Last but not least: even if all you care about is raw, non-SIMD performance - unless you used a 604 based Mac over a Pentium+ PC, why should I (or infact anybody) listen to you?

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    24. Re:drop AltiVec by gig · · Score: 1

      Altivec is used by hundreds of Mac apps. It is not there for marketing or benchmarks.

      When an app is updated to use Altivec, you typically see double the performance. So you can run 40 effects in real-time and then you get the Altivec version of your app and you can run 80 effects in real-time.

      Photoshop filters are the least of Altivec's uses. Encoding MPEG-2 and MPEG-4, encryption of any kind, Digital Signal Processing ... these are what Altivec is used for.

      Mac users are doing heavy audio, video, encoding ... these are things that can be sped up dramatically by Altivec without having to build a great big chip power-hungry chip to brute force these calculations.

      Slagging Altivec in favor of Intel just makes you look like an idiot. It shows you haven't seen it in action. There are jobs my PowerBook does faster than the biggest, hungriest P4 desktop with a CPU that takes 10x the power.

      I mean, go into an Apple Store and see this stuff in action before you shoot your mouth off.

  16. Based on this, by vlvtelvis · · Score: 0, Offtopic

    I'm assuming that the new procs must have some kind of support for the evil bit.

  17. Re:Another Dupe by the_2nd_coming · · Score: 0, Offtopic

    speaking of the matrix.....things are not as they seem!!!! bad guys are good good guys are bad.

    --



    I am the Alpha and the Omega-3
  18. nope. by netsrek · · Score: 3, Interesting

    Try doing audio signal processing or heavy graphics/video work.

    You're pretty thankful for your Altivec then...

    I saw such an insane improvement in Reaktor when it got Altivec enhanced...

    --

    i don't read slashdot anymore.
    1. Re:nope. by iomud · · Score: 2, Insightful

      But you don't need altivec for a next gen chip that's so much faster than it's predecesor that no one would notice. There is a huge disparity in the performance of current G4's and whatever they're gonna call the 970 based machines.

    2. Re:nope. by jovlinger · · Score: 0, Flamebait

      ya.

      But there's like three people in the world who actually use altivec. Hardly optimizing for the common case... (that said, I do run mplayer which I think relies on similar SIMD instructions on the x86 to provide realtime img post-processing).

      Wouldn't it be cheaper for all concerned for apple to seriously subsidize a PCI photoshop accelerator (basically a card full of DSPs and RAM with a wide bus)?

      I'm thinking back to the days when I would read BYTE and to drool over this black (really cool, in those days) 486/Pentium box filled to the brim with i960 cards. By a company called... erm.. Microway? With their compiler, it could reach a peak of 1 gigaflop. Back in the early ninety's. Do some reverse Moore's law extrapolation to see how impressive those numbers were.

      Now if you're going to cater to heavy number crunching, it seems that's the way to go. Just admit that the CPU is there to deal with the UI, and let the DSPs do the effects.

      Has anyone used one of those machines?

    3. Re:nope. by bnenning · · Score: 2, Insightful
      But there's like three people in the world who actually use altivec.


      More than 3 people have ripped music in iTunes. Then there's the tremendous acceleration it provides for encoding DVDs, Final Cut Pro's real-time effects, BLAST, and plenty more. It's not even close to just Photoshop.

      --
      How to solve most of our problems: 1.Lots of nuclear plants. 2.Cure aging.
    4. Re:nope. by g4dget · · Score: 1, Insightful
      If you spend $3800 on a high-end Mac, you get something that runs a small number of hand-optimized commercial programs really fast. That may be useful for some of Apple's core audience.

      But if you spend the same $3800 on x86 hardware, you get a small compute cluster that runs a lot of software faster and without AltiVec optimizations. For most scientific applications, as well as most video and audio applications, that's probably a better deal in terms of bang-for-the-buck, but, admittedly, it's probably not something your average Mac user wants to set up.

    5. Re:nope. by netsrek · · Score: 1

      It's also something that isn't necessarily feasible for the kind of audio work I do.

      Clustering multiple machines together is still problematic for stuff like this.

      It makes writing music with software instruments/fx bloody annoying as well.

      Scientific stuff is a whole nother ballpark, just pointing out that I know I'm not the only musician who uses the Altivec stuff a hell of a lot...

      --

      i don't read slashdot anymore.
    6. Re:nope. by andrewski · · Score: 2, Informative

      They had a Mac with a DSP built-in back in the day. The Quadra A/V! It blasted the shit out of most any other computer when it came to Photoshop - rivaling modern computers for some tasks.

    7. Re:nope. by g4dget · · Score: 0, Troll
      It makes writing music with software instruments/fx bloody annoying as well.

      If s/w developer spent the same amount of time on clustering as they do on AltiVec or MMX hacks, you wouldn't even notice. And it would be a hell of a lot easier to upgrade--just plug in another box when you need more power.

    8. Re:nope. by TheRaven64 · · Score: 3, Insightful
      Using SIMD is intrinsically easier than using a cluster. To use a SIMD instruction set all you need to do is isolate occurences where you are applying the same operation to multiple units of data and (in the case of AltiVec) call the corresponding vector operation in the standard lib.

      Coding for a cluster introduces all kinds of communication and synchronisation headaches, especially since it takes such a long time to communicate between nodes (1ms is a very long time in terms of a CPU).

      --
      I am TheRaven on Soylent News
    9. Re:nope. by g4dget · · Score: 1, Interesting
      Using SIMD is intrinsically easier than using a cluster.

      They are just different. Clustering allows program-level parallelism, which gives you nearly linear scaling for throughput for arbitrary programs with no programming effort. SIMD and vector processors are very specialized and require a lot of effort to use well, and that effort is usuall only worth it if you need to lower processing latencies.

      (Note, incidentally, that one of the most important SIMD machines, the Connection Machine, was built as a network of processors.)

    10. Re:nope. by drunkenbatman · · Score: 2, Interesting

      They had a Mac with a DSP built-in back in the day. The Quadra A/V! It blasted the shit out of most any other computer when it came to Photoshop - rivaling modern computers for some tasks.

      They actually made two: the Quadra 660AV and the Quadra 840av (there was also a centris line without the fancy stuff). The 660 used a 25MHz 68040, and the 840 used a 40MHz 68040, and had a seperate DSP that you had to write specifically for.

      Apple's thought was kinda ahead of the curve at the time, in that they were introducing speech recognition, were going to ditch the modem in favor of the geopod (just a jack, cost about half as much as a modem and you'd do the same stuff in software using the DSP). After it died (ver quickly) there were rumors that they'd be adding a phillips trimedia card later on, but that was dropped as they learned their lesson hopefully from the AV fiasco.

      At no tasks could those computers hold their own with any "modern computer". The 840 held its own for awhile, but that was due to the fact that it had the fastest released 040 chip, there wasn't much PPC software for awhile and the fastest powermac (80MHz) released later on emulated 68K code at about the same speed as the 840 ran natively.

      But honestly- and I did own one and loved it- they were a bad buy. Since you had to code specifically for the DSP, only a few photoshop filters and one or two 3D programs wrote any code for it... maybe a few things here and there, but nothing really of note. The telephony aspect of it sucked bad and never really worked well, so the DSP just sat there hanging out 99.9999999999% of the time as Apple themselves said they wouldn't be included in future machines a bit later as they weren't needed due to the speed of the PPC when using native code.

      Sorry, just at no time did they rival modern computers- they were cool for what they were, and allowed voice recognition and playback at a time when it was unthinkable... but that's about it.

      drunkenbatman

    11. Re:nope. by gig · · Score: 1

      Altivec is much, much, much faster than you think it is.

      If you're thinking of an Intel/AMD SIMD that gives you a 1% improvement then it would make sense to drop it from a next-gen chip that is 100% faster.

      Altivec often gives double or more performance, though. Since the CPU-intensive Mac apps all use it, it makes sense to keep it in there and just make it faster, too.

      Altivec is also exactly what you'd want if you are doing DSP and encryption and encoding MPEG. We do a lot of that on the Mac. All you have to do is make one DVD Video disc on a Mac and then do the same job on any Intel machine and you'll see the difference. What's encoded in an hour on the Mac will take 8 hours on the PC.

      Many Mac users know and love Altivec because as our apps were updated to use it (a few years ago) we would see tremendous performance gains without burning holes in our laps. So one day you can run 30 effects in your Digital Audio Workstation and then you get a small update that includes Altivec-optimizations and now you can run 60 effects at once. You don't have to get out the meters and testing suites to see the benefit.

    12. Re:nope. by gig · · Score: 1

      A HUGE number of Mac apps use Altivec.

      You benefit especially in PowerBooks where you do these massive computations on a low-power CPU.

      Altivec apps (off the top of my head): Pro Tools, Logic, Cubase, Performer, iTunes, Final Cut, Avid, iMovie, iDVD, iPhoto, QuickTime, Mac OS X (Quartz, CoreAudio, Disk Copy, more), Photoshop, Illustrator, Dreamweaver, Fireworks, Flash, FreeHand.

      You have to run an Intel chip at 70 watts to get it to brute force these computations as fast as a 15 watt chip with Altivec. Get the picture?

      On the Intel side, you guys pay zero attention to power management. Multiple fans in a big noisy box on the desktop, and stripped-down "mobile" CPU's in the notebooks that just don't perform. Put a PowerBook side by side with the best Intel notebook and the Intel stuff is a joke.

  19. yea yea itza dupe. BAMBOOFUCKER. by Anonymous Coward · · Score: 0, Redundant

    Important Stuff: * Please try to keep posts on topic.
    * Try to reply to other people's comments instead of starting new threads.
    * Read other people's messages before posting your own to avoid simply duplicating what has already been said.
    * Use a clear subject that describes what your message is about.
    * Offtopic, Inflammatory, Inappropriate, Illegal, or Offensive comments might be moderated. (You can read everything, even moderated posts, by adjusting your threshold on the User Preferences Page)
    * If you want replies to your comments sent to you, consider logging in or creating an account.

  20. Beat the Karma Whores by Anonymous Coward · · Score: 0

    Unfortunately, the vector performance of the G4e has been consistently bottlenecked by Apple's lackluster motherboard and chipset designs--specifically the anemic frontside bus and memory subsystems that Apple has saddled the PowerMac line with.
    This implies that the decision of how much bus bandwidth to give the G4e was up to Apple - which it was not. Motorola designed the processor (for Cisco, depending on who you believe), and Apple made do with the anemic MaxBus at 133mhz that they got from Motorola.

    Apple'd be putting DDR400 on the G4 right now if they could. None of this (well, except the decision to go Moto) was their fault.

    *****

    Why this had to be posted the morning before my presentation to my supervizor is a clear indication that the universe is against me.

    Time to hide my network cable until the end of the day.

    *****

    Who knows whether it will still be competitive in several months when they actually want to offer it.

    On the other hand Apple users won't have much of a choice, and neither has Apple.

    *****

    The PPC 970 will not really make the Macintosh competitive with modern PC's. It will make it competitive with PC's from the beginning of this year, which are not the fastest available any more, and will be even slower when compared to the machines that are available when the PPC 970 ships, which is the very earliest that Apple machines based on it can ship. It will however go a long way to catching up, and take off a lot of the pressure caused by the abominable performance of today's dual processor G4 machines when compared to even inexpensive PC's.

    The other unkown in this is the price. PPC 970 based Apple computers may be significantly more expensive. Motorola loses hundreds of millions of dollars each year on their semiconductor business, and IBM does as well. Still, IBM may want to look at Apple and the PPC 970 as a PROFIT center, rather than a LOSS center, like Motorola does with Apple and the G4.

    The PPC 970 is great news for Apple, but it is still a bone thrown to them while the x86 PC is feasting on the meat of the Intel and AMD processors.

    *****

    Reading through the article, its nice to see some real design going into a processor. Looking through Intel's last few chips, they've been upping ther clock speed and packing in more cache.
    Yeah, yeah, they are hog-tied because you can't easily re-compile the entire windows platform to use new instruction sets. Linux users, of course, don't have this problem (muhahahah).

    Did anyone else catch the bit on the twin FPU's? I'm just imagining what this thing is going to do with vector operations and frequency transforms.

    For most of you non-engineers:

    -Most 3d vector operations are affine tranformations. Using a 4x4 array of floating point numbers you can translate, rotate, and scale. Works beautifully, but it's a lot of calculations.
    -The Fast Fourier Transform (FFT) is used a lot in signal processing. It's a floating point monster.

    *****

  21. Imagine a Beowulf clu... by Anonymous Coward · · Score: 1, Funny
    ...

    Eh, nevermind.

  22. Re:Another Dupe by hawkear · · Score: 1

    You'd think michael and Taco talked every once in a while - less than 24 hours between duping this one.

  23. Floggings will continue until moral improves! by djupedal · · Score: 2, Funny

    Whining about dupe comments is worse than the whining in the dupe comments, and thus the point....don't bitch about the symptom, lobby to stop the source of the pain, and the whining will cease at the same time.

    "But Mom, I don't want to go to France!" "Shut up and keep rowing!"

    1. Re:Floggings will continue until moral improves! by Namarrgon · · Score: 1

      And the whining about all the whining about dupe comments? Where does that fit in?

      --
      Why would anyone engrave "Elbereth"?
  24. Mod parent up! by Anonymous Coward · · Score: 0

    I fail to see how the parent post was at all "redundant."

  25. Second time over by haxor.dk · · Score: 0

    This is the second time it's posted. Google News could almost do better than the /. editors.

  26. Performance vs P4 sucks by Compunerd · · Score: 0, Troll

    A quick google for SPEC(int|fp)2000 values for Intel P4 (http://www.aceshardware.com/SPECmine/) shows the P4 3000's SPEC numbers are around 1200. So. What's so cool about this?

    --
    Computers are like air conditioners.
    - They stop working when you open Windows.
    1. Re:Performance vs P4 sucks by Alan+Partridge · · Score: 1

      well, I supose that Apple will use two of these chips in their workstations at a time, and coupled with the extensive Altivec optimisation programme that Apple has been conducting, and their ever expanding list of grade 1 in-house applications (Final Cut Pro, DVD Studio Pro, Logic, Shake, iTunes, QuickTime) that really can benefit from those optimisations, the near future looks very encouraging for media creation on the Mac platform. The 64 bit addressing made available by the 970 may yet have profound implications for Apple's viability as a server/high end workstation provider - they could certainly finish off SGI's desktop business.

      --
      That was classic intercourse!
  27. Turning the FFT into an integer monster. by MickLinux · · Score: 2, Interesting

    In response to your FFT being a floating point monster... in a lot of cases, couldn't you turn it into an integer monster? I've been thinking about this, and it occurs to me that the vector can be decomposed into halves (thus the 2^x units in the FFT), but a vector and angle theta it can as easily be decomposed into to vectors half the length, one at angle phi, and the other at angle (2theta-2phi).

    That, where phi is any angle. That being the case, it seems to me that you could pick your values phi to correspond to "perfect" triangles (3-4-5, ~42 degrees, for example), and keep your operations in the integer realm for everything except subtraction of angles.

    I dunno, I haven't checked this out really thoroughly, and this is therefore probably nonsense. Last time I tried to do anything with the DFT, I thought I had something that blew the FFT away in terms of speed... precisely because I didn't understand the full FFT process, and its beautiful simplicity.

    In reality, I got a very modest improvement over the FFT, not worth the extra code in my opinion.

    My method was very different, involving a redefinition of the DFT matrix-vector combination, and had more work on paper, but fewer multiplications. But what I thought was (log2n)^2 instead of the DFT's N^2 order of magnitude multiplications, was really something like 0.87Nlog2N multiplications. FFT gets N*log2N multiplications.

    Essentially, when I understood the FFT well, and applied my lessons to it, I ended up showing that not all the multiplications are neccesary. Some of the FFT multiplications are dupes just like this article, and there is a system for finding them, also just like this article. (Look for the multiplications posted by Taco.)

    But the fact that I can make such errors means that I could be completely wrong about my supposed integer FFT.

    --
    Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
    1. Re:Turning the FFT into an integer monster. by Pig+Bodine · · Score: 1

      You can definitely do the FFT in integer arithmetic. I'm not sure of the details. Fixed point arithmetic is something I don't think about very much. It would be more work than floating point and it doesn't sound very fun to me. But then I'm very lazy, as my presence goofing off here proves. Still, I wish you good luck. A good integer FFT is a noble goal.

      Concerning your attempt to get rid of multiplications, the Winograd FFT reduces the number of multiplications to O(n) but the number of additions increases (to O(n^2) if I remember correctly.) For large n this is obviously not useful unless multiplication is much slower than addition (which is not the case on todays computers.) It is however sometimes used for short length modules in mixed radix FFT algorithms.

      If you are interested in efficient FFT, a web search turned up the following, which seems to have a ton of references (including the Winograd paper): burrus-notes There is also a good general reference book by Charles Van Loan published by the Society for Industrial and Applied Mathematics: SIAM It's amazing how much work is done in this area. The FFT is very important.

  28. It's a bug. Patch is available by MickLinux · · Score: 1
    Here's the patch.

    Link here. In your browser, find "CmdrTaco", click on the checkbox next to it, and then go to the bottom and click "submit" (rough translation from swahili: submit = "apply patch".

    [JUUUUST kidding, don't do this or you won't see any more of CmdrTaco's articles.]

    --
    Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
  29. Actually, a real suggestion by MickLinux · · Score: 1

    I really don't think it's possible for each of 30 people to be aware of all 30 other articles. Why not assign one person to read *all* articles, and flag dupes? Then everything has to be cleared by him, and we'll eliminate dupes. And if CmdrTaco or someone else has a reason that it should go up again, he can argue it out, and modify the headline accordingly, so the readers will also know why this should go up again.

    --
    Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
    1. Re:Actually, a real suggestion by gl4ss · · Score: 1

      it's not possible for slashdot editors to read slashdot?

      i mean, how busy can they be.

      --
      world was created 5 seconds before this post as it is.
  30. But this time in the apple section by cappadocius · · Score: 1
    Sure. But the original wasn't posted to the Apple section, so those of us who read the Apple section daily, but the main page only when time permits are seeing it for the first time.

    If you haven't seen it, it's new to you.

    --

    omnia tua castra sunt nobis

  31. Good question by djupedal · · Score: 1

    Well, after the first round of follow-up whinings (level 3) are logged, there is an 'incident number' (IN) reset, and the next whiner (NW) in line goes to the front of the queue. Unless of course, you have already participated in first-round whining (levels 1;2;3), in which case you have to sit out.

    Whining-by-proxy, substitute whining, pitch-whiners, designated whiners, ghost whiners and stand-in whiners are all permitted (first-round whiners, all levels), but only for Rhode Island, New Jersey, some parts of lower Manhattan and the District of Columbia.

  32. Friend's friend. by the+uNF+cola · · Score: 1

    See, your friend's enemy is your friend. No wait. Your friend's friend can also be your enemy. No...

    Ah frig'..

    Apple is going along /w IBM on their new chips. No more analysis :)

    --

    --
    "I'm not bright. Big words confuse me. But Wanda loves me and that should be enough for you." - Cosmo

  33. FFTW speed claims by Trepidity · · Score: 1

    djb (of qmail fame) has some rather uncomplimentary things to say about the accuracy of FFTW's speed claims.

    1. Re:FFTW speed claims by Anonymous Coward · · Score: 0

      What do you expect? He's a professional Grade-A asshole.

  34. Nonsense by Karl+Cocknozzle · · Score: 2, Interesting
    I really don't think it's possible for each of 30 people to be aware of all 30 other articles.

    Bullshit. When I worked foy the University Daily Paper we had no problem avoiding duplicate stories all over the paper... And we ran FAR MORE THAN 30 STORIES A DAY.

    In my example it was a bunch of drunk/high/rushing out to get laid coward students--Can't professionals who are being paid do their damn job right do AT LEAST as good as the wasted college kids?
    --
    Who did what now?
  35. integer FFTs aren't uncommon by Trepidity · · Score: 3, Insightful

    You just use fixed-point arithmetic instead of floating-point (i.e. a fixed 32 bits of precision, or 16 bits, or whatever). A simple way of doing is is to make INT_MAX/2 = 1.0, -INT_MAX/2 = -1.0, and everything in between scaled appropriately. (/2 to avoid overflow). Then you implement fixed-point addition, multiplication, division, and subtraction (as commonly doing in hardware DSP chips) and you've got yourself an integer-only FFT.

    Some really old C code doing something along these lines is available here.

  36. Re:NOT IBM NO!!! by Anonymous Coward · · Score: 0

    Motorola is much better than IBM as far a chip companies go and I thought you were going to buy MOTOROLA

    If "Makes crap chips like G4e's and doesn't bring them into the 21st century" is much better than "Makes awesome chips like 64bit 970s" then you have a pretty fucked up definition of "much better"

  37. Departments by bjb · · Score: 1

    I like how this duplicate was posted under the 'take-a-closer-look' department.

    --
    Never hit your grandmother with a shovel, for it leaves a bad impression on her mind...
  38. CDE by arthas · · Score: 1

    What is wrong with CDE? It is simple and efficient. I use it on my Alpha and I am quite satisfied with it.

    The best alternatives to CDE would be Indigo Magic (used on SGI Irix systems, perhaps the best desktop environment I have ever used) or Window Maker.

  39. slightly different analysis by Anonymous Coward · · Score: 0

    Sun: crappy hardware; moderately priced; Solaris/ free UNIX
    AMD (multiple vendors): decent hardware; relatively inexpensive; Windows/ free UNIX
    Intel (multiple vendors): good hardware; moderately priced; Windows/ HP/UX/ free UNIX
    IBM: good hardware; expensive; AIX/ free UNIX
    Apple: decent hardware; unknown price (probably inexpensive to moderately priced); OS X/ free UNIX

    It really depends on your goals and your applications. If you just want the cheapest, Apple or AMD hardware would be the best bet. Keep in mind that AMD hardware won't really be cheap until the Athlon 64 is introduced later this year; right now it is still inexpensive only in comparison to other 64-bit workstations. Apple pricing is unknown and will likely depend somewhat upon the prices that IBM gives them. Motorola bleeds lots of cash on their behalf; IBM may not be willing to do the same.

    On the other hand if you were looking for a 64-bit workstation, you would probably consider Sun if your company already has lots of Sun systems, and the other vendors depending upon which one excels in benchmarks related to your application (AMD, IBM and Intel processors all have slightly different strong points; you really need to benchmark to find out which is best for you) and the money available.

    Apple may be a good choice for content creation. It is likely to be less expensive, but also slower, than the competition from AMD, IBM and Intel. Lack of workstation applications may be a barrier for adoption to other workstation tasks.

  40. Altivec - Logic Audio; value of Mac platform by Steve+Cowan · · Score: 1

    Agreed. I use Logic Audio a lot, and my 350 MHz G4 can run about 3 times more simultaneous real-time DSP plug-ins than my 500 MHz iBook. I realize the 66 MHz system bus on the iBook comes into play here (it's 100 MHz on my old G4), but my mom's iMac (400 MHz with 100 MHz system bus) performs about as well as my iBook.

  41. yep by Zergwyn · · Score: 2, Insightful

    I agree with this totally. A surprisingly large, and ever increasing, amount of OS X libraries use altivec, which means that developers using those libraries get some acceleration for free. Altivec is much easier to optimize stuff for then MMX, SSE2, etc.

  42. A thought on how to avoid duplicates by merger · · Score: 0, Offtopic

    I just had a thought that would probably reduce the number of duplicates significantly. At the point where the author hits submit, the story checks all of the links in the article and then does a comparison of those against the complete story archive. At that point if the article has a duplicate it will find it and let them know.

    A problem with this method is what may happen with this story. After the editors typically link directly to the story they also link to the referring domain. A way around this is that if there is more than one link in the story then any link to a base level site (ie. www.arstechnica.com) is ignored and the links focus into the sites pages and folders are paid attention to.

    I may later play around with a couple of searches of the database to see how well this method works, but I think it would improve things significantly.

  43. Yet amazingly... by Infonaut · · Score: 1

    There are also 7th graders who can spell and edit better than these guys. It's really embarrassing.

    In spite of all these horrible shortcomings, we get all of the good things Slashdot provides for free. It's not perfect. They make mistakes.

    Get over it. Anyone can be a sharpshooter, waiting for someone else to screw up. But it takes a lot of hard work and dedication to put something like Slashdot together. Cut the guys some slack, or create your own website and call it Gripeslash.

    --
    Read the EFF's Fair Use FAQ
  44. Up to marketing, not technology by mariox19 · · Score: 3, Interesting

    For me, the most interesting part of the article concerns the pricing of the new machines as the real question. According to the author, the chip will make Apple machines technologically competitive. The question is, will Apple price them to gain market share, or continue to sell to a disappearing niche of luxury computer buyers.

    Maybe Apple's concentration on developing software, and selling that software (rather than giving it away), along with its new business ventures, such as .Mac and the new iTunes online music store, point to a new business model that can afford to cut the margins on hardware.

    If they don't lower the price of their machines -- the top ones, namely -- they will suffer, long-term. I don't think they need to be on par with PC's; I just think they cannot be too much more expensive than the PC's.

    --

    quiquid id est, timeo puellas et oscula dantes.

  45. Apple's done fine with this pricing for decades by Gorimek · · Score: 1

    Yeah, macs are a bit more expensive, but not really by much if you compare to comparable brands and quality, not self assembled garage models.

    And I agree in theory on your pricing opinion, but it's just that in reality Apple have been pricing their machines in pretty much this way for 20 years, and they have made a very successful business out of it. They've also continuously been pronounced on the verge of death for all those 20 years.

    So I don't expect either Apple pricing or their good fortunes to change anytime soon.

  46. troll? by netsrek · · Score: 2, Interesting

    wtf? hey maclots.... Just cause someone is criticising Altivec doesn't necessarily make them a troll....

    I do agree with you that clustering could be far more useful than it currently is, but as you say, anything that requires low latency is kind of problematic...

    As far as clustering goes, you know you're able to put together a PC processing monster and use VST System Link ?

    Been considering this to add to my TiBook...

    --

    i don't read slashdot anymore.
    1. Re:troll? by gig · · Score: 1

      VST system link is garbage. Read up on how it's supposed to work and it will amaze you that they built it. It's an end-run around the lack of clustering in MS Windows. Also, Steinberg are second only to Microsoft in making buggy software.

      On the Mac side, Apple's Shake sends jobs out over the network using Rendezvous to find more compute power. Logic and Final Cut Pro will start doing this soon. This kind of stuff is easy to do right on Mac OS X.

    2. Re:troll? by netsrek · · Score: 1

      Have you actually used it? I do know how it works btw... There's a real difference between needing real time processing power and something like rendering that Shake/FCP does. I'd be really suprised if Logic manages to use Zeroconf over a network to achieve this, as the latency of a network connection isn't really feasible for this kind of work afaik.

      --

      i don't read slashdot anymore.
  47. Someone brought this up on MacSlash today... by Anonymous Coward · · Score: 0

    ...and I reply with the same comment.

    I went to Dell and configured a PWS 450 (I think that's what it was called) with almost all the bells and whistles of the top-end G4. I didn't include a SuperDrive with the G4 and went DVD/CD-R because I couldn't tell which Dell option was closest to the SuperDrive. I always chose the cheapest component. I also left out the monitors and tried to get the same specs for the video card.

    A dual processor 2.0 GHz PWS 450 from Dell rang in around $3200. The Dual 1.42GHz PowerMac G4 rang in at $3500.

    Personally, I don't think 10% ($300) is too much more expensive for a Mac. Do you?

  48. Re:NOT IBM NO!!! by Anonymous Coward · · Score: 0

    Do the math. I have IBM chips from the 601 down to now and the 64 bit vectoring is not as good as altivec 128 bit. Humm 128 or 64 bit. I choose the Motorola built apple. Yes and even the gcc now has very good altivec factoring. The last time I checked 128 bit is twice that of 64 bit you just have to use the fuctionality. If you like IBM so much go work for them and you will see how evil a company they are. Hey but maybe you like clear channel too ( Clear channel is the government radio in USA). No bull Motorola is better.