Slashdot Mirror


New Itanium More Powerful, Power Efficient

Heir Of The Mess writes "Intel have a press release out about their new Itanium 2 Processor. The new processor doubles the performance, and improves performance per watt by 2.5 times compared to the existing single-core versions. The flagship model triples the cache and can execute 4 threads/instructions per processor enhanced by Hyper-Threading. Transistor count is a whopping 1.7 billion. Triples the previous SPEC_int_rate_base_2000 record. Retails for US$3692 for the top of the range.
So yes the Itanium crew are still pushing forward. I wonder if this could help save SGI?"

51 comments

  1. The ship has sailed by powerlord · · Score: 5, Insightful

    Considering how little buzz this article seems to be generating (even on Slashdot), it seems like this might be a case of too little, too late.

    The world has moved on from iTanic to x86-64.

    --
    This space for rent. All reasonable inquiries will be entertained at proprietors discretion.
    1. Re:The ship has sailed by jo42 · · Score: 1

      Someday Intel will admit this product is a failure and stop wasting time, money & effort on it. Like 2 or 3 years ago...

  2. In other (disappointing) Intel news by nxtw · · Score: 1

    Intel's profits fell, according to their earnings report last night.

    Not good for an INTC shareholder.

    1. Re:In other (disappointing) Intel news by tomstdenis · · Score: 1

      It's ok, all Intel needs is people to buy the equivalent spec_int box [a mere half million dollars USD will do] and they'll be smooth sailing. :-)

      Tom

      --
      Someday, I'll have a real sig.
  3. Limited Scope by tomstdenis · · Score: 4, Insightful

    The problem with the IA-64 platform is there are so few applications for which it is well suited. And even for problems it IS well suited it's a matter of figuring out how to extract the desired performance.

    That said, a well tuned IA-64 application can smoke the best offerings from x86 world [on both sides of the fence]. But a $3700 USD price tag may push people away. Specially since processors like the Opteron 285 are nearly half the price and way more flexible. :-)

    Tom

    --
    Someday, I'll have a real sig.
    1. Re:Limited Scope by fatphil · · Score: 1

      The itanium is the ultimate batch processing box. Task switches are expensive with that great fat register set, so you just want to let it work on one thing at a time.

      Superb for numerics, as you can hide all the latencies by using enough parallelism.

      One where the compiler will save you a lot of hassle, but probably not be totally optimal as there are too many possibilities for it to investigate.

      I'd certainly like to play with one. My bet is that a well-tuned FFT could be blisteringly fast.

      FatPhil.

      --
      Also FatPhil on SoylentNews, id 863
    2. Re:Limited Scope by tomstdenis · · Score: 1

      For the price you pay for IA64 boxes you're going to be learing IA64 assembler fairly quick.

      From what I heard handtuned bignum code flies on the older IA64 [doing RSA-1024 private key ops in ~500K cycles].

      But when you pay $3600 for a processor you want to get the maximum benefit of it.

      Tom

      --
      Someday, I'll have a real sig.
    3. Re:Limited Scope by vanyel · · Score: 1

      What could an x86 processor do with a 1.7 billion transistor budget? With a vast cache and/or multiple cores and/or pipelines I would think you'd get into the same performance range or maybe even better, without a (relatively) weird architecture.

    4. Re:Limited Scope by tomstdenis · · Score: 1

      The problem is most architectures are fairly locked down. For instance, putting another MUL engine in would be nice except MUL only reads from two registers and only outputs to two registers. A fourth pipe would raise retire latency, etc, etc, etc. In the AMD world at least most common ALU opcodes are already directpath. Larger caches could help but they raise the latency, so would raising the set-associtivity. More TLB entries are always welcomed, etc...

      As an employee of one of the chip companies I couldn't really go into any other ideas without treading on NDA boundaries. Lets just say both camps have uses for more transistors. Don't worry :-)

      Tom
      [STD disclaimer...]

      --
      Someday, I'll have a real sig.
  4. Working with the hardware directly by drachenstern · · Score: 4, Informative

    I can tell you that the processors are not even in production at the main server manufacturer backing the Itanium family (ie, not Intel, the other one*).

    We have seen a few proto style units roll through, but they have all had serious problems and are not running at full speed. The engineering group either cannot or will not give us a reason why these units are running crippled, but we believe it to be a chipset issue. Hopefully we will see the servers rolling through our manufacturing process within the next 60-90 days, but no management timelines have been released.

    Here's to all those of us who want that raw power and are looking to pay for it!!!

    *Won't disclose the name since I don't know if this info violates my NDA, but screw it, the public should know this stuff. Information does deserve to be known.

    -drach!

    --
    2^3 * 31 * 647
    1. Re:Working with the hardware directly by $RANDOMLUSER · · Score: 1

      Sigh. I guess it's our best hope since your management has killed both Alpha and PA-risc for this albatross.
      Maybe Cell...
      Lots and lots of Cells...

      --
      No folly is more costly than the folly of intolerant idealism. - Winston Churchill
    2. Re:Working with the hardware directly by thebdj · · Score: 1

      Would that server making be HP by any chance? I mean the did help develop Itanium 2. Just about every workstation or server I've seen with an Itanium is an HP...hm....come now, if it is HP you wouldn't be breaking the NDA by revealing common knowledge.

      --
      "Some days you just can't get rid of a bomb."
    3. Re:Working with the hardware directly by nkpatel · · Score: 3, Interesting

      I gotta call BS on this one. I work on the development team for many of these servers. (No, I'm not allowed to tell you when they'll ship, so don't ask). There are NO chipset issues I am aware of, and the units are not crippled. I'd be interested in finding out why you think this is the case. Case in point: ahref=http://www.tpc.org/tpcc/results/tpcc_result_ detail.asp?id=106071802rel=url2html-4549http://www .tpc.org/tpcc/results/tpcc_result_detail.asp?id=10 6071802>

    4. Re:Working with the hardware directly by Anonymous Coward · · Score: 1, Informative

      They are having major problems with flaky frequency shifting. The processor uses an on-silicon ammeter to control its internal clock frequency. So if you are running code that would cause its power usage to go above the rated wattage it would secretly slow the frequency to prevent that from happening. This analog method is a crapy method of power control. It requires almost no RTL and architechture changes, but tends to be falky, limits performance in unknown ways, and causes poor silicon yield. I have a feeling some grad student is going to bust them down the road by doing a detailed performance analysis and finding that while Montecito does well on SPEC, it is not too difficult to generate code that will make the chip perform very poorly since the frequency limiter kicked in.

  5. What's that in football fields? by sco08y · · Score: 1

    They double the performance and lower energy requirements, improving performance per watt by 2.5 times compared to existing, single-core versions.

    So does this mean that performance is 250% of the original or 350%?

    1. Re:What's that in football fields? by codemaster2b · · Score: 1

      The performance is 2.5 times greater per watt, or an increase of efficiency by 250%, or is 350% as efficient as it used to be, or it used to be only 40% as effiencient as it is now.

      Does that answer your question?

      --
      And over there we have the labyrinth guards. One always lies, one always tells the truth, and one stabs people who ask t
  6. Itanic, the theme song, sung by Paul Ottelini by arivanov · · Score: 1
    Near, far, wherever you are
    I believe that EPIC does go on
    Once more, you opened the door
    And you're here in my heart,
    And my heart will go on and on.

    To the tune of 20 billion down the drain until it smashed into a simple AMD hack of the x86 instruction set. Yeah... It goes on... Nothing outlasts the Energiser...

    --
    Baker's Law: Misery no longer loves company. Nowadays it insists on it
    http://www.sigsegv.cx/
  7. Sell off Itanium by FadedTimes · · Score: 1

    Maybe Intel needs to sell off their Itanium division like they did with the ARM division. Concentrate on the area the are best in. Hopefully they learned some lessons from Itanium / IA64; like they did with P4 / Netburst.

    1. Re:Sell off Itanium by GnuPooh · · Score: 2, Interesting

      Intel didn't sell off an ARM division. ARM has always been a separate company. They BOUGHT the StrongARM part of Digital/Compaq years ago and renamed it XScale and still have it.

      Secondly, the current Pentium-M/Core processors are decended from Pentium-III and Pentium Pro, so the thing they learned from P4 was that it was a dead-end architecture. I'm sure there are some elements here and there they "backported" to Pentium-III to make the current Dual Core 2, but you statement is not really accurate there either.

    2. Re:Sell off Itanium by TheRaven64 · · Score: 1
      They BOUGHT the StrongARM part of Digital/Compaq years ago and renamed it XScale and still have it

      I take it you missed this?

      --
      I am TheRaven on Soylent News
  8. Triple the cache by Ant+P. · · Score: 1

    Didn't the old Itanium have something like 9MB of L2? Wow.

    1. Re:Triple the cache by incabulos · · Score: 1

      What amazes me is the 1.7 _billion_ transistor count! Thats over 1000 80486 cpus worth of transistor, or over 500 pentiums. The power ( flops, mips ) of the Itanic is pretty poor by comparison. Hell, even a single 100 MHz 486 DX4 is still decent enough in this day and age to do useful work.

    2. Re:Triple the cache by PetiePooo · · Score: 1

      What amazes me is the 1.7 _billion_ transistor count!

      I'm sure the majority of that is the cache. I don't have figures, but if you want an example, check out the transister count of a large DRAM chip. Sounds impressive, 'til you realize that very little of it is logic.

    3. Re:Triple the cache by Anonymous Coward · · Score: 0

      Yep. 3MB, 6MB, 9MB, even up to 24MB on the new ones.

    4. Re:Triple the cache by tomstdenis · · Score: 2, Informative

      24MiB of 6T SRAM is 1,207,959,552 transistors just for the cells. Not including tags, address decoders, etc...

      By comparison, an Opteron uses 113,246,208 transistors for the 2048+256KB of cache [assuming they use a 6T structure which I don't know for a fact since I'm not privy to the details and technically I couldn't say even if I were, so don't assume what I said is verbatim fact, yada, I hate disclaimers] and the 4MB Duo (total of 4096+128KB of cache) needs 207,618,048 transistors for its cache.

      Tom

      --
      Someday, I'll have a real sig.
  9. Bumber for Intel by sgt+scrub · · Score: 2, Interesting

    I buy machines for our 100M product: 4 way opteron 870's with 8G Ram (2/proc) 2x73G Cheetas. Retail the box would cost about $7000. If one processor cost me $3500 life would suck. I can't see this processor doing the work of 4 duel core opterons.

    --
    Having to work for a living is the root of all evil.
    1. Re:Bumber for Intel by Anonymous Coward · · Score: 0
      dual means 2

      duel is a fight

      Are they fighting??

    2. Re:Bumber for Intel by Anonymous Coward · · Score: 0

      Yes, but just try getting your dual-Opteron to run an enterprise-level database application like a billing system. You're comparing apples and oranges - Opteron is great for it's target market, but it's not the same marketplace. Itanium is aimed a lot higher, against chips like the Power5+, in machines with the capability to scale way beyond just dual-CPUs. Against Opteron you would usually line up a Xeon-based alternative, or moving forward that would be one of the new AMD-whipping Conroe-cored CPUs....

    3. Re:Bumber for Intel by LWATCDR · · Score: 1

      "Yes, but just try getting your dual-Opteron to run an enterprise-level database application like a billing system."
      I think you might be surpised how well a dual Opteron would run a large database system.
      You might want to take a look at one of these. http://www.sun.com/servers/x64/x4600/ It can be expanded to 16 cpus.
      Oh and so far the Conroe CPUs seem to be beating Athlons but the Opteron is a different critter. It still has an advantage in IO thanks to it's four hypertransport channels.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    4. Re:Bumber for Intel by Nakarti · · Score: 1

      Oh I hate to be a grammar nazi, but...

      If you were AMD's spokesman I would be cheering for Intel by now.
      You can't spell bummer in the subject; Cheetah, the product name; dual for fecks sake; and you capitalize ram, a common word, instead of Opteron, a trademarked word.

      If this is what it is to be an AMD fanboy, count me out. I'd rather be literate.

    5. Re:Bumber for Intel by sgt+scrub · · Score: 1

      but I AM illiterate!

      --
      Having to work for a living is the root of all evil.
  10. Price by jefu · · Score: 3, Interesting
    It seems quite likely that if Intel were to push the Itanium more and get it used in more systems that the demand would rise and the price would therefore drop. Of course, this might require a lot of demand, but if the performance is good enough for the price, the demand will be there.

    It might be worthwhile for Intel to find a way to drop the price enough to put these things into more places. Even give them away to visible web installations (like slashdot, fer'nstance). Get a bit more market penetration, convince some vocal people that its a good buy and it will start to take off. (I'd be glad to take an Itanium system for free for web service - even though my primary web presence is anything but big. Even better a couple of them to let my students use for compute bound projects.)

    1. Re:Price by tomstdenis · · Score: 2, Interesting

      Even with a sufficient market, right now that chip is going to be expensive. It's huge and the yield can't be that high.

      But that's kinda the problem. The cost and the fact that's its a very niche processor will never make it as common place as Xeon or Opteron processors. Specially when both Intel and AMD are pushing SSE you can get a lot of the parallel SIMD like benefits of IA64 on x86.

      Tom

      --
      Someday, I'll have a real sig.
    2. Re:Price by boner · · Score: 3, Interesting

      The Itanium has horrible price/performance for web-serving... the ad-hoc nature of web-serving does not do the architecture justice. The new dual-core x64 Conroe offering would be a much better choice.

      Has anybody noticed that the Conroe effectively kills Itanium for most workloads?

    3. Re:Price by nkpatel · · Score: 2, Interesting

      While x64 may be better-suited for web serving, part of the reason x64 seems to be better than ia64 on most other workloads is that you may not have the OS and software optimized for the ia64 architecture - you're still writing and compiling in a x86-centric view and I don't think compiler technologies have evolved to the point where they can fully utilize ia64 ILP.

    4. Re:Price by blugu64 · · Score: 1

      "demand would rise and the price would therefore drop"

      ya know that when demand rises, price rises as well. Now if they supplied more, or the public demand less then price would fall.

      --
      "Personal ownership is a hallmark of conservative capitalism. And I don't believe I am entitled to anything that I did n
    5. Re:Price by LWATCDR · · Score: 2, Informative

      "While x64 may be better-suited for web serving, part of the reason x64 seems to be better than ia64 on most other workloads is that you may not have the OS and software optimized for the ia64 architecture"
      Or it could be the ia64 really isn't well suited to web serving.
      The ia64 has very good double precision floating point. And seems to be very good at minimal branching / very predictable branching code. In other words not web serving. The ia64 seems to have the same problem that they had with the i860 in fact if you read the wikipedia entry for it it sounds like the exact same chip!
      "One fairly unusual feature of the i860 was that the pipelines into the functional units were program-accessible, requiring the compilers to carefully order instructions in the object code to keep the pipelines filled."
      and this seems to be even more on the mark "As a result of its architecture, the i860 could run certain graphics and floating point algorithms with exceptionally high speed, but its performance in general-purpose applications suffered and it was difficult to program efficiently (see below)."

      My goodness it is hard to believe that Intel would go down this road again. I guess if at first you don't succeed try, try, again.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    6. Re:Price by tomstdenis · · Score: 1

      As another poster mentioned the IA64 is not a general purpose processor with decent pipeline control, register renaming, branch prediction, etc.

      This means your typical random access applications [e.g. desktop] do not behave as well.

      IA64 is really meant for the case where you can handtune that 5% of the code that takes 95% of the time in assembler. Who cares if your GUI is inefficient if you're pulling 4x the FLOPS as an Opteron in your HPC application? However, for a web server where you are doing less simd and more branching, loops, small ops, the Opteron arch [well and Conroe] helps.

      It's like the traditional Transmeta vs. Intel/AMD. Intel and AMD spend a lot of hardware on runtime optimizations [e.g. prefetching, branch prediction, CISC => RISC decoding, register renaming, etc] so that it can cope with the current runtime situation [e.g. workload] whereas Transmeta optimized your task more statically [with less forethought to branches, cache, etc]. The net result was you could get some performance from Transmeta but the MIPS/MHz just wasn't there. Similarly IA64 *can* really fly if you hand tune your application AND it can take advantage of the architecture.

      It isn't just a matter of having a good compiler [which admitedly GCC isn't there] but having a good problem.

      Tom

      --
      Someday, I'll have a real sig.
    7. Re:Price by fatphil · · Score: 1

      They're unrelated. It's just another VLIW. Same as every other VLIW.
      In the same way that all RISCs are the same.

      FatPhil

      --
      Also FatPhil on SoylentNews, id 863
    8. Re:Price by twitchingbug · · Score: 1

      Well that was the great white hope - compiler technology evolving. It hasn't happened, and I doubt it will in the near future, at least in it's current form. The problem is that most code that we run today is inherently non parallelizable. do 5 ops on dependent data, branch, repeat. There's nothing you can parallelize there. It'll take a major paradigm shift away from serial code for us to take advantage of ia64.

      on the flip size, some media app, that has a small kernel of code that loops, is perfect for IA64, but again, compilers are dumb, and if it's an important loop, you should hand optimize the assembly anyways.

      -don

    9. Re:Price by LWATCDR · · Score: 1

      I know the chips are unrelated but the issues they have seem to be identical.
      You may have a point in that VLIW just isn't practical for general computing tasks.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
  11. Itanium is the Vista of Silicon by Baldrson · · Score: 1
    The same thing has happened to Microsoft with Windows Vista that has happened to the Itanium project:

    Whenever you set out to reengineer a foundation, and you throw money at it in the form of more engineers, you are asking for long delays if not outright failure.

    1. Re:Itanium is the Vista of Silicon by powerlord · · Score: 1

      Yes, but the difference is that MS holds a court verified monopoly (on desktop OS installations).

      Once Vista ships from MS (ready or not, here it comes), it WILL ship on consumer boxes (want it or not, here it comes).

      So that within a year or so after launch, most new PCs purchased will come with Vista pre-installed, especially after they stop selling XP.

      If Intel decided to stop shipping all chips but the Itanium, most PC makers would laugh, and probably abandon them to use AMD.

      --
      This space for rent. All reasonable inquiries will be entertained at proprietors discretion.
    2. Re:Itanium is the Vista of Silicon by Anonymous Coward · · Score: 0

      Wow, imagine Vista running on an Itanium...

  12. Buy one now! by dwalsh · · Score: 1

    These will be very rare collectors items in a few years.

    --
    ${YEAR+1} is going to be the year of Linux on the desktop!
  13. Re:The ship has sailed- DOA by Anonymous Coward · · Score: 0

    Exactly. Montecito is DOA; Over a year late and no where close to its performance goals. This article is just PR fluff.
    LOL at the 10 billion in monopoly money Intel got other companies to commit to Itantic.

  14. Itanium - A Long Term Plan by logicnazi · · Score: 1

    It's interesting the way so many people on slashdot are willing to disregard or even diss intel's Itanium strategy. Admittedly intel mistimed the itanium introduction, trying to move the processor world in a whole new direction just as AMD started producing some good chips was a mistake. Though to be fair to intel it isn't clear they could have foreseen this when they made the decisions. However, just because Itanium isn't going to run the next version of WoW or Quake doesn't mean that it is dead or a bad idea.

    Itanium's big advantage is that it is simply a better ISA. Anyone who has done x86 assembly knows it is clunky and old while the IA64 architecture (except for the x86 emulation part) is well designed. Trying to improve x86 single threaded performance is ultimately a losing battle. Exacting a little bit more parallelism out of x86 instructions requires more and more transistors for smaller and smaller gains.

    In the long run if you need more single thread performance IA64 or similar strategies are the way you have to go. Sure it is taking a long time to work out the bugs in the compiler technology but that just means intel has that much of a leg up when x86 single thread performance hits a brick wall. Also the work intel does on the itanium produces technology and insight that can be transfered to their other processor lines.

    Remember intel needs to keep the big picture in mind and can't just look to the next 2 years. Rather than an indication of error I think the long time it is taking to iron out the IA64 technology is proof of foresight rather than an indication of a bungle.

    --

    If you liked this thought maybe you would find my blog nice too:

    1. Re:Itanium - A Long Term Plan by GWBasic · · Score: 1
      I've been under the impression that Itanium is really intended for "super-computer" type applications, as opposed to traditional desktop and web serving. For example, Itanium is used to search for oil and model areodynamics.

      Comparing Itanium to x86 is like comparing an oil tanker to a personal sailboat, or like comparing apples to oranges. x86 isn't meant for everything.

  15. "Long Term" is an understatement by raftpeople · · Score: 1

    You remember when development of this chip was announced right?

    1994. I remember the day I read the article in computerworld about the HP and Intel alliance and the new CPU, sounded interesting.

    12 years later they are shipping about 40,000 anually.

    This thing is simply not going to take off.

  16. So there's hope for Itanium by Baldrson · · Score: 1
    Since Itanium's team doesn't have a monopoly, as you point out, it may be they'll have sufficient motivation to make it work correctly enough to compete.

    We may yet see an Itanium that competes but I'm not holding my breath for Vista.

  17. Itanium - a niche product by Namarrgon · · Score: 1

    Admittedly intel mistimed the itanium introduction

    Mistimed? Intel had been working fruitlessly on it for 8-10 years when AMD finally realised there was an opportunity to pull a Microsoft on them (i.e. consumers like compatibility more than new tech).

    Itanium's big advantage is that it is simply a better ISA.

    Debatable. It really depends on the application. Itanium does have a nicely designed, regular ISA that is awesome for serious number-crunching, but the VLIW approach really isn't optimal for general-purpose, non-parallel code. It's wasteful, and hard to optimise for. x86 is clunky, inconsistent, badly lacking in registers and a complete pain to code and decode, but it is at least quite compact (fits into small caches). x86-64 improves on this (a little) by adding more registers. PowerPC code is probably better than either, for most jobs.

    I think the long time it is taking to iron out the IA64 technology is proof of foresight rather than an indication of a bungle.

    I don't think it's a question of "ironing out" the technology any more. They've been working on improving the compilers for years now, and it performs really well - for certain jobs, like SPEC ratings and HPC apps. But it's not going to get much better than it is for what most of the world wants. It's a niche product, the market knows that, and it ain't moving out of that niche any time soon.

    --
    Why would anyone engrave "Elbereth"?