Slashdot Mirror


Intel Reveals Itanium 2 Glitch

NeoChichiri writes "News.com is running on an article about glitches in Intel's Itanium 2 chips. Even though it doesn't affect all chips, they have still stopped shipments of the new 450 Servers until the problem is resolved. Apparently it has to be 'a specific set of operations in a specific sequence with specific data.' Intel is saying that affects the 900MHz and 1 GHz Itanium 2 chips and that it will not affect the upcoming 1.5 GHz Itanium 2 6M chips." Until the next iteration of chip arrives though, Oliver Wendell Jones writes, "they recommend working around the problem by underclocking the processor to run at 800 MHz instead of its default 900 MHz or 1 GHz."

249 comments

  1. he he he ;-) by Anonymous Coward · · Score: 1, Funny

    Well done intel ;-)
    Underclocking too...

  2. Aptly named... by hobbesmaster · · Score: 4, Funny

    The Itanic 2 appears to be going down like the first...

    1. Re:Aptly named... by PD · · Score: 2, Funny

      You mean it comes with a little band that plays "Nearer my God to Thee" when the server crashes? Cool!

    2. Re:Aptly named... by Anonymous Coward · · Score: 1, Funny

      imagine having to underclock a beowulf cluster of those... argh.

  3. Itanium "chops"? by Anonymous Coward · · Score: 0

    Intel is now selling cuts of meat?!?!? WTF? Does the grasping greediness of this corporation know no end?

  4. chops? by drfuchs · · Score: 0

    Their chops are busted?

  5. underclocking by Anonymous Coward · · Score: 0

    underclocking the processor to run at 800 MHz

    That'll make the chip more stable anyways. Always underclock your mission critical servers!

    1. Re:underclocking by kupo+zero · · Score: 1

      Yes yes, but some people still feel the need to run it fast, no matter how important it is. Technical machoism, I suppose.

  6. Glitch? by Ramjet350 · · Score: 4, Interesting

    Is it a glitch or did they sell chips that can't run at the rated speed?

    1. Re:Glitch? by PetWolverine · · Score: 1, Redundant

      Good question. And I'd like the moderator who called it redundant to show me a link to the previous post that said the same thing, because I can't seem to find it.

      --
      I found the meaning of life the other day, but I had write-only access.
    2. Re:Glitch? by dalamcd · · Score: 1
      (Score:1, Redundant)

      I wish you could meta-mod as 'funny'.

      dalamcd

      --
      moer liek CELtroid prime!!@1!
  7. are they overclocking the chips? by golan · · Score: 0, Troll

    Looks like they are overclocking their own chips. Maybe they should sell them with a bigger fan on them... ;-)

    1. Re:are they overclocking the chips? by mlk · · Score: 1

      Eap, soon the fans will be bigger than the air con!

      --
      Wow, I should not post when knackered.
  8. Pork or Lamb? by iworm · · Score: 0, Redundant

    Itanium 2 chops?... ?

    Methinks dear Timothy hasn't really grapsed what "editing" really means.

    1. Re:Pork or Lamb? by iworm · · Score: 1

      grapsed? I'm a Slashdot editor already!

    2. Re:Pork or Lamb? by Anonymous Coward · · Score: 0

      Oh the irony.

  9. Mmmm by thebatlab · · Score: 4, Funny

    *in Homer Simpsons voice* 'mmmmmm.....Itanium 2 chops.......glazhzhzhz'

  10. Underclocking..? by fadeaway · · Score: 3, Funny

    Bye-bye fans and thermal paste, hello heaters and insulation!

  11. Microcode? by chill · · Score: 4, Interesting

    Is this something that could be addressed by a microcode update? I've always wondered about exactly what can be done with the Kernel support for microcode updates.

    On a side note -- who exactly didn't expect something like this? Intel has a history of this sort of thing -- from the 80486DX not being able to add properly, and IBM having to halt shipments of PS/2 machines; to the Pentium F00F bug and others. Buying first run Intel chips is like playing dice with your business. Give them a few production runs to work out the bugs...

    --
    Learning HOW to think is more important than learning WHAT to think.
    1. Re:Microcode? by Anonymous Coward · · Score: 2, Informative

      It's not just Intel. How about Motorola leaving out critical instructions in the PPC603 and crippling every machine with one compared to the PPC601? or the G3 floating point debacle where excel spreadsheets would show up errors consistently. What about AMD and their first run overheating problems? running hot is one thing, burning up even with adequate cooling is another.

      Best option is not to restrict yourself to certain "runs" but to just see the performance of a run yourself. The aforementioned PPC601 was a good example. A fantastic CPU in its first incarnation

    2. Re:Microcode? by WndrBr3d · · Score: 3, Insightful

      I have to agree with your side note. I make the technology decisions here at my company and have a strict belief when it comes to upgrading. Microsoft OS's I refuse to deploy on our systems until SP1 is released (because we all know its coming sooner or later). We just last month upgraded to Windows XP.

      I suppose the same argument can be applied to everything in life. Cars, Televisions, DVD players.. you name it. You just need to get a feel for how things age before you invest in them for long term.

    3. Re:Microcode? by Cyno · · Score: 1

      Of course that could be said about anything. I own several AMD systems. The first ones I bought, when the Athlon was initially released, had many problems. The motherboards would only work properly with certain types of video cards, etc. After a couple generations these systems are far more stable and trustworthy.

      I believe all of this is related to our greedy decisions to release products prematurely. Afterall we're only making these products so we can make money. I doubt a single exec at Intel cares about the low level details of their chips.

    4. Re:Microcode? by Cyno · · Score: 1

      QA is what prevents this sort of thing from happening. And in my experience QA should be the buffer between your scientists, engineers and your customers. But most companies don't want to pay for the proper QA and most don't even want to pay for the proper R&D. They just want to sell their products, even if the name is the only thing carrying them.

    5. Re:Microcode? by questionlp · · Score: 0, Redundant
      Buying some Sun UltraSPARC II processors with 8MB eCache is also like playing dice with your business if the processor was equipped with faulty cache chips (or chips that would flake out in certain environments). I believe Sun also had an issue with their UltraSPARC III processor where you could apply a microcode update or a patch which effectively knocked the performance of the chip off by a bit.

      At least Intel has accepted that it's a real issue rather than a customer problem and providing a workaround and a fix to the processors. Anyway, the Itanium 2 is not a first-run chip, rather a re-work of the original Itanium (though a far amount of the design came from HP which was then tuned and produced by Intel).

    6. Re:Microcode? by AxelTorvalds · · Score: 1

      The 601 had extra instructions for compatibility with POWER. It was part of the architecture to remove some instructions, planned and agreed to by Apple, IBM and Motorola before the 601 was made.

    7. Re:Microcode? by Anonymous Coward · · Score: 0

      I think the real question is why did your company choose to go to XP anyway? Why not go to 2k? Do you need any of the new "features" XP offers?

    8. Re:Microcode? by EpsCylonB · · Score: 1

      My creative nomad muvo had a hardware problem where it wouldn't work with my nice new shiny NForce2 Mobo, and five months later creative released a firmware update that solved the problem.

  12. Doesn't affect all chips... by gilesjuk · · Score: 4, Funny

    Perhaps they should put some silver stuff over the serial number. Welcome to the Intel Itanium scratchcard lotto, those with bad chips win a new one :)

    1. Re:Doesn't affect all chips... by buffer-overflowed · · Score: 2, Funny

      Yes... except when you scratch it off you end up winning a free mountain dew and a useless server.

      --
      The key to the enjoyment of pop music is to replace any instance of "love" with "C.H.U.D."
    2. Re:Doesn't affect all chips... by Anonymous Coward · · Score: 1, Funny

      Actully, only 1 in 6 win free Mountain Dew...

  13. In all seriousness... by powerlinekid · · Score: 2, Funny

    What, this is going to affect all 6 people that own this chip?

    --

    can't sleep slashdot will eat me
    1. Re:In all seriousness... by Anonymous Coward · · Score: 1, Funny

      No, only 2 of them. The issue does not affect all of the chips.

    2. Re:In all seriousness... by evilviper · · Score: 1

      The point is that this is the stuff they want consumers and companies to buy... Here ya go, buy our great 64bit chip, that has so many problems, it must be made by Intel...

      I've heard almost nothing about AMD's 64bit chip, and I'd still rather buy *it* than Intel's offerings.

      Actually, I'd still much rather have a new Alpha system. Excuse me while I drool. I guess Intel can't even compare to the chip they based their own new chip on.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  14. Deja Vu by Jason1729 · · Score: 2, Interesting

    Apparently it has to be 'a specific set of operations in a specific sequence with specific data.

    This sounds similar to the way they described the floating point divide error in the original pentium. How long until they start giving odds on the chances of someone seeing the problem in normal use.

    Jason
    ProfQuotes

    1. Re:Deja Vu by b!arg · · Score: 1

      I don't know how long, but let's hope they don't use one of these processors to figure it out.

      --

      Everybody dies frustrated and sad and that is beautiful
    2. Re:Deja Vu by Michael_Burton · · Score: 5, Funny

      The problem is a sequence of 1s and 0s. Avoid those two numbers, and you'll be fine.

      --
      When all you have is an axe, everything looks like a grindstone.
    3. Re:Deja Vu by Anonymous Coward · · Score: 0

      But trying to force some 2's into the CPU bring up a whole new set of problems.

    4. Re:Deja Vu by TheRaven64 · · Score: 1
      Apparently it has to be 'a specific set of operations in a specific sequence with specific data.'

      I seem to recall that they almost lost the company the last time they made a mistake like this with a popular CPU.

      Oh, sorry, did you say the Itanium? Well, I suppose they'll probably just send replacements out to both customers then.

      --
      I am TheRaven on Soylent News
    5. Re:Deja Vu by Bombcar · · Score: 0, Troll

      Apparently it has to be 'a specific set of operations in a specific sequence with specific data.

      The operation:

      mov eeax, VALUE

      The sequence:

      1. mov eeax, VALUE

      The specific data:

      VALUE > 2^32+1

  15. Re:Heh by Anonymous Coward · · Score: 0

    You deserve to be beaten like a red-headed step-child.

  16. good time for an ad by technoid_ · · Score: 1

    When I clicked to read more of this story I got an Intel ad at the bottom of the story. Gee, what great timing...

    --
    Two wrongs don't make a right, but 3 lefts do - Lew of GO magazine
  17. another chip problem. by overbom · · Score: 3, Funny

    whenever they come out with a new design, they tend to have all sorts of f00fy little problems with it.

  18. could this be it? by mrpuffypants · · Score: 0

    a specific set of operations in a specific sequence with specific data.

    hrm....

    $crash = 1;

    1. Re:could this be it? by kiolbasa · · Score: 1

      $crash = 0xF00F;

      Of course.

      --

      Beer wants to be free
  19. Underclock? by phorm · · Score: 3, Interesting

    "they recommend working around the problem by underclocking the processor to run at 800 MHz instead of it's default 900 MHz or 1 GHz."

    Why not just buy the lower-clocked CPU's then? Will Intel replace the crap chips when a revision with a fix comes around?
    "If the customer feels it's the right solution, we'll exchange processors with ones that aren't affected," she said. Intel has developed a simple software test that can determine whether a chip is affected. Meaning what? Lower-end chips that aren't aaffected, or a fixed version of the same chip. If it's the same chip, who wouldn't think it is the right solution? The article doesn't indicate whether the problem is actually solved either, but that it seems to be somewhat of an anomaly that doesn't affect all chips.

    Not a good day for Intel, and probably another reason why you don't immediately need that "Newest on the shelf" CPU, whether for your home machine or a server. Besides, by the time this chip is assuredly fixed, a faster revision will probably be out at a comparable price.

    1. Re:Underclock? by Dr+Caleb · · Score: 4, Funny
      Why not just buy the lower-clocked CPU's then?

      Your geek membership has been revoked. Hand in your pocket protector at the door. OutOutOut!

      --
      "History doesn't repeat itself, but it does rhyme." Mark Twain
    2. Re:Underclock? by Anonymous Coward · · Score: 0

      "Why not just buy the lower-clocked CPU's then? Will Intel replace the crap chips when a revision with a fix comes around?"

      The ONLY versions of the Itanium 2 are the 900MHz and 1GHz chips. There are no lower clocked versions to buy.

    3. Re:Underclock? by mschoolbus · · Score: 1

      Why not just buy the lower-clocked CPU's then?

      I'm pretty sure you already are...

  20. Ummm, microcode is burned in, dude. by Anonymous Coward · · Score: 0

    It's literally etched in metal, kinda hard to change it at this point.

    1. Re:Ummm, microcode is burned in, dude. by Abcd1234 · · Score: 1

      Umm, no it's not, dude. :) A lot of processors (including, quite possibly, the CPU you're using right now) allow microcode updates. See here for information about the Linux microcode update driver for IA32. Now, I'm not sure how much can be updated, but it's clearly not completely "burned in", which returns us to the question the grandparent poster originally posited.

    2. Re:Ummm, microcode is burned in, dude. by Anonymous Coward · · Score: 0

      I'm fairly certain the microcode update feature is intended for boot-time overlay; that is, the chip must be able to boot stably/reliably up until the point the fresh code can be loaded.

      As an example, I've an Intellistation M Pro (6889-63U, for those who care) - an old duallie BX box - that can and does carry an update payload in its BIOS, which in turn can be toggled on/off like any other BIOS feature.

      I'd assume that Intel's engineers are well-aware of their products' updatability, and would push the fix as a BIOS update from their vendor partners if they could.

      But they couldn't, and thus it's Yet Another Intel Recall.

      It's rather surprising how good AMD's track record has been in comparison- they do have their share of errata, and publish them, but I'm not aware of any 'showstoppers' on the order of Intel's (F00F, FDIV, etc) since the acquisition of NexGen and the dawn of the K5 era. (Whether some sort of HLT-related bug is what keeps the d**n things so warm is another matter...?) That's a pretty impressive track record.

      Anyone know if IDT/Centaur/Via have had any entertaining bugs slip through post-Cyrix?

  21. Re: Truth is - those cooler fans will ... by Anonymous Coward · · Score: 0

    take your CPU down anyways.

    3rd time right here

  22. Alternative to underclocking by ethnocidal · · Score: 5, Informative

    Underclocking is typically necessary if a part needs more voltage than is allowed for with the default configuration. This is why when you overclock, the converse is generally required; you can get better overclocks by increasing voltage.

    Obviously, Intel are not going to encourage people to increase the voltage of their processors in order to run them at the default speeds, as this can run the risk of thermal damage to the chip with insufficient cooling, or overly high voltages. It may however still represent an option for system administrators who are keen to retain the performance of the chip.

    1. Re:Alternative to underclocking by Anonymous Coward · · Score: 0

      But the hardware problem is possibly a circuit speedpath issue, and underclocking is the only thing that's going to solve that short of a new chip.

    2. Re:Alternative to underclocking by ratfynk · · Score: 1

      Better still sell the IA64 chips to porn and spammers servers at a discount.

      --
      OH THE SHAME I fell off the wagon and use sigs again!
    3. Re:Alternative to underclocking by svirre · · Score: 1

      This assumes you do not exceed the technologys safe maximum voltage. High voltage in CMOS (and related) processes have other undesireable effects than just thermal problems.

      Most significantly the drain-source field may become sufficiently strong to open the channel regardless of the gate voltage. When this happens the device will fail regardless of temperature.

  23. I'm actually pretty impressed by Photar · · Score: 4, Interesting

    When you consider all the bugs that come through in higher level programming where everything is object oriented and human readable, it really comes as a surprise that you don't see more bugs in hardware considering the complexity of the problem and low level nature.

    --
    He who knows not and knows he knows not is a wise man. He who knows not and knows not he knows not is a fool.
    1. Re:I'm actually pretty impressed by kindofblue · · Score: 3, Insightful

      I readily agree. as a software person, it boggles my mind that the hardware doesn't fail hourly. If Microsoft (or Oracle or any software company) were held to the same standards as Intel/AMD, etc, they wouldn't exist. Intel and CPU companies have been work against the immutable laws of physics, whereas software companies only have to manage their own incompetence and beat back their business departments, IMO.

    2. Re:I'm actually pretty impressed by Photar · · Score: 1

      There's got to be an "If Microsoft Designed Hardware" joke out there somewhere.

      --
      He who knows not and knows he knows not is a wise man. He who knows not and knows not he knows not is a fool.
    3. Re:I'm actually pretty impressed by Anonymous Coward · · Score: 0

      two words: formal methods.

      Google that, and you'll see what's really
      impressive.

    4. Re:I'm actually pretty impressed by Anonymous Coward · · Score: 0

      If Microsoft (or Oracle or any software company) were held to the same standards as Intel/AMD I guess your wrong on that one, since Microsoft seems to still be here.

    5. Re:I'm actually pretty impressed by AxelTorvalds · · Score: 4, Interesting
      All do respect, but I know how they make chips. They use software to do it and that's why they are so reliable, a human doesn't put each gate in to place. It's also designed with test in mind and there are whole industries and standards surrounding that. Try to name something remotely close to a JTAG interface for software. I believe it's more reliable than software but that's really becuase once you etch a piece of silicon it's pretty damn hard to fix it. Don't get me wrong though, I trust the chip a lot more than the software in most cases, I expect a compiler bug long before I expect to have stumbled on to the magic code stream that doesn't compute correctly and I expect my own errors before that.

      This kind of bug is a little different though, we're not talking about a stuck gate that only gets tickled during a single ALU operation or retiring an instruction too early or bigfooting a register too early or anything like that. We're talking about clocking issues and fundamental timing issues in Intel's "server grade" platform. There are accepted standards and practices for how aggressive to be, some vendors can tell you with amazing detail how reliable their chips are, in what conditions, etc.. With clocks in particular some vendors can be picky, I've seen hard hitters scope up boxes and refuse to support hardware they sold because it was clocked out of spec (think about the edge of a clock and clock quality.. a 1.2 Ghz clock isn't enough, it has to actually achieve the level of the clock before it switches back and it takes time for the clock to transition..) it sounds like Intel is either ignoring them or trying to write their own book or the IA64 is a bigger disaster than any one there wants to even hint at. There are a fairly limited class of errors where underclocking the chip fixes the problem and most of those errors are related to the chip being aggressively clocked to begin with. It's ironic, on IBM's POWER4 line of processors they added extra cache room for parity (at the expense of potential performance) and made the leads more beefy (again at the expense of higher clock speeds) because the platform is a server platform that places reliability at a premium. It sounds like Intel has been making PC chips too long and isn't ready for server grade chips.

      Their party line has been that they will keep working at it until it's ready, they aren't expecting it to move a lot of chips, etc. etc.. Right now they have walked down a road where they have invested billions? (at least hundreds of millions) in an unproven technology. They have crossed the line to the point that there won't be $1500 IA64 products for years and years. They have piped it as a server grade platform. And it underachieves in every area and has't taken the world by storm nearly as much as they said. So bad is it that HP, their blood brother in that mess has continued the PA-RISC and Alpha lines past the point they claimed when they originally adopted the IA64. The only reason I could imagine them to aggressively clock it like that have would be because that's the only way to make it perform remotely like they have claimed it would. I'm not going to guess about Intel's dirty laundry but I'd guess the stakes are little higher than it would look on the surface for the IA64, either that or there are some incompetants running the show.

    6. Re:I'm actually pretty impressed by Anonymous Coward · · Score: 0

      all DUE respect.

      as in, all the respect that is DUE.

      It always scares me when I see mistakes like this, that are due (note the ue) to assumptions of spelling based on an American accent.

    7. Re:I'm actually pretty impressed by AxelTorvalds · · Score: 1
      You got me.

      I really meant that all the people DO respect Intel.. ;-)

    8. Re:I'm actually pretty impressed by Anonymous Coward · · Score: 0

      You REALLY don't know what you're talking about. I love the slashdot crowd, so many answers, so little knowledge.

    9. Re:I'm actually pretty impressed by Bert64 · · Score: 1

      This is because theres no comeback with buggy software, people EXPECT buggy software, and they will buy it anyway and just wait for the patch to come out... So theres no financial reason for a business to spend extra money thoroughly debugging their code..
      However with hardware, you may have to replace thousands of items.. all of which cost money, whereas the cost of distributing a patch is negligible, especially since third parties will often mirror it for you and reduce your bandwidth costs.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
  24. remember the 386 multiply bug? by Anonymous Coward · · Score: 0

    Sounds familiar.. crunching certain operands generated flawed results in 32x32 multiplies. Was some interference on chip, layout issue. Double sigma comes to mind. Since this bug can be fixed by lowering the speed it's probably a similar issue?

  25. I wonder if that's what caused this crash by kiwimate · · Score: 1

    Intel disclosed an electrical problem Monday that can cause computers using its flagship Itanium 2 processor to behave erratically or crash.

    Hmmm...wonder if BMW is using these chips?

    1. Re:I wonder if that's what caused this crash by damiam · · Score: 1

      They would have to be out of their minds to use an Itanium in a car. More likely it's a Crusoe, StrongArm, or something similar.

      --
      It's hard to be religious when certain people are never incinerated by bolts of lightning.
    2. Re:I wonder if that's what caused this crash by kiwimate · · Score: 1

      Yeah, I know...I wouldn't even have thought of posting this comment, except as rebellion against the /. editors.

      2003-05-12 15:26:10 Man trapped in car after computer fails (articles,news) (rejected)

      (Okay, so I'm petty. At least I got it in despite the editors rejecting it.)

    3. Re:I wonder if that's what caused this crash by bobbozzo · · Score: 2, Informative

      If it was a 7-series, its has an "I-Drive" computer, which runs Windows CE.

      --
      Nothing to see here; Move along.
    4. Re:I wonder if that's what caused this crash by ElGanzoLoco · · Score: 1

      I've already read quite a lot of stories about this 7-series acting erraticaly (including dangerous "behaviours", assuming a car has a "behaviour"). Is Windows CE really to blame here? (I'd love it: one more reason to bash Microsoft, yeah!)

      --
      Hello! I'm a disaster waiting to happen!
    5. Re:I wonder if that's what caused this crash by bobbozzo · · Score: 1

      Windows probably is to blame, although it has also recieved a lot of criticism for only having one knob to control everything...
      makes it hard if you need to turn down the radio while you're messing with the A/C, as I understand it.
      Also, it means you can't replace the stereo with an aftermarket one, ...

      --
      Nothing to see here; Move along.
    6. Re:I wonder if that's what caused this crash by Anonymous Coward · · Score: 0

      Do you know how hard it is to get a story posted that doesn't bash microsoft, intel or pimp linux if it hasn't already been posted?

    7. Re:I wonder if that's what caused this crash by keller · · Score: 1

      A chip that behaves erotically, that is why the price tag is so high, only big businesse can handle the excessive speed at which the pr0n is processed!

      --

      Enig? Det alt for hot det smor!

  26. Bugs... by FroMan · · Score: 1

    "Intel is saying that affects the 900MHz and 1 GHz Itanium 2 chops and that it will not affect the upcoming 1.5 GHz Itanium 2 6M chips."

    In other news Slashdot is returning their newest server to Intel for a replacement. It appears that the error in in the on chip string handling routines.

    --
    Norris/Palin 2012
    Fact: We deserve leaders who can kick your ass and field dress your carcass.
    1. Re:Bugs... by Anonymous Coward · · Score: 0

      Ok, you made a lame redundant joke about the O instead of I in chips, and fail to point out the rest of the errors (and then make one of your own?) You fail as grammar/spelling nazi.

  27. Ironic? by Jonathan+the+Nerd · · Score: 5, Interesting

    Does anyone else find it ironic that when Intel makes one mistake in a processor, everyone jumps on them for making a bad product, but software companies can sell products with thousands of bugs in them and people accept this as normal? Sure, we complain about buggy software, but I don't think anyone here expects any software to be completely bug-free. Why are Intel and other chip manufacturers held to such a high standard? Or, more importantly, why are software companies not held to the same high standards?. If Intel and AMD can make incredibly complex processors that are (usually) completely bug-free, why can't any software company in the world make any product that even comes close to being free of defects?

    --
    Disclaimer: The opinions expressed are not necessarily my own, as I've not yet had my medication today.
    1. Re:Ironic? by mojowantshappy · · Score: 1
      Yeah, though you can just release an easy patch for software, when a bug in the hardware can possibly require you to return the product or spen a lot of money to replace the product.

      --

      This page was generated by a Barrel of Circus Midgets, and that is the way I like it!!!

    2. Re:Ironic? by Anonymous Coward · · Score: 1, Informative

      because a processor executes a finite set of instructions and states which can be tested fully. the maximum number of states are equivalent to the on-board memory on the processor (8 megs of cache or so usually) and the address/data lines which are also finite. the microcode is usually less than a couple of dozen Kbytes.
      on the other hand a piece of software has the equivalent number of states that fit into the compiled code, any data it accesses from the hard drive, the contents of its memory, any data inputted or outputted from the internet/real world / keyboard / mouse / database / wherever.. ..testing everything 100% would require recreating every piece of data on the internet / keyboard / mouse / database / real world that may potentially feed into the software (usually billions of petabytes if its even possible to recreate it), all possible states of the files on the drive in hundreds of gigabytes, the gigabytes of memory the software has access to and the same goes for the OS it runs on, the database it talks to and everything else... support libraries etc.
      in other words, hardware is simple to test so it SHOULD NOT have bugs, software is hard to test so it WILL have bugs.

    3. Re:Ironic? by Anonymous Coward · · Score: 0

      Hardware bugs tend to be a much more difficult
      to fix, if you have a processor with a bug you
      either find a cheap workaround or you junk it,
      either way your pretty much SOL.

    4. Re:Ironic? by ziggy_zero · · Score: 2, Insightful

      Because software is a fuckload easier to fix - free downloadable patches, etc.

      With hardware like a proccessor, you'd most likely have to actually replace the part that's broken.

      I agree that software companies should be held to a higher standard, but they can get away with it because the bugs are easier to fix.

      --
      I belong to the ______ generation.
    5. Re:Ironic? by Anonymous Coward · · Score: 0

      Because on Slashdot, unless it's a Linus-blessed Transmeta Crusoe processor running some form of Linux, it isn't infallible.

    6. Re:Ironic? by cperciva · · Score: 1

      Right... it's easier to fix software bugs than hardware bugs. So why do we complain about hardware bugs when the vendor offers to ship free replacements to their customers, but not complain about software bugs for which fixes are not provided at all?

    7. Re:Ironic? by Anonymous Coward · · Score: 0

      but not complain about software bugs for which fixes are not provided at all?

      Who is this "we" you're talking about? Everyone I know bitches like crazy when there's a software bug found.

      Of course, /. isn't the place to go with all those Linux complaints, but that's fine since there are plenty of employed people who do hate Linux.

    8. Re:Ironic? by cybercrap · · Score: 2, Informative

      Haha, i can't help buy laught at this. I'm assuming you haven't taken a superscalar computer architecture class. Testing cpus is a bitch. It is just as hard if not harder than testing software. Sure there are only so many instructions in a cpu, but you have to deal with multiple combinations of instructions and what order they occur. To fully test a modern cpu in every possible state with every possible set of input, it would take more than your life time. Testing the cpu is equally as important as design. People who don't know dick about it blow it off as some easy task, but it is very time consuming and can be mentally taxing to create the best set of test vectors.

    9. Re:Ironic? by Unregistered · · Score: 1

      you can't dl a patch for a chip. 'nuff said. And has AMD had even close to as many bugs as intel?

    10. Re:Ironic? by Realistic_Dragon · · Score: 2, Insightful

      It's called managing expectations, and no one excells at the black art more than Microsoft.

      Everyone expected WinXP to be crap, and they were so relieved that it wasn't as bad as they thought they forgot to complain about the problems that do exist, as evidenced by the number of people who say "WinXP is great, compared to Win98 it's very stable and pretty fast, even though I did have to buy a new PC to run it, but that's just progress, isn't it?" when you ask them what they think of it.

      --
      Beep beep.
    11. Re:Ironic? by Anonymous Coward · · Score: 0

      the fact that you can create test vectors to cover at least 80-90% of all possible states in a modern CPU shows how easy the testing is. in a modern piece of software you cant test even 5-10% of it.
      BTW, while we're dropping credentials, im an EE with a masters degree and 8 yrs experience with ASICs. so fuck off.

    12. Re:Ironic? by Anonymous Coward · · Score: 0

      Sorry to put it this way, but the dichotomoy between expectations for hardware and software are the engineering standards that they are held up to.

      Now this may not seem to be much, when you engineer a physical product, like a microchip, a great deal of time and energy is spent on validating and testing the design.

      Fixing physical problems already in the field is more difficult to do than simply patching software. You not only have to identify the problem and fix it, but manufacture new parts and replace the malfuctioning units.

      Then there is the liability issue. While I am sure the custom written software meets higher standards (generally speaking) due to the close business relationship between manufacturer and client, mass produced software is riddled with holes because the companies that produced them are not held liable for defects. After all, "that's not a bug, its a feature."

      Then there is the whole patent issue. Patents on physical objects and microchips are accompanied with patent that leave absolutely NOTHING to the imagination. After all, patent examiners would be very likely to reject any application that included the phraze "a multiplicity of trasistors" in it. On the other hand, software patents can be obtained that are so vague and obtuse that companies like BT think that they can claim ownership over hyperlinks.

      These comments are not meant to be deroggatory to real software engineers, those individuals who follow rigorous procedures that ensure high quality standards. But it is rather to point out that hardware and software are qualitatively evaluated on different scales. Until there can be assurances that software be produced to higher standards (preferrably through legal liability), then we should get used to this situation.

      I for one hope that South Korea gets its is 100 kilograms of flesh out of the 800 lb gorrilla that is Microsoft.

    13. Re:Ironic? by vadim_t · · Score: 1

      Because software is easier to replace?

      Really, if you find a serious bug in your mail server you can do something. Patch it to do a quick and dirty workaround, perhaps disable a feature, maybe leave it running while you install a different server.

      With a bad CPU, especially something like Itanium you're going to take an expensive machine down. You might not have another one available, and it's quite possible that you don't have spare CPUs either. With the price of Itanium I doubt there are many people have spare ones. Besides, even if you have them, they're almost certainly the same model.

    14. Re:Ironic? by Anonymous Coward · · Score: 0

      Maybe its too hard for you because you're an idiot?

    15. Re:Ironic? by cgori · · Score: 4, Informative

      I love posts that are COMPLETELY TOTALLY WRONG.

      The number of states is 2 to the power of the numbers you were talking about. Even if I take the lowest number ("a couple dozen Kbytes") that you mentioned, it's 2^2*12*1024*8 = 2^24000.

      Guess what?

      That's a HUGE number -- way bigger than the "billions of petabytes" you were saying is impossible to recreate for software testing. It's roughly equivalent to 10^7200 (if that somehow makes things easier for you). Of course, the "couple dozen Kbytes" is a massive underestimation of the total state of a modern CPU (100 million transistors, even just making flip-flops will give 2.5M bits of state, and for 6T SRAM more like 16M bits).

      And then you have the nice problem that physics and electrical phenomena play havoc with hardware testing simulations, as opposed to software, which only has to worry about bad boolean logic.

      Come talk to me next time you have to worry about alpha-particle hits changing the state of any of your code or when you care about any event with picosecond granularity (which is just about every day in hardware).

      Yes, software testing has even more states to worry about, but trust me when I tell you that the hardware problem is plenty big enough to prevent exhaustive testing from being applicable. Hardware testing uses a lot of brute-force regression and detailed test planning to find and remove bugs. Software folks would do well to use such methodologies.

    16. Re:Ironic? by Anonymous Coward · · Score: 0

      Yes, i'm sure your fantastic attitude and demonstrated attention to detail will place you head and sholders above the swathes of similarily-qualified people when you start looking for employment...

    17. Re:Ironic? by Bombcar · · Score: 4, Insightful

      The big problem is when something fails SILENTLY! That's what the BSOD and the Kernel oopsies are! If the system has corrupt data, it is very very bad, worse than losing data. So if the hardware has a bug, then it will pass corrupt data around, and then things fail.....google around for what happens with bad ram, and learn about HAppy Fun Bugs!

    18. Re:Ironic? by slashdot_commentator · · Score: 3, Insightful


      While I do understand your sympathy towards hardware manufacturers, there is one obvious difference between accepting software and hardware bugs. The software bug can be fixed with a patch. The $200 software now works; we can accept that. When the CPU is buggy, the only way that gets corrected is if the manufacturer is willing to replace the CPU. BIG difference.

      I agree completely that software products should be set to a higher standard. But we haven't seen integrity in the industry, so all that's left to fix the problem would be to sic the lawyers at them. I don't see that as fixing the problem...

      --
      There is no America. There is no democracy. There is only IBM and AT&T and DuPont, Dow, General Electric, and Exxon
    19. Re:Ironic? by Anonymous Coward · · Score: 0

      I suggest a 1 year term in cpu verification to enlighten your attitude. It's *very* difficult.

    20. Re:Ironic? by Entropy_ah · · Score: 2, Interesting

      Spot on.
      I'd like to add another problem with testing. How do you know if the processor is giving the correct answer??? Work it out by hand??? Test it on another processor that may or may not have the same design flaws??

      --
      my other penis is a vagina
    21. Re:Ironic? by Jeff+DeMaagd · · Score: 2, Informative

      _Every_ CPU design made by anyone has errata documents. AMD, SUN, DEC, HP, Intel and all other CPU and hardware products end up having a flaw that gets out that causes it to behave outside of specs. Even microcontroller chips, those that execute less than a tenth of the opcodes and have less than 1k RAM and 8k ROM have had erratum posted which must be worked around, fixed or replaced.

      Also, true exhaustive testing is not just about testing all opcodes by running all of them, it is about testing all opcodes reading from all possible registers with all possible data permutations and all possible pipeline orderings.

      I think circuitry on a new CPU has long passed the complexity of a city the size of Alaska. Complete exhaustive testing of hardware has long been impossible to do on new computer chips.

    22. Re:Ironic? by Bingo+Foo · · Score: 1

      PhysicsGenius is posting AC these days?

      --
      taken! (by Davidleeroth) Thanks Bingo Foo!
    23. Re:Ironic? by psp · · Score: 1

      ARM doesn't have any errata, in fact it has no errors whatsoever. When an anomaly is found the behaviour is simply marked as 'unspecified' in the ARM architecture reference manual (the ARM ARM, got it?).

      Seriously, they actually do that.

    24. Re:Ironic? by cheezedawg · · Score: 1

      you can't dl a patch for a chip

      I guess you haven't heard of microcode patches...

      And has AMD had even close to as many bugs as intel?

      Yes. Every CPU on the market has bugs. I remember Palimino had a nasty bug with cache coherency that AMD was reluctant to fix. The only difference is that Intel is on a lot more systems than AMD, so it is a lot more noticable.

      --
      "The defense of freedom requires the advance of freedom" - George W Bush
    25. Re:Ironic? by Anonymous Coward · · Score: 0

      When the CPU is buggy, the only way that gets corrected is if the manufacturer is willing to replace the CPU. BIG difference.

      Or just download a new BIOS with a microcode patch. That has worked for a majority of the errata that has been discovered in recent years.

    26. Re:Ironic? by Mooncaller · · Score: 1

      Hey, the first sane post I've read. I might mention that physical phenomina effect on hardware is often not linear, nor monotonic, nor even continuous. In fact it is often catastrophic in 10 to 20 dimensions ( hystrisis is a one dimensional catastrophy, fight or flight responce is a two dimensional catastrophy). Okay so the phenominas effect on a single piece of hardware is nearly impossible with current math. On top of this, modeling an ideal device does no good. All of the possible devise variation that the process will produce also need to be modeled, and the effects of any particular process variation can be decidedly discontinuose, so catching the corner cases or major transitions does no good.

      And one last point. In software a one is a one, a zero is a zero, and a bit can only be a one or zero. In hardware, a one is a range of voltages (that can change), a zero is a range of voltages, and a bit can be effectivly somewhere in between.

    27. Re:Ironic? by Unregistered · · Score: 1

      I guess you haven't heard of microcode patches...

      I've heard they dn't work for a lot. Is this one? They talk about having to provide replacements, so i assumed it wasn't.

      Yes. Every CPU on the market has bugs. I remember Palimino had a nasty bug with cache coherency that AMD was reluctant to fix. The only difference is that Intel is on a lot more systems than AMD, so it is a lot more noticable.
      Ok, thx for the info. I guess this is the downside for intel of getting all the press.

    28. Re:Ironic? by Anonymous Coward · · Score: 0

      Who expected XP to be crap? Your WinME pals at the bathhouse?

      NT Workstation wasn't crap. 2000 Professional wasn't crap. So there was a reasonable expectation that XP would turn out OK.

    29. Re:Ironic? by Anonymous Coward · · Score: 0

      you test to see if the processor gives the same result as a model, typically in verilog. if your model is wrong (e.g. FDIV bug), you're fucked.

    30. Re:Ironic? by Anonymous Coward · · Score: 0
      /.rs have always struck me as equal opportunity bitchers. We do bitch about issues with the rest of the hardware and software out there. Why, we even bitch about Linux, at times.

      /flame deleted/ Damn Windoze IDE 'development environment' ate my day's work, including backups---so I'm generically bitchy right now.

      Anyway, the point about software companies not being held to as high standards as silicon was very good. That is one of the key questions regarding (esp. proprietary) software.

      Have a nice war,
      Mal the Elder

    31. Re:Ironic? by 10am-bedtime · · Score: 1
      the best software designers are hardware verification engineers (so says this wannabe guru ;-).

    32. Re:Ironic? by Anonymous Coward · · Score: 0

      Then why do we still have windows, if it would be so easy to fix?

    33. Re:Ironic? by EpsCylonB · · Score: 1

      as evidenced by the number of people who say "WinXP is great, compared to Win98 it's very stable and pretty fast, even though I did have to buy a new PC to run it, but that's just progress, isn't it?" when you ask them what they think of it.

      going a bit offtopic but that is progress isn't it ?. XP is a desktop OS that used by people at home a lot. people want their pretty colours, games and multimedia features. I'm not saying that MS software isn't a little bit bloated but it is not as bad as linux people like to make out.

      just cause linux can run acceptably on a 500 mhz machine doesn't mean I am going to.

    34. Re:Ironic? by Bert64 · · Score: 0, Troll

      Itanium on the other hand, is on a LOT LESS systems than AMD chips.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
    35. Re:Ironic? by pmz · · Score: 1

      NT Workstation wasn't crap. 2000 Professional wasn't crap. So there was a reasonable expectation that XP would turn out OK.

      This is from the point of view of a person who worked with PCs their entire lives.

      From the point of view of mainframes and UNIX, then, yes, Win NT, Win 2000, and even Win XP are pretty darn crappy. Even in my experience Windows 2000 has had the wierdest behaviors that I have yet to understand--or repeat. Windows is getting less and less transparent with each version, which makes the whole situation even worse.

    36. Re:Ironic? by Anonymous Coward · · Score: 0

      Yeah, and by that standard, Linux is pretty crappy too.

      90% of the difference between PCs and "real computers" can be explained by poor quality hardware. The other 10% is the rush-to-market instinct (an area where Linux is even worse than MS).

    37. Re:Ironic? by pmz · · Score: 1

      90% of the difference between PCs and "real computers" can be explained by poor quality hardware.

      I disagree. The percentage is probably nearer 5% of the difference. Windows has some real problems due to unmanaged complexity in the software. The opacity of Windows means that most bugs either go unnoticed or unreported, because no one really understands how it works.

      Yeah, and by that standard, Linux is pretty crappy too.

      Linux really isn't all that bad. It lacks the refinement of Solaris, but it is certainly a useful and reliable OS.

    38. Re:Ironic? by Jeff+DeMaagd · · Score: 1

      Well, that's kind of a "creative" way at best for them to pretend a problem doesn't exist and avoid an errata.

  28. How about others (AMD, Mot, IBM) by binaryDigit · · Score: 5, Interesting

    who exactly didn't expect something like this? Intel has a history of this sort of thing

    Of course when it happens to Intel, then EVERYBODY knows about it. My question is, how prevelant is this sort of thing throughout the cpu industry? Anyone know of other "mistakes" by the other major players? It's hard to imagine that only Intel makes these kinds of goofs, esp. with the complexity of todays chips. As an example, wouldn't Mot's failure to scale up the G4 PPC chips be considered an "error"? They just caught it early enough to not to ship any chips and say "oh, we're sorry, our G4's won't go as fast as we originally stated, wait another year and a half or so and we'll get it all sorted out". Didn't they also do a similar thing with the 68040?

    1. Re:How about others (AMD, Mot, IBM) by Photar · · Score: 2, Interesting

      I agree I'd like to see some stats on "bugs" in hardware.

      I think Motos problem is they're too busy making cell phones to worry about PPC.

      --
      He who knows not and knows he knows not is a wise man. He who knows not and knows not he knows not is a fool.
    2. Re:How about others (AMD, Mot, IBM) by Anonymous Coward · · Score: 0

      There's a difference between, "caught it before it was shipped" and "had to recall a bunch of CPUs"

    3. Re:How about others (AMD, Mot, IBM) by vadim_t · · Score: 5, Informative

      Not very uncommon, really. Here are some AMD bugs, for example. I think the deal is that the Itanium has a rather serious problem that's been undetected for a long time. Itanium based computers can cost about $20000, which is why it's a big deal. If you have such a system you probably are running something important on it.

    4. Re:How about others (AMD, Mot, IBM) by questamor · · Score: 3, Informative

      The 68040 bug affected quite a few LC040 machines, which made running FPU emulation on them horrid. Basically, trapping calls to the FPU in order to emulate them in software doesn't work as it should. It's b0rked, and most Apple 68LC040 machines just cannot fully emulate an FPU. That wasn't such a problem with the MacOS at the time, as it didn't need an FPU for any functions, nor did most apps.

      Running a normal Linux or NetBSD on one of these machines is asking for pain however,.

    5. Re:How about others (AMD, Mot, IBM) by alienw · · Score: 1

      Failure to scale up G4 chips? The question is, does Motorola care about supplying processors to a tiny market with razor-thin margins? You can be pretty damn sure that apple doesn't hand over much of that premium Mac price to Motorola, and development costs for processors are pretty high. Apple should have just went with the program and switched to x86.

    6. Re:How about others (AMD, Mot, IBM) by AlgUSF · · Score: 1

      Remember sun's ECC cache bug? It was front page news (not on /.) for days and days and days....

      --


      I want my rights back. I was actually using them when our government stole them after 9/11.
    7. Re:How about others (AMD, Mot, IBM) by vought · · Score: 1
      I think I might have just stayed passive if you hadn't made a reference to Apple's per-processor price.

      What do you know about how much Apple pays to Mot? Mot's processor business is just that - their own business - They are responsible for what they decide to charge to Apple.

    8. Re:How about others (AMD, Mot, IBM) by Wesley+Felter · · Score: 1

      Every CPU has errata. After all, CPUs are just software. Ever heard of software that doesn't have bugs? (Besides the Space Shuttle.) Sometimes the manufacturer tells you about it; sometimes they don't.

    9. Re:How about others (AMD, Mot, IBM) by Mr.+Piddle · · Score: 2, Informative

      Remember sun's ECC cache bug?

      Fortunately, that was just a supplier issue, where IBM was giving Sun bad cache RAM. This problem certainly caused a lot of unhappy customers, but it was a straight-forward resolution compared to fixing or patching the CPU itself.

      I've read that the UltraSPARC CPUs themselves tend to have very low errata rates, like a half dozen or so for the UltraSPARC II compared to dozens for Intel's Pentium chips. This is probably the result of Sun's long development and testing cycles, which, in turn, cause Sun's apparent lag in recent benchmarks. Everything's a compromise, I guess.

      --
      Vote in November. You won't regret it.
    10. Re:How about others (AMD, Mot, IBM) by SN74S181 · · Score: 2, Informative

      I know that the Sparc processor in my Ultra 1 has some sort of an 64 bit instruction bug that's bad enough that Sun defaulted the firmware in Ultra 1's to 32 bit mode. You have to change a jumper on the motherboard (hard to get to after opening the case) in order to reflash the Firmware and run 64 bits. I believe the bug is an instruction you can call that crashes the system. Someone else can add more details, I just run NetBSD/Sparc64 on the machine and it's not publically accessable.

    11. Re:How about others (AMD, Mot, IBM) by Anonymous Coward · · Score: 0

      Defensive about the Mac and the G4 processors?

      Why jump all over somebody over it?

    12. Re:How about others (AMD, Mot, IBM) by AlgUSF · · Score: 1

      Yeah nothing like building rock solid systems, except now companies are so cheap, they are going with less reliable alternatives.
      Intel (32-bit) systems==Cheap
      Sun Systems==Reliable
      Intel Itanic Systems==?

      --


      I want my rights back. I was actually using them when our government stole them after 9/11.
    13. Re:How about others (AMD, Mot, IBM) by JebusIsLord · · Score: 1

      Sun: There was a thing with the Sparc3s i believe where you had to download a microcode update that turned off certain FP functions (which were busted), resulting in a speed hit.

      --
      Jeremy
    14. Re:How about others (AMD, Mot, IBM) by nathanh · · Score: 1
      Of course when it happens to Intel, then EVERYBODY knows about it. My question is, how prevelant is this sort of thing throughout the cpu industry?

      Very prevalent. A recent /. story spoke of MMU bugs in the 68K series. Ultrasparc CPUs have had cache corruption bugs. I know somebody who was frustrated (for several weeks!) by a register corruption bug in a microcontroller. These bugs are sometimes "fixed" by changing code-generators (eg, compilers) to avoid problematic sequences.

      I'm not very familiar with Intel or x86 but my general feeling is that they are better than average.

    15. Re:How about others (AMD, Mot, IBM) by oscast · · Score: 1

      Because the guy was wrong with the way he phrased his comment

    16. Re:How about others (AMD, Mot, IBM) by PurpleFloyd · · Score: 1
      While Motorola may not have been able to scale up the G4s as fast as they wanted, that's not nearly as bad an error as the Itanium 2 "glitch" referenced in the article, or something like the F00F bug on the original Pentium. Why? Because Mot caught it in-house! They didn't release chips that couldn't reach their rated clockspeed in a normal operating environment, then advise their customers (who may have paid a fortune for their hot new system) to underclock until it works.

      Most chipmakers don't always meet their roadmaps for whatever reason; planning for the future is never totally reliable. Shipping a defective product, on the other hand, means that either the chipmaker is not testing its products well (bad), or it's sacrificing quality for marketshare and performance (even worse, as the chipmaker is knowingly shipping defective goods). While Intel may get more flak in the press for its errors than other companies, as it's the biggest CPU manufacturer, the fact remains that the chips never should have gone out the door. Nobody's perfect, but it seems that Intel is even less perfect than many others.

      --

      That's it. I'm no longer part of Team Sanity.
    17. Re:How about others (AMD, Mot, IBM) by tundog · · Score: 1

      Bugs are the reality of commercial software. This happens to everyone, even with the best quality assurance processes, no tests can ever replace the testing en masse that occurs when you release a product.

      That being said, some companies have better track records than others. You will never be able to catch all the bugs, but some companies have a corporate culture of releasing products whether they are ready or not. In the long run that's bad business.

      The funny thing about all this is that we all learned back in 2000, that the dot bomb model didn't work, but a lot of executive boards are still running their companies thinking only of the short-term and not the long-term. That simply amazes me.

      --
      All your base are belong to us!
    18. Re:How about others (AMD, Mot, IBM) by Luminous+Coward · · Score: 1
      Why did you link to Google's cache?

      AMD Athlon Processor Model 6 Revision Guide
      AMD Athlon Processor Model 8 Revision Guide

      Itanium based computers can cost about $20000 [...]
      An Itanium 2 workstation from HP costs one sixth of $20000.
    19. Re:How about others (AMD, Mot, IBM) by UnknowingFool · · Score: 1
      Apple should have just went with the program and switched to x86.

      What program was that? Where every new generation of chip runs significantly hotter than the last? Or how about settling for the disadvantages of CISC based CPUs over RISC based? Or how about locking themselves to one vendor?

      Although Intels and AMD chips are comparable, they're not always interchangeable. You just can't put a AMD chip on an Intel motherboard. Also the supply problem went away when IBM was contracted to meet the shortfall. In fact, it looks like Apple may use IBM to supply G5s.

      Everybody in the computer industry uses RISC. Even PC makers like Compaq-HP have a line of RISC workstations. Why?

      Until the current generation of Itaniums, there was no way a x86 chip could compete with a RISC chip in high-end computing in one on one situation. Where x86 beats RISC is price. That and the rise of Linux means that x86 machines can compete IF AND ONLY IF you cluster them in huge numbers.

      --
      Well, there's spam egg sausage and spam, that's not got much spam in it.
    20. Re:How about others (AMD, Mot, IBM) by vadim_t · · Score: 1

      I linked to the cache because I didn't have a PDF reader installed at the time, and thought that maybe other people don't as well.

      You're right, Itanium 2 can be obtained for that price. I didn't research it much and found this. Anyway, the point was that if you have an Itanium it's almost certainly there for a reason, and doing some improtant work, so an error in Itanium might have much more serious consequences than a web server crashing.

    21. Re:How about others (AMD, Mot, IBM) by Anonymous Coward · · Score: 0

      Agreed. The server configuration is also probably much more expensive than the workstation configuration.

    22. Re:How about others (AMD, Mot, IBM) by Anonymous Coward · · Score: 0

      Sun's lag in benchmarks is really more of a result of the incredible cost of designing a top of the line microprocessor these days. IBM and Intel are the only ones that can afford it anymore. I wouldn't expect to see more than one more generation of UltraSparcs out of Sun.

      You'll note that AMD's 'new' Opteron is an incremental change to their Athlon core; a magnified photo of the core should be enough to convice anyone of that.

  29. Freakazoid? by jon_eccleston · · Score: 1

    Didn't Dexter Douglas become Freakazoid when his cat punched 'a specific set of keys in a specific sequence' after he installed a CPU with a 'glitch'?

    1. Re:Freakazoid? by Cappy+Red · · Score: 1

      That's the part Intel didn't reveal. They figured the story would send some fun lovin' folks into a frenzy looking for the glitch, but since they didn't tell them the glitch requires being connected to the internet, and following the keys with delete.

      Having a cat named Mr. Chubbikins helps too.

      *honk*

      --
      This is my sig. It's prescription, I swear. I need it for reading things... on the other side of things
  30. Well, the chip is more important by YllabianBitPipe · · Score: 1

    It's the main component of a computer. Besides, for software, it's much easier to update (bug fix). If your processor is messed up, it's a lot worse.

  31. Try a few hundred.... Check these out... by Glasswire · · Score: 1, Informative

    Agilent Technologies
    ChevronTexaco
    Cornell University
    DreamWorks
    Johns Hopkins University
    Liberty Medical
    National Crash Analysis Ctr.
    NCSA
    PNNL
    Rice University
    Sony Pictures ImageWorks
    Wells Fargo
    VeriSign, Inc.
    Airbus
    British Petroleum
    CERN
    Daimler-Chrysler
    Daresbury Laboratory
    Erickson Utvecklings AB
    HLRS
    Philips Semiconductor
    Preussag
    SecFinex
    Triaton
    Univer sity of Oslo
    VTG-Lehnkering AG
    Bio-Informatics Institute
    Fujitsu ISOTEC, Ltd.
    Ibaraki Hitachi Information Service Co., Ltd.
    MarketBoomer
    Mazda
    Mitsubishi Heavy Industries
    Mitsui Chemicals
    Okazaki National Research Institute
    Singapore-MIT Alliance
    Subaru Research
    Toyota Autobody Corp.
    (nad lots of others....)

    1. Re:Try a few hundred.... Check these out... by Eudial · · Score: 2, Funny

      Dear mr. Glasswire. You have now spoiled a perfectly fine joke. Congratulations.

      --
      GAAH! MY PRINTER IS ON FIRE!!! PUT IT OUT! PUT IT OUT!
    2. Re:Try a few hundred.... Check these out... by Anonymous Coward · · Score: 2, Funny

      Naah...

      Itaniums are so powerful, all those companies run off just six chips.

      Oh... apparently this bug means the bottom 4 companies have to wait until Itanium 3 for 64-bit computing, sorry guys.

    3. Re:Try a few hundred.... Check these out... by Anonymous Coward · · Score: 0

      Way to not get the joke, glasshole.

    4. Re:Try a few hundred.... Check these out... by Anonymous Coward · · Score: 0

      So a couple of car companies and a bank wow. Now I know why my car keeps breaking down.

    5. Re:Try a few hundred.... Check these out... by Anonymous Coward · · Score: 0

      Yes, driven it right into the ground by showing *dozens* of customers for Itanium. I'd sure like to see AMD or Motorola come up with a list of customers *that* long.

    6. Re:Try a few hundred.... Check these out... by Glasswire · · Score: 1

      Got the joke - I was just explaining why it wasn't funny. It's only funny if there was some truth to the belief that Itanium hasn't been widely deployed. Instead the joke depends on pandering to a popular misconception.

    7. Re:Try a few hundred.... Check these out... by Anonymous Coward · · Score: 0

      According to analysts, sales volume for Itanium 2 was around 2,000 (two thousand) for all of 2002. This would generally be considered a disaster for any company, but for a volume-oriented company like Intel, there just aren't words to describe it.

      Honest, if you can list all the customers for a product in one Slashdot posting, this is not exactly a raging success.

      "Using simple math, this means that Intel will have to sell 2,498,000 chips, assuming a $2,000 margin per chip, to even pay back development costs."
      -- Gabriel Group, 2003

  32. What about the next chips that prevent clocking? by narratorDan · · Score: 1

    Not too long ago I saw a story that talked about how Intel was working (had) chips that wouldn't work if speed!=Intel settings. The new chips might prevent this kind of work around.

    Oops, I posted.

    --
    "If you're not confused by quantum mechanics, you really don't understand it." - Niels Bohr
  33. Hey Intel! by craenor · · Score: 4, Funny

    I have about 6 years experience in Quality Assurance, with emphasis on electronics, manufacturing processes and attention to detail.

    You know...if you're looking for anyone that is.

    1. Re:Hey Intel! by phsdv · · Score: 1
      And what would you have done differently?

      I think their QA did OK by telling there is a problem, a work around, and that they are working to fix it. What else do you want, from a QA stand point?

      This looks like a very special test escape that is hard to find (not impossible apparently). And their test engineers need to write a test for this before they can ship again.

    2. Re:Hey Intel! by Troll_Kamikaze · · Score: 1

      Six years of QA experience?

      I'm Intel's Human Resources Director, and I must say that your qualifications sound intriguing. There's just one sticking point: how do you propose to travel from the DEPTHS OF HELL to our corporate headquarters for an interview?

      (wink ;)

    3. Re:Hey Intel! by Mooncaller · · Score: 1

      Do you know what 2k equivelent yield is? Did'nt think so. Electronics manufacuring and semiconductor manufacturing are two very different fields. I've worked in both. BTW this is NOT a QA problem. It is a qualification problem.

      Oh, Intel, I have 23 years experience in test development and qualification of new products. I have been involved in everything from DC to light. That is if you are looking for anyone willing to work for cheap.

  34. que? by wisdom_brewing · · Score: 1

    all those gloches with the chops and the topys... or is that tpyos?

    1. Re:que? by Tailhook · · Score: 1

      it's tyops :)

      --
      Maw! Fire up the karma burner!
  35. Bad Joke. by rf0 · · Score: 2, Funny

    How long does it take an Itanium to count to 10?

    I don't know but will let you know when it gets there

    OMG I don't believe I just wrote that

    rus

  36. congrats you're an idiot by Anonymous Coward · · Score: 0

    What you wrote is not a list of instructions, genius.

    1. Re:congrats you're an idiot by Anonymous Coward · · Score: 0

      $mkdir ~/congrats you're an idiot
      $cd /congrats you're an idiot
      $ ./fyouandthehorseyouroadinon -revenge | cat dif "What you wrote is not a list of instructions, genius." //out

      So maybe they're trolls, what would /. be without them?
      How do you feed a troll? Be nasty, its a very American thing to do.
      We are all trolls in some small way.
      Some people just give in and admit it unless you live in Redmond, which is a troll free zone with lots of sheep.
      $ls
      fyou.out
      $./fyou.out +mail
      $sendmail +anon slashdot.org/comments.pl

  37. That something like lamb chops? by Cytlid · · Score: 0, Redundant

    NeoChichiri writes "News.com is running on an article about glitches in Intel's Itanium 2 chips. Even though it doesn't affect all chips, they have still stopped shipments of the new 450 Servers until the problem is resolved. Apparently it has to be 'a specific set of operations in a specific sequence with specific data.' Intel is saying that affects the 900MHz and 1 GHz Itanium 2 chops and that it will not affect the upcoming 1.5 GHz Itanium 2 6M chips." Until the next iteration of chip arrives though, Oliver Wendell Jones writes, "they recommend working around the problem by underclocking the processor to run at 800 MHz instead of its default 900 MHz or 1 GHz."


    Chips... chops... I'm confused...
    --
    FLR
  38. GREAT! by Anonymous Coward · · Score: 0

    We got screwed, we just bought 300 i2 900mhz processors(in dual proc systems) and NOW this comes up. Well at least I was right when I said we should have bought xeon's!!

  39. Pentium FDIV again? by XSforMe · · Score: 2, Funny

    Anybody else had flashbacks of the Pentium FDIV bug and this excelent post?

    --
    My other OS is the MCP!
  40. Re:AMD! by Anonymous Coward · · Score: 0

    AMD STILL SCUCKS ASS

  41. Possibly timing or power related by dprice · · Score: 4, Informative

    There isn't much detailed information about the exact conditions that bring out the bug, but they do state that the bug is electrical, that some unspecified combination of instructions and data pattern are needed, and that reducing the clock frequency avoids the problem. I can think of several things that might cause the bug. These are just guesses.

    One possibility is that there is a slow timing path in the logic that is marginally meeting the 900MHz or 1GHz clock speed. Going to 800 MHz gives the slow path more margin. This is the easy answer.

    Another possibility is that they have some part of the chip that has insufficient metal to deliver power to the logic gates. The right combination of activity might cause enough voltage droop to cause logic errors. Slowing the clock reduces the power consumption in CMOS chips.

    They might have a crosstalk problem between some signals that could flip bits when the right activity and frequency are combined. Slowing the clock can shift the relative positions of signal transitions.

    Eventually more details might surface, but Intel is probably keeping it quiet so that people don't write code to maliciously crash servers.

  42. HAL-9000 on Itanium by handy_vandal · · Score: 4, Funny

    "Open the Itanium register sets, HAL."

    "I'm sorry, Dave. I can't do that ...."

    --
    -kgj
  43. "affects the 900MHz and 1 GHz Itanium 2 *chops*" by jabber01 · · Score: 1

    Obviously, the error is division related.

    --

    The REAL jabber has the user id: 13196
    What you do today will cost you a day of your life

  44. Problem is the Hardware (re:Microcode?) by reporter · · Score: 2, Informative
    The problem that Intel is experiencing is not unique. Within the last 5 years, Sun Microsystems has been experiencing significant problems with its own processors. Please read "Sun suffers UltraSparc II cache crash headache".

    In terms of reliability, the Itanium II is no worse than the UltraSPARC series of chips. Both Itanium and UltraSPARC face the daunting task of debugging 100+ million transistors. Ensuring that the fabricated chip is bug free is virtually impossible. So, both companies have substantial errata sheets.

    The reason that Intel chips "appear" to be more error prone than other companies' chips is that Intel chips are extremely popular. So, people tend to pay far more attention to flaws in Intel chips than they do to flaws in other comapanies' chips. However, since so many people pay attention to the flaws in Intel chips, they are likely to have less bugs than other chips. The economies of scale that, say, the Pentium 4 enjoys means that if the Pentium 4 does have a bug, then it will likely be found by someone among the gazillion users. Then, Intel will fix the problem. Economies of scale help to lower the cost of a product but also help to lower the number of bugs.

    In any event, the performance of the Itanium II is at least 1 order of magnitude greater than the UltraSPARC III and (soon) IV. That performance difference is due to serious architectural mistakes in the UltraSPARC family of processors.

    1. Re:Problem is the Hardware (re:Microcode?) by Anonymous Coward · · Score: 0

      In any event, the performance of the Itanium II is at least 1 order of magnitude greater than the UltraSPARC III and (soon) IV. That performance difference is due to serious architectural mistakes in the UltraSPARC family of processors.

      I would like to see proof of this please.

      I have a feeling you're just doing some anti-Sun trolling but a some facts might enlighten me.

    2. Re:Problem is the Hardware (re:Microcode?) by Mr.+Piddle · · Score: 5, Insightful

      You really are a troll, tonight!

      Please read "Sun suffers UltraSparc II cache crash headache [theregister.co.uk]"

      This was a problem with the cache RAM and not the CPU itself. It was traced to a supplier (IBM), who was selling a defective product.

      In terms of reliability, the Itanium II is no worse than the UltraSPARC series of chips.

      There is no data to back this up. I know you don't have it, and I certainly don't have it. The only people who really have it (Intel and Sun) probably won't give it to us, so this ends here.

      However, since so many people pay attention to the flaws in Intel chips, they are likely to have less bugs than other chips.

      This is not true. Intel is pressured by a time-to-market more than other suppliers, especially with respect to the Pentium line. Sun has obviously decided to delay product launches to work out issues (e.g., UltraSPARC IIIi), because their customers expect reliability over other concerns. Hardware doesn't really follow the "all bugs are shallow" mantra of the Open Source movement, we mainly have to have faith in the manufacturer's simulation and test labs.

      In any event, the performance of the Itanium II is at least 1 order of magnitude greater than the UltraSPARC III and (soon) IV.

      Do you even know what "order of magnitude" means? You are claiming that, if the UltraSPARC III scores 975 on something that the Itanium II would score 9750??? For a given clock, it is true that the Itanium II is faster than the US III, but by a fraction--not a factor of ten!

      Also, the US IV, by definition, will be almost twice as fast as the US III for throughput, because it is two US III chips in one.

      You really don't know what the facts are.

      --
      Vote in November. You won't regret it.
    3. Re:Problem is the Hardware (re:Microcode?) by raxx7 · · Score: 2, Informative

      Also, the US IV, by definition, will be almost twice as fast as the US III for throughput, because it is two US III chips in one.


      First, UltraSparc IV will be a Out-of-Order CPU. Any comparisson with the In-Order UltraSparc III ends here.

      Second, "two chips in one" is misleading. It will be a CMP chip: multiple cores on one die, sharing external interfaces and higher levels of cache.

      Thirdly, the performance gain of doubling the number of cores per die (or the number os CPUs in a system) doesn't mean it can provide twice the throughput.

    4. Re:Problem is the Hardware (re:Microcode?) by hotchai · · Score: 1
      First, UltraSparc IV will be a Out-of-Order CPU. Any comparisson with the In-Order UltraSparc III ends here.

      Read this article at ExtremeTech. It states that UltraSPARC IV will have 2 UltraSPARC III cores. If the UltraSPARC III are in-order, how can UltraSPARC IV be out-of-order?

      Besides how does it matter if the core is in-order or not? All I want is kick-ass app performance - I don't care if you do in-order, out-of-order or even get a group of chimps to run through the instructions.

    5. Re:Problem is the Hardware (re:Microcode?) by pmz · · Score: 1

      Second, "two chips in one" is misleading. It will be a CMP chip: multiple cores on one die...

      Okay, dual cores is more accurate than dual chip.

      Thirdly, the performance gain of doubling the number of cores per die (or the number os CPUs in a system) doesn't mean it can provide twice the throughput.

      For a large number of applications, it can, and the Solaris kernel's fine-grained threading improves the odds greatly. For applications that saturate the processor's external bus, then it is certainly possible that one of the CPU cores would not get utilized efficiently.

      It's definitely a compromise. However, the idea of a 212-core Sun Fire 15K or even a four-core Sun Blade 2000 is quite enticing.

  45. phreakazoid by chillax137 · · Score: 1

    a specific set of operations in a specific sequence with specific data reminds me of that show on the WB called Phreakazoid i'm phreaking out!

    --
    chillax137
  46. Mwhahaha by nate+nice · · Score: 5, Funny

    Finally, The electrical engineers are to blame. I knew my code was correct!

    --
    "If you are a dreamer, a wisher, a liar, A hope-er, a pray-er, a magic bean buyer ..."
  47. zero tolerance for undetected corruption...? by bani · · Score: 4, Funny

    "Until we're sure the issues are 100 percent resolved, we're going to keep holding back shipments with the 450," IBM spokeswoman Lisa Lanspery said. "We have a policy of zero tolerance for undetected data corruption" at a customer site, she said.

    so detected data corruption is just fine, then...? :-)

    1. Re:zero tolerance for undetected corruption...? by Anonymous Coward · · Score: 1, Interesting

      Detected data corruption can possibly be fixed. Such as a single bit error on an ECC protected memory bus. And if the ECC error is a multibit error and unfixable, the OS is notified that the read got corrupted, and the OS is free to try again or shutdown or do whatever it thinks is reasonable. But that's better than say, getting the wrong stock buy order for a financial transaction.

      So, to a certain extent, detected data corruption is fine, because server class chipsets are designed to be able to correct or handle some types of mistakes.

    2. Re:zero tolerance for undetected corruption...? by Anonymous Coward · · Score: 0

      >> "Until we're sure the issues are 100 percent
      >> resolved, we're going to keep holding back
      >> shipments with the 450," IBM spokeswoman Lisa
      >> Lanspery said. "We have a policy of zero
      >> tolerance for undetected data corruption" at a
      >> customer site, she said.

      >so detected data corruption is just fine, then...? :-)

      Basically, yes. If all forms of data corruption are detectable, you can plan checks and balances into your systems. No system will ever be 100% immune to data corruption. One good gamma ray burts going through your RAM and suddenly somebody has a few too many or too few zeroes on the sum in their bank account. If the errors are undetectable, then you are just fucked up the ass with a cucumber until you cry.

      On the other hand, if the system does detect that the working set is in an impossible state, it can at least demand that all data be re-entered, or go to a backup, or at very very least alert the bank manager that he only has a few hours until the FBI shows up to find out where all the money went... :)

      Naturally, uncorrupted data is best, but given the choice between detected and undetected corruptions, I'd much rather be aware of a situation, even if I can't do anything about it, or don't really care (such as if a bit were flipped in a game of quake and a single frame rendered with pixel number 13726 seven shades lighter than it should have been.)

      Thank you.

  48. anti-overclocking patent by smartdreamer · · Score: 3, Funny

    Maybe this is not a bug, maybe this is just Intel's new anti-overclocking technology!

  49. Not suprising considering validation... by Thinkit3 · · Score: 1

    Spent some time setting up their validation tests. Without getting too specific, it was a bit of a mess. They do have good incident response though--lots of people working weekends on that.

    --
    -Libertarian secular transhumanist
  50. Hmm... by Cervantes · · Score: 2, Funny

    Ok, I guess the joke is now no longer:

    Intel Inside: Get 99.98765374% from your PC!

    Instead, it's now:

    Intel Inside: Get 99.98765374% from your ...>>NO CARRIER

    --
    If I knew the wedgies I gave you back in 6th grade would have resulted in this . . . I might have taken a moments pause.
  51. Wow this is bad by glenebob · · Score: 2, Funny
    "Intel is saying that affects the 900MHz and 1 GHz Itanium 2 chops"
    So... is /. an early adopter?
  52. Well... I guess this proves they shouldn't have by Anonymous Coward · · Score: 0

    rushed Itanium to market...

    (ducking)

  53. Re:AMD! by Anonymous Coward · · Score: 0

    AMD also sucks your mother. I know - I was there.

  54. MOD PARENT DOWN by Anonymous Coward · · Score: 0

    COMPLETELY WRONG! See the non-AC reply for an accurate explanation.

    1. Re:MOD PARENT DOWN by Anonymous Coward · · Score: 0

      Uh..thanks cuntfuckle.

  55. Re:Do you like the economy in the toilet? by Anonymous Coward · · Score: 0

    Yeah, hopefully the SlashTrolls will be able to find a janitorial job instead of their usual everyday crapposting.

  56. And at $7000 each... by Anonymous Coward · · Score: 1, Insightful

    ...that's why Itanium is not yet in the mass-market. Regardless of the flaws. It's also why my first 64-bit CPU is likely to be an AMD.

    I mean, come on. I want 64-bit a lot. An awful lot. I even wanted a redesigned instruction set - something I was please to see Intel had the balls to do with Itanium. The old IA-32 has a lot of baggage and bad design choices. But for crying out loud - $7000 for a single chip?

    I kid you not about the $7000 price tag for a single CPU - Itanium 2 is literally 10 times more expensive than AMD's 64-bit Opteron.

    This is why, as a life-long fan of Intel, am planning on defecting to AMD with my next machine purchase. To anyone else buying a new machine in the next 6 months: it makes absolutely no sense to get a 32-bit system when vastly more capable and future-proof 64-bit ones can be had for almost the same price as the old 32-bit ones.

    This time next year, I reckon that Intel will be steadily loosing market share to new system purchases because of their exhorbitant prices and their complete failure to provide a capable 64-bit platform in the same price range as AMD. Intel haven't even announced a consumer 64-bit chip. AMD's announced six - 2 Opterons, 4 Athlon 64's).

    I'm desperately hoping that AMD and Microsoft's marketing machines ramp up to push 64-bit - the sooner the better to punish Intel for not being more pro-active in the marketplace.

    1. Re:And at $7000 each... by You're+All+Wrong · · Score: 1

      If you want 64-bits, buy yourself a 21064-powered DEC from 1993.
      Yes _ten_ years old. Going for a song on E-bay (they're practically worthless, I won't deny that, but they're lovely machines).

      And if you want speed too - looking at the Open SSL benchmarks for 21264 alphas (EV67), it looks like 4-year old Alphas can beat last year's Intel/AMD chips if coded well enough.

      64-bit is 1990s technology. Intel and AMD are just slow to adopt it. (Or so say MIPS, Power, Sparc, HP-PA, and Axp.)

      YAW.

      --
      Your head of state is a corrupt weasel, I hope you're happy.
    2. Re:And at $7000 each... by rugger · · Score: 1

      its easy for 64bit machines to beat 32bit machines in encryption code, especially encryption code that deals with big integers ... like RSA and DSA.

      RSA of a given keysize requires about 25% of the work on a 64bit cpu it requires on a 32bit cpu.

  57. Re:What about the next chips that prevent clocking by ocelotbob · · Score: 1

    Intel probably doesn't care about underclocking as much, so the overclock protection circutry is probably more along the likes of
    if (($clockspeed) > ($specspeed)) shutdown
    than
    if (($clockspeed) != ($specspeed)) shutdown
    After all, faster clocked processors are more expensive, thus, Intel's already made their money off you if you underclock. They're more worried about overclocking because it skims money off their high profit margin chips.

    --

    Marxism is the opiate of dumbasses

  58. Not ppc603s by questamor · · Score: 4, Informative

    ow about Motorola leaving out critical instructions in the PPC603 and crippling every machine with one compared to the PPC601?

    That's a very very big reinterpretation of the facts. ppc603 machines were designed for low cost low heat. One of the ways to do this was to further remove instructions that were not needed, legacy instructions from pre-PPC601, and were never designed to be in the 601. They were not 'critical' and did not cripple anything. ppc603 cpus ended up working just for the purpose they were designed for. cheaper and less energy-hungry cpus.

    the G3 floating point debacle where excel spreadsheets would show up errors consistently

    You made a typo there. "Pentium" is not spelled "G3"

    1. Re:Not ppc603s by Anonymous Coward · · Score: 0

      What a sad sad apple zealot you are, Intel fuck up and you're all over them for shoddy quality yet when Apple do the same "they meant to do it"? You should look hard at yourself

    2. Re:Not ppc603s by questamor · · Score: 1

      Show me where I mentioned "Apple". I mentioned Motorola PPC chips, not Apple.

      Show me where I was "all over them (intel) for shoddy quality".

      You seem quite heated up about some little presumptions you made up in your own mind about what I said. Do you often get in this state of arguing with your own misconceptions?

  59. Geesh, Give Intel a Break by Mooncaller · · Score: 4, Interesting

    Itanium is a very new architecture. It has the potential for kicking i386 chips in the butt once it has a chance to grow up. With anything as radicaly new as the Itanium, there is a high probability of unexpected problems. AMD has not had this sort of problem resently because they don't have any balls. All they ever do basicaly amounts to minor tweeks of a stable design. Even their 64 bit extensions fall into this catagory.

    The type of problem Intel is dealing with could very well be in a new class. I have a hunch that it has to due with either unexpected capacitive coupling ( possibly related to an in-spec extreme of the process variation) or thermal transients causing timing skew. These types of phenomena are nearly impossible to model, especial if its tied to a particular set of process deviations. That is why manufacturer do such extensive qualification testing. Unfortunatly this testing can not be done untill there are enough units to test ( like in the 1000s). This does not happen untill the device is ready for production. Technicaly, this is the Pilot phase of development.

    One needs to give Intel some credit for learning a lesson from the Pentium fiascos ( not just the math error, but also the original ( 5V) 90Mhz burn-up issue). At least they are doing the right thing now. Corporations, like people, sometimes need to learn the hard way. Unfortunatly, though people usually retain their lessons, Corporations sometimes need to relearn them, especialy when being run by greedy BODs ( or board members with hidden agendas). AMD has yet to learn this particular lesson. One of these days, they will try to cover up a problem and its not going to work. They have gotten away with some stuff already because everyone loves to hate Intel ( me included, 68000 and PowerPC for me!)

    Unless your familiar with LSI semiconductor manufacturing, you should not be commenting. Because you don't have a clue as to what is going on. The posts I've read so far, remind me of what a class of 10 year olds would right in criticing Joseph Conrads "Heart of Darkness".

    1. Re:Geesh, Give Intel a Break by purdue_thor · · Score: 3, Insightful

      >>Unfortunatly this testing can not be done untill there are enough units to test

      >>AMD has not had this sort of problem resently

      Does that mean they're jealous of Intel's problems and resent not having them?

      >>The posts I've read so far, remind me of what a class of 10 year olds would right in criticing


      Wow. With your mad spelling and grammar skills you ought to know exactly what 10 year olds are capable of.

      But seriously though, Intel sells these chips to a completely different market than AMD. The customers here demand a box that never falls over. IBM and others test like crazy to make sure this stuff doesn't happen with their big chips. You have to remember that these aren't $800 clones here -- rather, big tin -- 6 figure kind of stuff.

      Oh, and when has it been customary to give credit to companies for making mistakes? It's like everything else, you have to earn trust. Intel's trying to get into the lucrative big tin market and need to earn the trust of people. So far they're not handling this one well... I'd be super ticked if I had to take down my machines (the ones that I was under the assumption would give me 99.999% uptime) so that I could underclock them to 800MHz. I do supercomputing work and I expect the machines to just run. That's what I paid all that money for, right?

    2. Re:Geesh, Give Intel a Break by Mooncaller · · Score: 1

      Actuly I relie on spel chekers alot. I already know that my spelling is atrocious. More so when I am being distracted every few second by my son. He thinks that since I'm home, I'm there for him. My grammar on the other hand, is normally good. The problem comes when I'm in a hurry and unable to complete even one sentence without answering a silly question. I have been dedicating my 9 to 5 to writing my resume. Which has burnt out my language composition circuitry. Unfortunatly, most have my posts are about as bad as this one. Oh well.

      Back to the original subject. I did not intend to imply that AMD does not do appropriate qualification of their new products. I am sure they do. If they did not, they would not be in buisness very long. What I am saying is that the class of problem, Intel is know investigating, is appearently new. The flak from the majority of the /. crowd is silly. It comes from ignorance of semiconductor manufactering and a dislike of Intel. Problems will always crop up when extending technologies reach. In all probability, Intel could have chosen to keep the problem quiet and continue to ship, and no one would have ever seen the problem. It is, after all an exceding rare event. There are indications that AMD has done just this befor. The other thing Intel could have done is to update the Final Test plan to weed out these defects. Intel did the right thing by stopping production. The engineers want to understand the cause and fix the problem. They do not want to apply a bandaid. This is what they should be given credit for.

      Current simulation technology is incapable of modeling all known phenomina. The math is just not there yet. Once there are devices to test, it would literaly take YEARS to go through all instruction sequences, with all possible data, for all devices the qualified process is likely to produce, over the entire speced operating enviroment range. ( How do you like that grammar!?) Once the current problem has been quantified, models will be updated ( if possible). Future designs will minimize the chance of any problems ( i.e. more tollerent of process variation). Process will be modified to yield device with tighter specs for whatever characteristic "turns on" the problem.

      I have worked in semiconductor industry for 15 years. My first eight years involved devices that usualy pushed technology, with no room for error ( all Level S stuff). On top of that, I grew up around IC manufacturing. My dad was an engineer. He sometimes took work home. I asked a lot of questions. I heard a lot of tales. Through my own experiences and what I learned from my dad, I could relate many many horror stories. I've seen combinations of events so unlikely that Mojo Jojo ( Powerpuff Girls) would not have thought of them.
      I would relate a few of the wierder( okay I can't spell, and I'm to tired to look it up) one but it is late, my son just got home, and I want to spend time with him ( after putting him off all day.)

    3. Re:Geesh, Give Intel a Break by EpsCylonB · · Score: 1

      I have a hunch that it has to due with either unexpected capacitive coupling ( possibly related to an in-spec extreme of the process variation) or thermal transients causing timing skew.

      In that case we need to change the gravitonic phase, reduce the tectronic radiation and then increase the nucleonic flux.

    4. Re:Geesh, Give Intel a Break by JCholewa · · Score: 1

      > Itanium is a very new architecture. It has the potential for
      > kicking i386 chips in the butt once it has a chance to grow up.

      Could you please detail what you think the benefits are that place IPF above x86? I do agree there are are such benefits, but there seem to be some downsides.

      For example, my conversations with Intel engineers lead me to believe that IPF's VLIW architecture is designed in a way that frowns upon the use of OOE (out of order execution) and similar cpu-level optimizations. One of the implicit problems here is that cpu-level (microarchitectural) optimizations have shown to transcend differences between instruction set architectures. Heck, even in the Itanium, the performance benefits shown do not appear to have much to do with the ISA. Instead, the high performance of the processor in certain areas is largely due to it having a bloody load of extra functional units, a feature that you can add to any modern ISA, even x86. Can you with certainty say that an x86 chip with eight flat register floating-point pipes (yay, you can do that now in x86!) and five megabytes of on-die cache would not outperform the Itanium II in floating-point applications?

      What does the Itanium have, save for absolutely top-notch compiler writers and the rather interesting but seemingly wasteful feature known as predication?

      -JC

  60. The Law of Leaky Abstractions by btakita · · Score: 1

    The Law of Leaky Abstractions.

    As we abstract more, we lose touch with the lower layers. As more abstractions are introduced, new classes of bugs are also introduced.

    So while it is easier to optimize software architecture and see other "high level" system design bugs, more "low level" bugs will creep in from this abstraction.

    1. Re:The Law of Leaky Abstractions by Anonymous Coward · · Score: 0

      That's a really great article. Saved. Thanks!

  61. Wrong figure by SuperBanana · · Score: 2, Funny
    What, this is going to affect all 6 people that own this chip?

    No, all 6.666666666666666666666 people.

  62. IBM PowerPC errata by Anonymous Coward · · Score: 0
    IBM apparently doesn't publish POWER4 errata, since they don't sell that as a product. Presumably they would publish errata for systems containing them or just publish fixes as patches to the operating systems they run. I'm not sure how many (if any) errata there are for them. They do publish errata for PowerPC microprocessors, though. This page lists their PowerPC microprocessor product information, and has links for information on each product. Those show several errata, such as this one for the PowerPC 750 processor.

    The description is kind of funny: "This document identifies implementation differences between versions of the PowerPC 750- PID8p processor and the description of the processor contained in the User's Manual.

  63. That's why I demand genuine Intel Inside... by Anonymous Coward · · Score: 2, Funny

    Because I truly believe that 1 * 1 == 2.

    1. Re:That's why I demand genuine Intel Inside... by wirelessbuzzers · · Score: 1

      It's 5. Not 2. 5. 5. Rats. 5.

      --
      I hereby place the above post in the public domain.
  64. Performance: Itanium 2 vs. UltraSPARC III by reporter · · Score: 1

    According to "SPEC", the Itanium 2 trounces the UltraSPARC III in performance and beats it by a wide margin. According to the "Transaction Processing Council", the Itanium 2 beats the UltraSPARC III by a wide margin on the most important commercial benchmark: TPC-C. An Itanium-powered server has close to the world record: 660,000 transactions per minute.

  65. Give ME a break by m11533 · · Score: 3, Informative

    While I have no particular animosity toward Intel, other than it is important for there always to be competition to push them, I do not think they need to be let off the hook. Itanium has been around a very long time. You may think of it as new technology, but that is more because of the lack of acceptance in the marketplace, not because it has only recently been released. What was happening all of these years since Itanium was initially launched?

    Additionally, while the Itanium instruction set takes a different approach to those of Intel's competitors, they are not the only company introducing new CPUs. I do not remember such problems when other 64bit CPUs with their own, new, unique instruction sets were launched by Digital, HP, IBM or Sun to name just a few. These days, the competitive landscape has been radically reduced. Digital no longer exists and its Alpha architecture is owned by Intel. HP, while it still owns its PA-RISC architecture, is trying to migrate its customers to Itanium, though it is hard to say what will really happen to PA-RISC since no one seems anxious to adopt Itanium. IBM also has picked up Itanium, so who knows what will happen to their RISC architecture? That leaves Sun, and while SPARC has always been the weaker of the RISC architectures, it seems to be the primary remaining competitor to Itanium and Intel. Of course, who knows how much longer Sun will survive as an independent company? Maybe they are the next to be gobbled up by IBM or HP, both already commited to Itanium, so what happens to SPARC?

    Finally, it is hard to say exactly where AMD fits in all of this. Its 32-bit line provides excellent competition to Intel's 32-bit Pentium family (now at P4), and the AMD 64-bit architecture looks like a nice increment beyond the now very old x86 32-bit architecture. But, in terms of major pressure on future CPU architectures? I just don't know where that competition will be coming from... Maybe China, Russia, Japan, India? Places not noted for their hi-tech prowess, but with lots of experience in fabrication and lots of affordable talent?

    1. Re:Give ME a break by chez69 · · Score: 3, Insightful

      Don't count on IBM abandoning their RISC chips yet, They power some of IBM's most profitable hardware (AS/400 and RS/6000) platforms.

      --
      PHP is the solution of choice for relaying mysql errors to web users.
    2. Re:Give ME a break by Mooncaller · · Score: 1

      In one sentence. You do not know what you are talking about and your "facts" are wrong. Thats what you get for using /. post as your primary source of info. P.S. I worked for Motorola and then for HP. I've been following Microprocessor technology since the release of the 6805. Guess when that was?

    3. Re:Give ME a break by drinkypoo · · Score: 1

      About the last time you experienced any personal growth in the area of maturity?

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    4. Re:Give ME a break by Tower · · Score: 1

      Don't forget the zSeries (mainframes). Plenty of horses under those hoods. The Power4 and Power4+ chips are rather quick, and extremely reliable (due to the needs of server-class hardware).

      A 1-way Power4+ 1.7GHz comes in with a Spec Int of 1113, FP at 1699 (this all with 128MB of L3 cache).

      Yes, I work for IBM, but not the proc design area.

      --
      "It's tough to be bilingual when you get hit in the head."
  66. Re:Performance: Itanium 2 vs. UltraSPARC III by cgori · · Score: 2, Informative

    Er, you do seems to be trolling just a bit. The US-III@ 1.2GHz achieves a base SPECint of 637, and the 1.0GHz Itanium-2 is 807. Yeah, it beats it, but trounces it? err, well, not really.
    And it's a far cry from the "order of magnitude" better performance than the grandparent post's claims.

    What's really funny about this post is that normally I am the one bashing Sun's CPUs... *boggle*

    Obligatory AMD note: the new SPEC update today shows that a 1.8GHz Opteron SPECint base is 1081.
    On a price/performance basis, I would consider that to be the trouncing chip -- maybe even in the order-of-magnitude range.

  67. Can you say Class Action Lawsuit? by Zathras11 · · Score: 1, Insightful

    Are you kidding me! If I paid for a server
    with a 900 Mhz or 1 Ghz processor, and the
    company that produced the processor said that
    I had to underclock the chip for it to work
    properly I'd ask for a refund. And it one
    wasn't forthcoming...

    1. Re:Can you say Class Action Lawsuit? by Anonymous Coward · · Score: 0

      I don't see that as a potential problem at all, they will replace it if you want to (and perhaps get a Madison).

  68. Um, what? by jpmorgan · · Score: 1

    *sigh* Mod parent down....

    1. Re:Um, what? by Anonymous Coward · · Score: 0

      Why should the parent be modded down just because you're too stupid to understand basic assembly?

    2. Re:Um, what? by Anonymous Coward · · Score: 0

      Because, retard, that isn't IA64 assembly. mov is not an instruction and eeax is not a register.

  69. Robert Nambla Malda by Anonymous Coward · · Score: 0

    Robert Nambla Malda

  70. Testing by Anonymous Coward · · Score: 0

    Harware has the advantage of well-defined specifications.

    Riddle me this, what should the output be in Word Processor A when the user presses the 'r' key?

    Answer: It depends. An 'r' should be inserted into the document, unless shift is depressed then is should be a capital r, unless the menus are selected, then a 30x80 pixel menu should scroll down smoothly at the rate of 10px/sec display certain text. . . .

  71. Makes you wonder.... by El+Cubano · · Score: 1

    They recommend working around the problem by underclocking the processor to run at 800 MHz instead of its default 900 MHz or 1 GHz

    I just want to see them recommend this AFTER they start incorporating their new patented anti-clock speed changing technology into all of their chips.

    1. Re:Makes you wonder.... by keller · · Score: 1
      ...patented anti-clock speed changing technology...

      I hope I won't live to see the day when I'm no longer able to set the speed of my anti-clock. I hate when I can't control the time-travel device.

      --

      Enig? Det alt for hot det smor!

    2. Re:Makes you wonder.... by EpsCylonB · · Score: 1

      prolly won't implement it until the Itanium chips are fully grown up like the x86 chips are.

  72. Is this that problem Compaq (IIRC) had? by VortexVertigo · · Score: 1

    I remember Compaq or maybe Dell delayed their Itanic systems while everyone else was going whole hog shipping them. Could this be it?

  73. Intel QA by sharkey · · Score: 2, Funny

    Where quality is job 1.99904274017.

    --

    --
    "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
  74. Link? by AvengerXP · · Score: 1

    How the hell is downclocking going to protect you from someone sending 'a specific set of operations in a specific sequence with specific data' to the processor? Spindoctors, i choose you!

    --
    Trolls dont like to be Flamebait, because they burn so well. Protect our Troll heritage!
  75. Glad they got the info out by 1nv4d3r · · Score: 3, Funny

    Luckily, all of the Itanium 2 owners have been contacted, and both of them had not yet experienced data corruption.

  76. apples and oranges by pete0t2 · · Score: 1

    Well, for one, a hardware bug is very easy to define. The chips may be complex, but in the end they have relatively few well defined operations. Software, on the other hand, has almost infinite poorly defined operations and it's not even possible to define what all software bugs are.

  77. Intel Itanic Systems== by Anonymous Coward · · Score: 0

    Not cheap, not reliable.

  78. Does the bug fry the hardware? by grimsweep · · Score: 1
    I hate to say it, but after removing the second smoldering AMD processor from my chasis, I don't think I'll be swaying from Intel anytime soon. And I thought I was getting a deal with the KT133A chipset... *shrug*


    Sans the liquid varients, there doesn't seem to be any such thing as 'adequate' cooling on an AMD T-Bird in Texas during the summer. Sure, the last few AMD processor generations seem relatively bug-free, but what's the point of a 'flawless' processor if it only lasts me a year?


    Sigh.....*waits for the new P4 and MB to arrivde*

    1. Re:Does the bug fry the hardware? by Loki_1929 · · Score: 1

      Try the stock retail fan. They last me just fine, and my machine runs in 100+ degree weather a lot. If your CPUs continue to die, there is a different problem. While thermal protection was a missing feature of AMD some time ago, the problem has since been addressed on a number of fronts. Personally, I'd rather go with a fairly bug-free chip that's cheaper and more powerful than one with a corporate logo attached to its 50 gazillion stage pipeline.

      --
      -- "Government is the great fiction through which everybody endeavors to live at the expense of everybody else."
    2. Re:Does the bug fry the hardware? by lscotte · · Score: 1
      I hate to say it, but after removing the second smoldering AMD processor from my chasis, I don't think I'll be swaying from Intel anytime soon.

      That's why you are supposed to put that block of metal and the spinny thing on top. Duh! :-)
      --
      This post is licensed under the Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License.
    3. Re:Does the bug fry the hardware? by grimsweep · · Score: 1
      I did some research at the time, and the best investment I could come up with was a Dragon Orb 3. It's a two-stage cooling system. The portion that actually attaches to the processor is a passive cooling heatsink with air channels. The second stage is a 7000 RPM fan, with it's own air flow plus the distribution thereof to the lower stage. At the time of it's purchase, it was considered one of the better cooling systems for AMD processors.

      The fan is still functional. I plan to pop a cheap duron in the MB soon enough for a Linux box. Until then, I'm going to find creative ways to use my defunct athlons. Perhaps they'll make a nice pair of earrings, as a belated Mother's Day gift.... : /

  79. Chips and Ketchup by ratfynk · · Score: 1

    Whenever I get a glitch I scratch then bitch.

    How the heck do they put em out so fast and get away with users needing to clock them down?
    Is it to prevent overclockers from doing processor fried chicken to their chips. Keep it up Intel you will make Hienz ketchup millions. Or maybe you can serve them up with big Macs.

    Though somehow I do not thing Steve is going to want his big macs to include intel chips, especially if they are too easy to cook!

    --
    OH THE SHAME I fell off the wagon and use sigs again!
  80. Re:Performance: Itanium 2 vs. UltraSPARC III by StressedEd · · Score: 1

    maybe even in the order-of-magnitude range.

    I agree. Statements involving the phrase "orders of magnitude" are banded around without any thought idea of what it means.

    If you are talking an "order of magnitude" you mean a "factor of ten", if you say "two orders of magnitude" you mean a factor of one hundred (etc).

    http://mathworld.wolfram.com/OrderofMagnitude.ht ml

    So a SPECint of 1081 v.s. a SPECint of 637 does not even differ by one one order of magnitude. 637 ~ 1081, more like a factor of two, significant but not as earth shattering as, say three orders of magnitude (1000)! ;-)

    --
    Be nice to people on the way up. You will meet them again on your way down!
  81. Bugs by tundog · · Score: 1

    Jesus! Talk about a difficult bug to find. I would be interested in how they figured out which instructions in which order with what data.

    --
    All your base are belong to us!
  82. Wait.. by MasTRE · · Score: 1

    Hasn't Intel championed anti-underclocking technology?

    --
    Must-not-watch TV!
  83. Order of Magnitude: TPC-C by reporter · · Score: 1
    The Itanium system does 660,000 transactions per minute on the TPC-C benchmark. The UltraSPARC system does 67,000 transactions per minute. So, yes, the Itanium is about 1 order of magnitude faster than the UltraSPARC. The results can be found at the "Transaction Processing Council".

    The Sun system uses the UltraSPARC II. Since Sun has refused to disclose the TPC-C score for the UltraSPARC III, we can only conclude that the UltraSPARC III does approximately as well as the UltraSPARC II.