Slashdot Mirror


Intel's Atom C2000 Chips Are Bricking Products -- And It's Not Just Cisco Hit (theregister.co.uk)

Thomas Claburn, reporting for The Register: Intel's Atom C2000 processor family has a fault that effectively bricks devices, costing the company a significant amount of money to correct. But the semiconductor giant won't disclose precisely how many chips are affected nor which products are at risk. In its Q4 2016 earnings call earlier this month, chief financial officer Robert Swan said a product issue limited profitability during the quarter, forcing the biz to set aside a pot of cash to deal with the problem. "We were observing a product quality issue in the fourth quarter with slightly higher expected failure rates under certain use and time constraints, and we established a reserve to deal with that," he said. "We think we have it relatively well-bounded with a minor design fix that we're working with our clients to resolve." Coincidentally, Cisco last week issued an advisory warning that several of its routing, optical networking, security and switch products sold prior to November 16, 2016 contain a faulty clock component that is likely to fail at an accelerated rate after 18 months of operation. Cisco at the time declined to name the supplier of that component.

6 of 59 comments (clear)

  1. Intel dropping the ball by Eravnrekaree · · Score: 4, Insightful

    Intel for the past decade has dropped the ball. Its missing the boat on mobile and failing to push x86 chips into mobile phones has weakened their entire platform which really needs to be an "everywhere" platform. It has been clear for a while that mobile would be a majority of CPUs for a decade, why it has not pushed x86 into more phones is beyond me. Its totally incompetent, especially given x86 binary compatability between desktop and mobile could be a selling point

    1. Re:Intel dropping the ball by drinkypoo · · Score: 5, Interesting

      It has been clear for a while that mobile would be a majority of CPUs for a decade, why it has not pushed x86 into more phones is beyond me.

      Because Intel cannot make a really low-power processor. They keep trying, and keep failing. Remember XScale? It was the fastest ARM implementation at the time, but it was also the most power-hungry.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    2. Re:Intel dropping the ball by TheGratefulNet · · Score: 3, Interesting

      check out intel's curie module (their really BAD arduino chip found in the arudino101 board). its a horrible chip and intel actually made that chip the center of their reality tv show 'americas greatest makers'. little known fact: the contestants on that show tried using the chip and all its features and almost everyone failed, even with intel's help. I can't tell you how I know, but I know this for a fact.

      a prime time tv event and intel farked it up.

      only intel could make such a mess and squander such a good chance to get a message out and create some buzz and goodwill. 'makers' still refuse to use intel for many reasons and the biggest: intel still has NO CLUE what the maker market is really about. curie is not a maker chip and so far, no one really is taking that chip seriously.

      that's one example. but it seems typical of intel these days.

      --

      --
      "It is now safe to switch off your computer."
    3. Re:Intel dropping the ball by networkBoy · · Score: 3, Interesting

      As a *former* Intel engineer I can tell you a little more about this bit:

      only intel could make such a mess and squander such a good chance to get a message out and create some buzz and goodwill. 'makers' still refuse to use intel for many reasons and the biggest: intel still has NO CLUE what the maker market is really about. curie is not a maker chip and so far, no one really is taking that chip seriously.

      Of us engineers in the trenches we had *many* makers, hackers, and all around nerds. Problem is very shortly up the food chain the view changes drastically. Marketing and management are generally clueless about it and adamantly refuse to listen to the real hackers in the company.

      As to how to ass up a design? Think of Intel as a medieval feudal system. Each Earl has his dukes, each duke has his lands with the peasants.
      Well, obviously the peasants can move as long as they're not indentured (A.K.A, can't move for a year after moving), but the dukes don't like it, because with less peasants they can't produce enough for the taxmen (from the Earls).

      The solution? put your thumb in other duke's pies and force a design by committee; erstwhile not actually sharing useful information to the product team because a competing duke is the figurehead of the project.

      There is everything from passive aggressive to cloak and dagger interference that happens. (though normally it's just the PA Asshattery.)

      meh, glad I'm out. They offered me money to go away forever. I grabbed it (and my super ergo office chair) with both hands and bolted.

      --
      whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
  2. Not to worry by Waffle+Iron · · Score: 3, Funny

    Once you get a replacement CPU from Intel, it's easy to upgrade your system.

    Get a small screwdriver, and insert it in the gap under the chip near pin 1. Gently rock the CPU out of its DIP socket; you may have to alternate pulling at each end of the chip.

    The new chip's legs will be slightly splayed for use with automatic pick-and-place machines. You may need to gently bend them inwards before proceeding. Making sure that pin 1 is aligned with the marker on the motherboard silkscreen, gently push the new CPU straight down into the DIP socket. Your system is fixed!

  3. Immediate repair is cost of a virgin motherboard by Anonymous Coward · · Score: 3, Interesting

    Can't post to The Register, since they don't have ACs.

    Anyway, the issue is damage to the LPC (low-pin-count) bus clock line. This is a secondary bus where you hang old ISA-style devices, like the system FLASH. If the FLASH is the only thing in there, it will mostly render the system unbootable (so, stuff that never gets power-cycled would just keep going). But LPC can generate interrupts, and one often hangs other crap to that bus, such as i2c controllers for hot-swap bays, motherboard management controllers, and other sensors. In that case, you can expect severe runtime misbehavior.

    The issue is caused by *continuous degradation due to use*, so repairing it is easy, if costly: replace the motherboard with a new one under warranty (and even if out of warranty period wherever this kind of "stealth" manufacturing defect is not subject to warranty time period limitations, such as in Brazil). It will "reset" the counter. This is your zero-day solution to the issue.

    Depending on time-to-market for the new stepping (hardware revision) B1/C0 of the Atom C2000, you might need an interim solution, which is the "platform-level change", i.e. redesigned board with extra components that work around Intel's hardware design error. As soon as you have these, you start using these to replace any boards returned due to the defect, or start a "recall" to preemptively replace boards.

    Depending on the total cost of the board plus other components, you keep the old boards you replaced around, and when revision B1/C0 of the Atom C2000 is out, you BGA-replace them in a factory (about US$ 25 per board in large volumes, if that much), maybe replace any liquid electrolytic capacitors and other crap that ages badly, and use the boards either as new or as refurbished, depending on your corporate/regulatory ethics. This kind of repair almost always really resets the boards MTBF. If Intel supplies the replacement Atoms at no charge, the cost of repair might well be far less than the cost of the production run for boards you'd want to keep around for warranty services, anyway.

    Mind you, at 1.5 years per failure, it will be rare the legislation/contract that forces more than one replacement... so, let's hope they don't replace a faulty board with a brand-new virgin but-still-timebombed board. You'd have trouble to replace it a second time if it fails after the warranty period.