Slashdot Mirror


Sandy Bridge Chipset Shipments Halted Due To Bug

J. Dzhugashvili writes "Early adopters of Intel's new Sandy Bridge processors, beware. Intel has discovered a flaw in the 6-series chipsets that accompany the new processors. The flaw causes Serial ATA performance to 'degrade over time' in 'some cases.' Although Intel claims 'relatively few' customers are affected, it has stopped shipments of these chipsets and started making a revised version of the silicon, which won't be ready until late February. Intel expects to lose $300 million in revenue because of the problem, and it's bracing for repair and replacement costs of $700 million."

212 comments

  1. How many heads will roll? by Anonymous Coward · · Score: 0

    How many heads will roll?

    1. Re:How many heads will roll? by Andrewkov · · Score: 2

      I heard Will I Am is so disgusted he's canceling his contract..

    2. Re:How many heads will roll? by Anonymous Coward · · Score: 2, Funny

      32 Intel people will be fired.... unless Hyperthreading is enabled, in which case 64.

    3. Re:How many heads will roll? by Anonymous Coward · · Score: 1

      Rumor is that 63.99999999947365 people are going to be fired.

  2. Dell by DoofusOfDeath · · Score: 1

    I'm been anxiously waiting for Dell to release its Mobile Precision Workstation with a Sandy Bridge processor.

    I had been cursing Dell for their slowness, but I guess it was a blessing.

  3. First Stalin, now this. You Georgians, I swear. by falzer · · Score: 0

    Thank you for the "scare quotes." I wasn't sure what slant I should have "read into" this "summary."

    1. Re:First Stalin, now this. You Georgians, I swear. by Chemicles · · Score: 4, Informative

      Or, they could be actual quotes from the company's actual press release.

    2. Re:First Stalin, now this. You Georgians, I swear. by Anonymous Coward · · Score: 0

      Not XOR. Noted.

    3. Re:First Stalin, now this. You Georgians, I swear. by PNutts · · Score: 1

      "LASER" - Alan Parsons Project

  4. Turns out they violated a Microsoft Patent by Anonymous Coward · · Score: 4, Funny

    They patented slowly degrading performance over time many years ago. It's a key feature built into Windows.

    1. Re:Turns out they violated a Microsoft Patent by gstrickler · · Score: 1

      Funny. Actually, hilarious. +1

      However, most of the slowdown over time is due to Internet Explorer. The problem was made worse by integrating IE into Windows rather than making it an app like ever other app. Reduce the IE cache size to 10MB and set "empty temporary items on exit" and Windows performance doesn't degrade nearly as badly. It'll still degrade due to installing crapware and every program installing it's own notification/task bar utility that runs at startup, and it may degrade if you don't defragment the HD periodically, but the IE cache is the biggest culprit. Of course, if you only use IE for Windows Update, then IE's cache isn't as much of a factor.

      --
      make imaginary.friends COUNT=100 VISIBLE=false
    2. Re:Turns out they violated a Microsoft Patent by Anonymous Coward · · Score: 0

      They patented slowly degrading performance over time many years ago. It's a key feature built into Windows.

      So Microsoft will sue them for this in addition the massive support costs generated as hundreds of thousands of end users call near-simultaneously to beg that they be allowed to re-activate Windows after motherboard replacements?

    3. Re:Turns out they violated a Microsoft Patent by denis-The-menace · · Score: 1

      It *IS* a key feature built into Windows to drive sales!

      I have a 5year-old installation of XP-home-Ed. (Compaq Presario ???? laptop)
      -We use FF, not IE
      -have little software on it
      -HD is 80% free and de-fragmented, temp folder is MT
      -Runs 99% of the time with User Account
      -Takes 5 minutes to get to the internet from boot
      XP Sucks

      Thinking Linux but laptop needs drivers to make volume buttons work!

      --
      Obama's legacy: (N)othing (S)ecure (A)nywhere and (T)error (S)imulation (A)dministration
    4. Re:Turns out they violated a Microsoft Patent by Anonymous Coward · · Score: 0

      Maybe while they are at it... they can remove the DRM shit too... and save even more money by not shipping stuff that people don't want?

    5. Re:Turns out they violated a Microsoft Patent by Anonymous Coward · · Score: 0

      Actually they've struck a deal. The new ie hardware accelerators for this degrading performance over time have already been released in beta.

    6. Re:Turns out they violated a Microsoft Patent by d6 · · Score: 1

      After the umpteenth crash on my Toshiba laptop running win7, I upgraded it to slack 13.1.
      The volume/mute controls worked out of the box. - this is on a approx one year old laptop.
      Try googling your particular laptop + linux - you might find it a painless upgrade.

    7. Re:Turns out they violated a Microsoft Patent by purpledinoz · · Score: 1
      You can easily disable this feature with this command:

      format c:

    8. Re:Turns out they violated a Microsoft Patent by Stuarticus · · Score: 1

      Safety: To ensure it works properly afterwards make you use the /s switch! Format c: /s

      --
      If you think someone isn't free to have a different definition of "freedom" you may be a tyrant.
    9. Re:Turns out they violated a Microsoft Patent by toddestan · · Score: 1

      Another huge culprit is the dynamic sizing of the page file. Not only does this fragment the hard drive terribly, you end up with a badly fragmented page file that can't be defragmented using Window's own built-in defraggfer (as it can't defrag files that are in use). Best thing to do with the page file is to set it to some large fixed size and leave it there. And download a boot-time defragmenter like Defraggler to get your page file back into one continuous chunk. You can also try Page Defrag but that apparently doesn't work on anything newer than XP.

    10. Re:Turns out they violated a Microsoft Patent by Fjandr · · Score: 1

      Aside from corporate OEMs, there are system builders who didn't stop using dynamic paging in Windows 10+ years ago?

  5. Given Intel's reaction... by gstrickler · · Score: 1

    ...it looks like they learned from the Pentium FP fiasco and are handling this one correctly.

    --
    make imaginary.friends COUNT=100 VISIBLE=false
    1. Re:Given Intel's reaction... by fuzzyfuzzyfungus · · Score: 4, Insightful

      Or this one is much more serious... The Pentium FP one was a big issue because of how cagey Intel was about it(and was a genuine problem for users who had purchased it for certain FP heavy operations); but it was a deterministic logical bug: as long as you avoided a fairly specific set of trigger conditions, it would stay safely contained(for certain customers, doing so would likely be so onerous as to qualify as unacceptable; but for everybody else not so scary).

      What makes the hair on the back of my neck stand up about this one is the "may gradually degrade" stuff. That makes it sound much less like the "100% of people who do X get bitten/0% of others do" logical bugs and more like the "component degradation in the field can be unpredictable, except at a population level" type of bug that, say, happened to Nvidia not too long back...

    2. Re:Given Intel's reaction... by Zocalo · · Score: 2

      Yeah, it does seem that way, which makes a change. Even though Intel has been here before, it still good to see a company just 'fessing up and dealing with a mistake like this for a change - unlike Dell's blatant denials about their faulty motherboards and Apple's "Antenna Gate".

      It's such a shame that they didn't also learn from the much earlier lesson about building on a foundation of rock instead of sand. If only they'd gone with "Rocky Bridge"... :)

      --
      UNIX? They're not even circumcised! Savages!
    3. Re:Given Intel's reaction... by aiht · · Score: 1

      I thought all computer chips were built on a foundation of sand.

  6. "Relatively Few" customers affected? by ckblackm · · Score: 1

    Isn't that the same line they fed the public about the Pentium FP bug?

    1. Re:"Relatively Few" customers affected? by Anonymous Coward · · Score: 1

      Motherboards with this chipset have been shipping for something like 3 weeks. They got the bug busted early, before large volume shipments began.

      This will obviously delay those large volume shipments and could leave a mark at Intel and various mobo/oem manufacturers.

    2. Re:"Relatively Few" customers affected? by RightSaidFred99 · · Score: 2

      Yes, if by "line" you mean "unarguably true statement".

    3. Re:"Relatively Few" customers affected? by QuantumBeep · · Score: 1

      Replying to highlight.

      Just because everybody says something doesn't mean it isn't true. In fact, true things tend to get said a lot!

  7. Intel caught this one first? by CajunArson · · Score: 4, Insightful

    I don't recall seeing any complaints online about degraded SATA performance, so it looks like Intel caught this internally and took the appropriate action before the issue became widespread in the wild. The bug sucks but it just goes to show how difficult it can be to test complex hardware under all situations. Kudos to Intel for being proactive... they have learned from the FDIV bug fiasco, and some other companies with fruity logos might learn from the example.

    --
    AntiFA: An abbreviation for Anti First Amendment.
    1. Re:Intel caught this one first? by alen · · Score: 2

      there was a rumor that new MacBook Pro's were going to be released tomorrow. if true it could have been Apple QA catching this at the last minute

    2. Re:Intel caught this one first? by Kjella · · Score: 1

      Yep, which makes it sort of odd.... none of the reviewers caught it, none of the early adopters seem to have caught it... yet it's so critical it justifies halting production and starting over with fresh silicon. Granted they don't test the controller that much but at least some of the file tests would. I bet they're all scrambling to find out now though and we'll know in a day or two.

      --
      Live today, because you never know what tomorrow brings
    3. Re:Intel caught this one first? by Anonymous Coward · · Score: 2, Interesting

      If its released tomorrow then Apple QA (early adopters) haven't received the new MacBook Pro yet.

    4. Re:Intel caught this one first? by Anonymous Coward · · Score: 0

      Will you apple fanboys ever get a clue. If apple had caught this bug, apple would not be releasing anything using this chipset. If they did, they might have *gasp* a defective apple product. That is not allowed. All the blame would be on intel (of course apple never does anything wrong), but the bad press would still hurt a little.

    5. Re:Intel caught this one first? by petermgreen · · Score: 3, Interesting

      Someone else linked to a post that claims it only supports ports 2-5 (the 3Gbps ports) not ports 0-1 (the 6Gbps ports). Most systems won't be stressing ports 2-5 that heavilly.

      Plus if this is indeed a gradual degredation issue it may be that most people simply haven't been using the systems long enough for it to become noticable yet.

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
    6. Re:Intel caught this one first? by petermgreen · · Score: 1

      That should have said only affects ports 2-5.

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
    7. Re:Intel caught this one first? by Ecuador · · Score: 5, Informative

      According to Anand's coverage, Intel said that they started getting customer complaints after they had shipped about 100k units, and their engineers managed to duplicate the problem early last week, the cause of which they figured out in a couple of days.

      Source : http://www.anandtech.com/show/4142/intel-discovers-bug-in-6series-chipset-begins-recall

      --
      Violence is the last refuge of the incompetent. Polar Scope Align for iOS
    8. Re:Intel caught this one first? by alvinrod · · Score: 2

      Yeah, but there's always a rumor about new MacBook Pro's being released tomorrow.

      If there wasn't, some rumor site is probably going to source our posts as possible evidence of a release tomorrow.

    9. Re:Intel caught this one first? by TheGratefulNet · · Score: 1, Insightful

      integration can be evil.

      now, if they had separate chips on the mobo for sata then the damage would have been contained and you simply disable the onboard controller and install a pci-e card instead. intel would make new mobos but NOT chipsets. chipsets are a big undertaking.

      also, I wonder if there were engineers in intel who said 'hey, whats with this too-fast churn of new socket types and chipsets? didn't we JUST release, not long ago, sockets for i3/i5? what wrong with using them again?'

      then some intel product mgr probably spoke up 'but we can get users to rebuy ALL new hardware instead of just a new cpu!'.

      money won out over logic and reason.

      well, intel LOST big-time on this cash grab move. now, since they did NOT leverage the older chipsets like a normal thinking company might, they can't even sell bare cpus right now. haha!

      lesson: don't put all your eggs in one basket. sometimes ultra integration will bite you in the arse.

      --

      --
      "It is now safe to switch off your computer."
    10. Re:Intel caught this one first? by Anonymous Coward · · Score: 0

      proactive isn't supposed to be a word.

      active or passive plz.

    11. Re:Intel caught this one first? by ilsaloving · · Score: 1

      You make it sound like striving to have non-defective products is a *bad* thing...

    12. Re:Intel caught this one first? by Zebedeu · · Score: 2

      Yes, and it was Steve Jobs who personally found and reported the bug to Intel while testing the new MacBook model.

      In fact, he's not even sick, he just needed time to concentrate on finding out why his beloved new mac model was behaving weirdly.

      Look for news of his holy return tomorrow.

    13. Re:Intel caught this one first? by Anonymous Coward · · Score: 0

      Actually, I was going for the fanboys thinking that apple is the greatest thing ever. They think that apple's QA caught this error and that is why intel is doing this.

      As for non defective parts, that is what everyone wants. If you think that apple does not have it's share of defective parts, I would like to show you the box of bad hard drives, RAM modules, and other replaceable parts that I got from apple computers. Unless the vender is making said bad part, everyone gets a bad parts from time to time. Last I checked, apple (dell, hp, acer, [name of computer company here]) assemble computer parts for their laptops and desktops. IBM when they made laptops might have been the exception. But not on all the parts, just some of them. If your 3 month old laptop's hard drive failed, do not blame apple, dell, hp, or who you got the laptop from. Blame the company that made the hard drive.

    14. Re:Intel caught this one first? by Misch · · Score: 1

      According to Anandtech, it wasn't an internal catch, it was external.

      Intel mentioned that after it had built over 100,000 chipsets it started to get some complaints from its customers about failures. Early last week Intel duplicated and confirmed the failure in house.

      --

      --You will rephrase your request for me to go to hell. Goto statements are not acceptable programming constructs
    15. Re:Intel caught this one first? by Anonymous Coward · · Score: 0

      They'll take the 3Gbps ports out, leave just the two 6Gbps ports, and relabel the motherboards "value", "certified for Celeron" or something, and ship them back to the market, cut their losses.

    16. Re:Intel caught this one first? by StikyPad · · Score: 1

      Bravo to Intel for owning up to this an correcting it so quickly. It's a pleasant departure from so many manufacturers these days who either refuse to acknowledge a problem or else acknowledge it and refuse to correct it.

    17. Re:Intel caught this one first? by vivek7006 · · Score: 2

      Mod Parent up. Steve Jobs just replied to my email from his iPad confirming exactly what the parent just said!

    18. Re:Intel caught this one first? by m50d · · Score: 1

      now, if they had separate chips on the mobo for sata then the damage would have been contained and you simply disable the onboard controller and install a pci-e card instead.

      You can still do that.

      And realistically, normal people don't upgrade their computers piecemeal anymore - it's not worth the effort.

      --
      I am trolling
    19. Re:Intel caught this one first? by raftpeople · · Score: 1

      proactive isn't supposed to be a word.

      Sounds like we better go back to 1930 and keep it from being invented then.

    20. Re:Intel caught this one first? by QuantumBeep · · Score: 1

      Intel values its business, government, and enterprise segments. There can be no chronically flaky hardware at all, from one end of Intel's product range to the other.

      To counter the integration issue, heavy integration is a really good thing for reliability and cost control. Remember 1990? Remember failed video cards, failed disk controllers, failed sound cards, failed network adapters? There were a lot more of those, added together, than there are outright motherboard failures today. In fact, I suspect those failures have been absorbed entirely, with motherboard failures becoming no more frequent. That's a good thing.

    21. Re:Intel caught this one first? by QuantumBeep · · Score: 1

      It's a word. Deal with it.

      Or go back to WoW trade chat where you sound like you belong. I don't care.

    22. Re:Intel caught this one first? by Ecuador · · Score: 1

      The last time they had a serious error, they were trying to avoid it by saying that if you were not doing serious number theory research you would never notice. That didn't go well (and they later had to do the recall anyway), so they are not making such a mistake twice!

      --
      Violence is the last refuge of the incompetent. Polar Scope Align for iOS
    23. Re:Intel caught this one first? by fuzzyfuzzyfungus · · Score: 1

      So long as they are upfront about the specs of what they are selling, more power to them... Everyone who was sold a product with both 0-1 and 2-5 has a right to demand that Intel make it up to them; but there is nothing wrong with selling motherboards that only have two SATA ports on them.

      2 SATA ports covers a decent chunk of the "value" segment(1 DVDRW, 1HDD), never mind the people who were going to fill every PCIe slot with SATA controllers anyway, and wouldn't mind a nice discount on the (seriously fast) second gen i5's motherboards...

      It would sure beat having them go to the dump.

    24. Re:Intel caught this one first? by Anonymous Coward · · Score: 0

      Good god, I'm so tired of people whining "they're making me buy a new motherboard." The only type of people who upgrade a cpu independent of the motherboard are people who want cutting edge. If you want cutting edge, expect to pay for it.

      They added a CRAPTON of new stuff to the latest generation. It wasn't possible to add all that, and retain backwards compatibility. It wasn't a cashgrab move, it was a "this is the only way to get all these features in" move.

      The integration you're probably talking about, having the GPU and CPU on the same silicon, made MASSIVE improvements to the GPU. I have been reading that unless you intend to play new games, the onboard GPU will save you from buying a 50$ dedicated graphics card. That and the improvements to encoding / decoding were quite impressive.

      If you don't want to buy the new features, don't buy the new hardware. quit being a whiny bitch.

    25. Re:Intel caught this one first? by Anonymous Coward · · Score: 0

      Indeed. It's nice to see someone at Intel understands that owning up to this early is cheaper in the long run. I'm a bit fucked off as I was intending to purchase one of those within the next couple of weeks, but it now looks like I'll be waiting for another month (maybe longer, depending on how long it takes to get them down to New Zealand...) for my new PC :(

    26. Re:Intel caught this one first? by AmiMoJo · · Score: 1

      This is why buying the latest hardware as soon as it comes out is a bad idea. The people doing early reviews just run a load of benchmarks and don't have time to discover longer term issues. None of the early SSD reviews caught the severe degradation over time issue, and how long did the FDIV bug take to come to light? It took months for nVidia chipsets to start failing in laptops (and despite every HP laptop made for about three years having that fault it still isn't widely known).

      It is good that Intel is fixing the problem, but I wonder what it means for someone who owns an affected motherboard. Sending it back to the manufacturer for exchange leaves you without a computer for possibly a few weeks.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
  8. Bracing for costs? this is an improvement... by maroonhat · · Score: 2

    At least I don't have to prove I _need_ high speed SATA performance to get a replacement... clearly SATA is more important than _DIVISION_...

    --
    The more I learn about Windows the more I am surprised it runs at all
    1. Re:Bracing for costs? this is an improvement... by mr_mischief · · Score: 1

      Funnily enough, the 6Gb/s ports are reported to be fine. it's the 3 Gb/s ports that degrade. If you only need as many ports as the highest speed the chipset supports you may be fine.

  9. Over time? by Caviller · · Score: 3, Interesting

    I RTFA and I for the life of me can't figure out if it's a "The longer the uptime the worse the degrading...and a reboot will start the process over?" or "You will use this and it will get worse and worse untill the chip burns out..."

    I hope to god it's the first one...If not this might beat the floating point error by a mile!

    1. Re:Over time? by LWATCDR · · Score: 1

      My guess is that it is the second. That is why Intel is going to replace all the motherboards. This is going to be one big PITA for users.
      This is why one should never pay for the latests and greatest if you don't need it.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    2. Re:Over time? by gstrickler · · Score: 2

      I can't imagine a hardware bug that would manifest only as degraded performance after extended uptime. Anything of that nature could probably be worked around with a software fix that periodically reset the controller. Therefore, I think it's safe to assume it's literally the SATA logic degrading with age, which would require a chip level change.

      --
      make imaginary.friends COUNT=100 VISIBLE=false
    3. Re:Over time? by bluefoxlucid · · Score: 1

      Why replace when you can send a $10 SATA PCI card?

    4. Re:Over time? by jgagnon · · Score: 1

      Maybe it's a "flash cache thrash" that wears out and degrades performance? :p

      --
      Remember to maintain your supply of /facepalm oil to prevent chafing.
    5. Re:Over time? by EvilIdler · · Score: 1

      The chipset is also used in laptops, according to the various articles out there. Make that an ExpressCard or something :P

    6. Re:Over time? by Megane · · Score: 1

      One of the replies in TFA speculates that it might be a timing bug with the differential drivers. If the output transistors in both directions are enabled at the same time, too much current would flow through the transistors, slowly burning them out. It doesn't have to happen all the time, or for very long each time, just often enough to burn out the output transistors over time.

      --
      #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
    7. Re:Over time? by LWATCDR · · Score: 2

      Because you paid for X sata connectors and or you do not want to waste a slot and or you have put the motherboard in a 1u rack-mount case or a slim HTC case.
      And of course it could be in a laptop.

      It is broken so Intel is going to do the right thing and fix it. This is a good thing.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    8. Re:Over time? by bluefoxlucid · · Score: 1

      In those cases you cannot send a $10 SATA PCI card because it is unfit for purpose (and more importantly the customer is not happy with that). Most desktops go to customers who generally won't care, as long as it works.

    9. Re:Over time? by LWATCDR · · Score: 1

      Or just send a motherboard that works like it should to everyone?
      Why offer a "good enough" fix when you can offer a real fix. Suppose the SATA card didn't work with Solaris? Intel is just going to fix it the right way from the start. It is just the right thing to do.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    10. Re:Over time? by Anonymous Coward · · Score: 0

      My guess would be that this is due to electromigration: http://en.wikipedia.org/wiki/Electromigration

      Even though wires in the chip are metal (a solid), the metal still diffuses ever so slightly. The weak link is usually the vias (the interconnect between metal layers). As lots of electric current, heat, etc stress the metal, the metal starts to diffuse until it either looses electrical contact or shorts itself with another wire... hence complete failure.

      Designers need to follow a set of design rules very closely to ensure that too much current, heat, etc do not impact reliability, and a robust set of design rules need to be created to ensure this never happens.

      If electromigration is to blame, I'm honestly surprised this made it into production. There are lots of stress tests that should have detected this.

      It could also be due to solder bumps, etc causing an electric failure, but the price tag would likely not be as high.

    11. Re:Over time? by ShnowDoggie · · Score: 3, Informative

      The problem in the chipset was traced back to a transistor in the 3Gbps PLL clocking tree. The aforementioned transistor has a very thin gate oxide, which allows you to turn it on with a very low voltage. Unfortunately in this case Intel biased the transistor with too high of a voltage, resulting in higher than expected leakage current. Depending on the physical characteristics of the transistor the leakage current here can increase over time which can ultimately result in this failure on the 3Gbps ports.

      ~http://www.anandtech.com/show/4143/the-source-of-intels-cougar-point-sata-bug

    12. Re:Over time? by fuzzyfuzzyfungus · · Score: 1

      I certainly have nothing against Intel doing the honorable thing; but, in principle, it seems like there could be an optional arrangement that could be mutually beneficial... According to reports, this bug means that ports 2-5, the four 3GB/s ones, have an unacceptable risk of death. As someone who, unluckily, just purchased a system based on this chipset, I'm not looking forward to the mobo swap.

      Were intel to offer me a 4-port SATA card and a check for half of whatever they save by giving me the expansion card instead of the mobo swap, I'd happily pocket the money and save myself the hassle.

      Obviously, as you say, there are a number of situations where such an offer would be either technically implausible(laptops, small form factor stuff) or unacceptable to the customer; but there are likely a lot of basic towers and/or thrifty enthusiasts who might be amenable to a (voluntary) deal.

    13. Re:Over time? by adolf · · Score: 1

      If you're that worried about the pain of swapping a motherboard, just drop a few bucks on an SATA card and be done. They're not particularly expensive. If it were me, however, I'd want the thing fixed -- not patched.

      Meanwhile, I think the whole thing is just really rather very funny. Back in the dark ages of aftermarket 386 motherboards and early 486 VLB boards, it was common to find things that either never quite worked right or got worse over time, requiring various tweaks to work around them. You kids don't know how good you've got it with hardware these days, where stuff generally works fine.

      For that matter: As recently as a few years ago, boards were being made by a whole bunch of manufacturers with bad capacitors that would fail in fairly short order. Only a handful of companies ever even admitted that the problem existed, and extremely few took the extraordinary step of repairing or replacing them for free.

      It's nice to see a company doing the right thing, straight-away -- especially with raw PC components. Perhaps it will become a trend.

    14. Re:Over time? by LWATCDR · · Score: 1

      Maybe but the extra paper work makes it not worth it to Intel. That small percentage just are not worth the hassle and the bad press they would get by offering a "band aid" solution even if some people would want it.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    15. Re:Over time? by Anonymous Coward · · Score: 0

      It is the second. The final fate of the SATA ports would be that they fail so badly the disk will be lost during the 10 seconds or so it takes to initialize. That implies the timer doesn't restart at reboots.

      It also means this is a physical failure, which I've never heard of before outside thermals. But this wouldn't be thermal or Intel would have provided customers with a (better) cooler for the chip. That's no big loss of face - sorry, usually these chips don't need it, this is a rare exception, here's a free cooler to stick on, etc.

  10. Does anybody else think this sounds ominous? by fuzzyfuzzyfungus · · Score: 3, Interesting

    Obviously, silicon bugs happen, barely anything makes it out of the fab without an 'errata' list as long as your leg; but the "may gradually degrade over time" part kind of freaks me out.

    If it were a "due to a design error, setting register xyz to 0xDEADBEEF causes Serious Badness, chipset drivers are being patched to Never Do That on rev.1 chipsets and future chipsets will be amended" that would be unfortunate; but so it goes. Fully deterministic errors, like the classic division bug, may be problematic; in some cases bad enough to qualify the product as just plain defective; but once known they can be mitigated by not stepping on them. Something that "sometimes" "gradually decreases" performance, on a bus with error correction, though, sounds a lot like a physical problem where some sort of silicon/electrical issue causes error rates to increase and thus retries/corrections to increase in frequency, and user-visible performance to go down. That makes me nervous. It sounds less like a deterministic error problem and more like a certain physical components are actually degrading much faster than expected problem...

    Can anybody think of an explanation for how a hardware bug would cause behavior that gradually changes over time(in a manner that couldn't be dealt with with a driver update) that doesn't involve the alarming possibility of gradually increasing error rates and/or early death of onboard SATA ports?

    1. Re:Does anybody else think this sounds ominous? by Viol8 · · Score: 1

      "Can anybody think of an explanation for how a hardware bug would cause behavior that gradually changes over time(in a manner that couldn't be dealt with with a driver update) that doesn't involve the alarming possibility of gradually increasing error rates and/or early death of onboard SATA ports?"

      Failure to reset some internal timer properly under certain conditions? I'm guessing.

    2. Re:Does anybody else think this sounds ominous? by pclminion · · Score: 5, Interesting

      It sounds less like a deterministic error problem and more like a certain physical components are actually degrading much faster than expected problem...

      Well, obviously. Could be a wire that was made too thin, or some component that overheats and slowly damages itself. I'm not sure why you think it's "ominous" though. It's a physical object that apparently has a design defect that causes it to wear out. I've seen ominous things before, but that usually involves a shadowy figure standing in a doorway with something that looks oddly like a machete, but dammit I can't really see clearly in this low light...

    3. Re:Does anybody else think this sounds ominous? by Jahava · · Score: 1

      Obviously, silicon bugs happen, barely anything makes it out of the fab without an 'errata' list as long as your leg; but the "may gradually degrade over time" part kind of freaks me out. If it were a "due to a design error, setting register xyz to 0xDEADBEEF causes Serious Badness, chipset drivers are being patched to Never Do That on rev.1 chipsets and future chipsets will be amended" that would be unfortunate; but so it goes. Fully deterministic errors, like the classic division bug, may be problematic; in some cases bad enough to qualify the product as just plain defective; but once known they can be mitigated by not stepping on them. Something that "sometimes" "gradually decreases" performance, on a bus with error correction, though, sounds a lot like a physical problem where some sort of silicon/electrical issue causes error rates to increase and thus retries/corrections to increase in frequency, and user-visible performance to go down. That makes me nervous. It sounds less like a deterministic error problem and more like a certain physical components are actually degrading much faster than expected problem... Can anybody think of an explanation for how a hardware bug would cause behavior that gradually changes over time(in a manner that couldn't be dealt with with a driver update) that doesn't involve the alarming possibility of gradually increasing error rates and/or early death of onboard SATA ports?

      Well, the flaw is with the Sandy Bridge chipset that accompanies the CPU, so it's not with the CPU itself. The http://en.wikipedia.org/wiki/Sandy_Bridge#Architecture includes, among other things, the IO and memory controller hub components, which control (between them) DMA and the actual SATA controller hardware and some of the relevant busses. I assume the problem is somewhere in there, and is hardware-related (i.e., not firmware-upgradeable).

    4. Re:Does anybody else think this sounds ominous? by clevelandguru · · Score: 1

      The question is if this problem is fixed with a power recycle? If not, The only way I can think of that this problem is in the digital logic is if the processor can save its states across power recycle. This sounds like a thermal hotspot in the processor that can cause physical damage over time.

    5. Re:Does anybody else think this sounds ominous? by Anonymous Coward · · Score: 1

      I posted over at the Tech Report my theory that it's a timing error in the output driver logic. This could cause--under rare circumstances--the logic to enable the high side and low side drivers at the same time. If it's just for a very small length of time, it can cause a current spike through the output transistors which will cause them to fail over time. It's just a theory.

    6. Re:Does anybody else think this sounds ominous? by Anonymous Coward · · Score: 0

      Something dependent on the thermal throttling feature built-in to Intel chips? Maybe there was a hotspot near a specific part of the chip that can be mitigated by shifting circuits around?

    7. Re:Does anybody else think this sounds ominous? by The+Salamander · · Score: 1

      Guess: Some sort of queue/FIFO with a bug in the read/write pointer logic that causes it to effectively decrease in depth over time.

      The bug is not severe enough to drop/corrupt data, which would have made finding the issue easier, but eventually performance suffers.

    8. Re:Does anybody else think this sounds ominous? by DarthVain · · Score: 1

      I read it could be a timing issue, if triggered sending too much juice too often to the SATA.

      From my own experience and common sense, anything to do with "degrading performance" sounds like a heat issue to me. Heat issues happen because of 3 causes: 1) Poor design of heat dissipation/mitigation in that they don't have a heat sink or enough of one to do the job, 2) Overvoltage in that a part is getting too much current, too often, resulting in heat, that is slowly breaking down the performance of the part, 3) Regulation failure, which basically controls the amount of current which results in overvoltage. All three are sort of the same thing really, bits getting too hot when they aren't designed to handle it.

      I think it is a good thing that Intel is doing the recall and has notified everyone. My fear would be unscrupulous hardware resellers trying to ditch these components on unsuspecting users. I wouldn't want to be buying a sandybridge pre-assembled in the next year or so. For users who buy their components individual, it is less serious, return your MB for a new one. You might be out a computer a few weeks, but at the same time your getting a new MB and you never know Manufacturers may even throw in a few extra features on the new one. Also the issue is about degrading performance, hopefully it isn't so fast that by the time you get your new one, you were never really effected by the issue in the first place. If your buying a new one, you just have to make sure of the version number of the MB you want to buy. Wait for the problem to get fixed and buy the fixed version 2.0 MB. Just be leary of sales of MB without clarifying version number. Some will likely be taken if they don't know about the issue, which is another reason I think it is good Intel came clean about it.

    9. Re:Does anybody else think this sounds ominous? by blair1q · · Score: 1

      that's not necessarily a lesser problem.

      chipsets are typically soldered onto motherboards, while CPUs are clipped into sockets.

      in order to install a free-replacement CPU, you flip a clip and put in the new part. 5-10 minutes for an inexperienced tech (60 seconds for a l33t h4xx0r whose system is never truly buttoned-up), and one new part to check out.

      in order to install a free-replacement chipset, you replace your motherboard. a couple of hours of unplugging, unscrewing, unpacking, re-screwing, chasing screws that fell off the desk, plugging back in, double-checking the plugging-in, going online to find a representative picture to be sure you put the memory in the optimal slots, and closing up the case. and then you have a hundred new parts to shake out.

    10. Re:Does anybody else think this sounds ominous? by Anonymous Coward · · Score: 0

      this is slashdot. a intel bug to them is pretty scary.

    11. Re:Does anybody else think this sounds ominous? by atisss · · Score: 1

      Apparently the 0xDEADBEEF will just become rotten

    12. Re:Does anybody else think this sounds ominous? by Jahava · · Score: 1

      that's not necessarily a lesser problem.

      chipsets are typically soldered onto motherboards, while CPUs are clipped into sockets.

      in order to install a free-replacement CPU, you flip a clip and put in the new part. 5-10 minutes for an inexperienced tech (60 seconds for a l33t h4xx0r whose system is never truly buttoned-up), and one new part to check out.

      in order to install a free-replacement chipset, you replace your motherboard. a couple of hours of unplugging, unscrewing, unpacking, re-screwing, chasing screws that fell off the desk, plugging back in, double-checking the plugging-in, going online to find a representative picture to be sure you put the memory in the optimal slots, and closing up the case. and then you have a hundred new parts to shake out.

      Don't get me wrong, I don't think it's a lesser problem. I got the tone from the OP that he was confused how a CPU issue could cause such a specific and isolated problem with SATA, so I was pointing out how the problem wasn't with the Sandy Bridge CPU, but rather with the supporting chipset.

    13. Re:Does anybody else think this sounds ominous? by Anonymous Coward · · Score: 0

      Can anybody think of an explanation for how a hardware bug would cause behavior that gradually changes over time(in a manner that couldn't be dealt with with a driver update) that doesn't involve the alarming possibility of gradually increasing error rates and/or early death of onboard SATA ports?

      Cases of components failing/glitching prematurely over time are ususally due to overcharging / electron migration.

    14. Re:Does anybody else think this sounds ominous? by sxeraverx · · Score: 1

      That would still be fixable by a reboot. More likely, it's that they made a wire too thin or a transistor (or a couple hundred) too big or too small, or some bad combination thereof. That in itself could make the logic not meet timing under certain conditions, but it wouldn't degrade over time.

      If things were really bad, this could create localized heat pockets which could damage transistors, altering their device parameters over time, causing them to miss timing margins more and more drastically.

    15. Re:Does anybody else think this sounds ominous? by Anonymous Coward · · Score: 0

      It could be electromigration of the metal connecting driver transistors. If the metal track is too tiny in cross-section, the high current density causes metal atoms to migrate over time, eventually opening the circuit.

    16. Re:Does anybody else think this sounds ominous? by yuhong · · Score: 1
    17. Re:Does anybody else think this sounds ominous? by Anonymous Coward · · Score: 0

      Yes, I can explain it. At the request of the Federal Govt, Intel designed into the controller a circuitry and firmware combination that would record certain ATA SECURITY commands. This includes the passwords used to perform SECURITY LOCK/UNLOCK on disk drives. Unfortunately, the implementation has a serious flaw that triggers a set of conditions that triggers a codepath that causes delays when trying to access the disk.

      Intel has a strict NDA on this type of thing.... they won't disclose it outside their elite ranks internally.

    18. Re:Does anybody else think this sounds ominous? by aiht · · Score: 1

      this is slashdot. a intel bug to them is pretty scary.

      What do you mean them?
      Everybody on slashdot except you and the GP?
      What makes you different, and why nobody else?

    19. Re:Does anybody else think this sounds ominous? by AmericanInKiev · · Score: 1

      I think his point is that the chip is non-deterministic. This is a rare defect which appears more mechanical (thermal or electrical "wear") than oops, a transistor is in the wrong place.

    20. Re:Does anybody else think this sounds ominous? by Anonymous Coward · · Score: 0

      Maybe you should have waited a few hours before posting your crackpot theories. The cause has been revealed:The PLL for the 3Gbps ports has transistors that are being over biased, which slowly fries the gate oxide.

      http://www.anandtech.com/show/4143/the-source-of-intels-cougar-point-sata-bug

    21. Re:Does anybody else think this sounds ominous? by CyberDragon777 · · Score: 1

      They are the AMD users.

      Both of them. :P

      --
      We both said a lot of things that you are going to regret.
  11. Hmm... by lostmongoose · · Score: 1

    Seems to me they had issues the last time they rushed a product to beat AMD, as well. Ghosts of the i820 MTH fiasco.

  12. Some additional information by dc29A · · Score: 3, Informative

    Apparently the problem is with SATA ports 2-5, at least for mobile motherboards. Every desktop board is affected.

  13. Re:Sucks to be them! by commodore6502 · · Score: 1

    >>>I'm still running a dual-core athlon 64. Processor/memory upgrades became overrated a few years ago.

    Agreed. I am still running a Pentium 4 at 3000 megahertz. When I experienced slowdown, I just doubled the memory and that eliminated the main problem (hard drive/virtual memory swapping). The only thing my P4 doesn't do is HD video, but I'm okay with that since my 700k connection doesn't do HD either.

    Now my Pentium 3(?) 700 MHz laptop is long in the tooth, and often runs too slow for my taste, but it is just a laptop. I don't use it much except for travel.

    As for Intel:
    1 billion dollar loss is a major suck. I doubt it will end-up costing that much though. When the original Pentium developed a floating-point bug, most users did not upgrade because it was not something they needed. That helped Intel save $$$ and probably the same will happen with this chipset too.

    --
    Information wants to be expensive AND wants to be free. So you have Value vs. Cheap distribution fighting each other.
  14. Re:Sucks to be them! by fuzzyfuzzyfungus · · Score: 2

    Depends on your workload. I'm typing this on a still-just-as-adequate-as-when-I-bought-it A64, plays games and everything; but when I put on my work hat, the fact that we can get more VMs into the same physical volume and power consumption with every generation(and, for annoyingly expensive software that is licensed per-socket, get substantially improved performance for peanuts hardware money) is reason to cheer...

  15. Sandy Bridge Closed Due to Erosion by BondGamer · · Score: 5, Funny

    Should have been article title.

  16. SATA early adoptors by jolyonr · · Score: 4, Funny

    Well, I guess this vindicates my decision to stick with MFM hard disks.

    --


    Please read my Canon EOS tech blog at http://www.everyothershot.com
    1. Re:SATA early adoptors by jolyonr · · Score: 1

      or ST-506 interface drives if you are just about to tell me that MFM is an encoding method rather than an interface. But they were called that back then. I remember!

      --


      Please read my Canon EOS tech blog at http://www.everyothershot.com
    2. Re:SATA early adoptors by Anonymous Coward · · Score: 0

      I wouldn't really call getting a SATA enabled motherboard in 2011 "early adoption", since the SATA standard was created in 2001.

    3. Re:SATA early adoptors by markass530 · · Score: 1

      I actually was a SATA "early adopter" back in early 2004 when I bought a computer. Had taken a "computer sabbatical" for several years in hawaii and was just purchasing a new one finally. Configured it without a floppy disk, which I was glad to never have to use again. Then a few months later when good ole' Windows XP takes a crap on me, I goto reinstall it, and XP requires a floppy drive to install the drivers for my sata drive... No thumb drive, burnt CD, nothing else, just a fucking floppy which I purposely left off my system build. f micorosft. fml

    4. Re:SATA early adoptors by TDO48 · · Score: 1

      Good old days...

      g=c800:5

      and my RLL controller getting me 30MB instead of the MFM-specd 20MB....

    5. Re:SATA early adoptors by Skater · · Score: 2

      I actually was a SATA "early adopter" back in early 2004 when I bought a computer. Had taken a "computer sabbatical" for several years in hawaii and was just purchasing a new one finally. Configured it without a floppy disk, which I was glad to never have to use again. Then a few months later when good ole' Windows XP takes a crap on me, I goto reinstall it, and XP requires a floppy drive to install the drivers for my sata drive... No thumb drive, burnt CD, nothing else, just a fucking floppy which I purposely left off my system build. f micorosft. fml

      I had a similar problem recently. First, I'm surprised you haven't gotten flamed yet like I did when I mentioned it. I said that I was having trouble getting it to work, but Linux was working fine on the machine, and I wasn't missing Windows XP - apparently that's worthy of flames about how I was stupid for trying to use an 8 year old OS on a new machine and I'd have as much trouble with Linux, etc.

      Anyway, it required slipstreaming XP on to a new DVD then reinstalling using that. It's not so bad as long as you have another machine on which to do it (I had my old laptop that I dug out of the closet for this purpose). You can mess with a bunch of options, but the first time I did it, I got a really wacky system setup without things like sound, so I recommend going with the normal settings. Just put SP3 and your drivers on the disc, and configure everything the old-fashioned way.

    6. Re:SATA early adoptors by sxeraverx · · Score: 2

      Successful troll is successful.

    7. Re:SATA early adoptors by greed · · Score: 1

      Yeah, that RLL stuff will never catch on. ST-506 FTW, the ST-412 is never gonna make it!

    8. Re:SATA early adoptors by MarkRose · · Score: 1

      Hard disk? Boy, what if it crashes? Have you thought of that? I keep everything on three separate floppies! Those new 5.25" disks are super compact!

      --
      Be relentless!
    9. Re:SATA early adoptors by Mr+Z · · Score: 1

      That's crazy! How can you read anything from them if your computer goes off line? Paper tape is the only way to fly.

    10. Re:SATA early adoptors by Anonymous Coward · · Score: 0

      That's good news then - you can upgrade your harddisk capacity and performace by 26/17, i.e. over 50% by upgrading to a RLL controller without changing the disks!

  17. Not all chipset affected by Anonymous Coward · · Score: 2, Interesting

    The systems with the affected support chips have only been shipping since January 9th and the company believes that relatively few consumers are impacted by this issue.

    Important details about shipment date lost in transcription.

    1. Re:Not all chipset affected by lowlymarine · · Score: 1, Interesting

      I'm pretty sure that's because Sandy Bridge chipsets have only been shipping since then. I'd assume that the "relatively few" customers affected by this are "everyone who has already purchased a SB board," kind of like how the "relatively few" customers affected by Bumpgate turned out to be "everyone with a G80 derivative" (which was "relatively few" of the set of "all nVidia customers ever," I suppose).

    2. Re:Not all chipset affected by Anonymous Coward · · Score: 1

      Correct me if I'm wrong, but wasn't that chipset released around January 9th. So...that would make it all chipsets.

  18. Excellent! While they're at it... by BLKMGK · · Score: 1

    Could they pretty please make a change that allows me to use the new H.264 encoding instructions without being forced to rely upon their nice but not nice enough video display capabilities? I'd LOVE to use the encoder speedups but if I'm forced to use their CPU as my GPU I may be forced to skip it. Everything I've read says that this is what I'll be forced to do - YUCK!

    --
    Build it, Drive it, Improve it! Hybridz.org
    1. Re:Excellent! While they're at it... by Kalroth · · Score: 1

      I believe this won't be possible before Intel releases their Z68 chipset, so you're out of luck if your motherboard got a P67 chipset.

      It is possible to do it with software on motherboards with the H67 chipset: http://www.anandtech.com/show/4113/lucid-enables-quick-sync-with-discrete-graphics-on-sandy-bridge

    2. Re:Excellent! While they're at it... by BLKMGK · · Score: 1

      Thank you, very helpful!

      I haven't YET purchased one of these - and a good thing it seems - as I've been waiting to figure this out. If I must wait for the Z67 then so be it. My requirements are that I be able to overclock (so I'll buy the unlocked CPU) and I want access to the encoder instructions. As it stands now nothing but commercial encoders are accessing this I believe as Intel came to the table VERY late for the x.264 guys. My fingers are crossed that this is solved by the time I'm ready to upgrade else I'll simply be stuck with a way faster CPU that clocks nearly 5ghz on air - and I run water :-) This bug will be a set back it seems but fingers crossed they get it solved quickly as this sounds like a very nice upgrade from my i7 clocked a bit over 4ghz. My I/O might actually become a bottleneck! :-O

      Anyway, I appreciate the links as this wasn't something I'd yet been able to find myself.

      --
      Build it, Drive it, Improve it! Hybridz.org
    3. Re:Excellent! While they're at it... by PNutts · · Score: 1

      As it stands now nothing but commercial encoders are accessing this I believe as Intel came to the table VERY late for the x.264 guys.

      No and no.

    4. Re:Excellent! While they're at it... by BLKMGK · · Score: 1

      Yes thank you for proving my point - those are both commercial encoders. Now go look at the x.264 mailing list where the devs are approached by an Intel guy about their new hardware (to much disbelief), much trouble ensues as they try to get him into IRC, contact is made but they find the silicon is already sealed and done, and then the guy disappears. Again, is x.264 accessing this yet or is it only commercial encoders? All in all the devs didn't seem too impressed with Intel, apparently there were things that could have been done that would've been a huge help that weren't. I still hold out hope that they will find some acceleration and if nothing the brute clocks on this CPU and better architecture will be helpful.

      BTW of the two you posted Mediaconverter7 is better in my experience. It actually uses a decent chunk of CPU as well as heats the GPU more than the other so it seems to load itmore. It only does base and main profiles but can be tweaked and CAN write to a MKV container. It does insist on shoving an audio track in there for some reason which is annoying. The other product barely uses the CPU at all, cannot handle writing to MKV, and doesn't appear to allow for the same level of tweaks. so, I've stuck to x.264 with meGUI as my primary front-end with some Handbrake as needed - no GPU acceleration. MediaConvert7 was able to compress a movie in about 45mins vs the 3 hours or so with my normal process but since I cannot get the settings aligned I'm not comfortable using it - their support was no help concerning using the High profile either when I asked. I know $40 isn't a ton of cash but I did sort of want some assurance it would give me at least equal results :-(

      --
      Build it, Drive it, Improve it! Hybridz.org
  19. Find a bug... by Anonymous Coward · · Score: 1

    ...as you write the spec: $1.00
    ...after you ship a few: $1,000,000,000.00

    1. Re:Find a bug... by magarity · · Score: 1

      ...as you write the spec: $1.00 ...after you ship a few: $1,000,000,000.00

      That should be:
       
      Even more testing equipment, staff, and time to check for and fix every bug every time: $5B / revision
      Testing enough so this happens only once in a while: $1B / once in a while
       
      You do the math.

    2. Re:Find a bug... by Megane · · Score: 1

      ...the benefit for your competitor: priceless

      --
      #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
    3. Re:Find a bug... by nonregistered · · Score: 1

      I trust GP: has empirical data.

  20. That's a Bug? by Greyfox · · Score: 2

    I would have thought Intel would consider that to be a feature. Certainly it seems to describe every system I've worked on for the past two decades...

    --

    I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

  21. Just my luck for switching back to Intel by Kindgott · · Score: 1

    I'd been exclusively building and using AMD systems for my past 2-3 machine builds, but I just built a new computer over the weekend. Just my luck it was a Sandy Bridge CPU with a P67 chipset motherboard. I suppose I'll go fill out my registration for the motherboard tonight and wait for Gigabyte to contact me regarding a recall.

    --
    If there's anything more important than my ego around here, I want it caught and shot immediately.
  22. Re:Sucks to be them! by Daniel+Phillips · · Score: 1

    I'm still running a dual-core athlon 64. Processor/memory upgrades became overrated a few years ago.

    I'm running a four core Phenom II black edition at home. Basically dirt cheap. I don't call that overrated at all, far from it. Oh, and 4 Gig, all of which I happily use. Actually that amount of memory will look small in the future. Say 6 months from now.

    --
    Have you got your LWN subscription yet?
  23. Will I Am by rwise2112 · · Score: 1

    Damn that Will I Am!

    --

    "For every expert, there is an equal and opposite expert"
  24. Ouch by GeekHang · · Score: 1

    $700 million? Talk about taking a hit! Someone's getting fired.

    1. Re:Ouch by blair1q · · Score: 1

      Recall that when they did the replacement program for the Pentium FDIV bug, they wrote-down $400 million in expected costs for returns and repairs.

      A few quarters later, they got to book about $360 million of that as income, as most of the requests for the fixed chips never materialized.

      I'd expect a chargeback of half a $billion or so sometime in the fall or early next year.

  25. Big losses for Intel?! by AndyAndyAndyAndy · · Score: 1

    Alright, competitors, time to shine! Let's go get... uh... guys? ... you there?

    --
    It's always confirmation bias!
    1. Re:Big losses for Intel?! by blair1q · · Score: 1

      Intel fumbles a chipset and AMD fires its CEO.

      Ironically, this makes perfect sense.

    2. Re:Big losses for Intel?! by Anonymous Coward · · Score: 1

      Intel fumbles a chipset and AMD fires its CEO.

      Ironically, this makes perfect sense.

      and AMD stock goes up as the new CEO may just end up being better than the old one.

    3. Re:Big losses for Intel?! by blair1q · · Score: 1

      AMD's stock went up because Intel raised revenue estimates.

    4. Re:Big losses for Intel?! by petermgreen · · Score: 1

      AMD is the closest thing intel has to a competitor but they are still behind nahlem let alone sandy bridge. They have tried to get arround this by throwing cores at the problem but that is of limited usefulness for desktop users. So while embarrasing and expensive for intel I don't see this really changing the balance of power in the processor market much. Especially as intel has caught this early while the ammount of sandy bridge hardware in consumers hands is still relatively small.

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
  26. AMD's take? by Anonymous Coward · · Score: 0

    I wonder how AMD will be taking Intel's total loss of a billion dollars?

  27. Chip bugs by nojayuk · · Score: 1

    VIA had a chipset bug on their old KT-series motherboard bridge chips that would lock up the machine if a certain sequence of bytes and a couple of signals on the ATA bus interface hit simultaneously. That condition was legal (if rare) as far as the ATA bus spec was concerned and shouldn't have caused the lockup, but it did. It was one of the conditions we had to insert escape code for when building optical (CD/DVD) drives otherwise people who bought our drives would bitch at us when the inevitable lockups happened. All the other manufacturers of IDE-bus devices did the same sort of workarounds and VIA did eventually fix the bug but it still left millions of motherboards out there with the problem chips on them.

    OTOH we caught the VHDL bug that occasionally switched off the DRAM refresh controller in the testing lab before the design got sent to production, a relief for everyone concerned.

    1. Re:Chip bugs by vadim_t · · Score: 1

      It didn't go unnoticed though. I avoided VIA chipsets like the plague, and so did a lot of people.

  28. Why Sandy Bridge ? by billcopc · · Score: 2

    What's the big appeal of Sandy Bridge anyway ? I still haven't figured out where it fits in the market... mind you, I type this on my dual nehalem, which is still king of the mountain after a year, so I really don't get what the fuss is about. Is Sandy Bridge significantly faster than the original i3/i5 cop-outs ? Or is this a mythical "bang for the buck" platform where everything costs twice as much as AMD ?

    I've been building a lot of systems, and Intel dominates the high end, but in my view they haven't sold a decent value processor since the E2xx0 Core 2's. In the desktop market there's really just 3 segments that matter: sub-$500, 500 to 1000, and balls-to-the-wall nutjobs like myself, and AMD has the bottom two tiers in a fierce headlock.

    --
    -Billco, Fnarg.com
    1. Re:Why Sandy Bridge ? by Anonymous Coward · · Score: 0

      Posting as a lazy AC, but it seems to be a solid step - I'm no engineer so I don't know for sure if it's this gen or the next one that's the game changer, but sandy bridge is first/almost first to eliminating a lot of bottleneck connectors that were way slower than the actual central cpu itself. It's a type of "Answer C - Other" improvement different from speed or quantity boosts.

    2. Re:Why Sandy Bridge ? by Anonymous+Showered · · Score: 3, Informative

      Sandy Bridge is the successor to Nehalem. It uses less power and is more efficient.

      The current P67 boards (LGA 1155) are for the mainstream market, e.g. Best Buy, Futureshop, Fry's, Staples, etc. They're basically "high-end' for the middle-class.

      Wait until LGA 2011 comes out (successor to 1366). You'll be thinking of switching then. :)

    3. Re:Why Sandy Bridge ? by Anonymous Coward · · Score: 0

      I don't get it either.

      I have a two year old Thinkpad T400 with a Core 2 Duo T9400, and Intel's new stuff isn't fast enough to justify an upgrade. It looks to me that Sandy Bridge is just a name, and not as impressive as people have hyped it to be.

    4. Re:Why Sandy Bridge ? by PitaBred · · Score: 4, Informative

      Yes. Sandy Bridge i7-2600K CPUs are approaching the speeds of the i7-980X, while costing 1/3rd as much. You can build an insanely fast machine for under $1000 with Sandy Bridge, including graphics card.

    5. Re:Why Sandy Bridge ? by Kjella · · Score: 1

      Nehalem to Sandy Bridge? Not necessary, particularly since you need another new mobo. Is Sandy Bridge better than Nehlem? Overall yes, but not radically so. It's more a replacement for the 1156 than the 1366 platform, that refresh is coming later this year with a different socket and different processors. You can tell by the pricing, I don't remember the 2100 but the 2500 and 2600 are at $200 and $300 respectively. The "balls to the walls" segment is coming later probably with CPU prices up to $999 for the extreme editions.

      --
      Live today, because you never know what tomorrow brings
    6. Re:Why Sandy Bridge ? by Anonymous Coward · · Score: 0

      Sandybridge is 30%+ faster clock for clock than c2duo. It uses less power, and encodes video close to 8 times faster... It has more advanced power saving features that allow for unprecedented performance in small-scale laptops...

      Sandybridge is ridiculous.

    7. Re:Why Sandy Bridge ? by DoofusOfDeath · · Score: 3, Informative

      What's the big appeal of Sandy Bridge anyway ?

      For some of us (including me), the big deal is that Sandy Bridge adds a new set of instructions called "AVX" intstructions, which let us do more floating-point operations at the same time. For some scientific apps this can nearly double the performance of the overall app.

    8. Re:Why Sandy Bridge ? by Theovon · · Score: 3, Informative

      There are a number of really good articles on the advances in Sandy Bridge. For instance:

      http://www.realworldtech.com/page.cfm?ArticleID=RWT091810191937
      http://www.anandtech.com/show/3922/intels-sandy-bridge-architecture-exposed

      To summarize some of the things I remember off the top of my head:

      The design is basically area-equivalent to the Nehalem designs, but they've made certain structures more space efficient to make room to enlarge others. For instance, they've made the branch predictor use fewer bits for the same prediction accuracy. This and other improvements have allowed them to increase critical structures that affect things like the instruction window size. The instruction window pertains to the number of decoded but not executed instructions out-standing. A larger instruction window allows you to (a) find more instruction-level parallelism because you're more likely to find independent instructions that can be executed simultaneously, and (b) absorb the effect of some high latency operations, like L2 cache misses -- you can effectively hide much of the latency by continuing to look for and perform unrelated work during the stall. In Nehalem and before, they had a structure that unified the reservation station, register file, and reorder buffer. Logically, this makes sense, but it also makes that area very power hungry, and you can never turn it off. In Sandy Bridge, they've split those structures, so they can be clock-gated separately. Also, instead of accumulating dependency results in the reservation station, they're stored in a single centralized physical register file, and pointers are held in the RS. This saves a lot of space, since now instructions traveling around the processor just need to carry the pointer. (This does add some latency and writing required to fetch those results from the RF when they're finally needed.)

      It's explicitly stated that Sandy Bridge is not a major revolution in processor design. Compared to Nehalem, you might think of it representing a large collection of efficiency improvements that work together to make a processor that is faster (clock for clock efficiency) and more power efficient.

      Many of these improvements lead to the larger instruction window. IMHO, this is a critical improvement. A Sun engineer once described modern processing as being a race between last-level cache misses. You have an L2 miss, and you quickly run out of work to do, and the processor stalls until that out-standing read arrives. Meanwhile, you've accumulated a hundred cycles or so of pending work, which gets blasted through, and execution continues perhaps a little while until you have another L2 miss. Processors like Nehalem can execute four or more instructions per cycle (peak), but the effective AVERAGE instructions per clock is less than 1. These high-latency L2 misses are primarily responsible for that. Besides adding on-die memory controllers, which reduces the latency, Sandy Bridge lengthens the instruction window so as to absorb more of that latency, so that stall time is less.

    9. Re:Why Sandy Bridge ? by Nemyst · · Score: 1

      Tom's Hardware recently made an article saying similar notebook performance can be had at half the power consumption in typical use situations.

      I'd say that's a big appeal.

    10. Re:Why Sandy Bridge ? by petermgreen · · Score: 1

      AMD has the bottom two tiers in a fierce headlock.
      Really? care to cite some sources to back up that claim? most times i've done a comparison of similarlly performing chips (note that AMD don't get as high performance per clock as intel) the intel option has been more expensive but not hugely so.

      Where intel does lose out is platform flexibility, you can put AMDs cheapest chips on a top end board or their most expensive on a low end board. With intel you have different platforms for different levels of price and performance so you can't mix and match.

      As for sandy bridge VS AMD afaict the fastest quad core AMD offers is the Phenom II X4 970 BE which costs just under $200. The i5 2500 costs just over $200 and beats it in every test anandtech include in their charts and in most of them it does so by a wide margin.
      http://www.anandtech.com/bench/Product/288?vs=186 (note the 2500K is just a 2500 with an unlocked multiplier and some esoteric features disabled). IMO for the $500-$1000 desktop segment prior to this announcement there was* little reason to choose anything other than a 2500 or 2600 (add K suffix if you plan to overclock)

      As for the AMD hex cores they may be marginally better than a 2500 in some (but not all) highly multithreaded tasks but I would definately have taken the 2500 over them for normal desktop use.

      * until todays announcement

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
    11. Re:Why Sandy Bridge ? by citizenr · · Score: 1

      What's the big appeal of Sandy Bridge anyway ?

      5GHz on Air cooling

      --
      Who logs in to gdm? Not I, said the duck.
    12. Re:Why Sandy Bridge ? by Anonymous Coward · · Score: 0

      Not anymore...

    13. Re:Why Sandy Bridge ? by Anonymous Coward · · Score: 0

      AMD has the bottom two tiers in a fierce headlock.

      Really? care to cite some sources to back up that claim? most times i've done a comparison of similarlly performing chips (note that AMD don't get as high performance per clock as intel) the intel option has been more expensive but not hugely so.

      I'd start by citing you, as you just said Intel is more expensive. "performance per clock" is irrelevant, unless .. you happen to supply your own quartz crystals? Performance per dollar is the major yardstick.

      I'd also cite your claim that AMD is more flexible, since in the end that's a synonym for better performance per dollar. If you need feature x, then flexibility lets you cheap out on parts that aren't integral for supplying feature x; or to put it another way, it lets you avoid overkill where you don't need it. Sometimes the expensive CPU+cheap mobo or cheap CPU+expensive mobo gives you what you want, so flexibility means you can buy that, where inflexibility means you'd buy the expensive CPU +expensive mobo. The expensive combo would be really nifty, not a sucky computer or anything, but you just paid for something that doesn't effect your situation's performance.

    14. Re:Why Sandy Bridge ? by MoriT · · Score: 1

      In performance-per-dollar the i7-2500 is the best thing until you get sub-$90, with the other new i7s close behind. There are some consumers who look at marginal value and not just total cost.

    15. Re:Why Sandy Bridge ? by leromarinvit · · Score: 1

      Not anymore...

      Well you can, but it won't be insanely fast for long!

      --
      Proud member of the Ferengi Socialist Party.
    16. Re:Why Sandy Bridge ? by Anonymous Coward · · Score: 0

      Well, I can't speak for anyone else, but I can speak from my own personal experience.

      Unless you plan on playing Minecraft (still have to check with Notch on this one), the graphics processing flies. The processor itself flies. The power consumption stays quite grounded. I seriously can't tell that my desktop is on from my CPU fan or case fan even with both sides of it open. My stupid 5-year-old laptop is far louder. It's probably not worth an upgrade (I think benchmarks put it around ~20% improvement across the board from last gen with a far more significant boost to AES - ~870%), especially if you have a really good processor from the past couple years. But if you're buying new, you'd be a serious fool not to purchase a Sandy Bridge.

      On the Windows Experience Index (which you may find to be a suitable or unsuitable indicator of performance), I get a graphics/gaming graphics score of 6.4 and a processor score of 7.6 with the Core i5 2500K. You may note that this scale ends at 7.9 and that the 6.4 was achieved in the complete absence of a typical graphics card. That score completely trumps the nVidia cards at least until the 9xxx series ends. Considering that the processor will not draw more than 95W under maximum load, I'd say that's a very nice performance:cost point.

      YMMV

    17. Re:Why Sandy Bridge ? by Anonymous Coward · · Score: 0

      Laptops mainly. I'm waiting for the new Macbook pro in April. ^_^

    18. Re:Why Sandy Bridge ? by petermgreen · · Score: 1

      as you just said Intel is more expensive.

      They are but not hugely so. I wouldn't call being slightly cheaper than your much larger competitor to be a "feirce headlock".

      And that only applies to the OP's "lower teir" and the lower end of his "middle tier". Move up to the $750-$1000 systems where the CPU budget is $300 or so and you can get chips that are better for desktop use than any AMD chip especially before the recall but even now afaict a 950 will smoke any quad core AMD has to offer (the AMD hex cores have their uses but I wouldn't choose one for a general purpose desktop).

      I'd also cite your claim that AMD is more flexible

      Which is somewhat useful in special cases but not too important for "ordinary" computers.

      P.S. I noted performance per clock in case the OP was comparing prices on similarlly clocked AMD and intel chips and assuming they were equivilent in performance.

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
    19. Re:Why Sandy Bridge ? by billcopc · · Score: 1

      Thanks to everyone who replied :)

      Yeah I see the appeal for laptops, lower power draw + better graphics = excellent. I guess for my desktop needs (wants), I'll be stuck waiting for enthusiast/server LGA 2011 boards. My biggest issue is that I make extensive use of virtual machines on my desktop, for client/server and cluster testing, so I need boatloads of Ram and CPU. Clearly, Sandy Bridge is not for my type of usage.

      --
      -Billco, Fnarg.com
  29. amd HAS sata 6 on all ports intel does not and now by Joe+The+Dragon · · Score: 1

    amd HAS sata 6 on all ports intel does not and now they can't even get sata right?

  30. How many? by Anonymous Coward · · Score: 0

    'relatively few' people are affected, yet it's gonna cost Intel $700 million in replacements ?

    Either these are very expensive replacements, or Intel has a different idea on how many 'few' are compared to the rest of us.

  31. At least they are prompt to deal with it by Timmy+D+Programmer · · Score: 1

    So many companies would still be denying having an issue. I say Kudos to them for owning up and acting fast. Everybody makes mistakes, but few take responsibility.

    --


    (If at first you don't succeed, do it different next time!)
  32. so the mini, macbook, mac bookpros under $1800 by Joe+The+Dragon · · Score: 1

    so the mini, macbook, mac bookpros under $1800 will be stuck on core 2 for most of 2011?

    1. Re:so the mini, macbook, mac bookpros under $1800 by hattig · · Score: 1

      It doesn't affect SATA ports 0 and 1 on the chipset (the new SATA3 ports), only the four SATA2 ports 2,3,4,5. It is likely that a laptop from Apple will only use the first two ports, so there shouldn't be an issue with new MacBooks or the Mac Mini.

  33. DRM? by ivoras · · Score: 1, Interesting

    "Slow performance degradation over time" on SATA controllers? Who wants to bet this is due to some "misapplied security" scheme such as DRM or something related to the TPM?

    --
    -- Sig down
    1. Re:DRM? by Anonymous Coward · · Score: 0

      Your tinfoil hat is lose.

    2. Re:DRM? by markass530 · · Score: 1

      I just commented on something else, so I can't use my mod points, someone else mod parent up, and slap the shit outta whoever labeled it a troll.

    3. Re:DRM? by alvinrod · · Score: 1

      Plenty of reasons to hate DRM/TPM for their own sake. No need to attribute a chipset defect to either of these without any actual proof. Seems that it only affects the 3 Gbps ports and not the 6 Gbps ports so that claim seems somewhat unlikely.

    4. Re:DRM? by Anonymous Coward · · Score: 0

      "Slow performance degradation over time" on SATA controllers? Who wants to bet this is due to some "misapplied security" scheme such as DRM or something related to the TPM?

      Not me...

    5. Re:DRM? by Anonymous Coward · · Score: 0

      It's a stupid comment because there's no evidence or other reason to suspect it. Not that it can't be true, but we have just as much reason to believe that it's caused by a Flying Spaghetti Monster problem as a DRM problem.

  34. AMD's Bulldozer getting a little bit closer :) by RalphTheWonderLlama · · Score: 1

    Any delay to Intel brings AMD's release of their new Bulldozer architecture a little bit closer to Sandy Bridge. Things will be interesting for the CPU market in 2011 to say the least.

    --
    simple, fast homepage with your links: http://www.ngumbi.com/
  35. At least they admitted it. by JustAnotherIdiot · · Score: 1

    If this was Apple, they'd just say you're using it wrong.

    --
    What do I know, I'm just an idiot, right?
    1. Re:At least they admitted it. by gstrickler · · Score: 1

      If it were used in a MacBook or Mac Mini, it probably wouldn't affect any users. To date, Apple hasn't introduced a MB/MBP/mini that uses more than 2 SATA connections. Assuming that remains true for any SB based machines, and that they would use the 2 6Gbps SATA ports, this bug would not affect those machines.

      In short, given what's currently known and likely, "you're using it wrong" would be correct. Of course, that's all speculation since Apple hasn't yet released machines using SB.

      --
      make imaginary.friends COUNT=100 VISIBLE=false
    2. Re:At least they admitted it. by RalphTheWonderLlama · · Score: 1

      Or Nvidia or Seagate :)

      --
      simple, fast homepage with your links: http://www.ngumbi.com/
    3. Re:At least they admitted it. by Anonymous Coward · · Score: 0

      Yeah my 500GB 7200.11 brick in the shed is a testament to that. And yes it broke again eventually even after I fixed it using the serial port on the back and diagnostic instructions.

    4. Re:At least they admitted it. by toddestan · · Score: 1

      I remember the issues with the IDE controllers in some of the early G3 PowerMacs, where everything would be going along just fine, then you'd suddenly get hit with data corruption. Apple's response was to more or less tell their customers to go screw themselves.

  36. Re:amd HAS sata 6 on all ports intel does not and by Ecuador · · Score: 1

    What is interesting is that those 2 SATA 6Gbps ports that the Intel boards have, are the ones unaffected by this! The problem is only with the other 4 ports. I bet they were thinking going with mostly the ol' 3Gbit ports will be safe and save some money... Woops!

    --
    Violence is the last refuge of the incompetent. Polar Scope Align for iOS
  37. MTH deja vu by Anonymous Coward · · Score: 0

    If anyone remembers the i820 MTH (DRAM to RAMBUS translator Hub) fiasco, this is it again. Basicly what happened back then is that as soon as the drivers (INF files) are installed the chipset experiences a high error rate over time. With the i820MTH "over time" was roughly one hour once affected. Initial builds of such systems seem to have no problems, it's not until the customer keeps calling in for the same problem that it's discovered to be a chipset issue.

    I'm glad that Intel caught this early, but it's very likely that affected parts from 3rd party motherboard vendors already went out, so I'd avoid buying anything with Sandy Bridge until late 2011.

  38. Quote from Intel's Director of Creative Media by A+Guy+From+Ottawa · · Score: 2

    Will.I.Am, at his first public Intel press event since being hired, was quoted saying "The problem with the Sandy Bridge Chipset seems to be the dirty bit. BZZZZT BOOM BOOM.... BZZZZT BZZZT BZZZT BOOM BOOOM...." The rest of his comments weren't heard by anyone at the event due to the sudden loud and obnoxious music blaring from all corners...

    --

    using System.Awesome;

  39. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  40. Comment removed by account_deleted · · Score: 2

    Comment removed based on user account deletion

  41. Re:Sucks to be them! by level_headed_midwest · · Score: 1

    There have been quite a few products worth spending money on in the last few years if your workload consists of applications that are highly multithreaded and/or use a large amount of memory. Servers especially have gotten phenomenally better since three years ago with Intel retiring their aged, performance-sapping FSB architecture for an IMC + point-to-point link architecture, Intel getting rid of high-latency and hot FB-DIMMs for normal DDR3, AMD releasing the MCM Opteron 6100s with 2-3 times the number of cores per socket as they had in early 2008, and the adoption of 2 Gbit DIMMs. Low-power/embedded users also are in luck as products like AMD's Fusion APUs and dual-core 1 GHz ARM Cortex A9s are much better than the early 945GC + single-core Atoms, 130 nm AMD Geodes, or ~600 MHz single-core ARM CPUs available three years ago. Also, the rise of NAND SSDs has done a lot to increase computer performance over the last three years as well. Other than that, most desktop CPUs and GPUs aren't phenomenally better or faster than those around three years ago. There has been some improvement, but nothing like what we saw during any three years of the 1990s.

    --
    Just "gittin-r-done," day after day.
  42. Re:Sucks to be them! by Belial6 · · Score: 1

    Maybe power savings and the costs associated with running high power consuming machines is not a factor for you, but processor/memory upgrades are very often not overrated. It just depends on what you call an upgrade. The last 5 machines I have bought have more than paid for themselves in power savings. They happen to be faster, but the bigget upgrade was in moving from an average power usage of ~180 watts to an average power usage of ~40 watts, as well as improving the machines ability to go into standby.

  43. Over time == statistically by Weaselmancer · · Score: 1

    By "over time" they mean that every time a set of circumstances crop up for the bug to manifest, roll some dice. Eventually you'll get snake eyes and the bug will bite you. From this article:

    "On its conference call to discuss the issue, Intel told me that it hasn’t been made aware of a single failure seen by end users. Intel expects that over 3 years of use it would see a failure rate of approximately 5 - 15% depending on usage model. Remember this problem isn’t a functional issue but rather one of those nasty statistical issues, so by nature it should take time to show up in large numbers (at the same time there should still be some very isolated incidents of failure early on)."

    So it's not like the chip is dissolving or some such.

    On the good news front, from this article:

    "If you've already built a Sandy Bridge system, fortunately, there are some obvious workarounds available. Most enthusiast-class motherboards these days ship with extra SATA ports driven by auxiliary SATA controller chips from third-party suppliers like Marvell, and those ports aren't at risk for this problem. As we've noted, the two 6Gbps SATA ports on the 6-series chipset aren't, either. For a great many users, sidestepping this problem should be as simple as moving their storage device connections to the other ports. Given the relatively strong performance that we've seen out of Intel's SATA 6Gbps controller, we'd recommending attaching any fast, primary storage devices like SSDs or 7,200-RPM drives to the 6Gbps SATA ports if possible. Other drives, like large and slow-rotating HDDs, should be fine on the third-party controllers. Just be careful to ensure that you have all the right drivers installed and the boot order in the BIOS set correctly before making the move, so you don't cause yourself the headache of an unbootable system."

    So it's not the huge deal that it seemed to be at first. Your 6Gbps ports are fine. It's your 3Gbps ports that are pooched. But if your board has a secondary controller like the Marvell controller - just move your drives to those ports (or plunk down $20 bucks and get an ePCI SATA board) and Bob's your uncle.

    That being said though - dammit. I JUST ordered one of these boards last night from Newegg. I've always been an AMD fan, but I figured just this once I'd try Intel since they've been making some really great cpus lately. Haven't upgraded in five years and BANG - this hits.

    If you'd like to make some quick cash, go to Vegas and place a few bets. Then have me root for the team you'd like to lose.

    --
    Weaselmancer
    rediculous.
    1. Re:Over time == statistically by DarthVain · · Score: 1

      Another cheap easy fix is I know they used to make RAID ATA PCI cards, which you didn't even really have to use the raid, you could just use the extra ports. I have no idea if they make them for SATA or how much they cost, but if they were anything like the ATA ones it wasn't much.

      So if you end up being one of the unlucky 5-15% AND the manufacturer refuses to replace (likely past warranty), perhaps you can go out and just buy a 50$ SATA PCI card that doesn't have that problem, and boom, problem solved. Of course you need a free PCI slot to do it, but these days so much crap is integrated in the motherboard, most people don't use all (or any) of their PCI slots anyway.

      A quick Google confirms that you can buy a 4 port SATA PCI card for 40-80$ online. Sans Problemo.

    2. Re:Over time == statistically by Weaselmancer · · Score: 1

      Yup. That'd do it.

      I just looked up the specs on the motherboard I bought last night and it appears I'm in luck. It's a Sabertooth P67.

      Intel® P67 Express Chipset 2 xSATA 6.0 Gb/s ports (brown) 4 xSATA 3Gb/s ports (black) Intel® Rapid Storage Technology Support RAID 0,1,5,10 Marvell® PCIe SATA 6Gb/s controller 2 xSATA 6Gb/s ports (gray) JMicron® JMB362 SATA controller 1 xPower eSATA 3Gb/s port (green) 1 xExternal SATA 3Gb/s port (red)

      So use the Brown ports for boot drive and CD for install, and Grey ports for data drives once the OS loads the Marvell driver, I'm thinking. Ignore the Black ports.

      No biggie.

      Oh, and in related news Newegg just pulled this motherboard from their shelves. I'm probably the last human being on planet earth to buy one.

      --
      Weaselmancer
      rediculous.
    3. Re:Over time == statistically by mr_mischief · · Score: 1

      The problem is a 133 MHz 64-bit PCI slot will max out at 8 Gb/s and you're looking at replacing the 3 Gb/s ports. You can only get two full-speed 3 Gb/s ports out of an 8 Gb/s slot.

      Your PCI on your motherboard is not 64 bit, either, unless you have a server. It's 32 bit. It's also not going to be 133 MHz, or even 66 MHz on a stock desktop motherboard. It's going to be 33 MHz.

      So you're running four 3 Gb/s ports from your 1 Gb/s slot. That is not without problems.

      You're going to need to step up to PCI Express, abbreviated PCI-E, or at least
      PCI-X or the faster wide/fast PCI found on some servers before PCI-E. Don't confuse PCI-X and PCI-E, either. They are not the same.

      PCI-E 1.0 will transfer about 2 Gb/s per lane, and 4 and 8 lane cards are available for drive controllers. PCI-E 2.0 and 2.1 will transfer 4 Gb/s per lane if both the slot and the card are 2.x compatible. PCI-E 3.0 is recently finalized and when you can get motherboards and add-in cards its 60% higher transfers per second and 90% or so lower overheard per transfer will enable about twice the bandwidth again. So a one-lane PCI-E 3.0 card in the proper slot will transfer about 8 Gb/s per lane.

      Motherboards for desktop PCs often offer PCI Express slots of 16 lanes for a video card (and sometimes two, three, or even four of these) with a few one-lane slots. Sometimes a four-lane or eight-lane slot will be featured on the motherboard as well. I've even seen some boards with two x16, two x4, and two x1 slots or some similar configuration.

      It can take a little care to get a PCI Express card and motherboard combination that properly handles full SATA speeds, and it'd take a little luck if you didn't spec such out beforehand. It'd take a lot more than that to find a PCI combination to do it.

    4. Re:Over time == statistically by Weaselmancer · · Score: 1

      Just wanted to say thank you for the PCI-E schooling. I didn't know very much about it until you posted.

      --
      Weaselmancer
      rediculous.
  44. The most reasonable explaination by slew · · Score: 4, Insightful

    My educated guess is that the SATA Input/Output Pads have a digital timing compensation circuit that tries to center the data sampling window (e.g., the clock edge where data is sampled). Since the appropriate data sampling window that won't cause a setup/hold violation changes with process variation and temperature it needs to have lots of potential settings in a large window and may need automatic tracking.

    Probably someone didn't design that window large enough to center the data sampling timing offset (or the step size isn't small enough or the auto adjustment circuit that tracks temperature and adjusts the window appropriatly has an algorithmic flaw in some cases, etc). It might be okay now (in early production tests), but as the part ages, the required data sampling window can shift significantly, and if the chip can't adjust the data sampling window appropriatly, then data errors are inevitable.

    As a silly example, let's say a hw engineer put in a clock trim circuit that could adjust +-100ps in steps of 10ps. No driver update can make that adjustment -110ps.

    Conversely, if the hw control algorithm that tracks temperature and adjusts the window has a postive temperature coefficient over time (say gets slower), but the actual I/O circuit has a negative coefficient over time (say gets faster), after a while, that feedback algorithm may become unstable, that might not be fixable with a driver update either (if the control algorithm is in hw).

    Of course, I have no real infomation, but it's my guess having designed high speed I/Os in the past...

    1. Re:The most reasonable explaination by Mr+Z · · Score: 1

      Mod parent up! Also, rather appropriate username for the topic.

  45. I'll give an example by Anonymous Coward · · Score: 0

    Yes, I am an ASIC designer. Transistor design varies wildly based on the process type but below is what can happen for a low-power CMOS process. Also SATA uses clock recovery and there are many things that can go wrong there.

    One case that can cause reliability issues is metastability. If you're not familiar, metastability is what happens when more than one input of a register changes within a certain time interval. The usual case is the clock input arrives too close to a change on the data input(called a setup violation), but it can happen with reset pins as well. The internal circuit of the flop isn't designed to handle these cases so the output voltage is not forced to either low or high, but hovers somewhere in between for a time before ultimately drifting one way or the other. Now if your clock and data are such that this case happens every clock, the output may in fact never settle and stay in an undetermined state.

    Besides the logical effect, there is a reliability effect as well. When the flop output is between high and low, the feedback circuit inside causes a direct short to ground that lasts as long as the output is unresolved. This can cause hot-electron effects and electro-migration, although local melting may be possible but that typically causes a hard failure. Both of these cause electrical property changes which can be read about here.

    I should point out that some designs have special registers that can handle metastability. They do this by increasing the register size about 10 times so that the transient current during metastability can be handled reliably.

  46. Re:Sucks to be them! by PNutts · · Score: 1

    Not that it matters to myself. I'm still running a dual-core athlon 64. Processor/memory upgrades became overrated a few years ago.

    There has been nothing new worth spending money on, hardware wise, for the last 3 years at least.

    Agreed, but 2011 changes that for some folks. I waited for Sandy Bridge instead of moving to AMD's six cores (or waiting for Fusion). The i5's Quick Sync decreased my video encode time for a feature length film down to just under 8 minutes at stock speeds.

  47. Re:Sucks to be them! by hairyfeet · · Score: 1

    Actually I'd be surprised if it didn't cost them every penny plus of that money. After all there is a pretty big difference between a certain math type being screwed up in edge corner cases and having to reboot daily like it is Win98 all over again just to keep your nice new SATA from running like a 66MHz IDE.

    So if I was Intel I'd be figuring in on a one to one replacement of EVERY SB board out there, plus disposal costs.

    --
    ACs don't waste your time replying, your posts are never seen by me.
  48. Not a big deal for most... by llZENll · · Score: 1

    First of all, this should be in the summary, the first two 6g controllers on the board are unaffected, its only the 3g controllers that are affected. Most users aren't even going to be using more than 2 ports. Secondly, most motherboards use third party controllers for additional ports (mine use 2 other controllers and 6 non-Intel ports), if you have any of these then unless you are using 10 drives in your computer it isn't going to be a problem and a simple cable switch to other ports and you are done.

    Lastly, it is probably going to be much easier, cheaper, and faster to simply pop in a SATA controller card than getting an RMA.

    1. Re:Not a big deal for most... by MtViewGuy · · Score: 1

      Also, because Intel caught this bug relatively early, it may fortunately NOT affect the rollout of the next-generation Apple iMac, MacBook and MacBook Pro computers that will likely use this chipset--I've heard Apple wouldn't roll them out until April 2011 at earliest anyway.

  49. THERE'S SPAGHETTI ON THE SOUTHBRIDGE?!?! by Anonymous Coward · · Score: 0

    I blame spaghetti.

    http://images.encyclopediadramatica.com/images/0/00/Spagsouth01.jpg

  50. Re:Sucks to be them! by mr_mischief · · Score: 1

    Unfortunately it's not over time during a session. This problem is degradation over the life of the machine, with as high as 15% complete failure of the SATA 3Gb/s ports after three years.

  51. Electromigration by Anonymous Coward · · Score: 0

    http://en.wikipedia.org/wiki/Electromigration

    It's a problem that becomes more and more of an issue as the feature size (nanometer figure) on chips gets smaller and smaller.

    It is as far as I know the only way in which silicon chips degrade over time. It cannot be reversed in software.

    Intel had this problem with their early Core 2 but I don't know that it ever became fully public like this. Intel released a patch to be placed in BIOS which changed a few settings in the chip to prevent the problem. This couldn't reverse it after it failed, but it could prevent or at least make it less likely if applied before the chip failed.

    It is a design and/or chip process (production) problem and cannot be fixed in software after it happens.

  52. cheap refurb boards? by Brit_in_the_USA · · Score: 1

    I hope some of these recalled MB's become available at bargain basement prices. I'd happily buy one and use a PCIe SATA card if it could save me ~75% compared to the present MSRP.

  53. Re:Sucks to be them! by m50d · · Score: 1

    The only thing my P4 doesn't do is HD video, but I'm okay with that since my 700k connection doesn't do HD either.

    You can download videos overnight to watch in the morning, or buy them on bluray. I'm guessing you're not playing recent games either. Nothing wrong with either position of course, but I think you'll find it's not the industry that's changed to make it not worth upgrading, it's you getting older.

    Now my Pentium 3(?) 700 MHz laptop is long in the tooth, and often runs too slow for my taste, but it is just a laptop. I don't use it much except for travel.

    And you don't like to watch video or play games while travelling?

    --
    I am trolling
  54. Re:amd HAS sata 6 on all ports intel does not and by coxymla · · Score: 1

    FWIW Intel's 2x SATA III (6 Gbps) ports are a lot faster than AMD's.

  55. Re:Sucks to be them! by commodore6502 · · Score: 1

    >>>older

    Cheaper. I try to avoid spending money on upgrades when the computer (or car or TV) is still working. And yes I play videogames but usually on my Gamecube or PS2, rather than my laptop.

    As for videos, youtube and hulu don't demand much processing power. The 700 MHz P3 can handle it just fine.

    --
    Information wants to be expensive AND wants to be free. So you have Value vs. Cheap distribution fighting each other.
  56. to bad intels chips have limited pci-e lanes by Joe+The+Dragon · · Score: 1

    to bad intels chips have limited pci-e lanes and you need to go to the high end i7 cpus just to get more then 20+DMI bus speed of 4 pcie lanes. For Sandy Bridge that may be a $400 cpu!

    AMD lets you USE ANY CPU in a AM3 board with chip set choice with better pci-e lane setups. 890FX has 38 + 4 SB link. 890GX and other 800's 22 + 4 SB link.

    790fx has 38 + 4 SB link lanes. 790X and 790GX has 22 + 4 SB link. 785G 20 + 4 SB link. 785E 22 + 4 SB link and most of the other 700 ones have the same.

  57. Re:amd HAS sata 6 on all ports intel does not and by Anonymous Coward · · Score: 0

    I think the AMD systems use a NEC USB 3 controller.. At least my Gigabyte 890FX does.

  58. At least there's an alternative, right? by Anonymous Coward · · Score: 0

    Gee, good thing Intel ran NVIDIA out of the chipset business. And that the feds didn't come down on Intel like a ton of bricks.

  59. Crap! by Anonymous Coward · · Score: 0

    My old PC died yesterday night. Today I did all the research and bought a sandy bridge PC while I was at work, instead of slacking and reading slashdot- so I missed this. Damn it! I hope cyberpowerpc lets you cancel orders . . .

  60. They also learned the hard way last time by Sycraft-fu · · Score: 1

    Remember the FDIV bug? For certain floating point divisions, the original Pentium gave results that were slightly incorrect. Now this really didn't matter to most people. This was back in the day when floating point was not used a lot (many 486s had no FPU at all), the error only happened on certain calculations, and it wasn't a large error. So realistically, most people would have been fine with a defective chip, never would have caused a problem.

    Intel reasoned along those lines and thus decided that to get a replacement chip, you'd need to demonstrate some kind of need for it. Show them you were doing something where it'd actually cause problems, not just that you wanted a new one...

    BIG mistake. People were extremely pissed off at that. It was a major PR disaster for Intel and in the end they offered replacements to all chips, so they had to spend the money anyhow, AND they'd already gotten egg on their face.

    This time they are making sure it isn't a problem. They are replacing things, even if it wouldn't really matter, even if a SATA card would bypass the problem. They are keeping people happy and avoiding bad PR/lawsuits.

  61. Apparently, not the problem by slew · · Score: 2

    Sounds like intel fessed up yesterday and stated it was a problem with a bias circuit in the PLL clocking tree. A bias circuit apparently caused a transistor to remain in a high leakage state (which over time will induce a failure mode). What makes it silly is apparently intel is saying this circuit wasn't in initial designs, added, but not needed in the design and will be disabled in the future... Back to the future!

  62. It's not a bug, it's an oxide layer breaking down. by Anonymous Coward · · Score: 0

    It's not a bug at all. It's a design fault - but a purely hardware one at that. (It's *not* a bug in software and it's not a bug in firmware). Also, performance does not degrade over time either - the ports themselves degrade, but not performance. What it is is one particular transistor in the 3Gbps clocking circuts having very thin gate oxide and being accidentally driven at an higher bias voltage (due to a fault). This gate oxide breaks down over time; leaking more and more current.

    Thus, performance does not degrade (RTFA - it doesn't talk about performance degredation, only port degredation), but apparently error rates do increase over time. Eventually the port itself becomes non-functional.

    .

  63. Someone needs to make a monitoring service by Anonymous Coward · · Score: 0

    Some people are going to be stuck with these boards with no way of knowing when they will die. But the problem is progressive and in theory detectable. So a service that periodicly checks the error rate of the affected sata ports and plots thier degradation would be very helpful. You may have to do a bandwidth test to find this if there is no low level way and there is a problem that testing it helps make it worse so you would have to keep the test short. Would be useful also to confirm if your board has this issue or some other unrelated sata issue.

    Another option to help is to write a modified sata driver that you can set a performance to relibility slide bar for. in performance mode it runs like normal and as you slide it to reliablility it adds delays in accessing the sata devices to reduce any sudden high transfer rate requests from damaging the transistor.