Slashdot Mirror


AMD's Piledriver To Hit 4GHz+ With Resonant Clock Mesh

MojoKid writes about some interesting news from AMD. From the article: "Advanced Micro Devices plans to use resonant clock mesh (PDF) technology developed by Cyclos Semiconductor to push its Piledriver processor architecture to 4GHz and beyond, the company announced at the International Solid State Circuits Conferences (ISSCC) in San Francisco. Cyclos is the only supplier of resonant clock mesh IP, which AMD has licensed and implemented into its x86 Piledriver core for Opteron server processors and Accelerated Processing Units. Resonant clock mesh technology will not only lead to higher clocked processors, but also significant power savings. According to Cyclos, the new technology is capable of reducing power consumption by 10 percent or bumping up clockspeeds by 10 percent without altering the TDP." Unfortunately, aside from a fuzzy whitepaper, actual technical details are all behind IEEE and other paywalls with useless abstracts.

286 comments

  1. vaporware by networkBoy · · Score: 5, Insightful

    it's all vaporware till they ship, and it works.
    if they pull it off though, might give Intel a run for their money again, it's about time!

    --
    whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
    1. Re:vaporware by Anonymous Coward · · Score: 5, Informative

      This is an ad. What is a "resonant clock mesh"? That's sounds really cool. So I started RTFA (I know, sorry). You don't have to chastise me that much, because I stopped reading soon. Right after

      An average Google search is reported to
      require ~ 0.3 watts, about the same amount of power that it takes for a 100 watt light
      bulb to be lit for 10 seconds.

      Which was obviously not written by anybody who has any clue what they are talking about.

    2. Re:vaporware by TheGoodNamesWereGone · · Score: 2

      Agreed. It's a breathlessly ebullient press release sales pitch. That said, I hope AMD is able to get back into the game to keep Intel honest, and I own an Intel processor (the last four or five machines I built before it were AMD-based).

    3. Re:vaporware by ackthpt · · Score: 1

      it's all vaporware till they ship, and it works.
      if they pull it off though, might give Intel a run for their money again, it's about time!

      Intel is pretty good at catching up, even after Intel said nobody needed 64 bit processors and nobody needed multi core processors, they're right there on top.

      --

      A feeling of having made the same mistake before: Deja Foobar
    4. Re:vaporware by AbRASiON · · Score: 1

      "might give Intel a run for their money"
      I'm sorry to inform you but you're a little (lot) out of the loop on the current state of Intel and AMD processors available. I'm sure most people here don't want to hear this but the little guy is well and truly down on the ground being kicked in the stomach.

      I wouldn't be surprised if one of these CPU's at 5ghz would barely compete with Intels current top shelf items, let alone 4ghz.

    5. Re:vaporware by Daniel+Phillips · · Score: 2, Insightful

      The only workstation class machine with which I have been completely happy is powered by an AMD 4 way Phenom II. Quiet, powerful, cheap, pick all three. And looking around, I would say that its successor is highly likely to be an AMD 6 way, 45 nm process chip. Best value by far for my money.

      Today I can choose slightly less latency with Intel or significantly more value with AMD. Call me cheap, but I will take the value, thank you.

      --
      Have you got your LWN subscription yet?
    6. Re:vaporware by networkBoy · · Score: 1, Interesting

      Is AMD really doing that badly?
      Seriously I am out of the loop from an AMD perspective*, but I assumed they were still rocking the cost/performance on the low end of the CPU ranges, and was hoping this would allow them to push into the mid-range i5 territory.
      -nB

      *all I work on at work & at home is Intel stuff, so I don't have any relevant AMD info.

      --
      whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
    7. Re:vaporware by the+linux+geek · · Score: 1, Informative

      Bulldozer - their current architecture - was really bad. Slow, mediocre price/performance ratio, and power-hungry. It remains to be seen if Piledriver can make it all better.

    8. Re:vaporware by ifiwereasculptor · · Score: 4, Informative

      Well, here's AMD on a nutshell:

      Brazos, the ultra low power processor, is a success.

      Llano, the A series, is actually a very solid product. For the cost of an i3, you get a quad core that is about 1/4 slower overall, but whose integrated graphics about 3 times faster. Actually selling very well.

      Bulldozer is a disaster unless all you do is video encoding.

      Now, here's the puzzling part: they want to use bulldozer, the failure, as the new core for the A series, the success. I hope they find a way to fix it, otherwise my next rig will have an Intel for the first time in ten years.

    9. Re:vaporware by AbRASiON · · Score: 1

      Yes, they are doing that badly. The bulldozer was a giant dissapointment. They have nothing on the table for the desktop crowd. At almost all price points it's silly to buy AMD at this time unfortunately. Especially for heat / power usage etc.

    10. Re:vaporware by moderatorrater · · Score: 1

      Yes, because everyone knows that having more cycles is the way to win the processor war. That's why the pentium 4 was so dominant.

    11. Re:vaporware by hawguy · · Score: 3, Informative

      This is an ad. What is a "resonant clock mesh"? That's sounds really cool. So I started RTFA (I know, sorry). You don't have to chastise me that much, because I stopped reading soon. Right after

      An average Google search is reported to
      require ~ 0.3 watts, about the same amount of power that it takes for a 100 watt light
      bulb to be lit for 10 seconds.

      Which was obviously not written by anybody who has any clue what they are talking about.

      I think it was a typo (or edit by someone who doesn't know what they are talking about). They should have said 0.3 watt-hours (and should have said "energy" instead of "power")

      Google says they use 0.0003 kWh of energy per search.

      A 100W bulb uses .1 kWh in an hour, or .0000278 kWh in a second, or .000278 kWh in 10 seconds. (or .278 Wh)

      Therefore, a 100W bulb running for 10 seconds uses about the same amount as energy as an average Google search. Which is a lot higher than I thought it would be - since I use 20W CFL's, each time I do a google search, that's the equivalent of 50 seconds of light per Google search. Just while typing this reply, I did enough Google searches to light up my room for about 15 minutes.

    12. Re:vaporware by viperidaenz · · Score: 2

      You first say you picked all three "quiet, "powerful", "cheap". Then you say you dropped the powerful to get the cheap. I'm confused.

    13. Re:vaporware by PitaBred · · Score: 2

      I think they take all the routers, networking, cooling, etc. into account as well. Not just the CPU power.

    14. Re:vaporware by Anonymous Coward · · Score: 0

      even after Intel said nobody needed 64 bit processors and nobody needed multi core processors, they're right there on top

      Ever heard of disinformation?

    15. Re:vaporware by Anonymous Coward · · Score: 2, Interesting

      80% of Intel performance at 12% of the cost.

    16. Re:vaporware by Anonymous Coward · · Score: 0

      The P4 would have done a lot better of it didn't spend so many of its superior cycle count stalled waiting for RAMBUS to fetch data from memory and flushing the pipeline due to incorrectly predicted branches. I still prefer AMD but their current products don't work well for me.

    17. Re:vaporware by lightknight · · Score: 1

      *shrugs*

      AMD's strategy was to switch to milling out 2 cores or so per unit, aka the Bulldozer architecture, and then stitching them together into a processor. I guess it makes the design more compact / easier to fab.

      --
      I am John Hurt.
    18. Re:vaporware by billcopc · · Score: 1

      Yep. This is where AMD lives and dies: the budget segment. That's where they stomp Intel, which prefers to keep its high margins and the mindshare that comes along with having the fastest chip of them all.

      For myself, AMD would have to push out very affordable 4-socket and 8-socket Opteron solutions, like they did in the K8 days. These days, it's a better value for me to spend the big bucks on Intel workstations and ride them out for an extra year.

      And when i say workstation, I'm thinking "server board with GPUs", whereas you seem to be thinking in terms of a standard desktop PC with high quality components. I still think you'd be better served by an Intel i7-2500k, which is also quiet, cheap and powerful (for a single-CPU system).

      --
      -Billco, Fnarg.com
    19. Re:vaporware by Anonymous Coward · · Score: 0

      "Powerful" doesn't necessarily have to mean "absolutely the most powerful you can get anywhere".

    20. Re:vaporware by Daniel+Phillips · · Score: 0, Flamebait

      You first say you picked all three "quiet, "powerful", "cheap". Then you say you dropped the powerful to get the cheap. I'm confused.

      Do not be confused, gentle reader. You may understand that as "powerful enough; very powerful indeed". And please do not put words in my mouth.

      --
      Have you got your LWN subscription yet?
    21. Re:vaporware by tyrione · · Score: 4, Interesting

      You must not work in Parallel Programming, doing any heavy engineering analysis/modeling. Taking advantage of all those threads and cores within Bulldozer and utilizing it with OpenCL along with the GPGPUs is a dream come true. More and more modeling environments are leveraging all that this architecture offers, but to you if your game doesn't presently use it it's worthless. To each their own.

    22. Re:vaporware by Daniel+Phillips · · Score: 1, Insightful

      If you add up the user-affecting latency over the lifetime of the processor, and multiply by your hourly rate, you would have to be a fairly lowly serf to not pay for it several times over.

      Oh indeed, they pay me peanuts. I should send somebody an email.

      Or maybe I should just be thankful that today I have a supercomputer[1] class computer under my desk for which I paid peanuts.

      [1] As of not very long ago at all.

      --
      Have you got your LWN subscription yet?
    23. Re:vaporware by elfprince13 · · Score: 1

      This article has an informative diagram.

    24. Re:vaporware by Anonymous Coward · · Score: 0

      I think he means. .3 watthours which is accurate for the lightbulb part. That much energy in a fraction of a second probably isn't safe in a computer.

    25. Re:vaporware by Johann+Lau · · Score: 2, Insightful

      Nah, it just makes you a tool. You're too stupid to be considered a villain, sorry.

    26. Re:vaporware by Daniel+Phillips · · Score: 1

      These days, it's a better value for me to spend the big bucks on Intel workstations and ride them out for an extra year.

      Your strategy confuses me. In the "extra" year you will lose big.

      --
      Have you got your LWN subscription yet?
    27. Re:vaporware by haruchai · · Score: 2

      There was a time 8-12 years ago where it looked like AMD could have snatched the performance crown. But, without the Fab expertise to match Chipzilla, it just never happened and nothing short of a fantastic screwup by Intel or an astonishing breakthrough by AMD will close the gap. But, AMD has been rock-solid for my personal needs and make it so easy to keep migrating to newer CPUs / Mainboards that I haven't run an Intel desktop, at home, in 10 years.

      --
      Pain is merely failure leaving the body
    28. Re:vaporware by Anthony+Mouse · · Score: 5, Interesting

      Now, here's the puzzling part: they want to use bulldozer, the failure, as the new core for the A series, the success. I hope they find a way to fix it, otherwise my next rig will have an Intel for the first time in ten years.

      I think the people calling bulldozer a failure have the wrong expectations. The core used in the existing A series is a direct descendant of the original Athlon from 1999, which itself was very similar to (and designed by the same people as) the DEC Alpha introduced in 1992, predating even the Pentium Pro. Suffice it to say that there isn't a lot of optimizing left to be done on the design.

      Bulldozer is a clean slate. The current implementation has some obvious shortcomings, not least of which that the cache architecture is lame. (The L1 is too small and the L2 latency is too high. They might actually do pretty well to make a smaller, lower latency, non-exclusive L2 and use the extra transistors for a bigger L3 or even an L4.) But that's not a bad thing. It's something they can fix and make future generations faster than the current generation. Which is the problem with the old K10 -- there are no easy little changes left to be made to make it substantially faster than it is now.

      The other part of the problem is that people want Bulldozer to be something it's not. It isn't designed for first in class single thread performance. It's designed to have adequate single thread performance while reducing the number of transistors per core so that you can have a lot of cores. It's designed for the server market, in other words. And to a lesser extent the workstation market. They designed something that would let them compete in the space that has the highest margins. So now all the high-end gamers who only care about single thread performance are howling at the moon because AMD concluded it couldn't compete with Intel in that sector and stopped trying.

      What you have to realize is that it isn't that the design is flawed. It's that you aren't the target market. They could have built something that achieved 90-100% of Intel's best on single threads instead of 60-80% by doubling the number of transistors per thread and halving the number of threads and cores, but think about who would buy that. PC enthusiasts who comprise about 0% of the market. It wouldn't sell in the server market because the performance per core * number of cores would be lower. It wouldn't sell in the budget market because it would require too many transistors per thread and therefore cost too much to manufacture.

      Instead, with Bulldozer they can use more modules and sell to the server market or anyone else with threaded software and then and use fewer modules in combination with a GPU and sell to the budget market and the midrange gaming market, and leave the six dozen howling high-end PC gamers to Intel.

    29. Re:vaporware by Nursie · · Score: 4, Informative

      Err, there was a time 8-12 years ago when AMD *did* snatch the performance crown.

      Around about the time of the Athlon 64's appearance, when Socket 939 came along, they were actually both faster and cheaper than Intel. Nothing intel had could match the FX range on the desktop, and nothing intel were doing in the server room could match Opteron at the time. Intel was struggling with its netburst architecture (IIRC) which had high clock speeds and performed slightly better under some loads (video encoding IIRC) but markedly worse for pretty much everything else.

      It didn't last long, Intel took back the performance crown, and after a few years made serious inroads into the budget sector as well. But for a brief, shining moment (around the time the FX-55 and 57 were released) AMD held the crown.

    30. Re:vaporware by plonk420 · · Score: 1

      i run AMD on my desktop (Phenom II X3 705e (65 watt)) and i'm quite sure any bottlenecks i encounter are HDD-related. even my ghetto LVM raid server is AMD.

      gaming, however, i'm quite taken with Intel. Bulldozer isn't too bad, tho, depending on the game. it "keeps up with" Intel @ 1080p and beyond on most games ...except--amusingly--some huge AAA titles like Civ 5, Starcraft II, and Skyrim. (tho it's 99.9% identical in performance to Intel on the utterly gorgeous Battlefield 3, and very very close in WoW and Crysis 2)

    31. Re:vaporware by Anonymous Coward · · Score: 1, Insightful

      99.99% of users don't work in parallel programming.

    32. Re:vaporware by plonk420 · · Score: 1

      oh, and it beats the i7-2600 with x264 in the second pass (the encoding pass) ... and presumably if you use --slowfirstpass (i'm a video encoding nerrrrd)

    33. Re:vaporware by JonySuede · · Score: 1

      I got a bulldozer 8250 for 179$ and an motherboard AM3+ for 139$.
      It run everything I want well and I never have to kill anything before starting a demanding game.
      It compiles speedily enough that my vertex 2 is now the bottleneck when I run maven.

      So please, tell us what Intel could have offered me in term of performance with a set price of 318$ ?

      --
      Jehovah be praised, Oracle was not selected
    34. Re:vaporware by nadaou · · Score: 1

      hear, hear

      --
      ~.~
      I'm a peripheral visionary.
    35. Re:vaporware by JonySuede · · Score: 0

      I did not click on the parent button yet, so I do not know what triggered your response. But that being said, I must admit that it was one of the funniest ad hominem witticism I ever saw !

      --
      Jehovah be praised, Oracle was not selected
    36. Re:vaporware by Hal_Porter · · Score: 1

      You mean the one from page 7 of TFA?

      --
      echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    37. Re:vaporware by haruchai · · Score: 1

      That's what I was referring to. Apart from solving the Fab issues, I don't know what else AMD could have done. They had ongoing problems with yields and there were initial problems with power consumption. I vaguely recall that you had to use either ( or both ) certified coolers or power supplies or the warranty was void. Intel took FOREVER to get their version of Hypertransport and the Alpha-derived designs to market but once they did, with Nehalem, they've not looked back. If they ever revive and perfect Larrabee or absorb Nvidia, AMD might be forever consigned to the very bottom of the budget bin. A few months ago, my team at work was looking for a decent dev box as we were tired of shoehorning Win2008 R2 onto older desktop PCs and one of the developers suggested the HP z210 workstation - for $2,000, we got a 3 X 1TB, 16GB, Core i7-2600k. Damn, that box is FAST and QUIET. It sat running, case open, on my desk to a full workday and I couldn't hear the CPU fan. I could have saved quite a bit of cash if I were allowed to buy OEM parts but the PHBs get all nervous if it's not all "HP-approved".

      --
      Pain is merely failure leaving the body
    38. Re:vaporware by TheGoodNamesWereGone · · Score: 2

      The only workstation class machine with which I have been completely happy is powered by an AMD 4 way Phenom II.

      My last box was a quad-core Phenom II. It served me well. There's no denying though, that Intel's current i7s (I have a 2600K) blow everything else out of the water. I fervently hope AMD will come up with something to challenge it. Competition is good.

    39. Re:vaporware by level_headed_midwest · · Score: 3, Informative

      AMD *does* push out affordable 4-socket Opteron setups- the Opteron 6000 series CPUs. They are selling those a whole ton less expensive now than they did in the K8 days. The least-expensive Opteron 6000s sell for $266 each and the most-expensive ones are around $1200-1500, compared to starting around $800 each and going on up to close to $3000 for the K8-era 4-way-capable Opterons. Considering a 4-way-capable Intel Xeon still costs close to $2000 and goes on up to near $5000- and is based on two-year-old technology- the Opterons are that great deal you were wishing for.

      However on the desktop, Intel has gotten much better in their pricing (i.e. they don't cripple lower-end chips as severely as they used to) and is giving AMD a real run for their money.

      --
      Just "gittin-r-done," day after day.
    40. Re:vaporware by Daniel+Phillips · · Score: 0

      You first say you picked all three "quiet, "powerful", "cheap". Then you say you dropped the powerful to get the cheap. I'm confused.

      Do not be confused, gentle reader. You may understand that as "powerful enough; very powerful indeed". And please do not put words in my mouth.

      What was not accurate about that?

      --
      Have you got your LWN subscription yet?
    41. Re:vaporware by Anonymous Coward · · Score: 0

      Who elevated him from failed buffoon to tool?

    42. Re:vaporware by Nursie · · Score: 1

      "They had ongoing problems with yields and there were initial problems with power consumption. I vaguely recall that you had to use either ( or both ) certified coolers or power supplies or the warranty was void."

      I have no idea about Opterons or the server room, but in the land of the desktop that wasn't so. The FX chips may well have needed a good power supply and decent cooling (especially if you were going to take advantage of their clock-unlocked features), but in general the high-end gamer PC world was as free-for-all as ever.

      But you're right, yes, Intel got their act together and took the world back. Core and Core 2 started it, then Nehalem and successors just sealed the deal.

    43. Re:vaporware by Anonymous Coward · · Score: 0

      That's about as sensible as saying 99.99% of users don't work in programming to imply that code execution speed is irrelevant.

    44. Re:vaporware by wanzeo · · Score: 1

      Fantastic explanation.

    45. Re:vaporware by TheGoodNamesWereGone · · Score: 1

      Civ V was slow as cold molasses on my Phenom II 810 (quad core, 2.6GHz) during the late game on large maps. The performance jump in it when I got this i7-2600K (quad core with HT=8, 3.4GHz) I'm using now surprised me. Like you say, other apps run at more similar speeds.

    46. Re:vaporware by Anonymous Coward · · Score: 0

      I sure hope you, others like you and the hopefully still numerous AMD fanboys buy millions of AMD CPUs. Someone has to keep AMD alive while the rest of us buy Intel.

    47. Re:vaporware by Anonymous Coward · · Score: 0

      99.99% of users play video games. But you are the 1% that thinks heavy graphics games are single threaded.

    48. Re:vaporware by Johann+Lau · · Score: 1

      why thank you ^^

    49. Re:vaporware by rrohbeck · · Score: 1

      I'll bet over 50% of programmers run parallel makes.

    50. Re:vaporware by TheLink · · Score: 3, Interesting

      This might be enlightening: http://hardforum.com/showpost.php?p=1037482638&postcount=88

      What did happen is that management decided there SHOULD BE such cross-engineering ,which meant we had to stop hand-crafting our CPU designs and switch to an SoC design style. This results in giving up a lot of performance, chip area, and efficiency. The reason DEC Alphas were always much faster than anything else is they designed each transistor by hand. Intel and AMD had always done so at least for the critical parts of the chip. That changed before I left - they started to rely on synthesis tools, automatic place and route tools, etc. I had been in charge of our design flow in the years before I left, and I had tested these tools by asking the companies who sold them to design blocks (adders, multipliers, etc.) using their tools. I let them take as long as they wanted. They always came back to me with designs that were 20% bigger, and 20% slower than our hand-crafted designs, and which suffered from electromigration and other problems.

      That is now how AMD designs chips. I'm sure it will turn out well for them [/sarcasm]

      And that comment was back in 2010. No surprise now Bulldozer is slower and uses more power, and the only advantage is it has more cores (meh, any idiot can add more cores, at worst case you just add another computer[1]).

      [1] The same embarrassingly parallel tasks that do well on multiple cores will do well on multiple computers.

      --
    51. Re:vaporware by slydder · · Score: 2

      They could have left ATI the fuck alone and concentrated on doing that which they were really good at. Chip design.

      Once they started messing with ATI and GPU's and automated chip design it all started to go downhill.

      Here lately they are moving back to manual chip design which is why it's taking a bit to get back into gear. I only hope they can get it working before it's too late.

    52. Re:vaporware by kyrio · · Score: 2

      You have to ignore the people who go on about AMD not being worth the money (though I have to admit that Bulldozer was a huge flop). Last year I got my 955BE and motherboard for $200 total. Nothing Intel offers can come close to that for a CPU and Mobo. The CPU alone would be at least $150, to match the Phenom II X4 955BE. I got a high quality motherboard and high quality CPU for about the cost of Intel's lower end CPUs.

    53. Re:vaporware by Chrisq · · Score: 3, Funny

      99.99% of users don't work in parallel programming.

      But the 0.01% who do, each write 10,000 programs at a time

    54. Re:vaporware by kyrio · · Score: 1

      Most games are only now adding support for two cores. The gaming world is a joke, even more so than the application development world.

    55. Re:vaporware by Anonymous Coward · · Score: 0

      Except that that article is by someone who is biased, and possibly blindly stuck in their old ways.
      Yes, some things won't get optimal results using tools.
      However AMD also had significant parts of the design that got a lot faster and smaller by not hand-crafting it.
      The argument is very much the same as the one for or against writing a program in assembler.
      You will have exactly the same thing: If you have a carefully hand-crafted assembler routine it's almost certain that any compiler will only produce something at least 20% slower.
      That doesn't mean that writing a whole 10 million lines (if written in C) project in assembler is sensible.
      Btw. ATI has always used those tools, and nobody seems to claim they will never be able to design a fast GPU that way.

    56. Re:vaporware by Anonymous Coward · · Score: 0

      Citation needed.

      I can't remember the last game I played which didn't make use of multiple cores. Of course, I'm disregarding any games which have no need to make use of multiple cores due to low processing requirements. I'm genuinely interested to hear what the most recent game lacking multicore support was.

    57. Re:vaporware by Kjella · · Score: 1

      What you have to realize is that it isn't that the design is flawed. It's that you aren't the target market. They could have built something that achieved 90-100% of Intel's best on single threads instead of 60-80% by doubling the number of transistors per thread and halving the number of threads and cores, but think about who would buy that.

      Do the math here, if Bulldozer's cores were 60-80% of Intel's then their 8 core chip should perform 120-160% of Intel's quad core chips in multithreaded performance. Instead it's stuck somewhere between the 2500K and 2600K and the 2500K doesn't have hyperthreading, which is like driving with the hand brake on in these tests. They reach parity only if you have perfect threading but if you have uneven loads like a server with different VMs they don't look so good, not just for games.

      Look at the pipeline now, they're launching the FX-6200 and FX-4170 which are both 125W TDP parts and will probably be competing with Intel's 65W processors, 95W at best. And soon Ivy Bridge will bring that down even further. Does this remind you of another company that couldn't get their chips performing so they had to turn up the GHz and the wattage, making them run very hot and loud? Bulldozer is AMD's Pentium IV, hopefully it's not quite as terrible as Intel's where they had to scrap it and start over from P3 designs but right now it looks awfully similar.

      --
      Live today, because you never know what tomorrow brings
    58. Re:vaporware by RCL · · Score: 1

      You are implying that only PC enthusiasts are interested in single-thread performance. How come that Intel invests so much in improving it, then, and does not lose its market share to AMD? This is also at odds with AMD's improvements in Turbo Core in Bulldozer.

      I am afraid that either most of desktop workloads remain essentially single-threaded (e.g. compilers are single-threaded if you want to perform global (whole program) optimizations as opposed to trivial - but suboptimal - splitting the program to multiple compile units) or it is peak single threaded performance that affects the feeling of "snappiness" that customers want to pay money for.

    59. Re:vaporware by zigurat667 · · Score: 1

      If you wanted to know what it is, you might want to read this paper.

      I found it quite interesting, despite not understanding that much.
      Maybe it's the 'fuzzy whitepaper' OP mentioned, but we'll never know.

    60. Re:vaporware by hairyfeet · · Score: 1

      THAT is not the problem, the problem is with the chip design my 2.6GHz Thuban will most likely stomp the dogshit out of it in most of the benchmarks. You see basically AMD has their own netburst on their hands. As most remember a 3.6GHz netburst P4 can get the dogshit stomped out of it by a 2.2GHz Intel Core arch simply because of IPC, or instructions per clock.

      Now what AMD did wrong as several things, first they are basically lying out their ass and trying to sell dual cores as quads, quads as octcores, so naturally the comparisons to actual quads and octacores is gonna suck, again my Thuban will stomp it simply because it has 6 ACTUAL cores as opposed to 4 real cores and 4 virtual ones. in fact if one disables every other core in a FX chip then that chip WILL get faster IPC, why? Because all the other core is is hardware assisted hyperthreading and it is sucking the shit out of problem number 2, which is they gimped the living hell out of the cache. As everyone probably knows AMD chips usually can get by with less cache than an intel chip, but only to a point, see how much more lousy a Sempron is than an Athlon simply because the cache is too gimped. because of the hardware assisted HT they have a lousy 16KB! Yes you read that right, 16KB per core of L1 cache. that means older programs or ones that don't hit the L3 are gonna be gimped right out the gate. Look up the reviews for the FX4100 and see how the reviewers are shocked that putting the new core design against the previous Deneb and even with a 1GHz speed boost it was still getting its clock cleaned and only by OCing the living shit out of the cores (over 4GHz) were they able to get it to pull ahead of Deneb, which brings me to 3, they made the pipes too long in search of speed. I guess Intel having the crown as being an OCing monster got to them because they made the pipes MUCH longer than Stars arch in the hopes of high clock speeds, but as we saw with netburst having long pipes means MUCH bigger risk of a stall and much lower IPC as a result. again this is why the BD was stomped by Stars as the longer pipes meant that only in certain specialized jobs could the chip shine.

      In the end as someone who has been selling AMD exclusively since the Intel bribery and compiler rigging came out this chip is junk and i just can't sell it. they are trying to get more than quad core money for a dual core with HT thus killing AMD's "Bang for the buck" and the killing of the AM3 line when only a single chip was needed to keep the entire line shows their incredible shortsightedness and a company with no real leadership. A single 95w Thuban could have kept the entire AM3 line alive while they worked on BD/PD, all that came out perfect would be X6, any with bad cores would be Phenom X4/X3, any with bad cache would have been Athlon X4/X3. this would have kept not only the entire Am3 line alive but also would have given value hunters and the DIYers on board as they could be buying X3s and X4s and hoping to unlock an X6 and the X6 is frankly kick ass for gaming and transcoding thus giving them a good chip to keep the gamers on a budget and AMD fans happy.

      You just can't get around one fundamental fact...the BD/PD core is really a server chip, it excels in server roles. the problem with this is server roles are the complete and polar opposite to what the average user of a desktop or laptop actually DOES with their machine and in the jobs that home users and SMBs have a PC or laptop for the BD/PD design is a real dog, hell I'd compare it to a Pentium 4 but sadly that wouldn't be fair to the P4, its actually more like a netburst Celeron in that lack of cache has gimped the shit out of the chip. Look up the benches for the BD folks and remember they are expecting ONLY about a 15%-20% gain. Now when the chips are having to be OCed like mad to keep up with the last gen offering, much less the Intel chips, do you REALLY think a lousy 20% increase to such a flawed design is gonna help?

      --
      ACs don't waste your time replying, your posts are never seen by me.
    61. Re:vaporware by anyanka · · Score: 1

      Also, experience with the Athlon has probably shown them that it's impossible to get a good foothold in the desktop market, even with a superior product, since the main PC manufacturers are in bed with Intel. The tiny margins don't help either. In the high-end market, the margins are better, and they've already had quite a bit of success in the supercomputer market (Cray, for example), where the CPU/GPU combo is highly desirable.

      What surprises me a bit is how they've completely held off on marketing it so far. At their booth at Supercomputing'11 in November, they had *no one* from AMD who knew anything about the APU, they could only refer to a university partner who had done some experiments. They're probably waiting for the high-end APUs, but still, I would have thought that they'd be interesting in getting people started with experiments, so they have something to show off later.

    62. Re:vaporware by billcopc · · Score: 2

      The Athlon 64 was indeed awesome. I was a full-on raging AMD fan back then, eventually culminating in an 8-way Opteron workstation: the good old Tyan Thunder K8QW. Only problem was, AMD stagnated for way too long. When I upgraded from the A64 to the X2, it was a huge leap (obviously), stomping all over Intel's overpriced Pentium-D. But then, Intel came out with the Core 2 series, and AMD just kept releasing die-shrinks of the same old CPUs. I had nothing to upgrade to. I eventually tired of waiting for a new Athlon to seduce me, and the Phenom was plagued with terrible reviews, so I went with an Intel Q6600 rig when they finally dropped its price in late 2007. Oh, and I overclocked the tits off that thing :)

      Even two years down the road, when I was shopping for yet another PC, there was nothing from AMD that could outpace my C2Q. I kind-of felt like AMD was trying to peddle 5-year old processors. Phenom II was OK for the mid-range, but with Intel now flogging very overclocker-friendly i7's under $300, with far more tricked-out boards to match, AMD just wasn't for me. I did build lots of office/surf machines around the Athlon X3 though, but even that has come to pass. Right now, for a true budget build, the Intel G850 is where it's at. It's a sad day when Intel beats AMD at the bottom end of the market.

      --
      -Billco, Fnarg.com
    63. Re:vaporware by AlienIntelligence · · Score: 1

      My last box was a quad-core Phenom II. It served me well. There's no denying though, that Intel's current i7s (I have a 2600K) blow everything else out of the water. I fervently hope AMD will come up with something to challenge it. Competition is good.

      I'm curious, since you specified a K chip, have you OC'd it... what nbridge and mobo did you choose?

      -AI

      --
      For me, it is far better to grasp the Universe as it really is than to persist in delusion
    64. Re:vaporware by hairyfeet · · Score: 3, Informative

      May I make a suggestion? Tiger has been selling their remaining stocks of 95w Thubans (in case you haven't heard in a serious "WTF are they thinking?" move AMD has killed AM3 for two sockets that have less than a year of life in them, FM1 and AM3+) for around $100. Sign up for their emails, that is where they have been offering it as of late. i got one and with the money i saved upgraded my ECS board to a nicer Asrock and i must say i couldn't be happier, the 1035T is not only around 40% faster than my 925 Deneb but whereas the Deneb would max out at around 139f doing transcodes with the hyper N520 cooler i paired the thuban with i'm getting a MAX of 114F and that's after 7 and a half hours of slamming the CPU with Virtualdub. At idle this baby is literally below room temp, no shit looking at Coretemp my chip is at 67f and the room is 72f. Frankly I've never been happier with a chip upgrade in my life and its just a damned shame AMD has killed AM3 but their loss is your gain if you jump on it and snatch one while they're cheap. I mean 6 cores for $109? How can you beat that? Paired with 8gb of RAM and a CF enabled board i figure this baby will last me until 2020 easy, what a sweet chip.

      But for everyone that wants to save some money and have a nice chip snatch one of the AM3s NOW before the stock runs out because when they are gone, that's it. I went ahead and built my GF a new Athlon X3 box and gave the Deneb to my youngest and as soon as this next batch of laptops gets sold I'll be building the oldest an X3 or X4 before supplies run out. The really nice Am3 boards have never been cheaper and paired with 4-8Gb of DDR 3 and a Hyper212 or hyper N520 they make pretty badass desktops, plenty of OCing headroom if you desire and easy to unlock so that X3 can easily be the cheapest quad you'll ever buy. But for me that X6 so cheap? hell how could you not love getting 6 cores for $109 shipped? That's a no brainer.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    65. Re:vaporware by hairyfeet · · Score: 4, Informative

      Actually I'd say buying ATI was one of the smartest things they ever did. one can argue if they had waited until the market tanked they could have gotten it cheaper but hindsight and all that. But have you tried bobcat? Less than 18w for a dual core with an HD6310 GPU and often runs at less than 12w. hell AMD had to slow down their desktop production simply because they didn't have enough capacity to meet demand for the Brazos platform. If that's failure I'll take two please. Go to someplace like Tiger and see how many units you have with the E350, we are talking netbooks and laptops, HTPCs and all in ones, the OEMs are cranking out new designs to use those chips as fast as they can. I walked into my local Wally World the other day and less than 4 units were Intel, the rest? All AMD Fusion. And don't forget this is still running on VLIW GPUs, the next revs will replace them for vector units which should behave like a hyper powerful FP when not needed for graphics.

      so I'd say while AMD has made some SERIOUS mistakes, killing the AM3 line and Stars arch before getting the bugs fixed (or better yet replacing for the consumer chip) the BD/PD design, trying to push a server chip like BD/PD as a desktop chip, frankly the APUs created thanks to the merger have been one of the few smart moves they've had. With Brazos they have a unit that stomps Intel+ION while often costing less than intel alone and thanks to intel shooting themselves in the face by killing the Nvidia chipsets there won't be any new ION designs. With Brazos you have a unit that sips power, is quiet, low enough heat it can be passively cooled, while still able to do 1080p over HDMI. If you haven't tried one you really should, its a sweet chip.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    66. Re:vaporware by Tough+Love · · Score: 1

      These days, it's a better value for me to spend the big bucks on Intel workstations and ride them out for an extra year.

      Your strategy confuses me. In the "extra" year you will lose big.

      Huh, overrated? Tonight we are infested with Intel fanbois?

      --
      When all you have is a hammer, every problem starts to look like a thumb.
    67. Re:vaporware by Tough+Love · · Score: 2

      You first say you picked all three "quiet, "powerful", "cheap". Then you say you dropped the powerful to get the cheap. I'm confused.

      Do not be confused, gentle reader. You may understand that as "powerful enough; very powerful indeed". And please do not put words in my mouth.

      What was not accurate about that?

      Mod points do not equal reasoning.

      --
      When all you have is a hammer, every problem starts to look like a thumb.
    68. Re:vaporware by Daniel+Phillips · · Score: 1

      The only workstation class machine with which I have been completely happy is powered by an AMD 4 way Phenom II. Quiet, powerful, cheap, pick all three. And looking around, I would say that its successor is highly likely to be an AMD 6 way, 45 nm process chip. Best value by far for my money.

      Today I can choose slightly less latency with Intel or significantly more value with AMD. Call me cheap, but I will take the value, thank you.

      Wow, I am totally unimpressed with Intel tonight... the pattern of downmods for favorable comments about AMD strongly suggests hired astroturfers, right down there on a moral par with Apple.

      --
      Have you got your LWN subscription yet?
    69. Re:vaporware by Daniel+Phillips · · Score: 1

      They could have left ATI the fuck alone and concentrated on doing that which they were really good at. Chip design.

      They improved ATI GPUs a lot by bringing in better process technology and throwing more engineers at it than ATI could afford. And they opened up the register specs, which is more than fine by me.

      --
      Have you got your LWN subscription yet?
    70. Re:vaporware by arogier · · Score: 1

      Agreed. For anything I'd want to do on a 12" laptop an E-350 plus and SSD just kills lag. Especially compared to my previous configuration of Atom and the same (intel) SSD.

    71. Re:vaporware by Rockoon · · Score: 1

      Do the math here, if Bulldozer's cores were 60-80% of Intel's then their 8 core chip should perform 120-160% of Intel's quad core chips in multithreaded performance.

      Before you start doing math, correct your numbers.

      That chip you are calling 8-core is a 4-module chip, closer to Intels HT offerings (4C/8T) than to what you are claiming (8C/8T)

      That is, correct your numbers unless you dont want to let facts get in the way of your inner banboy.

      --
      "His name was James Damore."
    72. Re:vaporware by Anonymous Coward · · Score: 0

      0) Bulldozer is slower.
      1) Nobody was talking about "writing the entire thing in assembler" so you can leave out that strawman argument.
      2) GPUs are a fair bit different from CPUs.

    73. Re:vaporware by Anonymous Coward · · Score: 0

      I'm using a bulldozer with a gtx570 for 3D gaming (with no over clocking), and haven't had any problems so far. I can pump out a steady 120fps on everything at full graphics I've thrown at it, so I can't see what the gamers have to complain about.
      What with it having 8 cores, even if the game *is* only using two of them, that still leaves 6 to handle O/S overheads, physics, etc. which should (in theory) lead to a more stable experience.

    74. Re:vaporware by Rockoon · · Score: 1

      Instead, with Bulldozer they can use more modules and sell to the server market or anyone else with threaded software and then and use fewer modules in combination with a GPU and sell to the budget market and the midrange gaming market, and leave the six dozen howling high-end PC gamers to Intel.

      I wouldnt say that i5 buyers are howling high-end PC gamers.. the i5-2500K is cheaper and performs better on most benchmarks (including multi-threaded) than the FX-8150.

      This is clearly the mid-range market with prices between $225 and $250, not the high end gaming market. AMD has some absolutely incredible budget offerings (the A8 series APU's for instance) that Intel simply cannot touch, but don't kid yourself about the mid-range market.. where Intel is highly competitive even when its not winning outright.

      --
      "His name was James Damore."
    75. Re:vaporware by drinkypoo · · Score: 1

      99.99% of users don't work in parallel programming.

      Today basically all users can benefit from more parallelism, as even the operating system is heavily multithreaded. Disingenous comment is disingenuous (as well as anonymous and cowardly.) It is now ordinary (since the Xbox 360) for games to be designed to take advantage of multiprocessing.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    76. Re:vaporware by ifiwereasculptor · · Score: 1

      I agree with you. AMD can't keep using stars forever. When I called BD a failure, though, I was merely commenting on general performance, power consumption and price of the current incarnation. Even when dealing with highly parallelized workloads, BD's strong suit, all AMD could do was achieve relative parity with the i5, and while drawing much more power and costing a bit more. So the tradeoffs weren't really worth it. I'm hoping they can improve on it, but one wonders why they have released something that obviously wasn't fit for competition yet and then hyped the fuck out of it.

      As for me not being the target market, based on your own definition of what it is, I'll have to disagree. I'm a video editor on a budget, so I'm in on two counts. Still, I have no reason to ditch my Athlon II X3. Not for something like the FX-4100. It's too much of an investment for too little gain. As for servers, I remember Anand running benches on the new Opterons and calling them a "disaster". I don't really know much about server workloads, so I won't comment, but I suspect power consumption plays an even higher role there.

      What I'm saying is I understand what they did. Unfortunately, it didn't work. Not yet. Interestingly enough, the FX-4100 is on par with the A8-3850, CPU-wise on threaded workloads, and they cost about the same. So why would you pass on what amounts to a free discrete-level GPU? Forget about AMD vs Intel - Bulldozer, right now, makes AMD vs AMD hard enough for... huh... both of them.

    77. Re:vaporware by Anonymous Coward · · Score: 0

      So where do you see the big difference between designing a huge CPU completely on transistor level and writing a huge program purely in assembler? Or rather one critical enough to make it a strawman.
      Anyway I am not saying that I know whether changing their process was a good idea or not and whether it affected Bulldozer performance, I don't see much proof either way.
      Bulldozer performance might be almost completely architecturally caused and the tools might not matter (or not). It is possible that using the old processes they might have had something faster, but also 3 years later (or not).

    78. Re:vaporware by Courageous · · Score: 1

      It's designed for the server market, in other words.

      If a parallel chip designed for the server market isn't tearing down the house on VMmark results in this day and age: epic fail.

      C//

    79. Re:vaporware by DarthVain · · Score: 1

      Correction:

      99.99% of programs do not take advantage of parallel programming.

      Though if you are like me and like to do a bunch of things all at the same time, then yes there would be at least some use.

    80. Re:vaporware by Anonymous Coward · · Score: 0

      The only workstation class machine with which I have been completely happy is powered by an AMD 4 way Phenom II. Quiet, powerful, cheap, pick all three.

      Yeah, but I bet your google searches aren't as fast as they could be.

    81. Re:vaporware by Kjella · · Score: 1

      Before you start doing math, correct your numbers. That chip you are calling 8-core (...) correct your numbers unless you dont want to let facts get in the way of your inner banboy.

      Oh, I let facts get in my way like that AMD is calling it an 8-core?

      AMD (NYSE: AMD) today unleashed the AMD FX family of CPUs, delivering a fully unlocked and customizable experience for desktop PC users. The AMD FX series of desktop CPUs includes the first-ever eight-core desktop processor

      There you have it, in their own damn press release. Besides it's not my math, there's no other way to interpret "They could have built something that achieved 90-100% of Intel's best on single threads instead of 60-80% [but] the performance per core * number of cores would be lower." than that he counts each module as two cores. If he didn't, the statement would be meaningless. And if one core is performing 60-80% of an Intel core then two cores (one module) should perform 120-160% for a well threaded application, which we all know it doesn't. Not if we take four modules and four Intel cores to make a FX-8150 and 2500K/2600K either. The truth is one Intel core outperforms one AMD module - or two cores as AMD likes to call it.

      --
      Live today, because you never know what tomorrow brings
    82. Re:vaporware by fast+turtle · · Score: 1

      (in case you haven't heard in a serious "WTF are they thinking?" move AMD has killed AM3 for two sockets that have less than a year of life in them, FM1 and AM3+)

      Please turn in your geek card and don't let the door hit you in the ass on the way out. You forgot that when AMD changes a socket from the AM# to AM#+ that the new socket supports the AM# chips fully. They're adding new features to the socket but that's the advantage. All of those AM3 CPU's will work in the new AM3+ socket without issue, unless the OEM/Bios maker screws something up.

      I will agree it's a WTF is Tiger thinking selling off their stock of 95watt Thubans as they'll work just fine in the new socket. The only reason I can see them making this decision is that the AM3+ Thubans are a better deal and they want to make the most of the stock they've already have before the new ones hit the market at a cheaper price.

      --
      Mod me up/Mod me down: I wont frown as I've no crown
    83. Re:vaporware by WiiVault · · Score: 1

      Turn in your geek card because you held down the shift key and put a # where a 3 is supposed to be.

    84. Re:vaporware by Anthony+Mouse · · Score: 2

      Do the math here, if Bulldozer's cores were 60-80% of Intel's then their 8 core chip should perform 120-160% of Intel's quad core chips in multithreaded performance.

      Only if by "do the math" you mean "ignore the math." Bulldozer modules are neither a complete pair of cores nor a single core with hyperthreading, remember.

      If you run a single thread on a module, it doesn't have to share the FPU or caches with any other threads and will have higher performance -- hence 60-80% of Intel's on single threaded workloads. If you run two threads on the same module than they share some things but not everything as HT would, so instead of having total performance go up by a pittance as it does with HT, it goes up substantially by adding another thread (even though performance per thread goes down somewhat).

      It's actually a pretty slick design if you think about it. If you have a multi-threaded workload then you get better-than-HT scaling and if you have a single-threaded workload you get an extra half a core worth of resources that can be applied to that thread. Clock-for-clock a BD module is pretty competitive with a Westmere-EP core on threaded workloads.

      Bulldozer is AMD's Pentium IV

      It doesn't really look like the Pentium 4 at all. The problem with the P4 was that the pipeline was way, way too long. Prescott had a more than 30 stage pipeline in order to achieve much higher clock speeds, and then it didn't achieve much higher clock speeds.

      Bulldozer isn't about that. The pipeline is about the same length as Sandy Bridge (they're both a little more than half that of Prescott). About the only thing BD has in common with the P4 is that they both run hot. But the P4 ran hot because they were sacrificing everything trying to hit 4GHz on 90nm process technology. Notwithstanding the demos showing BD at 8GHz on liquid helium, AMD isn't chasing GHz here. The heat issues may be attributable to this being the first generation. It remains to be seen whether they can address them in the future, but there is no sense in counting them out this early in the design evolution.

    85. Re:vaporware by Asmodae · · Score: 1

      So where do you see the big difference between designing a huge CPU completely on transistor level and writing a huge program purely in assembler?

      I'm not the AC but one is hardware the other is software. Don't confuse the two.

    86. Re:vaporware by hairyfeet · · Score: 1

      They have already announced that AM3+ gets ONE, count 'em, one more chip, and that Piledriver is FM2 and NOT FM1. Now sure you CAN jam one of the newer chips into the older sockets but....uhhhh....guess what? You LOSE FEATURES. Its like jamming an AM3 into an AM2 socket, goodbye DDR 3, you've just hamstringed your new chip. And the buzz is FM2 is NOT backwards compatible with FM1 due to the new GPU component and AM3+ as I said is getting ONE more chip and then its end of the line. If you look on Tiger or newegg you are paying 40%+ for an AM3+ board over an AM3 and the ONLY advantage you get for the AM3+ board is TWO CHIPS? That's it? Fuck that noise.

      And I'm sorry but the serious WTF was killing AM3 and Stars arch. Check any benchmark you like, you have to buy the FX8150 to compete with that $109 Thuban X6, that's fucking sad man. You have to buy a $289 chip, that BTW is just $15 difference between it and Sandy bridge which stomps the crap out of it BTW, just to beat a chip released two years ago.

      So I'm sorry AMD but if you don't come out with something better priced than bullshoveler and pilecrapper my shop is gonna have NO choice but to go Intel. If I'm gonna have to play socket roulette anyway I might as well get a chip that is more than double the speed, especially when it is less than $20 between it and a lousy AMD quad which BTW just FYI that is EXACTLY what their supposed 8 core CPU is, its just a quad with hardware assisted hyperthreading. For proof look up any of the benchmarks where they turn off every other chip (the hyperthreading) and watch the IPC go UP, by as much as 30%. Their hyperthreading is seriously gimped.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    87. Re:vaporware by hairyfeet · · Score: 1

      Acer os Asus? i got the EEE myself, wonderful little unit, maxed it out with 8Gb of RAM for just $33 and that much RAM with superfetch means the only time the SSD is gonna win is wake and sleep. While the SSDs are nice, don't get me wrong, I have too many media files and the hot/crazy scale with those frankly spooks me as I use my EEE for service calls and the last thing I need is to flip up the lid and have the SSD crap out.But as you've seen the Brazos platform is quite sweet, hell mine feels more like a CULV than a netbook and with an SSD I bet yours IS a CULV as far as performance goes, I just hope the SSD don't crap out when you need it.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    88. Re:vaporware by SiChemist · · Score: 1

      And during that time Intel made sure that it was difficult to buy AMD powered machines from any of their long time partners. (I'm looking at you, Dell.)

    89. Re:vaporware by Anonymous Coward · · Score: 0

      They improved ATI GPUs a lot by bringing in better process technology and throwing more engineers at it than ATI could afford.

      Uh, no, they didn't. AMD can't afford massive waves of engineers any more than former-ATI did. As a matter of fact, overpaying for ATI left AMD with a lot of debt which crippled them for several years, ultimately playing a role in forcing AMD to sell off its fabs. In other words they didn't exactly have a lot of cash to throw around. Now that they're fabless they're turning profits, but they're small profits and they had to lay off a bunch of people just last fall.

      Speaking of those fabs, AMD did not convert ATI GPUs to AMD process tech either. The discrete GPU parts continued to be made at TSMC, even to this day. The only GPUs which have 'moved' to the same process as AMD CPUs are those which are integrated into AMD's "APU" product lines (CPU+GPU on one die).

      And to wrap it all up, the APU product line is the real reason AMD bought ATI in the first place. The iron rule in the semiconductor industry is that you must integrate or perish, because if you don't your competitors will kill you. When AMD bought ATI, it was because they saw the integration of CPU and GPU coming, especially for low end CPUs (the value segment which AMD depends on). However, AMD had no in-house 3D expertise. And it turns out to be very difficult, expensive, and risky to assemble a design team capable of building a GPU HW+SW stack from the ground up, and then executing on that. So, AMD felt the best option was to acquire. They actually tried for NVidia first, but NVidia's CEO wanted to end up running the combined company, and AMD's management didn't want that.

    90. Re:vaporware by Anthony+Mouse · · Score: 1

      You are implying that only PC enthusiasts are interested in single-thread performance. How come that Intel invests so much in improving it, then, and does not lose its market share to AMD? This is also at odds with AMD's improvements in Turbo Core in Bulldozer.

      That's kind of a silly question, isn't it? Why would they lose market share as a result of higher single-thread performance?

      Intel is aiming for the broadest possible market. They want the subset of people who care about maximizing single thread performance in addition to the subset who care about maximizing multi-thread performance and the (by far largest) subset who don't really care about record-breaking performance very much at all and just want the best value for money.

      AMD has concluded that they can't afford to match Intel's R&D budget in order to target every market segment and can get by with only the latter two. And they're not wrong -- they sell as many processors as they can produce.

      I am afraid that either most of desktop workloads remain essentially single-threaded (e.g. compilers are single-threaded if you want to perform global (whole program) optimizations as opposed to trivial - but suboptimal - splitting the program to multiple compile units) or it is peak single threaded performance that affects the feeling of "snappiness" that customers want to pay money for.

      You are apparently under the impression that CPU performance is still a major factor in "snappiness" in modern computers. The fact of the matter is that any K10, Bulldozer or Core processor in the vast majority of desktops spends 99% of its time idle. Moreover, the user without an unlimited budget will find that they experience better performance for the same price by combining an AMD processor with a Raptor or SSD than they would have using a more expensive Core processor with a slower disk. AMD can succeed perfectly well without beating Intel's fastest CPU on single thread performance as long as they can beat several of its slower CPUs at the same or lower price point.

      The example of compiling stuff is particularly artificial because a) the vast majority of users don't ever compile anything (or, really, do anything CPU intensive whatsoever) and b) if you actually care about waiting for compiles to finish then you can turn off the most expensive performance optimizations until you get to deployment for beta testing because in the rare event that you discover it causes a major problem you can in the worst case just turn it back off.

    91. Re:vaporware by hairyfeet · · Score: 1

      How is THAT flamebait? He very politely pointed out that for his needs his AMD quad is "powerful enough" while saving him a LOT of money, what is inaccurate there? I paid $109 for a Thuban 1035T, this baby does a DVD to AVI conversion in less than 15 minutes flat and cost me less than a Pentium Dual would cost, I'd say that's pretty damned powerful for the money. Powerful? Check. Quiet? Can't tell the machine is on with a hyper N520 cooler so check. Cheap? $109 shipped for a 6 core? BIG check there.

      They really need to fix the mod system so that those that are making dozens of accounts can't modbomb as I've noticed this quite a lot lately, especially in chips, anything with Google, Apple, or FOSS. Anyone who checks out the posts on any of the above will find example after example of legitimate views buried by modbombs.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    92. Re:vaporware by Anonymous Coward · · Score: 0

      Your CPU temperature sensor is off. It can't be colder than room temperature unless you have phase-change or peltier cooling.

    93. Re:vaporware by hairyfeet · · Score: 1

      I don't know about the parent but I'd suggest if you want to OC an AMD try the Asrock boards. I have the A770DE+ as I had 8Gb of DDR 2 I wanted to reuse for this build and frankly the Asrock OCing tools are the easiest to use i've ever seen in 20 years of building! paired with a 1035T 6 core i ended up at nearly 3.6GHz before I said "WTF am I doing? What do I need all this extra speed for?" and stopped but frankly if I'd boosted the volts I could have easily hit 4GHz. The Asrock OC Tuner is frankly butt simple, you can save different BIOS profiles and switch on boot to try different OCs and during operation just fire up the Asrock OC Tuner and you can tweak both the OC and the Turbocore to your hearts content. To top off the sweetness if you want to save power it also has a tuner that will turn off phases when in idle thus saving power.

      Give the Asrock boards a try, they are really sweet, and if you sign up for the emails tiger has been selling the 95w Thubans for $109 shipped. put those together with a HyperN520 cooler and you have an OC monster friend, enjoy!

      --
      ACs don't waste your time replying, your posts are never seen by me.
    94. Re:vaporware by AlienIntelligence · · Score: 1

      Most games are only now adding support for two cores. The gaming world is a joke, even more so than the application development world.

      You are old and live in a cave on a desert island.

      Citation needed.

      I can't remember the last game I played which didn't make use of multiple cores. Of course, I'm disregarding any games which have no need to make use of multiple cores due to low processing requirements. I'm genuinely interested to hear what the most recent game lacking multicore support was.

      Me as well... a list I found, compiled by another of minimum CPU recommendations:

      BF3 -2 core
      Skyrim - 2 core
      Rage - 2 core
      Deus Ex - 4 core
      Duke Nukem - 3 core
      Witcher 2 - 4+ core
      Crysis 2 Ultra settings - 3 fast cores
      Bulletstrorm - 3 core

      Here's a better/longer list from early 2010:
      http://www.grandtheftpc.com/2010/03/list-of-quad-core-optimized-games.html

      A Cnet article from 2007:
      http://news.cnet.com/Game-developers-adapt-to-multicore-world/2100-1006_3-6175051.html

      Sooooo... if by "only now", you mean 5 years ago... yeah.

      Citation needed. Or didja forget what year it is, dear ol Rip.

      -AI

      --
      For me, it is far better to grasp the Universe as it really is than to persist in delusion
    95. Re:vaporware by arogier · · Score: 1

      I had a second generation Acer Aspire One and moved to a Lenovo X120e. The SSD I was using in the Acer survived the move and is still stable. If I'd ever moved my video collection from DVDs and streaming I'd run into space trouble, but I can live within the 128 gigs the SSD leaves me with plenty of comfort.

      My three big concerns are battery life, time to wake, and time to launch new applications. I spend a lot of time moving around campus and every second faster the laptop wakes up is more time I have before I have to get my A game on. The biggest speed difference I've noticed with the SSD is using Adobe Lightroom, but its still nice to bring up Firefox, Word, and Power Point faster.

      The thing I love about the E-350 is how well it sleeps. I don't hibernate or use virtual memory due to the write cycle lifespan limit on SSDs, but damn AMD fusion sleeps well. Being a graduate student I've seen my current setup against student's, classmate's, and faculty's setups (overwhelmingly MacBook Airs and iPads this year). It is competitive against anything anyone else in the classroom uses. When the warranty on the SSD quits I might be scared, but right now I am enjoying it.

      Right now I feel more limited by the E-350's ability to only use single channel DDR3. My documents are backed up on a form of media and my documents live somewhere in the cloud. It feels like I have the best of the mobile and desktop worlds for the tasks I need the notebook for. This would change though if I needed to edit video or compile code on the mobile.

    96. Re:vaporware by Anonymous Coward · · Score: 0

      At idle this baby is literally below room temp, no shit looking at Coretemp my chip is at 67f and the room is 72f.

      You're not very bright, are you?

      It is literally impossible for your chip to idle below ambient room temp unless you installed a thermoelectric cooler (TEC) or some other form of refrigeration system. Conventional heatsink + fan coolers can only make the CPU temperature asymptotically approach ambient temp, no matter how large the vapor chamber, fin area, or fan. They can never make it go below. Not even if your CPU uses 0 watts while idling. It's the Law. And I don't mean "law" as in "government", I mean "law" as in "thermodynamics".

      At least one of your temperature sensors (room or CPU) is inaccurate. Not by a small amount, either, since it's also impossible for the CPU core to not rise several degrees above room temp, even while idle. Personally, I'd guess it's mostly the CPU sensor, as there's a long grand tradition of on-die sensors not being very accurate. (Especially AMD's, Intel does them quite a bit better. Probably because the Pentium IV forced Intel to develop mature on-die temperature sensors and thermal control systems much earlier than AMD.)

      Also, reading CPU sensors properly can be... interesting. The raw value from the sensor may need massaging. If the software you're using doesn't implement the right correction calculations for the specific processor you have in hand (the formulas can change with die steppings etc), you'll get a bad readout.

    97. Re:vaporware by Anthony+Mouse · · Score: 1

      I wouldnt say that i5 buyers are howling high-end PC gamers.. the i5-2500K is cheaper and performs better on most benchmarks (including multi-threaded) than the FX-8150.

      This is clearly the mid-range market with prices between $225 and $250, not the high end gaming market. AMD has some absolutely incredible budget offerings (the A8 series APU's for instance) that Intel simply cannot touch, but don't kid yourself about the mid-range market.. where Intel is highly competitive even when its not winning outright.

      I don't think my point was to argue that AMD's offerings are superior to Intel's, but rather that they're competitive. And people are continuously comparing the i5 2500k to the FX-8150 because the comparison makes Intel look good, but try comparing it to something else, like the FX-6200. Because of the way modules work (i.e. they're faster per thread if you only run one thread), I would expect the FX-6200 to match or beat the 2500k at plenty of common workloads: If you're only running one thread (or really up to three threads) then the 2500k can't use all its cores and meanwhile each thread has an entire module to itself and a 500MHz clock speed advantage on the FX-6200, so you wouldn't expect the 2500k to win until you add the fourth thread, and then its advantage would fall off pretty quickly if you add a fifty and sixth thread that allow the FX-6200 to use two threads on all three modules. The 2500k might still be faster with six threads (though it would be pretty close), but then you consider that the FX-6200 is significantly cheaper.

      What I'm saying is that it's a close decision, which means AMD is competitive, which means they'll be selling plenty of chips.

    98. Re:vaporware by Anonymous Coward · · Score: 0

      Learn to read better. The sentence:

      Intel and AMD had always done so at least for the critical parts of the chip.

      does not mean AMD and Intel were doing it for all parts of the chip.

      Therefore "writing a whole 10 million lines (if written in C) project in assembler is sensible." is strawman.

    99. Re:vaporware by TheGoodNamesWereGone · · Score: 1

      It's an ASUS P8 H67-M Pro board. I ran it up from 3.4 to 4 GHZ once but then backed it down to 'automatic' overclocking. I just checked a second ago and at that moment it was running at 3.5. All that was under Win 7-64. You just reminded me I need to set the BIOS. Ordinarily I run Linux, in which case I overclock it at 10%. This is on the stock cooler. These Sandy Bridge chips are pretty cool! (Pun intended).

    100. Re:vaporware by Rudeboy777 · · Score: 1

      I don't put much faith in any what any software that ends in -mark tells me. Show me the real-world performance please! Preferably using DB or server software or my company uses.

      --

      From hell's heart I fstab at /dev/hdc

    101. Re:vaporware by Daniel+Phillips · · Score: 1

      Powerful? Check. Quiet? Can't tell the machine is on with a hyper N520 cooler so check.

      I have a Killawatt meter on my 4 core Phenom II box right now, it shows 80 watts idle or under load. That's from the wall. It makes me have extreme doubts about the numbers showing in some supposedly reputable tech review sites, that show much higher power consumption numbers. By comparison, the HP 4 core workstation I worked with for a year had big noisy coolers in it. I did not get a chance to measure the power from the wall, but obviously more. So I tend to think that Intel talks power efficiency, while AMD walks the walk. Contrary to "popular" opinion, and I also wonder about just how "popular" that opinion really is. Given the pattern of downmods I see on my positive, factual comments on AMD, it makes me feel that Intel has stooped to paying astroturfers to hang out in community sites.

      --
      Have you got your LWN subscription yet?
    102. Re:vaporware by Anonymous Coward · · Score: 0

      The core used in the existing A series is a direct descendant of the original Athlon from 1999, which itself was very similar to (and designed by the same people as) the DEC Alpha introduced in 1992, predating even the Pentium Pro.

      No, sorry, the Athlon core had no significant similarity to any DEC Alpha design. You're conflating the fact that some people left DEC and joined AMD into the cores being the same. As with any other tech industry, people change jobs in the semiconductor industry all the time, but the designs don't walk out the door with them unless they commit a crime! The only time that actually happens legitimately is when the change of employer is due to an acquisition or merger, and that wasn't how ex-DEC people ended up at AMD.

      The only Alpha technology in the original 1999 Athlon was the frontside bus. If I recall correctly, it was licensed to AMD for free because DEC (or whichever post-DEC entity owned Alpha at that point) still harbored dreams of riding Windows NT for Alpha into the PC space, and had troubles getting anybody to build off the shelf systems. Being FSB compatible with the #2 x86 processor vendor was a great way to get their foot in the door of the PC market.

      Suffice it to say that there isn't a lot of optimizing left to be done on the design.

      Who said you can only "optimize" an old design? You can do whatever you like with it, you don't have to stick to just minor tweaks to extract a few more MHz.

      I'll put it this way. Intel cleansheeted the Pentium IV, but that didn't work out so well. When they went back to basing new designs off the venerable Pentium Pro lineage, they started stomping AMD into the ground. They've delivered significant performance increases every generation from Core 2 onwards, even when clocks didn't increase much, because they keep adding new features to the core design (and sometimes replacing significant chunks of it with something better).

      Over time, steady improvement ends up being much the same as a true cleansheet design anyways. The current Sandy Bridge i7 cores may be descended from PPro in some sense, but there's not much left of the original design.

      Bulldozer is a clean slate.

      Pretty sure it isn't actually a true "clean slate". There's lots of signs of bits & pieces carried over from K8 to Bulldozer and modified to fit the new system design. Which is entirely sensible, because reinventing almost everything from scratch ala Pentium IV is too much work for not enough reward.

      The current implementation has some obvious shortcomings, not least of which that the cache architecture is lame. (The L1 is too small and the L2 latency is too high. They might actually do pretty well to make a smaller, lower latency, non-exclusive L2 and use the extra transistors for a bigger L3 or even an L4.) But that's not a bad thing. It's something they can fix and make future generations faster than the current generation. Which is the problem with the old K10 -- there are no easy little changes left to be made to make it substantially faster than it is now.

      There are no easy "little" changes to fix Bulldozer either, because (much like K10) its problems are big problems. The horribly slow caches you mention are a great example. L1 and L2 caches aren't distant things loosely coupled to the core. If you have to gut and replace them, you're going to end up touching a lot of other stuff in the core too.

      What you have to realize is that it isn't that the design is flawed. It's that you aren't the target market. They could have built something that achieved 90-100% of Intel's best on single threads instead of 60-80% by doubling the number of transistors per thread and halving the number of threads and cores, but think about who would buy that. PC enthusiasts who comprise about 0% of the market. It wouldn't sell in the server market because the performance per cor

    103. Re:vaporware by Anonymous Coward · · Score: 0

      What surprises me a bit is how they've completely held off on marketing it so far. At their booth at Supercomputing'11 in November, they had *no one* from AMD who knew anything about the APU, they could only refer to a university partner who had done some experiments.

      I believe Nov. 11 was after AMD gutted their marketing department last year. They needed to cut costs to keep posting profitable quarters, so they pinkslipped almost all the marketers, and consolidated engineering sites (which reduced engineering headcount since not everyone wanted to move). There was probably nobody left to send to SC '11 who knew the market or how AMD planned to serve it in the future.

      They're probably waiting for the high-end APUs, but still, I would have thought that they'd be interesting in getting people started with experiments, so they have something to show off later.

      You're assuming there ever will be "high end" APUs. There probably won't be any.

      AMD has basically conceded the high end to Intel at this point, especially with the tenor of their CEO's recent announcements about the future direction of the company. They know that with a fraction of the resources and (ever since they sold the fabs) not even a ghost of a chance at staying competitive on the process tech side, it's futile to chase Intel's ability to churn out fast high end chips. Instead they're going to be focusing a lot more on integration, low power, etc. Think notebooks and SoCs.

    104. Re:vaporware by Daniel+Phillips · · Score: 1

      Wow, more Intel astroturfers with mod points. There is no shame.

      --
      Have you got your LWN subscription yet?
    105. Re:vaporware by Rockoon · · Score: 1

      Oh, I let facts get in my way like that AMD is calling it an 8-core?

      ..and Boris Bulvik calls it 1000 hickups. What the fuck does what people call it have to do with FACTS? You seem to want to rely on semantics to win your argument.

      You are aware that Bulldozers use modules, which are similar to Intels HT, which you specifically mentioned, right? right? You are aware of the facts, right? Yeah.. you are.. but you do not want to admit them.

      --
      "His name was James Damore."
    106. Re:vaporware by Rockoon · · Score: 1

      And people are continuously comparing the i5 2500k to the FX-8150 because the comparison makes Intel look good, but try comparing it to something else, like the FX-6200.

      Look, I am an AMD user. I dont have a single Intel chip in my house... but lets be honest here..

      The i5-2500K is compared to the FX-8150 because the FX-8150 is $25 more expensive, and when AMD wins in a specific benchmark its only a small advantage going to AMD while when the Intel wins a specific benchmark its usually a significant advantage. The numbers speak for themselves

      Trying to compare to the FX-6200 is sort of laughable thing to suggest. I dont get how you claim that the FX-6200 is significantly cheaper or better performing at anything since you cant even buy one yet, and nobody has benchmarked one.

      --
      "His name was James Damore."
    107. Re:vaporware by AlienIntelligence · · Score: 1

      Thanks for both replies.

      My new build is in the mail... or more specifically on a conveyor
      somewhere, 5 miles away, waiting to be put on a truck. Thanks UPS.

      Went with the i7-2700K (if wondering, because I got it for $50
      less than the lowest price out there, which put it under the 2600K).

      Chose an ASRock Extreme 4, Z68 Nbridge. So, I'm glad to hear
      you like their OC tools. It was one of the reasons I went with them.

      I'm breaking out an old full server tower for this setup, so I can
      put on a Corsair H80 and have room for fans:
      http://www.newegg.com/Product/Product.aspx?Item=N82E16835181016

      I will be saddened if I cannot hit 5GHz

      -AI

      --
      For me, it is far better to grasp the Universe as it really is than to persist in delusion
    108. Re:vaporware by Daniel+Phillips · · Score: 1

      AMD did not convert ATI GPUs to AMD process tech either. The discrete GPU parts continued to be made at TSMC, even to this day. The only GPUs which have 'moved' to the same process as AMD CPUs are those which are integrated into AMD's "APU" product lines (CPU+GPU on one die).

      And what is not massively significant about that? Plus, even in it's current embattled state, AMD is a larger org with more engineers than ATI ever was. And notice that AMD has gained a significant lead in process size reduction for its GPUs over nVidia, including discrete GPUs. nVidia now has to really blow fuses to even create the appearance of matching Radeon in GPU throughput.

      That said, both nVidia and AMD are making great GPU products, no question about it. But I'm strictly AMD until nVidia deigns to reveal its register specs to us little people.

      --
      Have you got your LWN subscription yet?
    109. Re:vaporware by Anthony+Mouse · · Score: 1

      The i5-2500K is compared to the FX-8150 because the FX-8150 is $25 more expensive, and when AMD wins in a specific benchmark its only a small advantage going to AMD while when the Intel wins a specific benchmark its usually a significant advantage. The numbers speak for themselves [anandtech.com]

      Which tells you that you probably ought not to buy the more expensive eight-thread version unless you have a workload that actually has eight threads.

      Trying to compare to the FX-6200 is sort of laughable thing to suggest. I dont get how you claim that the FX-6200 is significantly cheaper or better performing at anything since you cant even buy one yet, and nobody has benchmarked one.

      They seem to be on sale. Moreover, you can pretty well predict how they'll perform: Somewhat better than the FX-8150 on anything with three threads or less (owing to higher clock speed) and somewhat better than 3/4ths as well as the FX-8150 on anything with many threads (owing to higher clock speed but only 3/4ths as many cores).

      That puts it in the same ballpark as the 2500k (at least to the extent that the FX-8150 is) but at a noticeably lower price point.

    110. Re:vaporware by toddestan · · Score: 1

      Yeah, but if you're doing an all-new build, why would you buy a AM3+ motherboard, and then shove an AM3 chip into it? It seems to me that you'd just go AM3+ all the way. The only reason I can see to put an AM3 chip into an AM3+ board is if you already have a Socket AM3 system and you had to replace the motherboard for some reason. It would have been much better if AMD had made the AM3+ chips usable in older socket AM3 boards (such as what they have done in the past) but that's not the direction they went.

    111. Re:vaporware by toddestan · · Score: 1

      Bulldozer is AMD's Pentium IV, hopefully it's not quite as terrible as Intel's where they had to scrap it and start over from P3 designs but right now it looks awfully similar.

      I'd actually say it's worse than the P4. The P4 may not have been much of a performer, but at least it was a solid, reliable chip. AMD's Bulldozer is full of bugs.

    112. Re:vaporware by Courageous · · Score: 1

      There are essentially zero such benchmarks for virtualization environments. The difficulty is the degree of completeness for a benchmark, and how much labor is involved in setting up and testing the requisite number of virtual machines on one of the types of heavy metal servers that are common in virtual workloads today (32 core servers with 256GB of RAM are what I use... a benchmark of that would be hell hard to set up and coordinate).

      You can fail to put your faith in VMmwark. Totally up to you. But my guess is that if you don't do that, you'll just be using faith.

      Being that even medium sized VMware environments run into the hundreds of thousands of dollars, I do hope you can find a better methodology than prayer. :-P

    113. Re:vaporware by hairyfeet · · Score: 1

      What's the max RAM on those? Because for me the selling point of the EEE was being able to hold 8Gb. With that much RAM I can have all my apps preloaded into RAM and as you've noted it sleeps DAMNED nicely. I also love how nice the DXVA support is, using the AMD system monitor I find I'm barely using 15% of the CPU when watching 720p vids thanks to GPU acceleration. Lucky for me though startup speed is not a factor so i just hibernate, if i need to use it right this second i can hit the Expressgate and have Chrome in under 6 seconds, which is nice.

      Either way i think we can both agree that the ATI buyout was a smart idea, frankly i can't stand the Atom netbooks as the GPU is so gimped and the in order CPU makes everything feel like its dragging. Comparing a Atom dual netbook a customer had with my E350 was like a bad joke, no matter what i ran it was fast and smooth while his was "hurry up and wait" constantly. BTW if you like Brazos you'll love Brazos II. While i'll probably set it out a the E350 does all i need to do the rumor is the next gen will not only add dual channel memory but will make the switch from VLIW to Vector for the GPU which means the GPU will behave more like a hyper floating point when not being used for graphics and with OpenCL it should be able to accelerate pretty much anything. Personally i hope they come out with a quad version, can you imagine a sub 20w quad with kick ass graphics AND fast enough to run just about anything? I can see why AMD is spending more on the Bobcat than they are the desktop, mobile does seem to be where the majority of the market is heading.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    114. Re:vaporware by hairyfeet · · Score: 1

      I don't think its astroturfing as much as raging fanbois. there are several here that have more than 40 accounts so they can modbomb on "their" subjects, hell I even had one follow me around the web for half a year just so he could post 'Die you fat fucker die" after every single thing i posted anywhere, all because i dared to counter his total bullshit on Linux with actual citations showing the truth was the opposite. I knew guys that got royally fucked on both Intel by refusing to touch Athlon over Pentium netburst and again when they refused to touch ATI even though Nvidia was in the middle of bumpgate. Once someone is a raging fanboi you can give it up, you can place a hundred citations showing your facts are accurate and they WILL find a way to modbomb.

      As for actual power usage when my Thuban X6 is sitting at 70f idle and I'm not even using the power saving features of my board I'd say yeah, their actual power numbers are bullshit. But you know the reviews are bullshit when they don't even bother to check whether the benchmark is compiled with the Intel Cripple compiler. you can take ANY benchmark that has been compiled with the ICC and take a Via CPU (the only chip that allows you to change CPUID on the fly) and watch as magically that very same chip scores 40% higher by simply changing its CPUID. I had to point this out to one reviewer who had written "I don't understand why all these tests show the Intel Atom dual beating the E350 by a decent margin but my real world tests don't show those results at all, quite the opposite".

      While I don't have a killawatt to measure one thing I do know is that on every C2D and C2Q I've ever built the fans really kicked up and that Heat Sink would get seriously hot but the only AMD I ever saw that behavior with was the 125w Phenom I chips. The 95w and 65w Phenom I and II chips frankly barely get warm unless you slam them, even if you use the crappy cooler. Add a Hyper 212 or my fav hyper n520 and that chip will barely reach 120F after 4 hours of prime95. Personally i'd love to see a test with both chips running prime95 and killawatt because i think there be something fishy friend.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    115. Re:vaporware by hairyfeet · · Score: 1

      AM3+ is a dead end anyway as AMD has announced they get ONE, count 'em ONE new chip and that's it. You pay 40%+ more for socket AM3+ and there is exactly TWO chips for the thing, the current Liano and the one more BD based chip they are gonna release. And have you seen the benches? AM3+ sucks serious ass dude! My $109 Thuban frankly stomps all the new AMDs except for the highest chip and its within $30 of Sandy, that's just nuts. oh and they are lying their asses off, the new supposed "quad" Bulldozers are nothing but DUAL CORES with hyperthreading, yet they are charging higher than an AM3 Deneb quad? yeah right AMD, you're full of crap! trust me, grab one of the 95w Thuban X6 chips while you can, they are transcoding monsters and if you want they OC like mad.

      Oh and for the anon cowards that say its impossible to get below room temp? its called heatpipes and dual 120mm fans on the heatsink. look up the hyper N520, she's a cooling monster and while quiet as quiet can be those 120mm move some serious air so dropping below room temp is not only possible, its quite easy. i'm not even using the phase dropping energy saver features of my new Asrock and I'm sitting at 70 degrees F and the room is 72f. If I were going passive cooling you would be correct but with 2 120mm fans pushing and pulling air over those heatpipes (The cooler is only $33 on Amazon BTW, great deal and works on ALL Intel and AMD boards) it drops the living hell out of the temps, especially with arctic silver.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    116. Re:vaporware by Anthony+Mouse · · Score: 1

      No, sorry, the Athlon core had no significant similarity to any DEC Alpha design. You're conflating the fact that some people left DEC and joined AMD into the cores being the same. As with any other tech industry, people change jobs in the semiconductor industry all the time, but the designs don't walk out the door with them unless they commit a crime! The only time that actually happens legitimately is when the change of employer is due to an acquisition or merger, and that wasn't how ex-DEC people ended up at AMD.

      Only if you've never heard of a license agreement. The amount of cross-licensing that happens between microprocessor vendors is fairly prolific.

      The only Alpha technology in the original 1999 Athlon was the frontside bus. If I recall correctly, it was licensed to AMD for free because DEC (or whichever post-DEC entity owned Alpha at that point) still harbored dreams of riding Windows NT for Alpha into the PC space, and had troubles getting anybody to build off the shelf systems. Being FSB compatible with the #2 x86 processor vendor was a great way to get their foot in the door of the PC market.

      First of all, the bus is not any kind of small detail having no effect on processor architecture. It was a significant contributing factor to the Athlon being substantially faster than the Pentium III.

      But then you look at the rest of it. The multiprocessor configuration is very similar, in part because of the shared bus. The Athlon and then-contemporary Alphas both have 64KB+64KB L1 instruction and data cache, exclusive 2-way associative. In later iterations (AMD64), they adopted even more Alpha characteristics, like the on-die memory controller. You can hardly say that they have "no significant similarity."

      Who said you can only "optimize" an old design? You can do whatever you like with it, you don't have to stick to just minor tweaks to extract a few more MHz.

      Which is what they've already done to get it from 500MHz to up near 4GHz over the course of more than a decade. But at some point you've done all you can with small and medium changes and you have to make big changes, which is what they're doing now.

      I'll put it this way. Intel cleansheeted the Pentium IV, but that didn't work out so well. When they went back to basing new designs off the venerable Pentium Pro lineage, they started stomping AMD into the ground.

      Core 2 was catch up. It was only somewhat faster than K8 and is about the same speed as K10. The subsequent improvements have been in large part adopting the things they learned from the Pentium 4 into the Core architecture. Sandy Bridge looks more like a modified Pentium 4 with a shorter pipeline than it does a Pentium Pro.

      You seem to be making some nonsense argument about how you get from point A to point B, as if replacing pieces of an architecture one at a time gets you to a different place than replacing several things at once. The fact of the matter is that they needed to make some significant changes and they did. That is very hard to get right on the first try, so now they've got some bugs to work out -- why are you so sure that they can't?

      Pretty sure it isn't actually a true "clean slate". There's lots of signs of bits & pieces carried over from K8 to Bulldozer and modified to fit the new system design. Which is entirely sensible, because reinventing almost everything from scratch ala Pentium IV is too much work for not enough reward.

      Of course you don't have to reinvent the wheel. That isn't what I'm saying. What I'm saying is that they couldn't just keep making small changes and expect to stay ahead like they had been for the previous decade. They had to do a major update. There was no viable option that says "keep existing architecture with minimal changes."

      Have you ever heard this parable? If you ask a coach to observe two runners who each make the same time around the tra

    117. Re:vaporware by Anonymous Coward · · Score: 0

      AMD did not convert ATI GPUs to AMD process tech either. The discrete GPU parts continued to be made at TSMC, even to this day. The only GPUs which have 'moved' to the same process as AMD CPUs are those which are integrated into AMD's "APU" product lines (CPU+GPU on one die).

      And what is not massively significant about that?

      Er, the claim was that joining AMD improved ATI GPUs by moving them to better process tech. And I'm pointing out that this actually wasn't true, because to date the only AMD GPUs built on "better" process tech (i.e. at what is now known as GlobalFoundries, the former AMD fabs, on the SOI process which was AMD's hallmark) have been low end integrated GPUs. Furthermore, this happened only very recently. Are you dense or something? You cannot understand that these facts are not congruent with "ATI GPUs got super advanced process tech because AMD bought them"?

      Plus, even in it's current embattled state, AMD is a larger org with more engineers than ATI ever was.

      What of it? Do you think this made some kind of huge difference in how many engineers work on GPUs, compared to the old independent ATI? You're a fool if you think it did. ATI was not some kind of pissant two-bit operation incapable of hiring enough people to develop its own products before AMD bought them. The reason the combined company is a larger org is because... drumroll... it's larger, with a more diverse product line.

      And notice that AMD has gained a significant lead in process size reduction for its GPUs over nVidia, including discrete GPUs. nVidia now has to really blow fuses to even create the appearance of matching Radeon in GPU throughput.

      You're ignorant. AMD still, to this day, makes discrete GPUs at TSMC. So does NVidia. It's TSMC's process size reduction, not AMD's or NVidia's. In this generation, it looks like AMD shipped a TSMC 28nm GPU before Nvidia. Before the Radeon 7000 series debuted at the beginning of this year, both AMD and NVidia were on TSMC 40nm. All congratulations to AMD, but you don't get to claim that as something which happened just because AMD bought ATI. If ATI was still independent, they'd have been every bit as capable of taping out and shipping a TSMC 28nm chip before NVidia.

      The problem is that even after you've had it explained to you, you still don't get it. AMD didn't buy ATI because AMD thought AMD could add value to ATI's products (chipsets, GPUs). ATI was doing just fine making those things on its own. AMD bought ATI because AMD knew that acquiring a GPU team was essential for developing future AMD products (processors with integrated GPUs).

      That said, both nVidia and AMD are making great GPU products, no question about it. But I'm strictly AMD until nVidia deigns to reveal its register specs to us little people.

      Yeah, and that's why you're strictly irrational about AMD products. I skimmed your posting history, you're that kind of single issue idiot. If company XYZ tosses a few bones to open source, you're their devoted fanboy, and XYZ's competition is Bad. You become completely incapable of comparing the two objectively, even in areas unrelated to open source.

    118. Re:vaporware by Daniel+Phillips · · Score: 1

      Whoa, you have issues my friend. Curious: does posting such tripe anonymously make you feel like a bellycrawling scumbag, or did you just get used to it?

      --
      Have you got your LWN subscription yet?
    119. Re:vaporware by Anonymous Coward · · Score: 0

      As with any other tech industry, people change jobs in the semiconductor industry all the time, but the designs don't walk out the door with them unless they commit a crime! The only time that actually happens legitimately is when the change of employer is due to an acquisition or merger, and that wasn't how ex-DEC people ended up at AMD.

      Only if you've never heard of a license agreement. The amount of cross-licensing that happens between microprocessor vendors is fairly prolific.

      Please. First, I told you AMD licensed the bus, so it's not like I'm unaware of the practice. Second, the fact that cross-licensing exists does not imply that it actually took place for what you're talking about!

      They were very different processor designs. Go look at some block diagrams of any Alpha and a block diagram of K7. K7 is pretty clearly an independent design, outside of the bus interface (and even that was probably designed from scratch by AMD employed engineers, since what they licensed was probably the bus protocol, not an implementation of it).

      But then you look at the rest of it. The multiprocessor configuration is very similar, in part because of the shared bus. The Athlon and then-contemporary Alphas both have 64KB+64KB L1 instruction and data cache, exclusive 2-way associative. In later iterations (AMD64), they adopted even more Alpha characteristics, like the on-die memory controller. You can hardly say that they have "no significant similarity."

      Dude. Caches being similar in size for contemporary processors is generally a function of the limits of process technology and/or die size. It doesn't mean anything about where the design originated. You're constructing a fictional narrative in your head about the origins of Athlon based on nothing more than some of the same people being involved and coincidental similarities. It does a great disservice to the people who actually created K7, including not only those who came over from DEC, but also those who had been at AMD for K5 and/or K6.

      And on-die memory controllers are an "Alpha characteristic" now? Facepalm. The vast majority of Alphas ever made had external memory controllers.

      Core 2 was catch up. It was only somewhat faster than K8

      Okay, so we know you're a delusional AMD fanboy now.

      The subsequent improvements have been in large part adopting the things they learned from the Pentium 4 into the Core architecture. Sandy Bridge looks more like a modified Pentium 4 with a shorter pipeline than it does a Pentium Pro.

      No, no it doesn't. It looks like a Pentium Pro descendant where the major changes from the previous generation are that the classic ROBs have finally been replaced with a physical register file (PRF), and a post-decode "L0" icache has been added to cache short sequences of decoded instructions. (Note that no matter what you may have read about it, this is a further refinement of a feature first introduced in Core 2, not a reincarnation of PIV trace cache. The PIV trace cache was quite different in both principle and implementation detail.)

      You seem to be making some nonsense argument about how you get from point A to point B, as if replacing pieces of an architecture one at a time gets you to a different place than replacing several things at once.

      No, I'm making the point that if you keep replacing pieces one or two at a time, you may still be able to get where you want to go. You don't always have to blow everything up. You only need to do that if there's something fundamentally wrong with the design philosophy, as there was with Pentium IV.

      But AMD wasn't starting with such a fatally flawed core design. It was one which had served them well. Had AMD chosen a different path, they could've been steadily improving K8 to keep pace with Intel.

      It was an either-or choice because AMD has ne

    120. Re:vaporware by Anthony+Mouse · · Score: 1

      I'm going to go ahead and concede that I don't design microprocessors for a living, and I'll give you the heritage of the various architectures because I'm primarily just reiterating what others have told me and I feel like at this point unless you have some kind of non-public information we're both just speculating about it anyway.

      So if you want to be the expert, let's see if we can learn anything here instead of just having a flame war.

      You're arguing that Bulldozer uses too many transistors for its performance, and never mind that most of those transistors are wasted on an unfortunate cache architecture because it would be too difficult to change that. At the same time, you seem to think nothing of Intel basically starting over entirely with the L2 between Katmai and Coppermine. So let's see if you can help me understand why that is.

      If you look at the cache latencies for Bulldozer, they're terrible. The L2 is 21 cycles compared to 11 for Core i5. The L3 is 65 cycles compared to 25 for Core i5. Combined with the 16KB L1D which means that a huge amount of data will (at best) be hitting the L2 and you've got something terrible going on.

      OK, so why are the latencies so high? Here's what I think, you correct me if I'm wrong. Cache latencies are higher when caches are larger, when they have higher associativity, and when they're exclusive rather than inclusive. There is a trade-off between latency and hit rate. The L2 on BD is 2MB instead of 256KB on SB, is 16-way associative instead of 8, and is exclusive instead of inclusive. Hence, the latency is nearly double, and that's very bad.

      And I can see why they did it that way. If you want to keep eight threads happy, you have to keep them out of main memory, and the way to do that is with large caches. They didn't want to have a tiny little L2 because there would be too much contention for the L3. They didn't want to have a big L2 which is inclusive with the L3 because it would be a huge waste of transistors to pay for 16MB of cache and only be able to put 8MB in it before you hit main memory. So they went for 8MB of L2 exclusive with the 8MB of L3 -- 16MB total, a good use of transistors, great.

      Except that it causes the L2 latency to be terrible, and with a tiny L1 that turns into a catastrophe. I don't know if they didn't notice this or what, but it happened.

      Now let's think about some alternatives here. Suppose they adopt the exact same L2 and L3 caches that SB uses: 256KB of 8-way, 11 cycle L2 inclusive with 8MB of 25 cycle L3. That would certainly be faster on anything with a working set that fits in the 256KB L2 regardless of the number of threads, and probably be faster on anything single-threaded because the L3 latency would then be almost as low a the L2 latency is now. And it would save them exactly those transistors that you're claiming disadvantages BD over SB cost-wise. (The disadvantage would be that they would then have a total of ~8MB of cache before they hit main memory, just like SB, and that could impact some highly threaded workloads with large working set sizes. But it seems like a small price.) Adopting this kind of cache architecture for a prospective A-series and selling it into the consumer market along with an integrated GPU sounds like a plan to me, so please explain why it would be unworkable.

      Another alternative would be the one I described already: You use the same number of transistors for the cache but you arrange them differently: 512KB as inclusive L2 on each module (or 256MB on each thread), then a 1.5MB exclusive L3 per module, and 8MB of exclusive L4. You get the L2 latency way down for the first 256KB (by making it inclusive and 4- or 8-way associative instead of 16), but you get to keep 14MB of the 16MB of cache as exclusive so that you don't hit main memory on server workloads and you keep anything with a working set smaller than 1.5MB on a single module instead of contending with other m

    121. Re:vaporware by Anonymous Coward · · Score: 0

      I'm going to go ahead and concede that I don't design microprocessors for a living,

      Neither do I, for the record, but I work in a closely related field (System-on-Chip design, using ARM cores), and I've always been interested personally and professionally in microprocessor technology. So take what I say with however many grains of salt you deem necessary.

      and I'll give you the heritage of the various architectures because I'm primarily just reiterating what others have told me and I feel like at this point unless you have some kind of non-public information we're both just speculating about it anyway.

      I don't have non-public information, but there are also a lot of incorrect memes which spread around in the public sphere, because it's a deeply technical topic and hardly anyone in the press has enough understanding to accurately report on it. If you don't have the right technical and/or social background (by social I mean basically having worked in some part of the chip industry for long enough to get a good idea of how all the various companies interact), you probably aren't going to be well equipped to separate the good information from the bad.

      You're arguing that Bulldozer uses too many transistors for its performance, and never mind that most of those transistors are wasted on an unfortunate cache architecture because it would be too difficult to change that. At the same time, you seem to think nothing of Intel basically starting over entirely with the L2 between Katmai and Coppermine.

      No, you're still not quite getting what I'm saying. I sometimes have a hard time explaining myself, so let me try again, unpacking a little more.

      You seem to think the Katmai->Coppermine cache change proves there should be large cache performance gains (latency and/or clock speed) to be had by reducing cache size. While it's true that larger caches are slower, you're greatly overestimating the impact of a single halving. Coppermine is misleading you, because its gain in cache performance was not about the size of the cache (more on that below).

      It's worth glossing over some of the fundamentals. The time it takes to asynchronously read one location in a SRAM memory array scales in proportion to the log (base 2) of N, where N is the number of addressable locations in the array. So already you can see that halving the size of a memory isn't going to double its speed. For example, if you go from a 1K entry memory to a 512 entry memory, log2(1024)=10 and log2(512)=9, so the time it takes to read from the smaller memory is only 10% faster. Note that because it's a log function, doubling size hurts performance a lot more at small N than it does at large N, meaning that for large N (i.e. L2 and L3 caches), performance becomes much more insensitive to size.

      I'd also like to explain more, about how this relates to clock cycles. In caches, the O(log2(N)) access time is typically much too long to happen inside of 1 CPU cycle. To work around that, you can pipeline the output multiplexer which is largely responsible for the access time. By choosing how many pipeline stages to use, you can get different tradeoffs between frequency and the number of cycles of latency, but no matter what, the clock period (1/frequency) multiplied by the number of clocks of latency has to be at least as much as the O(log2(N)) time it would take to access the memory if it was built as an asynchronous memory. For this reason, when comparing caches, it's often informative to compute how many nanoseconds of latency each has instead of using cycle counts, since the processors in question might have very different target clock frequencies.

      All of the above completely ignores the workings of anything other than the data storage part of a cache, but IIRC most other cache structures also have O(log2(N)) performance scaling.

      So what happened with Coppermine? Something else entirely. That was a transition from

    122. Re:vaporware by Anthony+Mouse · · Score: 1

      Does that help recalibrate your understanding of where I'm coming from? Even a 12x reduction in size, 3x reduction in associativity, a modest "process" advantage, and fewer access ports couldn't give a 2x speedup. You're never going to get even close to halving L2 latency by halving its size. That needs a far bigger hammer.

      I understand that. My thinking was that if Intel could manage 11 cycles at 32nm then AMD could too, if no other way than by adopting a cache design similar to the one Intel uses.

      Bringing up the idea that Intel's process technology is superior to Global Foundries' rather explains that though, and it gives me another thought: It may be that the reason for the high latencies is intentional in that AMD designed Bulldozer to hit significantly higher clock speeds than it has and high leakage on GF's process is thermally limiting it (and is likewise a significant contributor to the high power consumption).

      If that's the case then I think it bodes well for the design, because it means that if GF (or, failing that, TSMC et al) can improve their process technology to reduce leakage, the chips may clock much higher and both performance and performance-per-watt will improve. It also puts an interesting spin on the subject of the article: "Resonant clock mesh" (whatever that is) ostensibly reduces power consumption and heat. If they're successful in implementing it, the design could end up looking much more attractive.

      Actually, there is a major benefit to a processor which is 50% faster, even if it's idle much of the time. Power efficiency. There is increasing focus on total energy consumption, to improve the battery life of portable computers. CPU usage profiles for light loads look like "wake up, do something, go back to sleep". If you have two CPUs which both use the same power, but one is 2x as fast, the faster CPU's time spent in wake is literally half as much as the slower CPU. Now throw power gating into the mix -- when idle, modern CPUs can literally be shut down almost completely, consuming no power. This means the 2x faster CPU is literally 2x as energy efficient, even for light loads, because they both consume the same amount of power while active, but the faster CPU is active for much less time.

      That is true, though I'm not sure how large a loss that would be in practice: It has minimal effect on the desktop market and portable computers of the "slower CPU / faster GPU / low watts" variety are the wheelhouse of the Bobcat rather than Bulldozer, which takes care of anyone whose primary concern is battery life. Then AMD can still get anyone who prioritizes purchase price over battery life, as well as anyone who prioritizes battery life and better-than-Bobcat performance over weight (since you can use the money you save buying AMD over Intel to install a bigger battery that more than makes up for the power consumption). All it really excludes them from is those who prioritize performance, battery life and weight over price, which has a pretty high overlap with the market that demands the highest single-thread performance at any price (which they had already lost).

    123. Re:vaporware by hairyfeet · · Score: 1

      Nice choice in board, BAD choice in cooler. Frankly many air coolers can't beat the sealed WC units. if you find your unit getting hot look into a Hyper N520, they are less than $40 on Amazon and are so quiet you wouldn't believe it has 2 120mm fans but man does that baby drop the heat! My 1035T Thuban idles at just 70F and after 5 hours of all 6 cores full bore video transcoding I BARELY hit 120f, and less than 40 seconds after dropping the load i was back below 80f, just an incredible cooler. Works on just about every socket out there as well, so you can simply move it with you on your next build.

      You are gonna love the Asrock OCing tools, they are the easiest I've ever seen and i've been building boxes since before there even was a Windows. If its anything like mine not only will it have an excellent GUI for OCing but it also allows you to automatically drop phases when idle to sip power AND lets you save multiple BIOS setting and switch on the fly at boot, its a REALLY sweet board.

      a word of warning though: If you have trouble booting when you first put it together? Check your RAM timings. Sometimes Asrock boards get a little too aggressive on their RAM timings so you have to back off a bit from the defaults. But once you do that she'll purr like a big old kitty and the Asrock boards are real OCers boards, they just love to crank. With 6 cores I don't really need to OC but just for the heck of it when i first got it I decided to see what she'd do, i had gotten up to nearly 3.4GHz (from 2.6GHz default) with 3.7GHz on Turbo before i decided I didn't want to push my luck. if your chip will hit 5GHz the Asrock will get it there, enjoy!

      --
      ACs don't waste your time replying, your posts are never seen by me.
  2. That's nice by iamwhoiamtoday · · Score: 1

    But how will it scale? How many FLOPS can it pull? GHz doesn't mean squat.....

    1. Re:That's nice by networkBoy · · Score: 3, Insightful

      for a single executing thread of a specific bit width GHz means everything.
      The trick is can they scale it to multiple cores/threads, while lowering their power to match Intel's performance/Watt at the high end of the compute arena. If they can do that they will once again pull in DC customers.
      -nB

      --
      whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
    2. Re:That's nice by mhajicek · · Score: 1

      Single core performance is all that matters when processing a toolpath for CNC machining. I don't care about power consumption, just higher clock speed and fast memory access (large cache).

    3. Re:That's nice by DJRumpy · · Score: 1

      Agree. The multi-core trend was more to address inefficiency in CPU design, as well as technological limitations in clock speed. In short, GHz is important, as long as it's efficient.

    4. Re:That's nice by Anonymous Coward · · Score: 0

      Is branch prediction/cache misses no longer an issue?
      Is cycles per instruction no longer an issue?
      If yes to both, is it because high-end processors are pretty much on par with those issues?

      It's been awhile since I knew anything about anything, but I assumed that there's still a market for processors whose throughput depends more on other things than on clock speed.

    5. Re:That's nice by Kaenneth · · Score: 1

      And for Dwarf Fortress.

    6. Re:That's nice by networkBoy · · Score: 1

      sure they are still issues, but in addressing the GGP of GHz not meaning squat, GHz still matters. If cost/power/threadcount/and per clock average performance are all the same which would you rather have? 3.67GHx or 4Ghz (~10% higher)?
      -nB

      --
      whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
    7. Re:That's nice by aepurniet · · Score: 1

      for a single executing thread of a specific bit width GHz means everything.

      if the pipeline length of a processor is 2x as long, then it will be equivalent in performance to a processor 1/2 the speed. combine this with the fact that different instructions have different pipeline lengths, and you have a recipe for the GHZ to tell you squat. would you rather have a 2008 netburst pentium 3 or a brand new intel chip? both were available clocked at 3ghz.

    8. Re:That's nice by Daniel+Phillips · · Score: 4, Insightful

      Single core performance is all that matters when processing a toolpath for CNC machining.

      Rubbish. There is no way your CNC machining app will even get close to the minimum latency that a single AMD core is capable of. What you are really saying is that your vendor is slow to get a clue about parallel programming.

       

      --
      Have you got your LWN subscription yet?
    9. Re:That's nice by Anonymous Coward · · Score: 0

      for a single executing thread of a specific bit width GHz means everything.
      The trick is can they scale it to multiple cores/threads, while lowering their power to match Intel's performance/Watt at the high end of the compute arena. If they can do that they will once again pull in DC customers.
      -nB

      Without knowing CPI, the clock means nothing. This isn't nitpicking, CPI varies GREATLY.
      It's like talking about an engine's RPM without knowing torque.

      4GHz is like your car doing 10k RPM
      Interesting... but useless by itself.

    10. Re:That's nice by Daniel+Phillips · · Score: 1

      Agree. The multi-core trend was more to address inefficiency in CPU design, as well as technological limitations in clock speed.

      More precisely, it is about seeking the best tradeoff in the Latency*Heat*Cost equation.

      In short, GHz is important, as long as it's efficient.

      Interesting proposition. I think its a little more complex than that. The main use of GHz today is to paper over the inefficiencies of current-generation single threaded software.

      --
      Have you got your LWN subscription yet?
    11. Re:That's nice by viperidaenz · · Score: 1

      Who told you that? It may be true if every n/2 instruction is a branch (where n = pipeline length).

      You'll probably find netburst was p4, not p3.

    12. Re:That's nice by Jeremi · · Score: 2

      What you are really saying is that your vendor is slow to get a clue about parallel programming.

      Maybe there are CNC algorithms that aren't easily parallelizable. Or (more likely) they can be paralellized, but the CNC development teams haven't got around to doing that yet. It doesn't really matter which as far as the consumer is concerned -- in either case, they will want a chip that maximizes single-threaded performance. Finger-pointing doesn't help them one bit, but fast CPUs might.

      --


      I don't care if it's 90,000 hectares. That lake was not my doing.
    13. Re:That's nice by Shark · · Score: 3, Informative

      He's not talking about running the g-code, he's talking about generating it from a model. Most CAM software are very CPU intensive for toolpath generation.

      --
      Mind the frickin' laser...
    14. Re:That's nice by Anonymous Coward · · Score: 0

      my processor design class at an elite east coast ivy league school. clock speed just determines the length of a pipeline stage. if a processor has twice the pipeline length, it also needs twice the speed. this is why instructions per clock are also an important metric. unfortunately for AMD Intel has been winning both battles.

    15. Re:That's nice by Daniel+Phillips · · Score: 1

      Maybe there are CNC algorithms that aren't easily parallelizable.

      I doubt that, being somewhat familiar with the problem space.

      It doesn't really matter which as far as the consumer is concerned -- in either case, they will want a chip that maximizes single-threaded performance.

      Speak for yourself. I prefer to keep the money in my pocket, and spend it on more frequent full-box upgrades. This keeps me ahead of the curve on average. Example: in a past gig where money was no object I started life with a Core2 class desktop which was state of the art at the time, but no, even when money is no object the beancounters will reject the idea of a new box every six months. In short order my onetime shiny Intel box was being smoked by your bog standard mail order AMD box.

      Or (more likely) they can be paralellized, but the CNC development teams haven't got around to doing that yet.

      Yes: slow to get a clue.

      --
      Have you got your LWN subscription yet?
    16. Re:That's nice by Daniel+Phillips · · Score: 1

      Most CAM software are very CPU intensive for toolpath generation.

      All the more reason to parallelize it, cutting latency drastically in the process.

      --
      Have you got your LWN subscription yet?
    17. Re:That's nice by Anonymous Coward · · Score: 0

      Wow, you should get your money back. Increasing pipeline depth increases latency linearly, sure, but has no effect on throughput in the absence of branch mispredictions.

      Bringing up IPC shows that you really don't understand the point of a pipeline.

    18. Re:That's nice by Anonymous Coward · · Score: 0

      When are those ever the same, they differ generation to generation of the same manufacturer, not every processor is even super scalar for christs sake.

    19. Re:That's nice by Anonymous Coward · · Score: 0

      It can't be made parallel. Each pass depends on the previous one. It's dumb that people think everything can be made parallel. Nine women can't have a baby in a month. That's been known for quite a while.

    20. Re:That's nice by Daniel+Phillips · · Score: 1

      It can't be made parallel. Each pass depends on the previous one. It's dumb that people think everything can be made parallel. Nine women can't have a baby in a month. That's been known for quite a while.

      Perhaps you overstate the difficulty. There are many methods of making things parallel. While serializing constraints may in fact exist, it is rare that a problem cannot be factored in such a way as to make most of an algorithm parallelizable in spite of them. Or to put it simply, Amdahl's law was wrong, proved by example many times over. Or to put it equally simply: if you don't try, you can be sure of not succeeding.

      --
      Have you got your LWN subscription yet?
    21. Re:That's nice by Anonymous Coward · · Score: 0

      MSC Patran is an aeronautical industry standard $30,000 piece of software that is occasionally CPU limited for 10 minutes at a time when I use it on an overclocked 2600k. It uses 1 core.

      You're welcome to try to change this shitty circumstance in some manner, but in the mean time, single core performance is very important (for many apps).

    22. Re:That's nice by Score+Whore · · Score: 1

      Or to put it simply, Amdahl's law was wrong, proved by example many times over.

      Perhaps you are merely trying to be outré but what exactly is wrong about:

      "The speedup of a program using multiple processors in parallel computing is limited by the time needed for the sequential fraction of the program."

    23. Re:That's nice by scsirob · · Score: 1

      So you have a team of engineers working on a design for weeks or even months. It's finally done, and now you are arguing that the CPU to translate to G-code takes an hour extra? Rrright...

      --
      To Terminate, or not to Terminate, that's the question - SCSIROB
    24. Re:That's nice by Daniel+Phillips · · Score: 1

      Or to put it simply, Amdahl's law was wrong, proved by example many times over.

      Perhaps you are merely trying to be outré but what exactly is wrong about:

      "The speedup of a program using multiple processors in parallel computing is limited by the time needed for the sequential fraction of the program."

      The flaw is the presumption that the "sequential fraction" of a program will always dominate as processor count increases. Here is a pretty good discussion. Basically, an embarrassingly large number of algorithms that Amdahl would have regarded as having limited scalability, turned out to have alternative formulations that scale out the yinyang. Ambahl would have been embarrassed by all the embarrassingly parallel parallelism going on, contrary to his dire and arguably self serving prediction.

      --
      Have you got your LWN subscription yet?
    25. Re:That's nice by Daniel+Phillips · · Score: 1

      You're welcome to try to change this shitty circumstance in some manner, but in the mean time, single core performance is very important (for many apps).

      Sure, just none that I run. What do I care about mainly? Yes, make -j. For that you need cores. Lots of them. Lots of integer cores.

      --
      Have you got your LWN subscription yet?
    26. Re:That's nice by Anonymous Coward · · Score: 0

      Well, the open source pycam has no problem working in parallel for 3d gcode generation. It even works with multiple computers over a network.

      (anon because I moderated in this thread)

    27. Re:That's nice by Anonymous Coward · · Score: 0

      More specifically single threaded performance is where bulldozer is getting undisputedly creamed by just about every Intel chip out there. GHz does not always directly translate into better single threaded performance, but at least they're working on their biggest glaring weakness.

    28. Re:That's nice by Score+Whore · · Score: 1

      If I wipe away all the flim-flam and falsely claimed presumptions, what you -- and the linked blog post -- are saying is that Amdahl's law is 100% correct?

      Rather than continuing to debate and having to point out the logical flaws in your reasoning and the wackiness of your approach of telepathy, channeling and otherwise trying to read Dr. Amdahl's mind in order to state what he regards and is embarrassed by, I will simply present you with a bet:

      I will give you 100:1 odds that you cannot present me with an algorithm that violates Amdahl's law. Your minimum bet must be $10,000 USD. Since you are sure that he is wrong I'm going to assume you have such an algorithm in hand and set the time frame for the bet at 1 week just to get it wrapped up. An algorithm is a specific solution, not a specific problem. Therefore for you to be considered the winner you must present a single algorithm that does not obey the law, not two contrasting algorithms in which one is more parallelizable than the other. What say you? It should be an easy $1,000,000, right?

      If you don't take my bet than I will conclude that you concede the argument and you acknowledge that you are utterly and completely wrong.

    29. Re:That's nice by marcosdumay · · Score: 1

      Eh... The serial path of the program do always dominate as processor count goes to infinite. That's math. Next you'll come arguing that 1 + 1 = 3.

      If you haven't seen this effect on your daily life, it is because you don't have enough cores.

    30. Re:That's nice by mhajicek · · Score: 1

      I'm using Mastercam. While it will use multiple cores to simultaneously generate multiple toolpath operations, each operation is limited to one or two cores. If generating only one operation it will use little over 25% capacity on my four core system. While I can't honestly weigh in on the argument of whether or not parallelisation of the algorithms is theoretically possible, I would be that it's certainly not easy given that each cutter position is dependent on the previous cutter position. The practical question is whether or not the developer can paralellise the algorithms without having to charge the users an exorbitant fee to cover the development. Since the answer to that is "no", I focus on single core performance when building a system with which to run the software.

    31. Re:That's nice by mhajicek · · Score: 1

      I'm not quite sure what your point is. Are you familiar at all with CNC programming? You don't just take a model and "translate" it to gcode. Machining centers are not like 3D printers. Fixturing and machining a complicated part in a time efficient manner while holding tolerance and not destroying your cutters is an art that takes decades to master. I would say it's NP-Hard. On top of that, some of the toolpath algorithms can take half an hour or even multiple hours to run once you've set all the parameters and selected all the geometry. Once it's done, you determine if you like the results or not, then either tweak the parameters and regenerate or move on to the next one. Any time savings in the processing turns into dollars pretty quickly; conversely any time wasted can be hugely expensive, especially when a $300,000 (or more) machine is sitting idle waiting for your code and the customer is on the phone asking when they will get their part.

    32. Re:That's nice by AlienIntelligence · · Score: 1

      MSC Patran is an aeronautical industry standard $30,000 piece of software that is occasionally CPU limited for 10 minutes at a time when I use it on an overclocked 2600k. It uses 1 core.

      You're welcome to try to change this shitty circumstance in some manner, but in the mean time, single core performance is very important (for many apps).

      What's it doin...? Seriously... I'm curious. It looks like awesome
      software, wish my "cheap" race engines warranted a $30,000
      software appy. I'd love to make my own H-beam con rods and
      custom knifed crankshafts.

      -AI

      --
      For me, it is far better to grasp the Universe as it really is than to persist in delusion
    33. Re:That's nice by AlienIntelligence · · Score: 1

      Interesting proposition. I think its a little more complex than that. The main use of GHz today is to paper over the inefficiencies of current-generation single threaded software.

      Lol... I wish u were modded higher so more would see that line.

      -AI

      --
      For me, it is far better to grasp the Universe as it really is than to persist in delusion
    34. Re:That's nice by Daniel+Phillips · · Score: 1

      OK, identify yourself, then we will discuss the question of a bet. Until then you are just some random schmuck who feels brave and clever behind a keyboard in his underwear.

      --
      Have you got your LWN subscription yet?
    35. Re:That's nice by Daniel+Phillips · · Score: 1

      Eh... The serial path of the program do always dominate as processor count goes to infinite. That's math. Next you'll come arguing that 1 + 1 = 3.

      If you haven't seen this effect on your daily life, it is because you don't have enough cores.

      Oh wow, you are basically saying your brain doesn't work. Do you know how many parallel processors you have between your ears? If Amdahl's law was actually good computer science, as opposed to merely good math, then your neurons would be spending 100% of their time firing and 0% thinking.

      As a useful tool, Amdahl's law occupies roughly the same position in computer science as Li's Copula does in high finance: both mathematically sound and stupidly misguided.

      --
      Have you got your LWN subscription yet?
    36. Re:That's nice by Score+Whore · · Score: 1

      A) I'm clever pretty much everywhere.
      B) I feel brave regardless of whether I'm behind a keyboard, and whether I'm wearing underwear or not.
      C) Nice to see you've gotten over your straw man arguments and have moved on to ad hominem.

      So we're agreed then that you're wrong. Let me give you a suggestion for the future. If you are going to dispute someone's statements then you need to actually dispute their statements, not something you've made up, something easily rebutted, and claimed is their statements. What you and the gentleman who wrote the blog post you linked to have done is what is called lying.

      Amdahl's Law says nothing about the relative proportions of parallelizable:sequential. It says nothing about alternative formulations. It says nothing about embarrassing parallelism. It is merely a true observation as to the limits of parallelism in any particular algorithm. Even when the algorithm is 100% parallel it is still true.

      So get over yourself and stop trying to be more important than you really are with your juvenile attempts at being scandalous.

    37. Re:That's nice by toddestan · · Score: 1

      I've been wondering is the case of single-threaded apps on a mulit-core, Hyperthreading chip if it is better to turn Hyperthreading off. Theory being that giving the thread one whole core may be better than giving it one of two threads on a core where it may have to share the load with something else. On your 2600k you would still have three other cores/threads to handle other processes. I guess I could try it at work, where the single-threaded application I have to use runs pretty well on the Hyperthreaded i5-680 (at 3.6 Ghz and up to 3.86 Ghz turbo boost it's one of the faster clocked chips out there, even though it's "only" a dual core).

    38. Re:That's nice by Daniel+Phillips · · Score: 1

      I'm actually going to call you a cynical self important bellycrawler and this is not rhetoric, it's a fact you went out of your way to demonstrate to the world. Please slither back to somebody who can stand you. Should you deign to provide your real name then I still will not respect you or your unilateral claim to cleverness, but we can take it from there.

      --
      Have you got your LWN subscription yet?
  3. Misread by Anonymous Coward · · Score: 1

    Misread the title as "AMD's Piledriver To Hit 4GHz+ With Resonant Cloak Mesh." Must say, thats a lot cooler than the reality.

  4. but really by arbiter1 · · Score: 0

    Less it can provide competition for Intel's cpu's at same price level and not use a ton more power like they have can't say its any point to care.

    1. Re:but really by Anonymous Coward · · Score: 2, Insightful

      Man, I had to read that 4 times and I'm still not quite exactly sure what you're saying.

      Let me give it a stab.

      Unless it can provide competition for Intel's CPUs at the same price level, and not use a ton more power to do it (as they have been doing recently), I don't think there is any point in caring.

      Communication isn't just about belching words, but actually putting them down so people can understand them.

    2. Re:but really by arbiter1 · · Score: 3, Interesting

      The bulldozer and i7-2600k were about same performance wise but that is 8 core cpu vs 4 cores + HT. Powerusage of both machines at wall was like 250watts under load. When you overclocked both the bulldozer to 4.8ghz and i7 to 5ghz, i7 used 80 more watts, the bulldozer doubled its draw to over 500 watts, i think it was 550 watts.

    3. Re:but really by MartinSchou · · Score: 1

      So you're saying that the bulldozer CPU upped its power draw to no less than 300 watts?

      What the hell kind of cooling were you using in that machine in order for it not to power throttle, shut down or wreck itself?

    4. Re:but really by arbiter1 · · Score: 1

      The Draw was @ the wall for the computer so it was the whole computer, hard drive, video card, board.

  5. details? by rudy_wayne · · Score: 5, Insightful

    Unfortunately, aside from a fuzzy whitepaper, actual technical details are all behind IEEE and other paywalls with useless abstracts.

    So why post an article that contains no meaningful information?

    Oh wait . . . never mind. I forgot where I was.

    1. Re:details? by Namlak · · Score: 1

      Oh wait . . . never mind. I forgot where I was.

      You're here, too, you know.

  6. ghz after amd says clock speed irrelevant by Anonymous Coward · · Score: 0

    Whatever makes a better processor is a good thing, but I find it ironic AMD promoting higher clock speeds after renaming their processors due to the clock speed wars.

    1. Re:ghz after amd says clock speed irrelevant by Daniel+Phillips · · Score: 1

      Whatever makes a better processor is a good thing, but I find it ironic AMD promoting higher clock speeds after renaming their processors due to the clock speed wars.

      It is not ironic, rather it is because returns from superscalar design are diminishing while feature size keeps shrinking and other incremental technology improvements keep delivering higher practical clock rates.

      --
      Have you got your LWN subscription yet?
  7. Not really by Sycraft-fu · · Score: 4, Insightful

    Intel is already running at 4GHz+. Ok not officially, but it is almost impossible to find a Sandy Bridge K series that won't easily overclock to 4Ghz or more. I bumped my 2600k to 4GHz. No voltage increase, no messing around, just turned the multiplier up. Zero stability issues, doesn't even draw a ton more power. Basically they are just being conservative for thermal reasons.

    The 22nm Ivy Bridge is soon to launch as well. Never mind any potential better OCing, it is faster per clock than SB. Well SB is a good bit faster than Bulldozer (who's architecture Piledriver uses) per clock, sometimes more than a bit (depends on what you are doing).

    So no, they'd need way more speed to give Intel any kind of run for their money, unfortunately. What they really need is a better design, something that does better per clock, but of course new designs take a long time and BD itself was quite delayed.

    Remember the one and only time AMD did eclipse Intel was during Intel's P4 phase. Intel had decided to go for low work per clock, high clock speed. Well speeds didn't scale as they'd hoped and the P4 was not as powerful for it. AMD chips were tops. However the Core architecture turned all that around. It was very efficient per clock, and each generation just gets better. Meanwhile AMD stagnated on new architectures, and then released Bulldozer which is not that great.

    Also they have to fight the losing fab battle. They spun off their fabs and as such aren't investing tons of R&D in it. Well Intel is, and thus are nearly a node ahead of everyone else. Other companies are just in the last few months getting their 32nm node and 28nm half-node production lines rolling out products to retail channels. Intel has their 22nm node process complete and is fabbing chips for retail release in a couple months. So they've got that over AMD, until other fabs catch up, by which time Intel will probably have their 14nm half-node process online in Chandler (the plant construction is in full swing).

    Sadly, things are just not good in the x86 competition arena. AMD competes only in a few markets, and Intel seems to edge in more and more. Servers with lots of cores for reasonable prices seems to be the last place they really have an edge, and that is a small market.

    I don't want to see a one player game, but AMD has to step it up and this unfortunately is probably not it. If they make it work, expect Intel to just release faster Core i chips with higher TDP specs. The massive OCing success shows they could do so with no problem.

    1. Re:Not really by networkBoy · · Score: 4, Insightful

      I'm a diehard Intel Fanboi. My last AMD was an 80286, I owned an AMD80386DX40, but never used it (acquired it at a swap meet after the P60's had just launched).
      Prescott had a use case where it outperformed AMD, but it was very narrow, if your load was highly predictive and did not cause cache misses or branch prediction failures, it owned the AMD. Sadly every workload except straight up numerical number crunching was not so good. I used my 3.6GHz P4 for transscoding video. It was the first machine that I owned where I could encode faster than real time (i.e. movie is 60 min, I could encode in 50).

      I really hope this pans out for AMD and brings them a little up into Intel's game. While as you said there has only been one time where AMD flat out bested Intel, there have been several cases where AMD has nailed a particular segment:
      * Low cost many cores (data compute clusters).
      * Low cost reasonable performance for most end user loads.
      * Downright cheap CPU for entry machines.
      Every time they've done something they have forced Intel to step up to that segment and improve.
      In this case I hope to see not the high spec CPU improvement, but rather the mid-range CPU segment get a very low power option. Somewhere in the i5 equivalent range, but giving desktop performance while sipping mobile levels of power.
      It would make building a poor man's compute cluster more feasible from a power and cooling standpoint.
      -nB

      --
      whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
    2. Re:Not really by Intropy · · Score: 3, Insightful

      It's not true that the AMD lead was that short. The Athlon came out and was immediately on par with or better than Intel's Pentium IIIs. By the time it was thunderbird vs coppermine/tualatin the lead was pretty sizable. That lasted throughout the Athlon64/Pentium 4 period and into the Core's run until the Core 2 duos arrived. The gap was close for a while with Inte's multi-core processors generally superior, but as little as about a year and a half ago, AMD had the better offering in the X3 than Intel's Core i3. Competition is tight, which has been good for the rest of us.

    3. Re:Not really by PitaBred · · Score: 1

      You can probably go higher. I've got a 2500K that's running 4.8GHz on stock voltages. Basically all K series chips can reliably hit 4.5GHz on stock voltages with adequate cooling.

    4. Re:Not really by lightknight · · Score: 1

      And, as always, so can many of AMD's latest offerings (exceed 4Ghz).

      --
      I am John Hurt.
    5. Re:Not really by Nemyst · · Score: 1

      It's even more ridiculous than that. My motherboard automatically overclocked my 2500K to 4.3GHz. From what I can tell, that 1GHz increase over the stock value isn't even pushing it (temperatures are still ridiculously low, with a 7-Zip benchmark hitting 55C). Granted, aftermarket coolers probably help, but I believe a 0.5-0.75GHz bump on a stock cooler is entirely reasonable.

      I have a feeling that Intel might actually be downplaying their default clocks; even under the most terrible conditions, I can't see a 2500K not hitting at least 3.5-3.6GHz. In many ways Intel needs AMD to exist, so perhaps they're limiting the stock clock (that the majority of people will end up using) to give them a running chance.

      It's unfortunate, but considering the relative sizes of the two rivals, Intel could easily crush AMD if they so desired. They have the R&D advantage, the fab advantage and the pressure advantage.

    6. Re:Not really by Anonymous Coward · · Score: 0

      Intel already shipped 4GHz+ processors. Xeon X5698 dual-core 4.4 GHz, for high-frequency trading systems.

    7. Re:Not really by dbIII · · Score: 2

      Yes, but can a Xeon do it and how many cores can you have in a box at a sane price?
      Currently there are AMD systems with 64 cores going for under US$10k. For some CPU bound tasks such things are wonderful. Any speed increase makes it even better.
      Intel are catching up with 10 core CPUs that are faster than the currently available Opterons but for tasks with a LOT of threads the AMD CPUs still outperform for the same number of sockets. It may look like a "small market" to you but there's still a huge number of nodes out there with a lot of CPUs.

    8. Re:Not really by Anthony+Mouse · · Score: 1

      Hasn't AMD already shipped 4GHz+ processors?

    9. Re:Not really by Anonymous Coward · · Score: 0

      And my buddy just got a Bulldozer clocked at 3.6Ghz up to 4.8Ghz on air.

      Overclocking isn't Intel-exclusive.

    10. Re:Not really by Cajun+Hell · · Score: 4, Interesting

      It's even more ridiculous than that. My motherboard automatically overclocked my 2500K to 4.3GHz.

      It's even more ridiculous than that. Tom's boys say that by overclocking it a little, you might even make it more efficient.

      --
      "Believe me!" -- Donald Trump
    11. Re:Not really by Anonymous Coward · · Score: 0

      Where's the 8170? I want...

    12. Re:Not really by Ungrounded+Lightning · · Score: 1

      Also they have to fight the losing fab battle. ... Other companies are just in the last few months getting their 32nm node and 28nm half-node production lines rolling out products to retail channels. Intel has their 22nm node process complete and is fabbing chips for retail release in a couple months.

      However this technology lets AMD get rid of most of the clock drivers and most of their power consumption and waste heat. That means the rest of the logic can be pulled closer together in a given technology, speeding it up. It also means you can put more total logic on a chip before the heat limits you.

      It wouldn't surprise me if this matches or beats being one generation behind on fabs.

      Also: There's no substitute for low power drain in portable applications. Further, smaller feature size means higher leakage. This leaves Intel with TWO dings for the portable market.

      --
      Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
    13. Re:Not really by Anonymous Coward · · Score: 0

      But you are wrong amd chips can run 4ghz+ today to. But neither intel or amd can run 4ghz +today while consuming less than 125w. Neither intel or amd wants to realese a 200w+ cpu in todays market
      . And That is what this tech is about.

    14. Re:Not really by Anonymous Coward · · Score: 0

      Similarly, Bulldozer has such a load – if your task is based around integer processing, is very parallel, and just streams data in without doing anything much interesting to cause the instruction fetch/decode bottle neck to kick in, bulldozer will kick sandy bridge's butt. Unfortunately, everywhere else it sucks.

      Your assertion about low cost reasonable performance and downright cheap for entry level machines though turn out to be false. Most people forget that the Pentium G630 and G850 exist when they make this assertion.

    15. Re:Not really by kyrio · · Score: 1

      That's cool. My Phenom II X4 955BE is running at 4GHz with basic cooling.

    16. Re:Not really by Lonewolf666 · · Score: 1

      In this case I hope to see not the high spec CPU improvement, but rather the mid-range CPU segment get a very low power option. Somewhere in the i5 equivalent range, but giving desktop performance while sipping mobile levels of power.

      I agree. A CPU with mid-range performance and low power consumption is exactly what I want in my PCs. A few months ago I upgraded an old machine, using a Phenom II 910e (not quite as fast as an i5, but AMD supports ECC RAM in most of its CPUs). But the ECC feature was the only one that kept me on AMD this time. Intel does support it in the Xeon line for servers, but that was a little too expensive for my taste.

      If AMD can make their CPUs more attractive with the resonant mesh, it can only be good for end users. Even if they don't want an AMD, competition might force Intel to lower its prices.

      --
      C - the footgun of programming languages
    17. Re:Not really by dave420 · · Score: 1

      No-one said it was.

    18. Re:Not really by TheRaven64 · · Score: 2

      If you're factoring cost in, AMD's lead dates back to the K6-2. Clock for clock they were slower, but I could get a 400MHz K6-2 and motherboard for less than a 266MHz Pentium 2 and motherboard back then and the K6-2 was a lot faster - especially since it ran the memory 50% faster than the P2.

      --
      I am TheRaven on Soylent News
    19. Re:Not really by Rockoon · · Score: 1

      Granted, aftermarket coolers probably help, but I believe a 0.5-0.75GHz bump on a stock cooler is entirely reasonable.

      Both Intel and AMD have had offerings able to run at 4ghz and beyond, with simple air cooling, for many years.... but you cannot cool a chip running at these speeds using the build-up of hot air found in a cramped OEM case that has only a single 80mm exhaust fan.

      More important than the CPU cooler these days is the case ventilation. This is something OEM's get "wrong" (*) which is why Intel (and AMD) cannot market inexpensive CPU's that are advertised to run even near 4ghz without significantly altering either their warranty or their price. If you build your own system, or are upgrading an existing OEM system, you can add on a pair of 120mm fans (one in front, one in back, intake and exhaust) that is sure to allow even a stock 45nm 2.8ghz AMD Thuban (**) to run at 4ghz.

      (*) Its wrong for Intel and AMD, but not for the OEM's.
      (**) I have never heard anyone that builds their own system say that a Thuban or i7 wasn't able to hit 4ghz on air cooling.

      --
      "His name was James Damore."
    20. Re:Not really by drinkypoo · · Score: 1

      Don't forget mobile CPU, the smallest, most mobile Athlon 64 chips (like the L110) provide substantially superior performance to intel's offerings, dollar for dollar, and are coupled with a superior GPU to boot. While I am still beyond pissed at AMD for abandoning R690 and L110 with poor driver support, you have to admit it beat the living shit out of the single-core atoms it ran against, just as their dual-core mobile Athlon 64 does today.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    21. Re:Not really by drinkypoo · · Score: 1

      Or you could, like I did, get a 233 MHz K6-2 and overclock to 266 for less than half the price of actually building a 266 MHz P2 system. No lie. Granted, there were driver issues, but at the time it took my entire computing budget and got me to the point where I could play modern games with my friends.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    22. Re:Not really by TheRaven64 · · Score: 1

      The other place where AMD won was upgrades. I had a 133MHz Cyrix Pentium-compatible CPU (which did well clock-for-clock compared to Intel, but they insisted on branding it as a P166 and it was definitely slower than a 166MHz Pentium). I could drop a K6-2 into it for a cheap upgrade. If I wanted a Pentium 2, I needed a new motherboard. A lot of the later Pentium-class systems came with motherboards that supported a 100 or even 133MHz FSB, so you could go right up to a 450MHz K6-3 without having to change anything else.

      --
      I am TheRaven on Soylent News
    23. Re:Not really by fast+turtle · · Score: 1

      This is why I tend to stick with AMD over Intel. I can replace a board 5 years later with the same socket and my current CPU investment should simply drop right in and work while giving me access to newer features such as USB 2 and PCIe. Intel though wants you to buy a new motherboard and CPU every couple of years, which is why they change socket design. It also allows them to force segregate the market. A prime example is ECC memory support. The damn memory controller is now built into each and every damn Pentium/Xeon chip yet they don't support ECC on anything except the Server class Xeons. Why? Because it allows them to force you to buy the more expensive chip if you need ECC support unlike AMD who includes ECC support in damn near all of their desktop CPU's. The issue is the motherboard needs the extra traces to properly support ECC, which is the expensive part unlike including the support on the Die as AMD does.

      --
      Mod me up/Mod me down: I wont frown as I've no crown
    24. Re:Not really by Anonymous Coward · · Score: 0

      Also: There's no substitute for low power drain in portable applications. Further, smaller feature size means higher leakage. This leaves Intel with TWO dings for the portable market.

      You're really reaching to try to turn this into a win/win/win for AMD.

      If resonant clock meshes turn out to be a really good thing, there's nothing which is going to stop Intel from adopting them too. Did you not notice that this isn't an AMD technology? The story is about AMD licensing it, not AMD inventing it. The inventors are operating a tiny patent licensing outfit. I'm sure they'd be overjoyed to license it to Intel too, and anyone else who wants to use it. The more they license it, the more money they make.

      Second, no, smaller feature size does not automatically equate to higher leakage, especially when there are confounding factors. And over the last several process generations, there have been tons of those when comparing Intel process tech to the rest of the industry. Intel's been rolling out key new technologies before anyone else. Of particular relevance here, since you're bringing up leakage current, is that Intel was the first (at 45nm) to introduce the combination of a Hi-K gate dielectric material and a metal gate, which dramatically reduced leakage compared to Intel 65nm. Intel's competition still hasn't fully rolled out HKMG at 32nm or 28nm, and nobody seems to have done it quite so well as Intel. At 22nm, Intel is shortly to become the first to ship non-planar transistors (FinFET) for revenue, and once again this reduces leakage (or, if they choose, increases performance at the same leakage as 32nm).

      Finally, AMD is not merely a couple generations behind on fabs any more. As in, they don't own fabs now. They're at the mercy of their partners to try to keep up with Intel. And their partners don't really want to try, because trying to be on the bleeding edge of logic performance isn't a profitable model in the foundry industry.

    25. Re:Not really by Ungrounded+Lightning · · Score: 1

      I could really care less about AMD vs Intel. I have no horses in that race - and haven't for over a decade. (The only issue I have with either of them is Intel's inclusion of hardware remote administration backdoors in their chipsets, which they tout as a "feature" - and I haven't looked into whether AMD does the equivalent.)

      My only point was that using this particular technology is good for about a generation's worth of density without a process upgrade and a substantial power and heat savings.

      If Intel wants to do it, too, the more the merrier. Meanwhile, while AMD has it deployed and Intel doesn't yet, it may put AMD back into the race for one round.

      --
      Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
  8. Resonant Clock Mesh? by Anonymous Coward · · Score: 0, Funny

    Sounds fake. Like Hyperthreading. Sounds like they are doing some trick to make the numbers better, but not really improve any performance.

    But I'm just an armchair slashfag know-it-all.

    1. Re:Resonant Clock Mesh? by MobileTatsu-NJG · · Score: 1

      But I'm just an armchair slashfag know-it-all.

      Hah. If I were a bettin man I'd say you caught the recent story about CERN and the loose cable.

      --

      "I like to lick butts!" by MobileTatsu-NJG (#32700246) (Score:5, Informative)

  9. Even for single thread it doesn't by Sycraft-fu · · Score: 1

    See Athlon vs P4. Both were best for single threaded stuff, owing to a single core. However the Athlon did more with less, got better performance at lower clocks. Why? It could do more per clock, or more properly took less clocks to execute an instruction.

    IPC matters and the Core i series is really good at it. Bulldozer, not as good. What that means is that all other things being equal, BD needs to be clocked higher than SB to do the same calculations in the same time.

    Well that is also a problem because the Core i series are beasts with regards to clock speed. You more or less cannot find a k series part that won't overclock to 4GHz on stock cooling at stock voltages with no stability issues.

    You need both good IPC and good clock speed for bitchin' single threaded performance. Really the only thing Bulldozer has going for it is that it isn't a true 4 core system, in the classic way of thinking about it. It isn't a full 8 cores, but it is more than just 4 cores with 2 threads per core. So that can help for highly parallel stuff. Unfortunately, usually even in those cases SB wins out, and there is plenty that is not so highly parallel.

    1. Re:Even for single thread it doesn't by PitaBred · · Score: 1

      If you go aftermarket cooling, you can almost certainly hit 4.5GHz on stock voltages. Right now I'm doing 4.8GHz on stock voltage with a 2500K.

  10. There are no technical details. by game+kid · · Score: 4, Funny

    Cyclos is the only supplier of resonant clock mesh IP.

    There are no technical details. It's intellectual property, so it's powered by pixie dust, mana potions, and lawyers. Can't get more meaningful than that.

    --
    You can hold down the "B" button for continuous firing.
    1. Re:There are no technical details. by wbr1 · · Score: 1

      You forgot unicorn farts. Fail. :)

      --
      Silence is a state of mime.
    2. Re:There are no technical details. by Anonymous Coward · · Score: 0

      It's intellectual property, so it's powered by pixie dust, mana potions, and lawyers.

      You forgot unicorn farts. Fail. :)

      He did not. He said layers.

    3. Re:There are no technical details. by PatPending · · Score: 2

      There's plenty of technical details here:

      Resonant clock and interconnect architecture for digital devices with multiple clock networks (full text only; i.e., no images)

      If you want images also, here's the PDF.

      --
      What one fool can do, another can. (Ancient Simian Proverb)
    4. Re:There are no technical details. by Anonymous Coward · · Score: 0

      You forgot wit. Fail. :(

  11. Real Stuff by Anonymous Coward · · Score: 0

    This is a real thing. I'm sure anyone who has been reading the research literature knows that it works, but it's just very difficult to do well.

  12. IBM POWER 6 by Anonymous Coward · · Score: 1

    IBM was selling POWER 6 processors running at 5Ghz years ago.

    1. Re:IBM POWER 6 by the+linux+geek · · Score: 1

      And they still sell Power7 with 8 cores and issuing 6 instructions per cycle at 4GHz+. They're obscenely fast, but they're also not cheap unless you're comparing them to Itanium, SPARC, or Intel's -EX series Xeons.

    2. Re:IBM POWER 6 by dbIII · · Score: 2

      Except they won't sell them to you unless you are Sony or a reseller that's used to Defence pork contracts. The last time I finally got a price on a POWER CPU system (after two annyoing weeks of the salesguy building up a "relationship" and carefully weighing my wallet) I gave up and got four Xeon systems that were almost as good each for a lower price than the single POWER CPU system.

    3. Re:IBM POWER 6 by Anonymous Coward · · Score: 0

      You are honestly telling me that you could not buy a POWER, even a used power6 system from ebay?
      I'm running SUSE on a power5+ 520 alongside an AIX and an IBM i partition... the box costs less than 2k usd

  13. awesome by smash · · Score: 3, Insightful

    Maybe it will catch up to the Sandy Bridge Core i5 now?

    --
    I run: Windows, OS X, Linux, FreeBSD. Just because you have a hammer, doesn't mean everything is a nail.
  14. Synopsys by Anonymous Coward · · Score: 1

    -Each logic gate in a chip needs a clock signal to get to it.

    -This is normally done via a wavy, wormy mesh of clock wires.

    -Clock skew (When clock is sent, it takes x microseconds to traverse the chip) scales exponentially. 10% at 1ghz is 100mhz skew; 10% at 3ghz is 300mhz skew. And so on.

    -Clock skew = VERY BAD, big limiting factor in making faster chips.

    -Cyloes has solution.

    A: Replace the clock with a simple "Tank Circuit" clock, to reduce the [possibility of the clock not working.

    B: Replace a massive mesh of interweaving wires with a "clock plane". Most PCB's have a voltage and grounding plane, why not a clock plane?

    This design principal has some advantages:

    +Less length of wire in the processor = more savings. As hz increases, the savings in power are exponential. As you increase hz the entire name of the game becomes figuring out how to put fewer electrons through the die. Less wiring = less resistance = less power = less heat = more potential for speed.

    +No more designing parts of the processor to time around each other; the whole plate loads and unloads very predictably now and superconductive materials can be inserted between the clock and plate to increase saturation speed.

    Realworld:
    10-35% decrease in power usage.
    Scaling the processor to more transistors or more ghz is now much less problematic.

    1. Re:Synopsys by Thiez · · Score: 1

      > Clock skew (When clock is sent, it takes x microseconds to traverse the chip) scales exponentially. 10% at 1ghz is 100mhz skew; 10% at 3ghz is 300mhz skew.

      So when you triple the clockspeed, you triple the skew? That sounds... linear, and not exponential at all.

    2. Re:Synopsys by windwalkr · · Score: 1

      I also thought that at first, but on a second glance.. skew per clock cycle is increasing exponentially. Which is probably what they care about here?

  15. Sorry AMD by Osgeld · · Score: 0

    I am currently currently specing out my next two new machines, and for the first time since 1999 I am going intel. The X2 and X3 that are being replaced were just a pain in the ass, and directly out of the box felt unimpressive.

    And its not like they were a great deal or anything, less than 20$ difference, so whats the angle?

    1. Re:Sorry AMD by EmagGeek · · Score: 0

      Enjoy all those legacy SATA2 ports that won't support modern SSDs or hard disks.

    2. Re:Sorry AMD by Anonymous Coward · · Score: 0

      Really I have the 720 x3 and have had no problems with it. Also it overclocks great i can get 3.3ghz stable with no problems.

    3. Re:Sorry AMD by Osgeld · · Score: 1

      yea cause its a huge problem now with my SATA2 ports and no SSD's cause it really doesn't bother me to wait an extra 3 seconds for the thing to boot once or twice a month

      I need more brain power, and could not care less about loading screens

    4. Re:Sorry AMD by Anonymous Coward · · Score: 0

      >> I need more brain power

      Well that much is certain.

    5. Re:Sorry AMD by Osgeld · · Score: 1

      I have no problem with my720, I never stated it was unstable and crashing

    6. Re:Sorry AMD by PitaBred · · Score: 1

      They don't? Since when? There's not a single hard disk out there that will saturate an SATA2 port. And all modern Intel boards have at least 2 SATA3 ports.

    7. Re:Sorry AMD by viperidaenz · · Score: 1

      Thats why the intel Q67 chipset has 6 sata3 ports?

    8. Re:Sorry AMD by Anonymous Coward · · Score: 0

      "yea cause its a huge problem now with my SATA2 ports and no SSD's"

      While SATA3.0 would be nice, even my 550MB/s(80k IO) read 400MB/s(36k IO) write SSD rarely breaks 60MB/sec except when doing heavy multi-tasking. Doing benchmarks of sequential or random IO both pegs my SATA2.0 port, but those are synthetic.

      My guess is that any single given app that only reads a single file at a time, typically only has 1 outstanding IO at any one moment. One would have to write code differently to thread IO requests properly. Like reading from multiple files at a given time and possibly making multiple sequential IOs at the same time instead of waiting for the last one to come back.

    9. Re:Sorry AMD by Anonymous Coward · · Score: 1

      The change from SATA2 to SATA3 involved a whole lot more than just increasing the data rate.

      The 3.0 specification contains the following changes:

              6 Gbit/s for scalable performance
              Continued compatibility with SAS, including SAS 6 Gbit/s. "A SAS domain may support attachment to and control of unmodified SATA devices connected directly into the SAS domain using the Serial ATA Tunneled Protocol (STP)" from the SATA_Revision_3_0_Gold specification.
              Isochronous Native Command Queuing (NCQ) streaming command to enable isochronous quality of service data transfers for streaming digital content applications.
              An NCQ Management feature that helps optimize performance by enabling host processing and management of outstanding NCQ commands.
              Improved power management capabilities.
              A small low insertion force (LIF) connector for more compact 1.8-inch storage devices.
              A connector designed to accommodate 7 mm optical disk drives for thinner and lighter notebooks.
              Alignment with the INCITS ATA8-ACS standard.

    10. Re:Sorry AMD by EmagGeek · · Score: 1

      Um, no, it doesn't. It supports two SATA3 ports and four SATA2 ports.

    11. Re:Sorry AMD by RightSaidFred99 · · Score: 1

      Uhh, he will. Along with the SATA3 ports that will. What in the world are you talking about?

    12. Re:Sorry AMD by drsmithy · · Score: 1

      Enjoy all those legacy SATA2 ports that won't support modern SSDs or hard disks.

      SATA 3 devices work fine on SATA2 ports, all but the fastest spinning disks barely stretch SATA1, and you'd struggle to find many situations even high-end SSDs are meaningfully constrained by SATA2.

    13. Re:Sorry AMD by Pentium100 · · Score: 1

      6 Gbit/s for scalable performance

      I don't really think there are SATA (not SAS) hard drives that can saturate SATA2, so not really a big deal.

      Continued compatibility with SAS, including SAS 6 Gbit/s.

      Since you can only connect a SATA drive to a SAS controller (not SAS drive to SATA controller), not a big deal if you are using a SATA controller.

      Isochronous Native Command Queuing (NCQ) streaming command to enable isochronous quality of service data transfers for streaming digital content applications.

      Kinda neat, though I don't know how useful in real life.

      Improved power management capabilities.

      Does not really matter for desktops - a hard drive does not use a lot of power compared to the rest of the PC.

      A small low insertion force (LIF) connector for more compact 1.8-inch storage devices.

      Does not matter for 3.5" desktop drives.

      A connector designed to accommodate 7 mm optical disk drives for thinner and lighter notebooks.

      Does not matter for desktops.

      Alignment with the INCITS ATA8-ACS standard.

      Whatever.

      So, SATA3 is not really useful, or rather, it is not something that is worth replacing the motherboard if you have a motherboard with SATA2 and do not plan to use SSDs.

  16. amd boards have better pci-e I/o and lanes by Joe_Dragon · · Score: 0

    amd boards have better pci-e I/o and lanes then Intel boards

  17. Really? by stazeii · · Score: 1

    So, are they going to give their chips porn names? Piledriver? Someone needs to clue these guys in... Just calling your stuff by construction equipment names doesn't make it cool. I'm a Mac user, and I still hate the fact Apple has latched onto this "let's call our products by their code names" crap. Guess I should look forward to the days of AMD's Cleveland Steamer processors.

  18. Maybe its their choice of marketing words by viperidaenz · · Score: 2

    but clock resonance sounds like it wouldn't play well with changing the clock frequency.

    1. Re:Maybe its their choice of marketing words by Anonymous Coward · · Score: 0

      All clocks are based on some fundamental resonant frequency. Their output can be skewed a bit with the use of inductors and capacitor. (Not to mention the harmonic frequencies.)

    2. Re:Maybe its their choice of marketing words by Anonymous Coward · · Score: 0

      Wow, I've never seen anyone so confidentially fail basic linear systems theory.

    3. Re:Maybe its their choice of marketing words by Anonymous Coward · · Score: 0

      Your snarky reply is annoying and serves no purpose other than to stroke your own ego.

      Anyway good luck in the job market being such a condescending person.

    4. Re:Maybe its their choice of marketing words by Anonymous Coward · · Score: 0

      "Confidentially?"

      You meant "confidently," right?

      (And would you consider yourself as having failed basic English?)

    5. Re:Maybe its their choice of marketing words by rrohbeck · · Score: 1

      You can double it :)

  19. resonate clock mesh by slew · · Score: 5, Informative

    Quick background: Currently clocks on most generic chips today are structured as trees. As you can imagine the fan-out of the clock trees is pretty large and thus require clock buffers/driver circuits which need to be balanced so that clock signal gets to the leaves at about the same time (in a typical design where you don't use a lot of physical design tricks). To ease balancing the propagation delay, the clock tree is often physically looks like a fractalized "H" (just imagine the root clock driving in the center of the crossbar out towards the leaves at the corners of the "H", the wire lengths of the clock tree segments are the same, then the corners the big H driving the center of a smaller "H", etc, etc). Of course at the leaves, there can be some residual imbalance due to small manufacturing variations and wire loading and that has to be accounted for in closing the timing for the chip (to avoid short paths), and ultimatly these imbalances limit the upper frequencies achievable by the chip.

    Additional background: In any electrical circuit, there are some so-called resonant frequencies because of the distributed (or lumped) inductance and capacitances in the network. That is some frequencies experience a lot less energy loss than average (for the car analogy buffs, you can get your car to "bounce" quite easily if you bounce it at it's resonant frequency).

    The basic idea of the Cyclos technology is to "short-circuit" the middle of the clock tree on the chip with a mesh to make sure all the middle of the clock tree is coordinated to be the same clock (as oppposed to a typical H tree clock, in every stage the jitter builds up from the root). That way you avoid some of the imbalances the limit the upper frequencies achievable by the chip. The reason I say "short-circuit" is that it really isn't a "short circuit". If you just arbitrarily put in a mesh in the middle of a clock tree, although it would tend to get the clocks aligned, it would presents a very large capacitive and inductive load to drive and would likely increase power greatly. **Except** if that mesh was designed so that it resonated at the frequency that you were going to drive the clock, then you can get the benefit of jitter reduction w/o the power cost. Since you get to pick the physical design parameters of the mesh (wire width, length, and grid spacing, and external tank circuit inductance) and the target frequency, theoretically you can design that mesh to be resonant (well, that remains to be seen).

    The reason this idea hasn't been used to date is that it's a hard problem to create the mesh with the proper parameters and now the processor really has to just run at that frequency all the time (well, you can do clock cycle eating to approximate lower frequencies). Designers have gotten better at these things now and the area budgets for these types of things have gotten in the affordable range as transistors have gotten smaller.

    FWIW, In a pipeline design (like a cpu), sometimes it's advantagous to have a clock-follows-signal clocking topology or even an async strategy instead of a clock tree, but there of course is a complication if there is a loop or cycle in the pipeline (often this happens at say a register file or a bypass path in the pipeline), so that trick is limited in appliciablity, where the mesh idea is really a more general solution to clock network jjitter problems.

    Here's a white paper that describes this idea... http://www.cyclos-semi.com/pdfs/time_to_change_the_clocks.pdf

    1. Re:resonate clock mesh by TheSync · · Score: 3, Interesting

      How can the mesh be resonant to a square wave (with lots of high frequency harmonics over a huge band)?

      I can imagine it being resonant to a single frequency sine wave.

      But if the clock mesh is powered by a sine wave, you have to turn it back into a square wave to drive gates, and to do that you have to compare the clock voltage level with some known voltage levels, and there you may have process inaccuracies.

    2. Re:resonate clock mesh by kermidge · · Score: 1

      Thanks to you and PatPending, I'm now read into something that seems mighty interesting and far over my head. I'm wondering if, or how, this might be applied to the "3d" chips that IBM is/was working on. (Btw, I re-read James P. Hogan's "Inherit the Stars" on Saturday - he mentions stacked chips with internal cooling channels, in 1978.)

    3. Re:resonate clock mesh by subreality · · Score: 4, Interesting

      [quote]How can the mesh be resonant to a square wave (with lots of high frequency harmonics over a huge band)?[/quote]

      There's no such thing as a square wave at 4GHz. You can draw them like that on paper, but in reality the edges smear into a pretty good approximation of a sine wave.

      Regardless, it will still have some higher frequency components, but you don't have to worry about them. The resonance won't help generate nice sharp edges, but that's the line driver's job. The resonance is just to save energy by helping pump the voltage at the fundamental frequency.

      (Disclaimer, not an EE, but I've looked over their shoulders a bunch of times)

    4. Re:resonate clock mesh by Anonymous Coward · · Score: 0

      All the higher harmonics does is to increase the slew rate of the clock signal. Only the region around the logic threshold is important to a digital circuit.

      A square wave is a weighted sum of the odd harmonics. The weights of the nth harmonics is 1/n. So if you can take advantage of the power saving at the fundamental frequency via resonant which is a big chunk. Even if you have to throw some power to take care of 3rd and 5th so that you can have a faster slew rate in your clock edge, you still win.

      See here for the visualization of what the higher harmonics: http://www.allaboutcircuits.com/vol_2/chpt_7/2.html

    5. Re:resonate clock mesh by Anonymous Coward · · Score: 0

      You would obviously tune it to the highest powered harmonic of the square wave.

    6. Re:resonate clock mesh by lexman098 · · Score: 1

      What strikes me as the biggest problem with this isn't finding a strong resonant frequency, but building a small enough inductor with high Q. The inductor has to be small since the resonant frequency is fixed and the capacitance of an entire metal layer is huge (mesh or not). Q scales with inductance, and if your Q is too low then it's going to require a lot more energy to keep the thing resonating and you're not going to get much out of your "resonator".

      This is also completely disregarding the added expense of using up a metal layer or two (or three) to support a clock mesh. This directly impacts placement density (transistor count) due to the decreased routability. TFA seems to just blow this off with the claim of an IC being pad limited (so what has this Moore's law business been about?).

    7. Re:resonate clock mesh by Anonymous Coward · · Score: 0

      The trick applies to any type of chip arrangement, so long as there's a means of fabricating and linking the necessary components. Planar or 3d is irrelevant to the discussion.

    8. Re:resonate clock mesh by Anonymous Coward · · Score: 0

      Thank you, very informative

    9. Re:resonate clock mesh by drinkypoo · · Score: 1

      The reason this idea hasn't been used to date is that it's a hard problem to create the mesh with the proper parameters and now the processor really has to just run at that frequency all the time (well, you can do clock cycle eating to approximate lower frequencies). Designers have gotten better at these things now and the area budgets for these types of things have gotten in the affordable range as transistors have gotten smaller.

      Indeed, changing the clock rate has very little effect on power consumption compared to changing the voltage. If they're going to vary core voltages, too, they will probably have better power saving without clocking.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    10. Re:resonate clock mesh by Asmodae · · Score: 1

      I am an EE, and pretty much this. To get something that looks like a square wave, the bandwidth of your overall circuitry has to be MUCH higher than the frequency of the clock, since the square edge has infinite frequency content. Now that's not to say a good approximation of a square wave could NOT exist at 4 GHz or higher, but there's no need and no point since that would complicate and hinder other aspects of the design.

  20. A tuned circuit ... by PPH · · Score: 4, Insightful

    ... inside the processor? Sounds like the end of overclocking.

    --
    Have gnu, will travel.
  21. Desktop CPU space is a big yawnfest by Gordo_1 · · Score: 2, Insightful

    and has been for at least 5 years. A theoretical 10% performance boost? Gimme a break. I upgraded from a Core2Duo E6600 @ 2.4GHz to a quad core i5 2600k which runs at an overclocked 4.5GHz on air... Day to day, the new rig delivers a *mostly* perceptible performance advantage, but nothing earth shattering... I give you several recent changes that felt bigger:

    1. Moving from hard drive to SSD
    2. Moving from a DirectX9 class GPU to a DirectX 11 GPU (at least in games).
    3. Move from pre-JIT JS browser engine to a JIT-engined browser.

    As far as desktop CPU development goes, I think the future is largely about optimizing software for the multi-core architectures, not adding Gigahertz.

    1. Re:Desktop CPU space is a big yawnfest by Anonymous Coward · · Score: 0

      Gordo_1 meet Amdahl's Law. Amdahl's Law meet Gordo_1. I'll leave you two to get acquainted.

      Peak single core performance will always matter.

    2. Re:Desktop CPU space is a big yawnfest by Anonymous Coward · · Score: 0

      I bet dinosaurs were also bored before they died out.

  22. AMD isn't going to die for a long time by Anonymous Coward · · Score: 1

    AMD isn't going to die for a long time. I am honestly an AMD fanboy and use them in all of my computers. But the fact of the matter is that even if Intel could beat them to a pulp (and most likely they could if they wanted to due to their power and warchest) but the fact of the matter is that Intel will not allow them to die even if they have to start sabotaging themselves to do it. AMD is the only thing keeping them from Monopoly status and a LOT more scrutiny and regulation. So long as they can point to AMD as what they are competing against they can keep from that level of pressure.

    Now I really do hope that one day AMD becomes a serious threat to Intel but as it stands right now, they are only there cause Intel needs them. They do have stuff they are good at and have a price point I love and great stability and they do force Intel step up their game on the performance and prices though.

    They are Intels best friend and worst enemy at the same time. Without AMD, Intel would be a Monopoly and regulated as such. But with AMD, Intel has to actually watch them some and compete with them some or they risk actually becoming a sizable threat.

  23. Cyclos explains that they are using inductors by DrJimbo · · Score: 2

    link

    Cyclos resonant clock mesh technology employs on-chip inductors to create an electric pendulum, or "tank circuit", formed by the large capacitance of the clock mesh in parallel with the Cyclos inductors. The Cyclos inductors and clock control circuits "recycle" the clock power instead of dissipating it on every clock cycle like in a clock tree implementation, which results in a reduction in total IC power consumption of up to 10%.

    Inductors save power because unlike most other circuit elements, inductors are able to store energy in a magnetic field so it can be used later on. This is part of how switching power supplies get their efficiency.

    --
    We don't see the world as it is, we see it as we are.
    -- Anais Nin
    1. Re:Cyclos explains that they are using inductors by Anonymous Coward · · Score: 0

      Inductors have a large footprint on a semiconductor and that magnetic field storing the energy creates cross-talk (noise) on adjacent features. Those may not be show stoppers, but they actively work against increasing transistor density and shrinking chip features.

    2. Re:Cyclos explains that they are using inductors by Anonymous Coward · · Score: 0

      Your argument is flawed. I could say the same about capacitors:

      Capacitors save power because unlike most other circuit elements, capacitors are able to store energy in an electric field so it can be used later on. This is part of how switching power supplies get their efficiency.

      The critical point is that when you have an L and C together, energy will slosh back and forth between them. It's the interaction that counts, not one element or another.

  24. It is one of many techniques... by Anonymous Coward · · Score: 0

    that trades of space for lower power. It might be an easy trade to make when you need enough area for the thousands of bumps anyway. Its a good thing to do because other wise the power per area gets really unmanageable. ARM and Intel have been using many approaches to keep down power this is just the next lowest hanging fruit. I doubt that AMD has a lock on this technology. Hopefully they can make a few bucks out of it. It is great to see AMD keeping Intel and ARM on their toes. This may put a crimp in the plans of the OC crowd. I don't care much about OC, it's a lot of noise and expense when I rarely tax the processor anyway.

  25. Resonant Clock Mesh by Lord+of+the+Fries · · Score: 1

    I don't even know what one is. And I haven't even glanced at the fine article. I just know I want one of those. Sounds so shiny. Just wanna say it over again and again and again...

    Resonant Clock Mesh

    Resonant Clock Mesh

    Resonant Clock Mesh...

    --
    One man's pink plane is another man's blue plane.
  26. Neat and not vaporware at all. Explanation: by Ungrounded+Lightning · · Score: 5, Interesting

    Agreed [that it looks like vaporware]. It's a breathlessly ebullient press release sales pitch.

    Agreed it's a sales pitch. But not vaporware at all. Very neat solution. (I saw another with similar properties a couple years ago but this one is 'way better.)

    The issue is the power consumption of the clocking of the chip. Modern designs are primarily layers of D-type flip-flop registers separated by small amounts of random logic and all the flip flops are clocked simultaneously, all the time. The clock signal is input to ALL the flipflops and a bit of the random logic. I'm guessing somewhere between one in five and one in ten gate inputs are driven about equally by CLK or ~CLK. Further, the other signals flip between one and zero once, sometimes, on each cycle. ALL the CLK signals flip from zero to one and back to zero EVERY cycle. So there's a lot of activity on the clock.

    In CMOS the load on the clock is primarily capacitave - the stray capacitance of the CMOS gates and wiring - plus some losses, mainly due to the resistance of the wiring. The stray capacitance has to be charged and discharged every cycle. The charge represents energy. In a conventional design the clock drivers are essentially the same thing as logic gates (inverters). New energy is supplied from the power supply (and about half of it, excluding signal-line resistive losses, dumped as heat in the pullup transistors of the drivers) every cycle as the lines are charged. Then the charge is dumped to ground (and the rest of the energy dumped as heat in the pulldown transistors). All that energy gets lost as heat every cycle, and it represents about 30% of the power consumed by the chip. It would be nice to scavenge it and reuse most of it for the next tick.

    A previous invention used a half-wave transmission line looped around the chip and connected plus-to-minus. A big mobius strip. The CLK and ~CLK loads acted as distributed capacitance around the transmission line. A clock waveform circulated continuously, twice per cycle. Instead of a sea of drivers providing new energy and then throwing it away every cycle, the transmission ring had a few drivers distributed around it, keeping the wave circulating and correctly formed, and pumping in enough energy to replace the resistive losses while the bulk of the energy went round-and-round. Result: Most of the clock power requirements and heating load go away.

    Unfortunately, the circulating clock wave meant the region completing a computation ALSO went round-and-round, rather than everything switching at the same time. Stock design tools assume CLK/~CLK is simultaneous (except for minor variations) across the whole chip. So using that earlier system would require a major rewrite on the stock tools and new design methodologies.

    THIS system does a similar hack energetically, but with everything in sync. Instead of a sea of drivers driven by a carefully-balanced tree of pre-drivers, the CLK and ~CLK are constructed as a pair of heavy-conductor meshes - like two stacked layers of flattened-out window screens. These form two plates of a capacitor. These plates are connected by an inductor, forming a resonant "tank circuit". When this is "pumped up" by a few drivers and is "ringing", energy alternates between being an electric field between the screens and a magnetic field in the inductor coil, twice (once for each polarity) each cycle. Again the bulk of the energy is reused over and over while the drivers only have to replace the (mostly) resistive losses (and pump it up initially, over a number of cycles). Again the bulk of the clock power and heating is gone. But this time the whole chip is switching essentially simultaneously, so the stock design tools just work.

    Neat!

    Downside (of both inventions): You can't quickly start and stop the clock in a given area or run it more than a few percent off the speed set by the resonance of the tank circuit or transmission line. No overclocking. Also no clock gating to save power on quiesc

    --
    Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
  27. When the ones using the cell architecture came out by dbIII · · Score: 1

    At the time nothing used was available.
    It was about the time people were building clusters with Playstation 2 consoles because it was the only cost effective way to get a platform with that CPU (pity about the lack of memory). I haven't even bothered to check since. A couple of weeks with daily calls from a very slimy salesman put me off and gave me an idea of what it would be like to be a good looking woman in a bar full of desperate horny lowlifes.

  28. The anecdote was about IBM selling by dbIII · · Score: 1

    The anecdote was about IBM selling the things, but instead of making an effort selling them they were looking almost like they were deliberately setting them up to fail.

  29. 4ghz? No problem! by Anonymous Coward · · Score: 0

    I have something like this, it uses water cooling and I call it "hydromesh technology"(Patent Pending)!

  30. Re:Neat and not vaporware at all. Explanation: by gjscott332 · · Score: 1

    I've seen the cyclos public presentations a few years ago. A somewhat simpler explanation would be: On a chip such as a big processor a large part of the power is used distributing the clock signal. This is for two reasons. 1) It takes more power to get a clock distribution network closer to zero skew, and processor design is done zero skew. 2) Ratio of Clock power vs data power goes up as you put less data stages between your flops. (double clock speed, double data activity vs double clock activity x twice the flops (or local clock gates) load). Making an advance processor the most suitable chip type for a technology that's potentially difficult to use. These chips already use clock meshes for their distribution. The mesh looks to it's driver like a big capacitor. The driver does work charging the mesh each cycle. (imagine a child on a swing, lift the swing up once and let go, child annoyingly stops at bottom of swing, repeat). In this technology the driver is replaced by a 'kicker' and a big inductor is matched to the capacitance of the mesh. Now you have a resonant circuit (child is swinging without stopping, just needs a small nudge each time). Less power at the cost of either on chip or off chip inductor, lots of non-standard design flow and $$$ to cyclos for the clever bits (the kicker and tool knowledge).

  31. Re:Neat and not vaporware at all. Explanation: by elrond1999 · · Score: 1

    You are missing one point of the clock mesh. The mesh is only the middle part of the clock tree. This part serves to spread the clock globally on a single metal mesh with zero skew. Then the lower part of the clock tree is built normally and can be gated. This means that you can certainly turn off the clock of quiescent sections of the chip. You will still have to oscillate the mesh, but the registers and detailed distribution can be turned off.

    Also you can probably run this mesh off the resonant frequency, but at a power disadvantage. Or you could dynamically vary the L if that was possible.

  32. AMD vs.Intel by Lonewolf666 · · Score: 1

    A few months ago, I have upgraded an older PC to an AMD 4 way Phenom II as well. I chose the 910e for its low TDP of 65 watts and I'm so far quite happy with it.

    But Intel has similar parts, like the Core i5-2400S. If it wasn't for the ECC RAM support the AMD offers (but Intel only in expensive Xeons), I might have gone with Intel this time. In most reviews, the i5-2400S wins clearly on performance.

    So it will be a good thing if AMD can boost its performance/power ration and become more competitive.

    --
    C - the footgun of programming languages
  33. Resonance clock mesh? by jfbilodeau · · Score: 2

    I don't know about you, but I would be concerned about the effects of a resonance clock mesh cascade failure.

    I know a guy who had to deal with a resonance cascade and it wasn't pretty.

    --
    Goodbye Slashdot. You've changed.
    1. Re:Resonance clock mesh? by Anonymous Coward · · Score: 0

      Half-Life?

      Anyone?

    2. Re:Resonance clock mesh? by Anonymous Coward · · Score: 0

      I laughed. Ignore the people who didn't get it.

  34. Re:OMG an Intel killer! by Anonymous Coward · · Score: 0

    Eh, you're better buying stock in a telecom. They all pay dividends.

  35. Only 4 GHz? by BBCWatcher · · Score: 1

    IBM hit 5.2 GHz in 2010. That's with all cores active and constant operating speed, too.

  36. Simple exemple by DrYak · · Score: 1

    Specially, sometime the best algortihme isn't the most efficient one.
    It might be better to use some less efficient algorithm, but that can be better paralized.
    A O(n log n) algorithme sounds better than a O(n ^ 2). Except when the first is sequential, and the second could be parallelise along N, having n separate thread of O(n) complexity each.

    Concrete exemple :
    - For sorting big datasets, quick sort is among the best known algorithms... ...except that it's rather serial and as such is best algorithm *for single thread* usage.
    - Sorting nets have a lot of exchange/compare operation. ...except most are completely independant and thus can be done in parallel fashion.
    End result: when running in parallel, sorting nets out perform quicksort, even if they rely on more operations.

    Yes, nine women can't do 1 baby in one month.
    But if you want 9 babies, it's better to ask 9 different women. Not to wait 81 months on 1 single lady.

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
    1. Re:Simple exemple by Daniel+Phillips · · Score: 1

      I think you'd be talking minimum 150 months for the single lady, it takes a few months to reset the equipment. Oh, but there are indeed parallel parallel babies in some cases, so maybe that could be compressed.

      --
      Have you got your LWN subscription yet?
  37. Re:Neat and not vaporware at all. Explanation: by fast+turtle · · Score: 1

    The main problem that AMD has with both the Brazzos and Llano designs is production. Simply put, demand is so high that they were unable to meet the demand for all of their chips. Because of this and yeild issues with Global Foundaries, they jetisoned GF and worked with TSMC to get it out. From what the 3 prior posts (great explanations and why I still read /.) said, My take is that AMD now has a method of cutting the power consumption of chips like the A350 from 12 watts to 1/3 - 1/2 (4 - 6 watts) while staying on the same process size. This opens the production floodgates across the board because there are plenty of 45nm foundaries with available capacity.

    Remember that one of the problems they had with Bulldozer was getting the yeilds up and this may be a solution to some of that problem. Another issue that comes to mind is that Global Foundaries simply didn't have enough capacity available, so AMD was going to have to farm some of their chip production out. So what chips make sense? Those still using the 45nm process and if this offers anything near what I see in power reduction, then they would finally be able to meet Intel's Performance per Watt and start regaining market share.

    --
    Mod me up/Mod me down: I wont frown as I've no crown
  38. This is about energy efficiency, not GHz by warrior · · Score: 1

    Too bad the article is lacking on the technical details. This is about energy efficiency, not GHz. They have hit 4GHz and higher with a traditional clock mesh. The point here is that they hit 4GHz with a resonant clock mesh. What this means is that instead of charging and discharging the huge capacitor that is the clock grid every cycle using only FETs connected to VDD and VSS (traditional digital logic), there is an LC tank circuit that is resonating with the clock grid. The power rails still do some of the charging and discharging of the grid, but now some of the energy comes from the oscillator. I have seen the paper, the distributed LC tank is pretty cool. The technical achievement is that they got this to run at 4GHz while keeping the skew between clock grid points as low as a traditional mesh (had the skew increased, the max frequency of the processor would go down). They claim reduced clock power of 25%. Given that clock power is roughly half the power of a core, that's a 12% power reduction overall, pretty impressive. It's also really cool that the whole thing is on-die - they made the inductors out of back-end wires on the CPU die itself, no additional components on the package so no increased cost.

    --
    Intel transfer the difficult from Hadware to software, for get more power, programmer need more technology. -- chinaitn
  39. AMD, learn to make decent CPUs! by Anonymous Coward · · Score: 0

    Sandy Bridge already runs at 4 GHz just fine without this bullshit stuff.

  40. Re:OMG an Intel killer! by AlienIntelligence · · Score: 1

    I'm going to pour my entire savings into AMD stock - anyone else with me?

    Yup, I will too... on a short, let me know how many you will be buying.

    -AI

    --
    For me, it is far better to grasp the Universe as it really is than to persist in delusion
  41. Fertility by DrYak · · Score: 1

    I think you'd be talking minimum 150 months for the single lady, it takes a few months to reset the equipment.

    The fertility in human mothers is linked to breast feedin / weaning. No breast feeding mean shorter period of time until reset. Longer delay until weaning mean longer time until reset.
    And the 81 months were an image anyway.

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
    1. Re:Fertility by Daniel+Phillips · · Score: 1

      Great example, by the way.

      --
      Have you got your LWN subscription yet?