Slashdot Mirror


AMD Delays Hammer

TeJarz writes "C|Net reports that their next processor (Hammer) has been rescheduled from its original Q4 release to Q1 2003. To quote C|Net: 'The delays are occurring to accommodate the release of a new version of Athlon with a 333MHz bus, said Crank. Current Athlons come with a 200MHz bus and 256KB of secondary cache.' Let's hope this doesn't get moved again."

112 of 346 comments (clear)

  1. Current Athlons by Anonymous Coward · · Score: 3, Insightful

    ...have a 266MBz bus

    1. Re:Current Athlons by Anonymous Coward · · Score: 3, Funny

      What is that, MegaBizzatch?

    2. Re:Current Athlons by kir · · Score: 2

      I laughed so friggin hard at that! That got passed around the office REALLY fast!

      MegaBizzatch. Wonderful!

      --
      3cx.org - A truly bad website.
    3. Re:Current Athlons by parkanoid · · Score: 2, Informative

      Technically it's 133, just double-pumped (it's obvious that the 533 p4s don't actually have 533 mhz fsb, that would be jsut silly). I hope they are reffering to the base speed in this case as well.

    4. Re:Current Athlons by packeteer · · Score: 4, Informative

      In case everyone doesn't know what "double pumped" or "DDR FSB" mean let me explain. The clock that sets how often data is transfered clicks over and over to keepo the pace. On an Athlon it transfers data twice for every click. On a Pentium 4 its 4 times a click. Right now most Athlons run at 133mhz "DDR FSB". Mine already runs at 166mhz (overclocked of course) and let me tell you its sweet. I cant wait to see everyone have access to 166 mhz FSB Athlons.

      --
      unzip; strip; touch; finger; mount; fsck; more; yes; unmount; sleep
    5. Re:Current Athlons by Yokaze · · Score: 2
      > I have no idea how a one line comment with a single fact can be moderated as 'Insightful'. I don't consider you to be a great thinker just because you can transpose a number.

      More interesting is, that it is explicitly stated in the article:

      Current Athlons come with a 266MHz bus and 256KB of secondary cache.
      --
      "Between strong and weak, between rich and poor [...], it is freedom which oppresses and the law which sets free"
    6. Re:Current Athlons by Dun+Malg · · Score: 2

      He's referring to the /. blurb which incorrectly stated "200MHz bus"

      --
      If a job's not worth doing, it's not worth doing right.
    7. Re:Current Athlons by jmorris42 · · Score: 2

      You couldn't patent dual clocks due to prior art. Motorola was doing that trick with the M6809 ages ago. Two clocks (P & Q) 90 degrees out of phase with the CPU active on each of the four phase combos.

      It allowed a nominal 1.89Mhz Tandy Color Computer 3 to often outrun a stock AT because while the AT was throwing wait states by the half dozen anytime it wanted to actually DO anything the 6809 was actually reading or writing from memory on better than half of it's cycles.

      And since the CoCo's memory system never imposed wait states, run times were 100% deterministic. Made lots of programming jobs a LOT simpler to implement. Of course I really doubt it would be possible to attain today's speeds and remain deterministic, what with all of the cacheing that now goes on. Guess times change.

      --
      Democrat delenda est
  2. Not surprising by Anonymous Coward · · Score: 4, Funny

    AMD = All Microprocessors Delayed

    1. Re:Not surprising by Jugalator · · Score: 2

      ITEPL? You need to get some sleep!

      --
      Beware: In C++, your friends can see your privates!
  3. 266 Bus by clinko · · Score: 3, Interesting

    Current Athlons have 266 bus. You can still get the older 200 bus, but it died out about a year ago. Sorted in price on pricewatch

  4. Re:Who wants to bet... by bogie · · Score: 2

    Your joking right?

    Do you think its called Wintel because they couldn't figure out how to spell AMD?

    --
    If you wanna get rich, you know that payback is a bitch
  5. Comment removed by account_deleted · · Score: 4, Insightful

    Comment removed based on user account deletion

  6. Invested in AMD. by mjuarez · · Score: 2

    Anybody here have stock in AMD? I've been long on the company for like two years now, but it never seems to finally launch the Hammer!!! I was hoping for a christmas release, but that's not gonna happen now...... my stocks will get beaten tomorrow!!! :(

    1. Re:Invested in AMD. by Skyshadow · · Score: 2
      Hold onto your AMD.

      Really, there are only two companies positioned to provide CPUs for PCs. Intel is showing more and more signs of losing their grip on bits of the market, and AMD only really needs to gain small bits of Intel's narket share to be wildly successful.

      Besides, you should never sell during a recession. Now, if you want risky products, join me and buy some Abiomed (makers of the Abicor heart). Might not work out long term, but if they work out the kinks people will pay any price...

      --
      Every year during my review, I just pray the words "slashdot.org" aren't mentioned.
    2. Re:Invested in AMD. by Perdo · · Score: 3, Interesting

      They got beaten today.

      Down 7% on Intel's 2 cent per share dividend.

      They'll get beaten again tomorrow.

      They'll get beaten at Christmas.

      They'll get beaten until Sledgehammer is released.. not Claw hammer which will have no x86-64 desktop software support right off the bat, and will have to rely solely on it's pure x86 performance.

      Microsoft shafted them on the X-box because Intel paid Microsoft 200 million to use the Pentium III. Nvidia was stuck with an unused AMD integrated chipset for X-box and Nforce was born.

      Intel will pay Microsoft to shaft them again. No x86-64 Windows XP for AMD despite AMD testifying on Microsoft's behalf in exchange for anti-trust testimony. AMD made an unenforceable agreement with Microsoft. To enforce it would perjure themselves.

      So Intel wins again.

      Until Sledgehammer arrives. Sledgehammer is a server/workstation chip and will have full support of the dominant server operating system, Linux. Microsoft must support Sledgehammer or risk losing more of their already weak server market share.

      Long after Microsoft has done the work to get Windows running on the Server, Microsoft will incorporate x86-64 support into their desktop OS.

      Probably about the same time as they support Hyper threading and SSE3 for Intel.

      --

      If voting were effective, it would be illegal by now.

    3. Re:Invested in AMD. by cheezedawg · · Score: 2

      Where is Intel losing their grip? They increased their market share by 3% this year...

      --
      "The defense of freedom requires the advance of freedom" - George W Bush
    4. Re:Invested in AMD. by mjuarez · · Score: 2

      That's very interesting..... AMD is currently about 50% of my portfolio...... NVDA has another 25% or so. I'm thinking on adding to my NVDA and AMD positions, since they're so beaten down..... I'm betting on Hammer becoming a real blockbuster for AMD, so I'm staying put with my shares... :)

    5. Re:Invested in AMD. by mjuarez · · Score: 2

      I totally agree with you..... I only think they shouldn't start behaving like Intel, delaying their products by years..... Hammer was initially slated to come out first half of 2002. :(

      What you're saying is true..... AMD only has to get double their current market share (about 40% of the market) to start raking in the millions...
      and it's wildly undervalued, in my opinion.

      I might look into this Abiomed company..... sounds interesting.... I might be a customer in the not-too-distant future! :)

    6. Re:Invested in AMD. by EastCoastSurfer · · Score: 2

      Wow! To the parent and the next parent you two sure are betting a lot on AMD and the technology sector as a whole. Having a stock be 50% of your portfolio is generally too much unless you have direct influence in the company. Even then it is usually still excesive.

      My question is why invest in either of them at all right now? The tech sector as a whole sucks and doesn't look like it is going to improve anytime soon. Intel released its forward outlook the other day and said that they don't see PC demand even picking up for Christmas. Notice they didn't say Intel procs, but PC's as a whole. With the recent news of the delay, a current EPS of -1.15, and many analysts pointing out that AMD is losing market share to Intel it would seem that there are better places to put your money right now other than in the chip makers.

      NVDA is a slightly different story, but still suffers from the same lack of PC demand that the chipmakers suffer from. Financially they look better (although there have been some questions to that which is what triggered the massive drop a while back), and they are the leader of their segment(always a good thing). Additionally, NVDA really needs the xbox to sell well. That was what made them run up like they did(when MS announced them) and probably part of their backtrace(when MS said the Xbox wasn't selling that great). Also, with the console price wars in full swing you have to think that MS is passing some of their losses back to NVDA.

      Another thing to keep in mind is that you are invested in companies that generally tech minded people end up buying from. There are many techies out of work right now (with no real end in sight), thus they don't buy any new toys.

      I guess I just fail to see how either of these companies can be a good "investment" when on the paranoia down days we have been having, you can find good companies that have a dividend yield of 5+%. Then again it is your money ;-)

    7. Re:Invested in AMD. by EastCoastSurfer · · Score: 2

      Buy low, sell high?

      I agree that you should buy low and sell high of course :) I personally look for companies that are trading lower not because they are struggling(as AMD is), but because some external factor has caused them to go down. These days where the market just tumbles down because of the color of Greenspans breifcase leaves many opportunities to buy good stock in good companies at bargain prices. Look for companies with a distressed stock price and not a distressed balance sheet -- those are the gems.

    8. Re:Invested in AMD. by Hoser+McMoose · · Score: 2

      They'll get beaten until Sledgehammer is released.. not Claw hammer which will have no x86-64 desktop software support right off the bat, and will have to rely solely on it's pure x86 performance.

      64-bit performance is unlikely to be significantly different from 32-bit performance. Actually, all else being equal, 64-bit software is SLOWER then 32-bit software (twice as much data to read/write from memory), however the ability to properly access more then 4GB of memory can and does offset that penalty for many applications. In the specific case of the AMD Hammer and x86-64, making use of the 64-bit mode is expected to boost performance by about 5%, due primarily to the fact that AMD doubled the number of registers for x86-64 over IA-32.

      Microsoft shafted them on the X-box because Intel paid Microsoft 200 million to use the Pentium III

      I don't know any details of the deal between Intel and Microsoft, and in fact, I highly doubt that ANYONE reading and posting here does, just rumors heard from rumor mills. It is clear that Intel gave MS a VERY good deal, though Intel is probably not making any money at all off this deal, especailly given that X-Box sales have been well bellow expectations and profit from this sort of chip would be driven purely by volume.

      Nvidia was stuck with an unused AMD integrated chipset for X-box and Nforce was born.

      The chipset market is a market that nVidia HAD to get into at one time or another. Integrated video chipsets have grown from being ~5% of the market to being ~60% of the market in just over 3 years time. With an integrated video chipset, nVidia's potential market was more then cut in half. That being said, the nForce seems like a good thing for both AMD and nVidia. For AMD it has given them a stable platform that OEMs trust (as indicated by HPaq using it in their business PCs, something that is VERY rare for non-Intel chipsets). For nVidia, they have a successful product that is likely close to if not the top selling chipset for AMD processors (VIA mostly owns the retail channel, but nVidia has taken almost all the OEM sales).

      Intel will pay Microsoft to shaft them again. No x86-64 Windows XP for AMD despite AMD testifying on Microsoft's behalf in exchange for anti-trust testimony

      You seem to think that somehow Intel and Microsoft LIKE one another, or that by beating down AMD it would somehow HELP Microsoft? Perhaps you even believe in the whole "Wintel" alliance?! Well I hate to break it to you, but Microsoft wants/needs a company like AMD to give Intel competition. This lowers the prices of Windows PCs as comapred to the Unix boxes, it helps increase their performance and push things forward. Helping AMD makes GOOD business sense for MS, particularly if it doesn't cost them much money. And the x86-64 port of WinXP apparently didn't cost too much money, and they have already demonstrated it on Clawhammer systems.

      Probably about the same time as they support Hyper threading and SSE3 for Intel.

      WinXP already supports Hyperthreading. Even Win2K will work with Hyperthreading, though it appears like it might have been a touch of a hack (ie the chips show up as two separate processors, not a single processor which can run two threads at once).

      As for SSE3, I'm really not quite sure where Intel is going with that, I'll probably have to read up on it a bit more. It doesn't seem to make much sense to me though, they barely got people started on writing SSE code when they released SSE2 and told people to start again. Now they're planning on doing the same thing with SSE3? I'm not even sure that SSE3 will add all that much with is actually useful.. I guess we'll just wait and see.

  7. I heard from a guy I know by Anonymous Coward · · Score: 2, Interesting

    who works at AMD, we were talking about this tuesday, that the Hammer chips will be released next year, and I told him I thought late this year. Well, looks like he was right.

  8. Re:Comment non-sense by be-fan · · Score: 2

    Because the sooner it comes out, the sooner I get to play with a 64-bit OS development on a machine that gets top performance and doesn't cost $20,000. That alone is reason enough for AMD to ship it sooner.

    --
    A deep unwavering belief is a sure sign you're missing something...
  9. Re:Comment non-sense by Paul+Komarek · · Score: 5, Insightful

    I, for one, am hoping to replace our Alphas with cpus from the AMD Hammer series. We're about to buy a bunch of P4-based machines despite the problems we've had with certain tight loops in scientific code performing 80 times slower than a similarly clocked Athlon (according to Athlon advertised "speed", not actual clock). No, I'm not exaggerating, and this has been verified independently -- the P4 cpu has some huge weak spots that really suck if you hit them. If Hammer were out and working properly, we probably wouldn't buy the P4 machines to hold us over.

    We need 64 bit machines to accomodate massive memory for our research. I'm really hoping the Hammer can provide a relatively inexpensive and *commoditized* 64 bit platform for us to work on, compared to existing 64 bit (workstation/server) platforms. And I want it yesterday. Actually, I want it last year.

    I have no idea what the editors or submitter meant, of course.

    -Paul Komarek

  10. Re:Comment non-sense by Yoweigh116 · · Score: 2, Interesting

    "Let's hope this doesn't get moved again."

    There's a damn good reason I want this to come out soon. The sooner AMD comes out with Hammer the sooner Intel has some extremely serious competition. If Hammer can stand up to its hype the P4 won't look so hot, especially if Hammer ramps well in clock speed. Strong competition = lowering of prices. Also, Athlon XPs would then be pushed into the value market. So not only would Intel be forced to drop prices on their desktop and server CPUs, but AMD's old lineup would become and absolute steal. Sounds good for the average consumer, eh? Lets hope for no more delays.

    -Yoweigh

  11. Honesty? by wray · · Score: 3, Interesting

    What is the reason for the delay? Can it really be that it's just a business decision (as they seem to say) rather than a technological problem? It seems that AMD _needs_ this jump in 64 bit computing, the sse2 registers, and boost in performance on Intel. So to me, if it is a business decision, it is a poor one.

    Everything I have seen shows that Intel is doing much better in performance and climbing. AMD claims there is no real technological reason, yet there must be. Anyone have insights? It seems that it would be prudent for AMD to issue better explanations -- how could it hurt to be honest? I want to see competition, if they are going to lag in performance, then they present no reason for people to buy. (A similarly performing Intel chip is close in price right now)

    --
    Guess what? I got a fever! And the only prescription.. is more cowbell!
    1. Re:Honesty? by nelsonal · · Score: 2

      The only reason I can think of it being a business reason is to milk the Athlon design a little longer before it goes into the value market. They would only do this if they thought Intel wasn't going to make many speed bumps, or they have silicon that ramps better than it currently overclocks. Once you finish the design work for a chip, the more you produce the lower your costs are, since you don't have to do a major revision to the design for some time, new processor roll outs are a balance between lowering fixed costs and keeping up with the competition.

      --
      Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
  12. Good by Billly+Gates · · Score: 3, Informative

    A delay from palladium which will be included by default starting with the Hammer. It was probably delayed because longhorn aka drm-Windows was delayed and its needed to actually use the cyptography in the cpu.

    1. Re:Good by cheezedawg · · Score: 2

      While its true that Palladium hardware needs the OS to enter trusted mode, Longhorn is in no way needed to run Hammer or any other Palladium enabled hardware. Remember that Palladium is not involved in the boot process, and when it is enabled it runs parallel to the kernel. But we have already been over this, haven't we Billly...

      --
      "The defense of freedom requires the advance of freedom" - George W Bush
    2. Re:Good by Billly+Gates · · Score: 2
      If the user wants to use palladium to secure his/her documents then the OS has to support it. Palladium is not required for the boot process unlike TCPA but other complications or bugs could arise if palladium is enabled by default and a non supported OS. Palladium's own faq states ..." we have defined the "Palladium" initiative as a "new set of features in a forthcoming version of Windows that, when combined with new hardware and software, enable . . ." What we refer to as "Palladium" incorporates a Microsoft operating system.". So Longhorn is needed or a special version of XP in other words. Go do your homework next time cheezedawg.



      I bet it would be easy to disable it by default in the bios to boot XP or Linux. Microsoft should of picked TCPA which is more open and already has XP drivers for it by IBM.

    3. Re:Good by Billly+Gates · · Score: 5, Informative
      Oops I forgot to include this from the faq.



      Q: Can Linux, FreeBSD or another open source OS run on "Palladium" hardware?

      A: Virtually anything that runs on a Windows-based machine today will still run on a "Palladium" machine (there are some esoteric exceptions[1]). If you currently have a machine that runs both Linux and Windows, you would be able to have that same functionality on a "Palladium" machine.

      The exceptions are here



      [1] These exceptions include the following:

      1.)Some debuggers may need to be updated to work in the "Palladium" environment, but they can still work.

      2.)Some special performance tools may need to be updated.

      3.)Software that writes directly to TCPA hardware will need to be updated.

      4.)Memory scrub routines (at the hardware level) will need attention.

      5.)Third-party crash dump software may need to be updated.

      6.)BIOS mode hibernation features will need to be updated to work with "Palladium."



      Its these 6 reasons why palladium is still beta and why AMD is probably waiting before releasing Hammer.

    4. Re:Good by swillden · · Score: 2
      Q: Can Linux, FreeBSD or another open source OS run on "Palladium" hardware? A: Virtually anything that runs on a Windows-based machine...

      Since when [do] OPERATING SYSTEMS run on Windows? They are completly independent of software platform, all they need is hardware.

      Keep in mind, that FAQ was from Microsoft. I shall explain.

      From their point of view, any computer that can run Windows is a "Windows-based machine". They're aware that some oddballs insist on perverting these machines by running other experimental operating systems on them, but that doesn't change the fact that the machine was designed, built and sold to run Windows. It is, therefore, a "Windows-based machine", even if it happens to have some OSS crud running on it.

      When used properly, as God and Bill intended, it's just a "Windows machine", of course. Or, in MS common parlance, a "computer".

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
  13. Re:Comment non-sense by Paul+Komarek · · Score: 2

    Incidently, you can get a nice new dual Alpha 21264 667 4u rackmount with 4GB ram and 18GB scsi (64 bit) for = $14,000 these days. With educational discount, you can buy a Compaq ES40 (with single cpu to start) for $20K. I have no idea what the used 21164 machines are selling for these days.

    I don't have the same motivation for 64 bit machines (I need them for cycle servers with big memory), but I'm just as anxious for a commoditized 64 bit platform to emerge.

    -Paul Komarek

  14. Re:Comment non-sense by Archfeld · · Score: 3, Interesting

    Any place I can look for some doc on that issue ?
    We are migrating from our Alphas to dual P4's and seeing a serious drop in performance that should not exist :( The fingers have all been pointed at software optimization and we are doing some heavy duty examinations but it sounds all too pat to me...

    --
    errr....umm...*whooosh* *whoosh* Is this thing on ?
  15. the other side... by tanveer1979 · · Score: 3, Informative
    Lot of posts are screaming "again, again"... but the fact is a 64 bit processor is one devil to design.
    The biggest problem with current processors is that to design such devices we *have* to use dynamic logic. Ask any VLSI design engineer.. that is no joke. Infact many multipliers and dividers have to be hand edited! So delays are expected and it does reflect upon the desigers and companiesd in any way.

    Before you ask.. I do now work for AMD, i work in another VLSI company, thats why i say.. its tough. Millions of gates thousands to be hand edited its a bitch.. but as they say the fruits of labour are sweet... and for AMD hammer is going to be the sweetest

    --
    My Aurora : http://www.youtube.com/watch?v=o91ZsGwJYyg
    FB : https://www.facebook.com/TanveersPhotography
    1. Re:the other side... by Christopher+Thomas · · Score: 2

      Lot of posts are screaming "again, again"... but the fact is a 64 bit processor is one devil to design. The biggest problem with current processors is that to design such devices we *have* to use dynamic logic. Ask any VLSI design engineer.. that is no joke. Infact many multipliers and dividers have to be hand edited!

      This is true for any processor, 64-bit, 32-bit, or otherwise. If you want the last 20%-30% of the performance, it will involve hand-optimization and an ungodly amount of work.

      Designing a 64-bit chip vs. a 32-bit chip, OTOH, is mainly just replicating elements, though you do need design tweaking for a few pieces that don't scale well.

      Re. dynamic logic, it really depends on the process you're using and what your design goals are. There are a lot of "gotchas" that you're undoubtedly already aware of that can degrade performance in a dynamic logic system, and other tradeoffs that are made when adding dynamic components to an otherwise-static system.

      As with other design choices, there's no _requirement_ to do this. You just get a performance boost for certain specialized types of structure, which can justify the headaches if that kind of structure is on your critical path.

      For a 64-bit processor that doesn't use dynamic logic (last I checked), just look at the MIPS line.

  16. Re:Prefection by Christopher+Thomas · · Score: 2

    As for the 333mhz bus, I heard somewhere that the memory bus speed isn't the bottleneck for the Athlon processors...

    Depends on what you're doing.

    The P4, especially configured expensively, has a kickass memory subsystem on the motherboard (dual-channel anything will score high on bandwidth-bound tests). The fact that the Athlon doesn't has hurt its relative benchmark results even more than the speed war has.

    I still love the Althlon, and I still avoid the P4 on (personal) principle, but a faster memory bus is a Good Thing for AMD.

    The Hammer will live or die on this too. We don't have a real-world test of how well its memory subsystem works yet. The NUMA scheme for multiprocessor systems also gives me pause (without migration/copying of non-local pages, it'll bog down like crazy under certain conditions).

    A well-performing Hammer will push AMD back into prominence. I strongly suspect that they're at least partly buying time to tune the core.

  17. Gain this, lose that by Sp4c3+C4d3t · · Score: 2, Interesting

    I'm betting they're adding Palladium. It seems likely, since these days you must make sacrifices to gain things. XP Service Pack 1 will fix a few security holes, but at the cost of your privacy. Hammer will be 64-bit and more powerful than anything you've got now, but will probably be Palladium-enabled. Or maybe I'm being a pessimist and they're not adding Palladium. Lets hope not :|

    --
    Happy New Year, it's 1984!
    1. Re:Gain this, lose that by cheezedawg · · Score: 2

      Um, AMD announced support for Palladium long before Intel did. I'm not sure if it will ship with the first generation Hammer, but it will ship eventually.

      --
      "The defense of freedom requires the advance of freedom" - George W Bush
    2. Re:Gain this, lose that by evilviper · · Score: 2

      If by "more powerful than anything you've got now" you mean "more powerful than any other AMD processor", you might be right.

      On the other hand, if not, there are a lot of processors out there now that would leave AMD and Intel's new offerings in the dust.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    3. Re:Gain this, lose that by Hoser+McMoose · · Score: 2

      Actually, no there aren't.

      Take a look at SPEC CPU2000 sometime. Ok, maybe it's not the end-all-be-all of processor benchmarks, but it's the best that we've got right now. In terms of raw processing power, we get the following rankings:

      CINT2000:
      1. Intel P4 2.8GHz - 984
      2. HPaq 1.25GHz Alpha EV68 - 928
      3. AMD AthlonXP 2600+ - 839
      3. (tied) IBM Power4 1.3GHz - 839
      5. HP/Intel Itanium 2 - 810

      As you can see, AMD and Intel are doing quite well competitng with the rest of the world here, and even beating them. Add an extra 20% to the performance here, and AMD would have a VERY fast little chip on their hands.

      Now, in CFP, things look somewhat worse for the P4 and esepcially for the Athlon. This is largely due to two things. First, CFP is a very bandwidth-intensive benchmark, and bandwidth is one of those things that can easily be bought by throwing money at it. IBM threw a LOT of money at bandwidth, and it performs very well.

      The second thing with CFP is Sun's new compiler, which magically transformed the rather lackluster performance of the UltraSparc III into a real performer in CFP due almost entirely to a HUGE (8000%) gain in performance on a single sub-test. In other words, their results are not really accurately reflecting the performance of the chip relative to other chips at this time, because they will either perform quite a bit worse then their CFP numbers would suggest, or WAY better if your application happens to exactly match that one sub-test (99+ times out of 100 it would be the former).

      Still even here AMD and Intel are reasonably competative. The P4, with a score of 929 is behind only the Power4, the Itanium2 and the Alpha EV68 in CFP. Even the AthlonXP's score of 710 isn't too far off the pace, albeit it is trailing the leading score of 1356 (Itanium2) by a reasonable margin.

      As for the Hammer? It's integrated memory controller should REALLY help for CFP, particularly with regards to the Sledgehammer, with a 128-bit wide memory controller. CINT will benefit somewhat less, probably only the 20-30%, clock for clock, over what the AthlonXP (this is what AMD is saying would be typical).

      Anyway, the point I'm trying to get at here is that the AthlonXP and the P4 are actually VERY powerful chips, and really there isn't anything that is "leaving them in the dust" at this time.

  18. Comment removed by account_deleted · · Score: 2

    Comment removed based on user account deletion

  19. conspiracy theory.... by tanveer1979 · · Score: 2

    struct conspiracy theory {
    real : MS;
    int : palladium;
    int * : hammer;

    hmmm is it for integrating palladuim support!
    };//end of struct

    Sorry... couldnt help it ;-)

    --
    My Aurora : http://www.youtube.com/watch?v=o91ZsGwJYyg
    FB : https://www.facebook.com/TanveersPhotography
  20. This is very bad news... by dtjohnson · · Score: 2, Informative

    The hammer is a critical product for AMD that they would never delay unless there were *major* problems with it.

    1) AMD is currently losing huge amounts of money. The hammer would have allowed them to sell at the high-performance end of the market again where the sales prices are higher and might have helped them reduce the flow of red ink.

    2) The delay will badly hurt AMD partners such as motherboard and chipset vendors who have developed supporting products for hammer.

    3) The hammer had a potential performance lead over Intel that will be greatly eroded by the time it finally appears.

    4) Critical software development for hammer will be slowed which will slow eventual market acceptance of hammer.

    5) The delay will build momentum for Itanium.

    6) The delay will greatly reduce the pressure on Microsoft to support hammer and will give Microsoft the opportunity to also build momentum for Itanium. Depending on market conditions when the hammer finally appears, it is now even possible that Microsoft will never need to support hammer.

    7) This delay is so serious that it creates real doubts that hammer will *ever* be a viable product.

    1. Re:This is very bad news... by EastCoastSurfer · · Score: 2

      I like AMD stuff as much as the next /.er, but as a business this delay IS really bad news for them. They need this new chip so that they can break into the server market where they can actually make some money. Their other businesses(flash memory, memory, etc...) are particularly weak right now because of lack of demand. Using their numbers for the first quarter of this year taken off of their 8-k they had:

      $902,073,000 sales
      $684,000,000 from procs
      $160,000,000 from other

      That means that their processor business makes up nearly 76% of their revenue. If their proc business truly went south, AMD would be done. Since they do have a lot of assets(fabs, IP, etc) INTC or IBM or another big chip player would just buy them before they ever truly went under.

  21. Re:200 MHz? by Anonymous+DWord · · Score: 2

    LOL. Busi.
    Best... comment... ever.

    --
    "If he thinks he can hide and run from the United States and our allies, he's sorely mistaken." Bush on bin Laden
  22. Re:Who wants to bet... by foonf · · Score: 2
    Do you think its called Wintel because they couldn't figure out how to spell AMD?
    AMD has always used a close relationship with Microsoft to their marketing benefit. Back in the bad old days, right after Intel had begun to create their consumer brand identity with the whole "Intel Inside" campaign, the public reputation of clone x86 CPU makers was not very good, and non-Intel CPUs were presumed to be unreliable and have compatbility problems by many people. Correspondingly, you will find on every AMD CPU since the 486 days the text "Designed for Microsoft Windows" and a Windows logo, in order to reassure the buyer that even though they are not buying an Intel CPU, their computer is fully compatible with Industry-standard Microsoft software.
    --

    "(Man) tries to live his own life as if he were telling a story. But you have to choose: live or tell." --Sartre
  23. some people use these for work, you know by g4dget · · Score: 3, Insightful
    Why should we hope it gets released now instead of later? Do you have anything riding on it?

    Hard as that may be to believe, some people use their computers for real work. And some of those people run into that dreaded 4G limit--4G is not a lot of memory anymore these days. And many of these people would love to have the choice of a Hammer over Itanium.

  24. Delays!?!?!?!? by ackthpt · · Score: 2, Funny
    NooooooooooooOOOOOOooooOO!!!!

    The poster dies in a fit of agony...

    --

    A feeling of having made the same mistake before: Deja Foobar
  25. The real reason by PaxTech · · Score: 5, Funny

    They're waiting so they can ship the new chip bundled with Duke Nukem Forever. ;)

    --
    All movements for social change begin as missions, evolve into businesses, and end up as rackets.
  26. Re:Comment non-sense by Billly+Gates · · Score: 2
    Yep.



    This guy is quite rude, offensive and disrespectfull to others and is very arrogant. Ignore him and add him to your foe list like me. Also he has been very supportive of drm in a troll like way and feels free to flame other people who actually like their fair use rights.

  27. Comment removed by account_deleted · · Score: 2

    Comment removed based on user account deletion

  28. Re:To anyone complaing because they have old syste by pezpunk · · Score: 2, Troll

    Now, after Athlons have been out for a long, LONG time, I have an Athlon XP 2000+ sitting around, patiently waiting for me to get a motherboard and case to go along with it's RAM and 80GB hard drive.

    dude - if you've got an Athlon2000+ that's "patiently waiting around" then you must have bought the thing when it was brand new -- and paid a hell of a premium to let it sit doing nothing. same chip's probably half price now. i can almost buy your "my 550 celeron runs everything i need!" story .. but then you blow your entire "i don't need to waste money" premise by buying a state-of-the-art processor and then sitting on it until it's middle of the road? i don't get it.

    --
    i could live a little longer in this prison
  29. Re:Comment non-sense by Paul+Komarek · · Score: 4, Interesting

    I can probably send you some test code (same for anyone else who asks), but I'll have to check with my advisor first. The smallest I've made the test code is a bit under 300 lines. It's been run on Alpha 21264 EV67, Athlon C, Athlon XP, P4, and P-III, and one other Pentium-ish platform. At least two (I believe it's actually three) profilers have been run to find the bottleneck; it appears to be the floating point unit stalling for data.

    Here are the timings. Note that these are just via "time" on GNU/Linux or a wall clock on Windows (or something -- I didn't do the Windows tests).

    P4 dual Xeon 1.7GHz/gcc: 82 seconds
    P3 1000/msvc: 18 seconds
    Athlon C 600/msvc: 2 seconds
    P3 1000/msvc, using floats and sse:
    2 seconds
    Alpha 667/gcc: 2 seconds
    Athlon XP 1900+ 0.88 seconds

    I guess the Athlon's clock was closer to the P4's clock than I recalled in my original post. Either way, the slowdown on the Pentiums can be easily seen.

    -Paul Komarek

  30. we believed it was hammer time by deft · · Score: 5, Funny

    but it turns out you can't touch this.

    --

    There's nothing Intelligent about Intelligent Design.
    1. Re:we believed it was hammer time by Hadlock · · Score: 3, Funny

      and my case will be cleverly constructed entirly of parachute pants...

      --
      moox. for a new generation.
  31. Re:Comment non-sense by evilviper · · Score: 2

    May I ask why you are going to P4s instead of just getting more Alphas? You yourself said you are loosing quite a bit of performance with the P4 compared with the Alpha, but you don't say why more Alphas aren't an option.

    --
    Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  32. Re:Prefection by cheezedawg · · Score: 2

    I think you are getting confused- memory bandwidth has been Athlon's biggest bottleneck by far for a long time now. If you don't believe me, check out the Memory bench results. So far AMD just hasn't been able to compete with the 400 and 533 MHz FSB.

    --
    "The defense of freedom requires the advance of freedom" - George W Bush
  33. Re:Comment non-sense by steveha · · Score: 2

    Any place I can look for some doc on that issue ?

    Darek Mihocka of emulators.com has written a whole bunch of stuff about the Pentium 4. He has examples of code that performs badly on Pentium 4, although I'm not sure how the most recent versions of the P4 would work on his code samples.

    http://www.emulators.com/pentium4.htm

    steveha

    --
    lf(1): it's like ls(1) but sorts filenames by extension, tersely
  34. Re:Comment non-sense by evilviper · · Score: 2

    Hell, I got a 500MHz Alpha system for $300, used.

    You are thinking about this all wrong. You seem to believe that a thousand dollar AMD chip is going to perform like a mainframe. You may be seriously disappointed when you figure out that these new 64-bit chips aren't going to make your current systems obsolete at all.

    --
    Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  35. I was expecting this. by Anonymous Coward · · Score: 2, Informative

    I live in Austin, and have friends who work at AMD. AMD may make a great processor, but their motherboards suck because the motherboard testing department's manager tries very hard not to find any bugs. (Test stuff that you know will work. Never install an OS, just use a ghost image of preinstalled windows XP copied through the network onto the hard drive. Testing with linux is a no-no, because you actually find reproducible bugs in the hardware! We can't have that, we're a testing department...)

    At least one woman was fired for making a Linux test CD and distributing it internally around the company, against that manager's wishes. Her name's on the test CD, and it was still being used inside AMD last week, but she answered too many Linux questions for people outside her department and as such was labeled "not a team player" in the internal politics. As far as I can tell, that was the most knowledgeable linux person they had anywhere near that area.

    AMD makes great processors, but until they get a new motherboard testing department, they'll have nothing to put them in.

  36. Re:Who wants to bet... by packeteer · · Score: 2

    And of course the AthlonXP. I personally dont like the name but they are a company who is out to get money so i dont oppose them naming it this way. It just means more people are comfortable buying a budget chip.

    --
    unzip; strip; touch; finger; mount; fsck; more; yes; unmount; sleep
  37. Re:Who wants to bet... by ncc74656 · · Score: 2
    Correspondingly, you will find on every AMD CPU since the 486 days the text "Designed for Microsoft Windows" and a Windows logo, in order to reassure the buyer that even though they are not buying an Intel CPU, their computer is fully compatible with Industry-standard Microsoft software.

    FWIW, that went out with the K6-III at the latest. None of the Athlons or Durons I've installed have had the Windows logo on them in any manner--not printed, not engraved.

    --
    20 January 2017: the End of an Error.
  38. Re:Comment non-sense by Paul+Komarek · · Score: 2

    Price/performance on the Alphas is low for most of our applications, making the only Alpha selling point it's 64-bitness for big memory. Many of our apps don't need that much space and can run on x86. The few apps that do need 64-bitness will be run on our existing Alphas. If we could get dual Alpha 1GHz machines for the same price as dual P4 Xeons, we would.

    There's also the issue that finding replacement sysadmins for the Alphas isn't as easy as it is for the x86 machines. Alphas aren't much different to admin, but it can be a bit of a speedbump.

    -Paul Komarek

  39. Re:To anyone complaing because they have old syste by bogie · · Score: 3, Insightful

    I'm glad your happy with your slow Celeron, but don't assume that for the rest of us we don't need the fastest CPU possible. Time is money and the faster myself or someone else can get their work done the better. There are a ton of apps out there right now where that speed CPU is just no longer a viable solution.

    I do happen to agree with not freaking out about a processor release date. But do realize that people are excited about this cpu for a reason.

    BTW many of us here were using computers and programming before you were born. Your only 20 for Pete's sake.

    --
    If you wanna get rich, you know that payback is a bitch
  40. Re:Comment non-sense by Paul+Komarek · · Score: 2

    We've sort of considered IA-64, but don't really want to make that expensive of leap into that performance mess. We're not going to buy into a *radically* new architecture weeks after its release, either. And we can't afford to spend lots of time tuning our code for one platform or another -- portability is key. gcc is our compiler of choice because we don't have to screw with (as many) platform-specific issues.

    "But judging from the benchmarks you posted further below, I question your know-how. You compare a GCC-compiled program running on a Pentium 4 to MSVC-compiled programs running on Athlons?"

    I could snottily retort that I question your reading know-how, since msvc was for P-III and Athlon C, while gcc was for all the rest; but I won't. ;-) The Windows tests were done by a different person, on his own time, 2500 miles from me. I did the P4, Athlon XP, and Alpha tests using gcc. You can at least compare those numbers. And in the end, the compiler (in general) should not make a 200% to 8000% performance difference.

    For the tests, I used the same compilers we use for development and distrobution. We don't have time to screw with the industry's popularity contests. We do algorithmic and data structure work, aiming for 10000% speed-ups that just aren't available through compiler cleverness. The Intel compiler won't help us when compiling on Alpha, MIPS, or PA-Risc.

    -Paul Komarek

  41. Re:Comment non-sense by Paul+Komarek · · Score: 2

    If you weren't anonymous, I might take you more seriously. =-) I'm very interested in price/performance and maintainability. I think the Itaniums will lose on price/performance for a long time. Since Itanium(2) is so far from being a commodity processor (like our Alphas), we'd have to expend extra effort to maintain them and train others to maintain them.

    The bottom line is that Itaniums only seem to make sense for people who get them for free.

    -Paul Komarek

  42. Re:Comment non-sense by Paul+Komarek · · Score: 2

    Your repsonse is not particularly well-related to my post. I hope for the Hammer to replace our Alphas in providing big memory per process. I didn't claim that the P4 performs 80 times slower. In fact, the P4 Xeon 1.7GHz generally outperforms my Athlon XP 1900+; of course, it also costs more. But on a bit of code I wrote for my research, it goes 80 times slower.

    I can't spend all my research time optimizing for one silly cpu. My code is run on about 4 different cpus (only two different instruction sets, though) at present. Another cpu (with another instruction set) will be added soon. If Intel wants my code to run fast on the P4 Xeon, they can contribute to GCC; but I don't care. I'm happy to recommend that users of my code buy Athlons or Alphas.

    -Paul Komarek

  43. short rant and a question by Erpo · · Score: 5, Interesting

    Everyone always makes the same really annoying mistake when it comes to athlon fsbs. Athlon front side busses do not run at 200MHz and 266MHz. They offer bandwidth equivalent to 200MHz and 266MHz by using both sides of the clock (DDR) on 100MHz and 133MHz fsbs. All new athlons use 133MHz DDR fsbs. The hammers will support 166MHz DDR memory busses, offering performance equivalent to 333MHz SDR memory.

    However, the notion of "fsb" is a little blurred with the hammer. Hammers will be directly connected to dimm banks and have integrated memory controllers, so the speed of the fsb will no longer be a determining factor in memory bandwidth. (* see mp note below) The traditional fsb to the traditional northbridge will be replaced by a "high speed" hypertransport link to a chip that connects to the agp slot, and has another (slower) hypertransport link to what could be called the south bridge. This "south bridge" will then connect the pci bus, serial ports, hard drives, usb ports, and any other devices that need to talk to the processor or main memory.

    *What does this mean for MP systems? Well, that's actually the really cool part. By moving the memory controller onto the processor and providing communication between processors over a hypertransport link (3.2GB/sec for dual, 6.4GB/sec for quad and above), memory bandwidth actually increases as more cpus are added! This is in contrast to a normal MP system where as more cpus are added, there is increased competition for a fixed resource (main memory) which is already the bottleneck in many single processor applications.

    That's my rant on terminology. Here's the question:

    I'm no kernel hacker, and I certainly don't know anything about writing schedulers, but it seems like this would require a change in how processes are handled in hammer mp systems. In traditional mp systems, every processor has equal access to main memory. If a process gets moved from one cpu to another, there's initial overhead to do the moving, but after that it can still get to its areas in memory without any problems. On a hammer mp system, migrating a process from one cpu to another would mean that in order to access its memory it would have to reach out of its cpu's hypertransport link, into another cpu's memory controller (which may or may not be busy) and into the attached ram. Considering there would not be enough bandwidth available on the 3.2GB/sec hypertransport bus (in the case of a dp system) for both processors to reach into eachothers 166MHz DDR memory at the same time without suffering a performance hit, it seems like there would definitely be an advantage to keeping processes close to their data.

    What changes would this require to scheduling and process management code, if any? Has this already been addressed, or are there people working on it in the linux kernel?

    1. Re:short rant and a question by platypus · · Score: 2

      IANAKH, too, but I occasionaly lurk on linux-kernel. As far as I have understood, there are things to be done before it makes even sense to think about scheduling and process mgmt. policies, because there's simply not the information for the scheduler etc. to know about something like "local" or "remote" memory.

      Simple Topology API is one thread about this stuff.

    2. Re:short rant and a question by Erik+Hensema · · Score: 4, Informative

      Essentially this would be a NUMA system (non-uniform memory architecture). As far as I know Linux 2.6 will have support for these systems.

      In a real NUMA machine there would be a hierarchy of clusters of processors. Each cluster functions a bit like a traditional SMP system, but the clusters are interconnected over "low"-bandwidth busses. This makes memory accesses across clusters slower than direct accesses into the clusters' memory.

      Both the VM and the scheduler will have to know about this.

      Another point with NUMA systems is the possibility of gaps in the main memory (discontinues memory). Kernel hackers are currently working on support for that (discontigmem patch, merged in 2.5.34).

      --

      This is your sig. There are thousands more, but this one is yours.

    3. Re:short rant and a question by drinkypoo · · Score: 2

      I was under the impression that current (new) athlon chips ran a 200MHz FSB, double-clocked, to keep up with the DDR400 memory.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    4. Re:short rant and a question by gmarceau · · Score: 2

      You are not the first one to point out that 200mhz and 266mhz fsb buses are misnomers. I fail to understand the significance that you attach to it.

      Say I had two systems, the first one with one of these two-clock-side 133mhz fsb and there other with an honest-to-goodness 266mhz. Since you mention their speed would be identical, is there anyway I could tell them appart? If there is no distinguishing factor transpearing (to the programmer of the machine, say), why would anyone care to tell them appart?

      Is there a difference in stability? In extensibility? Or is my question just silly?

      --
      This post was compiled with `% gec -O`. email me if you need the sources
  44. Why Pentium IVs are slow by stewartjm · · Score: 4, Informative

    The P4's x87 FPU and x86 ALU are just plain slow compared to P3s and Athlons. Though I am surprised your code is running 82x slower. I'd expect more like 2-8x slower for compute bound code. You can get a somewhat sensationalistic overview of why it's so slow at this link.
    If you want more in-depth numbers you can compare appendix C of the Intel Pentium 4 Optimaztion Manual with chapter 29 of Agner Fog's Pentium/II/III Optimization Manual. You can see the Athlon numbers in Appendix F of AMD's Athlon Optimization Manual.
    If you want to do number crunching with Pentium 4s your best bet is to use the SSE2 instructions/registers. You should be able to get a noticable speedup by using the Intel C++ compiler and telling it to use SSE2 instructions. If you want to eek out max performance you'll have to use assembly language. Though you can probably get most of the way there using the Intel C++ Compiler's SSE2 intrinsics.
    I'm curious as to why your code is so much slower on a P4 than on an Athlon. The best way to find out would be to look at the assembly code that gcc is producing. You can do that by using gcc's -S option. If you'd like send me the C code and the output from -S and I'll see if I see anything obvious.
    I'm somewhat paranoid about posting my email address. My paranoia seems to work, as I've received no more than the occasional spam in the last few years. My email address is my slashdot user name at woh.rr.com.

    1. Re:Why Pentium IVs are slow by Paul+Komarek · · Score: 2

      If you'd like my test code, email me (my address is in my user info). We've already compared gcc's asm on Alpha, Athlon, and P4, and found nothing particularly strange. The stall seems to come from memory fetches. It's possible that blocking our matrix could really improve cache behavior, but it would be fairly painful to implement in this case.

      -Paul Komarek

    2. Re:Why Pentium IVs are slow by buysse · · Score: 2

      Aren't SSE ops lower precision? I'm guessing that the original poster was more interested in performance for scientific computing of some kind, where precision matters.

      --
      -30-
  45. They're having clock speed issues with Hammers... by Heretic2 · · Score: 5, Interesting

    You ever notice how all the Hammers are clock speed locked at 800MHz? Yea, there's a reason for that. They're having problems cranking the clock speed up. For 800MHz they're fast as hell, beating P4 with twice the frequency, but they're not gonna release them until they clock faster than current Athlons so they're trying different types of transitors and what not.

    How the hell do I know that??? Look where I live, take a guess...The birds outside my window know things.

  46. Re:Comment non-sense by MSG · · Score: 3, Interesting

    it appears to be the floating point unit stalling for data.

    Well, if it's stalling for data, your problem is probably that the P4 has a *tiny* L1 data cache compared to... uh... anything. It's only 8K, compared to the Athlons 64K. See the following URLs:
    http://www.tomshardware.com/cpu/02q2/020402 /p4_240 0-01.html
    http://www.geek.com/procspec/intel/nort hwood.htm
    http://www.geek.com/procspec/amd/k7sele ct.htm

    It's probably also worth noting that Intel does NOT list the P4 as a "server processor". The P4 is listed as a desktop or workstation processor. Only P3, Xeon, and Itanium chips are recommended for server use:
    http://www.intel.com/products/browse/process or.htm ?iid=Homepage+Find_Products_Processors&

    You might want to show that to management and reconsider your purchase of P4 equipment. Even a P3 is likely to perform better.

  47. Re:Comment non-sense by MSG · · Score: 2

    Even a P3 is likely to perform better.

    And by saying that, I don't mean to imply that I think the P3 is a good choice, (I like the Athlons :-) I just mean that if the P4 is performing like crap for your applications, then you shouldn't use that processor.

  48. is there a real difference? by Trepidity · · Score: 3, Interesting

    Something I've never seen a good explanation of -- is there performance-wise any difference between a 266 MHz clock with data transferred once per clock and a 133 MHz clock with data transferred twice per clock (despite the actual clock ticking rate of course)?

    1. Re:is there a real difference? by kimmo · · Score: 5, Informative

      Latency.

      With single data rate a new address can be sent every clock for all memory requests.

      With double data rate a new address can be send with every other "clock", but while data transmission rate stays the same. Effectively this means transferring double data for each request, while the amount of requests doesn't change.

      This isn't very serious problem, since single bytes/bus wide data aren't usually transferred, but whole cachelines of 32/64 bytes. They will generate 4/8 sequential burst requests nullifying much of the "halfclocked" address generation potential latency problems.

      Ok, so why can't the addresses be sent like the data is another question which someone else with more knowledge might explain.. Maybe it would complicate things too much since the request-answer mechanism should be pipelined to accept new requests until previous requests are served. Or maybe the physical bus has some limitations, like using the same pins for address/data, which would simply make it impossible to send new addresses simultaneously (on falling edge of clock) while receiving data.

    2. Re:is there a real difference? by megalomang · · Score: 2, Informative

      The additional latency required to synchronize the address with the rising edge rather than either edge is negligible when you consider the total amount of time required to perform the fetch from L3 or L4. Therefore there is no need to endure the more complex design to implement this.

      Most data is fetched in bursts. So there are typically 4 or more data phases per request. Consequently, there is no need for as high bandwidth for the address bus as for data.

      Plus, as another post said, it reduces the power requirements. This, combined with the fact that there are typically 4 or 8 data transfers per address is why P4 has gone to QDR buses. This way, there is one address per cycle, and an entire 4-unit burst can be completed in a cycle so the address bus could theoretically be completely saturated. Once you pass QDR (to Octal DR?), you may start requiring a higher data rate on the address bus as well for performing two 4-unit bursts per cycle.

  49. C|net is making processors? ;) by jukal · · Score: 2

    that's much bigger news than some delay - even bigger news is that it is not the first processor they produced :) "C|Net reports that their next processor (Hammer)". How low can you go :)

  50. Re:Comment non-sense by forgoil · · Score: 2

    iNTEL does have their own compiler, that they sell, I guess they rather have you running their compiler than gcc.

    That said, which version of gcc did you use? There seems to be vast differences between them (and certain companies seems to like 2.96.x which is NOT a valid gcc version. If gcc -v gives you the 2.96.x version, get a new gcc) and there are reports about speen increases in the 3.x series.

    I was mostly curious, I really would like to see that code of yours, but I realise that you wouldn't wanna give it away. Any chance you could write some dummy code that gives the same results (as far as the P4 being slow that is)?

  51. Re:Comment non-sense by Paul+Komarek · · Score: 2

    I may be able to give the testing code away; I'm waiting for a response from my advisor (GMT-5, so it will be a few more hours before he's even awake =-). Send me an email if you'd like the code.

    The P4 used gcc 3x, while the Alpha and Athlon XP used gcc 2.96. =-) If anything, this should give the P4 an advantage.

    The Athlon C and P-III results were all msvc. I don't know which version was used, because I didn't do the tests.

    -Paul Komarek

  52. Seems smart to me... by DeathPenguin · · Score: 3, Interesting

    Why should they rush the Hammer when the Itanium is failing as is? They know they can't push people to use their 64-bit capabilities, just like people didn't switch to Alphas. Squeeze every ounce of strength from the Athlon as they possibly can for now. Let Intel push the IA64 standard on everyone first to create a demand to migrate from 32-bit to 64-bit. That's where AMD plans to make their killing.

    I would imagine it would be better to release Hammer ASAP and create the 64-bit market themselves. Then again, I don't know the logistics required for such a launch, nor do I know exactly how much better, if any better, x86-64 would perform. Let's face it, not many people care about 64-bit versus 32-bit, they only know what the dork at CompUSA tells them. And if Hammers can't outscore P4's in the 32-bit apps that very short-sighted people care about, then there is really no place for Hammer in the consumer market.

    From what I've heard, mostly from internet gossip, is that AMD is having problems making Hammer scale high enough to beat the P4 in 32-bit apps, although it only requires roughly 1 Hammer MHz to beat 3 P4 MHz. I've also heard that AMD is having problems making Hammers run above 800MHz. With the expected debut of the P4 at clock speeds above 3GHz, the Hammer doesn't stand much of a chance in 32-bit apps.

    In short, don't expect to see Hammers until Intel manages to salvage the Itanic.

  53. Re:Prefection by Darren+Winsper · · Score: 2

    The memory system wasn't a problem, but it's becoming a bottleneck now. When I look at the latest benchmarks, it looks more and more like the P4 catching up to the Athlon in terms of IPC. This is probably due to the memory bandwidth holding the Athlon XP back.

  54. hammer mp: consequences for kernel arch? by mkh · · Score: 2, Interesting
    Actually the part of the kernel which would be affected most by this kind of architecture is the memory management code. While allocating pages to processes you probably want to make sure most of a process' pages "belong" to the same CPU. If they don't, nothing you do to the scheduler will gain you anything. (See below why this is stupid.)

    This isn't a particulary new requirement. You have to be careful about selecting pages for processes today even on single CPU systems to avoid cache thrashing. Because of the way first or second-level CPU-caches map to physical memory, certain memory-access pattern lead to constand reloading of the cache, making it pretty ineffective, even worse if it wasn't there in the first place. By carefully mapping physical pages to virtual memory the OS can avoid this problem. Solaris does this, I don't know about Linux. Probably.

    So, this is one new requirement for the memory management code. No problem, we just make sure all process pages belong to one particular CPU and schedule this process to this CPU only. Everything is fast and nice. Intel is doomed. Or is it? Not so fast, all this is probably a bad idea:

    We can't make sure pages on the right CPU a even available. What if they are not? Give out wrong pages? This would lead to results in running time which are not reproducable. This is really bad. It gets worse. What it the right CPU is not available because it's running some other process?

    Probably it's best to allocate evenly distributed pages (some fast, some not so fast) to processes and not schedule them special in any way.

    Easy ;)

    1. Re:hammer mp: consequences for kernel arch? by iabervon · · Score: 2

      Running times have never been particularly reproducable, and they're only going to get less so. Pretty soon, we'll have processors whose clock speed is dependant continuously on temperature; they run as fast as they can do accurately without damaging the chip. Of course, that means that your computer is faster during the winter.

      As far as existing chips go, hyperthreading also messes a lot with running times, because when you get processor time depends on when another process has cache misses.

  55. Excellent! by DarkHelmet · · Score: 2
    I can wait another few months. That'll give me the extra time I needed to figure out how to get linux running on my 64-bit Atari Jaguar.

    Anything with 64-bit doom is good enough for me :)

    --
    /^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.[A-Z]{2,4}$/i
  56. Not big a deal as people think by MtViewGuy · · Score: 2

    Folks,

    While AMD works out the bugs of their Hammer line of CPU's, don't forget that AMD still has a card to play in terms of CPU competition with Intel: the Barton-core Athlon CPU due later this fall.

    Unlike the Athlon CPU core designs since the original Thunderbird-core Athlon's, the Barton-core Athlon sports a larger 512 KB L2 cache on the CPU die, which will offer dramatic performance increases, especially with memory-intensive programs. Remember, the current Thoroughbred-core Athlon CPU rated at 2600+ already has reached parity with the Intel Pentium 4 2.53 GHz part, and that's with only 256 KB of L2 cache on the CPU die and using DDR266 DDR-SDRAM! What will the Barton-core Athlon do?

  57. Good news to me, kinda. by Zathrus · · Score: 2

    If this means Barton sooner rather than later, I'm happy... although from what I've read Barton (166 MHz FSB, 512k cache) is still slated for Q1'03. Sigh.

    Why? Because I'd like to get a Barton CPU for my next computer. I'm already in the waiting game for the NV30 and (to a much lesser extent) Serial ATA, so putting a better CPU on the list isn't a big deal.

    Why not Hammer? Because I know better than to buy a first generation CPU with first generation motherboards. Barton is just a mild revision to a 4 year old CPU core, and the motherboards are now hitting their 6th generation (KT133, KT133A, KT266, KT266A, KT333, KT400).

    For those who need the speed, power, and addressibility of a 64-bit chip this announcement sucks, but for those just looking for a faster current generation chip it's not entirely bad.

  58. Re:Comment non-sense by be-fan · · Score: 2

    Err, I have no such illusions. I expect the Hammer to be about 20-30% faster at a given clock than existing chips, which is somewhat optimistic, but entirely within the expectations for the chip. I want a 64-bit machine because there are some things in OS development that are more fun when you have 64-bits of address space. Things like single-address space operating systems and persistant virtual memory stores become feasible when with 64-bits of address space while they aren't so nice to implement with only 32 bits.

    --
    A deep unwavering belief is a sure sign you're missing something...
  59. I was afraid of this -- the Osborne 2 effect by geoswan · · Score: 2
    Two weeks ago, there was a thread on the AMD 2700+. Several slashdotters were suggesting we hold of on purchasing an AMD processor until the K8 was released. I suggested that if we weren't careful, AMD might suffer in the same way Osborne Computer's sales slumped when they announced the Osborne 2.

    If too many people hold off purchasing an AMD now, because they want to wait for the newest, whiz-bang thing, then the possibility exists that AMD will not be able to finance the development of the K8 on time, or even that AMD will go bust.
  60. Re:Comment non-sense by pmz · · Score: 2

    It's probably also worth noting that Intel does NOT list the P4 as a "server processor". The P4 is listed as a desktop or workstation processor.

    Quite honestly, I think workstations tend to be more floating-point intensive than servers. For example, how many floating-point calculations does 3D CAD software do vs. Sendmail or LDAP?

    So, new PC customers should be buying "servers" for any graphics, mathematics, or scientific work. This only increases my dislike of Intel's marketing tactics.

    Perhaps Intel should market the P4 as an administrative assistant's toy, and let the engineers and scientists go to Sun, SGI, HP, and IBM for real workstations?

  61. Re:Real reason for delay by cioxx · · Score: 2

    Could somebody give some insight why AMD is after adequacy.org?

    Was it because of the humor article "Is Your Son a Computer Hacker?" where it said:

    "If your son has requested a new "processor" from a company called "AMD", this is genuine cause for alarm. AMD is a third-world based company who make inferior, "knock-off" copies of American processor chips. They use child labor extensively in their third world sweatshops, and they deliberately disable the security features that American processor makers, such as Intel, use to prevent hacking. AMD chips are never sold in stores, and you will most likely be told that you have to order them from internet sites. Do not buy this chip!"

    Any info would be appreciated, since adequacy.org redicects to kiro5in. What's AMD's beef?

  62. Hammer memory isn't so rosy. by Christopher+Thomas · · Score: 2

    By moving the memory controller onto the processor and providing communication between processors over a hypertransport link (3.2GB/sec for bandwidth actually increases as more cpus are added! This is in contrast to a normal MP system where as more cpus are added, there is increased competition for a fixed resource (main memory) which is already the bottleneck in many single processor applications.

    This is true only if the processors are running tasks with unrelated working sets (and if the data for each task is in that processor's memory).

    If you have tasks that require memory managed by another processor, you have to go through the hypertransport link and the other processor's memory controller to get it. This will be _slow_. HT is decent, but nowhere near as good as a direct connection to memory, and there _will_ be delays due to arbitration on the second chip and the various buffering stages the data transfer has to go through.

    So, for multiple processors working on a shared workload, you're screwed.

    The only way to ameliorate this is to have very smart OS-level memory management that can duplicate shared-but-not-modified pages across multiple memory banks, and both OS and processor support for update-based coherence between the banks. The hardware support for this is a bit tricky, and the OS support will be a nightmare if the OS wasn't NUMA-friendly to begin with.

    And under some cases - like tasks on multiple processors competing for access to a lock or all heavily modifying the same data page - you're screwed no matter what you do.

    So, don't rejoice yet. We'll only know for sure how well this will work when we have Hammer systems on our desks.

    1. Re:Hammer memory isn't so rosy. by Wesley+Felter · · Score: 2

      If you have tasks that require memory managed by another processor, you have to go through the hypertransport link and the other processor's memory controller to get it. This will be _slow_. HT is decent, but nowhere near as good as a direct connection to memory, and there _will_ be delays due to arbitration on the second chip and the various buffering stages the data transfer has to go through.

      So, for multiple processors working on a shared workload, you're screwed.


      From the Hammer presentations I've seen, this is not true at all. The HT link between CPUs is 6.4GB/s, which is actually faster than the direct-attached memory (~5.3GB/s). Since the HT controllers are running at >2GHz, they introduce minimal latency.

      And under some cases - like tasks on multiple processors competing for access to a lock or all heavily modifying the same data page - you're screwed no matter what you do.

      I don't think this is true either. Contention for a cache line will simply bounce the line between caches, which is much faster on Hammer than on a 400MHz shared-bus SMP.

    2. Re:Hammer memory isn't so rosy. by Christopher+Thomas · · Score: 2

      So, for multiple processors working on a shared workload, you're screwed.

      From the Hammer presentations I've seen, this is not true at all. The HT link between CPUs is 6.4GB/s, which is actually faster than the direct-attached memory (~5.3GB/s). Since the HT controllers are running at >2GHz, they introduce minimal latency.

      I'll believe this when I see it, I'm afraid. You're trying to burst, at minimum, a 64-byte cache line over a HT link 32 bits wide, over a motherboard (the HT link can't do that bridge at 2 GHz, period). So you have at least 16 bus cycles for the data alone. The full overhead is request->proc2ctl->proc2mem/cache->long data burst. Even worse than that if you have to go through more than one processor (the Hammer boards we saw pictures of had processors daisy-chained. Ugh.). Latency from this will be very substantial. Migrating/copying pages is the only way to substantially reduce this in a NUMA system.

      If I'm proven wrong when the system ships, I'll be the first to cheer, believe me.

      For workloads that have shared data but that *aren't* saturating the memory bus, a shared-bus system would be faster. For workloads that do saturate the bus, NUMA is faster even with the delay, except under pathological conditions (i.e. all data in *one* processor's memory bank, with no migration).

      And under some cases - like tasks on multiple processors competing for access to a lock or all heavily modifying the same data page - you're screwed no matter what you do.

      I don't think this is true either. Contention for a cache line will simply bounce the line between caches, which is much faster on Hammer than on a 400MHz shared-bus SMP.

      Depends on the coherence mechanism, and on how many steps through the daisy-chain you have to take for AMD's system.

      In general, performance won't be much worse with NUMA (and could easily be better), but that doesn't change the fact that with contention it'll suck hard no matter what you do. HT bandwidth doesn't save you, which was the implication of the original post that I took issue with.

  63. Re:Comment non-sense by Paul+Komarek · · Score: 2

    I haven't tried the ppc yes, but we've considered ppc machines. Vector processing could really improve parts of our algorithms. However we can't afford to spend time on things like altivec and sse, and must remain portable.

    -Paul Komarek

  64. Re:Comment non-sense by Paul+Komarek · · Score: 2

    You can't believe how much I agree with you. I'm very tired of having to wade through "server" literature just to find a good workstation. OTOH, if you think of the workstation as a "cycle server", well .... =-)

    -Paul Komarek

  65. Re:Comment non-sense by Paul+Komarek · · Score: 2

    I now have permission to send the code, but I don't have an email address for you.

    -Paul Komarek

  66. Re:FUK AMD by sheepab · · Score: 2

    Patience my troll, good things come to those who wait. Better things come to those who wait longer. Besides, not like you never delayed anything in your life. No one is perfect...

  67. Re:Comment non-sense by Paul+Komarek · · Score: 2

    I've received a lot of email about my test code. I obtained permission to distribute my test code, and have made a web page with it. For those who are interested, please see http://www.andrew.cmu.edu/~komarek/work/RobustChol eskyPerf/RobustCholeskyPerf.html.

    -Paul Komarek

  68. Hammer does not compete against Itanic by Wesley+Felter · · Score: 2

    It competes against the Pentium 4. If the K7 tops out at "3000+" as I've read, AMD is going to need Hammer to compete with 3GHz+ Pentium 4s.

  69. Wintel is a myth by Hoser+McMoose · · Score: 2

    The whole "Wintel" relationship is mostly just a huge myth. At best, Intel and Microsoft are forced-friends.

    Both Intel and Microsoft have realized for quite some time that they need each other to survive, at least for the time being, but neither of them are happy about that fact, and both are striving to change it. Microsoft has been a fairly strong supporter of AMD for some time now, while Intel was one of the first major hardware companies to jump on the Linux bandwagon.

    Think about it, Intel is pretty much the dominant force in PC hardware, being not only the primary supplier of processors, but also the #1 supplier of motherboard chipsets, video solutions, and being well up there if not #1 when it comes to audio and NICs (most of this is now integrated into motherboards). They are also the driving force behind most buses and interconnects, ie PCI and AGP (AMD's Hypertransport is a notable exception here), not to mention the fact that they defined the ATX form factor that virtually all current PCs make use of (albeit in a somewhat basterdized format for some of the big OEMs).

    Microsft, on the other hand, is the dominant force in software, having the most common operating system, office software, web browser, e-mail program, etc. etc. ad nauseum.

    The end result is that Microsoft is the only company in the PC world that Intel doesn't have a fair degree of control over (AMD fights, but ends up adopting Intel-compatible technology in the end, ie MMX/SSE/SSE2). Similarly Intel is the only company that Microsoft doesn't exert a signicant amount of control over in the PC world, though even there I'd say that MS mostly won that battle a while back.

    Anyway, long story short, MS and Intel, not so much allies as simply two companies forced together by circumstance.

  70. Re:Prefection by Christopher+Thomas · · Score: 2

    I'm concerned by this too. The central HyperTransport links in an 8-way system will be heavily contended for many workloads. (Not to mention the latency even in the absence of contention.) Something has to give.

    If I understand correctly, the Sledgehammer has an extra HT link port, which will let them add extra processors as a mesh instead of a chain. The problem still occurs, but it's less crippling for large processor counts.

    I'd have to doublecheck this feature, of course.

    They may also be banking on fewer processors being used for most applications. I'm actually kind of impressed with their four-processor demo. It's large enough to be impressive, and for shared-bus schemes to bog down, but small enough that even with randomly-distributed data, really long hops will be uncommon. I don't see them competing in the Starfire/Sunfire's market any time soon, so larger systems might not be a problem.

    It'll be fun to see what happens.

  71. Implications of NUMA by Animats · · Score: 2
    SGI has had this sort of thing for a while, but at a considerably larger physical scale: rackmount units connected by cables. So there's some experience with the concept, and how SGI deals with it in IRIX needs to be looked at.

    How does this work architecturally? Does each CPU have its own (fast) address space and can access the address spaces of other CPUs more slowly? Or is there an intermediate translation layer, so that all of memory now looks like a cache of one big address space? Or, most likely, does the OS have to manage the logical address/CPU mapping problem as if it were virtual memory? It's not quite like paging, though, because you can still read memory on the other CPU; it's just slow. So only big or frequently accessed stuff needs to be copied. I think. The VM system is going to have to be very, very clever to manage this beast well.

    Some early mainframes had memory arrangements like this. The CDC 6600, the IBM 360/90, and the UNIVAC 1110 all had both "fast" and "slow" memory. But they didn't have paged virtual memory, so the fast/slow memory thing had to be managed explicitly. Mostly, this didn't work, and as a result, mixed speed memory got a bad name.

    But now, we've probably got to go there, simply because speed of light lag and crossbar bottlenecking limits the speed of multiprocessors.

    The notion of machines with really fast memory to memory copying hardware has some interesting implications. It may lead to some new OS architectures for large systems. But I'm too tired to think this through.

  72. Re:Comment non-sense by Paul+Komarek · · Score: 2

    Thanks for that link! I'm going to try to watch that thread; there seems to be at least a handful of sharp people posting there.

    -Paul Komarek

  73. Re:NUMA by GooberToo · · Score: 2

    I read on the lkhml that SUMO is used on 8 or less CPUs while NUMA is used for 9+ CPUs.

    If that's true, I don't think the majority of us have to worry about NUMA concerns.

  74. Re:HyperThreading.... by GooberToo · · Score: 2

    Effecient use of hyper threading requires proper kernel schedular support regardless of what OS you use. Without schedular support, you can slow your self down bigtime.