Slashdot Mirror


AMD Takes Opteron To 2.4GHz

EconolineCrush writes "AMD has added a series of Opteron x50 processors to its workstation and server line that push the K8 core up to 2.4GHz. The Tech Report has tested the latest single and dual-processor Opterons against more than 20 other processors, including exotic Pentim 4 Extreme Edition chips, affordable Athlon 64s, and everything in between. Even if you have no interest in AMD's latest workstation chips, the review is worth checking out to see how two dozen of the fastest workstation and PC processors stack up in rendering, scientific computing, speech recognition, and even gaming tests."

178 of 258 comments (clear)

  1. 2.4ghz? by p00p+at+instable.net · · Score: 3, Funny

    So what is that, 4000+?

    1. Re:2.4ghz? by stephenisu · · Score: 1

      Well, they didn't want all the case modders out there who use bad EM shielding to know that that is what is screwing up thier cordless phone.

      --
      Sigs? We don't need no stinking sigs!
    2. Re:2.4ghz? by Skater · · Score: 3, Funny

      Prepare for ... ludicrous speed!

      --RJ

    3. Re:2.4ghz? by irokitt · · Score: 1

      Phhhht. My processor went plaid years ago.

      --
      If my answers frighten you, stop asking scary questions.
  2. The Conclusion... by Mz6 · · Score: 5, Informative

    From the article to save everyone the 16 pages of boring charts and graphs.. Conclusions "If I were building (or, implausibly perhaps, buying) my ultimate workstation right now, I'd want a pair of Opteron 250s beating at the heart of it. The benchmarks speak volumes. For single-processor systems, the Opteron 150 looks like the fastest x86 CPU on the planet. In a multiprocessor configuration, the Opteron 250 scales up very well, even without the benefit of an optimal memory configuration, a NUMA-aware OS, or 64-bit extensions. By contrast, Intel's dual Xeons are a little bit disappointing. They perform relatively well in CPU-bound apps like 3D rendering programs, which are also largely well optimized for SSE2. But in memory-bound applications where dual Xeons ought to do well, like video encoding, the Xeons' slow bus and RAM hold them back. One has to wonder what Intel is hoping to accomplish by saddling its workstation-class processors with older, slower technology. Even a single Pentium 4 benefits greatly from additional bus and memory bandwidth. Surely a pair of Xeons on shared bus ought to have this same advantage. Intel's apparent willingness to forego such enhancements in favor of adding ever-larger on-chip caches to the Xeon is puzzling"

    --
    Hmmm.
    1. Re:The Conclusion... by Timesprout · · Score: 4, Insightful

      From the article to save everyone the 16 pages of boring charts and graphs

      Thanks for the summary but can I just say I appreciate the level of detail and information provided. Way too many 'benchmarks' these days, especially those dealing with language performance are from some loser with their crappy home PC which they usually dont even know how to configure properly.

      Its a relief to see a benchmark from someone competent and aware of the various factors affecting the results obtained.

      --
      Do not try to read the dupe, thats impossible. Instead, only try to realize the truth
      What truth?
      There is no dupe
    2. Re:The Conclusion... by ishark · · Score: 2, Interesting

      Thanks for the summary. I quickly browsed the article looking at some graphs, and I'm suprised by the bad performance of the Athlon 64s compared to the Athlon XPs in many of the tests.... Is any of those programs running in 64 bit mode, or it's just a test of 32-bit applications running on 64-bit CPUS?

    3. Re:The Conclusion... by bhtooefr · · Score: 2, Informative

      BTW, the 150 is a top-locked AFX53 (or, the AFX53 is an unlocked 150). Also, EconolineCrush's comment about the K8 core just now hitting 2.4GHz is wrong - the FX53's been out about a month, I think.

    4. Re:The Conclusion... by Jeff+DeMaagd · · Score: 4, Interesting

      The reason is that a three-drop bus used for Xeon DP (533MHz bus), five-drop for Xeon MP (400MHz bus), can't operate as fast signalling-wise as a point-to-point bus used for Pentium 4 and all Athlon systems, 1 and 2 processor. Terminmation was just too difficult, I think. Before Hypertransport, the wiring for multiprocessing with only a point-to-point bus was rediculously expensive, particularly on the chip that connects the CPUs to the rest of the system.

      AMD got a little unconventional and this time it paid off on Opteron. It didn't work so well with the Athlon MP because of this wiring problem, too many wires, too expensive of a core chip, it was 1000+ pins when 600 pins was thought to be expensive.

    5. Re:The Conclusion... by Uncle_Al · · Score: 1

      As far as I could gather, they were running windows XP (normal)...so 32bit it is..

      ..the dual opterons where also running with the memory only on one of the processors...

    6. Re:The Conclusion... by Anonymous Coward · · Score: 1, Informative

      The 64 bit xeons due out within a month or so have 800MHz FSB at last. Still dont have the serious memory bandwidth of real dual opteron boards (as opposed to the one tested here with only memory on one CPU).

      I get v. annoyed by all these crappy benchmarks under 32 bit emulation though. Linux gives you 64 bit and NUMA aware for amd64 already... they could at least try it. Much faster in 64 bit mode for many apps in my experience.

  3. AMD are back by RoderickMcDougall · · Score: 5, Interesting

    They were lagging there for a while but the benchmarks depict a good story. Looks like the opteron is going to be yet another AMD chip that is great for gaming (and most other things). Hopefully a cheaper price than the p4's will really contribute to yet another dominating year for AMD.

    1. Re:AMD are back by dealsites · · Score: 4, Insightful

      I agree. AMD has some really great products. We always hear about Intel's huge R&D budget. I'm not sure how much of that is alloted to processor design, but it doesn't seem like they've been able to outrun AMD. AMD might not always have the fastest chips at any given moment, but they are always close behind. Since I don't buy bleeding edge equipment due to the high cost premiums, AMD is always a solid choice for me.

      --
      New deal processing engine online: http://www.dealsites.net/livedeals.html

    2. Re:AMD are back by Deliveranc3 · · Score: 1

      AMD processor design is run in conjunction with Sun and Apple and several other companies. Basically they have several sets of labs all trying diffrent things.

      Hence, hypertransport + On Chip Memory + efficient design all comming at once as opposed to intels slow and steady approach.

      $0.1768 (hey I live in Canada.)

    3. Re:AMD are back by hrieke · · Score: 1

      And that has nothing to do with the fact they AMD hired most of the Alpha design team when DEC stopped do their chip development?

      --
      III.IIVIVIXIIVIVIIIVVIIIIXVIIIXIIIIIIIIVIIIIVVIIIV IIVIIIIIIVIII...
    4. Re:AMD are back by Gr8Apes · · Score: 2, Insightful

      I think that's an awesome strategic move! AMD had the insight to see that hey, here's a group of highly technical people that have some great ideas, and gee, we can hire them all.

      Would rather have had AMD go "hmm, naaah, we don't need to hire guys with really creative and proven ideas, let's go reinvent the wheel"?

      Thanks to their insight, hypertransport did not go the way of the dung heap, and superior processors design resulting in better performance, especially in multi-processor machines have resulted that are actually being accepted in the market place. They are also causing the former obvious monopoly to take notice. Note that the P4 will be no more, and the PIII core is coming back. Now that's eating some crow. If not for AMD, how likely would the retirement of the P4 have been in this time frame? I doubt it would have happened, because Intel would not have had to push their processors much and could probably have coasted for another 2 or 3 years with this chip.

      --
      The cesspool just got a check and balance.
  4. 2.4 Rates as around PR3900+ by �nertia · · Score: 5, Informative

    I have been running my Opteron 248 at 2400Mhz. Sisoft seems to equate this to a PR rating of 3900+. I have no idea how it calculates this so please take that with a measure of salt.

    --

    AEnertia
    Witty, tag line goes here

  5. I don't think it is puzzling at all by peterdaly · · Score: 5, Interesting

    Intel's apparent willingness to forego such enhancements in favor of adding ever-larger on-chip caches to the Xeon is puzzling"

    Why is it puzzling? In their historic "Intel Inside" world, they were basically competing against themselves. Adding a bigger cache is not only easy, but a cheap way to rake in more cash without doing much R&D work.

    It's not until recently that AMD has starting "schooling them" on what improvement really means. Just look at how Intel is going to use the AMD x86-64 method in the upcoming Intel 64bit platform. And now "If I were building (or, implausibly perhaps, buying) my ultimate workstation right now, I'd want a pair of Opteron 250s beating at the heart of it. The benchmarks speak volumes. For single-processor systems, the Opteron 150 looks like the fastest x86 CPU on the planet..." And this is at much lower mhz!

    I believe Intel had thought they had reached monopoly status, which really they had, and the culture had become complacent. This did not happen at the underdog AMD, who has recently been able to quickly leapfrog Intel's offerings.

    -Pete

    1. Re:I don't think it is puzzling at all by Yartrebo · · Score: 5, Insightful

      Adding a bigger cache is not only easy, but a cheap way to rake in more cash without doing much R&D work.

      It's might be easy, but it isn't cheap to add more cache. Cache accounts for something like 50% of the die surface of a modern chip, and a larger die means a lower yield and less chips per wafer.

    2. Re:I don't think it is puzzling at all by dpilot · · Score: 1

      Not necessarily so.

      L2 cache is also memory, and any memory of that size will have redundant elements. Therefore, even though it's a lot of area, it's fixable area, and will yield higher than an equivalent area of logic. Plus, though SRAM design may not be 'easy', expanding a cache is one of the easier things you can do to enhance performance.

      For some problems, cache will never be big enough, for others it will. Unfortunately for the P4, it's architected as a bandwidth hog, and the core excels at streaming media processing - precisely where a bigger cache tends to fall down.

      --
      The living have better things to do than to continue hating the dead.
    3. Re:I don't think it is puzzling at all by ivan256 · · Score: 2, Interesting

      The sentence before your quoted line there speaks volumes to the issue.

      Surely a pair of Xeons on shared bus ought to have this same advantage.

      It's way easier to ramp up the bus speed for a single processor, since it only has to interact with one other device. It's considerably harder to increase the speed when there are three devices on the bus instead of only two. Since the Opteron uses point to point connections they don't have this same problem. In that sense it's not really puzzling at all. They probably just can't get it to work.

    4. Re:I don't think it is puzzling at all by Vaystrem · · Score: 5, Insightful

      "Why is it puzzling? In their historic "Intel Inside" world, they were basically competing against themselves. Adding a bigger cache is not only easy, but a cheap way to rake in more cash without doing much R&D work."

      You are forgetting a key deficiency of the P4 "netburst" architecture. Its incredibly long pipeline which makes it very susceptable to cache misses. Therefore the larger the L2 cache the less of a performance hit the processor will take if it misses an instruction or two.

      It is possible that adding a bigger cache is 'cheap' but if that were the case we'd see a dramatic reduction in the price of P4EEs as they are getting schooled by AMDs. L2 Cache is not cheap to implement. And significantly adds to the manufacturing cost of the processor.

      Additionally the lack of a FSB upgrade on the Xeons is troubling, but that apparantly is coming later this year, and this may reduce the advantage of the Opteron's to SOME degree. However, in the current architecture the Xeon's FSB bandwidth will always be shared - while the Opteron's get dedicated bandwidth for every processor. This is really the most remarkable advantage of 'Hammer' family of AMD CPUs over the Netburst generation of P4s/Xeons.

      "I believe Intel had thought they had reached monopoly status, which really they had, and the culture had become complacent. This did not happen at the underdog AMD, who has recently been able to quickly leapfrog Intel's offerings."

      Intel put a lot of money and R&D into a product line (P4 NetBurst) and honestly - even with AMD making inroads - they still do not have that large a share of the CPU market. Intel has however observed their lead eroding and have canned Tejas - successor to the Prescott. So Intel is able to step up and make the big changes even when it has sacrificed large amounts of R&D money.

      Intel to formally confirm Tejas canned
      Intel may have canned Tejas...

    5. Re:I don't think it is puzzling at all by Michael+Spencer+Jr. · · Score: 2, Interesting

      I hate to sound like I'm being contrary, but I don't really know enough about the subject:

      You are forgetting a key deficiency of the P4 "netburst" architecture. Its incredibly long pipeline which makes it very susceptable to cache misses. Therefore the larger the L2 cache the less of a performance hit the processor will take if it misses an instruction or two.

      I just finished a Computer Architecture class at the local university. While I'll probably forget 90% of what we learned in that class in another year, I'll ask while it's fresh on my mind:

      What does a long pipeline have to do with the cache hit/miss ratio?

      We learned about some hypothetical five-stage-pipeline CPU in class, which is childs' play compared to the superpipelined monsters of today. However, if the same concepts still hold, a longer pipeline just increases the stall penalty.

      For those who haven't yet had their heads pumped full of Computer Architecture trivia, I'll recap what little I learned in class, so the question makes sense:

      A CPU is like a big assembly line. Its job is to read a bunch of instructions and execute them in order as they come down the assembly line. In an ideal world, with a program that never loops and never branches, it works JUST LIKE an assembly line, munching on instruction after instruction.

      CPUs operate at a clock speed, and receive a clock pulse at regular intervals. They are supposed to be able to complete whatever work they're working on each clock cycle, so a really simple one-stage CPU would need to have a clock speed slow enough that any instruction can be completed in that length of time.

      People figured out that instructions can be split into little pieces, such that these little pieces are each simpler than the whole instruction. That lets them build faster but more complex pipelined CPUs. Each pipeline stage might have some work to work on, but all pipeline stages can work at the same time.

      So this means that if the pipeline is full of instructions, and every instruction uses every stage, then the CPU is performing one instruction per clock cycle. This is better than before though: these clock cycles are tiny because they just have to be big enough for these tiny fractions of an instruction. So we get the speed benefit of quick clockcycles, but we're still performing a full instruction each cycle. That's something like a 5x speedup if you have five pipeline stages!

      It doesn't always really work this way though. See, instructions can depend on each other, and that causes problems. There can be dependencies like Read-After-Write, where instruction 100 does some math and puts the result in a register, like A, and instruction 101 uses the result (in A) in its own calculation. Normally that would be fine, but a pipelined CPU tries to do things at the same time.

      For CPUs as simple as we talked about in class, there are two solutions: stalling and forwarding.

      In forwarding, the CPU is smart and looks ahead and figures out "this instruction needs something that the previous instruction is providing", and just short-circuits the whole formal writing and reading process, and just kinda passes the answer under the table between pipeline stages. "Psst, hey M-stage, this is E-stage. I've got the answer to A if you want it..."

      In stalling, the CPU realizes it NEEDS the answer to one instruction before it can do the next, so it starts wasting work units. Stages start getting commands saying "do nothing", which wastes CPU cycles. So in the example above, where instruction 101 needs something instruction 100 is still creating...suppose instruction 100 is multiplying two 256-bit floating point numbers together. Instruction 100 is going to take TONS of time to finish, so instruction 101 just gets stuck at the decode pipeline stage, sitting there tapping its feet and executing an "are we there yet" check every clock cycle. The rest of the pipeline goes unused.

      For my next trick, I'll tie

    6. Re:I don't think it is puzzling at all by boots@work · · Score: 1

      I think he meant that deeply pipelined machines suffer a worse penalty when they get a cache miss. So it's worth making both L1 and L2 as large and fast as you can. It doesn't directly effect the hit ratio as far as I know.

    7. Re:I don't think it is puzzling at all by rainman_bc · · Score: 1

      Why is it puzzling? In their historic "Intel Inside" world


      I think this now serves as a warning instead of a reassurance :)

      Intel Inside: Less value for your $$

      Way to go AMD! Since the Athlon came out, they sure have come a long way! They've gone from being a budget proc to a real competitor. Wonder where our processors would be without the competition.
      --
      09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
  6. Glad to see AMD coming to the party. by WordODD · · Score: 4, Insightful

    I work for a medium sized school divison and this year happens to be the year when my school will get new equipment. When the meetings about what to buy have occured every single time someone has mentioned getting AMD chips instead of Intel those in managerial positions have been quick to say, " No, AMD chips are slow and run very hot. They wouldn't be a good choice for what we are looking for." Now this insight is coming from people who..

    A. Are mainly concerned about the bottom line as far as price goes.
    Which makes zero sense being AMD chips are more then competitivly priced compared to Intel.

    B. Are supposedly in the know about technology.
    Which is obviously not the case as many of them still think AMDs have the same cooling problems they did 5 years ago.
    These chips are cheaper now then their Intel conterparts and from my experience run at the same speed if not faster. AMD is finally getting on the ball as far as putting the clock speed measured in Ghz to provide direct comparission which really needed to be done in order to compete. Combined with their dedication to inovation, i.e. the 64 bit processor that Intel has still yet to bring to market make me really suppport what the company is trying to do. I really hope to see more reviews like this that I can pass on to those in charge in hopes of getting away from an Intel only environment.

    Just to clarify I do not hate Intel I just think that between the two there Intel does not always win outright and AMD should be considered before any purchases are ever made.

    --
    Please do not let scientific accuracy interfere with the intended humourous/interesting/insightful value of this comment
    1. Re:Glad to see AMD coming to the party. by Deliveranc3 · · Score: 1

      Well I'm not a senior manager but my philosophy at this point would be something like.

      "You will definitly be fired for buying Intel"

      But perhaps word just hasn't hit the market yet.

    2. Re:Glad to see AMD coming to the party. by dave420 · · Score: 1

      Seeing as most /.ers still say that windows crashes all the time and is rubbish, your point about people speaking out of their asses might go over a few peoples' heads :-P

  7. OK so it has a nice metal cap by Timesprout · · Score: 1, Funny

    but where are the 940 pins? I count only 939.

    --
    Do not try to read the dupe, thats impossible. Instead, only try to realize the truth
    What truth?
    There is no dupe
    1. Re:OK so it has a nice metal cap by mobby_6kl · · Score: 1

      did you count like "0, 1, 2,..." ??
      The last pin might be #939, but there are(at least should be) 940 total pins.

    2. Re:OK so it has a nice metal cap by Ianoo · · Score: 1

      I know the parent was trying to be funny, but if people are interested, just take a look at the grid on the bottom of the chip in the article:

      31 rows * 31 cols = 961

      Pins missing at the four corners total 13 (3 at each corner plus an extra keying pin at the bottom right). There are 4 'gaps' of 2 pins missing within the grid itself. This means:

      961 - 13 - 8 = 940 pins

    3. Re:OK so it has a nice metal cap by irokitt · · Score: 1

      Your geek pass is hereby revoked. You forgot to start counting at zero!

      --
      If my answers frighten you, stop asking scary questions.
    4. Re:OK so it has a nice metal cap by Junta · · Score: 1

      No, just the opposite, he *was* counting from zero, hence the 939... Counting from 1 instead of zero as an error would give too high a number.

      --
      XML is like violence. If it doesn't solve the problem, use more.
  8. Compiling by IceFox · · Score: 2, Insightful

    Has anyone done any compiling tests? If so PLEASE share! -Benjamin Meyer

    --
    Do you changes clothes while making the "chee-chee-cha-cha-choh" transformation sound?
    1. Re:Compiling by Illissius · · Score: 3, Informative

      AnandTech usually does them in their processor reviews, lemme dig one up.
      Here's one, for example.
      (Of note, the Athlon FX-51 and -53 are identical to Opteron 148 and 150 processors, respectively. The Athlon 64s are similar as well, difference is they use a different socket, have only single-channel memory controllers, and use unbuffered/unregistered memory.)

      Basically, the Hammers are godlike at compilation.
      The lowest-rated (at the time; a 2800+ has since been released) A64 3000+ beats the fastest P4 3.4GHz Extreme Edition.

      --
      Work is punishment for failing to procrastinate effectively.
    2. Re:Compiling by vicotnik · · Score: 1

      Informative? If you look at the test you linked it doesn't seem like the A64 3000+ beats the P4 3.4EE in general, in fact it looks like it's slower in most tests.

    3. Re:Compiling by vicotnik · · Score: 1

      It's just a compilation of a single program with a single compiler in a test where it is not faster in general.

    4. Re:Compiling by KirkH · · Score: 2, Informative

      He was addressing the compilation test specifically, which is the one he linked to and the one where the slowest Athlon does indeed beat out the fastest P4.

      So, yes, I would say his post was informative.

    5. Re:Compiling by vicotnik · · Score: 1

      Yes, I missed that he was replying to a question about compilation...I thought he just picked one good score in a test and stated that it showed that it was extremely good in that area when it could have been specific for that test (since it wasn't better in general). It would be interesting to see more compiler related tests to determine how good it really is in that area. Anyway, sorry about about the lazy reading.

  9. Waste of time... by Anonymous Coward · · Score: 5, Insightful

    Can somebody please benchmark a dual AMD opteron against a dual PPC 970 (MAC G5), using Linux in 64bit mode. What is with all these kids benchmarking opterons in 32bit mode?

    1. Re:Waste of time... by adzoox · · Score: 1

      I agree - not everyone who wants a computer at an ad agency wants a Mac and not everyone who owns a Mac wants/needs/can afford another.

      I wish all Intel/AMD chip benchmarks would include how they rate against the PowerPC.

      --
      Yell & scream & rant & rave... it's no use... you need a shaaaave ~ Bugs Bunny
    2. Re:Waste of time... by ValourX · · Score: 3, Informative

      64-bit vs. 32 bit using FreeBSD

      Don't you ever read the BSD section?

      -Jem

    3. Re:Waste of time... by DarkSarin · · Score: 1

      I suspect that the real reason that this does not occur is because most review type sites are a) not using linux at all, b) cater to a specific audience and c) want to use all the same test suites and benchies that everyone else does, and therefore can't use linux.

      Personally, I think its a huge mistake. IF I had the money to start doing these types of reviews, that's exactly what I would do...use linux and bench against ppc as well as x86.

      The reason is that linux is multi-architectural, unlike windows, and could give some added insight to the tech field. Furthermore, we could start seeing some tests of 64-bit goodness, and really see how these procs are doing. Even better, we could throw in some other chips, such as the UltraSparc and the Alpha to get a really cool look at how they stack up.

      Unfortunately, I don't have the cash. I would take donations, and/or equipment, however...

      --
      "We don't know what we are doing, but we are doing it very carefully,..." Wherry, R.J. Personnel Psychology (1995)
    4. Re:Waste of time... by phoxix · · Score: 2, Interesting

      Can somebody please benchmark a dual AMD opteron against a dual PPC 970 (MAC G5)

      Not so fast, a significant problem in such a comparison is that gcc has *much* better support for x86-64 than it does for PPC64. If there was even a chance that a dual PPC970 machine was faster than a dual x86-64 machine, the likes of Yellow Dog, and Momentum Computer would have been all over it.

      Sunny Dubey

    5. Re:Waste of time... by Jeff+DeMaagd · · Score: 1

      That depends on how well the software fits the architecture, how well optimized GCC is, etc. It would also depend on testers getting G5s and the other systems.

    6. Re:Waste of time... by dave_f1m · · Score: 1

      Sounds perfectly fair to me. I don't care which machine is theoretically faster, I don't care which does 'X' faster, when 'X' is something I don't do. I primarily run Linux on the workstations, and the question I would like answered is, which is faster for me? If the PPC is a faster chip, but anything I run on it is slower, then I don't really care if the chip is 'faster', it would a slower platform. Yes, some people would use it to compare just the chips, but it would not be a 'waste of time'.

    7. Re:Waste of time... by joib · · Score: 1


      Not so fast, a significant problem in such a comparison is that gcc has *much* better support for x86-64 than it does for PPC64.


      Well, lots of reviews trashing ppc64 sounds like a good incentive for apple to help out the gcc project then. BTW, doesn't apple use gcc as the system compiler for OS X? In that case, you can't even dismiss gcc performance as only something those dirty GNU/hippies care about.

    8. Re:Waste of time... by ValourX · · Score: 1

      It would be better to wait for 5.3 before trying an SMP comparison. It's not too far off. I just need to get an SMP system for testing... shouldn't be too hard at this point.

      I could also test with GNU/Linux. SuSE makes a fine testing platform for 64 vs. 32.

      -Jem

    9. Re:Waste of time... by Wesley+Felter · · Score: 1

      So use XLC on the 970 and the PathScale compiler on the Opteron.

    10. Re:Waste of time... by JBv · · Score: 1

      Well, IMHO, bechmarking with the same gcc would give valuable information for the number cruncher in me.

      I couldn't care less about the maximum CPU performance, but I do care a lot about how fast my programs run on each machine.

      Since gcc comes with linux and macosx, in my view, it would be a very pertinent test.

    11. Re:Waste of time... by troc · · Score: 1

      Hmm, seems BSD supporters have no sense of humour either..... :)

      Troc

      --
      Troc's dubious podcast and blog: http://www.trocnet.net
  10. Re:Huh? Pentim 4 Extreme by AgntOrnge · · Score: 1

    I'll refrain from mentioning how you are horribly off-topic but at any rate, give this a try for answers somtime. www.google.com In short the Extreme editions are a Northwood core with 2MB of L3 cache.

  11. A word of caution by GigsVT · · Score: 5, Informative

    I had an AMD64 chip with the heat spreader.

    I went to take the heat sink off the other day, and the vacuum that formed between the heat spreader and heat sink caused the chip to get yanked right out of the closed ZIF socket when I tried to get the heat sink off.

    Then, after reinstalling the chip, apparently the heat spreader has become disconnected from the core internally, because the CMOS reports rising temperature up to 120C, but even the heat spreader isn't warm if I turn the system off and get the heat sink off again.

    So be very careful. It takes about 10 minutes to take the heat sink off the heat spreader if you used a coating of grease that covers the whole top of the chip, even if you used a thin coat. You have to wiggle the heat sink and gently pull up for quite a while before that vacuum is broken. It doesn't help that the heat sink design makes it impossible to see the chip or slide the heat sink to the side.

    And be aware that it doesn't take a whole lot of force to yank the chip right from the ZIF, possibly damaging things in the process.

    --
    I've had enough abrasive sigs. Kittens are cute and fuzzy.
    1. Re:A word of caution by GigsVT · · Score: 1

      What?

      I can't believe I'm being modded down as troll, and getting silly replies like yours.

      This really happened to me. I don't buy Intel chips anymore, and haven't for years, but I'm not too happy about having to replace a chip that I bought a few months ago, probably out of my own pocket since newegg won't take an OEM CPU back that is over 30 days out, and they claim to not take any back that have physical damage either (though I don't know if this would count, I don't see how a design flaw is my fault).

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    2. Re:A word of caution by MoronGames · · Score: 5, Informative

      The problem is that you're using too much thermal compound. Generally, you use no more than an amount the size of a grain of rice, and spread it out to cover the entire heatspreader, and the bottom of the heatsink. Not only will this give a lower chance of damaging something, but it will also get better heat transfer and drop your temperatures.

      --
      hey!
    3. Re:A word of caution by Anonymous Coward · · Score: 1, Informative

      Even that is too much.

      If your CPU contact area and heatsink have a mirror finish then you really don't even need heatsink compound. Of course it's very difficult to make them stay together without using something so the compound is good for that.

      But you only need a thin film. And I mean thin . As thin as you can possibly make it.

      The goal is metal-to-metal contact which provides the best heat transfer, the heatsink compound is only there to fill in any tiny imperfections and hold the pieces together. The best heat transfer is metal-to-metal though.

    4. Re:A word of caution by Callitrax · · Score: 1

      I got a new heat sink the other day and it had a little gray patch stuck on it right where the processor would touch the heat sink.

      This is a "Thermal pad" - meant to doe the job of a paste and/or good lapping but in general they suck. You are best off removing it completely and using the grease.

    5. Re:A word of caution by necro2607 · · Score: 1

      This is why you never take the heatsink off while the CPU is still in your motherboard... I thought everyone knew that?

    6. Re:A word of caution by GigsVT · · Score: 1

      It's impossible to do it any other way, with AMD64 at least (which has a shroud around the base of the heatsink), and most chips for the last 5 years... the heatsink overhangs the ZIF lever.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    7. Re:A word of caution by clacke · · Score: 2, Funny

      And be aware that it doesn't take a whole lot of force to yank the chip right from the ZIF, possibly damaging things in the process.


      I see. So it doubles as a ZEF, then?
    8. Re:A word of caution by necro2607 · · Score: 1

      Ahh I wasn't aware of that with the AMD 64-bit CPUs. I've got a 32-bit AMD CPU which is easily detatched from the motherboard with the heatsink attached. Actually... nevermind. I have no clue what I was talking about. The heatsink *has* to be removed to even get the CPU off, considering the heatsink clips onto the socket on the motherboard... I have no idea why I thought that the heatsink was supposed to somehow stay on... hmm, maybe I should try and get more than 4 hours of sleep a night...

    9. Re:A word of caution by delus10n0 · · Score: 2, Insightful

      Guess next time you'll buy RETAIL instead of OEM?

      --
      Not All Who Wander Are Lost
    10. Re:A word of caution by GigsVT · · Score: 1

      It might be worth the warantee.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    11. Re:A word of caution by Anarke_Incarnate · · Score: 2, Informative

      Mirror finishes are bad. You need something with minute pockets (because even your mirror finish will have them) to retain the thermal compound. The purpose of lapping is not to cause the surface to be perfectly smooth, but rather to make the surfaces flat and allow for them to rest on each other so as to not have any gaps due to uneven layers.

    12. Re:A word of caution by Strange+Ranger · · Score: 1


      If you run the CPU very hot for awhile (e.g. play a game) the goo will soften quite a bit and the heatsink will come off pretty easily. Or if it really is a vacuum, the heat of the CPU will expand everything including the air and make the vacuum much weaker or gone altogether. Just run it hot for 10 minutes then shut right down and do the replacement while it's still pretty warm.

      I can attest this works nicely for P4's. Before learning this trick I got quite good at straightening P4 pins. groan.

      --

      Operator, give me the number for 911!
    13. Re:A word of caution by GigsVT · · Score: 1

      Oh, it really was a vacuum. Once I got it off and cleaned it, and put a tiny amount of grease back on, it formed the vacuum again instantly, and required several minutes of coaxing to come off again.

      I don't think it's air trapped in there per se, it was warm when I originally took it off. You don't need air bubbles to have a vacuum.

      Have you ever seen a machinists gauge blocks? They are very polished and flat metal blocks. If you stick two of them together, the lack of air between them actually makes them stick together. I think it's a similar effect here. If air can't get in, then it's a vacuum. :)

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    14. Re:A word of caution by TheLink · · Score: 1

      I doubt you are right. The thermal conductivity of the thermal compound is much worse than the two surfaces. The compound's thermal conductivity is better than air, which is why you use the compound, but if you get a mirror finish, then you have a lot less air and compound between the two surfaces on average.

      It's just things are designed so that a mirror surface isn't necessary - there's enough margin for how most people use it.

      --
  12. AMD's Cool 'n Quiet by niko9 · · Score: 4, Interesting

    ...the Opteron 150 looks like the fastest x86 CPU on the planet.

    I know I might be nitpicking here, but I really wish the Opteron series of chips incorporated AMD's Cool 'n Quite technology.
    From what I read on their website, with a supporting motherboard and driver (2.6.5 has a native driver) the Athlon 64 can scale down to 800Mhz, cool enough for the system to shut the HSF and case fans completely offoff.

    One demo I saw online had a Athlon 64 SFF computer playing a DVD while the AMD cool 'n quite app was shoing the the CPU at 80hz and the system was totally silent.

    Coudn't server rooms benfit from the reduced electricuty bill also?

    1. Re:AMD's Cool 'n Quiet by thorgil · · Score: 1

      ...ite app was shoing the the CPU at 80hz and the system was totally..

      80Hz ???? ... man thats slow.

      --
      Warning: This sig contains a small bug. ==> *
    2. Re:AMD's Cool 'n Quiet by niko9 · · Score: 1

      Correction. That would be 800Mhz.

      Sorry for the typo.

      Think of the power savings for those who like to leave their PC's on for filesharing or whatnot.
      Would also make for a better nights sleep, if the ATX case is in your bedroom.

    3. Re:AMD's Cool 'n Quiet by -tji · · Score: 1

      Based on the reports I have read, the Opterons can't do CnQ because of the ECC registered memory they use. The registered memory cannot handle the voltage variations that CnQ uses.

      This is the reason I went with the A64. In trying to make the quietest system I could, the CnQ feature is great.

    4. Re:AMD's Cool 'n Quiet by Wesley+Felter · · Score: 1

      Yes, servers can benefit from dynamic voltage/frequency scaling (aka Cool'n'quiet). My colleagues wrote several papers on this topic.

    5. Re:AMD's Cool 'n Quiet by Dravik · · Score: 1

      Sleep!! without the comforting whirring of my computer fans? The pain, the agony! I can't stand to think of such a thing any longer.

      --
      The purpose of language is communication, If the idea is clear the grammar ain't important
  13. Re:Think it's time to seriously consider AMD by turgid · · Score: 1

    So you've been living under a rock for the last four years?

  14. Great, but how fast can they compile the kernel? by freelunch · · Score: 4, Funny

    Nice article, but we need more Linux-centric bench and test sites.

  15. Hmm.. Pushing the top end... by Forge · · Score: 3, Insightful

    .. means cheaper CPUs at the bottom end.

    My needs are simple, Most of my systems would do just fine with a Duron 800MHz or even slower CPUs. With the advent of new high end chips heralds lower prices at the low end.

    It's gotten to the point where only a few popular niches need to even bother with anything but the absolute bottom end chips. I.e. Gaming, video encoding and servers (Faster chips mean more users on a server).

    Scientific Computing clusters, Compiling lots of code everyday etc.. are other niches worth noteing. For Web browsing, Office productivity, educational apps and old games I advise you to buy the chip so far behind the curve it won't be available in a few weeks.

    --
    --= Isn't it surprising how badly I spell ?
    1. Re:Hmm.. Pushing the top end... by Forge · · Score: 1

      Quoting myself :) 'I advise you to buy the chip so far behind the curve it won't be available in a few weeks'

      --
      --= Isn't it surprising how badly I spell ?
    2. Re:Hmm.. Pushing the top end... by necro2607 · · Score: 1

      That's generally true, although with things like RAM that's totally not the case - PC133 ram is stupidly expensive for its slow and outdated performance - it still costs $50 for a 128mb DIMM. The same amount DDR ram is actually cheaper!

      I realize it's partly due to the fact that far fewer companies manufacture the older RAM, but either way, paying high prices for outdated hardware seems kind of odd if you ask me...

      Getting back to your original topic, another good thing about new CPUs getting faster is the "old" computers that get thrown out are getting faster, too. I found a 400mhz Celeron computer thrown out behind a computer store, as well as numerous 166mhz Pentium systems. Most were stripped of their parts, but the motherboards/CPUs were still there.. awesome :)

  16. Re:Think it's time to seriously consider AMD by frozenray · · Score: 5, Informative
    Indeed, the outlook for AMD looks good given their roadmap and Intel's problems:
    • The "Prescott" P4 got mostly negative reviews, c't Magazine mentions in their current issue that Tejas (Prescott's planned successor) is being scrapped
    • For all the money they invested into Itanium, they seem to be lacking a coherent strategy for the architecture (or if they have one, they're not talking about it)
    • Their new processor rating and the power requirements of the Prescott line may be an indication that they cannot continue to crank up the clock on the CPUs any longer (GHz being the primary sales generator) without running into major problems with leak currents
    • AMD has processors which are looking more interesting to businesses than the previous CPUs, plus they have taken the lead in 64-bit processors for desktops and have dual core CPUs in the pipeline (heh)

    It will be interesting to see how Intel responds to these challenges - c't speculates that the future Pentiums will use the architecture they have in the Pentium M line (developed in Israel). If they're smart they'll introduce a dual core CPU based on the Pentium M architecture, if AMD is smart they'll modify their existing designs and beat intel to the punch again.

    Speaking as a business user, I'd welcome an emphasis on ergonomics and environmental concerns over raw speed. I'd rather have silent systems that do not overload the air conditioning with enormous amounts of heat than screamers which spend 99.9 % of their time waiting for the user to press a key anyway.
    --
    "There are already a million monkeys on a million typewriters, and Usenet is NOTHING like Shakespeare." - Blair Houghton
  17. Re:Ho Hum by digitalunity · · Score: 4, Insightful

    I'm sorry, but you do not truly understand how modern x86 chips work. You don't like them because they are 'CISC-mired'? The funny thing is, underneath they really aren't CISC. They are RISC to the bone. Each and every x86 instruction you feed a modern processor is deconstructed into many smaller RISC-like ops and they are processed independantly. Small register set you say? You don't get to address them directly, but both the Athlon and the P4 have had many more registers than the x86 ISA would lead one to believe for a really long time. The x86-64 is nice because now you get many more registers of larger size, directly addressable.

    In a purely dollar/performance comparison, nothing beats x86.

    --
    You can't legislate goodness. Let each to his own destiny, by will of his freely made choices.
  18. Re:Ho Hum by 10Ghz · · Score: 3, Insightful

    And still, that "technology of the 70's" is the fastest thing there is. Sure, you might have some CPU's that are even faster, but they are also alot more expensive. Those CPU's usually get better performance by adding lots and lots of cache to the CPU.

    If PowerPC (for example) is SOOOO much better, why doesn't it wipe the floor with x86? Sure, it's competetive, but it does not annihilate x86

    As to being register-starved... Again, that doesn't seem to hurt the performance of these chips that much. And if you use Opteron/Athlon64 with an 64bit OS, you get double the number of GP/SSE-registers (instead of 8, you get 16).

    As to CISC... Modern x86-CPU's are very much RISCue in the inside. And being RISC does not automatically mean that it's somehow better. You can have kick-ass CISC-CPU's, and you can have crappy RISC-CPU's.

    --
    Lesbian Nazi Hookers Abducted by UFOs and Forced Into Weight Loss Programs - -all next week on Town Talk.
  19. Re:Ho Hum by brap999 · · Score: 1

    Seeing as how Linux runs on just about every popular chip out there, your comment has no basis behind it. As a matter of fact, we can thank Windows for our dependancy on the x86 architecture, as it is the only platform it CURRENTLY runs on, until they finally release the 64bit version. You are an idiot who has only been first once in your life, in the shit for brains line. I do feel sorry for you however, it must be difficult living life with an IQ of 12 and a hockey helmet on head.

  20. Re:Ho Hum by dave420 · · Score: 1
    Why on earth does everyone blame everything on microsoft?? I'm all for some ragging if it's even plausible, but the amount of anti-microsoft tripe spewed out on this forum is just ridiculous. You used to be able to surf through it, but now everyone's blaming microsoft for global warming, outsourcing to india, Bush, Kerry, Nader, and the near-extinction of the giant panda.

    You admitted it yourself - it's AMD/Intels fault. They make the chips. Blame them. Microsoft are as much to blame as Linus is in this one. No amount of fanboy paranoid fascism is gonna change that :-P

  21. Re:Huh? Pentim 4 Extreme by AgntOrnge · · Score: 1

    That's a lot of animosity for an anonymous coward. Someone needs a good solid bitch slap

  22. Re:Ho Hum by _|()|\| · · Score: 2, Insightful
    take this register-starved, CISC-mired turkey out back and give it a proper burial

    This aging architecture has maintained an incredible price/performance ratio. At this price level, the only thing that compares is the G5. A comparable UltraSPARC, Itanium, POWER, or PA-RISC system will cost much more.

    As for registers, AMD64 doubled the number of general-purpose registers, which are already subject to register renaming.

  23. All I can say is... by Azureflare · · Score: 1
    Thank GOD for competition! I couldn't stand living with Intel as a monopoly if they took that kind of lazy attitude.

    It's my hope that AMD and Intel will be eternal rivals, and continue advancing and coming up with better products.

  24. Re:Ho Hum by Jeff+DeMaagd · · Score: 4, Insightful

    x86-64 only doubles the number of registers.

    Something tells me if the billions of dollars per year in R&D were spent on a fully-RISC system, externally and internally, it would be much faster, saving a stage or two of decoding and other internal mangagement, saving a lot of design and testing hassles.

    For over half a decade, DEC held its own against Intel with $70M / year CPU development budget, when Intel was spending $2B. They only got tripped up with poor marketing and problems and delays in fabbing the EV6 and EV7.

    For one, being fully RISC made it far easier to validate the chip design because it didn't involve lots of work disassembling instructions and keeping track of the results, predicting properly and so on.

  25. Re:Ho Hum by cosmo7 · · Score: 1

    As to being register-starved... Again, that doesn't seem to hurt the performance of these chips that much.

    I am pretty sure that in any modern x86 design the actual operation of the processor's registers is virtualized and accessing them is no faster or slower than accessing cached stack memory.

  26. Re:Ho Hum by Short+Circuit · · Score: 1

    Since Microsoft has been the dominant OS vendor for a long time, people have written their software for Microsoft products. And since the cheapest Microsoft solution was on PCs, people wrote their code for Wintel systems.

    Since the software was compiled to architecture-native code, it wasn't easily runnable on other architectures. As a result, even as Microsoft products were available for Alpha, third-party applications weren't. So people stuck with x86, 'cause that's where the software was.

    And the reason the PC architecture was cheap? IBM built it that way. They liked the 8086, but it was too expensive to implement (16 bit data bus) ... So Intel provided them with the 8088, which was cheaper (16 bits internal data, 8 bit external data bus. 20 bit adress bus.).

    I don't know if the PowerPC architecture was available at the time.

  27. Re:Think it's time to seriously consider AMD by wolf31o2 · · Score: 2, Informative

    It will be interesting to see how Intel responds to these challenges - c't speculates that the future Pentiums will use the architecture they have in the Pentium M line (developed in Israel). If they're smart they'll introduce a dual core CPU based on the Pentium M architecture, if AMD is smart they'll modify their existing designs and beat intel to the punch again.

    Funny enough, that is exactly what Intel has planned. They will also be shooting for dual-core, and then quad-core CPUs in the next 2-3 years. On the flip side, AMD has announced that they are already capable of producing dual-core Opterons, and are simply waiting for the market demand to meet their capabilities. After all, it doesn't make much sense to introduce something now that can wait until later. It extends the life of the current line and increases the return on R&D.

  28. Re:Ho Hum by Anonymous Coward · · Score: 1, Interesting

    > x86-64 only doubles the number of registers.

    Yes, and modern compilers make quite good use of what they have. Why build/buy more when the returns diminish so rapidly?

    > Something tells me if the billions of dollars per year in R&D were spent on a fully-RISC system, externally and internally, it would be much faster, saving a stage or two of decoding and other internal mangagement, saving a lot of design and testing hassles.

    RISC requires higher memory bandwidth, and more memory, to hold the small function instructions. Modern "CISC" cpus are RISC inside - they load a small "powerful" instruction and execute multiple RISC instructions within themselves - without loading all those instruction words over the memory bus.

    > For over half a decade, DEC held its own against Intel with $70M / year CPU development budget, when Intel was spending $2B. They only got tripped up with poor marketing and problems and delays in fabbing the EV6 and EV7.

    DEC had a radically different cultural ethic than Intel. While modern American Corporate Management would like to delude themselves otherwise, poor corporate ethics drive costs higher in many ways.

    > For one, being fully RISC made it far easier to validate the chip design because it didn't involve lots of work disassembling instructions and keeping track of the results, predicting properly and so on.

    These days CPUs are basic software design. Build and test a risk core. Then, build and test an interpreter. Layers my boy, layers. A technique as old, and testable, as the hills.

  29. Maple/Mathematica benchies while at it... by Anonymous Coward · · Score: 2, Informative

    http://smc.vnet.net/timings50.html is a start.

    Sad times when a

    Dell Precision 650, 4X3.06GHz Xeon, 512KB L2, 4GB, Win XP Pro V5.1 [35]:

    is slower than a

    Athlon 2800+, 512 KB cache, 333 MHz FSB, Win XP Pro

    1. Re:Maple/Mathematica benchies while at it... by Quikah · · Score: 1

      dell precision only supports 2 processors.

      --
      Q.
    2. Re:Maple/Mathematica benchies while at it... by sirsnork · · Score: 1

      Not to mention XeonMP processors aren't up to 3.06GHz, and thats what you need for 4 way

      --

      Normal people worry me!
  30. Question about itanium2 - Opteron by cazzazullu · · Score: 2, Interesting
    Here at the physics lab we are doing research about neural networks. This involves simulations that require a lot of memory and cpu-cycles. A problem we have encountered numerous times when building phase-diagrams is that the mathematical routines chrash when we reach critical parametervalues. This is caused by the fact that certain matrices become 'singular'. This problem does not arise however when we use 'double long' formats, or 64-bit floats, because these are way more precise and still can go a long way when 32-bit doubles already jump to zero, thus causing the problems.

    We have decided to buy/construct a fast 64-bit workstation where we can run our simulations without chrashes. Now my question to you fellow slashdotters is:

    The budget is a few thousand euros, not over 10 000 (this is comparable in dollars). What would the best bang-for-the-euro be? Single-Dual? Xeon-Opteron-Itanium2? It must at least contain 4 gig of RAM.

    Thanks for your suggestions, looking at several "comparison-websites" has only made us more confused.

    --
    int main(void) {while(1) fork(); return 0;}
    1. Re:Question about itanium2 - Opteron by painehope · · Score: 1

      Best bang for (euro)buck? For 10K you can probably build a 4 way Opteron system w/ 8 GB of RAM, especially if you are getting any kind of educational discount. I would definitely suggest Opteron, as from experience w/ vendors, they seem to be priced about the same as Xeons ( which are the current cheapest ), and do 64 bit and are also a bit faster for some apps. I would probably look at a 2-way Opteron system w/ 8GB of main memory, and then after that begin looking at either a 2nd workstation/server, or 4-way, if you have any money left.

      --
      PC moderators can suck my White pierced, tattooed dick. If you think pride == hate, s/dick/Aryan meat mallet/g.
    2. Re:Question about itanium2 - Opteron by flex941 · · Score: 1

      I think with that kind of money you should buy Dual-Opteron machine (with 244 or 246 cpu) and let it be built by some local not-so-expensive company (or even yourself). If corporate support is really not so important, then don't buy from IBM/HP/you-name-it (you will end up paying up to two times more than neccessary).

      Itanium2 is out of your league and there is no point spending money on half-baked (and cloned from AMD) 64bit Xeon. Opteron's hypertransport and on-die memory controller is what makes real difference to intel's old architecture.

    3. Re:Question about itanium2 - Opteron by mjuarez · · Score: 4, Informative

      The budget is a few thousand euros, not over 10 000 (this is comparable in dollars). What would the best bang-for-the-euro be? Single-Dual? Xeon-Opteron-Itanium2? It must at least contain 4 gig of RAM.

      Itanium servers are out of your league. A decent 1.5Ghz Itanium chip with 3Mb of on-die cache will set you back around $3,000. Not including memory, hard disks, etc. Just for ONE chip.

      Xeon are way cheaper, but in most cases are more expensive than Opterons, do not scale very well when used in 2-way or higher configurations, and can only use 4Gb in flat mode. To access above 4Gb, you need to use PAE, which greatly hampers the performance (PAE is akin to the "high-memory" window trick they used back in the DOS days).

      Opterons, on the other hand, are usually cheaper than Xeons, much cheaper than Itanium, almost always have better performance that Xeons, scale much better (in fact, a 2-way server performs better than a 1-way times 2!) and are only beat by Itanium in floating point performance, and then only barely.

      There's another thing. Opterons are going to become dual-core in less than 2 years, with the same pinout as today. That means that if you have a lowly 2-way server that you're thinking about dumping, you can buy new dual-core Opterons and instantly get a 4-way out of your old 2-way server. Also, Opterons can access linearly up to 1Tb of physical RAM (that's 1,024 Gb), and up to 256Tb of virtual memory. And, finally, it's the only 64-bit processor you can get today that works with all your 32-bit x86 software. Finally, Opterons consume less energy than equivalent Xeons or Itaniums, and this becomes very important when thinking about A/C, UPS, standby power generators, etc.

      I'd recommend you go with Opteron. Check out some well known tier-2 vendors such as Angstrom, Appro or Verari. They all make excellent quality Opteron servers and workstations. If you want brand names (and are willing to pay for it), check out Sun, Hewlett Packard or IBM for 2-way servers, or HP for a 4-way. IBM even has a dual Opteron workstation, if that's what you want.

      Good Luck,

      Marcos

    4. Re:Question about itanium2 - Opteron by joib · · Score: 4, Informative


      This problem does not arise however when we use 'double long' formats, or 64-bit floats, because these are way more precise and still can go a long way when 32-bit doubles already jump to zero, thus causing the problems.


      On the x86 architecture, "long double" is 80-bit, and not 64-bit, which is plain "double". "float" is 32-bit.

      However, note that the x86 does all floating point operations with 80-bit precision. So you don't get any performance advantage from using only single precision variables (other than lower memory bandwidth usage). Thus, a good rule of thumb is to always use double (long double might be better but isn't portable, and SSE doesn't support it if you want to use that). Single precision is mainly useful when you want to store large amounts of data (remember to cast the part of the data you're working on to double before calculating).

      As others have pointed out, currently the Opteron is quite unbeatable in price/performance. 10000 EUR should certainly get you a 2 cpu system. Probably not 4 cpu:s though? Given that you need lots of memory, especially avoid the Xeon (or some other 32-bit architecture). Linux can only give 3 GB to one process with it's default configuration (I guess windows is similar?). With the so-called 4g/4g patch you can allow 4 GB for each process, but the price is lower performance. With a 64-bit architecture all those problems disappear.

    5. Re:Question about itanium2 - Opteron by smurf975 · · Score: 1

      You could use a Apple dual g5 but I don't own any Apple machines so I can't tell how good they really are.

      Look at this site that another slashdotter posted several posts before you: http://smc.vnet.net/timings50.html (mathimatica benchmarks)

      --
      -- I don't buy it, I grow it.
    6. Re:Question about itanium2 - Opteron by Spoing · · Score: 4, Interesting
      1. There's another thing. Opterons are going to become dual-core in less than 2 years, with the same pinout as today. That means that if you have a lowly 2-way server that you're thinking about dumping, you can buy new dual-core Opterons and instantly get a 4-way out of your old 2-way server.

      Even if the pinouts stay the same, the system boards you can buy now might not support the processors being sold in 2 years.

      Why upgrade the CPU only in 2 years and skip the other improvements available at that time?

      I have very infrequently had a CPU upgrade that was worth it, while updating other components (disk, network, added RAM, video, ...) usually do give a reasonable boost. Most of the time the modest real performance increase from swapping in a new CPU -- one that is bound to the limits of the existing system board-bound -- isn't worth the time or money.

      The only exception I can think of is if you buy behind the bleeding edge and upgrade every 6-9 months to a processor that is substantially better (2x) but not bleading edge.

      To do this properly usually requires getting an advanced system board that can handle the higher end components and then turning around and being cheap on the CPU. While this can be a good idea, it usually isn't and the situation is very specific to the system board.

      1. IMNSHO:
      2. Always buy what you need today and do not look over 6-9 months in the future for upgrades.

      If you expect a payoff in a future upgrade, make sure that the hardware you buy now is also what you need today and do not depend on a future promise. If it works out, HOO-RA! If not, you haven't lost a thing.

      --
      A firewall can not protect you from yourself. Turn off what you do not need. Do not use the firewall to do your work.
    7. Re:Question about itanium2 - Opteron by Mikkeles · · Score: 1

      Have you thought about getting a mathematician/numerical analyst (e.g. grad student) to take a look at your system? Brute forcing singular matrices usually doesn't scale well. Being able to identify the critical points would probably give you more bang for your euro.

      --
      Great minds think alike; fools seldom differ.
    8. Re:Question about itanium2 - Opteron by lukesl · · Score: 1

      I work in a neuroscience lab, where we were running up against the wall of 32-bit address space. We ended up getting a quad opteron system with 16GB of RAM, and it's been great. AMD releases a highly-tuned math library called the "AMD Core Math Library" that has a full implementation of LAPACK, as well as a bunch of FFT routines. If you download octave (a matlab-compatible analysis package), you can recompile it and link it against the ACML, resulting in some serious speed improvements over matlab or C code linked to generic math libraries. I strongly recommend looking into this.

    9. Re:Question about itanium2 - Opteron by hoof · · Score: 1

      "However, note that the x86 does all floating point operations with 80-bit precision. So you don't get any performance advantage from using only single precision variables (other than lower memory bandwidth usage)."

      Actually, division and square root are faster in single precision mode. FDIV is 18 cycles for single precision vs. 38 for extended precision on a PIII.

    10. Re:Question about itanium2 - Opteron by cgori · · Score: 1

      If you care about floating-point performance, regardless of how much I dislike the boxes, Itanium has the best SPECfp results (by far) -- look at the SPEC website
      if you want to see what I mean.

      Opteron has the best overall integer performance at the moment, and probably the best price/performance ratio. I believe it will thrash a Xeon on integer or FP, due to the doubling of the internal register file for both. But for max-floating-point ops per second, Itanium is still king. It's true for hspice in my line of work (which mostly consists of FP matrix math), it should hold true for you too.

      I would try to get a set of eval systems from someone like HP, if I was you. They sell all 3 architectures -- tell them you will buy whichever one runs your research app the fastest and they will probably be willing to do the demo (or could get you some time ssh'd into one to at least try it out on your code).

      -c

    11. Re:Question about itanium2 - Opteron by Psychotext · · Score: 1

      Just as a quick reply to this. I just built a rather nice dual Opteron 248 workstation with 2gb ram, five 72gb 10k SATA raptors (Raid 0 / 5) and a Winnov Videum Duo capture card in it. It's used for video capture / editing / encoding / streaming.

      Total cost, 3200GBP / 4750EUR. Could I be any happier? Not a chance, this thing has been able to handle pretty much everything I've thrown at it.

      --
      People that believe in their opinions don't post AC.
  31. Re:Ho Hum by Anonymous Coward · · Score: 1, Informative

    Nice troll...

    Modern "x86" chips have a very advanced RISC-like architecture, x86 is only the instruction set - nothing more.

    It takes a little bit of extra circuitry to translate the x86 ops instead of using a brand new ISA, but it's well worth it for the backward compatibilty IMHO.

  32. 64bit AMD only benched with 32bit OS and software? by stock · · Score: 3, Insightful
    One shouldn't compare apple with pears. So a shootout between 32bit only Xeon's and 64bit AMD's (ok which do 32bit) is a weird exercise. The testers only ran a 32bit version of Windows XP. That should be obvious. Still the Opteron 150 and 250 seem to win many shooutouts.

    Robert
    For real 64bit performance visit VooDoo software tuning and download the 64bit 2004 Longsword Gamez Demo. The Download of UT2004 64-bit English Linux Demo is around 200Mb.

  33. XP-M 2500+ @ 2.4ghz? by Anonymous Coward · · Score: 1, Insightful

    Am I the only one who finds it funny/intresting that they would think to include an XP-M overclocked to 2.4ghz in this test?

    It's pretty funny that even on a benchmark like this, the mobile is known to be the best option for Athlon Barton XP's.

    Anyone looking to spend $100 on a cpu the choice is clear. Pretty solid performance in comparison too. Usually fitting right in around the P4 3.4ghz level of performance. AND ONLY $300 cheaper haha.

    1. Re:XP-M 2500+ @ 2.4ghz? by SFBwian · · Score: 1

      Yep. It's almost-certain overclocking abilities have something to do with the lower voltage requirements for mobile chips and the testing methods used to handpick them. TechReport did a mini-article on that too, I think.

      --
      I'm looking to get rich. I've got steps #2 (????) and #3 (PROFIT!) planned out, but am having trouble coming up with #1.
  34. Re:The Conclusion... Pimping by SFBwian · · Score: 1
    I gathered that they tested on Windows 32-bit because that is the OS with the most desktop workstation user base. To compare existing products to new ones on what people actually use, and more importantly, what is actually available (have you bought your copy of Windows XP 64-bit edition yet?), you benchmark what they did. Another reason to test a 64-bit-capable CPU on a non-64-bit OS is simply because they can, as it's not limited to solely 64-bit applications.

    They test on games because the benchmarks are easy for them to perform (they've been doing the same set of tests for quite a while) and because they cater mostly to a game-playing audience. AMD almost always compares favorably to Intel in game tests.

    Still, I agree that it would be nice to see benchmarks of 64-bit software running on a 64-bit operating system. Is UT2kX 64-bit ready for Linux?

    --
    I'm looking to get rich. I've got steps #2 (????) and #3 (PROFIT!) planned out, but am having trouble coming up with #1.
  35. Re:Ho Hum by haggar · · Score: 1

    I am neither a PPC or a x86 pundit (microcontrollers here), but as I understand it, the PPC has much more registers than the x86, and this, in my experience, does give a speed advantage.

    --
    Sigged!
  36. Re:Great, but how fast can they compile the kernel by Anonymous Coward · · Score: 1, Informative

    On my dual opteron 244 (2 x 1.8 GHz), compiling a 2.4.26 kernel and its modules:

    $ make bzImage && make modules

    takes 96 secs.

  37. Re:Think it's time to seriously consider AMD by haggar · · Score: 1

    Did the c't article mention that the Pentium M was developed in Israel, or did you just know otherwise? I had the impression that the fact that the Centrino was developed in Haifa, was kept relatively secret, as Israel seems to always attract so much controversy.

    --
    Sigged!
  38. Re:Ho Hum by fmorgan · · Score: 1

    This isn't a valid argument "If PowerPC (for example) is SOOOO much better, why doesn't it wipe the floor with x86? Sure, it's competetive, but it does not annihilate x86
    ". Unfortunately, the best tech solution doesn't always win.

    Otherwise a pseudo-OS from a company I won't name wouldn't have conquered monopoly status....

  39. But the price... by Dezer · · Score: 5, Informative

    Has anyone actually checked on the price? Take a gander over at http://www.amd.com/us-en/Processors/ProductInforma tion/0,,30_118_609,00.html?redir=CPT301 The new 150/250/850 models are $637/$851/$1514 comparatively. Compare that to the *48 models, which are still expensive. Does AMDs increased market share herald a a new strategy from AMD? Back "in the day" we all used to love AMDs more than Intels because of the great performance/cost ratio. I would love to have a pair of opterons, but the prices are ridiculous. I miss the old AMD...

    1. Re:But the price... by philthedrill · · Score: 1

      I would love to have a pair of opterons, but the prices are ridiculous.

      Well, there are a couple of factors to consider. First, AMD is the only company right now that offers 64-bit x86. Second, the price is going to be pretty high since these were just released. Models that are a couple of notches down from top-of-the-line are better values. Third, these are server chips, so you should probably be wondering when dual processor Athlon64's will be coming out.

    2. Re:But the price... by Illissius · · Score: 2, Informative

      The Opteron 1xx-s cost very nearly the same as an Athlon 64 of the same speed (which in turn cost the same as a Pentium 4 with MHz equal to the A64's rating), so those are fairly priced. The 2xx-s naturally cost somewhat more, while the 8xx-s cost a /lot/ more, which is reasonable seeing as only enterprise customers would think of wanting them, and iirc they're still a lot cheaper than Xeon MPs.

      --
      Work is punishment for failing to procrastinate effectively.
    3. Re:But the price... by juhaz · · Score: 1

      Well, these are not meant for you. Opteron is first and foremost a workstation/server CPU, it's obviously marketed and priced for companies.

      The regular A64 is much better bang for the buck, quite competive with P4. Not to mention the "old" Athlon XP's still here and still beats the living daylights of any other CPU on price/performance ratio.

    4. Re:But the price... by mjuarez · · Score: 1

      I would love to have a pair of opterons, but the prices are ridiculous. I miss the old AMD...

      That's like saying you'd love to have a new BMW 545i, but only want to pay $10K for it. These are one the best price/performance chips out there. Not to mention that they can run anything you're running right now (assuming you're on an x86 machine, as 99% of the population), unlike Itanium. Oh, and don't forget the NX-bit, which will block buffer-overflow viruses, when coupled with WindowsXP SP2.

      Even then, if you really want to test dual-Opteron performance, you can get a couple of 240 Opterons for under $200 each. That's very inexpensive, less than the price of a SINGLE Xeon 3.0Ghz.

      Marcos

    5. Re:But the price... by ducomputergeek · · Score: 2, Insightful
      Opterons are designed to be a server/high end work station chip. You need to comapare its price with like typed chips like the Xeon and SPARC from SUN.

      I've worked in the rendering graphics world where dropping $15k on an SGI, SUN, or ALPHA workstation in the past was nothing. The renderfarm alone was $3.2M of ALPHA servers back in the day.

      Funny thing now is it was replaced last summer with $750k worth of IBM blade servers that nearly quadrupled the number of processors in half the space.

      Anyway, AMD really isn't targeting gamers or the average users with these chips. If you want to compare price/performance, need to take the Athlon* series vs. the P4's.

      --
      "The problem with socialism is eventually you run out of other people's money" - Thatcher.
  40. Re:Ho Hum by 10Ghz · · Score: 1
    This isn't a valid argument "If PowerPC (for example) is SOOOO much better, why doesn't it wipe the floor with x86? Sure, it's competetive, but it does not annihilate x86
    ". Unfortunately, the best tech solution doesn't always win.


    Well, apparently x86 is crappy. It's "70's technology". If it's so damn crappy, why aren't any of the "better" CPU's wiping the floor with x86? I'm not talking about popularity here, I'm talking about performance. If x86 is so damn crappy, how come it performs so well with so little money when compared to some of those "better" CPU's? How come those "better" CPU's are only suitable for running Photoshop-filters [flamesuit = ON]?

    Like I said, there ARE faster CPU's out there. But they also cost alot more and they usually get their performance by having lots and lots of cache, and not by being "better CPU-architecture". How well would Opteron 150 (for example) perform if it had 8 megs of L2-cache?
    --
    Lesbian Nazi Hookers Abducted by UFOs and Forced Into Weight Loss Programs - -all next week on Town Talk.
  41. Exactly. by Cyno01 · · Score: 1

    If you need the cooling and are overclocking or whatever you should lap your heatsink to a mirror ffinish, and if possible the proceessor too. Your correct, thermal compound is only there to fill in microscopic imperfections. Check out this for an interesting comparison of thermal compund effectivness.

    --
    "Sic Semper Tyrannosaurus Rex."
  42. Lots of performance variables.. by -tji · · Score: 1

    The board they used only had memory off of one of the CPU's in the dual Opteron systems. I wonder how much this effected performance in the compute intensive benchmarks, or even memory bandwidth benchmarks.

    A benchmark with a Quad Opteron, like the Tyan Thunder K8QS would be interesting. It has memory hanging off of each of the four CPU's. (Which opens a lot of questions about how that memory is managed.. Is there some sort of memory affinity per processor? What is the performance hit if one CPU needs memory off another?)

    Also, you have the 32 bit vs 64 bit stuff. It would be interesting to compare the best 32 bit optimized app against a 64 bit optimized version. Maybe something like MPEG2 compression of an HD stream.

    1. Re:Lots of performance variables.. by joib · · Score: 1


      Which opens a lot of questions about how that memory is managed.. Is there some sort of memory affinity per processor?


      It's called a NUMA. Google for the term and ye shall find.

      The memory manager of the kernel has to take NUMA into account, or performance will suck.

    2. Re:Lots of performance variables.. by tiptone · · Score: 1

      buried all the way down on page 2:

      "The Thunder K8W showed us its eye-popping memory bandwidth potential in our recent Opteron workstation motherboard round-up, but it was only able to realize that potential with the 64-bit beta of Windows XP. Right now, there is no workstation-class version of Windows available, outside of early beta versions, that understands the non-uniform memory access (NUMA) configuration of Opteron systems, so much of the K8W's memory bandwidth goes to waste. This reality, combined with the fact that the K8W doesn't tolerate aggressive memory timings well, caused the Thunder K8W to lose out to the K8T Master2 in many of our performance tests. So I decided to go with the K8T Master2 for our purposes."

      --
      Please don't read my sig.
    3. Re:Lots of performance variables.. by -tji · · Score: 1

      But, it's not a typical NUMA architecture, is it?

      I think of NUMA more like the BBN Butterfly I worked on in college, where many CPU boards are connected by a relatively slow bus, and each CPU board has its own locally addressed memory. So, each CPU is a node, and for any decent performance it must use its own local memory.

      In the Opteron case, the memory is still relatively local, and based on the benchmarks in the article the performance hit for accessing another CPU's memory is not huge. In general, I think these systems are treated as SMP, with one large memory space. My question is how difficult is a memory affinity implementation, and is it worth the overhead/complexity?

    4. Re:Lots of performance variables.. by joib · · Score: 1


      But, it's not a typical NUMA architecture, is it?


      I'm no expert on this, but I guess you're right.


      In the Opteron case, the memory is still relatively local, and based on the benchmarks in the article the performance hit for accessing another CPU's memory is not huge.


      Remember that the Opteron has an on-chip memory manager (MMU). Consider a system with two cpus, CPU#0 and CPU#1. The architecture, AFAIK, is that each CPU has one HyperTransport link to its own memory, and another HyperTransport link to the other CPU. So, if CPU#0 finds that it needs to access memory local to CPU#1 it needs to go through

      CPU#0 -> MMU#0 -> HyperTransport -> MMU#1 -> HyperTransport -> Memory

      instead of the direct route which is

      CPU#0 -> MMU#0 -> HyperTransport -> Memory

      This will certainly increase latency as well as reducing the memory bandwidth available to CPU#1 (since CPU#0 is also using MMU#1).


      In general, I think these systems are treated as SMP, with one large memory space.


      Well, having a shared memory is the basic idea behind using a NUMA in the first place..


      My question is how difficult is a memory affinity implementation, and is it worth the overhead/complexity?


      Linux, at least, already has NUMA support, originally for the big SGI machines. So the "complexity price" has already been paid. Obviously, cycles are used executing the NUMA logic in the kernel (when NUMA support is enabled, that is), but I guess that overhead is vanishingly small compared to the overhead of accessing potentially Gigabytes of non-local memory.

    5. Re:Lots of performance variables.. by RzUpAnmsCwrds · · Score: 1

      "CPU#0 -> MMU#0 -> HyperTransport -> Memory"

      Not quite.

      MC = Memory Controller

      Think CPU0 > FSB0 > MC0 > DDR

      This is the same path as any system with a northbridge, except that the FSB runs at the same clock as the CPU, and is physically much shorter.

      Connecting to another CPU is as follows:

      MC = Memory/HyperTransport Controller
      HT = HyperTransport

      CPU0 > FSB0 > MC0 > HT > MC1 > DDR

      Not as low-latency as a typical northbridge arrangement, but not bad.

      Remember that memory bandwidth scales as you add more CPUs in an Opteron system. The HyperTransport link will eventually become a limiting factor, so it is critical to have an OS that knows the difference between "near" and "far" memory.

  43. Re:Ho Hum by Zog+The+Undeniable · · Score: 1

    So you have a Mac, I assume? Well, the x86 architecture *is* a crock, but modern chips only emulate it. They're RISC, superscalar and all kinds of other things I don't fully understand, but certainly weren't featured in the Intel 8086 you might be thinking of ;-)

    --
    When I am king, you will be first against the wall.
  44. Re:Think it's time to seriously consider AMD by toddestan · · Score: 1

    Maybe he was refering to the K5?

  45. Re:Ho Hum by jejones · · Score: 1

    Well, apparently x86 is crappy. It's "70's technology". If it's so damn crappy, why aren't any of the "better" CPU's wiping the floor with x86? I'm not talking about popularity here, I'm talking about performance.

    For some time, better CPUs did wipe the floor with the x86 family CPUs of their time, but didn't have the advantage of the Wintel monopoly. As for now... if you gave Intel's R&D budget to Motorola to spend on developing the PowerPC, I bet it would improve considerably.

  46. Power, Heat, Noise by timeOday · · Score: 2, Informative

    I wish reviewers would start including a section on how much power the systems take. I'd like to replace my home server box and would like to minimze power consumption since it runs 24/7. I'd also like to replace my 'desktop' PC and would like to minimize fans because I like to listen to music on it.

    1. Re:Power, Heat, Noise by alienw · · Score: 2, Informative

      I have a much lower-end Opteron 140 in my box, and for all I know, it runs about 50% cooler than a comparable Athlon XP. I wouldn't worry too much about cooling them, just don't go for the super-high-end stuff. If you are looking for low power consumption, get the slowest chip that's made with the smallest process size.

    2. Re:Power, Heat, Noise by Illissius · · Score: 1

      Here's the lowdown: Athlon 64s and Pentium 4 C (Northwood) processors are comparable in power consumption (or heat dissipation? one of the two, but they are rather closely related), around 80-90W iirc. The newer Pentium 4 E (Prescott) processors consume rather more, over 100W. So there's that.

      --
      Work is punishment for failing to procrastinate effectively.
    3. Re:Power, Heat, Noise by timeOday · · Score: 1
      I have found that the ThermalTake Xaser III case to be very quiet while still cooling very well.
      Holy cow those look nice... and expensive! (Couldn't find a price list, but it's pretty obvious).
  47. Re:Ho Hum by fmorgan · · Score: 1

    Well, it's very hard to create benchmarks that effectively compare architectures. And vendors (all vendors) muddle the waters.
    But actually some x86 (Xeons) are more expensive then PPCs. Some other great chip architectures like Alpha died (or dieing) because of neglect from their corporate owners;
    The x86 succeed in part because of the Wintel duopoly. There was a Windows NT port for the Alpha (and PPC, but I think this was only in beta) but it didn't went anywhere.

    As for your "photoshop" comments, probably you are thinking about Apple's benchmarks; they use photoshop because that's what a big part of their marketshare cares about.

    AS for cache, I don't know about the 8Mb for the Opteron, but the G5 has 512k of L2 and 64Kb of L1. I guess that with 8Mb it might improve a lot also.

    As for things like the most used CPUs, probably the x86 don't have that claim. Things like the Z80 probably outnumber them a lot; and they are much cheaper than x86s.

  48. Re:Ho Hum by flex941 · · Score: 2, Interesting

    Is it possible for Intel/AMD to make those chips so I can turn off the x86 emulation crap and use internal RISC directly ... so everyone could slowly migrate away from x86 and CISC?

  49. Re:Think it's time to seriously consider AMD by frozenray · · Score: 1
    Here's an online version of the c't article I mentioned (sans pictures). Quoting from it:
    "Definitiv bestätigt hat Intel jedoch nur, dass die Entwicklung des geplanten Prescott-Nachfolgers mit Codenamen Tejas sowie seines Workstation-Kollegen Jawhawk eingestellt wurde. Stattdessen soll ein echter Doppelprozessor, also ein Prozessor mit zwei physischen Kernen auf dem Chip, im nächsten Jahr die Desktops bevölkern. Allerdings wollten die Kalifornier noch nicht zugeben, was alle Internet-Spatzen von den Websites zwitschern, nämlich dass es sich hierbei nicht um einen Nachfahr der in Oregon entwickelten Pentium-4-Linie, sondern um einen Spross aus der israelischen Design-Schule in Haifa handeln soll."
    Rough translation (excuse my English):
    "Intel has only acknowledged that development of the Prescott successor Tejas and its workstation-brethren Jayhawk has been discontinued. Instead, a true dual processor - a processor with two physical cores on the chip - will populate desktop PCs next year. The californians were reluctant to admit what many are speculating about - namely, that this probably won't be a descendant of the Pentium-4 line developed in Oregon, but a child of the israelian design school in Haifa."
    Further down in the article they mention Pentium-M designer Shmuel "Mooly" Eden, who has been transferred to Oregon some time ago.
    --
    "There are already a million monkeys on a million typewriters, and Usenet is NOTHING like Shakespeare." - Blair Houghton
  50. Re:Think it's time to seriously consider AMD by daniel23 · · Score: 1

    The article mentioned it and it was no news for any regular reader of that magazine.

    There seems to be a consensus, at least among those with a technical interest that the quality of many Israelian products is not affected by controversial aspects of what religious fundamentalists, narrowminded politicians, government and military do.

    And the same is true for products of other countries that always attract so much controversy, like i.e the US of A.

    --
    605413? Yes, it's a prime.
  51. Re:Ho Hum by alienw · · Score: 1

    It doesn't matter. Every computationally intensive program makes use of SSE/MMX/3Dnow instructions, which are pretty much meant to bypass the CISC part of the processor.

  52. Re:Ho Hum by alienw · · Score: 1

    Well, it's very hard to create benchmarks that effectively compare architectures.

    Why? Take a Linux distro (Gentoo), recompile it for both architectures, run some application benchmarks that represent what you typically do with your computer (i.e. an MP3/Ogg/Divx encoder, Mozilla, OpenOffice...). The thing is, x86 usually wins those types of comparisons. I think that was the point the parent was trying to make.

  53. Re:AMD is back by ckaminski · · Score: 1

    I imagine they're not going to have much of a choice if Sun does decide to move to Opteron over the next 5-7 years. With no more Ultrasparc V coming, it's common guess/knowledge that the multicore Ultra IV's are the end of the road for Sparc.

  54. Re:Think it's time to seriously consider AMD by robnauta · · Score: 1
    Um... HUH?? The K6-2 and especially later the K6-3 ran RINGS around the same speed Pentium and P-II chips of the day?! What the heck are you talking about?!

    Well maybe the K6-2 450 did, but by then super 7 boards like the Asus P5A-B were already disappearing from stores. And it wasn't overclock friendly, a 450 would more often fail than succeed to run at 500. The K6-3 was better, but even more on the edge, and was just too expensive. Sure, the K6-2+ was a whole lot better, but it was way too late, and much too rare. It was nearly impossible to buy one of those.

  55. Re:The Conclusion... Pimping by SFBwian · · Score: 1
    Sounds to me like it's more the case that the article in question didn't pertain to your specific wants in a new processor article. It doesn't seem like the submitter hid the intents of the article:

    Even if you have no interest in AMD's latest workstation chips, the review is worth checking out to see how two dozen of the fastest workstation and PC processors stack up in rendering, scientific computing, speech recognition, and even gaming tests."

    In fact, nowhere do I see the implication that this would be a suite of tests regarding the Opteron's capability as a server processor. To quote the article itself:

    First and foremost, we tested the Opteron 150 and 250 as workstation processors, not as server chips. Perhaps next time out we'll consider doing a suite of server-oriented benchmarks, but PC-based workstations are a more common part of our coverage around here.

    Overall, I just don't think that you're the audience they intend to read the article. Still, you're not all Slashdotters, so I would assume (as the submitter probably did) that some people might benefit from it anyway. Hence, on Slashdot.

    Who knows? If I was into gaming, I'd be following the gamers sites, not reading Slashdot.

    No, you wouldn't, because gaming sites don't do comprehensive benchmarks of hardware on multiple processors and video cards. They do game reviews. If you're wondering which new processor or video card is going to give you the most bang for your buck on a game you are already planning to buy (or already have), that doesn't help much.

    And if I'm not mistaken, Slashdot has a games section. I could say the same thing about music (re: every single iTunes- and Rio-oriented article). Slashdot is not limited to just server administrators and programmers (whether or not this is a bad thing in the first place, isn't the point). So, if an article doesn't cater to some people, is it really fair to bitch and moan that it doesn't do that for niche X?

    Finally (and back to the main topic), TechReport is an enthusiast site. It doesn't handle every practice that products could be intended for, and doesn't handle all products for a given practice. Take it with a grain of salt.

    --
    I'm looking to get rich. I've got steps #2 (????) and #3 (PROFIT!) planned out, but am having trouble coming up with #1.
  56. Re:Great, but how fast can they compile the kernel by ckaminski · · Score: 1

    Sigh...

    I remember the days when I could at least drink a good cup of coffee and catch up with the local newspaper while waiting for my kernel compile to finish...

    Sigh...

  57. Re:Ho Hum by necro2607 · · Score: 1

    Actually, when that earlier parent post said "70's technology" I believe it was referring to UNIX, not the x86 architecture. Was x86 even around in the 70's?

    Also the PPC isn't just suitable for running PS filters... it was always known as the best CPU for running emulators, for reasons I'll never know (hey I'm not a programmer nor did I even care to emulate old video games), plus it's always been a pretty respected CPU. I doubt the cost of the PPC is any higher than x86, but the rest of the Apple-unique hardware is what makes the costs of Macs so damn high.

    I agree though about the cache thing. It seems like x86 CPU manafacturers just load on more cache and claim the architecture is highly improved, when there are only minor differences in actual CPU architecture.

    Oh well, it's not like it makes a difference because software just gets more bloated and inefficient as CPU speed increases. It's not like I can go to a web page any faster or load up a game or type an essay any faster than I could 4 years ago, yet I was using a 225mhz computer back then, and now I'm using a 1666mhz computer. Does that make sense do you? It doesn't to me.

  58. So does this mean by gnu-sucks · · Score: 1

    ...that it supports WiFi now?

    I'm so confused...

  59. Re:The Conclusion... Pimping by cixelsyd · · Score: 1

    You know you can compile with GCC and tell it use BOTH the 387 FPU and the SIMD units? Is that what happened? Or was it only using one set the whole time in both cases? You can use both the FPU and SIMD at the same time? Last thing I heard, any sort of SIMD instructions used the FPU registers, thus making it impossible to use both simultaneously. If I'm wrong (all I'm really familiar with is MMX, which is kind of old at this point), please correct me, because that sounds like it could definitely come in handy in some circumstances. Let me know.

    --
    Take a dollar, divide it by 100, take two and call me in the morning.
  60. AMD also climbing back regarding total Sales by pillageplunder · · Score: 1

    Article from Businessweek http://www.businessweek.com/technology/content/may 2004/tc20040519_3643_tc024.htm Points out that AMD is regaining market share. Nice to see. It helps that HP is an AMD shop. Also points out AMD 64 bit chip has the gamers all agog, and Intel so far with nothing to compare.

    --
    "Work is the curse of the drinking class" Oscar Wilde
  61. Go back... by Kjella · · Score: 2, Informative

    ...and read some of the papers on x86-64. AMD has a lot more than 16 registers *internally*. But it turned out the performance got WORSE when they were exposed to the compiler, instead of managed internally. If they can't even manage such a trivial change well, it's likely the RISC compilers would do worse than a CISC-RISC decoder stage.

    If you want to make a computer performing anything close to modern standards, you're going to have to deal with interdependency of the RISC instructions anyway (pipelining, hyperthreading, multiple cores etc.) Don't you think Intel or AMD would provide a "native" interface if the decoder stage was really holding them back?

    In short, I'm sure the engineers at AMD and Intel have picked apart x86 code and said "With perfect compilation to our internal structure, how much faster would it be?" and found that it simply isn't the way you describe it.

    Kjella

    --
    Live today, because you never know what tomorrow brings
  62. Re:Ho Hum by fmorgan · · Score: 1

    you are assuming that compilers like GCC are equaly optimized for all platforms?

  63. Re:Think it's time to seriously consider AMD by juhaz · · Score: 1

    K6-2 was pretty fast in integer math but it had a very lousy FPU.

    It was faster than P2 in some things, but due to that FPU, even Celeron badly beat it in the one area that values speeds over anything - games.

    K6-3 was fast, but they never managed to run it beyond 450MHz, and even that was too late.

    And Pentium wasn't "same day", but previous generation, so of course it was slower. Nor were there any that were same speed, the last pentium was 233MHz and K6-2 started from 300.

  64. Actually by Dimensio · · Score: 1

    My HSF (which came standard with the AMD 64 processor) actually had a layer of grease on it. Not a pad, actual thermal grease. Seemed kind of thick, too.

  65. Re:Ho Hum by Glock27 · · Score: 2, Informative
    AS for cache, I don't know about the 8Mb for the Opteron, but the G5 has 512k of L2 and 64Kb of L1. I guess that with 8Mb it might improve a lot also.

    The Opterons have 1 MB (8 Mb) L2 cache where the G5 has .5 MB (4 Mb) L2.

    At similar clockspeeds I think the performance is fairly similar, though the Opterons may do better in a dual-CPU configuration since they have on-chip memory controllers and thus more total memory bandwidth.

    I'd like to see a head-to-head shootout using top compilers (an often overlooked issue) for both.

    --
    Galileo: "The Earth revolves around the Sun!"
    Score: -1 100% Flamebait
  66. Re:The Conclusion... Pimping by WhiteDragon · · Score: 1

    I was just reading this in the gcc info today (trying to discover the absolutely optimal compiler options for my box):

    `-mfpmath=UNIT'
    Generate floating point arithmetics for selected unit UNIT. The
    choices for UNIT are:

    `387'
    Use the standard 387 floating point coprocessor present
    majority of chips and emulated otherwise. Code compiled with
    this option will run almost everywhere. The temporary
    results are computed in 80bit precision instead of precision
    specified by the type resulting in slightly different results
    compared to most of other chips. See `-ffloat-store' for more
    detailed description.

    This is the default choice for i386 compiler.

    `sse'
    Use scalar floating point instructions present in the SSE
    instruction set. This instruction set is supported by
    Pentium3 and newer chips, in the AMD line by Athlon-4,
    Athlon-xp and Athlon-mp chips. The earlier version of SSE
    instruction set supports only single precision arithmetics,
    thus the double and extended precision arithmetics is still
    done using 387. Later version, present only in Pentium4 and
    the future AMD x86-64 chips supports double precision
    arithmetics too.

    For i387 you need to use `-march=CPU-TYPE', `-msse' or
    `-msse2' switches to enable SSE extensions and make this
    option effective. For x86-64 compiler, these extensions are
    enabled by default.

    The resulting code should be considerably faster in the
    majority of cases and avoid the numerical instability
    problems of 387 code, but may break some existing code that
    expects temporaries to be 80bit.

    This is the default choice for the x86-64 compiler.

    `sse,387'
    Attempt to utilize both instruction sets at once. This
    effectively double the amount of available registers and on
    chips with separate execution units for 387 and SSE the
    execution resources too. Use this option with care, as it is
    still experimental, because the gcc register allocator does
    not model separate functional units well.

    So it looks like you can use the FPU and the SIMD registers at the same time! I would love to see if this actually makes an improvement in the execution speed.

    --
    Did you mount a military-grade, variable-focus MASER on an unlicensed artificial intelligence?
  67. Re:The Conclusion... Pimping by BiggerIsBetter · · Score: 1
    --
    Forget thrust, drag, lift and weight. Airplanes fly because of money.
  68. Quad Opterons? by ele7ven · · Score: 2, Interesting

    Seldom do you get to see the performance of quad opterons in benchmarks. With amd's hypertransport technology, the 800 series decimates even the newest 4mb L3 cache xeons. Perhaps, however, it's that reviewers realize they don't need to show the complete scaling potential of the opteron to make the point that it's a superior workstation cpu.

    1. Re:Quad Opterons? by bersl2 · · Score: 1

      Tyan's got a good-looking quad Opteron board (if only it had an AGP slot!). One of these days, I'm gonna get my hands on one of their 2P boards.

      Hopefully, this announcement will drive down the price of the 246 and 248, so that 1GB RAM per proc. will cost less than $1,000...

      Of course, I've considered waiting for the dual cored Opterons to come out in 2005. Then again, AFAIK the dual core should still look like a single core to the motherboard.

      Gee... all this talk is just making me want all of this stuff more.

  69. Marketing is wierd by James+Lewis · · Score: 1

    According to the article, and FX-51 is the same as an Opteron 148. Looking at newegg, you can see that the Opteron 148 is $404 while the FX-51 is $689. Interestingly, you can see that the FX-51 has a LOT more reviews (and I would assume more sales) so their marketing strategy seems to be working, but I sure would feel cheated to find out I had paid ~300 more than I could have for a product.

  70. Why should AMD have bothered with faster K6s? by bani · · Score: 1

    By then the Athlons were out. The profit margins on Athlons were far better, so why bother pumping money into the older designs?

  71. No. by StarKruzr · · Score: 1

    Only in integer calculations, iirc. Sorry, try again.

    --

    +++ATH0
  72. Re:The Conclusion... Pimping by SFBwian · · Score: 1
    Yes, AMD has released them for those purposes. The fact that TechReport didn't test them using database query benchmarks is their perrogative. If you didn't see anything you liked on the testing methods page, you probably shouldn't have bothered reading it any more.

    I can see from your previous posts that you're a TechReport fanboi and that's fine, but don't try and justify their consumer level report as something useful for IT professionals on Slashdot.

    Actually, I think I was trying to point out that it's useful for non- IT professionals on Slashdot, rather than people such as yourself. Still, even in my short term experience with Slashdot itself, IT professionals seem to be only one type of Slashdot reader, among many.

    I personally find TechReport useful for good first impressions of new (to me) technology, hardware, and links to the same sorts of things at other sites. Joe Blow benchmarks on typical end-user hardware are good for that. This isn't what you expected (ie, you must be a new visitor there), and I guess my suggestion to you is to either use it as it is, or just not read TechReport. You're don't seem to be particularly interested in what they have to say.

    --
    I'm looking to get rich. I've got steps #2 (????) and #3 (PROFIT!) planned out, but am having trouble coming up with #1.
  73. Intel is competing as best they can (ie., poorly) by branchingfactor · · Score: 3, Interesting

    I respectfully disagree that Intel was ever competing with itself. They've been competing with AMD in the desktop/workgroup market for a long time now, and with Sparc/MIPS/Alpha in the enterprise market as well. Intel developed the high-clock rate Pentium 4 to compete directly with AMD's Athlon, after the Athlon whooped the Pentium 3. The Intel marketing people saw how much leverage AMD got from being the first to 1GHz with their Athlon and they didn't want that to happen again. Intel was *severely* embarrassed by loosing the race to 1Ghz. The Intel marketing people incorrectly concluded that the market was buying clock rate rather than performance. So they mandated a CPU that would have the highest possible clock rate, irrespective of performance. That's the P4/Netburst. Now they are getting burned on performance because AMD has shifted the dialog from clock rate to benchmarks. Intel also saw with the success of the Pentium M that benchmarks can triumph over clock rate. So now Intel has finally realized that they misread the market and they have to change their entire product strategy.

  74. Not necessarily by neye_eve · · Score: 1

    IIRC, the FX-5x series is unlocked for the purposes of overclocking. Don't know if the Opterons are or not, but you know for a fact that overclocking is available for the FX series. So, maybe you're paying a bit more for this aspect of it?

    neye

    1. Re:Not necessarily by Wesley+Felter · · Score: 1

      After I paid $700 for an Athlon 64 FX, I won't dare to burn it up by overclocking...

  75. Re:Ho Hum by Ninja+Programmer · · Score: 1

    x86-64 only doubles the number of registers.

    Which is more than plenty for practical purposes. Having 32 registers with a 32-bit opcode size just means that 16 of them will be rarely used, and you run out of opcode space, and therefore operate in fewer modes requiring more instructions to perform the same work.

  76. Re:Ho Hum by fmorgan · · Score: 1

    That you are VERY, VERY wrong.

    Of course a $200 WalMartPC is cheaper than a dual G5, but a dual G5 is cheaper than a Dell 2xXeon (not even talking about an Itanium).

    And I heard a senior officer at Wolfram (Mathematica) saying that they couldn't really compare speeds because the Dell 2xXeon was so inferior.

    I have no idea about the new AMDs, but this was last year (July 2003), so there might be some news next month at WWDC.

  77. Re:Ho Hum by Sj0 · · Score: 1

    Don't bother to let a little thing like the facts get in your way! GO ANTI-x86!!

    (really. Ignore those extra registers on the a64. They're just for show....or something......really.)

    --
    It's been a long time.
  78. Re:Think it's time to seriously consider AMD by haggar · · Score: 1

    Thanks. I bought a copy of c't once when I visited Rheinland Pfalz, but my command of German was woefully unadequate to appreciate it. Still, I could see that it's an excellent magazine, written by people who know what the heck they're talking about.

    BTW, your English is excellent.

    --
    Sigged!
  79. Re:Ho Hum by Anonymous Coward · · Score: 1, Informative

    > No they don't. Ever compared a compiler outputted innerloop with a hand coded one?

    Yea, they do, and yea, I have. Absolute Perfection(tm) of an instruction rarely performance of a system makes. Compilers may not be perfect, but they're usually well beyond good enough. Good compilers are have been really quite good at register allocation for some quite time.

    > Which RISC needs more than that?

    Those that need to accomplish the various options offered. Ever compared assembly instructions sets by using them in actual practice? I have. In many cases you use 1 Risc or 1 Cisc. Often you need 1-5 risc to 1 cisc. Rarely, if ever, do you need 2+ cisc to perform a risc. Ever wonder why rule of thumb for sizing a RISC system generally requires twice the memory as a CISC (for similar loading)?

    Perhaps the CPU will, ultimately, be left accessing that memory? Would that imply a possible rough doubling (on average) of CPU-Memory bandwidth?

    Yea, it does.

    > I'm sure Intel knows very well that they could perform better if they broke x86 compatibility.

    Why would they know this? (Seeing how it's just not true.) There have been, and are now, lots of processors - and exactly NONE have flat out outcompeted the x86. We have fractional advantages here, or there, but NOTHING out there that HAS "broke x86 compatability" has screamingly outperformed it.

    Linux supports LOTS of CPUs. And, x86 still pretty much sits at the top of the performance heap.

  80. Re:Great, but how fast can they compile the kernel by mcpkaaos · · Score: 1

    Well there's always the Win2K source...

    --
    It goes from God, to Jerry, to me.
  81. Re:Ho Hum by 10Ghz · · Score: 1

    Well, Apple uses GCC as their compiler, they obviously believe that it's well enough optimized for G4/G5

    --
    Lesbian Nazi Hookers Abducted by UFOs and Forced Into Weight Loss Programs - -all next week on Town Talk.
  82. Re:Ho Hum by alienw · · Score: 1

    Use whatever is the standard compiler for that platform. For G5, that would be GCC. After all, how good a compiler is largely determines how fast the processor is, so if a really fast processor doesn't have a good compiler, then it's worthless for nearly all practical purposes.

  83. Re:Ho Hum by alienw · · Score: 1

    Well, for one, if nobody optimizes their applications for Altivec, then that technology is not very useful, at least for desktop users. Therefore, if your objective is to use the system for, say, running Gentoo, then it is a fair benchmark.

    Synthetic benchmarks are almost never fair, simply because every processor has its own advantages. For instance, any benchmark which heavily relies on fast system memory would run circles around the G5 when run on an Opteron, since the Opteron connects to memory directly (and uses NUMA in a dual-proc configuration). If your benchmark merely does computations, then the G5 may very well be faster. If I try it out with a real-world application, I will know for sure that it is either slower or faster at that application.

  84. It's wireless processor by MMHere · · Score: 1

    So it's a wireless processor, huh.

    They musta replaced all those tiny little interconnects on the chip surface with billions of WiFi access points, then?

  85. Re:Think it's time to seriously consider AMD by sumdumass · · Score: 1

    would a dual core proccesor count as 2 procesors like the hyperthreaded intels? if so i woulnd think they would make a quad core because of microsoft licensing. You would need a server product to run a single quad on one machine. Intel already has a server line of processors they charge way more for. maybe that were we would see it happen at.

  86. Re:Ho Hum by TheLink · · Score: 1

    5 bytes? Do a search on Google. The average x86 instruction is about 3.2 to 3.8 bytes in length (in practical usage).

    Most RISC have fixed length instructions - 4 bytes is common.

    In my experience most x86 programs have a smaller footprint on disk and more importantly in memory compared to SPARC versions of the same program.

    Probably SPARC sucks, but ok pick the RISC of your choice and get back to me with the associated binary executable sizes (e.g apache executable, gzip executable) compared to equiv x86 binaries.

    Think of x86 CISC in modern CPUs as compressed RISC instructions, decompressed on the fly at the CPU core after being squished through narrow bandwidth channels ( memory/cache ).

    Nowadays given the huge difference between CPU core speeds and memory speeds, "compression" is actually useful. A more compressed CISC could be even better (but I doubt there's a market for one).

    "I'm sure Intel knows very well that they could perform better if they broke x86 compatibility. They just cannot afford to do that."

    BTW Intel has this processor called the Itanium. And in _practice_ it's not really x86 compatible[1] (nor is it RISC). It performs quite well, FP is much better than Opteron (and other RISC processors). But actually FP is easy to improve - just add more FP units.

    So they've already tried to drop x86. Been trying that for the past 10 years.

    [1] Itanium Windows compatibility
    Itanium RHL compatibility

    --