AMD Athlon 64 Performance Preview
k-hell writes "It seems like X-bit Labs have gotten their hands on an 'engineering sample of the AMD Athlon 64 2800+ processor'. Damage at Tech Report is writing that 'This is really fun, but I am a little concerned about their memory latency numbers.'"
I don't think that word means what they think it does.
One line blog. I hear that they're called Twitters now.
temporary mirror here
You should include the full quote of Damage, because just quoting out of context can be misleading. Here's the full paragraph (emphasis is mine):
This is really fun, but I am a little concerned about their memory latency numbers. They haven't specified what units those numbers are in, but latency numbers come out of programs like cachemem in CPU cycles. Obviously, processors with higher clock speeds will see more clock cycles pass per second than processors with lower clock speeds. One must convert those numbers into comparable units, such as nanoseconds, in order to compare CPUs at different clock speeds. I do expect the Athlon 64 to have low memory access latencies because of its integrated memory controller, but I don't think the gap will be so great as the X-bit numbers would seem to indicate.
So, the worry is about the units the latency numbers are expressed in. And when you'd see the numbers below, you get an idea why it is so:
Athlon 64 2800+
Athlon XP 1.6GHz
Pentium 4 2.8C
See it for yourself.
--
Error 500: Internal sig error
Did they burn their fingers?
On the picture of the chip, it's stamped "Copyright 2001".
One line blog. I hear that they're called Twitters now.
So, what could be the problem? The problem is that AMD is trying to support too many instruction sets. AMD should have axed 3DNow! and swallowed its pride. Supporting MMX, SSE, and SSE2 is sufficient.
When you try to put everything and the chicken sink into a chip, you inevitably pay for it with a slower clock speed.
The processor doesn't take to many benchmarks, but you can't fault it too much. It's nice to see some numbers are the CPU in 32 bit mode, but let's not forget that EVERYTHING here is 32 bit (OS, programs, etc). I'd LOVE to see a comparison between 32 bit programs running under and identicle OS versions that do and don't support 64 bits (Win XP vs Win XP for x86-64 for example). I'd suspect that the performance would go up with a 64 bit OS (especially on the games, where drivers and such play such a big part). Considering it's clockspeed, it holds up very well. The fact that it's almost never far behind a current athlon with an identicle performance rating (which is actually like 400 mhz faster) shows that it can definatly run things well. This isn't the horridly crippled performance that we've all heard about with the Itanic.
So what's my take on all of this? I think that this shows that the x86-64 can really become a success. I know some of you out there are thinking "Why would I buy one? I've got a 2.4 ghz Octium 7 and my PC is faster than that thing." That may be true, but many people aren't like you. My fastest computer is a PIII 933, so even at 1.6 ghz that Athlon64 can run circles around my best PC. If you are using a PC that's even a year or so old, you can probably benefit alot if you were to move up to an Athlon64 when it comes out.
My notes on some specific benchmarks:
Now my objections to the benchmarking
My final thoughts are this: it looks quite promising, and I can't wait to see more. More and more people with comeout with benchmarks as time goes on, and with the Opteron released now, we'll soon see benchmarks of it in SMP mode against other chips in both 32 and 64 bit OSes with 32 and 64 bit code. Either way, it looks like it's more successful than the Itanic.
Comment forecast: Bits of genius surrounded by a sea of mediocrity.
'This is really fun, but I am a little concerned about their memory latency numbers.'
Note, this is not because they're bad numbers, but rather because the units aren't specified, and may be clock cycles, which wouldn't be a fair comparison to the other processors.
couldnt they have used any one of the 64bit linuxes? this sounds like a bs review to me;
I want 2D games back.
At $500, you have just limited yourself to a mid-ranged Athlon or P4.
More like very high range. A quick glance at Pricewatch will show you that currently, the fastest Intel, 3.06 ghz, costs $388. The fasted Athlon, the 3000+, costs $320. Even the 2.8 Xeon is $425.
If you don't understand any of my sayings, come to me in private and I shall take you in my German mouth.
Printable version here.
Escher was the first MC and Giger invented the HR department.
Really? I could have sworn...
Sig.i>
Note this is for Motherboard/Processor/RAM, so, if you get a $425 processor, you won't be finding a board and good ram for it for $75
lemme sum up the article:
- WAHOO! CHECK dis shit OUT! we got an athlon 64 chipz0rz!
- it's beefcake, dood. memory controller insIIIIDE!
- we're just gonna test it with 32 bit shizzle.
- it's like, good at some things, not so good at others.
anyway, here's something to consider: the sample they tested is 2800+ per AMD's performance rating spec, and it runs at 1.6gHz. yeah. most of the tests and graphs n stuff show it running around the level of a P4@2.53gHz. alright, so it doesn't exactly match the P4@2.8gHz. but think about this:
it's running at 1.6gHz!
nevermind the fact that it doesn't squash the fastest P4 they tested it against into the ground. it's just amusing to see how good the architecture is of the A64. i dunno. i think it's pretty cool, anyway.
anyway, seriously speaking: what use is testing a processor touted as being a 32-bit compatible 64-bit chip, when _NO_ 64 bit apps were used in testing?!
"uh. well. it ran the 32 bit stuff fine. and uh. it didn't fry."
Remember that this "preview" probably violates one or more NDA's, and it is of a desktop x86-64 chip that is scheduled for September release. In the meantime, it's bigger brother, the Opteron, who has more memory bandwidth, (usually) more cache, and multiple processor support will be released in less than a week (Tuesday to be exact).
Now the reviews that out in 4 days time should be much more interesting reads. I expect to see someone do a solid x86-32 vs. x86-64 comparison using Linux, maybe other OS's too. And yes, probably even Quake frame rate results. =)
or, for the more intuitive of you: simply click the "11" at the top. it takes you to the 11th page without you having to click through all the pages.
here's a graphic:
Pages:1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11
click here^^
AMAZING! MODERATORS: MOD ME UP AS +10000, INFORMATIVE! 8P
To add more fuel to the "its only an engineering sample", check out the date on the
processor itself.
Imagine, with nearly two years of time to improve on this piece of silicon just what is in store for the Clawhammer. Personally, i'm waiting for it so I can finally upgrade my Athlon 600.
I always get the shakes before a drop.
Being that I am an AMD employee, I know for a fact that B0 (the version of clawhammer that they are benchmarking) is a very early rev, and should not even be considered when thinking about final benchmarks. Geez.
Well, one thing is for sure, xbit labs just blew their chances of ever getting their hands on another upcoming bleeding edge technology again.
Where is the 1.6 ghz P4 in this?
That's irrelevant. The proper way to square off chips is based on money. In other words a $200 dollar chip should go head to head with another $200 dollar chip, and an $800 chip goes against another $800 chip.
That's the only way to get fair results that are independant of implementation details. Clock rate the chip runs at is an implementation detail. It's not important. What's important is WORK per DOLLAR. That's the only thing that matters. Period.
But here's another way to look at it - Itanium also has an x86 layer, but because it's really just an emulation, its performance sucks.
So I view this as a huge success. Why ? Because an Athlon-64 will be able to run "legacy" 32 apps at the same speed, while 64 apps will run faster.
You'd probably wonder why this is the case. Well, IMNSHO it's not because of the wider registers/ALUs, etc, but because of other improvements to the Instruction Set Architecture, like the 8 extra registers (16 total). Because you only have 8 registers on a regular x86, compilers can register-allocate very little. Adding 8 more registers means that you can keep more stuff in the register file, and you don't have to go to the stack (data cache) every single time.
The Raven
Umm...please tell me I wasn't the only person who tried to click on that. :-o
And the masses cried out, "09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0!"
Wasn't that designed with 64-bit processors in mind?
Now this doesn't make much sense, because how can you run that in 64-bit mode even though you have a 64-bit processor, when the OS is running in 32-bit mode?
Or am I dreaming?
3DNow! is essentially a subset of SSE. Removing it would not save a signifigant number of transistors.
You mean this one coming out next week?
The other enhancements to the chip more than make up for it. In fact, it should typically perform anywhere from 5-25% faster than an equally clocked AthlonXP.
The big downside to a longer pipeline is that it increase the performance penalty of flushing the pipeline after a mispredicted branch. However, AMD did two things here. First the Athlon64 has a better branch predictor than the AthlonXP, which should reduce the number of mispredicted branches. The second thing that AMD did was to change the pipeline slightly so that the extra pipeline stages don't need to be flushed on a missed branch.
FWIW AMD is kind of following Intel's lead in this regard. While the P4 has a VERY long pipeline (twice the length of the Athlons), it doesn't take all that much of a performance hit for a missed branch because they don't need to flush all of their pipeline. The P4 also has excellent branch predictors (second only to the old AMD K6 in the x86 world from what I've seen). That being said, the P4 still has a very long pipeline and very small L1 caches, as well as a number of odd quirks as to what instructions can be where in the pipeline, so the end result tends to be a reduction in the average IPC of the chip vs. the AthlonXP/64.
As for clock speeds, my understanding is that AMD is aiming for the 2.0GHz range when they bring the chip to market.