AMD Takes Opteron To 2.4GHz
EconolineCrush writes "AMD has added a series of Opteron x50 processors to its workstation and server line that push the K8 core up to 2.4GHz. The Tech Report has tested the latest single and dual-processor Opterons against more than 20 other processors, including exotic Pentim 4 Extreme Edition chips, affordable Athlon 64s, and everything in between. Even if you have no interest in AMD's latest workstation chips, the review is worth checking out to see how two dozen of the fastest workstation and PC processors stack up in rendering, scientific computing, speech recognition, and even gaming tests."
So what is that, 4000+?
From the article to save everyone the 16 pages of boring charts and graphs.. Conclusions "If I were building (or, implausibly perhaps, buying) my ultimate workstation right now, I'd want a pair of Opteron 250s beating at the heart of it. The benchmarks speak volumes. For single-processor systems, the Opteron 150 looks like the fastest x86 CPU on the planet. In a multiprocessor configuration, the Opteron 250 scales up very well, even without the benefit of an optimal memory configuration, a NUMA-aware OS, or 64-bit extensions. By contrast, Intel's dual Xeons are a little bit disappointing. They perform relatively well in CPU-bound apps like 3D rendering programs, which are also largely well optimized for SSE2. But in memory-bound applications where dual Xeons ought to do well, like video encoding, the Xeons' slow bus and RAM hold them back. One has to wonder what Intel is hoping to accomplish by saddling its workstation-class processors with older, slower technology. Even a single Pentium 4 benefits greatly from additional bus and memory bandwidth. Surely a pair of Xeons on shared bus ought to have this same advantage. Intel's apparent willingness to forego such enhancements in favor of adding ever-larger on-chip caches to the Xeon is puzzling"
Hmmm.
They were lagging there for a while but the benchmarks depict a good story. Looks like the opteron is going to be yet another AMD chip that is great for gaming (and most other things). Hopefully a cheaper price than the p4's will really contribute to yet another dominating year for AMD.
I have been running my Opteron 248 at 2400Mhz. Sisoft seems to equate this to a PR rating of 3900+. I have no idea how it calculates this so please take that with a measure of salt.
AEnertia
Witty, tag line goes here
Intel's apparent willingness to forego such enhancements in favor of adding ever-larger on-chip caches to the Xeon is puzzling"
Why is it puzzling? In their historic "Intel Inside" world, they were basically competing against themselves. Adding a bigger cache is not only easy, but a cheap way to rake in more cash without doing much R&D work.
It's not until recently that AMD has starting "schooling them" on what improvement really means. Just look at how Intel is going to use the AMD x86-64 method in the upcoming Intel 64bit platform. And now "If I were building (or, implausibly perhaps, buying) my ultimate workstation right now, I'd want a pair of Opteron 250s beating at the heart of it. The benchmarks speak volumes. For single-processor systems, the Opteron 150 looks like the fastest x86 CPU on the planet..." And this is at much lower mhz!
I believe Intel had thought they had reached monopoly status, which really they had, and the culture had become complacent. This did not happen at the underdog AMD, who has recently been able to quickly leapfrog Intel's offerings.
-Pete
Soccer Goal Plans
I work for a medium sized school divison and this year happens to be the year when my school will get new equipment. When the meetings about what to buy have occured every single time someone has mentioned getting AMD chips instead of Intel those in managerial positions have been quick to say, " No, AMD chips are slow and run very hot. They wouldn't be a good choice for what we are looking for." Now this insight is coming from people who..
A. Are mainly concerned about the bottom line as far as price goes.
Which makes zero sense being AMD chips are more then competitivly priced compared to Intel.
B. Are supposedly in the know about technology.
Which is obviously not the case as many of them still think AMDs have the same cooling problems they did 5 years ago.
These chips are cheaper now then their Intel conterparts and from my experience run at the same speed if not faster. AMD is finally getting on the ball as far as putting the clock speed measured in Ghz to provide direct comparission which really needed to be done in order to compete. Combined with their dedication to inovation, i.e. the 64 bit processor that Intel has still yet to bring to market make me really suppport what the company is trying to do. I really hope to see more reviews like this that I can pass on to those in charge in hopes of getting away from an Intel only environment.
Just to clarify I do not hate Intel I just think that between the two there Intel does not always win outright and AMD should be considered before any purchases are ever made.
Please do not let scientific accuracy interfere with the intended humourous/interesting/insightful value of this comment
but where are the 940 pins? I count only 939.
Do not try to read the dupe, thats impossible. Instead, only try to realize the truth
What truth?
There is no dupe
Has anyone done any compiling tests? If so PLEASE share! -Benjamin Meyer
Do you changes clothes while making the "chee-chee-cha-cha-choh" transformation sound?
Can somebody please benchmark a dual AMD opteron against a dual PPC 970 (MAC G5), using Linux in 64bit mode. What is with all these kids benchmarking opterons in 32bit mode?
I'll refrain from mentioning how you are horribly off-topic but at any rate, give this a try for answers somtime. www.google.com In short the Extreme editions are a Northwood core with 2MB of L3 cache.
I had an AMD64 chip with the heat spreader.
I went to take the heat sink off the other day, and the vacuum that formed between the heat spreader and heat sink caused the chip to get yanked right out of the closed ZIF socket when I tried to get the heat sink off.
Then, after reinstalling the chip, apparently the heat spreader has become disconnected from the core internally, because the CMOS reports rising temperature up to 120C, but even the heat spreader isn't warm if I turn the system off and get the heat sink off again.
So be very careful. It takes about 10 minutes to take the heat sink off the heat spreader if you used a coating of grease that covers the whole top of the chip, even if you used a thin coat. You have to wiggle the heat sink and gently pull up for quite a while before that vacuum is broken. It doesn't help that the heat sink design makes it impossible to see the chip or slide the heat sink to the side.
And be aware that it doesn't take a whole lot of force to yank the chip right from the ZIF, possibly damaging things in the process.
I've had enough abrasive sigs. Kittens are cute and fuzzy.
I know I might be nitpicking here, but I really wish the Opteron series of chips incorporated AMD's Cool 'n Quite technology.
From what I read on their website, with a supporting motherboard and driver (2.6.5 has a native driver) the Athlon 64 can scale down to 800Mhz, cool enough for the system to shut the HSF and case fans completely offoff.
One demo I saw online had a Athlon 64 SFF computer playing a DVD while the AMD cool 'n quite app was shoing the the CPU at 80hz and the system was totally silent.
Coudn't server rooms benfit from the reduced electricuty bill also?
So you've been living under a rock for the last four years?
Stick Men
Nice article, but we need more Linux-centric bench and test sites.
.. means cheaper CPUs at the bottom end.
My needs are simple, Most of my systems would do just fine with a Duron 800MHz or even slower CPUs. With the advent of new high end chips heralds lower prices at the low end.
It's gotten to the point where only a few popular niches need to even bother with anything but the absolute bottom end chips. I.e. Gaming, video encoding and servers (Faster chips mean more users on a server).
Scientific Computing clusters, Compiling lots of code everyday etc.. are other niches worth noteing. For Web browsing, Office productivity, educational apps and old games I advise you to buy the chip so far behind the curve it won't be available in a few weeks.
--= Isn't it surprising how badly I spell ?
It will be interesting to see how Intel responds to these challenges - c't speculates that the future Pentiums will use the architecture they have in the Pentium M line (developed in Israel). If they're smart they'll introduce a dual core CPU based on the Pentium M architecture, if AMD is smart they'll modify their existing designs and beat intel to the punch again.
Speaking as a business user, I'd welcome an emphasis on ergonomics and environmental concerns over raw speed. I'd rather have silent systems that do not overload the air conditioning with enormous amounts of heat than screamers which spend 99.9 % of their time waiting for the user to press a key anyway.
"There are already a million monkeys on a million typewriters, and Usenet is NOTHING like Shakespeare." - Blair Houghton
I'm sorry, but you do not truly understand how modern x86 chips work. You don't like them because they are 'CISC-mired'? The funny thing is, underneath they really aren't CISC. They are RISC to the bone. Each and every x86 instruction you feed a modern processor is deconstructed into many smaller RISC-like ops and they are processed independantly. Small register set you say? You don't get to address them directly, but both the Athlon and the P4 have had many more registers than the x86 ISA would lead one to believe for a really long time. The x86-64 is nice because now you get many more registers of larger size, directly addressable.
In a purely dollar/performance comparison, nothing beats x86.
You can't legislate goodness. Let each to his own destiny, by will of his freely made choices.
And still, that "technology of the 70's" is the fastest thing there is. Sure, you might have some CPU's that are even faster, but they are also alot more expensive. Those CPU's usually get better performance by adding lots and lots of cache to the CPU.
If PowerPC (for example) is SOOOO much better, why doesn't it wipe the floor with x86? Sure, it's competetive, but it does not annihilate x86
As to being register-starved... Again, that doesn't seem to hurt the performance of these chips that much. And if you use Opteron/Athlon64 with an 64bit OS, you get double the number of GP/SSE-registers (instead of 8, you get 16).
As to CISC... Modern x86-CPU's are very much RISCue in the inside. And being RISC does not automatically mean that it's somehow better. You can have kick-ass CISC-CPU's, and you can have crappy RISC-CPU's.
Lesbian Nazi Hookers Abducted by UFOs and Forced Into Weight Loss Programs - -all next week on Town Talk.
Seeing as how Linux runs on just about every popular chip out there, your comment has no basis behind it. As a matter of fact, we can thank Windows for our dependancy on the x86 architecture, as it is the only platform it CURRENTLY runs on, until they finally release the 64bit version. You are an idiot who has only been first once in your life, in the shit for brains line. I do feel sorry for you however, it must be difficult living life with an IQ of 12 and a hockey helmet on head.
College-Pages.com - Online Colleges and Degrees
You admitted it yourself - it's AMD/Intels fault. They make the chips. Blame them. Microsoft are as much to blame as Linus is in this one. No amount of fanboy paranoid fascism is gonna change that :-P
That's a lot of animosity for an anonymous coward. Someone needs a good solid bitch slap
This aging architecture has maintained an incredible price/performance ratio. At this price level, the only thing that compares is the G5. A comparable UltraSPARC, Itanium, POWER, or PA-RISC system will cost much more.
As for registers, AMD64 doubled the number of general-purpose registers, which are already subject to register renaming.
It's my hope that AMD and Intel will be eternal rivals, and continue advancing and coming up with better products.
x86-64 only doubles the number of registers.
Something tells me if the billions of dollars per year in R&D were spent on a fully-RISC system, externally and internally, it would be much faster, saving a stage or two of decoding and other internal mangagement, saving a lot of design and testing hassles.
For over half a decade, DEC held its own against Intel with $70M / year CPU development budget, when Intel was spending $2B. They only got tripped up with poor marketing and problems and delays in fabbing the EV6 and EV7.
For one, being fully RISC made it far easier to validate the chip design because it didn't involve lots of work disassembling instructions and keeping track of the results, predicting properly and so on.
As to being register-starved... Again, that doesn't seem to hurt the performance of these chips that much.
I am pretty sure that in any modern x86 design the actual operation of the processor's registers is virtualized and accessing them is no faster or slower than accessing cached stack memory.
Since Microsoft has been the dominant OS vendor for a long time, people have written their software for Microsoft products. And since the cheapest Microsoft solution was on PCs, people wrote their code for Wintel systems.
... So Intel provided them with the 8088, which was cheaper (16 bits internal data, 8 bit external data bus. 20 bit adress bus.).
Since the software was compiled to architecture-native code, it wasn't easily runnable on other architectures. As a result, even as Microsoft products were available for Alpha, third-party applications weren't. So people stuck with x86, 'cause that's where the software was.
And the reason the PC architecture was cheap? IBM built it that way. They liked the 8086, but it was too expensive to implement (16 bit data bus)
I don't know if the PowerPC architecture was available at the time.
tasks(723) drafts(105) languages(484) examples(29106)
Funny enough, that is exactly what Intel has planned. They will also be shooting for dual-core, and then quad-core CPUs in the next 2-3 years. On the flip side, AMD has announced that they are already capable of producing dual-core Opterons, and are simply waiting for the market demand to meet their capabilities. After all, it doesn't make much sense to introduce something now that can wait until later. It extends the life of the current line and increases the return on R&D.
> x86-64 only doubles the number of registers.
Yes, and modern compilers make quite good use of what they have. Why build/buy more when the returns diminish so rapidly?
> Something tells me if the billions of dollars per year in R&D were spent on a fully-RISC system, externally and internally, it would be much faster, saving a stage or two of decoding and other internal mangagement, saving a lot of design and testing hassles.
RISC requires higher memory bandwidth, and more memory, to hold the small function instructions. Modern "CISC" cpus are RISC inside - they load a small "powerful" instruction and execute multiple RISC instructions within themselves - without loading all those instruction words over the memory bus.
> For over half a decade, DEC held its own against Intel with $70M / year CPU development budget, when Intel was spending $2B. They only got tripped up with poor marketing and problems and delays in fabbing the EV6 and EV7.
DEC had a radically different cultural ethic than Intel. While modern American Corporate Management would like to delude themselves otherwise, poor corporate ethics drive costs higher in many ways.
> For one, being fully RISC made it far easier to validate the chip design because it didn't involve lots of work disassembling instructions and keeping track of the results, predicting properly and so on.
These days CPUs are basic software design. Build and test a risk core. Then, build and test an interpreter. Layers my boy, layers. A technique as old, and testable, as the hills.
http://smc.vnet.net/timings50.html is a start.
Sad times when a
Dell Precision 650, 4X3.06GHz Xeon, 512KB L2, 4GB, Win XP Pro V5.1 [35]:
is slower than a
Athlon 2800+, 512 KB cache, 333 MHz FSB, Win XP Pro
We have decided to buy/construct a fast 64-bit workstation where we can run our simulations without chrashes. Now my question to you fellow slashdotters is:
The budget is a few thousand euros, not over 10 000 (this is comparable in dollars). What would the best bang-for-the-euro be? Single-Dual? Xeon-Opteron-Itanium2? It must at least contain 4 gig of RAM.
Thanks for your suggestions, looking at several "comparison-websites" has only made us more confused.
int main(void) {while(1) fork(); return 0;}
Nice troll...
Modern "x86" chips have a very advanced RISC-like architecture, x86 is only the instruction set - nothing more.
It takes a little bit of extra circuitry to translate the x86 ops instead of using a brand new ISA, but it's well worth it for the backward compatibilty IMHO.
Robert
For real 64bit performance visit VooDoo software tuning and download the 64bit 2004 Longsword Gamez Demo. The Download of UT2004 64-bit English Linux Demo is around 200Mb.
Am I the only one who finds it funny/intresting that they would think to include an XP-M overclocked to 2.4ghz in this test?
It's pretty funny that even on a benchmark like this, the mobile is known to be the best option for Athlon Barton XP's.
Anyone looking to spend $100 on a cpu the choice is clear. Pretty solid performance in comparison too. Usually fitting right in around the P4 3.4ghz level of performance. AND ONLY $300 cheaper haha.
They test on games because the benchmarks are easy for them to perform (they've been doing the same set of tests for quite a while) and because they cater mostly to a game-playing audience. AMD almost always compares favorably to Intel in game tests.
Still, I agree that it would be nice to see benchmarks of 64-bit software running on a 64-bit operating system. Is UT2kX 64-bit ready for Linux?
I'm looking to get rich. I've got steps #2 (????) and #3 (PROFIT!) planned out, but am having trouble coming up with #1.
I am neither a PPC or a x86 pundit (microcontrollers here), but as I understand it, the PPC has much more registers than the x86, and this, in my experience, does give a speed advantage.
Sigged!
On my dual opteron 244 (2 x 1.8 GHz), compiling a 2.4.26 kernel and its modules:
$ make bzImage && make modules
takes 96 secs.
Did the c't article mention that the Pentium M was developed in Israel, or did you just know otherwise? I had the impression that the fact that the Centrino was developed in Haifa, was kept relatively secret, as Israel seems to always attract so much controversy.
Sigged!
This isn't a valid argument "If PowerPC (for example) is SOOOO much better, why doesn't it wipe the floor with x86? Sure, it's competetive, but it does not annihilate x86
". Unfortunately, the best tech solution doesn't always win.
Otherwise a pseudo-OS from a company I won't name wouldn't have conquered monopoly status....
Has anyone actually checked on the price? Take a gander over at http://www.amd.com/us-en/Processors/ProductInforma tion/0,,30_118_609,00.html?redir=CPT301 The new 150/250/850 models are $637/$851/$1514 comparatively. Compare that to the *48 models, which are still expensive.
Does AMDs increased market share herald a a new strategy from AMD? Back "in the day" we all used to love AMDs more than Intels because of the great performance/cost ratio.
I would love to have a pair of opterons, but the prices are ridiculous. I miss the old AMD...
Well, apparently x86 is crappy. It's "70's technology". If it's so damn crappy, why aren't any of the "better" CPU's wiping the floor with x86? I'm not talking about popularity here, I'm talking about performance. If x86 is so damn crappy, how come it performs so well with so little money when compared to some of those "better" CPU's? How come those "better" CPU's are only suitable for running Photoshop-filters [flamesuit = ON]?
Like I said, there ARE faster CPU's out there. But they also cost alot more and they usually get their performance by having lots and lots of cache, and not by being "better CPU-architecture". How well would Opteron 150 (for example) perform if it had 8 megs of L2-cache?
Lesbian Nazi Hookers Abducted by UFOs and Forced Into Weight Loss Programs - -all next week on Town Talk.
If you need the cooling and are overclocking or whatever you should lap your heatsink to a mirror ffinish, and if possible the proceessor too. Your correct, thermal compound is only there to fill in microscopic imperfections. Check out this for an interesting comparison of thermal compund effectivness.
"Sic Semper Tyrannosaurus Rex."
The board they used only had memory off of one of the CPU's in the dual Opteron systems. I wonder how much this effected performance in the compute intensive benchmarks, or even memory bandwidth benchmarks.
A benchmark with a Quad Opteron, like the Tyan Thunder K8QS would be interesting. It has memory hanging off of each of the four CPU's. (Which opens a lot of questions about how that memory is managed.. Is there some sort of memory affinity per processor? What is the performance hit if one CPU needs memory off another?)
Also, you have the 32 bit vs 64 bit stuff. It would be interesting to compare the best 32 bit optimized app against a 64 bit optimized version. Maybe something like MPEG2 compression of an HD stream.
So you have a Mac, I assume? Well, the x86 architecture *is* a crock, but modern chips only emulate it. They're RISC, superscalar and all kinds of other things I don't fully understand, but certainly weren't featured in the Intel 8086 you might be thinking of ;-)
When I am king, you will be first against the wall.
Maybe he was refering to the K5?
Well, apparently x86 is crappy. It's "70's technology". If it's so damn crappy, why aren't any of the "better" CPU's wiping the floor with x86? I'm not talking about popularity here, I'm talking about performance.
For some time, better CPUs did wipe the floor with the x86 family CPUs of their time, but didn't have the advantage of the Wintel monopoly. As for now... if you gave Intel's R&D budget to Motorola to spend on developing the PowerPC, I bet it would improve considerably.
I wish reviewers would start including a section on how much power the systems take. I'd like to replace my home server box and would like to minimze power consumption since it runs 24/7. I'd also like to replace my 'desktop' PC and would like to minimize fans because I like to listen to music on it.
Well, it's very hard to create benchmarks that effectively compare architectures. And vendors (all vendors) muddle the waters.
But actually some x86 (Xeons) are more expensive then PPCs. Some other great chip architectures like Alpha died (or dieing) because of neglect from their corporate owners;
The x86 succeed in part because of the Wintel duopoly. There was a Windows NT port for the Alpha (and PPC, but I think this was only in beta) but it didn't went anywhere.
As for your "photoshop" comments, probably you are thinking about Apple's benchmarks; they use photoshop because that's what a big part of their marketshare cares about.
AS for cache, I don't know about the 8Mb for the Opteron, but the G5 has 512k of L2 and 64Kb of L1. I guess that with 8Mb it might improve a lot also.
As for things like the most used CPUs, probably the x86 don't have that claim. Things like the Z80 probably outnumber them a lot; and they are much cheaper than x86s.
Is it possible for Intel/AMD to make those chips so I can turn off the x86 emulation crap and use internal RISC directly ... so everyone could slowly migrate away from x86 and CISC?
"There are already a million monkeys on a million typewriters, and Usenet is NOTHING like Shakespeare." - Blair Houghton
The article mentioned it and it was no news for any regular reader of that magazine.
There seems to be a consensus, at least among those with a technical interest that the quality of many Israelian products is not affected by controversial aspects of what religious fundamentalists, narrowminded politicians, government and military do.
And the same is true for products of other countries that always attract so much controversy, like i.e the US of A.
605413? Yes, it's a prime.
It doesn't matter. Every computationally intensive program makes use of SSE/MMX/3Dnow instructions, which are pretty much meant to bypass the CISC part of the processor.
Well, it's very hard to create benchmarks that effectively compare architectures.
Why? Take a Linux distro (Gentoo), recompile it for both architectures, run some application benchmarks that represent what you typically do with your computer (i.e. an MP3/Ogg/Divx encoder, Mozilla, OpenOffice...). The thing is, x86 usually wins those types of comparisons. I think that was the point the parent was trying to make.
I imagine they're not going to have much of a choice if Sun does decide to move to Opteron over the next 5-7 years. With no more Ultrasparc V coming, it's common guess/knowledge that the multicore Ultra IV's are the end of the road for Sparc.
Well maybe the K6-2 450 did, but by then super 7 boards like the Asus P5A-B were already disappearing from stores. And it wasn't overclock friendly, a 450 would more often fail than succeed to run at 500. The K6-3 was better, but even more on the edge, and was just too expensive. Sure, the K6-2+ was a whole lot better, but it was way too late, and much too rare. It was nearly impossible to buy one of those.
Even if you have no interest in AMD's latest workstation chips, the review is worth checking out to see how two dozen of the fastest workstation and PC processors stack up in rendering, scientific computing, speech recognition, and even gaming tests."
In fact, nowhere do I see the implication that this would be a suite of tests regarding the Opteron's capability as a server processor. To quote the article itself:
First and foremost, we tested the Opteron 150 and 250 as workstation processors, not as server chips. Perhaps next time out we'll consider doing a suite of server-oriented benchmarks, but PC-based workstations are a more common part of our coverage around here.
Overall, I just don't think that you're the audience they intend to read the article. Still, you're not all Slashdotters, so I would assume (as the submitter probably did) that some people might benefit from it anyway. Hence, on Slashdot.
Who knows? If I was into gaming, I'd be following the gamers sites, not reading Slashdot.
No, you wouldn't, because gaming sites don't do comprehensive benchmarks of hardware on multiple processors and video cards. They do game reviews. If you're wondering which new processor or video card is going to give you the most bang for your buck on a game you are already planning to buy (or already have), that doesn't help much.
And if I'm not mistaken, Slashdot has a games section. I could say the same thing about music (re: every single iTunes- and Rio-oriented article). Slashdot is not limited to just server administrators and programmers (whether or not this is a bad thing in the first place, isn't the point). So, if an article doesn't cater to some people, is it really fair to bitch and moan that it doesn't do that for niche X?
Finally (and back to the main topic), TechReport is an enthusiast site. It doesn't handle every practice that products could be intended for, and doesn't handle all products for a given practice. Take it with a grain of salt.
I'm looking to get rich. I've got steps #2 (????) and #3 (PROFIT!) planned out, but am having trouble coming up with #1.
Sigh...
I remember the days when I could at least drink a good cup of coffee and catch up with the local newspaper while waiting for my kernel compile to finish...
Sigh...
Actually, when that earlier parent post said "70's technology" I believe it was referring to UNIX, not the x86 architecture. Was x86 even around in the 70's?
Also the PPC isn't just suitable for running PS filters... it was always known as the best CPU for running emulators, for reasons I'll never know (hey I'm not a programmer nor did I even care to emulate old video games), plus it's always been a pretty respected CPU. I doubt the cost of the PPC is any higher than x86, but the rest of the Apple-unique hardware is what makes the costs of Macs so damn high.
I agree though about the cache thing. It seems like x86 CPU manafacturers just load on more cache and claim the architecture is highly improved, when there are only minor differences in actual CPU architecture.
Oh well, it's not like it makes a difference because software just gets more bloated and inefficient as CPU speed increases. It's not like I can go to a web page any faster or load up a game or type an essay any faster than I could 4 years ago, yet I was using a 225mhz computer back then, and now I'm using a 1666mhz computer. Does that make sense do you? It doesn't to me.
...that it supports WiFi now?
I'm so confused...
You know you can compile with GCC and tell it use BOTH the 387 FPU and the SIMD units? Is that what happened? Or was it only using one set the whole time in both cases? You can use both the FPU and SIMD at the same time? Last thing I heard, any sort of SIMD instructions used the FPU registers, thus making it impossible to use both simultaneously. If I'm wrong (all I'm really familiar with is MMX, which is kind of old at this point), please correct me, because that sounds like it could definitely come in handy in some circumstances. Let me know.
Take a dollar, divide it by 100, take two and call me in the morning.
Article from Businessweek http://www.businessweek.com/technology/content/may 2004/tc20040519_3643_tc024.htm Points out that AMD is regaining market share. Nice to see. It helps that HP is an AMD shop. Also points out AMD 64 bit chip has the gamers all agog, and Intel so far with nothing to compare.
"Work is the curse of the drinking class" Oscar Wilde
...and read some of the papers on x86-64. AMD has a lot more than 16 registers *internally*. But it turned out the performance got WORSE when they were exposed to the compiler, instead of managed internally. If they can't even manage such a trivial change well, it's likely the RISC compilers would do worse than a CISC-RISC decoder stage.
If you want to make a computer performing anything close to modern standards, you're going to have to deal with interdependency of the RISC instructions anyway (pipelining, hyperthreading, multiple cores etc.) Don't you think Intel or AMD would provide a "native" interface if the decoder stage was really holding them back?
In short, I'm sure the engineers at AMD and Intel have picked apart x86 code and said "With perfect compilation to our internal structure, how much faster would it be?" and found that it simply isn't the way you describe it.
Kjella
Live today, because you never know what tomorrow brings
you are assuming that compilers like GCC are equaly optimized for all platforms?
K6-2 was pretty fast in integer math but it had a very lousy FPU.
It was faster than P2 in some things, but due to that FPU, even Celeron badly beat it in the one area that values speeds over anything - games.
K6-3 was fast, but they never managed to run it beyond 450MHz, and even that was too late.
And Pentium wasn't "same day", but previous generation, so of course it was slower. Nor were there any that were same speed, the last pentium was 233MHz and K6-2 started from 300.
My HSF (which came standard with the AMD 64 processor) actually had a layer of grease on it. Not a pad, actual thermal grease. Seemed kind of thick, too.
STOP MISUSING APOSTROPHES, YOU MORONS!!!
The Opterons have 1 MB (8 Mb) L2 cache where the G5 has .5 MB (4 Mb) L2.
At similar clockspeeds I think the performance is fairly similar, though the Opterons may do better in a dual-CPU configuration since they have on-chip memory controllers and thus more total memory bandwidth.
I'd like to see a head-to-head shootout using top compilers (an often overlooked issue) for both.
Galileo: "The Earth revolves around the Sun!"
Score: -1 100% Flamebait
I was just reading this in the gcc info today (trying to discover the absolutely optimal compiler options for my box):
`-mfpmath=UNIT'
Generate floating point arithmetics for selected unit UNIT. The
choices for UNIT are:
`387'
Use the standard 387 floating point coprocessor present
majority of chips and emulated otherwise. Code compiled with
this option will run almost everywhere. The temporary
results are computed in 80bit precision instead of precision
specified by the type resulting in slightly different results
compared to most of other chips. See `-ffloat-store' for more
detailed description.
This is the default choice for i386 compiler.
`sse'
Use scalar floating point instructions present in the SSE
instruction set. This instruction set is supported by
Pentium3 and newer chips, in the AMD line by Athlon-4,
Athlon-xp and Athlon-mp chips. The earlier version of SSE
instruction set supports only single precision arithmetics,
thus the double and extended precision arithmetics is still
done using 387. Later version, present only in Pentium4 and
the future AMD x86-64 chips supports double precision
arithmetics too.
For i387 you need to use `-march=CPU-TYPE', `-msse' or
`-msse2' switches to enable SSE extensions and make this
option effective. For x86-64 compiler, these extensions are
enabled by default.
The resulting code should be considerably faster in the
majority of cases and avoid the numerical instability
problems of 387 code, but may break some existing code that
expects temporaries to be 80bit.
This is the default choice for the x86-64 compiler.
`sse,387'
Attempt to utilize both instruction sets at once. This
effectively double the amount of available registers and on
chips with separate execution units for 387 and SSE the
execution resources too. Use this option with care, as it is
still experimental, because the gcc register allocator does
not model separate functional units well.
So it looks like you can use the FPU and the SIMD registers at the same time! I would love to see if this actually makes an improvement in the execution speed.
Did you mount a military-grade, variable-focus MASER on an unlicensed artificial intelligence?
I was thinking of the -mfpmath=sse,387 option in this instance. http://gcc.gnu.org/onlinedocs/gcc-3.4.0/gcc/i386-a nd-x86-64-Options.html#i386%20and%20x86-64%20Optio ns
Forget thrust, drag, lift and weight. Airplanes fly because of money.
Seldom do you get to see the performance of quad opterons in benchmarks. With amd's hypertransport technology, the 800 series decimates even the newest 4mb L3 cache xeons. Perhaps, however, it's that reviewers realize they don't need to show the complete scaling potential of the opteron to make the point that it's a superior workstation cpu.
According to the article, and FX-51 is the same as an Opteron 148. Looking at newegg, you can see that the Opteron 148 is $404 while the FX-51 is $689. Interestingly, you can see that the FX-51 has a LOT more reviews (and I would assume more sales) so their marketing strategy seems to be working, but I sure would feel cheated to find out I had paid ~300 more than I could have for a product.
By then the Athlons were out. The profit margins on Athlons were far better, so why bother pumping money into the older designs?
Only in integer calculations, iirc. Sorry, try again.
+++ATH0
I can see from your previous posts that you're a TechReport fanboi and that's fine, but don't try and justify their consumer level report as something useful for IT professionals on Slashdot.
Actually, I think I was trying to point out that it's useful for non- IT professionals on Slashdot, rather than people such as yourself. Still, even in my short term experience with Slashdot itself, IT professionals seem to be only one type of Slashdot reader, among many.
I personally find TechReport useful for good first impressions of new (to me) technology, hardware, and links to the same sorts of things at other sites. Joe Blow benchmarks on typical end-user hardware are good for that. This isn't what you expected (ie, you must be a new visitor there), and I guess my suggestion to you is to either use it as it is, or just not read TechReport. You're don't seem to be particularly interested in what they have to say.
I'm looking to get rich. I've got steps #2 (????) and #3 (PROFIT!) planned out, but am having trouble coming up with #1.
I respectfully disagree that Intel was ever competing with itself. They've been competing with AMD in the desktop/workgroup market for a long time now, and with Sparc/MIPS/Alpha in the enterprise market as well. Intel developed the high-clock rate Pentium 4 to compete directly with AMD's Athlon, after the Athlon whooped the Pentium 3. The Intel marketing people saw how much leverage AMD got from being the first to 1GHz with their Athlon and they didn't want that to happen again. Intel was *severely* embarrassed by loosing the race to 1Ghz. The Intel marketing people incorrectly concluded that the market was buying clock rate rather than performance. So they mandated a CPU that would have the highest possible clock rate, irrespective of performance. That's the P4/Netburst. Now they are getting burned on performance because AMD has shifted the dialog from clock rate to benchmarks. Intel also saw with the success of the Pentium M that benchmarks can triumph over clock rate. So now Intel has finally realized that they misread the market and they have to change their entire product strategy.
IIRC, the FX-5x series is unlocked for the purposes of overclocking. Don't know if the Opterons are or not, but you know for a fact that overclocking is available for the FX series. So, maybe you're paying a bit more for this aspect of it?
neye
x86-64 only doubles the number of registers.
Which is more than plenty for practical purposes. Having 32 registers with a 32-bit opcode size just means that 16 of them will be rarely used, and you run out of opcode space, and therefore operate in fewer modes requiring more instructions to perform the same work.
That you are VERY, VERY wrong.
Of course a $200 WalMartPC is cheaper than a dual G5, but a dual G5 is cheaper than a Dell 2xXeon (not even talking about an Itanium).
And I heard a senior officer at Wolfram (Mathematica) saying that they couldn't really compare speeds because the Dell 2xXeon was so inferior.
I have no idea about the new AMDs, but this was last year (July 2003), so there might be some news next month at WWDC.
Don't bother to let a little thing like the facts get in your way! GO ANTI-x86!!
(really. Ignore those extra registers on the a64. They're just for show....or something......really.)
It's been a long time.
Thanks. I bought a copy of c't once when I visited Rheinland Pfalz, but my command of German was woefully unadequate to appreciate it. Still, I could see that it's an excellent magazine, written by people who know what the heck they're talking about.
BTW, your English is excellent.
Sigged!
> No they don't. Ever compared a compiler outputted innerloop with a hand coded one?
Yea, they do, and yea, I have. Absolute Perfection(tm) of an instruction rarely performance of a system makes. Compilers may not be perfect, but they're usually well beyond good enough. Good compilers are have been really quite good at register allocation for some quite time.
> Which RISC needs more than that?
Those that need to accomplish the various options offered. Ever compared assembly instructions sets by using them in actual practice? I have. In many cases you use 1 Risc or 1 Cisc. Often you need 1-5 risc to 1 cisc. Rarely, if ever, do you need 2+ cisc to perform a risc. Ever wonder why rule of thumb for sizing a RISC system generally requires twice the memory as a CISC (for similar loading)?
Perhaps the CPU will, ultimately, be left accessing that memory? Would that imply a possible rough doubling (on average) of CPU-Memory bandwidth?
Yea, it does.
> I'm sure Intel knows very well that they could perform better if they broke x86 compatibility.
Why would they know this? (Seeing how it's just not true.) There have been, and are now, lots of processors - and exactly NONE have flat out outcompeted the x86. We have fractional advantages here, or there, but NOTHING out there that HAS "broke x86 compatability" has screamingly outperformed it.
Linux supports LOTS of CPUs. And, x86 still pretty much sits at the top of the performance heap.
Well there's always the Win2K source...
It goes from God, to Jerry, to me.
Well, Apple uses GCC as their compiler, they obviously believe that it's well enough optimized for G4/G5
Lesbian Nazi Hookers Abducted by UFOs and Forced Into Weight Loss Programs - -all next week on Town Talk.
Use whatever is the standard compiler for that platform. For G5, that would be GCC. After all, how good a compiler is largely determines how fast the processor is, so if a really fast processor doesn't have a good compiler, then it's worthless for nearly all practical purposes.
Well, for one, if nobody optimizes their applications for Altivec, then that technology is not very useful, at least for desktop users. Therefore, if your objective is to use the system for, say, running Gentoo, then it is a fair benchmark.
Synthetic benchmarks are almost never fair, simply because every processor has its own advantages. For instance, any benchmark which heavily relies on fast system memory would run circles around the G5 when run on an Opteron, since the Opteron connects to memory directly (and uses NUMA in a dual-proc configuration). If your benchmark merely does computations, then the G5 may very well be faster. If I try it out with a real-world application, I will know for sure that it is either slower or faster at that application.
So it's a wireless processor, huh.
They musta replaced all those tiny little interconnects on the chip surface with billions of WiFi access points, then?
would a dual core proccesor count as 2 procesors like the hyperthreaded intels? if so i woulnd think they would make a quad core because of microsoft licensing. You would need a server product to run a single quad on one machine. Intel already has a server line of processors they charge way more for. maybe that were we would see it happen at.
5 bytes? Do a search on Google. The average x86 instruction is about 3.2 to 3.8 bytes in length (in practical usage).
Most RISC have fixed length instructions - 4 bytes is common.
In my experience most x86 programs have a smaller footprint on disk and more importantly in memory compared to SPARC versions of the same program.
Probably SPARC sucks, but ok pick the RISC of your choice and get back to me with the associated binary executable sizes (e.g apache executable, gzip executable) compared to equiv x86 binaries.
Think of x86 CISC in modern CPUs as compressed RISC instructions, decompressed on the fly at the CPU core after being squished through narrow bandwidth channels ( memory/cache ).
Nowadays given the huge difference between CPU core speeds and memory speeds, "compression" is actually useful. A more compressed CISC could be even better (but I doubt there's a market for one).
"I'm sure Intel knows very well that they could perform better if they broke x86 compatibility. They just cannot afford to do that."
BTW Intel has this processor called the Itanium. And in _practice_ it's not really x86 compatible[1] (nor is it RISC). It performs quite well, FP is much better than Opteron (and other RISC processors). But actually FP is easy to improve - just add more FP units.
So they've already tried to drop x86. Been trying that for the past 10 years.
[1] Itanium Windows compatibility
Itanium RHL compatibility