Linux Shootout: Opteron 150 vs. Xeon 3.6GHz Nocona

Short version: Xeon RIP. by Anonymous Coward · 2004-08-15 00:39 · Score: 5, Interesting

No message here. Oh, did you know that an Athlon64 3000+ is within 2fps of a P4 3.4 Extreme Edition in Doom 3?

Look up the prices for those two items.

Re:Short version: Xeon RIP. by hmniq · 2004-08-15 00:43 · Score: 0

There seems to be a rather heated debate on a gaming forum elsewhere regarding whether there are any video cards out there capable of handling Doom 3 in Ultra Quality mode. Any info?
Re:Short version: Xeon RIP. by Anonymous Coward · 2004-08-15 00:47 · Score: 5, Informative

Athlon64 3000+ (2GHz): $167
Pentium 4 3.4GHz Extreme Edition: $1025
Re:Short version: Xeon RIP. by PhotoBoy · 2004-08-15 01:08 · Score: 1

I've got an X800 XT (a Pro modded to XT with a BIOS flash) and I've clocked it to 540/571.

I can run in Ultra mode in 1280x1024 with 2xAA and 16xAF and it's perfectly playable. I get 58.9FPS on timedemo 1 with those settings.

I think part of the secret is my FSB is at 265 because I've overclocked my P4 to 3.74Ghz, the extra memory bandwidth prevents any of the sudden jerking Ultra mode might give on stock systems. I'd expect an Athlon 64 939 to be even better with the onboard controller.
Re:Short version: Xeon RIP. by hmniq · 2004-08-15 01:43 · Score: 0

Is your Pro-modded-to-XT a 512 or 256 MB card? It sounds like the latter from your comment about the FSB. How would one go about confirming that Doom 3 isn't compressing the textures even though one has set it to Ultra mode?
Re:Short version: Xeon RIP. by tuba_dude · 2004-08-15 02:15 · Score: 1

Assuming the console variables aren't lying to you, you can set a few options by bringing up the console CTRL-ALT-~. I don't have the actual names at the moment, but you should be able to google for them with very little trouble.

--
"The government of the United States is not, in any sense, founded on the Christian religion."
Re:Short version: Xeon RIP. by PhotoBoy · 2004-08-15 02:25 · Score: 1

It's a 256Mb card, the 512Mb cards will probably be out during the autumn "refresh" of nVidia's and ATi's ranges.

I'm not sure how one could confirm that there is no compression except by what John Carmack has said ultra mode does.

Also when you compare Ultra mode to High detail mode the image quality is definitely higher suggesting that there is no compression (or less compression at least).
Re:Short version: Xeon RIP. by kesuki · 2004-08-15 03:24 · Score: 0

The pentium 4 has pretty much been pushed as hard as it can go, adding 64-bit mode just extends the product life of the chips being sold and amd's product which was planned around 64-bit still has room to grow. Intel would not be in any kind of dire straights except for Microsoft sinking thre Itanic by saying retain x86 compatability or find your own os. Thast being said the pentium 5 is in works, and it will run between 6-10 ghz and absolutely smoke everything the opteron can do, except asm code. (because amd is designing it's chips around the needs of everyione including asm programmers, and intel is just designing faster mhz around the same basic capabilities)
yeah, intel it hurting now, but they were hurting when amd was the first to 1 ghz too,

--
https://www.gnu.org/philosophy/free-sw.html
Re:Short version: Xeon RIP. by Anonymous Coward · 2004-08-15 04:09 · Score: 0

Thast being said the pentium 5 is in works, and it will run between 6-10 ghz and absolutely smoke everything the opteron can do, except asm code.

Why don't you put down that pipe and stop talking about things you don't know anything about, m'kay?
Re:Short version: Xeon RIP. by Jeff+DeMaagd · 2004-08-15 06:51 · Score: 1

I think it should go without saying that Doom 3 has nothing to do with the typical market for Xeon.

I'd also hesitate to make a choice based on performance on a single program unless that is the only program I ever plan to use.
Re:Short version: Xeon RIP. by Ben+Hutchings · 2004-08-15 07:06 · Score: 5, Informative

Thast being said the pentium 5 is in works, and it will run between 6-10 ghz and absolutely smoke everything the opteron can do, except asm code.

The design intended to become the Pentium 5 (Tejas) was cancelled in favour of Pentium M derivatives. Intel basically had to give up on the Netburst micro-architecture and is now concentrating on increased parallelism (multiple cores) rather than extreme clock rates.
Re:Short version: Xeon RIP. by Anonymous Coward · 2004-08-15 07:22 · Score: 0

well, the opteron crushed the zeon, if YOU BOTHERED TO READ THE FUCKING ARTICLE.

across the board.

let me repeat that again.

ACROSS THE BOARD.

sheesh.

typing this from a pentium extreme, power notebook.

(powernotebooks.com)
Re:Short version: Xeon RIP. by Anonymous Coward · 2004-08-15 10:10 · Score: 0

Just a note, the person who mentioned doom3, was referring to the Pentium EE, which is meant for the market that plays doom3.
Re:Short version: Xeon RIP. by TheLink · 2004-08-15 19:23 · Score: 2, Funny

Summary: the cancelled P5 chips would have smoked everything including themselves.
--
- Too many replies beneath your current threshold

opteron by the+linux+geek · 2004-08-15 00:41 · Score: 1

i thought it was Athlon 64 now?

Re:opteron by lachlan76 · 2004-08-15 00:43 · Score: 5, Informative

Athlon 64 is the name used for the desktop line, and Opteron is the name used for the server/workstation processors.
Re:opteron by NoMercy · 2004-08-15 02:39 · Score: 2, Informative

The critical point being that Opterons unlike there Athlon 64 cousins have more hyper-transport interfaces, allowing them to be used in a multy-processor enviroment, depending on the seriese number up to 8-way systems can be built, though I think the largest Tyan's only carry 4 at present.

There's other minor diferences but *goes off dreaming about a 4-way processor in a database server*

Memory by lachlan76 · 2004-08-15 00:41 · Score: 4, Insightful

To be able to show the real potential of the Opteron, you need to have more than one processor.

This lets you take advantage of the on-die memory controller, by letting each processor do it's own memory work, rather than making the Northbrige do all the work.

If you want to use a single processor, you might as well use an FX-Whatever, since they are just an Opteron without MP capability and only one HT bus.

Re:Memory by Ianoo · 2004-08-15 00:53 · Score: 5, Informative

Provided you have a NUMA-aware operating system, that is. The OS needs to know which memory is attached to which processor, since access to memory attached to the same processor on which a thread is running will obviously be faster and lower latency than going across hypertransport to a different processor and waiting for an answer.
Re:Memory by Anonymous Coward · 2004-08-15 00:54 · Score: 1, Interesting

your argument is pretty fucking retarded. it is NOT always better to move away from a BUS architecture.

simple proof: consider an array of 486 processors connected by an ultra-fast, ultra-low-latency network.

long boring argument for idiots like you: suppose you have a single value in memory, called 'GO!!', it has the value 0 or 1. If it's ever set to 1, all the CPUs have to do some work and write an answer or something. If it's set to 0, the CPUs have to stay idle, otherwise the electricity man comes and shoots you.

On a Xeon (or any other bus) system: *THE INSTANT ANY CPU WRITES A ONE TO GO!!, ALL THE CPUS KNOW IT!! AND CAN START WORKING INSTANTLY!* No other messages need to be sent. Nothing. This is called "bus snooping" and you can only do it on, well, a bus, or other _shared_ communications medium.

On the Opteron architecture (we call this NUMA, or "point-to-point"), as soon as one CPU writes a value to the 'GO!!' area, well, that's _just the beginning_. It has to tell another CPU in the system that it just did that. etc etc.

Do you get it now?? When designing a NUMA system, you can choose _at which point_ you want to make things point-to-point, below which you want things to remain shared, so that tightly coupled (shared) data can be thrashed around efficiently, without blasting millions of messages here and there, saturating router buffers etc.

For example, the now-famous SGI Altix systems are point-to-point, just like Opterons, but only in two-CPU blocks, each of which has a shared (Itanium) bus. Even though this bus protocol explicitly supports up to 4 CPUs on the same bus. (this is unlike, say, HP's 4-CPU systems that as far as I'm aware have all 4 Itaniums on a bus, and only go point-to-point beyond that)
Re:Memory by sbennett · 2004-08-15 01:00 · Score: 1

Provided you have a NUMA-aware operating system, that is.

Like Linux 2.6, you mean?
Re:Memory by Anonymous Coward · 2004-08-15 01:01 · Score: 0

You need to have more than 2 processors and each must have it's own memory bank to show the advantage of the on die memory controller. Otherwise they are all talking through chip 1 and it is acting as a northbridge.

I get the impression that this is an Athlon-64 vs what the New P4 with 64bit extensions will be rather than a real server test.

Most the time it appears the AMD wins, which is fine by me. I still wish everybody would dump the backward compatability with the 8086 though. The CPU still has to bootstrap in 16bit real mode FFS!!!!
Re:Memory by Ianoo · 2004-08-15 01:01 · Score: 1

Like Linux 2.6, you mean?
Yes. Yes I do.
Re:Memory by Waffle+Iron · 2004-08-15 01:27 · Score: 4, Informative

THE INSTANT ANY CPU WRITES A ONE TO GO!!, ALL THE CPUS KNOW IT!! AND CAN START WORKING INSTANTLY!
How do they know? By cache coherence signals transferred between the CPUs. This isn't free and consumes bus bandwidth.
The first CPU can't "instantly" write the value either, because it must first obtain exclusive ownership of that cache line by checking with the other CPUs.
On the Opteron architecture (we call this NUMA, or "point-to-point"), as soon as one CPU writes a value to the 'GO!!' area, well, that's _just the beginning_. It has to tell another CPU in the system that it just did that. etc etc
It has to use some communication resource to update the other CPUs on the state of that cache line. Just like the bus-based situation.
Re:Memory by lachlan76 · 2004-08-15 02:10 · Score: 1

This is assuming that the CPUs are all working on the same data set.

If (theoretically) you got all of the CPUs working on an independant dataset then the CPUs would scale much better.
Re:Memory by kasperd · 2004-08-15 02:47 · Score: 2, Insightful

I still wish everybody would dump the backward compatability with the 8086 though. The CPU still has to bootstrap in 16bit real mode FFS!!!!

I agree. But at least AMD did something right when designing the AMD64 architecture. Virtual 86 mode and segmentation was eliminated from the 64 bit mode, but they still exist in 32 bit mode of course. Completely eliminating 8086 compatibility was not really so much of an option. Backward compatibility is part of the reason for the AMD64 success. But it would have been nice, if they had at least offered a way to boot the CPU in 64 bit mode. As it is now, the CPU boots in 16 bit real mode, then you switch to 32 bit protected mode, and only after that have been done you can switch the CPU into 64 bit mode.

--

Do you care about the security of your wireless mouse?
Re:Memory by Anonymous Coward · 2004-08-15 02:49 · Score: 0

IIRC, theres no real nead for NUMA software on a 2-way Opteron system -- the gains are minimal at best. I don't think any Linux distro (or Win2003) enables NUMA on small Opteron boxes.
Re:Memory by Anonymous Coward · 2004-08-15 03:07 · Score: 0

It is also assuming the CPUs don't have cache... and we all know that's terribly common in high-end server CPUs. :-)
Re:Memory by upsidedown_duck · 2004-08-15 04:04 · Score: 1, Informative

What about Solaris, IRIX, AIX, etc.?

UltraSPARCs have been running with memory local to CPUs for quite a while now, for example.

--
-- "Makes Little Debbie look like a pile of puke!" - Moe Szyslak
Re:Memory by Anonymous Coward · 2004-08-15 04:12 · Score: 0

So let me get this right. When it was "Athlon 64 vs. P4 Xeon" it was unfair because it should have been Opteron. Now that it's Opteron it's unfair because it has to be TWO opterons? Give me a break.
Re:Memory by Serveert · 2004-08-15 05:16 · Score: 1

You still must give CPU affinity explicitely but yes it's numa aware.

--
2 years and no mod points. Join reddit. Because openness is good.
Re:Memory by Anonymous Coward · 2004-08-15 05:48 · Score: 0

Indeed, Amd called it SUMO (Sufficiently Uniform Memory Organization). The NUMA factor is so low that using a NUMA unaware OS doesn't really hurt your performance that badly. IIRC benchmarks have shown about 5% difference.
Re:Memory by ottffssent · 2004-08-15 06:32 · Score: 4, Insightful

Actually, even without a NUMA-aware OS, the worst-case dual (and almost quad) memory latency is less than a Xeon's.

What really sets the Opteron apart in MP scenarios is the bandwidth. Each chip gets 6.4G/sec to memory: add more chips, get more bandwidth. The Xeon on the other hand has to share its 6.4G/sec with all the chips in the system, which severely limits its scaling. A quad Opteron has over 25G/sec of aggregate memory bandwidth, while a quad Xeon is stuck with its 6.4G shared 4 ways. That's half the bandwidth of a 400MHz P4 - no wonder the quad Xeons are often barely faster than the duals.

Add to this that cache snoops and other bus traffic all have to share the same FSB on the Xeon whereas on the Opteron local memory accesses don't touch the HT links at all. For a standard 2P system, this frees up 3.2G/sec of HT link bandwidth, and a NUMA-aware OS only increases the efficiency of the system.

Despite Intel's recent marketing push, they really don't have the best CPU, and don't have the best system either. There are still considerable advantages to choosing a Xeon system but these days they have little to do with the chip or the board and a lot to do with Intel backing. That's an advantage that will quickly evaporate as industry gets comfortable with non-Intel parts.

--
High-speed Road Trip (18.000KPH)
Re:Memory by andreyw · 2004-08-15 07:18 · Score: 2, Informative

Well, while Solaris is *technically* going to be ported to x86-64 eventually... IRIX and AIX won't for sure.
Re:Memory by EMN13 · 2004-08-15 08:41 · Score: 1

If I remember correctly, the speedup of turning on this numa-awareness on an opteron multiprocessor system is negligible even on benchmarks which should stress it. It's a neat feature but doesn't appearantly really help the opteron (partially because the HT bus is very fast and the number of CPU's involved isn't so great).

--Eamon
Re:Memory by Anonymous Coward · 2004-08-15 09:24 · Score: 0

How do they know? By cache coherence signals transferred between the CPUs This isn't free and consumes bus bandwidth.

Tell me then - how does the cost of these 'signals' scale with the number of CPUs on the bus?

It has to use some communication resource to update the other CPUs on the state of that cache line. Just like the bus-based situation.

Of course! All the CPUs we're talking about here cache memory, so of course there must be some scheme whereby they can share memory and yet use their caches appropriately. The point is that the way in which the communications is done is different.

I can't believe you've been modded 4, informative.

Look, I'll rephrase it again. Consider the following two situations.

1) There are 16 people in a room that's on fire, but the only way they can escape is if each of them enters a 10-digit password, in the correct order, into the strange lock on the door.

2) There are 16 people in a building that's on fire, but the fire's on another floor. The fire exit is unlocked - but they won't leave until someone tells them there's a fire in a building. It would be good if they got out soon.

There are two 'optimal' solutions here, and they're different. For 1), the fastest thing to do would be for the people to spontaneously arrange into tree-like communication: pairs of people share their codes, then pairs of pairs, etc, until the last guy receives two sorted sets of password that he can 'riffle' together as he enters them into the lock on the door. It would presumably NOT be fast for everyone to start yelling "I'm person number X, and my password is Y!".

Consider 2). The fastest thing to do is for someone to simply yell, as loudly as possible, "FIRE!!". It would _not_ be faster for people to gather into a tree, with each guy telling two others that there is a fire...

The simple example I'm giving is simply trying to point out that something I'm afraid you are going to try denying until you die: BROADCASTING IS FASTER ON BUSES!

Or in other words - yes, communication on a bus is not free (what a stupid concept) but no matter how much AMD stock you own, the additional penalty you pay for adding agents who LISTEN to bus broadcasts is ZERO.

Do you get it now?

Sure, many real-world computing situations don't involve anything like broadcasting (and would gain a heck of a lot more from e.g. the additional memory bandwidth likely to be present in a decent NUMA system) but some DO - if you don't believe me, try getting your logic analyzer out and watching what happens on a 4-way Xeon system bus as a JVM runs through some tightly threaded code, or tries to do a distributed garbage collect. It's quite elegant - you'll be surprised.
Re:Memory by Waffle+Iron · 2004-08-15 09:30 · Score: 1

If busses scale so well, then why isn't the Internet on a single electrical bus?
Re:Memory by Anonymous Coward · 2004-08-15 10:11 · Score: 0

Because, happily, the Internet does not involve x million people trying to share read/write access to the same piece of data.
Re:Memory by Anonymous Coward · 2004-08-15 10:45 · Score: 0

Except in /. incidents...
Re:Memory by Anonymous Coward · 2004-08-15 14:07 · Score: 1, Informative

> As it is now, the CPU boots in 16 bit real mode, then you switch to 32 bit protected mode, and only after that have been done you can switch the CPU into 64 bit mode.

This is no different from any other CPU existing in both 32-bit and 64-bit versions. PowerPC, SPARC and MIPS all boot into 32-bit physically addressed mode, then have to switch into 64-bit paged mode once the OS starts.

The only chips that don't are pure 64-bit devices like Alpha and IA-64.
Re:Memory by Anonymous Coward · 2004-08-16 10:09 · Score: 0

Err, no.

These in-architecture tests are OK, but... by Amiga+Lover · 2004-08-15 00:47 · Score: 4, Interesting

It's good to see benchmarks between processors in the same family, but is there anywhere that regularly tests CPUs across families? x86, PPC, Sparc, VIA etc. I'd like to see comparisons like that to see how various architectures strengths & weaknesses stack up

Re:These in-architecture tests are OK, but... by Anonymous Coward · 2004-08-15 00:49 · Score: 0

C'mon, admit it.

You just want to see the Opterons slaughter everything that come in its way :-)
Re:These in-architecture tests are OK, but... by Anonymous Coward · 2004-08-15 01:01 · Score: 0

It's good to see benchmarks between processors in the same family, but is there anywhere that regularly tests CPUs across families? x86, PPC, Sparc, VIA etc. I'd like to see comparisons like that to see how various architectures strengths & weaknesses stack up

That would be dull. It would be Opteron vs everyone else, and at the end the only chip left standing would be an opteron.
Re:These in-architecture tests are OK, but... by Technonotice_Dom · 2004-08-15 01:02 · Score: 1

I'm interested too.

The closest I've found is SPEC CPU2000 results (warning, large page). Loads of various systems, put against a common benchmark, CPU2000. Should be details on the website about how it tests them.

There are a few links at this link to various CPU2000 results for specific tests - integer tests, floating point tests and throughput.
Re:These in-architecture tests are OK, but... by DarkMantle · 2004-08-15 01:19 · Score: 2, Interesting

It would not be possible to get an accurate reading from this. Because the operating system would have to be compiled for each hardware platform. Also to keep it fair we would have to do the same for the benchmark software. I am aware of no OS that will go accross all platforms. BSD comes close (since OS-X is BSD based) but doesn't quite get all architectures.

--
DarkMantle I been bored, so I started a blog.
Re:These in-architecture tests are OK, but... by slayer99 · 2004-08-15 04:00 · Score: 1

I am aware of no OS that will go accross all platforms

Debian runs on at least 10 architectures. That'd do for a start, no?

--
Martin Brooks / Slayer99 #linux / UIN 2178117
Re:These in-architecture tests are OK, but... by mdamaged · 2004-08-15 06:33 · Score: 0

Debian is a distribution, it is not an OS.
Debian is for these purposes, Linux.

--
Someone asked me the difference between ignorance and apathy, I told them I don't know and I don't care.
Re:These in-architecture tests are OK, but... by gunpowder · 2004-08-15 06:59 · Score: 3, Informative

Perhaps this is what you are looking for
Re:These in-architecture tests are OK, but... by DAldredge · 2004-08-15 12:11 · Score: 1

OSX is Mach Based. The userland of OSX is FBSD based.
Re:These in-architecture tests are OK, but... by andreyw · 2004-08-15 16:58 · Score: 1

http://www.netbsd.org/ covers all relevant platforms.

Of course it runs NetBSD! ;-)
Re:These in-architecture tests are OK, but... by mgcarley · 2004-08-22 22:30 · Score: 1

Part of the problem there is just that: they are all based on (is some cases) completely seperate architecture... Hell, for most people, it's hard enough trying to distinguish the "MHz ratings" differences between Intel (P4) and AMD (Athlon) or Intel (Xeon) and AMD (Opteron)... and so on and so forth.

I suppose one way to do it is, money providing, get absolute, top of the line everything...

So, the best PPC chip, best SPARC chip, best x86 from Intel, best x86 from AMD, best x86 from transmeta if you really want, best alpha if you wanted to include it and best mips if you wanted it...

OR: another thing you could do: get the most amount of CPU you can for a given amount of money... for example, up to $500 on P4, up to $500 on Xeon or if you wanted, Itanium, up to $500 on Athlon, up to $500 on Athlon64, up to $500 on opteron... and if you can get a SPARC/PPC/whatever for $500 then go for it (and of course, spend how ever much is needed on the rest of the system, so that you can get equivalent/same motherboards, ram, power supplies, graphics etc where possible, because of the proprietary nature of certain configurations - such as Mac and to an extent PPC or Sun SPARC processors)... so like, try and spend the same amount of money on each given system...

That would be an easy way for AMD to win, in most cases, though...

--
Founder & COO, Hayai India (hayai.in) / USA (hayaibroadband.com) // t: @mgcarley

set-up benchmarks? by Metteyya · 2004-08-15 00:48 · Score: 1, Interesting

There's been some controversy about benchmarks being set up in favor of Intel, thus allowing it's processors - even having lower amount of MHz - to win most.

But I've seen some computers which, having only switched from AMD to comparable (in clock frequency) Intel processor, got some boost in speed. Especially in games. And I've seen some changing from Intel to AMD, suffering loss of speed - mainly in games.

I don't know if recent games (I've seen these effects mainly with Neverwinter Nights) are compiled with optimization for Intel line of 686, but that's the fact - AMD performed worse. In these particular computers, with some particular games of course.

Re:set-up benchmarks? by Anonymous Coward · 2004-08-15 01:05 · Score: 1, Informative

that's interesting that you say this, because just about every gaming beckmark and time demo clearly shows that the athlon64 is far superior to the pentium4 in gaming, and that's with comparing a 2ghz a64 and a 3.2ghz p4...
Re:set-up benchmarks? by Anonymous Coward · 2004-08-15 01:12 · Score: 0

yeah really. wheres the evidence? liars.
Re:set-up benchmarks? by Anonymous Coward · 2004-08-15 01:20 · Score: 0

>yeah really. wheres the evidence?
liars.
right here.

>liars.

Actually, he's speaking the truth. The 3.4GHz P4 is getting its performance/price-ass kicked by a much cheaper AMD part.
Re:set-up benchmarks? by Anonymous Coward · 2004-08-15 01:34 · Score: 0

but I've seen some computers which, having only switched from AMD to comparable (in clock frequency) Intel processor

Dood, how exactly did you plug an Intel processor into an AMD motherboard?? If you are for real and the only thing you changed was the processor we are talking Pentium 2/K5 levels here.
Re:set-up benchmarks? by Anonymous Coward · 2004-08-15 02:25 · Score: 0

You're being annoyingly vague. Which AMD processor? Which Intel processor? These comparisons aren't independent of the processors involved.

It sounds like you're talking about K6 vs Pentium on socket 7, which were the last socket-compatible AMD and Intel processors. It would make sense too, since K6 had a worse FPU than the Pentium (though it generally did better per clock on integer) and so suffered on FPU intensive games like Quake despite being a good desktop CPU. If so, your point, while true, is not very relevant to a comparison of K8 and P4/Nocona Xeon.
Re:set-up benchmarks? by hattig · 2004-08-15 02:27 · Score: 1

That's very odd, because if there is anything that anybody (i.e., reviews on most hardware sites) agrees on, it is that AMD is better than Intel for games, especially the Athlon 64 which whips Intel's butt.

Maybe Quake III ... that is very Intel optimised.
Re:set-up benchmarks? by mirror_dude · 2004-08-15 03:47 · Score: 0

Except your forgetting clockspead doesnt mean a lot now days.
What is important is comparable cost:
IE for $300 I can get an Athaln XXX and a P4 XXX , which one will perform better.

--
Note to Mods: When I post mirrors, it's a best guess. I don't know for certain whether or not the site will go down!
Re:set-up benchmarks? by sigaar · 2004-08-15 05:35 · Score: 2, Interesting

It really depends on what the rest of the hardware in the box is. AMD's (especially K6-II/III and Duron) CPUs tend to be seen as the low cost alternative and put in a box with a cheapo mobo, cheap mem and everything that goes with it, more often than Intel's CPUs. This is just my observation in dealing with a lot of SMEs, some who go all out and some who try to save where ever possible.

Shining example. We run an Astaro firewall for one of our clients. At first they didn't have machine available, when we wanted to start it as a proof of concept. We used one of our own boxes standing around the office, a Duron 800mhz on a PC-Chips board with SiS everything onboard, 512MB SD-RAM running at 100mhz. This PC worked quite nicely, and load never went past about 0.90

Later they retired one of their desktops to be the Astaro box. It's a P4 core 2Ghz Celeron, Intel board, 512MB SD-RAM (at 133Mhz). Load is constantly on 5.0. We've swapped out everything on that box, except the CPU. Even with a DDR board, it still running at an excessively high load.

Another example. I have an AthlonXP 2400+ on a SD-RAM board. A friend of mine has a 3ghz HT P4 with DDR333. He helped me once make ogg files of various quality of a movie's sound to compare. The P4 was only a fraction faster per file than the Athlon. Encoding two files at a time, we expected the P4 to be much quicker overall, but despite the HT, the Atlon was actually quicker per file. The encoding time per file stayed the same (time devided by two files), while on the P4 it took longer per file if we did two at a time.

This doesn't mean that the Athlon is always a faster CPU. My friend's gaming is a bit smoother, and he compiles KDE for example quite a bit quicker too. It's just that the performance depends entirely on what you do, and what quality hardware you use. If you put an Athlon on a good motherboard, it will kick arse. If you put a P4 on a crab board, it will suck.

--
sigaar
Re:set-up benchmarks? by nmos · 2004-08-15 06:11 · Score: 1

But I've seen some computers which, having only switched from AMD to comparable (in clock frequency) Intel processor, got some boost in speed.

That doesn't make any sense. Current AMD chips ALLWAYS perform better at the same clock speed. If you really mean that AMD chips with a performance rating (or whatever their calling it this week) similar to the Intel tends to perform a bit worse then that might make some sense. In addition recent Intel and AMD chips won't work in the same motherboards so clearly they didn't just switch the CPUs. They at least switched out the motherboard, cpu and probably memory and got different results.

How exactly are you running into all these people switching out CPUs for other equivilant CPUs? In practice almost noone does this. Instead people buy a system and keep it for a while then buy something else or upgrade but by then there is almost always something much faster for the same money and they buy that rather than something just like what they already had.
Re:set-up benchmarks? by Anonymous Coward · 2004-08-15 06:47 · Score: 1, Insightful

I expect the OP was comparing Athlon XP and P4.. (note that there was no mention of A64) Now, the main difference between XP and A64 is memory bandwidth/latency and guess where it shows the most? Yup. Games.
Re:set-up benchmarks? by Anonymous Coward · 2004-08-15 07:34 · Score: 0

anecdotal evidence.

nice.

move along little troll.
Re:set-up benchmarks? by Anonymous Coward · 2004-08-15 13:48 · Score: 0

Intel makes a compiler. AMD does not. Intel's compiler HAMMERS gcc and MSVC. Like 20% in some cases.

You can try it for 30 days free here. Note Linux is there too.

Opteron cpu hacked by GuyFawkes · 2004-08-15 00:49 · Score: 5, Interesting

I submitted this story an hour or two ago, but thinking about it it will be rejected just like everything else, and then pop up under someone else's name.

so what the hell.

Opteron Exposed: Reverse Engineering AMD K8 Microcode Updates

Summary

This document details the procedure for performing microcode updates on the AMD K8 processors. It also gives background information on the K8 microcode design and provides information on altering the microcode and loading the altered update for those who are interested in microcode hacking.

Source code is included for a simple Linux microcode update driver for those who want to update their K8's microcode without waiting for the motherboard vendor to add it to the BIOS. The latest microcode update blocks are included in the driver.

Background

Modern x86 microprocessors from Intel and AMD contain a feature known as "microcode update", or as the vendors prefer to call it, "BIOS update". Essentially the processor can reconfigure parts of its own hardware to fix bugs ("errata") in the silicon that would normally require a recall.

This is done by loading a block of "patch data" created by the CPU vendor into the processor using special control registers. Microcode updates essentially override hardware features with sequences of the internal RISC-like micro-ops (uops) actually executed by the processor. They can also replace the implementations of microcoded instructions already handled by hard-wired sequences in an on-die microcode ROM.

AMD's U.S. Patent 6438664 ("Microcode patch device and method for patching microcode using match registers and patch routines") goes into substantial detail on this.

Typically microcode update blocks are stored in the BIOS flash ROM and loaded into the processor as the system boots. They can also be loaded by the operating system; for instance, Linux contains a microcode device driver for Intel chips.

AMD recently released a "BIOS fix" to motherboard makers to address Errata 109, in which REP MOVS instructions caused subsequent instructions to be skipped under specific pipeline conditions.

Previously it was not clear if and how AMD even supported microcode updates in the K8 family until this announcement. After analyzing a number of BIOS images, it appears that AMD has secretly used the microcode update facility on several occasions over the past few years, but obviously avoided publicly disclosing that it actually had bugs patchable in this manner.

Early K7 (Athlon) cores initially supported microcode updates as well, until ironically the microcode update mechanism itself was found to be broken and subsequently listed as an errata!

The following sections describe the microcode update procedure, obtained by clean room reverse engineering various vendors' BIOS code. The actual microcode update blocks are embedded in the BIOS image; the most recent updates (created June 2004) have been included in the Linux driver source code attached to this description.

Microcode Update Procedure

The update procedure expects the 64-bit virtual address of the update data, including the 64 byte header, to be in edx:eax:

edx = high 32 bits of 64-bit virtual address
eax = low 32 bits of 64-bit virtual address
ecx = 0xc0010020 (MSR to trigger update)

Execute wrmsr with these register values. If the address and update block data are valid, wrmsr completes successfully. Otherwise, a GP fault is taken.

The microcode does not appear to update MSR 0x8B with the new update signature as it does on Intel processors, despite the fact that some BIOS code I have analyzed does seem to check this field. It is possible the MSR is only updated under certain conditions, for instance when microcode is loaded before initializing the cache controller. Nonetheless, as we shall see below, the processor is clearly doing something internally when it claims to accept an update in this manner.

The update generally takes around 5500 clock

--
http://slashdot.org/~GuyFawkes/journal

Re:Opteron cpu hacked by Anonymous Coward · 2004-08-15 00:53 · Score: 0

Ancient: credz nuked.
Re:Opteron cpu hacked by Ianoo · 2004-08-15 01:17 · Score: 4, Insightful

Microcode updates aren't permanent though - you need to reload them every time the machine boots. So clearly you would need to reload these "hacks" using a piece of software during the boot process.

Also, the article admits that it's "very unlikely" that a particular processor could be fried using a dodgy microcode update, so why even mention it? It would be much easier to write a BIOS flashing virus, I believe a few of these did exist at one point (although the old memory is failing). I doubt the hoops you'd need to jump through to write such a thing for Intel processors are no higher than for AMD processors, and as such, this is just FUD.
Re:Opteron cpu hacked by arose · 2004-08-15 01:20 · Score: 2, Insightful

It would be much easier to write a BIOS flashing virus, I believe a few of these did exist at one point
Chernobyl.

--
Analogies don't equal equalities, they are merely somewhat analogous.
Re:Opteron cpu hacked by Kugelfang · 2004-08-15 02:41 · Score: 1

I have rewritten the code that came with the article you qouted here. You can find my kernel module plus the necessary user application at:

http://dev.gentoo.org/~kugelfang/k8-ucode/
Re:Opteron cpu hacked by kasperd · 2004-08-15 03:06 · Score: 1

For instance, by patching the appropriate microcode lines, it may be possible to catch an opcode that would normally be illegal, and instead handle it by tricking the TLB into thinking we're in kernel mode when in fact the attacker has only compromised a userspace process.

But you could make some very similar backdoors by only changing the kernel. For instance somebody wrote a patch for Linux, where some illegal combination of parameters for the wait system call would result in the process getting root permissions.

--

Do you care about the security of your wireless mouse?
Re:Opteron cpu hacked by Sunspire · 2004-08-15 03:09 · Score: 4, Insightful

I'm not sure if other Linux distributions do this already, but at least Fedora Core 1 and 2 both come with a processor microcode update service that runs in the bootup sequence. It's even enabled by default out of the box.

Linux has for a long time already mostly ignored the system BIOS since they're notoriously broken because of legacy reasons. Supplying known good microcode is simply another step in eliminating variables that make system testing needlessly complex, I predict we'll see more developments along these lines in general.

--
It's like deja vu all over again.
Re:Opteron cpu hacked by Ramses0 · 2004-08-15 03:35 · Score: 1

Submit this stuff to Kuro5hin, it would be welcomed with open arms (mostly).

Was an interesting read!

--Robert
Re:Opteron cpu hacked by GuyFawkes · 2004-08-15 03:46 · Score: 1

Original discussion at

http://www.realworldtech.com/forums/index.cfm?ac ti on=detail&PostNum=2527&Thread=1&entryID=35446&room ID=11

I only saw it because bruce scheiner pointed it out.

--
http://slashdot.org/~GuyFawkes/journal
Re:Opteron cpu hacked by kermit6306 · 2004-08-15 11:23 · Score: 1

It would be much easier to write a BIOS flashing virus,
Thats kind of the point, you would flash the BIOS so your microcode changes would be used on every subsequent reboot. As processors become more feature rich, grow in complexity and have more of a need to be field reprogrammable, these types of hacks could be inconspicuously devastating to the security of a a system.
Re:Opteron cpu hacked by DAldredge · 2004-08-15 13:33 · Score: 1

Most K5 discussion have a habit of degenerating into an argument on American VS usian.
Re:Opteron cpu hacked by Anonymous Coward · 2004-08-15 14:10 · Score: 0

The microcode update driver currently only supports Intel CPUs. I'm working on getting the sample AMD driver into the Linux kernel.
Re:Opteron cpu hacked by Anonymous Coward · 2004-08-15 14:14 · Score: 0

Linux ignoring the BIOS has little relevance wrt microcode updates. When the system boots, the BIOS applies the microcode update (stored in the BIOS image). The update stays in effect until a reboot (or power-down).
Re:Opteron cpu hacked by Anonymous Coward · 2004-08-15 14:21 · Score: 0

Nice job! Please post this to LKML. I think the kernel maintainers would like to integrate it.
Re:Opteron cpu hacked by Sunspire · 2004-08-15 23:50 · Score: 1

The Linux microcode update service overrides the BIOS on this issue too (as well as BIOS functions in general), since it runs in the normal Linux startup sequence i.e. it always overwrites the BIOS applied microcode. Even if the Linux supplied microde is older than the BIOS's it might be advantageous to "downgrade" since the distribution toolchain might only be certified for a particular microcode revision.

--
It's like deja vu all over again.

This doesn't look good for Intel by GreatDrok · 2004-08-15 00:52 · Score: 4, Interesting

I recently installed Fedora 2 on a dual Opteron 248 system (Sun V20Z) and was amazed at the sheer grunt of the thing. Why anyone would even consider buying a Xeon just amazes me. I ran one of my own integer and memory heavy benchmark programs (single threaded) against my Athlon XP 2200+ and a single Opteron processor was 3x faster than the XP for only 400Mhz higher clock speed. These things are amazing, Intel should be crapping themselves and I am sure they would be if it wasn't for the cozy deal with Dell and the number of sites that have a Dell only policy. In a true free market they would be toast.

--
"I have the attention span of a strobe lit goldfish, please get to the point quickly!"

Re:This doesn't look good for Intel by aldoman · 2004-08-15 01:27 · Score: 1

Dell will move back to AMD if they think it's worth it, and I think they are starting to feel the heat. The opteron has went down very well in nearly all sectors, and I think we should see opteron servers soon. Hopefully.

--
IntechHosting - Free domain, 2GB, PHP, £4.95/$8.95
Re:This doesn't look good for Intel by mobby_6kl · 2004-08-15 02:23 · Score: 1

In a true free market they can sell their products at whatever prices and conditions both sides agree on. And imagine if Intel goes out of business now. AMD's a monopoly on x86 processors, there goes competition.
Re:This doesn't look good for Intel by Anonymous Coward · 2004-08-15 03:09 · Score: 0

And imagine if Intel goes out of business now. AMD's a monopoly on x86 processors, there goes competition.

Considering their war-chest, that may take a while.
Re:This doesn't look good for Intel by Unregistered · 2004-08-15 04:18 · Score: 1

Actually. it is a free market. Intel stil has the Intel name going for them. It'll be a long time (in tech time) before that changes.
Re:This doesn't look good for Intel by teg · 2004-08-15 04:55 · Score: 0
When launching Nocona, Intel focused on the platform, not the processor. After all, a processor is just part of something bigger... and the platform is AMDs weak point.
- I wouldn't touch VIA and SIS with a ten foot pole for my own systems, even less for servers. Plenty of bad experiences.
- NVIDIA does currently not go above 1 CPU, and has a strange proprietary ethernet card. I've also yet to see NVIDIA platforms support desirable enterprise features, like remote management/monitoring, other than the basic PCI slots. And while not proven bad, unlike VIA/SIS, they don't have a long track record for server chipsets either
- AMDs chipset is rather dated.
Re:This doesn't look good for Intel by iCEBaLM · 2004-08-15 05:06 · Score: 4, Informative

I wouldn't touch VIA and SIS with a ten foot pole for my own systems, even less for servers. Plenty of bad experiences.

While in the past VIA and SIS have been really bad chipsets, modern VIA chipsets (KT266A and up) are rock stable and perform well. I have had both KT333 and KT600 boards which have never failed. SIS, while I have no first hand experience, I am told is similar.
Re:This doesn't look good for Intel by Anonymous Coward · 2004-08-15 06:00 · Score: 0

When launching Nocona, Intel focused on the platform

Isn't this the same platform on which PCI-Express is broken? Yeah, there's Intel Quality for you.
Re:This doesn't look good for Intel by drinkypoo · 2004-08-15 07:11 · Score: 3, Informative

I bought a board with a SiS 745 chipset and it has been perfect, at least under windows. They provide a nice dual-channel PCI bus, even. SiS used to be a horrible joke but they've come a long, long way.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:This doesn't look good for Intel by sammcj2000 · 2004-08-15 08:16 · Score: 0

its only now that i have finaly moved away from VIA, ive moved to nforce2, the chipset is SOOOOOOO much better, i didnt expect this much of a difference to be honnest, but VIA really is Nforce2
Re:This doesn't look good for Intel by Veridium · 2004-08-15 10:40 · Score: 1

I've had no problems my via chipset in my dual opteron 248(MSI motherboard), short of I was a seriously early adopter and couldn't utilize my SATA controller right away under Linux. But hey, thems the breaks for early adopters.

Other than that, which is not a weakness of the chipset, everything has hummed along smoothly. This was the first high end system I'd purchased in over 8 years, and it was well worth it. The only thing weak about this platform IMO, is the cost, which is still less than a dual Xeon equivalent. But you know, it all comes down to personal choice and personal experience. I understand if you've been in the burned in the past not wanting to try it again. I'm like that with American brand cars.

--
Think for yourself, destroy your television.
Re:This doesn't look good for Intel by sp0rk173 · 2004-08-15 11:08 · Score: 1

I don't doubt your experience, but my KT333 system just died yesturday. It won't even POST after I fidded with some AGP-related BIOS settings, even after clearing the CMOS. It lived a good life though, three years of intense tri- to quad-booting service, gaming, heavy compiling with freebsd/dragonfly/gentoo. It was retired as my windows box, and was replaced with a opteron 144 based system using the nforce 3 pro chipset. It'll be interesting to see how long this thing lives.

WHile it was alive, though, it was plagued by USB-related issues. I have never had a via chipsetted box that did not have a USB issue.
Re:This doesn't look good for Intel by Anonymous Coward · 2004-08-15 11:35 · Score: 0

Sorry, but KT266a has a lot of same errata as previous VIA shitsets (like the infamous low PCI bandwidth bug). VIA based machines may run, but they have lower PCI performance, IDE bugs and other issues.

For server class machines, steer clear of VIA, SiS and ALi and stick with an AMD chipset or Intel.
Re:This doesn't look good for Intel by iCEBaLM · 2004-08-15 15:09 · Score: 1

Right, as if Intel has never had any problems.
Re:This doesn't look good for Intel by mgcarley · 2004-08-22 22:35 · Score: 1

YAY! Does that mean we should all buy Jetway(r), ECS(r), Chaintech(r) and AsRock (AKA shitty thrown out Asus) motherboards again?

--
Founder & COO, Hayai India (hayai.in) / USA (hayaibroadband.com) // t: @mgcarley
Re:This doesn't look good for Intel by drinkypoo · 2004-08-23 04:12 · Score: 1

Just because the chipset is good doesn't mean that it was successfully implemented on a specific motherboard. I personally buy motherboards only from manufacturers with a good reputation; abit, asus, giga-byte (their RMA turnaround time is atrocious though), tyan, and so on.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:This doesn't look good for Intel by mgcarley · 2004-08-26 09:25 · Score: 1

I was being sarcastic :)

--
Founder & COO, Hayai India (hayai.in) / USA (hayaibroadband.com) // t: @mgcarley

Difficult to trust? by arose · 2004-08-15 00:55 · Score: 4, Insightful

As we can see above, the difference between the two CPUs seems exaggerated and difficult to trust.

Maybe it's because Intel still makes processors for MHz and not performance? Maybe because unlike some comercial vendors the POV-Ray Team doesn't feel the need to make processor specific optimizations and leave that to job to the compiler (where it belongs)?

--
Analogies don't equal equalities, they are merely somewhat analogous.

Re:Difficult to trust? by ezzzD55J · 2004-08-15 01:12 · Score: 4, Insightful

This review struck me as a bit clueless, or unfinished. The above quote is a good example of why i think so. They do some measurements, but aren't sure they're doing it fairly (compiler flags), and don't know what to do with the results. There is little in the way of analysis or conclusions. With the openssl measurements they don't even give any conclusions. But analysis and conclusions are the whole point of the review, and a remark like "As we can see above, the difference between the two CPUs seems exaggerated and difficult to trust." really devaluates this review - they're just showing us measurements they're not sure are correct ('difficult to trust') and let us figure out what they're worth?
Well, the conclusion that the opteron kicks the xeon's ass is pretty inescapable to me, finding out opteron is available and the xeon isn't quite yet and more expensive, really closes the deal to me. But the review isn't very scientific, and didn't go very deep.
Re:Difficult to trust? by BlowChunx · 2004-08-15 01:20 · Score: 2, Insightful

It says to me that compiler flags matter a lot! ...oops, maybe those Gentoo zealots are onto something!
Re:Difficult to trust? by aldoman · 2004-08-15 01:37 · Score: 1

But they are not making just for MHz now. They are switching all CPU lines over to a similar system of numbering that AMD uses for Opetron CPUs, that is a 3 number ID code. The Prescott based Celeron is now known as the '320D' for example. The Pentium-M line is moving over soon (if they haven't totally done that yet).

--
IntechHosting - Free domain, 2GB, PHP, £4.95/$8.95
Re:Difficult to trust? by alienw · 2004-08-15 02:43 · Score: 4, Insightful

That's because it is nearly impossible to do a scientific comparison of two different processors. Anyone who tells you otherwise is a moron.

You have to evaluate performance (possibly vs price) for your particular application. If you need a faster processor for Doom 3, look at Doom 3 benchmarks. If you need to encode video, look at video benchmarks. If you need to do integer computations, look at integer benchmarks. Xeons probably kick AMD's ass at some applications, and AMD might beat the Xeon at others. You can't just say that one is "better" than the other in general.
Re:Difficult to trust? by Jah-Wren+Ryel · 2004-08-15 04:09 · Score: 0, Troll

This review struck me as a bit clueless, or unfinished.

Unfortunately, that is the case with so many of these "review" sites like anandtech and tom's hardware. For the most part, they are all just a bunch of kids or hobbiests with little to no industry experience. If they are lucky, they've got a CS degree but that's a bare minimum requirement not a qualification. More often than not the people writing these "reviews" are the same guys wearing the blue shirts down at the local best buy smurf cave.

If you think about it, it isn't too surprising. People with a real understanding of modern cpu architecture and system design are still rare enough, even in this depressed and outsourced micro-economy, that they can earn one or two orders of magnitude more money working in the labs of the big boys like Intel, AMD, HP, IBM, etc. Thus there is very little incentive for them to go do evaluations and write meaningful reports for hobbiest websites. The end result is you've got the blind leading the blind and at the same time these hobbiest websites are able to generate a hugely loyal following (whenever I post a critical message like this, there is usually at least on adherent to a website who replies with vitriolic denial).

That is not to say that there are no good sources of discussuion of modern computer architecture - the comp.arch usenet group has a few distinguished regulars like Andy Glew, a cpu architect who has worked on Intel and AMD microprocessors, for example. Though comp.arch has apparently become populare enough over the last year or so that the signal-to-noise ratio has significantly decreased. As for websites, of those with any regular activity, I think realwordltech has the highest level of knowledgable articles and community discussion going on, although many of the participants are just comp.arch regulars in a different venue.

--
When information is power, privacy is freedom.
Re:Difficult to trust? by upsidedown_duck · 2004-08-15 04:11 · Score: 1

It says to me that compiler flags matter a lot!

They do, but some performance-oriented flags can either cause instability or mistaken results. I've seen single flags cause a program to core dump or not, and even any optimization at all can cause some programs to crash (probably a very obscure bug regarding program and compiler assumptions...I never found it).

--
-- "Makes Little Debbie look like a pile of puke!" - Moe Szyslak
Re:Difficult to trust? by mrchaotica · 2004-08-15 04:38 · Score: 1

Well, actually you can if one beats the other in almost all the benchmarks : )

--
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Re:Difficult to trust? by Anonymous Coward · 2004-08-15 05:40 · Score: 0

You, like most average users, seem to want a reviewer to oversimplify and distill everything in a review down to a single tagline; like "ooh, it's the ultimate driving machine" or "coke ads life".
A better review for technical people, like those at anandtech and tomshardware, will be more transpartent - showing what their results are regardless of if they understand them - pointing out places where resutls indicate compiler issues, rather than CPU issues. You might like to say this makes the review sound "unifinished" and "hobbiest". I'd say it makes them sound honest.
Readers with industry experience appreciate this transparency; since it lets them focus their own analysis on the parts of the review that were of interest to their industry.
A "finished" review that gives a one liner "XXX is good" that doesn't highlight the rough edges and inconistant results may play well to the Time Magazine crowd, who like to be told exactly what to think.
Re:Difficult to trust? by Halfbaked+Plan · 2004-08-15 05:41 · Score: 1

You can if Mom will only allow you to have one computer in your 'room' so you have to choose which machine you buy veeeery carefully.

--
resigned
Re:Difficult to trust? by Halfbaked+Plan · 2004-08-15 05:44 · Score: 1

You're on track with your comment about hobbyists and enthusiasts.

I sold my first Pentium 75 chip (at the point I upgraded the system it was in) to a Mechanical Engineer at work. He had become an overclocking enthusiast, and was looking forward to 'seeing what he could get' out of that old chip.

That crowd is sorta the computer-hardware equivalent of the NASCAR fans. There are even fanboy bumper stickers for AMD.

--
resigned
Re:Difficult to trust? by drinkypoo · 2004-08-15 07:14 · Score: 2, Insightful

Yes, yes we are. :)

You definitely have to be careful about which ones you pick, though, and if you're really worried about performance you have to do what they did in this review, and try different settings with different programs because different flags will produce different results on different programs.

In general I use -O3 on older processors, like pentium2 cores, and -O2 on newer ones. I don't know if it's still true but -O3 was known to cause problems (errors, not just a performance hit) with GCC3 and Athlon processors, as well as MIPS processors, though that may have changed in 3.3 or 3.4.

Remember, gentoo's strategy of compiling everything is about more than cflags, it's also about the USE flags that ensure that you get support for the things you want, and skip the support for things you don't need. There's more than one kind of optimization.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:Difficult to trust? by ezzzD55J · 2004-08-15 07:28 · Score: 1

You, like most average users, seem to want a reviewer to oversimplify and distill everything in a review down to a single tagline; like "ooh, it's the ultimate driving machine" or "coke ads life".
Not at all; I'm not saying I'm more advanced than an average user, however, I in fact hate oversimplified and highly distilled articles, boiled down to taglines (our soundbites). These taglines and soundbites often find themselves back into slashdot blurbs, I've noticed, and that irritates me..
I have no problem with doing my own analysis and draw my own conclusions, in so far as I can.
What I was trying to say with my post is that I go to hardware/architecture sites for informed opinion and analysis on hardware performance, and that is not what I saw in this article.
Re:Difficult to trust? by Anonymous Coward · 2004-08-15 08:41 · Score: 0

Nah. This is really because P4 is DOG SLOW on x87 code. It's actually much slower than P3 in it while Athlon on the other hand is even slightly faster. This hasn't changed since the first P4 but nowadays POV-Ray is one of the few programs where it shows because it doesn't make any use of processor-specific trickery.

If you want to get even halfway decent speed out of P4, you need to use SSE. With SSE the ops/cycle you get is much closer to the same number regardless of architecture. Although the vector unit in Athlon is also more per-clock-powerful than in P4 (i can get 3 flops/cycle on some pieces of code on my XP while p4 has a hard limit on 2), it isn't enough so to take care of the GHz difference.
Re:Difficult to trust? by Anonymous Coward · 2004-08-15 10:13 · Score: 0

whenever I post a critical message like this, there is usually at least on adherent to a website who replies with vitriolic denial
-1 Troll -- Looks like you hit the nerve of an "adherent" with mod points today instead. Is that anything like crazy glue or maybe crazy glew?
Re:Difficult to trust? by Alsee · 2004-08-15 18:50 · Score: 1

You have to evaluate performance (possibly vs price) for your particular application.

Well considering that that Intel chip is more than 6 times the price of the AMD chip, if $858 matters to you at all, then AMD appears to win hands down.

Considering that the AMD chip is considerably faster on the vast majority of tasks, if you run multiple kinds of software, or even one peice of software with code that does multiple things, then on average the AMD appears to win hands down.

But yeah, you're right. *IF* you are running a single peice of software *AND* that software severely stresses a single portion of the system *AND* price is no object, then yes, there is a small chance that the Intel chip will be better for you. But AMD won amost every test, so odds are the ADM will still be the one you want.

I mean seriously, unless Anandtech intentionally stacked the deck, it was an absolute blowout. It would be quite a stretch to suggest Anandtech is intentionally biased in favor of AMD. If anything their last article was unfairly biased against AMD.

It seems that Intel is indeed slipping. Intel invested a lot of work in an ultra-high megahtz P5 core and then had to abandon that entire development line. That's a hell of a lot of work down the drain. Intel has also been investing a lot of work on embedding a Trusted CPU core inside of the main CPU. In my oppinion that is a lot of work down the drain as well, not only do I *not* want a second bullshit Trusted CPU inside of my CPU, not only does this work *not* speed up the CPU, not only does it result in a *more expensive* CPU, but it in fact results in a *slower* CPU because the second Trusted CPU eats up a large chunk of transistors and CPU realestate. I don't know whether there is an inert Trusted core hidden inside the P4 or or not, but I have seen the P5 micrograph and the Trusetd Core eats up about 20% of the chip, around 25 million transistors. That's HALF of an entire P3 CPU!

It seems that Intel really has stumbled lately. Their main advantage now is their trademark. But in this business a strong trademark isn't going to prop up an inferior more expensive product very well or very long. My current PC has an Intel CPU, but time to put this old hunk out to pasture and buy a new system. Looks like my money is going to AMD for price *and* performance.

-

--
- - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.
Re:Difficult to trust? by alienw · 2004-08-16 03:51 · Score: 1

Well considering that that Intel chip is more than 6 times the price of the AMD chip, if $858 matters to you at all, then AMD appears to win hands down.

This is a SERVER processor. If you are running a server, cost matters very little. Hell, you probably pay more than $800 to your janitor. Reliability, performance, and compatibility are what matters. AMD may be a better value, but Intel has a lot more experience with servers. Do you want to buy a $250K cluster to find out it doesn't work reliably with the application you want to run? All the machines I own are AMD, but I can see why somebody would buy Intel.

But yeah, you're right. *IF* you are running a single peice of software *AND* that software severely stresses a single portion of the system *AND* price is no object, then yes, there is a small chance that the Intel chip will be better for you.

Guess what: that's what a server needs to do, 24/7. Also, keep in mind that you need to evaluate the whole SYSTEM, not just a processor. You can't pick and choose parts, you have to either go with an Intel vendor or with an AMD one. So if AMD has a dodgy motherboard, it's out of the question. And that's where experience really counts.
Re:Difficult to trust? by alienw · 2004-08-16 03:54 · Score: 1

In an earlier test, it LOST to the other in almost all the benchmarks. Reinforces my point about the usefulness of benchmarks quite nicely, doesn't it?
Re:Difficult to trust? by mrchaotica · 2004-08-16 05:28 · Score: 1

Err, that was a different processor... and it was obvious they screwed up that set of benchmarks anyway.

--
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

Lame conclusion? by Quixote · 2004-08-15 01:07 · Score: 4, Informative

The comparison concludes with the wishy-washy statement:
After all is said and done it became difficult (nearly impossible?) to justify the Xeon processor in a UP configuration over the Opteron 150,

Huh? Here are some numbers:

POV-Ray 3.50c: Opteron is 40% faster
Crafty v19.15: Opteron is 70% faster
TSCP: 10% faster
PostgresSQL test-insert and test-select: Opteron takes 60% of the time it takes Xeon
MySQL test-insert: Opteron takes 80% of the time it takes Xeon.

In almost every benchmark, where proper optimizations are used (and why shouldn't they be? Who in his/her right mind would not use proper optimizations??), the Opteron destroys the Xeon.

Re:Lame conclusion? by Anonymous Coward · 2004-08-15 01:17 · Score: 0, Troll

Who in his/her right mind would not use proper optimizations??

Good question. So, by visiting SPEC, where _AMD_ could choose the optimizations on the Opteron, and _Intel_ could choose the optimizations on the Xeon, and BOTH companies had to play by SPEC's result reporting rules (available on their website for you to read in detail), HERE ARE THE CRAFTY SCORES:

Opteron 150: 63.8 seconds
Xeon: 74.9 second

This makes the Opteron 17% faster, not 70% faster.

Conclusion: ANANDTECH is a piece of shit website staffed by fucking morons like you.

Proof: figures are from HERE:

http://www.spec.org/osg/cpu2000/results/res2004q 3/ cpu2000-20040727-03285.html
and HERE:
http://www.spec.org/osg/cpu2000/results/res 2004q2/ cpu2000-20040503-03003.html
Re:Lame conclusion? by Anonymous Coward · 2004-08-15 03:51 · Score: 0

Conclusion: ANANDTECH is a piece of shit website staffed by fucking morons like you.

Error: Conclusion does not follow. Replace user and press any key to continue.
Re:Lame conclusion? by mabinogi · 2004-08-15 08:06 · Score: 1

And I bet Microsoft is going to release two versions of XP AMD64 one optomised for EMT64 and one for AMD 64, right?

And all the linux distributions and third party software providers (Oracle, etc) are going to do the same, right?

It's kinda irrelevant how well either processor works when compiled with opomisations that real world software is probably never going to use.

--
Advanced users are users too!
Re:Lame conclusion? by Veridium · 2004-08-15 11:19 · Score: 1

And all the linux distributions and third party software providers (Oracle, etc) are going to do the same, right? It's kinda irrelevant how well either processor works when compiled with opomisations that real world software is probably never going to use.

Hold on a sec, you do have a valid point within the context of typical windows machines, but when you're talking about high end workstations and servers, which both chips are targeted for, you have a whole different ball game.

I have never relied on precompiled binaries in a production environment when dealing with high performance apps. Hell, on my own Linux boxes, and I can't believe I'm alone or in a vast minority in the Linux community when I say this, I always play around with squeezing the most performance out of the apps through compiler optimizations.

If this were a Windows review, I wouldn't object so much. But we have to recognize that a Linux review is targeting an audience that includes people who do in fact utilize compiler optimizations in real world software. Any Linux admin worth his salt is going to do everything he can to make his machines and the apps that run on them, perform as fast as possible for his users.

--
Think for yourself, destroy your television.
Re:Lame conclusion? by mabinogi · 2004-08-15 11:28 · Score: 1

Ok, so how do you recompile Oracle?

Do you also usualy recompile libc? - though I could forsee a distro providing two versions of Libc if it was going to make a significant difference...

Obviously if you're using software that can be recompiled, and doing so will make a significant difference then it should be considered, but personally I think having a well tested, stable, and easy to recover in an emergency setup is far more important than tweaking the best performance. So for me, when it came to considering performance, I'd be looking for the platform that performed the best in the general case, with non optimised binaries, over one that _could_ perform much better, but only if you recompiled everything with specific optimisations and a different compiler.

I guess it probably comes down to the difference between scientific applications, where the software is specialised, and recompiling and tweaking for speed is pretty much expected, and database driven business applications - which are usualy built on closed source 3rd party platforms (and are probably IO bound anyway)

--
Advanced users are users too!
Re:Lame conclusion? by Veridium · 2004-08-15 11:44 · Score: 1

I guess it probably comes down to the difference between scientific applications, where the software is specialised, and recompiling and tweaking for speed is pretty much expected, and database driven business applications - which are usualy built on closed source 3rd party platforms (and are probably IO bound anyway)

That sounds about right. Yeah, I can't recompile oracle and there's some things I don't, but I would recompile MySQL, or APACHE(to name some more well known apps), and it is a trivial task to setup your optimized applications for fast recovery in an emergency. But, I can see some scenarios where it's probably better for the admin to go with precompiled binaries.

My real objection was, I didn't think you were recognizing a valid segment of the target audience who is very much interested in that. We want to be recognized ya know. ;)

--
Think for yourself, destroy your television.
Re:Lame conclusion? by Alsee · 2004-08-15 19:07 · Score: 1

Actually it seems pretty moot to me. As far as I see they aren't report a case where single-exe vs optimizing for each chip is going to reverse which chip is faster. It seems it would merely be tweeking how much the same winner is going to win by.

-

--
- - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.

very little grey area by Anonymous Coward · 2004-08-15 01:08 · Score: 1, Interesting

"As we can see above, the difference between the two CPUs seems exaggerated and difficult to trust."

---

The opteron seems to trounce the Xeon in most of the tests. The tests where it doesn't win outright, it is barely edged out. And this is from an Intel processor that mere mortals can't even buy right now against an Opteron I can order today and have in my hands tomorrow. That doesn't bode well for Intel.

Given the wide performance delta in AMD's favour in the database tests in this uniprocessor setting, I can't wait to see how things stack up with dual or quad processors. I know what I'LL be using next time I need to spec a departmental or enterprise database box.

Cheers,

Re:very little grey area by eddy · 2004-08-15 01:16 · Score: 5, Informative

There are older dual and quad Opteron vs Xeon reviews around.

When it comes to (Java) webservers and/or MySQL, the Opteron definitely has the advantage. In some cases, the Opteron simply annihilates the Xeon, but luckily for Intel the latter offers some resistance in our GZIP dominated benchmarks.

Humorously, the also say this:

The Opteron will probably remain the fastest CPU for the server tasks tested here until Intel introduces Nocona, the Xeon Prescott at 3.4-3.6 GHz (1 MB L2, 800 MHz FSB) at the end of the 2nd quarter of 2004.

Now we know that the Nocona is here, and it's getting slaughtered at the Altar of The Opteron.

--
Belief is the currency of delusion.

Opterons and Xeons and Prescotts, oh my... by Ianoo · 2004-08-15 01:10 · Score: 1

Anandtech originally posted an article which was a comparison of the Intel Xeon Nocona ("3.6F") and the AMD Athlon 64 3500+. The Xeon "won" most of the benchmarks by a good amount.

The criticisms were that the Xeon is not a desktop CPU, or vice-versa, the Athlon is not a workstation/server CPU. But are they so different? The Xeon has 1MB L2 cache, and so does the P4 Prescott (and presumably Prescott with x86-64 enabled), and both run at the same speed.

Similarly, the 3500+ runs at 2.2GHz and has a 512KB L2 cache, whilst the Opteron 250 runs at 2.4GHz and has a 1MB L2 cache.

With this in mind, can anyone explain to me why the Opteron seems to perform much better? The benchmarks appear to show the Xeon trouncing the A64 and the Opteron 250 trouncing the very same Xeon in 64-bit.

So what's the deal? Are AMD's desktop chips choked on 512KB of L2 cache (and yet the new Socket 939 A64s seem to be dropping back to 512KB L2 cache, whereas a few of the Socket 754 chips had a full 1MB).

I'm no processor expert, I just wondered if anyone could explain the "big difference".

Re:Opterons and Xeons and Prescotts, oh my... by Anonymous Coward · 2004-08-15 01:30 · Score: 0

The benchmarks appear to show the Xeon trouncing the A64 and the Opteron 250 trouncing the very same Xeon in 64-bit.

The only way a Xeon can "trounce" an Opteron is if the benchmarketers lie and cheat to make it so. This is especially true when processors you can't even buy(!) are "trouncing" proven technology.
Re:Opterons and Xeons and Prescotts, oh my... by Anonymous Coward · 2004-08-15 01:38 · Score: 0

You need to re-read the grandparent, this time more carefully.
Re:Opterons and Xeons and Prescotts, oh my... by ahmetaa · 2004-08-15 02:00 · Score: 1

Because first review was badly flawed. read the reader's comments on the first review, you will see why Anandtech come aout with his review. The configurations were wrong, compiler settings were wrong, and the chosen synthetic benchmarks were just trash. i am sure if they make the second test with the Athlon 64 againg, they will have similar results with opteron test..

Re:I have a great idea by NeedleSurfer · 2004-08-15 01:11 · Score: 1, Offtopic

when racism affects your judgement of an article it's the equivalent of having no judgement in the first place. Racism is the single most evocative measure of a person lack of judgement or mental abstraction capabilities.

As the proud owner of an Opteron 150... by kikensei · 2004-08-15 01:15 · Score: 1

I can say it kicks all kinds of ass in Doom3. :) I see that all sors of business/content creation benchmarks were used in the Anandtech review, but when a Baron of Hell corners you an ancient underground Mars excavation site... Well, lets just say your death animation looks all the more sweet with those extra fps.

games are poor example for Intel performance ... by Anonymous Coward · 2004-08-15 01:16 · Score: 1, Informative

You seem to be confused about the MHz thing ... an Opteron with the same clock frequency would win every benchmark against a P4: An Athlon 64 at 2 Ghz is faster than a 3.4 Ghz P4 in many benchmarks (and slower in others). Intel's design makes very high clock frequencies possible, but at the expense of a very long pipeline, which hurts applications with many branches or unplanned memory accesses.

If you really need high performance, you'll have to try and figure out the performance for the specific application you need.

A rough summary (when comparing processors with the same rating, not same clock frequency):
The Athlon 64 wins most gaming benchmarks against the P4.
The Athlon (and especially the Opteron with its larger chache) dominates database benchmarks.
Things are very different with video encoding - there Intel usually wins.

Opteron vs. A64 by jonobp · 2004-08-15 01:24 · Score: 1

The main difference between the two is rather simple. The Opteron uses a dual-channel memory controller. You'll see most mobo's for the Opteron require registered memory. The A64 on the other hand being a consumer based chip uses a single channel memory controller. All in all, the Opteron has double the theoretical memory bandwidth.

On the other hand, Intel's version of 64 bit computing still does not have a memory controller built into the chip. While using dual channel memory, they are still having the Northbridge bottleneck that AMD wanted to avoid from the beginning.

What would be nice to see, as others have commented would be a scalled series of benchmarks. You think the Opteron is impressive now, try 2 and 8 processor configurations vs. the Xeon's. You'll see how much the memory controller plays a role. In the end, add up the GHZ and price points and you'll be able to see how much bang you'll get for your server $.

Re:Opteron vs. A64 by Ianoo · 2004-08-15 01:27 · Score: 3, Interesting

Actually, your information is out-of-date. The new Socket 939 Athlon 64's (both the + and FX series) feature a dual-channel memory controller for unregistered DDR SDRAM (this is one of the big reasons for introducing the new socket in the first place).

This still leaves me wondering why an Opteron 250 (2.4GHz, 1MB L2 cache) seems to so seriously outperform an Athlon 64 3500+ (2.2GHz, 512KB L2 cache).
Re:Opteron vs. A64 by at_18 · 2004-08-15 01:45 · Score: 5, Informative

This still leaves me wondering why an Opteron 250 (2.4GHz, 1MB L2 cache) seems to so seriously outperform an Athlon 64 3500+ (2.2GHz, 512KB L2 cache).

When people says that the first article was bad, it's because it was really bad: 64-bit binaries for Intel vs. 32-bit binaries for AMD, copy&pasted benchmark results from previous 32-bit benchmarks, tests (PI digit computation) that measured the libc optimization instead of the actual benchmark (when removing the printf() it got about a 10x boost). People on aceshardware forums were posting TSCP scores about 2x what Anandtech got, on the same processor. So the A64 3500+ scores you saw in that article are trash. Forget them.
Re:Opteron vs. A64 by Ianoo · 2004-08-15 01:52 · Score: 1

Thanks for the info. I had been following the reaction at Anand's but didn't realise quite how bad these figures were. I'd give you mod points if I had them ;)
Re:Opteron vs. A64 by jonobp · 2004-08-15 01:57 · Score: 1

The new 939's do have the dual controller. But depending on the board a 200Mhz slower HT link and 200Mhz slower processor. This isn't Intel, it's AMD...200Mhz actually means something with chips that only use 12 stages. Now you can OC the 3500 and probably get closer results. The cache also might play a role depending on the benchmarks. Remember the FX-51 used to trounce everything in games and most everything else. All it had was a dual mem. controller and a little more clock frequency and cache. How did Intel answer...they put a mess load of cache on a P4 and sold it for a grand...that's all the EE was and it still can't keep up.

You mention the Opteron 250, I'm assuming that you compared benchmarks with the 3500 with the 150 in the article as the 250 is a dual setup.

Oh, also as someone mentioned below the benchmarks in the first article are not very well down. But if you want to go by them or even any others, the Opteron usually inches out the 64's and sometimes the FX. After all, servers usually have better constructed motherboards with lower tolerance for variation. I'll tell you what though, my 150 plays a hell of a game of Doom 3 :P
Re:Opteron vs. A64 by hirschma · 2004-08-15 03:23 · Score: 1

In some cases benchmarks were run with some executables (like the chess thingy) that would fit in the Xeon's cache, but not in the A64's. This would give the Xeon an extremely unfair advantage, and it came out that way.

Same benches with the same cache, the Xeon gets killed.

Jonathan
Re:Opteron vs. A64 by Anonymous Coward · 2004-08-15 03:25 · Score: 0

But depending on the board a 200Mhz slower HT link

There were some benchmarks done a while back when the 1GHz HT links were introduced. It showed that basically, the HT speed didn't change the benchmark scores unless you dropped the speed down to 200MHz or so (from the 800MHz or 1000MHz of the normal running systems). In fact, it showed that you could run the HT down at even 600MHz and not see any significant performance decreases.

Re:Am siebten Tage sollst du ruhen! by mycroft_rayok · 2004-08-15 01:24 · Score: 1

With the crap stories that have been posted today, I think perhaps the slashdot editors ARE taking it easy on the seventh day

Hyperthreading is not good for these benchmarks by DJStealth · 2004-08-15 01:24 · Score: 4, Insightful

As hyperthreading cuts the L2 cache in HALF, it should be disabled before doing any of these benchmarks. Hyperthreading only seems to improve the multithreading ability. These benchmarks being run on a single process are not realistic.

Re:Hyperthreading is not good for these benchmarks by mczak · 2004-08-15 02:14 · Score: 2, Informative

Hypterthreading doesn't cut the L2 cache in half. At least not statically, it is possible that if you're unlucky it will indeed look like each process will only have half the cache.
Re:Hyperthreading is not good for these benchmarks by langarto · 2004-08-15 02:34 · Score: 1

Hyperthreading does not cut the L2 in half if there is no other thread running. The cache is shared dinamically. However, there are other resources that are actually divided in two halves when hyperthreading is used (some internal queues).
Re:Hyperthreading is not good for these benchmarks by Glock27 · 2004-08-15 04:01 · Score: 2, Insightful

As hyperthreading cuts the L2 cache in HALF, it should be disabled before doing any of these benchmarks. Hyperthreading only seems to improve the multithreading ability. These benchmarks being run on a single process are not realistic.
Since it isn't possible to dynamically turn hyperthreading on and off while the system is running, the benchmarks should be run in the mode most systems will use - with (highly touted) HYPErthreading turned on. After all, it is supposed to be a useful feature...
Personally, if I really need to run two threads with top efficiency, I'll invest in a dual 2xx Opteron box - with two real processors rather than an extra sorta-processor. That way if I have to pay a dual CPU license fee at least I'll get my money's worth. :-)

--
Galileo: "The Earth revolves around the Sun!"
Score: -1 100% Flamebait
Re:Hyperthreading is not good for these benchmarks by Unregistered · 2004-08-15 04:26 · Score: 1

But in the real world, wouldn't you have HT enabled, even doing this kind of stuff?
Re:Hyperthreading is not good for these benchmarks by Fweeky · 2004-08-15 05:05 · Score: 1

For things like databases, sure; it's no good saying "This Xeon executes a single threaded set of db ops slower than this Opteron" when the usual workload of a database is to serve multiple clients at once; each thread might be quite significantly slower, but if you can run two of them per CPU at that speed you're not as behind as a single threaded benchmark might suggest.
Re:Hyperthreading is not good for these benchmarks by DJStealth · 2004-08-15 05:12 · Score: 1

I've tested this on various machines. Benchmarks for single processes are almost always higher with HT turned off. However, in multitasking system with only 1 processor, you are correct in stating that it should be turned on; yet the benchmarks shown on this site are meaningless unless they are tested in a multitasking environment.

However, we may be getting a few multiprocessor Xeon's for research use; since there are multiple processors, I'm debating whether HT should be turned OFF, as HT slows things down for single processes, and as there are multiple processors to deal with multiple processes, HT may not serve any advantage whatsoever.
Re:Hyperthreading is not good for these benchmarks by luguvalium2 · 2004-08-15 05:48 · Score: 1

I seem to recall that you are not supposed to turn off hyperthreading on a single cpu p4/xeon. I'll see if I can find a reference. It is optional on smp implementations.
Re:Hyperthreading is not good for these benchmarks by Glock27 · 2004-08-15 11:40 · Score: 1

However, we may be getting a few multiprocessor Xeon's for research use; since there are multiple processors, I'm debating whether HT should be turned OFF, as HT slows things down for single processes, and as there are multiple processors to deal with multiple processes, HT may not serve any advantage whatsoever.
Well, you should really look at the number of compute-bound threads you have active on average. If that number is around 4 or greater, and you have a dual-processor box, then you should probably turn on HT.
Just be aware that during the time that you have fewer than that number of threads going, you'll be taking a performance hit...

--
Galileo: "The Earth revolves around the Sun!"
Score: -1 100% Flamebait
Re:Hyperthreading is not good for these benchmarks by SuperQ · 2004-08-15 16:04 · Score: 1

Would you please post documenation (intel.com links maybe?) where you found this? I would like to see some hard evidence. just curious
Re:Hyperthreading is not good for these benchmarks by TheLink · 2004-08-15 19:01 · Score: 1

HT is not always a win even for multiple clients/threads:
See the following:
page4

page5
--
- Too many replies beneath your current threshold

Xeon lose at SPEC too. by Anonymous Coward · 2004-08-15 01:26 · Score: 0

>This makes the Opteron 17% faster, not 70% faster.

So Xeons get creamed at SPEC too, you troll. Where's the beef?

Re:Xeon lose at SPEC too. by barneyfoo · 2004-08-15 06:54 · Score: 1

He was commenting on the crafty scores anandtech got, and the crafty section of spec. Same test different results. Good to question anandtech it seems, but intel probably used INtel C Compiler.
Re:Xeon lose at SPEC too. by turgid · 2004-08-15 07:16 · Score: 2, Informative

Synthetic benchmarks like SPEC often give very different results to real world applications. The fact that the Opteron is faster in the SPEC benchmarks and many real-world tests speaks for itself.

--
Stick Men
Re:Xeon lose at SPEC too. by Anonymous Coward · 2004-08-15 09:30 · Score: 0

Actually, both AMD and Intel used the Intel C compiler.

AnandTech Biased by ironwill96 · 2004-08-15 01:36 · Score: 2, Interesting

It seems to me that AnandTech seems to be biased in Intel's favor for some odd reason. Either that, or that particular reviewer happens to be. Last week in their other review they said the Intel Xeon processor was way better - even when the results were about the same skewed in Intel's favor. Now that the results are skewed toward AMD the reviewer still refuses to see that the Opteron is a better processor, is available NOW, and is $250 cheaper than the Xeon-yet-to-be-released that they are comparing it too.

*Sigh* I've lost all faith in reviews by some of these hardware sites lately - they seem to be getting paid by someone to make invalid conclusions (or none at all) from fairly conclusive data.

--
"To strive, to seek, to find, and not to yield." - Tennyson

Re:AnandTech Biased by Anonymous Coward · 2004-08-15 01:56 · Score: 0

Last week they compared a server CPU vs a desktop CPU. They finally corrected that "bias" and compared apples to apples (server CPU vs server CPU).
Re:AnandTech Biased by ArbitraryConstant · 2004-08-15 03:03 · Score: 1

"I've lost all faith in reviews by some of these hardware sites lately - they seem to be getting paid by someone to make invalid conclusions (or none at all) from fairly conclusive data."

We still have Ars. :)

--
I rarely criticize things I don't care about.
Re:AnandTech Biased by swordgeek · 2004-08-15 03:08 · Score: 2, Insightful

Anandtech isn't biased, it's incompetent.

Don't get me wrong--I've liked Anand and company since they first hit the internet. They don't generally have an axe to grind or an ego to boost (both of which TomsHardware suffers from terribly), but they don't have the slightest bloody clue about statistics, or significance.

Fun to read, and not consistently biased, but not a great source of actual benchmarking or review information. (techreport.com is better for that)

--

"People who do stupid things with hazardous materials often die." -- Jim Davidson on alt.folklore.urban
Re:AnandTech Biased by Anonymous Coward · 2004-08-15 03:29 · Score: 0

I don't find them overtly biased. They're just not terribly competent. Consider that anandtech has morphed from a kid doing neato-whizbang reviews in his garage to a site that gets some small amount of media attention. The reviewers have "grown up" over the last few years (most are in college now, I think), but their methods have not grown up with them. The site is geared more towards teenage gamers with a significant hardware budget rather than your typical POB or technology professional.

I've pretty much ignored Anand for the last couple of years except for the occasional video card review (which they DO seem competent at since they are all diehard gamers).

Cheers,
Re:AnandTech Biased by Anonymous Coward · 2004-08-15 03:29 · Score: 0

Until you realize that the "server" cpu and the "desktop" cpu will be identical in every way that will matter. The only difference being that the Prescott 3.6F won't have multi-processor capability that the Xeon has. Every other thing is supposed to be identical - L1 and L2 cache sizes, the core, clock speed, etc.

People using this argument of "server" vs "desktop" cpus in this case have no clue to what they are talking about.
Re:AnandTech Biased by Sj0 · 2004-08-15 17:33 · Score: 1

Wrong processor.

The last review by anandtech was "EM64T Xeon vs. Athlon 64 under Linux".

I'll spare you the condescending jab at your last remark.

--
It's been a long time.

Those results speak for themselves by Stevyn · 2004-08-15 01:38 · Score: 2, Informative

Other than a few benchmarks that were either synthetic or not compiled specifically for the processor, AMD whooped Intel's ass. Some of the gains were quite significant.

However, this speed increase seems to depend on being able to compile your software from scratch which is generally unknown in the windows world. That should change in the future, but for now it's still a tough call whether or not to buy one now. But if you're running gentoo, let the funroll-loops begin!

Re:Those results speak for themselves by BCW2 · 2004-08-15 03:52 · Score: 1

The decision is simple: bang for the buck. AMD always wins that choice. Check other reviews for the best motherboard to go with it and build a real hot rod gamer, or a fast enough web server to need more bandwidth.

--
Professional Politicians are not the solution, they ARE the problem.
Re:Those results speak for themselves by SharkDiver · 2004-08-15 05:04 · Score: 0

Having had to buy larger numbers of dedicated application and db systems, performance matters a lot. Price not as much, but is a factor. Name brand, with technical support factors perhaps above price.

When both performance and price are on the same side, it's hard not to buy those servers. If the Operton 150's were HP/Compaq's or IBM server with a support contract, the Opteron would be a no-brainer.

Maybe it's time for Dell to realize that the advertising kickback isn't worth the loyalty required. "Intel inside" maybe become synonymous with Cadillac: Overpriced and underpowered.

Debian Sarge by Anonymous Coward · 2004-08-15 02:01 · Score: 0

http://www.debian.org/ports/
http://www.debian.org/devel/debian-installer/ports -status

Sarge in September?
http://debianplanet.org/node.php?id=1131

Re:Debian Sarge by Anonymous Coward · 2004-08-15 03:12 · Score: 0

Sarge in September?

Gentoo now. ;-)

Re:I have a great idea by Anonymous Coward · 2004-08-15 02:04 · Score: 0

What?!

Cigarettes are mentally diseased?

My God!

Other Ideas for benchmarks by johnhennessy · 2004-08-15 02:07 · Score: 4, Interesting

How about profiling bytecode interpreters for the new breed 64 bit processors.

Both Sun (the original innovators) and now Microsoft are putting their money on their bytecode (rather than binary) executables to try and avoid the whole backwards compatibilty problems when moving architectures. To get to grips with how important this is - Microsoft has only just recently managed to escape from the 16 bit code hell that it lived in for years (need proof - check out the Win16Lock you needed to get access to the video memory in DirectX).

That said, I can't imagine that many (someone might enlighten us here) performance benchmarks that a 64 bit bytecode interpreter could do better in when compared to its 32 bit smaller brother.

What would be interesting here would be to see how Javas bytecode and CIL scale to 64 bit. My first guess would be that Java should scale better (with Suns heritage of 64 bit platforms) but I wouldn't be surprised if MSFT weren't too far behind, as they were always keeping their eye on this test when designing the CIL. This would also be a good chance for the Mono project to try a "ours is better than yours" benchmark for their interpreterrs.

--
[ Monday is a terrible way to spend one seventh of your life. ]

Re:Other Ideas for benchmarks by Anonymous Coward · 2004-08-15 03:17 · Score: 1

My first guess would be that Java should scale better (with Suns heritage of 64 bit platforms)

Actually Sun's version of Java for Solaris is pretty much widely considered to be the worst Java platform implementation available, not the least in the speed department. Those who've had the pleasure to work with said version know what I'm talking about and its quirks.
Re:Other Ideas for benchmarks by drinkypoo · 2004-08-15 07:19 · Score: 1

Sun is the original innovator? I don't know if you can even give the credit to IBM, but bytecode translation has long been a strategy used by the AS/400.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:Other Ideas for benchmarks by andreyw · 2004-08-15 10:59 · Score: 1

... and IBM/360 with its VM/360.

Think about it... you can run all those OS/360 (as well as OS/390, etc, etc) programs on your zSeries...

Damn I love Big Blue.

Re:I have a great idea by Anonymous Coward · 2004-08-15 02:25 · Score: 0

When you don't have the material nor the time to validate an article, you have to judge its validity based on reputation. If you don't know the person who is making the article, then you have to use general knowledge, like cultural difference, to determine the value of the article.

For example, I won't trust an article praising Intel over AMD if it was made by a jewish person. I'd think there is a possibility he may have a political agenda. The opposite is also true. I won't trust someone praising AMD if he is openly against Israel. You may call this racism if you like but it's certainly not a lack judgment.

Also, most people use racist insults while not really being racist. It's a way to differenciate themselves from the other. It's like the brunette who bitches the blondes.

The parent poster also said the guy was only a kid. But you didn't reply to this. My guess is you're the kind of person who feels good about himself by "fighting" racism. My guess is you're the kind of person who refuse to see cultural difference. And to me it's the most evocative measure of a person lack of judgement.

wtf by SmallFurryCreature · 2004-08-15 02:56 · Score: 1

Ehm, try both linux and bsd? The processors named are not exactly that rare. Both are supported by linux and maybe bsd.

--

MMO Quests are like orgasms:

You may solo them, I prefer them in a group.

Original source? by j1m+5n0w · 2004-08-15 02:57 · Score: 1

Does the above article have an original source? I'm guessing it didn't just spontaneously appear on half a dozen weblogs, it was probably written by someone who would like credit for his/her work? Perhaps this is why the story was rejected?

-jim

Re:Original source? by GuyFawkes · 2004-08-15 03:43 · Score: 2, Informative

quoted in the submission
http://www.realworldtech.com/forums/in dex.cfm?acti on=detail&PostNum=2527&Thread=1&entryID=35446&room ID=11

hasn't been rejected yet, still pending, but no doubt will be.

--
http://slashdot.org/~GuyFawkes/journal

Why this is still a joke by Anonymous Coward · 2004-08-15 03:01 · Score: 1, Insightful

Why don't they just summarize each test/graph with the PERCENT difference?

Instead, we get gems like this:

This becomes our first real world test where we see Intel come out ahead. This coincides with what we saw on the previous page with the synthetic benchmark.

On a result where the Xeon is 55.894 and the Opteron is 56.26... Since when is half a percent significant?

And where is the kernel compile benchmark?

Re:Why this is still a joke by SharkDiver · 2004-08-15 05:09 · Score: 0

So you can look at older reviews and compare raw numbers. What's wrong, no feel for raw numbers?
Re:Why this is still a joke by Anonymous Coward · 2004-08-15 12:10 · Score: 0

So you can look at older reviews and compare raw numbers. What's wrong, no feel for raw numbers?

Sorry. As a performance engineer, I sometimes expect too much from people who aren't accustomed to focusing on what is important when looking at data.

I didn't say drop the raw data, I emphasized the importance of percentages.
Re:Why this is still a joke by TheLink · 2004-08-15 13:45 · Score: 1

Yawn. Those who find percentages important will calculate them.

I prefer raw numbers because that's one less thing for the reviewers to get wrong - they may screw up the calculation or be tempted to draw stupid graphs that show nothing.

And I'd prefer to see the time taken to convert a 700MB wave file compared in seconds rather than percentages. Gives me a better feel of how long a system will take to do other sized files, and whether I should fork out the X bucks for the next faster system. Percentages aren't so useful in this case.
--
- Too many replies beneath your current threshold

?!?! nocona ??! by rmr00t-f · 2004-08-15 03:03 · Score: 2, Funny

cona in portuguese jargon, is a vagina... :D

Re:?!?! nocona ??! by Anonymous Coward · 2004-08-15 05:28 · Score: 0

So in other words, what we can really draw from all of this is that the type of people that would buy a nocona processor get novagina. Makes sense to me.
Re:?!?! nocona ??! by Anonymous Coward · 2004-08-15 06:47 · Score: 0

Your years of studying Portugese has finally paid off in a very climactic post here on Slashdot.

How do you feel?
Re:?!?! nocona ??! by Anonymous Coward · 2004-08-15 08:32 · Score: 0

Sure. It's got BALLS, man!

Re:I have a great idea by Anonymous Coward · 2004-08-15 03:08 · Score: 0

But the problem is that many of those shortcut decisions aren't really accurate. The other problem is that you don't need to trust the article if the facts are supported by a methodology. If you still don't trust that, then you've got to believe that rather than bias, actual lies are taking place. And you can bet that people will check the facts if the methodology is included. It's just silly to use cultural stereotypes when there are more fine grained indicators available (such as the style, presentation of methodology, etc).

And the sort of differentiate you refer to (blond vs. brunnette, for instance), is rather insidious. It sounds harmless, but it's amazing how quickly that sort of us versus them mentality can balloon. You may recall a psychology study in which the experimenter taught young children to form groups based on eye color -- after a while, the groups started to display shockingly divisive behavior. Another is the Zimbardo prison experiment (Google "Stanford Prison Experiment"), which demonstrated that when two groups were randomly formed -- one of prisoners, one of guards -- and they were asked to role play, the role play went beyond simple acting to actual maliciousness. The point is that these distinctions people make are self-reinforcing and can lead to excessive bias. Yes, you might not trust an article that has no methodology, but if you're going to read the article and it seems up-front about its methodology and you can evaluate it, then you should evaluate it on the merits: it doesn't matter who wrote it. If you really just want an opinion without doing any analysis yourself, then that's your perrogative, and you might want to seek out the best sources possible. But I think cultural distinctions are far less important than the concrete evidence provided by the article.

Is there somewhere that details all the opterons? by Anonymous Coward · 2004-08-15 03:13 · Score: 0

There seem to be a million of them.

A 240 is 1/3 the price of a 250, is it still a great processor? I haven't found a site that goes into all of them and of course the reason for favoring one over another

Heute ist erste, nicht siebente Tag, dummkopf by Anonymous Coward · 2004-08-15 03:17 · Score: 0

Friday sundown till Saturday sundown was the 7th day

I don't know, the world is full of religious fundamentalists and they don't even know simple facts about their religions.

Genesis: And the evening and the morning was the first day.

Re:Heute ist erste, nicht siebente Tag, dummkopf by joerg · 2004-08-16 00:05 · Score: 1

Genesis started at day zero, not day one!

If you want controversy... by YU+Nicks+NE+Way · 2004-08-15 03:38 · Score: 1

go read this Infoworld review. They don't list any real figures that I could track -- do other reviews replicate their results?

... and to make it better ... by NerveGas · 2004-08-15 06:04 · Score: 1

These benchmarks show the Opteron 150, a $600, real-world-available chip handily beating a $850, non-real-world-available chip. But that's still not the half of it.

Wait until tests are run on multi-CPU machines. Because the Opterons scale so much better than Xeons, the performance advantage of the Opteron will be even greater.

When I've bought a quad Xeon machine, I've never been at all impressed with the scalability. When I bought a quad Opteron, I was blown away.

steve

--
Oh, you're not stuck, you're just unable to let go of the onion rings.

Re:Is there somewhere that details all the opteron by Fweeky · 2004-08-15 06:21 · Score: 3, Interesting

240 = 1.4GHz, £145
242 = 1.6GHz, +£15 / +14% faster clock
244 = 1.8GHz, +£90 / +28%
246 = 2.0GHz, +£190 / +43%
248 = 2.2GHz, +£345 / +57%
250 = 2.4GHz, +£465 / + 71%

First step's a no-brainer; next one isn't too bad, after that you're hitting significant diminishing returns, with each 200MHz gap being a smaller proportion of the total clock, not to mention other things becoming more likely to bottleneck (IO; memory bandwidth, disk latency, network, PCI bus, etc).

Core differences are going to be minimal, and hypertransport's remained at 800MHz across the S940 range afaik, so the clocks *should* be a pretty accurate upper bound on the performance differences within each range.

My own results by davros74 · 2004-08-15 06:23 · Score: 2, Interesting

We have been benchmarking several loaner boxes at work to determine what will be our next purchases for our compute farm. We do primarily ASIC and FPGA design, simulation and verification. We have been in dire need of >4GB boxes, and until just recently, we had been forced to run on Solaris machine to get 8GB.

The day of the Opteron, however, has come at last:
All these were run with stock tools in 32-bit mode, no fancy compiler optimizations. These are the same programs that we run on 2GHz P4s.

Agilent 3070 VCL vector conversion Perl program (which I wrote, this is very typical of the Perl programs we run to process large vector files - the benchmark only times data processing in memory, no file IO on read/write):
Sun Blade-1000 750MHz: 103.08 sec
P4 3.06GHz: 36.93 sec
Opt 148 (2.2GHz): 27.01 sec
Quad Opt 848: 27.42 sec
Quad Xeon64 (3.6GHz): 31.17 sec

Modelsim 5.8c simulation of LogicBIST simulation on 50K Flop ASIC:
P4 3.06GHz: 5955 sec
Opt 148: 3798 sec
4x Opt 848: 5985 sec (See note below)
4x Xeon64: 4858 sec

Mentor Flextest fault grading using make -j1, -j2 and -j4 (parallel runs, results combined in later step that is not benchmarked):
Sun Blade-1000: 7362 sec(-j1)
P4 3.06GHz: 2188 sec(-j1)
Dual P4 3.06GHz: 2189 sec/1333 sec (j2)
Opt 2.2GHz 128: 1493 sec
4x Opt 2.2GHz 848: 1562 sec(j1)/ 779 sec(j2)/ 393 sec (j4)
4x Xeon64 3.6GHz: 1465 sec(j1)/796 sec(j2)/ 879 sec(j4)

Mentor LbistArchitect on 50K ASIC:
Sun Blade-1000: 15698 sec
P4 3.06GHz: 3877 sec
Opt 148: 2845 sec
4x Opt 848: 3534 sec (See note below)
4x Xeon-64 3.6GHz: 2604 sec

Note - the poor performance of the quad opteron box was done on RedHat Enterprise Linux 3 AS-6, and I noticed that the SMP kernel did NOT have CONFIG_K8_NUMA set to y, so it's not fair to judge those numbers until we get a new kernel with ccNUMA support. I have run synthetic benchmarks on them too, and the memory performance on the Quad Opteron was indeed hurt by the lack of CONFIG_K8_NUMA in the linux kernel.

Clearly though, the HyperTransport makes the Quad Opteron box scale very well, whereas the Quad Xeon box choked on 4 threads, probably beacuse the memory bus became saturated and the processors starved for data.

Also, any serious optimizations need to use gcc-3.4.1 - which has specific optimizations for both Opteron and Nocona cores. gcc-3.4.0 does not have specific optimizations for Nocona ("Xeon64") cores. gcc-3.x does not have specific optimizations for Opteron.

Anyway, our decision has been made - we are buying Opteron 150s for all our new compute farm boxes.

Re:My own results by Anonymous Coward · 2004-08-15 14:04 · Score: 1, Informative

Very interesting. When I was in grad school, my research group did full custom designs with Cadance. When they finally got a copy of the tools for x86, they tested a Xeon 2.4 against a $60,000 Sun V880. Sure enough, 100% improvement on x86(5 minute DRCs down from 11 minutes). They only use the Suns now for the really large design that needs more than 4GB memory.

intel fails YET AGAIN by sammcj2000 · 2004-08-15 08:12 · Score: 0

what i find interesting this that on the benchmarks, the opteron 150 kicks the pX to hell and back, but what is interesting is that the 32/64bit Gflops of the pX are higher, suspecting to me that they want to belive that they will perform in real life situations, when in actual fact they dont...

Re:I have a great idea by Anonymous Coward · 2004-08-15 09:28 · Score: 0

Well you can't win em all. Thank god you didn't accuse me of ageism too, or my post would have started to look REALLY bad!

Heh! Subscribers first, dude! by Anonymous Coward · 2004-08-15 10:06 · Score: 0

Especially Australian ones on weekends! ;-)

Fair comparison? by illumina+us · 2004-08-15 10:24 · Score: 2, Insightful

This is comparing Intel's latest chip to a very old Opteron.

First of all, AMD's Opteron 150 is the highest performing AMD workstation CPU money can buy

What about the AMD Opteron 850?

--
-illumina+us "I put on my robe and wizard hat..."

Re:Fair comparison? by Lucretian · 2004-08-15 10:47 · Score: 2, Informative

The way the opteron numbering system works is the first number is the amount of CPUs you can run SMP. The 150 is the fastest single CPU you can buy right now, the 850 is running at same clock rate as the 150 but can run in an 8way opteron system (if the boards ever become available). Right now you'd mainly find the 850s in a 4way system. The 850 would definitely be the most expensive opteron, but as a single chip would perform the same as the 150.
Re:Fair comparison? by davros74 · 2004-08-16 02:26 · Score: 1

Because of the ccNUMA architecture for the 2xx and 8xx Opterons, if what you need is a really fast single processor box, it's best to stick with the Opteron 1xx - since then all the RAM you install is local to that processor, and you do not need a NUMA aware kernel/OS.

However, if you do want multiprocessing, the quad 850s are very very nice - but you need to make sure your OS/kernel supports ccNUMA correctly or the memory performance will hurt. Even with the correct support, however, an 8GB 4-way Opteron 850 will only have 2GB local to each processor - so if you have a big job that requires 4GB of RAM, it will run just slightly slower than a single Opteron 150 with local access to all 4GB of RAM.

The penalty isn't too bad however - I believe the HT links are 3.2GB/sec 1-way (6.4GB/sec total) and the memory bandwidth at the processor is 5.4GB/sec, IIRC.

Re:games are poor example for Intel performance .. by Anonymous Coward · 2004-08-15 10:51 · Score: 0

Things are very different with video encoding - there Intel usually wins.
When it comes to a standard a64 under windows, yes. Opterons are pretty close(and in some cases faster), and when you move into dual, Opterons dominate dual Xeons, even in media encoding, according to the reviews I've read and the tests I've done on my own system.

I know you said "usually", but I wanted to add a bit more specifity to it. ;)

Re:I have a great idea by Anonymous Coward · 2004-08-15 12:00 · Score: 0

For example, I won't trust an article praising Intel over AMD if it was made by a jewish person. I'd think there is a possibility he may have a political agenda. The opposite is also true. I won't trust someone praising AMD if he is openly against Israel. You may call this racism if you like but it's certainly not a lack judgment.

Okay, here's my big fat ignorance here, but why do you say this? I have to know. The thing is, I am strongly critical of Israel(I don't see myself as against them, so much as against some of the things they've to do as a matter of policy) and I happen to favor AMD, BUT, my choice of AMD had nothing to do with this. Is there something about AMD being anti-Israel I'm not aware of?

Re:I have a great idea by Bedouin+X · 2004-08-15 12:44 · Score: 1

I think that it has to do with Intel having a (very good) corps of engineers in Israel. Some people can make the massive leap from this tidbit to "Intel supports the murder of Palestenians!!!"

Or it could be something else.

--
Dissolve... Resolve... Evolve...

Re:I have a great idea by Anonymous Coward · 2004-08-15 14:07 · Score: 0

That group in Israel designed a fully asynchronous instruction decoder. They were able to get a 50% improvmnent over synchronous designs. This is very, very, very hard to do in digital logic design.

Re:Original source? (from the author) by Anonymous Coward · 2004-08-15 14:18 · Score: 0

I originally wrote it for Real World Tech as a forum post. It was just a technical memo, not a full article like Crusoe Exposed was. I'm actually surprised neither of these made it as a Slashdot story (I wasn't aware they rejected unsigned articles.)

Obviously I cannot reveal my identity. That's why it was posted anonymously, like all my chip reverse engineering work.

Suffice it to say that I've been in contact with top level engineers at AMD and am working with them to fix the problem ASAP. An exploit at this level could be nasty.

Price/performance still the same? by edxwelch · 2004-08-15 22:30 · Score: 1

One year ago I bought a Athelon XP 2600 based PC. At the time the chip itself cost 65 euros. I thought it was good value for a relatively speedy processor at the time. I just checked in the shop where I bought the PC (http://www.softworld.es/micro_amd/). It seems the 2600 is still the best value for price/performance - the cheapest Athelon 64 is 253.95 , three times as expensive and only 15% faster. So much for a years worth of technological adavancement.

Re:Is there somewhere that details all the opteron by holgie · 2004-08-16 04:23 · Score: 0

Shouldn't it be?:
47 65 65 6b 21 20 3a 70

Sun: making systems fly by mgcarley · 2004-08-22 22:38 · Score: 1

Remembering that Sun generally uses pretty good quality componentry, as well as possibly some enhancements to make these systems fly, of course... The opteron has a higher bus speed and memory control and 64-bit goodness and... well... the list goes on, no?

--
Founder & COO, Hayai India (hayai.in) / USA (hayaibroadband.com) // t: @mgcarley

Slashdot Mirror

Linux Shootout: Opteron 150 vs. Xeon 3.6GHz Nocona

217 comments