Slashdot Mirror


Opteron Gaming Benchmarks

bishop writes "Ace's Hardware has published some Unreal Tournament 2003 benchmark results on a 1.8 GHz Opteron 244. The Opteron servers out right now don't have AGP, but this issue has been nullified, literally, through the use of the 'null renderer' option in UT2003 to bypass the display output. At 1.8 GHz, the Opteron manages to outpace all previous Athlon models, though it does still fall behind the 3 GHz Pentium 4 by about 8%." Only 8% slower in performance with a 40% slower clock speed. Not too shabby.

26 of 39 comments (clear)

  1. Nice Stats! by mungeh · · Score: 3, Insightful

    I can't wait till the Athlon-64's come out! Maybe they'll be able to surpass a P4 3ghz?

    1. Re:Nice Stats! by The_K4 · · Score: 1

      However at that point the question is at that time (Sept) what will Intel be shipping? P4 3.5 ghz? P4 4.0 ghz? Sure it should beat a P4 3.0, but that's avaliable today. It's gunna be interesting to see how it compares to other procs avaliable when it ships.

  2. Um Null Driver? by vandel405 · · Score: 3, Funny

    One Function explains it all ...
    bltMegsOfTextureMaps(...) {
    return;
    }

    "hum, nah, that shouldn't mess up the results..."

    1. Re:Um Null Driver? by ChadN · · Score: 2, Informative

      But seriously. The graph indicates that they used the "null" driver for all setups (and even tested with graphics boards of VASTLY different performance to make sure they all performed roughly the same with the "null" driver). So one can infer it does test CPU and memory speed (primarily).

      --
      "It's overkill, of course. But you can never have too much overkill." - Anonymous Slashdot Coward
  3. Athlon versus P4 performance.. by Vector7 · · Score: 4, Informative

    You know, I find it pretty interesting how vastly AMD's chips outpace the P4 clock-per-clock.It's widely acknowledged that the P4 gets less done per clock than the P3 did. Some people have said that the P4's memory architecture is a disaster, or that it is pipelined TOO deep, but I've got a sort of conspiracy theory about this. Granted I'm not a computer engineer, and know just enough to hang myself with, but here goes:

    I think they've manipulated the design so they can deliberately increase the clock rate for marketing reasons, without getting proportionally more performance out. Basically I suppose they've taken the longest paths through the chip and stuck latches all over the place so that the overall cycle time can be reduced, but operations that used to take one clock cycle now may take two. When 1.8 GHz AMDs can nearly match the speed of a 3 GHz P4, I don't think this is an entirely unplausible theory.

    On the other hand, maybe the P4 really just is a pig. There was some discussion on usenet a while back about how it takes upward to 2000 clock cycles to enter and exit an interrupt handler on the P4, something which an old 486 could do in ~45 cycle IIRC (and I don't recall exactly, but I think the Athlon today can do an INT/IRET pair in a few hundred cycles). Curious how in hyper-optimizing these chips for the most common cases of execution, performance of these sorts of periphery operations goes all to hell.

    That said, I'm really looking forward to having 64-bits on the desktop. =)

    1. Re:Athlon versus P4 performance.. by Sevn · · Score: 3, Informative

      I know my bud with a p4 3.04 gets steaming pissed
      when my athlon 1700+ running a 184 bus times 12
      crushes his machine by 10 minutes on 'make world'
      on FreeBSD. Even using a -j flag to take advantage
      of the HT goodness doesn't seem to help him much.
      I thought it might be I/O, but his drive is faster
      than mine too with hdparm -t and -T.
      My UT2003 benchmarks are faster than his even
      though he has a ti4200 with 128mb and I'm running
      a ti500. All the supposed memory bandwidth just
      doesn't seem to be there for him. Nforce2 plus
      Athlon TbredB overclocked is a great way to fly.

      --
      For every annoying gentoo user, are three even more annoying anti-gentoo crybabies. Take Yosh from #Gimp for example.
    2. Re:Athlon versus P4 performance.. by madprof · · Score: 1

      Everyone says they upped the depth of teh pipeline (20 stages) so they coudl push the MHz up too, but the engineers think this is so laughable as to be pathetic.
      As much as it'd be a great story if it were true, Intel are not going to deliberately put out a chip that they know can be outclassed so easily.

    3. Re:Athlon versus P4 performance.. by PD · · Score: 1

      You have just posted the most useful benchmark in the world to me. I don't care about Unreal, or winstones, or whatever. I run a compiler on my machine and that's it. The make world test is the one I care about.

      Can you post what the total make world time on your machine is? You only mentioned that it was 10 minutes faster.

    4. Re:Athlon versus P4 performance.. by Sevn · · Score: 1

      Ok, I'm running 4.x stable. I'm also dual booting
      Gentoo at the moment. I was running Gentoo for
      a while waiting for issues with the nforce2
      support to sort themselves out. I ran hdparm
      under Gentoo. Steve's IBM drive is faster than
      my WD800JB. Pre 3.2.2 GCC was pulling 23 minutes
      for everything BUT I whack my /usr/obj tree
      before running make buildworld && make installworld
      && make buildkernel && make installkernel.
      If something dies in any of those, I want it to
      stop dead. At this point it's taking 27 minutes
      for those 4 steps. I have 768 megs of ram clocked
      at the same speed as the processor. If you are
      going to overclock an Athlon, make sure you
      get a 1700+. Make very sure it's a TBredB. When
      I picked mine up I bought three for 129. I cherry
      picked the one that overclocked the best and sold
      the rest on eBay. Since you are overclocking,
      there is little point in getting the Boxed CPU
      unless you want to intentionally void a warrantee.
      I HIGHLY recommend my cooling setup.
      I have a thermalright SLK800 heatsink, and a
      80mm to 120mm fan adapter. I have an AOC 120mm
      aluminum fan to match the aluminum fan adapter.
      You can run the Thermalright's wire fan harness
      through the holes in the fan adapter backwards to
      fasten it very securely. The AOC 120mm fan rocks
      because it pushs 80 feet of air, but only makes
      29db of noise. It's also cheap as sin. The only
      downside is it has a 4 pin molex connector, so
      no speed sensing. Even if you can only get
      166 x 11 out of your 1700+, that's still
      comparable to the 2600+. You don't want to go
      much higher than 1.85 volts to stabilize the
      processor. I'd suggest starting with the voltage
      there, then seeing what you can get. Bus speed
      is a ton more important than multiplier. Also,
      the Nforce2 chipset is the ONLY choice at this
      point because overclocking your bus speed does not
      overclock your other system buses like it does
      on a kt266, kt333, or kt400. I also recommend
      either corsair, or kingmax brand ram. The UT2003
      benchmarks are great for noticing the different
      between clocked and nonclocked. Messing with the
      nVidia driver settings is about pointless unless
      3 or 4 frames per second difference is worth
      the instability potential. I was able to get
      SBA and AGP 4x running, but no fast writes. It's
      rocks solid at 2x. You see more performance
      using nvclock to push your card than you will
      going above agp 2x. Don't forget to kick a tiny
      bit of voltage to your ram. 0.2 volts should be
      enough. Make sure your case has very good
      cooling characteristics. I'm a big fan of generic
      Cheiftek aluminum cases with thermaltake
      powersupplies and Vantec stealth fans. Don't go
      crazy with fan filters, you'll just cut off your
      airflow. Keep a couple cans of air around. I
      set up a cron job to remind me to dust the case
      out once a month. It's not a bad idea to make
      a leka bootdisk with memtest86 on it so you can
      run a thorough memtest on your system if you get
      a successful high clock. It's really crappy what
      pissed off ram will do to your system. My rule is
      that if the system doesn't lock up, explode, or
      have a single failure with all 16 of the tests
      ten times through, it's good. And last but not
      least, get some arctic silver 3. Nothing else
      even comes close, it's not just hype. At 72F
      room temp my system never gets above 47C under
      extreme load and idles at 38C. Hope this helps.

      --
      For every annoying gentoo user, are three even more annoying anti-gentoo crybabies. Take Yosh from #Gimp for example.
    5. Re:Athlon versus P4 performance.. by AvitarX · · Score: 2, Insightful

      I have a therory.

      They realized that if they made a design that allowed them to ramp up the clock speed to profane levels, they would have the best overall performance.

      That p4 3GHz is faster then the fasted AMD has to offer. Considering that not too lang ago AMD had the fastest offering for quite a while, I would say the stradegy worked.

      There is nothing wrong with designing a chip for ultra high clock sppeds then attaining them. The only problem I had with the p4 was that right when it came out it was slower overall then the best pIII's.

      --
      Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
    6. Re:Athlon versus P4 performance.. by spongman · · Score: 1
      You know, I find it pretty interesting how vastly AMD's chips outpace the P4 clock-per-clock.
      Not that interesting. The Opteron's pipeline is 40% shorter than the P4's. That's why the P4 can and must run at a higher clock speed to get equivalent performance. The same amount of work is being done, it's just split into different sized chunks of time.
  4. cool by Sevn · · Score: 3, Funny

    I'd pay 50 dollars!

    I'll run out and buy one for my blind neighbor.
    so I can cream him on tokara forest.
    Maybe I can hook up a more advanced version of
    that cool braille output doohickey from the
    movie sneakers so he can feel the pain with
    his fingers.

    --
    For every annoying gentoo user, are three even more annoying anti-gentoo crybabies. Take Yosh from #Gimp for example.
  5. tsarkon reports - opteron - good to rock the boat by Anonymous Coward · · Score: 2, Interesting

    I think the Opteron has the potential to do a lot more than be a gaming spectacle, but clearly, between the preliminary SPEC marks, various performance metrics on database machines, coupled with a real bus (hyper transport / alpha - like bus) and being reasonably competitive at a modest clock rate is impressive to say the least. And its ability to chew through legacy un-optimized code is leagues better than Itanium2, which is considering an FX!32 like on the fly "re-compiler" to help with horrific performance.
    The novelty here is "I can run junk binaries and be very good at big applications at the same time."

    Since when has a potential piece of server iron ever run a gamer benchmark well? The versatility of the Opteron is apparent right out of the floodgate.

    This industry needs a bit of boat rocking right now. I will sit back and watch the competition, but this Opteron is quite possibly the first step in de-cheesing and de-crudding the horribly "consumer" oriented PC design.

    Sure, this isn't a SPARC,HP-PA, Power4 or a MIPS. But this is so less "gay" than Itanic2. This gives serious power to the masses at a great price. People, lets consider rewarding AMD.

    Sun and IBM have both indicated that they will be coming up with designs around the Opteron recently. I remember wanting that Power4 a while back. I remember hoping Apple de-vaporized the 970. Now the Opteron is here, and well, I don't want to wait for a Power4 or a 970, lets take this baby and run with it.

  6. 1 CPU - I want 8-way !! by DrSkwid · · Score: 2, Interesting

    You know, it's always saddened me that there aren't more multi-cpu motherboards for general consumption.

    There's plenty of us that would stick another £200 CPU in the box every now and then as the cash came in.

    I mean, I've already got more memory than I ever use and too much HD storage, I want CPUs I never use either.

    I like my DUAL 1.2 Ghz p3 mb. I put it together for £300 or so and it's done sterling service, surpassing my normal 18 month upgrade timetable and not looking like it's about to be retired any time soon. (until DoomIII & HL2 I guess 8).

    I'll be looking to buy a nice 4-8 way Opteron system and populate it as I go along.

    I think Intel missed an opportunity crippling their processors to be non SMP. I hope there's an engineering reason why. I can't really think of a good reason why otherwise, surely they don't make *that much* on gambling that I'll buy a expensive Pentium if I can't have a 4 way Celeron set-up.

    --
    There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    1. Re:1 CPU - I want 8-way !! by NerveGas · · Score: 3, Interesting

      surpassing my normal 18 month upgrade timetable and not looking like it's about to be retired any time soon

      That is precisely the reason why dual-CPU machines are much more useful as desktops that the pundits would have you believe.

      You'll hear over and over that "you won't use the other CPU", or "your apps won't take advantage of it." In reality, dual-CPU desktops are so much more responsive under load that they still feel "quick" much, much longer than their single-CPU counterparts will.

      I have a dual Pentium 133. With NT4 on it, it's just as quick and usable as a P3/650 with Win98. CPU-intensive tasks do take longer, but the machine is still so responsive that you really don't notice it.

      Here are the reasons: First, you *are* running more than one program at once, even if you don't notice it. On the Windows side, even with one app, you still have at least 30-40 different threads and/or programs running that you're not aware of - logging daemons, mouse daemons, graphics drivers, timers, and the copious other programs that X and/or Windows will launch. Then, if you're doing net or disk access, that's even more there.

      The second reason is because of the interrupts. In a dual-CPU machine, one CPU can be getting hammered by interrupts, and you still have another to run other code, such as your GUI.

      After having used dual-CPU workstations, I'll never build myself another single-CPU setup again. If that 2xP133 is still a nicely usable machine, I really can't imagine how long my 2xAthlonMP 1800 is going to last!

      steve

      --
      Oh, you're not stuck, you're just unable to let go of the onion rings.
  7. I don't trust the numbers. by Guspaz · · Score: 2, Informative

    Oddly enough, Anandtech's numbers show the complete opposite: The Opteron beat ALL other CPUs at ALL games. Including the P4 3.0C in UT2K3. Take a look:

    http://anandtech.com/cpu/showdoc.html?i=1818&p =6

  8. hmm by strider44 · · Score: 1

    It's interesting to see what's almost a disclaimer after the benchmark results saying that it is not a final benchmark result basically because it wasn't using the full capabilities of the chip.

    There has been no benchmarks so far except perhaps for a few Linux benchmarks where the Opterion has been nearly as optimised by the software as the Pentium, Athalon or Xeon simply because the software producers and developers haven't caught up to the chip, and it's a big transfer as well.

    In my opinion, we won't see any true representative benchmarks for months yet, perhaps when the A64 has been released.

  9. performance:price? by niall111 · · Score: 1

    Only 8% slower in performance with a 40% slower clock speed. Not too shabby.

    but what is the price difference?

    1. Re:performance:price? by Zuph · · Score: 1

      At Newegg, you can get a 1.4 opteron for $315 for shipping. Assuming the Equivalent to that is about a 2.7 or so P4, the 2.66 is $219 at newegg, and the 1.8 is $319. This is just from one vendor though, so these may no be the best prices. Also, the opteron is designed for server applications. A Better comparison might be a Xeon, which at 2.66 is $308. So judging by performance, about the same cost. The Opteron can only come down in price :)

  10. Re:1 CPU - I want 8-way !! tsarkon reports. by Anonymous Coward · · Score: 2, Interesting

    The second reason is because of the interrupts. In a dual-CPU machine, one CPU can be getting hammered by interrupts, and you still have another to run other code, such as your GUI.

    True to a point. There must be a working/efficient APIC and the OS has to properly manipulate this for both CPUs to service interrupts. While I strongly agree with a 2-way system being leagues better than "faster" single boxes, there are big-time problems with various OSes including Linux where say, putting a GigE card in, then putting it in promiscuous mode, then sending 200,000 packets per second at the box causing complete and total livelock. The problem comes with the ability for interrupts to very easily monopolize the whole system. Other architectures have solved this problem with things like crossbars, more efficient busses, etc.

    I am hoping the Opteron will handle IO better - a lot better. We have a very nice Serverworks dual P3 implementation right now with 64bit 66MHz PCI cards (E1000, 3C996). It is fairly easy to send enough traffic to livelock the box. The CPU is cold to the touch, its not doing anything, but it is live locked. So, with the APIC turned off, 1 CPU is 100% used up by interrupts. With it on, 2 CPUs are 100% used up by interrupts. Interrupts steering isn't an availability guarantee is my point.

    I have solved problems with a polling feature available in FreeBSD, where you can set the maximum amount of CPU time (interrupts fall under "system") that the network driver can take up. The point is to have the ability to set a fraction of the CPU aside for userland. This is very useful for network applications that require seeing lots of traffic, like Snort, but need CPU to do other things.

    The point is that the PC architecture is seriously flawed in many ways. If you crack open the hood of a Juniper you'll see a "PC brain" running FreeBSD, but they solved these crazy architectural flaws by having network processors. Everything is co-processed in Juniper land.

    Unfortunately, even the best NICs with interrupts coalescing and all the CPU cycle saving junk turned on doesn't help for network applications that need to see it all.

    I'm hoping for better IO and better 2/4 way performance from Opteron, and I think it will deliver. Given that UT2003 is completely un-optimized for Opteron yet Opteron can play ball with what it considers "junk binaries" is impressive.

    All in all, I must say, the minimum number of CPUs for anyone who is in the slightest technically inclined is 2.

  11. Re:Athlon versus P4 performance.. tsarkon reports by spongman · · Score: 1
    And this explains the SPEC marks kicking the P4's ass how?
    well, it doesn't explain it mainly because i wasn't trying to. i was trying to explain why the clock-for-clock argument is invalid, but there are plenty of other reasons why the two processors are different, the pipeline length is just one of the most significant.

    I'm just going to ignore the rest of your rant - I didn't even read it.

  12. Re:Athlon versus P4 performance.. tsarkon reports by spongman · · Score: 1
    I think the 2 year-old kid is the guy that gets upset at being ignored.

    Oh yeah, and why the AC?

  13. Re:Athlon versus P4 performance.. tsarkon reports by spongman · · Score: 1
    I have a karma capped account for other things.
    How very schizophrenic of you.
  14. Re:Athlon versus P4 performance.. tsarkon reports by spongman · · Score: 1

    You're hillarious. Please, continue...

  15. Re:Athlon versus P4 performance.. tsarkon reports by spongman · · Score: 1
    But not quite adequate enough that you could resist one more time, eh?

    I don't see that you've said much of anything, especially concerning the original point - CPU instruction pipeline lengths. In fact, most of what you've said has been a reflection of your character. Maybe that's why you've chosen to remain anonymous? Surely your karma rating isn't that important to you?

  16. Re:1 CPU - I want 8-way !! tsarkon reports. by NerveGas · · Score: 2, Interesting


    The situation with the gigE cards isn't entirely the fault of the OS (although the OS certainly isn't blameless). GigE cards simply generate *swamps* of interrupts. However, the nicer gigE cards give you the ability to turn on interrupt coallescing, where the card will queue up several packets before it sends an interrupt.

    That does, of course, increase the latency, but unless you're in a clustering environment, you can generally handle a small hit in latency much better than you can handle a swarm of interrupts.

    On the note of embedded PC architecture, yesterday I helped my boss crack open an old GPS to try and replace the interal battery. What did I see staring back at me? A 25 Mhz i386! It was, of course, in a *much* smaller package than the "regular" 386's were in, I suppose that Intel makes (or made) them on a smaller process for a while for projects like that. And I recall $15,000 Livingston PM3's which used an AMD 486. : )

    steve

    --
    Oh, you're not stuck, you're just unable to let go of the onion rings.