NASA Benchmarks the New G5 Powermac

Single Processor Mode by CptChipJew · 2003-07-04 09:30 · Score: 5, Informative

Because I have a strong feeling this is going to be asked:

For those of you who were wondering, you too can switch off one of your Mac's dual CPU's with the Apple CHUD Tools. Look near the bottom of the page. It'll make you appreciate your second processor ;)

Personally though, I want to see how well it runs Seti@Home.

--
Vonal Declosion

Re:Single Processor Mode by jbm · 2003-07-04 09:50 · Score: 5, Informative

you ... can switch off one of your Mac's dual CPU's with the Apple CHUD Tools.

You can also do this simply with the cpus= boot argument; here's a reference.
Re:Single Processor Mode by Andy_R · 2003-07-04 11:08 · Score: 2, Insightful

But why on earth would you want to? Unless you are an intel/amd fanatic trying to prove a rather dubious point by crippling the Mac or an imb/apple devotee trying to skew the field by giving 1 cpu the entire 1Ghz bus bandwidth. Either way it's not going to give a useful result.

If you can think of a good reason to turn off 50% of your processing, why not save a lot of money and go for one of the two mid-range single cpu G5 configurations that Apple will happily sell you?

By testing in this ludicrous 'half a machine' mode, we don't get any of the potentially really interesting information about the efficiency of apple's dual cpu hardware and OS support, and how much difference their 1Ghz bus makes compared to intel/amd offerings. It's this area which will make or break the claims of 'fastest PC'.

--
A pizza of radius z and thickness a has a volume of pi z z a
Re:Single Processor Mode by Goldfinger7400 · 2003-07-04 11:13 · Score: 3, Insightful

I believe this is useful to allow developers to better understand their applications performance on a single cpu system, without the need to buy yet another mac. I believe that's why the tool is offered from the Developers portion of the site.
I also remember a few games having issues on dual processor computers, a while back. Though most of these issues have been resolved now, it'd be interesting to see if the problems cleared up if you disabled a cpu.
Re:Single Processor Mode by Spyritus · 2003-07-04 11:15 · Score: 2, Informative

In the Power Mac G5 Dual Processor, there is a DUAL 1GHz bus, 1 for each processor.
Turning off a processor does not give the other one any performance increase at all.
Re:Single Processor Mode by furballphat · 2003-07-04 11:25 · Score: 5, Informative

You only get the CPU tab if you install the CHUD tools, so you'll have to install no matter what.
Re:Single Processor Mode by Andy_R · 2003-07-04 11:29 · Score: 4, Insightful

Well, that is exactly the sort of claim that needs to be verified by testing.

Given that the 2 processors are (presumably) sharing 1 video card, one set of RAM chips, one I/O system, and one disk drive, surely there must be *some* kind of performance hit from sharing the rest of the machine with a second processor? Or some performance drain from putting all the housekeeping talks onto just one of the 2 processors?

I think this adds weight to my original point - that benchmarking a dual G5 against dual amd or intel systems is the only way we'll settle this 'fastest PC' question, and that arbitrarily shutting half the sysem off isn;t a sensible way to test things.

--
A pizza of radius z and thickness a has a volume of pi z z a
Re:Single Processor Mode by The+Infamous+Grimace · 2003-07-04 11:34 · Score: 4, Informative

"No shit. You can also not be a total ramrod and do it form System Preferences. See the CPU icon? Check it out."
No shit. You can also not be a total ramrod and realize that one needs CHUD installed for that particular System Pref. (fyi : CHUD = Computer Hardware Understanding Developement Tools)

You should check your facts before you flame.

(tig)
"We do not inherit the land from our ancestors"
"We borrow it from our children"

--
Ignorance and prejudice and fear
Walk hand in hand
Re:Single Processor Mode by artur9 · 2003-07-04 12:25 · Score: 4, Insightful

My take.
Since everything inside the G5 is "point to point" then unless both CPUs need to talk to exactly the same thing at exactly the same time then the performance hit will be almost impossible to detect. (I wonder how RAIDing drives could make this overhead even more improbable?)
Since there is a lot of Mach-ness in the OS X kernel I would also think that there aren't too many critical regions there that force the use of only one CPU. In other words, I think the situation is better than that kernel (was it an early Solaris?) that had one mutex around the whole kernel.
As usual, anyone willing to pay me to benchmark one of these puppies for them? :-) The machine is suitable payment.

--
------- MacOS X, WebObjects, Apple (G5) hardware triply tied
Re:Single Processor Mode by Chazmati · 2003-07-04 13:08 · Score: 4, Funny

Man, and all this time I was thinking C.H.U.D. stood for Cannibalistic Humanoid Underground Dwellers!
Re:Single Processor Mode by Knife_Edge · 2003-07-04 13:46 · Score: 3, Interesting

Well, there is one I/O system, the bus, but from what I understand it is a point to point system that runs at 1Ghz in the 2GHz dual processor machines. That is a heck of a lot of bus bandwidth. The G5 is designed to do 2, 4, or even 8 way multiprocessing without too much competition between the chips for system resources.

I agree totally that we need to see benchmarks of one dual system against another. But bear in mind that the use of dual processors depends a lot upon the specific code being run and also the operating system. I think the general problem here is that the G5, while fast, is not like 10X faster than the competition and at this point it is difficult to tell what scores where.
Re:Single Processor Mode by TheCrazyFinn · 2003-07-04 15:41 · Score: 4, Informative

well, don't forget they were testing a SINGLE 2GHz G5 (They turned off the second CPU), running G4 code. That's really equivalent to the $2400 G5 (Single 1.8GHz) than the actual performance of the Dual G5 (Which should be at least 50% higher, if not more, due to the G5's optimized SMP design, which is similar to the Athlon MP's, barring the vastly faster system bus (Point-to-point, unlike the P3's [and the G4's] shared architecture. I think the P4 Xeon might use a ptp bus, not sure).

Also, they were getting baseline tests on performance, against the G4. They also broke it down to performance per MHz, which the G5 took a huge lead in.

I suspect a dual G5, with an optimized compiler, would prove more than a match for the dual Xeon setup (That would cost significantly more, similar spec dual-xeon dell's are in the $4000 range), at least for this application (which heavily benefits from Altivec, and Altivec is still king of the SIMD world, SSE2 isn't even close in performance)

--
"You've got an invalid haircut" -Warren Zevon - Life'll Kill Ya
Re:Single Processor Mode by Have+Blue · 2003-07-04 16:42 · Score: 4, Informative

You can't "give 1 CPU the entire 1Ghz bus bandwidth". The two CPUs have independent 1Ghz connections to the system controller. They only share the 400Mhz RAM bus and the rest of the system devices.
Re:Single Processor Mode by NetCurl · 2003-07-04 18:13 · Score: 3, Insightful

Video isn't going to be affected. The calculations that put it into affect are done by the CPUs, but the CPU NEVER talks directly to the video card or the disk drive. The RAM, well, if you access it once, it's gonna be in cache, so there was a previous poster that mentioned the only time this would affect anything would be absolutely simultaneous access of the same region in RAM. That is most likely to happen with the same task or process, and the OS gives priority to the last CPU the task or process ran on (check linux sched.c for a similar implementation). Your situation, that was mod'ed +5 is likely to occur about once in a computer's lifetime....

--
It's only when we've lost everything, that we are free to do anything...
Re:Single Processor Mode by Billly+Gates · 2003-07-04 18:27 · Score: 2, Informative

Well benchmarking is a strange and controversial thing.

A benchmark will only show what is benchmarked. How many here care about how fast the cpu is in signle percentage points? Today's cpus are not as much as a bottleneck as they once were. Hardware has advanced beyond software for most apps.
What really matters is what you plan to do with it.

A disagree on the smp thing if your testing out chip performance. The reason being is that the OS and the particular benchmark have a huge influence on what the numbers will look like.

If your running a single threaded app that requires alot of cpu time, the smp benchmark would obfuscate the result and not tell you which cpu is the fastest.

For running a web server or testing i/o on the new motherboard then yes smp should be included. Alot factors change performance including which OS has the best i/o scheduler.

I use pc's as workstations for coding and browsing the web. A benchmark for me would be bootime and latency. MacOSX has been rumoured to be quite sluggish in disk latency. A faster processor certainly would not help this. My current athlonX+ takes awhile to load W2k with mysqladmin, interbase, OpenOffice, and Mozilla at startup. Would a twice as fast G5 mac help? Or would a new Serial ATA controller and drive help?

MaximumPC has bootimes in their benchmarks and I would like to see this and i/o tested on these new macs. They have hypertransport pci buses besides Serial ATA support. Sweet! This should help make the system alot faster overall and more responsive.

--
http://saveie6.com/
Re:Single Processor Mode by sadtrev · 2003-07-04 19:48 · Score: 3, Informative

This test was for large fluid dynamics computations. These involve lots of difficult sums (unsteady Naver-Stokes) on large discretised grids.
For this type of work the CPU usually runs flat out and the bottlenecks that apply to things like opening MSWord documents hardly come into play.
If it's properly written, then HDD access speed is irrelevant, and even main memory access is hardly ever the bottleneck.
This is one of those applications where the system speed is determined by the speed and efficiency of the CPU.
Re:Single Processor Mode by Andre+Breton · 2003-07-04 20:21 · Score: 2, Informative

"Personally though, I want to see how well it runs Seti@Home."
Not well, IIRC the seti folks were reluctant to do certain optimizations for the Mac. The distributed.net stuff works much better.
Re:Single Processor Mode by Anonymous Coward · 2003-07-05 02:20 · Score: 2, Insightful

Ummm, nice try butt chunk, but the G5 for $3k is DUAL PROC. Please try again, this time with a dual proc intel machine. The one you spec'd out would get trounced by the 2x2GHz G5. Give it another go.
And BTW - those pricewatch "lowest prices" things are usually a crock - they usually have like $20 shipping fees to make up for the lower prices. Please factor those in too. Thanks. Still want to do this exercise? I didn't think so...
Re:Single Processor Mode by WhiteBandit · 2003-07-05 07:51 · Score: 2, Informative

Wow, I got modded as a troll for admitting I am possibly wrong but answering a question anyway. Nice! Anyway.

You're right. However the poster didn't ask for a dual-proc system ;) Sure that would substantially increase the price though and I would concede that the Mac system would perform better. But regardless, you can still put together a similiar system for less money using a PC setup. :P

And if you did read my post, I noted that you would have to factor in shipping costs, so I'm not claiming ignorance to the prices.

Whatever though!

IHBT.
Re:Single Processor Mode by ChadN · 2003-07-05 16:36 · Score: 2, Informative

(which heavily benefits from Altivec, and Altivec is still king of the SIMD world, SSE2 isn't even close in performance.

Altivec is single-precision, SSE2 is double precision. The latter is invaluable for scientific computations of many types of matrix problems, and being wrong twice as fast is of little use.

Altivec is nice, for what it is meant for (mainly media type calculations, signal processing, etc.) But scientists will prefer SSE2.

--
"It's overkill, of course. But you can never have too much overkill." - Anonymous Slashdot Coward
Re:Single Processor Mode by afantee · 2003-07-05 23:23 · Score: 2, Interesting

>> Altivec is single-precision, SSE2 is double precision. The latter is invaluable for scientific computations of many types of matrix problems, and being wrong twice as fast is of little use. Actually, according to the author of the article, Jet3D is 99% double precision, but he was able to reformulate 10 lines of code to take advantage of AltiVec and gained 10 to 13 times in speed. SSE2 may handle double precision, but it's not a true vector processor like AltiVec, and there is absolutely no evidence that it can even double the speed.

NASA Verifies Apple Benchmarks? by Dak+RIT · 2003-07-04 09:31 · Score: 5, Interesting

Though dual processor benchmarks are not presented in detail here, it is worth noting that the G5 system benchmarked at 498 MFLOPS and 0.125 MFLOPS/MHz for scalar Jet3D performance when two processors were used.

By adding a second processor, the MFLOPS/Mhz output only dropped from 0.127 to 0.125 MFLOPS/Mhz. This chip can definitely perform in a multi-processor environment. The P4 scored 0.096 MFLOPS/MHz with a single processor.

Apple's benchmarks which were highly criticized by some, gave the Dual 2GHz Power Mac G5 a 194.5% performance advantage over a 3GHz P4 in SPECfp_rate_base2000. The G5 getting a score of 15.7, and the P4 getting an 8.07.

NASA's study found the Dual 2GHz Power Mac G5 to score 498 MFLOPS for their Jet3D performance. A P4 running at 2.66GHz scored 255 MFLOPS: a 195.3% performance advantage for the G5 in this test. If we assume a direct correlation between MHz and MFLOPS for the P4 (which would actually overstate the performance of the P4) and increase the P4's score by 12.782% this would give the 3GHz P4 a score of 287.594 MFLOPS. This is still a 173.16% performance advantage for the G5, and NASA states that a 20% increase in performance for the G5 would be reasonable "when G5-aware compiler tools become available."

So it would seem NASA's benchmarks go a long way in validating the benchmarks for the G5 that Apple released last month at the WWDC. In fact, NASA appears to be giving the G5 even better scores than Apple and Veritest did.

The vector tests that NASA performed to test the G5's AltiVec instruction set produce some even more impressive results, and would be a good indication for why the G5 outpaced the Xeon and P4 by such dramatic amounts on real world tests (at times more than 700% faster than a 3GHz P4). "The vector version of Jet3D runs an order of magnitude faster than the scalar version (speedups of 10X-13X are typical)." The dual 2GHz G5 was benchmarked at 5177 MFLOPS (a 1040% increase over the scalar test) and 1.29 MFLOPS/MHz. This also seems accurate considering Ars Technica's claim that the AltiVec engine wasn't as well integrated into the G5 as it was in the G4. The 2GHz G5 (single cpu) scored 2755 MFLOPS, or 1.378 MFLOPS/MHz, which shows a slightly larger performance hit for vector operations than floating point operations when moving to a dual G5.

Dak

Re:NASA Verifies Apple Benchmarks? by Anonymous Coward · 2003-07-04 09:37 · Score: 4, Insightful

"NASA's study found the Dual 2GHz Power Mac G5 to score 498 MFLOPS for their Jet3D performance. A P4 running at 2.66GHz scored 255 MFLOPS: a 195.3% performance advantage for the G5 in this test."

First, 498 vs 255 is 95.3% (instead of 195.3%) advantage.

Second, why compare dual fastest G5 vs single mid-range P4? Singe 2GHz G5 scored 254 MFLOPS, it quite fast, but then again equally fast 2.66GHz P4 is available at $188.

It would be much more interesting to compare dual 2GHz G5 against dual Opterons and Xeons.
Re:NASA Verifies Apple Benchmarks? by baseinfinity · 2003-07-04 10:07 · Score: 5, Insightful

Either way they're still benchmarking Apple hardware to be released in September versus Intel hardware that was available months ago. A quick look at Tejas and Athlon 64 specs will prove that both Intel and AMD have some whoopass coming down the line in Q4. At least this means Apple will be competitive still when the storm hits... but I don't think this hardware is revolutionary when compared to it's future peers.
Re:NASA Verifies Apple Benchmarks? by andreMA · 2003-07-04 10:50 · Score: 5, Informative

The 498 figure was presented strictly as an aside:
Though dual processor benchmarks are not presented in detail here, it is worth noting that the G5 system benchmarked at 498 MFLOPS...

More relevant, perhaps, are the figures in the raw MFLOPS graph:
254: PowerMac G5, 2x2GHz, (single CPU only)
255: Pentium 4, 1x2.66 GHz
Alas, difficulties in cross-platform benchmarking rear their ugly head:
Scalar Code:
G4 using Absoft F90 v8: f90 -s -O -lU77 -N11
P4 using Portland Group F90 v4.0-3: pgf90 -byteswapio -tp p7 -O1
The author did apparently make an effort to use the compiler and flags best suited for each architecture if I read this correctly....
Note that the higher level of optimization (-O2) and SSE/SSE2 options in the Portland compiler degraded Jet3D performance on the P4 system, and were therefore not used.

I don't know how much I trust NASA tho. Afterall, they only do RealMedia and WindowsMedia streaming media. Perhaps there's some bias there in favor of Windows (yes, I realize that the testbed P4 system ran Red Hat. Lighten up)
Re:NASA Verifies Apple Benchmarks? by nsrbrake · 2003-07-04 11:41 · Score: 5, Informative

Go back and read the article. You have no idea what you are saying.

1) One of the Mac's processors was disabled
2) 195.3% advantage was on an MFLOP/MHz basis

That is how they are comparing the architechture of the chip and it's performance outside of a MHz pissing race. They are in the same ballpark now MHz wise so why shouldn't they take a look at how the actual chip performs. Not to mention how much more will likely come out of the chip with maturing compilers to take advantage of the arch.

--

Bah!
Re:NASA Verifies Apple Benchmarks? by Mr.+Hankey · 2003-07-04 11:51 · Score: 4, Interesting

It would be interesting to see how it compared to an Athlon at MFLOP/MHz, which has a higher performance to clock cycle rate (and much better floating point) than the P4. I wouldn't be surprised if the Athlon outperforms the G5 at similar clock rates. The P4 is specifically designed to achieve higher clock rates at the expense of instructions per clock cycle.

Obviously a comparison against the Opteron or Itanium is not fair at this time, as they're not intended for the desktop but rather workstation audience. When the Athlon 64 comes out though, benchmarks of those vs. the G5 would be of interest as well.

--
GPL: Free as in will
Re:NASA Verifies Apple Benchmarks? by artur9 · 2003-07-04 12:31 · Score: 2, Insightful

Cost per MIP is a different metric is why.
If you read the article the benchmarker wanted to know the performance levels of the different systems (note: not CPUs).
He did say that Pentium were probably more cost effective.
PS: Where can I buy a complete system with a 2.66GHz P4 for $188? is that U.S. or some other type of dollar?

--
------- MacOS X, WebObjects, Apple (G5) hardware triply tied
Re:NASA Verifies Apple Benchmarks? by the+argonaut · 2003-07-04 13:08 · Score: 3, Interesting

Actually, both 195.3% and 95.3% are correct, depending on how it is phrased:

1.) it is 95.3% greater than...
2.) it is 195.3% of... ...or something like that.

--
fuck you.
Re:NASA Verifies Apple Benchmarks? by mentin · 2003-07-04 13:58 · Score: 4, Insightful

Either way they're still benchmarking Apple hardware to be released in September versus Intel hardware that was available months ago.
You wanted to say year ago? Month ago Intel released 3 GHz, 800 MGh FSB P4 and new chipset for dual DDR 400 motherboards. Comparing G5 against older 533 MGh FSB processors on old chipset is hypocrisy.
Even 2.6 GHz 800 MGh FSU processor with i875 chipset-based motherboard and dual DDR 400 memory behaves much better than that 2.66 GHz processor.

--
MSDOS: 20+ years without remote hole in the default install
Re:NASA Verifies Apple Benchmarks? by gTsiros · 2003-07-04 20:32 · Score: 2, Funny

What's MGh?

Me-Gaga-hertz?

--
Looking for people to chat about multicopters, coding, music. skype: gtsiros
Re:NASA Verifies Apple Benchmarks? by John+Harrison · 2003-07-05 01:23 · Score: 2, Informative

This is MGH.

--
Lasers Controlled Games!

Summary by LordOfYourPants · 2003-07-04 09:33 · Score: 3, Insightful

I love the wording:

"Benchmarks from the scalar version of Jet3D are shown in Figure 1 (MFLOPS) and Figure 2 (MFLOPS normalized by MHz). In terms of raw MFLOPS, the 2GHz G5 is about 32% faster than the 2GHz P4, 97% faster than the 1.25GHz G4, 142% faster than the 1GHz G4, and within 1 MFLOP of the 2.66GHz P4."

Translation: Slower than the P4 for anyone who didn't look at the grid. And M stands for million. Not one.

Re:Summary by Anonymous Coward · 2003-07-04 09:43 · Score: 3, Informative

Translation: Slower than the P4 for anyone who didn't look at the grid.

Real Translation: 0.4% slower, at 75% of the clock speed.
Re:Summary by dbrutus · 2003-07-04 12:11 · Score: 2, Interesting

One thing to remember is that the scaling problems that plagued the G4 where a Motorola problem. IBM's G3's were clocked so fast they had to be kept out of Apple's line because they threatened to outrace the G4s. Also, you might not be aware but IBM was originally scheduled to be at 1.8Ghz not 2.0 Ghz. that's about 10% faster than originally projected. A 10% faster scaling G5 chip would be 3.3Ghz when Intel is at 3.6Ghz according to their roadmaps assuming current trends continue (Intel on schedule, IBM exceeding expectations).

Like you said, time will tell.

Curious... by Realistic_Dragon · 2003-07-04 09:35 · Score: 2, Funny

I wonder why none of the NASA boxes were running Windows? I mean, if it's an all sinigng all dancing solution to everything...

(It's ok, you can mod me -1 Troll now. I'm just bitter about an edict on a project I'm working on.)

--
Beep beep.

MFLOPS/MHz? No AMD, Old P4, Old Redhat. by jbridges · 2003-07-04 09:36 · Score: 3, Interesting

What the heck is that a benchmark of? Divide by the Mhz to create a higher number on their nice little graph for the G5?

Why a Pentium 4 2.66mhz?

Why no Athlon?

Why no Opteron?

Why an old old version of RedHat 7.1?

and so on....

Portland compiler by PineGreen · 2003-07-04 09:38 · Score: 2, Interesting

I have very bad experience with Portland compiler; I wonder what the results would have been if they had used Intel Fortran Compiler....

fortran compiler by mz001b · 2003-07-04 09:38 · Score: 4, Informative

It is interesting to note that they used the Portland group compiler instead of the intel compiler. For the CFD code that I work on (which is mostly Fortran), the Intel compiler produces much faster code than the Portland group compiler (as much as 50%).

Re:fortran compiler by Jah-Wren+Ryel · 2003-07-04 10:31 · Score: 2, Insightful

Furthermore, check out this footnote from the paper:

"Note that the higher level of optimization (-O2) and SSE/SSE2 options in the Portland compiler degraded Jet3D performance on the P4 system, and were therefore not used."

Thus they are comparing the FP performance of the old x87 style, stack-based interface on the P4 - not to mention they are just using minimal optimization. When they start talking about how great vector peformance is, they need to be using SSE2, which is both modern and far more efficient than x87, plus SSE2 is essentially an implementation of a short-length vector processor, somewhat like altivec is on the G4/G5 chips.

This "report" sure smacks of the same bogosity that the original Apple benchmarks did - essentially hamstringing the competition before making a comparison. Since you can't *buy* a dual G5 system today, one must assume that Apple gave it to NASA and you can bet that early access comes with an NDA. So, Apple must have either approved or at least been involved in the generation of these results. Essentially making them just as biased as Apple's own.

--
When information is power, privacy is freedom.
Re:fortran compiler by wagnerer · 2003-07-04 11:53 · Score: 5, Insightful

You'll want to be careful with the code the intel compiler spits out. It may be fast but some scientific codes compiled with it give incorrect answers. Compile the same code on any other f90 compiler and it gives the correct answer. Not something to inspire confience in your answers.

Interesting choice of processors by binaryDigit · 2003-07-04 09:39 · Score: 4, Interesting

If there budget is such that dualie 2Ghz G5's are a possibility, then it's somewhat surprising that they A) used such low powered P4's B) that they didn't include Itanic2 systems. Seems that their report really just pointed out stuff that we already knew, the PPC is typcially faster per mhz than the P4 (hell, just about anything is better per mhz than the P4). Interesting to note that on the vector test, the G5 outperformed the G4 is a fashion that is almost purely based on the increase in Mhz (i.e. other system improvements didn't really seem to help much). Compiler perhaps, though some of the architectural improvements would seem to be not dependant on that?

Re:Interesting choice of processors by Phroggy · 2003-07-04 10:00 · Score: 3, Informative

If there budget is such that dualie 2Ghz G5's are a possibility...

Budget had nothing to do with it; the PowerMac G5 isn't shipping yet. NASA had to have obtained theirs through a special arrangement with Apple.

--
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
Re:Interesting choice of processors by markomarko · 2003-07-04 10:40 · Score: 2, Insightful

Certain NASA labs use a large amount of Apple hardware, and I'm sure the machine is a demo unit. We may get a demo here at the UofA ECE dept, too. ie. the machine wasn't purchased.

really, do you think NASA's in on some conspiracy to hype Apple hardware? Good grief. I couldn't imagine any business, but a benchmarking business, approving an order for a bunch of new hardware solely for the purpose of conducting some mythical-the-truth-is-out-there, guaranteed, unbiased benchmark. There's nothing interesting about the choice of processors. It's what was lying around when the demo came in, fool.

G5 is really a full-blown workstation by reporter · 2003-07-04 09:40 · Score: 4, Interesting

The G5 by Apple is really a workstation but has been mis-labeled a "PC". The floating-point performance of the G5 crushes any workstation by Sun. In fact, the heart of the G5 is a Power4 server-based microprocessor. At "SPEC", you can easily find the performance of a Power4 @ 1.45 GHz. Its SPEC2000 rating for floating point is 1097. When you scale that to the 2.0 GHz processor in the G5, you conclude that it has a SPEC score of about 1500.

Apple has just created a new market for itself among the hardcore engineers who use workstations for numerical simulations like HSPICE, etc. Steve Jobs lucked out -- again.

By the way, the bell tolls. It tolls ominously for Sun Microsystems.

Re:G5 is really a full-blown workstation by Phroggy · 2003-07-04 10:03 · Score: 4, Interesting

Steve Jobs lucked out -- again.

Um, yeah, sure is lucky Apple found the G5. I'm sure they had nothing to do with its development. It's not like Apple has been involved with development of the whole PowerPC architecture since the early 90s.

--
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
Re:G5 is really a full-blown workstation by afidel · 2003-07-04 10:31 · Score: 2, Informative

The Power4 has 128MB of L3 cache, to get comparable performance to a Power4 the G5 needs considerably higher clock speeds. Also Sun hasn't been competitive in the single and dual CPU workstation market for some time. The only things they had going for them were large memory support and large CPU scalability, now everyone is getting large memory support with the transition to 64bit so they only have large cpu counts to fall back on.

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Re:G5 is really a full-blown workstation by Josuah · 2003-07-04 18:29 · Score: 2, Informative

About as many as are optimised for "OS X SMP". IOW, not many.

Mac OS X automatically splits execution threads among multiple CPUs. Even something as basic as a progress bar or a network service daemon will run in separate threads. The Mach microkernel makes heavy use of threads so the basic OS itself will experience noticeable improvements in message passing between tasks (e.g. crossing the kernel boundary), especially given the dedicated CPU buses.

Also, there are a whole bunch of tasks running at any given time. I've got 64 right now on my Beige G3. Each of those tasks is running one or more threads (most only one). But if iTunes can run on one CPU while my compiler runs on the other, that's going to be a big performance gain. Not to mention how the new XCode is going to benefit from SMP.

Windows is probably in the same boat as far as distributing threads, but Mac OS X makes heavier use of threads/tasks (at least, we think so because Windows is too proprietary for us to know for sure). Plus there are a bunch of very important applications where SMP really matters, e.g. Adobe products, scientific research.

Only benchmarks that matter are.... by FooGoo · 2003-07-04 09:43 · Score: 5, Insightful

The only benchmarks that matter is my impression of the system while using the apps I use. Everything else is opinion.

--
People who bite the hand that feeds them usually lick the boot that kicks them

Costs by mgkimsal2 · 2003-07-04 09:44 · Score: 3, Insightful

Something which seems to get lost in the Mac/PC debates *sometimes* is the cost factor. I looked those graphs and thought "Wow - mac is faster at this benchmark". Then I looked up pricing - minimum I can get that mac for would be $1999. An equivalent PC system with the P4 2.66ghz is probably under $900 (didn't spec it out entirely, just did a rough lookup on Dell). Great - Mac is faster. But I can apparently get within reasonable range on PC hardware for probably 50% less cost.

I'd read some thread a while back on another board saying that "Macs are cheaper than PCs". I still can't believe anyone would make that argument. Doesn't being really good in a few areas satisfy the mac people? Do they have to try to spin higher costs as 'lower' (craziest thing I'd ever heard...)

--
creation science book

Re:Costs by nordicfrost · 2003-07-04 10:01 · Score: 4, Insightful

Mac is cheaper than PCs, even in laptops. It all depends on what you'll use it for. I'm trying to convince a pal that the iBook would be a better purchase for his mother than a Fujitsu Siemens PC. The iBook is rougly 9500 NOK and the Siemens 11500 NOK.

The difference in speed mHz, RAM etc. is irrelevant for a person just getting used to computers and the net. When she is having weekly problems with the Siemens / Windows machine, it will be lost money in time. While the Mac is cheaper in usage, because of less "frustration time" and less hassle.

This is an argument I would strongly disagree with, if you asked me two years ago. But since then, I have come to the conclusion that the Mac simply work better for the lay people. It does the work, and faster because there are less frustrations and less hassle.
Re:Costs by MalleusEBHC · 2003-07-04 10:20 · Score: 4, Interesting

I'd read some thread a while back on another board saying that "Macs are cheaper than PCs". I still can't believe anyone would make that argument.

I'll make that argument any day of the week if you want to consider TCO. My family got a Powermac G4 in 1999, and it is still the daily use computer for them. (I have my own Cube, which is basically the same for performance comparisons.) That thing still does everything that they can ask of it and then some. Hell, it can still play all the games that I want to play, save UT2k3. The great part is that it is still humming along perfectly, and I don't see any reason why it won't last two or three more years. Find me a PC that you will still be using daily 6 or 7 years later.

This doesn't even take into account all the time and headaches that have been saved from using a Mac. Taking out the "Did you accidentally kick the power cord out?" type phone calls I've gotten to help them, I can think of maybe twice that they have had to call me and troubleshoot. There is no pricetag on this peace of mind.
Re:Costs by autopr0n · 2003-07-04 10:56 · Score: 2, Insightful

Didja notice that overall the G4s were faster, too? Therefore the price of entry for a slower machine if $799, and it includes way more than that $900 PC.

Are you honestly claming that 129 is a higher number then 255? (fastest g4 to the fastest p4) Yeah, the g4 had a slightly higher flop/cycle score (that would have been much lower using the intel fortran compiler) but the p4's over all score was much higher.

When comparing the price of a 32bit system to that of a 64bit, it's a bit pointless...as you're getting so much more with the 64bit chip.

Um, no, you're not. For one thing these benchmarks were all done with floating point numbers, the Intel architecture has supported 64 bit floats since the 287. A 64 bit CPU won't have much of an advantage over a 32 bit CPU at all. Secondly, a 64 bit CPU can't do anything that a 32 bit CPU can't. It just needs to take more cycles. But that only happens in the rare case when you have to deal with a non floating point number which is larger then 2^32, or ~4.3 billion. That doesn't happen very often.

--
autopr0n is like, down and stuff.
Re:Costs by Anonymous Coward · 2003-07-04 11:16 · Score: 2, Insightful

IMNSHO While the initial purchase cost of the Mac is marginally more expensive for a top end machine

Dual 2GHz PowerPC G5
2GB DDR400 SDRAM (PC3200) - 2x1GB
250GB Serial ATA - 7200rpm
ATI Radeon 9800 Pro
Apple Cinema Display (20" flat panel)
SuperDrive (DVD-R/CD-RW)
Apple Keyboard & Apple Mouse - U.S. English
Mac OS X - U.S. English
APP for Power Mac (w/ or w/o display) - Enrollment Kit
$5693.00

versus

Dell Dimension XPS Series
Pentium® 4 Processor at 3.2GHz with 800MHz front side busPrice: $4,268.00
SAVE $190! (Savings included in price) 2GB DDR SDRAM at 400MHz
Dell ® Quietkey ® Keyboard
SAVE $160! (savings included in price) 20.1 in Digital Flat Panel Display
New 128MB DDR ATI RADEON(TM) 9800 Graphics Card
200GB Ultra ATA/100 Hard Drive (7200 RPM) with DataBurst Cache(TM)
3.5 in Floppy Drive
Microsoft® Windows® XP Professional
Dell(TM) Optical USB Mouse
New Dell Gigabit Ethernet
16 Max DVD-ROM Drive
Sound Blaster Audigy 2(TM) sound card with DVD Audio
SAVE $60 (system price shown before rebate) 4 Yr Ltd Warr plus 4 Yr At-Home
Dell Movie Studio Plus with Roxio VideoWave Movie Creator(TM)
FREE UPGRADE! New 4x DVD+RW/+R Drive w/CD-RW

If we first add back in all of the "free" stuff DELL is giving by ordering online....$190 + $220 +$170 = $580 +$4268 = $4848 which is a more realistic total considering that I doubt NASA does their ordering online.

Now if we compare the 2 prices ($4848.00 DELL and $5693.00 APPLE), unless I am doing math poorly these days, we see a 15% savings by purchasing the DELL system.

HOWEVER....
What always seems to be forgotten is the cost of SUPPORTING these machines. The support costs is what makes the Mac shine when it comes to TCO. I cannot tell you how many times I have seen IT departments in large corporations that have 10+ Windows support people in Department X that are ALWAYS busy running around fixing Windows machines, compared to their Mac support group (generally about 1/2 or less the people) with an equal size department to support.

The support costs are where the Mac just totals Windows boxes over the useful life of the machine....

What most people FAIL to realize is that the upfront costs, != the total cost
Re:Costs by benh57 · 2003-07-04 12:18 · Score: 2, Informative

No, autopr0n - the G4 was not slower, in fact it was faster than the P4 by a factor of 10 in the vector benchmarks (scroll down).

The G5 by Eric(b0mb)Dennis · 2003-07-04 09:44 · Score: 3, Insightful

The G5 really is a powerful machine, and i wouldn't mind owning one... if only the price for maintaining a mac wasn't so high...

Their scheme for OS X is the equivalent of Microsoft charging $100+ for a service pack, I just don't understand it.

I've used OS X, and it blows everything else out of the water in elequency and it seems the perfect balance between productive and 'cool factor'

But until I win the lottery, I'll stick with my cheap x86 machines

--
Excuse me, I don't mean to impose, but I am the ocean

Re:The G5 by Dak+RIT · 2003-07-04 09:53 · Score: 5, Insightful

Microsoft Windows XP Pro: Full price: $299
Microsoft Windows XP Pro Upgrapde: $199
http://shop.microsoft.com/Referral/Productinfo.asp ?siteID=10798
MacOS X 10.3/2/1 Full price: $129
http://www.apple.com/macosx/
Microsoft Windows XP Pro (5 Users): $1315.60
MacOS X 10.3/2/1 (5 Users): $199
If you bought Windows XP ($299), and then can upgrapde to Longhorn for $199, you paid $498. If you bought MacOS X 10.1, 10.2, 10.3, and 10.4, you paid $516. Pretty similar, and that's assuming you only have to pay $199 for Longhorn. In the meantime, Apple users enjoy continued advance, while Windows stagnates for 4+ years.
Do the same with a family licence of 5. Buy Windows XP for $1315.60, then upgrade for $875.60: $2191.20 (over 4 years, for 5 people: $109.56/user/year).
Buy MacOS 10.1, 10.2, 10.3, 10.4 (5 User Licence): $796 (over 4 years, for 5 people: $39.80/user/year).
Using http://shopper.cnet.com I found a copy of Windows XP Pro for $207, and an upgrade for Windows XP Pro for $177. I found a copy of MacOS X 10.2 for $98.
If these prices hold over to the newer Operating Systems these companies release, then Windows would cost $384 (23% savings), and MacOS X would cost $196 (24% savings). If you bought every point upgrade Apple released it would cost $392.
Dak
Re:The G5 by dbrutus · 2003-07-04 13:03 · Score: 3, Interesting

Microsoft does this every time they want to juice sales figures in a disappointing product. They even condone piracy by telling their distributors to stop checking for student IDs when selling educational versions of Office apps (that's straight from an interview with the Office division VP).

You might have been able to do it, but you also might have just fed into the MS myth of "everybody pirates our stuff".
Re:The G5 by dbrutus · 2003-07-04 13:15 · Score: 2, Informative

The price on the client side is generally the same until you add in the server side. Then clients get more expensive. An Apple server generally is CAL free for $999 (you *can* get a 10 client version for half that). A Windows server serving file and print makes you get a CAL for each machine that accesses it. If you have a server application like Exchange on the box, you need a second CAL (priced differently) to access that program. Each server has their own CAL and the money keeps rolling into MS.
Re:The G5 by SoTuA · 2003-07-04 13:30 · Score: 2, Interesting

The problem is, people tend to see Apple computers and cheap DIY x86 computers as equivalent. I wouldn't sneer at a guy that has a 2003 BMW because his car cost 10x more than my '99 Hyundai. I would think "damn it's nice to be able to afford a luxury car!". Likewise, it would be nice to be able to buy a luxury computer.
Apple offers top-notch hardware and a great OS, and is priced with that in mind. My el cheapo homemade computer has "somewhat decent" hardware, and a great OS that is a bit "not all that" when it comes to desktop (debian unstable).

*I* for one wouldn't mind owning one of those G5's. :)

Re:Costs - correction by mgkimsal2 · 2003-07-04 09:47 · Score: 4, Informative

$2999 for the mac 2x2ghz

--
creation science book

Re:MFLOPS/MHz? No AMD, Old P4, Old Redhat. by MBCook · 2003-07-04 09:53 · Score: 2, Insightful

Why a P4 2.66? That's probably what they had that was closest in clockspeed to 2ghz, maybe while still using the newest core. Plus this is still pretty close to 3ghz, so you know the P4 scores wouldn't double if you used one or anything.

Why no Athlon? They probably didn't have one and with the P4 at 3ghz and climbing, the old althon is becomming less and less significant for these pure number crunching apps. Plus maybe they've done previous tests that said that P4 they uesd was faster than an equivelent athlon, so it didn't need to be tested.

Why no Opteron? They probably didn't have one. This is the most valid question you ask.

Why RH 7.1? That's probably what they use. They are benchmarking PURE CPU so the OS doesn't matter too much for this kind of thing.

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.

Wrong. by Tokerat · 2003-07-04 09:54 · Score: 3, Insightful

Their scheme for OS X is the equivalent of Microsoft charging $100+ for a service pack, I just don't understand it.

Apple regularly issues Security Updates, Bug Fixes, and Mac OS X Updatres through the Software Update system. This, the equivilant of a Service Pack, is free.

The major OS version updates are when new features are added, etc. That is the equivelant of upgrading 98 to XP. The cost of buying a Mac is high. The cost of maint. is probably less than a Windows box.

--
CAn'T CompreHend SARcaSm?

Re:Wrong. by dmarcoot · 2003-07-04 11:10 · Score: 2, Interesting

also, to be fair, get your facts straight. the browser, chat program and media players, Apples equivalents (safari, ichat, quicktime) have are and have always been FREE and backwards compatible for all versions of the apple os x.
furthermore, you can add on other FREE apps such as imovie, itunes, idvd, iphoto, ical. all are excellent and in some cases have no PC equivalent of similar high quality at the same cost, FREE. nor are any of them (besides QT) built into the OS, your free to delete any of them with no harm to your system.

That policy has the added advantage for the users that they rarely have to upgrade their OS to get new apps, in contrast to the situation on windows were the os is locked out if you dont call Microsoft. to register it.

MFLOPS/Mhz. - Useless Metric by Cordath · 2003-07-04 09:55 · Score: 4, Insightful

It was interesting to see this paper devote so much effort to the completely useless metric of MFLOPS/MHz. This measurement has absolutely nothing to do with performance, but rather, with the approach taken by the chip manufacturer. You can do more in one clock cycle, as AMD historitcally has done, or less, but optimize for faster clock speeds, as Intel has in recent flavors of the Pentium.

One might be tempted to design a chip that does more in one cycle and then clock it as fast as a chip that does less in one cycle. Unfortunately, while reality is a little more complex than this, the basic reason is that the more a chip does per cycle, the more heat it generates per cycle. If you try to squeeze too many cycles through it in a second it will fry.

So showing that the G5 has better performance per clock cycles is no more useful than showing that an AMD chip has better performance per clock cycle than an Intel chip. All that matters is how much performance you can get from a chip before it cannot be clocked any faster without requiring unreasonable cooling methods.

All this paper shows is that, while the G5 is designed to do more in a clock cycle than a P4 is, the chip tested is ultimately not any faster than the P4 they benchmarked it against. It remains to be seen how the G5 will do at higher clock speeds. With this in mind, it would be *far* more useful to see heat dissipation stats on the G5 since that might give us some idea how close to it's design limits. If it is cranking out high-end P4 performance and running cool *then* I will be impressed.

Why No Cray? by Detritus · 2003-07-04 09:55 · Score: 4, Insightful

Maybe because that was the hardware and software available to the testers. Contrary to popular belief, government employees do not have unlimited budgets to buy stuff. The last time I worked in a government office, some of the furniture was older than I was and my PC was built from scrounged parts.

--
Mea navis aericumbens anguillis abundat

Re:Just to add to the real world translation by Anonymous Coward · 2003-07-04 09:55 · Score: 2, Funny

I guess this makes me the 10th fan...oh wait...

Re:Linux is Dying by WuWarrior · 2003-07-04 09:57 · Score: 2, Insightful

Linux is almost as popular as Windows 95! Seriously, 1% of that pie is probably tens of millions of people.

Interesting Thought by ciroknight · 2003-07-04 09:57 · Score: 5, Interesting

Apple has been making better software for years, everyone agrees; They just never had the hardware to back them up. Then every time they do crop up with better hardware, everyone criticizes them and says that it's just not possible, PC hardware is always better they say. But now they've proven you wrong... TWICE, and some trolls STILL don't believe them. It's a sad world. I just wish Apple would open up at least their motherboards a little more, make Macs more customizable, more like PC's so they can start dominating again.

There's also one benchmark I'de love to see. Power Mac G5 vs Sun UltraSPARC III. It's fair: they're both 64-bit procs, and it would really make people look at it in businesses that only look at supercomputers as viable. Then maybe people would start giving Apple and IBM some credit.

My 2 cents (Canadian). Thanks.

--
"Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush

Re:Interesting Thought by vi-rocks · 2003-07-04 11:34 · Score: 2, Insightful

Who modded this crap up!!!
Summary... The 2GHz G5 is *very* slightly slower than the older 2.66GHz. (3GHz machines are out now). As far as the second bar graph goes who gives a crap about how fast something is per MHz? Seriously moderators... read the article... that goes for you too ciroknight. You Apple fanboys are ridiculous sometimes.
The DUAL 2GHz G5s peform much better than the single 3GHz. I guess most people are too busy holding onto their desks while Bill is ramming them up that ass to pay attention to the article.

Frankly, us Mac "fanboys" could care less if someone could pick up a $499 4 Ghz P4. The people touting these systems are driving rusted out Ford Escorts with the bumpers held on by duct tape (or drooling over the Ford Escorts while looking out of the windows of the loser-cruiser)

Seriously now, Mac/OSX fanboys like their computers. They don't care if it is the absolute fastest or the most/least expensive. The hardware has some style and the OS simply rocks.

Re:Turn the optimizations on first. by Phroggy · 2003-07-04 09:58 · Score: 5, Informative

I hope they didn't use gcc (the yet-another free and hopeless compiler).

It should be noted that Apple uses gcc to compile Mac OS X and most of their applications, so it would be appropriate to use gcc on the G5. Intel's compiler might be a more appropriate choice for the Xeon.

--
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;

MFLOPS per $ by mgkimsal2 · 2003-07-04 10:02 · Score: 5, Interesting

Why not show an mflops/$ chart? Related to my 'cost' post as well, but I felt it deserved its own post. :)

This seems to confirm my belief that most mac people don't buy their own hardware, but get it through work or school.

--
creation science book

Re:MFLOPS per $ by Trurl's+Machine · 2003-07-04 11:13 · Score: 2, Interesting

Why not show an mflops/$ chart?

Why not show a "just works"/$ chart?

Vector Performance by Japer+Lamar+Crabb · 2003-07-04 10:08 · Score: 5, Insightful

>>Vector performance of the G5 remains excellent, and is inline with current G4 systems on a per clock cycle >>basis. As a result, raw vector performance of the G5 will be boosted simply by its higher clock speeds relative >>to current G4 systems.

This would seem to be one of the more interesting points made, actually. Prior to the announcement of the G5s, speculation on the PPC 970 suggested that it would be stellar with FP & so-so with integer; the real question surrounded how well IBM would implement SIMD. Many were pessimistic. Given that it seems like they've managed to add it efficiently a scaled-down POWER4 core, future refinements could make this series of chips (PPC 9X0s) real monsters.

But the future viability of that roadmap (given how ruthless the company as a whole tends to be when faced with departmental money losses) depends as much upon the success of IBM's Linux strategy as it does on its success in the PowerMac line.

[With apologies to BadAndy of the Ars Technica boards; thanks for sharing your insights.]

--
Habit is the ballast that chains the dog to his vomit - Samuel Beckett, "Proust"

In all fairness... by Znonymous+Coward · 2003-07-04 10:09 · Score: 5, Insightful

It's not Apple's G5, it's IBM's 970 and it's the shizzle.

--

Karma: The shiznight, mostly because I am the Drizzle.

Re:In all fairness... by dynayellow · 2003-07-04 11:40 · Score: 2, Funny

From Crazy Apple Rumors:

Apple Rocked By "G5" Scandal.
Just one week after the "G5"'s introduction, Apple has been hit by shocking allegations that the name of the chip powering the computer is nothing more than a tawdry attempt at huckstering, worthy of the lowest flim-flam artist.
According to highly placed industry sources, the "G5" is actually the IBM PowerPC 970, and Apple has been using the "G5" name simply to sell more computers.
Through a systematic application of underhanded techniques known as "marketing," Apple attempted to convince customers that the "G5" was newer and a step above its previous computers.
"Apple has plaster the 'G5' moniker all over its promotional materials," said InfoWorld's Tom Yager. "And it's all a lie. They made that name up."

If I remember right... by LordOfYourPants · 2003-07-04 10:14 · Score: 3, Informative

If one thing is 80 dB and one is 90 dB, the second object is twice as "loud." Each 10 dB jump either doubles of halves "loudness." ie: If you're at 1000 dB vs 1010 dB, the 2nd object is twice as loud.

So, based on what was said at the keynote (and my interpretation), the G5s are 10dB quieter. Twice as quiet sounds more impressive. Note that saying "half as loud" still implies "loud" so psychologically it's not as impressive.

If I'm wrong, I'm sure someone will jump on me soon enough. I'm holding on tight.

Re:If I remember right... by Filarion · 2003-07-04 11:09 · Score: 5, Informative

actually you only need 3 db to double the volume (which, btw, has little to do with loudness). and 1000 dBa is, I hope, impossible.

--
--[Nothing important]--
Re:If I remember right... by Mac+Degger · 2003-07-04 11:23 · Score: 2, Interesting

Uh...that's +3 dB for every doubling of sound energy.

--
-- Waht? Tehr's a preveiw buottn?

NASA never benchmarked the G5! by Anonymous Coward · 2003-07-04 10:14 · Score: 5, Funny

It was a hardware simulator running on a soundstage.

Re:MFLOPS/MHz? No AMD, Old P4, Old Redhat. by Jeffrey+Baker · 2003-07-04 10:15 · Score: 3, Informative

I think it's pretty obvious why they tested the G5: their Altivec program is 13X faster than their scalar program. They don't mention the SSE2 so I assume they have an investment in Altivec programs. Therefore they would naturally be interested in comparing the G5 versus the XServe and G4. Until Intel releases the 34.5GHz P4 (13X 2.66GHz), there doesn't seem to be any reason to run out and buy a latest P4 just for this comparison.

And surely the version of RH Linux hardly matters. Maybe they benchmarked using this OS because (shock, horror) it is what they use daily.

OS X 10.2.7 by Greedo · 2003-07-04 10:15 · Score: 4, Interesting

I was RTFA, and this caught my eye:

Additional Notes: The G5 system was running Mac OS X 10.2.7 and ...

I'm only running 10.2.6, and Software Update says nothing new is available. What's up with that?

--
Tuus crepidae innexilis sunt.

Re:OS X 10.2.7 by Trurl's+Machine · 2003-07-04 11:08 · Score: 3, Informative

I'm only running 10.2.6, and Software Update says nothing new is available.

MacOS X 10.2.7 - codename Smeagol - is a stop-gap solution to provide JUST ANY working OS to the G5's until Panther is ready for prime time. Your Sofware Update is right not to install it on your machine, as most probably it is not a G5, sir.
Re:OS X 10.2.7 by OgGreeb · 2003-07-04 14:47 · Score: 2, Funny

You probably noticed you weren't running on a G5 either. I'm confident that if you can score a G5 system this early, getting 10.2.7 to go with it isn't much of a challenge.

--
-- Gary Goldberg KA3ZYW 301/249-6501 AIM:OgGreeb Digital Marketing Inc., Bowie, MD //www.digimark.net/

5177 MFLOPS 288 MFLOPS by vitaboy · 2003-07-04 10:22 · Score: 5, Insightful

This is the killer statement in the whole article:

"The vector version of Jet3D runs an order of magnitude faster than the scalar version (speedups of 10X-13X are typical)." The dual 2GHz G5 was benchmarked at 5177 MFLOPS (a 1040% increase over the scalar test) and 1.29 MFLOPS/MHz."

5177 MFLOPS when running a Velocity Engine optimized version of Jet3D.

Now, how much does an P4 extrapolated to 3.2 GHz get? Like 288 MFLOPS?

Someone please explain to me how 5177 MFLOPS and ~300 MFLOPS are even comparable.

As the Mathematica guy said, the competition is no longer high-end PCs, it's now $10,000 UNIX workstations...and the G5 is still faster than any of them.

No wonder the G5s smoke the dual Xeon in the Photoshop, Mathematica, Logic, and Luxology app bake-off. All these apps would have been optimized to use the Velocity Engine.

If I were a scientist doing lots of image processing and vector calculations, I'd need a cluster of about 18 or so 3.2 GHz P4 machines to keep up with the dual 2 GHz G5 PowerMac running a typical Velocity Engine optimized app.

That's a sweet 5177 MFLOPS for you - evidence the G5s rock as hard as Apple has been indicating.

Re:I'll wait for a real comparison. by JonathanBoyd · 2003-07-04 10:35 · Score: 4, Informative

Hyperthreading isn't a magic double-the-speed-of-your-processor feature. In fact, ti can slow a computer down. What it is nice for is for running multiple threads or programs more efficiently.

Re:I'll wait for a real comparison. by autopr0n · 2003-07-04 10:35 · Score: 2, Informative

I'll be more impressed if the folks at the Langley Research Center compared the Apple Power Macintosh G5 with the 2.2 GHz PowerPC 970 CPU against a system running the Pentium 4 3.2 GHz CPU (which has Hyper threading instruction registers to have almost dual-CPU performance).

Hyper threading does not give you the performance of a multi-processor setup. Hyper threading speeds things up when you have lots of independent threads. Lets say You get into a situation where you have, say, a cache miss and the CPU has to wait like 100 cycles to read crap out of ram. With HT, the CPU can use those cycles to run programs running in other threads.

It's like having two CPUs, but only one can run at a time, so while one waits, the other runs.

It just lets the chip come closer to it's theoretical maximum speed.

--
autopr0n is like, down and stuff.

SETI by elliotj · 2003-07-04 10:35 · Score: 4, Funny

Personally though, I want to see how well it runs Seti@Home

My bet is you still won't find any signs of Alien life. So it won't be any better than my old crappy ass P1 166.

But good luck to ya.

Re:SETI by I+Like+Swords!!! · 2003-07-04 15:53 · Score: 5, Funny

Yes, but just think of how much MORE static you can have SETI run computations on... It's like watching a blank tape in fast-play.

--
.unsigged

Because ... by Mooncaller · 2003-07-04 10:37 · Score: 3, Interesting

Redhat 7.1 and 7.2 are the standard versions used by industry and research. They are concidered more stable then newer releases. Most GNU/Linux hosting in the US uses RH7.2. It works and does not have some of the issues of OSs that use the kernel of pain. I do all of my development to target RH7.1. After the 2.6 kernel gains some maturity, I will probably change this. BTW, the 2.4 kernel realy has little to offer in the way of improving performance for the type of application NASA is intersted in. The same will not be true of the 2.6 series as can be seen in the current 2.5 tree.

Damn Dude, RTFA by Mindcry · 2003-07-04 10:44 · Score: 4, Interesting

The 498 MFLOPS figure was WITH 2 G5s!!!!
With a single G5, the 2ghz got a 254, and the 2.66ghz P4 got 255 MFLOPs...
Please read the article more clearly, this DOES NOT IN ANY WAY validate apple's earlier claims... here's the quote that was misread

"Though dual processor benchmarks are not presented in detail here, it is worth noting that the G5 system benchmarked at 498 MFLOPS and 0.125 MFLOPS/MHz for scalar Jet3D performance when two processors were used."

Followed by a chart showing the P4 2.66ghz with 255MFLOPS at the top and a G5 2ghz with 254MFLOPS at the bottom...

So you could guess that a dual 2.66ghz would get about 499-500MFLOPS which would be a 0% performance advantage to the G5, and the P4 3.2ghz would be even faster...

Re:Damn Dude, RTFA by dhogaza · 2003-07-04 11:07 · Score: 5, Insightful

Except that

1. They weren't using a compiler optimized for the G5, and expect performance to increase when they have that opportunity.

2. Dual G5s appear to scale better than dual P4s. They're getting close to 2x performance with the dual G5s, much better than most folks are used to with SMP systems.
Re:Damn Dude, RTFA by ChadN · 2003-07-06 00:37 · Score: 2, Informative

Also, even with HyperThreading, the chips do NOT have 2 separate cores (although I suppose that could be done, in principle). It just allows software to think of them as two cores, and to schedule more than one process at a time.

The thing is, this usually works best when you have two separate types of task running concurrently on the processor, because they can (hopeully) take advantage of more execution units at one time. But, if you run two of the same scientific number crunching programs, on the hyper-threaded processor, they will both be competing for the floating point, and other, execution units at the same time. That competition has to be serialized by the chip, and the result may well be slower than if you just ran one process on the chip.

So, for this kind of computation, the hyper-threading may not be such a win (and it is probably marketed more for things like web-servers, where you might have a database program, and web content delivery program, both executing, and they may parallelize with each other a bit.

--
"It's overkill, of course. But you can never have too much overkill." - Anonymous Slashdot Coward

Interesting by Skyleth · 2003-07-04 10:46 · Score: 2, Interesting

If this was done by NASA, why isn't the report on a NASA web server? (as opposed to some guy's personal homepage) kinda limits the creditability of this whole thing...

It's spelled A-M-D by autopr0n · 2003-07-04 10:48 · Score: 2, Interesting

You can easly build a dual Athlon system that could trounce an equivilant g5 for less cost. Xeon's arnt the only MP capable x86 chips out there...

--
autopr0n is like, down and stuff.

Another message from the Benchmark author by jbridges · 2003-07-04 10:49 · Score: 4, Informative

Found this from last Jan:

Date: Mon, 13 Jan 2003 23:29:38 -0500
From: Craig Hunter
Subject: G4 vs. P4 performance

I have been following the discussion of Rob Galbraith's benchmarks with much interest, as I have spent a good deal of time testing, optimizing, and benchmarking software for the G4 (OS X) and P4 (Linux).

The first thing to realize is that there are numerous benchmarks that show the P4 is faster, and there are numerous benchmarks that show the G4 is faster. What matters? Well, probably the benchmarks that apply to the kind of work you do. For people doing photo processing with the software Rob tested, his results are extremely relevant. But, someone working with a program optimized for AltiVec and dual processors might have a completely opposite experience.

Just to give an example of a benchmark that goes the other way, see this chart.

(You're welcome to mirror this benchmark image, since my web site may not handle a lot of traffic). These real-world results come from the Jet3D computational fluid dynamics noise prediction software, which I developed for my doctoral thesis and currently use in my work at NASA. Jet3D is written in a combination of FORTRAN 77, FORTRAN 90, and C, and is optimized for AltiVec and dual processors on G4 hardware. When compiled on Linux using Intel's ifc compiler tools, Jet3D also becomes optimized for the P4 (using the various SIMD extensions available on the P4).

As you can see, the G4 does quite well here. A dual processor 1.25GHz G4 system is more than 3.5X faster than a single processor 2GHz P4 system. Though it's not shown on the chart, a single 1.25GHz G4 processor benchmarks at about 1589 MFLOPS, 1.9X faster than the P4. If you look at MFLOPS per MHz for a single processor, the G4 comes in at 1.27 MFLOPS/MHz, while the P4 comes in at 0.42 MFLOPS/MHz. If you want a good example of the MHz myth, look at the Cray, which comes in at 1.78 MFLOPS/MHz with only a 500MHz processor, beating both the G4 and P4.

Without AltiVec, the Jet3D benchmark would be about 794 MFLOPS on the dual-1.25GHz G4, which erases the performance lead over the P4. And then, using only a single processor, the 1.25GHz G4 benchmarks at about 418 MFLOPS, which is about half as fast as the P4. And all of a sudden, the G4 doesn't look very compelling. For the Jet3D benchmark, AltiVec and dual processors are key (AltiVec more so than dual procs). This is true for most benchmarks I have looked at; thus numerically intensive applications that can't use AltiVec and/or dual processors are likely to suffer on the G4.

In the case of Jet3D, it was easy to optimize for AltiVec. I was able to hand-vectorize about 10 lines of code within the guts of the FORTRAN algorithm and convert the computations to C for easy access to AltiVec hardware instructions. It had a huge effect for not a lot of work. For other more complicated cases, it may be possible to use the VAST compiler tools to automatically vectorize and tie in with AltiVec (VAST has parallel tools also). But in some cases, vectorization is not possible or feasible. In those instances, you're stuck with the processor's scalar performance, and the P4 generally has better scalar performance than the G4 in my experience. One final note: these are my personal views, and do not represent the views of NASA Langley Research Center, NASA, or the United States Government, nor do they constitute an endorsement by NASA Langley Research Center, NASA, or the United States Government

Re:And before anyone asks... by Llywelyn · 2003-07-04 10:50 · Score: 2, Informative

Oh boy.

Those systems aren't *exactly* what I would call comparable. A HD that is 4x bigger, a superdrive, and thats just the stats you posted (I'm sure I could draw it out with things like the Airport Antenna).

"I still haven't seen reliable benchmarks with the dual 2.0GHz facing a P4/3.2GHz with Hyperthreading on"

Veritest disabled HT for tests where the system would be slower, left it on where it was faster. They also enabled SSE2. You can check all of that in their report off of their website.

"if they are three times the cost of a PC, buyers will have a hard time justifiying it."

Apple doesn't sell in the low-range (exempting iBooks), they sell mid-range and up. For those of us who purchase Apple systems, we don't want the cheapest system we can get, we want a system that /just works/.

--
Integrate Keynote and LaTeX

Re:NASA Verifies Apple Benchmarks? The Future by Nom+du+Keyboard · 2003-07-04 10:50 · Score: 3, Insightful

Apple will be competitive still when the storm hits... but I don't think this hardware is revolutionary when compared to it's future peers.

If, as widely reported, the PPC 970 goes from 2GHz to 3GHz in the next 12 months it will definitely be more than competative.

It has been along time since I've seen performance increase by near 50% in a year. Takes me back to the 486DX2-66 days.

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

Re:Wha? by angst7 · 2003-07-04 10:55 · Score: 3, Funny

Though dual processor benchmarks are not presented in detail here, it is worth noting that the G5 system benchmarked at 498 MFLOPS and 0.125 MFLOPS/MHz for scalar Jet3D performance when two processors were used.

That was the above poster's point. Mkay?
</karma burn>

--
StrategyTalk.com, PC Game Forums

Re:5177 MFLOPS 288 MFLOPS by WasterDave · 2003-07-04 10:59 · Score: 4, Interesting

Someone please explain to me how 5177 MFLOPS and ~300 MFLOPS are even comparable.

They're not, which is what makes this whole benchmark so entirely useless.

Look at it: The conclusion, basically, is that there's no point in running CFD code using scalar FP. So why didn't they port their code to SSE2? P4's, and particularly the new 800MHz FSB P4 get data through SSE2 code like there's no tomorrow.

Nah, I'll listen when someone compares SSE2 and AltiVec properly. Until then it's just more blah. Don't get me wrong, I'm rapidly turning into the biggest Mac fanboy you've ever seen (Cocoa, since you ask) but the G5's are not the quantum leap Apple are making them out to be. Back in contention? Sure, but I promise you a dual Opteron 2GHz will blow the doors off a dual G5.

Dave

--
I write a blog now, you should be afraid.

NASA + Apple = national conspiracy? by JB72 · 2003-07-04 11:05 · Score: 2, Funny

Well this story really makes me nervous. After taking in the comments from fellow /. readers, I am now convinced that Apple has conspired with the NASA Langley Research Center to make the G5 CPU look fast. The real clever bit was how they only used one CPU, and and unoptimized compiler, just to make it look "authentic" I suppose. Clever bastards. But they'll never fool me. They have my crappy ass Wintel box when they pry it from my numb, pale fingers.

Damn Dude, Read What I Wrote by Dak+RIT · 2003-07-04 11:06 · Score: 4, Informative

If you follow that little link I gave you for Apple's benchmark claims, you'll see that the performance advantage Apple is claiming for its Dual 2Ghz G5 (I wrote Dual if you reread) is almost identical to what NASA is claiming for the Dual 2GHz G5 against a 2.66MHz P4.

Apple claims 15.7 for the Dual 2GHz G5, and the 3GHz P4 getting an 8.07. NASA gives the Dual 2GHz G5 498MFLOPS and the 2.66GHz P4 255MFLOPS.

If you use your math skills: 15.7 / 8.07 about equals 498 / 255. So therefore we can draw the conclusion that they have similar results.

Now, NASA only used a 2.66MHz P4 while Apple used a 3GHz P4. Although remember NASA's figure that the P4 had 0.096 MFLOPS/MHz? Give the P4 333 more MHz, and you find it has about 286.968 MFLOPS. NASA also suggests a 20% performance increase can be expected with compilers that take advantage of the G5.

Although, even without this increase Apple's benchmark and NASA's benchmarks are very close. Which would lead one to draw the conclusion that Apple's benchmarks were in fact valid.

I should also note that a P4 would not perform as well in a dual system as the G5 does. So your 500 MFLOPS number is a little rediculous. The G5 which is an amazing dual proc chip saw it's 254 MFLOPS for a single processor (508 when doubled) drop to 498 MFLOPS in a dual system. And the P4 isn't designed for a dual system, doesn't support HyperTransport, etc.

Dak

Re:Damn Dude, Read What I Wrote by Jmstuckman · 2003-07-04 12:39 · Score: 2, Funny

>

A 2.66 MHz P4? I imagine that one of those would be tough to beat!

Re:And before anyone asks... by Mononoke · 2003-07-04 11:10 · Score: 3, Interesting

Just thought I'd point out that there's a bit more difference here than meets the eye.

There's more difference between your two comparison systems than you seemed to notice also:

The Mac has a DVD-R/CDR drive
120 Gig HD is just a little bit bigger than the 40 Gig
The "free" monitor is only worth $100 to Dell
The Mac includes video editing software
The Mac's video card is dual output with ADV and VGA connectors, dual-monitor capable

An "equivalent" Dell system is nearly $1400, according to their website, if I start with your example as a basis.

Sorry, but it's worth $1000 to me to have a computer with a better ROI and no Windows.

--
NetInfo connection failed for server 127.0.0.1/local

Re:And before anyone asks... by mamer-retrogamer · 2003-07-04 11:25 · Score: 5, Informative

The G5s are Apple's flagship product line. Comparing el-cheapo Dell's to high-end Apple's is like comparing... well you know where I'm going with this train of thought (something about oranges, I think).
How about a more fair comparison? Namely, between similarly configured high-end single-processor systems:

Apple PowerMac G5:
1.8GHz PowerPC G5
250GB Serial ATA - 7200rpm
SuperDrive (DVD-R/CD-RW)
512MB DDR400 SDRAM (PC3200)
Mac OS X
AppleWorks
ATI Radeon 9800 Pro
56k V.92 internal modem
No Monitor
$2874

Dell Dimension XPS:
3.2GHz Pentium 4
200GB Ultra ATA - 7200rpm
DVD+RW/DVD+R/CD-RW
512MB DDR400 SDRAM
Microsoft® Windows® XP Professional w/ Microsoft® Plus!
Microsoft® Works Suite 2003
ATI Radeon 9800 pro
No Monitor
$3062

And if you are to believe the benchmarks, it seems that Apple is selling the faster system for a lesser price than a similarly configured Dell.
Apple has never competed at the low end. It is not starting now.

-Mike

--
Schrödinger's cat is not amused—maybe.

Re:MFLOPS/MHz? No AMD, Old P4, Old Redhat. by localghost · 2003-07-04 11:26 · Score: 2, Informative

The unit MFLOPS/MHz is a little weird. Let's simplify. / MFLOP \ | ----- | \ S / ----------- / MCYCLE \ | ------ | \ S / Multiply by the reciprocal... MFLOP S ----- * ------ S MCYCLE Millions cancel, seconds cancel... FLOP ----- CYCLE So it seems that this unit is equal to 1 floating point operation per CPU cycle. That makes a little bit more sense as a unit.

Re:Turn the optimizations on first. by dhovis · 2003-07-04 11:29 · Score: 5, Interesting

I think the thing that most people on /. seem to keep missing is this: MacOS X and Linux both use GCC as their primary compiler. The Linux kernel is compiled with GCC, as is Darwin. Most software for each platform is compiled with GCC.

Now, with all these Linux-heads around here insisting that Linux is faster than Windows on x86, you'd think GCC for x86 might be a good compiler. Certainly the SPEC tests Apple (and Veritest) did with GCC on the G5 with OS X and the dual Xeon Dell with Red Hat had to have been a valid comparison between those two situations.

I also keep seeing all these comparisons to Dell computers without full specs of the Dell. The base configurations for the PowerMac G5 is positively loaded. How many $500 Dells come with Gigabit Ethernet? How many have the same level of engineering into the thermal managment?

Only time will tell for sure. In the mean time, remember that IBM will be producing blade systems with the 970. We'll get a chance to compare those as well eventually.

--

--
The internet is the greatest source of biased information in the history of mankind.

Re:5177 MFLOPS 288 MFLOPS by Anonymous Coward · 2003-07-04 11:30 · Score: 5, Informative

I recently got the chance to do a testrun, doing some airflow simulation on a G5 1.8GHz demo machine, and with altivec optimizations it clocked in at roughly 2100MFLOPS average for 5 runs(I could probably get better results with a better compiler though), while the dual Opteron 1.8(which the place where I did the testrun has bought 10 boxes of for their renderfarm), running Suse Linux, and my program re-compiled for x86-64 and SSE2 performed at about 2960MFLOPS average, but that could probably be improved with a better compiler too, but I had to use GCC at this time. Both machines had 4GB RAM btw.

My Hyundai is faster than your Porsche... by today · 2003-07-04 11:32 · Score: 2, Interesting

...because it gets more miles to the gallon.

Re:Costs (system, not processor...) by hc00jw · 2003-07-04 11:50 · Score: 2, Insightful

I'd read some thread a while back on another board saying that "Macs are cheaper than PCs". I still can't believe anyone would make that argument. Doesn't being really good in a few areas satisfy the mac people? Do they have to try to spin higher costs as 'lower' (craziest thing I'd ever heard...)

I think that's fair, for what you will be getting!

160GB Serial ATA, SuperDrive (CD-RW, DVD-R), Firewire 800, USB 2, ATI Radeon 9600 Pro...

Plus all the standard things that apple don't actually mention in their specs any more of course. 16 bit sound, gigabit ethernet, and so on.

And that's without the free software. iMovie, iCal, iPhoto, iDvd, etc., (which you may or may not want, so adjust value of system accordingly).

The thing with your comparision, is that you are comparing the G5 processor with the P4 processor. Now try comparing the G5 system to a PC system with a P4 with equivilant specs. Hard drive, DVD writer, latest firewire, gigabit ethernet, etc, and you soon see the PC cost more.

And that's without touting the advantages of Mac OS X and all the other points that have already been made in this thread...

This is Apples advantage and disadvantage. Because they won't sell you anything less than top quality in their pro line... And that costs!

People seem to miss the big picture some times!

Re:5177 MFLOPS 288 MFLOPS by slithytove · 2003-07-04 11:58 · Score: 3, Interesting

a single 970@1.8Ghz scored 2100MFLOPS and a DUAL opteron@1.8Ghz scored 2960 !? I assume your code was multithreaded/multiprocess? Your's sounds like a much more interesting test than nasa's - could you give any more info?

Re:Wha? by bnenning · 2003-07-04 12:21 · Score: 4, Insightful

A midrange p4 beat the highest end g5 available.

By a whopping 0.4%, and with one of the G5's processors disabled. You can spin it any way you want, but the clear fact is that with the G5 Macs are competitive in CPU performance again. I don't see why this disturbs you so; competition is good.

--
How to solve most of our problems: 1.Lots of nuclear plants. 2.Cure aging.

Re:MFLOPS/MHz? No AMD, Old P4, Old Redhat. by charnov · 2003-07-04 12:32 · Score: 2, Insightful

They couldn't get a hold of a super-cheap Athlon XP or even a sub $1500 Opteron server, but they got a hold of an Apple G5 which isn't even available for sale???

Puh-lease... The gripe with the SPEC benchmarks was that Apples numbers for the competition were WAY below the OFFICIAL numbers, ot that Apples numbers for their own equipment was crap.

Jeeze...let's at least wait till these things are SHIPPING.

--
[RIAA] says its concern is artists. That's true, in just the sense that a cattle rancher is concerned about its cattle.

Re:MFLOPS/Mhz. - Useless Metric by maraist · 2003-07-04 12:47 · Score: 4, Informative

Actually, you're both correct, and you're both missing something.

Originally, the pipelining was segmented based on the I-Fetch, D-Fetch (register/etc), Exec, Reg-Write-Back, with expensive floating point doing with different timing considerations (externalized or delay-locking multi-stage execution). Then they started sub-dividing each of those stages (especially in CISC archetectures). Now its common to see 15 integer execution pipeline stages - either with shared resources, such that you can only have one divide occuring at any given time (early P-I, P-II, P-III), or with fully independent/concurrent resources (AMD's Athlon).

The addition of the pipelinable-stages between I-Fetch, D-Fetch, exec, and WB was somewhat trivial, because prior to pipelining, there were still seperate events on seperate clock-ticks with inter-stage latching. However, in CPU's with exec-stages that are pipelined, you are introducing additional latches that cause additional undesirable propagation delays.

So a 15 stage integer multiply unit (excluding fetch/WB) has 15 x [guestimating] 4 propagations of additional latency over a single-stage I-unit. If there are resource-based stage-interlocks, or worse data-dependencies, then the pipelining is useless and you're totally hit by the excess propagation delays.

Still, marketing being what it is these days, adding more stages means less propagations per stage, thus less worst-case propagation time, and thus higher clockability (all else being equal; temperature, etc).

The P4, however, compensated by double-clocking the core integer stages, so the number of advertised stages is somewhat misleading.

On a side note, due to the latching in pipelining, you're definately doing more total work for a given instruction. And more importantly, the designers have to think of totally different logic-algorithms to efficiently pipeline than to single-stage. My guess is that the pipelined versions will always be less efficient (especially considering that not all stages will fully utilize their allotted clock-time), and thus there's an additional loss.

Ok, so this supports your post, but here's the part about power/heat.

There are two types of transistors used in modern CPUs (everything past the Pentium). BiPolar and Field-Effect. The CMOS-FET refers to Complementary Metal-Oxide-Semiconductor Field-Effect-Transistor. This acts similarly to a capacitor in that there is a charge and discharge time with little waste current, and power dessipation is typical V=IR, Pwr = I^2 * R. The gate capacitor charge-time is the killer, and what limits switching speed (and thereby clock-speed). Shrinking the area of the capacitor (related to the micron-size stated, .18u, .15u, .13u, etc) means there's less time necessary to charge the capacitors, and thus speeds are increased. (This is only one aspect, and I'm no expert here).

There's another way of reducing switching speed.. Increasing the amount of current running through the wires that ultimately charge/discharge the gate-capacitors. FETs are poor amplifiers, but BiPolar (while more complex and harder to make small) are phenominal. In addition to their complexity, Bipolar also are power hogs. While a FET only consumes power while turning on or off, BiPolars are always on, consuming power (there is current bleeding from the switch). So what often happens is that designers sprinkle BJT's here and there to amplify the current (at the expense of cost/complexity and power-dessipation), and continue using FETs everywhere else.

The bigger and greater number of BJTs that are used, the faster some heavily loaded FET gates will charge and the quicker their switching time will be.

If you up the voltage on a CPU, you're enhancing the amplifier's ability to charge the capacitors and thus gives you more safety-room to up the external clock-speed.

Again, this deviates somewhat from my knowledge domain, but you can often merely co

--
-Michael

+3db Doubles Power, +10dB Doubles Loudness by DonnarsHmr · 2003-07-04 12:47 · Score: 3, Informative

Plus or minus 3dB represents a doubling or halving of the power, respectively. However, quietness or loudness is a subjective quality. Most statistically normal humans seem to agree that plus or minus 10dB doubles or halves the apparrent loudness. Psychoaccoustics bears no relation to math or physics.

Re:No by Jeremy+Erwin · 2003-07-04 13:13 · Score: 2, Insightful

It's still useful (although marginally so) to have a statistic which allows you to compare a 3.4 Ghz Pentium IV with a 2.16 Ghz G5. In theory, you could multiply the clockspeed by the "efficiency" and get a rough idea of which machine will run your code faster, without having to redo the benchmarks every time the manufacturers release a new PC.

It's unAmerican, I tell you... by qtp · 2003-07-04 14:10 · Score: 2, Funny

Damn liberal editors caving int those artsy-fartsy mac users again...

How dare you print a story that derides the paragon of the IT hardware industry, Intel, as being second best to a processor installed in a machine maniufactured by those long-haired elitist at Apple.

And to think that you would accept the word from those commie "scientist" down at NASA, who sit around doing nothing all day except "thinking" and playing with thier toys at taxpayer expense.

Shame, shame, next thing youl be tryin' to convince me that Linux OS is more secure than the hallowed work that our hero in capitolism, Bill Gates, does at microsoft. It's bad for the economy, I tell you. All these long haired, smelly, weirdos keep messing with our good old American way of life. What, you say you want a choice? You have a choice! You'll choose from what choices we here at the Central Office tell you to choose from!

Take your Open Source, MacIntosh, OS-X (You don't think I know whaty OS-X really means, do you, but I do, smarty pants, I do indeed..), and whatever else it is you keep trying to give us and go back to Russia, or wherever it is that you came from. Who invited you to our red-blooded American industral party anyway? Nobody! That's who!

Why you pnko, beatnik, hippy, givin' it all away, don't think you need the establishment faggo...

and so on,

and so on,

and so...

--
Read, L

Whats the difference? Distinction died long ago by Ars-Fartsica · 2003-07-04 14:18 · Score: 2, Interesting

What is the difference between a PC and a workstation? I think its pretty clear the term workstation as a distinguishing label died long ago, some time after "PC" performance on the desktop destroyed "workstation" performance.

G5 has much slower SpecFP than Power4 (same clock) by vmp17 · 2003-07-04 14:28 · Score: 3, Informative

At "SPEC", you can easily find the performance of a Power4 @ 1.45 GHz. Its SPEC2000 rating for floating point is 1097. When you scale that to the 2.0 GHz processor in the G5, you conclude that it has a SPEC score of about 1500.

Wrong Conclusion!

According to this pdf (page 13) G5 @ 1.8GHz has 1051 SpecFP.
At the same time Power4 @ 1.7GHz has 1598 SpecFP !!!

It is very clear that Power 970 (G5) is much-much slower in floating point than it's Power4.

Re:Turn the optimizations on first. by agent+dero · 2003-07-04 14:55 · Score: 2, Interesting

Please note,
gcc + g++ for MacOS X on G5 platform = $0

Intel's C++ compiler for pentium based platforms = $399
prices

Now factor in Operating Systems prices, and general software.
MacOS X 10.2 $129 Windows XP Professional $143
iTunes $0 MusicMatch Jukebox Plus $19.99
AppleWorks $0 WindowsXP Office $297

Software Total: MacOS X: $129 WinXP: $459.99
(prices taken from www.newegg.com)

--
Error 407 - No creative sig found

Bow wow wow yippie yo yippie yay... by MsGeek · 2003-07-04 15:36 · Score: 2, Informative

The reason why there are no RH 8 or 9 ISOs for PPC is that basically they are leaving development of the PPC branch of Red Hat to the makers of Yellow Dog Linux. Which, right now, is G5 ready and is basically RH9 for PPC. http://www.yellowdoglinux.com/

--
Knowledge is power. Knowledge shared is power multiplied.

Re:They left out an important graph. by dmarcoot · 2003-07-04 15:37 · Score: 2, Insightful

no, most important graph of all is time being productive versus time making shit work the way it should so i can be productive. Apple has always won that hands down.

Bebox by fredistheking · 2003-07-04 16:13 · Score: 3, Funny

Does anyone remember the dual-ppc that the BeOS people were originally selling? Anyway, you could turn off one of the processors but if you wanted to, you could turn them both off. I think they evenrtually removed this feature. =)

-

Re:Apple had little say in the Power4 CPU by thatguywhoiam · 2003-07-04 16:54 · Score: 2, Informative

As much as people would like to believe that Apple created the G5, that's simply not true. IBM was the one that created it, about 2.5 years ago for their servers.

Ah, but Power4 != G5. The G5 was called the GigaProcessor UltraLite (i know) in development and is quite a different beast from a full-blown Power4; it is scaled down to 'desktop tolerances', has not as many cores, and an AltiVec unit. Apple probably had a hand in what is now known as the 'G5' since the very beginning.

--
If Jesus wants me it knows where to find me.

Intel, oh Intel, where art thou? by bninja_penguin · 2003-07-04 17:09 · Score: 2, Interesting

Is it just me, or do other people notice the growing trend of Intel really sucking hind tit? Sure, they are over 3Ghz, but AMD and now even Apple are sporting processors that are more than one thousand mhz slower, yet, over all, perform the same or better. I figure by the time Intel hits 4 or 5 Ghz, Via's Cyrix processor will be just about 1.8Ghz, but will have been worked to where it's performance will be on par with the Intel offering. Ol' Intel had better double their advertising budget, or even Dell will have to start offering non-intel systems.

By the way, no, I don't benchmark systems I use, as that, in my mind is like putting a car on a dynometer. I mean, who realy cares how much horsepower a machine has, if it's gearing is completely mis-calculated? Based upon actually using systems, I find that G3s seem very slow to use, G4s not bad, P4s a little better, and AMD Athlons wicked quick. Yes, all systems were slightly different clock speeds, but all had 256MB of RAM, except the P4, it has 512MB of RAM. In just normal usage, nothing I've come across can touch the Athlons for performance. However, I also do NOT do video editing, sound editing, etc. I just play a few games, do programming for school, the internet, and such. So no, I do NOT care about how fast something can open Photoshop, or if these systems can do real time video editing. I didn't build/purchase them for that.

There's my two cents, could I use that for a down payment on a new dual G5?

--
For those who describe their systems as 'boxen', do you order multiple 'boxen' of corn flakes also?

Re:Still not the same comparisons... by Billly+Gates · 2003-07-04 18:14 · Score: 2, Insightful

It has been stated here before and I will state again for the millionith dam time Steve Jobs specifically states "..first 64-bit personal computer"!

He did not say the first 64-bit computer or 64-bit server but 64-bit personal computer.

Sheesh.

Alpha's and Sparcs are not cheap and are considered servers . You do not go to CompUSA and buy a digital Alpha server or Ultra 10x workstation. THey are certainly not designed for the average joe and the price reflects it.

Their is no software for them that is not specialized anyway. They are specialized workstations for running a particular set of apps. No photoshop or quakeIII for the kids.

FYI I am not a mac user.

--
http://saveie6.com/

Re:Costs - correction by Phroggy · 2003-07-05 05:40 · Score: 2, Insightful

Was that the 3.06GHz Xeon that Apple demonstrated, or a slower dual-Xeon system?

Does that include most of the hardware features of the G5? Gigabit Ethernet, FireWire, optical 5.1 audio, CD-RW/DVD-R drive?

A friend of mine ran the numbers on Pricewatch last week and didn't have the same results you did. I don't recall what he found, but for similar specs it wasn't cheaper than the G5.

--
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;

Straight forward by MarkCollette · 2003-07-05 07:22 · Score: 2, Insightful

I see a lot of people arguing, so here's my summary/explaination:

- A single G5/970 top of the line is probably 20% slower than P4 top of the line, on unoptimized software. By that I mean regular integer and floating point (no AltiVec), and with unoptimized compilers. The compilters may or may not improve significantly over time, so that shouldn't be afactor on release date, but may pan out later.
- The G5/970 is not designed/optimized to ship as a single CPU, instead the bus, etc. are designed for SMP of between 2 and 8 CPUs (might technically work with more, I don't know). So, you'll see that, in a dual CPU comparison that the G5/970 will scale better than the P4 (xeon) duals, narrowing the performance gap, and in some applications, putting the G5/970 on top.
- With software that is recoded to work with AltiVec, compared to SSE(2) optimized software, the AltiVec performs noticably better, but that depends on the application. When comparing AltiVec to regular (non-vector optimized code), there can be as much as an order of magnitude increase in speed, again depending on t6he application.

So, expect benchmarks to vary by a tremendous amount, depending on the number of CPUs, and how optimized for AltiVec the software is, and how mature the compiler that was used. Since a lot of software can be vectorised, I would expect software on the G5 to initially lag P4 performance, but then to dramatically speed up in some areas.

Also, expect a lot of people who don't grasp this to be getting their panties in a knot!

Re:Single vs Double Precision by Ahaldra · 2003-07-05 23:06 · Score: 2, Informative

k, I'm loosing the moderator points I already spend on this thread but what the heck-

You said:
Altivec is nice, for what it is meant for (mainly media type calculations, signal processing, etc.) But scientists will prefer SSE2.

Well I was under the impression, that if I need double-precision floating point arithmetic I use the FPU (which is quite fast with the G5 as you can see in Fig. 1 of the Article).
Concerning fast vector operations - some problems can be "dumbed down" to take advantage of the faster single precision units and lastly you can do an additional Newton-Raphson refinement step where double precision is needed.
Your claim "scientists will prefer SSE2" was proven wrong by the article, where the Jet3D test suite was only ported to take advantage of the Altivec SIMD unit, not the SSE2 SIMD unit - unless of course you argue that NASA engineers are not scientists ;-)

IMHO real scientists use whatever they think is best suited to solve their specific problem.

--
Code is Speech. No to Censorship.

Re:Single vs Double Precision by ChadN · 2003-07-06 00:21 · Score: 3, Informative

Well, first of all, I work at NASA as a researcher, doing numerical work (although I'm not a civil servant, and don't speak for them).

I agree that AltiVec is superior to SSE (ie. single precision), but you compared it to to SSE2, which is a bit apple-to-oranges (no pun intended, btw). If the G5 FPU is faster than current SSE2 at double precision, it just proves the well thought out design of the PowerPC architecture (and the unfortunate legacy of Intel's FPU instruction set, which is still a handicap even with SSE/SSE2, due to the need to mode switch).
But SSE2 is still immature, and I expect compilers to improve, as well as chip implementations. Once they do, a more meaningful comparision can be made.

The Intel chips NEED stuff like SSE/SSE2 to achieve faster floating point speeds, whereas the PowerPC can get by without it, thanks to a much better FPU design, and thus, PowerPC makers will probably not spend the silicon to make a double precision SIMD instruction set anytime soon.

I stand by my claim that while most consumer and media software can get by with single precision, scientific computing (ie. large matrix calculations, to be blunt) quite often needs double precision (hell, you can get libraries that use 128 bit long doubles, these days), and will ultimately prefer SSE2. Scientists fuss with single precision SIMD simply because many of their applications can benefit so much from SIMD that it is worth the pain to use single precision (with proper conditioning and verification, etc.) Now that double precision SIMD is available, I can only predict they will want to jump to it, once tools for using it are there.

Granted, if Intel can't make a double precision SIMD unit that outperforms a double precision general FPU like the G5's, for matrix problems, then they don't deserve to design chips for scientific computing. :)

--
"It's overkill, of course. But you can never have too much overkill." - Anonymous Slashdot Coward

Slashdot Mirror

NASA Benchmarks the New G5 Powermac

130 of 751 comments (clear)