NASA Benchmarks the New G5 Powermac
sockit2me9000 writes "Well NASA's Langley Research Center recently benchmarked the new G5 dual 2ghz Powermac against a dual 1ghz Xserve, a dual 1.25 ghz Powermac, a Pentium4 2 ghz, and a Pentium4 2.66 ghz. To make things fair, the second processor in the G5 was switched off, as well as the other dual sysytems. Then, they all ran Jet3d. Even with un-optimized code and one processor, the G5 performance is impressive."
Because I have a strong feeling this is going to be asked:
;)
For those of you who were wondering, you too can switch off one of your Mac's dual CPU's with the Apple CHUD Tools. Look near the bottom of the page. It'll make you appreciate your second processor
Personally though, I want to see how well it runs Seti@Home.
Vonal Declosion
By adding a second processor, the MFLOPS/Mhz output only dropped from 0.127 to 0.125 MFLOPS/Mhz. This chip can definitely perform in a multi-processor environment. The P4 scored 0.096 MFLOPS/MHz with a single processor.
Apple's benchmarks which were highly criticized by some, gave the Dual 2GHz Power Mac G5 a 194.5% performance advantage over a 3GHz P4 in SPECfp_rate_base2000. The G5 getting a score of 15.7, and the P4 getting an 8.07.
NASA's study found the Dual 2GHz Power Mac G5 to score 498 MFLOPS for their Jet3D performance. A P4 running at 2.66GHz scored 255 MFLOPS: a 195.3% performance advantage for the G5 in this test. If we assume a direct correlation between MHz and MFLOPS for the P4 (which would actually overstate the performance of the P4) and increase the P4's score by 12.782% this would give the 3GHz P4 a score of 287.594 MFLOPS. This is still a 173.16% performance advantage for the G5, and NASA states that a 20% increase in performance for the G5 would be reasonable "when G5-aware compiler tools become available."
So it would seem NASA's benchmarks go a long way in validating the benchmarks for the G5 that Apple released last month at the WWDC. In fact, NASA appears to be giving the G5 even better scores than Apple and Veritest did.
The vector tests that NASA performed to test the G5's AltiVec instruction set produce some even more impressive results, and would be a good indication for why the G5 outpaced the Xeon and P4 by such dramatic amounts on real world tests (at times more than 700% faster than a 3GHz P4). "The vector version of Jet3D runs an order of magnitude faster than the scalar version (speedups of 10X-13X are typical)." The dual 2GHz G5 was benchmarked at 5177 MFLOPS (a 1040% increase over the scalar test) and 1.29 MFLOPS/MHz. This also seems accurate considering Ars Technica's claim that the AltiVec engine wasn't as well integrated into the G5 as it was in the G4. The 2GHz G5 (single cpu) scored 2755 MFLOPS, or 1.378 MFLOPS/MHz, which shows a slightly larger performance hit for vector operations than floating point operations when moving to a dual G5.
Dak
Why not 8 or 9? Certainly there must be some performance increases in 8 or 9!
Either way, RedHat still won.. 255 to 254, YEAH!!!!
--------
Free your mind.
I love the wording:
"Benchmarks from the scalar version of Jet3D are shown in Figure 1 (MFLOPS) and Figure 2 (MFLOPS normalized by MHz). In terms of raw MFLOPS, the 2GHz G5 is about 32% faster than the 2GHz P4, 97% faster than the 1.25GHz G4, 142% faster than the 1GHz G4, and within 1 MFLOP of the 2.66GHz P4."
Translation: Slower than the P4 for anyone who didn't look at the grid. And M stands for million. Not one.
Then why not simply go with an IBM pSeries server that is based on the same POWER chip? It runs real Unix (at least until SCO wins their lawsuit) and you can get it in rack-sized units today and it supports SMP sizes higher than just 2. Oh right, this is McSlashdot, where worship of all things Apple is practiced. Sorry.
I wonder why none of the NASA boxes were running Windows? I mean, if it's an all sinigng all dancing solution to everything...
(It's ok, you can mod me -1 Troll now. I'm just bitter about an edict on a project I'm working on.)
Beep beep.
What the heck is that a benchmark of? Divide by the Mhz to create a higher number on their nice little graph for the G5?
Why a Pentium 4 2.66mhz?
Why no Athlon?
Why no Opteron?
Why an old old version of RedHat 7.1?
and so on....
I guess Apple was legit in its marketing of calling it the fastest PC ever..
(\_/)
(O.o) This is Bunny. (> <)
I have very bad experience with Portland compiler; I wonder what the results would have been if they had used Intel Fortran Compiler....
It is interesting to note that they used the Portland group compiler instead of the intel compiler. For the CFD code that I work on (which is mostly Fortran), the Intel compiler produces much faster code than the Portland group compiler (as much as 50%).
If there budget is such that dualie 2Ghz G5's are a possibility, then it's somewhat surprising that they A) used such low powered P4's B) that they didn't include Itanic2 systems. Seems that their report really just pointed out stuff that we already knew, the PPC is typcially faster per mhz than the P4 (hell, just about anything is better per mhz than the P4). Interesting to note that on the vector test, the G5 outperformed the G4 is a fashion that is almost purely based on the increase in Mhz (i.e. other system improvements didn't really seem to help much). Compiler perhaps, though some of the architectural improvements would seem to be not dependant on that?
Apple has just created a new market for itself among the hardcore engineers who use workstations for numerical simulations like HSPICE, etc. Steve Jobs lucked out -- again.
By the way, the bell tolls. It tolls ominously for Sun Microsystems.
Hahaha, FreeBSD a distro of Linux?! Hahahaha, good one!
A _bit_ US centric? It's July 5th over here you clever boy.
The only benchmarks that matter is my impression of the system while using the apps I use. Everything else is opinion.
People who bite the hand that feeds them usually lick the boot that kicks them
Something which seems to get lost in the Mac/PC debates *sometimes* is the cost factor. I looked those graphs and thought "Wow - mac is faster at this benchmark". Then I looked up pricing - minimum I can get that mac for would be $1999. An equivalent PC system with the P4 2.66ghz is probably under $900 (didn't spec it out entirely, just did a rough lookup on Dell). Great - Mac is faster. But I can apparently get within reasonable range on PC hardware for probably 50% less cost.
I'd read some thread a while back on another board saying that "Macs are cheaper than PCs". I still can't believe anyone would make that argument. Doesn't being really good in a few areas satisfy the mac people? Do they have to try to spin higher costs as 'lower' (craziest thing I'd ever heard...)
creation science book
The G5 really is a powerful machine, and i wouldn't mind owning one... if only the price for maintaining a mac wasn't so high...
Their scheme for OS X is the equivalent of Microsoft charging $100+ for a service pack, I just don't understand it.
I've used OS X, and it blows everything else out of the water in elequency and it seems the perfect balance between productive and 'cool factor'
But until I win the lottery, I'll stick with my cheap x86 machines
Excuse me, I don't mean to impose, but I am the ocean
I wonder how SCO will affect Mac OS X ? I heard that Mac OS is based on FreeBSD which is a distro of Linux right? I hope Mac doesn't get suied.
Tell me this is a troll. Please!
As you can see I don't care about my karma.
$2999 for the mac 2x2ghz
creation science book
You got modded offtopic, but I checked the link
OK, I guess I missed it.
Care to explain to the rest of us what relevance Barry White has to an Apple benchmark?
Maybe he got modded offtopic because he WAS offtopic?
Jet3D was compiled and optimized for the best performance on each platform
;)
and yet he didn't use the Intel compiler (a free download) on the P4?
It's people like him that keep losing our space probes.
And at what % of the cost of a Pentium 4 2.66 GHz?
Barry White used a Mac.
What's so big about July 4th?
Why no Athlon? They probably didn't have one and with the P4 at 3ghz and climbing, the old althon is becomming less and less significant for these pure number crunching apps. Plus maybe they've done previous tests that said that P4 they uesd was faster than an equivelent athlon, so it didn't need to be tested.
Why no Opteron? They probably didn't have one. This is the most valid question you ask.
Why RH 7.1? That's probably what they use. They are benchmarking PURE CPU so the OS doesn't matter too much for this kind of thing.
Comment forecast: Bits of genius surrounded by a sea of mediocrity.
The major OS version updates are when new features are added, etc. That is the equivelant of upgrading 98 to XP. The cost of buying a Mac is high. The cost of maint. is probably less than a Windows box.
CAn'T CompreHend SARcaSm?
Wait, let me get this straight, your assertion is that the benchmark is invalid because they "probably compared the i386 generated code against the one fot he G5". "Probably" IOW, you have no freakin' clue if this is the case and are just spouting off. Brilliant! Worse yet, you got a +1 Insightful...
The G5s have 9(?) fans I believe.. Someone correct me if I'm wrong, but I remember it being a ridiculous number to that effect. If an equivalent P4 had 9 fans, how much higher could you clock it?
It was interesting to see this paper devote so much effort to the completely useless metric of MFLOPS/MHz. This measurement has absolutely nothing to do with performance, but rather, with the approach taken by the chip manufacturer. You can do more in one clock cycle, as AMD historitcally has done, or less, but optimize for faster clock speeds, as Intel has in recent flavors of the Pentium.
One might be tempted to design a chip that does more in one cycle and then clock it as fast as a chip that does less in one cycle. Unfortunately, while reality is a little more complex than this, the basic reason is that the more a chip does per cycle, the more heat it generates per cycle. If you try to squeeze too many cycles through it in a second it will fry.
So showing that the G5 has better performance per clock cycles is no more useful than showing that an AMD chip has better performance per clock cycle than an Intel chip. All that matters is how much performance you can get from a chip before it cannot be clocked any faster without requiring unreasonable cooling methods.
All this paper shows is that, while the G5 is designed to do more in a clock cycle than a P4 is, the chip tested is ultimately not any faster than the P4 they benchmarked it against. It remains to be seen how the G5 will do at higher clock speeds. With this in mind, it would be *far* more useful to see heat dissipation stats on the G5 since that might give us some idea how close to it's design limits. If it is cranking out high-end P4 performance and running cool *then* I will be impressed.
Maybe because that was the hardware and software available to the testers. Contrary to popular belief, government employees do not have unlimited budgets to buy stuff. The last time I worked in a government office, some of the furniture was older than I was and my PC was built from scrounged parts.
Mea navis aericumbens anguillis abundat
but deep down inside . . everything is linux right?=)
How Now Brown Cow
Linux is almost as popular as Windows 95! Seriously, 1% of that pie is probably tens of millions of people.
Apple has been making better software for years, everyone agrees; They just never had the hardware to back them up. Then every time they do crop up with better hardware, everyone criticizes them and says that it's just not possible, PC hardware is always better they say. But now they've proven you wrong... TWICE, and some trolls STILL don't believe them. It's a sad world. I just wish Apple would open up at least their motherboards a little more, make Macs more customizable, more like PC's so they can start dominating again.
There's also one benchmark I'de love to see. Power Mac G5 vs Sun UltraSPARC III. It's fair: they're both 64-bit procs, and it would really make people look at it in businesses that only look at supercomputers as viable. Then maybe people would start giving Apple and IBM some credit.
My 2 cents (Canadian). Thanks.
"Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
I hope they didn't use gcc (the yet-another free and hopeless compiler).
It should be noted that Apple uses gcc to compile Mac OS X and most of their applications, so it would be appropriate to use gcc on the G5. Intel's compiler might be a more appropriate choice for the Xeon.
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
head up the ass Mac users. Macs offer shiny objects and that's all. They look great, have an easy to use interface and perform second best to PCs.
FreeBSD and Linux have no correlation besides the fact that they are both opensource products. FreeBSD is a kernel _and_ an Operating System. While linux distros are the linux kernel and the set of tools that goes with it to make it an OS (GNU tools, operating scripts, packaging system, etc).
Um...no.
BSD and Linux are two different code bases.
-twb
> Tell me this is a troll. Please!
Yes, your posting is a troll.
Do you feel any better now?
k2r
You got to be joking. Hopeless? Have you even tried the gcc 3.2.x series?
Why not show an mflops/$ chart? Related to my 'cost' post as well, but I felt it deserved its own post. :)
This seems to confirm my belief that most mac people don't buy their own hardware, but get it through work or school.
creation science book
What a pointless test, I hate to say it but the new G5's are just stupidly expensive. I can't believe anyone in a government environment would put together a farm of G5's instead of a farm of Intel processors...
The REAL metric should be MFLOP/$
Are they making use of transcendental functions? How about using multithreading? If they are using either of those two, then they'd see a big improvement if they were running RH9 instead.
Hows that construction gig going?
Just get your Compaq and got online I assume, you common loser?
Suck my collective dick after you are done beating your wife or your wiener, whichever dies first.
This article says 9 fans. Say what you like about the source. :)
"The new PowerMacs feature an all aluminum enclosure featuring 9 fans but with be up to twice as quiet as current Power Mac G4's. The new Power Macs will be shipping in August."
That's nice that they're twice as quiet. But 9 speicalised fans can be expensive to replace over time. They are not immortal.
RTFA.
Why isn't it hosted on NASA servers if it's a NASA report?
Barry White used a Mac.
And? Lots of people die who use PCs - should they be mentioned in every Pentium benchmark story?
Perhaps if he designed the Mac, then it would have some relevance.
$2999 for the mac 2x2ghz
How much for the dual Xeon system they were comparing that too? Yeah, you can build a P4 for $900, but not a dual Xeon.
However, yes, the $1999 low-end G5 is definitely more expensive than a P4 of similar performance, if the only consideration is raw speed. Macs are cheaper than PCs at the high end, not the low end.
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
>>Vector performance of the G5 remains excellent, and is inline with current G4 systems on a per clock cycle >>basis. As a result, raw vector performance of the G5 will be boosted simply by its higher clock speeds relative >>to current G4 systems.
This would seem to be one of the more interesting points made, actually. Prior to the announcement of the G5s, speculation on the PPC 970 suggested that it would be stellar with FP & so-so with integer; the real question surrounded how well IBM would implement SIMD. Many were pessimistic. Given that it seems like they've managed to add it efficiently a scaled-down POWER4 core, future refinements could make this series of chips (PPC 9X0s) real monsters.
But the future viability of that roadmap (given how ruthless the company as a whole tends to be when faced with departmental money losses) depends as much upon the success of IBM's Linux strategy as it does on its success in the PowerMac line.
[With apologies to BadAndy of the Ars Technica boards; thanks for sharing your insights.]
Habit is the ballast that chains the dog to his vomit - Samuel Beckett, "Proust"
It's not Apple's G5, it's IBM's 970 and it's the shizzle.
Karma: The shiznight, mostly because I am the Drizzle.
Yes, I see the article talks a lot about performance per MHz, but that's a useless measurement. That would be like comparing cars by looking at speed/RPM. If you want to get somewhere, car A that goes 60 MPH and runs at 6000 RPM will get you there faster than car B at 50 MPH and 2500 RPM, even though car B has a better speed/RPM ratio.
New G5 Powermac Benchmarks NASA
G5 compared to Itanium and Opteron as soon as someone actually writes a G5 compiler, news at 11:00.
Benchmarking a system thats not out yet with unoptimized code with compilers written for systems from 3 years ago, old news.
Good thing we have some more bogus benchmarks, it's a slow nerdy news day.
- Adam L. Beberg - The Cosm Project - http://www.mithral.com/
i just wanna see a dual 2ghz G5 vs a dual 1.8ghz Opteron
If one thing is 80 dB and one is 90 dB, the second object is twice as "loud." Each 10 dB jump either doubles of halves "loudness." ie: If you're at 1000 dB vs 1010 dB, the 2nd object is twice as loud.
So, based on what was said at the keynote (and my interpretation), the G5s are 10dB quieter. Twice as quiet sounds more impressive. Note that saying "half as loud" still implies "loud" so psychologically it's not as impressive.
If I'm wrong, I'm sure someone will jump on me soon enough. I'm holding on tight.
It was a hardware simulator running on a soundstage.
...if this recent Ask Slashdot post was related?
SCO (noun.)- A Slimy Corporate Ogre. Often seeks free money.
I think it's pretty obvious why they tested the G5: their Altivec program is 13X faster than their scalar program. They don't mention the SSE2 so I assume they have an investment in Altivec programs. Therefore they would naturally be interested in comparing the G5 versus the XServe and G4. Until Intel releases the 34.5GHz P4 (13X 2.66GHz), there doesn't seem to be any reason to run out and buy a latest P4 just for this comparison.
And surely the version of RH Linux hardly matters. Maybe they benchmarked using this OS because (shock, horror) it is what they use daily.
I was RTFA, and this caught my eye:
...
Additional Notes: The G5 system was running Mac OS X 10.2.7 and
I'm only running 10.2.6, and Software Update says nothing new is available. What's up with that?
Tuus crepidae innexilis sunt.
There are fewer Mac users. Every single death hurts them quite a bit.
I'll be more impressed if the folks at the Langley Research Center compared the Apple Power Macintosh G5 with the 2.2 GHz PowerPC 970 CPU against a system running the Pentium 4 3.2 GHz CPU (which has Hyperthreading instruction registers to have almost dual-CPU performance).
My guess is under Jet3D the P4 3.2 GHz machine will likely outpace the 2.2 GHz G5 PowerMac handily in single CPU mode.
The apples to apples (no pun intended) comparisons made to old P4s (not the vector plots) are not all that impressive, as other posts have pointed out. If anything, Apples latest and greatest is slower than an old P4.
What we need is a comparison using a code that is written to use both Altivec (on G4/5s) and sse/sse2/3dnow! (on pentiums/athlons) enhancements. Anyone got something like that?
Actually, it is a little known fact that Barry White was the primary technician behind the development of the Macintosh. He was also the chief GUI architect.
"The vector version of Jet3D runs an order of magnitude faster than the scalar version (speedups of 10X-13X are typical)." The dual 2GHz G5 was benchmarked at 5177 MFLOPS (a 1040% increase over the scalar test) and 1.29 MFLOPS/MHz."
5177 MFLOPS when running a Velocity Engine optimized version of Jet3D.
Now, how much does an P4 extrapolated to 3.2 GHz get? Like 288 MFLOPS?
Someone please explain to me how 5177 MFLOPS and ~300 MFLOPS are even comparable.
As the Mathematica guy said, the competition is no longer high-end PCs, it's now $10,000 UNIX workstations...and the G5 is still faster than any of them.
No wonder the G5s smoke the dual Xeon in the Photoshop, Mathematica, Logic, and Luxology app bake-off. All these apps would have been optimized to use the Velocity Engine.
If I were a scientist doing lots of image processing and vector calculations, I'd need a cluster of about 18 or so 3.2 GHz P4 machines to keep up with the dual 2 GHz G5 PowerMac running a typical Velocity Engine optimized app.
That's a sweet 5177 MFLOPS for you - evidence the G5s rock as hard as Apple has been indicating.
Um, yeah, sure is lucky Apple found the G5. I'm sure they had nothing to do with its development. It's not like Apple has been involved with development of the whole PowerPC architecture since the early 90s.
But Jobs wasn't at Apple during that time. What's the timeline:
1983: Scully joins Apple.
1985: Scully fires Jobs. Jobs, no longer at Apple , begins Next.
1991: PowerPC alliance between Apple and IBM.
1993: Scully leaves Apple, Spindler becomes CEO.
1996: Gil Amelio becomes CEO. Jobs approaches Amelio with idea for Apple to buy Next. Apple buys Next.
1997: Amelio fired from Apple.
actually what you'll find is that when you buy that shiney new G5, it actually has this little piece of paper inside that offers you future OS upgrades for free, or $30 per. (i don't remember offhand).
also, last i checked the number of a release was rather arbitrary relative to what was added or not added. the differences between 10.1, 10.2 and 10.3 are pretty hefty. well worth $30 per.
but i suppose it's much better to assume you have to pay full price for each.
This serves to only debunk the usefulness of benchmarking...
Tom
Someday, I'll have a real sig.
this is really the lamest comment possible.
so you don't expect someone to do comparisons before beefing out your precious tax-dollars on the best systems available for doing his job?
oh I know, let's put those dollars into something useful like rockets to blast away people not on our buddy list.
i think they used redhat 7.x as it is the last in the redhat range supporting the ppc chipset. isos are not available for 8 or 9; anyone know any different?
Any benchmark that's not against another real 64bit chip with a flat memory space is completely and totally irrelivant. Comparing optimized (no gcc need apply) code against optimized code like in the real world.
Has anyone seen any such relivant benchmarks?
- Adam L. Beberg - The Cosm Project - http://www.mithral.com/
Will it be enough to run Doom III? :)
*ducks*
Hahah, so the P4 does, on average 'less work' per cycle then the g5, so much for the g5 being superior "RISC" and the p4 being inferior "CISC"
Okay, I'm joking. But seriously, that information is a little useful. We can uses it as a baseline to compare new CPUs. So we'll know that a 3ghz g5 is still slower then a 5ghz p4.
This also tels you how many floating point ops per CYCLE the chip can do, which is always intresting.
autopr0n is like, down and stuff.
Looks good coming from a place like NASA, an agencey, a federal agencey, deailng in things of Great Speed and Power. The Article is a Nice little White Paper looking Official Document too.
Wonder if it was an evaluation machine, or an actual bonafied in triplicate carbon paper purchase order duly completed and fully authorized aquisition?
They were only using one of the G5 processors, RTFA
Why an old old version of RedHat 7.1?
:D
This IS Nasa we are talking about. Its like an oligation they have to get so many of their boxes hacked by leaving default installs of old distro's on instead of keeping them up to date
Just thought I'd point out that there's a bit more difference here than meets the eye.
Dell Dimension 4600
Pentium 4 2.66GHz
512MB RAM
40GB hard drive
48x CD-ROM
XP Home
$749. Ships today. Oh, and that comes with a free 15" flat-panel monitor. (Link)
Configured with a CD-RW and a 120GB hard drive, the above system is $919.
Apple Power Macintosh G5
1.8GHz (I'll be fair and not pick the dual 2GHz model)
512MB RAM
160GB hard drive
Superdrive
$2399. Not shipping until September. (By then, the above Dell will be $500 or less.) No monitor included.
Even if the G5 has better performance (and honestly, I still haven't seen reliable benchmarks with the dual 2.0GHz facing a P4/3.2GHz with Hyperthreading on), it still can't compete on price.
No matter how cool Mac OS X is... no matter how awesome these new G5s will be... if they are three times the cost of a PC, buyers will have a hard time justifiying it. In this economy, Apple will be hard-pressed to sell those $2300-$3000 desktops when people can get an equivalent-or-just-slightly-slower Dell for $800... including a flat-panel monitor.
Simpli - Your source for San Jose dedicated servers and colocation!
You mean this wasn't a Tom's Hardware review?
I especiallly love the way G4s used to "smoke a Pentium" in Apple magazines and advertisements. You know what else "smokes" a Pentium? A PII.
Personally though, I want to see how well it runs Seti@Home
My bet is you still won't find any signs of Alien life. So it won't be any better than my old crappy ass P1 166.
But good luck to ya.
And a 2.66GHz P4 barely edged out the 2GHz G5 in scalar floating-point operations, a task it isn't even particularly good at per-clock. If you read the great-grandparent, I think you'll notice that he mentioned the dual CPU score of the G5. So while you're reading the fucking article, you can consider reading the fucking comments, too.
You have exactly 314 seconds to come up with a less retarded plot.
Redhat 7.1 and 7.2 are the standard versions used by industry and research. They are concidered more stable then newer releases. Most GNU/Linux hosting in the US uses RH7.2. It works and does not have some of the issues of OSs that use the kernel of pain. I do all of my development to target RH7.1. After the 2.6 kernel gains some maturity, I will probably change this. BTW, the 2.4 kernel realy has little to offer in the way of improving performance for the type of application NASA is intersted in. The same will not be true of the 2.6 series as can be seen in the current 2.5 tree.
The apple scored 0.39% less then a 2.6ghz p4. Given that there are 3.2ghz p4s out there, as well as dual athlons, I doubt this is the 'fastest ever'
autopr0n is like, down and stuff.
Yes! A store for Slashdot Trolls!
http://nero-online.org/store.php
The Pentium comparison was just a fun side show.
Moderators should have to take a reading comprehension test.
3.06GHz with 512k cache? Dell has been selling these for a while.
What about Dual Opteron?
Why do people tests Macs that aren't even out yet against CPUs from the competition that are several speed increments below the top?
According to some of the comments on this board, the intel fortran compiler is about 50% faster then the portland compiler that was used. If the intel compiler had been used, the p4 would have scored 0.144 FLOPs/clock cycle*, which is much better then the g5.
(*which is exactly the same thing as MFLOPs/Mhz. And they wonder why NASA has so much trouble converting things...)
autopr0n is like, down and stuff.
Well then! Happy July 5th to you and your family!!
Let's all come together and rejoice in this special day!
And please allow a most sincere Happy July 6th as well.
The 498 MFLOPS figure was WITH 2 G5s!!!!
With a single G5, the 2ghz got a 254, and the 2.66ghz P4 got 255 MFLOPs...
Please read the article more clearly, this DOES NOT IN ANY WAY validate apple's earlier claims... here's the quote that was misread
"Though dual processor benchmarks are not presented in detail here, it is worth noting that the G5 system benchmarked at 498 MFLOPS and 0.125 MFLOPS/MHz for scalar Jet3D performance when two processors were used."
Followed by a chart showing the P4 2.66ghz with 255MFLOPS at the top and a G5 2ghz with 254MFLOPS at the bottom...
So you could guess that a dual 2.66ghz would get about 499-500MFLOPS which would be a 0% performance advantage to the G5, and the P4 3.2ghz would be even faster...
If this was done by NASA, why isn't the report on a NASA web server? (as opposed to some guy's personal homepage) kinda limits the creditability of this whole thing...
They used RedHat 7.1 only on the faster P4, the slower P4 was with RedHat 7.3.
You can easly build a dual Athlon system that could trounce an equivilant g5 for less cost. Xeon's arnt the only MP capable x86 chips out there...
autopr0n is like, down and stuff.
Hehe.
Found this from last Jan:
Date: Mon, 13 Jan 2003 23:29:38 -0500
From: Craig Hunter
Subject: G4 vs. P4 performance
I have been following the discussion of Rob Galbraith's benchmarks with much interest, as I have spent a good deal of time testing, optimizing, and benchmarking software for the G4 (OS X) and P4 (Linux).
The first thing to realize is that there are numerous benchmarks that show the P4 is faster, and there are numerous benchmarks that show the G4 is faster. What matters? Well, probably the benchmarks that apply to the kind of work you do. For people doing photo processing with the software Rob tested, his results are extremely relevant. But, someone working with a program optimized for AltiVec and dual processors might have a completely opposite experience.
Just to give an example of a benchmark that goes the other way, see this chart.
(You're welcome to mirror this benchmark image, since my web site may not handle a lot of traffic). These real-world results come from the Jet3D computational fluid dynamics noise prediction software, which I developed for my doctoral thesis and currently use in my work at NASA. Jet3D is written in a combination of FORTRAN 77, FORTRAN 90, and C, and is optimized for AltiVec and dual processors on G4 hardware. When compiled on Linux using Intel's ifc compiler tools, Jet3D also becomes optimized for the P4 (using the various SIMD extensions available on the P4).
As you can see, the G4 does quite well here. A dual processor 1.25GHz G4 system is more than 3.5X faster than a single processor 2GHz P4 system. Though it's not shown on the chart, a single 1.25GHz G4 processor benchmarks at about 1589 MFLOPS, 1.9X faster than the P4. If you look at MFLOPS per MHz for a single processor, the G4 comes in at 1.27 MFLOPS/MHz, while the P4 comes in at 0.42 MFLOPS/MHz. If you want a good example of the MHz myth, look at the Cray, which comes in at 1.78 MFLOPS/MHz with only a 500MHz processor, beating both the G4 and P4.
Without AltiVec, the Jet3D benchmark would be about 794 MFLOPS on the dual-1.25GHz G4, which erases the performance lead over the P4. And then, using only a single processor, the 1.25GHz G4 benchmarks at about 418 MFLOPS, which is about half as fast as the P4. And all of a sudden, the G4 doesn't look very compelling. For the Jet3D benchmark, AltiVec and dual processors are key (AltiVec more so than dual procs). This is true for most benchmarks I have looked at; thus numerically intensive applications that can't use AltiVec and/or dual processors are likely to suffer on the G4.
In the case of Jet3D, it was easy to optimize for AltiVec. I was able to hand-vectorize about 10 lines of code within the guts of the FORTRAN algorithm and convert the computations to C for easy access to AltiVec hardware instructions. It had a huge effect for not a lot of work. For other more complicated cases, it may be possible to use the VAST compiler tools to automatically vectorize and tie in with AltiVec (VAST has parallel tools also). But in some cases, vectorization is not possible or feasible. In those instances, you're stuck with the processor's scalar performance, and the P4 generally has better scalar performance than the G4 in my experience. One final note: these are my personal views, and do not represent the views of NASA Langley Research Center, NASA, or the United States Government, nor do they constitute an endorsement by NASA Langley Research Center, NASA, or the United States Government
Why a Pentium 4 2.66mhz?
Why no Athlon?
Why no Opteron?
Why an old old version of RedHat 7.1?
and so on....
RTFA:
"The primary purpose of this test is to determine how G5 performance compares to G4 performance in CFD applications"
If, as widely reported, the PPC 970 goes from 2GHz to 3GHz in the next 12 months it will definitely be more than competative.
It has been along time since I've seen performance increase by near 50% in a year. Takes me back to the 486DX2-66 days.
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
The point, however, is that it gives you a basis to compare any G5 system to any P4 system, and get a rough idea of how they will perform relative to each other MHz for MHz.
Sinced the stated goal of the benchmarking was to be able to gauge G5 performance relative to G4 and P4 performance in fluid-dynamic-number-crunching, this seems a reasonable unit to use. One can easily use it to do cost/performance analyses on current or future configurations of Macs and PCs.
Did they just use whatever they had lying around? Seems to me that if you were gonna benchmark apples fastest, you'd pit it against intel (or amd)'s fastest. I would have expected to see some 3ghz processors (which are still cheaper than G5 2 ghz processors) in the test.
It's nice to see that Apple will finally have some decent power to back them up, but I have STRONG doubts about their "Fastest PC in the world" claim. If a single 2 ghz g5 more or less keeps pace with a 2.66 ghz intel chip, I fail to see how dual 2 ghz system would outperform a dual 3 ghz (or faster)intel-based system.
Nevertheless, compared to the G4, it does seem that apple is making strides. Congratulations on that.
Though dual processor benchmarks are not presented in detail here, it is worth noting that the G5 system benchmarked at 498 MFLOPS and 0.125 MFLOPS/MHz for scalar Jet3D performance when two processors were used.
That was the above poster's point. Mkay?
</karma burn>
StrategyTalk.com, PC Game Forums
And you should try to use your eyes and read the article, even if they are of the hopeless kind.
Compare this to your crappy $900 PC. End of discussion.
I'm glad I use Macs.
I guess the NASA won't care about the price difference, and will care a lot about performance/herz (you know, mission critical CPUs that would rather run at slower clocks due to radiation issues).
I don't know about you, but if a P4 at 2.66Ghz is cheaper and better (even marginally) than a G5 at 2Ghz, I won't give a rat's ass about the clock speed.
Better scalability? maybe. Just show me the whole computer's performance and price tag.
Talk about performance/price and stop the rhetoric.
Someone please explain to me how 5177 MFLOPS and ~300 MFLOPS are even comparable.
They're not, which is what makes this whole benchmark so entirely useless.
Look at it: The conclusion, basically, is that there's no point in running CFD code using scalar FP. So why didn't they port their code to SSE2? P4's, and particularly the new 800MHz FSB P4 get data through SSE2 code like there's no tomorrow.
Nah, I'll listen when someone compares SSE2 and AltiVec properly. Until then it's just more blah. Don't get me wrong, I'm rapidly turning into the biggest Mac fanboy you've ever seen (Cocoa, since you ask) but the G5's are not the quantum leap Apple are making them out to be. Back in contention? Sure, but I promise you a dual Opteron 2GHz will blow the doors off a dual G5.
Dave
I write a blog now, you should be afraid.
"The code used was developed on a G4, and the vector tests use Altivec, so these results are what you can expect for an optmized application."
They give both the optimized and the unoptimized results.
Integrate Keynote and LaTeX
you should change the theme pic from the old g4 to the new g5
What are you talking about? Are you looking at the same graphs I am? A midrange p4 beat the highest end g5 available. How can you say that the g5 'won'?
autopr0n is like, down and stuff.
Actually they did mention SSE2, and weirdly enough, said it made things slower, and so they didn't use it
Well this story really makes me nervous. After taking in the comments from fellow /. readers, I am now convinced that Apple has conspired with the NASA Langley Research Center to make the G5 CPU look fast. The real clever bit was how they only used one CPU, and and unoptimized compiler, just to make it look "authentic" I suppose. Clever bastards. But they'll never fool me. They have my crappy ass Wintel box when they pry it from my numb, pale fingers.
... that the p4 is still faster! :P
Even if Apple makes those lofty goals, they're still going to be competing with 4ghz+ enhanced tejas Intel cpus by that time (have you seen the low capacitence packaging Intel is introducing to allow higher FSBs?).
http://www.hexus.net/feature.php?feature=587
With the probable ease of manufacturing dual cpu Athlon 64's due to the low trace count HT connections, AMD is going to be able to offer consumer priced multiprocessing, coupled with PCI express video cards, SATA, iSCSI etc. Apple has a lot of technologies to worry about bringing to their platform besides cranking clock rate.
Apple claims 15.7 for the Dual 2GHz G5, and the 3GHz P4 getting an 8.07. NASA gives the Dual 2GHz G5 498MFLOPS and the 2.66GHz P4 255MFLOPS.
If you use your math skills: 15.7 / 8.07 about equals 498 / 255. So therefore we can draw the conclusion that they have similar results.
Now, NASA only used a 2.66MHz P4 while Apple used a 3GHz P4. Although remember NASA's figure that the P4 had 0.096 MFLOPS/MHz? Give the P4 333 more MHz, and you find it has about 286.968 MFLOPS. NASA also suggests a 20% performance increase can be expected with compilers that take advantage of the G5.
Although, even without this increase Apple's benchmark and NASA's benchmarks are very close. Which would lead one to draw the conclusion that Apple's benchmarks were in fact valid.
I should also note that a P4 would not perform as well in a dual system as the G5 does. So your 500 MFLOPS number is a little rediculous. The G5 which is an amazing dual proc chip saw it's 254 MFLOPS for a single processor (508 when doubled) drop to 498 MFLOPS in a dual system. And the P4 isn't designed for a dual system, doesn't support HyperTransport, etc.
Dak
Who on earth would spend thousands and thousands of dolars on high-end hardware and then not even bother to use the compiler's optimization flags? Sheesh.
autopr0n is like, down and stuff.
> I heard that Mac OS is based on FreeBSD which is a distro of Linux right?
BSDi4.4lite --> FreeBSD
minix --> linux
You are aware of this system where people make comments, and then other people can comment on their comments. My reply was to the comment above, not the article.
autopr0n is like, down and stuff.
NASA's study found the Dual 2GHz Power Mac G5 to score 498 MFLOPS for their Jet3D performance. A P4 running at 2.66GHz scored 255 MFLOPS: a 195.3% performance advantage for the G5 in this test. If we assume a direct correlation between MHz and MFLOPS for the P4 (which would actually overstate the performance of the P4) and increase the P4's score by 12.782% this would give the 3GHz P4 a score of 287.594 MFLOPS. This is still a 173.16% performance advantage for the G5, and NASA states that a 20% increase in performance for the G5 would be reasonable "when G5-aware compiler tools become available."
etc. Please try not to jump into discussions without knowing what people are talking about.
autopr0n is like, down and stuff.
Real Translation: 0.4% slower, at 75% of the clock speed.
Real translation fastest G5 0.4% then mid-range p4. Don't forget it was you Mac zealots who screamed 'clock speed doesn't matter'! You could probably make custom hardware that would chew through this application at 100mhz if you really wanted to.
autopr0n is like, down and stuff.
Tell you what, let's split up Intel's market share 2:1 Athlon:PPC, how's that for fair?
Some applications are going to do better with the G5, no doubt others will be speedier on the Athlon. From what I can gather, over the next 12 months, the G5 will go from 2Ghz->3Ghz, Intel will go from 3.2Ghz->3.6Ghz but I don't know what the Athlon's are roadmapped for. If the 2Ghz is competitive with today's Athlon and the roadmap (IBM's roadmap this time, not Motorola's) has the G5 ramping up speed significantly faster than the Athlon, it makes sense for people in a certain part of their upgrade cycle to go to G5 even though today's Athlon might be faster.
it seems from that information the g5 had a lot more l1 and l2 cache. perhaps that's responsible for a good deal of the speed increase over the p4?
Single CPU scalar benchmark
P4 2.66 Ghz --> 255 MFLOPS
G5 2.00 Ghz --> 254 MFLOPS
Then the guy basically switches on the part of his code (developed on a G4) that use Altivec (Apples SIMD instruction/pipeline), and wammo, reasonably good MFLOPS numbers (2755 MFLOPS) ensue (any modern processor can get over 1000 MFLOPS with suitably written code).
This guys code is parallelized (nice), so with the 2nd processor on he gets more done, hence 5177 MFLOPS.
But there isn't a intel or AMD comparison for the vector tests, cos his code doesn't seem to have been written to use their SIMD instruction sets. The Intel and AMD pundits claim their SIMD architecture is superior to Apple's (e.g. see discussion in this article).
As the author of these benchmarks himself states
Based on an extrapolation of current P4 results, the 2GHz G5 would lag newly announced 3.2GHz P4 systems in Jet3D scalar floating point performance by about 20%, but this kind of comparison is best deferred until G5-aware compiler tools become available (since a 20% performance gain is well within the potential of compiler optimization).
Finally, it is important to note that the current test does not factor machine cost or intended use into the picture, and that can have a large impact, especially in clustering applications.
I.e. a G5 is probably comparable to a top-end desktop 32bit PC for joe blogs applications once the small changes from G4-to-G5 specific optimizations take effect. A comparison of suitably SIMD-aware number crunching applications is still needed to evaluate absolute speed. Processor power per unit dollar cost is another issue that the author hints at, which is probably not in Apple's favor.
The unit MFLOPS/MHz is a little weird. Let's simplify.
/ /
/ MFLOP \
| ----- |
\ S
-----------
/ MCYCLE \
| ------ |
\ S
Multiply by the reciprocal...
MFLOP S
----- * ------
S MCYCLE
Millions cancel, seconds cancel...
FLOP
-----
CYCLE
So it seems that this unit is equal to 1 floating point operation per CPU cycle. That makes a little bit more sense as a unit.
For those who hate to wade through...
The G5 is about equal to a P4 at the same clock speed.
Unfortunately, P4's have much higher clock speed.
I think the thing that most people on /. seem to keep missing is this: MacOS X and Linux both use GCC as their primary compiler. The Linux kernel is compiled with GCC, as is Darwin. Most software for each platform is compiled with GCC.
Now, with all these Linux-heads around here insisting that Linux is faster than Windows on x86, you'd think GCC for x86 might be a good compiler. Certainly the SPEC tests Apple (and Veritest) did with GCC on the G5 with OS X and the dual Xeon Dell with Red Hat had to have been a valid comparison between those two situations.
I also keep seeing all these comparisons to Dell computers without full specs of the Dell. The base configurations for the PowerMac G5 is positively loaded. How many $500 Dells come with Gigabit Ethernet? How many have the same level of engineering into the thermal managment?
Only time will tell for sure. In the mean time, remember that IBM will be producing blade systems with the 970. We'll get a chance to compare those as well eventually.
--
The internet is the greatest source of biased information in the history of mankind.
I found it interesting that the G4 and P4 have similar MFLOP/MHz. This argues that (at least for this kind of test) the G4 is not more efficient per clock cycle than the P4, as some have tried to argue. On the other hand, it looks like the G5 is.
This is also a useful metric if you want to extrapolate performance of anticipated systems with higher clock speeds.
I recently got the chance to do a testrun, doing some airflow simulation on a G5 1.8GHz demo machine, and with altivec optimizations it clocked in at roughly 2100MFLOPS average for 5 runs(I could probably get better results with a better compiler though), while the dual Opteron 1.8(which the place where I did the testrun has bought 10 boxes of for their renderfarm), running Suse Linux, and my program re-compiled for x86-64 and SSE2 performed at about 2960MFLOPS average, but that could probably be improved with a better compiler too, but I had to use GCC at this time. Both machines had 4GB RAM btw.
what i'd like to see is some G5 versus AMD XP+ 1.5GHz to 3.2GHz reviews, since P4 seems to be a dog compared to P3.
No, let's make your "summary" a little more accurate. The 2Ghz G5 is *very* slightly slower than the older 2.66Ghz P4 -- when running unoptimized code on an unoptimized version of OS X.
As far as I'm concerned, all of this benchmarking is meaningless at this point in time. The G5 is simply too immature of a CPU to benchmark until the software catches up with it.
This same problem came up when Intel first released the Pentium 75Mhz CPUs. Nearly everyone said "This thing isn't even quite as fast as my 486 that costs a lot less!" This also happened when Intel first added "MMX" support to their chips, and when the PIII's first came out as successor to their PII.
Any time you believe the early benchmark results on a new line of CPU, all you find out is that you're not going to get immediate gratification as an "early adopter". So what? This is common knowledge.
Apple fans have every reason to celebrate with the G5's release. It gives Apple a whole new starting point on CPUs they can "ramp up" from, instead of struggling to squeeze some meager improvement from the maxxed-out G4 chip. If you want to use Apple's OS and Apple compatible software - your only option is an Apple computer. Ultimately, that's the key, and explains why some people (myself included!) shelled out thousands for a system (dual CPU G4) that doesn't outperform Intel's best offerings. I want to do things I can't do with anything Intel sells, such as run OS X.
...because it gets more miles to the gallon.
160GB Serial ATA, SuperDrive (CD-RW, DVD-R), Firewire 800, USB 2, ATI Radeon 9600 Pro...
Plus all the standard things that apple don't actually mention in their specs any more of course. 16 bit sound, gigabit ethernet, and so on.
And that's without the free software. iMovie, iCal, iPhoto, iDvd, etc., (which you may or may not want, so adjust value of system accordingly).
The thing with your comparision, is that you are comparing the G5 processor with the P4 processor. Now try comparing the G5 system to a PC system with a P4 with equivilant specs. Hard drive, DVD writer, latest firewire, gigabit ethernet, etc, and you soon see the PC cost more.
And that's without touting the advantages of Mac OS X and all the other points that have already been made in this thread...
This is Apples advantage and disadvantage. Because they won't sell you anything less than top quality in their pro line... And that costs!
People seem to miss the big picture some times!
a single 970@1.8Ghz scored 2100MFLOPS and a DUAL opteron@1.8Ghz scored 2960 !? I assume your code was multithreaded/multiprocess? Your's sounds like a much more interesting test than nasa's - could you give any more info?
Actually, if you would read the link I posted, and the sentence I mentioned below that where I added a CD-RW and a 120GB hard drive, I did configure a similar system for $919.
So you what the whole enchilada, eh?
Dell Dimension 4600
Pentium 4 2.66GHz
512MB RAM
200GB hard drive
4x DVD+RW/CD-RW
16x DVD-ROM
128MB ATI Radeon 9800 (which is higher-end than the video card in the $3000 Mac, and MUCH nicer than the one in this Mac)
XP Home
$1448 (And that's still including the free 15" flat-panel monitor. To those who said this monitor is worth "$50", please tell me where I can buy a 15" flat-panel that is even half as nice as this one for $50.)
Now, if you want to say that Macs are worth twice the price, that's fine. I don't necessarily agree, but that's a value judgement (opinion). But to argue that the systems aren't comparable -- not only are they comparable, but the PC is 2-3x cheaper with a monitor. This particular one also has a larger hard drive and a better video card, and it's still averaging half the cost of a Mac.
On a purely cost basis, the PC wins, hands down. Is the Mac worth twice as much? That's up to individual buyers to decide. I'm just pointing out the facts.
Simpli - Your source for San Jose dedicated servers and colocation!
"...a testament to the quality of the code several lines are still in use ..."
What percentage of the original lines?
Funny, I just ran this at the Dell site:
Dell Dimension XPS:
3.2GHz Pentium 4
200GB Ultra ATA - 7200rpm
DVD+RW/DVD+R/CD-RW
16x DVD-ROM
1024MB DDR400 SDRAM
Microsoft® Windows® XP Professional
Wordperfect
ATI Radeon 9800 pro
No Monitor
$2638
You're off by over $400 and you didn't include the DVD-ROM drive or the extra memory.
You're forgetting the G5 is dual CPU, these test are on single CPU mode.
(\_/)
(O.o) This is Bunny. (> <)
You can get dual-proc Xeon and Athlon PCs if you want.
autopr0n is like, down and stuff.
So this proves the Mac is an :-)
excellent computer/Mhz
I would like to know the price per Mhz, and
then the price of a Mflop in all systems.
Z.
By a whopping 0.4%, and with one of the G5's processors disabled. You can spin it any way you want, but the clear fact is that with the G5 Macs are competitive in CPU performance again. I don't see why this disturbs you so; competition is good.
How to solve most of our problems: 1.Lots of nuclear plants. 2.Cure aging.
If Apple had release at least a dozen G5's to various internet review sites that specialized on hardware, and specified which benchmarks they MUST include, none of this anger would be directed at Apple.
Instead, well see the true prejudice these so called "unbiased" hardware review sites have towards all things NOT within their interests.
I would have given the first box to amdzone.com. Then everyone could blast them for their subjective view on all things not AMD, and leave Apple alone, or even console them for being the victim.
They couldn't get a hold of a super-cheap Athlon XP or even a sub $1500 Opteron server, but they got a hold of an Apple G5 which isn't even available for sale???
Puh-lease... The gripe with the SPEC benchmarks was that Apples numbers for the competition were WAY below the OFFICIAL numbers, ot that Apples numbers for their own equipment was crap.
Jeeze...let's at least wait till these things are SHIPPING.
[RIAA] says its concern is artists. That's true, in just the sense that a cattle rancher is concerned about its cattle.
*BSD went belly-up in 1999. Mac OSX is actaully based on VMS.
One future, two choices. Oppose them or let them destroy us.
The Jet 3d website was /.'d ....
It's Christmas everyday with BitTorrent.
The g5 is 64 bit and can address a shitload of memory. Can we please see comparison against 64 bit apps ? Compare to t5he Itanium or Opteron please.
Apps were not optimized for the g5, in real world of 3d graphics, engineering, and science, optimized apps would be used.
It's unfair to apple to have people benchmark it without utilizing the full capabilites and feature set of the processor.
"...what you'll find is that when you buy that shiney new G5, it actually has this little piece of paper inside..."
Which is how it REALLY smokes the P4.
(tig)
"We do not inherit the land from our ancestors"
"We borrow it from our children"
Ignorance and prejudice and fear
Walk hand in hand
The Portland compiler traditionally underperforms the Intel compiler, and often gcc as well.
Does it hurt to hear them lying? Was this the only world you had?
Not as hard as Bill Gates' mom on an IBM exec.
Actually, you're both correct, and you're both missing something.
.18u, .15u, .13u, etc) means there's less time necessary to charge the capacitors, and thus speeds are increased. (This is only one aspect, and I'm no expert here).
Originally, the pipelining was segmented based on the I-Fetch, D-Fetch (register/etc), Exec, Reg-Write-Back, with expensive floating point doing with different timing considerations (externalized or delay-locking multi-stage execution). Then they started sub-dividing each of those stages (especially in CISC archetectures). Now its common to see 15 integer execution pipeline stages - either with shared resources, such that you can only have one divide occuring at any given time (early P-I, P-II, P-III), or with fully independent/concurrent resources (AMD's Athlon).
The addition of the pipelinable-stages between I-Fetch, D-Fetch, exec, and WB was somewhat trivial, because prior to pipelining, there were still seperate events on seperate clock-ticks with inter-stage latching. However, in CPU's with exec-stages that are pipelined, you are introducing additional latches that cause additional undesirable propagation delays.
So a 15 stage integer multiply unit (excluding fetch/WB) has 15 x [guestimating] 4 propagations of additional latency over a single-stage I-unit. If there are resource-based stage-interlocks, or worse data-dependencies, then the pipelining is useless and you're totally hit by the excess propagation delays.
Still, marketing being what it is these days, adding more stages means less propagations per stage, thus less worst-case propagation time, and thus higher clockability (all else being equal; temperature, etc).
The P4, however, compensated by double-clocking the core integer stages, so the number of advertised stages is somewhat misleading.
On a side note, due to the latching in pipelining, you're definately doing more total work for a given instruction. And more importantly, the designers have to think of totally different logic-algorithms to efficiently pipeline than to single-stage. My guess is that the pipelined versions will always be less efficient (especially considering that not all stages will fully utilize their allotted clock-time), and thus there's an additional loss.
Ok, so this supports your post, but here's the part about power/heat.
There are two types of transistors used in modern CPUs (everything past the Pentium). BiPolar and Field-Effect. The CMOS-FET refers to Complementary Metal-Oxide-Semiconductor Field-Effect-Transistor. This acts similarly to a capacitor in that there is a charge and discharge time with little waste current, and power dessipation is typical V=IR, Pwr = I^2 * R. The gate capacitor charge-time is the killer, and what limits switching speed (and thereby clock-speed). Shrinking the area of the capacitor (related to the micron-size stated,
There's another way of reducing switching speed.. Increasing the amount of current running through the wires that ultimately charge/discharge the gate-capacitors. FETs are poor amplifiers, but BiPolar (while more complex and harder to make small) are phenominal. In addition to their complexity, Bipolar also are power hogs. While a FET only consumes power while turning on or off, BiPolars are always on, consuming power (there is current bleeding from the switch). So what often happens is that designers sprinkle BJT's here and there to amplify the current (at the expense of cost/complexity and power-dessipation), and continue using FETs everywhere else.
The bigger and greater number of BJTs that are used, the faster some heavily loaded FET gates will charge and the quicker their switching time will be.
If you up the voltage on a CPU, you're enhancing the amplifier's ability to charge the capacitors and thus gives you more safety-room to up the external clock-speed.
Again, this deviates somewhat from my knowledge domain, but you can often merely co
-Michael
Plus or minus 3dB represents a doubling or halving of the power, respectively. However, quietness or loudness is a subjective quality. Most statistically normal humans seem to agree that plus or minus 10dB doubles or halves the apparrent loudness. Psychoaccoustics bears no relation to math or physics.
3dB is only a factor of 2 for power. It takes 10dB to get a factor of 2 on loudness/quietness.
Close, but no cigar. The G5 has a 16-stage integer pipeline.
16 bit sound
Considering my G4 PowerMac has 24bit audio, I think it is safe to say that the G5 have at least that too.
sin(6cos(r)+5A)
I think the operative questions are:
Is it easy to optimize for Altivec?
Is it easy to optimize for SSE2?
Suppose you have a year to complete a series of simulation runs. After spending a few months writing code, you discover the intel machine, using code compiled by icc is 20% faster than a G5 coupled with gcc.
Now, you could start your runs. Or you could spend another month or so hand vectorizing code for a tenfold increase in speed (at least on the altivec platform). Is it as easy to produce such speed gains by hand optimizing SSE2 code?
We'll only stop running benchmarks until Intel wins.
We'll only stop counting until Bush wins.
Buncha damn liberal hippies anyway...
What are you talking about?
Single CPU Jet3D Vector Benchmarks - MFLOPS/MHz
G4 1.25 -> 1.29
Single CPU Jet3D Scalar Benchmarks - MFLOPS/MHz
P4 2.60 -> 0.96
Sure, the G4 using the scalar benchmark is just a bit faster, but as soon as you use Altivec the G4 beats the P4 on a per clock basis soundly. That is what all those arguments are refering to. Thanks to Altivec the processor is more efficient.
sin(6cos(r)+5A)
The G5s uses dual processors more efficiently than pentium systems use dual processors. The load balancing tends to be more even, from what I can tell. In fact, the architecture was, to some extent, optimized FOR dual processors. Additionally, the G5 is designed to be a good floating point/vector processor.
====
Crudely Drawn Games
I could not find these benchmarks anywhere on NASA's site. The benchmarks are only posted on this persons personal home page.
Or could someone reply with a direct link to the NASA page instead of the guys ISP homepage.
Otherwise I can't take these benchmarks seriously at all.
http://www.kubuntu.org/
Big difference. screw the graphics if you want to crunch.
you could spend another month or so hand vectorizing code for a tenfold increase in speed
You could do, or you could spend a month being scared of it, then actually get it done in a couple of days (when push comes to shove), which is what I did.
I recently managed to get MMX code into some software I'm wirting using GCC instrinsics and, y'know, it wasn't that hard. The hard part is architecting the application as such that the data just happens to be hanging around in exactly the right structures in the first place.
Dave
I write a blog now, you should be afraid.
...if they are never hard pressed to sell their stuff, then why is Apple perinnially in the proverbial economic crapper? Not trying to troll, but it just shows that the lowest common denominator (or a broader range of product classes) will always win over catering in to an exclusive high end crowd.
How is a dual Athalon going to "trounce" a G5 when the 970's trounce a dual Xeon, which will trounce a dual Athalon? I wont argue that you can build a cheaper dual Athalon than buy a dual 970 from Apple, or that you might get more price/performance by going AMD, but trouncing the new Macs? ...bullshit.
The big deal with the Athalon will be the upcoming Athalon-64 which will likely compete more with the G5. From everything I've seen the Opteron is already much faster than the G5. Intel, I don't know what they are going to do. Certainly the P4 will continue to improve. I've heard some exciting things about upcoming chips but the Itanium clearly isn't aiming at the same market as Apple or AMD. (Well, some overlap, but I think Apple has more work to do to get the workstation/server market)
Damn liberal editors caving int those artsy-fartsy mac users again...
How dare you print a story that derides the paragon of the IT hardware industry, Intel, as being second best to a processor installed in a machine maniufactured by those long-haired elitist at Apple.
And to think that you would accept the word from those commie "scientist" down at NASA, who sit around doing nothing all day except "thinking" and playing with thier toys at taxpayer expense.
Shame, shame, next thing youl be tryin' to convince me that Linux OS is more secure than the hallowed work that our hero in capitolism, Bill Gates, does at microsoft. It's bad for the economy, I tell you. All these long haired, smelly, weirdos keep messing with our good old American way of life. What, you say you want a choice? You have a choice! You'll choose from what choices we here at the Central Office tell you to choose from!
Take your Open Source, MacIntosh, OS-X (You don't think I know whaty OS-X really means, do you, but I do, smarty pants, I do indeed..), and whatever else it is you keep trying to give us and go back to Russia, or wherever it is that you came from. Who invited you to our red-blooded American industral party anyway? Nobody! That's who!
Why you pnko, beatnik, hippy, givin' it all away, don't think you need the establishment faggo...
and so on,
and so on,
and so...
Read, L
What is the difference between a PC and a workstation? I think its pretty clear the term workstation as a distinguishing label died long ago, some time after "PC" performance on the desktop destroyed "workstation" performance.
I they wanted they could *easily* have got one.
How? (you might ask)
If someone from NASA had sent an email to marketing@amd.com; basically asking*:
Howdy!
I work at NASA and do some testing of some new hardware, among it the new G5 Powermac.
Could i borrow a Opteron for some testing?
AMD would have sent it on the day.
*Or a phonecall to any of these numbers.
Disclaimer: I don't work for AMD or any other related companies
Melius mori in libertate quam vivere in servitute.
Wrong Conclusion!
According to this pdf (page 13) G5 @ 1.8GHz has 1051 SpecFP.
At the same time Power4 @ 1.7GHz has 1598 SpecFP !!!
It is very clear that Power 970 (G5) is much-much slower in floating point than it's Power4.
Also if you had a previous version of windows then you simply purchase the upgrade which is around $200 and if you find some decent dealers it can be had for a lot less.
:)
OS X costs around $100 for almost each upgrade. I started out with 10.0 which was extremely buggy. It came with my G4 which also had os 9. At the time i just used os 9. 10.1 was free i think and it should be as it's just some bug fixes. 10.2 costs another $120 something but it was the first really useable versions. Almost crash free for the first time. I haven't bothered with 10.3, yet... is it free
But the software cost really isn't big because if you go amd then you'll get huge savings over apple no matter what. XP would have to cost at least $500 for macs to be a better price value.
Hmmm... Pie...
Modded as offtopic because the Slashdot mods hate niggers!!!
I'm sorry if i seem a bit agitated but i'm getting tired of all this misinterpretation. If you read that article it basically says that the p4 2.66 ghz (old gen as the 3.0 ghz 800 mhz fsb stuff came out some time ago) is still beating a new chip that will come from apple. I think there were some tests in which apple won but most were when the dual cpus were turned on.
EVERYONE should know that by now that intel makes processors that do less per mhz but to reach a higher mhz level. Infact even p3's do more per mhz then p4's not to mention all of amd's cpu's since the original athlon. This benchmarks clearly shows that apple doesn't have a chance again the current 3.2 ghz p4 cpu's with 800 mhz buses. If you take overclocking into the picture (can't oc macs) then apples are just blown out of the water.
Besides in the next few months the biggest cpu launches from both amd and intel since a long time ago will happen.
Hmmm... Pie...
If you payed attention to any hardware comparisons of the latest macs (g4's verses p4's) then you'd notice that it was always getting trounced.
Macs are great but hardware performance isn't the reason.
Hmmm... Pie...
Do not forget, the G5 has 64 bit processing while everything else has 32 bit processing. You cannot draw the same comparisons. Wait until there is a 64 bit processing for the P4 or AMD processor, then compare.
Look at what nasa and what apples uses. Low end dells with slow pc2100 ram for comparison. NASA compared that with a product not even out yet. The PC you built for $3000 will beat the mac that won't be out for months. The 3.2 ghz p4 has the new 800 mhz fsb that the 2.66 ghz one compared in the article didn't. In reality by sept. we'll have the athlon64 and possibly intel's prescott. So no matter how you look at it, this benchmark is irrelevant unless and pc's are still cheaper.
Hmmm... Pie...
Although I know it was asked, but why aren't there AMD processors? What about (DEC, Compaq, HP, or whatever the hell it is now) Alphas? Sparcs? Also, why rate in MFLOPS/MHz? CpF is a lot more easier to understand. CpF is Cycles per Floating Point Opteration. As far as I know, the Itaniums are the most efficient, while running in native mode. Whatever the G5 has to offer, doesn't even compare.
Now, getting back to AMD processors, I think that they should have been included in there. Although, since the Athlons ARE more efficient in pretty much everythan that the Pentiums, they could have easly gotten high scores. Perhaps Langley doesn't want to show how equal, or worse even, the G5 is compared to a sub-100 dollar USD processor. No processor in existance compares in performance for price to Athlons.
Well, even though there is another, perhaps even credible to some, source for benchmarks, but I still say I don't trust them one bit. Call me old fashioned, or a stubborn 16 year old, but I don't trust any benchmark done by anyone. I like to test things before I buy them. Why doesn't anyone else? Why trust people that are funded by the companies that make the product they benchmark? Well, anyway...
This great article discusses the general specs of the CPU and it's architecture.
A really well written (but highly technical) article. As you can see, IBM designed it to beat Sun and Alpha (still alive at the time), not Intel. As far as Apple is "involved" in the development, it's in the latter stages of implementation rather than design itself.
Please note,
gcc + g++ for MacOS X on G5 platform = $0
Intel's C++ compiler for pentium based platforms = $399
prices
Now factor in Operating Systems prices, and general software.
MacOS X 10.2 $129 Windows XP Professional $143
iTunes $0 MusicMatch Jukebox Plus $19.99
AppleWorks $0 WindowsXP Office $297
Software Total: MacOS X: $129 WinXP: $459.99
(prices taken from www.newegg.com)
Error 407 - No creative sig found
But misses the point.
The P4 2.6ghz was about as fast as the G5 at 2ghz.
The trouble is, the 2.6ghz part they tested is not the fastest Intel chip in 2 respects, 1st, 3.2Ghz is the current speed 2nd, this has the old 533mhz FSB.
So the point is that Intel is reining supreme.
THe trouble for apple is three fold:
1) These G5 machines won't be available for several months
2) By the time they are, the Intel roadmap will be close to 4ghz
3) These G5 machines are just too damned expensive. The right price is $1700 for these machines on the low-end, with $2300 getting you a fully tricked out G5. Instead, the prices will probably be $1,000 more than these price points.
Meanwhile, You can get a fully tricked out Dell 8300 3.2ghz machine for around just under $2K shipped.
The only thing going for the Mac is OS X. But frankly, XP is okay to use, and if you want, you can run any flavor of Linux, BSD, or hell, if you're pathetic, you can run SCO.
I showed a photo of the new G5 to my dual 1.25 GHz G4, and said ...on your mark....get set....GO!!!, and neither of them moved an inch.
I had a stopwatch, start/finish line, champagne girls on the podium and everything.
I have to admit, after all the talk recently, I was pretty surprised at the results.
The G5 has requested another match, later in the summer, so we'll see if things come out differently then.
Seriously.
They included the MFLOPS/Mhz graph that makes the G5 look great, but they forgot the MOST IMPORTANT graph of all: MFLOPS/Dollar.
"Comparing el-cheapo Dell's to high-end Apple's is like comparing.."
Is like comparing a ferrari (dell) to a milk truck (apple).
Nobody except a few die-hard macs-heads will claim the 2Ghz G5 is anywhere close to the 3.2Ghz P4.
Plus, the Dells are cheaper.
Your price is a complete misstatement; you probably configured it with a 14 year warranty and the kennedy girls come to your house as hookers for an evening.
You can't get a dell desktop machine this expensive; they don't make one.
So we're back to you using a Mac because you like it: that's a very defensible statement. But careful with benchmarks, because the P4 kicks the PPC line's ass, and its only getting faster.
What you get inside the box are these three little coupons that can later be redeemed if and when Apple decides to have a discount for previous OS owners. If Apple decides to charge full for it, the coupons are useless (on rare occasions they're also used for certain rebates). I have some of them from my new iBook purchase, and they're essentially scrap because they're charging for Panther. (Not that I necessarily have a problem with that; I'm more than willing to pay for an OS upgrade of this magnitude [not to mention I get educational pricing])
Want Slashdot headlines on your site? Try SlashHead
And a 2.66GHz P4 barely edged out the 2GHz G5 in scalar floating-point operations, a task it isn't even particularly good at per-clock. If you read the great-grandparent, I think you'll notice that he mentioned the dual CPU score of the G5. So while you're reading the fucking article, you can consider reading the fucking comments, too.
Apple zealots, I have all the time in the world.
You have exactly 314 seconds to come up with a less retarded plot.
Apple already has FireWire 800, SATA, and PCI-X. I don't think Apple has any worries about bringing tech to its platform.
The reason why there are no RH 8 or 9 ISOs for PPC is that basically they are leaving development of the PPC branch of Red Hat to the makers of Yellow Dog Linux. Which, right now, is G5 ready and is basically RH9 for PPC. http://www.yellowdoglinux.com/
Knowledge is power. Knowledge shared is power multiplied.
Just want to share a story, There is once a father and his son who owns a donkey. They were walking towards a village with the son riding the donkey and the father towing it. When they entered the village, the people starts critizing the son for being a useless son. They decided to go into the village again later but this time the father is riding the donkey and the son tow them. In the village, the villagers start commenting that the father is a cruel father for letting the son tow the donkey while he rides it. They thought for a while and when into the village again; but this time both tow the donkey behind them. Guess what? The villagers call them stupid. Donkeys are to be ride. In the end, both of them carried the donkey. Think about it..
Are these 1 or 2 LSB errors or are the answers flat-out wrong? References please.
Contribute to civilization: ari.aynrand.org/donate
Yes, I know. I was trying to be humorous. As in, when the little piece o' paper inside the G5 catches fire etc etc. Guess I failed. Again. *sob sniff sniff sob*
(tig)
"We do not inherit the land from our ancestors"
"We borrow it from our children"
Ignorance and prejudice and fear
Walk hand in hand
Does anyone remember the dual-ppc that the BeOS people were originally selling? Anyway, you could turn off one of the processors but if you wanted to, you could turn them both off. I think they evenrtually removed this feature. =)
-
Insightful posts by people like you. I also came to the same conclusion. I'm Glad to see that some people can figure things out without it being presented to them on a silver platter.
I used to think that my brain was my favorite organ, but then I remebered who was telling me this. --Jack Handey.
Well.. maybe. Or Maybe not. But Definitely not sort of.
I'm talking about the non-vectorized benchmarks:
Numbers were 0.096 P4 vs 0.105 G4 MFLOP/MHz for the non-vector comparison
As stated in NASA study,
In other words, for the many uses that don't involve Altivec, the G4 and P4 actually are comparable on a MHz basis, despite the substantially different designsIs it just me, or do other people notice the growing trend of Intel really sucking hind tit? Sure, they are over 3Ghz, but AMD and now even Apple are sporting processors that are more than one thousand mhz slower, yet, over all, perform the same or better. I figure by the time Intel hits 4 or 5 Ghz, Via's Cyrix processor will be just about 1.8Ghz, but will have been worked to where it's performance will be on par with the Intel offering. Ol' Intel had better double their advertising budget, or even Dell will have to start offering non-intel systems.
By the way, no, I don't benchmark systems I use, as that, in my mind is like putting a car on a dynometer. I mean, who realy cares how much horsepower a machine has, if it's gearing is completely mis-calculated? Based upon actually using systems, I find that G3s seem very slow to use, G4s not bad, P4s a little better, and AMD Athlons wicked quick. Yes, all systems were slightly different clock speeds, but all had 256MB of RAM, except the P4, it has 512MB of RAM. In just normal usage, nothing I've come across can touch the Athlons for performance. However, I also do NOT do video editing, sound editing, etc. I just play a few games, do programming for school, the internet, and such. So no, I do NOT care about how fast something can open Photoshop, or if these systems can do real time video editing. I didn't build/purchase them for that.
There's my two cents, could I use that for a down payment on a new dual G5?
For those who describe their systems as 'boxen', do you order multiple 'boxen' of corn flakes also?
Will this give me porn a million times faster?
RTFM; please, I beg you.
Somebody help me with this. How much different should the speed numbers be if an application is written with 64 bit in mind and compiled on a 64 bit aware compiler versus a 32 bit version on a 32 bit compiler? Is it twice as fast? Would this be the real comparison between G5 and P4?
Would this invalidate the current discussions?
What is the point of comparing a 64 bit machine to a 32 bit machine at the same clock rate?
I wish even half of the people on this site posted as well as this. Really, really nice explanation; thanks for taking the time to explain it to us laymen.
I don't understand why the new G5 processor is constantly benchmarked against the P4 and Xeon processors. The G5 is a 64-bit processor and it damn well better be faster than any 32-bit processor. I would be interested to see benchmarks comparing it to an Opteron or Itaniam2. With the already mediocre results against the 32-bit processors I think that Apple will fall short this winter when Intel breaks 4GHz.
Is it as easy to produce such speed gains by hand optimizing SSE2 code?
Yes. Both gcc and the intel compiler supports intrinsic
operations of the same type as the vec_XXX() altivec extensions.
I've coded both SSE & Altivec. In general, Altivec is a "cleaner" vector instruction set for better and worse. If your data is perfectly aligned on 16-byte cache boundaries, if you always load 4 elements at a time, and traverse memory linearly it rocks. However, as soon as you need to do some manipulation of the data (e.g. if it isn't properly aligned, and the algorithm makes random data access necessary) SSE wins by a more flexible instruction set.
For the same reason, Altivec has higher theoretical throughput, but in most practical applications SSE does just as well.
Now, you could start your runs. Or you could spend another month or so hand vectorizing code for a tenfold increase in speed (at least on the altivec platform).
Ah, but in the real world programmers don't come free. Instead of doing something else while the machines are crunching you would spend one month of time on coding. And you didn't include the time necessary to learn altivec. Then you'll have to add the time necessary to document the changes, and maintaining a non-portable module in the code - each new programmer on the project will have to be trained on altivec.
In other words, for the many uses that don't involve Altivec, the G4 and P4 actually are comparable on a MHz basis, despite the substantially different designs.
Yes, and on a pure MHz basis, both the Z80, commodore-64 and motorola 68000 would be even better per MHz.
The whole idea with the P4 is that it currently runs at 3.2GHz, which is significantly higher than the 2GHz of the PPC970. If they are comparable per clock, the P4 is obviously a better chip (sadly).
Whoa, it can play the other four games for the Mac?? Hook me up!!
That's why I'd never buy a computer from Apple (or a high-end computer from Dell, apparently); I could buy or build more computer than I'd ever want for approximately $2,000 and it would be comparable to those systems (better in some ways and worse in others, most likely).
However, I've never spent more than $1,100 on a computer; in fact, lately I've spent less. And I've still been quite happy with what I've gotten for my money. For me, it's all about getting a computer that fits my needs, and maxing out that magical price/performance ratio. Therefore, my computer is invariably a non-Intel x86 built from parts from (or by) my local generic computer shop.
I actually considered getting a Mac when there were PowerPC clone vendors; I thought it was very cool that Apple was finally opening up its hardware. But even the clones were pricey, and lo and behold Apple squashed them so fast that it didn't even matter. Also, Rhapsody didn't deliver on its promises at the time, so I'm glad I didn't get stuck with an old, overpriced G3, and I'm certainly not going to end up with a new, overpriced G4 or G5.
But hey, no hard feelings; I'm just not in their target market of suckers with too much cash lying around. If I had that much cash and I wanted a PowerPC of some sort, I'd probably look into getting something from IBM instead.
pb Reply or e-mail; don't vaguely moderate.
These prices apply if you're doing your own development; the issue most people are raising has to do with the operating system and off-the-shelf applications which are compiled with whatever the developer happened to be using. For Mac OS X, that's often gcc. For Windows, that may be Intel's compiler. How much money it costs the developer really isn't at issue here.
By the way, AppleWorks is not included with the G5 as far as I know (it's only bundled with consumer Macs, not pro Macs); it retails for $79. In my opinion, AppleWorks is really not that good, compared to MSOffice, so if you're comparing MS Office to MS Office, you should notice that MSOffice for Mac is $499.95, or Word and Excel are $369.95 each.
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
These benchmarks prove once again that while high-end RISC CPUs outperform commodity CISC CPUs per clock cycle, it really doesn't matter, because the CISC CPU runs so many more clock cycles that the CISC CPUs can blow away the RISC CPUs.
;)
Some people will point out that the Intel chip barely beat out the G5. But the intel chip is available *now*, while the G5 won't actually be shipping in quantity until late August, at which point the intel CPU will have dropped in price and been replaced by faster models.
Of course, I would still rather have a new Apple over any more Lintel boxes
Just to point a few things out:
1) The Dell has a DVD+R
2) 120G HD's are worth about $80 these days.
3) Monitor is a monitor. Too bad if Apple charges too much
4) The Dell includes video editing software
5) Since the Dell was configured with the 9800, it too is dual capable.
As to your use of "ROI", you apparently have no idea what that means. Why don't you misuse TCO to cap off a completely wrong post?
Finally, the movie was just awful. Like "Poke my eyes out with a stick" awful.
"Why not show a "just works"/$ chart?"
When backed into a corner, Mac fanatics always throw out Mac promotional ditties acting as if they're kryptonite to the Superman that is the Intel P4.
Just works? Yeah as long as you just work the Mac way.
I'd rather have a choice.
Dude. Think Different.
I can't believe so many people read the benchmarks and concluded the G4/G5 is faster than the P4.
At best, it shows the latest chip from IBM is equivalent to a year-old P4 without the latest FSB improvements.
At BEST its equal.
The trouble is, the P4 is already significantly faster in clock speed, and worse for apple, the P4 core is designed to be scaled up with faster clock speeds very rapidly.
You're like a drowning man who sees a floating board drift by and you think you're seeing the QE2.
Enjoy your Mac for the OS, but please stay out of the performance area. Apple gets spanked there.
And yes, I AM a mac user. I've got 6 of them in my house that I and my family use all the time.
"They don't care if it is the absolute fastest or the most/least expensive"
Every Apple fan-boy thinks this benchmark is the bomb without understanding what its really saying and what it implies.
It means Apple is already behind before they've even released the new 970 CPU.
So in conclusion, Apple fans are desperate for something other than "it just works" as a way to prove their PC is better than an Intel based PC.
And yes, Apple fan-boys are *happy* their machines cost more; its their red badge of courage.
"G5 Macs are competitive in CPU performance again."
Against a 2.6 P4?
The latest CPU's are 3.2 with a 33% faster FSB.
It means a Mac not yet available is probably 1/3 slower than a currently shipping Dell/Gateway/HP/No-name brand.
And to top it off, it costs 100-200% more!
Is this how Mac people "think different"?
CPU for CPU, P4's are 33% faster, and the clock speeds are rising RAPIDLY on the P4's.
Meanwhile, the 970 will have to go to a new smaller process to get the clock speeds up; something IBM may not be willing to do just to support Apple.
No, while this is good news that apple is in the ball-park, the Intel juggernaut is moving ahead, and IBM will not be upgrading nearly as quickly.
Better bring up the wonderful new service pack "10.3" that we'll be charged another $130.
"On top of that, we have no serial numbers, no product activation, and the install discs no matter which apple machine they were bundled with will install on any other apple machine (so long as that particular OS revision runs on the hardware)."
Sure, but nothing all that appealing in software.
Hate windows all you want, but there's lots of good software out there for Windows.
Oh wait...someday Quark and Photoshop will be OS X native. I'm sure.
"I'm glad I use Macs."
Mostly everybody is glad you use a Mac. Well, except for Apple, that is.
I see you're using TCO in a way that is new and interesting.
As far as the games you play, I'm sure it can play all of them for the Mac... there aren't that many out there.
Peace of mind with a Mac? Presumably this conversation does not involve OS 9, perhaps the worst modern operating system ever foisted on users.
Thank god its gone.
"The difference in speed mHz, RAM etc. is irrelevant for a person just getting used to computers and the net"
That's a nice rationalization for paying more for less.
It doesn't explain why Apples cost too much, but it does make you feel better about it.
This is a great marketing term to hook you into paying 100% too much.
//WORKSTATION\\"
"This Mac costs a lot of money"
"No, its cheaper than a PC"
"Really? How do you figure? An equivalent PC is 50% less!"
"Yes, but those are PC's. The Mac is a
"Oh, in that case, you're right. In fact, they should charge MORE"
and filthy jews too!
note to moderators: this is NOT a troll!
yes, but the article tells us that the G5 does more work per cycle than an equivalent P4. However, considering that the 2GHz (top of the line) G5 does less work than the 2.66GHz (yesterday's news) P4, in terms of raw MFLOPS (according to the article) your point is completely invalid. The fastest P4 is over 3GHz. The fastest G5 costs more and doesn't do as much work per second. Obviously the P4 is the faster chip, and the greater efficiency of the G5 has nothing to do with this argument.
So, in this case, megahertz actually *do* matter, since the only difference between the 2.66-P4 and the 3.2-P4 is clock speed. The factor that kills the G5 in the real world competition actually *is* clock speed.
Oh, and RTFA.
I'm running 450 mhz on my desktop, 600mhz on my laptop. When I upgrade, sure, I'll be able to run some of the latest and greatest software (not). But the most productive software I use is 99% command-line tools. And those tools worked fine when I was running 60mhz. No matter what speed of hardware I use, I'll still be making $120,000+ a year programming. The speed nowadays is really irrelevant. My computer wastes more clock cycles than I actually need.
the article tells us that the G5 does more work per cycle than an equivalent P4. However, considering that the 2GHz (top of the line) G5 does less work than the 2.66GHz (yesterday's news) P4, in terms of raw MFLOPS, (according to the article) your argument is invalid and misleading. The fastest P4 is over 3GHz. The fastest G5 doesn't do as much work per second. The G5 does more work per cycle, which is completely different. The P4 is the faster chip, and the greater efficiency of the G5 has nothing to do with this argument because the P4 ends up doing MORE WORK per second.
So, in this case, megahertz actually *do* matter, since the only difference between the 2.66-P4 and the 3.2-P4 is clock speed. The factor that kills the G5 in the real world competition actually *is* clock speed. The analogy would be better stated by saying that increasing the MHz is like boring out the engine; it increases max output up to a point.
Oh, and RTFA.
G5 is a small step for a computing but quantum leap for Apple.
Emacs is good operating system, but it has one flaw: Its text editor could be better.
Please try reading the summary at least if not the article.
Romans 8:38-39
Why doesn't my Bible have this? Sounds like a very practical translation of the scriptures!
Actually it's Apple's G5 too. G5 stands for generation five of Apple's Power PC architecture. The machine they tested was built by Apple. These test test not only the processors but the various busses and IO chips at least two of which were designed totally by Apple (The Memory controller & PCI-X bridge).
IBM plays a huge part but you can't run Nasa's software on a 970 alone. You need the whole platform. (Imagine if the 970 was still tied to a 180Mhz bus)
The real test I expect is MFLOPS/$ where $ is a retail price of the desktop being tested.
Less is more !
I remember how Apple was ridiculed when they said their G4s were "super computers". But in fact they were right. There was a government standard for those and they met that. Compared to the ENIAC any old 386 box would pass as a supercomputer. And compared to my iMac DV the G5 very well goes as workstation. But we have to see that standards shift. And for a workstation it just misses features. I mean, come on: No second 8X AGP slot, you must be kiddin' me... ;-P
SSE2 has the same limitation with data having to be aligned in 16-byte segments. And, as stated in the Intel developer docs when discussing the intrinsics: "Some of these operations may require several instructions", especially when talking about the array instructions. Ooops, there goes the whole SIMD principle..... Also, working with arrays in SSE2 is pure hell, compared to Altivec, and learning Altivec is easier than learning SSE2, at least how to use it properly.
And the VAST tools are good too.
::Smacks head off table:: Doh. Must have missed that bit. My bad.
I missed the bit in the article where it talks about dual processors. That's what I get for doing everything in a hurry. My bad. Sorry. Guess I was a little overzealous in trying to stamp out the usual FUD against Apple which was a little unjustified on my part in this case.
Apple is a piece of shit. They lie and attract know nothing zealots for the most part.
And Bush doesnt fucking care what you think. Politicizing propagandist is what you are.
And if you are so fucking obsessed with Bush, vote him out asshole, dont bitch and piss about the irreversible. Too bad Wesley Clark doesnt run for the crats instead of the fucking morons they have now.
Yes, my program is multi-threaded.
The dataset I ran the test on was about 500MB in size, thus easily fitting into the RAM.
The simulation does two major things: It gives a simple visualization that it outputs to high-resolution but graphically simple pictures, for easy analysis. The second is that it dumps all the stuff to a log-file, saving coordinates, motion curves, dynamics states etc, that can be imported into Maya for prettier visualization.
Thus it's fairly dependant upon HD too, which is one place where the speed problems could be, since they are using MSI motherboards(bleurgh).
Another thing is the compiler, GCC. I compiled for x86-64 with the switches -m64 -O2 -msse2 -mpreferred-stack-boundary=4(Which sets it to 16 bytes which SSE/SSE2 needs) -mno-push-args -maccumulate-outgoing-args -mtthreads, but I don't think GCC properly read the stack boundary size. The Opterons should be faster than the result I got.
For the G5, the most important switches I used were -mpowerpc -O2 -maltivec -mnew-mnemonics -mabi=altivec.
Each increase of 3 dB is a doubling in volume. Every 10 dB increase means the sound is 10x louder.
;-)
That means the G5s are 1/10 as loud, not half as loud.
Back to Physics 101 for you.
BOO HOO HOO! But Macs really aren't overpriced pieces of shit! See, look at this benchmerk!
You're quite wrong. MFLOPS/MHz (the "H" is always captalized in honor of Mr. Hertz) can be used to project performance.
For instance, NASA tested a 2.66 GHz P4 system. By using MFLOPS/MHz, they can estimate the performance of a 3.06 or 3.2 GHz P4 system.
Likewise, IBM has promised a 3 GHz G5 within 12 months. The MFLOPS/MHz metric allows you to project the potential performance of such a system as well.
That is why the NASA test doesn't simply look at raw performance. They want some idea where things are going as the G5 gets faster and Intel tries to eke another 0.167 GHz out for a "next generation" Pentium4. (How long did it take to jump from 3.06 to 3.2 GHz?)
I respectfully disagree with ciroknight. There have been times when Apple has had the hardware to take on the Intel world.
The Mac 512K of 1984 at 8 MHz held its own against the 6 MHz IBM AT. Too bad it didn't have a decent hard drive.
The 40 MHz Mac IIfx outperformed the 33 MHz 386 and 20 MHz 486 systems of its era (1990), especially with Apple's accelerated video card.
MHz for MHz, the better Quadras ran circles around Windows PCs (back in the Windows 3.1 era).
In 1994 Apple went PowerPC, which didn't result in an major immediate improvements. Then again, they didn't have the Pentium math bug, which was probably responsible for the biggest CPU recall in computer history.
When the Power Mac G3 was introduced, it was the most powerful CPU on any desktop system. Remember the snail ads? The bunny suits? The BYTEmark results -- and BYTE magazine getting gobbled up by a pro-Windows company and the magazine being discontinued?
The G4 was a breakthrough as well, although the folks at Motorola seemed to believe that Moore's Law was an unatainable goal.
And now the G5 offers more power. About 25% (ballpark) more than the G4, MHz for MHz. About 25-50% more than the P4, MHz for MHz (we'll know a whole lot more when production units ship with a full release version of OS X 10.2.7 -- the NASA and other tests are all using prerelease hardware and a prerelease OS).
Not only has Apple has Intel-beating performance several times in its history, but the original Mac OS was a far more efficient OS than Microsoft's bloated consumer Windows releases (3.1, 95, 98, Me).
Unfortunately, OS X trades a lot of that efficiency for stability and eye candy, making it harder for modern Macs to feel perky compared with the old Mac OS or 2-3 GHz Windows systems. The fast G4s helped, but the G5s are going to take the OS Xperience to the next level.
On top of that, Macs are simply more productive than Unix workstations. You can run high end software, but also AppleWorks, Microsoft Office, SimCity 4, Mathematica, and a host of other apps not available for Solaris, Linux, BSD, and the like.
so you use the EDU shit BFD. I know you felt other boys in the boy scouts. You felt the crotch, You FELT them. Rubbed the penal region.
you freak. you know where to look when you want to get some boy meat. you just gotta know how to seduce them with gi joes, tonkas and candy? isnt that right?
Sure, along with a 10 page NDA requring not to publish unfavorable benchamrk results.
US-UK-Israel: The real Axis of Evil
Project Builder and Interface Builder and all Apple programming tools = $0. MS Visual Studio .NET Enterprise > $3000.
OS X - FreeBSD 3-ish userland which is available as Darwin, a hybrid micro/monolithic kernel related to mach and abstracted from NeXT/Rhapsody. The UI and many pieces of the OS are still proprietary, but the underpinnings, Darwin, is basically an 'inferior' "free OS" (WRT Linux, FreeBSD)
Services range from Apache to OpenSSH and other GNUish and BSD licensed software.
NT is mode closely related to VMS (Dave Cutler/DEC).
*BSD is doing just fine. I suggest you take a look at FreeBSD 5.1. BSD is UNdying.
I asked this myself earlier today and hit pricewatch. I came up with $1800 for an E7505 based dual Xeon box in an aluminum case that would be the x86 equivalent of Apple's flagship G5 at $3,300 (I had to bump the stock video card up as I couldn't find 64MB 9600s on pricewatch). Run the numbers yourself. What you believe about the lower expense of high end Macs just isn't true.
chip except a larger memory address space.
Really? It seems unlikely to me that cpus without any floating point processors at all would have higher MFLOPS/MHz than a P4 or G4. Can you direct me to some benchmarks that show this?
This is totally OT, since the FA was about benchmarks done by NASA, not by Apple and not by IBM and not by Intel.
You want to know what a Mac is really worth? Buy any Mac Laptop or Desktop and buy any Brand PC Lapptop or desktop. Use your machine for three years. Then put it up for sale on ebay. See which one fetches a higher percentage of the origional price.
It's called an investment. Case closed.
Intel-based PCs have all these things.
Yes, even Firewire 800 - the PCI card from Swann, FirewireDirect and others has been available for a while.
SATA still isn't available from Apple. Not until they ship it, and it has been in PCs oh, 6 months?
This is coming from an organisation that can't land a shuttle... NASA, Need Another Seven Astronauts... Although I use a mac :)
Hear here. I gotta say, 512KB of L2 on a 0.13u, 64-bit CPU is some sort of sick joke...
Workstation and PC aren't meaningful terms... they're marketing terms.
So saying "this is a workstation" is like claiming "my PC thinks different". Its gibberish. Nonsense. Dumb stuff.
oh and you added the OS and warranty and everything to that PCs price too?
Elegy For *BSD
I am a *BSD user
and I try hard to be brave
That is a tall order
*BSD's foot is in the grave.
I tap at my toy keyboard
and whistle a happy tune
but keeping happy's so hard,
*BSD died so soon.
Each day I wake and softly sob
Nightfall finds me crying
Not only am I a zit faced slob
but *BSD is dying.
Elegy For *BSD
I am a *BSD user
and I try hard to be brave
That is a tall order
*BSD's foot is in the grave.
I tap at my toy keyboard
and whistle a happy tune
but keeping happy's so hard,
*BSD died so soon.
Each day I wake and softly sob
Nightfall finds me crying
Not only am I a zit faced slob
but *BSD is dying.
This exact comment has already been posted. Try to be more original...
I can't believe some people. First they say Apple need better hardware... then Apple comes out and delieves on that... and now they're yelling "it can't be true - it just can't". Give me a break.
Ok it's better than a lot of stuff. But a few questions
Who's idiotic idea was it to make it syncrynous? (as aposed to asyncrinous), why use silocon instead of optical cuplers?
And who thought: Gee instead of hacing a futuristic LOOKING computer lets ask RubberMaid to make the damn think look like a trash bin.
"You do not go to CompUSA and buy a digital Alpha server"
YOu do not go to CompUSA and buy G5's yet, either.
"I'm more than willing to pay for an OS upgrade of this magnitude [not to mention I get educational pricing])"
Apple throws in a few features, and you pay them $130. Microsoft throws out a bunch of new features for free, and they're evil.
(Note that I'm primarily a Mac user, but I'm tired of paying for service packs).
Paying $130 every year for OS X is turning out to be iMoronic.
I don't think it's fair to compare the system performance of the Mac with one of its processors off to the P4. The highest end Mac has two processors for a reason.
"No!" you cry. "That's not a fair comparison! The P4 only has one processor, so to make the benchmark equal, we have to make sure the systems are on equal footing!"
Well, too bad. It's a system limitation that Intel put in. If I buy a TOP END system, I want to see top end performance. The fact that the P4 was so (badly) designed isn't my problem, or Apple's. Their top end machine spanks ANY P4 system you can put to the test because they decided that a good way to make the system faster would be to make sure it supported dual processors.
Now, if you're stricly comparing PPC970s to P4s, and you want strict PROCESSOR benchmarks, then fine, just test the processor. However, even results like this benefit from the architecture of the system. A fast, data starved processor is useless.
I'm uninterested in raw CPU results. I can't do anything with JUST a CPU on my desk. I need a whole system.
This is the highest performing, lowest cost Dell that anyone posted to this discussion:
Dell Dimension 8300: (same as XPS except not blue)
3.2GHz Pentium 4
200GB Ultra ATA - 7200rpm
DVD+RW/DVD+R/CD-RW
512MB DDR400 SDRAM
Microsoft® Windows® XP Professional
Wordperfect
ATI Radeon 9800 pro
No Monitor
$2239
This is the base 2GHz Apple. So, the cheapest, highest performing Apple.
$2,999.00
Dual 2GHz PowerPC G5
1GHz frontside bus
512K L2 cache/processor
512MB DDR400 128-bit SDRAM
Expandable to 8GB SDRAM
160GB Serial ATA
SuperDrive
Three PCI-X Slots
ATI Radeon 9600 Pro
64MB DDR video memory
56K internal modem
And that doesn't list the computer controlled cooling or expensive case, the fact that the Apple is quiet, the gigabit ethernet, the optical digital audio and analogue audio in and out, or any of the engineering that went into making all of those things fit together. For only $700 more, you get an extra processor, superior performance, and an operating system that's actually worth using.
Remember, you get what you pay for...even with Apple. Dell puts togher fine PCs, and they're certainly cheaper. However, (for the first time in a while) the Mac is BETTER.
Apple might have shot itself in the proverbial foot with the nomenclature they chose for OS X.
People equate the "point one", "point two" names as small increases (0.1 of an increase in fact), not the names of the operating system. The small increase comes with the second point - the point releases.
10.1 is to Windows 95 as 10.2 is to Windows 98 (well, if we're being fair, 10.2 = Windows 3257, but there we go, I'm biased).
There were probably as many changes between Windows 95 and 98 as there were between 10.1 and 10.2, and people weren't up in arms about having to pay for 98 were they?
Well spotted, it's an FP IPC (or CPI) test.
If you want high IPC (instructions per clock) then look at Alphas. DEC were the first company to hit 1.0 IPC mean on SPEC, and even had 2.0 IPC before anyone else. Having said that, now Intel has bought vast chunks of Alpha tech, the Itanic is pretty darn high IPC too.
YAW.
Your head of state is a corrupt weasel, I hope you're happy.
610 posts, and only one AC mentioned the cache size difference in the machines? Here are some interesting numbers:
Program use 1024 KB of memory -- that's unusally low.
G5 data cache:
hw.l1dcachesize = 32 KB
hw.l2cachesize = 512 KB
G4 cache:
hw.l1dcachesize = 32 KB
hw.l2cachesize = 256 KB
hw.l3cachesize = 2048 KB
P4:
cache size: 512 KB
It looks like the G4 should walk all over these other processors- the whole dataset fits in cache. One really interesting thing about the dual G5 is that each processor can access data in the other's cache... Since the 2nd processor was still installed, I wonder if its cache was still operating! If so, then this might boost the G5's cache to roughly the size of the data set.
HIV Crosses Species Barrier... into Muppets
Actually they did mention SSE2, and weirdly enough, said it made things slower, and so they didn't use it
It's pretty important to separate the intel or portland compiler's ability to autovectorize code for SSE2 and manually coding SSE2 code just like the altivec code...
Was that the 3.06GHz Xeon that Apple demonstrated, or a slower dual-Xeon system?
Does that include most of the hardware features of the G5? Gigabit Ethernet, FireWire, optical 5.1 audio, CD-RW/DVD-R drive?
A friend of mine ran the numbers on Pricewatch last week and didn't have the same results you did. I don't recall what he found, but for similar specs it wasn't cheaper than the G5.
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
You dumb fuck!
Have you ever even seen copper Gb to an Apple? I have, once, it was fast but Ive gotta be honest with you. that Gb connector really doesnt measure up, your argument is useless similar to you. Bottom line is this, if Apple wants to sell these in big numbers, they need to drop the prices. But they wont, they like their client base being the VW driving, Terrier walking yuppies. They like it a lot. And the other thing, personally I do not think they have the means to make enough of these machines if their market share jumped to say 10%. Suddenly they would be behind the 8-ball and quality would go down.
As far as thermal management, have you felt the back of any of the new G4s? Jesus it is like a hair dryer back there. Put 3 of these in a room together and you better have airconditioning.
I see a lot of people arguing, so here's my summary/explaination:
- A single G5/970 top of the line is probably 20% slower than P4 top of the line, on unoptimized software. By that I mean regular integer and floating point (no AltiVec), and with unoptimized compilers. The compilters may or may not improve significantly over time, so that shouldn't be afactor on release date, but may pan out later.
- The G5/970 is not designed/optimized to ship as a single CPU, instead the bus, etc. are designed for SMP of between 2 and 8 CPUs (might technically work with more, I don't know). So, you'll see that, in a dual CPU comparison that the G5/970 will scale better than the P4 (xeon) duals, narrowing the performance gap, and in some applications, putting the G5/970 on top.
- With software that is recoded to work with AltiVec, compared to SSE(2) optimized software, the AltiVec performs noticably better, but that depends on the application. When comparing AltiVec to regular (non-vector optimized code), there can be as much as an order of magnitude increase in speed, again depending on t6he application.
So, expect benchmarks to vary by a tremendous amount, depending on the number of CPUs, and how optimized for AltiVec the software is, and how mature the compiler that was used. Since a lot of software can be vectorised, I would expect software on the G5 to initially lag P4 performance, but then to dramatically speed up in some areas.
Also, expect a lot of people who don't grasp this to be getting their panties in a knot!
...will the G5 cruch SETI@home faster than an x86 based solution? Because you know I want to be the first to establish formal contact with the Zeta Riticulans, the Nordic Aliens, or the Time Lords... :)
"Right now, somewhere in this world, Scott Baio is plowing a woman he doesn't love," - Peter Griffin, *Family Guy*
Actually, I tried that a few years ago. And they don't even make good anchors.
I don't consider my self being a Terrier walking yuppie, though i have a mac.... I wouldn't buy a Toyota to drive in, if i could get an BMW... Given both cars would take me from PointA to PointB, one would do it with comfort and style.... I'll leave you to figure out which of the cars im talking about here... However its an incredible view you have, even for a AnonCoward, i must say...
I'll speak some about having actually used SSE and Altivec. I've used Apple's gcc altivec flags, and SSE (and SSE2, but since Altivec doesn't have 2x64bit FP vectors, it's unimportant) assembly.
Having written some code, and hacking it up to do 4x4 matrix multiplication, I set it up to multiply about a million times. I tested standard unvectorized floating point (on both a G4, and P4), altivec on a G4, and SSE on a P4.
My results? Altivec provided a nearly 4x speed boost (even though I was writing results out to memory nearly every single operation), and SSE performed horribly, running almost 2x as slow. I figured out that this was because I was using "movups" instead of "movaps" (unaligned, vs aligned) but I couldn't get aligned moves to work, because GCC wouldn't align my vector to a 128-bit boundary. (you can aligned the stack to a 128-bit boundary, but then the %ebp register goes on, and it craps out everything else)
After trying numerous things, I just outputted the C code to assembly, and hacked it so my vector was alligned, and then SSE showed minimal performance gains at all.
I'm working on rewriting my library to make it so that I don't have to write out all the vectors every single add/mul/madd. But this thread will be old by then, and no one would probably read it.
I am unamerican, and proud of it!
You can switch the second processor off permanently by simply taking a hammer to it.
10.2.7 is only available to slashdot subscribers.
Tyan Thunder i7505 motherboard with PCI-X, 8X AGP, Gigabit Ether, Firewire, USB 2.0, and 6-channel Dolby audio (including SPDIF out) = $415.
Dual Xeon 2.66 GHz @ $315 X 2 = $630.
512MB Kingston CL2 ECC DDR = $80.
ATi Radeon 9800 Pro with 128MB = $305
Seagate Barracuda 7200.7 Plus - 160GB/8MB Cache = $165
Pioneer DVR-A06 DVD burner (Supports DVD-R, DVD+R, DVD-RW, DVD+RW) = $250
Lian-Li Aluminum Case = $115
TruePower 550 Watt power supply (nice and quiet!) = $110
Add more cash if you want to skip stock cooling to lower noise or overclock by buying a pair of Zalmans or Swiftechs = $100.
Oh yeah, throw more cash at a modem, keyboard, and mouse = $100.
Now that totals $2,270 right now for the x86 equivalent of a $3,300 PowerMac available in September. One could stick with the stock 64MB Radeon 9600 that will come in the top PowerMac and drop the extra $300 off it's price. That would allow you to drop $160 off the PC and it would still leave you with a 128MB card, a fast, quiet drive (8MB cache which does add a boost -- see storagereview.com), CAS 2 ECC memory, HyperThreading for that 4-way feel, an optical drive compatible with all formats, and a cool grand to spend as you please. If you don't care about PCI-X or integrated Firewire you could get the Tiger i7505 instead and save another $150 to boot. If you want this PC to be even cheaper you can wait until the G5 PowerMacs are actually available before buying this dual Xeon box.
I work with os x standard and server almost daily. I meant the server edition rather then addition... OS X has several version is all I was pointing out in my previous post. Yes it costs tons more... but thats not related to anything as does windows server.
Hmmm... Pie...
it's been a while but when i was trying out os x installs and version i remember entering serial numbers. Might have been the server edition...
In any case buy xp pro cds and it will work on all pc's that can run xp as long as you have the correct number of licenses.
Hmmm... Pie...
>> So why didn't they port their code to SSE2?
Probably because it's not worth it. While Altivec speeds up Jet3D by a factor of 10 to 13, there is no evidence that SSE2 can even double the speed.
I'll take the word of the NASA guy over the slashdot rants.
Ok, state your arguments, and I'll state mine. After all, I seriously doubt you can conduct a civil discussion in the first place, based on your preference for ad-hominem attacks instead.
:)
Apple and Microsoft both make piles of money; that doesn't mean I like how they conduct their businesses.
As for the Xerox bit, I admit I never looked into it, but after a little googling, it sounds like Apple did have some sort of agreement with Xerox, or at least "In the end, Xerox got a large block of Apple stock for sharing the technology." (according to Woz himself). That doesn't make it any more innovative, however.
pb Reply or e-mail; don't vaguely moderate.
Apple's benchmarks... gave the Dual 2GHz Power Mac G5 a 194.5% performance advantage over a 3GHz P4 in SPECfp_rate_base2000. The G5 getting a score of 15.7, and the P4 getting an 8.07.
15.7 is 94.5% greater than 8.07 -- not 194.5% greater.
That that is is that that that that is not is not.
I'm no expert in anything, but I remember reading that the G5 is the only 64-bit chip that will run 64-bit and 32-bit natively at the same time. AMD's Opteron will only run both if you reboot between the 64-bit and 32-bit sessions. The Itanium will run 64-bit and only run 32-bit in emulation. It seems to me that all of them are fast, so I don't care about relative speed in that regard, but the G5 seems more flexible as far as running 32-bit and 64-bit applications in the same session side-by-side. That's why I'm buying it.
You said:
Altivec is nice, for what it is meant for (mainly media type calculations, signal processing, etc.) But scientists will prefer SSE2.
Well I was under the impression, that if I need double-precision floating point arithmetic I use the FPU (which is quite fast with the G5 as you can see in Fig. 1 of the Article). ;-)
Concerning fast vector operations - some problems can be "dumbed down" to take advantage of the faster single precision units and lastly you can do an additional Newton-Raphson refinement step where double precision is needed.
Your claim "scientists will prefer SSE2" was proven wrong by the article, where the Jet3D test suite was only ported to take advantage of the Altivec SIMD unit, not the SSE2 SIMD unit - unless of course you argue that NASA engineers are not scientists
IMHO real scientists use whatever they think is best suited to solve their specific problem.
Code is Speech. No to Censorship.
Well, first of all, I work at NASA as a researcher, doing numerical work (although I'm not a civil servant, and don't speak for them).
:)
I agree that AltiVec is superior to SSE (ie. single precision), but you compared it to to SSE2, which is a bit apple-to-oranges (no pun intended, btw). If the G5 FPU is faster than current SSE2 at double precision, it just proves the well thought out design of the PowerPC architecture (and the unfortunate legacy of Intel's FPU instruction set, which is still a handicap even with SSE/SSE2, due to the need to mode switch).
But SSE2 is still immature, and I expect compilers to improve, as well as chip implementations. Once they do, a more meaningful comparision can be made.
The Intel chips NEED stuff like SSE/SSE2 to achieve faster floating point speeds, whereas the PowerPC can get by without it, thanks to a much better FPU design, and thus, PowerPC makers will probably not spend the silicon to make a double precision SIMD instruction set anytime soon.
I stand by my claim that while most consumer and media software can get by with single precision, scientific computing (ie. large matrix calculations, to be blunt) quite often needs double precision (hell, you can get libraries that use 128 bit long doubles, these days), and will ultimately prefer SSE2. Scientists fuss with single precision SIMD simply because many of their applications can benefit so much from SIMD that it is worth the pain to use single precision (with proper conditioning and verification, etc.) Now that double precision SIMD is available, I can only predict they will want to jump to it, once tools for using it are there.
Granted, if Intel can't make a double precision SIMD unit that outperforms a double precision general FPU like the G5's, for matrix problems, then they don't deserve to design chips for scientific computing.
"It's overkill, of course. But you can never have too much overkill." - Anonymous Slashdot Coward
MFLOPS = (million floating point operations) / second
.
MHz = (million cycles) / second
MFLOPS / MHz = (million floating point operations) / (million cycles)
=> (floating point operations) / cycle
So this unit is deliberately chosen to disfavor the P4 since it measures only number of operations per cycle and and is not a measure of speed or how fast. To measure speed it has to be per second
We all know that the P4 has more cycles per second than the G5, this benchmark is the other extreme of the MHz myth. More floating point operations per cycle does not mean much without considering the number of cycles per second.
A computer that does 10 ops per cycle, and 1 cycle in 10 seconds only does 1 op per second.
Another computer that does 1 op per cycle and does 10 cycles in 1 second, would do 10 ops per second.
Which is faster? I'm sure you don't need a research institution to tell you that.
Nasa compares them only on the ops/cycle metric. And some G5 zealots here tend to agree. What a disgrace to common sense.
"Fighting terrorists with millitary might is like killing a mosquitor on your Dad's forehead with a rifle."
I find it deeply depressing that this is the second week in a row I have had moderator points... only to find that everything is moderated at 5, and, what is even worse, that is exactly how I would have moderated!!! "Use 'em or lose 'em" eh? I think it's the /. Bayesian filter that has a loose screw! I mean, how many people are moderating, really? And how come we are all in agreement??? Like Senator Joe "MAC" Carthy, brilliantly immitated by the previous poster, this is, no doubt, a communist plot to overthrow /.
I'm sure you'd agree that ops/sec is also not a very useful metric either, since two CPU's will have vastly different efficiencies of work accomplished for an average instruction.
A CISC "alu [dstMem], [srcMem], [srcMem]" instruction that (when efficiently using a cache) can complete in fewer cycles then the explicitly defined RISC instrs:
"ld [src1]",
"ld [src2]",
"alu",
"st [dst]"
due to a reduced load on the register set is an example.
More critically, there are the vector operations, cache-sensative prefetching instructions, etc.
Ultimately, all that you can do is have comparable top-tier compilers render the same application on different platforms and compare the aggregate outputs. Individual enhancements don't generally make or break a CPU; it's the whole shibang; platform, compiler, OS, language-choice, etc.
-Michael
Bull. It is still the best metric. Compilers don't matter. User the best available for each platform and compare systems of equivalent price. The one with the better ops/second wins -- simple.
If there is a vector unit present, it should enable the proc to do more ops/sec so it is included in the metric. No of procs don't matter either, we're talking systems here. They should learn from the TPC benchmarks. What matters should be how fast and how chep, not what compiler or how many CPUs or how many MHz etc.
"Fighting terrorists with millitary might is like killing a mosquitor on your Dad's forehead with a rifle."
>> So why didn't they port their code to SSE2?
Probably because it's not worth it. While Altivec speeds up Jet3D by a factor of 10 to 13, there is no evidence that SSE2 can even double the speed.
Yeah, especially if no one tries to port something to SSE2 to see.
Cleary, you've been reading about the Opteron in Mac fan sites. With the Opteron (which was released sveral months ago and gets almost twice the G5's SPEC scores), you can even have a 64-bit OS with 32-bit drivers. Go read the articles in Ars Technica or Anandtech and stop propagating stupidity.
No one is yelling it can't be true. Did you read the article? The G5 loses to a 2.6 GHz P4. Obviously, the 3.2 GHz would be even faster, and the opteron probably faster still.
you guys are hilarious. lol. fighting like cats and dogs. when it boils down to it, computer geeks use pcs and artsy types use macs.
I love it I quote a troll and I get modded down as a troll because it was a STUPID troll?
;)
WTF...
Oh well the moderation is all screwed up anyhow. They gave ME mod point.
As you can see I don't care about my karma.
Yea it was a troll that went -1 so no one saw it and assumed you were trolling. Funny huh.
All thoses people who have their setting to show nothing below a 1 shouldn't be allowed to mod unless they have read the parent.
becuase my experience has been so horrible!! ; .. just play yahoo pool on your stupid mac with some one on a pc, and then talk ;
if a computer that is supposed to be used for video editing and what not is not good even to play online pool, i am sure not to use it to edit my home videos or pictures
all those dumb mac users out there
btw, it was a G4 15 inch and whatever ram it came with;
Apple's Cannibalistic Humanoid Underground Dweller Tools? Now I'm truely frightened!
Regarding the less hassle, updates are just a click. Remote administration can be done via SSH.
My anecdote about easy to use regards burning CDs. Some one asked how I did it and only then I realized that it was just a matter of popping in a blank CD and dragging the ISO image over to it. Even the correct speed was figured out automatically. It was really so simple, especially compared to all the Windows based CD burners I've seen.
I'll sing you a song
It won't take long
G5 users are poofters!
That looks pretty similar to what you already wrote up; I'm not convinced, but maybe you're right. In any case, I obviously won't be buying a Mac, and at least for those reasons. Their additions--much like Microsoft's--generally annoy me more than anything.
However, you should probably collect those comments together and distill them into a stock reply on the subject; the topic does come up often enough, after all.
pb Reply or e-mail; don't vaguely moderate.
Now, with all these Linux-heads around here insisting that Linux is faster than Windows on x86, you'd think GCC for x86 might be a good compiler.
I believe the kernel design is a more important factor than how fast code the compiler generates.
Do you care about the security of your wireless mouse?
Octuple-precision floating point on Apple G4.
Lars T.
To the guy who modded me down from perfect to terrible Karma - Apple haters still suck
SETI@Home is not vectorized. Even on a G4, they could get a factor of almost 10 on throughput, were they to vectorize the client.
When this was pointed out to them by Apple employees, the SETI@Home people did nothing about it, preferring to keep the client code the same on all platforms, rather than achieve their workload numbers in a fraction of the time (apparently one Apple employee by himself is sitting at about 15th place on number of workunits processed).
It's actually really annoying, but from what I hear, SETI@Home is wrapping up, and they are working on new clients for different problem sets (also not vectorized).
-- Terry