Quick and Dirty Penryn Benchmarks
An anonymous reader writes "So Intel has their quad-core Penryn processors all set and ready to launch in November. There are benchmarks for the dual-core Wolfdale all over the place, but this seems to be the first article to put the quad-core Yorkfield to the test. It looks like the Yorkfield is only about 7-8% faster than the Kentsfield with similar clock speeds and front-side bus."
My recent experience with quad-CPU Xeon machines is that multithread performance for a single is VERY poor, even with great care in coding, presumably because of cache-sloshing between these physically-separate CPUs dropped onto one die.
(I compare with Niagara and even Core Duo which seem much better for threaded apps.)
Has anyone else tested threadability of these CPUs, and power efficiency, sleep states, etc?
Rgds
Damon
http://m.earth.org.uk/
I would think that AMD would be providing Barcelona benchmarks hand over fist, at this point, if they had something...
There are two possible situations here:
a) Barcelona is faster than Intel's current line-up and does not want to see Intel up the pace more by releasing such numbers.
b) Barcelona is slower than Intel's current line-up and does not want its shares hit a new low, or perhaps buy some time to speed it up.
Full Tilt
"Intel expects SSE4 optimizations to deliver performance improvements in video authoring, imaging, graphics, video search, off-chip accelerators, gaming and physics applications. Early benchmarks with an SSE4 optimized version of DivX 6.6 Alpha yielded a 116 percent performance improvement due to SSE4 optimizations." Not bad...
Depends on what you do "at home". Grandma who only sends email and orders flowers will see zero benefits.
But the rest of "normal" home users who own things like camcorders, make DVDs, rip movies, etc all see a huge benefit. I just put together a Q6600 system and couldn't be happier, but I've been a dual CPU workstation user since the PII days.
They could probably make better use of the die space of the 4th, 3rd, or even 2nd CPU core by putting things like cache there instead. And in another direction, go with SoC (system on a chip) or certain subsets thereof. Combined with serialized bus technologies, this should work while also reducing pin counts.
now we need to go OSS in diesel cars
"And of course.. 4 core CPU has no use at homes unless you are content creator. I'm software engineer, I don't think that any of my colleagues I work with knows how to write app that will take advantage of 2 cores; let alone 4.
Conclusion? 4 cores right now need much software support."
Well, you're talking about cutting-edge CPU:s which typically co-exist with cutting-edge software. If you're getting a quad core setup, it's probably because you're going beyond Word processing.
Of course quad cores will need more software support before they become a more viable option, but it's hardly a bad thing to prepare for the future when you're purchasing a computer.
Full Tilt
I was debating between Q6600 (2.4GHz, 4 cores) and E6700 (2.67GHz, 2 cores), and I have chosen second option, because of limited advantage of more cores, but always present advantage of higher clock speed.
Seriously (partly, at-least) : How many penguins I will see during the boot-up? 4?
Read and Comment at my BLOG
!!!
'it's hardly a bad thing to prepare for the future when you're purchasing a computer.'
yes it is, it costs you extra money & hardware comes down in price quickly, if you buy a high end cpu now that you wont use for another year, you're wasting your money. in a years time your high end cpu will be mid range & a lot cheaper, so it'd probably be cheaper to buy a mid range or budget cpu now, then another one in a years time. then you can get a bit of money back for the old one on ebay too.
Web Design
I read a comparison/benchmark someplace (Ars? Who can remember..) that showed the E6700 only a touch better at a narrow range of applications and getting its hat handed to it on media encoding applications, so I went with the Q6600 since that accounts for my "heavy" computing.
I see MPEG-2 renders running better than real time on single pass encodes in TMPGEnc.
Penryn? Wolfdale? Yorkfield? Kentsfield? What are they doing here, making processors, or naming streets in a new upscale subdivision?
AMD rose to this position primarily because they didn't make Intel's mistakes - trying to force a new CPU architecture on the market (Itanium) instead of incrementally developing the X86 line, and focusing on clock-speed (P4) at the expense of performance per watt. Now that Intel is focused on performance per watt, AMD needs to find a new differentiator for their chips.
Perhaps they should start thinking about how to integrate a high quality Vista-capable GPU into their processors? (afterall they acquired ATI). How about sound cards, USB ports, et cetera. If they can fit 90% of a typical motherboard into the processor and usher in a new era of affordable and efficient computers while intel is busy playing with 64-core chips, why not?
Or if you use Linux... Because that support has been standard for quite some time now. They even rotate which CPU gets priority so that heat and usage gets distributed evenly.
Yay for processes that make sense!
You're turning a discussion about buying high-end hardware into "best bang for the buck". Where in my quoted statement can you see me saying anything about buying the top of the line hardware and that it is the best choice?
Full Tilt
yeah.
that's why they call it chip real estate.
(Hyuck. Get it?)
"Don't mind him - he's from Barcelona."
Barcelona is faster than Intel's current line-up and does not want to see Intel up the pace more by releasing such numbers.
That may have been true 6 months ago, but the K10 is supposed to be officially announced in about 16 days on September 10 (since AMD claims not to do paper launches it is supposed to be widely available then too... ymmv). AMD is not going to be able to stop benchmarks after it is released, and while Intel can adapt quickly, it can't turn on a dime in 2 weeks time. AMD has not been doing well in the PR and benchmarking battles since Core 2 came out, if K10 really was that amazing you would be seeing all the usual suspects putting out full reviews right now in order to generate hype. I'm leaning towards your second theory, and most analysts are too.
AntiFA: An abbreviation for Anti First Amendment.
Not fake. Browse at -1 and you'd see the post he responded to.
(rot13) rpbzbab@tznvy.pbz
Intel tends to do a release of a new architecture, then some refinements on that. While it would be cool to do a whole new architecture each time around, there's just not really money for that. This is one of the refinements. The chips are not likely to be all that much faster then their previous chips at the same clock speed because they are largely the same architecture. Mostly they are just a die shrink (which means lower power and probably better scaling and cost) and some new instructions, that aren't really used yet. They are still Core 2s.
However that doesn't mean that the next generation will be the same. Indeed, if Intel keeps with their plans it will be a new architecture and thus hopefully bring new speed increases.
As to using multiple cores, well if you don't know how, perhaps you'd best learn then? You not knowing how doesn't mean it can't be done, indeed it can be done and IS being done. Multi-core is just the way things are going, at least for now. Not only are desktops and servers headed that way, but even things like the Xbox 360 and PS3 are as well. It's simply time to start thinking about software in a different way. No longer is a big while loop the way to go.
Already that's happening. The number of games (and games are interesting to watch since they often ride the leading edge in terms of requirements) that makes use of two cores has risen dramatically. We are also seeing a couple games, with more on the horizon, that will support 4 cores. Things like AI and physics get executed in parallel, which makes it possible for them to be much more complex.
Finally, there HAVE been some cool developments on processors, just not ones that most hardware sites like to cover. Some time back Intel introduced a technology they call VT, which is basically instructions to allow you to virtualize the protection rings on a processor. Supposed to make for faster VMs. Currently the implementation is somewhat lacking, VMware claims it is slower than a well optimised software solution, though others dispute that claim (Xen likes VT). The new 45nm Core 2s add to the existing VT technology with what Intel calls VT-d. Basically the idea is to allow VM software to pass DMA access to their guests, but in a safe manner that can't hurt the host. This may not be exciting to everyone, but these advances are worthwhile, given that virtual computing is getting more and more use.
Processors may not be getting huge gains in single thread performance any more, but that doesn't mean they aren't advancing.
You can't rule AMD out but Intel sure has done just about EVERYTHING right since Conroe. They've been more open with the press (both financial and enthusiast) and have (for lack of a better expression) engineered the shit out of AMD. That said, it looked like Intel would never catch up during the prime of Athlon 64...of course Intel's resources and size give it an astronomical advantage. AMD will again have to be innovative in some way if it ever wants to catch up...Fusion maybe?
You hit the nail on the head. Parallel programming is now required if you want to take advantage of multiple cores.
NEVER underestimate the huge number of virus / trojan / spyware and pop-up generating crapware that are running in parallel on average joe's computer.
Just think about the number of users who come into stores to buy "faster computers because the old one is getting too slow" when the old computer is crawling under an impressive amount of crapware.
They are the perfect target for those new multi-core processors :
- 1 core for running the OS, Internet Explorer and Microsoft Word.
- All other core for running SPAM-spitting zombies.
Now, if you add Vista in the equation...
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
"Conclusion? 4 cores right now need much software support."
It goes beyond just that IMHO, right now the PC industry needs to get it's act together as a PLATFORM. And also for applications that don't break. One of the big things that is pissing me off right now is closed-source programs who's compatability breaks and because it's closed source no one can fix it/update it, etc to get it running when OS's and other technologies change. I think there really needs to be a legal framework for people (end users) who own software and in the Open souce community for accessing closed-source code (that they technically own/ have invested in or have some ownersip stake in really) especially when those applications are long past their sell-date to get them fixed and up and running. You don't buy a car and expect to be prevented from fixing it when something goes wrong.
Next, growth IMHO for certain industries like the game industry is being held back by not subsidizing the cost of some kind of mid-range performance standard graphics *for everyone*. I find it ironic that companies like Nintendo, Sony, and MS can subsidize their consoles, but when it comes to the PC, MS just sit's there.
I think one of the big reasons PC gaming is flagging was in large part due to the incessant march of the graphics card industry. Starcraft and Diablo 1 & 2 were both 2D games, it makes sense that these games got as widespread as they did because they'd run everywhere.
Well, fortunately, some of this software has already been written just for you and your colleagues. Check out make(1) manual page — look for the -j option...
And no, it is not only for software engineering either. Every time I come back from vacation, I use make to convert my digital pictures from the lossless "raw" format of the camera to the lower resolution JPEG for the web-pages. Having four CPUs makes that process four times faster. Great idea, uhm?..
Your colleagues may be doofusen, but people, who will finally bring us reliable speech-generation and parsing (as an example) will certainly be smart enough to take full advantage of the multiple processors.
Meanwhile, you can schedule a meeting to discuss using OpenMP in your company's software... Compilers (including Visual Studio's and gcc) have been supporting this standard for some years now.
In Soviet Washington the swamp drains you.
I am running a Q6600 on an eVGA 680i motherboard, should have just gone with the Q6600, I have this sucker clocked at 3.3Ghz, am pushing for 3.4 but I can't get it stable yet.
When decoding "full HD" h264, i.e. 40 Mbit/s BluRay or 30 MBit/s HD-DVD, with 1080p resolution, current cpus start to trash the L2 cache:
Each 1080p frame consist of approximately 2 M pixels, which means that the luminance info will need 2 MB, right?
Since the normal way to encode most of the frames is to have two source frames and one target, motion compensation (which can access any 4x4, 8x8 og 16x16 sub-block from either or both of the source frames), will need to have up to 2+2+2=6MB as the working set.
Terje
"almost all programming can be viewed as an exercise in caching"
It really wasn't funny regardless..
Who makes an SSE version of a function w/o a regular x86 version to fall back on when SSE isn't available?
bite my glorious golden ass.
Just a totally random comment, but Penryn, sounds a lot like "Pianren" in Chinese, which means to Cheat People.
That's pretty funny. Hope the name catches on. It'd certainly explain poor sales in China :-)
:-/
Oh wait her comes another humor impaired mod to further 'Offtopic' us.
Just junk food for thought...
Conclusion? 4 cores right now need much software support. But shouldn't it be improvements in hardware that make software run faster as opposed to the other way around? For instance, my 3 year old 2GHz Athlon64 is way faster than my 1.6GHz dualcore Athlon64 for all the games I play. Why is it that something that uses twice as much space and is on a smaller process node (90nm) and has twice the memory channel width (dual channel vs. single channel) runs slower? It's newer hardware, and it's running my old software slower. That is destroying the value of the product to customers. Something is wrong here. I bet you a 3GHz core 2 duo would run circles around a 2.4GHz Penryn in any game, despite having 2x the potential processing power.