IBM Unveils Fastest Microprocessor Ever
adeelarshad82 writes "IBM revealed details of its 5.2-GHz chip, the fastest microprocessor ever announced. Costing hundreds of thousands of dollars, IBM described the z196, which will power its Z-series of mainframes. The z196 contains 1.4 billion transistors on a chip measuring 512 square millimeters fabricated on 45-nm PD SOI technology. It contains a 64KB L1 instruction cache, a 128KB L1 data cache, a 1.5MB private L2 cache per core, plus a pair of co-processors used for cryptographic operations. IBM is set to ship the chip in September."
But will it run ... a Beowolf cluster of ...
[Comment terminated : memelock detected]
So what is this beast supposed to be, a 64 core machine?
Didn't we retire the Ghz wars 5 years ago? I know, AMD style "more done per cycle", but isn't a quad core 3.1 Ghz per chip with 20% logistic overhead faster?
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
The Z-series mainframes cost hundreds of thousands (or even over a million) dollars, not the chips. As it says in the article.
I can't wait to get a PowerMac G6 with this CPU, in your face Dell users with your commodity Intel-based desi... oh, wait.
You are not alone. This is not normal. None of this is normal.
Fark is consistently a whole 1-2 days faster than Slash-D lately. And their "Idle" section is better.
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
But it will be obsolete by the end of the month.
The simple truth is that interstellar distances will not fit into the human imagination
- Douglas Adams
They need to work on crunchin more cores on 1 dye then get programs up to speed where they can utilize the extra cores.
IBM defines the z196 as one of the few remaining CISC chips, which allows for bulky, large programs that can require much more memory to execute in than RISC chips, including the PowerPC and ARM embeddded processors, among others.
For CISC you need more bytes per instruction, because there are more instructions. With RISC your executable has more instructions but they each use less storage.
I am not sure I believe their implication that CISC is better for humungus commercial applications. Sounds like marketing speak to management to me.
http://michaelsmith.id.au
Can't even imagine writing in assembly code for this monster. I miss dinking around with a nice 6502 system.
__ Someday, but not this morning, I'll finally learn to use the preview button.
Yes, but their article comments are much closer to Youtube than Slashdot.
That term is so 90's. Why are we still calling it that? Shouldn't it be 'nanochip' or something like that?
At least, not private L2 caches per core. The L1 caches seem a little small, especially if this is supposed to be used in mainframe type situations.
But i'm not too in-the-know when it comes to their mainframes, the cache might not even be needed that much if they have some fast pipes between those circuits.
Either that or the system depends more on the private caches.
Quite liking those 2 co-processors for crypto. T'is an important function that gets ignored far too often these days. Having it in hardware is even better.
Will it run Crysis at full settings though?
Announcing a 5.2 GHz, 1.4 billion-transistor processor at "Hot Chips 2010" just makes sense. Strangely, no power numbers were given...
Intel's netburst architecture (of pentium 4 fame) featured the 'Rapid Execution Engine', which consisted of two ALU's running double the clock speed, on 3.8 GHz Pentium 4's, that would be 7.6 GHz
Granted, that is not the entire cpu, but still..
People, what a bunch of bastards
except possibly in clock speed. I'm fairly sure than an 8-core 4.25GHz Power7 is probably as fast or faster if the workload is properly threaded, which any enterprise server or mainframe should be. On the other hand, on single-thread or few-thread workloads, the z196 probably has a bit of an edge, despite a large portion of its instruction set being microcoded.
Yes, but you'll have to disable "Aero".
Caveat Utilitor
Those slackers, wheres my 3GHZ G5? Huh?
*sigh* FINE, begin the switch back....
-Steve
Sent from my iPad 2
---
If I could mod you insightful, I would.
My 386DX has an external Maths Coprocessor, => can only do floating point functions :( ... your one does how 5.2 Ghz -> Sure my M series superseeds your G series..right?.... ....
However mine's now a bit faster overclocked it from 33Mhz to 52Mhz
right?
It's crazy that an architecture developed in the '60s lives on in the System Z today. IBM bet the company on the S/360 product line. I think the investment has paid off-- and still does!
It contains a 64KB L1 instruction cache, a 128KB L1 data cache, a 1.5MB private L2 cache per core, plus a pair of co-processors used for cryptographic operations. In a four-node system, 19.5 MB of SRAM are used for L1 private cache, 144MB for L2 private cache, 576MB of eDRAM for L3 cache, and a whopping 768MB of eDRAM for a level-four cache. All this is used to ensure that the processor finds and executes its instructions before searching for them in main memory, a task which can force the system to essentially wait for the data to be found--dramatically slowing a system that is designed to be as fast as possible.
I'm assuming the cache referred to in the second paragraph is off-chip cache, otherwise it would sort of negate the first sentence.... Would be nice if the article would have actually said that though.
Monstar L
Essentially all desktop and laptop computers use CISC chips and they are fast and cheap. RISC is a neat theory, but these days it seems that as the processors get decoupled from their ISAs anyhow, for various reasons, that it doesn't matter much. You choose the ISA for reasons of binary compatibility or features or the like, and it'll work just fine with the chip.
Also it is not true that CISC needs more bytes per instruction, at least not all implementations. With x86 you find instructions are variable length. They can be as little one byte and as many as 11 bytes. In actual practice, you find that a lot of 1 and 2 byte instructions are used in code. CISC can be extremely pithy in some respects. Then of course many CISC instructions do more. The idea with RISC is that each instruction does only one thing (that isn't really true with all the vector math stuff these days). So you end up having to issue instructions to load values to registers, operate on them, then store them back. Not necessary in CISC, there are instructions that can take a register and a memory location as values, and sometimes even two memory locations.
That ought to be enough instruction cache for anybody.
The codename for this processor, was "Ming Mecca".
-Laxitive
These days, compilers take care of almost everything. It has gotten complex to the extent that a programmer trying to do things all in assembly will probably do a worse job than a good compiler. Chips have many, many tools to solve their problems.
That isn't to say it is never done, in some programs there may be some hand optimized assembly for various super speed critical functions. However even then it is most likely written in a high level language, compiled to assembly (you can order most compilers to do that), tuned and then put back in the program.
Memory is cheap and compilers are powerful so assembly is just not as needed as it once was, at least on desktops/servers where you see these massive chips.
Really this article kind of makes all of last week's comments on the speed of light limiting the speed of processors to 3GHz a bit pointless doesn't it? Now I know in principle the discussions were correct, but this just goes to show that problems can be engineered around.
No news here. Everyone knows the only innovation going on in the big companies is in Apple nowadays. Move along...
The summary makes it sound like it's merely the one with the greatest clock frequency. Me RTFA is out of the question, this being Slashdot and all.
If IBM licensed by the Core, at least there would be some business justification for developing this chip.
If you direct to the IBM announcement, which mentions the system in more detail then this linked article - http://www-03.ibm.com/press/us/en/pressrelease/32414.wss The New zEnterprise 196 " From a performance standpoint, the zEnterprise System is the most powerful commercial IBM system ever. The core server in the zEnterprise System -- called zEnterprise 196 -- contains 96 of the world's fastest, most powerful microprocessors, capable of executing more than 50 billion instructions per second. That's roughly 17,000 times more instructions than the Model 91, the high-end of IBM's popular System/360 family, could execute in 1970." 17k x improvement in performance in 40 years? I suppose that is about right...
How much would it cost for me to put together a system with the same computing power, using off-the-shelf products, like a Xeon chip, or something? How long would it take for me to save $1 million in electricity, or whatever?
Given that the Z architecture doesn't even have PCI, that would be a no.
There's no failure quite as dissatisfying as a complete and total solution to the wrong problem.
lets see X2 oh ya 6GHZ at 300$
go IBM
Can we still enable "Coffee Crisp"?
in b4 beowulf cluster
When can we move on from microprocessor? Is there a definition?
Let's get things moving, how about a kiloprocessor--or at least a milliprocessor.
yeah shitty IBM utilities on uber hardware. I'll take uber utilities on great hardware.. Good ol DEC
...my trusty VIC-20 was clocked at a mere 1.02 MHz.
Its truly amazing how far we've come in so short a time.
(Well, maybe not so short for you whippersnappers...)
My mom always said, "Jim, you're 1 in a million." Given the current population, there are 7000 of me. God help us all!
Be that as it may, there are also a few of us who simply enjoy the art of assembler. Sue me; I'm a romantic.
Reply to That ||
Windows EVER?
Yours In Osh,
Kilgore Trout
Lots of people have the impression that IBM is some kind of old yesteryear has-been dinosaur of a company that hasn't dies yet just because it was so huge and market-dominating at one time.
They're not. They're still doing real *computer science* at IBM. They might not be the 800 lb gorilla of the market that they once were anymore, but they're still quite relevant, and as to the S/360 architecture, there's nothing wrong with building a solid foundation and sticking to it. Especially if you keep applying computer science to it instead of some marketing guy's ideas of "innovation" to ran down customers' throats.
Actually, I believe it does now. 3D driver support is even worse than Linux, though. :)
You could run Linux on it. Then QEMU/WINE combo on that. Then Crysis on that. It'd still probably only get you back to the 2.5-ish-GHz of your average desktop, though.
I could never... Aero is... the best thing ever... it's so amazing and useful. It improves the look and feel so much.
Wait, do I have it turned on or off right now?
If I could mod you flamebait, I would.
What I'd love to know is how they overcame the 3 GHz overheat barrier that has prevented this sort of processor speed until now.
I guess I'm a counterexample then:
I'm 53.
I believe (hope?) most people who know me would say that I'm still a pretty good programmer.
Terje
"almost all programming can be viewed as an exercise in caching"
I'll hazard a guess that clock speed excels in one particular case: tight-loop iteration. You can't do that with parallelism (ignoring some fancy pipelining to get part-way there). The fastest way to get the 1 millionth result in a no-shortcut iterative sequence is to get the loop processing at the highest frequency possible.
The brain is estimated to have about 60 Trillion synapses (bio-electrical connections between neurons). Neuroscientists have suggested that to simulate a human brain you would need a computer with at least that many neurons. It seems that ~43 of these chips would do the trick!
Had a golf game ended differently, would we be seeing these in power macs?
Seems like there should be more.
I killed da wabbit -Elmer Fudd
Way to break the combo, jerk.
I read TFA and all I got was this lousy cookie
they are very much needed.
We have deployed two IBM 570 series minis and 2 595s. These range from 24 to 64 processor systems featuring 30 to 50tb of disk storage. We will deploy a 770? with nearly 100tb of storage backed by I think 96 processors. There are no Intel/AMD servers that reach our needs with the software and reliability we need. Coworkers in other departments might make jokes about our "old school" hardware and such but even they get shocked by our downtime, which was twice in twelve years excluding new machine upgrades; of course during which we were swapped to offsites. First failure was because the install wrongly wired power connections and site UPS failure dropped the box, the second time was site cooling failure and our machines were the last to turn off, the intel/amd heat furnaces; no other way to be polite about them; quickly could not deal with the hot computer room.
While I have great respect for the server group, their hardware just cannot compare. They can spend similar money but once you reach certain sizes minis (AIX or iSeries) become much easier to deal with, hell with 72 iSeries we only had 3 Admins for the whole lot, we have more than that just for one AIX machine, let alone that "department" that supports the servers (think company wide data sharing) or business intelligence setups.
At the same time the one telling point, we are held to a standard which the server groups are not. We have to schedule outages; even with a offsite mirror; just to do upgrades. The other groups get one weekend per quarter, we can't even get that. We serve too many and 24x7 really means 24x365. Lose a file server and people get mad but if we have a slow down we get calls from up high.
So, don't just focus on CPUs, that is the game of PC geeks. We bench mark on work performed and downtime. Every component is intelligent, we don't lose processing power to driving peoples sessions, disk access, and such. FWIW, we host Linux too, so there is some other value for us.
* Winners compare their achievements to their goals, losers compare theirs to that of others.
As a matter of fact Crysis is one of the killer applications for the zSeries has to offer... Financial Crisis that is...
The z196 contains 1.4 billion transistors on a chip measuring 512 square millimeters fabricated on 45-nm PD SOI technology.
At what point would we stop calling it a microprocessor?
This is a Mainframe CPU not a Server/Desktop/Mobile Chip. This means it's purpose is to be as expensive as possible while costing an Arm/Leg in maintenance fees.
Mod me up/Mod me down: I wont frown as I've no crown
IBM also previously claimed the title of fastest microprocessor with the POWER6 chip, which ran at speeds of up to 4.6 to 4.7 GHz, and its own z10, a 2008 chip which ran at speeds of up to 4.4 GHz.
I seem to recall that one of the official reasons Apple gave for the switch from Power to Intel was that IBM couldn't/wouldn't deliver a fast enough processor.
Mainframes are designed to do huge amounts of I/O supporting lots of processing in parallel and running big databases with whatever the current generation's definition of huge quantities of disk storage is. That's not the kind of computer you put on top of your desk, either in a workstation or a laptop. It's the kind of computer you put in a big air-conditioned sound-proofed room down the hall. If you want to run graphics-oriented applications using it to do the data processing, that's fine - do your graphics on a graphics-oriented workstation on your desk, and crunch the data on the mainframe, using some appropriate protocol to connect the two.
Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
Wow...the author of the article thinks that the difference between RISC and CISC is how much memory the programs can take up? He needs to get a serious education in computers before writing about them.
Virtualization is a RECENT mainframe trend? Only if you count 1970 as recent. Virtualization is a RECENT trend in toy computers, not mainframes. They did it first. And as to whether it's faster or slower than a PC...it's really workload dependent. You have 5000 employees who need to access the same data, say your airline reservation system? Try that on a farm of Intel servers and you're out of business! You wanna play Crisys? I wouldn't try that on your z10 anytime soon.
Is Slash-D some cool hip way of saying slashdot today?
-- This space for lease, low setup fee, inquire within!
in two years, that's a new revision every 3 days. I'd wait a couple of weeks until z200 comes out.
Nullius in verba
Finally... a processor that can power snoop's pimped out machine with norton installed
You build the new one using the old one. Visual Studio 2010 was written in Visual Studio 2008 originally. Now it is written in itself, the production compiler is used to update the code for that compiler.
I sure hope one of them instructions is 'Insert Characters Under Mask'.
I'm not a coward by any name.
"The fastest X ever" sounds so final, as in "for ever", as if, hey, this might be IT. And of course this kind of record will inevitably be broken, and probably sooner rather than later.
Maybe these kinds of stories should have headlines like, "IBM Unveils the Fastest Processor . . . for the Time Being." or " . . . So Far."
I would have thought it would be "Slash-dizzle."
:-P
And remember, kids: "Hack is Wack!"
Yeah, but will it Overclock?
It would get you nowhere near that. A substantial fraction of any mainframe architecture's instruction set is emulated in software. The actual MIPS ratings are way below the MHz ratings, whereas on most superscalar architectures, MIPS exceeds MHz.
Once you've paid that penalty as well as the qemu penalty, you're getting down to somewhere in the Doom/Quake I range, with no hardware acceleration.
There's no failure quite as dissatisfying as a complete and total solution to the wrong problem.