More Cell Processor Details And First Pictures
slashflood writes "After reading two articles on slashdot about the Cell architecture and another one that criticizes the extensive roundup of the STI patents, I found the first pictures of the Cell core. It seems that at least some predictions were true. Seeing is believing." mtgarden points to this ZDNet article which says that the "first version of the chip will run at speeds faster than 4GHz. Engineers were vague on how much faster, but reports from design partners say 4.6GHz is likely. By comparison, the fastest current Pentium PC processor tops out at 3.8GHz." (More below.)
Hack Jandy writes "Anand Shimpi has some details about the upcoming Cell processor (PS3) in his personal blog. According to Anand, "Rambus announced that the new Cell processor uses both Rambus XDR memory and their FlexIO processor bus. Because Rambus designed the interface for both the memory controller(s) and the processor interface, the vast majority of signaling pins are using Rambus interfaces - a total of 90% according to Rambus." Hasn't Rambus been showing up a lot again recently? The fact that Cell uses XDR has been widely speculated, but the fact that it will also use the Rambus bus signalling is something completely new."
How about HL2 benchmarks?
Cell processors could really dominate. With how cheap they arespeculated to be, their distributed processing, and their all around speed, the could take over a significant part of the computer marketshare. If Cell processors also have the Power4 processors in them, this could be a replacement for x86. Could be. As other articles have pointed out, x86 has had superior competition in the past, and has been able to weather it. We shall wait and see. Cheers
Hurricane Ivan: A 17th century prison collapsed. All of the inmates escaped.
The Cell is going to be in the PS3, so does that mean that the PS3 will be clocked at 4.6 GHz+? That seems like a big leap considering consoles are normally running a little slower than "good" computers at the time they come out.
Sigs are for the weak.
I understand the chip will be used in Playstation 3, but it will also likely be used in future Apple computers, of which, the G5 is already based on the Power architecture. Maybe IT would've been a better section to put this under?
Best. Webhost. Ever. Dreamhost.
I'm waiting to see how much work it can actually do before making a judgement. At the least it always exciting to have another option. I wonder how difficult it will be to take advantage of the new architecture.
While 4.6 GHz sounds impressive, I thought we were getting away from the notion that clock speed = performance. The Pentium 4 killed off clock speed comparisons.
I must admit the specs are impressive, but show me the benchmarks!
For all intensive porpoises your a bunch of rediculous loosers
"Nucular". It's pronounced "nucular".
I believe Sony and IBM and Toshiba are going to produce this thing as a joint venture, calling it "Cyberdyne" also naming the PS3 online game network Skynet, sounds promising...
I don't think the final PS3 part will be clocked much higher than 3.5GHz. Otherwise it would probably involve downclocking parts of the CPU to maintain a sane thermal profile, thus making overall performance rather unpredictable. This would especially impact games, where it's all about sustainable framerates at 100% CPU utilization.
If IBM can produce these things in enough volume this could be the ideal linux platform for the future.
PS3 is expected to sell very well so the chip production might be soaked up by the game consoles but you never know.
I bet apple engineers are salivating right now too.
Promises to be interesting for intel and AMD next year.
Get a free Mac Mini
http://www.scee.presscentre.com/imagelibrary/detai l.asp?MediaDetailsID=25555
:
CELL...bringing supercomputer power to everyday life with latest technology optimized for compute-intensive and broadband rich media applications
SUMMARY:
Cell is a breakthrough architectural design -- featuring 8 Synergistic Processing Units (SPU) with Power-based core, with top clock speeds exceeding 4 GHz (as measured during initial laboratory testing).
Cell is OS neutral - supporting multiple operating systems simultaneously
Cell is a multicore chip comprising 8 SPUs and a 64-bit Power processor core capable of massive floating point processing
Special circuit techniques, rules for modularity and reuse, customized clocking structures, and unique power and thermal management concepts were applied to optimize the design
CELL is a Multi-Core Architecture
Contains 8 SPUs each containing a 128 entry 128-bit register file and 256KB Local Store
Contains 64-bit Power ArchitectureTM with VMX that is a dual thread SMT design - views system memory as a 10-way coherent threaded machine
2.5MB of on Chip memory (512KB L2 and 8 * 256KB)
234 million transistors
Prototype die size of 221mm2
Fabricated with 90nanometer (nm) SOI process technology
Cell is a modular architecture and floating point calculation capabilities can be adjusted by increasing or reducing the number of SPUs
CELL is a Broadband Architecture
Compatible with 64b Power Architecture(TM)
SPU is a RISC architecture with SIMD organization and Local Store
128+ concurrent transactions to memory per processor
High speed internal element interconnect bus performing at 96B/cycle
CELL is a Real-Time Architecture
Resource allocation (for Bandwidth Management)
Locking caches (via Replacement Management Tables)
Virtualization support with real time response characteristics across multiple operating systems running simultaneously
CELL is Security Enabled Architecture
SPUs dynamically configurable as secure processors for flexible security programming
CELL is a Confluence of New Technologies
Virtualization techniques to support conventional and real time applications
Autonomic power management features
Resource management for real time human interaction
Smart memory flow controllers (DMA) to sustain bandwidth
I think more likely, Microsoft will just end up putting in more Cells in their next XBox if the PS3 proves successful.
Thing is, the next Xbox will be using a PowerPC 970. So it will share a common ancestor (POWER) with the Cell.
I wonder, how compatible are the two CPUs' instruction sets? Will Microsoft be able to drop a Cell into a future revision of the Xbox2 and maintain backward compatibility? Could someone theoretically hack a PlayStation3 to run Xbox2 games?
They zoomed in on this press photo of an engineer holding a die.
Help save the critically endangered Blue Iguana
No chance that Rambus learned a lesson, then?
The Spoon
Updated 6/28/2011
Remember how the Emotion Engine worked us all into a lather five years ago? And when it came out, it was just merely competitive with contemporary processors? Sony is great at churning out nerd fetish tech, but they have a terrible track record of living up to their promises. Let's hope it's different this time.
My friend just called me from the ISSCC. He got a couple more bits of info, including that STI plan to recoup their R&D expenses largely from other consumer multimedia devices and NOT from selling Cell processors or Cell processor based computers, meaning they will be surprisingly inexpensive. Yeah!!!
with IBM is pushing linux, I can't see microsoft ditching their good buddy Intel, and throwing money at IBM by porting windows to run on the Cell processor. I would guess that this would allow Intel to retain their market share, even if the Cell is infact a far superior processor.
- a regular CPU (good for program flow/logic and interdependant operations),
- a vector unit (good for large arrays with no conditionals),
- and 8 stream processors (good for applying the same operations plus flow control to lots of independant chunks of data).
w00t!what does your oscilloscope need a D/A converter for? Do you mean an A/D converter? And why 1,024 bits? That's 128 channels worth of 8-bit A/D.
What scope is this?
Ce n'est pas un vrai mouvement de robot!
fact #1 Apple and Sony have been awfully close for the past few years- with some dialogue between the two CEOs.
fact #2 Apple has signed up to display at E3 this year- but hasn't published any official info on their site.
fact #3 The Mac is somewhat deficient when it comes to gaming when compared to the Windows PC.
So my speculation is that it is possible that Apple intends to build a new Mac aimed at the gaming market that will be compatible and play Sony's PS3 games- Apple in turn could publish games for the PS3.
Apple's Connection to the Cell Processor
For those of you wondering about the power consumption of this thing, perhaps you should note that Sony just licensed LongRun2 from Transmeta. It is a dynamic solution for power consumption and leakage that will probably end up in the 65nm versions coming out next year. google transmeta sony for more.
Once touted as the Intel killer, perhaps Transmeta will finally have its day.
"If you build it, he will come."
If you create a machine so powerful that there's nothing that fully utilizes its capacities, that merely spurs all sorts of geeks to dream about how they can push that machine to its limits, then overclock it, then put it all in a case made of Legos.
- Greg
Start a happiness pandemic
It is, and receantly with developments in chip design and compiler design, the architecture of a chip has become much less of a big deal.
Back in the day, RISC was important because it allowed pipelining, the ability for a chip to be doing multiple things at once. Like old MIPS chips used to have 8 parallel piplines that took 8 cycles to execute an instruction, giving an effective rate of one instruction per cycle. Couldn't do that with CISC. Well now processors are decoupled from their ISAs. Each of those instructions is translated into a number of micro operations, which are actually what get handled by the processing section. Likewise it means there can be more registers than are exposed by the ISA.
The upshot is that it doesn't matter as much it used to.
However, there are still plenty of people who like to villify Intel for sticking with x86. They declare it to be an olde kludge of an architecture that needs to die and makes things all slow. However when AMD decided to stick with it, rather than hop on the EPIC bandwagon, they are suddenly heros for maintaining backwards compatibility, which is the whole reason Intel has stuck with x86 for so long.
What's I'm pointing out is the bashing is done against Intel, regardless of what they do. Intel is in the "bad" position, no matter what that is. Like with the cell chips and speed. Slasdotters have been long raging on Intel for making a design that has higher MHz but less performance per MHz (as opposed to AMD). They declare it to be a marketing gimick, etc. Now here we have an article talking about cell chips that are designed to cycle even faster, and taking shots at how slow Intel chips cycle by comparison.
It's not that these people actually have good reasons to like or dislike the decisions, they just dislike Intel and so slam on them.
"plan to recoup their R&D expenses largely from other consumer multimedia devices and NOT from selling Cell processors or Cell processor based computers, meaning they will be surprisingly inexpensive. Yeah!!!"
However, from the press release:
Prototype die size of 221mm2
When it comes to chip manufacturing, the cost of a chip is basically a direct function of the area. A 221 mm^2 chip size is pretty damn big; this thing isn't going to be cheap. Even considering IBM's extensive fabrication experience, Sony will probably have to sell this at a significant loss to make the PS3 palatable to gamers.
Granted, this is a prototype, so they can probably shrink it further by production, but it still won't be something cheap. Don't count on being able to buy these cheaply to make your own parallel supercomputer.
Prescott has 125M transistors, while the GeForce 6800 has 222M transistors. And on a tangent: this is typical Slashdot. IBM and Sony announce a 256 gigaflop chip, and Slashdotters' first reaction is to bitch about how hot and noisy it will be! Where are the real nerds in the audience?
A deep unwavering belief is a sure sign you're missing something...
Before everyone starts having digital orgasms over this chip we should take a sit back and watch approach. This all sounds amazing on paper as did PS2 and look how that turned out in terms of hardware power.
We have no idea if developers will be able to easily adapt and get any real performance out of this thing above and beyond what they get from CPU's now. Almost nobody uses the vector units in PS2, who says they will start now? In terms of just gaming I wanna see some games and examples of this thing running in real time before i start taking my wallet out of my pocket, Sony burned me last time with underpowered bug ridden hardware, ill be damned if i let them do it again.
Did you RTFA? From the second article:
Die size: 221mm^2
Transistor count: 234m
SPE Size: 2.5x5.81mm
SPE Interconnect: 4x128bit ring bus
SPE local memory: 256KB
SPE decode rate: 2 insns/cycle
SPE resources: 7 execution units (unspecified type)
They also mention the core voltage of the CPU (1.3V), the fact that the memory has been tested to 5.4GHz, detail the temperature monitoring scheme, and the fact that the SPEs are in-order chips. This is all new information.
A deep unwavering belief is a sure sign you're missing something...
I wonder what M$ has to beat back a server processor with essentially hyper threading, running at 4.6 ghz, attached to 8 vector processors, each with a lot of registers and cache, which are using extremely fast memory, that can connect to other, similar processors nearby.
Microsoft has consistently overwhelmed the fastest processors on the market and I am confident that with the right bloatware they will continue to do so.
I'd be far more worried about malcode (virii, worms, etc) taking hold in the cross-platform evironment the parent mentions than worrying about games.
There seems to be alot of confusion surrounding the Cell chip. This is not "just another processor", and it certainly has little to do with clock frequencies - the Cell is a whole new architecture, which might just be a glimpse into the future of computing.
To begin with, it might be useful with some background on the ps2 architecture - there are a couple of really great in-depth articles at Ars Technica; Sound and Vision: A Technical Overview of the Emotion Engine and The PlayStation2 vs. the PC: a system-level comparison of two 3D platforms.
What made the ps2 so awesome was that it was custom-built specifically for multimedia-processing, which requires completely different processing environments than general-purpose computing. Normal PCs are made for computing where you have a large number of instructions working on a small data-set (such as a spreadsheet) - this requires large data-caches close to the CPU, while instructions are streamed continually from RAM. Media-processing is the other way around; you have "simple" operations (like doing the calculations for a single pixel), which are run on a large set of data - so you wouldn't really need any data-caches. The ps2 did exactly this; it removed almost all the caches (only a few tiny ones were left), but it had a totally insane bus bandwidth. To borrow an analogy from the mentioned Ars Technica article:
"Here's a goofy example to help you visualize what I'm talking about: imagine a series of large buckets, connected by pipes to a main tank, with a cow lapping water out of each bucket. Since cows don't drink too fast, the pipes don't have to be too large to keep the buckets full and the cows happy. Now imagine that same setup, except with elephants on the other end instead of cows. The elephants are sucking water out so fast that you've got to do something drastic to keep them happy. One option would be to enlarge the pipes just a little (*cough* AGP *cough*), and stick insanely large buckets on the ends of them (*cough* 64MB GeForce *cough*). You then fill the buckets up to the top every morning, leave the water on all day, and pray to God that the elephants don't get too thirsty. This only works to a certain extent though, because a really thirsty elephant would still end up draining the bucket faster than you can fill it. And what happens when the elephants have kids, and the kids are even thirstier? You're only delaying the inevitable with this solution, because the problem isn't with the buckets, it's with the pipes (assuming an infinite supply of water). A better approach would be to just ditch the buckets altogether and make the pipes really, really large. You'd also want to stick some pans on the ends of the pipes as a place to collect the water before it gets consumed, but the pans don't have to be that big because the water isn't staying in them very long."
So, what does this have to do with the Cell? The Cell takes this concept even further. Cell systems are made up of multiple processors, called APUs (Attached Processing Units), which are connected using an insanely fast data bus. Each APU can be programmed to handle one specific task, and then pass the data on to the next APU for a different task. By doing this, you can just put in more processors to increase the throughput of the system. This works especially good for multimedia processing, which can be pipelined like this pretty easily. Here are a couple of snippets from the Wikipedia entry:
"While the Cell chip can have a number of different configurations, the workstation and PlayStation 3 version of Cell consists of one "Processing Element" ("PE"), and eight "Attached Processing Units" ("APU"). The PE is based on the POWER Architecture, basis of their existing POWER line and related to the PowerPC used by Apple
If you can help clarify some of this for me, thanks.
Irritable, left-wing and possibly humorous bumper stickers and t-shirts
They already tried this for the PS2.
So this will just be like the last time Rambus had their hooks into a product, it will die a very expensive and slow death.
You mean like the PS2 did (given that is uses RDRAM)?
Dave
I write a blog now, you should be afraid.
You mean like the Itanic? Shoe's on the other foot now, eh?
If I were to villify Intel, it wouldn't be for sticking with the x86 instruction set.
:)
I'd villify them for...
Building the 8086 down to such a super low cost, by crippling them with a pitifully small number of interrupts, registers and DMA channels, they knew it was a bad idea but went ahead with the design anyway.
Making a chip (the 286) with an enhanced 16-bit mode, but no way to switch back to real mode (fixed in the 386) which resulted in the 16 bit mode of the chip being nearly useless.
Failing to properly address the I/O problems of the first generation processor until the fifth generation CPU.
Forever upping x86 core speeds in an attempt to convince punters that GHz == instructions executed, despite it meaning that the chips would fail more, and make computers noisier and hotter.
And finally, copying AMDs 64bit architecture without giving any credit.
As I understand it, the APUs can act semi-independantly but the controlling processor has overall control. If that's true, if the processor wanted to (say) switch to some other process would it have to save all that state to somewhere else before continuing, just as standard processors do now?
As all the APUs have lots of big registers and significant amounts of private memory, wouldn't that be painful?
I rarely criticize things I don't care about.
You weren't supposed to switch back to real mode, which was obsolete. They underestimated the PC industry's fixation on backwards compatibility and overestimated the ability of Microsoft and other software vendors to produce advanced operating systems. It was a decent chip, but it wasn't what most of their customers wanted.
Mea navis aericumbens anguillis abundat
and I bet it still wouldn't be able to run Longhorn.
Warning: Could be fatal if taken seriously
as Mr. T would say.
How about some actual SPECint and SPECfp?
Oh, nothing like that was released? Hmm. makes you wonder. Sort of like the Itanium flop where the excuse, going on for about 10 years now, is that the compiler isn't quite optimised yet.
Any nerd over 15 ought to have heard far to many claims of "revolutionary cpu design" to know better.
actually according to the reports ive read, the cell is supposed to debut in servers long before ps3 arrives, and i remember marketing estimates of 16 terraflops for a 1 rack server!!!!!!
watch "the money masters" on google video
The New Xbox 2 (or Xbox 360?) is using the PowerPC , if fact Microsoft is currently using Apple G5 as the development platform. So they will have experiance on the Power architecture. I seem to remeber them doing some work in with NT on PCC in 98? but it was killed.
Don't forget KDE bloatware.
I was at the Cell event today, and quoted in some of the news stories. I also have the ISSCC technical papers.
The PowerPC core in the Cell prototype chip is NOT a Power5, as speculated here. According to IBM, this core was designed from scratch for this application. One critical difference is that the new pipeline executes instructions in strict program order rather than reordering instructions to improve throughput as is done with Power5.
Also, IBM has not described the core as "simultaneous multithreaded", just "multithreaded." I presume from this that the multithreading is coarse-grained-- only one thread is active at a time, unlike Power5 which can execute instructions from two different threads in the same cycle.
The logic design for the Cell CPU was optimized for higher clock speeds in a given process than Power5 can achieve. This is a good tradeoff for more linear multimedia algorithms, but reduces effective throughput on other types of code.
I think it's reasonable to suppose that if Apple were interested in using the Cell architecture, it would prefer to use a version of the design that includes a Power5 core in place of the one in the Cell prototype.
. png
Hey man, if $34 Billion in revenue is getting your butt spanked, then line me up and get the paddle.
As for graphics, those have been a huge success, to the point that nVidia and ATi began copying the idea. Intel's integrated chipsets are a huge hit with business. They keep costs and space down, and high performance grapihcs aren't necessary for office work. The integrated low-end graphics chip is getting to be quite popular.
Networking would be another huge non-processor area that they excell in. If you ask me what kind of NIC I want in a server, Windows, Linux, BSD, whatever, the answer is Intel. Nobody else I know makes cards of the same quality. 3com used to, but not anymore.
Now the x86-64 thing is an interesting one to pick on, because the reverse is true. AMD was being the uninnovative one. They decided that innovation, in this case, was unnecessary and counter productive. They decided to just whack on 64-bit extensions to the x86 architecture, as was done with the 32-bit conversion years ago, and call it good. It offered nothing new in terms of ISA, but that meant backward compatibility.
Intel tried to be radical. EPIC is a neat idea that's been messed with for years and never made practical. You have the compiler do all the work of deciding what runs in parallel, rather than the chip. Makes for helaciously complex assembly, but that's ok, you just need a good compiler, and Intel makes the best.
Well, total non-starter in the desktop market, that's gone to x86-64 and it's not changing. However seems to be working in the high end computation market. We just got in 2 racks of SGI Itanium coputers for one of the research labs. From what I hear, they are badass number crunchers.
Now if you want to talk some major failures, let's have a look at AMD's motherboard situation. When the Athlon came out it was abysmal. AMD couldn't produce a reasonable chipset to support their own processors. It was slow and incomplete, and couldn't deal with basics like AGP 2x. VIA had a full featured chipset, that was full of bugs and couldn't handle hardware like the GeForce in many configurations. ACPI problems plauged all boards.
Now the point here isn't to try and say Intel's better than AMD. The point is, both companies have hits and misses. Some products can be both a hit in one way, and a miss in another. However there's a lot of fanboyism about AMD and hate towards Intel and its not productive.
You should pick your platform based off of informed choices about what performs better for you, and gives you that performance at the best price. If you find yourself having to justify it by attacking the other company, you probably made it for the wrong reasons.
This goes extra for doublespeak like hating on Intel for focusing on MHz, then hating on them again when someone else does so.
Cut an 80x80 hole in the top plastic about the middle of the unit, facing upwards. Mount a lo-flow panaflo over the hole, facing up. Wire the panaflo to an external 9v wall wart. Cut the wire to the original fan. For an added bonus, undervolt the panaflo to 6 volts, and add an undervolted rotary fan into the HDD drive bay. This should be silent overall and have a great airflow.
The default internal fan is a 6v 50mm screamer, narrow and loud. Ultimately it moves less air than any undervolted panaflo. Plus it is pretty terribly positioned. A fan on top of your case may prevent you from stacking other equipment on top, but it is perfect for sucking heat off of the gargantuan main heat sink, and gives the fan enough space that the blades don't make that choppy noise. Thanks to a ludicrous heat sink the PS2 doesn't really require that much cooling, though it does require some. It is usually about 20 minutes before a PS2 with no fan will overheat, while a mostly symbolic fan is usually enough.
Of course, the PS2's are engineered to work in hot climates, so if you're near the sahara you may want to run your fan faster. But overall the systems don't need to be noisy.
The ______ Agenda
And the power supply it needs outputs 1.21 Gigawatts.
It is a radeon/geforce competitor. Or something like that.
The cell processor is only really fast when the spus are in use, which means 32-bit non-branching floating-point arithmatic. For anything involving integer math, flow control, or uneven memory access, the SPUs defer to the main processor. I'm sure IBM put a decent processor in there, but it doesn't sound like it's anything revolutionary, and there's only the one.
What does this get you? -- A processor that is really good at decoding mpeg, rendering graphics, maybe approximating the physics of flying dragons. It is not a fast general purpose processor. Operating systems, word processors, databases, these are all integer tasks, and much more-so they are branch tasks. Scientific computation - this requires double-precision floating point. Photoshop is about the only piece of non-multimedia software that might be able to take advantage of this.
The end result is that this will likely be a great chip for set-top boxes of all sorts, maybe even for video-editing workstations. A G5/pentium replacement it isn't; that's a different ball game.
I hear you though. The Power5 is designed to handle large multi-process loads. This new Cell architecture, or at least this particular Cell chip, is designed for real time processing of large piles of data.
I'm not reliving computer architecture class... I'm not reliving computer architecture class... (open's eyes) ... Whew
"Learning is not compulsory... neither is survival."
--Dr.W.Edwards Deming