IBM Claims World's Smallest SRAM Memory Cell
nokiator writes "IBM issued a press release today claiming that it has built an SRAM memory cell that is ten times smaller than those currently available.
My interpretation of the PRese in this release is that IBM will be able to build 256Mb or 512Mb SRAM chips or integrate 32MB or more SRAM into processor dies for cache applications in the future. Of course, showing some SRAM cell prototypes is a long ways from being able to manufacture this technology in a cost effective way.
There is no information in this PR about the speed or power consumption of SRAM blocks that can be built with this new cell technology. This is not likely to be a potential DRAM replacement for mainstream applications as DRAM already offers more than ten times density compared to SRAM at much better cost."
...they can only store zeros. They are working feverishly on scaling them up to store ones.
And people keep asking me why we don't buy servers from Dell. Maybe I'm old-fashioned in thinking that a company that continuously innovates the technology and not the distribution or marketing channel is whom I should get my technology from...
This is not likely to be a potential DRAM replacement for mainstream applications as DRAM already offers more than ten times density compared to SRAM at much better cost."
I thought the PR implied "Although not as dense, SRAM is many times faster than dynamic random access memory (DRAM).", density is like a also-run.
Rock that crushes, Paper & Scissors that don't matter.
There's so many things wrong with this post...
a) the 64-bit CPUs that exist today have 32-bit instructions.
2) just because the width of the registers is 64-bit does not mean that all data that is processed is now 64-bit wide, 64-bit CPUs are certainly capable of processing 32-bit values.
D) somehow having a value stored as 64-bits doesn't mean you can use it for twice as many things as the same value stored as 32-bits (assuming it fits) or somehow all of your programs/algorithms can use the value twice as much as before (implying twice the work)
The memory modules you put in your computer are composed of DRAM chips. DRAM uses a capacitor and a transistor per cell (plus sense amps, decoders, etc.). DRAM requires refresh (the charge on the cap leaks off) but is relatively low-power and very dense. SRAM uses no capacitors, but more transistors (4 or 6) per bit; it's higher-power but faster and doesn't require refresh.
So, SRAM density has nothing to do with DIMMs you put in a computer. It's used for on-chip caches (and off-chip caches), but is too expensive for main memory. Denser SRAM means that Opteron you've got with 1M L2 cache could have 4M or 8M if IBM can mass-produce the stuff.
DIMMs usually have 16 chips (18 for ECC modules). So, if you have 512Mbit DRAMs, you put 16 of them on a module and you get 8Gbit = 1Gbyte. Gigabit DRAMs can make 2GB DIMMs. 2Gbit DRAMs are needed to make 4GB DIMMs; they cost hundreds of dollars each (and you need 16!), which is why 4GB DIMMs are so amazingly expensive.
High-speed Road Trip (18.000KPH)
That is not true at all. The size of instructions is still 32-bits for all 64-bit RISC CPUs and the instruction size is the same in x86-64 as that of regular x86 (CISC). Now 64-bit CPUs give you the *option* to use 64-bit data/pointers, but you are free to continue using 32/16/8 bit data as you deem appropriate. Code/data size doesn't automatically increase.
Also, you may not necessarily be doing twice the work with a 64-bit CPU - it again depends on your compiler. In theory you *can* do two 32-bit operations in parallel using a 64-bit EX/FP unit if the ISA provides for packed operations (also known as SIMD instructions).
Of course, showing some SRAM cell prototypes is a long ways from being able to manufacture this technology in a cost effective way.
Well, were still talking about IBM here? Do you really think that a few hundred dollars more would even get noticed at clients that buy a server in the 100K range?
The main advantage of buying high-end gear from IBM, Cisco and the like isn't that you get cheap hardware ('cause you simply don't). You buy the gear from that company because you get 10 years in-house service including remote failure detection if you pay for it. That means, THEY call YOU before you even notice one of your tripple-redunant drives has problems. At this point in time, the technician is probably already on the way up to your office.
Sure, it's very expensive. But you save quite a lot by not having any significant downtime...
Seen in that context, 500 bucks more for RAM is IMHO just irrelevant to even think of...
Look, this thing is totally safe! Built it myself, you know. You just press that button like this and then turn that lev
However, I wonder if the additional implementation requirements justify the benefits. Static typing is only found in certain computer languages, and programmers have come to rely on dynamic memory allocation offered by malloc() or similar routines. I suspect with careful design one could fully exploit the advantages present -- with software being cheaper than hardware, it could easily be well worth it in embedded or pre-fabricated devices.
The type of implementor that uses (dynamic) extreme programming methodologies may be left out in the cold, although I would like to suggest that would occur anyway to a person working without a blueprint. Regardless, it will be exciting to see how this develops from the embedded perspective...
Try not. Do or do not, there is no try.
-- Dr. Spock, stardate 2822-3.
IBM just decided they wanted a new measurement system, the number at end of a human hair, or naeoahh. Everyone just needs to get used to this, much as our loc data size measurement, number of volkswagons for volume, and football feilds for length. We now have naeohh for density.
WikiAfterDark.com It's a sex wiki, go now!
Comment removed based on user account deletion
The problem with E-beam is that it is a serial process. In effect you have to draw every single transistor using a megnetic field to move the electron beam.
What makes IC manufacturing in general manufacturable is the fact that you are using photo lithography with an optical mask through which you expose the wafer. This is a parallel process whereas E-beam, which can be good for engineering samples, sucks for manufacturability.
Huh? What you say makes little sense. While it is true that smaller caches with less associativity respond faster, there is no "search through them more quickly" as it is all done in parallel. What makes it faster/slower is the depth of match logic required when you have many way set associativity (all the way up to fully associative which is the most expensive).
Secondly, why in the *world* would you not want to cache more data if you could? Ideally, all of your main memory would be just a large one-way set associative cache for best performance. (Compaq once made a 386 machine that had all SRAM as its main memory. No wait states at all and it was *fast* *fast* but it was also *expensive* *expensive* and upgrades for the memory were incredibly expensive.)
I'm thinking that this post must have been a failed attempt at humor.
They say 50.000 at the end of a human hair. Do anybody know the actual size of this cell?
Not only that, but they don't even mention how many Libraries of Congress those cells store! How am I supposed to compare press releases from different companies if they don't use the complete set of PR specs? Sheesh.
---
Intel released a press release four years ago claiming to have built SRAM memory cells which (to my calculation) work out about three times as small as IBM's. IBM are clearly claiming to have invented the smallest commercially viable memory cells, but not the smallest outside of that context. Maybe /. should try to keep up a bit more.
I remmber back in the days of the release of the original Macintosh II, a lot of articles (In Byte, MacUser, etc.) about the new 68020 architecture stated that main memory in the Mac would eventually be transitioned to SRAM because of SRAM's speed and power-consumption advantages. Cheap and dense SRAM was coming, "real soon", so that extra wait states or caches would not have to be implemented when the 68020 was scaled past 25 MHz.
Then the original Mac Portable came out, with 1 MB of zero-wait-state SRAM as main memory. It cost $7300 with a 40 MB hard disk, and pretty much sucked in every imaginable way.
Excuse me, but isn't the cost of any feature directly related to its size, making the above statement self-redundant?
I mean, a wafer-start is a fixed cost, divided by the number of processors it yields. That makes the area of the processor die directly relate to its cost, and the size of any feature relates to its subcost portion of the overall processor cost.
Or in simple terms: Smaller features should always cost less because you get more of them per fixed cost wafer.
Am I missing something major here?
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
If you can shrink SRAM down to a size equivalent to DRAM, then it CAN serve as an effective replacement, and here is why:
1) No fancy control logic. DRAM needs to be refreshed on a regular basis. SRAM is a straight "chip select, read/write" type of ram.
2) low power. Because it is not being constantly refreshed, it can hold those bits with far less power. Thats why you see NVSRAM and don't see NVDRAM. Imagine having 1 gig of RAM that is battery backed up?
3) One can argue that without the control logic, it will be theoretically faster.
The OP is mistaken when they say that it will never serve as an effective replacement to DRAM. On the contrary, it will be an awesome repleacement.
Feed the need: Digitaladdiction.net
I'd be happy to design a circuit that will store an unlimited both ones and zeros in 19 sq nanometers for free.
If you want to read them back, well that's gonna cost you.
myke
Mimetics Inc. Twitter
And, to add just a little to the parent, more cache is one of the best ways to improve your performance.
If you're working on a 2 Meg file and you only have a 256K cache, then your CPU has to work on a little bit of it, then swap the cache to main memory to work on some more, then again, then again. Each of those swaps takes time, and main memory is way slower than cache. So if you can store most of your work in cache and save the trips to memory, your get much faster speed.
So a webserver with a 2 GHz processor and 8 MB of cache is likely to outperform a 3 GHz processor with 512 KB of cache. Bottom line: bigger, smaller cache is a very good thing.
You, sir, are the man.
You convinced at least five posters that you were serious while simultaneously spouting an illogical collection of computer-related jargon and utterly false statements. I expect to see you here again on April 1st.
Mod me down and I will become more powerful than you can possibly imagine!
Actually, it used to be based solely on the width of the general purpose registers in the CPU, implying that the CPU could process data in those sizes with one ALU operation (the ALU was as wide as the general purpose registers).
The 68000 you mention was considered a 32-bit CPU, or at least a 32/16 (32-bit internally, 16-bit data bus) as was the 68010 and it wasn't until the 68020 when it had a 32-bit data bus as well. There was even a variant that was the 68008 that was identically equivalent to the 68000 except it had an 8-bit external data bus. I have occassionally seen the 68000 referred to as a 16-bit processor, but the vast majority of times I've seen it discussed call it a 32-bit CPU.
As far as I remember, no processor has ever been 'named' relative to the address bus width. Even in the PC world I can't remember any time when the CPUs were 'named' relative to the address bus. Many 32-bit CPUs didn't have the full 32-bit address bus externally accessible. I can't think of any 64-bit CPU that has the full 64-bit address bus externally accessible.
As far as cache storage of data, the data formats are independent of the pointer size. The blanket statement that a 64-bit CPU effectivly 'halves the cache' is not true. If you have a program that deals only with pointers, then you'd have a point but 'int' on a 64-bit CPU is the same as 'int' on a 32-bit CPU (both being 4 bytes or 32-bits). It *is* true that a program compiled for 64-bit ISA will use *some* more cache, but that is entirely dependent upon the program itself. Again, a program that uses pointers a lot may use more cache than the same code on a 32-bit architecture, but a program that uses few pointers may not use much more (if any more) than the 32-bit counterpart, depending upon the datatypes used.
Nope, think a pendulum locked at the maximum swing position. Ok, I'm being cerebral so think a stat ram cell like two inverters interlocked so that the output of one forces the other in a state whose output forces the first one into the same state it was before like an oscillator that dosen't oscillate (it would if the second's output would force the first inverter to change state rather than maintain the current).
It's like a flipflop but wired differently. It's wicked fast because you don't have to refresh the ram pages and also reading isn't destructive (DRAM reading on the other had is because the charge in the MOS transistor has to be driven to the sense amp in order to measure it and decide whether it's a 0 or a 1). DRAMS are slow because you have to wait for the correct time slot while the chip isn't refreshing.
Mi domando chi à il mandante di tutte le cazzate che faccio - Altan
They say 50.000 at the end of a human hair. Do anybody know the actual size of this cell?
First match on Google for diameter of human hair is:l
http://hypertextbook.com/facts/1999/BrianLey.shtm
Quote: "In my research, I have found the diameter of human hair to range from 17 to 181 [micrometer/microns]."
Assume a circular hair of 100 microns diameter, and assume the end of it is a flat circle of area Pi*Rad^2, or 7854 micron^2, divide this by 50000 and you get 0.157 micron^2 per SRAM cell.
The article mentions how IBM's SRAM cell is 10 times smaller than the current smallest. A Google for smallest SRAM cell gets you the Intel press release in March 2002 (too old?) that claims a 1 micron^2 SRAM cell.
Sounds about right to me. Given the range of hair diameter from 17 micron to 181 micron, the corresponding SRAM sizes would range from 0.0045 micron^2 to 0.51 micron^2. For exactly 0.1 micron^2 (a tenth of Intel's 2002 record), the hair diameter should be 80 micron.
Also, looks like the hair width varies too much from person to person to make it a realiable metric!
And explain: What are you saying you like, Dell, or IBM? Because I can't figure it out.
- It's not the Macs I hate. It's Digg users. -