Reduce Transistor Power Consumption

Woohoo by matr0x_x · 2005-12-06 18:46 · Score: 4, Interesting

This may not sound like that big a deal, but let me assure you this is very significant to wireless infrastructure enhancement. One of the biggest limiting factors in wireless devices is power consumptions, so this is great news for the industry!

--
LINUX ONLINE POKER: Linux Poker

Re:Woohoo by 0racle · 2005-12-06 19:09 · Score: 2, Informative

How could this not sound significant to the general users of this site. The latest Athalons and Pentiums use how much power again? If these guys patent this idea, I'm guessing it could make them quite rich.

--
"I use a Mac because I'm just better than you are."
Re:Woohoo by Anonymous Coward · 2005-12-06 20:03 · Score: 0

If these guys patent this idea, I'm guessing it could make them quite rich.
They better hope creative labs doesn't beat them to it.
Re:Woohoo by Anonymous Coward · 2005-12-06 20:05 · Score: 0

Pentiums? Don't you mean Pentaliums?
Re:Woohoo by Anonymous Coward · 2005-12-06 21:36 · Score: 0

Wireless devices produce electromagnetic radiation ("signal"), for that reason they consume much power.

This technique could enable another speed step of CPU's if it can be quickly adopted to industrial production process.

if their claim about improvement of 10^4 - 10^5 is true, it will surely remove this kind of bottleneck for at least another decade!
Re:Woohoo by SlashSquatch · 2005-12-06 23:05 · Score: 0, Troll

Maybe I missed something but all I found was an unsupported claim. Nothing on how they arrive at their results, nothing on what the results are, in short, nothing. A bunch of nurds speculating on an empty claim doesn't change that.

--
Autonomous Retard -- Is your camp safe? UnsafeCamp.com
Re:Woohoo by tedgyz · 2005-12-07 02:37 · Score: 1

W00t! Now they can put those low-power tracking devices in our necks.

--
"No matter where you go, there you are." -- Buckaroo Banzai
Re:Woohoo by oringo · 2005-12-07 03:06 · Score: 1

It may not make a big difference with the high-frequency chips like gigahertz CPUs. Semiconductor power consumption is mainly two components: static DC consumption and dynamic capacitance load. The gate leakage current is what makes static DC consumption high, but at high frequency, dynamic capacitive load dominates the power consumption, because you are charging/discharging billions of tiny capacitors at the speed of 1 trillion times a second. It's kinda like switching to drive and reverse gears in a car 1 thousand times a minute and pushing the gas pedal at the same time!
Re:Woohoo by corcoranp · 2005-12-07 03:32 · Score: 0

Here is an interesting article about comparing the Athlon & Pentium's Power consumption at 90nm.

This is an incredible breakthrough, transistors are the most important invention of the 20th century!

--
Peter Corcoran
Re:Woohoo by Anonymous Coward · 2005-12-07 05:55 · Score: 0

Warning: Talking out of my rectum here.

My impression of the significance of this is that the gate leakage was an obstacle to further transistor size reductions. Smaller is less capacitance, which is less dynamic power.

Though you are correct, leakage has the direct effect on static power, which is pretty much insignificant in general purpose microprocessors.
Re:Woohoo by Sigy · 2005-12-07 13:57 · Score: 1

There are three reasons why this will not have a big impact on wireless devices.

First - other than the digital parts of the device usually BJTs are used which have a leaky base as it is so improving the MOSFETs will not do anything for the BJT parts of the circuit.

Second - even if you make the whole device with MOSFETs the real power consumption is in parts like the transmitter and receiver which have orders of magnitude more current going through them than the leakage current.

Third - most wireless devices do not use cutting edge speed digital parts. So it would be a while before they would use processes with extreme leakage currents anyway.

I do not mean to take away from this accomplishment - it is very interesting and will help reduce power consumption in many applications, but wireless is not where I would expect the big gains.

How do you reduce tunneling current? by Beryllium+Sphere(tm) · 2005-12-06 18:49 · Score: 4, Interesting

The press release says they're getting several orders of magnitude less tunneling current through gate insulators. But tunneling happens because some portion of the electron's wavefunction extends to the other side of the insulator. Whst are they changing that would affect the physics? Or are they fixing a different kind of leakage and getting the press release wrong?

Re:How do you reduce tunneling current? by cagle_.25 · 2005-12-06 19:08 · Score: 1

Tunneling is a function of both the physical width and potential energy height of the energy barrier. Obviously, they don't want to increase the width. I don't know the details, but I would surmise Warning ... he's making this up ... that their process simply increases the (height of the) potential barrier from drain to gate.

--
Human being (n.): A genetically human, genetically distinct, functioning organism.
Re:How do you reduce tunneling current? by soundsop · 2005-12-06 19:29 · Score: 5, Informative
It's easy to reduce the tunneling current through the gate. All you have to do is increase the thickness of the insulator. Unfortunately, this has the detrimental effect of reducing the effective capacitance of the gate, which in turn lowers the amount of current conducted by atransistor of a given size (lowering the current also lowers the speed). To make up for the lowered gate capacitance, researchers have been trying to increase the dielectric constant of the insulator. I'm guessing that they're proposing a method to increase the dielectric constant of the gate insulator. The devil is in the details of improving the dielectric constant without screwing up later processing steps or reducing the mechanical integrity of the wafer, etc.

Summary:
1. To lower gate leakage simply increase dielectric thickness.
2. To make up for lost speed due to higher dielectric thickness, increase dielectric constant.
3. Profit.
Re:How do you reduce tunneling current? by hopey · 2005-12-06 19:56 · Score: 1

The tunneling goes through different kind of defects in the oxide material (crystal imperfections impurities etc). I guess in this case the RTA (rapid thermal annealing) improves the oxide quality by removing the crystal imperfections ( by rearraging the oxide in high temperature) and maybe by gathering the impurities by diffusion...
Re:How do you reduce tunneling current? by saifatlast · 2005-12-06 20:38 · Score: 5, Funny
I'm no expert, but even I can see that you missed a step here. Here, I'll fix it for you.
1. To lower gate leakage simply increase dielectric thickness.
2. To make up for lost speed due to higher dielectric thickness, increase dielectric constant.
3. ????
4. Profit
Hope that clears things up for everyone.
--
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't regist
Re:How do you reduce tunneling current? by HateBreeder · 2005-12-06 21:21 · Score: 1

I thought that using CMOS tech. the larger the gate capacitance is, the lower the speed, since you have to "charge" that "capacitor" in order to deliver a signal.

Increasing the gate insulator would in turn, perhaps affect the electric field strength required to create or deplete the SourceDrain Tunnel - which in turn, requires more current -> higher power consumption.

So these guys found a way to reduce leakage current without increasing the insulator size...?

Or Am I getting this wrong?

--
Sigs are for the weak.
Re:How do you reduce tunneling current? by Antique+Geekmeister · 2005-12-07 00:55 · Score: 1

This is basically true. But preventing leakage is not just due to the thickness of the insulator. The inevitable flaws in the insulator, and the unevenness of its formation, are a big source of leakage. Just as eggs stick to the pan and burn where you failed to put enough cooking oil, you can fix it by adding more oil (which you may not want to do on your diet), or by heating the pan first and swirling the oil around to spread it evenly before you add the eggs.

The process these gentleman describe involves spreading the insulator more effectively, and does seem reasonable. Whether it's cost effective and works in production is another story.
Re:How do you reduce tunneling current? by rufty_tufty · 2005-12-07 01:41 · Score: 1

Actually to get faster people are trying to reduce the capacitance with low-K dielectrics. Not increase (hint: more capacinance = more charge to transfer = slower switch; if you can transfer less current to turn the switch on, it'll work faster). The reason to thin the gate is partially to do with Vp (propagation delay) issues, partially to do with just making the device physically smaller and partally (if I remember this bit of semiconductor physics correctly) to do with getting the gate polysilicon itself closer to the conduction channel causing a more severe depletion layer(and therefore higher conuctivity and faster turn on and turn off).

First google link on Low-K:
http://nepp.nasa.gov/index_nasa.cfm/934/

--
"The weirdest thing about a mind, is that every answer that you find, is the basis of a brand new cliche" -
Re:How do you reduce tunneling current? by jank1887 · 2005-12-07 01:46 · Score: 1

Sorry to nitpick, but the SiO2 formed for typical gate oxides would be fused silica (semi-amorphous) not crystalline SiO2 (quartz crystal). Although they don't mention the specifics, typical RTA schemes only take the wafer to about 1100C, and the divitrification process of going from silica to quartz drops to an almost zero rate below that. So, assuming they don't super-heat the wafer (there would be a lot of resulting issues with any previous Silicon diffusions, and I thought quartz had worse gate-oxide properties anyway, but I'm not certain) there is no long-term order of the SiO4 groups, so removing crystal imperfections probably isn't the correct descriptor to use.
Now, gate oxide quality is directly related to the density of the oxide, as higher densities indicate better uniformity and less "empty space" for impurities to creep in. I would think the key would be to either drive out impurities, or keep them from finding a home in the oxide in the first place. Gathering the impurities would be a worse condition. If you previously had lower density evenly distributed impurities, but then somehow gathered them to a bunch of local spots, you've basically introduced local charge and field asperities that could serve as points of initiating failure. Then again, if things are below the failure threshold, I'm not sure if having a few really bad spots is better or worse than having a lot of mildly bad spots.
Re:How do you reduce tunneling current? by marcosdumay · 2005-12-07 03:17 · Score: 1

You can also increse the energy neded to transpass the insulator. that will reduce the leakage current maintaining its thickness. Of course, you need a better insulator for that. And discovering what is this "better insulator" is a huge problem, and buiding it on a chip, another problem as big as the first one.

--
Rethinking email
Re:How do you reduce tunneling current? by soundsop · 2005-12-07 05:30 · Score: 1

Low-k dielectrics are for the insulator between wires. High-k dielectrics are for the insulator between the gate and channel.
Re:How do you reduce tunneling current? by rufty_tufty · 2005-12-07 05:57 · Score: 1

Ah it's comming back to me now...
Isn't it something like the increased capacitance causes a greater charge to accumulate which enhances the depletion region thus increasing the conductivity of the silicon?

--
"The weirdest thing about a mind, is that every answer that you find, is the basis of a brand new cliche" -
Re:How do you reduce tunneling current? by soundsop · 2005-12-07 18:08 · Score: 1

My apologies. You are indeed correct.

What does this mean? by BikeRacer · 2005-12-06 18:51 · Score: 1

If Intel could apply this technique to existing P4 chips that burn ~150 watts what would the savings be? 10,000 - 100,000x less leakage current is how much of the equation?

Re:What does this mean? by crashelite · 2005-12-06 20:14 · Score: 1

haha why would they apply it to the P4 wouldnt that be a waist of time... look at their new processor line power consumption 108 watts @ peak

--
(yes i know i suck at spelling fell free to correct my grammar and/or spellin i dont care, im still not going to change
Re:What does this mean? by bbrack · 2005-12-07 01:45 · Score: 2, Insightful

considering the fact that static (leakage) power should be
the real killer in microprocessors is the dynamic power - for wireless/dsp/ucontroller type applications, it could be pretty huge

honestly, the article has so few details it's impossible to tell what they're really doing, but i am pretty sure that most companies out there already use RTP on there gate oxide...
Re:What does this mean? by pyser · 2005-12-07 08:52 · Score: 1

Actually, Intel has their own solution to this problem - redesign transistors out of different materials that switch faster and consume less power.

On on U of K... by tom8658 · 2005-12-06 18:51 · Score: 0, Offtopic

Seriously, we have some really good programs. Hank Dietz, Bill Dieter, and Tim Mattox have some exceptional results in parallel computing. Until recently, their $40,000 home-made cluster beat UK's million dollar HP Superdome cluster in Linpack ratings. The $40k even factors in the cost of student labor (in the form of pizza) to wire the cluster.

I just wish I could say the same for our CS department... It's been getting steadily better since the College of Engineering adopted it, but they switched to M$ Visual Studio .NET this year, and that really worries me... program internals shouldn't be hidden from the student at lower levels of computer science.

I hope I didn't /. aggregate.org too badly...

Re:On on U of K... by kramthegram · 2005-12-06 18:58 · Score: 0, Offtopic

This makes me feel so much better about my Grad School choice. I will be starting my TA position at UK in January, less than a month away!
Re:On on U of K... by tom8658 · 2005-12-06 18:59 · Score: 1

Cool... what department are you in?
Re:On on U of K... by kramthegram · 2005-12-06 19:32 · Score: 1

Computer Science, I don't know the specifics of my position yet, all I know is the stipend amount basically.
Re:On on U of K... by stonefoz · 2005-12-06 19:36 · Score: 1

I'm just glad someone mentioned Kentucky on /. and I don't have to be offended.

--
I think I just cashed out all my cool points.
Re:On on U of K... by Orgazmus · 2005-12-06 22:27 · Score: 2, Insightful

C# _is_ a bit more high-level than C, or for that matter, assembly, which is the language they should be learning.

--
The system had the verbosity of HTML combined with all the readability of compiled assembly viewed as bitmap images
Re:On on U of K... by Anonymous Coward · 2005-12-06 22:36 · Score: 0

The trouble is that Linpack isn't hard to make parallel!

Hopefully the expensive HP Superdome isn't running too many linpack stuff!

Would be interesting to get a ratio of wall clock time spent running benchmarks agains wall clock time running real code!
Re:On on U of K... by Anonymous Coward · 2005-12-06 22:55 · Score: 0

If you go back and write on the articles that you modded, you will get your points back. Give it a go. Idiot.
Re:On on U of K... by Anonymous Coward · 2005-12-07 00:07 · Score: 0

What a relief that Visual Studio ships with a C/C++ compiler!

Not to mention that assembly is probably the worst language to reason about algorithms in, which is what introductory computer science should focus on. Assembly should be learned in a computer architecture class, and then utilized in advanced compiler classes. Supplemental education of assembly language implementation can be provided by means of The Art of Computer Programming.
Re:On on U of K... by vcbumg2 · 2005-12-07 00:32 · Score: 1

I can assure you that 99% of the time the Superdome is used for research and has nothing to do with benchmarks. Dr. Dietz clusters spend 99% of there time calculating ways to make his clusters faster.

--
projects @ http://spectechnologies.net
Re:On on U of K... by Anonymous Coward · 2005-12-07 05:41 · Score: 0

If you're going to grad school at UK:
Studying EE: Be fluent in Chinese
Studying CS: Be fluent in Polish

Otherwise you're screwed, have fun!

Moore's Law by Anonymous Coward · 2005-12-06 18:53 · Score: 0

Moore's law still not dead.

zach.

Prior art? by Dirtside · 2005-12-06 18:53 · Score: 5, Funny

University of Kentucky researchers have discovered a way to reduce the overall power consumption of transistors

Wayyyy ahead of you.

--
"Destroy science and religion. Science would re-emerge exactly the same; but not religion." - Penn Jillette, paraphrased

This is super awesome by Anonymous Coward · 2005-12-06 18:53 · Score: 0, Funny

The whole world will change when computers run faster.

Where's the news? by Anonymous Coward · 2005-12-06 18:58 · Score: 5, Interesting

As probably one of the few semiconductor geeks on /., I have to say: Where's the news? Gate dielectrics are always made with rapid thermal processing on current technologies. Basically, stick a wafer in a chamber, flow some gas, turn on some super-high intensity
lamps, heat the wafer to >1000C for a very brief time, grow a few atomic layers of silicon dioxide (or some variant that includes nitrogen), turn off lamps, cool wafer, take it out of chamber.

From what little info is in the press release, it doesn't sound like they're doing anything revolutionary, so I'm curious why they claim they can reduce gate leakage by so much.

Re:Where's the news? by Jesus_666 · 2005-12-06 22:01 · Score: 0, Troll

From what little info is in the press release, it doesn't sound like they're doing anything revolutionary, so I'm curious why they claim they can reduce gate leakage by so much.

Potty training?

--
USE HOT GRITS WITH STATUE OF NATALIE PORTMAN (NAKED AND PETRIFIED)
Re:Where's the news? by tedgyz · 2005-12-07 02:40 · Score: 1

Aw! C'mon! Let all us unqualified geeks pontificate in ignorance.

--
"No matter where you go, there you are." -- Buckaroo Banzai

It is already done, old news by karvind · 2005-12-06 19:00 · Score: 4, Interesting

Gate oxides in current microprocessors are around 1.2-2 nm and are grown using RTP (rapid thermal process). A furnace oxidation is too fast. So yes industry already uses rapid thermal anneal (as suggested in TFA) for their gate oxides. Can anyone tell how is the new ?

Re:It is already done, old news by aminorex · 2005-12-07 11:06 · Score: 1

> Can anyone tell how is the new ?

Sure. The new is how this: It's a press release! Whee! Your life will never be the same!

--
-I like my women like I like my tea: green-

Cuts 75% of power usage in current generation by Markus+Registrada · 2005-12-06 19:02 · Score: 4, Insightful

In the latest generation of processors, 50% - 75% of the power consumption is this gate current leakage. In the next generation, it was looking to go over 90%.

What this really means is that the next generation has just become possible. As an incidental side benefit, current-generation laptops will be able to run cooler.

Re:Cuts 75% of power usage in current generation by eechuah · 2005-12-06 19:32 · Score: 1

This is bullshit. Most of the leakage is currently subthreshold.
Re:Cuts 75% of power usage in current generation by curious.corn · 2005-12-06 22:27 · Score: 1

what about switching current? with gates flipping @ GHz frequency it's a still a major problem.

--
Mi domando chi Ã il mandante di tutte le cazzate che faccio - Altan
Re:Cuts 75% of power usage in current generation by Markus+Registrada · 2005-12-07 01:14 · Score: 3, Informative

Switching current runs only when a signal changes. Most signals don't change in most cycles, but the transistor gates leak continuously. Furthermore, as the transistors get smaller, the capacitance per transistor goes down, but the gate current leakage goes up inverse-exponentially -- or did. So, the switching current really goes up only linearly with the clock rate, but there was no upper limit for the leakage current.
Re:Cuts 75% of power usage in current generation by dmatos · 2005-12-07 02:42 · Score: 1

Are you sure you're talking about the same type of leakage? Actual dielectric leakage? There is also the leakage from S to D in a CMOS circuit, which you get when one of the gates does not turn completely off. I can see that as a much more likely source of power usage than dielectric leakage.

--

It may look like I'm doing nothing, but I'm actively waiting for my problems to go away.
--Scott Adams
Re:Cuts 75% of power usage in current generation by Anonymous Coward · 2005-12-07 04:41 · Score: 0

Wait a minute with the following assumptions:
V supply=3V
leakage per gate 1 nA dc
100,000,000 transistors

power dissipation is 0.3W not 75% of 150W (112W)

Gate current is the total current in a gate, the dc leakage and the ac switching current.
Leakage current is the current that flows with dc bias on the gate only.

Gate current with the clock running is much greater than the dc leakage current. It is probably 1000 times the dc leakage current.

I haven't looked at the leakage currents lately, but most devices I work with have leakage in the 1 to 10nA range. The real dissipation problem is the number of devices that must switch and how often they switch. The leakage dissipation is probably only significant when the chip is not connected to a clock.
Re:Cuts 75% of power usage in current generation by scatterbrained · 2005-12-07 05:43 · Score: 1

Maybe not the next gen, but maybe the one after it.

It's all well and good in a research lab, but a lot has to happen before this
mystery technology gets rolled out into a mainstream fab. Not knowing much about
it (the linked article was really uninformative) you would have to look at how
much equipment is necessary for these extra steps, how many more steps does it
add to the wafer fab process, what immediate yield impact it has, and are there
any long term effects, does it impact the design rules you work under, etc.

After the research gets presented, if it's really anything to get excited about
I would expect someone like EE Times to have an artcle with enough detail to
at least figure out roughly what they're doing.

--
-- All that's left of me, is slight insanity, whats on the right, I don't know. -- Bob Mould
Re:Cuts 75% of power usage in current generation by evangellydonut · 2005-12-07 07:23 · Score: 1

I guess your processor sits around doing nothing all day... 'cuz if you look at pure statistics, at 90nm, leakage current is something like 2% of active current...

Must be late... by cagle_.25 · 2005-12-06 19:03 · Score: 2, Funny

Heh...for just a second as my eyes hit the headline, I thought that the researchers had discovered some "direct tunneling" from Kentucky to the United Kingdom.

--
Human being (n.): A genetically human, genetically distinct, functioning organism.

Re:Must be late... by Anonymous Coward · 2005-12-06 19:10 · Score: 0

Not all of Mammoth Cave has been explored, you never know!
Re:Must be late... by Duhavid · 2005-12-06 19:17 · Score: 1

They did. Shhhh.

We had to change the title quick like after the truth was printed.

Cant have that getting out.

--
emt 377 emt 4
Re:Must be late... by Fred_A · 2005-12-06 21:51 · Score: 1

Ah so they lower consumption by plugging into 110V instead of 240 ?

I'm not sure that's the best potential application for their tunneling system. :)

--

May contain traces of nut.
Made from the freshest electrons.

they have it right by Anonymous Coward · 2005-12-06 19:04 · Score: 0

leakage current happens when the transistor is in "off" state, in other words no channel is formed between the source and drain (for your run-of-mill MOS). at this time the only way for electrons to go through is by tunnelling, and as you noted, the wavefunction extends to the other side easier with each shrinkage of the gate width.*

that said, i am much more interested in what exactly is the "thermal process" they are talking about.

*wavefunction's extention can be affected by several things, AFAIK: e.g. voltage across the gate, dielectric constant.

size vs heat by esac17 · 2005-12-06 19:09 · Score: 5, Informative

What you have to remember about heat is that electronics only get hot because they are never perfect conductors nor perfect insulators {though we can make nearer-perfect insulators than we can conductors}. A perfect conductor will never get hot, no matter how much current you put through it, because the voltage drop across it will be nil and power = voltage * current. Nor will a perfect insulator, because this time, the current through it will be nil.

CMOS is based around two transistors, a P-channel FET which goes conductive when the gate is driven low, and an N-channel FET which goes conductive when the gate is driven high. The P-FET is trying to pull the output high and the N-FET is trying to pull it low. Both the gates are joined together, and this is the input. This is a simple NOT gate.

For a NAND gate, where any input 0 will drive the output to a 1, we have several P-FETs in parallel trying to drive the output high, and so many N-FETs in series trying to drive the output low. Each P-FET gate joined to an N-FET gate is one input. When they are all high, all the N-FETs turn on allowing the output to go low; when any one is low, the chain of N-FETs is broken, one or more P-FETs turn on, and the output goes high. For a NOR gate, where any input 1 will drive the output to a 0, we put the Ns in parallel and the Ps in series. You can make AND gates from NAND+NOT, OR gates from NOR+NOT, and any other combination you like. In fact you really don't need both NAND and NOR, because you can make either one out of the other; but it turns out they're equally as easy to make as each other in CMOS {not like many other technologies}.

In an ideal world this would never dissipate any power, since the input cannot be high and low at the same time so only one of the transistors will ever be on. In practice what happens is that the gates act like capacitors which take a finite time to charge and discharge. They do not switch instantaneously from conductive to non-conductive. So one stops conducting while the other is starting to conduct, and for a brief instant while the inputs are changing state both transistors are conducting a little. It's not a dead short circuit of course, otherwise something would give way ..... hopefully a fuse.

Now every time something changes state, you get a little pulse of heat. Which is why fast processors need cooling. Additionally, to make sure that the logic gate output has changed state before the next clock pulse, you need to make the gate capacitances charge up quickly -- which means using a higher voltage than you could get away with at lower speeds. But 2x more volts means 2x more amps means 4x more watts.

Smaller transistors should have less gate capacitance, and so be capable of switching more quickly.

Re:size vs heat by soundsop · 2005-12-06 20:32 · Score: 4, Interesting

Some clarifications:

Short-circuit current is only responsible for 10-20% of switching power. The rest is dissipated in the transistor through charging and discharing all the nodal capacitances (due to transistor gates, transistor diffusions and wiring capacitance). Since typical circuit styles are non-adiabatic, this charge/discharge power component would not go away even if we could completely eliminate short-circuit currents.

Making transistors smaller certainly reduces their gate capacitance but it also reduces their current drive by the same proportion. These two effects cancel each other out! So how can transistors get faster from generation to generation?

Transistors get faster by increasing electron mobility and/or increasing gate capacitance per unit area and/or reducing diffusion junction/sidewall capacitance per unit area/perimiter and/or reducing (local) interconnect capacitance since smaller transistors are closer together.
Re:size vs heat by william_w_bush · 2005-12-06 21:29 · Score: 4, Insightful

Great comment.
The unfortunate corrolary to:
Smaller transistors should have less gate capacitance, and so be capable of switching more quickly.
is:
Smaller transistors will have less resistance, and so will dissipate more power.

Which is why the P4 prescott, while a marvel on the drawing board, is pretty crappy in reality. 90nm technology has largely been an attempt to find a happy medium between higher capacitance and lower resistance, both of which limit speed. The current "nucular age" of chips is a direct by-product of ignoring the drop in resistance until it was too late.

Also, at 4+ Ghz an current-induced EM field has many of the properties of a microwave beam, which can resonate, and essentially self-focus on any imperfections in the semi-conductor structure, essentially burning small holes in the chip, or causing signal noise unless perfectly grounded (which in itself causes inductive leakage). This is why intel and amd have speed-bins, because the chips with the fewest imperfections are able to perform at the highest clockspeeds without thermal or electric failure.

My point is, the mega-hurts race, even assuming one or more miracles of metal-oxide chemistry, is ending. I look forward to the multi-proccessing race which seems to be heating up, as a long-postponed, but neccessary next step. The sad obstacle holding back the day of 1000-thread chips has been programmers complete lack of willingness to move beyond the single-threaded debugging paradigm. As one myself, I understand why it's seen as hard as it is, but consider it more of a viewpoint shift, rather than an insurmountable increase in complexity. New languages/language changes will happen to simplify threaded programming, and new mechanisms like auto-synchronized data structures, self-unrolling iterands, and integrated message-passing stacks will replace old-standbys. The mega-threading doomsday scenario will fall along the wayside with other past programmer nightmares such as the death of the goto loop and the loss of direct memory access in java and higher level languages, left only as subjects of nostalgia.

Clockspeed is dead, long-live multi-threading.

--
The first rule of USENET is you do not talk about USENET.
Re:size vs heat by elgatozorbas · 2005-12-07 00:04 · Score: 2, Interesting

They do not switch instantaneously from conductive to non-conductive. So one stops conducting while the other is starting to conduct, and for a brief instant while the inputs are changing state both transistors are conducting a little.
Even if they don't EVER conduct (even a little) at the same time there will be dissipation because the capacitance is charged and discharged all the time. Each of these cycles implies that some positive charge moves between the power supply and ground with the capacitor as an intermediate step. This is why the dissipation is proportional to the clock frequency.
Re:size vs heat by rufty_tufty · 2005-12-07 02:13 · Score: 1

Making transistors smaller certainly reduces their gate capacitance but it also reduces their current drive by the same proportion.
Are you sure? As I remember, let's say we take a transistor and halve every dimension. By C=(8.8542 x 10-12 K A)/D this will halve the capacitance. Now we've halved the conduction channel though, so at constant ohms/square (as is a fair model for most FETs in the conduction region) then the resistance of the channel stays constant. Assuming we keep the same drive voltage this equates to a higher drive current and therefore faster transistor.
Whether we can do that part though is admitadly doubtful, but I don't see a direct link between a smaller transistor and a lower drive current. Clearly you would want to reduce the drive current some ammount otherwise you power/area goes through the roof! But certainly if you just reduce the gate length, but keep the width constant you can get an increased drive current...
But all of this isn't taking into account the effect a reduced size will have on the gain of the transisto - I'll have to dig out my notes for that bit...

--
"The weirdest thing about a mind, is that every answer that you find, is the basis of a brand new cliche" -
Re:size vs heat by zopf · 2005-12-07 17:55 · Score: 1

Clockspeed is dead, long-live multi-threading.

So true. Nature "figured this out" long ago. For proof, check out the massively parallel machine that is your brain. It would be interesting to make a simple processor that could be tiled ala VLSI and interconnected to its neighbors or a center controller. Massive multi-threading will work best when the hardware matches the software concepts.

--
Did you see the pool? They flipped the bitch!

uh... bad name for a university? yeah by Mashdar · 2005-12-06 19:43 · Score: 0, Offtopic

University of Kansas uses the acronym "UK"... This was slightly confusing during football season, but I figured our school must have been playing a team from the USA. In the title of science article, however, I sure as hell don't assume Kansas origion.

How did they do it? by penguinoid · 2005-12-06 19:48 · Score: 2, Funny

"It was really simple," said the researchers at the University of Kentucky. "We siply asked the Flying Spaghetti Monster to fix the problem for us, and He did." </flamebait>

--
Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways

Re:How did they do it? by Anonymous Coward · 2005-12-06 21:16 · Score: 0

kentucky != kansas

Chip insulator sas capacitors? by MsWillow · 2005-12-06 19:54 · Score: 1

As speeds increase, won't leakage also increase because the insulators are, in effect, capacitors? At RF speeds, power flows through capicitors.

I'm not a chip designer, just a ham radio bug, so I don't know if this problem has already been found to be a non-issue. Maybe one of you bright guys knows the answer?

--

Lemon curry?

Re:Chip insulator sas capacitors? by Anonymous Coward · 2005-12-06 20:04 · Score: 1, Informative

You can use a high-k dielectric and get the same capacitance as a SiO2 layer. A good looking material is HfO2. Keep in mind that as transistors are scaled, the capacitance is scaled too -- thus your natural increase in speed.

The leakage is normally due to direct tunneling. Basically the silicon dioxide layer is so thin and the electric field is so high that electrons tunnel between the gate and the source/drain.

I think all this paper is about using some sort of thermal processing to make a higher quality gate and or oxide. Doesn't seem too Earth shattering to me.
Re:Chip insulator sas capacitors? by aXis100 · 2005-12-06 20:44 · Score: 1

Power doesnt really flow through capacitors.....they just support a charge being moved either side of them. When that moving charge does work like speakers or antennas, then yes, there is power consuption.

MOS transistors have acted like capacitors for ages. There's no power flow in the sense of work done, there is just the regular expense of charging/discharging the gate.
Re:Chip insulator sas capacitors? by Antique+Geekmeister · 2005-12-07 00:49 · Score: 2, Informative

Of course power flows through capacitors! You've got charges, you've got voltages, you have that charges flowing from one side to the other with a voltage on them in a certain amount of time, charge * voltage / time = power. If you couldn't transmit or modulate some power, you couldn't transmit or modify a signal. That's basic thermodynamics: if you can't transmit power, you can't transmit a signal.

The "work done" is, to some extent, recoverable when you change the state of the MOS transistor by discharging them. But that work is usually wasted and thrown away by simply discharging it to ground, then recharging it from the power supply, and that energy has to go somewhere and come from somewhere. But that can happen more in the power supply, and to some extent happens as electromagnetic radiation. You can't get rid of those two problems until you start playing with superconductors.

The heat and loss issues these researchers try to deal with are wasted work, trickle currents of keeping the transistor gate signals charged up as electrons leak away from the gate part of the transistor into the signal part of the transistor. They're nasty problems, taking constant current to keep even static signals active and wasting power as heat that has to be dealt with by some means.
Re:Chip insulator sas capacitors? by rufty_tufty · 2005-12-07 02:54 · Score: 1

Is this not a case of terminology problems?
Power "flowing" is a horrid concept. Current flows, power is a measure of work done.
So no a capacitor will do no work (well, capacitors do get hot, but that's the resistive component of their physical construction). If you attatch a perfect capacitor to your AC supply most power meters will register power consumed (after all they're doing V*I) even though the transistor isn't dissapating any heat itself. The point being that the current is out of phase with the voltage so over an entire cycle all energy consumed is returned.

But this is all the field of power factor correction, and I feel I'm preaching to the choir...

--
"The weirdest thing about a mind, is that every answer that you find, is the basis of a brand new cliche" -

Your sig: by temojen · 2005-12-06 19:58 · Score: 1

ummm... twins!?!?

Re:Your sig: by cagle_.25 · 2005-12-07 06:21 · Score: 1

Twins are separate organisms; there's no point of confusion *unless* they are conjoined, which I would consider to be a boundary case. Fair?

--
Human being (n.): A genetically human, genetically distinct, functioning organism.
Re:Your sig: by powerlord · 2005-12-07 09:30 · Score: 1

Twins are separate organisms; there's no point of confusion *unless* they are conjoined, which I would consider to be a boundary case. Fair?

Ouch! What the GP was mentioning with twins though, is that your sig "Human being (n.): A genetically human, genetically distinct, functioning organism." (emphisis mine)

According to that definition, maternal twins (triplets, etc) would not qualify as a Human Being since each one is not genetically distinct ... on the other hand perhaps we can look at it as a "Human Being" with Reduntant Backup. :)

--
This space for rent. All reasonable inquiries will be entertained at proprietors discretion.
Re:Your sig: by cagle_.25 · 2005-12-07 12:42 · Score: 1

I wasn't trying to be cold-hearted about conjoined twins; I was just imagining a situation in which it might be difficult to discern how many organisms are present. In practice, it may be that all such cases are lethal abnormalities, so that all living cases of conjoined twins are clearly two organisms.
With regard to normal twins, I agree that the language doesn't parse nicely. I'm not intending "distinct" to mean "unique", but simply "distinct from other organisms which might be present and attached." So for example, tapeworms living in me are not a part of me, because they are genetically distinct from me -- they are separate organisms.
I haven't found a pithy way to word it so that the impression of uniqueness is not conveyed. Any suggestions?

--
Human being (n.): A genetically human, genetically distinct, functioning organism.
Re:Your sig: by powerlord · 2005-12-20 06:20 · Score: 1

Didn't think you were trying to be cold hearted. I actually thought you were trying to make a pun about co-joined twins being a 'boundry case'. :)

--
This space for rent. All reasonable inquiries will be entertained at proprietors discretion.

WTF is rapid thermal processing? by arrrrg · 2005-12-06 20:08 · Score: 2, Interesting

Anyone know? For once Wikipedia isn't much help.

Re:WTF is rapid thermal processing? by nsaspook · 2005-12-07 05:54 · Score: 1

http://www.keyassets.com/preview.taf?itemNr=4286&_ UserReference=DDFE42B205D4B85A42F986FE

--
In GOD we trust, all others we monitor.

PlayfullyClever, eh? by Ospeovedizer · 2005-12-06 20:22 · Score: 5, Interesting

So, did ScuttleMonkey not notice that the submitter's name was PlayfullyClever backwards? The one whose website says that the vast majority of /. posts are "blatantly plagiarized"? Although the news seems real enough to me, the submitter's name and website raised some pretty big alarm bells, especially since their site now says:

"okay, so we are going to win slashdot again, this time with a different game plan, keep your eye out for our new name.. it is VERY playfully clever."

Hmm... As I said, the news seems real enough, but the submitter is a fake.

--
"We demand rigidly defined areas of doubt and uncertainty!" - Vroomfondel, H2G2

Re:PlayfullyClever, eh? by Anonymous Coward · 2005-12-07 00:48 · Score: 2, Informative

i checked the program at the 2005 International Semiconductor Device Research Symposium (http://www.ece.umd.edu/isdrs2005/program.html/) and sure enough, today at 4:35 PM, their paper is being presented: Dramatic Reduction of Gate Leakage Current of Ultrathin Oxides Through Oxide Structure Modification, Zhi Chen et. al, University of Kentucky
Re:PlayfullyClever, eh? by g0at · 2005-12-07 02:58 · Score: 1

The submitter is not a "fake". On their site they are quite up-front and honest with their tactic. The real problem is that the slashdot editors are morons and don't give a shit.

-b

--
myselfmusic
Re:PlayfullyClever, eh? by Surt · 2005-12-07 03:02 · Score: 3, Informative

I don't really see how it's possible for the submitter to be fake. Either he submitted the story or he didn't. Apparently, he submitted the story.

Now, he might think the joke is that he's posting 'news' from a news aggregation site to a news aggregation site, but meta news is the only kind of news slashdot gets anyway, and that's what we come here for.

All in all, if he's scamming slashdot, he can only be doing it if EurekAlert is a fake, which it certainly doesn't look like at first glance, though I notice that in an unusual move for a meta-news site, it doesn't have links to originating information. That is somewhat suspicious. Still, if true, it's an incredible effort he's putting in just to scam slashdot stories.

Further, it would have to be a long term scam plan, since the UKY story in particular is real:
http://news.uky.edu/news/display_article.php?artid =844

So at best he's trying to build credibility as an article submitter for a later scam.

--
"Who is the Journal of Quantum Physics going to believe?" --Stephen Hawking
Re:PlayfullyClever, eh? by Jambon · 2005-12-07 16:41 · Score: 1

I don't really see how it's possible for the submitter to be fake. Either he submitted the story or he didn't. Apparently, he submitted the story.
He submitted the story until you view it, at which point, he didn't.

Physics by kf6auf · 2005-12-06 21:01 · Score: 4, Informative

A quick lesson in quantum physics:
Basically, tunnelling occurs because an electron can get from one side of a potential barrier to the other without ever being in the forbidden region (the width of the barrier, where the potential energy exceeds the total energy of the electron) due to it existing as a wavefunction that does not collapse until you observe it. Anyway, the chance of an electron penetrating a simple potential barrier like the gate of a transistor is a function of the height of the barrier (voltage applied to the gate), the width of the barrier (gate length), and the energy of the electron (voltage across transistor + electron thermal energy).

So ways to decrease tunnelling include:

Longer gate, but slower. Wanting smaller transistors and faster speeds is the whole reason we're having this problem.
Increase gate voltage or decrease transistor voltage. Unfortunately these two are coupled. They might not exactly cancel each other out, but they make things difficult.
Decrease the thermal energy of the electons. There are a couple ways to do this. One involves liquid nitrogen; the other involves something like making electrons climb further out of their holes to become free (fairly easy by introducing impurities into the silicon), resulting in less electron energy and so less tunnelling. Also less current in general though, so this might be prohibitive for some other reason.

Just my $0.02 since if I knew for sure I'd be making 6 figures somewhere and not applying to grad schools...

Re:Physics by DaCapn · 2005-12-07 01:50 · Score: 1

Nice post (generally a breath of fresh air when we see one of our own).
Something I'm a bit surprised that no one has mentioned yet is the capacitance of the p-n junctions in the transistor. Don't forget changing the dielectric constant and the potential has an effect on this aspect of the component as well. To people asking "how important is such a capacitance?" Well, the capacitance causes frequency dependent behavior (you have just introduced an RC time constant).
An area where I have dealt with this personaly was with a Si quadrant photodetector (basically involved pointing a laser at 2 photodiodes and letting the beam spot oscillate between the two at increasing speeds. An output signal essentially shows which detector has more optical power incident on its photodiode, equally shared power yields zero signal). When the beam switches too quickly, the detector can't tell that the beam is actually moving (the detector mistakes it for a smear of light across both photodetectors). Just one real world (hah... to some) example that I happened to have in my arsenal.
Bringing my point back to transistors... you can clearly see how this would be important if it has an effect on the rate at which information can be exchanged and processed!
Thermal/dark/reverse leakage current will always remain a concern. In the physics world, we're generally only concerned when it comes to a single diode detector that we're using to take measurements with and at that point the liquid nitrogen cooling is the simple and obvious solution. Furthermore we have that luxury because we only have one diode... It isn't practical to have a reservoir of liquid nitrogen in our home computers that needs to be filled regularly to cool (or else damage) the hundreds of transistors contained within. Doping is an interesting thought. I just wonder, as you said, what effect it will have on the current gain of the transistors if this many impurities are introduced and how well it can still be used for its basic function.

Final solution: Beat the... by Anonymous Coward · 2005-12-06 22:24 · Score: 0

Final solution: Beat the speed of light into the ground. Anything else is a baby step. Since a pico-second is , you either have to make shrink everything, or make a pico-second . Class dismissed.

Other Applications by EBFoxbat · 2005-12-06 22:35 · Score: 3, Insightful

Assuming they really have discovered a way to lower power consumption (forgive me for not understanding semi-conductor principles) would it not be applicable to other semiconductors? I immediately thought about cell phone/mp3 player battery life and other such things. Even so far as to think about laptops. I (roughly) understand the not-so-much-wasted-power train of thought, and heat reduction from a CPU core and all, but wouldn't this have just as much effect on battery-powered devices? Or am I just being an ass again?

Re:Other Applications by Botty · 2005-12-07 05:30 · Score: 0

Yes.

Almost right. by dtmos · 2005-12-06 22:45 · Score: 0

The leakage path relevant to tunneling is through the gate oxide, from the gate to the channel below it. In this case, the width of the barrier is the gate oxide thickness, not the gate length. So the ways to decrease tunneling include having a thicker gate oxide, but of course it'll still be slower (less capacitive coupling of the gate to the charge in the channel). A representative paper reviewing gate tunneling and its effects on logic gate performance is this one (in pdf).

Also, the height of the barrier is determined by the material properties, not the gate voltage. With that said, I still don't understand how the authors can do what the press release says they say they do. How does RTA affect the material properies enough to affect tunneling significantly? MOS gate oxides are one of the most studied materials known to man, with uncounted man-millenia devoted to eliminating any defects therein. What did they miss?

A final thought--if this was such a fundamental breakthrough one would think it would be presented at the International Electron Devices Meeting itself, rather than at the small conference associated with it held later in the week. But maybe not.

Making a CPU fan that lasts more than a year... by leonbrooks · 2005-12-07 01:24 · Score: 1

...would be a great start. Most modern COTS CPU fans crap out in 2-3 years, tops, but I have a dual PentiumPro 200 box under my desk which is showing no sign of wearing out a fan after ~10 years of continuous service.

Let's not even discuss my underengineered AOpen laptop with its hopelessly inefficient (and now defunct) fan, other than to say that bypassing 75% of its heat generation would be -ing marvellous; the hard disks might also last more than a year, and the battery be worth something again (maybe with some help from white LEDs to replace the LCD screen's illuminating flouros).

Fixing much less glamorous issues might actually have more of an impact on power consumption than pushing the boundaries of physics, but I'm all in favour of doing that, too.

--
Got time? Spend some of it coding or testing

Re:Making a CPU fan that lasts more than a year... by bigtrike · 2005-12-07 05:19 · Score: 1

Have you tried buying ball bearing fans? You'll pay a very small amount extra, but they'll last several times as long as sleeve bearing fans. Your current CPU fan is also trying to dissipate a lot more heat than your ppro fan, so it's going to wear faster.

"University of Kentucky researchers" by rjfan · 2005-12-07 01:53 · Score: 1, Flamebait

Although I'm sure these guys are top-notch, research isn't what comes to mind when I think of Kentucky.

Re:"University of Kentucky researchers" by Anonymous Coward · 2005-12-07 02:42 · Score: 0

Maybe thats why they are in the research lab and you're sitting behind your computer making retarded crass statements like that. Some people (i.e. you) are not cut out for doing intellectual tasks.
Re:"University of Kentucky researchers" by Anonymous Coward · 2005-12-07 05:44 · Score: 0

this is *not* a flamebait, this is funny!!!
mod parent up please

I am NOT the owner of the parent message

That's Gate Leakage, but what about SD Leakage? by trigeek · 2005-12-07 04:01 · Score: 2, Interesting

From the little information provided in the article, it appears that this takes care of the gate leakage problem, which is great! However, it doesn't address the Source-Drain leakage, which is a larger issue for current process technologies. Gate leakage isn't forseen to be a significant problem until 45nm.

This just tells us that future technologies are not going to have twice the leakage power as current technologies. This doesn't mean that future process technologies are going to have less leakage power than the current ones.

--
Sometimes I doubt your committment to SparkleMotion!

Kentucky, kentucky... by Spy+der+Mann · 2005-12-07 04:05 · Score: 0

so KFC now means Kentucky Fried Chip?

*ducks*

bigger processors? by lazychris317 · 2005-12-07 04:11 · Score: 1

Out of curiousity and because I have been trying to figure this out ofr some time now....why don't they just make the processors bigger? Looking at an old PIII I have laying around, it's 2 in^2. I'm sure P4's aren't that much smaller. What if they bumped it up to 2.5in^2 or 3 in^2? then they wouldnt have to worry about making the transistors smaller because there would be all that extra space. If there is any flaw to my theory, please let me know because in my mind, it seems like a very good solution....

Re:bigger processors? by bigtrike · 2005-12-07 05:24 · Score: 1

The processor would have to be slowed in order to give the electrical signals extra time to travel farther.
Re:bigger processors? by lazychris317 · 2005-12-07 05:29 · Score: 1

didnt even think about that part thanks for the info
Re:bigger processors? by Bad_Feeling · 2005-12-07 05:39 · Score: 1

The problem with making them bigger is money. A bigger die means less yield per wafer, plus a higher chance that there will be a defect in it (such as a spec of dust) and then a larger amount of material will have to be chucked.

--
Disclaimer: On the other hand, I am kind of a psycho...
Re:bigger processors? by Botty · 2005-12-07 07:23 · Score: 0

Economics. They make a certain number of processor cores per silicon wafer now and throw out some due to manufacturing defects. If you are making the cores bigger you are making less chips per wafer but probably still throwing the same number out, thus you have to raise the price per chip. Will the market bear such an increase?

In addition, propogation delay is significant at nanosecond speeds. In fact, the reason that AMD has put the memory controllers on the die is that a signal CANT move from the CPU down to the northbridge fast enough. Larger dies would mean that you have to be careful what you put at the opposite end of the chip because you wont be getting at it all that fast.

On a side note, I think the signals can go about 3 inches in a single clock cycle at current Ghz speeds.
Re:bigger processors? by lazychris317 · 2005-12-07 07:56 · Score: 1

well.....so much for my idea....it sounded good in my head, tho

that is so very not right... by YesIAmAScript · 2005-12-07 04:24 · Score: 2, Interesting

Yes, power dissipated is V*V/R or VI And yeah, smaller transistors have lower resistance. But smaller gates mean less power, not more. You need less current to move the charge in and out of a smaller transistor (since the charge is smaller). So the "I" in the "VI" can go down. Well, that "I" is really a "V/R" (current across a resistance), so lowering that I really means you can reduce the "V". And since the total power is V*V/R, that means the total power used drops drastically.

Let me explain it a little better because I think I even confused myself.

Power is V*I. The I is V/R. Lowering this R means the V/R value does get bigger (current goes up). But also, since the I only needs to be sufficient to fill or drain a gate in a given amount of time (one cycle), you can reduce the V until V/R is a more reasonable value. And when you lower that V you're also reducing the other V in the power formula (V*V/R), so in fact instead of power going up, it goes down greatly.

For a much easier corollary, look at AMD's 130nm CPUs against their directly equivalent 90nm versions. The 90nm versions take half as much power.

Today's nuclear CPUs are mainly because there are so many transistors switching so fast in such a small space. If you built an old-type CPU using 90nm technology (like an Z80 or something) it would take far far less power than the old ones, which ran off of +5V (plug that into V*V/R!). Additionally, current CPUs have a lot of leakage current, something that CMOS didn't have a problem with until we got to sub 180nm processes. Compare a current CPU to an old NMOS or even ECL processor. You'll see how leakage was a problem before and how much of a savior CMOS was.

Additionally, the megahertz race is not over. It may not be the current concentration of vendors, but as chips go to smaller and smaller feature sizes, they naturally get faster. So even with little concentration on speed, we'll still see a rise in individual core speed.

A 1000-thread (simultaneous) chip is a ridiculous idea. That means you have to duplicate every transistor in the chip (like registers) 1000 times. That makes no sense. You will never reach the same speed as current single processor chips with a 1000-thread CPU (at least not right now). A small number of cores is a better idea at the moment.

--
http://lkml.org/lkml/2005/8/20/95

Re:that is so very not right... by william_w_bush · 2005-12-07 09:57 · Score: 2, Interesting

a: I think I misspoke. When I said that at smaller feature size, resistance goes down, I was also considering leakage as a failure of resistance. You're right, the current 90nm designs do use less power, the amd design in particular because it uses SOI to compensate for increased leakage, while I don't believe intel has soi on its chip line yet. At these scales, execution tends to matter as much as size, and my earlier point of blindly scaling down in hope of finding more gains from the magical process shrink becomes much more of a challenge.

2: Also, in a 1000-thread chip, not nearly all the parts of such a chip would have to be replicated. Much of current superscaler design relies on scheduling for as efficient usage of pipeline segments as possible, to allow for the maximum chip usage. This same approach could work, by turning l1 cache into an effective register file, and simply having a part of the chip whose role is to schedule prioritized register sets to the requisite resources. At the extreme this could even allow decoupled pipelines, such that the decode, memory load and IO intructions are executed in a primary stage chip dedicated to preprocessing and high wait state instructions, then passed on via HS interconnect to 1 or more secondary processors to perform the actual arithmetic, even allowing whole chips to perform the ancilliary functions such as the sound or GPU today, while being tightly coupled to the memory and processor state.

My point is that from this point on, threading will scale much faster than clockspeed. Even now, it is easier to make a second core than to double the clockspeed for the faster processors, and unless the leakage issue from tfm is solved soon this won't change.

Plus, once you have the right programming language support, with auto-parallelizing iterands, and self-dispatching async subroutines, who needs a fast chip, the actual instruction pointer, the current limiting factor, becomes meaningless, with all actual work being done parallel automatically, dependencies and resource sharing solving themselves by language design. A 1Ghz multithreaded chip can be as effective as a 4.0 Ghz prescott today, without even including the specialized instructions (SIMD, fast fp) that can boost actual efficiency through the roof, and be compiled in as static asynchronous subroutines dispatched parallel to the "main" thread.

--
The first rule of USENET is you do not talk about USENET.
Re:that is so very not right... by aminorex · 2005-12-07 11:19 · Score: 2

> A 1000-thread (simultaneous) chip is a ridiculous idea.

Strong claim. One must ask: Why?

> That means you have to duplicate every transistor in the chip (like registers) 1000 times.

And this is bad... how?

> That makes no sense.

It makes enormous sense when you're running 1000 threads!

> You will never reach the same speed as current single processor chips with a 1000-thread CPU
> (at least not right now).

Naturlich, since the latter is a figment. But supposing it were real, one would have to ask: Why not? You can't seriously intend that it would be impossible to run each thread at 4 or 5 MHz, as you seem to be saying, so I can only infer that your meaning is not clear to me.

Since threads block so often, by the way, there is an optimal ratio of register files (thread states) to execution units, which will vary with load characteristics and degree of parallelism. System architectures that allow threads to avoid blocking will get more execution per transistor. One good way to do this is to provide hardware call/cc support, but it fails the goal of reducing register realestate.

--
-I like my women like I like my tea: green-

Yeah (unsupported claim) by hackwrench · 2005-12-07 04:39 · Score: 1

I wonder if the article uses UK, which is easily misinterpeted as "United Kingdom" to give the missive more credence. I also note that the actual researcher is of oriental heritage.

As an aside, I'm reminded of the joke which is probably rewritten for numerous groups.

A person on the Indiana side of the Ohio river tells a person on the Kentucky side that he will shine a flashlight across the river and he can walk across the beam. The person on the other side says, "No way! You'll shut the flashlight off when I get halfway across!"

Re:Yeah (unsupported claim) by SlashSquatch · 2005-12-07 07:29 · Score: 1

Hey Ma, put them thar potatoes in the rapid thermal processing unit! When theys done, be shore not to grab thems without your gate insulators.
"some folk'll never eat a skunk, then again some folk'll"

--
Autonomous Retard -- Is your camp safe? UnsafeCamp.com

The UK researcher's secret technique: by ctnp · 2005-12-07 07:09 · Score: 1

The article was purposefully mum on the technique these guys are using, so I'll try to elaborate:

We all know that the fab process for turning silicon into chips is *way* too complex to be explained by ordinary science, so the UK researchers instead brought in 7 Christian ministers to sanctify the process. Prior to etching, the wafer (pun intended) is doped with a mixture of holy water & oil. As the etching process takes place, the ministers intone the "Reverent Petition for Holy Quantumness" in quiet solitude, while reflecting on the needs for fast silicon to spread the Good News of our Lord and Savior Jebus Christ, Son of Man.

---
And if I offended you, oh I'm sorry but maybe you needed to be offended. - Muir

They put a french fry in the microwave by SlashSquatch · 2005-12-07 07:40 · Score: 1

in the UK they call them chips!

--
Autonomous Retard -- Is your camp safe? UnsafeCamp.com

Terrible news for cooks by petantik+f00l · 2005-12-07 09:00 · Score: 3, Funny

This news has made me very depressed.

how can I now be a cook at the same time as programming?

Before my Pentium 4 generated heat enough for frying eggs and I'm sure in a few years I would be looking for recipes suitable for heat generated by nuclear reactor( charcoal egg comes to mind) but now me dream is gone. Damn you University of Kentucky researchers. I hope we never meet

don't compare specialized chips to general purpose by YesIAmAScript · 2005-12-07 11:54 · Score: 1

Even a 1GHz non-multithreaded chip can be much faster than a 4.0GHz Prescott today. It doesn't take threads to best it. All you have to do is jettison all the transistors that you absolutely can do without (16-bit mode, out of order execution) and then replace them with transistors you can use.

If you want to run code across a family of processors, you'll have some wastage of transistors. This isn't avoidable. But it is also critical. You can't just make one CPU and throw it away, you need a family to compete.

As to not replicating the parts 1,000 times, you're correct, you wouldn't have to replicate them all 1,000 times. But if you don't, you now have to arbitrate for those cells, and they are physically farther away (distance is latency) because they aren't co-located anymore. I would suggest that non-replicating thread support is a steeply falling slope. 2 threads is good. 4 not as much, 1000 isn't useful. All that you need to do is keep your transistors all working as much as possible. Having a 2nd thread to run when the first is stuck on memory adds a lot of performance, if you have some compute-bound threads to run. But do you have thousands eligibile to run at any time? No.

Multi-chip CPUs don't really work anymore (your idea of high-speed interconnects to other chips). CPUs are too fast. It takes too long to drive that signal out to the other chip, it's not worth the trouble.

Anyway, I would suggest that to make 1,000 threads run well, most modules would have to be replicated at least 250 times. And if you're gonna do that, don't bother redesigning, just make a 2 or 4 thread chip and buy more of them (or replicate the area of the die).

BTW, you're right, Intel doesn't have SOI. But honestly it appears it is so because they don't need it. And don't use P4 as an example, everyone knows it sucks including Intel. Look at the new (non-SOI) 2-core Yonah that is roughly AMD A64 X2 3800+ speed and uses half the power.

Honestly, Intel are incredible at process technology. They have a lot of secrets and are way ahead of most of the industry. When the stories about the idea of using "strained silicon" came into the news, Intel had already SHIPPED chips using it. I had one in my PC it turned out!

If Intel switches tracks and gets as good at chip design as AMD is (and it appears they are at least trying now), we're going to see some spectacular stuff.

Finally, note that not nearly all tasks cannot be parallelized effectively. CPU speed will still have its place because of this. Don't write off higher clock speeds. When I started with computers, chips ran at 500KHz or 1MHz. Now even decent chips (say, AMD or Intel P-M) run at 2400x that. There's no reason it won't go up 10x again over the next couple years. And given that even a slow PC right now is very fast for all nearly all normal tasks (Excel, anyone), what percentage of the market going to need a gaggle of cores then?

BTW, I find this discussion interesting. Too often things on slashdot turn into name calling and hate-fests. I may not agree with you on this topic, but it's pretty clear that we both at least recognize there are two legitimate sides to discuss.

--
http://lkml.org/lkml/2005/8/20/95

I saw this presented at ISDRS today... by qwertykid · 2005-12-07 15:08 · Score: 1

I'm attending ISDRS in Bethesda this week and presenting a paper tomorrow. It was interesting but I'm not entirely sure how well it would work. I'd have to see the actual data (which usually isn't shown in such presentations) and try it. It had to do with passivating the oxide-silicon interface with deuterium instead to reduce the number of hot carriers injected through the oxide. My review: interesting but I want to repeat it and see it in action.

Re:I saw this presented at ISDRS today... by qwertykid · 2005-12-07 15:11 · Score: 1

edit: deuterium instead of hydrogen... I'm obviously tired.

Re:don't compare specialized chips to general purp by william_w_bush · 2005-12-07 15:58 · Score: 1

Good reply.
I'm not trying to write-off chip speeds, but as a programmer I'm aware we can't just count on the fact that this bloated, single-threaded program/game which slugs around now will work fine 1 year from now when the processors catch up.

Second, multi-chip cpu's do work, the gpu is essentially a graphics slave to the primary, specialized for its tasks, with access to primary memory, and using specialized (tho with need of improvement) mechanisms to offload work from the cpu. TCP offload adapters are similar if not the same. How are multi-chip processors dying? Even the new physics addons are attached via high speed interface. The new line of high speed busses and interconnects are easily capable of handling the data and latency. How do Opteron's share each other's memory and IO otherwise via a simple HT link?

Ok, Yonah blows my intel idea, but my point was that Yonah wasn't a blind "design it first, then just try to keep shrinking it" like prescott.

Anyway, I would suggest that to make 1,000 threads run well, most modules would have to be replicated at least 250 times.

Or, make a version of an ALU that can SIMD 16-32 similar operations at once, and schedule accordingly.

Finally, no, there is almost no way we could keep a 1000-thread chip full today. That is not a flaw of the chip, or concept, but of our current programming methodology. Are you honestly telling me there is no way to change the language, or design guidelines to unroll loops in parallel? Assuming only 200 are going to be running at any time due to memory/IO/waits, you have less reason to be so focused on branch prediction or cache misses. So a thread misses a couple cycles waiting on memory, the penalty is miniscule. Have 2 branches to choose from? Choose both, and discard the unused one. For loops can be coded as dependant, or independant, or in the case of DSP like functionality, unrolled with the pointers tracked automatically, parallelizing the operation completely. All this could be done if not today, than soon, without waiting for a 200x increase in clock speeds, which may or may not happen, and with few or none of the drawbacks. All the eye candy in modern os's? Free of charge, because that is just processor power that would be wasted otherwise. Who needs a gpu, your cpu can do that as well or better with an internal geometry unit, with no need for seperate memory.

The point is once the individual operation cost drops because of massive parallelization, the entire way of looking at programming and computers in general can be changed so the current high-revving, but stop for roadbumps viewpoint can be replaced with a steady convoy of trucks metaphor. Every instruction your computer executes is not dependent on every instruction executed before it, that is a restriction placed on computers by us, for debugging purposes and because multi-threaded chips were more expensive than high speed chips until recently, a product of CMOS fabrication technology. We still use processor interrupts for similar reasons, even though they are a huge performance hit unless done properly, and have been removed as far as possible from modern software and hardware designs.

Single threading is just a legacy frame of mind, because debugging dozens of threads without the right tools is hard. Once we develop the right tools, why in god's name would we want to run 1 thread at a time? Just imagine massive multithreading as the next generation of the superscalar architecture used in current chips. Take it to the next level, and blur the lines between current threads and a superscalar scheduler instead of handicapping yourself by forcing dependency scheduling on chip by dedicated circuits that have very little idea of the actual dependencies and execution flow, and less ability to flexibly reschedule for efficiency.

--
The first rule of USENET is you do not talk about USENET.

multi-chip CPUs by YesIAmAScript · 2005-12-07 18:22 · Score: 1

I probably wasn't clear enough about what I meant by multi-chip CPUs. And I probably interpreted your original description too rigidly to mean the thing that isn't viable anymore.

Let me explain.

It used to be that you might have multiple chips involved directly in the execution of the instruction stream. For example, the AMD 2900 series was bit-sliced. When an instruction was fetched and executed, multiple chips worked on it in parallel. I don't know the restrictions, but I believe each chip operated on 8 bits. If you had two chips in parallel you had a 16-bit processor, 4 of them, 32-bit.

A more common organization had some instructions go to different chips, very commonly FPUs. In the Intel 8086/8087, the 8087 watched the CPU fetches and when it saw a floating point instruction, it executed it itself. With the 80286/80287 and later, the main CPU would fetch the instruction and send it to the FPU for exeuction, then get the results back and put them were they were supposed to go. The Motorla 68020 and 68030 supported external FPUs also (68881/68882). The MIPS R2000 and R3000 also had external FPUs.

The original Motorola 88000 was actually two chips, the 88100 and the 88200. One chip did the operations, the other was the bus-interface (load/store and cache) chip.

This all ended at the end of the 80s, when new chips like the Motorola 68040 (except LC), the Intel 486 (except SX), Motorola 88110, and MIPS R4000 series came out. These chips were too fast to effectively use external FPUs, moving the data out to that other chip and back was too slow. Even though the on-chip 68040 FPU wasn't as capable as the external 68882 (or even 68881 in many ways), the lower latency made the performance much much higher.

So, what I meant was that off-chip processing of instructions in the stream just became unworkable at that point, and it still is.

But you're right, a processing system that includes multiple chips executing different instruction streams (the GPU) is quite viable and effective today.

HT is very effective on AMD's processors, but you have to remember that HT only comes into play for load/store instructions. So it has to be fast, but it has to be fast compared to the bus speeds in order to be effective. "calling out" to other chips for the execution of single-cycle math operations wouldn't work nearly as well.

For example, my understanding is that the HT links on current AMD processors run "only" at 1GHz. Since chips like mine run instructions at 2.2GHz or higher, using HT would mean no instruction could be executed in less than 2 clocks. This would be a big hit for simple register-register transfers or adds or such.

A lot of the other stuff you describe sounds remarkably like Intel's EPIC (Itanium). I am not convinced at this time of the value of EPIC in a processor family. I agree it sounds great on paper, but the results so far have been poor, and I know that Intel hasn't be completely successful in making compilers as effective as CPUs at scheduling instructions to run. And you can see why, right? Can you really statically schedule a loop perfectly which contains loads when some of those loads will come from cache (very fast) and some from SDRAM (very slow)? A CPU selecting instructions at runtime will be able to make the decisions on the fly, the compiler doesn't know what to do.

I do know that EPIC isn't incompatible with your idea of lots of threads though, so I know that rejection of one doesn't put a black mark against the other.

--
http://lkml.org/lkml/2005/8/20/95

Kentucky University - more famous than Britain by Ignominious · 2005-12-08 06:35 · Score: 1

UK researchers - damn! Just when I thought us Brits might be making scientific progress again!

Still, great tagline for the UoK though.

Slashdot Mirror

Reduce Transistor Power Consumption

124 comments