Slashdot Mirror


Sun Unveils Direct chip-to-chip Interconnect

mfago writes "On Tuesday September 23, Sun researchers R. Drost, R. Hopkins and I. Sutherland will present the paper "Proximity Communication" at the CICC conference in San Jose. According to an article published in the NYTimes, this breakthrough may eventually allow chips arranged in a checkerboard pattern to communicate directly with each other at over a Terabit per second using arrays of capacitively coupled transmitters and recievers located on the chip edges. Perhaps the beginning of a solution to the lag between memory and interconnect speed versus cpu frequency?"

185 comments

  1. It will be running java by holzp · · Score: 5, Funny

    therefore the speeed increase will be unnoticable.

    1. Re:It will be running java by aphor · · Score: 1

      Actually, you might notice a difference: Java might peg all of your CPUs seemingly for no good reason.
      Depending on what patch level your Solaris is at, your JVM might be using one OS thread-handling model or another. One apparently makes Java go nuts on the spinlocks, which is less noticable on slower machines. True story of a Solaris support case at work...

      If you want to sell more hardware, why would you make a framework that scales well down to smaller, slower hardware? The idea is that the more hardware you throw at it, the better it will run. That is offensive to people whose computing effort isn't directly pulling in revenue. A little bit of cleverness can go a long way, but people who only care about fat paychecks don't understand that kind of long-term thinking. Java is definitely for business: soaking up budgets!!!

      --
      --- Nothing clever here: move along now...
  2. Timing? by afidel · · Score: 3, Interesting

    I wonder if this release might have been pressed forward a bit to squelch some of the talk about Sun losing their will to innovate after Bill Joy left.

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    1. Re:Timing? by Usagi_yo · · Score: 5, Insightful

      No. What you don't understand or realize is that Bill Joy actually left 2 years ago, when he "retired" into distinguished senior engineer, from CTO. This latest move by Bill Joy, full retirement is merely a continuation of that. At least thats how I see it.

    2. Re:Timing? by cayle+clark · · Score: 1

      Nobody will notice - since the NYT persists in crediting Joy with every innovation Sun ever made, including being "developer of the Java programming language."

      Does the actual inventor of Java get no respect because his name makes people think of little fuzzy birds instead of nerdish ecstasy?

  3. Hmm by Mysticalfruit · · Score: 1

    This sounds like a sweet technology. Hopefully we'll see this in a real product in the near future.

    Though the way people talk about SUN, were more likely to see it licensed to some other company...

    --
    Yes Francis, the world has gone crazy.
    1. Re:Hmm by Daniel+Dvorkin · · Score: 1, Insightful

      If IBM has it, it will run Linux.

      --
      The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
  4. terrabit by lanswitch · · Score: 2, Funny

    Does "terrabit" mean that it will be made of pieces of the earth?

  5. No registration by Anonymous Coward · · Score: 5, Informative
  6. Replacing Network-on-Chip/System-on-Chip by KarmaPolice · · Score: 4, Interesting

    This could prove very interesting as the speed usually drops when "leaving the chip" to do communications. There has been alot of research to develop protocols to ease on-chip communication when several ICs are combined on a single chip. If Suns technology can stand the test, NoC/SoC products could reduce it's time-to-marked dramatically...smaller and faster devices for everyone!

    BTW: I didn't RTFA since it requires (free) reg.

  7. I suppose this will be patented... by Thinkit3 · · Score: 3, Funny

    Or maybe Rambus is already fixing to sue them.

    --
    -Libertarian secular transhumanist
    1. Re:I suppose this will be patented... by Anonymous Coward · · Score: 0

      Ramb...who? Havn't they died in a flaming ball of shit yet?

      I havn't seen a RIMM in over a year now. Do Intel even make Rambus Northbridges (Sorry, Memory Interconect Hubs) anymore?

  8. Fast today Slow Tomorrow by Anonymous Coward · · Score: 5, Interesting

    That is the nature of the beast.

    Remember how excited you were to get your hands
    on a 386 machine?

    The thrill of your first encounter with a 286 screamer?

    Upgrading to 16k from 4k on your TRS-80?

    Your first disk drive for your Apple 2?

    It's all relative.

    So enjoy

    1. Re:Fast today Slow Tomorrow by FrostedWheat · · Score: 1

      Spoilsport :p

      Just because you didn't get that 486 you wanted all those years ago, dosen't mean we can't enjoy the nice fast computers!

      ;)

    2. Re:Fast today Slow Tomorrow by ccp · · Score: 1

      If you weren't already been moderated to 5, I would spend a point with your post.

      What you're stating is obvious but true: at this point, speed improvements are meaningless except for a fistful of aplications.

      For the common user, a fasterr chip means nothing: the application is the choking point.

    3. Re:Fast today Slow Tomorrow by pjt48108 · · Score: 1

      Hey, now, don't forget the TI-994/A! Insert the Extended Basic cartridge, and, WHAMMO! You just doubled your memory to 32k!

      Cool beans, dude.

      --
      Mmmmmm... Bold, yet refreshing!
    4. Re:Fast today Slow Tomorrow by DickBreath · · Score: 1
      What you're stating is obvious but true: at this point, speed improvements are meaningless except for a fistful of aplications.

      You're right of course. Only a fistful of applications need ever increasing cpu speed.

      And that small fistful of applications all begin with one word.
      • Microsoft Word
      • Microsoft Excel
      • Microsoft PowerPoint
      • Microsoft Outlook
      • Microsoft FrontPage
      • etc...

      For the common user, a fasterr chip means nothing: the application is the choking point.

      I don't believe that I need to add anything to this statement about choking on certian applications. You're right. It's obviously true. :-)
      --

      I'll see your senator, and I'll raise you two judges.
    5. Re:Fast today Slow Tomorrow by Anonymous Coward · · Score: 0

      You're right, damn you!
      I remember all of that...
      *sniff of nostalgia*

  9. SUV of chip interconnects? by Atomizer · · Score: 3, Funny

    Whatever, I think this will end up being the SUV of chip to chip conections. ;)

    1. Re:SUV of chip interconnects? by ikkonoishi · · Score: 0

      More likely it will be the model T of chip to chip connections since it is a prototype.

    2. Re:SUV of chip interconnects? by stratjakt · · Score: 1

      The Model T was a production car.

      --
      I don't need no instructions to know how to rock!!!!
    3. Re:SUV of chip interconnects? by Slightly+Askew · · Score: 1

      And the Obscure Analogy Award goes to...

      --
      Public use of any portable music system is a virtually guaranteed indicator of sociopathic tendencies. -- Zoso
    4. Re:SUV of chip interconnects? by AsylumWraith · · Score: 1

      You mean the Model A...

  10. Link via Google (no Reg. Required) by chrestomanci · · Score: 4, Informative
    1. Re:Link via Google (no Reg. Required) by ConceptJunkie · · Score: 1

      That's nothing... thermal expansion and vibration have been unseating my circuit boards for years.

      --
      You are in a maze of twisty little passages, all alike.
  11. IANAEE (I am not an electrical engineer) by Peridriga · · Score: 4, Insightful

    This might be the obvious question but, why hasn't anyone done this before?

    It seems obvious, the end of chip has pins. The chip it will eventually connect to has pins. Instead of having 20 trace lines to the next chip why not redesign them so the out/inputs of both line up to reduce the complexity of the design.

    Anyone wanna fill in my mental gap for me?

    1. Re:IANAEE (I am not an electrical engineer) by Anonymous Coward · · Score: 0

      The main bottleneck in CPU performance is memory access. It doesn't matter if you have a grid of super processors if getting data from memory takes a trillion cycles. So i guess it hasn't been done before because it wouldn't help much. And yes, that memory wall is still an issue.

    2. Re:IANAEE (I am not an electrical engineer) by ikkonoishi · · Score: 1, Insightful

      The main bottle neck is memory access because of the system bus.

      A direct CPU to RAM connection would improve things dramatically.

      Why do you think L1 cache is so important?

    3. Re:IANAEE (I am not an electrical engineer) by Jah-Wren+Ryel · · Score: 5, Insightful

      It has been done before, probably the most recent incarnation is hypertransport from AMD. The only difference at the 50,000ft view is that the speeds and feeds are faster. This is an evolutionary step, not revolutionary or innovationary,

      --
      When information is power, privacy is freedom.
    4. Re:IANAEE (I am not an electrical engineer) by proj_2501 · · Score: 0, Flamebait

      Yeah, YOU try to coordinate redesigns for 4 different chip vendors, who are also redesigning their chips for EACH OF THEIR CUSTOMERS.

    5. Re:IANAEE (I am not an electrical engineer) by jhines · · Score: 2, Informative

      It has been done.

      The DEC PDP11/03 aka LSI-11 was implemented as a multi chip (4 + 1 rom) CPU. The 5 chips were placed right next to each other.

      This chip set was also setup by others with the UCSD Pascal "p-code" as the instruction set.

      Other CPU in the series had MMU, and additional instructions in additional chips.

    6. Re:IANAEE (I am not an electrical engineer) by Usagi_yo · · Score: 2, Interesting
      Alot of reasons. We'll start with large complex chips. No, the pins aren't at the end of the chip, they are underneath the chip. BGA, CGA, LGA, ball grid, colume grid or land grid array.

      Then we'll go to ... well, they sorta did. They just enclosed it into one big chip.

      Then of course Heat and cooling and power requirments.

      Then onto manufacturability and repairability. Don't want to have a $15k board that has to be thrown away whenever there are problems with it. You do want to be able to repair it.

      Next on to glue logic or glue componants such as current limiting resisters, pull ups, pull downs, bypass and decoupling capacitance.

    7. Re:IANAEE (I am not an electrical engineer) by proj_2501 · · Score: 1, Informative

      HyperTransport is more than AMD. In fact, it includes Sun!

      from the HyperTransport FAQ
      "6. What is the current specification release?
      The current HyperTransport Technology Specification is Release 1.05. It is backward compatible to previous releases (1.01, 1.03, and 1.04) and adds 64-bit addressing, defines the HyperTransport switch function, increases the number of outstanding concurrent transactions, and enhances support for PCI-X 2.0 internetworking."

    8. Re:IANAEE (I am not an electrical engineer) by Loconut1389 · · Score: 1

      I think sun is talking about having -no- trace lines, the chips have transmitters and receivers rather than traces between chips.

    9. Re:IANAEE (I am not an electrical engineer) by Anonymous Coward · · Score: 5, Informative

      You can't simply just remove the circuit board to achieve better speeds, you need to eliminate the need for the pad that converts internal logic to what we currently use externally. That is what Sun is claiming they have done.

      Sun's technology is not simply soldering to pins directly together (as you suggest), which is effectively the same thing as wiring through a circuit board. The high speed, low drive strength, low-voltage drivers have to go through pads that convert the internal signal to a slower, high drive strength, high voltage driver, that will yield a reliable connection to the next chip. I'm not an expert in this area, but Physics just gets in the way. There are capacitive issues, and interconnect delay issues.

      Sun is claiming to use capacitive coupling (put the pins really close together, but don't physically connect them.) This way they don't have to drive the external load of the pin/board connection, and are claiming they will be able to scale this down to a pad that will be able to switch faster than existing physical wire connected pins. Which means they believe they can make this technology work with lower drive stengths.

      They still have a ways to go. Notice that the P4 has faster connections using existing techology. Sun did a proof of concept, and claim they can speed it up 100x. So they haven't _proved_ that this will operate faster yet. They still have many things to overcome to make this viable, including how to make a mass production/assembly process. It's going to be a few years. At least.

    10. Re:IANAEE (I am not an electrical engineer) by fitten · · Score: 2, Informative

      So you think DRAM access time is 40ns (or so) because of the system bus?

      L1 cache typically found on today's processors and DRAM are two different things with different design targets. Pick up a VLSI book.

    11. Re:IANAEE (I am not an electrical engineer) by eXtro · · Score: 5, Informative
      There are two seperate metrics that define the speed of memory. Latency, which is what your 40 ns refers to, and bandwidth. Large caches address the latency problem as you stated. If you want to transfer more bits per cycle you're restricted due to signal integrity issues related to the bus, so the parent post is also correct. You can increase the width of the bus, up to a point, and get a small scalar increase in bandwidth. To go beyond this you need to address signal integrity problems.


      Sending fast edges over a bus is difficult because the signal degrades:

      • inter-signal interference: Each parsel of information spreads due to the RC nature of the bus so that it takes up more than a period, thus interfering with the next packet.
      • cross-talk: Each wire on the bus is fairly tightly coupled with it's neighbours, so switching activity on one wire affects it's neighbours.
      • transmission line effects: Package connectors, bends in circuit traces etc all create impedance mismatches. This causes reflections which degrade the signal.


      If your dataset fits into the cache well, which is often the case for PCs, then a cache can fix most of your problems. If you're dealing with datasets that span gigabytes or terabytes and your application can't be subdivided such that processing and memory can be constrained per cpu then your cache doesn't assist you very much.
    12. Re:IANAEE (I am not an electrical engineer) by Anonymous Coward · · Score: 0

      HyperTransport is derived from the interconnects in the Sun E15k which Sun got when it bought the remnants of Cray. The other part of what was left of Cray went on to become Tera which has changed its name to Cray again...
      Hypertransport is a completely different thing to this new type of interconnect.

    13. Re:IANAEE (I am not an electrical engineer) by proj_2501 · · Score: 1

      wow, that's confusing. there's a trademark lawsuit waiting to happen.

    14. Re:IANAEE (I am not an electrical engineer) by yppiz · · Score: 1
      Prior art: a four-pin CPU.

      --Pat / zippy@cs.brandeis.edu

    15. Re:IANAEE (I am not an electrical engineer) by Anonymous Coward · · Score: 0

      In school EEs are taught that wires conduct electricity, and it's hard for them to think outside of the box.

    16. Re:IANAEE (I am not an electrical engineer) by glassesmonkey · · Score: 1

      Well I am an EE and this is sort of interesting, but capacitively coupled is still driving a large capacitive load. There are a few packaging ideas that are better for this idea of reducing chip-chip speed and wire widths.

      flip-chip bonding of two chips basically puts the silicon directly on top of each other. Problem with this is getting the heat out. There are also ideas about tiny printed conductive ink lines used to connect two silicon chips in a package, etc.

      The interesting idea you'll hear about at the same conference involves 'pin modems' and the idea of instead of a signal coming out of your chip and driving another chip, but to make each pin into a modem and achive higher bandwidth than a simple analog signal. All the ideas of traditional modems apply, carrier freq, compression, connecting at the highest speed available. This idea of coupled chips will be VERY sensitive to the environment and PCB your systems sits on. But modems, like bad phone lines still work.

    17. Re:IANAEE (I am not an electrical engineer) by randyest · · Score: 1

      This is MISinformative. Perhaps the noble moderator misread the Informative option as Misinformative? Or perhaps the gentle moderator doesn't know the difference between Hypertransport(TM) (a bus standard like, but faster than, ISA, PCI, etc. using plain old PCB traces as chip interconnect) and the new PCB-less chip interconnect discussed in the fine article? If this is the case (and I suspect it is), I must note that the moderator had no business moderating this particular post.

      The original poster is ignorant and should be excused. By accepting a position of responsibility and authority, the moderator should only moderate posts about which he or she has at least a moderate understanding, and even then, in moderation.

      --
      everything in moderation
    18. Re:IANAEE (I am not an electrical engineer) by proj_2501 · · Score: 1

      Perhaps the responder misread the post to which he is responding, as did the Anonymous Coward before.

      The HyperTransport consortium has Sun Microsystems as a member. HyperTransport is used in AMD systems. Nothing more, nothing less.

    19. Re:IANAEE (I am not an electrical engineer) by randyest · · Score: 1

      Nope, I read it. I guess you misread, or simply ignored, the original post to which you replied, which clearly confused Hyperchip and the new technology, to wit:

      [This new technology] has been done before, probably the most recent incarnation is hypertransport from AMD. The only difference at the 50,000ft view is that the speeds and feeds are faster. This is an evolutionary step, not revolutionary or innovationary,

      Although you didn't directly re-state the posters' false claim, you did continue that thread, which was based soley on an incorrect assertion, and fueled the fire of misinformation by adding you own comment to it, and perhaps even confusing matters more by bringing up the (wholly Offtopic) point that "HyperTransport is more than AMD. In fact, it includes Sun!"

      It's not that big of a deal, so it's OK. As I said, you're excused for your ignorance. It's the moron moderators who modded you Informative that should be caned.

      --
      everything in moderation
    20. Re:IANAEE (I am not an electrical engineer) by Anonymous Coward · · Score: 0

      What part of 50,000ft view do you fail to comprehend?

    21. Re:IANAEE (I am not an electrical engineer) by Anonymous Coward · · Score: 0

      None. You're wrong at any altitude, friend.

  12. Re:terrabit by Anonymous Coward · · Score: 0

    You mean those silicon chips with ceramic packaging? They sure will!

  13. Imagine a Beowulf Cluster... by tekiegreg · · Score: 2, Funny

    of these, well that's kind of the point actually :-)

    --
    ...in bed
  14. Perhaps a physical base for Neural Network? by BlankStare · · Score: 3, Interesting

    I wonder if this hardware computing model could provide the first real base for Neural Network computing? As far as I know, any neural network is currently emulated on linear processing machines.

    1. Re:Perhaps a physical base for Neural Network? by kinnell · · Score: 1
      I wonder if this hardware computing model could provide the first real base for Neural Network computing?

      This is not a hardware computing model, it's a new interconnect technology. So no.

      As far as I know, any neural network is currently emulated on linear processing machines

      The neural network group at Edinburgh University has been developing parallel neural network chips using analog technology for some time now. Because neural networks are very fault tolerant, the errors introduced by analog adders and multipliers is not important.

      --
      If I seem short sighted, it is because I stand on the shoulders of midgets
    2. Re:Perhaps a physical base for Neural Network? by gkramer · · Score: 1

      >As far as I know, any neural network is >currently emulated on linear processing >machines. Not entirely true. A lot of people simulate neural nets on computers because its cheaper and easier. But my lab, and a lot of other people implement them in hardware for speed and compactness. NN hardware implementations have been in the literature for a long time.

    3. Re:Perhaps a physical base for Neural Network? by BlankStare · · Score: 1

      Could you point me to public postings for NN hardware implementations? Thanks.

    4. Re:Perhaps a physical base for Neural Network? by eggoeater · · Score: 0

      Sorry but it would still be a linear processor.
      This seems to be just a communication technology, with little effect as to how things are 'computed' . Using multiple processors to simulate neural nets has certainly been done before... but that doesn't really help with the core problem.
      Any function performed on multiple processors can be duplicated (albeit slower) on a single processor. Alan Turing proved that before ENIAC was built.
      The real solution for AI (in my humble opinion) will come when we figure out how to solve problems that are 'uncomputable' on a Turing Machine.
      -Steve

    5. Re:Perhaps a physical base for Neural Network? by gkramer · · Score: 1

      Check out Carver Mead. I believe that he was one of the first to write about implementing NNs in hardware. You'll find a lot of hits for him on Google. Don't remember the name of the book off the top of my head. You can also check out the papers section of my lab for more refs. http://gozer.cs.wright.edu.

    6. Re:Perhaps a physical base for Neural Network? by Anonymous Coward · · Score: 0

      Wow, what are you smoking? I want some.

  15. FINALLY! by JoeLinux · · Score: 5, Interesting

    Someone gets it. As an Electrical Engineer-in-training, I was always frustrated with people who got these big bad processors and wondered why their improvement was minimal.

    They never quite grasped that the biggest bottleneck is between the processor and memory.

    My EE instructor always said that they could improve performance by doing one simple thing: make the interconnects on the motherboard between the motherboard and RAM rounded instead of cornered. You could then increase bus speed as you wouldn't have magnetic loss at the corners like you do now.

    You fix that, and you can see a SUBSTANTIAL improvement in performance. The only thing that can be done beyond that is to get a Platypus drive (Solid state "Hard Drive" made from Quikdata made from DDR RAM). Then you reduce your access time to your hard drive from milliseconds to nano/microseconds.

    1. Re:FINALLY! by javatips · · Score: 0, Flamebait
      My EE instructor always said that they could improve performance by doing one simple thing: make the interconnects on the motherboard between the motherboard and RAM rounded instead of cornered. You could then increase bus speed as you wouldn't have magnetic loss at the corners like you do now.


      I hope that you instructor applied for a patent for this... If he did, he will be rich soon just be licensing this incedible breakthrough in interconnecting chips.


      The actual good consequence is that he will stop teaching soon. I hate crappy instructor who think they have great idea to make stuff better, but do nothing with it... Especially, easy to implement improvements. If what he said to you was not full of crap, you can be sure that someone would have implemented this!

    2. Re:FINALLY! by afidel · · Score: 1

      Obviously an EE in training =)
      Have you ever tried to route a complex multilayer PCB design? If you have then you will know that it would be basically impossible to guarentee all straight paths between the CPU and RAM, or any other component. Besides if you want fast ram you put it on or near the CPU die. Hence processors like the Xeon, Itanium, HP PA-8800, etc which derive most of their performance gains over their desktop competitors by having large L2 and huge L3 caches.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    3. Re:FINALLY! by Jeff+DeMaagd · · Score: 2, Informative

      I think the fourty five degree corners is about as much of a compromise as one can get without being too expensive with routing.

      Another problem is that the speed of memory itself isn't that great unless you want to spend a _lot_ of money, to the tune of $50-$100 per megabyte as we see in advanced processor caches, and the faster it is, the more very power inefficient it becomes, maybe to a sizeable fraction of a watt per megabyte.

    4. Re:FINALLY! by hackstraw · · Score: 2, Interesting

      Other people "get it". If you go to UVA, you might want to talk with Dr. McCalpin, and take a look at the stream memory benchmark.

      Memory bandwidth is a bottleneck, not the biggest. It depends on the application. Sometimes an app is CPU bound, disk bound, network bound, or memory bound (or graphics card bound if 130FPS is too slow for your eyes). Also, chip-to-chip interconnects will not change the memory bandidth issue, because if the data does not fit on the chips or thier cache, then its going in memory.

      Also, this is for scaling. Think beowulf. For those machines, the data must go from the cpu -> memory -> network interface -> switch -> network interface -> memory -> cpu. Yes, bandiwidth is an issue, but there is also latency, which would be very low with these kinds of chips.

      A side note, I work with a guy that soldered chips together like this which had native parallel processing instructions about 20 years ago.

    5. Re:FINALLY! by Ella+the+Cat · · Score: 1

      It must be horrible to live life so cynically and literally. His instructor was saying something to make him =think= about what happens. There's a post a few down from yours about using 45 degree bends. Go check out High Speed Signal Propagation (Advanced Black Magic) by Johnson and Graham ISBN 0-13-084408-X. You might learn something.

    6. Re:FINALLY! by javatips · · Score: 0, Offtopic
      It must be horrible to live life so cynically and literally.


      It is actually quite pleasant to live a life like this. I really enjoy it... You made my day!

    7. Re:FINALLY! by Anonymous Coward · · Score: 0

      McCalpin is at IBM, was at SGI before that, STREAM is just hosted at UVA out of tradition.

    8. Re:FINALLY! by chrysrobyn · · Score: 5, Insightful
      Someone gets it. As an Electrical Engineer-in-training, I was always frustrated with people who got these big bad processors and wondered why their improvement was minimal.
      They never quite grasped that the biggest bottleneck is between the processor and memory.

      Don't get too frustrated with this. There will always be people who don't understand something fundamental to your training. That's why you're trained, to understand these non-obvious fundamentals. Now that you understand a CPU has to be fed data in order to process it, it's obvious, but a PHB wouldn't necessarily come to that conclusion on his own.

      My EE instructor always said that they could improve performance by doing one simple thing: make the interconnects on the motherboard between the motherboard and RAM rounded instead of cornered. You could then increase bus speed as you wouldn't have magnetic loss at the corners like you do now.
      You fix that, and you can see a SUBSTANTIAL improvement in performance. The only thing that can be done beyond that is to get a Platypus drive (Solid state "Hard Drive" made from Quikdata made from DDR RAM). Then you reduce your access time to your hard drive from milliseconds to nano/microseconds.

      Your EE instructor will tell you lots of things that can help performance. For example, making the L2 cache be the size of main memory. Just because it helps performance doesn't make it worth the price. Rounded edges on the PCB are not easy to accomplish and their benefits may not be outweighed by the added price-- even for exceptionally high end servers. Without looking at the math, I would like to toss "10% performance adder, 50% cost adder" out into the air, and say that most people would rather save the dough. Another factor to consider is reliability. Intuition suggests to me that reliability would go up without sharp edges, but intuition also tells me that modelling board coupling on a 4 layer board would be a real pain in the ass, to say nothing of a server class 6 or 8 (or higher) layer board if you have to model curved structures. You might not find an easy way to capitalize on your wonderful curved wire performance. Not only do you have to worry about your slowest path, but your quickest one can't arrive so quickly that the other chip can't sample the previous output.

      Take care in your classes when you use the word "only". Taking advantage of our wonderful next generation 64 bit processors and multiple gigs of RAM, we could conceivably copy the contents of the hard drive to main memory (especially if we are only concerned with 1-4 gigs of data in a low cost solution). Here, we get the enhanced bandwidth of main memory instead of having to kludge through the southbridge, PCI controller, IDE/SCSI to RAM interface and back.

      There are many things that improve system performance-- and the system is the only thing that matters-- rounded wires and SSDs (solid state drives) are only the beginning. Depending on the application, a larger L3 cache may make more difference, or a wider faster CPU to CPU interface, or a pair of PCI controllers hanging off the southbridge for twice the bandwidth, or integrating the northbridge onto the CPU, or ...

      The best engineering advice I can give you is that the answer is always, "It depends". You'll spend the next 5-30 years of your life learning how to answer the followon question "Depends on what?". Almost everything has advantages and disadvantages and there are few absolutes.

      The "Someone gets it" and "They never quite grasped" attitude may get you in trouble. Being proactive and explaining and educating instead will likely be more effective.

    9. Re:FINALLY! by ingenthr · · Score: 2, Insightful

      Quite right. Sun gets this quite well. Look for the articles on the Niagara processor. People always look at it as an SMP on a chip and try to compare it to hyperthreading or the stuff that IBM has done. However, what Sun is doing is 'fast switching' between threads based on stalls for memory access. This is another way of solving the problem you mention.

      If the software and programming model are capable (and most software run on Solaris is) of exploiting this, you effecively trade off bandwidth (easy to obtain) for latency (very, very hard to obtain).

      It's cool stuff-- I'm looking forward to seeing it released as a product.

    10. Re:FINALLY! by sakarada · · Score: 1
      For example, making the L2 cache be the size of main memory.

      I was of the understanding that to double the size of a cache would effectivly half its speed. Mainly due to the way caches are designed. By removing the addressing overhead. Hence the reason we dont have whopping great level 2 caches, and only a very small level 1 cache. So a L2 cache the size of main memory could actually cause performance loss.

    11. Re:FINALLY! by chrysrobyn · · Score: 1
      I was of the understanding that to double the size of a cache would effectivly half its speed. Mainly due to the way caches are designed. By removing the addressing overhead. Hence the reason we dont have whopping great level 2 caches, and only a very small level 1 cache. So a L2 cache the size of main memory could actually cause performance loss.

      To the first order, yes, doubling the cache size will likely double the latency. A skilled SRAM designer knows a few tricks which may mitigate this some. However, the CPU designer needs to know what tradeoffs the area expensive and high off current SRAM array will have against the application advantages of not having to go hundreds of CPU cycles out to main memory. Don't forget that doubling the access time of an SRAM array to 20 cycles is far less than multiple hundred to get to main memory.

      If one were maximizing performance of RC5 brute forcing, Distributed.net found that 256k of cache was plenty. On the other hand, SETI found they needed 1 Meg of cache (IIRC)-- the RC5 application would suffer from the effect you point out-- an L2 too large to take advantage of. A database program, on the other hand, would really probably prefer no datacache penalties if the application is truly random access. Cache misses cost cycles too.

    12. Re:FINALLY! by Via_Patrino · · Score: 1

      The memory speed is the same but the latency is much lower.
      The latency has been partialy remedy by the prefetch instructions (pentium 4 and xp 1800+) but it's still a useful invention since you can't prefetch everything.

    13. Re:FINALLY! by dmiller · · Score: 1

      The "Someone gets it" and "They never quite grasped" attitude may get you in trouble. Being proactive and explaining and educating instead will likely be more effective

      Not on Slashdot, alas.

  16. Is this new? by 4im · · Score: 3, Insightful

    Sounds a lot like the ol' Transputer (was from INMOS), of course faster. One could also think of AMD's HyperTransport. So, again, except maybe for the speed, I don't see much innovation here.

    If only people could remember that "terra" has something to do with earth, "tera" is the unit...

    1. Re:Is this new? by TwistedSquare · · Score: 1

      I wouldn't say it was innovation but its a step in the right direction. Its very close to the CSP model as dealt with before (http://www.wotug.org/), which should allow for efficient use of multiple processors.

    2. Re:Is this new? by Ella+the+Cat · · Score: 2, Insightful

      It doesn't sound like the Transputer to me. Sure, they resemble each other in that you can build a 2D array of chips from them by design, but you miss (or inadvertently downplay) that the innovation occurs in the fundamental electronic engineering issue of what happens in the bits of circuitry that drive the pins/pads - the transputer used asynchronous links and conventional pins and a nice but conventional memory interface, the Sun chip is doing something new, or if not new, seldom seen and highly promising. Ivor Sutherland knows his stuff.

    3. Re:Is this new? by randyest · · Score: 1

      Yes, it is. And no, it doesn't. The Transputer has a printed-circuit board. This doesn't. Any more questions?

      --
      everything in moderation
  17. or it might not by penguin7of9 · · Score: 3, Insightful

    Placing large numbers of chips adjacent to one another has obvious problems with heat and power, in particular when running at those speeds. That, rather than interconnect technology, is probably the main reason we still package up chips in large packages.

    This might be useful for placing a small number of chips close together, in particular chips that may require different manufacturing processes.

    1. Re:or it might not by Derivin · · Score: 2, Insightful

      Heat will definatly be an issue, but much less power will be required. The majority of the power required by chips is used to push data on and off the chip. It takes alot of poser to move a signal from a 25 micron PCB path.

      This technology (if it pans out) will mostlikly enter teh private sector in cell phones, DVD players and other small consumer electronics that have a very large number of units produced.

      Silicon wafer production has always had one major problem. Impurities. The ability to use more of the waffer to produce smaller chips that can later be 'put back together' in arrays that may not be any larger than the origional single chip solution has the potential to be much cheaper to manufacture in mass quantity.

      Granted this is part of the theory behind 6 Sigma, which does not always work out.

  18. Burning Slashdotter Questions by teamhasnoi · · Score: 0, Troll
    What effect will this have on the dip to chip ratio? Will this prevent breakage under load? Has anyone measured the performance benefits of salsa compared to sour cream and onion?

    Most importantly, will I still need my ThinkGeek 'I am teh Chip Haxx0R' bib?

  19. Re:terrabit by lanswitch · · Score: 0

    And they call that news? Most of the stuff around me is made of bits of the earth. Some bits are more processed than others, but they are still b.o.t.e.

  20. Yes, sounds like the Transputer reinvented by Angostura · · Score: 1
    Ah, the Inmos Transputer, ideal for parallel applications. Now as dead as a doornail.

    Transputer background

  21. Bah, this is old! by ecki · · Score: 1, Funny

    Anybody remember the viruses which could travel from floppy to floppy back in the C64 days? You would put an infected floppy next to a clean floppy, and the virus would just hop over! Don't know about the speed though...

    (No kidding, there were people back then who told and believed this nonsense ;)

    1. Re:Bah, this is old! by Anonymous Coward · · Score: 0

      That was why you have the paper covers you floppys, didn't you know :-)
      Later it was plastic for the Amiga that was because that was a better protection from Viruses, Amiga still had more!!

  22. GridComputing by Bluelive · · Score: 1

    Great potential for gridcomputing, just keep adding chips.

  23. OT: Re:SUV of chip interconnects? by Slightly+Askew · · Score: 1

    Arrgh! That's twice. I'm gonna have to start reading at the bottom of the page so I can catch the references to other articles.

    --
    Public use of any portable music system is a virtually guaranteed indicator of sociopathic tendencies. -- Zoso
  24. This may be of interest to some... by Anonymous Coward · · Score: 0
  25. Hard to say what's new here by kent.dickey · · Score: 2, Informative

    The article is a bit vague as to what the innovation really is.

    The article immediately made me think of multi-chip modules. Multi-chip modules is an idea which never really caught on in the industry (except for IBM), and I'm not sure how Sun's innovation isn't just a take-off along that idea. Multi-chip modules have failed due to costs since much has to go right to get a multi-chip module that works.

    Any practical chip-to-chip connectivity scheme had better have a good rework scheme. If it doesn't, it's just boutique technology that will not affect the industry overall.

    Having worked on chips with multi-gigabit pins, a huge problem is resynchonizing the signals. Creating a receiver to align one pin's data with 15 neighbors at 3GHz takes a whole lot more logic space on the die than a small driver (or receiver). The auxiliary logic basically makes shrinking the final driver FET almost meaningless.

    Modern chip design is a constant trade-off between features and cost. And what's cheap is what everyone has been doing for years (or is an evolution of that).

    1. Re:Hard to say what's new here by cybermace5 · · Score: 1

      Well, one benefit I can see is that you don't have to drive a huge (in silicon terms) chunk of copper over to the next chip. I guess they have the distances and other parameters figured out so that this capacitive coupling is actually an advantage compared to copper traces.

      They could probably do something similar with arrays of laser diodes beaming out the edges of the chips. Then again, maybe the capacitive coupling is better than that in terms of power consumption and speed.

      --
      ...
    2. Re:Hard to say what's new here by William+Tanksley · · Score: 2, Informative

      They could probably do something similar with arrays of laser diodes beaming out the edges of the chips

      Definitely. That would be electromagnetic coupling. Sun's using capacitive coupling, using only the E field. Last week we saw an article on a company using inductive coupling (magnetism) for short-distance data links (in their first product, a wireless earset).

      EM is long-range (drops according to the inverse square) but very hard to convert to and from electricity.

      M is short-range (inverse sixth power), relatively easy to convert, not easy to interfere with, but bulky and directional.

      E is short-range (inverse sixth), slightly harder to convert, not bulky, but easily interfered with.

      Sun's choice here is perfect: this application doesn't need (or want) the range of EM, and can't afford the mass and volume of an inductor. OTOH, the ease of interference is easily dealt with because once we know the geometry and composition of the board, we know the shapes the e-fields will have.

      I really like the fact that we've had two nicely orthogonal stories just so close together.

      -Billy

  26. Chip to Chip technology? by TWX · · Score: 1

    Isn't that called a trace? Or another fancy name would be a lead? I think that there are people with prior art...

    --
    Do not look into laser with remaining eye.
    1. Re:Chip to Chip technology? by timeOday · · Score: 1

      I don't get it either. You want to make memory access faster and faster, so you put it closer and closer to the cpu. Eventually the bus length reaches 0, as the two chips are physically adjacent. So what?

    2. Re:Chip to Chip technology? by randyest · · Score: 2, Informative

      Isn't that called a trace? Or another fancy name would be a lead? I think that there are people with prior art...

      No, a trace is a flat wire stuck to (or etched from) a printed circuit board. This invention (process, really, see below) obviates the need for PCB's between (at least some of the) chips. A lead is a wire, not stuck to a PCB, such as the input connections to most oscilloscopes and test equipment.

      I don't get it either. You want to make memory access faster and faster, so you put it closer and closer to the cpu. Eventually the bus length reaches 0, as the two chips are physically adjacent. So what?

      As with many great inventions, the difficulty is not so much thinking of what needs to be done, but in actually doing it cost-effectively. System designers have been trying to use the idea of optimized interconnect (sometimes called "integration", as in LSI, VLSI, etc.) but it has remained cost-prohibitive in most cases (notable exceptions include the Pentium Pro and some ATI mobility products, but these are more desperation moves than anything, since margins drop on multi-chip "chips", and they had to do it to get the needed result even though the costs were higher than normally tolerable.)

      So, sure, light bulbs are obvious, as are cars, space shuttles, computers, etc. The hard part is making them possible technically and economically.

      Hope that helps you two understand why the ASIC-design industry is pretty damn excited and anxious to license this technology (if we really can do this as cheaply as they claim).

      --
      everything in moderation
  27. Yuor speling is teribal, Hemus. by Anonymous Coward · · Score: 0

    It's terabit and receivers, Hemos. Not terrabit or recievers.

  28. Actually, you mispelled them.... by Anonymous Coward · · Score: 0
    1. Re:Actually, you mispelled them.... by lanswitch · · Score: 0

      so please pell them again?

    2. Re:Actually, you mispelled them.... by Anonymous Coward · · Score: 0

      die.

  29. And its name? by GlassUser · · Score: 1

    Shall we call it Prime Intellect?

    (actually, by the story naming convention, it would be closer to intellect 1, but oh well)

    1. Re:And its name? by zapp · · Score: 1

      shoot, you beat me to it. Good story though.

      --
      no comment
    2. Re:And its name? by GlassUser · · Score: 1

      I was late too, concerned that someone else would beat me. I enjoyed it though.

  30. Bad math? by Quixote · · Score: 4, Insightful
    I hate it when the hype overshadows the technical details. Here's a snippet from the article:

    By comparison, an Intel Pentium 4 processor, the fastest desktop chip, can transmit about 50 billion bits a second. But when the technology is used in complete products, the researchers say, they expect to reach speeds in excess of a trillion bits a second, which would be about 100 times the limits of today's technology.

    If a P4 is already doing 50 Gbps (as they say), and this uber-technology will allow 1Tbps (which is 20x a P4's 50Gbps), then how is that "100x the limits of today's technology" ?

    <shakes head>

    1. Re:Bad math? by babyrat · · Score: 1

      Nope the math checks out...got my slide rule out and did some calculations...

      50 billion * 100 = 5 trillion

      5 trillion is DEFINITELY in excess of 1 trillion

      So the statement is true.

  31. that's because... by Anonymous Coward · · Score: 0
  32. Sun may be ahead in other areas, too by Mr.+Ophidian+Jones · · Score: 4, Interesting

    Normally I don't pimp Sun, but here's something that makes me think they still have a finger on the pulse of things:
    Read about plans for Sun's "Niagra" core

    I understand they hope to create blade systems using high densities of these multiscalar cores for incredible throughput.

    There's your parallel/grid computing. ;-)

  33. This reminds me of by BackSpace · · Score: 2

    the Transputer. It had 4 available hardware connections and the description of the way the different processors communicate is very similar to what is described by the article.

    Of course to take maximum effect of this communication speed in general parallel applications, main memory access would have to be improved. I'd guess these things will have huge on-chip caches.

  34. More on the broader project by leery · · Score: 4, Informative

    IANAEE either, but this made a little more sense to me after I read this Inforworld article, which talks about two other aspects of Sun's DARPA-funded project: clockless "asynchronous logic", and building processors with interchangeable and upgradable modules. They absolutely need these busless "proximity" interconnects for the processor modules to communicate at close to on-chip speeds, and the clockless architecture lets them get rid of the bus. Or vice versa... or something like that.

    Working prototype computer about six years away, according to the article.

    --
    "This is not a sig." -- R.
    1. Re:More on the broader project by leery · · Score: 1

      [please strike the "absolutely" from above--i'm not qualified to use that adverb] ...Obviously, announcing this kind of concrete breakthrough is also good for PR, stock price, and future DARPA funding.

      --
      "This is not a sig." -- R.
  35. Transputer dusted off and presented as new? by Anonymous Coward · · Score: 2, Informative

    As usual with alot of Computer Science, this appears to be just an old idea reinvented...the Transputer...and about time too!

    1. Re:Transputer dusted off and presented as new? by Anonymous Coward · · Score: 0

      smith.edu? This chick went to smith: http://www.geocities.com/ferretstuff/Jan2003

  36. ibm storage bricks... by simpl3x · · Score: 1

    ibm has layed out similar plans for modular storage blocks (http://www.google.com/search?sourceid=navclient&i e=UTF-8&oe=UTF-8&q=ibm+storage+bricks) connected by pads on the surfaces. good luck patenting that, unless the application was made a couple of years ago.

  37. Reactive Power by snatchitup · · Score: 1

    Now there's something we EE's know about. (Or not...) We got it wrong in the upper North East with the huge black out.

    Looks like it's even used in the tiny chip to chip communications. Basically, to overcome the impotence caused by the little bit of impedance between the chips, we'll add some capacitance (CAPs). Adding the cap's to ground provides reactive power.

  38. BORING by Mooncaller · · Score: 1

    There is nothing new under the Sun. This concept, along with several others like it have been around for at least 15 years.

  39. And then... by Combuchan · · Score: 1

    Ralston Purina will sue for copyright infringement.

    --
    "[T]he single essential element on which all discoveries will be dependent is human freedom." -- Barry Goldwater
  40. This is not new. by Anonymous Coward · · Score: 1, Funny

    "Perhaps the beginning of a solution to the lag between memory and interconnect speed versus cpu frequency?"

    You mean the problem that everyone outside the PC world already solved? Please people, for your own sake go learn about the alpha architecture. Where all the CPUs connect to other cpus via north, south, east and west. They can all communicate that way, even routing around failed cpus. Then you can start crying when you realize crapaq threw it away.

  41. OT: Why register for /. but not NYT? by Mignon · · Score: 0, Offtopic
    BTW: I didn't RTFA since it requires (free) reg.

    You feel strongly enough about registering at the NYT to mention it in your interesting post, but you registered for Slashdot where registration isn't required for reading articles or even posting. Just curious - what's the difference?

    Respectfully,
    "Mignon"

    1. Re:OT: Why register for /. but not NYT? by Anonymous Coward · · Score: 0
      respect...

      This respect is earned by providing valuable content for me to see _without_ registration. If a site at least gives me some value to prove they have something to say, I might be inclined to register. NYT hasn't.

      For companies that do provide me value, I often _am_ inclined to not only register, but also buy their products (i.e. Red Hat).

  42. Re:check this shizle my nizle by Anonymous Coward · · Score: 0
    malloc allocates off the heap, not the stack, MORON!

    You Java programmers are one step above VB programmers!

  43. ob: Carl Sagan was a hypocritical pothead quote! by Anonymous Coward · · Score: 0
    You are WRONG.

    They're actually made of bits of exploded stars.

  44. Cooling with dielectric? by SkiItIfYouCan · · Score: 1

    I bet they could bath the board in a dielectric liquid that would increase the capacative coupling, while also removing heat more efficiently. Not sure what this liquid would be, but those guys at Sun are smart, they can figure it out.

  45. Heat? by GearheadX · · Score: 1

    I cna't help but wonder just how bad the heat problems could eventually get on a system designed like this. I mean, the Northbridge on a typical PC these days can burn your hand...

    1. Re:Heat? by Firehawke · · Score: 1

      Well, with the speed increase naturally comes heat increases. I'm thinking we're rapidly reaching the point where we're going to be having to use mandatory water cooling or air conditioning-- the whole design will be generating too much heat to be usable otherwise.

      Of course, that'll require a complete redesign of the case scheme for airflow, but BTX might just be up to the task.

  46. good gawd by Anonymous Coward · · Score: 0

    for those of us ignorantly curious enuff to click on the above link... pray for us...

  47. Eerm... weren't they called Transputers back then? by Qbertino · · Score: 1, Informative

    Nothing new here. No?
    I remember seeing the first Transputers on my very first Cebit visit sometime in the early 90s. The Transputer workstations would crunch full screen fractal grafics in seconds, which was an amazing feat back then. Just plain *everybody* was convinced they would put the then ruling Amiga to rest or - also a popular theory back then - would be adapted by Commodore. There is this Transputing PL Ocam that, as far as I can tell, makes Java, C# and all the rest look like kiddiecrap. Everyone who I know who knows Ocam says it rules and usually also has the skills to prove it.
    The overall concept - very much like the one Sun is talking about now - was to stick in a CPU, or 2 or 10 and make the box faster with nearly no decline in perfomance/processor ratio. It actually did work that way.

    Transputers never made it though, to expensive and the required software developement was to esotheric back then. It would be really nice to see this concept rise again. Maybe now they actually would be affordable.

    --
    We suffer more in our imagination than in reality. - Seneca
  48. Already been done with SERDES by StandardCell · · Score: 4, Informative

    If you look at a modern evaluation board with gigabit SERDES or SERialization-DESerialization (e.g. the 3.125Gbit/s differential signal pair per channel), the trace routes are typically rounded, with no square corners. This is done to reduce the effective impedance along the line which needs to be carefully controlled. They also run in parallel closely-routed pairs because it's typically a differential signal. Actually looks a bit like a set of minature train tracks without the railroad ties.

    In fact, multichannel SERDES is the next real interconnect technology. It's used in Infiniband, HyperTransport, PCI Express, Rambus RDRAM and in 10 Gb/s Ethernet (usually as 4x3.125Gbit/s channels as a XAUI interface between optical module and switch fabric silicon with 8b/10b conversion). There are even variants, such as LSI Logic's HyperPHY, that are deployed specifically for numerous high-bandwidth chip-to-chip interconnections. The problem that is cropping up is that the traditional laminate PCBs are becoming the limiting factor in increasing per-channel connectivity, to the extent that 10Gbit/s per channel speeds are next to impossible on these boards due to the lack of signal integrity. There has been some experimentation for very short hops on regular boards, as well as using PTFE resins to manufacture the boards themselves, but it's precarious at best.

    As for Sun's technology, it's interesting but I don't know how much it will catch on or how feasible it will be. It creates packaging issues and requires good thermal modelling and 3-D field modelling to account for expansion and contraction through the operating temperature range and the presence of nearby signals, which could affect the integrity of the signals.

  49. Bad Idea by fatboy · · Score: 1

    Seems to me that not only would such a design cause RF interference, but it would be susceptible to strong RF fields as well. I doubt it could pass part 15 compliance.

    --
    --fatboy
    1. Re:Bad Idea by Anonymous Coward · · Score: 0

      This idea has been floated before and has been shot down before. At the end of the day it is $$$. It is just too expensive. It is cheaper to have a wired interconnect than a wireless one.

    2. Re:Bad Idea by h8sg8s · · Score: 1

      It's capacitive - not inductive. Should have good resistance to stray EMF, etc. Do you get problems with capacitors working in high EMF environments? Nope.

      Good to see some original ideas out there. Chatter aside, I'm happy to see Sun doing *new* R&D as opposed to so many others (Dell, etc) just doing "D."

      --
      Organization? You must be joking..
  50. Not the same interconnect, but... by mwood · · Score: 1

    ...did anyone else remember the MicroJ-11, a PDP11-on-TWO-chips implementation in a single DIP? Two chips wired together on one carrier. (IIRC the floating-point unit was one chip and everything else was on the other.) It got used in cluster storage controllers (HSC70/90) and all sorts of interesting gadgets.

  51. Patent by Anonymous Coward · · Score: 0

    As any EE will tell you, time varying signals can be transferred across a capacitor. This U.S. patent describes an isolation circuit using capacitive coupling to couple signals between two IC chips within the same IC package. Sun seems to be using this type of technology to communicate between IC chips in separate packages.

  52. Re:Eerm... weren't they called Transputers back th by AtrN · · Score: 1
    Firstly its transputer and occam. No capitalization. And occam didn't rule. In many ways it sucked. No data structuring (until occam 2.5 by which time it was too late) and numerous painful limitations to ensure appropriate semantics for checking for parallelism mistakes. occam's variant of CSP was cool but others (Rob Pike) had done it before the Inmos crowd and in some repsects in a better way. The reason the transputer was popular was its floating-point unit which at the time was the dog's balls, the links and relatively high integration (add crystal and away you go). From a s/w point of view they weren't all they were cracked up to be. The T9000 would've been good with the routing done for you but the others had caught up by the time the bugs got ironed out and inmos dropped the ball.

    Oh, BTW this Sun stuff is nothing like transputers and their links. It's chip-level interconnect to avoid pads.

  53. Re:check this shizle my nizle by Directrix1 · · Score: 1

    Maybe its inside a function with parameters, which recursively calls itself, unconditionally.

    --
    Occam's razor is the blind faith in the natural selection of least resistance and in universal oversimplification. -- EF
  54. Computers without Clocks by rpiquepa · · Score: 1

    Ivan E. Sutherland has always been a great thinker. An article about asynchronous computers fascinated me last year. You can find more details here. And you can count on him for real products to come.

  55. Re:check this shizle my nizle by Directrix1 · · Score: 1

    You C programmers are one step above the end of the buffer.

    --
    Occam's razor is the blind faith in the natural selection of least resistance and in universal oversimplification. -- EF
  56. Re:It will be running java - The Best Comment ! by SlashingComments · · Score: 2, Funny

    Great ! I am so happy to see that there are some real programmers exist who see the truth. I have seen our Sun E3500 with 8 CPUs felt like a pentium pro with java shit running on it. But it was management's vision ... what we can do. I just procured the servers and pretended that I am doing social work by giving Sun more money.

    --

    - People who believe other people have no right to live, got no right to live ...

  57. Magnifying Glass? by nullman · · Score: 1

    If that's a magnifying glass, it's not a very good one. Their hands look like the size of... their hands!

  58. Future Applications by Anonymous Coward · · Score: 1, Informative

    This is very, very interesting.

    I/O limitations of traditional chip architectures prevent us from building truly large-scale hardware neural network systems. To achieve the connectivity required to model a net as complex as the human brain's, it's not enough to link up an array of small neural chips,. because you hit a bandwidth bottleneck as soon as you try to go off-chip. This limits neural architectures to simple, regular block-structured models.

    These chips of Sun's only meet up at the edges, but (assuming advances in reduced power usage and heat dissipation technologies) imagine if this was extended to provide connectivity on all exterior surfaces of the package? You could build neural networks of arbitrary size that weren't I/O bound.

    This would enable truly "brute force" approaches to connectionist AI, and quite possibly something capable of human-level intelligence in real time.

  59. Actually, SUN Micro provides ... by quarkscat · · Score: 0

    compliance with the US Patriot Act
    and "Pentagon John" Poindexter's
    TIA (Total Information Awareness).

    WTF, who would even need a "wiretap"
    when a loop antenna and a LNA will do?

  60. Re:check this shizle my nizle by Anonymous Coward · · Score: 0

    there's no "heap". learn some assembly. (heap is only a concept)

  61. What about crosstalk? by quarkscat · · Score: 0

    and the next new SUN Micro innovation will
    be a wireless mind-machine brain implant?

    1. Re:What about crosstalk? by EmbeddedJanitor · · Score: 1
      The crosstalk is likely to be quite low since capacitive effects drop off with the square or cube of distance (I forget which).

      The limitation of this technologyy is really how to place the correct components adjacent to each other to get the capacitive coupling working. ie, you wont be able to build buses with these devices.

      As people have pointed out, a major power consumer in most chips (excluding Pentiums!) is charging the pin driving circuit that has to charge the pins and pcb tracks. These currents are also the largest source of electomagnetic interference etc. The capacitive coupling could help to reduce both these effects. I say "could", since used incorrectly, these could also get worse!

      --
      Engineering is the art of compromise.
    2. Re:What about crosstalk? by William+Tanksley · · Score: 1

      The crosstalk is likely to be quite low since capacitive effects drop off with the square or cube of distance (I forget which).

      I always thought it was the cube -- but the recent article on inductive coupling claimed that it's the inverse _sixth_ power. Wow.

      I find it utterly facinating that we've seen two companies release exploits for E and M fields so close together :-). And they're entirely appropriate, given the drawbacks of each:

      E fields are very easy to generate and tap, but very hard to control in the presence of foreign objects (i.e. almost anything can interfere with them. So Sun uses them in an environment containing no foreign objects -- problem solved!

      M fields are easy to generate and control, but hard to tap due to their alignment -- so we use them where we can afford the bulk of a generator.

      E and M fields are wonderfully complimentary. I would say that they serve orthogonal purposes, but if I did that I would be making a physics pun, and I would never do that.

      -Billy

  62. you don't grasp what a hard disk is by Anonymous Coward · · Score: 2, Insightful

    You need more training. Or less ego.

    Look at a recent P4 motherboard for 45 degree traces. Look at any previous motherboard with RAMBUS (even an Nintendo 64 from November 1999) for curved traces.

    It's not so much a question as knowing about something as it is implementing it. If it isn't affordable, it isn't worth it. Because if it isn't affordable, you might be able to buy two affordable ones for the same price. And you're going to have trouble beating the performance of two systems with one.

    Finally, to make a hard drive from RAM is to totally lose track of the idea of what a hard drive is. Hard drives are supposed to be slower but they make it for it with lower cost per megabyte. Instead of a RAM drive, just put more RAM in your machine, it will use it as a disk cache/backing store and get you all the performance you want.

    Also, at 120us per command register access, you really cannot initiate any transfer over ATA in under 0.75ms.

  63. Been there, done that. by Anonymous Coward · · Score: 0

    www.xanoptix.com: making Hybrid ICs with optical interconnects. (Their proof-of-concept is a hybridized parallel optics transceiver that runs 72 channels at 3 1/8 gb/s.) There had been a handful of other competitors -- check out lightreading.com for some of them -- but most lost funding and/or momentum, while Xanoptix has been going pretty much at-speed.

  64. Sun chips by Slime-dogg · · Score: 1

    With all this talk of Sun Chips, is anyone else hungry? I wonder if they'll produce a ranch version.

    --
    You need to restart your computer. Hold down the Power button for several seconds or press the Restart button.
  65. You may also be interested in by dmelomed · · Score: 1

    Chuck Moore's 25x MISC stack machine technology.

    See www.colorforth.com, and www.ultratechnology.com for more information on this overlooked, and underrated stuff.

  66. Surveillance? by mmol_6453 · · Score: 0, Troll

    I'm concerned about new applications in surveillance.

    While I'm sure remote gap-dropping is a long ways away, what if someone were able to place a sensing device on top of the gap between the chips? Who needs to decrypt someone's hard drive if they can just log memory transfers?

    It's not like similar techniques aren't already in place. For example, I'm certain I heard about a case where someone put a keylogger on a mobster's computer. I know someone's used keyloggers at my college campus.

    --
    What's this Submit thingy do?
    1. Re:Surveillance? by randyest · · Score: 1

      Psst -- tin-foil hat guy: I hate to break it to you, but these connections between the chips you're worried about already exist in all the electronics you already own. They're even bigger (especially longer) and easier to tap, and unlike this new technology, they have nice big output drivers with big, easy to read signal swing voltage. They're called traces, sometimes even simple wires, and you're gonna need a lot of tinfoil to cover up all that surface area -- please be careful around the powere supply. ;)

      In any case, if you're going to do anything with whatever you might glean by eavesdropping between two chips, you're going to need a fast processor and lots of storage nearby (at least 2x max data transfer between chips) plus some very big, very expensive, and very delicate test leads. I think I'd notice if all 20-lbs. of a HP83K test-harness was hanging out of my box in all of its nightmare-spagetti glory.

      --
      everything in moderation
    2. Re:Surveillance? by g!sys1 · · Score: 1
      There was an interesting talk by IBM researchers Agrawal, Rao and Rohatgi at the CHES 2003 workshop two weeks back in Cologne.

      They presented multi-channel attacks on just about any computing device that is not properly shielded, simply using an ordinary short wave radio receiver. They recorded the electromagnetic emissions on different frequency bands to determine the internal operations, and hence, bits of a secret encryption key.

      For the demonstration they used a Palm handheld computing an Elliptic Curve signature and held the microphone next to the radio receiver. You could easily make out the different bleeps which correspond to the different EC Point operations.

      They were also able to listen in on an SSL acceleration chip in one of their servers, from several feet away (outside on the parking lot). What worries me the most is, that they say that proper shielding is non-trivial, since EM escapes along the power lines.

      Side-Channel attacks have been at the focus of research in Cryptography since 1996.

      Gunnar

  67. Tera Or Tibi? by henele · · Score: 1

    Is that...

    1,000,000,000,000 bits per second

    or

    2^40 bits per second?

    Theres a whole bunch of bits per second difference there...

  68. THank God! by Ken+Broadfoot · · Score: 1

    "Perhaps the beginning of a solution to the lag between memory and interconnect speed versus cpu frequency?"

    Thank God! This was something that used to keep me up at all hours of the night. A solution to this problem will change the world. It may even stop the RIAA from suing young girls and stop global warming. Thank you for letting me sleep at night once again.

    WTF?

    --ken

    --
    Bitcoin pyramid: Join here: http://www.bitcoinpyramid.com/r/1427 it's FREE!
  69. Increase yield? by mmol_6453 · · Score: 2, Interesting

    ...in particular chips that may require different manufacturing processes.

    Or at least portions of more complex circuits where part of the circuit may not warrant the added cost of SOI, 90nm, or strained silicon.

    But then, those divisions are already made. AMD, for one, is working on recombining those parts. As an example, consider AMD's putting the memory controller on the CPU die.

    I am curious, however, as to whether you could have more than one silicon die in the same ceramic casing. This would let you combine different parts made using different techniques. The CPU and L1/L2 caches could be on a high-cost process, with the L3 cache and memory controller being built with cheaper processes.

    Placing more emphasis on cost savings than on performance, you could build the L1 instruction cache with a slower process than the L1 data cache. Or you could leave all of L1 on the same process and split L2 into instruction and data caches, with different processes.

    If you wanted to ramp up CPU speed without a major hit to your performance, you could reduce the feature size of the core and L1 caches, and put L2 on a slower, more reliable process. That way, you could ramp up core speeds without having as much worry about yield loss from cache failure.

    Helluva way to increase yield, and it gives the designer a LOT of options.

    --
    What's this Submit thingy do?
  70. We should all give up now ... by Augusto · · Score: 1

    ... speed doesn't matter anymore.

    I wonder why these dummies in these corporations spend so much money in high performance computing research, man if only they would listen to Carlos.

    After all, it's not like this will make Mine Sweeper or the Windows Calculator program any faster. Speed doesn't matter people, give it up. Let's just dedicate ourselves to farming.

    --

    - sigs are for wimps.
    1. Re:We should all give up now ... by ccp · · Score: 1


      You wonder... let me explain to you:

      The dummies are the people that think a 2 MHz CPU will be ppreferable to the 1.4 MHz they have now, and upgrade.

      The corporations aren't dummies, they are very smart in fleecing the market.

      Maybe you need some super-duper machine, but for 99% of people it just doesn't make sense.

      Cheers,

  71. HOLY FUCKING SHIT by pr0ntab · · Score: 0, Offtopic

    YOU GET +5 FOR STEALING MY COMMENT FROM AN OLD ARTICLE!!!

    (thanks rei)

    http://slashdot.org/comments.pl?sid=78540&cid=6962 967

    How many other +5 informatives did you steal, asswipe?

    --
    Fuck Beta. Fuck Dice
  72. Object OCCAM by PurpleWizard · · Score: 1

    Sounds like I should finish my Object Occam language ready for Suns work to be on general release:-)

  73. Evans & Sutherland by Bowling+Moses · · Score: 1

    The lab next door to me has an old Evans & Sutherland ESV graphics processor. It's the size of a small dorm fridge and it was built in '90 or '91, and is still working...as an end table, after being retired in '98. But I understand that a few are still doing graphics work today, despite (I think) not being Y2K compliant. Anyway, I like knowing that the age of 65 Dr. Sutherland's still out there working since back in the day apparently he did a lot of cool work.

  74. Mr. Ophidian Jones... by Anonymous Coward · · Score: 0

    n./t.

  75. ...is a huge fag and... by Anonymous Coward · · Score: 0

    n over t

  76. Feasible to incorporate DRAM with CPU? by BlueBiker · · Score: 1

    A possibly naive question: Would it make any sense to include some substantial amount of main [non-cache] memory on the CPU die or in the same package? If it were say 128MB to 256MB or so, that would likely be enough to cover the working set of a typical consumer desktop machine. A NUMA-aware OS could be smart enough to migrate data structures to this faster 'on-chip memory' from external DIMMs.

    In other words, if it's too difficult or expensive to improve latency of all memory, is it cost-effective to do so for the system's most frequently or LRU data?

  77. Only talking to its neighbor? by Pork-Chopper · · Score: 1

    As I figure it, this causes more problems then it solves. I mean you can arrange chips in a checkerboard patter or star, or whatever and they can capacitively couple to each other, but you have no control over which chip capacitively couples to which. I guess this would work for a synchronous system wide bus where everyone only has to know about what's on the data bus, but most current architectures have more than one bus. How do you direct who reads what bus if the bus is just 'there'.

  78. ... he plagarizes for karma. by Anonymous Coward · · Score: 0

    [n/t]

  79. Still +5 Informative? MOD THIS DOWN!!!! by pr0ntab · · Score: 0, Offtopic

    Christ, are you mods on crack?

    Who the frick modded it up AFTER I posted the links the prove he plagarized that comment word for word from an article I replied to a week ago THAT HAS LITTLE TO DO WITH THIS ARTICLE.

    Mary, Mother and Joseph are you dumbfucks retarded?!?

    --
    Fuck Beta. Fuck Dice
  80. Re:Eerm... weren't they called Transputers back th by Anonymous Coward · · Score: 0
    Oh, BTW this Sun stuff is nothing like transputers and their links. It's chip-level interconnect to avoid pads.

    Which has already been done to death. I myself am doing this at the circuit level for my thesis (On the circuit level is has been done for 10 years). It's calle neuMOS.

  81. Re:It will be running java - The Best Comment ! by Anonymous Coward · · Score: 0

    No end to the line of excuses for writing bad code...