Slashdot Mirror


Clearspeed Makes Tall Claims for Future Chip

Josuah writes "ClearSpeed Technology announced today a new multithreaded array processor named the CS301. Their press release states the chip can achieve 25Gflops for only 3W of power. New Scientist and TechNewsWorld have articles on this chip, each with more information about the chip. I wondering if this is too good to be true." The key phrase is in the Wired story: "Soon to be in prototype, the chip...". "Soon to be in prototype" is synonymous with "does not exist".

254 comments

  1. Who needs that much speed anyway? by Mycroft_514 · · Score: 0, Redundant

    Besides, it isn't real until it is used in some computer somewhere

    1. Re:Who needs that much speed anyway? by sixdotoh · · Score: 0

      "Who needs that much speed anyway?"

      Remember the days when people in the industry made statements such as "512kb of RAM should be more that enough for anyone"? That was about a decade ago. Just wait a few years.

      --

      This post was brought to you by the number 584811 and the characters / and .

    2. Re:Who needs that much speed anyway? by Anonymous Coward · · Score: 0
      Who needs that much speed anyway?

      Just yesterday it was, "Who needs a 300GB disk drive."

      Me. We do SAR radar processing. Computers does a lot more that most Slashdotters seem to reallize.

    3. Re:Who needs that much speed anyway? by Carnildo · · Score: 1

      No, it isn't real until someone's started the Linux port.

      --
      "They redundantly repeated themselves over and over again incessantly without end ad infinitum" -- ibid.
    4. Re:Who needs that much speed anyway? by aliens · · Score: 1

      You're right, I will gladly trade you my Packard Bell with the way too much 2megs of RAM and the too fast CPU 286 12MHz for whatever way too fast new computer you're using.

      No need for multitasking when you got DOS 3.0

      --
      -- taking over the world, we are.
    5. Re:Who needs that much speed anyway? by tinrobot · · Score: 1

      We do animation for film and video. I'll be patiently wating for the Mental Ray port. Real-time global illumination sounds like a great use for these puppies.

    6. Re:Who needs that much speed anyway? by elphkotm · · Score: 1

      What? The only purpose of computers is to process HTML, Web Graphics, and MP3s.... and for compiling the latest l33t k3rn3l p4tch3z.

      --

      <Amanda`> I just went out to the parking lot in my bathrobe to exchange warez CDs.
    7. Re:Who needs that much speed anyway? by Anonymous Coward · · Score: 0

      Anyone who uses that stupid fucking quote immediately goes to my foes/ignore list. Its taken out of context, you got the quote wrong, and it doesn't prove SHIT ALL.

  2. In other news... by Ikeya · · Score: 3, Funny

    Today it was announced that Duke Nukem Forever would be optimized to run on the new CS301 processor develpoed by a new firm called ClearSpeed Technology. It is said that with this newfound processing speed, Duke Nukem Forever will be the most realistic game ever realeased.

    --
    ---- Move SIG...For great justice!
    1. Re:In other news... by Anonymous Coward · · Score: 0

      >Duke Nukem Forever will be the most realistic game ever realeased.

      I've said it once and I'll say it again : it's called Duke Nukem Forever because it's the amount of time it'll require to code the damn thing. There's no release date for that same reason!

    2. Re:In other news... by sixdotoh · · Score: 2, Funny

      In a similar article, Microsoft released a statement saying they are pushing back the release date of "Longhorn" until the CS301 is ready for home desktop use.

      --

      This post was brought to you by the number 584811 and the characters / and .

    3. Re:In other news... by Anonymous Coward · · Score: 0

      Ahh, well you're obviously a very witty individual.

    4. Re:In other news... by Anonymous Coward · · Score: 0

      Gee, that joke didn't sound forced. Forgive me for not laughing.

    5. Re:In other news... by insanecarbonbasedlif · · Score: 1

      >Duke Nukem Forever will be the most realistic game ever realeased.

      I've said it once and I'll say it again : it's called Duke Nukem Forever because it's the amount of time it'll require to code the damn thing. There's no release date for that same reason!


      But will you say it...

      ...Forever?

      --
      Just because I doubt myself does not mean I find your position compelling.
    6. Re:In other news... by Paradise+Pete · · Score: 1
      it's called Duke Nukem Forever because it's the amount of time it'll require to code the damn thing.

      Wait. I don't get it.

    7. Re:In other news... by Anonymous Coward · · Score: 0
      You can take the AOL user out of AOL, but you can't take the AOL out of the user.

      However, I do have to commend you on your exceptional use of one punctuation mark. Your frugality is impressive and should serve as an example for future generations.

    8. Re:In other news... by NanoGator · · Score: 1

      "In a similar article, Microsoft released a statement saying they are pushing back the release date of "Longhorn" until the CS301 is ready for home desktop use."

      And, in yet another article, the Mozilla team announced that version 2 of...

      --
      "Derp de derp."
    9. Re:In other news... by CaseyB · · Score: 1

      The game will surely be even better when you run it on the upcoming Bitboys Oy graphics cards!

  3. Soon to be in PROTOTYPE?? by Deflagro · · Score: 1

    Why would they release a story on something that isn't even in prototype? Seems silly to me. I have plans for a 200GHZ chip, but I still have to make a prototype, film at 11!
    It would be interesting though.

    --
    Der Tod ist der einzige Weg hier raus!
    1. Re:Soon to be in PROTOTYPE?? by Anonymous Coward · · Score: 0

      Yeah..

      The design seems to be different from previous ones, but four months ago I saw a presentation for a design that used the same techniques (ok, I _think_ the two has a lot in common, but the ClearSpeed info is pretty sparse).
      That design had working silicon back then! And a thick portfolio of pending patents to protect it (probably the reason they dared present it)...

      Same stuff; lotsa small cores with local storage using an advanced crossbar for communications. They even had a similar thread concept.

      /B

  4. Soon to be in prototype. by Gutboy_Barrelhouse · · Score: 1
    "Soon to be in prototype" is synonymous with "does not exist"

    Oh, right on. It's about time someone started developing a mass-market Loch Ness monster.

  5. So theoretically... by captain+igor · · Score: 1

    We could put 32 or so of these in a computer and generate the same ammount of heat as, say a Pentium IV but with almost a Terflop of performance? This strikes me as too good to be true...

    1. Re:So theoretically... by Anonymous Coward · · Score: 0

      You can already achieve more computing power for the same watts by simply using VIA C3 processors (yes, they may be under-powered compared to a P4, but if you can use, say, 20 or 30 C3's in place of a single P4, the P4 has no chance in hell).

    2. Re:So theoretically... by pmz · · Score: 1

      We could put 32 or so of these in a computer and generate the same ammount of heat as, say a Pentium IV but with almost a Terflop of performance? This strikes me as too good to be true...

      Some of the special-purpose GPUs can probably make this claim. "We can do 1TFlop...as long as it consists only of function X." Sun's MAJC was advertised as doing 6GFlops for two cores four years ago, so scaling something similar to 1TFlop today doesn't seem to be totally out of reach (40 CPUs would probably do it).

  6. "Soon to be in prototype" by psyconaut · · Score: 4, Informative

    Chips are virtually fabricated and tested well before the first bit of silicon is etched....you can actually be pretty sure of both a chips performance and reliability just from simulations these days. Also, having to etch development chips constantly is both expensive and time consuming....so the longer you can leave a design in virtual space, the better.

    -psy

    1. Re:"Soon to be in prototype" by Wyatt+Earp · · Score: 1

      Like those Russian CPUs that were "soon to be in prototype" back in 98-99 that were going to kick Intel and AMDs ass in the 2000-2002 time-frame.

    2. Re:"Soon to be in prototype" by Anonymous Coward · · Score: 0

      One of these days have the chip in 'virtual space' will be sufficient. You will be able to apply data to the virtual chip and get processed data out, in high speed because millions of copies of these virtual chips will be all over the net. WHo needs hardware then?

    3. Re:"Soon to be in prototype" by psyconaut · · Score: 1

      I was just explaining that it's not unusual to finish a design and even know it's performance before you fab a single piece of silicon.

      I'm not justifying their claims ;-)

      -psy

    4. Re:"Soon to be in prototype" by Anonymous Coward · · Score: 0
      Here's a clue : those kinds of simulations never run at anywhere near 100%.

      The only way for a simulation to outpace the acutal physical reality is by simplification, and therefore, loss of detail. That's not gonna do you any good simulating digital devices.

    5. Re:"Soon to be in prototype" by Canthros · · Score: 1

      Um .... If you're doing grid computing on virtual microchips over the Internet, remind me again what you're processing and transferring data with? Or have you managed to get a net-capable port of NetBSD using an abacus, two tin cans and some string?

      --
      Canthros
    6. Re:"Soon to be in prototype" by PD · · Score: 1

      A few more posts like that and I may just find my Buddha nature.

    7. Re:"Soon to be in prototype" by fireboy1919 · · Score: 1

      This is not exclusively true. When writing stuff in Verilog for use with Synopsys or Cadence, for example, you don't get to specify quantum effects (though CPU chip designers wouldn't be using something so high-level for most of the design).

      Quantum effects are what you get with such small transistors, and the interactions are a lot harder to predict than is, for instance, the adherence of a transistor to it's response curve.

      You can do an okay job, yes, but that doesn't mean that the thing is going to positively always work as well as you want it to.

      And don't forget the problems with the fabrication process. How sensitive to errors is your chip to subtle flaws in the fabrication? This is a question that people are still looking for a very reliable way to simulate.

      --
      Mod me down and I will become more powerful than you can possibly imagine!
    8. Re:"Soon to be in prototype" by borum · · Score: 1

      I've seen one of those in silicon.
      They appear to aim for low power.

      http://www.synputer.com/

    9. Re:"Soon to be in prototype" by Carnildo · · Score: 1

      I have, but it's a damned expensive abacus!

      --
      "They redundantly repeated themselves over and over again incessantly without end ad infinitum" -- ibid.
    10. Re:"Soon to be in prototype" by Anonymous Coward · · Score: 0
      I'm not justifying their claims ;-)

      Then I fail to see what point you were trying to make.

    11. Re:"Soon to be in prototype" by Epistax · · Score: 1

      I agree. I would wonder, however, what percent yield they expect to get on these chips.

    12. Re:"Soon to be in prototype" by zx75 · · Score: 1

      Yes, its the honesty of the company we should be questioning, not the reliability of their tests.

      --
      This is not a sig.
    13. Re:"Soon to be in prototype" by psyconaut · · Score: 1

      I'd be more worried about the fact WiReD article talks about slapping these puppies all on a PCI bus... ;-)

      -psy

    14. Re:"Soon to be in prototype" by Anonymous Coward · · Score: 0

      Thus you are stupid.

      //EOF

  7. Key phrase? by slipgun · · Score: 1

    The key phrase is in the Wired story...

    No, the key phrase is this is too good to be true

    --
    SpamNet - a spam blocker that really works
  8. This weekend! by Anonymous Coward · · Score: 0


    Saturday.. Saturday..Saturday!!

    Clearspeed will tear through the competition with their awesome megachip the CS301!!

    3 WATTS of ThrowYouBackInYourSeat Power! Twice the speed! Twice Twice! Twice!

  9. Co processor by key134 · · Score: 1, Insightful

    When it comes to market, the chip will likely be sold to consumers as a co-processor -- an add-on PCI card that works in parallel with a PC's main processor

    It's not replacing our current processors. It is just helping them with intensive floating-point calculations. Is that really going to be helpful to the average user? Keith

    1. Re:Co processor by Arker · · Score: 2, Insightful

      Everything old is new again... eventually.

      From reading the articles, it seems it is indeed designed to be a math coprocessor. Since the Pentium came out, those have been out of style. The Pentium effectively included a 80487 on the same die, and on other architectures that was done even earlier. But now it comes back - only now the idea is a far more powerful coprocessor for scientific functions.

      No, it's not going to be very helpful to the average users. But for those of us that spend a lot of time using our computers to do complex mathematical calculations, it could be damn helpful, if it turns out to be anywhere near as powerful as they claim it will be.

      --
      =-=-=-=-=-=-=-=-=-=-=-=-=-=-
      Friends don't let friends enable ecmascript.
    2. Re:Co processor by grub · · Score: 1


      It's not replacing our current processors. It is just helping them with intensive floating-point calculations. Is that really going to be helpful to the average user?

      Considering their initial chips (if and when actually made) will be selling for $16,500 I don't think the "average user" is their initial intended market. They'll go to the clustering people and research folks first.

      --
      Trolling is a art,
    3. Re:Co processor by Josuah · · Score: 1

      From reading the articles, it seems it is indeed designed to be a math coprocessor.

      ClearSpeed is saying that it can run as a coprocessor, but also standalone. From their press release:

      "The CS301 can serve either as a co-processor alongside an Intel or AMD CPU within a high performance workstation, blade server or cluster configuration, or as a standalone processor for embedded DSP applications like radar pulse compression or image processing."

    4. Re:Co processor by Vaginal+Discharge · · Score: 1

      Hmm... A PCI board you say? If it does do Gigaflops of calculation, then pray say what kind of bandwidth do you think it's going to take to pipe this massive amounts of data to the chip in the first place? What about memory? Cache misses? Page faults? on a PC? I can make a jet engine that has 100 kN of thrust, but that doesn't mean I should strap it on a Toyota Camry.

      I vote hype.

      --
      "Glory is fleeting but obscurity is forever" - Napoleon Bonapart.
    5. Re:Co processor by cgb8176 · · Score: 1
      No, it's not going to be very helpful to the average users.

      Sort of like the Nvidia/ATI co-processor I use to play video games?

      When one reads "scientific computation", one automatically thinks, "not average user". But try reading it as signal processing, i.e. voice recognition. Or image processing, i.e. face/object/target recognition.

      The computers promised to us by decades of sci-fi aren't going to have a single processor. They'll have one for sound synthesis (Sound Blaster), one for image synthesis (ATI/Nvidia), one for voice recognition (special purpose DSP). And then they'll have one on the side, for wimpy stuff like your OS and your web browser.

    6. Re:Co processor by 33degrees · · Score: 1

      It's not replacing our current processors. It is just helping them with intensive floating-point calculations. Is that really going to be helpful to the average user?

      Hey, this is slashdot, we don't care about the average user.

    7. Re:Co processor by Thomas+A.+Anderson · · Score: 1

      For the last couple of years I've felt that voice recognition was the next "killer app" that would push computer hardware. This chip, if it's possible to make, could give us the computational ability to have verbal conversations with our computers.....

      Either that - or it will help me become #1 on seti@home's list of data crunchers....

      --
      Personally its not God I dislike, its his fan club I cant stand (bash.org)
  10. Seems obvious. by Prince_Ali · · Score: 1
    I wondering if this is too good to be true.

    I thinking it is!

    1. Re:Seems obvious. by FyRE666 · · Score: 1

      How you know this?

    2. Re:Seems obvious. by Anonymous Coward · · Score: 0

      It's a story on Slashdot.

  11. Skeptical by cybermace5 · · Score: 4, Funny

    As well as the fact that I've seen this press release trolled by AC's on Slashdot.

    25Gflops on 3W? That must be some unorthodox technology at work there. Anyone hear anything about some research corporation finding an amazing processor in a robot from the future?

    --
    ...
    1. Re:Skeptical by grub · · Score: 1


      25Gflops on 3W?

      Clearspeed forgets to mention the several hundred litres of expensive liquid helium needed to make this possible. :)

      --
      Trolling is a art,
    2. Re:Skeptical by Anonymous Coward · · Score: 0

      >Anyone hear anything about some research corporation finding an amazing processor in a robot from the future?

      We didn't find anything in our factory, we're not hiding anything, we're just able to innovate more than our competitors.

      We do have a robotic arm for sale though.

    3. Re:Skeptical by fussman · · Score: 0

      Well, that basic plot starter has been in many, MANY scf fi films and tv shows. It's about time it actually happened in real life.

      --
      Support Israeli punk bands. Man Alive.
    4. Re:Skeptical by ComputerSlicer23 · · Score: 1
      Uhhh, if you had liquid helium, that means you'd be sucking off a lot of heat, which means lots of heat is generated, which means lots of energy was put into the system. Which means it used more then 3Watts.

      More likely, it only uses 3W while in the low power mode, making no calculations or some such... :-)

      Kirby

    5. Re:Skeptical by gl4ss · · Score: 1

      well..

      if you have only one possible flop(floating point op? i dunno, i've never bothered to check on these usually fairy tale figures beyond bogomips).

      for example, you can only add 0.001 to 0.001, but you can do that 25 000 000 000 times a second..

      --
      world was created 5 seconds before this post as it is.
    6. Re:Skeptical by cybermace5 · · Score: 1

      No, they forgot to mention that it removes any stain known to man, restores rough and damaged skin, makes an old corroded penny bright and shiny as new, reverses aging AND tones your abs, what would you expect to pay for this? $199? no. $99? no. All for the low low price of $19.95 per month, but WAIT, call now and receive a second ClearSpeed absolutely free, that's right, an $80 value at no additional cost, and THAT'S NOT ALL, mention that you saw this program and you will get a complimentary terry-cloth bathrobe COMPLETELY FREE + shipping and handling, call now to receive this amazing one-time offer, operators are standing by!

      --
      ...
    7. Re:Skeptical by Anonymous Coward · · Score: 0

      More likely, it only uses 3W while in the low power mode, making no calculations or some such... :-)

      You mean, when they have the power turned off?

    8. Re:Skeptical by grasshoppa · · Score: 1

      I'm betting it's more along the lines of entropy calculations.

      You know, the numbers that can be derived from the numbers that the chip produces? Yeah.

      ( :) - For the humor-impaired )

      --
      Mod me down with all of your hatred and your journey towards the dark side will be complete!
    9. Re:Skeptical by the_consumer · · Score: 1

      Or, the chip might be manufactured using low-temperature superconductors.

      --
      "If you're thinking what I'm thinking, you're right." -
    10. Re:Skeptical by sageFool · · Score: 1

      > Anyone hear anything about some research
      > corporation finding an amazing processor in a
      > robot from the future?

      Where have you been? The robot is currently running the state of California, which needs cash to ease it's budget crisis. See, it all fits!!

    11. Re:Skeptical by bill_mcgonigle · · Score: 1

      That must be some unorthodox technology at work there. Anyone hear anything about some research corporation finding an amazing processor in a robot from the future?

      Don't be silly, that's just a movie. It was found at Roswell.

      (or the MiB need more budget - oops I'm guilty too)

      --
      My God, it's Full of Source!
      OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
    12. Re:Skeptical by ed333 · · Score: 0

      Anyone hear anything about some research corporation finding an amazing processor in a robot from the future? It's been donated from that spammer who was looking for a AMD time warp generator. Seems he ahd to trade some of his uber future technology in order to get back home.

    13. Re:Skeptical by GreyPoopon · · Score: 1
      No, they forgot to mention that it removes any stain known to man, restores rough and damaged skin, makes an old corroded penny bright and shiny as new, reverses aging AND tones your abs...

      What, it's not a dessert topping, too? Well then, I'm not interested.

      --

      GreyPoopon
      --
      Why is it I can write insightful comments but can't come up with a clever signature?

    14. Re:Skeptical by akuma(x86) · · Score: 1

      There is no magic here. This processor will not run general-purpose code very well, but it will scream on regular, repetetive code with very predictable memory access patterns. DSP kernels, some types of encryption algorithms, blah blah blah...

      It's a very power efficient way to run these kinds of applications.

      Do a google search on the Stanford Imagine project for some academic background.

    15. Re:Skeptical by Anonymous Coward · · Score: 0

      Geeze, at least give 'em a bloody link!

    16. Re:Skeptical by 24-bit+Voxel · · Score: 2, Funny

      Sitting somewhere in infinite isolation, Marvin the Robot sits and sighs in abject misery. He ponders the loss of his right arm; parts from it used to spit out nothing but 1's and 0's in a small beige box. 1's and 0's, 0's and 1's. Marvin let's out a small mechanical sigh of solitude and begins counting backwards from infinity to 0, in binary.

    17. Re:Skeptical by You're+All+Wrong · · Score: 1

      Thy mention 64 processing elements, so 25GFlops is only 390MHz.
      390MHz is a pretty slow clock speed. Maybe there's _also_ a DMA
      controller moving data around, and every number that gets moved
      into a PE or moved out of a PE counts as a FlOp too.

      It doesn't sound unreasonable. I remember in the 90s working with TI c80MVP chips which had an integer DSP, 4 multiple-issue (OK, LIW) FP DSPs, and one DMA controller. For only a 66MHz piece it was cranking out nearly 1GOps.

      YAW.

      --
      Your head of state is a corrupt weasel, I hope you're happy.
    18. Re:Skeptical by You're+All+Wrong · · Score: 1

      Possibly fantastic for the sieving stages for factoring the RSA challenges. No number of nodes is too many for a task like that.
      (where each has comparatively low processing power)

      YAW.

      --
      Your head of state is a corrupt weasel, I hope you're happy.
    19. Re:Skeptical by be-fan · · Score: 1

      Actually, you can expect to pay about $17,000. Doesn't sound so unrealistic now, do it?

      --
      A deep unwavering belief is a sure sign you're missing something...
    20. Re:Skeptical by Weirsbaski · · Score: 1

      I wondering if this is too good to be true

      If someone has to ask, it probably is.

      --

      I am not a sig.
  12. Coprocessor Price by ChrisBrown1 · · Score: 1

    Only $16,000! I'll take two!
    But where's the desktop bus bandwidth supposed to come from? I Think it'll choke on my PC133 RAM. Whatever desktop machine they're targeting is what I want for Christmas.

    1. Re:Coprocessor Price by Lieutenant_Dan · · Score: 1

      Another company, called "ClutterTech" has conceptual designs for a 4.543 TeraFlop RAM module. Unfortunately it comes only in 8,192 byte SIMMs. They're still working on it.

      --
      Wearing pants should always be optional.
    2. Re:Coprocessor Price by ChrisBrown1 · · Score: 1

      8,192 Bytes? Yeah, That's what my RCA COSMAC ELF 1802 Hex Keypad had. It sure could have used that upgrade. Maybe faster RAM would have sped it up.

    3. Re:Coprocessor Price by bahamat · · Score: 1

      Here's my favorite line from the article:

      Putting around 20 ClearSpeed chips into a few personal computers could potentially provide the sort of power normally only found in a supercomputer built from hundreds of parallel processors or specialised hardware.

      Yea, that's right. A $16,000 comodity processor.

  13. I remember the quotes of 256KB of ram and 10MB(+) by Mycroft_514 · · Score: 1

    of disk space will be all anyone would ever need. But really, what requires 25G flops?

    Maybe if we decide to model "Life, the Universe and Everything?"

  14. Reminds me.... by Jaysyn · · Score: 1

    .... of Starbridge systems....

    Jaysyn

    --
    There is a war going on for your mind.
    1. Re:Reminds me.... by Jaysyn · · Score: 1

      I take that back, apparently they aren't vapor like I thought. NASA's bought a couple...

      Jaysyn

      --
      There is a war going on for your mind.
  15. echoes of Transmeta by mblase · · Score: 3, Insightful

    I'm reminded of all the promises we heard for the Transmeta chip, only a fraction of which are being realized. And they have an actual product to demonstrate, mind you.

    Yeah, it sounds like wishful thinking. I have little faith in processors from unknown companies that claim to do what Intel, AMD and IBM combined haven't yet been able to achieve.

    1. Re:echoes of Transmeta by GooberToo · · Score: 1

      I'm reminded of all the promises we heard for the Transmeta chip, only a fraction of which are being realized. And they have an actual product to demonstrate, mind you.

      Really? From what I've read they delivered on everything they said they would/could do. What didn't they deliver on and where was it said they did something that didn't materialize?

      I'm not trolling here, I just curious.

    2. Re:echoes of Transmeta by Anonymous Coward · · Score: 0
      I just curious.

      I think there something missing.

    3. Re:echoes of Transmeta by fuqqer · · Score: 1

      As for unknown companies doing stuff that IBM, Intel, and AMD aren't...Xilinx anybody? Check out Star Bridge Systems and their computer at NASA's Langeley Research Facility.

      I've yet to see a Transmeta box in action, but I know their chips were in some of the first blade systems a year or two ahead of HP's or any other major manufacturers. You definitely can't deny that they've been successful.

      Maybe I'm a tech optimist, but I'd be willing to put money on Clearspeed's technology. It sounds cool to boot.

      -non sig here-

    4. Re:echoes of Transmeta by Colin+Douglas+Howell · · Score: 1
      I have little faith in processors from unknown companies that claim to do what Intel, AMD and IBM combined haven't yet been able to achieve.

      Well, Intel, AMD, and IBM haven't really tried. This chip isn't a normal microprocessor, it's an "array processor", meaning that it's designed specifically to execute operations on large floating-point arrays. The market for such processors has been rather small in the past. It's hard to write code for them with traditional programming tools, and they're only applicable to a restricted class of problems. Most high-end supercomputing these days uses massively parallel sets of conventional processors. So there hasn't been an opportunity large enough to be worthy of the attention of the big guys.

      Once you take a closer look at ClearSpeed's PR, this thing seems interesting, but not worth making a huge fuss about. The ClearSpeed press release shows that 25.6 GFLOPS is the peak performance. That's the absolute maximum the chip can do for the easiest kind of problem, which is why some people call peak performance "guaranteed never to exceed". With 64 processing elements, 25.6 GFLOPS means 400 MFLOPS per element. According to ClearSpeed's Microprocessor Forum presentation (warning: this is a slow download), the chip runs at 200 MHz, so this implies each element can do up to 2 floating-point operations per cycle. In fact, ClearSpeed's press release claims only "more than twice the processing speed of competitive products". So this isn't exactly an earth-shattering advance.

      The power usage figure is more eye-opening, of course. Here ClearSpeed's press release claims they are twenty times more efficient than their competitors. But it turns out they're comparing the thing with conventional high-speed processors like the Pentium 4, hardly paragons of power efficiency. And ClearSpeed's presentation says they are using IBM's 0.13-micron process, so IBM should get some credit for providing the semiconductor technology to make this possible. 2-3 watts for modern 200 MHz logic using that kind of process doesn't sound outside the realm of possibility. (Remember, this isn't a conventional superscalar processor which requires huge amounts of logic for instruction issuing and control. This thing is mostly a simple mass of ALUs.)

      In any case, ClearCase's presentation says they'll be sampling in the 4th quarter of 2003, so they'll have to demonstrate real hardware soon.

    5. Re:echoes of Transmeta by Anonymous Coward · · Score: 0

      Ditzel was going on about post-RISC architecture burying RISC back in 1996 or so. He wouldn't get specific about what post-RISC meant back then, to avoid losing the "stealth-mode" hype.

      First chips hit the wire, and the architecture was interesting, but performance wasn't there. Suddenly all the marketing materials were modified to high light power consumption.

      No one designs a new chip unless they firmly believe that it will be a world-beater; it's just too expensive. Obviously Transmeta's chips are not, and have never been, world-beaters.

  16. Diamonds... by Anonymous Coward · · Score: 0

    I can't wait until they come out with diamond based processors. They can provide peformance in the 200Ghz range, and now that that fabrication methods to produce flawless diamonds has been perfected. Read the article in Wired a few months ago for more info about this.

  17. Who's got the thermodynamics degree? by Sean80 · · Score: 1
    Way back when, when I was reading that classic crytographic book whose name I can't remember by that guy whose name I also can't remember, he was saying that a 256 bit symmetric key would be practically unbreakable since you'd need the total energy output of the Sun for a year to make that many phase changes in the computer.

    So, in that kind of light, can anybody here with thermodynamic knowledge compare the total number of phase changes required for this speed versus the energy which has been claimed it needs?

    1. Re:Who's got the thermodynamics degree? by hobit · · Score: 1

      Don't have the numbers, but I do know we aren't even close to them. It was something around 10^-18 Watts for a 32 bit add (I could be off by a huge amount but it was very very small). I remember thinking "that isn't going to be important for a long long time."

      I couldn't find any number on the web.

      Mark

      --
      As Nietsche famously said, "If you stare too long into the Abyss, 1d4 Tanar'ri of random type will attack you."
    2. Re:Who's got the thermodynamics degree? by InfiniteWisdom · · Score: 1

      THe classic Cryptograpy book you're talking about is Applied Cryptography by Bruce Schneier.

      I believe he was talking about 4096 bit RSA, not 256-bit symmetric keys, but I could be wrong.

      Anyway... he'd assumed that you could do one operation with one electron-volt of energy, which is about 1.6 * 10^-19 joules.

      This processor can do 25*10^9 operations with 3 W of energy. Thats 3 joules of energy for 25G operations or about 1.2 * 10^-10.

      So to come anywhere near 1 eV per operation requires a further reduction in power by a factor of a billion

    3. Re:Who's got the thermodynamics degree? by geekoid · · Score: 1

      enough to fill the library of congress!

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    4. Re:Who's got the thermodynamics degree? by You're+All+Wrong · · Score: 1

      You're maybe thinking of Ron Rivest, of Rivest Shamir Aldleman (RSA) fame. The magic words to google for are "40 quadrillion years", if I remember correctly. (Sometimes numbers just stick in your head when they're so embarassingly wrong!)

      YAW.

      --
      Your head of state is a corrupt weasel, I hope you're happy.
  18. Admitted Vapour ware... by jm91509 · · Score: 1

    And its still an article?

    Slow news day I guess...

    1. Re:Admitted Vapour ware... by Josuah · · Score: 1

      The details of the CS301 are supposed to be given at Microprocessor Forum 2003, this week. I think this would indicate a little more substance to the chip. Especially since ClearSpeed wil have to defend their claims to experts in the industry.

  19. It's a math coprocessor by Anonymous Coward · · Score: 0

    It looks like a math coprocessor, to be used in combination with a regular CPU. Remember how, back in the day, math coprocessors were an option for your Macintosh or 286 PC? Kind of like that.

  20. Good news for OpenOffice.org users. by Anonymous Coward · · Score: 0

    No more having to wait a minute for it to load!

    1. Re:Good news for OpenOffice.org users. by Anonymous Coward · · Score: 0

      Hey, OpenOffice.org Ver 1.1 is out, It's way faster, loading in only 50 secs now!

    2. Re:Good news for OpenOffice.org users. by Anonymous Coward · · Score: 0

      If you just bought a decent computer with all the money you saved by not buying MS, you might have the uber-elite 23 second load time.

  21. sounds familiar by Anonymous Coward · · Score: 0

    now they just need to hire Linus Torvalds and we'll be good to go!

  22. Great, a NEW kind of vaporware by Anonymous Coward · · Score: 0

    Not just any old vaporware, HARD vaporware!

  23. Re:Press Release: ClearSpeed web site is getting s by herrvinny · · Score: 0, Redundant

    Sorry, got a 500 Server error when posting that, and didn't Plain Text it.

    ClearSpeed Announces CS301 Multi-Threaded Array Processor to Deliver High Performance Computing and Power Efficiency
    October 14th, 2003

    Highest Floating Point Performance Chip Executes Complex Mathematical Applications in a Fraction of the Power and Time.

    SAN JOSE, Calif., October 14th, 2003 -- ClearSpeed Technology, a leading provider of high performance, low power chip-based solutions, today announced the ClearSpeed CS301, a multi-threaded array processor that enables dramatic improvements in performance and power consumption for intensive floating point applications. At over 25 GFLOPS peak performance, the new chip provides more than twice the processing speed of competitive products. At 10 GFLOPS per Watt, power consumption is also twenty times more efficient. As a result, the CS301 delivers up to a ninety percent reduction in purchase price and running costs, making high performance computing affordable and available to companies of all sizes.

    "With conventional processor design, increasing performance has tended to come with real penalties in power consumption and heat dissipation, to the point where computing cannot keep up with the demands of today's emerging applications and rapidly increasing volumes of data," said Tom Beese, CEO of ClearSpeed Technology. "The CS301 is designed specifically to meet those needs with high performance, power efficiency and full programmability in C
    combined into a single chip. The CS301 is the first in a family of ClearSpeed microprocessors that we believe will challenge present day thinking by creating a world where scientists, bio-informaticians, engineers and content creators alike can have access to high performance computing anywhere, anytime."

    The CS301 is based on a multi-threaded array processing (MTAP) architecture and includes 64 processing elements, 384 Kbytes of on-chip SRAM and I/O ports interconnecting through ClearSpeed's ClearConnect(R) bus. Each processing element has its own floating point units, local memory and I/O capability, making the CS301 ideally suited for applications which have high processing or bandwidth requirements. The ClearConnect bus is a packet switched network that provides high bandwidth and low power consumption, supporting multiple concurrent transfers giving even higher aggregate bandwidth.

    As a result, complex mathematically based applications such as, computational biology and drug discovery, digital content creation, nanotechnology development, scientific research and financial modelling can now be executed in a fraction of the time.

    "We are gratified to see the immediate high level of interest displayed by OEM's in the overall system improvements enabled by the CS301," said Mike Calise, president of ClearSpeed U.S. "The dual benefit of performance and efficiency is empowering companies to accelerate existing applications as well as inspiring them to explore new applications that were previously inaccessible."

    The CS301 can serve either as a co-processor alongside an Intel or AMD CPU within a high performance workstation, blade server or cluster configuration, or as a standalone processor for embedded DSP applications like radar pulse compression or image processing. In applications where the CS301 is acting as a co-processor, dynamic libraries offload an application's inner loops to the CS301. Although these inner loops only make up a small portion of the source code, these loops are responsible for the vast majority of the application's running time. By offloading the inner loops, the CS301 can bypass the traditional bottleneck caused by a CPU's limited mathematical capability, executing the core of the application more than twice as fast as anything else in the marketplace.

    "To deliver such high levels of performance with full programmability and outstanding gains in power efficiency is a very significant achievement," said Chris Piercy, president and chairman of the Northern Californ

  24. Pretty good but not amazing..... by dracken · · Score: 1

    ....The announcement might be describing vaporware but 3W / 25 Gflops isnt too amazing to definitely indicate vaporware. ARM VFP9-S co-processor is about 0.4 Gflops for about 0.8 watt (about 1.5 gflops for 3 watt). Keep in mind that it was introduced in 2001. 4 years and 15 fold improvement seems possible.....

    1. Re:Pretty good but not amazing..... by Carnildo · · Score: 1

      And modern desktop CPUs are what -- 3-6 Gflops for 80+ watts?

      --
      "They redundantly repeated themselves over and over again incessantly without end ad infinitum" -- ibid.
    2. Re:Pretty good but not amazing..... by platypus · · Score: 1

      Go, look up for TMS320C6701 on google. This is a DSP which gets to 1 GFLOP at 2 Watts (I think).

      A DSP is just not comparable to a desktop CPU, and I didn't see anything which suggests this thing can do more.

  25. this kind of thing is un-heard of by compubomb · · Score: 0

    So they now think we can go from the NASA Space shuttle to StarTrek Next generations in terms of computational speed, i find this very hard to believe that they can bring us this far in technology in such a short jump. To me it's like when they build the first jet, now we go from builting the first jet to reaching the speed of a bullet via the SR71 Blackbird. Maybe i haven't done enough research into their methodologies in this technology, but i'm very doubtful.

    1. Re:this kind of thing is un-heard of by rkww · · Score: 1
      Hmm, the first jet was the German He-178 (first flew Aug. 1939); the SR71 first flew Dec. 1964. I make that 25 years.

      What were vector processors like 25 years ago? The Cray-1 (1977) peaked at 240 MFlops.

      Admittedly the Cray needed its own power station to run; but speedwise, a one-hundred fold increase in 25 years seems quite reasonable.

    2. Re:this kind of thing is un-heard of by ericspinder · · Score: 1

      the first jet planes were flown in the mid 40's the sound barrier was broken in 1947, the SR-71 first flew in 1959. The 747 was a product of the 60's. Not a great deal of time.

      However I am taking a "wait and see" attitude about chips which haven't even been produced yet.

      --
      The grass is only greener, if you don't take care of your own lawn.
  26. 25 GFLOPs of performance and 2 x 1.6 GB/sec bus by baseinfinity · · Score: 2, Insightful

    ... best case, and 128 K of cache.

    Unless this thing is working on highly specialized data sets, it doesn't matter how much data the core can mow through if it can't get the data fast enough. Why do you think AMD and Intel are so obsessed with their memory interfaces? There's little difference between the Athlon and the Athlon 64 besides large data width and fancy memory / SMP interfaces.

    1. Re:25 GFLOPs of performance and 2 x 1.6 GB/sec bus by LxDengar · · Score: 1

      But there must be thousands of applications for this chip which can be sent in small amounts of data. Considering this chip is focusing on discreet floating point math, and contains its own internal registers (64 of them), I would think that there are many calculations that could be complete done on chip - data in, answer out.

    2. Re:25 GFLOPs of performance and 2 x 1.6 GB/sec bus by baseinfinity · · Score: 1

      data in, data out is precisely what will kill this chip. What this thing needs to work in giant parallelism is a whole bunch of stuff (like arrays, matrix, etc...) that can be broken down "blocked" such that the amortized run time spends far more time in the 128k than in the whole dataset. Also, these things don't run at full speed unless it's chewing on something 4k in size for a pretty long time relative to the 128k time, so it's pretty tight for anything besides specifically engineered algorithms. This is what their front page says, and chips like this arn't that revolutionary, although neat. Quoting those numbers in the context of PC applications (a PC centric website) is rather deceptive.

      The thing is specified to the layout level based on their pictures, so that's pretty much enough to verify that it'll work in silicon. Computer CAD tools extract how the circuit work based on the masks, and use tools like spice that can model these things to rediculous accuracy (24th or so order effects).

    3. Re:25 GFLOPs of performance and 2 x 1.6 GB/sec bus by LxDengar · · Score: 1

      I agree that its deceptive putting this on a PC centric website. However, I still think there are applications that could be wildly sucessful for this type of chip. Specifically, I wonder if 3D rendering - which can spend a great deal time within a 128K set, might not be greatly effected by such a chip. I could easily be wrong, but it seems that discreet calulations of MC/QMC sampling would be one of those algorithms that this chip could be useful for.

    4. Re:25 GFLOPs of performance and 2 x 1.6 GB/sec bus by statusbar · · Score: 1
      I know I would love to have this chip for digital audio processing. Currently the system I work on expands to up to 32 DSP's, but that is still not enough for some customers.

      The individual throughput this chip has is good enough for each unit, and the more processing we can do on the audio, the better. We've already looked at this product, however, and determined that until it is available and selling well for 1 year we can't trust the company to not disappear.

      Because of that, we usually end up with a design that has a slower DSP with a longer shelf life and from a company such as Motorola or Texas Instruments that documents errata's very well...

      --jeff++

      --
      ipv6 is my vpn
  27. What about IO, RAM? by king_ramen · · Score: 1

    it would be quite difficult to get high performance IO through a PCMCIA bus. I can see its use for large matrix computations but not generally useful as an "OS" CPU.

    Unless they have some monster NUMA architecture RAM access will also suffer dramatically. RAM contention on a 64-way system would be AWFUL and there is NO WAY in hell they would access system RAM through PCMCIA. It is certainly an interesting idea but I do not see a way for this technology to be useful without lots of changes to existing software and program design. This is NOT like adding 64 CPUs to your home machine.

    --
    ----- Refactoring is the reason why man does not mistake himself for a god.
  28. It IS going to be used, real soon too!! by t0ny · · Score: 1

    I just heard it is going to be used in the Infinium Labs Phantom Console!

    --

    Manipulate the moderator system! Mod someone as "overrated" today.

  29. The other key words by pestilence4hr · · Score: 1
    Strauss warned that writing software for the chip's complex architecture might be a stumbling block, but the company has assured him that its compiler makes it easy to program.
    Putting aside for a second the fact that their claim is not backed by a working prototype, this sentence should make you weary that even if they did you would only see this performance on a benchmark, and not on any real applications.

    If, on the other hand, they do in fact have a chip that is not too hard to program and can pound crunch numbers that quickly, it will certainly bring with it a revolution in high performance computing, and probably change the world as we know it.
    1. Re:The other key words by Anonymous Coward · · Score: 0

      Change the world one $16G at a time...

  30. 64 parallel FPUs by flaming-opus · · Score: 1

    sounds like it might make a very nice DSP chip. However lots of simple, non-contentious, non-overlapping floatingpoint computation is really not a problem most desktop or notebook users are struggling with. In fact it's really not a problem that super-computers are struggling with. There have been pci cards with power-pc chips on them for years. Curiously enough these cards haven't ended up being used in many top500 supercomputers.

    A fast low-voltage DSP chip is interesting for a lot of applications, but not in the way that this press release describes the product.

    1. Re:64 parallel FPUs by Sneftel · · Score: 1

      sounds like it might make a very nice DSP chip. However lots of simple, non-contentious, non-overlapping floatingpoint computation is really not a problem most desktop or notebook users are struggling with.
      You've obviously never done graphics or sound pipeline programming.

      --
      The opinions stated herein do not necessarily represent those of anybody at all. Deal with it.
  31. possible synergetic approach? by hkfczrqj · · Score: 1

    you can actually be pretty sure of both a chips performance and reliability just from simulations these days

    yes... its true... imagine: they build the first bunch of prototypes, and with all that power, even if they dont achieve the 25 Gflops at that stage, those prototypes can be used to simulate and tune the second generation of chips and so on ... just a dream/thought

    Cheers...


    PD: I was tempted to say 'imagine a beowulf cluster of these'... yikes... i said it

  32. I don't know... by GoNINzo · · Score: 1
    I don't think it's completely is synonymous.

    Knight Rider, a shadowy flight into the dangerous world of a man who will soon be in prototype.

    --
    Gonzo Granzeau
    "Nothing the god of biomechanics wouldn't let you into heaven for.." -Roy Batty
    1. Re:I don't know... by insanecarbonbasedlif · · Score: 1

      Our favorite scottish dinosaur will soon be in prototype.

      --
      Just because I doubt myself does not mean I find your position compelling.
  33. Let me guess by GreenCrackBaby · · Score: 0, Redundant

    It's going to power the phantom.

    --

    "The market alone cannot provide sufficient constraints on corporation's penchant to cause harm." -- Joel Bakan
  34. Speed of access to memory will be limiting factor. by shocking · · Score: 1

    As always, these things will have to be fed data at a high rate in order to be completely utilised. I don't see current PC memory subsystems being able to do it, and as for a PCMCIA card, well forget it!
    You'll probably only ever see a tiny fraction of their claimed performance.

  35. The race is on........ by jube_fl · · Score: 1

    to see if clearspeed can develop a CPU that can survive /.

  36. Re:Press Release: ClearSpeed web site is getting s by omarques · · Score: 0

    In other news, slashdot.org announces that now allows the use of the html tag "
    " in the comments and introduced a "Preview" button that shows the user how the comment will appear before the actual posting.

  37. Great idea! Vector processor chip . by zymano · · Score: 1

    Graphics cards used for general computing

    The chip is a vector processor like major 3d graphics cards.

    It's a great idea and something that people on slash ,including me,have talked about everytime a powerful graphics card has come out.

    The compiler will be key.

    Anything that can steal the thunder of INTEL & AMD is fine by me. Their processors are heat emitting loads of crap. If you noticed the amount of wattage that this new chip uses , it's because of the architecture and not fancy transistor technology.

    Now you all know that Intel , Sun, Ibm, and Amd do not make efficient processors.

  38. Pricing confusion by Phiz · · Score: 1

    There seems to be wild confusion from ClearSpeed about pricing. In the New Scientist article they quote a starting price of $16,500/chip, but in the Wired article they state you could get a PC with 24 of these chips in it for $25,000.

    But either of those prices are pretty high for your average home user. Hopefully someone can give them strong competition without violating their patents.

  39. Overinflated claims? by Carnildo · · Score: 1

    From their press release:
    ClearSpeed Technology, a leading provider of high performance, low power chip-based solutions, today announced the ClearSpeed CS301, a multi-threaded array processor that enables dramatic improvements in performance and power consumption for intensive floating point applications.

    But what are these other "high-performance, low-power solutions"? Looking at their web site, the CS301 is the company's only product.

    --
    "They redundantly repeated themselves over and over again incessantly without end ad infinitum" -- ibid.
  40. Thing is by Sycraft-fu · · Score: 2, Insightful

    You can make theoritical things on a VHDL simulator that you'll never be able to make into actual silicon. The real magic of companies like Intel, IBM, AMD, etc isn't designing an uber powerful chip, it's designing an uber powerful chip that can actually be realizied in silicon, and at a cost that makes it worth selling.

    There has been more than one firm that has suffered from simulator disease. They get all caught up in making an awesome, ass-kicking theoritical design that will eclipse everything and everybody that they forget about physical limits of actual silicon. They then find, when they try to really implement the chip, it just can't be done.

    1. Re:Thing is by psyconaut · · Score: 1

      Yup, valid points....

      I'm lucky if I can get my VHDL for FPGA and CPLD designs working to start with, let alone create silicon I can't actually pseudo-fab ;-)

      -psy

  41. Maspar by hobit · · Score: 4, Interesting
    For the last 10 years or so I've been thinking about how to do just this. What I'm 99% sure they are doing is SIMD on a massive scale. The Maspar (and especially the Maspar-2) were computers along this line.

    The basic idea is to have lots of "processing elements" that are basically ALUs with a bit of additional smarts (for branches mainly). Each PE has its own memory. The main processor (probably not the PC CPU) tells each PE what to do. Thus the Single Instruction Multiple Data. Things are a bit more complex then this (branches, pointers, and a few other things cause some problems.) but not too much worse. PE to PE communication is also interesting (the Maspar was a toroid as I recall).

    The two basic problems with this type of a design are:

    • You either need a special programming language (and someone who understands the language and understands the problem really well) or a very very good compiler to get anything out of it.
    • The application range is quite limited. Not as limited as supercomputer people seem to think (I mean I've written genetic algorithm code for the Maspar that scales wonderfully.) but still quite limited.

    There are also a huge number of other problems. Caches don't generally do a darn thing for massive SIMD computers (if one processing element misses, they all do.) The memory usually has two types of pointers (one to the PE memory and one to global memory). I may contact the company to see if they want to hire a short-term consultant. hummm.... Have PhD will travel?

    --
    As Nietsche famously said, "If you stare too long into the Abyss, 1d4 Tanar'ri of random type will attack you."
    1. Re:Maspar by alakon · · Score: 1

      What a great post! This type of post is the only reason I read Slashdot.

    2. Re:Maspar by hughperkins · · Score: 1

      Today's applications are optimized for single-processor environments, but there's no particular reason why many parts of applications couldn't be parellelized. You only have to look at Word, which is running real-time spell-checking, grammar-checking, paragraph alignment and so on (at least 3 independent threads straight off, and there's no reason why you couldn't have multiple threads for the spell-checking for example). As far as games, there are a LOT of things going on at the same time. It would be child's play to make each particular module (AI, rendering, physics...) run as it's own thread; and just a little harder to run multiple threads for each module (eg use one thread for each unit in a strategy game). Parallelization is the future. It's only a question of time before it becomes the norm rather than the exception. Here's to the day when we have 1 Gig of processors on our main-boards!

    3. Re:Maspar by hobit · · Score: 1

      thanks.

      Mark

      --
      As Nietsche famously said, "If you stare too long into the Abyss, 1d4 Tanar'ri of random type will attack you."
  42. Join #ranma on irc.darkaxis.com by pope1 · · Score: 1

    For a rousing discussion of what applications
    this chip could possibly have. Its being sold as
    a CO-processor, but what kind of bus will it be used
    on? It would seem that all variants of PCI/SBUS would choke.. we really need more information.

    --
    /* * pope1 */
  43. Re:Press Release: ClearSpeed web site is getting s by Anonymous Coward · · Score: 0

    The CS301 can serve either as a co-processor alongside an Intel or AMD CPU within a high performance workstation, blade server or cluster configuration, or as a standalone processor for embedded DSP applications like radar pulse compression or image processing.

    so there, its a dsp. lets do some simple fictional estimates ..

    strip a 3ghz p3 down to the XMMS core, make the ops 1 cycle/instruction, restructure the pipeline to loop only (no branches) ... lets see

    3ghz minus some overhead.. lets say 2ghz effectivly, 2ghz*4 operands per cycle.. 8 gflops, and this strange beast wouldnt have been designed for it from the ground up anyway.

    move along, nothing special to see here, dsps like that were able to encode/decode mpeg1 constrained in realtime when 386s were state of the art.

  44. Hyperlink for Graphic Cards used for computing by zymano · · Score: 1
  45. unfortunately for them... by dustmote · · Score: 2, Funny

    unfortunately for them, the proof is too big for them to fit in this margin...

    --


    -1, "1337" speak
    1. Re:unfortunately for them... by Anonymous Coward · · Score: 0

      Right. Last time sombody said that it took only 350 years to proof it (again?). Looks as if we are looking at the same timscale here.

  46. kind of slow? by Anonymous Coward · · Score: 0

    it doesn't appear that the felonious wons are planning to delete themselves/surrender to the light? how long until the planet/population rescue is complete, & we're free (from the hostage taking scams of the corepirate nazis) again?

  47. If I have the physics right... by Spamalamadingdong · · Score: 3, Interesting

    ... parallel processing units may perform a lot more ops/sec/watt than one single unit. The speed of a processor depends on the time required to charge and discharge the stray capacitances of its connects, and the impedance of its transistors increases as the drive voltage decreases so the RC time constant goes up and the speed goes down. However, the energy required to charge the capacitance scales as voltage squared, so by accepting a hit on the speed (due to the voltage drop) you can do the same calculation with less energy. Clearspeed seems to be taking parallelism to the sub-processor level in order to reduce heat loads; their operations may take longer to complete, but they can do more operations in the same time as long as the code can use the processors in parallel. Thus the emphasis on "multi-threaded", because it wouldn't work otherwise.

    1. Re:If I have the physics right... by John+Courtland · · Score: 1

      Sounds a lot like the "Cell" architecture that the next PlayStation will have. IIRC they wanted an order of magnitude or two speed increase before releasing it.

      --
      Slashdot is proof that Sturgeon's Law applies to mankind.
    2. Re:If I have the physics right... by Anonymous Coward · · Score: 0

      Sounds like the end result of the SOC hype that TSMC was pushing about two years back.

    3. Re:If I have the physics right... by arkanes · · Score: 1

      You'd need a compiler (and an OS, for that matter) that can optimize things into parrallel actions to get any performance out of this thing, then. I wonder if they're planning on having one of those, and if they're going to release it for free or not - they'll probably have a hard time getting any sort of bite into the market without one.

    4. Re:If I have the physics right... by Urkki · · Score: 1

      So basically any server would benefit greatly as they have multiple processes in execution at one time, even with existing software. But most desktop applications would only be slower without major re-design of entire software to heavily utilize multpile threads all the time and for everything.

    5. Re:If I have the physics right... by Zzootnik · · Score: 1

      That sounds about right...That Also sounds like what BeOS was like...Which is why it ran so damn fast on puny hardware... All the organization work was done at the WAY-TO-THE-FRONT end...in coding...

      It did work pretty well, though...

      --
      Sig currently under construction. Mind the gap....
    6. Re:If I have the physics right... by igodard · · Score: 1

      The ClearSpeed processor is actually a co-processor (see the ClearSpeed site for details). A co-processor is a special purpose functional unit that you bolt onto the side of a regular processor that runs the actual program. The ClearSpeed chip is a floating-point banger; other co-processors do other things. To use a co-processor, the main program wanders along until it hits something that the c-p can do - say it wants a 1000 point FFT done (ClearSpeed would be good at that). It loads the data into the c-p, pushes the button, and then idles until the c-p is done, usually signaled by an interrupt (boxes vary). It then unloads the result and continues. Making some reasonable guesses (their site has little tech detail) I'm guessing that the c-p has pipelined floating point but no scoreboarding to keep things straight between instructions. An easy way to get more throughput in such a case is to give every nth instruction to a different thread of control i.e. multi threading on a cycle by cycle basis, rather than on an interrupt by interrupt basis (c-ps don't have interrupts). This keeps the multipliers busy at the cost of requiring software to keep things straight. Speaking of software, to use a c-p you have to rewrite (and often redesign) your app so as to put in the necessary handshaking. This isn't just a matter of recompiling - think assembler level timing issues, even if you are writing in nominal C. C-ps do 10 line loops (real fast), not applications. You will never run gcc, or Apache, or Oracle, or Hello World on a ClearSpeed - but if you need a pile of FFTs and are willing to rethink/rewrite your code then it should brush your teeth. C-ps are a very crowded market. Most are grid, stream, or vector machines, most often used for embedded codes like your friendly local MRI machine. ClearSpeed looks to be a reasonably workmanlike design, but no breakthrough IMO. Ivan

    7. Re:If I have the physics right... by Anonymous Coward · · Score: 0
      If you compare BeOS to OSX you will find there is no comparisson. BeOS is fast, lean, new, and nimble. OSX is bloated, slow, and creaky. BeOS is a clean modern OS built from the ground up to support multi-threading and high performance multimedia. OSX is an old street walker with years of wear who hides her "experience" beneath layers of Maybeline. OSX supports a paint job that would do Earl Scheib proud.

      When you need a 3/4 GHz processor just to make a GUI respond to your clicks, something is wrong--OSX. BeOS on the other hand is spritely and works magnificently well on commodity hardware, a achievement beyond the realistic hopes of even the most zealous of Apple fanatics.

  48. Doesn't the Phantom... by Xaoswolf · · Score: 1

    Aren't these used in the Phantom Game console?

  49. Works for Apple by zymano · · Score: 1


    <a href="http://www.gpgpu.org/">Graphics cards used for general computing</a>

    Apple = altivec =vector processor.

    1. Re:Works for Apple by Anonymous Coward · · Score: 0

      By the way, links work much better if you don't post them as extrans...

    2. Re:Works for Apple by You're+All+Wrong · · Score: 1

      Altivec's SIMD, which is not vector.

      YAW.

      --
      Your head of state is a corrupt weasel, I hope you're happy.
  50. Re:I remember the quotes of 256KB of ram and 10MB( by mangu · · Score: 2, Insightful
    But really, what requires 25G flops?
    Maybe if we decide to model "Life, the Universe and Everything?"


    No, just modelling the surf breaking on a beach would need several beowulf clusters of these chips. Or the flow of gas through an airplane turbine. Or the weather in a small region of planet Earth. There are many simulations of non-linear systems whose simulation require a lot more CPU power than is likely to be available on the near future.

    And what about the human brain itself? Our current computers are still so far from the data processing capability in our brains that many people doubt it will be possible at all. Assume we have about 100 billion (10^11) neurons, and each neuron has about a thousand synapses. Assume the simulation of each synapse would need one hundred floating point operations per second. Therefore, to simulate the operation of a typical human brain one would need ten million Gflops, equivalent to a Beowulf cluster of 400000 of these chips. That's what it'll take to do the AI in Duke Nukem Whenever...

  51. Hey, a-hole(+) by Mycroft_514 · · Score: 1

    The 256K limit was in DOS 1.0 and the quoted person was Bill Gates. And I am that old. The first serious machine I worked with had 16K of usable memory.

    1. Re:Hey, a-hole(+) by xNoLaNx · · Score: 1

      The quote was still supposed to be 640k you guys. And though I doubt Gates said it, it really doesn't matter.

    2. Re:Hey, a-hole(+) by Anonymous Coward · · Score: 0

      Hey, moron, go kill yourself already. If you're going to insult someone's intelligence, the least you could do is try not to look like a fucking dipshit yourself.

      The average IQ is 100 and will always be 100, because that's the definition of an average IQ within a society. The meaning of a 100 IQ in this country might improve slightly when you die, but the number won't change.

      You know what the average IQ is in Japan? 100. Canada? 100. Japan's average IQ on our scale, however, is 107. Canada's? 104.

      It's your fault, and the fault of people like you. You ignorant fucks. You're dragging us all down.

  52. Duke Nukem 4-0 - balder, fatter by Anonymous Coward · · Score: 0
    Garland, TX --

    Looks like middle age hasn't been kind to action hero Duke Nukem. In a prerelease press preview, presented by Joe Siegler, the studly hero is bald with a huge beer-gut.

    "We wanted to flesh out the character of Duke", Siegler said, "we want to make him more a character that his fans can directly relate to".

    In the new title, Duke is in a custody dispute with his ex-wife. Apparently, since he lost his job, he's in arrears on his child-support payments. When his (alien) wife kidnaps their kids and leaves for her mothers' on Alogl II, it's butt-kicking time!"

  53. BitBoys anyone? by vasqzr · · Score: 1

    http://techzone.pcvsconsole.com/news.php?tzd=1973

  54. faster than you might bulleave by Anonymous Coward · · Score: 0

    that's right. as the lights come up, you might note the creator's interaction with yOUR environment, as well as interventions on the planet/population life0cide perpetrated by the phonIE felons/walking dead.

    what happens next, is ALL about yOUR intentions/motives/behaviours.

    you can pretend some more if you want/need to, but that doesn't really help. see you there.

  55. Two words by geekoid · · Score: 1

    Stock. Price.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    1. Re:Two words by jerde · · Score: 1

      Stock. Price.

      Except that they're privately owned.

      - Peter

      --
      INsigNIFICANT
    2. Re:Two words by CrayHill · · Score: 1

      Two different words then: venture capital

  56. Old US GOVT Black Ops Technology by SirMack · · Score: 1

    This technology has been around for years, US GOVERNMENT individuals have used this to crack codes and such. The add on expansion card pops into a normal desktop slot or a laptop slot and gives them Super Computer Power. Although it was about half as powerful as these, but its been around... ;)

    1. Re:Old US GOVT Black Ops Technology by KaiserZoze_860 · · Score: 2, Funny

      Yes, I see...

      Ordinary x86 by day, but SuperComputer by night. I wonder what the theme song will be...

  57. Onyxruby's law by onyxruby · · Score: 3, Funny

    Onyxruby's law:

    The amount of hype per inch produced by marketing doubles every 18 months.

    With apologies to Moore ;) /me reminded of when apple tried claiming the imac as supercomputer.

    1. Re:Onyxruby's law by Jeff+DeMaagd · · Score: 1

      Applie didn't try claiming the iMac to be a supercomputer that I remember. They did claim that for the PowerMac G4 at 500MHz, their claim was a super computer was > 1GFLOP. That performance rating was not too unrealistic as Alpha chips two years before that clocked around 0.95 GFlops without SIMD, it isn't hard to exceed that with SIMD.

  58. This will be great for gaming! by Anonymous Coward · · Score: 0

    Microsoft's next game engine is said to be based on Excel.

  59. clear speed array processor by Anonymous Coward · · Score: 0

    Not actually all that impressive. Texas Memory Systems has been selling comparable chips for several years. There are others.

    http://www.superdsp.com/products/tm44.asp

  60. Chip Description -Multi-Threaded Array Processor by zymano · · Score: 1

    Multi-Threaded Array Processor
    ClearSpeed's multi-threaded array processor can be applied to any high performance computing application where large volumes of data can be processed in parallel.

    The performance of conventional processors has been primarily driven by higher clock speeds. This inevitably means higher power dissipation and the resulting problems of system cost and reliability. These designs can no longer meet the requirements of emerging applications or keep up with the demands presented by the explosive growth of data volumes.

    The multi-threaded array processor architecture provides an exceptionally powerful and scalable processing solution, based on an array of tens to thousands of Processing Elements (PEs). Each PE has its own local memory and I/O capability, making the architecture ideally suited for applications which have high processing and/or bandwidth requirements. The inherently scalable array architecture is also highly area and power efficient.

    The processor can be used either as a co-processor sitting alongside an Intel or AMD CPU within a high performance workstation, blade server or cluster configuration, or as a standalone processor for embedded DSP applications such as radar pulse compression or image processing. In applications where it is acting as a co-processor, dynamic libraries off-load an application's inner loops to the processor. Although these inner loops only make up a small portion of the source code, these loops are responsible for the vast majority of the application's running time.

  61. Burning some karma, here.... by mcrbids · · Score: 0, Troll

    Imagine a beowulf cluster of these things!

    Hasta la vista, Karma!

    --
    I have no problem with your religion until you decide it's reason to deprive others of the truth.
  62. not very impressive by dutky · · Score: 1
    The chip will have 64 threads of execution, which means that each thread only needs to deliver about 400 MFLOPS. Since a standard floating point instruction has a latency (from issue to retire) of about 5 or 6 cycles, this is easily achievable in current technology (2-2.5 GHz system clock) without even using pipelining. If the thread units are pipelined, you can expect the clock to be in 400-800 MHz range.

    When they have a device that delivers 200 GFLOPS with 64 threads, then I'll be interested.

    1. Re:not very impressive by kinnell · · Score: 1

      According to this presentation, it runs at 200MHz. It's refreshing to see someone taking this approach, rather than insane clock frequency/power dissipation. I'll be impressed, though, if real application software can use it efficiently.

      --
      If I seem short sighted, it is because I stand on the shoulders of midgets
  63. Computation bottleneck by vlad_petric · · Score: 1
    Currently, the main computational bottleneck is memory speed & bandwidth. Processors - it's still relatively easy to stay on Moore's curve. But memory speed improves by only a couple of percents per year. Yes, you can throw caches at them (it's not uncommon these days to have 6M or even 8M on a server chip), but those caches are very unlikely to consume 3W ...

    You can certainly throw a bunch of ALUs on a grid (it's not so difficult) and claim GIPS, GFLOPS or whatever ... but you won't get similar speedups on real-life benchmarks, because no program will be able to use them all at the same time (btw, only professional bullshiters - read marketing people - use MIPS/GIPS/GFLOPS these days. The computer architecture has realized for a LOONG time that they're not a good indicator of performance).

    --

    The Raven

  64. We NEED it. Some uses - Biotech, Aerospace.... by zymano · · Score: 1

    There are too many uses for large scale computing which is plagued by huge power hungry systems.

  65. "enables nanotech research" by irritating+environme · · Score: 1

    technewsworld had this as their last paragraph. If anything indicates the complete bullshit smell of this announcement, attaching it to a similarly wildyly overhyped fad tech would be it.

    I bet it might hit 25 gigaflops with an "optimized demonstration algorithm" with no cache misses, no branch misses, and heck, all the data is in the registers at all times, so it doesn't even wait for the cache.

    --


    Hey, I'm just your average shit and piss factory.
    1. Re:"enables nanotech research" by elvum · · Score: 1

      I'm also more than a little doubtful that this wonder-processor is going to hit anything like 25Gflops when separated from its memory by the PCI bus, let alone when sharing that bus with five of its kind (as suggested in the article).

      Let's see - a 64-bit 66Mhz PCI bus (the maximum the PCI standard allows and, well, not exactly common on PC motherboards, shall we say?) will supply a maximum of 133megawords per second. So a 25Gflop processor will receive one word of data from the system memory every four *hundred* clock cycles. And that's assuming the data is suitable for transferring in burst mode. Once the processor starts requesting words from non-contiguous parts of the memory that figure plummets still further, thanks to the latency caused by passing all your memory access requests through the motherboard chipset.

      Other PCI devices will also reduce the performance; notably other coprocessor boards will divide it equally between them, cancelling out any advantage of having more than one. Maybe you can pull something back with cacheing, but only in rare circumstances are you going to see anything approaching *one tenth* of a gigaflop, let alone 25.

      Maybe in a very very few specialised applications there might be some point to this plan. (You could probably write a prime factoring algorithm and run it from a small amount of on-card cache memory, for example.)

  66. A point to consider by beldraen · · Score: 1

    Getting high performance out of a chip really isn't that difficult (I know I'm understating a lot of the real knowledge underneath); however, the trick is doing it reliably. An Intel or AMD processor must be able change from a wide variety of states (fixed to floating to OS commands) and be able to recover from any invalid state, so a lot of the chip is tied up in ensuring consistant operations. As I gather, they're just basically making an optimized floating point coprocessor (can you say 387? I knew you could) and I can bet that feeding it bad instructions would do bad things. This is why videocards are capable doing such tremendous amounts of calculations far and above CPU's--consistant types of instructions and the attitude "screw up and you lock up the video card, so don't do that."

    --
    Bel, the mostly sane.. "Of course I can't see anything! I'm standing on the shoulders of idiots." -- Me
  67. Your moment of cynicism for the day. by rk · · Score: 1

    "Soon to be in prototype" is synonymous with "does not exist"

    Ah, but what the hell, let's post it to the front page anyway.

  68. Three words by H.G.+Pennypacker · · Score: 1

    Computational Fluid Dynamics.

    --
    -- HG Pennypacker, wealthy industrialist and philanthropist
  69. Venture Bullshit Company by Anonymous Coward · · Score: 0

    This looks to me like a venture backed bullshit company. Here are some clues:

    Management team includes 2 people who are nobody's, just investors "putting together advisory boards..." and whatnot...

    They are not hiring. WHAT!!! If they are really doing this, there is no frigging way they wouldn't be hiring.

    The "about this company" says that they are a "leading provider of..." - a dead ringer give-away that the company is a tale spinner, since they have never provided a thing.

    These news stories have been seeded in order to attract attention. The attention will be used to try to leverage a buyout (of patents, probably) or large-scale investment. I doubt they even have the silicon designed. Just BS.

    Nothing to see here. Move along.

  70. Typo In Story Title by gadders · · Score: 2, Funny

    It should read: "Clearspeed Makes Tall Claims for Fictional Chip"

  71. What's the matter, michael? by Anonymous Coward · · Score: 0

    Been reading the Inquirer again too much? You smell a little bit cynical across the pond, you see.

  72. Because by michaelhood · · Score: 1

    michael tell me!

  73. Vapor... or not? by John+Whitley · · Score: 1
    From ClearSpeed's website
    HPEC 2003
    Lexington, MA
    September, 2003

    Lockheed-Martin and Worldscape Defense presented the results of their work using ClearSpeed's processing solutions.

    They benchmarked FFT and pulse compression algorithms and found between 20 and 30 times improvement in performance per watt against competitive solutions.

    That page also has a PDF of their presentation at the 2003 Microprocessor Forum. Whether this technology will pan out is a matter for the markets, but ClearSpeed isn't looking very vaporous.
  74. sneak peek at CS308 by Anonymous Coward · · Score: 0
  75. in other news by trb · · Score: 1

    My dad is the smartest person in the world.

  76. Oi! by supabeast! · · Score: 1

    Does anyone know if this company employees the same marketing/PR firm that handled the BitBoys?

  77. If you build it, will they come? by Animats · · Score: 1
    It's certainly feasible to build a machine with 64 FPUs that can operate in parallel. Finding an application for it is tougher.

    Getting data in and out fast enough to feed the thing will be a problem. It will probably only achieve its rated speed when it's working intensively on small data sets. That's a typical DSP application. This might be a useful part for a software radio. They mention radar applications, which are basically software radios.

    That ratio of MFLOPS/watt would help for graphics processors, but they need to be architected differently. Still, if they've figured out a way to get FPU power consumption down, that's helpful.

    Thus, it's not a useful device for desktops.

    As for it being vaporware, it sounds like they have it running in simulation but haven't had it fabbed yet. They're offering it as "intellectual property", which means "buy the VHDL file". If they have good VHDL, and are honest about the simulation results, they could have valid performance numbers. But from the information give, we don't know that.

  78. I'll be back(ing... over my dog) by Spyral999 · · Score: 2, Funny

    I'm sure that cyberdyne chip is working out well for them... But what are they going to do with the arm? Juggling? Labyrinth-esque sphere stuff? I kinda shudder to think...

    --
    The big print giveth and the small print taketh away - Tom Waits
  79. MS Windows announced 1984; works 1993 by peter303 · · Score: 1

    MicroSoft the master of pre-announcing vaporware. Sometimes it eventually does work!

  80. Plausible by saha · · Score: 3, Interesting
    Clearspeed formerly known as Pixelfusion was a promising graphics chip company that developed these scalable SIMD processors a few years ago. They put 24Mbits of RAM directly on to the chip, to have the enormous memory bandwidth that was and still is unheard of in the industry. After the industry attention shifted towards Nvidia, ATI, 3DLabs the board of directors reorganized the company to focus on high speed network switchers and routers.

    Some of the hardware design came from from engineers in Bristol, UK. Companies like Division and INMOS (anyone remember the T800 and T9000 transputer and a Microway board for parallel computing on a PC board more than a decade ago?). The other half of the design team came from UNC computer graphics lab in Chapel Hill. From the well known PixelFlow and PixelPlane machines. That along with a Taiwanese fab plant that would produce these SIMD processors with extra PE (SIMD Processor Engines) that would compensate for the manufacturing errors. eg. Lets say the chip would have 100 PEs so they would manufacture it 120 PEs. Those that didn't work they'd switch off and they wouldn't have to throw away the entire chip.

    The story of PixelFusion was unfortunate. They could have rocked the computer graphics world with their scalable tile based rendering technology and efficient manufacturing methods. The programmable PEs would be able to handle both Direct X and Open GL. I suppose now they are trying to focus their investment and IP into more generic applications. I find their claims to be plausible because they have demonstrated innovative chips in the past.

    My 2 cents

  81. Um, guys? It's 25GFLOPS, not 25GHZ! by Moekandu · · Score: 2, Informative

    The chip will have 64 parallel FPU's. If it can complete one floating point operation per cycle, it will only need to run at about 350 to 400Mhz to reach 25GFLOPS (latency and pipeline issues aside, of course). Even if it requires 2 clock cycles, or the first 32 FPU's feed the second, we're talking about 700 to 800Mhz.

    I'm not certain, but I thought I ran across similar number crunching capabilities in Integer OPS. It seems to me to have been in regards to fibre fabric and switching.

    Or I could be on crack.

    Hm.

    Moekandu

    --
    Mediocrity knows nothing higher than itself; but talent instantly recognizes genius. -- Sir Arthur Conan Doyle
  82. For 16G it better be fast! by mysterious_mark · · Score: 1

    Article also gives an srp of $16,500, for that much it better be fast! (Think of the linux boxen cluster you could build for 16G). It would be nice to be able to a full Navier Stokes simulation on my PC, mayber Santa we send me one...

  83. 200 MHz is slow in a 3 GHz world by peter303 · · Score: 1

    Since you can get 6 GFLOP in a conventional x86 compatible CPU, why go to incompatible technology for a 4X speed improvement?

    1. Re:200 MHz is slow in a 3 GHz world by Johan+Veenstra · · Score: 1

      The difference is not the 4x speed increase. The difference is 10x-30x power decrease.

  84. You sir, are an idiot. by pr0ntab · · Score: 1

    I will tell you in confidence that the US GOVERMENT (as you so capitalized, would you care to be more specific?) is the sole entity keeping SGI alive.

    No one would pop it into their desktop. Most spooks with desktops wouldn't be allowed to open their damn computers, lest they violate the service contract with whomever their superiors pork-rolled.

    No, that kind of thing stays in the data center, where it can bought by writing a few short purchase orders for inordinate amounts of budget padding money.

    --
    Fuck Beta. Fuck Dice
    1. Re:You sir, are an idiot. by SirMack · · Score: 1

      Interesting. The version that the government used for years used 2 cards that plugged in conjunction with each other and easily increased the system performance 500%. This allowed them to use virtually ANY available computer anywhere in the world for HIGH END computing tasks. All they packed with them were the 'super computing' upgrade cards that were also encrypted with protection and used a propreitary programming language that i'm sure wasn't easily decoded. Specialized software was also written for them. Everyone from image analysis, voice analysis, to code cracking could be done through specialized suites of applications made for the 'portable super computer." These items were used by just anyone, as you mentioned, they stayed MAINLY in the nucleus of a data center or were used by covert agents world-wide. From appearances as well, from what I have heard, the cards lookd like huge TV tuner cards (and functioned as one) to mask their real function.

  85. Unusual Scepticism, and Transmeta by istartedi · · Score: 1

    I wonder how many people work for Slashdot and own shares in Transmeta, which is coming out with the TM8000 right now, and is announcing earnings tomorrow. Full disclosure: I own Transmeta shares too.

    Now, usually Slashdot greets these RSN products with glee and neglects to mention that they are vapor. Not this time, nosiree. Why? Because if it were true it would compete with Transmeta.

    Not accusing anybody of anything wrong here... just... well... I've drawn my conclusions. You draw yours.

    --
    For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
  86. Price per flop VS Mac G5... by gbulmash · · Score: 1
    In large-scale supercomputing applications, if the $16,500 holds, you're still better off with VT's G5 supercomputer as it's less than half the price of a comparably speedy machine based on these chips.

    I don't have a question on how these chips will plug in. Most likely their card will contain 2-8 of these chips, plus a controller and specialized RAM, all interconnected by their proprietary bus (mentioned in the press release). It will do a large chunk of the processing in isolation from the CPU and other system components, sending back aggregate results through the PCI-X or Hypertransport system bus.

    I can't say they'll build a world-beating supercomputer out of them. They seem like a way of bringing "sort-of supercomputer" power to environments where having a few $65k 100 gflop workstations running specialized apps makes sense.

    That's, of course, if they're not vaporware designed to generate another round of VC funding before they slip the projected ship date.

    - Greg

  87. Good points ! Link for you. by zymano · · Score: 1

    I remember the transputer.Good points. Didn't know they were a former graphics card company.

    Explains there vector parallel processing knowlege.

    But there could be a developing problem for them.
    New Nvidia and ATI graphics cards can compete with this chip.

  88. not too good to be true by phr1 · · Score: 1

    This chip sounds like a big parallel DSP. All those transistors on a Pentium 4 that go into the virtual memory system or the branch prediction or out of order pipeline juggling, in the DSP are dedicated to number crunching. I don't know how much the crunch power of this chip exceeds those of a current high end graphics chip (NVidia, etc). but it's probably not that big a ratio. The graphics chip also beats the heck out of a Pentium 4 in raw parallel arithmetic speed. The graphic chip is of course very specialized (for crunching display lists) while the DSP is a little more general (crunching 64-element vectors, like a Cray-1). It's likely that some multimedia applications like MPEG encoders can be sped up a lot by this chip, but don't expect it to make MS Word or GCC run any faster.

  89. Re:Good points ! Link for you. by zymano · · Score: 2, Informative
  90. easy by riqnevala · · Score: 1

    You only have to cool it down to -1 Kelvin temperature and send it to sun's orbit with 1.1 times the speed of light.

    --
    love slashdot. populate it. use it. abuse it. hate it. kill it. miss it. stop following links, they only kill servers.
  91. Chip Multithreading is all the rage now... by maitas · · Score: 1

    Intel has it on its P4, IBM has it on its RS64III and POWER4, Sun will have it on its Niagara... everyone is going the same way.. now, none has find a way to transform serial code to parallel. There's a simple Touring machine that converts parallel graphs to serial, no one has ever came up with a Touring machine that takes serial code and makes it a "perfect" (every posible paralelism degree) parallel one. Thas the saint-grial that seems unrecheable... up to then, only some apps can scale with those CMT processors.

    1. Re:Chip Multithreading is all the rage now... by joto · · Score: 1
      now, none has find a way to transform serial code to parallel. There's a simple Touring machine that converts parallel graphs to serial, no one has ever came up with a Touring machine that takes serial code and makes it a "perfect" (every posible paralelism degree) parallel one. Thas the saint-grial that seems unrecheable... up to then, only some apps can scale with those CMT processors.

      While obviously true, it means very little in practice. Mostly, tasks that take a lot of time are easily converted to paralell subtasks by a human. And we certainly don't need "perfect" paralellism. For some tasks, it's sufficient with "faster", and for all tasks it's sufficient with "faster at same or lower cost".

    2. Re:Chip Multithreading is all the rage now... by maitas · · Score: 1


      Actually you will hit Amdall law's very soon. With a code that is 95% parallel and 5% serial (100 minutes run), depending on the processor count, you've got:
      1- processor: 100 minutes, 0 speed-up
      2- processors: 52.5 minutes 1.9 speed-up
      5- processors: 24 minutes 4.17 speed-up
      15- processors: 11.4 minutes 8.77 speed-up
      50- processors: 6.9 minutes 14.49 speed-up
      100- processors: 5.95 minutes 16.8 speed-up
      300- processors: 5.32 minutes 18.8 speed-up

      You can see that if you have 5% of your code that can't be parallelized by any human, you won't gain almost nothing at all going 50 to 100 or even 300 processors (or independent, non-blocking threads, if you like).

    3. Re:Chip Multithreading is all the rage now... by joto · · Score: 1
      Actually you will hit Amdall law's very soon. [snip]

      For the current generation of typical computers, 50, 100, or 300 processors is still a lot more than what is common. Mostly, it is 1. Meaning we still have quite a lot of speedup to gain before Amdahls law makes it all worthless.

      Systems with more than 1 processor today are usually servers or running other kinds of easily paralellizable processes.

      The main reason most of todays systems avoid having too many processors is probably not Amdahls law. My guess would be a combination of history, market demand (will it run my software (and better?)), and the obvious problem of memory bandwidth which makes multiple processors cost more than just the processor cores, if you want to keep them busy.

  92. Bit Boys by Zep1a · · Score: 0

    Maybe they can incorporate the "Not Yet in Prototype" Bit Boys Oy! Advanced graphics chip technology....

    That will be one awesome "Does Not Yet Exist" chip!

    :)

  93. Do we really wanna trust these guys? by y2imm · · Score: 1

    ClearSpeed Technology Inc. 20 North Santa Cruz Ave, Suite C Los Gatos CA 95030-5917

  94. The key phrase is in the Wired story... by Anonymous Coward · · Score: 0

    The key phrase is in the Wired story...

    Ah the key phrase here is that it came from wired magizine, the abhored wanna be magazine that has wanna be tech articles.

  95. Math intensive... ? by alakon · · Score: 1

    In otherwords ... cutting edge is going to be expensive forever? I am very happy that today I can buy a great machine for doing CFD calculations for only a few hundred dollars, even cheaper if I put it together myself. Considered the length of the grant-cycle I usually have outdated technology... I suppose if this sort of technology becomes the leader, I would still have to have the resources of a large research university to get anything done. At least the way it is now, my home machine is faster then the one at work!

    1. Re:Math intensive... ? by joto · · Score: 1
      In otherwords ... cutting edge is going to be expensive forever?

      Uh, what would you expect. When "cutting edge" eventually gets affordable, don't you suspect that someone would offer something a little bit faster that costs just a few hundred times more?

      Considered the length of the grant-cycle I usually have outdated technology... I suppose if this sort of technology becomes the leader, I would still have to have the resources of a large research university to get anything done. At least the way it is now, my home machine is faster then the one at work!

      Then rent some computer time from somebody that has bought a decent machine. It's still possible to do that, right? :-)

  96. see also: Pixelfusion any realware yet? by Anonymous Coward · · Score: 0

    These guys have been around before.
    I don't remember anything that wasn't vaporware..

    http://www.model.com/news_events/pr/pixel.asp

    What happened to this thing anyway??

  97. I am skeptical by mykepredko · · Score: 1

    Hey psy,

    This is true iff the chip is using standard/existing fabrication tools, processes and development/layout tools. Looking at the articles, it seems like the chip is designed using traditional methods, so except for the "ClearConnect Bus", there doesn't seem to be any ground breaking technology. I would be interested in seeing how a packet based network linking 64, 32 bit processors would be implemented on a standard piece of silicon.

    From this chip's perspective I would like to understand how data will be shoveled in and out of the chip to allow it to run at full 25 GFLOPS performance - will the performance of the multiple "processing elements" be hindered by the lack of bus bandwidth? 384k ain't a whole bunch of memory for data AND programs for the 64 "processing elements" (only 6k per processor).

    myke

    1. Re:I am skeptical by psyconaut · · Score: 1

      Myke...

      Are you questioning the PCI bus's ability to shovel enough data to these chips? ;-)

      (For those who didn't read the WiReD article, the company mentions "PCI supercomputer" cards).

      -psy

      P.S: You still in Toronto?

    2. Re:I am skeptical by mykepredko · · Score: 1

      Yes, I would love to see how they expect to do something meaningful with 25GFLOPS while shovelling it through a single PCI bus. I guess you could calculate pi or Napier's constant to whatever decimal place and have enough bandwidth, but I just went through a finite element analysis of a board heating up and I can't believe that you would see a significant jump in performance for an application like this even if you are relying on PCI-X.

      Yessir, still in Toronto, survived the provincial election, prepared to get bored through the municipal election and completely ignore the Federal one.

      myke

  98. A BLITTER CHIP BY ANY OTHER NAME.... by The+Lynxpro · · Score: 1

    Sounds to me like a Blitter chip! I better check and see if this'll work with the TOS 2.5 ROM upgrade for my Atari 1040ST! Bonus points if it'll plug into the cartridge slot! I can't wait for the ST to beat the G5 using Cubase Audio now... :)

    --
    "Right now, somewhere in this world, Scott Baio is plowing a woman he doesn't love," - Peter Griffin, *Family Guy*
  99. Premature... by gweihir · · Score: 2, Interesting

    Without a working prototype they have nothing.

    With a working prototype they still have not much.

    With a working, and cost-efficient manufacturing process, they have something.

    When there are compilers that actually can use this kind of thing, it starts to be somthing that is real.

    My guess is they are about a decade from a reliable, usable and cheap product. Suddenly these numbers do not sound impressive at all...

    --
    Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  100. Sounds real to me by oji+eisai · · Score: 1
    Some of you get it some of you don't.

    It aint going to run windows or even linux. But it will accelerate some difficult real world problems. There's probably even money to be made there - only a tiny percentage of CPUs sold go into PCs after all.

    The key thing is low power. And low cost. OK. The two key things are low power and low cost. And scalability. And a compiler. And...

    Who needs supercomputers when you can have this on your desktop.

    And... I hear they have working silicon already....

  101. What about memory bandwidth? by illumin8 · · Score: 2, Interesting

    If I understand the article correctly, it looks like they're implementing a much more powerful version of Apple's Altivec SIMD technology. My question is, if computing power increases 500x using this technology, doesn't memory bandwidth and system bus speed have to increase exponentially as well just to realize any gains?

    It seems like putting one of these cards in a PC with today's technology would be like sticking a mainframe behind a 300 baud connection: sure it can handle millions of transactions a second, but you'll never actually see that kind of throughput because memory is so slow.

    --
    "When the president does it, that means it's not illegal." - Richard M. Nixon
  102. Re:yes by Anonymous Coward · · Score: 0

    you sure know alot about gay

  103. prototype by h4x0r-3l337 · · Score: 1

    I get the impression that they may have chips, but not ready-to-use boards for PCs, and that *those* are the prototypes they speak of. Other than that, this just sounds like the old "Transputer" idea recycled.

  104. So what, Floating Point Systems already did this.. by linuxbikr · · Score: 1
    Sounds like just the latest generation of processors geared for number crunching. Not a new idea. Floating Point Systems built array processors that were tied to IBM mainframes and fed computations and data for them to chew on years ago. Since they were dedicated number crunchers, they blew away the mainframe by a factor of 10. Oil companies and other consumers of raw numerical power bought these things by the dozens because they were cheap ($100K vs. millions for the mainframe they were attached to) and could be moved from task to task (disconnect it and roll it to the workstation for that geology analysis...).

    Heck, when the NSF built the five National Centers for Supercomputing Applications, one of the facilities used a bunch of FPS array processors to give them the number crunching power instead of a traditional IBM, CDC or Cray supercomputer.

    Nothing special about these guys and I'd take what they have with a grain of salt. Unless they're offering half a teraflop for $500K out-the-door in real world workloads (fluid dynamics, oil exploration, weather simulations, finite element analysis, etc), forget it. 25GFlops peak? Big deal. It's what it does on real work that matters, not the amount of useless calculations you can perform. Plus, they don't even have pre-production silicon. The problem is the compiling and the programming. If you can get those tasks down and be able to write code that can harness that power efficiently, their nothing but expensive serial computers at best and boat anchors at worst.

  105. Supercomputer Performance by Anonymous Coward · · Score: 0

    Coincidentally, I have been building my own computing cluster at home. It will involve 7 P4 2.4GHz processors. Extrapolating from benchmarks on other clusters, I estimate the 7 nodes to be capable of about 30 gigaFLOPs.

    Now they claim (2nd article above - TechNewsWorld) 64 processing elements to get an aggregate of 25 gigaFLOPs. A little quick math shows that each processing element has to be about equivalent to a 220 Mhz P4 FPU. Definitely doable.

    But 3 watts? Hmmm, that might be stretching technology a little. VIAs processor chips, the lowest power general purpose CPUs I know of, achieve (they claim) about 7 watts for 900Mhz operation. Now, granted, the FPU seems about 1/2 as fast as P3 or P4 FPUs, clock for clock, but there's a helluva lot more than just a FPU on any of these chips. Maybe doable.

    But the price! Jeeez, Louise! from the NewScientist article:
    Beese says a single chip will initially cost around $16,500.
    for 25 gigaFLOPs! My cluster is coming in at around $1500 for about the same performance. Admittedly, there are classes of problems that will perform better on such chips (anything that requires tightly coupled data, usually solved with shared memory among many processors) but I think they are off by a factor of 10!

  106. Not a lot of RTFA happening... by TexVex · · Score: 2, Informative

    So I'll summarize some interesting key points:

    1. The chip is fully programmable and an SDK invluding C compiler is available now.
    2. The chip will be marketed as a coprocessor.
    3. They expect to start selling them for around $16,000 in a few months.

    --
    Fun with Anagarams! LADS HOST, SHALT DOS. HAS DOLTS. AD SLOTHS, HATS SOLD. ASS HO, LTD.
  107. ExtremeTech had the story since this AM by writertype · · Score: 1
  108. Array processors are becoming popular by Anonymous Coward · · Score: 0

    Building multiprocessor chips, or chips from arrays of processors has become a fairly hot design approach. There are a number of companies using it. It seems to be especially popular in the reconfigurable computing area. There is an interesting paper here. These processors go well beyond the current crop of dual CPU core chips like the P4, Power 5, and Ultrasparc IV.

    Clearspeed's chip is a static 64 processor array chip aimed at FPU intensive applications, but there are many more things that you can do with array designs.

    Mathstar is building a reconfigurable chip with hundreds of elements availble in various mixes of processors, memory blocks and other components. They are trying to replace ASICs and FPGAs as a platform for some part needs. There was a story on their architecture in EE Times a couple of months ago.

    Intel is wokring on an array based processor aimed at the radio / communications market. I will be interested to see if their work with these chips ends up being used in other Intel chips. That could be deadly. So, the Pentium-X sucks at that task today? [Morph] Not now!

    Phillips has what they call Silicon Hive technology which is another reconfigurable processor of functional blocks.

    There have been plenty of companies using arrays and reconfigurable techniques too, like Altera and Chameleon.

    Sun bought up a start up and is developing massivly multithreaded processors based on the start-up's technology. They call it Throughput Computing. They claim that in about two years they will have a chip 30x faster than todays designs. I'll be very interested to see if they can do that.

    The next couple of years should be very interesting on the processor front.

  109. "T800" explains it all. by Anonymous Coward · · Score: 0

    The T-800 transputer, you say? This must be proof that somebody indeed has found a neural-net CPU from the future.

    Wait, wait, based on what we've seen of it running, that means the only effective language to program it in is gonna be a division of COBOL. Noooo!

  110. Re:I HAVE AN E-R-E-C-T-I-O-N!!!! by Anonymous Coward · · Score: 0

    Suicide.

  111. Not surprising by be-fan · · Score: 1

    Actually, this isn't terribly surprising if you look at the specs. Its a vector processor with 64 processing elements. Each PE has an FPU. The 25 gigaflop theoretical rating probably comes from FPUs * Clock_Speed, so the thing probably runs about 400 MHz. You have to understand that this isn't a general purpose processor --- you just send it some numbers to crunch, and it sends numbers back to you.

    --
    A deep unwavering belief is a sure sign you're missing something...
  112. Incredibly powerful giga-peta-petaflop machine by Anonymous Coward · · Score: 0

    I just witnessed a machine that could simulate the motions of trillions of trillions of water molecules under different temperature configurations. It can do this at speeds so incredibly fast - billions of samples per second can accurately be made against any of the particles. Currently, the scientists are now working on a way of reading this data as fast as possible.

    Let me describe the machine:

    It's a plastic cube filled with water.

  113. What I'm saying is... by pr0ntab · · Score: 1

    I work for the government.

    Well, actually I work for the contractor that provides information systems to the DoD, IRS, Customs, Coast Guard, etc.

    Honest to god... it's bunk. Paranoid ramblings.

    COTS is the big push, or big iron. You have to realize that the end users are not that bright. They pay us to figure out what can make it simplest.

    Put a card in each persons' machine? SUPPORT NIGHTMARE. Not going to happen. They want us to deliver "something", a nice packaged-up solution. Usually it's based off commodity parts, so they can drop and switch vendors on a dime.

    The NSA is the only group that might have been using something like that so secretly, but a little birdie tells me that they are basically the only people that buy the 9 micron Alpha EV7 chips, and the sole reason HP keeps that contract alive.

    Those monsters would eat your "tv card" alive. ^_^;;;

    Oh, and speaking of proprietary programming languages: who's going to teach them to learn it?

    You might be confusing that with Ada (which the US Gov still likes for some reason), or Smalltalk. Powerful languages that only still exist in the government nowadays.

    Did you know that Microsoft has it's own programming language that looks like Objective C that they program all their operating systems in? And that every developers' computer is used in parallel when making daily builds of software!?
    THAT's probably the closest thing to what you are implying.

    Guys with the money in government are suspicious of technology, and costs. That kind of fantasy story is NOT characterstic of even crazy, tech-happy places like ONR or JPL.

    Sorry. If you worked for SAIC or some place like that for a few weeks, you'd get the idea.

    Take off the tin foil hat, man. I'm not saying the US Government DOESN'T have huge computing power, but it's not so silly sounding. Kind of boring, really. BUt very expensive, don't kid yourself.

    The thing you might want to start worrying about is "netted sensors". I keep hearing that word thrown around a lot, and it's going to become real hard to sneak out of the country in a few years, I GUARANTEE THAT.

    --
    Fuck Beta. Fuck Dice
  114. Slide show from ClearSpeed by hobit · · Score: 1
    For whatever reason I was having problems downloading the slide show from my home computer, though I had no problem from work. I've mirrored it if anyone wants to look at it.

    It is a SIMD machine. It looks like they've put some real thought into the software, which is the hard part in something like this. The debugger certainly looks pretty.

    The ported C code on slide 13 is a bit scary. The intermediate language appears to rely on the compiler to distribute the workload to the PEs (otherwise why is the loop the same in both.) I'd much prefer the intermediate language give you complete control of the PEs rather than letting the compiler do it for you.

    It does look like there are actual dies out there. Maybe not functional, but built. Also, it looks like the PE communication is more limited than I'd like. There are only two communcation ports. I'd expect four if they want this architecture to scale past 64 PEs.

    Other comments.

    • "multi-treaded" is as close to a false description as you can get. Only one thread (instruction stream) runs at a time. Sure, you get 64 data streams but that is SIMD, not MT.

    • The peak GFLOPS numbers are actually really really poor compared to a P4. 25 vs. 12? And while both are much higher than actually achievable on real programs, the P4 is probably easier to come close on.

    • 64 PEs on chip? It looks like there only 12 million logic transistors. Which after the main core means that only slightly more than 100,000 transistors are spent per PE. I'd really expect to see a whole lot more logic transistors. The Itanium 2 (which uses a newer process) has about 75 million logic transistors ((see here). Doing the same here would give space for more than 256 PEs.

    Given the above issues I don't think this thing is going to take off anytime soon for "super computer" purposes. The big win is high FLOPS per Watt, which isn't all that important for SCs (well not that important). As part of a graphics processor or DSP I could see potential. I still think that in a few years (say 5 to 10) this type of thing will be a coprossesor on a fair number of those Linux clusters used for scientific computing. But this one isn't there yet.

    I've seen people toss around $16,000 as a price point, but I can't find that anywhere. I assume I'm missing something obvious. At that price it is useless. It needs to be under $1,000, and really wants to be a lot cheaper than that to be interesting.

    --
    As Nietsche famously said, "If you stare too long into the Abyss, 1d4 Tanar'ri of random type will attack you."
    1. Re:Slide show from ClearSpeed by wroceng · · Score: 1
      The machine is really multithreaded with special hardware to support threads *as well* as being SIMD. Like a normal processor only one thread can run at a time but pre-emption, priorities etc are supported. So its multithreaded in the same way a normal processor is, not a MIMD machine.


      Power/performance is key in some fields. This chip is 3W whereas a P4 is much more.


      As you point out only 12Mil transistors which means there is room for a lot more even at low power and some articles talk of 128 or 256 PEs coming next. At 256, assuming power is roughly linear, it would be 12 watts for 2xP4 performance. Not bad. Buit you make small chips to begin with and then iron out process issues with bigger chips.

    2. Re:Slide show from ClearSpeed by hobit · · Score: 1
      The machine is really multithreaded with special hardware to support threads *as well* as being SIMD.


      Really? I didn't see anything about such special hardware. Could you point it out to me?

      --
      As Nietsche famously said, "If you stare too long into the Abyss, 1d4 Tanar'ri of random type will attack you."
  115. Used to work there by wroceng · · Score: 1
    Hi all. I used to work at Clearspeed. Their claims are valid, not hype. The article in wired needs rewired a little as the chips _are_ real, not just in simulation. Another article about them points out that manufacturers will have samples this year, ie once a full run, not a prototype run is done at the fab.