Slashdot Mirror


Cringely Wants A Supercomputer in Every Garage

Nate LaCourse writes: "Real good one from Cringely this month. It's on building his own supercomputer, but with some twists." You'll probably also want to check out the KLAT2 homepage to learn more about their Flat Neighborhood Network. And since KLAT2 has been around for nearly a year (check out the poster on this page!), perhaps a 3rd generation is in the works?

17 of 277 comments (clear)

  1. Now is the time for all good men.... by BlueJay465 · · Score: 5, Interesting

    This is a very interesting concept that he is putting forth, but at the same time, how many geeks out there are going to really make use of such a clustering farm? Not everyone I know does video compression projects, and it would seem kinda prohibitive for a black-hat to set one up to break encryption codes. Could someone please tell this naive soul what useful everyday application all these CPU cycles could be used for? (if you say SETI@Home, I am going to bitch-slap you)

    Secondly, UWB seems to be the holy grail of wireless networking, yes, however is this something that the agencies of the world are going to let out of the bag so easily as he says, I can think of the CIA and the NSA having a few choice words about such "undetectable signals" being used by commonfolk after September 11th...

    Just my two cents

    1. Re:Now is the time for all good men.... by An+Onerous+Coward · · Score: 4, Interesting

      I am now telling the computer exactly what it can do with a lifetime supply of chocolate.

      Okay, we need to burn some spare cycles. Lots of them, in fact. I have some ideas. There may even be a couple in here that can be taken semi-seriously.

      * SETI@H. . . Why are you looking at me like that? Admittedly, it's cliched, and I'm the impatient type who figured I'd find my first LGM within a week. Or by the end of the year at the very latest. But I still think that it's a pretty cool thing to be doing. Or load up one of the alternatives like Folding@Home.

      * Find a buddy with a similar supercomputer, and have them play chess. Or tic-tac-toe billions of times every second (sorry, War Games flashback).

      * There are lots of mathematical problems out there just begging to have a few supercomputers thrown at them. I'm not aware of what they are, so consult your local Mathematics department and offer your services.

      * If you're not interested in doing video compression or complex scene rendering, you might be able to find someone who was. Some indie film maker who wants to play with the big kids is going to become your new best friend. Be sure to ask for a walk-on.

      * Some sort of AI project could be interesting, providing you have some specialized training. Or you just give someone at MIT telnet access.

      --

      You want the truthiness? You can't handle the truthiness!

  2. HP Did This Too by MathJMendl · · Score: 5, Interesting

    ZDNet has an article of HP building a supercomputer like this as well, called the "I-Cluster." It has 225 networked computers running Linux Mandrake (so changes could be easily made) on 733 MHZ out of the box PCs. The only catch is that is is slightly more expensive- $210,000 (minus network cabling). On the other hand, they plan to release the open source tools they made as well, so that people can repeat this.

    --


    "I have not failed. I've simply found 10,000 ways that won't work." --Thomas Edison
  3. Re:genetic algorithms by An+Onerous+Coward · · Score: 5, Funny
    "Ironically, they needed a supercomputer to design a supercomputer."


    And it shall be called. . . Earth!
    --

    You want the truthiness? You can't handle the truthiness!

  4. /. needs a Cringley icon, any suggestions? by John+Harrison · · Score: 4, Funny
    I think a fake Stanford degree would do nicely.

    Maybe they could set things up so that ALL his articles hit the main page as soon as he posts them.

    If this were the case he could put a "discuss this article" link on his page and simply link to /.

  5. Old news by SumDeusExMachina · · Score: 4, Offtopic
    Sorry to say it guys, but this is a repeat of an old Slashdot post that linked to an ArsTechnica article more than a year old.

    Still though, after having to wallow through Cringley's painful lack of comprehension of basic technical knowledge, reading the ArsTechnica piece again was quite refreshing.

    --

    Is your company running tools written by ma
  6. the ignorant are easily amused by markj02 · · Score: 5, Insightful

    Cringely is completely missing the point. KLAT2 uses multiple routes and switches, not channel bonding. And what the project contributes is not the basic idea of using multiple network interfaces (which is decades old), but a specific approach: using genetic algorithms to optimize the network topology. More traditionally, such clusters have used manually designed topologies with known performance bounds.

    1. Re:the ignorant are easily amused by funnyguy · · Score: 4, Insightful

      the FNN which was created for KLAT2, is not a speed increase of ethernet by using multiple network cards. It basically allows full speed (100mb full-duplex) without a 64+ port, full wire speed switch. If such a thing even existed. Cringley's network is just 4 channel bonded network layers. Channel bonding actually has slightly more overhead than FNN. With KLAT2's FNN, each machine is on 4 seperate networks. No matter what other machine a single machine needs to communicate with, they each share one common network. Each network is held together with one switch, so there is always a full speed route to every other computer in the cluster. The OS handles this directly by using /etc/ethers to hard code the hardware addresses of every computer. different networks are different subnets, and the network routes are layed out accordingly.... blah blah... I could go on and on, but aggregate.org has more info.

      As for the algorithm everyone is talking about. there are some versions which can return a pattern in a second or two on a slow celeron. then there are some version which are designed optimized for certain datasets which take time to run. but generally, you don't need a supercomputer to design a fnn. even with 64+ nodes.

    2. Re:the ignorant are easily amused by Zeinfeld · · Score: 4, Insightful
      Quite, the problem with measuring super-computer performance is that every single machine in the class is highly optimised to a particular niche. That is the main rason they are so expensive compared to the components - large machines sell in the tens rather than the tens of thousands.

      Anyone can build a machine with a really high processing performance. Just by a few thousand X boxes and plug them into the same ethernet cable. The real issue is how much communications bandwidth you have between the CPUs. Some problems require almost none - the 'trivial parallelism' problems like DEScrack and the mandelbrot set. In the 1980s we had a machine that had 1000 20MHz processors that could bang out mandelbrot sets like anything (using the goofy algorithm, not the modern optimizations). But is wasn't much use for anything else.

      The problem with competitions for supercomputers is that they rarely measure the communication bandwidth because (a) its hard to do and (b) the effect on performance is highly algorithm dependent.

      As for the KLAT's ingenious topology, I once did some research in the area myself when it was the fashion. I tried using minimum diameter graphs which should in theory have been better than a plain taurus. However as with Bill Dally at Cal Tech I concluded that the additional cost of exotic topology (more than double the price) was not really justified by the performance advantage (about 10-30% on a good day).

      Certainly the many companies that set up to build transputer based processing clusters with high performance switches inside did not seem to go anywhere much.

      Using a high performance router at the core of a processing cluster might be interesting. They are pretty cheap these days and are headed cheaper.

      --
      Looking for an Information Security student project suggestion?
      Try http://dotcrimeManifesto.com/
  7. Supercomputing? Why bother. by Bowie+J.+Poag · · Score: 5, Insightful



    Speaking as someone who, yes, has actually worked with the big iron...

    Why bother. Remember, Moore's Law is still in effect. Recently, we've hit the point in the curve where supercomputers are no longer needed, nor cost-effective. That is, the time it takes for the industry to deliver a far superior product has eclipsed the average lifespan of your typical supercomputer.

    We're living in an age where a single graphing calculator you can buy at Walgreens has more horsepower under the hood than what got us to the moon 30 years ago. Your $2700 PC will be worth $150 within 3 years.

    Having a supercomputer in every garage makes about as much sense as taking a rocket fuel-powered dragster to the supermarket for a gallon of milk.

    Cheers,

    --
    Bowie J. Poag

    1. Re:Supercomputing? Why bother. by Rasta+Prefect · · Score: 4, Interesting

      I don't know about every garage, but as someone who is currently working on a research project at a University, I can say we'd find something like this very interesting, as would a number of other departments on the campus. We've got a couple of Crays sitting around, but can't afford the cost of maintaining the things. Something like this would be way more affordable to buy and maintain for educational/research purposes where traditional supercomputers aren't even vaugely an option.

      --
      Why?
    2. Re:Supercomputing? Why bother. by Zeinfeld · · Score: 5, Interesting
      Speaking as someone who, yes, has actually worked with the big iron...

      The machine I worked on in the early 90s is still in the top 100 of the supercomputer charts (or would be if the compilers knew about it).

      While a desktop Cray-1 can now be had at commodity prices the machine is now two decades old. The obsolescence rate is nowhere near as giddy as some would claim.

      The really big iron tends to have a lifespan of about five years and is typically retired because the power consumption and maintenance costs favor a move to newer hardware. True supercomputers rarely fall victim to Moore's law. Even the KLAC machine discussed only barely qualifies as a supercomputer, 64 processors is at the low end of the scale. People have Web servers with that number of CPUs. True big iron starts with a few hundred processors and goes up to the tens of thousand.

      If by working on the big iron you merely mean you used to use IBM 3090 class machines, then the joke is on you, those machines were often obsolete before they were manufactured. When I worked at one lab I had a desktop machine (first production run Alpha) that was considerably more powerful than the CPUs of the just-installed campus mainframe.

      Fact is that many of the people buying 'big iron' in the 1980s and 1990s were incompetent. They bought machines that ran the O/S they knew, which often meant they bought obsolete IBM mainframes for applications where a ntwork of IBM PCs would have served far better. I spent quite a bit of time in institutions where wrestling control of the computing budget from an incompetent IT dept was a major issue. In fact the World Wide Web began at CERN in part as a result of such a struggle. Tim, bless him wanted the physicists to switch from the IBM mainframe CERN VM to use NeXTStep machines. One of the schemes that the CERN CN division had cooked up to force people to use the mainframe was to only make information such as the address book available on the IBM mainframe. Attempts to make it more widely available were treated much the same way that Napster was treated by the RIAA. The Web took off at CERN initially because you could access the address book from a workstation or from the VAX.

      Very few mainframes were actually designed to provide fast processing. The IBM 3090 series was actually designed to perform transaction processing for banks. As a scientific CPU it offered tepid performance at a price arround 100 to 500 times the price of a high power workstation.

      There are certain applications in which CPU cycles are still the limiting factor. Admittedly they are much smaller as a proportion of the whole than they were 10 years ago.

      --
      Looking for an Information Security student project suggestion?
      Try http://dotcrimeManifesto.com/
  8. My Hank Dietz (creator of KLAT2) story by IvyMike · · Score: 4, Interesting

    Dr. Dietz used to teach at Purdue, and I had the good fortune to take a compiler course taught by him. On the first day, when introducing himself, he came to the part where he was describing how to get into contact with him. When giving out his phone number (at Purdue, on-campus numbers were 5 digits long) he mentioed that his phone number was "GEEKS". He added, "No, I didn't ask for GEEKS, but when I figured it out, I thought it was pretty cool."

    Needless to say, it was a pretty cool course.

  9. A real supercomputer? Not exactly by fgodfrey · · Score: 5, Insightful
    The article would have people believe that all a supercomputer is is a collection of not-quite-modern processors, memory, and an interconnect of some sort. This is simply not the case. If it were, why do many (granted a smaller number than before) people still buy real big iron? The answer is that Cringely's (sp?) collection of processors is not a real supercomputer for the kinds of applications that are associated with traditional machines. Traditional vector supercomputers still have processors that are faster than Pentium 4 class systems. Traditional massively parallel supercomputers (which are the most similar to a cluster) have a number of features not found in your average garage built cluster like a truely low-latency interconnect, gang scheduling of entire jobs, single system image for users/administrators/processes.

    Clusters are great for embarassingly parallel applications (ie ones that have threads which don't communicate with each other much. This includes things like SETI@home and batch rendering of images. What they don't compare on is applications that communicate a lot like nuclear physics simulations. This is not to say that that will never change in the future, but for the time being it's still true.


    Last, and certainly not least, real supercomputers have memory bandwidth that can match the speed of the processor. A Cray or an SGI Origin has an absolutely massive amount of bandwith from the processor to local memory compared to a PC. That allwos a traditional supercomputer to actually *achieve* the fantastic peak performance numbers. On many applications, the working sets are huge and don't fit in cache so you end up relying on memory being fast. On a PC, it's not and I've heard from sources I consider reliable (though I have no actual numbers to back this up so it may be rumor only) that one large cluster site sees around 10% or less of peak on a cluster for a nuclear physics simulation, whereas, on a vector Cray, you can hit ~80% of peak. This means that the cluster has to be 8 times more powerful and when you start multiplying the costs by 8, they start looking like the same price as a real supercomputer.


    So my point is that building a real supercomputer does not mean grabbing a bunch of off-the-shelf components, slapping them together with a decent network and running Beowulf (or a similar product).

    --
    Go Badgers! -- #include "std/disclaimer.h"
  10. Imagine... by Tsar · · Score: 5, Funny

    ...a single-CPU version of this!

    1. Re:Imagine... by nuintari · · Score: 5, Funny

      and a cluster of those!..... oh wait, nuts....... never mind.

      --

      --Nuintari

      slashdot : where an opinion can be wrong.

  11. A real supercomputer? Yes, exactly by Multics · · Score: 4, Interesting
    Your comments are true for a 486. They are not true for anything much newer. An IBM SP machine, which owns half of the top 10 on the top500 list, is basically a commodity parts built system.

    Yes, these systems are not sometimes the best for handling vectorizable jobs, but they are so inexpensive compared to the old specialized hardware that it is easier to waste cycles than build special hardware.

    As to memory bandwidth. Modern CPU caches make the question nearly moot.

    If all of this were not true, then people wouldn't be building clusters and the majority of the top500 list wouldn't be dominated by clusters. Instead there are 3 traditional architecture machines in the top 20. This is the reason that Cray (etal) no longer dominates the marketplace... commodity systems have overtaken nearly all of the specialized hardware world.

    -- Multics