Slashdot Mirror


FASTRA II Puts 13 GPUs In a Desktop Supercomputer

An anonymous reader writes "Last year tomography researchers of the ASTRA group at the University of Antwerp developed a desktop supercomputer with four NVIDIA GeForce 9800 GX2 graphics cards. The performance of the FASTRA GPGPU system was amazing; it was slightly faster than the university's 512-core supercomputer and cost less than 4000EUR. Today the researchers announce FASTRA II, a new 6000EUR GPGPU computing beast with six dual-GPU NVIDIA GeForce GTX 295 graphics cards and one GeForce GTX 275. The development of the new system was more complicated and there are still some stability issues, but tests reveal the 13 GPUs deliver 3.75x more performance than the old system. For the tomography reconstruction calculations these researchers need to do, the compact FASTRA II is four times faster than the university's supercomputer cluster, while being roughly 300 times more energy efficient."

127 comments

  1. Easy money to be made? by Darkness404 · · Score: 0, Redundant

    It sounds like there might be easy money to be made buying these components, putting them in a computer case and then reselling them for profit at various universities. Just wait for the "Dell" of supercomputers.

    --
    Taxation is legalized theft, no more, no less.
    1. Re:Easy money to be made? by Hatta · · Score: 1, Flamebait

      Where do you get a motherboard that can accept 5 graphics cards?

      --
      Give me Classic Slashdot or give me death!
    2. Re:Easy money to be made? by LordKaT · · Score: 0, Redundant

      7 graphics cards. Plus 4 power supplies.

      Methink "easy" in the GP's context means "easier than building a supercomputer from the ground up like IBM currently does"

    3. Re:Easy money to be made? by Hatta · · Score: 1, Interesting

      Oh, I read that wrong, it's 7 graphics cards. Who makes such a motherboard?

      --
      Give me Classic Slashdot or give me death!
    4. Re:Easy money to be made? by Icegryphon · · Score: 1

      Best I could find sofar.
      I am sure someone else can come up with some goodies.

    5. Re:Easy money to be made? by rockNme2349 · · Score: 1

      ASUS.

      I didn't even RTFA, i just WTFV

      --
      Sewage Treatment Facilities - "Our duty is clear."
    6. Re:Easy money to be made? by Chirs · · Score: 4, Informative

      Um...read the article?

      The motherboard is a ASUS P6T7 WS Supercomputer.

    7. Re:Easy money to be made? by daVinci1980 · · Score: 3, Funny

      You must be new here... ;)

      --
      I currently have no clever signature witicism to add here.
    8. Re:Easy money to be made? by joocemann · · Score: 1

      ASUS.

      I didn't even RTFA, i just WTFV

      x2

      lets see the video.

    9. Re:Easy money to be made? by skirtsteak_asshat · · Score: 1

      > Furthermore, the researchers believe the performance benefit will be even greater once they solve the remaining stability problems...

      Hah. Hope they can write BIOS code from scratch... can you imagine trying to get mobo vendor support?

    10. Re:Easy money to be made? by citizenr · · Score: 1

      Where do you get a motherboard that can accept 5 graphics cards?

      msi 890FX-GD70 6x PCIE 2.0 x16

      --
      Who logs in to gdm? Not I, said the duck.
    11. Re:Easy money to be made? by petermgreen · · Score: 1

      umm where in TFA does it say that?!

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
    12. Re:Easy money to be made? by Nutria · · Score: 1

      Hah. Hope they can write BIOS code from scratch... can you imagine trying to get mobo vendor support?

      Yet another RTFA (or, in this case, WTFV).

      --
      "I don't know, therefore Aliens" Wafflebox1
    13. Re:Easy money to be made? by jo_ham · · Score: 1

      In the huge bullet point list, in bold, by product code, with a further text explanation for each piece.

      It's halfway down the page underneath the photograph of the machine and the bold face, all caps title "FASTRA II".

      Do you need a screenshot also?

    14. Re:Easy money to be made? by hairyfeet · · Score: 1

      Yeah and it is surprisingly cheap for a board that crazy powerful at $400. I bet we'll see more colleges cooking up their own supercomputers for specialized tasks with the price THAT low.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    15. Re:Easy money to be made? by nurb432 · · Score: 2, Funny

      Does that come in a picoATX version?

      --
      ---- Booth was a patriot ----
    16. Re:Easy money to be made? by Korin43 · · Score: 1

      What's so hard about 6 graphics cards and 4 power supplies? It's not like you have to hook them up differently. The only hard part would be finding a case they fit in.

    17. Re:Easy money to be made? by CityZen · · Score: 1

      There were several difficulties. The most obvious is that they fit 7 double-wide cards into 7 single-wide slots. The next was that the motherboard BIOS crashes when more than 5? boards are installed. The next was that in order to allocate enough I/O space, all unnecessary devices had to be disabled, and even still the Linux kernel needed to be hacked to reduce the space allocated to various resources. After all that, it was a piece of cake.

  2. Awesome by enderjsv · · Score: 5, Funny

    Almost meets the minimum requirements for Crysis 2

    1. Re:Awesome by sadness203 · · Score: 4, Funny

      Only if you imagine a beowolf cluster of these
      Here goes the redundant and offtopic mod.

    2. Re:Awesome by TejWC · · Score: 1

      Sadly it doesn't. Why? Because it appears to be running Linux.

    3. Re:Awesome by Firehed · · Score: 1

      And in Soviet Russia, 13 GPUs supercompute using you!

      (Is that the smell of burning Karma?)

      --
      How are sites slashdotted when nobody reads TFAs?
    4. Re:Awesome by wirelessbuzzers · · Score: 1

      (Is that the smell of burning Karma?)

      No, that's petrified grits you're smelling.

      --
      I hereby place the above post in the public domain.
  3. More Awesome by copponex · · Score: 3, Funny

    This was post #2 and already modded -1, Redundant.

    1. Re:More Awesome by sadness203 · · Score: 1

      Must be redundant in the long run.
      This is sad, since this one was clever.

    2. Re:More Awesome by joocemann · · Score: 3, Funny

      slashdot mods are often, as I observe, sour and pissy skeptics. even if it is humorous to them they will knock it for lack of something else to bash.

    3. Re:More Awesome by joocemann · · Score: 2, Funny

      slashdot mods are often, as I observe, sour and pissy skeptics. even if it is humorous to them they will knock it for lack of something else to bash.

      -1 troll

      lol. exactly

    4. Re:More Awesome by mgblst · · Score: 1

      Oh yeah, the fact that we get exactly the same comment everytime a fast computer + GPU is mentioned shouldn't stop the next moron from posting it.

    5. Re:More Awesome by joocemann · · Score: 1

      Oh yeah, the fact that we get exactly the same comment everytime a fast computer + GPU is mentioned shouldn't stop the next moron from posting it.

      Its so horrible. Oh god they must pay. MOD THEM DOWN! MOD THEM DOWN!

      hows those lemons?

  4. News Flash by RandomUsr · · Score: 2, Funny

    Blazing Fast Pron Machine running Windows Vista. Don't forget to pick a copy of the latest memory intensive Anti-Virus, as this machine will handle it just fine.

    1. Re:News Flash by jd2112 · · Score: 1

      Never underestimate the capacity of Symantec AV to suck CPU cycles...

      --
      Any insufficiently advanced magic is indistinguishable from technology.
  5. How fast is this really? by Ziekheid · · Score: 3, Insightful

    "the compact FASTRA II is four times faster than the university's supercomputer cluster, while consuming 300 times less power" And the original supercomputer was how fast? 512 cores doesn't say THAT much. I could compare my computer to supercomputers from the past and they'd say the performance of my system was amazing too.

    1. Re:How fast is this really? by jandrese · · Score: 5, Informative

      If you read the article it tells you that the supercomputer has 256 Opteron 250s (2.4Ghz) and was built 3 years ago. If you have a parallizable problem that can be solved with CUDA, you can get absolutely incredible performance out of off-of-the-shelf GPUs these days.

      --

      I read the internet for the articles.
    2. Re:How fast is this really? by Ziekheid · · Score: 2, Interesting

      I'll admit that, thanks for the info, you'd think this was crucial information for the summary too though. Everything put in perspective, it will only outperform the cluster on specific calculations so overall it's not faster right?

    3. Re:How fast is this really? by Anonymous Coward · · Score: 0

      Don't you mean Massively Parallel?

    4. Re:How fast is this really? by raftpeople · · Score: 2, Interesting

      It's all a continuum and depends on the problem. For problems with enough parallelism that the GPU's are a good choice, then they are faster. For a completely serial problem, then the current fastest single core is faster than the both the supercomputer and the GPU's.

    5. Re:How fast is this really? by jstults · · Score: 2, Informative

      you can get absolutely incredible performance out of off-of-the-shelf GPUs these days.

      I had heard this from folks, but didn't really buy it until I read this paper today. They get a speed-up (wall clock) using the GPU even though they have to go to a worse algorithm (Jacobi instead of SSOR). Pretty amazing.

    6. Re:How fast is this really? by cheesybagel · · Score: 1

      At least a CPU program, when it crashes, does not bring down the whole OS. Memory protection? Pah, who needs such things... After all you never make coding mistakes. Right?

      It is like MS-DOS programming all over again. Except the computer takes longer to reboot.

      They use a worse algorithmic complexity algorithm in the paper because it actually performs better in the GPU than the other one. This happens in CPUs in several cases as well. When was the last time you saw someone using a Fibonacci heap? Memory footprint matters and taking advantage of the CPU caches matters. The paper also says nothing about CPU SIMD optimizations, which can make a program 3x faster if applied. That would make the performance the same as for the GPU system. Note that I am being generous here and actually ignoring the program setup time when they need to copy the data to the GPU. Because if I did not the pure CPU version would probably actually be faster.

    7. Re:How fast is this really? by jstults · · Score: 1
      Well, I'm not sure about most of your criticisms, but they use Jacobi instead of Gauss-Seidel because SSOR is not data parallel, but Jacobi is.

      That would make the performance the same as for the GPU system.

      Really? Care to share any results that support that? I'm quite sure the peak flops you can achieve on the GPU are much higher than the limited SIMD capability of the CPU.

      Note that I am being generous here and actually ignoring the program setup time when they need to copy the data to the GPU.

      Sure there's communications overhead, but that's true of any parallel processing problem, the trick is to find problems that have a big computation to communication ratio (which happens to be most of computational physics and these tomographic reconstruction problems that TFA mentions as well).

    8. Re:How fast is this really? by cheesybagel · · Score: 2, Informative

      Really? Care to share any results that support that? I'm quite sure the peak flops you can achieve on the GPU are much higher than the limited SIMD capability of the CPU.

      IIRC they claim 2.5-3x times more performance using a Tesla than using the CPUs in their workstation. Ignoring load time.

      SSE enables a theoretical peak performance enhancement of 4x for SIMD amenable codes (e.g. you can do 4 parallel adds using vector SSE, in the time it takes to make 1 add using scalar SSE). In practice however you usually get like 3x more performance.

      Theoretical SIMD performance for the GPU is very fine and nice, but in practice the small caches in current GPUs limit performance. CPUs also often have out-of-order execution support and other hardware which is too expensive in terms of transistors to implement in a GPU.

      IMO the main problem here is that the programming model for the CPU is too complex since you need to use several different ways to express parallelism (SIMD/Multicore/Cluster) to get top performance.

    9. Re:How fast is this really? by BronsCon · · Score: 1

      Only for problems that can be described as Massively Multithreaded, Oratorical, Redundant, Periphrastic, and Gratuitous

      Like WoW and Second Life.

      [Citation: http://thesaurus.reference.com/browse/redundant%5D

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    10. Re:How fast is this really? by wirelessbuzzers · · Score: 1

      IIRC they claim 2.5-3x times more performance using a Tesla than using the CPUs in their workstation. Ignoring load time.

      Their CPU numbers almost certainly take SIMD into account.

      I'm doing cryptography research, and some of my colleagues have been considering building a similar "desktop supercomputer". The speedup there looks more reasonable: a single high-end GPU should be worth maybe 5-10 quad-core CPUs; it costs double and uses double the power, but it's easier to put a dozen of them in a single PC. The numbers aren't as good as for big matrices of floats, but that's because we're doing integer operations and GPUs aren't optimized for those. (But then again, crypto problems tend to set the standard for embarrassingly parallel problems.)

      Anyway, the new "box-fulla-GPUs" supercomputers sure beat the heck out of the previous generation of cheap scientific compute cluster: a hundred PS3s running linux.

      --
      I hereby place the above post in the public domain.
  6. Swordfish by Globally+Mobile · · Score: 1

    Why does the computer from Swordfish?

    Get Animated
    *Drools*

    1. Re:Swordfish by Anonymous Coward · · Score: 0

      was there supposed to be a question here?

    2. Re:Swordfish by Globally+Mobile · · Score: 1

      was there supposed to be a question here?

      oh wow. Yeah, there was. And it had to do with this new computer. And I failed. Epic. Brain. Fart.

      What I was trying to ask was a bit of a wandering thought, so I don't even know if it needs to be repeated. But I am always up for answering questions I know the answer to, so here goes... I wanted to say... Why dos this piece of tech-hardware remind me of the computer/monitor setup in that cheesy (though who doesn't love cheese cept those that are lactose intolerant, and maybe a few anti-cheeseheads... [Though I doubt they exsist])yet fun/silly holowood flick Swordfish. (What with it's multi-GPU's, I am assuming (another good way to make an arse out myself, woot) it can run that many screens, and pump-crunch through the needs of the program in record time.

      Also just was a bit in awe of the tech-specs I saw, and was drooling, so I had to let it out, somewhere.
      Wow that was a rambling... *crawls back under his warm lizard-style hovel, and hides*

  7. GPU accuracy by tbischel · · Score: 1

    It used to be that GPUs would sacrifice accuracy for speed in floating point calculations, making them unsuitable for scientific computing. Is this still the case?

    1. Re:GPU accuracy by kpesler · · Score: 5, Informative

      Presently the G200 GPUs in this machine support double-precision, but at about 1/8 the peak rate of single-precision. In practice, since most codes tend to be bandwidth limited, and pointer arithmetic is the same for single and double precision, double-precision performance is usually closer to 1/2 that of single-precision performance, but not always. With the Fermi GPUs to be released early next year, double-precision peak FLOPS will be 1/2 of single-precision peak, just like on present X86 processors. Also note that many scientific research groups, such as my own, have found that contrary to dogma, single-precision is good enough for most of the computation, and that a judicious mix of single and double-precision arithmetic gives high-performance with sufficient accuracy. This is true for some, but not all, computational methods.

    2. Re:GPU accuracy by Beardo+the+Bearded · · Score: 3, Interesting

      First, a gaming card is going to get fast firmware. A workstation card is going to get accurate firmware. I imagine that supercomputer cards would get specialized firmware. (I only skimmed the summary.)

      GPUs are excellent at solving certain types of problems and excel at solving matrices. (That's what your video card is doing while it's rendering.) The best part of that is that most, if not all, mathematical problems can be expressed as a matrix, meaning that your super-fast GPU can solve most math problems super-fast.

      Next, GPUs love working together since they don't care about what the OS is doing. All they do is take raw data and respond with an answer. Usually we're putting that answer onto the display, since otherwise wtf are we doing with a GPU? In this case, the results are returned instead of using the flashy display. So what you end up with is a set of really fast, specialized, parallel engines solving broken down matrices.

      They're also not subject to the marketing whims of Moore's Law, so you can often get faster cards sooner than faster CPUs. To break down a supercomputer so that you get this kind of performance for 4000 EURO is a fantastic achievement. It's almost, but not quite, hobby range. (I'd still put money on someone trying to evolve this into a gaming rig...)

      --

      ---
      ECHELON is a government program to find words like bomb, jihad, plutonium, assassinate, and anarchy.
    3. Re:GPU accuracy by hairyfeet · · Score: 2, Interesting

      Question: Since you seem to be pretty knowledgeable on the subject, have you or any of your colleagues used or tried the AMD Stream SDK? Because those ATi 5870s look to be pretty scary as far as raw power, and since the AMD SDK supports OpenCL on both the CPU and GPU, and AMD has opened up their code as well as supporting both Windows and Linux 32/64 bit I was just curious if you or anyone else here has tried it?

      --
      ACs don't waste your time replying, your posts are never seen by me.
    4. Re:GPU accuracy by kpesler · · Score: 2, Interesting

      I have not tried it for two reasons. First, to my knowledge there are no large public machines in the US being planned using AMD GPUs, so there is relatively little incentive to port the code to OpenCL. We run on large clusters and it appears for the moment that NVIDIA has the HPC cluster market tied up. Second, while OpenCL is quite similar to CUDA in many respects, it's also significantly less convenient from a coding perspective. NVIDIA added a few language extensions that makes launching kernels nearly as simple as a function call. As a pure C library, OpenCL requires much more setup code for each kernel invocation. If there was a strong incentive, such as the construction of a large NSF or DOE machine with AMD GPUs, I'd probably port it anyway, but without such a machine, it's not worth the time and effort. It's important to note that on GPUs, peak performance data often doesn't translate into actual performance numbers. The 4870 had a higher peak floating point rate than the G200, but in graphics and some other benchmarks, the G200 usually came out ahead. I don't know if this will also be the case with Fermi vs. 5870's. Finally, another large consideration is that AMD is pretty far behind on the software end. Besides mature compilers for both CUDA and OpenCL, NVIDIA provides profilers and debuggers that can debug GPU execution in hardware, and there is a growing ecosystem of CUDA libraries. For the sake of competition, I hope AMD adoption grows, but I've gotten the impression they are just not investing that much in general-purpose GPU computing.

    5. Re:GPU accuracy by LeadSongDog · · Score: 1

      Sounds great, but where can I buy a matching 4000 Euro MRI machine to go with it? Seriously, spending less than 1% of the system price on processing power for the images is daft, especially when in most cases the bottleneck in getting patients in and out is the limited supply of radiologists. These guys need to think a little more deeply about their architecture.

      --
      Oh, I'm sorry sir, I thought you were referring to me, Mr. Wensleydale.
  8. times less by Tubal-Cain · · Score: 4, Funny

    ...consuming 300 times less power.

    *sigh*

    1. Re:times less by RandomUsr · · Score: 1

      Cost of seeing your Boss' face when he realizes how much you save the company on the new spam platform? Priceless. Oops, that's not a happy face!

    2. Re:times less by CityZen · · Score: 1

      It makes perfect sense, given appropriate units, such as (1/watt)'s. Okay, maybe not.

    3. Re:times less by timeOday · · Score: 4, Insightful

      Can we please just officially define "n times less" as "1/n" and not feel bad about it anymore?

    4. Re:times less by Nalgas+D.+Lemur · · Score: 2, Funny

      ...consuming 300 times less power. *sigh*

      Oops. Sorry. 300 times fewer.

  9. Not sure how fast it is, but I know it is hot... by (H)elix1 · · Score: 2, Interesting

    I've got a pair of 9800gx2 in my rig. The cards turn room temperature air into ~46C air. Without proper ventilation, these things will turn a chassis into an easy bake oven.

    For those not familiar with the 9800gx2 cards, it essentially is two 8800gts video cards linked together to act as a single card - something called SLI on the NVidia side of marketing. SLI typically required a mainboard/chipset that would allow you to plug in two cards and link them together. This model allowed any mainboard to have two 'internal' cards linked together, with the option of linking another 9800gx2 if your board actually supported SLI.

    The pictures did not show any SLI bridge, so it looks like they are just taking advantage of multiple GPUs per card.

  10. Yeah but... by definate · · Score: 0, Redundant

    Can it play Crysis with a high frame rate on maximum?

    --
    This is my footer. There are many like it, but this one is mine.
    1. Re:Yeah but... by Anonymous Coward · · Score: 0

      No. At least until Crysis runs on BeOS they get no gaming. And they are not using SLI at all.

    2. Re:Yeah but... by Anonymous Coward · · Score: 0

      whoosh

    3. Re:Yeah but... by Anonymous Coward · · Score: 0

      Does it run Linux?

  11. Stability problem solved... by gsgriffin · · Score: 2, Funny

    Duh! Look at the number of GPU's...13...try 12 or 14 and your luck will change.

    --
    jsut athnoer menagiensls ltitle psrhae for you to dcoede. Why do we wtsae our tmie dnoig tihs?
    1. Re:Stability problem solved... by selven · · Score: 1

      It's not even that hard. Just number them starting from 0 so the last one is only 12. Then when you add another make it 14. Problem solved.

  12. lol by MacGyver2210 · · Score: 1

    The guy in the video on that page looks exactly like the stereotype of the guy I'd expect to do this sort of thing.

    --
    If the only way you can accept an assertion is by faith, then you are conceding that it can't be taken on its own merits
  13. Silly by jpmorgan · · Score: 1

    This isn't a huge achievement. Nobody else has done it because it's silly.

    There are two major reasons... the first is they use GeForce cards. That's not a good idea, since GeForces are held to much lower quality standards than Teslas and Quadros. They're intended for gaming graphics, where a minor error here or there isn't the end of the world. "Sorry we missed your cancer, since our supercomputer miscalculated that region of the reconstruction." The second problem is, that's one bandwidth starved machine. It's based on a pretty nice motherboard, but with 13 GPUs that's not a lot of bandwidth to go around.

    The more popular layout for a GPU supercomputer of that size is a small cluster of 2-GPU blades, with a hypertransport interconnect. It's a little bit trickier to work with, but there are fewer bottlenecks.

    1. Re:Silly by modemboy · · Score: 2, Informative

      The difference between GeForce and Quadro cards is almost always completely driver based, it is the exact same hw, different sw.
      This basically a roll your own Tesla, and considering the Teslas connect to the host system via an 8x or 16x PCI-e add in card, I'm gonna say you are wrong when it comes to the bandwidth issue as well...

    2. Re:Silly by jpmorgan · · Score: 2, Informative

      The hardware is the same, but the quality control is different. Teslas and Quadros are held to rigorous standards. GeForces have an acceptable error rate. That's fine for gaming, but falls flat in scientific computing.

    3. Re:Silly by CityZen · · Score: 2, Insightful

      It's not silly: (1) this is a research project, not production medical equipment, meaning that the funds to buy Tesla cards were probably not available, and they aren't particularly worried about occasional bit errors. (2) Their particular application doesn't need much inter-GPU communication, if any, so that bandwidth is not an issue. They just need for each GPU to load datasets, chew on them, and spit out the results.

      How much does your proposed GPU supercomputer cost for 13 GPUs?

    4. Re:Silly by DeKO · · Score: 1

      Uh... no, you are wrong. Quadros and GeForces have a lot of differences in the internal hardware. Just because they "do the same thing" (they draw triangles really, really fast) it doesn't mean they are the same. GeForces, for example, don't have optimizations for drawing points and lines, nor assume you are abusing of obsolete APIs, like immediate mode drawing; both are common in CAD applications, and almost useless in games.

    5. Re:Silly by jpmorgan · · Score: 1

      No, the chips are almost exactly the same (except Quadros have 100% unbroken chips). You're thinking driver differences.

    6. Re:Silly by Khyber · · Score: 1

      There is NO difference between Quadro and GeForce besides Geforce basically being a laser-locked defective quadro with a different firmware.

      In fact, you can flash most GeForce cards with the equivalent Quadro firmware and in some applications (not gaming) get better performance.

      Been tooling around with nVidia cards since NV4. They've pretty much used this same strategy for the past decade+.

      --
      Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
    7. Re:Silly by cheier · · Score: 1

      This has been the argument that I've seen to justify getting a GeForce over a Quadro in CGI. A few points:

      1) The memory system on the Tesla/Quadro is much more rigorously tested and held to a much higher standard of quality than the GeForce. There is plenty of research evidence to prove this, and I have had plenty of anecdotal evidence to prove this point as well. NVIDIA doesn't give a crap about the memory in a GeForce because a miscolored pixel in 1/60th of a second doesn't matter. A soft/hard error in GPU memory for scientific calculations can be catastrophic. This is also a reason that NVIDIA is applying ECC memory to the Tesla C2050 and C2070 GPUs.

      2) Some GeForce GPUs will have major threading errors after a few minutes of hard running. I've experienced this with a dgemm torture test with a Tesla and 2 GeForce GTX 285 GPUs in a single system. Give the test about 5 minutes on all 3 GPUs simultaneously and the GeForce GPUs will crash out at nearly the same time. The Tesla will continue the test until completion (which can be about a day or so)

      3) Bandwidth starvation is a term to indicate that the cards are getting less bandwidth than they should be getting. On this FASTRA machine, only a few slots are full x16 Gen2, which end up being shared across 2 GPUs, making it effectively x8 Gen2 to each GPU. For other slots, it is even worse when you have a x8 Gen2 link going in. That has to be shared between a pair of GPUs. Technically, you can run the Tesla GPU in a x1 Gen1 slot if you had the right adapter, but the time it will take to transfer data from host memory to GPU memory may end up negating any performance benefits you might see out of using the GPU, unless you are using very heavy computational algorithms that are almost completely compute bound.

      I couple years ago, I had a compute rig using 6 Tesla C870 GPUs, and even that setup was starting to get bandwidth starved as all GPUs were using a single x8 Gen1 link being aggregated to 6 x4 Gen1 slots (using adapters). I had to up the output data frame size on an MD simulation in order to have all cards performing equally. With smaller frame sizes, the first 4 GPUs were finishing their computations before the last 2 GPUs got their data.

    8. Re:Silly by drinkypoo · · Score: 1

      There are two major reasons... the first is they use GeForce cards. That's not a good idea, since GeForces are held to much lower quality standards than Teslas and Quadros.

      Tell that to the Quadro FX1500M that was in my HP/Compaq "professional workstation" laptop, that had a well-known die bonding problem that caused overheat failures across an entire production line. Neither HP nor nVidia recalled the defective parts and I ended up spending literally days on the phone with HP support before they sent me a new laptop. Higher quality, my ass. Quadro chips are marked differently, period the end.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  14. Can't be to impressed: Folding@home guys did more. by blind+biker · · Score: 1

    Folding@home enthusiasts and academic contributors did more than that, and a long time ago, too. Just check this thread at foldingforums for one example.

    --
    "The agriculture ministry is not in charge of Gundam" - Japanese ministry official.
  15. double precession needed for matrix by peter303 · · Score: 1

    Carefule about equating tesellation processing with matices. Many matrice operations have N^3 or higher operations. And they may be close to singular (ill-conditioned). Single point precession is poor for both.

  16. Re:Not sure how fast it is, but I know it is hot.. by The+Archon+V2.0 · · Score: 1

    The pictures did not show any SLI bridge, so it looks like they are just taking advantage of multiple GPUs per card.

    There's no seven-way SLI anyway. Since the GPUs are being used for processing and not graphics, there's no need for them to work together via SLI or Crossfire or what have you as long as the OS and programs treat 'em like any other multiprocessor setup.

  17. Naming Scheme by lymond01 · · Score: 1

    Wouldn't it be nice if the FASTRA II, which is 3.75 times faster than the FASTRA I, was actually called the FASTRA 375. Then I wouldn't have to ask.

    1. Re:Naming Scheme by slew · · Score: 1

      If it's really 3.75 times faster maybe they could call it the FASTRA System 360 Model 96 (or the Fastra 360/96) for short ;^)

  18. That's why I have a problem with the comparisons by Sycraft-fu · · Score: 3, Informative

    Because it only applies to the kind of problems that CUDA is good at solving. Now while there are plenty of those, there are plenty that it isn't good for. Take a problem that is all 64-bit integer math and has a branch every couple hundred instructions and GPUs will do for crap on it. However a supercomputer with general purpose CPUs will do as well on it as basically anything else.

    That's why I find these comparisons stupid. "Oh this is so much faster than our supercomputer!" No it isn't. It is so much faster for some things. Now if you are doing those things wonderful, please use GPUs. However don't then try to pretend you have a "supercomputer in a desktop." You don't. You have a specialized computer with a bunch of single precision stream processors. That's great so long as your problem is 32-bit fp, highly parallel, doesn't branch much, and fits within the memory on a GPU. However not all problems are hence they are NOT a general replacement for a supercomputer.

  19. Re:Not sure how fast it is, but I know it is hot.. by thedarknite · · Score: 2, Funny

    I've got a pair of 9800gx2 in my rig. The cards turn room temperature air into ~46C air. Without proper ventilation, these things will turn a chassis into an easy bake oven.

    That's a brilliant idea, now people can make snacks without ever leaving the computer.

    --
    A game has objectives and is competitive, anything else is just play
  20. What the hell is up with the clothing? by tyrione · · Score: 1

    The Brady Bunch called they want their set clothes back!

  21. Re:Can't be to impressed: Folding@home guys did mo by CityZen · · Score: 1

    Did more what, exactly? None of the Folding setups listed have more than 4 GPU cards per motherboard.

  22. but does it run.. by Anonymous Coward · · Score: 0

    ..hyper linux

  23. Generic statements FAIL! by Hurricane78 · · Score: 1

    it was slightly faster than the university's 512-core supercomputer and cost less than 4000EUR.

    but tests reveal the 13 GPUs deliver 3.75x more performance than the old system.

    It is impossible, to make such general statements about the performance, for something that is still very much specialized on long pipelines and streams of repetitive data (vector processing).

    They may be much faster for tasks that fit that scheme. But slower for those that don’t.

    --
    Any sufficiently advanced intelligence is indistinguishable from stupidity.
    1. Re:Generic statements FAIL! by ceoyoyo · · Score: 1

      The performance of a standard cluster, or even a SIMD machine will vary tremendously depending on your application as well. The only reasonable way is to pick a problem and compare performance on that problem.

      They just forgot a phrase at the end of that statement: "it was slightly faster than the university's 512-core supercomputer... in this application."

  24. Why it's 13, not 14 GPUs by CityZen · · Score: 2, Interesting

    Apparently, the regular BIOS can't boot with more than 5? graphics cards installed due to the amount of resources (memory & I/O space) that each one requires. So the researchers asked ASUS to make a special BIOS for them which doesn't set up the graphics card resources. However, the BIOS still needs to initialize at least one video card, so they agreed that the boot video card would be the one with only a single GPU. Presumably, they could have also chosen a dual GPU card that happened to be different from the others in some way.

  25. Re:That's why I have a problem with the comparison by jstults · · Score: 1

    Take a problem that is all 64-bit integer math and has a branch every couple hundred instructions and GPUs will do for crap on it.

    So would a Cray; supercomputers and GPUs are made for the same sorts of problems (exploiting data parallelism). Now if by 'supercomputer' you mean 'a cluster of commodity hardware', then ok, you've got a point, that heap of cpus will handle branches plenty fast.

  26. Re:That's why I have a problem with the comparison by wagnerrp · · Score: 1

    Except that a 'supercomputers' and a 'cluster of commodity hardware' are effectively synonymous these days. They all use the same Power/Xeon/Opteron/Itanium chips, with several cores and a several GB of memory to a compute node. The only real difference left is the interconnect. Commercially built systems tend to have far beefier and more complex interconnects. Homebrew systems more often than not just use gigabit ethernet, with the larger ones rarely using anything better than a 'fat tree' with channel bonding or 10gbps ethernet.

  27. Re:Not sure how fast it is, but I know it is hot.. by (H)elix1 · · Score: 1

    I did not try baking anything, but it did turn the top of the computer into a nice coffee cup warmer.

  28. humor on /. by snooo53 · · Score: 1

    I've found that lately on Slashdot, I agree with them that highly moderated humorous posts seem to far outnumber the interesting ones. I've actually ratcheted down all funny comments to -4 or -5, and browse at 2, to catch the more interesting discussions which get passed over. But I've never seen any reason to moderate them down now that we have control when logged in... I dunno, maybe others think that people who come here looking for facetious comments should have to browse at funny +5 instead of us sourpusses :)

    --
    The sending of this message pretty much inconveniences everyone involved.
  29. Re:That's why I have a problem with the comparison by Retric · · Score: 1

    There are also a fair number of Cell based supercomputers and even one hybrid out there. And even some pure custom solutions used by the NSA. (There is a reason they have their own chip fab.) And, if you include folding at home type applications, then GPU's represent a reasonable percentage of the worlds supper computing infrastructure.

  30. Re:Not sure how fast it is, but I know it is hot.. by Anonymous Coward · · Score: 0

    Not only is there no seven-way SLI, it tends to work poorly with CUDA applications no matter what sort of SLI you're using. Before running any BOINC Cuda apps, SLI needs to be disable or the app only sees "1" gpu.

  31. Cramped cases... by Dusthead+Jr. · · Score: 1

    Maybe there's a really good reason for it that I'm not fully aware of, but why are PC cases, motherboards, add-on cards etc. all seen to be designed around such limited amounts of space? Is there a such thing a s PC case that size of a mini-fridge or bigger? A motherboard with freaking 10 or 12 slots with enough space between them? A video card the size of a motherboard? Anything but a cramped little box with limited expansion? Is that such a bizarre thing to want?

    1. Re:Cramped cases... by CityZen · · Score: 1

      It's known as "market forces". In case you haven't noticed, the computing needs of most people can be crammed into something the size of a paperback book or so. Larger computing devices are available, but the bigger you go, the smaller the market, and thus the larger the price. If you want something big, you might take look at a computer named "Jaguar". It has a big price, too.

      As far as personal computers go, they tend to be designed around CPU strengths & limitations. Intel and AMD have figured out that the most efficient way to increase computing power is to put more and more processing power into a single chip, and have systems designed around a single CPU chip, as opposed to systems that put multiple CPU chips on the motherboard. Because of this approach, it became unnecessary to build systems larger than your typical ATX desktop.

      If you needed more computing power than that, your best bet was to get multiple machines. Indeed, you can fill refrigerator-sized racks full of ATX (or other form factor) motherboards. For instance, check out: http://www.cse.illinois.edu/turing/Images/FrontView.html

      Only recently have GPUs become recognized as an efficient way of adding lots of computing power to a desktop machine. As evidenced by the motherboard that made Fastra II possible, hardware is slowly becoming available to embrace this new computing paradigm. Perhaps in a few years, you'll get your 12-double-wide-slot motherboard and you'll be able to populate it with GeForce 28000 boards. But more than likely, it still won't be cheap, since few people seem to need this kind of performance.

    2. Re:Cramped cases... by CityZen · · Score: 1

      Oh, and by the way, I'm wondering quite the opposite: why do we still see so many over-sized full ATX size cases being offered, when microATX motherboards have everything we (most of us) need? Indeed, even mini-ITX motherboards are often adequate for so many needs, and yet mini-ITX cases still seem to command a premium because they are relatively rare. It's easy (and boring) to design a big rectangular ATX box. It's an engineering challenge to make a good-looking small box that does everything you need and is still practical to work with.

    3. Re:Cramped cases... by CityZen · · Score: 1

      Oh, and here's your mini-fridge size case with 10 slots:

      http://www.mountainmods.com/computer-cases-c-21.html

  32. Re:Not sure how fast it is, but I know it is hot.. by Anonymous Coward · · Score: 0

    I don't think anyone on /. actually "leaves" their computer...

  33. Too right it's redundant by BertieBaggio · · Score: 1

    It's redundant because some smartass mentions Crysis in response to *every fucking article* about someone doing something using powerful GPUs*.

    Of course, if it was about CPUs, the post would be about what will be needed to run Windows 8, or 'finally meeting the minimum system requirements for Vista'.

    Mostly, you can predict these posts from the title of the article. Doesn't stop crotchety people people like me coming to complain about it though...

    * Footnote: When someone equally-crotchety complained about this before, a poster made the good point that Crysis draws this derision as it *still* taxes high-end systems. Myabe it's because CryEngine2 is bloated and ineffecient, maybe it's because it tries to do too much. All I know is we keep getting these inane posts.

    --
    If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
    1. Re:Too right it's redundant by BertieBaggio · · Score: 1

      Sorry to reply to myself, but I've just noticed two comments:

      mgblast's, which says the same thing as mine more succinctly and bluntly. Embarrassingly, it was in the same god-damn thread.

      Further down, we have this comment by RandomUsr, who actually does mention Vista. Woo! In fact, he (and the person that responded to him) also mentions anitvirus software. Never mind that this is a GPGPU system, just post crap about *something* bloated and wait for the '+1 Funny' mods to roll in.

      Gods, reading these two posts made me realise that I need to stop reading and posting to Slashdot when it's late and I'm in a bad mood and feeling misanthropic. *grumbles*

      --
      If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
    2. Re:Too right it's redundant by enderjsv · · Score: 1

      Gods, reading these two posts made me realise that I need to stop reading and posting to Slashdot when it's late and I'm in a bad mood and feeling misanthropic. *grumbles*

      Ya think? Honestly I'm not on Slashdot very often. Maybe it is an overused joke, but I don't post a lot in hardware related forums so how would I know. Check my history if you don't believe me. If you think it's overused, not funny or redundant, do what I do when I come across such posts. Roll your eyes, then move onto the next post. Don't get your panties in a twist over it. I'm sorry if not every post on Slashdot conforms to your standards, but if you're that worried about it, go start your own forum. Your tagline could be "News for people who take themselves way too seriously." I thought it was funny. I guess so did a few others. Relax. Some guy you don't know getting modded up for an overused joke REALLY isn't going to affect your life.

    3. Re:Too right it's redundant by BertieBaggio · · Score: 1

      Whoa, whoa. Chill, we're all friends. Don't take it personally, my comments weren't directed solely at you. It isn't a big deal (frankly, are any posts on Slashdot a 'big deal'?), so there's no need to make a mountain out of it. I don't expect it's you that malevolently thinks "Aha! Another hardware article! Just what I need to get a sardonic, lambasting response from BertieBaggio." Equally, I don't look for these jokes just so I can grumble about how people always post them. Generally, the only systematic comments I make are to gently remind people not to respond to trolls in any way or form, since any response (positive or negative) just encourages them. Most of the time when I read a comment or a thread I could have just as easily had in my head, I just sigh and keep scrolling for the good stuff. Every once in a while to maintain my sanity I write a post like the one or two I did. It's not to be taken 100% seriously.

      Besides, I once did create a forum, just as you suggest; but it turned out I forgot about the forum and concentrated on the blackjack and hookers I had also brought along*.

      (tl;dr version of the rest: Slashdot needs the xkcd ROBOT9000 bot that tempbans people who say something that has already been said. Except we'd need it to tempban anyone tthat brings up a joke/meme/concept/argument that's been done to death)

      Anyway, in the spirit of fairness and explanation for my diatribe... I guess there is bigger rant that I didn't post because in the first place it would have been seriously OT, and in the second it was 5 AM. But fuck it, this time it's 6:15 AM so we're okay. It goes like this: Slashdot is getting more predictable. Yes, yes, it's always been that way with memes, old jokes and older trolls and so forth; but it's getting to the stage where you can guess what comments are going to appear on an article with reasonable accuracy. Would that this great power of mine extended to lottery numbers. Ah well, can't have everything.

      Like I say, with tech spec articles we get Vista or Crysis brought up, with anything Apple related, we get price brought up. With anything in the politics section, it's how the Dems/GOP/megacorps/cops/'stupid sheeple' are killing the country. Oh, and at last one 1984 reference. With anything market related, it's how there is too much regulation, or how the free market has "failed". Anything to do with CS, or education will have one comment/thread about how the degrees are worthless and that you can be a great programmer without them. Relatedly, there will be another post, thread or reply about how the doctors/lawyers/other professionals are artificially keeping people out of the profession to drive up prices.

      With Microsoft articles we have chairs. With anything GNU/FSF-y we have the "GPL/BSD IS THE ONLY WAY TO LICENSE CODE" crowd. There are also those quoting Linus as gospel, and those saying "well, he's a good engineer, but he doesn't get politics". Similarly, there are those saying how RMS is an ideological nutjob, and others countering with "yes but look at how accurate he's been".

      We have naysayers on 90% of articles. They say "This won't work because of trivial thing X", or "Why bother doing Y?", with optional "FooWidget already solves problem Z, which is nearly the same".

      Any Ask Slashdot has a response asking "Why didn't you just Google X?", or "Slashdot isn't good for legal advice, ask a lawyer". And of course those that remark that anyone who asks a question like the one posed must be an idiot.

      And of course, there are always the ones who complain that Slashdot is going downhill. Oh yeah, and the "I'm-so-clever" meta-posts.

      --

      Like I say, it's no big deal. Like you say, posts really aren't important. Like the FAQ says, Slashdot isn't here for me or for you, it's here for us. Well, for CmdrTaco anyway.

      And Slashdot still has the unique diamond-in-the-rough posts, saying "I came up with a cool impleme

      --
      If all you have is a grenade, pretty soon every problem looks like a foxhole -- MightyYar
  34. Re:That's why I have a problem with the comparison by timeOday · · Score: 4, Insightful

    Take a problem that is all 64-bit integer math and has a branch every couple hundred instructions and GPUs will do for crap on it. However a supercomputer with general purpose CPUs will do as well on it as basically anything else.

    That was always true of supercomputers. In fact the stuff that runs well on CUDA now is almost precisely the same stuff that ran well on Cray vector machines - the classic stereotype of "Supercomputer"! Thus I do not see your point. The best computer for any particular task will always be one specialized for that task, and thus compromised for other tasks.

    BTW, newer GPUs support double precision.

  35. +1 by toby · · Score: 1

    Au contraire, I clicked the article link JUST to find this comment. Thankyou for maintaining a cherished /. tradition!

    --
    you had me at #!
  36. It really runs Linux by Anonymous Coward · · Score: 0

    Nice try at humor. But as almost always with these types of multi-processor machines, it runs Linux.

  37. Re:That's why I have a problem with the comparison by Anonymous Coward · · Score: 1, Insightful

    E X A C T L Y ! ! ! I always read about how fast the Cell Broadband Processor(tm) is and how anyone is a FOOL for not using it. No. They suck hard when it comes to branch prediction. Their memory access is limited to fast, but very small memory. Out of branch execution performance is awful. You have to rewrite code massively to avoid it. For embarassingly parallel problems, they are a dream. For problems not parallel, they are quite slow. An old supercomputer isn't as fast as a new one. If ordinary processors especially multi-core ones had two or four stream processors for every core, parallel operations would be much faster too, the processors themselves would be faster and its likely one of the improvements that are being looked at by Intel and AMD (and others). Something like this would make general purpose processors much more like the Cell Broadband Engine(tm), and would make them somewhat obsolete. Certainly the Cell processor suffers from being able to deal with problems that can only use 256 MB of memory (the cell BE uses proprietary memory, very fast, but only available up to 256 MB, no one else makes this kind of memory, and they don't make chip sizes bigger than what winds up being 256 MB. GPU's are limited by memory size too (although 1GB is bigger than 256 MB), but it still suffers all of the problems of a specialty processor. If you can use it, great. I can't get any performance boost out of them, because my programs have out-of-order branches, and I get better performance from a general purpose CPU.

  38. Re:That's why I have a problem with the comparison by mcrbids · · Score: 1

    That's why I find these comparisons stupid. "Oh this is so much faster than our supercomputer!" No it isn't. It is so much faster for some things. Now if you are doing those things wonderful, please use GPUs. However don't then try to pretend you have a "supercomputer in a desktop." You don't. You have a specialized computer with a bunch of single precision stream processors. That's great so long as your problem is 32-bit fp, highly parallel, doesn't branch much, and fits within the memory on a GPU. However not all problems are hence they are NOT a general replacement for a supercomputer.

    For that matter, which is faster: A two-ton flatbed truck, or a Maserati? Kinda depends on what you are trying to do, doesn't it? Want to move 3,000 pounds of Hay? You probably DON'T want the Maserati!

    And all machines are like this. Some machines are better at some tasks than others. And presumably, the comparison to the University Supercomputer was because of a task that they *needed* to perform, and the pittance cost of the GPGPU-based supercomputer favored very well against the cost of leasing University supercomputer time.

    Even different people are better at some things than others.... Some people are better a maths than others. Some people can take a bit of vinegar and coffee grounds, and make an artistic masterpiece.

    Because I'm a jogger, I can run long distances faster than most people. But I suck at sprints, and I take long showers. I type over 100 WPM.

    See?

    --
    I have no problem with your religion until you decide it's reason to deprive others of the truth.
  39. Re:Not sure how fast it is, but I know it is hot.. by Anonymous Coward · · Score: 0

    I also have two GX2s in a box for use with CUDA programming. (You don't use SLI with CUDA, in fact it's a disadvantage in that if you do you'll only be able to actually use one of the GPUs, so you won't see SLI bridges in a CUDA box.) The power consumption of the 9800gx2s is indeed fearsome even at idle. I measured it, but don't have the numbers on hand. BUT: Newer nvidia cards apparently use *much* less power at idle, and probably less at full blast as well (like a 45nm CPU vs. a 90nm CPU at the same GHz will use less power for the exact same work).

    Since I only need one GX2 to test most programs, I keep the power unplugged to the second one most of the time to keep from wasting so much energy (and producing so much heat).

  40. Re:That's why I have a problem with the comparison by Kjella · · Score: 1

    Sure,,but if you look at it from their perspective - before we needed time on a supercomputer and now we don't. Either you redefine supercomputers to include that or it's another task where we don't need one, even better if you ask me. So it doesn't do everything, well running an embarrassingly parallel problem on a supercomputer would also "terrible" performance now compared to this.

    That's great so long as your problem is 32-bit fp, highly parallel, doesn't branch much, and fits within the memory on a GPU.

    As far as I know the Teslas will be doing double precision, and we certainly could put GPUs on a better backplane for GPU-GPU communication with NUMA. What's left is being highly parallel and doesn't branch much - aren't those two sides of the same coin? - and usually that's about finding a reasonable way to break it down like a finite element model or something, there's many ways you can do that and approach the right result.

    Trying to solve one megastate usually has tons of cache coherency issues to let all CPUs do useful work too. If you have to rely on a single stream of calculations performance will suck one way or the other, so really you only get decent performance if you can divide it into blocks of work and yet each block is branching enough that a supercomputer is better than a GPU. So it won't be a general purpose supercomputer true but these aren't computers used for running a million different desktop apps. There's some highly specialized simulations you run, if you can do better on highly specialized hardware that's what will happen.

    --
    Live today, because you never know what tomorrow brings
  41. Re:That's why I have a problem with the comparison by wagnerrp · · Score: 1

    Aside from a few homebrew PS3 clusters, I don't know of any large scale Cell installations. The Roadrunner is a fairly standard (if very large) Opteron based cluster, with PowerXCell co-processors. The latest Cray XT5 is is a fairly standard (if very large) Opteron based cluster, with PowerXCell or FPGA co-processors.

    The NSAs ASIC systems don't count, by definition, they are not general purpose. A modern 3GHz quad-core processor will manage an exhaustive DES search in about 600 years. Deep crack in 1998 could manage that feat in 9 days, but you wouldn't consider it comparable with a 25k chip cluster.

    Similarly the @Home applications don't count because the interconnect bandwidth and latency is so abysmally low. It holds a huge amount of power (on the order of several PF), but it cannot be used for anything but relatively small Monte Carlo simulations.

  42. Re:That's why I have a problem with the comparison by Anonymous Coward · · Score: 0

    I think the TESLA GPUs actually go up to 4GB, and that is per GPU. However for NVidia at least 4GB unfortunately is an architectural limit. (and due to "bad" memory management you won't be able to use nearly all of those 4GB, particularly for long-running stuff with many allocations/deallocations).

  43. Re:Can't be to impressed: Folding@home guys did mo by blind+biker · · Score: 1

    They have more powerful GPUs, and have had them since a long time.

    --
    "The agriculture ministry is not in charge of Gundam" - Japanese ministry official.
  44. Re:That's why I have a problem with the comparison by Anonymous Coward · · Score: 0

    Well isn't a super computer specially build for one task in the first run?

  45. I wants? by ewersj · · Score: 1

    My lab will soon be building a computer or cluster for bioinformatics. Would something like this be appropriate / scaleable for gene microarray analysis, clustering algorithm tasks, etc? We need the capability to work with datasets in the 400 GB range, and with many permutations, but the specific datapoints are not large. Any suggestions or input would be much appreciated...

  46. Lets See by Zygamorph · · Score: 1

    Mobo Manufacturer

    Lets see, I can help these guys develop a new use for my line of wonky mobo's, get favourable mention all over the world on places like slashdot and reap the benefit of every geek with excess cash and a yen for a super computer or I can stand back and maybe they find someone else who has a "better" board or they develop their own.

    Hmm lets think on this one

  47. Would be nice if it was finished by kegon · · Score: 1

    Would be nice if it actually worked. It's not much good having the fastest desktop computer in the world if it isn't stable. Or are they using the Dilbert definition of a PC upgrade ?

    Next time, make the fancy video when it's finished guys.

  48. My Karma is too bad by NaughtyNimitz · · Score: 0

    What the fuck is WRONG with you guys at Slashdot. I posted this on Tuesday and my submission get's turned down. Is it because i am not Anglosaxon? Whose **** do i need to suck?

  49. Re:My Karma is too bad and you're wrong by NaughtyNimitz · · Score: 0

    It was posted on Monday, not Tuesday

  50. Re:That's why I have a problem with the comparison by cheier · · Score: 1

    It is kind of unfair to generalize commercial clusters vs homebrew in that manner. Many institutions that purchase commercial clusters from HP/Dell/SGI/etc opt out of the use of InfiniBand or 10GbE. The logic behind it is when the vender says that for $100,000 they can upgrade to IB, the purchaser goes back and says, "For $100,000, I'll just get more cluster nodes instead." This is probably a big reason that gigabit takes up 52% of the Top500 list of supercomputers.

  51. Re:Can't be to impressed: Folding@home guys did mo by CityZen · · Score: 1

    The main accomplishment of Fastra II is that they put 7 GPU cards into the machine. Fastra I already had 4 GPU cards, which is what the FOH machines top out at. The GPU cards are the same cards all around (Nvidia GTX 295). The linked article & video point out the difficulties they faced & overcame in trying to make 7 GPU cards work simultaneously in the same box.

  52. Re:Can't be to impressed: Folding@home guys did mo by blind+biker · · Score: 1

    OK then. I'm raising an eyebrow in somewhat heightened interest.

    --
    "The agriculture ministry is not in charge of Gundam" - Japanese ministry official.
  53. Erm.... They did. by itomato · · Score: 1

    They had support from the mainboard manufacturer.

    Read the fucking article.