How To Build a Homebrew PS3 Cluster Supercomputer
eldavojohn writes "UMass Dartmouth Physics Professor Gaurav Khanna and UMass Dartmouth Principal Investigator Chris Poulin have created a step-by-step guide designed to show you how to build your own supercomputer for about $4,000. They are also hoping that by publishing this guide they will bring about a new kind of software development targeting this architecture & grid (I know a few failed NLP projects of my own that could use some new hardware). If this catches on for research institutions it may increase Sony's sales, but they might not be seeing the corresponding sale of games spike (where they make the most profit)."
Why would you want to use PS3s for a homebrew supercomputing cluster if it means you have to write and optimize code for the SPEs to get benefit out of it? The PS3's linux environment doesn't let you utilize the GPU or all of the built-in SPEs and it doesn't have a lot of RAM available either. It seems like it would be cheaper to build a cluster out of commodity PC parts, and maybe use GPUs+CUDA to get more muscle without having to completely hand-roll your own accelerated computation code (since CUDA is roughly C). I can't imagine that the PS3 would end up cheaper for these purposes, considering it includes a Blu-Ray player along with a bunch of other things you're not going to be using.
using namespace slashdot;
troll::post();
Couple issues with this as an alternative to the garden-variety x86 cluster connected with InfiniBand:
Slow network interconnect. For problems that are not trivially parallel, network latency is usually a big deal. Ethernet doesn't cut it.
Lack of RAM. 'Nuff said.
Have to care about Cell and PS3 architecture. The codes ("codes" has a slightly different meaning in the context of supercomputing) have to be modified to take advantage of this very specific architecture. Software always outlives hardware, so in the long run the effort may not be worth it.
That said, it's really cheap. If your application isn't held back too much by these issues then enjoy your insanely cheap cluster!
This isn't really the place to start criticizing grammar and spelling, unless you REALLY want to live a life full of frustration and torment....?
(Though it could be worse, I suppose - you could go to digg etc. intsead...:p)
A) Although the cell is a pain to code for, it is much better than whatever PC you can get for ~$400 which will probably contain a mid-to-low-range dual core x86 CPU, whereas the PS3 gives you a Cell CPU which is much, much, faster than the x86 CPU.
B) PS3s are uniform. Other than HD differences, a PS3 built in 2008 will be the same PS3 built in 2012 (assuming the PS3 lasts that long) this allows for a uniform cluster without worrying about differing parts (for example, the Core i7 built in 2008 will not be the same as the Core i7 built in 2012 and getting a 2008 Core i7 is going to be a pain)
C) PS3s are the new fad. It isn't going to be hard to set up a supercomputer cluster with PS3s compared to using a mismatch of older computers because again, the PS3 is uniform.
Taxation is legalized theft, no more, no less.
I'm not trying to be a smartass, but why did he mention in TFA that his supercomputer cost $4000 if the 8 consoles were "Sony-donated"?
Oh come on, you are being pedantic. Clearly what he meant was "$4000 worth of consoles", never mind that they were donated. $X worth of consoles is a useful number if someone is considering buying PS3s and setting up a supercomputer; it's also a fun number to compare to the cost of renting time on some large supercomputer.
The original Wired article is informative:
http://www.wired.com/techbiz/it/news/2007/10/ps3_supercomputer
He asked for Sony to donate the PS3s because he didn't think the NSF would give him grant money to buy video game systems. Now that he has actually built the supercomputer and it does everything he hoped it would do, perhaps other researchers will be able to justify the money to set up their own clusters (without donations from Sony).
The numbers are a no-brainer: he used to spend $5000 to do a single simulation run using rented supercomputer time. For less than the cost of a single simulation run, you can set up your own supercomputer and make simulation runs whenever you feel like it.
ALso, like the iPod example at the top of the post, most research use of the technology won't come from actual iPods or consoles
Um, he is using actual PS3 consoles to do actual research.
If one wanted to build their own home "super" computer then why not just use CUDA and a few Nvidia cards?
If you think that is a good way to make a super computer, why don't you go ahead and do it, and make a web site explaining how it is done?
Meanwhile, he thought he had a good way to go with the PS3, and it did in fact work as he expected, so what's the problem?
Anyway, here's why he thought it was a good idea. From the above linked Wired article:
steveha
lf(1): it's like ls(1) but sorts filenames by extension, tersely
Old CPU's have a much lower MIPS/Watt and a lower MIPS/interconnect so they have a higher cost. Many organizations have found it's cheaper to retire an old supercomputer and add a few nodes to the new one even if it is more capital outlay to get the same performance. Basically a Cell does many times as much useful work than a P4 at a fraction of the power budget.
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
The number of PS3's sold will never be enough to hurt Sony's bottom line, but will boost the image of the console. Having credible scientists call your product a "supercomputer" is worth something. Does Ferrari's Forumla 1 racing team pay for itself? Nah, it's an investment to promote an image.
How is it useless, when the guy who built it, used it already for a month? And it has replaced 200 supercomputer nodes, for his purpose? I'd say that's very fucking useful.
But you know what, maybe you should send him an e-mail and try to convince him how his cluster is useless. Make it a nice, insightful and intelligent e-mail, like your post.
"The agriculture ministry is not in charge of Gundam" - Japanese ministry official.
IBM is probably banking on the existence of people who want Cell processors in systems with more than 256megs of RAM. Other IBM value-adds would presumably include rack mountability, support for netbooting and other convenient management stuff, and so forth.
If your application leans almost entirely on the CPU with very little need for RAM, and you have an army of screwdriver monkeys(or grad students) to do all the legwork, the PS3 is an excellent deal. If you need something with RAM capacity that wasn't a joke in 2001, and/or management features that won't have you tearing your eyes out when you have 10,000 of them, then IBM smells opportunity.
sorry, but that's stupid -how many pop culture references from 1923 are relevant to TODAY's pop culture:
seeya snookums, me and the squeeze are the bees knees in our raccoon coats, we're gonna get jazzed up in our hupmobile on hootch and go check out Mary Astor's horse after we hit the blind pig.
I agree its unfortunate that this stuff is non free, but pre 1923 means that most talkies would be out of bounds as well -including stuff you can see on tv all the time.
I'm just sayin'
Researchers pay not only for the initial capital outlay required to install a supercomputer, but also for its power, cooling, the building it resides in, and its maintenance. This PS3 cluster might be cheap from the researchers' standpoint if they don't pay for any of these things directly, but I imagine their departments won't be real thrilled if a bunch of researchers start building their own individual "cheap" supercomputers! Those issues aside, it sounds like they're doing pretty cool stuff with those machines, so maybe more supercomputers should be cell-based!
http://www.physorg.com/news92674403.html
http://dgl.com/itinfo/2003/it030528.html
http://www.lbl.gov/Science-Articles/Archive/sabl/2006/Jul/06.html
http://folding.stanford.edu/English/FAQ-PS3
see afidel's posting.
basically it comes down to the costs of having your own personal power station in the TCO to run a cluster.
this started (well, really hit it off) a few years back, when the pentium M and centrino tech became widespread. basically, to my knowledge, it was the first time you could actually have more processors with less jiggahertz, that consumed less power in total and still had more flops than the others. it swayed everyone from "more powerful cpus plz" train of thought to the "more cpus, less power-consumption". (also cpus/chips in general will eventually hit an upper barrier, making parallel computing a necessity)
I haven't checked the facts on the ps3, but seeing how much nether-region sucking is going on, ps3s probably fit into this scheme.
It's not about fate, it's about character.
there be no shelter here, the frontline is everywhere!
it's not that simple. sure you can make up for a lack of per-CPU processing power through cluster computing, but at some point it becomes more practical or even cheaper to go with a smaller cluster using a better processor architecture.
you could use hundreds of P3s or even P4s and still not achieve the same real-world performance as a couple dozen cell processors or modern GPGPU stream processors. that's because P3s & P4s are general-purpose CPUs designed for SISD/scalar processing. they're great for the bulk of general-purpose commodity computing applications like running an OS, web browser, word processor, etc., but high-performance computing problems typically involve processing very large data sets that greatly benefit from data parallelism. so if you had two processors, one scalar and one vector, each with the same power consumption and clock rate, the vector processor would be an order of magnitude faster at performing HPC tasks than the processor with the scalar architecture.
and the combined use of parallelization at multiple levels will always be more efficient than relying solely on a single form of parallelism. blindly adding more cheap 32-bit scalar CPUs won't get you as good of results as building a smaller cluster comprised of 64-bit fully-pipelined stream processors with multithreaded superscalar cores that support VLIW. in the former case, you're only employing task-level parallelism, whereas in the later case you're taking advantage of bit-level, instruction-level (pipelining + superscalar + VLIW), data, and task-level (multiprocessing + multithreading) parallelism. you'd not only save power by using fewer (more power-efficient) processors, but you'd also reduce memory coherence & bandwidth problems, not to mention the space savings.
or bank reclaimed assets from a sunken business?
What type of processor do Woolworth's POS tills use?