Impressive GPU Numbers From Folding@Home

← Back to Stories (view on slashdot.org)

Impressive GPU Numbers From Folding@Home

Posted by ryuzaki0 on Friday October 13, 2006 @10:20AM from the that's-fast dept.

ludd1t3 writes, "The Folding@Home project has put forth some impressive performance numbers with the GPU client that's designed to work with the ATI X1900. According to the client statistics, there are 448 registered GPUs that produce 29 TFLOPS. Those 448 GPUs outperform the combined 25,050 CPUs registered by the Linux and Mac OS clients. Ouch! Are ASICs really that much better than general-purpose circuits? If so, does that mean that IBM was right all along with their AS/400, iSeries product which makes heavy use of ASICs?"

15 of 201 comments (clear)

Min score:

Reason:

Sort:

Are ASICs really that much better? by Anonymous Coward · 2006-10-13 10:23 · Score: 5, Funny

Are ASICs really that much better than general-purpose circuits?

Generally ASICs are much better than general-purpose circuits except in general cases.
1. Re: Are ASICs really that much better? by doublebackslash · 2006-10-13 13:11 · Score: 4, Interesting
  
  Partially true. The GPUs of today now have some general purpose circuits, but they are far from optimized and the execution unit count is skewed to the point that these processors would never, ever be able to run, say, an OS with anything approaching efficiency. FAH benefits from the insane amount of Floating point power because FAH is nothing but a pure FP stress test. They had to heavily modify the code to run on these babies, basically tuning the problems into vector information and letting the GPU do its thing, throwing. Only a few areas involve a need for CPU style processor, which is functionality provided only on these new cards. So please, please realize that even though these cards do not a contain a "protein folding circuit", they did modify the program to run on what it does have: 4x4 matrix operation units for multiplication and addiction.
  
  --
  md5sum /boot/vmlinuz
  d41d8cd98f00b204e9800998ecf8427e /boot/vmlinuz
Re:So obvious... by Iron+Condor · 2006-10-13 10:24 · Score: 5, Funny

ASCII silly question, get a silly ANSI.

--
We're all born with nothing.
If you die in debt, you're ahead.
Distributed amongst home users by Skevin · 2006-10-13 10:24 · Score: 4, Funny

So, will someone please create a really pretty 3D screensaver representing the folding calculation process? I'd love to see a represention with hi-res lighting and texturing, full transforms, and user-scalable views at 400 million triangles/sec.. Thanks.

Solomon

--
"Twice half-assed makes an ass whole." --Solomon K. Chang
1. Re:Distributed amongst home users by Enderandrew · 2006-10-13 10:30 · Score: 4, Interesting
  
  The folding team has done this, and it will be a free download for the PS3 version. The Cell processor runs the Folding application itself, and the graphical representation of the protein folding calculations will be handled by the GPU with a pretty display.
  
  --
  http://blindscribblings.com - Tasty pop-culture in conceptual fashion.
GPUs are Specialized Parallel Computers by ThinkFr33ly · 2006-10-13 10:34 · Score: 4, Insightful

GPUs are, for the most part, highly specialized parallel computers. Virtually all modern CPUs are serial computers. They do essentially one thing at a time. Because of this, most modern programming languages are taylored to this serial processing.

Making a general purpose parallel computer is very, very hard. It just so happens that you can use things like shaders for more than just graphics processing, and so via OpenGL and DirectX you can make GPUs do some nifty things.

In theory, and indeed often in practice, parallel computers are much, much faster than their serial counterparts. Hence the reason a GPU that costs $200 can render incredible 3D scenes that a $1000 CPU wouldn't have a prayer trying to render.
1. Re:GPUs are Specialized Parallel Computers by dslauson · 2006-10-13 10:56 · Score: 4, Informative
  
  Yes. That's basically right.
  
  Here's a Wikipedia article on general purpose GPU processing.
  
  Folding is what's know as a rediculously parallel problem. That is, it can be broken up in to small subproblems that can be distributed among many processors with a minimal amount of communication among processors. It also benefits from not requiring a lot of branching (if/switch statements and such), which GPUs generally do not handle well.
  
  Many problems, (I'd argue MOST problems) do not cater well to these kinds of restrictions. So, while a GPU is well suited to crunching away on pieces of the folding problem, it's going to be lousy at doing the day-to-day stuff you do with your computer.
2. Re:GPUs are Specialized Parallel Computers by Goner · 2006-10-13 13:20 · Score: 4, Informative
  
  The technical term (jargon) is embarrassingly parallel.
Re:This is the perfect time... by Murphy+Murph · 2006-10-13 10:56 · Score: 4, Insightful

I actually installed boinc with seti on several of my machines last night and it worked quite well to heat part of the house (us Canadians need to turn the heater on earlier). Took a bit of time to get started, but it was nice and toasty in the morning. Does anyone know if this method is less efficient in generating heat than using a apace heater? Slower perhaps..

Using your CPU as a space heater is not a bad idea. It is 100% efficient. Every watt it consumes gets turned into heat. Before someone says "but the cooling fans are wasteful" let me remind you that the air moved by those cooling fans will eventually come to a stop (inside your house) as a result of friction, releasing its energy as heat in the process.

Depending on what type of space heater you use, and the construction of your house, your computer can be more efficient than many other electric space heaters. Since none of the energy "consumed" by your CPU/GPU is converted to visible light, none of it has the opportunity to leave your house through your window panes (assuming you have IR reflective glass). Contrast this to quartz and halogen space heaters which produce a fair amount of visible light.

In much the same way, incandescent bulbs match the efficiency of compact fluorescents during the winter months. Every watt "wasted" as heat during the summer is now performing useful work heating your house. (Before someone says "you called a quartz/halogen space heater inefficient because of its waste light, and now an incandescent efficient because of its waste heat!' let me say that the space heater's light is not useful light, while the bulb's heat is useful heat (during the cool months.))

--
I dub thee... Sir Phobos, Knight of Mars, Beater of Ass.
Re:Lopsided Alright.. by throx · 2006-10-13 11:01 · Score: 4, Informative

It has nothing to do with memory bandwidth or use. The ASIC is about 1000 times faster than the CPU because it is using dedicated hardware designed to run very fast and parallel in 3D image processing, which is almost exactly the same problem as folding protiens.

Unless you are saying all CPUs are pegged at 99.9% use, or the GPU has memory three orders of magnitude faster then you're just looking at a effects that make a few percent difference here and there. The simple fact is the GPU is insanely faster at solving specific problems (3D processing) while it simply cannot ever run an operating system.

--

Fear: When you see B8 00 4C CD 21 and know what it means
Move the vector processor on-board? by Zygfryd · 2006-10-13 11:33 · Score: 4, Interesting

So when are we going to see (x86/64) motherboards with a socket for a standard processor and a socket for a vector processor?
Couldn't we finally have graphics cards that only give output to the screen and separate vector processors with a standardized interface / instruction set?
Remember: 1 GPU has more than one processor. by nick_davison · 2006-10-13 11:42 · Score: 4, Informative

X1900 - 48 pixel shader processors plus 8 vertex shaders. Assuming you manage to run them all equally in parallel: 56 processors.

Standard CPU - 1 core (assuming dual cores get read as 2 CPUs).

448 GPUs x 56 = 25,088 effective processors all with on card memory.

25,050 CPUs x 1 core = 25,050 effective processors all dealing with system busses etc.

In short, if you're performing one simple task trillions of times, many very simple, highly optimized processors with dedicated memory do the job better than even a similar number of much more capable processors that have to play nice across a whole system.

And this ignores the number of old couple of hundred megahertz systems that people don't use anymore so hand over to the task vs. X1900s being the very high end of ATIs most recent line.

For massively parallel tasks like rendering pixels, folding proteins, compressing frames of a movie, etc. I'd absolutely love large quantities of a simple processor. For most other tasks, given present technology, I'd still side with fewer more able processors. Either way comparing 448 of something with 56 processors within it to 25,000 single processors and saying, "But 448 is SO much less than 25,000!" is an unfair comparrison.
Re:So obvious... by Jesus_666 · 2006-10-13 12:04 · Score: 4, Funny

UTF are you talking about? I'm quite sure the mods are not latin-1 post like this go unmoderated.

--
USE HOT GRITS WITH STATUE OF NATALIE PORTMAN (NAKED AND PETRIFIED)
Re:Not a mystery by tkittel · 2006-10-13 12:56 · Score: 4, Insightful

Your logic is fine, but you are overestimating the effect you mention if you really think that it "solves the mystery".

500 users out of 25000 means that you have at most taken the 2 percent highest performers out of the CPU pool. If we assume that those 2 percent have computers that are 5 times as powerful as the average computer, then we have lowered the average performance of the CPU pool by roughly 9%.

This 9% systematic effect will lower the reported performance superiority of around 5000% of the GPU vs. the CPU to something like 4500%. I.e. it doesnt change the result at all (which seems to be that GPUs kick ass for these applications).
Re:Lopsided Alright.. by Majik+Sheff · 2006-10-13 13:03 · Score: 5, Insightful

Look at the first two letters of the acronym: Application Specific. A screwdriver and a swiss army knife will both turn a screw, but the screwdriver is going to be much more efficient at it. GPUs are finely tuned to rip through massive volumes of floating point vectors and not much else. It just so happens that the folding project also fits this desctiption and as such is an excellent use of an otherwise wasted resource.

--
Women are like electronics: you don't know how damaged they are until you try to turn them on.