Slashdot Mirror


Impressive GPU Numbers From Folding@Home

ludd1t3 writes, "The Folding@Home project has put forth some impressive performance numbers with the GPU client that's designed to work with the ATI X1900. According to the client statistics, there are 448 registered GPUs that produce 29 TFLOPS. Those 448 GPUs outperform the combined 25,050 CPUs registered by the Linux and Mac OS clients. Ouch! Are ASICs really that much better than general-purpose circuits? If so, does that mean that IBM was right all along with their AS/400, iSeries product which makes heavy use of ASICs?"

37 of 201 comments (clear)

  1. Lopsided Alright.. by ackthpt · · Score: 2, Interesting

    Those 448 GPUs outperform the combined 282,111 CPUs registered by the Linux and Mac OS clients. Ouch! Are ASICs really that much better than general-purpose circuits? If so, does that mean that IBM was right all along with their AS/400, iSeries product which makes heavy use of ASICs?"

    That's pretty lopsided, but I suppose some of it could be explained away by GPU's not chewing through OS code and having to play nice for memory, so they'd be a bit more efficient. Could be most of those Linux and MacOS systems are long of tooth, but suspect someone's missed a few decimal places somewhere. I do love how quick a theory is posed and the OP starts to run with it. e.g. I look at the balance of my checking account and see there's $1,000 more in there than I expect there to be and immediately form the hypothesis that it's money to spend, without considering whether my rent check has gone through yet. Could be a rough time ahead if I went shopping with it. Either that or the GPU computers are on more than the others.

    Whoops, used an old pentium for the math, never mind.

    --

    A feeling of having made the same mistake before: Deja Foobar
    1. Re:Lopsided Alright.. by blahplusplus · · Score: 3, Insightful

      I have a feeling this is memory bandwidth related, modern GPU's have insane amounts of memory bandwidth compared to the wide range of desktops. Not to mention the parallelism.

    2. Re:Lopsided Alright.. by throx · · Score: 4, Informative

      It has nothing to do with memory bandwidth or use. The ASIC is about 1000 times faster than the CPU because it is using dedicated hardware designed to run very fast and parallel in 3D image processing, which is almost exactly the same problem as folding protiens.

      Unless you are saying all CPUs are pegged at 99.9% use, or the GPU has memory three orders of magnitude faster then you're just looking at a effects that make a few percent difference here and there. The simple fact is the GPU is insanely faster at solving specific problems (3D processing) while it simply cannot ever run an operating system.

      --

      Fear: When you see B8 00 4C CD 21 and know what it means

    3. Re:Lopsided Alright.. by oringo · · Score: 3, Interesting

      You can look at the statistics many ways. Here's the GFLOP/CPU catagorized by OS:

      1. GPU: 65.463
      2. Linux: 1.219
      3. Windows: 0.948
      4. Mac: 0.511

      Of course, GPU beating the hell out of CPU in such tests is no surprise. It's pretty much a massive parallel vector engine. I'm more interested in seeing how PS3 holds agains all other guys when it comes out. They have a folding client for PS3 already.

    4. Re:Lopsided Alright.. by Majik+Sheff · · Score: 5, Insightful

      Look at the first two letters of the acronym: Application Specific. A screwdriver and a swiss army knife will both turn a screw, but the screwdriver is going to be much more efficient at it. GPUs are finely tuned to rip through massive volumes of floating point vectors and not much else. It just so happens that the folding project also fits this desctiption and as such is an excellent use of an otherwise wasted resource.

      --
      Women are like electronics: you don't know how damaged they are until you try to turn them on.
  2. Are ASICs really that much better? by Anonymous Coward · · Score: 5, Funny

    Are ASICs really that much better than general-purpose circuits?

    Generally ASICs are much better than general-purpose circuits except in general cases.

    1. Re: Are ASICs really that much better? by Jeffrey+Baker · · Score: 3, Insightful

      Good one ... but I also wonder why anyone is throwing around the term "ASIC" in this article. A GPU is obviously not an application-specific circuit, which is clearly shown by the fact that it can be programmed to process graphics, or protein folding, or numerous other tasks. A GPU is a general-purpose processor like a CPU, it just happens to have different numbers and kinds of execution units.

    2. Re: Are ASICs really that much better? by doublebackslash · · Score: 4, Interesting

      Partially true. The GPUs of today now have some general purpose circuits, but they are far from optimized and the execution unit count is skewed to the point that these processors would never, ever be able to run, say, an OS with anything approaching efficiency. FAH benefits from the insane amount of Floating point power because FAH is nothing but a pure FP stress test. They had to heavily modify the code to run on these babies, basically tuning the problems into vector information and letting the GPU do its thing, throwing. Only a few areas involve a need for CPU style processor, which is functionality provided only on these new cards. So please, please realize that even though these cards do not a contain a "protein folding circuit", they did modify the program to run on what it does have: 4x4 matrix operation units for multiplication and addiction.

      --
      md5sum /boot/vmlinuz
      d41d8cd98f00b204e9800998ecf8427e /boot/vmlinuz
    3. Re: Are ASICs really that much better? by Jeremi · · Score: 2, Funny
      An ASIC can be orders of magnitude faster for an fft, but you can't write a word processor for it.


      You realize that somebody is bound to write a word processor that runs on a GPU now, just to prove it can be done?

      --


      I don't care if it's 90,000 hectares. That lake was not my doing.
    4. Re: Are ASICs really that much better? by hritcu · · Score: 2, Funny
      they did modify the program to run on what it does have: 4x4 matrix operation units for multiplication and addiction.
      This is also what my mother was always telling me when I was a boy: "if you play video games you'll become an addict". She was so right.
      --
      If you don't fail at least 90 percent of the time, you're not aiming high enough. (Alan Kay)
  3. IBM right all along, or obvious? by Gothmolly · · Score: 2, Funny

    Custom app written to run on hardware specifically designed to run apps like it, outperforming general purpose CPUs? Newsflash from Ric Romero!!1!

    --
    I want to delete my account but Slashdot doesn't allow it.
  4. Re:So obvious... by Iron+Condor · · Score: 5, Funny

    ASCII silly question, get a silly ANSI.

    --
    We're all born with nothing.
    If you die in debt, you're ahead.
  5. Distributed amongst home users by Skevin · · Score: 4, Funny

    So, will someone please create a really pretty 3D screensaver representing the folding calculation process? I'd love to see a represention with hi-res lighting and texturing, full transforms, and user-scalable views at 400 million triangles/sec.. Thanks.

    Solomon

    --
    "Twice half-assed makes an ass whole." --Solomon K. Chang
    1. Re:Distributed amongst home users by Enderandrew · · Score: 4, Interesting

      The folding team has done this, and it will be a free download for the PS3 version. The Cell processor runs the Folding application itself, and the graphical representation of the protein folding calculations will be handled by the GPU with a pretty display.

      --
      http://blindscribblings.com - Tasty pop-culture in conceptual fashion.
    2. Re:Distributed amongst home users by holysin · · Score: 3, Funny

      There is this lovely feature called a power button. There's also this really handy feature on most tvs since the mid-late 90s called multiple inputs.

    3. Re:Distributed amongst home users by tukkayoot · · Score: 2, Insightful

      It may be useful in getting people to actually download and run the client.

  6. GPUs are Specialized Parallel Computers by ThinkFr33ly · · Score: 4, Insightful

    GPUs are, for the most part, highly specialized parallel computers. Virtually all modern CPUs are serial computers. They do essentially one thing at a time. Because of this, most modern programming languages are taylored to this serial processing.

    Making a general purpose parallel computer is very, very hard. It just so happens that you can use things like shaders for more than just graphics processing, and so via OpenGL and DirectX you can make GPUs do some nifty things.

    In theory, and indeed often in practice, parallel computers are much, much faster than their serial counterparts. Hence the reason a GPU that costs $200 can render incredible 3D scenes that a $1000 CPU wouldn't have a prayer trying to render.

    1. Re:GPUs are Specialized Parallel Computers by dslauson · · Score: 4, Informative

      Yes. That's basically right.

      Here's a Wikipedia article on general purpose GPU processing.

      Folding is what's know as a rediculously parallel problem. That is, it can be broken up in to small subproblems that can be distributed among many processors with a minimal amount of communication among processors. It also benefits from not requiring a lot of branching (if/switch statements and such), which GPUs generally do not handle well.

      Many problems, (I'd argue MOST problems) do not cater well to these kinds of restrictions. So, while a GPU is well suited to crunching away on pieces of the folding problem, it's going to be lousy at doing the day-to-day stuff you do with your computer.

    2. Re:GPUs are Specialized Parallel Computers by Goner · · Score: 4, Informative

      The technical term (jargon) is embarrassingly parallel.

  7. This is the perfect time... by loraksus · · Score: 2, Interesting

    ... to start heating your house with your computers ;)

    I actually installed boinc with seti on several of my machines last night and it worked quite well to heat part of the house (us Canadians need to turn the heater on earlier). Took a bit of time to get started, but it was nice and toasty in the morning.

    Does anyone know if this method is less efficient in generating heat than using a apace heater? Slower perhaps...
    If you're going to use energy by turning on the wall heater anyways, why not use it to crunch some numbers?

    --
    1q2w3e4r5t6y7u8i9o0pqawsedrftgthyjukilo;p'azsxdcfv gbhnjmk,l.;/
    1. Re:This is the perfect time... by loraksus · · Score: 2, Funny

      Well, I really, really hate the "heater that hasn't been used for 6 months" smell, so that was sort of my primary focus.
      That makes it a little better... right?
      Please?

      --
      1q2w3e4r5t6y7u8i9o0pqawsedrftgthyjukilo;p'azsxdcfv gbhnjmk,l.;/
    2. Re:This is the perfect time... by Murphy+Murph · · Score: 4, Insightful

      I actually installed boinc with seti on several of my machines last night and it worked quite well to heat part of the house (us Canadians need to turn the heater on earlier). Took a bit of time to get started, but it was nice and toasty in the morning. Does anyone know if this method is less efficient in generating heat than using a apace heater? Slower perhaps..


      Using your CPU as a space heater is not a bad idea. It is 100% efficient. Every watt it consumes gets turned into heat. Before someone says "but the cooling fans are wasteful" let me remind you that the air moved by those cooling fans will eventually come to a stop (inside your house) as a result of friction, releasing its energy as heat in the process.

      Depending on what type of space heater you use, and the construction of your house, your computer can be more efficient than many other electric space heaters. Since none of the energy "consumed" by your CPU/GPU is converted to visible light, none of it has the opportunity to leave your house through your window panes (assuming you have IR reflective glass). Contrast this to quartz and halogen space heaters which produce a fair amount of visible light.

      In much the same way, incandescent bulbs match the efficiency of compact fluorescents during the winter months. Every watt "wasted" as heat during the summer is now performing useful work heating your house. (Before someone says "you called a quartz/halogen space heater inefficient because of its waste light, and now an incandescent efficient because of its waste heat!' let me say that the space heater's light is not useful light, while the bulb's heat is useful heat (during the cool months.))
      --
      I dub thee... Sir Phobos, Knight of Mars, Beater of Ass.
    3. Re:This is the perfect time... by olddoc · · Score: 2, Insightful

      If my computer idles at 150w and runs FAH at 100% cpu at 200w and I need 20h to generate 1 unit,
      I am spending $.10 for the extra kw hour roughly. In the summer I waste money on AC in the winter I save
      gas money on heat. If I put my computer in 4watt S3 standby for 15 of those 20 hours, I can save a lot more.
      FAH calculations do not depend on "free" "idle" computer power, they depend on users spending money to generate
      the results.

      --
      Power tends to corrupt, and absolute power corrupts absolutely.
    4. Re:This is the perfect time... by ameline · · Score: 3, Insightful

      Ok, I went and did the math (assuming, on average 1035 btu/cubic foot of natural gas) Looking at my bills,

      Natural gas is (cdn)$0.278287 per cubic meter, and electricity is 0.058 /kwh.

      At 96% efficiency, natural gas works out to 0.027331 / kwh, (3413 btu in 1 kwh) or 47% of the cost of electricity at today's prices in Toronto.

      So 1/3 was a bit of hyperbole, but not too much.

      --
      Ian Ameline
    5. Re:This is the perfect time... by evilviper · · Score: 3, Informative
      It is 100% efficient.

      Not true. You aren't taking into account Power Factor at all... Not that I'm surprised, as most people don't understand it.

      With switching power supplies, it's common to see PF in the range of 0.4, as opposed to fully-resistive electric space-heaters (and incandesent lightbulbs) with a perfect 1.0 PF.

      Residential customers are lucky, in that they don't get charged for PF losses by the power company, while companies certainly do. However, it's still highly ineffecient, even if you aren't paying for it directly.

      And besides that, electric heating is almost always more expensive than conventional heating, like natural gas, or electric heatpumps.
      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  8. Re:So obvious... by Anonymous Coward · · Score: 2, Funny

    - Cool, can I play too?
    - No, that's what consoles are for.

  9. Not a mystery by DaveJay · · Score: 2, Insightful

    Take one hundred people with computers, and who have an interest in Folding@Home. Offer them a CPU-driven version of the app, and 100 computers will be running the CPU-driven app, regardless of the age/performance of the machine.

    Now, offer them a GPU-driven alternative. For the most part, the only people that will install and run it are those with a fancy-schmancy video card capable of running it, and for the most part, the only people that have a fancy-schmancy video card capable of running it have high-performance computers as well (or at least more recent computers that came with compatible cards.)

    So let's say that's ten out of the hundred, and those ten are statistically likely to have had the highest-performing CPUs as well; so you've pulled the top ten performers out of the CPU-client pool, and thrown them in the GPU-client pool. Even if you didn't switch those ten people over to the GPU, you could probably isolate those computers' CPU-client performance numbers from the other 90 and find that they're disproportionally faster than a larger number of the slower computers.

    There's still more to the story, of course, but you really are taking the highest-performing computers out of the CPU pool and into the GPU pool. The exception would be high-performance servers with lousy/no graphic cards, but those are likely working so hard to perform their normal business that Folding@Home isn't a priority.

    1. Re:Not a mystery by tkittel · · Score: 4, Insightful

      Your logic is fine, but you are overestimating the effect you mention if you really think that it "solves the mystery".

      500 users out of 25000 means that you have at most taken the 2 percent highest performers out of the CPU pool. If we assume that those 2 percent have computers that are 5 times as powerful as the average computer, then we have lowered the average performance of the CPU pool by roughly 9%.

      This 9% systematic effect will lower the reported performance superiority of around 5000% of the GPU vs. the CPU to something like 4500%. I.e. it doesnt change the result at all (which seems to be that GPUs kick ass for these applications).

  10. Move the vector processor on-board? by Zygfryd · · Score: 4, Interesting

    So when are we going to see (x86/64) motherboards with a socket for a standard processor and a socket for a vector processor?
    Couldn't we finally have graphics cards that only give output to the screen and separate vector processors with a standardized interface / instruction set?

  11. Remember: 1 GPU has more than one processor. by nick_davison · · Score: 4, Informative

    X1900 - 48 pixel shader processors plus 8 vertex shaders. Assuming you manage to run them all equally in parallel: 56 processors.

    Standard CPU - 1 core (assuming dual cores get read as 2 CPUs).

    448 GPUs x 56 = 25,088 effective processors all with on card memory.

    25,050 CPUs x 1 core = 25,050 effective processors all dealing with system busses etc.

    In short, if you're performing one simple task trillions of times, many very simple, highly optimized processors with dedicated memory do the job better than even a similar number of much more capable processors that have to play nice across a whole system.

    And this ignores the number of old couple of hundred megahertz systems that people don't use anymore so hand over to the task vs. X1900s being the very high end of ATIs most recent line.

    For massively parallel tasks like rendering pixels, folding proteins, compressing frames of a movie, etc. I'd absolutely love large quantities of a simple processor. For most other tasks, given present technology, I'd still side with fewer more able processors. Either way comparing 448 of something with 56 processors within it to 25,000 single processors and saying, "But 448 is SO much less than 25,000!" is an unfair comparrison.

  12. Re:So obvious... by Jesus_666 · · Score: 4, Funny

    UTF are you talking about? I'm quite sure the mods are not latin-1 post like this go unmoderated.

    --
    USE HOT GRITS WITH STATUE OF NATALIE PORTMAN (NAKED AND PETRIFIED)
  13. Golly Beav.... by certain+death · · Score: 2, Funny

    Imagine if they had developed this application for NVidia video cards, probably 2x the speed!!1! Go ahead, mod me a troll....I will appologize tomorrow :o)

    --
    "My immediate reaction is "WTF? What kind of moron doesn't make things 64-bit safe to begin with?" Linus
  14. Not really. by TerranFury · · Score: 3, Insightful

    >Using your CPU as a space heater is not a bad idea. It is 100% efficient.

    Not really. Consider exergy. Yes, your CPU is just as efficient as any electric space heater. However, consider that the alternative is probably burning natural gas or oil in a furnace. If you burn fuel for heat, 90%+ of the chemical energy goes to producing heat (the rest is lost as unburnt hydrocarbons in the exhaust). If you burn fuel to spin a turbine at a power plant, only about 40% goes to electrical energy, and unless it's a cogeneration plant which uses the waste heat for industrial purposes, the rest is lost as heat up the smokestacks. So, starting from the fossil-fuel source, electrical heating is less than half as efficient as burning fuel for heat. If you do need to heat using electric power, it's much more efficient to use that electricity to pump heat in from a lower temperature outside than it is to turn that electricity itself into heat.

    If you are stuck with electric (non-heat-pump) heating in your house, however, you are correct: There is absolutely no reason not to run your CPU or any other electrical appliance full tilt.

  15. Actually Linux users have good hardware: by olddoc · · Score: 3, Informative

    Look at the number of Tflops per active cpu by OS.
    I took (TFlop/active cpu)*1000 to get a readable number --
    or Gflops/cpu
    Windows is .948
    Mac is .51
    Linux is 1.21
    And GPU is 65!

    The source:
    http://fah-web.stanford.edu/cgi-bin/main.py?qtype= osstats

    The average Linux user proably has a decent AMD Athlon,
    The average Windoze user has a P4 Dell.
    Athlons just crunch the math better.

    --
    Power tends to corrupt, and absolute power corrupts absolutely.
  16. Re:Lopsided Alright..ASIC and yea shall receive. by Baddas · · Score: 2, Funny

    Man, imagine how simple-minded and yet weak superhuman an AI that ran solely on GPUs would be?

    "I can think of many, many things at once. As long as they are the same type of thing."

  17. Re:PF losses are not losses in any real sense by evilviper · · Score: 2, Informative
    PF "losses" are not losses, it is power that is in effect returned back to the source. One can simply treat it as power that isn't delivered at all.

    Electricity isn't water, you can't return it to the source.

    With a lower power factor, you're either forcing the power company to install huge banks of capacitors, or making the generators work that much harder for fewer watts actually delivered/used. That's practically the definition of "inefficent".
    --
    Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  18. Re:PF losses are not losses in any real sense by Agripa · · Score: 2, Informative

    In the case of computer power supplies that use a rectifier and capacitor combination for AC to DC conversion which is almost all of them, they do not look like an inductive or capacitive load but have a lower power factor caused by drawing current in pulses instead of a sine wave. The result is a higher RMS current then necessary for the load which causes increased line losses and requires higher current capacity for a given power. In extreme cases, distribution transformers can go into saturation causing additional losses.

    A power factor of 0.6 does not mean 0.6 watts available for every watt sent. It means the capacity of the line is reduced to 60% of normal do to excessive circulating current. This is easy to see when you look at the rated output power of a typical wall socket, 120 VAC x 15 A = 1800 W, of which you can only use 1100 watts (actually 1080) for a computer load. 1100 watts is the largest common size for inexpensive uninterruptible power supplies for this very reason.

    A 0.6 power factor should cause about a 66% increase in line losses for a given load.