Slashdot Mirror


Nvidia's Chief Scientist on the Future of the GPU

teh bigz writes "There's been a lot of talk about integrating the GPU into the CPU, but David Kirk believes that the two will continue to co-exist. Bit-tech got to sit down with Nvidia's Chief Scientist for an interview that discusses the changing roles of CPUs and GPUs, GPU computing (CUDA), Larrabee, and what he thinks about Intel's and AMD's futures. From the article: 'What would happen if multi-core processors increase core counts further though, does David believe that this will give consumers enough power to deliver what most of them need and, as a result of that, would it erode away at Nvidia's consumer installed base? "No, that's ridiculous — it would be at least a thousand times too slow [for graphics]," he said. "Adding four more cores, for example, is not going anywhere near close to what is required.""

143 comments

  1. Nah... by Anonymous Coward · · Score: 0, Troll

    It won't work, since linux will never get the drivers for it going.

    1. Re:Nah... by pato101 · · Score: 1

      It won't work, since linux will never get the drivers for it going. stop trolling! NVidia's drivers for Linux do the CUDA stuff!
    2. Re:Nah... by witherstaff · · Score: 1

      On the nVidia driver bashing, I'm still waiting for a driver for FreeBSD on amd64. It's a back and forth blame game between Nvidia and FreeBSD devs and it has been going on for years.

    3. Re:Nah... by Anonymous Coward · · Score: 0

      Dude, if you are willing to come down here into my parents basement and get it working I'll pay you a consulting fee.

  2. NV on the war path? by Vigile · · Score: 3, Interesting

    Pretty good read; interesting that this guy is talking to press a lot more:

    http://www.pcper.com/article.php?aid=530

    Must be part of the "attack Intel" strategy?

    1. Re:NV on the war path? by pato101 · · Score: 1

      interesting that this guy is talking to press a lot more: I was a couple of weeks ago in a conference given by him at Barcelona (Spain). He is a nice speaker. He seems at a round of conferences around the world universities showing the CUDA technology. By the way, CUDA technology seems to be an interesting thing.
    2. Re:NV on the war path? by Anonymous Coward · · Score: 0

      NV is pushing FUD.

      The SIMD processing on CPUs will eventually overtake GPUs.

      The End-Game:
      4 Sockets on the mobo, 8 Cores/CPU, only plug in/buy as many CPUs that you need. GPUs are dead.

  3. At some point in the future... by Anonymous Coward · · Score: 4, Funny

    Everything will be integrated into one chip, and we will call it the PU.

    1. Re:At some point in the future... by UID30 · · Score: 1

      Pew pew pew pew.

      (quad core)

      --
      "Glory is fleeting, but obscurity is forever." - Napoleon Bonaparte
    2. Re:At some point in the future... by werelnon · · Score: 1

      and then it will be broken up into pieces again. The wheel of reincarnation always seems to keep turning.

  4. Very surprising by athdemo · · Score: 1

    I would never have expected nVidia's chief scientist to say that nVidia's products would not soon be obsolete.

    1. Re:Very surprising by AKAImBatman · · Score: 2, Interesting

      I would never have expected nVidia's chief scientist to say that nVidia's products would not soon be obsolete.

      Moving to a combined CPU/GPU wouldn't obsolete NVidia's product-line. Quite the opposite, in fact. NVidia would get to become something called a Fabless semiconductor company. Basically, companies like Intel could license the GPU designs from NVidia and integrate them into their own CPU dies. This means that Intel would handle the manufacturing and NVidia would see larger profit margins. NVidia (IIRC) already does this with their 3D chips targeted at ARM chips and cell phones.

      The problem is that the GPU chipset looks nothing like the CPU chipset. The GPU is designed for massive parallelism, while CPUs have traditionally been designed for single-threaded operation. While CPUs are definitely moving in the multithreaded direction and GPUs are moving in the general-purpose direction, it's still too early to combine them. Attempting to do so would get you the worst of both worlds rather than the best. (i.e. Shared Memory Architecture )

      So I don't think NVidia's chief scientist is off on this. (If he was, we'd probably already see GPU integration in the current generation of game consoles; all of which are custom chips.) The time will come, but it's not here yet. :-)
    2. Re:Very surprising by Hatta · · Score: 1

      I would. After all, why buy nVidia's next product if their current product isn't obsolete?

      --
      Give me Classic Slashdot or give me death!
    3. Re:Very surprising by Rockoon · · Score: 1

      "Moving to a combined CPU/GPU" leaves NVidia out of the loop because NVidia does not control the API which makes its hardware work. The standard is DirectX, not an NV part. NVidia will never become a "fabless gpu manufacturer" in the PC market because of this. The technical details of how and why a GPU can accomplish so much work in so lttle time is exactly that, a technical detail. There is no Intellectual Property here that NVidia gets to bargain with. Raster algorithms are more or less simple things. Additionally, most of the agree-with-nvidia camp that have posted here seem to misunderstand what is comming, using arguements that are only justified for symetric multicores. The near future is asymetric multicores, with 4 or 8 general purpose processors and 32+ additional programmable stream processors. Why do you think AMD was so keen to buy ATI? Those stream processors don't just fullfill the GPU requirement. They work for all large scale highy parallel number crunching tasks, such as those being pioneered by the GPGPU guys.

      --
      "His name was James Damore."
    4. Re:Very surprising by default+luser · · Score: 1

      Yes, actually what Nvidia fears most is being cut-out of the chipset business. Right now, most PCs ship with an integrated graphics chipset (even more true in the notebook/ultraporatble market). If the GPU is integrated on the CPU die, then Nvidia loses out completely in the integrated graphics chipset market.

      They may also lose market share because new APIs besides Direct3D may surface to control these new hybrid processors (think CUDA). If Nvidia is not there on the ground floor, then their own API (CUDA) will not gain traction in the general-purpose stream computing market. While big players like Microsoft may eventually release a standard, this may take years (remember how long DirextX took to "get it right," and how Glide initially stole the show).

      I've actually been waiting for Nvidia to buy Via; it's made sense from the moment that AMD announced Fusion and Intel announced similar plans. Nvidia has got to create a hybrid product, and unfortunately for them it's got to be x86-compatible. Right now, they're playing around with an Arm-based hybrid processor setup, but they need to move to x86 if they inten to be taken seriously.

      --

      Man is the animal that laughs.
      And occasionally whores for Karma.

    5. Re:Very surprising by Rockoon · · Score: 1

      For what its worth, Microsoft Research has played around with leveraging these new technologies.

      They are currently calling it Accelerator and in its current form, it interfaces with GPU's to perform what I would call "Super-SIMD." You can perform integer work very very fast using it. Development on Accelerator stopped several years ago, probably because the Microsoft Research guys are waiting for something a little more concrete to start planning for.

      --
      "His name was James Damore."
  5. CPU based GPU will not work as good as long as the by Joe+The+Dragon · · Score: 1

    CPU based GPU will not work as good as long as they have to use the main system ram also heat will limit there power. NVIDAI should start working HTX video card so you can the video card on the cpu bus but it is on a card so you put ram and big heat sinks on it.

  6. Ugh. by Anonymous Coward · · Score: 2, Insightful

    From TFA> The ability to do one thing really quickly doesn't help you that much when you have a lot of things, but the ability to do a lot of things doesn't help that much when you have just one thing to do. However, if you modify the CPU so that it's doing multiple things, then when you're only doing one thing it's not going to be any faster.

    David Kirk takes 2 minutes to get ready for work every morning because he can shit, shower and shave at the same time.

    1. Re:Ugh. by Ethanol-fueled · · Score: 0, Offtopic

      But can he sneeze, shit, and cum at the same time? Damn, that would feel good!

    2. Re:Ugh. by Eternal+Vigilance · · Score: 1

      I feel such pity for your significant other....

      ( Oh, wait, this is /. Your drycleaner, then. OTOH, you probably enjoy hay fever season more than anyone. ;-) )

  7. FOR NOW by Relic+of+the+Future · · Score: 2, Interesting

    There wasn't a horizon given on his predictions. What he said about the important numbers being "1" and "12,000" means consumer CPUs have about, what, 9 to 12 years to go before we get there? At which point it'd be foolish /not/ to have the GPU be part of the CPU. Personally, I think it'll be a bit sooner than that. Not next year, or the year after; but soon.

    --
    Those who fail to understand communication protocols, are doomed to repeat them over port 80.
    1. Re:FOR NOW by Bastard+of+Subhumani · · Score: 1

      Personally, I think it'll be a bit sooner than that. Not next year, or the year after; but soon.
      You mean it'll coincide with the release of Duke Nukem Forever?
      --
      Only three things are certain; death, taxes, and apocryphal quotations - Ben Franklin.
    2. Re:FOR NOW by Keith+Russell · · Score: 1

      You mean it'll coincide with the release of Duke Nukem Forever?

      Nope. Duke Nukem Forever will be delayed so the engine can maximize the potential of the new combined GPU/CPU tech.

      --
      This sig intentionally left blank.
    3. Re:FOR NOW by Dolda2000 · · Score: 2, Insightful
      Why would one even want to have a GPU on the same die as the CPU? Maybe I'm just being dense here, but I don't see the advantage.

      On the other hand, I certainly do see possibly disadvantages with it. For one thing, they would reasonably be sharing one bus interface in that case, which would lead to possibly less parallelism in the system.

      I absolutely love your sig, though. :)

    4. Re:FOR NOW by renoX · · Score: 2, Interesting

      >Why would one even want to have a GPU on the same die as the CPU?

      Think about low end computers, IMHO putting the GPU in the same die as the CPU will provide better performance/cost than embedded in the motherboard.

      And a huge number of computers have integrated video so this is an important market too.

    5. Re:FOR NOW by smallfries · · Score: 1

      It can do vast amounts of linear algebra really quickly. That makes it useful for a lot of applications if you decrease the latency between the processor and the vector pipelines.

      Sharing one bus would hamper bandwidth per core (or parallelism as you've phrased it) - but look at the memory interface designs in mini-computers/mainframes over the past ten years for some guesses on how that will end up. Probably splitting the single bus into may point-to-point links, or at least that is where AMD's money was.

      --
      Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
    6. Re:FOR NOW by Dolda2000 · · Score: 1

      Think about low end computers, IMHO putting the GPU in the same die as the CPU will provide better performance/cost than embedded in the motherboard. Oh? I thought I always heard about this CPU/GPU combo chip in the context of high-performance graphics, but I may just have mistaken the intent, then. If it's about economics, I can understand it. Thanks for the explanation!
    7. Re:FOR NOW by bluefoxlucid · · Score: 1

      The GPU doesn't care about CPU cache, the CPU doesn't care about VRAM. You'll create a heat problem and need an extended memory bus to access video memory. Graphics without dedicated VRAM causes a huge CPU performance hit due to rapid and repeated north bridge/memory bus access.

  8. Consider the source by dj245 · · Score: 1

    Graphics card man says that CPU's not a threat to his businees. I'm shocked!

    --
    Even those who arrange and design shrubberies are under considerable economic stress at this period in history.
    1. Re:Consider the source by Ngarrang · · Score: 1

      This just in, Slashdotters think slashdot is the best web site!

      --
      Bearded Dragon
    2. Re:Consider the source by Anonymous Coward · · Score: 0

      This just in, Slashdotters think slashdot is the best web site! You must be new here.
    3. Re:Consider the source by lnjasdpppun · · Score: 1

      I like how he says Intel can't see the complexity of designing GPU's because they build CPU's, then turns around and says nVidia could build a CPU easily.

    4. Re:Consider the source by Anonymous Coward · · Score: 0

      I suspect many Slashdotters prefer porn sites, but need to take a break once in a while...

  9. On the high-end... by Kjella · · Score: 1

    ..there's discrete chips, but on the low end there's already integrated chipsets and I think the future is heading towards systems on a chip. A basic desktop with hardware HD decoding and 3D enough to run Aero (but not games) can be made in one package by Intel.

    --
    Live today, because you never know what tomorrow brings
    1. Re:On the high-end... by Telvin_3d · · Score: 1

      Aero takes more graphics support than some games. Even some new games if you look at some smaller niche titles.

    2. Re:On the high-end... by Anonymous Coward · · Score: 0

      And that is why it is failing

    3. Re:On the high-end... by billcopc · · Score: 1

      Minesweeper is a game. It's about as hard on a GPU as most "niche" titles you speak of, because the great majority of low-budget titles are built by glorified VB coders.

      I'm not saying a game needs to pound the crap out of my machine to be considered entertaining. What I'm saying is the game industry is full of junk in all segments. Great ideas with crap developers, and crap ideas with great devs. Once in a blue moon, both kinds of geniuses meet up and produce gaming nirvana, the other 99% isn't even worth a screenshot.

      --
      -Billco, Fnarg.com
    4. Re:On the high-end... by bluefoxlucid · · Score: 1

      John Carmack usually goes, "Your ideas are stupid. We're going to make something cool now unless you fire all of us."

  10. And later.... by Chyeld · · Score: 1

    "No, that's ridiculous -- it would be at least a thousand times too slow [for graphics]," he said. "Adding four more cores, for example, is not going anywhere near close to what is required."

    He then quipped, "Go away kid, ya bother me!"

  11. Summary by InvisblePinkUnicorn · · Score: 1

    "If the market wants to move away from GPU integration, let it, but we're not going to help it along..."

  12. More interested in open drivers by pembo13 · · Score: 1

    So I am going which ever manufacturer has the best drivers for my platform of choice, Linux. So if the future doesn't hold this for Nvidia, it doesn't really interest me.

    --
    "Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft
    1. Re:More interested in open drivers by Telvin_3d · · Score: 1

      And if your platform of choice doesn't hold much future/value for Nvidia, you will continue to not really interest them.

      The only people who run Linux without access to a Windows/OSX box tend to be the ones who are only willing to run/support Open Source/Free software. This is also the group least likely to buy commercial games, even if they were released for Linux.

      No games -> No market share for high end graphics cards with big margin -> The graphics cards companies don't care

    2. Re:More interested in open drivers by pembo13 · · Score: 1

      Does Nvidia make commercial games? I thought they made hardware. I can't (yet) download hardware for free.

      --
      "Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft
    3. Re:More interested in open drivers by Anonymous Coward · · Score: 0

      >And if your platform of choice doesn't hold much future/value for Nvidia, you will continue to not really interest them.

      Yet ATI is very interested...

      >The only people who run Linux without access to a Windows/OSX box tend to be the ones who are only willing to run/support Open Source/Free software. This is also the group least likely to buy commercial games, even if they were released for Linux.

      That would explain why we buy the windows games then run them in WINE.

      You can't broadly generalize FOSS users. Not all of us are Stallman clones.

      As an earlier story today pointed out, if nVidia wants to keep the interest of OEMs like Dell, Lenovo, Asus, HP etc., they'll provide redistributable linux drivers or else the OEMs will give preference to ATI, which does provide them.

      As of now, the only place to legitimately get the linux drivers for nVidia chips is nVidia. If you want to bundle them in your distro, you taint your license and incur legal risk.

      Say "There aren't enough linux users to interest nVidia and game companies". Don't say the FOSS users that exist won't buy games, because it's completely wrong.

      I buy an average of 6 games a year. Sometimes more.

      -AC

    4. Re:More interested in open drivers by Anonymous Coward · · Score: 0

      To be fair to NVidia, they do have reasonable binary drivers. But yeah, I toy with 3D very little these days and buy intel purely because of the driver situation.

      Give me either companies products over what AMD/ATI are churning out though... pfft!

    5. Re:More interested in open drivers by IKnwThePiecesFt · · Score: 1

      He never implied they did. He was saying though that NVidia doesn't care about everyday productivity users, they care about gamers since gamers are the ones spending $500 for the top video cards. Since games are typically Windows exclusive (aside from less-than-perfect emulation) gamers tend to be Windows users. Thus, Linux is not their market and they don't care.

    6. Re:More interested in open drivers by jonaskoelker · · Score: 1

      This is also the group least likely to buy commercial games, even if they were released for Linux.

      No games => ... Ever played Nexuiz? Tremulous? Sauerbraten? Warsow? OpenArena? There are high-quality* free software (non-commercial) games...

      (*) Quality is defined as entertaining me. I think contemporary commercial non-free games entertain me about as well, and are slightly prettier while doing it; I haven't heard of any revolutions in game design. However, my play experience of contemporary commercial non-free games is limited to Wii Sports, Twilight Princess and Super Mario Galaxy.
  13. Why not.... by Anonymous Coward · · Score: 0

    Instead of 4 CPU cores on a quad-core chip, why not put 2xCPU cores and 2xGPU cores?

    1. Re:Why not.... by sexconker · · Score: 1

      Because the design of a "CPU" core is vastly different than that of a "GPU" core.

      The whole "OMG let's integrate everything!" routine is old. It is quickly followed by the realization (due to programmers getting frustrated with stupid quirks/implementation requirements, and hitting the always annoying performance wall) that things work better when they're designed for a specific purpose, and then we work to separate them out again, creating new buses and slots and form factors and power connectors.

      A while later, CPU people begin to envy the raw performance of the dedicated hardware, while the GPU (and other dedicated hardware) people see the untapped potential of a CPU sitting mostly idle, with a giant instruction set.

      They both then work on gearing their hardware to the other side of things (designing memory access and pipelines and extra instruction sets and registers to allow for more specialized tasks, for the CPU folks, and making everything programmable and more generic for the dedicated hardware folks).

      We eventually get to the point where people realize "Hey, they're both just processors doing some basic logic and math. Why do we need multiple things again?" and "Hey, if we got rid of this stupid bus, and put everything on one chip, we could save on latency, heat, power, and cost!"

      Then the merge everything, and the cycle repeats.

    2. Re:Why not.... by nick_davison · · Score: 2, Insightful

      Instead of 4 CPU cores on a quad-core chip, why not put 2xCPU cores and 2xGPU cores? Because now they have to make [number of CPU options] x [number of GPU options] variants rather than [number of CPU options] + [number of GPU options].

      Even taking a small subset of the market:
      8600GT, 8800GT, 8800GTS, 6600, 6700, 6800

      Six products sit on shelves. Users buy what they want. As a competitor to say the 8600GT comes out, Best Buy has to discount one product line.

      To give users the same choices as an integrated solution, that'd be 9 variants:

      8600GT/6600 - Budget
      8600GT/6700 - Typical desktop user
      8600GT/6800 - Photoshop user/media encoder
      8800GT/6600 - Poor gamer
      8800GT/6700 - Mid range gamer
      8800GT/6800 - Serious desktop user who likes to game
      8800GTS/6600 - Exclusive but somewhat poor gamer
      8800GTS/6700 - Gaming enthusiast
      8800GTS/6800 - Hardcore power gamer/3D Modeller

      Most users are now left scratching their heads as to whether the similarly priced 8600GT/6800 or the 8800GTS/6600 is better or worse for them than the also similarly priced 8800GT/6700.

      Plus, every time one part of the market is perceived as less valuable, the stores have to price many different skus.

      Now add in the gamer who bought a $200 GPU and a $300 CPU a little while before a great new mid range GPU option turns up. They can toss their $200 investment which sucks but that's probably it when it comes to upgrading. Or the guy who bought the $450 (we'll grant a small discount for single purchases) combined unit now has to toss both. Plus he most likely has to buy a new motherboard and memory because memory speed requirements and processor sockets change faster than Britney Spears' moods can swing.
  14. Re:CPU based GPU will not work as good as long as by Anonymous Coward · · Score: 1, Interesting

    Right, come back in 5 years when we have multi core processors with integrated spe-style cores, GPU and multiple memory controllers.

    NVidia are putting a brave face on it but they're not fooling anybody.

  15. Correction by InvisblePinkUnicorn · · Score: 1

    I was using the Old English sense of the phrase "away from", which actually means "toward".

  16. Why wouldn't you have a gpu core in a multiple ... by Cedric+Tsui · · Score: 3, Interesting

    ... core processor? I don't understand the author's logic. Now, suppose it's 2012 or so and multiple core processors have gotten past their initial growing pains and computers are finally able to use any number of cores each to their maximum potential at the same time.

    A logical improvement at this point would be to start specializing cores to specific types of jobs. As the processor assigns jobs to particular cores, it would preferentially assign tasks to the cores best suited for that type of processing.

  17. Re:CPU based GPU will not work as good as long as by arbiter1 · · Score: 2, Insightful

    truthfully only real application for the gpu/cpu hybrid would be in laptop use where they can get away with using lower end gpu chips

  18. VIA by StreetStealth · · Score: 2, Interesting

    The more Nvidia gets sassy with Intel, the closer they seem to inch toward VIA.

    This has been in the back of my mind for awhile... Could NV be looking at the integrated roadmap of ATI/AMD and thinking, long term, that perhaps they should consider more than a simple business relationship with VIA?

    --
    Your mind is clear / The things that you fear / Will fade with how much you / Believe what you hear
    1. Re:VIA by Retric · · Score: 2, Informative

      The real limitation on a CPU/GPU hybrid is memory bandwidth. A GPU is happy with .5 to 1 GB of FAST RAM but CPU running vista works best with 4-8GB of CHEEP ram and a large L2 cash. Think of it this way a GPU needs to access every bit of ram 60+ times per second but a CPU tends to work with a small section of a much larger pool of ram which is why L2 cash size/speed is so important.

      Now at the low end there is little need for a GPU but as soon as you want to start 3D gaming and working with Photoshop on the same system you are going to want both video and normal ram.

      PS: This is also why people don't use DDR3 memory for system RAM it's just not worth the cost for a 1-2% increase over cheep DDR2 ram.

    2. Re:VIA by JoshHeitzman · · Score: 2, Interesting

      Don't see why a hybrid couldn't have two memory controllers included right on the chip and then mobos could have slot(s) for the fast RAM nearest to the CPU socket and the slots for the slower RAM further away.

      --
      Software Inventor
    3. Re:VIA by m50d · · Score: 1

      Modern systems are already having trouble fitting everything that has to go near the CPU near the CPU; ordinary system RAM still wants latency as low as you can. I don't see this happening until after we start putting main memory on the same chip as the CPU.

      --
      I am trolling
    4. Re:VIA by smallfries · · Score: 1

      The simple answer is you use a memory hierarchy same as people do now. The L2 cache on a CPU is large enough to contain the working set for most problems. The working set for GPU-type problems tends to be accessed differently. You need some sort of caching for data but for lots of the memory you access it will be a really large pretty sequential stream. The memory locking in CUDA reflects this.

      So going back to your comment about memory mismatch. Some of your cores in a hybrid would have large L2 caches like a conventional CPU. Some of your cores would have almost no L2 cache but would share a really large pool of L3 (probably the same 1/2 GB of DDR3) and the rest would be system memory. If the large L3 pool is in use then the cpu type cores wouldn't see any benefit from this layer, .... but when the gpu parts are idle this would be a large speed boost for the cpu-type parts.

      --
      Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
    5. Re:VIA by Khyber · · Score: 1

      It's called R&D costs. They know they can do it, but right now, they're too busy milkiing the current cash cow to spend money on any decent R&D advances. I'm willing to bet they (nVidia) could have had SLI out the year after buying 3Dfx, but they were too busy working off the money and paying employees and bribes to be able to absolutely dominate the video industry and put ATi and any other company that made video cards (Matrox, Trident, etc) out of business.

      --
      Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
    6. Re:VIA by aliquis · · Score: 1

      On the Amiga this was somewhat solved by giving the graphics chips prioritized access over the chip ram.

      But back then you WANTED "fast mem", as in cpu specific ram, because it made the cpu work faster instead =P

      But at current memory prices and if production was moved to faster ram I guess it may be possible to just have a bunch of very fast memory and let the GPU have priority over it once again.

      Or as someone else said have both kinds even thought both gpus and cpus are within the same chip (why you would want that with such a design, why not just have them split as they are.)

      I guess combining them only makes sense once they are very similair, and once they are the memory bandwidth difference maybe aren't that huge any longer?

    7. Re:VIA by aliquis · · Score: 1

      Except ATI aren't out of business...

      And I don't see how they could bribe themself to dominence either, it's more likely that they just did the best product, again and again. Though luck for the companies which didn't had as competent crew and engineers.

    8. Re:VIA by Anonymous Coward · · Score: 0

      Patriot Dual Channel Viper 2048MB PC15000 DDR3 1866MHz Memory (2 x 1024MB)
      $269.99 x 2 = 4gb of DDR3 RAM for 539.88$ And you only get 3GB of ram for the system as the GPU takes 1 gig of that.
      vs.
      The new 1GB GeForce 8800 GT model has 112 Stream Processors set to 1500MHz, a GPU clocked at 600MHz and GDDR3 memory at 1800MHz. The card is in stock now for $299.99. + G.SKILL 2GB (2 x 1GB) 240-Pin DDR2 SDRAM DDR2 800 (PC2 6400) Dual Channel Kit Desktop Memory @ $48.99 * 2 = 4gb of DDR2 RAM for 90$ + 300$ graphics card saves you 140$ (plus the cost of the GPU on the CPU.)

      You could give the graphics card it's own memory but that's basicly a graphics card minus the GPU.

    9. Re:VIA by aliquis · · Score: 1

      And how much faster are DDR3 in reality? How much would you lose from using only DDR2? How much cheaper are one step slower DDR3?

      Not to mention if DDR3 are much faster you do get much faster memory for those $140 more for your CPU (Thought often occupied, and you don't need to transport data between system ram and graphics memory.

      Personally I don't think it's a good idea to combine them either, I was just reasoning =P

    10. Re:VIA by msromike · · Score: 1

      Does the Vista friendly RAM make birdlike sounds as it is being accessed?

  19. Qualified... by fitten · · Score: 1
    From TFA:

    "Sure," acknowledged David. "I think that if you look at any kind of computational problem that has a lot of parallelism and a lot of data, the GPU is an architecture that is better suited than that. It's possible that you could make the CPUs more GPU-like, but then you run the risk of them being less good at what they're good at now - that's one of the challenges ahead [for the CPU guys].


    Yeah... so all you have to do is turn every problem into one that GPUs are good at... lots of parallelism and lots of data... but not all problems are like that (heck, the majority of problems aren't like that). GPU stream processors do fairly simple jobs compared to what a (general purpose) CPU does *and* what they do is extremely parallel (embarassingly parallel). All that OOOE, branch prediction, memory management, and all those other features take silicon to make fast. That's the reason general purpose CPUs have few cores per die.

    Stream processors are very simple in comparison and don't require nearly as much silicon to implement, which is why we have over 100 of them on some chips. When you add the complexity that the general purpose CPU has to deal with to the GPU processors, you will eventually be in the same boat.

    Or maybe perhaps NVIDIA has been showing their graphics cards running a database engine? or even an OS as we are used to using (memory protection, etc.) What about compiling source code?

    The future is asymmetric cores on a single die. The DSPs and Cell are early forms of this but still too hard to deal with. OS kernels and compilers have to become smarter: the OS knows which cores can do what and the compiler can tell what kinds of things a program expects to do and puts that into the executable, the OS matchs the executable with the cores that best satisfy what the program needs (closest minimal match), perhaps even dynamically as different sections of a program are 'marked' by the compiler to let an OS know when to schedule the process for a different type of core.

    Today, they are explicitly programmed... the 'main' CPU makes library calls, basically, that use the other cores to do stuff, more like coprocessors. All this stuff will eventually need to be done automatically.
  20. And it will probably... by Nick+Driver · · Score: 1

    ...stink.

  21. Every time I walk out to my car I see raytracing. by argent · · Score: 2, Interesting

    There's the sun reflecting off the cars, there's the cars reflecting off each other, there's me reflecting off the cars. There's the whole parking lot reflecting off the building. Inside, there's this long covered walkway, and the reflections of the cars on one side and the trees on the other and the multiple internal reflections between the two banks of windows is part of what makes reality look real. AND it also tells me that there's someone running down the hall just around the corner inside the building, so I can move out of the way before I see them directly.

    You can't do that without raytracing, you just can't, and if you don't do it it looks fake. You get "shiny effect" windows with scenery painted on them, and that tells you "that's a window" but it doesn't make it look like one. It's like putting stick figures in and saying that's how you model humans.

    And if Professor Slusallek could do that in realtime with a hardwired raytracer... in 2005, I don't see how nVidia's going to do it with even 100,000 GPU cores in a cost-effective fashion. Raytracing is something that hardware does very well, and that's highly parallelizable, but both Intel and nVidia are attacking it in far too brute-force a fashion using the wrong kinds of tools.

  22. Re:Why wouldn't you have a gpu core in a multiple by Narpak · · Score: 1

    Because if processing power goes up way past what you generally need for even heavy apps, Nvidia still want you to believe that you need a separate graphics card. If that model were to change at some point it would be death for graphics card manufacturers. Of course, they could very well be right. What the hell do I know :P

  23. Future is set by Archangel+Michael · · Score: 3, Insightful

    The pattern set by the whole CPU / Math Co-Processor integration showed the way. For those old enough to remember, once upon a time the CPU and Math Co-Processor were separate socketed chips. Specifically you had to add the chip to the MOBO to get math functions integrated.

    The argument back then is eerily similar to the same as proposed by NV chief, namely the average user wouldn't "need" a Math Co-Processor. Then came along the Spreadsheet, and suddenly that point was moot.

    Fast forward today, if we had a dedicated GPU integrated with the CPU, it would eventually simplify things so that the next "killer app" could make use of commonly available GPU.

    Sorry, NV, but AMD and INTEL will be integrating GPU into the chip, bypassing bus issues and streamlining the timing. I suspect that VIDEO processing will be the next "Killer App". YouTube is just a precursor to what will become shortly.

    --
    Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
    1. Re:Future is set by TopSpin · · Score: 1

      CPUs, GPUs... in the end they're all ICs. Bets against integration inevitably lose. The history of computation is marked by integration.

      NVidia already makes good GPUs and tolerable chipsets. They should expand to make CPUs and build their own integrated platform. AMD has already proven there is room in the market for entirely non-Intel platforms.

      It's that or wait till the competition puts out cheap, low power integrated equivalents that annihilate NVidia's market share. I think they have the credibility and could leverage the necessary capital. The question is whether NVidia has the vision to act. Probably not; they've been very successful for a long period and may have weeded out any risky leadership.

      They'll probably just get bought by HP, long after their relevance has faded.

      --
      Lurking at the bottom of the gravity well, getting old
    2. Re:Future is set by Anonymous Coward · · Score: 0

      It's preposterous to say that spreadsheets drove adoption of FPUs.
      Even for a huge spreadsheet at the time (say 10000 cells), each cell containing a complex formula (say 5 multiplications, 5 divisions and 10 additions/subtractions), would only use approx. 0.1 seconds to recalculate on a 25mhz 386sx..

    3. Re:Future is set by TheRaven64 · · Score: 1

      nVidia would be foolish to think that the desktop graphics market won't follow the same trends as the workstation graphics market, since their founders were at SGI when that trend started and were the ones that noticed it. I suspect this is why nVidia have licensed the ARM11 MPCore from ARM. They are using it in the APX 2500, which has one to four ARM CPU cores at up to 750MHz, and an nVidia-developed GPU, which supports OpenGL 2.0 ES and 720p encoding and decoding of H.264, in a small enough (and low enough power) package for handheld devices.

      --
      I am TheRaven on Soylent News
    4. Re:Future is set by blahplusplus · · Score: 1

      "The pattern set by the whole CPU / Math Co-Processor integration showed the way. For those old enough to remember, once upon a time the CPU and Math Co-Processor were separate socketed chips"

      Math co-processors did not have massive bandwidth requirements that modern GPU's need in order to pump out frames. Everyone in this discussion seeing the merging of CPU and GPU haven't been around long enough, I remember many times back in the 80's and 90's the same people predicting the 'end of the graphics card' it NEVER HAPPENED, even with 2D cards, and this was well before the advent of 3D accelerators.

    5. Re:Future is set by Anonymous Coward · · Score: 0

      I remember many times back in the 80's and 90's the same people predicting the 'end of the graphics card' it NEVER HAPPENED, even with 2D cards, and this was well before the advent of 3D accelerators.
      You've seen it go from an enormous ISA card that did simple VESA graphics, to a tiny IC on the motherboard capable of decent 3D acceleration. We're pretty clearly moving in the direction where even many gamers will not want to buy a discrete GPU card. Will that mean CPU/GPU integration? I have no idea, but AMD seems to think so.
    6. Re:Future is set by Archangel+Michael · · Score: 1

      Your error is that I'm not suggesting the "end" of a graphics chip (or better, graphics core). Bringing the GPU to the CPU Core will INCREASE bandwidth, not decrease it, because it will not be limited by whatever bus you're running.

      The bus between the CPU cores, Memory, GPU and whatnot could be ultimately tuned in ways you might not be able to do with a standardized bus (PCIe, AGP etc).

      And in fact, the old CPU /Math CoProcessor was limited bus speed/bandwidth, which ONE of the reasons they brought it to the CPU chip itself.

      There were other reasons as well, so don't get me wrong, but bus limitations was one of them. Additionally many of the other reasons for putting the Math Co-Processor on the same die as the rest of the CPU also applies here as well.

      --
      Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
    7. Re:Future is set by blahplusplus · · Score: 1

      The point you keep missing of course is bandwidth and memory, the math co-processor is NOTHING LIKE a modern gpu, it did not have nor need what GPU's require in order to function, even GPU's are under tremendous strain, moving that strain to a different place and centralizing it with general purpose processor is not going to pan out.

      And integrated graphics still suck, the fact is no one wants to subsidize integrating high end GPU chips onto motherboards to standardize the platform because of the costs associated with that. This is where markets pervert and hamper progress.

    8. Re:Future is set by Anonymous Coward · · Score: 0

      Well, yes.

      But no. The reason for not integrating the FPU on the CPU was that, one, they had better things to spend transistors on (getting rid of the microcode, for starters), and two, there was just no way to put both on the same die. For a long time the FPU had a higher transistor count than the CPU. The 8087 had 45000 against the 8086's 29000. Even then, the 80387 was the first of Intel's to implement the whole of IEEE754.

      Then came along Moore's law and suddenly that point was moot.

      I agree that the GPU will eventually end up on the CPU, but the reason it's not right now is, I think, different, and that's memory bandwidth.

  24. Re:CPU based GPU will not work as good as long as by maxume · · Score: 1

    How many more pixels do you think you need? I'm glad they are looking ahead to the point when graphics is sitting on chip.

    (current high end boards will push an awful lot of pixels. Intel is a generation or two away from single chip solutions that will push an awful lot of pixels. Shiny only needs to progress to the point where it is better than our eyes, and it isn't a factor of 100 away, it is closer to a factor of 20 away, less on smaller screens)

    --
    Nerd rage is the funniest rage.
  25. SMP not the answer by Anonymous Coward · · Score: 0

    We already have multi-core systems and they rarely improve gaming. Why? Because almost all games are not coded to take advantage of a SMP environment.

    Even Carmack himself will tell you that it is very challenging to develop a truly multi-threaded app, especially when a real-time component exists.

    Most games operate in a synchronous state machine type fashion with rendering just another step in the cycle. This does not lend itself to parallelism. In order to truly take advantage of SMP, most of the big game engines in use would have to be re-written from the ground up.

    Unless we move the engine to the GPU, then Intel would really be in trouble. Already moving that direction piecemeal... shaders (well sort of, dynamic code on the GPU counts in my book), aegia physx anyone?

  26. Realtime Ray Tracing and Multicore CPU's by nherc · · Score: 4, Interesting

    Despite what some major 3D game engine creators have to say if real-time ray tracing comes sooner than later, at about the time an eight core CPU is common, I think we might be able to do away with the graphics card especially considering the improved floating point units going in next gen. cores. Consider Intel's QuakeIV Raytraced running at 99fps at 720P on a dual quad-core Intel rig at IDF 2007. This set-up did not use any graphic card processing power and scales up and down. So, if you think 1280x720 is a decent resolution AND 50fps is fine you can play this now with a single quad-core processor. Now imagine it with dual octo-cores which should be available when? Next year? I hazard 120fps at 1080P on your (granted) above average rig doing real time ray tracing some time next year IF developers went that route AND still playable resolutions and decent fps with "old" (by that time) quad-cores.

    --
    'He was a dreamer, a thinker, a speculative philosopher... or, as his wife would have it, an idiot.' - Douglas Adams
    1. Re:Realtime Ray Tracing and Multicore CPU's by Sancho · · Score: 1

      It seems like you could still have specialized ray-tracing hardware. Whether that's integrated into the main CPU as a specialized core, or as an expansion card really isn't relevant, though.

      I think the best thing about heading in this direction is that "accelerated" graphics no longer becomes limited by your OS--assuming your OS supports the full instruction set of the CPU. No more whining that Mac Minis have crappy graphics cards, no more whining that Linux has crappy GPU driver support....

      The downside is that an easy upgrade path gets lost. Right now, you can breathe new life into your aging system by upgrading the graphics card (if you're wanting to play newer games, of course.) Upgrading the CPU is a little more intimidating.

    2. Re:Realtime Ray Tracing and Multicore CPU's by nabasu · · Score: 1

      Problem is just that ray-tracing isn't all that. You want control when you design a game environment. Control over how everything looks and all the lights. I think that there will be lighting solutions to take advantage of more CPU horsepower, like Geomeric's Enlighten. But a full raytracing solution would just plain suck. If we look at the film business, you will see that the most commonly used render solution is Pixar's Renderman. It's a mix between raytracing and more traditional methods. I think that's where gaming is headed now, a mix between traditional graphics and a touch of raytracing.

    3. Re:Realtime Ray Tracing and Multicore CPU's by nabasu · · Score: 1

      Woops. Pardon for using the "bold"-tag...After ten years of using html, I'm still a newbie...

    4. Re:Realtime Ray Tracing and Multicore CPU's by Anonymous Coward · · Score: 0

      Consider that Quake IV uses at least two orders of magnitude less complex scene geometry than current-gen games, lower-resolution textures, and almost no shader effects...

    5. Re:Realtime Ray Tracing and Multicore CPU's by Anonymous Coward · · Score: 0

      Good. I'm sick of lousy support and unstable drivers for high end graphics cards. I would like to see all the complexity of graphics cards moved into standardized software which can run on a variety of multi-core general purpose processors. Then we can actually get some better stability.

    6. Re:Realtime Ray Tracing and Multicore CPU's by G00F · · Score: 1

      You can rarely upgrade CPU now days unless you bought what is considered a low end CPU. And even then you don't see the performance jump from say a Geforce 440MX to a Geforce 8500. (easy under $100)

      Most people would spend ~100+ to upgrade a CPU for small increases and then their mobo is locked with PCI or AGP? Just spend ~150-200 for new CPU/RAM/Mobo, upgrade video card later. (I've been upgrading people to AMD 690V chipset mobo, and it has given them a large enough increase they didn't need the new card and the ones that did just cost another $50-150)

      SO what I am saying is upgrading CPU is worthless unless upgrading the whole system, the video card is great, but at a certain point have to meak the leap because why continue to upgrade an aging and out of date system?

      --
      The spirit of resistance to government is so valuable on certain occasions that I wish it to be always kept alive
    7. Re:Realtime Ray Tracing and Multicore CPU's by Sancho · · Score: 1

      Well, I was talking about a world where we've moved on from off-CPU GPUs. Right now, yes, it's rare for people to upgrade the CPU without also upgrading many other components--but it's not always as dark an outlook as you suggest. The Core Duo, for example, is pin-compatible with the Core2Duo, and the performance difference is noticeable (at least on Macs.)

    8. Re:Realtime Ray Tracing and Multicore CPU's by jpmorgan · · Score: 1

      And how big were the textures? Raw computational performance is one thing but the often overlooked issue is memory bandwidth. GPUs don't access memory the same was as CPUs. It's harder to work with and not as efficient for general purpose computing as a CPU's multiple levels of caches, but it has much higher bandwidth.

      NVIDIA already sells cards with >1GB of memory. Try rendering a scene at 60FPS when you have 1GB of textures and geometry data.

    9. Re:Realtime Ray Tracing and Multicore CPU's by Anonymous Coward · · Score: 0

      Not really true.

      It uses shader effects on almost every surface (per-pixel lighting), uses stencil shadows (which raise geometric complexity by about 2 orders of a magnitude due to rendering shadow volumes) and still can max out a 512mb card on it's highest texture settings.

      Quake IV is not a current-gen game for sure, but you have to remember, it can actually use more resources already than current generation Xbox 360 or PS3 games.

  27. Drivers? by Anonymous Coward · · Score: 0

    I'm more interested in the future of open gpu drivers... that fancy hardware is next to useless without them unless you are pretty technical and don't mind poisoning your system, or use that other OS.

    ATI will win if nVidia doesn't follow suit. My next card will be ATI on OSS drivers unless nVidia does something similar to keep me interested.

    -AC

  28. How bout this by kurt555gs · · Score: 1

    Why not make one of the multiple cores a GPU, then the speed at which it communicates with the CPU will be at clock speed.

    Problem solved.

    Of course Nvidia will need to come up with a CPU.

    Cheers

    --
    * Carthago Delenda Est *
    1. Re:How bout this by BiggerIsBetter · · Score: 1

      Of course Nvidia will need to come up with a CPU. No they wouldn't. They could easily use a licensed PowerPC core, or one of the open Sparc cores.

      Just means they'd have to forgo the Windows market...
      --
      Forget thrust, drag, lift and weight. Airplanes fly because of money.
  29. Re:Why wouldn't you have a gpu core in a multiple by Anonymous Coward · · Score: 1, Interesting

    I don't think you understand the difference between GPUs and CPUs. The number of parallel processes that a modern GPU can run is massively more than what a modern multi-core CPU can handle. What you're talking about sounds like just mashing a CPU core and GPU core together on the same die. Which would be horrible for all kinds of reasons (heat, bus bottlenecks and yields!).

    Intel has already figured out that for the vast majority of home users have finally caught on that they don't NEED more processing power. Intel knows they have to find some other way to keep people buying more in the future. How many home users need more than a C2D E4500? Will MS Word, web browser and an email client change that much in the next 3-5 years that will demand more horsepower that is available today?

    Then again, you might need 32 CPU cores on a single die if you want to run that AT&T browser ;)

  30. And as we all knew by aliquis · · Score: 2, Insightful

    Only Amiga made it possible! (Thanks to custom chips, not in spite of them.)

    It doesn't seem likely that one generic item would be better at something than many specific ones. Sure CPU+GPU would just be all in one chip but why would that be better than many chips? Maybe if it had RAM inside aswell and that enabled faster FSB.

    1. Re:And as we all knew by AKAImBatman · · Score: 1

      It doesn't seem likely that one generic item would be better at something than many specific ones.

      Combined items rarely are. However, they do provide a great deal of convenience as well as cost savings. If the difference between dedicated items and combined items is negligent, then the combined item is a better deal. The problem is, you can't shortcut the economic process by which two items become similar enough to combine.

      e.g.
      Combining VCR and Cassette Tape Player: Not very effective
      Combining DVD Player and CD Player: Very effective

      CPUs and GPUs are moving in the right direction to eventually merge (in much the same way as FPUs and SSE units merged with CPUs), but they simply aren't there yet. :-)
  31. Re:Why wouldn't you have a gpu core in a multiple by holophrastic · · Score: 1

    well, yeah, for sure. But I see that as only the first step. It's like the math-coprocessor step. My 32-core cpu has six graphics cores, four math cores, two HD video cores, an audio core, 3 physics, ten AI, and 6 general cores. But even that only lasts long enough to reach the point where mass production benefits exceed the specialized production benefits.

    It'll also be the case that development will start to adjust back towards the cpu. Keep in mind, I don't think even one game exists now that is actually built to be dependent on even two cores. We're still dropping video frames as a preference. I await the day when other things get dropped. Imagine where AI gets dumber on a slower machine. Or sound FX are reduced. Or any number of other code paths are eliminated. Hey, no ones even reducing the video quality for 10 of 20 frames. All things that become possible with poly-core machines. Obviously raytracing takes that concept even further.

    The trickle-down of core programming for many cores -- heh -- is the leaps-and-bounds concept that moves industries.

    Either way, I'm back to my tried-and-true statement that what brought about the computer world in the first place was the concept of shared resources -- which includes the cpu. The same thing will happen again. Because the alternative is rediculous. Do you want to play a game on your GPU for graphics, your Xonar for sound, your physX for physics, your AIntelligence for AI (I made that up), and have your cpu do nothing but handle keyboard input? That's just silly. And it reminds me of the days when music was produced differently than sound fx. A computer is not a whole bunch of individual components in one box. It's about the box that can do anything. And when it can't, it gets a little expansion card. That expansion card usually handles some external component, or does something particularly unusual.

    A GPU doesn't handle anything external, and certainly not something unusual. Every machine, always, at every moment, produces high-quality display elements. It's silly to make that a separate component.

    Also, look at the prices. A decent CPU is $100 - $300, and a decent GPU is $150 - $400. There's a lot of money there when combined into a $250 - $700 device. It'll also be great to spend more on my CPU that on my hard drives. What a concept.

  32. Algorithms for graphics don't need Pentium cores by Donkey+Kong+Cluster · · Score: 1

    It is very ridiculous, because if you can put 8 cores in a single dye, then you can put a lot more Multiprocessors then a current GPU already have. And this GPUs are very scalable and the software the runs in it are very simple, so you need simpler threads.
    And this is what happens. Current GPUs can run 512 threas in parallel. Suppose you have 8 core with Hyperthreading, you could run, squeezing everything, 16 threads top. And there isn't any 8 core for sale, isn't?

  33. Re:CPU based GPU will not work as good as long as by Culture20 · · Score: 1

    If you have Eight or X Cores, Couldn't one or two (or X-1) be dedicated to run MESA (or a newer, better, software GL implementation)? IIRC, SGI's linux/NT workstation 350's had their graphics tied into system RAM (which you could dedicate huge amounts of RAM for), and they worked fine.

  34. SIMD vs. MIMD by MOBE2001 · · Score: 1

    Nvidia makes SIMD (single instruction, multiple data) multicore processors while Intel, AMD and the other players make MIMD (multiple instructions, multiple data) multicore processors. These two architectures are incompatible, requiring different programming models. The former uses a fine grain approach to parallelism while the latter is coarse-grained. This makes for an extremely complex programming environment, something that is sure to negatively affect productivity. The idea that the industry must somehow resign itself to an uneasy marriage between the two approaches is nonsense. Logic dictates that universality should be the main goal of multicore research. The market is crying for a super fast, fine-grain and easy to program, MIMD multicore architecture that can handle any kind of parallel computing task. Neither Nvidia, Intel, AMD or the others even come close to delivering what the market wants. And as we all know, what the market wants, the market will get. So my point is that Nvidia should not rest on its laurels because their technology is bound to become obsolete as soon as someone figures out how to make the right multicore processor and kicks everybody's ass in the process. Read Nightmare on Core Street for a good analysis of where the changing multicore landscape is going. In the meantime, I advise everybody in the multicore business to thread carefully. Big money is in the balance. And I mean, BIG MONEY.

    1. Re:SIMD vs. MIMD by hackstraw · · Score: 2, Informative

      Nvidia makes SIMD (single instruction, multiple data) multicore processors...

      That is untrue. The Nvidia cuda environment can do MIMD. I don't know the granularity, or much about it, but you don't have to run in complete SIMD mode.

    2. Re:SIMD vs. MIMD by Anonymous Coward · · Score: 0
      For anybody that followed the link to the COSA bullshit, the following quote from the same author should put it in perspective...

      My goal is to use my understanding of the metaphorical texts to design and build a true artificial intelligence. The Christian AI! It is only a matter of time. When that happens, the Darwinian walls will come crumbling down like the old walls of Jericho. Sweet revenge.


      HTH.
    3. Re:SIMD vs. MIMD by Anonymous Coward · · Score: 0

      The 8800 GTX has 16 "multiprocessors", each containing 8 "stream processors". Each multiprocessor has an independent instruction decoder. Within a multiprocessor, the same instruction is executed for 32 threads at a time, which are pipelined into the 8 stream processors. A multiprocessor rapidly switches between different groups of 32 threads, each running a different instruction. That grouping into chunks of 32 is common across all CUDA-capable cards, so that is probably the best measure of granularity.

    4. Re:SIMD vs. MIMD by Anonymous Coward · · Score: 0

      The problem is poorly phrased in terms of SIMD versus MIMD. CUDA is currently *very* SIMD oriented, but (AFAIK) it's not the SIMD of old, where I stood for Instruction, not Instructions. With an 's'. Anyway, I'm sure it will turn more MIMD in the future. But that's not the real problem.

      The real problem is the GPU architecture is *extremely* stream oriented, i.e. based on what data you're processing now, the hardware can fetch the next set on time to feed the processing units. When you start doing general purpose MIMD, you get pointers, and all that goes out the window. So you need a big fast cache and hope for the best. If you don't, it doesn't matter if you got a 256 bits wide bus, or that you have super fast GDDR3; the latency will kill you.

      So they can just add cache, you say? Well, yes. But cache is expensive, transistor wise. 6 per bit. A 9800 GTX has 128 stream processors. To give them 256k of cache each (a rather pitiful amount by today's standards), that works out to 1,610,612,736 transistors. As in more than 1.5 billions. That's more than twice as much as the 9800 GTX itself, and I'm not even counting the plumbing, which can easily add 10-20%. Moore's law will come to the rescue of course. 3 years from now. They can also reduce the number of PUs. But there's catch: all that is of zero use for, well, what GPUs do: rasterisation.

      So that's the trouble nVidia is in: they have a processor that's totally optimized for, and very very good at, a (very) small number of tasks, and absolute cr*p at everything else. If they try to change it, they loose to whoever puts out a GPU that doesn't and use the transistors for things useful to rasterisation. But if they don't, there's Intel and AMD breathing down their neck, adding cores like there's no tomorrow because that's the only way they can increase the performance of their CPUs and spend their transistor budgets. Sooner or later somebody's gonna decide, hey, let's use a smarter rendering algorithm, so we don't need that much bandwidth, but does use the 16 (?) smarter cores we have, and we get better quality to boot, and screw the GPU. It might be raytracing. It might be something else. But it will happen. And then nVidia's f*cked.

      Cheers,

      Matthieu

  35. Re:Why wouldn't you have a gpu core in a multiple by iabervon · · Score: 1

    I think the interviewer wasn't asking the right questions. His answer was for why you can't replace a GPU with an N-core CPU, not why you wouldn't put a GPU on the same die with your CPUs. I think his answers in general imply that it's more likely that people will want GPU cores that aren't attached to graphics output at all in the future, in addition to the usual hardware that connects to a monitor. I wouldn't be surprised if it became common to have a processor chip with 4 CPU cores and 2 GPU cores, and also have a graphics card with another GPU or 2 in addition to video output.

    He is right that having a 16-core CPU won't do a number of common tasks efficiently, compared to a single massively-SIMD core.

  36. Re:Why wouldn't you have a gpu core in a multiple by bhima · · Score: 1

    I was under the impression that optimal bus design used to different but that was sort of going away with the move to multi core designs.

    --
    Nothing in the world is more dangerous than sincere ignorance and conscientious stupidity.
  37. Re:Why wouldn't you have a gpu core in a multiple by svnt · · Score: 1

    The five year window might not be in the cards, but I've got two words for you: ray tracing.

    Pretty much the only way to continue Moore's Law that I can see is via additional cores. If you had 128 cores, you would no longer care about polygons. Polygons = approximations for ray tracing. Nvidia = polygons.

  38. Re:Why wouldn't you have a gpu core in a multiple by gbjbaanb · · Score: 1

    Supreme Commander is the game that requires 2 cores (well, ok you can drop the frame rate, polygon levels and other fidelity settings of course. Nobody would ever release a game that couldn't be played on a single core machine)(not yet at least).

    I think, considering the diminishing returns from adding cores, that adding specialised units on die would make sense. Look at how good a GPU version of folding@home is, and think how that kind of specialised processign could be farmed off to a specialised core. Not necessarily for graphics as I think Nvidia will continue to sell better and better graphics cards.

    If the die had the co-processor on it, and CPU extensions to support it, then compiler writers would use it and some processing tasks could fly along.

    And the reason why wouldn't they put these things into the existing CPU cores is probably complexity. A dedicated core must be easier to design and develop that bloating existing ones with added features and extensions.

  39. Re:Why wouldn't you have a gpu core in a multiple by Anonymous Coward · · Score: 2, Insightful

    I think a better question is "Why wouldn't we have a separate multi-core GPU along with the multi-core CPU?" While I agree that nVidia is obviously going to protect it's own best interests, I don't see the GPU/CPU separation going away completely. Obviously there will be combination-core boards in the future for lower-end graphics, but the demand on GPU cycles is only going to increase as desktops/games/apps get better. However, one of the huge reasons that video cards are a productive industry is that there are plenty of high-end graphical demands out there, from hardcore gamers to Autocad applications. Ever seen the number of cycles/graphical processesing power it takes to run a digital 911 map? Unbelievable!

    Seriously, if there is anything that history has taught us, it's that there's room for the integrated (low-end) and dedicated (high-end) graphics at the same time, as they server different niches.

    Oh, and never get involved in a land war in Asia ;-)

  40. Re:Why wouldn't you have a gpu core in a multiple by drinkypoo · · Score: 1

    I think it's fairly clear that GPUs will stick around until we either have so much processing power and bandwidth we can't figure out what to do with it all, at which point it makes more sense to use the CPU(s) for everything, or until we have three-dimensional reconfigurable logic (optical?) that we can make into big grids of whatever we want. A computer that was just one big giant FPGA with some voltage converters and switches on it would make whatever kind of cores (and buses!) it needed on demand. Since we're not Buck Rogers and this ain't the 24th century, GPUs will probably be here for a while.

    The real question becomes how all these cores will be connected to one another. The processes are getting finer all the time but clock rates are rising only gradually as are word lengths, and it seems highly likely that basically all computers will go multicore before we experience another quantum leap in performance that makes uniprocessor systems powerful again. So then, why not have two CPU cores and a GPU core on the same die?

    I would assume that he's guessing that the integrated systems will continue to be mid-range at best, and that those systems will continue to have only one core. I disagree on both counts, especially since mid-range systems with crappy onboard graphics are here around $500-600 today.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  41. Whole System Design by Archangel+Michael · · Score: 1

    I've actually been suggesting to my friends for a while, that you'll end up with about four or five different major vendors of computers, each similar to what Apple is today, selling whole systems.

    Imagine Microsoft buying Intel, AMD buying RedHat, NVidia using Ubuntu(or whatever) and IBM launching OS/3 on Powerchips, and Apple.

    If the Document formats are set (ISO) then why not?

    There will be those few that continue to mod their cars, but for the most part, things will be mostly sealed and only a qualified mechanic er technician would ever need to crack the case.

    I suspect that in the next 15 years or so, this is what you're gonna end up seeing.

    --
    Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
  42. overestimating the cost of ray tracing by j1m+5n0w · · Score: 2, Informative

    During the Analyst's Day, Jen-Hsun showed a rendering of an Audi R8 that used a hybrid rasterisation and ray tracing renderer. Jen-Hsun said that it ran at 15 frames per second, which isn't all that far away from being real-time. So I asked David when we're likely to see ray tracing appearing in 3D graphics engines where it can actually be real-time?

    "15 frames per second was with our professional cards I think. That would have been with 16 GPUs and at least that many multi-core CPUs â" that's what that is. Just vaguely extrapolating that into our progress, it'll be some number of years before you'll see that in real-time," explained Kirk. "If you take a 2x generational increase in performance, you're looking at least four or five years for the GPU part to have enough power to render that scene in real-time.

    Modern real-time ray tracers can get respectable performance without doing any sort of GPU-hybrid trickery, or requiring any hardware other than a fast CPU. For instance, try out the Arauna demo. (Dedicated ray-tracing hardware would be nice, but I'm not aware of any hardware implementation that has significantly outperformed a well-optimized CPU ray tracer. With the resources of a major chip manufacturer I don't doubt it could be done, though.) Arauna and OpenRT and the like might still be a little too slow to run a modern game at high resolution, but they're getting there fast.

    "People use ray tracing for real effects as well though. Things like shiny chains and for ambient occlusion (global illumination), which is an offline rendering process that is many thousands of times too slow for real-time," said Kirk. "Using ray tracing to calculate the light going from every surface to every other surface is a process that takes hundreds of hours."

    This is just plain ignorant. Naive, O(n^2) radiosity may take that long, or path tracing with a lot of samples per pixel, but a decent photon mapping algorithm shouldn't be anywhere near that slow to produce a rendering quality acceptable for games. Maybe "hundreds of seconds" might be a more plausible number. (Or less, if you're willing to accept a less accurate approximation.) Metropolis Light Transport is another algorithm, but I don't have a good notion of how fast it is.

    1. Re:overestimating the cost of ray tracing by argent · · Score: 1

      Not to mention that Philipp Slusallek was getting 15 FPS in 2005, with an FPGA that had maybe 1% the gates of a modern GPU, and ran at 1/8th the clock rate. It might not have been beating the best conventional raytracers in 2005, but it was doing them with a chip that had the clock rate and gate count of a processor from 1995.

    2. Re:overestimating the cost of ray tracing by j1m+5n0w · · Score: 1

      Dedicated ray tracing hardware would be nice. Unfortunately, I don't think any big hardware company is going to invest in the technology until they start feeling the competitive pressure from software renderers.

    3. Re:overestimating the cost of ray tracing by argent · · Score: 1

      Certainly not so long as David Kirk is in charge.

      Kirk vs Slusallek

  43. Re:Every time I walk out to my car I see raytracin by synth7 · · Score: 1

    Every time I walk out to my car I see raytracing.

    Actually, you don't. Raytracing is a mathematical model that attempts to simulate light behavior in reality. And, as is true for most simulations, it is a gross simplification of reality. The mathematical model used for approaching realism is irrelevant, just so long as the result is closer to the perceived goal.

    And, of course, we are assumng that modeling visual reality is the perceived goal, which it is not in many cases.

  44. Comments on AMD technology misleading by Anonymous Coward · · Score: 0

    He said that current ATI hardware cannot run C code, so the question is: has Nvidia talked with competitors (like ATI) about running C on their hardware? You can indeed run C on AMD's firestream processor using the Brook+ compiler http://ati.amd.com/products/streamprocessor/specs.html. There are issues, however. For instance, the firestream 9170 appears dedicated to computing. I'm not sure they have a processor that can do both graphics and general purpose computing with C. Makes me wonder about his other comments.

    It would be sad if his comments about AMD folding did pan out. It would have been wonderful to have a CPU and GPU chip communicating by HT.
  45. Re:Why wouldn't you have a gpu core in a multiple by Cedric+Tsui · · Score: 1

    Hmmm. That's interesting.
    You're right. Perhaps the CPU and the GPU are too different to play nicely on the same die.

    A little simpler then. If CPU processing power does continue to increase exponentially (regardless of need) then one clever way to speed up a processor may be to introduce specialized processing cores. The differences might be small at first. Maybe some cores could be optimized for 64bit applications while others are still backwards compatible with 32bit. (No. I have no idea what sort of logistical nightmare this would be. )

  46. Re:Every time I walk out to my car I see raytracin by argent · · Score: 1

    Actually, you don't.

    What, you're one of these heretics who doesn't realize that we're in an elaborate computer simulation?

  47. SoC is the future by davido42 · · Score: 0

    I think you will see nVidia licensing their IP to other chip companies, because like it or not, the push is always going to be toward cost reduction, power reduction, smaller form factors, and so on. This is true at all performance levels. For low-end systems, the multi-core CPUs may eat their lunch. The only thing that saves them is that graphics and video are data pigs, so the issue is more managing high bandwidth data flow than overall horsepower. They will still be around, but they may go the way of SGI.

    --

    BitWorksMusic.com -- odd tunes for odd times

  48. Re:CPU based GPU will not work as good as long as by Anonymous Coward · · Score: 0

    MESA is slow, and main RAM access is slow. More general purpose cores isn't the solution. That said, you could build onto the same silicon wafer one or more GPU cores with access to on chip graphics ram (which could double as extended cache or ram specifically for double/triple buffering).

  49. Re:Why wouldn't you have a gpu core in a multiple by blahplusplus · · Score: 1

    "Intel has already figured out that for the vast majority of home users have finally caught on that they don't NEED more processing power."

    I think the real big issue is that there are no killer apps yet (apps so convenient to ones life that they require more processing power).

    I think there are a lot of killer apps out there simply waiting for processing power to make its move, the next big move IMHO is in AUTOMATING the OS, automating programming, and the creation of AI's that do what people can't.

    I've been experimenting with automatic content generation for games and whatnot, and over time these same principles will spread into other areas, I doubt I'll see it in my lifetime but smart-systems are coming.

  50. Re:CPU based GPU will not work as good as long as by billcopc · · Score: 1

    As many pixels as they can possibly throw at me, that's how many.

    There are people who are perfectly happy with resolutions like 1024x768, good for them! Me, I was running that rez in the 486 days, and gaming it in the late 90's with Voodoo2 and the first GeForce.

    The fact that GPUs have scaled faster and larger than CPUs is proof to me that GPGPU is a good idea. I have a beefy PC, and the bulk of what I do involves image processing. If the GPU can do it 10 times faster for less money, that's an epic win and I say bring it on!

    --
    -Billco, Fnarg.com
  51. Eventually it will be all one processor by fluxburn · · Score: 0

    Eventually processors will have hundreds of cores, and will be the polar opposite of today's world of processor design.

  52. Re:CPU based GPU will not work as good as long as by maxume · · Score: 1

    I'm not saying that they should stick everything on one chip anytime soon, just that there is actually a limit somewhere in the medium term future where you start spending improvements somewhere other than raw performance. For casual users, that's really soon(because dpi isn't that important 3 or 4 feet away from your face, most people's eyes don't have the resolution for it to matter).

    --
    Nerd rage is the funniest rage.
  53. Re:Why wouldn't you have a gpu core in a multiple by Rockoon · · Score: 1

    Indeed. Procedural content is the future of gaming. That is so because the primary bottleneck for GPU's has always been memory bandwidth and will always be memory bandwidth unless a fundamentaly different (goodbye Mr Texture Bitmap) methodology is advanced. Cache misses got you down? No problem. Nextgen games wont be missing the cache on texture reads because nextgen games wont have textures. They will have shaders which sample a procedural function on a per-texel bases.

    --
    "His name was James Damore."
  54. Integrated Central Unit Processor by SpeedyDX · · Score: 1

    ICUP

    My inner child needed release. Sorry.

    1. Re:Integrated Central Unit Processor by fractoid · · Score: 1

      I prefer the Distributed Central Unit Processor, it's got a better interface (once you get past the logon screen, damn those little clippy things :P )

      --
      Rampant carbon sequestration destroyed the Dinosaurs' tropical paradise. I'm here to help repair the damage.
  55. Re:Why wouldn't you have a gpu core in a multiple by complete+loony · · Score: 1

    I think by 2012 or 2020 or so, it would be far more likely that all code will be compiled to an abstract representation like LLVM. With a JIT engine that will continuously analyse your code, refactoring into the longest execution pipeline it can manage, examine each step of that pipeline and assign each step to the single threaded CPU style or stream processing GPU style core that seems most appropriate.

    I don't think this will be done at a raw hardware level. I imagine the optimisation process will be far too complex, and be continually improved by researchers. I can't imagine being able to implement anything like this in silicon.

    --
    09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
  56. Re:Why wouldn't you have a gpu core in a multiple by mrbluze · · Score: 1

    A logical improvement at this point would be to start specializing cores to specific types of jobs. As the processor assigns jobs to particular cores, it would preferentially assign tasks to the cores best suited for that type of processing. .. and call it the Amiga.
    --
    Do it yourself, because no one else will do it yourself. [beta blockade 10-17 Feb]
  57. CUPCHICKS by vikstar · · Score: 1

    Central Unit Processor - Complex Homogeneous Integrated Component Kernal System. ... so does mine.

    --
    The question of whether a computer can think is no more interesting than the question of whether a submarine can swim.
  58. how about "kinda" transmeta? by Z80a · · Score: 1

    well,there is a LOT of computing power on that shader units,much more than on a intel cpu for example

    but using it is the main problem,as they cant quite execute x86 code

    so how about using a specializated Dynamic recompilation cpu that do all the conversion/branch prediction/out of order execution/loop unrolling work by software,but with a highly specialized opcode set to do the job instead of a generic cpu?

    even if you lose half of the raw power of the units,you will still have much more processing power than the current cpus on the market

    1. Re:how about "kinda" transmeta? by t_little · · Score: 1

      That sort of dynamic recompiler, branch predictor, and out-of-order executor is what makes up about 90% of the complexity of a CPU - and you'd need to devote silicon to that, as well as a full-speed cache. So you'd have to replace vertex processors to make room for all that.

      What you'd end up with would be pretty similar to a current multi-core CPU, only possibly with a different trade-off between number of cores and average processing power per core per cycle. Definitely not as good for typical sequential, dependent CPU tasks as a current CPU. Probably better for parallel tasks, but definitely not as good at massively parallel tasks as a current GPU.

      --

      -- Tim Little

    2. Re:how about "kinda" transmeta? by SuiteSisterMary · · Score: 1

      Didn't you just re-invent the Itanium? You know, more or less.

      --
      Vintage computer games and RPG books available. Email me if you're interested.
  59. Re:CPU based GPU will not work as good as long as by TheNetAvenger · · Score: 1

    truthfully only real application for the gpu/cpu hybrid would be in laptop use where they can get away with using lower end gpu chips

    These kind of comments scare me, is everyone new, or just not paying attention.

    The PCI Express 16x has amazing overhead even on the most hardcore gaming today, expecially when utilizing SLI/Crossfire configurations.

    As for this ONLY BEING for LOW END? Did you ever read the PCI/AGP/PCI Express specifications?

    Just because RAM sharing was ONLY used in low end on board GPUs doesn't mean that if system RAM is fast enough (or managed properly) that it can't be use in gaming.

    This is where a normal Vista plug would come in, as people don't realize this is ONE OF THE GOOD things of the Vista WDDM (besides being the first OS driver model providing GPU preemptive multitasking). Vista also intelligently manages and virtualizes VRAM to system RAM, and uses AGP/PCI Express techniques so that the memory space is seen by all applications as available VRAM.

    Vista then uses the new memory prioritizers for VRAM operations as well and will swap low performance needs textures between system and VRAM transparently. This not only works, but with the AGP/PCI Express concepts is fast enough that it gives games more VRAM room for better texture quality at the same performance.

    This is EXACTLY how Vista can take a HIGH PERFORMANCE DEDICATED Video Card with 128mb or 256mb of VRAM and allow applications to use textures that exceed the VRAM of the card transparently with no performance loss. In addition, because of the multi-core/multi-GPU tasking nature of Vista, SEVERAL 3D applications can run at once expecting full control the Video card and VRAM at the same time, and not even realize Vista is making this happen.

    The old 'shared system' RAM concepts were SLOW, but mainly because the low end GPUs paired with the mainboards providing this were SLOW.

    Vista does this even with new NVidia 9800s and gives it higher texture quality and performance over XP, so this is NO LONGER a low end concept.

    So I ask again, does anyone even pay attention to what is being done in the GPU/OS realm, or do you just ignore crap because MS created it? If so, no wonder the Linux composers and even the OS X composer and driver technologies are half a decade behind Vista, and people don't even 'get it' until the market notices this and Linux and OS X are screwed and trying to catch up... (This is such a sad pattern, as Linux could have been a 'leader' instead of copy follower of OS X and NT technologies, which is all that seems to happen beyond basic kernel optimizations.)

  60. Re:CPU based GPU will not work as good as long as by TheNetAvenger · · Score: 1

    CPU based GPU will not work as good as long as they have to use the main system ram also heat will limit there power. NVIDIA should start working HTX video card so you can the video card on the cpu bus but it is on a card so you put ram and big heat sinks on it.

    I agree that GPU/CPU will need to be integrated at a lower level than current technologies, but not in the near future as PCI Expres 2.0 doesn't even benefit yet.

    However, don't discount System and VRAM becoming a unified concept. This has already happened with Vista, and is one of the things that gave NVidia and ATI such trouble with the WDDM, as OS handled VRAM virtualization was not something either company was use to coding for in high performance instances. (Also the preemptive multitasking GPU nature of Vista is another shiney new bell they had to hurdle.)

    Vista (now with current drivers) shows that the WDDM and using memory prioritization even for Gaming and System/VRAM virtualization works, and works well. This allows users with low VRAM amounts to crank up texture sizes that exceed the Video card's RAM, without performance loss.

    Additionally as multi-3D application concepts become more standard, system RAM virtualization will be as necessary as HD or other virtualization/paging techniques are in other aspects of computing.

    Not a lot of users run more than one game on the screen at once, but with new application UI concepts like WPF.NET in Vista that inherently doest 3D, this will become common in all Oses. Even running a game in a Window on Vista while Aero is active, is using the multi-3D paradigm, and with the shared texture nature of the Vista composer it can do this with no FPS loss, even with more than one 3D game on screen active at a time (and even Exposed or Flip3D'd on the screen, something that would bring other composers or VIdeo subystem/driver models to a crawl in other consumer OSes.

    Microsoft somehow realized this (actually the XBOX team realized this, and shoved to get these features to be a core of WDDM in Vista.)

    I'm surprised others don't get this, and don't realize that besides the media/SlashDot Vista slam, Vista has done this right, and does it well. Putting it several years ahead of OS X, Linux, etc.

    If people think NVidia or ATI are looking to the OS model of *nix, or OSX for future directions, they are fooling themselves. Microsoft with Vista and especially Microsoft's XBox graphic researchers are mapping the road of the future. The XBox 360 team already has done this with Vista, and the VRAM virtualization concepts, and the unified shader push, and the next big thing will be an extention of this and from them.

  61. Re:Why wouldn't you have a gpu core in a multiple by sweet_petunias_full_ · · Score: 1

    "What you're talking about sounds like just mashing a CPU core and GPU core together on the same die. Which would be horrible for all kinds of reasons (heat, bus bottlenecks and yields!)."

    One would hope that bus bottlenecks would not be a problem if the cores don't have to go off-chip to access a common bus. The bus could be a wider, dedicated video bus if kept on on-chip, and would enjoy vastly reduced latency. It would use less overall power than having an external mobo chip coordinate the handshaking, not to mention the signals that have to be sent to a physical video card. The die itself would be larger than a 1 CPU die, but that's not a problem anymore.

    The problem now is what the hell to do with all of the transistors we can cram onto a chip. At some point people won't notice a difference between a 16 core unit and a 32 core unit connected to the same memory. Might as well stick a GPU in there. Heck, throw in two for good measure. If there's a defect in one, you can disable that and still sell the part as the economy version. If that means 2-4 fewer CPUs per die then so be it.

    It's not all as horribly problematic as you make it sound. There are some significant savings that can be reaped by putting the two together. Right now they claim to be doing it as a way of producing a cheaper system with slightly slower video-card performance, but there are some opportunities to actually improve performance by doing this, and the benefit of lower power consumption for the overall system is definitely in the plus column.

    Actually, if anybody can find the "right" way of doing this, it will probably be AMD/ATI... and then everybody else will just copy what they did, tweak it slightly and give it a different name.

    --
    You can't send a takedown notice to an already printed newspaper.
  62. Re:Why wouldn't you have a gpu core in a multiple by dreamchaser · · Score: 1

    "computers are finally able to use any number of cores each to their maximum potential at the same time."

    The main problem is the software can't use an arbitrarily high number of cores, not the 'computers'. We could put out 64 core PC's (say 16 quad cores) but software just isn't written to take advantage of that level of parallelism.

  63. Building CPUs isn't that hard, but GPUs now... by Intelista · · Score: 1

    Our friend from Nvidia mocks Intel for thinking it GPUs are easy even as he makes statements like "making a core isn't all that hard". And it's true: making a core isn't all that hard. Unless it has to perform well, be proven compatible to all kinds of bizarre existing legacy, be cheap, be reliable, be on time... and in short satisfy all the other requirements on a CPU design team for a mainstream product. This guy is completely talking out of his ass on that front. That said, it will be interesting to see whether Larrabee proves to be an innovative new take on how to do graphics, or in fact a CPU architect's warped view of how to solve a graphics problem. Either is quite possible. I think the conjecture that a Larrabee die is going to cost $1000 to make is probably ridiculous... and even if he's talking about a sale price it's pushing it. I think the observation that CPU and GPU will probably remain pretty separate except in low cost implementations is probably right. At the flop level we're talking about, memory latency and bandwidth as well as sheer transistor count makes it difficult to believe that partitioning the CPU and GPU logic in the same die makes sense. The dominant interactions are the memory-compute bandwidth, not the CPU-GPU logic bandwidth, so the current partitioning makes sense as far as I can see.

    --
    And then there were none.
  64. Re:CPU based GPU will not work as good as long as by TheLink · · Score: 1

    "Vista also intelligently manages and virtualizes VRAM to system RAM"

    It's still going to be than the real thing. Show me how fast Vista runs Crysis on a fast 256MB/512MB card compared to a fast 1GB card at high res with AA on.

    And that virtual video RAM seems to mean that if you have 2GB of real RAM, Vista takes 1GB for the O/S, and 512-1GB for the vidcard and that leaves you with nothing much left over for the game.

    As long as the O/S is still 32bit you'll also have the problem of only 4GB of easily addressable space. So no point installing 4GB of system RAM on your board. You might as well stick to 2 or 3GB system RAM and spend the extra money on a video card with real video RAM.

    So it's a good idea to ignore that crap till 64 bit O/Ses become mainstream.

    Lastly, I don't seem to have problems running multiple 3D apps at the same time on Win2K. So what's the big deal?

    --
  65. Yes and no. by DrYak · · Score: 1

    Yes, nVidia's binary drivers are available on linux and the recent one include CUDA support (even the Beta 2.0 is available on Linux).

    And in fact Linux is a much better environment for developing CUDA. (Ability to setup headless server, numerous way to interact with said server).
    That's what I'm doing at work currently.

    *BUT*
    No, there are still no decent open source drivers for nVidia yet. Thanks to the lack of collaboration from nVidia, Project Nouveau has to go through the difficulties of reverse engineering everything and because of that colossal amount of work, still hasn't produced drivers that support all nvidia chips + 2D acceleration + 3D acceleration + GPGPU support.

    So there's no CUDA for you on linux if you aren't using anything beside the few supported architecture, and even then you may have to go through some patching if your compiler *minor version* isn't supported yet (CUDA 1.0 doesn't support GCC 4.2.x out of the box).

    Want to play with CUDA on a Sun Niagara II or on a Tilera's TileExpress64 based computer ? Sorry. CUDA only supports x86 and x86_64 architectures. No SPARC and no MIPS for you.

    * On the other hand *
    Concurrents' GPU have limitations too.
    Although Brook is open source, the CAL technology used by ATI as a backend isn't yet, and you have to rely on the OpenGL back-end. And not all ATI radeons have good open-source drivers supporting enough 3D functionality to get good support on Brook. (r300 driver fails with latest GLSL-based brook. radeonhd driver doesn't have 3D acceleration yet).
    But at least, ATI/AMD is (slowly) trying to help the development of open source drivers.

    Haven't tried Brook on Intel GPU yet.

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
  66. Re:CPU based GPU will not work as good as long as by default+luser · · Score: 1

    truthfully only real application for the gpu/cpu hybrid would be in laptop use where they can get away with using lower end gpu chips

    You only need so much power for %95 of users. And thanks to the introduction of PCIe, most desktop systems come with an x16 expansion port, even if the chipset has integrated graphics. Further, there's a push from ATI and Nvidia to support switching to the IGP and turning off the discrete chip when you're not playing games, which cuts down on power used when you're at the desktop. Isn't technology great?

    And further, there's more out there in the land of IGPs than just crappy Intel. AMD's new 780 chipset actually packs a real HD 3450 core onboard (not cut-down at all), and Nvidia has a new IGP on the horizon that's even faster. Both of these will be more than enough power to run Aeroglass, and you can even run recent games at low settings.

    Now, what's the difference between affordably integrating graphics in the chipset and affordably integrating it in the same die as the CPU? Maybe one process revision, tops. And that's the beauty of it: integrated graphics will improve in performance over the years just like performance improves in IGP and discrete chipsets now; it's only a matter of time before ALL users have PCs that come with the power of quad-Crossfire/SLI ultra setups.

    --

    Man is the animal that laughs.
    And occasionally whores for Karma.

  67. Re:CPU based GPU will not work as good as long as by TheNetAvenger · · Score: 1

    It's still going to be than the real thing. Show me how fast Vista runs Crysis on a fast 256MB/512MB card compared to a fast 1GB card at high res with AA on.

    Of course more VRAM gives games more room and Vista more room, who said it didn't? AA isn't always the best example though, as most implementations use selective AA, instead of full image sub rendering that requires large chunks of RAM.

    (PS Crysis isn't a full DX10 game. When the game says DX10 only, then you will see the performance benefits of DX10, for things like 16X Full Image AA at resolutions higher than 1920x1200.)


    And that virtual video RAM seems to mean that if you have 2GB of real RAM, Vista takes 1GB for the O/S, and 512-1GB for the vidcard and that leaves you with nothing much left over for the game.


    Vista takes 1GB for the OS? Are you high or just painfully misinformed? Vista manages RAM between the OS/Applications/VRAM very well, and if the game is requesting more system RAM, Vista gives it up, no questions asked. You act like Vista is choking applications to do what it does. Wrong.

    Also Vista don't chunk 1GB for the OS, and when running games virtually anything non-essential is paged out if necessary as the game/application requests RAM. Go read up on NT memory management, then read the Vista kernel additions on memory prioritization and additional scheduling.


    As long as the O/S is still 32bit you'll also have the problem of only 4GB of easily addressable space. So no point installing 4GB of system RAM on your board. You might as well stick to 2 or 3GB system RAM and spend the extra money on a video card with real video RAM.


    Again, who SAID not to buy a video card with the most VRAM you can? This is NOT THE POINT, and NOT THE SUBJECT HERE. Vista's ability to virtualize VRAM, is like OSes virtualizing System RAM to a pagefile. It is nice to have if needed, as the applications don't fail with a lack of memory. The only difference here is Vista's VRAM virtualization performance (if used) is very tiny, as it writes directly to the GPU via AGP/PCI Express concepts.


    So it's a good idea to ignore that crap till 64 bit O/Ses become mainstream.


    Um, most everyone I know runs Vista x64. It has more driver support than Windows XP 32bit, and is a virtual carbon copy of x32 Vista unlike x64 XP where several features were never added to the x64 version.

    And this list of people includes everyone from large clients deploying Vista x64 on desktops to any serious gamer or 'tech minded' home user.

    Do you not realize that there are more copies of Vista x64 running than all versions of Linux and OS X combined? (This isn't even touching x32 Vista.) So I would make a good argument that x64 Vista is a mainstream OS. Also unlike OS X that still has hybrid 32bit aspects, Vista x64 is a ground up 64bit OS, with all aspects of the WinXX subsystem being 64bit as well.

    Also if you happen to google Vista x64, it outperforms even XP 32bit easily, and is the OS of choise for gamers, even running 32bit applications on the OS, as the 64bit OS, that can use the 64bit registers, address space, and driver memory transfers does out perform the x32bit version of Vista, even with 32bit applications.


    Lastly, I don't seem to have problems running multiple 3D apps at the same time on Win2K. So what's the big deal?


    Really? Ok, open 2, 3, or even 5 3D applications in a Window, so they are running side by side concurrently. Make sure a couple are hard hitting games, heck even Crysis. How well does that work for you? And use your choice of 3D technology, DirectX or OpenGL... Heck even set 2 of the 3D applications to 50% transparency using the GDI+ layers introduced in Win2K. How is performance now?

    On any OS prior to Vista, 3D applications will starve for GPU time and GPU RAM. PERIOD. OpenGL TRIES to cooperative multi-task, but it all up to the applications to yield RAM and resources. DirectX also tries to yeild in a cooperat