Nvidia Working on a CPU+GPU Combo

Heard This One Before by eldavojohn · 2006-10-20 04:26 · Score: 4, Interesting

Sounds like Nvidia is just firing back at the ATI-AMD claim from two months ago. Oh, you say that you're integrating GPUs and CPUs? "Well, we can say that too!"

What I don't understand is that I thought GPUs were made to offload a lot of graphics computations from the CPU. So why are we merging them again? Isn't a GPU supposed to be an auxillary CPU only for graphics? I'm so confused.

What I'm not confused about is the sentence from the above article:

DAAMIT engineers will be looking to shift to 65 nanometre if not even to 45 nanometre to make such a complex chip as a CPU/GPU possible.

Oh, I've worked with my fair share of DAAMIT engineers. They're the ones that go, "Yeah, it's pretty good but ... DAAMIT, we just need more power!"

--
My work here is dung.

Re:Heard This One Before by Anonymous Coward · 2006-10-20 04:29 · Score: 0

I think it is because of the distance and to speed up shared memory or cache or something.
Re:Heard This One Before by everphilski · 2006-10-20 04:32 · Score: 4, Interesting

What I don't understand is that I thought GPUs were made to offload a lot of graphics computations from the CPU. So why are we merging them again?

a really, really fast pipe. It is a lot quicker to push stuff from CPU->GPU when they are on the same piece of silicon, versus the PCIe or AGP bus. Speed is what matters, it doesn't look like they are moving the load one way or another (although moving some load from CPU->GPU for vector based stuff would be cool if they had a general purpose toolkit, which I'd imagine one of these three companies will think about).
Re:Heard This One Before by LWATCDR · 2006-10-20 04:39 · Score: 3, Insightful

Well think of it like floating point.
At one time floating point was done by software it still is one some cpus.
Then floating point co-processors became available. For some applications you really needed to speed up floating point so it was worth shelling out the big bucks for a chip to speed it up. This is very similar to what we have now with graphics cards.
Finally CPUs had floating point units put right on the die. Later DSP like instructions where added to CPUs.

We are getting to the point where 3d graphics are mandatory. Tying it closer to the CPU is now a logical choice.

--
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
Re:Heard This One Before by Ryan+Amos · 2006-10-20 04:40 · Score: 2, Interesting

It is; but if you combine them on the same die with a large shared cache and the on-chip memory controller... you can see where I'm going with this. Think of it as a separate CPU, just printed on the same silicon wafer. That means you only need 1 fan to cool it and you can lose a lot of heat producing power management circuitry on the video card.

Obviously this is not going to be ideal for high end gaming rigs; but it will improve the quality of integrated video chipsets on lower end and even mid range PCs.
Re:Heard This One Before by novus+ordo · 2006-10-20 04:41 · Score: 1

They are merging them, but I doubt one cpu will do both. This will be more like 2 cores where 1 core does sequential instruction processing, and other is a more vector processor where you have to do the same operation over a lot of data. Sounds pretty interesting since you will be able to use the vector processor as a more general purpose cpu to do other things. The challenge will be solving the problem of cores competing for the bus bandwidth.

--
"You're everywhere. You're omnivorous."
Re:Heard This One Before by Ironsides · 2006-10-20 04:46 · Score: 2, Insightful

Then why not just have some connections that come straight out of the CPU and go directly to a graphics card, bypassing any bus entirely?

--
Fly me to the moon Let me sing among those stars Let me see what spring is like On jupiter and mars
Re:Heard This One Before by drinkypoo · 2006-10-20 04:49 · Score: 3, Interesting

ATI/AMD is working on that right now. I think it comes after the next rev of hypertransport.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:Heard This One Before by purpledinoz · 2006-10-20 04:51 · Score: 4, Interesting

It seems like this type of product would be marketed towards the budget segment, which really doesn't care about graphics performance. However, the huge advantage of having a GPU on the same silicon as the CPU would be a big boost in performance. The low cost advantage has already been attained with the integrated graphics chipsets (like nForce). So that would mean this might be marketed towards the high-performance crowd.

But I highly doubt that nVidia will be able to get a CPU out that out-performs an Intel or AMD, which the high-performance junkies would want. Intel and AMD put a HUGE amount of money into research, development, and fabrication to attain their performance. This is going to be interesting to watch. Hopefully nVidia doesn't dig themselves into a hole with this attempt.
Re:Heard This One Before by mikael · 2006-10-20 04:53 · Score: 1

What I don't understand is that I thought GPUs were made to offload a lot of graphics computations from the CPU. So why are we merging them again? Isn't a GPU supposed to be an auxillary CPU only for graphics? I'm so confused.

GPU's are so powerful now, that some of the latest high-end scientific visualisation applications will actually do calculations on a supercomputer, transfer the data across to a PC, and then use the CPU to process the data so it can be visualised on the GPU in real-time. Similarly for game software (the physics engine will run on the CPU or physics card, then send the data over to the GPU). Engineers will always try and remove the bottleneck in performance whether its in the network, CPU, data bus, or GPU.

--
Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
Re:Heard This One Before by TheRaven64 · 2006-10-20 04:55 · Score: 5, Informative
It's not just floating point. Originally, CPUs did integer ops and comparisons/branches. Some of the things that were external chips and are now found on (some) CPU dies include:
1. Memory Management Units. Even in microcomputers there are some (old m68k machines) that have an off-chip MMU (and some, like the 8086 that just don't have one).
2. Floating Point Units. The 80486 was the first x86 chip to put one of these on-die.
3. SIMD units. Formerly only found in high-end machines as dedicated chips, now on a lot of CPUs.
4. DSPs. Again, formerly dedicated hardware, now found on-die in a few of TI's ARM-based cores.
A GPU these days is very programmable. It's basically a highly parallel stream processor. Integrating it onto the CPU makes a lot of sense.
--
I am TheRaven on Soylent News
Re:Heard This One Before by Do+You+Smell+That · 2006-10-20 05:12 · Score: 3, Informative

What I don't understand is that I thought GPUs were made to offload a lot of graphics computations from the CPU. So why are we merging them again? Isn't a GPU supposed to be an auxillary CPU only for graphics? I'm so confused.
You're partially right. GPUs were made to execute the algorithms developed for graphically-intensive programs directly in silicon... thus avoiding the need to run compiled code within an operating system, which entails LOTS of overhead. Being able to do this directly on dedicated hardware (with entirely different processor designs optimized for graphical computations)makes it possible to execute ALOT more calculations per second. You can really see the difference if you, for instance, use DirectX on two nearly identical video cards; one with hardware based DirectX, the other with it running as software.

Moving it right up next to the CPU will allow the data to flow between the two alot faster than currently where it has to go over a bus... they can finally get rid of the bottlenecks that have been around since the two were seperated.

--
I'm not good at making signatures...
Re:Heard This One Before by *weasel · 2006-10-20 05:16 · Score: 1

Having a specialized GPU made sense when processors were single-core.

Now that processors have multiple cores, many of which are left looking for a job to do - it makes sense to bring the GPU back to the main die.

The result will produce an immediate performance boost for Joe Sixpack, at lower manufacturer cost.

--
// "Can't clowns and pirates just -try- to get along?"
Re:Heard This One Before by arth1 · 2006-10-20 05:24 · Score: 4, Interesting

At one time floating point was done by software it still is one some cpus.
Then floating point co-processors became available. For some applications you really needed to speed up floating point so it was worth shelling out the big bucks for a chip to speed it up.

Then people started using floats for the convenience, not because the accuracy was needed, and performance suffered greatly as a result. Granted, there are a lot of situations where accuracy is needed in 3D, but many of the calculations that are done could be better done in integer math and table lookups.
Does it often matter whether a pixel has position (542,396) or (542.0518434,395.97862456)?
Using a lookup table of twice the resolution (or two tables where there's non-square pixels) will give you enough precision for pixel-perfect placement, and can quite often speed up things remarkably. Alas, this, and many other techniques have been mostly forgotten, and it's easier to leave it to the MMU or graphics card, even if you compute the same unneccessary calculations and conversions a million times.

Fast MMUs, CPU extensions and 3D graphics routines are good, but I'm not too sure they're always used correctly. Does a new game that's fifty times as graphically advanced as a game from six years ago really need a thousand times the processing power, or is it just easier to throw hardware at a problem?
Re:Heard This One Before by Neil+Hodges · 2006-10-20 05:25 · Score: 1

Wasn't that what the old VL Bus was for? It connected the i486 to the video card, ne?
Re:Heard This One Before by NerveGas · 2006-10-20 05:32 · Score: 4, Interesting

I don't think that the CPU->GPU pipe is any limitation. Going from AGP 4x->8X gave very little speed benefit, and on PCI-E connections, you have to go from the normal 16x down to a 4x before you see any slowdown.

Memory size and bandwidth are the usual limitations. Remember that if you want 2x AA, you double your memory usage, and if you want 4x AA, you quadruple it. So, that game that needed 128 megs on the video card, with 4x AA, can suddenly need 512.

steve

--
Oh, you're not stuck, you're just unable to let go of the onion rings.
Re:Heard This One Before by ultranova · 2006-10-20 05:32 · Score: 1

Then why not just have some connections that come straight out of the CPU and go directly to a graphics card, bypassing any bus entirely?

That's the whole point here: put the CPU and GPU right next to each other and wire them together. You see, the nearer they are to each other, the less time it takes for electric impulses to travel from one to the other, and the faster the communication is.

And, of course, the reason number one: you get a guaranteed GPU sale for each CPU sale - goodbye pesky competition ;).

--
Forget magic. Any technology distinguishable from divine power is insufficiently advanced.
Re:Heard This One Before by LWATCDR · 2006-10-20 05:39 · Score: 4, Interesting

Exactly.
I was using floating point as an example.
I don't know if Nvidia can pull this off without a partner. Too build a really good X86 core isn't easy. I wonder if they may not do a PPC or Arm core instead. That could make nVidia a big player in the cell phone and mobile video market. At some point there will be portable HD-DVD players.

My crystal ball says.
AMD will produce these products.
1. A low end CPU it integrated GPU for the Vista market. This will be a nice inexpensive option for home and corporate users. It might also end up in some set-top boxes. This will the next generation Geode.
2 A family of medium and high end video products that use Hyperchannel to interface with Opteron and Athlon64 line.

Intel will
Adopt Hyperchannel or reinvent it. Once we hit four cores Intel will hit a front bus wall.
Intel will produce a replacement for the Celeron that is Duo2Core with integrated graphics on one die. This is to compete with AMD new integrated solution.
Intel will not go in to the high end graphics line.

nVidia will fail to produce an X86+GPU to compete with AMD and Intel.
nVidia produces an integrated ARM+GPU and dominates the embedded market. Soon every cellphone and media player has an nVidia chipset at it's heart. ARM and nVidia merge.

Of course I am just making all this up but so what, electrons are cheap.

--
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
Re:Heard This One Before by Anonymous Coward · 2006-10-20 05:40 · Score: 1, Interesting

Except for the bottleneck of memory bandwidth. Video cards have tremendous bandwidth on external cards, several times that of main memory in a computer. Putting them next to cpus and having to share main memory bandwidth may not be that great for certain workloads.
Re:Heard This One Before by LWATCDR · 2006-10-20 05:45 · Score: 5, Informative

I am an old school programmer so I tend to use ints a lot. The sad truth if that float using SSE are as fast and sometimes faster than the old tricks we used to avoid floats!
Yes we live in an upside down world where floats are faster than ints some times.

--
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
Re:Heard This One Before by daVinci1980 · 2006-10-20 05:48 · Score: 2, Informative

Does it often matter whether a pixel has position (542,396) or (542.0518434,395.97862456)?

Yes. It absolutely matters. It makes a huge difference in image quality.

It matters when we go to sample textures, it matters when we enable AA, it matters.

--
I currently have no clever signature witicism to add here.
Re:Heard This One Before by Ironsides · 2006-10-20 05:49 · Score: 4, Insightful

And, of course, the reason number one: you get a guaranteed GPU sale for each CPU sale - goodbye pesky competition ;).

And vice versa. This might work where someone wants an embeded GPU for low to medium end graphics. However, I doubt gamers would like the idea of having to purchase a new CPU evertime a new GPU comes out and vice versa.

There's something to be said for physically discrete components.

--
Fly me to the moon Let me sing among those stars Let me see what spring is like On jupiter and mars
Re:Heard This One Before by nine-times · 2006-10-20 05:54 · Score: 2, Interesting

What I don't understand is that I thought GPUs were made to offload a lot of graphics computations from the CPU. So why are we merging them again? Isn't a GPU supposed to be an auxillary CPU only for graphics? I'm so confused.
You've already gotten some good answers here, but I'll throw in something that I haven't seen anyone else mention explicitly: GPUs aren't only being used for 3D animation anymore. GPUs started because, in order to make graphics for games better, you needed a specialized processor to handle the 3D calculations. However, GPUs have become, in some ways, more complex and powerful than the CPU, and as that has happened, other uses have been found for all that power. turns out that there are lots of mathematical transformations that are more efficient on the specialized graphics processors, including audio/video processing and some data analysis. Some clever programmers have already started offloading some of their complex calculations from the CPU to the GPU.
This has lead many people to wonder, why don't we bring some of the GPU advancements back to the CPU somehow, so that we aren't swapping data back and forth between the CPU and GPU, the system RAM and video RAM? Apparently, it's not a stupid question.
Re:Heard This One Before by FlyingGuy · 2006-10-20 06:02 · Score: 2, Insightful

As other have replied its all about the bus speed. The amount of time it takes to move data from chip to chip can insert enormous overhead.

Think back a little to the DEC Alpha. Now the ALPHA chip in and of itself was not really that remarkable. What was so VERY remarkable about the Alpha system was its bus switching. It was blazingly fast and could handle monster amounts of data from main memory to CPU, GPU, etc. The reason ( mostly ) that its now sitting in HP's vault is that the bus switch was/is really expensive to manufacture.

So the way you do this without haveing to build this very expensice bus architecture is to just put the GPU on the die with the CPU. Everything runs at the internal speed of the processor and its fairly inexpensive, comparatively, to build.

--
Hey KID! Yeah you, get the fuck off my lawn!
Re:Heard This One Before by arth1 · 2006-10-20 06:09 · Score: 2, Informative

Yes. It absolutely matters. It makes a huge difference in image quality.

No, it doesn't. Note that I said pixel, not coordinate.
The coordinates should be as accurate as possible, but having a pixel more accurate than twice the resolution of the display serves very little purpose.
Re:Heard This One Before by Anonymous Coward · 2006-10-20 06:13 · Score: 0

Remember to add a Physics engine, too! And an AI engine! And maybe even a kitchen sink.
Re:Heard This One Before by NerveGas · 2006-10-20 06:18 · Score: 1

I'm not sure if it does. Modern GPUs occupy a large die, run at slow clock speeds, and have an enormous transistor count. CPUs, on the other hand, have fewer transistors, smaller dies, and higher clock speeds.

Your CPU isn't going to work well at the 200-400 MHz of a GPU, and you're not going to make a huge GPU die run at 2 GHz to get your CPU to work well. I think that their CPUs will be closer to Via's C3 than a P4/Athlon 64.

--
Oh, you're not stuck, you're just unable to let go of the onion rings.
Re:Heard This One Before by dargaud · 2006-10-20 06:28 · Score: 1

I have a general question relating to this. How can you compile a program that stays compatible with all those kinds of processor 'options' ? It's been a while since I last did some compiler work (okay, 15 years), but how can you have a program that uses FPU instructions if there's an FPU coprocessor or on-die available and and still work if not, and so on for GPU, DSP, SIMD, etc... Do you have tests and branches each time one of those instructions should be used that uses a library if not available ? In that case it gives a lot of sense to use a distro like Gentoo to compile specifically for your processor (saves a lot of test/branches during program execution and a lot of space in the executable). Or I'm missing something.

--
Non-Linux Penguins ?
Re:Heard This One Before by Anonymous Coward · 2006-10-20 06:31 · Score: 0

And vice versa. This might work where someone wants an embeded GPU for low to medium end graphics. However, I doubt gamers would like the idea of having to purchase a new CPU evertime a new GPU comes out and vice versa.

There's something to be said for physically discrete components.

I'm sure the company is smart enough to do some simple marketing and pair up the right CPU with the right GPU for a particular buyer/budget.
Re:Heard This One Before by stevenm86 · 2006-10-20 06:38 · Score: 4, Informative

That's sort of the point of building them on the same die. You can't just run a wire to it, as it would be quite slow. Wires tend to have parasitic inductances and capacitances, so the setup and hold times on the lines would be too large to provide a benefit.
Re:Heard This One Before by julesh · 2006-10-20 06:40 · Score: 3, Interesting

Memory size and bandwidth are the usual limitations. Remember that if you want 2x AA, you double your memory usage, and if you want 4x AA, you quadruple it. So, that game that needed 128 megs on the video card, with 4x AA, can suddenly need 512.

Of course with the GPU integrated into the CPU you wouldn't need card-based RAM at all. You'd process your video on system RAM, and it would be as fast as the GPU accessing its own RAM at the moment is (not shit like shared-memory video cards are at the moment). This results in flexibility: if you're only using 128MB of RAM for your graphics, you can reuse the other 384MB as additional system RAM.
Re:Heard This One Before by coleblak · 2006-10-20 06:45 · Score: 1

You know, with a four core processor, that's not a bad idea. AMD GAMEX 3800+ You got your CPU, GPU, AI specific core, and your physics core. Just slap a peltier on that and a fucking ginormous HS/F.

--
77 HITS
Really Long Off Topic Combo
Re:Heard This One Before by Xichekolas · 2006-10-20 06:50 · Score: 0

While we're at it, I wouldn't mind if they threw in a Fruit Fucker ... you know, in case of Zombie Infestation.

--
Self-referential Sigs are cool on /. these days...

54
Re:Heard This One Before by Intron · 2006-10-20 06:58 · Score: 3, Informative

Typically, unimplemented instructions cause an exception. The operation can then be emulated in software.

--
Intron: the portion of DNA which expresses nothing useful.
Re:Heard This One Before by joto · 2006-10-20 07:07 · Score: 1

Now, I'm not a hardware engineer, but I fail to see why having different parts of a chip running at different clock-speeds is such a big issue. After all, what you need in your example above, is a circuit that counts to somewhere between 5 and 10 before you send out a clock-pulse. I don't think that's to complicated to put on a chip that is intended to include both a modern CPU and GPU.
Re:Heard This One Before by joto · 2006-10-20 07:25 · Score: 3, Informative
There are a couple of strategies:
1. Write a specialized program that will only run at a single computer, the one the programmer owns, as everything is specialized and optimized for his/her hardware. If other people needs to run the program, write a new one, or at the very least use some other compiler options.
2. Don't use non-portable features. Always go for the lowest common denominator.
3. Manually testing for existence of coprocessor at each FPU instruction, branch to emulator if FPU doesn't exist.
4. Same as above, but tests are inserted automatically by the compiler.
5. Test for existence of coprocessor at start of program execution. If FPU doesn't exist, dynamically replace all FPU instructions with branches to emulator routines
6. Same as above, but done automatically by the OS program loader
7. Make it mandatory for CPUs to: either support the FPU instructions (with a coprocessor if necessary); or to issue some sort of trap/interrupt that can be used by software such as the OS kernel/libc to use an emulator routine instead.
I believe the last option (option 7) is what x86/87 CPU/FPU combo actually used. That's why there is a coprocessor-prefix in front of the FPU instructions. They are not just unused opcodes.
Option 5 (and sometimes even 3) is commonly used for MMX/3dNOW/SSE/SSE2/SSE3/whatever instructions.
Unless they *really* need nonportable features, most programmers tend to go with option 2.
Re:Heard This One Before by joto · 2006-10-20 07:42 · Score: 1

Does it often matter whether a pixel has position (542,396) or (542.0518434,395.97862456)?

Per definition, no.
But does it matter whether the programmer has to create and populate lookup-tables, with lots of manual tweaking to find the right compromize between memory usage (cache issues), few branches (lookup must be fast), accuracy (lookup must be correct), etc... when the alternative is to simply call sin(x)?
Alas, this, and many other techniques have been mostly forgotten, and it's easier to leave it to the MMU or graphics card, even if you compute the same unneccessary calculations and conversions a million times.

Obviously, doing unnecessary calculations is unnecessary. But if the hardware has already circuits capable of calculating sin(x), I don't consider using those circuits as unnecessary calculations, even though it's possible to avoid the calculations by using a lookup-table. We are not in a competition to build programs the most suitable for the machines of 1960 here. If you want to compete, use real-world metrics like speed (wall-clock), memory usage, and perhaps even power usage. Your "unnecessary calculations" doesn't help reduce this.
Fast MMUs, CPU extensions and 3D graphics routines are good, but I'm not too sure they're always used correctly. Does a new game that's fifty times as graphically advanced as a game from six years ago really need a thousand times the processing power, or is it just easier to throw hardware at a problem?

Fifty times as graphically advanced? What does that mean? That there are 50 times as many pixels? That you need to read 50 times as many books about graphics programming to write a similar game? That the graphic library you use has 50 times as many source lines? Or that there are 50 times as many colours? Obviously the techniques that helped you write pong in the 1960s isn't what you are going to use to write a 3d-engine today. And obviously, if your computer has hardware to speed up graphics, if you need fast graphics, you are going to use them.
Re:Heard This One Before by 644bd346996 · 2006-10-20 07:44 · Score: 1

No. The 486 did not have any graphics-specific capabilities. VLB was just a faster bus than ISA, so the cpu could push more data to the card. VLB was quickly replaced by PCI and then the increasingly fast iterations of the AGP spec.
Re:Heard This One Before by MojoStan · 2006-10-20 07:47 · Score: 2, Informative

... if you combine them on the same die with a large shared cache and the on-chip memory controller... you can see where I'm going with this. Think of it as a separate CPU, just printed on the same silicon wafer. That means you only need 1 fan to cool it and you can lose a lot of heat producing power management circuitry on the video card.
Obviously this is not going to be ideal for high end gaming rigs; but it will improve the quality of integrated video chipsets on lower end and even mid range PCs.
Do you remember how Intel tried to do this with their (code name) Timna processor in 2000? Timna was supposed to be a low cost solution that integrated a CPU, GPU, and memory controller on the same silicon wafer. The CPU was a Celeron CPU (Pentium III based), the GPU was based on Intel's new i740, and the memory controller used RAMBUS (yes, RAMBUS) memory. At the same time, Intel was also developing the first chipset with integrated graphics (i810 chipset) and the first RAMBUS chipset (i820 chipset). RAMBUS was supposed to be the successor to PC100 SDRAM.
When Timna was initially finished, RAMBUS was still so expensive that Timna's release had to be delayed so that a (PC100-to-RAMBUS) memory translator could be added. Those of us who followed chipsets back then know how badly RAMBUS and memory translators bombed. The integrated RAMBUS memory controller had to be the biggest reason Timna was cancelled. This might also be a reason Intel doesn't integrate a memory controller onto their current CPUs.
Interestingly, Timna was the first project of Intel's new Israeli design team. Not a great start, but their second project was pretty darned good (Pentium M/Centrino).

--
TO START PRESS ANY KEY
Where's the 'ANY' key? I see Esk, Kitarl, and Pig-Up...
Re:Heard This One Before by TeknoHog · 2006-10-20 08:09 · Score: 1

Then why not just have some connections that come straight out of the CPU and go directly to a graphics card, bypassing any bus entirely?
I think it was done sometime in the past and it was called AGP.

--
Escher was the first MC and Giger invented the HR department.
Re:Heard This One Before by 644bd346996 · 2006-10-20 08:13 · Score: 3, Informative

Having the GPU on the same chip or die as the CPU would reduce the latency by several orders of magnitude and allow a much higher clock for the bus between the two. The memory access could also be improved dramatically, depending on how it was implemented.

I think the first example of this integration we see will use the HyperTransport bus and a single package with CPU and GPU on different dies, though fabbed on the same process. This could be done with an existing AMD socket and motherboard.

Before this happens, though, I think we will see graphics cards on HTX slots. For those who do not know, HTX slots were introduced in a recent revision of the HyperTransport standard. They allow an add-in card to communicate with the CPU with much lower latency and higher bandwidth than PCIe, and no controller in between. The add-in card could even have another CPU on it, and the performance would be comparable to current AMD SMP systems. A GPU on an HTX card could have its own RAM, and be able to access system RAM much faster than PCIe allows. The neat thing is that with HT, the CPU would probably be able to use the graphics RAM as though it were system RAM.

Note that Nvidia is a member of the HyperTransport Consortium due to their chipset business, and they could easily have HTX cards in their labs right now.
Re:Heard This One Before by NerveGas · 2006-10-20 09:47 · Score: 1

Even if the different clock speeds aren't an issue, look at die size. Building a die large enough to hold a good GPU and CPU, on a process sufficiently advanced to make the CPU perform adequately, would be MUCH more expensive than just building them seperately.

--
Oh, you're not stuck, you're just unable to let go of the onion rings.
Re:Heard This One Before by Creepy · 2006-10-20 10:25 · Score: 1

What I don't understand is that I thought GPUs were made to offload a lot of graphics computations from the CPU. So why are we merging them again? Isn't a GPU supposed to be an auxillary CPU only for graphics? I'm so confused.

A very similar argument is FPUs (floating point units) 20 years ago. Separating the floating point and integer processors was more expensive if you wanted both (it was cheaper to put them on a single die) but since the majority of machines didn't need an FPU, entry-level (the majority) PCs without them were much cheaper and if FPU operations were required they could be done with much slower software methods (e.g. Newton's Method).

Essentially, I think this is signaling that we're getting to the point where it may be more cost effective to put the GPU, or at least a base-level GPU on the die. There may be alternative reasons like sharing unused high speed GPU memory as a L2 or L3 cache to speed up non-graphics operations, as well, but I'm speculating.
Re:Heard This One Before by TheRaven64 · 2006-10-20 10:26 · Score: 1

In case you missed the Moore's Law lesson, the number of transistors you can put on a die per dollar doubles every 18 months or so. Adding a GPU is likely to give a bigger performance boost (and especially a performance per watt boost, which is where the real money is these days) than doubling the cache size (we already hit diminishing returns there) or the number of cores in the near future.

--
I am TheRaven on Soylent News
Re:Heard This One Before by daVinci1980 · 2006-10-20 10:28 · Score: 2, Informative

You'd be mistaken. See the slide on Texture Mapping.

Perspective divide is performed before texture sampling. This is necessary to get proper texture step sizes, for correct sampling of the texture onto the pixel.

Fractional pixel locations are also used in antialiasing.

--
I currently have no clever signature witicism to add here.
Re:Heard This One Before by TheRaven64 · 2006-10-20 10:30 · Score: 1

Unless they *really* need nonportable features, most programmers tend to go with option 2.
For anything that taxes the CPU, they often go with a combination of 1 and 2. Write something for the lowers denominator, and provide optimised versions for various platforms as options (often in the style of option 5). In the real world, some combination of several of these options is common.
It's worth noting that few people stray outside 2 except at the library level, however. A lot of people will use things like OpenSSL or OpenGL, which will hide the complexity from them, using dedicated hardware when it's available and falling back to general codepaths when it isn't.

--
I am TheRaven on Soylent News
Re:Heard This One Before by Anonymous Coward · 2006-10-20 11:31 · Score: 0

Since ARM cores are usually integrated with other components on the same piece of silicon, I'm not sure there's anything new to do there. Besides, ATI's already working with Qualcomm to have their 3D tech in CDMA chipsets. (March 2004!)
Re:Heard This One Before by Surt · 2006-10-20 12:07 · Score: 1

Then why not just have some connections that come straight out of the CPU and go directly to a graphics card, bypassing any bus entirely?

Those 'connections' that run from one device to another? They're known as a 'bus' in computer design lingo.

--
"Who is the Journal of Quantum Physics going to believe?" --Stephen Hawking
Re:Heard This One Before by Doctor+Memory · 2006-10-20 13:59 · Score: 2, Interesting

But I highly doubt that nVidia will be able to get a CPU out that out-performs an Intel or AMD
Maybe they don't have to. If they can just make something that can accelerate MMX/3D Now (sort of a graphics pre-processor) and plug that into a Socket F slot, it'd be like a two-stage accelerator: first accelerate the calculations that produce the graphics, then accelerate the display. Maybe they could find a way to do a micro-op translation of MMX instructions into something more RISC-like, and run them on a RISC core.

At least, I hope it's something like that, because I agree with PP: nVidia doesn't have much of a chance to beat both Intel and AMD at the x86 game.

--
Just junk food for thought...
Re:Heard This One Before by GWBasic · 2006-10-20 14:49 · Score: 1

What I don't understand is that I thought GPUs were made to offload a lot of graphics computations from the CPU. So why are we merging them again? Isn't a GPU supposed to be an auxillary CPU only for graphics? I'm so confused.

A couple of reasons. The primary reason is that it's cheaper to build a computer with less chips. In addition, a single chip design might be more power efficent and allow for a smaller computer.

Besides, if you're building a cheapo computer to act as a Jetson's-style picture phone running Skype, wouldn't it make more sense to use a combo CPU/GPU?

What if you want an UMPC (Ultra-mobile PC) in your pocket to play games? Why try and cram two chips into it? (I think the original GameBoy had a combined CPU/GPU.)

--
No, I will not work for your startup
Re:Heard This One Before by Ironsides · 2006-10-20 14:53 · Score: 1

Those 'connections' that run from one device to another? They're known as a 'bus' in computer design lingo.

Er. No. A Bus has several devices connecting to it, not just two.

--
Fly me to the moon Let me sing among those stars Let me see what spring is like On jupiter and mars
Re:Heard This One Before by Ironsides · 2006-10-20 14:56 · Score: 1

That's sort of the point of building them on the same die. You can't just run a wire to it, as it would be quite slow. Wires tend to have parasitic inductances and capacitances, so the setup and hold times on the lines would be too large to provide a benefit.

Given that I'm an EE, I know a bit about this and that there are many ways to get around it. For one thing, if they can provide 10Gbit connections over copper, they shure as hell can do a direct connection with a video card.

--
Fly me to the moon Let me sing among those stars Let me see what spring is like On jupiter and mars
Re:Heard This One Before by Fred_A · 2006-10-21 00:52 · Score: 1

Or you could have a Pentium class chip on a daughter card plugged into the PCI-E slot...

Umm...

--

May contain traces of nut.
Made from the freshest electrons.
Re:Heard This One Before by Edward+Kmett · 2006-10-21 04:19 · Score: 1

Floating point has sped up to the point where I would rather perform the calculation on the GPU than expend the memory bandwidth looking it up in the table, since the memory bandwidth is my current bottleneck. The GPUs are largely memory bound these days; your solution was sound six years ago, but now would just exacerbate speed issues.

--
Sanity is a sandbox. I prefer the swings.
Re:Heard This One Before by dargaud · 2006-10-30 04:59 · Score: 1

Thanks Joto. THere were several techniques I had never heard of. I'll be doing do bare metal stuff soon again in my new job, after 6 years doing sysadmin and data analysis stuff.

--
Non-Linux Penguins ?

Should Slashdot really insult other news outlets? by Anonymous Coward · 2006-10-20 04:26 · Score: 0, Offtopic

Because they're never wrong and never sensationalize a story for a few clicks.

My outlandish idea for this by Anonymous Coward · 2006-10-20 04:28 · Score: 0

Perhaps something like this could be used in a general-purpose computer. Many technical hurdles will have to be overcome, but it may be possible after several decades of research.

Re:My outlandish idea for this by Teilo · 2006-10-20 05:06 · Score: 1

I am hoping this is a joke.

"decades" ago (assuming 20 years to match your plural), most computing was still 8-bit and most personal computers had either monochrome text displays or EGA (16 color 640x480) graphics at best? Make it 30 years and there were no personal computers of note at all.

--
Mir tut es leid, Menschen daß Einfältigfehlersuchenbaumfolgendenaffen sind.

nVidia don't have a good chance with this. by purpledinoz · 2006-10-20 04:30 · Score: 2, Interesting

AMD and Intel have their own fabs that are at the leading edge of semiconductor technology. I highly doubt that nVidia will open up a fab for their chips. But who knows, IBM may produce their chips for them.

I think the better option would be to have a graphics chip fit into a Socket 939 on a dual socket motherboard, with an AMD chip. It will have a high-speed link through hyper-transport, and would act just like a co-processor. I'm no chip designer, so I have no idea what the pros/cons of this are, or if it's even possible.

Re:nVidia don't have a good chance with this. by drinkypoo · 2006-10-20 04:52 · Score: 1

You'd also need more hardware on the board, like the video memory and the RAMDAC, not to mention the ports to plug in your display. Those could be on a riser, though, and expressed like any slot card, including the DAC.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:nVidia don't have a good chance with this. by Joe+The+Dragon · 2006-10-20 06:07 · Score: 1

No you put on a hypertransport htx card
Re:nVidia don't have a good chance with this. by stevesliva · 2006-10-20 06:38 · Score: 1

AMD and Intel have their own fabs that are at the leading edge of semiconductor technology. I highly doubt that nVidia will open up a fab for their chips. But who knows, IBM may produce their chips for them.
Why not just stick with TSMC? It's not like TSMC doesn't also have a few gagillion dollars to spend on process development. Is there something wrong with Nvidia's current chips?

--
Who do you get to be an expert to tell you something's not obvious? The least insightful person you can find? -J Roberts
Re:nVidia don't have a good chance with this. by archen · 2006-10-20 06:50 · Score: 1

Correct me if I'm wrong, but aren't co-pressors already on AMD's map? I think that was announced a few months ago. Maybe this is what they've been up to with ATI.

All I hope is that it's an open standard that anybody can plug their chip in as a GPU coprocessor. Because (and I say this as an AMD fanboy) if it comes down to a choice of either Nvidia+Intel or AMD+ATI, I will drop AMD. I wan't absolutely nothing to do with ATI or their junk.
Re:nVidia don't have a good chance with this. by Anonymous Coward · 2006-10-20 08:08 · Score: 0

You do realize that Socket 939 is old? AMD's cutting edge socket is AM2 now. Why would someone build a Socket 939 graphics chip? They are supposed to stay ahead of the curve, not lag behind.
Re:nVidia don't have a good chance with this. by purpledinoz · 2006-10-20 08:39 · Score: 1

You missed the point... replace "939" with "AM2"...

One Question: by TubeSteak · 2006-10-20 04:30 · Score: 1

How much heat do integrated graphics solutions put out?

I can't imagine it is that much
(since they mostly suck)

--
[Fuck Beta]
o0t!

With integration.. by Hangin10 · 2006-10-20 04:30 · Score: 3, Interesting

With this integration, does that mean a standard for 3-d? No more Nvidia/ATI drivers. The OSDEV guys would love this if it came to that. But how would this integration work? A co-processor space like MIPS? If so, does that mean that graphics calculations have somewhat been moved back to the CPU? And what about the actual workings itself, I'm guessing the actual registers would still be memory mapped in someway (or I/O ports for x86, whatever).

I'm thinking way too much. It did alleviate boredom for about a minute though...

Re:With integration.. by ozamosi · 2006-10-20 05:28 · Score: 0, Flamebait

You DID see that Nvidia was behind this, right? Which means you'll have to run som broken, bloated and insecure binary blob just to get the bloody CPU working.

Out of their league? by Salvance · 2006-10-20 04:32 · Score: 4, Insightful

Unless Nvidia is partnering with Intel to release this chip, I think they're getting too far out of their confort zone to be successful. Sure, a dual or even quad core chip with half of the cores handling graphics would be great, but can Nvidia deliver? I doubt it ... look how many companies have gone down the tubes after spending millions/billions after trying to make an x86 compatible chip, let alone trying to integrate top end graphics as well.

Nvidia is a fantastic graphics card company - they should continue to innovate focus on what they're good at rather than try to play follow the leader in an arena they know nothing about.

--
Crack - Free with every butt and set of boobs

Re:Out of their league? by TheRaven64 · 2006-10-20 05:07 · Score: 4, Insightful

The thing is, it doesn't need to be a very good x86 chip. Something like a VIA C7 is enough for most uses, if coupled with a reasonable GPU. I can imagine something like this being very popular in the sub-notebook (which used to mean smaller-than-letter-paper but now means not-as-fat-as-our-other-products) arena where power usage is king. If the CPU and GPU are on the same die then this gives some serious potential for good power management, especially if the memory controller is also on-die. This makes for very simple motherboard designs (and simple = cheap), so it could be very popular.

--
I am TheRaven on Soylent News
Re:Out of their league? by BecomingLumberg · 2006-10-20 05:17 · Score: 0

Did they go down the wrong tube? If they did, maybe thats why my internets are not getting through...

--
If a nation expects to be ignorant and free, in a state of civilization, it expects what never was and never will be.-TJ
Re:Out of their league? by Doctor+Faustus · 2006-10-20 05:57 · Score: 1

The thing is, it doesn't need to be a very good x86 chip. Something like a VIA C7 is enough for most uses, if coupled with a reasonable GPU.
Especially if they just shove several of them together onto the die. Everyone is going to be focusing more on software that can take advantage of multiple CPUs for the next couple of years, and nVidia can ride the coattails of that with a nice, simple in-order-execution design. Put 16 or so of those onto your chip with a good GPU, and you might get pretty good performance without a whole lot of design work (relatively speaking, of course). The power management system could shut down entire CPUs, too.
Re:Out of their league? by archen · 2006-10-20 06:53 · Score: 1

The problem is that even if they don't know anything about it, the ATI/AMD merger is troubling news for them. I wouldn't say this is a solid roadmap, but more like an R&D project to keep their bases covered "just in case".
Re:Out of their league? by mgblst · 2006-10-22 21:25 · Score: 1

Who else has moved out of there comfort zone? Apple with the ipod, Microsoft with office, Sony with the playstation!

It is not like they are not going to keep producing graphics cards, I am sure they will concentrate most of their work on this. But, they have the ability to innovate in another area, probably have a lot of money and skilled people, so why shouldn't they? What is wrong with taking a risk?

I smell a pattern by doti · 2006-10-20 04:32 · Score: 2, Interesting

There seems to be a cycle of integrating and decoupling things.
We had separated math co-processors, that later were integrated in the CPU.
Then the separated GPU, which will soon be integrated back too.

--
factor 966971: 966971

Re:I smell a pattern by katz · 2006-10-20 05:33 · Score: 1

The pattern you describe has a name, it's called the "Wheel of Reincarnation". The Jargon File specifically mentions graphics equipment, even:

"[coined in a paper by T.H. Myer and I.E. Sutherland On the Design of Display Processors, Comm. ACM, Vol. 11, no. 6, June 1968)] Term used to refer to a well-known effect whereby function in a computing system family is migrated out to special-purpose peripheral hardware for speed, then the peripheral evolves toward more computing power as it does its job, then somebody notices that it is inefficient to support two asymmetrical processors in the architecture and folds the function back into the main CPU, at which point the cycle begins again.
Several iterations of this cycle have been observed in graphics-processor design, and at least one or two in communications and floating-point processors. Also known as the Wheel of Life, the Wheel of Samsara, and other variations of the basic Hindu/Buddhist theological idea. See also blitter."

see http://catb.org/jargon/html/W/wheel-of-reincarnati on.html.

- Roey
Re:I smell a pattern by Doctor+Faustus · 2006-10-20 06:07 · Score: 1

My understanding of The Wheel of Reincarnation is that separate hardware is eliminated entirely, and the main CPU just has more processes running. If I've got that right, floating point doesn't really fit, because it wasn't really integrated into the main CPU, just moved onto the same die.

When you look at the 80387, it was a lot harder to get enough onto a single chip, and the bus speed between chips was a lot faster, relative to the CPU speed, than it is now (you might have a couple cycles of latency, but the CPU and the bus were both 33mhz and 32 bit). This is back when the CPU cache was on separate chips. Even a generation later, the 486 had an 8kb cache built-in, and motherboard manufacturers were adding external L2 caches of 32-128kb.
Re:I smell a pattern by Anonymous Coward · 2006-10-20 09:48 · Score: 0

The 486DX was just a CPU with FP unit on the same die; the 486SX was originally a regular 486 with the FP disabled (due to it malfunctioning); the 487 was essentially just a 486DX.

However after that, the FP functionality was fully integrated into the CPU's pipeline. Once MMX came out, the FP registers were even used for an alternate purpose.

dom

Math co-processors, anyone? by cplusplus · 2006-10-20 04:35 · Score: 4, Insightful

GPUs are going the way of the math co-processor. I think it's inevitable.

--
"False hope is why we'll never run out of natural resources!" - Lewis Black

Re:Math co-processors, anyone? by SirKron · 2006-10-20 05:08 · Score: 1

Exactly, why else do you feel we need an 8 core processor? The GPU will just be extra clock cycles on one of the cores.
Re:Math co-processors, anyone? by holistah · 2006-10-20 15:59 · Score: 1

this is the 20th time I've read this same exact comment (10th for this article), and we all thought the same thing when we saw the headline, no need to say it out loud! why is this modded +5 insightful every single time? perhaps you should read before commenting, so should the mods

patents by chipace · 2006-10-20 04:36 · Score: 2, Insightful

It's quite clear that the AMD-ATI merger was to aquire the IP and expertise necessary to integrate gpu core(s) on the same die as cpu core(s). Nvidia does not have to actually market a design, but rather patent some key concepts, and this could provide revenue or protection.

I would very much doubt that they could compete with AMD and Intel who have already patented many x86 cpu concepts.

It's a shame that Intel has decided not to buy nvidia, and go it alone with it's own design staff.

Nvidia by Anonymous Coward · 2006-10-20 04:42 · Score: 1, Funny

Why doesn't google buy Nvidia?

Re:Nvidia by SevenHands · 2006-10-20 05:30 · Score: 1

Google is just software, not hardware. Or am I wrong on this one?
Re:Nvidia by Anonymous Coward · 2006-10-20 05:46 · Score: 1, Funny

maybe for the same reason ford doesn't.
Re:Nvidia by Anonymous Coward · 2006-10-20 07:49 · Score: 0

Why doesn't google buy ford then?
Re:Nvidia by daverabbitz · 2006-10-20 09:58 · Score: 1

Because Ford is going down the tubes.

--
What could be better than a jet powered motorcycle? http://www.youtube.com/watch?v=u8l6GTHLSWE
Re:Nvidia by uujjj · 2006-10-20 10:14 · Score: 1

Google is just software, not hardware. Or am I wrong on this one?

You are indeed wrong. Google is actually a major hardware manufacturer since they build their own servers plus the Google Search Machines. There's even rumors of them making custom chipsets for their servers.

Thank MicroSoft by powerlord · 2006-10-20 04:43 · Score: 5, Interesting

Okay, I admit, I haven't RTFA yet, but if GPUs do get folded back into CPUs, I think we need to thank MS.

No. ... Seriously. Think for a minute.

The major driving force right now in GPU development and purchase are games.

The major factor that they have to contend with is DirectX.

As of DirectXv10. A card either IS, or IS NOT compliant. None of this "We are 67.3% compliant".

This provides a known target that can be reached. I wouldn't be surprised if the DirectX10 (video) featureset becomes synonymous with 'VGA Graphics' given enough time.

Yeah, sure, MS will come out with DX11, and those CPUs won't be compatible, but so what?, If you upgrade your CPU and GPU regularly anyway to maintain the 'killer rig', why not just upgrade them together? :)

--
This space for rent. All reasonable inquiries will be entertained at proprietors discretion.

Re:Thank MicroSoft by Anonymous Coward · 2006-10-20 04:57 · Score: 0

It will take a long time for DX 10 to be fully accepted if it's going to require vista, as rumoured...
Re:Thank MicroSoft by powerlord · 2006-10-20 05:04 · Score: 1

I wasn't thinking so much DX10 being accepted, as the featureset that it requires to be there.

--
This space for rent. All reasonable inquiries will be entertained at proprietors discretion.
Re:Thank MicroSoft by Anonymous Coward · 2006-10-20 05:10 · Score: 0

I dunno, maybe due to cost/system instability issues? You want to try swapping out the CPU every 4 months, along with all the headaches that can cause (like optimizing the kernel)? I sure as hell don't. Also, what about OpenGL? As a CPU manufacturer, I'd prefer to keep the number of standards I have to comply with and get certified for to a minimum, and as a Linux user, I don't like the idea of my CPU of all thinsg being beholden to Microsoft.
Re:Thank MicroSoft by powerlord · 2006-10-20 05:28 · Score: 1

OpenGL should be able to take advantage of the same features a chip can provide, as long as there are drivers to support it.

My comment about DirectX10 was just that it was being presented as an absolute standard to adhere to, unlike its predecessors, if you want to get certified, you have to have certain features. This provides a much more concrete standard that people can look at and say 'can that graphic Card/Chip do X?'.

I'm confused how this would make the CPU beholden to MS. Somehow the CPUs currently in our computers, primarily sold to run DOS and Windows, have managed to run SCO (wether you like it or not), Novell, Minux, Linux, BSD, Solaris, and even (shock) OS X. A remarkable number of those computers even contain graphics cards that are certified for DirectX (in one flavor or another).

I do have one other question though: As a linux user, do you usually find the need to swap out your GPU every 4 months?

--
This space for rent. All reasonable inquiries will be entertained at proprietors discretion.
Re:Thank MicroSoft by Kjella · 2006-10-20 06:31 · Score: 1

If you upgrade your CPU and GPU regularly anyway to maintain the 'killer rig', why not just upgrade them together? :)

Don't know about you, but for me graphics and CPU still aren't in sync. I imagine for most gamers it's still that way. More often than not you want to swap your graphics card, and your CPU *only* if it's holding back the GPU, which is far from always. I'd say you can at least live two generations of GPU cards on the same CPU. Still, for the "all-in-one" market that wouldn't go off buying a separate card anyway, go for it.

--
Live today, because you never know what tomorrow brings
Re:Thank MicroSoft by asuffield · 2006-10-20 06:38 · Score: 1

As of DirectXv10. A card either IS, or IS NOT compliant. None of this "We are 67.3% compliant".

While this is a deliberate feature of DirectX, it's nowhere near as useful as you suggest. What is specified and complied with is the set of instructions which the card will accept. What is not specified is what it will do with those instructions - a card is considered to be DirectX compliant even if it has many rendering errors in the output. GPU makers can and do take a great many liberties with this, and that's just the deliberate part - this is before we start considering the generally abysmal quality of drivers from nVidia and ATI.

Video cards are not "compatible" with DirectX. Games are deliberately made to be compatible with video cards. Every (and I do mean 'every') big-money game out there has many workarounds written into its code to handle ATI and nVidia cards, and probably the two or three minor GPU makers as well. This works because the number of variations of video cards on the market being considered at any one time is quite small, due to the absence of any attempt at compatibility with older hardware. This is not 'standardisation', it's just lip-service.

If we did not have DirectX and each manufacturer had their own proprietary interface, there would be no visible difference to gamers. Games would be made in exactly the same way they are today and they would work about as well as they currently do.

a dash? by racebit · 2006-10-20 04:45 · Score: 1

The article is from the Inquirer, so a dash of salt might make this more palatable.
A dash?..hell! better use the whole damn shaker

A cyclic process? by Kadin2048 · 2006-10-20 04:49 · Score: 4, Insightful

A while ago -- and maybe it was in the Slashdot discussion about ATI, I'm not sure -- somebody described a cycle in computer design, where various components are built-in monolithically, then broken out as separate components, and then swallowed back up into monolithic designs again.

Graphics chips seem to have done this cycle at least once; perhaps now we're just looking at the next stage in the cycle? We've had graphics as a separate component from the processor for a while, perhaps the next stage in the cycle is for them to combine together into a G/CPU, to take advantage of the design gains in general-purpose processors.

Then at some point down the road, the GPU (or more likely, various GPU-like functional units) might get separated back out onto their own silicon, as more application-specific processors become advantageous once again.

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."

Re:A cyclic process? by shizzle · 2006-10-20 04:56 · Score: 5, Informative

Yup, the idea is pushing 30 years old now, and came out of the earliest work on graphics processors. The term "wheel of reincarnation" came from "On the Design of Display Processors", T.H. Myer and I. E. Sutherland, Communications of the ACM, Vol 11, No. 6, June 1968.
http://www.cap-lore.com/Hardware/Wheel.html
Re:A cyclic process? by levork · 2006-10-20 04:58 · Score: 4, Informative

This is known as the wheel of reincarnation, and has come up several times in the last forty years of graphics hardware.
Re:A cyclic process? by gstoddart · 2006-10-20 05:40 · Score: 1

The term "wheel of reincarnation" came from "On the Design of Display Processors"

I won't dispute that the term in a technical usage was coined by them. But, it's basically a borrowed term from Hindu/Buddhist stuff who have believed in reincarnation and the wheel of life for a few thousand years.

Cheers

--
Lost at C:>. Found at C.
Re:A cyclic process? by shizzle · 2006-10-20 05:56 · Score: 1

True! Didn't mean to imply otherwise. I should have been clearer:
The use of the term "wheel of reincarnation" to refer to this phenomenon came from [...]
And of course the main contribution of this paper was the recognition of that phenomenon, not just the appropriation of a catchy phrase to describe it.
Re:A cyclic process? by gstoddart · 2006-10-20 06:02 · Score: 1

True! Didn't mean to imply otherwise. I should have been clearer

*laugh* That, or I should be less of a pedant. ;-)

Cheers

--
Lost at C:>. Found at C.
Re:A cyclic process? by kylerimkus · 2006-10-20 06:45 · Score: 0

Diapers do this too. They move in a cycle of unisex-> sex specific -> unisex. But with little improvement in quality, it seems to be just driven by marketing. Especially since each shift comes with a noticable bump in price.
Re:A cyclic process? by asuffield · 2006-10-20 06:46 · Score: 1
A while ago -- and maybe it was in the Slashdot discussion about ATI, I'm not sure -- somebody described a cycle in computer design, where various components are built-in monolithically, then broken out as separate components, and then swallowed back up into monolithic designs again.

Graphics chips seem to have done this cycle at least once; perhaps now we're just looking at the next stage in the cycle?

Frankly, I hope this is the last stage in the cycle - at least until we have some radical changes in how processors are fabricated. The problem is that we've more or less reached the physical limits of possible miniaturisation in semiconductor technology (there's some more space for improvement, but we won't get another factor of ten out of it). Unless we invent something smarter, in five years our options are going to be:
- Systems split out into many components
- A processor unit that is the size of a house brick, before the heatsink is attached
I'm not going to be foolish enough to claim that things won't continue to get faster - but sort of some revolutionary new method for building these things, they're going to be getting bigger from here on out. I don't even want to think about what the sockets will look like.
Re:A cyclic process? by Rockinsockindune · 2006-10-20 08:25 · Score: 2, Funny

Ohh... do you think that I could get a trademark on "G/CPU" and then try and auction it off to the two compaines?

--
I abuse commas, I cannot help myself.
Re:A cyclic process? by funfail · 2006-10-20 08:48 · Score: 1

No baby needs a diaper that long to be affected by the cycle, how's that marketing driven?

--
Search RapidShare and MegaUpload!
Re:A cyclic process? by Anonymous Coward · 2006-10-20 22:22 · Score: 0

the fact that people have more than one child seems to have escaped you... sherlock

I don't think it's gonna happen by tehSpork · 2006-10-20 04:53 · Score: 1

With nVidia making CPUs, and of course Intel and AMD/ATI (DAAMIT) making CPUs, how could nVidia expect to grab any market share? No offence to the nVidian engineers, but their product would have to be miles above the Intel/DAAMIT offerings in order to make most people even consider a system with an nVidia CPU. I think they would be better off if they attempted to enter into a contract with Intel for their CPU/GPU combo ideas, maybe Intel could get a few nice server chipsets out of the deal? :)

Re:I don't think it's gonna happen by DragonWriter · 2006-10-20 09:22 · Score: 1

With nVidia making CPUs, and of course Intel and AMD/ATI (DAAMIT) making CPUs, how could nVidia expect to grab any market share? No offence to the nVidian engineers, but their product would have to be miles above the Intel/DAAMIT offerings in order to make most people even consider a system with an nVidia CPU.

No, it wouldn't. It would have to be either cheaper for OEM's to consider it, or offer some other selling point that AMD or Intel wasn't offering, or, preferably, both; which is how AMD (and Cyrix and others) got marketshare competing with Intel in making PC processors in the first place, years ago.

A CPU-GPU combination would certainly meet the second qualification, and its not inconceivable that such a combination could also outperform a similar-cost separate CPU and video subsystem setup.

Most people don't care who makes the CPU in their computer, the idea is getting it to sell to OEMs, who are the decision-makers that matter.

Pointless without documentation. by sudog · 2006-10-20 04:56 · Score: 3, Insightful

Why is everyone getting excited about this? Now we're going to have a CPU that's only partially documented, and we lose even moreso to closed-source blobs.

This isn't a good thing unless they also release documentation for it!

Just my preference . . . by Orange+Crush · 2006-10-20 05:04 · Score: 3, Funny

The article is from the Inquirer, so a dash of salt might make this more palatable.

I prefer my articles with a dash of accuracy.

Niche market? by gstoddart · 2006-10-20 05:08 · Score: 3, Insightful

Will dedicated graphics cards become a niche product for enthusiasts and pros, like audio cards already largely have?

Haven't they already???

The vast majority of machines (at least, from my experience, which could not be broad enough) seem to be shipping with integrated graphics on the motherboard. Certainly, my last 3 computers have had this.

Granted, I buy on the trailing edge since I don't need gamer performance, but I kind of thought most consumer machines were no longer using a separate graphics card.

Anyone have any meaningful statistics as opposed to my purely anecdotal observations?

--
Lost at C:>. Found at C.

Re:Niche market? by gbjbaanb · 2006-10-20 05:18 · Score: 2, Insightful

Its hardly a niche market - every server wants onboard graphics, mainly because they don't need to be superpowerful. I imagine this is similar - a low-powered CPU on the same chipset as the graphics chip (and no doubt the network chip) would make making motherboards a bit cheaper, or give them more capabilities that they currently have to have managed with software installed as a driver.

I really doubt the CPU part is going to compete with the latest super-quadcore chips from AMD or Intel, so no-one will use it for a mainstream computer. Possibly it'd have a market for embedded products but I thought they were already well catered for.
Re:Niche market? by gstoddart · 2006-10-20 05:30 · Score: 1

I really doubt the CPU part is going to compete with the latest super-quadcore chips from AMD or Intel, so no-one will use it for a mainstream computer.

What about the latest quad-core chips are mainstream??? Those are specialty products if there ever was a specialty product. Except for high-end gamers and people doing really specialized tasks, who actually needs one of them? I bet I couldn't tax one for anything I do.

Nowadays, so many motherboards have video, lan, sound, IDE controller, possibly RAID, and USB all on them that I would think for most people that would suffice. Your average home user isn't going to overwork that.

A separate video card might not yet be niche, but I can forsee it becoming so.

Cheers

--
Lost at C:>. Found at C.

It's a logical extension of the NVidia NForce line by Animats · 2006-10-20 05:10 · Score: 4, Interesting

I've been expecting this for a while, ever since the transistor count of the GPU passed that of the CPU. Actually, I thought it would happen sooner. It's certainly time. Putting more transistors into a single CPU doesn't help any more, which is why we now have "multicore" machines. So it's time to put more of the computer into a single part.

NVidia already makes the nForce line, the "everything but the CPU" part, with graphics, Ethernet, disk interface, etc. If they stick a CPU in there, they have a whole computer.

Chip designers can license x86 implementations; they don't have to be redesigned from scratch. This isn't going to be a tough job for NVidia.

What we're headed for is the one-chip "value PC", the one that sits on every corporate desk. That's where the best price/performance is.

Re:Should Slashdot really insult other news outlet by vondo · 2006-10-20 05:11 · Score: 4, Insightful

The Inquirer is more of a rumor site than a news site. They have a pretty good track record for their rumors, but they don't have people on record backing this one up.

What NVidia eventually does may not bear much resemblance to the story.

we need more than that by Anonymous Coward · 2006-10-20 05:11 · Score: 0

http://www.antsbitemylegs.com/mountain.jpg

Why multiprocessor units suddenly most efficient? by Anonymous Coward · 2006-10-20 05:18 · Score: 0

From someone who has mainly been involved in computers as a semi-competent user:

Why are the multiprocessor units suddenly so popular, relative to why e.g. the Voodoo graphics cards failed? I remember them being ridiculed and ending up in the performance backwaters through their 2-4-8(-16) multiprocessor cards, but it seems that there are engineering reasons why multiple processors are now suddenly coming into favour, or?

CELL by Anonymous Coward · 2006-10-20 05:18 · Score: 0

This is happening because all the big vendors know that once consumers really start using stuff with the cell processor they won't need all the other stuff for most purposes. Not all, but or most normal consumer desktops. It's sneaking in everywhere because it was a good idea and it works. You have multiple cores that go do different stuff and talk to each other really really fast, makes building a computer a lot simpler and in theory can be a lot smaller. I know which one I would pick say at a low end, a 300 buck computer with a normal even duocore processor and a cheap amount of ram and cheap vid and sound, or something for the same price with the cell processor where different cores did didfferent stuff and 4 times the ram. No contest. There will always be a market for higher end and specialised cards and chips, but once you can whip out one thing that fits in one slot and takes the place of 3/4ths of the current mobo you will be making some cash on that badboy. Pretty soon you'll be able to just plop your cellphone (which will act as your main unit) down on your desk in front of a wireless enabled screen, keyboard and mouse and do basically most stuff you do today with a normal big machine, and it is all because of the multiple core chips, they will make it possible and cheap.

Jen-Hsun Huang: A True Asskicker by DeathPenguin · 2006-10-20 05:23 · Score: 2, Informative

When I saw this headline I immediately thought of this article, an interview with Jen-Hsun Huang (CEO: nVidia) by Wired dated July '02. In it, the intention of overthrowing Intel is made quite clear, and ironically enough they even mention the speculation from a time when it was rumored that nVidia and AMD would merge.

It's actually a very good article for those interested in nVidia's history and Huang's mentality. Paul Otellini ought to be afraid. Very afraid.

Niche by Etyenne · 2006-10-20 05:23 · Score: 1

Will dedicated graphics cards become a niche product for enthusiasts and pros, like audio cards already largely have?

It already is.

--
:wq

Other uses? by NerveGas · 2006-10-20 05:29 · Score: 1

"what other prospects could a solution like this have?"

Duh. Gaming consoles. Add memory, a sound controller, and some sort of storage, and you're in business.

--
Oh, you're not stuck, you're just unable to let go of the onion rings.

best of both worlds..? by discojohnson · 2006-10-20 05:30 · Score: 1

"back in the day" when my 80387 (7?..coprocessor) was sitting to the side with it's own instructions to complete, the commands never had to traverse up and down a much slower ISA bus. why not apply the same idea to upgrading your CPU and GPU separately through a slot/socket design? they can still share memory (not necessarily cache--sorry), but the agp/pcie bus is removed. communication between a single die g/cpu will still have some sort of bus, albeit a very tiny bus with tiny pathways; why not just make the pathways a bit bigger and build them into the mobo?

Re:best of both worlds..? by BoberFett · 2006-10-20 06:41 · Score: 1

You mean like AMD is positioning itself with HyperTransport?

Here's the bottom line. by smug_lisp_weenie · 2006-10-20 05:31 · Score: 1

There's certain advantages to having (A) the GPU functionality integrated in the CPU (cost, certain performance aspects, others) and some to having it (B) in a separate GPU (more easily upgraded, more real estate, less heat problems)...

Every once in a while an unrelated tech innovation comes around that benefits (A) or (B) in some indirect fashion. It could be faster bus speeds, more sophisticated GPU instruction sets, etc etc etc. Doesn't really matter what they are, but they happen all the time and each may benefit (A) or (B) more than the other.

So currently we are at a time when (B) is the standard design- GPU and CPU remain separate. Whenever an innovation has come along that benefits (B) more it has been integrated into the latest NVIDIA or ATI design. Whenever something that benefits (A) has come along it has been ignored in the last few years... this probably includes things such as the balooning costs of the GPUs, difficulty in getting GPUs into now popular laptop form factors, texture latency, etc. etc. ...so finally the innovations for design (A) are reaching a critical mass... all those innovations that recently couldn't be pounced upon because of the separation of GPU/CPU are now making (A) looking pretty damn nice again.

This CPU/GPU cycle happens every few years... We've been in an (A) cycle many times before- Remember what MMX was originally for? It was because we didn't have GPUs and couldn't do efficient block operations for video at the time... Remember the IBM PCjr? That was arguably another (A) cycle because IBM wanted to save money on circuitry/memory for the video subsystem, which is arguably just a primitive form of a GPU.

This GPU->CPU oscillation has been, and will continue to be, going on forever.

Re:It's a logical extension of the NVidia NForce l by LesPaul75 · 2006-10-20 05:34 · Score: 1

Well, the thing about a high-end CPU is that it's something like 80% custom logic, where a GPU is much more "standard cell" design. So the fact that NVIDIA is good at GPUs with lots of transistors doesn't mean that it will be easy for them to build a CPU. It will be very difficult to build something competitive with Intel and AMD. But if anyone out there right now has a shot at it, it's NVIDIA. Licensing of the x86 architecture is going to be a sticky issue.

Something that's interesting about this, if true, is that Intel might be the one playing catch-up. AMD will have ATI graphics, NVIDIA will have NVIDIA graphics, and Intel will have Intel graphics, which have always been pretty horrible.

Nothing New In The World by NekoXP · 2006-10-20 05:35 · Score: 1

Integration is the key to cost reduction, performance improvement and power efficiency.

L2 cache used to be external. Then they integrated it when technology and performance allowed. L3 cache then became external while L2 was integrated; now you can buy processors with all this inside. Put the memory controller inside the CPU and you no longer need to spread out high (er than CPU core voltage) IO lines with nasty length requirements between Northbridge and CPU, and can clock the bus faster. Put the ethernet and so on inside the Northbridge and you no longer need discrete chips and buses for them, and can run them faster with tighter integration to a DMA controller and embedded RAM.

Integrate the graphics hardware into the CPU and you can have most of the high-bandwidth devices on the fastest possible bus.

Take Freescale's nearly-done 8641D Power Architecture processor. It's 2 G4s, 4 gigabit ethernet, USB2, PCI Express and RapidIO, DMA, interrupt and memory controller, I2C, serial. This chip is priced LOWER than equivalently specced Core Duo 2 combinations (CPU, i975MCH/ICH combination), and the performance.. is about the same. However board implementation will be much easier, and lower power. All you need for a system is to add your peripherals; a SATA chip, perhaps. I can't think of anything else that is missing besides graphics.

http://www.freescale.com/files/32bit/doc/fact_shee t/MPC8641DDLCRFS.pdf

Eventually SATA will go in there, you can bet on that. Then graphics. Then one chip per board is a possibility. You thought NanoITX was small..?

Mod parent up +1 Sad by Tei · 2006-10-20 05:35 · Score: 1

yes, indeeed, but is a sad state of the world. I hope OpenGL return again and force the Neverwinter Nights to develop Nwn 3 with a OpenGL path again :I

--

-Woof woof woof!

Re:Mod parent up +1 Sad by ThePhilips · 2006-10-20 09:27 · Score: 1

hope OpenGL return again and force the Neverwinter Nights to develop Nwn 3 with a OpenGL path again :I

Not until M$ looses its grip it holds with every game developer as hostage to cheap development tools M$DevStudio for M$Widn0ze and Xbox.
As of now, they are obediently do what M$ tells them. Earlier Xbox release? - Pay them promotional fees and consider it done. Poor/late MacOS/X support? - No problems. Non-existent Linux support? - Give them another rebate for M$DevStudio and they would forget about such OS.
DirectX wasn't technical thing. It's a thing to control companies developing for Windows and effectively locking them in Windows, so they couldn't run away from it.

--
All hope abandon ye who enter here.

Despite doubters, this seems a good idea to me by zappepcs · 2006-10-20 05:40 · Score: 1

If you could preload half the code you run on the CPU over to a CPU/GPU chip, and cut down buss use by >50% by utilizing a 'smart' GPU chip, this should enhance overall system performance by tons in graphic intensive applications. Not to mention that it simplifies simpler system needs (say embedded or wireless) for smaller systems that require less space but provide required functionality with high graphics performance. Just my thought

--
Support NYCountryLawyer RIAA vs People

Just what we needed by Anonymous Coward · 2006-10-20 05:45 · Score: 0

A proprietary CPU that you can't use with your free operating system.

Considering the recent security problems with their proprietary piece of shit driver I'd rather stay away from anything manufactured by NVIDIA.

What if you could put multiple chips like these in by stunt_penguin · 2006-10-20 05:53 · Score: 1

What if you could put multiple chips like these in one machine?

They'd probably be obsolete in three months, as opposed to one month ;)

--
When the posters fear their moderators, there is tyranny; when the moderators fears the posters, there is liberty.

Thin Clients by Beefslaya · 2006-10-20 05:53 · Score: 1

Thinner thin clients perhaps?

Sounds like the CELL by Anonymous Coward · 2006-10-20 05:53 · Score: 0

Maybe it is also a move to compete with the Cell Broadband Engine, where the SPU's can act similar to a GPU.

Re:It's a logical extension of the NVidia NForce l by Anonymous Coward · 2006-10-20 05:53 · Score: 0

Wow, a systerm on a chip, never heard of those before...

Nvidia is the odd-man out by WoTG · 2006-10-20 06:01 · Score: 1

Nvidia doesn't really have a choice in this matter. They need to at least explore the option of creating a CPU or be relegated to niche status.

It's pretty clear that both Intel and AMD are intent on swallowing up the lower 3/4 (hand-waving guess) of the GPU market over the next few years. And I believe that ATI will still be fighting it out at the top end over that remaining 25%.

That would leave Nvidea as a niche player in the uber high end, making GPUs strictly for graphics professionals and gamers with too much money. You can survive as a niche player in technology... but it's a tenuous life. Just ask Cray, SGI, Transmeta, Sun, Cisco, and hundreds of others*. Only the latter two of resemble themselves in their primes -- but even then, they've lost influence.

* Whatever happened to Gravis and the Gravis Ultrasound?

Re:Nvidia is the odd-man out by spadefoot · 2006-10-20 07:36 · Score: 1

I had a Gravis Ultrasound... What a cool soundcard, light years ahead of Creative at the time, and a cool purple PCB, IIRC.

Re:It's a logical extension of the NVidia NForce l by ceoyoyo · 2006-10-20 06:05 · Score: 1

The problem with gluing the GPU and CPU together is that it'll be a humongous chip, with low yields and therefore very expensive -- more expensive than a comparable CPU and GPU separately.

Cyrix Media GX, anyone? by spedrosa · 2006-10-20 06:13 · Score: 1

This integration is nothing new. "Cyrix did it!"

First, processors now have integrated memory controllers. Now this.

Seems like Cyrix was way ahead of its time.

Re:Cyrix Media GX, anyone? by dbfruth · 2006-10-20 06:55 · Score: 1

I actually owned a Compaq laptop with a 180mhz MediaGX chip. It wasn't the fastest thing in the world but it worked well for 2D stuff. http://en.wikipedia.org/wiki/MediaGX

Re:It's a logical extension of the NVidia NForce l by Joe+The+Dragon · 2006-10-20 06:13 · Score: 1

and do you really want to take away system ram for video ram?
with vista high ram use 128+ just for video whould make it even worse.

Looking at this from the wrong perspective by Anonymous Coward · 2006-10-20 06:19 · Score: 0

What folding@home has shown us is that specialized hardware can be put to other novel uses, in some cases yielding huge performance gains. In this case, using graphics circuitry for the protein folding computations. All Nvidia proposes is to reduce latency by moving the GPU on-chip (and perhaps have the added benefit of locking you into a GPU of Nvidia's manufacture...).

However, there are many algorithms that are not optimally solved by throwing EITHER a CPU or a GPU at them. A more interesting idea would be in the form of on-chip programmable hardware a la FPGAs, in conjunction with a nice low-latency on-ship setup. Then, developers could push more interesting and advanced circuits onto CPUs, such as crypto engines or crackers. This would allow for readily incorporating new high performance codecs for crypto, graphics, audio, protein-folding, etc.

Intel is interested in something similar by Anonymous Coward · 2006-10-20 06:21 · Score: 1, Informative

A guy from Intel recently presented at a seminar at my university. He is working with a group that is pushing for a CPU architecture that looks kind of like a GPU, when you look at it at a very high level (and perhaps your eyes squinted just a bit).

The unofficial title of his talk was 'the war going on inside your PC'. He argued that the design of future CPUs and GPUs will eventually converge, with future architectures being comprised of a sea of small and efficient but tightly interconnected processors (no superscalar), and that it is basically a race to see who will get there first - the CPU manufacturers or the GPU manufacturers.

One of his main points was that with increased compiler effort, potentially many computational workloads can be made to run on the tiled architecture of simple processors, much in the way that the process of graphic rendering has been able to be shifted into the type of workload that can leverage the 'tiles of simple processors' found in a graphics card today, even though the nature of graphic rendering was originally better suited for execution in a typical CPU, where control dependent loads run efficiently. When the workload cannot be mapped to the 'tiles of simple processors' architecture, just slap a superscalar processor in the corner of your die (like nvidia seems to be doing) to take care of those small corner cases.

So, we will likely be seeing a lot more of this in the future. Especially now that AMD and ATI are together.

(More details on the abstract of the presentation I mentioned can be found here)

Re:Intel is interested in something similar by SillyNickName4me · 2006-10-20 07:28 · Score: 1

One of his main points was that with increased compiler effort, potentially many computational workloads can be made to run on the tiled architecture of simple processors, much in the way that the process of graphic rendering has been able to be shifted into the type of workload that can leverage the 'tiles of simple processors' found in a graphics card today, even though the nature of graphic rendering was originally better suited for execution in a typical CPU, where control dependent loads run efficiently. When the workload cannot be mapped to the 'tiles of simple processors' architecture, just slap a superscalar processor in the corner of your die (like nvidia seems to be doing) to take care of those small corner cases.

Where to start..

Intel has been saying this at least since they launched the i860 cpu over 15 years ago, yet any cpu platform actually counting on this happening has met with little success so far.

Counting on compilers to get parallelism and extremely complex instruction ordering right is unworkable for the simple reason that it depends on runtime conditions that are unpredictable. It is possible to do it for relatively simple and predictable situations however, but those happen to be rare corner cases.

For example, you cannot predict which data is going to be in cache at runtime. Because of not knowing this, you cannot predict how many cycles a memory access will take and consequentely you are unable to order instructions that access memory correctly. That is, unless you know exactly what code is going to run when, or when all the code and data that will be accessed fit in cache.

Just getting instruction ordering right for a relatively well defined case like the Ithanium seems pretty difficult already and this is contributing to the relative lack of popularity of that architecture I believe.

Interestingly, while it did not have video output capabilities or a framebuffer or such, the i860 cpu had a vector unit that has some similarity to todays GPUs... Seems Intel has been there before.

Re:It's a logical extension of the NVidia NForce l by ceoyoyo · 2006-10-20 06:21 · Score: 1

Presumably you'd still need separate graphics memory. We all know how well integrated memory works now.

Re:It's a logical extension of the NVidia NForce l by Doctor+Faustus · 2006-10-20 06:21 · Score: 2, Interesting

do you really want to take away system ram for video ram?
If using larger chips means I can get 2GB combined RAM for the price of 1GB system RAM and 256MB video RAM? Absolutely.

Re:Why multiprocessor units suddenly most efficien by SillyNickName4me · 2006-10-20 06:22 · Score: 2, Informative

Why are the multiprocessor units suddenly so popular, relative to why e.g. the Voodoo graphics cards failed? I remember them being ridiculed and ending up in the performance backwaters through their 2-4-8(-16) multiprocessor cards, but it seems that there are engineering reasons why multiple processors are now suddenly coming into favour, or?

multiple processors (CPU, GPU or otherwise) are a way to add more 'cycles' based on current technology. This has the advantage of being able to get more out of your current designs and manufacturing technology, but comes at the cost of increased complexity in both the supporting hardware, and in software.

Getting a single core implementation faster is always the more efficient way to add processing capacity, but it is very impractical beyond a certain point due to power and heat considerations (where that point is exactly depends on the state of technology at any given moment but in the end is limited by the physical size of molecules, at least for as far as current technology goes)

So, multiple processors is not directly better from an engineering point of view, rather, it is a solution to overcome the speed limits of current technology, provided you can deal with the extra complexity (moving much of the hardware complexity into the chip itself like AMD and Intel are doing now removes the burden from systemboard designers, but the complexity itself is still there, esp. on the software side of the picture).

With regards to 3dfx, it seems to me that:
1. They failed to manage the additional complexity
2. As their competition showed, limits of technology at that time were much higher then what 3dfx managed, which indicates there were problems with either their design or manufacturing technology, or more likely, with both.

Re:Why multiprocessor units suddenly most efficien by Doctor+Faustus · 2006-10-20 06:27 · Score: 1

it seems that there are engineering reasons why multiple processors are now suddenly coming into favour

Traditionaly, when you move to a more compact production process, your parts are closer together, so it takes less time for the electric signals to move through them (propogate), so you can get faster clock speeds without really changing the design much. When Intel reached the 90nm process (or maybe the one before -- 130nm?), they were startled to discover that that effect just didn't work anymore. The chips were smaller, and thus cheaper to make, but they didn't work any faster than the old ones. AMD apparently ran into the same thing when they reached that size.

With their easy gains in clock speeds at a dead end, they went looking for something else to improve, and more CPUs was it.

Re:Why multiprocessor units suddenly most efficien by mrkh · 2006-10-20 06:27 · Score: 1

Mostly because it's a different point in the life cycle. Many of the easy performance gains in CPU's have now been taken - it wasn't so long ago that a 100MHz clock increase was sensational, but now it's expected. It's getting tough to eke out big performance wins in each generation now, so it's easier to move sideways to multiple cores. Parallelism is the future (witness the supercomputers).

I don't know the 3dfx history all that well, but I'd *guess* that their cards were getting hard to dig more performance from, and that they went two ways - one to work on a new core, one for multiprocessor. They could at least keep some market share if they got something out quickly. Unfortunately, the new core probably took too long, and the MP wasn't enough ; there were still new things to try for single GPU's, and they ended up with an expensive, slow card.

Re:It's a logical extension of the NVidia NForce l by julesh · 2006-10-20 06:47 · Score: 1

Well, yeah, but you've never heard of one that can run Windows Vista before, I'll bet. :)

Not so much by Sycraft-fu · 2006-10-20 06:56 · Score: 4, Informative

System RAM is SLOW compared to GPU RAM. PCIe actually allows very high speed access to system RAM, but the RAM itself is too slow for GPUs. That's one of the reasons their RAM amounts are so small, they use higher speed and thus more expensive RAM. Also because of the speed you end up dealing with cooling and signal issues which makes it impractical (or perhaps impossible) to simply stick it in addon slots to allow for upgrades.

Even fast as it is, it's still slower than the GPU would really like.

What you've suggested is already done by low end accelerators like the Intel GMA 950. Works ok, but as I said, slow.

Unless you are willing to start dropping serious amounts of cash on system RAM, we'll be needing to stick with dedicated video RAM here for some time.

Re:Not so much by julesh · 2006-10-20 07:08 · Score: 1, Troll

System RAM is SLOW compared to GPU RAM.

Huh? All the systems on this lineup use standard PC3200 (DDR400) RAM. Which is the same RAM that you could use as system RAM with many motherboards (e.g. this one). I don't see why the RAM would be faster on the video card than in the main system...?

Also, a GPU inside the CPU would get to benefit from the CPU's cache, which would usually contain any data that had recently been modified by the main redraw thread, thus eliminating the need to go out to get data from that memory. I wouldn't be surprised to see something like a 25% reduction in the amount of memory access the GPU needs to perform.

What you've suggested is already done by low end accelerators like the Intel GMA 950. Works ok, but as I said, slow.

The reason this is slow is (a) that it's an intel GPU, and they've never been any good anyway, and (b) that it has to contend with the CPU for access. If you integrate the two into a single component with an nVidia designed GPU, I don't see any reason it couldn't perform as well as current separate systems do. And cost significantly less, and use less power.
Re:Not so much by ZachPruckowski · 2006-10-20 08:25 · Score: 1

Huh? All the systems on this lineup use standard PC3200 (DDR400) RAM. Which is the same RAM that you could use as system RAM with many motherboards (e.g. this one). I don't see why the RAM would be faster on the video card than in the main system...?

You are correct that those systems are using DDR RAM. But graphics cards (including the cards in those machines) use other, more expensive, faster RAM, like GDDR3.
Re:Not so much by NerveGas · 2006-10-20 09:44 · Score: 1

My lowly GeForce 6800 GT has a 256-bit, 1GHz (500x2) memory interface. As a comparison, an Athlon64 has a 128-bit, 400 MHz (200x2) memory interface, giving my GPU five times more memory bandwidth than the CPU.

I've used GPUs that utilize system RAM from Intel and NVidia, and they're always slower compared to their counterparts with dedicated memory. The 6200 TC does a fair job, but that's because it has 64-256 megs of dedicated memory AND THEN uses system ram as well.

Believe me, if video card manufacturers could make their chips work as fast by accessing system RAM, they'd jump at the chance. RAM has been one of the greatest hinderences in the GPU field. They finally abandoned the JEDEC and started their own standards (the GDDR series), and the industry has devoted a LOT of money to solving the problems. If they would just utilize system RAM, it would greatly benefit everyone, but unfortunately, it just doesn't work as well.

Steve

--
Oh, you're not stuck, you're just unable to let go of the onion rings.

Some things you forget by Sycraft-fu · 2006-10-20 07:05 · Score: 3, Informative

1) Processors are wicked fast at floating point these days. Have a look at the benchmarks a modern chip using SSE2 can do some time. Integer doesn't inherently mean faster, and chips these days have badass FPUs.

2) For many things, it DOES make a difference. You might ask why do we need more than 24-bit (or 32-bit if you consider the alpha channel) integer colour? After all, it's enough to look highly realistic. Yes well that's fine for a final image, but you don't want to do the computation like that. Why? Rounding errors. You find that with iterative things like shaders doing them integer adds up to nasty errors which equals nasty colours and jaggies. There's a reason why pro software does it as 128-bit FP (32-bits per colour channel) and why cards are now going that way as well.

3) In modern games, everything is handled in the GPU anyhow. The CPU sends over the the data and the GPU does all the transform, lighting, texturing and rasterizing. The CPU really is responsible for very little. With vertex shaders the GPU even handles a good deal of the animation these days. The reason is that not only is it more functional but it's waaaaay faster. You can spend all the time you like trying to make a nice optimised integer T&L path in the CPU, the GPU will blow it away. You actually find that some older games run slower than new ones because they rely on the CPU to do the initial rendering phases like T&L before handing it off, whereas newer games let the GPU handle it and thus run faster even though having higher detail.

Actually... by ichigo+2.0 · 2006-10-20 07:05 · Score: 1

Actually only the size of the frame buffer is multiplied, i.e. a scene at 1024x768x32 bit would take 3 MB without anti-aliasing, and at 1600x1200x32 bit 7.3 MB. With 4xFSAA those scenes would take 12 MB and 29.2 MB respectively. The rest of the memory is used for texture & vertex data, and other miscellaneous stuff like shaders. Those don't grow in size with the display resolution.

No not so much by Sycraft-fu · 2006-10-20 07:11 · Score: 1

CPUs are very important for good games, and only getting more so. You think MS paid for a 3 core PPC CPU in the 360 for fun? If a low power cheap part would do the trick, they'd have jumped on it.

Even after you've offloaded all the graphics, there's still a ton to be done. AI, physics, and sound to name just three. Fire up a copy of Oblivion some time and you'll be amazed how hard it hits the CPU. It even makes use of dual cores, if they are available. Yes, a prime factor in game speed is having a GPU that can keep up with the shiny graphics, but equally important is a CPU that can keep up with the game data. If you stuck a 7900GTX on a P4 1.6GHz the game would look beautiful, but run slow (if it ran at all) because the GPU would be sitting waiting for the CPU to get it stuff to render.

For cool, latest-gen games, you need both a good GPU and CPU.

Re:No not so much by The+Warlock · 2006-10-20 07:14 · Score: 1

Not every computer is for games, you know. Not even more than like 10%. The cheap Wal*Mart Special, the mass produced home/corporate desktop, is where the real money is.

--
I've upped my standards, so up yours.

advantage of seperate companies by peter303 · 2006-10-20 07:16 · Score: 1

Intel has tried to integrate specialized I/O on their chips more than once, a nd so have other companies. Then they end up with something neither best in either and become commercial failures.

What's that got to do with anything? by Sycraft-fu · 2006-10-20 07:34 · Score: 5, Informative

Yes the SYSTEMS Tom used to test have normal speed ram for systems. Duh. The graphics cards, however, have much faster RAM. For example my system at home has DDR2-667 RAM. That's spec'd to run at 333MHz which is 667MHz is DDR RAM speak. My graphics card, a 7800GT, on the other hand has RAM clocked at 600MHz, or 1200MHz in RAM speak.

Not a small difference, really. My system RAM is rated to somewhere around 10GB/second max bandwidth (it gets like 6 in actuality). The graphics card? 54GB/sec.

Video cards have fast RAM subsystems. They use fast, expensive chips and they have controllers designed for blazing fast (and exclusive) access. You can't just throw normal, slow, system RAM at it and expect it to perform the same.

Re:What's that got to do with anything? by DragonWriter · 2006-10-20 09:13 · Score: 1

Video cards have fast RAM subsystems. They use fast, expensive chips and they have controllers designed for blazing fast (and exclusive) access. You can't just throw normal, slow, system RAM at it and expect it to perform the same.

Of course, Nvidia could plan to use blazingly fast RAM like that used on video cards now as the system RAM on motherboards supporting their CPU-GPU hybrid, which would solve the problem nicely, though it might drive the price up quite a bit. (Then again, it would improve system performance in other ways, and perhaps Nvidia sees the future as demanding faster RAM access for more and more applications besides traditional GPU-heavy ones, in which case this kind of thing would make a lot of sense.)
Re:What's that got to do with anything? by julesh · 2006-10-27 20:24 · Score: 1

OK, I misread the specs. However, if you look at the cost of video cards with varying amounts of memory, you can start to get an idea of how expensive the memory on them is. For instance, I can get a 7800GTX-based card with 256MB for £336.00. The next card up is a 7950GTX, but let's ignore that difference and concentrate on its 512MB. It costs £355.16. That's a difference of just under £20 for 1/4 gig. This suggests that the RAM used on these cards costs about £80/GB. My best price for DDR667 RAM is £64/GB. It's not going to drive the cost of a system up by much if this RAM is used on the main system. Or if the processor has a 512MB cache of the stuff right next to it, or something.

Re:It's a logical extension of the NVidia NForce l by Tillmann · 2006-10-20 07:35 · Score: 1

They don't even need to license it. Today, GPUs are more complex than standard x86 CPUs. So if nVidia can handle GPUs, they can handle CPUs. They have the tools, the know-how, and the engineers. Developing their own x86 CPU shouldn't be too hard for them.

Yes but those by Sycraft-fu · 2006-10-20 07:42 · Score: 1

Don't need good graphics. An Intel GMA950 does just fine for desktop use. Hell, it actually can play older games, and is enough to handle AeroGlass in Vista.

The problem is the GP seems to think that nVidia would have a viable market combining their excellent graphics chips with shitty processors. Nope, not so much. If you want a good graphics card, you pretty much by definition also want a good processor. There are actually plenty of people who can use a good processor and have no need for a good video card, but the reverse is very rarely true. Games are the big use, but any other app I can think of, like HFSS or Solidworks, where good 3D card is a benefit, you also need a good processor.

So ya, nVidia could try the low end market, but there's some real stiff competition there in the form of Intel and their GMA series. That's what the cheap computers now feature, a cheap Intel cheap and their cheap accelerator. However those that think they could combine a good GPU with a cheap CPU as a gaming system are off base.

Re:Yes but those by TheRaven64 · 2006-10-20 10:24 · Score: 1

The two biggest growth areas at the moment are high-density servers and portables. Servers don't really need a GPU, unless they are for HPC (in which case, maybe). Portables, at the extreme portable end, don't need a great CPU or a great GPU, but having a so-so CPU and GPU on the same die is a recipe for a cheap system that's not very power-hungry. In the sub-notebook arena, this easily translates to several million sales.

--
I am TheRaven on Soylent News

Re:It's a logical extension of the NVidia NForce l by chipace · 2006-10-20 07:43 · Score: 1

The one-chip PC is a falicy... if the integration of all blocks produced higher performance, then you can charge more for that integration. The slower blocks (ATA, USB, PCI) don't perform significantly better when integrated, thus consumers don't value integration (board manufactures will, but only by a fixed amount).

If you integrated the gpu/ethernet/sdram/cpu, then all the high-speed blocks can talk with each other at die level speeds. It's clear as day that performance will significantly increase.

The southbridge IO (ATA USB PCI) requires many pins, and moving them to a single chip doesn't save enough money at board level to justify the added expense at die level. The power cost of running a bus to the southbridge is tiny compared to the total system budget.

Re:It's a logical extension of the NVidia NForce l by Joe+The+Dragon · 2006-10-20 08:05 · Score: 1

But the combined RAM is slower then video ram.

Re:It's a logical extension of the NVidia NForce l by hasdikarlsam · 2006-10-20 08:21 · Score: 1

GPUs are larger, but not more complex. It's a fairly homogenous array of execution units - massive SIMD, in other words.

It doesn't take anywhere *near* as much knowhow to produce a GPU as it takes to produce a CPU - and it's an entirely different sort of knowhow, as well.

nVidia might have a chance at producing a (special-purpose, GPU-like) supercomputer chip; they'd have serious trouble producing an x86/x86_64 CPU competitive with what Intel and AMD make. Of course, they know that themselves, so they're not doing that.

I want CPU + heap-o-FPGAs by said_captain_said_wo · 2006-10-20 08:29 · Score: 1

Are we just waiting for patents to expire to see more FPGAs in general purpose systems?

How about a Xeon-socket GPU? by Maury+Markowitz · 2006-10-20 08:44 · Score: 1

Imagine a GPU that plugs into the existing Xeon (Core 2 versions) socket. Now the four-core CPU in the other socket can talk to it over the memory bus. Put a separate bus on the "far side" for the GPU's local memory and/or framebuffer.

Re:It's a logical extension of the NVidia NForce l by Doctor+Faustus · 2006-10-20 09:05 · Score: 1

Historically, yes, but RAM's been getting faster lately, and I expect that will continue for the next couple of years before these chips are expected to come out.

It's a cycle by mr_mischief · 2006-10-20 09:27 · Score: 1

Once upon a time, there were mainframes, and computing was good. Only it was only good for large universities, governments, and a handful of corporations.

Then, there came minicomputers. Fewer components meant smaller machine with smaller prices. Banks, small colleges, and even some high schools got into the game. My high school had an AS/400.

Then came the micros. These are what we call PCs now. They started as hobbyist toys. Then, the chips got more powerful and more memory was put in them, and they started to be more powerful than the minis of a few years before.

There were math coprocessors, but those got absorbed into the CPU. Then came accelerated 2d video, and later accelerated 3d video. Sound processing appeared on expansion cards, and then on the system board.

Then came bigger mainframes, faster minis, and high-end workstations. Micro users wanted more power. Micros were made with more instructions, some that did things with more bits at a time. Then the internal scheduling was made so that multiple instructions could be in different stages of being processed simultaneously. It was called pipelining, and the users saw that it was good.

Mainframes grew again, these things called servers that had a brain like a micro but a body like a pizza box but with lots of IO strength and more mouths and ears sprung up in the Data Center, which had been just the closet next to the coffe pot. People saw that servers were good, because the processing went on in the Data Center instead of on their Micros. And the servers sometimes joined together into great hordes called Clusters that could challenge any mainframe.

Then there were vector instructions added to the PUs. And some PUs could do a vector instruction and regular instructions both at once. And the vector unit stole some of the glory of the graphics accelerator.

People put two CPUs (or PUs, since they're not really central at that point) into one system, then four or eight. Then someone put two logical PUs on one chip.

Someone figured out that lots of the processing for graphics was still done on the CPU -- and more so now with vector instructions. The graphics accelerators grew their own vector units, and become very wise in the ways of the vector and in pipelining. More graphics work was placed with the accelerators than ever before.

A Major Designer of PUs determined that what was good and Intel-ligent for math processors in the beginnings of the micro was still good for other types of processing on the micro. The path had widened again, and two of these wider PUs per chip was becoming the norm. The die had shrunk yet again, meaning that now a stronger PU was now smaller than ever. Even more vector units could be fit on a chip, despite the fact that the chip held two whole PUs. In Embedded Land, in the valley of Console, a strange company called So Knee had put one PU on a chip with a massive vector unit, and called it Sell. A Major Designer determined that this was good, and noticed that it looked something like a PU and something like a graphics accelerator.

Now, graphics accelerators had become known as GPUs, because they were doing their own fair share of Processing. They had started to team up as well, sometimes with more than one GPU inside one Micro. Not since the Friendly days of The Great Chicken Head had so many chips gone into one machine. Most micros had at least two PUs, even if they were on one chip. They had one or more GPUs, even though the "G" was questionable since they could do other kinds of Processing, too. They had sound, often from a DSP. They had Networking, which is those extra mouths and ears of which the Servers have in such abundance.

And so, the die was shrinking again, and A Major Designer was swallowing a maker of GPUs. The socket for a different type of processor, which had been so Intel-ligent so long before, started to look like a good place for a GPU. And with the die shrinking, making more room on a chip, the thought of a CPU and a GPU together started look

What's really creepy is by Solr_Flare · 2006-10-20 11:03 · Score: 1

Many years ago, when the 3d video card revolution was taking place under the helm of companies like 3dfx, Intel announced that eventually video cards would become obsolete. I forget the exact time frame but I believe they said by 2006 to 2008 cpus would be handling the majority of video processing.

Now, this statement was made before the cpu manufacturers started hitting the brick wall, and 3d video technology was still in relative infancy. So, to be fair, we should probably tack on another 5 years onto that estimate. Once we do so, the market seems to be heading in the very direction Intel predicted between Cell and GPU/CPU integration, we aren't too far off from this becoming common place at all.

--
You are who you are, let no one tell you different. But, never close your mind to a new point of view.

Why isn't a console maker doing this too by SnarfQuest · 2006-10-20 11:16 · Score: 1

Why doesn't one of the game console makers take their favorite processor, and add 8 GPU's to it. Maybe join forces with a large cpu maker, like IBM. They could give it one of those silly names, like "cell processor" and make a game console based on it. If it could run Linux, and folding@home, that would really be a bonus.

Everyone on slashdot would surely want one of these, even if it were a little overpriced.

--
Who would win this election: Andrew Weiner vs Andrew Weiner's weiner.

Re:Should Slashdot really insult other news outlet by Bingung+Balik · 2006-10-20 11:23 · Score: 1

The Inquirer is more of a rumor site than a news site

News is just rumours that have come true.

The New AMD + nVidia World of Desktops? by seamus5 · 2006-10-20 12:24 · Score: 1

I was just having some crazy thoughts, how about this; AMD liceses HT, cHT and various x86 technologies to nVidia to help them get competetive in the x86 market. AMD/ATi and nVidia manufacture all of their discrete graphics chips on the cHT platform effectively locking intel out of the CPU + discrete graphics market segment. Intel is confined to their own integrated platform for low in Desktop and and Laptop configurations. AMD/ATi and nVidia reap the spoils of an enlarged market now that intel is locked out. AMD becomes the new intel, nVidia becomes the new AMD (in terms of the current x86 landscape)

AI by Anonymous Coward · 2006-10-20 21:09 · Score: 0

If they were to do it I'd suspect that this also has something to do with that talk of using GPUs for AI processing. The main problem they are having at the moment is that graphics cards are designed to send data back to the CPU which would be needing for any AI processing which wasn't simply graphical in nature. If the CPU and GPU were on the same piece of silicon (or more likely completely integrated together into a single or dual core microprocessor), information could easily be passed back and forth to allow for not only very fast graphics, but also AI. The question is though whether nVidia would be able to make a cpu which could compete with offereings from Intel and AMD.

Slashdot Mirror

Nvidia Working on a CPU+GPU Combo

178 comments