NVIDIA Unveils (And Tom's Reviews) The GeForce4

Can't stand it by darketernal · 2002-02-06 03:08 · Score: 3, Insightful

I almost can't stand it when I buy a new flashy graphics card that is praised by every magazine, and then a NEWER card comes out, that supports DX8 pixel shaders, etc., etc. (IE I bought a Radeon 64MB DDR card....two weeks later, hello GeForce3)

I hope if I buy a GeForce4, it'll last, in both speed and 3D technology.

Re:Can't stand it by Zathrus · 2002-02-06 03:34 · Score: 3, Interesting

Which merely proves that you haven't read the article, or pretty much ANY article on nVidia cards.

The MX isn't a stripped down GeForce3/4 - it's a totally different chip without nearly any of the features that make the GF3/4 powerful and a good match for today's and tomorrow's games.

The MX chips lack any vertex or pixel shaders. Yes, the GF4 MX has limited vertex shader support, but it's more akin to the GF2 shader than anything else.

Go look at the benchmarks. There's a reason that the MX line score so far below the regular ones. And a reason why they're performing abysmally in DX8 games - they aren't DX8 compliant. It's about like getting a 2D card and trying to run Quake with it - it simply doesn't have the guts needed to do it.

If you want to go on the cheap, pick up a full fledged GF3, GF3 Ti200, or the as-yet-unreleased GF4 4200 (I think that's the designation). All have the hardware needed for DX8 games (and contrary to the articles and to what some would have you believe, there are games out right now that make use of DX8 and these cards - one of them is Everquest), and they're cheap - under $200. I suspect the GF3 Ti200 will be heading toward $100 very soon now.

Personally I bought a GF2 the 2nd day it was out. I paid $350 for it. I would've liked to wait for a bit of a price drop, but my new computer wouldn't work with my old cards (dual Voodoo2 at the time). That was two years ago, and my GF2 is still perfectly acceptable for playing games. It's a bit slow in EQ, but I'll live. It won't handle the upcoming games though.
Re:Can't stand it by Geek+In+Training · 2002-02-06 03:37 · Score: 5, Insightful

"I hope if I buy a GeForce4, it'll last, in both speed and 3D technology. "

Hey buddy, cope. If you're not realizing by now that you're going to be able to run the latest and greatest games with all the eye candy without shelling out $200-$400 every 12 months, that's YOUR problem, not the industry's.

This is like my neighbors who were mad at *ME* for telling them they could not load WindowsXP on their 486 DX2/66. But we paid $4000 for this machine ten years ago! That was almost half the cost of a car! And it still works! They ended up going to WalMart and buying an HP, with monitor and CD burner, for $699. Now they quit whining... until 5 years from now then it won't run Microsoft AOL version 15.2.

As for me, I have too many other interests to shell out $400 for a video card. I buy games 18-24 months after they come out, at the $19.95 (or lower) price. I *NEVER* pay more than $130 for a video card, and I'm extremely pleased with my price/performance return. Go look at newegg.com for the GeForce2 GTS-V for $49 and you'll see what card I'm running; it gives me 70 frames per second at high quality in Quake3.

If that isn't enough for you, well, I'm sorry, you're just going to have to pay more for the Cadillac.

--
SlashSigTheorem: Humorous, Political, Critical, Constructive- If you have a .sig, someone WILL complai
Re:Can't stand it by Tim+Browse · 2002-02-06 04:08 · Score: 3, Funny

It's about like getting a 2D card and trying to run Quake with it - it simply doesn't have the guts needed to do it.
Um, IIRC, Quake was software only. Having a 3D card didn't help you any until GLQuake was released. In any case, Quake has always run fine on 2D-only cards - that was the target market, after all.
Don't get me wrong - the point you're making is partially correct (although I don't agree that pixel/vertex shaders are in such widespread use that not having hw support means all your games look bad) but at least get your back-up facts correct :-)
Tim

apple by SlamMan · 2002-02-06 03:08 · Score: 3, Insightful

And in an almsot suprising move, apple's offering as a build to order option in their towers (announced yesterday. For a company that almsot always has hidiously slow graphics cards, its kind of a nice change tosee them ahead of the game for once in this department.

--
Mod point free since 2001

Re:apple by Anixamander · 2002-02-06 03:16 · Score: 3, Informative

Actually, when they announced the speed bumped towers a few weeks ago, they noted that the higher end ones included the GeForce4. Of course, nVidia had not announced the existence of such a product yet, leading to some speculation here on slashdot.

As far as Apple having a history of slow graphics cards, they have done pretty well in the towers for the last year or two. They were the first (by a couple of days) to have the GeForce 3 even.

--
Do not taunt Happy Fun Ball(TM)
Re:apple by gamgee5273 · 2002-02-06 03:18 · Score: 4, Informative

I'm curious as to what you mean here. Apple has had the GeForce3 in the Power Macs for the past nine months (roughly), and since the blue & white G3 they've had the top of the line (or close to) ATI cards in the Power Macs (at least as an option). And, while Apple had little to do with it, 3dfx was supporting Apple with Voodoo cards from Voodoo 3 on up (I'm using a Voodoo 3 in my PM 6500).
You can't go off of the chipsets in the iMacs - the iMac was essentially a laptop with a CRT on it (now it's a laptop in a bigger package). But the G3s (after the beige boxes) and G4s (G4s especially) have always had strong card options, both at Apple and outside of it.

Another article by SILIZIUMM · 2002-02-06 03:09 · Score: 5, Interesting

There is another article at Anandtech too, it's quite a good read. Contains pictures, benchmarks, etc.

http://www.anandtech.com/video/showdoc.html?i=1583

... by lexcyber · 2002-02-06 03:10 · Score: 4, Funny

And to everyone's suprise. Geforce4 is faster then
the previous chipsets. Has more pipelines and
bigger memory bandwidth. When will someone try
the new and fresh marketing trick and announce
hardwarre that is slower then the old hardware.
(I hope MS didnt hear this and starts making hardware)

--
- To understand recursion, we must first understand recursion -

cheap geforce3 by belterone · 2002-02-06 03:21 · Score: 3, Interesting

LeadTek has a Geforce3 Ti200 with 128M of memory
for under $200. I just got one of these a
couple of days ago. Heaviest video card I've ever
owned. Looks great in windows. (I did windows
first because I knew it would take longer). If anybody's curious, mail me; I should have it
working under linux tonite if nothing comes up
after work.

funny story: I upgraded my mobo as well to
a soyo dragon+... That thing does NOT turn off
power to the keyboard or ps/2 mouse port when it
powers down. I finally had to unsolder that idiot
taillight on my MS optical mouse so I could get
some sleep.

--
I can't find my car keys. (no a's in email)

Tom's is going downhill. by joshsisk · 2002-02-06 03:22 · Score: 5, Insightful

After this article and yesterday's overly-glowing review of the Xbox, it seems to me that Tom's has fallen on hard times. Consider the following sentence:

"The test guys who aught [sic] to have caught this driver bug seem to be busy selling their stock our [sic] counting their money instead."

All their articles now seem to have been written in five minutes and sent though to door without the slightest bit of editing- or even spell checking!

I don't mean to nitpick, but Tom's used to be a very reliable source- and a great read. Not so much anymore.

Re:Tom's is going downhill. by mrphrtq · 2002-02-06 03:27 · Score: 3, Funny

You mean there are words on Tom's Hardware? Man, I thought it was just benchmark graphs. I heard a rumor that Playboy had "words," and now this.

--

"Life has improved immeasurably since I have been forced to stop taking it seriously." - Hunter S. Thompson

What's the point? by filtersweep · 2002-02-06 03:28 · Score: 4, Insightful

Does anyone use these cards for anything other than games?

These cards cost as much as a decent CPU... or a console game system- yet are the fraction the cost of a CAD card. Their shelf life seems pretty limited as well. In a year or two they will all have a half gig of Rambus or DDR and we'll have 16X AGP? Then we'll all need high definition monitors because today's pixels will all look "blocky" by comparison. Then we'll be right back to unusable framerates at higher resolutions... it all goes full circle.

I've never been able to justify the cost, but then again I don't game. The ironic thing is that "fun and games" arguably stress the hardware more than any other apps for most general home users.

--

Those that suggest you "dance like no one is watching" really want to see you make a complete fool of yourself.

A new watershed in (c/g)pu history by yoink! · 2002-02-06 03:36 · Score: 4, Insightful

The THG article indicates that for all intents and purposes, the average home-computer user still has enough power in his 700-1000MHz machine that upgrading to the rediculously overpowered 2GHz P4s and Athlon XP 2000+ etc, just isn't worth it for them (unless of course their livelihood is dependant upon computing time). I believe the same is starting to happen in the GPU field as well. A brother of mine recently bought a GeForce 3 card, just after the introduction of the whole Ti 500/200 updates. To this day it's still more power than he needs and should be able to outlast the TNT2 Ultra card he replaced it with. The main point being that except for those people that crave "the fastest," and there's nothing wrong with that ;-) , these incremental increases in performance are going to mean less and less to the consumer, most of whom go to the biggest electronics store around and say "my kid needs a special 3d thingy to play this new game." Although I honestly believe people would be happier if they informed themselves a little, it's impossible to think that they will and in the end it doesn't matter. We've been years away from any new device that shows real promise, instead the best some people can come up with is an integrated cell-phone / PDA. Hmmm... who would have thought... until something does show up... I'll be playing Quake on an 8MB single-head graphics card. Humiliation!

Anandtech's review by GweeDo · 2002-02-06 03:42 · Score: 3, Informative

Anandtech has quite a good review here. They also have benchmarks from the lastest build of the unreal engine here. Enjoy :)

--
Unstable Apps: Our Android Apps Don't Suck

A question for John Carmack by DG · 2002-02-06 03:47 · Score: 3, Interesting

I know you're out there John. :)

Lemme ask you this: it seems that with the previous generation of 3D cards, the technology had reached the point where any game with a reasonable game engine could be run at 1024X768x32bit with all the detail goodies turned on at framerates that were completely playable.

(Perhaps this is a mistaken assumption?)

If so, then what does this card bring to the table from a game designer/coder's perspective?

If there's no point in driving a Quake3 style engine any faster (because it's already fast enough) then what will you be able to do with this new hardware that you couldn't do with older stuff?

Or to rephrase, what hardware feature do you most wish was availible on the current generation of 3D cards, and does this new card have that feature?

DG

--
Want to learn about race cars? Read my Book

Re:A question for John Carmack by Geek+In+Training · 2002-02-06 04:43 · Score: 3, Interesting

If there's no point in driving a Quake3 style engine any faster (because it's already fast enough) then what will you be able to do with this new hardware that you couldn't do with older stuff?

IANJC, but I think I can try an answer.

I play Quake3 online about 2 hours a night. At 1024x768x32, no less, on a TFT (which effectively limits famerates since the refresh is so much slower than modern tubes).

70 fps is plenty good for me in Quake3, and I don't really have much desire to go higher. But my GF2 won't do 2xAA over about 9 frames per second. This helps smooth out the picture considerably. So the newer cards will support the same old engines running with full-scene anti-aliasing at 4x at a "usable" framerate. No big change for coding there.

Another thing John talked about in his last remarks, though, was poly count. Your models and scenes can have beaucoup more polys when you juice up the core speed on a new processor, making the whole gaming experience a lot more realistic-looking.

John also talked in previous .plan files about vertex and pixel-shading, and how applying multiple lighting effects on single pixels can make things a lot cooler in actual gameplay. The eyecandy factor for this is hella-big.

As a side note, the one disappoitning thing is that while the GF4Ti cards (NV25 chipset) include a second Vertex Shading Unit in teh chip, there is *NO* dedicated pixel shading unit at all, as there was in the GF3. Why is this??

They go into this in the Tom's article, and it sounds as though the NV25 still supports pixel shading versions 1.1 and 1.3 (whatever that means), but won't support 1.4 until the next chipset. And *THEN* they should be fully DirectX 8.1-compliant.

On the other hand, why should that matter, as John Carmack uses OpenGL, not Direct3D. ;D

--
SlashSigTheorem: Humorous, Political, Critical, Constructive- If you have a .sig, someone WILL complai
Re:A question for John Carmack by MisterBlister · 2002-02-06 05:24 · Score: 3, Insightful

He meant that the Quake 3 engine runs fast enough on the lastest cards. Ie, buying a GeForce4 is not going to improve your Quake experience if you already have a GeForce2 GTS or so.
Of course the Quake 3 engine runs fast enough on the latest cards, it was released last millenium! (1999)
All of this changes this year when the new Doom engine is out -- as has been repeated to death, it will generally require a GeForce 3 or similiar card to run at a reasonable speed.
The hardware makers are refreshing their products a lot faster than the game development houses can keep up, as these days its starting to take nearly 3 years (on average) to develop a top quality game (in part this is the fault of the newest hardware, more polygons == more complexity for the artist, etc). What this means for the average user is that getting the latest, greatest videocard is not a wise idea unless you have lots of money to burn and an itch to have the 'fastest' hardware. It will be months if not years until games catch up with the hardware, this delta between games and the hardware is likely to get even larger as we move forward; nature of the beast.
And to answer the original question, though I'm not John Carmack what I'd like to see is more polygons, more fillrate and a more general programmable GPU interface allowing for really interesting code to be running on the video card.

What else is there? by Galvatron · 2002-02-06 03:49 · Score: 3, Interesting

Seriously, are there any competitive alternatives to NVidia these days? Personally, I'm starting to think about replacing my TNT2, but I'd kind of like to get something with open source linux drivers. At the same time, I don't want to have to go back to a Voodoo 5 or some shit like that just because it is open.

So, does any company make good graphics cards with open specs?

--
"The question of whether a computer can think is no more interesting than that of whether a submarine can swim" -EWD

Re:What else is there? by Odinson · 2002-02-06 05:23 · Score: 4, Insightful

Seriously, are there any competitive alternatives to NVidia these days?
Strangly few slashdoters want to talk about this.
Personally, I'm starting to think about replacing my TNT2, but I'd kind of like to get something with open source linux drivers. At the same time, I don't want to have to go back to a Voodoo 5 or some shit like that just because it is open.
I totally agree. Not only would I buy such a card myself but I would advertise it to everybody I know as the best(most flexable) solution.
So, does any company make good graphics cards with open specs?
The Raedon 7500 (AIW as well?) is the best non-nvidia card in xfree (4.2) right now.
The Xfree guys are working on the 8500, but who knows.
The problem is a one-two punch
Nobody bothers to try with Linux since good free closed source drivers are made availible.
Nvidia bought one the players and shrunk it two a two way race.
I would care less if Nvidia had bought 3dfx or released their own closed drivers, but both.....

--
Novel theory: Modern Man evolved from psychopath

Best Buy by antisocial77 · 2002-02-06 03:51 · Score: 3, Redundant

Um... this has to be a mistake, but apparently Best Buy is letting you Pre-Order these little slices of heaven for $129.00
Check it out.

Re:Best Buy by Julius+X · 2002-02-06 05:10 · Score: 3, Informative

Prices and availability are subject to change without notice. Errors will be corrected where discovered, and Best Buy reserves the right to revoke any stated offer and to correct any errors, inaccuracies or omissions (including after an order has been submitted). Best Buy may, at its own discretion, limit or cancel quantities purchased per person, per household or per order. These restrictions may include orders placed by the same BestBuy.com account, credit card, and also orders which use the same billing and/or shipping address. Notification will be sent to the e-mail and/or billing address provided should such change occur.
-From the BestBuy website.

So this means that this probably won't be honored. Bummer.

--

-Julius X
remove "-whatkindofspamdoyoutakemefor-" from email to send
Re:Best Buy by moonsammy · 2002-02-06 06:29 · Score: 3, Informative

No, Best Buy's cost is most likely somewhere in the neighborhood of $350-$375. They make a *huge* profit on things like cables, but computer parts are usually pretty reasonably priced. I bought a Radeon 8500 from them a few months ago using an employee discount, and the price dropped from about $290-$260. I imagine the ti4600 is quite similar in markup.

Re:Some respect, please by Don+Negro · 2002-02-06 03:56 · Score: 3, Informative

Well, there is the normal average-joe meaning of interesting and there is the understated-all-to-hell meaning of interesting.

An example of the latter: at the University of Texas at Austin Hans Mark - former Director or NASA Ames, Deputy Administrator of NASA, ect., ect. - used to teach a class in which the Airborne Laser system used to become a topic of conversation. When asked about its range (since he'd seen the classified testing documents), all he'd say was that it was effective at a 'militarily interesting distance'.

Now, that's a far cry from Tom's Hardware and the GeForce4, but maybe they're trying to get a little reflected glory rather than simply grossly underusing the language.

We can hope, right?

--

Don Negro
Perl 6 will give you the big knob. -- Larry Wall

Here are the links to the RealVideo movies by fluor2 · 2002-02-06 03:59 · Score: 3, Informative

here are the links for the gf4 in action. i think the resolution is pretty high. I can't wait for the Doom3 on this card.

Squid
Wolfman (i guess this is the best)
Tidepool

Looks like they had some spelling errors on some of the videos (they spelled content as contnent).

the $150 video card rule... by HomerJ · 2002-02-06 04:39 · Score: 4, Informative

If you're sick of all these senseless video card upgrades, just follow the $150 video card rule. No game is really going to take full advantage of a card less then $150. If you're paying more then that, you're wasting money.

Your money would be better spend putting the extra money towards a better monitor for instance. Be surprised the number of people that spend $400 on a video card to play on a $150 montior. Then wonder why things are still jumpy. A nice subwoofer and new speakers would also enhance your gaming experience.

Re:Some respect, please by hyoo · 2002-02-06 04:40 · Score: 3, Funny

I've noticed that /. uses the word 'interesting' when an article/review/benchmark doesn't show the community's favoured product (linux/AMD/ATI) as a superior one.

Most slashdotters see nVidia as an evil corporation because they don't open source their drivers for linux. This leaves ATI as the favourite. The benchmarking shows that in almost every test (except aniso) the GF4 smokes the 8500, therefore the results are summarized as 'interesting'.

If the ATI card actually did outperform the nVidia one, then the post would contain something like "ATI crushes the evil nVidia, we are 1337".

I'm not the one to look up previous articles, but I do recal some benchmarks (biased or not) where NT/2000 did something better than linux. The poster stated that the results were "interesting".

I think this is slashdot's attempt to hide the truth that it is possible for the 'evil' corporation to do something good.

On another note, who else thinks that it is pointless to use Q3 as a benchmark. Start using RTCW or another game that actually makes modern cards break a sweat.

Re:Is this really needed? by NonSequor · 2002-02-06 04:42 · Score: 3, Insightful

This is Slashdot. Any time any program, whether it's a game, a word processor, or a weather simulation, runs too slow it's due to bloat.

The average slashdotter thinks that any program could be reduced to the following if it were written by "skilled" programmers:

int main(int argc, char *argv[]) { return 0; }

Basically it's better to do nothing quickly than to actually accomplish something more slowly.

--
My only political goal is to see to it that no political party achieves its goals.

Everybody's missing the point! by epepke · 2002-02-06 05:31 · Score: 4, Interesting

The exciting thing about the GeForce 4 is not that it's faster or cheaper, it's that finally the programmability is at an appropriate level.

Uh-huh. 15%. Yawn. Don' need that. I can play Deus Ex just fine. Well, guess what. Even if you think that games are the entire universe, some day you might just need an MRI and need someone to be able to look at it and find something that will keep you from dying. Medical imaging is one of the things that the GeForce 4 will be good enough to do. Scientific visualization, volumetric rendering, that sort of stuff.

Why is this? About a decade ago, everything was basically SGI. These were big, expensive machines, suitable for vertical markets. It was possible to get the engineers to work with the microcode for the sales of a small number of units.

Then various card companies came along (NVidea has a lot of ex-SGI engineers) and started making cards for the horizontal gaming market. They concentrated, of course, on satisfying the needs of their biggest customers/promoters, which were the gaming people. Many of these cards were customizable, but at a level of abstruseness that made it so that maybe three people in the world could really hack them up the wazoo.

In the mean time, SGI suffered, because even people who should know better make decisions on the basis of "gee whiz." No magazine is going to benchmark a card on how accurately it shows a tumor from real data. A perception rose that the graphics problem had been solved for cheap, when it really hadn't been.

The GeForce 4 finally brings little-card graphics up to the point where mere mortals can actually do customization for vertical markets.

So uhh. Who modded this guys post as "Interesting" by PeelBoy · 2002-02-06 05:44 · Score: 3, Funny

;-)

Re:Geforce4... Wowee... by LinuxParanoid · 2002-02-06 07:26 · Score: 3, Interesting

First of all, nobody uses scanline rendering. Maybe NEC PowerVR if they're still around. 'Scanline' as most graphics guys use the term means you do hidden surface removal with something like Brezenham's algorithm rather than a Z-buffer. But everybody uses Z buffers and, as far as I can tell, a 'sort-middle' approach.

Second, tile-based rendering has been tried many many times, both by high-end graphics companies (HP's PixelFlow effort a few years back) and by low-end companies (PowerVR's scanline approach, Dynamic Pictures did tiles under the covers IIRC, MS Talisman, PixelFusion, Gigapixel, and others I'm no doubt forgetting of the 40+ PC 3D companies that were around 5 years ago...). Basically it's a loser. It doesn't fit well with DirectX and OpenGL APIs, it creates almost as many problems as it solves (e.g. load-balancing among tiles, bandwidth-sucking data overlap/duplication among tiles), and the marginal improvements it might generate in theory in speed are outweighed by the retraining time required for graphics developers worldwide to learn programming techniques oriented around tile-based hardware. I could describe these problems in more detail if you indicate interest in a follow-up posting, but I don't have the time now in the middle of the day.

Pixel and vertex shaders are at least relatively innovative. If they can figure out how to tie together not just 2 or 4, but 8 or 32 together in a simple, yet flexible and comprehensible way (I saw Pat Hanrahan give a proposal on how to do this at Eurographics a couple years ago) that makes it easier for developers to use them, that'd be an innovation in parallelism that really pays off IMHO.

--LP

Disclaimer: Any 3D expertise I have is a bit rusty. Feel free to correct any technical misstatements.

Re:Geforce4... Wowee... by Performer+Guy · 2002-02-06 10:02 · Score: 3, Insightful

Nonsense, who moded this to 5?? This guy doesn't have a clue. This card is the fastest, the policy of whatever works should apply, and will ultimately win in the market, people have tried deferred shading and tiled approaches, and while the NVIDIA system is not a scanline approach, it is not the scheme you probably envision that's WHY it's the fastest. The other approaches failed, and many of the people who worked on them now work for NVIDIA. There are hundreds of engineers at NVIDIA who make these design decisions based on what will work in terms of power requirements, implementation, programmability, speed and a host of other reasons. NVIDIA leads in performance because they get this right. Programmers DO know how to use the programmable shaders, but there are other more traditional ways to use this hardware, and the other pixel pipeline will help even simple multitexture applications too. Even scanline systems can scale very nicely, so the scalability of the tiled approach is just not true, you seem to have forgotten Voodoo SLI, but there are other ways to scale graphics systems too. Your post is a plea to support your pet favourite graphics scheme, but there are detailed technical issues to be considered beyone the glib appeal to emotion. The facts and NVIDIAs performance speaks for itself, and your post is the graphics equivalent of complaining that Ford doesn't make water powered cars.

Re:Geforce4... Wowee... by ToLu+the+Happy+Furby · 2002-02-06 11:19 · Score: 5, Informative

More shaders, More pixel pipelines, More memory bandwidth... whoopee...

When the hell are they going to ditch the antiquated scanline rendering method and go work on some tile based rendering methods?

Probably never, and for very good reason. Tile-based rendering is a very efficient architecture whose time has already come and gone.

For those who don't know, tile-based rendering divides an image up into a number of smaller squares ("tiles") and renders them independently, as opposed to the traditional method ("immediate-mode rendering") of rendering an image one polygon at a time. The major benefits claimed for tile-based renderers are that the process is more parallelizable (no risk of two chips rendering to the same area if they are working on different tiles) and that it is an easy modification to check each polygon's z-buffer (its distance from the camera) as you add it to the poly-list for its tile, and then to only texturize those polygons which are not occluded (i.e. actually visible). This is in contrast to the traditional immediate-mode rendering algorithm, where polygons are textured more or less in random order, leading to situations where a polygon will go through the entire process of being textured and rendered, only to later be completely covered up by a later poly--a situation which wastes a lot of (especially) memory bandwidth, fetching all those useless textures and such.

Cool! Sounds great! Let's hear it for tile-based rendering! Too bad ATI and NVIDIA have clearly never ever heard of this miracle technique! After all, it's not like they would ever make (gasp!) an informed choice not to use it!

Well...not so fast. Basically what we've seen is that tile-based rendering offers two potential benefits: it eliminates *some* of the complexity of enabling multi-GPU implmentations, and it uses quite a bit less memory bandwidth in the base case. The problem is that both of these supposed benefits really buy you very little when designing a consumer-level graphics card today.

First, the problem of "dividing up the work" isn't really what's preventing multi-chip graphics cards these days. Indeed, it's really a rather easy problem. Here's a clue: have alternate chips render alternate frames. Gee...that wasn't so tough, now was it? Well, no. But the other problems of implementing a multi-chip card for the consumer market sure are. For example, we have our choice of implementing an (expensive, performance gating) point-to-point bus to handle memory traffic (and have memory bandwidth/chip cut in half anyways), or of completely mirroring the memory, using twice as much for the same capacity (expensive). Then there's the cost of a second chip (expensive), the cost of packaging the second chip and connecting it to memory (expensive), and the cost of the extra power and cooling, the cost of trying to squeeze it all onto one card (results in a bigger, more expensive card; may gate clockability). And this is without mentioning the extra development and debugging time that goes into getting a multi-chip solution to work correctly. (In general this is one of the most difficult issues design engineers face.) Golly, it's almost enough to make you remember how when 3dfx tried to make a multi-chip product it was 6 months late, the single-chip card was far too slow, the double-chip (and cancelled quad-chip) card too expensive, and, due to the release delay, no longer competitive. (OTOH John C has hinted that a scalable multi-chip architecture might be on the way from one of the major players. Tie that in with the fact that Anand reports the GF4 will be the last to use the GF name, and that NVIDIA owns the remnants of 3dfx, and I start scratching my head...)

Second, the problem of memory bandwidth. Or rather, the former problem of memory bandwidth. Yes, the traditional rendering pipeline is very inefficient with memory bandwidth. Thing is, the prices on high-speed DDR have been coming down so fast that it hardly matters. You can find a Radeon 7500 with 64MB of 128-bit-wide DDR running at 2x230 MHz (i.e. 7.4GB/s bandwidth) for as low as $85 on pricewatch.com. (Actually there's one for $79 but it may be mislabeled.) The memory is probably less than $30 of the cost. Or maybe even less--the 64MB and 32MB GF2Pros (6.4GB/s bandwidth) only differ by $6. And the new GF4 MX460 hits the street with 64MB of 2x275 MHz DDR (8.8GB/s) for $179, list, on a brand new card.

As for the price premium of using relatively high-speed DDR instead of the same amount of SDRAM, it's pretty neglibible. Even for the highest speed DDR it's not such a big deal. Sure NVIDIA charges an extra $100 for another 25MHz on the GPU and an extra 1.6GB/s from the memroy (GF4 Ti4600 vs. Ti4400), but that doesn't mean it costs them anywhere near that much. (depending on GPU yields) It just means they like to bilk those in the $400-for-a-video-card crowd for the full $400. So how much does the stuff cost? Well...Hynix recently announced samples and volume production of 2x375 MHz x32 DDR selling at $10 for 128Mbit chips. That means $40 for 64MB of 128-bit-wide DDR with 12GB/s bandwidth. Not too shabby.

Ok, ok...so maybe the benefits of tile-based rendering don't really mean all that much in today's consumer GPU market. But better is better: why wouldn't ATI and NVIDIA use tile-based architectures for the benfits it does provide. After all, it's not like there might be some (gasp!) downsides to tile-based rendering!

Well, actually, there are. For one thing, it's more difficult to design a tile-based GPU and get it running at high speeds. For another both NVIDIA and ATI have years and years of research and experience with implementation techniques and algorithms for immediate-mode renderers, much of which wouldn't apply to tile-based designs.

For another, neither ATI nor NVIDIA really uses traditional immediate-mode rendering anymore. Instead they use modified immediate-mode rendering, with lots of algorithmic tricks and tweaks to lessen the memory bandwidth inefficiencies of traditional immediate-mode rendering. Things like lossless z-buffer compression and various early polygon-culling algorithms. No they aren't quite as effective in reducing overdraw as tile-based rendering, but they provide quite a significant benefit. Indeed, the GF4 Ti4600 has more or less caught up with the (tile-based) KyroII in Kyro's own villagemark benchmark, which is contrived entirely to test massive overdraw of the sort which is never encountered in a game. The KyroII is only 8 months old. Sure it's much much cheaper than a Ti4600, but if Kyro can barely keep the lead in the one benchmark specially designed to make the case for tile-based rendering then something is wrong here.

Meanwhile there are very serious issues with the ability of tile-based rendering to scale to meet future challenges. In particular, the tile-based rendering algorithm works very naturally so long as there are no polygons which find themselves spread into more than one tile, and so long as you don't use transparent or translucent textures. Of course it's not that tile-based chips can't handle these situations--the KyroII is here and works just fine, after all--but just that they require complicated workarounds which are more inefficient than for immediate-mode rendering, which handles these cases naturally.

The problem is that both cases are going to be more and more likely as graphics continue to improve. As tile-based rendering tries to scale with increasing scene polygon counts and resolutions, you get more tiles per scene and many more polygons crossing tile boundries. And as graphical effects get more realistic, the alpha channel (i.e. transparency) starts coming into play more and more. Indeed much of the recent research in non-real-time computer graphics has focused on adding translucent "subsurface" reflections to the ray-tracing algorithm. This (and approximations of it) is the sort of thing that future pixel shaders are going to be called on to do, and tile-based rendering is a bad match for it.

Indeed, most of the recent advances in graphics are pointing towards a world in which the assumptions which tile-based rendering is based on no longer hold. How, for example, does tile-based rendering handle cubic environment mapping across tile boundries, or cast dynamic shadows across tile boundries? What happens if a dot3 bump map extends a texture from one tile into another? I'm sure clever solutions can be found to these and all the other dozens and dozens of issues that will arise when you try to mix DX8-style effects and tile boundries, but the main point is that tile-based rendering was an algorithm developed under two assumptions which increasingly do not hold:

1) If one polygon occludes another, the other's texture will never be visible to the camera;

2) Objects in one section in the screen can be rendered without reference to any other parts of the screen.

Of course, we may never know the difficulties of trying to make a DX8-compliant tile-based renderer; after all, the KyroII hasn't even made it to DX7, since it is still missing integrated T&L. I have no idea whether this is because of any difficulties integrating T&L with a tile-based rendering pipeline (can't think of why it would be a problem, but it may be), or just because the Kyro doesn't have the money or manpower behind it to keep up with 3 year old technology, but this lack is already preventing the KyroII from competing effectively with the cheaper GF2MX on modern high-poly games. I am pretty sure that integrating a programmable pixel shader into a tile-based architecture would be pretty tough, if not pretty impossible.

Which brings me to the main point: you started out writing "More shaders, More pixel pipelines, More memory bandwidth... whoopee..." and in a sense, this is the right attitude. To which we should very quickly add "tile-based screen division...deferred rendering algorithm...whoopee..." All these technical details only mean something insofar as they give us the capability for more realistic graphics--this means high FPS, high color depth, higher resolutions, lack of aliasing problems, high-quality mip-mapping/anisotropic filtering, realistic--or even dynamic--lighting and shadows, realistic and/or impressive pixel effects, high polygon counts, useful and realistic vertex effects, etc.--for a reasonable price. It is pretty damn hard to argue that the last few years, under NVIDIA's leadership (and ATI's pursuit) have not resulted in huge improvements on these measures. Again, the new GF4 Ti4600 may be ridiculously expensive and may not change your experience with today's games very much (besides enabling 1600x1200x32 with 4xAA at playable framerates), but when the new Doom game comes out, a card with similar specs and selling for ~$100 will bring you decent performance on an engine which offers a totally new level of graphical realism. Same thing when Unreal Warfare, Unreal 2, Deus Ex 2, and all the other Unreal 2-engine games start coming out. Believe me, a GF4 caliber card will improve the experience of playing those and later games significantly over a GF3 and especially a non-DX8 compliant card like a GF2 (and, sadly, a GF4MX). And, believe me, those games are going to provide significantly more realistic graphical experiences than those of today.

Immediate-mode rendering is doing just fine, and the GF4 marks an evolutionary but very significant improvement to the state-of-the-art. A switch to tile-based would require significant retreading to reach the same level, and might form a poorer basis for future improvements. But, if I'm wrong, then ATI and NVIDIA will make the switch. Believe me, they know all about tile-based rendering, and NVIDIA even owns Gigapixel (via 3dfx) and their tile-based rendering engine. I think they'll stick to modifications of immediate-based rendering, but no matter what they do it will be whatever they think offers the best graphics performance at the lowest cost to them.

And now to correct some minor misconceptions in your post:

Hell, the reason why the Geforce line has to keep doubling its fill rates every generation is because its architechture is so god damn ineffecient. Look at the memory bandwidth requirements for the cards!

The reason the GeForce line increases its texel fill rates continually is because consumers want to run new games which have higher multi-texturing requirements (Carmack has said Doom3 will have something like ~8 textures/pixel), and to run existing games in higher resolutions and at higher FPS.

The memory bandwidth "requirements" for the cards don't matter, only the prices. If a recent card with 7.4GB/s only costs $85 (Radeon 7500) and a brand new card with 8.8GB/s lists for $179, then the costs of increasing memory bandwidth are obviously not so terrible. Today's $400 card is next year's $80 card. Similarly, immediate-mode rendering's inefficiencies need to be measured according to their dollar costs, not their bandwidth costs.

Instead of using the relatively limited bandwidth of AGP for streaming textures from main memory (where it should god damn be) to the texture cache, the card is busy wasting bandwidth on the damn Z-buffer (which would be eliminated if they implemented hidden surface removal like the PowerVR chipsets).

???

First off, textures most certainly should not "god damn be" in main memory! The AGP bus is there to stream vertex data from the CPU (pre- or post-transformation, it's the same amount of data). That's all it's there to do, and good thing, too, because today's high-poly games can already generate enough vertex data to make AGP 2x a bottleneck, and those of a couple years will do the same to AGP 4x. (Which is why AGP 8x is on the horizon.) Increasing the bandwidth of a bus from the northbridge across the motherboard through a slot to an add-on card is a whole lot harder than increasing the bandwidth from soldered DDR to a soldered GPU a few centimeters away. AGP should only carry the data which it absolutely is forced to--namely initial vertex data from the game's engine running on the CPU.

Z-buffer lookups only waste bandwidth between the GPU and the on-card memory. Technically, you don't eliminate z-buffer lookups with a tile-based architecture; you eliminate texture lookups (and texture application) on occluded polygons. However, by dealing with a small tile at a time, you can read all the z-buffer data for the tile in from memory all at once, and store it in an on-chip cache until you're done with that tile. (This is essentially why higher poly-count games mean smaller and smaller tiles.)

And last, they do implement hidden surface removal techniques, like I pointed out before, even though they are less effective than with a tile-based architecture.

Slashdot Mirror

NVIDIA Unveils (And Tom's Reviews) The GeForce4

33 of 386 comments (clear)