Slashdot Mirror


Carmack on NV30 vs R300

Nexxpert writes "John Carmack has posted his thoughts on the NV30 vs R300 (featured via www.bluesnews.com. Highlights some of the shortcomings of Nvidia's next step as well as pointing out what they've done right. Interesting read." In particular the arb2 vs nv30 path differences mean that it's not as simple as saying "ATI roX0rs nVidia" or vice versa.(update: sorry bout the misspelling, don't know how I missed that)

47 of 226 comments (clear)

  1. Re:Who the fuck is that guy? by mao+che+minh · · Score: 2, Funny

    His contribution to the world was good enough to force at least 80% of us to consume unimaginable amounts of our college's T3 bandwidth. It almost cost at least one of us (wink wink) our job. To speak the name of this soul sucking horror....no, tis too dangerous, it makes me "quake".

  2. We know that... by LordYUK · · Score: 4, Informative

    We already know that the Radeon 9700 pro and the GeForce FX will run Doom III, and they are both going to look good doing it.

    My concern is what is it going to look like on, lets say, a G4Ti4200 w/64 megs of ram, or the G4Ti4600 w/128 megs of ram. Are those of us not willing to spend 400 bucks on a new vid card (or for those of us stuck with a 4x AGP board, that plus a new mobo) going to have to turn 90% of the features off to run it at a good looking frame rate?

    Thanks goes out to Carmack, its nice that he takes the time to give us a run down of the two cards that are battling for supremecy, especially since I like many of you are thinking about D3 when I evaluate my system and its need for being upgraded.

    --
    This is my sig. Its pathetic.
    1. Re:We know that... by nomadicGeek · · Score: 4, Insightful
      Are those of us not willing to spend 400 bucks on a new vid card (or for those of us stuck with a 4x AGP board, that plus a new mobo) going to have to turn 90% of the features off to run it at a good looking frame rate?

      Probably, but only for about 6 months until that $400 video card turns into a $200 video card and the Mobo becomes the $99 special.

      God I love this business.

    2. Re:We know that... by Camulus · · Score: 4, Informative

      Doom 3 will be playable with a GF4 ti4200 and you won't have to turn every thing off to get a decent frame rate, if by decent you mean at least 60 or above. A friend of mine got a hold of the leaked alpha and on a GF2 GTS it was running around some where at 15 fps with all options turned on at 1280x1024 (I think) on an unoptimized release of the program. So, yeah, of course you will go get more performance if you plop down some money, but you should be pretty good to go I would imagine.

  3. OK, I feel a little bit stupider. by 3.5+stripes · · Score: 5, Funny

    I was doing ok for the first paragraph or so.

    Then my brain went *beep* *beep* *beep*

    And I lost everything.

    About the only thing I came away with is, if you do it the way a specific vendor wants, it kicks the crap outa the other one, otherwise the ATI may be a wee bit faster.

    --


    He tried to kill me with a forklift!
    1. Re:OK, I feel a little bit stupider. by tsetem · · Score: 3, Informative

      About the only thing I came away with is, if you do it the way a specific vendor wants, it kicks the crap outa the other one, otherwise the ATI may be a wee bit faster.

      Close. I think the big thing he was trying to say was that ATI & NVidia in the ARB2 mode, ATI kicks the snot out of NVidia. But it's because ATI renders with less precision than NVidia.

      So when both cards are in ARB2 mode, NVidia looks better, but ATI is faster.

    2. Re:OK, I feel a little bit stupider. by TheCrazyFinn · · Score: 2, Informative

      He never indicated which looked better, all he said was that ATI was faster in ARB2 mode, which may or may not be due to the differences in precision. Every review I've read indicates that ATI has either a slight or a major advantage in visual quality, depending on settings (ATI aquires a much bigger lead in quality as the settings are lowered.) and that ATI aquired a lead in speed as quality settings go up. Notably, the Radeon 9700 Pro also looks as good at 6x AA as the NV30 does at 8x AA.

      --
      "You've got an invalid haircut" -Warren Zevon - Life'll Kill Ya
    3. Re:OK, I feel a little bit stupider. by barawn · · Score: 3, Insightful

      No - this is decidedly not what he said. What he said was that the ATI precision mode that is used doesn't correspond to a precision that NVIDIA uses on the NV30, and the ARB2 precision mode corresponds to what ATI is using, but is "between" two for NV30. So the NV30 has to render it at its highest precision (rather than rendering it at a lower precision and artifacting the hell out of the thing) which slows it down.

      The thing is that the renderer didn't want the higher precision, so the excess precision is mostly wasted. It's like calculating out 1.5000 * 1.5000 and getting 2.2500, and then truncating it back to 2.2. You could've just truncated it to 1.5 and 1.5 initially, and gotten 2.2 as well.

      So, I doubt that NVIDIA actually looks better in the ARB2 mode. It just runs slower, because it's calculating things that don't need insane precision to insane precision.

      Hence the reason that Carmack said that NVIDIA is confident there's a lot of room to improve, if it can realize which of the three precision modes is most ideal and shift to it.

    4. Re:OK, I feel a little bit stupider. by Mac+Degger · · Score: 2, Informative

      Actually, it breaks down more like this: ATI does 96 bit precision, Nvidia can do 32, 64 and 128.

      Your example with the truncation is a gross simplification, which goes into ATI's favour, but in reality there's way more precision than just 2 decimals...more like 6 for ATI and 8 for Nvidia...which makes a reasonably big difference in quality.

      The numbers above are not entirely spot on (except for Nvidia's 128 bit precision, and I can't be arsed to find the exact other numbers), but the principle remains the same: grandparent had it right; ATI for speed, Nvidia for quality (on a ARB2 path...vendor specific paths lead to different results, but are you really going to play the game in anything other than true OpenGL?

      --
      -- Waht? Tehr's a preveiw buottn?
    5. Re:OK, I feel a little bit stupider. by barawn · · Score: 2, Insightful

      I don't think you understood what I said. The thing is that the rendering path only asked for 96 bit precision - however the NV30 had to render it in 128 bit because it doesn't have a 96 bit mode. You're not going to get "free" improved image quality simply by calculating things out to more precision.

      It's like in freshman-level science classes - you don't take the numbers out to more significant figures than you start with, because the added precision is meaningless. Carmack was talking about fragment path programs, for which added precision probably wouldn't pan out to added image quality.

  4. Re:Who the fuck is that guy? by Anonymous Coward · · Score: 2, Funny

    This thread was "Doomed" from the start.

  5. Whatver works best for Doom3... by grub · · Score: 3, Funny


    ...is what I'm buying! Blood, gore, death, mutilation, horror... that's what I want splattered on my monitor.

    --
    Trolling is a art,
    1. Re:Whatver works best for Doom3... by paulcammish · · Score: 2, Funny
      Blood, gore, death, mutilation, horror... that's what I want splattered on my monitor.

      Considered a chainsaw? Or a shotgun?

      You should be able to get hold of them a bit cheaper than one of those videocards, and it even looks better.

  6. R300 path by Anonymous Coward · · Score: 5, Interesting

    Notice that Carmack has no R300 path. Why is that? I can think of two possible explanations instantly.

    One is that ATI has optimized for the standard ARB2 path, and a specific R300 path wouldn't make much difference. In that case, my response would be, kudos ATI for promoting the standard, but speak positively of the performance of NV30.

    The other possibility I can think of is that the lack of an R300 path is punishment for ATI leaking the Doom III alpha version. In that case I wonder how much the Radeon 9700 Pro would gain from an R300 specific path.

    It certainly isn't a lack of time to develop the ATI path; there is an R200 path for older Radeon cards, and the Radeon 9700 has been available to developers for quite a bit longer than the GeforceFX has.

    1. Re:R300 path by jpaana · · Score: 5, Informative

      ATI doesn't have any proprietary extensions for exposing the R300 shader functionality, only "ARB2" (ARB_vertex_program and ARB_fragment_program) so there's no way to do "specific" R300 path.

      The leaked alpha does support ATI's two-sided stencil extension (ATI_separate_stencil) which is only implemented on R300.

  7. Re:Who the fuck is that guy? by CaseyB · · Score: 4, Funny

    I'm "Keen" on continuing it.

  8. thought on noise by Anonymous Coward · · Score: 5, Interesting

    "The current NV30 cards do have some other disadvantages: They take up two
    slots, and when the cooling fan fires up they are VERY LOUD. I'm not usually
    one to care about fan noise, but the NV30 does annoy me.


    Noise is becoming a big problem. I now putting zalman stuff in all my computers. My guide is a zalman heatsink, a zalman powersupply, athlon 1.2, seagate 40 Gig HD, ATi radeon 9000(whitout fan) and a good motherboard without fan on the chipset.

    thank you
    louis

    1. Re:thought on noise by frozenray · · Score: 2, Informative

      > My guide is a zalman heatsink, a zalman powersupply, athlon 1.2, seagate 40 Gig HD, ATi radeon 9000(whitout fan) and a good motherboard without fan on the chipset.

      Good advice, but all this will be futile if the system case doesn't reduce the noise that is generated by the internal components (or even amplifies it because of vibrations). Watch out for expensive aluminum cases which look cool but have no provisions against noise. Acoustically well-designed cases, such as this one from silentmaxx use several types of dampening materials with different noise absorption properties.

      --
      "There are already a million monkeys on a million typewriters, and Usenet is NOTHING like Shakespeare." - Blair Houghton
  9. The answer by wantedman · · Score: 3, Funny

    Carmak is the equivelent of Einstein for the computing age. He is a genius. His work with BSP, Ray Tracing, AI is usually regarded as the "Standard" or best real life way of doing game engine design in real time.

    In other words, he is a god...

    While you are a smuck.

    1. Re:The answer by Omkar · · Score: 2, Funny

      You misspelled the guy's name after ~20 comments about the very same error were posted? It's Carmack, dammit!

    2. Re:The answer by 21mhz · · Score: 4, Informative

      In addition, Carmack is, perhaps, singlehandedly responsible for preserving OpenGL as a viable alternative on PC. Remember when Microsoft was aggressively pushing Direct3D and steam was running out of OpenGL drivers for Windows 95, he said "no shit" and put out the 2nd installment of The 3D Game (and, subsequently, the engine) using OpenGL as the sole rendering backend. The manufacturers couldn't stand the pressure and rushed to update drivers.

      --
      My exception safety is -fno-exceptions.
    3. Re:The answer by BWJones · · Score: 2, Informative

      n addition, Carmack is, perhaps, singlehandedly responsible for preserving OpenGL as a viable alternative on PC.

      Well, I don't know about that. However, I would be willing to give him kudos though for implementing it in his games. We owe SGI a certain debt for making IRIS GL and documenting the api, then making the OpenGL "open". I suppose we could also give kudos to Evans and Sutherland for creating frame buffers, Postscript and "GL" as well.

      Anyway, the whole reason SGI decided to open it up was pressure from the software developers. In order to be viable, they needed to have a wider variety of hardware choices and told SGI they would drop them if they did not open up the standard.

      Currently we also have companies like Apple to thank for aggressively supporting it, integrating and improving OpenGL.

      --
      Visit Jonesblog and say hello.
    4. Re:The answer by CaseyB · · Score: 3, Informative
      Quake was the *only* reason that any PC graphics companies went to the trouble of implementing OpenGL on consumer hardware. If Carmack hadn't decided to boycott it, Direct3D/DirectX would be the only 3D video API used in PC games today. Period.

      Currently we also have companies like Apple to thank for aggressively supporting it, integrating and improving OpenGL.

      That goes back to Carmack again. He went to Apple more than once evangelizing OpenGL, and his games have always been the Macworld showpieces on new hardware in the iMac era.

  10. You don't need bluesnews; just standard finger by Anonymous Coward · · Score: 4, Informative

    $ finger johnc@idsoftware.com
    [idsoftware.com]
    Welcome to id Software's Finger Service V1.5!

    Name: John Carmack
    Email:
    Description: Programmer
    Project:
    Last Updated: 01/29/2003 18:53:43 (Central Standard Time)

    Jan 29, 2003

    NV30 vs R300, current developments, etc

    At the moment, the NV30 is slightly faster on most scenes in Doom than the
    R300, but I can still find some scenes where the R300 pulls a little bit
    ahead. The issue is complicated because of the different ways the cards can
    choose to run the game.

    The R300 can run Doom in three different modes: ARB (minimum extensions, no
    specular highlights, no vertex programs), R200 (full featured, almost always
    single pass interaction rendering), ARB2 (floating point fragment shaders,
    minor quality improvements, always single pass).

    The NV30 can run DOOM in five different modes: ARB, NV10 (full featured, five
    rendering passes, no vertex programs), NV20 (full featured, two or three
    rendering passes), NV30 ( full featured, single pass), and ARB2.

    The R200 path has a slight speed advantage over the ARB2 path on the R300, but
    only by a small margin, so it defaults to using the ARB2 path for the quality
    improvements. The NV30 runs the ARB2 path MUCH slower than the NV30 path.
    Half the speed at the moment. This is unfortunate, because when you do an
    exact, apples-to-apples comparison using exactly the same API, the R300 looks
    twice as fast, but when you use the vendor-specific paths, the NV30 wins.

    The reason for this is that ATI does everything at high precision all the
    time, while Nvidia internally supports three different precisions with
    different performances. To make it even more complicated, the exact
    precision that ATI uses is in between the floating point precisions offered by
    Nvidia, so when Nvidia runs fragment programs, they are at a higher precision
    than ATI's, which is some justification for the slower speed. Nvidia assures
    me that there is a lot of room for improving the fragment program performance
    with improved driver compiler technology.

    The current NV30 cards do have some other disadvantages: They take up two
    slots, and when the cooling fan fires up they are VERY LOUD. I'm not usually
    one to care about fan noise, but the NV30 does annoy me.

    I am using an NV30 in my primary work system now, largely so I can test more
    of the rendering paths on one system, and because I feel Nvidia still has
    somewhat better driver quality (ATI continues to improve, though). For a
    typical consumer, I don't think the decision is at all clear cut at the
    moment.

    For developers doing forward looking work, there is a different tradeoff --
    the NV30 runs fragment programs much slower, but it has a huge maximum
    instruction count. I have bumped into program limits on the R300 already.

    As always, better cards are coming soon.

    --------

    Doom has dropped support for vendor-specific vertex programs
    (NV_vertex_program and EXT_vertex_shader), in favor of using
    ARB_vertex_program for all rendering paths. This has been a pleasant thing to
    do, and both ATI and Nvidia supported the move. The standardization process
    for ARB_vertex_program was pretty drawn out and arduous, but in the end, it is
    a just-plain-better API than either of the vendor specific ones that it
    replaced. I fretted for a while over whether I should leave in support for
    the older APIs for broader driver compatibility, but the final decision was
    that we are going to require a modern driver for the game to run in the
    advanced modes. Older drivers can still fall back to either the ARB or NV10
    paths.

    The newly-ratified ARB_vertex_buffer_object extension will probably let me do
    the same thing for NV_vertex_array_range and ATI_vertex_array_object.

    Reasonable arguments can be made for and against the OpenGL or Direct-X style
    of API evolution. With vendor extensions, you get immediate access to new
    functionality, but then there is often a period of squabbling about exact
    feature support from different vendors before an industry standard settles
    down. With central planning, you can have "phasing problems" between
    hardware and software releases, and there is a real danger of bad decisions
    hampering the entire industry, but enforced commonality does make life easier
    for developers. Trying to keep boneheaded-ideas-that-will-haunt-us-for-years
    out of Direct-X is the primary reason I have been attending the Windows
    Graphics Summit for the past three years, even though I still code for OpenGL.

    The most significant functionality in the new crop of cards is the truly
    flexible fragment programming, as exposed with ARB_fragment_program. Moving
    from the "switches and dials" style of discrete functional graphics
    programming to generally flexible programming with indirection and high
    precision is what is going to enable the next major step in graphics engines.

    It is going to require fairly deep, non-backwards-compatible modifications to
    an engine to take real advantage of the new features, but working with
    ARB_fragment_program is really a lot of fun, so I have added a few little
    tweaks to the current codebase on the ARB2 path:

    High dynamic color ranges are supported internally, rather than with
    post-blending. This gives a few more bits of color precision in the final
    image, but it isn't something that you really notice.

    Per-pixel environment mapping, rather than per-vertex. This fixes a pet-peeve
    of mine, which is large panes of environment mapped glass that aren't
    tessellated enough, giving that awful warping-around-the-triangulation effect
    as you move past them.

    Light and view vectors normalized with math, rather than a cube map. On
    future hardware this will likely be a performance improvement due to the
    decrease in bandwidth, but current hardware has the computation and bandwidth
    balanced such that it is pretty much a wash. What it does (in conjunction
    with floating point math) give you is a perfectly smooth specular highlight,
    instead of the pixelish blob that we get on older generations of cards.

    There are some more things I am playing around with, that will probably remain
    in the engine as novelties, but not supported features:

    Per-pixel reflection vector calculations for specular, instead of an
    interpolated half-angle. The only remaining effect that has any visual
    dependency on the underlying geometry is the shape of the specular highlight.
    Ideally, you want the same final image for a surface regardless of if it is
    two giant triangles, or a mesh of 1024 triangles. This will not be true if
    any calculation done at a vertex involves anything other than linear math
    operations. The specular half-angle calculation involves normalizations, so
    the interpolation across triangles on a surface will be dependent on exactly
    where the vertexes are located. The most visible end result of this is that
    on large, flat, shiny surfaces where you expect a clean highlight circle
    moving across it, you wind up with a highlight that distorts into an L shape
    around the triangulation line.

    The extra instructions to implement this did have a noticeable performance
    hit, and I was a little surprised to see that the highlights not only
    stabilized in shape, but also sharpened up quite a bit, changing the scene
    more than I expected. This probably isn't a good tradeoff today for a gamer,
    but it is nice for any kind of high-fidelity rendering.

    Renormalization of surface normal map samples makes significant quality
    improvements in magnified textures, turning tight, blurred corners into shiny,
    smooth pockets, but it introduces a huge amount of aliasing on minimized
    textures. Blending between the cases is possible with fragment programs, but
    the performance overhead does start piling up, and it may require stashing
    some information in the normal map alpha channel that varies with mip level.
    Doing good filtering of a specularly lit normal map texture is a fairly
    interesting problem, with lots of subtle issues.

    Bump mapped ambient lighting will give much better looking outdoor and
    well-lit scenes. This only became possible with dependent texture reads, and
    it requires new designer and tool-chain support to implement well, so it isn't
    easy to test globally with the current Doom datasets, but isolated demos are
    promising.

    The future is in floating point framebuffers. One of the most noticeable
    thing this will get you without fundamental algorithm changes is the ability
    to use a correct display gamma ramp without destroying the dark color
    precision. Unfortunately, using a floating point framebuffer on the current
    generation of cards is pretty difficult, because no blending operations are
    supported, and the primary thing we need to do is add light contributions
    together in the framebuffer. The workaround is to copy the part of the
    framebuffer you are going to reference to a texture, and have your fragment
    program explicitly add that texture, instead of having the separate blend unit
    do it. This is intrusive enough that I probably won't hack up the current
    codebase, instead playing around on a forked version.

    Floating point framebuffers and complex fragment shaders will also allow much
    better volumetric effects, like volumetric illumination of fogged areas with
    shadows and additive/subtractive eddy currents.

    John Carmack

    1. Re:You don't need bluesnews; just standard finger by Anonymous Coward · · Score: 2, Funny

      "The R300 can run Doom.." "The NV30 can run DOOM"

      Heh, even teb Carmack isn't sure what the capitalization is.

  11. Carmak: Negative by SirNarfsALot · · Score: 2, Funny

    (mostly due to the egregious misspelling)

  12. I like my cards quiet by Wag · · Score: 5, Interesting

    I was holding out on nVidia's new card, but now I've given up on that idea. With more and more people using PCs as multimedia devices (watch DVDs, listen to music, etc), a fan that puts out almost 60Db of noise is unacceptable.

    I really wanted to go away from ATI this time around, but it appears I'll have to wait a little longer. I'm sure nVidia will [eventually] release a fanless, 1-slot version. I just wonder if it will be too little too late.

    1. Re:I like my cards quiet by cca93014 · · Score: 2, Interesting

      The fan only runs at full RPM when the card is doing a lot of 3d work. 2D stuff causes the fan to run a lot slower (not sure if it ever turns off completely tho)...

    2. Re:I like my cards quiet by Elledan · · Score: 5, Informative

      "The fan only runs at full RPM when the card is doing a lot of 3d work. 2D stuff causes the fan to run a lot slower (not sure if it ever turns off completely tho)..."

      From the [H]ard|OCP review:

      "Using a decibel meter we tested the sound level of the GFFX at three feet away, directly in front of the exhaust vent. In 2D mode, the reading was 56dB."

      I don't know about you, but I find 56 dB to be very noisy.

      --
      Site & blog: http://www.mayaposch.com
    3. Re:I like my cards quiet by haloscan · · Score: 2, Informative
      The fan only runs at full RPM when the card is doing a lot of 3d work.

      Not quite. The FX card actually turns the fans full blast the second 3d is detected in software (ie: a DirectX/OpenGL call is made). I thought (hoped) it would be based on the running temperature of the card (hardware RPM control) but unfortunately, it isn't.
  13. MIB 2 reference by BigBir3d · · Score: 4, Funny

    These are both the "new hotness" but with the noise of the nVidia I forsee it becoming "old and busted" quite soon.

    Not to mention ATi's next card...

  14. Cowboy Kneal and everyone at Slachdot... by mbbac · · Score: 5, Funny

    ...really need to concentrate on spelling people's names correctly. When they can't even spell John Carmack's name right, something is seriously wrong.

    --

    mbbac

  15. More Kudos to ATI by scotay · · Score: 5, Insightful

    I've been running ATI cards on my desktop since the mach64 chip days. When I got my 9700 in August, I NEVER thought I'd still have a chip that was competitive with nVidia's best offering in 3D. I never bought ATI cards because they were best in 3D or driver quality - they never were better. ATI did have superior 2D quality (to my eyes) and Video/DVD playback. Given I spend 90% on my time on a desktop, ATI had the right mix of features. Now they finally are competitive with nVidia's 3D.

    After we started to get benchmarks showing matched performance, the remaining questions were left to DX9 and the more complex shaders. From Carmack's comments and the shadermark tests that are showing up, it appears that ATI is anywhere from competitive to superior in the DX9 2.0 shaders, as well. It does look like NV30 can indeed run deeper/higher precision shaders, but we will have to wait to see if games ever do show with shaders deeper than the LCD between NV30 and R300.

    Carmack does mention that nVidia promised that "compiler improvements" will increase the NV30 shader performance. (Better scheduling of parallel pipes?)

    The astounding bottom line is that as of Jan 2003, the 9700 is not shown to be inferior in any way to an as-yet unreleased flagship product from the king of 3d on the mainstream desktop.3 Cheers for ATI.

  16. I worry about NVIDIA by szquirrel · · Score: 5, Informative

    NVIDIA got where they are today by beating 3dfx on their own turf: high-end gaming performance. Remember when 3dfx released the Voodoo 4 & 5? More expensive than the GeForce256 but not decisively better performance. Now I'm hearing similar things about the GeForceFX vs. ATI's three month old Radeons. NVIDIA is getting bigger but they still aren't a huge company. Can they really afford to lose the lucrative high-end sales right now?

    One thing NVIDIA does seem to have going well is their motherboard chipsets. The new nForce2 really kicks ass by all accounts. I remember a while back hearing about an ATI mobo chipset based on tech they acquired from ArtX, but apparently end-user mobo chipsets aren't ATI's plan.

    Good luck, NVIDIA. Hope y'all can keep up the pace.

    --
    Never approach a vast undertaking with a half-vast plan.
    1. Re:I worry about NVIDIA by kurokaze · · Score: 2, Insightful

      high end isn't where the money is at.. its the
      mid to low end that generates most of the revenue.

      You need not worry until Nvidia starts losing
      OEM business to ATI.

  17. NV30 needed its own path to compete by scotay · · Score: 4, Informative

    The NV30 runs the ARB2 path MUCH slower than the NV30 path.
    Half the speed at the moment. This is unfortunate, because when you do an
    exact, apples-to-apples comparison using exactly the same API, the R300 looks
    twice as fast, but when you use the vendor-specific paths, the NV30 wins.


    I'm betting that Carmack assumed NV30 would also use the ARB2 path with NV10/20 R200 for the older cards. When he found ARB2 ran like shit on NV30, he had to do a special NV30 path.

    He's already dumped vendor-specific vertex programs. I bet ARB2 would have been the only next-generation fragment processor if the NV30 could have run it fast enough.

  18. Driver differences by daVinci1980 · · Score: 4, Informative

    Carmack mentioned this, and its important not to gloss over...

    There's a big difference between the drivers theoretical output, and the actual acheived output.

    In testing at my job, we found that the ATI drivers typically performed very poorly in comparison to those released by nVidia on similar hardware. In addition, we often had more serious issues with bugs in ATI drivers than nVidia. Although the next great thing from nVidia isn't likely to outright dethrone the 9700, nVidia is constantly improving their driver technology, constantly making the layer between software and hardware thinner and thinner.

    --
    I currently have no clever signature witicism to add here.
    1. Re:Driver differences by bogie · · Score: 4, Interesting

      He's right, ATI has really only gotten its act together with drivers in the past3-6 months. Up till then it was one buggy driver after another. For most people the nvidia drivers just worked while it seemed that with every ATI driver release you ended up needing patches for every game to work right. That said at this point it does seem like ATI finally got things right with the 9700. Up till now I honestly wouldn't even consider an ATI card, but compared to the initial FX I don't see why you choose it over the 9700. The more I read about it, it seems like Cost, noise, and loss of a pci slot will be keeping me away from the FX. Right now I have a 4200 and by the summer I'll be ready for a new card. If Nvidia doesn't get its FX in line by then there's no doubt I'll jump to ATI.

      --
      If you wanna get rich, you know that payback is a bitch
  19. Slashdot heros. by Cannelbrae · · Score: 3, Funny

    Its wonderful to see slashdot celebrating its heros like John Carmak, Richard Stalman and Linus Torvads. I mean, with names like these contributing, who needs editors?

  20. leaked alpha != anything useful by DrSkwid · · Score: 3, Interesting

    3 rooms and 4 mobs does not a stress test make

    okay I agree with the premise that older cards will do just fine but the Doom3 alpha wasn't pushing the limits like I expect the ifnal release to do.

    A couple of dynamic lights and some bump mapping.

    You can still get q3 to reach the limits of the Ti4600 128Mb. Heck, even the original Unreal still poses a test with eveything turned up!

    --
    There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    1. Re:leaked alpha != anything useful by Fulcrum+of+Evil · · Score: 3, Interesting

      3 rooms and 4 mobs does not a stress test make

      I thought that doom 3 was supposed to be more of a suspense game than a gorefest - i.e. there won't be hordes of demons after your butt. Instead, maybe you have to sneak around and avoid most creatures.

      --
      "We returned the General to El Salvador, or maybe Guatemala, it's difficult to tell from 10,000 feet"
  21. Spelling by doc_traig · · Score: 4, Funny

    update: sorry bout the misspelling, don't know how I missed that

    psst... it's spelled about...

    --
    So long, michael. Don't let the door hit you...
  22. Re:Shader program limits by Rolphus · · Score: 2, Interesting

    To my knowledge, all you'd have to do in order to use a longer shader is to break it down into separate passes. What the nVIDIA card does well, is extremely long pixel-shaders in once pass (1024 vs. 255 instructions, I think?), and also insanely long vertex shaders in one pass (1024x64 loops vs. 255x1 loop).

    If you do more passes, as I understand it you have to upload all the scene geometry again, which is stressful on bus bandwidth and wasteful of processing resources.

    Of course, it's entirely possible I'm misinterpreting everything, and I apologise in advance if that's the case.

    --
    ~Rolphus
  23. I call B.S... by XaXXon · · Score: 2, Insightful

    on getting 15 FPS at 1280x1024 with all options turned on with a gf2.

    *MAYBE* in an empty room with very few shadows and no combat..

    On my 9700 Pro running on a 2.4 GHz Xeon, at that resolution, I normally got ~40-45 fps, but when anything happened, it dipped under 10. Sometimes in the 1-2 fps range -- when I was shooting, and being attacked.

    No way in hell any gf2 is going to push 15 fps when anything is going on. It'd be a freaking slideshow slower than the powerpoint presentation at the meeting I just got out of.

  24. Re:Um - can anyone explain this? by cthulhubob · · Score: 5, Informative

    OK, I did some 3D imaging math about 10 years ago (when you had to code your own drivers to get SuperVGA mode under DOS), so I think I get what he's talking about: the problem of how to show the reflection of one object (or light source) off another object. I've never heard of "interpolated half-angle" or "specular highlights", or the "triangulation line". Anyone know what he is talking about?

    You didn't get much beyond Gouraud shading, did you? :)

    Of course, depending on your hardware ten years ago, specularity might not have been feaasible if you were doing something big and real-time. Certainly not with the standard PC of that era.

    • Specular highlights: the white (or colored, if a non-white light) spot you see reflected on a non-matte surface from an incoming light. If you've got a standard office-black telephone at your desk like I do right now, looking at the corner edge of the handset you should see a bit of yellowish or bluish white which is a reflection from the overhead lights. If you move your head (change the viewing angle) you can see it shift over the surface of the phone. This is a specular highlight. The strength of the highlight depends on the "Shininess" of the surface - the less shiny, the more diffuse, until you reach sheet-of-paper-matte, which has little to no visible highlight - only shading.
    • Triangulation line: Unfortunately, the state of 3d graphics not being ideal when compared to the telephone handset on your desk, the surface of that phone will be composed of triangles rather than molded plastic. Under the traditional Gouraud shading plus specular highlight model, to conserve computing resources the angle of the incoming light is only calculated at each vertex of each triangle on the surface of each object, and then the angle between it and the reflection toward the viewer is interpolated linearly along the edges of the polygon. Thus, when triangles get too large, instead of a nice highlight resembling the shape of the light source (usually spherical in computer graphics, regardless of the actual object the light is supposedly emanating from), you see a highlight along the lines making up the triangle, that quickly fades toward the outside edges. Very big ugly obvious rendering mistake.
    • Interpolated half-angle: you can probably figure out what this is based on my explanation of "triangulation line", but just in case -- this is the interpolated angle (actually interpolated cosine of two angles, which is why it's referred to here as a half-angle) used to figure out the strength of the highlight at any given fragment (new word for pixel) of the triangle being rendered.

    Hope that helps!

    --

    In post-9/11 America, the CIA interrogates YOU!
  25. Re:Once again... by John+Carmack · · Score: 5, Informative

    >But he mentioned something about next gen cards having less bandwidth. Does that make sense to anyone?

    The RATIO of bandwidth to calculation speed is going to decrease. It is nothing short of miraculous that ram bandwidth has made the progress is has, but adding gates is cheaper than adding more pins or increasing the clock on external lines.

    Bandwidth will continue to increase, but calculation will likely get faster at an even better pace. If all calculations were still done in 8 bit, we would clearly be there with this generation, but bumping to 24/32 bit calculations while keeping the textures and framebuffer at 8 bit put the pressure on the calculations.

    John Carmack

  26. GeforceFX non Ultra by Ratchet · · Score: 2, Informative

    There'a also a slower version of the GeforceFX in the works, sans leafblower. It's reported to run at 400MHz core/800MHz memory (as opposed to 500/1000 for the "Ultra" leafblower version). It will likely get trounced by the 9700 and 9700 Pro in most performance and IQ areas, but it will be an alternate solution for you guys that want one of those nV30 based cards but don't want to risk having your cat sucked into the back of your computer.