The Impact of Memory Latency Explored

cache by Quixote · 2005-11-02 04:02 · Score: 1, Informative

Cached link, in case original gets Slashdotted

Re:cache by Unnamed+Chickenheart · 2005-11-02 06:29 · Score: 1, Funny

Is that a L1 or L2 cache? o.O

--
urd
Re:cache by Anonymous Coward · 2005-11-02 07:44 · Score: 0

What's the latency on that cache?

FP! by Fordiman · 2005-11-02 04:02 · Score: 0

The article seems to finish off with, "Don't bother with these unless you want to overclock"

Which makes sense... or not. I'd just buy 533 ram and be done with it.

The real question is: can I buy 533MHz ram and run it slower with lower latencies?

--
110100 1101000 1101000 1100110 0 1101111 1101000 1100011 1

Re:FP! by Anonymous Coward · 2005-11-02 04:16 · Score: 0

I find myself questioning some of the benchmarks. It seems to me that you get an overall system performance boost of anything from 3% to 13% by simply upgrading the RAM. For a $50 price raise this is pretty reasonable.

On a number of the tests there was no performance benefit from the faster RAM but these were typically operations that were bound by the capabilities of another part of the system, Winzip - Harddisk, Games@high resolutions - Graphics card or bus bandwidth. So in these cases the system was waiting around for the other components rather than the memory. It would be interesting to do the same test with a far far lower clocked memory speed wher the rest of the system was still high specked so we could reduce the chance of the bottleneck moving elsewhere. If the graphics card had been a 7800GT and the hard disks had been uber raided I suspect more of the benchmarks would have shown benefits.

However, my take on the whole thing is that for $55 you can increase the performance of your entire system by 3% to 13% by reducing the memory bottleneck. As a proportion of the system cost I actually think that is a good deal.

-Anon
Re:FP! by rrhal · 2005-11-02 04:21 · Score: 1

Low latency DDR is basically the same as high speed RAM. Its all about how much time it takes for the module to do its thing. IIRC its a little more difficult to hit 2-2-2 than it is to run at high speed so if you want really fast RAM low latency may be your best bet.

--
All generalizations are false, including this one. Mark Twain
Re:FP! by stinerman · 2005-11-02 04:25 · Score: 4, Informative

The real question is: can I buy 533MHz ram and run it slower with lower latencies?

Yes. I regularly by high speed RAM and downclock it, but run it at lower latency. For instance if I wanted to run my RAM at 400MHz, I'd buy 433/466/500MHz VAL-U-RAM and run it as a stick of semi-premium 400MHz.

Link crashed Firefox by ziggamon2.0 · 2005-11-02 04:03 · Score: 1

The link seems to crash my Firefox...

A bug to be reported, or what is happening?

Re:Link crashed Firefox by Maljin+Jolt · 2005-11-02 04:10 · Score: 5, Interesting

Beware, one of the banner advertiser on that page (netshelter.net) is trying to buffer overflow with strangely crafted cookie. Hope you do not run your Firefox on Windows...

--
There you are, staring at me again.
Re:Link crashed Firefox by Anonymous+Custard · 2005-11-02 04:16 · Score: 1

Or if you do, can you just block that cookie?

--
$8.95/mo web hosting
Re:Link crashed Firefox by EFGearman · 2005-11-02 04:24 · Score: 1

Hmmmm... that may be why my system freezes up on certain websites...

--
Atomic batteries to power! Turbines to speed!
Re:Link crashed Firefox by failure-man · 2005-11-02 04:29 · Score: 1

So what is this overflow then, and can it do anything but cause Firefox to segfault on *nix? (You mention Windows. Working exploit there?)
Re:Link crashed Firefox by isometrick · 2005-11-02 04:35 · Score: 2, Informative

How do you figure? Here's the response:
greg@yak ~ $ wget -U "Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.10) Gecko/20050922 Firefox/1.0.6" --save-headers -O- "http://servedby.netshelter.net/serve.cgi?pid=TECH -REPORT&g=1&m=9&j=1&k=1&id=1016381969816&d=iframe" 2>/dev/null

HTTP/1.1 200 OK Date: Wed, 02 Nov 2005 16:05:08 GMT Server: Apache/2.0.46 (Red Hat) Set-Cookie: ls=1; path=/; domain=.servedby.netshelter.net Set-Cookie: FCDEFAULT_TECH-REPORT.TECH-REPORT=1|051102160508+1 |cf4bbb50bcb0d4ac; path=/; domain=.servedby.netshelter.net; expires=Wed, 09 Nov 2005 16:05:08 GMT P3P: CP='NOI NID PSAa PSDa OUR IND COM NAV', policyref="/w3c/p3p.xml" Expires: Fri, 20 Mar 1998 01:00:00 GMT Cache-Control: no-cache Pragma: no-cache X-Adtrix-Debug: -ac=7 VONCAN728Q405HEITNOV=GT ITTQ405TECHENTHUS728=DD VON728Q405HEISITNOV=DD CRUUKNOV05GEEK728=GT CRUUKNOV05VIPER728=GT VERZTECHENTH728NOV05=DD -ec=1 DEFAULT_TECH-REPORT=#1 Connection: close Content-Type: text/html; charset=UTF-8 <script language='JavaScript' type='text/javascript' src='http://techreport.com/phpads/adx.js'></script > <script language='JavaScript' type='text/javascript'>  </script><noscript><a href='http://techreport.com/phpads/adclick.php?n=a 1feedc2' target='_blank'><img src='http://techreport.com/phpads/adview.php?what= zone:24&n=a1feedc2' border='0' alt=''></a></noscript>
Re:Link crashed Firefox by Anonymous Coward · 2005-11-02 04:37 · Score: 0

Works fine with konqueror
Re:Link crashed Firefox by Daveznet · 2005-11-02 04:37 · Score: 1

Here is the contents of the cookie CA%7CNA%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C.

--
GL HF!
Re:Link crashed Firefox by NanoGator · 2005-11-02 04:43 · Score: 4, Funny

"Beware, one of the banner advertiser on that page (netshelter.net) is trying to buffer overflow with strangely crafted cookie. Hope you do not run your Firefox on Windows..."

Just another reason to switch to IE!

--
"Derp de derp."
Re:Link crashed Firefox by ChrisMaple · 2005-11-02 05:19 · Score: 1

I turn off javascript when this happens. This is a nuisance, since a lot of good things require javascript.

--
Contribute to civilization: ari.aynrand.org/donate
Re:Link crashed Firefox by Anonymous Coward · 2005-11-02 05:27 · Score: 0

Or turn on your DEP (Data Execution Protection), like I have on my Athlon 64.
Re:Link crashed Firefox by Hillgiant · 2005-11-02 05:49 · Score: 1

Dynamically turn on and off javascript with prefbar.

--
-
Re:Link crashed Firefox by Intron · 2005-11-02 06:00 · Score: 1

%7C = vertical bar, so that's one weird cookie:

"CA|NA||||||||||"

--
Intron: the portion of DNA which expresses nothing useful.
Re:Link crashed Firefox by Janek+Kozicki · 2005-11-02 07:12 · Score: 1

well I run sarge :) but if we are speaking about ads, I'm using custom hosts file that is pointing to my local install of apache2. So every request for an ad shows 404 ErrorDocument from apache2. I don't like to see "Not found" all the time, so I placed in 404 a reference to this. It says that only 142 days are left for our second child to be born. :) Hosts file from my wife's computer is also pointing to it, and she was delighted when she saw it.

--
# #\ @ ? Colonize Mars #
Re:Link crashed Firefox by Anonymous Coward · 2005-11-02 08:22 · Score: 0

Beware, one of the banner advertiser on that page (netshelter.net) is trying to buffer overflow with strangely crafted cookie. Hope you do not run your Firefox on Windows...

I do just that, and had no issues. Then again, I am also using Mike Skallas' hosts file, so the banner got blocked.
Re:Link crashed Firefox by Randolpho · 2005-11-02 08:40 · Score: 2, Informative

It's not really that wierd. Vertical bars are a popular list separation item, and I've used them in cookies for web applications I've designed many times. In your example, you have a twelve-item list, with the first two items equal to "CA" and "NA" respectively, and the remaining equal to "".

What they're doing with the list is anybody's guess. :)

--
"Times have not become more violent. They have just become more televised."
-Marilyn Manson
Re:Link crashed Firefox by giant_toaster · 2005-11-02 09:28 · Score: 1

Didn't happen on Fx 1.5... did anyone else notice it was released yesterday?
Re:Link crashed Firefox by Lord+Kestrel · 2005-11-03 10:38 · Score: 1

Take a look at NoScript. It allows you to set up acls, allowing for instance javascript for gmail, but disabled for everything else. It'll also block java, flash, other plugins, etc.

The only problem I've seen with it is a tendency to wipe my acls upon upgrading to a new version, but I imagine that'll be fixed.

apply this to picking a wife by 0110011001110101 · 2005-11-02 04:04 · Score: 5, Funny

FTFA - Lowering memory latencies is a good thing, of course, but low-latency modules typically cost twice as much as standard DIMMs.

I'd have to say this is right on when applied picking a woman to spend your life with... low-latency memory is a BAD BAD thing, and VERY expensive. My next time around, I'm going with the "CHEAPER", high-latency model that can't immediately recall everything I've ever said while arguing her point... Roses and jewelry can cost you over the long run friends...

--
Don't anthropomorphize computers: they hate that.

Re:apply this to picking a wife by JakusMinimus · 2005-11-02 04:55 · Score: 1

I'm going with the "CHEAPER", high-latency model that can't immediately recall everything I've ever said while arguing her point...

Fret thee not, for such a beast exists solely in myth!

--

You can be an atheist and still not want to succumb to some weird cross-over sheep disease -- AC
Re:apply this to picking a wife by Hymer · 2005-11-02 06:08 · Score: 1

a married geek ??
What have I done wrong ? there are pizza boxes on the floor and AlphaServers in the rack but no wife in my house...
Re:apply this to picking a wife by cloudmaster · 2005-11-02 07:40 · Score: 1

there are pizza boxes on the floor and AlphaServers in the rack but no wife in my house.

Matter and anti-matter, my friend. They just don't mix.
Re:apply this to picking a wife by Bulmakau · 2005-11-02 08:39 · Score: 1

:)
Well, the comparison to women is interesting..
However, when you have a server that is not preforming well enough, you can get another one :) twice the "horse power".. Now... what would that translate to in the computers-women analogy?

--
"From the moment I could talk, I was ordered to listen" - Cat Stevens
Re:apply this to picking a wife by mrselfdestrukt · 2005-11-02 19:36 · Score: 0

Mind over matter dude!
If you don't mind, it doesn't matter!
And having a wife / gf /so is nice for the sex, mental stimulation, companionship etc.. but what they don't tell you about is the constant complaining, moaning, bitching , putting on weight, then sleeping around when you're not interested anymore, trying to control you etc. I'll take your pizzas and alpha's anyday.

--
"I used to have that really cool,funny sig ,but it got stolen."
Re:apply this to picking a wife by Nykon · 2005-11-04 06:29 · Score: 1

Probably much like hitting your mid life crisis and dumping your wife (trading in a 40 for 2 20's) ;) Though I imagine that isn't really quite the same, just moving from 1 server to a "distributed model" instead :)

--
"It's better to be a pirate then join the Navy"

Just stick a few blue LEDs on it... by phpm0nkey · 2005-11-02 04:09 · Score: 5, Funny

I have no doubt that hardcore PC gamers will shell out the cash for these, regardless of the cost/performance ratio. Once you start paying $500+ for a graphics card, all rational decision making skills are lost.

--
domain combinatorics

Re:Just stick a few blue LEDs on it... by the_loon · 2005-11-02 04:18 · Score: 0, Redundant

phpm0nkey said -- "I have no doubt that hardcore PC gamers will shell out the cash for these, regardless of the cost/performance ratio. Once you start paying $500+ for a graphics card, all rational decision making skills are lost." This is probably the funniest thing I've read all morning...okay, I'll give you all week, its only Wednesday. The sad part is particularly true...heh.
Re:Just stick a few blue LEDs on it... by mindaktiviti · 2005-11-02 04:20 · Score: 1

One of my friend's friend had this obsession with having the bleedy edge gaming rig and would always upgrade his video card. I think his upgrade cycle compared to mine was something along the lines of 3:1. I'd have a TNT2 Ultra (hand me down), then grabbed a GeForce3Ti200 (second hand from cousin for $100CDN), and just recently I bought a GeForce6600GT when the 7000-series just came out. I'm sure this guy has one of the top of the line ATi cards or something in the $400USD+ range.
Re:Just stick a few blue LEDs on it... by Anonymous Coward · 2005-11-02 04:45 · Score: 0

dude you do realize that Coolermaster sells ram with lighted heatsinks already applied.

they are super special "MATCHED" low latency ram.

I alwasy considered people that buy that marketingspeak a bunch of fools.
Re:Just stick a few blue LEDs on it... by timeOday · 2005-11-02 04:50 · Score: 4, Insightful

I wouldn't buy a $500 card either but, sheesh, at least they're faster than the cheap ones. This low-latency memory is twice the price for a ~3% boost... I think not.
Re:Just stick a few blue LEDs on it... by Iriel · 2005-11-02 04:52 · Score: 3, Insightful

This isn't all that funny. I mean, it does make me laugh, but it's far more true than humorous. I constantly get berated by the 'hardcore' gamers for not having the fastest CPU/RAM/GPU/HD when I can still run a lot of games just as well as anyone else. The problem with hardcore gaming equipment is that it has become something like MTV selling you 'cool'.

Guess what? That wicked dual-core CPU actually runs games slower than its single core cousin. That brand-spankin' new video card that cost you $400(or more)? I pay that much once every several years on my video card. The difference is that I don't care if I squeeze out my maximum frames per second because most people can't even detect the difference if the game didn't have an option to show the number in the corner of the screen like some veritable rating of thier manhood (sorry for my gender bias on that). And that super ultra OHMYFUCKINGGODITMAKESMYEXPLODEITSSOFAST low-latency RAM is giving you a performance boost of 2% of what I've got now.

I find it educational to read these reports so I can make educated purchasing choices. For that, I'm quite grateful. However, I find it kind of sad that the parent post is unsettlingly accurate in that the 'hardcore pc gamers' will shove this to the side for the ATI SXL 10G Super Elite XTRME Pro card next week. Witness what happens when PC gaming meets MTV-esque marketing.

--
Perfecting Discordia
www.stevenvansickle.com
Re:Just stick a few blue LEDs on it... by DrSkwid · 2005-11-02 05:04 · Score: 1

> when I can still run a lot of games just as well as anyone else.

is that right ? What FPS are you getting in Quake 4 at 1280x1024x32 with tri-linear and models/textures set to HIGH ?

--
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
Re:Just stick a few blue LEDs on it... by uacheesehead · 2005-11-02 05:06 · Score: 1

Haha, indeed..
Re:Just stick a few blue LEDs on it... by Iriel · 2005-11-02 05:12 · Score: 3, Insightful

You seemed to have missed the point that 'a lot of games' does not mean 'all games', 'any games', or any derivative thereof. And honestly, the point of my post is that I'm willing to sacrifice some detail and put my settings at 75-80% instead of maxed-out if it'll save me from spending close to a thousand dollars a year in upgrades.

--
Perfecting Discordia
www.stevenvansickle.com
Re:Just stick a few blue LEDs on it... by Anonymous Coward · 2005-11-02 05:13 · Score: 0

85, 95 when I stick my peepee into the Zip drive slut. I mean "slot."
Re:Just stick a few blue LEDs on it... by Anonymous Coward · 2005-11-02 05:16 · Score: 0

Hello, McFly. Your super-duper graphics are nothing more than a high-res skin on a game that's going to play exactly the same regardless of what machine it's on (provided the machine meets reasonable specs, of course). I mean, it's not like the go-fast stripes and Kanji stickers on your '89 Civic suddenly turn it into a Ferrari.
Re:Just stick a few blue LEDs on it... by (A)*(B)!0_- · 2005-11-02 05:23 · Score: 1

"I constantly get berated by the 'hardcore' gamers for not having the fastest CPU/RAM/GPU/HD when I can still run a lot of games just as well as anyone else."
Why are you associating with such people?
"However, I find it kind of sad that the parent post is unsettlingly accurate in that the 'hardcore pc gamers' will shove this to the side for the ATI SXL 10G Super Elite XTRME Pro card next week. Witness what happens when PC gaming meets MTV-esque marketing."
Sad? Who cares? Let people spend their money the way they want to. If they insult you because you don't spend your money on high-end video cards, don't associate with them. But just as you don't like being insulted for not spending your money that way, don't call people "sad" who do. Just let people live.
And I have never spent more than $200 on a video card.
Re:Just stick a few blue LEDs on it... by Anonymous Coward · 2005-11-02 05:29 · Score: 0

Thank you for that entirely useless view into your pathetic Canuck life. Have you considered killing yourself to spare the rest of the world? You'd go down in history as a hero.

HAHAHAHA, I'm just kidding, no one will ever remember you or care about you being gone. You stupid fucktard.
Re:Just stick a few blue LEDs on it... by vmcto · 2005-11-02 05:43 · Score: 3, Interesting

Hey don't knock gamers that spend tons of money on computer gear.

It's thanks to them that the rest of us can get normal gear at such reasonable prices...
Re:Just stick a few blue LEDs on it... by DrSkwid · 2005-11-02 05:44 · Score: 1

and you think < $10 a week is too much to pay for that ?

FPS makes a difference to playability, anyone that says otherwise has dull senses.

--
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
Re:Just stick a few blue LEDs on it... by Anonymous Coward · 2005-11-02 06:00 · Score: 0

Hey Cheer up Emo Kid!

I don't know what reviews you been reading but every review I have see thus far shows an AMD 64 X2 4800+ (2.4 GHz Toledo) beats an AMD 64 4000+ (2.4 GHz Clawhammer) in just about every benchmark I have seen?
Re:Just stick a few blue LEDs on it... by Wonko · 2005-11-02 06:14 · Score: 2, Informative

Guess what? That wicked dual-core CPU actually runs games slower than its single core cousin.

Is this actually a true statement? I can't do any current testing since I don't have a reasonable 3D card in my machine, but I remember testing Quake 3 on my old dual Celeron machine with a TNT2 card. top showed Quake was using 95% or more of one CPU, and the X server was using 30% or more of the other CPU.

I don't expect the numbers to be the same today, but shouldn't there be at least some slight increase in speed if the GUI is running in a separate process? I am not saying that the increase would justify the price, but I haven't run a single CPU desktop in something like six years. I am not about to start now :).
Re:Just stick a few blue LEDs on it... by HavokDevNull · 2005-11-02 06:22 · Score: 4, Interesting

Have to agree with AC on the cpu issue, taken from the http://techreport.com/reviews/2005q2/athlon64-x2/i ndex.x?pg=16

Conclusions
Let's start by talking about the Athlon 64 X2 4200+. This CPU generally offers better performance than its direct competitor from Intel, the Pentium D 840. Most notably, the X2 4200+ doesn't share the Pentium D's relatively weak performance in single-threaded tasks like our 3D gaming benchmarks. The Athlon 64 X2 4200+ also consumes less power, at the system level, than the Pentium D 840--just a little bit at idle (even without Cool'n'Quiet) but over 100W under load. That's a very potent combo, all told.

In fact, the X2 4200+ frequently outperforms the Pentium Extreme Edition 840, which costs nearly twice as much. Thanks to its dual-core config, the X2 4200+ also embarrasses some expensive single-core processors, like the Athlon 64 FX-55 and the Pentium 4 Extreme Edition 3.73GHz. Personally, I don't think there's any reason to pay any more for a CPU than the $531 that AMD will be asking for the Athlon 64 X2 4200+.

If you must pay more for some reason, the Athlon 64 X2 4800+ will give you the best all-around performance we've ever seen from a "single" CPU. The X2 4800+ beats out the Pentium Extreme Edition 840 virtually across the board, even in tests that use four threads to take best advantage of the Extreme Edition 840's Hyper-Threading capabilities. The difference becomes even more pronounced in single-threaded applications, including games, where the Pentium XE 840 is near the bottom of the pack and the X2 4800+ is constantly near the top. The X2 4800+ also consumes considerably less power, both at idle and under load.

The X2 4800+ gives up 200MHz to its fastest single-core competitor, the Athlon 64 FX-55, but gains most of the performance back in single-threaded apps thanks to AMD's latest round of core enhancements, included in the X2 chips. The X2 4800+ also matches the Opteron 152 in many cases thanks to Socket 939's faster memory subsystem. Remarkably, our test system consumes the same amount of power under load with an X2 4800+ in its socket as it does with an Athlon 64 FX-55, even though the X2 is running two rendering threads and doing nearly twice the work. Amazing.

There's not much to complain about here, but that won't stop me from trying. I would like to see AMD extend the X2 line down two more notches by offering a couple of Athlon 64 X2 variants at 2GHz clock speeds and lower prices. I realize that by asking for this, I may sound like a bit of a freeloader or something, but hey--Intel's doing it. No, the performance picture for Intel's dual-core chips isn't quite so rosy, but the lower-end Pentium D models will make the sometimes-substantial benefits of dual-core CPU technology more widely accessible. If AMD doesn't follow suit, lots of folks will be forced to choose between one fast AMD core or two relatively slower Intel cores. I'm not so sure I won't end up recommending the latter more often than the former.

Beyond that, the giant question looming over the Athlon 64 X2 is about availability, as in, "When can I get one?" Let's hope the answer is sooner rather than later, because these things are sweet.

--
Sig
Re:Just stick a few blue LEDs on it... by StarvingSE · 2005-11-02 07:52 · Score: 1

Thats why I am always looking for best "bang for my buck." The only thing that can usually tell the difference between my $200 vid card and the $500 is FRAPS.

--
I got nothin'
Re:Just stick a few blue LEDs on it... by CRiMSON · 2005-11-02 08:11 · Score: 1

For me it's about having the game look as beautiful as it can.

a $150 card won't do that.

So once every 3 years, I go out and buy a $600 video card.

And it's worth it. But then again $600 isn't that much money.

--
oogly boogly!
Re:Just stick a few blue LEDs on it... by Iriel · 2005-11-02 08:31 · Score: 1

I understand fully. Which is why I do get a 300-400 dollar card every several years. But I've seen people sink money into 2-3 new cards a year or more just for SLI. Seriously, beyond a certain point, the human eye can't detect the framerates that some 'hardcore' fans tell me looks so much better. They think they see a difference because a higher number must mean better quality, and so their mind rationalizes with them. So in the end, they wouldn't notice the difference if you took their FRPS down by 10, but they will convince themselves that they can see the difference between 70 and 80. It's like someone telling you that they hear the difference between two sounds because they assume that you're trying to test them when they really just heard the same thing twice.

And depending on how much your monthly bills are, $600 can be a lot of money.

--
Perfecting Discordia
www.stevenvansickle.com
Re:Just stick a few blue LEDs on it... by Anonymous Coward · 2005-11-02 09:45 · Score: 0

This low-latency memory is twice the price for a ~3% boost... I think not.
Been nice if they had compared low latency RAM with the larger amount of high latency RAM you could buy for the same $$$, 1GB low latency to 2GB high latency. Of course, I'm a multi-tasking freak, so its a no brainer for me, but I'm curious...
Re:Just stick a few blue LEDs on it... by Glonoinha · 2005-11-02 13:35 · Score: 1

It's a bogus question - no real gamer that isn't a complete waste of oxygen and silicon plays at 1280x1024.
Now if he had said 1600x1200, then maybe we could have taken the question seriously - but I assure you guys that he is just yanking your chains (and you fell for it!)

That said, I second the guy that requested a roundup where they put 512M of the uber1337 memory with chrome heat-spreaders and blinkenlichten in one machine, and 2G of Crucial/Kingston budget line memory in the other - and do a full suite of testing to tell us which runs faster overall.

--
Glonoinha the MebiByte Slayer
Re:Just stick a few blue LEDs on it... by DrSkwid · 2005-11-02 21:52 · Score: 1

I said 1280x1024 to give him a chance =)

--
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter

ZOMG by Anonymous Coward · 2005-11-02 04:09 · Score: 0

Teh LATENCIES MAKE ALL THE DIFFernence I run with my uber memory timings on my gentoo system => http://funroll-loops.org/

Anandtech did this months ago by Anonymous Coward · 2005-11-02 04:10 · Score: 4, Informative

http://anandtech.com/memory/showdoc.aspx?i=2392

You'll basically find that the performance of value memory is very on par with the high end stuff. You basically pay for the ability to overclock on a more consistent basis.

Re:Anandtech did this months ago by bersl2 · 2005-11-02 06:29 · Score: 2, Informative

Also, see this thread from the Anandtech forums.
Re:Anandtech did this months ago by Anonymous Coward · 2005-11-02 06:47 · Score: 0

Some people buy high-end RAM, not because they want to overclock it, but because it's more stable at any given frequency. People that run servers.
Re:Anandtech did this months ago by dave_f1m · 2005-11-02 12:16 · Score: 1

People that run servers that care about stability run Registered/ECC memory. They also usually run mainboards that don't give them a choice about it.

Can't Read the Article by Ed+Almos · 2005-11-02 04:11 · Score: 1, Interesting

I'm running Firefox 1.0.7 under Ubuntu. When I click on the link firefox exits, am I the only one having this problem?

Ed Almos

--
The more corrupt the state, the more numerous the laws. - Tacitus, 56-120 A.D.

Re:Can't Read the Article by 0110011001110101 · 2005-11-02 04:16 · Score: 1

I'd have to say no...
http://hardware.slashdot.org/comments.pl?sid=16710 1&cid=13932623
considering just a handful of posts above you, another user complained of the same thing, and yet another user provided a useful reason why:
Beware, one of the banner advertiser on that page (netshelter.net) is trying to buffer overflow with strangely crafted cookie. Hope you do not run your Firefox on Windows...

--
Don't anthropomorphize computers: they hate that.
Re:Can't Read the Article by Anonymous Coward · 2005-11-02 04:18 · Score: 0

The link crashes Mozilla 1.7 also.
Re:Can't Read the Article by Progressive4Peace · 2005-11-02 04:58 · Score: 1

I'm running Firefox 1.07 on Ubuntu as well - having no problems. I also have adblock running, however, and I heard that site contains a malicious cookie on it so I might not be seeing it because of that.
Re:Can't Read the Article by DrWhizBang · 2005-11-02 05:00 · Score: 4, Funny

Sounds like a memory timing issue - you should upgrade to some OCZ low-latency RAM!

(I'm sorry, that's not helpful at all, is it?)

--
Schrodinger's cat is either dead or really pissed off...
Re:Can't Read the Article by 1336 · 2005-11-02 05:21 · Score: 1

Same here; details for comparison:

OS: Ubuntu 5.10
Browser: Firefox 1.0.7-0ubuntu20
Extension: Adblock 0.5.2.039
Filter: Filterset.G 2005-10-31a
Re:Can't Read the Article by trparky · 2005-11-02 05:39 · Score: 1

It don't crash on my Windows XP machine. Then again, I have damn good blacklist loaded into my FireFox AdBlock extension. It ends up blocking close to 98% of all Internet ads. Also, having DEP (Data Execution Protection) enabled helps too.
Re:Can't Read the Article by Cannedbread · 2005-11-02 09:52 · Score: 0

im on the 5.10RC with firefox 1.0.7 and all is well.

Insightful article by mindaktiviti · 2005-11-02 04:15 · Score: 2, Informative

Although I didn't read all the text (about 50% of it), the benchmarks were what I was interested in, as well as the conclusion. So to sum it up:

2-2-2-5 timings at 400MHz t1 memory is the fastest but costs twice as much and the performance gains are almost non-existant except in lower resolution games (i.e. 800x600 you may see an increase in 20 fps, which I think is a lot!), and of course the cost of the ram in this case would not be justified because putting that extra money into a better video card would be the better thing to do.

Only if you're an overclocker is this worth it, at least from their benchmarking and perspective, which I'll accept.

Oh yes, and that website also crashed my Firefox.

Re:Insightful article by Anonymous Coward · 2005-11-02 04:24 · Score: 0

800x600? Won't you already be getting >100 FPS in most games anyway?
Re:Insightful article by mindaktiviti · 2005-11-02 04:32 · Score: 2, Insightful

800x600? Won't you already be getting >100 FPS in most games anyway?
Perhaps, that particular benchmark was for Far Cry at 800x600 w/ medium settings, and the lowest fps was around 168, and the highest was 188, so a 20fps difference.

They were using this video card: NVIDIA GeForce 6800 GT with ForceWare 77.77 drivers

However if you look at the opportunity cost of buying this ram because you have a bad video card and play at those resolutions, then it would still be more worth it to just get a better video card. Even if you want to upgrade your ram, it would be wiser to just save that extra money to put into a 6600GT or something.

It would have been interesting if they did the test with an older video card as well, like a GeForce3 series.
Re:Insightful article by Anonymous Coward · 2005-11-02 04:45 · Score: 0

Once again, a worthless benchmark on /.

Why not do a benchmark that actually tests memory bandwidth and throughput - not the video card, the video card bus, or any other subsystem to an extent that the memory system is not the bottleneck in the test.

A good rule of thumb is that if the benchmark post contains *any* type of MS Office software benchmark, you can discount the findings as worthless, and the creators of the test as idiots if they try to pass it off as meaningful.

Then again, I guess the only real use of a windows box these days is gaming...
Re:Insightful article by Heembo · 2005-11-02 05:52 · Score: 1

> Oh yes, and that website also crashed my Firefox. DAMN YOU BILL GATES!

--
Horns are really just a broken halo.
Re:Insightful article by Zathrus · 2005-11-02 07:11 · Score: 1

It would have been interesting if they did the test with an older video card as well, like a GeForce3 series.

Not really. The video card was irrelevant to the test -- all the high vs low benchmarks showed was that if you're GPU bound then your memory latency has virtually no impact whatsoever. An older video card would've just been GPU bound at lower resolutions (for most of the games they tested, the lower resolution test would've been GPU bound by itself; BF2 wouldn't even run on a GF3).

If you're a gamer, you really should try to be GPU bound when possible -- as long as the lowest frame rate you get is high enough to avoid hitching and stutter (generally 45-50 fps). Otherwise you probably spent too much on your video card.

Note that some games simply don't become GPU bound. But most do.

one word - SuperPi by Anonymous Coward · 2005-11-02 04:19 · Score: 0

If you wanna show off your uber-low SuperPi times you need to be at or below 2.5-3-3-7 T1 ;-)
http://superpi.radeonx.com/

and yes we all know PiFast is much faster than SuperPi:
http://home.istar.ca/~lyster/chart.html

If you can afford a cup of coffee a day... by MLopat · 2005-11-02 04:21 · Score: 1

By the same logic as those cheesy insurance commercials, where you can afford the policy if you can afford a cup of coffee a day, if you can afford to spend 5 minutes reading slashdot each day, then you can afford not to drop twice the amount of money on ram for the 2% time savings it offers in most programs.

Re:If you can afford a cup of coffee a day... by merdaccia · 2005-11-02 05:22 · Score: 4, Funny

Your analogy does not hold. Slashdot is a high latency site. By the time I've read a few comments, I've usually forgotten what the story was about.
Wait, why am I posting this comment again?

--
*blinking cursor*

Conclusions by Anonymous Coward · 2005-11-02 04:23 · Score: 0

Although tighter memory timings and a 1T command rate can certainly improve the performance of the Athlon 64's memory subsystem, that improvement doesn't always translate to better application performance. In fact, with the exception of the Sphinx speech recognition engine, moving to tighter memory timings or a more aggressive command rate generally didn't improve performance by more than a few percentage points, if at all, in our tests. Lower latencies only improved WorldBench's overall score by a single point, and performance gains in games were generally limited to lower resolutions and detail levels.

At the end of the day, the appeal of low-latency memory modules may be limited to overclockers and enthusiasts intent on squeezing every last drop of performance from a system. More pedestrian "value" memory should be plenty fast enough for everyone else, especially since you can practically afford twice as much.

But surely flashing LEDS make it go faster! by Cr0w+T.+Trollbot · 2005-11-02 04:25 · Score: 3, Funny

After all, that's the main feature of Crucial Ballistix Tracer Memory. I'm sure those LEDS must be worth at least 10 fps in Doom 3...

Crow T. Trollbot

Re:But surely flashing LEDS make it go faster! by DigiShaman · 2005-11-02 19:45 · Score: 1

I'm sure those LEDs will end up costing players next to you a few frag points. The moment they set eyes on your "bling bling" is another moment they fail to pay attention to the game at hand.

"Dude, no fair. Your bling was in my eye!!!"

Hey, you can't blame loss of frags on LPBs at a LAN party anymore. ;)

--
Life is not for the lazy.

Ask a builder by Dragoon412 · 2005-11-02 04:30 · Score: 2, Insightful

Seriously, this has been known very well amongst the gaming PC builder crowd for a long time. Most of them, anyways; there's unfortunately still that level at which people know enough to put the PC together, but don't know enough to tell you what any of the numbers mean.

The difference between, say, Corsair Value Select memory, and Corsair 1337 Ultra X2000 - the memory equipped with LCDs, heat spreaders, and a spoiler with metal-flake yellow paint that add at least 10 horsepower - is going to be absolutely unnoticeable in the real world. Even benchmark scores will show little to no improvement.

Ricer RAM - you know, the PC equivalent of this crap - is for overclocking. If you're not planning on overclocking it, you're paying too damned much.

Re:Ask a builder by Evil_Timmy · 2005-11-02 04:42 · Score: 1

Compared to the people who are willing to cool their CPUs with liquid nitrogen for five minutes and risk cracking the core, hyper-accelerated chip creep, shorting from rapid condensation, and a number of other potential issues, just to turn out a few extra 3dmarks (thankfully, Futuremark's killed off that crowd with ridiculous releases time and time again) or gain a few more MHz than the next hardcore overclocker down the line...paying near double the price for a 2-5% increase in performance (on a good day) seems downright reasonable. In my case, I've only bought Corsair for their high-performance EEC memory for a critical server; hardcore memory at average (rated) speeds is rock-solid, and their tech support department and RMA policies are the best you can get without a $10k+ support contract.
Re:Ask a builder by theantipop · 2005-11-02 04:49 · Score: 3, Insightful

What most people don't realize is that the only way to improve your performance at the top end of the performance spectrum is through a combination of small tweaks such as this. Sure spending twice the money for 103% of the performance sounds dumb, but when you combine that with small tweaks to your processor, graphics card and a 10,000rpm hard drive they add up.
These products are not for people who want to achieve a useable level of performance and as such are not marketed at those crowds. They are for people who have already fast equipment but want more. I won't say this is a good or bad thing as it is simply a hobby for most of these people. Just like import tuners: they may drive funny-looking cars, but it's their choice of hobby.
Re:Ask a builder by WhiteWolf666 · 2005-11-02 05:25 · Score: 1

Reminds me of :
Welcome to Gentoo is Rice, the Volume goes to 11 here

--
WhiteWolf666 an exBush supporter. All you new-school,compassionate,save the children Republicans can rot in hell
Re:Ask a builder by mobets · 2005-11-02 06:43 · Score: 1

Ok, I agree with you about all the flashy stuff, but I think you underestimate the power of low latency ram. My machine is a P4 3.0gz HT with a Gig of standard PC3200 (512 for each chanel). My bother's machine is an early AMD64 2800+ish socket 700 whataver (sorry, don't know the specifics) with 512 MB of low latency ram (223 or something similar). Except for the ram, I would expect us to be an reletively eaven footing. Some differece sure, but not a lot. However, his install of Gentoo from stage 1 took half the time mine did. GCC does almost all of it's work in ram, rarely hitting the harddrive. Just my own experiance, but the ram does make a difference.

--

It was me, I did it, I moved your cheese
Re:Ask a builder by Anonymous Coward · 2005-11-02 06:54 · Score: 0

You have no idea how great overclocking realy is. I have been running a 4ghz P4 with 1,000 FSB for over 2 years now. The computer is on 24/7/365 with 100% CPU usage running Seti@home. It gets gamed on all the time .. it gets taken to LAN parties and has shiny blue lights that puts a smile on everyones face when ever they see it. Also it has a water pump a radiator with a bunch of tigon tubes that run all over. It only cost me $300 for something that it seems for now is not even on the drawing board for Intel. Thanks to Corsair XMS memory its been running crash free at those speed.
Re:Ask a builder by StarvingSE · 2005-11-02 08:05 · Score: 1

The key to increasing performance on a machine is to identify and solve the bottleneck on speed (ie your system is only as fast as the slowest component). For most people, this is most likely their hard disks. Seeing as the premium memory is only 3% faster than the regular stuff shows that its not much of a bottleneck. I think most people will see a better speed improvement by upgrading to 10,000RPM HD's in a RAID-0 configuration. After doing this in a system, the speed difference was like night and day.

--
I got nothin'
Re:Ask a builder by Guysmiley777 · 2005-11-02 08:38 · Score: 1

Like everything else in computing, It Depends On What You're Doing. Do you start off every day installing Gentoo?

For gaming (which is what inspires most insane PC builds) this article proves you'd be better off putting that $100 extra you'd spend on RAM into an even more eye wateringly expensive video card.

--
Coding with assembly is like playing with Legos. Coding an application in assembly is like building a car with Legos.
Re:Ask a builder by Glonoinha · 2005-11-02 14:04 · Score: 1

Want in on a few secrets?
If you have to use a stopwatch to tell which system is faster, they are the same speed.
If you calculate that one system is 7% faster than another system, they are the same speed.
If you have one system getting 127 frames per second, and another system getting 136 fps - they are the same speed.
103% for memory tweaks, 102% for OCing the CPU, 104% for a different tweak all add up to : same speed.

There are two magnificent pieces of equipment that are going to make your computer faster : a calendar, and a credit card.
A year from now when you can buy a new box that is THREE TIMES FASTER than your current (six month old) machine, now THAT is faster.

To use the cars analogy :
If I'm in a regular car and someone else is in a ricer, and he accelerates 7.1% faster when the light turns red - a block later if he and I are both waiting at the next red light together, we are going the same speed. 34mph = 37.5mph. Now if his vehicle can launch with authority and do 0-60 in under three seconds (mine can), can get to over 100mph before getting to the next traffic light and gets through the next light before it turns red (keeping me from getting through) - then he is going faster than I am.

3x faster = faster.
1.05381% faster = same speed.

PS - in case you are wondering, it's a stock GSXR-1100, and 0-60mph in under 3 seconds isn't that big of a deal in my world.

--
Glonoinha the MebiByte Slayer
Re:Ask a builder by theantipop · 2005-11-03 02:28 · Score: 1

Tell the NHRA that the two top-fuel dragsters that are 0.001 seconds apart are the same speed. Tell the baseball team that loses the World Series by 1 run that they're equally as good. Look, you may not understand that some people benchmark as a form of recreational competition, but it's reality whether you like it or not.
Re:Ask a builder by theantipop · 2005-11-03 02:31 · Score: 1

There's one important thing you fail to consider. Not all computer tasks are limited by your hard disk. Sure plenty are: bootup, unzipping, file copy. But there are just as many that aren't. Image rendering, video encoding, 3d games, math programs. Those are all limited by your CPU frequency and/or memory speed and latency. Once your application is started up and loaded into memory, say goodbye to your hard disk bottlenecks.
Re:Ask a builder by Glonoinha · 2005-11-03 10:47 · Score: 1

Two dragsters are 0.001 seconds apart ARE the same speed.
From run to run if they ran 20 runs and in all 20 runs the same car was 0.001 seconds faster each and every single time, letting the drivers swap cars a few times so each driver got 10 runs in each car, I would be willing to budge a little. But you and I know that that isn't the case. The machines are the same, but one lane had a little more rubber on the ground, or the driver was a little better (or a few lbs lighter, or had on his lucky shoes, or caught the light a little better), or one lane was one degree cooler due to the cross-wind.

Compared to my ride, though, those machines are faster. Twice as fast, easily.

I can appreciate the modders hand tweaking their boxes for fun - I used to do it myself. Hell I have hand polished and deburred the blades of a ceiling fan just to make it quieter, so if anybody can relate, I can. I'm just saying that it is stupid, and that comparing two machines that are the same effective speed and trying to say one is faster than the other (however imperceivably) is also stupid. I will put my home system up against the fastest, freakiest liquid nitrogen cooled computer you can possibly put together using an unlimited budget today. And I will be significantly faster, applying those two magnificent pieces of hardware (a calendar and a credit card, by which I imply I will be using the home system I buy a year from now.)

Not 4.3% faster. Not 7.1 FPS faster. I'm talking TWICE as fast, and that is what faster really means.

--
Glonoinha the MebiByte Slayer

Not to harp on the obvious by Anonymous Coward · 2005-11-02 04:31 · Score: 1, Interesting

But they're doing this on an AMD-64 platform...

Re:Not to harp on the obvious by Anonymous Coward · 2005-11-02 11:04 · Score: 0

...whose integrated memory controller will be implemented by Intel in roughly 1-2 generations due to DDR-2's inefficiencies.

Seriously, the integrated memory controller coupled with slower RAM yields as fast a box as a seperate memory controller with faster RAM. The performance is there, measured in the real world. The absolute most you'd see with a DDR2 test is slightly different numbers, because of the extra latency introduced between the CPU & memory controller eats up most of the performance advantage.

In any case, the test is just as valid, since the principles they're testing are the same.

The underestimated impact of latency. by G4from128k · 2005-11-02 04:33 · Score: 1, Insightful

These tests underestimate the performance impact of latency because they are conducted using software optimized over the years for the high-latency realities of current-day memory architectures. CPU clock speeds have been outstripping RAM clock speeds for about 15 years. Software developers have spent years optimizing their code to mitigate the impacts of latency.

In the short-run, these tests help a person decide whether to buy low-latency RAM. But they provide little long-term insight into how much faster the entire system could be if low-latency were the norm and compilers, libraries, operating systems, and applications were re-optimized for low-latency, not high-latency, architectures.

--
Two wrongs don't make a right, but three lefts do.

Re:The underestimated impact of latency. by Zathrus · 2005-11-02 05:05 · Score: 5, Interesting

Sorry, I call BS on your entire post. The difference in latencies here is miniscule -- it's not like we're talking about having the CPU wait 2 clock cycles vs 30 clock cycles. It's closer to 13 vs 25 (not exact, but the magnitude of difference is close). That just doesn't matter that much -- the reality is that if you have a cache miss then you're looking at 20-30 cycles (or, more likely, 40-60 cycles) of stall while you fetch the data from main memory.

The kind of changes you're talking about require vastly faster memory. Not the kind of latency differences being discussed here at all. Both of these are "high latency" compared to what would be needed for your theoretical redesign of the entire software stack. And even then, you just become utterly and completely screwed if you have to hit virtual memory, possibly more so than you are now because you've re-orchestrated everything around the idea that latency is a non issue.

Oh, and latency is getting worse, not better, and has been for a long, long time. CPU speeds long ago outstripped the speeds of our fastest memory (well, fastest while still not costing absurd amounts of money...), and the newer memory formats (DDR, DDR2, DDR3, RDRAM, etc) have higher latencies in exchange for greater bandwidth.
Re:The underestimated impact of latency. by twiddlingbits · 2005-11-02 05:12 · Score: 2, Informative

The software knows nothing about memory latency, the software only knows it needs to move a block of data from point A to point B. That Java/C/C++ Move_Memory function translates at the lowest level to machine code instructions which are implemented in the logic of the silicon. The coder or the compiler may optimize the ORDER of execution of the instructions, or use different instructions (such as BlockMoves) to speed things up, but the basic underlying machine instructions execute the same way every time (either they hit the cache and load from there, or it misses and a memory fetch is executed across the memory bus). On-chip caches were a design to minimize memory fetch and it's associated latency. On-chip caches are small and fast and are a different design than the external memory.

What you would want is to eliminate the wait states from CPU to RAM (or get more cache hits) and that is NOT something a compiler or OS can do for you, that is done in the algorithms that run the CPU. You can change that to some extent in the BIOS settings, to tell the CPU that memory wait states are zero, or the clock is higher but IIRC the CPU and Memory and Bus Controller have to agree on all this setting and must be able to implement its' timing. Overclocking the CPU won't fix this when the Bus and Memory can't run any faster.
Re:The underestimated impact of latency. by ChrisMaple · 2005-11-02 05:45 · Score: 1

RAM not on the CPU chip has been and will continue to be a burden on performance. Low latency RAM will reduce this trend but not reverse it. The physical distance between the CPU and RAM adds to latency due to speed-of-light delays, and it's hard to avoid that.

--
Contribute to civilization: ari.aynrand.org/donate
Re:The underestimated impact of latency. by Anonymous Coward · 2005-11-02 06:32 · Score: 0

To emphasize on this point: In the world of the 8-bit home computers of the 80's, just about any algorithm more complicated than an addition could be made blazingly fast by using look-up tables and jump tables -- by sacrificing memory for processing speed. Memory at the time was scarce, but so was processing speed, and finding the right balance often involved 10 to 50 % of your memory in look-up tables.

Today, all this has changed: to avoid cache misses, you try to avoid using memory as much as possible, jump tables are a disaster because they cannot be handled by the CPU branch predictor, and it can make sense to repeat comparatively complex computations simply because the CPU is so fast and keeping old results in memory is so slow.
Re:The underestimated impact of latency. by mce · 2005-11-02 07:08 · Score: 1

Nonsense. The latency problem has done nothing but getting worse for the past x years, as CPUs have been speeding up way faster than memories. In the future this is only going to get worse.
Yes there are software developers that have spent years optimizing code to mitigate the impacts of latency. In fact I and my team earn a living by researching methods and producing tools that help in doing this. We do this (and can afford to do so) only because it is very clear that we do not run any risk of being obsoleted simply by the SAMSUNGs of this world designing lower latency modules. In fact, SAMSUNG is so much aware of this themselves, that they licensed our tools for use in their own end-user product design flow and that we have SAMSUNG engineers permanently posted within our company so as to learn how all of our stuff works.

--
Linux user since early January 1992.
Re:The underestimated impact of latency. by OS24Ever · 2005-11-02 07:34 · Score: 1

In Other words -

It's like putting 93 octane gas in a 87 octante tuned car. You waste your money and get nothing out of it but maybe a check engine gas light.

--
As a rock-in-roll Physicist once said, No matter where you go, there you are.
Re:The underestimated impact of latency. by twiddlingbits · 2005-11-02 08:12 · Score: 1

Yep, good analogy. The Law of Diminishing Marginal Returns kicks in Immediately. There are some interesting "solutions" being tried (such as the NUMA Architecture from SGI) and other shared memory approaches to reduce the memory hit.
Re:The underestimated impact of latency. by Woody77 · 2005-11-02 10:26 · Score: 2, Interesting

However, if you have algorithmicly intensive software (spending lots of time in the same loops or crunching large amounts of data), it's worthwhile to instrument your code and see how you're doing for cache hits/misses. You might discover that by tweaking the inner-most loops or the size blocks you crunch, you can better fit the cache of the target processor.

Word/Excel isn't going to bother, but a game might be worth stuffing a few versions of tweaked loops in that are selected by a loop invariant, or by feeding the functions some data ahead of time to help guide them to use the best sizes of data that they can.

This isn't unlike memory alignment for structures, and taking a massive performance hit for the data not being "easy" for the assembly instructions to process.

One example is the ability to loop-unroll the innermost butterflies of an FFT on the x86-64 extension using the extra registers that are available there. That WILL get you a noticeable increase in performance.

But these are always the last 20% kinds of increases...
Re:The underestimated impact of latency. by twiddlingbits · 2005-11-02 16:46 · Score: 1

If you are doing a lot of FFTs on an x86 processor you probably have either the wrong processor (DSPs are much better at FFTs) or the wrong algorithm. Estimating things (or pre-calculating the common values and interpolating the rest) and using table lookups is much faster even with using the extra registers. Memory alignement on structures USED to be an issue, but not anymore. I did some research about 4-5 yrs ago that showed on a PowerPC processor alignment of structures made no difference, the tiny bit of lost time was greatly offset in other areas. Sure, this tweak can squeeze out the last 1/2 of 1% performance and that is sometimes needed but for 99.9% of the applications it doesnt matter.

Umm, how is tweaking the innermost loops going to result in more cache hits? With the size of caches today you can bet that loop is already in cache after 1 iteration. Instrumenting code is old stuff, a good High level language debugger can tell you so much more, and if you need to know more than that learn to read circuit schematics and how to operate an oscilliscope. Data size might be an issue, but some CPUs are smart enough to see you are "walking" thru memory and do pre-fetches to help you out. Most CPUs when they DO have to reload the cache are not grabbing a few instructions or bytes of data, they go grab 4K or 8K blocks.
Re:The underestimated impact of latency. by Woody77 · 2005-11-02 17:06 · Score: 1

Agreed that FFTs are best done in DSPs, but when writing an app that runs on x86 platforms, that's not much of an option. And for a scientific (or design) application that heavily uses FFTs (speaker design, for instance) you want the precision of doing the match instead of estimating things.

From a cpu-cycle approach, how sensitive was the processor to byte alignment in structures? I'm not familiar with PPC assembly, and haven't done the calculations on x86, but am familiar with other CPUs (mostly older) that were sensitive to the tune of an extra couple cycles per instruction to do the internal shifting of data. Granted, these were also non-superscalar chips that didn't have the kind of pipelining and out of order execution available to a more modern processor.

By tweaking the innermost loops for the cache hits, I was thinking data-size and not code-size. Although if you overly unroll your loops to the point that parts of the loop fall out of cache... I was also thinking of large blocks of data, with semi-random accesses (as opposed to sequential), say when logically processing array of data vs. arrays of numbers. Again, this might be moot with megs of cache on-chip.

I wish I had the time to go craft up some scenarios and really sit down and look at the results...
Re:The underestimated impact of latency. by twiddlingbits · 2005-11-03 04:46 · Score: 1

I wish I had the time to go craft up some scenarios and really sit down and look at the results...

I agree, I did some benchmarking similar to this about 5-7 years ago for some avionics architecture but was not funded to get into this level of detail. I was looking for best CPU, best network, best OS combinations. Somedays I miss the embedded world,as now I work in Enterprise IT where such stuff is WAY below the level of concern.

What about cache? by antifoidulus · 2005-11-02 04:35 · Score: 3, Interesting

Improvements in memory speed crawl compared to improvements in CPU speed, however larger caches can mitigate this problem to a certain extent, so why is it that growth in cache size continues to crawl? The Apple G5 updates FINALLY gave us 1mb l2 cache per core(and of course the industry standard 64k L1 cache per core) and whil the Intel/AMD world is slightly better in this regard, it's not by much. So why is it so hard to increase cache size?(of course you will need good cache allocation/replacement policies to go with them)? I'm not trolling, I honestly want to know. I realize that the people that design these chips are a lot smarter than I, but so far I haven't really seen a good reason why they don't increase cache size.
Also, outside of the HPC world, it seems very few programmers optimize their cache usage. Are there any tools(open source or otherwise) that can actually help you locate/fix inefficient uses of cache?

--
Monstar L

Re:What about cache? by tradiuz · 2005-11-02 04:40 · Score: 1

Why only a 64k L1 and 1mb L2? Why not a 1mb L1 and 4mb L2?
Re:What about cache? by prefect42 · 2005-11-02 04:43 · Score: 1

They've moved towards on die cache, and that makes it expensive. A typical CPU uses a hell of a lot of chip area for cache already. Why do you think P4 EE costs so much?

--
jh
Re:What about cache? by gnasher719 · 2005-11-02 04:44 · Score: 1

>> The Apple G5 updates FINALLY gave us 1mb l2 cache per core(and of course the industry standard 64k L1 cache per core) and whil the Intel/AMD world is slightly better in this regard, it's not by much.

Remind me again, how much L1 cache exactly does a Pentium 4 have? Wasn't it something like 8KB fast cache on the older ones and 16KB at half the speed on the newer ones?
Re:What about cache? by Anonymous Coward · 2005-11-02 04:46 · Score: 0

So why is it so hard to increase cache size?

Because L2 cache resides on-die. Take a look at CPU's with 1 or 2MB of L2 cache. Over half the die is cache. More cache makes chips bigger, hotter, and more expensive.

L3 cache, which is usually not on-die, is usually a lot bigger. I recall IBM p-series of servers support something like upto 128MB of L3 cache per MCM (multi-chip module of 4 CPUs).
Re:What about cache? by sznupi · 2005-11-02 04:48 · Score: 1

Obvious reason - much more expensive (you can't get as good yields at manufacturing fab)

--
One that hath name thou can not otter
Re:What about cache? by harrkev · 2005-11-02 04:48 · Score: 2, Informative

For one simple reason -- die size. Cache eats up a lot of real estate. A 1MB (B as in byte) is 8 million bits. If the cache uses DRAM-style cells, that is at least 8 million transistors. If the cache is more like SRAM, then you can count on a lot more This increases the size of the die, which decreases both the number of chips per wafer, and also increases the percentage of defective dies.

So, the bottom line is that cache is the most expensive type of memory in a computer. Some methods have been made to get around this -- like the Intel "Slot-1" architecture where the L2 cache was on a separate chip. But this idea faded into the museum of bad ideas.

--
"-1 Troll" is the apparently the same as "-1 I disagree with you."
Re:What about cache? by timeOday · 2005-11-02 04:53 · Score: 2, Informative

Because you get diminishing returns for more and more cache. At some point it's better to use all those transistors as a second core instead.
Re:What about cache? by Anonymous Coward · 2005-11-02 06:17 · Score: 0

"Because you get diminishing returns for more and more cache. At some point it's better to use all those transistors as a second core instead."

Pretty much, and the reason why you get diminishing returns is due to program locality. If you take a program (sequence of instructions), you'll notice that the program tends to stay in one area for a long time before moving to a completely different area. This is true for most programs except some programs that are particularly branch-intensive. Modern compilers now offer algorithms to gain more locality in your code without you explicitely programming for it. The OS and hardware also have techniques in place for optimizing reducing cache misses. At some point you're going to have fewer and fewer cache misses that increasing the cache size won't help much if at all.

Lots of research has been poured into creating techniques or algorithms for dealing with slow memory and the result is the memory hierarchy. At the bottom you have your hard disk and at the top you have CPU registers. All of the technologies in both OS, compilers, and hardware all account for dealing with this architecture these days.

It makes sense though: in the ideal situation you want to have a massive amount of cheap memory, not expensive memory.
Re:What about cache? by toleraen · 2005-11-02 06:30 · Score: 1

I don't recall for sure, but i seem to remember it being about fitting it onto the chip. Since its L1, it needs to be accessed incredibly quickly, so it needs to reside near the gates/pipelines/however you call it. Adding more registers to hold the data would take more physical space, would need to be seperated further, and would cause more latency in retrieving data. That's just my vague recollection from a lesson about the 80386...so it could be way off heh.
Re:What about cache? by Sycraft-fu · 2005-11-02 06:45 · Score: 2, Insightful

Cache is SRAM, since SRAM is much faster. Ok, except that SRAM takes 6 transistors per bit to make. So for 1 megabyte of cache, that's 48 million transistors to implement. That's a major budget of silicon. As transistor count goes up, so does die size, heat, cost, failure rate, etc. So putting large caches on just isn't feasable. A 8MB cache would use more transistors than most processors today do in total between core and cache.

Ok, you say, so move it off the chip. Well the problem is that part of the reason the cache can be so fast and low latency is that it's located on die. If you move it off, you start to run in to a lot harder speed limitations. Intel discovered that with their Celerons back in the PII days.

Real PIIs had 512k of cache, but on seperate chips. Because it was off die, half chip speed was the best they could do. The Celerons only had 128k of cache, but it was on the chip die and thus ran at full speed. Now what you found was if you overclocked a Celeron to the same bus and speed as a PII (for example a 300mhz Celeron ran on a 66mhz bus, cranking that to 100mhz made it run at 450mhz, the same as a PII) it ran at least as fast, sometimes faster, despite the lower cache.

Thus these days, on-die cache is what it's all about. Generally the value the pick is where diminishing returns start to seriously kick in. You discover that throwing more cache at things generally doesn't result in that big a speed increase (servers are a little different).

So, unless someone figures out a better kind of RAM to use, we are stuck. DRAM is what we use for main memory already, and SRAM is too expensive to use very much of.
Re:What about cache? by Detritus · 2005-11-02 07:02 · Score: 1

It's very hard to do fast and large. Fast and small or slow and large are easy. Increasing the size of the cache also increases the delays in the circuits that manage the cache. Signals have to travel longer distances and drive more loads.

--
Mea navis aericumbens anguillis abundat

Save money? by PhilHibbs · 2005-11-02 04:38 · Score: 1

They could even save you some money

How so? The article indicates that the benefits are marginal. How can this RAM save money?

Re:Save money? by BushCheney08 · 2005-11-02 04:42 · Score: 1

The submitter is saying that this Tech Report article could save you money, in that if you read it, you will be informed that low latency memory isn't necessarily worth the extra cost. C'mon, rub those brain cells together...

--
Be a real patriot: Question authority. Think for yourself. Formulate your own conclusions.
Re:Save money? by Anonymous Coward · 2005-11-02 04:45 · Score: 0

How so? The article indicates that the benefits are marginal. How can this RAM save money?

Hmmmm...let me think, let me think....Oh, maybe by NOT BUYING IT!

Did that breeze over your head feel nice and refreshing?
Re:Save money? by CountBrass · 2005-11-02 05:08 · Score: 3, Funny

I'm sorry, but you are too stupid to post on /.

--
Bad analogies are like waxing a monkey with a rainbow.
Re:Save money? by PhilHibbs · 2005-11-02 05:31 · Score: 1

Great, I'll not buy a hundred then. Wow! I just saved a ton of cash!
Re:Save money? by Anonymous Coward · 2005-11-02 05:36 · Score: 0

Great, I'll not buy a hundred then. Wow! I just saved a ton of cash!
You must be pretty smart, huh? I've got a bridge you can not buy and you'll save even more money! You'll be rich! Then you'll be able to afford that expensive RAM to impress chicks!
Re:Save money? by freeweed · 2005-11-02 07:32 · Score: 1

By not buying it.

--
Endless arguments over trivial contradictions in books written by ignorant savages to explain thunder in the dark.
Re:Save money? by PhilHibbs · 2005-11-02 07:43 · Score: 0, Redundant

Yes yes thank you I know now!

I think the idea that you are saving money by not buying something that you weren't going to buy in the first place may be more common in the US than it is over here.
Re:Save money? by Anonymous Coward · 2005-11-02 10:01 · Score: 0

Um, YOU are the one who couldn't comprehend the extremely (perhaps mind-numbingly) simple idea presented by this article. Why are you trying to disparage others based on supposed location? Why are you trying to cover up your own stupidity by taking aim at a culturally convenient target (The Stupid and Evil Americans), and how do you know that the poster you are replying to is an American? Slashdot, as you know, is read and commented on by people across the planet.

I assume your original post was written quickly without reading or comprehending the article. Unlike your knee-jerk response to the criticism you received I won't assume anything about you or the culture you reside in or associate with.

This was a simple story that didn't need anyone to prove how 1337 they are by posting without reading or understanding the point. You posted a less than intelligent comment and got called out on it. Instead of accusing others for your mistake, perhaps just accept it and move on.
Re:Save money? by Anonymous Coward · 2005-11-02 12:47 · Score: 0

And you're too rude. The problem isn't the quantity of either trait, but a lack of balance. With your powers combined, you could be the ultimate /. poster!

So did ExtremeTech - and they included A64 and P4 by freidog · 2005-11-02 04:43 · Score: 4, Interesting

ExtremeTech Article

Importants of clock speeds by springbox · 2005-11-02 04:46 · Score: 1

If people are concerned about the speed of their memory, then having fast DDR SDRAM running on an equally fast FSB is what really makes a difference. This is especially true on P4 Celeron based systems where the L2 cache isn't huge and cache misses are common. While memory latency is important to consider, it isn't critical that your modules have the absolute fastest timings ever. I think that the importance of the other components that connect to your memory like the FSB are underestimated. You can have fast memory, but still have it traveling over a slow or congested bus.

Re:Importants of clock speeds by Chirs · 2005-11-02 10:51 · Score: 1

If you're using an Athlon or better, your memory connects directly to the cpu, so the speed of the FSB is *utterly and completely* irrelevent to memory speed.

The real issue ... by TheCrig · 2005-11-02 04:48 · Score: 4, Insightful

... Is not memory performance as such, but system performance. If a 5 percent increase in system performance increases the cost of your system by 10 percent, you have to want it pretty badly or be on the edge of required performance or just be in a schoolyard comparison. But if it's reversed, and a 10 percent increase in system performance can be had for a 5 percent increase in system price, then if you can afford the 5 percent (say $100 for a $2000 system), go for it.

--
-- Jim Crigler In 1937, I began, like Lazarus, the impossible return. -- Whittaker Chambers

Re:The real issue ... by Nic-o-demus · 2005-11-02 10:24 · Score: 1

And the real way to increase system performance is to focus on the current bottleneck. Just like optimizing code, you look for the weakest link in the chain and work on that first. I wish someone would put a performance review together where they really put things into perspective. Something like:
1st best leveraged enhancement: Disk access- once you get something to such-and-such speed, then the next important step is:
Bus speed. Once you get this to speed x:
Video card.
then RAM latency...
(and of course CPU clock speed is in there somewhere)

At some point if the graphics card and the HDD access are fast enough and maybe some other things, low latency memory will give very big system improvements (percentage-wise. Of course there is always the law of diminishing returns).

well Duh? by thebdj · 2005-11-02 04:49 · Score: 1

I mean did anyone seriously think that these memory latencies were going to have a great impact on anything that the most common users care about? I mean game performance is barely touched at all, which is another Duh! I think their conclusion is probably right, the people buying these things are the idiots who want to post how they have the ultimate system with great RAM and everything, where they probably only could afford 1 GB of their stuff, my performance and load times are better because I could afford the 2 GB of my 'slower' RAM.

Geeze, must be a slow week for hardware...

--
"Some days you just can't get rid of a bomb."

You really think so? by Gothmolly · 2005-11-02 05:05 · Score: 1

"Software developers have spent years optimizing their code to mitigate the impacts of latency."

Really? MS hand-tunes the ASM code generated when they do a build of winword.exe ? Maybe thats why OO.o is so slow?

If I sound sarcastic, I suppose I am. With a few exceptions, almost every coder I've worked with in multiple jobs, has been of the 'throw CPU cycles' at the problem. I can count on one hand those who actually design for a HW architecture, since most of the coders these days are VBScript and Java kiddies.

--
I want to delete my account but Slashdot doesn't allow it.

server db/compile performance? by Anonymous Coward · 2005-11-02 05:09 · Score: 0

Anyone have any benchmarks on how much faster a compile, DB or app server gets with better memory like this?

What does this mean? by Flying+pig · 2005-11-02 05:27 · Score: 2, Interesting

All memory has an access time, and the further you get from the CPU the longer it is going to be. CPU registers have the shortest access time, with (nowadays) subnanosecond access. L1 cache comes next, then L2, then external RAM, then HDD, and finally the slow backing store represented nowadays by CD and DVD. This heirarchical memory architecture changes with time mostly in that the caches grow bigger, so the 640K of RAM from DOS days now fits into the cache of each processor in a pentium-D with room to spare, and a Pentium-M could in theory run DOS with extended and expanded memory without needing any external RAM at all. (I'd almost like to try that.)

So talking about optimisation for low-latency RAM is, I suspect, nonsense. What we are surely seeing here is that the actual limitation on memory bandwidth is somewhere else - in the memory controller,in the cache controller, in the CPU fetch rate, in the rate at which stuff is being fetched from hard disk, in bus contention. Overclocking - speeding up memory controllers and buses - will have an effect. Reducing the number of wait states on the memory bus will not have much effect on performance if the total number of active memory cycles in a given period is largely unchanged.

If you had a need for real speed in an application which was not dependent on the graphics subsystem or access to network and HDD, I am sure you could get much more performance out of low-wait state RAM, but you would do it by HARDWARE design, not by software optimisation.

As a simple example from the dim and distant past when I was building hardware, TI used to have a microcontroller called the TMS9995 which ran at, for the day, a hefty 12MHz. With the slow DRAM of the time, it always needed a wait state and this meant that it could manage, as I recall, two memory accesses per microsecond. With static RAM, it could manage 3. The 9995 actually stored its working registers in external memory and so this meant a real world speedup of nearly 30%. The 8088, on the other hand, kept its working registers on-chip and had a limited instruction pipeline. As a result, the equivalent speedup was nothing like 30%. This was due to hardware differences not software differences.

In fact, the applications which really test out the memory subsystem are not games - they are databases and webservers, which hardly use the graphics system at all. And in these cases, for low end systems, the big beast in the equation is cache. It's quite astonishing how a Pentium-M can churn through a badly designed join while a low end AMD 64 struggles, simply because one has 2Mbytes of cache and the other has only 512K. As a result, for ordinary technical laptop and desktop work, I now specify Pentium-M, the AMD 64 with 1Mbyte cache, or pentium-D with 1Mbyte per core. You know it makes sense.(And now everyone can explain why I'm wrong, in my turn)

--
Pining for the fjords

Memory Timing Analysis by __aawavt7683 · 2005-11-02 05:52 · Score: 1

I remember an article published on Slashdot a year or two (or three or four?) ago about a memory timing analysis. A copy is available here:

http://www.techwarelabs.com/reviews/memory/memory_ timings/index_3.shtml

The net impact of the CAS latency: 0-0.002%. Almost _nothing_.

The great thing about this article is that it goes into just about every aspect available in the bios, giving you a good idea of what _does_ work (a brief scan-through reveals clock speed as the primary contributor, dram command rate and Active to Command (Trcd?)).

Happy reading.

-DrkShadow

No, that's why prices are high. by Anonymous Coward · 2005-11-02 06:04 · Score: 0

In an interview nvidia's chief blatantly said such. He sees no end in sight to how much they can gouge. The tools who spend tons of money for pocket lining havn't balked at the price hikes so far so nvidia et al intend to keep hiking.

Comment removed by account_deleted · 2005-11-02 06:06 · Score: 1

Comment removed based on user account deletion

This is older than just a few months back at anand by Anonymous Coward · 2005-11-02 06:23 · Score: 0

How's about this one? 2004, ExtremeTech
http://www.extremetech.com/article2/0%2C1558%2C163 7762%2C00.asp?kc=ETRSS03039TX1K0000564

My LEDs are blinkier than yours! by cerebis · 2005-11-02 06:35 · Score: 1

I find Corsair Xpert to be the funniest of the Gam3r memory products.

Google Images
PDF from company

Note, due to their width, you can only put in one per bank. :)

Ostentation doesn't work so well when inside an opaque case.

Real world testing by MaTriXxx1 · 2005-11-02 06:42 · Score: 1

Ok, heres my stance on ram (considering my wallet just went up in flames while testing ram... i think i have some things to say about it)

This is on an AMD Sempron 757 socket, 3000+ AMD 64 bit cpu, and a gigabyte k8u motherboard. running windows xp pro 64. nvidia geforce 6600 (agp 8x)

I used Performance Mark(64) to benchmark, and FEAR, and Sims2 for real world testing.

Ok onto the results,

config 1 (2x512) total of 1 gig of memory
value ram 266mhz through tweaking with the motherboard, i scored 380 to 400
there was no cas timings or anything available to be read

fear chokes every few minutes, but overall an awsome game that only lags a little in complex scenes

sims 2 chokes every minute or so... and full fledge crashes the system every hour or so.

config 2 (3x512) total of 1.5 gig of memory
value ram 266mhz through tweaking i scored anywhere from 390 to 405, considering the large jump in memory, i woulda expected a bigger difference to be made

fear completly stopped choking
sims 2, crashed lest often, but still choked every minute or 2

config 3 (1x512, 1x256) total of ~784 megs of ram
value ram 333mhz through tweaking i scored anywhere from 398 to 415, the memory speed seemed to make a large difference, but the timings were HORRIBLE... cheap ass ram 3-3-3-8

fear ran like crap, barely playable.
sims 2, didnt crash but still choked alot

config 4 (2x512) total of 1 gb
patriot dual channel memory kit 400mhz, (2-3-2-5) heres when i got nuts, dropped 109 bux
ram would NOT overclock, at all.... even if i only increased it a few mhz it would crash the whole system, i scored 440 to 450 by overclocking just the cpu speed.

fear was smooth as hell, and an awesome play i was even able to increase the options of the game to allow more detail, without choking

sims 2, choked twice within an hour... but never crashed

config 5 (3x512) total of 1.5 gb
patriot dual channel memory kit 400mhz, (2-3-2-5), and a cheap value stick of geil memory (3-3-3-8 @ 400mhz) again, ram would NOT overclock, at all.... i scored 430 to 435 by overclocking just the cpu speed. as expected... the timing made more difference than the ammount of ram.

didnt even try fear or sims 2... so i cant tell ya

So, whats my conclusion? try it before you buy it!!!! ask a buddy if you can borrow his ram for a few minutes, so you can benchmark your system, and try out some games. see if the difference is worth it to you. if you dont have friends, which hey.... if your reading /. theres a high chance of it, lol... darn anti social nerds... but anyway... my recommendation is as follows...

266mhz to 400mhz makes a HUGE difference, that right there is the biggest boost you will see, thats worth an extra 40/50 bux (to me)

3-3-3-8 is pure and utter crap.... dont touch it with a ten foot pole, a full 1.5 gig of it is far WORSE than 1 gig of decent ram.

2-3-2-5 is pretty good, if you sit n wait for a decent sale you can get a good price... i searched for a month before i dropped my bux on it.

2-2-2-5 is the best you'll see (in my price range anyway), but its expensive, and i honestly dont think its worth the extra 70 bux difference from the 2-3-2-5 price tag.... but i also didnt test it.

all of the equipment i use was bought at frys, they sell stuff on www.outpost.com, but i warn you, if you goto the physical store.... expect them to try to rip you off.... they tried selling me returned ram, brokem motherboards, bad hardrives, a copy of xp pro 64 with a frickin scratch on the cd!!! (it was obviusly used), i havent heard much bad things about the online policies... but i havent tried them

--
Do NOT goto this URL http://www.forthesims.com

Scientific computing benefits from this by Orp · 2005-11-02 06:49 · Score: 2, Interesting

I do large 3D thunderstorm simulations. With some of the larger simulations I am integrating lots of things, contained in 3D floating point arrays, over 1 billion or more gridpoints (using distributed computing, such as a beowulf cluster made up of dual Xeons or an SGI Altix system). Each scientific calculation requires accessing floating point values stored in these arrays, doing some math, and updating another array.

Memory latency, and memory bandwidth, both impact how long it takes my simulations to complete. Let's say it is the difference between a simulation taking a week vs. five days... this is significant to me and how much I can get done. With these heavy duty scientific models and such, you really can see a noticable benefit with the fancier hardware, and clock speed is certainly not the the only factor to consider by a long shot.

--
A squid eating dough in a polyethylene bag is fast and bulbous, got me?

ExtremeTech did this like a year ago by mikemuch · 2005-11-02 06:54 · Score: 1

Does RAM Latency Matter?

Re:Real world testingc (typo fixing) by MaTriXxx1 · 2005-11-02 06:57 · Score: 1

This is on an AMD Sempron 757 should be This is on an AMD Sempron 754 .... i went with the 109 bux for the (2x512, 2-3-3-5, 400mhz, patriot) from frys. should be .... i went with the 109 bux for the (2x512, 2-3-2-5, 400mhz, patriot) from frys. also, id like to say... if your planning on overclocking your memory, do NOT buy patriot ram.... go with the centon stuff

--
Do NOT goto this URL http://www.forthesims.com

Low latency memory is for overclockers by Anonymous Coward · 2005-11-02 07:01 · Score: 0

I admit, I only read the conclusion (it's the only thing I read on articles like that), but I'm rather dissappointed that the author didn't mention overclocking. Low latency memory is almost exclusively marketed to enthusiasts--and that's overclockers.

The reason low latency memory is interesting is it's overclockability. So comparing performance at stock CPU clock is just a silly waste of time. The whole point is now you can mess with your multipliers, bus, and clock and get the whole machine running faster.

I saw a couple other people mention this but I wanted to emphasize that this article really missed the point, I think.

More test - memory frequency included by Anonymous Coward · 2005-11-02 07:02 · Score: 0

If you can read some Czech (or numbers and colours, at least :) ), try this article. Interesting part begins here - throughput and some benchmarks with constant timing and increasing frequency (two pages). Then there are test with two frequencies and three timings.
Tests are done on A64 X2 3800+.

(can't have a subject that starts with $) $ by freeweed · 2005-11-02 07:29 · Score: 2, Insightful

Price. Well, price and size, but mostly price.

Cache isn't some magical thing. It's simply RAM. SRAM, usually, which is why it's so fast (don't have to waste power/time refreshing your contents). At the end of the day, it's just some very fast RAM. It sits between your CPU and the rest of your RAM, and uses its increased speed to "trick" the CPU into performing as if your main RAM is much faster than it is.

In my computer arch course a while back, someone asked why, if cache is so fast, we don't just build computers 100% SRAM memory. Our professor did some back-of-the-napkin calculations for fun. Major $. Have to include the extra space and cooling requirements, of course :)

The other thing, of course, is the good old law of diminishing returns. Cache actually solves the problem VERY nicely. For most people/computers/applications, cache misses aren't that great of a problem, because most computer code lends itself to cache hits (a phenomenon called "locality"). Locality is WHY we have cache in the first place. In general most computing works very well with a tiny amount of very fast cache and a small amount of fast cache. Adding more eventually gets you to the point where you're not seeing much if any improvement. On most modern systems, we're at that point - at least as far as the market will bear.

Oh, and outside of the HPC world, there's no NEED for programmers to worry about memory caching issues. This isn't where most bottlenecks show up, and again, most general-purpose code lends itself very nicely to small amounts of cache. Compilers often help here, too. Most of your average programmers would get better use of their time analyzing the data structures and algorithms they use.

--
Endless arguments over trivial contradictions in books written by ignorant savages to explain thunder in the dark.

Re:(can't have a subject that starts with $) $ by SillyNickName4me · 2005-11-02 10:12 · Score: 1

Heh, I have an i860 based machine here that uses 16mb sram for its main memory. When it was new in the late 80s, just the memory made it cost a small fortune.
Re:(can't have a subject that starts with $) $ by Anonymous Coward · 2005-11-02 11:33 · Score: 0

"we don't just build computers 100% SRAM memory"

Well, you can get 1 or 2 mb of cache nowadays - that's enough to run an entire MS-DOS system REALLY fast :)

Bling in the box by Otto · 2005-11-02 07:40 · Score: 1

the memory equipped with LCDs

I actually picked up a pair of these when I bought my latest machine's parts. I didn't buy it because the memory was any better, I bought it for the geek factor of having a couple of LED displays on which I could put whatever I liked. And I still like it. It's nice to be able to glance at the box to see the temperature the unit is running at. It's fun to watch it go from 95 to 130 when I'm playing some major 3d game. And it was only $30 more than the cheap stuff.

Yes, if you pay list price, you're paying too much. So don't pay list. :)

--
- Give a man a fire and he's warm for a day, but set him on fire and he's warm for the rest of his life.

Re:Cost/Benefit Joke by SSalvatore · 2005-11-02 07:55 · Score: 1

This is a Cost / Benefit joke. Those ridiculous percentage points that you gain by adding this memory are nothing compared to what the average computer spends on swapping. Swapping is in my view the worst performance problem that I see in the average office or home computer these days.

Instead of buying one of this modules at twice the cost, I'd say go and buy twice as much memory. Plese note that I have the average computer in mind, not those steroid pumped mutants that some o' my geek friedns have (you have to be geek and have the cash!).

OT, but by ionpro · 2005-11-02 07:57 · Score: 1

Intel uses an inclusive cache archetecture, so you actually don't get the 640K you were looking for, and even so it'd have to be backed by DRAM (AFAIK, that cache isn't programmer or even OS accessible). AMD uses an exclusive-cache, so the L1 and L2 (and any L3) would all be additive in which data they could store.

JOC, why don't you specify Athlon X2 4400+ or 4800+s? They all have 1MB L2 per core, as well.

You made my point. by G4from128k · 2005-11-02 08:20 · Score: 1

The kind of changes you're talking about require vastly faster memory. Not the kind of latency differences being discussed here at all. Both of these are "high latency" compared to what would be needed for your theoretical redesign of the entire software stack.

Exactly!

Oh, and latency is getting worse, not better, and has been for a long, long time.

Very true my first full-sized computer had a 8 MHz processor and 150 ns RAM in 1985. Now there's more than an 8:1 ratio between CPU and RAM clocks (and the RAM requires several cycles of wait states even at its slow speed). The question is: is this totally irreversible? Could future memory technologies based on paramagnetic hyperoptical nanodots (there's no such thing, but you never know) provide CPU-equalling clock speeds or allow significant amounts of on-die RAM for future CPUs?

My point is that these tests suggest that low-latency RAM wouldn't help in most of the applications tested in TFA. But I wonder if most of those applications have been written to cope with high-latency memory.

--
Two wrongs don't make a right, but three lefts do.

Re:You made my point. by Vellmont · 2005-11-02 08:43 · Score: 1

You seem to misunderstand the question this article is addressing. The question is "what performance benefits should I expect from buying low-latency RAM?". The question is not "Should all computers be designed for lower latency?" Me buying lower latency RAM doesn't make anyone design games or the majority of software for lower latency.

--
AccountKiller

There are a lot of good reasons... by MarcQuadra · 2005-11-02 08:22 · Score: 1

The Apple G5 updates FINALLY gave us 1mb l2 cache per core

What are you talking about? My G3 running at 450MHz has a 1MB L2 cache, and it has since 1999. Pentium Pros and various workstation/server class chips had multimegabyte caches a decade ago.

The reason you've seen less cache is that it didn't make sense to have a slow CPU with a 4MB ache that had to dissipate 100+ watts to operate. on-die cache is expensive in terms of heat, die space, and clock speed.

There's also the marketing factor, Intel would have you believe (at least until last year) that GHz were the benchmark, to get them they had to strip out cache. Everyone has to follow Intel to some extent, or they'll get mowed down (read: Apple filing to sell 500MHz G4s when 1.7GHz P4s were out). If you made a chip today that ran at 400MHz but kicked a Pentium 4 830's ass, you'd have a lot of trouble selling it.

And as for cache optimization, CHUD is an excellent tool to profile such things. You can get INSANE performance benefits by keeping an oft-repeated loop or randomly charged dataset inside the CPU's cache.

--
"Sometimes, I think Trent just needs a cup of hot chocolate and a blankie." -Tori Amos on Nine Inch Nails

As we say in Psychology by frank249 · 2005-11-02 10:25 · Score: 2, Funny

Better latent than never.

--

Today's vices may be tomorrow's virtues.

Ran bandwidth tests instead of latency tests! by pslam · 2005-11-02 23:08 · Score: 1

Bah. Games benchmarks? Doom 3? Cinebench? Those are all primarily tests of memory bandwidth not latency. Not bloody wonder they found very little difference.

That's the funny thing about all this 2-2-2-5 stuff being sold to gamers - it makes very little difference for them. It is however fantastic for code compilation, simulations, and anything else that has unavoidable random access patterns.

Thankfully they do market it for gamers, because if they did market it for scientists, software developers and generally in servers, it would be even more uber expensive than it is now.

Slashdot Mirror

The Impact of Memory Latency Explored

162 comments