Slashdot Mirror


Reliability of Computer Memory?

olddoc writes "In the days of 512MB systems, I remember reading about cosmic rays causing memory errors and how errors become more frequent with more RAM. Now, home PCs are stuffed with 6GB or 8GB and no one uses ECC memory in them. Recently I had consistent BSODs with Vista64 on a PC with 4GB; I tried memtest86 and it always failed within hours. Yet when I ran 64-bit Ubuntu at 100% load and using all memory, it ran fine for days. I have two questions: 1) Do people trust a memtest86 error to mean a bad memory module or motherboard or CPU? 2) When I check my email on my desktop 16GB PC next year, should I be running ECC memory?"

123 of 724 comments (clear)

  1. Surprise? by Anonymous Coward · · Score: 4, Funny

    Recently I had consistent BSODs with Vista64 on a PC with 4GB...

    This was a surprise?

    1. Re:Surprise? by Erik+Hensema · · Score: 5, Informative

      Yes. Vista is rock solid on solid hardware. Seriously. Vista is as reliable as Linux. Some people wreck their vista installation, some people wreck their Linux installation.

      --

      This is your sig. There are thousands more, but this one is yours.

    2. Re:Surprise? by Starayo · · Score: 2, Insightful

      It won't crash often if at all, it's true, but vista is way too slow for my, and many other's tastes.

      --
      Ezekiel 23:20
    3. Re:Surprise? by bigstrat2003 · · Score: 5, Informative

      Agreed. People who will sit and tell me with a straight face that Vista, in their experience, is unstable are either very unlucky, or liars. Windows stopped being generally unstable years ago. Get with the times.

      --
      "16MB (fuck off, MiB fascists)" - The Mighty Buzzard
    4. Re:Surprise? by c0p0n · · Score: 3, Informative

      I fail to see how the parent is a troll, regardless of whether he is right or not.

      Nevertheless my experience with Vista is the same, I run home premium on a newish laptop I use for music production and haven't had a glitch on it for months. My first intention was to wipe out the drive and install XP, but I abandoned the idea some time ago.

      --

      Your head a splode
    5. Re:Surprise? by alexandre_ganso · · Score: 5, Funny

      ... vista is way too slow for my, and many other's tastes.

      Now you got what he meant with "rock solid"....

    6. Re:Surprise? by Erik+Hensema · · Score: 3, Informative

      I fail to see how the parent is a troll, regardless of whether he is right or not.

      That's because I wasn't trolling. Yes, I do know people here on slashdot don't like to hear positive opinions on Vista, but in fact Vista isn't all that bad.

      I use Linux exclusively on my desktop pc at home and at work. I've been using Linux for over a decade. When I bought a laptop a year and a half ago, it came with Vista. Vista is IMHO a great improvement over XP. It's not even slow on decent hardware.ÂI have yet to receive my first BSOD since SP1 was released. SP0 gave me a few BSODs, maybe 5 in total.

      That being said, I use Linux for work and Vista for play. So the comparison may not be entirely fair.

      --

      This is your sig. There are thousands more, but this one is yours.

    7. Re:Surprise? by evan_arrrr! · · Score: 3, Funny

      So what you're saying is that you've figured out how to keep your PC running WITHOUT power? Why are we still driving gas-powered vehicles? Somebody get this man to Washington!

    8. Re:Surprise? by Dan+East · · Score: 4, Funny

      In reference to the parent, gp, ggp, etc. Either I'm reading the alternate-reality edition of Slashdot, or y'all are warming up for Wednesday.

      --
      Better known as 318230.
    9. Re:Surprise? by pdusen · · Score: 2, Insightful

      It bears repeating because of all the stupid people who tried to run Vista on hardware that even XP would shudder at, then complained that it didn't work or was too slow. Ugh.

    10. Re:Surprise? by erroneus · · Score: 5, Insightful

      I find that when a Windows machine, from Windows 2000 on up, when taken care not to install too many programs and/or immature or junk-ware, then Windows remains quite stable and usable. The trouble with Windows is the culture. It seems everything wants to install and run a background process or a quick-launcher or a taskbar icon. It seems many don't care about loading old DLLs over newer ones. There is a lot of software misbehavior in Windows-world. (To be fair, there is software misbehavior in MacOS and Linux as well, but I see it far less often.) But Windows by itself is typically just fine.

      Since the problem is Windows culture and not Windows itself, one has to educate one's self in order to avoid the pitfalls that people tend to associate with Windows itself.

    11. Re:Surprise? by isorox · · Score: 4, Insightful

      Agreed. People who will sit and tell me with a straight face that Vista, in their experience, is unstable are either very unlucky, or liars. Windows stopped being generally unstable years ago. Get with the times.

      I'm not convinced, I have a fairly old desktop at work I keep for Outlook use only. After a few days outlook's toolbar becomes unresponsive, and whenever I shut it down it stalls and requires a poweroff. Task manager doesn't say I'm using that much memory (still got cached files in physical ram).

      I don't use windows much, I'm not used to the tricks that keep it running, where I probably use those tricks subconciously to keep my linux workstation and laptop running.

      I wonder if Windows continued increase in stability is, at least partly, people subconciously learning how to adapt to it.

    12. Re:Surprise? by saintm · · Score: 2, Insightful

      You must be unlucky or the cause.

    13. Re:Surprise? by jamesh · · Score: 2, Insightful

      A guy at work got his laptop with Vista on it. Explorer would hang often (Explorer, not IE), and if he tried to arrange his second monitor to the left of his laptop screen, the system would BSoD. (pretty funny, he had his monitor on the left, due to physical desk constraints, but he had to move his mouse off of the right side of his laptop screen where it would appear on the left of his second monitor...). We updated all the latest drivers from HP but to no avail.

      Since putting Vista SP1 on though it has been fine - all those problems went away.

      I have never seen another Vista machine do that though, so obviously something got broken during the install. If it was Linux I would have been able to fix it myself, but with Vista all we could do was wait for the magic hotfix or sp that might fix the problem.

      Anyway, just because you haven't come across an unstable Vista install doesn't mean they don't exist. (or you're a troll and I just got sucked in horribly :)

    14. Re:Surprise? by andy9o · · Score: 5, Funny

      PEBKAC

    15. Re:Surprise? by Daimanta · · Score: 4, Insightful

      To all the posters who think the parent is a bad mechanic I will tell you my anecdote: I have never had a harddrive fail. Never. Not on a fresh computer and not on a decade old one.

      Either I have magic hands, harddrives don't fail that often or /.ers can't handle harddrives.

      Or people can beat the odds. Chances, sometimes you win in a casino.

      --
      Knowledge is power. Knowledge shared is power lost.
    16. Re:Surprise? by robthebloke · · Score: 2, Interesting

      I have never in my lifetime managed to banjax an install of firefox/Safari/IE to the point it wouldn't work or un-install, which begs the question: what the hell are you doing to it? (If i didn't know any better, I'd suspect the cause maybe downloading too much pron?)

    17. Re:Surprise? by MobyDisk · · Score: 5, Insightful

      People who will sit and tell me with a straight face that Vista, in their experience, is stable are either very lucky, or Microsoft shills.

      See? I can say the opposite, and provide just as much evidence? Do I get modded to 5 as well? Where's your statistics on the stability of Vista? Did it work well for you, therefore, it works well for everyone else?

      I worked for a company that bought a laptop of every brand, so that when the higher-ups went into meetings with Dell, HP, Apple, etc. they had laptops that weren't made by a competitor. They have had problems like laptops not starting-up the first time due to incompatible software. That was a recent as 6 months ago. My mother-in-law bought a machine that has plenty of Vista-related problems (audio cutting out, USB devices not working, random crashes in explorer) on new mid-range hardware that came with Vista. But I have a neighbor who found it fixed lots of problems with gaming under XP.

      There's plenty of issues. Vista's problems weren't just made-up because you didn't experience them.

      Everybody's experience is different. Quit making blanket statements based on nothing.

    18. Re:Surprise? by robthebloke · · Score: 2, Insightful

      It's slower than XP in any case and requires more memory.

      Not true. It uses more memory than XP, but it doesn't require it. In exactly the same way that linux uses more memory than XP, but doesn't require it (it's used for system cache if you bother to check). If you actually install the 64bit version, you'll see where MS's development budget has been spent (The 32bit version of vista feels a bit like Win ME in comparison). In every test I've done, 64bit vista has crapped all over XP from quite a big height.

      The problem is I don't consider decent hardware to be something an IT'er would buy

      Dual core machine + 2Gb ram + integrated ATI/Nvidia/Intel X4500 GPU is more than adequate. These are pretty basic machine specs by anyone's standard, and tbh you'd be hard pressed to find a brand new machine for sale with lower specs than that.

      The worst machine I've installed vista on was an old 1.6Ghz Athalon XP. It was more than happy playing blu-ray disks, and didn't perform any worse than XP. (though I did add an ATI 3650 AGP card to help out with the blu-ray decoding). That's what, a 5year old machine and a £50 upgraded graphics card.... Not exactly high end spec I must say.

    19. Re:Surprise? by Anonymous Coward · · Score: 5, Informative

      I have to send the machine to corporate headquarters so they can do the reinstall, leaving us without one for a week.

      Well, if it takes your corporate IT staff that long to rebuild a computer, they're probably doing it by hand while putting out other fires, which is foolish. Better IT departments have standard images that have been made for and tested upon the computer models that they've standardized upon. Barring hardware failure, the result is a stable Windows environment with few software problems that aren't user-inflicted. In addition, rebuilding a system takes less than an hour: Gigabit Ethernet drops to the benches make backing up a system and restoring a clean image to it go very quickly. Rebuilds for purely remote users are a priority as well. They have access to their email and calendar via OWA, but not to any corporate systems that require VPN access, so getting their laptop repaired and back to them quickly is important: We try to get them repaired and sent out the day we receive them, and have been known to work Saturdays as well to get a system back to someone by the next Monday. We also maintain a hot spare pool: One laptop of every model that we support is on hand to overnight to someone whose laptop is broken. So, in all cases except where the hard drive is broken or the software on it borked, we can have a person up and running the next day. They then send us their computer, and we handle the warranty issues and return it to them.

      We also don't permit anyone (ourselves included), to run Windows as Administrator or equivalent except for purposes of installing software or patching. While the computers are joined to our domain, remote/traveling users also have a local user account that is Administrator-equivalent whose name is "[their domain login name].local". They are given the password to it (which is different than their domain password) and told not to use it except to install software or in emergencies (but if they get to that point, they're expected to call: We have a person whose main job is to support remote/traveling users, and she's very good - not only is she an intelligent person, she's a skilled technician and knows our systems inside and out).

      It sounds to me as though there are number of things going on: First, you're getting poor Windows installations. Secondly, there's probably a degree of PEBKAC going on as well. You say that you use Macs at home, so there's almost certainly more than a little resistance to using Windows stemming from attitude: "Macs are better and so I don't have to/won't learn how to use Windows". I've seen this more than once in our company: People that have Macs at home tend to be smug about them and pounce upon every problem (whether real or perceived) with their Windows computers at work. That's OK: After awhile you learn which people are your "problem children", and accommodate them as best you can.

      In any event, I am sorry for your difficulties, and hope that they are remedied soon.

    20. Re:Surprise? by kentrel · · Score: 3, Insightful

      Looks like he posted his opinion based on his experience, and you posted your opinion based on your experience. So you should quit making blanket statements based on nothing too.

      Neither of you posted statistics. Where are yours?

    21. Re:Surprise? by Zero__Kelvin · · Score: 2, Interesting

      "Vista is as reliable as Linux."

      I can definately attest to this fact! The family computer has dual boot with Vista (It shipped with the 64 bit machine, and is 32 bit of course) and Mandriva Linux 2009 x86_64. Vista has been used to view Oprah's website with it's proprietary garbage, but other than that is unused and unmolested. It is a stock install. No third party stuff has been added other than iTunes. I recently had to install iTunes to restore my ipod after trashing the filesystem, and I can tell you Vista was very reliable. I could rely on it to apply updates in the background without my knowledge and interrupt the install process to reboot. Whenever I wanted to take a coffee break I could set it off to start installing iTunes, go off and have my coffee, and rely on it either having failed with some obscure error or to still be busy with the task! I reliably had iTunes installed in just 1 hour and a few minutes!

      Compare and contrast that to Linux. I can't rely on it to still be busy doing a trivial task after a coffee break. I cannot rely on it to apply updates without my knowledge. I can't rely on the occaissonal opportunity to see my pretty splash boot screen when I am forced to reboot, since I never am. Worse still, I usually don't even get to see my pretty splash screen for OS level updates unless the kernel or glibc is updated! I'm damn lucky I have been using it on my Laptop for years even though it still isn't ready for the desktop yet, or I'd never get to see what this thing looks like when it boots!

      So yes, they are each very reliable in their own special way. Frankly, I learned how to drink coffee at the computer without spilling it quite some time ago, so I far prefer the kind of reliability Linux offers, but clearly your experience differs. I'm going to go out on a limb and say you don't use both Vista and Linux. You use Vista, tried Linux, had no idea what you were doing and "wrecked" your Linux installation. Conclusion: There is no difference between Vista and Linux! I can fsck 'em both up!

      If you don't want to take my word for it, take the word of every other competent systems level software engineer out there. You won't find anyone saying Vista is great unless they are making money off of it, yet Linux has dedicated developers working on it who make NO money for doing so. Ask yourself this question: If M$ Open Sourced the code tomorrow, do you think they would have any more skilled people working on it than they they do today? Absolutely not. Nobody competent and in their right mind would invest their time and effort in developing that garbage without getting paid for it. It really is that simple.

      --
      Guns don't kill people; Physics kills people! - John Lithgow as Dick Solomon on Third Rock From The Sun
    22. Re:Surprise? by Zero__Kelvin · · Score: 2, Interesting

      "Windows stopped being generally unstable years ago."

      Agreed. They have moved away from generalization to specialization now, and Vista is much more specific about how, when, and where it is unstable. Essentially, they pushed the crashes out of the kernel, and all the applications now act funny or crash instead of crashing the kernel.

      "People who will sit and tell me with a straight face that Vista, in their experience, is unstable are either very unlucky ..."

      Saying they are unlucky, when they are unfortunate enough to be stuck using Vista, is redundant.

      --
      Guns don't kill people; Physics kills people! - John Lithgow as Dick Solomon on Third Rock From The Sun
    23. Re:Surprise? by inasity_rules · · Score: 5, Insightful

      Dude! Take a chill pill. This is not FUD. The gp is just relating his experience, and here's a shock, YMMV! So just sit back and have another beer.

      BTW, I've also had major hassles with windows - mostly related to viruses. As it happens this forced me to switch 100% to linux and I'm happy here, but not everyone who switches is. Personally I like the bandwidth I save from not constantly downloading AV updates, and the speed increase from not running AV. But hey, where you are computing power and bandwidth are probably cheap. Again, YMMV.

      --
      I have determined that my sig is indeterminate.
    24. Re:Surprise? by perlchild · · Score: 3, Insightful

      And some of us actually expect an OS with a certification logo program to send lawyer letters to Marvell telling them to recall that driver. Sheesh, get with the program, badly written, certified drivers make Microsoft look bad, deservedly.

    25. Re:Surprise? by poetmatt · · Score: 2, Interesting

      What?

      Vista is not 100% stable, never has been, obviously never will be. Do you think it's magically immune to its own BSOD's? I run Vista 64bit myself, and it's "better than XP", but not stable. Apps still get random errors, etc.

      Windows is as stable as it will ever be; at least with Ubuntu you can have a month's uptime and be fine. Now if only Wine was 100% there for gaming (it's getting there).

    26. Re:Surprise? by mdwh2 · · Score: 2, Insightful

      Same experience I've had with Linux; meanwhile I can't remember the last time I saw Windows crash. Perhaps we shouldn't generalise from anecdotes.

    27. Re:Surprise? by bigstrat2003 · · Score: 2, Informative

      Nothing is 100% stable. That's an awfully high standard to reach. And I get uptimes of a month on my Vista machine too, so I fail to see how you're demonstrating a point of how Windows is so far behind.

      --
      "16MB (fuck off, MiB fascists)" - The Mighty Buzzard
    28. Re:Surprise? by mdwh2 · · Score: 2, Informative

      Again this is a culture issue - there's nothing stopping Windows applications from running from a single folder (and indeed, plenty of them do). Conversely, I don't see why one couldn't make a Linux or OS X application that installed some system files (they do have shared libraries, right?)

      And indeed, it's worth noting that Quicktime on Windows is as bad as an offender as any other application when it comes to installing background rubbish and insisting on running all the time, so Apple don't get off lightly here.

    29. Re:Surprise? by unfunk · · Score: 2, Funny

      "61 days, rebooting only for updates" still flies in the face of everybody that claims that Vista will crash on a weekly/daily/hourly basis. His point is still valid without you needing to wave your epenis.

    30. Re:Surprise? by adamjaskie · · Score: 4, Insightful

      Not only that, but PC manufacturers install all sorts of crapware. Just because you have a new PC doesn't mean it has a "clean install" of Windows on it.

      --
      /usr/games/fortune
    31. Re:Surprise? by Angus+McNitt · · Score: 2, Informative

      I think you hit on it there. Windows, it's self, has gotten better over the years. It's buggy, but I've never seen an OS that isn't. From my years of working on customer walk in and corporate contract machines, Windows "buggyness" usually comes from 3 vectors (in order of severity): flaky drivers, flaky software, or PEBKAC. Mac has less "crashes" only due to a controlled hardware pool. Start attaching lost of 3rd party hardware and see how your mileage goes. Linux has the advantage of having mostly open drivers, so you get geeks tinkering and putting back. But you still need those geeks to have the hardware and time to fix it. Windows does not have those advantages, because of the market they want to participate in. If this bugs you that much, use another OS. God knows there are plenty. No OS is perfect. I personally use 3 on a daily basis (XP, OS X, Ubuntu Linux). And yes, they all crash occasionally.

      --
      "To Do Is To Be" - Socrates, "To Be Is To Do" - Sartre, "Do Be Do Be Do" - Sinatra
    32. Re:Surprise? by Thornburg · · Score: 2, Insightful

      It's slower than XP in any case and requires more memory.

      Not true. It uses more memory than XP, but it doesn't require it. In exactly the same way that linux uses more memory than XP, but doesn't require it (it's used for system cache if you bother to check).

      Umm, yes, it is true, many benchmarks were done of XP SP3 vs Vista SP1, and XP SP3 is definately faster than Vista SP1, and it definitely _requires_ less memory. I can run an XP machine with 512MB of RAM, and it will be OK. Not great, but OK. Put Vista on the exact same machine (or even on a more modern, faster machine, but still with only 512MB of RAM), and it will be a total dog. Vista really needs a bare minimum of 1GB of RAM to be usable, whereas XP will run acceptably on 512MB... you could probably get away with 320MB if you don't run any memory-itense applications.

      If you actually install the 64bit version, you'll see where MS's development budget has been spent (The 32bit version of vista feels a bit like Win ME in comparison). In every test I've done, 64bit vista has crapped all over XP from quite a big height.

      The problem is I don't consider decent hardware to be something an IT'er would buy

      Dual core machine + 2Gb ram + integrated ATI/Nvidia/Intel X4500 GPU is more than adequate. These are pretty basic machine specs by anyone's standard, and tbh you'd be hard pressed to find a brand new machine for sale with lower specs than that.

      Yeah, that should be enough to run Vista, and many new machines are spec'd like that, but businesses need to use a uniform platform across all machines... so are they supposed to throw away all their old machines and buy new ones just so they can use Vista? No, they'll wait until they have replaced all their machines with ones that can run Vista through the same update schedule as they usually use, and then they'll use Windows 7, since it is supposed to be faster/leaner than Vista anyway.

      The worst machine I've installed vista on was an old 1.6Ghz Athalon XP. It was more than happy playing blu-ray disks, and didn't perform any worse than XP. (though I did add an ATI 3650 AGP card to help out with the blu-ray decoding). That's what, a 5year old machine and a £50 upgraded graphics card.... Not exactly high end spec I must say.

      Actually, for consumer hardware, an ATI 3650 is rather high-spec. The most common integrated graphics chips are about 5-10% as powerful as a 3650.

      All that said, I have nothing against Vista for home users, but in the business world, it just doesn't add up (unless you replace your hardware on a 2-year cycle).

    33. Re:Surprise? by unoengborg · · Score: 5, Insightful

      You are right and you are wrong. Yes, it's true that Vista, XP or even Windows 2k are rock solid, but only as long as you don't add third party hardware driveres of dubious quality. Unfortunately many hardware venders don't spend as much effort as they should to develop good drivers. Just using the drivers that comes with windows leaves you with a rather small set of supported hardware, so people install whatever drivers that comes with the hardware they buy, and as a result they get BSOD if they are unlucky, and then they blame Microsoft.

      --
      God is REAL! Unless explicitly declared INTEGER
    34. Re:Surprise? by Lumpy · · Score: 4, Interesting

      Vista can hose it's user profiles easily and they get the white scrteen loading bug that causes lots of problems and even networking to fail for that user.

      It's a profile problem that can be fixed easily by creating a new profile and deleting the old one, but that is way out of the ability of most users.

      This happens a LOT with home users. Out of the last 30 vista support calls I got 6 were this problem of corrupt user profiles.

      Honestly user profiles under Windows have sucked cince the 2000 days.

      --
      Do not look at laser with remaining good eye.
    35. Re:Surprise? by jonbryce · · Score: 2, Informative

      I get uptimes of 4-5 weeks on Vista. I have to reboot on the Wednesday after the second Tuesday every month for updates.

      I have an uptime of about 6 months on Ubuntu since the last time I rebooted to put an extra hard drive in. I don't have to reboot for updates.

    36. Re:Surprise? by travbrad · · Score: 2, Insightful

      I'm curious why you need your OS to be on non-stop for more than 60 days. Even for servers I don't think having a brief downtime every couple months would be a serious issue for the vast majority of users. Besides, who runs servers on Vista anyway? At least use Windows Server.

      P.S. I don't run Vista and I never will, but I just don't understand what you would need years of up-time for on a home PC. I power down my PC most nights (unless I'm downloading/uploading something), just because there's no sense wasting electricity.

    37. Re:Surprise? by stuartkahler · · Score: 4, Insightful

      Or they're running crappy hardware. Most people blame Windows when their hardware is constantly running on the edge of failure. They have a computer that works fine out of the box, but crashes when the PSU can't keep up with the fifth USB device plugged in. Maybe some heat sinks are clogged with dust.

      The OS running on the cheapest hardware with the most clueless user base has the highest failure rate? You don't say!

    38. Re:Surprise? by mcvos · · Score: 4, Funny

      I have never had a harddrive fail. Never. Not on a fresh computer and not on a decade old one.

      Can I hire you as admin for our raid-0 disk server?

    39. Re:Surprise? by Zero__Kelvin · · Score: 4, Insightful

      They use Linux, so the kernel is stable and updates aren't needed unless new features are implemented that the older kernel doesn't support. Since it is already a production machine that scenario never arises. If only you weren't an Anonymous Coward I'd go on (but kudos to at least being smart enough to hide).

      And for those who will go to the security well here, we call it a trade-off. For many systems uptime is more important. It generally isn't a very big risk to run an older Linux kernel though it is more risky than not updating. In a world of blind men, the one-eyed man is king. We can sacrifice a modicum of security, exchanging our plate mail for chain mail, and still feel confident because we are surrounded with weaponless peasants ;-0

      --
      Guns don't kill people; Physics kills people! - John Lithgow as Dick Solomon on Third Rock From The Sun
    40. Re:Surprise? by Zero__Kelvin · · Score: 4, Insightful

      "Well, if it takes your corporate IT staff that long to rebuild a computer, they're probably doing it by hand while putting out other fires, which is foolish."

      Or could it be that they have a queue full of machines waiting for reinstalls, etc? No. It couldn't be that, since we all know that the thousands of people saying they have had major problems are liars, and we have as evidence a few people who claim that they haven't had major problems, or don't know that they have problems ;-)

      --
      Guns don't kill people; Physics kills people! - John Lithgow as Dick Solomon on Third Rock From The Sun
    41. Re:Surprise? by j-turkey · · Score: 2, Insightful

      Holy Shit! 61 Days! And you had to reboot for updates, so who could complain about that??!!! Oh I don't know, how about the Linux users who use an OS that has uptimes measured in years. What's that you say? How incompetent are they if they don't update for years? You see that is the thing. They did apply updates daily ! Linux uses some kind of voodoo magic to allow updates without downtime! (it's scary to think about, I know). Now go back to your stickball game kid. The adults have some real computing to do. You are the epitome of the idea that people who use Windows simply don't know any better.

      So; you ran daily updates on your system and had uptime measured in years? How did you manage to patch/update your kernel? Did you apply those patches/updates without rebooting? How?

      From your post, you sound like the epitome of an arrogant Linux user who throws half-truths around while looking down your nose at everyone who isn't just like you. When you use that tone, do you expect people to actually listen to you or are you just trolling for an argument? Sheesh.

      --

      -Turkey

    42. Re:Surprise? by Simetrical · · Score: 5, Insightful

      I worked for a company that bought a laptop of every brand, so that when the higher-ups went into meetings with Dell, HP, Apple, etc. they had laptops that weren't made by a competitor. They have had problems like laptops not starting-up the first time due to incompatible software. That was a recent as 6 months ago. My mother-in-law bought a machine that has plenty of Vista-related problems (audio cutting out, USB devices not working, random crashes in explorer) on new mid-range hardware that came with Vista. But I have a neighbor who found it fixed lots of problems with gaming under XP.

      On the other hand, my Linux server freezes up and needs to be reset (sometimes even reboot -f doesn't work) every few days due to a kernel bug, probably some unfortunate interaction with the hardware or BIOS. (I'm using no third-party drivers, only stock Ubuntu 8.04.) And hey, in the ext4 discussions that popped up recently, it emerged that some people had their Linux box freeze every time they quit their game of World of Goo. Just yesterday I had to kill X via SSH on my desktop because the GUI became totally unresponsive, and even the magic SysRq keys didn't seem to work. Computers screw up sometimes.

      What's definitely true is that Windows 9x was drastically less stable any Unix. Nobody could use it and claim otherwise with a straight face. Blue screens were a regular experience for everyone, and even Bill Gates once blue-screened Windows during a freaking tech demo.

      This is just not true of NT. I don't know if it's quite as stable as Linux, but reasonably stable, sure. Nowhere near the hell of 9x. I used XP for several years and now Linux for about two years, and in my experience, they're comparable in stability. The only unexpected reboots I had on a regular basis in XP was Windows Update forcing a reboot without permission. Of course there were some random screwups, as with Linux. And of course some configurations showed particularly nasty behavior, as with Linux (see above). But they weren't common.

      Of course, you're right that none of us have statistics on any of this, but we all have a pretty decent amount of personal experience. Add together enough personal experience and you get something approaching reality, with any luck.

      --
      MediaWiki developer, Total War Center sysadmin
    43. Re:Surprise? by David+Chappell · · Score: 2, Insightful

      Looks like he posted his opinion based on his experience, and you posted your opinion based on your experience. So you should quit making blanket statements based on nothing too.

      Neither of you posted statistics. Where are yours?

      I think you misunderstood his posting. He is saying, "Look, if I use your loose methods of argument, I can 'prove' the opposite! Watch!"

    44. Re:Surprise? by ebh · · Score: 3, Insightful

      Even for servers I don't think having a brief downtime every couple months would be a serious issue for the vast majority of users.

      My servers run non-stop because my users are non-stop.

      No, they're not chained to their desks 24/7, but the user base is large enough that at any given moment, it's crunch time for *someone*.

      In order to find those rare times when bringing a server down won't cause someone to miss a critical deadline, downtime has to be planned months in advance. To do that I have to be able to rely on my systems being stable for months or years on end.

      As the other poster said, rebooting just to interrupt the degradation process is not an option for me. If a system degrades on its own, I have to find out why and fix it.

      Keep in mind, though, that this is an OS-independent issue. Some of the applications I support only run on certain OSes, so I have to build stable Linux systems, stable Windows systems, etc. Nobody cares but me that that's harder to do with some OSes than with others.

    45. Re:Surprise? by Zero__Kelvin · · Score: 2, Insightful

      You missed the point. This is a discussion about an operating system, not a gaming platform. You don't use Vista as a computer , you use it as a gaming platform. Your posting in the wrong story. Look for the Mario Brothers logo, or whatever it is on Slashdot these days that indicates games, and your posts might matter.

      --
      Guns don't kill people; Physics kills people! - John Lithgow as Dick Solomon on Third Rock From The Sun
  2. Memtest not perfect. by Galactic+Dominator · · Score: 5, Informative

    My experience with memtest is you can trust the results if it says the memory is bad, however if the memory passed it could still be bad. Troubleshooting your scenario should involve replacing the DIMM's in questions with known good modules while running Windows.

    --
    brandelf -t FreeBSD /brain
    1. Re:Memtest not perfect. by 0100010001010011 · · Score: 5, Funny

      I bet Windows will love you replacing the DIMM's while running.

    2. Re:Memtest not perfect. by mrmeval · · Score: 3, Informative

      I've yet to see memtest86 find an error even though replacing the ram fixed the problem. This has been on several builds.

      --
      I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
    3. Re:Memtest not perfect. by Anonymous Coward · · Score: 5, Interesting

      Another nice tool is prime95. I've used it when doing memory overclocking and it seemed to find the threshold fairly quickly. Of course your comment still stands - even if a software tool says the memory is good, it might not necessarily be true.

    4. Re:Memtest not perfect. by Anthony_Cargile · · Score: 4, Funny

      I bet Windows will love you replacing the DIMM's while running.

      Yeah wait until it starts to sleep first, or even better if you catch it while hibernating

    5. Re:Memtest not perfect. by Antidamage · · Score: 5, Funny

      I've often had it pick up bad ram, usually within the first five minutes. One time, the memory in question had been through a number of unprotected power surges. The motherboard and power supply were dead too.

      You can reliably replicate my results by removing the ram, snapping it in half and putting it back in. No need to wait for a power surge to see memtest86 shine.

    6. Re:Memtest not perfect. by Hal_Porter · · Score: 5, Funny

      Memtestx86 is bögus. My machine alwayS generated errors when I run the test but it works fOne otherwise ÿ

      --
      echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    7. Re:Memtest not perfect. by NerveGas · · Score: 4, Informative

      +1. I once had a pair of DIMMs which would intermittently throw errors in whichever machine they were placed, but Memtest would never detect anything wrong with them - even if used for weeks.

      I called Micron, and they said "Yes, we do see sticks that go bad and Memtest won't detect it." They replaced them for free, the problem went away, and I was happy.

      --
      Oh, you're not stuck, you're just unable to let go of the onion rings.
    8. Re:Memtest not perfect. by Idaho · · Score: 5, Interesting

      My experience with memtest is you can trust the results if it says the memory is bad, however if the memory passed it could still be bad.

      I wonder how strongly RAM stability depends on power fluctuations. While you're testing memory using Memtest, the GPU is not used at all, for example. When playing a game and/or running some heavy compile-jobs, on the other hand, overall power usage will be much higher. I wonder if this may reflect on RAM stability, especially if the power supply is not really up to par?

      If so, you might never find out about such a problem by using (only) memtest.

      --
      Every expression is true, for a given value of 'true'
    9. Re:Memtest not perfect. by Znork · · Score: 4, Informative

      A lot. When AM2 boards were new I went through a bunch of bad RAM (memory manufacturers hadn't quite gotten their act together yet) and RAM voltage would significantly change the number of bits that were 'bad'. 1.9 V and there were a few bits bad, 1.85, some more, 1.8 and memtest would light up all over.

      So certainly, if any component is subpar, even a slight power fluctuation could trigger a borderline bad bit.

    10. Re:Memtest not perfect. by Sen.NullProcPntr · · Score: 4, Interesting

      While you're testing memory using Memtest, the GPU is not used at all, for example. When playing a game and/or running some heavy compile-jobs, on the other hand, overall power usage will be much higher.

      I think memtest is a good first level test - it will pinpoint gross errors in memory. But probably won't detect more subtle problems. For me the best extended test is to enable all the opengl screen savers and let the system run overnight cycling through each of them. If the system doesn't crash with this it will probably be solid under a normal load. For me this has been the best test of overall system stability. Unfortunately if it fails won't know exactly what is wrong.

    11. Re:Memtest not perfect. by blackest_k · · Score: 4, Informative

      i've seen memtest find an error and yes the ram was bad.

      There is a bit of a difference between ram use on linux and windows desktops, Linux tends to require less ram than a windows system to run, windows is far more likely to use all your ram and page out. In day to day use rarely do my linux systems need to use the swap file. If some of your ram is faulty and never gets used then you will not see crashes. I'm sure most of us have juggled ram about finding swapping slots cures the problem although reseating ram can fix problems anyway. If memtest is showing problems then the ram has problems bare in mind that some tests performed can pass with later tests failing.
      memtest is to prove ram to be bad, not good. At higher temperatures than the testing was performed at the ram may become unreliable. It might be the case that the ram is ok in some systems but not in others, I've seen that too.

    12. Re:Memtest not perfect. by machine321 · · Score: 3, Funny

      That's impressive. Most memory tester software so I've tried requires a working power supply and motherboard.

    13. Re:Memtest not perfect. by rant64 · · Score: 2, Interesting

      Which will work, as a matter of fact, given the proper hardware: http://www.microsoft.com/whdc/system/pnppwr/hotadd/hotaddmem.mspx

    14. Re:Memtest not perfect. by Joce640k · · Score: 4, Interesting

      I've had a lot more success with Microsoft's RAM tester, free download here: http://oca.microsoft.com/en/windiag.asp

      See, good things do come out of Redmond!

      --
      No sig today...
    15. Re:Memtest not perfect. by mysticgoat · · Score: 2, Interesting

      I run memtest86 overnight (12+ hrs) as a routine part of the initial evaluation of a sick machine. Occasionally it finds errors after several hours that were not present on a single pass test. The last instance was a few months ago: a single stuck bit in one of the progressive pattern memory tests that only showed up after 4+ hours of repetitive testing. Replacing that mem module cured WinXP of a lot of weird flakey behavior involving IEv7 and Word.

      The overnight memtest86 runs have only kicked out errors that were not found on single pass testing maybe 3 or 4 times in the last 10 years. But it happens.

    16. Re:Memtest not perfect. by MikeBabcock · · Score: 2, Interesting

      Actually its worth noting that several motherboards on the market automatically over-clock the timings on the board under high load situations to improve performance. These same situations may not happen while simply running memtest86[+].

      I've often thought that throwing in a copy of Folding@Home or Distributed.NET running in the background would be fun while memory testing, to juice the CPU and test the system under a heavier load.

      Unfortunately, isolating the memory to run said software and relocating it periodically could be a pain.

      --
      - Michael T. Babcock (Yes, I blog)
  3. tinfoil is the answer by Anonymous Coward · · Score: 5, Funny

    wrap your _whole_ computer in tinfoil to deflect those pesky cosmic rays. it also works to keep them out of your head too.

    1. Re:tinfoil is the answer by platypussrex · · Score: 4, Funny

      I have an even better idea. You know how water cooling makes your computer run better? Well my theory is that water cooling would work the same way for the OP. He needs to get a large tank, fill it with ice water, and be sure to keep his head fully submerged while doing all his computer work. I'm sure he'll be amazed at his increased productivity.

    2. Re:tinfoil is the answer by JWSmythe · · Score: 3, Funny

      ... or so they've made you believe.

          The tin foil hat works. We can't read your mind. Feel safe wearing the tin foil hat. You've protected yourself against our evil plot to control your mind. :)

      --
      Serious? Seriousness is well above my pay grade.
    3. Re:tinfoil is the answer by Anonymous Coward · · Score: 2, Informative

      That's absolutely true. As Samuel Johnson remarked, "Depend upon it, sir, when a man knows he is to be drowned in ice water, it concentrates his mind wonderfully." Of course, Boswell made a few errors in transcription.

    4. Re:tinfoil is the answer by Waccoon · · Score: 2, Funny

      Meanwhile, aren't some people wrapping their WiFi antennas with tin foil to boost reception?

    5. Re:tinfoil is the answer by flyingfsck · · Score: 2, Funny

      That will only work if you use heavy water - Deuterium. ;

      --
      Excuse me, but please get off my Pennisetum Clandestinum, eh!
  4. Error response by greg1104 · · Score: 4, Informative

    If a system gives memtest86 errors, I break it down and swap components until it doesn't. The test pattern it uses can find subtle errors you're unlikely to run into with any application-based testing even when run for a few days. Any failures it reports should be taken seriously. Also: you should pay a attention to the memory speed value it reports, that's a surprisingly effective simple benchmark for figuring out if you've setup your RAM optimally. The last system I built, I ended up purchasing 4 different sets of RAM, and there was about a 30% delta between how well the best and worst performed on the memtest86 results--correlated extremely well with other benchmarks I ran too.

    At the same time, I've had memory that memtest86 said was fine, but the system itself still crashed under a heavy Linux-based test. I consider both a full memtest86 test and a moderate workload Linux test to be necessary before I consider a new system to have baseline usable reliability.

    There are a few separate problems here that are worthwhile to distinguish among. A significant amount of RAM doesn't work reliably when tested fully. Once you've culled those out, only using the good stuff, some of that will degrade over time to where it will no longer pass a repeat of the initial tests; I recently had a perfectly good set of RAM degrade to useless in only 3 months here. After you take out those two problematic sources for bad RAM, is the remainder likely enough to have problems that it's worth upgrading to ECC RAM? I don't think it is for my home systems, because I'm OK with initial and periodic culling to kick out borderline modules. And things like power reliability cause me more downtime than RAM issues do. If you don't know how or have the time to do that sort of thing yourself though, you could easily be better off buying more redundant RAM.

    1. Re:Error response by gabebear · · Score: 2, Interesting

      Anyone else have RAM modules degrade over time? I've never seen this.

      I always buy faster modules than I'm actually using. I usually test the system with memtest at a higher frequency than what it's going to run. My last build overclocked to [2.7Ghz CPU, 1066 FSB, 1066/CL7 DDR3] with memtest still reporting no errors; I run it at [2.1Ghz, 800, 800/CL7](a.k.a stock speed).

    2. Re:Error response by nmos · · Score: 2, Informative

      Anyone else have RAM modules degrade over time? I've never seen this.

      I've seen a few known good modules fail later on but it's pretty rare. I'd say I've seen fewer than 5 in 15 years. Most times when a previously good module suddenly appears bad there's something else going on such as a failing power supply etc.

    3. Re:Error response by Anthony_Cargile · · Score: 2, Funny

      Anyone else have RAM modules degrade over time? I've never seen this.

      I don't know if this is from degraded RAM, or rats pissing on the motherboard, but an olde IBM PC running DOS (upgraded to 3?) started having little blips on-screen and other strange characters appear in the output of programs and the shell itself, and in addition to this it would randomly lock up occasionally displaying a stack error.

      I know the floppy is alright, because it boots fine without any of these symptoms occurring from other machines it boots from. The video cardish component appears fine to the naked eye, but does not explain the random stack errors and unexplained lockups. I've always wondered what the hell was wrong with this thing (and a certain someone won't stop nagging me to throw it away, already) but it could very well be degraded RAM. Can't boot up memtest86 because (CPU i386) but the symptoms seem to all point to bad RAM.

    4. Re:Error response by cowbutt · · Score: 2, Insightful

      Anyone know, why PC133 memory would have an issue on a bus overclocked from 100MHz to 133? It should be able to handle it just fine, so I'd like to think :-/

      It's probably not the RAM as such; the 440BX on the P2B is only officially rated for 100MHz. Overclocking the chipset can have any number of side-effects.

  5. Answers by jawtheshark · · Score: 5, Interesting

    1) Yes

    2) No

    Now to be serious. Home PC do not come yet with 6GB or 8GB. Most new home PC still seem to have between 1GB and 4GB. Where the 4GB variety is rare because of the fact that most home PCs still come with a 32-bit operating system. 3GB seems to be the sweet spot for higher-end-home-pcs. Your home PC will most likely not have 16GB next year. Your workstation at work, perhaps, but then even perhaps.

    At the risk of sounding like "640KByte is enough for everyone", I have to ask why you think why you need 16GB to check your email next year. I'm typing this on a 6 year old computer, I'm running quite a few applications at the same time and I know a second user is logged in. Current memory usage: 764Meg RAM. As a general rule, I know that Windows XP runs fine on 512Meg RAM and is comfortable with 1GB RAM. The same is true for GNU/Linux running Gnome.

    Now, at work with Eclipse loaded, a couple of application servers, a database and a few VMs... Yeah, there indeed you get memory starved quickly. You have to keep in mind that such usage pattern is not that of a typical office worker. I can imagine that a heavy Photoshop user would want every bit of RAM he can get too. The Word-wielding-office-worker? I don't think so.

    Now, I can't speak for Vista. I heard it runs well on 2GB systems, but I can't say. I got a new work laptop last week and booted briefly in Vista. It felt extremely sluggish and my machine does have 4Gig RAM. Anyway, I didn't bother and put Debian Lenny/amd64 on it and didn't look back.

    I my idea, you have quite a twisted sense of reality regarding to the computers people actually use.

    Oh, and frankly... If cosmic rays would be a big issue by now with huge memories, don't you think that more people would be complaining? I can't say why Ubuntu/amd64 ran fine on your machine. Perhaps GNU/Linux has built-in error correction and marks bad RAM as "bad".

    --
    Ahhh...the great dumpster continuum. Many a free computer will be found there. -- sowth (748135)
    1. Re:Answers by bdsesq · · Score: 4, Informative

      ... 3GB seems to be the sweet spot for higher-end-home-pcs.

      3GB is not so much a "sweet spot" as it is a limitation based on a 32 bit OS.
      You can address 4GB max using 32 bits. Now take out the address space needed for your video card and any other cards you may put on the bus and you are looking at a 3GB max for useable memory.
      So instead of "sweet spot" you really mean "maximum that can be used by Windows XP 32 Bit (the most commonly used OS today).

    2. Re:Answers by megabeck42 · · Score: 5, Informative

      Just FYI, 32bit Intel processors from the Pentium Pro generation and forward (with the exception of most, if not all of the Pentium-M's) have 36 physical address pins or more?

      Many, but not all, chipsets have a facility for breaking the physical address presentation of the system RAM into a configurably-sized contiguous block below the 4GB limit and then making the rest available above the 4GB limit. If you're curious, the register (in intel parlance) is often called TOLUD (Top of Low Useable DRAM).

      Yes, furthermore, given modern OS designs on x86 architecture, a process cannot utilize more than 2gb (windows without /3gb boot option) or 3gb (linux, most BSDs, windows with /3gb and apps specially built to use the 3/1 instead of 2/2 split.)

      However, that limitation does not preclude you from having a machine running eight processes using 2GB of physical memory each.

      The processor feature is called PAE (Physical Address Extension). It works, basically, by adding an extra level of processor pagetable indirection.

      Incidentally, I have a quad P3-700 (It's a Dell PowerEdge 6450) propping a door open that could support 8GB of RAM if you had enough registered, ECC PC-133 SDRAM to populate the sixteen dimm slots.

      Anyways, here's a snippet from the beginning of a 32 bit machine running Linux which has 4GB of RAM:
      [ 0.000000] BIOS-provided physical RAM map:
      [ 0.000000] BIOS-e820: 0000000000000000 - 0000000000097c00 (usable)
      [ 0.000000] BIOS-e820: 0000000000097c00 - 00000000000a0000 (reserved)
      [ 0.000000] BIOS-e820: 00000000000e8000 - 0000000000100000 (reserved)
      [ 0.000000] BIOS-e820: 0000000000100000 - 00000000defafe00 (usable)
      [ 0.000000] BIOS-e820: 00000000defb1e00 - 00000000defb1ea0 (ACPI NVS)
      [ 0.000000] BIOS-e820: 00000000defb1ea0 - 00000000e0000000 (reserved)
      [ 0.000000] BIOS-e820: 00000000f4000000 - 00000000f8000000 (reserved)
      [ 0.000000] BIOS-e820: 00000000fec00000 - 00000000fed40000 (reserved)
      [ 0.000000] BIOS-e820: 00000000fed45000 - 0000000100000000 (reserved)
      [ 0.000000] BIOS-e820: 0000000100000000 - 000000011c000000 (usable)

      The title of that list should really be "Physical Address Space map." Either way, notice that the majority of the RAM is available up until 0xDEFAFE00 and the rest is available from 0x100000000 to 0x11c000000 - a range that's clearly above the 4GB limit.

      Yes, it's running a bigmem kernel... But that's what bigmem kernels are for.

      Oh, incidentally, even windows 2000 supported PAE. The bigger problem is the chipset. Not all of them support remapping a portion of RAM above 4GB.

      --
      fnord.
    3. Re:Answers by alexandre_ganso · · Score: 2, Informative

      3GB is not so much a "sweet spot" as it is a limitation based on a 32 bit OS.
      You can address 4GB max using 32 bits.

      I beg your pardon, but this is limited by your version of windows, not by hardware nor even by 32bit systems.

      http://www.microsoft.com/whdc/system/platform/server/PAE/PAEdrv.mspx

  6. Cost benefit analysis by Logic+Worshipper · · Score: 2, Insightful

    Is ECC memory worth the money in a machine you use to check your E-mail? Can't you just reboot and/or replace the memory if errors occur?

    I could see it happening when the cost of ECC memory is no higher than normal memory, and using ECC memory has no or minimal impact on performance, until then, I won't expect to start seeing it desktop machines.

    If you want ECC memory on your desktop, feel free to build your own machine with a motherboard that supports ECC memory. Some high end desktops do support ECC memory already.

    1. Re:Cost benefit analysis by a09bdb811a · · Score: 2, Interesting

      Is ECC memory worth the money in a machine you use to check your E-mail?

      Unbuffered ECC is only a few $ more than unbuffered non-ECC. It's only 9 chips per side instead of 8, after all. The performance impact is marginal.

      I see no reason not to use ECC except that Intel doesn't want you to. It seems they want to keep ECC as a 'server' feature (as if your desktop at home isn't 'serving' you your data). So all their consumer chipsets don't support it, and the i7's memory controller doesn't either. AMD doesn't play that game with their chips, but it seems only ASUS actually implements the ECC support on most of their boards.

  7. The truth by mcrbids · · Score: 4, Insightful

    My first computer was a 80286 with 1 MB of RAM. That RAM was all parity memory. Cheaper than ECC, but still good enough to positively identify a genuine bit flip with great accuracy. My 80386SX had parity RAM, so did my 486DX4 120. I ran a computer shop for some years, so I went through at least a dozen machines ranging from the 386 era through the Pentium II era, at which point I sold the shop and settled on a AMDK62 450. And right about the time that the Pentium was giving way to the Pentium II, non-parity memory started to take hold.

    What protection did parity memory provide, anyway? Not much, really. It would detect with 99.99...? % accuracy when a memory bit had flipped, but provided no answer as to which one. The result was that if parity failed, you'd see a generic "MEMORY FAILURE" message and the system would instantly lock up.

    I saw this message perhaps three times - it didn't really help much. I had other problems, but when I've had problems with memory, it's usually been due to mismatched sticks, or sticks that are strangely incompatible with a specific motherboard, etc. none of which caused a parity error. So, if it matters, spend the money and get ECC RAM to eliminate the small risk of parity error. If it doesn't, don't bother, at least not now.

    Note: having more memory increases your error rate assuming a constant rate of error (per megabyte) in the memory. However, if the error rate drops as technology advances, adding more memory does not necessarily result in a higher system error rate. And based on what I've seen, this most definitely seems to be the case.

    Remember this blog article about the end of RAID 5 in 2009? Come on... are you really going to think that Western Digital is going to be OK with near 100% failure of their drives in a RAID 5 array? They'll do whatever it takes to keep it working because they have to - if the error rate became anywhere near that high, their good name would be trashed because some other company (Seagate, Hitachi, etc) would do the research and pwn3rz the marketplace.

    --
    I have no problem with your religion until you decide it's reason to deprive others of the truth.
    1. Re:The truth by Mr+Z · · Score: 5, Interesting

      Note: having more memory increases your error rate assuming a constant rate of error (per megabyte) in the memory. However, if the error rate drops as technology advances, adding more memory does not necessarily result in a higher system error rate. And based on what I've seen, this most definitely seems to be the case.

      Actually, error rates per bit are increasing, because bits are getting smaller and fewer electrons are holding the value for your bit. An alpha particle whizzing through your RAM will take out several bits if it hits the memory array at the right angle. Previously, the bits were so large that there was a good chance the bit wouldn't flip. Now they're small enough that multiple bits might flip.

      This is why I run my systems with ECC memory and background scrubbing enabled. Scrubbing is where the system actively picks up lines and proactively fixes bit-flips as a background activity. I've actually had a bitflip translate into persistent corruption on the hard drive. I don't want that again.

      FWIW, I work in the embedded space architecting chips with large amounts of on-chip RAM. These chips go into various infrastructure pieces, such as cell phone towers. These days we can't sell such a part without ECC, and customers are always wanting more. We actually characterize our chip's RAM's bit-flip behavior by actively trying to cause bit-flips in a radiation-filled environment. Serious business.

      Now, other errors that parity/ECC used to catch, such as signal integrity issues from mismatched components or devices pushed beyond their margins... Yeah, I can see improved technology helping that.

    2. Re:The truth by tagno25 · · Score: 2, Informative

      An alpha particle whizzing through your RAM will take out several bits if it hits the memory array at the right angle.

      Did you figure out how alpha particles cannot travel through paper but can travel through RAM?

    3. Re:The truth by X0563511 · · Score: 4, Insightful

      Yes. A higher energy particle hits something in the RAM, and alpha/beta particles scatter from the impact point... which is inside the memory cell.

      That's why higher energy radiation is dangerous. It doesn't cause the damage itself, the products of the collision do. Radiation shrapnel, if you will.

      --
      For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
  8. RAID(?) for RAM by Xyde · · Score: 5, Interesting

    With memory becoming so plentiful these days (I haven't seen many home PC's with 6 or 8GB granted, but we're getting there) it seems that a single error on a large capacity chip is getting more and more trivial. Isn't it a waste to throw away a whole DIMM? Why isn't it possible to "remap" this known-bad address, or allocate some amount of RAM for parity the way software like PAR2 works? Hard drive manufacturers already remap bad blocks on new drives. Also it seems to me that, being a solid state device, small failures in RAM aren't necessarily indicative of a failing component like bad sectors on a hard drive are. Am I missing something really obvious here or is it really just easier/cheaper to throw it away?

    1. Re:RAID(?) for RAM by Rufus211 · · Score: 2, Interesting

      You just described ECC scrubbing and Chipkill. The technology's been around for a while, but it costs >$0 to implement so most people don't bother. As with most RAS features most people don't know anything about it, so would rather pay $50 less than have a strange feature that could end up saving them hours of downtime. At the same time if you actually know what these features are and you need them, you're probably going to be willing to shell out the money to pay for them.

  9. Joking aside... by BabaChazz · · Score: 5, Informative

    First, it was not cosmic rays; memory was tested in a lead vault and showed the same error rate. Turns out to have been alpha particles emitted by the epoxy / ceramic that the memory chips were encapsulated in.

    That said: Quite clearly given your experience, Vista and Ubuntu load the memory subsystem quite differently. It is possible that Vista, with its all-over-the-map program flow, is missing cache a lot more often and so is hitting DRAM harder; I don't have the background to really know. I believe that Memtest86, in order to put the most strain on memory and thus test it in the most pessimal conditions, tries to access memory in patterns that equally hit physical memory hardest. But, what I have found is that some OSs, apparently including Ubuntu, will run on memory that is marginal, memory that Memtest86 picks up as bad.

    As for ECC in memory... The problem is that ECC carries a heavy performance hit on write. If you only want to write 1 byte, you still have to read in the whole QWord, change the byte, and write it back to get the ECC to recalculate correctly. It is because of that performance hit that ECC was deprecated. The problem goes away to a large extent if your cache is write-back rather than write-through; though there will be still a significant number of cases where you have to write a set of bytes that has not yet been read into cache and does not comprise a whole ECC word.

    That said, it is still used on servers...

    But I don't expect it will reappear on desktops any time soon. Apparently they have managed to control the alpha radiation to a great extent, and so the actual radiation-caused errors are now occurring at a much lower rate, significantly lower than software-induced BSODs.

    1. Re:Joking aside... by Ron+Bennett · · Score: 2, Informative

      It is possible that Vista, with its all-over-the-map program flow, is missing cache a lot more often and so is hitting DRAM harder...

      Perhaps that's another "feature" of Windows - no need for Memtest86 ... just leave Windows running for a few days with some applications running ... and if nothing crashes, the RAM is probably good.

    2. Re:Joking aside... by bertok · · Score: 5, Insightful

      As for ECC in memory... The problem is that ECC carries a heavy performance hit on write. If you only want to write 1 byte, you still have to read in the whole QWord, change the byte, and write it back to get the ECC to recalculate correctly. It is because of that performance hit that ECC was deprecated. The problem goes away to a large extent if your cache is write-back rather than write-through; though there will be still a significant number of cases where you have to write a set of bytes that has not yet been read into cache and does not comprise a whole ECC word.

      AFAIK, on modern computer systems all memory is always written in chunks larger than a byte. I seriously doubt there's any system out there that can perform single-bit writes either in the instruction set, or physically down the bus. ECC is most certainly not "depreciated" -- all standard server memory is always ECC, I've certainly never seen anything else in practice from any major vendor.

      The real issue is that ECC costs a little bit more than standard memory, including additional traces and logic in the motherboard and memory controller. The differential cost of the memory is some fixed percentage (it needs extra storage for the check bits), but the additional cost in the motherboard is some tiny fixed $ amount. Apparently for most desktop motherboard and memory controllers that few $ extra is far too much, so consumers don't really have a choice. Even if you want to pay the premium for ECC memory, you can't plug it into your desktop, because virtually none of them support it. This results in a situation where the "next step up" is a server class sytem, which is usually at least 2x the cost of the equivalent speed desktop part for reasons unrelated to the memory controller. Also, because no desktop manufacturers are buying ECC memory in bulk, it's a "rare" part, so instead of, say, 20% more expensive, it's 150% more expensive.

      I've asked around for ECC motherboards before, and the answer I got was: "ECC memory is too expensive for end-users, it's an 'enterprise' part, that's why we don't support it." - Of course, it's an expensive 'enterprise' part BECAUSE the desktop manufacturers don't support it. If they did, it'd be only 20% more expensive. This is the kind of circular marketing logic that makes my brain hurt.

    3. Re:Joking aside... by philipgar · · Score: 2, Insightful

      Just because the ISA supports reading and writing memory in one byte increments doesn't mean the chip physically reads or writes a single byte to the DRAMs. Data is generally read and written to the caches, which will later get written back to memory. Most caches don't bother to contain the state needed to know which bytes in a cache line were modified. Additionally, most memories themselves aren't really good at single bye reads/writes. They operate much faster when reading and writing larger data chunks at a time.

      Phil

    4. Re:Joking aside... by bosef1 · · Score: 2, Funny

      I remember when my Dad and I picked up our first bits at the store. We had a hatchback, so it was pretty easy to just slide them in sideways, but then we had to bungie-cord everything down so they wouldn't shift too much on the ride home. Darn kids today with their multi-gigabyte memory chips.

  10. Depends by gweihir · · Score: 5, Interesting

    My experience with a server that recorded about 15TB of data is something like 6 bit-errors per year that could not be traced to any source. This was a server with ECC RAM, so the problem likely occured in busses, network cards, and the like, not in RAM.

    For non-ECC memory, I would strongly syggest running memtest86+ at least a day before using the system and if it gives you errors, replace the memory. I had one very persistend bit-error in a PC in a cluster, that actually reqired 2 days of memtest86+ to show up once, but did occure about once per hour for some computations. I also had one other bit-error that memtest86+ did not find, but the Linux commandline memory tester found after about 12 hours.

    The problem here is that different testing/usage patterns result in different occurence probability for weak bits, i.e. bits that only sometimes fail. Any failure in memtest86+ or any other RAM tester indicates a serious problem. The absence of errors in a RAM test does not indicate the memory is necessarily fine.

    That said, I do not believe memory errors have become more common on a per computer basis. RAM has become larger, but also more reliable. Of course, people participating in the stupidity called "overclocking" will see a lot more memory errors and other errors as well. But a well-designed system with quality hardware and a thourough initial test should typically not have memory issues.

    However there is "quality" hardware, that gets it wrong. My ASUS board sets the timing for 2 and 4 memory modules to the values for 1 module. This resulted in stable 1 and 2 module operation, but got flaky for 4 modules. Finally I moved to ECC memory before I figuerd out that I had to manually set the correct timings. (No BIOS upgrade available that fixed this...) This board has a "professional" in its name, but apparently, "professional" does not include use of generic (Kingston, no less) memory modules. Other people have memory issues with this board as well that they could not fix this way, seems that somethimes a design just is bad or even reputed manufacturers do not spend a lot of effort to fix issues in some cases.In can only advise you to do a thourough forum-search before buying a specific mainboard.

     

    --
    Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  11. If it was really a cosmic ray by circletimessquare · · Score: 4, Funny

    Then it would proba%ly alter not just one byte, b%t a chain of them. The cha%n of modified bytes would be stru%g out, in a regular patter%. Now if only there were so%e way to read memory in%a chain of bytes, as if it w%re a string, to visu%lize the cosmic ray mod%fication. hmmm...

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
  12. Settings matter too by Max+Romantschuk · · Score: 5, Informative

    Not all memory is created equal. Memory can be bad if Memtest detects errors, or you can simply be running it at the wrong settings. Usually there are both "normal" and "performance" settings for memory on higher end motherboards, or sometimes you can tweak all sorts of cycle-level stuff manually (CAS latency etc.).

    Try running your memory with the most conservative settings before you assume it's bad.

    --
    .: Max Romantschuk :: http://max.romantschuk.fi/
  13. Workaround bad memory howto (linux only) by gQuigs · · Score: 4, Informative

    Depending on where it fails (if it fails in a the same spot) you can relatively easily work around it and not throw out the remaining good portion of the stick. I wrote a howto..

    http://gquigs.blogspot.com/2009/01/bad-memory-howto.html

    I've been running on Option 3 for quite some time now. No, it's not as good as ECC, but it doesn't cost you anything.

    1. Re:Workaround bad memory howto (linux only) by m_pll · · Score: 4, Informative

      On vista you can do the same thing using bcdedit:

      bcdedit /set badmemorylist 0x12345 0x23456

      Parameters are page frame numbers.

  14. Re:(Sensible) People do use ECC RAM by Mr+Z · · Score: 2, Informative

    With today's wide buses, parity RAM is ECC RAM. It's worth paying the extra couple dollars.

    Several years back I experienced disk corruption that seemed to be due to a bitflip that had happened in RAM and got committed to disk. That machine didn't have ECC RAM. I went to ECC for everything after that. That was back in the 128MB days, and no I don't overclock.

    (Well, not aggressively. My machine is overclocked by about 1%.)

  15. Trust Memtest86 by nmos · · Score: 4, Informative

    1) Do people trust a memtest86 error to mean a bad memory module or motherboard or CPU?

    Well, I'd add some other possibilities such as:

    Bad power supply,
    Memory isn't seated properly in it's socket.
    Incorrect timing set in bios.
    Memory is incompatable with your motherboard.
    etc..

    But yeah, if memtest86 says there's a problem then there really is something wrong.

  16. Best practice by swehack · · Score: 2, Informative

    is to swap the memory modules to find out which is causing the problem, if not motherboard. Also i don't see how memory tests running inside an OS can be effective, i'd much rather boot off of a smaller system on a DVD, USB-stick or floppy to run a memory test. Dell servers have those Dell Diagnostics CDs that are very small in memory footprint just in order to run diagnostics on memory. But even they're not perfect so you often have to take memory out and see if you can reproduce errors.

  17. Re:Paranoia? by dgatwood · · Score: 4, Interesting

    The probability of a cosmic ray at precisely the right angle and speed to cause a single bit error and cause an app to crash is somewhere on the same order as your chances of getting hit by a car, getting struck by lightning, getting torn apart by rabid wolves, and having sex in the back of a red 1948 Buick convertible at a drive-in movie theater on Tuesday night, Feb. 29th under a blue moon... all at the same time.... Sure, given enough bits, it's bound to happen sooner or later, but it isn't something I'd worry about. :-)

    The probability of RAM just plain being defective---failing to operate correctly due to bugs in handling of certain low power states, having actual bad bits, having insufficient decoupling capacitance to work correctly in the presence of power supply rail noise, etc---is probably several hundred thousand orders of magnitude greater (probably on the order of a one in several thousand chance of a given part being bad versus happening to a given part a few times before the heat death of the universe).

    Memory test failures (other than mapping errors) are pretty much always caused by hardware failing. If running memtest86 in Linux works correctly for days, this probably means one of three things:

    • A. Linux is detecting the bad part and is mapping out the RAM in question.
    • B. The Linux VM system doesn't move things around RAM as much as Windows. Thus, random chunks of code don't end up there, and the few that do are in rarely used parts of background daemons or unused kernel modules so you don't notice the problem.
    • C. Linux power management isn't as rough on the RAM or CPU as Windows. Dodgy RAM/CPUs are most likely to fail when you take them through power state changes like putting the machine to sleep or switching the CPU into or out of an idle state. If Linux is making power state changes less frequently, is not using some of the lowest power states, is not stepping clock speeds, is not dropping the RAM refresh rate in sleep mode, etc., then you are less likely to see memory corruption. Similarly, power state changes can increase the rate of crashes due to a defective CPU or memory controller (northbridge).

    I couldn't tell you which of these is the case without swapping out parts, of course. You should definitely take the time to replace whatever is bad even if it seems to be "working" in Linux. In the worst case, you have a few bad bits of RAM, they're somewhere in the middle of your disk cache in Linux, and you are slowly and silently corrupting data periodically on its way out to disk.... You definitely need to figure out what's wrong with the hardware and why it is only failing in Windows, and it sounds like the only way to do that is to swap out parts, boot into Windows, and see if the problem is still reproducible in under a couple of days, repeating with different part swaps until the problem goes away. Don't forget to try a different power supply.

    --

    Check out my sci-fi/humor trilogy at PatriotsBooks.

  18. Re:Paranoia? by Workaphobia · · Score: 3, Insightful

    several hundred thousand orders of magnitude

    We've crossed beyond the realm of the astronomical and into something else entirely. Surely you meant several orders of magnitude, aka, hundreds of thousands of times? Let's keep things on this side of the googol.

    --
    Evidently, the key to understanding recursion is to begin by understanding recursion. The rest is easy.
  19. Use ECC in anything you care about by LukeCrawford · · Score: 2, Insightful

    really, it's not that much more expensive. Search newegg for unbuffered ecc, if you are using a desktop class system that can't handle registered ram.

    You wouldn't put data you care about on a hard drive without raid, would you?

  20. Was it cosmic rays, or...? by Nom+du+Keyboard · · Score: 2, Informative

    Was it cosmic rays, or Alpha particle decay from impure materials that was going to do in our memory soon? IIRC it was the latter.

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  21. OK by Runefox · · Score: 2, Insightful

    1) Do people trust a memtest86 error to mean a bad memory module or motherboard or CPU?

    Yes. I do, anyway; I've never had it report a false-positive, and it's always been one of the three (and even if it was cosmic rays, it wouldn't consistently come up bad, then, would it?). Then again, it could also mean that you could be using RAM requiring a higher voltage than what your motherboard is giving it. If it's brand-name RAM, you should look up the model number and see what voltage the RAM requires. Things like Crucial Ballistix and Corsair Dominator usually require around 2.1v.

    2) When I check my email on my desktop 16GB PC next year, should I be running ECC memory?

    Depends. If you're doing really important stuff then sure. ECC memory is quite a boon in that case. If you're just using your desktop for word processing and web browsing, it's a waste of money.

    --
    Screw the rules, I have green hair!
  22. Re:Paranoia? by Anonymous Coward · · Score: 5, Funny

    and having sex in the back of a red 1948 Buick convertible at a drive-in movie theater on Tuesday night, Feb. 29th under a blue moon... all at the same time....

    Mom?

  23. metal armour is the answer by captainpanic · · Score: 4, Funny

    I usually wear medieval armour. Not only does that work as efficient as tinfoil, it's also very fashionable.

  24. Re:Paranoia? by DamonHD · · Score: 2

    When rudely swiping at other people, at least stop dribbling nonsense like "several hundred thousand orders of magnitude greater". I don't think you know what you are talking about. >>10^100000?

    So I discount the rest of your "contribution" accordingly. Actually, several other parts of your answer are independently rubbish too: have you considered a career in tabloid journalism? Wish I had mod points...

    Rgds

    Damon

    --
    http://m.earth.org.uk/
  25. Re:Paranoia? by fractoid · · Score: 4, Informative

    Not only silicon... NASA astronauts consistently observe bright flashes in orbit, whether their eyes are open or closed. It is believed that these flashes are the result of cosmic rays interacting with the astronauts' retinas.

    --
    Rampant carbon sequestration destroyed the Dinosaurs' tropical paradise. I'm here to help repair the damage.
  26. Re:Paranoia? by dgatwood · · Score: 2, Interesting

    You're right that I've never run memtest86 at all. I hadn't regularly worked with any hardware based on an Intel architecture until about two years ago, and haven't experienced any RAM problems in that relatively short period. That is the sole valid criticism in your post, and even that was redundant. The rest of your post consists of you putting words in my mouth that I did not say.

    Regarding point A., many Linux systems do perform at least rudimentary RAM checks. What I said was that it is remotely possible that it got lucky and detected the problem during such screening, then flagged that page of physical RAM as defective. I never said anything about checking every write to RAM. That was you putting words in my mouth, and completely ludicrous words that I'd have to know almost nothing about hardware to say, at that. NIce straw man.

    Regarding point B., that's not a baseless troll argument by any stretch of the imagination. First, running a lean Linux distro will almost certainly thrash pages around far less than 64-bit Vista simply because the OS uses far less RAM. Second, last time I used it, Linux wired down a -lot- of pages down in the kernel. All of those pages are just going to sit there. If anything, this was a criticism of Linux's tendency to wire too many pages, not any sort of "pro-Linux" comment. Maybe it might be taken to mean that Linux is less likely to eject pages belonging to one process in favor of another process---indeed, my experience has been that it does seem to do so less frequently than some other operating systems, though this can either be good or bad depending on the workload in question---but that was in no way implied by my previous comment, nor certainly was there any value judgment on my part as to whether such behavior is good or bad.

    Likewise on point C., I was actually being harshly critical of Linux's power management, albeit without coming right out and saying it. Nowhere in my statement did I in ANY way insinuate that failing to switch into the lowest power states was in any way a good thing. It isn't. Poor power management leads to diminished battery life in portables and increased electric bills from computers of all types.

    Before you go painting me as a pro-Linux troll, you need to learn some reading comprehension skills and stop trying to put words in my mouth. It only makes you look like a troll yourself.

    --

    Check out my sci-fi/humor trilogy at PatriotsBooks.

  27. Here's the article I remember RE alpha particles. by Jay+Tarbox · · Score: 5, Informative

    http://www.ida.liu.se/~abdmo/SNDFT/docs/ram-soft.html

    This references an IBM study, which is what I think I actually remember but could not find quickly this morning.

    "In a study by IBM, it was noted that errors in cache memory were twice as common above an altitude of 2600 feet as at sea level. The soft error rate of cache memory above 2600 feet was five times the rate at sea level, and the soft error rate in Denver (5280 feet) was ten times the rate at sea level."

  28. Re:Paranoia? by redirect+'slash'+nil · · Score: 5, Informative

    My bet is that it is cerenkov radiaton as a high speed charged particle breaks the speed of light in the fluid in the eyeball.

    Indeed, these flashes have pretty much already been identified as the result of Cerenkov radiation.

    --
    Looks like these truths are not so self-evident after all...
  29. Re:(Sensible) People do use ECC RAM by swilver · · Score: 2, Informative

    I think it occurs quite a bit more often than once every few days. It is however rare that you'll notice since the data corrupted is often not in code that would lead to a crash, actual program code being such a small percentage of RAM usage these days. A flip in graphical data, sound or text data will very likely go unnoticed. Same goes for flips in code-paths that are rarely used.

  30. My experience dictates it... by kdawson+(3715) · · Score: 4, Insightful

    I was recently running a server that archived about 2 terabytes of data and got something like one or two bit errors per year that could not be traced to any known source. This server boasted ECC RAM, so the problem didn't occur in the ram, and it was unlikely for it to have occurred in the FSB.

    If you go with non-ECC, I would suggest running memtest86+. If you get errors, swap the memory. If swapping the memory still doesn't take care of it, swap motherboards! I recently had a memory problem in one of my customers' racks, and running memtest86+ got nothing until I had it running on my bench for over a week. There may be some problems with memtest86+...I even had another bit-error that memtest86+ did not find, but a Linux commandline memory tester found a problem almost immediately

    The problem here is that different testing/usage patterns result in different probabilities of finding potentially bad words, e.g. words that may only be bad if you read from them a hundred cycles consecutively. But, if you do see a failure in memtest86+ or the CLI tester, you got yourself a serious problem. The point to take from this is that if you don't see errors, that doesn't mean you don't have errors!

    Having said this, I still don't think memory errors among PCs are that common. We have more RAM on machines these days, but at the same time, the manufacturing processes have become better. I have a personal conviction in believing that though the likelihood of word error due to the increased amount of words in memory has increased, the RAM itself has become so much more "solid" that the increase of memory is negligible. Now, if you do dumb things with your computer like running it without a case or not giving it ventilation( learned this the hard way) or overclocking it, you *WILL* still run into problems. But if you design a system with quality and integrity, you typically shouldn't have these issues with memory!

    One last thing to point out: there is quality hardware, and there is cheap hardware. My PC-Chips motherboard ran for three months and two days, and I didn't have a problem. Two days out of warrant. Now, take my MSI motherboard. It sets the timing for all memory modules to have the values of a single module. This resulted in stable single module operation, but got flaky for all four modules. I Finally moved to ECC before I figuerd out that I had to manually set the correct timings. This board is an ultra board, but apparently, it does not include use of generic (Micron, Corsair, etc!! - tried 'em all) memory modules. People on the Newegg reviews board have memory issues with this board as well that they could not fix with a BIOS update, and it appears that sometimes a design just is bad! Even the "good" manufacturers do not spend a lot of effort to fix issues in some cases.

    My words of advice: Do your homework. Read through the reviews. AND DON'T BUY HARDWARE AS SOON AS IT COMES OUT!

    1. Re:My experience dictates it... by frieko · · Score: 2, Informative

      I had a box once that kept crashing randomly. I thought it was the OS, the memory, couldn't figure it out. Finally realized that one of the memory slots was bad. Kept that one empty and it was solid.

      Granted, it was a $30 motherboard.

  31. Re:(Sensible) People do use ECC RAM by Thundersnatch · · Score: 4, Funny

    actual program code being such a small percentage of RAM usage these days

    I see you've never experienced the joys of J2EE.

  32. One eye open! by camperdave · · Score: 2, Informative

    ...if you catch it while hibernating

    Be careful. Vista hibernates with one eye open. It can wake itself up from hibernation to do updates. I dual boot my laptop with Linux Mint (an Ubuntu variant). Every week, I'd go to turn on my computer only to find that the battery was dead. Checking the startup logs showed that linux was starting up at about 3:00 in the morning. After googling, I found out that many people were having that problem. The suggested solution was to turn off Vista automatic updates. I checked my Vista, and sure enough, it was set to update at 3am. I turned that setting off: no battery issue. I turned it back on: battery drained.

    My CMOS settings pages do not have any facility for waking the laptop at a specific time, so I don't know how Vista manages it. I only know that it can. So beware! Vista hibernates with one eye open.

    --
    When our name is on the back of your car, we're behind you all the way!
  33. Re:Mod Parent Up by somersault · · Score: 4, Insightful

    if you look at the username it's not him at all, it's someone with ID 1344097 pretending to be him. Still, what he says is sensible, and what's wrong with this piece? If it doesn't interest you, why are you reading the comments?

    --
    which is totally what she said
  34. Re:Paranoia? by pz · · Score: 2, Interesting

    The real issue with memory cells flipping is not cosmic rays -- at least not with terrestrially deployed memory, it's alpha particle emissions from radioactive decay of the plastics in the memory package. Yes, the plastics surrounding the silicon.

    A lot of work has been done to reduce the radioactivity of plastics used in IC packaging from normal background levels that you don't worry about in day-to-day life, to as quiet as possible, by carefully selecting source materials that have few naturally occurring radioisotopes.

    From my chip-designer days I recall that the minimum charge required on a dynamic memory cell (like the ones in your computer's DRAM) to prevent spurious bit flips is one million electrons, give-or-take. The various designs back then were coming up with ways to reduce the footprint of the elements used to store that charge.

    That said, it's been about 10 years since I've been in that line of work, and things have probably changed -- strike that, they've definitely changed -- substantially.

    --

    Put my fist through my alarm clock with its ding-dong death inside my ear. - The Blackjacks.
  35. Re:Quit trolling by somersault · · Score: 2, Insightful

    I do get it, but some people (like me) automatically skip past sigs, and the guy created his frickin name as KDawson's name plus his userID, what about that don't you get?

    --
    which is totally what she said
  36. New Microsoft ad slogan by mkcmkc · · Score: 5, Funny

    You must be unlucky or the cause.

    This would make a great slogan for Microsoft's new ad campaign:

    • Windows: if it doesn't work perfectly, it's your fault.
    --
    "Not an actor, but he plays one on TV."
    1. Re:New Microsoft ad slogan by Goaway · · Score: 3, Insightful

      This is truly a sign that Windows has caught up with Linux: It used to be only Linux users saying that, but now Windows users are, too!

  37. Re:Here's the article I remember RE alpha particle by Westmalle · · Score: 2, Interesting

    http://www.ida.liu.se/~abdmo/SNDFT/docs/ram-soft.html

    This references an IBM study, which is what I think I actually remember but could not find quickly this morning.

    "In a study by IBM, it was noted that errors in cache memory were twice as common above an altitude of 2600 feet as at sea level. The soft error rate of cache memory above 2600 feet was five times the rate at sea level, and the soft error rate in Denver (5280 feet) was ten times the rate at sea level."

    IBM research is a wonderful resource in the area of soft errors. I do remember exactly reading your quote, I didn't bother to track the exact article, but it should be part of this special issue http://www.research.ibm.com/journal/rd40-1.html, the banner article mentions Denver but doesn't have the exact quote. The web shows it would be "Terrestrial Cosmic Rays", the second article in that issue. They have a more recent special issue on the same subject http://www.research.ibm.com/journal/rd52-3.html

  38. To The OP: by multimediavt · · Score: 2, Informative

    Some simple tests:

    1. Does the machine have ECC RAM?
    2. Do you know, for a fact not a presumption, that your hardware is 100% compatible with Vista64?
    3. Do you know, for a fact not a presumption, that the Ubuntu version you are using supports ECC?
    4. What manufacturer of RAM are you using?
    5. What is the configuration of your RAM, GB per stick?

    Being one who has maintained an 1100 node cluster with 8800 pieces of ECC RAM I can tell you we chase bad RAM sticks ALL THE TIME! It's not necessarily due to cosmic activity, the RAM just exhibits bad behavior as the circuits get older and things start to separate and break down due to thermal load over time. Even a small defect that would let the RAM pass the manufacturers tests will eventually lead to a DIMM failure down the road. Most average human beings will never determine why their machine crashes every few days if it is a RAM issue. Some power users will even overlook it because they have too much faith in RAM that *was* good when they bought it, but now that it's two or three years old ...

    I wouldn't trust a single app to verify your RAM. Run a couple different tests and see if you can nail down the problem. I can look and see how we're tracking that and get back to you.