Slashdot Mirror


Software To Diagnose Faulty PC Hardware?

Etylowy writes "Over the years I have repaired my own PC and those belonging to family and friends many, many times. While in most cases it turned out to be restoring a system after malware/the user/Windows made a mess, or simple cases of 'follow the smell of smoke and molten plastic,' there were some nasty ones where the computer mostly works. By 'mostly,' I mean: you can boot it up, it might even work for a while, but will crash way too often to blame it all on Microsoft — what do you do then? Once you strip it of any extra hardware (which, with today's motherboards that have pretty much everything integrated, might not be an option) you are left with the CPU, motherboard, graphics card, RAM and HDD. You can test the HDD, you can run memtest86+ to check the RAM, but how do you go about testing the CPU, motherboard and graphics card trio to find which is to blame? Replacing them one by one isn't really an option. Do you know of any software that would help the way memtest helps with RAM?"

7 of 274 comments (clear)

  1. OCCT by PFAK · · Score: 4, Interesting

    It will stress your RAM, CPU, and GPU or all at once with pretty temperature and utilization graphs (for Windows only): http://www.ocbase.com/perestroika_en/

    --

    Free means no restrictions, ironic the FSF's GPL forces restrictions, isn't it? What's your definition of free?
  2. prime 95 by LordKronos · · Score: 2, Interesting

    Prime 95 is a good test of CPU/RAM, as well as to see if the system remains stable under peak temperature. It's often used to burn in overclocked machines.

  3. Re:Preventative Medicine - get a UPS by a09bdb811a · · Score: 2, Interesting

    If you notice that your lights dim a little bit when your fridge compressor or AirCon comes on, that is a recipe for a computer failure.

    Why? Doesn't the computer's PSU have enough juice in it to survive a quick dip in voltage? Besides, almost all PSUs are rated ~90-260V, so I always assumed if it dips from 230V, it won't matter.

    Occasionally my lights dim but I don't seem to have had problems. I'm still waiting for my decade-old P3 to die so it can be replaced by an Atom board, but the darn thing keeps on running.

  4. Re:Built in Self Test by gd2shoe · · Score: 2, Interesting

    It is a good start, but no more than that. Those tests are certainly not comprehensive (and should be). On the plus side, they often have your specific hardware in mind, and might possibly catch something that other tools wont. (doesn't happen often, but sometimes...)

    SMART is also not the end-all of hard drive indicators. A drive can pass SMART, and still be on the way out. I've found (for those familiar with Linux) that a dd from the hd to /dev/null will often spit out errors on a drive that's getting ready to fail. A linear read is far faster than a read/write surface scan, albeit not as thorough. (can be run from knoppix live CD)

    --
    I won't join Slashcott. OTOH, If Beta goes live, I just won't be back until it's fixed. Sorry Dice.
  5. Re:Just replace it. by Trahloc · · Score: 2, Interesting

    So your solution is to pawn off the problem to someone else? Either your an ancient tech near retirement who is bored to tears with what he does or you never really loved this stuff to begin with. Unless I take glee in the idea that a particular individuals machine is broken because I despise them I'll help utter strangers fix their stuff just because its *fun* to figure out a problem. Helping the other guy and gaining their gratitude is a bonus. And yes I've been doing this for long enough that it isn't something I'll "grow out of".

    --
    The Goal: A long simple life filled with many complex toys.
  6. Power supply by Alioth · · Score: 2, Interesting

    You didn't mention the power supply.

    In my experience, a "crashy machine" is almost always down to the PSU. Out of the dozens of "crashy machines" I've had to fix, only one was due to bad memory. The rest were *all* down to faulty power supplies, and all of those were due to capacitors that had failed.

    I have an oscilloscope so I can easily test for ripple without needing to open up the power supply and look for the obvious signs (bulging capacitors, maybe ones that have leaked). We've had dozens of machines at work with supplies that have gone bad this way. Bad capacitors have been a real problem in recent years. Four years ago, it wasn't just in power supplies either - we had to return 70 machines to Hewlett-Packard under warranty after the capacitors on the motherboard began failing after 3 months of use. We've not seen anything on that scale on motherboards since, but we still have frequent problems with power supplies failing from "capacitor plague".

    A machine of mine was actually killed by a sudden power supply failure - the PSU let the magic smoke out with a loud "bang", and there was the sound of stuff richocheting around the computer's case. That sound turned out to be bits of exploding chips on the motherboard. The only thing that survived that incident was the CD-ROM drive - all other components were destroyed.

  7. QuickTech or QT by iq+in+binary · · Score: 2, Interesting

    My shop uses it, works pretty well. A full scan can take up to 6 or 8 hours (we set up hardware diags before leaving for the night, and in the morning on a 24-channel KVM), but it is THOROUGH. VRAM, RAM, HDD, CPU, everything is tested and thoroughly. First step should be testing the PSU, then running QT.

    --
    Of all the Universal Constants, here's one I know: Nice guys finish last ;)