Lessons In Hardware / OS Troubleshooting
Esther Schindler writes "We like to imagine that every Microsoft OS installation will work just as well as the company promises. When things don't work out, identifying and remedying the case of failure can be time-consuming and frustrating. This lesson in how to determine why Windows 7 didn't install may help you troubleshoot a problem of your own, and save you from a Lost Weekend. Maybe you'll find this account useful all on its own. But the real key here is that the author is Ed Tittel — who's written over 100 books. If this hardware geek spends days solving a CPU-meets-Windows 7 problem, what chance do mere mortals have?"
He has issues with an "unsupported and unwarranted engineering sample CPU from Intel" with Windows 7... and Windows 7 is of course to blame according to the OP.... *roll eyes*
We like to imagine that every Microsoft OS installation will work just as well as the company promises.
Actually around here people like to imagine that every MS OS installation will miserably crash, because then they strut around feeling good about using Linux.
This is front-page news for Slashdot now? Here's the sum total of TFA:
Wow, color me impressed!
How are "mortals" supposed to figure it out? I guess they buy a PC from Dell because everything in that article qualifies as "no duh" for system builders.
"What do you despise? By this are you truly known." --Princess Irulan, Manual of Muad'Dib
/)
just rolls right on past the fact that, if what he was installing was -- oh, say -- a Linux distribution, he wouldn't have an opaque "I'm uncompressing files" thermometer, he'd have real progress status messages, with, y'know, *parameters* and stuff, and -- unlike me this morning with my boss's iPhone -- a hope of actually figuring out what's broken.
But he's apparently completely blind to the fact that that's the *real* problem here.
"We'll just make fault-tolerant users", indeed
Sounds like this system would have crashed unzipping ANY large file under ANY OS because it was the application that the CPU couldn't properly execute.
I'm a little suspicious; how much of an expert can you be writing 100 books on a variety of subjects.
.net development classes, php, etc..... Yeah he couldn't answer any basic questions that strayed from the text book in front of us.
Reminds me of a tech instructor I had who proudly informed the class he teaches oracle classes, mysql classes, sql server classes, cisco classes, juniper classes,
We know perfectly well that when we boot from a Ubuntu CD that we will end up with Ubuntu and don't think a second of why Windows 7 didn't install.
If crapware won't install on your computer, don't install the crapware. I'm sure there's plenty of other choices out there.
If this hardware geek spends days solving a CPU-meets-Windows 7 problem, what chance do mere mortals have?"
You need first to show me a "mere mortal" who has, and uses, an engineering sample CPU. There is a very good reason why -ES parts are marked as such - because they have bugs. And those bugs will be a problem sooner or later.
So the whole sob story can be reduced to this. The guy runs software on a prototype hardware, and the software crashes. In other breaking news, dog bites man.
Also, he was using an unsupported engineering sample CPU which was, in fact, the culprit for his issues. In other words, he ran into a Hardware Issue NO end user will, or most power users.
[X] Psst ... it's not your imagination, honey :-)
... oh, wait a sec ...
... but at least it blends ...
[X] I use BSD, you insensitive clod!
[X] In Soviet Russia, Windows crashes YOU!
[X] CowboyNeal is my runtime environment (Ewww!)
[X]
[X] If someone says "There's an app for that" one more time I'll throw a chair at them!
[X] Steve Ballmer posts on slashdot!!!
If you have never had a hardware issue when installing Linux on a machine you must be very lucky.
"Most things work fine" people tell me, which is true. The trouble is that the chances of you owning something that doesn't work is relatively high. (There's probably something from my statistics course that explains why that is, but I have so far managed to suppress that memory.)
After having rebuilt a Mac with OS X, and rebuilt a laptop with Ubuntu 9.04, I was surprised at how smooth and the Ubuntu install was. Of course that was until I wanted to use my webcam with Ubuntu. These kinds of problems get very difficult very fast in Linux. When 9.04 first came out there was a dependency problem that meant that you couldn't easily get some webcams working.
To be fair, that problem is most likely sorted out now, and a non-Apple webcam would have needed a (very easy to install) driver on OS X as well. The point is, Windows and hardware generally work very well.
"If this hardware geek spends days solving a CPU-meets-Windows 7 problem, what chance do mere mortals have?"
Simple really. WE ARE SPARTANS!
I have mod points and I am not afraid to use them.
I probably installed Windows 7 over 200 times [..]. Since then, I’ve installed the OS at least another 200 times [...]Until a couple of weeks ago, I never encountered a single problem that stopped me from installing Windows 7 itself.
400 installs w/o a problem and then a problem that has nothing to do with Windows 7. Now read the summary of the article, 'If this hardware geek spends days solving a CPU-meets-Windows 7 problem, what chance do mere mortals have?'
Really? That summary blurb is what you got from the article.
Gotta love /. people.
And those work because they have the drivers built in without the need to swap hardware or anything.
Attention... all grammer nazi"s! Is they're anything; wrong with: my post,
Man installs Windows 400 times on 30 different processors without a hitch. On the 401st time, it fails when trying to install on a machine with an old engineering sample CPU. What on Earth is a mortal to do?
So, after swapping out every component of his PC for a new one, the installation was successful. Did he not notice that it was no longer his old PC? That it was a now new PC in the old case?
This is not unlike the story of the guy with an axe that he claimed used to belong to Abraham Lincoln. The handle had been replaced three times and the head replaced twice, but it was still Abe's old axe.
Ultimately this is in Microsofts best interestes to ensure people can install Windows SHINYNEW(tm). There scattered offerings are the cause of this problem, they should offer one OS install disk with maybe a few kernels. Disclaimer I've never installed windows 7, its on mums cheap laptop and works fine for her much improved on previous offerings and for once MS makes XP look like the dated and cumbersome giant it is.
I didn't read the article, but judging by the summary, I think it is more about troubleshooting than assigning blame.
He had a running PC, but he couldn't figure out how to install a different OS on it (using the barely-supported bootable flash drive method)... so he threw parts at it until it worked, essentially installing Windows on a completely different PC than the one he started with.
I fix computers for a living, and I often reinstall Windows for my customers... the difference is that if I fail, I don't get paid.
He should have tried installing Windows from the DVD, or from the hard drive (using WIndows PE to kick off the installation).
I am unimpressed. I award him no points, and may Dog have mercy on his credit card bills.
Actually I had the exact same problem a few months ago upgrading a Dell server from Win2003 x86 to Win2008 x64, I suspected the CPU from the beginning, but I spent a few hours before the Dell Tech agreed with me. They sent a replacement and it worked like a champ.
This proves it has happened to a production Intel Core2Duo CPU at least once, I can't believe I was the only one.
iRepairIT - iPhone, Mac, & PC Repair
Dear self important guy who isn't near as good at computers as he thinks he is:
This may surprise you to learn, but all those defaults out these, all those specified values, all that kind of stuff, that isn't just arbitrary. See many smart engineers and other folks worked on designing and creating all the hardware for your computer. A lot of extremely complex stuff went in to it, modern computers are quite a marvel of engineering. As such, they discovered that certain tolerances, certain ranges work well. Outside of that, there can be problems. Thus the defaults because, well, default. They set them so that things are very likely to work in all cases.
As with most things, they aren't absolutes. They aren't things you can never exceed. In various circumstances you can go outside those normal ranges, sometimes by a little, sometimes by a lot. However, problems can potentially result. What problems those are and when they happen is not predictable. A system can appear stable but only crash on one app, or it can be stable for awhile then develop an instability.
Regardless, the first step to troubleshooting should be to USE THE FUCKING DEFAULTS, you idiot!
Seriously, I'm supposed to take someone seriously who is running overclocked settings of some sort or another (RAM timings, FSB, etc) and an engineering sample CPU and has problems? Ummm, duh. That right there is asking for problems. When you OC, you go in to it knowing you may have some difficulties. You understand this is the tradeoff for something that runs faster than spec. If you start having problems, the first step is to back off the OCing and see if that fixes it.
This is true even of OC'd systems that were fine but aren't now. I had a Celeron 300 that I OC'd to 450 back in the day and it worked well for about a year, then started to burn out. System started crashing randomly, and so on.
To me, it sounds like he's being whiny because he didn't bother to troubleshoot his setup properly. Come talk to me when you've got a retail CPU running at stock spec and FSB, RAM running per it's JEDEC spec at standard voltage and so on. Oh, what's that? You did that and it stopped having problems? Well there you go then. Don't bitch that your i7 920 "should" run at 3.8GHz. I don't care if others have done it, doesn't mean it'll work in your case. If it does, wonderful. It if doesn't well tough shit. Don't get mad at the software. It has pretty much no way to know if the CPU is going crazy as it runs on the CPU. About the only way software can indicate a CPU problem is by inducing a problem and thus a crash.
I found this very useful. I have been having the exact same problem, Asus Mobo w/ AI Tweaker enabled. Swapped out some parts, came to the conclusion it must be the mobo and decided to install ubuntu instead.
Not all Windows 7 installs are painless. When I installed mine, I found out the hard way just how dependent a Win7 installation is on a network connection. My network connection wasn't supported so the install failed miserably (seriously - I haven't seen a desktop like that since Windows 95). I eventually worked my way around it by installing a wireless card and connecting through it. Great fun.
The guy does 400+ successful installs, then runs into a decidedly obscure hardware problem, and people flame him? And Windows 7?
Yee Gods. Get a life folks. I read this as a success story, both for the author and for Microsoft.
Three Squirrels
may help you troubleshoot a problem of your own, and save you from a Lost Weekend
I thought people really liked that show. What's wrong with watching the episodes on friday night, saturday, and sunday?
Nowhere in the original article did I get the sense that the author was blaming Windows for his issues. In fact, he starts out by stating that he's installed Windows 7 hundreds of times without a single incident, but this was a "problem PC". So, how did this turn into an anti-Windows rant? Oh, right, it's Slashdot...
who's written over 100 books
Michael Behe's written dozens of books trying to debunk evolution. It does not make him an expert in evolution. He installs Windows, copies down what he sees on the screen and writes it down. That does NOT translate into "he knows what he's doing". I'm not saying he's not an expert, just that it's not a valid qualification.
If this hardware geek spends days solving a CPU-meets-Windows 7 problem, what chance do mere mortals have?"
They wouldn't be installing an OS. Very few non-geeks do so. They buy a computer from a vendor like Dell, it comes with an OS. When it's time to upgrade, they buy a new PC and give the old one to their kids or grandparents. They also, as has been stated numerous times in the comments, wouldn't be installing on machines that had an engineering sample for a CPU. Actually, this debunks the claim that because he's written books, he's an expert. He knew he had a machine with an unsupported processor in it and still replaced everything in the machine first. Um....duh!
Which is more painful? Going to work or gouging your eye out with a spoon? Find out!
http://www.workorspoon.com
60 to 90 minutes to replace a motherboard?
The Gregory House M.D. of OS installation.
Sig this!
Standard approach for solving installation issues:
This isn't even geeky; it's rote procedure for anyone that's been in the tech support business long enough to see enough weird crap happen. Glad he learned, though; you need to start somewhere!
For those that didn't read the article, he apparently fiddled with BIOS settings and replaced the mobo before even looking at the components.
(Obligatory nostalgic moment: I remember when Windows was much younger and error messages were oh-so-much crueler. Those were the times. Then again, blue screens nowadays are still pretty cruel, though way more informative. [STOP codes, memdumps, and even culprit FILES sometimes are great] Remember the completely useless VxD blue screen errors from W9x?)
that's how the corepirate nazis do it. just fire randomly/blindly into the 'audience', then demand applause. anything to keep your eye off the 'ball'.
as bs is not usually a fatally effective weapon, there's lots of repeat (daily) performances.
I did read the article. The buggy part he found after swapping everything is the CPU... An engineering sample. The kind of part that should be the first on the potentially defective part.
Most component called engineering sample are called that for a reason. Usually it really mean : Wow I can't believe it kinda work!
If he's supposed to be good debugger, I fear for everyone who need is PC fixed.
Just last night I fixed my parents computer in one of those long fixes that turns out to be the most fundamentally trivial things. This is why this is not my main occupation.
Basicly they had a reccently built custom Windows 7 + Ubuntu PC that had begun randomly shutting down, often minutes after it had been powered up.
Ok first thing, any obvious errors or cicumstances? No, it would just randomly power off. Windows event logs showed kernel power events, no specific driver, service or app crashing anywhere. Linux was the same. Not a thermal issue cpu + gpu temps nominal and stress test din't immediatley cause a crash.
Suspecting a power or a motherboard issue, first checked and re-seated things internally. It still occured.
Removed extraneous cards, connectors and drives. No result. It would even happen sitting in BIOS setup. Have ruled out a number of problems.
Checked for electrical shorts, poor voltage etc.
Dying power supply? Overloading or shorting? Nope, all voltages nominal, and it was brand new.
I was about to try a spare power supply and a thought occured to me..
It's almost as if the reset switch was being hit, but it wasn't even close to being knocked at any point and the switch otherwise worked fine. Then I knocked the case and the system reset. Yep, the reset switch was faulty, jolting it even slightly would reset. Who needs a reset switch since Vista anyway? Unplugged it from mainboard. Solved.
I decided not to even joke about charging my Dad for two hours of my time.
Chances are if he paid someone to do it they wouldn't necessarily have found the fault that quickly, and he'd be hundreds of dollars out of pocket.
The lesson in troubleshooting? Um... I'm not sure.
After logging in slashdot still does not take you back to the page you were on. It's been that way for 20 years.
...
How many people had the same impression I had: "Why, this sounds exactly like one of the 'Chaos Manor' columns Jerry Pournelle used to write in BYTE!"
All it needs is a few of Jerry Pournelle's favorite stock phrases. "The disk trundled for a while..." "I tried swapping out the hard disk, but no joy..." "I called up Bill Godbout..."
"How to Do Nothing," kids activities, back in print!
The OP needs to be careful, there are sharp corners present.
That's exactly why I wouldn't trust him much. Takes time to write a book. More time to write it well.
...and it's a good thing, because then I can afford a Mac and don't have to do shit like that on my personal time.
http://xkcd.com/627/
"I zero-index my hamsters" - Willtor (147206)
Nice article but good god it was an "engineering sample". Thats the equiv. of "Beta" in the chip world. They are unfinished chips, or those that are not yet properly characterized so you should expect it to fail. Not really a storey here, nothing is at fault other than someone should have warned you that things may be very unstable.
That was not a strong article in troubleshooting.
He basically used the parts blast technique to isolate the problem. No where in the article did I read any actual troubleshooting steps.
In situations like this there are two methodologies that can be applied. In a system with many components its usually easier to use a reduction technique. It's just a play on the isolation technique, but removing a large number of components.
For home troubleshooting you can still get fairly far without a lab full of equipment or spare parts. However, you don't need to rush to the store to replace every component. Memory, hard drives, processors and even some testing can be targeted against the motherboard in a limited extent.
If you are very lucky you may even have some debug information available from the motherboard itself.
A good linux rescue cd with the right tools for testing and diagnostics can do a world of good. Unfortunately, I don't have time to write a really good article and apparently the TFA didn't either.
"You should always go to other people's funerals; otherwise, they won't come to yours." -- Yogi Berra
but I wasn't using m$ winblows, I was installing Ubuntu from a 16gb flash drive on to an asus eee. That might have worked if I knew what I was doing...
What were those 100 books on? Think about that - how many years has be been writing books. Say it's been 10 years. 10 books a year? A book every 5.2 weeks? WTF
Here's what I posted as a reply to this "expert's" article. It's now awaiting admin approval to appear as a comment. We'll see if it makes it.
==================
While reading, I was thinking this was a well-written detective story. Then I got to the end and found out it's a story about a massive waste of time because you didn't follow standard procedures.
Here's how to save a few days next time: go to the motherboard manufacturer's website, get the list of supported CPUs for the motherboard you're trying to install. Then download and install the BIOS that supports that CPU. It really is that simple.
Asus is particularly good at providing a CPU support list for their motherboards. It took me entire minutes to find the lists for the P5Q3 and P5E3 Deluxe (not P5E3 Pro, as you wrote). The QX9650 is listed for both motherboards -- and in both cases, it is supported only as of a recent BIOS revision.
So all you had to do was download and install BIOS version 0204 or later for the first motherboard, the P5Q3, and I bet Win 7 would have installed correctly the first time.
As for the motherboard automatically making BIOS changes to match the fast DIMMs you installed, Asus motherboards do NOT do this by default. You must have left the BIOS in some sort of overclocker's mode.
Next time, look up and download the BIOS that supports the CPU you're trying to use. After installing it, use the BIOS setting that restores all other BIOS settings to their defaults. Then install the OS. THEN and only then, can you start tweaking BIOS settings.
Once again, the article was well written. But it's also an inadvertent confession.
==================
A test CPU would be the first culprit to come into my mind on a new OS, but then again I guess that's why he gets paid for writing books and not solving real world problems with a boss or customer breathing down your neck.
The gjy is using an engineering sample CPU, he has a motherboard set up for overclocking, and he doesn't have a set of CPU diagnostics. Clueless.
.
With Microsoft and all of it's bullshit, I can hardly remember a weekend spent fixing it - didn't run from Sunday to Saturday - that being Sunday the first of the month and Saturday being the first day of the next month.
.
Voting up, Voting down - If I really gave a fuck about your approval or not, I'd come and ask you.
One of the reasons I use Linux is that, currently, it is much more secure than Windows, given my personal use scenario.
Yes, if I were a specialist in securing Windows that might not be the case, but I'm not. Yes, if equivalent amount of effort was invested to break the security of casual users of Linux compared to that invested in breaking Windows, again, Linux might not be any more secure than Windows (well, with Linux, there are distros where I can always boot off of USB and then not save any changes, so until Microsoft offers me the same functionality there's little chance that I could use it in as secure a fashion as I can use Linux).
Running Linux in a VM under Windows just wouldn't "cut it" for me. Sorry.
what chance do mere mortals have?
Yes, what chance do mere mortals have of being in the particular use case this super-geek is in, using an engineering sample CPU?
Answer: 0. Well, not exactly 0 --- you never know exactly what you're buying nowadays!
What kind of expert is he if he can't figure this out? I mean I'll give normal users a pass on this kind of stuff. It is complicated, no question about that. To the extent I'll razz normal users it is to not mess with it. However, if you are a supposed expert, you should understand it. While complicated, it isn't really all that hard. Learn about what the different kinds of settings are, learn the defaults (or know where to look them up) etc. This is the kind of thing to check when you are troubleshooting a system.
This is also why OEMs turn them off in their BIOSes, because they don't want you screwing with them and then lying and saying it is all default.
At any rate I don't cut this guy any slack. He wants to pretend to be an expert, yet clearly knows fuck-all about the BIOS. Anyone who has mucked around with computers ought to understand it.
Heck, you run in to this stuff even if you don't mess around from time to time. When I upgraded my desktop to 8GB of RAM, it suddenly started having some random reboot problems very occasionally. Going back to 4GB fixed it. Odd, board supports 8GB. So I had a look see at the settings. The RAM was "high end" RAM that had it's SPD set above JEDEC timings, however it wanted 1.9v to do that. My board supplies 1.8v only. Ok, so I went in to the BIOS and set the RAM down to the JEDEC timings (1 more CL). All problems cleared up.
At work we had a system that we got 8GB of RAM for. Older system that uses 533 or 667MHz RAM. We got 800MHz since that's what they sell these days and it functions fine at lower speeds. Well, turns out the board will try to support 800MHz, however with all 8GB in there, again reboots are a problem. The memory controller just can't handle it. Unfortunately, you can't change the timing in the BIOS. So instead put in a set of the 800MHz and a set of 667MHz, which forces the system to 667MHz. All works good then.
While I won't call this basic knowledge, the sort of thing a user ought to know, it is sure as hell part of doing hardware support.
Well, for some kind of "expert" I find his component-swapping "diagnosis" a bit dubious to be honest. Yeah, a lot of the time you can just swap out a suspect component but swapping a hard drive for a installation-hang? It's towards the bottom of my list, especially if that drive has handled other OS installations without any problem and doesn't have SMART errors (Did he check? Did he clean-format in between? Even a corrupt NTFS can exhibit those same symptoms... nary a mention of things like that). I'd actually be suspecting a Windows bug before most of the stuff he replaces, considering that he suggests he was already using that hardware fine on the 32-bit editions of Windows - I'd be making an up-to-the-minute Windows 7 installation CD with every update I could find.
And the graphics card replacement is just completely illogical unless there was some hint that the graphics card was the fault - bad imagery, hangs happening on a resolution change or reboot, etc. The graphics cards use doesn't change during the installation except at those points and so unless there was a temperature problem or something related, it would just be *weird* to suspect that. Power supplies can present odd problems certainly but it makes me wonder how reliable his testing really is normally if he's powering things up on the wrong wattage of power supply without even noticing and then possibly publishing benchmarks / stability reports etc.
By the time you get to the casing, he's just clutching at straws rather than thinking. I'm not saying that I'd suspect the CPU immediately, or even at all, but the logical processes he's using are just dubious. I mean, he changes the memory before he remembers that he has some stupid BIOS option for upping the memory speeds enabled. And doesn't think to do a simple memtest at *ANY* stage? Surely a memtest would have picked up the same memory errors before all the testing, and that would lead him to check the BIOS and then replace the RAM way before he does? It saves a lot of hurt to do *checks* like that, rather than blindly assuming, especially if you're replacing one set of RAM with another that is "older, and well-tried and trusted" (with memory, there's no such thing as tried and trusted... they can die overnight, don't take kindly to handling, etc.). He was using a non-standard CPU cooler, too, which doesn't get mentioned until later in the article. I'd be suspecting temperature problems with his setup which he doesn't even consider.
But at no point does he actually suspect the *only* engineering *sample* that he's ever used and that's been in the machine all along. He just blindly replaces it eventually but he never suspects it until he "finds" the problem.
Hell, must be nice to just be able to order random components until your problem goes away. Other people have to do the same job, with cheaper, shittier hardware, no budget, no diagnostic tools, useless vendor support, while under pressure, to earn their living. But then according to the Wiki he's a "freelance writer", "trainer", "Internet consultant" (Yeurk...), "Director of Technical Marketing" (for a year), "Technical Evangelist", "Director of Training", "Senior Researcher", "series editor", "currently writes for... TechTarget.com Web sites... Tom's Hardware and Tom's Guide... the American Institute of Certified Public Accountants (AICPA)... InformIT.com., "content development and book publishing projects". To be absolutely honest, the glaring thing that sticks out like a sore thumb in that list is writing for Tom's Hardware - everything else is book/article-writing and sales. This guy isn't a tech - he's a marketer. Why his "diagnosis" should be interesting at all is beyond me, except possibly as a humour article.
Oh, and the article seems to just be a way to show off brandnames, model names, and his hardware. I expect as much from a marketing guy.
Real trouble shooters pull RAM first* because mis-seated and bad RAM is the cause of most hardware errors. After that you pull the PSU and then the GPU. If that still doesn't find the culprit you test those parts in known good machines and then swap out the CPU to isolate it or the MB. This guy didn't even have a serious problem, a real hardware gremlin is when you have combination issues and can't isolate a single faulty component. A month in the trenches of front line support teaches you these lessons that you won't learn from a lifetime of building the occasional machine for yourself.
That said if I had a frigging ENGINEERING SAMPLE anything I would pull that first. Anyone who knows anything about hardware production knows engineering samples are for troubleshooting and not freebies for journalists. It is like over clocking, far too many people expect that OCing is guaranteed and stable because most gear these days has some head room. I have seen people whining and blaming the manufacturers because their hardware didn't get the OC that everyone else got.
*And if the user has been OCing or fiddling with BIOS you set that to default as well.
========
CINC, 4th Penguin Legion
If you have a quad core processor you need a 257 bit power supply.
That extra bit is for the power switch.
When you crank out a lot of stuff, it is extremely hard to make all of that stuff be high quality. Quality usually takes time, it takes research, it takes refinement. It is possible, in some rare cases, to have someone that produces a vast quantity of work, all of which is top quality. However it is far more common to see someone produce a vast amount of mediocre to bad quality work.
As an example: Dr. Mark Russinovich has written a grand total of three technical books to date. So, clearly a man who doesn't know what he's talking about right? Wrong. Those three books are "Inside Microsoft Windows 2000," "Windows Internals Fourth Edition," and "Windows Internals Fifth Edition." He has, literally, written the book (along with David Solomon) on the recent versions of Windows, published by MS themselves. These are extremely accurate, comprehensive, technical documents of Windows down to its very fundamental levels. He also has written a suite of tools, the Sysinternals tools, so good that MS bought them, and hired him on as a technical fellow.
So while he's produced only three books, they are all of the highest quality of technical information. There haven't been more because he hasn't had the time to write hundreds of books, nor the need to issue revisions to correct problems with the ones he has (each new edition covers a new version of Windows).
Thus when I hear someone talk about how good they are because of the quantity of they works, I am skeptical. The only way you get a vast quantity of high quality work is either laboring an entire lifetime (and even then often not), being a prodigy, or both.
..a clue about how computers work. Even experienced windows professionals.
I mean this guy has 32 bit OS working and moves to 64 bit OS...am I following this ok. The 32 bit install presumably went well on the hardware and the 64 bit install fails.
So I grok his first attempts which are replacing the install media once. Seems like a reasonable assumption (some bit out of the billions on the DVD image just happened to be flipped the wrong way). From there though he starts to lose me. The motherboard is perhaps plausible but you would have to be assuming some rather significant difference in hardware support between the 64bit and 32bit systems. From there? RAM is 64bit how? Or even my HD?
I think the most significant thing to learn here is twofold.
i) People - even experienced computer professionals - treat computers like they are magic. Like there is no real science behind how they work. Clearly this guy was replacing parts based on some "experiential weighted average" with regard to how likely they are to cause a "weird" problem.
ii) When A. C. Doyle said "When you have excluded the impossible" he neglected to state that the *order* in which one does so is significant. Eliminating things in order of their apparent relation to the problem (i.e. all the things for which 64 bits makes a difference) and (in a business environment) with respect to cost (i.e. Replacing a CPU is often a cheaper test than replacing a motherboard wrt labour) will likely fix your problem sooner than just going for the "usual suspects".
Aside: I've had two cases where I found a CPU issue. One was very similar to this - crashing during a Windows 2000 install - often at the same place. The problem I had was actually thermal - the heatsink was reversed leaving the thermal patch making minimal contact with the heat spreader. Somehow I figured that out without replacing everything else first.
Once again I am reminded why I buy Apple. Back in the day I used to love tinkering with this sort of thing. Then I got bored of that. Now I buy Apple and it just works.
INstall linux and run Windows in a VM. When your windows install gets infected/hosed with a virus/malware/whatever it could well mess up your linux VM machine and make it inrecoverable but if you install Windows in a VM and run on top of linux the worst that can happen is the VM gets hosed.
I'll go a little easier on the fellow and his 100 books. It's hard to not learn at least one nifty trick per book you write. So I'm sure the guy knows a few hundred nifty tricks.
I like your last sentence better. The article tone is wrong. Instead of "the world's hardest install", it should have been a "Friday-At-7PM-at-the-bar" story over a triple beer.
"Okay guys, I know, I know, but when my special spec system crashed on install, I was kinda fried and my method was like going to New York via Alaska..."
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
The overclocking WAS in default!!! The BIOS mucked around with FSB, memory and clocking due to a DEFAULT "optimize" setting. It only worked AFTER he changed the settings AWAY from default!
On top of which, he had been running the CPU with stock specs and 32-bit Win7.
Meanwhile we're supposed to take someone seriously who either doesn't read or misreads the article then launches into a tirade. Sheesh.
the Year of the Windows Desktop?
I'm not a lawyer, but I play one on the Internet. Blog
So this guy is writing a book about Windows 7 and in his blog at least makes no mention of the Windows 7 Upgrade Advisor? http://www.microsoft.com/downloads/details.aspx?FamilyID=1b544e90-7659-4bd9-9e51-2497c146af15&displaylang=en Granted he is installing a different version (x86 vs. x64) and edition (ultimate vs. pro) of the same OS, it is definitely not a given that driver support is the same with different versions of Windows.
Epic FAIL - not bothering to bench burn-in any change in system hardware.
I've been building PC's for well over 25 years, starting with my empty case IBM PC 5150 (no drives, cards - nothing but 64k soldered on the system board).
First test - TEST! Then, test it again.
I currently use PC-Check to run a diag for a minimum of 24 hours. This does nothing more than to check if you're going to release any "magic smoke" because of an assembly failure.
After that, I load the OS.
I had an old Novell-brand 386 server (yes - they rebranded hardware back in the 2.12 SFT days) that had a bad CPU, but, only intermittently. Worked great when cold, would lock up dead after 10 - 15 minutes, but not all of the time. We replaced the server, I took the old one home, popped in another 386-16MHz (yes - the days of unbridled power!) and it worked flawlessly 24x7 for 8 years until it got too old to bother with and chucked into the shitcan.
Test and test again. That helps ensure your 5 9's of availability!
People that prefer linux will like to believe that windows will not ever be reliable, and people who prefer windows tend to believe that linux won't be.
I've seen this time and time again, both through myself, and my friends/acquaintances.
Just two recent examples of this:
A friend of mine bought a really nice Asus i5 gaming laptop with win7 home premium preloaded on Thur last week. I envy his laptop. It's awesome. Anyway, on Friday, after only running the laptop 2 times, he turned it on, only to find that win7 wouldn't boot. Luckily Asus provided a recovery disk (some OEMS require you to burn your own, he wouldn't have done that that soon). My very first reaction (as I do prefer linux, and I myself happen to have many issues just like this one with any and all windows oses, my friends will confirm that I do for some reason, whereas they do not usually...) was to start laughing, and I remember saying "When will MS get their ^(&# straight?". Now, we all know that it's possible it was his fault, not win7, but it wasn't my first reaction.
Likewise, a different friend of mine has a win7/ubuntu dual boot setup on his desktop, and at one time had xp and ubuntu via wubi on his laptop. He very very rarely uses ubuntu, and I honestly don't even know why he insists on having it installed. His antivirus program thought the wubi virtual disk of ubuntu was a virus, and deleted/broke it. He instantly blamed ubuntu for being unstable (who wouldn't). It was only later, after it happened again and we caught his antivirus in the act that we figured out what was happening. This same friend had a sata cable fail on his desktop which was hooked up to the drive with ubuntu on it. Because of this, his computer wouldn't boot, although oddly enough the drive did show up in bios. He blamed ubuntu instantly for the grub error that appeared when grub couldn't find that drive, and had me come over and help him "fix ubuntu". Well, we figured out pretty quick that he just had a bad sata cable, and after replacing it, everything was perfect.
In both cases, the operating system might not have been at fault. (ok, so ubuntu wasn't in these cases, but I've seen it become unbootable on a netbook after a regular "apt-get upgrade" too..., and the win7 issue might have been my friends fault too, I don't know.) However, it just goes to show how our personal feelings about our favorite operating systems get in the way of factual information about them, and many times people say things and bash them before they think about it. The same goes for both windows and linux.