OpenSUSE Beta Can Brick Intel e1000e Network Cards
An anonymous reader writes "Some Intel cards don't just not work with the new OpenSUSE beta, they can get bricked as well. Check your hardware before you install!" The only card mentioned as affected is the Intel e1000e, and it's not just OpenSUSE for which this card is a problem, according to this short article: "Bug reports for Fedora 9 and 10 and Linux Kernel 2.6.27rc1 match the symptoms reported by SUSE users."
Any decent firmware for a device should not allow the user to accidentally destroy the device. Looks like Intel skipped on Q&A.
1% APY, No fees, Online Bank https://captl1.co/2uIErYq Don't let your $$$ sit in a no-interest acct.
Why won't people stop using the word brick to mean things that aren't bricked! All you have to do is use a quasi-negative reverse transponder linked to your flux capacitor to generate an inverse tachyon field, connect it to the JTAG while chanting Siaynoq and it will come right up. Sheesh!
I hate it when people keep incorrectly using brick . . . . wait, what? They used it right? Oh . . . my bad, carry on.
Your hair look like poop, Bob! - Wanker.
Kernel 2.6.27-rc7 has a changelog entry that reads:
Christopher Li (1):
e1000: prevent corruption of EEPROM/NVM
That's because it a closed driver.
Only Intel knows what it does.
Looks like it write the NIC's firmware and cannot reset just the NIC. So it asks for a reboot.
IOW: the NIC needs the reboot, not Linux.
Almost all drivers in Windows are closed.
Obama's legacy: (N)othing (S)ecure (A)nywhere and (T)error (S)imulation (A)dministration
Gun and air-conditioning aside, devices should not allow accidental bricking or physical damage unless it is inherent in the function of the hardware.
For cases of loading bad firmware, the "load new firmware" instruction should have a few failsafes like magic words or what-not so it isn't accidentally invoked.
Even better, hardware devices should have a failsafe firmware burned on silicon that can be reactivated by flipping a switch, setting a jumper, or some other hardware-action-required setting. This "failsafe firmware" may be nothing more than a stub that prepares the device to accept a new "real" firmware, but at least it will allow de-bricking.
You don't really want this debrick/failsafe-mode to be triggerable through software alone, it's too much of an opportunity for malicious use.
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
Remember how everyone on /. called bullshit?
This doesn't look good for our cause.
THL phish sticks
I work on the e1000 team (including the e1000e driver) and here is what we know. A panic in another driver (believed to be the gfx driver but uncertain) which scribbles over the NIC/LOM non-volatile memory (NVM). This is only happening with the 2.6.27-rc kernels on ICHx systems. Since the NIC/LOM VNM is part of the whole BIOS image other things in the system could be effected by this driver panic as well. An update of the system BIOS will restore the NIC/LOM to be operational. We have some patches under test right now that we will be releasing later today to protect the NIC/LOM NVM. That should help narrow down who is scribbling over NVM.
There should be a jumper somewhere which either physically disables writes to nonvolatile memory. That applies to all hardware. The current incident is a bug, but this could also be exploited by malware for embedding itself in the mainboard firmware or the PXE firmware on a network card. Also, bring back write-protection switches on USB sticks.
It is a beta using an early release candidate kernel.
Isn't this the sort of issue that testing releases are meant to catch? It is unfortunate that some users got bit by the bug but it probably isn't very widespread.
Of course, if this was Windows the blame wouldn't be aimed at the hardware...
What do we learn from this incident?
1. Beta is not for the common people.
2. Programmers are humans are erroneous.
3. "This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; [...]"
Unfortunately, due to inferior materials used in the chip's casing, exposing the device to a sufficiently strong inverse tachyon field will cause protonium breakdown which will in turn cause an endothermic reaction, which in turn will fracture the silicon along the sharp drop-offs in the resulting thermal gradient. As a side-effect of the presence of the inverse tachyons, the failure will happen in the near future rather than immediately. In other words, your device will work on the testbench but by the time you put it into production then *crack* there it goes.
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
I've heard of BSD, NetBSD, OpenBSD, and FreeBSD, and others, but what is this BSDM of which you speak?
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
This with a suse bata not a finale release with Mandrake
A cooling system should have a build in over ride that trun it back on after it its too hot. There was a movie that got this part right
Are you talking about the Linux driver? e1000e isn't at all a closed driver, it's in the kernel.org source, which is published under the GPL v2. That's about as un-closed as you can get.
Chris "Ng" Jones
cmsj@tenshu.net
www.tenshu.net
It's by no means limited to a SuSE beta. Ubuntu's Alphas of 8.10 (Intrepid) are also using 2.6.27-rc kernels.
SuSE are taking the heat because they were first to put out a warning about the issue, but it is not their fault, or specific to them.
Chris "Ng" Jones
cmsj@tenshu.net
www.tenshu.net
1. Beta is not for the common people.
Is that why Betamax (consumer video tape format) died and Betacam (professional video tape format family) lived?
For example, it needs 2 layers of control. One layer should be minimal and not subject to any possible reprogramming. It might even be implemented in pure hardware (not firmware). But if it is implemented in firmware, it needs to be in "read ONLY not physically possible to write" memory (e.g. legacy ROM). This layer is the one that implements the function to write the flash that controls the next layer of the device. And this first layer needs to function even if the next layer is hosed or doing strange stuff. Or alternatively, the first layer would have functions to stop, reset, and restart the next layer. This would allow the flash to be reloaded even if it is completely fouled up now. It should not be necessary to have a JTAG port for this.
now we need to go OSS in diesel cars
No, as one of the previous posters stated, it should shut down. A system should always go into the safest mode of operation if a problem is encountered. If your reaching a critical temperature and the operator has turned off the fans, you shut down. The system doesn't know why the fans were disabled and so it should not try to start them. (What if the operator shut them down because of a fire?)
Which definition of "bricked" is this? The real "bricked" meaning "doesn't work anymore" or the apple iphone definition of "bricked" which has come to mean "doesn't work unless you reboot it" ?
One of the vista betas bricked on old RTL 8029 clone card I was using. Vista lasted about 2 hours installed before I decided it was rubbish and went fully GNU/Linux instead.
... to reverse the polarity.
Mit der Dummheit kämpfen Götter selbst vergebens
Dear Editors,
I am writing today specifically to request that the phrase "don't just not work" be banned from this site. This phrase is ridiculous and utterly confusing. Even expanding it to the root words of "do not just not work" does little to help.
The English language is mutilated just fine without needing extra help by poor editing.
Thanks,
Concerned Reader
Indeed. https://lists.ubuntu.com/archives/ubuntu-devel/2008-September/026559.html
"When I first heard Daydream Nation it quite frankly scared the living shit out of me." -- Matthew Stearns
I once had to literally throw away 18 of 24 e1000's, because they could not be made to work reliable under Linux. Unfortunately among those working were my two test cards that I got beforehand. I will not get any networking equipment from Intel ever again. It seems to me that just because of their well-known name they believe they can get away with massively substandard quality.
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
It is pretty much THE gold standard. There are one or two other vendors that have good chipsets as well, but probably half the servers in existence are using Intel NICs, and 3/4 of the rest are Broadcomm.
When you pay 20 bucks for an ethernet card, you get what you pay for. 95% of the time it is fine for everyday use in a PC, but they aren't at the same quality level (and not even close to the same performance level) as the enterprise class products. Same is true of all the other vendors.
"Malo periculosam, libertatem quam quietam servitutem." -- Jefferson
This can also affect Mandriva Linux 2009 pre-releases. To be clear, the bug is in the upstream kernel itself, not in any code specific to any distribution.
It affects any 2.6.27rc kernel, whether it's in a distribution or a clean upstream build.
We have posted a full, detailed notification of the issue for Mandriva users.
That causes a fire to start 5 minutes ago.
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
Some Intel cards don't just not work
with the new OpenSUSE beta, they can get bricked as well.
Get it now?