Data Storm Caused Nuclear Plant To Shut Down
rs232 writes to let us know that the US House of Representatives Committee on Homeland Security called this week for the Nuclear Regulatory Commission to further investigate the cause of excessive network traffic that shut down an Alabama nuclear plant. Investigators want to know whether the data storm could have been initiated from outside the plant.
the NPG electrode was replaced with carbon blac
All of the plant employees were looking up Starcraft 2 news.
I am on the road crew. This is my stop sign.
>Investigators want to know whether the data storm could have been initiated from outside the plant.
Do invesigators also want to know how a "data storm" could have caused a nuclear plant to shut down?
I'm missing the "haha" tag so much! Was it probably a pr0n related issue? Or just another case for the **AAs ?
The employees used YouTube and MySpace
Some choice quotes, emphasis added:
An investigation into the failure found that the controllers for the pumps locked up following a spike in data traffic -- referred to as a "data storm" in the NRC notice -- on the power plant's internal control system network. The deluge of data was apparently caused by a separate malfunctioning control device, known as a programmable logic controller (PLC).
"Conversations between the Homeland Security Committee staff and the NRC representatives suggest that it is possible that this incident could have come from outside the plant," Committee Chairman Bennie G. Thompson (D-Miss.) and Subcommittee Chairman James R. Langevin (D-RI) stated in the letter. "Unless and until the cause of the excessive network load can be explained, there is no way for either the licensee (power company) or the NRC to know that this was not an external distributed denial-of-service attack."
Wow. Just...wow. As if you needed more proof that this wasn't a hacking attempt:
"The integrated control system (ICS) network is not connected to the network outside the plant, but it is connected to a very large number of controllers and devices in the plant," Johnson said. "You can end up with a lot of information, and it appears to be more than it could handle."
Seriously, how stupid do you have to be to think "OMG, Haxxors?" Answer: work at Homeland inSecurity, or be a Congresscritter. They already figured it out. It was a controller for a specific piece of equipment that flooded the network and triggered a bug in the variable-frequency-drive controllers for pumps.
Please help metamoderate.
You'd hope that in something as critical as a nuclear power plant the answer would be, very quickly, "no, it didn't come from an external source because that's impossible". Followed by detailed analysis of the logs to determine which internal system screwed up.
That said, the article is a bit sparse on actual technical details, so my derision may be unwarranted.
If this did come from The Outside, then this is the first domestic instance of nuclear terrorism (that we know of).
It is a darn good thing we have the DMCA to have their AOL accounts pulled, and prosecutorial power over their actions.
I feel quite relieved and secure now.
---
On a serious note, I really hope the problem was stupidity, or malfunction instead of malicious intent, no matter what the agenda of the attacker.
Sounds to me that the vendors under-engineered their network and still charged mega-bucks for it. The auditors, I'm sure, are making the most out of this to justify their fee.
Nothing to see, move along - I'll say!
I prefer Flambe as apposed flamebait.
musta been the slashdot effect..
MABASPLOOM!
Firstly I would re-design that entire infrastructure and rid that power plant of incompetent IT people. Secondly I would hold those in power responsible for 1) not having failover measures in place 2) not having a stable and robust enough infrastructure in place 3) obviously not being SCADA compliant. If they can't pass IT security implement simplistic measures such as a properly designed network, it makes me wonder about the physical security aspects of it. What am I paying higher taxes for everytime the gov cries about strenghtening infrastructure when they couldn't even avoid something as stupid and as simple as a 1) safe 2) stable network. Why wasn't there any failover who knows. Insanity when three different agencies can all come down on one agency instead of WORKING with that agency to take corrective measures. US Tax dollars at work. We need to redesign infrastructure and some of these idiots in office.
Infiltrated dot Net
As usual, the American government is looking to extend its control over things. "Oh noes, look what terrorists might have done. Homeland security needs more funding and less oversight to prevent this in the future." When will people learn to assume the government is lying first, then wait for them to prove themselves right later?
- I voted for Nintendo and against Bush
Do investigators also want to know how a "data storm" could have caused a nuclear plant to shut down?
The two questions, where and how, will be answered at the same time.
Friends don't help friends install M$ junk.
Isn't it a bit odd that they were using a non-deterministic network - something like Ethernet, by the sound of it. Back in the early 90s, I was always told that networks like Ethernet were great for office apps, but not where you wanted guaranteed times for message delivery. For that token ring, FDDI and the like were better. What is the network infrastructure of choice in a nuclear power station?
After yet re-reading, I find this government even more insanely stupider than I would have hoped for... Such failures are common among PLC and supervisory control and data acquisition (SCADA) systems, because the manufacturers do not test the devices' handling of bad data, said Dale Peterson, CEO of industrial system security firm DigitalBond.
"What is happening in this marketplace is that vendors will build their own (network) stacks to make it cheaper," Peterson said. "And it works, but when (the device) gets anything that it didn't expect, it will gag." So you mean to tell me pretty much there is no enforcement for manufacturers to maintain compliance on their products even if those products are going into a nuclear *ANYTHING... Which on the worst case scenario could cause catastrophe, yet we have regulatory commissions on the flow of ketchup, regulatory commissions/directions/etc., on weight loss products, lipsticks, etc. (FDA), but this place is not concerned with nuclear plants. Sinful.
Infiltrated dot Net
At least their reactor failed to "off" this time...
Schwab
Editor, A1-AAA AmeriCaptions
...nuclear viagra
Table-ized A.I.
Behold, the power of Slashdot!
A cat fell asleep on a keyboard
Seriously, how stupid do you have to be to think "OMG, Haxxors?" Answer: work at Homeland inSecurity, or be a Congresscritter. They already figured it out. It was a controller for a specific piece of equipment that flooded the network and triggered a bug in the variable-frequency-drive controllers for pumps.
As someone who used to work in system's engineering for a sister BWR, I think the inspection is a good idea. Oh, there's dumb and there's nuclear dumb but this is not a case of either. Nuclear dumb involves putting machine guns nests inside the plant. Finding the root cause of the accident is a good idea.
Handwaving about a PLC device won't do. What ultimately caused the PLC malfunction needs to be answered at a component level. There's going to be something wrong with it and that should be reported and every other device like it needs to be ripped out and trashed. If there is not component failure, there's a software problem which also must be understood.
Yes, it could have been hackers. The "internal control network" might at some point hits a desk that's connected to the wider world. It could be something mundane and unintentional, like an operator's virused up laptop.
An outage like that is something that's going to have both NRC and corporate ass-chewers looking at everything. Corporate might want to paint a nice picture for the NRC, but the poor devil that lies to them goes to jail. In either case, the problem will be identified and eliminated.
You might also have noted in the article that this is not the first plant to go thumbs down over some winblows born virus. In 2003, the slammer worm caused havoc at an offline Ohio plant. Yes, that was hackers. They did not mean to do it, but the plant's systems were open to it and failed. That's not acceptable from any standpoint.
Despite the better advice of the computer people at the plants, Entergy is a big M$ Partner. They take the big dogs out fishing and sell them the works. Ten years ago, M$ had something worth while and interesting. It was used in places it should not have been. Worse, the flaws from ten years ago have not been addressed or fixed. A good clean up is in order.
Friends don't help friends install M$ junk.
Firstly I would re-design that entire infrastructure and rid that power plant of incompetent IT people.
You need to find the root cause. You don't know it yet, so you don't really know what to do.
Chances are, the cause has been written up by the four or five systems engineering people in charge of the plant. They ARE competent, but they are never given the resources they need.
Why wasn't there any failover who knows.
There was a failover - they overrode the broken thing. Had the operators been gassed, the plant would have turned itself off when the water level got too high or low. This is a big deal but ultimately the plant was safely shut down and no one got hurt. It's designed to do that even if you could shear the feed water pipe off and they did not let the new fangled control network mess with that.
Friends don't help friends install M$ junk.
Wow, someone who's worked in a BWR down modded in less than ten minutes. Nice work, trolls.
DMCA, Hollings, Palladium. What might have sounded like paranoia is now common sense.
> data storm
Is that a nice way of saying they were downloading pr0n?
> US House of Representative's Committee on Homeland Security called further investigate
Boss: "So we don't have the backups for the first two weeks in April"
Employee: "Yes Boss. They were obviously misplaced by terrorists"
When Homeland Security is done, my refrigerator door was left ajar last night. I think it was terrorists too. Think I'll phone this one in.
"Ok, techie, give me the jist of it."
"It seems the problem was with the NC9828A chip"
"Oh? And what was the problem?"
"It melted, basically. It went bonkers."
"Ah, and then what happend?"
"Err... it caused the shutdown."
"But how?"
"Well, I presume the AH-982's got deluged with data, so they shut off."
"Ah, so it was some sort of data thing."
"Kind of, the failing chip would start sending data in the network t--"
"Hey, it's like a storm of data! Hah! I get it!"
"Umm, basically."
"Oh man. A data storm! I better tell the NRC"
"Ok, sure."
Later...
"Sir, I have the cause of the shutdown, it was caused what the tech guys here would call a data storm."
"A data storm? Wow. So your reactors got a bunch of bad datas, right?"
"Errr.. kind of, the microchips melted."
"Data can do that?"
"Yeah, it's like a storm on our, uh, logic networks. I guess that can melt the microchips"
"Uh oh. Maybe this storm came from outside the plant! One of those hacker attacks!"
"Hmmmm, the guy said it melted, but I suppo--"
"Oh crap I better inform Homeland Security!"
"Ok, sure."
Later still...
"Yeah, we had a data storm and it melted the reactor networks."
"How did this data storm happen?"
"I don't think they know yet, but it messed up big time."
"My God. Do you realize this could be Al Qaeda?!!"
"Could realize wha--"
"Al Qaeda! Terrorists. Internets terrorists."
"I don't know if the reactors are hooked up to the Interne--"
"Listen. Keep this quiet, but make sure you tell everyone you know. These reactors are not safe! No one is safe from the terror!"
"Well, it was a data storm. Can terrorists make data storms?"
"Yes. They caused your meltdown."
"No, no, the microchips melted down because of the storm. A meltdow--"
"In the terror business, there's more than one type of meltdown, you just let us handle this."
"Ok, sure."
insightful. He's worked in a nuclear power station and seems to be clueful.
1) They can't describe what happened
2) They can't tell if outside interference, whatever the nature occurred
3) That this might have an internal/design cause
/. again.
In the land of the blind, the one-eyed man is king.
Erris is Twitter's goddamned motherfucking sockpuppet.
This sig intentionally left blank.
Wow, shilling your own posts does work!
Web2.0: I love when people Flickr my cuil and digg my boingboing until my google is reddit and I start to yahoo
They device manufacturers create their own stacks and then don't test them? Are we just asking for a nuclear accident, or does this strike anyone else as stupidity?
At least it wasn't connected to the public internet. Can you imagine the havoc THAT would create. But I wonder why they're treating this as a criminal thing. Did someone modify a device?
There's an excellent summary on Risks.
Great news, guys. This is going to be a non-issue. People are freaking out because a digital device is involved, and freaking out because a nuclear power plant was involved, but I do industrial control system and DCS design for a living, and I'll tell you right now, that you simply can't access control networks from the outside. There are seperate, often redundant networks, and even then, depending on the way the plant was designed, we're talking modbus plus or something that PCs don't normally access.
It's been a long time.
Why, exactly, are the failsafe systems of a nuclear power plant hooked up to the Internet? Stuff like this needs to be completely sealed off from outside intrusion. And I mean completely. You don't need an internet connection to operate a power plant.
Granted, there's probably valid uses for it, but the computers with a 'net connection need to be isolated from the ones that actually keep the plant operating.
Who in God's name connects a plant's coolant regulation systems to the Internet? How could it be an outside agent when the "data storm" happened on the plant's INTERNAL network.
The article says that explicitly. "Internal network." The DHD is worried about outside agents penetrating the plant personnel, not someone with a laptop uploading a virus like Jeff Goldblum in "Independence Day."
If there *was* such a "data storm" attack, it would _have_ to be caused by an inside saboteur. The plant needs to focus on HUMAN security, not computer security. Either that or they need to reconsider a faulty design.
But can we try, just try, not to write completely hysterical baloney? Hysterical baloney is a tradmark of "Homeland Security," and they might see fit to sue.
--
Toro
Maybe someone inside the complex are using some p2p network...
From article
:)
---
Such failures are common among PLC and supervisory control and data acquisition (SCADA) systems, because the manufacturers do not test the devices' handling of bad data, said Dale Peterson, CEO of industrial system security firm DigitalBond.
"What is happening in this marketplace is that vendors will build their own (network) stacks to make it cheaper," Peterson said. "And it works, but when (the device) gets anything that it didn't expect, it will gag."
In many cases, a simple vulnerability scan will even cause the devices to crash, Peterson said. During tests in an electrical substation, Nessus running in safe scan mode crashed devices, he said. In some cases, sending out broadcast data on the network will crash several of connected devices, he added.
---
Scary, and really strange.
All pipes, pumps and other physical components have to pass multiple safety and quality checks before they are allowed as plant constructions materials.
While it seems that computing components don't have even rudimentary interoperability checks.
It seems that US nuclear safety board (or whatever is the name) is dangerously computer illiterate. Not a good thing.
One of my programming teachers was a retired system architect for Finnish nuclear power plant, and he had bit different stories to tell how they tested computer systems.
In one lecture that turned into a story of how they did things taught me more about software testing than I have gotten on any other course after that
So fortunately it is not that bad everywhere.
"The power plant doesn't work"
First link in google.
list of incident at brown ferry (click on candle)
C. Sagan : A demon haunted world:
http://www.amazon.com/gp/product/0345409469/
visit randi.org
Russian nuclear engineers flood you of the lack of your "Research & Development" ...
I have actually seen such a problem myself: Controllers crashing because someone was testing the network. The problem was, ofcourse, that the CPU spent a lot of time to handle the amount of packages on the network and therefore didn't have time enough for it's real-time application. (It didn't help that the platform didn't support DMA.)
Solution: Make the network interrupt handler threaded and prioritize it below the real-time application. Sure, that doesn't help the SCADA performance, but you have to make sure that the real-time application meets it's deadlines no matter what is going on on the network. I simply don't buy that you can secure a network stretching over more than 1 meter against "data storms."
In the letter from the Committe on Homeland security to Nookoolur Regulatory Commission about it,
In accord with current regulations, NRC staff decided against investigating the failure as a "cybersecurity incident" because 1) the failing system was a "non-safety" system rather than a "safety" system,
If failure of the component is dangerous enough to force a system shutdown due to lack of cooling, that IS a safety system. It's like saying there's no need for bracing the BOTTOM of the ladder, cuz it's such a short fall.
It wasn't the power plant that made this determination. It was the NRC.
Pavlov wouldn't be so famous if he'd used a can opener instead of a bell.
"I don't know what's more disturbing, the fact that it happened or that fact that it happens so often we have a name for it."
Strength through redundancy and over-design
"They were also using candles to determine whether or not the leaks had been successfully plugged .. The electrical engineer put the candle too close to the foam rubber, and it burst into flame"
..
..
Don't try and find sources of draft using inflammable foam and a candle
Don't route the backup system through the same conduits as the primary one
was Brown's Ferry *AGAIN!?!??!* (Score:4, AGhaaaaaa !!!, ohh God, make me a believer )
davecb5620@gmail.com
In this case it was a rogue device pumping out too much garbled date. The solution being to design nodes that isolate the network from such occurances. The devices communicate through some high level protocol that is validated by the nodes before getting on the network, not as seems to be in this case standard TCP/IP over Ethernet. The loss of two recirculation pumps because of a network event is not trivial.
Network stack has too high priority (Score:1)
davecb5620@gmail.com
If you've worked in IT for a while, you would remember Procomm. It was probably the best PC based terminal emulation/RS-232 scripting programs. I know a few folks who still use it for automated embedded equipment telemetry & command applications. Procomm was written by none other than "Datastorm Technologies corp.".
Now back on topic: If you RTFA it says that there was an embedded networked controller that was "babbling" (flooding the internal network with unwanted traffic). Unless some hacker from outside penetrated their firewall and reprogrammed the embedded controller, this incident is probably due to a simple hardware failure. They should be looking at setting up redundant network interfaces for their "mission critical" systems to prevent this sort of thing from happening in the future.
JSL
http://kb.iu.edu/data/aibq.html
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
Q.What caused the data storm?
A.When cold bits and hot bits collide a storm like formation is produced.
sometimes its just rain bits, other times it can be hails bit(or even bytes!) and also light bits(normally occur in fiber optic networks)
People here are talking about ethernet and ip, but all the PLCs i've dealt with use 4 wire rs-485. Some do allow access to the serial network via ip, using a windows app. I don't know if the plant uses ethernet (using ip, or another protocol), but it's possible, if they have many devices over a large area. However, you shouldn't make that assumption.
A "data storm" can be caused by lots of things, even an unstable driver causing a NIC to spew garbage packets. Or an application that hits a bug and begins spewing to the network. Or a failure of Spanning Tree causing network loops to arise (which can really mess up an Ethernet).
The wierdest I ever saw was a situation at a school where the entire network (built around high-end Cisco switches) crashed hard. It took 3 hours of troubleshooting and disconnecting various segments to finally pin down the cause. It was a little mini-switch that some teacher attached to the LAN that somehow had a meltdown and began spewing "valid" Ethernet packets with all kinds of random garbage source and destination MAC addresses, random payload, and valid checksums. No hosts were attached to the mini switch, so it had to be something in its microcontroller going haywire. This cause every switch to go nuts trying to maintain its forwarding tables ("show cpu" was 100% utilization) and resulted in no traffic going anywhere. It even crossed VLAN boundaries since all the switches had "trunk" ports using tagged VLANS, so the garbage packets still made it through the entire LAN.
These things happen sometimes. Network gear is generally pretty robust, but can still fail in wierd ways.
Not so, FUDster. The follow up report to the blackout by CERT was published in 2004 (and Slashdot also linked to it).
On page 133-134 of the report itself we find:
The Cyber Analysis sub-team was led by the CERT(R) Coordination Center (CERT/CC) at Carnegie Mellon University and the Royal Canadian Mounted Police (RCMP). This team was focused on analyzing and reviewing electronic media of computer networks in which online communications take place. The sub-team examined these networks to determine if they were maliciously used to cause, or contribute to the August 14, 2003, outage. Specifically, the SWG reviewed materials created on behalf of DHS's National Communication System (NCS). These materials covered the analysis and conclusions of their Internet Protocol (IP) modeling correlation study of Blaster (a malicious Internet worm first noticed on August 11, 2003) and the power outage. This NCS analysis supports the SWG's finding that viruses and worms prevalent across the Internet at the time of the outage did not have any significant impact on power generation and delivery systems. The team also conducted interviews with vendors to identify known system flaws and vulnerabilities.
Bold emphasis mine.
Just more of the same sad, tired bullshit FUD you're famous for. And the mods fell for it because you shilled your own bullshit with your sockpuppet account.
Jebus, you'd think someome would have the forethought to attach a passive sniffer if the whole plant depends on on a working network.
The CERT report you quote is about Yet Another incident where Winblows was fingered, the 2003 Blackout, that has nothing to do with Davis-Besse besides M$.
The fine article is rather clear:
They go on to mention three other incidents, including the later whitewashed Blackout account.
Just more of the same sad, tired bullshit FUD you're famous for.
Your love of M$ has blinded you again. Why do you feel so much for a big dumb company and their software? How many screw ups does it take to convince you that M$ does not belong everywhere and they have serious issues to resolve before they can be trusted anywhere.
Friends don't help friends install M$ junk.
Here I'm thinking this is a massive DOS situation, but these people could have been easily nailed by that phpBB exploit on Linux for all practical purposes. That's even more fucking pathetic.
Likely they're talking about some sort of 'Traffic Storm' (which is some type of data). I have seen and heard of a lot of devices that are very poorly designed and don't expect a lot of extraneous data on their lan. Most commonly these are things like PBX'es and small 'appliance' devices that have some simple SNMP or web mgmt capabilities. You stick them on an internal lan with lots of broadcast traffic, where there may be other interesting things going on and i've seen them either die under the interrupt load (insufficent cpu for the 10Mb or 100Mb they negotiate) or just lock-up because of what it thinks is a corrupted frame.
Do not meddle in the affairs of wizards, for it makes them soggy and hard to light.
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
There is a lot of ignorance and selective reading of the article going on here! I've worked in power stations, both nuclear and conventional, for nearly 20 years designing, building and commissioning control systems.
1. The control system failure did not trip the reactors - the operators did manually as per their operating instructions which dictate the action in the event of both of the affected pumps failing.
2. The PLCs and drives will not have been using Ethernet for controlling - they will simply have been connected to the site-wide LAN for monitoring.
3. Using a PLC on a non-nuclear safety piece of equipment is common practice.
4. Network/Ethernet/Data storms are a known phenomenon, and are acknowledged by the PLC manufacturers.
I first came across this phenomenon about 8 or 9 years ago. Several PLCs connected to the Ethernet network (the Ethernet connection only for remote monitoring/programming - not for control), and all had faulted, shutting down a hydro-electric power station. When investigated, I discovered all the PLCs had erased their application software, looking like they'd just come out of the box.
An IT engineer had been on site at the time replacing a blade in a hub.
On discussion with the manufacturer, this was a known problem, due to what they called an 'Ethernet Storm,' but not a problem they thought was a serious issue and needed publicising. They even had a fix, but wanted £2000 per processor to upgrade the firmware.
I pointed out that there were serious implications especially for Chemical/Nuclear plants etc. and that they should be proactively addressing the problem. They eventually agreed to upgrade the problem to something called a 'code 10' and that way all our processors would be upgraded for free. In the new revision, there was a new register called an 'Ethernet Storm counter'.
Since then, the problem has re-occured and we are now uprevving all our processors to the latest revision of firmware - which they say is now definitely Ethernet storm resistant (we'll see!).
In critical applications where control system components need to communicate with each other, we do use Controlnet, Modbus, etc., monitor for device failures and all the other good practice that Ranjan advocates, but Ethernet is widely used for non critical connections for MIS, Remote Monitoring etc. Who would foresee that a non-critical connection to an Ethernet network could erase the memory of a processor? Possibly someone who designed the equipment, but not your experience Control Engineer who would expect a piece of kit to be 'fit for purpose'.