Slashdot Mirror


Software Glitch Caused 911 Outage For 11 Million People

HughPickens.com writes: Brian Fung reports at the Washington Post that earlier this year emergency services went dark for over six hours for more than 11 million people across seven states. "The outage may have gone unnoticed by some, but for the more than 6,000 people trying to reach help, April 9 may well have been the scariest time of their lives." In a 40-page report (PDF), the FCC found that an entirely preventable software error was responsible for causing 911 service to drop. "It could have been prevented. But it was not," the FCC's report reads. "The causes of this outage highlight vulnerabilities of networks as they transition from the long-familiar methods of reaching 911 to [Internet Protocol]-supported technologies."

On April 9, the software responsible for assigning the identifying code to each incoming 911 call maxed out at a pre-set limit; the counter literally stopped counting at 40 million calls. As a result, the routing system stopped accepting new calls, leading to a bottleneck and a series of cascading failures elsewhere in the 911 infrastructure. Adm. David Simpson, the FCC's chief of public safety and homeland security, says having a single backup does not provide the kind of reliability that is ideal for 911. "Miami is kind of prone to hurricanes. Had a hurricane come at the same time [as the multi-state outage], we would not have had that failover, perhaps. So I think there needs to be more [distribution of 911 capabilities]."

115 comments

  1. backup for 911 by iggymanz · · Score: 5, Informative

    have your local police and fire phone numbers in your cell phone and posted next to your land line.

    1. Re:backup for 911 by Archangel+Michael · · Score: 0, Troll

      Have a gun. In a real emergency, the police are too busy to help everyone.

      Additionally, do you trust the police to actually not shoot you?

      --
      Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
    2. Re:backup for 911 by aaarrrgggh · · Score: 1

      Which is kind of ok as long as your problem happens in that area, and there is no actual natural or man-made disaster in play.

      A 911 dispatch center has much better resources than a local police or fire department main desk. If you are lucky, they can operate at 10% of the capacity as a proper dispatch center.

      The system should be more robust. It has improved dramatically since 9/11 with absurd amounts of cash poured into many facilities, but it can't do everything. An alternative solution is to not expect 9/11 to be robust, but hard to tell if that is a good approach for most communities.

    3. Re:backup for 911 by Anonymous Coward · · Score: 2, Funny

      What are you going to do? Shoot the cardiac arrest?

    4. Re:backup for 911 by Anonymous Coward · · Score: 0

      What landline? And every local police department says call 911 for emergencies. The number listed in the phone book is for administration tasks. You call that, and who knows how long it'll take the person who answers to accept that 911 is down (and then get you to the right person).

    5. Re:backup for 911 by Anonymous Coward · · Score: 0

      No silly, he's going to shoot the tornado.

    6. Re:backup for 911 by Anonymous Coward · · Score: 0

      Well, it does speed things up a bit...

    7. Re:backup for 911 by i+kan+reed · · Score: 2

      Have you ever actually called those?

      They say "If this is an emergency, please hang up and dial 911".

    8. Re:backup for 911 by Wookact · · Score: 1

      The police department in my area has both Emergency and Non-Emergency numbers. It is listed that way in the phone book.

    9. Re:backup for 911 by geantvert · · Score: 1

      This is a recording of Archangel calling 911:

      Operator: What can I do for you?
      Archangel: My friend ... he ... he has a cardiac arrest. I think he is dead.
      Operator: Calm down! Go back to your friend and check that he is dead.
      Archangel: Ok ... ... ... PAN ... ...
      Archangel: I confirm! He is dead now.

    10. Re:backup for 911 by Noah+Haders · · Score: 0

      whats a landline. also, you can just google the numbers or ask siri.

    11. Re:backup for 911 by Anonymous Coward · · Score: 0

      So you continue to hold, they pick up and you say "911 isn't working but I have an emergency! My dick is stuck in my vacuum cleaner.".

    12. Re:backup for 911 by iggymanz · · Score: 1

      use your brain, you have no point

    13. Re:backup for 911 by iggymanz · · Score: 1

      wow, do tell how your gun delivers babies, provides oxygen to elderly, puts out fires. For that matter, it won't even stop a crime at your vacationing neighbor's house without likely sending you to prison.

      I love firearms but brainless people with guns are a big problem.

    14. Re:backup for 911 by Russ1642 · · Score: 1

      They also list their dispatch numbers. That's what you call when you want to report a non-emergency. For example, I called it to report a completely wasted drunk lady causing a disturbance. Not quite enough to call 911 but I wanted to report her before she wandered into traffic.

    15. Re:backup for 911 by tlhIngan · · Score: 1

      It actually is as part of 911-NG (next generation) which is meant to better handle next-generation telephony systems as well. Right now cellphones and VoIP 911 is more of a hack than anything in the current system, while the next-gen system switches to a completely VoIP (over a private network) system with failover and tagging and all that.

      So the next-gen system will allow 911 to be contacted in many ways, including texts and SMS, VoIP (both private and internet), POTS, cellphones and all sorts of other mechanisms. And support for metadata is better, so GPS location information is available to everything (right now positional information is a hack for cellphones and non-existent for VoIP which often relies on just sending the subscriber's address info).

      And load balancing as well - a center can go down and its calls get switched transparently to another center to take the excess with all the information the local center had being displayed as well (it's metadata is forwarded with the call).

      Of course, it's supposed to only be deployed in a few year's time because it's a massive overhaul - it's going from circuit switched (POTS primary) to a packet switched (POTS secondary, but compatible) network and all that.

      http://en.wikipedia.org/wiki/N...

    16. Re:backup for 911 by David_W · · Score: 1

      Or failing that, do what we used to do before 911 was implemented in our town: Call the operator, tell them who you want (police, fire, ambulance) and that it is an emergency. Not quite as effective since it lacks the central control 911 has, but it generally works. (And in some areas the operator may have the back-end number for 911 and can transfer you to that, which could also work.)

    17. Re:backup for 911 by Shadowhawk · · Score: 1

      What is this land line you speak of?

      --
      My mind works like lightning. One brilliant flash and it is gone.
    18. Re:backup for 911 by Charliemopps · · Score: 1

      have your local police and fire phone numbers in your cell phone and posted next to your land line.

      That is a great idea.
      But, I used to handle 911 outages. Most 911 outages are due to cable cuts, which would often leave those facilities unreachable as well.

      I'd say that if your phone works, and you can't call 911 or the local hospital, you should assume the trunk leading to those services (foolishly all usually located next to each other) is cut or damaged. So your next best bet would be to call a NON-LOCAL ER. i.e. Call the next town over. Just because downtown is broken doesn't mean the trunk leading to the next exchange is as well. We'd often route that way ourselves until it was fixed. So if you can call there, then they can radio to your local EMTs.

      Also, a lot of times the local network is made up of all of these trunks, but your internet connection heads strait out of town. You might have better luck making a voip call or sending an email. A relative may be able to reach someone when you can't, etc... Text messages might be a good route as well, they are handled entirely different (though I've never dealt with that tech myself so take that with a grain of salt.)

    19. Re:backup for 911 by i+kan+reed · · Score: 1

      I think I specifically have a point that they're not a substitute for emergency calls.

    20. Re:backup for 911 by Anonymous Coward · · Score: 0

      You must have the wrong kind of gun if it doesn't do all that for you. You need a Glock. They do everything.

    21. Re:backup for 911 by Anonymous Coward · · Score: 0

      I've even heard that they are plastic so they can go through airport security w/o problems.

    22. Re:backup for 911 by Anonymous Coward · · Score: 0

      Look around your home. Somewhere you will probably see a tin can with a string attached at one end that goes into the wall. That's your land line (I believe it's called a land line because the string goes to a grounding rod in the land).

    23. Re:backup for 911 by sremick · · Score: 2

      What landline?

      If you care enough about 911 and emergency situations to be reading this article, and you don't have a landline, then that's on you for being irresponsible. People spend more on texting than it costs to have a landline. No excuses.

      The monthly cost of a landline is cheap insurance in the event of an emergency. Cell towers go down, fail, become over-congested, and cell phone batteries die.

    24. Re:backup for 911 by sremick · · Score: 0

      whats a landline.

      If you care enough about 911 and emergency situations to be reading this article, and you don't have a landline, then that's on you for being irresponsible. People spend more on texting than it costs to have a landline. No excuses.

      The monthly cost of a landline is cheap insurance in the event of an emergency. Cell towers go down, fail, become over-congested, and cell phone batteries die.

    25. Re:backup for 911 by davester666 · · Score: 1

      No, you shoot AGAINST the wind, not with it.

      --
      Sleep your way to a whiter smile...date a dentist!
    26. Re:backup for 911 by sremick · · Score: 0

      What is this land line you speak of?

      If you care enough about 911 and emergency situations to be reading this article, and you don't have a landline, then that's on you for being irresponsible. People spend more on texting than it costs to have a landline. No excuses.

      The monthly cost of a landline is cheap insurance in the event of an emergency. Cell towers go down, fail, become over-congested, and cell phone batteries die.

    27. Re:backup for 911 by Archangel+Michael · · Score: 1

      For the idiots that don't comprehend because their rose tinted liberal glasses don't work, here is the relevant portion highlighted:

      Have a gun. In a real emergency, the police are too busy to help everyone.

      You see, you're deliberately changing the parameters to make fun of me. I get it, you're too stupid to have a valid argument against what I actually said, in context, so you change the context. The reality is, it makes you look stupid.

      But okay, lets go with the hypothetical "cardiac arrest" mentioned below. Okay, you're cardiac arrest is because your shop is being overrun by a mob of thugs on a rampage, do you think that calling 911 for an ambulance is going to help you while they are rioting in the street outside your shop?

      If you're going to change the parameters, so can I.

      --
      Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
    28. Re: backup for 911 by Anonymous Coward · · Score: 0

      You do realize that the Plain Old Telephony System is independently powered, and people who switched to VoIP and dropped their landline gave up the security that came with paying the phone company to provide the power, right?

      Additionally, cell networks are NEVER provisioned with sufficient capacity to handle all customers, so in a general emergency situation, your odds of making a cell call are diminished severely. I learned just how difficult it is to make a call during the earthquake of 2000 in Seattle.

      Furthermore, TFA mentioned a seemingly arbitray limit set at 40 million emergency call ID serial numbers without any explanation of how or why this quantity was selected. Obviously the individuals involved failed to design, monitor or maintain this system properly, and there are at least 6000 people who can attest to that. I have to wonder if this was the same 911 outage previously blamed on Centurylink's interstate routing errors.

      At any rate, if this is a culmination of the digital information age, then I shall rest assured that the long term emergency management concerns arising from the complexities of anthropogenic industrial climate modification are so far beyond the capacity of human attentions as to be insurmountable. All too often, our leadership seems so easily convinced that implementing new solutions can be responsibly or effectively achieved without better testing small scale proving before large scale implementation.

    29. Re:backup for 911 by umghhh · · Score: 1

      if you shoot a fire maniac before he manages to start the fire then a gun helps indeed. Not sure how that goes for delivering babies - shoot a man before he ejaculates? but you would have to know this in advance, hmm

    30. Re: backup for 911 by umghhh · · Score: 1

      but interwebs were designed to survive nuclear holocaust not some small emergency. If a nuclear holocaust occurs they will still be operating I am sure....

    31. Re:backup for 911 by umghhh · · Score: 1

      if you had a gun as one proposed you could have shoot her which kinds of proves the point of rather having population all armed instead of spending money on 911 and some other commie crap.

    32. Re:backup for 911 by Anonymous Coward · · Score: 0

      Have a gun. In a real emergency, the police are too busy to help everyone.

      Additionally, do you trust the police to actually not shoot you?

      OMG He's having a heart attack! Quick! Shoot Em!

    33. Re:backup for 911 by umghhh · · Score: 1
      Now seriously. For years already my landline is an IP device that does some fancy SIP messaging. It does not even have its power supply coming from the line - it is all coming from the power supply of a DLS modem. I am not even sure if I could get this old fashion phone line anywhere in Europe except maybe in UK but this is not Europe so it does not count. I recall there was some research (or exercise) on this last year (in Germany) where they found out that switch to fully digital communication of emergency services made their communication capabilities extend for maybe 10h or so in case of complete blackout. It used to be days - If memory serves well, then old digital telephony exchange had to have batteries for 2 days of service or so, but I am an old fart so maybe that is a result of my Alzheimer. Actually that is the trend in all other areas (alzheimer and short term abilities to sustain service under duress). We assume that the rest of the world stays online and is able to help. If disaster strikes province in which I live and all surrounding ones and quarantine is assumed - we will have no food in two days or so - thanx to JIT delivery. The same with fuel and water. VW had to stop production after a few days of stopped air traffic (Island vulcano with odd and lengthy name exploded at the wrong moment). UK gov had to call in army after 100 or 200 hauliers went on strike few years back - no fuel and JIT delivery of food etc. This has something to do with computers and interwebs indeed.

      The problem like the one in TFA has become a big problem because you do not you have a dispatcher in your county and each county but one for 7 states so any small glitch that happens will cause significant misery. Redundancy has to be built in. Sadly everytime I hear redundancy I see all the L2 loops of the past.

    34. Re:backup for 911 by umghhh · · Score: 1

      so if there is no emergency calls what then? we all go suck your dick because we have to follow some rule that assumes emergency calls work? let us say that 30% of those 6k people that could not reach emergency number came to your dick for sucking you will be terribly sorry for expressing silly views on how the rules are superior to reality.

    35. Re:backup for 911 by K.+S.+Kyosuke · · Score: 1

      I've heard of some cases of crimes in progress where the 911 refused to send anyone saying that there were no units available, so the guy called again and said he was going to shoot the people in question, and suddenly, a unit appeared within five minutes! Apparently, it works like magic...

      --
      Ezekiel 23:20
    36. Re:backup for 911 by Cramer · · Score: 1

      Right, so I can pay $30+ per month for something I might use once in my lifetime, or $20/mo for unlimited text that I [i]will[/i] use every day. You failed elementary school math, didn't you?

      (Also factor in the number of LECs (read: Verizon, and AT&T) doing everything they can to do away with POTS, and it's even more a waste of money. FYI, if you have FiOS, your copper loop was removed, and is forever more not an option.)

    37. Re:backup for 911 by Cramer · · Score: 1

      30 years ago, MAYBE. Today, "0" could be answered by a call center anywhere. Your call a) might not be answered by a human at all, or b) might not be answered by someone on the same continent.

    38. Re:backup for 911 by Anonymous Coward · · Score: 0

      Because emergencies can only happen at home.

    39. Re:backup for 911 by dgatwood · · Score: 1

      Sounds like a variant of a famous joke.

      Operator: 911. What is your emergency?
      Hunter: My hunting partner just had a heart attack. I think he's dead.
      Operator: Go make sure.
      [sound of a gunshot]
      Hunter: Okay. Now what?

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    40. Re:backup for 911 by dgatwood · · Score: 1

      The monthly cost of a landline is cheap insurance in the event of an emergency. Cell towers go down, fail, become over-congested, and cell phone batteries die.

      Not around here. I'm paying about $40 per month for a nearly bare-bones land line (only Caller ID). Even if I were on a $0.35 per text plan, I'd spend more money on that land line every month than I would on texting for ten years. Cheap, it ain't.

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    41. Re:backup for 911 by Anonymous Coward · · Score: 0

      You're welcome to suck my big black dick Mr. Umghhh.

    42. Re:backup for 911 by antdude · · Score: 1

      What if one travels a lot in USA? 911 does local ones. Can 411 go to 911?

      --
      Ant(Dude) @ Quality Foraged Links (AQFL.net) & The Ant Farm (antfarm.ma.cx / antfarm.home.dhs.org).
    43. Re:backup for 911 by cyberchondriac · · Score: 1

      Sounds like a variant of a famous joke.

      Operator: 911. What is your emergency? Hunter: My hunting partner just had a heart attack. I think he's dead. Operator: Go make sure. [sound of a gunshot] Hunter: Okay. Now what?

      Yep, this is the right way to phrase it, makes more sense this way.

      --

      Look back up at my post, now look back down, you're on the Internet. Now look back up. I'm a signature.
    44. Re:backup for 911 by cheater512 · · Score: 1

      And the brain size seems inversely proportional to the number of guns. At least it seems that way viewing from a safe distance (Australia).

    45. Re:backup for 911 by ShaunC · · Score: 1

      If you care enough about 911 and emergency situations to be reading this article, and you don't have a landline, then that's on you for being irresponsible. People spend more on texting than it costs to have a landline.

      Every line in the US is required by law to be able to dial 911, even if you aren't paying for any service at all. This applies to landlines and cellphones. I often keep the police scanner on as background noise, and I can say at least a quarter of the 911 calls in Memphis originate from disconnected cell phones. If your landline gets a dial tone or your cellphone is charged and has a signal, you can dial 911 whether you have a phone plan or not.

      Of course that doesn't help in the event of a 911 outage but those are, fortunately, quite rare. I can't justify $50 a month to pay for landline service in the extremely unlikely event that both the cell network and the local 911 network go down. If something like that occurs, there's been a catastrophic event on a scale where emergency officials aren't going to be much help anyway.

      --
      Thanks to the War on Drugs, it's easier to buy meth than it is to buy cold medicine!
    46. Re:backup for 911 by evilviper · · Score: 1

      In FIOS areas, it's no longer possible to get a POTS landline. You can get a phone service over FIOS, but it's subject to wall-power being available, and you're using the same E-911 system as normal VoIP or cell phone services, anyhow. It's the FCC that's to blame for me not having a landline.

      Also, there's no reason cellular 911 service shouldn't be ultra-reliable. There are 4 different nationwide carriers in the US. What are the odds that all 4 of them will have ALL their overlapping cell towers in an area knocked-out? That does happen, today, but ONLY because the FCC pussied-out on requiring them to have backup generators in each cell tower, and lets them just keep a few backup batteries in there for short power outages.

      And if some event damages the fiber-optic line to my house, there's no chance I'm fixing it... At least with a cell phone I have the option of climbing onto higher-ground and trying to get a signal from a more remote tower, or even just SMS texting emergency services (coming real-soon-now) and hoping.

      With ad-hoc WiFi in cell phones, people may soon be able to self-assemble into their own wireless network that spans whole cities, after a disaster knocks-out all other local service. Try that with your land-line.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    47. Re:backup for 911 by sremick · · Score: 1

      Also, there's no reason cellular 911 service shouldn't be ultra-reliable.

      http://www.mychamplainvalley.c...

      What are the odds that all 4 of them will have ALL their overlapping cell towers in an area knocked-out?

      What are the odds your family isn't all on a single cellular carrier, making you unable to take advantage of such redundancy?

    48. Re:backup for 911 by dcw3 · · Score: 1

      In FIOS areas, it's no longer possible to get a POTS landline.

      Hmmm. My home, built in 2001, originally had a Verizon landline. FIOS has been available here for several years, but I never signed up for that (we use cable for everything else). When our power drops out, the old landline still works just fine. Are you saying that just new installations are affected?

      --
      Just another day in Paradise
    49. Re:backup for 911 by iggymanz · · Score: 1

      Wrong, in the USA there is very high per capita ownership of guns in nice areas with no crime. It takes a lot of money to own and maintain firearms.

      Fear the inner city punk with just one. Most the gun crime (and rape, armed robbery, etc.) is done in inner cities by a couple of subcultures with no respect for life and property.

    50. Re:backup for 911 by kmoser · · Score: 1

      And the brain size seems inversely proportional to the number of guns. At least it seems that way viewing from a safe distance (Australia).

      That ratio is usually more closely related to an organ located a bit closer to the middle of the body. (The same organ whose size is inversely proportional to the size of one's SUV or to the speed and price of one's sports car.)

    51. Re:backup for 911 by iggymanz · · Score: 1

      This is my rifle (raise weapon),
      this is my GUN (grab crotch),
      this is for fighting,
      THIS is for fun

    52. Re:backup for 911 by evilviper · · Score: 1

      What are the odds your family isn't all on a single cellular carrier, making you unable to take advantage of such redundancy?

      Verizon and Sprint are compatible, while AT&T and T-Mobile are compatible. And with them all switching to LTE, it's likely they will all be mutually compatible in a few more years, when manufacturers start selling multi-band LTE phones.

      Most every post-paid cellular plan includes voice roaming. Even if you're not paying for roaming normally, when you dial 911, all restrictions are dropped, and your cell will connect to any available tower from any provider that it can.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
  2. Why 40 millions? by AqD · · Score: 1

    40 millions doesn't seem to cross any boundary?

    1. Re:Why 40 millions? by Anonymous Coward · · Score: 2, Insightful

      Probably crosses a licensing boundary. Beats me!

      "We're sorry, your 911 call centre didn't pay their software licensing fee this month. Please call 1-800-RU-LEGIT and report this instance."

    2. Re:Why 40 millions? by Anonymous Coward · · Score: 0

      Hmm, might be rounded up from the 2^25th (33554432) But even that seams arbitrary unless they're encoding other data into some of a bits of 32 bit integer which they might have done depending on how old the systems is or it might be designed for use over slower data links.

      However, This is a fairly common software problem, not having a plan for running into the out bits and not considering that your software we'll probably be used longer then was originally intended. On some software projects I've worked our plan for running out of space to store an incremental number was simply to be begin logging warns when there was only 10% of the range left.

    3. Re:Why 40 millions? by Archtech · · Score: 1

      The word "architecture" is bandied around a lot, partly because it sounds so important. But if architecture means anything, it should include scoping out ALL limits embedded in the software or adjustable through a UI. At the very least the limits should be documented in such a way that those responsible for managing and maintaining the system are fully aware of them at all times. Because they are just as important as the speed at which your car will come off the road when you drive round a tight bend.

      Ideally, resources permitting, a better solution should be systematically adopted. Such as having the software itself warn (in good time) that a built-in limit is being approached. Or simply allocating a type that can store numbers vastly greater than could ever conceivably arise. This, of course, is one of the useful aspects of strong typing: before using any variable, you MUST specify its type, and a good programmer will learn to stop at that point and find out what the requirement is.

      --
      I am sure that there are many other solipsists out there.
    4. Re:Why 40 millions? by Anonymous Coward · · Score: 0

      40 millions doesn't seem to cross any boundary?

      It crosses the clear boundary of 40 million. Limits in systems are not always based in values of power of 2, if that's what you meant.

    5. Re:Why 40 millions? by pushing-robot · · Score: 1

      I think I found the offending code:

      static unsigned int counter[640] // ought to be enough

      ...

      void countcall() {
          int i;
          for (i = 0; counter[i] = 65535; i++) {} // skip the ones we filled up
          counter[i]++;
      }

      --
      How can I believe you when you tell me what I don't want to hear?
    6. Re:Why 40 millions? by Anonymous Coward · · Score: 0

      counter[i] == 65535

    7. Re:Why 40 millions? by umghhh · · Score: 1

      this is OT but this and other SW faults of a kid are getting more and more common because we value working software over comprehensive documentation without understanding what the hell that means thus we do not have either.

    8. Re:Why 40 millions? by umghhh · · Score: 1
      Sir you are damned naive. We have no money for shit like reason and qa. potentially shippable product will be shipped as soon as it compiles and sometimes even without fulfilling this requirement. Actually in a project I work for we reached release and discuss happily faults on a fault list while a customer on which our industrial application got installed for testing gave us another very short but not empty list with faults that it sees as a show stoppers. The lists do not overlap on any single point. Which of the lists would you start discussing at the release meeting? Yes bravo - the ones on the little problems list. I actually had some designers denying that the customer list existed, that the faults on the list existed and one stubborn guy claimed that the problems on the list have been fixed albeit question about how that was verified was left without answer. The funny thing is - 2 of these faults I have seen before even the preliminary software version our customer got was packaged. Now you say this is anecdotal and has no bearing on wider production practice - I dare to differ on that based on experience I had over last 10y.

      I agree with you at least as far as to say that if need be such considerations have to be made at some point during development. Often enough they are not because people live their fantasy of craftsmanship or being an artisan doing some fancy stuff where hard rules of reality do not apply.

      In other words: there are no requirements in the jungle. There is only you and your agile team in search of holy Grail of software development practice (BTW: I think agile manifesto is actually nicely organized set of principles of what is important but none of the agilists I worked for over the years, did actually read it).

  3. killbots have a preset kill limit by Anonymous Coward · · Score: 1

    You see, killbots have a preset kill limit. Knowing their weakness, I sent wave after wave of my own men at them until they reached their limit

    1. Re:killbots have a preset kill limit by Anonymous Coward · · Score: 0

      Good job stealing the comment made by AC earlier.

    2. Re:killbots have a preset kill limit by Anonymous Coward · · Score: 0

      And in reposting, you have received negative mods. It's not showing yet, but if I had to guess, it would be "Redundant." Smooth move, Jedi. /s

  4. This is why you use unsigned integers by Anonymous Coward · · Score: 0

    Then we would've just rolled to -39,999,999 calls. :D

    1. Re:This is why you use unsigned integers by Anonymous Coward · · Score: 0

      Derp. I meant to type signed, not unsigned. Brb, committing seppuku for my crimes against Interwebs.

    2. Re:This is why you use unsigned integers by mrchaotica · · Score: 1

      Brb, committing seppuku

      Wait, what?

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    3. Re:This is why you use unsigned integers by Anonymous Coward · · Score: 0

      I think he's off to figure out how to put some numbers into a grid.

  5. Wouldn't the operators notice this? by Anonymous Coward · · Score: 0

    Did the 911 operators just think it was a slow day for emergencies?

  6. OMG, we ran out of numbers by alen · · Score: 1

    sounds like some dummy read a best practices and conserve the resources book and made a column an interger data type instead of big interger. or whatever the corresponding names are for Oracle or non-MSSQL. some auto process or identity column creates the keys and it reached the max amount. and it wasn't set up to use negative numbers either

    1. Re:OMG, we ran out of numbers by Anonymous Coward · · Score: 0

      Unsigned byte: 255
      Unsigned short: 65535
      Unsigned long: ~4.2 billion

      It was not a data type error, but definitely a designer error of some sort.

  7. cities themselves have limited service. by nimbius · · Score: 4, Informative

    While you might find 911 service operable and efficient in the burbs, cash strapped cities with large populations like Miami run out of operators before they run out of capacity. dialing 911 in Cincinnati for example, or any other major city in the rust belt, results in a pre-recorded message instructing you to stay on the line and wait for the next available operator. Its a fun joke to make on sitcoms, but when you've actually in danger its not. Having been backed over on a motorcycle by a truck, I was at the mercy of this hold system for nearly 10 minutes in a busy downtown intersection.

    --
    Good people go to bed earlier.
  8. Re:Hint by Russ1642 · · Score: 1

    That's just about the most ridiculous thing I've heard here in a while.

  9. possible solution, fallback to voicemail by Anonymous Coward · · Score: 1

    Thank you for calling Springfield RescuePhone!
    if you know the name of the crime being committed, press one!
    To choose from a list of felonies press two!
    If you are being murdered or are calling from a rotary phone please stay on the line!

  10. Entirely preventable software error by Thanshin · · Score: 1

    An entirely preventable software error was responsible for causing 911 service to drop. "It could have been prevented. But it was not,"

    So, let us be clear. The error, was not simply preventable but absolutely and completely preventable in all cases. There was no impediment to prevent it. Its prevention was not only possible but also within the reach of any error prevention effort or action. It could have been prevented.

    The preventability of the error was absolute. No situation, fictive or factual, in this or other world, would allow a situation in which this error was not preventable.

    Finally, it's important to note that the eventual series of events that would lead to the fully avoidable non-prevention of this error, would be unfortunate.

    1. Re: Entirely preventable software error by cdrudge · · Score: 1

      All kidding aside, when I read that line I wondered what type of software error isn't preventable. There's things that are easily preventable and should be thought of, but ANYTHING is "entirely preventable".

    2. Re: Entirely preventable software error by danlip · · Score: 1

      Bad wording. Perhaps "obvious and preventable" would have been better (and why does the counter not go to at least max int, e.g. 2^31-1 = 2,147,483,647).

    3. Re: Entirely preventable software error by Anonymous Coward · · Score: 0

      We had this once. The problem was put on the yellow sticker but these kept falling off the wall. Problem that I reported to our scrum master many times, when we realized, that our finished user stories, were in fact, not finished. The problem was analysed and put on impediments list. We tried many things but after a year or two I lost interest and moved on to some other duties. Now it is all peace and flowers. The white dressed personal gives us all these nice little pills and we are all happy. Long gone are stress and violence of never ending two days demos followed by hours long retros and sprint planning with only hours left for actual development and test.

  11. preventable software error by tomhath · · Score: 1

    the FCC found that an entirely preventable software error was responsible for causing 911 service to drop

    As opposed to what? An entirely unpreventable software error? Sounds more like a configuration issue than a software error anyway.

  12. Did they try turning it on and off? by Anonymous Coward · · Score: 0

    always works for my Windows computers...

  13. Failure started at the Administrative level.... by Lumpy · · Score: 2

    It's the fault of the administrators to begin with. I am friends with one of the technical advisors for the midwest EOC and the problem is that the administrators dont know their ass from a hole in the ground and ignore their tech guys and listen to the vendors.

    He has been screaming for all call centers to have analog failover, but the administrators refuse to hear it.

    So who is to blame for the failures? That top moron of Homeland security. IT would have been in place if he would realize that he is not an expert and to actually LISTEN to the experts in the field.

    --
    Do not look at laser with remaining good eye.
    1. Re:Failure started at the Administrative level.... by uksv29 · · Score: 1

      I'm not convinced you need an analogue failover but you do need fully duplicated systems right down to the power subsystems and cables which you periodically switch between. There is no point having a backup if you don't use it on a regular schedule to be sure it is working properly.

      The solutions are not all technical, you have to be monitoring them properly with the right people who are motivated and properly trained. You also need the proper organisational processes .

      I've seen NOCs on emergency service networks where the staff on duty have been asleep or out of the room for long periods. Motivation, training and accountability are frequently not given the importance that they deserve.

    2. Re:Failure started at the Administrative level.... by uksv29 · · Score: 2

      I've been specifying emergency service systems for over 10 years, duplication, monitoring, management and processes are always at the top of the list.

    3. Re:Failure started at the Administrative level.... by Anonymous Coward · · Score: 0

      The all digital radio systems are 100% worthless in an full failure emergency. NYC hurricane, emergency towers went out and all fire and police comms dropped. using an analog failover they could have talked to each other over a simplex analog channel.

      Digital systems that require a central tower to operate are a failure waiting to happen.

    4. Re:Failure started at the Administrative level.... by Hognoxious · · Score: 1

      they could have talked to each other over a simplex analog channel.

      10-1 to that good buddy, got me a smoky on my six, what's your 20 come back?

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  14. Typical government waste and inefficiency. by 140Mandak262Jamuna · · Score: 1
    11 million people, 24/7/365 etc works out to 96.360 billion citizen-hours per year. Assuming these 40 million calls all happened this year and each call took place one full hour each, it works out to a load factor of 0.04%. In other words citizens are NOT calling 911, 99.96% of their time. Since most 911 calls do not last an hour the load factor is even lower than that. This is how over built and inefficient government services are.

    If a private railroad owns rolling stock that would occupy, say 10 miles of track, but actually owns 2000 miles of track, it is no skin off your nose. Your taxes are not funding it. But if the government is running that railroad, we should restrict the total track length owned by the government to the actual track required by those rail cars and not an inch more.

    That is how we reduce the size of the government, reduce deficits and reduce taxes. When will America see the logic here?

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:Typical government waste and inefficiency. by sribe · · Score: 1

      This is how over built and inefficient government services are.

      That was one of the most stupid nonsensical posts I have ever seen here. You calculated the "load factor" based on each of 11,000,000 people instead of on the number of 911 operators.

      And of course that's not even counting the fact that 911 services pretty much need to be provisioned to handle *peak* loads, not average (nor even median).

    2. Re:Typical government waste and inefficiency. by 140Mandak262Jamuna · · Score: 1

      I have failed miserably looks like. Even adding the bit about railway rolling stock did not help. Well, that is the problem when you speak with a tongue in the cheek. You end up chewing your own tongue.

      --
      sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    3. Re:Typical government waste and inefficiency. by sribe · · Score: 1

      I have failed miserably looks like. Even adding the bit about railway rolling stock did not help. Well, that is the problem when you speak with a tongue in the cheek. You end up chewing your own tongue.

      That was meant to be tongue in cheek? Oh, OK then ;-)

      Problem is, it's election season, and what you said there was really not much different than some of the bullshit that we're inundated with nightly on TV commercials, and flyers in the mail. My favorite so far is the one accusing a Democrat of attempting to "replace Medicare with a completely government-run system". Uhhhmmm, excuse me???

    4. Re:Typical government waste and inefficiency. by Anonymous Coward · · Score: 0

      I had this before. You made a joke but ignorant take it for the words of wisdom. I think I take precautions ever since I made some references to Stalingrad while on the phone with German customer.

    5. Re:Typical government waste and inefficiency. by 140Mandak262Jamuna · · Score: 1

      I don't blame you, politicians, especially the Republicans have gone so far out, it is nearly impossible to parody them by exaggerating anything they say or do. No wonder Jon Stewart and Steven Colbert are reduce playing back their footage unmodified and make faces.

      --
      sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
  15. Re:Hint by Archtech · · Score: 1

    Don't do critical things in hastily-written, poorly designed software. Instead, take sufficient time and make the design and implementation robust. Tried and tested methods exist for all of this. (Consider avionics, for example).

    --
    I am sure that there are many other solipsists out there.
  16. This will have impacted the outcome of incidents by uksv29 · · Score: 1

    If calls are lost then help is delayed. This impacts the outcome of incidents.

    I'm not saying that people died because of this but I'm absolutely certain that there were some who suffered worse injury and losses because of the delays. Loss of 6,000 calls will result in a lot of hurt.

    Like so many other issues, it wasn't a single fault but a chain of events. In this case there was a software failure but the fault monitoring systems and support services failed to immediately note that there were no calls going through the affected systems. A change from 1,000 calls per hour to zero should be pretty obvious.

    They didn't appear to have a credible mitigation process to handle this sort of failure like diverting calls to another location. This could have been automated or manually initiated by the NOC operators.

    Shit happens in all systems, the important thing is how you deal with problems.

  17. I used to handle this... by Charliemopps · · Score: 2

    I used to work in the NOC for a large Telco and we'd handle 911 outages. Usually 911 goes down because the entire networks down. Like the switch failed, or the trunk from one area that leads to the area the 911 center is in would get cut. Most of this stuff is in a ring so there's usually an alternate route, but in some areas that's not physically possible. For example a remote mountain town with a single road in, would likely have its only trunk running along that same road and it'd get cut all the time as the road constantly needed repair. Chose where you live wisely.

    We'd handle this in different ways depending on the situation. For example, if we had 4 trunks that could handle 4X number of calls, and 3 got cut so it could only handle 1X, we could actually prioritize certain numbers so 911 and emergency services would get priority. If the trunk leading to the 911 center failed, we could do something like re-route the calls to the local police dispatcher who literally had no warning and would suddenly have their phone ringing off the hook. You may say "you should warn them!" but our policy was "Get it done" because who's dieing while you're arguing with the dispatcher about how her days going to suck?

    The most important skill you can have in any NOC is your ability to triage problems. That term comes from the medical world but it's just networking equipment... until you get into the situation I was in. And you're making triage decisions that could actually result in death. These were real engineers that really cared and did what they could. But when you have an area ravaged by hurricane and you tell the tech to put gas in generator 1 instead of 2, because you've been up for 30hrs strait... and a remote goes down so they can't call 911? I just couldn't detach myself from that. I took a pay cut to leave. A lot of people floated through that job, it wasn't just me. It takes a special kind of person that can detach themselves from the consequences of their decisions.

  18. Single Point Failure by PPH · · Score: 1

    For 911 services in 7 states? Set aside issues about the backup system for the moment (which may be a second server in the same data center): Why do all the 911 calls have to funnel through a single system? Emergency services are largely local. Not many people have to make 911 calls across a large region. So why isn't the energency call routing handled by local systems? And calls routed to local service centers?

    Even if there was a common software glitch in all the handling systems, I doubt everyone would hit some call limit in a database field simultaneously.

    --
    Have gnu, will travel.
    1. Re:Single Point Failure by Anonymous Coward · · Score: 0

      Actually there are mirrored data centers so a failure in a data center won't impact the call handling.

    2. Re:Single Point Failure by PPH · · Score: 1

      Actually there are mirrored data centers

      OK. That fixes an event that can take one data center out*. But why mirrored? Why aren't these systems distributed and colocated with the municipalities that they serve? And why a globally unique call ID that overflows at 4E7 records?

      * I helped develop a system that achieved its reliability through the use of distributed servers. And then had the IT people put all the servers in one rack, in one building. For cost savings, of course. The building sits right on top of the Seattle fault. So you'll have to excuse me when I don't take details like this for granted.

      --
      Have gnu, will travel.
  19. It will be fine by Anonymous Coward · · Score: 0

    The police are under no obligation to respond to 911 calls. Not accepting them due to a software problem just speeds up the process.

  20. Okay, I've been doing this for 30-something years by Trailer+Trash · · Score: 2

    And the number "40,000,000" doesn't come up on my list of "potential overflows to watch out for". What's special about 40 million?

  21. There's no need for people to own guns! by Anonymous Coward · · Score: 0

    All you have to do is dial 9... oh

  22. Or you can do like before there were phones... by Anonymous Coward · · Score: 0

    ...and prepare your life around the principle that nobody else is responsible for how you respond to things but you. It may still be troublesome if/when 911 goes down, but far less so when you realize that people lived for millenia without the telephone. And I say that as a heart patient who's been grateful for 911 in the past.

    1. Re:Or you can do like before there were phones... by umghhh · · Score: 1

      and they were dying like flies for millennia too. we have arranged and are paying for these services to help us out when need be. Some of those things that happen (fire or stroke) can kill or maim if not dealt with fast. This said it is indeed true that you need to accept that accidents happen also to such emergency centers. You can also expect that in this day and age such centers are handled in a way assuring redundancy. There many things that failed apparently. This is an occasion to improve. For some it is occasion to course and blame game (Putin would be a nice candidate to blame). For you this is just a statement of surprise why others are so shocked.

  23. I see a suspicious bag full of ebola-laced bombs by timothy · · Score: 1

    At least there are no current / recent worries what might make someone want to call 911 ...

    Actually, it's an interesting question -- just what is the threshold? Suspected ebola vomit? Suspicious bag on suburban street? Seems like it would be a very easy system to game, or even to unintentionally render useless. Takes a lot of goodwill and good behavior, all around.

    I took a CPR class last night, and the instructor (a firefighter in his dayjob) basically encouraged people to use 911 more, even for things about which the caller might be on the fence. "We'll sort it out. Don't call about your kid's math homework, but don't *not* call because you're not sure it's serious enough" was the gist.

    I've called 911 quite a few times, or asked others to (hypothermia case, guy on railroad tracks, gun shots, more gunshots, drunk drivers, etc), now I wonder what the mean / mode is for that ;)

    --
    jrnl: http://tinyurl.com/c2l8yr / foes: http://tinyurl.com/ckjno5
  24. I hate the word glitch by Anonymous Coward · · Score: 0

    It's a software error, stupid. It may be due to carelessness, oversight, lack or testing, or other reason. It may be trivial or catastrophic, but it is still an error. The implication is that it is harmless. If it's being reported, it certainly is not harmless.

  25. Re:Okay, I've been doing this for 30-something yea by danlip · · Score: 1

    As near as I can tell from the TFA somebody just put an arbitrary limit of 40 million in the code somewhere. Would be nice to have more technical details.

  26. Re:Okay, I've been doing this for 30-something yea by Anonymous Coward · · Score: 0

    Maybe a sequence value tied to a 32bit was shared with other tables resulting in 40 million not being the important number, just what was visible to the end user. Perhaps it was a sequence to a related but unknown table, say an auditing table, that overflowed - may have been a design intent that wasn't realized in engineering to have that table cycled every so often.

  27. Re:Hint by cyberchondriac · · Score: 1

    Well, what the gp suggests is pretty much impossible, but the truth is, software *is* the weakest link. OSes and apps are never or rarely bug free, and each new patch often introduces new bugs. Some things, like OSes, have tens of millions of lines of code, and where just a missing or misplaced semicolon can wreak random havoc, that's practically a guaranteed problem waiting to happen.
    Flexibility is proportional to complexity, and inversely proportional to reliability/stability. Dedicated hardware devices are the most stable, followed by firmware driven appliances. I've been in IT (repair, then administration) for about 20 years now, which is a fair amount of time, and I've seen far fewer hardware issues than I ever have with software.
    Then there are the issues of security; Windows, for one example, has been around for two and a half decades now, and there are still numerous bug fixes and security patches released on a weekly basis. It is permanently flawed.

    I sometimes wonder if it's really a language thing. I'm not a developer though. High level coding (that I'm aware of anyway) is in English, yet there is a pervasive attitude these days, even here at /. among IT and science people, against "grammar nazis", with the frequent defense of "you know what I meant". Well, computers don't know. And recently I've seen a surprising number of typos in SuSE/SLES conf file comments and whatnot.. it makes me wonder if developers are screwing up in the code too, syntactically.

    --

    Look back up at my post, now look back down, you're on the Internet. Now look back up. I'm a signature.
  28. I am a 911 systems expert by Anonymous Coward · · Score: 0

    I once joined an IBM business partner, where one of my first calls was to a police department in the Detroit suburbs. They wanted me to configure a printer on their AIX based high-availability 911 system. It all seemed odd. When I met the police chief he was very slow to shake my hand, and really left me hanging. He told the office people to watch everything I did - and not the "watch this expensive expert and learn!" sort of watching.

    It wasn't until years later that I became aware of the background check practice and realized they didn't actually need the printer configured. It was just a ruse to send me to do work for a police department, who would give me a thorough background check, courtesy of taxpayers.

    I did not drink the IBM Kool Aide, so the partner gig did not work out.

  29. i like this by Anonymous Coward · · Score: 0

    lmbo