Slashdot Mirror


Software Upgrade Crashes UK Air Traffic Control System

pitpe writes "Earlier today the computer system controlling most of the UK's airspace failed, after tests in preparation for an upgrade failed. The original failure occurred at the West Drayton centre, which is an old (70's) system, as opposed to the new system at Swanage, which has had its own problems. A system wide reboot to fix the system resulted in the entire system being taken down temporarily."

56 of 233 comments (clear)

  1. Damn... by Anonymous Coward · · Score: 2, Funny

    And I was going to put the blame on M$, but if it's a 70's system we're talking about I'll just shut up.

    1. Re:Damn... by not_a_product_id · · Score: 3, Funny

      Microsoft Air-traffic Control Software? Shudder...

      --

      ---
      We spoke for about a half an hour. I don't recall a thing we said. - Colorblind James Experience

    2. Re:Damn... by InShadows · · Score: 4, Funny

      Microsoft has been testing around with flight software since XP called MSFlight.. here's the article to prove it..

    3. Re:Damn... by Mr.+Bad+Example · · Score: 2, Funny

      > Microsoft Air-traffic Control Software? Shudder...

      It looks like you're trying to direct a pilot to land a plane. Would you like to:

      - Have the pilot land at Cirque du Soleil and tell him it's Denver International Airport?

      - Redirect the plane to Chicago, but send the luggage on to Orlando?

      - Adjust the ground level to send the plane and all aboard to fiery doom like that scene in Die Hard 2?

    4. Re:Damn... by LabRat007 · · Score: 2, Funny


      "Microsoft Air-traffic Control Software?" Shudder...

      I can see it now...


      Pilot: Air Traffic control (ATC) we've lost our nav con please advise!

      ATC: Please remain calm and click on start then control panel. Once in the control panel double-click(TM) on game controler - we need to reconfigure your yolk before we enter admin mode on the nav con system.

      Pilot: I don't see a |"start" button.

      ATC: Ok sir, I need you to find the windows key on your systems keyboard.

      Pilot: Whats a windows key?

      ATC: On most models the windows key is found on the lower left section of the keyboard. Occastionally it is found in the upper right section.

      Pilot: I still can't find it. I think we're lossing altitude.

      ATC: Just look key by key until you find it.

      Pilot: I don't think we have time for this.

      ATC: Just look sir.

      several minutes pass

      Pilot: Oh, there it is. Thats funny I was looking at it the whole time (nervous chuckle).

      Pilot: Hey, it doesnt do anything!

      ATC: What color is it sir?

      Pilot: Blue.

      ATC: Ok sir, I need you to find a function key of the same color. When pressed at the same time the desired effect will occure.

      Pilot: (Static)

      ATC: Sir?

      Pilot: (static)

      ATC: You seem to be off radar - sir are you there?

      --
      "Capital punishment makes the state into a murderer. Imprisonment makes the state into a gay dungeon-master"
    5. Re:Damn... by identity0 · · Score: 3, Funny

      Right, everyone knows that if you've got mission-critical, life-or-death systems that need ten nine's reliability, you should run them on iMacs.

      "I was directing air traffic on my ATC computer, giving a flight path to an KLM flight into Heathrow, when suddenly it was like *beep beep beep beep* and half of my planes were gone.

      "I was like, 'Uunh?'

      "It DEVOURED my flight path.

      "It was a really good flight path.

      "And then I had to send it again, and it wasn't as good 'cuase I had to do it fast. And the KLM crashed short of Heathrow.

      "It was... a bummer.

      "My name is Ellen Feiss, and I'm an air-traffic controller."

      Ph33r.

  2. Three fingers by panda+attack · · Score: 3, Funny

    Ah the good old reboot and hope for the best method :D

    1. Re:Three fingers by Fishstick · · Score: 4, Funny

      Hey, it worked for Samuel L Jackson...

      this is a UNIX system, I know this!!!

      --

      There is much cruelty in the universe, John.
      Yeah, we seem to have the tour map.

    2. Re:Three fingers by arkanes · · Score: 3, Informative

      Amazingly, it was also true. That was an SGI file browser running on Irix. See http://www.sgi.com/fun/freeware/3d_navigator.html

  3. Software doesn't rust... by Anonymous Coward · · Score: 5, Insightful

    "which is an old (70's) system". As long as it's not 30-year-old hardware then the software should still be fine. Why does everyone think that simply because software was written in the past it is bad?

    1. Re:Software doesn't rust... by Kredal · · Score: 2, Informative

      Because it IS old hardware.

      --
      Whoever stated that signature sizes should be limited to one hundred and twenty characters can just go ahead and kiss my
    2. Re:Software doesn't rust... by GoofyBoy · · Score: 5, Insightful

      >Why does everyone think that simply because software was written in the past it is bad?

      What is implied is that its being pushed to its limits. e.g. it was designed for 100 flights a day, when today there are 1200 flights a day.

      Those small things which you could get away with before start to become factors in usability and stability.

      --
      The surprise isn't how often we make bad choices; the surprise is how seldom they defeat us.
    3. Re:Software doesn't rust... by lennart78 · · Score: 3, Insightful

      Software written in the past also needs to be maintained. TFA states that testing for an upgrade and improvement to the West Drayton system was the cause of the problems.

      (Which I find strange, cause testing in a system as critical as this should be done in a separate environmnent.)

      I assume you've had no previous experience in maintaining a 'vintage' system like that? The code is probably written by a lot of different programmers, each with his own style, poorly documented, and thus very hard to read and understand.
      Software doesn't rust, but it clutters up and gets dirty over the years. It won't come apart by itself, but by the hands of a developer writing a necessary upgrade.

    4. Re:Software doesn't rust... by grub · · Score: 2, Interesting


      Yeah but the Y2K problem was "discovered" way back in the 70s. Banks doing 25 year mortgages in 1975 would extrapolate into 2000 and "whoops!" Any place which had Y2K problems gets no sympathy from me. :P

      --
      Trolling is a art,
    5. Re:Software doesn't rust... by Shimbo · · Score: 3, Informative

      "which is an old (70's) system". As long as it's not 30-year-old hardware then the software should still be fine. Why does everyone think that simply because software was written in the past it is bad?

      Sadly, it really is running on ~30 year old hardware, at least in part. I've spoken to some of the service engineers.

    6. Re:Software doesn't rust... by lennart78 · · Score: 2, Insightful

      I don't think the problem with the 'current generation' of programmers lies with their unwillingness to 'learn how to do some real programming'.
      Companies who develop software wanted the release yesterday. Development tools focus on allowing fast development of applications. M$ is taking a lot of criticism on itself by delaying Longhorn.

      You can no longer work on a product until 'it's done'. It has to ship, wether it's stable or not. If not, you issue a patch a week later. This is especially visible in games development.

      Companies tend to turn a blind eye to some aspects of life when it comes to making a buck, especially if no one's going to hold them responsible for it.
      I remember a quote from that horrible movie "Armageddon", where Steve Buscemi, sitting in a space shuttle of sorts, comments that he is sitting on 200.000 parts made by the lowest bidder, $BIGNUM gallons of fuel and a nuclear warhead. Makes you feel save heh?

      Who cares if you miss out on your holiday flight? Not the software manufacturer who just cashed in on his ATC-system...

    7. Re:Software doesn't rust... by Prendeghast · · Score: 4, Informative

      The 'fridge size boxes are 70's vintage (I suspect bits have been replaced over the years). The CPUs are only about five years old. The system consists of two identical computers for hot failover and they they had to get two custom CPUs made by the original manufacturer (IBM, I think) to deal with Y2K.

      As for the software? Written in some weird language called Jovial, and continually repatched - never rewritten.

      BTW, where the heck is Swanage? The new NATS center is in Swanwick!

    8. Re:Software doesn't rust... by general_re · · Score: 3, Informative
      ...Written in some weird language called Jovial...

      Muahaha. Languages from the stone-age. Jovial is an ancient semi-descendant of Algol, originally written especially for avionics systems. I'm not nearly old enough to have worked with it myself - Jovial's heyday was the mid-'70's or so - but I used to work with a couple of DoD greybeards who had done so, although even they hadn't touched the thing in years, as it's mostly been supplanted by Ada these days. The USAF can tell you a bit more about Jovial if you're having a slow day today ;)

      --
      ABSURDITY, n.: A statement or belief manifestly inconsistent with one's own opinion.
  4. Lucky in the US... by Kredal · · Score: 4, Informative

    Considering that up until about 2000, all of the major Air Traffic Control centers in the US were running on vacuum tubes, we were lucky nothing like this ever happened here. Sure, there were glitches at regional centers, that required controllers to do everything by hand, but nothing that required a full reboot of the entire country's ATC system.

    Hopefully the UK will get the new system tested and online before it causes more problems!

    --
    Whoever stated that signature sizes should be limited to one hundred and twenty characters can just go ahead and kiss my
    1. Re:Lucky in the US... by gowen · · Score: 2, Informative

      Its already back running (and has been since this morning, BST) Now the only delays are caused by clearing the backlog of grounded flights.

      --
      Athletic Scholarships to universities make as much sense as academic scholarships to sports teams.
    2. Re:Lucky in the US... by aldoman · · Score: 5, Informative

      There is 2 ATC centers in the UK - West Drayton which is for the 4 major London airports only (Heathrow, Standstead, Gatwick and London City). This is a 70s system and is due to be replaced by 2006. This is the one that crashed, but because a large percentage of UK air traffic is destined for London, then it caused the other one to go to a standstill.

      The other one at Swanage handles the ATC for everywhere else. This was replaced with a new system in 2002.

      But, by 2006 hopefully all ATC in the UK will be running on new systems.

    3. Re:Lucky in the US... by digrieze · · Score: 2

      "Considering that up until about 2000, all of the major Air Traffic Control centers in the US were running on vacuum tubes, we were lucky nothing like this ever happened here"

      That's a statement based on a totaly false premise. Simply because it was old or used tubes did not mean it was a bad design, in fact, the simplicity of the design and the shortness of the source code made it very a very easy system to debug and program. It wasn't as pretty, but it worked fine.

      The simple fact is that there's more bytes of code in most corporate spreadsheets than in the OS and application on those old systems. There is simply no way you can write the compact code on the newer 64 bit processors that you had on the old 8 bit units or debug or maintain them as easily, there are simply too many bytes and pipeline interactions. Sometimes newer IS worse, especially when you want to write reliable code.

      --
      It doesn't matter what you wrap your emotions around, Reality is a brick wall specifically designed to scramble eggs
    4. Re:Lucky in the US... by Anonymous Coward · · Score: 3, Informative

      Your regional centers cover about the same airspace as the West Drayton centre, so it is hardly fair to compare a US regional TRACON to the London FIR (comparing big regions to small countries....). Also, it wasn't a reboot of the entire countrys ATC system, it was a reboot of the host computer system at West Drayton, responsible for the printing of flight strips and processing of flight information at the Drayton centre - The radar at West Drayton was still functional and airports still have their own individual radar services, and this outage was only one FIR - the Scottish FIR was not technically affected by this crash, most of the effects were simply knock-on due to the fact that many flights simply pass into the London FIR or are destined to airports within that region, so you can hardly call this the "entire country's ATC system". Again, delays have been increased due to the route rotation most low cost carriers in the UK (and indeed the world) have been using; rather than have one aircraft doing on route, an aircraft may operate many different sectors during the day. Delays caused at one sector can cause other sectors and entirely different flights to be screwed up due to the lack of aircraft because of previous delays. You'll find that the majority of "mainstream" carriers delayed flights are clearing up quicker than those of the lo-co airlines.

      The only way this could be described as the entire countries ATC system is when you consider that London D&D (Distress and Diversion) is [in]conveniently located at West Drayton, and would be down during this outage. London D&D is responsible for monitoring 121.5 ("guard" frequency) across the whole country (Scottish and possibly Shannon FIR included) and becomes responsible for handling just about any aircraft emergency before handoff to local ATCO's.
      Whilst D&D wouldn't be affected technically, regulations require that the information provided by the HCS at West Drayton is always available to D&D, thus forcing a D&D shutdown in the event of the HCS crash. There is enough contingency to negate the use of D&D in an emergency, and handoff to local controllers immediately instead, but I don't think NATS and the CAA were willing to operate without D&D due to regulations and risk of litigation. Many GA flights (even those operating locally and un-filed under VFR) were also grounded due to the lack of D&D. It's a stupid system where one system needlessly relys on another then gets scared into submission when the other goes tits up.

    5. Re:Lucky in the US... by Anonymous Coward · · Score: 3, Informative

      This, like so many of today's reports (including the ones from the BBC), is inaccurate. England and Wales has 2 en-route ATC centres - that is centres that handle high-level traffic, outside the control of the terminal control areas round the airports. These are West Drayton and Swanick (not Swanage). The plan is that Swanick will handle all en-route traffic once flight data processing has moved from West Drayton. Swanick currently handles all en-route traffic control and routing, but if there is no flight data handling nothing can move.

      Swanick (formerly known as New EnRoute Centre - NERC) has just gone live after a very painful birth. This was largely because the system was based on the US AAS project which was cancelled before NERC was started.

  5. More problems... by Mz6 · · Score: 4, Informative
    I found a similar article on MSNBC.

    It seems they have been having problems with their computer systems since 2001 when it was "privatized".

    "The air traffic service has been beset by problems since it was partially privatized in 2001. A $484 million center at Swanwick in southern England opened five years late in 2002.

    The opening was delayed by problems with computer software, and the glitches continued for months afterward, as controllers misread aircraft altitudes and destinations because of hard-to-decipher computer screens. In at least one case, controllers mistook the Scottish city of Glasgow for Cardiff in Wales.

    Now.. that seems like a pretty big mistake for me.. especially for an air traffic controller to do. However, the article later states that:

    "Transport Secretary Alistair Darling said Thursday's problem did not lie at Swanwick but at the older West Drayton center, which is due to be closed by 2007."

    Thank goodness that old one is closing, however it doesn't sound like its replacement is doing any better!

    "If you want to know what is wrong with transport in this country it is that over decades successive governments did not spend enough on the infrastructure and air traffic control is no different," Darling told BBC radio."

    Excellent quote! While terrorism is on everyone's mind, we sometimes forget that safety of transportation should also be just as high. I couldn't imagine pilots relying on themselves to fly airplanes amid the thousands of others without the aid of traffic controllers and their computers.

    --
    Hmmm.
    1. Re:More problems... by Mr_Silver · · Score: 4, Interesting
      "If you want to know what is wrong with transport in this country it is that over decades successive governments did not spend enough on the infrastructure and air traffic control is no different," Darling told BBC radio."

      A dutch friend of mine once remarked that she didn't understand the mentality of the British. "You" she said, "have an amazing tendency to run things into the ground and then get around to fixing them rather than spending money on continually maintaining them so they never fall apart."

      It's a very good point.

      --
      Avantslash - View Slashdot cleanly on your mobile phone.
    2. Re:More problems... by Anonymous Coward · · Score: 5, Insightful
      A dutch friend of mine once remarked that she didn't understand the mentality of the British. "You" she said, "have an amazing tendency to run things into the ground and then get around to fixing them rather than spending money on continually maintaining them so they never fall apart."
      That's a pretty fair point. The trouble is, since 1977, politics has been obsessed with who can provide the biggest tax cuts, which has meant all the state funded / supported industries have gone to hell in a handcart.
      <rant>
      I blame Margaret fucking Thatcher, who let the hospitals fall apart and flogged off the viable bits of the infrastructure to her friends (at well below market value). [We're still feeling the effects of this on the railways, which the private sector has run into the ground] Corrupt old bitch.
    3. Re:More problems... by drwho · · Score: 2, Insightful
      A dutch friend of mine once remarked that she didn't understand the mentality of the British. "You" she said, "have an amazing tendency to run things into the ground and then get around to fixing them rather than spending money on continually maintaining them so they never fall apart."


      Funny, I noticed this about the U.S. system. But I figured it out. It has to do with the fact that civil maintenance is done by civil-service people with a union and a contract, while new equipment and construction contracts with companies are awarded to either the lowest bidder or some politically well connected company. So, the disposable roads/bridges and possibly airports system is a result of trying to contain socialist laborism. And I don't know how I feel about that.

    4. Re:More problems... by plopez · · Score: 2, Interesting

      The same is true in the private sector. No money for plant maintenence until something breaks and threatens a lucrative contract.

      Or the management mentality of 'Oh, security is too expensive right now we'll ship it and fix it later'.

      Politicians only look to the next election and managers only look to the next quarter. It is a typical attempt by non technical types to ignore entropy, expressed quite nicely in the old saying 'rust never sleeps.' If you want a bridge to last, paint it today, not after it has rusted out and collapsed. The analogy holds in many ways in many different areas.

      And we as voters and consumers let them get away with it.

      --
      putting the 'B' in LGBTQ+
    5. Re:More problems... by Anonymous Coward · · Score: 2, Informative
      Maggie is a paragon of Socialist Virtue.
      Care to tell us where all that North Sea Oil revenue has gone, then? Since 1997, public expenditure on health has risen by 50.6% Private spending has risen by 22.9% Which means Thatcher was more reliant on private money. Transport spending has risen by a similar amount.
      I doubt if Thatchers government would have blown nearly a billion quid on the Millenium Dome
      Well, no. That was Major's idea.
    6. Re:More problems... by Silburn_Luke · · Score: 3, Interesting

      To be fair (not that I hold any affection for Mrs T. in my heart) the rot stretches back a lot further than the 70s.

      I'd say the UK has been letting the infrastructure maintenance slide since at least WW2, maybe earlier. We inherited a fantastic installed base from the Victorians - the fact that it took 50 years of neglect to rot away is a tribute to how well they built - but the sad fact is this stuff was put together by a world-spanning Empire at the top of its game. What with paying for a couple of world wars and then trying to keep up Great Power appearances in the postwar world, we didn't have enough cash to keep this installed base up to scratch or replaced in anything like a timely fashion.

      Unfortunately what has taken 50 years to fall to pieces is likely to take about as long to put back together again and (I have it on very good authority) *that* is the real reason why Blair and Brown are so keen on PFIs, despite them being such a poor deal for UK plc in the long term. Its not because they are a cunning dodge to keep spending off the treasury books and plump up the bottom line numbers for the current electoral cycle (although that's a handy side-effect); its because they know that they or their like-minded successors cannot stay in charge for the decades that a full infrastructure overhaul is going to take and they want to make damned sure that nobody raids the infrastructure warchest after their watch.

      What one government gives another can take away after all, so from their perspective its no good kicking off a massive overhaul project now if a Conservative government is able to come along in a few years time and gut it for tax handouts just when its about to pay off. What handing out those juicy multi-decade PFI contracts does is lock in a powerful City-based constituency who will scream bloody murder if a future Chancellor tries to raid those revenue streams for a quick handout.

      It doesn't make much fiscal sense, but politically its quite astute.

      Regards
      Luke

      --
      #include witty_one_liner.h
  6. What WAS the System that crashed? by Spencerian · · Score: 4, Insightful

    Perhaps a person experienced in ATC software or hardware could enlighten us on the specific system in use, its OS and other trivial bits.

    It would help to reduce the coming surge of Microsoft jokes, which is very likely not relevant here.

    --
    Vos teneo officium eram periculosus ut vos recipero is.
    1. Re:What WAS the System that crashed? by orbitalia · · Score: 5, Informative
      Hi, I worked on exactly this system for 4 years.

      The hardware is an IBM 9020 family mainframe, the application is written in Jovial (one of , if not THE first algebraic language), and BAL assembler (for the monitor mostly). The monitor is the operating system so it effectively is a custom written operating system for this application.

      Although MVS is also used for testing. The I/O capabilities of the mainframe are superb which means it can handle 2000+ flights with only 14 Megs of RAM (if I remember rightly).

      I believe the NAS application came as a freebee from IBM when the UK purchased the hardware and was the same NAS (national airspace system) application used all over the US. It has been continously developed since then (no mean feat when you consider that all variables are global in Jovial, It uses holleriths instead of ascii, and you are limited to 5 or 6 characters per variable name). The hardware has also been upgraded several times over its lifetime.

      It doesn't often go down, last time was 2002 sometime, and you can tell how important it is because everyone screams when it does go down. The people I worked with are extremely dedicated to their job, but one cannot test a system like this for absolutely every eventuality. No doubt some patch was applied and some special case came up that caused a FLOP (functional loss of operation). It happens, Radar is usually unaffected, so the safety implications are not large, but flow is affected.

      The UK approach to handling NAS is much different to the US, the US tends to not touch the NAS software and develop external systems that enhance the usage of airspace, where as the UK tends to delve into NAS and improve things directly in NAS. Jovial is a very interesting language it has been used heavily by the US military and exists in such applications as Cruise missiles and many other aircraft and missile systems. Read about Jovial here if you are interested.

      I can't say too much about it for various NDA reasons (OSA) I think most of the above is in the public domain.

      HTH.

    2. Re:What WAS the System that crashed? by AlecC · · Score: 2, Informative

      It's an IBM 360/370 class mainframe: not sure what model. Somebody up the line said that the software was written in Jovial, which strikes me as very likely. Jovial was an Algol variant popular in defence/high reliability circles at about the time this lot was written.

      I think the system which crashed was only responsible for admitting new flight plans to the whole complex. Any flightplan already filed could carry on; it is just that no-one could file a new plan for the next flight.

      --
      Consciousness is an illusion caused by an excess of self consciousness.
    3. Re:What WAS the System that crashed? by orbitalia · · Score: 2, Interesting

      Oops I lost my link there - Jovial Lives!

  7. Links for reference by matthew.thompson · · Score: 4, Informative

    National Air Traffic Services http://www.nats.co.uk/services/index.html are the outfit responsible for this.

    They have a press release http://www.nats.co.uk/news/news_stories/2004_06_03 .html which explains quite nicely what they did and why.

    --
    Matt Thompson - Actuality - Insert product here.
  8. So what? by stratjakt · · Score: 5, Insightful

    There are redundant systems in place. Analog radar, humans with brains.

    At least there should be. Computers crash, break, have bugs, etc. They're a tool - a more efficient and convenient tool to be sure.

    But when they break, there are contingencies so that planes can still take off and land, and wont just fall out of the sky.

    This is also why Y2K was such a bunch of stupidity. We really aren't as reliant on computers as people think. We know they crash and are prepared to handle it when they do.

    --
    I don't need no instructions to know how to rock!!!!
    1. Re:So what? by Scooter · · Score: 3, Informative

      Hmm I don;t think there are those humans around. Cartinaly not in the quantities that would be required to manually guide the 1200 flights a day. We get dependent on the systems. We put the systems in because the load increases beyond the economic viability of an army of ATC guys, not to mention the communication overhead and possibility of error or mis communication. So we build a computer system to deal with it instead. That in turn allows us to up the load by an order of magnitude again. 30 years later, take the system away, and there's nothing.

      In scenarios like this, where load has increased whilst the computers systems were in place, we *are* reliant on them.

      Think of banks - time was when you had to almost plead on your knees to get a banck account, and they charged you for running it. This was becasue every account was written down manually in a book, and any calculations were performed by hoards of clerks. Then - computers. Now your new account is just one more record in a table somewhere, so the banks give out accounts to anyone who wants one, and do it for free. If for some reason your bank's computer system goes AWOL, there is no way they can process a month's interest calculations on the millions of balances and transactions - not to mention actually applying the transations that would now come in on bits of paper.

      I do agree that in a lot of cases, there remains a perfectly useable manual method, but where the computer system has enabled geometric increases in capacity over the manual system (which has been taken up) then, if you'll excuse the pun, it won't fly.

      You're right about the Y2k thing - I worked on a contract for a railway maintenance company in 1999 and the Y2K cordinator guy was tearing his hair out at the thousands of questions he got monthly such as "so, these nails, are they Y2K compliant?" He actually had solid steel track components called "chairs" that the rails sit on that had Y2K compliance stickers on them from the manufacturer. Presumably, they got fed up explaining it too, and decided it was easier to just stick the stickers on everything they made...

  9. Same in Ireland! by pixelbeat · · Score: 5, Informative

    Much the same thing happened last
    week in Dublin

  10. And the Wizard said: by the_twisted_pair · · Score: 5, Funny

    It appears you are trying to land a plane. Would you like to:

    [x] Allow Windows to detect new hardware ?
    [ ] Allow planes to circle in uncertainty ?

    [x] Show this window at all airports

  11. Re:This wouldn't have happened... by millahtime · · Score: 3, Insightful

    This wouldn't have happened had they been using Linux.

    This might have happened even if they were running linux. If the software that is used for the air traffic controlling was written badly it still could have crashed.

  12. This happened here in Houston about a month ago: by boschmorden · · Score: 2, Informative

    http://abclocal.go.com/ktrk/news/050404_local_airp ort.html

  13. Re:new linux distro idea by cpghost · · Score: 3, Funny

    Check out gflightcontrol-0.01, then run the usual:

    ./configure
    make
    make install
    make crash [plane] #optional

    Of course, it requires gnome 2.6 and all deps. Planes will have to circle while everything emerges.

    --
    cpghost at Cordula's Web.
  14. Re:This wouldn't have happened... by British · · Score: 3, Funny

    This wouldn't have happened had they been using Linux.

    No, the air traffic controllers would still be figuring out how to cut/copy/paste while a 747 is on it's final approach.

  15. It could have been alot worse... by b06r011 · · Score: 2, Interesting
    at least only the computers crashed

    as for the system crashing in the first place, it's unfortunate, but a good thing that they were able to cope and keep everyone safe - that's the main thing, right? (it's certainly my main concern)

    and as for the software not being up to the job, it may well not be. after all, air traffic has increased ever so slightly since the 1970's - is it reasonable to expect a program presumably designed for 70's hardware, and 70's air traffic loads to cope with heathrow in 2004?

  16. Swanwick not Swanage! by perly-king-69 · · Score: 4, Informative

    The new centre is at Swanwick in Hampshire, not Swanage in Dorset!!

    --

    --
    This sig is inoffensive.

  17. Swanwick, not Swanage! by Xilman · · Score: 4, Informative
    The new system is at Swanick near Southampton, not Swanage as posted here.

    Swanage is a pleasant little seaside resort. I know it well and stayed there a few nights when on my honeymoon.

    Finding Swanwick and Swanage on a map of southern England is left as a exercise. Hint: Mapquest may be a good place to start.

    Paul

    --
    Lasciate ogne speranza, voi ch'intrate
  18. Re:This wouldn't have happened... by LaCosaNostradamus · · Score: 3, Interesting

    Yes, I think that the software structure of a critical realtime system like ATC is much more important than which OS or language it's written in. It should be built like a strange composite stranded cable, with different strands of simple structure that can survive sporadic (even systemic) failure of its parts. In such a system, there should be no such thing as a system-wide reboot, since the only thing that is truly system-wide is the data.

    Without this structure, Linux would probably fail at an unacceptable rate too.

    --
    [You have a stable society when some nut guns down a schoolyard and the law doesn't change.]
  19. Re:new linux distro idea by Rakshasa+Taisab · · Score: 2, Funny

    Slackware has had ATC for years.

    --
    - These characters were randomly selected.
  20. Hang on a second... by Gordon+Bennett · · Score: 3, Interesting

    To quote from the NATS (National Air Traffic Services) press release:

    "The FDP was being tested overnight for a future upgrade. The system was successfully returned to service but at 06.03 errors were detected in the distribution of flight data between Centres. As a precaution, we decided to restart the FDP (known as a cold restart) causing an interruption to full service. The data processing system was restored at 06.42 and declared fully operational at 07.03. Flight capacity restrictions were lifted at 08.05. The system is now fully operational and we are confident that it is stable.

    Through the response team at West Drayton, we have been working with airports and airlines to clear the delayed departures, and expect the backlog to be cleared quickly.

    Our investigation into the cause of the problem is continuing."

    Let me get this straight: they ran a test on the FDP. The FDP glitched. They rebooted the FDP. They are still investigating the problem.
    Now, unless I am mistaken, I can only infer from their statement above that they are now running the FDP which is still susceptible to the problems highlighted by the test.

    1. Re:Hang on a second... by jrumney · · Score: 3, Insightful
      Now, unless I am mistaken, I can only infer from their statement above that they are now running the FDP which is still susceptible to the problems highlighted by the test.

      That's not the way I understand it. From their report, I understand the events went something like this:

      1. Overnight while it is quiet, new FDP software is brought online for testing.
      2. Testing was successful, and they brought the old FDP system back online (probably before 5:30 in time for the first arrivals at Heathrow).
      3. At 6:03 they noticed "errors in the distribution of flight data between Centres." I don't know what exactly this means, but if I had to guess, I'd say that the other Centre was not taken offline during the tests, and some test data leaked onto their live system as a result.
      So the problem is most likely not due to a bug in either the new or current software, more likely a bug in the testing procedure they were using.
  21. Golden rules.. by rf0 · · Score: 5, Insightful

    Never just test software upgrades on Live systems

    Rus

    1. Re:Golden rules.. by hughk · · Score: 2, Interesting

      A big bank did this, only they thought it was in UAT, however it registered itself with production as wanting to collect equity trades. It did, and very well too. They realised by the end of the day that the production backoffice was only seeing a fraction of the number of trades expected. Some poor bastard then had to trawl through the UAT database pulling out trades that were really intended to go to production and put them in the right place. I heard it took a couple of weeks. This is a shame because trades usually must be settled two business days after trading.

      --
      See my journal, I write things there
  22. Re:I always wondered... by YrWrstNtmr · · Score: 2, Interesting

    Yes they did, and no, using a cell phone is not a certainty to cause problems.

    It does, however, carry the potential to introduce errors in various systems.
    Would you want the altimeter to read 200 feet too high, or have an uncommanded left turn, because some numbnuts is yakking on the cellphone?

    "DC-9 flight crew experienced an involuntary turn by the autopilot during cruise. Autopilot reacted normally after the captain asked passengers to turn off any personal electronic devices. Crew later learned that a cell phone in an overhead bin was heard during the time of the autopilot problem."

  23. Windows Update by Anonymous Coward · · Score: 3, Funny

    Unconfirmed reports are stating that aparently one of the air traffic controllers accidently clicked on the "Windows update" icon. :P

  24. ATC software is scary (aka, Know Your Userbase) by Knowbuddy · · Score: 4, Insightful

    My dad is helping the FAA and the US military design and roll out the next gen ATC software here in the US. He comes home and tells stories that make my skin crawl.

    The first version of the software was built using standard current interface guidelines and widgets and the testing group that had no experience with older ATC systems were wowed at how simple and yet powerful it was. Pretty much any random person off the street could look at the screen and easily figure out what was going on and how to do various basic tasks. When that version was demoed to the ATC union the union freaked out at how different it was and thus began a cycle of making it more and more backwards.

    So, nowadays the next gen ATC software almost exactly replicates the UI of the old non-computerized and semi-computerized systems. On-screen toggle switches and dials, that sort of thing. The FAA and the ATC union have decided that retraining all of their ATCs to use modern computer interfaces would be a Bad Thing. When the computer screen doesn't exactly replicate the interface of the 50+-year-old systems, they freak out and scream bloody murder. On the flip side, kids coming into the field today that have been using computers most of their lives are finding the interface to be counterintuitive to the point of being almost unusable. Middle-aged workers who are both highly proficient ATCs and home computer users report that switching between the two types of interfaces each night when they go home requires conscious effort on their part, since they are so orthogonal.

    So who wins? Historical inertia, of course. Why fix the problem today when you can wait for your successors to fix it in 25 years?