Slashdot Mirror


The London Stock Exchange Goes Down For Whole Day

Colin Smith writes "TradElect, the Microsoft .Net based trading platform for the London Stock Exchange, was offline for about seven hours, meaning that their 5-nines SLAs are shot for approximately the next 100 years. The TradElect system was launched back in June of 2007 and was designed for increased speed and system capacity."

175 of 792 comments (clear)

  1. The London Stock Exchange Goes Down For Whole Day by Anonymous Coward · · Score: 5, Funny

    ...now if only my wife would do that! /rimshot!

  2. That's okay by sokoban · · Score: 5, Funny

    most of the american stock exchanges have been going down all year.

    --
    09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0 is the magic number.
    1. Re:That's okay by morgan_greywolf · · Score: 5, Funny

      most of the american stock exchanges have been going down all year

      My wife did that once. Nearly killed me. Come to think of it, it was just after we signed up for the life insurance ...

    2. Re:That's okay by Xugumad · · Score: 5, Funny

      Ya laugh, but I've been trying to describe this to people all day...

      "The FTSE has crashed!"
      "What, like another Black Monday?"
      "No, no, crashed, as in gone down!"
      "Errr..."

    3. Re:That's okay by Kreigaffe · · Score: 2, Informative

      I heard it was more like 37 times.

      --
      ... still waiting for this free-as-in-beer free beer I keep hearing about. :|
  3. 99.9967% Uptime if up the next 100 years by xmas2003 · · Score: 5, Informative

    Assuming 8.5 hour trading day (0700-1530) and 250 trading days/year. Maybe a squirrel caused the problem ... ;-)

    --
    Hulk SMASH Celiac Disease
    1. Re:99.9967% Uptime if up the next 100 years by Anonymous+Crowhead · · Score: 5, Funny

      Yeah, because they turn it on when trading starts and turn it off when trading ends.

    2. Re:99.9967% Uptime if up the next 100 years by The+Clockwork+Troll · · Score: 2, Funny

      Turn those machines back on!

      Turn Those Machines Back ON!

      --

      There are no karma whores, only moderation johns
  4. 100 years? by Anonymous Coward · · Score: 3, Insightful

    5 nines does not mean what you think it means.

    1. Re:100 years? by ShadowBlasko · · Score: 4, Funny

      99.999% uptime is something different?
      Guess that depends on what hours it is supposed to be working doesn't it?
      c/o User Friendly

      "Sid, Stef
      - Stef: How reliable is our network?
      Sid: As far as our customers are concerned, five nines.

      Stef: What does "five nines" mean?
      Sid: 99.999% uptime.

      Sid: Wait... Why?!
      Stef: So would "reliable to nine fives" in our newspaper ad be not very good?"

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order- Ed Howdershelt Via Tass
    2. Re:100 years? by julesh · · Score: 5, Informative

      5 nines does not mean what you think it means.

      No, you're right. By my calculation, the actual figure is more like 360 years.

      (Remember, this is a system that only operates 7.5 hours per day, 250 days per year)

    3. Re:100 years? by powerlord · · Score: 5, Funny

      Nah.

      They'll be back at "5 nines" by next week.

      The trick is to either redefine what the term means (so they are actually referring to 9.9999% uptime), or the timeframe (we've been at "5 nines" for the whole year" - said Jan 1 2009), or both ("so, we use 1 day as a data point, then if we've been up for any part of that day, we're good... so we've always operated at '5 nines' reliability")

      --
      This space for rent. All reasonable inquiries will be entertained at proprietors discretion.
    4. Re:100 years? by Anonymous Coward · · Score: 3, Funny

      Just different decimal point, Windows almost always has better than 9.9999% uptime

    5. Re:100 years? by ThePhilips · · Score: 4, Interesting

      In business, generally it means that solution provider (software + hardware) bears direct responsibility for all unplanned downtimes.

      If solution cannot provide such service availability, the solution provider has to be ready to cover all the damages. And it is often planned that way from day one: some downtimes are covers by the "5 nines", some are covered monetarily by solution providers.

      That's why 5 nines solutions cost as much as they cost: on one side to allow providers to bring quality of solution to desired level, on another side, in case of emergency, to let them to cover some downtimes with money.

      But covering seven(!) hours(!!) can be lethal to the solution provider. But again, it all depends on their support contract. Some (cheaper) 5 nines are delivered without any guarantees: they only theoretically 5 nines and provide only "best effort" service availability.

      --
      All hope abandon ye who enter here.
    6. Re:100 years? by RiotingPacifist · · Score: 4, Funny

      which vista version are you using?

      --
      IranAir Flight 655 never forget!
    7. Re:100 years? by ObsessiveMathsFreak · · Score: 4, Insightful

      It's called framing and it is making public debate in western society increasingly difficult.

      --
      May the Maths Be with you!
  5. Oh, my. by cp.tar · · Score: 4, Interesting

    So what happens when this happens again?

    --
    Ignore this signature. By order.
    1. Re:Oh, my. by gnick · · Score: 5, Funny

      The same thing that happened this time?

      --
      He's getting rather old, but he's a good mouse.
    2. Re:Oh, my. by Eudial · · Score: 5, Funny

      So what happens when this happens again?

      Well, first "Have you tried turning it off and on again?"
      Otherwise, "Are you sure it's plugged in?"

      --
      GAAH! MY PRINTER IS ON FIRE!!! PUT IT OUT! PUT IT OUT!
    3. Re:Oh, my. by im_thatoneguy · · Score: 5, Interesting

      Actually this is "again".

      The LSE used to run on HP-NonStop (w/ Cobol and C as far as I can find) but still managed to take itself down for 8 hours in 2000.

      If they're going to go down for a day every 7-8 years it might as well be cheaper and faster. (Articles quote the CTO as citing 10x performance increases).

      (All based on a quick google search)

      So before the hounds descend upon Microsoft it would seem the LSE has a history managing to bring down whatever system they run on.

    4. Re:Oh, my. by Coraon · · Score: 5, Insightful

      Followed by the youngest member of the team becoming the scape goat and being fired.

      --
      -Ours is the wisdom of Solomon, the magic of Merlyn, the fall of Icaris.
    5. Re:Oh, my. by Darkness404 · · Score: 4, Funny

      But new computer systems should make things more reliable, along with more experienced coders and better languages.

      --
      Taxation is legalized theft, no more, no less.
    6. Re:Oh, my. by im_thatoneguy · · Score: 5, Informative

      Which from the sounds of this article http://www.computerweekly.com/Articles/2008/06/12/231031/agile-trading-software-critical-to-london-stock-exchange.htm was the intent.

      One very interesting note is at the end of the article:

      Timeline for Tradelect upgrades

      18 June 2007: Tradelect launched, reducing the time taken to process trades from 140 milliseconds to 10 milliseconds. Capacity increased from 593 to 2,500 orders a second.

      November 2007: Version 2 upgrade. Trading time reduced from 10 milliseconds to about 6 milliseconds. Capacity increased by 70% from 2,500 to 4,200 orders a second. Introduced full suite of Mifid-compliant services.

      September 2008: Planned migration of Italian trades to Tradelect platform.

      September 2008: Tradelect Version 2 to launch. Plans to double trading capacity to 10,000 continuous messages per second. Aims to cut average time taken to complete a trade by half from 6 milliseconds to 3 milliseconds.

      Coincidence that this month was when they intended to release a new version?

    7. Re:Oh, my. by cp.tar · · Score: 3, Insightful

      Ah. Some blamecasting, after which everybody pretends it had never happened?

      --
      Ignore this signature. By order.
    8. Re:Oh, my. by GigaplexNZ · · Score: 4, Insightful

      These "better languages" are easier to use which allows for less experienced coders to perform the tasks. This is not an ideal world we live in.

    9. Re:Oh, my. by PitaBred · · Score: 4, Funny

      Well, it looks like it's hosed. You should probably reinstall the OS.

    10. Re:Oh, my. by cp.tar · · Score: 5, Funny

      Well, that gives a new meaning to opening Windows to Dungeon Dimensions.

      --
      Ignore this signature. By order.
    11. Re:Oh, my. by Anonymous Coward · · Score: 3, Interesting

      How the hell can you 'complete' a trade in 3 ms?

      Is there a hard drive out there that can even complete a write in 3ms?

    12. Re:Oh, my. by mrjb · · Score: 5, Insightful

      These "better languages" are easier to use which allows for less experienced coders to perform the tasks.

      I couldn't disagree more. Although automatic garbage collection is nice, this doesn't mean that you'll get "five nines uptime" systems by working with "less experienced" coders.

      If you're building a system that must guarantee 999.99% uptime, you wait until your best professionals become available, because it doesn't only involve code. You DON'T give the job to the less experienced ones, no matter how great the programming language. Five nines uptime requires a very robust design and very solid code quality running on a very solid platform which is running on a very solid OS on a very solid infrastructure. You'll want everything to be tested by unit tests, integration tests, regression tests, and whatnot. That involves a whole lot more than 'just' coders, but whoever works on it, they better be good at it.

      --
      Visit http://ringbreak.dnd.utwente.nl/~mrjb/growingbettersoftware to download your free copy of the book
    13. Re:Oh, my. by gbjbaanb · · Score: 2, Insightful

      yeah, 'cos that's what we really need - less experienced coders.

      Look on the bright side, they're probably much cheaper.

    14. Re:Oh, my. by gbjbaanb · · Score: 3, Interesting

      I think its all about network latency - ie the marketing machine says 3ms, but they are referring to the time taken to get the message to the stock exchange's switch.

      A Computer Weekly article (and its first link) explains it - basically, they replaced the old networks with new fibre-based ones and colocated servers for brokerages.

    15. Re:Oh, my. by bhtooefr · · Score: 4, Funny

      999.99% uptime: The system never crashes, and after you turn it off, it keeps running 9.9999 times as long as you had it running.

    16. Re:Oh, my. by e2d2 · · Score: 2, Insightful

      This is a funny ha ha right? Because it's like saying we should do away with new glass displays in airplanes because they make pilots lazy.

      Apologies if it's a joke but every time someone makes a coding error someone on here has to pull out the "back in my day we were men and coded without errors because the system exploded!" card. That's nonsense. Garbage in garbage out. A tool is only as good as it's user. It's the responsibility of the developer to use the tool properly.

    17. Re:Oh, my. by mr_mischief · · Score: 2, Insightful

      No, but there are likely plenty of programmers with 20+ years of experience who have learned .Net at some point in the last 5 years. The language experience isn't as important as the programming experience, despite what the headhunters think.

    18. Re:Oh, my. by Brigadier · · Score: 2, Informative

      unless they are union in which case it would be the last hire.

    19. Re:Oh, my. by tnk1 · · Score: 2, Funny

      and better languages.

      What? You mean Hindi?

    20. Re:Oh, my. by RiotingPacifist · · Score: 2, Funny

      Well, it looks like it's hosed. You should probably changethe OS.

      fixed

      --
      IranAir Flight 655 never forget!
    21. Re:Oh, my. by catmistake · · Score: 2, Funny

      You missed a step. Forgot "Try reinstalling the operating system and see if that helps." Then, when 6 hours after starting the installation the poor bastard that's still downloading updates realizes the system has already been compromised for at least the last 4 hours, begins screaming and yelling incoherently, he is escorted off the premises. Scrub, rinse, repeat.

    22. Re:Oh, my. by afidel · · Score: 2, Interesting

      WTF, 4,200 orders a second, is that some kind of joke? NYSE did 28,000 per second during the credit crunch last fall and they were designing for 64,000 by year end (can't find any info on whether they got there but I don't doubt it).

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    23. Re:Oh, my. by Hal_Porter · · Score: 2, Funny

      Ah. Some blamecasting, after which everybody pretends it had never happened?

      Everyone pretends what never happened?

      --
      echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    24. Re:Oh, my. by tezza · · Score: 2, Informative

      I work as a contractor in London, which has a trading platform (not LSE).

      "Complete" is an industry term. They use "complete", "cross" and "trade" in the same way.

      What it means is:

      * that there is currently a set of offers and asks on the exchange. Other people have submitted those already
      * you want to buy/sell one of these as appropriate
      * you send down your legally binding request for selling/buying that amount
      * the matching algorithm sees that your sell/buy matches the buy/sell on the exchange
      * that particular offer is then "yours" and no-one else can have it.
      * you are notified of the success of your "trade"/"cross" which is "complete"

      So the "match" occurs in 3ms. This is important, because otherwise you have to wait longer to know if you need to look elsewhere. There's more to it of course, this is just a mega-simple case.

      --
      [% slash_sig_val.text %]
  6. Ugly Day by pyite · · Score: 4, Informative

    It was an ugly day of finger-pointing and near-fixes, but in the end, it just left all the financial firms standing there staring at the Exchange. Definitely was a big deal--and it seemed like a lot of volume spilled over to US markets, creating volume related issues here.

    --

    "Nature doesn't care how smart you are. You can still be wrong." - Richard Feynman

  7. MS should hurry up and patent.... by 3seas · · Score: 4, Funny

    .... a method of controlling the market.

  8. Patch Tuesday by caluml · · Score: 5, Funny

    But Patch Tuesday is tomorrow?

    1. Re:Patch Tuesday by Anarke_Incarnate · · Score: 2, Funny

      Is it "Talk like an Ass-Pirate Day" already? Wow thailor, I mutht've forgotten all about it, you thilly gooth.

  9. Reliability? by R2.0 · · Score: 5, Funny

    Looks like someone needs to brush up on their buzzwords, specifically "mission critical" and "services no longer required".

    --
    "As God is my witness, I thought turkeys could fly." A. Carlson
    1. Re:Reliability? by syousef · · Score: 4, Funny

      Looks like someone needs to brush up on their buzzwords, specifically "mission critical" and "services no longer required".

      More like "Would you like fries with that?" and "Would you like to upsize?"

      --
      These posts express my own personal views, not those of my employer
  10. Re:That's some strange math... by pyite · · Score: 5, Insightful

    Since when is 7 hours even close to "a whole day"? Maybe you meant "almost a whole business day"?

    It's a whole trading day--and that's all that really matters when it comes to a major market.

    --

    "Nature doesn't care how smart you are. You can still be wrong." - Richard Feynman

  11. single page by Anonymous Coward · · Score: 5, Informative

    I wish people would get into the habit of linking to the single page version of the FA.

  12. Misleading summary by denoir · · Score: 5, Informative

    The summary implies that TradElect was responsible for the shutdown, but according to the stock exchange itself, it wasn't the case. They say instead it was a network problem.

    1. Re:Misleading summary by Anonymous Coward · · Score: 2, Funny

      See, that's why they should have used J2EE instead. If they had, no one would have been able to tell the difference between the network working and the normal slow crawl that Java users are accustomed to.

      (Seriously, though, the summary needs to be rewritten. Slashdot is blaming TradeElect as a proxy for Microsoft due to a, uh, slight anti-Microsoft bias.)

    2. Re:Misleading summary by tgatliff · · Score: 5, Insightful

      Why the heck they were using MS Windows for this type of environment is stunning... Transactional processing which is the bulk of this type of setup is where Solaris and Linux excel. Any company that builds a system like that on .Net should be thown out on the street.

      In short.. Not to rock on Windows, but different platforms always offer different strengths..

    3. Re:Misleading summary by Hyppy · · Score: 4, Informative

      if it was a network problem, then they're in more trouble than the summary implies. It's relatively simple to get 100% uptime (minus a dropped packet or two) in a network. The key here is redundancy. If you throw enough hardware at it, yes, it will not break.

      Internal? Dual(+) homed servers, redundant switches, redundant AC, redundant power.
      External? BGP on 2 or more transits on separate physical runs.

      What, you say that you need to account for natural disasters? Then get a second site, at least a few hundred miles away, and repeat.

      Virtual 100% uptime is a solved problem in the networking world.

    4. Re:Misleading summary by japhering · · Score: 5, Insightful

      As is normally the case M$ threw lots of money at the exchange to get it to switch unix/linux base to windows net so that M$ can tout that a major exchange is running windows.

      Full page ads touting the switch and the reasons they cited were better through put and better up time.

      They even had ads touting it here on /.

    5. Re:Misleading summary by tgatliff · · Score: 4, Insightful

      No... Actually I deal with this everyday. Windows is great for places where you need desktop apps or such. It also does well when you must have generic developers for web development.

      Where Unix/Linux/BSD truly shines is on back office type transactional processing. There are many reasons for this, and have a long history at doing exactly this. Meaning, mainframes may not have every been considered sexy, but they ran critical systems in companies for decades with very little problems... Actually they built such a reputation that when they failed most instantly assumed it was a hardware failure... Working on them, however, takes a more polished developer...

    6. Re:Misleading summary by caluml · · Score: 5, Interesting
      Although:

      The Johannesburg Stock Exchange, which uses the LSE's trading platform TradElect, also suspended trading.

      Hmm. Smells like a new version to me.

    7. Re:Misleading summary by oliverthered · · Score: 3, Insightful

      The point is you shouldn't be running mission critical systems on new and shiney (it's bound to have bugs) you should be running it on old and reliable (or at least where the bugs and workarounds are well known)

      --
      thank God the internet isn't a human right.
    8. Re:Misleading summary by IgnoramusMaximus · · Score: 5, Informative

      Any company that includes Linux is RTP/STP should go out in the street with them. Though at least you got Solaris correct.

      You have no clue. When people mention Linux in these environments they mean Linux running on one of these, not a home-brew distro running on a $150 PC.

    9. Re:Misleading summary by Cillian · · Score: 4, Informative

      Everybody wasn't dropped. A few people had issues, and so they had to completely stop trading, else the people without issues had an unfair advantage.

      --
      -- All your booze are belong to us.
    10. Re:Misleading summary by mashade · · Score: 5, Interesting

      Yep, I remembered and laughed so hard I had to put the images next to each other:

      http://tipotheday.com/2008/09/08/microsofts-foot-in-mouth-london-stock-exchange/

      --
      Technology tips and tricks.
    11. Re:Misleading summary by mabhatter654 · · Score: 3, Insightful

      I thought unfair advantage was the whole point of capitalism...I have it you don't! what kind of communists run the place?

    12. Re:Misleading summary by Angostura · · Score: 2, Interesting

      Follow that logic rigorously and all mission-critical systems would be running on abacus.

    13. Re:Misleading summary by grassy_knoll · · Score: 2, Informative

      Well another poster pointed out this story with a juicy quote:

      The stock exchange realised it had a problem at 9.15am this morning and has been working since then to identify and fix the problem.

      A source close to the company said an upgrade had gone wrong. The stock exchange would not comment.

    14. Re:Misleading summary by tgatliff · · Score: 2, Insightful

      I certainly understand your point, but quality of developers are only half the picture. The OS design structure of Windows and Unix is just different, and both have their strengths and weaknesses. I am excluding OSX in this argument for several reasons...

      The Unix development approach is allot better at appliance like implementations. Meaning, you build it for a specific job (batch transactional processing in this case), and you treat the hardware as an appliance. You also limit ways a user can interact with the system, so patching is rare to never. Video interaction is rare, and even shell interactions are console menu based and/or web based. The prevents users from tinkering, and prevents the ability to exploit any vulnerabilities that may exist.

      The Windows development approach is more more flexible in nature. Meaning, primarily keyboard and mouse user interaction is guaranteed. The possibility of somehow getting a virus must be taken into consideration. Most interactions is RDC and/or VNC based. Web based servicing sometimes occurs, but Windows is impractical to build in an appliance fashion typically...

      I hope this Helps..

  13. Re:Still don't know why... by eggoeater · · Score: 4, Insightful

    Agreed. It's a bit of flame-bait mentioning them in the summary when the exchange is being tight-lipped about what the root-cause is (if they even know at this point.) I do a lot of .NET stuff and, like other platforms eg. Java, there's many things that could cause problems, like plain old programming bugs.

  14. Re:The London Stock Exchange Goes Down For Whole D by east+coast · · Score: 5, Funny

    Oh, she does... just not with you.

    nudge nudge, wink wink.

    --
    Dedicated Cthulhu Cultist since 4523 BC.
  15. Re:Good lord, they're running on Windows? Why? by fotbr · · Score: 2, Insightful

    Perhaps the bit you're missing is that windows isn't quite as bad as the /. crowd likes to say it is. Especially if its an older (translation: fixed & stable) variety like win2k or even nt4.

  16. performed as expected... by markana · · Score: 5, Funny

    "and was designed for increased speed and system capacity"

    and see - it went down far faster and more completely than the previous system would have been able to. So that's progress. It's all in how you present it.

  17. Potentially misleading summary by Reality+Master+201 · · Score: 3, Insightful

    The exchange insists the problem was connectivity, not the trading platform.

    Not to sound overly cynical, but I'd hardly expect them to acknowledge the problem if it were the trading platform that was the issue. That'd kind of be business suicide.

    1. Re:Potentially misleading summary by Angostura · · Score: 5, Informative

      Well, the Reuters article does say that trading started normally, but some traders were unable to connect, so the whole exchange was bought down to avoid unfair advantage/disadvantage occurring, so actually both stories are consistent.

  18. 5 nines? by andreyvul · · Score: 5, Funny

    So their 9.9999% uptime is screwed?

    --
    proud caffeine whore
    1. Re:5 nines? by pancake_lover · · Score: 5, Funny

      Maybe they should shoot for 9 fives instead. When the problem is too hard, just lower the goal posts.

      --
      Homer no function beer well without.
  19. Re:Good lord, they're running on Windows? Why? by Hyppy · · Score: 5, Insightful

    Perhaps the bit you're missing is that windows isn't quite as bad as the /. crowd likes to say it is. Especially if its an older (translation: fixed & stable) variety like win2k or even nt4.

    I'm not sure if you're serious or not, but surely you aren't trying to compare NT4 uptime with the 5 9s of a solid System z platform?

  20. Re:Good lord, they're running on Windows? Why? by erroneus · · Score: 5, Insightful

    Oh please. Persuasive marketers can get Windows installed just about anywhere including US war ships.

    While it is commonly accepted by many techies (and strongly denied by others) that Microsoft Windows is not a suitable platform for that level of computing, sales people often bypass the techies who know better and sell to managers and executives who still believe "you can't get fired for using Microsoft."

    With all this said, it will be quite some time (and possibly never) that we will ever know for certain what is at the root cause of the failure. You can be sure that Microsoft is all over this problem both technically and P.R.-wise. They won't let the facts get out if they are damaging. Recall the major power outage that many still believe was caused by a worm attacking Microsoft servers? As far as I can see, the true cause of that failure has yet to be revealed.

    But if this was a planned event, or an unplanned disaster resulting from a planned event gone bad (updates, upgrade, other maintenance), you would think they would have provided for mishaps in some way or another.

    But as this news story is all I have to go on, there is no indication of cause and so I will not presume this is a Microsoft problem. But it says a lot that NYSE runs on Linux and not Microsoft. It seems SOMEONE did listen to the techies.

  21. Nothing taxes can't fix by heroine · · Score: 5, Funny

    After the malfunction, TradElect was immediately bought by UK's government for $200 billion and all its debts waved. In an unrelated story, medicare tax was raised yet again because of an unexpected shortfall.

  22. .NET clearly to blame...... by heffrey · · Score: 3, Insightful

    .....I mean, there couldn't be any other possible cause for the problem.

    1. Re:.NET clearly to blame...... by The+End+Of+Days · · Score: 3, Funny

      It's impolite to interrupt all these people while they're masturbating.

  23. Re:The London Stock Exchange Goes Down For Whole D by cp.tar · · Score: 3, Funny

    Oh, she does... just not with you. nudge nudge, wink wink.

    Your wife -- does she go?

    More importantly, does she run?
    More specifically, does she run Linux?

    --
    Ignore this signature. By order.
  24. Re:Good lord, they're running on Windows? Why? by ArhcAngel · · Score: 2, Interesting

    Perhaps the bit you're missing is that windows isn't quite as bad as the /. crowd likes to say it is. Especially if its an older (translation: fixed & stable) variety like win2k or even nt4.

    A) Yes, in fact, it is quite that bad (just not as bad as when it was first released) and

    B) There is no "fixed and stable" version of .NET yet. At least none I would hinge my mission critical business on.

    --
    "A person is smart. People are dumb, panicky dangerous animals and you know it." - K
  25. What, no ads? by Bert64 · · Score: 5, Funny

    Does anyone else remember the "The london stock exchange chose windows 2003 for reliability, they didn't choose linux" ad banners that used to run all over the place, including slashdot if i remember?
    Funny how it's all come crashing down...

    "The london stock exchange chose windows, but after 7 hours of downtime wishes they had chosen linux".

    --
    http://spamdecoy.net - free throwaway anonymous email - avoid spam!
  26. 5-nines SLA by skeeto · · Score: 4, Informative

    "5-nines SLA"

    I had to look this up, so I imagine other people didn't know it either (I thought was was a stock exchange term). First Google search result reveals the answer,

    The Battle With "3 Nines" and The Goal of "5 Nines"

    1. Re:5-nines SLA by glwtta · · Score: 2, Insightful

      This is a good post, I had no idea what it meant.

      Yeah, me neither! If only we also had this "Google" thing.

      --
      sic transit gloria mundi
  27. Re:How many failures before.. by KernelMuncher · · Score: 5, Informative

    When I worked in academia I used to collaborate on a research project with a data architect from one of the major electronic exchanges. His whole shop is MS and .NET. I asked him why he didn't run Linux / Unix. He said that with competent guys the MS boxes had great uptime. Wall Street can afford to pay the top salaries so they attract guys who really know their stuff. Not just semi-competent people who managed to sit through an MSCE exam. [his words not mine]

    Also he said support was crucial for his company. If something went down, he wanted to be able to call someone immediately. He couldn't afford to just post a question on a message board and hope someone replies. He wanted contracts with 3rd party support that had experience with similar huge enterprise systems that he had.

    When I said there were companies who could provide excellent Linux support, he said his ass was on the line if something broke so he wanted to be able to justify his software choice to the the C-level guys. And those guys knew the name Microsoft. So he didn't see anything else as an option.

  28. Re:let me be the first to say by CnlPepper · · Score: 2, Informative

    bollocks

  29. Re:let me be the first to say by actionbastard · · Score: 2, Funny

    That's 'bollocks', mate. You'd know 'em if you had 'em.

    --
    Sig this!
  30. Re:Still don't know why... by morgan_greywolf · · Score: 5, Insightful

    Wait! Are you suggesting that downtime can be caused by application problems, network problems, hardware problems, dumbass systems administrators and a whole slew of other things completed unrelated to the platform on which it is running?

    I am *shocked*! *Shocked* I tell you!

  31. ketan by ketan324 · · Score: 5, Interesting

    The LSE going down is a big deal. The US exchanges have been trying very hard to displace LSE's strong hold in the EUROPEAN markets. With the merger of NYSE/Euronext and NASDAQ/OMX this cuts market share and faith in LSE as everyday passes. Additionally with continued tech issues, NASDAQ could reinvigorate their bid for LSE again! I work for a data major data vendor, and I know from experience the NYSE and NASDAQ are much more reliable than their European counterparts. Also LSE going down today is huge, considering the news on Fannie/Freddie, WAMU, Lehman, and the WRONG news on United Airlines. Many arbitrage opportunities were lost for LSE traders.

  32. Some info on the system itself by Jherek+Carnelian · · Score: 2, Informative

    http://www.computerweekly.com/Articles/2006/09/26/218637/city-prepares-to-test-new-trading-platform.htm

    I bet the fingers are pointing today - Accenture (formerly Arthur Andersen) India vs HP vs Microsoft.

  33. Quote .NET by Legion_SB · · Score: 4, Funny

    .NET garbage collector: "Oops, that wasn't garbage!"

    --
    'a';DROP TABLE users; SELECT * FROM DATA WHERE name LIKE '%'... if you're reading this, it didn't work.
  34. It's official. by RightSaidFred99 · · Score: 4, Insightful
    Most of you are morons. Let me get this straight. TradElect is .NET based. TradElect failed. Ergo, Windo$e sucks, M$ sucks. and .N$T sucks, etc... You'd think you were technically illiterate morons or something who think that all or even most system failures are caused by the platform or programming language.

    Let me explain computers to you. See, the developer uses a set of platforms, languages, integration components, etc.. to deliver his functionality to the end user. A failure at any level can cause the application to fail. It could be application logic, network issues, hardware issues, integration with third party systems, a dipship systems administrator, etc...

    And yet the 90-105 IQ SlashDweeb set comes out in numbers with no data and says "lolz Windoze! .NET haha!". Crikey.

    1. Re:It's official. by Alioth · · Score: 3, Insightful

      Well, no, it's just that Microsoft shouted long and hard about how reliable the LSE would be now it was running on Windows Server System 2003. So it's deliciously ironic that after all this trumpet tooting, it still fell flat on its face, regardless of the reason...since Microsoft's ads were obviously to get everyone to believe that the system would be highly reliable.

  35. Re:Good lord, they're running on Windows? Why? by SimonBelmont · · Score: 3, Interesting

    Even if MS is able to make Windows good at what it is and generally reliable, what it is is not a high-SLA platform intended for mission critical systems, so there's really no excuse. I don't think NSA/CIA/DoD would say, "The security model of Windows isn't quite as bad as the /. crowd likes to say it is. Sure, we haven't reviewed it, but the IT guy says it will help us leverage synergy to effect better ROI."

  36. Re:What do Brits say when stuff like this happens? by julesh · · Score: 3, Interesting

    You've seen the first scene of "four weddings and a funeral", surely?

  37. Do Not Dot Net by dspolleke · · Score: 2, Funny

    Was it back up before the bell rang and did the Micro$oft stock plumit? in a dutch dialect dot-net translates into "doesn't work" ;-)

  38. Re:The London Stock Exchange Goes Down For Whole D by cp.tar · · Score: 2, Funny

    Oh, she does... just not with you. nudge nudge, wink wink.

    Your wife -- does she go?

    More importantly, does she run?

    More specifically, does she run Linux?

    More relevantly, does she run TradElect?

    No, she goes down on it.

    --
    Ignore this signature. By order.
  39. You have your causes and effects reversed. by jcr · · Score: 2, Interesting

    Windows does suck, building any mission-critical system on a fundamentally botched foundation is begging for trouble, and knowing that TradElect was built on quicksand is prima facie evidence of negligence. IOW, it probably failed because windows sucks, not the other way around.

    Let me explain computers to you

    Let me explain stock exchanges to you: if they go down during a trading day, a lot of people lose a lot of money. In years past, this kind of work was typically done on Tandem, Stratus or IBM systems which were so reliable that any unscheduled reboot merited a visit from the factory.

    BTW, I've worked on trading systems for Salomon Brothers, Phibro Energy, JP Morgan, and UBS/Warburg. If anyone had suggested running mission-critical back-office apps (like the system of record of a major stock exchange) on windows, they would have been laughed out of the room. I'm astounded that the LSE could be so sloppy.

    -jcr

    --
    The only title of honor that a tyrant can grant is "Enemy of the State."
    1. Re:You have your causes and effects reversed. by metamatic · · Score: 2, Interesting

      So far I know that the source does not blame Microsoft or .NET for that matter. It appears to have been some sort of network problem.

      A network problem which simultaneously affected the Johannesburg Stock Exchange, which uses the same software as per TFA? That's some interesting network topology.

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
  40. Re:That's some strange math... by Tibor+the+Hun · · Score: 4, Funny

    Well, I'm a state employee, and I can tell you that a few 7 hour days in a row would outright kill me.

    --
    If you don't know what AltaVista is (was), get off my lawn.
  41. Coincidence? by Shadow+Wrought · · Score: 2, Funny

    The exchange goes down at the same time that the US buys two huge, failing monoliths?

    "Turn the bloody thing off! We'll just blame Microsoft and see how the rest of the markets shake out, shall we?"

    --
    If brevity is the soul of wit, then how does one explain Twitter?
  42. Virtual 100% uptime? I call BS... by volxdragon · · Score: 2, Interesting

    What the hell are you smoking??!? I worked for one of the top switch/router manufacturers for 7 years and this is FAR from true in their shop and pretty much everyone else's. Talk to any technical call-center rep for the top 5 or so router manufacturers and I am sure they can tell you many horror stories...

  43. Re:It appears high load/usage crippled the system. by julesh · · Score: 4, Insightful

    No different then what can happen on a unix box I suppose.

    Note that the current system is built around a large cluster of 2.2GHz servers, while the unix-based system it replaced (which coped perfectly happily with a substantial portion of the same traffic) ran from a smaller cluster of much slower servers.

    The primary purpose for the new system, introduced less than a year ago, was to expand capacity. For it to have failed within a year due to lack of capacity basically means that it has failed in that objective.

  44. Re:How many failures before.. by stinerman · · Score: 5, Insightful

    When I said there were companies who could provide excellent Linux support, he said his ass was on the line if something broke so he wanted to be able to justify his software choice to the the C-level guys. And those guys knew the name Microsoft. So he didn't see anything else as an option.

    In other words, he used the "no one ever got fired for buying IBM" defense.

  45. Comment from an affected trader: by reverseengineer · · Score: 4, Funny

    President of Exchange: [Randolph Duke has just collapsed with shock] Mortimer, your brother is not well. We better call an ambulance.

    Mortimer Duke: Fuck him! Now, you listen to me! I want trading reopened right now. Get those brokers back in here! Turn those machines back on!

    [shouts - it echoes pathetically throughout the trading hall]

    Mortimer Duke: Turn those machines back on!

    --
    "FDA staff reviewers expressed concern about the number of patients who were left out of the study because they died."
  46. What We Do Know by tobiah · · Score: 2, Interesting

    Is that this is good news for most of /. readers. A lot of big corporatations are being reminded of how important it can be to not cut corners when hiring programmers and IT.

    --
    "The ability to delude yourself may be an important survival tool" - Jane Wagner -
  47. Re:In other NEWS... by Cap'nPedro · · Score: 5, Informative

    No, he'd waggle his arse .
    A fanny would be a vagina in Britain.

    Come on +5 informative!

  48. Get The Facts by moderators_are_w*nke · · Score: 3, Informative

    "In the past six years, there have been no production outages at the London Stock Exchange, and the new systems running on Microsoft technologies are critical to maintaining this 100 per cent reliability record."

    http://www.microsoft.com/casestudies/casestudy.aspx?casestudyid=200042

    --
    "XML is like violence. If it doesn't solve your problem, use more." - Anonymous Coward
    1. Re:Get The Facts by kesuki · · Score: 4, Insightful

      Right from your article "and be cheaper to manage"

      sounds like the LSE fired expensive. knowledgeable admins and went for 'cheaper' ones, there is your problem right there. windows server isn't perfect, but clearly they had good hardware, were running mission critical apps, but went with cheaper less experienced admins.

      also, your fine article specified there were 'no production outages', they don't claim the system ran 24/7/365 with no reboots or glitches, but that there was no production outages for six years. there is quite a bit of difference. the former states that admins and hardware were able to offer the specific services needed at the time it was needed for 6 years, but not on the amount of redundant hardware, etc required to accomplish everything.

      so given everything i've read here, under experienced windows admin approves an under tested system upgrade that epic fails, and takes down the production server for the first time in 6 years. no shock here, they wanted to cut corners on admin costs, they brought the epic fail on themselves.

    2. Re:Get The Facts by narcberry · · Score: 5, Interesting

      Interesting since they haven't been "running on Microsoft technologies" for "the past six years"...

      --
      Modding me -1 troll doesn't make me wrong.
  49. Re:d'OH by maino82 · · Score: 3, Funny

    yes, but how many times did they reboot it?

  50. Tee Hee by mengel · · Score: 5, Insightful

    Oh, ye of lesser cynicism. I also, long ago, used to believe that language features could improve software reliability. Nowadays the idea just makes me cackle -- in actuality the universe just invents better idiots.

    --
    - "History shows again and again how nature points out the folly of men" -- Blue Oyster Cult, 'Godzilla'
    1. Re:Tee Hee by arevos · · Score: 4, Insightful

      I also, long ago, used to believe that language features could improve software reliability. Nowadays the idea just makes me cackle

      Why? Certain languages have features that eliminate large classes of errors. Whilst its possible that programmers will find other ways to screw up, I'd have though that reducing the set of errors that are actually possible would go some way to improving reliability.

      Out of curiousity, what languages are you familiar with? Have you worked much in languages with very tough compile-time checks, like Haskell?

    2. Re:Tee Hee by natebarney · · Score: 3, Informative

      Certain languages have features that eliminate large classes of errors. Whilst its possible that programmers will find other ways to screw up, I'd have though that reducing the set of errors that are actually possible would go some way to improving reliability.

      With a general purpose programming language, the number of ways to screw up is effectively infinite. If you take another infinite set, say, the integers, and eliminate a large subset, say the even integers, you still have an infinite set left over. The GP is simply pointing out that there will always be programmers who screw up in ways that haven't been eliminated.

    3. Re:Tee Hee by arevos · · Score: 2, Insightful

      With a general purpose programming language, the number of ways to screw up is effectively infinite. If you take another infinite set, say, the integers, and eliminate a large subset, say the even integers, you still have an infinite set left over.

      By the same token, the number of ways to befall a terrible accident is effectively infinite. Does that therefore mean we should not bother putting railings on bridges, or seatbelts in cars, or fuses in sockets?

    4. Re:Tee Hee by the_raptor · · Score: 2, Insightful

      The average driver can't take out the whole bridge. The average programmer can. Software is just not understood as well as structural engineering, and most of the people we allow to do it couldn't even get into an engineering program.

      --

      ========
      CINC, 4th Penguin Legion
    5. Re:Tee Hee by Alex+Belits · · Score: 2, Insightful

      People actively AVOID getting into an accident, so while it's still possible to die horribly in all kinds of ways, a person will not likely to get to the point of actually getting into a fatal accident unless something is seriously broken (say, railing is unexpectedly missing on a bridge).

      Inexperienced and stupid programmers produce bugs at nearly constant rate -- give them something that makes some bugs impossible, and they will either improve their productivity (still with $deity-awful rate of bugs), or start making different kinds of bugs.

      --
      Contrary to the popular belief, there indeed is no God.
    6. Re:Tee Hee by Firehed · · Score: 2, Insightful

      This is just why the average programmer shouldn't be writing software that controls a stock market. When you're dealing with something that mission-critical with that much money on the line, you damn well better be pulling from the same pool as NASA.

      --
      How are sites slashdotted when nobody reads TFAs?
    7. Re:Tee Hee by forand · · Score: 2, Insightful

      Just because a languages reduces or even removes certain classes of problems does not mean it doesn't create new ones.

    8. Re:Tee Hee by Simon+Brooke · · Score: 5, Insightful

      I also, long ago, used to believe that language features could improve software reliability. Nowadays the idea just makes me cackle

      Why? Certain languages have features that eliminate large classes of errors. Whilst its possible that programmers will find other ways to screw up, I'd have though that reducing the set of errors that are actually possible would go some way to improving reliability.

      Out of curiousity, what languages are you familiar with? Have you worked much in languages with very tough compile-time checks, like Haskell?

      Y'know, I agree with the grandparent. On my first coding job there was a guy (Chris Burton) who'd worked on the Manchester Mark One. He was retirement age when I met him. We had a new model of inkjet printer, which had a new processor none of us had ever seen before. It printed characters, we needed it to print bitmaps.

      Chris took the datasheet for the printer and the datasheet for the processor home on the train with him, and came back next morning with new code for the printer PROM written out - in opcodes, not assembler mnemonics - in longhand on a pad of paper. That code was blown into the PROM and worked first time, and continued to work without any errors reported for the three years I was on that project.

      Programmers like that just don't seem to exist any more. Automatic memory allocation, bounds checking, type checking, etc. are great technology, and I wouldn't choose to live without them. But they mean we are all sloppy and careless, because we can get away with it, and when humans can, they do.

      --
      I'm old enough to remember when discussions on Slashdot were well informed.
  51. Bad upgrade by JShadow21 · · Score: 5, Informative

    The article here blames it on some sort of botched upgrade.

    1. Re:Bad upgrade by Locutus · · Score: 5, Insightful

      this same kind of thing( replace *nix with Windows ) is what took out the LAX comm system a few years ago and left dozens and dozens of airplanes in the air and on the ground at/over LAX without communications.

      What blows me away is that for years, UNIX systems were one of the defacto standards for mission critical OSs. Along comes a marketing company, Microsoft, and people are saying it is capable of mission critical use even when there are constant disruptions from virus attacks, Ctl-Alt-Del and BSoD are a well known features, and any of a hundred other reasons it is NOT ready for mission critical systems.

      What kinds of morons are running the show anyways? And it is about time people start getting fired for this junk. From my experience on operating systems, UNIX was the one OS where when you wrote code, you dealt with the business logic/code and not OS issues. Only once in a blue moon did an OS patch or structure tweak get in the way of coding the application(s). OS/2 was pretty good but not as good as UNIX and Windows was the worst. Gawd, I still hear people complaining about that little Windows Mobile OS crashing. They can't even get a small chunk of code working properly let alone the behemoth that is the Windows desktop and server OS.

      LoB

      --
      "Anyone who stands out in the middle of a road looks like roadkill to me." --Linus
  52. Re:It appears high load/usage crippled the system. by lgw · · Score: 4, Insightful

    No, actually the Windows system (10 ms per transaction) was a 13x speedup over the older system (135 ms per transaction), followed quickly by an addiditonal 50% speedup (6 ms per transaction). The Windows system was just recently updated to double performance again (3 ms per transaction), so it's now 45 times as fast as the unix-based system it replaced.

    You may be able to fault it on reliability (though the olde system wasn't perfect either), but you can't fault it on performance.

    --
    Socialism: a lie told by totalitarians and believed by fools.
  53. Re:That's some strange math... by phillous · · Score: 2, Funny

    You must be an American, since you have NO idea what irony is...

  54. It could be .. but wasn't :) by rs232 · · Score: 2, Informative

    "It could be application logic, network issues, hardware issues, integration with third party systems, a dipship systems administrator, etc"

    But it wasn't any of the above. The Stock Exchange failed after a failed upgrade of the Microsoft .Net based trading platform ..

    --
    davecb5620@gmail.com
    1. Re:It could be .. but wasn't :) by RightSaidFred99 · · Score: 4, Insightful

      Why did the upgrade fail, I guess is what an intelligent person would ask. You haven't asked that. You've hilariously assumed it's .NET or Microsoft's fault.

      As a matter of like for like, I'm going to assume it was because some Linux dweeb walked in and tripped over a network cable. Ergo, I now claim Linux dweebs are clumbsy oafs who should be banned from computer rooms.

  55. Re:How many failures before.. by Kjella · · Score: 2, Insightful

    In other words, he used the "no one ever got fired for buying IBM" defense.

    Yes, he did and as a matter of fact it's a valid defense and it's not a chicken or egg problem. Almost everything that ever wants to run mission-critical systems have to work their way up from missing-trivial through mission-sensitive and mission-important. Customers didn't just one day decide IBM was overpriced and threw out all their computers, they tried and tested clones and nothing bad happened. Can you point to any smaller exchanges that use Linux? Or do you expect that suddenly Linux should go from nowhere to running the biggest, most critical markets in the world economy? No offense, but it doesn't happen to anything else either, walk before you run. That, or bribe before you run but expect some trip-ups...

    --
    Live today, because you never know what tomorrow brings
  56. Re:Still don't know why... by Dan667 · · Score: 4, Informative

    Leaky abstractions (look it up, it is a good read). A lot of times for kitchen sink platforms like .Net and Java you get burned by the bugs buried in the underlying platform. If to many of these system are stacked it becomes really difficult to have any stability.

  57. To be fair by EmbeddedJanitor · · Score: 4, Informative
    Five 9s does not mean achieving five 9s at every installation. It means five 9s averaged across all installations. Having a 0.0001% chance of being hit by a bus is hardly consolation for the person that actually does get hit by the bus.

    Of course it is very unlikely that MS achieves five 9s on any installation, let alone as an average.

    --
    Engineering is the art of compromise.
    1. Re:To be fair by Anonymous Coward · · Score: 2, Interesting

      Typical MS hosted services aim for 99.5%. That means about 44 hours of downtime per year. Don't ask how I know. (Posting anonymously, of course.)

  58. Re:Good lord, they're running on Windows? Why? by Nerdfest · · Score: 2, Interesting

    You can talk up system z all you want, but when it comes right down to it, most of the outages problems are caused by incompetence, not hardware failure. Because of this, I've actually seen a Win2K based system beat zOS based systems a few years in a row. It frequently has little to do with the hardware, or even the OS.

  59. Why is Microsoft getting dragged into this discuss by rs232 · · Score: 2, Informative

    "The UK's major banks and hundreds of City trading firms will begin testing the London Stock Exchange's new core trading platform early next month, ahead of its planned launch in the summer of 2007 ..

    Accenture built the Tradelect platform in India between late 2004 and March this year .. Tradelect .. will rely on high-speed middleware developed in-house, which was created using Microsoft's C# programming language and the .net Framework"

    --
    davecb5620@gmail.com
  60. Re:Why is Microsoft getting dragged into this disc by gbjbaanb · · Score: 2, Funny

    d'oh. So it was all built on .NET 1.1, no wonder. They need to upgrade to .NET 3.5 and all will be good. promise.

  61. Re:It appears high load/usage crippled the system. by alexhs · · Score: 3, Funny

    6.40K transactions/second ought to be enough for everyone.

    --
    I have discovered a truly marvelous proof of killer sig, which this margin is too narrow to contain.
  62. Talk Like a Pirate Day by Mr.+Firewall · · Score: 2, Informative

    Is it "Talk like an Ass-Pirate Day" already?

    No, TLAP Day is next week.

    --
    In times of universal deceit, telling the truth gets you modded -1 Troll
  63. Why would you even do that? by Maxo-Texas · · Score: 2, Interesting

    Mainframes, AS/400 - pricey but high reliability with 2-3 decades of baseline.
    PC's / Windows - cheap, "good enough" for home use and light business use, new as hell and subject to unknown problems- and with a known history of issues too.

    I use a pc daily... but there is no way I would put any 99% uptime application on it where huge amounts of money or lives were at stake. It's fine as a client.

    --
    She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
  64. Link to incident status page by alexmin · · Score: 5, Informative

    Here: http://www.londonstockexchange.com/en-gb/products/membershiptrading/tradingservices/Incident/LIVE
    Notice that there were several unsuccessful attempts to bring it back up.
    What's really pitiful, LSE has just a fraction of data/trade volume of major US exchanges like Nasdaq or NYSE and still, their systems are regularly getting hosed, albeit not as much as today's meltdown.
    Hopefully in coming years LSE will lose market share to Nasdaq/Europe, BATS/Europe, Chi-X and other electronic markets - that should teach them well.

  65. Re:It appears high load/usage crippled the system. by Just+Some+Guy · · Score: 3, Informative

    I'm not sure I understand the distinction you're trying to draw,

    Latency versus throughput. If the new system processed those serially while the old could handle 130 in parallel, then the old system would be 10x faster even though the new was 10x quicker.

    but total transaction capacity of the system increased along the same lines.

    Yes, after throwing massive amounts of hardware at the problem.

    --
    Dewey, what part of this looks like authorities should be involved?
  66. Re:Good lord, they're running on Windows? Why? by gbjbaanb · · Score: 2, Interesting

    but it does have a lot to do with the development. They chucked the old, stable (but "obsolete" and slow) systems for something shiny and new. In this case, Windows and .NET 1.1 written by a consultancy with Indian developers.

    I doubt reliability really factored much into it, using the newest coolest stuff came first. Possibly for marketing reasons. Remember this was 4 years ago, .NET had pretty much just come out (we ignore v1.0 which was practically a preview release), you know it couldn't have been as good as the MS marketing man said it was.

  67. Re:How many failures before.. by metamatic · · Score: 4, Informative

    Can you point to any smaller exchanges that use Linux?

    No, but I can point to the New York Stock Exchange, which uses AIX and Linux.

    --
    GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
  68. Vietnam outperforms London by hallucination · · Score: 2, Informative

    7 hours? Is that all you can do? We managed a 3 day outage earlier this year at the Ho Chi Minh City stock exchange.

    http://www.bloomberg.com/apps/news?pid=20601087&sid=aCTlooFV6H0Y&refer=home

    1. Re:Vietnam outperforms London by synthespian · · Score: 3, Interesting

      Ok, so here's the tally I've seen so far:

      - LSE today (7 hours downtime)
      - Ho Chi Minh City stock exchange (3 days downtime)
      - Brazil futures, BM & F, aug 26, 2008 and Bovespa Nov, 30th, 2007.

      that I've heard of.

      It's incredible! This looks systemic and widespread.

      I guess it's a great marketing achievement for Microsoft.

      When will people in the financial sector wake up and learn they've been duped?

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
  69. Re:Virtual 100% uptime? I call BS... by fishbowl · · Score: 3, Funny

    >It's called redundancy. Yes, a single router will fail. 2 at once? Likely not. 3? No.

    Saw triply redundant systems fail twice in my career as a net admin.

    I'm willing to bet your life on the reliability of triple redundancy.

    --
    -fb Everything not expressly forbidden is now mandatory.
  70. Re:The London Stock Exchange Goes Down For Whole D by mcpkaaos · · Score: 4, Funny

    If she's down then who will do the dishes and laundry? How do you reboot her? Does it really take 7 hours? Don't they make drugs for that?

    What.. what's a wife?

    It's like a mother, but requires less therapy.

    --
    It goes from God, to Jerry, to me.
  71. Re:How many failures before.. by Ilgaz · · Score: 2, Informative

    The IBM Z/OS and AIX, Sun Solaris are at such a level that they would alert the respective companies before anything catastrophic actually happens and while systems running on a parallel sysplex backup system without any employee figuring it, IBM/Sun engineers would be fixing the issue. Ask any serious bank why they keep buying/using mainframes.

    The term is "Autonomic computing" http://en.wikipedia.org/wiki/Autonomic_Computing

    I would investigate the decision maker having genius idea of running a financial, time critical process on Windows 2003 and .NET platform.

  72. /. is like the Keith Olbermann of the IT world by Liquidrage · · Score: 2, Insightful

    If the issue turned out to be network is /. going to post a retraction article? Both the summary and the comments are pretty pathetic.

    I fully appreciate they have an agenda and a cause. No problem. But for such a popular site, if they're going to throw stones they should do at least wait until the target is verified.

  73. A critical system shoud be RELIABLE! by mangu · · Score: 4, Insightful

    there's many things that could cause problems, like plain old programming bugs.

    It's about the same thing when people say that "XP does not crash, it's faulty device drivers that crash".

    If a system should be reliable, then it should be reliable, no excuses accepted. It does not matter if it's system bugs, application bugs, hardware failures or power outages, a system that pretends to achieve 99.999% availability should take all that into account.

    The operating system is not at fault if the power goes down, of course, it's a sloppy engineer that designs a system without redundant power supply. But, likewise, a sloppy engineer will prefer a system that lets him configure and operate it by click-and-drag, instead of a carefully designed and tested set of procedures.

    A critical system should NEVER depend on an operating system that does not have a proper batch language. That should be a compact and powerful script language, using TEXT files for configuration that can be hand edited if needed, that can be stored and archived in a version control system, so that bugs can be tracked.

  74. Re:Good lord, they're running on Windows? Why? by Alex+Pennace · · Score: 3, Interesting

    You have hit the nail right on the head.

    I'd like to suggest another factor: Stability-conscious developers -- those that know about race conditions, memory leaks, atomic transactions, and the like -- tend to gravitate towards operating systems that make it easy to put their ideas into practice.

    That isn't to say Windows is inherently unstable, it just means that it is more difficult to write a stable and reliable application on that platform. And even if you think you got all your bases covered, you can still get blindsided by depending on poorly written code churned out by some .NET developer who was happy enough to ship something that appeared to work most of the time.

    The good developers then shrug and say Windows is not suitable for critical computing, and go back to UNIX-ish platforms or whatever they are more comfortable with. Rushing into that void are legions of Windows developers who are also happy enough to ship something that appeared to work most of the time, and the cycle continues.

  75. Re:Good lord, they're running on Windows? Why? by narcberry · · Score: 2, Insightful

    I love to bash windows as much as the next guy, but I don't see the connection.

    Developing an application on .NET does not transfer all your responsibility to Microsoft.

    --
    Modding me -1 troll doesn't make me wrong.
  76. Latency is kinda pointless for this kind of stuff by Nicolas+MONNET · · Score: 4, Insightful

    I mean, that might be what they worked on, but it's kinda pointless; what's interesting is the # of transactions per second, and that can usually be improved at the expense of individual latency. For example, databases can be configured to wait a few milliseconds to group transactions, so as to write several to disk in one single write/sync.

  77. Re:The London Stock Exchange Goes Down For Whole D by qengho · · Score: 2, Informative

    /rimshot!

    Here you go.

  78. Choice quotes by DrSkwid · · Score: 3, Insightful

    Nick Illidge Financial Markets Sales Manager at Microsoft UK "We are delighted that the London Stock Exchange has selected the Windows platform to base a significant part of its business on. This is further evidence of the enterprise scalability of the Windows franchise. We see our relationship with the Exchange and Accenture as a strong partnership. The Exchange is bold in its technology vision, Accenture provides the capability to deliver this vision, and Microsoft is providing the core technology to help provide the business benefits that the Exchange is looking for."

    David Lester CIO at the LSE says ... that the LSE "is the only exchange in the world not to have had a single outage in six years."

    "This is all about the question, 'How are we going to take over the world?'" says Lester, "... I believe this system -- because it's fast, agile and reliable -- will help us compete better. Our current system has to go down for four hours every evening to get ready for the next day's trading," he says. "The batch processing is '80s and '90s technology. You can't run a global market with a system that has to be down for four hours."

    Here's a great factoid
    Before joining the Exchange in 2001, David worked for Thomson Financial and Accenture.

    --
    There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
  79. status page by Anonymous Coward · · Score: 2, Informative

    here their status page.
    http://www.londonstockexchange.com/en-gb/products/membershiptrading/tradingservices/Incident/LIVE

    same below, but may not render properly

    Incident Updates
    Time Market Status Exchange Action
    Client Impact
    Client action
    6.43pm Market Closed

    The Exchange regrets the earlier interruption to trading and is conducting further investigations. It is in the process of confirming all of the steps necessary to ensure trading can commence as scheduled tomorrow.

    Further updates this evening will be published on this website.
    Monitor this Website.
    4.49pm Market Closed Closing auction has now finished. Closing prices, where relevant, have been disseminated.

    4.21pm Closing Auction

    This is to inform you that due to on-going connectivity issues to resume a fair and stable market the Closing Auction will commence from 16:21 onwards. The Closing Auction will uncross as scheduled at 16:35 onwards (subject to a 30 second random period).

    4.00pm Continuous trading
    Standard trading schedule will be followed for the remainder of the day.

    3.45pm Auction

    The auction will uncross at 16:00 BST (subject to a 30 second random period) at which time continuous trading will resume.

    There will be no further change to the remainder of the trading day. Therefore, the Closing Auction will commence as scheduled at 16.30 and uncross at 16:35 (subject to a 30 second random period).

    From this time market maker quotes in both quote and order driven markets will be firm.
    Prepare to resume trading
    3.30pm Auction
    The International Order Book and International Bulletin Board will NOT be available for automatic execution for the rest of today.

    No closing prices will be issued in these trading segments (IOB, IOBU, ITBB and ITBU) today.

    The remaining Trading Segments will remain in an auction phase. A further update will be provided.

    3.11pm Auction

    We will be re-enabling connectivity from 3.15pm

    Connectivity will be phased and following completion all order book segments will remain in an auction phase.
    Once connectivity is established orders can be entered and deleted, but no electronic execution will occur until the uncrossing and commencement of continuous trading.

    2.38pm
    Auction To ensure consistent connectivity we are suspending connectivity to trading for a short period from 2.45pm

    Once connectivity is established orders can be entered and deleted, but no electronic execution will occur until the uncrossing and commencement of continuous trading.
    During this time customers are required to reset their log on connection status to ensure legitimate connections can be established once connectivity is re-enabled
    2.20pm Auction
    We are continuing to establish connectivity with our customers. This process is taking longer than expected.

    A further update will be provided.

    Once connectivity is established orders can be entered and deleted, but no electronic execution will occur until the uncrossing and commencement of continuous trading.
    1.13pm Auction
    We are continuing to establish connectivity with our customers.

    A further update will be provided shortly.

    Once connectivity is established orders can be entered and deleted, but no electronic execution will occur until the uncrossing and commencement of continuous trading.

    12.30 Auction
    We are continuing to establish connectivity with our customers.

    Continuous trading will re-commence at the end of the auction period. We will provide at least 15 minutes notice of when we plan to end the

  80. Back in the day - Not only London by synthespian · · Score: 4, Insightful

    IIRC, Brazil Bovespa had a small glitch last month or two.

    Back in the day when Wall Street and financial markets ran on Solaris systems (AFAIK), this shit wasn't common.

    Now it's probably going to become *acceptable* for stock exchanges and aviation reservation software to crash.

    Apparently, there's a new generation of a-holes on the system administration markets who grew up with Windows and the Blue Screen of Death, that thinks it's acceptable for operating systems to crash, once in a while. Is it evolution?

    --
    Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
  81. Re:Battery-backed write through cache by El_Oscuro · · Score: 4, Insightful

    Oh, yes.. battery backed write cache. With batteries produced by the lowest bidder. The warranty is for 3 years, and the battery lasts just that long before silently failing. When the power goes, well you really didn't need that data written to disk on your database server, did you?

    We now do not allow any server to be put into production with any kind of write cache on it. Ever.

    --
    "Be grateful for what you have. You may never know when you may lose it."
  82. Or VAX/VMS. by toby · · Score: 2, Insightful

    Windows is just consumer junk, and not even very good consumer junk.

    Kickbacks are almost certainly at work in a deployment like this.

    --
    you had me at #!
  83. Re:The London Stock Exchange Goes Down For Whole D by bursch-X · · Score: 4, Funny

    Maybe he meant rim-job...

    --
    There are two rules for success:
    1. Never tell everything you know.
  84. Re:Uhhh, what? by ozphx · · Score: 2, Interesting

    Haha. Having worked for a Super-Platinum-Alpha MS partner before I'll tell you how it works:

    You pay your enormous Partner fee every year. Occasionally MS will send someone out with a powerpoint presentation on a 6 month old MSDN article. If you are doing one of these projects, MS will never ever touch it, or help in any way, beyond paid support for a particular product (like anyone else can pay for). They will put an article in MSDN Magazine like "Microsoft and Fagware collaborate to make Some Awesome System". There will be a photo, with the MS rep shaking the clients hand, with your boss half cropped out. They will mention a whole bunch of MS tech that you (probably didnt) use.

    Its not like they ever see a single design doc, let alone line of code. They don't do a damn thing beyond telling everyone how they are collaborating with you to build Awesome-X Plus for Important Client.

    Also you get to put "Gold Partner" on your website, and MS occasionally refers clients that need someone to implement stuff.

    --
    3laws: No freebies, no backsies, GTFO.
  85. Using Microsoft for a 5-nines SLA? Is that a joke? by MrJerryNormandinSir · · Score: 5, Informative

    That was the their first mistake. What were they thinking? You need a 3 highly available Unix clusters with three SANs. You need three to elect a quorum. If you don't know what a quorum is you shouldn't be attempting to design system that is supposed to deliver on a 5-nine SLA. Each geographic location should include 1 cluster and 1 SAN. All three locations networked with dark fiber. fiber routing should be set up so that a cluster can fail over to a SAN in another location. As far as Hardware is concerned, I would go with a cluster of IBM P6-570 and use an EMC Symmetrix DMX SAN at each site.
    Who the heck designed this? .Net trading platform.. I have to laugh! Microsoft .net = 5.none SLA! .Net is only good for people who would like to create a light duty website. Under a load it breaks. The London Stock Exchange proves my point.

  86. Re:The London Stock Exchange Goes Down For Whole D by mikiN · · Score: 5, Funny

    What.. what's a wife?

    WIFE: Specialized form of WIFI, indicating one of two stations engaged in a (semi-)permanent point-to-point link, the other station typically called HUSBAND. Unsecured transmission often leads to packet loss 9 months after initial association, resulting in long-term elevated QoS requirements. Roaming is usually forbidden by link protocol, although experiments with mesh networks have been reported. DOS attacks often lead to severed links, litigation and possibly material and financial damages.

    --
    The Hacker's Guide To The Kernel: Don't panic()!
  87. Re:How many failures before.. by brianerst · · Score: 3, Informative

    I've been the lead architect and/or senior programmer on a couple of futures and options exchange trading platforms (CBOT's Order Routing System, CBOE's CBOEdirect) and pay the bills currently by connecting firms to various electronic trading platforms. Hardly anyone uses Microsoft except for GUI/user-facing applications. The back-end stuff is almost always UNIX/Linux.

    Off the top of my head, I know that all the LiffeConnect-based systems (London Financial Futures Exchange, EuroNext, Amsterdam, CBOT Metals Complex, Tokyo Futures Exchange, probably a couple of others) run on Linux (a relatively recent change from Sun boxen). NYSE now owns that codebase, and I'm pretty sure that the NYSE uses Linux and AIX on its own platform.

    The Chicago Mercantile Exchange's GLOBEX trading engine (running CME, CBOT non-Metals, NYMEX plus a couple smaller exchanges like Minneapolis and Kansas City) platform runs on Linux. They migrated from Solaris to Red Hat back in 2004.

    The Intercontinental Exchange's WebICE platform is written in Java and I believe it's running on Linux, but there may be some Solaris still around.

    The CBOEdirect system is Java but runs mostly on Sun Enterprise hardware. There is some Linux in the mix, and they certainly use it on some of their other trading systems.

    In the (futures and options) trading world, running on Windows servers is considered to be a sure sign of being bush-league. Demand for UNIX/Linux is huge. And I'm not saying this as a Java/UNIX/Linux snob - most of the systems I've written were Microsoft-based (for a variety of reasons - most started out as technology demonstrations that grew way beyond their intended lifespan - "the client's always right").

  88. Re:Good lord, they're running on Windows? Why? by Lost+Engineer · · Score: 2, Insightful

    "you can't get fired for using Microsoft."

    Maybe not but you sure can get fired when your system epic fails and stops trading for a day.

    Seriously though given the length of the downtime, I doubt this was caused by an operating system problem. If that were the case, you'd just reboot the affected system and hope it doesn't go down again before you can get a patch, no? Even Vista doesn't take that long to boot.

  89. Re:Using Microsoft for a 5-nines SLA? Is that a jo by afidel · · Score: 2, Insightful

    Bah, my YTD uptime for most of my systems is 99.995 without clustering running Windows 2003. We are a java shop not a .net shop though but java isn't perfect by any stretch of the imagination.

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  90. Re:Latency is kinda pointless for this kind of stu by ultranova · · Score: 3, Insightful

    Latency IS important, especially for institutional investors or trades on a mercantile exchange. One of the most critical being arbitrage, or buying something from one person and immediately selling to another at a higher price - instant profit. You just have to be the one to spot the price differential first, and it can come down to milliseconds.

    Based on this description, seems to me that "arbitrage" is a nice word for inserting yourself into a trade which has nothing to do with you for the purpose of bleeding both the seller and the buyer out of some profit without producing or contributing anything of value. Making it more difficult would make the actual productive parties in the trade better off, and likely help economy as a whole.

    Or, to put it even more bluntly: arbitrage, as described by you, is a nicer name for parasitism.

    --

    Forget magic. Any technology distinguishable from divine power is insufficiently advanced.

  91. Re:Using Microsoft for a 5-nines SLA? Is that a jo by hughk · · Score: 4, Interesting

    WTF did a moderator mark this as flamebait? The poster was right, HA is a) hard and b) expensive.

    I designed some of the HA stuff many years ago for Eurex. We used OpenVMS and had two clusters (over 40Km apart) for the main and standby with the standby system also being used for development with a flick of a switch the standby cluster could take over in production. We had no SANs in those days but used Digital's Hierarchical Storage Controllers. These days it runs with SANs but the host systems still run VMS and there are now product specific clusters.

    The next level down there are access points containing communications servers providing connectivity to member systems and routing to the hosts which are scattered around the globe. A member normally has connectivity to two access points. The only single point of failure for a member is where both lines come together for the last few metres into their building and some idiot digs a hole in the road.

    --
    See my journal, I write things there
  92. Re:Using Microsoft for a 5-nines SLA? Is that a jo by Aceticon · · Score: 4, Informative

    I work in London as a freelancer in IT in Investment Banking. My professional experience was mostly with IT Products/Services companies.

    Although I haven't worked in the LSE, from the places I've worked in around here I came out with the impression that most people in IT in this industry are amateurs (and that includes those in other geographical locations).

    Any kind of more advanced IT concepts such as technical analysis, software/hardware architecture, iterative software development processes are pretty much either not done or done by people you don't have clue about what they're doing.

    I'm hardly surprised with what happened in the LSE.

  93. Are London people crazy?? by GNUPublicLicense · · Score: 2

    A trading system without the control of the source code? Are they mad or insane?

  94. Re:Using Microsoft for a 5-nines SLA? Is that a jo by Anpheus · · Score: 2, Insightful

    Why modded troll? It is possible to get high uptime figures with a lone system. You can't take it offline, but hell, I could probably run my PC for a year end to end without issue. The problem occurs when I try to scale that and make, say, 200 PCs all run for a whole year without issue.

  95. Re:Latency is kinda pointless for this kind of stu by AlecC · · Score: 2, Informative

    It is not inserting into a trade, it is creating a trade when a difference in value is noticed. Arbitrage levels out prices between different exchanges, allowing people to trade on either without worrying if they would be getting a better price elsewhere. It is parasitic in the same sense that the oil pump is parasitic in a car - it doesn't add any power, but things turn much more freely with it in place, and it exacts a charge for doing so. Essentially, an arbitrageur is constantly shopping around for bargains an evening them out, meaning that ordinary traders don't need to do so.

    --
    Consciousness is an illusion caused by an excess of self consciousness.
  96. Re:Using Microsoft for a 5-nines SLA? Is that a jo by donscarletti · · Score: 2, Funny

    All three locations networked with dark fiber.

    Um, if you set up a network with this fiber, it would no longer be dark. I'm not sure what you think dark fiber is, fiber optics that are cool, edgy and a little bit menacing I suppose.

    --
    When Argumentum ad Hominem falls short, try Argumentum ad Matrem
  97. Re:Using Microsoft for a 5-nines SLA? Is that a jo by Capt+James+McCarthy · · Score: 5, Insightful

    I have a feeling that the 'normal' IT situation was to blame for this.

    Preamble: Technical Expertise provided a wonderful architecture that was HA and robust, fast, and scalable.

    Bean Counters looked at the cost and said "You Tech guys spend too much money."

    IT architects: "How much is your data worth?"

    Bean Counters: "Not this much. Look we don't really need all of these systems. My home system has been working for 4 years with no problems. And I've talked with Microsoft Execs and they will cut us a deal for their platform. Now go away, I've just decided how the architecture will be done. Why did we hire you anyways?"

    --
    There are no loopholes. It's either legal or it's not.
  98. Re:Using Microsoft for a 5-nines SLA? Is that a jo by splutty · · Score: 2, Funny

    You need to see this particular Dilbert cartoon, which is very much like what you describe :)

    http://www.dilbert.com/fast/2008-09-09/

    --
    Coz eternity my friend, is a long *ing time.
  99. microsoft as five nines? by nimbius · · Score: 2, Informative

    one more blackhearted sentiment from a seasoned IT guy. Microsoft has no business stating "five nines" of anything other than profit. not since Xenix.

    the last fortune 500 i worked for had to have the exchange cluster failed over at least twice a month, each failover costing in the neighborhood of 15 minutes (if it worked, which half the time it didnt.)

    the sharepoint server cluster routinely, with all the grace of a candelabra about the skull, would try to failover and fence but managed to fence more active nodes than dead ones. brings new meaning to STONITH.

    im sure Red Hat is having a good laugh over this NYSE mess.

    --
    Good people go to bed earlier.
  100. Re:Using Microsoft for a 5-nines SLA? Is that a jo by nicolas.kassis · · Score: 3, Insightful

    Um, wrong - Ever heard of the Mono Project?

    Mono provides the necessary software to develop and run .NET client and server applications on Linux, Solaris, Mac OS X, Windows, and Unix.

    http://www.mono-project.com/

    Glad you fell for Microsoft's marketing campaign. There is a reason they don't crush mono. It gives a illusion that there is choice. Name me

  101. Re:Latency is kinda pointless for this kind of stu by Quetzo · · Score: 2, Informative

    No, arbitrage is a necessary component of orderly markets. A perfectly balanced market has 0 arbitrage opportunity.

    Example: ETF Conversion/Redemption. The ETF is priced in real-time based on the price of its components. Fast systems are able to detect small discrepancies in the price of some of the components and the basket as a whole and are able to execute trades ( say BUY ) on the components and the contra trade ( SELL ) on the ETF and then redeem the ETF from the components to pair off the contra trade.

    This arbitrage always works to keep the components perfectly in line with the ETF itself. I think that is a good thing.

  102. Not getting it by SmallFurryCreature · · Score: 2, Interesting

    MOST of your systems, that says it all. If you design a system you must be a little bit more certain then that "most" of it will be up for the availability your are promising.

    I got a consumer HD that so far lasted for 6 years, non-stop. By your logic therefor consumer HD's are fine for a server enviroment because they last for 6 years running 24/7.

    If 100% of you systems run 99.999 availabibilty, THEN you can come back. The acceptable error-rate is 0.001% FOR ALL YOUR SYSTEMS. Lets say that you have 10 2003 systems and MOST is 9, then you only got 90% availabilty, you fail, back to the drawing board.

    It is hard to get true HA. MS can't do it, doesn't mean you can't use it for little projects. I have seen many a succesfull project launch on PC's thrown together at a local computer store and shoved into a rack. Worked fine, but only an idiot would guarantee any availability on it. Prove me wrong, put in your contracts that you guarantee HA for your clients. See you how your boss/lawyer reacts.

    --

    MMO Quests are like orgasms:

    You may solo them, I prefer them in a group.

  103. No National TV News Coverage? by s6plit4 · · Score: 2, Insightful

    I watched NBC and ABC national TV news last night and so absolutely no coverage of this event. There was even a mention of markets around the world with the news of Freddie and Fanny government backing. Anyone else notice this? This was a major event that went unnoticed for a good portion of the US.