Slashdot Mirror


London Stock Exchange Price Errors 'Emerged At Linux Launch'

DMandPenfold writes "Within the first 20 seconds of the London Stock Exchange's new matching engine going live on Monday, price data vendors began displaying incorrect prices, blank prices and wrong trading volumes, according to Computerworld UK sources. Thomson Reuters, Interactive Data and Netbuilder are among the largest data vendors, providing share prices to traders, that have been displaying pricing problems on some stocks throughout the week. Even the LSE's own data vendor, ProQuote, experienced problems. Concerns are being raised that there could be mistakenly setup connections or incorrect software interfaces at some of the large data vendors. Alternatively, there may be a data caching issue at the LSE that means data going out is not properly synchronised between different systems."

109 of 168 comments (clear)

  1. Testing? by Tubal-Cain · · Score: 2

    Within the first 20 seconds of the London Stock Exchange's new matching engine going live on Monday, price data vendors began displaying incorrect prices, blank prices and wrong trading volumes, according to Computerworld UK sources.

    Surely they would have run test data before letting it go live. Maybe even feed it the actual data and simply not publish its results.

    1. Re:Testing? by PNutts · · Score: 5, Informative

      Surely they would have run test data before letting it go live. Maybe even feed it the actual data and simply not publish its results.

      From TFA: "Meanwhile, the private view at the exchange is that the 15 month testing window should have been plenty of time for the vendors to interface perfectly with it."

    2. Re:Testing? by Davorama · · Score: 2

      Indeed, but please stop calling me Shirly.

      --

      Davo -- Free speech, free software, AND free beer.

    3. Re:Testing? by cronius · · Score: 5, Informative

      From this article a couple of days earlier http://www.computerworlduk.com/news/it-business/3261625/frustrations-mount-over-london-stock-exchange-data-interface-problems/ :

      Publicly, the LSE and the data vendors say they are working together as engineers scramble to fix the price data problems, even though some statements to clients suggest elements of blame.

      But behind the scenes, sources close to the several of the parties, including the exchange, told Computerworld UK they were immensely frustrated at the reputational impact of the problems.

      The LSE is taking the position that its data feeds are working correctly. Industry sources said the exchange had placed a great deal of emphasis on the launch, which was largely providing successful high-speed trading, and that it had allocated sufficient time - 15 months - for the vendors to be fully prepared for the new system.

      The LSE statement appeared to place some blame with the vendors. "Unfortunately a couple of market data vendors have experienced some specific issues aligning to the new Millennium Exchange platform and we are actively working with them to help resolve their issues," it said.

      "All other trading customers and vendors are successfully trading on the new platform, benefiting from Millennium Exchange's superior functionality and speed."

      Emphasize mine. Looks to me like the exchange is getting the flack for a couple of amateurish (or saboteurish?) vendors.

      --
      Life is Reality
    4. Re:Testing? by Anonymous Coward · · Score: 3, Funny

      When Apple released a phone they hadn't tested and it turned out "holding it a certain way" prevented calls, it suddenly changed to "holding it wrong", thus blaming the user. No scope for that here?

    5. Re:Testing? by Anonymous Coward · · Score: 2, Insightful

      First off, you can have 15 YEARS To test, if you are understaffed and underskilled you will discover jack squat.

      Second, THIS news comes out three days after Slashdot wrote what amounts to a slam piece on .Net's failures to handle this business. Hmm, what happened here? These problems are noted in the title of the article itself as emerging after Linux/C++ solutions were brought to bear.

      Lets face facts, the core of EVERY issue (the failures in the .Net system that led to its replacement, the failures we're witnessing now, the failure at switch, the failure to test properly, notice the trend?) isn't the language/platform in question but the general incompetence with which the issue is being handled.

    6. Re:Testing? by CaymanIslandCarpedie · · Score: 4, Insightful

      I think you can take off the tinfoil hat on the sabotage thoughts. For one LSE's own quote providing solution is experiencing the same issue. So unless they have some diabolical plan to sabotage themselves, I think that fact should rule that out. Also, this is effecting large players. These are billion dollar companies whose sole propose is to provide accurate data and provide it fast. Without that they simply don't exist. You'd have to live in a very conspiracy theory driven existence to think they'd all really all throw their own companies and livelihoods away in some dark plan to get the LSE or Linux.

      Now would all these large players all have made the exact same errors in their interfaces so they are seeing these same issues while the smaller players seem to have got it right? Sure. But the simplest explanation, of course is they are all experiencing the same issue because of some upstream problem with scaling to their large volumes. Those actually involved seem to point to some caching issue. Is that true? No clue, but they are certainly in a better position to offer analysis of the issue than you or I. That doesn't mean Linux is bad for christ's sake (so you don't have to go on some crazy conspiracy hunt to explain it away). If it is true (and we certainly don't know that it is, though the information currently available may point that way), you know what? There is software which just happens to be running on Linux, which may have an issue. No big deal. Happens every day on every platform. Not some big black eye on Linux if some software that happens to run on Linux has issues and no need to start jumping to conspiracies to explain it away.

      --
      "reality has a well-known liberal bias" - Steven Colbert
    7. Re:Testing? by johnsnails · · Score: 1

      why isint this modded funny? I found it funny... or at least informative.

    8. Re:Testing? by $pace6host · · Score: 1

      Emphasize mine. Looks to me like the exchange is getting the flack for a couple of amateurish (or saboteurish?) vendors.

      So you mean they asked their vendors to change a communication protocol and never tested before day 1 ? Honestly, that not how you test such a development. Sure, one can blame vendors for not correctly implementing a spec, but this is regular, expectable error in the software world. The real mistake is the lack of a test with the various sources.

      Huh? Did you skip the earlier sentence?

      The LSE is taking the position that its data feeds are working correctly. Industry sources said the exchange had placed a great deal of emphasis on the launch, which was largely providing successful high-speed trading, and that it had allocated sufficient time - 15 months - for the vendors to be fully prepared for the new system.

      I think he means it was available for test for 15 months. I don't think the LSE can force the vendors to use the new protocol, but giving them 15 months to do so seems fairly reasonable. On the other hand, it could be that the quantity of testing was fine, but the quality (as in how accurate with respect to important things like scaling) was poor. You can test for years, but if your testbed doesn't look anything like the real environment, what are you really testing?

    9. Re:Testing? by smallfries · · Score: 1

      Also, this is effecting large players

      Really? I hadn't heard about that aspect of the story. So previously there were only small players, but the effect of this bug is causing large players to be created. Gosh, I hope they are not also affected by the problem.

      --
      Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
    10. Re:Testing? by ISoldat53 · · Score: 1

      How did it work before? Did the old system run with zero errors?

    11. Re:Testing? by spydum · · Score: 1

      This is a common problem when you operate infrastructure that interfaces with multiple vendors. Sometimes you just have to forge ahead and fix the problems as they come up. I tend to agree with your assumption: lazy vendors not getting on the ball and doing their share of the legwork to stay compatible with the upgrade, who then spun stories about how the LSE's new system is "broken".

    12. Re:Testing? by mysidia · · Score: 1

      So you mean they asked their vendors to change a communication protocol

      More like they told vendors they need to change a communication protocol. It's not as if the exchange has to ask the vendors' permission.

      and never tested before day 1 ?

      If you think "15 month test period" counts as "never tested before day 1", then sure. Seems like the vendors had plenty of time to get their act together.

    13. Re:Testing? by hesaigo999ca · · Score: 1

      Unless they had bosses that were under the gun and willing to take that risk, and no matter how many time the architects told them not to do it, they said cut the corners and let's get it out there....happens to me all the time, and if it fails, guess who gets thrown under the bus.

  2. Re:Linux fails... AGAIN by Nuno+Sa · · Score: 2

    Google, Amazon, IBM, Redhat, ... Take your pick :-)

  3. Off by Anonymous Coward · · Score: 3, Funny

    Turn it off and on again.

    1. Re:Off by 93+Escort+Wagon · · Score: 2

      Turn it off and on again.

      Jiggle the cord!

      --
      #DeleteChrome
    2. Re:Off by mindwhip · · Score: 1

      defrag the disks!

      --
      [The Universe] has gone offline.
    3. Re:Off by justforgetme · · Score: 1

      dial 0

      --
      -- no sig today
    4. Re:Off by mysidia · · Score: 1

      Have you tried forcing an unexpected reboot?

  4. Re:Linux fails... AGAIN by TooMuchToDo · · Score: 5, Informative

    Don't forget the US DoD, The US Navy Submarine Fleet, the FAA, and Cisco:

    http://www.focus.com/fyi/information-technology/50-places-linux-running-you-might-not-expect/

    I don't see IBM running Watson on Windows 2008 Server because, well, you know....they want it to work.

  5. either sympathy or accusation by DreadPirateShawn · · Score: 5, Insightful

    My heart goes out to the devs "working long hours and night shifts" to suss this out.

    That being said, the line that catches my eye most is: "The fact the majority of smaller vendors were fine demonstrated that those having trouble had made mistakes." In my experience, that means one of two things:

    1) The devs configuring the system didn't properly account for the sheer scale of the stress on their systems.
    2) The smaller vendors took the change more seriously, and being smaller and more flexible, successfully updated their systems to interact properly with the new systems.

    Or, of course, both.

    1. Re:either sympathy or accusation by micheas · · Score: 4, Interesting

      Or the big vendors were doing excessive cost cutting to maintain profits during the recession.

      As long as this is just a one weekend nightmare, the execs that ordered the QA not done are looking like great business monkeys. they took a risk that cut costs and managed to defect the blame to the exchange when it blew up, and it looks like nothing bad will happen to the big vendors, unless one of the small vendors gets a really unethical sales monkey that steals a large chunk of business from a couple of the big vendors.

    2. Re:either sympathy or accusation by jmorris42 · · Score: 3, Interesting

      These are teething issues. And it apparently worked for the smaller traders/vendors so it looks more like a problem at the trader's end in that they didn't get their crap straight. But when changing out a large system like that it isn't that incredible there were problems not caught in testing.

      > I don't blame Linux at all, but nor was I partisan fool enough to blame Windows or .NET for a poor implementation of a trading tool.

      Of course Linux isn't to blame, unless they turn up kernel or other low level library problems, that system is a mass of spanking new code running atop linux. But we already know Linux can handle the transactional load reliability requirements and real time needs of a stock exchange since several are already doing it. The question is whether the guys working for the London Exchange have good enough code-fu.

      But apparently they blamed the performance of .NET/Windows or the cost benefit of .NET/Windows otherwise they wouldn't have embarked on an expensive rip and replace operation in the first place. Personally I'm shocked .NET ever managed such a task in the first place, even poorly.

      --
      Democrat delenda est
    3. Re:either sympathy or accusation by micheas · · Score: 1

      It is not at all clear that there was/is a problem with the new system.

      What is clear is that if you unfortunate enough to have signed up to 500 quid per workstation to Thomson Reuters per month you got garbage quotes from Thomson Reuters.

      Personally if I was one of the people that received garbage quotes / no quotes at those price levels I would be looking for a new vendor and talking to a solicitor.

    4. Re:either sympathy or accusation by fatp · · Score: 1

      Maybe the smaller vendors have fewer customer so there is smaller chance to detect the problem.

      Also, it said "the majority of smaller vendors", not all smaller vendors were fine.

    5. Re:either sympathy or accusation by jgrahn · · Score: 2

      I happen to know that the TR system is .NET/C++, so their problems might be related to known issues of TCP/IP message passing from Unix/BSD stack to a Win stack. Let's just say you can get the most mysterious error messages in .NET. Of this, I have first hand experience in my own projects.

      I really doubt the Linux IP stack (which isn't BSD-based) has problems talking to the Windows one. And even if it did, it couldn't have nothing to do with .NET (unless .NET comes with its own IP implementation).

      What *are* these known issues, according to you?

    6. Re:either sympathy or accusation by IainCartwright · · Score: 3, Insightful

      I can well believe you have had problems in your projects if you think that there are "known issues of TCP/IP message passing from Unix/BSD stack to a Win stack".

    7. Re:either sympathy or accusation by Anonymous Coward · · Score: 5, Interesting

      As I work for one of the vendors cited in the article, I can assure you that the reason is freaking simple: the systems that consume the feeds to move them into the trader's desktop are absolute mess of epic proportions riddled with mind boggling wtfs. Missing data, blank values, wrong numbers are common on those systems (root cause are mainly race conditions everywhere, and assumptions about missing data), assembled from random left and right acquisitions. There is no way to really know how they will behave under stress, and corporate culture will not allow managers to actually fix the issues (extremely costly, and corporate bonuses are not aligned to quality).

      In fact, it is much cheaper for everyone to just pretend this is a one time occurrence, to fire a couple of middle managers and continue frantically fixing bugs in the mess until it sorta works.

    8. Re:either sympathy or accusation by Evildonald · · Score: 1

      Those poor devs, who are earning 500-800 pounds a day. My heart goes out to them too.

    9. Re:either sympathy or accusation by Anonymous Coward · · Score: 1

      Unethical? Why couldn't they steal lots of business from the big guys by showing their customers that they handled the transition properly while the big guy botched it? Why would unethical sales be needed to steal business from an apparently incompetent incumbent?

    10. Re:either sympathy or accusation by Svartalf · · Score: 1

      Really? I thought it more along the lines of 270 pounds or so. That was about what I was getting paid when I worked for one of those companies doing this. Of course, that's converting it from the $440 a day I was making there in the States.

      --
      I am not merely a "consumer" or a "taxpayer". I am a Citizen of the State of Texas
    11. Re:either sympathy or accusation by LO0G · · Score: 1

      To be fair, NT4 service pack 1 had serious issues communicating with *nix systems over the internet. Of course that was in 1995 and the issues were fixed in NT4 SP2 (which was rushed to production to fix the issues).

      But hey, just because an issue was fixed 15 years ago, it shouldn't stop an anti-windows rant, should it?

    12. Re:either sympathy or accusation by Jayson · · Score: 1

      800 pounds seems about right. $440/day is only $88,000/yr. I made $480,000 last year doing HFT core and strategy coding. Of course my salary is technically only $100,000/yr and the rest is in bonus. I'm hoping to crack $650,000 this year since my strategies are producing better PNL and my Sharpe ratio is stratospheric.

      I will never understand the Slashdot and geek dislike of HFT. It pays coders more than any other industry, and you get exactly what you deserve. But I guess a lot of people here are lazy tools that expect the world to just hand them a living.

    13. Re:either sympathy or accusation by Timothy+Brownawell · · Score: 1

      I can well believe you have had problems in your projects if you think that there are "known issues of TCP/IP message passing from Unix/BSD stack to a Win stack".

      Oh, but there are issues. One side does "write(sockfd, &myobject, sizeof(myobject))" and the other does the equivalent (which .NET makes far more difficult that it needs to be, so I haven't room to write it here), and because the platforms are incompatible your program usually just crashes.

      Much better to use SOAP (or even just XML over HTTP if you have truly extreme performance requirements) and completely avoid all that TCP/IP mess.

      .

      Note to the humor impaired: you have no sense of humor.

    14. Re:either sympathy or accusation by ArchieBunker · · Score: 4, Interesting

      We don't like High Frequency Trading because it fucks up the economy for everyone else.

      --
      Only the State obtains its revenue by coercion. - Murray Rothbard
    15. Re:either sympathy or accusation by Jonboy+X · · Score: 1

      I think we'd all agree that a crappy app atop a good OS can lead to a crappy experience. Ergo, a crappy experience does not imply a crappy OS.

      --

      "In a 32-bit world, you're a 2-bit user. You've got your own newsgroup, alt.total.loser." -Weird Al
    16. Re:either sympathy or accusation by ToasterMonkey · · Score: 1

      But apparently they blamed the performance of .NET/Windows or the cost benefit of .NET/Windows otherwise they wouldn't have embarked on an expensive rip and replace operation in the first place.

      Or instead of saying geez, we need to replace our OS, they said, we need to replace our trading platform, and got a good quote for one that happened to use Linux.

    17. Re:either sympathy or accusation by smellotron · · Score: 1
      Let me add another (unfortunately realistic) possibility to your list:

      1) The devs configuring the system didn't properly account for the sheer scale of the stress on their systems.
      2) The smaller vendors took the change more seriously, and being smaller and more flexible, successfully updated their systems to interact properly with the new systems.

      3) Documentation provided to vendors is either incorrect or sufficiently vague that some software shops interpret the spec incorrectly and end up depending on the equivalent of "undefined behavior". The behavior appears correct up until the minute that someone with a fat wallet looks at it—at which point everything breaks.

    18. Re:either sympathy or accusation by smellotron · · Score: 1

      write(sockfd, &myobject, sizeof(myobject))

      Much better to use SOAP (or even just XML over HTTP if you have truly extreme performance requirements) and completely avoid all that TCP/IP mess.

      Dammit, you really had me going!

    19. Re:either sympathy or accusation by mysidia · · Score: 1

      unless one of the small vendors gets a really unethical sales monkey that steals a large chunk of business from a couple of the big vendors.

      Is it really unethical if the sales monkey tells the truth about big vendors' incompetence to grab a large chunk of business from big vendors?

    20. Re:either sympathy or accusation by badkarmadayaccount · · Score: 1

      Need a migraine pill for that Wi-Fi headache of yours?

      --
      I know tobacco is bad for you, so I smoke weed with crack.
    21. Re:either sympathy or accusation by IRWolfie- · · Score: 1

      With TCP one write does not correspond to one read on the other end.

  6. Re:Linux fails... AGAIN by Anonymous Coward · · Score: 3, Funny

    The amusing part of this troll is that it would be modded up to 5, instantly, if it were attacking Windows. The day when I see a pro-linux comment modded "troll" is the day that I take anything posted on Slashdot seriously. Until then, it's just self-important, self-congratulatory, mindless groupthink.

  7. Re:Linux fails... AGAIN by Nuno+Sa · · Score: 1, Interesting

    Btw, they are _upgrading_ to Linux, because the previous system (Windows and .net) failed several times, required MW of energy and was slow.

    This new _upgraded_ system is many orders of magnitude faster.

    Unfortunately, it appears, it wasn't properly tested... WTH? I'd expect for the system to run only 100x faster, instead of 1000x faster, during the first few weeks. Or maybe that some API from 1990 stopped working. Wrong data is just too much. Maybe the story isn't telling all the facts? ...Hmmm... Maybe the affected clients are running Windows?!!!!!!

  8. Re:Linux fails... AGAIN by MichaelSmith · · Score: 5, Interesting

    I work on large scale air traffic control systems which run Linux and I don't envy the LSE in their task. Most of our interfaces are relatively simple and go out to organisations with a good history of validating interfaces. This trading system seems to have to interface to a lot of little offices around the place running various implementations. Its no surprise some of the interfaces weren't tested to the point where they are known to work 100%, though they may be 100% correct.

  9. If only they had thought of that by raftpeople · · Score: 4, Funny

    Surely they would have run test data before letting it go live. Maybe even feed it the actual data and simply not publish its results.

    Quote from the project manager:
    "Dang. If only we had read Tubal-Cain's post before going live. Who would have thought to run test data through this darn thing?"

  10. Re:so... by darth+dickinson · · Score: 1

    No they had to recompile the kernel with the --WORK_NOW_DAMN_YOU flag enabled, but not until after they had downloaded the latest kernel source and repackaged it for their current distribution-of-the-month. After working out all the dependencies, and *their* dependencies, of course.

  11. Quote Services by Anonymous Coward · · Score: 5, Informative

    Article is a a bit misleading....
    Reuters, Netbuilder et al are quote services that run externally and fetch quotes (price/volume) from the LSE and are *not* part of what would be considered the main trading platform. The problem isnt the actual OS (Linux) or tradiing platform, which has been heralded as a big step forward from the MS solution, but rather how the resulting trade data that is pushed from the exchange to the 3rd party sites and then ultimately to the end users.

    1. Re:Quote Services by Anonymous Coward · · Score: 2, Informative

      Article is a a bit misleading....
      Reuters, Netbuilder et al are quote services that run externally and fetch quotes (price/volume) from the LSE and are *not* part of what would be considered the main trading platform. The problem isnt the actual OS (Linux) or tradiing platform, which has been heralded as a big step forward from the MS solution, but rather how the resulting trade data that is pushed from the exchange to the 3rd party sites and then ultimately to the end users.

      BTW... these protocols while well documented are complicted. To get an idea, check out either the NYSE ArcaEX Direct API or the NYSE Arca FIX specs

      If someone could point out the LSE equivalent, I'd be interested in seeing the differences

    2. Re:Quote Services by Anonymous Coward · · Score: 5, Interesting

      Hi,

      The NYSE Arca FIX docs are not quite relevant to this - as what we're talking about here is the market data API.

      The FIX protocol (although it does support market data transactions), is typically not the protocol of choice for market data. Typically it'll be non standard provider specific protocols which are optimised for delivering partial updates of information in a very bandwidth efficient manner.

      In the parent post someone mentions that Reuters, Netbuilder etc fetch quotes - which is also not entirely correct either. They accept push feeds from the exchange, which contain tick updates to the information about the instruments traded on that exchange. The generation of these ticks is triggered when a match occurs on the order book - which may (or may not) involve a change in price data. They then typically augment this information with other data - such as other identifiers, or derived information such as previous days close value / percentage etc.

      It is with these derived fields that the ongoing issues seem to lie, although for a certain period of time, the exchange was actually pushing out zero values for the bid/offer fields. As you can imagine, these plays merry hell with systems that attempt to drive automatic execution based on current price.

      I am informed by Netbuilder that they will have pushed a final fix to live within a subset of their servers - but it will probably be Tuesday before this is available across their whole system. This is over a week since the original identification of the issue, not good!

  12. Re:Linux fails... AGAIN by perryizgr8 · · Score: 2

    using a solid os won't protect you from human stupidity. i thought that was a given. perhaps letting microsoft setup everything would be better for stupid people like lse. more expensive but atleast no stupid errors.
    meanwhile, intelligent people continue using linux to their advantage.

    --
    Wealth is the gift that keeps on giving.
  13. And I thought my bug was bad by JazzXP · · Score: 2

    I thought I'd stuffed up big time when I hooked all of our users (around 1000 users at the time) up to test market prices rather than live data (Australian Stock Exchange) over market opening one morning (our product is aimed at day traders too). My stuff up seems minor in comparison to this.

    1. Re:And I thought my bug was bad by phantomfive · · Score: 1

      What happened to you after you did that? Reminds me of one time when our project manager (he was a decent programmer, too) was digging around our Oracle database install and found some useless 'backup' files and just deleted them. Somehow immediately after that, all customer data was gone. The ironic thing is we were an automatic backup company, and we didn't keep any backups.

      --
      "First they came for the slanderers and i said nothing."
    2. Re:And I thought my bug was bad by JazzXP · · Score: 1

      Not much really (had to buy the company donuts as punishment), but policies were changed to not allow changing our production network without going through proper channels. We had a similar database thing too where somebody accidentally removed all historical data for a project that was about to be rolled out, and due to the size, the techs had decided not to back it up. Yeah, policies are much better these days - major issues like this don't happen any more.

  14. Data by gravel+junkie · · Score: 1

    Data is always seems to be the least respected item in the IT stack (TBV). Yet its the lifeblood. Referential integrity, blah. At your peril.

  15. The offending code by billcopc · · Score: 1, Funny

    The offending code was traced back to :

            $stockprice = power( 2, rand()*10 + 0.0001 );

    Data vendors quickly addressed the issue by cutting the LSE out entirely, and rolling their own rand() function.

    Fucking stock markets... .what a joke!

    --
    -Billco, Fnarg.com
    1. Re:The offending code by mindwhip · · Score: 1

      You got that wrong... they use SuperRand()...

      http://thedailywtf.com/Articles/SuperRand.aspx

      --
      [The Universe] has gone offline.
    2. Re:The offending code by smellotron · · Score: 1

      $stockprice = power( 2, rand()*10 + 0.0001 );

      I considered arguing with your example on the basis of the presence of floating-point (prices are discrete - any sane market uses fixed-point integer arithmetic). However, since you did not provide a complete running program I have to admit that it's possible that your code is C++ and rand() is a function that returns an object, and the object's member function operator+(float) simply returns *this as a red-herring to confuse the interns.

  16. Why Am I Not Surprised by oakwine · · Score: 1

    On the other hand, a moon mission interfaced a host of complex systems including the lives of the astronauts. Difference is the level of integration of QA into the project.

  17. Effect on stockbrokers by Anonymous Coward · · Score: 5, Informative

    As someone on the receiving end of this problem (as head of Development for one of the largest UK stockbrokers), it has been immensly frustrating trying to get an admission of blame from anyone involved in this issue. Our clients are complaining to use because elements of the data we're displaying on our site are just so out of whack they are laughable. In the end we've just had to remove those elements of data from our website, and they remain removed until the fix is confirmed working on Monday. This is after several attempts to fix the issues during the week.

    It's also causing serious issues for things like our black box trading engine which is used to drive automatic execution of client orders when the price moves into range for execution. The data being bad means these systems just have to be turned off, and clients are being advised to simply delete their orders.

    This is, frankly, a pretty appalling situation as the service to our clients is being impacted pretty severely.

    For an exchange that is attempting to position itself as the true 'global' exchange, this is doing an immense amount of damage to it's reputation.

    1. Re:Effect on stockbrokers by kievit · · Score: 1

      Did you check your data during the 15-month testing period, and were the errors also apparent during that time? If yes, did you report the issues or take other actions to get the problems fixed during the testing period?

      If you did not check your data during the testing period: why not?

    2. Re:Effect on stockbrokers by Asic+Eng · · Score: 1

      "What is more important, an admission of blame or a fix?"

      Important for whom? His employer mainly needs the fix. However, closely followed by that they need to figure out whether they have the right people for the job. If they don't, then issues like this will keep appearing.

      He, on the other hand mainly needs the admission of blame because he wants to keep his job. He also needs a fix so that the company can keep paying him, but that's only relevant if they don't fire him.

  18. I mean by Dunbal · · Score: 1

    It's only a stock exchange. Why the fuck should they bother testing it first, right? This just goes to underline that QA does not exist in the software world.

    --
    Seven puppies were harmed during the making of this post.
    1. Re:I mean by rrossman2 · · Score: 1

      They did. They had 15 months of/to test. It's right in the article

    2. Re:I mean by Zoxed · · Score: 1

      > This just goes to underline that QA does not exist in the software world.

      My 24 years in the software industry tell me that QA is available: the customer just has to be prepared to pay for it !

    3. Re:I mean by jo_ham · · Score: 1

      It's only a news discussion site. Why the fuck should they bother reading the article before commenting on it? This just goes to underline that reading comprehension is non existent in the slashdot world, yet still they believe they have all the facts and can pass judgement.

    4. Re:I mean by Dunbal · · Score: 1

      We have a different definition of testing. Testing != compile it and see if it runs. Testing != hiring some minimum wage loser to tell you exactly what you want to hear.

      --
      Seven puppies were harmed during the making of this post.
  19. Re:Effect on stockbrokers (postmortem) by realxmp · · Score: 3, Interesting

    As with most major issues there's bound to be a big ol' postmortem on this. As head of Dev you've probably got a unique insight into this, I'm curious as to your perspective on this, what you think the cause of failure might be? More strategic or more technical? Poor interface specification? Inability to handle queries under full load? From TFA there was supposedly 15 months of testing. So I'm curious as to why it failed, whether the testing simply wasn't realistic and/or thorough enough, or it was something that just wouldn't come up except on the live system?

  20. What is your biggest mistake? Monetarily... by Colin+Smith · · Score: 1, Offtopic

    I of course don't make mistakes, but what do you think would happen if for example you wiped 8.5 billion dollars off of the value of your company?

    http://finance.yahoo.com/q/bc?s=NOK+Basic+Chart&t=3m

    Do we have a world record here Mr. Elop?
     

    --
    Deleted
    1. Re:What is your biggest mistake? Monetarily... by WalkingBear · · Score: 2

      The drop in stock price after the announcement of the Nokia Microsoft venture was sharp for a single day. It's also one of the highest volume days in the last 5 years.

      However, the drop is insignificant compared to the drop from late 2007 to now. Market Cap at Nokia is 1/4 what it was. Stock prices have dropped from mid 40's / share to ~ 10/share.

      Besides, the 8.5B drop in market cap isn't nearly as important to the bottom line as the billions in cash coming in from Microsoft as part of this deal.

  21. what does that have to do with Linux? by t2t10 · · Score: 3, Insightful

    If it didn't crash and didn't drop its network connections, Linux was doing its job.

    If the application software had bugs, then the application software developers are to blame.

    1. Re:what does that have to do with Linux? by $pace6host · · Score: 2, Insightful

      If it didn't crash and didn't drop its network connections, Linux was doing its job.

      If the application software had bugs, then the application software developers are to blame.

      Even if the OS itself isn't at fault, don't think that someone with an axe to grind won't blame it. You'll just hear something like "It's Too Hard to develop on Linux. A platform with more / better tools would have made it easier to develop and comprehensively test." I don't buy that either (regardless of what price you quote me! ;) ), but I'm sure someone will be saying it. Especially someone in marketing.

    2. Re:what does that have to do with Linux? by Blakey+Rat · · Score: 2, Insightful

      But when they had problems with the Windows-based system, obviously it was the fault of the operating system! Right? I seem to remember that being the case.

      Because Slashdot ain't anything if not hypocritical!

    3. Re:what does that have to do with Linux? by t2t10 · · Score: 1

      If it was OS crashes, yes, it was a Windows problem.

    4. Re:what does that have to do with Linux? by judeancodersfront · · Score: 1

      How dare you question the group think of Slashdot! Go hang out at the Microsoft section if you don't like it....oh wait it's the only tech site that doesn't have a Microsoft section.

  22. Re:If it was a Windows implementation... by t2t10 · · Score: 1

    Let me clear that up for you: system crashes, outages, BSODs -- fault of the operating system; bad data, bad XML, bad UI -- fault of the application programmers and application platform.

    Most likely, this was Java programmers screwing up.

  23. Nothing to do with Linux by Wowsers · · Score: 2

    An attempt to rubbish Linux.

    After 20 seconds they should have realised that they should have tested more, and fired the programmers for allowing such mistakes. This is nothing to do with Linux.

    You'd think the LSE would have learnt from their last computer system rollout which also had massive problems. No quality control = management problem.

    --
    Take Nobody's Word For It.
    1. Re:Nothing to do with Linux by JonySuede · · Score: 1

      The lack of testing, in a corporate setting, is not the programmers fault, it is a management failure and as such the managers should be fired.

      --
      Jehovah be praised, Oracle was not selected
    2. Re:Nothing to do with Linux by cynyr · · Score: 1

      Well, is it was caused by bugs in .Net's XML tools? sure MS's fault. Remember that linux is just a kernel, and the rest of the GNU software may or may not be running on this beast and most of the important software(the push server thingy) is likely to be custom in house stuff.

      --
      All of the above was encrypted with a Quad ROT-13 method. Unauthorized decryption is in violation of the DMCA.
    3. Re:Nothing to do with Linux by Dr_Barnowl · · Score: 1

      Indeed. If the coders are anything like myself, they are doing one of two things ; cringing uncontrollably because a piece of code that they wrote is causing problems. Or genuinely not giving a rats arse because they were ordered not to do an appropriate level of testing by the immediate management who feared that they would miss a deadline.

    4. Re:Nothing to do with Linux by judeancodersfront · · Score: 1

      Who do you think you are fooling?

      Any Windows solution reported here will receive bashing before the details are even provided. This is Slashdot, the group think opinion here is that Windows Server will crash in 5 minutes and any website you host will be attacked overnight. Oh and underneath it is DOS based and requires 12 cpus and 10 gb of RAM to do the same work as Linux running on 486.

  24. Re:Linux fails... AGAIN by Hylandr · · Score: 1

    Sure wish I had mod points, this man knows how to think.

    - Dan.

    --
    ~ People that think they are better than anyone else for any reason are the cause of all the strife in the world.
  25. Re:Look closely at the vendors having issues. by Jasonjk74 · · Score: 1

    Aaaaannnd.... there it is! I was about to write a post saying that I wonder how someone would try to turn this into an apropos-of-nothing, Microsoft-bashing event.

  26. Re:Look closely at the vendors having issues. by $pace6host · · Score: 1
    Well, eventually the root cause(s) will be found - and if the faults lies within their own systems, who will be embarrassed? It would seem to be an extremely poor strategy in the long term to do something like that. I suspect that it will turn out to be run-of-the-mill variety bugs in code, or in configuration, that managed to slip through on both sides, and were not caught previously perhaps as a result of testing with insufficient load. It may be that the larger vendors, having more complicated networks and more volume, were just the most vulnerable to having bad configurations. This just points out how important it is to test, test, test, test, test, test some more, and then test again - but most importantly, test realistically, with realistic configurations and realistic loads. Especially there is something valuable depending on the result, like lives, or the financial well being of millions of trusting pensioners.

    Good luck to the developers and integrators who will undoubtedly be working 16-hour days until this is fixed.

  27. Troll summary is troll. by GNUALMAFUERTE · · Score: 2

    That quoted phrase 'emerged at linux launch' is NOWHERE in the article. The article says "London Stock Exchange price data failures ‘emerged immediately at Millennium launch’". This is obviously an application issue, why is the summary sort of blaming the operating system?

    --
    WTF am I doing replying to an AC at 5 A.M on a Friday night?
  28. Re:Linux fails... AGAIN by Desler · · Score: 1

    Those *are* facts. The new system is better in every metric. The old system failed several times. Once it crashed and stayed down for the better part of the day.

    And yet you cite not a single source with your supposed metrics or stories showing it failed "several times". There is a single downtime that one can find any mention of. Care to provide actually evidence of your claims now instead of just asserting they are facts?

  29. Re:Should have kept Windows. by Ferzerp · · Score: 1, Insightful

    If this has been Windows story, there would be much frothing at the mouth and blame on Microsoft. Since it is Linux, the blame magically lies with the implementation.

    Hyporcrite much, Slashdot?

  30. Re:so... by cynyr · · Score: 1

    you repackage it?!!!??

    you just extract it(tbh anywhere you like, but convention says /usr/src/linux-$VERSION-$PATCH/) and then "cp ../$OLDKERNEL/.config ./ ; make oldconfig; make ; make modules_install; mount /boot; cp $ARCH/boot/bzImage /boot; $EDIT_BOOTLOADER_CONFIG; shutdown -r now"

    On most binary distros you need some sort of dev tools, that changes based on distro, but should be simple.

    P.S. Has Ubuntu fixed the deps on the kernel sources to include the build-utils yet? cause it's not like all i want to do when i install the sources is look at them.

    --
    All of the above was encrypted with a Quad ROT-13 method. Unauthorized decryption is in violation of the DMCA.
  31. LSE is a buerocratic mess by alexmin · · Score: 2

    No surprise here for anyone working with them. Their marked data specs have hundreds of pages while Chi-X EU have about 20. Chi-X EU quoting and trading volume is far greater. Coincidence? I say no - people voting with their feet and going from LSE to Chi-X EU (biggest trading platform in EU now), BATS EU, and other MTFs. Maybe the only thing keeping LSE still afloat is a fact that FTSE 100 index is computed using LSE prices. Once that gone, potentially as soon as MIFID II comes out, LSE is going to circle in the crapper.

    1. Re:LSE is a buerocratic mess by lisa.crockett · · Score: 1

      we should expect MIFID II to come out as soon as possible.

    2. Re:LSE is a buerocratic mess by F.Ultra · · Score: 1

      You are thinking about the old Infolect system, the new LSE system, Milenium, is using the same market data feed as CHI-X (Itch)

  32. Re:Linux fails... AGAIN by Smallpond · · Score: 1

    I work on large scale air traffic control systems which run Linux and I don't envy the LSE in their task. Most of our interfaces are relatively simple and go out to organisations with a good history of validating interfaces. This trading system seems to have to interface to a lot of little offices around the place running various implementations. Its no surprise some of the interfaces weren't tested to the point where they are known to work 100%, though they may be 100% correct.

    According to the article, the little organizations are running fine -- it's some of the big company systems that are having problems. My guess would be that their systems are so large and complex that they haven't been able to do sufficient testing, and that they weren't able to make the changes that they needed to make because the systems are all in use 24 x 7.

    Its like the story of why a heart surgeon is paid more than a car mechanic. Try fixing a car engine while its running.

  33. Re:Linux fails... AGAIN by spiffmastercow · · Score: 1

    If it is orders of magnitude faster, that's likely due more to better algorithms than choice of platform. The .net platform is no slouch, and I would be REALLY surprised to see anything greater than a 300% speed up if the same system was merely reimplemented under Linux/C++.

  34. Re:Linux fails... AGAIN by Anonymous Coward · · Score: 2, Insightful

    I would wager that the vast majority of problems in Windows are also due to user error, yet Linux fanatics instantly claim it's the OS.

  35. I've seen banking information systems by Nicolas+MONNET · · Score: 1

    They typically have decent to excellent central systems, and it feels like they're treating it seriously and spending lots of money to maintain and upgrade them.

    Now at the periphery ... the interconnections, for electronic payment and so on, they're a disgusting mess. It doesn't help that some protocols are truly horrible and should have died 40 years ago, but even when they've moved into the IP era, they keep on using outdated shit and/or idiotic settings.

  36. Simulated data is not a panacea by perpenso · · Score: 2

    At very least you'd expect a large scale dry-run with simulated data that should bring these errors out before becoming live.

    Maybe. Such testing only tests the type of errors you have thought of and are anticipating. I was once blessed with a competent and skilled QA department and months of pre-release testing, simulation, fuzzing, beta testing, and yet when a product goes live to millions bugs become apparent. You just can't think of and test for everything in a sufficiently complex system. The real failure of this LSE rollout may have been not running the old and new systems in parallel while looking for discrepancies in the publicly reported data.

  37. Re:Linux fails... AGAIN by bmcage · · Score: 1
    The fact is they switched. What other evidence do you need? If it was just a problem with the programming language, then they would not need to change OS. If it was just because Windows is too expensive, they would run CentOS.

    Seriously, what other fact do you need?

  38. Re:Should have kept Windows. by grantek · · Score: 1

    Nah, everyone knows Windows is just for games.

    They should have gone with OS/2

  39. Re:Should have kept Windows. by smellotron · · Score: 1

    If this has been Windows story, there would be much frothing at the mouth and blame on Microsoft. Since it is Linux, the blame magically lies with the implementation.

    Nobody is blaming the LSE's implementation. The LSE (and MilleniumIT) has provided a new system and interface. Many third-party vendors (Reuters, Trading Technology, Interactive Brokers, etc.) write adapter software which consumes that interface and exposes it through their own standard (but usually slower or less powerful) interfaces. It is typically cheaper and easier for traders to use these vendor interfaces to connect to a variety of exchanges, instead of forcing every trader to adopt the raw (high-performance and bandwidth-intensive) interfaces provided by the LSE. Those vendors now have bugs in their software, but the LSE is taking the heat. That's not right.

  40. Re:the real stupidity shown here by smellotron · · Score: 1

    the stock market has to be dismantled

    Pull your money out of the stock market and convince all of your friends to do the same. Find a way to invest your money in a way that you feel is proper. If enough people reject the market in its current incarnation, change will be forced. If not, at least you know that you're doing what you can.

    Just don't get upset when many people disagree with you. We all choose our own poison.

  41. Re:Should have kept Windows. by mabhatter654 · · Score: 1

    when multi-million dollar CEOs don't get their trades spot on.. or have to wait. (AAAHHH!) they fling the blame around.. from the golf course, to the call girls.. word gets around.

  42. Re:Linux fails... AGAIN by badkarmadayaccount · · Score: 1

    Some of the more important algorithms are in the kernel.

    --
    I know tobacco is bad for you, so I smoke weed with crack.
  43. Re:Linux fails... AGAIN by badkarmadayaccount · · Score: 1

    Crappy memory management, application protection and limitation, backwards compatibility issues, kitchen sink architecture...

    --
    I know tobacco is bad for you, so I smoke weed with crack.
  44. Linux isn't the issue by woboyle · · Score: 1

    This is strictly an application server implementation issue, not a Linux one. They had more (though probably different) problems with their previous MS Server infrastructure, which is, as I understand it, the reason behind the migration to Linux. So, don't tie this fiasco to the use of Linux, but rather to inadequate testing of the new systems in a real-world load scenario. This is really difficult for large scale distributed systems such as this is. I spent a LOT of time and engineering effort to build a testing framework for a major manufacturing execution system. It took the efforts of a fair number of really talented software engineers to emulate and resolve many of the issues (race conditions, hardware/software failures, etc) before we had a dead-bang reliable system that can run a major semiconductor FAB on a 365x24 basis. You must start thinking about these things during the design phase - not after the system has been implemented.

    --
    Sometimes, real fast is almost as good as real-time.
  45. Re:Linux fails... AGAIN by LingNoi · · Score: 1

    That's the thing, Microsoft helped setup the original LSE. It was well publised in their own get the facts campaign on why Microsoft's solutions are better then open source.

  46. Re:Linux fails... AGAIN by spiffmastercow · · Score: 1

    Again, back to the "orders of magnitude" claim. 2 orders of magnitude (the minimum required to be plural) indicates a 100x increase in speed. That kind of increase doesn't happen from changing operating systems.

  47. Reputational Impact by dave87656 · · Score: 1

    Microsoft has has had a lot of bad press lately as linux and java seem to be eating its lunch in the new world (high-performance, web base computing, small form factor and mobile computing). I suspect these issues are true but exaggerated.

    Remember that the LSE had serious problems using the MS-based system before. Remember the curious wording of the recent problems with the .NET-based system. Somehow, they tried to link this to the fact that the Linux-based system was being tested (of course it had nothing to with that).

    Windows Phone 7 is seriously buggy (many basic functions still don't work). Android (java) and IPhone (freebsd) have a lock on the smartphone market and MS's feeble attempt is too little, too late. Selling low-quality crap only works if you have a monopoly and it must be hard for them to see that they can't compete in the new emerging markets.

    Even Microsoft has to provide its updates via Akamai linux-based servers. It's hotmail servers couldn't handle the load when switched to Windows Server and had to be switched back.

    All high-end Web-based services (facebook, twitter, google, yahoo, etc) are all run on linux or freebsd systems.

    Does it surprise anyone that an issue with a high-profile linux system is going to be big news for the Windows world?

  48. Re:If it was a Windows implementation... by judeancodersfront · · Score: 1

    When they had a custom .NET stack Windows was blamed even though the details weren't provided.

    Rules of Slashdot:

    If a Linux solution fails, blame everything but Linux.

    If a Windows solution fails, blame Windows and vote down anyone who asks questions.