Slashdot Mirror


MS, CNET On 7-Day Messenger Outage

imipak writes: "Microsoft have finally commented on the recent seven day outage at their Messenger IM service -- some users have permanently lost data, and there's still no explanation of the cause. Interesting earlier story from CNet News. Key quote: "... an outage that lasts seven days with no valid explanation really starts to make you think about .Net, and about Microsoft's plans for the Internet. What if this were the new Office software verification service that was down?"" Here 's a story on MSNBC as well.

10 of 249 comments (clear)

  1. I'd like to believe you, but ... by daviddennis · · Score: 5
    why, then, did VeriSign basically turn over its entire authentication process to Microsoft and start deploying Windows 2000 servers in its core business?

    See this report from The Register for the grisly details.

    I suppose you could say this is because VeriSign and Network Solutions are insane, deranged companies, and there is most likely truth to this. But I'm not convinced; I HAVE TO deal with these idiots for my domain names, and now I have to rely on .NET to do it. Ick.

    D
    ----

  2. Possible causes? by powerlord · · Score: 5

    Ya know there are two possible causes from the minute information they've released (It was caused by a freak failure when a hard disk controler crashed).

    1) Caused by a freak failure when a hard disk controler crashed.
    2) They've said they have to restore from backups.

    If both are true, then it sounds like they were using a distributed database (or filesystem?) and one machine going down very badly managed to infect lots of others... doesn't bode well, especially when MS's solution to competing in the Server environment is traditionally to Cluster lots of machines together. The more you have the more chance one may have problems.

    If the first statement is false, then the only thing I can think of is that the system was infected by either an outside source, or some other malicious virus. Standard Operating Procedure in this case would be to disconnect the machines, diagnose the problem (so new machines wouldn't be infected), and then restore from backup. Its also possible someone over-reacted and they went into this mode when in actuallity Item 1 was true.

    Anybody else think we're hearing the whole story?

    --
    This space for rent. All reasonable inquiries will be entertained at proprietors discretion.
  3. I'm often reminded... by Dr.Evil · · Score: 5

    when these kind of outages happen, of Peter Deutsch's 8 Fallacies of Distributed Computing:

    1. The network is reliable
    2. Latency is zero
    3. Bandwidth is infinite
    4. The network is secure
    5. Topology doesn't change
    6. There is one administrator
    7. Transport cost is zero
    8. The network is homogeneous

    This is, of course, why the idea of remote authentication being necessary to use your word processor is a bad thing. Heck, even losing something as innocuous as an instant messaging program brought thousands of people to a screeching halt for a week. It seems to me that Microsoft (although they're certainly not the only ones) seem to believe these 8 fallacies blindly, espcially 1, 4, and (they're hoping) 6.

    --
    Right...
  4. Absolutely no excuse for this. by Pinball+Wizard · · Score: 5
    ...permanently lost data

    I am utterly amazed at times the things I hear about how system administration is performed at MS. Ever check their jobs page? They're really picky about who they hire, you know.

    Yet we repeatedly hear about security problems with their own servers, how all their DNS servers were on the same network segment, hotmail goes down and now this? Lost data??!!!

    I'm sorry, but as a former full-time sysadmin, there is absolutely no excuse for losing data. Preserving your companies data is the #1 priority of any sysadmin, regardless of the company. And preserving data with 100% certainty is acheivable by anyone who takes the time to set things up right.

    Oh well, I was never a fan of their passport/hailstorm idea anyway. Things like this can only cause more people to run away from using those services.

    --

    No, Thursday's out. How about never - is never good for you?

  5. Feel My Pain! by TOTKChief · · Score: 5
    Not only has Microsoft been struggling to restore full service, but on Thursday the company also shut down MSN Messenger as it restarted the network of servers that handle messaging traffic. That "reboot" failed to immediately fix the problem. [Emphasis added.]

    Bet they know how I feel at work every day now...

  6. Letting others handle your data. by Traicovn · · Score: 5

    Think about Hobbes social contract.
    'People give up certain rights and freedoms for a feeling of safety etc.'

    This is the same sort of situation kinda. People give up having their own servers for communications and data storage in technologies like .NET. It is the companies responsibility to give us fair service, and tell us what's going on.

    If we do not like what's going on, it is our right and responsibility to seek alternatives.

    Your always going to risk loss of data and loss of service if you let someone else handle your data, communications, authorization, etc. It's a risk that you take. You hope that the company is able to do a good job and maintain good service. Remember, if you start using .NET and using all of the authorization features to access Microsoft's sites that require Passport/Messenger, just like in Hobbes social contract you are giving up some rights and some control. Your taking a risk. But remember, their are other choices.

    [Something witty and intelligent should have appeared here.]

    --

    [Something witty and intelligent should have appeared here.]
    {Traicovn}
  7. Post Mortem Summary (aka Wishfull Thinking) by Fatal0E · · Score: 5

    As a (curious) sysadmin I wouldn't mind reading a post mortem like what the /. crew did a few weeks ago. I think MS is missing out on a lot of brownie points by not publishing a blow by blow summary of how an enterprise goes about troubleshooting/fixing a system like that. It would be possible to do something like that w/o disclosing sensitive information. Like I said, wishfull thinking.

  8. In related news... by ryanvm · · Score: 5
    In related news:
    Recent surveys show that employees that use Microsoft's popular Instant Messenger software are having one of the most productive weeks in recent years.

    Now if only Slashdot would have a week-long outage, I could get some work done.

  9. You didn't get it? by fiber_halo · · Score: 5

    We sent out an instant message to all the users letting them know about the outage.

  10. People will still use .NET in droves by UberOogie · · Score: 5
    Remember when AOL had huge outages several years ago?

    Remember when users couldn't get through because there were busy signals all the time?

    Remember how people said that there was going to be a mass exodus from AOL?

    Remember how that didn't happen?

    No matter how badly MS screws this incident up, no matter how many judgements get made against them, the average business drone and Joe User will still end up using .NET.

    --
    "Enough of this wretched, whining monkey life." -- Marcus Aurelius, _Meditations_, Book 9, 37