Microsoft Azure's Southern US Data Center Goes Down For Hours, Impacting Office365 and Active Directory Customers (geekwire.com)
New submitter courcoul alerted us to Azure outage, which is affecting several customers in many parts of the world: Some Microsoft Azure customers with workloads running in its South Central US data center are having big problems coming back from the holiday weekend Tuesday, after shutdown procedures were initiated following a spike in temperature inside one of its facilities. Around 230am Pacific Time, Microsoft identified problems with the cooling systems in one part of its Texas data center complex, which caused a spike in temperature and forced it to shut down equipment in order to prevent a more catastrophic failure, according to the Azure status page. These issues have also caused cascading effects for some Microsoft Office 365 users as well as those who rely on Microsoft Active Directory to log into their accounts. The cooling system is the most critical part of a modern data center, given the intense heat produced by thousands of servers cranking away in an enclosed area. More resources: The official status page of Azure; and third-party web tracking tool DownDetector's assessment. Further reading: Microsoft Azure suffers outage after cooling issue.
D'oh!
sounds like a Mr Robot episode, did you scan the network for rogue raspberry pi's?
I do not like software that requires you to phone home to the mothership. The second something go wrong outside of your control it borks all your work. Office 365 is a bad joke if I have ever seen one.
Aside: Yes I know video games do this a lot but games are games and work is work.
My employer was affected. Many employees could not authenticate to our third-party webapps because we use whatever the cloud-provided Active Directory SSO solution is. Ah, well. I wonder if this violated SLAs and we get some money back... My company is always concerned about not violating our SLAs to our customers (Saas), so hopefully we extract the same pound of flesh from our vendors.
4.5 hours downtime for us. Thanks Microsoft.
I don't think that word means what they think it means. They need to rethink their distributed model if one data center takes down customers. Isn't the pitch for those services that they basically bulletproof for businesses?
London Stock Exchange outage blamed on Microsoft
When all you have is a hammer, every problem starts to look like a thumb.
The cooling system is the most critical part of a modern data center, given the intense heat produced by thousands of servers cranking away in an enclosed area.
Hence the movement to ARM servers, and away from Intel.
The most ironic part is that the Azure Support Twitter account keeps pointing customers to the Azure status page. Which also happens to be down with 503 errors. Guess they could e-mail for support, unless they are using Office 365. Or request help via the Management Portal, but guess that's down too. lol.
Who wants the reliability of locally installed software when you can have Office 365 running on 'the cloud'. With Office 365 your staff can enjoy a nice break at your expense until the service comes back online. Think of the moral benefits this offers, and it's only available with 'the cloud'. As for Azure, what's a few hour outage if it's just some critical business systems you're running.
You don't want to have control over your IT infrastructure. It's far better to pay for subscription based services and let us fuck up on your behalf. And think of the cost saving. While an Office licence was around $200, now you can pay $1200 over ten years for something that's less reliable.
The cloud is your friend. Subscriptions are your friend. Keep paying the subscriptions.
Active Directory = DNS dependent as hell: Anyone want to take bets that DNS went down for AD & it all "shit the bed"? The Chinese know how to protect vs. it & WHO did it 1st: China or me? I did - dates are my proof http://theregister.co.uk/2017/...
* ... & it's NOT like China doesn't steal ideas - the FACT China rampantly STEALS U.S. Intellectual properties & military secrets IS proof enough too!
APK
P.S.=> I haven't read the article but I know enough about DIRECTORY SERVICES of this nation & they are DNS dependent + yes - DNS servers DO GO DOWN! apk
It is now "Office 364"
You have something that needs to be cool, and you put it in Texas?
Sent from my ASR33 using ASCII
Microsoft wants to be a cloud provider but they can't keep their infrastructure up worth a damned.
I honestly don't understand how they can be in second place considering how often they have major news-worthy failures.
I was unable to access some of my game saves for Fallout Shelter this morning saying the cloud sync was failing. (and it stupidly deleted the local save)
And this is why you don't keep critical files and documents only in the cloud.
Let me name just a few of the pet peeves I have with Asszure:
1) Can't rename a VM (or hell anything) without deleting it
2) Can't set the name of a VM (not the OS hostname, but the guest name) to anything more than 15 characters if it's a Windows VM
3) Can't peer with other people who aren't under your Azure Active Directory tenant
4) Functions take forever to execute. There's no reason it takes 10 goddamn minutes to just run a Restart command.
5) Can't use GRE
6) The whole blade/journey concept in the portal is fucking dumb
Microsoft fucked up with Asszure
I'm almost done setting up a hybrid configuration now :-(
Isn't this what backup generators and N+1 infrastructure is for? I can understand Joe's hosting and bait shop emporium going down, but power and HVAC are pretty well solved sciences. The weather in Texas is hot -- this is not a surprise. There are lightning storms in Texas, this is also not a surprise.
It seems like if you a positioning a data center in Texas (which there as some reasons for), you prepare for both heat and lightning. I could understand if there was an incredibly unusual weather event (asteroid landing on data center, or death rays from the moon) but hot is not unusual in Texas.
However, when major cloud service providers it does provide an excuse for everyone else who manages a data center -- even the biggest cloud provider occasionally has an outage, so when our data center has an issue it's no worse. We say thank you Microsoft/AWS/Joe's!
There is no cloud, just other people's computers.
Back in my day, we called this time sharing. Now you kids get off my lawn.
Obviously you know shit on directory services: Anything built on a directory service (NDS/AD) has DNS dependencies. I was speculating on that VERY POSSIBLE possibility is all (which is MORE than a dumbo shitbrain like you can manage, lol - that's certain).
* PLUS, Projecting you're broke AND on your "meds" (along w/ you PROVING you know SHIT on directory services too)?
LMAO!
APK
P.S.=> Figures you'd say something SO STUPID (that's you though) - especially on MY WORK users LIKE + USE by the 100's of 1,000's worldwide (you with ZERO to show for your big BLOWHARD mouth that spews crap, lol)... apk
I know! Lee's make our entire OS dependent on an active internet connection and permanent uptime of servers we don't control!
What could possibly go wrong?
Looking at you Microsoft...
Turned up the thermostat!!!
"The average reporter we talk to is 27 years old......They literally know nothing." - Ben Rhodes
Where I work all our source control is hosted by TFS on Azure. So all of our checkouts/checkins and code reviews are in limbo. Surely we aren't the only ones who have bought into the whole cloud idea. I may be too old fashioned, but I have a hard time putting all my eggs in someone else's basket. Besides, isn't the cloud supposed to prevent this from happening? I'm curious to know how many shops are affected by this.
My user name was a mistake. Input wasn't restricted, my bad.
It was the cooling equipment eh?
I think they are making excuses for patch Tuesday.... :)
"File to fit, pound to insert, paint to match" - Aircraft Maintenance 101
Live by the cloud, die by the cloud.
In them olden days, you know, back in the early 1990s, I used a mainframe terminal connection via microwave link with 200 other people to do my programming job. Whenever the network was down, we couldn't work. The microwave link was a shared 9600 baud for about 1/4mile to an office that had a wired link to the mainframe complex. This was in south Houston where heavy rain happens almost every afternoon for 30-60 minutes 4-6 months of the year. That is just the climate there. Heavy rain interrupts microwave data.
Live by the cloud, die by the cloud.
BTW, I self-host almost everything. Often when the internet connection is down, I won't notice because all the important stuff is locally hosted, including email.
Just sayin'.
Last week I moved some servers around:
Gosh, I bet Microsoft hates it when something critical for work unexpectedly becomes unavailable for some indeterminate period of time.
JUST LIKE WHEN WIN10 UPDATES, EH MICROSOFT?
My company has 77,000 O365 licenses and use it for our email hosting as well as storing documents on OneDrive for Business. I kept seeing warnings about the web page for o365 not working.
I would expect that a cloud vendor such as Microsoft or AWS would have multiple redundancies in place so that any one data center going down would not affect usage.
I just am still not a cloud fan. It works great but when it doesn't - there's nothing we can do.
The Kai's Semi-Updated Website Thingy
My favorite part of today was the executive screaming on the outage line about getting it back up. Sorry guys, you all put it in the cloud. Now you’ve had a longer outage than the last 5 years combined for the same price you paid for the people, process, and tech. At the same time they lost fileshares (which moved to OneDrive), workflow apps (which are on sharepoint), and every app you moved from the internal sso solution to azure ad. Super fun!
Impacting is a neutral word, except if it involves wisdom teeth.
In this case, is the outage resulting in a positive impact or a negative impact?
Some anti-MS posters would suggest positive.