The Schwartz Cloud Report

Blog archive

Microsoft Explains Recent Cloud Outage

Wondering what caused the outage that brought down some of Microsoft's cloud services two weeks ago? While Microsoft attributed it to a DNS error, that was all the company was saying at the time.

The three-hour outage occurred on the evening of Sept. 8 and affected Windows Live services such as SkyDrive, MSN and Hotmail for three hours. Also affected was Microsoft's Office 365 service.

Arthur de Haan, VP of Windows Live test and service engineering at Microsoft, elaborated on the incident in a blog post on Tuesday night, explaining a corrupt file in the company's DNS service was to blame.

Microsoft was in the process of updating a tool that helps balance network traffic and the update went awry, he noted. Consequently, the configurations were corrupted, resulting in the outage, he said.

"The file corruption was a result of two rare conditions occurring at the same time," de Haan said.  "The first condition is related to how the load balancing devices in the DNS service respond to a malformed input string (i.e., the software was unable to parse an incorrectly constructed line in the configuration file). The second condition was related to how the configuration is synchronized across the DNS service to ensure all client requests return the same response regardless of the connection location of the client. Each of these conditions was tracked to the networking device firmware used in the Microsoft DNS service."

He said Microsoft intends to further harden the DNS service to by providing greater redundancy and failover capability.

Posted by Jeffrey Schwartz on 09/21/2011 at 1:14 PM


Featured

  • Get More Out of Your Outlook Inbox with TakeNote

    Brien comes across a handy, but imperfect, feature in Outlook that lets you annotate specific e-mails. Its provenance is something of a mystery, though.

  • Microsoft Resumes Rerelease of Windows 10 Version 1809

    Microsoft on Wednesday once more resumed its general rollout of the Windows 10 version 1809 upgrade, also known as the "October 2018 Update."

  • Microsoft Ups Its Windows 10 App Compatibility Assurances

    Microsoft gave assurances this week that organizations adopting Windows 10 likely won't face application compatibility issues.

  • SharePoint Online Users To Get 'Modern' UI Push in April

    Microsoft plans to alter some of the tenant-level blocking capabilities that may have been set up by organizations and deliver its so-called "modern" user interface (UI) to Lists and Libraries for SharePoint Online users, starting in April.

comments powered by Disqus
Most   Popular

Office 365 Watch

Sign up for our newsletter.

Terms and Privacy Policy consent

I agree to this site's Privacy Policy.