Microsoft Confirms CrowdStrike Outage Root Cause, Outlines Plans to Improve Reliability

Microsoft has confirmed CrowdStrikeâ€™s analysis of the root cause of the July 19 global Windows outage â€“ and outlined plans to work with anti-malware vendors to help prevent similar events in the future.

In a July 27 blog post by David Weston, Microsoftâ€™s VP for Enterprise and OS Security, the software giant outlined four initiatives for helping anti-malware vendors roll out updates more safely:

Offering rollout guidance, best practices, and technologies â€œto make it safer to perform updates to security products.â€
Reducing the need for kernel drivers to access security data.
Providing improved isolation and anti-tampering capabilities with technologies like Virtualization-based security (VBS) enclaves.
Enabling zero trust approaches likeÂ high integrity attestation,Â which can determine the security state of a machine based on the health of Windows native security features.

The Microsoft CrowdStrike response also emphasizes support for the Rust memory-safe programming language as a way â€œfor security tools to detect and respond to emerging threats safely and securely.â€

Westonâ€™s blog post is the latest post-mortem on the faulty CrowdStrike update that brought down 8.5 million Windows machines around the world in what was possibly the largest cyber incident of all time.

Microsoft Confirms CrowdStrike Root Cause

Westonâ€™s blog post confirms CrowdStrikeâ€™s version of the causes of the global â€œblue screen of deathâ€ outage before getting into Microsoftâ€™s plans for making updates safer.

â€œOur observations confirm CrowdStrikeâ€™s analysis that this was a read-out-of-bounds memory safety error in the CrowdStrike developed CSagent.sys driver,â€ Weston wrote. Such errors â€œcan lead to widespread availability issues when not combined with safe deployment practices.â€

He said csagent.sys is aÂ file system filter driverÂ used by anti-malware agents to receive notifications about file operations such as the creation or modification of a file, useful for scanning downloads and other new files.

File system filters can also be used as a signal for monitoring system behavior. â€œCrowdStrike noted in their blog that part of their content update was changing the sensorâ€™s logic relating to data around named pipe creation,â€ he wrote. â€œThe File System filter driver API allows the driver to receive a call when named pipe activity (e.g., named pipe creation) occurs on the system that could enable the detection of malicious behavior.â€

Kernel Usage Important But Not Always Necessary

Microsoft generally defended the practice of using kernel drivers for their ability to provide system-wide visibility, to load early to detect threats likeÂ boot kits and rootkits,Â which can load before user-mode applications, and to monitor for events like file creation, deletion, or modification. Weston said Kernel activity can also trigger call backs for drivers to decide when to block activities like file or process creations, and many vendors use drivers to collect network information in the kernel using theÂ NDIS driver class.

Microsoft noted tamper resistance and performance benefits too, but added, â€œThere are many scenarios where data collection and analysis can be optimized for operation outside of kernel mode and Microsoft continues to partner with the ecosystem to improve performance and provide best practices to achieve parity outside of kernel mode.â€

â€œIt is possible today for security tools to balance security and reliability,â€ Weston wrote. Security vendors can use “minimal sensors” that run in kernel mode for data collection and enforcement, limiting exposure to availability issues.

Other key product functionality – managing updates, parsing content, and other operations – “can occur isolated within user mode where recoverability is possible. This demonstrates the best practice of minimizing kernel usage while still maintaining a robust security posture and strong visibility.â€ He included this image on where those functions might run:

Windows security: Kernel-mode and user-mode functionality

Best Practices for Windows Security and Stability

Weston also mentioned a number of best practices that can improve Windows security and availability, with App Control for Business and VBS memory integrity two of the more noteworthy ones.

App Control for BusinessÂ (formerly Windows Defender Application Control) can be used to allow only trusted and business-critical apps. â€œYour policy can be crafted to deterministically and durably prevent nearly all malware and â€˜living off the landâ€™ style attacks. It can also specify which kernel drivers are allowed by your organization to durably guarantee that only those drivers will load on your managed endpoints.â€

VBS offers memory integrity with aÂ specific allow list policyÂ to further protect the Windows kernel. â€œCombined with App Control for Business, memory integrity can reduce the attack surface for kernel malware or boot kits,â€ Weston wrote. â€œThis can also be used to limit any drivers that might impact reliability on systems.â€

Running asÂ Standard UserÂ and usingÂ Device Health AttestationÂ (DHA) are other important controls.

Microsoft CrowdStrike Response Could Involve MVI

Microsoft engages with third-party security vendors through the Microsoft Virus Initiative (MVI) â€œto define reliable extension points and platform improvements, as well as share information about how to best protect our customers.â€

Presumably MVI will be involved in efforts to improve Windows reliability and availability in the wake of the CrowdStrike outage.

Source: Read More

Sunshine And March Vibes (2025 Wallpapers Edition)

The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

How To Fix Largest Contentful Paint Issues With Subpart Analysis

How To Build Confidence In Your UX Work

“Touch Grass without touching grass” with these hilarious (and very real) skins for Xbox, Steam Deck, laptop, phone, and more

Microsoft Teams will fix meeting chats for presenters with this small change

ChatGPT’s stunning new image generator is now free for everyone

Everything coming to Call of Duty: Black Ops 6 multiplayer with Season 3

Community News: Latest PEAR Releases (03.10.2025)

Community News: Latest PEAR Releases (03.10.2025)

Community News: Latest PECL Releases (03.11.2025)

Image Dimension Validation with Laravel’s dimensions Rule

“Touch Grass without touching grass” with these hilarious (and very real) skins for Xbox, Steam Deck, laptop, phone, and more

“Touch Grass without touching grass” with these hilarious (and very real) skins for Xbox, Steam Deck, laptop, phone, and more

Microsoft Teams will fix meeting chats for presenters with this small change

Everything coming to Call of Duty: Black Ops 6 multiplayer with Season 3

Microsoft Confirms CrowdStrike Outage Root Cause, Outlines Plans to Improve Reliability

Microsoft Confirms CrowdStrike Root Cause

Kernel Usage Important But Not Always Necessary

Best Practices for Windows Security and Stability

Microsoft CrowdStrike Response Could Involve MVI

ruby-align is Baseline Newly available

February 2025 Baseline monthly digest

5 Tips for Software Developers to Excel in Their Careers

Pinecone previews new bulk import feature for its serverless offering

The Tale of Zecos

Boxes – view, access, and manage remote and virtual systems

Driving Customer Loyalty with Experience Cloud

How to Install JetStream in Laravel 12

The future of AI & design

Top WordPress Themes to Elevate Your Website in 2025

Microsoft Confirms CrowdStrike Outage Root Cause, Outlines Plans to Improve Reliability

Microsoft Confirms CrowdStrike Root Cause

Kernel Usage Important But Not Always Necessary

Best Practices for Windows Security and Stability

Microsoft CrowdStrike Response Could Involve MVI

Related Posts