July 21st, 2024

CrowdStrike IT Outage Explained by a Windows Developer [video]

The YouTube video discusses Crowd Strike causing blue screens on Windows PCs, stressing understanding the software, kernel driver issues, recent disruptive updates, and solutions based on Microsoft's crash management history. It emphasizes stress testing, debugging, and driver certification.

Read original articleLink Icon
CrowdStrike IT Outage Explained by a Windows Developer [video]

The YouTube video delves into the Crowd Strike problem triggering blue screens on Windows computers. It emphasizes the significance of comprehending Crowd Strike software, the repercussions of a kernel driver malfunction, and the factors leading to the recent disruptive software update. Additionally, it draws on past Microsoft crash management experiences from the 1990s to provide solutions for addressing the issue. The video highlights stress testing and machine debugging, underscoring the importance of kernel mode and WHQL certification for drivers.

Link Icon 4 comments
By @IntelMiner - 7 months
The title should probably say "former" Windows developer. Dave hasn't worked at Microsoft in over 20 years (he left in 2003)

It's probably also worth noting he's a serial liar who claims he wrote Space Cadet Pinball [1] and that "Linux has binary blobs that only Linus Torvalds has the source code to [2]

Also there was that time he got sued by the Washington State Attorney General for selling "Registry Cleaner" and "Internet Shield" junk apps [3]

I don't consider Dave a credible source for much of anything and the amount of clickbait he pumps out for Youtube in recent years has only solidified that opinion

[1] https://social.restless.systems/system/media_attachments/fil...

[2] https://social.restless.systems/system/media_attachments/fil...

[3] https://www.atg.wa.gov/news/news-releases/attorney-general-s...

By @dallas - 7 months
The "all this debugging was done in assembly language with minimal symbol table information" was basically true in 2009-2011 too. The (non-CrowdStrike, non-Microsoft) team I was on was developing Windows intermediate drivers which did network acceleration. I'm not sure how CrowdStrike works but we essentially MITM'd/proxied in the Windows networking stack (is CrowdStrike observe-only? I don't know). I would end up filling notebooks with register moves and subroutine calls to trace back bluescreens because Windows is closed source. Thank goodness for Windbag disassembly. Interop with other intermediate drivers like popular virus scanners was an interesting problem. I'm pretty proud of our work there in hindsight!
By @nitinreddy88 - 7 months
11:00 mark nailed it pretty much how many engineers here doesn't simply understand and blames Microsoft. I refrained from commenting earlier because people don't see these during chaos mode.
By @limejuice - 7 months
I want to know more about why the update was pushed to every single machine online at that time, ignoring CS sensor update policies that admins use to update test network first, before updating production.