CrowdStrike's impact on aviation

feyman_r · 2024-07-29T20:12:14.000000Z

>> Why were other airlines able to get back to normal so much faster than Delta?

I read somewhere that their crew tracking software was hit hard and took time to recover. Will look for source on that.

(Edited) source: https://news.delta.com/update-delta-customers-ceo-ed-bastian

“… and in particular one of our crew tracking-related tools was affected and unable to effectively process the unprecedented number of changes triggered by the system shutdown…”

crazytony · 2024-07-29T22:40:00.000000Z

One other compounding problem is that Delta's headquarters and main traffic patterns are on the east coast. Crowdstrike affected all the airlines at roughly the same time. This gave them roughly one to two fewer hours to respond before they hit their morning peak flights.

As someone else pointed out, they probably weren't ready by the time they needed their systems for the morning rush so they went to their business continuity strategy (manual). This has a throughput and recovery time penalty and obviously it compounds the longer they are in that mode.

I think what we're finding with the Southwest meltdown and now the Delta meltdown is that the big airlines just don't have the manpower or scheduling slack to accommodate going into business continuity. I do think this should be investigated. Hopefully financial penalties incentivize action but time will tell.

katbyte · 2024-07-30T04:23:03.000000Z

They prioritized stock buy backs instead of investing in a robust it operation

shiroiushi · 2024-07-30T04:25:09.000000Z

As well they should!

Which one profits the CEO more? Stock buy-backs or robust IT? Robust IT is only good for the company in the long term; however, with stock buy-backs or other skimping on IT, if disaster like this happens, the CEO just takes his golden parachute and leaves, but if no disaster happens, he gets a huge bonus to buy another private yacht.

throwaway2037 · 2024-07-30T00:57:29.000000Z

    > big airlines just don't have the manpower or scheduling slack to accommodate going into business continuity

Do small airlines have it? And, how much higher are you willing to pay in ticket prices to have this ability?

WalterBright · 2024-07-30T05:06:50.000000Z

> Hopefully financial penalties incentivize action

Delta already took a huge financial hit for this.

smileysteve · 2024-07-29T20:38:51.000000Z

Re Delta

It's not so much a severity as "hard"; but with the hub and spoke model that Delta uses, scheduling being down (at all on Friday), combined with FAA hour limits. It becomes exponentially difficult to reschedule flights.

Put more plainly, on Friday, your scheduling software is down for 4 hours in the morning, so you "borrow" any replacements you need for employees that are late or sick. This ruins the availability for the next flights, at which time you hope the system is up again; but if it's not, you borrow from the evening flights. Combine this with each flight that was late/cancelled as you were hoping to fill now affects the hours available for the employees that were available. Finally, as you've cascaded this, you head into a weekend trying to catalog how many hours each crew member did or did not log, and you're not sure how to get them back in time.

inferiorhuman · 2024-07-29T21:02:56.000000Z

Except for Southwest the other legacy airlines (United, American) also use a hub and spoke model. So does jetBlue.

crazytony · 2024-07-29T22:22:51.000000Z

Funny you should mention WN. Delta's meltdown is the exact same scenario as Southwest. Crew scheduling is messed up, they don't have a way of tracking where employees are, if the employee is legal, etc and so the operation grinds to a halt

rconti · 2024-07-30T05:10:03.000000Z

To clarify, Southwest's meltdown last year, which was all about the difficulties of crew scheduling and the knock-on effects of same.

sidewndr46 · 2024-07-29T22:09:40.000000Z

wouldn't this imply either an upper bound on down time (airline simply folds as it never catches up) or an upper bound on the duration of the impact ?

toast0 · 2024-07-30T00:58:17.000000Z

Worst case, with good weather, you can stop service for a few days: day 1 mandatory rest; day 2 fly crews to where they need to be to start service; day 3 mandatory rest; day 4 return to service. Then start rebooking passengers and picking up the pieces. Carriers with long haul international may need longer, and maybe you need more rest days to ensure everyone is ready for their normal shift, but that's a reasonable napkin estimate.

Otoh, Delta seemed to have recovered after about a week, and canceled about 1,000 out of about 4,000 flights for several days. It's way better to fly 75% of the daily flights than not. There's less wiggle room in a summer schedule for weather, but there's still some wiggle room.

Someone1234 · 2024-07-30T03:17:33.000000Z

They did a "reset." Cancel enough flights to reduce load, then manually recalibrate the crew tracking software to figure out where everyone is and their hours. Then start operations again.

rconti · 2024-07-30T05:11:13.000000Z

It's like stopping your in-place manual software recovery efforts and restoring from backup. You KNOW it's going to take a massive amount of time, but at least you know how much time it's expected to take, and what the expected result is, rather than "2 more hours... 2 more hours.... 2 more hours.." for a week.

brendoelfrendo · 2024-07-30T02:24:37.000000Z

There probably is an upper bound on down time, by which point the business has suffered some irreparable harm. It might not result in the business simply folding, but might result in significant expense or legal complications, long-term reputational damage, etc. In business continuity speak, that's the "maximum tolerable downtime," and while I don't know how Delta defines it for the impacted systems... I imagine they're not happy with how long they were down.

reaperducer · 2024-07-29T20:46:34.000000Z

>> Why were other airlines able to get back to normal so much faster than Delta?

I read somewhere that their crew tracking software was hit hard and took time to recover. Will look for source on that.

I heard on the radio (maybe NPR, not sure) it wasn't about the computers, it was about Delta's response.

According to the report, the other airlines delayed flights, while Delta cancelled them outright. That left Delta with more people and planes in the wrong places, making it harder to recover.

Onavo · 2024-07-29T20:13:32.000000Z

Because they used Windows 3.1

shagie · 2024-07-29T20:29:47.000000Z

I chased through this chain the other day...

https://www.tomshardware.com/software/windows/windows-31-sav...

https://www.forbes.com/sites/tedreed/2024/07/20/meltdown-wha...

> A story on the website govtech.com on Friday asked the question, “Why isn’t Southwest affected by the CrowdStrike/Microsoft outage?

> “That’s because major portions of the airline’s computer systems are still using Windows 3.1, a 32-year-old version of Microsoft’s computer operating software,” the website said. “It’s so old that the CrowdStrike issue doesn’t affect it so Southwest is still operating as normal. It’s typically not a good idea to wait so long to update, but in this one instance Southwest has done itself a favor.”

The govetech.com article is https://www.govtech.com/question-of-the-day/why-isnt-southwe...

which linked to https://www.digitaltrends.com/computing/southwest-cloudstrik...

which linked to an earlier Forbes article - https://www.forbes.com/sites/hershshefrin/2022/12/31/can-sou...

> The December 2022 scheduling fiasco was the result of skimping on information technology. I am old enough to remember when Microsoft introduced a new operating system called Windows 95, to replace its predecessor operating system Windows 3.1. The 95 in Windows 95 refers to the year of its introduction: 1995. By some accounts, major portions of Southwest’s scheduling system for pilots and flight attendants is built on the Windows 95 platform. That platform is now more than 25 years old.

JumpCrisscross · 2024-07-29T20:36:49.000000Z

Southwest does not run Windows 3.1:

“That’s it. That’s where all these stories can trace their origin to. These few paragraphs do not say that Southwest is still using ancient Windows versions; it just states that the systems they developed internally, SkySolver and Crew Web Access, look ‘historic like they were designed on Windows 95’.”

https://www.osnews.com/story/140301/no-southwest-airlines-is...

shagie · 2024-07-29T20:40:49.000000Z

The other day, I saw a screen capture from Tom's Hardware and so chased the series of links and quotes to try to find the earliest one that had reporting on it that was the source. That was the chain that I found.

I am not claiming that they run Windows 3.1 or Windows 95 ... but rather "this is where that story was sourced from" because everyone kept linking to somewhere else. The relevant XKCD is https://xkcd.com/978/

Modified3019 · 2024-07-29T20:49:39.000000Z

Funny enough, this cycle is close to what the Russian disinformation machine does deliberately to spread bullshit.

starspangled · 2024-07-30T01:22:59.000000Z

Is that actually true, or just something that's repeated until people believe it?

computerfriend · 2024-07-30T02:56:40.000000Z

https://en.m.wikipedia.org/wiki/Information_laundering

starspangled · 2024-07-30T02:59:47.000000Z

Yes, is there some evidence beyond the claims of "intelligence officials"?

tadfisher · 2024-07-30T03:43:08.000000Z

Also https://en.wikipedia.org/wiki/Woozle_effect

dave4420 · 2024-07-30T11:30:36.000000Z

I see what you did there.

red-iron-pine · 2024-07-30T18:22:49.000000Z

Russian approaches are well known and documented. None of this is new, and wasn't even really that new in 2016, it's just become better known.

Essentially modern versions of Soviet-style disinformation campaigns, but augmented with new technology (social media), and without the ideological hindrances of a Communist government (e.g. sell hard to both Right and Left).

RAND Corp calls it "the Russian Firehose" model: https://www.rand.org/pubs/perspectives/PE198.html

Similar approaches are also used by NK, Indian, Chinese, and other national-tier disinfo campaigns. This contrasts with models used by the West, which are often less about creating a disinformation clusterfuck, and more of a "watch our Disney / BBC / Scandinavian TV & movies and their implied messages about freedom and human rights and shit".

starspangled · 2024-08-02T04:57:37.000000Z

Not too sure they are. The "experts" insisted that Russia colluded with Trump to "hack the election", that they somehow faked or planted or were responsible for various laptop and email leaks, etc., all such things which have since been found to be false or at best no real evidence has ever been produced in support of.

Obiously Russian, Chinese, and all other governments engage in information campaigns, and obviously the US government knows a lot about what they are. But we the public does not necessarily have the same information. It's not really possible to distinguish the "well known and documented (by the military and espionage industrial complex)" operation of foreign countries from domestic propaganda developed by those corporations and agencies to influence their own citizens.

ZeWaka · 2024-07-29T20:24:34.000000Z

In the article it says Southwest used 3.1, not Delta (though, that's apparently incorrect according to other posters).

Someone1234 · 2024-07-29T20:28:30.000000Z

And Southwest had two crew-management outages in 2022[0], so let's not sing their praises for escaping the CrowdStrike disruption. Southwest has been widely critized for under-investment in technology, Delta on the other hand purchased one of the best security products on the market and that backfired.

[0] https://en.wikipedia.org/wiki/2022_Southwest_Airlines_schedu...

chgs · 2024-07-29T21:07:42.000000Z

Delta put all their eggs in one basket and had no DR capability

Someone1234 · 2024-07-29T21:40:41.000000Z

What basis do you have for saying that? It is likely their DR was running on a mirror of their production systems, and was similarly impacted by the Crowdstrike outage. So they fell back to Windows Servers similarly stuck in a boot-loop.

Keep in mind there was no way to opt out or delay CS Channel updates.

chgs · 2024-07-29T22:35:04.000000Z

If your DR system is susceptible to the same faults as your main system it’s not a DR system.

It would be like claiming raid1 is a backup.

TheDong · 2024-07-29T23:01:15.000000Z

Or it would be like claiming my backup isn’t a backup because both systems run openssh, so a remote code execution vuln there could take down both systems.

Any DR system will have to accept some risks, and those don’t necessarily invalidate it in general, just make it insufficient for some scenarios.

Conversely, if they ran the main system on windows with crowdstrike and the DR one on poorly configured linux with no security software, they probably would have needed more sysadmins, had more trouble maintaining software for both, and been vulnerable to risk from both linux and windows bugs, so I feel like they made the right tradeoff in general.

I’m sure you, who can deride this DR system, have devised your own system such that it is resilient to a meteor destroying the earth.

shagie · 2024-07-30T00:29:43.000000Z

> I’m sure you, who can deride this DR system, have devised your own system such that it is resilient to a meteor destroying the earth.

That reminds me one of Corey Quinn's comfortable AWS truths.

https://x.com/QuinnyPig/status/1173371749808783360

> If your DR plan assumes us-east-1 dies unrecoverably, what you're really planning for is 100 square miles of Northern Virginia no longer existing. Good luck with that ad farm in a nuclear wasteland, buddy!

dredmorbius · 2024-07-30T10:55:18.000000Z

As HN itself discovered a couple of years ago when a set of same-manufacturer, same-batch disks within both RAID arrays and backup server failed within a few hours of one another:

<https://news.ycombinator.com/item?id=32048148>

<https://news.ycombinator.com/item?id=32031243>

amluto · 2024-07-30T00:46:28.000000Z

One idea: build a DR system and turn it off. Ideally it would be cloneable, but even without that ability, one could test it every few months to make sure it boots adequately quickly and then turn it back off. The attack surface of a bunch of computers or instances that are powered down is pretty low.

compiler-guy · 2024-07-30T03:11:52.000000Z

Better yet, alternate between them every month or two.

freeopinion · 2024-07-29T22:41:56.000000Z

> Keep in mind there was no way to opt out or delay CS Channel updates.

Do CS updates somehow work over airgaps? You know, the kind that production systems have to prevent any access to or from external networks? Well... some production systems anyway.

nradov · 2024-07-30T00:04:00.000000Z

What's your point? An air gapped disaster recovery system would be useless. An airline operations application has to connect to a bunch of other external systems to be of any use.

shiroiushi · 2024-07-30T00:02:37.000000Z

>Delta on the other hand purchased one of the best security products on the market and that backfired.

It looks like it wasn't a good security product after all...

Zigurd · 2024-07-29T20:34:12.000000Z

I would like to know if a solid, up to date, well-rehearsed disaster recovery plan saved anyone's butt, or if we're all just raw dogging our machines whether IT is paying for backup and recovery or not?

ta1243 · 2024-07-29T20:37:04.000000Z

Our systems worked fine, we expect things to fail - including software like sentinal one, crowdstrike, etc, and have DR systems which can keep us limping along. We have DR systems which will work should other things happen - say the Thames barrier fails (i.e. no docklands)

Unfortunately some of our outsourced suppliers didn't have such attitudes.

red-iron-pine · 2024-07-30T18:49:12.000000Z

Sure has. HSRP and VRRP plus other SD-WAN features definitely made a difference when one of our sites had the fiber pulled by accident. data center tech screwed up bigly and took us plus at least one other of their customers down.

definitely saw a blip and stuff had issues for 10 minutes, e.g. pages timed out or had to restart a process, but generally sites failed over and were able to keep limping on while we did triage.

got something like a $19 ($21?) service credit and an apology from the data center. our CEO shouted a lot and threatened lawsuits but it never went anywhere. Director of IT Infra quietly thanked all of us for having failover that mostly worked.

berniedurfee · 2024-07-30T16:51:19.000000Z

They certainly have DR infrastructure primed and ready to go… with Crowdstrike pre-installed on every DR server.

paulddraper · 2024-07-30T15:05:27.000000Z

I've never seen it.

Obviously some selection bias there, but I'd love to hear some success stories.

Zigurd · 2024-07-29T20:36:15.000000Z

I see just moments after I posted, someone posted this: https://news.ycombinator.com/item?id=41103486

So, yeah, lack of DR is why Delta was so screwed.

pimlottc · 2024-07-30T04:46:46.000000Z

One thing I don't understand from these graphs - why was there a relative uptick in takeoffs starting a short time /before/ the CrowdStrike update was pushed? It's in the overall graph, as well as the graphs for United, American, and especially Delta. I can't think of any reason for this, maybe it's just random noise, or maybe there was something unusual about the previous week at the same time?

jjwiseman · 2024-07-30T19:36:22.000000Z

One reason I put charts for both absolute numbers of flights and percentage change is to help understand the larger context. Those relative upticks happened just before CrowdStrike hit, so around midnight Eastern time, when traffic was already very low. So a 25% increase might be 12 extra flights taking off in the U.S. in an hour. There's plenty of noise, for sure, and looking at absolute numbers and percent change together can help give you a sense of what was going on. Looking at two days worth of data is probably enough to give you the main themes of the CrowdStrike impact, but not enough to explain every variation.

mike_hearn · 2024-07-30T08:24:04.000000Z

It was widely reported to be the busiest travel day for quite a long time, which compounded problems.

account42 · 2024-07-30T08:32:18.000000Z

Yeah, shouldn't have been too hard to add a couple more weeks so you at least get an idea about variance.

jandrusk · 2024-07-30T15:19:05.000000Z

Will be most interesting how this lawsuit by Delta plays out against Microsoft & Crowdstrike:

https://www.marketwatch.com/story/delta-hires-law-firm-seeki...

rdtsc · 2024-07-29T20:34:07.000000Z

From the included link: https://www.techradar.com/pro/security/southwest-airlines-av...

> To give you an idea of just how outdated this operating system is, Windows 3.1 was originally launched in 1992, and Microsoft ended support for it on December 31, 2001, except for the embedded version, which was officially retired in 2008.

I keep hearing the Windows 3.1 story repeated. I mean here it comes from TechRadar and even has the "Pro" in the name, they can't possibly make stuff up, right? But still don't quite believe it.

Can anyone working at Southwest confirm that their main scheduling system is running on Windows 3.1?

JumpCrisscross · 2024-07-29T20:38:02.000000Z

> keep hearing the Windows 3.1 story repeated

It’s wrong [1] and serves as a litmus test for whether an outlet independently verifies its claims.

(“The systems [Southwest] developed internally, SkySolver and Crew Web Access, look ‘historic like they were designed on Windows 95’.” That got mangled into they run 3.1.)

[1] https://www.osnews.com/story/140301/no-southwest-airlines-is...

xp84 · 2024-07-29T21:12:04.000000Z

Wow, that’s even more frustrating considering it’s conflating an unfashionable UI (which I’d argue is a good thing, since all modern UI trends are towards slick, minimalism-worshiping messes which hide everything from users) and old, provably-flawed technological foundations (like a 16-bit system without things like filesystem access control or memory protection).

I knew this story was false immediately though because no company ever even in 1993 had production server systems which ran a desktop OS like Win 3.1. It just wasn’t up to the task. They would have used NT if anything.

btown · 2024-07-29T21:44:29.000000Z

http://www3.alpa.org/LinkClick.aspx?fileticket=IO7kd%2Bfm2Do... shows the system as of 2020. To the parent’s point, it’s actually quite a reasonable UX, with colored outputs, filter banks, and just enough abbreviations and whitespace to balance density with intuitiveness.

But that doesn’t mean this is the only modern design system that meets those requirements. And conflating all modern UI with consumer design trends is an equally frustratingly broad statement.

qingcharles · 2024-07-29T22:14:17.000000Z

OK, this is definitely unfashionable looking if your main exposure to apps is the latest doodah on your phone that was literally updated yesterday.

Very standard looking legacy Win32 looking app. Which, admittedly, would have probably look very similar had it been on Windows 3, but is probably running on LTSC Windows 10 or something in reality.

numpad0 · 2024-07-30T06:35:24.000000Z

Doesn't look Microsoft at all to me, just colored to mimic XP. Java on some Unix?

mjevans · 2024-07-30T15:08:34.000000Z

Page 7 (as labeled) of the slides. The tabs and checkboxes layout have a distinctly Win 9x era look/feel. I do agree that it's missing an obvious menu, and the theme for the window decorations reminds me of win 3.1, but that was probably an option for software of that era just as it is in this if someone pushes hard enough.

goodcanadian · 2024-07-30T09:37:31.000000Z

Perhaps you just aren't old enough? It looks very Windows 95 to me.

stoltzmann · 2024-07-30T14:30:13.000000Z

Age has nothing to do with it, the interface just doesn't look like Windows 95.

The button shapes, minimize/close window buttons, the titlebar are all looking wrong for Windows 95.

It looks significantly more like Swing, but then the buttons don't match that either.

Shorel · 2024-07-30T06:10:46.000000Z

It looks like every single hospital or car rental software I have managed to peek.

It's not old-fashioned, it is _timeless_ B)

veggieroll · 2024-07-29T22:09:39.000000Z

Link worked for me but took a long time to load. It just seems like their server is overloaded.

quotemstr · 2024-07-29T22:02:18.000000Z

Broken link

cjbprime · 2024-07-29T21:39:54.000000Z

Windows 95 is an "unfashionable" OS which has not received any security updates since 2001.

andrewxdiamond · 2024-07-29T21:46:17.000000Z

Yes and the fact that my software’s UI looks like Windows 95 makes it vulnerable to all the same security vulnerabilities.

/s

The systems don’t run on W95, they look like W95

spookie · 2024-07-29T21:27:24.000000Z

Being blasted by media for running your own software, incredible. As others have commented, just a single tweet was enough to propagate this story. Quite concerning how easy it is to fake reality nowadays.

madeofpalk · 2024-07-29T21:32:14.000000Z

This is the same as the "Olympic cardboard beds are anti-sex" fake story that persisted. Anyone who publishes it demonstrates they don't actually research.

jjwiseman · 2024-07-29T20:48:04.000000Z

Thanks, I updated the post.

kragen · 2024-07-29T21:21:48.000000Z

i miss the lemonodor blog

zitterbewegung · 2024-07-29T22:02:29.000000Z

I know this is a hot take but companies have to figure out if modernization of a UI will be worth it to retrain everyone in the new UI. Many people were involved with its creation and maintenance and due to its age the UI may have a large amount of glue code that can't be separated unless you build an API around the other software. Especially if there is some kind of change in the system that moving off the old one is meaningless. Southwest is also making changes to their operations so they probably might be in maintenance mode for the software especially when the outage of their current software was done since they will have to not have anyone choose any seat at this time. [1]

[1] https://www.cnn.com/2024/07/25/investing/southwest-airlines-...

stavros · 2024-07-29T22:50:21.000000Z

I don't know, I like the classic Windows UI. I don't think modern UIs are an improvement on that.

suzzer99 · 2024-07-29T23:16:04.000000Z

No no no. We must now have floating headers that don't give any indication they belong to the columns below them, much less that you can click them to sort the columns. 95% of possible actions must only appear when hovered over. Buttons should not look like buttons, nor should they provide any feedback that they've actually been clicked. Etc.

xarope · 2024-07-30T03:38:19.000000Z

to be fair, some of the java-era software with their default toolkits do look very windows 3.1/95'ish (all that blue and teal)

dsr_ · 2024-07-29T20:40:38.000000Z

Tech Radar quotes Tom's Hardware; Tom's Hardware quotes a tweet.

Not a tweet from Southwest, mind you. Not even a tweet from someone who says that they used to work for Southwest. Just... a tweet.

shombaboor · 2024-07-29T20:50:51.000000Z

I just wish there was some type of identifiable credit / penalty system for writing accurately as a news source. And this would include quotes / retweets. Never been a better time to be wrong about everything.

JumpCrisscross · 2024-07-29T20:52:15.000000Z

> wish there was some type of identifiable credit / penalty system for writing accurately as a news source

Good starting point is if the news is free. A shocking fraction of people get their news from solely free sources.

mewpmewp2 · 2024-07-29T21:09:31.000000Z

And why would someone put in effort for free?

cgriswald · 2024-07-30T00:43:02.000000Z

This is a misunderstanding of the problem. Effort is made in both cases. In one case effort is made to find verifiable truth as a service. In the other effort is made to provide eyeballs to advertisers.

kspacewalk2 · 2024-07-29T21:47:37.000000Z

What's "solely free"? Does the ad-driven model count as free? Why do you think an outlet that works for you will necessarily deliver better quality news that the one that works for advertisers? There are obvious bias downsides to both.

sxg · 2024-07-29T22:03:49.000000Z

The ad-driven model does count as free, and it's far less likely to deliver better quality news than a subscription service users pay for. The core metric for ad-driven news sites is maximizing views—it doesn't matter how you get views as long as you get them. This means free sites are heavily incentivized to be the first to break a news story even if the details are wrong or sparse. Sure, they'll issue corrections and updates later, but only a small percentage of the initial viewers will ever see these, and there's essentially zero cost for having made the mistake.

The core metric for subscription news sites is minimizing churn. A mistake will cost a subscription site subscribers who have a massive lifetime value. These sites are heavily incentivized to report high quality, accurate news even if they're not the first to break the story.

JumpCrisscross · 2024-07-30T00:00:53.000000Z

> What's "solely free"? Does the ad-driven model count as free?

Yes, in this context.

> Why do you think an outlet that works for you will necessarily deliver better quality news that the one that works for advertisers?

I can’t explain the mechanics precisely. But it’s pretty clear when I compare my subscription and non-subscription sources where the quality lies.

torginus · 2024-07-30T10:06:52.000000Z

Yet paying for news is a very weak guarantee of not being fed propaganda/inaccurate reporting.

If we held food safety to the same standards as paid news sources are held, people would get salmonella once a week.

Analemma_ · 2024-07-29T21:19:28.000000Z

Your cure is worse than the disease. The second such a system existed, it would be gamed to hell and back, and nobody would believe it anyway since they'd all angrily insist that "you shouldn't have counted X" or "you should've counted Y more" and it would just turn into a war over who got to control the system and use it to deplatform their enemies.

andrewflnr · 2024-07-29T21:57:46.000000Z

It doesn't have to, and indeed shouldn't, be a single system. We'd rather have a handful of independent news checker orgs, maybe some topic-specific ones. Funding remains an exercise for the reader.

treflop · 2024-07-29T21:57:10.000000Z

There just isn’t. You just have to read enough of one source to determine your own opinion.

Just like with anyone you meet: you are the judge if they are trustworthy, nice, mean, funny, etc.

That said, I think tech journalism is the bottom of the barrel. I just feel like they focus more on tech than journalism.

shombaboor · 2024-07-30T02:02:27.000000Z

the cost of producing bs is too low, back in the day it would at least require time and money to print / distribute.

mardifoufs · 2024-07-29T22:49:34.000000Z

Community notes on twitter is the closest thing to what you're describing I've seen yet. It's been very helpful too imo

thereddaikon · 2024-07-29T20:57:03.000000Z

A great example of why people don't trust journalists anymore. They don't even perform a basic amount of fact checking before publishing.

colechristensen · 2024-07-29T21:12:31.000000Z

Articles from the likes of Tech Radar or Toms Hardware I would trust to a higher standard than a random tweet, but really I wouldn't label them as "real journalists"

I question the ethics and standards of the New York Times at least a little at this point so it's not like great journalism is common.

torginus · 2024-07-30T10:10:41.000000Z

Also, there is the effect of a lie oft repeated becoming the truth - the times I've seen small outlets writing nonsense, and having it picked up by progressively bigger papers citing the smaller ones as credible sources is too much to count.

Generally there is a chain of trust in news publishing that goes nowhere and there's nothing we can do about it, as more often than not, someone credible repeats the hearsay nonsense down the line, at which point they count as a primary source.

So much of news publishing I would describe as not even wrong.

ThrowawayTestr · 2024-07-30T02:39:55.000000Z

People don't pay for news anymore so we get what we pay for.

shiroiushi · 2024-07-30T04:28:52.000000Z

People never paid for news really. If you're thinking of the days when you had to pay 25 cents for a newspaper at the convenience store, that didn't come even close to the cost of running a newspaper in those days. Your quarter only covered (maybe) the cost of the paper and printing it. These days, we don't need paper, and running a web service is probably cheaper per-reader than physical paper.

Newspapers got the bulk of their funding from advertising back then, just as they do now.

ThrowawayTestr · 2024-07-30T17:37:28.000000Z

The important thing is you were able to justify not paying for stuff.

jxy · 2024-07-29T21:25:03.000000Z

I don't trust any kind of generalization like this, which only serves further disinformation and misinformation.

There are bad journalists (if they can be called journalists at all) and good journalists. At this point in history, our only hope lies with diligent reporters from reputable publishers.

Dalewyn · 2024-07-29T22:39:14.000000Z

[flagged]

ThrowawayTestr · 2024-07-30T02:40:56.000000Z

When's the last time you paid for a newspaper or a subscription to a newspaper?

Bud · 2024-07-29T21:10:12.000000Z

It's unfair to pretend that all journalists have the same level of professionalism (or lack thereof) with regard to sourcing.

They don't.

Terr_ · 2024-07-29T21:12:09.000000Z

It's kind of depressing to think that we have had this world-spanning system of knowledge and "hyperlinks" for decades now, individual pieces that should've enabled an easy chain of attribution/citation...

Y_Y · 2024-07-29T22:09:52.000000Z

And encourage the reader to move away from your site‽ No self respecting PHB could condone such a thing.

nostromo · 2024-07-29T22:24:37.000000Z

I've started seeing this on Wikipedia.

Wikipedia sources an article from a semi-legit source. That semi-legit source either just says "sources" or points to something less-legit, like a Tweet.

You can bring new "facts" into existence by just laundering them from lower- and lower-quality sources.

Arrath · 2024-07-30T01:15:55.000000Z

Source-laundering is a bit catchy, I have to say.

qingcharles · 2024-07-29T22:08:28.000000Z

The guy that started it all said it was just a "troll tweet":

https://x.com/ArtemR/status/1815408553131426179

hn_throwaway_99 · 2024-07-29T20:38:09.000000Z

The "Southwest uses Windows 3.1" claim is false, and is a great example of how bullshit can spread on the Internet once some semi "reputable" organizations repeat the false rumor:

https://kotaku.com/southwest-airlines-windows-3-1-blue-scree...

technick · 2024-07-30T05:44:01.000000Z

I worked for SITA ( https://en.wikipedia.org/wiki/SITA_(business_services_compan... )back in the late 2000's. They had a massive X25 serial network connecting airlines across the globe. Some of its customers were still running Windows 3.11 in the data center on old AT system. We would buy old computers on craigslist and ebay to keep hardware around for when it failed. I wouldn't be surprised if those systems are still in use today.

brianpan · 2024-07-29T22:16:24.000000Z

The San Francisco subway runs off of 5-inch floppy disks.

https://sfstandard.com/2023/02/02/sfs-market-street-subway-r...

That article links to an (only slightly older) article about British Airways loading navigation updates every month off of the fancy new 3.5-inch floppy disks.

umvi · 2024-07-29T21:10:18.000000Z

> Can anyone working at Southwest confirm that their main scheduling system is running on Windows 3.1?

I can't confirm that, but I can certainly confirm lots of hospital equipment is still running Windows XP and lots of hospital personnel browse the internet with Internet Explorer.

ponector · 2024-07-29T21:49:26.000000Z

This story is another example how hallucinations from LLM can successfully replace many "news" portals.

camillomiller · 2024-07-30T07:14:08.000000Z

Berlin Brandenburg got hit hard. As a disgruntled BER user, I am NOT surprised they had one of the worse repercussion.

nicbou · 2024-07-30T07:17:51.000000Z

German IT is often hit hard by such things. Unless of course they're still running on paper.

At least they immediately mentioned it on their website, as a banner right at the top. The immigration office's appointment system has been down for over a month, and it took them 3 weeks to just acknowledge it.

camillomiller · 2024-07-30T17:07:44.000000Z

Thanks for your website btw. My partner’s currently renewing her work visa (she’s from Australia) and it’s insane how bad the situation is. To the point where it feels like it should just be illegal for Berlin to operate services like this. Appalling.

ta1243 · 2024-07-30T13:57:53.000000Z

I'm still shocked that Brandenburg is actually open!

beambot · 2024-07-29T21:56:15.000000Z

Lawsuits inbound. Delta appears to be gearing up for one already:

https://finance.yahoo.com/news/delta-air-lines-seek-compensa...

hypeatei · 2024-07-29T23:27:12.000000Z

> has hired a law firm and will seek compensation from Microsoft and CrowdStrike

Going after Microsoft seems like a misguided move here. What does Microsoft have to do with a third party driver installed by your own IT department?

bruce511 · 2024-07-30T03:49:28.000000Z

I suspect the lawsuit is created by lawyers, not techies.

Equally reporting on this whole issue seems to be by journalists, not techies. It's been framed (a lot) as a Windows issue not a Crowd Strike issue.

(8.5 million machines were affected, out of 1.4+ billion windows machines [1])

I have one affected customer (10k machines) who assumed I'd suffered like he did, and was surprised when I said we weren't affected. The reporting was consistently that it was a Windows issue, caused by an MS update.

Even this article leans into this narrative...

"as the New York Times put it, “It is more apt to ask what was not affected.” The answer is Linux, Macs, and phones."

Let me add "not to mention 99.4% of windows".

So the journalists don't know what happened, or who was affected, and felt "some computers have a problem" was a weak headline. The lawyers get that narrative and run with it.

And yes, it's easy to squint and claim the "OS should cope with this", but there's realistically limits on what an OS can do once you install a kernel-level driver on the machine. Should we go after Intel for making the chips?

[1] https://www.pcworld.com/article/608447/microsoft-delighted-b...

realusername · 2024-07-30T07:03:24.000000Z

Reports in the mainstream media were absolutely insane. All the language used points to some kind of unlucky event similar to a bad weather pattern. I knew the general IT knowledge isn't very high but I didn't expect newspapers to report on it like a tornado or an earthquake...

hilbert42 · 2024-07-31T15:02:05.000000Z

"Let me add "not to mention 99.4% of windows".

...And neither were any of my Windows 7 systems affected.

"And yes, it's easy to squint and claim the "OS should cope with this", but there's realistically limits on what an OS can do once you install a kernel-level driver on the machine."

What are those limits, and why are they limits? Are there solutions? Yes there are. For instance, Microsoft doesn't takes snapshots of the working system state before loading a kernel patch, which on crash it would automatically reload without patch, nor does it employ various other techniques that would solve the problem.

I've discussed these issues in other posts so I won't repeat them here.

fsflover · 2024-07-30T08:21:38.000000Z

It's arguably also a fault of MS: https://news.ycombinator.com/item?id=41096344

vel0city · 2024-07-30T14:06:41.000000Z

> Windows is unsuitable precisely because it can be brought down by third party updates

If I run bullshit on a Linux or MacOS box it can also be unstable and brought to its knees. Or is that poster really trying to argue there's no way you can get a Linux box to lock up?

fsflover · 2024-07-30T15:17:46.000000Z

Key quote from my link:

> Third party vendors are forced into writing unsafe kernel drivers because Microsoft does not provide sufficient user mode APIs.

AFAIK it's different on Linux, and the reliability is higher. Is this not the case?

vel0city · 2024-07-30T15:29:10.000000Z

https://forums.rockylinux.org/t/crowdstrike-freezing-rockyli...

https://access.redhat.com/solutions/7068083

https://lists.debian.org/debian-kernel/2024/04/msg00202.html

You can install buggy kernel modules in Linux as well. I can't even count how many times an apt upgrade/yum update made my system unbootable when using nvidia GPU drivers.

And besides, if you're really wanting that AV system to deeply know about everything the operating system is doing and hook into tons of syscalls, you pretty much can't be running exclusively in usermode. If someone compromised the root of the system you then can't trust the info the kernel is giving your usermode application. eBPF isn't usermode.

And in the end, the poster literally said Windows is unsuitable because third party updates can kill it. That was the key takeaway from their post. Well, third party updates can kill Linux, it can kill MacOS, it can kill darn near everything.

bruce511 · 2024-07-30T14:50:10.000000Z

It's an argument. I'm not sure it's a _good_ argument, but hey it's an argument.

travoc · 2024-07-30T00:45:17.000000Z

It was too tempting to include damages from accidentally enabling New New Outlook.

L-four · 2024-07-30T10:24:51.000000Z

You, want Microsoft named in the case so CrowdStrike can't defect that it's Microsoft's fault.

bandyaboot · 2024-07-30T00:00:30.000000Z

Anyone know why Minneapolis-St Paul began experiencing cancellations much earlier than other US airports?

miohtama · 2024-07-30T14:41:09.000000Z

How to avoid getting rekt

> Southwest wasn’t affected because they don’t use CrowdStrike

roshankhan28 · 2024-07-30T07:57:52.000000Z

i really dont understand, how can my social media have better backup and infrastructure as compared to an OS which is being used by worldwide?

goodcanadian · 2024-07-30T09:44:55.000000Z

Because IT is your social media's business. They know IT inside and out. They understand what can go wrong and how to mitigate it. The business for airlines (for example) is to fly planes. They are pretty damn good at it. IT, however, is just a tool to them that they buy elsewhere. They don't understand it in the same way as social media. They rely on outside contractors to do it right: outside contractors who get the job based on being the cheapest or convincing the buyers their service is "industry best practice."

owl57 · 2024-07-30T12:33:38.000000Z

I wouldn't overestimate FAANG's immunity to crash-the-world config updates. Facebook had everything including engineers' access to the datacenter down for hours in 2021:

https://news.ycombinator.com/item?id=28750894

> infrastructure as compared to an OS

By the way, I don't think quality of Microsoft's infrastructure is relevant here.

Nextgrid · 2024-07-30T13:55:45.000000Z

Compare the salaries, working conditions and prestige offered by tech jobs at a social media companies vs some large legacy company like a bank or airline.

In the former, you are paid well and have some sort of prestige and political capital. In the latter, you are underpaid and your prestige/political capital is often equivalent to the janitor's.

rr808 · 2024-07-30T10:48:59.000000Z

Meta is one of the most valuable companies in the world with the most resources to buy the best of everything. At 1,280 Billion dollars of market cap it is 30x bigger than American, Delta and United put together. It made $39 Billion last year compared to $7.8 Billion for all US airlines together. Of course it has better systems.

hiddencost · 2024-07-30T09:20:57.000000Z

Because one is in a data center that can be controlled, and the other is deployed to user owned hardware that cannot.

Ekaros · 2024-07-30T09:24:50.000000Z

Because they don't do mass rollouts on the servers. Then again those companies could fail if they had single point of failure with automatic mass deployments...

This could happen for anything that supports this type of automatic mass deployment. Just in this case that thing was popular enough and happened on one of the most popular platforms.

pjc50 · 2024-07-30T08:59:40.000000Z

Windows has always been terrible for reliability. Adding a "security" system which is invasive and always-updated makes the reliability worse.

firtoz · 2024-07-29T20:17:27.000000Z

Is there a similar global analysis?

jjwiseman · 2024-07-29T20:53:39.000000Z

Maybe I'll do a Part 2: The World.

opdahl · 2024-07-30T02:46:21.000000Z

As a non-American that would be very interesting.

jijji · 2024-07-30T03:05:51.000000Z

love to see the airlines using linux and what kind of problems, if any, they experienced that day

fullspectrumdev · 2024-07-29T20:25:05.000000Z

I’d love to have some solid numbers of “global cancellations due to” - I heard a bunch of varying figures so far.

jijji · 2024-07-30T03:03:45.000000Z

basically any airline using linux is not on that list

bruce511 · 2024-07-30T03:57:47.000000Z

It's more accurate to say "any airline not using Crowd Strike is not on that list."

Blaming Windows for this outage is like blaming Linux for Apache bugs. The two systems are distinct.

It just so happens that Crowd Strike was very successful at selling to large corporates. That includes some airlines.

99.4% of Windows machines were unaffected. Including those of airlines using Windows, but not Crowd Strike.

jijji · 2024-07-30T04:45:24.000000Z

ahhh yes you are correct

namdnay · 2024-07-30T09:20:14.000000Z

every airline in the world "uses linux", the core reservation and distribution systems were migrated from TPF to Linux over the past 20 years

misja111 · 2024-07-30T06:52:11.000000Z

Can you name one?

aftbit · 2024-07-29T20:29:57.000000Z

One interesting feature of this outage was that "PROD" was generally fine, on account of mostly running on Linux and/or ancient proprietary software, while "CORP" was generally wrecked, on account of mostly running Windows. In other words, the bank systems responsible for moving money mostly worked, while the systems responsible for allowing humans to interact with them (to issue approvals, change configuration, or other ops things) often did not.

7thaccount · 2024-07-29T20:33:21.000000Z

Same thing for a lot of industries actually. PROD runs on Linux and probably has some delay to prevent this. Corp gets hosed.

LeifCarrotson · 2024-07-29T21:32:05.000000Z

Yep, here in manufacturing production/OT PLCs run on Wind River VxWorks from Rockwell, Siemens, and others. The HMI (human-machine interface, basically a touchscreen used to display status and enter setpoints and other data) and SCADA/ERP systems run on Windows. Sometimes, this is an industrial fanless PC with eg. Ignition (Java+Python) software, other times it's a Rockwell Panelview which actually still run Windows CE 6.0.

This gets to be a problem when IT wants to get their hooks into OT networks. The PLC is meant to be left alone, and will happily send its Ethernet packet to that servo drive or digital IO card every 10ms for literal decades. There is no reason to update its firmware ever, just don't expose it to the Internet. But corporate wants everything on the Internet.

The PLC will reliably run its sequence when you close the contacts on the physical "Cycle Start" pushbutton. But if corporate is down, you can't know what part number you're supposed to make or how many of them, or get a serial number from and report test results to the traceability database.

aftbit · 2024-07-30T21:08:49.000000Z

On the flip side, there are a lot of physical production systems (think CNC mills or 3d printer farms) where remote observability and management would be very handy, or where you'd really like to upload gcode files directly from your workstation. However, because they've been air-gapped, you need to instead walk across the shop floor to "that one PC" that allows you to insert a USB stick, copy the files off the network drive to the USB, then walk back to the lab with the machine tools and insert the stick to feed the files over.

If you want to monitor, you need to sit in the lab and watch, or if you're lucky, leave a PC with a webcam pointed at the tool and remote into that machine from your desk or your laptop at home.

This works, but long cycle times kill productivity, and engineering twiddling their thumbs costs money. It's easy to end up spending multiple hours a week just walking back and forth doing this dumb dance. One would expect that with 40+ years of networking experience, we would have come up with a way to securely perform these tasks without simultaneously exposing our tooling to cyberattack. Perhaps some kind of segregated network that can't access the internet, but gets pull-only access to the file share? Or vice versa - a screencap feed that gets sent through a data diode so an engineer can monitor the tool from their phone or laptop without being able to affect it?

Perhaps such solutions exist and are just beyond the IT skills, budget, or complexity appetite of the sorts of production tooling shops that I'm familiar with.

Caveat tho - I don't work in this space, I'm just friends with people who do.

aftbit · 2024-07-30T21:10:43.000000Z

100% - banks were just a specific example that popped to mind immediately. If this bug had affected Linux Crowdstrike, we might have seen the opposite - people hard at work on their laptops trying to fix the production server outages. Probably nothing could have taken down the FAA systems though, on account of them being too old and bespoke to have a supported Crowdstrike module.

brazzy · 2024-07-29T20:33:20.000000Z

In the original thread there were some reports of people having their Linux systems taken down by Crowdstrike as well. At separate times, of course, and I supposed the greater heterogeneity of Linux distros prevents events of this magnitude. But that would be little consolation when it takes down your systems.

foobarchu · 2024-07-29T22:12:12.000000Z

Those should be considered coincidence until proven otherwise. Crowdstrike is intended to bring down systems when it believes there was an intrusion, after all.

brazzy · 2024-07-31T21:33:06.000000Z

Not by crashing the kernel...

mjevans · 2024-07-29T20:31:55.000000Z

Outsourcing a core business competency and surely also cutting the contracts to the bone as well to pocket the savings embrittled Delta and I seriously hope the compensation to customers costs more than any savings or profits they made in the interim. It MUST be painful enough that they do not repeat this mistake again.

The article quotes https://www.reddit.com/r/delta/comments/1edtfbh/why_did_delt... (with improper attribution)

topgun966Platinum wrote on Reddit """ These "experts" are completely wrong. The core issue was Delta did NOT have a proper DR plan ready and did NOT have a proper IT business continuity plan ready. UA, AA, and F9 recovered so fast because they had plans on stand-by and engaged them immediately. After the SWA IT problem, UA and AA put in robust DR plans staged everywhere from the server farms, to cloud solutions, to end-user stations at airports. They had plans on how to recover systems. DL outsources a lot of their IT. UA and AA engaged those plans quickly. They did not hold back paying OT for staff. UA and AA have just as much reliance on Windows as Delta. AA was recovered by end of data Friday and resumed normal operations Saturday. UA was about 12 hours behind them having it resolved by Saturday morning resuming normal schedules Saturday afternoon. The ONUS is 100% on DL C+ level in their IT decisions. The problem is that the lower level IT staff is going to get the brunt of the blame and the consequences. """

tiahura · 2024-07-30T15:06:22.000000Z

That’s why I think the suit against crowdstrike and Ms is mostly a dud. First you have to get around the waiver (much harder for business than a consumer) and then you have to deal with comparative fault - ie delta’s disaster recovery system sucked.

skrebbel · 2024-07-29T20:50:59.000000Z

I love that “CrowdStrike” is now a synonym for “global outage”. Not some cute hihi name like “heartbleed”, just the name of the company that did the screwup. Seems fair.

jraph · 2024-07-29T21:15:38.000000Z

Not sure it's fair, but I am certainly waiting for it to become a verb or a noun.

    crowdstrike. n.
     1. A set of major disruptions caused by an update that was not tested enough, pushed to many devices across the globe.
     2. The name of such an update.
     3. (by extension) a joke so bad it causes major disruptions.

     For instance:
       - Congrats for your crowdstrike! Now my weekend is ruined as I'll be the one who'll be asked to fix this mess.

    crowdstrike. v. (simple past crowdstruck or crowdstriked¹, past participle crowdstricken, or crowdstruck, or (obsolete, regionalism) crowdstroke²)
     1. Action of pushing an update to many devices that causes a global outage or major disruptions in various sectors.

     For instance:
       - We've been crowdstruck. Again.

    crowdstrike. adj.
     1. Qualifies an update that, when pushed to many devices across the world, causes major disruptions across the globe.
     2. Qualifies such a (set of) event(s).

    For instance:
       - We are sorry for the crowdstrike event we caused. We gently remind our kind customers and their end users that per our ToS, we will issue no refund, and that no liability can be held against us. Customers who don't try to contact us in the following month will get a discount for their next contract renewal. You will hear us speak before the Congress, who nicely invited us for some comedy in the hope it will appease you all. Make sure you like the related videos on the various online platforms. We wish you a nice end of the week and nice, relaxing summer holidays.

¹ people have differing but strong opinions on which simple past form is correct, mainly due to regional differences. Some avoid saying crowdstrike and say crowdhit instead.

² some people have tried to push crowdstricken, which first caught on in some areas or particular contexts. The idea that this form likens the qualified subject to the bearer of some sickness has eventually seduced a critical mass of people after some initial push back. Please also see the usage notes for strike for other, rarer, alternative forms [*].

[*] https://en.wiktionary.org/wiki/strike#Usage%20notes

(Thanks to the contributors in this thread)

arrakeen · 2024-07-29T21:45:06.000000Z

since nothing will happen to them except a slap on the wrist, and all our employers will continue to force this crapware on our machines, i think we should make a point to start using their name as a pejorative (similar to the 'santorum' neologism). any when they inevitably try to rebrand, use that term too

chasd00 · 2024-07-30T02:14:00.000000Z

> since nothing will happen to them except a slap on the wrist

I've already bought some of their stock, i'm pretty sure it's bottomed. I bet i make 30% a year from now. This always happens some "ohnoes!" event cuts a stock price off at the knees but then everyone forgets and in a year or so it's back to where it was before the event.

skrebbel · 2024-07-29T21:24:57.000000Z

“The intern crowdstruck half the customers”

jraph · 2024-07-29T21:29:23.000000Z

Exactly, by the way I added the irregular inflections and fixed the example for the verb. Thanks for your contribution.

LeifCarrotson · 2024-07-29T21:37:58.000000Z

I disagree, I think that the simple past should be "crowdstruck" but the participle should be "crowdstricken", as might apply to someone afflicted by an illness:

"The update wasn't tested, so the servers are all crowdstricken."

jraph · 2024-07-29T21:42:46.000000Z

Thanks, I added the documentation for this form, and added a second usage note. I initially wanted to tease you by documenting that people with bad taste tried to push for this form, but I really like this illness idea.

quectophoton · 2024-07-30T10:26:54.000000Z

> crowdstruck

    Said, "Yeah, it's all right
    We're doing fine"
    Yeah, it's all right
    We're doing fine, so fine

    Crowdstruck
    Yeah, yeah, yeah, crowdstruck
    Crowdstruck (crowdstruck)
    Whoa, baby, baby (crowdstruck)
    You've been crowdstruck

(AC/DC's Thunderstruck, but replacing "thunderstruck" with "crowdstruck")

defrost · 2024-07-30T10:37:22.000000Z

They do have a song about an insidious disabling virus you know: https://www.youtube.com/watch?v=6njy7mZbwdc

aragonite · 2024-07-30T02:29:34.000000Z

Does anyone know (or have any guesses as to) why the founder(s) named it "CrowdStrike"? What was (or might have been) the idea behind the name? I'm guessing it's not patterned after "crowdfunding" "crowdsourcing" "crowdlending", etc.

latentsea · 2024-07-30T06:29:16.000000Z

It's part of a trend where companies name themselves after a self-describing disaster they're going to cause. Oceangate also did this.

New investing strategy is to look for companies whose name also fits this pattern but who have not yet caused the disaster and short the stock.

Hemospectrum · 2024-07-30T02:40:54.000000Z

The cute name was Blue Friday, but it doesn't seem to have caught on.

stana · 2024-07-29T21:58:02.000000Z

Rebranding project coming up at CrowdStrike?

jraph · 2024-07-29T22:10:48.000000Z

That would be a shame, the name is so fitting, more than ever!

They struck a very big crowd real bad.

ks1723 · 2024-07-29T20:30:48.000000Z

I found it quite interesting, that crowdstrike actually exclude a bunch of services explicitly. They also basically say, don’t use, if it needs to be reliable. I don’t know if this is standard for software, but for me this was quite surprising.

From crowdstrike terms and services [1]: […] THERE IS NO WARRANTY THAT THE OFFERINGS OR CROWDSTRIKE TOOLS WILL BE ERROR FREE, OR THAT THEY WILL OPERATE WITHOUT INTERRUPTION OR WILL FULFILL ANY OF CUSTOMER’S PARTICULAR PURPOSES OR NEEDS. THE OFFERINGS AND CROWDSTRIKE TOOLS ARE NOT FAULT-TOLERANT AND ARE NOT DESIGNED OR INTENDED FOR USE IN ANY HAZARDOUS ENVIRONMENT REQUIRING FAIL-SAFE PERFORMANCE OR OPERATION. NEITHER THE OFFERINGS NOR CROWDSTRIKE TOOLS ARE FOR USE IN THE OPERATION OF AIRCRAFT NAVIGATION, NUCLEAR FACILITIES, COMMUNICATION SYSTEMS, WEAPONS SYSTEMS, DIRECT OR INDIRECT LIFE-SUPPORT SYSTEMS, AIR TRAFFIC CONTROL, OR ANY APPLICATION OR INSTALLATION WHERE FAILURE COULD RESULT IN DEATH, SEVERE PHYSICAL INJURY, OR PROPERTY DAMAGE. Customer agrees that it is Customer’s responsibility to ensure safe use of an Offering and the CrowdStrike Tools in such applications and installations. CROWDSTRIKE DOES NOT WARRANT ANY THIRD PARTY PRODUCTS OR SERVICES.

[1] section 8.6 of https://www.crowdstrike.com/terms-conditions/

objclxt · 2024-07-29T20:32:11.000000Z

> I don’t know if this is standard for software

This is pretty standard. There is almost identical language in the Windows and macOS EULAs, for example.

ale42 · 2024-07-29T20:50:08.000000Z

Same for datasheets of most electronic components. The manufacturers don't want the responsibility to avoid possible multi-million lawsuits.

SoftTalker · 2024-07-29T21:41:25.000000Z

So how does it get installed on all the endpoints in 911 dispatch centers?

EvanAnderson · 2024-07-29T22:37:29.000000Z

Because FBI CJIS requirements, adopted by state law enforcement bodies, require it. I support a Public Safety Answering Point (PSAP, aka a 911 call center) and I push back on as many of the inane requirements as I can with compensating controls.

Example: As of right now I am still required to expire passwords every 90 days. My state is considering the current guidance from NIST but FBI CJIS policy still mandates the expirations.

tgv · 2024-07-30T06:09:30.000000Z

I don't know what CJIS requirements entail precisely, but at a first glance, they seem reasonable. But it's weird that people then think they can comply by installing a product with a disclaimer against their intended use. It's just a token acknowledgment: "Yeah, we've read it, but we don't really care."

If that's also the interpretation of the courts, then each company would be invidivually liable, at least towards the government.

hypeatei · 2024-07-30T13:32:50.000000Z

Holy shit I cannot stand the password expiration requirements. Like you said, NIST literally recommends against it but so many regulations require it. So aggravating.

wrs · 2024-07-29T22:10:37.000000Z

Because no endpoint protection software exists that doesn’t have the same disclaimer clause. So you install this one and accept the lack of vendor liability.

(If such a thing did exist, it would cost a lot more!)

nemonemo · 2024-07-29T22:07:23.000000Z

What is the alternative? Have you considered a possibility that those could be the best out there for 911 despite their imperfections?

SoftTalker · 2024-07-30T03:26:36.000000Z

The data entry endpoints in a 911 dispatch center should not be running a general purpose consumer OS. They should be single purpose machines much closer to a dumb VT100 terminal than a personal computer. Maybe something like a stripped down hardened Chromebook. No internet connection. No personal email, web, or other use allowed or even possible. A product like crowdstrike should not be needed because it should not be possible to run anything but the dispatching software on those machines.

EvanAnderson · 2024-07-30T05:34:38.000000Z

That's what computer aided dispatch (CAD, in the industry) software was 30 years ago (my PSAP had an AS/400). The market has rejected it. Also, see my other comment re: FBI CJIS policy.

In the PSAP I support we have three dedicated PCs at each workstation to run the CAD, phones, and radio. Each of those has a dedicated VLAN, separate physical servers and storage, separate Active Directory forest for CAD (no AD for radios or phones-- standalone PCs), and default-deny ACLs for inbound and outbound traffic on the hosts and at the borders.

A fourth dedicated PC (VLAN, ACLs, physical servers, AD environment) does email, web browsing, etc. (All of it is shackled together with a nice KVM that supports a single keyboard and mouse controlling up to 5 PCs.)

Not every PSAP does this and I think that's insane. The law and fire agencies we interface with absolutely do put a single PC on a desk (or in a cruiser) and use it for everything (and we filter and monitor the traffic coming in from them over our VPN heavily and block access at the first sign of anomalous traffic). Often their budgets don't support the notion of using dedicated computers for task-oriented work. The marketers have pushed general purpose devices for this kind of application.

In the last 5 years all three "hardened" systems we use (all companies acquired by Motorola) have started requiring Internet access for various APIs they use, and for integration with third-party vendors (mapping, public information databases, and task instructions for telecommunications). I think it's ridiculous, but I don't get to decide the direction of the product roadmaps or what the business stakeholders want from a feature perspective.

Motorola (who makes the CAD software used by some of the largest US municipalities) is pushing for hosted CAD and integrating hosted features into on-prem systems. (Of course, they have a managed security product offering that they want to sell along side it.)

awad · 2024-07-30T01:38:14.000000Z

Usually the largest of companies will have their own customized T&Cs governed in their Master Services Agreement (MSA) which are often very modified versions of these publicly available ones

sidewndr46 · 2024-07-29T22:11:47.000000Z

My experience has been better legal counsel has the relevant terms struck before the deal is signed. In this case it would have been the terms around Aircraft and aviation

jojobas · 2024-07-30T03:19:58.000000Z

There often are limits to how much your can disclaim in your T&C. If under the same terms you cause damages deliberately you'll be held liable, and obvious gross negligence can be a factor as well.

There are often 3 opinions between any 2 lawyers so we have a chance to learn the outcome many months and millions of dollars later.

bustling-noose · 2024-07-30T03:56:17.000000Z

> The outage highlighted a different kind of digital divide. On one side, gmail, Facebook, and Twitter kept running, letting us post photos of blue screens located on the other side: the Windows machines responsible for actually doing things in the world like making appointments, opening accounts, and dispatching police.

At this point using windows for these tasks seems like using legacy software because training people to use an iPad or a web browser seems too complicated or because no one wants to move their age old systems to a more modern web based system because of costs. Native apps work great, but I think the world is moving to the cloud and that means web based everything should be the norm. Yes AWS AZURE outages can still happen but those can be fixed by spinning up a VM in different clouds.

This is also why software jobs aren’t going anywhere thanks for a while. Many systems need to be changed to more modern and robust clouds. It might take decades for this transformation across the globe.

amluto · 2024-07-30T06:40:03.000000Z

Your “modern and robust cloud” is my “why on Earth doesn’t this thing work offline”.

The world is absolutely full of things that have worked for decades to centuries without the Internet, are eventually more or less consistent (remember carbon paper credit card machines?), and did an amazing job of keeping the world running despite, wars, network partitions (the “network” would basically always be partitioned), mistakes, entire branches offline, etc.

Sure, a lot of things are easier when centralized, and “the cloud” is incredibly powerful. But it’s not necessarily more robust. Also, depending on any sort of cloud means you’re also depending on the network, and networks are far from infallable. There’s a reason that a lot of stored-value transit systems still track balances on the card and will let people in even if a fare gate cannot connect to a cloud service.

And CrowdStrike took out plenty of cloud instances, and recovering them can be worse than recovering physical hardware, as the “robust cloud” has an absolutely terrible ability to do anything outside the happy path of booting an instance normally.

dailykoder · 2024-07-30T08:35:16.000000Z

Okay this sounds all very reasonable, but how do you know when your washing machine is finished, when it's not connected to the cloud and you won't get notified in your app? It sure is not an easy thing and the cloud helps very much here

julian_t · 2024-07-30T09:32:44.000000Z

When the noise from the white box stops, then I know. And if I'm not at home to hear it, I'm not quite sure why I'd need to know.

mschuster91 · 2024-07-30T11:13:24.000000Z

Well, for people in an apartment it doesn't matter all that much, but if your laundry washer or dryer is in the basement, you don't necessarily hear it if you're out in the garden.

dailykoder · 2024-07-30T13:20:45.000000Z

Sure, it might be a "nice to have" thing. But the machines usually show how long they'll take. And even if it's a newer one with sensors that make the whole process vary in time. I'd still be like "Oh, okay it'll take about 3 hours, so ill be back at 6pm". It doesn't really matter if the clothes chill out for about an hour, especially the newer machines don't stink that fast. And on top of that, I don't think that it has to go over the internet if you needed some sorta notification. Local would be suffiecient.

If I buy something new like this and have a few choices, I intentionally pick the one with as few smart features as possible.

dTP90pN · 2024-07-30T11:30:46.000000Z

What happened to the good old tin can telephone down the side of the house to the washing room?

geoduck14 · 2024-07-30T12:34:12.000000Z

I think you are joking, but I'll reply with a serious answer.

Where I went to college, our dorms had (free) shared washing machines. This was "pre cloud", but wifi was throughout. One student rugged up a hall-effect sensor and attached it to each power cable. It could detect if the washers and driers were on. It sent this info to a specific website that the students could monitor to see if there were any available washers or driers.

mmikeff · 2024-07-30T12:44:04.000000Z

Wasn't the first webcam setup to show whether a coffee pot was full?

red-iron-pine · 2024-07-30T14:22:02.000000Z

Also the reason we got Hyper Text Coffee Pot Control Protocol (HTCPCP) in RFC 2324

nihzm · 2024-07-30T09:26:35.000000Z

I hope this is sarcasm, but if it isn't washing machine cycles have a fixed duration so a timer on your phone is more than enough, no cloud necessary.

4ad · 2024-07-30T09:28:43.000000Z

I wish washing machines had a fixed cycle duration. When I start the cycle my washing machines tells me the same duration, always, but in actuality it takes different amounts of time every time. Madness. I've been told this is a feature.

mschuster91 · 2024-07-30T11:15:35.000000Z

> Madness. I've been told this is a feature.

It actually is. Fixed length cycles haven't been a thing for many years now - modern washing machines adjust the washing cycle length by the weight of the laundry and its behavior during spin-drying, both its vibration behavior aka weight distribution (that can have multiple adjustment cycles to achieve reasonably even distribution) and how much water it loses - when no more water comes out during spinning, it will cut the cycle short to save energy.

anticensor · 2024-07-31T18:59:55.000000Z

Yes, newer machines shorten the cycle for lower loads and less dirty clothes.

throw0101b · 2024-07-30T11:59:11.000000Z

> When I start the cycle my washing machines tells me the same duration, always, but in actuality it takes different amounts of time every time.

If it says (e.g.) 43 minutes, but sometimes it takes 40 and sometimes 49 or 53, set your timer for 60 minutes and get on with life. Your laundry sitting for 17 or 7 minutes isn't the end of the world. If your timer goes off and it's still not done, set it for another 20 and do something else.

Of all the things to fill your head with worry and annoyance with, laundry is near the bottom of the list for me.

4ad · 2024-07-30T12:12:15.000000Z

Except when you live in a building with communal washing machines and where you need to book time for laundry, as it is common in many European cities.

krige · 2024-07-30T11:11:24.000000Z

My washing machine is kind enough to both indicate time to end in minutes, but also allows me to delay start so that the cycle is finished in [x] hours. It's not even that modern.