I prefer the walkthrough of the CAP theorem in “Designing Data Intensive Applica...

yas_hmaheshwari · 2024-10-08T08:25:43 1728375943

I also like the fact that how he says that that theorem was a good theorem for the time but is now not that relevant

Maybe PACELC is better theorem, maybe not - but I guess three letter acronyms would always rule better than a six letter one

sethammons · 2024-10-08T12:30:15 1728390615

Esp since CAP is easily pronounced. "PACELC"? Is it "Pace-elk"? "Pak-Elk"? "Pacel-C"?

chuckadams · 2024-10-08T16:38:52 1728405532

I think you got it with the third one. Picture an elk wearing saddle bags. A Pack Elk.

dijit · 2024-10-08T13:02:03 1728392523

> And obviously you choose availability for the distributed systems you most commonly encounter.

fail safe (rather than fail open) is generally regarded as the most acceptable thing we can do as CRUD programmers; that’s true, however it’s categorically untrue that this is a foregone conclusion.

There are many cases (especially in financial services) were failing safe is a much preferable option and having retry logic in application is much preferred

stevebmark · 2024-10-08T17:03:40 1728407020

In distributed banking back-ends, if one of your datacenters goes down, you don't take down the other one for safety. You don't force global consistency/linearity of all transactions before allowing UI updates. There are delays in financial reconciliation all the time, the important thing is they are eventually consistent with a ledger, not that if one thing fails, you stop the train. And the reality of distributed systems is things fail constantly. Hard drives, networks, bugs, clock drift...

This is in contrast to something like a supercomputer, or a distributed map-reduce job, where if one node fails as part of a distributed process, it will corrupt your data, and you have the luxury to stop the whole thing, fix the issue, and restart the whole process.

tajd · 2024-10-08T12:38:07 1728391087

For the interested reader you can see a paper talking about this by Martin Kleppmann https://arxiv.org/abs/1509.05393 - the author of DDIA.

jeremyjh · 2024-10-08T13:15:32 1728393332

The formal definition for Available is given in TFA.

stevebmark · 2024-10-08T17:40:48 1728409248

"No formal definition" includes where we are today, not the original, outdated idea. By the original (not useful) definition, an "available" distributed system can return a response 10 years later, which is not helpful nor relevant when thinking about distributed systems.

jeremyjh · 2024-10-08T19:11:35 1728414695

What has changed that makes the idea out-dated? You can argue that it was always unhelpful but I can't see how it could be out-dated.

You're correct that the theorem doesn't address latency requirements. There are all kinds of things it doesn't address. The point of it is simply as you say - you must give up consistency to have availability in the face of a partition, or vice-versa. Some vendors of distributed systems would have us believe otherwise. This theorem gives us a framework to understand at a very basic level the trade-offs that must be made in distributed system implementations. That isn't really very much for as much airtime as it got, which I suspect is what your contention is with.

er4hn · 2024-10-08T18:32:45 1728412365

Availability is similarly poorly defined for CVSS scoring, i.e. what gets used to decide how bad a CVE is.