Mostly unrelated, but a fun fact about quorums that I enjoy noting whenever I ca...

elvinyung · on March 21, 2017

I would argue that by the time you've chosen Paxos or some other majority quorum commit protocol, you're already well aware that you're building a CP system, and that availability and latency aren't your main concern. A majority quorum is basically the most obvious (and somewhat brute force) way of providing serializable consistency in the system.

The one non-majority quorum commit protocol that most people are probably already familiar with is the "sloppy quorum" replication in Dynamo systems[1] (e.g. Cassandra, Riak, Voldemort, etc.). Basically, since the quorum is configurable on a per-cluster basis instead of being inherent to the protocol, and usually isn't a majority of the cluster, the system can still make progress when half of the nodes are unreachable. (But of course, as the paper notes, this means that you need to resolve conflicts some other way, which adds a whole bunch of complexity.)

1: http://www.allthingsdistributed.com/files/amazon-dynamo-sosp...

dllthomas · on March 21, 2017

> you're already well aware that you're building a CP system, and that availability and latency aren't your main concern

Assuming you've chosen correctly between CP and AP approaches, this tells us that availability and latency aren't as important as consistency. But there's nothing that says they aren't arbitrarily close...

elvinyung · on March 22, 2017

Yeah, definitely -- I agree that the decision doesn't mean to just to blindly throw away availability optimizations once you've decided that consistency is important.

Actually, invoking CAP probably didn't add to my message. What I meant to say is that people don't talk about non-majority quorum commits that much because the interesting part is that the serializability comes with majority/overlapping quorums.

dllthomas · on March 22, 2017

As I read it, the comment you were replying to was still restricting its discussion to overlapping quorums, and merely pointing out that that's not actually synonymous with majority.

elvinyung · on March 22, 2017

Fair enough :) I guess my mind latched onto the "it still seems under-explored" part and wanted to try and respond to that.

jasonwatkinspdx · on March 21, 2017

I think you'd be interested in Flexible Paxos[1] if you haven't run into it.

[1]: https://arxiv.org/abs/1608.06696

jakewins · on March 21, 2017

Oh man, that is a really cool paper, thanks a bunch for sharing that. I've got next week off and lots of itch to try Go for network code.. might try this out!

sbanach · on March 22, 2017

That's pretty interesting. Is there a concrete example of a quorum definition where the probability of an outlier is improved vs majority quorum? I'm struggling to come up with one. I've always assumed majority is optimal, since you can tolerate outliers in (n-1) / 2 voters without seeing an outlier for the commit overall. Eg: for 3 node Raft - you'd need both followers to have an outlier before a client notices a slow commit.

gosubpl · on March 22, 2017

You are right: https://blog.acolyer.org/2016/09/27/flexible-paxos-quorum-in... There is also sample code and TLA+ proofs: https://github.com/fpaxos/fpaxos-tlaplus