> Cockroach is a distributed key:value datastore *(SQL and structured data layer...

atombender · on June 5, 2015

I suspect they have figured out the hard part.

If you look at the documentation (eg., [1]), the design has been rather carefully thought out; it's just that they're implementing it from the bottom up.

According to their roadmap [2], they're aiming for KV functionality in 1.0 and aren't aiming for SQL until past version 2.0 (it's currently alpha).

Given the backgrounds of the technical people involved (including Google, as this project is inspired by Spanner), they should have a lot of experience with what they're trying to accomplish.

As for "done before", a core feature of Cockroach is true ACID transaction support, including snapshot isolation, something no distributed NoSQL database I know about supports. (ArangoDB does support transaction, but is mostly NoSQL in the sense of implementing a different query language than SQL.)

[1] https://github.com/cockroachdb/cockroach/blob/master/docs/de...

[2] https://github.com/cockroachdb/cockroach/wiki/Roadmap

tschottdorf · on June 5, 2015

Exactly right. The hard part is building a key-value store with a powerful notion of transactions (not just compare-and-set or the like), and that's what's mostly done. Structured data is still work, but on the shoulders of giants.

mbell · on June 5, 2015

> As for "done before", a core feature of Cockroach is true ACID transaction support, including snapshot isolation, something no distributed NoSQL database I know about supports.

Zookeeper has ACID transactions which I believe are linearizable (which trumps SI). The downside is the memory only working set, but given how cheap memory is, I'd still rather have a memory only Zookeeper with a rich query interface than a large storage data KV store with minimal query interface.

> ArangoDB does support transaction, but is mostly NoSQL in the sense of implementing a different query language than SQL

What is your definition of NoSQL?

atombender · on June 5, 2015

ZooKeeper is not a general-purpose database. I have heard of anyone using it as one, either.

> What is your definition of NoSQL?

I don't have one, and I think the term isn't terribly useful. But the whole idea of NoSQL started as an attempt to break free of the relational aspect of SQL, because things like joins, strict schemas, foreign keys, and normalization were perceived as getting in the way of distribution. ArangoDB supports joins (but not foreign keys, because it's schemaless) and an SQL-like query language, which makes it a lot closer to an SQL database than something like Redis or Cassandra.