Why Replication is Awesome

rdtsc · on Nov 7, 2013

CouchDB and by extension Cloudant are really cool.

Change feeds and replication are first class citizens if you wish, not some side-feature.

This means 3 things for me:

* Can build your own cluster topology. Replicate in a star pattern, in a circle with overlapped neighbors, in a hierarchy. You decide. Replication is explicit and configurable.

* Ideal for real-time applications. Having client poll for changes can grind things down. CouchDB support continuous updates and changes feed. It can go via long polling, event sourcing, can still to GET's with a 'since' parameters. Up to the user.

* Having to handle merge conflicts explicitly. MVCC semantics is not hidden or hand-waved away with a timestamp-last-writer-wins-unless-your-ntp-is-messed-up. Conflicts are first class citizens as well and one needs to know how to handle. That is good for some case but bad in others. I like it. For example one can attach an application specific conflict resolver that knows how to solve conflicts for that particular database in an application specific manner. Riak is another database that handles explicit conflicts well.

BTW: Just noticed CouchDB 1.5 was just released.

http://www.apache.org/dist/couchdb/notes/1.5.0/apache-couchd...

Will have to play with the new Admin UI and Node.JS query server.

skrebbel · on Nov 7, 2013

With regard to your 3rd point, conflicts that happen during replication aren't actually first class citizens, if memory serves me well.

If you change the same field on two databases at roughly the same time, some heuristic determines who wins.

rdtsc · on Nov 7, 2013

That is true but the conflict is preserved in the database. So a change feed set up to monitor conflicts will notice it immediately. You do have to set up the monitor and conflict resolver.

From the CAP standpoint your system is available but not consistent for a short period of time because initially on a GET during a conflicted document you might get one version but after the custom merge resolve runs, it could end up replacing with another.

That is the price to pay for availability, and it works best if applications on top are designed to work with it. It is not always possible so this, so it is a tool that is good in some cases but not in all cases.

mlmilleratmit · on Nov 7, 2013

There is indeed a deterministic but arbitrary choice of the "winner", but there is a big difference with other systems -- no data is discarded. The hash tree branches and both edits are saved. You can choose which branch of the tree you want to start editing, and you can listen to a feed for documents that have conflicting edits and resolve them according to your application logic. The real choice is to never discard data.

memracom · on Nov 7, 2013

Meh... Replication has been around for a long time. Couchdb was not anywhere near the first, in fact Couchdb was just leveraging capabilities that were in Erlang OTP for many years.

And replication is not perfect either. There are tradeoffs and corner cases that need to be understood. Riak is another cluster datastore that is written in Erlang OTP, like Couchdb, and the people who build Riak are quite open about the issues that they have to deal with and the corner cases. They often present at conferences and write blogs on these topics so if you really want to UNDERSTAND replication, google for blogs and conference presentations connected to Riak.

In any case, Couchdb is good, Riak is good, but even traditional RDBMSes like PostgreSQL are good and can do replication. In all these cases, the developers have wrestled with the math and computer science behind replication and have made something that mere mortals can use.

BigBlueHat · on Nov 7, 2013

To be clear, "replication" in this case is "multi-master replication."

Riak's Replication: http://docs.basho.com/riak/1.3.2/references/appendices/conce...

CouchDB's Replication: http://docs.couchdb.org/en/latest/replication/intro.html

Riak's is for data safety within the cluster.

CouchDB's is multi-master, cross-cluster, cross-device, etc. It can either be uni-directional (push or pull) or (if you do both) it can serve to synchronize two distributed master databases--each serving as a primary write point in the architecture.

That seems unique.

rdtsc · on Nov 7, 2013

> Replication has been around for a long time.

Well yeah data copying has been around as long as there is data. Peer-to-peer replication in the database world, with consistently and openly defined protocol has not.

http://dataprotocols.org/couchdb-replication/

I haven't seen too many databases that do this well. And well enough to have other database product interface with them (PouchDB for example).

> They often present at conferences and write blogs on these topics so if you really want to UNDERSTAND replication, google for blogs and conference presentations connected to Riak.

I don't remember having too many issues. What exact issues did you encounter with CouchDB's replication? You emphasize UNDERSTAND, do you have any indication CouchDB developers don't UNDERSTAND replication?

mlmilleratmit · on Nov 7, 2013

Sure, nobody claims that CouchDB is the first to have replication. MVCC was invented decades back, and hash trees have been around forever. I believe those two combined are compelling, and if you follow, e.g. risk, you'll see they're heading down the same path.

patrickmay · on Nov 7, 2013

> Replication has been around for a long time. Couchdb was not anywhere near the first

Indeed. I occasionally work with GigaSpaces -- they had replication at least five years before CouchDB existed. And they had competitors before then, too.

tony_landis · on Nov 7, 2013

I think it was Rich Hickey that said the document database is the worst of them all, because you are now married to that structure.

Having used Couchdb in production for two years, I have to agree with his analysis, and offer my own opinion that Couchdb is highly overrated. Not because it is not a good implementation of a document style database, but because the document store itself is not a good match for most use cases.

If the only requirement is a replicated JSON document store, it may work OK for you. But so would Riak, Postgres and some others.

If you need to update the data in those documents or ever need to query the data in ways you did not initially envision, you will quickly find yourself missing features which even traditional SQL databases are very good at. Development is slower.

Writing map/reduce for queries seems particularly cumbersome, particularly if you prefer not to use Javascript. And you have to plug them into a textarea in a webpage interface, or manually put them into Couchdb over http using curl or some library that abstracts this away. Either way it is a degree of separation that makes the data feel more out of reach than through a console interface like psql or mysql.

Consider the scenario where you want to update the value in an attribute on several thousand, or even just several documents that match some criteria. In SQL, you would simply jump in the console and in a few seconds or minutes complete that as a transaction with something like:

> update table set col=val where criteria.

There is no such feature in Couchdb. You will need to write code to filter and fetch each matching document, manipulate it as needed, then write the entire thing back. All to update a few bits that hopefully were not nested too deep as that really increases the complexity of the code you will need to write.

As memracom stated, the replication is not perfect. My experience even on a low latency network is the only safe way to ensure a client can immediately read back what they just wrote is to pass them through the likes of haproxy and use a sticky session. Otherwise you have a good probability of getting a 404 after a POST (create) or stale data after a PUT (update).

So for what it is worth, here is my advice on choosing a database from an ease of development standpoint:

1) has as many features as you can, even if you don't need them initially 2) has top notch libraries for your language / framework 3) has relation awareness - do not denormalize unless you must 4) supports consistency 5) supports in place updates - easily filter and change values (doesn't apply to Datomic) 6) has tools to make schema changes / reshaping data is easy, and can be done online

Maybe 2 years ago Couchdb was a great solution. But with memory and ssd storage being so cheap and so much innovation with traditional and NoSQL DBs, I don't foresee myself deploying Couchdb again. If I did need a place to dump some semi-structured data, I find Amazon's hosted offerings more attractive.

daleharvey · on Nov 7, 2013

I mostly agree, CouchDB's data and query model only works well for a subset of use cases and anything past that subset (the line at which is vague and only really understandable after being burnt) makes life hard. I do think that the query model and general capabilities can (and will) be vastly improved, but its already taken too long.

However I dont agree that many (or any) other things are suitable alternatives for replicatable json stores, in this case where replication means peer to peer stores that can operate offline for any period of time.

My particular interest these days is building web applications that work well offline, its why I build PouchDB, while its entirely possible to build a home made sync solution on top of your favourite database, its an extremely hard problem and something I see app after app try and fail constantly.

If I didnt need the ability sync data that would work offline, I wouldnt use CouchDB (pouchdb/cloudant etc), but since that is what I am interested in, right now I think its pretty much the only choice.

tony_landis · on Nov 7, 2013

I think the kind of thing you are doing is the sweet spot for couch. Perhaps over time it will be less so, but I now I think it is pretty unique.

mlmilleratmit · on Nov 7, 2013

I agree with some but not all of what you write.

0) CouchDB will never lose your data. Period. Not many other stores are 'append only, copy on write'. If you're data is transient, you may not care about that, but many apps expect the DB to never lose or corrupt data. Take it down with 'kill -9'? no problem, it's guaranteed to be consistent on disk.

1) I think document DB's are as good or better than a key value store like riak. It's great to have the choice, at a later point in time, to reach inside your documents, build indexes, etc.

2) The biggest wart with couchdb from a scaling point is the single server, master-slave, and master-master. There is no dynamo style clustering, ala cassandra, risk, etc. We added that in our own stack in '09 and it's finally hit the Apache CouchDB repo in a refined state, you'll see it in Apache CouchDB 2.0

3) Finally, the biggest wart from a usability standpoint is the need to build materialized views. Ad hoc queries are painful. In Apache CouchDB most folks use Elastic Search in conjunction. In Cloudant we embedded lucene into each cluster node so you can do the the obvious things: 'GET http://...?q=name:"Mik*" AND age:[25 TO 34] & sort...'

tony_landis · on Nov 7, 2013

Good points. Replication is certainly not painless to setup, and I've had trouble with continuous mode simply failing. I'm sure those warts will be worked out though.

Lucene and Elastic Search go a long way, it is just one more service to configure and maintain. Thats been an annoyance for me. If Lucene could be built right into couchdb that would be a major improvement.

However, that still doesn't let me cherry pick the values I want from a document. When the app is in a dynamic language, the cost of deserialization can add up.

Building some kind of xpath expressions to pull out specific parts of the doc would free up developers from spending as much time writing views, and would likely be much more performant to have that operation take place server-side. Maybe that should be an Elastic Search feature though and not Couch.

js4all · on Nov 7, 2013

When criticising CouchDB, don't forget about its killer features:

Replication: slave, master, multi-master, pull, push, single, continuous over http(s), you name it.

Update handlers: You don't have to fetch, modify and save in every case.

MVCC semantics: Lock-free write access. Never, ever database dead-locks.

tony_landis · on Nov 7, 2013

Here is the reference for update handlers for anyone wanting to check it out:

http://docs.couchdb.org/en/latest/ddocs.html#update-function...

It still requires writing code, and moving it into the database.

Once that is done, how do you call that function against an arbitrary list of documents and pass the new values to it without writing even more code somewhere?

This workflow of putting code/logic in the db is that it is forcing developers out of their preferred development environment, workflow, and most likely language.

Not to mention the fact that debugging all these couchdb functions and map/reduce calls becomes a nightmare. And testing - not sure how that could be done efficiently.

All of this this slows development.

It is possible to implement some web apps completely in static html, js, and couchdb, eliminating the need for anything server side. In those cases, couchdb is one of a kind.

gaius · on Nov 7, 2013

Back in the 90s, Sybase Replication Server was pretty sweet, and we ran topologies you wouldn't believe...

mwnz · on Nov 7, 2013

When I first read this I thought it said "Why Republican is Awesome". Now that would have been an entertaining read (not that the actual article isn't entertaining - I'm sure it is).