Interestingly the chart at the end suggests implementations for everything excep...

markelliot on Sept 25, 2012 | parent | context | favorite | on: Runaway complexity in Big Data... and a plan to st...

Interestingly the chart at the end suggests implementations for everything except the raw data, but generating batch views from an arbitrary store and keeping your raw data reliably are hard problems. The charts motivate dumping this stuff in Hadoop (or some distributed file system), but any reliable store would do. (@nathanmarz: would love recommendations)

nathanmarz on Sept 26, 2012 [–]

A distributed filesystem, such as HDFS or MapR, is ideal for the master dataset.