Hey could you provide some resources for learning Distributed Computing using Python. I was thinking of using Go but since I know python to a considerable extent, I wanna give Python a fighting chance for making distributed systems.
I'm not one for reading books I think there's more value in just building stuff:
Check out Prediction.IO - http://prediction.io/ it's an out of the box machine learning server, it uses hbase, but hbase in this configuration is sitting on top of the local filesystem, not HDFS -- so the first task I assigned myself was to learn how to setup hadoop, and then configure HBase to store data in HDFS.
Once I have that, I'm going to use PySpark to use spark to query the data. (Thats where the python comes in)
Another cool project (not related to the tech stack above) is Pyro https://pypi.python.org/pypi/Pyro4 ... this consumed a whole weekend of mine a few weeks back, building a small distributed search engine for querying compressed DNA sequences ... think a parallel boyer-moore. As a learning exercise try doing that, it's great fun!
Techs: HBase, MLLib and Spark (Most of my time)
Spatial SDR Reconstruction with NuPIC (1 day a week)
Spanish using Duolingo (20 mins a day)
Game development with ImpactJS (only 1 day a week)