The two I reach for first: Dask and SparkSQL Dask is super easy and quick to lea...

heinrichhartman · on Jan 31, 2020

Thanks for this. I did not know about Dask! wow this looks great. Love the web-based task visualizations: https://distributed.dask.org/en/latest/web.html

faizshah · on Feb 1, 2020

Check out the Dask Bag it’s my favorite feature, it helps you deal with non tabular data that also might not be structured consistently: https://examples.dask.org/bag.html

Everybody I show it to likes it even more than working with data frames once they grok it.