Dask doesn't solve that problem since it's a wrapper around pandas functions. If...

kiriliponi · on Feb 7, 2018

dask.dataframe might not help but dask.distributed could in that case.

I've had success using it on non vanilla stuff (i.e. code that could not get converted to play natively with numpy/pandas structures)

As a bonus, the nice profiling tools (built within dask) have also helped me improve the performance of the code.

pletnes · on Feb 7, 2018

There’s dask.array which works on numpy arrays instead of dataframes. Otherwise, your argument holds.