Experiments with dask.bag
This post / notebook is a lightning talk presented at Python Users Berlin (13.10.2016)
dask(link) is a flexible dynamic computing library for analytic computingdask.bagprovides extremely useful distributed data analysis abstractions over files on disk