How can I transform my resulting dask.DataFrame into pandas.DataFrame (let's say I am done with heavy lifting, and just want to apply sklearn to my aggregate result)?
You can call the .compute() method to transform a dask.dataframe to a pandas dataframe:
df = df.compute()