Top "Dask" questions

Dask is a parallel computing and data analytics library for Python.

Make Pandas DataFrame apply() use all cores?

As of August 2017, Pandas DataFame.apply() is unfortunately still limited to working with a single core, meaning that a multi-core …

pandas dask
Convert Pandas dataframe to Dask dataframe

Suppose I have pandas dataframe as: df=pd.DataFrame({'a':[1,2,3],'b':[4,5,6]}) When I convert it into dask dataframe what should …

python pandas dataframe data-conversion dask
At what situation I can use Dask instead of Apache Spark?

I am currently using Pandas and Spark for data analysis. I found Dask provides parallelized NumPy array and Pandas DataFrame. …

python pandas apache-spark dask
A comparison between fastparquet and pyarrow?

After some searching I failed to find a thorough comparison of fastparquet and pyarrow. I found this blog post (a …

python parquet dask pyarrow fastparquet
python dask DataFrame, support for (trivially parallelizable) row apply?

I recently found dask module that aims to be an easy-to-use python parallel processing module. Big selling point for me …

python pandas parallel-processing dask
Can dask parralelize reading fom a csv file?

I'm converting a large textfile to a hdf storage in hopes of a faster data access. The conversion works allright, …

python csv pandas dask
dask dataframe how to convert column to to_datetime

I am trying to convert one column of my dataframe to datetime. Following the discussion here https://github.com/dask/…

python pandas dask
simple dask map_partitions example

I read the following SO thead and now am trying to understand it. Here is my example: import dask.dataframe …

python parallel-processing dask
How to transform Dask.DataFrame to pd.DataFrame?

How can I transform my resulting dask.DataFrame into pandas.DataFrame (let's say I am done with heavy lifting, and …

python pandas dask
Default pip installation of Dask gives "ImportError: No module named toolz"

I installed Dask using pip like this: pip install dask and when I try to do import dask.dataframe as …

python installation pip importerror dask