Top "Pyarrow" questions

pyarrow is a Python interface for Apache Arrow

error: command 'cmake' failed: No such file or directory

Getting error while installing vaex in Pycharm with Python3.8 I have installed below before running this on my Win-10 64-bit: …

python cmake pyarrow python-3.8 vaex
Read Parquet file stored in S3 with AWS Lambda (Python 3)

I am trying to load, process and write Parquet files in S3 with AWS Lambda. My testing / deployment process is: …

python amazon-s3 aws-lambda parquet pyarrow
Assign schema to pa.Table.from_pandas()

Im getting this error when transforming a pandas.DF to parquet using pyArrow: ArrowInvalid('Error converting from Python objects to …

python pandas parquet pyarrow
Using predicates to filter rows from pyarrow.parquet.ParquetDataset

I have a parquet dataset stored on s3, and I would like to query specific rows from the dataset. I …

python pandas amazon-s3 parquet pyarrow
Unable to read a parquet file

I am breaking my head over this right now. I am new to this parquet files, and I am running …

python pandas parquet pyarrow fastparquet
read a parquet files from HDFS using PyArrow

I know I can connect to an HDFS cluster via pyarrow using pyarrow.hdfs.connect() I also know I can …

hdfs parquet pyarrow
Reading specific partitions from a partitioned parquet dataset with pyarrow

I have a somewhat large (~20 GB) partitioned dataset in parquet format. I would like to read specific partitions from the …

python parquet pyarrow apache-arrow
Failed building wheel for pyarrow

I am trying to pip install Superset pip install apache-superset and getting below error Traceback (most recent call last): File "…

cmake pyarrow apache-superset
pyarrow error: toPandas attempted Arrow optimization

when I set pyarrow to true we using spark session, but when I run toPandas(), it throws the error: "toPandas …

pyspark pyarrow
Reading Parquet File with Array<Map<String,String>> Column

I'm using Dask to read a Parquet file that was generated by PySpark, and one of the columns is a …

python dask python-3.7 pyarrow fastparquet