Popular "parquet" questions | Page 9

I tried to concat() two parquet file with pandas in python . It can work , but when I try to write …

python pandas parquet

I'm processing events using Dataframes converted from a stream of JSON events which eventually gets written out as as Parquet …

apache-spark apache-spark-sql spark-streaming spark-dataframe parquet

I am trying to load, process and write Parquet files in S3 with AWS Lambda. My testing / deployment process is: …

python amazon-s3 aws-lambda parquet pyarrow

Hello I need to read the data from gz.parquet files but dont know how to?? Tried with impala but …

apache-spark hive apache-kafka parquet flume-twitter

Im getting this error when transforming a pandas.DF to parquet using pyArrow: ArrowInvalid('Error converting from Python objects to …

python pandas parquet pyarrow

Is there any performance benefit resulting from the usage of using nested data types in the Parquet file format? AFAIK …

apache-spark nested parquet data-files

I have a parquet dataset stored on s3, and I would like to query specific rows from the dataset. I …

python pandas amazon-s3 parquet pyarrow

I have a hive table like this: CREATE TABLE `abtestmsg_orc`( `eventname` string COMMENT 'AB测试方案上报事件：ABTest', `eventtime` string COMMENT '事件上报时间…

hive alter-table parquet orc

I am breaking my head over this right now. I am new to this parquet files, and I am running …

python pandas parquet pyarrow fastparquet

I know I can connect to an HDFS cluster via pyarrow using pyarrow.hdfs.connect() I also know I can …

hdfs parquet pyarrow

Top "Parquet" questions