Popular "parquet" questions | Page 3

I am trying to test how to write data in HDFS 2.7 using Spark 2.1. My data is a simple sequence of …

scala apache-spark apache-spark-sql parquet

I'm using the following code to create ParquetWriter and to write records to it. ParquetWriter<GenericRecord> parquetWriter = new …

java hadoop parquet

Is it possible to save a pandas data frame directly to a parquet file? If not, what would be the …

python-3.x hdfs parquet

Both are columnar (disk-)storage formats for use in data analysis systems. Both are integrated within Apache Arrow (pyarrow package …

python pandas parquet feather pyarrow

I have multiple small parquet files generated as output of hive ql job, i would like to merge the output …

hdfs parquet

I am trying to use Spark SQL to write parquet file. By default Spark SQL supports gzip, but it also …

apache-spark gzip parquet snappy lzo

I'm having trouble finding a library that allows Parquet files to be written using Python. Bonus points if I can …

python apache-spark apache-spark-sql parquet snappy

the parquet docs from cloudera shows examples of integration with pig/hive/impala. but in many cases I want to …

java parquet

I have a quite hefty parquet file where I need to change values for one of the column. One way …

apache-spark parquet

So I have just 1 parquet file I'm reading with Spark (using the SQL stuff) and I'd like it to be …

scala apache-spark parquet

Top "Parquet" questions