Apache Parquet is a columnar storage format for Hadoop.
I need to check the classpath of the Hive service to see the location of the jars it loads while …
hadoop hive hortonworks-data-platform parquetI use dynamic frames to write a parquet file in S3 but if a file already exists my program append …
amazon-web-services parquet aws-glueI have a table that has partitions and I use avro files or text files to create and insert into …
text-files avro parquet impalaI have parquet data partitioned by date & hour, folder structure: events_v3 -- event_date=2015-01-01 -- event_…
apache-spark amazon-s3 apache-spark-sql partitioning parquetI have data saved as parquet files in Azure blob storage. Data is partitioned by year, month, day and hour …
azure hive parquet azure-hdinsightI'm receiving a set of (1 Mb) CSV/JSON files on S3 that I would like to convert to Parquet. I …
java scala amazon-web-services parquetI downloaded and built parquet-1.5.0 of https://github.com/apache/parquet-mr. I now want to run some commands on my …
maven hdfs parquet parquet-mrI'm trying to save dataframe in table hive. In spark 1.6 it's work but after migration to 2.2.0 it doesn't work anymore. …
apache-spark hive parquet apache-spark-2.0