Top "Hive" questions

Hive is a database built on top of Hadoop and facilitates easy data summarization, ad-hoc queries, and the analysis of large datasets stored in Hadoop compatible distributed file system.

How to load data to hive from HDFS without removing the source file?

When load data from HDFS to Hive, using LOAD DATA INPATH 'hdfs_file' INTO TABLE tablename; command, it looks like …

hadoop hive
Hive load CSV with commas in quoted fields

I am trying to load a CSV file into a Hive table like so: CREATE TABLE mytable ( num1 INT, text1 …

hadoop hbase hive hdfs delimiter
Hive ParseException - cannot recognize input near 'end' 'string'

I am getting the following error when trying to create a Hive table from an existing DynamoDB table: NoViableAltException(88@[]) at …

hadoop mapreduce hive bigdata amazon-dynamodb
Hive Insert Overwrite Table

I'm new to Hive and I wanted to know if insert overwrite will overwrite an existing table I have created. …

sql hive insert overwrite
how to convert unix epoch time to date string in hive

I have a log file which contains timestamp column. The timestamp is in unix epoch time format. I want to …

function hive database-partitioning
Is there any way to get the column name along with the output while execute any query in Hive?

In Hive, when we do a query (like: select * from employee), we do not get any column names in the …

hadoop hive rdbms
Query HIVE table in pyspark

I am using CDH5.5 I have a table created in HIVE default database and able to query it from the …

hive pyspark
Explode the Array of Struct in Hive

This is the below Hive Table CREATE EXTERNAL TABLE IF NOT EXISTS SampleTable ( USER_ID BIGINT, NEW_ITEM ARRAY<…

hadoop mapreduce hive hiveql
Row number functionality in Hive

How can I generate row numbers for an existing table while running a select query? For example: select row_number(), * …

hive hiveql
How do you make a HIVE table out of JSON data?

I want to create a Hive table out of some JSON data (nested) and run queries on it? Is this …

json hadoop hive amazon-emr emr