Top "Hadoop" questions

Hadoop is an Apache open-source project that provides software for reliable and scalable distributed computing.

How do you make a HIVE table out of JSON data?

I want to create a Hive table out of some JSON data (nested) and run queries on it? Is this …

json hadoop hive amazon-emr emr
out of Memory Error in Hadoop

I tried installing Hadoop following this http://hadoop.apache.org/common/docs/stable/single_node_setup.html document. When I …

java hadoop
Write a file in hdfs with Java

I want to create a file in HDFS and write data in that. I used this code: Configuration config = new …

java hadoop hdfs
How to find the size of a HDFS file

How to find the size of a HDFS file? What command should be used to find the size of any …

hadoop hdfs
Hive error: parseexception missing EOF

I am not sure what I am doing wrong here: hive> CREATE TABLE default.testtbl(int1 INT,string1 STRING) …

hadoop hive hiveql hcatalog
HDFS error: could only be replicated to 0 nodes, instead of 1

I've created a ubuntu single node hadoop cluster in EC2. Testing a simple file upload to hdfs works from the …

amazon-ec2 hadoop
Connecting to Hive using Beeline

I am trying to connect to hive installed in my machine through Beeline client. when I give the 'beeline' command &…

hadoop hive beeline
Is there a hdfs command to list files in HDFS directory as per timestamp

Is there a hdfs command to list files in HDFS directory as per timestamp, ascending or descending? By default, hdfs …

hadoop hdfs
Chaining multiple MapReduce jobs in Hadoop

In many real-life situations where you apply MapReduce, the final algorithms end up being several MapReduce steps. i.e. Map1 , …

hadoop mapreduce
Save Spark dataframe as dynamic partitioned table in Hive

I have a sample application working to read from csv files into a dataframe. The dataframe can be stored to …

hadoop apache-spark hive apache-spark-sql spark-dataframe