Top "Hadoop" questions

Hadoop is an Apache open-source project that provides software for reliable and scalable distributed computing.

What is Hive: Return Code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask

I am getting: FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask While trying to make …

hadoop mapreduce hive
How do I output the results of a HiveQL query to CSV?

we would like to put the results of a Hive query to a CSV file. I thought the command should …

database hadoop hive hiveql
Name node is in safe mode. Not able to leave

root# bin/hadoop fs -mkdir t mkdir: org.apache.hadoop.hdfs.server.namenode.SafeModeException: Cannot create directory /user/root/t. …

hadoop hdfs
Failed to locate the winutils binary in the hadoop binary path

I am getting the following error while starting namenode for latest hadoop-2.2 release. I didn't find winutils exe file in …

hadoop
How to Access Hive via Python?

https://cwiki.apache.org/confluence/display/Hive/HiveClient#HiveClient-Python appears to be outdated. When I add this to /etc/profile: …

python hadoop hive
Difference between Hive internal tables and external tables?

Can anyone tell me the difference between Hive's external table and internal tables. I know the difference comes when dropping …

hadoop hive hiveql
Hive: how to show all partitions of a table?

I have a table with 1000+ partitions. "Show partitions" command only lists a small number of partitions. How can i show …

hadoop hive
The way to check a HDFS directory's size?

I know du -sh in common Linux filesystems. But how to do that with HDFS?

hadoop command-line directory hdfs
Setting the number of map tasks and reduce tasks

I am currently running a job I fixed the number of map task to 20 but and getting a higher number. …

hadoop mapreduce
What is the difference between partitioning and bucketing a table in Hive ?

I know both is performed on a column in the table but how is each operation different.

hadoop hive