Popular "hadoop" questions | Page 11

I am new to spark, and I want to use group-by & reduce to find the following from CSV (one …

java apache-spark hadoop apache-spark-sql hdfs

Similar to SHOW TABLES command, do we have any such command to list all databases created so far?

hadoop hive hiveql

From any node in a Hadoop cluster, what is the command to identify the running namenode? identify all running datanodes? …

hadoop mapreduce

I want to debug a mapreduce script, and without going into much trouble tried to put some print statements in …

hadoop mapreduce

I have set up a multi node Hadoop Cluster. The NameNode and Secondary namenode runs on the same machine and …

ubuntu hadoop amazon-ec2 hdfs hadoop2

Are they supposed to be equal? but, why the "hadoop fs" commands show the hdfs files while the "hdfs dfs" …

hadoop hdfs

I'm trying to create a partitioned table using dynamic partitioning, but i'm facing an issue. I'm running Hive 0.12 on Hortonworks …

hadoop hive hiveql

I have 3 data nodes running, while running a job i am getting the following given below error , java.io.IOException: …

java hadoop mapreduce hive hdfs

I have a file stored in HDFS as part-m-00000.gz.parquet I've tried to run hdfs dfs -text dir/part-m-00000.…

hadoop apache-pig hdfs parquet

One of the main examples that is used in demonstrating the power of MapReduce is the Terasort benchmark. I'm having …

algorithm sorting parallel-processing hadoop mapreduce

Top "Hadoop" questions