Top "Mapreduce" questions

MapReduce is an algorithm for processing huge datasets on certain kinds of distributable problems using a large number of nodes

hadoop.mapred vs hadoop.mapreduce?

Why are there two separate packages map-reduce package in Apache's hadoop package tree: org.apache.hadoop.mapred http://javasourcecode.org/…

apache hadoop mapreduce
Difference between Application Manager and Application Master in YARN?

I understood how MRv1 works.Now I am trying to understand MRv2.. what's the difference between Application Manager and Application …

hadoop mapreduce yarn
Container exited with a non-zero exit code 1 error during mapreduce task

On executing jar in hadoop, I get the following error: 16/11/04 18:32:59 INFO mapreduce.Job: Task Id : attempt_1478261728730_0005_m_000000_2, Status : FAILED Exception …

hadoop mapreduce hipi
java.lang.IllegalArgumentException: Wrong FS: , expected: hdfs://localhost:9000

I am trying to implement reduce side join , and using mapfile reader to look up distributed cache but it is …

java hadoop mapreduce distributed-cache
Computing median in map reduce

Can someone example the computation of median/quantiles in map reduce? My understanding of Datafu's median is that the 'n' …

hadoop statistics mapreduce apache-pig median
hadoop: difference between 0 reducer and identity reducer?

I am just trying to confirm my understanding of difference between 0 reducer and identity reducer. 0 reducer means reduce step will …

hadoop mapreduce
describe extended table in Hive

I am storing the Table as a SequenceFile format and I am setting the below commands to enable Sequence with …

hadoop mapreduce hive lzo
Using MongoDB's map/reduce to "group by" two fields

I need something slightly more complex than the examples in the MongoDB docs and I can't seem to be able …

mongodb mapreduce
Renaming Part Files in Hadoop Map Reduce

I have tried to use the MultipleOutputs class as per the example in page http://hadoop.apache.org/docs/mapreduce/…

java hadoop mapreduce
Hadoop: Error in configuring object

I'm trying to run the Terasort benchmarks and i'm getting the following exception: java.lang.RuntimeException: Error in configuring object …

java exception hadoop mapreduce cloudera