Top "Mapreduce" questions

MapReduce is an algorithm for processing huge datasets on certain kinds of distributable problems using a large number of nodes

What is the use of grouping comparator in hadoop map reduce

I would like to know why grouping comparator is used in secondary sort of mapreduce. According to the definitive guide …

hadoop mapreduce hadoop-partitioning
MapReduce job hangs, waiting for AM container to be allocated

I tried to run simple word count as MapReduce job. Everything works fine when run locally (all work done on …

hadoop mapreduce yarn
setup and cleanup methods of Mapper/Reducer in Hadoop MapReduce

Are setup and cleanup methods called in each mapper and reducer tasks respectively? Or are they called only once at …

hadoop mapreduce
Data not getting loaded into Partitioned Table in Hive

I am trying to create partition for my Table inorder to update a value. This is my sample data 1,Anne,…

hadoop mapreduce hive partition
Type mismatch in key from map: expected org.apache.hadoop.io.Text, recieved org.apache.hadoop.io.LongWritable

I am trying to run a map/reducer in java. Below are my files WordCount.java package counter; public class …

java hadoop mapreduce
The import org.apache.hadoop.mapreduce cannot be resolved

I am trying to execute the below code package test; import java.io.IOException; import java.util.*; import org.apache.…

hadoop mapreduce hive hadoop-streaming hadoop-plugins
Mapper input Key-Value pair in Hadoop

Normally, we write the mapper in the form : public static class Map extends Mapper<**LongWritable**, Text, Text, IntWritable> …

hadoop mapreduce key-value
Hadoop one Map and multiple Reduce

We have a large dataset to analyze with multiple reduce functions. All reduce algorithm work on the same dataset generated …

hadoop mapreduce
How to kill a mapred job started by hive?

I'm working by CDH 5.1 now. It starts normal Hadoop job by YARN but hive still works with mapred. Sometimes a …

hadoop mapreduce hive yarn cloudera-cdh
Check if every element in array matches condition

I have a collection of documents: date: Date users: [ { user: 1, group: 1 } { user: 5, group: 2 } ] date: Date users: [ { user: 1, group: 1 } { user: 3, group: 2 } ] …

mongodb mapreduce mongodb-query aggregation-framework