Top "Mapreduce" questions

MapReduce is an algorithm for processing huge datasets on certain kinds of distributable problems using a large number of nodes

Hadoop job fails, Resource Manager doesnt recognize AttemptID

Im trying to aggregate some data in an Oozie workflow. However the aggregation step fails. I found two points of …

hadoop mapreduce oozie
RavenDB Map-Reduce Example using .NET Client

I'm looking for an example of how to implement and use Map-Reduce within the RavenDB .NET Client. I'd like to …

.net mapreduce ravendb
Couchdb join two documents using key

I have two documents one with tree structure and the other one relation to the first doc. Im trying to …

javascript join mapreduce couchdb relational-algebra
Hadoop map-reduce operation is failing on writing output

I am finally able to start a map-reduce job on Hadoop (running on a single debian machine). However, the map …

java hadoop mapreduce debian file-access
Parsing PDF files in Hadoop Map Reduce

I have to parse PDF files , that are in HDFS in a Map Reduce Program in Hadoop. So i get …

pdf hadoop mapreduce pdf-parsing
saving json data in hdfs in hadoop

I have the following Reducer class public static class TokenCounterReducer extends Reducer<Text, Text, Text, Text> { public void …

java json hadoop mapreduce reducers
How can we automate incremental import in SQOOP?

How can we automate the incremental import in SQoop ? In incremental import, we need to give the --last-value to start …

hadoop mapreduce sqoop
How big data is "Bigdata"?

How much amount of data does qualify to be categorised as Bigdata? With what size of data can one decide …

hadoop mapreduce bigdata
Analytics and Mining of data sitting on Cassandra

We have a lot of user interaction data from various websites stored in Cassandra such as cookies, page-visits, ads-viewed, ads-clicked, …

hadoop mapreduce cassandra analytics
How does mapreduce sort and shuffle work?

I am using yelps MRJob library for achieving map-reduce functionality. I know that map reduce has an internal sort and …

hadoop mapreduce mrjob