Top "Mapreduce" questions

MapReduce is an algorithm for processing huge datasets on certain kinds of distributable problems using a large number of nodes

MongoDB aggregation comparison: group(), $group and MapReduce

I am somewhat confused about when to use group(), aggregate with $group or mapreduce. I read the documentation at http://…

mongodb mapreduce mongodb-query aggregation-framework
MySQL Cluster vs. Hadoop for handling big data

I want to know the advantages/disadvantages of using a MySQL Cluster and using the Hadoop framework. What is the …

hadoop mapreduce hive bigdata mysql-cluster
Standard practices for logging in MapReduce jobs

I'm trying to find the best approach for logging in MapReduce jobs. I'm using slf4j with log4j appender …

java hadoop mapreduce hadoop2 mapr
what is the basic difference between jobconf and job?

hi i wanted to know the basic difference between jobconf and job objects,currently i am submitting my job like …

hadoop mapreduce bigdata
Best way to do one-to-many "JOIN" in CouchDB

I am looking for a CouchDB equivalent to "SQL joins". In my example there are CouchDB documents that are list …

couchdb mapreduce
what are the options for hadoop on scala

We are starting a big-data based analytic project and we are considering to adopt scala (typesafe stack). I would like …

scala hadoop mapreduce jvm-languages
MapReduce Output ArrayWritable

I'm trying to get an output from an ArrayWritable in a simple MapReduce-Task. I found a few questions with a …

java apache hadoop mapreduce writable
Hadoop MapReduce provide nested directories as job input

I'm working on a job that processes a nested directory structure, containing files on multiple levels: one/ ├── three/ │   └── four/ │   ├── baz.…

hadoop nested mapreduce directory-walk
Querying embedded objects in Mongoid/rails 3 ("Lower than", Min operators and sorting)

I am using rails 3 with mongoid. I have a collection of Stocks with an embedded collection of Prices : class Stock …

ruby-on-rails mongodb mapreduce mongoid
How can I allot more memory to Python program? Its not consuming more than 64MB on 4GB RAM

I have a Python program running on some input data on 4GB RAM 32-bit 12.04 Ubuntu. The time and space complexity …

python ubuntu memory-management mapreduce mrjob