Popular "mapreduce" questions | Page 3

One of the main examples that is used in demonstrating the power of MapReduce is the Terasort benchmark. I'm having …

algorithm sorting parallel-processing hadoop mapreduce

I have a bunch of small files in an HDFS directory. Although the volume of the files is relatively small, …

java hadoop mapreduce distributed-computing

What is a container in YARN? Is it same as the child JVM in which the tasks on the nodemanager …

hadoop mapreduce yarn

Are there any dependencies between Spark and Hadoop? If not, are there any features I'll miss when I run Spark …

hadoop amazon-s3 apache-spark mapreduce mesos

This is a conceptual question involving Hadoop/HDFS. Lets say you have a file containing 1 billion lines. And for the …

hadoop mapreduce hdfs

So, I've been looking at Hadoop with keen interest, and to be honest I'm fascinated, things don't get much cooler. …

c# .net hadoop mapreduce

I've been trying to use MapReduce in MongoDB to do what I think is a simple procedure. I don't know …

mongodb mapreduce

Suppose I have a collection with some set of documents. something like this. { "_id" : ObjectId("4f127fa55e7242718200002d"), "id":1, "…

mongodb mapreduce duplicates aggregation-framework

For a Big Data project, I'm planning to use spark, which has some nice features like in-memory-computations for repeated workloads. …

java scala mapreduce gzip apache-spark

My program looks like public class TopKRecord extends Configured implements Tool { public static class MapClass extends Mapper<Text, Text, …

java hadoop mapreduce

Top "Mapreduce" questions