Top "Emr" questions

Amazon Elastic MapReduce (Amazon EMR) is a web service that enables businesses, researchers, data analysts, and developers to easily and cost-effectively process vast amounts of data.

How to install a GUI on Amazon AWS EC2 or EMR with the Amazon AMI

I have a need to run an application that requires a GUI interface to start and configure. I also need …

amazon-ec2 emr amazon-emr xfce
How to clean up the list of Terminated AWS EMR clusters?

I have about 88 EMR clusters that are terminated in my AWS account. How do I clean up the list of …

amazon-web-services emr
Why does Yarn on EMR not allocate all nodes to running Spark jobs?

I'm running a job on Apache Spark on Amazon Elastic Map Reduce (EMR). Currently I'm running on emr-4.1.0 which includes …

apache-spark yarn emr amazon-emr elastic-map-reduce
IllegalArgumentException The bucketName parameter must be specified. com.amazonaws.services.s3.AmazonS3Client.rejectNull

Running a Clojure jar on AWS-EMR cluster using (hfs-textline) and getting: IllegalArgumentException The bucketName parameter must be specified. com.amazonaws.…

hadoop clojure emr cascalog
File already exists error writing new files from dataframe

On EMR Spark, writing an RDD[String] to S3 via a dataframe. rddString .toDF() .coalesce(16) .write .option("compression", "gzip") .mode(…

apache-spark emr
How do you delete an AWS EMR Cluster?

I've been playing around with AWS EMR and I now have a few clusters that are terminated and that I …

amazon-web-services emr amazon-emr
java.lang.NoClassDefFoundError: org/apache/spark/sql/DataFrame

I am running my Spark code to save data to HBase in Amazon EMR 5.8.0 which has Spark 2.2.0 installed. Running in …

scala apache-spark hbase emr phoenix
How to suppress INFO messages for spark-sql running on EMR?

I'm running Spark on EMR as described in Run Spark and Spark SQL on Amazon Elastic MapReduce: This tutorial walks …

log4j apache-spark emr
terminating a spark step in aws

I want to set up a series of spark steps on an EMR spark cluster, and terminate the current step …

hadoop amazon-web-services apache-spark emr
Force Server Side Encryption for S3 Bucket

I want to set an S3 bucket policy so that all requests to upload to that bucket will use server …

encryption amazon-web-services amazon-s3 emr