Top "Amazon-emr" questions

Amazon Elastic MapReduce (Amazon EMR) is a web service that enables businesses, researchers, data analysts, and developers to easily and cost-effectively process vast amounts of data.

How can I get Zeppelin to restart cleanly on an EMR cluster?

I am running an EMR cluster and trying to use a Zeppelin notebook for data analysis. Versions: Release label:emr-5.2.1 …

amazon-web-services hadoop pyspark amazon-emr apache-zeppelin
How to find spark master URL on Amazon EMR

I am new to spark and trying to install spark on Amazon cluster with version 1.3.1. when i do SparkConf sparkConfig = …

apache-spark spark-streaming amazon-emr
AWS VPC identify private and public subnet

I have a VPC in AWS account and there are 5 subnets associated with that VPC. Subnets are of 2 types - …

amazon-web-services amazon-emr amazon-vpc subnet
How to install a GUI on Amazon AWS EC2 or EMR with the Amazon AMI

I have a need to run an application that requires a GUI interface to start and configure. I also need …

amazon-ec2 emr amazon-emr xfce
Why does Yarn on EMR not allocate all nodes to running Spark jobs?

I'm running a job on Apache Spark on Amazon Elastic Map Reduce (EMR). Currently I'm running on emr-4.1.0 which includes …

apache-spark yarn emr amazon-emr elastic-map-reduce
Spark on Amazon EMR: "Timeout waiting for connection from pool"

I'm running a Spark job on a small three server Amazon EMR 5 (Spark 2.0) cluster. My job runs for an hour …

apache-spark amazon-emr
how to set livy.server.session.timeout on EMR cluster boostrap?

I am creating an EMR cluster, and using jupyter notebook to run some spark tasks. My tasks die after approximately 1 …

configuration amazon-emr livy
How do you delete an AWS EMR Cluster?

I've been playing around with AWS EMR and I now have a few clusters that are terminated and that I …

amazon-web-services emr amazon-emr
AWS Glue pricing against AWS EMR

I am doing some pricing comparison between AWS Glue against AWS EMR so as to chose between EMR & Glue. …

amazon-web-services amazon-emr aws-glue
Running EMR Spark With Multiple S3 Accounts

I have an EMR Spark Job that needs to read data from S3 on one account and write to another. …

apache-spark amazon-s3 amazon-emr