Spark UI on AWS EMR

gallamine picture gallamine · Jul 16, 2015 · Viewed 19.3k times · Source

I am running a AWS EMR cluster with Spark (1.3.1) installed via the EMR console dropdown. Spark is current and processing data but I am trying to find which port has been assigned to the WebUI. I've tried port forwarding both 4040 and 8080 with no connection. I'm forwarding like so

ssh -i ~/KEY.pem -L 8080:localhost:8080 hadoop@EMR_DNS

1) How do I find out what the Spark WebUI's assigned port is? 2) How do I verify the Spark WebUI is running?

Answer

ChristopherB picture ChristopherB · Jul 18, 2015

Spark on EMR is configured for YARN, thus the Spark UI is available by the application url provided by the YARN Resource Manager (http://spark.apache.org/docs/latest/monitoring.html). So the easiest way to get to it is to setup your browser with SOCKS using a port opened by SSH then from the EMR console open Resource Manager and click the Application Master URL provided to the right of the running application. Spark History server is available at the default port 18080.

Example of socks with EMR at http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-web-interfaces.html