Top "Yarn" questions

YARN (Yet Another Resource Negotiator) is a key component of second generation Apache Hadoop infrastructure.

Spark num-executors

I have setup a 10 node HDP platform on AWS. Below is my configuration 2 Servers - Name Node and Standby Name …

apache-spark yarn hortonworks-data-platform
Why does Spark job fail with "Exit code: 52"

I have had Spark job failing with a trace like this one: ./containers/application_1455622885057_0016/container_1455622885057_0016_01_000001/stderr-Container id: container_1455622885057_0016_01_000008 ./containers/application_1455622885057_0016/…

apache-spark yarn spark-dataframe
Why does my yarn application not have logs even with logging enabled?

I have enabled logs in the xml file: yarn-site.xml, and I restarted yarn by doing: sudo service hadoop-yarn-resourcemanager restart …

hadoop apache-spark logging yarn
How to deal with tasks running too long (comparing to others in job) in yarn-client?

We use a Spark cluster as yarn-client to calculate several business, but sometimes we have a task run too long …

apache-spark yarn parquet
How to change yarn scheduler configuration on aws EMR?

Unlike HortonWorks or Cloudera, AWS EMR does not seem to give any GUI to change xml configurations of various hadoop …

amazon-web-services yarn
Hadoop Nodemanager and Resourcemanager not starting

I am trying to setup the latest Hadoop 2.2 single node cluster on Ubuntu 13.10 64 bit. the OS is a fresh installation, …

hadoop bigdata yarn ubuntu-13.10
Hadoop namenode : Single point of failure

The Namenode in the Hadoop architecture is a single point of failure. How do people who have large Hadoop clusters …

hadoop mapreduce hdfs yarn hadoop2
Spark : multiple spark-submit in parallel

I have a generic question about Apache Spark : We have some spark streaming scripts that consume Kafka messages. Problem : they …

hadoop apache-spark cloudera yarn
Apache Hadoop Yarn vs. Kubernetes

Since versions 2.6 (Apache Hadoop) Yarn handles docker containers. Basically it distributes the requested amount of containers on a Hadoop cluster, …

docker kubernetes yarn
EMR Spark - TransportClient: Failed to send RPC

I'm getting this error, I tried to increase memory on cluster instances and in the executor and driver parameters without …

apache-spark yarn emr