How can I get Zeppelin to restart cleanly on an EMR cluster?

Andy Jobe picture Andy Jobe · Feb 3, 2017 · Viewed 11.7k times · Source

I am running an EMR cluster and trying to use a Zeppelin notebook for data analysis.

Versions: Release label:emr-5.2.1
Hadoop distribution: Amazon 2.7.3
Hive 2.1.0
Spark 2.0.2
Zeppelin 0.6.2

I am consistently having problems with Zeppelin hanging up while running a query and I can never recover it. I have tried:
- restarting the interpreter
- SSH'ing into the master node and running zeppelin_daemon.sh restart (have tried running as hadoop / root / zeppelin, and also running the script with options reload, start/stop, upstart)

Every time I use the daemon shell script, it tells me that it has stopped/started OK, but then I get this when I run a status:
Zeppelin running but process is dead [FAILED]

The only thing that I can seem to do is kill my cluster and stand up a new one, which is crazy on many levels.

So...what is the officially supported way to restart Zeppelin???

Answer

soaptree picture soaptree · Oct 17, 2017

I ssh'ed into the master node, then ran

$ sudo stop zeppelin
zeppelin stop/waiting

$ sudo start zeppelin
zeppelin start/running, process 24434

Immediately after, refreshing Zeppelin gave me a 503, but after ~2 minutes or so, the UI was available again. I checked the logs in /var/log/zeppelin and it takes some time to configure settings.