Apache Zeppelin is a web-based notebook for interactive data analytics. We already described how to install Apache Spark. If you have followed our any guide on big data tools installation, it is few minutes work to install and use Zeppelin. Zeppelin can be pre-built package or can be build from source. Here is how to install apache zeppelin on ubuntu 16.04 building from source.
What Are Needed To Install Apache Zeppelin On Ubuntu 16.04
For our guide Java, Scala, Apache Spark, Maven, npm, Node.js needed. We will describe to install all of them as steps. As reader, you can skip the things already installed.
Steps To Install Apache Zeppelin On Ubuntu 16.04
As we said above, we have to install Java, Scala and Spark. First install Java :
---
1 2 3 4 5 | apt update apt upgrade sudo apt-add-repository ppa:webupd8team/java apt update apt install oracle-java8-installer |
Check whether Java is correctly installed :
1 | java -version |
Next, we will install Scala :
1 2 3 | wget http://downloads.lightbend.com/scala/2.12.0/scala-2.12.0.tgz sudo mkdir /usr/local/src/scala tar -xvf scala-2.12.0.tgz -C /usr/local/src/scala/ |
Next we need to open the profile file :
1 | nano .bashrc |
Add these :
1 2 | export SCALA_HOME=/usr/local/src/scala/scala-2.12.0 export PATH=$SCALA_HOME/bin:$PATH |
And reload by sourcing it :
1 | source .bashrc |
Next we will check whether Scala is correctly installed and install Git, sbt :
1 2 3 4 5 6 7 | scala -version apt install git wget https://bintray.com/artifact/download/sbt/debian/sbt-0.13.6.deb sudo dpkg -i sbt-0.13.6.deb ## OR apt update apt install sbt |
Next we will install Apache Spark :
1 2 3 4 5 6 | wget http://d3kbcqa49mib13.cloudfront.net/spark-2.0.2-bin-hadoop2.7.tgz tar -xvf spark-2.0.2-bin-hadoop2.7.tgz mv spark-2.0.2-bin-hadoop2.7/ spark cd conf/ cp spark-env.sh.template spark-env.sh nano spark-env.sh |
Add the following lines to spark-env.sh
:
1 2 3 4 | JAVA_HOME=/usr/lib/jvm/java-8-oracle SPARK_WORKER_MEMORY=4g PYSPARK_PYTHON=/home/<username>/anaconda3/bin/python source spark-env.sh |
The run pyspark :
1 | pyspark |
Next we will install npm, Node.js, maven, Zeppelin notebook :
1 2 3 4 5 6 7 8 9 10 | apt update apt install npm apt install nodejs-legacy apt install libfontconfig wget http://www-eu.apache.org/dist/maven/maven-3/3.3.9/binaries/apache-maven-3.3.9-bin.tar.gz sudo tar -zxf apache-maven-3.3.9-bin.tar.gz -C /usr/local/ sudo ln -s /usr/local/apache-maven-3.3.9/bin/mvn /usr/local/bin/mvn node --version mvn --version git clone https://github.com/apache/zeppelin.git |
Next we need to open the profile file :
1 | nano .bashrc |
Add these :
1 2 | export M2_HOME=/usr/local/apache-maven-3.3.9 export PATH=${M2_HOME}/bin:${PATH} |
Then :
1 2 3 4 5 6 7 8 | source .bashrc sudo npm install -g bower mvn clean install -DskipTests cd zeppelin/zeppelin-web # debug bower install # Run mvn clean install -DskipTests |
Start the daemon.sh
:
1 | bin/zeppelin-daemon.sh start |
Next steps you can follow official website :
1 | https://zeppelin.apache.org/docs/0.6.0/install/install.html#building-from-source |