Apache Drill is SQL Query Engine for Hadoop, NoSQL, Cloud Storage and file systems including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud Storage, Swift, NAS and local files. As example, we can join a collection in MongoDB with a directory of event logs in Hadoop. Apache Drill features JSON data model which enables queries on complex and nested data. Drill has extensions to SQL to easily query complex data. We can install Apache Drill on single cluster or multiple cluster. Here is how to install Apache Drill on Ubuntu 16.04 LTS single cloud server. Drillbit is Drill’s daemon which runs on each node of the cluster using ZooKeeper for all communication in the cluster. Drillbit which receives the request from the client is called Foreman.
Install Apache Drill on Ubuntu 16.04 LTS Single Cloud Server
Follow our basic server setup guide to create a user with root privilege and enable other optional basic security.
We need Java, so run :
---
1 2 3 | apt update apt updgrade apt install default-jdk |
Test :
1 | java -version |
Now we need to update newest version of Apache Drill :
1 | http://www.apache.org/dyn/closer.cgi/drill/ |
Example commands :
1 2 3 4 5 6 | cd /temp wget http://getdrill.org/drill/download/apache-drill-1.11.0.tar.gz cp apache-drill-1.11.0.tar.gz /usr/local cd /usr/local tar -xzvf apache-drill-1.11.0.tar.gz mv apache-drill-1.11.0 drill |
Update ~/.bashrc
file with following contents :
1 2 3 4 | #DRILL STARTS export DRILL_HOME=/usr/local/drill export PATH=$PATH:$DRILL_HOME/bin #DRILL ENDS |
Source :
1 | source ~/.bashrc |
To run drill agent, run these commands :
1 2 | sudo chmod -R 777 /usr/local/drill sqlline -u jdbc:drill:zk=local |
To verify Drill installation, run the following command :
1 | jps |
Use the following command to log into Drill Command Prompt :
1 | sqlline -u jdbc:drill:zk=local |
To start the Drill shell in embedded mode, we will use the drill-embedded command which will use a jdbc connection string and identifies the local node as the ZooKeeper node. So we will navigate to the Drill installation directory (/usr/local/drill
in our case) and run the following commands :
1 | root@lalala:/usr/local/drill# bin/drill-embedded |
Drill will run on port 8047 :
1 | netstat -nlpt|grep 8047 |
To exit the Drill shell, we need to run !quit
as command.
To install Drill on nodes, we need to configure a cluster ID and add Zookeeper information. We need install Apache ZooKeeper :
1 | apt install zookeeperd |
ZooKeeper will be started as a daemon listening on port 2181 :
1 | sudo netstat -nlpt | grep ':2181' |
I am not describing those steps for cluster configuration as it can be a separate guide in future and optional for test purpose. You can access the Apache Drill Web Interface with this address pattern :
1 | http://IP.ADDRESS:8047 |
Rest you can read from Apache Drill’s official documentation.
Tagged With ubuntu install apache-drill , drill command for ubuntu , CERZ , apache drill start web service ubuntu , apache drill in ubuntu12 04 download , apache drill commands in ubuntu , apache drill 16 04 , 983W , 4UQ4 , 2BFS