How to Install Apache Druid (Single Server)

Abhishek Ghosh

By Abhishek Ghosh November 15, 2019 8:48 am Updated on November 15, 2019

How to Install Apache Druid (Single Server)

Apache Druid is a column oriented, analytics database which can be used to build a data warehouse. Druid has APIs (via HTTP and JDBC) for loading, managing, and querying data and the user can also interact with Druid via the console. A mere 4GB RAM of an instance or computer is sufficient to test-run nano-quickstart configuration of Druid, and 16GB RAM instance can run it at production. Druid can be used for clickstreams, view streams, and activity streams, for measuring user engagement, tracking A/B test data, understanding usage patterns, compute impressions, clicks, eCPM, and key conversion metrics. Other use cases of Druid includes business intelligence (OLAPOLAP), Internet of Things.

Druid is intended to run as a cluster of nodes to support a fault-tolerant architecture. The cluster will include external dependencies for Apache ZooKeeper (coordination), MySQL or PostgreSQL (metadata storage), and HDFS, or Amazon S3 for deep storage. Using Ambari, you configure Apache Druid (incubating) by setting parameters in a graphical user interface. Druid can be downloaded (and details read) from :

#
https://druid.apache.org/
https://github.com/apache/incubator-druid
#

1

2

3

4

#

https://druid.apache.org/

https://github.com/apache/incubator-druid

#

This is the link to 0.16.0 :

#
https://www.apache.org/dyn/closer.cgi?path=/incubator/druid/0.16.0-incubating/apache-druid-0.16.0-incubating-bin.tar.gz
#

1

2

3

#

https://www.apache.org/dyn/closer.cgi?path=/incubator/druid/0.16.0-incubating/apache-druid-0.16.0-incubating-bin.tar.gz

#

We have to extract and navigate to the main directory :

#
tar -xzf apache-druid-0.16.0-incubating-bin.tar.gz
cd apache-druid-0.16.0-incubating
#

1

2

3

4

#

tar -xzf apache-druid-0.16.0-incubating-bin.tar.gz

cd apache-druid-0.16.0-incubating

#

The directories contain the below :

bin/* – scripts for the quickstarts
conf/* – example configurations for single-server, clustered setup
extensions/* – core Druid extensions
hadoop-dependencies/* – Druid Hadoop dependencies
lib/* – libraries and dependencies for core Druid
quickstart/* – configuration files, sample data, and files for the quickstart tutorials

Apache Zookeeper is a dependency of Apache Druid :

curl https://archive.apache.org/dist/zookeeper/zookeeper-3.4.14/zookeeper-3.4.14.tar.gz -o zookeeper-3.4.14.tar.gz
tar -xzf zookeeper-3.4.14.tar.gz
mv zookeeper-3.4.14 zk

1

2

3

curl https://archive.apache.org/dist/zookeeper/zookeeper-3.4.14/zookeeper-3.4.14.tar.gz -o zookeeper-3.4.14.tar.gz

tar -xzf zookeeper-3.4.14.tar.gz

mv zookeeper-3.4.14 zk

The contents of the Zookeeper to be located at zk under the apache-druid-0.16.0-incubating package root. It is too easy to start Apache Druid :

#
./bin/start-micro-quickstart
#

1

2

3

#

./bin/start-micro-quickstart

#

Druid UI runs on http://localhost:8888/. It is important to read the tutorials on Druid to correctly configure depending on your need :

#
https://druid.apache.org/docs/latest/tutorials/
#

1

2

3

#

https://druid.apache.org/docs/latest/tutorials/

#

Druid needs a Relational DB as a Metadata Storage, i.e. MySQL. We have different guides on installing MySQLinstalling MySQL.

Start a CLI session: mysql -u root -p. Inside the session issue the following commands :

CREATE DATABASE druid DEFAULT CHARACTER SET utf8;

CREATE USER 'druid'@'%' IDENTIFIED BY 'druid-pass';
GRANT ALL PRIVILEGES ON *.* TO 'druid'@'%' WITH GRANT OPTION;

CREATE USER 'druid'@'localhost' IDENTIFIED BY 'druid-pass';
GRANT ALL PRIVILEGES ON *.* TO 'druid'@'localhost' WITH GRANT OPTION;

flush privileges;

1

2

3

4

5

6

7

8

9

CREATE DATABASE druid DEFAULT CHARACTER SET utf8;

CREATE USER 'druid'@'%' IDENTIFIED BY 'druid-pass';

GRANT ALL PRIVILEGES ON *.* TO 'druid'@'%' WITH GRANT OPTION;

CREATE USER 'druid'@'localhost' IDENTIFIED BY 'druid-pass';

GRANT ALL PRIVILEGES ON *.* TO 'druid'@'localhost' WITH GRANT OPTION;

flush privileges;

This is the minimum common settings for MySQL :

# Extensions (no deep storage model is listed - using local fs for deep storage - not recommended for production)
druid.extensions.coordinates=["io.druid.extensions:druid-examples","io.druid.extensions:druid-kafka-eight","io.druid.extensions:mysql-metadata-storage"]

# Zookeeper
druid.zk.service.host=localhost

# Metadata Storage (mysql)
druid.metadata.storage.type=mysql
druid.metadata.storage.connector.connectURI=jdbc\:mysql\://localhost\:3306/druid
druid.metadata.storage.connector.user=druid
druid.metadata.storage.connector.password=diurd

# Deep storage (local filesystem for examples - don't use this in production)
druid.storage.type=local
druid.storage.storageDirectory=/Users/hbutani/druid/localStorage

# Query Cache (we use a simple 10mb heap-based local cache on the broker)
druid.cache.type=local
druid.cache.sizeInBytes=10000000

# Indexing service discovery
druid.selectors.indexing.serviceName=overlord

# Monitoring (disabled for examples, if you enable SysMonitor, make sure to include sigar jar in your cp)
# druid.monitoring.monitors=["com.metamx.metrics.SysMonitor","com.metamx.metrics.JvmMonitor"]

# Metrics logging (disabled for examples - change this to logging or http in production)
druid.emitter=noop

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

# Extensions (no deep storage model is listed - using local fs for deep storage - not recommended for production)

druid.extensions.coordinates=["io.druid.extensions:druid-examples","io.druid.extensions:druid-kafka-eight","io.druid.extensions:mysql-metadata-storage"]

# Zookeeper

druid.zk.service.host=localhost

# Metadata Storage (mysql)

druid.metadata.storage.type=mysql

druid.metadata.storage.connector.connectURI=jdbc\:mysql\://localhost\:3306/druid

druid.metadata.storage.connector.user=druid

druid.metadata.storage.connector.password=diurd

# Deep storage (local filesystem for examples - don't use this in production)

druid.storage.type=local

druid.storage.storageDirectory=/Users/hbutani/druid/localStorage

# Query Cache (we use a simple 10mb heap-based local cache on the broker)

druid.cache.type=local

druid.cache.sizeInBytes=10000000

# Indexing service discovery

druid.selectors.indexing.serviceName=overlord

# Monitoring (disabled for examples, if you enable SysMonitor, make sure to include sigar jar in your cp)

# druid.monitoring.monitors=["com.metamx.metrics.SysMonitor","com.metamx.metrics.JvmMonitor"]

# Metrics logging (disabled for examples - change this to logging or http in production)

druid.emitter=noop

That ends this guide.

Tagged With apache druid in windows , apache spark druid , druid apache server , druid apache установка и настройка , https://thecustomizewindows com/2019/11/how-to-install-apache-druid-single-server/ , install apache drill ubuntu , install apache druid vm , postgresql-metadata-storage , run apache druid on windows

About Abhishek Ghosh

Here’s what we’ve got for you which might like :

Take The Conversation Further ...

Get new posts by email:

About Abhishek Ghosh

Here’s what we’ve got for you which might like :

Articles Related to How to Install Apache Druid (Single Server)

Take The Conversation Further ...

Get new posts by email: