The cluster environment has been configured zookeeper,hadoop. And distributed database hbase. In this stage, we will start the practice of building a data warehouse . Software involved :
mysql, hive .
background :
( I used to think that a data warehouse is a database of a database ..)
so ,hive The essence is a hadoop Tools for .
mysql Naturally, you don't have to search too much . It should be noted that it is in the centos7 But there are some fastidious installation . originally centos use yum
It can be very convenient to mysql Service installed , No environment variables need to be configured , But my network environment is not good , I have to download the installation package !!
stay ubutu Installed on mysql Of tar.gz package , stay windows It can also be downloaded from zip package , however centos The top needs the next
bundle Bag of , And pay attention to the version . adopt bundle Installation is still the first time , So there are a lot of problems .
step :
1. centos 7 Installation on mysql:
Download and unzip the downloaded bundle package , And adopt rpm Install .
Change password policy and change password :
mysql Some of its own operation task commands :
A lot of orders are Mysql Required by itself , Although it has been used many times , But every time you use it, you still need to check it .
It is estimated that it was not recited as an ancient poem in class , Or is it that your brain doesn't feel the need to remember ..
2.hive Server installation :
2.1 Extract and configure environment variables :
2.2 stay hive Server side , take mysql Client connections for : mysql-connector Copy to :hive Of lib lower .
be careful , this mysql-connector Just copy to hive Server side , Because only the server needs to follow the mysql signal communication .
hive The client only needs to communicate with the server , So there are no steps .
2.3 to configure hive Server configuration : conf/hive-env.sh
2.4 to configure hive Server configuration : conf/hive-site.xml
3. hive Client installation :
3.0 Client needs and hadoo signal communication . hadoop And hive Of jline Package versions need to be unified .
It can be downloaded from Hive Of lib Package will jline Of jar Package copy to hadoop Of lib position . Pay attention to the situation here : hive client , hadoop Server side .
3.1 Extract and configure environment variables .
3.2 modify conf/hive-env.sh, newly added hdoop route .
3.3 to configure conf/hive-site.xml:
5. hive Verification of data warehouse :
Server :
client :
Technology