Apache HBase is an open source, non-relational, distributed
database modeled after Google's BigTable and is written in Java. It is
developed as part of Apache Software Foundation's Apache Hadoop project
and runs on top of HDFS (Hadoop Distributed File System), providing
BigTable-like capabilities for Hadoop. That is, it provides a
fault-tolerant way of storing large quantities of sparse data (small
amounts of information caught within a large collection of empty or
unimportant data, such as finding the 50 largest items in a group of 2
billion records, or finding the non-zero items representing less than
0.1% of a huge collection).
Pre Requirements
1) A machine with Ubuntu 14.04 LTS operating system.
2) Apache Hadoop pre installed (How to install Hadoop on Ubuntu 14.04)
3) Apache HBase 1.2.3 Software (Download Here)
HBase Pseudo Distributed Mode Installation
Before proceeding with HBase, configure Hadoop and HDFS on your local system or on a remote system and make sure they are running.
A pseudo-distributed mode is simply a distributed mode run on a single host. Use this configuration testing and prototyping on HBase. Do not use this configuration for production nor for evaluating HBase performance.
HBase Installation Steps
Step 1 - Installing java 7. Open a terminal (CTRL + ALT + T) and type the following sudo command.
Verify Installation
Step 2 - Edit /etc/hosts file.
/etc/hosts file. Add machine IP address and hostname. Save and close.
Step 3 - Creating /usr/local/hbase directory.
Step 4 - Change the ownership and permissions of the directory /usr/local/hbase. Here 'hduser' is an Ubuntu username.
Step 5 - Creating /var/hbase/pids directory.
Step 6 - Change the ownership and permissions of the directory /var/hbase/pids. Here 'hduser' is an Ubuntu username.
Step 7 - Change the directory to /home/hduser/Desktop , In my case the downloaded hbase-1.2.3-bin.tar.gz file is in /home/hduser/Desktop folder. For you it might be in /downloads folder check it.
Step 8 - Untar the hbase-1.2.3-bin.tar.gz file.
Step 9 - Move the contents of hbase-1.2.3 folder to /usr/local/hbase
Step 10 - Edit $HOME/.bashrc file by adding the java and hadoop path.
$HOME/.bashrc file. Add the following lines
Step 11 - Reload your changed $HOME/.bashrc settings
Step 12 - Change the directory to /usr/local/hbase/conf
Step 13 - Edit hbase-env.sh file.
Step 14 - Add the below lines to hadoop-env.sh file. Save and Close.
Step 15 - Edit hbase-site.xml file.
Add the below lines to hbase-site.xml file. Save and Close.
Step 16 - Make a new /user/hduser/hbase directory in HDFS.
Step 17 - Make a new /user/hduser/zookeeper directory in HDFS.
Step 18 - Change the directory to /usr/local/hbase/bin
Step 19 - Start all hbase daemons.
Step 20 - The JPS (Java Virtual Machine Process Status Tool) tool is limited to reporting information on JVMs for which it has the access permissions.
Once the HBase is up and running check the web-ui of the components as described below
Step 21 - Change the directory to /usr/local/hbase
Step 22 - HBase creates its directory in HDFS. To see the created directory, browse to Hadoop bin and type the following command.
Step 23 - To start up the initial HBase cluster.
Step 24 - To start up an extra backup master(s) on the same server run. the '1' means use ports 60001 & 60011, and this backup master's logfile will be at logs/hbase--1-master-.log. You can start up to 9 backup masters (10 total).
Step 25 - To start up more regionservers.
where '1' means use ports 60201 & 60301 and its logfile will be at
logs/hbase--1-regionserver-$ {HOSTNAME}.log. This supports up to 99
extra regionservers (100 total).
To add 4 more regionservers in addition to the one you just started by running.
To Enter into HBase Shell
Step 26 - To stop an individual regionserver
Step 27 - To stop an individual backup master server
OR
Step 28 - To stop all HBase daemons.
Pre Requirements
1) A machine with Ubuntu 14.04 LTS operating system.
2) Apache Hadoop pre installed (How to install Hadoop on Ubuntu 14.04)
3) Apache HBase 1.2.3 Software (Download Here)
HBase Pseudo Distributed Mode Installation
Before proceeding with HBase, configure Hadoop and HDFS on your local system or on a remote system and make sure they are running.
A pseudo-distributed mode is simply a distributed mode run on a single host. Use this configuration testing and prototyping on HBase. Do not use this configuration for production nor for evaluating HBase performance.
HBase Installation Steps
Step 1 - Installing java 7. Open a terminal (CTRL + ALT + T) and type the following sudo command.
$ sudo apt-get install openjdk-7-jdk
$ java -version
$ sudo gedit /etc/hosts
127.0.0.1 localhost 127.0.0.1 praveen
$ sudo mkdir /usr/local/hbase
$ sudo chown -R hduser /usr/local/hbase
$ sudo chmod -R 755 /usr/local/hbase
$ sudo mkdir /var/hbase/pids
$ sudo chown -R hduser /var/hbase/pids
$ sudo chmod -R 755 /var/hbase/pids
$ cd /home/hduser/Desktop/
$ tar xzf hbase-1.2.3-bin.tar.gz
$ mv hbase-1.2.3/* /usr/local/hbase
$ sudo gedit $HOME/.bashrc
export HBASE_HOME=/usr/local/hbase PATH=$PATH:$HBASE_HOME/bin export CLASSPATH=$CLASSPATH:/usr/local/hbase/lib/*
$ source $HOME/.bashrc
$ cd /usr/local/hbase/conf
$ gedit hbase-env.sh
export JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64 export HBASE_REGIONSERVERS=/conf/regionservers export HBASE_MANAGES_ZK=true export HBASE_PID_DIR=/var/hbase/pids
$ gedit hbase-site.xml
<property> <name>hbase.rootdir</name> <value>hdfs://localhost:9000/user/hduser/hbase</value> </property> <property> <name>hbase.cluster.distributed</name> <value>true</value> </property> <property> <name>hbase.zookeeper.quorum</name> <value>localhost</value> </property> <property> <name>hbase.master</name> <value>localhost:60010</value> </property> <property> <name>hbase.zookeeper.property.clientPort</name> <value>2181</value> </property> <property> <name>hbase.zookeeper.property.dataDir</name> <value>hdfs://localhost:9000/user/hduser/zookeeper</value> </property>
$ hdfs dfs -mkdir /user/hduser/hbase
$ hdfs dfs -mkdir /user/hduser/zookeeper
$ cd /usr/local/hbase/bin
$ ./start-hbase.sh
$ jps
http://localhost:16010
$ cd /usr/local/hbase
$ ./bin/hbase dfs -ls /user/hduser/hbase
Step 23 - To start up the initial HBase cluster.
$ ./start-hbase.sh
Step 24 - To start up an extra backup master(s) on the same server run. the '1' means use ports 60001 & 60011, and this backup master's logfile will be at logs/hbase--1-master-.log. You can start up to 9 backup masters (10 total).
$ ./bin/local-master-backup.sh start 1 2
$ ./bin/./local-regionservers.sh start 1
$ ./bin/./local-regionservers.sh start 2 3 4
$ hbase shell
Step 26 - To stop an individual regionserver
$ ./bin/./local-regionservers.sh stop 1
$ ./bin/local-master-backup.sh stop 1
Step 28 - To stop all HBase daemons.
$ ./bin/stop-hbase.sh
Comments
Post a Comment