I am working on an application which have following flow :
KAFKA PRODUCER --> SPARK STREAMING APPLICATION (KAFKA CONSUMER) --> HBASE --> PYTHON VIZ.
I have Spark, Kafka 2.3.0, HBase 1.3.5 and Zookeeper 3.4.13 installed. Spark, Kafka and Zookeeper are working successfully.
I'm having difficulty connecting HBase to Zookeeper.
- I am using HBase in pseudo distributed mode. Below is my
hbase- site.xml
:
<configuration>
<property>
<name>hbase.rootdir</name>
<value>hdfs://localhost:9000/hbase</value>
</property>
<property>
<name>hbase.zookeeper.property.clientPort</name>
<value>2181</value>
</property>
<property>
<name>hbase.zookeeper.property.dataDir</name>
<value>/usr/local/var/zookeeper</value>
</property>
<property>
<name>hbase.cluster.distributed</name>
<value>true</value>
</property>
</configuration>
I have set HBASE_MANAGES_ZK=false in hbase-env.sh as I am running Zookeeper server separately from kafka/bin/zookeeper-server- start .
jps
command output:
4656 QuorumPeerMain
2977 HRegionServer
6083 NameNode
6694 Jps
6330 SecondaryNameNode
6524 ResourceManager
6191 DataNode
I am following below steps :
- Starting hadoop
- Starting Zookeeper server
kafka/bin/zookeeper-server-start /usr/local/Cellar/apache-zookeeper-3.5.5-bin/conf/zoo.cfg - Running KAFKA Producer and consumer application.
In HBase shell the list
command returns the following error :
Error - Can't get master address from ZooKeeper; znode data == null
I am stuck at this point. Please help me to understand that what I am missing.