I have a pilot HBase cluster with 1 master and 5 slave nodes. I want to access (basically write ad impression data via GET's) the cluster via its REST API. I want to be able to run aggregated reports using Hadoop/Hive?Pig (TBD) later, so I want a single picture of the data.
Do I start the REST server on the master and just write to that single endpoint, or do I start a REST server instance on each slave node and load balance writes across the slave nodes?
(The latter doesn't seem right but I saw some mention in docs about that so am a little confused).