Below is my scenario:
- Initially load data to HBASE using Sqoop (This is done)
- Now, I will get batches of data on daily basis (around 600000 records) which is a combination of new data (for inserting the new records to HBASE) and old data (for updating the existing records of HBASE). Now my question is:
How can I perform this operation using Spark/scala to Hbase table.
Your early reply would be highly appreciated.
Thanks Souvik