I'm new at Big Data and Hadoop. I'm learning Hadoop and Hbase. I got a problem but still had no idea. Could you help me?
I've put 3 csv files to HDFS, include:
- File 1(Subscribe_info.txt): numID, active_date, status
- File 2(Recharge.txt): numID, recharge_history_date, amount, method
- File 3 (Charge.txt): numID, charge_date, amount_charge
All of them related with each other by: numID.
I need to load all data above into a Hbase table with row key is the numID and contain all columns related, like this:
Hbase table: rowkey= userID -- Column: active_date - status - recharge_history - amount_recharge - method - charge - history - amount
I use Java and I've created 3 classes to parse the data from 3 data files. But I don't know how I can read the file path from HDFS and parse to put it into the Hbase table.