I'm new to hive and read about it online too. But still having doubts which are not cleared.
for hive external tables, hive
keep table's metadata within HDFS, but not in its warehouse which is also in HDFS
. correct ?whether its internal or external table, in both cases
data of table
will be available in HDFS only butNOWHERE
else. Mean to say, data can taken from anywhere but has to be loaded in HDFS, because HIVE uses hadoop's processing engine to process data. Correct ?internal table,
table's metadata and table's data
both will be available in HIVE's data warehouse, and this data warehouse will be at nowhere else but in HDFS only. correct ?in external table,
table's metadata and table's data
both will beNOT
available in HIVE's data warehouse but in HDFS. But hive must be keeping some info with itself that where is table's metadata located and where is its data located in HDFS, correct ?
Can anyone share feedback to above understanding ?
THanks