I am having 2 tables src
and dest
, same schema. I would like to overwrite partitions of dest
using the updated data on src
table.
On hive there are 2 different statements:
Insert Overwrite
Exchange Partition
for exchange partition
it's problematic, because some of the partitions already exist.(as stated on documentation it is not possible to exchange if it is already exist). I also need support concurrency and atomicity, which means drop + exchange may not work but I am not sure about that.
for insert overwrite
it's problematic because it's take a lot of time (lot of data copied, there is not HOT swap for the partition).
Is there any hybrid way to solve this issue ? like exchange overwrite
or something like that? Is there any plan to add that functionality to hive if it is not exist.