I am currently doing a POC on Apache Hudi
with spark(scala).
I am facing a problem while saving a dataframe with partitioning.
Hudi saves the dataframe with path/valueOfPartitionCol1/valueOfPartitionCol2....
using the property PARTITIONPATH_FIELD_OPT_KEY
.
But my requirement is path/COL1=value/COL2=value....
Similar to the way spark partitions the data using partitionBy()
.
Anyone who has tried out custom partitioning with Hudi
can help me out?