[New to Spark]
After creating a DataFrame I am trying to partition it based on a column in the DataFrame. When I check the partitioner using data_frame.rdd.partitioner
I get None as output.
Partitioning using ->
data_frame.repartition("column_name")
As per Spark documentation the default partitioner is HashPartitioner, how can I confirm that ?
Also, how can I change the partitioner ?