I have two RDDs. Each of them is a key-value pair:
rdd1:
('a', 1)
('b', 2)
rdd2:
('a', 3)
('c', 2)
I want to combine them in a pyspark sql dataframe such that:
a b c
rdd1 1 2 0
rdd2 3 0 2
Is there a way to do so? Or do I need to change the way I create my rdd1 and rdd2?
Thank you