I have a csv file with 3 columns.
Key,Branch,Account
a,213,234567
a,454,457900
a,562,340094
a,200,456704
b,400,850988
b,590,344433
c,565,678635
c,300,453432
c,555,563546
c,001,660905
I would like to iterate through each row and get distinct rows from the Key column (a,b & c) and split them into 3 different pyspark datagrams.
a,213,234567
a,454,457900
a,562,340094
a,200,456704
b,400,850988
b,590,344433
c,565,678635
c,300,453432
c,555,563546
c,001,660905