The base RDD that this dataframe works on is zipped with index. I'd like to split this original dataframe into multiple dataframes where the delimiter is based on the first column string (eg. "GCKN" in this case).
I assume if I get the individual dataframes, I can combine other values as well such as this..
A F G
GCKN:GCKN_cppr0/in:GCKN_cppr0/out:GCKN_cppr15/in.. -71 531
Is this possible. What is the best way to do this?