I have a list of lists formatted as:
testing_set = ["001,P01", "002,P01,P02", "003,P01,P02,P09", "004,P01,P03"]
I used re
to reformat the list as such:
[in] test_set1 = [ re.split(r',', line, maxsplit=5) for line in testing_set]
[out] ["001","P01"]
How can I create a dataframe where the index is (transaction_id) "001,002,003,004" and the p-values for each line are listed in the column (product_id).