Task1
supposed dataset
Name B C
0 James a a,b,c,d
1 James a NaN
2 Rudy b a,f
3 Karl c e,c
In column c, the values are in list form, and I want to split them up and add them to the row. Delete the value which C column is NaN
the ouptut what I want
Name B C
0 James a a
1 James a b
2 James a c
3 James a d
4 Rudy b a
5 Rudy b f
6 Karl c e
7 Karl c c
Task2
I would like to label based on the relationship between James, Rudy, Karl and column 'C'.
Label standard (∩ mean intersection)
Label column 'C' value
0 James
1 Rudy
2 Karl
3 James ∩ Rudy
4 James ∩ Karl
5 Rudy ∩ Karl
6 James ∩ Rudy ∩ Karl
I want to label according to where each coulmn 'C' value belongs.
The final result reflects what I want
Name B C Label
0 James a a 3
1 James a b 0
2 James a c 4
3 James a d 0
4 Rudy b a 3
5 Rudy b f 1
6 Karl c e 2
7 Karl c c 4
For example, 'a' in column 'C' is labeled 3 because it is in both James and Rudy
Difficult for me. I'd appreciate if you help me.
Thank you for reading.