If I have a DataFrame like this (very minimal example)
col1 col2
0 a 1
1 a 2
2 b 1
3 b 2
4 b 4
5 c 1
6 c 2
7 c 3
and I want the intersection of all col2
values when they are related to their unique col1
values (so in this case, the intersection would be [1,2]
), how can I do so with Pandas? Another way to word this would be the values in col2
that exist for every unique value in col1
.
My (bad) solution was to get the unique col1
elements with unique
, and then build dictionaries from each unique element in col1
and then take the set intersection of those dictionary values. I feel like there is a mechanism I should be using to relate the columns together however that could make this much easier.