Let's say my data frame contains these data:
>>> df = pd.DataFrame({'a':['l1','l2','l1','l2','l1','l2'],
'b':['1','2','2','1','2','2']})
>>> df
a b
0 l1 1
1 l2 2
2 l1 2
3 l2 1
4 l1 2
5 l2 2
l1
should correspond to 1
whereas l2
should correspond to 2
.
I'd like to create a new column 'c
' such that, for each row, c = 1
if a = l1
and b = 1
(or a = l2
and b = 2
). If a = l1
and b = 2
(or a = l2
and b = 1
) then c = 0
.
The resulting data frame should look like this:
a b c
0 l1 1 1
1 l2 2 1
2 l1 2 0
3 l2 1 0
4 l1 2 0
5 l2 2 1
My data frame is very large so I'm really looking for the most efficient way to do this using pandas.