I have 2 sets of data:
df1
'Y'
A a
B b
C c
D d
E e
df2
'Y'
D d
E e
F f
I need to transform the first df1 to remove any data appearing in df2
I can easily do it using SQL, but that's not an option for me
The logic is 'select * from df1 where df1.Y not in (select Y from df2)'
I can't seem to get the implementation right using merge. Any ideas?
Desired result:
df3
'Y'
A a
B b
C c
with df2 unchanged