I have two dataframes with diffrenet sizes (df1 is larger than df2):
df1:
coulmn 1(contains duplicates) column 2 (unique)
A B
A C
G H
df2
coulmn 3(contains duplicates) column 4
A 1
A 1
G 0
I want to create a new dataframe where column 3 is equal to coulmn 1 (including duplicate matches) and I want to grab values from column 2 and coulmn 4. I want my resulting df to look like this:
column 2 column 4
B 1
C 1
H 0
what is the fastest way to implement this on millions of rows?