I have got two similar csv files. With similar I mean that both have the same headers and most of the lines are identical. I read them in using pd.read_csv
into df1
and df2
. Now I want the software to tell me which lines in df1
do not occur in df2
. Note that by using pd.read_csv
each line in df1
obtained an auto generated index which might differ from the index in df2
even if it is the same line. Hence, comparing the indicies would lead to the wrong result.
Asked
Active
Viewed 9 times
0

principal-ideal-domain
- 3,998
- 8
- 36
- 73