Which lines in one DataFrame do not occur in another?

Asked Jan 13 '23 at 07:13

Active Jan 13 '23 at 07:13

Viewed 9 times

I have got two similar csv files. With similar I mean that both have the same headers and most of the lines are identical. I read them in using pd.read_csv into df1 and df2. Now I want the software to tell me which lines in df1 do not occur in df2. Note that by using pd.read_csv each line in df1 obtained an auto generated index which might differ from the index in df2 even if it is the same line. Hence, comparing the indicies would lead to the wrong result.

asked Jan 13 '23 at 07:13

principal-ideal-domain

3,998
8
36
73

Which lines in one DataFrame do not occur in another?

0 Answers0