I have two pandas dataframe (let's call them df1
and df2
), with 132 columns each and one used as index (index
) and i need to merge df2
to df1
using these rules:
- Update If an index of
df2
is indf1
update the specific row ofdf1
with the values fromdf2
- Insert If a new
df2
index is found append the row todf1
The first dataframe has 500000 rows and this operation will be executed every day with a new df2
.
I'have tried with pandas.update()
but it seems to just solve the first point of my problem (Update) while pd.concat([df2[~df2["index"].isin(df1["index"])], df1])
seems to just solve the Insert problem.
Is there a smart way using Pandas or other packages to do this?