executemany() is updating the last row data to all the rows in the table

Question

I have a requirement where i have some input data in one df which needs to be compare the 'name' column with other df which has same column, and if the match is found i need to do update those to my table.

Issue: The last row of my data_to_update is applying to all the columns.

What i have tried so far:

import pandas as pd
import numpy as np
import cx_Oracle

conn = cx_Oracle.connect('xxxxx', 'yyyyyy',dsn_tns)
cursor = conn.cursor()

data = [{'name': 'ABC', 'col1': 10, 'col2': 20, 'col3': 'John'},
        {'name': 'DEF', 'col1': 30, 'col2': 40, 'col3': 'Peter'},
        {'name': 'PQR', 'col1': 50, 'col2': 60, 'col3': 'Mary'},
        {'name': 'XYZ', 'col1': 70, 'col2': 80, 'col3': 'Robert'}]
df = pd.DataFrame(data)

data2 = [{'name': 'ABC', 'col1': 10, 'col2': 20000, 'col3': 'XXXX'},
        {'name': 'DEF', 'col1': 30, 'col2': 40, 'col3': 'Peter'},
        {'name': 'PQR', 'col1': 50, 'col2': 60, 'col3': 'Mary'},
        {'name': 'XYZ', 'col1': 70, 'col2': 80000, 'col3': 'YYYY'}]
df2 = pd.DataFrame(data)
    
df['match'] = np.where(df['name'].isin(df2['name']), 1, 0)

exist_df = df[df['match'] == 1]
del exist_df['match']

new_df = df[df['match'] == 0]
del new_df['match']

update_list = exist_df['name'].tolist()

to_update =  "','".join(update_list)
to_update1 = "('" + to_update + "')"

data_to_update = [tuple(x) for x in exist_df[['col2','col3']].values]

update_query = ''' update mytable set col2 =: col2, col3 =: col3 where name in ''' + to_update1

cursor.executemany(update_query,data_to_update)
conn.commit()

My table data before is:

name  col1  col2  col3
ABC    10    20   John
DEF    30    40   Peter
PQR    50    60   Mary
XYZ    70    80   Robert

Data after running above code is:

name  col1  col2     col3
XYZ    70    80000   YYYY
XYZ    70    80000   YYYY
XYZ    70    80000   YYYY
XYZ    70    80000   YYYY

But the expected table data after the process:

name  col1  col2    col3
ABC    10   20000   XXXX
DEF    30    40     Peter
PQR    50    60     Mary
XYZ    70   80000   YYYY

Any help is highly appreciated, thanks in advance!

Your UPDATE syntax doesn't look correct, I'm surprised this is running at all. `=:` should be `=`. — Barmar, Apr 27 '21 at 19:31
Why are you setting `col2 = col2`? That doesn't change anything. — Barmar, Apr 27 '21 at 19:32
Where do you make the values to assign to each column depend on something else in the row? — Barmar, Apr 27 '21 at 19:33
It was a typo, the col2 value will be taken from the data_to_update 1st value similarly for col3 — Mr.B, Apr 27 '21 at 19:36
I think you mean `= :col2`, not `=: col2`. The `:` is part of the placeholder name. — Barmar, Apr 27 '21 at 19:41

score 2 · Answer 1 · edited Apr 27 '21 at 22:28

2

You need to execute a separate statement for each name. Change your tuple so it also includes the name column, then you can match that with a placeholder. executemany will then update each row with its corresponding values.

data_to_update = [tuple(x) for x in exist_df[['col2','col3', 'name']].values]
sql = 'UPDATE table SET col2 = :col2, col3 = :col3 WHERE name = :name'
cursor.executemany(sql, data_to_update)

edited Apr 27 '21 at 22:28

Christopher Jones

9,449
3
24
48

answered Apr 27 '21 at 19:40

Barmar

741,623
53
500
612

I have a question please, what if the name is just a string var and holds values like name = ('ABC', 'XYZ', 'LMN') and need to use it in the WHERE IN clause – Mr.B Apr 27 '21 at 19:49
1

See https://cx-oracle.readthedocs.io/en/latest/user_guide/bind.html#binding-multiple-values-to-a-sql-where-in-clause – Barmar Apr 27 '21 at 19:51
1

Adding a link for reference: In some data manipulation cases MERGE might be useful, for example see https://stackoverflow.com/questions/67161376/is-there-a-way-to-improve-a-merge-query Other than whether it does what you want with the data, then you have to assess whether it's more efficient to do the processing in Python or the DB, and what the cost of transferring data from Python to the DB is. – Christopher Jones Apr 27 '21 at 22:37

score 0 · Answer 2 · answered Apr 27 '21 at 19:36

0

The =: syntax must be something special with the cx_Oracle connector. Remember that update_list contains every name in your dataframe, so executing a single UPDATE with WHERE NAME IN update_list is definitely going to set all the rows to a single value. You're going to need 4 separate UPDATE statements to update 4 rows to different values.

answered Apr 27 '21 at 19:36

Tim Roberts

48,973
4
21
30

Forgive me im a newbie, can't we make a bulk update like an bulk insert using executemany() " – Mr.B Apr 27 '21 at 19:38
1

The syntax is actually `= :placeholder`. – Barmar Apr 27 '21 at 22:43
You would have to check the syntax on the cx_oracle connector. Maybe you can supply one UPDATE statement with multiple substitution tuples. – Tim Roberts Apr 27 '21 at 22:53

executemany() is updating the last row data to all the rows in the table

2 Answers2