Your question is very valid. This is a very common requirement. And most people get it wrong, due to what MySQL offers.
- The requirement: Insert unless the
PRIMARY
key exists, otherwise update.
- The common approach:
ON DUPLICATE KEY UPDATE
- The result of that approach, disturbingly: Insert unless the
PRIMARY
or any UNIQUE
key exists, otherwise update!
What can go horribly wrong with ON DUPLICATE KEY UPDATE
? You insert a supposedly new record, with a new PRIMARY
key value (say a UUID), but you happen to have a duplicate value for its UNIQUE
key.
What you want is a proper exception, indicating that you are trying to insert a duplicate into a UNIQUE
column.
But what you get is an unwanted UPDATE
! MySQL will take the conflicting record and start overwriting its values. If this happens unintentionally, you have mutilated an old record, and any incoming references to the old record are now referencing the new record. And since you probably won't tell the query to update the PRIMARY
column, your new UUID is nowhere to be found. If you ever encounter this data, it will probably make no sense and you will have no idea where it came from.
We need a solution to actually insert unless the PRIMARY
key exists, otherwise update.
We will use a query that consists of two statements:
- Update where the
PRIMARY
key value matches (affects 0 or 1 rows).
- Insert if the
PRIMARY
key value does not exist (inserts 1 or 0 rows).
This is the query:
UPDATE my_table SET
unique_name = 'one', update_datetime = NOW()
WHERE id = 1;
INSERT INTO my_table
SELECT 1, 'one', NOW()
FROM my_table
WHERE id = 1
HAVING COUNT(*) = 0;
Only one of these queries will have an effect. The UPDATE
is easy. As for the INSERT
: WHERE id = 1
results in a row if the id exists, or no row if it does not. HAVING COUNT(*) = 0
inverts that, resulting in a row if the id is new, or no row if it already exists.
I have explored other variants of the same idea, such as with a LEFT JOIN
and WHERE
, but they all looked more convoluted. Improvements are welcome.