Best way to test if a row exists in a MySQL table

Question

I'm trying to find out if a row exists in a table. Using MySQL, is it better to do a query like this:

SELECT COUNT(*) AS total FROM table1 WHERE ...

and check to see if the total is non-zero or is it better to do a query like this:

SELECT * FROM table1 WHERE ... LIMIT 1

and check to see if any rows were returned?

In both queries, the WHERE clause uses an index.

score 556 · Accepted Answer · edited Sep 04 '19 at 19:20

556

You could also try EXISTS:

SELECT EXISTS(SELECT * FROM table1 WHERE ...)

and per the documentation, you can SELECT anything.

Traditionally, an EXISTS subquery starts with SELECT *, but it could begin with SELECT 5 or SELECT column1 or anything at all. MySQL ignores the SELECT list in such a subquery, so it makes no difference.

edited Sep 04 '19 at 19:20

SovietFrontier

2,047
1
15
33

answered Nov 04 '09 at 20:59

Chris Thompson

35,167
12
80
109

32

Test with `...EXISTS( SELECT 1/0 FROM someothertable)`. For SQL Server & Oracle - it makes no difference to use *, 1 or NULL because EXISTS only tests for a boolean based on 1+ of the WHERE criteria matching. – OMG Ponies Nov 04 '09 at 23:28
84

Guys, it says right in the documentation linked to in this answer, 2nd paragraph, "Traditionally, an EXISTS subquery starts with SELECT *, but it could begin with SELECT 5 or SELECT column1 or anything at all. MySQL ignores the SELECT list in such a subquery, so it makes no difference." – mpen Feb 11 '12 at 00:39
14

@ChrisThompson : what happens when the statement is executed? I mean what does the result set contain? – Ashwin Nov 04 '12 at 11:37
14

@Ashwin, it contains whether a 0 (not exists) or 1 (exists). – fedorqui Feb 07 '13 at 09:09
Does the statement return bool type or unsigned char type? I am developing using the C API and I need to know this. Thanks. – Luka Feb 25 '14 at 06:43
17

I think your query is superfluous, I tested, and this query `SELECT 1 FROM table1 WHERE col = $var LIMIT 1` is more faster than your query. So what is the advantage of your query? – Shafizadeh Sep 28 '15 at 15:20
2

@Shafizadeh makes an excellent point, that is easy to overlook. Missing as an option above is `SELECT 1 FROM test WHERE ... LIMIT 1`, without `SELECT EXISTS` around it. Might be a hair faster that way. – ToolmakerSteve Sep 02 '16 at 15:06
2

To avoid a very long key, use this `SELECT EXISTS(SELECT 1 FROM table1 WHERE ...) as 'is_exist'` – Rahul Saini Jun 25 '19 at 04:55
5

@Shafizadeh The `EXISTS()` version has the advantage to return 0 instead of NULL when used as a scalar subquery such that you can add multiple of these expressions to create an "or" expression from it. – Tilman Vogel Mar 10 '22 at 21:46

score 241 · Answer 2 · edited Nov 03 '20 at 11:48

241

I have made some researches on this subject recently. The way to implement it has to be different if the field is a TEXT field, a non unique field.

I have made some tests with a TEXT field. Considering the fact that we have a table with 1M entries. 37 entries are equal to 'something':

SELECT * FROM test WHERE text LIKE '%something%' LIMIT 1 with mysql_num_rows() : 0.039061069488525s. (FASTER)
SELECT count(*) as count FROM test WHERE text LIKE '%something% : 16.028197050095s.
SELECT EXISTS(SELECT 1 FROM test WHERE text LIKE '%something%') : 0.87045907974243s.
SELECT EXISTS(SELECT 1 FROM test WHERE text LIKE '%something%' LIMIT 1) : 0.044898986816406s.

But now, with a BIGINT PK field, only one entry is equal to '321321' :

SELECT * FROM test2 WHERE id ='321321' LIMIT 1 with mysql_num_rows() : 0.0089840888977051s.
SELECT count(*) as count FROM test2 WHERE id ='321321' : 0.00033879280090332s.
SELECT EXISTS(SELECT 1 FROM test2 WHERE id ='321321') : 0.00023889541625977s.
SELECT EXISTS(SELECT 1 FROM test2 WHERE id ='321321' LIMIT 1) : 0.00020313262939453s. (FASTER)

edited Nov 03 '20 at 11:48

T.Todua

53,146
19
236
237

answered May 21 '12 at 15:23

Laurent W.

3,629
2
20
29

3

Thanks for the additional answer. Did you find the difference in time between the two fastest options for a TEXT field to be pretty consistent? The difference doesn't seem large, and using SELECT EXISTS(SELECT 1 ... LIMIT 1) seems to be pretty good in both cases. – Bernard Chen May 22 '12 at 07:15
1

You are right, the difference is not so important regarding the other results concerning the text field. Nevertheless, maybe the query would be better using `SELECT 1 FROM test WHERE texte LIKE '%something%' LIMIT 1` – Laurent W. Jun 04 '12 at 20:08
1

I tried on mysql and in the case you use `select 1 ... limit 1`, it is useless to surround with select exists – Adrien Horgnies May 15 '16 at 09:57
5

@LittleNooby there is difference. SELECT EXISTS ... gives true and false value (1 or 0), while SELECT 1 ... gives either 1 or empty. There are subtle difference between false value and empty set, depending on your situation. – Quickpick May 17 '16 at 23:30
@LittleNooby makes an excellent point, that is easy to overlook. Missing in the timing tests above is `SELECT 1 FROM test WHERE ...`, without `SELECT EXISTS` around it. Presumably is a hair faster that way. – ToolmakerSteve Sep 02 '16 at 15:06
I believe SELECT *... becomes slower as you add more columns. I would be curious how this all compares to SELECT `id`... – Nosajimiki Jul 13 '17 at 16:43
This is almost certainly because the PK select can be satisfied by the index for SELECT 1, but not for SELECT * which requires a row-lookup. The TEXT select needs a row lookup either way, because of the condition. `SELECT 1 FROM ... WHERE ... LIMIT 1` will give you the best of both worlds.. add the EXISTS wrapping if you require a boolean return for no matching rows. – Arth Sep 10 '18 at 11:03
3

Which version of MySQL did you use? At least in 5.5+ there is no difference between `EXISTS (SELECT ...)` and `EXISTS (SELECT ... LIMIT 1)`. MySQL is smart enough to insert this `LIMIT 1` by itself, because this is how `EXISTS` works: it stops when at least one result is found. – Ruslan Stelmachenko Jan 29 '20 at 04:05

score 36 · Answer 3 · answered Dec 19 '13 at 11:18

A short example of @ChrisThompson's answer

Example:

mysql> SELECT * FROM table_1;
+----+--------+
| id | col1   |
+----+--------+
|  1 | foo    |
|  2 | bar    |
|  3 | foobar |
+----+--------+
3 rows in set (0.00 sec)

mysql> SELECT EXISTS(SELECT 1 FROM table_1 WHERE id = 1);
+--------------------------------------------+
| EXISTS(SELECT 1 FROM table_1 WHERE id = 1) |
+--------------------------------------------+
|                                          1 |
+--------------------------------------------+
1 row in set (0.00 sec)

mysql> SELECT EXISTS(SELECT 1 FROM table_1 WHERE id = 9);
+--------------------------------------------+
| EXISTS(SELECT 1 FROM table_1 WHERE id = 9) |
+--------------------------------------------+
|                                          0 |
+--------------------------------------------+
1 row in set (0.00 sec)

Using an alias:

mysql> SELECT EXISTS(SELECT 1 FROM table_1 WHERE id = 1) AS mycheck;
+---------+
| mycheck |
+---------+
|       1 |
+---------+
1 row in set (0.00 sec)

While the accepted answer was helpful, I appreciated the alias syntax. Thanks! — krummens, Mar 15 '21 at 17:39

score 28 · Answer 4 · answered Feb 15 '17 at 06:32

28

In my research, I can find the result getting on following speed.

select * from table where condition=value
(1 total, Query took 0.0052 sec)

select exists(select * from table where condition=value)
(1 total, Query took 0.0008 sec)

select count(*) from table where condition=value limit 1) 
(1 total, Query took 0.0007 sec)

select exists(select * from table where condition=value limit 1)
(1 total, Query took 0.0006 sec)

answered Feb 15 '17 at 06:32

shihab mm

505
5
15

1

These numbers are meaningless unless you have complete control of the universe. For one thing try doing them in reverse order. Unless of course your point is there is no difference. In that case you probably right. – theking2 Jun 17 '21 at 17:48

Arth · Answer 5 · 2019-06-19T12:20:54.110

I feel it is worth pointing out, although it was touched on in the comments, that in this situation:

SELECT 1 FROM my_table WHERE *indexed_condition* LIMIT 1

Is superior to:

SELECT * FROM my_table WHERE *indexed_condition* LIMIT 1

This is because the first query can be satisfied by the index, whereas the second requires a row look up (unless possibly all the table's columns are in the index used).

Adding the LIMIT clause allows the engine to stop after finding any row.

The first query should be comparable to:

SELECT EXISTS(SELECT * FROM my_table WHERE *indexed_condition*)

Which sends the same signals to the engine (1/* makes no difference here), but I'd still write the 1 to reinforce the habit when using EXISTS:

SELECT EXISTS(SELECT 1 FROM my_table WHERE *indexed_condition*)

It may make sense to add the EXISTS wrapping if you require an explicit return when no rows match.

score 5 · Answer 6 · answered Nov 02 '12 at 15:21

5

Suggest you not to use Count because count always makes extra loads for db use SELECT 1 and it returns 1 if your record right there otherwise it returns null and you can handle it.

answered Nov 02 '12 at 15:21

fth

2,478
2
30
44

score 3 · Answer 7 · answered Mar 22 '16 at 06:50

3

At times it is quite handy to get the auto increment primary key (id) of the row if it exists and 0 if it doesn't.

Here's how this can be done in a single query:

SELECT IFNULL(`id`, COUNT(*)) FROM WHERE ...

answered Mar 22 '16 at 06:50

Zaxter

2,939
3
31
48

Why not just use `IFNULL(id, 0)` here instead of the `COUNT(*)`? – Ethan Hohensee Aug 28 '17 at 18:35

score 2 · Answer 8 · edited Mar 17 '15 at 17:39

2

A COUNT query is faster, although maybe not noticeably, but as far as getting the desired result, both should be sufficient.

edited Mar 17 '15 at 17:39

atiquratik

1,296
3
27
34

answered Nov 04 '09 at 20:58

jaywon

8,164
10
39
47

6

This is however DB specific. The COUNT(*) is known to be slow in PostgreSQL. Better would be to select the PK column and see if it returns any rows. – BalusC Nov 04 '09 at 21:05
6

COUNT(*) is slow in InnoDB though – Will Jun 21 '12 at 20:16

score -1 · Answer 9 · answered Nov 04 '09 at 21:12

-1

For non-InnoDB tables you could also use the information schema tables:

http://dev.mysql.com/doc/refman/5.1/en/tables-table.html

answered Nov 04 '09 at 21:12

davek

22,499
9
75
95

score -1 · Answer 10 · answered Nov 04 '09 at 21:15

-1

I'd go with COUNT(1). It is faster than COUNT(*) because COUNT(*) tests to see if at least one column in that row is != NULL. You don't need that, especially because you already have a condition in place (the WHERE clause). COUNT(1) instead tests the validity of 1, which is always valid and takes a lot less time to test.

answered Nov 04 '09 at 21:15

Felix

88,392
43
149
167

8

-1 This is wrong. COUNT(*) doesn't look at the column values - it just counts the number of rows. See my answer here: http://stackoverflow.com/questions/2876909/count-and-countcolumn-name-whats-the-diff – Mark Byers May 28 '10 at 21:35
6

COUNT() is much slower than EXISTS as EXISTS can return when it first finds a row – Will Jun 21 '12 at 20:17

user4193303 · Answer 11 · 2014-10-29T09:18:01.047

-1

Or you can insert raw sql part to conditions so I have 'conditions'=>array('Member.id NOT IN (SELECT Membership.member_id FROM memberships AS Membership)')

edited Oct 29 '14 at 09:18

answered Oct 29 '14 at 09:08

user4193303

1
1

score -3 · Answer 12 · answered Nov 04 '09 at 21:01

-3

COUNT(*) are optimized in MySQL, so the former query is likely to be faster, generally speaking.

answered Nov 04 '09 at 21:01

Arthur Reutenauer

2,622
1
17
15

2

Are you referring to the optimization that the MyISAM has for selecting the count for a whole table? I didn't think that helped if there was a WHERE condition. – Bernard Chen Nov 04 '09 at 21:14

Best way to test if a row exists in a MySQL table

12 Answers12

Linked

Related