"SELECT * FROM..." VS "SELECT ID FROM..." Performance

Question

As someone who is newer to many things SQL as I don't use it much, I'm sure there is an answer to this question out there, but I don't know what to search for to find it, so I apologize.

Question: if I had a bunch of rows in a database with many columns but only need to get back the IDs which is faster or are they the same speed?

SELECT * FROM...

vs

SELECT ID FROM...

when ever you use any aggregate function like `Max`, `Min` etc you need group by on that column. — शेखर, Jan 24 '14 at 04:54
Split the second question to its own post. You'll get better response. — Nick, Jan 24 '14 at 04:54
Wow tnx for all the great answers. My next focus is to go back and update all the SQL Select statements iv'e done. — zezba9000, Jan 24 '14 at 05:17
@zezba9000 Mark someone's as answer if it helped you, so that it will be useful for someone in future — Amarnath Balasubramanian, Jan 24 '14 at 05:22

J0e3gan · Accepted Answer · 2014-01-24T05:12:40.130

You asked about performance in particular vs. all the other reasons to avoid SELECT *: so it is performance to which I will limit my answer.

On my system, SQL Profiler initially indicated less CPU overhead for the ID-only query, but with the small # or rows involved, each query took the same amount of time.

I think really this was only due to the ID-only query being run first, though. On re-run (in opposite order), they took equally little CPU overhead.

Here is the view of things in SQL Profiler:

SQL Profiler Results - *, ID-Only, ID-Only, *

With extremely high column and row counts, extremely wide rows, there may be a perceptible difference in the database engine, but nothing glaring here.

Where you will really see the difference is in sending the result set back across the network! The ID-only result set will typically be much smaller of course - i.e. less to send back.

Marked this as the answer as it clearly proves there is a CPU performance gain. — zezba9000, Jan 24 '14 at 05:24

score 3 · Answer 2 · answered Jan 24 '14 at 04:57

3

Never use * to return all columns in a table–it’s lazy. You should only extract the data you need. so-> select field from is more faster

answered Jan 24 '14 at 04:57

Abhi Adr

1,264
2
11
27

What's the reason behind that? Just posting your POV sounds more like preaching. – Daksh Gargas Nov 10 '22 at 01:12

score 2 · Answer 3 · answered Jan 24 '14 at 05:00

There are several reasons you should never (never ever) use SELECT * in production code:

since you're not giving your database any hints as to what you want, it will first need to check the table's definition in order to determine the columns on that table. That lookup will cost some time - not much in a single query - but it adds up over time.
in SQL Server (not sure about other databases), if you need a subset of columns, there's always a chance a non-clustered index might be covering that request (contain all columns needed). With a SELECT *, you're giving up on that possibility right from the get-go. In this particular case, the data would be retrieved from the index pages (if those contain all the necessary columns) and thus disk I/O and memory overhead would be much less compared to doing a SELECT *.... query.

score 1 · Answer 4 · answered Jan 24 '14 at 04:52

1

Long answer short, selecting only the columns you need will always be faster. SELECT * requires scanning the whole table. This is a best practice thing that you should adopt very early on.

For the second part, you should probably post a seperate question instead of piggybacking off this one. Makes it easy to distinguish what you are asking about.

answered Jan 24 '14 at 04:52

TTeeple

2,913
1
13
22

Ok cool got it. And will do on second part. – zezba9000 Jan 24 '14 at 04:54

"SELECT * FROM..." VS "SELECT ID FROM..." Performance

4 Answers4