SELECT DISTINCT on one column

Question

Using SQL Server, I have...

ID  SKU     PRODUCT
=======================
1   FOO-23  Orange
2   BAR-23  Orange
3   FOO-24  Apple
4   FOO-25  Orange

I want

1   FOO-23  Orange
3   FOO-24  Apple

This query isn't getting me there. How can I SELECT DISTINCT on just one column?

SELECT 
[ID],[SKU],[PRODUCT]
FROM [TestData] 
WHERE ([PRODUCT] = 
(SELECT DISTINCT [PRODUCT] FROM [TestData] WHERE ([SKU] LIKE 'FOO-%')) 
ORDER BY [ID]

Can we assume that you don't care about the suffix on the SKU column data? I.E., You only care about "FOO-" and not "FOO-xx" — Kane, Jun 08 '09 at 18:18
What is your logic for choosing ID = 1, SKU = FOO-23 over the other values? It's easy to create a query that answers specfically for ID = 1 but fails for a general case — gbn, Jun 08 '09 at 18:20
gbn - this is an overly simplified example (obviously). What I am trying to show is one example that satisfies both criteria. There isn't (and need not be) logic to which one is chosen. — mmcglynn, Jun 08 '09 at 19:36

score 383 · Accepted Answer · answered Jun 08 '09 at 18:20

383

Assuming that you're on SQL Server 2005 or greater, you can use a CTE with ROW_NUMBER():

SELECT  *
FROM    (SELECT ID, SKU, Product,
                ROW_NUMBER() OVER (PARTITION BY PRODUCT ORDER BY ID) AS RowNumber
         FROM   MyTable
         WHERE  SKU LIKE 'FOO%') AS a
WHERE   a.RowNumber = 1

answered Jun 08 '09 at 18:20

Aaron Alton

22,728
6
34
32

46

You aren't using a [CTE](http://msdn.microsoft.com/en-us/library/ms190766.aspx) in your query. That's just a derived table. But you are right that you *could* have used a CTE here. – Mark Byers Dec 07 '11 at 09:13
1

leave out "AS" for oracle -> ...WHERE SKU LIKE 'FOO%') a WHERE a.RowNumber = 1 – Andre Nel Jun 21 '17 at 11:46
This works although it's not a CTE ( ;WITH CTE ...... ) . more of a sub query with partition by inside.... – Rohan K Jul 24 '19 at 08:24
this is really useful case in any various duplication thank you – Mustafa Salih ASLIM Jan 13 '20 at 08:08

Jakob Egger · Answer 2 · 2012-03-28T08:22:50.553

60

The simplest solution would be to use a subquery for finding the minimum ID matching your query. In the subquery you use GROUP BY instead of DISTINCT:

SELECT * FROM [TestData] WHERE [ID] IN (
   SELECT MIN([ID]) FROM [TestData]
   WHERE [SKU] LIKE 'FOO-%'
   GROUP BY [PRODUCT]
)

edited Mar 28 '12 at 08:22

answered Mar 28 '12 at 08:17

Jakob Egger

11,981
4
38
48

1

GROUPBY is not useful when you have several columns, because you need to bring all your columns in the GROUP BY statement. – MJBZA Jan 24 '22 at 13:29

score 15 · Answer 3 · edited Mar 28 '12 at 08:20

try this:

SELECT 
    t.*
    FROM TestData t
        INNER JOIN (SELECT
                        MIN(ID) as MinID
                        FROM TestData
                        WHERE SKU LIKE 'FOO-%'
                   ) dt ON t.ID=dt.MinID

EDIT
once the OP corrected his samle output (previously had only ONE result row, now has all shown), this is the correct query:

declare @TestData table (ID int, sku char(6), product varchar(15))
insert into @TestData values (1 ,  'FOO-23'      ,'Orange')
insert into @TestData values (2 ,  'BAR-23'      ,'Orange')
insert into @TestData values (3 ,  'FOO-24'      ,'Apple')
insert into @TestData values (4 ,  'FOO-25'      ,'Orange')

--basically the same as @Aaron Alton's answer:
SELECT
    dt.ID, dt.SKU, dt.Product
    FROM (SELECT
              ID, SKU, Product, ROW_NUMBER() OVER (PARTITION BY PRODUCT ORDER BY ID) AS RowID
              FROM @TestData
              WHERE  SKU LIKE 'FOO-%'
         ) AS dt
    WHERE dt.RowID=1
    ORDER BY dt.ID

Ivan · Answer 4 · 2019-03-19T16:39:23.000

Here is a version, basically the same as a couple of the other answers, but that you can copy paste into your SQL server Management Studio to test, (and without generating any unwanted tables), thanks to some inline values.

WITH [TestData]([ID],[SKU],[PRODUCT]) AS
(
    SELECT *
    FROM (
        VALUES
        (1,   'FOO-23',  'Orange'),
        (2,   'BAR-23',  'Orange'),
        (3,   'FOO-24',  'Apple'),
        (4,   'FOO-25',  'Orange')
    )
    AS [TestData]([ID],[SKU],[PRODUCT])
)

SELECT * FROM [TestData] WHERE [ID] IN 
(
    SELECT MIN([ID]) 
    FROM [TestData] 
    GROUP BY [PRODUCT]
)

Result

ID  SKU     PRODUCT
1   FOO-23  Orange
3   FOO-24  Apple

I have ignored the following ...

WHERE ([SKU] LIKE 'FOO-%')

as its only part of the authors faulty code and not part of the question. It's unlikely to be helpful to people looking here.

A great idea for dev work and testing without having to create test tables. Thanks. — luisdev, Mar 16 '21 at 14:29

Bartosz X · Answer 5 · 2015-11-19T09:01:36.597

I know it was asked over 6 years ago, but knowledge is still knowledge. This is different solution than all above, as I had to run it under SQL Server 2000:

DECLARE @TestData TABLE([ID] int, [SKU] char(6), [Product] varchar(15))
INSERT INTO @TestData values (1 ,'FOO-23', 'Orange')
INSERT INTO @TestData values (2 ,'BAR-23', 'Orange')
INSERT INTO @TestData values (3 ,'FOO-24', 'Apple')
INSERT INTO @TestData values (4 ,'FOO-25', 'Orange')

SELECT DISTINCT  [ID] = ( SELECT TOP 1 [ID]  FROM @TestData Y WHERE Y.[Product] = X.[Product])
                ,[SKU]= ( SELECT TOP 1 [SKU] FROM @TestData Y WHERE Y.[Product] = X.[Product])
                ,[PRODUCT] 
            FROM @TestData X

score 10 · Answer 6 · answered Jun 08 '09 at 20:01

10

SELECT min (id) AS 'ID', min(sku) AS 'SKU', Product
    FROM TestData
    WHERE sku LIKE 'FOO%' -- If you want only the sku that matchs with FOO%
    GROUP BY product 
    ORDER BY 'ID'

answered Jun 08 '09 at 20:01

3

Was going to +1 this, because I think GROUP BY is the right way to go - but the minimum ID and the minimum SKU may not happen to belong to the same record. It's hard to determine what are the correct ID and SKU to report for a given PRODUCT. – Carl Manaster Jun 08 '09 at 20:17

score 6 · Answer 7 · edited Jul 24 '18 at 13:16

6

Try this:

SELECT * FROM [TestData] WHERE Id IN(SELECT DISTINCT MIN(Id) FROM [TestData] GROUP BY Product)

edited Jul 24 '18 at 13:16

SE1986

2,534
1
10
29

answered Dec 14 '13 at 04:53

Anna Karthi

273
3
2

SELECT DISTINCT on one column

7 Answers7

Linked

Related