Optimizing MySQL subquery

Question

Working with MTA API.

stop_times table looks like this: trip_id, stop_id

trip_id is repeated as stop_ids are listed per row. Example:

Goal is to select trip_id of a train that we know will definitely stop at two specific stations. If we want trains that will stop at 1 and 3, we will get trips 1111, and 2222. Or if 1 and 2, then 1111 and 3333.

Here's what I wrote quickly, and of course it runs rather slow:

SELECT trip_id 
FROM stop_times 
WHERE stop_id=## 
  AND trip_id IN (SELECT trip_id FROM stop_times WHERE stop_id=##)

Basically, I am trying to do the equivalent of MS SQL INTERSECT.

How can I optimize this to run better?

score 2 · Accepted Answer · answered Nov 11 '11 at 15:35

2

select trip_id 
from stop_times 
where stop_id in (111, 222)
group by trip_id
having count(distinct stop_id) = 2

answered Nov 11 '11 at 15:35

D'Arcy Rittich

167,292
40
290
283

score 0 · Answer 2 · edited May 23 '17 at 12:03

0

See this excellent answer on a variety of ways to accomplish this - plus performance tests:
how-to-filter-sql-results-in-a-has-many-through-relation

One way is this (assuming that the (trip_id, stop_id) combination is UNIQUE in your table):

SELECT a.trip_id 
FROM stop_times a
  JOIN stop_times b
    ON b.trip_id = a.trip_id
WHERE a.stop_id = #1 
  AND b.stop_id = #2

edited May 23 '17 at 12:03

Community

1
1

answered Nov 11 '11 at 15:35

ypercubeᵀᴹ

113,259
19
174
235

score 0 · Answer 3 · answered Nov 11 '11 at 15:37

0

SELECT trip_id FROM stop_times WHERE stop_id IN (##,##)
HAVING count(DISTINCT stop_id)=2;

answered Nov 11 '11 at 15:37

Michael Krelin - hacker

138,757
24
193
173

Optimizing MySQL subquery

3 Answers3