python - given two tuples lists find the closest tuple between them (dist

Question

I have two lists with tuples (coordinates), for example:

some_pt1 = [(10.76,2.9),(3.24,4.28),(7.98,1.98),(3.21,9.87)]
some_pt2 = [(11.87,6.87), (67.87,8.88), (44.44, 6.78), (9.81, 1.09), (6.91, 0.56), (8.76, 8.97), (8.21, 71.66)]

each value in the tuples is a flat
the lists are not in the same length

I what to find the two closest points between the two lists. I don't know how, maybe it is possible to do it using distances. I hope there is a more efficient way to do this cause I need this function to work as fast as possible (it is a part of something bigger).

Is it a requirement that the two points are from _both_ lists or do you also include the distances from points of the same list? — Aemyl, Apr 13 '18 at 07:35

score 2 · Answer 1 · answered Apr 13 '18 at 08:17

Alternatively, by taking reference Tim Seed's code. this can be used.

from scipy.spatial import distance
some_pt1 = [(10.76,2.9),(3.24,4.28),(7.98,1.98),(3.21,9.87)]
some_pt2 = [(11.87,6.87), (67.87,8.88), (44.44, 6.78), (9.81, 1.09), (6.91, 0.56), (8.76, 8.97), (8.21, 71.66)]

empthy_dict = {}
for i in range(len(some_pt1)):
    for j in range(len(some_pt2)):
        dist = distance.euclidean(some_pt1[i],some_pt2[j])
        empthy_dict[dist] = [some_pt1[i],some_pt2[j]]

shortest = sorted(empthy_dict.keys())[0]
points = empthy_dict[shortest]
print('Shortest distance is ' ,shortest,' and points are ' ,points)

score 1 · Answer 2 · answered Apr 13 '18 at 07:42

How about this

from pprint import pprint

some_pt1 = [(10.76,2.9),(3.24,4.28),(7.98,1.98),(3.21,9.87)]
some_pt2 = [(11.87,6.87), (67.87,8.88), (44.44, 6.78), (9.81, 1.09), (6.91, 0.56), (8.76, 8.97), (8.21, 71.66)]


distance = {}
for x in some_pt1:
    for y in some_pt2:
        dist =abs(abs(x[0])-abs(y[0]))+abs(abs(x[1])-abs(y[1]))
        distance[dist]=[x,y]

shortest =sorted(distance.keys())[0]
print("Min Distance is {} Objects are  {} {} ".format(shortest, distance[shortest][0],distance[shortest][0]))

This is (almost, too many `abs`) the Manhattan metric, which is the wrong one :) — timgeb, Apr 13 '18 at 07:48

Jorge Rodriguez Molinuevo · Answer 3 · 2018-04-13T08:12:47.760

In any case you need to do all posible combinations, there are algorithms out there that can help you do it in the best order or avoiding repeating distances. If you want to do it fast you should use a special library that helps compiling with the or precompiling the arrays, this could be done with Numba or Cython. Other libraries such as scipy have special modules suh as scipy.spatial.distance. Look at this post for more doubts similar cuestion.

Example:

import scipy.spatial.distance as sd
import numpy as np
some_pt1 = [(10.76,2.9),(3.24,4.28),(7.98,1.98),(3.21,9.87)]
some_pt2 = [(11.87,6.87), (67.87,8.88), (44.44, 6.78), (9.81, 1.09), (6.91, 0.56), (8.76, 8.97), (8.21, 71.66)]
np.unravel_index(np.argmin(sd.cdist(some_pt1, some_pt2)), (len(some_pt1), len(some_pt2)))

results: (2, 4)

This code would return the position in the first list and in the second.

score 0 · Answer 4 · answered Apr 13 '18 at 07:40

0

By Euclidian distance:

>>> some_pt1 = [(10.76,2.9),(3.24,4.28),(7.98,1.98),(3.21,9.87)]
>>> some_pt2 = [(11.87,6.87), (67.87,8.88), (44.44, 6.78), (9.81, 1.09), (6.91, 0.56), (8.76, 8.97), (8.21, 71.66)]
>>> 
>>> def dist_sq(p1_p2):
...     p1, p2 = p1_p2
...     return sum(x*y for x,y in zip(p1, p2))
... 
>>> 
>>> min(((p1, p2) for p1 in some_pt1 for p2 in some_pt2), key=dist_sq)
((3.24, 4.28), (6.91, 0.56))

Which has runtime O(n*m) (where n, m are the lengths of your lists). Since you need to look at all the pairs, it won't get better than this.

Note that comparing the squared distances is enough, no need to compute the root.

answered Apr 13 '18 at 07:40

timgeb

76,762
20
123
145

Thanks, That looks like it could do the job good, but it makes me ask: is there any other way to do it more efficiently then by chacking every single one cause the real lists that I will have will be long and may have points that are extremely far. – eran halperin Apr 13 '18 at 07:57
@eranhalperin no, you can't reduce the complexity of the algorithm. You can make it faster with `numpy` and other techniques mentioned by Jorge Rodriguez Molinuevo but you need to compare every pair. – timgeb Apr 13 '18 at 07:59
Okay, so how do I use numpy (for example) to make it work faster – eran halperin Apr 13 '18 at 08:02

python - given two tuples lists find the closest tuple between them (dist

4 Answers4