OpenCV recoverPose camera coordinate system

Question

I'm estimating the translation and rotation of a single camera using the following code.

E, mask = cv2.findEssentialMat(k1, k2, 
                         focal = SCALE_FACTOR * 2868
                         pp = (1920/2 * SCALE_FACTOR, 1080/2 * SCALE_FACTOR), 
                         method = cv2.RANSAC, 
                         prob = 0.999, 
                         threshold = 1.0)

points, R, t, mask = cv2.recoverPose(E, k1, k2)

where k1 and k2 are my matching set of key points, which are Nx2 matrices where the first column is the x-coordinates and the second column is y-coordinates.

I collect all the translations over several frames and generate a path that the camera traveled like this.

def generate_path(rotations, translations):
    path = []
    current_point = np.array([0, 0, 0])

    for R, t in zip(rotations, translations):
        path.append(current_point)
        # don't care about rotation of a single point
        current_point = current_point + t.reshape((3,)

    return np.array(path)

So, I have a few issues with this.

The OpenCV camera coordinate system suggests that if I want to view the 2D "top down" view of the camera's path, I should plot the translations along the X-Z plane.

plt.plot(path[:,0], path[:,2])

This is completely wrong.

However, if I write this instead

plt.plot(path[:,0], path[:,1])

I get the following (after doing some averaging)

This path is basically perfect. So, perhaps I am misunderstanding the coordinate system convention used by cv2.recoverPose? Why should the "birds eye view" of the camera path be along the XY plane and not the XZ plane?

Another, perhaps unrelated issue is that the reported Z-translation appears to decrease linearly, which doesn't really make sense.

I'm pretty sure there's a bug in my code since these issues appear systematic - but I wanted to make sure my understanding of the coordinate system was correct so I can restrict the search space for debugging.

Have a look at https://stackoverflow.com/questions/37810218/is-the-recoverpose-function-in-opencv-is-left-handed and the references cited there. It might give you a hint. — Paul92, May 08 '19 at 21:35

score 2 · Answer 1 · answered May 21 '19 at 09:54

At the very beginning, actually, your method is not producing a real path. The translation t produced by recoverPose() is always a unit vector. Thus, in your 'path', every frame is moving exactly 1 'meter' from the previous frame. The correct method would be, 1) initialize:(featureMatch, findEssentialMatrix, recoverPose), then 2) track:(triangluate, featureMatch, solvePnP). If you would like to dig deeper, finding tutorials on Monocular Visual SLAM would help.

Secondly, you might have messed up with the camera coordinate system and world coordinate system. If you want to plot the trajectory, you would use the world coordinate system rather than camera coordinate system. Besides, the results of recoverPose() are also in world coordinate system. And the world coordinate system is: x-axis pointing to right, y-axis pointing forward, z-axix pointing up.Thus, when you would like to plot the 'bird view', it is correct that you should plot along the X-Y plane.

Are you sure about the X-axis and Y-axis direction in the world coordinate system? Because, at least in Robotics, X-axis points forward and the Y-axis points left! — Milan, Oct 03 '22 at 21:21

OpenCV recoverPose camera coordinate system

1 Answers1