I have an homography matrix H, by using keypoints correspondence, and I want to find the camera pose by using it.
For now, I know that you can get R1 and R2 from columns 1 and 2 of H, and R3 is the cross product of R1 and R2, while T is column 3 of H. So with it, I can build a [R|t] 3x4 matrix.
There are some things that still aren't clear for me:
What is this RT ? Camera to Keypoints, or Keypoints to camera?
I also know that you can decompose H and you get a list of possible rotation and translation vectors with the decomposeHomographyMat() function, but how I can select the correct ones.
By knowing that H=K[R|t], if I multiply calibration matrix (K) which is 3x3 in [R|t] which is 3x4, then I get a 3x4 matrix, and that's not equal to H. So I'm very confused about that.
Talking about SolvePnP() function, I don't know how to apply it here. Because I have the 2d points, which could be a random projected Keypoint obtained, but don't know how can I get the 3d object points. And, I also don't know if its necessary to do SolvePnp() because I'm supposed to have a "rvect" and "tvect" by extracting them of the Homography Matrix.
I know that maybe I am so lost in various concepts, but please have in mind that I am just starting.