Trying to accurately measure 3D distance from a 2D image

Question

I am trying to extract out 3D distance in mm between two known points in a 2D image. I am using square AR markers in order to get the camera coordinates relative to the markers in the scene. The points are the corners of these markers.

An example is shown below:

enter image description here

The code is written in C# and I am using XNA. I am using AForge.net for the CoPlanar POSIT The steps I take in order to work out the distance:

1. Mark corners on screen. Corners are represented in 2D vector form, Image centre is (0,0). Up is positive in the Y direction, right is positive in the X direction.

2. Use AForge.net Co-Planar POSIT algorithm to get pose of each marker:

    float focalLength = 640; //Needed for POSIT
    float halfCornerSize = 50; //Represents 1/2 an edge i.e. 50mm
    AVector[] modelPoints = new AVector3[]
    {
         new AVector3( -halfCornerSize, 0,  halfCornerSize ),
         new AVector3(  halfCornerSize, 0,  halfCornerSize ),
         new AVector3(  halfCornerSize, 0, -halfCornerSize ),
         new AVector3( -halfCornerSize, 0, -halfCornerSize ),
    };
    CoplanarPosit coPosit = new CoplanarPosit(modelPoints, focalLength);
    coPosit.EstimatePose(cornersToEstimate, out marker1Rot, out marker1Trans);

3. Convert to XNA rotation/translation matrix (AForge uses OpenGL matrix form):

    float yaw, pitch, roll;
    marker1Rot.ExtractYawPitchRoll(out yaw, out pitch, out roll);

    Matrix xnaRot = Matrix.CreateFromYawPitchRoll(-yaw, -pitch, roll);
    Matrix xnaTranslation = Matrix.CreateTranslation(marker1Trans.X, marker1Trans.Y, -marker1Trans.Z);
    Matrix transform = xnaRot * xnaTranslation;

4. Find 3D coordinates of the corners:

    //Model corner points
    cornerModel = new Vector3[]
    {
        new Vector3(halfCornerSize,0,-halfCornerSize),
        new Vector3(-halfCornerSize,0,-halfCornerSize),

        new Vector3(halfCornerSize,0,halfCornerSize),
        new Vector3(-halfCornerSize,0,halfCornerSize)
    };

    Matrix markerTransform =  Matrix.CreateTranslation(cornerModel[i].X, cornerModel[i].Y, cornerModel[i].Z);
    cornerPositions3d1[i] = (markerTransform * transform).Translation;

    //DEBUG: project corner onto screen - represented by brown dots
    Vector3 t3 = viewPort.Project(markerTransform.Translation, projectionMatrix, viewMatrix, transform);
    cornersProjected1[i].X = t3.X; cornersProjected1[i].Y = t3.Y;

5. Look at the 3D distance between two corners on a marker, this represents 100mm. Find the scaling factor needed to convert this 3D distance to 100mm. (I actually get the average scaling factor):

    for (int i = 0; i < 4; i++)
    {
        //Distance scale;
        distanceScale1 += (halfCornerSize * 2) / Vector3.Distance(cornerPositions3d1[i], cornerPositions3d1[(i + 1) % 4]);
    }
    distanceScale1 /= 4;

6. Finally I find the 3D distance between related corners and multiply by the scaling factor to get distance in mm:

    for(int i = 0; i < 4; i++)
    {
       distance[i] = Vector3.Distance(cornerPositions3d1[i], cornerPositions3d2[i]) * scalingFactor;
    }

The distances acquired are never truly correct. I used the cutting board as it allowed me easy calculation of what the distances should be. The above image calculated a distance of 147mm (expected 150mm) for corner 1 (red to purple). The image below shows 188mm (expected 200mm).

enter image description here

What is also worrying is the fact that when measuring the distance between marker corners sharing an edge on the same marker, the 3D distances obtained are never the same. Another thing I noticed is that the brown dots never seem to exactly match up with the colored dots. The colored dots are the coordinates used as input to the CoPlanar posit. The brown dots are the calculated positions from the center of the marker calculated via POSIT.

Does anyone have any idea what might be wrong here? I am pulling out my hair trying to figure it out. The code should be quite simple, I don't think I have made any obvious mistakes with the code. I am not great at maths so please point out where my basic maths might be wrong as well...

2D -> 3D the most obvious thing that comes to mind is stero vision. — ldog, May 28 '12 at 22:14
This is supposed to work with a single webcam hence the reason why I am using pose estimation on a known target. — Jkh2, May 28 '12 at 22:52
I think you're getting a lot of noise and rounding error from all the calculations. XNA math methods were meant for speed, not accuracy. Look into using doubles, and/or using custom math functions. — Julien Lebot, May 28 '12 at 23:14
I did consider rounding errors, I tried manually doing the calculations using doubles but it gave similar results. — Jkh2, May 29 '12 at 00:28
Have you checked for lens aberrations in your camera? What happens to the estimates if you move your camera in the plane perpendicular to the image axis without rotating it so that the object changes position in the image but stays the same otherwise? — jilles de wit, May 29 '12 at 08:50
I am trying to add camera undistortion now. I used GML toolbox http://graphics.cs.msu.ru/en/science/research/calibration/cpp to get the distortion parameters but I am unsure about how to use them. I was looking at this answer: http://stackoverflow.com/a/7076541/1055841 and I was wondering if that is distorting a corrected image. Also I think I have only K values from GML toolbox, could I set the p(tangential distortion) to 0? — Jkh2, May 29 '12 at 17:26

score 1 · Accepted Answer · answered May 30 '12 at 07:01

You are using way to many black boxes in your question. What is the focal length in the second step? Why go through ypr in step 3? How do you calibrate? I recommend to start over from scratch without using libraries that you do not understand.

Step 1: Create a camera model. Understand the errors, build a projection. If needed apply a 2d filter for lens distortion. This might be hard.

Step 2: Find you markers in 2d, after removing lens distortion. Make sure you know the error and that you get the center. Maybe over multiple frames.

Step 3: Un-project to 3d. After 1 and 2 this should be easy.

Step 4: ???

Step 5: Profit! (Measure distance in 3d and know your error)

score 0 · Answer 2 · answered May 07 '13 at 15:57

0

I think you need to have 3D photo (two photo from a set of distance) so you can get the parallax distance from image differences

answered May 07 '13 at 15:57

passerby

1

Trying to accurately measure 3D distance from a 2D image

2 Answers2