Computer Vision Project

Project 3 / Camera Calibration and Fundamental Matrix Estimation with RANSAC

Part 1

Part 1 was straightforward. To solve for the projection matrix I set up a system of linear equations and solved using SVD, enforcing the constraint that the solutions are not all 0.

To solve for the camera center, I followed the method from class which splits the transformation matrix into Q and m4 to find the world coordinates of the camera.

The output of my code is:

The projection matrix is:
    0.4583   -0.2947   -0.0140    0.0040
   -0.0509   -0.0546   -0.5411   -0.0524
    0.1090    0.1783   -0.0443    0.5968


The total residual is: <0.0445>

The estimated location of camera is: <-1.5127, -2.3517, 0.2826>

Part 2

To estimate the fundamental matrix I first normalized the points in each image so that they are centered around 0,0 and scaled from -1 to 1. This helps improve the accuracy of the estimation (as proposed by Hartly).

After normalization, I estimate F using the 8-point algorithm by solving a system of homogeneous linear equations using SVD. Then I enforce rank(F) = 2 by setting the smallest singular value to 0 and reconstructing it.

Results With Normalization

F_matrix =

   -0.0000    0.0000   -0.0001
    0.0000   -0.0000    0.0011
   -0.0000   -0.0015    0.0345

Results Without Normalization

Closeup difference

Original Normalized Difference

F_matrix =

   -0.0000    0.0000   -0.0019
    0.0000    0.0000    0.0172
   -0.0009   -0.0264    0.9995

If you look close enough, you will see that the epipolar lines from the normalized fundamental matrix are closer to the original points in the images. So it makes sense to normalize the points first to improve accuracy.

Part 3

For this part I implemented RANSAC to estimate the fundamental matrix using matched SIFT features. This process helps remove spurious matches in the SIFT pipeline by looking for the best fundamental matrix to describe the image transformation and the best inliers that fit the fundamental matrix.

The fundamental matrix can describe a transformation from one image to another. In the ideal scenario x^TFx' = 0. But since our fundamental matrix is an estimate, we are looking for inliers within a specific threshold. For most cases I set the threshold to 0.1.

There is nothing special about my implementation, but I do normalize the points for estimating the fundamental matrix. You can see the effect it has on the Gaudi image pair below. Since RANSAC is a randomized algorithm, we may want to know how many iterations it will require for 99% accuracy. There is a bound we can calculate, but intuitively the longer we run it the better we should do. For most cases I ran 100,000 iterations.

Mount Rushmore

This was the easiest image pair. My best results are shown below:

Sift Matches

RANSAC Matches

Epipolar Lines

Summary

The low number of bad SIFT matches leads to a pretty good fundamental matrix estimation. You can see RANSAC filtered most of the bad matches out, but there are still a couple. The epipolar lines are very reasonable, however. We should expect each corresponding line to cover the same points in each image. In this case the lines may not be perfect matches, but it is easy to see they are pretty close.

Project 3 / Camera Calibration and Fundamental Matrix Estimation with RANSAC

Part 1

Part 2

Results With Normalization

Results Without Normalization

Closeup difference

Part 3

Mount Rushmore

Sift Matches

RANSAC Matches

Epipolar Lines

Summary

Mount Rushmore 100 iterations

RANSAC Matches

Epipolar Lines

Summary

Notre Dame

Sift Matches

RANSAC Matches

Epipolar Lines

Summary

Gaudi

Sift Matches

RANSAC Matches

Epipolar Lines

Summary

Gaudi Without Normalization

Sift Matches

RANSAC Matches

Epipolar Lines

Summary

Woodies

Sift Matches

RANSAC Matches

Epipolar Lines

Summary

Woodies For 500000 Iterations

Sift Matches

RANSAC Matches

Epipolar Lines

Summary