CS 4495/7495: Computer Vision

Problem Set 4

Handed Out: Thursday. October 4, 2001
Due: Thursday October 18, 2001


  1. Write a matlab program "vowel.m" that uses binary image processing methods to find instances of the lower case letters 'e' in the following relatively noise-free scanned texts of the same font and size.

    Hint: Matlab has a lot of useful functions for this problem set. "help " describes particular functions and sometimes points to new functions that are useful. Check out

    Use the first image to create a set of training e's. Test on this training image and calculate your accuracy:

    Acc = (N - D - S - I) / N

    Where N is the number of samples, D are the deletions (failed to recognize the letter was present), S are the substitutions (incorrectly recognized the letter), I are insertions (labeled noise (i.e. something not a letter) as the particular letter).

    Use the second image as an independent test (i.e. use the same system that you developed for the first image on the second but without using any examples from the second image to train the system). What accuracy did you achieve?

    Do the same thing with the third image. What was the accuracy? Did your method generalize well? Try to use other features to help recognize the letter. What did you use? How much did it improve the accuracy?

    1. test image 1
    2. Click Right Mouse button (and choose Save As)to download in TIFF or in GIF, OR  copy from /net/www/classes/AY2002/cs4495_fall/ps/ps4/ps1-ex1.tiff
       

    3. test image 2
    4. Click Right Mouse button (and choose Save As) to download in TIFF or in GIF  OR  copy from /net/www/classes/AY2002/cs4495_fall/ps/ps4/ps1-ex2.tiff
       

    5. test image 3
    6. Click Right Mouse button (and choose Save As) to download in TIFF or in GIF  OR  copy from /net/www/classes/AY2002/cs4495_fall/ps/ps4/ps1-ex3.tiff
       

    7. Do the same for the letters 'a', 'i', 'o', and 'u'. In other words make the system general for all five of the vowels.

    8.  
    9. The problem set with the highest average accuracy for the 5 vowels will be awarded a box of Krispy Kremes.

    10.  
    Your output for each vowel should be three images in which the detected letters are highlighted.
     
  2. WHAT to TURN IN?
  3. You will submit your work electronically (no paper). Instead, you will submit an HTML file. Create your own web page in the spirit of the template provided for PS2. The format of the page is up to you. Feel free to add as many result images, intermediate results images, and result images for different values of the control parameters as you think is necessary to demonstrate your work. However, the page should contain at least the following:

    1. your name on top of the page,
    2. links to ALL of your matlab code,
    3. links to your result images converted to JPG format.

    Your matlab code should be reasonably commented if you want us to understand it and give you credit/comments. Also include test scripts for each of the two subproblems: test_e_only.m and test_vowels.m. These files should demonstrate your work for part 1 (letter e only), and part 2 (all vowels) of this problem set, respectively. test_vowels.m will be used to determine the winner for part 3. The scripts should:

    1. load your test images,
    2. call your other matlab functions that do the required image processing,
    3. show all results on the screen using multiple figures, and
    4. for your final result images color the pixels of the detected letters in some distinct color so we know that your code is indeed detecting them. Use different colors for different letter (see example below).



    5. save all result images in JPG format with appropriate names
    6. that can be loaded from the web page.

    To submit your work do the following:

    1. On a UNIX platform create a directory named after your CoC login_name
    2. Put ALL your matlab files, the three test scripts, the original images, and all result images in that directory.
    3. Also put a README file that explains any special needs of your code, and lists any KNOWN BUGS/FEATURES.
      For example if your code is running very slow try to explain why. If something is not working explain why.
      Also remember to put the required explanations of your code as described above.
    4. Create an HTML page according to the instructions given above. Save it as index.html in the same directory.
    5. CD to the parent directory.
    6. Create a tar archive using:
        tar cvf login_name.tar login_name
    7. Gzip the tar file using:
        gzip login_name.tar
    8. E-mail the login_name.tar.gz file to cs7495@cc.gatech.edu