CS 7321 Winter 1998

PS#1 Solutions by Jiqun Wang

Character Recognition Problem


Index


How I solved it

  1. First I did this:
    1. First things first, the game is based on binary image morphological operations. So, I change all the text images into binary format;
    2. Reverse the BW picture, to make the characters in white for future use;
    3. Crop the corresponding vowels from the binary images given above, for each vowel, "a, e, i, o, and u";
    4. Thicken the text image;
    5. Thin the structural element - SE;
    6. Opening text image -- "erode and dilate original image using SE", to get the vowel's position in the image;
    7. Change the colormap setting, to show only colored vowels searched out in the text images.
  2. Then I did this:
    1. By the way metioned above, we can recognize the single vowel in the image;
    2. Do the same routine, other 4 vowel could be searched out respectively;
    3. Then, we using the addition function, find all the marks made in the individual results by single vowel check;
    4. This temporary result is all these 5 vowels recognized from the original text image. These are marked in red.
  3. Finally, I work on ex3
    1. One script file "run.m" was prepared to make the vowel rotate at the increments of 6 degree from -90 to 90 degrees;
    2. At each position, the matching rountine is basically the same.
to TOP


Assumptions and Weaknesses

Assumptions:
  1. The character I selected during cropping is a good sample. The croping process is good (sometimes it is poor indeed)
  2. The standard erode/dilate function is good for this open/close actions
Weakness:
  1. Only one letter is pretty representative;
  2. For the low accuracy of the SE, the erosion/dilation works with lots of error marking.

to TOP


Improvements and Possible Future Work

to TOP


Results

; ; ; ; Five sample vowels selected cropped and inverse/binary process.

Image 1

Figure 1: This one is the original copy.
Image 1

Image 1

Image 2

Figure 2: This one is the original image.

Image 1

Image 1

Image 3

Figure 3: This one is the original image.

Image 1

Image 1

to TOP


Source Code