An Application of Handwriting Recognition System for Recognizing Student’s ID and Score on the Examination Paper Using WebCam

  

An Application of Handwriting Recognition System for

Recognizing Student’s ID and Score on the Examination

Paper Using WebCam

  well known that for engineering programs, most of examination given to students should be answered by handwriting on the paper. In this case, a data entry process is needed to enter the score and the corresponding

  our research, we propose a method to extract student’s ID

  The approaches for characters extraction described above are application dependent. It is difficult to find a general method to extract characters for all scenarios. In

  The appplications of character recognition described above require the high performance of the accuracy and speed. The problems of handwritten character recognition are more complex than the machine printed characters, due to the different writing styles. Many approaches have been proposed to recognize the handwritten characters [6]. In [6], they compared several popular classification techniques for handwritten character recognition, i.e. the k-nearest neighbor classifier, neural classifiers, learning vector quantization classifier, and support vector classifiers. They concluded that all the classifiers give high recognition accuracies. Furthermore, they noted that the feature extraction is primarily important to the performance of character recognition. A n innovative approach for feature extration called box-method is proposed by [7] to deal with the variability of writing styles. In [7], both fuzzy logic and neural network are used for character recognition.

  Handwriting recognition techniques have been implemented in several areas such as for automatic data entry of passport [1], to extract and recognize handwritten characters from application forms [2], to recognize handwritten postal codes for automatic sorting of mails [3]. In those applications, the characters are extracted from a whole document automatically, before they are recognized by the recognition system. Since the character extractions play an important process in the handwritten recognition system, they are also investigated intensively by researchers. In [4], they proposed a method by a syntactical structure of the numerical field to extract zip codes, phone numbers and customer codes from handwritten incoming mail documents. A robust connected component based character locating method is proposed in [5] for locating characters in scene image taken from digital camera.

  recognize the student’s ID and score written on the

  In this research, we propose an automatic data entry system to solve the above problems. The proposed system utilizes the handwriting recognition technique to

  that needs extra time and works, also facing the accuration problem due to the human error.

  student’s ID into computer

  the student’s ID and score on the examination paper. It is

  Aryuanto

  Recently, most of educational institutions have implemented the academic information sytem which provides the information of academic matters to the students easily and in short time. One key feature in this system is the application of computer and networking systems to store, process, and produce the data, resuts a paperless system. It requires that the data should be available (or converted) in digital format. Unfortunately, in several cases this requirement creates some burdens for implementation of a fully efficient academic information system. One of such problem is to convert or digitalize

  I. I NTRODUCTION

  I ndex Terms —automatic data entry, handwriting recognition, image projection, vector distance, box-method.

  I D and score. A simple effective image projection technique is employed to localize the student’s ID and score from an image, followed by the classsification technique using vector distance and box-method for classifying the numeral chartacters. The experimental results show that the algorithm works properly in finding and extracting student’s and score from the examination paper. Further, the classification technique shows a good result in classifying numeral handwriting characters.

  Abstract —This paper presents an automatic data entry system which is used to read and recognize the student’s ID and score on the examination paper automatically. The system uses a W ebcam to capture the image and a numeral handwriting recognition technique to recognize the student’s

  2 yusuf_nakhoda@yahoo.com

  1 aryuanto@gmail.com,

  2 Department of Electrical Engineering

Institut Teknologi Nasional Malang, Jl.Raya Karanglo Km. 2 Malang, I ndonesia

Tel. + 62-341-417636

  1 , Yusuf Ismail Nakhoda

  and score written on the examination paper. The proposed method utilizes the characteristics of examinee

  

localize and extract the student’s ID and score. The the fields of name, student’s ID, course, etc., is printed on

  simple and effective vertical and horizontal projections the top-right corner of the examination paper. There is no are employed for extraction process. field for score on that box, but usually examiner/lecturer To recognize the numeral handwriting characters, we writes the score on the left side of that box as depicted in propose to modify the handwritten recognition method Fig. 2. used in [7]. The method [7] uses the skeleton of a

  Examination

  character to extract the feature. This skeleton is obtained

  information fields Score Student's ID

  by a thinning algorithm which is rather complicated. In our research, to simplify the process we do not used the skeleten of a character, but uses a blob (binary image) of the character instead.

  The paper is organized as below. Section 2 presents the proposed handwriting recognition system. The characters extraction is described in section 3. In section 4, the characters recognition is described. The experimental results and discussions are covered in section 5. Finally, the conclusion is presented in section 6.

ROPOSED ANDWRITING RECOGNITION SYSTEM

II. P H

  The proposed system is depicted in Fig. 1. A Webcam is used to capture the image of examination papers where

  student’s ID and score to be recognized. The Webcam is Figure 2. Example of examination paper.

  used instead of the scanner, because of the low cost, fast reading, and easy installation. However, those benefits should be paid with the following problems: the captured

  Captured image

  image is affected by the lighting changes, the appearance of characters on the paper may degrade due to the improper position of the paper, eg. not in flat position,

  Student’s ID and

  folded, etc. Since our objective is to find and recognize

  Score localization the numeral characters, i.e. the student’s ID and score of

  the examination, not the whole handwriting texts, the Character

  extraction problems become less difficult. Character segmentation

  Character normalization Character Feature extraction recognition

  Character classification Figure 3. Block diagram of student’s ID and score recognition system.

  Figure 1. Hardware configuration of the proposed system.

  The process for recognizing student’s ID and score is

  depicted in Fig. 3. The recognition process is divided into two stages: character extraction and character recognition.

  In the research, the student’s ID and score to be

  recognized are captured from the examination paper used First step in the character extraction stage is the in National Institute of Technology (ITN), Malang, with

  localization of student’s ID and score that finds the

  the specific format as depicted in Fig. 2. A box contains locations are defined by the bounding boxes of the digits

  of student’s ID (seven digits) and score (one to three

  digits). Then the character segmentation step will separate

  the individual digit of student’s

  ID and score. After each character is extracted, they will be classified in the recognition stage.

  The extracted characters obtained in previous stage are scale varying and sometimes contain noise. Thus, the character normalization is required in the recognition stage. It resizes the characters in a standard size, and eliminates the outlier noise. After character is normalized, the feacture extraction using the box-method [7] is employed to extract the f eature of each character. Finally, the character classification process is used to classify the character (represented by its feature) into the reference numeral characters, i.

  e. “0” to “9”. The details processes will be described in the following sections.

  III. C HARACTER EX TRA CTION

  To localize the score, we search the area on the left side of the box of examination information fields. It is assumed that the area only contains the score. Therefore, the bounding box of the score could be determined by vertical and horizontal projection easily.

  L=LB+33x(line_up-line_down)/40 (1) R=LB+83x(line_up-line_down)/40 (2) T=line_down+26x(line_up-line_down)/40 (3) D=line_down+20x(line_up-line_down)/40 (4)

  following formulas:

  borders of the student’s ID are obtained using the

  3. The left (L), right (R), top (T) and bottom (B)

  line_down.

A. Student’s ID and Score Localization

  left_image, and perform a scanning from the top

  examination information fields, we first find the box in the image. The box is defined by the left border (LB), the right border (RB), the top border (TB), and bottom border (BB). RB is assigned as the right border of the paper, TB is assigned as the top border of the paper. While LB and BB are searched by analyzing the image projections as described in the following:

  (a) (b)

  locating the student’s ID as defined by Eqs. (1)-(4).

  The line_down corresponds with the last field. Hence, we could utilize the two parameters as the reference for

  line_up corresponds with the first field of the examination information fields (see Fig. 2 for the details).

  Fig. 5 depicts the box_image and the corresponding vertical projection. From the figure, it is clear that the

  Fig. 4(a) depitcs the gradient image obtained by Sobel operator in the horizontal direction. Since the examination paper uses the paper with horizontal lines printed on it, we could eliminate those horizontal lines by perform the gradient image only in the horizontal direction. The horizontal projection of the gradient image in Fig. 4(a) is depicted in Fig. 4(b). The peak in the figure corresponds with the left border of the box (LB).

  Figure 4. (a) The gradient image; (b) The horizontal projection of the gradient image.

  Since student’s ID is written in the provided box of the

  1. Convert the color image (captured image) into grayscale image.

  to the bottom to find the first transition from high peaks to zero indicating the bottom border of the box (BB). After LB,RB,TB and BB are determined, the bounding

  2. Find the Sobel gradient image in the horizontal direction.

  3. Compute the horizontal projection of the gradient image, and identify the column of the maximum peak as the left border of the box (LB).

  4. Define the left_image as the sub-image of the gradient image bounded with a short distance in the left and right of LB.

  to the bottom to find the first peak. Identify the corresponding row as line_up. Then perform a scanning from the bottom to the top to find the first peak, and identify the corresponding row as

  box_image, and perform a scanning from the top

  2. Compute the vertical projection of the

  5. Compute the vertical projection of the

  follows:

  box of student’s ID is defined by geometric analysis as

  1. Define the box_image as the gradient image bounded by LB, RB, TB and BB.

  Figure 5. The box_image and the corresponding vertical projection.

B. Character Segmentation

  The original approach [7] used the thinned image (skeleton image) to extract the feature. It requires a special thinning algorithm [9] in pre-processing step to provide the thinned image. In the research, instead of using the thinned image, we treat the binary image directly. Since the calculation of vector distance expressed by Eq. (7) is normalized by total pixels in the box, it suggests that the non-thinned image might be used too. Further, the non-thinned image is less sensitive to the

  (6) Then, a normalized vector distance is obatined by

  dividing the sum of distance of all ‘1’ pixels in a box with

  the total number of pixels on the box, yields b

  n k b k b b b d n

  1

  , 24 ,..., 1 ,

  1 (7) where n

  b

  is the the number of pixels in bth box. Fig. 6(c) depicts the extracted features

  for character ”4”, where the

  dot points denote the normalized vector distance for each box.

  e or shape compared to the thinned image. After features of the characters are extracted, the classification process is performed by calculating the distance of feature vectors between target image and the references. The distance is calculated using the following formula:

  variation of character’s styl

  The bounding box obtained in previous section defines

  24

  1

  2 b r b t b dist

  (8) where

  t b

  is the normalized vector distance for bth box of the target image, and

  r b

  is the normalized vector distance for bth box of the reference image. Given a target image, we calculate the distance for all reference images (digit

  “0” to “9”), and classify the

  target image to the reference

  line_up line_down

  b k

  ) ( j i d

  2

  2

  1

  2 /

  superimposed on the partitioned image. By taking the bottom left corner as the origin (0,0), the vector disctance for kth pixel in bth box at location (i,j) is calcucalted as

  character of number “4” shown in Fig. 6(a) is

  Feature extraction used in this research is the vector distance and box-method approached proposed by [7], [8]. The method divides a character image into 24 boxes of size 6 x 4 as depicted in Fig. 6(b), where a binary

  B. Feature Extraction and Classification

  When the normalized image is bigger than the original image, there will be pixels do not have corresponding pixels in the original image. In the case, we use the nearest neighbor interpolation to interpolate those pixels.

  m is the height of the original image, n is is the width of the original image.

  is the width of the normalized image,

  the boundarie of all seven digits of the student’s ID. To

  classify each digit or character, we should separate each digit individually. We employ a horizontal projection technique to separate them. Here, the projection is calculated from the binary image obtained by thresholding the grayscale image. To elimate the noise and connect the missing segment in the thresholded image, we perform the Gaussian smoothing and the morphological dilation operation.

  The horizontal projection method peforms well when the gap between two adjacent digits is clear enough. Thus create a valley in the image projection. However, in some cases two digits often touch each other, and the valley is not exist. It results a wrong segmentation. To overcome such problem, we utilize the ratio of height and width of the character to verify the segmented results.

IV. CHARACTER RECOGNITON

A. Character Normalization

  y n q y

  (5)

  x m p x '

  In the research, the normalized size of the character is chosen as 42 x 32 pixels. For normalization we employ the method used in [7]:

  written by the examiners will be totally different from one examiner to others. Theref ore, the size normalization should be performed before classification.

  without the provided field, the size of score’s characters

  nature of handwriting, the size of characters written by the different students will be different. Furthermore,

  height for writing the student’s ID. However due to the

  The examination information fields printed in the examination paper provides a specific field with a certain

  where

  x’

  is the x-coordinate of the normalized image,

  y ’ is the y -coordinate of the normalized image, x is the x-coordinate of the original image, p is the the height of the normalized image, q

  ' (6)

  Table 1. Results of the characters extraction Number of successed Tested items % images

  1. Box localization 19 95% 18 90%

  2. Student’s ID localization 9 45%

  3. Student’s ID extraction

  4. Score extraction 14 70% (a) (b)

  (a) (c) Figure 6. (a) A binary image of digit “4”; (b) Digit “4” enclosed in 6 x 4 boxes; (c) Pattern of digit “4” plotted using extracted features[7].

  (b)

XPERIMENTAL RESULTS AND

  ISCUSSIONS

V. E D

  To verify our proposed algorithm, we conducted the experiments using the examination paper images captured by 2.0 Megapixels Webcam. The algorithm was implemented using C++ and the OpenCV library.

  (c)

  We evaluated the characters extraction and classification stages separately. Twenty examination papers captured by Webcam are used for evaluating the character extraction process. Table 1 shows the results. From the table, we could see that the proposed algorithm

  is able to find the location of the student’s ID and score efficiently. However, the extraction rate of the student’s

  ID is low. From the observation, it is caused by the resolution of the Webcam which is relative low, hence the

  (d)

  quality of captured image is poor, i.e. the characters of

  student’s ID are not clear enough. The score extraction Figure 7. (a) Detected box of the examination information

  rate is higher than the student

  ’s ID because the character fields; (b) The bounding box of student’s ID;(c) Extracted

  size of score is usually larger, more than three times of student’s ID; (d) Extracted score.

  the student’s ID size.

  Fig. 7 shows the typical extraction results. The box shown on the top right corner in Fig. 7(a) is the detected The character classificati on algorithm was tested on box of the examination information fields. The box two hundred images of the numeral characters collected shown on the middle in Fig. 7(b) is the bounding box of from Internet, contain twenty images for each numeral

  the student’s ID. The digit extraction of the student’s ID

  character. The classification rate is shown in Table 2. It and score are shown in Fig. 7(c) and 7(d) respectively. could be clearly understood that the classification rate of numeral “1” is very high, because there are no many variations in the writing of number “1”. Fig. 8 depitcs the

  samples of tested numeral character images.

VI. CONCLUSION

  Proceedi ngs of International Conference on Computer and Information Technology , Dhaka, Banglasdesh, 2007.

  [9] N.J. Naccache, R. Shinghal, “SPTA: a proposed algorithm for thinning binary patterns”, IEEE Trans. Systems Man Cybernet. Vol 14, No. 3, pp. 409-418.

  V.K. Madasu, B.C. Lovell, M. Hanmandlu, “ Hand Printed Character Recognition Using Neural Networks”, Proceedings of International Conference on Cogniti on and Recognition , M ysore, India.

  [7] M . Hanmandlu, K.R.M . M ohan, S. Chakraborty, S. Goyal, and D.R. Choudhury, “Unconstrained handwritten character recognition based on fuzzy logic”, Pattern Recognition, Vol. 36, pp. 603-623. [8]

  C. Liu, K. Nakashima,H. Sako, and H. Fujisawa, “Handwritten digit recognition: benchmarking of state-of-the-art techniques”, Pattern Recognition , Vol. 36, pp. 2271-2285.

  [5] K. Wang and J.A. Kangas, “Character location in scene images from digital camera”, Pattern Recognition, Vol. 36, pp. 2287-2299. [6]

  International Workshop on Pattern Recognition in Information Systems, PRIS’2003, pp. 167-172.

  [4]

  [3] A.S.M .M. Morshed, M d. S. Islam, M d. A. Faisal, M d. A. Sattar, and M. Ishizuka, “Automatic Sorting of Mails by Recognizing Handwritten Postal Codes Using Ne ural Network Architectures”,

  Table 2. Results of the character classification Numeral character Classification rate “0” 85% “1” 100% “2” 75% “3” 90% “4” 95% “5” 60% “6” 95% “7” 90% “8” 85% “9” 80%

  E. Kavallieratos, N. Antoniades, N. Fakotakis and G. Kookinakis, “Extraction and Recognition of Handwritten Alphanumeric Characters From Application Forms”, Proceedings of 13 th International Conference on Digital Signal Processing , Santorini, Greece, 1997.

  , Kuala Lumpur, Malaysia, 2007. [2]

  R EFERENCES [1] M.M. Thin and M.M. Sein, “Implementation Automatic Data Entry of Passport With Handwr itten Recognition System”, Proceedings of 28 th Asi an Conference on Remote Sensing

  In future, we will investigate with the complex and a huge numbers of test images. Furthermore, the technique will be extended to deal with the general case of the examination paper.

  written on the examination paper. The proposed technique using image projection for locating those characters and vectore distance calculated using box- method, shows a good results on a limited test images.

  recognition technique to recognize student’s ID and score

  This paper presents the application of handwriting

  Overall 86% Figure 8. Some of the tested numeral character images.