A Zone Based Character Recognition Engine for Kannada and English Scripts

被引:1
|
作者
Mukarambi, Gururaj [1 ]
Dhandra, B. V. [1 ]
Hangarge, Mallikarjun [2 ]
机构
[1] Gulbarga Univ, Dept PG Studies & Res Comp Sci, Gulbarga 585106, Karnataka, India
[2] Sci & Commerece Coll, Karnatak Arts, Dept Comp Sci, Bidar 585401, Karnataka, India
关键词
OCR; SVM; document image analysis;
D O I
10.1016/j.proeng.2012.06.381
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, an Optical Character Recognition engine for Kannada and English character recognition is proposed based on zone features. The zone is one of the old concepts in case of document image analysis research. But this method is good in case of Kannada and English character recognition. The total of 2800 Kannada consonants and 2300 English lowercase alphabets sample images are classified based on the SVM classifier. All preprocessed images are normalized into 32 x 32 dimensions, it is optimum. Then the preprocessed image is divided into 64 zones of non overlapping and zone based pixel density is calculated for each of the 64 zones, there by generating 64 features. These features are fed to the SVM classifier for classification of character images. To test the performance of an algorithm 2 fold cross validation is used. The average recognition accuracy of 73.33% and 96.13% is obtained for Kannada consonants and English lowercase alphabets respectively. Further the average percentage of recognition accuracy of 83.02% is obtained for mixture input of both Kannada and English characters. The recognition accuracy obtained for Kannada consonants is low, because most of the characters are similar in shape. Hence, one may need to add some more dominating features to discriminating the characters. In this direction, the work is in progress. It is an initial attempt for mixture of Kannada and English characters recognition with single algorithm. The novelty of the algorithm is independent of thinning and slant of the characters. (C) 2012 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of Noorul Islam Centre for Higher Education
引用
收藏
页码:3292 / 3299
页数:8
相关论文
共 50 条
  • [21] Segmentation-free composite character recognition (CR) in bilingual handwritten text for Gurumukhi–English scripts
    Sukhandeep Kaur
    Seema Bawa
    Ravinder Kumar
    Soft Computing, 2023, 27 : 16159 - 16178
  • [22] Complete Kannada Optical Character Recognition with Syntactical Analysis of the script
    Sagar, B. M.
    Shobha, G.
    Kumar, Ramakanth P.
    ICCN: 2008 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING, 2008, : 484 - +
  • [23] A survey on optical character recognition for Bangla and Devanagari scripts
    Bag, Soumen
    Harit, Gaurav
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2013, 38 (01): : 133 - 168
  • [24] A survey on optical character recognition for Bangla and Devanagari scripts
    SOUMEN BAG
    GAURAV HARIT
    Sadhana, 2013, 38 : 133 - 168
  • [25] Review on OCR for Handwritten Indian Scripts Character Recognition
    Kumar, Munish
    Jindal, M. K.
    Sharma, R. K.
    ADVANCES IN DIGITAL IMAGE PROCESSING AND INFORMATION TECHNOLOGY, 2011, 205 : 268 - +
  • [26] Handwritten character recognition of popular south Indian scripts
    Pal, Umapada
    Sharma, Nabin
    Wakabayashi, Tetsushi
    Kimura, Fumitaka
    ARABIC AND CHINESE HANDWRITING RECOGNITION, 2008, 4768 : 251 - +
  • [27] Handwritten Kannada Character Recognition using Wavelet Transform and Structural Features
    Pasha, Salecm
    Padma, M. C.
    2015 INTERNATIONAL CONFERENCE ON EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY (ICERECT), 2015, : 346 - 351
  • [28] Kannada Confusing Character Recognition and Classification Using Random Forest and SVM
    Rani, Shobha N.
    Nair, Bipin B. J.
    Athira, M. R.
    Prajwal, M. L.
    ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 537 - 541
  • [29] Segmentation-free composite character recognition (CR) in bilingual handwritten text for Gurumukhi-English scripts
    Kaur, Sukhandeep
    Bawa, Seema
    Kumar, Ravinder
    SOFT COMPUTING, 2023, 27 (21) : 16159 - 16178
  • [30] Kannada Character Recognition Using Multi-Class SVM Method
    Dutta, Kusumika Krori
    Swamy, Sunny Arokia.
    Banerjee, Anushua
    Rashi, Divya B.
    Chandan, R.
    Vaprani, Deepak
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 405 - 409