A Zone Based Character Recognition Engine for Kannada and English Scripts

被引:1
|
作者
Mukarambi, Gururaj [1 ]
Dhandra, B. V. [1 ]
Hangarge, Mallikarjun [2 ]
机构
[1] Gulbarga Univ, Dept PG Studies & Res Comp Sci, Gulbarga 585106, Karnataka, India
[2] Sci & Commerece Coll, Karnatak Arts, Dept Comp Sci, Bidar 585401, Karnataka, India
关键词
OCR; SVM; document image analysis;
D O I
10.1016/j.proeng.2012.06.381
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, an Optical Character Recognition engine for Kannada and English character recognition is proposed based on zone features. The zone is one of the old concepts in case of document image analysis research. But this method is good in case of Kannada and English character recognition. The total of 2800 Kannada consonants and 2300 English lowercase alphabets sample images are classified based on the SVM classifier. All preprocessed images are normalized into 32 x 32 dimensions, it is optimum. Then the preprocessed image is divided into 64 zones of non overlapping and zone based pixel density is calculated for each of the 64 zones, there by generating 64 features. These features are fed to the SVM classifier for classification of character images. To test the performance of an algorithm 2 fold cross validation is used. The average recognition accuracy of 73.33% and 96.13% is obtained for Kannada consonants and English lowercase alphabets respectively. Further the average percentage of recognition accuracy of 83.02% is obtained for mixture input of both Kannada and English characters. The recognition accuracy obtained for Kannada consonants is low, because most of the characters are similar in shape. Hence, one may need to add some more dominating features to discriminating the characters. In this direction, the work is in progress. It is an initial attempt for mixture of Kannada and English characters recognition with single algorithm. The novelty of the algorithm is independent of thinning and slant of the characters. (C) 2012 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of Noorul Islam Centre for Higher Education
引用
收藏
页码:3292 / 3299
页数:8
相关论文
共 50 条
  • [31] Isolated Kannada Character Recognition using Densely Connected Convolutional Network
    Sandhya, S.
    Geetha, V
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 137 - 142
  • [32] Offline Kannada Handwritten Character Recognition Using Convolutional Neural Networks
    Ramesh, G.
    Sharma, Ganesh N.
    Balaji, J. Manoj
    Champa, H. N.
    2019 5TH IEEE INTERNATIONAL WIE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (WIECON-ECE 2019), 2019,
  • [33] The optical character recognition of Urdu-like cursive scripts
    Naz, Saeeda
    Hayat, Khizar
    Razzak, Muhammad Imran
    Anwar, Muhammad Waqas
    Madani, Sajjad A.
    Khan, Samee U.
    PATTERN RECOGNITION, 2014, 47 (03) : 1229 - 1248
  • [34] English Handwritten Character Recognition Based on Ensembled Machine Learning
    Zanwar S.R.
    Bhosale Y.H.
    Bhuyar D.L.
    Ahmed Z.
    Shinde U.B.
    Narote S.P.
    Journal of The Institution of Engineers (India): Series B, 2023, 104 (05) : 1053 - 1067
  • [35] A Convolution Neural Networks Based Character and Word Recognition System for Similar Script Languages Kannada and Telugu
    Hebbi, Chandravva
    Mamatha, H. R.
    Sahana, Y. S.
    Dhage, Sagar
    Somayaji, Shriram
    PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 306 - 317
  • [36] Stroke-Based Data Augmentation for Enhancing Optical Character Recognition of Ancient Handwritten Scripts
    Ayyoob, M. P.
    Ilyas, P. Muhamed
    IEEE ACCESS, 2024, 12 : 186794 - 186802
  • [37] Character Recognition using Conditional Random Field based Matching Engine
    Ray, Anupama
    Chandawala, Ankit
    Chaudhary, Santanu
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 18 - 22
  • [38] Handwritten Kannada Numerals Recognition by Using Zone Features and CNN Classifier
    Hallur, Vishweshwarayya C.
    Hegadi, Rajendra S.
    Hegadi, Ravindra S.
    INTERNATIONAL JOURNAL OF TECHNOLOGY AND HUMAN INTERACTION, 2019, 15 (04) : 63 - 79
  • [39] Hybrid manifold smoothing and label propagation technique for Kannada handwritten character recognition
    Ramesh, G.
    Shreyas, J.
    Balaji, J. Manoj
    Sharma, Ganesh N.
    Gururaj, H. L.
    Srinidhi, N. N.
    Askar, S. S.
    Abouhawwash, Mohamed
    FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [40] Handwritten Kannada Vowel Character Recognition Using Crack Codes and Fourier Descriptors
    Rajput, Ganapatsingh G.
    Horakeri, Rajeswari
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2011, 7080 : 169 - 180