Hierarchical Recognition System for Machine Printed Kannada Characters

被引:0
|
作者
Achaya, Dinesh U. [1 ]
Reddy, N. V. Subba [1 ]
Krishnamoorthi [1 ]
机构
[1] Manipal Univ, Manipal Inst Technol, Dept Comp Sci & Engn, Manipal 576104, Karnataka, India
关键词
Character recognition; Structural features; Direction code; Binary decision tree; k-Nearest Neighbor; Multi-stage classifier;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extensive research has been done on optical character recognition in the last few decades. Most of the efforts were made to develop OCR systems for foreign languages like English, Japanese, Roman and Arabic characters. Many commercial OCR systems for these foreign languages are available in the market. In the context of Indian languages, majority of work is reported on Hindi and Bangla. And very few reports are available on South Indian languages. This paper describes a character recognition system that can handle machine printed text documents in Kannada, which is the official language of the South Indian state of Karnataka. Initially, the scanned image is preprocessed to remove noise. Lines, words and character components are segmented using two-stage segmentation technique. Classification of the character components is done in two stages. In the first stage, the character components are grouped into small subsets by a feature based tree classifier. In the second stage, characters in each group are recognized using a nearest neighbor classifier. We adopted this hybrid approach instead of using only a tree classifier because it is nearly impossible to find a set of stroke features that are simple to compute, robust and reliable to detect, and are sufficient to classify a large number of basic and complex shaped compound characters. The system is tested with the data set containing 8400 characters of different font and size. On average, the system recognizes characters with an accuracy of about 92.68%.
引用
收藏
页码:44 / 53
页数:10
相关论文
共 50 条
  • [1] Machine recognition of printed Kannada text
    Kumar, BV
    Ramakrishnan, AG
    [J]. DOCUMENT ANALYSIS SYSTEM V, PROCEEDINGS, 2002, 2423 : 37 - 48
  • [2] Symbolic representation of Kannada characters for recognition
    Vikram, T. N.
    Gowda, Chidananada K.
    Urs, Shalini R.
    [J]. PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 823 - 826
  • [3] Neural networks in the recognition of machine printed Arabic characters
    Bouslama, F
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1999, 13 (03) : 395 - 414
  • [4] SEGMENTATION METHODS FOR RECOGNITION OF MACHINE-PRINTED CHARACTERS
    HOFFMAN, RL
    MCCULLOUGH, JW
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1971, 15 (02) : 153 - +
  • [5] Recognition of machine printed Arabic characters and numerals based on MCR
    Zidouri, ABC
    Chinveeraphan, S
    Sato, M
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (12): : 1649 - 1655
  • [6] MACHINE RECOGNITION OF PRINTED CHINESE CHARACTERS VIA TRANSFORMATION ALGORITHMS
    WANG, PP
    SHIAU, RC
    [J]. PATTERN RECOGNITION, 1973, 5 (04) : 303 - 321
  • [7] Recognition of Kannada characters using deep learning approach
    Indira, K.
    Karki, Maya, V
    Mallika, H.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (03) : 2333 - 2346
  • [8] The CNN Based Machine-printed Traditional Mongolian Characters Recognition
    Hu, Hongwei
    Wei, Hongxi
    Liu, Zhenyu
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 3937 - 3941
  • [9] Pairwise coupling for machine recognition of hand-printed Japanese characters
    Roth, V
    Tsuda, K
    [J]. 2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2001, : 1120 - 1125
  • [10] RECOGNITION OF MACHINE PRINTED BROKEN ORIYA CHARACTERS USING SIFT FEATURES
    Sharma, Sandeepika
    Choudhray, Sneha
    Kumar, Bhupendra
    [J]. 6TH INTERNATIONAL CONFERENCE ON COMPUTER & COMMUNICATION TECHNOLOGY (ICCCT-2015), 2015, : 106 - 109