Hierarchical Recognition System for Machine Printed Kannada Characters

被引:0
|
作者
Achaya, Dinesh U. [1 ]
Reddy, N. V. Subba [1 ]
Krishnamoorthi [1 ]
机构
[1] Manipal Univ, Manipal Inst Technol, Dept Comp Sci & Engn, Manipal 576104, Karnataka, India
关键词
Character recognition; Structural features; Direction code; Binary decision tree; k-Nearest Neighbor; Multi-stage classifier;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extensive research has been done on optical character recognition in the last few decades. Most of the efforts were made to develop OCR systems for foreign languages like English, Japanese, Roman and Arabic characters. Many commercial OCR systems for these foreign languages are available in the market. In the context of Indian languages, majority of work is reported on Hindi and Bangla. And very few reports are available on South Indian languages. This paper describes a character recognition system that can handle machine printed text documents in Kannada, which is the official language of the South Indian state of Karnataka. Initially, the scanned image is preprocessed to remove noise. Lines, words and character components are segmented using two-stage segmentation technique. Classification of the character components is done in two stages. In the first stage, the character components are grouped into small subsets by a feature based tree classifier. In the second stage, characters in each group are recognized using a nearest neighbor classifier. We adopted this hybrid approach instead of using only a tree classifier because it is nearly impossible to find a set of stroke features that are simple to compute, robust and reliable to detect, and are sufficient to classify a large number of basic and complex shaped compound characters. The system is tested with the data set containing 8400 characters of different font and size. On average, the system recognizes characters with an accuracy of about 92.68%.
引用
收藏
页码:44 / 53
页数:10
相关论文
共 50 条
  • [31] Recognition of hand-printed Chinese characters using decision trees/machine learning C4.5 system
    Amin, A
    Singh, S
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 1998, 1 (02) : 130 - 141
  • [32] MACHINE RECOGNITION OF HANDPRINTED CHARACTERS
    COATES, CL
    NOE, P
    [J]. NAVAL RESEARCH REVIEWS, 1972, 25 (08): : 13 - &
  • [33] Handwritten and Machine Printed Text Separation from Kannada Document Images
    Pardeshi, Rajmohan
    Hangarge, Mallikarjun
    Doddamani, Srikanth
    Santosh, K. C.
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
  • [34] Radial basis function and subspace approach for printed Kannada text recognition
    Vijaykumar, B
    Ramakrishnan, AG
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 321 - 324
  • [35] Kannada Speech Recognition System for Aphasic people
    Aishwarya, Jaya
    Kundapur, Poornima Panduranga
    Kumar, Sampath
    Hareesha, K. S.
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, : 1753 - 1756
  • [36] Read and Recognition of old Kannada Stone Inscriptions Characters using Novel Algorithm
    Rajithkumar, B. K.
    Mohana, H. S.
    Uday, J.
    Bhavana, M. B.
    Anusha, L. S.
    [J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2015, : 284 - 288
  • [37] A NOTE ON RECOGNITION OF HAND-PRINTED CHARACTERS
    TUFFILL, HW
    [J]. INFORMATION AND CONTROL, 1961, 4 (2-3): : 197 - &
  • [38] A HEURISTIC ALGORITHM FOR THE RECOGNITION OF PRINTED CHINESE CHARACTERS
    CHUANG, CT
    TSENG, LY
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1995, 25 (04): : 710 - 717
  • [39] Kannada Word Recognition System Using HTK
    Ananthakrishna, T.
    Maithri, M.
    Shama, Kumara
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [40] SEGMENTATION OF TOUCHING CHARACTERS IN PRINTED DOCUMENT RECOGNITION
    LIANG, S
    SHRIDHAR, M
    AHMADI, M
    [J]. PATTERN RECOGNITION, 1994, 27 (06) : 825 - 840