Machine recognition of printed Kannada text

被引:0
|
作者
Kumar, BV [1 ]
Ramakrishnan, AG [1 ]
机构
[1] Indian Inst Sci, Dept Elect Engn, Bangalore 560012, Karnataka, India
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the design of a full fledged OCR system for printed Kannada text. The machine recognition of Kannada characters is difficult due to similarity in the shapes of different characters, script complexity and non-uniqueness in the representation of diacritics. The document image is subject to line segmentation, word segmentation and zone detection. From the zonal information, base characters, vowel modifiers and consonant conjucts are separated. Knowledge based approach is employed for recognizing the base characters. Various features are employed for recognising the characters. These include the coefficients of the Discrete Cosine Transform, Discrete Wavelet Transform and Karhunen-Louve Transform. These features are fed to different classifiers. Structural features are used in the subsequent levels to discriminate confused characters. Use of structural features, increases recognition rate from 93% to 98%. Apart from the classical pattern classification technique of nearest neighbour, Artificial Neural Network (ANN) based classifiers like Back Propogation and Radial Basis Function (RBF) Networks have also been studied. The ANN classifiers are trained in supervised mode using the transform features. Highest recognition rate of 99% is obtained with RBF using second level approximation coefficients of Haar wavelets as the features on presegmented base characters.
引用
收藏
页码:37 / 48
页数:12
相关论文
共 50 条
  • [1] Hierarchical Recognition System for Machine Printed Kannada Characters
    Achaya, Dinesh U.
    Reddy, N. V. Subba
    Krishnamoorthi
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (11): : 44 - 53
  • [2] Wavelet descriptors for recognition of basic symbols in printed Kannada text
    Kunte, R. Sanjeev
    Samuel, R. D. Sudhaker
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2007, 5 (02) : 351 - 367
  • [3] Handwritten and Machine Printed Text Separation from Kannada Document Images
    Pardeshi, Rajmohan
    Hangarge, Mallikarjun
    Doddamani, Srikanth
    Santosh, K. C.
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,
  • [4] Radial basis function and subspace approach for printed Kannada text recognition
    Vijaykumar, B
    Ramakrishnan, AG
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 321 - 324
  • [5] OCR for printed Kannada text to Machine editable format using Database approach
    Sagar, B. M.
    Shobha, G.
    Ramakanth, P. Kumar
    [J]. PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON AUTOMATION AND INFORMATION, 2008, : 322 - +
  • [6] MACHINE RECOGNITION AND CORRECTION OF PRINTED ARABIC TEXT
    AMIN, A
    MARI, JF
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1989, 19 (05): : 1300 - 1306
  • [7] A simple and efficient optical character recognition system for basic symbols in printed Kannada text
    Kunte, R. Sanjeev
    Samuel, R. D. Sudhaker
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2007, 32 (05): : 521 - 533
  • [8] MACHINE RECOGNITION OF OPTICALLY CAPTURED MACHINE PRINTED ARABIC TEXT
    ELKHALY, F
    SIDAHMED, MA
    [J]. PATTERN RECOGNITION, 1990, 23 (11) : 1207 - 1214
  • [9] Recognition of printed Arabic text via machine learning
    Amin, A
    [J]. INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 1999, : 317 - 326
  • [10] Recognition of printed Arabic text using machine learning
    Amin, A
    [J]. DOCUMENT RECOGNITION V, 1998, 3305 : 62 - 71