Printed Text Recognition using BLSTM and MDLSTM for Indian languages

被引:0
|
作者
Chavan, Vishal [1 ]
Malage, Abhijit [1 ]
Mehrotra, Kapil [1 ]
Gupta, Manish Kumar [1 ]
机构
[1] C DAC, Pune, Maharashtra, India
关键词
Recurrent Neural Network; Optical Character Recognition; Bidirectional LSTM; Multidimensional LSTM; OCR SYSTEM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we evaluated the recognition performance of BLSTM (Bidirectional LSTM) and MDLSTM (two-dimensional LSTM) neural network architecture on printed documents. We also compare the performance of 2 architectures with tesseract on same test bed. We demonstrate our experimentation on 7 Indian languages i.e. Hindi, Marathi, Tamil, Kannada, Malayalam, Bangla and Gurumukhi. The input to both the architecture will be segmented lines. The data-set used contains approximate 5000 pages for each language which then divided into train, validation and test set. The Histogram of Gradients are extracted at line level to feed into the BLSTM network. Whereas MDLSTM processes 2D image (raw pixels) of each line. The level and number of hidden layers in both the architectures are empirically selected and kept same for all the languages. The output CTC layer will contain the number of unicode present in the evaluated languages and one blank label. The input layer was fully connected to hidden layers, and these were fully connected to themselves and to the output layer. The validated result shows MDLSTM outperforms both BLSTM and tesseract for all the languages included in our experimentation.
引用
收藏
页码:345 / 350
页数:6
相关论文
共 50 条
  • [41] ERIL: An Algorithm for Emotion Recognition from Indian Languages Using Machine Learning
    Mehra, Pramod
    Jain, Parag
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 126 (03) : 2557 - 2577
  • [42] COMPUTER RECOGNITION OF HAND-PRINTED TEXT
    MUNSON, JH
    JOURNAL OF TYPOGRAPHIC RESEARCH, 1969, 3 (01): : 31 - +
  • [43] Database for Arabic Printed Text Recognition Research
    Jaiem, Faten Kallel
    Kanoun, Slim
    Khemakhem, Maher
    El Abed, Haikal
    Kardoun, Jihain
    IMAGE ANALYSIS AND PROCESSING (ICIAP 2013), PT 1, 2013, 8156 : 251 - 259
  • [44] ERIL: An Algorithm for Emotion Recognition from Indian Languages Using Machine Learning
    Pramod Mehra
    Parag Jain
    Wireless Personal Communications, 2022, 126 : 2557 - 2577
  • [45] On the problem of individual character recognition in a printed text
    Ardalionov, L.V.
    Simaranov, S.Yu.
    Izvestiya Akademii Nauk. Teoriya i Sistemy Upravleniya, 1993, (06): : 61 - 75
  • [46] Recognition of Handwritten Numerals of various Indian Regional Languages using Deep Learning
    Chaurasia, Saumya
    Agarwal, Suneeta
    2018 5TH IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (UPCON), 2018, : 582 - 587
  • [47] Optical character recognition program for images of printed text using a neural network
    Ganapathy, Velappa
    Lean, Charles C. H.
    2006 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-6, 2006, : 1174 - +
  • [48] Multilingual Speech Recognition Using Language-Specific Phoneme Recognition as Auxiliary Task for Indian Languages
    Sailor, Hardik B.
    Hain, Thomas
    INTERSPEECH 2020, 2020, : 4756 - 4760
  • [49] Speech Emotion Recognition using XGBoost and CNN BLSTM with Attention
    He, Jingru
    Ren, Liyong
    2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 154 - 159
  • [50] An Algorithmic Approach for Text Recognition from Printed/Typed Text Images
    Agrawal, Neha
    Kaur, Arashdeep
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE CONFLUENCE 2018 ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING, 2018, : 876 - 879