Printed Text Recognition using BLSTM and MDLSTM for Indian languages

被引:0
|
作者
Chavan, Vishal [1 ]
Malage, Abhijit [1 ]
Mehrotra, Kapil [1 ]
Gupta, Manish Kumar [1 ]
机构
[1] C DAC, Pune, Maharashtra, India
关键词
Recurrent Neural Network; Optical Character Recognition; Bidirectional LSTM; Multidimensional LSTM; OCR SYSTEM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we evaluated the recognition performance of BLSTM (Bidirectional LSTM) and MDLSTM (two-dimensional LSTM) neural network architecture on printed documents. We also compare the performance of 2 architectures with tesseract on same test bed. We demonstrate our experimentation on 7 Indian languages i.e. Hindi, Marathi, Tamil, Kannada, Malayalam, Bangla and Gurumukhi. The input to both the architecture will be segmented lines. The data-set used contains approximate 5000 pages for each language which then divided into train, validation and test set. The Histogram of Gradients are extracted at line level to feed into the BLSTM network. Whereas MDLSTM processes 2D image (raw pixels) of each line. The level and number of hidden layers in both the architectures are empirically selected and kept same for all the languages. The output CTC layer will contain the number of unicode present in the evaluated languages and one blank label. The input layer was fully connected to hidden layers, and these were fully connected to themselves and to the output layer. The validated result shows MDLSTM outperforms both BLSTM and tesseract for all the languages included in our experimentation.
引用
收藏
页码:345 / 350
页数:6
相关论文
共 50 条
  • [31] Recent Trends in Text to Speech Synthesis of Indian Languages
    Joshi, Sarang L.
    Bairagi, Vinayak K.
    HELIX, 2019, 9 (03): : 4931 - 4936
  • [32] IndicSpeech: Text-to-Speech Corpus for Indian Languages
    Srivastava, Nimisha
    Mukhopadhyay, Rudrabha
    Prajwal, K. R.
    Jawahar, C., V
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6417 - 6422
  • [33] LCS based Text Steganography through Indian Languages
    Changder, S.
    Ghosh, D.
    Debnath, N. C.
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 8, 2010, : 53 - 57
  • [34] Improving MDLSTM for Offline Arabic Handwriting Recognition Using Dropout at Different Positions
    Maalej, Rania
    Kherallah, Monji
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 431 - 438
  • [35] CNN-BLSTM Model for Arabic Text Recognition in Unconstrained Captured Identity Documents
    Ghanmi, Nabil
    Belhakimi, Amine
    Awal, Ahmad-Montaser
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT I, 2024, 14365 : 106 - 118
  • [36] Languages, books, and reading from the printed word to the digital text
    Chartier, R
    CRITICAL INQUIRY, 2004, 31 (01) : 133 - 152
  • [37] Optical Character Recognition of Arabic Printed Text
    Taha, Safwa
    Babiker, Yusra
    Abbas, Mohamed
    2012 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2012,
  • [38] Optical character recognition of arabic printed text
    Electrical and Electronics Engineering Department, University of Khartoum, Sudan
    SCOReD - IEEE Stud. Conf. Res. Dev., (235-240):
  • [39] RECOGNITION OF PRINTED TEXT UNDER REALISTIC CONDITIONS
    PAVLIDIS, T
    PATTERN RECOGNITION LETTERS, 1993, 14 (04) : 317 - 326
  • [40] MACHINE RECOGNITION AND CORRECTION OF PRINTED ARABIC TEXT
    AMIN, A
    MARI, JF
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1989, 19 (05): : 1300 - 1306