Printed Text Recognition using BLSTM and MDLSTM for Indian languages

被引:0
|
作者
Chavan, Vishal [1 ]
Malage, Abhijit [1 ]
Mehrotra, Kapil [1 ]
Gupta, Manish Kumar [1 ]
机构
[1] C DAC, Pune, Maharashtra, India
关键词
Recurrent Neural Network; Optical Character Recognition; Bidirectional LSTM; Multidimensional LSTM; OCR SYSTEM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we evaluated the recognition performance of BLSTM (Bidirectional LSTM) and MDLSTM (two-dimensional LSTM) neural network architecture on printed documents. We also compare the performance of 2 architectures with tesseract on same test bed. We demonstrate our experimentation on 7 Indian languages i.e. Hindi, Marathi, Tamil, Kannada, Malayalam, Bangla and Gurumukhi. The input to both the architecture will be segmented lines. The data-set used contains approximate 5000 pages for each language which then divided into train, validation and test set. The Histogram of Gradients are extracted at line level to feed into the BLSTM network. Whereas MDLSTM processes 2D image (raw pixels) of each line. The level and number of hidden layers in both the architectures are empirically selected and kept same for all the languages. The output CTC layer will contain the number of unicode present in the evaluated languages and one blank label. The input layer was fully connected to hidden layers, and these were fully connected to themselves and to the output layer. The validated result shows MDLSTM outperforms both BLSTM and tesseract for all the languages included in our experimentation.
引用
收藏
页码:345 / 350
页数:6
相关论文
共 50 条
  • [1] Recognition of Printed Devanagari Text Using BLSTM Neural Network
    Sankaran, Naveen
    Jawahar, C. V.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 322 - 325
  • [2] Collaborative Deep Neural Network for Printed Text Recognition of Indian Languages
    Mehrotra, Kapil
    Gupta, Manish Kumar
    Khajuria, Karan
    2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 252 - 256
  • [3] Text Recognition using Deep BLSTM Networks
    Ray, Anupama
    Rajeswar, Sai
    Chaudhury, Santanu
    2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 207 - +
  • [4] Transfer Learning for Scene Text Recognition in Indian Languages
    Gunna, Sanjana
    Saluja, Rohit
    Jawahar, C., V
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 182 - 197
  • [5] Transfer Learning for Scene Text Recognition in Indian Languages
    Gunna, Sanjana
    Saluja, Rohit
    Jawahar, C.V.
    arXiv, 2022,
  • [6] Text-Dependent Speaker Recognition System for Indian Languages
    Rao, R. Rajeswara
    Nagesh, A.
    Prasad, Kamakshi
    Babu, K. Ephraim
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (11): : 65 - 71
  • [7] BLSTM-based handwritten text recognition using Web resources
    Oprean, Cristina
    Likforman-Sulem, Laurence
    Mokbel, Chafic
    Popescu, Adrian
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 466 - 470
  • [8] Offline Handwritten Text Recognition Using Hybrid CNN-BLSTM Network
    Namdeo, Rahul Kumar
    Gupta, Chetan
    Shrivastava, Ritu
    Proceedings - 2022 IEEE 11th International Conference on Communication Systems and Network Technologies, CSNT 2022, 2022, : 318 - 323
  • [9] Improving Scene Text Recognition for Indian Languages with Transfer Learning and Font Diversity
    Gunna, Sanjana
    Saluja, Rohit
    Jawahar, Cheerakkuzhi Veluthemana
    JOURNAL OF IMAGING, 2022, 8 (04)
  • [10] Recognition of printed Arabic text using neural networks
    Amin, A
    Mansoor, W
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, 1997, : 612 - 615