An innovative network based on double receptive field and Recursive Bi-directional Long Short-Term Memory

被引:1
|
作者
Meng, Peng-fei [1 ]
Jia, Shuang-cheng [1 ]
Li, Qian [1 ]
机构
[1] Mogo Auto Intelligence & Telemat Informat Technol, Beijing, Peoples R China
关键词
D O I
10.1038/s41598-021-01520-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sequence recognition of natural scene images has always been an important research topic in the field of computer vision. CRNN has been proven to be a popular end-to-end character sequence recognition network. However, the problem of wide characters is not considered under the setting of CRNN. The CRNN is less effective in recognizing long dense small characters. Aiming at the shortcomings of CRNN, we proposed an improved CRNN network, named CRNN-RES, based on BiLSTM and multiple receptive fields. Specifically, on the one hand, the CRNN-RES uses a dual pooling core to enhance the CNN network's ability to extract features. On the other hand, by improving the last RNN layer, the BiLSTM is changed to a shared parameter BiLSTM network using recursive residuals, which reduces the number of network parameters and improves the accuracy. In addition, we designed a structure that can flexibly configure the length of the input data sequence in the RNN layer, called the CRFC layer. Comparing the CRNN-RES network proposed in this paper with the original CRNN network, the extensive experiments show that when recognizing English characters and numbers, the parameters of CRNN-RES is 8197549, which decreased 133,752 parameters compare with CRNN. In the public dataset ICDAR 2003 (IC03), ICDAR 2013 (IC13), IIIT 5k-word (IIIT5k), and Street View Text (SVT), the CRNN-RES obtain the accuracy of 96.90%, 89.85%, 83.63%, and 82.96%, which higher than CRNN by 1.40%, 3.15%, 5.43%, and 2.16% respectively.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Arrhythmia Classification Based on Bi-Directional Long Short-Term Memory and Multi-Task Group Method
    Munawar, Shaik
    Angappan, Geetha
    Konda, Srinivas
    INTERNATIONAL JOURNAL OF E-COLLABORATION, 2023, 19 (01)
  • [32] Prediction and Diagnosis of Respiratory Disease by Combining Convolutional Neural Network and Bi-directional Long Short-Term Memory Methods
    Li, Li
    Ayiguli, Alimu
    Luan, Qiyun
    Yang, Boyi
    Subinuer, Yilamujiang
    Gong, Hui
    Zulipikaer, Abudureherman
    Xu, Jingran
    Zhong, Xuemei
    Ren, Jiangtao
    Zou, Xiaoguang
    FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [33] State of Charge Estimation of Lithium-Ion Batteries Using Long Short-Term Memory and Bi-directional Long Short-Term Memory Neural Networks
    Namboothiri K.M.
    Sundareswaran K.
    Nayak P.S.R.
    Simon S.P.
    Journal of The Institution of Engineers (India): Series B, 2024, 105 (01) : 175 - 182
  • [34] A deep bi-directional long-short term memory neural network-based methodology to enhance short-term electricity load forecasting for residential applications
    Atef, Sara
    Nakata, Kazuhide
    Eltawil, Amr B.
    COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 170
  • [35] An energy prediction approach using bi-directional long short-term memory for a hydropower plant in Laos
    Kaewarsa, Suriya
    Kongpaseuth, Vanhkham
    ELECTRICAL ENGINEERING, 2024, 106 (03) : 2609 - 2625
  • [36] Runoff Forecasting using Convolutional Neural Networks and optimized Bi-directional Long Short-term Memory
    Junhao Wu
    Zhaocai Wang
    Yuan Hu
    Sen Tao
    Jinghan Dong
    Water Resources Management, 2023, 37 : 937 - 953
  • [37] Fake news detection system based on modified bi-directional long short term memory
    Chetan Agrawal
    Anjana Pandey
    Sachin Goyal
    Multimedia Tools and Applications, 2022, 81 : 24199 - 24223
  • [38] Deep learning reservoir porosity prediction method based on a spatiotemporal convolution bi-directional long short-term memory neural network model
    Wang, Jun
    Cao, Junxing
    Yuan, Shan
    GEOMECHANICS FOR ENERGY AND THE ENVIRONMENT, 2022, 32
  • [39] A novel hybrid model integrating residual structure and bi-directional long short-term memory network for tool wear monitoring
    Zhang, Ning
    Chen, Enping
    Wu, Yukang
    Guo, Baosu
    Jiang, Zhanpeng
    Wu, Fenghe
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 120 (9-10): : 6707 - 6722
  • [40] Application of bi-directional long-short-term memory network in cognitive age prediction based on EEG signals
    Wong, Shi-Bing
    Tsao, Yu
    Tsai, Wen-Hsin
    Wang, Tzong-Shi
    Wu, Hsin-Chi
    Wang, Syu-Siang
    SCIENTIFIC REPORTS, 2024, 14 (01)