An innovative network based on double receptive field and Recursive Bi-directional Long Short-Term Memory

被引:1
|
作者
Meng, Peng-fei [1 ]
Jia, Shuang-cheng [1 ]
Li, Qian [1 ]
机构
[1] Mogo Auto Intelligence & Telemat Informat Technol, Beijing, Peoples R China
关键词
D O I
10.1038/s41598-021-01520-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sequence recognition of natural scene images has always been an important research topic in the field of computer vision. CRNN has been proven to be a popular end-to-end character sequence recognition network. However, the problem of wide characters is not considered under the setting of CRNN. The CRNN is less effective in recognizing long dense small characters. Aiming at the shortcomings of CRNN, we proposed an improved CRNN network, named CRNN-RES, based on BiLSTM and multiple receptive fields. Specifically, on the one hand, the CRNN-RES uses a dual pooling core to enhance the CNN network's ability to extract features. On the other hand, by improving the last RNN layer, the BiLSTM is changed to a shared parameter BiLSTM network using recursive residuals, which reduces the number of network parameters and improves the accuracy. In addition, we designed a structure that can flexibly configure the length of the input data sequence in the RNN layer, called the CRFC layer. Comparing the CRNN-RES network proposed in this paper with the original CRNN network, the extensive experiments show that when recognizing English characters and numbers, the parameters of CRNN-RES is 8197549, which decreased 133,752 parameters compare with CRNN. In the public dataset ICDAR 2003 (IC03), ICDAR 2013 (IC13), IIIT 5k-word (IIIT5k), and Street View Text (SVT), the CRNN-RES obtain the accuracy of 96.90%, 89.85%, 83.63%, and 82.96%, which higher than CRNN by 1.40%, 3.15%, 5.43%, and 2.16% respectively.
引用
下载
收藏
页数:9
相关论文
共 50 条
  • [41] Application of bi-directional long-short-term memory network in cognitive age prediction based on EEG signals
    Wong, Shi-Bing
    Tsao, Yu
    Tsai, Wen-Hsin
    Wang, Tzong-Shi
    Wu, Hsin-Chi
    Wang, Syu-Siang
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [42] Fake news detection system based on modified bi-directional long short term memory
    Agrawal, Chetan
    Pandey, Anjana
    Goyal, Sachin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24199 - 24223
  • [43] State of health estimation for lithium-ion battery based on Bi-directional long short-term memory neural network and attention mechanism
    Guo, Yu
    Yang, Dongfang
    Zhao, Kun
    Wang, Kai
    ENERGY REPORTS, 2022, 8 : 208 - 215
  • [44] Accurate Detection of Bearing Faults Using Difference Visibility Graph and Bi-Directional Long Short-Term Memory Network Classifier
    Roy, Sayanjit Singha
    Chatterjee, Soumya
    Roy, Saptarshi
    Bamane, Pradip
    Paramane, Ashish
    Rao, U. Mohan
    Nazir, Muhammad Tariq
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2022, 58 (04) : 4542 - 4551
  • [45] Application of bi-directional long-short-term memory network in cognitive age prediction based on EEG signals
    Shi-Bing Wong
    Yu Tsao
    Wen-Hsin Tsai
    Tzong-Shi Wang
    Hsin-Chi Wu
    Syu-Siang Wang
    Scientific Reports, 13 (1)
  • [46] Runoff Forecasting using Convolutional Neural Networks and optimized Bi-directional Long Short-term Memory
    Wu, Junhao
    Wang, Zhaocai
    Hu, Yuan
    Tao, Sen
    Dong, Jinghan
    WATER RESOURCES MANAGEMENT, 2023, 37 (02) : 937 - 953
  • [47] Bi-directional Long Short-Term Memory Model with Semantic Positional Attention for the Question Answering System
    Bi, Mingwen
    Zhang, Qingchuan
    Zuo, Min
    Xu, Zelong
    Jin, Qingyu
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (05)
  • [48] Deep Bi-directional Long Short-Term Memory Neural Networks for Sentiment Analysis of Social Data
    Ngoc Khuong Nguyen
    Anh-Cuong Le
    Hong Thai Pham
    INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING, IUKM 2016, 2016, 9978 : 255 - 268
  • [49] A novel hybrid model integrating residual structure and bi-directional long short-term memory network for tool wear monitoring
    Ning Zhang
    Enping Chen
    Yukang Wu
    Baosu Guo
    Zhanpeng Jiang
    Fenghe Wu
    The International Journal of Advanced Manufacturing Technology, 2022, 120 : 6707 - 6722
  • [50] Opinion Summarisation using Bi-Directional Long-Short Term Memory
    Pabbi, Kethan
    Sindhu, C.
    2021 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2021, : 256 - 259