An innovative network based on double receptive field and Recursive Bi-directional Long Short-Term Memory

被引:1
|
作者
Meng, Peng-fei [1 ]
Jia, Shuang-cheng [1 ]
Li, Qian [1 ]
机构
[1] Mogo Auto Intelligence & Telemat Informat Technol, Beijing, Peoples R China
关键词
D O I
10.1038/s41598-021-01520-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sequence recognition of natural scene images has always been an important research topic in the field of computer vision. CRNN has been proven to be a popular end-to-end character sequence recognition network. However, the problem of wide characters is not considered under the setting of CRNN. The CRNN is less effective in recognizing long dense small characters. Aiming at the shortcomings of CRNN, we proposed an improved CRNN network, named CRNN-RES, based on BiLSTM and multiple receptive fields. Specifically, on the one hand, the CRNN-RES uses a dual pooling core to enhance the CNN network's ability to extract features. On the other hand, by improving the last RNN layer, the BiLSTM is changed to a shared parameter BiLSTM network using recursive residuals, which reduces the number of network parameters and improves the accuracy. In addition, we designed a structure that can flexibly configure the length of the input data sequence in the RNN layer, called the CRFC layer. Comparing the CRNN-RES network proposed in this paper with the original CRNN network, the extensive experiments show that when recognizing English characters and numbers, the parameters of CRNN-RES is 8197549, which decreased 133,752 parameters compare with CRNN. In the public dataset ICDAR 2003 (IC03), ICDAR 2013 (IC13), IIIT 5k-word (IIIT5k), and Street View Text (SVT), the CRNN-RES obtain the accuracy of 96.90%, 89.85%, 83.63%, and 82.96%, which higher than CRNN by 1.40%, 3.15%, 5.43%, and 2.16% respectively.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] An innovative network based on double receptive field and Recursive Bi-directional Long Short-Term Memory
    Pengfei Meng
    Shuangcheng Jia
    Qian Li
    [J]. Scientific Reports, 11
  • [2] Separating overlapping bat calls with a bi-directional long short-term memory network
    Zhang, Kangkang
    Liu, Tong
    Song, Shengjing
    Zhao, Xin
    Sun, Shijun
    Metzner, Walter
    Feng, Jiang
    Liu, Ying
    [J]. INTEGRATIVE ZOOLOGY, 2022, 17 (05): : 741 - 751
  • [3] Learning to Track by Bi-directional Long Short-Term Memory Networks
    Pan, Chen
    Shi, Dianxi
    Guan, Naiyang
    Zhang, Yongjun
    Wang, Liujing
    Jin, Songchang
    [J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 783 - 790
  • [4] Diabetes Prediction Using Bi-directional Long Short-Term Memory
    Jaiswal S.
    Gupta P.
    [J]. SN Computer Science, 4 (4)
  • [5] Daily Peak Load Prediction Based on Correlation Analysis and Bi-directional Long Short-term Memory Network
    Li Y.
    Liu X.
    Xing F.
    Wen G.
    Lu N.
    He H.
    Jiao R.
    [J]. Dianwang Jishu/Power System Technology, 2021, 45 (07): : 2719 - 2730
  • [6] Short-term Load Forecasting in Renewable Energy Grid Based on Bi-directional Long Short-term Memory Network Considering Feature Selection
    Yang L.
    Wu H.
    Ding M.
    Bi R.
    [J]. Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2021, 45 (03): : 166 - 173
  • [7] Deep Bi-directional Long Short-Term Memory Model for Short-Term Traffic Flow Prediction
    Wang, Jingyuan
    Hu, Fei
    Li, Li
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2017, PT V, 2017, 10638 : 306 - 316
  • [8] Sarcasm detection using optimized bi-directional long short-term memory
    Sukhavasi, Vidyullatha
    Sistla, Venkatrama Phani kumar
    Dondeti, Venkatesulu
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, : 2771 - 2799
  • [9] Sensing Incipient Faults in Power Transformers Using Bi-Directional Long Short-Term Memory Network
    Das, Suchandan
    Paramane, Ashish
    Chatterjee, Soumya
    Rao, Ungarala Mohan
    [J]. IEEE SENSORS LETTERS, 2023, 7 (01)
  • [10] Classification of cardiac arrhythmia using a convolutional neural network and bi-directional long short-term memory
    Ul Hassan, Shahab
    Zahid, Mohd S. Mohd
    Abdullah, Talal A. A.
    Husain, Khaleel
    [J]. DIGITAL HEALTH, 2022, 8