Adaptive embedding gate for attention-based scene text recognition

被引:26
|
作者
Chen, Xiaoxue [1 ]
Wang, Tianwei [1 ]
Zhu, Yuanzhi [1 ]
Jin, Lianwen [1 ,2 ]
Luo, Canjie [1 ]
机构
[1] South China Univ Technol, Coll Elect & Informat Engn, Guangzhou, Peoples R China
[2] SCUT Zhuhai Inst Modern Ind Innovat, Zhuhai, Peoples R China
关键词
Deep learning; Scene text recognition; Attention mechanism; NEURAL-NETWORK;
D O I
10.1016/j.neucom.2019.11.049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text recognition has attracted particular research interest because it is a very challenging problem and has various applications. The most cutting-edge methods are attentional encoder-decoder frameworks that learn the alignment between the input image and output sequences. In particular, the decoder recurrently outputs predictions, using the prediction of the previous step as a guidance for every time step. In this study, we point out that the inappropriate use of previous predictions in existing attentional decoders restricts the recognition performance and brings instability. To handle this problem, we propose a novel module, namely adaptive embedding gate (AEG). The proposed AEG focuses on introducing high-order character language models to attentional decoders by controlling the information transmission between adjacent characters. AEG is a flexible module and can be easily integrated into the state-of-the-art attentional decoders for scene text recognition. We evaluate its effectiveness as well as robustness on a number of standard benchmarks, including the IIIT5K, SVT, SVT-P, CUTE80, and ICDAR datasets. Experimental results demonstrate that AEG can significantly boost recognition performance and bring better robustness. (C) 2019 The Author(s). Published by Elsevier B.V.
引用
收藏
页码:261 / 271
页数:11
相关论文
共 50 条
  • [1] An Attention-Based Convolutional Recurrent Neural Networks for Scene Text Recognition
    Alshawi, Adil Abdullah Abdulhussein
    Tanha, Jafar
    Balafar, Mohammad Ali
    [J]. IEEE ACCESS, 2024, 12 : 8123 - 8134
  • [2] STAN: A sequential transformation attention-based network for scene text recognition
    Lin, Qingxiang
    Luo, Canjie
    Jin, Lianwen
    Lai, Songxuan
    [J]. PATTERN RECOGNITION, 2021, 111
  • [3] Scene Text Recognition by Attention Network with Gated Embedding
    Wang, Cong
    Liu, Cheng-Lin
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] Attention-Based Deep Neural Network and Its Application to Scene Text Recognition
    He, Haizhen
    Li, Jiehan
    [J]. 2019 IEEE 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2019), 2019, : 672 - 677
  • [5] Attention-based Text Recognition in the Wild
    Yan, Zhi-Chen
    Yu, Stephanie A.
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON DEEP LEARNING THEORY AND APPLICATIONS (DELTA), 2020, : 42 - 49
  • [6] CarveNet: a channel-wise attention-based network for irregular scene text recognition
    Guibin Wu
    Zheng Zhang
    Yongping Xiong
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 177 - 186
  • [7] CarveNet: a channel-wise attention-based network for irregular scene text recognition
    Wu, Guibin
    Zhang, Zheng
    Xiong, Yongping
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (3) : 177 - 186
  • [8] Attention-Based CNN-RNN Arabic Text Recognition from Natural Scene Images
    Butt, Hanan
    Raza, Muhammad Raheel
    Ramzan, Muhammad Javed
    Ali, Muhammad Junaid
    Haris, Muhammad
    [J]. FORECASTING, 2021, 3 (03): : 520 - 540
  • [9] Attention-Based Scene Text Detection on Dual Feature Fusion
    Li, Yuze
    Silamu, Wushour
    Wang, Zhenchao
    Xu, Miaomiao
    [J]. SENSORS, 2022, 22 (23)
  • [10] Short Text Embedding Autoencoders With Attention-Based Neighborhood Preservation
    Wei, Chao
    Zhu, Lijun
    Shi, Jiaoxiang
    [J]. IEEE ACCESS, 2020, 8 : 223156 - 223171