Deformable Mixed Domain Attention Network for Scene Text Recognition

被引:0
|
作者
Huang, Yangyang [1 ]
Fang, Wei [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Dept Comp Sci, Beijing, Peoples R China
关键词
scene text recognition; deformable convolution; attention mechanism; center loss;
D O I
10.1109/icsess49938.2020.9237645
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As a hot research area in computer vision in recent years, scene text recognition is still challenging due to the large variance in irregular text. The current methods treat the recognition process as a sequence-to-sequence task and solve it by an encoder-decoder framework. In this work, we propose a DMDAN for robust scene text recognition. First, we utilize deformable convolution to strengthen the ability to adapt to irregular text. Then, mix domain visual attention and self-attention are respectively employed in the encoder and decoder, which can effectively alleviate the problem of "attention drifting". Finally, we integrate the center loss to reduce the intra-class distances and make each class easier to distinguish. Extensive experimental results show that our model outperforms the baseline CRNN a lot and achieves a comparable performance against existing attention-based methods on both regular and irregular datasets.
引用
收藏
页码:142 / 145
页数:4
相关论文
共 50 条
  • [21] STAN: A sequential transformation attention-based network for scene text recognition
    Lin, Qingxiang
    Luo, Canjie
    Jin, Lianwen
    Lai, Songxuan
    PATTERN RECOGNITION, 2021, 111
  • [22] Parallel Scale-wise Attention Network for Effective Scene Text Recognition
    Sajid, Usman
    Chow, Michael
    Zhang, Jin
    Kim, Taejoon
    Wang, Guanghui
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [23] Review network for scene text recognition
    Li, Shuohao
    Han, Anqi
    Chen, Xu
    Yin, Xiaoqing
    Zhang, Jun
    JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (05)
  • [24] Text proposals with location-awareness-attention network for arbitrarily shaped scene text detection and recognition
    Zhong, Dajian
    Lyu, Shujing
    Shivakumara, Palaiahankote
    Pal, Umapada
    Lu, Yue
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
  • [25] SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition
    Zhong, Dajian
    Lyu, Shujing
    Shivakumara, Palaiahnakote
    Yin, Bing
    Wu, Jiajia
    Pal, Umapada
    Lu, Yue
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 464 - 480
  • [26] Attention-Based Deep Neural Network and Its Application to Scene Text Recognition
    He, Haizhen
    Li, Jiehan
    2019 IEEE 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2019), 2019, : 672 - 677
  • [27] CAMTNet: CTC-Attention Mechanism and Transformer Fusion Network for Scene Text Recognition
    Wang, Ling
    Luo, Kexin
    Wang, Peng
    Bai, Yane
    IAENG International Journal of Computer Science, 2024, 51 (11) : 1750 - 1760
  • [28] Sequential alignment attention model for scene text recognition
    Wu, Yan
    Fan, Jiaxin
    Tao, Renshuai
    Wang, Jiakai
    Qin, Haotong
    Liu, Aishan
    Liu, Xianglong
    Tao, Renshuai (rstao@buaa.edu.cn), 1600, Academic Press Inc. (80):
  • [29] FACLSTM: ConvLSTM with focused attention for scene text recognition
    Wang, Qingqing
    Huang, Ye
    Jia, Wenjing
    He, Xiangjian
    Blumenstein, Michael
    Lyu, Shujing
    Lu, Yue
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (02)
  • [30] SCENE TEXT RECOGNITION VIA GATED CASCADE ATTENTION
    Wang, Siwei
    Wang, Yongtao
    Qin, Xiaoran
    Zhao, Qijie
    Tang, Zhi
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1018 - 1023