Deformable Mixed Domain Attention Network for Scene Text Recognition

被引:0
|
作者
Huang, Yangyang [1 ]
Fang, Wei [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Dept Comp Sci, Beijing, Peoples R China
关键词
scene text recognition; deformable convolution; attention mechanism; center loss;
D O I
10.1109/icsess49938.2020.9237645
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As a hot research area in computer vision in recent years, scene text recognition is still challenging due to the large variance in irregular text. The current methods treat the recognition process as a sequence-to-sequence task and solve it by an encoder-decoder framework. In this work, we propose a DMDAN for robust scene text recognition. First, we utilize deformable convolution to strengthen the ability to adapt to irregular text. Then, mix domain visual attention and self-attention are respectively employed in the encoder and decoder, which can effectively alleviate the problem of "attention drifting". Finally, we integrate the center loss to reduce the intra-class distances and make each class easier to distinguish. Extensive experimental results show that our model outperforms the baseline CRNN a lot and achieves a comparable performance against existing attention-based methods on both regular and irregular datasets.
引用
收藏
页码:142 / 145
页数:4
相关论文
共 50 条
  • [1] Scene Text Recognition with Cascade Attention Network
    Zhang, Min
    Ma, Meng
    Wang, Ping
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 385 - 393
  • [2] Arbitrary-Shaped Scene Text Recognition with Deformable Ensemble Attention
    Xu, Shuo
    Zhuang, Zeming
    Li, Mingjun
    Su, Feng
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (237-253):
  • [3] Gaussian Constrained Attention Network for Scene Text Recognition
    Qiao, Zhi
    Qin, Xugong
    Zhou, Yu
    Yang, Fei
    Wang, Weiping
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3328 - 3335
  • [4] Scene Text Recognition by Attention Network with Gated Embedding
    Wang, Cong
    Liu, Cheng-Lin
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] DENSE CHAINED ATTENTION NETWORK FOR SCENE TEXT RECOGNITION
    Gao, Yunze
    Chen, Yingying
    Wang, Jinqiao
    Tang, Ming
    Lu, Hanqing
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 679 - 683
  • [6] Spatial attention contrastive network for scene text recognition
    Wang, Fan
    Yin, Dong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [7] A holistic representation guided attention network for scene text recognition
    Yang, Lu
    Wang, Peng
    Li, Hui
    Li, Zhen
    Zhang, Yanning
    NEUROCOMPUTING, 2020, 414 : 67 - 75
  • [8] Deep neural network with attention model for scene text recognition
    Li, Shuohao
    Tang, Min
    Guo, Qiang
    Lei, Jun
    Zhang, Jun
    IET COMPUTER VISION, 2017, 11 (07) : 605 - 612
  • [9] EPAN: Effective parts attention network for scene text recognition
    Huang, Yunlong
    Sun, Zenghui
    Jin, Lianwen
    Luo, Canjie
    NEUROCOMPUTING, 2020, 376 (376) : 202 - 213
  • [10] A Two-Level Rectification Attention Network for Scene Text Recognition
    Wu, Lintai
    Xu, Yong
    Hou, Junhui
    Chen, C. L. Philip
    Liu, Cheng-Lin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2404 - 2414