Deformable Mixed Domain Attention Network for Scene Text Recognition

被引:0
|
作者
Huang, Yangyang [1 ]
Fang, Wei [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Dept Comp Sci, Beijing, Peoples R China
关键词
scene text recognition; deformable convolution; attention mechanism; center loss;
D O I
10.1109/icsess49938.2020.9237645
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
As a hot research area in computer vision in recent years, scene text recognition is still challenging due to the large variance in irregular text. The current methods treat the recognition process as a sequence-to-sequence task and solve it by an encoder-decoder framework. In this work, we propose a DMDAN for robust scene text recognition. First, we utilize deformable convolution to strengthen the ability to adapt to irregular text. Then, mix domain visual attention and self-attention are respectively employed in the encoder and decoder, which can effectively alleviate the problem of "attention drifting". Finally, we integrate the center loss to reduce the intra-class distances and make each class easier to distinguish. Extensive experimental results show that our model outperforms the baseline CRNN a lot and achieves a comparable performance against existing attention-based methods on both regular and irregular datasets.
引用
收藏
页码:142 / 145
页数:4
相关论文
共 50 条
  • [41] Look back again: Dual parallel attention network for accurate and robust scene text recognition
    Fu, Zilong
    Xie, Hongtao
    Jin, Guoqing
    Guo, Junbo
    ICMR 2021 - Proceedings of the 2021 International Conference on Multimedia Retrieval, 2021, : 638 - 644
  • [42] Flexible scene text recognition based on dual attention mechanism
    Tian, Zhiqiang
    Wang, Chunhui
    Xiao, Youzi
    Lin, Yuping
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (22):
  • [43] Recurrent Highway Networks with Attention Mechanism for Scene Text Recognition
    Yang, Haodong
    Li, Shuohao
    Yin, Xiaoqing
    Han, Anqi
    Zhang, Jun
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 315 - 322
  • [44] Memory-Augmented Attention Model for Scene Text Recognition
    Wang, Cong
    Yin, Fei
    Liu, Cheng-Lin
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 62 - 67
  • [45] Scene Text Recognition Based on Corner Point and Attention Mechanism
    Wang, Hui
    Hu, Tao
    Geng, Xiaoke
    Li, Kai
    PRICAI 2024: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2025, 15281 : 170 - 181
  • [46] Text Enhancement Network for Cross-Domain Scene Text Detection
    Deng, Jinhong
    Luo, Xiulian
    Zheng, Jiawen
    Dang, Wanli
    Li, Wen
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2203 - 2207
  • [47] CHARACTER REGION AWARENESS NETWORK FOR SCENE TEXT RECOGNITION
    Shang, Mingyu
    Gao, Jie
    Sun, Jun
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [48] weakly supervised text attention network for generating text proposals in scene images
    Li Rong
    En Mengyi
    Li Jianqiang
    Zhang haibin
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 324 - 330
  • [49] Scene Text Recognition via Dual-path Network with Shape-driven Attention Alignment
    Hu, Yijie
    Dong, Bin
    Huang, Kaizhu
    Ding, Lei
    Wang, Wei
    Huang, Xiaowei
    Wang, Qiu-Feng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
  • [50] Scene Recognition Based on Recurrent Memorized Attention Network
    Shao, Xi
    Zhang, Xuan
    Tang, Guijin
    Bao, Bingkun
    ELECTRONICS, 2020, 9 (12) : 1 - 19