Deformable Mixed Domain Attention Network for Scene Text Recognition

被引：0

作者：

Huang, Yangyang ^{[1
]}

Fang, Wei ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Dept Comp Sci, Beijing, Peoples R China

来源：

PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020) | 2020年

关键词：

scene text recognition; deformable convolution; attention mechanism; center loss;

D O I：

10.1109/icsess49938.2020.9237645

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

As a hot research area in computer vision in recent years, scene text recognition is still challenging due to the large variance in irregular text. The current methods treat the recognition process as a sequence-to-sequence task and solve it by an encoder-decoder framework. In this work, we propose a DMDAN for robust scene text recognition. First, we utilize deformable convolution to strengthen the ability to adapt to irregular text. Then, mix domain visual attention and self-attention are respectively employed in the encoder and decoder, which can effectively alleviate the problem of "attention drifting". Finally, we integrate the center loss to reduce the intra-class distances and make each class easier to distinguish. Extensive experimental results show that our model outperforms the baseline CRNN a lot and achieves a comparable performance against existing attention-based methods on both regular and irregular datasets.

引用

页码：142 / 145

页数：4

共 50 条

[41] Look back again: Dual parallel attention network for accurate and robust scene text recognition
Fu, Zilong
Xie, Hongtao
Jin, Guoqing
Guo, Junbo
ICMR 2021 - Proceedings of the 2021 International Conference on Multimedia Retrieval, 2021, : 638 - 644
[42] Flexible scene text recognition based on dual attention mechanism
Tian, Zhiqiang
Wang, Chunhui
Xiao, Youzi
Lin, Yuping
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (22):
[43] Recurrent Highway Networks with Attention Mechanism for Scene Text Recognition
Yang, Haodong
Li, Shuohao
Yin, Xiaoqing
Han, Anqi
Zhang, Jun
2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 315 - 322
[44] Memory-Augmented Attention Model for Scene Text Recognition
Wang, Cong
Yin, Fei
Liu, Cheng-Lin
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 62 - 67
[45] Scene Text Recognition Based on Corner Point and Attention Mechanism
Wang, Hui
Hu, Tao
Geng, Xiaoke
Li, Kai
PRICAI 2024: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2025, 15281 : 170 - 181
[46] Text Enhancement Network for Cross-Domain Scene Text Detection
Deng, Jinhong
Luo, Xiulian
Zheng, Jiawen
Dang, Wanli
Li, Wen
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2203 - 2207
[47] CHARACTER REGION AWARENESS NETWORK FOR SCENE TEXT RECOGNITION
Shang, Mingyu
Gao, Jie
Sun, Jun
2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[48] weakly supervised text attention network for generating text proposals in scene images
Li Rong
En Mengyi
Li Jianqiang
Zhang haibin
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 324 - 330
[49] Scene Text Recognition via Dual-path Network with Shape-driven Attention Alignment
Hu, Yijie
Dong, Bin
Huang, Kaizhu
Ding, Lei
Wang, Wei
Huang, Xiaowei
Wang, Qiu-Feng
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
[50] Scene Recognition Based on Recurrent Memorized Attention Network
Shao, Xi
Zhang, Xuan
Tang, Guijin
Bao, Bingkun
ELECTRONICS, 2020, 9 (12) : 1 - 19

← 1 2 3 4 5 →