DA-STD: Deformable Attention-Based Scene Text Detection in Arbitrary Shape

被引:2
|
作者
Wu, Xing [1 ]
Qi, Yangyang [1 ]
Tang, Bin [1 ]
Liu, Hairan [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Shanghai Tech Inst Elect & Informat, Shanghai, Peoples R China
关键词
Scene Text Detection; Deformable Attention; Transformer;
D O I
10.1109/PIC53636.2021.9687065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Text Detection (STD) is important for developing many popular technologies, such as Security and Automatic Driving. However, the existing text detection models are based on unified text shape and single background, which does not accord with the text characteristics in the natural scene. To detect arbitrarily shaped text with a complex background, we proposed a method based on deformable attention mechanism and named DA-STD. At first, a feature enhancement module named FPEM is applied to enhance the image's ability of representation learning. In addition, unlike the attention in the vanilla Transformer, our method adopts the deformable attention module interested in the pixels around the sampling points rather than the global features to make relational modeling. Experiments show that not only can we effectively improve the performance of the model but also greatly save the computational cost in this way.
引用
收藏
页码:102 / 106
页数:5
相关论文
共 50 条
  • [1] CA-STD: Scene Text Detection in Arbitrary Shape Based on Conditional Attention
    Wu, Xing
    Qi, Yangyang
    Song, Jun
    Yao, Junfeng
    Wang, Yanzhong
    Liu, Yang
    Han, Yuexing
    Qian, Quan
    [J]. INFORMATION, 2022, 13 (12)
  • [2] Attention-Based Scene Text Detection on Dual Feature Fusion
    Li, Yuze
    Silamu, Wushour
    Wang, Zhenchao
    Xu, Miaomiao
    [J]. SENSORS, 2022, 22 (23)
  • [3] Arbitrary-Shaped Scene Text Recognition with Deformable Ensemble Attention
    Xu, Shuo
    Zhuang, Zeming
    Li, Mingjun
    Su, Feng
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), (237-253):
  • [4] Arbitrary Shape Natural Scene Text Detection Method Based on Soft Attention Mechanism and Dilated Convolution
    Qin, Xiao
    Jiang, Jianhui
    Yuan, Chang-An
    Qiao, Shaojie
    Fan, Wei
    [J]. IEEE ACCESS, 2020, 8 (08): : 122685 - 122694
  • [5] Arbitrary Shape Scene Text Detection with Adaptive Text Region Representation
    Wang, Xiaobing
    Jiang, Yingying
    Luo, Zhenbo
    Liu, Cheng-Lin
    Choi, Hyunsoo
    Kim, Sungjin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6442 - 6451
  • [6] Scene Text Detection via Deep Semantic Feature Fusion and Attention-based Refinement
    Song, Yu
    Cui, Yuanshun
    Han, Hu
    Shan, Shiguang
    Chen, Xilin
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3747 - 3752
  • [7] AB-LSTM: Attention-based Bidirectional LSTM Model for Scene Text Detection
    Liu, Zhandong
    Zhou, Wengang
    Li, Houqiang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (04)
  • [8] Adaptive embedding gate for attention-based scene text recognition
    Chen, Xiaoxue
    Wang, Tianwei
    Zhu, Yuanzhi
    Jin, Lianwen
    Luo, Canjie
    [J]. NEUROCOMPUTING, 2020, 381 : 261 - 271
  • [9] Arbitrary-shaped scene text detection with keypoint-based shape representation
    Shuxin Qin
    Lin Chen
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 115 - 127
  • [10] Arbitrary-shaped scene text detection with keypoint-based shape representation
    Qin, Shuxin
    Chen, Lin
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (02) : 115 - 127