DA-STD: Deformable Attention-Based Scene Text Detection in Arbitrary Shape

被引:2
|
作者
Wu, Xing [1 ]
Qi, Yangyang [1 ]
Tang, Bin [1 ]
Liu, Hairan [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Shanghai Tech Inst Elect & Informat, Shanghai, Peoples R China
关键词
Scene Text Detection; Deformable Attention; Transformer;
D O I
10.1109/PIC53636.2021.9687065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Text Detection (STD) is important for developing many popular technologies, such as Security and Automatic Driving. However, the existing text detection models are based on unified text shape and single background, which does not accord with the text characteristics in the natural scene. To detect arbitrarily shaped text with a complex background, we proposed a method based on deformable attention mechanism and named DA-STD. At first, a feature enhancement module named FPEM is applied to enhance the image's ability of representation learning. In addition, unlike the attention in the vanilla Transformer, our method adopts the deformable attention module interested in the pixels around the sampling points rather than the global features to make relational modeling. Experiments show that not only can we effectively improve the performance of the model but also greatly save the computational cost in this way.
引用
收藏
页码:102 / 106
页数:5
相关论文
共 50 条
  • [21] Residual attention-based multi-scale script identification in scene text images
    Ma, Mengkai
    Wang, Qiu-Feng
    Huang, Shan
    Huang, Shen
    Goulermas, Yannis
    Huang, Kaizhu
    [J]. Neurocomputing, 2021, 421 : 222 - 233
  • [22] MorphText: Deep Morphology Regularized Accurate Arbitrary-Shape Scene Text Detection
    Xu, Chengpei
    Jia, Wenjing
    Wang, Ruomei
    Luo, Xiaonan
    He, Xiangjian
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4199 - 4212
  • [23] A Quantum-Based Attention Mechanism in Scene Text Detection
    Wu, Hao
    Zhou, Jun
    Zhang, Qiong
    Lei, Yang
    Yu, Kun
    An, Wenbo
    Zhang, Juntao
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 3 - 14
  • [24] TCATD: Text Contour Attention for Scene Text Detection
    Hu, ZiLing
    Wu, Xingjiao
    Yang, Jing
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1083 - 1088
  • [25] Attention-Based Neural Text Segmentation
    Badjatiya, Pinkesh
    Kurisinkel, Litton J.
    Gupta, Manish
    Varma, Vasudeva
    [J]. ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 180 - 193
  • [26] Attention-based Text Recognition in the Wild
    Yan, Zhi-Chen
    Yu, Stephanie A.
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON DEEP LEARNING THEORY AND APPLICATIONS (DELTA), 2020, : 42 - 49
  • [27] Text kernel calculation for arbitrary shape text detection
    Xu Han
    Junyu Gao
    Yuan Yuan
    Qi Wang
    [J]. The Visual Computer, 2024, 40 : 2641 - 2654
  • [28] Text kernel calculation for arbitrary shape text detection
    Han, Xu
    Gao, Junyu
    Yuan, Yuan
    Wang, Qi
    [J]. VISUAL COMPUTER, 2024, 40 (04): : 2641 - 2654
  • [29] Attention-Based CNN-RNN Arabic Text Recognition from Natural Scene Images
    Butt, Hanan
    Raza, Muhammad Raheel
    Ramzan, Muhammad Javed
    Ali, Muhammad Junaid
    Haris, Muhammad
    [J]. FORECASTING, 2021, 3 (03): : 520 - 540
  • [30] OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection
    Zhang, Sheng
    Liu, Yuliang
    Jin, Lianwen
    Wei, Zhongrong
    Shen, Chunhua
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 454 - 467