Attention-Based Scene Text Detection on Dual Feature Fusion

被引:3
|
作者
Li, Yuze [1 ]
Silamu, Wushour [1 ]
Wang, Zhenchao [1 ]
Xu, Miaomiao [1 ]
机构
[1] Xinjiang Univ, Coll Informat Sci & Engn, Xinjiang Multilingual Informat Technol Res Ctr, Xinjiang Multilingual Informat Technol Lab, Urumqi 830017, Peoples R China
基金
中国国家自然科学基金;
关键词
scene text detection; feature pyramid network; spatial attention; multi-scale feature fusion; differentiable binarization;
D O I
10.3390/s22239072
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The segmentation-based scene text detection algorithm has advantages in scene text detection scenarios with arbitrary shape and extreme aspect ratio, depending on its pixel-level description and fine post-processing. However, the insufficient use of semantic and spatial information in the network limits the classification and positioning capabilities of the network. Existing scene text detection methods have the problem of losing important feature information in the process of extracting features from each network layer. To solve this problem, the Attention-based Dual Feature Fusion Model (ADFM) is proposed. The Bi-directional Feature Fusion Pyramid Module (BFM) first adds stronger semantic information to the higher-resolution feature maps through a top-down process and then reduces the aliasing effects generated by the previous process through a bottom-up process to enhance the representation of multi-scale text semantic information. Meanwhile, a position-sensitive Spatial Attention Module (SAM) is introduced in the intermediate process of two-stage feature fusion. It focuses on the one feature map with the highest resolution and strongest semantic features generated in the top-down process and weighs the spatial position weight by the relevance of text features, thus improving the sensitivity of the text detection network to text regions. The effectiveness of each module of ADFM was verified by ablation experiments and the model was compared with recent scene text detection methods on several publicly available datasets.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Text Detection Algorithm Based on Multi-Scale Attention Feature Fusion
    She, Xiangyang
    Liu, Zhe
    Dong, Lihong
    [J]. Computer Engineering and Applications, 2024, 60 (01) : 198 - 206
  • [22] Scene Text Detection Based on Multi-Scale Pooling and Bidirectional Feature Fusion
    Wei, Zheliang
    Li, Yueyang
    Luo, Haichi
    [J]. Computer Engineering and Applications, 2024, 60 (02) : 154 - 161
  • [23] Attention-based dual-path feature fusion network for automatic skin lesion segmentation
    He, Zhenxiang
    Li, Xiaoxia
    Chen, Yuling
    Lv, Nianzu
    Cai, Yong
    [J]. BIODATA MINING, 2023, 16 (01)
  • [24] Attention-based dual-path feature fusion network for automatic skin lesion segmentation
    Zhenxiang He
    Xiaoxia Li
    Yuling Chen
    Nianzu Lv
    Yong Cai
    [J]. BioData Mining, 16
  • [25] Selective attention-based novelty scene detection in dynamic environments
    Ban, Sang-Woo
    Lee, Minho
    [J]. NEUROCOMPUTING, 2006, 69 (13-15) : 1723 - 1727
  • [26] SCAF-Net: Scene Context Attention-Based Fusion Network for Vehicle Detection in Aerial Imagery
    Wang, Minghui
    Li, Qingpeng
    Gu, Yunchao
    Fang, Leyuan
    Zhu, Xiao Xiang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [27] Hierarchical Feature Fusion With Text Attention For Multi-scale Text Detection
    Liu, Chao
    Zou, Yuexian
    Guan, Wenjie
    [J]. 2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [28] Infrared and Visible Image Fusion via Attention-Based Adaptive Feature Fusion
    Wang, Lei
    Hu, Ziming
    Kong, Quan
    Qi, Qian
    Liao, Qing
    [J]. ENTROPY, 2023, 25 (03)
  • [29] Attention-Based Deep Neural Network and Its Application to Scene Text Recognition
    He, Haizhen
    Li, Jiehan
    [J]. 2019 IEEE 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2019), 2019, : 672 - 677
  • [30] TAFFNet: Two-Stage Attention-Based Feature Fusion Network for Surface Defect Detection
    Cao, Jingang
    Yang, Guotian
    Yang, Xiyun
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (12): : 1531 - 1544