AB-LSTM: Attention-based Bidirectional LSTM Model for Scene Text Detection

被引:8
|
作者
Liu, Zhandong [1 ]
Zhou, Wengang [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Dept Elect Engn & Informat Sci, 443 Huangshan Rd, Hefei 230027, Peoples R China
关键词
Scene text detection; bidirectional LSTM; feature fusion; attention; semantic segmentation; LOCALIZATION; RECOGNITION; COMPETITION;
D O I
10.1145/3356728
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detection of scene text in arbitrary shapes is a challenging task in the field of computer vision. Most existing scene text detection methods exploit the rectangle/quadrangular bounding box to denote the detected text, which fails to accurately fit text with arbitrary shapes, such as curved text. In addition, recent progress on scene text detection has benefited from Fully Convolutional Network. Text cues contained in multi-level convolutional features are complementary for detecting scene text objects. How to explore these multi-level features is still an open problem. To tackle the above issues, we propose an Attention-based Bidirectional Long Short-Term Memory (AB-LSTM) model for scene text detection. First, word stroke regions (WSRs) and text center blocks (TCBs) are extracted by two AB-LSTM models, respectively. Then, the union of WSRs and TCBs are used to represent text objects. To verify the effectiveness of the proposed method, we perform experiments on four public benchmarks: CTW1500, Total-text, ICDAR2013, and MSRA-TD500, and compare it with existing state-of-the-art methods. Experiment results demonstrate that the proposed method can achieve competitive results, and well handle scene text objects with arbitrary shapes (i.e., curved, oriented, and horizontal forms).
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Enhancements of Attention-Based Bidirectional LSTM for Hybrid Automatic Text Summarization
    Jiang, Jiawen
    Zhang, Haiyang
    Dai, Chenxu
    Zhao, Qingjuan
    Feng, Hao
    Ji, Zhanlin
    Ganchev, Ivan
    [J]. IEEE ACCESS, 2021, 9 : 123660 - 123671
  • [2] Text Summarization of Articles Using LSTM and Attention-Based LSTM
    Kumar, Harsh
    Kumar, Gaurav
    Singh, Shaivye
    Paul, Sourav
    [J]. MACHINE LEARNING AND AUTONOMOUS SYSTEMS, 2022, 269 : 133 - 145
  • [3] Describing Video With Attention-Based Bidirectional LSTM
    Bin, Yi
    Yang, Yang
    Shen, Fumin
    Xie, Ning
    Shen, Heng Tao
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (07) : 2631 - 2641
  • [4] An Improved Attention-based Bidirectional LSTM Model for Cyanobacterial Bloom Prediction
    Jianjun Ni
    Ruping Liu
    Guangyi Tang
    Yingjuan Xie
    [J]. International Journal of Control, Automation and Systems, 2022, 20 : 3445 - 3455
  • [5] An Improved Attention-based Bidirectional LSTM Model for Cyanobacterial Bloom Prediction
    Ni, Jianjun
    Liu, Ruping
    Tang, Guangyi
    Xie, Yingjuan
    [J]. INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (10) : 3445 - 3455
  • [6] Attention-based bidirectional LSTM for Chinese punctuation prediction
    Li, Jinliang
    Yin, Chengfeng
    Jia, Zhen
    Li, Tianrui
    Tang, Min
    [J]. DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 485 - 491
  • [7] Attention-based bidirectional LSTM for Chinese punctuation prediction
    Li, Jinliang
    Yin, Chengfeng
    Jia, Zhen
    Li, Tianrui
    Tang, Min
    [J]. DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 708 - 714
  • [8] Multi-domain Network Intrusion Detection Based on Attention-based Bidirectional LSTM
    Wang, Xiaoning
    [J]. ITNEC 2023 - IEEE 6th Information Technology, Networking, Electronic and Automation Control Conference, 2023, : 805 - 810
  • [9] aMV-LSTM: an attention-based model with multiple positional text matching
    Belkacem, Thiziri
    Dkaki, Taoufiq
    Moreno, Jose G.
    Boughanem, Mohand
    [J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 788 - 795
  • [10] Intrusion Detection Using Attention-Based CNN-LSTM Model
    Al-Omar, Ban
    Trabelsi, Zouheir
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT I, 2023, 675 : 515 - 526