A Multi-level Progressive Rectification Mechanism for Irregular Scene Text Recognition

被引:1
|
作者
Liao, Qianying [1 ]
Lin, Qingxiang [3 ]
Jin, Lianwen [1 ,2 ]
Luo, Canjie [1 ]
Zhang, Jiaxin [1 ]
Peng, Dezhi [1 ]
Wang, Tianwei [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510641, Peoples R China
[2] Guangdong Artificial Intelligence & Digital Econ, Guangzhou, Peoples R China
[3] Tencent Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
关键词
Optical character recognition (OCR); Deep learning; Irregular scene text recognition; NEURAL-NETWORK;
D O I
10.1007/978-3-030-86337-1_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rectifying irregular texts into regular ones is a promising approach for improving scene text recognition systems. However, most existing methods only perform rectification at the image level once. This may be insufficient for complicated deformations. To this end, we propose a multi-level progressive rectification mechanism, which consists of global and local rectification modules at the image level and a refinement rectification module at the feature level. First, the global rectification module roughly rectifies the entire text. Then, the local rectification module focuses on local deformation to achieve a more fine-grained rectification. Finally, the refinement rectification module rectifies the feature maps to achieve supplementary rectification. In this way, the text distortion and interference from the background are gradually alleviated, thus benefiting subsequent recognition. The entire rectification stage is trained in an end-to-end weakly supervised manner, requiring only images and their corresponding text labels. Extensive experiments demonstrate that the proposed rectification mechanism is capable of rectifying irregular scene texts flexibly and accurately. The proposed method achieves state-of-the-art performance for three testing datasets including IIIT5K, IC13 and SVTP.
引用
收藏
页码:140 / 155
页数:16
相关论文
共 50 条
  • [11] Weakly Supervised Attention Rectification for Scene Text Recognition
    Gu, Chengyu
    Wang, Shilin
    Zhu, Yiwei
    Huang, Zheng
    Chen, Kai
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 779 - 786
  • [12] Scene Text Segmentation with Multi-level Maximally Stable Extremal Regions
    Tian, Shangxuan
    Lu, Shijian
    Su, Bolan
    Tan, Chew Lim
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2703 - 2708
  • [13] Symmetry-Constrained Rectification Network for Scene Text Recognition
    Yan, Mingkun
    Guan, Yushuo
    Liao, Minghui
    He, Xin
    Bian, Kaigui
    Bai, Song
    Yao, Cong
    Bai, Xiang
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9146 - 9155
  • [14] Character-Aware Sampling and Rectification for Scene Text Recognition
    Li, Ming
    Fu, Bin
    Zhang, Zhengfu
    Qiao, Yu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 649 - 661
  • [15] MFECN: Multi-level Feature Enhanced Cumulative Network for Scene Text Detection
    Liu, Zhandong
    Zhou, Wengang
    Li, Houqiang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (03)
  • [16] Scene Text Recognition with Orientation Rectification via IC-STN
    Naosekpam, Veronica
    Shishir, Ailneni Sai
    Sahu, Nilkanta
    [J]. 2021 IEEE REGION 10 CONFERENCE (TENCON 2021), 2021, : 664 - 669
  • [17] Study on multi-level progressive optimization of planetary roller screw mechanism
    Yao, Qin
    Zhang, Mengchuang
    Ma, Shangjun
    Jiang, Quansheng
    [J]. TRIBOLOGY INTERNATIONAL, 2023, 189
  • [18] An extended attention mechanism for scene text recognition
    Xiao, Zheng
    Nie, Zhenyu
    Song, Chao
    Chronopoulos, Anthony Theodore
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
  • [19] Text-Level Contrastive Learning for Scene Text Recognition
    Zhuang, Junbin
    Ren, Yixuan
    Li, Xia
    Liang, Zhanpeng
    [J]. 2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 231 - 236
  • [20] Progressive Raising in Multi-level IR
    Chelini, Lorenzo
    Drebes, Andi
    Zinenko, Oleksandr
    Cohen, Albert
    Vasilache, Nicolas
    Grosser, Tobias
    Corporaal, Henk
    [J]. CGO '21: PROCEEDINGS OF THE 2021 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2021, : 15 - 26