A Multi-level Progressive Rectification Mechanism for Irregular Scene Text Recognition

被引:1
|
作者
Liao, Qianying [1 ]
Lin, Qingxiang [3 ]
Jin, Lianwen [1 ,2 ]
Luo, Canjie [1 ]
Zhang, Jiaxin [1 ]
Peng, Dezhi [1 ]
Wang, Tianwei [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510641, Peoples R China
[2] Guangdong Artificial Intelligence & Digital Econ, Guangzhou, Peoples R China
[3] Tencent Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
关键词
Optical character recognition (OCR); Deep learning; Irregular scene text recognition; NEURAL-NETWORK;
D O I
10.1007/978-3-030-86337-1_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rectifying irregular texts into regular ones is a promising approach for improving scene text recognition systems. However, most existing methods only perform rectification at the image level once. This may be insufficient for complicated deformations. To this end, we propose a multi-level progressive rectification mechanism, which consists of global and local rectification modules at the image level and a refinement rectification module at the feature level. First, the global rectification module roughly rectifies the entire text. Then, the local rectification module focuses on local deformation to achieve a more fine-grained rectification. Finally, the refinement rectification module rectifies the feature maps to achieve supplementary rectification. In this way, the text distortion and interference from the background are gradually alleviated, thus benefiting subsequent recognition. The entire rectification stage is trained in an end-to-end weakly supervised manner, requiring only images and their corresponding text labels. Extensive experiments demonstrate that the proposed rectification mechanism is capable of rectifying irregular scene texts flexibly and accurately. The proposed method achieves state-of-the-art performance for three testing datasets including IIIT5K, IC13 and SVTP.
引用
收藏
页码:140 / 155
页数:16
相关论文
共 50 条
  • [1] Progressive rectification network for irregular text recognition
    Gao, Yunze
    Chen, Yingying
    Wang, Jinqiao
    Lu, Hanqing
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (02)
  • [2] Progressive rectification network for irregular text recognition
    Yunze GAO
    Yingying CHEN
    Jinqiao WANG
    Hanqing LU
    [J]. Science China(Information Sciences), 2020, 63 (02) : 7 - 20
  • [3] Progressive rectification network for irregular text recognition
    Yunze Gao
    Yingying Chen
    Jinqiao Wang
    Hanqing Lu
    [J]. Science China Information Sciences, 2020, 63
  • [4] Rethinking text rectification for scene text recognition
    Ke, Wenjun
    Wei, Jianguo
    Hou, Qingzhi
    Feng, Hui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219
  • [5] Multi-Level Ensemble Network for Scene Recognition
    Zhang, Longhao
    Li, Lingqiao
    Pan, Xipeng
    Cao, Zhiwei
    Chen, Qianyu
    Yang, Huihua
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (19) : 28209 - 28230
  • [6] Multi-Level Ensemble Network for Scene Recognition
    Longhao Zhang
    Lingqiao Li
    Xipeng Pan
    Zhiwei Cao
    Qianyu Chen
    Huihua Yang
    [J]. Multimedia Tools and Applications, 2019, 78 : 28209 - 28230
  • [7] A Two-Level Rectification Attention Network for Scene Text Recognition
    Wu, Lintai
    Xu, Yong
    Hou, Junhui
    Chen, C. L. Philip
    Liu, Cheng-Lin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2404 - 2414
  • [8] Unattached irregular scene text rectification with refined objective
    Gong, Yanxiang
    Deng, Linjie
    Zhang, Zhiqiang
    Duan, Guozhen
    Ma, Zheng
    Xie, Mei
    [J]. NEUROCOMPUTING, 2021, 463 : 101 - 108
  • [9] Robust Scene Text Recognition with Automatic Rectification
    Shi, Baoguang
    Wang, Xinggang
    Lyu, Pengyuan
    Yao, Cong
    Bai, Xiang
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4168 - 4176
  • [10] Multi-granularity Deep Local Representations for Irregular Scene Text Recognition
    Gao, Hongchao
    Li, Yujia
    Dai, Jiao
    Wang, Xi
    Han, Jizhong
    Li, Ruixuan
    [J]. ACM/IMS Transactions on Data Science, 2021, 2 (02):