IFR: Iterative Fusion Based Recognizer for Low Quality Scene Text Recognition

被引:2
|
作者
Jia, Zhiwei [1 ]
Xu, Shugong [1 ]
Mu, Shiyi [1 ]
Tao, Yue [1 ]
Cao, Shan [1 ]
Chen, Zhiyong [1 ]
机构
[1] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai 200444, Peoples R China
关键词
Scene text recognition; Iterative collaboration; Feature fusion; NETWORK;
D O I
10.1007/978-3-030-88007-1_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although recent works based on deep learning have made progress in improving recognition accuracy on scene text recognition, how to handle low-quality text images in end-to-end deep networks remains a research challenge. In this paper, we propose an Iterative Fusion based Recognizer (IFR) for low quality scene text recognition, taking advantage of refined text images input and robust feature representation. IFR contains two branches which focus on scene text recognition and low quality scene text image recovery respectively. We utilize an iterative collaboration between two branches, which can effectively alleviate the impact of low quality input. A feature fusion module is proposed to strengthen the feature representation of the two branches, where the features from the Recognizer are Fused with image Restoration branch, referred to as RRF. Without changing the recognition network structure, extensive quantitative and qualitative experimental results show that the proposed method significantly outperforms the baseline methods in boosting the recognition accuracy of benchmark datasets and low resolution images in TextZoom dataset.
引用
收藏
页码:180 / 191
页数:12
相关论文
共 50 条
  • [1] Scene Text Detection and Recognition Based on Iterative Correction
    Xiong, Li
    Gui, Ziyan
    Ou, Ying
    Xu, Wenxia
    [J]. PROCEEDINGS OF 2022 5TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA2022, 2022, : 7 - 10
  • [2] Adversarial learning based attentional scene text recognizer
    Zhao, Jinyuan
    Wang, Yanna
    Xiao, Baihua
    Shi, Cunzhao
    Jiang, Jingzhong
    Wang, Chunheng
    [J]. PATTERN RECOGNITION LETTERS, 2020, 138 : 217 - 222
  • [3] PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition
    Qiao, Zhi
    Zhou, Yu
    Wei, Jin
    Wang, Wei
    Zhang, Yuan
    Jiang, Ning
    Wang, Hongbin
    Wang, Weiping
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2046 - 2055
  • [4] IterVM: Iterative Vision Modeling Module for Scene Text Recognition
    Chu, Xiaojie
    Wang, Yongtao
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1393 - 1399
  • [5] Scene Text Detection Based On Fusion Network
    Zhao, Xuezhuan
    Zhou, Ziheng
    Li, Lingling
    Pei, Lishen
    Ye, Zhaoyi
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (10)
  • [6] Scene Text Recognition for Text-Based Traffic Signs
    Taki, Youssef
    Zemmouri, Elmoukhtar
    [J]. ADVANCES IN INTELLIGENT TRAFFIC AND TRANSPORTATION SYSTEMS, ICITT 2022, 2023, 34 : 67 - 77
  • [7] SCENE TEXT RECOGNITION IN MULTIPLE FRAMES BASED ON TEXT TRACKING
    Rong, Xuejian
    Yi, Chucai
    Yang, Xiaodong
    Tian, Yingli
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [8] Lightweight Scene Text Recognition Based on Transformer
    Luan, Xin
    Zhang, Jinwei
    Xu, Miaomiao
    Silamu, Wushouer
    Li, Yanbing
    [J]. SENSORS, 2023, 23 (09)
  • [9] A Vision Transformer Based Scene Text Recognizer with Multi-grained Encoding and Decoding
    Qiao, Zhi
    Ji, Zhilong
    Yuan, Ye
    Bai, Jinfeng
    [J]. FRONTIERS IN HANDWRITING RECOGNITION, ICFHR 2022, 2022, 13639 : 198 - 212
  • [10] Scene Text Recognition Based on Improved CRNN
    Yu, Wenhua
    Ibrayim, Mayire
    Hamdulla, Askar
    [J]. INFORMATION, 2023, 14 (07)