Parametric loss-based super-resolution for scene text recognition

被引:0
|
作者
Supatta Viriyavisuthisakul
Parinya Sanguansat
Teeradaj Racharak
Minh Le Nguyen
Natsuda Kaothanthong
Choochart Haruechaiyasak
Toshihiko Yamasaki
机构
[1] Thammasat University,School of Management Technology, Sirindhorn International Institute of Technology
[2] Japan Advanced Institute of Information Technology,School of Information Science
[3] Panyapiwat Institute of Management,Faculty of Engineering and Technology
[4] National Electronics and Computer Technology Center,undefined
[5] Department of Information and Communication Engineering,undefined
[6] The University of Tokyo,undefined
来源
关键词
Scene text image; Super-resolution; Parametric; Regularization; Loss function;
D O I
暂无
中图分类号
学科分类号
摘要
Scene text image super-resolution (STISR) is regarded as the process of improving the image quality of low-resolution scene text images to improve text recognition accuracy. Recently, a text attention network was introduced to reconstruct high-resolution scene text images; the backbone method involved the convolutional neural network-based and transformer-based architecture. Although it can deal with rotated and curved-shaped texts, it still cannot properly handle images containing improper-shaped texts and blurred text regions. This can lead to incorrect text predictions during the text recognition step. In this study, we propose the application of multiple parametric regularizations and parametric weight parameters to the loss function of the STISR method to improve scene text image quality and text recognition accuracy. We design and extend it into three types of methods: adding multiple parametric regularizations, modifying parametric weight parameters, and combining parametric weights and multiple parametric regularizations. Experiments were conducted and compared with state-of-the-art models. The results showed a significant improvement for every proposed method. Moreover, our methods generated clearer and sharper edges than the baseline with a better-quality image score.
引用
收藏
相关论文
共 50 条
  • [31] Light field angular super-resolution based on structure and scene information
    Jiangxin Yang
    Lingyu Wang
    Lifei Ren
    Yanpeng Cao
    Yanlong Cao
    [J]. Applied Intelligence, 2023, 53 : 4767 - 4783
  • [32] Light field angular super-resolution based on structure and scene information
    Yang, Jiangxin
    Wang, Lingyu
    Ren, Lifei
    Cao, Yanpeng
    Cao, Yanlong
    [J]. APPLIED INTELLIGENCE, 2023, 53 (04) : 4767 - 4783
  • [33] SRR-GAN: Super-Resolution based Recognition with GAN for Low-Resolved Text Images
    Xu, Ming-Chao
    Yin, Fei
    Liu, Cheng-Lin
    [J]. 2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 1 - 6
  • [34] Scene text image super-resolution via textual reasoning and multiscale cross-convolution
    Lan Yu
    Xiaojie Li
    Qi Yu
    Guangju Li
    Dehu Jin
    Meng Qi
    [J]. Applied Intelligence, 2024, 54 : 1997 - 2008
  • [35] Text-Enhanced Scene Image Super-Resolution via Stroke Mask and Orthogonal Attention
    Shu, Rui
    Zhao, Cairong
    Feng, Shuyang
    Zhu, Liang
    Miao, Duoqian
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6317 - 6330
  • [36] Pragmatic degradation learning for scene text image super-resolution with data-training strategy
    Yang, Shengying
    Xie, Lifeng
    Ran, Xiaoxiao
    Lei, Jingsheng
    Qian, Xiaohong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 285
  • [37] Scene text image super-resolution via textual reasoning and multiscale cross-convolution
    Yu, Lan
    Li, Xiaojie
    Yu, Qi
    Li, Guangju
    Jin, Dehu
    Qi, Meng
    [J]. APPLIED INTELLIGENCE, 2024, 54 (02) : 1997 - 2008
  • [38] Super-Resolution Benefit for Face Recognition
    Hu, Shuowen
    Maschal, Robert
    Young, S. Susan
    Hong, Tsai Hong
    Phillips, Jonathon P.
    [J]. SENSING TECHNOLOGIES FOR GLOBAL HEALTH, MILITARY MEDICINE, DISASTER RESPONSE, AND ENVIRONMENTAL MONITORING AND BIOMETRIC TECHNOLOGY FOR HUMAN IDENTIFICATION VIII, 2011, 8029
  • [39] Non-parametric Bayesian super-resolution
    Lane, R. O.
    [J]. IET RADAR SONAR AND NAVIGATION, 2010, 4 (04): : 639 - 648
  • [40] Super-resolution enhancement of text image sequences
    Capel, D
    Zisserman, A
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 600 - 605