Parametric loss-based super-resolution for scene text recognition

被引:0
|
作者
Supatta Viriyavisuthisakul
Parinya Sanguansat
Teeradaj Racharak
Minh Le Nguyen
Natsuda Kaothanthong
Choochart Haruechaiyasak
Toshihiko Yamasaki
机构
[1] Thammasat University,School of Management Technology, Sirindhorn International Institute of Technology
[2] Japan Advanced Institute of Information Technology,School of Information Science
[3] Panyapiwat Institute of Management,Faculty of Engineering and Technology
[4] National Electronics and Computer Technology Center,undefined
[5] Department of Information and Communication Engineering,undefined
[6] The University of Tokyo,undefined
来源
关键词
Scene text image; Super-resolution; Parametric; Regularization; Loss function;
D O I
暂无
中图分类号
学科分类号
摘要
Scene text image super-resolution (STISR) is regarded as the process of improving the image quality of low-resolution scene text images to improve text recognition accuracy. Recently, a text attention network was introduced to reconstruct high-resolution scene text images; the backbone method involved the convolutional neural network-based and transformer-based architecture. Although it can deal with rotated and curved-shaped texts, it still cannot properly handle images containing improper-shaped texts and blurred text regions. This can lead to incorrect text predictions during the text recognition step. In this study, we propose the application of multiple parametric regularizations and parametric weight parameters to the loss function of the STISR method to improve scene text image quality and text recognition accuracy. We design and extend it into three types of methods: adding multiple parametric regularizations, modifying parametric weight parameters, and combining parametric weights and multiple parametric regularizations. Experiments were conducted and compared with state-of-the-art models. The results showed a significant improvement for every proposed method. Moreover, our methods generated clearer and sharper edges than the baseline with a better-quality image score.
引用
收藏
相关论文
共 50 条
  • [1] Parametric loss-based super-resolution for scene text recognition
    Viriyavisuthisakul, Supatta
    Sanguansat, Parinya
    Racharak, Teeradaj
    Le Nguyen, Minh
    Kaothanthong, Natsuda
    Haruechaiyasak, Choochart
    Yamasaki, Toshihiko
    [J]. MACHINE VISION AND APPLICATIONS, 2023, 34 (04)
  • [2] Scene Text Telescope: Text-Focused Scene Image Super-Resolution
    Chen, Jingye
    Li, Bin
    Xue, Xiangyang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12021 - 12030
  • [3] Text Prior Guided Scene Text Image Super-Resolution
    Ma, Jianqi
    Guo, Shi
    Zhang, Lei
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1341 - 1353
  • [4] Parametric regularization loss in super-resolution reconstruction
    Viriyavisuthisakul, Supatta
    Kaothanthong, Natsuda
    Sanguansat, Parinya
    Le Nguyen, Minh
    Haruechaiyasak, Choochart
    [J]. MACHINE VISION AND APPLICATIONS, 2022, 33 (05)
  • [5] Parametric regularization loss in super-resolution reconstruction
    Supatta Viriyavisuthisakul
    Natsuda Kaothanthong
    Parinya Sanguansat
    Minh Le Nguyen
    Choochart Haruechaiyasak
    [J]. Machine Vision and Applications, 2022, 33
  • [6] TextSRNet: Scene Text Super-Resolution Based on Contour Prior and Atrous Convolution
    Ma, Jizhao
    Jin, Lianwen
    Zhang, Jiaxin
    Jiang, Jiajia
    Xue, Yang
    He, Mengchao
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3252 - 3258
  • [7] Batch-transformer for scene text image super-resolution
    Sun, Yaqi
    Xie, Xiaolan
    Li, Zhi
    Yang, Kai
    [J]. VISUAL COMPUTER, 2024, 40 (10): : 7399 - 7409
  • [8] Text Gestalt: Stroke-Aware Scene Text Image Super-resolution
    Chen, Jingye
    Yu, Haiyang
    Ma, Jianqi
    Li, Bin
    Xue, Xiangyang
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 285 - 293
  • [9] Rectification and Super-Resolution Enhancements for Forensic Text Recognition
    Blanco-Medina, Pablo
    Fidalgo, Eduardo
    Alegre, Enrique
    Alaiz-Rodriguez, Rocio
    Janez-Martino, Francisco
    Bonnici, Alexandra
    [J]. SENSORS, 2020, 20 (20) : 1 - 17
  • [10] A Benchmark for Chinese-English Scene Text Image Super-resolution
    Ma, Jianqi
    Liang, Zhetong
    Xiang, Wangmeng
    Yang, Xi
    Zhang, Lei
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19395 - 19404