A Benchmark for Chinese-English Scene Text Image Super-resolution

被引:0
|
作者
Ma, Jianqi [1 ,2 ]
Liang, Zhetong [2 ]
Xiang, Wangmeng [1 ]
Yang, Xi [1 ,2 ]
Zhang, Lei [1 ,2 ]
机构
[1] Hong Kong Polytech Univ, Hong Kong, Peoples R China
[2] OPPO Res, Chengdu, Peoples R China
关键词
D O I
10.1109/ICCV51070.2023.01782
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Text Image Super-resolution (STISR) aims to recover high-resolution (HR) scene text images with visually pleasant and readable text content from the given low-resolution (LR) input. Most existing works focus on recovering English texts, which have relatively simple character structures, while little work has been done on the more challenging Chinese texts with diverse and complex character structures. In this paper, we propose a real-world Chinese-English benchmark dataset, namely Real-CE, for the task of STISR with the emphasis on restoring structurally complex Chinese characters. The benchmark provides 1,935/783 real-world LR-HR text image pairs (contains 33,789 text lines in total) for training/testing in 2x and 4x zooming modes, complemented by detailed annotations, including detection boxes and text transcripts. Moreover, we design an edge-aware learning method, which provides structural supervision in image and feature domains, to effectively reconstruct the dense structures of Chinese characters. We conduct experiments on the proposed Real-CE benchmark and evaluate the existing STISR models with and without our edge-aware loss. The benchmark, including data and source code, is available at https://github.com/mjq11302010044/Real-CE.
引用
收藏
页码:19395 / 19404
页数:10
相关论文
共 50 条
  • [31] Light Field Super-Resolution: A Benchmark
    Cheng, Zhen
    Xiong, Zhiwei
    Chen, Chang
    Liu, Dong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1804 - 1813
  • [32] ICDAR2015 Competition on Text Image Super-Resolution
    Peyrard, Clement
    Baccouche, Moez
    Mamalet, Franck
    Garcia, Christophe
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1201 - 1205
  • [33] Anisotropic Total Variation Method for Text Image Super-Resolution
    Bayarsaikhan, Battulga
    Kwon, Younghee
    Kim, Jin Hyung
    [J]. PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 473 - 479
  • [34] ADVERSARIAL TEXT IMAGE SUPER-RESOLUTION USING SINKHORN DISTANCE
    Geng, Cong
    Chen, Li
    Zhang, Xiaoyun
    Gao, Zhiyong
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2663 - 2667
  • [35] Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
    Zhang, Wenyu
    Deng, Xin
    Jia, Baojun
    Yu, Xingtong
    Chen, Yifan
    Ma, Jin
    Ding, Qing
    Zhang, Xinming
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2168 - 2179
  • [36] Scene text image super-resolution using multi-scale convolutional neural network with skip connections
    Walha, Rim
    Aouini, Amal
    [J]. APPLIED INTELLIGENCE, 2024, : 5931 - 5943
  • [37] Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
    Zhang, Wenyu
    Deng, Xin
    Jia, Baojun
    Yu, Xingtong
    Chen, Yifan
    Ma, Jin
    Ding, Qing
    Zhang, Xinming
    [J]. MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia, 2023, : 2168 - 2179
  • [38] TextSRNet: Scene Text Super-Resolution Based on Contour Prior and Atrous Convolution
    Ma, Jizhao
    Jin, Lianwen
    Zhang, Jiaxin
    Jiang, Jiajia
    Xue, Yang
    He, Mengchao
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3252 - 3258
  • [39] Criteria Comparative Learning for Real-Scene Image Super-Resolution
    Shi, Yukai
    Li, Hao
    Zhang, Sen
    Yang, Zhijing
    Wang, Xiao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8476 - 8485
  • [40] Hyperspectral Image Super-Resolution with RGB Image Super-Resolution as an Auxiliary Task
    Li, Ke
    Dai, Dengxin
    van Gool, Luc
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 4039 - 4048