A Benchmark for Chinese-English Scene Text Image Super-resolution

被引：0

作者：

Ma, Jianqi ^{[1
,2
]}

Liang, Zhetong ^{[2
]}

Xiang, Wangmeng ^{[1
]}

Yang, Xi ^{[1
,2
]}

Zhang, Lei ^{[1
,2
]}

机构：

[1] Hong Kong Polytech Univ, Hong Kong, Peoples R China

[2] OPPO Res, Chengdu, Peoples R China

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

关键词：

D O I：

10.1109/ICCV51070.2023.01782

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene Text Image Super-resolution (STISR) aims to recover high-resolution (HR) scene text images with visually pleasant and readable text content from the given low-resolution (LR) input. Most existing works focus on recovering English texts, which have relatively simple character structures, while little work has been done on the more challenging Chinese texts with diverse and complex character structures. In this paper, we propose a real-world Chinese-English benchmark dataset, namely Real-CE, for the task of STISR with the emphasis on restoring structurally complex Chinese characters. The benchmark provides 1,935/783 real-world LR-HR text image pairs (contains 33,789 text lines in total) for training/testing in 2x and 4x zooming modes, complemented by detailed annotations, including detection boxes and text transcripts. Moreover, we design an edge-aware learning method, which provides structural supervision in image and feature domains, to effectively reconstruct the dense structures of Chinese characters. We conduct experiments on the proposed Real-CE benchmark and evaluate the existing STISR models with and without our edge-aware loss. The benchmark, including data and source code, is available at https://github.com/mjq11302010044/Real-CE.

引用

页码：19395 / 19404

页数：10

共 50 条

[31] Light Field Super-Resolution: A Benchmark
Cheng, Zhen
Xiong, Zhiwei
Chen, Chang
Liu, Dong
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1804 - 1813
[32] ICDAR2015 Competition on Text Image Super-Resolution
Peyrard, Clement
Baccouche, Moez
Mamalet, Franck
Garcia, Christophe
[J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1201 - 1205
[33] Anisotropic Total Variation Method for Text Image Super-Resolution
Bayarsaikhan, Battulga
Kwon, Younghee
Kim, Jin Hyung
[J]. PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 473 - 479
[34] ADVERSARIAL TEXT IMAGE SUPER-RESOLUTION USING SINKHORN DISTANCE
Geng, Cong
Chen, Li
Zhang, Xiaoyun
Gao, Zhiyong
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2663 - 2667
[35] Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
Zhang, Wenyu
Deng, Xin
Jia, Baojun
Yu, Xingtong
Chen, Yifan
Ma, Jin
Ding, Qing
Zhang, Xinming
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2168 - 2179
[36] Scene text image super-resolution using multi-scale convolutional neural network with skip connections
Walha, Rim
Aouini, Amal
[J]. APPLIED INTELLIGENCE, 2024, : 5931 - 5943
[37] Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
Zhang, Wenyu
Deng, Xin
Jia, Baojun
Yu, Xingtong
Chen, Yifan
Ma, Jin
Ding, Qing
Zhang, Xinming
[J]. MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia, 2023, : 2168 - 2179
[38] TextSRNet: Scene Text Super-Resolution Based on Contour Prior and Atrous Convolution
Ma, Jizhao
Jin, Lianwen
Zhang, Jiaxin
Jiang, Jiajia
Xue, Yang
He, Mengchao
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3252 - 3258
[39] Criteria Comparative Learning for Real-Scene Image Super-Resolution
Shi, Yukai
Li, Hao
Zhang, Sen
Yang, Zhijing
Wang, Xiao
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8476 - 8485
[40] Hyperspectral Image Super-Resolution with RGB Image Super-Resolution as an Auxiliary Task
Li, Ke
Dai, Dengxin
van Gool, Luc
[J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 4039 - 4048

← 1 2 3 4 5 →