Scene Text Recognition with Self-supervised Contrastive Predictive Coding

被引:0
|
作者
Jiang, Xinzhe [1 ]
Zhang, Jianshu [2 ]
Du, Jun [1 ]
Zhang, Zhenrong [1 ]
Wu, Jiajia [2 ]
机构
[1] Univ Sci & Technol China, Natl Engn Res Ctr Speech & Language Informat Proc, Hefei, Anhui, Peoples R China
[2] iFLYTEK Res, Hefei, Peoples R China
关键词
D O I
10.1109/ICPR56361.2022.9956631
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised visual pre-training has recently emerged in scene text recognition (STR), which designs the pretext tasks and takes unlabeled data as input to obtain useful representations for STR. However, most current self-supervised methods do not pay special attention to the importance of sequence awareness. Accordingly, we propose a novel self-supervised STR method based on contrastive predictive coding (STR-CPC), which regards a text instance as a sequence from left to right and captures the visual sequence correlation. Considering the information overlap problem within the feature map induced by the deep convolutional neural network (CNN) encoder, we design a widthwise causal convolution during model pre-training and a progressive recovery training strategy (PRTS) during model fine-tuning to improve the STR performance. Experiments on scene text show that our STR-CPC method outperforms the existing self-supervised methods, which testifies the advantage of visual sequence correlation for STR. Additionally, STR-CPC observably boosts performance compared with supervised training when the amount of labeled data decreases.
引用
收藏
页码:1514 / 1521
页数:8
相关论文
共 50 条
  • [1] Self-supervised Underwater Source Localization based on Contrastive Predictive Coding
    Zhu, Xiaoyu
    Dong, Hefeng
    Rossi, Pierluigi Salvo
    Landro, Martin
    [J]. 2021 IEEE SENSORS, 2021,
  • [2] SELF-SUPERVISED LEARNING FOR SLEEP STAGE CLASSIFICATION WITH PREDICTIVE AND DISCRIMINATIVE CONTRASTIVE CODING
    Xiao, Qinfeng
    Wang, Jing
    Ye, Jianan
    Zhang, Hongjun
    Bu, Yuyan
    Zhang, Yiqiong
    Wu, Hao
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1290 - 1294
  • [3] Time Series Change Point Detection with Self-Supervised Contrastive Predictive Coding
    Deldari, Shohreh
    Smith, Daniel, V
    Xue, Hao
    Salim, Flora D.
    [J]. PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 3124 - 3135
  • [4] Self-Supervised Learning of Remote Sensing Scene Representations Using Contrastive Multiview Coding
    Stojnic, Vladan
    Risojevic, Vladimir
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1182 - 1191
  • [5] CONTRASTIVE SEPARATIVE CODING FOR SELF-SUPERVISED REPRESENTATION LEARNING
    Wang, Jun
    Lam, Max W. Y.
    Su, Dan
    Yu, Dong
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3865 - 3869
  • [6] Self-Supervised Learning on Graphs: Contrastive, Generative, or Predictive
    Wu, Lirong
    Lin, Haitao
    Tan, Cheng
    Gao, Zhangyang
    Li, Stan Z.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4216 - 4235
  • [7] Shot Contrastive Self-Supervised Learning for Scene Boundary Detection
    Chen, Shixing
    Nie, Xiaohan
    Fan, David
    Zhang, Dongqing
    Bhat, Vimal
    Hamid, Raffay
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9791 - 9800
  • [8] Contrastive Self-Supervised Learning for Skeleton Action Recognition
    Gao, Xuehao
    Yang, Yang
    Du, Shaoyi
    [J]. NEURIPS 2020 WORKSHOP ON PRE-REGISTRATION IN MACHINE LEARNING, VOL 148, 2020, 148 : 51 - 61
  • [9] Self-Supervised EEG Representation Learning with Contrastive Predictive Coding for Post-Stroke Patients
    Xu, Fangzhou
    Yan, Yihao
    Zhu, Jianqun
    Chen, Xinyi
    Gao, Licai
    Liu, Yanbing
    Shi, Weiyou
    Lou, Yitai
    Wang, Wei
    Leng, Jiancai
    Zhang, Yang
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (12)
  • [10] A Review of Predictive and Contrastive Self-supervised Learning for Medical Images
    Wang, Wei-Chien
    Ahn, Euijoon
    Feng, Dagan
    Kim, Jinman
    [J]. MACHINE INTELLIGENCE RESEARCH, 2023, 20 (04) : 483 - 513