ProtoUDA: Prototype-based Unsupervised Adaptation for Cross-Domain Text Recognition

被引:0
|
作者
Liu X. [1 ]
Ding X. [1 ]
Luo X. [1 ]
Xu X. [1 ]
机构
[1] School of Software, Shandong University, Jinan
关键词
Adaptation models; contrastive learning; Decoding; domain adaptation; Feature extraction; prototype; Prototypes; Task analysis; text recognition; Text recognition; Unsupervised learning; Visualization;
D O I
10.1109/TKDE.2023.3344761
中图分类号
学科分类号
摘要
Text recognition reads from real scene text or handwritten text, facilitating many real-world applications such as driverless cars, visual Q&A, and image-based machine translation. Although impressive results have been achieved in single-domain text recognition, it still suffers from great challenges in cross-domain due to the domain gaps among the synthetic text, the real scene text, and the handwritten text. Existing standard unsupervised domain adaptation (UDA) methods struggle to solve the text recognition task since they view a domain or a text image (containing a character sequence) as a whole, ignoring the subunits that make up the sequence. In the paper, we present a Prototyped-based Unsupervised Domain Adaptation method for text recognition (ProtoUDA), where the class prototypes are computed from the source domain, target domain, and the mixed (source-target) domain, respectively. Technically, ProtoUDA initially extracts pseudo-labeled character features under word-level supervised information. Further, based on these character features, we propose two parallel and complementary modules to perform class-level and instance-level alignment, which explicitly transfer the knowledge learned in the source domain to the target domain. Among them, class-level alignment is to close the distance between the similar source prototypes and target prototypes. The instance-level alignment is based on contrastive learning, making the character instances of the mixed domain close to the corresponding class mixed prototype while staying away from other class mixed prototypes. To our knowledge, we are the first to adopt contrastive learning in UDA-based text recognition tasks. Extensive experiments on several benchmark datasets show the superiority of our method over state-of-the-art methods. IEEE
引用
收藏
页码:1 / 13
页数:12
相关论文
共 50 条
  • [41] STR Transformer: A Cross-domain Transformer for Scene Text Recognition
    Xing Wu
    Bin Tang
    Ming Zhao
    Jianjia Wang
    Yike Guo
    Applied Intelligence, 2023, 53 : 3444 - 3458
  • [42] Volumetric Body Composition Through Cross-Domain Consistency Training for Unsupervised Domain Adaptation
    Ali, Shahzad
    Lee, Yu Rim
    Park, Soo Young
    Tak, Won Young
    Jung, Soon Ki
    ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I, 2023, 14361 : 289 - 299
  • [43] NaCL: noise-robust cross-domain contrastive learning for unsupervised domain adaptation
    Jingzheng Li
    Hailong Sun
    Machine Learning, 2023, 112 : 3473 - 3496
  • [44] NaCL: noise-robust cross-domain contrastive learning for unsupervised domain adaptation
    Li, Jingzheng
    Sun, Hailong
    MACHINE LEARNING, 2023, 112 (09) : 3473 - 3496
  • [45] Discriminative Extreme Learning Machine with Cross-Domain Mean Approximation for Unsupervised Domain Adaptation
    Zang, Shaofei
    Li, Xinghai
    Ma, Jianwei
    Yan, Yongyi
    Lv, Jinfeng
    Wei, Yuan
    COMPLEXITY, 2022, 2022
  • [46] CONNECTING THE DOTS WITHOUT CLUES: UNSUPERVISED DOMAIN ADAPTATION FOR CROSS-DOMAIN VISUAL CLASSIFICATION
    Chen, Wei-Yu
    Hsu, Tzu-Ming Harry
    Hou, Cheng-An
    Yeh, Yi-Ren
    Wang, Yu-Chiang Frank
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3997 - 4001
  • [47] STR Transformer: A Cross-domain Transformer for Scene Text Recognition
    Wu, Xing
    Tang, Bin
    Zhao, Ming
    Wang, Jianjia
    Guo, Yike
    APPLIED INTELLIGENCE, 2023, 53 (03) : 3444 - 3458
  • [48] A Fast, Efficient Domain Adaptation Technique for Cross-Domain Electroencephalography(EEG)-Based Emotion Recognition
    Chai, Xin
    Wang, Qisong
    Zhao, Yongping
    Li, Yongqiang
    Liu, Dan
    Liu, Xin
    Bai, Ou
    SENSORS, 2017, 17 (05)
  • [49] A Compressed Unsupervised Deep Domain Adaptation Model for Efficient Cross-Domain Fault Diagnosis
    Xu, Gaowei
    Huang, Chenxi
    Silva, Daniel Santos da
    Albuquerque, Victor Hugo C. de
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (05) : 6741 - 6749
  • [50] TraND: Transferable Neighborhood Discovery for Unsupervised Cross-domain Gait Recognition
    Zheng, Jinkai
    Liu, Xinchen
    Yan, Chenggang
    Zhang, Jiyong
    Liu, Wu
    Zhang, Xiaoping
    Mei, Tao
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,