Noisy-Aware Unsupervised Domain Adaptation for Scene Text Recognition

被引:0
|
作者
Liu, Xiao-Qian [1 ]
Zhang, Peng-Fei [2 ]
Luo, Xin [1 ]
Huang, Zi [2 ]
Xu, Xin-Shun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Univ Queensland, Sch Elect Engn & Comp Sci, Brisbane, Qld 4072, Australia
基金
中国国家自然科学基金;
关键词
Text recognition; domain adaptation; entropy; noisy-aware; consistency regularization; NETWORK;
D O I
10.1109/TIP.2024.3492705
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised Domain Adaptation (UDA) has shown promise in Scene Text Recognition (STR) by facilitating knowledge transfer from labeled synthetic text (source) to more challenging unlabeled real scene text (target). However, existing UDA-based STR methods fully rely on the pseudo-labels of target samples, which ignores the impact of domain gaps (inter-domain noise) and various natural environments (intra-domain noise), resulting in poor pseudo-label quality. In this paper, we propose a novel noisy-aware unsupervised domain adaptation framework tailored for STR, which aims to enhance model robustness against both inter- and intra-domain noise, thereby providing more precise pseudo-labels for target samples. Concretely, we propose a reweighting target pseudo-labels by estimating the entropy of refined probability distributions, which mitigates the impact of domain gaps on pseudo-labels. Additionally, a decoupled triple-P-N consistency matching module is proposed, which leverages data augmentation to increase data diversity, enhancing model robustness in diverse natural environments. Within this module, we design a low-confidence-based character negative learning, which is decoupled from high-confidence-based positive learning, thus improving sample utilization under scarce target samples. Furthermore, we extend our framework to the more challenging Source-Free UDA (SFUDA) setting, where only a pre-trained source model is available for adaptation, with no access to source data. Experimental results on benchmark datasets demonstrate the effectiveness of our framework. Under the SFUDA setting, our method exhibits faster convergence and superior performance with less training data than previous UDA-based STR methods. Our method surpasses representative STR methods, establishing new state-of-the-art results across multiple datasets.
引用
收藏
页码:6550 / 6563
页数:14
相关论文
共 50 条
  • [11] ProtoUDA: Prototype-Based Unsupervised Adaptation for Cross-Domain Text Recognition
    Liu, Xiao-Qian
    Ding, Xue-Ying
    Luo, Xin
    Xu, Xin-Shun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 9096 - 9108
  • [12] LAL: Linguistically Aware Learning for Scene Text Recognition
    Zheng, Yi
    Qin, Wenda
    Wijaya, Derry
    Betke, Margrit
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4051 - 4059
  • [13] Unsupervised domain adaptation for speech recognition with unsupervised error correction
    Mai, Long
    Carson-Berndsen, Julie
    INTERSPEECH 2022, 2022, : 5120 - 5124
  • [14] On robustness of unsupervised domain adaptation for speaker recognition
    Bousquet, Pierre-Michel
    Rouvier, Mickael
    INTERSPEECH 2019, 2019, : 2958 - 2962
  • [15] Domain Adaptation for Object Recognition: An Unsupervised Approach
    Gopalan, Raghuraman
    Li, Ruonan
    Chellappa, Rama
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 999 - 1006
  • [16] Unsupervised Domain Adaptation for Human Activity Recognition
    Barbosa, Paulo
    Garcia, Kemilly Dearo
    Mendes-Moreira, Joao
    de Carvalho, Andre C. P. L. F.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 623 - 630
  • [17] Spectral Unsupervised Domain Adaptation for Visual Recognition
    Zhang, Jingyi
    Huang, Jiaxing
    Tian, Zichen
    Lu, Shijian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9819 - 9830
  • [18] UNSUPERVISED DOMAIN ADAPTATION FOR DISGUISED FACE RECOGNITION
    Wu, Fangyu
    Yan, Shiyang
    Smith, Jeremy S.
    Lu, Wenjin
    Zhang, Bailing
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 537 - 542
  • [19] Discriminative and Geometry-Aware Unsupervised Domain Adaptation
    Luo, Lingkun
    Chen, Liming
    Hu, Shiqiang
    Lu, Ying
    Wang, Xiaofang
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 3914 - 3927
  • [20] Subtype-Aware Dynamic Unsupervised Domain Adaptation
    Liu, Xiaofeng
    Xing, Fangxu
    You, Jane
    Lu, Jun
    Kuo, C. -C. Jay
    El Fakhri, Georges
    Woo, Jonghye
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2820 - 2834