Noisy-Aware Unsupervised Domain Adaptation for Scene Text Recognition

被引:0
|
作者
Liu, Xiao-Qian [1 ]
Zhang, Peng-Fei [2 ]
Luo, Xin [1 ]
Huang, Zi [2 ]
Xu, Xin-Shun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Univ Queensland, Sch Elect Engn & Comp Sci, Brisbane, Qld 4072, Australia
基金
中国国家自然科学基金;
关键词
Text recognition; domain adaptation; entropy; noisy-aware; consistency regularization; NETWORK;
D O I
10.1109/TIP.2024.3492705
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised Domain Adaptation (UDA) has shown promise in Scene Text Recognition (STR) by facilitating knowledge transfer from labeled synthetic text (source) to more challenging unlabeled real scene text (target). However, existing UDA-based STR methods fully rely on the pseudo-labels of target samples, which ignores the impact of domain gaps (inter-domain noise) and various natural environments (intra-domain noise), resulting in poor pseudo-label quality. In this paper, we propose a novel noisy-aware unsupervised domain adaptation framework tailored for STR, which aims to enhance model robustness against both inter- and intra-domain noise, thereby providing more precise pseudo-labels for target samples. Concretely, we propose a reweighting target pseudo-labels by estimating the entropy of refined probability distributions, which mitigates the impact of domain gaps on pseudo-labels. Additionally, a decoupled triple-P-N consistency matching module is proposed, which leverages data augmentation to increase data diversity, enhancing model robustness in diverse natural environments. Within this module, we design a low-confidence-based character negative learning, which is decoupled from high-confidence-based positive learning, thus improving sample utilization under scarce target samples. Furthermore, we extend our framework to the more challenging Source-Free UDA (SFUDA) setting, where only a pre-trained source model is available for adaptation, with no access to source data. Experimental results on benchmark datasets demonstrate the effectiveness of our framework. Under the SFUDA setting, our method exhibits faster convergence and superior performance with less training data than previous UDA-based STR methods. Our method surpasses representative STR methods, establishing new state-of-the-art results across multiple datasets.
引用
收藏
页码:6550 / 6563
页数:14
相关论文
共 50 条
  • [1] UNSUPERVISED DOMAIN ADAPTATION WITH IMBALANCED CHARACTER DISTRIBUTION FOR SCENE TEXT RECOGNITION
    Hung Tran Tien
    Thanh Duc Ngo
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3493 - 3497
  • [2] GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition
    Zhan, Fangneng
    Xue, Chuhui
    Lu, Shijian
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9104 - 9114
  • [3] Unsupervised Domain Adaptation via Class Aggregation for Text Recognition
    Liu, Xiao-Qian
    Ding, Xue-Ying
    Luo, Xin
    Xu, Xin-Shun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5617 - 5630
  • [4] Prompt-Integrated Adversarial Unsupervised Domain Adaptation for Scene Recognition
    Yu, Yangyang
    Wang, Shengsheng
    Fu, Zihao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [5] BSAM: Bidirectional Scene-Aware Mixup for Unsupervised Domain Adaptation in Semantic Segmentation
    Xing, Congying
    Li, Gao
    Zhang, Lefei
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 54 - 66
  • [6] Margin-aware Unsupervised Domain Adaptation for Cross-lingual Text Labeling
    Zhang, Dejiao
    Nallapati, Ramesh
    Zhu, Henghui
    Nan, Feng
    dos Santos, Cicero Nogueira
    McKeown, Kathleen
    Xiang, Bing
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [7] Unsupervised Method to Remove Noisy and Redundant Images in Scene Recognition
    Santos-Saavedra, David
    Iglesias, Roberto
    Pardo, Xose M.
    ROBOT 2015: SECOND IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, VOL 2, 2016, 418 : 695 - 704
  • [8] ROAD: Robust Unsupervised Domain Adaptation with Noisy Labels
    Feng, Yanglin
    Zhu, Hongyuan
    Peng, Dezhong
    Peng, Xi
    Hu, Peng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7264 - 7273
  • [9] Nighttime Road Scene Parsing by Unsupervised Domain Adaptation
    Song, Can
    Wu, Jin
    Zhu, Lei
    Zhang, Mei
    Ling, Haibin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (04) : 3244 - 3255
  • [10] Unsupervised urban scene segmentation via domain adaptation
    Gao, Lianli
    Zhang, Yiyue
    Zou, Fuhao
    Shao, Jie
    Lai, Junyu
    NEUROCOMPUTING, 2020, 406 : 295 - 301