Domain Adaptation Without Source Data

被引:83
|
作者
Kim Y. [1 ]
Cho D. [2 ]
Han K. [3 ]
Panda P. [1 ]
Hong S. [3 ]
机构
[1] The Department of Electrical Engineering, Yale University, New Haven, 06520, CT
[2] The Department of Electronics Engineering, Chungnam National University, Daejeon
[3] The Department of Electrical and Computer Engineering, Inha University, Incheon
来源
基金
新加坡国家研究基金会;
关键词
Class prototypes; data privacy; pseudolabels; self-entropy; source data free domain adaptation (SFDA);
D O I
10.1109/TAI.2021.3110179
中图分类号
学科分类号
摘要
Domain adaptation assumes that samples from source and target domains are freely accessible during a training phase. However, such an assumption is rarely plausible in the real world and possibly causes data privacy issues, especially when the label of the source domain can be a sensitive attribute as an identifier. To avoid accessing source data that could contain sensitive information, we introduce source data free domain adaptation (SFDA). Our key idea is to leverage a pretrained model from the source domain and progressively update the target model in a self-learning manner. We observe that target samples with lower self-entropy measured by the pretrained source model are more likely to be classified correctly. From this, we select the reliable samples with the self-entropy criterion and define these as class prototypes. We then assign pseudolabels for every target sample based on the similarity score with class prototypes. We further propose point-to-set distance-based filtering, which does not require any tunable hyperparameters to reduce uncertainty from the pseudolabeling process. Finally, we train the target model with the filtered pseudolabels with regularization from the pretrained source model. Surprisingly, without direct usage of labeled source samples, our SFDA outperforms conventional domain adaptation methods on benchmark datasets. Impact Statement-This study addresses the data privacy issue, especially in unsupervised domain adaptation. Based on our privacy-preserving domain adaptation, various stakeholders, including enterprises and government organizations, can be free of concern about privacy issues with their labeled source dataset. Furthermore, the proposed data-free approach can contribute to creating a positive social impact, especially in large-scale datasets. Recently, since the size of data across various fields has been scaling up, it is almost incapable for individual researchers to directly utilize a large scale of data during training. For this reason, a new social trend of sharing pretrained models, e.g., EfficientNet and BERT, led by global enterprises with their huge amount of resources has been rising up. From this viewpoint, our approach thus enables more people to participate in the domain adaptation field specifically when the source data are large scale and contain sensitive attributes. © 2021 IEEE.
引用
收藏
页码:508 / 518
页数:10
相关论文
共 50 条
  • [1] Model Adaptation: Unsupervised Domain Adaptation without Source Data
    Li, Rui
    Jiao, Qianfen
    Cao, Wenming
    Wong, Hau-San
    Wu, Si
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9638 - 9647
  • [2] Unsupervised Robust Domain Adaptation without Source Data
    Agarwal, Peshal
    Paudel, Danda Pani
    Zaech, Jan-Nico
    Van Gool, Luc
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2805 - 2814
  • [3] Unsupervised Multi-source Domain Adaptation Without Access to Source Data
    Ahmed, Sk Miraj
    Raychaudhuri, Dripta S.
    Paul, Sujoy
    Oymak, Samet
    Roy-Chowdhury, Amit K.
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10098 - 10107
  • [4] Model Adaptation: Historical Contrastive Learning for Unsupervised Domain Adaptation without Source Data
    Huang, Jiaxing
    Guan, Dayan
    Xiao, Aoran
    Lu, Shijian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Transformer-Based Multi-Source Domain Adaptation Without Source Data
    Li, Gang
    Wu, Chao
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [6] Domain Adaptation in the Absence of Source Domain Data
    Chidlovskii, Boris
    Clinchant, Stephane
    Csurka, Gabriela
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 451 - 460
  • [7] UNKNOWN CLASS FEATURE TRANSFORMATION FOR OPEN SET DOMAIN ADAPTATION WITHOUT SOURCE DATA
    Zhong, Jian
    Wu, Si
    Wong, Hau-San
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 405 - 409
  • [8] UNIVERSAL DOMAIN ADAPTATION WITHOUT SOURCE DATA FOR REMOTE SENSING IMAGE SCENE CLASSIFICATION
    Xu, Qingsong
    Shi, Yilei
    Zhu, Xiaoxiang
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 5341 - 5344
  • [9] Domain Impression: A Source Data Free Domain Adaptation Method
    Kurmi, Vinod K.
    Subramanian, Venkatesh K.
    Namboodiri, Vinay P.
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 615 - 625
  • [10] Unsupervised domain adaptation without source data for estimating occupancy and recognizing activities in smart buildings
    Dridi, Jawher
    Amayri, Manar
    Bouguila, Nizar
    ENERGY AND BUILDINGS, 2024, 303