Domain Adaptation Without Source Data

被引:83
|
作者
Kim Y. [1 ]
Cho D. [2 ]
Han K. [3 ]
Panda P. [1 ]
Hong S. [3 ]
机构
[1] The Department of Electrical Engineering, Yale University, New Haven, 06520, CT
[2] The Department of Electronics Engineering, Chungnam National University, Daejeon
[3] The Department of Electrical and Computer Engineering, Inha University, Incheon
来源
基金
新加坡国家研究基金会;
关键词
Class prototypes; data privacy; pseudolabels; self-entropy; source data free domain adaptation (SFDA);
D O I
10.1109/TAI.2021.3110179
中图分类号
学科分类号
摘要
Domain adaptation assumes that samples from source and target domains are freely accessible during a training phase. However, such an assumption is rarely plausible in the real world and possibly causes data privacy issues, especially when the label of the source domain can be a sensitive attribute as an identifier. To avoid accessing source data that could contain sensitive information, we introduce source data free domain adaptation (SFDA). Our key idea is to leverage a pretrained model from the source domain and progressively update the target model in a self-learning manner. We observe that target samples with lower self-entropy measured by the pretrained source model are more likely to be classified correctly. From this, we select the reliable samples with the self-entropy criterion and define these as class prototypes. We then assign pseudolabels for every target sample based on the similarity score with class prototypes. We further propose point-to-set distance-based filtering, which does not require any tunable hyperparameters to reduce uncertainty from the pseudolabeling process. Finally, we train the target model with the filtered pseudolabels with regularization from the pretrained source model. Surprisingly, without direct usage of labeled source samples, our SFDA outperforms conventional domain adaptation methods on benchmark datasets. Impact Statement-This study addresses the data privacy issue, especially in unsupervised domain adaptation. Based on our privacy-preserving domain adaptation, various stakeholders, including enterprises and government organizations, can be free of concern about privacy issues with their labeled source dataset. Furthermore, the proposed data-free approach can contribute to creating a positive social impact, especially in large-scale datasets. Recently, since the size of data across various fields has been scaling up, it is almost incapable for individual researchers to directly utilize a large scale of data during training. For this reason, a new social trend of sharing pretrained models, e.g., EfficientNet and BERT, led by global enterprises with their huge amount of resources has been rising up. From this viewpoint, our approach thus enables more people to participate in the domain adaptation field specifically when the source data are large scale and contain sensitive attributes. © 2021 IEEE.
引用
收藏
页码:508 / 518
页数:10
相关论文
共 50 条
  • [31] Multi-source domain adaptation of social media data for disaster management
    Anuradha Khattar
    S. M. K. Quadri
    Multimedia Tools and Applications, 2023, 82 : 9083 - 9111
  • [32] Source data-free domain adaptation for a faster R-CNN
    Xiong, Lin
    Ye, Mao
    Zhang, Dan
    Gan, Yan
    Liu, Yiguang
    PATTERN RECOGNITION, 2022, 124
  • [33] Privacy-Preserving Multi-Source Domain Adaptation for Medical Data
    Han, Tianyi
    Gong, Xiaoli
    Feng, Fan
    Zhang, Jin
    Sun, Zhe
    Zhang, Yu
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (02) : 842 - 853
  • [34] Source-Free Domain Adaptation with Temporal Imputation for Time Series Data
    Ragab, Mohamed
    Eldele, Emadeldeen
    Wu, Min
    Foo, Chuan-Sheng
    Li, Xiaoli
    Chen, Zhenghua
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1989 - 1998
  • [35] SoFA: Source-data-free Feature Alignment for Unsupervised Domain Adaptation
    Yeh, Hao-Wei
    Yang, Baoyao
    Yuen, Pong C.
    Harada, Tatsuya
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 474 - 483
  • [36] Adversarial Multiple Source Domain Adaptation
    Zhao, Han
    Zhang, Shanghang
    Wu, Guanhang
    Costeira, Joao P.
    Moura, Jose M. F.
    Gordon, Geoffrey J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [37] Unsupervised Source Selection for Domain Adaptation
    Vogt, Karsten
    Paul, Andreas
    Ostermann, Joern
    Rottensteiner, Franz
    Heipke, Christian
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2018, 84 (05): : 249 - 261
  • [38] IMPROVING DOMAIN ADAPTATION BY SOURCE SELECTION
    Bascol, Kevin
    Emonet, Remi
    Fromont, Elisa
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3043 - 3047
  • [39] Source data-free domain adaptation of object detector through domain-specific perturbation
    Xiong, Lin
    Ye, Mao
    Zhang, Dan
    Gan, Yan
    Li, Xue
    Zhu, Yingying
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (08) : 3746 - 3766
  • [40] Semi-Supervised Domain Adaptation with Source Label Adaptation
    Yu, Yu-Chu
    Lin, Hsuan-Tien
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24100 - 24109