Multi-source Domain Adaptation Based on Data Selector with Soft Actor-Critic

被引:0
|
作者
Cui, Qiquan [1 ,2 ]
Jin, Xuanyu [1 ,2 ]
Dai, Weichen [1 ,2 ]
Kong, Wanzeng [1 ,2 ]
机构
[1] HangZhou DianZi Univ, Sch Comp Sci, Hangzhou, Peoples R China
[2] Key Lab Brain Machine Collaborat Intelligence Zhe, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Multi-source domain adaptation; Reinforced learning data selector; Soft actor-critic;
D O I
10.1007/978-981-19-8222-4_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-source domain adaptation (MDA) aims to transfer the knowledge learned from multiple-sources domains to the target domain. Although the source domains are related to the target domain, the difference of data distribution between source and target domains may lead to negative transfer. Therefore, selecting the high-quality source data is conducive to mitigate the problem. However, the existing methods select the data with uniform criteria, neglecting the variety of multiple source domains. In this paper, we propose a reinforced learning Data Selector with the Soft Actor-Critic (DSAC) algorithm for MDA. Specifically, the Soft Actor-Critic (SAC) algorithm has two Q-value Critic networks, it can better judge the performance of the data. Select the data in multi-source domains to migrate with our target domain, and use the difference in loss both before and after the model to determine the quality of the data and whether it is retained. Extensive experiments on the representative benchmark demonstrate that our method performs favorably against the state-of-the-art approaches.
引用
收藏
页码:99 / 109
页数:11
相关论文
共 50 条
  • [1] Load Frequency Active Disturbance Rejection Control for Multi-Source Power System Based on Soft Actor-Critic
    Zheng, Yuemin
    Tao, Jin
    Sun, Hao
    Sun, Qinglin
    Chen, Zengqiang
    Dehmer, Matthias
    Zhou, Quan
    ENERGIES, 2021, 14 (16)
  • [2] Latent Context Based Soft Actor-Critic
    Pu, Yuan
    Wang, Shaochen
    Yao, Xin
    Li, Bin
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [3] Model-Based Soft Actor-Critic
    Chien, Jen-Tzung
    Yang, Shu-Hsiang
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 2028 - 2035
  • [4] Generative Adversarial Soft Actor-Critic
    Hwang, Hyo-Seok
    Kim, Yoojoong
    Seok, Junhee
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [5] An actor-critic model of saccade adaptation
    Manabu Inaba
    Tadashi Yamazaki
    BMC Neuroscience, 14 (Suppl 1)
  • [6] Soft Actor-Critic With Integer Actions
    Fan, Ting-Han
    Wang, Yubo
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2611 - 2616
  • [7] Cooperative optimal scheduling strategy of source and storage in microgrid based on soft actor-critic
    Liu L.
    Zhu J.
    Chen J.
    Ye H.
    Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2022, 42 (01): : 79 - 85
  • [8] Transformer-Based Multi-Source Domain Adaptation Without Source Data
    Li, Gang
    Wu, Chao
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [9] Actor-Critic Algorithms with Online Feature Adaptation
    Prabuchandran, K. J.
    Bhatnagar, Shalabh
    Borkar, Vivek S.
    ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2016, 26 (04):
  • [10] Robot Skill Adaptation via Soft Actor-Critic Gaussian Mixture Models
    Nematollahi, Iman
    Rosete-Beas, Erick
    Roefer, Adrian
    Welschehold, Tim
    Valada, Abhinav
    Burgard, Wolfram
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8651 - 8657