CoachGAN: Fast Adversarial Transfer Learning between Differently Shaped Entities

被引:1
|
作者
Mounsif, Mehdi [1 ]
Lengagne, Sebastien [1 ]
Thuilot, Benoit [1 ]
Adouane, Lounis [2 ]
机构
[1] Univ Clermont Auvergne, SIGMA Clermont, CNRS, Inst Pascal, F-63000 Clermont Ferrand, France
[2] Univ Technol Compiegne, Heudiasyc, CNRS, F-60200 Compiegne, France
关键词
Transfer Learning; Generative Adversarial Networks; Control; Differentiable Models;
D O I
10.5220/0009972200890096
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the last decade, robots have been taking an increasingly important place in our societies, and shall the current trend keep the same dynamic,their presence and activities will likely become ubiquitous. As robots will certainly be produced by various industrial actors, it is reasonable to assume that a very diverse robot population will be used by mankind for a broad panel of tasks. As such, it appears probable that robots with a distinct morphology will be required to perform the same task. As an important part of these tasks requires learning-based control and given the millions of interactions steps needed by these approaches to create a single agent, it appears highly desirable to be able to transfer skills from one agent to another despite a potentially different kinematic structure. Correspondingly, this paper introduces a new method, CoachGAN, based on an adversarial framework that allows fast transfer of capacities between a teacher and a student agent. The CoachGAN approach aims at embedding the teacher's way of solving the task within a critic network. Enhanced with the intermediate state variable (ISV) that translates a student state in its teacher equivalent, the critic is then able to guide the student policy in a supervised way in a fraction of the initial training time and without the student having any interaction with the target domain. To demonstrate the flexibility of this approach, CoachGAN is evaluated over a custom tennis task, using various ways to define the intermediate state variables.
引用
收藏
页码:89 / 96
页数:8
相关论文
共 50 条
  • [1] Identifying Named Entities of Adverse Drug Reaction with Adversarial Transfer Learning
    Han, Pu
    Zhong, Yule
    Lu, Haojie
    Ma, Shiwen
    [J]. Data Analysis and Knowledge Discovery, 2023, 7 (03) : 131 - 141
  • [2] Partial Transfer Learning for Fast Evolutionary Generative Adversarial Networks
    Liu, Zheping
    Sabar, Nasser
    Song, Andy
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] Identifying adverse drug reaction entities from social media with adversarial transfer learning model
    Zhang, Tongxuan
    Lin, Hongfei
    Ren, Yuqi
    Yang, Zhihao
    Wang, Jian
    Duan, Xiaodong
    Xu, Bo
    [J]. NEUROCOMPUTING, 2021, 453 : 254 - 262
  • [4] Identifying adverse drug reaction entities from social media with adversarial transfer learning model
    Zhang, Tongxuan
    Lin, Hongfei
    Ren, Yuqi
    Yang, Zhihao
    Wang, Jian
    Duan, Xiaodong
    Xu, Bo
    [J]. Neurocomputing, 2021, 453 : 254 - 262
  • [5] Quantum Adversarial Transfer Learning
    Wang, Longhan
    Sun, Yifan
    Zhang, Xiangdong
    [J]. ENTROPY, 2023, 25 (07)
  • [6] Adversarial training for fast arbitrary style transfer
    Xu, Zheng
    Wilber, Michael
    Fang, Chen
    Hertzmann, Aaron
    Jin, Hailin
    [J]. COMPUTERS & GRAPHICS-UK, 2020, 87 : 1 - 11
  • [7] Adversarial Vulnerability of Active Transfer Learning
    Mueller, Nicolas M.
    Boettinger, Konstantin
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 116 - 127
  • [8] Fast and Adversarial Robust Kernelized SDU Learning
    Fan, Yajing
    Shi, Wanli
    Chang, Yi
    Gu, Bin
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [9] Transfer of Robot Perception Module With Adversarial Learning
    Sui, Hongjian
    Shang, Weiwei
    Li, Xiang
    [J]. IEEE ACCESS, 2019, 7 : 79726 - 79736
  • [10] Spectrum sensing based on adversarial transfer learning
    Miao, Jiawu
    Li, Yuebo
    Jing, Xiaojun
    Zhang, Fangpei
    Mu, Junsheng
    [J]. IET COMMUNICATIONS, 2022, 16 (17) : 2059 - 2069