CoachGAN: Fast Adversarial Transfer Learning between Differently Shaped Entities

被引：1

作者：

Mounsif, Mehdi ^{[1
]}

Lengagne, Sebastien ^{[1
]}

Thuilot, Benoit ^{[1
]}

Adouane, Lounis ^{[2
]}

机构：

[1] Univ Clermont Auvergne, SIGMA Clermont, CNRS, Inst Pascal, F-63000 Clermont Ferrand, France

[2] Univ Technol Compiegne, Heudiasyc, CNRS, F-60200 Compiegne, France

来源：

ICINCO: PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS | 2020年

关键词：

Transfer Learning; Generative Adversarial Networks; Control; Differentiable Models;

D O I：

10.5220/0009972200890096

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the last decade, robots have been taking an increasingly important place in our societies, and shall the current trend keep the same dynamic,their presence and activities will likely become ubiquitous. As robots will certainly be produced by various industrial actors, it is reasonable to assume that a very diverse robot population will be used by mankind for a broad panel of tasks. As such, it appears probable that robots with a distinct morphology will be required to perform the same task. As an important part of these tasks requires learning-based control and given the millions of interactions steps needed by these approaches to create a single agent, it appears highly desirable to be able to transfer skills from one agent to another despite a potentially different kinematic structure. Correspondingly, this paper introduces a new method, CoachGAN, based on an adversarial framework that allows fast transfer of capacities between a teacher and a student agent. The CoachGAN approach aims at embedding the teacher's way of solving the task within a critic network. Enhanced with the intermediate state variable (ISV) that translates a student state in its teacher equivalent, the critic is then able to guide the student policy in a supervised way in a fraction of the initial training time and without the student having any interaction with the target domain. To demonstrate the flexibility of this approach, CoachGAN is evaluated over a custom tennis task, using various ways to define the intermediate state variables.

引用

页码：89 / 96

页数：8

共 50 条

[1] Identifying Named Entities of Adverse Drug Reaction with Adversarial Transfer Learning
Han, Pu
Zhong, Yule
Lu, Haojie
Ma, Shiwen
[J]. Data Analysis and Knowledge Discovery, 2023, 7 (03) : 131 - 141
[2] Partial Transfer Learning for Fast Evolutionary Generative Adversarial Networks
Liu, Zheping
Sabar, Nasser
Song, Andy
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[3] Identifying adverse drug reaction entities from social media with adversarial transfer learning model
Zhang, Tongxuan
Lin, Hongfei
Ren, Yuqi
Yang, Zhihao
Wang, Jian
Duan, Xiaodong
Xu, Bo
[J]. NEUROCOMPUTING, 2021, 453 : 254 - 262
[4] Identifying adverse drug reaction entities from social media with adversarial transfer learning model
Zhang, Tongxuan
Lin, Hongfei
Ren, Yuqi
Yang, Zhihao
Wang, Jian
Duan, Xiaodong
Xu, Bo
[J]. Neurocomputing, 2021, 453 : 254 - 262
[5] Quantum Adversarial Transfer Learning
Wang, Longhan
Sun, Yifan
Zhang, Xiangdong
[J]. ENTROPY, 2023, 25 (07)
[6] Adversarial training for fast arbitrary style transfer
Xu, Zheng
Wilber, Michael
Fang, Chen
Hertzmann, Aaron
Jin, Hailin
[J]. COMPUTERS & GRAPHICS-UK, 2020, 87 : 1 - 11
[7] Adversarial Vulnerability of Active Transfer Learning
Mueller, Nicolas M.
Boettinger, Konstantin
[J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 116 - 127
[8] Fast and Adversarial Robust Kernelized SDU Learning
Fan, Yajing
Shi, Wanli
Chang, Yi
Gu, Bin
[J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
[9] Transfer of Robot Perception Module With Adversarial Learning
Sui, Hongjian
Shang, Weiwei
Li, Xiang
[J]. IEEE ACCESS, 2019, 7 : 79726 - 79736
[10] Spectrum sensing based on adversarial transfer learning
Miao, Jiawu
Li, Yuebo
Jing, Xiaojun
Zhang, Fangpei
Mu, Junsheng
[J]. IET COMMUNICATIONS, 2022, 16 (17) : 2059 - 2069

← 1 2 3 4 5 →