Generative Adversarial Imitation Learning

被引：0

作者：

Ho, Jonathan ^{[1
]}

Ermon, Stefano ^{[2
]}

机构：

[1] OpenAI, San Francisco, CA 94110 USA

[2] Stanford Univ, Stanford, CA 94305 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016) | 2016年 / 29卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Consider learning a policy from example expert behavior, without interaction with the expert or access to a reinforcement signal. One approach is to recover the expert's cost function with inverse reinforcement learning, then extract a policy from that cost function with reinforcement learning. This approach is indirect and can be slow. We propose a new general framework for directly extracting a policy from data as if it were obtained by reinforcement learning following inverse reinforcement learning. We show that a certain instantiation of our framework draws an analogy between imitation learning and generative adversarial networks, from which we derive a model-free imitation learning algorithm that obtains significant performance gains over existing model-free methods in imitating complex behaviors in large, high-dimensional environments.

引用

页数：9

共 50 条

[1] Quantum generative adversarial imitation learning
Xiao, Tailong
Huang, Jingzheng
Li, Hongjing
Fan, Jianping
Zeng, Guihua
NEW JOURNAL OF PHYSICS, 2023, 25 (03):
[2] Deterministic generative adversarial imitation learning
Zuo, Guoyu
Chen, Kexin
Lu, Jiahao
Huang, Xiangsheng
NEUROCOMPUTING, 2020, 388 : 60 - 69
[3] A Bayesian Approach to Generative Adversarial Imitation Learning
Jeon, Wonseok
Seo, Seokin
Kim, Kee-Eung
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[4] Robot Manipulation Learning Using Generative Adversarial Imitation Learning
Jabri, Mohamed Khalil
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4893 - 4894
[5] A Survey of Imitation Learning Based on Generative Adversarial Nets
Lin J.-H.
Zhang Z.-Z.
Jiang C.
Hao J.-Y.
Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (02): : 326 - 351
[6] Ranking-Based Generative Adversarial Imitation Learning
Shi, Zhipeng
Zhang, Xuehe
Fang, Yu
Li, Changle
Liu, Gangfeng
Zhao, Jie
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (10): : 8967 - 8974
[7] Generative Adversarial Imitation Learning from Failed Experiences
Zhu, Jiacheng
Lin, Jiahao
Wang, Meng
Chen, Yingfeng
Fan, Changjie
Jiang, Chong
Zhang, Zongzhang
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13997 - 13998
[8] TextGAIL: Generative Adversarial Imitation Learning for Text Generation
Wu, Qingyang
Li, Lei
Yu, Zhou
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14067 - 14075
[9] Multimodal Storytelling via Generative Adversarial Imitation Learning
Chen, Zhiqian
Zhang, Xuchao
Boedihardjo, Arnold P.
Dai, Jing
Lu, Chang-Tien
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3967 - 3973
[10] Multi-Agent Generative Adversarial Imitation Learning
Song, Jiaming
Ren, Hongyu
Sadigh, Dorsa
Ermon, Stefano
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31

← 1 2 3 4 5 →