Generative Adversarial Imitation Learning

被引：0

作者：

Ho, Jonathan ^{[1
]}

Ermon, Stefano ^{[2
]}

机构：

[1] OpenAI, San Francisco, CA 94110 USA

[2] Stanford Univ, Stanford, CA 94305 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016) | 2016年 / 29卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Consider learning a policy from example expert behavior, without interaction with the expert or access to a reinforcement signal. One approach is to recover the expert's cost function with inverse reinforcement learning, then extract a policy from that cost function with reinforcement learning. This approach is indirect and can be slow. We propose a new general framework for directly extracting a policy from data as if it were obtained by reinforcement learning following inverse reinforcement learning. We show that a certain instantiation of our framework draws an analogy between imitation learning and generative adversarial networks, from which we derive a model-free imitation learning algorithm that obtains significant performance gains over existing model-free methods in imitating complex behaviors in large, high-dimensional environments.

引用

页数：9

共 50 条

[41] Collaborative Robot-Assisted Endovascular Catheterization with Generative Adversarial Imitation Learning
Chi, Wenqiang
Dagnino, Giulio
Kwok, Trevor M. Y.
Anh Nguyen
Kundrat, Dennis
Abdelaziz, Mohamed E. M. K.
Riga, Celia
Bicknell, Colin
Yang, Guang-Zhong
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2414 - 2420
[42] Generative Adversarial Imitation Learning Based Bicycle Behaviors Simulation on Road Segments
Wei, Shuqiao
Ni, Ying
Sun, Jian
Qiu, Hongtong
Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2024, 24 (04): : 105 - 115
[43] Restored Action Generative Adversarial Imitation Learning from observation for robot manipulator
Park, Jongcheon
Han, Seungyong
Lee, S. M.
ISA TRANSACTIONS, 2022, 129 : 684 - 690
[44] Goal Conditioned Generative Adversarial Imitation Learning Based on Dueling-DQN
Xu, Ziqi
Wang, Shaofan
Li, Ke
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2365 - 2378
[45] A Mixed Generative Adversarial Imitation Learning Based Vehicle Path Planning Algorithm
Yang, Zan
Nai, Wei
Li, Dan
Liu, Lu
Chen, Ziyu
IEEE ACCESS, 2024, 12 : 85859 - 85879
[46] Generative adversarial interactive imitation learning for path following of autonomous underwater vehicle
Jiang, Dong
Huang, Jie
Fang, Zheng
Cheng, Chunxi
Sha, Qixin
He, Bo
Li, Guangliang
OCEAN ENGINEERING, 2022, 260
[47] TrajGAIL: Generating urban vehicle trajectories using generative adversarial imitation learning
Choi, Seongjin
Kim, Jiwon
Yeo, Hwasoo
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 128
[48] An Enhanced Driving Trajectory Prediction Method Based on Generative Adversarial Imitation Learning
Liu, Ming
Lin, Fanrong
Zhang, Zhen
Jia, Yungang
Cui, Jianming
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 179 - 190
[49] Modelling flight trajectories with multi-modal generative adversarial imitation learning
Spatharis, Christos
Blekas, Konstantinos
Vouros, George A.
APPLIED INTELLIGENCE, 2024, : 7118 - 7134
[50] Generative Adversarial Neuroevolution for Control Behaviour Imitation
Le Clei, Maximilien
Bellec, Pierre
PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 663 - 666

← 1 2 3 4 5 →