Generative Adversarial Imitation Learning

被引:0
|
作者
Ho, Jonathan [1 ]
Ermon, Stefano [2 ]
机构
[1] OpenAI, San Francisco, CA 94110 USA
[2] Stanford Univ, Stanford, CA 94305 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016) | 2016年 / 29卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consider learning a policy from example expert behavior, without interaction with the expert or access to a reinforcement signal. One approach is to recover the expert's cost function with inverse reinforcement learning, then extract a policy from that cost function with reinforcement learning. This approach is indirect and can be slow. We propose a new general framework for directly extracting a policy from data as if it were obtained by reinforcement learning following inverse reinforcement learning. We show that a certain instantiation of our framework draws an analogy between imitation learning and generative adversarial networks, from which we derive a model-free imitation learning algorithm that obtains significant performance gains over existing model-free methods in imitating complex behaviors in large, high-dimensional environments.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Collaborative Robot-Assisted Endovascular Catheterization with Generative Adversarial Imitation Learning
    Chi, Wenqiang
    Dagnino, Giulio
    Kwok, Trevor M. Y.
    Anh Nguyen
    Kundrat, Dennis
    Abdelaziz, Mohamed E. M. K.
    Riga, Celia
    Bicknell, Colin
    Yang, Guang-Zhong
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2414 - 2420
  • [42] Generative Adversarial Imitation Learning Based Bicycle Behaviors Simulation on Road Segments
    Wei, Shuqiao
    Ni, Ying
    Sun, Jian
    Qiu, Hongtong
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2024, 24 (04): : 105 - 115
  • [43] Restored Action Generative Adversarial Imitation Learning from observation for robot manipulator
    Park, Jongcheon
    Han, Seungyong
    Lee, S. M.
    ISA TRANSACTIONS, 2022, 129 : 684 - 690
  • [44] Goal Conditioned Generative Adversarial Imitation Learning Based on Dueling-DQN
    Xu, Ziqi
    Wang, Shaofan
    Li, Ke
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2365 - 2378
  • [45] A Mixed Generative Adversarial Imitation Learning Based Vehicle Path Planning Algorithm
    Yang, Zan
    Nai, Wei
    Li, Dan
    Liu, Lu
    Chen, Ziyu
    IEEE ACCESS, 2024, 12 : 85859 - 85879
  • [46] Generative adversarial interactive imitation learning for path following of autonomous underwater vehicle
    Jiang, Dong
    Huang, Jie
    Fang, Zheng
    Cheng, Chunxi
    Sha, Qixin
    He, Bo
    Li, Guangliang
    OCEAN ENGINEERING, 2022, 260
  • [47] TrajGAIL: Generating urban vehicle trajectories using generative adversarial imitation learning
    Choi, Seongjin
    Kim, Jiwon
    Yeo, Hwasoo
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 128
  • [48] An Enhanced Driving Trajectory Prediction Method Based on Generative Adversarial Imitation Learning
    Liu, Ming
    Lin, Fanrong
    Zhang, Zhen
    Jia, Yungang
    Cui, Jianming
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 179 - 190
  • [49] Modelling flight trajectories with multi-modal generative adversarial imitation learning
    Spatharis, Christos
    Blekas, Konstantinos
    Vouros, George A.
    APPLIED INTELLIGENCE, 2024, : 7118 - 7134
  • [50] Generative Adversarial Neuroevolution for Control Behaviour Imitation
    Le Clei, Maximilien
    Bellec, Pierre
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 663 - 666