Generalization and Computation for Policy Classes of Generative Adversarial Imitation Learning

被引:3
|
作者
Zhou, Yirui [1 ]
Zhang, Yangchun [1 ]
Liu, Xiaowei [1 ]
Wang, Wanying [1 ]
Che, Zhengping [2 ]
Xu, Zhiyuan [2 ]
Tang, Jian [2 ]
Peng, Yaxin [1 ]
机构
[1] Shanghai Univ, Sch Sci, Dept Math, Shanghai 200444, Peoples R China
[2] Midea Grp, AI Innovat Ctr, Shanghai 201702, Peoples R China
关键词
Generative adversarial imitation learning; Generalization; Computation; Policy classes;
D O I
10.1007/978-3-031-14714-2_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative adversarial imitation learning (GAIL) learns an optimal policy by expert demonstrations from the environment with unknown reward functions. Different from existing works that studied the generalization of reward function classes or discriminator classes, we focus on policy classes. This paper investigates the generalization and computation for policy classes of GAIL. Specifically, our contributions lie in: 1) We prove that the generalization is guaranteed in GAIL when the complexity of policy classes is properly controlled. 2) We provide an off-policy framework called the two-stage stochastic gradient (TSSG), which can efficiently solve GAIL based on the soft policy iteration and attain the sublinear convergence rate to a stationary solution. The comprehensive numerical simulations are illustrated in MuJoCo environments.
引用
收藏
页码:385 / 399
页数:15
相关论文
共 50 条
  • [21] Saliency Prediction on Omnidirectional Image With Generative Adversarial Imitation Learning
    Xu, Mai
    Yang, Li
    Tao, Xiaoming
    Duan, Yiping
    Wang, Zulin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2087 - 2102
  • [22] Imitation Learning for Playing Shogi Based on Generative Adversarial Networks
    Wan, Shanchuan
    Kaneko, Tomoyuki
    2017 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2017, : 92 - 95
  • [23] Imitating Agents in A Complex Environment by Generative Adversarial Imitation Learning
    Li, Wanxiang
    Hsueh, Chu-Hsuan
    Ikeda, Kokolo
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 702 - 705
  • [24] Domain Adaptation for Imitation Learning Using Generative Adversarial Network
    Duc, Tho Nguyen
    Tran, Chanh Minh
    Tan, Phan Xuan
    Kamioka, Eiji
    SENSORS, 2021, 21 (14)
  • [25] Joint Entity and Event Extraction with Generative Adversarial Imitation Learning
    Zhang, Tongtao
    Ji, Heng
    Sil, Avirup
    DATA INTELLIGENCE, 2019, 1 (02) : 99 - 120
  • [26] MusicGAIL: A Generative Adversarial Imitation Learning Approach for Music Generation
    Liao, Yusong
    Xu, Hongguang
    Xu, Ke
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 505 - 516
  • [27] Urban Vehicle Trajectory Generation Based on Generative Adversarial Imitation Learning
    Wang, Min
    Cui, Jianqun
    Wong, Yew Wee
    Chang, Yanan
    Wu, Libing
    Jin, Jiong
    IEEE Transactions on Vehicular Technology, 2024, 73 (12) : 18237 - 18249
  • [28] Sample-Efficient Imitation Learning via Generative Adversarial Nets
    Blonde, Lionel
    Kalousis, Alexandros
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [29] Unmanned surface vehicle navigation through generative adversarial imitation learning
    Chaysri, Piyabhum
    Spatharis, Christos
    Blekas, Konstantinos
    Vlachos, Kostas
    OCEAN ENGINEERING, 2023, 282
  • [30] Modeling Human Driving Behavior Through Generative Adversarial Imitation Learning
    Bhattacharyya, Raunak
    Wulfe, Blake
    Phillips, Derek J.
    Kuefler, Alex
    Morton, Jeremy
    Senanayake, Ransalu
    Kochenderfer, Mykel J.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (03) : 2874 - 2887