Addressing Delays in Reinforcement Learning via Delayed Adversarial Imitation Learning

被引:2
|
作者
Xie, Minzhi [1 ]
Xia, Bo [1 ]
Yu, Yalou [1 ]
Wang, Xueqian [1 ]
Chang, Yongzhe [1 ]
机构
[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen 518000, Peoples R China
关键词
Reinforcement Learning; Delays; Adversarial Imitation Learning;
D O I
10.1007/978-3-031-44213-1_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Observation and action delays occur commonly in many real-world tasks which violate Markov property and consequently degrade the performance of Reinforcement Learning methods. So far, there have been several efforts on delays in RL. Model-based methods train forward models to predict unknown current information while model-free approaches focus on state-augmentation to define new Markov Decision Processes. However, previous works suffer from difficult model fine-tuning and the curse of dimensionality that prevent them from solving delays. Motivated by the advantage of imitation learning, a novel idea is introduced that a delayed policy can be trained by imitating undelayed expert demonstrations. Based on the idea, we propose an algorithm named Delayed Adversarial Imitation Learning (DAIL). In DAIL, a few undelayed expert demonstrations are utilized to generate a surrogate delayed expert and a delayed policy is trained by imitating the surrogate expert using adversarial imitation learning. Moreover, a theoretical analysis of DAIL is presented to validate the rationality of DAIL and guide the practical design of the approach. Finally, experiments on continuous control tasks demonstrate that DAIL achieves much higher performance than previous approaches in solving delays in RL, where DAIL can converge to high performance with an excellent sample efficiency, even for substantial delays, while previous works cannot due to the divergence problems.
引用
收藏
页码:271 / 282
页数:12
相关论文
共 50 条
  • [41] Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning
    Chen, Hanxiao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15769 - 15770
  • [42] Learning from Suboptimal Demonstration via Trajectory-Ranked Adversarial Imitation
    Chen, Luyao
    Xie, Shaorong
    Pang, Tao
    Yu, Hang
    Luo, Xiangfeng
    Zhang, Zhenyu
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 486 - 493
  • [43] A Bayesian Approach to Generative Adversarial Imitation Learning
    Jeon, Wonseok
    Seo, Seokin
    Kim, Kee-Eung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [44] Sample-efficient Adversarial Imitation Learning
    Jung, Dahuin
    Lee, Hyungyu
    Yoon, Sungroh
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [45] Combating False Negatives in Adversarial Imitation Learning
    Zolna, Konrad
    Saharia, Chitwan
    Boussioux, Leonard
    Hui, David Yu-Tung
    Chevalier-Boisvert, Maxime
    Bahdanau, Dzmitry
    Bengio, Yoshua
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [46] Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
    Wang, Yunke
    Du, Bo
    Xu, Chang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10262 - 10270
  • [47] Variational Adversarial Kernel Learned Imitation Learning
    Yang, Fan
    Vereshchaka, Alma
    Zhou, Yufan
    Chen, Changyou
    Dong, Wen
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6599 - 6606
  • [48] Adversarial Imitation Learning with Trajectorial Augmentation and Correction
    Antotsiou, Dafni
    Ciliberto, Carlo
    Kim, Tae-Kyun
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4724 - 4730
  • [49] Sample-efficient Adversarial Imitation Learning
    Jung, Dahuin
    Lee, Hyungyu
    Yoon, Sungroh
    Journal of Machine Learning Research, 2024, 25 : 1 - 32
  • [50] Self-Supervised Adversarial Imitation Learning
    Monteiro, Juarez
    Gavenski, Nathan
    Meneguzzi, Felipe
    Barros, Rodrigo C.
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,