Addressing Delays in Reinforcement Learning via Delayed Adversarial Imitation Learning

被引：2

作者：

Xie, Minzhi ^{[1
]}

Xia, Bo ^{[1
]}

Yu, Yalou ^{[1
]}

Wang, Xueqian ^{[1
]}

Chang, Yongzhe ^{[1
]}

机构：

[1] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen 518000, Peoples R China

来源：

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III | 2023年 / 14256卷

关键词：

Reinforcement Learning; Delays; Adversarial Imitation Learning;

D O I：

10.1007/978-3-031-44213-1_23

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Observation and action delays occur commonly in many real-world tasks which violate Markov property and consequently degrade the performance of Reinforcement Learning methods. So far, there have been several efforts on delays in RL. Model-based methods train forward models to predict unknown current information while model-free approaches focus on state-augmentation to define new Markov Decision Processes. However, previous works suffer from difficult model fine-tuning and the curse of dimensionality that prevent them from solving delays. Motivated by the advantage of imitation learning, a novel idea is introduced that a delayed policy can be trained by imitating undelayed expert demonstrations. Based on the idea, we propose an algorithm named Delayed Adversarial Imitation Learning (DAIL). In DAIL, a few undelayed expert demonstrations are utilized to generate a surrogate delayed expert and a delayed policy is trained by imitating the surrogate expert using adversarial imitation learning. Moreover, a theoretical analysis of DAIL is presented to validate the rationality of DAIL and guide the practical design of the approach. Finally, experiments on continuous control tasks demonstrate that DAIL achieves much higher performance than previous approaches in solving delays in RL, where DAIL can converge to high performance with an excellent sample efficiency, even for substantial delays, while previous works cannot due to the divergence problems.

引用

页码：271 / 282

页数：12

共 50 条

[41] Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning
Chen, Hanxiao
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15769 - 15770
[42] Learning from Suboptimal Demonstration via Trajectory-Ranked Adversarial Imitation
Chen, Luyao
Xie, Shaorong
Pang, Tao
Yu, Hang
Luo, Xiangfeng
Zhang, Zhenyu
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 486 - 493
[43] A Bayesian Approach to Generative Adversarial Imitation Learning
Jeon, Wonseok
Seo, Seokin
Kim, Kee-Eung
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[44] Sample-efficient Adversarial Imitation Learning
Jung, Dahuin
Lee, Hyungyu
Yoon, Sungroh
JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
[45] Combating False Negatives in Adversarial Imitation Learning
Zolna, Konrad
Saharia, Chitwan
Boussioux, Leonard
Hui, David Yu-Tung
Chevalier-Boisvert, Maxime
Bahdanau, Dzmitry
Bengio, Yoshua
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[46] Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning
Wang, Yunke
Du, Bo
Xu, Chang
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10262 - 10270
[47] Variational Adversarial Kernel Learned Imitation Learning
Yang, Fan
Vereshchaka, Alma
Zhou, Yufan
Chen, Changyou
Dong, Wen
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6599 - 6606
[48] Adversarial Imitation Learning with Trajectorial Augmentation and Correction
Antotsiou, Dafni
Ciliberto, Carlo
Kim, Tae-Kyun
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4724 - 4730
[49] Sample-efficient Adversarial Imitation Learning
Jung, Dahuin
Lee, Hyungyu
Yoon, Sungroh
Journal of Machine Learning Research, 2024, 25 : 1 - 32
[50] Self-Supervised Adversarial Imitation Learning
Monteiro, Juarez
Gavenski, Nathan
Meneguzzi, Felipe
Barros, Rodrigo C.
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,

← 1 2 3 4 5 →