Curiosity-Driven Variational Autoencoder for Deep Q Network

被引：1

作者：

Han, Gao-Jie ^{[1
]}

Zhang, Xiao-Fang ^{[1
]}

Wang, Hao ^{[1
]}

Mao, Chen-Guang ^{[1
]}

机构：

[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I | 2020年 / 12084卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Deep Q learning; Exploration; Variational autoencoder;

D O I：

10.1007/978-3-030-47426-3_59

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, deep reinforcement learning (DRL) has achieved tremendous success in high-dimensional and large-scale space control and sequential decision-making tasks. However, the current model-free DRL methods suffer from low sample efficiency, which is a bottleneck that limits their performance. To alleviate this problem, some researchers used the generative model for modeling the environment. But the generative model may become inaccurate or even collapse if the state has not been sufficiently explored. In this paper, we introduce a model called Curiosity-driven Variational Autoencoder (CVAE), which combines variational autoencoder and curiosity-driven exploration. During the training process, the CVAE model can improve sample efficiency while curiosity-driven exploration can make sufficient exploration in a complex environment. Then, a CVAE-based algorithm is proposed, namely DQN-CVAE, that scales CVAE to higher dimensional environments. Finally, the performance of our algorithm is evaluated through several Atari 2600 games, and the experimental results show that the DQN-CVAE achieves better performance in terms of average reward per episode on these games.

引用

页码：764 / 775

页数：12

共 50 条

[31] Autistic traits foster effective curiosity-driven exploration
Poli, Francesco
Koolen, Maran
Velazquez-Vargas, Carlos A.
Ramos-Sanchez, Jessica
Meyer, Marlene
Mars, Rogier B.
Rommelse, Nanda
Hunnius, Sabine
PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (10)
[32] Humans monitor learning progress in curiosity-driven exploration
Ten, Alexandr
Kaushik, Pramod
Oudeyer, Pierre-Yves
Gottlieb, Jacqueline
NATURE COMMUNICATIONS, 2021, 12 (01)
[33] Curiosity-Driven Learning of Joint Locomotion and Manipulation Tasks
Schwarke, Clemens
Klemm, Victor
van der Boon, Matthijs
Bjelonic, Marko
Hutter, Marco
CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[34] Curiosity-driven Exploration by Self-supervised Prediction
Pathak, Deepak
Agrawal, Pulkit
Efros, Alexei A.
Darrell, Trevor
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[35] Curiosity-Driven and Victim-Aware Adversarial Policies
Gong, Chen
Yang, Zhou
Bai, Yunpeng
Shi, Jieke
Sinha, Arunesh
Xu, Bowen
Lo, David
Hou, Xinwen
Fan, Guoliang
PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 186 - 200
[36] Curiosity-driven learning of traversability affordance on a mobile robot
Ugur, Emre
Dogar, Mehmet R.
Cakmak, Maya
Sahin, Erol
2007 IEEE 6TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2007, : 128 - 133
[37] Verification of Applying Curiosity-Driven to Fighting Game AI
Inoue, Hideyasu
Takano, Yoshina
Thawonmas, Ruck
Harada, Tomohiro
2019 NICOGRAPH INTERNATIONAL (NICOINT), 2019, : 119 - 119
[38] Modular Active Curiosity-Driven Discovery of Tool Use
Forestier, Sebastien
Oudeyer, Pierre-Yves
2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 3965 - 3972
[39] Ontology enhancing process for a situated and curiosity-driven robot
Rea, Francesco
Nefti-Meziani, Samia
Manzoor, Umar
Davis, Steve
ROBOTICS AND AUTONOMOUS SYSTEMS, 2014, 62 (12) : 1837 - 1847
[40] Curiosity-driven Exploration by Self-supervised Prediction
Pathak, Deepak
Agrawal, Pulkit
Efros, Alexei A.
Darrell, Trevor
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 488 - 489

← 1 2 3 4 5 →