Curiosity-Driven Variational Autoencoder for Deep Q Network

被引：1

作者：

Han, Gao-Jie ^{[1
]}

Zhang, Xiao-Fang ^{[1
]}

Wang, Hao ^{[1
]}

Mao, Chen-Guang ^{[1
]}

机构：

[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I | 2020年 / 12084卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Deep Q learning; Exploration; Variational autoencoder;

D O I：

10.1007/978-3-030-47426-3_59

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, deep reinforcement learning (DRL) has achieved tremendous success in high-dimensional and large-scale space control and sequential decision-making tasks. However, the current model-free DRL methods suffer from low sample efficiency, which is a bottleneck that limits their performance. To alleviate this problem, some researchers used the generative model for modeling the environment. But the generative model may become inaccurate or even collapse if the state has not been sufficiently explored. In this paper, we introduce a model called Curiosity-driven Variational Autoencoder (CVAE), which combines variational autoencoder and curiosity-driven exploration. During the training process, the CVAE model can improve sample efficiency while curiosity-driven exploration can make sufficient exploration in a complex environment. Then, a CVAE-based algorithm is proposed, namely DQN-CVAE, that scales CVAE to higher dimensional environments. Finally, the performance of our algorithm is evaluated through several Atari 2600 games, and the experimental results show that the DQN-CVAE achieves better performance in terms of average reward per episode on these games.

引用

页码：764 / 775

页数：12

共 50 条

[41] Curiosity-driven exploration: foundations in neuroscience and computational modeling
Modirshanechi, Alireza
Kondrakiewicz, Kacper
Gerstner, Wulfram
Haesler, Sebastian
TRENDS IN NEUROSCIENCES, 2023, 46 (12) : 1054 - 1066
[42] Curiosity-Driven Exploration via Latent Bayesian Surprise
Mazzaglia, Pietro
Catal, Ozan
Verbelen, Tim
Dhoedt, Bart
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7752 - 7760
[43] Humans monitor learning progress in curiosity-driven exploration
Alexandr Ten
Pramod Kaushik
Pierre-Yves Oudeyer
Jacqueline Gottlieb
Nature Communications, 12
[44] Curiosity-Driven Salient Object Detection With Fragment Attention
Wang, Zheng
Wang, Pengzhi
Han, Yahong
Zhang, Xue
Sun, Meijun
Tian, Qi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5989 - 6001
[45] Conversational agents for fostering curiosity-driven learning in children
Abdelghani, Rania
Oudeyer, Pierre-Yves
Law, Edith
de Vulpillieres, Catherine
Sauzeon, Helene
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2022, 167
[46] Towards Hierarchical Curiosity-Driven Exploration of Sensorimotor Models
Forestier, Sebastien
Oudeyer, Pierre-Yves
5TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND ON EPIGENETIC ROBOTICS (ICDL-EPIROB), 2015, : 234 - 235
[47] Leveraging ambient sensing for the estimation of curiosity-driven human crowd
Das, Anirban
Narayan, Kartik
Chakraborty, Suchetana
SYSCON 2022: THE 16TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON), 2022,
[48] Seeking Visual Discomfort: Curiosity-driven Representations for Reinforcement Learning
Aljalbout, Elie
Ulmer, Maximilian
Triebel, Rudolph
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 3591 - 3597
[49] The consequences of the extended gap between curiosity-driven and impact-driven research
Woxenius, Johan
TRANSPORT REVIEWS, 2015, 35 (04) : 401 - 403
[50] Optimal Curiosity-Driven Modular Incremental Slow Feature Analysis
Kompella, Varun Raj
Luciw, Matthew
Stollenga, Marijn Frederik
Schmidhuber, Juergen
NEURAL COMPUTATION, 2016, 28 (08) : 1599 - 1662

← 1 2 3 4 5 →