Curiosity-Driven Variational Autoencoder for Deep Q Network

被引:1
|
作者
Han, Gao-Jie [1 ]
Zhang, Xiao-Fang [1 ]
Wang, Hao [1 ]
Mao, Chen-Guang [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Deep Q learning; Exploration; Variational autoencoder;
D O I
10.1007/978-3-030-47426-3_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep reinforcement learning (DRL) has achieved tremendous success in high-dimensional and large-scale space control and sequential decision-making tasks. However, the current model-free DRL methods suffer from low sample efficiency, which is a bottleneck that limits their performance. To alleviate this problem, some researchers used the generative model for modeling the environment. But the generative model may become inaccurate or even collapse if the state has not been sufficiently explored. In this paper, we introduce a model called Curiosity-driven Variational Autoencoder (CVAE), which combines variational autoencoder and curiosity-driven exploration. During the training process, the CVAE model can improve sample efficiency while curiosity-driven exploration can make sufficient exploration in a complex environment. Then, a CVAE-based algorithm is proposed, namely DQN-CVAE, that scales CVAE to higher dimensional environments. Finally, the performance of our algorithm is evaluated through several Atari 2600 games, and the experimental results show that the DQN-CVAE achieves better performance in terms of average reward per episode on these games.
引用
收藏
页码:764 / 775
页数:12
相关论文
共 50 条
  • [1] CURIOSITY-DRIVEN RESEARCH
    HUGHBANKS, T
    CHEMICAL & ENGINEERING NEWS, 1995, 73 (49) : 5 - 5
  • [2] Curiosity-Driven Optimization
    Schaul, Tom
    Sun, Yi
    Wierstra, Daan
    Gomez, Fausino
    Schmidhuber, Juergen
    2011 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2011, : 1343 - 1349
  • [3] Random curiosity-driven exploration in deep reinforcement learning
    Li, Jing
    Shi, Xinxin
    Li, Jiehao
    Zhang, Xin
    Wang, Junzheng
    NEUROCOMPUTING, 2020, 418 : 139 - 147
  • [4] A neural network model of curiosity-driven infant categorization
    Twomey, Katherine E.
    Westermann, Gert
    5TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND ON EPIGENETIC ROBOTICS (ICDL-EPIROB), 2015, : 1 - 6
  • [5] Curiosity-driven method development
    Kaitlin McCardle
    Nature Computational Science, 2022, 2 : 542 - 544
  • [6] Curiosity-driven method development
    McCardle, Kaitlin
    Head-Gordon, Martin
    NATURE COMPUTATIONAL SCIENCE, 2022, 2 (09): : 542 - 544
  • [7] Securing UAV-to-Vehicle Communications: A Curiosity-Driven Deep Q-learning Network (C-DQN) Approach
    Fu, Fang
    Jiao, Qi
    Yu, F. Richard
    Zhang, Zhicai
    Du, Jianbo
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [8] Encouraging Curiosity-Driven Science
    不详
    CHEMICAL ENGINEERING PROGRESS, 2014, 110 (06) : 20 - 21
  • [9] Curiosity-driven phonetic learning
    Moulin-Frier, Clement
    Oudeyer, Pierre-Yves
    2012 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL), 2012,
  • [10] Multi-objective virtual network embedding algorithm based on Q-learning and curiosity-driven
    Mengyang He
    Lei Zhuang
    Shuaikui Tian
    Guoqing Wang
    Kunli Zhang
    EURASIP Journal on Wireless Communications and Networking, 2018