Curiosity-Driven Variational Autoencoder for Deep Q Network

被引:1
|
作者
Han, Gao-Jie [1 ]
Zhang, Xiao-Fang [1 ]
Wang, Hao [1 ]
Mao, Chen-Guang [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Deep Q learning; Exploration; Variational autoencoder;
D O I
10.1007/978-3-030-47426-3_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep reinforcement learning (DRL) has achieved tremendous success in high-dimensional and large-scale space control and sequential decision-making tasks. However, the current model-free DRL methods suffer from low sample efficiency, which is a bottleneck that limits their performance. To alleviate this problem, some researchers used the generative model for modeling the environment. But the generative model may become inaccurate or even collapse if the state has not been sufficiently explored. In this paper, we introduce a model called Curiosity-driven Variational Autoencoder (CVAE), which combines variational autoencoder and curiosity-driven exploration. During the training process, the CVAE model can improve sample efficiency while curiosity-driven exploration can make sufficient exploration in a complex environment. Then, a CVAE-based algorithm is proposed, namely DQN-CVAE, that scales CVAE to higher dimensional environments. Finally, the performance of our algorithm is evaluated through several Atari 2600 games, and the experimental results show that the DQN-CVAE achieves better performance in terms of average reward per episode on these games.
引用
收藏
页码:764 / 775
页数:12
相关论文
共 50 条
  • [41] Curiosity-driven exploration: foundations in neuroscience and computational modeling
    Modirshanechi, Alireza
    Kondrakiewicz, Kacper
    Gerstner, Wulfram
    Haesler, Sebastian
    TRENDS IN NEUROSCIENCES, 2023, 46 (12) : 1054 - 1066
  • [42] Curiosity-Driven Exploration via Latent Bayesian Surprise
    Mazzaglia, Pietro
    Catal, Ozan
    Verbelen, Tim
    Dhoedt, Bart
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7752 - 7760
  • [43] Humans monitor learning progress in curiosity-driven exploration
    Alexandr Ten
    Pramod Kaushik
    Pierre-Yves Oudeyer
    Jacqueline Gottlieb
    Nature Communications, 12
  • [44] Curiosity-Driven Salient Object Detection With Fragment Attention
    Wang, Zheng
    Wang, Pengzhi
    Han, Yahong
    Zhang, Xue
    Sun, Meijun
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 5989 - 6001
  • [45] Conversational agents for fostering curiosity-driven learning in children
    Abdelghani, Rania
    Oudeyer, Pierre-Yves
    Law, Edith
    de Vulpillieres, Catherine
    Sauzeon, Helene
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2022, 167
  • [46] Towards Hierarchical Curiosity-Driven Exploration of Sensorimotor Models
    Forestier, Sebastien
    Oudeyer, Pierre-Yves
    5TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND ON EPIGENETIC ROBOTICS (ICDL-EPIROB), 2015, : 234 - 235
  • [47] Leveraging ambient sensing for the estimation of curiosity-driven human crowd
    Das, Anirban
    Narayan, Kartik
    Chakraborty, Suchetana
    SYSCON 2022: THE 16TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON), 2022,
  • [48] Seeking Visual Discomfort: Curiosity-driven Representations for Reinforcement Learning
    Aljalbout, Elie
    Ulmer, Maximilian
    Triebel, Rudolph
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 3591 - 3597
  • [50] Optimal Curiosity-Driven Modular Incremental Slow Feature Analysis
    Kompella, Varun Raj
    Luciw, Matthew
    Stollenga, Marijn Frederik
    Schmidhuber, Juergen
    NEURAL COMPUTATION, 2016, 28 (08) : 1599 - 1662