Curiosity-Driven Variational Autoencoder for Deep Q Network

被引:1
|
作者
Han, Gao-Jie [1 ]
Zhang, Xiao-Fang [1 ]
Wang, Hao [1 ]
Mao, Chen-Guang [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Deep Q learning; Exploration; Variational autoencoder;
D O I
10.1007/978-3-030-47426-3_59
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep reinforcement learning (DRL) has achieved tremendous success in high-dimensional and large-scale space control and sequential decision-making tasks. However, the current model-free DRL methods suffer from low sample efficiency, which is a bottleneck that limits their performance. To alleviate this problem, some researchers used the generative model for modeling the environment. But the generative model may become inaccurate or even collapse if the state has not been sufficiently explored. In this paper, we introduce a model called Curiosity-driven Variational Autoencoder (CVAE), which combines variational autoencoder and curiosity-driven exploration. During the training process, the CVAE model can improve sample efficiency while curiosity-driven exploration can make sufficient exploration in a complex environment. Then, a CVAE-based algorithm is proposed, namely DQN-CVAE, that scales CVAE to higher dimensional environments. Finally, the performance of our algorithm is evaluated through several Atari 2600 games, and the experimental results show that the DQN-CVAE achieves better performance in terms of average reward per episode on these games.
引用
收藏
页码:764 / 775
页数:12
相关论文
共 50 条
  • [31] Autistic traits foster effective curiosity-driven exploration
    Poli, Francesco
    Koolen, Maran
    Velazquez-Vargas, Carlos A.
    Ramos-Sanchez, Jessica
    Meyer, Marlene
    Mars, Rogier B.
    Rommelse, Nanda
    Hunnius, Sabine
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (10)
  • [32] Humans monitor learning progress in curiosity-driven exploration
    Ten, Alexandr
    Kaushik, Pramod
    Oudeyer, Pierre-Yves
    Gottlieb, Jacqueline
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [33] Curiosity-Driven Learning of Joint Locomotion and Manipulation Tasks
    Schwarke, Clemens
    Klemm, Victor
    van der Boon, Matthijs
    Bjelonic, Marko
    Hutter, Marco
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [34] Curiosity-driven Exploration by Self-supervised Prediction
    Pathak, Deepak
    Agrawal, Pulkit
    Efros, Alexei A.
    Darrell, Trevor
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [35] Curiosity-Driven and Victim-Aware Adversarial Policies
    Gong, Chen
    Yang, Zhou
    Bai, Yunpeng
    Shi, Jieke
    Sinha, Arunesh
    Xu, Bowen
    Lo, David
    Hou, Xinwen
    Fan, Guoliang
    PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 186 - 200
  • [36] Curiosity-driven learning of traversability affordance on a mobile robot
    Ugur, Emre
    Dogar, Mehmet R.
    Cakmak, Maya
    Sahin, Erol
    2007 IEEE 6TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2007, : 128 - 133
  • [37] Verification of Applying Curiosity-Driven to Fighting Game AI
    Inoue, Hideyasu
    Takano, Yoshina
    Thawonmas, Ruck
    Harada, Tomohiro
    2019 NICOGRAPH INTERNATIONAL (NICOINT), 2019, : 119 - 119
  • [38] Modular Active Curiosity-Driven Discovery of Tool Use
    Forestier, Sebastien
    Oudeyer, Pierre-Yves
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 3965 - 3972
  • [39] Ontology enhancing process for a situated and curiosity-driven robot
    Rea, Francesco
    Nefti-Meziani, Samia
    Manzoor, Umar
    Davis, Steve
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2014, 62 (12) : 1837 - 1847
  • [40] Curiosity-driven Exploration by Self-supervised Prediction
    Pathak, Deepak
    Agrawal, Pulkit
    Efros, Alexei A.
    Darrell, Trevor
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 488 - 489