Efficient Novelty Search Through Deep Reinforcement Learning

被引:6
|
作者
Shi, Longxiang [1 ]
Li, Shijian [1 ]
Zheng, Qian [2 ]
Yao, Min [1 ]
Pan, Gang [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Reinforcement learning; novelty search; evolutionary computing; deep learning; NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2020.3008735
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Novelty search, which was inspired by the nature that evolves creatures with diversity, has shown great potential in solving reinforcement learning (RL) tasks with sparse and deceptive rewards. However, most of the existing novelty search methods evolve the populations through hybrization and mutation, which is inefficient in diverging populations. In this paper, we propose a method which incorporates deep RL with novelty search to improve the efficiency of diverging the populations for novelty search. We first propose a strategy that improves the novelty of individuals generated by genetic algorithm using reinforcement learning. Based on this strategy, we propose a framework that incorporates deep RL with novelty search, and then derive an algorithm to improve the search efficiency of the novelty search for continuous control tasks. Our experimental results show that our method can improve the search efficiency of novelty search and can also provide a competitive performance compared to some of the existing novelty search methods. The implementation of our method is available at: https://github.com/shilx001/NoveltySearch_Improvement.
引用
收藏
页码:128809 / 128818
页数:10
相关论文
共 50 条
  • [21] Using deep reinforcement learning to search reachability properties in systems specified through graph transformation
    Mehrabi, Mohammad Javad
    Rafe, Vahid
    SOFT COMPUTING, 2022, 26 (18) : 9635 - 9663
  • [22] Using deep reinforcement learning to search reachability properties in systems specified through graph transformation
    Mohammad Javad Mehrabi
    Vahid Rafe
    Soft Computing, 2022, 26 : 9635 - 9663
  • [23] Air conditioner component optimum operation point search through a deep reinforcement learning algorithm
    Yoon, Myung-Sup
    Yoon, Won-Sik
    Seo, Myung-Kyo
    Ryu, Seung-Yup
    Lee, Jong-Seok
    2020 20TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2020, : 365 - 372
  • [24] A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems
    Chengyu Hu
    Rui Qiao
    Wenyin Gong
    Xuesong Yan
    Ling Wang
    Memetic Computing, 2022, 14 : 451 - 460
  • [25] A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems
    Hu, Chengyu
    Qiao, Rui
    Gong, Wenyin
    Yan, Xuesong
    Wang, Ling
    MEMETIC COMPUTING, 2022, 14 (04) : 451 - 460
  • [26] PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard Exploration Environments
    Liu, Qihao
    Wang, Yujia
    Liu, Xiaofeng
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5627 - 5634
  • [27] Learn to Steer through Deep Reinforcement Learning
    Wu, Keyu
    Esfahani, Mahdi Abolfazli
    Yuan, Shenghai
    Wang, Han
    SENSORS, 2018, 18 (11)
  • [28] Autonomous exploration through deep reinforcement learning
    Yan, Xiangda
    Huang, Jie
    He, Keyan
    Hong, Huajie
    Xu, Dasheng
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2023, 50 (05): : 793 - 803
  • [29] Efficient reinforcement learning through symbiotic evolution
    Moriarty, DE
    Miikkulainen, R
    MACHINE LEARNING, 1996, 22 (1-3) : 11 - 32
  • [30] Efficient Distributed Reinforcement Learning through Agreement
    Varshavskaya, Paulina
    Kaelbling, Leslie Pack
    Rus, Daniela
    DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS 8, 2009, : 367 - 378