Efficient Novelty Search Through Deep Reinforcement Learning

被引：6

作者：

Shi, Longxiang ^{[1
]}

Li, Shijian ^{[1
]}

Zheng, Qian ^{[2
]}

Yao, Min ^{[1
]}

Pan, Gang ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China

[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

来源：

IEEE ACCESS | 2020年 / 8卷

关键词：

Reinforcement learning; novelty search; evolutionary computing; deep learning; NEURAL-NETWORKS;

D O I：

10.1109/ACCESS.2020.3008735

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Novelty search, which was inspired by the nature that evolves creatures with diversity, has shown great potential in solving reinforcement learning (RL) tasks with sparse and deceptive rewards. However, most of the existing novelty search methods evolve the populations through hybrization and mutation, which is inefficient in diverging populations. In this paper, we propose a method which incorporates deep RL with novelty search to improve the efficiency of diverging the populations for novelty search. We first propose a strategy that improves the novelty of individuals generated by genetic algorithm using reinforcement learning. Based on this strategy, we propose a framework that incorporates deep RL with novelty search, and then derive an algorithm to improve the search efficiency of the novelty search for continuous control tasks. Our experimental results show that our method can improve the search efficiency of novelty search and can also provide a competitive performance compared to some of the existing novelty search methods. The implementation of our method is available at: https://github.com/shilx001/NoveltySearch_Improvement.

引用

页码：128809 / 128818

页数：10

共 50 条

[21] Using deep reinforcement learning to search reachability properties in systems specified through graph transformation
Mehrabi, Mohammad Javad
Rafe, Vahid
SOFT COMPUTING, 2022, 26 (18) : 9635 - 9663
[22] Using deep reinforcement learning to search reachability properties in systems specified through graph transformation
Mohammad Javad Mehrabi
Vahid Rafe
Soft Computing, 2022, 26 : 9635 - 9663
[23] Air conditioner component optimum operation point search through a deep reinforcement learning algorithm
Yoon, Myung-Sup
Yoon, Won-Sik
Seo, Myung-Kyo
Ryu, Seung-Yup
Lee, Jong-Seok
2020 20TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2020, : 365 - 372
[24] A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems
Chengyu Hu
Rui Qiao
Wenyin Gong
Xuesong Yan
Ling Wang
Memetic Computing, 2022, 14 : 451 - 460
[25] A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems
Hu, Chengyu
Qiao, Rui
Gong, Wenyin
Yan, Xuesong
Wang, Ling
MEMETIC COMPUTING, 2022, 14 (04) : 451 - 460
[26] PNS: Population-Guided Novelty Search for Reinforcement Learning in Hard Exploration Environments
Liu, Qihao
Wang, Yujia
Liu, Xiaofeng
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 5627 - 5634
[27] Learn to Steer through Deep Reinforcement Learning
Wu, Keyu
Esfahani, Mahdi Abolfazli
Yuan, Shenghai
Wang, Han
SENSORS, 2018, 18 (11)
[28] Autonomous exploration through deep reinforcement learning
Yan, Xiangda
Huang, Jie
He, Keyan
Hong, Huajie
Xu, Dasheng
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2023, 50 (05): : 793 - 803
[29] Efficient reinforcement learning through symbiotic evolution
Moriarty, DE
Miikkulainen, R
MACHINE LEARNING, 1996, 22 (1-3) : 11 - 32
[30] Efficient Distributed Reinforcement Learning through Agreement
Varshavskaya, Paulina
Kaelbling, Leslie Pack
Rus, Daniela
DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS 8, 2009, : 367 - 378

← 1 2 3 4 5 →