Automated State Feature Learning for Actor-Critic Reinforcement Learning through NEAT

被引：2

作者：

Peng, Yiming ^{[1
]}

Chen, Gang ^{[1
]}

Holdaway, Scott ^{[1
]}

Mei, Yi ^{[1
]}

Zhang, Mengjie ^{[1
]}

机构：

[1] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand

来源：

PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION) | 2017年

关键词：

NeuroEvolution; NEAT; Actor-Critic; Reinforcement Learning; Feature Extraction; Feature Learning;

D O I：

10.1145/3067695.3076035

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Actor-Critic (AC) algorithms are important approaches to solving sophisticated reinforcement learning problems. However, the learning performance of these algorithms rely heavily on good state features that are often designed manually. To address this issue, we propose to adopt an evolutionary approach based on NeuroEvolution of Augmenting Topology (NEAT) to automatically evolve neural networks that directly transform the raw environmental inputs into state features. Following this idea, we have successfully developed a new algorithm called NEAT+AC which combines Regular-gradient Actor-Critic (RAC) with NEAT. It can simultaneously learn suitable state features as well as good policies that are expected to significantly improve the reinforcement learning performance. Preliminary experiments on two benchmark problems confirm that our new algorithm can clearly outperform the baseline algorithm, i.e., NEAT.

引用

页码：135 / 136

页数：2

共 50 条

[21] Deep Actor-Critic Reinforcement Learning for Anomaly Detection
Zhong, Chen
Gursoy, M. Cenk
Velipasalar, Senem
[J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[22] MARS: Malleable Actor-Critic Reinforcement Learning Scheduler
Baheri, Betis
Tronge, Jacob
Fang, Bo
Li, Ang
Chaudhary, Vipin
Guan, Qiang
[J]. 2022 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE, IPCCC, 2022,
[23] Averaged Soft Actor-Critic for Deep Reinforcement Learning
Ding, Feng
Ma, Guanfeng
Chen, Zhikui
Gao, Jing
Li, Peng
[J]. COMPLEXITY, 2021, 2021
[24] Dynamic spectrum access and sharing through actor-critic deep reinforcement learning
Liang Dong
Yuchen Qian
Yuan Xing
[J]. EURASIP Journal on Wireless Communications and Networking, 2022
[25] Dynamic spectrum access and sharing through actor-critic deep reinforcement learning
Dong, Liang
Qian, Yuchen
Xing, Yuan
[J]. EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2022, 2022 (01)
[26] Forward Actor-Critic for Nonlinear Function Approximation in Reinforcement Learning
Veeriah, Vivek
van Seijen, Harm
Sutton, Richard S.
[J]. AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 556 - 564
[27] An Actor-Critic Hierarchical Reinforcement Learning Model for Course Recommendation
Liang, Kun
Zhang, Guoqiang
Guo, Jinhui
Li, Wentao
[J]. ELECTRONICS, 2023, 12 (24)
[28] ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR DYNAMIC MULTICHANNEL ACCESS
Zhong, Chen
Lu, Ziyang
Gursoy, M. Cenk
Velipasalar, Senem
[J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 599 - 603
[29] THE APPLICATION OF ACTOR-CRITIC REINFORCEMENT LEARNING FOR FAB DISPATCHING SCHEDULING
Kim, Namyong
Shin, IIayong
[J]. 2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 4570 - 4571
[30] Swarm Reinforcement Learning Method Based on an Actor-Critic Method
Iima, Hitoshi
Kuroe, Yasuaki
[J]. SIMULATED EVOLUTION AND LEARNING, 2010, 6457 : 279 - 288

← 1 2 3 4 5 →