Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization

被引：6

作者：

Martinez, Aritz D. ^{[1
]}

Osaba, Eneko ^{[1
]}

Del Ser, Javier ^{[1
,2
]}

Herrera, Francisco ^{[3
]}

机构：

[1] TECNALIA, Basque Res & Technol Alliance BRTA, Derio 48160, Bizkaia, Spain

[2] Univ Basque Country, Bilbao 48013, Bizkaia, Spain

[3] Univ Granada, DaSCI Andalusian Inst Data Sci & Computat Intelli, Granada 18071, Spain

来源：

2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2020年

关键词：

Multifactorial Optimization; Deep Reinforcement Learning; Transfer Learning; Evolutionary Algorithm;

D O I：

10.1109/cec48606.2020.9185667

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, Multifactorial Optimization (MFO) has gained a notable momentum in the research community. MFO is known for its inherent capability to efficiently address multiple optimization tasks at the same time, while transferring information among such tasks to improve their convergence speed. On the other hand, the quantum leap made by Deep Q Learning (DQL) in the Machine Learning field has allowed facing Reinforcement Learning (RL) problems of unprecedented complexity. Unfortunately, complex DQL models usually find it difficult to converge to optimal policies due to the lack of exploration or sparse rewards. In order to overcome these drawbacks, pre-trained models are widely harnessed via Transfer Learning, extrapolating knowledge acquired in a source task to the target task. Besides, meta-heuristic optimization has been shown to reduce the lack of exploration of DQL models. This work proposes a MFO framework capable of simultaneously evolving several DQL models towards solving interrelated RL tasks. Specifically, our proposed framework blends together the benefits of meta-heuristic optimization, Transfer Learning and DQL to automate the process of knowledge transfer and policy learning of distributed RL agents. A thorough experimentation is presented and discussed so as to assess the performance of the framework, its comparison to the traditional methodology for Transfer Learning in terms of convergence, speed and policy quality, and the intertask relationships found and exploited over the search process.

引用

页数：8

共 50 条

[21] Enhancing Deep Reinforcement Learning: A Tutorial on Generative Diffusion Models in Network Optimization
Du, Hongyang
Zhang, Ruichen
Liu, Yinqiu
Wang, Jiacheng
Lin, Yijing
Li, Zonghang
Niyato, Dusit
Kang, Jiawen
Xiong, Zehui
Cui, Shuguang
Ai, Bo
Zhou, Haibo
Kim, Dong In
IEEE Communications Surveys and Tutorials, 2024, 26 (04): : 2611 - 2646
[22] OSCAR: a Contention Window Optimization approach using Deep Reinforcement Learning
Grasso, Christian
Raftopoulos, Raoul
Schembra, Giovanni
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 459 - 465
[23] Decoupling Optimization for Complex PDN Structures Using Deep Reinforcement Learning
Zhang, Ling
Jiang, Li
Juang, Jack
Yang, Zhiping
Li, Er-Ping
Hwang, Chulsoon
IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2023, 71 (09) : 3773 - 3783
[24] Multi-echelon inventory optimization using deep reinforcement learning
Geevers, Kevin
van Hezewijk, Lotte
Mes, Martijn R. K.
CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH, 2024, 32 (03) : 653 - 683
[25] PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning
Roy, Rajarshi
Raiman, Jonathan
Kant, Neel
Elkin, Ilyas
Kirby, Robert
Siu, Michael
Oberman, Stuart
Godil, Saad
Catanzaro, Bryan
2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 853 - 858
[26] Comparison of algorithms using deep reinforcement learning for optimization of hyperbolic metamaterials
Kenta Hamada
Hui-Hsin Hsiao
Wakana Kubo
Scientific Reports, 14 (1)
[27] Using Deep Reinforcement Learning with Hierarchical Risk Parity for Portfolio Optimization
Millea, Adrian
Edalat, Abbas
INTERNATIONAL JOURNAL OF FINANCIAL STUDIES, 2023, 11 (01):
[28] Cost Optimization at Early Stages of Design Using Deep Reinforcement Learning
Servadei, Lorenzo
Zheng, Jiapeng
Arjona-Medina, Jose
Werner, Michael
Esen, Volkan
Hochreiter, Sepp
Ecker, Wolfgang
Wille, Robert
PROCEEDINGS OF THE 2020 ACM/IEEE 2ND WORKSHOP ON MACHINE LEARNING FOR CAD (MLCAD '20), 2020, : 37 - 42
[29] Aircraft collision avoidance modeling and optimization using deep reinforcement learning
Park K.-W.
Kim J.-H.
Journal of Institute of Control, Robotics and Systems, 2021, 27 (09) : 652 - 659
[30] Multiobjective Offloading Optimization in Fog Computing Using Deep Reinforcement Learning
Mashal, Hojjat
Rezvani, Mohammad Hossein
Journal of Computer Networks and Communications, 2024, 2024

← 1 2 3 4 5 →