Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization

被引:6
|
作者
Martinez, Aritz D. [1 ]
Osaba, Eneko [1 ]
Del Ser, Javier [1 ,2 ]
Herrera, Francisco [3 ]
机构
[1] TECNALIA, Basque Res & Technol Alliance BRTA, Derio 48160, Bizkaia, Spain
[2] Univ Basque Country, Bilbao 48013, Bizkaia, Spain
[3] Univ Granada, DaSCI Andalusian Inst Data Sci & Computat Intelli, Granada 18071, Spain
关键词
Multifactorial Optimization; Deep Reinforcement Learning; Transfer Learning; Evolutionary Algorithm;
D O I
10.1109/cec48606.2020.9185667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Multifactorial Optimization (MFO) has gained a notable momentum in the research community. MFO is known for its inherent capability to efficiently address multiple optimization tasks at the same time, while transferring information among such tasks to improve their convergence speed. On the other hand, the quantum leap made by Deep Q Learning (DQL) in the Machine Learning field has allowed facing Reinforcement Learning (RL) problems of unprecedented complexity. Unfortunately, complex DQL models usually find it difficult to converge to optimal policies due to the lack of exploration or sparse rewards. In order to overcome these drawbacks, pre-trained models are widely harnessed via Transfer Learning, extrapolating knowledge acquired in a source task to the target task. Besides, meta-heuristic optimization has been shown to reduce the lack of exploration of DQL models. This work proposes a MFO framework capable of simultaneously evolving several DQL models towards solving interrelated RL tasks. Specifically, our proposed framework blends together the benefits of meta-heuristic optimization, Transfer Learning and DQL to automate the process of knowledge transfer and policy learning of distributed RL agents. A thorough experimentation is presented and discussed so as to assess the performance of the framework, its comparison to the traditional methodology for Transfer Learning in terms of convergence, speed and policy quality, and the intertask relationships found and exploited over the search process.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Enhancing Deep Reinforcement Learning: A Tutorial on Generative Diffusion Models in Network Optimization
    Du, Hongyang
    Zhang, Ruichen
    Liu, Yinqiu
    Wang, Jiacheng
    Lin, Yijing
    Li, Zonghang
    Niyato, Dusit
    Kang, Jiawen
    Xiong, Zehui
    Cui, Shuguang
    Ai, Bo
    Zhou, Haibo
    Kim, Dong In
    IEEE Communications Surveys and Tutorials, 2024, 26 (04): : 2611 - 2646
  • [22] OSCAR: a Contention Window Optimization approach using Deep Reinforcement Learning
    Grasso, Christian
    Raftopoulos, Raoul
    Schembra, Giovanni
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 459 - 465
  • [23] Decoupling Optimization for Complex PDN Structures Using Deep Reinforcement Learning
    Zhang, Ling
    Jiang, Li
    Juang, Jack
    Yang, Zhiping
    Li, Er-Ping
    Hwang, Chulsoon
    IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2023, 71 (09) : 3773 - 3783
  • [24] Multi-echelon inventory optimization using deep reinforcement learning
    Geevers, Kevin
    van Hezewijk, Lotte
    Mes, Martijn R. K.
    CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH, 2024, 32 (03) : 653 - 683
  • [25] PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning
    Roy, Rajarshi
    Raiman, Jonathan
    Kant, Neel
    Elkin, Ilyas
    Kirby, Robert
    Siu, Michael
    Oberman, Stuart
    Godil, Saad
    Catanzaro, Bryan
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 853 - 858
  • [26] Comparison of algorithms using deep reinforcement learning for optimization of hyperbolic metamaterials
    Kenta Hamada
    Hui-Hsin Hsiao
    Wakana Kubo
    Scientific Reports, 14 (1)
  • [27] Using Deep Reinforcement Learning with Hierarchical Risk Parity for Portfolio Optimization
    Millea, Adrian
    Edalat, Abbas
    INTERNATIONAL JOURNAL OF FINANCIAL STUDIES, 2023, 11 (01):
  • [28] Cost Optimization at Early Stages of Design Using Deep Reinforcement Learning
    Servadei, Lorenzo
    Zheng, Jiapeng
    Arjona-Medina, Jose
    Werner, Michael
    Esen, Volkan
    Hochreiter, Sepp
    Ecker, Wolfgang
    Wille, Robert
    PROCEEDINGS OF THE 2020 ACM/IEEE 2ND WORKSHOP ON MACHINE LEARNING FOR CAD (MLCAD '20), 2020, : 37 - 42
  • [29] Aircraft collision avoidance modeling and optimization using deep reinforcement learning
    Park K.-W.
    Kim J.-H.
    Journal of Institute of Control, Robotics and Systems, 2021, 27 (09) : 652 - 659
  • [30] Multiobjective Offloading Optimization in Fog Computing Using Deep Reinforcement Learning
    Mashal, Hojjat
    Rezvani, Mohammad Hossein
    Journal of Computer Networks and Communications, 2024, 2024