Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization

被引:6
|
作者
Martinez, Aritz D. [1 ]
Osaba, Eneko [1 ]
Del Ser, Javier [1 ,2 ]
Herrera, Francisco [3 ]
机构
[1] TECNALIA, Basque Res & Technol Alliance BRTA, Derio 48160, Bizkaia, Spain
[2] Univ Basque Country, Bilbao 48013, Bizkaia, Spain
[3] Univ Granada, DaSCI Andalusian Inst Data Sci & Computat Intelli, Granada 18071, Spain
关键词
Multifactorial Optimization; Deep Reinforcement Learning; Transfer Learning; Evolutionary Algorithm;
D O I
10.1109/cec48606.2020.9185667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, Multifactorial Optimization (MFO) has gained a notable momentum in the research community. MFO is known for its inherent capability to efficiently address multiple optimization tasks at the same time, while transferring information among such tasks to improve their convergence speed. On the other hand, the quantum leap made by Deep Q Learning (DQL) in the Machine Learning field has allowed facing Reinforcement Learning (RL) problems of unprecedented complexity. Unfortunately, complex DQL models usually find it difficult to converge to optimal policies due to the lack of exploration or sparse rewards. In order to overcome these drawbacks, pre-trained models are widely harnessed via Transfer Learning, extrapolating knowledge acquired in a source task to the target task. Besides, meta-heuristic optimization has been shown to reduce the lack of exploration of DQL models. This work proposes a MFO framework capable of simultaneously evolving several DQL models towards solving interrelated RL tasks. Specifically, our proposed framework blends together the benefits of meta-heuristic optimization, Transfer Learning and DQL to automate the process of knowledge transfer and policy learning of distributed RL agents. A thorough experimentation is presented and discussed so as to assess the performance of the framework, its comparison to the traditional methodology for Transfer Learning in terms of convergence, speed and policy quality, and the intertask relationships found and exploited over the search process.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Adaptive Multifactorial Evolutionary Optimization for Multitask Reinforcement Learning
    Martinez, Aritz D.
    Del Ser, Javier
    Osaba, Eneko
    Herrera, Francisco
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (02) : 233 - 247
  • [2] Framework for design optimization using deep reinforcement learning
    Yonekura, Kazuo
    Hattori, Hitoshi
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2019, 60 (04) : 1709 - 1713
  • [3] Framework for design optimization using deep reinforcement learning
    Kazuo Yonekura
    Hitoshi Hattori
    Structural and Multidisciplinary Optimization, 2019, 60 : 1709 - 1713
  • [4] Fairness Testing of Machine Learning Models Using Deep Reinforcement Learning
    Xie, Wentao
    Wu, Peng
    2020 IEEE 19TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2020), 2020, : 121 - 128
  • [5] Deep Reinforcement Learning using Genetic Algorithm for Parameter Optimization
    Sehgal, Adarsh
    Hung Manh La
    Louis, Sushil J.
    Hai Nguyen
    2019 THIRD IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2019), 2019, : 596 - 601
  • [6] Optimization of Apparel Supply Chain Using Deep Reinforcement Learning
    Chong, Ji Won
    Kim, Wooju
    Hong, Jun Seok
    IEEE ACCESS, 2022, 10 : 100367 - 100375
  • [7] Improved Channel Equalization using Deep Reinforcement Learning and Optimization
    Katwal, Swati
    Bhatia, Vinay
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (35):
  • [8] Interterminal Truck Routing Optimization Using Deep Reinforcement Learning
    Adi, Taufik Nur
    Iskandar, Yelita Anggiane
    Bae, Hyerim
    SENSORS, 2020, 20 (20) : 1 - 20
  • [9] Fluid dynamic control and optimization using deep reinforcement learning
    Innyoung Kim
    Donghyun You
    JMST Advances, 2024, 6 (1) : 61 - 65
  • [10] VLSI Placement Parameter Optimization using Deep Reinforcement Learning
    Agnesina, Anthony
    Chang, Kyungwook
    Lim, Sung Kyu
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,