A new distributed reinforcement learning algorithm for multiple objective optimization problems

被引:0
|
作者
Mariano, C
Morales, E
机构
[1] Inst Mexicano Tecnol Agua, Jiutepec 62550, Morelos, Mexico
[2] ITESM, Temixco 62589, Morelos, Mexico
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a new algorithm, called MDQL, for the solution of multiple objective optimization problems. MDQL is based on a new distributed Q-learning algorithm, called DQL, which is also introduced in this paper. In DQL a family of independent agents, exploring different options, finds a common policy in a common environment. Information about action goodness is transmitted using traces over state-action pairs. MDQL extends this idea to multiple objectives, assigning a family of agents for each objective involved. A non-dominant criterion is used to construct Pareto fronts and by delaying adjustments on the rewards MDQL achieves better distributions of solutions. Furthermore, an extension for applying reinforcement learning to continuous functions is also given. Successful results of MDQL on several test-bed problems suggested in the literature are described.
引用
收藏
页码:290 / 299
页数:10
相关论文
共 50 条
  • [31] A Distributed Multiple Populations Framework for Evolutionary Algorithm in Solving Dynamic Optimization Problems
    Luo, Xiong-Wen
    Wang, Zi-Jia
    Guan, Ren-Chu
    Zhan, Zhi-Hui
    Gao, Ying
    IEEE ACCESS, 2019, 7 : 44372 - 44390
  • [32] A Distributed Bi-Behaviors Crow Search Algorithm for Dynamic Multi-Objective Optimization and Many-Objective Optimization Problems
    Aboud, Ahlem
    Rokbani, Nizar
    Neji, Bilel
    Al Barakeh, Zaher
    Mirjalili, Seyedali
    Alimi, Adel M.
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [33] Multi-objective Optimization Service Function Chain Placement Algorithm Based on Reinforcement Learning
    Hongtai Liu
    Shengduo Ding
    Shunyi Wang
    Gang Zhao
    Chao Wang
    Journal of Network and Systems Management, 2022, 30
  • [34] Multi-objective Optimization Service Function Chain Placement Algorithm Based on Reinforcement Learning
    Liu, Hongtai
    Ding, Shengduo
    Wang, Shunyi
    Zhao, Gang
    Wang, Chao
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2022, 30 (04)
  • [35] Analytical adaptive distributed multi-objective optimization algorithm for optimal power flow problems
    Yin, Linfei
    Wang, Tao
    Zheng, Baomin
    ENERGY, 2021, 216
  • [36] Distributed reinforcement learning strategy for multi-objective optimization of fed-batch fermentation process
    Li D.
    Song T.
    Jin Q.
    Tan T.
    Huagong Xuebao/CIESC Journal, 2011, 62 (08): : 2243 - 2247
  • [37] A multi-start threshold accepting algorithm for multiple objective continuous optimization problems
    Dhouib, Souhail
    Kharrat, Aida
    Chabchoub, Habib
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2010, 83 (11) : 1498 - 1517
  • [38] A new VPS-based algorithm for multi-objective optimization problems
    A. Kaveh
    M. Ilchi Ghazaan
    Engineering with Computers, 2020, 36 : 1029 - 1040
  • [39] New multi-objective genetic algorithm for nonlinear constrained optimization problems
    Liu, Chun-an
    2007 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS, VOLS 1-6, 2007, : 118 - 120
  • [40] A New Evolutionary Algorithm Based on Decomposition for Multi-objective Optimization Problems
    Dai, Cai
    Lei, Xiujuan
    PROCEEDINGS OF 2016 12TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2016, : 33 - 38