A new distributed reinforcement learning algorithm for multiple objective optimization problems

被引：0

作者：

Mariano, C

Morales, E

机构：

[1] Inst Mexicano Tecnol Agua, Jiutepec 62550, Morelos, Mexico

[2] ITESM, Temixco 62589, Morelos, Mexico

来源：

ADVANCES IN ARTIFICIAL INTELLIGENCE | 2000年 / 1952卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a new algorithm, called MDQL, for the solution of multiple objective optimization problems. MDQL is based on a new distributed Q-learning algorithm, called DQL, which is also introduced in this paper. In DQL a family of independent agents, exploring different options, finds a common policy in a common environment. Information about action goodness is transmitted using traces over state-action pairs. MDQL extends this idea to multiple objectives, assigning a family of agents for each objective involved. A non-dominant criterion is used to construct Pareto fronts and by delaying adjustments on the rewards MDQL achieves better distributions of solutions. Furthermore, an extension for applying reinforcement learning to continuous functions is also given. Successful results of MDQL on several test-bed problems suggested in the literature are described.

引用

页码：290 / 299

页数：10

共 50 条

[21] A New Evolutionary Algorithm for Solving Multi-Objective Optimization Problems
D Chen Wen-ping
WuhanUniversityJournalofNaturalSciences, 2003, (S1) : 202 - 206
[22] A new evolutionary algorithm for solving many-objective optimization problems
Zou, Xiufen
Chen, Yu
Liu, Minzhong
Kang, Lishan
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (05): : 1402 - 1412
[23] A reinforcement learning-based metaheuristic algorithm for solving global optimization problems
Seyyedabbasi, Amir
ADVANCES IN ENGINEERING SOFTWARE, 2023, 178
[24] Reinforcement learning iterated greedy algorithm for distributed assembly permutation flowshop scheduling problems
Ying K.-C.
Lin S.-W.
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (08) : 11123 - 11138
[25] Q-Sorting: An Algorithm for Reinforcement Learning Problems with Multiple Cumulative Constraints
Huang, Jianfeng
Lu, Guoqiang
Li, Yi
Wu, Jiajun
MATHEMATICS, 2024, 12 (13)
[26] Multiobjective optimization algorithm with objective-wise learning for continuous multiobjective problems
Wang, Jiahai
Zhong, Chenglin
Zhou, Ying
Zhou, Yalan
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2015, 6 (05) : 571 - 585
[27] Multiobjective optimization algorithm with objective-wise learning for continuous multiobjective problems
Jiahai Wang
Chenglin Zhong
Ying Zhou
Yalan Zhou
Journal of Ambient Intelligence and Humanized Computing, 2015, 6 : 571 - 585
[28] Modified teaching-learning-based optimization algorithm for multi-objective optimization problems
Wang, Zhi
Song, Shufang
Wei, Hongkui
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (06) : 6017 - 6026
[29] Collaborative Pareto Set Learning in Multiple Multi-Objective Optimization Problems
Shang, Chikai
Ye, Rongguang
Jiang, Jiaqi
Gu, Fangqing
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
[30] Objective Extraction for Many-Objective Optimization Problems: Algorithm and Test Problems
Cheung, Yiu-ming
Gu, Fangqing
Liu, Hai-Lin
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2016, 20 (05) : 755 - 772

← 1 2 3 4 5 →