Decomposed Multi-objective Method Based on Q-Learning for Solving Multi-objective Combinatorial Optimization Problem

被引：0

作者：

Yang, Anju ^{[1
]}

Liu, Yuan ^{[1
]}

Zou, Juan ^{[1
]}

Yang, Shengxiang ^{[2
]}

机构：

[1] Xiangtan Univ, Hunan Engn Res Ctr Intelligent Syst Optimizat & S, Xiangtan 411105, Peoples R China

[2] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, Leics, England

来源：

BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 1, BIC-TA 2023 | 2024年 / 2061卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement Learning; Q-learning; Temporal-Difference; Shared Q-table; Multi-objective Traveling Salesman Problem; ALGORITHM;

D O I：

10.1007/978-981-97-2272-3_5

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Neural combinatorial optimization has emerged as a promising technique for combinatorial optimization problems. However, the high representation of deep learning inevitably requires a lot of training overhead and computing resources, especially in large-scale decision making and multi-objective scenarios. This paper first provides a simple but efficient combinatorial optimization method that uses a traditional reinforcement learning (RL) paradigm to balance the computational cost and performance. We decompose the multi-objective problem into multiple scalar subproblems and only use the improved Q-learning for the sequential optimization of these subproblems. Our method employs the Temporal-Difference (TD) update strategy and provides a shared Q-table for all subproblems. The TD update strategy speeds up the optimization by learning while making decisions. The shared Q-table devotes a high-quality starting point to generate excellent solutions quickly for each subproblem. Both strategies promote the effectiveness and efficiency of the proposed method. After new solutions are generated, a selection operator keeps the historical optimal solution for each subproblem. We apply our method to various multi-objective traveling salesman problems involving up to 10 objectives and 200 decisions. Experiments demonstrate that only simple RL achieved comparable performance to state-of-the-art approaches.

引用

页码：59 / 73

页数：15

共 50 条

[11] Q-Learning Based Multi-objective Optimization Routing Strategy in UAVs Deterministic Network
Zhou, Zou
Chen, Longjie
Hu, Yu
Zheng, Fei
Liang, Caisheng
Li, Kelin
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND NETWORKS, VOL II, CENET 2023, 2024, 1126 : 399 - 408
[12] An interactive heuristic method for multi-objective combinatorial optimization
Teghem, J
Tuyttens, D
Ulungu, EL
COMPUTERS & OPERATIONS RESEARCH, 2000, 27 (7-8) : 621 - 634
[13] A Distance Based Method for Solving Multi-Objective Optimization Problems
Kamal, Murshid
Jalil, Syed Aqib
Muneeb, Syed Mohd
Ali, Irfan
JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2018, 17 (01) : 2 - 23
[14] A New Method for Multi-Objective Optimization Problem
Jiang Hong
Yang Meng-fei
Zhang Shao-lin
Wang Ruo-chuan
2013 IEEE 4TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC), 2014, : 209 - 212
[15] Bidirectional Q-Learning based Multi-objective optimization Routing Protocol for Multi-Destination FANETs
Xue, Liang
Tang, Jie
Zhang, Jiaying
Hu, Juncheng
2024 IEEE INTERNATIONAL WORKSHOP ON RADIO FREQUENCY AND ANTENNA TECHNOLOGIES, IWRF&AT 2024, 2024, : 421 - 426
[16] Multi-objective chicken swarm optimization: A novel algorithm for solving multi-objective optimization problems
Zouache, Djaafar
Arby, Yahya Quid
Nouioua, Farid
Ben Abdelaziz, Fouad
COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 129 : 377 - 391
[17] Multi-objective chemical reaction optimization based decomposition for multi-objective traveling salesman problem
Bouzoubia, Samira
Layeb, Abdesslem
Chikhi, Salim
PROCEEDINGS OF 2015 THIRD IEEE WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS), 2015,
[18] Solving Multi-Objective Portfolio Optimization Problem Based on MOEA/D
Zhao, Pengxiang
Gao, Shang
Yang, Nachuan
2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2020, : 30 - 37
[19] MOMPA: Multi-objective marine predator algorithm for solving multi-objective optimization problems
Jangir, Pradeep
Buch, Hitarth
Mirjalili, Seyedali
Manoharan, Premkumar
EVOLUTIONARY INTELLIGENCE, 2023, 16 (01) : 169 - 195
[20] A Multi-Objective Carnivorous Plant Algorithm for Solving Constrained Multi-Objective Optimization Problems
Yang, Yufei
Zhang, Changsheng
BIOMIMETICS, 2023, 8 (02)

← 1 2 3 4 5 →