Decomposed Multi-objective Method Based on Q-Learning for Solving Multi-objective Combinatorial Optimization Problem

被引：0

作者：

Yang, Anju ^{[1
]}

Liu, Yuan ^{[1
]}

Zou, Juan ^{[1
]}

Yang, Shengxiang ^{[2
]}

机构：

[1] Xiangtan Univ, Hunan Engn Res Ctr Intelligent Syst Optimizat & S, Xiangtan 411105, Peoples R China

[2] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, Leics, England

来源：

BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 1, BIC-TA 2023 | 2024年 / 2061卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement Learning; Q-learning; Temporal-Difference; Shared Q-table; Multi-objective Traveling Salesman Problem; ALGORITHM;

D O I：

10.1007/978-981-97-2272-3_5

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Neural combinatorial optimization has emerged as a promising technique for combinatorial optimization problems. However, the high representation of deep learning inevitably requires a lot of training overhead and computing resources, especially in large-scale decision making and multi-objective scenarios. This paper first provides a simple but efficient combinatorial optimization method that uses a traditional reinforcement learning (RL) paradigm to balance the computational cost and performance. We decompose the multi-objective problem into multiple scalar subproblems and only use the improved Q-learning for the sequential optimization of these subproblems. Our method employs the Temporal-Difference (TD) update strategy and provides a shared Q-table for all subproblems. The TD update strategy speeds up the optimization by learning while making decisions. The shared Q-table devotes a high-quality starting point to generate excellent solutions quickly for each subproblem. Both strategies promote the effectiveness and efficiency of the proposed method. After new solutions are generated, a selection operator keeps the historical optimal solution for each subproblem. We apply our method to various multi-objective traveling salesman problems involving up to 10 objectives and 200 decisions. Experiments demonstrate that only simple RL achieved comparable performance to state-of-the-art approaches.

引用

页码：59 / 73

页数：15

共 50 条

[1] Multi-objective route recommendation method based on Q-learning algorithm
Yu, Qingying
Xiao, Zhenxing
Yang, Feng
Gong, Shan
Shi, Gege
Chen, Chuanming
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (04) : 7009 - 7025
[2] Deep reinforcement learning for multi-objective combinatorial optimization: A case study on multi-objective traveling salesman problem
Li, Shicheng
Wang, Feng
He, Qi
Wang, Xujie
SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
[3] A method for solving the multi-objective transit frequency optimization problem
Giesen, Ricardo
Martinez, Hector
Mauttone, Antonio
Urquhart, Maria E.
JOURNAL OF ADVANCED TRANSPORTATION, 2016, 50 (08) : 2323 - 2337
[4] A Novel Multi-Objective Deep Q-Network: Addressing Immediate and Delayed Rewards in Multi-Objective Q-Learning
Zhang, Youming
IEEE ACCESS, 2024, 12 : 144932 - 144949
[5] Multi-Objective Factored Evolutionary Optimization and the Multi-Objective Knapsack Problem
Peerlinck, Amy
Sheppard, John
2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
[6] Inverse multi-objective combinatorial optimization
Roland, Julien
De Smet, Yves
Figueira, Jose Rui
DISCRETE APPLIED MATHEMATICS, 2013, 161 (16-17) : 2764 - 2771
[7] Evaluation of an effective solving method based on cooperative multi-objective differential evolution for multi-objective optimization problems
Matsuzaki, Yusuke
Matsuura, Takafumi
Kimura, Takayuki
IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2024, 15 (02): : 404 - 420
[8] A Decomposition based Memetic Multi-objective Algorithm for Continuous Multi-objective Optimization Problem
Wang, Na
Wang, Hongfeng
Fu, Yaping
Wang, Lingwei
2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 896 - 900
[9] Multi-objective Jaya Algorithm for Solving Constrained Multi-objective Optimization Problems
Naidu, Y. Ramu
Ojha, A. K.
Devi, V. Susheela
ADVANCES IN HARMONY SEARCH, SOFT COMPUTING AND APPLICATIONS, 2020, 1063 : 89 - 98
[10] Multi-objective optimization of radiotherapy: distributed Q-learning and agent-based simulation
Jalalimanesh, Ammar
Haghighi, Hamidreza Shahabi
Ahmadi, Abbas
Hejazian, Hossein
Soltani, Madjid
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2017, 29 (05) : 1071 - 1086

← 1 2 3 4 5 →