Decomposed Multi-objective Method Based on Q-Learning for Solving Multi-objective Combinatorial Optimization Problem

被引：0

作者：

Yang, Anju ^{[1
]}

Liu, Yuan ^{[1
]}

Zou, Juan ^{[1
]}

Yang, Shengxiang ^{[2
]}

机构：

[1] Xiangtan Univ, Hunan Engn Res Ctr Intelligent Syst Optimizat & S, Xiangtan 411105, Peoples R China

[2] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, Leics, England

来源：

BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 1, BIC-TA 2023 | 2024年 / 2061卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement Learning; Q-learning; Temporal-Difference; Shared Q-table; Multi-objective Traveling Salesman Problem; ALGORITHM;

D O I：

10.1007/978-981-97-2272-3_5

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Neural combinatorial optimization has emerged as a promising technique for combinatorial optimization problems. However, the high representation of deep learning inevitably requires a lot of training overhead and computing resources, especially in large-scale decision making and multi-objective scenarios. This paper first provides a simple but efficient combinatorial optimization method that uses a traditional reinforcement learning (RL) paradigm to balance the computational cost and performance. We decompose the multi-objective problem into multiple scalar subproblems and only use the improved Q-learning for the sequential optimization of these subproblems. Our method employs the Temporal-Difference (TD) update strategy and provides a shared Q-table for all subproblems. The TD update strategy speeds up the optimization by learning while making decisions. The shared Q-table devotes a high-quality starting point to generate excellent solutions quickly for each subproblem. Both strategies promote the effectiveness and efficiency of the proposed method. After new solutions are generated, a selection operator keeps the historical optimal solution for each subproblem. We apply our method to various multi-objective traveling salesman problems involving up to 10 objectives and 200 decisions. Experiments demonstrate that only simple RL achieved comparable performance to state-of-the-art approaches.

引用

页码：59 / 73

页数：15

共 50 条

[31] MOCOVIDOA: a novel multi-objective coronavirus disease optimization algorithm for solving multi-objective optimization problems
Asmaa M. Khalid
Hanaa M. Hamza
Seyedali Mirjalili
Khaid M. Hosny
Neural Computing and Applications, 2023, 35 : 17319 - 17347
[32] A novel immune dominance selection multi-objective optimization algorithm for solving multi-objective optimization problems
Xiao, Jin-ke
Li, Wei-min
Xiao, Xin-rong
Cheng-zhong, L., V
APPLIED INTELLIGENCE, 2017, 46 (03) : 739 - 755
[33] Multi-objective sand cat swarm optimization based on adaptive clustering for solving multimodal multi-objective optimization problems
Niu, Yanbiao
Yan, Xuefeng
Zeng, Weiping
Wang, Yongzhen
Niu, Yanzhao
MATHEMATICS AND COMPUTERS IN SIMULATION, 2025, 227 : 391 - 404
[34] A novel immune dominance selection multi-objective optimization algorithm for solving multi-objective optimization problems
Jin-ke Xiao
Wei-min Li
Xin-rong Xiao
Cheng-zhong LV
Applied Intelligence, 2017, 46 : 739 - 755
[35] MULTI-OBJECTIVE COMBINATORIAL OPTIMIZATION DESIGN METHOD FOR THE COMPRESSOR SPLITTER
Gao, Limin
Deng, Xiaoming
Gao, Lei
Li, Ruiyu
Zeng, Ruihui
Liu, Cunliang
ASME TURBO EXPO: TURBINE TECHNICAL CONFERENCE AND EXPOSITION, 2015, VOL 2C, 2015,
[36] A Bi-population Multi-objective Algorithm for Continuous Multi-objective Optimization Problem
Chen, Lili
Wang, Hongfeng
PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 4830 - 4833
[37] Cognitive networks QoS multi-objective strategy based on Q-learning algorithm
Wang, B. (wangbowx@163.com), 1600, Advanced Institute of Convergence Information Technology, Myoungbo Bldg 3F,, Bumin-dong 1-ga, Seo-gu, Busan, 602-816, Korea, Republic of (07):
[38] Glowworm swarm optimization algorithm for solving multi-objective optimization problem
He Deng-xu
Liu Gui-qing
Zhu Hua-zheng
2013 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2013, : 11 - 15
[39] Solving Multi-Objective Energy Management of a DC Microgrid using Multi-Objective Multiverse Optimization
Lagouir, Marouane
Badri, Abdelmajid
Sayouti, Yassine
INTERNATIONAL JOURNAL OF RENEWABLE ENERGY DEVELOPMENT-IJRED, 2021, 10 (04): : 911 - 922
[40] A novel ε-dominance multi-objective evolutionary algorithms for solving DRS multi-objective optimization problems
Liu, Liu
Li, Minqiang
Lin, Dan
ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2007, : 96 - +

← 1 2 3 4 5 →