Decomposed Multi-objective Method Based on Q-Learning for Solving Multi-objective Combinatorial Optimization Problem

被引:0
|
作者
Yang, Anju [1 ]
Liu, Yuan [1 ]
Zou, Juan [1 ]
Yang, Shengxiang [2 ]
机构
[1] Xiangtan Univ, Hunan Engn Res Ctr Intelligent Syst Optimizat & S, Xiangtan 411105, Peoples R China
[2] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, Leics, England
基金
中国国家自然科学基金;
关键词
Reinforcement Learning; Q-learning; Temporal-Difference; Shared Q-table; Multi-objective Traveling Salesman Problem; ALGORITHM;
D O I
10.1007/978-981-97-2272-3_5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Neural combinatorial optimization has emerged as a promising technique for combinatorial optimization problems. However, the high representation of deep learning inevitably requires a lot of training overhead and computing resources, especially in large-scale decision making and multi-objective scenarios. This paper first provides a simple but efficient combinatorial optimization method that uses a traditional reinforcement learning (RL) paradigm to balance the computational cost and performance. We decompose the multi-objective problem into multiple scalar subproblems and only use the improved Q-learning for the sequential optimization of these subproblems. Our method employs the Temporal-Difference (TD) update strategy and provides a shared Q-table for all subproblems. The TD update strategy speeds up the optimization by learning while making decisions. The shared Q-table devotes a high-quality starting point to generate excellent solutions quickly for each subproblem. Both strategies promote the effectiveness and efficiency of the proposed method. After new solutions are generated, a selection operator keeps the historical optimal solution for each subproblem. We apply our method to various multi-objective traveling salesman problems involving up to 10 objectives and 200 decisions. Experiments demonstrate that only simple RL achieved comparable performance to state-of-the-art approaches.
引用
收藏
页码:59 / 73
页数:15
相关论文
共 50 条
  • [11] Q-Learning Based Multi-objective Optimization Routing Strategy in UAVs Deterministic Network
    Zhou, Zou
    Chen, Longjie
    Hu, Yu
    Zheng, Fei
    Liang, Caisheng
    Li, Kelin
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND NETWORKS, VOL II, CENET 2023, 2024, 1126 : 399 - 408
  • [12] An interactive heuristic method for multi-objective combinatorial optimization
    Teghem, J
    Tuyttens, D
    Ulungu, EL
    COMPUTERS & OPERATIONS RESEARCH, 2000, 27 (7-8) : 621 - 634
  • [13] A Distance Based Method for Solving Multi-Objective Optimization Problems
    Kamal, Murshid
    Jalil, Syed Aqib
    Muneeb, Syed Mohd
    Ali, Irfan
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2018, 17 (01) : 2 - 23
  • [14] A New Method for Multi-Objective Optimization Problem
    Jiang Hong
    Yang Meng-fei
    Zhang Shao-lin
    Wang Ruo-chuan
    2013 IEEE 4TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC), 2014, : 209 - 212
  • [15] Bidirectional Q-Learning based Multi-objective optimization Routing Protocol for Multi-Destination FANETs
    Xue, Liang
    Tang, Jie
    Zhang, Jiaying
    Hu, Juncheng
    2024 IEEE INTERNATIONAL WORKSHOP ON RADIO FREQUENCY AND ANTENNA TECHNOLOGIES, IWRF&AT 2024, 2024, : 421 - 426
  • [16] Multi-objective chicken swarm optimization: A novel algorithm for solving multi-objective optimization problems
    Zouache, Djaafar
    Arby, Yahya Quid
    Nouioua, Farid
    Ben Abdelaziz, Fouad
    COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 129 : 377 - 391
  • [17] Multi-objective chemical reaction optimization based decomposition for multi-objective traveling salesman problem
    Bouzoubia, Samira
    Layeb, Abdesslem
    Chikhi, Salim
    PROCEEDINGS OF 2015 THIRD IEEE WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS), 2015,
  • [18] Solving Multi-Objective Portfolio Optimization Problem Based on MOEA/D
    Zhao, Pengxiang
    Gao, Shang
    Yang, Nachuan
    2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2020, : 30 - 37
  • [19] MOMPA: Multi-objective marine predator algorithm for solving multi-objective optimization problems
    Jangir, Pradeep
    Buch, Hitarth
    Mirjalili, Seyedali
    Manoharan, Premkumar
    EVOLUTIONARY INTELLIGENCE, 2023, 16 (01) : 169 - 195
  • [20] A Multi-Objective Carnivorous Plant Algorithm for Solving Constrained Multi-Objective Optimization Problems
    Yang, Yufei
    Zhang, Changsheng
    BIOMIMETICS, 2023, 8 (02)