Decomposed Multi-objective Method Based on Q-Learning for Solving Multi-objective Combinatorial Optimization Problem

被引:0
|
作者
Yang, Anju [1 ]
Liu, Yuan [1 ]
Zou, Juan [1 ]
Yang, Shengxiang [2 ]
机构
[1] Xiangtan Univ, Hunan Engn Res Ctr Intelligent Syst Optimizat & S, Xiangtan 411105, Peoples R China
[2] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, Leics, England
基金
中国国家自然科学基金;
关键词
Reinforcement Learning; Q-learning; Temporal-Difference; Shared Q-table; Multi-objective Traveling Salesman Problem; ALGORITHM;
D O I
10.1007/978-981-97-2272-3_5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Neural combinatorial optimization has emerged as a promising technique for combinatorial optimization problems. However, the high representation of deep learning inevitably requires a lot of training overhead and computing resources, especially in large-scale decision making and multi-objective scenarios. This paper first provides a simple but efficient combinatorial optimization method that uses a traditional reinforcement learning (RL) paradigm to balance the computational cost and performance. We decompose the multi-objective problem into multiple scalar subproblems and only use the improved Q-learning for the sequential optimization of these subproblems. Our method employs the Temporal-Difference (TD) update strategy and provides a shared Q-table for all subproblems. The TD update strategy speeds up the optimization by learning while making decisions. The shared Q-table devotes a high-quality starting point to generate excellent solutions quickly for each subproblem. Both strategies promote the effectiveness and efficiency of the proposed method. After new solutions are generated, a selection operator keeps the historical optimal solution for each subproblem. We apply our method to various multi-objective traveling salesman problems involving up to 10 objectives and 200 decisions. Experiments demonstrate that only simple RL achieved comparable performance to state-of-the-art approaches.
引用
收藏
页码:59 / 73
页数:15
相关论文
共 50 条
  • [1] Multi-objective route recommendation method based on Q-learning algorithm
    Yu, Qingying
    Xiao, Zhenxing
    Yang, Feng
    Gong, Shan
    Shi, Gege
    Chen, Chuanming
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (04) : 7009 - 7025
  • [2] Deep reinforcement learning for multi-objective combinatorial optimization: A case study on multi-objective traveling salesman problem
    Li, Shicheng
    Wang, Feng
    He, Qi
    Wang, Xujie
    SWARM AND EVOLUTIONARY COMPUTATION, 2023, 83
  • [3] A method for solving the multi-objective transit frequency optimization problem
    Giesen, Ricardo
    Martinez, Hector
    Mauttone, Antonio
    Urquhart, Maria E.
    JOURNAL OF ADVANCED TRANSPORTATION, 2016, 50 (08) : 2323 - 2337
  • [4] A Novel Multi-Objective Deep Q-Network: Addressing Immediate and Delayed Rewards in Multi-Objective Q-Learning
    Zhang, Youming
    IEEE ACCESS, 2024, 12 : 144932 - 144949
  • [5] Multi-Objective Factored Evolutionary Optimization and the Multi-Objective Knapsack Problem
    Peerlinck, Amy
    Sheppard, John
    2022 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2022,
  • [6] Inverse multi-objective combinatorial optimization
    Roland, Julien
    De Smet, Yves
    Figueira, Jose Rui
    DISCRETE APPLIED MATHEMATICS, 2013, 161 (16-17) : 2764 - 2771
  • [7] Evaluation of an effective solving method based on cooperative multi-objective differential evolution for multi-objective optimization problems
    Matsuzaki, Yusuke
    Matsuura, Takafumi
    Kimura, Takayuki
    IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2024, 15 (02): : 404 - 420
  • [8] A Decomposition based Memetic Multi-objective Algorithm for Continuous Multi-objective Optimization Problem
    Wang, Na
    Wang, Hongfeng
    Fu, Yaping
    Wang, Lingwei
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 896 - 900
  • [9] Multi-objective Jaya Algorithm for Solving Constrained Multi-objective Optimization Problems
    Naidu, Y. Ramu
    Ojha, A. K.
    Devi, V. Susheela
    ADVANCES IN HARMONY SEARCH, SOFT COMPUTING AND APPLICATIONS, 2020, 1063 : 89 - 98
  • [10] Multi-objective optimization of radiotherapy: distributed Q-learning and agent-based simulation
    Jalalimanesh, Ammar
    Haghighi, Hamidreza Shahabi
    Ahmadi, Abbas
    Hejazian, Hossein
    Soltani, Madjid
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2017, 29 (05) : 1071 - 1086