Decomposed Multi-objective Method Based on Q-Learning for Solving Multi-objective Combinatorial Optimization Problem

被引:0
|
作者
Yang, Anju [1 ]
Liu, Yuan [1 ]
Zou, Juan [1 ]
Yang, Shengxiang [2 ]
机构
[1] Xiangtan Univ, Hunan Engn Res Ctr Intelligent Syst Optimizat & S, Xiangtan 411105, Peoples R China
[2] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, Leics, England
基金
中国国家自然科学基金;
关键词
Reinforcement Learning; Q-learning; Temporal-Difference; Shared Q-table; Multi-objective Traveling Salesman Problem; ALGORITHM;
D O I
10.1007/978-981-97-2272-3_5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Neural combinatorial optimization has emerged as a promising technique for combinatorial optimization problems. However, the high representation of deep learning inevitably requires a lot of training overhead and computing resources, especially in large-scale decision making and multi-objective scenarios. This paper first provides a simple but efficient combinatorial optimization method that uses a traditional reinforcement learning (RL) paradigm to balance the computational cost and performance. We decompose the multi-objective problem into multiple scalar subproblems and only use the improved Q-learning for the sequential optimization of these subproblems. Our method employs the Temporal-Difference (TD) update strategy and provides a shared Q-table for all subproblems. The TD update strategy speeds up the optimization by learning while making decisions. The shared Q-table devotes a high-quality starting point to generate excellent solutions quickly for each subproblem. Both strategies promote the effectiveness and efficiency of the proposed method. After new solutions are generated, a selection operator keeps the historical optimal solution for each subproblem. We apply our method to various multi-objective traveling salesman problems involving up to 10 objectives and 200 decisions. Experiments demonstrate that only simple RL achieved comparable performance to state-of-the-art approaches.
引用
收藏
页码:59 / 73
页数:15
相关论文
共 50 条
  • [31] MOCOVIDOA: a novel multi-objective coronavirus disease optimization algorithm for solving multi-objective optimization problems
    Asmaa M. Khalid
    Hanaa M. Hamza
    Seyedali Mirjalili
    Khaid M. Hosny
    Neural Computing and Applications, 2023, 35 : 17319 - 17347
  • [32] A novel immune dominance selection multi-objective optimization algorithm for solving multi-objective optimization problems
    Xiao, Jin-ke
    Li, Wei-min
    Xiao, Xin-rong
    Cheng-zhong, L., V
    APPLIED INTELLIGENCE, 2017, 46 (03) : 739 - 755
  • [33] Multi-objective sand cat swarm optimization based on adaptive clustering for solving multimodal multi-objective optimization problems
    Niu, Yanbiao
    Yan, Xuefeng
    Zeng, Weiping
    Wang, Yongzhen
    Niu, Yanzhao
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2025, 227 : 391 - 404
  • [34] A novel immune dominance selection multi-objective optimization algorithm for solving multi-objective optimization problems
    Jin-ke Xiao
    Wei-min Li
    Xin-rong Xiao
    Cheng-zhong LV
    Applied Intelligence, 2017, 46 : 739 - 755
  • [35] MULTI-OBJECTIVE COMBINATORIAL OPTIMIZATION DESIGN METHOD FOR THE COMPRESSOR SPLITTER
    Gao, Limin
    Deng, Xiaoming
    Gao, Lei
    Li, Ruiyu
    Zeng, Ruihui
    Liu, Cunliang
    ASME TURBO EXPO: TURBINE TECHNICAL CONFERENCE AND EXPOSITION, 2015, VOL 2C, 2015,
  • [36] A Bi-population Multi-objective Algorithm for Continuous Multi-objective Optimization Problem
    Chen, Lili
    Wang, Hongfeng
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 4830 - 4833
  • [37] Cognitive networks QoS multi-objective strategy based on Q-learning algorithm
    Wang, B. (wangbowx@163.com), 1600, Advanced Institute of Convergence Information Technology, Myoungbo Bldg 3F,, Bumin-dong 1-ga, Seo-gu, Busan, 602-816, Korea, Republic of (07):
  • [38] Glowworm swarm optimization algorithm for solving multi-objective optimization problem
    He Deng-xu
    Liu Gui-qing
    Zhu Hua-zheng
    2013 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2013, : 11 - 15
  • [39] Solving Multi-Objective Energy Management of a DC Microgrid using Multi-Objective Multiverse Optimization
    Lagouir, Marouane
    Badri, Abdelmajid
    Sayouti, Yassine
    INTERNATIONAL JOURNAL OF RENEWABLE ENERGY DEVELOPMENT-IJRED, 2021, 10 (04): : 911 - 922
  • [40] A novel ε-dominance multi-objective evolutionary algorithms for solving DRS multi-objective optimization problems
    Liu, Liu
    Li, Minqiang
    Lin, Dan
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2007, : 96 - +