Learning to Escape: Multi-mode Policy Learning for the Traveling Salesmen Problem

被引:0
|
作者
Ha, Myoung Hoon [1 ]
Chi, Seunggeun [2 ]
Lee, Sang Wan [3 ]
机构
[1] Korea Adv Inst Sci & Technol, Ctr Neurosci Inspired AI, Daejeon, South Korea
[2] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[3] Korea Adv Inst Sci & Technol, Dept Brain Cognit Sci, Daejeon, South Korea
关键词
Traveling Salesmen Problem; Neural Combinatoric Optimization; Deep Reinforcement Learning; Transformer;
D O I
10.1109/EAIS58494.2024.10569999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traveling salesmen problem (TSP)-one of the most fundamental NP-hard problems in combinatorial optimization-has received considerable attention owing to its direct applicability to real-world routing. Recent studies on TSP have adopted a deep policy network to learn a stochastic acceptance rule. Despite its success in some cases, the structural and functional complexity of the deep policy networks makes it hard to explore the problem space while performing a local search at the same time. We found in our empirical analyses that searching processes are often stuck in the local region, leading to severe performance degradation. To tackle this issue, we propose a novel method for multi-mode policy learning. In the proposed method, a conventional exploration-exploitation scheme is reformulated as the problem of learning to escape from a local search area to induce exploration. We present a multi-mode Markov decision process, followed by policy and value design for local search and escaping modes. Experimental results show that the performance of the proposed method is superior to that of various baseline models, suggesting that the learned escaping policy allows the model to initiate a new local search in promising regions efficiently.
引用
收藏
页码:107 / 117
页数:11
相关论文
共 50 条
  • [31] Applying Deep Learning and Reinforcement Learning to Traveling Salesman Problem
    Miki, Shoma
    Yamamoto, Daisuke
    Ebara, Hiroyuki
    2018 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRONICS & COMMUNICATIONS ENGINEERING (ICCECE), 2018, : 65 - 70
  • [32] An optimal inspection and diagnosis policy for a multi-mode system
    Baohe, S
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2002, 76 (02) : 181 - 188
  • [33] A new heuristic for the multi-mode resource investment problem
    Hsu, CC
    Kim, DS
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2005, 56 (04) : 406 - 413
  • [34] Some appraisal criteria for multi-mode scheduling problem
    Abdel-Basset, Mohamed
    Atef, Asmaa
    Hussein, Abdel-Nasser
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (04) : 1641 - 1654
  • [35] Multi-mode hybrid electric vehicle routing problem
    Seyfi, Majid
    Alinaghian, Mahdi
    Ghorbani, Erfan
    catay, Buelent
    Sabbagh, Mohammad Saeid
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 166
  • [36] A Critical Chain Project Scheduling Problem with Multi-mode
    Peng, W. L.
    Jiao, Q. J.
    ADVANCES IN MATERIALS MANUFACTURING SCIENCE AND TECHNOLOGY XIV, 2012, 697-698 : 541 - 545
  • [37] Some appraisal criteria for multi-mode scheduling problem
    Mohamed Abdel-Basset
    Asmaa Atef
    Abdel-Nasser Hussein
    Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 1641 - 1654
  • [38] Multi-mode hybrid electric vehicle routing problem
    Seyfi, Majid
    Alinaghian, Mahdi
    Ghorbani, Erfan
    Çatay, Bülent
    Saeid Sabbagh, Mohammad
    Transportation Research Part E: Logistics and Transportation Review, 2022, 166
  • [39] Radio-Frequency Multi-Mode OAM Detection Based on UCA Samples Learning
    Fan, Jiabei
    Chen, Rui
    Long, Wen-Xuan
    Moretti, Marco
    Li, Jiandong
    SPAWC 2021: 2021 IEEE 22ND INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (IEEE SPAWC 2021), 2020, : 56 - 60
  • [40] Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning
    Song, Kaiyou
    Xie, Jin
    Zhang, Shan
    Luo, Zimeng
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11848 - 11857