Simulated annealing with reinforcement learning for the set team orienteering problem with time windows

被引:2
|
作者
Yu, Vincent F. [1 ,2 ]
Salsabila, Nabila Yuraisyah [1 ]
Lin, Shih-Wei [3 ,4 ,5 ]
Gunawan, Aldy [6 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Ind Management, Taipei, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Ctr Cyber Phys Syst Innovat, Taipei, Taiwan
[3] Chang Gung Univ, Dept Informat Management, Taoyuan, Taiwan
[4] Ming Chi Univ Technol, Dept Ind Engn & Management, Taipei, Taiwan
[5] Keelung Chang Gung Mem Hosp, Dept Emergency Med, Keelung City, Taiwan
[6] Singapore Management Univ, Sch Comp & Informat Syst, Singapore, Singapore
关键词
Team orienteering problem with time windows; Set orienteering problem; Simulated annealing; LOCAL SEARCH; HEURISTICS; ALGORITHMS;
D O I
10.1016/j.eswa.2023.121996
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research investigates the Set Team Orienteering Problem with Time Windows (STOPTW), a new variant of the well-known Team Orienteering Problem with Time Windows and Set Orienteering Problem. In the STOPTW, customers are grouped into clusters. Each cluster is associated with a profit attainable when a customer in the cluster is visited within the customer's time window. A Mixed Integer Linear Programming model is formulated for STOPTW to maximizing total profit while adhering to time window constraints. Since STOPTW is an NP-hard problem, a Simulated Annealing with Reinforcement Learning (SARL) algorithm is developed. The proposed SARL incorporates the core concepts of reinforcement learning, utilizing the epsilon-greedy algorithm to learn the fitness values resulting from neighborhood moves. Numerical experiments are conducted to assess the performance of SARL, comparing the results with those obtained by CPLEX and Simulated Annealing (SA). For small instances, both SARL and SA algorithms outperform CPLEX by obtaining eight optimal solutions and 12 better solutions. For large instances, both algorithms obtain better solutions to 28 out of 29 instances within shorter computational times compared to CPLEX. Overall, SARL outperforms SA by resulting in lower gap percentages within the same computational times. Specifically, SARL outperforms SA in solving 13 large STOPTW benchmark instances. Finally, a sensitivity analysis is conducted to derive managerial insights.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A simulated annealing heuristic for the team orienteering problem with time windows
    Lin, Shih-Wei
    Yu, Vincent F.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2012, 217 (01) : 94 - 107
  • [2] Set Team Orienteering Problem with Time Windows
    Gunawan, Aldy
    Yu, Vincent F.
    Sutanto, Andro Nicus
    Jodiawan, Panca
    LEARNING AND INTELLIGENT OPTIMIZATION, LION 15, 2021, 12931 : 142 - 149
  • [3] A simulated annealing heuristic for the multiconstraint team orienteering problem with multiple time windows
    Lin, Shih-Wei
    Yu, Vincent F.
    APPLIED SOFT COMPUTING, 2015, 37 : 632 - 642
  • [4] A reinforcement learning approach to the orienteering problem with time windows
    Gama, Ricardo
    Fernandes, Hugo L.
    COMPUTERS & OPERATIONS RESEARCH, 2021, 133
  • [5] Solving the team orienteering problem with time windows and mandatory visits by multi-start simulated annealing
    Lin, Shih-Wei
    Yu, Vincent F.
    COMPUTERS & INDUSTRIAL ENGINEERING, 2017, 114 : 195 - 205
  • [6] Team Orienteering Problem with Time Windows and Variable Profit
    Marzal, Eliseo
    Sebastia, Laura
    PROCEEDINGS OF THE 2022 17TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2022, : 347 - 350
  • [7] The Multiconstraint Team Orienteering Problem with Multiple Time Windows
    Souffriau, Wouter
    Vansteenwegen, Pieter
    Vanden Berghe, Greet
    Van Oudheusden, Dirk
    TRANSPORTATION SCIENCE, 2013, 47 (01) : 53 - 63
  • [8] A Composite Algorithm for the Team Orienteering Problem with Time Windows
    Konstantopoulos, Charalampos
    Orfanos, Dimitrios
    2016 7TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS & APPLICATIONS (IISA), 2016,
  • [9] Team orienteering problem with time windows and time-dependent scores
    Yu, Vincent F.
    Jewpanya, Parida
    Lin, Shih-Wei
    Redi, A. A. N. Perwira
    COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 127 : 213 - 224
  • [10] Iterated local search for the team orienteering problem with time windows
    Vansteenwegen, Pieter
    Souffriau, Wouter
    Vanden Berghe, Greet
    Van Oudheusden, Dirk
    COMPUTERS & OPERATIONS RESEARCH, 2009, 36 (12) : 3281 - 3290